HG10015340 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10015340
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function (DUF1666)
LocationChr02: 25947919 .. 25950072 (-)
RNA-Seq ExpressionHG10015340
SyntenyHG10015340
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAATGGATGGTGAAAACATGGGGAAAACGAGGAATTCTGTTTTATGCTCTGTTTCTGTGGAAACTATCTCGTCTTTTAAAATGGGAAATTATGATTTTACTTGTGGGAAGAAAGAACTTGACATGGATAATGTTGATTCTGATCTTTCTTACATGCAACCCACATCTTTGGCTAAAATTTCTAAGGGAAGAAATTCCGGTGGCTTGAAAAGTAGTGACGAAATTCTTGATGTTGACTCAAAAAATAATGCTTCCAATGTGTTTGATTCATTGCCTGAACCAGAAGTTCAAGTGCTTTGGGAAGATTGTCCTGTTCCATCTGATTCAGAATCAGCAGGGGACTCAACCAATGGCTCCCCAAAGATCAATCATGATCAGGTTGATGATTCGTCGTGCAAAGAGATCAAGGCTGAGGAAAACAAGCAACTCGACTCTCTCAATAATTTGGCTCAGAAAGAGGAAGAATCTGTCGACGTTTCACAGAAATCAACAGAAACAATTTTGCTGGGGAAGCGCTCCATTTTAAATTATGATCATCGATATGAGCTAAACTATCTACCAGATCATCAGGATATAGTACATCAGCTTGAAATGGAGCTGAAAAATTCAAGAACTGGAGGGCTTCCAACAATTTTTGAAGAGGAAACTGAAACAGCAGAAACAATTAGTGAAAAATTTAAGTATGAAGAAGTAATGGGAGAGATACAAAAGGTCTACAGGACTTATACAGAGAAAATGTGGAAGCTTGATGTCTTAAACAATCAGACTATGCATTCCATTGGTGAGTCTTATCTATATACCATGAAAAGTACTTACATTTTCCTTTTTGTCTACTTAATATTTGAATCAATATTTTGTTATATTTCCATGAACAAGTCCCAATGATTTGTGAAGAGTTGTAGATTTAGATAATGTAGTACCACATGTACTAACACCATCATTCAATGAAGTATAGTTTAATTTCTTGATGCTTTTATCGTAGCAGTATATTTGAATTCTGCAGGTTTGCTGCAGCTGCAATATCCACTGCAATCAGTGTCGGCGCAGAACTCACATAACGCACCGCTATGGTTGGGAAAAGCTCGGAGGCTGGGAGCTGATCCAAGACTTGAATTCATAAGAGATTTACTTAGAGACATAGAACTGGTGTACGTAGGACAAGTTTGTCTTTCCTGGGAAATTCTGCAATGGCAGCTTAGGAAATCCATTGACCTGCAACGATATGACTCGCAAGGCAATCGTCATTATAATCAAGTTGCTAGTGAGTTCCAACTTTTTCAAGTTATGCTGAAAAGATTCATGGGGGAAGAAAGGTTTCAGGGCAACAGAGTTGAGAACTATGTCCAGAACCGCTGTGTATTTCGTTCTCTTCTGCTAGTTCCACCCATCAAAGGTAGAATATTCACCACATTTATTAAAGTTTGAGGTTCAGATGATGATAAACCTATGTAATTGTTCTTCAGATGATGGTTCTGCTGAAGCTGAAGGAAGAGAATGGGAAGATCATGAAGATGCCTTTTCCAGTGATTTTGTAACAGAAATCATAGAGAAATCAATGTGGGTTTTCTATGAATTTCTTCTTTCTGATAAAGATGATGTTAAAAGTATCCTGAAAATGAACAGAAAGTATCAGATTGAACTCCAAAATACAGAAAATCCACAGCTTTTGTTGGTCAACATACAAGCTCAGTTTCAGAAGGTTGGCTTACCAATCTTATTCTTCAATATCATAGCAATGTCTTTCTGTCATCTCATACCTAAAAAATTCTGTTTTCTGGTGATGTAGGCAGAGAGAAAGCTGAAAGACCTTATGAGAGGCAGCAATAGATGTTCTGTAGAGAAGTTGGGAAAGCAAGAGGAAGCTGGACTGAGTTATTCATTAATACTTCTCATTGCTCAGGTTGACCTCAAATTGATTTCAAGAGTGCTGAGAATGACAAGATTGACTGTCAACCAATTATTATGGTGCAACCAAAAACTTGATCAACTTACATTTATAAACAGACAGGTCCTCTTGGAACCCTCGCTTCTGCTTTTTCCCTTTTAACACTATTTTTTTCACAGTTTATAACAAATAACAGTATTGGAATATCTTTCTTCGTTCAAAATGTAAAGTTCCCAAACATTACACTTTCTAGATTGAATTATTAG

mRNA sequence

ATGGAAATGGATGGTGAAAACATGGGGAAAACGAGGAATTCTGTTTTATGCTCTGTTTCTGTGGAAACTATCTCGTCTTTTAAAATGGGAAATTATGATTTTACTTGTGGGAAGAAAGAACTTGACATGGATAATGTTGATTCTGATCTTTCTTACATGCAACCCACATCTTTGGCTAAAATTTCTAAGGGAAGAAATTCCGGTGGCTTGAAAAGTAGTGACGAAATTCTTGATGTTGACTCAAAAAATAATGCTTCCAATGTGTTTGATTCATTGCCTGAACCAGAAGTTCAAGTGCTTTGGGAAGATTGTCCTGTTCCATCTGATTCAGAATCAGCAGGGGACTCAACCAATGGCTCCCCAAAGATCAATCATGATCAGGTTGATGATTCGTCGTGCAAAGAGATCAAGGCTGAGGAAAACAAGCAACTCGACTCTCTCAATAATTTGGCTCAGAAAGAGGAAGAATCTGTCGACGTTTCACAGAAATCAACAGAAACAATTTTGCTGGGGAAGCGCTCCATTTTAAATTATGATCATCGATATGAGCTAAACTATCTACCAGATCATCAGGATATAGTACATCAGCTTGAAATGGAGCTGAAAAATTCAAGAACTGGAGGGCTTCCAACAATTTTTGAAGAGGAAACTGAAACAGCAGAAACAATTAGTGAAAAATTTAAGTATGAAGAAGTAATGGGAGAGATACAAAAGGTCTACAGGACTTATACAGAGAAAATGTGGAAGCTTGATGTCTTAAACAATCAGACTATGCATTCCATTGTATATTTGAATTCTGCAGGTTTGCTGCAGCTGCAATATCCACTGCAATCAGTGTCGGCGCAGAACTCACATAACGCACCGCTATGGTTGGGAAAAGCTCGGAGGCTGGGAGCTGATCCAAGACTTGAATTCATAAGAGATTTACTTAGAGACATAGAACTGGTGTACGTAGGACAAGTTTGTCTTTCCTGGGAAATTCTGCAATGGCAGCTTAGGAAATCCATTGACCTGCAACGATATGACTCGCAAGGCAATCGTCATTATAATCAAGTTGCTAGTGAGTTCCAACTTTTTCAAGTTATGCTGAAAAGATTCATGGGGGAAGAAAGGTTTCAGGGCAACAGAGTTGAGAACTATGTCCAGAACCGCTGTGTATTTCGTTCTCTTCTGCTAGTTCCACCCATCAAAGATGATGGTTCTGCTGAAGCTGAAGGAAGAGAATGGGAAGATCATGAAGATGCCTTTTCCAGTGATTTTGTAACAGAAATCATAGAGAAATCAATGTGGGTTTTCTATGAATTTCTTCTTTCTGATAAAGATGATGTTAAAAGTATCCTGAAAATGAACAGAAAGTATCAGATTGAACTCCAAAATACAGAAAATCCACAGCTTTTGTTGGTCAACATACAAGCTCAGTTTCAGAAGGCAGAGAGAAAGCTGAAAGACCTTATGAGAGGCAGCAATAGATGTTCTGTAGAGAAGTTGGGAAAGCAAGAGGAAGCTGGACTGAGTTATTCATTAATACTTCTCATTGCTCAGTTTATAACAAATAACAGTATTGGAATATCTTTCTTCGTTCAAAATGTAAAGTTCCCAAACATTACACTTTCTAGATTGAATTATTAG

Coding sequence (CDS)

ATGGAAATGGATGGTGAAAACATGGGGAAAACGAGGAATTCTGTTTTATGCTCTGTTTCTGTGGAAACTATCTCGTCTTTTAAAATGGGAAATTATGATTTTACTTGTGGGAAGAAAGAACTTGACATGGATAATGTTGATTCTGATCTTTCTTACATGCAACCCACATCTTTGGCTAAAATTTCTAAGGGAAGAAATTCCGGTGGCTTGAAAAGTAGTGACGAAATTCTTGATGTTGACTCAAAAAATAATGCTTCCAATGTGTTTGATTCATTGCCTGAACCAGAAGTTCAAGTGCTTTGGGAAGATTGTCCTGTTCCATCTGATTCAGAATCAGCAGGGGACTCAACCAATGGCTCCCCAAAGATCAATCATGATCAGGTTGATGATTCGTCGTGCAAAGAGATCAAGGCTGAGGAAAACAAGCAACTCGACTCTCTCAATAATTTGGCTCAGAAAGAGGAAGAATCTGTCGACGTTTCACAGAAATCAACAGAAACAATTTTGCTGGGGAAGCGCTCCATTTTAAATTATGATCATCGATATGAGCTAAACTATCTACCAGATCATCAGGATATAGTACATCAGCTTGAAATGGAGCTGAAAAATTCAAGAACTGGAGGGCTTCCAACAATTTTTGAAGAGGAAACTGAAACAGCAGAAACAATTAGTGAAAAATTTAAGTATGAAGAAGTAATGGGAGAGATACAAAAGGTCTACAGGACTTATACAGAGAAAATGTGGAAGCTTGATGTCTTAAACAATCAGACTATGCATTCCATTGTATATTTGAATTCTGCAGGTTTGCTGCAGCTGCAATATCCACTGCAATCAGTGTCGGCGCAGAACTCACATAACGCACCGCTATGGTTGGGAAAAGCTCGGAGGCTGGGAGCTGATCCAAGACTTGAATTCATAAGAGATTTACTTAGAGACATAGAACTGGTGTACGTAGGACAAGTTTGTCTTTCCTGGGAAATTCTGCAATGGCAGCTTAGGAAATCCATTGACCTGCAACGATATGACTCGCAAGGCAATCGTCATTATAATCAAGTTGCTAGTGAGTTCCAACTTTTTCAAGTTATGCTGAAAAGATTCATGGGGGAAGAAAGGTTTCAGGGCAACAGAGTTGAGAACTATGTCCAGAACCGCTGTGTATTTCGTTCTCTTCTGCTAGTTCCACCCATCAAAGATGATGGTTCTGCTGAAGCTGAAGGAAGAGAATGGGAAGATCATGAAGATGCCTTTTCCAGTGATTTTGTAACAGAAATCATAGAGAAATCAATGTGGGTTTTCTATGAATTTCTTCTTTCTGATAAAGATGATGTTAAAAGTATCCTGAAAATGAACAGAAAGTATCAGATTGAACTCCAAAATACAGAAAATCCACAGCTTTTGTTGGTCAACATACAAGCTCAGTTTCAGAAGGCAGAGAGAAAGCTGAAAGACCTTATGAGAGGCAGCAATAGATGTTCTGTAGAGAAGTTGGGAAAGCAAGAGGAAGCTGGACTGAGTTATTCATTAATACTTCTCATTGCTCAGTTTATAACAAATAACAGTATTGGAATATCTTTCTTCGTTCAAAATGTAAAGTTCCCAAACATTACACTTTCTAGATTGAATTATTAG

Protein sequence

MEMDGENMGKTRNSVLCSVSVETISSFKMGNYDFTCGKKELDMDNVDSDLSYMQPTSLAKISKGRNSGGLKSSDEILDVDSKNNASNVFDSLPEPEVQVLWEDCPVPSDSESAGDSTNGSPKINHDQVDDSSCKEIKAEENKQLDSLNNLAQKEEESVDVSQKSTETILLGKRSILNYDHRYELNYLPDHQDIVHQLEMELKNSRTGGLPTIFEEETETAETISEKFKYEEVMGEIQKVYRTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVSAQNSHNAPLWLGKARRLGADPRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQVMLKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDDGSAEAEGREWEDHEDAFSSDFVTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAERKLKDLMRGSNRCSVEKLGKQEEAGLSYSLILLIAQFITNNSIGISFFVQNVKFPNITLSRLNY
Homology
BLAST of HG10015340 vs. NCBI nr
Match: XP_038891903.1 (uncharacterized protein LOC120081255 [Benincasa hispida])

HSP 1 Score: 863.2 bits (2229), Expect = 1.2e-246
Identity = 448/514 (87.16%), Postives = 471/514 (91.63%), Query Frame = 0

Query: 1   MEMDGENMGKTRNSVLCSVSVETISSFKMGNYDFTCGKKELDMDNVDSDLSYMQPTSLAK 60
           M+MD EN GKT NSVLCSVSVET SSFKM N DFTCG+KE DM NVDS LS +QPT+L  
Sbjct: 58  MDMDDENNGKTGNSVLCSVSVETSSSFKMRNCDFTCGRKEFDMGNVDSALSCIQPTTLGI 117

Query: 61  ISKGRNSGGLKSSDEILDVDSKNNASNVFDSLPEPEVQVLWEDCPVPSDSESAGDSTNGS 120
           +SKGRNSG LKSS EI+DVDSKNNASNVFD LPEPEVQVLWED PV SDSESAGDSTN S
Sbjct: 118 VSKGRNSGSLKSSGEIIDVDSKNNASNVFDQLPEPEVQVLWEDYPVLSDSESAGDSTNAS 177

Query: 121 PKINHDQVDDSSCKEIKAEENKQLDSLNNLAQKEEESVDVSQKSTETILLGKRSILNYDH 180
           PKINHDQVDDSSCKE   EENKQLDSLNNLAQKEEESV+VS+KSTETILL K SILNY H
Sbjct: 178 PKINHDQVDDSSCKETNTEENKQLDSLNNLAQKEEESVNVSEKSTETILLEKNSILNYGH 237

Query: 181 RYELNYLPDHQDIVHQLEMELKNSRTGGLPTIFEEETETAETISEKFKYEEVMGEIQKVY 240
           RYELNYLPDHQDIVHQLEMELKN+RTGGLPTIFEEETE AETI+EKFKYEE+MGEIQKVY
Sbjct: 238 RYELNYLPDHQDIVHQLEMELKNARTGGLPTIFEEETEKAETINEKFKYEEIMGEIQKVY 297

Query: 241 RTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVSAQNSHNAPLWLGKARRLGAD 300
           RTY EKMWKLDVLNNQ+MH+I      GLLQLQYPLQSV AQNS+NAP+WLGKARRLGAD
Sbjct: 298 RTYAEKMWKLDVLNNQSMHAI------GLLQLQYPLQSVLAQNSYNAPIWLGKARRLGAD 357

Query: 301 PRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQ 360
           PRLEF+ DLLRDIELVYVGQVCLSWEILQWQLRKSIDLQR+DSQG RHYNQVASEFQLFQ
Sbjct: 358 PRLEFVEDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRHDSQGIRHYNQVASEFQLFQ 417

Query: 361 VMLKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDDGSAEAEGREWEDHEDAFSSDF 420
           V+LKRFM EERFQGNRVENYVQNRCVF SLLLVPP+KDDGS EAEGREWE HED FSS+ 
Sbjct: 418 VILKRFMEEERFQGNRVENYVQNRCVFHSLLLVPPVKDDGSGEAEGREWE-HEDGFSSNL 477

Query: 421 VTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAERK 480
           VTEIIEKSMWVFYEFLLSDKDDVKSILKMNRK+QIELQN ENPQLLLVNIQA+F KAERK
Sbjct: 478 VTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKHQIELQNIENPQLLLVNIQARFLKAERK 537

Query: 481 LKDLMRGSNRCSVEKLGKQEEAGLSYSLILLIAQ 515
           LK+LMRGSNRCSVEKLGKQEEAGLSYSLILLIAQ
Sbjct: 538 LKNLMRGSNRCSVEKLGKQEEAGLSYSLILLIAQ 564

BLAST of HG10015340 vs. NCBI nr
Match: XP_011655680.1 (uncharacterized protein LOC105435571 isoform X1 [Cucumis sativus])

HSP 1 Score: 765.8 bits (1976), Expect = 2.6e-217
Identity = 409/514 (79.57%), Postives = 439/514 (85.41%), Query Frame = 0

Query: 1   MEMDGENMGKTRNSVLCSVSVETISSFKMGNYDFTCGKKELDMDNVDSDLSYMQPTSLAK 60
           MEMD EN GKTRNSVLCSVSVETISS       FTCG++  +M N DS +S +QPT+LA 
Sbjct: 60  MEMDCENSGKTRNSVLCSVSVETISS-------FTCGRENFEMCNADSAISRIQPTALAI 119

Query: 61  ISKGRNSGGLKSSDEILDVDSKNNASNVFDSLPEPEVQVLWEDCPVPSDSESAGDSTNGS 120
           +SKGRNSGGLK S E+LDV S+NN S+VFDSLPEPEVQV WEDCPVPSDSES GDSTNGS
Sbjct: 120 VSKGRNSGGLKCSGEVLDVGSENNGSDVFDSLPEPEVQVFWEDCPVPSDSESVGDSTNGS 179

Query: 121 PKINHDQVDDSSCKEIKAEENKQLDSLNNLAQKEEESVDVSQKSTETILLGKRSILNYDH 180
           PKINHDQVDDSSCKEI AEENKQL+SLNNL QKEEE+V+ S+KST+   L K  I  +DH
Sbjct: 180 PKINHDQVDDSSCKEINAEENKQLESLNNLTQKEEETVNFSKKSTKMNFLEKGFISKFDH 239

Query: 181 RYELNYLPDHQDIVHQLEMELKNSRTGGLPTIFEEETETAETISEKFKYEEVMGEIQKVY 240
             ELNYLPDHQDIV QLEMELKNSRTGGLPTIFEEETE+AETI EK KYEEVMGEIQKVY
Sbjct: 240 MSELNYLPDHQDIVRQLEMELKNSRTGGLPTIFEEETESAETIYEKLKYEEVMGEIQKVY 299

Query: 241 RTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVSAQNSHNAPLWLGKARRLGAD 300
           +TY EKMW LDVLNNQ MH+I      GLLQLQYPLQ VS+QNS N  LW GKARRLGAD
Sbjct: 300 KTYAEKMWNLDVLNNQCMHAI------GLLQLQYPLQPVSSQNSQNPSLWFGKARRLGAD 359

Query: 301 PRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQ 360
            RLEFI DLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQ    YNQVASEFQLFQ
Sbjct: 360 TRLEFIGDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQSIHRYNQVASEFQLFQ 419

Query: 361 VMLKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDDGSAEAEGREWEDHEDAFSSDF 420
           VMLKRFM EERFQGNRVENYVQNRC+FRSLLLVPPIKDDG AEAEGREWED ED +SS+F
Sbjct: 420 VMLKRFMEEERFQGNRVENYVQNRCIFRSLLLVPPIKDDGFAEAEGREWED-EDGYSSNF 479

Query: 421 VTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAERK 480
           VTE IEKSM VFYEFLLSDKDDVKSILK+NRK+QIE QNT+  QLLL +IQAQFQKAERK
Sbjct: 480 VTETIEKSMCVFYEFLLSDKDDVKSILKLNRKHQIEFQNTD--QLLLASIQAQFQKAERK 539

Query: 481 LKDLMRGSNRCSVEKLGKQEEAGLSYSLILLIAQ 515
           LK+LMR  +RCS EKL K +EAGL YSLILLI Q
Sbjct: 540 LKNLMRCRHRCSAEKLRKLQEAGLRYSLILLILQ 557

BLAST of HG10015340 vs. NCBI nr
Match: XP_008446346.1 (PREDICTED: uncharacterized protein LOC103489114 isoform X1 [Cucumis melo])

HSP 1 Score: 761.5 bits (1965), Expect = 4.8e-216
Identity = 408/514 (79.38%), Postives = 434/514 (84.44%), Query Frame = 0

Query: 1   MEMDGENMGKTRNSVLCSVSVETISSFKMGNYDFTCGKKELDMDNVDSDLSYMQPTSLAK 60
           MEMD EN GKTRNSVLCSVSVETISS       FTCG++  +M N DS +S +Q      
Sbjct: 99  MEMDCENRGKTRNSVLCSVSVETISS-------FTCGRENFEMGNADSAISLIQ------ 158

Query: 61  ISKGRNSGGLKSSDEILDVDSKNNASNVFDSLPEPEVQVLWEDCPVPSDSESAGDSTNGS 120
            SKGRNSGGLK S EILDV SKNNAS+VFDSL EPEVQV WEDCP PSDSES GDSTN S
Sbjct: 159 -SKGRNSGGLKCSGEILDVGSKNNASDVFDSLAEPEVQVFWEDCPPPSDSESVGDSTNSS 218

Query: 121 PKINHDQVDDSSCKEIKAEENKQLDSLNNLAQKEEESVDVSQKSTETILLGKRSILNYDH 180
           PKINHDQVDD SCKEI  EENKQLDSLN LAQKEEE+V+V++KSTET LL K  IL +D+
Sbjct: 219 PKINHDQVDDLSCKEINTEENKQLDSLNKLAQKEEETVNVAEKSTETNLLEKGFILKFDN 278

Query: 181 RYELNYLPDHQDIVHQLEMELKNSRTGGLPTIFEEETETAETISEKFKYEEVMGEIQKVY 240
            YELNYLPDHQDIVHQLEMELKNSRTGGLPTIFEEETETAETI EK KYEEVMGEIQK Y
Sbjct: 279 MYELNYLPDHQDIVHQLEMELKNSRTGGLPTIFEEETETAETIYEKLKYEEVMGEIQKAY 338

Query: 241 RTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVSAQNSHNAPLWLGKARRLGAD 300
           +TY EKMW LDVLNNQ+MH+I      GLLQLQYP+  VS+ NS N  LW GKARRLG D
Sbjct: 339 KTYAEKMWNLDVLNNQSMHAI------GLLQLQYPMPPVSSHNSQNPSLWFGKARRLGTD 398

Query: 301 PRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQ 360
            RLEFI DLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQ    YNQVAS+FQLFQ
Sbjct: 399 TRLEFIGDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQSIHRYNQVASDFQLFQ 458

Query: 361 VMLKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDDGSAEAEGREWEDHEDAFSSDF 420
           VMLKRFM EERFQGNRVENYVQNRC+FRSLLLVPPIKDDGSAEAEGREWED ED +SS+F
Sbjct: 459 VMLKRFMEEERFQGNRVENYVQNRCIFRSLLLVPPIKDDGSAEAEGREWED-EDGYSSNF 518

Query: 421 VTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAERK 480
           VTE IEKSM VFYEFLLSDKDDV+SILK+NRK+QIE Q+TEN QLLL +IQAQFQKAERK
Sbjct: 519 VTETIEKSMCVFYEFLLSDKDDVQSILKLNRKHQIEFQDTENRQLLLASIQAQFQKAERK 578

Query: 481 LKDLMRGSNRCSVEKLGKQEEAGLSYSLILLIAQ 515
           LKDLMR   RCS EKL K EEAGLSYSLILLI Q
Sbjct: 579 LKDLMRCRRRCSAEKLRKLEEAGLSYSLILLILQ 591

BLAST of HG10015340 vs. NCBI nr
Match: XP_023541276.1 (uncharacterized protein LOC111801499 isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 739.6 bits (1908), Expect = 2.0e-209
Identity = 391/514 (76.07%), Postives = 439/514 (85.41%), Query Frame = 0

Query: 1   MEMDGENMGKTRNSVLCSVSVETISSFKMGNYDFTCGKKELDMDNVDSDLSYMQPTSLAK 60
           MEMDGE+ GKTR+SV CSVS+ETISS KMG+ +FTCG KELDMDNVD  LSY+Q T+L  
Sbjct: 85  MEMDGEDNGKTRSSVFCSVSMETISSLKMGDSEFTCGIKELDMDNVDCTLSYVQSTALDI 144

Query: 61  ISKGRNSGGLKSSDEILDVDSKNNASNVFDSLPEPEVQVLWEDCPVPSDSESAGDSTNGS 120
           +SKG NS GLKS++ ILDV S+N+ASNVFD LPEPEVQVLWED P PSDSESA +ST+GS
Sbjct: 145 VSKGVNSDGLKSTNTILDVYSENDASNVFDVLPEPEVQVLWEDYPDPSDSESAEESTSGS 204

Query: 121 PKINHDQVDDSSCKEIKAEENKQLDSLNNLAQKEEESVDVSQKSTETILLGKRSILNYDH 180
           PK NHDQVDDSSCKEI     K+LDS NN  +KE+ESV+V ++STE  L    S+LNYD 
Sbjct: 205 PKTNHDQVDDSSCKEI-----KRLDSFNNF-EKEKESVNVLEESTEASLQENPSMLNYDR 264

Query: 181 RYELNYLPDHQDIVHQLEMELKNSRTGGLPTIFEEETETAETISEKFKYEEVMGEIQKVY 240
           RYELNYLPDHQDIV QLEMEL+N+RTGGLPTIFEE+TETAE I+EKFKYEEVMGEIQKVY
Sbjct: 265 RYELNYLPDHQDIVKQLEMELRNARTGGLPTIFEEDTETAEAINEKFKYEEVMGEIQKVY 324

Query: 241 RTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVSAQNSHNAPLWLGKARRLGAD 300
           R Y EKMWKLD+LNNQ MH I      GL  L+YPLQSVSAQNS ++ LWLGKARRLGAD
Sbjct: 325 RIYAEKMWKLDILNNQIMHVI------GLQHLKYPLQSVSAQNSQSSQLWLGKARRLGAD 384

Query: 301 PRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQ 360
           P L F+ DLLRDIE+VYVGQVCLSWE+LQWQLRKS++LQRYDSQG R YNQVASEFQLFQ
Sbjct: 385 PVLVFLGDLLRDIEMVYVGQVCLSWEMLQWQLRKSLELQRYDSQGIRQYNQVASEFQLFQ 444

Query: 361 VMLKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDDGSAEAEGREWEDHEDAFSSDF 420
           VMLKRFM  E  QGNRV NYVQNRCVFRSLL VPPI DD SAEAEGREWED  D FSS+F
Sbjct: 445 VMLKRFMEGESLQGNRVNNYVQNRCVFRSLLQVPPIIDDDSAEAEGREWEDDYD-FSSNF 504

Query: 421 VTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAERK 480
           + EIIEKSMWVFYEFL+SDKDD K+ILK NRK+QIELQN+ENPQLLLVN+Q  FQK+ERK
Sbjct: 505 LAEIIEKSMWVFYEFLVSDKDDAKNILKCNRKHQIELQNSENPQLLLVNVQDHFQKSERK 564

Query: 481 LKDLMRGSNRCSVEKLGKQEEAGLSYSLILLIAQ 515
           +KDL+  + RCS++KLGKQEEAGLSYSL+LLIAQ
Sbjct: 565 VKDLLSRNKRCSLDKLGKQEEAGLSYSLMLLIAQ 585

BLAST of HG10015340 vs. NCBI nr
Match: XP_023541275.1 (uncharacterized protein LOC111801499 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 739.6 bits (1908), Expect = 2.0e-209
Identity = 391/514 (76.07%), Postives = 439/514 (85.41%), Query Frame = 0

Query: 1   MEMDGENMGKTRNSVLCSVSVETISSFKMGNYDFTCGKKELDMDNVDSDLSYMQPTSLAK 60
           MEMDGE+ GKTR+SV CSVS+ETISS KMG+ +FTCG KELDMDNVD  LSY+Q T+L  
Sbjct: 85  MEMDGEDNGKTRSSVFCSVSMETISSLKMGDSEFTCGIKELDMDNVDCTLSYVQSTALDI 144

Query: 61  ISKGRNSGGLKSSDEILDVDSKNNASNVFDSLPEPEVQVLWEDCPVPSDSESAGDSTNGS 120
           +SKG NS GLKS++ ILDV S+N+ASNVFD LPEPEVQVLWED P PSDSESA +ST+GS
Sbjct: 145 VSKGVNSDGLKSTNTILDVYSENDASNVFDVLPEPEVQVLWEDYPDPSDSESAEESTSGS 204

Query: 121 PKINHDQVDDSSCKEIKAEENKQLDSLNNLAQKEEESVDVSQKSTETILLGKRSILNYDH 180
           PK NHDQVDDSSCKEI     K+LDS NN  +KE+ESV+V ++STE  L    S+LNYD 
Sbjct: 205 PKTNHDQVDDSSCKEI-----KRLDSFNNF-EKEKESVNVLEESTEASLQENPSMLNYDR 264

Query: 181 RYELNYLPDHQDIVHQLEMELKNSRTGGLPTIFEEETETAETISEKFKYEEVMGEIQKVY 240
           RYELNYLPDHQDIV QLEMEL+N+RTGGLPTIFEE+TETAE I+EKFKYEEVMGEIQKVY
Sbjct: 265 RYELNYLPDHQDIVKQLEMELRNARTGGLPTIFEEDTETAEAINEKFKYEEVMGEIQKVY 324

Query: 241 RTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVSAQNSHNAPLWLGKARRLGAD 300
           R Y EKMWKLD+LNNQ MH I      GL  L+YPLQSVSAQNS ++ LWLGKARRLGAD
Sbjct: 325 RIYAEKMWKLDILNNQIMHVI------GLQHLKYPLQSVSAQNSQSSQLWLGKARRLGAD 384

Query: 301 PRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQ 360
           P L F+ DLLRDIE+VYVGQVCLSWE+LQWQLRKS++LQRYDSQG R YNQVASEFQLFQ
Sbjct: 385 PVLVFLGDLLRDIEMVYVGQVCLSWEMLQWQLRKSLELQRYDSQGIRQYNQVASEFQLFQ 444

Query: 361 VMLKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDDGSAEAEGREWEDHEDAFSSDF 420
           VMLKRFM  E  QGNRV NYVQNRCVFRSLL VPPI DD SAEAEGREWED  D FSS+F
Sbjct: 445 VMLKRFMEGESLQGNRVNNYVQNRCVFRSLLQVPPIIDDDSAEAEGREWEDDYD-FSSNF 504

Query: 421 VTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAERK 480
           + EIIEKSMWVFYEFL+SDKDD K+ILK NRK+QIELQN+ENPQLLLVN+Q  FQK+ERK
Sbjct: 505 LAEIIEKSMWVFYEFLVSDKDDAKNILKCNRKHQIELQNSENPQLLLVNVQDHFQKSERK 564

Query: 481 LKDLMRGSNRCSVEKLGKQEEAGLSYSLILLIAQ 515
           +KDL+  + RCS++KLGKQEEAGLSYSL+LLIAQ
Sbjct: 565 VKDLLSRNKRCSLDKLGKQEEAGLSYSLMLLIAQ 585

BLAST of HG10015340 vs. ExPASy TrEMBL
Match: A0A0A0KQM1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G605070 PE=4 SV=1)

HSP 1 Score: 765.0 bits (1974), Expect = 2.1e-217
Identity = 412/532 (77.44%), Postives = 446/532 (83.83%), Query Frame = 0

Query: 1   MEMDGENMGKTRNSVLCSVSVETISSFKMGNYDFTCGKKELDMDNVDSDLSYMQPTSLAK 60
           MEMD EN GKTRNSVLCSVSVETISS       FTCG++  +M N DS +S +QPT+LA 
Sbjct: 60  MEMDCENSGKTRNSVLCSVSVETISS-------FTCGRENFEMCNADSAISRIQPTALAI 119

Query: 61  ISKGRNSGGLKSSDEILDVDSKNNASNVFDSLPEPEVQVLWEDCPVPSDSESAGDSTNGS 120
           +SKGRNSGGLK S E+LDV S+NN S+VFDSLPEPEVQV WEDCPVPSDSES GDSTNGS
Sbjct: 120 VSKGRNSGGLKCSGEVLDVGSENNGSDVFDSLPEPEVQVFWEDCPVPSDSESVGDSTNGS 179

Query: 121 PKINHDQVDDSSCKEIKAEENKQLDSLNNLAQKEEESVDVSQKSTETILLGKRSILNYDH 180
           PKINHDQVDDSSCKEI AEENKQL+SLNNL QKEEE+V+ S+KST+   L K  I  +DH
Sbjct: 180 PKINHDQVDDSSCKEINAEENKQLESLNNLTQKEEETVNFSKKSTKMNFLEKGFISKFDH 239

Query: 181 RYELNYLPDHQDIVHQLEMELKNSRTGGLPTIFEEETETAETISEKFKYEEVMGEIQKVY 240
             ELNYLPDHQDIV QLEMELKNSRTGGLPTIFEEETE+AETI EK KYEEVMGEIQKVY
Sbjct: 240 MSELNYLPDHQDIVRQLEMELKNSRTGGLPTIFEEETESAETIYEKLKYEEVMGEIQKVY 299

Query: 241 RTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVSAQNSHNAPLWLGKARRLGAD 300
           +TY EKMW LDVLNNQ MH+I      GLLQLQYPLQ VS+QNS N  LW GKARRLGAD
Sbjct: 300 KTYAEKMWNLDVLNNQCMHAI------GLLQLQYPLQPVSSQNSQNPSLWFGKARRLGAD 359

Query: 301 PRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQ 360
            RLEFI DLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQ    YNQVASEFQLFQ
Sbjct: 360 TRLEFIGDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQSIHRYNQVASEFQLFQ 419

Query: 361 VMLKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDDGSAEAEGREWEDHEDAFSSDF 420
           VMLKRFM EERFQGNRVENYVQNRC+FRSLLLVPPIKDDG AEAEGREWED ED +SS+F
Sbjct: 420 VMLKRFMEEERFQGNRVENYVQNRCIFRSLLLVPPIKDDGFAEAEGREWED-EDGYSSNF 479

Query: 421 VTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAERK 480
           VTE IEKSM VFYEFLLSDKDDVKSILK+NRK+QIE QNT+  QLLL +IQAQFQKAERK
Sbjct: 480 VTETIEKSMCVFYEFLLSDKDDVKSILKLNRKHQIEFQNTD--QLLLASIQAQFQKAERK 539

Query: 481 LKDLMRGSNRCSVEKLGKQEEAGLSYSLILLIAQFITNNSIGISFFVQNVKF 533
           LK+LMR  +RCS EKL K +EAGL YSLILLI Q     ++  +F +  V F
Sbjct: 540 LKNLMRCRHRCSAEKLRKLQEAGLRYSLILLILQKDPLGTLTSAFSLLTVYF 575

BLAST of HG10015340 vs. ExPASy TrEMBL
Match: A0A1S3BFI6 (uncharacterized protein LOC103489114 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103489114 PE=4 SV=1)

HSP 1 Score: 761.5 bits (1965), Expect = 2.3e-216
Identity = 408/514 (79.38%), Postives = 434/514 (84.44%), Query Frame = 0

Query: 1   MEMDGENMGKTRNSVLCSVSVETISSFKMGNYDFTCGKKELDMDNVDSDLSYMQPTSLAK 60
           MEMD EN GKTRNSVLCSVSVETISS       FTCG++  +M N DS +S +Q      
Sbjct: 99  MEMDCENRGKTRNSVLCSVSVETISS-------FTCGRENFEMGNADSAISLIQ------ 158

Query: 61  ISKGRNSGGLKSSDEILDVDSKNNASNVFDSLPEPEVQVLWEDCPVPSDSESAGDSTNGS 120
            SKGRNSGGLK S EILDV SKNNAS+VFDSL EPEVQV WEDCP PSDSES GDSTN S
Sbjct: 159 -SKGRNSGGLKCSGEILDVGSKNNASDVFDSLAEPEVQVFWEDCPPPSDSESVGDSTNSS 218

Query: 121 PKINHDQVDDSSCKEIKAEENKQLDSLNNLAQKEEESVDVSQKSTETILLGKRSILNYDH 180
           PKINHDQVDD SCKEI  EENKQLDSLN LAQKEEE+V+V++KSTET LL K  IL +D+
Sbjct: 219 PKINHDQVDDLSCKEINTEENKQLDSLNKLAQKEEETVNVAEKSTETNLLEKGFILKFDN 278

Query: 181 RYELNYLPDHQDIVHQLEMELKNSRTGGLPTIFEEETETAETISEKFKYEEVMGEIQKVY 240
            YELNYLPDHQDIVHQLEMELKNSRTGGLPTIFEEETETAETI EK KYEEVMGEIQK Y
Sbjct: 279 MYELNYLPDHQDIVHQLEMELKNSRTGGLPTIFEEETETAETIYEKLKYEEVMGEIQKAY 338

Query: 241 RTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVSAQNSHNAPLWLGKARRLGAD 300
           +TY EKMW LDVLNNQ+MH+I      GLLQLQYP+  VS+ NS N  LW GKARRLG D
Sbjct: 339 KTYAEKMWNLDVLNNQSMHAI------GLLQLQYPMPPVSSHNSQNPSLWFGKARRLGTD 398

Query: 301 PRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQ 360
            RLEFI DLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQ    YNQVAS+FQLFQ
Sbjct: 399 TRLEFIGDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQSIHRYNQVASDFQLFQ 458

Query: 361 VMLKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDDGSAEAEGREWEDHEDAFSSDF 420
           VMLKRFM EERFQGNRVENYVQNRC+FRSLLLVPPIKDDGSAEAEGREWED ED +SS+F
Sbjct: 459 VMLKRFMEEERFQGNRVENYVQNRCIFRSLLLVPPIKDDGSAEAEGREWED-EDGYSSNF 518

Query: 421 VTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAERK 480
           VTE IEKSM VFYEFLLSDKDDV+SILK+NRK+QIE Q+TEN QLLL +IQAQFQKAERK
Sbjct: 519 VTETIEKSMCVFYEFLLSDKDDVQSILKLNRKHQIEFQDTENRQLLLASIQAQFQKAERK 578

Query: 481 LKDLMRGSNRCSVEKLGKQEEAGLSYSLILLIAQ 515
           LKDLMR   RCS EKL K EEAGLSYSLILLI Q
Sbjct: 579 LKDLMRCRRRCSAEKLRKLEEAGLSYSLILLILQ 591

BLAST of HG10015340 vs. ExPASy TrEMBL
Match: A0A6J1G171 (uncharacterized protein LOC111449746 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111449746 PE=4 SV=1)

HSP 1 Score: 725.7 bits (1872), Expect = 1.4e-205
Identity = 388/515 (75.34%), Postives = 432/515 (83.88%), Query Frame = 0

Query: 1   MEMDGENMGKTRNSVLCSVSVETISSFKMGNYDFTCGKKELDMDNVDSDLSYMQPTSLAK 60
           MEMDGE+ GKTR+SV CSVS+ETISS KMG+ +FT G KE+ MDNVD  LSY+Q T+L  
Sbjct: 85  MEMDGEDNGKTRSSVFCSVSMETISSLKMGDSEFTRGSKEVGMDNVDCTLSYVQSTALDI 144

Query: 61  ISKGRNSGGLKSSDEILDVDSKNNASNVFDSLPEPEVQVLWEDCPVPSDSESAGDSTNGS 120
           +SKG NS GLKS++ ILDV S+NNASNVFD LPEPEVQVLWED P PSDSESA +ST+GS
Sbjct: 145 VSKGLNSDGLKSTNIILDVYSENNASNVFDVLPEPEVQVLWEDYPDPSDSESAEESTSGS 204

Query: 121 PKINHDQVDDSSCKEIKAEENKQLDSLNNL-AQKEEESVDVSQKSTETILLGKRSILNYD 180
           PK NHDQVDDSSCKEI     KQLDS NN   +KE+ESV+ S++STE  L  K S+LNYD
Sbjct: 205 PKTNHDQVDDSSCKEI-----KQLDSFNNFEKEKEKESVNASEESTEASLQEKPSMLNYD 264

Query: 181 HRYELNYLPDHQDIVHQLEMELKNSRTGGLPTIFEEETETAETISEKFKYEEVMGEIQKV 240
           HRYELNYLPDHQDIV QLEMEL+N+RTGGLPTIFEEE ETAE I+EKFKYEEVMGEIQKV
Sbjct: 265 HRYELNYLPDHQDIVQQLEMELRNARTGGLPTIFEEEAETAEAINEKFKYEEVMGEIQKV 324

Query: 241 YRTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVSAQNSHNAPLWLGKARRLGA 300
           YR Y EKMWKLD+LNNQ MH I      GL  L+YPLQSV  QNS ++ LWLGKARRLGA
Sbjct: 325 YRIYAEKMWKLDILNNQIMHVI------GLQHLKYPLQSVPVQNSQSSQLWLGKARRLGA 384

Query: 301 DPRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLF 360
           DP L F+ DL RDIE VYVGQVCLSWEILQWQLRKS++LQRYDSQG R YNQVASEFQLF
Sbjct: 385 DPILVFLGDLSRDIETVYVGQVCLSWEILQWQLRKSLELQRYDSQGIRQYNQVASEFQLF 444

Query: 361 QVMLKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDDGSAEAEGREWEDHEDAFSSD 420
           QVMLKRFM  ER QGNRV +YVQNRCVFRSLL VPPI DD SAEAEGRE ED  D FSS+
Sbjct: 445 QVMLKRFMEGERLQGNRVNDYVQNRCVFRSLLQVPPIIDDDSAEAEGREREDDYD-FSSN 504

Query: 421 FVTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAER 480
           F+ EIIEKSMWVFYEFL+SDKD VK+ILK NRK+QIEL+N+ENPQLLLVN+Q  F K+ER
Sbjct: 505 FLAEIIEKSMWVFYEFLVSDKDHVKNILKCNRKHQIELENSENPQLLLVNVQDHFHKSER 564

Query: 481 KLKDLMRGSNRCSVEKLGKQEEAGLSYSLILLIAQ 515
           K+KDL+  + RCS EKLGKQEEAGLSYSL+LLIAQ
Sbjct: 565 KVKDLLSRNKRCSSEKLGKQEEAGLSYSLMLLIAQ 587

BLAST of HG10015340 vs. ExPASy TrEMBL
Match: A0A6J1G186 (uncharacterized protein LOC111449746 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111449746 PE=4 SV=1)

HSP 1 Score: 725.7 bits (1872), Expect = 1.4e-205
Identity = 388/515 (75.34%), Postives = 432/515 (83.88%), Query Frame = 0

Query: 1   MEMDGENMGKTRNSVLCSVSVETISSFKMGNYDFTCGKKELDMDNVDSDLSYMQPTSLAK 60
           MEMDGE+ GKTR+SV CSVS+ETISS KMG+ +FT G KE+ MDNVD  LSY+Q T+L  
Sbjct: 85  MEMDGEDNGKTRSSVFCSVSMETISSLKMGDSEFTRGSKEVGMDNVDCTLSYVQSTALDI 144

Query: 61  ISKGRNSGGLKSSDEILDVDSKNNASNVFDSLPEPEVQVLWEDCPVPSDSESAGDSTNGS 120
           +SKG NS GLKS++ ILDV S+NNASNVFD LPEPEVQVLWED P PSDSESA +ST+GS
Sbjct: 145 VSKGLNSDGLKSTNIILDVYSENNASNVFDVLPEPEVQVLWEDYPDPSDSESAEESTSGS 204

Query: 121 PKINHDQVDDSSCKEIKAEENKQLDSLNNL-AQKEEESVDVSQKSTETILLGKRSILNYD 180
           PK NHDQVDDSSCKEI     KQLDS NN   +KE+ESV+ S++STE  L  K S+LNYD
Sbjct: 205 PKTNHDQVDDSSCKEI-----KQLDSFNNFEKEKEKESVNASEESTEASLQEKPSMLNYD 264

Query: 181 HRYELNYLPDHQDIVHQLEMELKNSRTGGLPTIFEEETETAETISEKFKYEEVMGEIQKV 240
           HRYELNYLPDHQDIV QLEMEL+N+RTGGLPTIFEEE ETAE I+EKFKYEEVMGEIQKV
Sbjct: 265 HRYELNYLPDHQDIVQQLEMELRNARTGGLPTIFEEEAETAEAINEKFKYEEVMGEIQKV 324

Query: 241 YRTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVSAQNSHNAPLWLGKARRLGA 300
           YR Y EKMWKLD+LNNQ MH I      GL  L+YPLQSV  QNS ++ LWLGKARRLGA
Sbjct: 325 YRIYAEKMWKLDILNNQIMHVI------GLQHLKYPLQSVPVQNSQSSQLWLGKARRLGA 384

Query: 301 DPRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLF 360
           DP L F+ DL RDIE VYVGQVCLSWEILQWQLRKS++LQRYDSQG R YNQVASEFQLF
Sbjct: 385 DPILVFLGDLSRDIETVYVGQVCLSWEILQWQLRKSLELQRYDSQGIRQYNQVASEFQLF 444

Query: 361 QVMLKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDDGSAEAEGREWEDHEDAFSSD 420
           QVMLKRFM  ER QGNRV +YVQNRCVFRSLL VPPI DD SAEAEGRE ED  D FSS+
Sbjct: 445 QVMLKRFMEGERLQGNRVNDYVQNRCVFRSLLQVPPIIDDDSAEAEGREREDDYD-FSSN 504

Query: 421 FVTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAER 480
           F+ EIIEKSMWVFYEFL+SDKD VK+ILK NRK+QIEL+N+ENPQLLLVN+Q  F K+ER
Sbjct: 505 FLAEIIEKSMWVFYEFLVSDKDHVKNILKCNRKHQIELENSENPQLLLVNVQDHFHKSER 564

Query: 481 KLKDLMRGSNRCSVEKLGKQEEAGLSYSLILLIAQ 515
           K+KDL+  + RCS EKLGKQEEAGLSYSL+LLIAQ
Sbjct: 565 KVKDLLSRNKRCSSEKLGKQEEAGLSYSLMLLIAQ 587

BLAST of HG10015340 vs. ExPASy TrEMBL
Match: A0A6J1HWY0 (uncharacterized protein LOC111467411 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111467411 PE=4 SV=1)

HSP 1 Score: 725.3 bits (1871), Expect = 1.9e-205
Identity = 388/514 (75.49%), Postives = 433/514 (84.24%), Query Frame = 0

Query: 1   MEMDGENMGKTRNSVLCSVSVETISSFKMGNYDFTCGKKELDMDNVDSDLSYMQPTSLAK 60
           MEMDGE+ GKTR+SV CSV VETISS KMG+ +FTCG KELDMDNVD  LSY+Q T+L  
Sbjct: 60  MEMDGEDNGKTRSSVFCSVFVETISSLKMGDSEFTCGIKELDMDNVDCTLSYVQSTALDI 119

Query: 61  ISKGRNSGGLKSSDEILDVDSKNNASNVFDSLPEPEVQVLWEDCPVPSDSESAGDSTNGS 120
           + KG NS GL S++ ILDV S+N+ASNVFD LPEPEVQVLWED P PSDSESA +ST+GS
Sbjct: 120 VCKGVNSDGLTSTNTILDVYSENDASNVFDVLPEPEVQVLWEDYPDPSDSESAEESTSGS 179

Query: 121 PKINHDQVDDSSCKEIKAEENKQLDSLNNLAQKEEESVDVSQKSTETILLGKRSILNYDH 180
           PK NHDQVDDSSCK+I     KQLDS NNL +KE+ESV+VS++STE  L  K S+LNYDH
Sbjct: 180 PKTNHDQVDDSSCKQI-----KQLDSFNNL-EKEKESVNVSEESTEASLQEKPSMLNYDH 239

Query: 181 RYELNYLPDHQDIVHQLEMELKNSRTGGLPTIFEEETETAETISEKFKYEEVMGEIQKVY 240
           RYELNYLPDHQDIV QLEMEL+N+RTGGLPTIFEE+TET E I++KFKYEEVMGEIQKVY
Sbjct: 240 RYELNYLPDHQDIVKQLEMELRNARTGGLPTIFEEDTETEEAINDKFKYEEVMGEIQKVY 299

Query: 241 RTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVSAQNSHNAPLWLGKARRLGAD 300
           R Y EKMWKLD+LNNQ MH I      GL  L++PLQSVSAQNS ++ LWLGKARRLGAD
Sbjct: 300 RIYAEKMWKLDILNNQIMHVI------GLQHLKHPLQSVSAQNSQSSQLWLGKARRLGAD 359

Query: 301 PRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQ 360
           P L F+ DL RDIE VYVGQVCLSWEILQWQL KS++LQRYDSQG R YNQVASEFQLFQ
Sbjct: 360 PILVFLGDLSRDIETVYVGQVCLSWEILQWQLSKSLELQRYDSQGIRQYNQVASEFQLFQ 419

Query: 361 VMLKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDDGSAEAEGREWEDHEDAFSSDF 420
           VMLKRFM  ER QGNRV NYVQNRCVFRSLL VP I DD SAEAEGRE ED  D FSS+F
Sbjct: 420 VMLKRFMEGERLQGNRVNNYVQNRCVFRSLLQVPHIIDDDSAEAEGREREDDYD-FSSNF 479

Query: 421 VTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAERK 480
           + EIIEKSMWVFYEFL+SDKDD K+ILK NRK+QIELQN+ENPQLLLVN+Q  F K+ERK
Sbjct: 480 LAEIIEKSMWVFYEFLVSDKDDAKNILKCNRKHQIELQNSENPQLLLVNVQDHFHKSERK 539

Query: 481 LKDLMRGSNRCSVEKLGKQEEAGLSYSLILLIAQ 515
           +KDL+  + RCS EKLGKQEEAGLSYSL+LLIAQ
Sbjct: 540 VKDLLSRNKRCSSEKLGKQEEAGLSYSLMLLIAQ 560

BLAST of HG10015340 vs. TAIR 10
Match: AT1G69610.1 (Protein of unknown function (DUF1666) )

HSP 1 Score: 204.1 bits (518), Expect = 2.8e-52
Identity = 136/337 (40.36%), Postives = 209/337 (62.02%), Query Frame = 0

Query: 189 DHQDIVHQLEMELKNSRTGGLPTIFEE-ETETAETISEKF-----KYEEVMGEIQKVYRT 248
           +H D++ +L+ EL+ +RTGGL TI EE ET   E    K      ++++ + EI KVY+ 
Sbjct: 262 EHSDVIEKLKTELRTARTGGLCTILEESETPLQELKPLKIEPKPDQHKDRIAEIHKVYKN 321

Query: 249 YTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVS--AQNSHNAPLWLGKARRLGAD 308
           Y  KM KLDV+++QTMHSI  L    L     P ++     ++S +  +W  K   L  D
Sbjct: 322 YAVKMRKLDVIDSQTMHSISLLK---LKDSSKPSRNTDKPPKSSLHQNIWPFKKHTLECD 381

Query: 309 PRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRH-YNQVASEFQLF 368
           P    +++  RD E VYVGQVCLSWE+L+WQ  K ++   +DSQ   + YN VA EFQLF
Sbjct: 382 PSERLVKEASRDFETVYVGQVCLSWEMLRWQYDKVLE---FDSQVTTYQYNLVAGEFQLF 441

Query: 369 QVMLKRFMGEERFQ-GNRVENYVQNRCVFRSLLLVPPIKDDGSAEAEGREWEDHEDAFSS 428
           QV+L+RF+  E FQ  +RVE Y++NR  F++ L +P ++DD S++ + R   + E A  +
Sbjct: 442 QVLLQRFVENEPFQNSSRVETYLKNRRHFQNFLQIPLVRDDRSSKKKCR--YEGEFAVKT 501

Query: 429 DFVTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAE 488
           + + EII +SM VF+EFL +DKD+  S++K++ + Q+  Q++ + + LL +I+   QK E
Sbjct: 502 EMLREIIRESMSVFWEFLCADKDEFTSMMKVSHQTQVSPQDSLDLE-LLTDIRTHLQKKE 561

Query: 489 RKLKDLMRGSNRCSVEKLGKQE-EAGLSYSLILLIAQ 515
           +KLK++ R S  C V+KL K E ++ +     LLIA+
Sbjct: 562 KKLKEIQR-SQSCIVKKLKKNESKSSIGVKDELLIAK 588

BLAST of HG10015340 vs. TAIR 10
Match: AT5G39785.1 (Protein of unknown function (DUF1666) )

HSP 1 Score: 198.7 bits (504), Expect = 1.2e-50
Identity = 144/447 (32.21%), Postives = 243/447 (54.36%), Query Frame = 0

Query: 91  SLPEPEVQVLWEDCPVPSDSESAGDSTNGSPKINHDQVDDSSCKE--IKAEENKQLDSLN 150
           S  + +++ L E+  + SDS+    S   +       + DS   E  +K  +N++ D+  
Sbjct: 128 SFKKKKIRFLTEEDFLESDSDFVDSSQTFTSNDEDGFLSDSDFAETSLKKGQNRKSDNSG 187

Query: 151 NLAQKEEESVDVSQKSTETILLGKRSILNYDHRYELNYLPDHQDIVHQLEMELKNSRT-G 210
           + +  EEE  + +                         L +HQD++ QL+ME+K  +  G
Sbjct: 188 SGSDSEEEEEEDTN--------------------GFESLWEHQDLIEQLKMEMKKVKAIG 247

Query: 211 GLPTIFEEETETAE-------------TISEKFKYEEVMGEIQKVYRTYTEKMWKLDVLN 270
           GL TI EEE E  +                +KFK+ + +GE+ K +R+Y E+M KLD+L+
Sbjct: 248 GLTTILEEEEEDDDCPKIMEDLKPWRIEEEKKFKHVDTIGEVHKFHRSYRERMRKLDILS 307

Query: 271 NQTMHSIVYLNSAGLLQLQYPLQSVSAQNSH------------NAPLWLGKARRLGADPR 330
            Q  +++      GLLQ + P Q+ S   S+            N  LW  KA++   +P 
Sbjct: 308 FQKSYAL------GLLQSKSPQQATSTLGSNPSQTSFSSVFSVNIRLW--KAKKSEIEPM 367

Query: 331 LEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQVM 390
           ++F++++  ++E VYVGQ+CLSWEIL WQ  K+I+L   D  G+R YN+VA EFQ FQV+
Sbjct: 368 VQFVKEIQGELENVYVGQMCLSWEILHWQYEKAIELLESDVYGSRRYNEVAGEFQQFQVL 427

Query: 391 LKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDDGSAEAEG---REWEDHED-AFSS 450
           L+RF+  E F+  RV++Y++ RCV R+LL +P I++DG+ + +    R++E++ D    S
Sbjct: 428 LQRFLENEPFEEPRVQHYIKRRCVLRNLLQIPVIREDGNKDKKNGRRRDYEENNDGVIKS 487

Query: 451 DFVTEIIEKSMWVFYEFLLSDKDDVKSI--LKMNRKYQIELQNTENPQLL--LVNIQAQF 502
           D + EI+E+++ +F+ F+  DK    SI   K   K QIE  + E+ + L     +++Q 
Sbjct: 488 DQLVEIMEETIRLFWRFVRCDK-LTSSIHDQKSRTKSQIEPDHEEDSEDLEMFAEVKSQL 544

BLAST of HG10015340 vs. TAIR 10
Match: AT5G39785.2 (Protein of unknown function (DUF1666) )

HSP 1 Score: 194.9 bits (494), Expect = 1.7e-49
Identity = 144/448 (32.14%), Postives = 244/448 (54.46%), Query Frame = 0

Query: 91  SLPEPEVQVLWEDCPVPSDSESAGDSTNGSPKINHDQVDDSSCKE--IKAEENKQLDSLN 150
           S  + +++ L E+  + SDS+    S   +       + DS   E  +K  +N++ D+  
Sbjct: 128 SFKKKKIRFLTEEDFLESDSDFVDSSQTFTSNDEDGFLSDSDFAETSLKKGQNRKSDNSG 187

Query: 151 NLAQKEEESVDVSQKSTETILLGKRSILNYDHRYELNYLPDHQDIVHQLEMELKNSRT-G 210
           + +  EEE  + +                         L +HQD++ QL+ME+K  +  G
Sbjct: 188 SGSDSEEEEEEDTN--------------------GFESLWEHQDLIEQLKMEMKKVKAIG 247

Query: 211 GLPTIFEEETETAE-------------TISEKFKYEEVMGEIQKVYRTYTEKMWKLDVLN 270
           GL TI EEE E  +                +KFK+ + +GE+ K +R+Y E+M KLD+L+
Sbjct: 248 GLTTILEEEEEDDDCPKIMEDLKPWRIEEEKKFKHVDTIGEVHKFHRSYRERMRKLDILS 307

Query: 271 NQTMHSIVYLNSAGLLQLQYPLQSVSAQNSH------------NAPLWLGKARRLGADPR 330
            Q  +++      GLLQ + P Q+ S   S+            N  LW  KA++   +P 
Sbjct: 308 FQKSYAL------GLLQSKSPQQATSTLGSNPSQTSFSSVFSVNIRLW--KAKKSEIEPM 367

Query: 331 LEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQVM 390
           ++F++++  ++E VYVGQ+CLSWEIL WQ  K+I+L   D  G+R YN+VA EFQ FQV+
Sbjct: 368 VQFVKEIQGELENVYVGQMCLSWEILHWQYEKAIELLESDVYGSRRYNEVAGEFQQFQVL 427

Query: 391 LKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDDGSAEAEG---REWEDHED-AFSS 450
           L+RF+  E F+  RV++Y++ RCV R+LL +P I++DG+ + +    R++E++ D    S
Sbjct: 428 LQRFLENEPFEEPRVQHYIKRRCVLRNLLQIPVIREDGNKDKKNGRRRDYEENNDGVIKS 487

Query: 451 DFVTEIIEKSMWVFYEFLLSDKDDVKSI--LKMNRKYQIELQNTENPQLL--LVNIQAQF 502
           D + EI+E+++ +F+ F+  DK    SI   K   K QIE  + E+ + L     +++Q 
Sbjct: 488 DQLVEIMEETIRLFWRFVRCDK-LTSSIHDQKSRTKSQIEPDHEEDSEDLEMFAEVKSQL 545

BLAST of HG10015340 vs. TAIR 10
Match: AT3G01175.1 (Protein of unknown function (DUF1666) )

HSP 1 Score: 72.0 bits (175), Expect = 1.6e-12
Identity = 38/88 (43.18%), Postives = 52/88 (59.09%), Query Frame = 0

Query: 307 RDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQVMLKRF 366
           + + +D+ELVYV QVCLSWE LQ Q     D         R  + ++ EFQ FQV+L+RF
Sbjct: 309 QSMKKDLELVYVAQVCLSWEALQHQYILVRDSSNPADSRGRFDDDISREFQNFQVLLERF 368

Query: 367 MGEERFQGNRVENYVQNRCVFRSLLLVP 395
           + +ER +G RV ++VQ R    S   VP
Sbjct: 369 LEDERCEGKRVLSFVQRRFELISFFQVP 396

BLAST of HG10015340 vs. TAIR 10
Match: AT3G20260.1 (Protein of unknown function (DUF1666) )

HSP 1 Score: 68.2 bits (165), Expect = 2.4e-11
Identity = 80/324 (24.69%), Postives = 129/324 (39.81%), Query Frame = 0

Query: 189 DHQDIVHQLEMELKNSRTGGLPTIFEEETETAETISE------------KFKYEEVMGE- 248
           D   I ++++  LK  R      +  EE E  E  S               ++ +V+ E 
Sbjct: 54  DDDFITNEVKRRLKELRRNSFMVLIPEEEEEEEEESYLDEDDDDGEDKCSSEWRDVVAEG 113

Query: 249 ------IQKVYRTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVSAQNSHNAPL 308
                    VY  Y E+M   D L++Q +         G+          SA    ++P 
Sbjct: 114 LQWWGGFDAVYEKYCERMLFFDRLSSQQLKE----TGIGIAPSPSTPSPRSASKKLSSPF 173

Query: 309 WLGKARRLGA-DPRLEFIR-----DLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDS 368
                ++    +  +E ++     D  +D+E  YV Q+CL+WE L  Q  +   L     
Sbjct: 174 RCLSLKKFDVPEEDIEHLQPTEVDDPYQDLETAYVAQLCLTWEALHCQYTQLSHLISCQP 233

Query: 369 QGNRHYNQVASEFQLFQVMLKRFMGEERF-QGNRVENYVQNRCVFRSLLLVPPIKDDGSA 428
           +    YN  A  FQ F V+L+R++  E F QG+R E Y + R     LL  P I+     
Sbjct: 234 ETPTCYNHTAQLFQQFLVLLQRYIENEPFEQGSRSELYARARNAMPKLLQAPKIQGSDKK 293

Query: 429 EAEGREWEDHEDAFSSDFVTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTEN 487
           E E    +D      +D + ++IE S+  F  FL  DK      + +   +     N+  
Sbjct: 294 EME----KDTGFMVLADDLIKVIESSILTFNVFLKMDKKKPNGGIHLFGNHNNNHVNSTT 353

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038891903.11.2e-24687.16uncharacterized protein LOC120081255 [Benincasa hispida][more]
XP_011655680.12.6e-21779.57uncharacterized protein LOC105435571 isoform X1 [Cucumis sativus][more]
XP_008446346.14.8e-21679.38PREDICTED: uncharacterized protein LOC103489114 isoform X1 [Cucumis melo][more]
XP_023541276.12.0e-20976.07uncharacterized protein LOC111801499 isoform X2 [Cucurbita pepo subsp. pepo][more]
XP_023541275.12.0e-20976.07uncharacterized protein LOC111801499 isoform X1 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KQM12.1e-21777.44Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G605070 PE=4 SV=1[more]
A0A1S3BFI62.3e-21679.38uncharacterized protein LOC103489114 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A6J1G1711.4e-20575.34uncharacterized protein LOC111449746 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1G1861.4e-20575.34uncharacterized protein LOC111449746 isoform X3 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1HWY01.9e-20575.49uncharacterized protein LOC111467411 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
Match NameE-valueIdentityDescription
AT1G69610.12.8e-5240.36Protein of unknown function (DUF1666) [more]
AT5G39785.11.2e-5032.21Protein of unknown function (DUF1666) [more]
AT5G39785.21.7e-4932.14Protein of unknown function (DUF1666) [more]
AT3G01175.11.6e-1243.18Protein of unknown function (DUF1666) [more]
AT3G20260.12.4e-1124.69Protein of unknown function (DUF1666) [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 136..157
NoneNo IPR availableCOILSCoilCoilcoord: 467..487
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 104..133
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 110..126
NoneNo IPR availablePANTHERPTHR46741OS09G0413600 PROTEINcoord: 64..508
NoneNo IPR availablePANTHERPTHR46741:SF4FINGER FYVE DOMAIN PROTEIN, PUTATIVE (DUF1666)-RELATEDcoord: 64..508
IPR012870Protein of unknown function DUF1666PFAMPF07891DUF1666coord: 311..509
e-value: 8.0E-54
score: 182.9

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10015340.1HG10015340.1mRNA