HG10008430.1 (mRNA) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10008430.1
TypemRNA
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function (DUF789)
LocationChr10: 23057679 .. 23060861 (-)
Sequence length1188
RNA-Seq ExpressionHG10008430.1
SyntenyHG10008430.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTGGGAACTGCGTTGCAGTTTGGGGGAATCAAAGGTGAGGATCGGTTTTATATTCCGGTAAGGGCACGAAAGAATTATAATCAGCAAAAGCAATCGAGGAGACCTACCAAGACCGATGAAACTGAGAGCCCATCGAGTAAAGTTGTGGCTTCTACTATAAAGCCTTCTAAGCAATTAACTCCTCAGTCTAAGAGCAACTTAGAGAGATTCTTGGAGGCCACAAGGCCTTCAGTTCCAGCGCAGTATTTCTCTAAGGTAAATGTTTAGAATCAAAGAAAGTTGGTTACATATAATCTTGATTGGTTTCTTCTTAAATTATAGTTAATAACCAAATTAGGATCATAATCTTGATTCGAGTTCGGTTTCTTGTGGCCCTTTAATGGTTTTCTTGGCTATTTTCTGCATTTACCAGTTTTAGAACTCTAAGTAAATGATCCCTTCCGAGAGTTCTTTGTTTTTGTTTTTCTTTTTTCCTGGGAGATTTGAATTTGTGATATTGTCTTCTTTCCAGACAACTATGAGGGATTGGAGGACTTGTGACATTGAGTTTCAACCTTATTTCATTCTGAATGATCTGTGGGAGTCTTTCAAGGAGTGGAGTGCATATGGCGCTGGAGTTCCTTTAGTTCTTAATGGAGGTGATTCTGTTGTTCAATATTACGTTCCATATTTGTCTGGTATCCAAATATATGGTGAAGCTTCTGCACTGAGACCAGATTCTAACTCCAGGTACTTGATCTCCGTTTGGAATATGTTCATCCATATCAGTACTTGAATTCTTCTGGTTGAAATAAACTATATCTGTTTTTAATGTTCTGTGAGTGTGAGGTTTATTTTACTAAGATGTATATTCTACTCTAATGTGAGTTTTCTTTTAGATGTGATCATGAATTTTTTCCACCATTCTTAACGTATGAGAACATTGGTTTTACTCTGATGGTGATTTTTTCGATGTTTCTTTATATTAGTAGGCTGGCTAGTGAGGACAGTGATCTTGACTCTTTTAGAGATACAAGCAGCGATGGAAGCATTGACTATGAGTTTGGAAAAATCTGTAACTTTTCTAGTGAACAGTGGGTTCATCACCATCTAGCTTGTGAAAACACACTGAAAATGAGAAAGACGTCTTTAACTGATGAACATAGAACGATACAAGAAGGTTTTTCGAGTGATGATGCGGATGCAGGATATCATCGAAGTGGTTTGCTCTTTCAGTTTCTTGAGCATGATCTTCCTTATCAACGTGTACCATTGGCTGATAAAGTTAGTTGGTTTCTTATTGGTTCATTTAGTAGTTATATTGTTGCATGTTTTGCTATTTCAAGAACCGTATTAGTTTACTAATATTTTTTATGTAATGACAGATATTTGAACTTGCTTTCCAATTTCCTGGTTTGAAAACGTTAAGAAGTTGTGATATCCTGTCAGCCAGCTGGGTCTCTGTAGCATGGTAAGTATTTGCGGATTGAATGTTATTACTAGTTTTGCATACCATGCTATTCATTTCCTCCACTGGAATTGTAGGTACCCTATTTACCGTATACCCACCGGTCCGACATTAAAAGATTTGGATGCTTGCTTCTTAACATATCATTCCCTTTCCACACCAGGTAATATGGCATTCTTAGTTCTCTACTGTAACTTTTTAGGTAATCTTCAAGATATGTGTTCTCTAGGCTCCATTCACTCATTTGTCGACTAAATAAACCCATTAACTTCTGCTGAGGTTACTGCCATAAGATGAATTAGAACAAGCAATCATCGCCTAAGAAGCACGGACACTTTAGTTTGGCTAGCGTGTCTGTGTCCGATACCGACACTTGTTGGACACGTACTGGACACTTGTTAGTGCAACAAATGTGTTAAGACATGCATAGAACACTTGTTGAGTAGACTAAAAAGACACATATATGACAATAATAACAACTTTTGAGAGTGAAATACATCAAGCTAAGTTTTTTAAGCATATAAATGCATCCAACCTATTTACTTAGAATTTTCTTTCGGTATAAAAATAATATATATTTTTAAAAATGTACATTTTAATAAGTGTGTCTTTGCCGTGTCGTGTTGTAGATTTTTAAAATATGGCGTGTCCCCGTGTCCGGTCGTGTCGTATTCGTGTCTCGTATTTGTATATGTATCTGTGCTTCTTAGATCATCGCCATAGTTCTTCTGACATCTCACTCTCATAAAAATGTTGTTTCTTCTTTTCCGTAACTTTTATCCTTCTTATTGATAAAGCTTGTTGTTATTAGCCAAGTATAAGTCAACCCCATATTAACTGATTGTTGAAAGAATGACCCTTTCCCTGTGACTGGCCATGTCAGAGGTAAGTTAATTCTTTTCAGGTTCGAACTGACCAATAAATTCATTGATGTTTACATATTTGGAGCTATATTGTGTTAAATCATGGATTGTGTAAGAATTAAGATGCATTATTCTCATGTGCATACAATATCTCCATATTTTGTCTAATAAACTAAATTTAGGCTTGTGTTCATTCTTCATCGTGTCCTCACCACGCTCGCTGAAAGTTTATTCAAGATTTTGAGTGACTTTTCAAATTTTGTAAAATCTGAATAACACATTAAATATTATTTTTCCACCAACTCCTTGTACATTTCATTTTGGAGCTTATTCATTGCTTCTGTGCTCATCTTTTAAGCCTCGTTTCTGCTCGAGAAATTTTTTATTACATCAGCTTATTTCTAGAATGACTCAATATTTCGTTGTTCTGACGTTTAGCTTGTTTGTTTCAAAATATAAAATATATTATCCATTTTCTGCTTCATCCAGGGCTGTTACTTACTTGAGCCTGTCCTAGTCACTAGCCACCATTGTCTACAAAAATCCAATTTTCCTACTGTTGTCAATCATCATCTATTTGGCTTGTTCTCCACAAGTGTTGATCGTTTTCGTTGCTTCACAGGTAATGGACATGGTCTGCCACCAGTAATGATATATCCAAAGGACATTGATGGTATCGCAAAGGTCTCCTTGCCTGTTTTTGGATTGGCTTCTTATAAGCTGAAAGGATCGATATGGGGGCAAAATGGCGTCAACGAGCATCAAACGGCAAATTCTCTCATGCAGGCAGCAGATAAGTGGCTGAGGAGCCTTCAGGTCGTTCAACCTGATTTTCAGTTCTTTGCATCACATGGGACATACTGGAGATGA

mRNA sequence

ATGTTGGGAACTGCGTTGCAGTTTGGGGGAATCAAAGGTGAGGATCGGTTTTATATTCCGGTAAGGGCACGAAAGAATTATAATCAGCAAAAGCAATCGAGGAGACCTACCAAGACCGATGAAACTGAGAGCCCATCGAGTAAAGTTGTGGCTTCTACTATAAAGCCTTCTAAGCAATTAACTCCTCAGTCTAAGAGCAACTTAGAGAGATTCTTGGAGGCCACAAGGCCTTCAGTTCCAGCGCAGTATTTCTCTAAGACAACTATGAGGGATTGGAGGACTTGTGACATTGAGTTTCAACCTTATTTCATTCTGAATGATCTGTGGGAGTCTTTCAAGGAGTGGAGTGCATATGGCGCTGGAGTTCCTTTAGTTCTTAATGGAGGTGATTCTGTTGTTCAATATTACGTTCCATATTTGTCTGGTATCCAAATATATGGTGAAGCTTCTGCACTGAGACCAGATTCTAACTCCAGGCTGGCTAGTGAGGACAGTGATCTTGACTCTTTTAGAGATACAAGCAGCGATGGAAGCATTGACTATGAGTTTGGAAAAATCTGTAACTTTTCTAGTGAACAGTGGGTTCATCACCATCTAGCTTGTGAAAACACACTGAAAATGAGAAAGACGTCTTTAACTGATGAACATAGAACGATACAAGAAGGTTTTTCGAGTGATGATGCGGATGCAGGATATCATCGAAGTGGTTTGCTCTTTCAGTTTCTTGAGCATGATCTTCCTTATCAACGTGTACCATTGGCTGATAAAATATTTGAACTTGCTTTCCAATTTCCTGGTTTGAAAACGTTAAGAAGTTGTGATATCCTGTCAGCCAGCTGGGTCTCTGTAGCATGGTACCCTATTTACCGTATACCCACCGGTCCGACATTAAAAGATTTGGATGCTTGCTTCTTAACATATCATTCCCTTTCCACACCAGGTAATGGACATGGTCTGCCACCAGTAATGATATATCCAAAGGACATTGATGGTATCGCAAAGGTCTCCTTGCCTGTTTTTGGATTGGCTTCTTATAAGCTGAAAGGATCGATATGGGGGCAAAATGGCGTCAACGAGCATCAAACGGCAAATTCTCTCATGCAGGCAGCAGATAAGTGGCTGAGGAGCCTTCAGGTCGTTCAACCTGATTTTCAGTTCTTTGCATCACATGGGACATACTGGAGATGA

Coding sequence (CDS)

ATGTTGGGAACTGCGTTGCAGTTTGGGGGAATCAAAGGTGAGGATCGGTTTTATATTCCGGTAAGGGCACGAAAGAATTATAATCAGCAAAAGCAATCGAGGAGACCTACCAAGACCGATGAAACTGAGAGCCCATCGAGTAAAGTTGTGGCTTCTACTATAAAGCCTTCTAAGCAATTAACTCCTCAGTCTAAGAGCAACTTAGAGAGATTCTTGGAGGCCACAAGGCCTTCAGTTCCAGCGCAGTATTTCTCTAAGACAACTATGAGGGATTGGAGGACTTGTGACATTGAGTTTCAACCTTATTTCATTCTGAATGATCTGTGGGAGTCTTTCAAGGAGTGGAGTGCATATGGCGCTGGAGTTCCTTTAGTTCTTAATGGAGGTGATTCTGTTGTTCAATATTACGTTCCATATTTGTCTGGTATCCAAATATATGGTGAAGCTTCTGCACTGAGACCAGATTCTAACTCCAGGCTGGCTAGTGAGGACAGTGATCTTGACTCTTTTAGAGATACAAGCAGCGATGGAAGCATTGACTATGAGTTTGGAAAAATCTGTAACTTTTCTAGTGAACAGTGGGTTCATCACCATCTAGCTTGTGAAAACACACTGAAAATGAGAAAGACGTCTTTAACTGATGAACATAGAACGATACAAGAAGGTTTTTCGAGTGATGATGCGGATGCAGGATATCATCGAAGTGGTTTGCTCTTTCAGTTTCTTGAGCATGATCTTCCTTATCAACGTGTACCATTGGCTGATAAAATATTTGAACTTGCTTTCCAATTTCCTGGTTTGAAAACGTTAAGAAGTTGTGATATCCTGTCAGCCAGCTGGGTCTCTGTAGCATGGTACCCTATTTACCGTATACCCACCGGTCCGACATTAAAAGATTTGGATGCTTGCTTCTTAACATATCATTCCCTTTCCACACCAGGTAATGGACATGGTCTGCCACCAGTAATGATATATCCAAAGGACATTGATGGTATCGCAAAGGTCTCCTTGCCTGTTTTTGGATTGGCTTCTTATAAGCTGAAAGGATCGATATGGGGGCAAAATGGCGTCAACGAGCATCAAACGGCAAATTCTCTCATGCAGGCAGCAGATAAGTGGCTGAGGAGCCTTCAGGTCGTTCAACCTGATTTTCAGTTCTTTGCATCACATGGGACATACTGGAGATGA

Protein sequence

MLGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQLTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSIDYEFGKICNFSSEQWVHHHLACENTLKMRKTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQFLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPGNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYWR
Homology
BLAST of HG10008430.1 vs. NCBI nr
Match: XP_038897708.1 (uncharacterized protein LOC120085653 [Benincasa hispida])

HSP 1 Score: 773.1 bits (1995), Expect = 1.2e-219
Identity = 373/395 (94.43%), Postives = 382/395 (96.71%), Query Frame = 0

Query: 1   MLGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQL 60
           MLGT LQFGGIKGEDRFYIP+RARKNYNQQK SRRPTKTDETESPS KV+ASTIKPSKQL
Sbjct: 1   MLGTELQFGGIKGEDRFYIPIRARKNYNQQKPSRRPTKTDETESPSCKVLASTIKPSKQL 60

Query: 61  TPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQPYFILNDLWESFKEWSAYGA 120
           TPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQPYFILNDLWESFKEWSAYGA
Sbjct: 61  TPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQPYFILNDLWESFKEWSAYGA 120

Query: 121 GVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSID 180
           GVPLVLNGGDSV+QYYVPYLSGIQIYGEASA+R DSN RL SEDSDLDS RDTSS+GSID
Sbjct: 121 GVPLVLNGGDSVIQYYVPYLSGIQIYGEASAMRSDSNFRLVSEDSDLDSSRDTSSNGSID 180

Query: 181 YEFGKICNFSSEQWVHHHLACENTLKMRKTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQ 240
           YEFGKICNFSSEQW HHHLACENTLKMRKTSLTDEHRTIQ+GFSSDD DAGY RSGLLFQ
Sbjct: 181 YEFGKICNFSSEQWAHHHLACENTLKMRKTSLTDEHRTIQKGFSSDDGDAGYPRSGLLFQ 240

Query: 241 FLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDL 300
           FLE DLPYQRVPLADKIFELA+QFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDL
Sbjct: 241 FLEQDLPYQRVPLADKIFELAYQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDL 300

Query: 301 DACFLTYHSLSTPGNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVNEH 360
           DACFLTYHSLST GNGHGLPPVMIYPKDID IAKVSLPVFGLASYKLKGSIWGQNG+NEH
Sbjct: 301 DACFLTYHSLSTSGNGHGLPPVMIYPKDIDDIAKVSLPVFGLASYKLKGSIWGQNGINEH 360

Query: 361 QTANSLMQAADKWLRSLQVVQPDFQFFASHGTYWR 396
           QTANSLMQAADKWLRSLQV+QPDFQFFASHGTYWR
Sbjct: 361 QTANSLMQAADKWLRSLQVIQPDFQFFASHGTYWR 395

BLAST of HG10008430.1 vs. NCBI nr
Match: XP_008453426.1 (PREDICTED: uncharacterized protein LOC103494138 isoform X1 [Cucumis melo] >KAA0058094.1 uncharacterized protein E6C27_scaffold274G004020 [Cucumis melo var. makuwa] >TYK28447.1 uncharacterized protein E5676_scaffold629G00790 [Cucumis melo var. makuwa])

HSP 1 Score: 713.4 bits (1840), Expect = 1.1e-201
Identity = 349/397 (87.91%), Postives = 363/397 (91.44%), Query Frame = 0

Query: 1   MLGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQL 60
           MLGTALQFGGIKGEDRFYIPVRARKNYNQQK SRRPTKTDETES SSKVV  T KP ++L
Sbjct: 1   MLGTALQFGGIKGEDRFYIPVRARKNYNQQKPSRRPTKTDETESLSSKVVGCTTKPCEEL 60

Query: 61  TPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQPYFILNDLWESFKEWSAYGA 120
           TPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQPYFILNDLWESFKEWSAYGA
Sbjct: 61  TPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQPYFILNDLWESFKEWSAYGA 120

Query: 121 GVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSID 180
           GVPLVL+GGDSVVQYYVPYLSGIQIYGEA+ALR DSN RLA EDSDLDS RDTSSDGSID
Sbjct: 121 GVPLVLDGGDSVVQYYVPYLSGIQIYGEAAALRSDSNVRLACEDSDLDSSRDTSSDGSID 180

Query: 181 YEFGKICNFSSEQWVHHHLACENTLKMRKTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQ 240
           Y+ GK  N S EQW H HLACEN  KMRKTSLTDE + +QEGF SDD DAGY RSGLLFQ
Sbjct: 181 YDLGKSFNLSREQWDHPHLACENMPKMRKTSLTDERKMVQEGFLSDDGDAGYPRSGLLFQ 240

Query: 241 FLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDL 300
           FLE DLPYQRVPLADKIFELA+QFPGLKTLRSCDIL ASWVSVAWYPIYRIPTGPTLKDL
Sbjct: 241 FLEQDLPYQRVPLADKIFELAYQFPGLKTLRSCDILPASWVSVAWYPIYRIPTGPTLKDL 300

Query: 301 DACFLTYHSLSTP--GNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVN 360
           DACFLTYHSLSTP  GN H LPPVM+YPKDID I K+SLPVFG+ASYKLKGSIWGQNG+N
Sbjct: 301 DACFLTYHSLSTPKKGNRHSLPPVMVYPKDIDDITKISLPVFGMASYKLKGSIWGQNGIN 360

Query: 361 EHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYWR 396
           +HQ ANSLMQAADKWLRSLQV QPDFQFF+SHGTYWR
Sbjct: 361 DHQKANSLMQAADKWLRSLQVSQPDFQFFSSHGTYWR 397

BLAST of HG10008430.1 vs. NCBI nr
Match: XP_031736215.1 (uncharacterized protein LOC101215266 [Cucumis sativus] >KGN63839.1 hypothetical protein Csa_014275 [Cucumis sativus])

HSP 1 Score: 705.7 bits (1820), Expect = 2.3e-199
Identity = 342/397 (86.15%), Postives = 362/397 (91.18%), Query Frame = 0

Query: 1   MLGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQL 60
           MLGT LQFGGIKGEDRFY+PVRARKNYNQQK SR PTKTDETES SSKVV  T KP ++L
Sbjct: 1   MLGTTLQFGGIKGEDRFYVPVRARKNYNQQKPSRNPTKTDETESLSSKVVGCTTKPCEEL 60

Query: 61  TPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQPYFILNDLWESFKEWSAYGA 120
           TPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQPYFILNDLWESFKEWSAYGA
Sbjct: 61  TPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQPYFILNDLWESFKEWSAYGA 120

Query: 121 GVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSID 180
           GVPLVL+GGDSVVQYYVPYLSGIQIYGEA+ALR DS+ RLA EDSDLDS RDTSSDGSID
Sbjct: 121 GVPLVLDGGDSVVQYYVPYLSGIQIYGEAAALRSDSHVRLACEDSDLDSSRDTSSDGSID 180

Query: 181 YEFGKICNFSSEQWVHHHLACENTLKMRKTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQ 240
           ++ GK  NFS EQW H HLACEN LKMRKTSLTDEH+ +QEGF SDD DAGY RS LLFQ
Sbjct: 181 HDLGKSFNFSREQWDHPHLACENMLKMRKTSLTDEHKMVQEGFLSDDGDAGYPRSSLLFQ 240

Query: 241 FLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDL 300
           FLE DLPYQRVPLADKIFELA+QFPGLKTL SCDIL ASWVSVAWYPIYRIPTGPTLKDL
Sbjct: 241 FLEQDLPYQRVPLADKIFELAYQFPGLKTLSSCDILPASWVSVAWYPIYRIPTGPTLKDL 300

Query: 301 DACFLTYHSLSTP--GNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVN 360
           DACFLTYHSLSTP  GN H LPP+M+YPKDID I K+SLPVFG+ASYK+KGSIWGQNG++
Sbjct: 301 DACFLTYHSLSTPKKGNRHSLPPIMVYPKDIDDITKISLPVFGMASYKVKGSIWGQNGIS 360

Query: 361 EHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYWR 396
           +HQ ANSLMQAADKWLRSLQV QPDFQFF+SHGTYWR
Sbjct: 361 DHQKANSLMQAADKWLRSLQVSQPDFQFFSSHGTYWR 397

BLAST of HG10008430.1 vs. NCBI nr
Match: XP_023516127.1 (uncharacterized protein LOC111780081 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 704.1 bits (1816), Expect = 6.7e-199
Identity = 345/397 (86.90%), Postives = 361/397 (90.93%), Query Frame = 0

Query: 1   MLGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQL 60
           MLGTALQFGGIKGEDRFYIPVRARK+YNQQK SRRPTKTDETE+PSSKVVAST  PSK L
Sbjct: 1   MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSKVVASTTTPSKPL 60

Query: 61  TPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQPYFILNDLWESFKEWSAYGA 120
           TPQSKSNLERFL+AT+PSVPAQYFSKTTMR WRTCDIEFQPYF+LNDLWESFKEWSAYGA
Sbjct: 61  TPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQPYFVLNDLWESFKEWSAYGA 120

Query: 121 GVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSID 180
           GVPLVLNGGDSVVQYYVPYLSGIQIYGE++ALR DS SRLA+EDSDLDS RDTSS+GSID
Sbjct: 121 GVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSRDTSSEGSID 180

Query: 181 YEFGKICNFSSEQWVHHHLACENTLKMRKTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQ 240
           YEFGK CN S EQWVHHHLACE+ + MRKTSL DEH T QEGFSSDD DA Y RSGLLFQ
Sbjct: 181 YEFGKSCNLSREQWVHHHLACESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQ 240

Query: 241 FLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDL 300
           FLE DLPYQRVPLADKIF+LA+QFPGLKTLRSCDIL ASW+SVAWYPIYRIPTGPTLKDL
Sbjct: 241 FLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDL 300

Query: 301 DACFLTYHSLSTP--GNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVN 360
           DACFLTYHSLSTP  GNGHG  P MIYP D DGI KVSLPVFGLASYKLKGSIW QN V 
Sbjct: 301 DACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVK 360

Query: 361 EHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYWR 396
           EHQ ANSLMQAA+KWLR LQV QPDFQFFAS+ TYWR
Sbjct: 361 EHQMANSLMQAAEKWLRRLQVNQPDFQFFASNMTYWR 397

BLAST of HG10008430.1 vs. NCBI nr
Match: XP_022921943.1 (uncharacterized protein LOC111430050 [Cucurbita moschata] >KAG7023306.1 hypothetical protein SDJN02_14331 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 704.1 bits (1816), Expect = 6.7e-199
Identity = 344/397 (86.65%), Postives = 361/397 (90.93%), Query Frame = 0

Query: 1   MLGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQL 60
           MLGTALQFGGIKGEDRFYIPVRARK+YNQQK SRRPTKTDETE+PSS+VVAST  PSK L
Sbjct: 1   MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSEVVASTTTPSKPL 60

Query: 61  TPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQPYFILNDLWESFKEWSAYGA 120
           TPQSKSNLERFL+AT+PSVPAQYFSKTTMR WRTCDIEFQPYF+LNDLWESFKEWSAYGA
Sbjct: 61  TPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQPYFVLNDLWESFKEWSAYGA 120

Query: 121 GVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSID 180
           GVPLVLNGGDSVVQYYVPYLSGIQIYGE++ALR DS SRLA+EDSDLDS +DTSS+GSID
Sbjct: 121 GVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSKDTSSEGSID 180

Query: 181 YEFGKICNFSSEQWVHHHLACENTLKMRKTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQ 240
           YEFGK CN S EQWVHHHLACE+ + MRKTSL DEH T QEGFSSDD DA Y RSGLLFQ
Sbjct: 181 YEFGKSCNLSREQWVHHHLACESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQ 240

Query: 241 FLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDL 300
           FLE DLPYQRVPLADKIF+LA+QFPGLKTLRSCDIL ASW+SVAWYPIYRIPTGPTLKDL
Sbjct: 241 FLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDL 300

Query: 301 DACFLTYHSLSTP--GNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVN 360
           DACFLTYHSLSTP  GNGHG  P MIYP D DGI KVSLPVFGLASYKLKGSIW QN V 
Sbjct: 301 DACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVK 360

Query: 361 EHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYWR 396
           EHQ ANSLMQAA+KWLR LQV QPDFQFFASH TYWR
Sbjct: 361 EHQMANSLMQAAEKWLRRLQVNQPDFQFFASHMTYWR 397

BLAST of HG10008430.1 vs. ExPASy TrEMBL
Match: A0A5A7USF1 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold629G00790 PE=4 SV=1)

HSP 1 Score: 713.4 bits (1840), Expect = 5.3e-202
Identity = 349/397 (87.91%), Postives = 363/397 (91.44%), Query Frame = 0

Query: 1   MLGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQL 60
           MLGTALQFGGIKGEDRFYIPVRARKNYNQQK SRRPTKTDETES SSKVV  T KP ++L
Sbjct: 1   MLGTALQFGGIKGEDRFYIPVRARKNYNQQKPSRRPTKTDETESLSSKVVGCTTKPCEEL 60

Query: 61  TPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQPYFILNDLWESFKEWSAYGA 120
           TPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQPYFILNDLWESFKEWSAYGA
Sbjct: 61  TPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQPYFILNDLWESFKEWSAYGA 120

Query: 121 GVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSID 180
           GVPLVL+GGDSVVQYYVPYLSGIQIYGEA+ALR DSN RLA EDSDLDS RDTSSDGSID
Sbjct: 121 GVPLVLDGGDSVVQYYVPYLSGIQIYGEAAALRSDSNVRLACEDSDLDSSRDTSSDGSID 180

Query: 181 YEFGKICNFSSEQWVHHHLACENTLKMRKTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQ 240
           Y+ GK  N S EQW H HLACEN  KMRKTSLTDE + +QEGF SDD DAGY RSGLLFQ
Sbjct: 181 YDLGKSFNLSREQWDHPHLACENMPKMRKTSLTDERKMVQEGFLSDDGDAGYPRSGLLFQ 240

Query: 241 FLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDL 300
           FLE DLPYQRVPLADKIFELA+QFPGLKTLRSCDIL ASWVSVAWYPIYRIPTGPTLKDL
Sbjct: 241 FLEQDLPYQRVPLADKIFELAYQFPGLKTLRSCDILPASWVSVAWYPIYRIPTGPTLKDL 300

Query: 301 DACFLTYHSLSTP--GNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVN 360
           DACFLTYHSLSTP  GN H LPPVM+YPKDID I K+SLPVFG+ASYKLKGSIWGQNG+N
Sbjct: 301 DACFLTYHSLSTPKKGNRHSLPPVMVYPKDIDDITKISLPVFGMASYKLKGSIWGQNGIN 360

Query: 361 EHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYWR 396
           +HQ ANSLMQAADKWLRSLQV QPDFQFF+SHGTYWR
Sbjct: 361 DHQKANSLMQAADKWLRSLQVSQPDFQFFSSHGTYWR 397

BLAST of HG10008430.1 vs. ExPASy TrEMBL
Match: A0A1S3BX10 (uncharacterized protein LOC103494138 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103494138 PE=4 SV=1)

HSP 1 Score: 713.4 bits (1840), Expect = 5.3e-202
Identity = 349/397 (87.91%), Postives = 363/397 (91.44%), Query Frame = 0

Query: 1   MLGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQL 60
           MLGTALQFGGIKGEDRFYIPVRARKNYNQQK SRRPTKTDETES SSKVV  T KP ++L
Sbjct: 1   MLGTALQFGGIKGEDRFYIPVRARKNYNQQKPSRRPTKTDETESLSSKVVGCTTKPCEEL 60

Query: 61  TPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQPYFILNDLWESFKEWSAYGA 120
           TPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQPYFILNDLWESFKEWSAYGA
Sbjct: 61  TPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQPYFILNDLWESFKEWSAYGA 120

Query: 121 GVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSID 180
           GVPLVL+GGDSVVQYYVPYLSGIQIYGEA+ALR DSN RLA EDSDLDS RDTSSDGSID
Sbjct: 121 GVPLVLDGGDSVVQYYVPYLSGIQIYGEAAALRSDSNVRLACEDSDLDSSRDTSSDGSID 180

Query: 181 YEFGKICNFSSEQWVHHHLACENTLKMRKTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQ 240
           Y+ GK  N S EQW H HLACEN  KMRKTSLTDE + +QEGF SDD DAGY RSGLLFQ
Sbjct: 181 YDLGKSFNLSREQWDHPHLACENMPKMRKTSLTDERKMVQEGFLSDDGDAGYPRSGLLFQ 240

Query: 241 FLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDL 300
           FLE DLPYQRVPLADKIFELA+QFPGLKTLRSCDIL ASWVSVAWYPIYRIPTGPTLKDL
Sbjct: 241 FLEQDLPYQRVPLADKIFELAYQFPGLKTLRSCDILPASWVSVAWYPIYRIPTGPTLKDL 300

Query: 301 DACFLTYHSLSTP--GNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVN 360
           DACFLTYHSLSTP  GN H LPPVM+YPKDID I K+SLPVFG+ASYKLKGSIWGQNG+N
Sbjct: 301 DACFLTYHSLSTPKKGNRHSLPPVMVYPKDIDDITKISLPVFGMASYKLKGSIWGQNGIN 360

Query: 361 EHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYWR 396
           +HQ ANSLMQAADKWLRSLQV QPDFQFF+SHGTYWR
Sbjct: 361 DHQKANSLMQAADKWLRSLQVSQPDFQFFSSHGTYWR 397

BLAST of HG10008430.1 vs. ExPASy TrEMBL
Match: A0A0A0LS49 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G024240 PE=4 SV=1)

HSP 1 Score: 705.7 bits (1820), Expect = 1.1e-199
Identity = 342/397 (86.15%), Postives = 362/397 (91.18%), Query Frame = 0

Query: 1   MLGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQL 60
           MLGT LQFGGIKGEDRFY+PVRARKNYNQQK SR PTKTDETES SSKVV  T KP ++L
Sbjct: 1   MLGTTLQFGGIKGEDRFYVPVRARKNYNQQKPSRNPTKTDETESLSSKVVGCTTKPCEEL 60

Query: 61  TPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQPYFILNDLWESFKEWSAYGA 120
           TPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQPYFILNDLWESFKEWSAYGA
Sbjct: 61  TPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQPYFILNDLWESFKEWSAYGA 120

Query: 121 GVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSID 180
           GVPLVL+GGDSVVQYYVPYLSGIQIYGEA+ALR DS+ RLA EDSDLDS RDTSSDGSID
Sbjct: 121 GVPLVLDGGDSVVQYYVPYLSGIQIYGEAAALRSDSHVRLACEDSDLDSSRDTSSDGSID 180

Query: 181 YEFGKICNFSSEQWVHHHLACENTLKMRKTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQ 240
           ++ GK  NFS EQW H HLACEN LKMRKTSLTDEH+ +QEGF SDD DAGY RS LLFQ
Sbjct: 181 HDLGKSFNFSREQWDHPHLACENMLKMRKTSLTDEHKMVQEGFLSDDGDAGYPRSSLLFQ 240

Query: 241 FLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDL 300
           FLE DLPYQRVPLADKIFELA+QFPGLKTL SCDIL ASWVSVAWYPIYRIPTGPTLKDL
Sbjct: 241 FLEQDLPYQRVPLADKIFELAYQFPGLKTLSSCDILPASWVSVAWYPIYRIPTGPTLKDL 300

Query: 301 DACFLTYHSLSTP--GNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVN 360
           DACFLTYHSLSTP  GN H LPP+M+YPKDID I K+SLPVFG+ASYK+KGSIWGQNG++
Sbjct: 301 DACFLTYHSLSTPKKGNRHSLPPIMVYPKDIDDITKISLPVFGMASYKVKGSIWGQNGIS 360

Query: 361 EHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYWR 396
           +HQ ANSLMQAADKWLRSLQV QPDFQFF+SHGTYWR
Sbjct: 361 DHQKANSLMQAADKWLRSLQVSQPDFQFFSSHGTYWR 397

BLAST of HG10008430.1 vs. ExPASy TrEMBL
Match: A0A6J1E577 (uncharacterized protein LOC111430050 OS=Cucurbita moschata OX=3662 GN=LOC111430050 PE=4 SV=1)

HSP 1 Score: 704.1 bits (1816), Expect = 3.2e-199
Identity = 344/397 (86.65%), Postives = 361/397 (90.93%), Query Frame = 0

Query: 1   MLGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQL 60
           MLGTALQFGGIKGEDRFYIPVRARK+YNQQK SRRPTKTDETE+PSS+VVAST  PSK L
Sbjct: 1   MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSEVVASTTTPSKPL 60

Query: 61  TPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQPYFILNDLWESFKEWSAYGA 120
           TPQSKSNLERFL+AT+PSVPAQYFSKTTMR WRTCDIEFQPYF+LNDLWESFKEWSAYGA
Sbjct: 61  TPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQPYFVLNDLWESFKEWSAYGA 120

Query: 121 GVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSID 180
           GVPLVLNGGDSVVQYYVPYLSGIQIYGE++ALR DS SRLA+EDSDLDS +DTSS+GSID
Sbjct: 121 GVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSKDTSSEGSID 180

Query: 181 YEFGKICNFSSEQWVHHHLACENTLKMRKTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQ 240
           YEFGK CN S EQWVHHHLACE+ + MRKTSL DEH T QEGFSSDD DA Y RSGLLFQ
Sbjct: 181 YEFGKSCNLSREQWVHHHLACESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQ 240

Query: 241 FLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDL 300
           FLE DLPYQRVPLADKIF+LA+QFPGLKTLRSCDIL ASW+SVAWYPIYRIPTGPTLKDL
Sbjct: 241 FLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDL 300

Query: 301 DACFLTYHSLSTP--GNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVN 360
           DACFLTYHSLSTP  GNGHG  P MIYP D DGI KVSLPVFGLASYKLKGSIW QN V 
Sbjct: 301 DACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVK 360

Query: 361 EHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYWR 396
           EHQ ANSLMQAA+KWLR LQV QPDFQFFASH TYWR
Sbjct: 361 EHQMANSLMQAAEKWLRRLQVNQPDFQFFASHMTYWR 397

BLAST of HG10008430.1 vs. ExPASy TrEMBL
Match: A0A6J1JE68 (uncharacterized protein LOC111484983 OS=Cucurbita maxima OX=3661 GN=LOC111484983 PE=4 SV=1)

HSP 1 Score: 698.7 bits (1802), Expect = 1.4e-197
Identity = 343/397 (86.40%), Postives = 359/397 (90.43%), Query Frame = 0

Query: 1   MLGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQL 60
           MLGTALQFGGIKGEDRFYIPVRARK YNQQK SRRPTKTDETE+PSSKVVAST  PSK L
Sbjct: 1   MLGTALQFGGIKGEDRFYIPVRARKIYNQQKPSRRPTKTDETETPSSKVVASTTTPSKPL 60

Query: 61  TPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQPYFILNDLWESFKEWSAYGA 120
           TPQSKSNLERFL+AT+PSVPAQYFSKTTMR WRTCDIEFQPYF+LNDLWESFKEWSAYGA
Sbjct: 61  TPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQPYFVLNDLWESFKEWSAYGA 120

Query: 121 GVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSID 180
           GVPLVLNGGDSVVQYYVPYLSGIQIYGE++ALR DS SRLA+EDSDLDS RDTSS+GSID
Sbjct: 121 GVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSRDTSSEGSID 180

Query: 181 YEFGKICNFSSEQWVHHHLACENTLKMRKTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQ 240
           YEFGK CN S EQWVHHHLAC++ L +RKTSL DEH T QEGFSSDD DA Y RSGLLFQ
Sbjct: 181 YEFGKSCNLSREQWVHHHLACDSALTIRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQ 240

Query: 241 FLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDL 300
           FLE DLPYQRVPLADKIF+LA+QFPGLKTLRSCDIL ASW+SVAWYPIYRIPTGPTLKDL
Sbjct: 241 FLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDL 300

Query: 301 DACFLTYHSLSTP--GNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVN 360
           DACFLTYHSLSTP  GNGHG  P MIYP D DGI KVSLPVFGLASYKLKGSIW QN V 
Sbjct: 301 DACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVK 360

Query: 361 EHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYWR 396
           E+Q  NSLMQAA+KWLR LQV QPDFQFFASH TYWR
Sbjct: 361 ENQMENSLMQAAEKWLRRLQVNQPDFQFFASHMTYWR 397

BLAST of HG10008430.1 vs. TAIR 10
Match: AT2G01260.1 (Protein of unknown function (DUF789) )

HSP 1 Score: 332.0 bits (850), Expect = 6.4e-91
Identity = 198/391 (50.64%), Postives = 238/391 (60.87%), Query Frame = 0

Query: 1   MLGTALQF-GGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQ 60
           MLG   Q   G  G+D FY   + R+  NQ+    R  ++D +  PSS    S  K   +
Sbjct: 1   MLGAGFQLTRGRHGDDPFYTSAKTRR-ANQRIDQLRRAQSDVSNVPSS--APSPHKQQLE 60

Query: 61  LTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCD--IEFQPYFILNDLWESFKEWSA 120
            +  S SNL+RFLE+  PSVPAQ+ SKT +R+ R  D   +  PYF+L D+W+SF EWSA
Sbjct: 61  PSDLSSSNLDRFLESVTPSVPAQFLSKTLLRERRADDDYNKLVPYFVLGDIWDSFAEWSA 120

Query: 121 YGAGVPLVLNGG-DSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSD 180
           YG GVPLVLN   D V+QYYVP LS IQIY  + AL     SR   + SD D FRD+SSD
Sbjct: 121 YGTGVPLVLNNNKDRVIQYYVPSLSAIQIYAHSHALDSSLKSRRPGDSSDSD-FRDSSSD 180

Query: 181 GSIDYEFGKICNFSSEQWVHHHLACENTLKMRKTSLTDEHRTIQEGFSSDDADAGYHRSG 240
            S D         S  + V   + C         SL D+H   QE  SSDD +    +  
Sbjct: 181 VSSD---------SDSERVSARVDC--------ISLRDQH---QEDSSSDDGEPLGSQGR 240

Query: 241 LLFQFLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPT 300
           L+F++LE DLPY R P ADK+ +LA QFP L TLRSCD+L +SW SVAWYPIYRIPTGPT
Sbjct: 241 LMFEYLERDLPYIREPFADKVLDLAAQFPELMTLRSCDLLRSSWFSVAWYPIYRIPTGPT 300

Query: 301 LKDLDACFLTYHSLSTPGNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNG 360
           LKDLDACFLTYHSL T   G G    M   +  +   K+SLPVFGLASYK +GS+W   G
Sbjct: 301 LKDLDACFLTYHSLHTSFGGEGSEQSMSLTQPRES-EKMSLPVFGLASYKFRGSLWTPIG 360

Query: 361 VNEHQTANSLMQAADKWLRSLQVVQPDFQFF 388
            +EHQ  NSL QAADKWL S  V  PDF FF
Sbjct: 361 GSEHQLVNSLFQAADKWLHSCHVSHPDFLFF 366

BLAST of HG10008430.1 vs. TAIR 10
Match: AT1G15030.1 (Protein of unknown function (DUF789) )

HSP 1 Score: 329.3 bits (843), Expect = 4.2e-90
Identity = 180/327 (55.05%), Postives = 218/327 (66.67%), Query Frame = 0

Query: 64  SKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQ-PYFILNDLWESFKEWSAYGAGV 123
           S SN+ERFL++  PSVPA Y SKT +R+    D+E Q PYF+L D+WESF EWSAYG GV
Sbjct: 45  SSSNVERFLDSVTPSVPAHYLSKTIVRERGGSDVESQVPYFLLGDVWESFAEWSAYGIGV 104

Query: 124 PLVLNGG-DSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSIDY 183
           PL LN   D V QYYVP LSGIQ+Y +  AL     +R   E+S+ D FRD+SS+GS   
Sbjct: 105 PLTLNNNKDRVFQYYVPSLSGIQVYADVDALTSSLQARRQGEESESD-FRDSSSEGSSSE 164

Query: 184 EFGKICNFSSEQWVHHHLACENTLKMRKTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQF 243
               +C +S EQ          + +M K SL  EH   QE  SSDD +    +  L+F++
Sbjct: 165 SERGLC-YSKEQ---------ISARMDKLSLRKEH---QEDSSSDDGEPLSSQGRLIFEY 224

Query: 244 LEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDLD 303
           LE DLPY R P ADK+ +LA +FP LKTLRSCD+L +SW SVAWYPIY+IPTGPTLKDLD
Sbjct: 225 LERDLPYVREPFADKMSDLASRFPELKTLRSCDLLPSSWFSVAWYPIYKIPTGPTLKDLD 284

Query: 304 ACFLTYHSLSTPGNGHGLPP-VMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVNEH 363
           ACFLTYHSL TP  G G+    M   +  + + K+ LPVFGLASYKL+GS+W   G + H
Sbjct: 285 ACFLTYHSLHTPFQGPGVTTGSMHVVQPRESVEKMELPVFGLASYKLRGSVWTSFGGSGH 344

Query: 364 QTANSLMQAADKWLRSLQVVQPDFQFF 388
           Q ANSL QAAD WLR  QV  PDF FF
Sbjct: 345 QLANSLFQAADNWLRLRQVNHPDFIFF 357

BLAST of HG10008430.1 vs. TAIR 10
Match: AT4G16100.1 (Protein of unknown function (DUF789) )

HSP 1 Score: 304.7 bits (779), Expect = 1.1e-82
Identity = 183/413 (44.31%), Postives = 243/413 (58.84%), Query Frame = 0

Query: 11  IKGEDRFYIPVRARKNYNQQKQSR------------------RPTKTDETE-------SP 70
           I+GE+RFY P   RK   ++++ R                  R  K +E E       S 
Sbjct: 7   IRGENRFYNPPPMRKLQQEREKKRLEAEEIEKEKKKAKEILDRKIKVEEKEIKQPEECST 66

Query: 71  SSKVVASTIKPSKQLTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQPYFIL 130
           S   V S +  +   T  + SNL RFL+ T P V  Q+   T+ + WRT + E++PYF+L
Sbjct: 67  SDCSVPSRVSSTTTTTGTTSSNLGRFLDCTTPIVSTQHLPLTSSKGWRTREPEYRPYFLL 126

Query: 131 NDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDS 190
           NDLW+SF+EWSAYG GVPL+LNG DSVVQYYVPYLSGIQ+Y + S  R  +  R   E+S
Sbjct: 127 NDLWDSFEEWSAYGVGVPLLLNGIDSVVQYYVPYLSGIQLYEDPS--RACTTRRRVGEES 186

Query: 191 DLDSFRDTSSDGSIDYEFGKICNFSSEQWVHHHLACENTLKMRKTSLTDEHRTIQEGFSS 250
           D DS RD SSDGS D                    C    +    +  +E   I  G SS
Sbjct: 187 DGDSPRDMSSDGSND--------------------CRELSQNLYRASLEEKPCI--GSSS 246

Query: 251 DDADAGYHRSG-LLFQFLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVA 310
           D+++A  +  G L+F++LE  +P+ R PL DKI  L+ QFP L+T RSCD+  +SWVSVA
Sbjct: 247 DESEASSNSPGELVFEYLEGAMPFGREPLTDKISNLSSQFPALRTYRSCDLSPSSWVSVA 306

Query: 311 WYPIYRIPTGPTLKDLDACFLTYHSLSTPGNGHGLPPVMIYPKDIDGIAKVSLPVFGLAS 370
           WYPIYRIP G +L++LDACFLT+HSLSTP  G          K +   AK+ LP FGLAS
Sbjct: 307 WYPIYRIPLGQSLQNLDACFLTFHSLSTPCRGTSNEEGQSSSKSV-ASAKLPLPTFGLAS 366

Query: 371 YKLKGSIWG-QNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASH-GTYWR 396
           YK K S W  ++ V+E+Q   +L++ A++WLR L+V+ PDF+ F SH G+ WR
Sbjct: 367 YKFKLSEWSPESDVDENQRVGTLLRTAEEWLRRLKVILPDFRHFISHSGSAWR 394

BLAST of HG10008430.1 vs. TAIR 10
Match: AT5G49220.1 (Protein of unknown function (DUF789) )

HSP 1 Score: 267.3 bits (682), Expect = 1.9e-71
Identity = 171/436 (39.22%), Postives = 231/436 (52.98%), Query Frame = 0

Query: 3   GTALQFGGIKGEDRFYIPVRARKN------YNQQKQSRRPTKTDETESPSSKVVASTIKP 62
           G ++    I+GE+RFY P   R+         Q ++ +R    DE      +  A+T+ P
Sbjct: 6   GVSIARTAIRGENRFYNPPPMRRMQQEAQLQQQIREKQRRDDEDEVLMDKERRKAATVAP 65

Query: 63  -------------SKQLTPQSK-------------------SNLERFLEATRPSVPAQYF 122
                        S+ +   S+                   SNL+RFLE T P VPA+ F
Sbjct: 66  RTTRKGLGVSESKSRVVVSGSEVCAGSSDSSSGSGRVLSDGSNLDRFLEHTTPVVPARLF 125

Query: 123 SKTTMRDWRTCDIEFQPYFILNDLWESFKEWSAYGAGV-----PLVLNGGDSVVQYYVPY 182
              +  + +T + +   YF+L DLWESF EWSAYGAGV     PL ++G DS VQYYVPY
Sbjct: 126 PMRSRWELKTRESDCHTYFVLEDLWESFAEWSAYGAGVPLEMHPLEMHGNDSTVQYYVPY 185

Query: 183 LSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSIDYEFGKICNFSSEQWVHHHL 242
           LSGIQ+Y     L+   N    +E S   S    S    +D   G++             
Sbjct: 186 LSGIQLY--VDPLKKPRNPVGDNEGSSEGS--SNSRTLPVDLSVGEL------------- 245

Query: 243 ACENTLKMRKTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQFLEHDLPYQRVPLADKIFE 302
              N + ++  S+T          SS +A+    +  LLF++LE++ P+ R PLA+KI +
Sbjct: 246 ---NRISLKDQSITG-------SLSSGEAEISNPQGRLLFEYLEYEPPFGREPLANKISD 305

Query: 303 LAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPGNGHGL 362
           LA + P L T RSCD+L +SWVSV+WYPIYRIP GPTL++LDACFLT+HSLST       
Sbjct: 306 LASRVPELMTYRSCDLLPSSWVSVSWYPIYRIPVGPTLQNLDACFLTFHSLST-----AP 365

Query: 363 PPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQV 396
           P   +   D     K+ LP FGLASYKLK S+W QN + E Q   SL+QAADKWL+ LQV
Sbjct: 366 PQSAMGCSDSQPSTKLPLPTFGLASYKLKVSVWNQNRIQESQKMTSLLQAADKWLKRLQV 409

BLAST of HG10008430.1 vs. TAIR 10
Match: AT2G01260.2 (Protein of unknown function (DUF789) )

HSP 1 Score: 263.1 bits (671), Expect = 3.7e-70
Identity = 160/320 (50.00%), Postives = 194/320 (60.62%), Query Frame = 0

Query: 1   MLGTALQF-GGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQ 60
           MLG   Q   G  G+D FY   + R+  NQ+    R  ++D +  PSS    S  K   +
Sbjct: 1   MLGAGFQLTRGRHGDDPFYTSAKTRR-ANQRIDQLRRAQSDVSNVPSS--APSPHKQQLE 60

Query: 61  LTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCD--IEFQPYFILNDLWESFKEWSA 120
            +  S SNL+RFLE+  PSVPAQ+ SKT +R+ R  D   +  PYF+L D+W+SF EWSA
Sbjct: 61  PSDLSSSNLDRFLESVTPSVPAQFLSKTLLRERRADDDYNKLVPYFVLGDIWDSFAEWSA 120

Query: 121 YGAGVPLVLNGG-DSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSD 180
           YG GVPLVLN   D V+QYYVP LS IQIY  + AL     SR   + SD D FRD+SSD
Sbjct: 121 YGTGVPLVLNNNKDRVIQYYVPSLSAIQIYAHSHALDSSLKSRRPGDSSDSD-FRDSSSD 180

Query: 181 GSIDYEFGKICNFSSEQWVHHHLACENTLKMRKTSLTDEHRTIQEGFSSDDADAGYHRSG 240
            S D         S  + V   + C         SL D+H   QE  SSDD +    +  
Sbjct: 181 VSSD---------SDSERVSARVDC--------ISLRDQH---QEDSSSDDGEPLGSQGR 240

Query: 241 LLFQFLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPT 300
           L+F++LE DLPY R P ADK+ +LA QFP L TLRSCD+L +SW SVAWYPIYRIPTGPT
Sbjct: 241 LMFEYLERDLPYIREPFADKVLDLAAQFPELMTLRSCDLLRSSWFSVAWYPIYRIPTGPT 296

Query: 301 LKDLDACFLTYHSLSTPGNG 317
           LKDLDACFLTYHSL T   G
Sbjct: 301 LKDLDACFLTYHSLHTSFGG 296

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038897708.11.2e-21994.43uncharacterized protein LOC120085653 [Benincasa hispida][more]
XP_008453426.11.1e-20187.91PREDICTED: uncharacterized protein LOC103494138 isoform X1 [Cucumis melo] >KAA00... [more]
XP_031736215.12.3e-19986.15uncharacterized protein LOC101215266 [Cucumis sativus] >KGN63839.1 hypothetical ... [more]
XP_023516127.16.7e-19986.90uncharacterized protein LOC111780081 [Cucurbita pepo subsp. pepo][more]
XP_022921943.16.7e-19986.65uncharacterized protein LOC111430050 [Cucurbita moschata] >KAG7023306.1 hypothet... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5A7USF15.3e-20287.91Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3BX105.3e-20287.91uncharacterized protein LOC103494138 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A0A0LS491.1e-19986.15Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G024240 PE=4 SV=1[more]
A0A6J1E5773.2e-19986.65uncharacterized protein LOC111430050 OS=Cucurbita moschata OX=3662 GN=LOC1114300... [more]
A0A6J1JE681.4e-19786.40uncharacterized protein LOC111484983 OS=Cucurbita maxima OX=3661 GN=LOC111484983... [more]
Match NameE-valueIdentityDescription
AT2G01260.16.4e-9150.64Protein of unknown function (DUF789) [more]
AT1G15030.14.2e-9055.05Protein of unknown function (DUF789) [more]
AT4G16100.11.1e-8244.31Protein of unknown function (DUF789) [more]
AT5G49220.11.9e-7139.22Protein of unknown function (DUF789) [more]
AT2G01260.23.7e-7050.00Protein of unknown function (DUF789) [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008507Protein of unknown function DUF789PFAMPF05623DUF789coord: 66..387
e-value: 5.3E-102
score: 341.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 23..55
NoneNo IPR availablePANTHERPTHR31343T15D22.8coord: 1..395
NoneNo IPR availablePANTHERPTHR31343:SF5DUF789 FAMILY PROTEINcoord: 1..395

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
HG10008430HG10008430gene


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
HG10008430.1-cdsHG10008430.1-cds-Chr10:23057679..23057926CDS
HG10008430.1-cdsHG10008430.1-cds-Chr10:23059249..23059334CDS
HG10008430.1-cdsHG10008430.1-cds-Chr10:23059409..23059494CDS
HG10008430.1-cdsHG10008430.1-cds-Chr10:23059596..23059887CDS
HG10008430.1-cdsHG10008430.1-cds-Chr10:23060130..23060347CDS
HG10008430.1-cdsHG10008430.1-cds-Chr10:23060604..23060861CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
HG10008430.1HG10008430.1-proteinpolypeptide