HG10014086.1 (mRNA) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10014086.1
TypemRNA
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPyridoxal phosphate (PLP)-dependent transferases superfamily protein
LocationChr02: 7444141 .. 7446948 (-)
Sequence length2808
RNA-Seq ExpressionHG10014086.1
SyntenyHG10014086.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACAAAAAAACCAGAAGGCGACACGGTTCTGGTGTAACAGAGAGGCGCAAATCCTCCATTCTCAGGCAATTGCAAGAAAACAAGCTCAGAGAGGCTCTGGAAGAAGCTTCTGAAGATGGGTCTCTCGCAAAATCTAGAGACATTGACTGTGAATCGCCAAATCAGGACAGGAATGTTCGACGGTCGAGATCTCTCGCTCGGCTTCATGCCCAAAAGGAGTTTCTACGCGCCACCGCACTTGCCGCCGACCGGACCTATTCCACGGAGGATTTTATTCCGAATCTCTTCGATGCCTTCACCAAATTCCTCACTATGTACCCAAAATTTCAGACGTCAGAAAAAATCGACCAGTTGAGATCAGAAGAATATGAGCATCTCTCAGAGTCGTTCTCGAAGGTATGTCTTGACTACTGTGGTTTTGGTTTATTTTCCCACATTCAAACACAACAATTTTGGGAGTCTTCGGCGTTTACCCTCTCTGAAATTACTGCCAATTTGAGCAACCACGCGCTATACGGCGGCGCTGAGAAGGGCACGATTGAACACGATATCAAGACTAGAATTATGGATTATCTGAACATTTCTGAAAATGAATATGGGCTTGTTTTTACAGTCAGTAGGGGATCGGCCTTTAAGCTCTTGTCTGAGTCTTACCCTTTTCATACGAATAAGAAGCTGTTGACTATGTTTGATCATGAGAGTCAATCTGTGAGTTGGATGGCTCAGAGTGCTAAAGAGAGGGGTGCAAAGGTTTACAGTGCATGGTTTAAGTGGCCAACATTGAGACTCTGTTCAAGGGAGTTGAGGAAACAGATCACAAACAAGAGGAAGAGGAAGAAGGATTCTGTTGCTGGCCTTTTTGTGTTTCCTGTTCAGTCTAGAGTTACAGGGGCAAAGTATTCTTACCAGTGGATGGCACTTGCCCAGCAGAACAATTGGCATGTATTGCTCGATGCTGGCTCGCTCGGTCCTAAGGACATGGATTCCTTGGGGCTCTCTCTCTTCAAGCCCGATTTTATCATTACATCGTTTTATCGCGTTTTCGGGTCTGATCCAACTGGGTTTGGCTGCCTGTTGATTAAGAAATCTGTTATAGGGAGTTTGCAAAACCAATCTGGGCGGACTGGTACAGGAATGGTGAGGATACTCCCCATTTTTCCACAGTATATTGGTGATTCGATTGATGGTTTGGTGGATGTCTTGGCTGGGATTGAAGATGATGAAATTAATGGTCAGGAGGATTCTGAAACTGAGAAGCATCAGGAGTCACGCATGCCTGCCTTTTCGGGTGTGTTTACGTCGAACCAGGTGAGGGATGTGTTTGAGACTGAGATGGAGCAGGATAATAATAGCTCGGACAGGGATGGGGCTAGTACCATTTTTGAGGAAGCTGAGAGCATCTCAATTGGTGAGGTTATGAAGAGTCCAATCTTCAGTGAGGACGAGTCGTCGGACAATTCATATTGGATTGATTTGGGTCAGAGTCCATTTGGTTCTGATAATTCTGGCCATTTGATCAAGCAAAAAACATGGTCACCCTTACCACCATCTTGGTTTTCTGGAAAAAGGAACACTAGGAAACGTTCACCAAAACCAGCATCTAGGTTGTTGAAAAGTCCAATGTGCGGTGATGATAAGCGGATGAATTCAAGGCACTATGAAGATTCGGTGTTGTCTTTTGATGCAGCTGTATTATCAATGTCACAGGATTTCGGCTGTGTGAAGGGGATTCCTGAAGAAGAACAATCTGGAGAGCAAGACTCTTGCTGTGGAAATGTAGGAAGTTTGAGGGATTCTCATGCTGTTAGTGAGATTCAAGAGGATTCAGAAACTGGAGAAGAATCAGGTAGGTTGAGTGTTGCATCAAATGGAACCCGACCTGCGAATCAGACTTCCGAGTTTCAGGATCTGAAGCGTTCAAATTCCACAACATCTGGAGCCTTCAAAGACCTGAAGGAAAGTGCTATAAGAAGGGAGACAGAAGGGGAATTCAGACTCTTGGGTAGGAGGGAAAGGAGTAGATTTTCTGAACGTGGGTTCTTTGGTTTAGAAGAGGGAGATAGAGCGATAAGCATGGGTCGCCATGTATCATTTAGTGTAGAATATAATGAAAAAGAAAGTTTGAACGAGATGTTTGAGCTTGGTGAAACATCTAATGCAGCCTTTGGCAACGAGGAATCAACAAGTGATGGAGAATATGTTGATGAGCAAGAATGGGGAAGGAGGGAGTCTGAAATGATCTGTCGGCATCTTGATCATATTGATATGTTGGGCCTCAACAAGACTACCCTCCGACTAAGATACCTCATTAACTGGCTCGTTACTTCGTTACTTCAACTCCGATTACCCGGTCGAGATGATGTAGGGGTCCACCTTGTACAACTATATGGACCAAAGATCAAATACGAAAGAGGTGCTGCGATTGCTTTCAATATAAAAGAGAGCAATGGAAGAGGGCTAATACATCCGGAAGTCGTACAAAAACTGGCTGAAAATAATGGAATATCTCTTGGAGTTGGCATTCTCAGCCATGTGCGAGCGGTAGACGTTCCGAAGCAGAATTCTGGACAGTATGATCTCGAAGACATGGCGTTGTGCAAACCTATGGGTAATGGCCATAACAGGAAGAAACTATTTTTCCGAGTCGAGGTTGTCACAGCTTCCCTTGGATTCCTTACCAACTTTGATGATGTTTATAAAATGTGGGCTTTCATAGCCAAGTTTCTAAATCCATCATTTCTTGAAAACAATACTTTGTCTTCCGTCCCTGAGAGTTCGGAATCCTATCACCGGAGCCTGTTTTAG

mRNA sequence

ATGGACAAAAAAACCAGAAGGCGACACGGTTCTGGTGTAACAGAGAGGCGCAAATCCTCCATTCTCAGGCAATTGCAAGAAAACAAGCTCAGAGAGGCTCTGGAAGAAGCTTCTGAAGATGGGTCTCTCGCAAAATCTAGAGACATTGACTGTGAATCGCCAAATCAGGACAGGAATGTTCGACGGTCGAGATCTCTCGCTCGGCTTCATGCCCAAAAGGAGTTTCTACGCGCCACCGCACTTGCCGCCGACCGGACCTATTCCACGGAGGATTTTATTCCGAATCTCTTCGATGCCTTCACCAAATTCCTCACTATGTACCCAAAATTTCAGACGTCAGAAAAAATCGACCAGTTGAGATCAGAAGAATATGAGCATCTCTCAGAGTCGTTCTCGAAGGTATGTCTTGACTACTGTGGTTTTGGTTTATTTTCCCACATTCAAACACAACAATTTTGGGAGTCTTCGGCGTTTACCCTCTCTGAAATTACTGCCAATTTGAGCAACCACGCGCTATACGGCGGCGCTGAGAAGGGCACGATTGAACACGATATCAAGACTAGAATTATGGATTATCTGAACATTTCTGAAAATGAATATGGGCTTGTTTTTACAGTCAGTAGGGGATCGGCCTTTAAGCTCTTGTCTGAGTCTTACCCTTTTCATACGAATAAGAAGCTGTTGACTATGTTTGATCATGAGAGTCAATCTGTGAGTTGGATGGCTCAGAGTGCTAAAGAGAGGGGTGCAAAGGTTTACAGTGCATGGTTTAAGTGGCCAACATTGAGACTCTGTTCAAGGGAGTTGAGGAAACAGATCACAAACAAGAGGAAGAGGAAGAAGGATTCTGTTGCTGGCCTTTTTGTGTTTCCTGTTCAGTCTAGAGTTACAGGGGCAAAGTATTCTTACCAGTGGATGGCACTTGCCCAGCAGAACAATTGGCATGTATTGCTCGATGCTGGCTCGCTCGGTCCTAAGGACATGGATTCCTTGGGGCTCTCTCTCTTCAAGCCCGATTTTATCATTACATCGTTTTATCGCGTTTTCGGGTCTGATCCAACTGGGTTTGGCTGCCTGTTGATTAAGAAATCTGTTATAGGGAGTTTGCAAAACCAATCTGGGCGGACTGGTACAGGAATGGTGAGGATACTCCCCATTTTTCCACAGTATATTGGTGATTCGATTGATGGTTTGGTGGATGTCTTGGCTGGGATTGAAGATGATGAAATTAATGGTCAGGAGGATTCTGAAACTGAGAAGCATCAGGAGTCACGCATGCCTGCCTTTTCGGGTGTGTTTACGTCGAACCAGGTGAGGGATGTGTTTGAGACTGAGATGGAGCAGGATAATAATAGCTCGGACAGGGATGGGGCTAGTACCATTTTTGAGGAAGCTGAGAGCATCTCAATTGGTGAGGTTATGAAGAGTCCAATCTTCAGTGAGGACGAGTCGTCGGACAATTCATATTGGATTGATTTGGGTCAGAGTCCATTTGGTTCTGATAATTCTGGCCATTTGATCAAGCAAAAAACATGGTCACCCTTACCACCATCTTGGTTTTCTGGAAAAAGGAACACTAGGAAACGTTCACCAAAACCAGCATCTAGGTTGTTGAAAAGTCCAATGTGCGGTGATGATAAGCGGATGAATTCAAGGCACTATGAAGATTCGGTGTTGTCTTTTGATGCAGCTGTATTATCAATGTCACAGGATTTCGGCTGTGTGAAGGGGATTCCTGAAGAAGAACAATCTGGAGAGCAAGACTCTTGCTGTGGAAATGTAGGAAGTTTGAGGGATTCTCATGCTGTTAGTGAGATTCAAGAGGATTCAGAAACTGGAGAAGAATCAGGTAGGTTGAGTGTTGCATCAAATGGAACCCGACCTGCGAATCAGACTTCCGAGTTTCAGGATCTGAAGCGTTCAAATTCCACAACATCTGGAGCCTTCAAAGACCTGAAGGAAAGTGCTATAAGAAGGGAGACAGAAGGGGAATTCAGACTCTTGGGTAGGAGGGAAAGGAGTAGATTTTCTGAACGTGGGTTCTTTGGTTTAGAAGAGGGAGATAGAGCGATAAGCATGGGTCGCCATGTATCATTTAGTGTAGAATATAATGAAAAAGAAAGTTTGAACGAGATGTTTGAGCTTGGTGAAACATCTAATGCAGCCTTTGGCAACGAGGAATCAACAAGTGATGGAGAATATGTTGATGAGCAAGAATGGGGAAGGAGGGAGTCTGAAATGATCTGTCGGCATCTTGATCATATTGATATGTTGGGCCTCAACAAGACTACCCTCCGACTAAGATACCTCATTAACTGGCTCGTTACTTCGTTACTTCAACTCCGATTACCCGGTCGAGATGATGTAGGGGTCCACCTTGTACAACTATATGGACCAAAGATCAAATACGAAAGAGGTGCTGCGATTGCTTTCAATATAAAAGAGAGCAATGGAAGAGGGCTAATACATCCGGAAGTCGTACAAAAACTGGCTGAAAATAATGGAATATCTCTTGGAGTTGGCATTCTCAGCCATGTGCGAGCGGTAGACGTTCCGAAGCAGAATTCTGGACAGTATGATCTCGAAGACATGGCGTTGTGCAAACCTATGGGTAATGGCCATAACAGGAAGAAACTATTTTTCCGAGTCGAGGTTGTCACAGCTTCCCTTGGATTCCTTACCAACTTTGATGATGTTTATAAAATGTGGGCTTTCATAGCCAAGTTTCTAAATCCATCATTTCTTGAAAACAATACTTTGTCTTCCGTCCCTGAGAGTTCGGAATCCTATCACCGGAGCCTGTTTTAG

Coding sequence (CDS)

ATGGACAAAAAAACCAGAAGGCGACACGGTTCTGGTGTAACAGAGAGGCGCAAATCCTCCATTCTCAGGCAATTGCAAGAAAACAAGCTCAGAGAGGCTCTGGAAGAAGCTTCTGAAGATGGGTCTCTCGCAAAATCTAGAGACATTGACTGTGAATCGCCAAATCAGGACAGGAATGTTCGACGGTCGAGATCTCTCGCTCGGCTTCATGCCCAAAAGGAGTTTCTACGCGCCACCGCACTTGCCGCCGACCGGACCTATTCCACGGAGGATTTTATTCCGAATCTCTTCGATGCCTTCACCAAATTCCTCACTATGTACCCAAAATTTCAGACGTCAGAAAAAATCGACCAGTTGAGATCAGAAGAATATGAGCATCTCTCAGAGTCGTTCTCGAAGGTATGTCTTGACTACTGTGGTTTTGGTTTATTTTCCCACATTCAAACACAACAATTTTGGGAGTCTTCGGCGTTTACCCTCTCTGAAATTACTGCCAATTTGAGCAACCACGCGCTATACGGCGGCGCTGAGAAGGGCACGATTGAACACGATATCAAGACTAGAATTATGGATTATCTGAACATTTCTGAAAATGAATATGGGCTTGTTTTTACAGTCAGTAGGGGATCGGCCTTTAAGCTCTTGTCTGAGTCTTACCCTTTTCATACGAATAAGAAGCTGTTGACTATGTTTGATCATGAGAGTCAATCTGTGAGTTGGATGGCTCAGAGTGCTAAAGAGAGGGGTGCAAAGGTTTACAGTGCATGGTTTAAGTGGCCAACATTGAGACTCTGTTCAAGGGAGTTGAGGAAACAGATCACAAACAAGAGGAAGAGGAAGAAGGATTCTGTTGCTGGCCTTTTTGTGTTTCCTGTTCAGTCTAGAGTTACAGGGGCAAAGTATTCTTACCAGTGGATGGCACTTGCCCAGCAGAACAATTGGCATGTATTGCTCGATGCTGGCTCGCTCGGTCCTAAGGACATGGATTCCTTGGGGCTCTCTCTCTTCAAGCCCGATTTTATCATTACATCGTTTTATCGCGTTTTCGGGTCTGATCCAACTGGGTTTGGCTGCCTGTTGATTAAGAAATCTGTTATAGGGAGTTTGCAAAACCAATCTGGGCGGACTGGTACAGGAATGGTGAGGATACTCCCCATTTTTCCACAGTATATTGGTGATTCGATTGATGGTTTGGTGGATGTCTTGGCTGGGATTGAAGATGATGAAATTAATGGTCAGGAGGATTCTGAAACTGAGAAGCATCAGGAGTCACGCATGCCTGCCTTTTCGGGTGTGTTTACGTCGAACCAGGTGAGGGATGTGTTTGAGACTGAGATGGAGCAGGATAATAATAGCTCGGACAGGGATGGGGCTAGTACCATTTTTGAGGAAGCTGAGAGCATCTCAATTGGTGAGGTTATGAAGAGTCCAATCTTCAGTGAGGACGAGTCGTCGGACAATTCATATTGGATTGATTTGGGTCAGAGTCCATTTGGTTCTGATAATTCTGGCCATTTGATCAAGCAAAAAACATGGTCACCCTTACCACCATCTTGGTTTTCTGGAAAAAGGAACACTAGGAAACGTTCACCAAAACCAGCATCTAGGTTGTTGAAAAGTCCAATGTGCGGTGATGATAAGCGGATGAATTCAAGGCACTATGAAGATTCGGTGTTGTCTTTTGATGCAGCTGTATTATCAATGTCACAGGATTTCGGCTGTGTGAAGGGGATTCCTGAAGAAGAACAATCTGGAGAGCAAGACTCTTGCTGTGGAAATGTAGGAAGTTTGAGGGATTCTCATGCTGTTAGTGAGATTCAAGAGGATTCAGAAACTGGAGAAGAATCAGGTAGGTTGAGTGTTGCATCAAATGGAACCCGACCTGCGAATCAGACTTCCGAGTTTCAGGATCTGAAGCGTTCAAATTCCACAACATCTGGAGCCTTCAAAGACCTGAAGGAAAGTGCTATAAGAAGGGAGACAGAAGGGGAATTCAGACTCTTGGGTAGGAGGGAAAGGAGTAGATTTTCTGAACGTGGGTTCTTTGGTTTAGAAGAGGGAGATAGAGCGATAAGCATGGGTCGCCATGTATCATTTAGTGTAGAATATAATGAAAAAGAAAGTTTGAACGAGATGTTTGAGCTTGGTGAAACATCTAATGCAGCCTTTGGCAACGAGGAATCAACAAGTGATGGAGAATATGTTGATGAGCAAGAATGGGGAAGGAGGGAGTCTGAAATGATCTGTCGGCATCTTGATCATATTGATATGTTGGGCCTCAACAAGACTACCCTCCGACTAAGATACCTCATTAACTGGCTCGTTACTTCGTTACTTCAACTCCGATTACCCGGTCGAGATGATGTAGGGGTCCACCTTGTACAACTATATGGACCAAAGATCAAATACGAAAGAGGTGCTGCGATTGCTTTCAATATAAAAGAGAGCAATGGAAGAGGGCTAATACATCCGGAAGTCGTACAAAAACTGGCTGAAAATAATGGAATATCTCTTGGAGTTGGCATTCTCAGCCATGTGCGAGCGGTAGACGTTCCGAAGCAGAATTCTGGACAGTATGATCTCGAAGACATGGCGTTGTGCAAACCTATGGGTAATGGCCATAACAGGAAGAAACTATTTTTCCGAGTCGAGGTTGTCACAGCTTCCCTTGGATTCCTTACCAACTTTGATGATGTTTATAAAATGTGGGCTTTCATAGCCAAGTTTCTAAATCCATCATTTCTTGAAAACAATACTTTGTCTTCCGTCCCTGAGAGTTCGGAATCCTATCACCGGAGCCTGTTTTAG

Protein sequence

MDKKTRRRHGSGVTERRKSSILRQLQENKLREALEEASEDGSLAKSRDIDCESPNQDRNVRRSRSLARLHAQKEFLRATALAADRTYSTEDFIPNLFDAFTKFLTMYPKFQTSEKIDQLRSEEYEHLSESFSKVCLDYCGFGLFSHIQTQQFWESSAFTLSEITANLSNHALYGGAEKGTIEHDIKTRIMDYLNISENEYGLVFTVSRGSAFKLLSESYPFHTNKKLLTMFDHESQSVSWMAQSAKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSVAGLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFKPDFIITSFYRVFGSDPTGFGCLLIKKSVIGSLQNQSGRTGTGMVRILPIFPQYIGDSIDGLVDVLAGIEDDEINGQEDSETEKHQESRMPAFSGVFTSNQVRDVFETEMEQDNNSSDRDGASTIFEEAESISIGEVMKSPIFSEDESSDNSYWIDLGQSPFGSDNSGHLIKQKTWSPLPPSWFSGKRNTRKRSPKPASRLLKSPMCGDDKRMNSRHYEDSVLSFDAAVLSMSQDFGCVKGIPEEEQSGEQDSCCGNVGSLRDSHAVSEIQEDSETGEESGRLSVASNGTRPANQTSEFQDLKRSNSTTSGAFKDLKESAIRRETEGEFRLLGRRERSRFSERGFFGLEEGDRAISMGRHVSFSVEYNEKESLNEMFELGETSNAAFGNEESTSDGEYVDEQEWGRRESEMICRHLDHIDMLGLNKTTLRLRYLINWLVTSLLQLRLPGRDDVGVHLVQLYGPKIKYERGAAIAFNIKESNGRGLIHPEVVQKLAENNGISLGVGILSHVRAVDVPKQNSGQYDLEDMALCKPMGNGHNRKKLFFRVEVVTASLGFLTNFDDVYKMWAFIAKFLNPSFLENNTLSSVPESSESYHRSLF
Homology
BLAST of HG10014086.1 vs. NCBI nr
Match: XP_038899790.1 (uncharacterized protein LOC120087021 [Benincasa hispida])

HSP 1 Score: 1778.5 bits (4605), Expect = 0.0e+00
Identity = 903/935 (96.58%), Postives = 915/935 (97.86%), Query Frame = 0

Query: 1   MDKKTRRRHGSGVTERRKSSILRQLQENKLREALEEASEDGSLAKSRDIDCESPNQDRNV 60
           MDKKTRRRHGSGVTERRKSSILRQLQENKLREALEEASEDGSLAKSRDIDCESPN DRNV
Sbjct: 17  MDKKTRRRHGSGVTERRKSSILRQLQENKLREALEEASEDGSLAKSRDIDCESPNYDRNV 76

Query: 61  RRSRSLARLHAQKEFLRATALAADRTYSTEDFIPNLFDAFTKFLTMYPKFQTSEKIDQLR 120
           RRSRSLARLHAQKEFLRATALAADRTY TED IPNLFDAFTKFLTMYPKFQTSEKIDQLR
Sbjct: 77  RRSRSLARLHAQKEFLRATALAADRTYCTEDLIPNLFDAFTKFLTMYPKFQTSEKIDQLR 136

Query: 121 SEEYEHLSESFSKVCLDYCGFGLFSHIQTQQFWESSAFTLSEITANLSNHALYGGAEKGT 180
           SEEYEHLSESFSKVCLDYCGFGLFSHIQTQQFWESS+FTLSEITANLSNHALYGGAEKGT
Sbjct: 137 SEEYEHLSESFSKVCLDYCGFGLFSHIQTQQFWESSSFTLSEITANLSNHALYGGAEKGT 196

Query: 181 IEHDIKTRIMDYLNISENEYGLVFTVSRGSAFKLLSESYPFHTNKKLLTMFDHESQSVSW 240
           IEHDIKTRIMDYLNISENEYGLVFTVSRGSAFKLLSESYPFHTNKKLLTMFDHESQSVSW
Sbjct: 197 IEHDIKTRIMDYLNISENEYGLVFTVSRGSAFKLLSESYPFHTNKKLLTMFDHESQSVSW 256

Query: 241 MAQSAKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSVAGLFVFPVQSRVTGAK 300
           MAQSAK+RGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSVAGLFVFPVQSRVTGAK
Sbjct: 257 MAQSAKQRGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSVAGLFVFPVQSRVTGAK 316

Query: 301 YSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFKPDFIITSFYRVFGSDPTGFGCLL 360
           YSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLF+PDFIITSFYRVFGSDPTGFGCLL
Sbjct: 317 YSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIITSFYRVFGSDPTGFGCLL 376

Query: 361 IKKSVIGSLQNQSGRTGTGMVRILPIFPQYIGDSIDGLVDVLAGIEDDEINGQEDSETEK 420
           IKKSVIGSLQNQSGRTGTGMVRILPIFPQYIGDSIDGL DVLAGIEDD INGQEDSETEK
Sbjct: 377 IKKSVIGSLQNQSGRTGTGMVRILPIFPQYIGDSIDGL-DVLAGIEDDAINGQEDSETEK 436

Query: 421 HQESRMPAFSGVFTSNQVRDVFETEMEQDNNSSDRDGASTIFEEAESISIGEVMKSPIFS 480
           HQESRMPAFSGVFTSNQVRDVFETE+E DNNSSDRDGASTIFEEAESISIGEVMKSPIFS
Sbjct: 437 HQESRMPAFSGVFTSNQVRDVFETEIEHDNNSSDRDGASTIFEEAESISIGEVMKSPIFS 496

Query: 481 EDESSDNSYWIDLGQSPFGSDNSGHLIKQKTWSPLPPSWFSGKRNTRKRSPKPASRLLKS 540
           EDESSDNSYWIDLGQSPFGSDNSGHLIKQKTWSPLPPSWFSGKRN R+RSPKPASR LKS
Sbjct: 497 EDESSDNSYWIDLGQSPFGSDNSGHLIKQKTWSPLPPSWFSGKRNNRQRSPKPASRFLKS 556

Query: 541 PMCGDDKRMNSRHYEDSVLSFDAAVLSMSQDFGCVKGIPEEEQSGEQDSCCGNVGSLRDS 600
           PMCGDDKR+NSR +EDSVLSFDAAVLSMSQDFGCVKGIPEEEQSGEQDSCCGNVGSL+DS
Sbjct: 557 PMCGDDKRVNSRQHEDSVLSFDAAVLSMSQDFGCVKGIPEEEQSGEQDSCCGNVGSLKDS 616

Query: 601 HAVSEIQEDSETGEESGRLSVASNGTRPANQTSEFQDLKRSNSTTSGAFKDLKESAIRRE 660
           H VSEIQEDSETGEES RLSVASNG RPAN TSEFQ+LKRSNSTT GAFKDLKE+AIRRE
Sbjct: 617 HVVSEIQEDSETGEESARLSVASNGIRPANDTSEFQELKRSNSTTCGAFKDLKENAIRRE 676

Query: 661 TEGEFRLLGRRERSRFSERGFFGLEEGDRAISMGRHVSFSVEYNEKESLNEMFELGETSN 720
           TEGEFRLLGRRERSRFSERGF GLEEGDRAISMGR VSFSVEYNEKESLNEMFELGE SN
Sbjct: 677 TEGEFRLLGRRERSRFSERGFLGLEEGDRAISMGRRVSFSVEYNEKESLNEMFELGEASN 736

Query: 721 AAFGNEESTSDGEYVDEQEWGRRESEMICRHLDHIDMLGLNKTTLRLRYLINWLVTSLLQ 780
           AAFGNEESTSDGEYVDEQEWGRRE EMICRHLDHIDMLGLNKTTLR RYLINWLVTSLLQ
Sbjct: 737 AAFGNEESTSDGEYVDEQEWGRREPEMICRHLDHIDMLGLNKTTLRQRYLINWLVTSLLQ 796

Query: 781 LRLPGRDDVGVHLVQLYGPKIKYERGAAIAFNIKESNGRGLIHPEVVQKLAENNGISLGV 840
           LRLPG+DDVGVHLVQLYGPKIKYERGAAIAFN+KESNGRGLIHPEVVQKLAENNGISLGV
Sbjct: 797 LRLPGQDDVGVHLVQLYGPKIKYERGAAIAFNVKESNGRGLIHPEVVQKLAENNGISLGV 856

Query: 841 GILSHVRAVDVPKQNSGQYDLEDMALCKPMGNGHNRKKLFFRVEVVTASLGFLTNFDDVY 900
           GILSHVR VDVPKQNSGQYDLEDMALCKPMGNGHNRKKLFFRVEVVTASLGFLTNFDDVY
Sbjct: 857 GILSHVRVVDVPKQNSGQYDLEDMALCKPMGNGHNRKKLFFRVEVVTASLGFLTNFDDVY 916

Query: 901 KMWAFIAKFLNPSFLENNTLSSVPESSESYHRSLF 936
           KMWAFIAKFLNPSFLENNTLSSVPESSESYHRSLF
Sbjct: 917 KMWAFIAKFLNPSFLENNTLSSVPESSESYHRSLF 950

BLAST of HG10014086.1 vs. NCBI nr
Match: XP_008454669.1 (PREDICTED: uncharacterized protein LOC103495022 [Cucumis melo] >KAA0057170.1 uncharacterized protein E6C27_scaffold741G00400 [Cucumis melo var. makuwa] >TYK27153.1 uncharacterized protein E5676_scaffold1920G00320 [Cucumis melo var. makuwa])

HSP 1 Score: 1739.2 bits (4503), Expect = 0.0e+00
Identity = 888/935 (94.97%), Postives = 901/935 (96.36%), Query Frame = 0

Query: 1   MDKKTRRRHGSGVTERRKSSILRQLQENKLREALEEASEDGSLAKSRDIDCESPNQDRNV 60
           MDKKTRRRHGSGVTERRKSSILRQLQENKLREALEEASEDGSLAK+RDIDCESPNQDRNV
Sbjct: 17  MDKKTRRRHGSGVTERRKSSILRQLQENKLREALEEASEDGSLAKTRDIDCESPNQDRNV 76

Query: 61  RRSRSLARLHAQKEFLRATALAADRTYSTEDFIPNLFDAFTKFLTMYPKFQTSEKIDQLR 120
           RRSRS ARLHAQKEFLRATALAADRTY  ED IPNLFDAFTKFLTMYPKFQTSEKIDQLR
Sbjct: 77  RRSRSFARLHAQKEFLRATALAADRTYCREDSIPNLFDAFTKFLTMYPKFQTSEKIDQLR 136

Query: 121 SEEYEHLSESFSKVCLDYCGFGLFSHIQTQQFWESSAFTLSEITANLSNHALYGGAEKGT 180
           SEEYEHLSESFSKVCLDYCGFGLFSHIQTQQFWESSAF+LSEITANLSNHALYGGAEKGT
Sbjct: 137 SEEYEHLSESFSKVCLDYCGFGLFSHIQTQQFWESSAFSLSEITANLSNHALYGGAEKGT 196

Query: 181 IEHDIKTRIMDYLNISENEYGLVFTVSRGSAFKLLSESYPFHTNKKLLTMFDHESQSVSW 240
           IEHDIKTRIMDYLNISENEYGLVFTVSRGSAFKLLSESYPFHTNKKLLTMFDHESQSVSW
Sbjct: 197 IEHDIKTRIMDYLNISENEYGLVFTVSRGSAFKLLSESYPFHTNKKLLTMFDHESQSVSW 256

Query: 241 MAQSAKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSVAGLFVFPVQSRVTGAK 300
           MAQSAKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSVAGLFVFPVQSRVTGAK
Sbjct: 257 MAQSAKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSVAGLFVFPVQSRVTGAK 316

Query: 301 YSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFKPDFIITSFYRVFGSDPTGFGCLL 360
           YSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLF+PDFIITSFYRVFGSDPTGFGCLL
Sbjct: 317 YSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIITSFYRVFGSDPTGFGCLL 376

Query: 361 IKKSVIGSLQNQSGRTGTGMVRILPIFPQYIGDSIDGLVDVLAGIEDDEINGQEDSETEK 420
           IKKSVIGSLQ+QSGRTGTGMVRILPIFPQYIGDSIDGL DVLAGIEDD IN  EDSETEK
Sbjct: 377 IKKSVIGSLQSQSGRTGTGMVRILPIFPQYIGDSIDGL-DVLAGIEDDVIN--EDSETEK 436

Query: 421 HQESRMPAFSGVFTSNQVRDVFETEMEQDNNSSDRDGASTIFEEAESISIGEVMKSPIFS 480
           H ESRMPAFSGVFT NQVRDVFETEME DNNSSDRDGASTIFEEAESISIGEVMKSPIFS
Sbjct: 437 HPESRMPAFSGVFTPNQVRDVFETEMEHDNNSSDRDGASTIFEEAESISIGEVMKSPIFS 496

Query: 481 EDESSDNSYWIDLGQSPFGSDNSGHLIKQKTWSPLPPSWFSGKRNTRKRSPKPASRLLKS 540
           EDESSDNSYWIDLGQSPFGSDNS HLIKQKTWSPLPPSWFSGKRN RKRSPKPASRLLKS
Sbjct: 497 EDESSDNSYWIDLGQSPFGSDNSDHLIKQKTWSPLPPSWFSGKRNNRKRSPKPASRLLKS 556

Query: 541 PMCGDDKRMNSRHYEDSVLSFDAAVLSMSQDFGCVKGIPEEEQSGEQDSCCGNVGSLRDS 600
           PMC +DKR N+RH  DSVLSFDAA+LSMSQDF CV+GIPEEEQSGEQDSCCGNVGSLRDS
Sbjct: 557 PMCSNDKRANARHRNDSVLSFDAALLSMSQDFSCVQGIPEEEQSGEQDSCCGNVGSLRDS 616

Query: 601 HAVSEIQEDSETGEESGRLSVASNGTRPANQTSEFQDLKRSNSTTSGAFKDLKESAIRRE 660
           H VSEIQEDSETGEES RLS ASNG  PAN TSEF DLKRSNSTTSGAF DLKESAIRRE
Sbjct: 617 HVVSEIQEDSETGEESARLSFASNGIHPANHTSEFWDLKRSNSTTSGAFNDLKESAIRRE 676

Query: 661 TEGEFRLLGRRERSRFSERGFFGLEEGDRAISMGRHVSFSVEYNEKESLNEMFELGETSN 720
           TEGEFRLLGRRERSRFSERGFFGLEEGDRAISMGR VSF VEYNEKESLNEMFELGE S 
Sbjct: 677 TEGEFRLLGRRERSRFSERGFFGLEEGDRAISMGRRVSFRVEYNEKESLNEMFELGEASC 736

Query: 721 AAFGNEESTSDGEYVDEQEWGRRESEMICRHLDHIDMLGLNKTTLRLRYLINWLVTSLLQ 780
            AFGNEESTSDGEYVDEQEWGRRE EMICRHLDHIDMLGLNKTTLR RYLINWLVTSLLQ
Sbjct: 737 TAFGNEESTSDGEYVDEQEWGRREPEMICRHLDHIDMLGLNKTTLRQRYLINWLVTSLLQ 796

Query: 781 LRLPGRDDVGVHLVQLYGPKIKYERGAAIAFNIKESNGRGLIHPEVVQKLAENNGISLGV 840
           LRLPG+DDVGV LVQLYGPKIKYERGAAIAFN+KESNGRGLIHPEVVQKLAENNGI+LGV
Sbjct: 797 LRLPGQDDVGVQLVQLYGPKIKYERGAAIAFNVKESNGRGLIHPEVVQKLAENNGIALGV 856

Query: 841 GILSHVRAVDVPKQNSGQYDLEDMALCKPMGNGHNRKKLFFRVEVVTASLGFLTNFDDVY 900
           GILSHVRAVDVPKQNSGQYDLEDMALCKPMGNGHNRKKLFFRVEVVTASLGFLTNFDDVY
Sbjct: 857 GILSHVRAVDVPKQNSGQYDLEDMALCKPMGNGHNRKKLFFRVEVVTASLGFLTNFDDVY 916

Query: 901 KMWAFIAKFLNPSFLENNTLSSVPESSESYHRSLF 936
           KMWAF+AKFLNPSFLENNTLSSVPESSESYHRS+F
Sbjct: 917 KMWAFVAKFLNPSFLENNTLSSVPESSESYHRSMF 948

BLAST of HG10014086.1 vs. NCBI nr
Match: XP_011652392.1 (uncharacterized protein LOC101215138 [Cucumis sativus] >KGN59901.1 hypothetical protein Csa_002100 [Cucumis sativus])

HSP 1 Score: 1735.7 bits (4494), Expect = 0.0e+00
Identity = 885/935 (94.65%), Postives = 901/935 (96.36%), Query Frame = 0

Query: 1   MDKKTRRRHGSGVTERRKSSILRQLQENKLREALEEASEDGSLAKSRDIDCESPNQDRNV 60
           MDKKTR+RHGSGVTERRKSSILRQLQENKLREALEEASEDGSLAK+RDIDC+SP+QDRNV
Sbjct: 17  MDKKTRKRHGSGVTERRKSSILRQLQENKLREALEEASEDGSLAKTRDIDCDSPDQDRNV 76

Query: 61  RRSRSLARLHAQKEFLRATALAADRTYSTEDFIPNLFDAFTKFLTMYPKFQTSEKIDQLR 120
           RRSRS ARLHAQKEFLRATALAADRTY TED IPNLFDAFTKFLTMYPKFQTSEKIDQLR
Sbjct: 77  RRSRSFARLHAQKEFLRATALAADRTYCTEDLIPNLFDAFTKFLTMYPKFQTSEKIDQLR 136

Query: 121 SEEYEHLSESFSKVCLDYCGFGLFSHIQTQQFWESSAFTLSEITANLSNHALYGGAEKGT 180
           SEEYEHLSESFSKVCLDYCGFGLFSHIQTQQFWESSAFTLSEITANLSNHALYGGAEKGT
Sbjct: 137 SEEYEHLSESFSKVCLDYCGFGLFSHIQTQQFWESSAFTLSEITANLSNHALYGGAEKGT 196

Query: 181 IEHDIKTRIMDYLNISENEYGLVFTVSRGSAFKLLSESYPFHTNKKLLTMFDHESQSVSW 240
           IEHDIKTRIMDYLNISENEYGLVFTVSRGSAFKLLSESYPFHTNKKLLTMFDHESQSVSW
Sbjct: 197 IEHDIKTRIMDYLNISENEYGLVFTVSRGSAFKLLSESYPFHTNKKLLTMFDHESQSVSW 256

Query: 241 MAQSAKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSVAGLFVFPVQSRVTGAK 300
           MAQSAKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSVAGLFVFPVQSRVTGAK
Sbjct: 257 MAQSAKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSVAGLFVFPVQSRVTGAK 316

Query: 301 YSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFKPDFIITSFYRVFGSDPTGFGCLL 360
           YSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLF+PDFIITSFYRVFGSDPTGFGCLL
Sbjct: 317 YSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIITSFYRVFGSDPTGFGCLL 376

Query: 361 IKKSVIGSLQNQSGRTGTGMVRILPIFPQYIGDSIDGLVDVLAGIEDDEINGQEDSETEK 420
           IKKSVIGSLQ+QSGRTGTGMVRILPIFPQYIGDSIDGL DVLAGI+DD IN  EDSETEK
Sbjct: 377 IKKSVIGSLQSQSGRTGTGMVRILPIFPQYIGDSIDGL-DVLAGIDDDVIN--EDSETEK 436

Query: 421 HQESRMPAFSGVFTSNQVRDVFETEMEQDNNSSDRDGASTIFEEAESISIGEVMKSPIFS 480
           H ESRMPAFSGVFT NQVRDVFETEME DNNSSDRDGASTIFEEAESISIGEVMKSPIFS
Sbjct: 437 HLESRMPAFSGVFTPNQVRDVFETEMEHDNNSSDRDGASTIFEEAESISIGEVMKSPIFS 496

Query: 481 EDESSDNSYWIDLGQSPFGSDNSGHLIKQKTWSPLPPSWFSGKRNTRKRSPKPASRLLKS 540
           EDESSDNSYWIDLGQSPFGSDNS HLIKQKTWSPLPPSWFSGKRN R+RSPKPASRLLKS
Sbjct: 497 EDESSDNSYWIDLGQSPFGSDNSDHLIKQKTWSPLPPSWFSGKRNNRQRSPKPASRLLKS 556

Query: 541 PMCGDDKRMNSRHYEDSVLSFDAAVLSMSQDFGCVKGIPEEEQSGEQDSCCGNVGSLRDS 600
           PMCGDDKR N+RH  DSVLSFDAAVLSMSQDF CV+GIPEE+QSGEQDSCCGNVGSLRDS
Sbjct: 557 PMCGDDKRANARHRNDSVLSFDAAVLSMSQDFSCVEGIPEEDQSGEQDSCCGNVGSLRDS 616

Query: 601 HAVSEIQEDSETGEESGRLSVASNGTRPANQTSEFQDLKRSNSTTSGAFKDLKESAIRRE 660
           H VSEIQEDSETGEES RLS ASNG  P N TSEF+DLKRSNSTTSGAF DLKESAIRRE
Sbjct: 617 HVVSEIQEDSETGEESARLSFASNGIHPVNHTSEFRDLKRSNSTTSGAFNDLKESAIRRE 676

Query: 661 TEGEFRLLGRRERSRFSERGFFGLEEGDRAISMGRHVSFSVEYNEKESLNEMFELGETSN 720
           TEGEFRLLGRRERSRFSERGFFGLEEGDRAISMGR VSF VEYNEKESLNEMFELGETS 
Sbjct: 677 TEGEFRLLGRRERSRFSERGFFGLEEGDRAISMGRRVSFRVEYNEKESLNEMFELGETSC 736

Query: 721 AAFGNEESTSDGEYVDEQEWGRRESEMICRHLDHIDMLGLNKTTLRLRYLINWLVTSLLQ 780
            AFGNEESTSDGEYVDEQEWGRRE EMICRHLDHIDMLGLNKTTLR RYLINWLVTSLLQ
Sbjct: 737 TAFGNEESTSDGEYVDEQEWGRREPEMICRHLDHIDMLGLNKTTLRQRYLINWLVTSLLQ 796

Query: 781 LRLPGRDDVGVHLVQLYGPKIKYERGAAIAFNIKESNGRGLIHPEVVQKLAENNGISLGV 840
           LRLPG+DDVGVHLVQLYGPKIKYERGAAIAFN+KESNGRGLIHPEVVQKLAENNGI+LGV
Sbjct: 797 LRLPGQDDVGVHLVQLYGPKIKYERGAAIAFNVKESNGRGLIHPEVVQKLAENNGIALGV 856

Query: 841 GILSHVRAVDVPKQNSGQYDLEDMALCKPMGNGHNRKKLFFRVEVVTASLGFLTNFDDVY 900
           GILSHVRAVDVPKQNSGQYDLEDMALCKPMGNGHNRKKLFFRVEVVTASLGFLTNFDDVY
Sbjct: 857 GILSHVRAVDVPKQNSGQYDLEDMALCKPMGNGHNRKKLFFRVEVVTASLGFLTNFDDVY 916

Query: 901 KMWAFIAKFLNPSFLENNTLSSVPESSESYHRSLF 936
           KMWAFIAKFLNPSFLENNTLS VPES ESY  S+F
Sbjct: 917 KMWAFIAKFLNPSFLENNTLSPVPESLESYRGSMF 948

BLAST of HG10014086.1 vs. NCBI nr
Match: KAG6608507.1 (Molybdenum cofactor sulfurase, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1672.9 bits (4331), Expect = 0.0e+00
Identity = 855/933 (91.64%), Postives = 895/933 (95.93%), Query Frame = 0

Query: 1   MDKKTRRRHGSGVTERRKSSILRQLQENKLREALEEASEDGSLAKSRDIDCESPNQDRNV 60
           MDKK+RRRHGSG+TERR+SSILRQLQENKLREALEEASEDGSLAKSRDIDC+SPN D NV
Sbjct: 17  MDKKSRRRHGSGLTERRQSSILRQLQENKLREALEEASEDGSLAKSRDIDCDSPNHDGNV 76

Query: 61  RRSRSLARLHAQKEFLRATALAADRTYSTEDFIPNLFDAFTKFLTMYPKFQTSEKIDQLR 120
           RRSRSLARLHAQKEFLRATALAADRTYSTED IPNLFDAFTKFLTMYPKFQ+SE+IDQLR
Sbjct: 77  RRSRSLARLHAQKEFLRATALAADRTYSTEDLIPNLFDAFTKFLTMYPKFQSSEQIDQLR 136

Query: 121 SEEYEHLSESFSKVCLDYCGFGLFSHIQTQQFWESSAFTLSEITANLSNHALYGGAEKGT 180
           +E+YEHLSESFSKVCLDYCGFGLFS+IQTQQFWESSAFTLSEITANL+NHALYGGAEKGT
Sbjct: 137 TEDYEHLSESFSKVCLDYCGFGLFSYIQTQQFWESSAFTLSEITANLNNHALYGGAEKGT 196

Query: 181 IEHDIKTRIMDYLNISENEYGLVFTVSRGSAFKLLSESYPFHTNKKLLTMFDHESQSVSW 240
           IEHDIKTRIM+YLNISENEYGLVFTVSRGSAFKLL+ESYPFHTNKKLLTMFDHESQSVSW
Sbjct: 197 IEHDIKTRIMNYLNISENEYGLVFTVSRGSAFKLLAESYPFHTNKKLLTMFDHESQSVSW 256

Query: 241 MAQSAKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSVAGLFVFPVQSRVTGAK 300
           MAQ+AKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSV+GLFVFPVQSRVTGAK
Sbjct: 257 MAQNAKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSVSGLFVFPVQSRVTGAK 316

Query: 301 YSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFKPDFIITSFYRVFGSDPTGFGCLL 360
           YSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLF+PDFIITSFYRVFGSDPTGFGCLL
Sbjct: 317 YSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIITSFYRVFGSDPTGFGCLL 376

Query: 361 IKKSVIGSLQNQSGRTGTGMVRILPIFPQYIGDSIDGLVDVLAGIEDDEINGQEDSETEK 420
           IKKSVIGSLQ+Q GRTGTGMVRILPIFPQYIGDSIDGL DVLAGIEDD INGQEDSETE 
Sbjct: 377 IKKSVIGSLQSQCGRTGTGMVRILPIFPQYIGDSIDGL-DVLAGIEDDAINGQEDSETET 436

Query: 421 HQESRMPAFSGVFTSNQVRDVFETEMEQDNNSSDRDGASTIFEEAESISIGEVMKSPIFS 480
           HQESRMPAFSGVFT+NQVRDVFETE+EQDNNSSDRDGASTIFEE ESIS+GEVMKSPIFS
Sbjct: 437 HQESRMPAFSGVFTTNQVRDVFETEIEQDNNSSDRDGASTIFEEVESISVGEVMKSPIFS 496

Query: 481 EDESSDNSYWIDLGQSPFGSDNSGHLIKQKTW-SPLPPSWFSGKRNTRKRSPKPASRLLK 540
           EDESSDNSYWIDLG SPFGSDNSGHLIKQKTW SPLPPSWFSGKRN+R+ SPKPASRLL+
Sbjct: 497 EDESSDNSYWIDLGHSPFGSDNSGHLIKQKTWSSPLPPSWFSGKRNSRQLSPKPASRLLR 556

Query: 541 SPMC-GDDKRMNSRHYEDSVLSFDAAVLSMSQDFGCVKGIPEEEQSGEQDSCCGNVGSLR 600
           SP+C GDDKR N RH +DSVLSFDAAVLS+SQD   V+GIPEEEQSGEQDSCCGNVGSL+
Sbjct: 557 SPICGGDDKRANPRHRDDSVLSFDAAVLSVSQDLCRVEGIPEEEQSGEQDSCCGNVGSLK 616

Query: 601 DSHAVSEIQEDSETGEES--GRLSVASNGTRPANQTSEFQDLKRSNSTTSGAFKDLKESA 660
           DSHAV EIQEDSETGEES   RLS ASNG R ANQT E QDLK SNST +GAFKDLKESA
Sbjct: 617 DSHAVGEIQEDSETGEESIPNRLSFASNGNRSANQTFEIQDLKLSNSTAAGAFKDLKESA 676

Query: 661 IRRETEGEFRLLGRRERSRFSERGFFGLEEGDRAISMGRHVSFSVEYNEKESLNEMFELG 720
           IRRETEGEFRLLGRRERSRFSERGFFGL +G+RA+SMGR VSFSVEYNEKESLNEMFELG
Sbjct: 677 IRRETEGEFRLLGRRERSRFSERGFFGL-DGERALSMGRRVSFSVEYNEKESLNEMFELG 736

Query: 721 ETSNAAFGNEESTSDGEYVDEQEWGRRESEMICRHLDHIDMLGLNKTTLRLRYLINWLVT 780
           E SNAAF NEES SDGEYVDEQEWGRRE EMIC+HLDHIDMLGLN+TTLRLRYLINWLVT
Sbjct: 737 EASNAAFDNEESMSDGEYVDEQEWGRREPEMICQHLDHIDMLGLNRTTLRLRYLINWLVT 796

Query: 781 SLLQLRLPGRDDVGVHLVQLYGPKIKYERGAAIAFNIKESNGRGLIHPEVVQKLAENNGI 840
           SLLQLRLPGRDDVG HLVQLYGPKIKYERGAA+AFN+KESNGRGLIHPEVVQKLAENNGI
Sbjct: 797 SLLQLRLPGRDDVGTHLVQLYGPKIKYERGAAVAFNVKESNGRGLIHPEVVQKLAENNGI 856

Query: 841 SLGVGILSHVRAVDVPKQNSGQYDLEDMALCKPMGNGHNRKKLFFRVEVVTASLGFLTNF 900
           SLGVGILSHVRAVDVPKQNSGQYDL+DMALCKPM NGHNRKKLFFRVEVVTASLGFLTNF
Sbjct: 857 SLGVGILSHVRAVDVPKQNSGQYDLKDMALCKPMANGHNRKKLFFRVEVVTASLGFLTNF 916

Query: 901 DDVYKMWAFIAKFLNPSFLENNTLSSVPESSES 930
           +DVYKMWAF+AKFLNPSFLEN+TLSS PE+SES
Sbjct: 917 EDVYKMWAFVAKFLNPSFLENSTLSSGPETSES 947

BLAST of HG10014086.1 vs. NCBI nr
Match: XP_022941406.1 (uncharacterized protein LOC111446707 [Cucurbita moschata])

HSP 1 Score: 1672.5 bits (4330), Expect = 0.0e+00
Identity = 855/933 (91.64%), Postives = 895/933 (95.93%), Query Frame = 0

Query: 1   MDKKTRRRHGSGVTERRKSSILRQLQENKLREALEEASEDGSLAKSRDIDCESPNQDRNV 60
           MDKK+RRRHGSG+TERRKSSILRQLQENKLREALEEASEDGSLAKSRDIDC+SPN D NV
Sbjct: 17  MDKKSRRRHGSGLTERRKSSILRQLQENKLREALEEASEDGSLAKSRDIDCDSPNHDGNV 76

Query: 61  RRSRSLARLHAQKEFLRATALAADRTYSTEDFIPNLFDAFTKFLTMYPKFQTSEKIDQLR 120
           RRSRSLARLHAQKEFLRATALAADRTYSTED IPNLFDAFTKFLTMYPKFQ+SE+IDQLR
Sbjct: 77  RRSRSLARLHAQKEFLRATALAADRTYSTEDLIPNLFDAFTKFLTMYPKFQSSEQIDQLR 136

Query: 121 SEEYEHLSESFSKVCLDYCGFGLFSHIQTQQFWESSAFTLSEITANLSNHALYGGAEKGT 180
           +E+YEHLSESFSKVCLDYCGFGLFS+IQTQQFWESSAFTLSEITANL+NHALYGGAEKGT
Sbjct: 137 TEDYEHLSESFSKVCLDYCGFGLFSYIQTQQFWESSAFTLSEITANLNNHALYGGAEKGT 196

Query: 181 IEHDIKTRIMDYLNISENEYGLVFTVSRGSAFKLLSESYPFHTNKKLLTMFDHESQSVSW 240
           IEHDIKTRIM+YLNISENEYGLVFTVSRGSAFKLL+ESYPFHTNKKLLTMFDHESQSVSW
Sbjct: 197 IEHDIKTRIMNYLNISENEYGLVFTVSRGSAFKLLAESYPFHTNKKLLTMFDHESQSVSW 256

Query: 241 MAQSAKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSVAGLFVFPVQSRVTGAK 300
           MAQ+AKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSV+GLFVFPVQSRVTGAK
Sbjct: 257 MAQNAKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSVSGLFVFPVQSRVTGAK 316

Query: 301 YSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFKPDFIITSFYRVFGSDPTGFGCLL 360
           YSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLF+PDFIITSFYRVFGSDPTGFGCLL
Sbjct: 317 YSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIITSFYRVFGSDPTGFGCLL 376

Query: 361 IKKSVIGSLQNQSGRTGTGMVRILPIFPQYIGDSIDGLVDVLAGIEDDEINGQEDSETEK 420
           IKKSVIGSLQ+Q GRTGTGMVRILPIFPQYIGDSIDGL DVLAGIEDD INGQEDSETE 
Sbjct: 377 IKKSVIGSLQSQCGRTGTGMVRILPIFPQYIGDSIDGL-DVLAGIEDDAINGQEDSETET 436

Query: 421 HQESRMPAFSGVFTSNQVRDVFETEMEQDNNSSDRDGASTIFEEAESISIGEVMKSPIFS 480
           HQESRMPAFSGVFT+NQVRDVFETE+EQDNNSSDRDGASTIFEE ESIS+GEVMKSPIFS
Sbjct: 437 HQESRMPAFSGVFTTNQVRDVFETEIEQDNNSSDRDGASTIFEEVESISVGEVMKSPIFS 496

Query: 481 EDESSDNSYWIDLGQSPFGSDNSGHLIKQKTW-SPLPPSWFSGKRNTRKRSPKPASRLLK 540
           EDESSDNSYWIDLG SPFGSDNSGHLIKQKTW SPLPPSWFSGKRN+R+ SPKPASRLL+
Sbjct: 497 EDESSDNSYWIDLGHSPFGSDNSGHLIKQKTWSSPLPPSWFSGKRNSRQLSPKPASRLLR 556

Query: 541 SPMC-GDDKRMNSRHYEDSVLSFDAAVLSMSQDFGCVKGIPEEEQSGEQDSCCGNVGSLR 600
           SP+C GDDKR N RH +DSVLSFDAAVLS+SQD   V+GIPEEEQSGEQDSCCGNVGSL+
Sbjct: 557 SPICGGDDKRANPRHRDDSVLSFDAAVLSVSQDLCRVEGIPEEEQSGEQDSCCGNVGSLK 616

Query: 601 DSHAVSEIQEDSETGEES--GRLSVASNGTRPANQTSEFQDLKRSNSTTSGAFKDLKESA 660
           DSHAVSEIQEDSETGEES   RLS ASNG R ANQT E +DLK SNST +GA KDLKESA
Sbjct: 617 DSHAVSEIQEDSETGEESIPNRLSFASNGNRSANQTFEIRDLKLSNSTAAGALKDLKESA 676

Query: 661 IRRETEGEFRLLGRRERSRFSERGFFGLEEGDRAISMGRHVSFSVEYNEKESLNEMFELG 720
           IRRETEGEFRLLGRRERSRFSERGFFGL +G+RA+SMGR VSFSVEYNEKESLNEMFELG
Sbjct: 677 IRRETEGEFRLLGRRERSRFSERGFFGL-DGERALSMGRRVSFSVEYNEKESLNEMFELG 736

Query: 721 ETSNAAFGNEESTSDGEYVDEQEWGRRESEMICRHLDHIDMLGLNKTTLRLRYLINWLVT 780
           E SNAAF NEES SDGEYVDEQEWGRRE EMICRHLDHIDMLGLN+TTLRLRYLINWLVT
Sbjct: 737 EASNAAFDNEESMSDGEYVDEQEWGRREPEMICRHLDHIDMLGLNRTTLRLRYLINWLVT 796

Query: 781 SLLQLRLPGRDDVGVHLVQLYGPKIKYERGAAIAFNIKESNGRGLIHPEVVQKLAENNGI 840
           SLLQLRLPGRDDVG HLVQLYGPKIKYERGAA+AFN+KESNGRGLIHPEVVQ+LAENNGI
Sbjct: 797 SLLQLRLPGRDDVGTHLVQLYGPKIKYERGAAVAFNVKESNGRGLIHPEVVQRLAENNGI 856

Query: 841 SLGVGILSHVRAVDVPKQNSGQYDLEDMALCKPMGNGHNRKKLFFRVEVVTASLGFLTNF 900
           SLGVGILSHVRAVDVPKQNSGQYDL+DMALCKPM NGHNRKKLFFRVEVVTASLGFLTNF
Sbjct: 857 SLGVGILSHVRAVDVPKQNSGQYDLKDMALCKPMANGHNRKKLFFRVEVVTASLGFLTNF 916

Query: 901 DDVYKMWAFIAKFLNPSFLENNTLSSVPESSES 930
           +DVYKMWAF+AKFLNPSFLEN+TLSS PE+SES
Sbjct: 917 EDVYKMWAFVAKFLNPSFLENSTLSSGPETSES 947

BLAST of HG10014086.1 vs. ExPASy Swiss-Prot
Match: Q8LGM7 (Molybdenum cofactor sulfurase OS=Solanum lycopersicum OX=4081 GN=FLACCA PE=2 SV=1)

HSP 1 Score: 90.5 bits (223), Expect = 1.1e-16
Identity = 77/325 (23.69%), Postives = 152/325 (46.77%), Query Frame = 0

Query: 102 KFLTMYPKFQTSEKIDQLRSEEYEHLSESFSKVCLDYCGFGLFSHIQTQQFWESSAFTL- 161
           +F + Y    + + ID++R+ E++ L+++   V LD+ G  L+S  Q +  ++    TL 
Sbjct: 13  EFGSYYGYANSPKNIDEIRATEFKRLNDT---VYLDHAGATLYSESQMEAVFKDLNSTLY 72

Query: 162 ----SEITANLSNHALYGGAEKGTIEHDIKTRIMDYLNISENEYGLVFTVSRGSAFKLLS 221
               S+ T +L+   + G A         + +++ + N S  EY  +FT    +A KL+ 
Sbjct: 73  GNPHSQSTCSLATEDIVGKA---------RQQVLSFFNASPREYSCIFTSGATAALKLVG 132

Query: 222 ESYPFHTNKKLLTMFDHESQSVSWMAQSAKERGAKVYSAWFK----------WPTLRLCS 281
           E++P+ +N   +   ++ + SV  + + A  +GA  ++   +             L+L  
Sbjct: 133 ETFPWSSNSSFMYSMENHN-SVLGIREYALSKGAAAFAVDIEDTHVGESESPQSNLKLTQ 192

Query: 282 RELRKQITNKRKRKKDSVAG----LFVFPVQSRVTGAKYSYQWMALAQQNN--------- 341
             ++++  N+    K+ + G    LF FP +   +G K+    + + ++ +         
Sbjct: 193 HHIQRR--NEGGVLKEGMTGNTYNLFAFPSECNFSGRKFDPNLIKIIKEGSERILESSQY 252

Query: 342 ----WHVLLDAGSLGPKDMDSLGLSLFKPDFIITSFYRVFGSDPTGFGCLLIKKSVIGSL 393
               W VL+DA      +  +  LS+FK DF++ SFY++FG  PTG G L+++K     +
Sbjct: 253 SRGCWLVLIDAAKGCATNPPN--LSMFKADFVVFSFYKLFGY-PTGLGALIVRKDAAKLM 312

BLAST of HG10014086.1 vs. ExPASy Swiss-Prot
Match: Q9C5X8 (Molybdenum cofactor sulfurase OS=Arabidopsis thaliana OX=3702 GN=ABA3 PE=1 SV=1)

HSP 1 Score: 86.7 bits (213), Expect = 1.6e-15
Identity = 81/320 (25.31%), Postives = 143/320 (44.69%), Query Frame = 0

Query: 98  DAFTKFLTMYPKFQTSEK-IDQLRSEEYEHLSESFSKVCLDYCGFGLFSHIQTQQFWESS 157
           +AF K    Y  +    K I ++R  E++ L +    V LD+ G  L+S +Q +  ++  
Sbjct: 2   EAFLKEFGDYYGYPDGPKNIQEIRDTEFKRLDKGV--VYLDHAGSTLYSELQMEYIFKD- 61

Query: 158 AFTLSEITANLSNHALYGGAEKGTIEHDIKTRIMDYLNISENEYGLVFTVSRGSAFKLLS 217
            FT S +  N  + +    A    I  D + ++++Y N S  +Y  +FT    +A KL+ 
Sbjct: 62  -FT-SNVFGNPHSQSDISSATSDLIA-DARHQVLEYFNASPEDYSCLFTSGATAALKLVG 121

Query: 218 ESYPFHTNKKLL-TMFDHES----------QSVSWMAQSAKERGAKVYSAWFKWPTLRLC 277
           E++P+  +   L TM +H S          Q  S  A   +E   +        P++++ 
Sbjct: 122 ETFPWTQDSNFLYTMENHNSVLGIREYALAQGASACAVDIEEAANQPGQLTNSGPSIKVK 181

Query: 278 SRELRKQITNK--RKRKKDSVAGLFVFPVQSRVTGAKYSYQWMALAQQN----------- 337
            R ++ + T+K  ++  + +   LF FP +   +G +++   + L ++N           
Sbjct: 182 HRAVQMRNTSKLQKEESRGNAYNLFAFPSECNFSGLRFNLDLVKLMKENTETVLQGSPFS 241

Query: 338 ---NWHVLLDAG---SLGPKDMDSLGLSLFKPDFIITSFYRVFGSDPTGFGCLLIKKSVI 387
               W VL+DA    +  P D     LS +  DF++ SFY++FG  PTG G LL++    
Sbjct: 242 KSKRWMVLIDAAKGCATLPPD-----LSEYPADFVVLSFYKLFGY-PTGLGALLVRNDAA 301

BLAST of HG10014086.1 vs. ExPASy Swiss-Prot
Match: Q16P90 (Molybdenum cofactor sulfurase 3 OS=Aedes aegypti OX=7159 GN=mal3 PE=3 SV=1)

HSP 1 Score: 78.2 bits (191), Expect = 5.6e-13
Identity = 72/271 (26.57%), Postives = 110/271 (40.59%), Query Frame = 0

Query: 122 EEYEHLSESFSKVCLDYCGFGLFSHIQTQQFWESSAFTL------SEITANLSNHALYGG 181
           +E+  L E   K  LD+ G  L++  Q +   E  A  L      S  T +L +   Y  
Sbjct: 21  KEFSRLKE---KCYLDHAGTTLYADSQIRSVCEGLAQNLYCNPHTSRTTEDLLDQVRY-- 80

Query: 182 AEKGTIEHDIKTRIMDYLNISENEYGLVFTVSRGSAFKLLSESYPFHTNKKLLTMFDHES 241
                       R++ + N   +EY L+FT    ++ KLL+ESY F      + + D  +
Sbjct: 81  ------------RVLRHFNTRSSEYSLIFTSGTTASLKLLAESYEFAPEGAFVYLKDSHT 140

Query: 242 QSVSWMAQSAKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSVAGLFVFPVQSR 301
             +        ER   +Y             RE   +  +  +R     + L VFP Q  
Sbjct: 141 SVLGMREIVGTER---IYPV----------EREQLLKELDSSERSDSEHSSLIVFPAQCN 200

Query: 302 VTGAKYSYQWMALAQQN--------NWHVLLDAGSLGPKDMDSLGLSLFKPDFIITSFYR 361
             G KY  + +   Q+N         + V LDA S        L LS ++PDF+  SFY+
Sbjct: 201 FNGVKYPLELVRKIQRNGISGYGKERFRVCLDAASF--VSTSFLDLSKYQPDFVCLSFYK 258

Query: 362 VFGSDPTGFGCLLIKKSVIGSLQNQSGRTGT 379
           +FG  PTG G LL+  +    L+ +    GT
Sbjct: 261 IFGY-PTGLGALLVHHTAADQLRKKYYGGGT 258

BLAST of HG10014086.1 vs. ExPASy Swiss-Prot
Match: Q16GH0 (Molybdenum cofactor sulfurase 1 OS=Aedes aegypti OX=7159 GN=mal1 PE=3 SV=1)

HSP 1 Score: 75.1 bits (183), Expect = 4.7e-12
Identity = 70/270 (25.93%), Postives = 110/270 (40.74%), Query Frame = 0

Query: 123 EYEHLSESFSKVCLDYCGFGLFSHIQTQQFWESSAFTL------SEITANLSNHALYGGA 182
           E+  L E   K  LD+ G  L++  Q +   E  A  L      S  T +L +   Y   
Sbjct: 22  EFSRLKE---KCYLDHAGTTLYADSQIRSVCEGLAQNLYCNPHTSRTTEDLLDQVRY--- 81

Query: 183 EKGTIEHDIKTRIMDYLNISENEYGLVFTVSRGSAFKLLSESYPFHTNKKLLTMFDHESQ 242
                      R++ + N   +EY L+FT    ++ KLL+ES+ F      + + D  + 
Sbjct: 82  -----------RVLRHFNTRSSEYSLIFTSGTTASLKLLAESFEFAPEGAFVYLKDSHTS 141

Query: 243 SVSWMAQSAKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSVAGLFVFPVQSRV 302
            +        ER   +Y             RE   +  +  +R  +  + L VFP Q   
Sbjct: 142 VLGMREIVGTER---IYPV----------EREQLLKELDSSERSDNEHSSLIVFPAQCNF 201

Query: 303 TGAKYSYQWMALAQQN--------NWHVLLDAGSLGPKDMDSLGLSLFKPDFIITSFYRV 362
            G KY  + +   Q++         + V LDA S        L LS ++PDF+  SFY++
Sbjct: 202 NGVKYPLELVRKIQRDGISGYGKERFRVCLDAASF--VSTSFLDLSKYQPDFVCLSFYKI 258

Query: 363 FGSDPTGFGCLLIKKSVIGSLQNQSGRTGT 379
           FG  PTG G LL+  +    L+ +    GT
Sbjct: 262 FGY-PTGLGALLVHHTAADQLRKKYYGGGT 258

BLAST of HG10014086.1 vs. ExPASy Swiss-Prot
Match: A2VD33 (Molybdenum cofactor sulfurase OS=Danio rerio OX=7955 GN=mocos PE=2 SV=2)

HSP 1 Score: 75.1 bits (183), Expect = 4.7e-12
Identity = 66/256 (25.78%), Postives = 111/256 (43.36%), Query Frame = 0

Query: 136 LDYCGFGLFSHIQTQQFWESSAFTLSEITANLSNHALYGGAEKGTIEHDIKTRIMDYLNI 195
           LD+ G  LF     + F +  +     +  N  +H         T+E  ++ +I+ + N 
Sbjct: 48  LDHAGTTLFPESLIKGFHDDIS---RNVYGNPHSHNSSSRLTHDTVE-SVRYKILAHFNT 107

Query: 196 SENEYGLVFTVSRGSAFKLLSESYPFHTNKKLLTMFDHESQSVSWMAQSAKERGAKVYSA 255
           S  +Y ++FT    +A KL+++++P+    K ++  +  SQ            G +  +A
Sbjct: 108 SPEDYSVIFTSGCTAALKLVADTFPW----KPMSNKEPGSQFCYLTDNHTSVVGIRGATA 167

Query: 256 WFKWPTLRLCSRELRKQITNKRK---RKKDSVAGLFVFPVQSRVTGAKYSYQWM------ 315
                T+ +  RE+  +  NK +    ++ S   LF +P QS  +G KYS  ++      
Sbjct: 168 LQGVGTISVSPREVETRARNKTQTNGEEECSTPHLFCYPAQSNFSGRKYSLSYVKGIQSQ 227

Query: 316 ----ALAQQNNWHVLLDAGSLGPKDMDSLGLSLFKPDFIITSFYRVFGSDPTGFGCLLIK 375
               A      W VLLDA          L LS +  DF+  SFY++FG  PTG G LL++
Sbjct: 228 QLYPACEHHGQWFVLLDAACF--VSCSPLDLSQYPADFVPISFYKMFGF-PTGLGALLVR 287

Query: 376 KSVIGSLQNQSGRTGT 379
                 L+      GT
Sbjct: 288 NEAAEVLRKTYFGGGT 292

BLAST of HG10014086.1 vs. ExPASy TrEMBL
Match: A0A5A7UPY9 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1920G00320 PE=4 SV=1)

HSP 1 Score: 1739.2 bits (4503), Expect = 0.0e+00
Identity = 888/935 (94.97%), Postives = 901/935 (96.36%), Query Frame = 0

Query: 1   MDKKTRRRHGSGVTERRKSSILRQLQENKLREALEEASEDGSLAKSRDIDCESPNQDRNV 60
           MDKKTRRRHGSGVTERRKSSILRQLQENKLREALEEASEDGSLAK+RDIDCESPNQDRNV
Sbjct: 17  MDKKTRRRHGSGVTERRKSSILRQLQENKLREALEEASEDGSLAKTRDIDCESPNQDRNV 76

Query: 61  RRSRSLARLHAQKEFLRATALAADRTYSTEDFIPNLFDAFTKFLTMYPKFQTSEKIDQLR 120
           RRSRS ARLHAQKEFLRATALAADRTY  ED IPNLFDAFTKFLTMYPKFQTSEKIDQLR
Sbjct: 77  RRSRSFARLHAQKEFLRATALAADRTYCREDSIPNLFDAFTKFLTMYPKFQTSEKIDQLR 136

Query: 121 SEEYEHLSESFSKVCLDYCGFGLFSHIQTQQFWESSAFTLSEITANLSNHALYGGAEKGT 180
           SEEYEHLSESFSKVCLDYCGFGLFSHIQTQQFWESSAF+LSEITANLSNHALYGGAEKGT
Sbjct: 137 SEEYEHLSESFSKVCLDYCGFGLFSHIQTQQFWESSAFSLSEITANLSNHALYGGAEKGT 196

Query: 181 IEHDIKTRIMDYLNISENEYGLVFTVSRGSAFKLLSESYPFHTNKKLLTMFDHESQSVSW 240
           IEHDIKTRIMDYLNISENEYGLVFTVSRGSAFKLLSESYPFHTNKKLLTMFDHESQSVSW
Sbjct: 197 IEHDIKTRIMDYLNISENEYGLVFTVSRGSAFKLLSESYPFHTNKKLLTMFDHESQSVSW 256

Query: 241 MAQSAKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSVAGLFVFPVQSRVTGAK 300
           MAQSAKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSVAGLFVFPVQSRVTGAK
Sbjct: 257 MAQSAKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSVAGLFVFPVQSRVTGAK 316

Query: 301 YSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFKPDFIITSFYRVFGSDPTGFGCLL 360
           YSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLF+PDFIITSFYRVFGSDPTGFGCLL
Sbjct: 317 YSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIITSFYRVFGSDPTGFGCLL 376

Query: 361 IKKSVIGSLQNQSGRTGTGMVRILPIFPQYIGDSIDGLVDVLAGIEDDEINGQEDSETEK 420
           IKKSVIGSLQ+QSGRTGTGMVRILPIFPQYIGDSIDGL DVLAGIEDD IN  EDSETEK
Sbjct: 377 IKKSVIGSLQSQSGRTGTGMVRILPIFPQYIGDSIDGL-DVLAGIEDDVIN--EDSETEK 436

Query: 421 HQESRMPAFSGVFTSNQVRDVFETEMEQDNNSSDRDGASTIFEEAESISIGEVMKSPIFS 480
           H ESRMPAFSGVFT NQVRDVFETEME DNNSSDRDGASTIFEEAESISIGEVMKSPIFS
Sbjct: 437 HPESRMPAFSGVFTPNQVRDVFETEMEHDNNSSDRDGASTIFEEAESISIGEVMKSPIFS 496

Query: 481 EDESSDNSYWIDLGQSPFGSDNSGHLIKQKTWSPLPPSWFSGKRNTRKRSPKPASRLLKS 540
           EDESSDNSYWIDLGQSPFGSDNS HLIKQKTWSPLPPSWFSGKRN RKRSPKPASRLLKS
Sbjct: 497 EDESSDNSYWIDLGQSPFGSDNSDHLIKQKTWSPLPPSWFSGKRNNRKRSPKPASRLLKS 556

Query: 541 PMCGDDKRMNSRHYEDSVLSFDAAVLSMSQDFGCVKGIPEEEQSGEQDSCCGNVGSLRDS 600
           PMC +DKR N+RH  DSVLSFDAA+LSMSQDF CV+GIPEEEQSGEQDSCCGNVGSLRDS
Sbjct: 557 PMCSNDKRANARHRNDSVLSFDAALLSMSQDFSCVQGIPEEEQSGEQDSCCGNVGSLRDS 616

Query: 601 HAVSEIQEDSETGEESGRLSVASNGTRPANQTSEFQDLKRSNSTTSGAFKDLKESAIRRE 660
           H VSEIQEDSETGEES RLS ASNG  PAN TSEF DLKRSNSTTSGAF DLKESAIRRE
Sbjct: 617 HVVSEIQEDSETGEESARLSFASNGIHPANHTSEFWDLKRSNSTTSGAFNDLKESAIRRE 676

Query: 661 TEGEFRLLGRRERSRFSERGFFGLEEGDRAISMGRHVSFSVEYNEKESLNEMFELGETSN 720
           TEGEFRLLGRRERSRFSERGFFGLEEGDRAISMGR VSF VEYNEKESLNEMFELGE S 
Sbjct: 677 TEGEFRLLGRRERSRFSERGFFGLEEGDRAISMGRRVSFRVEYNEKESLNEMFELGEASC 736

Query: 721 AAFGNEESTSDGEYVDEQEWGRRESEMICRHLDHIDMLGLNKTTLRLRYLINWLVTSLLQ 780
            AFGNEESTSDGEYVDEQEWGRRE EMICRHLDHIDMLGLNKTTLR RYLINWLVTSLLQ
Sbjct: 737 TAFGNEESTSDGEYVDEQEWGRREPEMICRHLDHIDMLGLNKTTLRQRYLINWLVTSLLQ 796

Query: 781 LRLPGRDDVGVHLVQLYGPKIKYERGAAIAFNIKESNGRGLIHPEVVQKLAENNGISLGV 840
           LRLPG+DDVGV LVQLYGPKIKYERGAAIAFN+KESNGRGLIHPEVVQKLAENNGI+LGV
Sbjct: 797 LRLPGQDDVGVQLVQLYGPKIKYERGAAIAFNVKESNGRGLIHPEVVQKLAENNGIALGV 856

Query: 841 GILSHVRAVDVPKQNSGQYDLEDMALCKPMGNGHNRKKLFFRVEVVTASLGFLTNFDDVY 900
           GILSHVRAVDVPKQNSGQYDLEDMALCKPMGNGHNRKKLFFRVEVVTASLGFLTNFDDVY
Sbjct: 857 GILSHVRAVDVPKQNSGQYDLEDMALCKPMGNGHNRKKLFFRVEVVTASLGFLTNFDDVY 916

Query: 901 KMWAFIAKFLNPSFLENNTLSSVPESSESYHRSLF 936
           KMWAF+AKFLNPSFLENNTLSSVPESSESYHRS+F
Sbjct: 917 KMWAFVAKFLNPSFLENNTLSSVPESSESYHRSMF 948

BLAST of HG10014086.1 vs. ExPASy TrEMBL
Match: A0A1S3BZ97 (uncharacterized protein LOC103495022 OS=Cucumis melo OX=3656 GN=LOC103495022 PE=4 SV=1)

HSP 1 Score: 1739.2 bits (4503), Expect = 0.0e+00
Identity = 888/935 (94.97%), Postives = 901/935 (96.36%), Query Frame = 0

Query: 1   MDKKTRRRHGSGVTERRKSSILRQLQENKLREALEEASEDGSLAKSRDIDCESPNQDRNV 60
           MDKKTRRRHGSGVTERRKSSILRQLQENKLREALEEASEDGSLAK+RDIDCESPNQDRNV
Sbjct: 17  MDKKTRRRHGSGVTERRKSSILRQLQENKLREALEEASEDGSLAKTRDIDCESPNQDRNV 76

Query: 61  RRSRSLARLHAQKEFLRATALAADRTYSTEDFIPNLFDAFTKFLTMYPKFQTSEKIDQLR 120
           RRSRS ARLHAQKEFLRATALAADRTY  ED IPNLFDAFTKFLTMYPKFQTSEKIDQLR
Sbjct: 77  RRSRSFARLHAQKEFLRATALAADRTYCREDSIPNLFDAFTKFLTMYPKFQTSEKIDQLR 136

Query: 121 SEEYEHLSESFSKVCLDYCGFGLFSHIQTQQFWESSAFTLSEITANLSNHALYGGAEKGT 180
           SEEYEHLSESFSKVCLDYCGFGLFSHIQTQQFWESSAF+LSEITANLSNHALYGGAEKGT
Sbjct: 137 SEEYEHLSESFSKVCLDYCGFGLFSHIQTQQFWESSAFSLSEITANLSNHALYGGAEKGT 196

Query: 181 IEHDIKTRIMDYLNISENEYGLVFTVSRGSAFKLLSESYPFHTNKKLLTMFDHESQSVSW 240
           IEHDIKTRIMDYLNISENEYGLVFTVSRGSAFKLLSESYPFHTNKKLLTMFDHESQSVSW
Sbjct: 197 IEHDIKTRIMDYLNISENEYGLVFTVSRGSAFKLLSESYPFHTNKKLLTMFDHESQSVSW 256

Query: 241 MAQSAKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSVAGLFVFPVQSRVTGAK 300
           MAQSAKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSVAGLFVFPVQSRVTGAK
Sbjct: 257 MAQSAKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSVAGLFVFPVQSRVTGAK 316

Query: 301 YSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFKPDFIITSFYRVFGSDPTGFGCLL 360
           YSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLF+PDFIITSFYRVFGSDPTGFGCLL
Sbjct: 317 YSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIITSFYRVFGSDPTGFGCLL 376

Query: 361 IKKSVIGSLQNQSGRTGTGMVRILPIFPQYIGDSIDGLVDVLAGIEDDEINGQEDSETEK 420
           IKKSVIGSLQ+QSGRTGTGMVRILPIFPQYIGDSIDGL DVLAGIEDD IN  EDSETEK
Sbjct: 377 IKKSVIGSLQSQSGRTGTGMVRILPIFPQYIGDSIDGL-DVLAGIEDDVIN--EDSETEK 436

Query: 421 HQESRMPAFSGVFTSNQVRDVFETEMEQDNNSSDRDGASTIFEEAESISIGEVMKSPIFS 480
           H ESRMPAFSGVFT NQVRDVFETEME DNNSSDRDGASTIFEEAESISIGEVMKSPIFS
Sbjct: 437 HPESRMPAFSGVFTPNQVRDVFETEMEHDNNSSDRDGASTIFEEAESISIGEVMKSPIFS 496

Query: 481 EDESSDNSYWIDLGQSPFGSDNSGHLIKQKTWSPLPPSWFSGKRNTRKRSPKPASRLLKS 540
           EDESSDNSYWIDLGQSPFGSDNS HLIKQKTWSPLPPSWFSGKRN RKRSPKPASRLLKS
Sbjct: 497 EDESSDNSYWIDLGQSPFGSDNSDHLIKQKTWSPLPPSWFSGKRNNRKRSPKPASRLLKS 556

Query: 541 PMCGDDKRMNSRHYEDSVLSFDAAVLSMSQDFGCVKGIPEEEQSGEQDSCCGNVGSLRDS 600
           PMC +DKR N+RH  DSVLSFDAA+LSMSQDF CV+GIPEEEQSGEQDSCCGNVGSLRDS
Sbjct: 557 PMCSNDKRANARHRNDSVLSFDAALLSMSQDFSCVQGIPEEEQSGEQDSCCGNVGSLRDS 616

Query: 601 HAVSEIQEDSETGEESGRLSVASNGTRPANQTSEFQDLKRSNSTTSGAFKDLKESAIRRE 660
           H VSEIQEDSETGEES RLS ASNG  PAN TSEF DLKRSNSTTSGAF DLKESAIRRE
Sbjct: 617 HVVSEIQEDSETGEESARLSFASNGIHPANHTSEFWDLKRSNSTTSGAFNDLKESAIRRE 676

Query: 661 TEGEFRLLGRRERSRFSERGFFGLEEGDRAISMGRHVSFSVEYNEKESLNEMFELGETSN 720
           TEGEFRLLGRRERSRFSERGFFGLEEGDRAISMGR VSF VEYNEKESLNEMFELGE S 
Sbjct: 677 TEGEFRLLGRRERSRFSERGFFGLEEGDRAISMGRRVSFRVEYNEKESLNEMFELGEASC 736

Query: 721 AAFGNEESTSDGEYVDEQEWGRRESEMICRHLDHIDMLGLNKTTLRLRYLINWLVTSLLQ 780
            AFGNEESTSDGEYVDEQEWGRRE EMICRHLDHIDMLGLNKTTLR RYLINWLVTSLLQ
Sbjct: 737 TAFGNEESTSDGEYVDEQEWGRREPEMICRHLDHIDMLGLNKTTLRQRYLINWLVTSLLQ 796

Query: 781 LRLPGRDDVGVHLVQLYGPKIKYERGAAIAFNIKESNGRGLIHPEVVQKLAENNGISLGV 840
           LRLPG+DDVGV LVQLYGPKIKYERGAAIAFN+KESNGRGLIHPEVVQKLAENNGI+LGV
Sbjct: 797 LRLPGQDDVGVQLVQLYGPKIKYERGAAIAFNVKESNGRGLIHPEVVQKLAENNGIALGV 856

Query: 841 GILSHVRAVDVPKQNSGQYDLEDMALCKPMGNGHNRKKLFFRVEVVTASLGFLTNFDDVY 900
           GILSHVRAVDVPKQNSGQYDLEDMALCKPMGNGHNRKKLFFRVEVVTASLGFLTNFDDVY
Sbjct: 857 GILSHVRAVDVPKQNSGQYDLEDMALCKPMGNGHNRKKLFFRVEVVTASLGFLTNFDDVY 916

Query: 901 KMWAFIAKFLNPSFLENNTLSSVPESSESYHRSLF 936
           KMWAF+AKFLNPSFLENNTLSSVPESSESYHRS+F
Sbjct: 917 KMWAFVAKFLNPSFLENNTLSSVPESSESYHRSMF 948

BLAST of HG10014086.1 vs. ExPASy TrEMBL
Match: A0A0A0LIQ1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G852500 PE=4 SV=1)

HSP 1 Score: 1735.7 bits (4494), Expect = 0.0e+00
Identity = 885/935 (94.65%), Postives = 901/935 (96.36%), Query Frame = 0

Query: 1   MDKKTRRRHGSGVTERRKSSILRQLQENKLREALEEASEDGSLAKSRDIDCESPNQDRNV 60
           MDKKTR+RHGSGVTERRKSSILRQLQENKLREALEEASEDGSLAK+RDIDC+SP+QDRNV
Sbjct: 17  MDKKTRKRHGSGVTERRKSSILRQLQENKLREALEEASEDGSLAKTRDIDCDSPDQDRNV 76

Query: 61  RRSRSLARLHAQKEFLRATALAADRTYSTEDFIPNLFDAFTKFLTMYPKFQTSEKIDQLR 120
           RRSRS ARLHAQKEFLRATALAADRTY TED IPNLFDAFTKFLTMYPKFQTSEKIDQLR
Sbjct: 77  RRSRSFARLHAQKEFLRATALAADRTYCTEDLIPNLFDAFTKFLTMYPKFQTSEKIDQLR 136

Query: 121 SEEYEHLSESFSKVCLDYCGFGLFSHIQTQQFWESSAFTLSEITANLSNHALYGGAEKGT 180
           SEEYEHLSESFSKVCLDYCGFGLFSHIQTQQFWESSAFTLSEITANLSNHALYGGAEKGT
Sbjct: 137 SEEYEHLSESFSKVCLDYCGFGLFSHIQTQQFWESSAFTLSEITANLSNHALYGGAEKGT 196

Query: 181 IEHDIKTRIMDYLNISENEYGLVFTVSRGSAFKLLSESYPFHTNKKLLTMFDHESQSVSW 240
           IEHDIKTRIMDYLNISENEYGLVFTVSRGSAFKLLSESYPFHTNKKLLTMFDHESQSVSW
Sbjct: 197 IEHDIKTRIMDYLNISENEYGLVFTVSRGSAFKLLSESYPFHTNKKLLTMFDHESQSVSW 256

Query: 241 MAQSAKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSVAGLFVFPVQSRVTGAK 300
           MAQSAKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSVAGLFVFPVQSRVTGAK
Sbjct: 257 MAQSAKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSVAGLFVFPVQSRVTGAK 316

Query: 301 YSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFKPDFIITSFYRVFGSDPTGFGCLL 360
           YSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLF+PDFIITSFYRVFGSDPTGFGCLL
Sbjct: 317 YSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIITSFYRVFGSDPTGFGCLL 376

Query: 361 IKKSVIGSLQNQSGRTGTGMVRILPIFPQYIGDSIDGLVDVLAGIEDDEINGQEDSETEK 420
           IKKSVIGSLQ+QSGRTGTGMVRILPIFPQYIGDSIDGL DVLAGI+DD IN  EDSETEK
Sbjct: 377 IKKSVIGSLQSQSGRTGTGMVRILPIFPQYIGDSIDGL-DVLAGIDDDVIN--EDSETEK 436

Query: 421 HQESRMPAFSGVFTSNQVRDVFETEMEQDNNSSDRDGASTIFEEAESISIGEVMKSPIFS 480
           H ESRMPAFSGVFT NQVRDVFETEME DNNSSDRDGASTIFEEAESISIGEVMKSPIFS
Sbjct: 437 HLESRMPAFSGVFTPNQVRDVFETEMEHDNNSSDRDGASTIFEEAESISIGEVMKSPIFS 496

Query: 481 EDESSDNSYWIDLGQSPFGSDNSGHLIKQKTWSPLPPSWFSGKRNTRKRSPKPASRLLKS 540
           EDESSDNSYWIDLGQSPFGSDNS HLIKQKTWSPLPPSWFSGKRN R+RSPKPASRLLKS
Sbjct: 497 EDESSDNSYWIDLGQSPFGSDNSDHLIKQKTWSPLPPSWFSGKRNNRQRSPKPASRLLKS 556

Query: 541 PMCGDDKRMNSRHYEDSVLSFDAAVLSMSQDFGCVKGIPEEEQSGEQDSCCGNVGSLRDS 600
           PMCGDDKR N+RH  DSVLSFDAAVLSMSQDF CV+GIPEE+QSGEQDSCCGNVGSLRDS
Sbjct: 557 PMCGDDKRANARHRNDSVLSFDAAVLSMSQDFSCVEGIPEEDQSGEQDSCCGNVGSLRDS 616

Query: 601 HAVSEIQEDSETGEESGRLSVASNGTRPANQTSEFQDLKRSNSTTSGAFKDLKESAIRRE 660
           H VSEIQEDSETGEES RLS ASNG  P N TSEF+DLKRSNSTTSGAF DLKESAIRRE
Sbjct: 617 HVVSEIQEDSETGEESARLSFASNGIHPVNHTSEFRDLKRSNSTTSGAFNDLKESAIRRE 676

Query: 661 TEGEFRLLGRRERSRFSERGFFGLEEGDRAISMGRHVSFSVEYNEKESLNEMFELGETSN 720
           TEGEFRLLGRRERSRFSERGFFGLEEGDRAISMGR VSF VEYNEKESLNEMFELGETS 
Sbjct: 677 TEGEFRLLGRRERSRFSERGFFGLEEGDRAISMGRRVSFRVEYNEKESLNEMFELGETSC 736

Query: 721 AAFGNEESTSDGEYVDEQEWGRRESEMICRHLDHIDMLGLNKTTLRLRYLINWLVTSLLQ 780
            AFGNEESTSDGEYVDEQEWGRRE EMICRHLDHIDMLGLNKTTLR RYLINWLVTSLLQ
Sbjct: 737 TAFGNEESTSDGEYVDEQEWGRREPEMICRHLDHIDMLGLNKTTLRQRYLINWLVTSLLQ 796

Query: 781 LRLPGRDDVGVHLVQLYGPKIKYERGAAIAFNIKESNGRGLIHPEVVQKLAENNGISLGV 840
           LRLPG+DDVGVHLVQLYGPKIKYERGAAIAFN+KESNGRGLIHPEVVQKLAENNGI+LGV
Sbjct: 797 LRLPGQDDVGVHLVQLYGPKIKYERGAAIAFNVKESNGRGLIHPEVVQKLAENNGIALGV 856

Query: 841 GILSHVRAVDVPKQNSGQYDLEDMALCKPMGNGHNRKKLFFRVEVVTASLGFLTNFDDVY 900
           GILSHVRAVDVPKQNSGQYDLEDMALCKPMGNGHNRKKLFFRVEVVTASLGFLTNFDDVY
Sbjct: 857 GILSHVRAVDVPKQNSGQYDLEDMALCKPMGNGHNRKKLFFRVEVVTASLGFLTNFDDVY 916

Query: 901 KMWAFIAKFLNPSFLENNTLSSVPESSESYHRSLF 936
           KMWAFIAKFLNPSFLENNTLS VPES ESY  S+F
Sbjct: 917 KMWAFIAKFLNPSFLENNTLSPVPESLESYRGSMF 948

BLAST of HG10014086.1 vs. ExPASy TrEMBL
Match: A0A6J1FL08 (uncharacterized protein LOC111446707 OS=Cucurbita moschata OX=3662 GN=LOC111446707 PE=4 SV=1)

HSP 1 Score: 1672.5 bits (4330), Expect = 0.0e+00
Identity = 855/933 (91.64%), Postives = 895/933 (95.93%), Query Frame = 0

Query: 1   MDKKTRRRHGSGVTERRKSSILRQLQENKLREALEEASEDGSLAKSRDIDCESPNQDRNV 60
           MDKK+RRRHGSG+TERRKSSILRQLQENKLREALEEASEDGSLAKSRDIDC+SPN D NV
Sbjct: 17  MDKKSRRRHGSGLTERRKSSILRQLQENKLREALEEASEDGSLAKSRDIDCDSPNHDGNV 76

Query: 61  RRSRSLARLHAQKEFLRATALAADRTYSTEDFIPNLFDAFTKFLTMYPKFQTSEKIDQLR 120
           RRSRSLARLHAQKEFLRATALAADRTYSTED IPNLFDAFTKFLTMYPKFQ+SE+IDQLR
Sbjct: 77  RRSRSLARLHAQKEFLRATALAADRTYSTEDLIPNLFDAFTKFLTMYPKFQSSEQIDQLR 136

Query: 121 SEEYEHLSESFSKVCLDYCGFGLFSHIQTQQFWESSAFTLSEITANLSNHALYGGAEKGT 180
           +E+YEHLSESFSKVCLDYCGFGLFS+IQTQQFWESSAFTLSEITANL+NHALYGGAEKGT
Sbjct: 137 TEDYEHLSESFSKVCLDYCGFGLFSYIQTQQFWESSAFTLSEITANLNNHALYGGAEKGT 196

Query: 181 IEHDIKTRIMDYLNISENEYGLVFTVSRGSAFKLLSESYPFHTNKKLLTMFDHESQSVSW 240
           IEHDIKTRIM+YLNISENEYGLVFTVSRGSAFKLL+ESYPFHTNKKLLTMFDHESQSVSW
Sbjct: 197 IEHDIKTRIMNYLNISENEYGLVFTVSRGSAFKLLAESYPFHTNKKLLTMFDHESQSVSW 256

Query: 241 MAQSAKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSVAGLFVFPVQSRVTGAK 300
           MAQ+AKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSV+GLFVFPVQSRVTGAK
Sbjct: 257 MAQNAKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSVSGLFVFPVQSRVTGAK 316

Query: 301 YSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFKPDFIITSFYRVFGSDPTGFGCLL 360
           YSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLF+PDFIITSFYRVFGSDPTGFGCLL
Sbjct: 317 YSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIITSFYRVFGSDPTGFGCLL 376

Query: 361 IKKSVIGSLQNQSGRTGTGMVRILPIFPQYIGDSIDGLVDVLAGIEDDEINGQEDSETEK 420
           IKKSVIGSLQ+Q GRTGTGMVRILPIFPQYIGDSIDGL DVLAGIEDD INGQEDSETE 
Sbjct: 377 IKKSVIGSLQSQCGRTGTGMVRILPIFPQYIGDSIDGL-DVLAGIEDDAINGQEDSETET 436

Query: 421 HQESRMPAFSGVFTSNQVRDVFETEMEQDNNSSDRDGASTIFEEAESISIGEVMKSPIFS 480
           HQESRMPAFSGVFT+NQVRDVFETE+EQDNNSSDRDGASTIFEE ESIS+GEVMKSPIFS
Sbjct: 437 HQESRMPAFSGVFTTNQVRDVFETEIEQDNNSSDRDGASTIFEEVESISVGEVMKSPIFS 496

Query: 481 EDESSDNSYWIDLGQSPFGSDNSGHLIKQKTW-SPLPPSWFSGKRNTRKRSPKPASRLLK 540
           EDESSDNSYWIDLG SPFGSDNSGHLIKQKTW SPLPPSWFSGKRN+R+ SPKPASRLL+
Sbjct: 497 EDESSDNSYWIDLGHSPFGSDNSGHLIKQKTWSSPLPPSWFSGKRNSRQLSPKPASRLLR 556

Query: 541 SPMC-GDDKRMNSRHYEDSVLSFDAAVLSMSQDFGCVKGIPEEEQSGEQDSCCGNVGSLR 600
           SP+C GDDKR N RH +DSVLSFDAAVLS+SQD   V+GIPEEEQSGEQDSCCGNVGSL+
Sbjct: 557 SPICGGDDKRANPRHRDDSVLSFDAAVLSVSQDLCRVEGIPEEEQSGEQDSCCGNVGSLK 616

Query: 601 DSHAVSEIQEDSETGEES--GRLSVASNGTRPANQTSEFQDLKRSNSTTSGAFKDLKESA 660
           DSHAVSEIQEDSETGEES   RLS ASNG R ANQT E +DLK SNST +GA KDLKESA
Sbjct: 617 DSHAVSEIQEDSETGEESIPNRLSFASNGNRSANQTFEIRDLKLSNSTAAGALKDLKESA 676

Query: 661 IRRETEGEFRLLGRRERSRFSERGFFGLEEGDRAISMGRHVSFSVEYNEKESLNEMFELG 720
           IRRETEGEFRLLGRRERSRFSERGFFGL +G+RA+SMGR VSFSVEYNEKESLNEMFELG
Sbjct: 677 IRRETEGEFRLLGRRERSRFSERGFFGL-DGERALSMGRRVSFSVEYNEKESLNEMFELG 736

Query: 721 ETSNAAFGNEESTSDGEYVDEQEWGRRESEMICRHLDHIDMLGLNKTTLRLRYLINWLVT 780
           E SNAAF NEES SDGEYVDEQEWGRRE EMICRHLDHIDMLGLN+TTLRLRYLINWLVT
Sbjct: 737 EASNAAFDNEESMSDGEYVDEQEWGRREPEMICRHLDHIDMLGLNRTTLRLRYLINWLVT 796

Query: 781 SLLQLRLPGRDDVGVHLVQLYGPKIKYERGAAIAFNIKESNGRGLIHPEVVQKLAENNGI 840
           SLLQLRLPGRDDVG HLVQLYGPKIKYERGAA+AFN+KESNGRGLIHPEVVQ+LAENNGI
Sbjct: 797 SLLQLRLPGRDDVGTHLVQLYGPKIKYERGAAVAFNVKESNGRGLIHPEVVQRLAENNGI 856

Query: 841 SLGVGILSHVRAVDVPKQNSGQYDLEDMALCKPMGNGHNRKKLFFRVEVVTASLGFLTNF 900
           SLGVGILSHVRAVDVPKQNSGQYDL+DMALCKPM NGHNRKKLFFRVEVVTASLGFLTNF
Sbjct: 857 SLGVGILSHVRAVDVPKQNSGQYDLKDMALCKPMANGHNRKKLFFRVEVVTASLGFLTNF 916

Query: 901 DDVYKMWAFIAKFLNPSFLENNTLSSVPESSES 930
           +DVYKMWAF+AKFLNPSFLEN+TLSS PE+SES
Sbjct: 917 EDVYKMWAFVAKFLNPSFLENSTLSSGPETSES 947

BLAST of HG10014086.1 vs. ExPASy TrEMBL
Match: A0A6J1IYP0 (uncharacterized protein LOC111481100 OS=Cucurbita maxima OX=3661 GN=LOC111481100 PE=4 SV=1)

HSP 1 Score: 1671.8 bits (4328), Expect = 0.0e+00
Identity = 855/933 (91.64%), Postives = 894/933 (95.82%), Query Frame = 0

Query: 1   MDKKTRRRHGSGVTERRKSSILRQLQENKLREALEEASEDGSLAKSRDIDCESPNQDRNV 60
           MDKK+RRRHGSG+TERRKSSILRQLQENKLREALEEASEDGSLAKSRDIDC+SPN D NV
Sbjct: 17  MDKKSRRRHGSGLTERRKSSILRQLQENKLREALEEASEDGSLAKSRDIDCDSPNHDGNV 76

Query: 61  RRSRSLARLHAQKEFLRATALAADRTYSTEDFIPNLFDAFTKFLTMYPKFQTSEKIDQLR 120
           RRSRSLARLHAQKEFLRATALAADRTYSTED IPNLFDAFTKFLTMYPKFQ+SE+IDQLR
Sbjct: 77  RRSRSLARLHAQKEFLRATALAADRTYSTEDLIPNLFDAFTKFLTMYPKFQSSEQIDQLR 136

Query: 121 SEEYEHLSESFSKVCLDYCGFGLFSHIQTQQFWESSAFTLSEITANLSNHALYGGAEKGT 180
           +E+YEHLSESFSKVCLDYCGFGLFS+IQTQQFWESSAFTLSEITANL+NHALYGGAEKGT
Sbjct: 137 TEDYEHLSESFSKVCLDYCGFGLFSYIQTQQFWESSAFTLSEITANLNNHALYGGAEKGT 196

Query: 181 IEHDIKTRIMDYLNISENEYGLVFTVSRGSAFKLLSESYPFHTNKKLLTMFDHESQSVSW 240
           IEHDIKTRIM+YLNISENEYGLVFTVSRGSAFKLL+ESYPFHTNKKLLTMFDHESQSVSW
Sbjct: 197 IEHDIKTRIMNYLNISENEYGLVFTVSRGSAFKLLAESYPFHTNKKLLTMFDHESQSVSW 256

Query: 241 MAQSAKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSVAGLFVFPVQSRVTGAK 300
           MAQ+AKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSV+GLFVFPVQSRVTGAK
Sbjct: 257 MAQNAKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSVSGLFVFPVQSRVTGAK 316

Query: 301 YSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFKPDFIITSFYRVFGSDPTGFGCLL 360
           YSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLF+PDFIITSFYRVFGSDPTGFGCLL
Sbjct: 317 YSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIITSFYRVFGSDPTGFGCLL 376

Query: 361 IKKSVIGSLQNQSGRTGTGMVRILPIFPQYIGDSIDGLVDVLAGIEDDEINGQEDSETEK 420
           IKKSVIGSLQ+Q GRTGTGMVRILPIFPQYIGDSIDGL DVLAGIEDD INGQEDSETE 
Sbjct: 377 IKKSVIGSLQSQCGRTGTGMVRILPIFPQYIGDSIDGL-DVLAGIEDDAINGQEDSETET 436

Query: 421 HQESRMPAFSGVFTSNQVRDVFETEMEQDNNSSDRDGASTIFEEAESISIGEVMKSPIFS 480
           HQESRMPAFSGVFT+NQVRDVFETE+EQDNNSSDRDGASTIFEE ESIS+GEVMKSPIFS
Sbjct: 437 HQESRMPAFSGVFTTNQVRDVFETEIEQDNNSSDRDGASTIFEEVESISVGEVMKSPIFS 496

Query: 481 EDESSDNSYWIDLGQSPFGSDNSGHLIKQKTW-SPLPPSWFSGKRNTRKRSPKPASRLLK 540
           EDESSDNSYWIDLG SPFGSDNSGHLIKQKTW SPLPPSWFSGKRN+R+ SPKPASRLL+
Sbjct: 497 EDESSDNSYWIDLGHSPFGSDNSGHLIKQKTWSSPLPPSWFSGKRNSRQLSPKPASRLLR 556

Query: 541 SPMC-GDDKRMNSRHYEDSVLSFDAAVLSMSQDFGCVKGIPEEEQSGEQDSCCGNVGSLR 600
           SP+C GDDKR N RH +DSVLSFDAAVLS+SQD   V+GIPEEEQSGEQDSCCGNVGSL+
Sbjct: 557 SPICGGDDKRANPRHRDDSVLSFDAAVLSVSQDLCRVEGIPEEEQSGEQDSCCGNVGSLK 616

Query: 601 DSHAVSEIQEDSETGEE--SGRLSVASNGTRPANQTSEFQDLKRSNSTTSGAFKDLKESA 660
           DSHAVSEIQEDSETGEE  S RLS ASNG R ANQT E QDLK SNST +GA KDLKESA
Sbjct: 617 DSHAVSEIQEDSETGEESISNRLSFASNGNRSANQTFEIQDLKLSNSTAAGALKDLKESA 676

Query: 661 IRRETEGEFRLLGRRERSRFSERGFFGLEEGDRAISMGRHVSFSVEYNEKESLNEMFELG 720
           IRRETEGEFRLLGRRERSRFSERGFFGL +G+RA+SMGR VSFSVEYNEKESLNEMFELG
Sbjct: 677 IRRETEGEFRLLGRRERSRFSERGFFGL-DGERALSMGRRVSFSVEYNEKESLNEMFELG 736

Query: 721 ETSNAAFGNEESTSDGEYVDEQEWGRRESEMICRHLDHIDMLGLNKTTLRLRYLINWLVT 780
           E SNAAF NEES SDGEYVDEQEWGRRE EMICRHLDHIDMLGLN+TTLRLRYLINWLVT
Sbjct: 737 EASNAAFDNEESMSDGEYVDEQEWGRREPEMICRHLDHIDMLGLNRTTLRLRYLINWLVT 796

Query: 781 SLLQLRLPGRDDVGVHLVQLYGPKIKYERGAAIAFNIKESNGRGLIHPEVVQKLAENNGI 840
           SLLQLRLPGRDDVG HLVQLYGPKIKYERGAA+AFN+KESNGRGLIHPEVVQKLAENNGI
Sbjct: 797 SLLQLRLPGRDDVGTHLVQLYGPKIKYERGAAVAFNVKESNGRGLIHPEVVQKLAENNGI 856

Query: 841 SLGVGILSHVRAVDVPKQNSGQYDLEDMALCKPMGNGHNRKKLFFRVEVVTASLGFLTNF 900
           SLGVGILSHVRAVDVPKQNSGQYDL+DMALCKPM NGHNRKKLFFRVEVVT SLGFLTNF
Sbjct: 857 SLGVGILSHVRAVDVPKQNSGQYDLKDMALCKPMANGHNRKKLFFRVEVVTVSLGFLTNF 916

Query: 901 DDVYKMWAFIAKFLNPSFLENNTLSSVPESSES 930
           +DVYKMWAF+AKFLNPSFLE++TLSS PE+SES
Sbjct: 917 EDVYKMWAFVAKFLNPSFLESSTLSSGPETSES 947

BLAST of HG10014086.1 vs. TAIR 10
Match: AT2G23520.1 (Pyridoxal phosphate (PLP)-dependent transferases superfamily protein )

HSP 1 Score: 995.3 bits (2572), Expect = 3.2e-290
Identity = 557/938 (59.38%), Postives = 704/938 (75.05%), Query Frame = 0

Query: 1   MDK-KTRRRHGSG--VTERRKSSILRQLQENKLREALEEASEDGSLAKSRDIDCESPNQD 60
           +DK K+RRR GS   +  RRK+S+LR+L E+KLR+ALEEASE+GSL KS+D+  E+ NQD
Sbjct: 17  LDKSKSRRRDGSDSPIDVRRKASMLRKLYEDKLRDALEEASENGSLFKSQDV--ENENQD 76

Query: 61  RNVRRSRSLARLHAQKEFLRATALAADRTYSTEDFIPNLFDAFTKFLTMYPKFQTSEKID 120
            ++ RSRSLARLHAQ+EFLRATALAA+R + +ED IP L +AF KFLTMYPKF+TSEK+D
Sbjct: 77  ESLGRSRSLARLHAQREFLRATALAAERAFESEDDIPELLEAFNKFLTMYPKFETSEKVD 136

Query: 121 QLRSEEYEHLSESFSKVCLDYCGFGLFSHIQTQQFWESSAFTLSEITANLSNHALYGGAE 180
           QLRS+EY HL +  SKVCLDYCGFGLFS++QT  +W+S  F+LSEITANLSNHALYGGAE
Sbjct: 137 QLRSDEYGHLLD--SKVCLDYCGFGLFSYVQTLHYWDSCTFSLSEITANLSNHALYGGAE 196

Query: 181 KGTIEHDIKTRIMDYLNISENEYGLVFTVSRGSAFKLLSESYPFHTNKKLLTMFDHESQS 240
            GT+EHD+KTRIMDYLNI E+EYGLVFT SRGSAF+LL+ESYPFHTNK+LLTMFDHESQS
Sbjct: 197 IGTVEHDLKTRIMDYLNIPESEYGLVFTGSRGSAFRLLAESYPFHTNKRLLTMFDHESQS 256

Query: 241 VSWMAQSAKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSVAGLFVFPVQSRVT 300
           V+WMAQ+A+E+GAK Y+AWFKWPTL+LCS +L+K++++K+++KKDS  GLFVFP QSRVT
Sbjct: 257 VNWMAQTAREKGAKAYNAWFKWPTLKLCSTDLKKRLSHKKRKKKDSAVGLFVFPAQSRVT 316

Query: 301 GAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFKPDFIITSFYRVFGSDPTGFG 360
           G+KYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLF+P+FIITSFY+VFG DPTGFG
Sbjct: 317 GSKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPEFIITSFYKVFGHDPTGFG 376

Query: 361 CLLIKKSVIGSLQNQSGRTGTGMVRILPIFPQYIGDSIDGLVDVLAGIEDDEI--NGQED 420
           CLLIKKSV+G+LQ+QSG+TG+G+V+I P +P Y+ DSIDGL D L G+ED +I  NG + 
Sbjct: 377 CLLIKKSVMGNLQSQSGKTGSGIVKITPQYPLYLSDSIDGL-DGLVGLEDHDIGTNGDKP 436

Query: 421 SETE-KHQESRMPAFSGVFTSNQVRDVFETEMEQDNNSSDRDG-ASTIFEEAESISIGEV 480
           + T+   + ++MP FSG +TS QVRDVFET++ +D N+SDRDG +STIFEE ES+S+GE+
Sbjct: 437 ATTDAARRGAQMPVFSGAYTSAQVRDVFETDLLED-NASDRDGTSSTIFEENESVSVGEL 496

Query: 481 MKSPIFSEDESSDNSYWIDLGQSPFGSDNSGHLIKQKTWSPLPPSWFSGKRNTRKRSPKP 540
           MKSP FSEDESSDNS+WIDLGQSP GSD++GHL   K  SPLPP WF+ KR    +SPKP
Sbjct: 497 MKSPAFSEDESSDNSFWIDLGQSPLGSDSAGHLNHHKIASPLPPFWFTSKR----QSPKP 556

Query: 541 ASRLLKSPMCGDDKRMNSRHYEDSVLSFDAAVLSMSQDFGCVKGIPEEEQSGEQDSCCGN 600
            ++   SPM  D K          VLSFDAAV+S++Q+   +   P              
Sbjct: 557 VAKSYSSPMY-DGK---------DVLSFDAAVMSVTQE---INSTPSR------------ 616

Query: 601 VGSLRDSHAVSEIQEDSETGEESGRLSVASNGTRPANQTSEFQDLKRSNSTTSGAFKDLK 660
             +LR+S+ + +IQE  E  E  G +   +         S F     SN ++S    D+K
Sbjct: 617 --NLRNSNNL-QIQEIQE--ENCGNIVYRAG--------SGF----GSNGSSSKISSDMK 676

Query: 661 ESAIRRETEGEFRLLGRRERSRFSERGFFGLEEGDRAISMGRHVSFSVEYNEKESLNEMF 720
           ++AIRRETEGEFRLLGRR     +     GLE  D   S G  VSF++     + ++   
Sbjct: 677 DNAIRRETEGEFRLLGRRG----TGGRLLGLE--DEQPSRGTRVSFNM-----DRVSHSL 736

Query: 721 ELGETSNAAFGNEESTSDGEYVDEQEWGRRESEMICRHLDHIDMLGLNKTTLRLRYLINW 780
           + GE S A+  +E   SDGE  +E +W RRE E++C H+DH++MLGLNKTT RLR+LINW
Sbjct: 737 DQGEASLASVYDE---SDGENPNEDDWDRREPEIVCSHIDHVNMLGLNKTTSRLRFLINW 796

Query: 781 LVTSLLQLRLPGRDDVG----VHLVQLYGPKIKYERGAAIAFNIKESNGRGLIHPEVVQK 840
           LV SLLQL++P     G    ++LVQ+YGPKIKYERGAA+AFN+K+ + +G + PE+V K
Sbjct: 797 LVISLLQLKVPEPGSDGSSRYMNLVQIYGPKIKYERGAAVAFNVKDKS-KGFVSPEIVLK 856

Query: 841 LAENNGISLGVGILSHVRAVDVPKQNSGQYDL-EDMALCKPMGNG-HNRKKLFFRVEVVT 900
           LAE  G+SLG+GILSH+R +D+P+ + G   + ED +L      G    K  F R EVVT
Sbjct: 857 LAEREGVSLGIGILSHIRIMDLPRNHRGGARIKEDSSLHLQREAGKRGGKNGFVRFEVVT 887

Query: 901 ASLGFLTNFDDVYKMWAFIAKFLNPSFLENNTLSSVPE 926
           ASL FL+NF+DVYK+WAF+AKFLNP F    +L +V E
Sbjct: 917 ASLSFLSNFEDVYKLWAFVAKFLNPGFSREGSLPTVIE 887

BLAST of HG10014086.1 vs. TAIR 10
Match: AT4G37100.1 (Pyridoxal phosphate (PLP)-dependent transferases superfamily protein )

HSP 1 Score: 977.2 bits (2525), Expect = 9.0e-285
Identity = 550/942 (58.39%), Postives = 682/942 (72.40%), Query Frame = 0

Query: 6   RRRHG--SGVTERRKSSILRQLQENKLREALEEASEDGSLAKSRDIDCESPNQDRNVRRS 65
           RRR G  S +  ++K++++R+L E+KLREALEEASE+GSL KS+DID    N D ++ RS
Sbjct: 25  RRRDGSDSSLNVKKKAALIRKLYEDKLREALEEASENGSLFKSQDID--QDNGDGSLGRS 84

Query: 66  RSLARLHAQKEFLRATALAADRTYSTEDFIPNLFDAFTKFLTMYPKFQTSEKIDQLRSEE 125
           RSLARLHAQ+EFLRATALAA+R   +ED IP L +A TKFL+MYPK+Q SEKIDQLRS+E
Sbjct: 85  RSLARLHAQREFLRATALAAERIIESEDSIPELREALTKFLSMYPKYQASEKIDQLRSDE 144

Query: 126 YEHLSESFSKVCLDYCGFGLFSHIQTQQFWESSAFTLSEITANLSNHALYGGAEKGTIEH 185
           Y HLS S SKVCLDYCGFGLFS++QT  +W++  F+LSEITANLSNHALYGGAE GT+EH
Sbjct: 145 YSHLSSSASKVCLDYCGFGLFSYVQTLHYWDTCTFSLSEITANLSNHALYGGAESGTVEH 204

Query: 186 DIKTRIMDYLNISENEYGLVFTVSRGSAFKLLSESYPFHTNKKLLTMFDHESQSVSWMAQ 245
           DIKTRIMDYLNI ENEYGLVFTVSRGSAF+LL+ESYPF +NK+LLTMFDHESQSV+WMAQ
Sbjct: 205 DIKTRIMDYLNIPENEYGLVFTVSRGSAFRLLAESYPFQSNKRLLTMFDHESQSVNWMAQ 264

Query: 246 SAKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSVAGLFVFPVQSRVTGAKYSY 305
           +A+E+GAK Y+AWFKWPTL+LCS +L+K+++ K+++KKDS  GLFVFP QSRVTG KYSY
Sbjct: 265 TAREKGAKAYNAWFKWPTLKLCSTDLKKRLSYKKRKKKDSAVGLFVFPAQSRVTGTKYSY 324

Query: 306 QWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFKPDFIITSFYRVFGSDPTGFGCLLIKK 365
           QWMALAQQN+WHVLLDAGSLGPKDMDSLGLSLF+P+FIITSFYRVFG DPTGFGCLLIKK
Sbjct: 325 QWMALAQQNHWHVLLDAGSLGPKDMDSLGLSLFRPEFIITSFYRVFGHDPTGFGCLLIKK 384

Query: 366 SVIGSLQNQSGRTGTGMVRILPIFPQYIGDSIDGLVDVLAGIEDDEINGQEDSETEKHQE 425
           SV+GSLQ+QSG+TG+G+V+I P +P Y+ DS+DGL D L G ED      +D   E H+ 
Sbjct: 385 SVMGSLQSQSGKTGSGIVKITPEYPLYLSDSVDGL-DGLVGFEDH----NDDKTKEAHRP 444

Query: 426 -SRMPAFSGVFTSNQVRDVFETEMEQDNNSSDRDG--ASTIFEEAESISIGEVMKSPIFS 485
            ++MPAFSG +TS QVRDVFETE+ +DN SSDRDG  ++TIFEE ES+S+GE+MKSP+FS
Sbjct: 445 GTQMPAFSGAYTSAQVRDVFETELLEDNISSDRDGTTSTTIFEETESVSVGELMKSPVFS 504

Query: 486 EDESSDNSYWIDLGQSPFGSDNSGHLIKQKTWSPLPPSWFSGKRNTRKR-SPKPASRLLK 545
           EDESSDNS+WIDLGQSP GSD        K  SPLPP W + KR  ++R SPKP  +   
Sbjct: 505 EDESSDNSFWIDLGQSPLGSDQ-----HNKIASPLPPIWLTNKRKQKQRQSPKPIPKSYS 564

Query: 546 SPMCGDDKRMNSRHYEDSVLSFDAAVLSMSQDFGCVKGIPEEEQSGEQDSCCGNVGSLRD 605
           SP+          +  + VLSFDAAV+S++             + G   +   N  S  +
Sbjct: 565 SPL----------YDGNDVLSFDAAVMSVT-------------EHGTNSTPSRNRRSSSN 624

Query: 606 SHAVSEIQEDSETGEESGRLSVASNGTRPANQTSEFQDLKRSNSTTSGAFKDLKESAIRR 665
              V EIQE     E  G     +NG + +N +SE                 +KESAIRR
Sbjct: 625 HLHVQEIQE-----ENCGH--SFANGLKSSNISSE-----------------IKESAIRR 684

Query: 666 ETEGEFRLLGRRERSRFSERGFFGLEEGDRAISMGRHVSFSVEYNEKESLNEMFELGETS 725
           ETEGEFRLLG R+  R       G+E  D   S GR VSF++E       + + E GE S
Sbjct: 685 ETEGEFRLLGGRDGGR---SRLLGVE--DEHPSKGRRVSFNME----RVSHSIVEPGEAS 744

Query: 726 NAAFGNEE--STSDGEYVDEQ----EWGRR--ESEMICRHLDHIDMLGLNKTTLRLRYLI 785
            A+  +E+  +TSD E  D++    EW RR  E+E++CRH+DH++MLGLNKTT RLR+LI
Sbjct: 745 LASVYDEDYINTSDVENGDDEGADDEWDRRDTETEIVCRHIDHVNMLGLNKTTTRLRFLI 804

Query: 786 NWLVTSLLQLRLPGRDDVGVHLVQLYGPKIKYERGAAIAFNIKESNGRGLIHPEVVQKLA 845
           NWLV SLLQL++P      ++LVQ+YGPKIKYERGAA+AFN+++ + +G + PE+VQ+L 
Sbjct: 805 NWLVISLLQLQVPESGGRHMNLVQIYGPKIKYERGAAVAFNVRDKS-KGFVSPEIVQRLG 864

Query: 846 ENNGISLGVGILSHVRAVDVPKQNSGQYDLEDMALCKPMGNGHNRKKLFFRVEVVTASLG 905
           +  G+SLG+GILSH+R VD   +N      ED AL      G N    F R EVVTASL 
Sbjct: 865 DREGVSLGIGILSHIRIVDEKPRNHRARTKEDSALHLQNEAGKNG---FIRFEVVTASLS 894

Query: 906 FLTNFDDVYKMWAFIAKFLNPSFLENNTLSSVPESSESYHRS 934
           FLTNF+DVYK+W F+AKFLNP F    +L +V E  E    S
Sbjct: 925 FLTNFEDVYKLWVFVAKFLNPGFSREGSLPTVEEEEEEAENS 894

BLAST of HG10014086.1 vs. TAIR 10
Match: AT5G66950.1 (Pyridoxal phosphate (PLP)-dependent transferases superfamily protein )

HSP 1 Score: 959.5 bits (2479), Expect = 1.9e-279
Identity = 544/932 (58.37%), Postives = 681/932 (73.07%), Query Frame = 0

Query: 1   MDKKTRRRHGSGVTERRKSSILRQLQENKLREALEEASEDGSLAKSRDIDCESPNQDRNV 60
           +DKK+     S  + R +    R+L E+KLREALE+ASEDG L KS+D++ E  +QD+ +
Sbjct: 18  LDKKS--SGSSSSSSRNRDVTQRKLHESKLREALEQASEDGLLVKSQDMEEEDESQDQIL 77

Query: 61  RRSRSLARLHAQKEFLRATALAADRTYSTEDFIPNLFDAFTKFLTMYPKFQTSEKIDQLR 120
            RSRSLARL+AQ+EFLRAT+LAA R + +E+ +P L +A T FLTMYPK+Q+SEK+D+LR
Sbjct: 78  GRSRSLARLNAQREFLRATSLAAQRAFESEETLPELEEALTIFLTMYPKYQSSEKVDELR 137

Query: 121 SEEYEHLSESFSKVCLDYCGFGLFSHIQTQQFWESSAFTLSEITANLSNHALYGGAEKGT 180
           ++EY HL  S  KVCLDYCGFGLFS++QT  +W++  F+LSEI+ANLSNHA+YGGAEKG+
Sbjct: 138 NDEYFHL--SLPKVCLDYCGFGLFSYLQTVHYWDTCTFSLSEISANLSNHAIYGGAEKGS 197

Query: 181 IEHDIKTRIMDYLNISENEYGLVFTVSRGSAFKLLSESYPFHTNKKLLTMFDHESQSVSW 240
           IEHDIK RIMDYLNI ENEYGLVFTVSRGSAFKLL+ESYPFHTNKKLLTMFDHESQSVSW
Sbjct: 198 IEHDIKIRIMDYLNIPENEYGLVFTVSRGSAFKLLAESYPFHTNKKLLTMFDHESQSVSW 257

Query: 241 MAQSAKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSVAGLFVFPVQSRVTGAK 300
           M Q AKE+GAKV SAWFKWPTLRLCS +L+K+I +K+KRKKDS  GLFVFPVQSRVTG+K
Sbjct: 258 MGQCAKEKGAKVGSAWFKWPTLRLCSMDLKKEILSKKKRKKDSATGLFVFPVQSRVTGSK 317

Query: 301 YSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFKPDFIITSFYRVFGSDPTGFGCLL 360
           YSYQWMALAQQNNWHVLLDAG+LGPKDMDSLGLSLF+PDFIITSFYRVFG DPTGFGCLL
Sbjct: 318 YSYQWMALAQQNNWHVLLDAGALGPKDMDSLGLSLFRPDFIITSFYRVFGYDPTGFGCLL 377

Query: 361 IKKSVIGSLQNQSGRTGTGMVRILPIFPQYIGDSIDGLVDVLAGIEDDEINGQEDSETEK 420
           IKKSVI  LQ+QSG+T +G+V+I P +P Y+ DS+DGL + L GI+D   NG   +   K
Sbjct: 378 IKKSVISCLQSQSGKTSSGIVKITPEYPLYLSDSMDGL-EGLTGIQD---NGIAINGDNK 437

Query: 421 HQESRMPAFSGVFTSNQVRDVFETEMEQDNNSSDRDGASTIFEEAESISIGEVMKSPIFS 480
              +++PAFSG +TS QV+DVFET+M+ +   SDRD  S +FEEAESIS+GE++KSP+FS
Sbjct: 438 ALGTQLPAFSGAYTSAQVQDVFETDMDHE-IGSDRDNTSAVFEEAESISVGELIKSPVFS 497

Query: 481 EDESSDNSYWIDLGQSPFGSDNSGHLIKQKTWSPLPPSWFSGKRNTRKRSPKPASRLLKS 540
           EDESSD+S WIDLGQSP  SDN+GHL KQK  SPL       K + R+ SPKPAS+    
Sbjct: 498 EDESSDSSLWIDLGQSPADSDNAGHLNKQK--SPL----LVRKNHKRRSSPKPASK---- 557

Query: 541 PMCGDDKRMNSRHYEDSVLSFDAAVLSMSQDFGCVKGIPEEEQSGEQDSCCGNVGSLRDS 600
               ++     RH    VLSFDAAVLS+S + G       EE   E++S    + + R  
Sbjct: 558 ---ANNGSNGGRH----VLSFDAAVLSVSHEVG-------EEVIEEENSEMNQIDTSRRL 617

Query: 601 HAVSEIQEDSETGEESGRLSVASNGTRPANQTSEFQDLKRSNSTTSGAFKDLKESAIRRE 660
             V+EI+E+ E G  S +L+  +NG                  ++SG    +K+SAIRRE
Sbjct: 618 R-VTEIEEEEEEG-GSSKLTAHANG------------------SSSG----IKDSAIRRE 677

Query: 661 TEGEFRLLGRRERSRFSERGFFGLEEGDRAISMGRHVSF-SVEYNEKESLNEMFELGETS 720
           TEGEFRLLGRRE+S+++  G   + E +      R VSF SV++            GE S
Sbjct: 678 TEGEFRLLGRREKSQYN-GGRLLVNEDEHPSK--RRVSFRSVDH------------GEAS 737

Query: 721 NAAFGNEESTSDGEYVDEQEWGRRESEMICRHLDHIDMLGLNKTTLRLRYLINWLVTSLL 780
             + G+E+   DG    E +  +RE E++CRH+DH++MLGLNKTT RLRYLINWLVTSLL
Sbjct: 738 VISLGDEDEEEDGSNGVEWDDDQREPEIVCRHIDHVNMLGLNKTTSRLRYLINWLVTSLL 797

Query: 781 QLRLPGRDDVGVH--LVQLYGPKIKYERGAAIAFNIKESNGRGLIHPEVVQKLAENNGIS 840
           QLRLP  D  G H  LVQ+YGPKIKYERG+++AFNI++    G++HPE+VQKLAE  GIS
Sbjct: 798 QLRLPRSDSDGEHKNLVQIYGPKIKYERGSSVAFNIRDLKS-GMVHPEIVQKLAEREGIS 857

Query: 841 LGVGILSHVRAVDVPKQNSGQYDLEDMALCKPMGNGHNRKKLFFRVEVVTASLGFLTNFD 900
           LG+G LSH++ +D   ++S  +        KP+ +   R   F RVEVVTASLGFLTNF+
Sbjct: 858 LGIGYLSHIKIIDNRSEDSSSW--------KPV-DREGRNNGFIRVEVVTASLGFLTNFE 867

Query: 901 DVYKMWAFIAKFLNPSFLENNTLSSVPESSES 930
           DVY++W F+AKFL+P F +  TL +V E  +S
Sbjct: 918 DVYRLWNFVAKFLSPGFAKQGTLPTVIEEDDS 867

BLAST of HG10014086.1 vs. TAIR 10
Match: AT5G51920.1 (Pyridoxal phosphate (PLP)-dependent transferases superfamily protein )

HSP 1 Score: 287.0 bits (733), Expect = 5.6e-77
Identity = 147/345 (42.61%), Postives = 220/345 (63.77%), Query Frame = 0

Query: 52  ESPNQDRNVRRSRSLARLHAQKEFLRATALAADRTYSTEDFIPNLFDAFTKFLTMYPKFQ 111
           E P        S +L R  AQ      + +  D  ++  + +P+  ++F+ F+  YP + 
Sbjct: 31  EHPPHSTPTVTSATLRRNFAQ---TTVSTIFPDTEFTDPNSLPSHQESFSDFIQAYPNYS 90

Query: 112 TSEKIDQLRSEEYEHLSESFSKVCLDYCGFGLFSHIQ-----------TQQFWESSAFTL 171
            + KID+LRS+ Y HL  S    CLDY G GL+S+ Q           +    ES  F++
Sbjct: 91  DTYKIDRLRSDHYFHLGLS-HYTCLDYIGIGLYSYSQLLNYDPSTYQISSSLSESPFFSV 150

Query: 172 SEITANLSNHALYGGAEKGTIEHDIKTRIMDYLNISENEYGLVFTVSRGSAFKLLSESYP 231
           S    NL    L  G ++   E+ +K RIM +L ISE +Y +VFT +R SAF+L++ESYP
Sbjct: 151 SPKIGNLKEKLLNDGGQETEFEYSMKRRIMGFLKISEEDYSMVFTANRTSAFRLVAESYP 210

Query: 232 FHTNKKLLTMFDHESQSVSWMAQSAKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRK 291
           F++ +KLLT++D+ES++VS + + +++RGAKV +A F WP L+LCS +LRK +T  +   
Sbjct: 211 FNSKRKLLTVYDYESEAVSEINRVSEKRGAKVAAAEFSWPRLKLCSSKLRKLVTAGKNGS 270

Query: 292 KDSVAGLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFKPDF 351
           K    G++VFP+ SRVTG++Y Y WM++AQ+N WHV++DA  LGPKDMDS GLS++ PDF
Sbjct: 271 KTKKKGIYVFPLHSRVTGSRYPYLWMSVAQENGWHVMIDACGLGPKDMDSFGLSIYNPDF 330

Query: 352 IITSFYRVFGSDPTGFGCLLIKKSVIGSLQNQSGRTGTGMVRILP 386
           ++ SFY+VFG +P+GFGCL +KKS I  L++    TG GM+ ++P
Sbjct: 331 MVCSFYKVFGENPSGFGCLFVKKSTISILESS---TGPGMINLVP 368


HSP 2 Score: 116.7 bits (291), Expect = 1.0e-25
Identity = 72/185 (38.92%), Postives = 104/185 (56.22%), Query Frame = 0

Query: 736 DEQEWGRRESEMICRHLDHIDMLGLNKTTLRLRYLINWLVTSLLQLRLPGRDDVGVHLVQ 795
           D +E     S +  + LDH+D LGL  T  R R LINWLV++L +L    +      LV+
Sbjct: 386 DSEETYSFSSSVEYKGLDHVDSLGLVATGNRSRCLINWLVSALYKL----KHSTTSRLVK 445

Query: 796 LYGPKIKYERGAAIAFNIKESNGRGLIHPEVVQKLAENNGISLGVGILSHVRAVDVPKQN 855
           +YGPK+ + RG A+AFN+    G   I P +VQKLAE + ISLG   L ++         
Sbjct: 446 IYGPKVNFNRGPAVAFNLFNHKGE-KIEPFIVQKLAECSNISLGKSFLKNILF------- 505

Query: 856 SGQYDLEDMALCKPMGNGHNRKKLFFRVEVVTASLGFLTNFDDVYKMWAFIAKFLNPSFL 915
             Q D E +   +      NR     R+ V+TA+LGFL NF+DVYK+W F+A+FL+  F+
Sbjct: 506 --QEDYEGVK-DRVFEKKRNRDVDEPRISVLTAALGFLANFEDVYKLWIFVARFLDSEFV 555

Query: 916 ENNTL 921
           +  ++
Sbjct: 566 DKESV 555

BLAST of HG10014086.1 vs. TAIR 10
Match: AT4G22980.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 13 plant structures; EXPRESSED DURING: 9 growth stages; BEST Arabidopsis thaliana protein match is: Pyridoxal phosphate (PLP)-dependent transferases superfamily protein (TAIR:AT5G51920.1); Has 520 Blast hits to 468 proteins in 130 species: Archae - 5; Bacteria - 23; Metazoa - 99; Fungi - 131; Plants - 231; Viruses - 0; Other Eukaryotes - 31 (source: NCBI BLink). )

HSP 1 Score: 220.7 bits (561), Expect = 4.9e-57
Identity = 128/325 (39.38%), Postives = 197/325 (60.62%), Query Frame = 0

Query: 63  SRSLARLHAQKEFLRATA----LAADRTYSTEDFIPNLFDAFTKFLTMYPKFQTSEKIDQ 122
           S S++    + EF   T     L  +  +++++ +P L  +F   +T +P +  + + D 
Sbjct: 24  SHSMSEKPEELEFSVTTTGTSFLTRNTKFTSQESLPRLRTSFYDLITAFPDYLQTNQADH 83

Query: 123 LRSEEYEHLSESFSKVCLDYCGFG----LFSHIQTQQFWESSAFTLSEITANLSNHALYG 182
           LRS EY++LS S S V      FG    LFS+ Q ++  ES +  L+     LS   +  
Sbjct: 84  LRSTEYQNLSSS-SHV------FGQQQPLFSYSQFREISESES-DLNHSLLTLSCKQVSS 143

Query: 183 GAEKGTIEHD------IKTRIMDYLNISENEYGLVFTVSRGSAFKLLSESYPFHTNKKLL 242
           G E  + E +      I+ RI  ++N+ E+EY ++ T  R SAFK+++E Y F TN  LL
Sbjct: 144 GKELLSFEEESRFQSRIRKRITSFMNLEESEYHMILTQDRSSAFKIVAELYSFKTNPNLL 203

Query: 243 TMFDHESQSVSWMAQSAKERGAKVYSAWFKWPTLRLCSRELRKQITNKRKRKKDSVAGLF 302
           T++++E ++V  M + ++++G K  SA F WP+  + S +L+++IT  ++R K    GLF
Sbjct: 204 TVYNYEDEAVEEMIRISEKKGIKPQSAEFSWPSTEILSEKLKRRITRSKRRGK---RGLF 263

Query: 303 VFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFKPDFIITSFYRV 362
           VFP+QS VTGA YSY WM+LA+++ WHVLLD  +LG KDM++LGLSLF+PDF+I SF  V
Sbjct: 264 VFPLQSLVTGASYSYSWMSLARESEWHVLLDTSALGSKDMETLGLSLFQPDFLICSFTEV 323

Query: 363 FG-SDPTGFGCLLIKKSVIGSLQNQ 373
            G  DP+GFGCL +KKS   +L  +
Sbjct: 324 LGQDDPSGFGCLFVKKSSSTALSEE 337


HSP 2 Score: 95.9 bits (237), Expect = 1.8e-19
Identity = 67/192 (34.90%), Postives = 101/192 (52.60%), Query Frame = 0

Query: 728 STSDGEYVDEQEWGRRESEMI-CRHLDHIDMLGLNKTTLRLRYLINWLVTSLLQLRLPGR 787
           STS  E V+ +   +++  MI  + LDH D LGL   + R + L  WL+ +L  L+ PG 
Sbjct: 378 STSSSEIVEIESSVKQDKAMIEFQGLDHADSLGLILISRRSKSLTLWLLRALRTLQHPGY 437

Query: 788 DDVGVHLVQLYGPKIKYERGAAIAFNIKESNGRGLIHPEVVQKLAENNGISLGVGILSHV 847
               + LV+LYGPK K  RG +I+FNI +  G   + P +V++LAE   I L    L   
Sbjct: 438 HQTEMPLVKLYGPKTKPSRGPSISFNIFDWQGE-KVDPLMVERLAEREKIGLRCAYLHKF 497

Query: 848 RAVDVPKQNSGQYDLEDMALCKPMGN-GHNRKKLFFRVEVVTASL-GFLTNFDDVYKMWA 907
           R                      +GN   + + +  R+ VVT  L GF+TNF+DV+K+W 
Sbjct: 498 R----------------------IGNKRRSDEAVSLRLSVVTVRLGGFMTNFEDVFKVWE 546

Query: 908 FIAKFLNPSFLE 917
           F+++FL+  F+E
Sbjct: 558 FVSRFLDADFVE 546

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038899790.10.0e+0096.58uncharacterized protein LOC120087021 [Benincasa hispida][more]
XP_008454669.10.0e+0094.97PREDICTED: uncharacterized protein LOC103495022 [Cucumis melo] >KAA0057170.1 unc... [more]
XP_011652392.10.0e+0094.65uncharacterized protein LOC101215138 [Cucumis sativus] >KGN59901.1 hypothetical ... [more]
KAG6608507.10.0e+0091.64Molybdenum cofactor sulfurase, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_022941406.10.0e+0091.64uncharacterized protein LOC111446707 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Q8LGM71.1e-1623.69Molybdenum cofactor sulfurase OS=Solanum lycopersicum OX=4081 GN=FLACCA PE=2 SV=... [more]
Q9C5X81.6e-1525.31Molybdenum cofactor sulfurase OS=Arabidopsis thaliana OX=3702 GN=ABA3 PE=1 SV=1[more]
Q16P905.6e-1326.57Molybdenum cofactor sulfurase 3 OS=Aedes aegypti OX=7159 GN=mal3 PE=3 SV=1[more]
Q16GH04.7e-1225.93Molybdenum cofactor sulfurase 1 OS=Aedes aegypti OX=7159 GN=mal1 PE=3 SV=1[more]
A2VD334.7e-1225.78Molybdenum cofactor sulfurase OS=Danio rerio OX=7955 GN=mocos PE=2 SV=2[more]
Match NameE-valueIdentityDescription
A0A5A7UPY90.0e+0094.97Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3BZ970.0e+0094.97uncharacterized protein LOC103495022 OS=Cucumis melo OX=3656 GN=LOC103495022 PE=... [more]
A0A0A0LIQ10.0e+0094.65Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G852500 PE=4 SV=1[more]
A0A6J1FL080.0e+0091.64uncharacterized protein LOC111446707 OS=Cucurbita moschata OX=3662 GN=LOC1114467... [more]
A0A6J1IYP00.0e+0091.64uncharacterized protein LOC111481100 OS=Cucurbita maxima OX=3661 GN=LOC111481100... [more]
Match NameE-valueIdentityDescription
AT2G23520.13.2e-29059.38Pyridoxal phosphate (PLP)-dependent transferases superfamily protein [more]
AT4G37100.19.0e-28558.39Pyridoxal phosphate (PLP)-dependent transferases superfamily protein [more]
AT5G66950.11.9e-27958.37Pyridoxal phosphate (PLP)-dependent transferases superfamily protein [more]
AT5G51920.15.6e-7742.61Pyridoxal phosphate (PLP)-dependent transferases superfamily protein [more]
AT4G22980.14.9e-5739.38FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR015421Pyridoxal phosphate-dependent transferase, major domainGENE3D3.40.640.10coord: 154..404
e-value: 2.9E-18
score: 67.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 616..650
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 517..549
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..63
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 601..655
NoneNo IPR availablePANTHERPTHR14237MOLYBDOPTERIN COFACTOR SULFURASE MOSCcoord: 19..928
NoneNo IPR availablePANTHERPTHR14237:SF76OS03G0765800 PROTEINcoord: 19..928
IPR015424Pyridoxal phosphate-dependent transferaseSUPERFAMILY53383PLP-dependent transferasescoord: 134..378

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
HG10014086HG10014086gene


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
HG10014086.1-cdsHG10014086.1-cds-Chr02:7444141..7446948CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
HG10014086.1HG10014086.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003824 catalytic activity