HG10005098 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10005098
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein SIEVE ELEMENT OCCLUSION B
LocationChr08: 22860908 .. 22863584 (-)
RNA-Seq ExpressionHG10005098
SyntenyHG10005098
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTACTGGAGTACGTAAGTTGAGTCTCATCAAACCAGACCGTCAGCTATTCGCAGCAGGTGACGAAAATGCCTTGACAAAGCAAGTTTTGGCCACGCATTCTGAGGAACCCCTCGAGTTTCCTGTCACCCCTTTGCTCAGCCTCGTTGAACAAATTTTCCTTCGAGCTAAACTCAATACTCACCAGGCACATAAATATGTTTTAACAAGTGTTTTCTCCAAATCATCTCTTTGGTTTAAAAGGAAATTCACTTAGCTCATTGATTTGATTTTGTCTCCAGGGAACAACTCGAGCTCAGCTGGAGGCAATTGAAGACAATTCCCCAAGCCCAACAGACTTGCTGGACTTGCTGGATTTTGTATCATTTACTATCAATAAAGTTTCCAATGAGGTTGTTGTTTAGAGTTTCTTCTTTCTTTTGGGTATACAAAAAGTCTGTCTTTTCAGAGATCCATTTTCTTTGCTTGTTTTGAATGATCACAGATACAGTACAAGTGTTCAGGAGCAGGGGATCCCCATACTGTGACTATGGAAGTGTTTAATTTGTTATCAAGCTGGCCATGGGATGCTAAGGTGGTGCTGGCCTTGGCTGCATTTGCCATCAACTATGGAGAGTTTTGGCTATTGGTTCAACAATCCTCAACAGACTTACTCGCCAAAGACATCTCGCTCCTCAAAAAACTCCCAGAAATATTCGAGAGGGTCGACATTGTGAAGCAAAAATTTGAAGCACTTGACAAACTCATCAAGGCACTCGTGGATGTAGCCAAGTGCATTGTTGATTGCAAGATGCTTCCTCCCCATTACATTACTCCAGACACGCCTGAAATGAAGAGTGCAACCACTCTTATTCCAACAGCTATTTATTGGACAATCAGAAGCATTGTCGCCTGTGCTGCACAGAATGCAGGCCTTATTGGAGTTGGCCATGAGTATGCCACCGTCTTCTACTTTTGATAATCTAACTTAGCTAACTACTTAACCATACTTAGTTCCTTCTCTGCAACATTTCATCAGGTATTTAGCATCAGCATCTGAAACATGGGAGCTGTCTAGTTTGGCCCATAAGATCGACAACATCCGCAAGCACCTTGAACAACTGCTTCTTGCTTGTCATCATTACATAAGTGAGGCTTTTAGTGCTCTCTAAACTTCAATATGACCAGGTAGTTGAGACTGAGTTAATATTTTACCCTGTTTATGTTTGTAATTGATACACAGATGAGAAGATGCATCATGAAGCATATATGAACCTGGTCCGCCTTTTCGAGATACCCCACATTGACAACAACAAGATTCTGAGGGCTTTGATTTACTCCAAGGATGATAAGCCACCCCTCATCGATGGTTTAAGCAAGGAAAAGGTCAGCACTTTCCCCTTTTCTAACAAACATTCTACACTATCTACACGTCTCTATAATTCCTCTTGGTTTTAATACCCTCTATCAAACATTACTTGGGTCTTCTTTAATCACATAATAGGCTACCCTCGAAGTTCTAAGAAAGAAAAACGTGCTGCTTCTCATCTCTGACCTGGACATATCGATAGTGGAGCTTTCAATGCTAGACCAAATCTATAGAGAATCAAGACACAACAAAACAAGAGCAGAGAGCGATTACGAGGTGGTGTGGATGCCAATTGTGGAGCCTACATGGACAGAAGAGAAACAAGTGAAATTCGAAGCGTTGTTGGGTTTGATGCCATGGTACTCGGTAGCACATCCTTCACTAATCGAATCCGCCGTCGTTAAGTACGTGAGACAGGTATGGAACTTCATAAAAAAGCCTCTATTGGTGGTTTTGGACCCTCAAGGCAAAGTGGTTAATACCAACGCCGTCCATATGCTCTGGATTTGGGGAAGCTTGGCCTACCCTTTCACAAGCGCTCGAGAGGAATCACTTTGGAAAGAAGAGACTTGGCAACTTGAGCTTTTAGTCGATTCAGTCGAACCTCTCATCTTCCAATGGGTAATAAACCTCAACCCATTTTTCCAATGCTTGATCTTCTCAGTTTCCTGACTCCTTTTTTTTCTGTAATGTGGTGAATTTTCAGAAGGAAACAGGGAAATACATTTGCATTATTGGAGGGGAAGATTTGGGATGGATAAGAAGCTTCAGCTCAAAGGCAAAATCAGTAGCCAATGATGCAGGGATAGAGCTGGAGATACTGTACGTGGGGAAGAGCAACCCTGGGGAGAAAATAAGGAAGAACATAGCCGCAATCTTAGCAGATAAAATAATTCATACACTGGTAGATCCAACCCTCATTTGGTTCTTCTGGGTGAGGCTAGAAAGCATGTGGTACTCAAAAACACAAAGAGGAAACACAATTGAAGATGATCCAGTAATGCAAGAGACGATGACGATGTTGAGTTTCGACAGTGGAGACCAGGGATGGGCCTTGTTCTGCAAAGGCTCAACCGACATCCTTCGAGCCAAAGCCGAGACCATAACCAATGTGGTGGATGGTTATGAAGAGCGTTGGAAGATCCATGTGAAGGAGGAAGGATTTATACCTGCTATGAGTAAAGACCTGGAACATATCCATACTCCTGAGCATTGCAACCGTCTGATTCTTCCTTCTTCCAATGGCACCATTCCAGAGAAGGTGGTTTGTTCTGAATGTGGTAGTGCCATGGAAAAGTTCATCATGTATCGCTGCTGCAACGACTAA

mRNA sequence

ATGGCTACTGGAGTACGTAAGTTGAGTCTCATCAAACCAGACCGTCAGCTATTCGCAGCAGGTGACGAAAATGCCTTGACAAAGCAAGTTTTGGCCACGCATTCTGAGGAACCCCTCGAGTTTCCTGTCACCCCTTTGCTCAGCCTCGTTGAACAAATTTTCCTTCGAGCTAAACTCAATACTCACCAGGCACATAAATATGGAACAACTCGAGCTCAGCTGGAGGCAATTGAAGACAATTCCCCAAGCCCAACAGACTTGCTGGACTTGCTGGATTTTGTATCATTTACTATCAATAAAGTTTCCAATGAGATACAGTACAAGTGTTCAGGAGCAGGGGATCCCCATACTGTGACTATGGAAGTGTTTAATTTGTTATCAAGCTGGCCATGGGATGCTAAGGTGGTGCTGGCCTTGGCTGCATTTGCCATCAACTATGGAGAGTTTTGGCTATTGGTTCAACAATCCTCAACAGACTTACTCGCCAAAGACATCTCGCTCCTCAAAAAACTCCCAGAAATATTCGAGAGGGTCGACATTGTGAAGCAAAAATTTGAAGCACTTGACAAACTCATCAAGGCACTCGTGGATGTAGCCAAGTGCATTGTTGATTGCAAGATGCTTCCTCCCCATTACATTACTCCAGACACGCCTGAAATGAAGAGTGCAACCACTCTTATTCCAACAGCTATTTATTGGACAATCAGAAGCATTGTCGCCTGTGCTGCACAGAATGCAGGCCTTATTGGAGTTGGCCATGAGTATTTAGCATCAGCATCTGAAACATGGGAGCTGTCTAGTTTGGCCCATAAGATCGACAACATCCGCAAGCACCTTGAACAACTGCTTCTTGCTTGTCATCATTACATAAATGAGAAGATGCATCATGAAGCATATATGAACCTGGTCCGCCTTTTCGAGATACCCCACATTGACAACAACAAGATTCTGAGGGCTTTGATTTACTCCAAGGATGATAAGCCACCCCTCATCGATGGTTTAAGCAAGGAAAAGGCTACCCTCGAAGTTCTAAGAAAGAAAAACGTGCTGCTTCTCATCTCTGACCTGGACATATCGATAGTGGAGCTTTCAATGCTAGACCAAATCTATAGAGAATCAAGACACAACAAAACAAGAGCAGAGAGCGATTACGAGGTGGTGTGGATGCCAATTGTGGAGCCTACATGGACAGAAGAGAAACAAGTGAAATTCGAAGCGTTGTTGGGTTTGATGCCATGGTACTCGGTAGCACATCCTTCACTAATCGAATCCGCCGTCGTTAAGTACGTGAGACAGGTATGGAACTTCATAAAAAAGCCTCTATTGGTGGTTTTGGACCCTCAAGGCAAAGTGGTTAATACCAACGCCGTCCATATGCTCTGGATTTGGGGAAGCTTGGCCTACCCTTTCACAAGCGCTCGAGAGGAATCACTTTGGAAAGAAGAGACTTGGCAACTTGAGCTTTTAGTCGATTCAGTCGAACCTCTCATCTTCCAATGGAAGGAAACAGGGAAATACATTTGCATTATTGGAGGGGAAGATTTGGGATGGATAAGAAGCTTCAGCTCAAAGGCAAAATCAGTAGCCAATGATGCAGGGATAGAGCTGGAGATACTGTACGTGGGGAAGAGCAACCCTGGGGAGAAAATAAGGAAGAACATAGCCGCAATCTTAGCAGATAAAATAATTCATACACTGGTAGATCCAACCCTCATTTGGTTCTTCTGGGTGAGGCTAGAAAGCATGTGGTACTCAAAAACACAAAGAGGAAACACAATTGAAGATGATCCAGTAATGCAAGAGACGATGACGATGTTGAGTTTCGACAGTGGAGACCAGGGATGGGCCTTGTTCTGCAAAGGCTCAACCGACATCCTTCGAGCCAAAGCCGAGACCATAACCAATGTGGTGGATGGTTATGAAGAGCGTTGGAAGATCCATGTGAAGGAGGAAGGATTTATACCTGCTATGAGTAAAGACCTGGAACATATCCATACTCCTGAGCATTGCAACCGTCTGATTCTTCCTTCTTCCAATGGCACCATTCCAGAGAAGGTGGTTTGTTCTGAATGTGGTAGTGCCATGGAAAAGTTCATCATGTATCGCTGCTGCAACGACTAA

Coding sequence (CDS)

ATGGCTACTGGAGTACGTAAGTTGAGTCTCATCAAACCAGACCGTCAGCTATTCGCAGCAGGTGACGAAAATGCCTTGACAAAGCAAGTTTTGGCCACGCATTCTGAGGAACCCCTCGAGTTTCCTGTCACCCCTTTGCTCAGCCTCGTTGAACAAATTTTCCTTCGAGCTAAACTCAATACTCACCAGGCACATAAATATGGAACAACTCGAGCTCAGCTGGAGGCAATTGAAGACAATTCCCCAAGCCCAACAGACTTGCTGGACTTGCTGGATTTTGTATCATTTACTATCAATAAAGTTTCCAATGAGATACAGTACAAGTGTTCAGGAGCAGGGGATCCCCATACTGTGACTATGGAAGTGTTTAATTTGTTATCAAGCTGGCCATGGGATGCTAAGGTGGTGCTGGCCTTGGCTGCATTTGCCATCAACTATGGAGAGTTTTGGCTATTGGTTCAACAATCCTCAACAGACTTACTCGCCAAAGACATCTCGCTCCTCAAAAAACTCCCAGAAATATTCGAGAGGGTCGACATTGTGAAGCAAAAATTTGAAGCACTTGACAAACTCATCAAGGCACTCGTGGATGTAGCCAAGTGCATTGTTGATTGCAAGATGCTTCCTCCCCATTACATTACTCCAGACACGCCTGAAATGAAGAGTGCAACCACTCTTATTCCAACAGCTATTTATTGGACAATCAGAAGCATTGTCGCCTGTGCTGCACAGAATGCAGGCCTTATTGGAGTTGGCCATGAGTATTTAGCATCAGCATCTGAAACATGGGAGCTGTCTAGTTTGGCCCATAAGATCGACAACATCCGCAAGCACCTTGAACAACTGCTTCTTGCTTGTCATCATTACATAAATGAGAAGATGCATCATGAAGCATATATGAACCTGGTCCGCCTTTTCGAGATACCCCACATTGACAACAACAAGATTCTGAGGGCTTTGATTTACTCCAAGGATGATAAGCCACCCCTCATCGATGGTTTAAGCAAGGAAAAGGCTACCCTCGAAGTTCTAAGAAAGAAAAACGTGCTGCTTCTCATCTCTGACCTGGACATATCGATAGTGGAGCTTTCAATGCTAGACCAAATCTATAGAGAATCAAGACACAACAAAACAAGAGCAGAGAGCGATTACGAGGTGGTGTGGATGCCAATTGTGGAGCCTACATGGACAGAAGAGAAACAAGTGAAATTCGAAGCGTTGTTGGGTTTGATGCCATGGTACTCGGTAGCACATCCTTCACTAATCGAATCCGCCGTCGTTAAGTACGTGAGACAGGTATGGAACTTCATAAAAAAGCCTCTATTGGTGGTTTTGGACCCTCAAGGCAAAGTGGTTAATACCAACGCCGTCCATATGCTCTGGATTTGGGGAAGCTTGGCCTACCCTTTCACAAGCGCTCGAGAGGAATCACTTTGGAAAGAAGAGACTTGGCAACTTGAGCTTTTAGTCGATTCAGTCGAACCTCTCATCTTCCAATGGAAGGAAACAGGGAAATACATTTGCATTATTGGAGGGGAAGATTTGGGATGGATAAGAAGCTTCAGCTCAAAGGCAAAATCAGTAGCCAATGATGCAGGGATAGAGCTGGAGATACTGTACGTGGGGAAGAGCAACCCTGGGGAGAAAATAAGGAAGAACATAGCCGCAATCTTAGCAGATAAAATAATTCATACACTGGTAGATCCAACCCTCATTTGGTTCTTCTGGGTGAGGCTAGAAAGCATGTGGTACTCAAAAACACAAAGAGGAAACACAATTGAAGATGATCCAGTAATGCAAGAGACGATGACGATGTTGAGTTTCGACAGTGGAGACCAGGGATGGGCCTTGTTCTGCAAAGGCTCAACCGACATCCTTCGAGCCAAAGCCGAGACCATAACCAATGTGGTGGATGGTTATGAAGAGCGTTGGAAGATCCATGTGAAGGAGGAAGGATTTATACCTGCTATGAGTAAAGACCTGGAACATATCCATACTCCTGAGCATTGCAACCGTCTGATTCTTCCTTCTTCCAATGGCACCATTCCAGAGAAGGTGGTTTGTTCTGAATGTGGTAGTGCCATGGAAAAGTTCATCATGTATCGCTGCTGCAACGACTAA

Protein sequence

MATGVRKLSLIKPDRQLFAAGDENALTKQVLATHSEEPLEFPVTPLLSLVEQIFLRAKLNTHQAHKYGTTRAQLEAIEDNSPSPTDLLDLLDFVSFTINKVSNEIQYKCSGAGDPHTVTMEVFNLLSSWPWDAKVVLALAAFAINYGEFWLLVQQSSTDLLAKDISLLKKLPEIFERVDIVKQKFEALDKLIKALVDVAKCIVDCKMLPPHYITPDTPEMKSATTLIPTAIYWTIRSIVACAAQNAGLIGVGHEYLASASETWELSSLAHKIDNIRKHLEQLLLACHHYINEKMHHEAYMNLVRLFEIPHIDNNKILRALIYSKDDKPPLIDGLSKEKATLEVLRKKNVLLLISDLDISIVELSMLDQIYRESRHNKTRAESDYEVVWMPIVEPTWTEEKQVKFEALLGLMPWYSVAHPSLIESAVVKYVRQVWNFIKKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLWKEETWQLELLVDSVEPLIFQWKETGKYICIIGGEDLGWIRSFSSKAKSVANDAGIELEILYVGKSNPGEKIRKNIAAILADKIIHTLVDPTLIWFFWVRLESMWYSKTQRGNTIEDDPVMQETMTMLSFDSGDQGWALFCKGSTDILRAKAETITNVVDGYEERWKIHVKEEGFIPAMSKDLEHIHTPEHCNRLILPSSNGTIPEKVVCSECGSAMEKFIMYRCCND
Homology
BLAST of HG10005098 vs. NCBI nr
Match: KAA0061050.1 (protein SIEVE ELEMENT OCCLUSION B [Cucumis melo var. makuwa])

HSP 1 Score: 1363.2 bits (3527), Expect = 0.0e+00
Identity = 670/701 (95.58%), Postives = 684/701 (97.57%), Query Frame = 0

Query: 6   RKLSLIKPDRQLFAAGDENALTKQVLATHSEEPLEFPVTPLLSLVEQIFLRAKLNTHQAH 65
           RKLSLIKPDRQLFA GDENALTKQVLATHSEEPLEFPVTPLLSLVEQIFLRAKLNTHQA+
Sbjct: 7   RKLSLIKPDRQLFAGGDENALTKQVLATHSEEPLEFPVTPLLSLVEQIFLRAKLNTHQAY 66

Query: 66  KYGTTRAQLEAIEDNSPSPTDLLDLLDFVSFTINKVSNEIQYKCSGAGDPHTVTMEVFNL 125
             G TRAQLEAIED SPSPTDLLDLLDFVSFTIN+VSNEIQYKCSGAGDPHTVTMEVFNL
Sbjct: 67  TCGATRAQLEAIEDKSPSPTDLLDLLDFVSFTINRVSNEIQYKCSGAGDPHTVTMEVFNL 126

Query: 126 LSSWPWDAKVVLALAAFAINYGEFWLLVQQSSTDLLAKDISLLKKLPEIFERVDIVKQKF 185
           LSSWPWDAKVVLALAAFAINYGEFWLLVQQSSTDLLAKDISLLKKLPEIFERVDIVKQKF
Sbjct: 127 LSSWPWDAKVVLALAAFAINYGEFWLLVQQSSTDLLAKDISLLKKLPEIFERVDIVKQKF 186

Query: 186 EALDKLIKALVDVAKCIVDCKMLPPHYITPDTPEMKSATTLIPTAIYWTIRSIVACAAQN 245
           EALDKLIK+LVDVAKCIVD KMLPPHYITPDTPEMKSATTLIPTAIYWTIRSIVACAAQN
Sbjct: 187 EALDKLIKSLVDVAKCIVDFKMLPPHYITPDTPEMKSATTLIPTAIYWTIRSIVACAAQN 246

Query: 246 AGLIGVGHEYLASASETWELSSLAHKIDNIRKHLEQLLLACHHYINEKMHHEAYMNLVRL 305
           AGLIGVGHEYLASASETWELSSLAHKIDNIRKHLEQLLLACHHYINEKMHHEAYMNLVRL
Sbjct: 247 AGLIGVGHEYLASASETWELSSLAHKIDNIRKHLEQLLLACHHYINEKMHHEAYMNLVRL 306

Query: 306 FEIPHIDNNKILRALIYSKDDKPPLIDGLSKEKATLEVLRKKNVLLLISDLDISIVELSM 365
           FEIPHIDNNKILRALIYSKDDKPPL+DGLSKEKATLEVLRKKNVLLLISDLD+SIVELSM
Sbjct: 307 FEIPHIDNNKILRALIYSKDDKPPLVDGLSKEKATLEVLRKKNVLLLISDLDLSIVELSM 366

Query: 366 LDQIYRESRHNKTRAESDYEVVWMPIVEPTWTEEKQVKFEALLGLMPWYSVAHPSLIESA 425
           LDQIYRESR NKTR ESDYEVVWMPIVEP WTEEKQVKFEALLGLMPWYSVAHPSLIESA
Sbjct: 367 LDQIYRESRQNKTRTESDYEVVWMPIVEPPWTEEKQVKFEALLGLMPWYSVAHPSLIESA 426

Query: 426 VVKYVRQVWNFIKKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLWKEETWQ 485
           V+KYVRQVWNFIKKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLWKEETW+
Sbjct: 427 VIKYVRQVWNFIKKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLWKEETWR 486

Query: 486 LELLVDSVEPLIFQWKETGKYICIIGGEDLGWIRSFSSKAKSVANDAGIELEILYVGKSN 545
           LELLVDSVEPLIFQW ETGKYICI+GGEDL WIR FS+KA  VA DAGI LEILYVGKSN
Sbjct: 487 LELLVDSVEPLIFQWMETGKYICILGGEDLAWIRGFSAKALGVAKDAGINLEILYVGKSN 546

Query: 546 PGEKIRKNIAAILADKIIHTLVDPTLIWFFWVRLESMWYSKTQRGNTIEDDPVMQETMTM 605
           PGEKI+KNIAAILADK+IHTLVDPTLIWFFWVRLESMWYSKTQRGNTIEDDPVMQETMTM
Sbjct: 547 PGEKIKKNIAAILADKVIHTLVDPTLIWFFWVRLESMWYSKTQRGNTIEDDPVMQETMTM 606

Query: 606 LSFDSGDQGWALFCKGSTDILRAKAETITNVVDGYEERWKIHVKEEGFIPAMSKDLEHIH 665
           LSFDSGDQGWALFCKGSTDILRAKAETITNVVDGYEERWK+HV+EEGFIPAMSKDL+ IH
Sbjct: 607 LSFDSGDQGWALFCKGSTDILRAKAETITNVVDGYEERWKVHVQEEGFIPAMSKDLQDIH 666

Query: 666 TPEHCNRLILPSSNGTIPEKVVCSECGSAMEKFIMYRCCND 707
           TPEHCNRLILPSSNGTIPEKVVCSECGSAMEKFIMYRCCND
Sbjct: 667 TPEHCNRLILPSSNGTIPEKVVCSECGSAMEKFIMYRCCND 707

BLAST of HG10005098 vs. NCBI nr
Match: XP_008444389.1 (PREDICTED: protein SIEVE ELEMENT OCCLUSION B [Cucumis melo])

HSP 1 Score: 1356.3 bits (3509), Expect = 0.0e+00
Identity = 669/701 (95.44%), Postives = 682/701 (97.29%), Query Frame = 0

Query: 6   RKLSLIKPDRQLFAAGDENALTKQVLATHSEEPLEFPVTPLLSLVEQIFLRAKLNTHQAH 65
           RKLSLIKPDRQLFA GDENALTKQVLATHSEEPLEFPVTPLLSLVEQIFLRAKLNTHQ  
Sbjct: 7   RKLSLIKPDRQLFAGGDENALTKQVLATHSEEPLEFPVTPLLSLVEQIFLRAKLNTHQ-- 66

Query: 66  KYGTTRAQLEAIEDNSPSPTDLLDLLDFVSFTINKVSNEIQYKCSGAGDPHTVTMEVFNL 125
             G TRAQLEAIED SPSPTDLLDLLDFVSFTIN+VSNEIQYKCSGAGDPHTVTMEVFNL
Sbjct: 67  --GATRAQLEAIEDKSPSPTDLLDLLDFVSFTINRVSNEIQYKCSGAGDPHTVTMEVFNL 126

Query: 126 LSSWPWDAKVVLALAAFAINYGEFWLLVQQSSTDLLAKDISLLKKLPEIFERVDIVKQKF 185
           LSSWPWDAKVVLALAAFAINYGEFWLLVQQSSTDLLAKDISLLKKLPEIFERVDIVKQKF
Sbjct: 127 LSSWPWDAKVVLALAAFAINYGEFWLLVQQSSTDLLAKDISLLKKLPEIFERVDIVKQKF 186

Query: 186 EALDKLIKALVDVAKCIVDCKMLPPHYITPDTPEMKSATTLIPTAIYWTIRSIVACAAQN 245
           EALDKLIK+LVDVAKCIVD KMLPPHYITPDTPEMKSATTLIPTAIYWTIRSIVACAAQN
Sbjct: 187 EALDKLIKSLVDVAKCIVDFKMLPPHYITPDTPEMKSATTLIPTAIYWTIRSIVACAAQN 246

Query: 246 AGLIGVGHEYLASASETWELSSLAHKIDNIRKHLEQLLLACHHYINEKMHHEAYMNLVRL 305
           AGLIGVGHEYLASASETWELSSLAHKIDNIRKHLEQLLLACHHYINEKMHHEAYMNLVRL
Sbjct: 247 AGLIGVGHEYLASASETWELSSLAHKIDNIRKHLEQLLLACHHYINEKMHHEAYMNLVRL 306

Query: 306 FEIPHIDNNKILRALIYSKDDKPPLIDGLSKEKATLEVLRKKNVLLLISDLDISIVELSM 365
           FEIPHIDNNKILRALIYSKDDKPPL+DGLSKEKATLEVLRKKNVLLLISDLD+SIVELSM
Sbjct: 307 FEIPHIDNNKILRALIYSKDDKPPLVDGLSKEKATLEVLRKKNVLLLISDLDLSIVELSM 366

Query: 366 LDQIYRESRHNKTRAESDYEVVWMPIVEPTWTEEKQVKFEALLGLMPWYSVAHPSLIESA 425
           LDQIYRESR NKTR ESDYEVVWMPIVEP WTEEKQVKFEALLGLMPWYSVAHPSLIESA
Sbjct: 367 LDQIYRESRQNKTRTESDYEVVWMPIVEPPWTEEKQVKFEALLGLMPWYSVAHPSLIESA 426

Query: 426 VVKYVRQVWNFIKKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLWKEETWQ 485
           V+KYVRQVWNFIKKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLWKEETW+
Sbjct: 427 VIKYVRQVWNFIKKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLWKEETWR 486

Query: 486 LELLVDSVEPLIFQWKETGKYICIIGGEDLGWIRSFSSKAKSVANDAGIELEILYVGKSN 545
           LELLVDSVEPLIFQW ETGKYICI+GGEDL WIR FS+KA  VA DAGI LEILYVGKSN
Sbjct: 487 LELLVDSVEPLIFQWMETGKYICILGGEDLAWIRGFSAKALGVAKDAGINLEILYVGKSN 546

Query: 546 PGEKIRKNIAAILADKIIHTLVDPTLIWFFWVRLESMWYSKTQRGNTIEDDPVMQETMTM 605
           PGEKI+KNIAAILADK+IHTLVDPTLIWFFWVRLESMWYSKTQRGNTIEDDPVMQETMTM
Sbjct: 547 PGEKIKKNIAAILADKVIHTLVDPTLIWFFWVRLESMWYSKTQRGNTIEDDPVMQETMTM 606

Query: 606 LSFDSGDQGWALFCKGSTDILRAKAETITNVVDGYEERWKIHVKEEGFIPAMSKDLEHIH 665
           LSFDSGDQGWALFCKGSTDILRAKAETITNVVDGYEERWK+HV+EEGFIPAMSKDL+ IH
Sbjct: 607 LSFDSGDQGWALFCKGSTDILRAKAETITNVVDGYEERWKVHVQEEGFIPAMSKDLQDIH 666

Query: 666 TPEHCNRLILPSSNGTIPEKVVCSECGSAMEKFIMYRCCND 707
           TPEHCNRLILPSSNGTIPEKVVCSECGSAMEKFIMYRCCND
Sbjct: 667 TPEHCNRLILPSSNGTIPEKVVCSECGSAMEKFIMYRCCND 703

BLAST of HG10005098 vs. NCBI nr
Match: XP_038884139.1 (protein SIEVE ELEMENT OCCLUSION B-like [Benincasa hispida])

HSP 1 Score: 1347.4 bits (3486), Expect = 0.0e+00
Identity = 669/707 (94.63%), Postives = 684/707 (96.75%), Query Frame = 0

Query: 1   MATGV-RKLSLIKPDRQLFAAGDENALTKQVLATHSEEPLEFPVTPLLSLVEQIFLRAKL 60
           MAT + RKLSLIKPDRQLFAAGDENALTKQVLATHSEEPLEFPVTPLLSLVEQIFLRAKL
Sbjct: 1   MATAIPRKLSLIKPDRQLFAAGDENALTKQVLATHSEEPLEFPVTPLLSLVEQIFLRAKL 60

Query: 61  NTHQAHKYGTTRAQLEAIEDNSPSPTDLLDLLDFVSFTINKVSNEIQYKCSGAGDPHTVT 120
           N HQ    GTTRAQLEAIEDNSPSP DLLDLLDFVSFTINKVSNEIQYKCSGAGDPHTVT
Sbjct: 61  NAHQ----GTTRAQLEAIEDNSPSPADLLDLLDFVSFTINKVSNEIQYKCSGAGDPHTVT 120

Query: 121 MEVFNLLSSWPWDAKVVLALAAFAINYGEFWLLVQQSSTDLLAKDISLLKKLPEIFERVD 180
           MEVFNLLSSWPWDAKVVLALAAFAINYGEFWLLVQQSSTDLLAKDISLLKKLPEIFERVD
Sbjct: 121 MEVFNLLSSWPWDAKVVLALAAFAINYGEFWLLVQQSSTDLLAKDISLLKKLPEIFERVD 180

Query: 181 IVKQKFEALDKLIKALVDVAKCIVDCKMLPPHYITPDTPEMKSATTLIPTAIYWTIRSIV 240
           IVKQKFEALDKLIKALVDVAKCIVD KMLPPHYITPDTPEMKSATTLIPTAIYWTIRSIV
Sbjct: 181 IVKQKFEALDKLIKALVDVAKCIVDFKMLPPHYITPDTPEMKSATTLIPTAIYWTIRSIV 240

Query: 241 ACAAQNAGLIGVGHEYLASASETWELSSLAHKIDNIRKHLEQLLLACHHYINEKMHHEAY 300
           ACAAQNAGLIGVGHEYLASASETWELSSLAHKIDNIRKHLEQLLLACH YINEKMHHEAY
Sbjct: 241 ACAAQNAGLIGVGHEYLASASETWELSSLAHKIDNIRKHLEQLLLACHRYINEKMHHEAY 300

Query: 301 MNLVRLFEIPHIDNNKILRALIYSKDDKPPLIDGLSKEKATLEVLRKKNVLLLISDLDIS 360
           MNLVRLFEIPHIDNNKILRALIYSKDDKPPLIDGL KEKATLEVLRKKNVLLLISDLD+S
Sbjct: 301 MNLVRLFEIPHIDNNKILRALIYSKDDKPPLIDGLIKEKATLEVLRKKNVLLLISDLDLS 360

Query: 361 IVELSMLDQIYRESRHNKTRAESDYEVVWMPIVEPTWTEEKQVKFEALLGLMPWYSVAHP 420
           +VELSMLDQIYRESR NKTR ESDYEVVWMPIV+  WTEEKQVKF+ALLGLMPWYSVAHP
Sbjct: 361 VVELSMLDQIYRESRQNKTRTESDYEVVWMPIVDSPWTEEKQVKFDALLGLMPWYSVAHP 420

Query: 421 SLIESAVVKYVRQVWNFIKKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLW 480
           SLIESAV+KYVRQVWNFIKKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLW
Sbjct: 421 SLIESAVIKYVRQVWNFIKKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLW 480

Query: 481 KEETWQLELLVDSVEPLIFQWKETGKYICIIGGEDLGWIRSFSSKAKSVANDAGIELEIL 540
           KEETW+LELLVDSVEPLIFQW ETGKYICI+GGEDLGWIRSFS+KA  VA DA I LEIL
Sbjct: 481 KEETWRLELLVDSVEPLIFQWMETGKYICILGGEDLGWIRSFSTKALEVAKDAEIALEIL 540

Query: 541 YVGKSNPGEKIRKNIAAILADKIIHTLVDPTLIWFFWVRLESMWYSKTQRGNTIEDDPVM 600
           YVGKSNPGEKI+KNIAAILA+KIIHTLVDPTLIWFFWVRLESMWYSKTQRGNTIE+DPVM
Sbjct: 541 YVGKSNPGEKIKKNIAAILAEKIIHTLVDPTLIWFFWVRLESMWYSKTQRGNTIEEDPVM 600

Query: 601 QETMTMLSFDSGDQGWALFCKGSTDILRAKAETITNVVDGYEERWKIHVKEEGFIPAMSK 660
           QETMTMLSFDSGDQGWALFCKGSTDILRAKAETITNVV GYEERWK+HVK+EGFIPAMSK
Sbjct: 601 QETMTMLSFDSGDQGWALFCKGSTDILRAKAETITNVVSGYEERWKVHVKDEGFIPAMSK 660

Query: 661 DLEHIHTPEHCNRLILPSSNGTIPEKVVCSECGSAMEKFIMYRCCND 707
           DL+ IHTPEHCNRLILPSSNGTIPEKVVCSECGSAMEKFIMYRCCND
Sbjct: 661 DLQDIHTPEHCNRLILPSSNGTIPEKVVCSECGSAMEKFIMYRCCND 703

BLAST of HG10005098 vs. NCBI nr
Match: XP_004143056.1 (protein SIEVE ELEMENT OCCLUSION B [Cucumis sativus] >KGN62332.1 hypothetical protein Csa_018749 [Cucumis sativus])

HSP 1 Score: 1342.8 bits (3474), Expect = 0.0e+00
Identity = 664/701 (94.72%), Postives = 679/701 (96.86%), Query Frame = 0

Query: 6   RKLSLIKPDRQLFAAGDENALTKQVLATHSEEPLEFPVTPLLSLVEQIFLRAKLNTHQAH 65
           RKLSLIKPDRQLFA GDENALTKQVLATHSEEPLEFPVTPLLSLVEQIFLRAKLNT Q  
Sbjct: 7   RKLSLIKPDRQLFAGGDENALTKQVLATHSEEPLEFPVTPLLSLVEQIFLRAKLNTLQ-- 66

Query: 66  KYGTTRAQLEAIEDNSPSPTDLLDLLDFVSFTINKVSNEIQYKCSGAGDPHTVTMEVFNL 125
             GTTRAQLEAIED SPSPTDLLDLLDFVSFTIN+VSNEIQYKCSGAGDPHTVTMEVFNL
Sbjct: 67  --GTTRAQLEAIEDKSPSPTDLLDLLDFVSFTINRVSNEIQYKCSGAGDPHTVTMEVFNL 126

Query: 126 LSSWPWDAKVVLALAAFAINYGEFWLLVQQSSTDLLAKDISLLKKLPEIFERVDIVKQKF 185
           LSSWPWDAKVVLALAAFAINYGEFWLLVQQSSTDLLAKDISLLKKLPEIFERVDIVKQKF
Sbjct: 127 LSSWPWDAKVVLALAAFAINYGEFWLLVQQSSTDLLAKDISLLKKLPEIFERVDIVKQKF 186

Query: 186 EALDKLIKALVDVAKCIVDCKMLPPHYITPDTPEMKSATTLIPTAIYWTIRSIVACAAQN 245
           EALDKLIK+LVDVAKCIVD KMLPPHYITPDTPEMKSATTLIPTAIYWTIRSIVACAAQN
Sbjct: 187 EALDKLIKSLVDVAKCIVDFKMLPPHYITPDTPEMKSATTLIPTAIYWTIRSIVACAAQN 246

Query: 246 AGLIGVGHEYLASASETWELSSLAHKIDNIRKHLEQLLLACHHYINEKMHHEAYMNLVRL 305
           AGLIGVGHEYLASASETWELSSLAHKIDNIRKHLEQLLLACHHYINEKMHHEAYMNLVRL
Sbjct: 247 AGLIGVGHEYLASASETWELSSLAHKIDNIRKHLEQLLLACHHYINEKMHHEAYMNLVRL 306

Query: 306 FEIPHIDNNKILRALIYSKDDKPPLIDGLSKEKATLEVLRKKNVLLLISDLDISIVELSM 365
           FEIPHIDNNKILRALIYSKDDKPPL+DGLSKEKATLEVLRKKNVLLLISDLD+SIVELSM
Sbjct: 307 FEIPHIDNNKILRALIYSKDDKPPLLDGLSKEKATLEVLRKKNVLLLISDLDLSIVELSM 366

Query: 366 LDQIYRESRHNKTRAESDYEVVWMPIVEPTWTEEKQVKFEALLGLMPWYSVAHPSLIESA 425
           LDQIYRESR NKTR+ESDYEVVWMPIVE  WTE+KQVKFEALLGLMPWYSVAHPSLIESA
Sbjct: 367 LDQIYRESRQNKTRSESDYEVVWMPIVESPWTEDKQVKFEALLGLMPWYSVAHPSLIESA 426

Query: 426 VVKYVRQVWNFIKKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLWKEETWQ 485
           V+KYVRQVWNFIKKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLWKEETW+
Sbjct: 427 VIKYVRQVWNFIKKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLWKEETWR 486

Query: 486 LELLVDSVEPLIFQWKETGKYICIIGGEDLGWIRSFSSKAKSVANDAGIELEILYVGKSN 545
           LELLVDSVEPLIFQW E GKYICI+GGEDL WIR FS+KA  VA DAGI LEILYVGKSN
Sbjct: 487 LELLVDSVEPLIFQWMEAGKYICILGGEDLAWIRGFSAKALGVAKDAGINLEILYVGKSN 546

Query: 546 PGEKIRKNIAAILADKIIHTLVDPTLIWFFWVRLESMWYSKTQRGNTIEDDPVMQETMTM 605
           PGEKI+KNIA ILADK+I TLVDPTLIWFFWVRLESMWYSKTQRGNTIEDDPVMQETMTM
Sbjct: 547 PGEKIKKNIAGILADKMIRTLVDPTLIWFFWVRLESMWYSKTQRGNTIEDDPVMQETMTM 606

Query: 606 LSFDSGDQGWALFCKGSTDILRAKAETITNVVDGYEERWKIHVKEEGFIPAMSKDLEHIH 665
           LSFDSGDQGWALFCKGSTDILRAKAETITNVVDGYEERWK+HVKEEGFIPAM+KDL+ IH
Sbjct: 607 LSFDSGDQGWALFCKGSTDILRAKAETITNVVDGYEERWKVHVKEEGFIPAMTKDLQDIH 666

Query: 666 TPEHCNRLILPSSNGTIPEKVVCSECGSAMEKFIMYRCCND 707
           TPEHCNRLILPSSNGTIPEKVVCSECGSAMEKFIMYRCCND
Sbjct: 667 TPEHCNRLILPSSNGTIPEKVVCSECGSAMEKFIMYRCCND 703

BLAST of HG10005098 vs. NCBI nr
Match: XP_023537424.1 (protein SIEVE ELEMENT OCCLUSION B-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1241.1 bits (3210), Expect = 0.0e+00
Identity = 602/706 (85.27%), Postives = 650/706 (92.07%), Query Frame = 0

Query: 1   MATGVRKLSLIKPDRQLFAAGDENALTKQVLATHSEEPLEFPVTPLLSLVEQIFLRAKLN 60
           +AT  RK+ L+KPDRQLFA  D+ ALTKQVLATHS+E LEF VTPLL L+EQIFLRAKLN
Sbjct: 3   LATAARKMGLMKPDRQLFAVADDTALTKQVLATHSDETLEFLVTPLLGLIEQIFLRAKLN 62

Query: 61  THQAHKYGTTRAQLEAIEDNSPSPTDLLDLLDFVSFTINKVSNEIQYKCSGAGDPHTVTM 120
                K GTT A+LEAIEDNSPSPTDLLDLLDFVSFTI++VSNEIQYKCS AG+PHTVTM
Sbjct: 63  ----DKQGTTGAELEAIEDNSPSPTDLLDLLDFVSFTIHRVSNEIQYKCSRAGEPHTVTM 122

Query: 121 EVFNLLSSWPWDAKVVLALAAFAINYGEFWLLVQQSSTDLLAKDISLLKKLPEIFERVDI 180
           EV NLL++WPWDAK VLALAAF+INYGEFWLLV QSS+DLLAKDISLLKKLPEIFER+DI
Sbjct: 123 EVLNLLTNWPWDAKAVLALAAFSINYGEFWLLVHQSSSDLLAKDISLLKKLPEIFERIDI 182

Query: 181 VKQKFEALDKLIKALVDVAKCIVDCKMLPPHYITPDTPEMKSATTLIPTAIYWTIRSIVA 240
           V+QKF+A+DKLIKAL+ VAKCIVD KMLPPHYITPDTPEMKSATTLIPTA+YW +RSI+A
Sbjct: 183 VRQKFDAIDKLIKALISVAKCIVDFKMLPPHYITPDTPEMKSATTLIPTAVYWIVRSIIA 242

Query: 241 CAAQNAGLIGVGHEYLASASETWELSSLAHKIDNIRKHLEQLLLACHHYINEKMHHEAYM 300
           CAAQ  GL+GVGHEYLASASETWELSSLAHKIDNIRKHLEQLL ACH YI+EKMHHEAYM
Sbjct: 243 CAAQITGLVGVGHEYLASASETWELSSLAHKIDNIRKHLEQLLQACHQYIHEKMHHEAYM 302

Query: 301 NLVRLFEIPHIDNNKILRALIYSKDDKPPLIDGLSKEKATLEVLRKKNVLLLISDLDISI 360
           NLVRLFEIPH+DNNKILRALIYSKDDK PLIDG+SKEKATL+VLRKKNVLLLISDLD+S 
Sbjct: 303 NLVRLFEIPHLDNNKILRALIYSKDDKMPLIDGISKEKATLDVLRKKNVLLLISDLDLSA 362

Query: 361 VELSMLDQIYRESRHNKTRAESDYEVVWMPIVEPTWTEEKQVKFEALLGLMPWYSVAHPS 420
           VELSMLDQIYRESR NKTRAESDYEVVWMPIVE  WT+EKQ KFE LL LMPWYSVAHPS
Sbjct: 363 VELSMLDQIYRESRQNKTRAESDYEVVWMPIVESPWTDEKQAKFEGLLNLMPWYSVAHPS 422

Query: 421 LIESAVVKYVRQVWNFIKKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLWK 480
           LIESAV+KY+RQVW+F KKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLWK
Sbjct: 423 LIESAVIKYIRQVWHFNKKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLWK 482

Query: 481 EETWQLELLVDSVEPLIFQWKETGKYICIIGGEDLGWIRSFSSKAKSVANDAGIELEILY 540
           EETW+LELLVDSVEPLIF W ETGKYICI GGED+ W+RSFS K K VA DAG+E+EILY
Sbjct: 483 EETWRLELLVDSVEPLIFNWMETGKYICICGGEDMEWVRSFSKKVKEVAKDAGVEMEILY 542

Query: 541 VGKSNPGEKIRKNIAAILADKIIHTLVDPTLIWFFWVRLESMWYSKTQRGNTIEDDPVMQ 600
           VGKSNPGE+IRKNIAAILA+K+IHTL DPTL+WFFWVRLESMWYSKTQRGNTIE+DP+MQ
Sbjct: 543 VGKSNPGERIRKNIAAILAEKMIHTLADPTLVWFFWVRLESMWYSKTQRGNTIEEDPIMQ 602

Query: 601 ETMTMLSFDSGDQGWALFCKGSTDILRAKAETITNVVDGYEERWKIHVKEEGFIPAMSKD 660
           ETMTMLSFDSGDQGWA+FCKGST I+RAKAE I  V++GYEERWK   KE G IPAMSKD
Sbjct: 603 ETMTMLSFDSGDQGWAVFCKGSTSIIRAKAEMIMKVMEGYEERWKEDAKELGLIPAMSKD 662

Query: 661 LEHIHTPEHCNRLILPSSNGTIPEKVVCSECGSAMEKFIMYRCCND 707
           L+ IHTPEHCNRLILPSSNGTIPEKVVCSECGSAMEKFIMYRCC D
Sbjct: 663 LQTIHTPEHCNRLILPSSNGTIPEKVVCSECGSAMEKFIMYRCCTD 704

BLAST of HG10005098 vs. ExPASy Swiss-Prot
Match: Q9SS87 (Protein SIEVE ELEMENT OCCLUSION B OS=Arabidopsis thaliana OX=3702 GN=SEOB PE=1 SV=1)

HSP 1 Score: 555.8 bits (1431), Expect = 6.9e-157
Identity = 305/712 (42.84%), Postives = 430/712 (60.39%), Query Frame = 0

Query: 13  PDRQLFAAGDENALTKQVLATHSEEPLEFPVTPLLSLVEQIFLRAKLNTHQAHKYGTTRA 72
           P   L  + DE+ + K +  THS +  E  V  LLSLVE I  RA L++       T  +
Sbjct: 30  PATGLAMSSDESMMLKLIQQTHSPDAREVQVRGLLSLVEDILDRATLDSED-----TNAS 89

Query: 73  QLEAIEDNSPSPTDLLDLLDFVSFTINKVSNEIQYKCSGAGDPHTVTMEVFNLLSSWPWD 132
            L    ++    + ++ +LD VS+ I++V+ EI YK     D H +TM VF  LSS+ WD
Sbjct: 90  MLPLPTEDKLMQSSMMSVLDSVSYAIDRVACEIAYKSLTGSDSHEITMSVFEHLSSFQWD 149

Query: 133 AKVVLALAAFAINYGEFWLLVQQSSTDLLAKDISLLKKLPEIFERVDIVKQKFEALDKLI 192
            K+VL LAAFA+NYGEFWLLVQ  S + LAK +++LK +P +  RV + +   + L+ LI
Sbjct: 150 GKLVLTLAAFALNYGEFWLLVQFYSKNQLAKSLAMLKLVP-VQNRVTL-ESVSQGLNDLI 209

Query: 193 KALVDVAKCIVDCKMLPPHYITPDTPEMKSATTLIPTAIYWTIRSIVACAAQNAGLIGVG 252
           + +  V  C+V+   LP  YITPD P++    + IP A+YWTIRS++AC +Q   +  +G
Sbjct: 210 REMKSVTACVVELSELPDRYITPDVPQLSRILSTIPIAVYWTIRSVIACISQINMITAMG 269

Query: 253 HEYLASASETWELSSLAHKIDNIRKHLEQLLLACHHYINEKMHHEAYMNLVRLFEIPHID 312
           HE + +  + WE S LA+K+ NI  HL + L  C+ +I ++   E+   L  LF+  HID
Sbjct: 270 HEMMNTQMDLWETSMLANKLKNIHDHLAETLRLCYRHIEKQRSSESLKVLHSLFDTTHID 329

Query: 313 NNKILRALIYSKDDKPPLIDGLSKEKATLEVLRKKNVLLLISDLDISIVELSMLDQIYRE 372
           N KIL AL++ K    PL DGL+K K  L+VLR+K VLLLISDL+I   ELS+ +QIY E
Sbjct: 330 NMKILTALVHPKPHITPLQDGLTKRKVHLDVLRRKTVLLLISDLNILQDELSIFEQIYTE 389

Query: 373 SRHNKT----RAESDYEVVWMPIVEPTWTEEK----QVKFEALLGLMPWYSVAHPSLIES 432
           SR N      ++   YEVVW+P+V+P    E+    Q KFE L   MPWYSV  P LIE 
Sbjct: 390 SRRNLVGVDGKSHMPYEVVWVPVVDPIEDFERSPILQKKFEDLRDPMPWYSVDSPKLIER 449

Query: 433 AVVKYVRQVWNFIKKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLWKEETW 492
            VV+++R  W+F+ KP+LVV+DPQG   + NA+HM+WIWG+ A+PFT +REE LW+ ET+
Sbjct: 450 HVVEFMRGRWHFMNKPILVVIDPQGNEASLNALHMIWIWGTEAFPFTRSREEELWRRETF 509

Query: 493 QLELLVDSVEPLIFQWKETGKYICIIGGEDLGWIRSFSSKAKSVANDAGIELEILYVGKS 552
            L L+VD ++ +IF W +   YI + GG+DL WIR F+  AK+ A D+ + LE+ YVGK 
Sbjct: 510 SLNLIVDGIDSVIFNWIKPDNYIFLYGGDDLDWIRRFTMAAKATAKDSNVNLEMAYVGKR 569

Query: 553 NPG--EKIRKNIAAILADKIIHTLVDPTLIWFFWVRLESMWYSKTQRGNTIEDDPVMQET 612
           N    E+IR+    I ++ + H+  +P L+WFFW RLESM YSK Q G   + D VMQ  
Sbjct: 570 NHSHREQIRRISEVIRSENLSHSWAEPALMWFFWTRLESMLYSKIQLGKADDHDDVMQGI 629

Query: 613 MTMLSFDSGDQGWALFCKGSTDILRAKAETITNVVDGYEERWKIHVKEEGFIPAMSKDLE 672
             +LS+D    GWAL  KG   ++ A    I   +  Y+  WK HV  +G+  AMS   +
Sbjct: 630 KKILSYDKLG-GWALLSKGPEIVMIAHG-AIERTMSVYDRTWKTHVPTKGYTKAMS---D 689

Query: 673 HIH------TPEHCNR--LILPSSNGTIPEKVVCSECGSAMEKFIMYRCCND 707
           H H      T + C      + + +G IPEK+ C EC   MEK++ + CC+D
Sbjct: 690 HHHDEVLRETGKPCGHFDFHITARSGRIPEKMNCFECQRPMEKYMSFSCCHD 729

BLAST of HG10005098 vs. ExPASy Swiss-Prot
Match: Q93XX2 (Protein SIEVE ELEMENT OCCLUSION A OS=Arabidopsis thaliana OX=3702 GN=SEOA PE=1 SV=1)

HSP 1 Score: 444.5 bits (1142), Expect = 2.2e-123
Identity = 257/730 (35.21%), Postives = 399/730 (54.66%), Query Frame = 0

Query: 6   RKLSLIKPDRQLFAAGDENALTKQVLATHSEEPLEFPVTPLLSLVEQIFLRAKLNTHQAH 65
           +K +  +  R +F+  D+  +  +VL THS + + F VT LLS+V  IF           
Sbjct: 122 KKQAFHRNGRPMFSLSDDRVMADRVLKTHSPDMIFFDVTSLLSVVNDIF----------- 181

Query: 66  KYGTTRAQLEAIEDNSPSPT----DLLDLLDFVSFT--INKVSNEIQYKCSGAGDPH--- 125
                ++ + +I+ ++P P+    D  D   F +F   I+++S EI  KC   G+ H   
Sbjct: 182 -----KSHVPSIDSSAPKPSLVFKDYADHTSFETFADLIDQISCEIDCKCLHGGESHGMM 241

Query: 126 ----------TVTMEVFNLLSSWPWDAKVVLALAAFAINYGEFWLLVQQSSTDLLAKDIS 185
                     T T  V +L+S + WDAK+VL L+A A+ YG F LL +  +T+ L K ++
Sbjct: 242 TSGLHLDSRNTTTFSVLSLVSKYRWDAKLVLVLSALAVKYGVFLLLAETHATNQLTKSLA 301

Query: 186 LLKKLPEIFERVDIVKQKFEALDKLIKALVDVAKCIVDCKMLPPHYITPDTPEMKSATTL 245
           L+K+LP IF R + + Q+ +    L++ +VD+   I+D   LPP++IT       + T  
Sbjct: 302 LIKQLPSIFSRQNALHQRLDKTRILMQDMVDLTTTIIDIYQLPPNHIT------AAFTDH 361

Query: 246 IPTAIYWTIRSIVACAAQNAGLIGVGHEYLASASETWELSSLAHKIDNIRKHLEQLLLAC 305
           IPTA+YW +R ++ C +  +G  G   + + S  E  E+   + ++  I  +L +     
Sbjct: 362 IPTAVYWIVRCVLICVSHISGASGFKQDQIMSFMEVSEIHENSERLRKINAYLLEQFKKS 421

Query: 306 HHYINEKMHHEAYMNLVRLF-EIPHIDNNKILRALIYSKDDKPPLIDGLSKEKATLEVLR 365
              I E +  E Y  L++ F  I H+D    L  L+   D       G+SK +  + VL 
Sbjct: 422 KMTIEEGIIEEEYQELIQTFTTIIHVDVVPPLLRLLRPIDFLYHGA-GVSKRRVGINVLT 481

Query: 366 KKNVLLLISDLDISIVELSMLDQIYRESRHNKTRAESDYEVVWMPIVEPTWTEEKQVKFE 425
           +K+VLLLISDL+    EL +L+ +Y E+       +  +E++W+P V+  WTE    KFE
Sbjct: 482 QKHVLLLISDLENIEKELYILESLYTEA------WQQSFEILWVP-VQDFWTEADDAKFE 541

Query: 426 ALLGLMPWYSVAHPSLIESAVVKYVRQVWNFIKKPLLVVLDPQGKVVNTNAVHMLWIWGS 485
           AL   M WY +  P  +  A +++VR+ W F  +P+LV LDP+G+V++TNA  M+WIW  
Sbjct: 542 ALHMNMRWYVLGEPRKLRRAAIRFVREWWGFKNRPILVALDPKGQVMSTNAFPMVWIWQP 601

Query: 486 LAYPFTSAREESLWKEETWQLELLVDSVEPLIFQWKETGKYICIIGGEDLGWIRSFSSKA 545
            A+PFT+ARE  LW E+ W LE L+D  +P        GKYIC+ GGED+ WI++F+S  
Sbjct: 602 FAHPFTTARERDLWSEQEWNLEFLIDGTDPHSLNQLVDGKYICLYGGEDMQWIKNFTSLW 661

Query: 546 KSVANDAGIELEILYVGKSNPGEKIRKNIAAILADKIIHTLVDPTLIWFFWVRLESMWYS 605
           ++VA  A I+LE++YVGK NP   I+  I  I  + + HTL D   IWFFW R+ESMW S
Sbjct: 662 RNVAKAANIQLEMVYVGKRNPKNGIQPIINTIREENLSHTLPDLFQIWFFWTRVESMWES 721

Query: 606 KTQ--RGNTI---------EDDPVMQETMTMLSFDSGDQGWALFCKGSTDILRAKAETIT 665
           K +  + + I         E D V+QE + ML +     GW L  K S  ++RAK    +
Sbjct: 722 KQRMLKAHGIKGREGFKEEEKDLVLQEVVAMLGYGGEGDGWGLVSKASDMMVRAKGNLFS 781

Query: 666 NVVDGYEERWKIHVKEEGFIPAMSKDLEHIHTPEHCNRLILPSSNGTIPEKVVCSECGSA 705
             +  + E W++++  +GF+ A++  L     P HC R +LP + G IP +V C+EC   
Sbjct: 782 RGLAEFNE-WEVNIPTKGFLTALNDHLLMRLPPHHCTRFMLPETAGIIPNEVECTECRRT 820

BLAST of HG10005098 vs. ExPASy Swiss-Prot
Match: Q9FXE2 (Protein SIEVE ELEMENT OCCLUSION C OS=Arabidopsis thaliana OX=3702 GN=SEOC PE=4 SV=2)

HSP 1 Score: 262.7 bits (670), Expect = 1.2e-68
Identity = 208/715 (29.09%), Postives = 331/715 (46.29%), Query Frame = 0

Query: 15  RQLFAAGDENALTKQVLATHSEEPLEFPVTPLLSLVEQIFLRAKLNTHQAHKYGTTRAQL 74
           R+  +A +E+ + +Q+L +H  +        LL  VE I      N              
Sbjct: 4   RRDISALNEDIIVEQLLRSHDPDGRWLDSEMLLQEVETILSFVLQND----------VSR 63

Query: 75  EAIEDNSPSPTDLLDLLDFVSFTINKVSNEIQYKCSGAGDPHTVTMEVFNLLSSWPWDAK 134
             + +N  +  ++ D  + + + I ++S ++   C+G  +    TM +F+LL  + WDAK
Sbjct: 64  PLLTENCITTIEVFDSKETLPYAIFRISVQMLCPCTGENEIRKRTMVLFDLLKEYRWDAK 123

Query: 135 VVLALAAFAINYGEFWLLVQQSSTDLLAKDISLLKKLPEIFERVDIVKQKFEALDKLIKA 194
            VL L   A  YG   L V  +  D +A  I+ L +LP   ER    +   E+L+ LIKA
Sbjct: 124 AVLVLGVLAATYGGLLLPVHLAICDPVAASIAKLNQLP--IERTKF-RPWLESLNLLIKA 183

Query: 195 LVDVAKCIVDCKMLPPHYITPDTPEMKSATTLIPTAIYWTIRSIVACAAQ---------- 254
           +VDV KCI+  + +P      D   +    + I    Y  ++S + C  Q          
Sbjct: 184 MVDVTKCIIKFEKIPFKQAKLDNNILGETLSNIYLTTYRVVKSALTCMQQIPYFKQTQQA 243

Query: 255 NAGLIGVGHEYLASASETWELSSLAHKIDNIRKHLEQLLLACHHYINEKMHHEAYMNLVR 314
                      + S     ELSSL +++ NI   L + +  C   I E+++      L  
Sbjct: 244 KKSRKTAAELSIESRRAAGELSSLGYQLLNIHTRLNKQVEDCSTQIEEEIN----QRLRN 303

Query: 315 LFEIPHIDNNKILRALIYSKDDKPPLIDGLSKEKATLEVLRKKNVLLLISDLDISIVELS 374
           +    H DN  +L  L   +DD P  +   S++ +  EV + K  LLL+S   +  +   
Sbjct: 304 INIETHQDNQDVLHLLFSLQDDLP--LQQYSRQISITEV-QDKVTLLLLSKPPVEPL-FF 363

Query: 375 MLDQIYRESRHNKTRAESDYEVVWMPI-VEPTWTEEKQVKFEALLGLMPWYSVAHPSLIE 434
           +L Q+Y     + T  E +YE++W+PI     WT+E++  F+     +PW SV  P L+ 
Sbjct: 364 LLQQLY--DHPSNTNTEQNYEIIWVPIPSSQKWTDEEKEIFDFYSNSLPWISVRQPWLMS 423

Query: 435 SAVVKYVRQVWNF-IKKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLWKEE 494
           S ++ + +Q W++   + +LVV+D  G+ VN NA+ M+ IWG  AYPF+ +RE+ LWKE 
Sbjct: 424 STILNFFKQEWHYKDNEAMLVVIDSNGRFVNMNAMDMVLIWGVKAYPFSVSREDELWKEH 483

Query: 495 TWQLELLVDSVEPLIFQWKETGKYICIIGGEDLGWIRSFSSKAKSVANDAGIELEILYVG 554
            W + LL+D + P        G+ ICI G E+L WI  F S A+ + N  G +LE++Y+ 
Sbjct: 484 GWSINLLLDGIHPTF-----EGREICIFGSENLDWIDEFVSLARKIQN-LGFQLELIYLS 543

Query: 555 KSNPGEKIRKNIAAILADKIIHTLVDPTLIWFFWVRLESMWYSKTQR--GNTIEDDPVMQ 614
                E+  +  +          L  PTL   FW+RLES+  SK +R      + D V +
Sbjct: 544 NQRRDERAMEESS---------ILFSPTLQQLFWLRLESIERSKLKRIVIEPSKPDRVFE 603

Query: 615 ETMTMLSFDSG-DQGWALFCKGSTDILRAKAETITNVVDGYE--------ERWKIHVKEE 674
           E   +L FD G  +GW +   GST      AET    VDG +         RW  + K  
Sbjct: 604 EVRNLLDFDYGKHRGWGIIGNGST------AET----VDGEKMTERMRKIVRWGEYAKGL 663

Query: 675 GFIPAM----SKDLEHIHTPEHCNRLILPSSNGTIPEKVVCSECGSAMEKFIMYR 703
           GF  A+     K  E  HT       ++P       + V C +C   M++F+ Y+
Sbjct: 664 GFTEAIEIAAEKPCELSHT------AVVPFEEALTMKVVTCEKCKWPMKRFVAYQ 664

BLAST of HG10005098 vs. ExPASy Swiss-Prot
Match: Q7XPE8 (Probable nucleoredoxin 3 OS=Oryza sativa subsp. japonica OX=39947 GN=Os04g0608600 PE=3 SV=2)

HSP 1 Score: 50.1 bits (118), Expect = 1.2e-04
Identity = 26/88 (29.55%), Postives = 49/88 (55.68%), Query Frame = 0

Query: 398 EEKQVKFEALLGLMPWYSVAHPSLIESAVVKYVRQVWNFIKKPLLVVLDPQGKVVNTNAV 457
           +  + +F+A L  MPW+++ +        V+ + +++     P L++L P GKV  T+  
Sbjct: 249 DRNEEEFQASLSAMPWFAIPY----SDTTVQELSRIFTIKGIPTLLILGPDGKVFKTDGR 308

Query: 458 HMLWIWGSLAYPFTSAR----EESLWKE 482
            ++  +G++A+PFT +R    EE L KE
Sbjct: 309 RIISKYGAMAFPFTESRAYELEEVLKKE 332

BLAST of HG10005098 vs. ExPASy TrEMBL
Match: A0A5A7V141 (Protein SIEVE ELEMENT OCCLUSION B OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold501G001880 PE=4 SV=1)

HSP 1 Score: 1363.2 bits (3527), Expect = 0.0e+00
Identity = 670/701 (95.58%), Postives = 684/701 (97.57%), Query Frame = 0

Query: 6   RKLSLIKPDRQLFAAGDENALTKQVLATHSEEPLEFPVTPLLSLVEQIFLRAKLNTHQAH 65
           RKLSLIKPDRQLFA GDENALTKQVLATHSEEPLEFPVTPLLSLVEQIFLRAKLNTHQA+
Sbjct: 7   RKLSLIKPDRQLFAGGDENALTKQVLATHSEEPLEFPVTPLLSLVEQIFLRAKLNTHQAY 66

Query: 66  KYGTTRAQLEAIEDNSPSPTDLLDLLDFVSFTINKVSNEIQYKCSGAGDPHTVTMEVFNL 125
             G TRAQLEAIED SPSPTDLLDLLDFVSFTIN+VSNEIQYKCSGAGDPHTVTMEVFNL
Sbjct: 67  TCGATRAQLEAIEDKSPSPTDLLDLLDFVSFTINRVSNEIQYKCSGAGDPHTVTMEVFNL 126

Query: 126 LSSWPWDAKVVLALAAFAINYGEFWLLVQQSSTDLLAKDISLLKKLPEIFERVDIVKQKF 185
           LSSWPWDAKVVLALAAFAINYGEFWLLVQQSSTDLLAKDISLLKKLPEIFERVDIVKQKF
Sbjct: 127 LSSWPWDAKVVLALAAFAINYGEFWLLVQQSSTDLLAKDISLLKKLPEIFERVDIVKQKF 186

Query: 186 EALDKLIKALVDVAKCIVDCKMLPPHYITPDTPEMKSATTLIPTAIYWTIRSIVACAAQN 245
           EALDKLIK+LVDVAKCIVD KMLPPHYITPDTPEMKSATTLIPTAIYWTIRSIVACAAQN
Sbjct: 187 EALDKLIKSLVDVAKCIVDFKMLPPHYITPDTPEMKSATTLIPTAIYWTIRSIVACAAQN 246

Query: 246 AGLIGVGHEYLASASETWELSSLAHKIDNIRKHLEQLLLACHHYINEKMHHEAYMNLVRL 305
           AGLIGVGHEYLASASETWELSSLAHKIDNIRKHLEQLLLACHHYINEKMHHEAYMNLVRL
Sbjct: 247 AGLIGVGHEYLASASETWELSSLAHKIDNIRKHLEQLLLACHHYINEKMHHEAYMNLVRL 306

Query: 306 FEIPHIDNNKILRALIYSKDDKPPLIDGLSKEKATLEVLRKKNVLLLISDLDISIVELSM 365
           FEIPHIDNNKILRALIYSKDDKPPL+DGLSKEKATLEVLRKKNVLLLISDLD+SIVELSM
Sbjct: 307 FEIPHIDNNKILRALIYSKDDKPPLVDGLSKEKATLEVLRKKNVLLLISDLDLSIVELSM 366

Query: 366 LDQIYRESRHNKTRAESDYEVVWMPIVEPTWTEEKQVKFEALLGLMPWYSVAHPSLIESA 425
           LDQIYRESR NKTR ESDYEVVWMPIVEP WTEEKQVKFEALLGLMPWYSVAHPSLIESA
Sbjct: 367 LDQIYRESRQNKTRTESDYEVVWMPIVEPPWTEEKQVKFEALLGLMPWYSVAHPSLIESA 426

Query: 426 VVKYVRQVWNFIKKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLWKEETWQ 485
           V+KYVRQVWNFIKKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLWKEETW+
Sbjct: 427 VIKYVRQVWNFIKKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLWKEETWR 486

Query: 486 LELLVDSVEPLIFQWKETGKYICIIGGEDLGWIRSFSSKAKSVANDAGIELEILYVGKSN 545
           LELLVDSVEPLIFQW ETGKYICI+GGEDL WIR FS+KA  VA DAGI LEILYVGKSN
Sbjct: 487 LELLVDSVEPLIFQWMETGKYICILGGEDLAWIRGFSAKALGVAKDAGINLEILYVGKSN 546

Query: 546 PGEKIRKNIAAILADKIIHTLVDPTLIWFFWVRLESMWYSKTQRGNTIEDDPVMQETMTM 605
           PGEKI+KNIAAILADK+IHTLVDPTLIWFFWVRLESMWYSKTQRGNTIEDDPVMQETMTM
Sbjct: 547 PGEKIKKNIAAILADKVIHTLVDPTLIWFFWVRLESMWYSKTQRGNTIEDDPVMQETMTM 606

Query: 606 LSFDSGDQGWALFCKGSTDILRAKAETITNVVDGYEERWKIHVKEEGFIPAMSKDLEHIH 665
           LSFDSGDQGWALFCKGSTDILRAKAETITNVVDGYEERWK+HV+EEGFIPAMSKDL+ IH
Sbjct: 607 LSFDSGDQGWALFCKGSTDILRAKAETITNVVDGYEERWKVHVQEEGFIPAMSKDLQDIH 666

Query: 666 TPEHCNRLILPSSNGTIPEKVVCSECGSAMEKFIMYRCCND 707
           TPEHCNRLILPSSNGTIPEKVVCSECGSAMEKFIMYRCCND
Sbjct: 667 TPEHCNRLILPSSNGTIPEKVVCSECGSAMEKFIMYRCCND 707

BLAST of HG10005098 vs. ExPASy TrEMBL
Match: A0A1S3B9Q6 (protein SIEVE ELEMENT OCCLUSION B OS=Cucumis melo OX=3656 GN=LOC103487729 PE=4 SV=1)

HSP 1 Score: 1356.3 bits (3509), Expect = 0.0e+00
Identity = 669/701 (95.44%), Postives = 682/701 (97.29%), Query Frame = 0

Query: 6   RKLSLIKPDRQLFAAGDENALTKQVLATHSEEPLEFPVTPLLSLVEQIFLRAKLNTHQAH 65
           RKLSLIKPDRQLFA GDENALTKQVLATHSEEPLEFPVTPLLSLVEQIFLRAKLNTHQ  
Sbjct: 7   RKLSLIKPDRQLFAGGDENALTKQVLATHSEEPLEFPVTPLLSLVEQIFLRAKLNTHQ-- 66

Query: 66  KYGTTRAQLEAIEDNSPSPTDLLDLLDFVSFTINKVSNEIQYKCSGAGDPHTVTMEVFNL 125
             G TRAQLEAIED SPSPTDLLDLLDFVSFTIN+VSNEIQYKCSGAGDPHTVTMEVFNL
Sbjct: 67  --GATRAQLEAIEDKSPSPTDLLDLLDFVSFTINRVSNEIQYKCSGAGDPHTVTMEVFNL 126

Query: 126 LSSWPWDAKVVLALAAFAINYGEFWLLVQQSSTDLLAKDISLLKKLPEIFERVDIVKQKF 185
           LSSWPWDAKVVLALAAFAINYGEFWLLVQQSSTDLLAKDISLLKKLPEIFERVDIVKQKF
Sbjct: 127 LSSWPWDAKVVLALAAFAINYGEFWLLVQQSSTDLLAKDISLLKKLPEIFERVDIVKQKF 186

Query: 186 EALDKLIKALVDVAKCIVDCKMLPPHYITPDTPEMKSATTLIPTAIYWTIRSIVACAAQN 245
           EALDKLIK+LVDVAKCIVD KMLPPHYITPDTPEMKSATTLIPTAIYWTIRSIVACAAQN
Sbjct: 187 EALDKLIKSLVDVAKCIVDFKMLPPHYITPDTPEMKSATTLIPTAIYWTIRSIVACAAQN 246

Query: 246 AGLIGVGHEYLASASETWELSSLAHKIDNIRKHLEQLLLACHHYINEKMHHEAYMNLVRL 305
           AGLIGVGHEYLASASETWELSSLAHKIDNIRKHLEQLLLACHHYINEKMHHEAYMNLVRL
Sbjct: 247 AGLIGVGHEYLASASETWELSSLAHKIDNIRKHLEQLLLACHHYINEKMHHEAYMNLVRL 306

Query: 306 FEIPHIDNNKILRALIYSKDDKPPLIDGLSKEKATLEVLRKKNVLLLISDLDISIVELSM 365
           FEIPHIDNNKILRALIYSKDDKPPL+DGLSKEKATLEVLRKKNVLLLISDLD+SIVELSM
Sbjct: 307 FEIPHIDNNKILRALIYSKDDKPPLVDGLSKEKATLEVLRKKNVLLLISDLDLSIVELSM 366

Query: 366 LDQIYRESRHNKTRAESDYEVVWMPIVEPTWTEEKQVKFEALLGLMPWYSVAHPSLIESA 425
           LDQIYRESR NKTR ESDYEVVWMPIVEP WTEEKQVKFEALLGLMPWYSVAHPSLIESA
Sbjct: 367 LDQIYRESRQNKTRTESDYEVVWMPIVEPPWTEEKQVKFEALLGLMPWYSVAHPSLIESA 426

Query: 426 VVKYVRQVWNFIKKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLWKEETWQ 485
           V+KYVRQVWNFIKKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLWKEETW+
Sbjct: 427 VIKYVRQVWNFIKKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLWKEETWR 486

Query: 486 LELLVDSVEPLIFQWKETGKYICIIGGEDLGWIRSFSSKAKSVANDAGIELEILYVGKSN 545
           LELLVDSVEPLIFQW ETGKYICI+GGEDL WIR FS+KA  VA DAGI LEILYVGKSN
Sbjct: 487 LELLVDSVEPLIFQWMETGKYICILGGEDLAWIRGFSAKALGVAKDAGINLEILYVGKSN 546

Query: 546 PGEKIRKNIAAILADKIIHTLVDPTLIWFFWVRLESMWYSKTQRGNTIEDDPVMQETMTM 605
           PGEKI+KNIAAILADK+IHTLVDPTLIWFFWVRLESMWYSKTQRGNTIEDDPVMQETMTM
Sbjct: 547 PGEKIKKNIAAILADKVIHTLVDPTLIWFFWVRLESMWYSKTQRGNTIEDDPVMQETMTM 606

Query: 606 LSFDSGDQGWALFCKGSTDILRAKAETITNVVDGYEERWKIHVKEEGFIPAMSKDLEHIH 665
           LSFDSGDQGWALFCKGSTDILRAKAETITNVVDGYEERWK+HV+EEGFIPAMSKDL+ IH
Sbjct: 607 LSFDSGDQGWALFCKGSTDILRAKAETITNVVDGYEERWKVHVQEEGFIPAMSKDLQDIH 666

Query: 666 TPEHCNRLILPSSNGTIPEKVVCSECGSAMEKFIMYRCCND 707
           TPEHCNRLILPSSNGTIPEKVVCSECGSAMEKFIMYRCCND
Sbjct: 667 TPEHCNRLILPSSNGTIPEKVVCSECGSAMEKFIMYRCCND 703

BLAST of HG10005098 vs. ExPASy TrEMBL
Match: A0A0A0LNE1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G349660 PE=4 SV=1)

HSP 1 Score: 1342.8 bits (3474), Expect = 0.0e+00
Identity = 664/701 (94.72%), Postives = 679/701 (96.86%), Query Frame = 0

Query: 6   RKLSLIKPDRQLFAAGDENALTKQVLATHSEEPLEFPVTPLLSLVEQIFLRAKLNTHQAH 65
           RKLSLIKPDRQLFA GDENALTKQVLATHSEEPLEFPVTPLLSLVEQIFLRAKLNT Q  
Sbjct: 7   RKLSLIKPDRQLFAGGDENALTKQVLATHSEEPLEFPVTPLLSLVEQIFLRAKLNTLQ-- 66

Query: 66  KYGTTRAQLEAIEDNSPSPTDLLDLLDFVSFTINKVSNEIQYKCSGAGDPHTVTMEVFNL 125
             GTTRAQLEAIED SPSPTDLLDLLDFVSFTIN+VSNEIQYKCSGAGDPHTVTMEVFNL
Sbjct: 67  --GTTRAQLEAIEDKSPSPTDLLDLLDFVSFTINRVSNEIQYKCSGAGDPHTVTMEVFNL 126

Query: 126 LSSWPWDAKVVLALAAFAINYGEFWLLVQQSSTDLLAKDISLLKKLPEIFERVDIVKQKF 185
           LSSWPWDAKVVLALAAFAINYGEFWLLVQQSSTDLLAKDISLLKKLPEIFERVDIVKQKF
Sbjct: 127 LSSWPWDAKVVLALAAFAINYGEFWLLVQQSSTDLLAKDISLLKKLPEIFERVDIVKQKF 186

Query: 186 EALDKLIKALVDVAKCIVDCKMLPPHYITPDTPEMKSATTLIPTAIYWTIRSIVACAAQN 245
           EALDKLIK+LVDVAKCIVD KMLPPHYITPDTPEMKSATTLIPTAIYWTIRSIVACAAQN
Sbjct: 187 EALDKLIKSLVDVAKCIVDFKMLPPHYITPDTPEMKSATTLIPTAIYWTIRSIVACAAQN 246

Query: 246 AGLIGVGHEYLASASETWELSSLAHKIDNIRKHLEQLLLACHHYINEKMHHEAYMNLVRL 305
           AGLIGVGHEYLASASETWELSSLAHKIDNIRKHLEQLLLACHHYINEKMHHEAYMNLVRL
Sbjct: 247 AGLIGVGHEYLASASETWELSSLAHKIDNIRKHLEQLLLACHHYINEKMHHEAYMNLVRL 306

Query: 306 FEIPHIDNNKILRALIYSKDDKPPLIDGLSKEKATLEVLRKKNVLLLISDLDISIVELSM 365
           FEIPHIDNNKILRALIYSKDDKPPL+DGLSKEKATLEVLRKKNVLLLISDLD+SIVELSM
Sbjct: 307 FEIPHIDNNKILRALIYSKDDKPPLLDGLSKEKATLEVLRKKNVLLLISDLDLSIVELSM 366

Query: 366 LDQIYRESRHNKTRAESDYEVVWMPIVEPTWTEEKQVKFEALLGLMPWYSVAHPSLIESA 425
           LDQIYRESR NKTR+ESDYEVVWMPIVE  WTE+KQVKFEALLGLMPWYSVAHPSLIESA
Sbjct: 367 LDQIYRESRQNKTRSESDYEVVWMPIVESPWTEDKQVKFEALLGLMPWYSVAHPSLIESA 426

Query: 426 VVKYVRQVWNFIKKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLWKEETWQ 485
           V+KYVRQVWNFIKKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLWKEETW+
Sbjct: 427 VIKYVRQVWNFIKKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLWKEETWR 486

Query: 486 LELLVDSVEPLIFQWKETGKYICIIGGEDLGWIRSFSSKAKSVANDAGIELEILYVGKSN 545
           LELLVDSVEPLIFQW E GKYICI+GGEDL WIR FS+KA  VA DAGI LEILYVGKSN
Sbjct: 487 LELLVDSVEPLIFQWMEAGKYICILGGEDLAWIRGFSAKALGVAKDAGINLEILYVGKSN 546

Query: 546 PGEKIRKNIAAILADKIIHTLVDPTLIWFFWVRLESMWYSKTQRGNTIEDDPVMQETMTM 605
           PGEKI+KNIA ILADK+I TLVDPTLIWFFWVRLESMWYSKTQRGNTIEDDPVMQETMTM
Sbjct: 547 PGEKIKKNIAGILADKMIRTLVDPTLIWFFWVRLESMWYSKTQRGNTIEDDPVMQETMTM 606

Query: 606 LSFDSGDQGWALFCKGSTDILRAKAETITNVVDGYEERWKIHVKEEGFIPAMSKDLEHIH 665
           LSFDSGDQGWALFCKGSTDILRAKAETITNVVDGYEERWK+HVKEEGFIPAM+KDL+ IH
Sbjct: 607 LSFDSGDQGWALFCKGSTDILRAKAETITNVVDGYEERWKVHVKEEGFIPAMTKDLQDIH 666

Query: 666 TPEHCNRLILPSSNGTIPEKVVCSECGSAMEKFIMYRCCND 707
           TPEHCNRLILPSSNGTIPEKVVCSECGSAMEKFIMYRCCND
Sbjct: 667 TPEHCNRLILPSSNGTIPEKVVCSECGSAMEKFIMYRCCND 703

BLAST of HG10005098 vs. ExPASy TrEMBL
Match: A0A6J1KKF1 (protein SIEVE ELEMENT OCCLUSION B-like OS=Cucurbita maxima OX=3661 GN=LOC111496078 PE=4 SV=1)

HSP 1 Score: 1239.6 bits (3206), Expect = 0.0e+00
Identity = 602/706 (85.27%), Postives = 649/706 (91.93%), Query Frame = 0

Query: 1   MATGVRKLSLIKPDRQLFAAGDENALTKQVLATHSEEPLEFPVTPLLSLVEQIFLRAKLN 60
           +AT  RK+ L+KPDRQLFA  D+ ALTKQVLATHS+E LEF VTPLL L+EQIFLRAKLN
Sbjct: 3   LATAARKMGLMKPDRQLFAVADDTALTKQVLATHSDETLEFLVTPLLGLIEQIFLRAKLN 62

Query: 61  THQAHKYGTTRAQLEAIEDNSPSPTDLLDLLDFVSFTINKVSNEIQYKCSGAGDPHTVTM 120
                K GTT A+LEAIEDNSPSPTDLLDLLDFVSFTI++VSNEIQYKCS AG+PHTVTM
Sbjct: 63  ----DKQGTTGAELEAIEDNSPSPTDLLDLLDFVSFTIHRVSNEIQYKCSRAGEPHTVTM 122

Query: 121 EVFNLLSSWPWDAKVVLALAAFAINYGEFWLLVQQSSTDLLAKDISLLKKLPEIFERVDI 180
           EV NLL++WPWDAK VLALAAF+INYGEFWLLV QSS+DLLAKDISLLKKLPEIFER+DI
Sbjct: 123 EVLNLLTNWPWDAKAVLALAAFSINYGEFWLLVHQSSSDLLAKDISLLKKLPEIFERIDI 182

Query: 181 VKQKFEALDKLIKALVDVAKCIVDCKMLPPHYITPDTPEMKSATTLIPTAIYWTIRSIVA 240
           V+QKF+A+DKLIKAL+ VAKCIVD KMLPPHYITPDTPEMKSATTLIPTA+YW +RSI+A
Sbjct: 183 VRQKFDAIDKLIKALISVAKCIVDFKMLPPHYITPDTPEMKSATTLIPTAVYWIVRSIIA 242

Query: 241 CAAQNAGLIGVGHEYLASASETWELSSLAHKIDNIRKHLEQLLLACHHYINEKMHHEAYM 300
           CAAQ  GL+GVGHEYLASASETWELSSLAHKIDNIRKHLEQLL ACH YI+EKMHHEAYM
Sbjct: 243 CAAQITGLVGVGHEYLASASETWELSSLAHKIDNIRKHLEQLLRACHQYIHEKMHHEAYM 302

Query: 301 NLVRLFEIPHIDNNKILRALIYSKDDKPPLIDGLSKEKATLEVLRKKNVLLLISDLDISI 360
           NLVRLFEIPH+DNNKILRALIYSKDDK PLIDG+SKEKATL+VLRKKNVLLLISDLD+S+
Sbjct: 303 NLVRLFEIPHLDNNKILRALIYSKDDKMPLIDGISKEKATLDVLRKKNVLLLISDLDLSV 362

Query: 361 VELSMLDQIYRESRHNKTRAESDYEVVWMPIVEPTWTEEKQVKFEALLGLMPWYSVAHPS 420
           VELSMLDQIYRESR NKTRAESDYEVVWMPIVE  WT+EK  KFE LL LMPWYSVAHPS
Sbjct: 363 VELSMLDQIYRESRQNKTRAESDYEVVWMPIVESPWTDEKHAKFEGLLNLMPWYSVAHPS 422

Query: 421 LIESAVVKYVRQVWNFIKKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLWK 480
           LIESAV+KY+RQVW+F KKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLWK
Sbjct: 423 LIESAVIKYIRQVWHFNKKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLWK 482

Query: 481 EETWQLELLVDSVEPLIFQWKETGKYICIIGGEDLGWIRSFSSKAKSVANDAGIELEILY 540
           EETW+LELLVDSVEPLIF W ETGKYICI GGED+ W+RSFS K K VANDA IE+EILY
Sbjct: 483 EETWRLELLVDSVEPLIFNWMETGKYICICGGEDMEWVRSFSKKVKEVANDAKIEMEILY 542

Query: 541 VGKSNPGEKIRKNIAAILADKIIHTLVDPTLIWFFWVRLESMWYSKTQRGNTIEDDPVMQ 600
           VGKSNPGE+IRKNIAAILA+K IHTL DPTL+WFFWVRLESMWYSKTQRGNTIE+DP+MQ
Sbjct: 543 VGKSNPGERIRKNIAAILAEKTIHTLADPTLVWFFWVRLESMWYSKTQRGNTIEEDPIMQ 602

Query: 601 ETMTMLSFDSGDQGWALFCKGSTDILRAKAETITNVVDGYEERWKIHVKEEGFIPAMSKD 660
           ETMTMLSFDSGDQGWA+FCKGST I+RAKAE I  V++GYEERWK   KE G IPAMSKD
Sbjct: 603 ETMTMLSFDSGDQGWAVFCKGSTSIIRAKAEMIMKVMEGYEERWKEDAKELGLIPAMSKD 662

Query: 661 LEHIHTPEHCNRLILPSSNGTIPEKVVCSECGSAMEKFIMYRCCND 707
           L+ IHTPEHCNRLILPSSNGTIPEKVVCSECGSAMEKFIMYRCC D
Sbjct: 663 LQAIHTPEHCNRLILPSSNGTIPEKVVCSECGSAMEKFIMYRCCTD 704

BLAST of HG10005098 vs. ExPASy TrEMBL
Match: A0A6J1GIV3 (protein SIEVE ELEMENT OCCLUSION B-like OS=Cucurbita moschata OX=3662 GN=LOC111454275 PE=4 SV=1)

HSP 1 Score: 1238.0 bits (3202), Expect = 0.0e+00
Identity = 601/706 (85.13%), Postives = 649/706 (91.93%), Query Frame = 0

Query: 1   MATGVRKLSLIKPDRQLFAAGDENALTKQVLATHSEEPLEFPVTPLLSLVEQIFLRAKLN 60
           +AT  RK+ L+KPDRQLFA  D+ ALTKQVLATHS+E LEF VTPLL L+EQIFLRAKLN
Sbjct: 3   LATAARKMGLMKPDRQLFAVADDTALTKQVLATHSDETLEFLVTPLLGLIEQIFLRAKLN 62

Query: 61  THQAHKYGTTRAQLEAIEDNSPSPTDLLDLLDFVSFTINKVSNEIQYKCSGAGDPHTVTM 120
                K GTT A+LEAIEDNSPSPTDLLDLLDFVSFTI++VSNEIQYKCS AG+PHTVTM
Sbjct: 63  ----DKQGTTGAELEAIEDNSPSPTDLLDLLDFVSFTIHRVSNEIQYKCSRAGEPHTVTM 122

Query: 121 EVFNLLSSWPWDAKVVLALAAFAINYGEFWLLVQQSSTDLLAKDISLLKKLPEIFERVDI 180
           EV NLL++WPWDAK VLALAAF+INYGEFWLLV QSS+DLLAKDISLLKKLPEIFER+DI
Sbjct: 123 EVLNLLTNWPWDAKAVLALAAFSINYGEFWLLVHQSSSDLLAKDISLLKKLPEIFERIDI 182

Query: 181 VKQKFEALDKLIKALVDVAKCIVDCKMLPPHYITPDTPEMKSATTLIPTAIYWTIRSIVA 240
           V+QKF+A+DKLIKAL+ VAKCIVD KMLPPHYITPDTPEMKSATTLIPTA+YW +RSI+A
Sbjct: 183 VRQKFDAIDKLIKALISVAKCIVDFKMLPPHYITPDTPEMKSATTLIPTAVYWIVRSIIA 242

Query: 241 CAAQNAGLIGVGHEYLASASETWELSSLAHKIDNIRKHLEQLLLACHHYINEKMHHEAYM 300
           CAAQ  GL+GVGHEYLASASETWELSSLAHKIDNIRKHLEQLL ACH YI+EKMHHEAYM
Sbjct: 243 CAAQITGLVGVGHEYLASASETWELSSLAHKIDNIRKHLEQLLQACHQYIHEKMHHEAYM 302

Query: 301 NLVRLFEIPHIDNNKILRALIYSKDDKPPLIDGLSKEKATLEVLRKKNVLLLISDLDISI 360
           NLVRLFEIPH+DNNKILRALIYSKDDK PLIDG+SKEKATL+VLRKKNVLLLISDLD+S 
Sbjct: 303 NLVRLFEIPHLDNNKILRALIYSKDDKMPLIDGISKEKATLDVLRKKNVLLLISDLDLSA 362

Query: 361 VELSMLDQIYRESRHNKTRAESDYEVVWMPIVEPTWTEEKQVKFEALLGLMPWYSVAHPS 420
           VELSMLDQIYRESR NKTRAESDYEVVWMPIVE  WT+EKQ KFE LL LMPWYSVAHPS
Sbjct: 363 VELSMLDQIYRESRQNKTRAESDYEVVWMPIVESPWTDEKQAKFEGLLNLMPWYSVAHPS 422

Query: 421 LIESAVVKYVRQVWNFIKKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLWK 480
           LIESAV+KY+RQVW+F KKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLWK
Sbjct: 423 LIESAVIKYIRQVWHFNKKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLWK 482

Query: 481 EETWQLELLVDSVEPLIFQWKETGKYICIIGGEDLGWIRSFSSKAKSVANDAGIELEILY 540
           EETW+LELLVDSVEPLIF W ETGKYICI GGED+ W+RSFS K K VANDA +E+EILY
Sbjct: 483 EETWRLELLVDSVEPLIFNWMETGKYICICGGEDMEWVRSFSKKVKEVANDAEVEMEILY 542

Query: 541 VGKSNPGEKIRKNIAAILADKIIHTLVDPTLIWFFWVRLESMWYSKTQRGNTIEDDPVMQ 600
           VGKSNPGE+IRKNIAAILA+K IHTL DPTL+WFFWVRLESMWYSKTQRGNTIE+DP+MQ
Sbjct: 543 VGKSNPGERIRKNIAAILAEKTIHTLADPTLVWFFWVRLESMWYSKTQRGNTIEEDPIMQ 602

Query: 601 ETMTMLSFDSGDQGWALFCKGSTDILRAKAETITNVVDGYEERWKIHVKEEGFIPAMSKD 660
           ETMTMLSFDSGDQGWA+FCKGST I+RAKAE I  V++GYE+RWK   KE G IPAMSKD
Sbjct: 603 ETMTMLSFDSGDQGWAVFCKGSTSIIRAKAEMIMKVMEGYEKRWKDDAKELGLIPAMSKD 662

Query: 661 LEHIHTPEHCNRLILPSSNGTIPEKVVCSECGSAMEKFIMYRCCND 707
           L+ IHTPEHCNRLILPSSNGTIPEKVVCSECGSAMEKFIMYRCC D
Sbjct: 663 LQTIHTPEHCNRLILPSSNGTIPEKVVCSECGSAMEKFIMYRCCTD 704

BLAST of HG10005098 vs. TAIR 10
Match: AT3G01680.1 (CONTAINS InterPro DOMAIN/s: Mediator complex subunit Med28 (InterPro:IPR021640); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G01670.1); Has 122 Blast hits to 112 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 122; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 555.8 bits (1431), Expect = 4.9e-158
Identity = 305/712 (42.84%), Postives = 430/712 (60.39%), Query Frame = 0

Query: 13  PDRQLFAAGDENALTKQVLATHSEEPLEFPVTPLLSLVEQIFLRAKLNTHQAHKYGTTRA 72
           P   L  + DE+ + K +  THS +  E  V  LLSLVE I  RA L++       T  +
Sbjct: 30  PATGLAMSSDESMMLKLIQQTHSPDAREVQVRGLLSLVEDILDRATLDSED-----TNAS 89

Query: 73  QLEAIEDNSPSPTDLLDLLDFVSFTINKVSNEIQYKCSGAGDPHTVTMEVFNLLSSWPWD 132
            L    ++    + ++ +LD VS+ I++V+ EI YK     D H +TM VF  LSS+ WD
Sbjct: 90  MLPLPTEDKLMQSSMMSVLDSVSYAIDRVACEIAYKSLTGSDSHEITMSVFEHLSSFQWD 149

Query: 133 AKVVLALAAFAINYGEFWLLVQQSSTDLLAKDISLLKKLPEIFERVDIVKQKFEALDKLI 192
            K+VL LAAFA+NYGEFWLLVQ  S + LAK +++LK +P +  RV + +   + L+ LI
Sbjct: 150 GKLVLTLAAFALNYGEFWLLVQFYSKNQLAKSLAMLKLVP-VQNRVTL-ESVSQGLNDLI 209

Query: 193 KALVDVAKCIVDCKMLPPHYITPDTPEMKSATTLIPTAIYWTIRSIVACAAQNAGLIGVG 252
           + +  V  C+V+   LP  YITPD P++    + IP A+YWTIRS++AC +Q   +  +G
Sbjct: 210 REMKSVTACVVELSELPDRYITPDVPQLSRILSTIPIAVYWTIRSVIACISQINMITAMG 269

Query: 253 HEYLASASETWELSSLAHKIDNIRKHLEQLLLACHHYINEKMHHEAYMNLVRLFEIPHID 312
           HE + +  + WE S LA+K+ NI  HL + L  C+ +I ++   E+   L  LF+  HID
Sbjct: 270 HEMMNTQMDLWETSMLANKLKNIHDHLAETLRLCYRHIEKQRSSESLKVLHSLFDTTHID 329

Query: 313 NNKILRALIYSKDDKPPLIDGLSKEKATLEVLRKKNVLLLISDLDISIVELSMLDQIYRE 372
           N KIL AL++ K    PL DGL+K K  L+VLR+K VLLLISDL+I   ELS+ +QIY E
Sbjct: 330 NMKILTALVHPKPHITPLQDGLTKRKVHLDVLRRKTVLLLISDLNILQDELSIFEQIYTE 389

Query: 373 SRHNKT----RAESDYEVVWMPIVEPTWTEEK----QVKFEALLGLMPWYSVAHPSLIES 432
           SR N      ++   YEVVW+P+V+P    E+    Q KFE L   MPWYSV  P LIE 
Sbjct: 390 SRRNLVGVDGKSHMPYEVVWVPVVDPIEDFERSPILQKKFEDLRDPMPWYSVDSPKLIER 449

Query: 433 AVVKYVRQVWNFIKKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLWKEETW 492
            VV+++R  W+F+ KP+LVV+DPQG   + NA+HM+WIWG+ A+PFT +REE LW+ ET+
Sbjct: 450 HVVEFMRGRWHFMNKPILVVIDPQGNEASLNALHMIWIWGTEAFPFTRSREEELWRRETF 509

Query: 493 QLELLVDSVEPLIFQWKETGKYICIIGGEDLGWIRSFSSKAKSVANDAGIELEILYVGKS 552
            L L+VD ++ +IF W +   YI + GG+DL WIR F+  AK+ A D+ + LE+ YVGK 
Sbjct: 510 SLNLIVDGIDSVIFNWIKPDNYIFLYGGDDLDWIRRFTMAAKATAKDSNVNLEMAYVGKR 569

Query: 553 NPG--EKIRKNIAAILADKIIHTLVDPTLIWFFWVRLESMWYSKTQRGNTIEDDPVMQET 612
           N    E+IR+    I ++ + H+  +P L+WFFW RLESM YSK Q G   + D VMQ  
Sbjct: 570 NHSHREQIRRISEVIRSENLSHSWAEPALMWFFWTRLESMLYSKIQLGKADDHDDVMQGI 629

Query: 613 MTMLSFDSGDQGWALFCKGSTDILRAKAETITNVVDGYEERWKIHVKEEGFIPAMSKDLE 672
             +LS+D    GWAL  KG   ++ A    I   +  Y+  WK HV  +G+  AMS   +
Sbjct: 630 KKILSYDKLG-GWALLSKGPEIVMIAHG-AIERTMSVYDRTWKTHVPTKGYTKAMS---D 689

Query: 673 HIH------TPEHCNR--LILPSSNGTIPEKVVCSECGSAMEKFIMYRCCND 707
           H H      T + C      + + +G IPEK+ C EC   MEK++ + CC+D
Sbjct: 690 HHHDEVLRETGKPCGHFDFHITARSGRIPEKMNCFECQRPMEKYMSFSCCHD 729

BLAST of HG10005098 vs. TAIR 10
Match: AT3G01670.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G01680.1); Has 121 Blast hits to 111 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 121; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 444.5 bits (1142), Expect = 1.6e-124
Identity = 257/730 (35.21%), Postives = 399/730 (54.66%), Query Frame = 0

Query: 6   RKLSLIKPDRQLFAAGDENALTKQVLATHSEEPLEFPVTPLLSLVEQIFLRAKLNTHQAH 65
           +K +  +  R +F+  D+  +  +VL THS + + F VT LLS+V  IF           
Sbjct: 122 KKQAFHRNGRPMFSLSDDRVMADRVLKTHSPDMIFFDVTSLLSVVNDIF----------- 181

Query: 66  KYGTTRAQLEAIEDNSPSPT----DLLDLLDFVSFT--INKVSNEIQYKCSGAGDPH--- 125
                ++ + +I+ ++P P+    D  D   F +F   I+++S EI  KC   G+ H   
Sbjct: 182 -----KSHVPSIDSSAPKPSLVFKDYADHTSFETFADLIDQISCEIDCKCLHGGESHGMM 241

Query: 126 ----------TVTMEVFNLLSSWPWDAKVVLALAAFAINYGEFWLLVQQSSTDLLAKDIS 185
                     T T  V +L+S + WDAK+VL L+A A+ YG F LL +  +T+ L K ++
Sbjct: 242 TSGLHLDSRNTTTFSVLSLVSKYRWDAKLVLVLSALAVKYGVFLLLAETHATNQLTKSLA 301

Query: 186 LLKKLPEIFERVDIVKQKFEALDKLIKALVDVAKCIVDCKMLPPHYITPDTPEMKSATTL 245
           L+K+LP IF R + + Q+ +    L++ +VD+   I+D   LPP++IT       + T  
Sbjct: 302 LIKQLPSIFSRQNALHQRLDKTRILMQDMVDLTTTIIDIYQLPPNHIT------AAFTDH 361

Query: 246 IPTAIYWTIRSIVACAAQNAGLIGVGHEYLASASETWELSSLAHKIDNIRKHLEQLLLAC 305
           IPTA+YW +R ++ C +  +G  G   + + S  E  E+   + ++  I  +L +     
Sbjct: 362 IPTAVYWIVRCVLICVSHISGASGFKQDQIMSFMEVSEIHENSERLRKINAYLLEQFKKS 421

Query: 306 HHYINEKMHHEAYMNLVRLF-EIPHIDNNKILRALIYSKDDKPPLIDGLSKEKATLEVLR 365
              I E +  E Y  L++ F  I H+D    L  L+   D       G+SK +  + VL 
Sbjct: 422 KMTIEEGIIEEEYQELIQTFTTIIHVDVVPPLLRLLRPIDFLYHGA-GVSKRRVGINVLT 481

Query: 366 KKNVLLLISDLDISIVELSMLDQIYRESRHNKTRAESDYEVVWMPIVEPTWTEEKQVKFE 425
           +K+VLLLISDL+    EL +L+ +Y E+       +  +E++W+P V+  WTE    KFE
Sbjct: 482 QKHVLLLISDLENIEKELYILESLYTEA------WQQSFEILWVP-VQDFWTEADDAKFE 541

Query: 426 ALLGLMPWYSVAHPSLIESAVVKYVRQVWNFIKKPLLVVLDPQGKVVNTNAVHMLWIWGS 485
           AL   M WY +  P  +  A +++VR+ W F  +P+LV LDP+G+V++TNA  M+WIW  
Sbjct: 542 ALHMNMRWYVLGEPRKLRRAAIRFVREWWGFKNRPILVALDPKGQVMSTNAFPMVWIWQP 601

Query: 486 LAYPFTSAREESLWKEETWQLELLVDSVEPLIFQWKETGKYICIIGGEDLGWIRSFSSKA 545
            A+PFT+ARE  LW E+ W LE L+D  +P        GKYIC+ GGED+ WI++F+S  
Sbjct: 602 FAHPFTTARERDLWSEQEWNLEFLIDGTDPHSLNQLVDGKYICLYGGEDMQWIKNFTSLW 661

Query: 546 KSVANDAGIELEILYVGKSNPGEKIRKNIAAILADKIIHTLVDPTLIWFFWVRLESMWYS 605
           ++VA  A I+LE++YVGK NP   I+  I  I  + + HTL D   IWFFW R+ESMW S
Sbjct: 662 RNVAKAANIQLEMVYVGKRNPKNGIQPIINTIREENLSHTLPDLFQIWFFWTRVESMWES 721

Query: 606 KTQ--RGNTI---------EDDPVMQETMTMLSFDSGDQGWALFCKGSTDILRAKAETIT 665
           K +  + + I         E D V+QE + ML +     GW L  K S  ++RAK    +
Sbjct: 722 KQRMLKAHGIKGREGFKEEEKDLVLQEVVAMLGYGGEGDGWGLVSKASDMMVRAKGNLFS 781

Query: 666 NVVDGYEERWKIHVKEEGFIPAMSKDLEHIHTPEHCNRLILPSSNGTIPEKVVCSECGSA 705
             +  + E W++++  +GF+ A++  L     P HC R +LP + G IP +V C+EC   
Sbjct: 782 RGLAEFNE-WEVNIPTKGFLTALNDHLLMRLPPHHCTRFMLPETAGIIPNEVECTECRRT 820

BLAST of HG10005098 vs. TAIR 10
Match: AT1G67790.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G01680.1); Has 208 Blast hits to 125 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 208; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 222.2 bits (565), Expect = 1.3e-57
Identity = 186/705 (26.38%), Postives = 299/705 (42.41%), Query Frame = 0

Query: 15  RQLFAAGDENALTKQVLATHSEEPLEFPVTPLLSLVEQIFLRAKLNTHQAHKYGTTRAQL 74
           R+  +A +E+ + +Q+L +H  +        LL  VE I      N              
Sbjct: 4   RRDISALNEDIIVEQLLRSHDPDGRWLDSEMLLQEVETILSFVLQND----------VSR 63

Query: 75  EAIEDNSPSPTDLLDLLDFVSFTINKVSNEIQYKCSGAGDPHTVTMEVFNLLSSWPWDAK 134
             + +N  +  ++ D  + + + I ++S ++   C+G  +    TM +F+LL  + WDAK
Sbjct: 64  PLLTENCITTIEVFDSKETLPYAIFRISVQMLCPCTGENEIRKRTMVLFDLLKEYRWDAK 123

Query: 135 VVLALAAFAINYGEFWLLVQQSSTDLLAKDISLLKKLPEIFERVDIVKQKFEALDKLIKA 194
            VL L   A  YG   L V  +  D +A  I+ L +LP   ER    +   E+L+ LIKA
Sbjct: 124 AVLVLGVLAATYGGLLLPVHLAICDPVAASIAKLNQLP--IERTKF-RPWLESLNLLIKA 183

Query: 195 LVDVAKCIVDCKMLPPHYITPDTPEMKSATTLIPTAIYWTIRSIVACAAQNAGLIGVGHE 254
           +VDV KCI+  + +P      D   +    + I    Y  ++S + C  Q          
Sbjct: 184 MVDVTKCIIKFEKIPFKQAKLDNNILGETLSNIYLTTYRVVKSALTCMQQ---------- 243

Query: 255 YLASASETWELSSLAHKIDNIRKHLEQLLLACHHYINEKMHHEAYMNLVRLFEIPHIDNN 314
                                                                IP+    
Sbjct: 244 -----------------------------------------------------IPYFKQT 303

Query: 315 KILRALIYSKDDKPPLIDGLSKEKATLEVLRKKNVLLLISDLDISIVELSMLDQIYRESR 374
                                 ++ ++  ++ K  LLL+S   +  +   +L Q+Y    
Sbjct: 304 ----------------------QQISITEVQDKVTLLLLSKPPVEPL-FFLLQQLY--DH 363

Query: 375 HNKTRAESDYEVVWMPI-VEPTWTEEKQVKFEALLGLMPWYSVAHPSLIESAVVKYVRQV 434
            + T  E +YE++W+PI     WT+E++  F+     +PW SV  P L+ S ++ + +Q 
Sbjct: 364 PSNTNTEQNYEIIWVPIPSSQKWTDEEKEIFDFYSNSLPWISVRQPWLMSSTILNFFKQE 423

Query: 435 WNF-IKKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAREESLWKEETWQLELLVDS 494
           W++   + +LVV+D  G+ VN NA+ M+ IWG  AYPF+ +RE+ LWKE  W + LL+D 
Sbjct: 424 WHYKDNEAMLVVIDSNGRFVNMNAMDMVLIWGVKAYPFSVSREDELWKEHGWSINLLLDG 483

Query: 495 VEPLIFQWKETGKYICIIGGEDLGWIRSFSSKAKSVANDAGIELEILYVGKSNPGEKIRK 554
           + P        G+ ICI G E+L WI  F S A+ + N  G +LE++Y+      E+  +
Sbjct: 484 IHPTF-----EGREICIFGSENLDWIDEFVSLARKIQN-LGFQLELIYLSNQRRDERAME 543

Query: 555 NIAAILADKIIHTLVDPTLIWFFWVRLESMWYSKTQR--GNTIEDDPVMQETMTMLSFDS 614
             +          L  PTL   FW+RLES+  SK +R      + D V +E   +L FD 
Sbjct: 544 ESS---------ILFSPTLQQLFWLRLESIERSKLKRIVIEPSKPDRVFEEVRNLLDFDY 576

Query: 615 G-DQGWALFCKGSTDILRAKAETITNVVDGYE--------ERWKIHVKEEGFIPAM---- 674
           G  +GW +   GST      AET    VDG +         RW  + K  GF  A+    
Sbjct: 604 GKHRGWGIIGNGST------AET----VDGEKMTERMRKIVRWGEYAKGLGFTEAIEIAA 576

Query: 675 SKDLEHIHTPEHCNRLILPSSNGTIPEKVVCSECGSAMEKFIMYR 703
            K  E  HT       ++P       + V C +C   M++F+ Y+
Sbjct: 664 EKPCELSHT------AVVPFEEALTMKVVTCEKCKWPMKRFVAYQ 576

BLAST of HG10005098 vs. TAIR 10
Match: AT4G31240.1 (protein kinase C-like zinc finger protein )

HSP 1 Score: 45.8 bits (107), Expect = 1.6e-04
Identity = 30/111 (27.03%), Postives = 55/111 (49.55%), Query Frame = 0

Query: 364 SMLDQIYRESRHNKTRAESDYEVVWMPIVEPTWTEEKQVKFEALLGLMPWYSVAHPSLIE 423
           S L  +Y E     T  +  +EV+ +       T+    +F   +  MPW ++ +    E
Sbjct: 223 SQLVDVYNEL---ATTDKGSFEVILIS------TDRDSREFNINMTNMPWLAIPY----E 282

Query: 424 SAVVKYVRQVWNFIKKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAR 475
               + + +++N    P LV++ P+ K V TNA  M+ ++GS ++PFT +R
Sbjct: 283 DRTRQDLCRIFNVKLIPALVIIGPEEKTVTTNAREMVSLYGSRSFPFTESR 320

BLAST of HG10005098 vs. TAIR 10
Match: AT4G31240.2 (protein kinase C-like zinc finger protein )

HSP 1 Score: 45.8 bits (107), Expect = 1.6e-04
Identity = 30/111 (27.03%), Postives = 55/111 (49.55%), Query Frame = 0

Query: 364 SMLDQIYRESRHNKTRAESDYEVVWMPIVEPTWTEEKQVKFEALLGLMPWYSVAHPSLIE 423
           S L  +Y E     T  +  +EV+ +       T+    +F   +  MPW ++ +    E
Sbjct: 223 SQLVDVYNEL---ATTDKGSFEVILIS------TDRDSREFNINMTNMPWLAIPY----E 282

Query: 424 SAVVKYVRQVWNFIKKPLLVVLDPQGKVVNTNAVHMLWIWGSLAYPFTSAR 475
               + + +++N    P LV++ P+ K V TNA  M+ ++GS ++PFT +R
Sbjct: 283 DRTRQDLCRIFNVKLIPALVIIGPEEKTVTTNAREMVSLYGSRSFPFTESR 320

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA0061050.10.0e+0095.58protein SIEVE ELEMENT OCCLUSION B [Cucumis melo var. makuwa][more]
XP_008444389.10.0e+0095.44PREDICTED: protein SIEVE ELEMENT OCCLUSION B [Cucumis melo][more]
XP_038884139.10.0e+0094.63protein SIEVE ELEMENT OCCLUSION B-like [Benincasa hispida][more]
XP_004143056.10.0e+0094.72protein SIEVE ELEMENT OCCLUSION B [Cucumis sativus] >KGN62332.1 hypothetical pro... [more]
XP_023537424.10.0e+0085.27protein SIEVE ELEMENT OCCLUSION B-like [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Q9SS876.9e-15742.84Protein SIEVE ELEMENT OCCLUSION B OS=Arabidopsis thaliana OX=3702 GN=SEOB PE=1 S... [more]
Q93XX22.2e-12335.21Protein SIEVE ELEMENT OCCLUSION A OS=Arabidopsis thaliana OX=3702 GN=SEOA PE=1 S... [more]
Q9FXE21.2e-6829.09Protein SIEVE ELEMENT OCCLUSION C OS=Arabidopsis thaliana OX=3702 GN=SEOC PE=4 S... [more]
Q7XPE81.2e-0429.55Probable nucleoredoxin 3 OS=Oryza sativa subsp. japonica OX=39947 GN=Os04g060860... [more]
Match NameE-valueIdentityDescription
A0A5A7V1410.0e+0095.58Protein SIEVE ELEMENT OCCLUSION B OS=Cucumis melo var. makuwa OX=1194695 GN=E6C2... [more]
A0A1S3B9Q60.0e+0095.44protein SIEVE ELEMENT OCCLUSION B OS=Cucumis melo OX=3656 GN=LOC103487729 PE=4 S... [more]
A0A0A0LNE10.0e+0094.72Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G349660 PE=4 SV=1[more]
A0A6J1KKF10.0e+0085.27protein SIEVE ELEMENT OCCLUSION B-like OS=Cucurbita maxima OX=3661 GN=LOC1114960... [more]
A0A6J1GIV30.0e+0085.13protein SIEVE ELEMENT OCCLUSION B-like OS=Cucurbita moschata OX=3662 GN=LOC11145... [more]
Match NameE-valueIdentityDescription
AT3G01680.14.9e-15842.84CONTAINS InterPro DOMAIN/s: Mediator complex subunit Med28 (InterPro:IPR021640);... [more]
AT3G01670.11.6e-12435.21unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G67790.11.3e-5726.38unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G31240.11.6e-0427.03protein kinase C-like zinc finger protein [more]
AT4G31240.21.6e-0427.03protein kinase C-like zinc finger protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR027944Sieve element occlusion, C-terminalPFAMPF14577SEO_Ccoord: 474..705
e-value: 3.5E-95
score: 317.8
IPR027942Sieve element occlusion, N-terminalPFAMPF14576SEO_Ncoord: 22..310
e-value: 5.3E-103
score: 344.2
NoneNo IPR availablePANTHERPTHR33232:SF20PROTEIN SIEVE ELEMENT OCCLUSION B-LIKEcoord: 7..706
IPR039299Protein SIEVE ELEMENT OCCLUSIONPANTHERPTHR33232PROTEIN SIEVE ELEMENT OCCLUSION B-LIKEcoord: 7..706

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10005098.1HG10005098.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010088 phloem development