HG10022544 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10022544
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionAlkaline/neutral invertase
LocationChr05: 25313774 .. 25319148 (+)
RNA-Seq ExpressionHG10022544
SyntenyHG10022544
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATGGGTTGGGACTTCGAAATGTGAGCTCTCATTGCTCGATCTCTGAGATGGATGATTATGATCTTTCTCGTCTTCTTGATAAGCCTAAGCTCAATATTGAGAGGCAAAGATCATTTGACGAGAGATCCCTCAGTGAGCTGTCCATCAGCCTTGCTAGGGGAGGCTTGGACAACTTTGAGAGCTCATATTCACCTGGTGGAAGGTCAGGATTTGATACCCCAGCTTCATCTACCAGAAACTCATTTGAGCCCCACCCAATGATCGCTGAAGCATGGGAGGCTTTGCGGAGATCTTTGGTGCATTTTCGGGGCCAACCAGTTGGAACTATTGCAGCATATGACCATGCCTCGGAGGAAGTTTTGAATTATGATCAGGTTGATATCTTTACAACCAATAAAAAAACCGATAATGATTTTGATGTATCATTTGGCATGCTAGAGACCCCAATCATGTGAATCTGGAGACTGCAGTGTTGTAGAATAATTATGAATAGCTAATATATTAGGTGGAAAATGTTTTTATCCTCTGTCGAGAAAGTTCAAATTAAAAGTCAGAGAACTTATCAGGTAAATAGTGGGCACCAAAAATACATCGAAAACTTTAACTCAAAAGTTTGATTTTCTTGCCTCAAAAAAAAAAAAAAAGAAAAAGAAAAAAGAAAAAAGTTTGATTTTCTTCTATATTTACATTCATGGAAAGAACGGCCGTAGACAACCTGAAGTAAGTTAGTTAAATCATAAAACTGGAGCCCCTAACCCCATTTCTTACCACAAAATTGTCATGTATTTGATAAGAAATAATTGTATACTTGCAGTTGTTTCTATAAAATAAAAATAATTGTATACTAGTAACGGCTTTAACAAACTTTTCAATAAAGAAAATACGACTTTAACAAAAGTCTGAATTATGTATTGGTTGTGTTACTTTTCTCCCAGGTTTTTGTTCGGGATTTTGTACCAAGTGCTTTGGCTTTTCTGATGAACGGGGAACCTGAAATAGTTAAGAACTTCCTGTTAAAGACTCTGCAGCTTCAGGGATGGGAAAAAAGAATAGACAGATTCAAGCTTGGGGAAGGTGCAATGCCAGCTAGCTTTAAGGTTCTTCATGATCCTGTTAGAAAAACAGATACCATTGCTGCTGATTTTGGAGAGAGTGCGATAGGAAGAGTTGCTCCTGTTGACTCTGGATTCTGGTGGATCATTCTGCTCCGTGCGTATACAAAGTCAACGGGTGATCTATCTCTGGCTGAAACACCAGAGTGTCAGAAGGGAATGAGACTTATTTTAACTTTATGTCTGTCGGAGGGGTTTGATACCTTCCCGACTCTACTTTGTGCTGATGGATGCTCCATGATTGATCGAAGAATGGTAAAATTTCTTGATATCTTTTGTTCGTTATGAAGTTACTTTTAGCTGTTTTTCATTTCATCATACATAATTATCTTTTTATAATACTTTCAGTTAAATTTATTATGTGATGTCTATGTATATAACAACTGTGAAAGTTTCTAAAAAGAATGAAGTTTTTATAAGTTATTGACATCAGTTGATTTTTAATAGGCCAATCCATAAAAAAGGAAAAATAAAATGAAAAAAATAATAGTTACAGAATGAGGAGGCAGTCACAACAATGTCCTAAATTTTTTTATTCTATAGCTTGTTATATTTCTCTGTAGGACTTCTTTTAATGATCTAACGGTTGACTTCTTGGACTTGGTTTTTATTTACATTAAATTATTTATGTGTGATACAAACAAATGATTTTATTGTGTTATGTTACTGCTGTAGGATGTGTTTTGATGGCTTTTATTGCATGGCTTCTGAGATCGTAAAGTATTAACATTTTTAATGCGTAATAAACATAAATGGTCCCTGCACAAGTGTTTATCAGAACACTATGTCTTTTACTGAAGCATGTATATTTGAAGAGCTAACCTCTTCCACTGTTATTTTCCCATCAGTGGGCTGTAATGTGATGATGCCTATCATCAACAATTAACATGAAGGGTTTAAACTATGATTCCTTCTTTCTTCCTTCAACTTTATTTCCTTGTCTAACTGGGAAGTCAGAAAAGAGATCTTACTGCACGGTGTATTTAAAATTGTATTGTTTGTCACAGTATTTAAAATTCCAGGATTAATTTAATCAGCTTTCCTGAATTTACCTATTTTCAAACCATTATTTTTAGTCCAGCATAAGTTATAATTAACATTTTTAGTATTTCAGTCATGTAGACGTAAAATTTTCTACCCAGAAGTGCTATTTTGCAGCTTTCTTATATTGTTGGGGAGCAGATATTGTACTTGATTATTGCAGTTTGATATAATTATTAATCTTAGGTAATCCTAAATGCGTTTTAGTTTTTTTTTTCCCTTTTTAAATTCGATAGCATTATGCAAGATTTTTTTGTTTTGGATAAGAAACAATAATTTAATAATTTGGTGTCTACTCACAAAAAGTTTTTTTTTCCCTTCTTAAATTCGGTAGCATTATGCAAGTTTTTTTGTTTCGGATAAGAAACAATATTTTAATAATTTGGTGTCTACTCACAAAAGTTTTTTTTTCCCTTCTTAAATTCAATAGCATTATGCAAGATTTTTTCTTTTGGATAAGAAACAATATTTTAATAATTTGGTGTCTACTCACAAAAGCTTTTTTTTTTTTGGTTATGAAATATCGTGTTTCTGATGAAAAATGTTCTGATAAGTATAAGATATCTTCAACAACTAAAGAAAGTTTGGCCCTCGAATATTAATCTTTTTGGAACCTTCTGTGATTTTATTTTCAGATGTTAAAACAACAGATAACAACCTTTTCATTAACTTCTTTGCAGCTATTGTGTTCAAATTTAAAGAATAGAAGATATTTCTGAGAAATATTTACCTTATCCATGTATTTTAACTTAATAATTTTTCTCCAGGGTATATATGGTTATCCTATAGAAATTCAAGCCCTTTTCTTCATGGCCTTGAGATGCGCTCTGGCTATGCTGAAACATGATGCTGAAGGAAAAGAGTGCATAGAGCGTATTGTGAAGCGTTTGCATGCTTTAAGTTATCACATGAGGAGTTATTTCTGGCTTGATTTTCAGCAACTAAATGACATCTACCGTTATAAAACTGAAGAATATTCACATACAGCTGTAAATAAGTTTAATGTCATCCCTGATTCAATCCCAGAATGGGTGTTTGATTTTATGCCCACACGTGGCGGGTACTTTGTTGGCAACGTTAGTCCTGCAAGAATGGACTTTAGATGGTTTGCTTTAGGTAATTGTGTTGCAATTCTATCATCTCTTGCCACCCCTGAGCAATCAATAGCTATTATGGATCTTATTGAATCACGCTGGGAAGAGCTGGTTGGAGAAATGCCTTTGAAAATATCATATCCTGCCATAGAAAGCCATGACTGGCGAATTATTACTGGTTGTGATCCGAAGAATACCAGGTGGAGCTACCATAATGGTGGATCTTGGCCAGGTTTGTTGCTTCACTCATTTCTCTCATCAATTTATAAAAAAGCTTATATTGTTTATAATTAATCCATAAAAGTTTTCTAATAATAGCTTTAGGTATAAAAGTAAACTGTGCCAGCCATTGTTGGCTTGCAATTAGAACGGGCATCAGTGCCTTTCATGCACTTGGATCTGCCAATCATCAATTTTATTGATGATATCACTATTTTAAACCTCAAGAAGACGGTATTAGGAACTGTCTTTTTCTCCCCTCAACGATTTCTTGAAGCATTAAGCTTATGCACTACTAGATTTTAGCTTCTAGCCAGTAGTAGAGGTCTGAGGTGGAATCTAAGTTGATAAAACAGGAAAAGTATCAGATAAGTACTTTTTGTAAAATTTCCACAATGATAATTGCTAAGTGGTGAATTCCTGGGGGCTATATATAACTTTCATCTATAAAGAGTCACTTGTTTTGTACTTAATACATTCCGGTTTTAAGAAGGATGCAGTTAGGCGCCTCCCTTCAAGACAAGGGAAACTTGCTCAGACAAGCATTTAGGTTTGTTCTTCTGAGCTTTTTGAGGCATGAATGAAGCATGTGCTCGCAATCATACTTGTCTTTTAAAAATCCTAGGGCTAACAGAAAAAAAAAGGAAAGGAAAAAGAATGAGCTCACTTGCCTTATCTTTTTTCCTTTAATCTTTCGGCGCTATCTAAAGTAGCTCTTTTTCTTAGTTTGGACCTATTTGGAATGGATTTTCAAGTTTTTAGAAAAGGATTTTAAAGTATCATTCCTAACAGGTTCTTTCTCATTCTTCTCTCTCTTGGTTTTTCTCGCTCTCTCTTTGCCTCCCCAAATCAACTCTCTTGTTCTTTACTCTCAGAATTTTGTCTCTCCCCTTTCTATCTCTCTCAGAACTAAACCTATTTCATGAACTGAAATGGTAACTACGAGAAGTAGCTAGTGATGAGTTTCAATATGATGAAGTTTTTTTTTTCCCCTTCTATTTTGTATTCTTTTTACACAAATGTCAAATTTTCTAGATCTGCAAGACGTGTTGATGCCAGCCACAGTCATTGTCTATTGTCTAACAAAGAACTGAAGTTCCTTTCCATAAAACATTAAATAGATTTATTGGAAGGATTTTAGTGTTATGAATAAGGTACACAGTTGCATCCCAAACTCTAGTACTCCTGCTTGTAGTCTACATGAAAGAAATTTAATTTGTATTACTATAAATCACATTTAGGAGTATCGAGTATGATCACCAAAAAGAGTTAGGATGTTGTTGGAGTACCAAGTATGATTAAATTGAGTATCAAGTTGGAGTCTAATCTAGATACTGTTAGAGATGGATGGCATTTTCCAAACACCAGAACCTTGCTTAAATTTTCTTGTTGGAAAACATCCTTGATATTGGAGGGCAGGTAGCATCCTTTATACAGATGGATATTTTCATTACTCAATGCAGTCGATTCATGAGACAAGGAAAGTTAGGATCATCATTTATGGATACTGCATTGCGTATTGCAGTAAAAGATAGCAGAGTTTGGTTGTTAATCTCATCTTTTGCCTTGTTCTTTTATACTGACATCTTTCTCCTTGCTTTGGATGCAGTGCTACTATGGCTGCTAACAGCTGCTTGCATTAAAACTGGACGACCACAGATTGCTCGAAGAGCCATCGAGCTGGCCGAGAGTCGATTGCTGAAGGATAGTTGGCCCGAATACTATGATGGAAAGTTAGGAAGATATATCGGAAAACAAGCGAGGAAATACCAGACGTGGTCGATAGCAGGATACTTAGTTGCAAAGATGATGTTGGAAGATCCATCACACTTGGGGATGATTTCACTTGAAGAGGACAAGCAAATGAAGCCACTGATCAAGAGATCATCATCTTGGACCTGCTGA

mRNA sequence

ATGGATGGGTTGGGACTTCGAAATGTGAGCTCTCATTGCTCGATCTCTGAGATGGATGATTATGATCTTTCTCGTCTTCTTGATAAGCCTAAGCTCAATATTGAGAGGCAAAGATCATTTGACGAGAGATCCCTCAGTGAGCTGTCCATCAGCCTTGCTAGGGGAGGCTTGGACAACTTTGAGAGCTCATATTCACCTGGTGGAAGGTCAGGATTTGATACCCCAGCTTCATCTACCAGAAACTCATTTGAGCCCCACCCAATGATCGCTGAAGCATGGGAGGCTTTGCGGAGATCTTTGGTGCATTTTCGGGGCCAACCAGTTGGAACTATTGCAGCATATGACCATGCCTCGGAGGAAGTTTTGAATTATGATCAGGTTTTTGTTCGGGATTTTGTACCAAGTGCTTTGGCTTTTCTGATGAACGGGGAACCTGAAATAGTTAAGAACTTCCTGTTAAAGACTCTGCAGCTTCAGGGATGGGAAAAAAGAATAGACAGATTCAAGCTTGGGGAAGGTGCAATGCCAGCTAGCTTTAAGGTTCTTCATGATCCTGTTAGAAAAACAGATACCATTGCTGCTGATTTTGGAGAGAGTGCGATAGGAAGAGTTGCTCCTGTTGACTCTGGATTCTGGTGGATCATTCTGCTCCGTGCGTATACAAAGTCAACGGGTGATCTATCTCTGGCTGAAACACCAGAGTGTCAGAAGGGAATGAGACTTATTTTAACTTTATGTCTGTCGGAGGGGTTTGATACCTTCCCGACTCTACTTTGTGCTGATGGATGCTCCATGATTGATCGAAGAATGGGTATATATGGTTATCCTATAGAAATTCAAGCCCTTTTCTTCATGGCCTTGAGATGCGCTCTGGCTATGCTGAAACATGATGCTGAAGGAAAAGAGTGCATAGAGCGTATTGTGAAGCGTTTGCATGCTTTAAGTTATCACATGAGGAGTTATTTCTGGCTTGATTTTCAGCAACTAAATGACATCTACCGTTATAAAACTGAAGAATATTCACATACAGCTGTAAATAAGTTTAATGTCATCCCTGATTCAATCCCAGAATGGGTGTTTGATTTTATGCCCACACGTGGCGGGTACTTTGTTGGCAACGTTAGTCCTGCAAGAATGGACTTTAGATGGTTTGCTTTAGGTAATTGTGTTGCAATTCTATCATCTCTTGCCACCCCTGAGCAATCAATAGCTATTATGGATCTTATTGAATCACGCTGGGAAGAGCTGGTTGGAGAAATGCCTTTGAAAATATCATATCCTGCCATAGAAAGCCATGACTGGCGAATTATTACTGGTTGTGATCCGAAGAATACCAGGTGGAGCTACCATAATGGTGGATCTTGGCCAGTGCTACTATGGCTGCTAACAGCTGCTTGCATTAAAACTGGACGACCACAGATTGCTCGAAGAGCCATCGAGCTGGCCGAGAGTCGATTGCTGAAGGATAGTTGGCCCGAATACTATGATGGAAAGTTAGGAAGATATATCGGAAAACAAGCGAGGAAATACCAGACGTGGTCGATAGCAGGATACTTAGTTGCAAAGATGATGTTGGAAGATCCATCACACTTGGGGATGATTTCACTTGAAGAGGACAAGCAAATGAAGCCACTGATCAAGAGATCATCATCTTGGACCTGCTGA

Coding sequence (CDS)

ATGGATGGGTTGGGACTTCGAAATGTGAGCTCTCATTGCTCGATCTCTGAGATGGATGATTATGATCTTTCTCGTCTTCTTGATAAGCCTAAGCTCAATATTGAGAGGCAAAGATCATTTGACGAGAGATCCCTCAGTGAGCTGTCCATCAGCCTTGCTAGGGGAGGCTTGGACAACTTTGAGAGCTCATATTCACCTGGTGGAAGGTCAGGATTTGATACCCCAGCTTCATCTACCAGAAACTCATTTGAGCCCCACCCAATGATCGCTGAAGCATGGGAGGCTTTGCGGAGATCTTTGGTGCATTTTCGGGGCCAACCAGTTGGAACTATTGCAGCATATGACCATGCCTCGGAGGAAGTTTTGAATTATGATCAGGTTTTTGTTCGGGATTTTGTACCAAGTGCTTTGGCTTTTCTGATGAACGGGGAACCTGAAATAGTTAAGAACTTCCTGTTAAAGACTCTGCAGCTTCAGGGATGGGAAAAAAGAATAGACAGATTCAAGCTTGGGGAAGGTGCAATGCCAGCTAGCTTTAAGGTTCTTCATGATCCTGTTAGAAAAACAGATACCATTGCTGCTGATTTTGGAGAGAGTGCGATAGGAAGAGTTGCTCCTGTTGACTCTGGATTCTGGTGGATCATTCTGCTCCGTGCGTATACAAAGTCAACGGGTGATCTATCTCTGGCTGAAACACCAGAGTGTCAGAAGGGAATGAGACTTATTTTAACTTTATGTCTGTCGGAGGGGTTTGATACCTTCCCGACTCTACTTTGTGCTGATGGATGCTCCATGATTGATCGAAGAATGGGTATATATGGTTATCCTATAGAAATTCAAGCCCTTTTCTTCATGGCCTTGAGATGCGCTCTGGCTATGCTGAAACATGATGCTGAAGGAAAAGAGTGCATAGAGCGTATTGTGAAGCGTTTGCATGCTTTAAGTTATCACATGAGGAGTTATTTCTGGCTTGATTTTCAGCAACTAAATGACATCTACCGTTATAAAACTGAAGAATATTCACATACAGCTGTAAATAAGTTTAATGTCATCCCTGATTCAATCCCAGAATGGGTGTTTGATTTTATGCCCACACGTGGCGGGTACTTTGTTGGCAACGTTAGTCCTGCAAGAATGGACTTTAGATGGTTTGCTTTAGGTAATTGTGTTGCAATTCTATCATCTCTTGCCACCCCTGAGCAATCAATAGCTATTATGGATCTTATTGAATCACGCTGGGAAGAGCTGGTTGGAGAAATGCCTTTGAAAATATCATATCCTGCCATAGAAAGCCATGACTGGCGAATTATTACTGGTTGTGATCCGAAGAATACCAGGTGGAGCTACCATAATGGTGGATCTTGGCCAGTGCTACTATGGCTGCTAACAGCTGCTTGCATTAAAACTGGACGACCACAGATTGCTCGAAGAGCCATCGAGCTGGCCGAGAGTCGATTGCTGAAGGATAGTTGGCCCGAATACTATGATGGAAAGTTAGGAAGATATATCGGAAAACAAGCGAGGAAATACCAGACGTGGTCGATAGCAGGATACTTAGTTGCAAAGATGATGTTGGAAGATCCATCACACTTGGGGATGATTTCACTTGAAGAGGACAAGCAAATGAAGCCACTGATCAAGAGATCATCATCTTGGACCTGCTGA

Protein sequence

MDGLGLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDERSLSELSISLARGGLDNFESSYSPGGRSGFDTPASSTRNSFEPHPMIAEAWEALRRSLVHFRGQPVGTIAAYDHASEEVLNYDQVFVRDFVPSALAFLMNGEPEIVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASFKVLHDPVRKTDTIAADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETPECQKGMRLILTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHDAEGKECIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWVFDFMPTRGGYFVGNVSPARMDFRWFALGNCVAILSSLATPEQSIAIMDLIESRWEELVGEMPLKISYPAIESHDWRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIELAESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDKQMKPLIKRSSSWTC
Homology
BLAST of HG10022544 vs. NCBI nr
Match: XP_038897844.1 (probable alkaline/neutral invertase D [Benincasa hispida])

HSP 1 Score: 1141.3 bits (2951), Expect = 0.0e+00
Identity = 551/554 (99.46%), Postives = 554/554 (100.00%), Query Frame = 0

Query: 1   MDGLGLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDERSLSELSISLARGGLDNF 60
           MDGLGLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDERSLSELSISLARGGLDNF
Sbjct: 1   MDGLGLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDERSLSELSISLARGGLDNF 60

Query: 61  ESSYSPGGRSGFDTPASSTRNSFEPHPMIAEAWEALRRSLVHFRGQPVGTIAAYDHASEE 120
           ESSYSPGGRSGFDTPASSTRNSFEPHPMIAEAWEALRRSLV+FRGQPVGTIAAYDHASEE
Sbjct: 61  ESSYSPGGRSGFDTPASSTRNSFEPHPMIAEAWEALRRSLVYFRGQPVGTIAAYDHASEE 120

Query: 121 VLNYDQVFVRDFVPSALAFLMNGEPEIVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASFK 180
           VLNYDQVFVRDFVPSALAFLMNGEPEIVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASFK
Sbjct: 121 VLNYDQVFVRDFVPSALAFLMNGEPEIVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASFK 180

Query: 181 VLHDPVRKTDTIAADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETPECQKGMR 240
           VLHDPVRKTDTIAADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETPECQKGMR
Sbjct: 181 VLHDPVRKTDTIAADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETPECQKGMR 240

Query: 241 LILTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHDAEG 300
           LILTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHDAEG
Sbjct: 241 LILTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHDAEG 300

Query: 301 KECIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWVF 360
           KECIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWVF
Sbjct: 301 KECIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWVF 360

Query: 361 DFMPTRGGYFVGNVSPARMDFRWFALGNCVAILSSLATPEQSIAIMDLIESRWEELVGEM 420
           DFMPTRGGYFVGNVSPARMDFRWFALGNCVAILSSLATPEQS+AIMDLIESRWEELVGEM
Sbjct: 361 DFMPTRGGYFVGNVSPARMDFRWFALGNCVAILSSLATPEQSMAIMDLIESRWEELVGEM 420

Query: 421 PLKISYPAIESHDWRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIE 480
           PLKISYPAIESH+WRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIE
Sbjct: 421 PLKISYPAIESHEWRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIE 480

Query: 481 LAESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDK 540
           LAESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDK
Sbjct: 481 LAESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDK 540

Query: 541 QMKPLIKRSSSWTC 555
           QMKPLIKRSSSWTC
Sbjct: 541 QMKPLIKRSSSWTC 554

BLAST of HG10022544 vs. NCBI nr
Match: XP_022143693.1 (probable alkaline/neutral invertase D [Momordica charantia])

HSP 1 Score: 1136.3 bits (2938), Expect = 0.0e+00
Identity = 546/554 (98.56%), Postives = 552/554 (99.64%), Query Frame = 0

Query: 1   MDGLGLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDERSLSELSISLARGGLDNF 60
           MDG GLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDERSLSELSISLARGGLDNF
Sbjct: 1   MDGFGLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDERSLSELSISLARGGLDNF 60

Query: 61  ESSYSPGGRSGFDTPASSTRNSFEPHPMIAEAWEALRRSLVHFRGQPVGTIAAYDHASEE 120
           ESSYSPGGRSGFDTPASSTRNSFEPHPMIAEAWEALRRSLVHFRGQPVGTIAAYDHASEE
Sbjct: 61  ESSYSPGGRSGFDTPASSTRNSFEPHPMIAEAWEALRRSLVHFRGQPVGTIAAYDHASEE 120

Query: 121 VLNYDQVFVRDFVPSALAFLMNGEPEIVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASFK 180
           VLNYDQVFVRDFVPSALAFLMNGEPEIVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASFK
Sbjct: 121 VLNYDQVFVRDFVPSALAFLMNGEPEIVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASFK 180

Query: 181 VLHDPVRKTDTIAADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETPECQKGMR 240
           VLHDPVRKTD+I ADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETPECQKGMR
Sbjct: 181 VLHDPVRKTDSIVADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETPECQKGMR 240

Query: 241 LILTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHDAEG 300
           LILTLCLSEGFDTFPTLLCADGCSMIDRRMG+YGYPIEIQALFFMALRCALAMLKHDAEG
Sbjct: 241 LILTLCLSEGFDTFPTLLCADGCSMIDRRMGVYGYPIEIQALFFMALRCALAMLKHDAEG 300

Query: 301 KECIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWVF 360
           KECIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWVF
Sbjct: 301 KECIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWVF 360

Query: 361 DFMPTRGGYFVGNVSPARMDFRWFALGNCVAILSSLATPEQSIAIMDLIESRWEELVGEM 420
           DFMPTRGGYFVGNVSPARMDFRWFALGNCVAILSSLATPEQS+AIMDLIESRWEELVGEM
Sbjct: 361 DFMPTRGGYFVGNVSPARMDFRWFALGNCVAILSSLATPEQSMAIMDLIESRWEELVGEM 420

Query: 421 PLKISYPAIESHDWRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIE 480
           PLKI+YPAIESH+WRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIAR+AIE
Sbjct: 421 PLKITYPAIESHEWRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARKAIE 480

Query: 481 LAESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDK 540
           LAESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDK
Sbjct: 481 LAESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDK 540

Query: 541 QMKPLIKRSSSWTC 555
           QMKPLIKRSSSWTC
Sbjct: 541 QMKPLIKRSSSWTC 554

BLAST of HG10022544 vs. NCBI nr
Match: XP_011659122.1 (probable alkaline/neutral invertase D [Cucumis sativus] >KGN44485.1 hypothetical protein Csa_016461 [Cucumis sativus])

HSP 1 Score: 1128.6 bits (2918), Expect = 0.0e+00
Identity = 542/554 (97.83%), Postives = 550/554 (99.28%), Query Frame = 0

Query: 1   MDGLGLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDERSLSELSISLARGGLDNF 60
           MDG GLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDERSLSELSI LARGGLDNF
Sbjct: 1   MDGFGLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDERSLSELSIGLARGGLDNF 60

Query: 61  ESSYSPGGRSGFDTPASSTRNSFEPHPMIAEAWEALRRSLVHFRGQPVGTIAAYDHASEE 120
           ESSYSPGGRSGFDTPASS+RNSFEPHPMIAEAWEALRRS+V+FRGQPVGTIAAYDHASEE
Sbjct: 61  ESSYSPGGRSGFDTPASSSRNSFEPHPMIAEAWEALRRSMVYFRGQPVGTIAAYDHASEE 120

Query: 121 VLNYDQVFVRDFVPSALAFLMNGEPEIVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASFK 180
           VLNYDQVFVRDFVPSALAFLMNGEP+IVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASFK
Sbjct: 121 VLNYDQVFVRDFVPSALAFLMNGEPDIVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASFK 180

Query: 181 VLHDPVRKTDTIAADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETPECQKGMR 240
           VLHDPVRKTDT+AADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAET ECQKGMR
Sbjct: 181 VLHDPVRKTDTVAADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETSECQKGMR 240

Query: 241 LILTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHDAEG 300
           LILTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHDAEG
Sbjct: 241 LILTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHDAEG 300

Query: 301 KECIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWVF 360
           KECIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEW+F
Sbjct: 301 KECIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWLF 360

Query: 361 DFMPTRGGYFVGNVSPARMDFRWFALGNCVAILSSLATPEQSIAIMDLIESRWEELVGEM 420
           DFMPTRGGYFVGNVSPARMDFRWFALGNCVAIL SLATPEQS+AIMDLIESRWEELVGEM
Sbjct: 361 DFMPTRGGYFVGNVSPARMDFRWFALGNCVAILGSLATPEQSMAIMDLIESRWEELVGEM 420

Query: 421 PLKISYPAIESHDWRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIE 480
           PLKISYPAIESH+WRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIE
Sbjct: 421 PLKISYPAIESHEWRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIE 480

Query: 481 LAESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDK 540
           LAESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDK
Sbjct: 481 LAESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDK 540

Query: 541 QMKPLIKRSSSWTC 555
           QMKPLIKRSSSWTC
Sbjct: 541 QMKPLIKRSSSWTC 554

BLAST of HG10022544 vs. NCBI nr
Match: XP_008461922.1 (PREDICTED: probable alkaline/neutral invertase D [Cucumis melo] >KAA0046174.1 putative alkaline/neutral invertase D [Cucumis melo var. makuwa] >TYK00203.1 putative alkaline/neutral invertase D [Cucumis melo var. makuwa])

HSP 1 Score: 1127.9 bits (2916), Expect = 0.0e+00
Identity = 542/552 (98.19%), Postives = 550/552 (99.64%), Query Frame = 0

Query: 3   GLGLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDERSLSELSISLARGGLDNFES 62
           GLGLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDERSLSELSI LARGGLDNFES
Sbjct: 5   GLGLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDERSLSELSIGLARGGLDNFES 64

Query: 63  SYSPGGRSGFDTPASSTRNSFEPHPMIAEAWEALRRSLVHFRGQPVGTIAAYDHASEEVL 122
           SYSPGGRSGFDTPASS+RNSFEPHPMIAEAWEALRRS+V+FRGQPVGTIAAYDHASEEVL
Sbjct: 65  SYSPGGRSGFDTPASSSRNSFEPHPMIAEAWEALRRSMVYFRGQPVGTIAAYDHASEEVL 124

Query: 123 NYDQVFVRDFVPSALAFLMNGEPEIVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASFKVL 182
           NYDQVFVRDFVPSALAFLMNGEP+IVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASFKVL
Sbjct: 125 NYDQVFVRDFVPSALAFLMNGEPDIVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASFKVL 184

Query: 183 HDPVRKTDTIAADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETPECQKGMRLI 242
           HDPVRKTDT+AADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAET ECQKGMRLI
Sbjct: 185 HDPVRKTDTVAADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETSECQKGMRLI 244

Query: 243 LTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHDAEGKE 302
           LTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHDAEGKE
Sbjct: 245 LTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHDAEGKE 304

Query: 303 CIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWVFDF 362
           CIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWVFDF
Sbjct: 305 CIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWVFDF 364

Query: 363 MPTRGGYFVGNVSPARMDFRWFALGNCVAILSSLATPEQSIAIMDLIESRWEELVGEMPL 422
           MPTRGGYFVGNVSPARMDFRWFALGNCVAIL+SLATPEQS+AIMDLIESRWEELVGEMPL
Sbjct: 365 MPTRGGYFVGNVSPARMDFRWFALGNCVAILASLATPEQSMAIMDLIESRWEELVGEMPL 424

Query: 423 KISYPAIESHDWRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIELA 482
           KISYPAIESH+WRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIELA
Sbjct: 425 KISYPAIESHEWRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIELA 484

Query: 483 ESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDKQM 542
           ESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDKQM
Sbjct: 485 ESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDKQM 544

Query: 543 KPLIKRSSSWTC 555
           KPLIKRSSSWTC
Sbjct: 545 KPLIKRSSSWTC 556

BLAST of HG10022544 vs. NCBI nr
Match: XP_022926252.1 (probable alkaline/neutral invertase D [Cucurbita moschata] >XP_022926253.1 probable alkaline/neutral invertase D [Cucurbita moschata] >XP_022926254.1 probable alkaline/neutral invertase D [Cucurbita moschata] >XP_022981442.1 probable alkaline/neutral invertase D [Cucurbita maxima] >XP_022981443.1 probable alkaline/neutral invertase D [Cucurbita maxima] >XP_022981444.1 probable alkaline/neutral invertase D [Cucurbita maxima] >KAG7037246.1 Alkaline/neutral invertase CINV2, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1123.2 bits (2904), Expect = 0.0e+00
Identity = 539/554 (97.29%), Postives = 547/554 (98.74%), Query Frame = 0

Query: 1   MDGLGLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDERSLSELSISLARGGLDNF 60
           MDGLGLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFD+RSLSELSI LARGGLDNF
Sbjct: 1   MDGLGLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDDRSLSELSIGLARGGLDNF 60

Query: 61  ESSYSPGGRSGFDTPASSTRNSFEPHPMIAEAWEALRRSLVHFRGQPVGTIAAYDHASEE 120
           ESSYSPGGRSGFDTPASSTRN+FE HPMI EAWEALRRSLV+FRGQPVGTIAAYDHASEE
Sbjct: 61  ESSYSPGGRSGFDTPASSTRNTFETHPMIGEAWEALRRSLVYFRGQPVGTIAAYDHASEE 120

Query: 121 VLNYDQVFVRDFVPSALAFLMNGEPEIVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASFK 180
           VLNYDQVFVRDFVPSALAFLMNGEPEIVKNFLLKTLQLQGWEK+IDRFKLGEGAMPASFK
Sbjct: 121 VLNYDQVFVRDFVPSALAFLMNGEPEIVKNFLLKTLQLQGWEKKIDRFKLGEGAMPASFK 180

Query: 181 VLHDPVRKTDTIAADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETPECQKGMR 240
           VLHDPVRKTDTIAADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETPECQKGMR
Sbjct: 181 VLHDPVRKTDTIAADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETPECQKGMR 240

Query: 241 LILTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHDAEG 300
           LILTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHDAEG
Sbjct: 241 LILTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHDAEG 300

Query: 301 KECIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWVF 360
           KECIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIP+WVF
Sbjct: 301 KECIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPDWVF 360

Query: 361 DFMPTRGGYFVGNVSPARMDFRWFALGNCVAILSSLATPEQSIAIMDLIESRWEELVGEM 420
           DFMP RGGYFVGNVSPARMDFRWFALGNCVAILSSLATPEQS+AIMDLIESRWEELVGEM
Sbjct: 361 DFMPKRGGYFVGNVSPARMDFRWFALGNCVAILSSLATPEQSMAIMDLIESRWEELVGEM 420

Query: 421 PLKISYPAIESHDWRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIE 480
           PLKISYPAIESH+WRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIE
Sbjct: 421 PLKISYPAIESHEWRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIE 480

Query: 481 LAESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDK 540
           L ESRLLKD WPEYYDGK GRY+GKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDK
Sbjct: 481 LTESRLLKDGWPEYYDGKFGRYVGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDK 540

Query: 541 QMKPLIKRSSSWTC 555
           QMKPLIKRSSSWTC
Sbjct: 541 QMKPLIKRSSSWTC 554

BLAST of HG10022544 vs. ExPASy Swiss-Prot
Match: Q67XD9 (Alkaline/neutral invertase CINV2 OS=Arabidopsis thaliana OX=3702 GN=CINV2 PE=1 SV=1)

HSP 1 Score: 996.9 bits (2576), Expect = 9.1e-290
Identity = 475/552 (86.05%), Postives = 513/552 (92.93%), Query Frame = 0

Query: 4   LGLRNVSSHCSISEMDDYDLSRLLDKPK-LNIERQRSFDERSLSELSISLARGGLDNFES 63
           L LR   SHCS+SEMDD+DL+R L+KP+ L IER+RSFDERS+SELS    R      E 
Sbjct: 9   LVLRVEGSHCSLSEMDDFDLTRALEKPRQLKIERKRSFDERSMSELSTGYVRQD-SILEM 68

Query: 64  SYSPGGRSGFDTPASSTRNSFEPHPMIAEAWEALRRSLVHFRGQPVGTIAAYDHASEEVL 123
           ++SPG RS  DTP  S RNSFEPHPM+AEAWEALRRS+V FRGQPVGTIAAYDHASEEVL
Sbjct: 69  AHSPGSRSMVDTPL-SVRNSFEPHPMVAEAWEALRRSMVFFRGQPVGTIAAYDHASEEVL 128

Query: 124 NYDQVFVRDFVPSALAFLMNGEPEIVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASFKVL 183
           NYDQVFVRDFVPSALAFLMNGEP+IVKNFLLKTLQLQGWEKR+DRFKLGEG MPASFKVL
Sbjct: 129 NYDQVFVRDFVPSALAFLMNGEPDIVKNFLLKTLQLQGWEKRVDRFKLGEGVMPASFKVL 188

Query: 184 HDPVRKTDTIAADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETPECQKGMRLI 243
           HDPVRKTDTI ADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDL+L+ETPECQ+GMRLI
Sbjct: 189 HDPVRKTDTIIADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLTLSETPECQRGMRLI 248

Query: 244 LTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHDAEGKE 303
           L+LCLSEGFDTFPTLLCADGCSM+DRRMG+YGYPIEIQALFFMALRCAL+MLK D EG++
Sbjct: 249 LSLCLSEGFDTFPTLLCADGCSMVDRRMGVYGYPIEIQALFFMALRCALSMLKPDEEGRD 308

Query: 304 CIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWVFDF 363
            IERIVKRLHALS+HMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNV+PDSIP+WVFDF
Sbjct: 309 FIERIVKRLHALSFHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVMPDSIPDWVFDF 368

Query: 364 MPTRGGYFVGNVSPARMDFRWFALGNCVAILSSLATPEQSIAIMDLIESRWEELVGEMPL 423
           MP RGGYFVGNVSPARMDFRWF+LGNCV+ILSSLATP+QS+AIMDL+E RWEELVGEMPL
Sbjct: 369 MPLRGGYFVGNVSPARMDFRWFSLGNCVSILSSLATPDQSMAIMDLLEHRWEELVGEMPL 428

Query: 424 KISYPAIESHDWRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIELA 483
           KI YP IESH+WRI+TGCDPKNTRWSYHNGGSWPVLLW LTAACIKTGRPQIARRAI+L 
Sbjct: 429 KICYPCIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWTLTAACIKTGRPQIARRAIDLI 488

Query: 484 ESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDKQM 543
           ESRL +D WPEYYDGK GRY+GKQARKYQTWSIAGYLVAKMMLEDPSH+GMISLEEDKQM
Sbjct: 489 ESRLHRDCWPEYYDGKQGRYVGKQARKYQTWSIAGYLVAKMMLEDPSHIGMISLEEDKQM 548

Query: 544 KPLIKRSSSWTC 555
           KP+IKRS+SWTC
Sbjct: 549 KPVIKRSASWTC 558

BLAST of HG10022544 vs. ExPASy Swiss-Prot
Match: Q69T31 (Cytosolic invertase 1 OS=Oryza sativa subsp. japonica OX=39947 GN=CINV1 PE=1 SV=1)

HSP 1 Score: 976.9 bits (2524), Expect = 9.8e-284
Identity = 464/556 (83.45%), Postives = 510/556 (91.73%), Query Frame = 0

Query: 5   GLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDERSLSELSIS-----LARGGLDN 64
           G+R  +SH S+SE DD+DLSRLL+KP++N+ERQRSFD+RSLS++S S       RGG   
Sbjct: 9   GMRRSASHTSLSESDDFDLSRLLNKPRINVERQRSFDDRSLSDVSYSGGGHGGTRGG--- 68

Query: 65  FESSYSPGG--RSGFDTPASSTRNSFEPHPMIAEAWEALRRSLVHFRGQPVGTIAAYDHA 124
           F+  YSPGG  RS   TPASS  +SFEPHP++ +AWEALRRSLV FRGQP+GTIAA+DHA
Sbjct: 69  FDGMYSPGGGLRSLVGTPASSALHSFEPHPIVGDAWEALRRSLVFFRGQPLGTIAAFDHA 128

Query: 125 SEEVLNYDQVFVRDFVPSALAFLMNGEPEIVKNFLLKTLQLQGWEKRIDRFKLGEGAMPA 184
           SEEVLNYDQVFVRDFVPSALAFLMNGEPEIV++FLLKTL LQGWEK++DRFKLGEGAMPA
Sbjct: 129 SEEVLNYDQVFVRDFVPSALAFLMNGEPEIVRHFLLKTLLLQGWEKKVDRFKLGEGAMPA 188

Query: 185 SFKVLHDPVRKTDTIAADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETPECQK 244
           SFKVLHD  +  DT+ ADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDL+LAETPECQK
Sbjct: 189 SFKVLHDSKKGVDTLHADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLTLAETPECQK 248

Query: 245 GMRLILTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHD 304
           GMRLIL+LCLSEGFDTFPTLLCADGC MIDRRMG+YGYPIEIQALFFMALRCAL +LKHD
Sbjct: 249 GMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQALFFMALRCALQLLKHD 308

Query: 305 AEGKECIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPE 364
            EGKE +ERI  RLHALSYHMRSY+WLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIP+
Sbjct: 309 NEGKEFVERIATRLHALSYHMRSYYWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPD 368

Query: 365 WVFDFMPTRGGYFVGNVSPARMDFRWFALGNCVAILSSLATPEQSIAIMDLIESRWEELV 424
           W+FDFMP +GG+F+GNVSPARMDFRWFALGN +AILSSLATPEQS AIMDLIE RWEEL+
Sbjct: 369 WLFDFMPCQGGFFIGNVSPARMDFRWFALGNMIAILSSLATPEQSTAIMDLIEERWEELI 428

Query: 425 GEMPLKISYPAIESHDWRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARR 484
           GEMPLKI YPAIE+H+WRI+TGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARR
Sbjct: 429 GEMPLKICYPAIENHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARR 488

Query: 485 AIELAESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLE 544
           AI+LAE RLLKD WPEYYDGKLGRY+GKQARK+QTWSIAGYLVAKMMLEDPSHLGMISLE
Sbjct: 489 AIDLAERRLLKDGWPEYYDGKLGRYVGKQARKFQTWSIAGYLVAKMMLEDPSHLGMISLE 548

Query: 545 EDKQMKPLIKRSSSWT 554
           EDK MKP++KRS+SWT
Sbjct: 549 EDKAMKPVLKRSASWT 561

BLAST of HG10022544 vs. ExPASy Swiss-Prot
Match: Q9LQF2 (Alkaline/neutral invertase CINV1 OS=Arabidopsis thaliana OX=3702 GN=CINV1 PE=1 SV=1)

HSP 1 Score: 973.0 bits (2514), Expect = 1.4e-282
Identity = 458/553 (82.82%), Postives = 507/553 (91.68%), Query Frame = 0

Query: 1   MDGLGLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDERSLSELSISLAR-GGLDN 60
           M+G+GLR V SHCS+SEMDD DL+R LDKP+L IER+RSFDERS+SELS   +R  G+ +
Sbjct: 1   MEGVGLRAVGSHCSLSEMDDLDLTRALDKPRLKIERKRSFDERSMSELSTGYSRHDGIHD 60

Query: 61  FESSYSPGGRSGFDTPASSTRNSFEPHPMIAEAWEALRRSLVHFRGQPVGTIAAYDHASE 120
                SP GRS  DTP SS RNSFEPHPM+AEAWEALRRS+V FRGQPVGT+AA D+ ++
Sbjct: 61  -----SPRGRSVLDTPLSSARNSFEPHPMMAEAWEALRRSMVFFRGQPVGTLAAVDNTTD 120

Query: 121 EVLNYDQVFVRDFVPSALAFLMNGEPEIVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASF 180
           EVLNYDQVFVRDFVPSALAFLMNGEP+IVK+FLLKTLQLQGWEKR+DRFKLGEG MPASF
Sbjct: 121 EVLNYDQVFVRDFVPSALAFLMNGEPDIVKHFLLKTLQLQGWEKRVDRFKLGEGVMPASF 180

Query: 181 KVLHDPVRKTDTIAADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETPECQKGM 240
           KVLHDP+R+TD I ADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDL+L+ETPECQKGM
Sbjct: 181 KVLHDPIRETDNIVADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLTLSETPECQKGM 240

Query: 241 RLILTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHDAE 300
           +LIL+LCL+EGFDTFPTLLCADGCSMIDRRMG+YGYPIEIQALFFMALR AL+MLK D +
Sbjct: 241 KLILSLCLAEGFDTFPTLLCADGCSMIDRRMGVYGYPIEIQALFFMALRSALSMLKPDGD 300

Query: 301 GKECIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWV 360
           G+E IERIVKRLHALS+HMR+YFWLD Q LNDIYR+KTEEYSHTAVNKFNV+PDSIPEWV
Sbjct: 301 GREVIERIVKRLHALSFHMRNYFWLDHQNLNDIYRFKTEEYSHTAVNKFNVMPDSIPEWV 360

Query: 361 FDFMPTRGGYFVGNVSPARMDFRWFALGNCVAILSSLATPEQSIAIMDLIESRWEELVGE 420
           FDFMP RGGYFVGNV PA MDFRWFALGNCV+ILSSLATP+QS+AIMDL+E RW ELVGE
Sbjct: 361 FDFMPLRGGYFVGNVGPAHMDFRWFALGNCVSILSSLATPDQSMAIMDLLEHRWAELVGE 420

Query: 421 MPLKISYPAIESHDWRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAI 480
           MPLKI YP +E H+WRI+TGCDPKNTRWSYHNGGSWPVLLW LTAACIKTGRPQIARRA+
Sbjct: 421 MPLKICYPCLEGHEWRIVTGCDPKNTRWSYHNGGSWPVLLWQLTAACIKTGRPQIARRAV 480

Query: 481 ELAESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEED 540
           +L ESRL +D WPEYYDGKLGRY+GKQARKYQTWSIAGYLVAKM+LEDPSH+GMISLEED
Sbjct: 481 DLIESRLHRDCWPEYYDGKLGRYVGKQARKYQTWSIAGYLVAKMLLEDPSHIGMISLEED 540

Query: 541 KQMKPLIKRSSSW 553
           K MKP+IKRS+SW
Sbjct: 541 KLMKPVIKRSASW 548

BLAST of HG10022544 vs. ExPASy Swiss-Prot
Match: F4I2X9 (Probable alkaline/neutral invertase D OS=Arabidopsis thaliana OX=3702 GN=INVD PE=2 SV=1)

HSP 1 Score: 972.6 bits (2513), Expect = 1.8e-282
Identity = 464/548 (84.67%), Postives = 510/548 (93.07%), Query Frame = 0

Query: 6   LRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDERSLSELSISLARGGLDNFESSYS 65
           +  V+S  SIS++D  +L+RLLD+P++NIER+RSFDERS SE+ I         F++  S
Sbjct: 1   MEGVNSSSSISDLD--ELARLLDRPRVNIERKRSFDERSFSEMGI---------FDNVNS 60

Query: 66  PGGRSGFDTPASSTRNSFEPHPMIAEAWEALRRSLVHFRGQPVGTIAAYDHASEEVLNYD 125
           PG   G++TP SS RNSFEPHPM+AEAW+ALRRSLV+FRGQPVGTIAAYDHA+EEVLNYD
Sbjct: 61  PG---GWETPVSSARNSFEPHPMVAEAWDALRRSLVYFRGQPVGTIAAYDHATEEVLNYD 120

Query: 126 QVFVRDFVPSALAFLMNGEPEIVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASFKVLHDP 185
           QVFVRDFVPSALAFLMNGEP+IVKNFLLKT+Q+QG EKRIDRFKLGEGAMPASFKV+HDP
Sbjct: 121 QVFVRDFVPSALAFLMNGEPDIVKNFLLKTIQIQGREKRIDRFKLGEGAMPASFKVIHDP 180

Query: 186 VRKTDTIAADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETPECQKGMRLILTL 245
           +++TD+I ADFGESAIGRVAPVDSGFWWIILLRAYTKSTGD SLAET ECQKGMRLIL+L
Sbjct: 181 IKETDSINADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDTSLAETSECQKGMRLILSL 240

Query: 246 CLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHDAEGKECIE 305
           CLSEGFDTFPTLLCADGCSMIDRRMG+YGYPIEIQALFFMALR A++MLKHDAEGKE +E
Sbjct: 241 CLSEGFDTFPTLLCADGCSMIDRRMGVYGYPIEIQALFFMALRSAMSMLKHDAEGKEFME 300

Query: 306 RIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWVFDFMPT 365
           RIVKRLHALS+HMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWVFDFMP 
Sbjct: 301 RIVKRLHALSFHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWVFDFMPL 360

Query: 366 RGGYFVGNVSPARMDFRWFALGNCVAILSSLATPEQSIAIMDLIESRWEELVGEMPLKIS 425
           RGGYF+GNVSPARMDFRWFALGNCVAIL+SLATPEQS +IMDLIE RWEELVGEMP+KI 
Sbjct: 361 RGGYFIGNVSPARMDFRWFALGNCVAILASLATPEQSASIMDLIEERWEELVGEMPVKIC 420

Query: 426 YPAIESHDWRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIELAESR 485
           +PAIESH+WRI+TGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAI+LAE+R
Sbjct: 421 HPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIDLAEAR 480

Query: 486 LLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDKQMKPL 545
           LLKD WPEYYDGK GR+IGKQARK+QTWSIAGYLVAKM+LEDPSHLGMISLEEDKQ KP+
Sbjct: 481 LLKDGWPEYYDGKSGRFIGKQARKFQTWSIAGYLVAKMLLEDPSHLGMISLEEDKQTKPV 534

Query: 546 IKRSSSWT 554
           IKRS SWT
Sbjct: 541 IKRSYSWT 534

BLAST of HG10022544 vs. ExPASy Swiss-Prot
Match: Q9SW48 (Probable alkaline/neutral invertase B OS=Arabidopsis thaliana OX=3702 GN=INVB PE=1 SV=1)

HSP 1 Score: 931.0 bits (2405), Expect = 6.2e-270
Identity = 444/559 (79.43%), Postives = 498/559 (89.09%), Query Frame = 0

Query: 6   LRNVSSHCSISEMDDYDLSRLLDKPK-LNIERQRSFDERSLSELSISLARGGLDN----- 65
           ++NV S  ++ ++DD D ++LL+KP+ LNI+R RS DERSL+EL+ S      DN     
Sbjct: 16  IKNVDSLSTLDDIDDIDFAKLLEKPRPLNIDRLRSLDERSLTELTGSPQLRNADNASRAP 75

Query: 66  ----FESSYSPGGRSGFDTPASSTRNSFEPHPMIAEAWEALRRSLVHFRGQPVGTIAAYD 125
               +  S S G RSGF+TP S  +  FE HPM+ EAW+ALRRS+V+FRGQPVGTIAA D
Sbjct: 76  DHADYVISPSFGRRSGFNTPRS--QPGFESHPMVGEAWDALRRSMVYFRGQPVGTIAAVD 135

Query: 126 HASEEVLNYDQVFVRDFVPSALAFLMNGEPEIVKNFLLKTLQLQGWEKRIDRFKLGEGAM 185
           + SEE LNYDQVFVRDFVPSALAFLMNGEP+IVKNFLLKTL+LQ WEK+IDRF+LGEG M
Sbjct: 136 N-SEEKLNYDQVFVRDFVPSALAFLMNGEPDIVKNFLLKTLRLQSWEKKIDRFQLGEGVM 195

Query: 186 PASFKVLHDPVRKTDTIAADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETPEC 245
           PASFKV HDPVR  +T+ ADFGESAIGRVAPVDSGFWWIILLRAYTKSTGD SLA+ PEC
Sbjct: 196 PASFKVFHDPVRNHETLIADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDSSLADMPEC 255

Query: 246 QKGMRLILTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLK 305
           QKG+RLIL+LCLSEGFDTFPTLLCADGC MIDRRMG+YGYPIEIQALFFMALRCAL +LK
Sbjct: 256 QKGIRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQALFFMALRCALLLLK 315

Query: 306 HDAEGKECIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSI 365
           HD EGKE +E+IVKRLHALSYHMRSYFWLD +QLNDIYRYKTEEYSHTAVNKFNVIPDS+
Sbjct: 316 HDGEGKEMVEQIVKRLHALSYHMRSYFWLDLKQLNDIYRYKTEEYSHTAVNKFNVIPDSL 375

Query: 366 PEWVFDFMPTRGGYFVGNVSPARMDFRWFALGNCVAILSSLATPEQSIAIMDLIESRWEE 425
           PEWVFDFMP  GG+F+GNVSPARMDFRWFALGNC+AILSSLATPEQS AIMDLIESRWEE
Sbjct: 376 PEWVFDFMPPHGGFFIGNVSPARMDFRWFALGNCIAILSSLATPEQSTAIMDLIESRWEE 435

Query: 426 LVGEMPLKISYPAIESHDWRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIA 485
           LVGEMPLK+ YPAIESH+WRI+TGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIA
Sbjct: 436 LVGEMPLKVCYPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIA 495

Query: 486 RRAIELAESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMIS 545
           RRAIE+AE+RL KD WPEYYDGK+GRY+GKQ+RK QTWS+AGYLVAKMMLEDPSH+GM+ 
Sbjct: 496 RRAIEVAEARLHKDHWPEYYDGKVGRYVGKQSRKNQTWSVAGYLVAKMMLEDPSHVGMVC 555

Query: 546 LEEDKQMKPLIKRSSSWTC 555
           LEEDKQMKP+++RS+SWTC
Sbjct: 556 LEEDKQMKPVMRRSNSWTC 571

BLAST of HG10022544 vs. ExPASy TrEMBL
Match: A0A6J1CRC0 (Alkaline/neutral invertase OS=Momordica charantia OX=3673 GN=LOC111013535 PE=3 SV=1)

HSP 1 Score: 1136.3 bits (2938), Expect = 0.0e+00
Identity = 546/554 (98.56%), Postives = 552/554 (99.64%), Query Frame = 0

Query: 1   MDGLGLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDERSLSELSISLARGGLDNF 60
           MDG GLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDERSLSELSISLARGGLDNF
Sbjct: 1   MDGFGLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDERSLSELSISLARGGLDNF 60

Query: 61  ESSYSPGGRSGFDTPASSTRNSFEPHPMIAEAWEALRRSLVHFRGQPVGTIAAYDHASEE 120
           ESSYSPGGRSGFDTPASSTRNSFEPHPMIAEAWEALRRSLVHFRGQPVGTIAAYDHASEE
Sbjct: 61  ESSYSPGGRSGFDTPASSTRNSFEPHPMIAEAWEALRRSLVHFRGQPVGTIAAYDHASEE 120

Query: 121 VLNYDQVFVRDFVPSALAFLMNGEPEIVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASFK 180
           VLNYDQVFVRDFVPSALAFLMNGEPEIVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASFK
Sbjct: 121 VLNYDQVFVRDFVPSALAFLMNGEPEIVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASFK 180

Query: 181 VLHDPVRKTDTIAADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETPECQKGMR 240
           VLHDPVRKTD+I ADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETPECQKGMR
Sbjct: 181 VLHDPVRKTDSIVADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETPECQKGMR 240

Query: 241 LILTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHDAEG 300
           LILTLCLSEGFDTFPTLLCADGCSMIDRRMG+YGYPIEIQALFFMALRCALAMLKHDAEG
Sbjct: 241 LILTLCLSEGFDTFPTLLCADGCSMIDRRMGVYGYPIEIQALFFMALRCALAMLKHDAEG 300

Query: 301 KECIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWVF 360
           KECIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWVF
Sbjct: 301 KECIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWVF 360

Query: 361 DFMPTRGGYFVGNVSPARMDFRWFALGNCVAILSSLATPEQSIAIMDLIESRWEELVGEM 420
           DFMPTRGGYFVGNVSPARMDFRWFALGNCVAILSSLATPEQS+AIMDLIESRWEELVGEM
Sbjct: 361 DFMPTRGGYFVGNVSPARMDFRWFALGNCVAILSSLATPEQSMAIMDLIESRWEELVGEM 420

Query: 421 PLKISYPAIESHDWRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIE 480
           PLKI+YPAIESH+WRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIAR+AIE
Sbjct: 421 PLKITYPAIESHEWRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARKAIE 480

Query: 481 LAESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDK 540
           LAESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDK
Sbjct: 481 LAESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDK 540

Query: 541 QMKPLIKRSSSWTC 555
           QMKPLIKRSSSWTC
Sbjct: 541 QMKPLIKRSSSWTC 554

BLAST of HG10022544 vs. ExPASy TrEMBL
Match: A0A0A0K476 (Alkaline/neutral invertase OS=Cucumis sativus OX=3659 GN=Csa_7G308910 PE=3 SV=1)

HSP 1 Score: 1128.6 bits (2918), Expect = 0.0e+00
Identity = 542/554 (97.83%), Postives = 550/554 (99.28%), Query Frame = 0

Query: 1   MDGLGLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDERSLSELSISLARGGLDNF 60
           MDG GLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDERSLSELSI LARGGLDNF
Sbjct: 1   MDGFGLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDERSLSELSIGLARGGLDNF 60

Query: 61  ESSYSPGGRSGFDTPASSTRNSFEPHPMIAEAWEALRRSLVHFRGQPVGTIAAYDHASEE 120
           ESSYSPGGRSGFDTPASS+RNSFEPHPMIAEAWEALRRS+V+FRGQPVGTIAAYDHASEE
Sbjct: 61  ESSYSPGGRSGFDTPASSSRNSFEPHPMIAEAWEALRRSMVYFRGQPVGTIAAYDHASEE 120

Query: 121 VLNYDQVFVRDFVPSALAFLMNGEPEIVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASFK 180
           VLNYDQVFVRDFVPSALAFLMNGEP+IVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASFK
Sbjct: 121 VLNYDQVFVRDFVPSALAFLMNGEPDIVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASFK 180

Query: 181 VLHDPVRKTDTIAADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETPECQKGMR 240
           VLHDPVRKTDT+AADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAET ECQKGMR
Sbjct: 181 VLHDPVRKTDTVAADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETSECQKGMR 240

Query: 241 LILTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHDAEG 300
           LILTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHDAEG
Sbjct: 241 LILTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHDAEG 300

Query: 301 KECIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWVF 360
           KECIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEW+F
Sbjct: 301 KECIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWLF 360

Query: 361 DFMPTRGGYFVGNVSPARMDFRWFALGNCVAILSSLATPEQSIAIMDLIESRWEELVGEM 420
           DFMPTRGGYFVGNVSPARMDFRWFALGNCVAIL SLATPEQS+AIMDLIESRWEELVGEM
Sbjct: 361 DFMPTRGGYFVGNVSPARMDFRWFALGNCVAILGSLATPEQSMAIMDLIESRWEELVGEM 420

Query: 421 PLKISYPAIESHDWRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIE 480
           PLKISYPAIESH+WRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIE
Sbjct: 421 PLKISYPAIESHEWRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIE 480

Query: 481 LAESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDK 540
           LAESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDK
Sbjct: 481 LAESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDK 540

Query: 541 QMKPLIKRSSSWTC 555
           QMKPLIKRSSSWTC
Sbjct: 541 QMKPLIKRSSSWTC 554

BLAST of HG10022544 vs. ExPASy TrEMBL
Match: A0A5A7TSZ5 (Alkaline/neutral invertase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1333G00200 PE=3 SV=1)

HSP 1 Score: 1127.9 bits (2916), Expect = 0.0e+00
Identity = 542/552 (98.19%), Postives = 550/552 (99.64%), Query Frame = 0

Query: 3   GLGLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDERSLSELSISLARGGLDNFES 62
           GLGLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDERSLSELSI LARGGLDNFES
Sbjct: 5   GLGLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDERSLSELSIGLARGGLDNFES 64

Query: 63  SYSPGGRSGFDTPASSTRNSFEPHPMIAEAWEALRRSLVHFRGQPVGTIAAYDHASEEVL 122
           SYSPGGRSGFDTPASS+RNSFEPHPMIAEAWEALRRS+V+FRGQPVGTIAAYDHASEEVL
Sbjct: 65  SYSPGGRSGFDTPASSSRNSFEPHPMIAEAWEALRRSMVYFRGQPVGTIAAYDHASEEVL 124

Query: 123 NYDQVFVRDFVPSALAFLMNGEPEIVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASFKVL 182
           NYDQVFVRDFVPSALAFLMNGEP+IVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASFKVL
Sbjct: 125 NYDQVFVRDFVPSALAFLMNGEPDIVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASFKVL 184

Query: 183 HDPVRKTDTIAADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETPECQKGMRLI 242
           HDPVRKTDT+AADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAET ECQKGMRLI
Sbjct: 185 HDPVRKTDTVAADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETSECQKGMRLI 244

Query: 243 LTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHDAEGKE 302
           LTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHDAEGKE
Sbjct: 245 LTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHDAEGKE 304

Query: 303 CIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWVFDF 362
           CIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWVFDF
Sbjct: 305 CIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWVFDF 364

Query: 363 MPTRGGYFVGNVSPARMDFRWFALGNCVAILSSLATPEQSIAIMDLIESRWEELVGEMPL 422
           MPTRGGYFVGNVSPARMDFRWFALGNCVAIL+SLATPEQS+AIMDLIESRWEELVGEMPL
Sbjct: 365 MPTRGGYFVGNVSPARMDFRWFALGNCVAILASLATPEQSMAIMDLIESRWEELVGEMPL 424

Query: 423 KISYPAIESHDWRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIELA 482
           KISYPAIESH+WRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIELA
Sbjct: 425 KISYPAIESHEWRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIELA 484

Query: 483 ESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDKQM 542
           ESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDKQM
Sbjct: 485 ESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDKQM 544

Query: 543 KPLIKRSSSWTC 555
           KPLIKRSSSWTC
Sbjct: 545 KPLIKRSSSWTC 556

BLAST of HG10022544 vs. ExPASy TrEMBL
Match: A0A1S3CH54 (Alkaline/neutral invertase OS=Cucumis melo OX=3656 GN=LOC103500409 PE=3 SV=1)

HSP 1 Score: 1127.9 bits (2916), Expect = 0.0e+00
Identity = 542/552 (98.19%), Postives = 550/552 (99.64%), Query Frame = 0

Query: 3   GLGLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDERSLSELSISLARGGLDNFES 62
           GLGLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDERSLSELSI LARGGLDNFES
Sbjct: 5   GLGLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDERSLSELSIGLARGGLDNFES 64

Query: 63  SYSPGGRSGFDTPASSTRNSFEPHPMIAEAWEALRRSLVHFRGQPVGTIAAYDHASEEVL 122
           SYSPGGRSGFDTPASS+RNSFEPHPMIAEAWEALRRS+V+FRGQPVGTIAAYDHASEEVL
Sbjct: 65  SYSPGGRSGFDTPASSSRNSFEPHPMIAEAWEALRRSMVYFRGQPVGTIAAYDHASEEVL 124

Query: 123 NYDQVFVRDFVPSALAFLMNGEPEIVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASFKVL 182
           NYDQVFVRDFVPSALAFLMNGEP+IVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASFKVL
Sbjct: 125 NYDQVFVRDFVPSALAFLMNGEPDIVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASFKVL 184

Query: 183 HDPVRKTDTIAADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETPECQKGMRLI 242
           HDPVRKTDT+AADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAET ECQKGMRLI
Sbjct: 185 HDPVRKTDTVAADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETSECQKGMRLI 244

Query: 243 LTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHDAEGKE 302
           LTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHDAEGKE
Sbjct: 245 LTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHDAEGKE 304

Query: 303 CIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWVFDF 362
           CIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWVFDF
Sbjct: 305 CIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWVFDF 364

Query: 363 MPTRGGYFVGNVSPARMDFRWFALGNCVAILSSLATPEQSIAIMDLIESRWEELVGEMPL 422
           MPTRGGYFVGNVSPARMDFRWFALGNCVAIL+SLATPEQS+AIMDLIESRWEELVGEMPL
Sbjct: 365 MPTRGGYFVGNVSPARMDFRWFALGNCVAILASLATPEQSMAIMDLIESRWEELVGEMPL 424

Query: 423 KISYPAIESHDWRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIELA 482
           KISYPAIESH+WRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIELA
Sbjct: 425 KISYPAIESHEWRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIELA 484

Query: 483 ESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDKQM 542
           ESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDKQM
Sbjct: 485 ESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDKQM 544

Query: 543 KPLIKRSSSWTC 555
           KPLIKRSSSWTC
Sbjct: 545 KPLIKRSSSWTC 556

BLAST of HG10022544 vs. ExPASy TrEMBL
Match: A0A6J1J239 (Alkaline/neutral invertase OS=Cucurbita maxima OX=3661 GN=LOC111480561 PE=3 SV=1)

HSP 1 Score: 1123.2 bits (2904), Expect = 0.0e+00
Identity = 539/554 (97.29%), Postives = 547/554 (98.74%), Query Frame = 0

Query: 1   MDGLGLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDERSLSELSISLARGGLDNF 60
           MDGLGLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFD+RSLSELSI LARGGLDNF
Sbjct: 1   MDGLGLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDDRSLSELSIGLARGGLDNF 60

Query: 61  ESSYSPGGRSGFDTPASSTRNSFEPHPMIAEAWEALRRSLVHFRGQPVGTIAAYDHASEE 120
           ESSYSPGGRSGFDTPASSTRN+FE HPMI EAWEALRRSLV+FRGQPVGTIAAYDHASEE
Sbjct: 61  ESSYSPGGRSGFDTPASSTRNTFETHPMIGEAWEALRRSLVYFRGQPVGTIAAYDHASEE 120

Query: 121 VLNYDQVFVRDFVPSALAFLMNGEPEIVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASFK 180
           VLNYDQVFVRDFVPSALAFLMNGEPEIVKNFLLKTLQLQGWEK+IDRFKLGEGAMPASFK
Sbjct: 121 VLNYDQVFVRDFVPSALAFLMNGEPEIVKNFLLKTLQLQGWEKKIDRFKLGEGAMPASFK 180

Query: 181 VLHDPVRKTDTIAADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETPECQKGMR 240
           VLHDPVRKTDTIAADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETPECQKGMR
Sbjct: 181 VLHDPVRKTDTIAADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETPECQKGMR 240

Query: 241 LILTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHDAEG 300
           LILTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHDAEG
Sbjct: 241 LILTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHDAEG 300

Query: 301 KECIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWVF 360
           KECIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIP+WVF
Sbjct: 301 KECIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPDWVF 360

Query: 361 DFMPTRGGYFVGNVSPARMDFRWFALGNCVAILSSLATPEQSIAIMDLIESRWEELVGEM 420
           DFMP RGGYFVGNVSPARMDFRWFALGNCVAILSSLATPEQS+AIMDLIESRWEELVGEM
Sbjct: 361 DFMPKRGGYFVGNVSPARMDFRWFALGNCVAILSSLATPEQSMAIMDLIESRWEELVGEM 420

Query: 421 PLKISYPAIESHDWRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIE 480
           PLKISYPAIESH+WRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIE
Sbjct: 421 PLKISYPAIESHEWRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIE 480

Query: 481 LAESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDK 540
           L ESRLLKD WPEYYDGK GRY+GKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDK
Sbjct: 481 LTESRLLKDGWPEYYDGKFGRYVGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDK 540

Query: 541 QMKPLIKRSSSWTC 555
           QMKPLIKRSSSWTC
Sbjct: 541 QMKPLIKRSSSWTC 554

BLAST of HG10022544 vs. TAIR 10
Match: AT4G09510.1 (cytosolic invertase 2 )

HSP 1 Score: 996.9 bits (2576), Expect = 6.5e-291
Identity = 475/552 (86.05%), Postives = 513/552 (92.93%), Query Frame = 0

Query: 4   LGLRNVSSHCSISEMDDYDLSRLLDKPK-LNIERQRSFDERSLSELSISLARGGLDNFES 63
           L LR   SHCS+SEMDD+DL+R L+KP+ L IER+RSFDERS+SELS    R      E 
Sbjct: 9   LVLRVEGSHCSLSEMDDFDLTRALEKPRQLKIERKRSFDERSMSELSTGYVRQD-SILEM 68

Query: 64  SYSPGGRSGFDTPASSTRNSFEPHPMIAEAWEALRRSLVHFRGQPVGTIAAYDHASEEVL 123
           ++SPG RS  DTP  S RNSFEPHPM+AEAWEALRRS+V FRGQPVGTIAAYDHASEEVL
Sbjct: 69  AHSPGSRSMVDTPL-SVRNSFEPHPMVAEAWEALRRSMVFFRGQPVGTIAAYDHASEEVL 128

Query: 124 NYDQVFVRDFVPSALAFLMNGEPEIVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASFKVL 183
           NYDQVFVRDFVPSALAFLMNGEP+IVKNFLLKTLQLQGWEKR+DRFKLGEG MPASFKVL
Sbjct: 129 NYDQVFVRDFVPSALAFLMNGEPDIVKNFLLKTLQLQGWEKRVDRFKLGEGVMPASFKVL 188

Query: 184 HDPVRKTDTIAADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETPECQKGMRLI 243
           HDPVRKTDTI ADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDL+L+ETPECQ+GMRLI
Sbjct: 189 HDPVRKTDTIIADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLTLSETPECQRGMRLI 248

Query: 244 LTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHDAEGKE 303
           L+LCLSEGFDTFPTLLCADGCSM+DRRMG+YGYPIEIQALFFMALRCAL+MLK D EG++
Sbjct: 249 LSLCLSEGFDTFPTLLCADGCSMVDRRMGVYGYPIEIQALFFMALRCALSMLKPDEEGRD 308

Query: 304 CIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWVFDF 363
            IERIVKRLHALS+HMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNV+PDSIP+WVFDF
Sbjct: 309 FIERIVKRLHALSFHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVMPDSIPDWVFDF 368

Query: 364 MPTRGGYFVGNVSPARMDFRWFALGNCVAILSSLATPEQSIAIMDLIESRWEELVGEMPL 423
           MP RGGYFVGNVSPARMDFRWF+LGNCV+ILSSLATP+QS+AIMDL+E RWEELVGEMPL
Sbjct: 369 MPLRGGYFVGNVSPARMDFRWFSLGNCVSILSSLATPDQSMAIMDLLEHRWEELVGEMPL 428

Query: 424 KISYPAIESHDWRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIELA 483
           KI YP IESH+WRI+TGCDPKNTRWSYHNGGSWPVLLW LTAACIKTGRPQIARRAI+L 
Sbjct: 429 KICYPCIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWTLTAACIKTGRPQIARRAIDLI 488

Query: 484 ESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDKQM 543
           ESRL +D WPEYYDGK GRY+GKQARKYQTWSIAGYLVAKMMLEDPSH+GMISLEEDKQM
Sbjct: 489 ESRLHRDCWPEYYDGKQGRYVGKQARKYQTWSIAGYLVAKMMLEDPSHIGMISLEEDKQM 548

Query: 544 KPLIKRSSSWTC 555
           KP+IKRS+SWTC
Sbjct: 549 KPVIKRSASWTC 558

BLAST of HG10022544 vs. TAIR 10
Match: AT1G35580.1 (cytosolic invertase 1 )

HSP 1 Score: 973.0 bits (2514), Expect = 1.0e-283
Identity = 458/553 (82.82%), Postives = 507/553 (91.68%), Query Frame = 0

Query: 1   MDGLGLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDERSLSELSISLAR-GGLDN 60
           M+G+GLR V SHCS+SEMDD DL+R LDKP+L IER+RSFDERS+SELS   +R  G+ +
Sbjct: 1   MEGVGLRAVGSHCSLSEMDDLDLTRALDKPRLKIERKRSFDERSMSELSTGYSRHDGIHD 60

Query: 61  FESSYSPGGRSGFDTPASSTRNSFEPHPMIAEAWEALRRSLVHFRGQPVGTIAAYDHASE 120
                SP GRS  DTP SS RNSFEPHPM+AEAWEALRRS+V FRGQPVGT+AA D+ ++
Sbjct: 61  -----SPRGRSVLDTPLSSARNSFEPHPMMAEAWEALRRSMVFFRGQPVGTLAAVDNTTD 120

Query: 121 EVLNYDQVFVRDFVPSALAFLMNGEPEIVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASF 180
           EVLNYDQVFVRDFVPSALAFLMNGEP+IVK+FLLKTLQLQGWEKR+DRFKLGEG MPASF
Sbjct: 121 EVLNYDQVFVRDFVPSALAFLMNGEPDIVKHFLLKTLQLQGWEKRVDRFKLGEGVMPASF 180

Query: 181 KVLHDPVRKTDTIAADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETPECQKGM 240
           KVLHDP+R+TD I ADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDL+L+ETPECQKGM
Sbjct: 181 KVLHDPIRETDNIVADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLTLSETPECQKGM 240

Query: 241 RLILTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHDAE 300
           +LIL+LCL+EGFDTFPTLLCADGCSMIDRRMG+YGYPIEIQALFFMALR AL+MLK D +
Sbjct: 241 KLILSLCLAEGFDTFPTLLCADGCSMIDRRMGVYGYPIEIQALFFMALRSALSMLKPDGD 300

Query: 301 GKECIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWV 360
           G+E IERIVKRLHALS+HMR+YFWLD Q LNDIYR+KTEEYSHTAVNKFNV+PDSIPEWV
Sbjct: 301 GREVIERIVKRLHALSFHMRNYFWLDHQNLNDIYRFKTEEYSHTAVNKFNVMPDSIPEWV 360

Query: 361 FDFMPTRGGYFVGNVSPARMDFRWFALGNCVAILSSLATPEQSIAIMDLIESRWEELVGE 420
           FDFMP RGGYFVGNV PA MDFRWFALGNCV+ILSSLATP+QS+AIMDL+E RW ELVGE
Sbjct: 361 FDFMPLRGGYFVGNVGPAHMDFRWFALGNCVSILSSLATPDQSMAIMDLLEHRWAELVGE 420

Query: 421 MPLKISYPAIESHDWRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAI 480
           MPLKI YP +E H+WRI+TGCDPKNTRWSYHNGGSWPVLLW LTAACIKTGRPQIARRA+
Sbjct: 421 MPLKICYPCLEGHEWRIVTGCDPKNTRWSYHNGGSWPVLLWQLTAACIKTGRPQIARRAV 480

Query: 481 ELAESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEED 540
           +L ESRL +D WPEYYDGKLGRY+GKQARKYQTWSIAGYLVAKM+LEDPSH+GMISLEED
Sbjct: 481 DLIESRLHRDCWPEYYDGKLGRYVGKQARKYQTWSIAGYLVAKMLLEDPSHIGMISLEED 540

Query: 541 KQMKPLIKRSSSW 553
           K MKP+IKRS+SW
Sbjct: 541 KLMKPVIKRSASW 548

BLAST of HG10022544 vs. TAIR 10
Match: AT1G35580.2 (cytosolic invertase 1 )

HSP 1 Score: 973.0 bits (2514), Expect = 1.0e-283
Identity = 458/553 (82.82%), Postives = 507/553 (91.68%), Query Frame = 0

Query: 1   MDGLGLRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDERSLSELSISLAR-GGLDN 60
           M+G+GLR V SHCS+SEMDD DL+R LDKP+L IER+RSFDERS+SELS   +R  G+ +
Sbjct: 1   MEGVGLRAVGSHCSLSEMDDLDLTRALDKPRLKIERKRSFDERSMSELSTGYSRHDGIHD 60

Query: 61  FESSYSPGGRSGFDTPASSTRNSFEPHPMIAEAWEALRRSLVHFRGQPVGTIAAYDHASE 120
                SP GRS  DTP SS RNSFEPHPM+AEAWEALRRS+V FRGQPVGT+AA D+ ++
Sbjct: 61  -----SPRGRSVLDTPLSSARNSFEPHPMMAEAWEALRRSMVFFRGQPVGTLAAVDNTTD 120

Query: 121 EVLNYDQVFVRDFVPSALAFLMNGEPEIVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASF 180
           EVLNYDQVFVRDFVPSALAFLMNGEP+IVK+FLLKTLQLQGWEKR+DRFKLGEG MPASF
Sbjct: 121 EVLNYDQVFVRDFVPSALAFLMNGEPDIVKHFLLKTLQLQGWEKRVDRFKLGEGVMPASF 180

Query: 181 KVLHDPVRKTDTIAADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETPECQKGM 240
           KVLHDP+R+TD I ADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDL+L+ETPECQKGM
Sbjct: 181 KVLHDPIRETDNIVADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLTLSETPECQKGM 240

Query: 241 RLILTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHDAE 300
           +LIL+LCL+EGFDTFPTLLCADGCSMIDRRMG+YGYPIEIQALFFMALR AL+MLK D +
Sbjct: 241 KLILSLCLAEGFDTFPTLLCADGCSMIDRRMGVYGYPIEIQALFFMALRSALSMLKPDGD 300

Query: 301 GKECIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWV 360
           G+E IERIVKRLHALS+HMR+YFWLD Q LNDIYR+KTEEYSHTAVNKFNV+PDSIPEWV
Sbjct: 301 GREVIERIVKRLHALSFHMRNYFWLDHQNLNDIYRFKTEEYSHTAVNKFNVMPDSIPEWV 360

Query: 361 FDFMPTRGGYFVGNVSPARMDFRWFALGNCVAILSSLATPEQSIAIMDLIESRWEELVGE 420
           FDFMP RGGYFVGNV PA MDFRWFALGNCV+ILSSLATP+QS+AIMDL+E RW ELVGE
Sbjct: 361 FDFMPLRGGYFVGNVGPAHMDFRWFALGNCVSILSSLATPDQSMAIMDLLEHRWAELVGE 420

Query: 421 MPLKISYPAIESHDWRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAI 480
           MPLKI YP +E H+WRI+TGCDPKNTRWSYHNGGSWPVLLW LTAACIKTGRPQIARRA+
Sbjct: 421 MPLKICYPCLEGHEWRIVTGCDPKNTRWSYHNGGSWPVLLWQLTAACIKTGRPQIARRAV 480

Query: 481 ELAESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEED 540
           +L ESRL +D WPEYYDGKLGRY+GKQARKYQTWSIAGYLVAKM+LEDPSH+GMISLEED
Sbjct: 481 DLIESRLHRDCWPEYYDGKLGRYVGKQARKYQTWSIAGYLVAKMLLEDPSHIGMISLEED 540

Query: 541 KQMKPLIKRSSSW 553
           K MKP+IKRS+SW
Sbjct: 541 KLMKPVIKRSASW 548

BLAST of HG10022544 vs. TAIR 10
Match: AT1G22650.1 (Plant neutral invertase family protein )

HSP 1 Score: 972.6 bits (2513), Expect = 1.3e-283
Identity = 464/548 (84.67%), Postives = 510/548 (93.07%), Query Frame = 0

Query: 6   LRNVSSHCSISEMDDYDLSRLLDKPKLNIERQRSFDERSLSELSISLARGGLDNFESSYS 65
           +  V+S  SIS++D  +L+RLLD+P++NIER+RSFDERS SE+ I         F++  S
Sbjct: 1   MEGVNSSSSISDLD--ELARLLDRPRVNIERKRSFDERSFSEMGI---------FDNVNS 60

Query: 66  PGGRSGFDTPASSTRNSFEPHPMIAEAWEALRRSLVHFRGQPVGTIAAYDHASEEVLNYD 125
           PG   G++TP SS RNSFEPHPM+AEAW+ALRRSLV+FRGQPVGTIAAYDHA+EEVLNYD
Sbjct: 61  PG---GWETPVSSARNSFEPHPMVAEAWDALRRSLVYFRGQPVGTIAAYDHATEEVLNYD 120

Query: 126 QVFVRDFVPSALAFLMNGEPEIVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASFKVLHDP 185
           QVFVRDFVPSALAFLMNGEP+IVKNFLLKT+Q+QG EKRIDRFKLGEGAMPASFKV+HDP
Sbjct: 121 QVFVRDFVPSALAFLMNGEPDIVKNFLLKTIQIQGREKRIDRFKLGEGAMPASFKVIHDP 180

Query: 186 VRKTDTIAADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETPECQKGMRLILTL 245
           +++TD+I ADFGESAIGRVAPVDSGFWWIILLRAYTKSTGD SLAET ECQKGMRLIL+L
Sbjct: 181 IKETDSINADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDTSLAETSECQKGMRLILSL 240

Query: 246 CLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLKHDAEGKECIE 305
           CLSEGFDTFPTLLCADGCSMIDRRMG+YGYPIEIQALFFMALR A++MLKHDAEGKE +E
Sbjct: 241 CLSEGFDTFPTLLCADGCSMIDRRMGVYGYPIEIQALFFMALRSAMSMLKHDAEGKEFME 300

Query: 306 RIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWVFDFMPT 365
           RIVKRLHALS+HMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWVFDFMP 
Sbjct: 301 RIVKRLHALSFHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWVFDFMPL 360

Query: 366 RGGYFVGNVSPARMDFRWFALGNCVAILSSLATPEQSIAIMDLIESRWEELVGEMPLKIS 425
           RGGYF+GNVSPARMDFRWFALGNCVAIL+SLATPEQS +IMDLIE RWEELVGEMP+KI 
Sbjct: 361 RGGYFIGNVSPARMDFRWFALGNCVAILASLATPEQSASIMDLIEERWEELVGEMPVKIC 420

Query: 426 YPAIESHDWRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIELAESR 485
           +PAIESH+WRI+TGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAI+LAE+R
Sbjct: 421 HPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIDLAEAR 480

Query: 486 LLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDKQMKPL 545
           LLKD WPEYYDGK GR+IGKQARK+QTWSIAGYLVAKM+LEDPSHLGMISLEEDKQ KP+
Sbjct: 481 LLKDGWPEYYDGKSGRFIGKQARKFQTWSIAGYLVAKMLLEDPSHLGMISLEEDKQTKPV 534

Query: 546 IKRSSSWT 554
           IKRS SWT
Sbjct: 541 IKRSYSWT 534

BLAST of HG10022544 vs. TAIR 10
Match: AT4G34860.1 (Plant neutral invertase family protein )

HSP 1 Score: 931.0 bits (2405), Expect = 4.4e-271
Identity = 444/559 (79.43%), Postives = 498/559 (89.09%), Query Frame = 0

Query: 6   LRNVSSHCSISEMDDYDLSRLLDKPK-LNIERQRSFDERSLSELSISLARGGLDN----- 65
           ++NV S  ++ ++DD D ++LL+KP+ LNI+R RS DERSL+EL+ S      DN     
Sbjct: 16  IKNVDSLSTLDDIDDIDFAKLLEKPRPLNIDRLRSLDERSLTELTGSPQLRNADNASRAP 75

Query: 66  ----FESSYSPGGRSGFDTPASSTRNSFEPHPMIAEAWEALRRSLVHFRGQPVGTIAAYD 125
               +  S S G RSGF+TP S  +  FE HPM+ EAW+ALRRS+V+FRGQPVGTIAA D
Sbjct: 76  DHADYVISPSFGRRSGFNTPRS--QPGFESHPMVGEAWDALRRSMVYFRGQPVGTIAAVD 135

Query: 126 HASEEVLNYDQVFVRDFVPSALAFLMNGEPEIVKNFLLKTLQLQGWEKRIDRFKLGEGAM 185
           + SEE LNYDQVFVRDFVPSALAFLMNGEP+IVKNFLLKTL+LQ WEK+IDRF+LGEG M
Sbjct: 136 N-SEEKLNYDQVFVRDFVPSALAFLMNGEPDIVKNFLLKTLRLQSWEKKIDRFQLGEGVM 195

Query: 186 PASFKVLHDPVRKTDTIAADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETPEC 245
           PASFKV HDPVR  +T+ ADFGESAIGRVAPVDSGFWWIILLRAYTKSTGD SLA+ PEC
Sbjct: 196 PASFKVFHDPVRNHETLIADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDSSLADMPEC 255

Query: 246 QKGMRLILTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALAMLK 305
           QKG+RLIL+LCLSEGFDTFPTLLCADGC MIDRRMG+YGYPIEIQALFFMALRCAL +LK
Sbjct: 256 QKGIRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQALFFMALRCALLLLK 315

Query: 306 HDAEGKECIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSI 365
           HD EGKE +E+IVKRLHALSYHMRSYFWLD +QLNDIYRYKTEEYSHTAVNKFNVIPDS+
Sbjct: 316 HDGEGKEMVEQIVKRLHALSYHMRSYFWLDLKQLNDIYRYKTEEYSHTAVNKFNVIPDSL 375

Query: 366 PEWVFDFMPTRGGYFVGNVSPARMDFRWFALGNCVAILSSLATPEQSIAIMDLIESRWEE 425
           PEWVFDFMP  GG+F+GNVSPARMDFRWFALGNC+AILSSLATPEQS AIMDLIESRWEE
Sbjct: 376 PEWVFDFMPPHGGFFIGNVSPARMDFRWFALGNCIAILSSLATPEQSTAIMDLIESRWEE 435

Query: 426 LVGEMPLKISYPAIESHDWRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIA 485
           LVGEMPLK+ YPAIESH+WRI+TGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIA
Sbjct: 436 LVGEMPLKVCYPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIA 495

Query: 486 RRAIELAESRLLKDSWPEYYDGKLGRYIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMIS 545
           RRAIE+AE+RL KD WPEYYDGK+GRY+GKQ+RK QTWS+AGYLVAKMMLEDPSH+GM+ 
Sbjct: 496 RRAIEVAEARLHKDHWPEYYDGKVGRYVGKQSRKNQTWSVAGYLVAKMMLEDPSHVGMVC 555

Query: 546 LEEDKQMKPLIKRSSSWTC 555
           LEEDKQMKP+++RS+SWTC
Sbjct: 556 LEEDKQMKPVMRRSNSWTC 571

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038897844.10.0e+0099.46probable alkaline/neutral invertase D [Benincasa hispida][more]
XP_022143693.10.0e+0098.56probable alkaline/neutral invertase D [Momordica charantia][more]
XP_011659122.10.0e+0097.83probable alkaline/neutral invertase D [Cucumis sativus] >KGN44485.1 hypothetical... [more]
XP_008461922.10.0e+0098.19PREDICTED: probable alkaline/neutral invertase D [Cucumis melo] >KAA0046174.1 pu... [more]
XP_022926252.10.0e+0097.29probable alkaline/neutral invertase D [Cucurbita moschata] >XP_022926253.1 proba... [more]
Match NameE-valueIdentityDescription
Q67XD99.1e-29086.05Alkaline/neutral invertase CINV2 OS=Arabidopsis thaliana OX=3702 GN=CINV2 PE=1 S... [more]
Q69T319.8e-28483.45Cytosolic invertase 1 OS=Oryza sativa subsp. japonica OX=39947 GN=CINV1 PE=1 SV=... [more]
Q9LQF21.4e-28282.82Alkaline/neutral invertase CINV1 OS=Arabidopsis thaliana OX=3702 GN=CINV1 PE=1 S... [more]
F4I2X91.8e-28284.67Probable alkaline/neutral invertase D OS=Arabidopsis thaliana OX=3702 GN=INVD PE... [more]
Q9SW486.2e-27079.43Probable alkaline/neutral invertase B OS=Arabidopsis thaliana OX=3702 GN=INVB PE... [more]
Match NameE-valueIdentityDescription
A0A6J1CRC00.0e+0098.56Alkaline/neutral invertase OS=Momordica charantia OX=3673 GN=LOC111013535 PE=3 S... [more]
A0A0A0K4760.0e+0097.83Alkaline/neutral invertase OS=Cucumis sativus OX=3659 GN=Csa_7G308910 PE=3 SV=1[more]
A0A5A7TSZ50.0e+0098.19Alkaline/neutral invertase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaff... [more]
A0A1S3CH540.0e+0098.19Alkaline/neutral invertase OS=Cucumis melo OX=3656 GN=LOC103500409 PE=3 SV=1[more]
A0A6J1J2390.0e+0097.29Alkaline/neutral invertase OS=Cucurbita maxima OX=3661 GN=LOC111480561 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT4G09510.16.5e-29186.05cytosolic invertase 2 [more]
AT1G35580.11.0e-28382.82cytosolic invertase 1 [more]
AT1G35580.21.0e-28382.82cytosolic invertase 1 [more]
AT1G22650.11.3e-28384.67Plant neutral invertase family protein [more]
AT4G34860.14.4e-27179.43Plant neutral invertase family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR024746Glycosyl hydrolase family 100PFAMPF12899Glyco_hydro_100coord: 92..528
e-value: 3.3E-213
score: 708.1
IPR024746Glycosyl hydrolase family 100PANTHERPTHR31916FAMILY NOT NAMEDcoord: 1..553
IPR012341Six-hairpin glycosidase-like superfamilyGENE3D1.50.10.10coord: 113..523
e-value: 9.7E-17
score: 62.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 64..83
NoneNo IPR availablePANTHERPTHR31916:SF45ALKALINE/NEUTRAL INVERTASE CINV2coord: 1..553
IPR008928Six-hairpin glycosidase superfamilySUPERFAMILY48208Six-hairpin glycosidasescoord: 98..521

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10022544.1HG10022544.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005987 sucrose catabolic process
biological_process GO:0005975 carbohydrate metabolic process
molecular_function GO:0033926 glycopeptide alpha-N-acetylgalactosaminidase activity
molecular_function GO:0004575 sucrose alpha-glucosidase activity