HG10017889 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10017889
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUncharacterised conserved protein UCP015417, vWA
LocationChr03: 25315949 .. 25317931 (-)
RNA-Seq ExpressionHG10017889
SyntenyHG10017889
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTCCTCCAAGCCTTCTCGGTCCTCCGGAGCTCTACGCCGCCGCCGCCCCTGTTTCCGTTCCACTCCAACCATCTCAAACAGCCTCCGGAGACCCCTTCGTCGATGCACTGGTCGCGAACTTCAACAATATCGATAACCCCGATGACAACCTGCTGCCCATGGGCTTCACGGAGAATATGTCGGTGACGTTCCTCTCCACCGGCAATCCTTGCCTTGATTTCTTCTTCCATGTGGTTCCTGATACGCCCGCCGATTCTTTGATCGAGAGATTGAGTTTGGCTTGGAATTACAATCCGTTGATGACGCTGAAGCTTATCTGTAATTTGCGAGGAGTTCGTGGTACCGGAAAGTCCGATAAAAAGGGATACTACACGGCTGCGCTCTGGCTCCACAAATTTCATCACAAAACCCTAGCAGGTAACATTCCTTCTATTGCTGATTTCGGTTATTTCAAGGATCTGCCGGAGATACTCTACCGGCTTCTTGAGGGTTCCGATGTGAGGCAGAATCAGAAGAAAGAGTGGTTACAGAGGAAACGTGGTAATTCGAGTGGAAAGAGATCGTCGTCGACTCGGAGAGGGAGAGGGGGGCTATCTATCAGGCATGAAAGCTTCAAGCAAGAGAAGCCGAAGACGAGGAAGAAAGAAATTCAATCTTCAACCGGCGGAGAGGCCAATATTTCGAAGGCCATGGAGACATCAAGGATAGAGAAAGAGAAGGCGAGCGCAGAAAGGAAGATAAAGAAGGTTTCGATGGCGAAGAAGGTTATGGAACGTTTTCAATCCGATCCAAATTTCCAACTCTTGTACGAACGAATCTCTGACTTCTTTGCCGATTGCTTGAAATCTGATCTTCAATTCCTGAATTCTGGAGAATTGAGGAAAATCAGTCTCGCTGCGAAATGGTGCCCTTCCGTCGATTCGTCCTTCGATCGATCGACATTACTCTGCGAGAGCATAGCGAGGAAGGTTTTCCCTCGCGAATCGGACCCAGAATACGTAGGGATCGAAGAGGCGCATTATGCGTACAGAGTTCGCGACAGATTGAGGAAGCAAGTTCTGGTGCCGCTCCGGAAGGTGTTGGAGCTGCCGGAGGTTTACATGGGAGCCAATCGGTGGGATTCGATCCCTTACAACAGAGTTGCTTCTGTTGCAATGAAAAATTACAAGGAAAAGTTCATGCAACACGACGGGGAGCGGTTTGGCCAATACTTGCAAGACGTGAAGGATGGTAAGACCAAGATCGCCGCCGGTGCACTGCTTCCTCACGAGATCATAAAGTCATTGGACGACGGTGAGGAAGACTGTGGAGAAGTCGCAGAGCTTCAATGGAAGAGAATGGTGGATGACTTGTTGAAGAAAGGGAAGTTGAGAAACTGCATTTCTGTTTGTGATGTGTCTGGAAGTATGAGCGGAATTCCCATGGATGTTTGTGTTGCTTTAGGTCTTTTGGTTTCTGAATTGAGCGAAGATCCATGGAAGGGGAAAGTGATCACATTCAGTGCGGACCCTCAACTTCATTTGATTCAAGGGGACAGTCTGAAATCAAAGACGGATTTCATTAAGAGGATGGATTGGGGGTATAATACTGATTTTCAGAAGGTTTTTGATCAAATTCTGAAAGTGGCTGTGGATGCAAAGTTGAATGAAGAACAGATGGTAAAGAGATTGTTCGTGTTCAGTGACATGGAGTTCGATCAAGCATCAGCCAACTCGTGGGAAACAGATTACCAAGTTATAGTTAGGAAGTTTACAGAAAAAGGGTATGGATCAGCTGTTCCACAGATTGTGTTTTGGAACTTGAGAAATTCGAGGGCGACGCCAGTGCCGGCCAAGGAGAAGGGGGTGGCCTTGGTCAGTGGATACTCGAAGAACTTGATGAACTTGTTTTTGAATGACAACGGTGACATTCAACCGGAAGCCGTCATGGAGCAGGCTATCTCCGGCAGCGAGTACCAGAAGCTTGTTGTTCTTGATTGA

mRNA sequence

ATGGCTCCTCCAAGCCTTCTCGGTCCTCCGGAGCTCTACGCCGCCGCCGCCCCTGTTTCCGTTCCACTCCAACCATCTCAAACAGCCTCCGGAGACCCCTTCGTCGATGCACTGGTCGCGAACTTCAACAATATCGATAACCCCGATGACAACCTGCTGCCCATGGGCTTCACGGAGAATATGTCGGTGACGTTCCTCTCCACCGGCAATCCTTGCCTTGATTTCTTCTTCCATGTGGTTCCTGATACGCCCGCCGATTCTTTGATCGAGAGATTGAGTTTGGCTTGGAATTACAATCCGTTGATGACGCTGAAGCTTATCTGTAATTTGCGAGGAGTTCGTGGTACCGGAAAGTCCGATAAAAAGGGATACTACACGGCTGCGCTCTGGCTCCACAAATTTCATCACAAAACCCTAGCAGGTAACATTCCTTCTATTGCTGATTTCGGTTATTTCAAGGATCTGCCGGAGATACTCTACCGGCTTCTTGAGGGTTCCGATGTGAGGCAGAATCAGAAGAAAGAGTGGTTACAGAGGAAACGTGGTAATTCGAGTGGAAAGAGATCGTCGTCGACTCGGAGAGGGAGAGGGGGGCTATCTATCAGGCATGAAAGCTTCAAGCAAGAGAAGCCGAAGACGAGGAAGAAAGAAATTCAATCTTCAACCGGCGGAGAGGCCAATATTTCGAAGGCCATGGAGACATCAAGGATAGAGAAAGAGAAGGCGAGCGCAGAAAGGAAGATAAAGAAGGTTTCGATGGCGAAGAAGGTTATGGAACGTTTTCAATCCGATCCAAATTTCCAACTCTTGTACGAACGAATCTCTGACTTCTTTGCCGATTGCTTGAAATCTGATCTTCAATTCCTGAATTCTGGAGAATTGAGGAAAATCAGTCTCGCTGCGAAATGGTGCCCTTCCGTCGATTCGTCCTTCGATCGATCGACATTACTCTGCGAGAGCATAGCGAGGAAGGTTTTCCCTCGCGAATCGGACCCAGAATACGTAGGGATCGAAGAGGCGCATTATGCGTACAGAGTTCGCGACAGATTGAGGAAGCAAGTTCTGGTGCCGCTCCGGAAGGTGTTGGAGCTGCCGGAGGTTTACATGGGAGCCAATCGGTGGGATTCGATCCCTTACAACAGAGTTGCTTCTGTTGCAATGAAAAATTACAAGGAAAAGTTCATGCAACACGACGGGGAGCGGTTTGGCCAATACTTGCAAGACGTGAAGGATGGTAAGACCAAGATCGCCGCCGGTGCACTGCTTCCTCACGAGATCATAAAGTCATTGGACGACGGTGAGGAAGACTGTGGAGAAGTCGCAGAGCTTCAATGGAAGAGAATGGTGGATGACTTGTTGAAGAAAGGGAAGTTGAGAAACTGCATTTCTGTTTGTGATGTGTCTGGAAGTATGAGCGGAATTCCCATGGATGTTTGTGTTGCTTTAGGTCTTTTGGTTTCTGAATTGAGCGAAGATCCATGGAAGGGGAAAGTGATCACATTCAGTGCGGACCCTCAACTTCATTTGATTCAAGGGGACAGTCTGAAATCAAAGACGGATTTCATTAAGAGGATGGATTGGGGGTATAATACTGATTTTCAGAAGGTTTTTGATCAAATTCTGAAAGTGGCTGTGGATGCAAAGTTGAATGAAGAACAGATGGTAAAGAGATTGTTCGTGTTCAGTGACATGGAGTTCGATCAAGCATCAGCCAACTCGTGGGAAACAGATTACCAAGTTATAGTTAGGAAGTTTACAGAAAAAGGGTATGGATCAGCTGTTCCACAGATTGTGTTTTGGAACTTGAGAAATTCGAGGGCGACGCCAGTGCCGGCCAAGGAGAAGGGGGTGGCCTTGGTCAGTGGATACTCGAAGAACTTGATGAACTTGTTTTTGAATGACAACGGTGACATTCAACCGGAAGCCGTCATGGAGCAGGCTATCTCCGGCAGCGAGTACCAGAAGCTTGTTGTTCTTGATTGA

Coding sequence (CDS)

ATGGCTCCTCCAAGCCTTCTCGGTCCTCCGGAGCTCTACGCCGCCGCCGCCCCTGTTTCCGTTCCACTCCAACCATCTCAAACAGCCTCCGGAGACCCCTTCGTCGATGCACTGGTCGCGAACTTCAACAATATCGATAACCCCGATGACAACCTGCTGCCCATGGGCTTCACGGAGAATATGTCGGTGACGTTCCTCTCCACCGGCAATCCTTGCCTTGATTTCTTCTTCCATGTGGTTCCTGATACGCCCGCCGATTCTTTGATCGAGAGATTGAGTTTGGCTTGGAATTACAATCCGTTGATGACGCTGAAGCTTATCTGTAATTTGCGAGGAGTTCGTGGTACCGGAAAGTCCGATAAAAAGGGATACTACACGGCTGCGCTCTGGCTCCACAAATTTCATCACAAAACCCTAGCAGGTAACATTCCTTCTATTGCTGATTTCGGTTATTTCAAGGATCTGCCGGAGATACTCTACCGGCTTCTTGAGGGTTCCGATGTGAGGCAGAATCAGAAGAAAGAGTGGTTACAGAGGAAACGTGGTAATTCGAGTGGAAAGAGATCGTCGTCGACTCGGAGAGGGAGAGGGGGGCTATCTATCAGGCATGAAAGCTTCAAGCAAGAGAAGCCGAAGACGAGGAAGAAAGAAATTCAATCTTCAACCGGCGGAGAGGCCAATATTTCGAAGGCCATGGAGACATCAAGGATAGAGAAAGAGAAGGCGAGCGCAGAAAGGAAGATAAAGAAGGTTTCGATGGCGAAGAAGGTTATGGAACGTTTTCAATCCGATCCAAATTTCCAACTCTTGTACGAACGAATCTCTGACTTCTTTGCCGATTGCTTGAAATCTGATCTTCAATTCCTGAATTCTGGAGAATTGAGGAAAATCAGTCTCGCTGCGAAATGGTGCCCTTCCGTCGATTCGTCCTTCGATCGATCGACATTACTCTGCGAGAGCATAGCGAGGAAGGTTTTCCCTCGCGAATCGGACCCAGAATACGTAGGGATCGAAGAGGCGCATTATGCGTACAGAGTTCGCGACAGATTGAGGAAGCAAGTTCTGGTGCCGCTCCGGAAGGTGTTGGAGCTGCCGGAGGTTTACATGGGAGCCAATCGGTGGGATTCGATCCCTTACAACAGAGTTGCTTCTGTTGCAATGAAAAATTACAAGGAAAAGTTCATGCAACACGACGGGGAGCGGTTTGGCCAATACTTGCAAGACGTGAAGGATGGTAAGACCAAGATCGCCGCCGGTGCACTGCTTCCTCACGAGATCATAAAGTCATTGGACGACGGTGAGGAAGACTGTGGAGAAGTCGCAGAGCTTCAATGGAAGAGAATGGTGGATGACTTGTTGAAGAAAGGGAAGTTGAGAAACTGCATTTCTGTTTGTGATGTGTCTGGAAGTATGAGCGGAATTCCCATGGATGTTTGTGTTGCTTTAGGTCTTTTGGTTTCTGAATTGAGCGAAGATCCATGGAAGGGGAAAGTGATCACATTCAGTGCGGACCCTCAACTTCATTTGATTCAAGGGGACAGTCTGAAATCAAAGACGGATTTCATTAAGAGGATGGATTGGGGGTATAATACTGATTTTCAGAAGGTTTTTGATCAAATTCTGAAAGTGGCTGTGGATGCAAAGTTGAATGAAGAACAGATGGTAAAGAGATTGTTCGTGTTCAGTGACATGGAGTTCGATCAAGCATCAGCCAACTCGTGGGAAACAGATTACCAAGTTATAGTTAGGAAGTTTACAGAAAAAGGGTATGGATCAGCTGTTCCACAGATTGTGTTTTGGAACTTGAGAAATTCGAGGGCGACGCCAGTGCCGGCCAAGGAGAAGGGGGTGGCCTTGGTCAGTGGATACTCGAAGAACTTGATGAACTTGTTTTTGAATGACAACGGTGACATTCAACCGGAAGCCGTCATGGAGCAGGCTATCTCCGGCAGCGAGTACCAGAAGCTTGTTGTTCTTGATTGA

Protein sequence

MAPPSLLGPPELYAAAAPVSVPLQPSQTASGDPFVDALVANFNNIDNPDDNLLPMGFTENMSVTFLSTGNPCLDFFFHVVPDTPADSLIERLSLAWNYNPLMTLKLICNLRGVRGTGKSDKKGYYTAALWLHKFHHKTLAGNIPSIADFGYFKDLPEILYRLLEGSDVRQNQKKEWLQRKRGNSSGKRSSSTRRGRGGLSIRHESFKQEKPKTRKKEIQSSTGGEANISKAMETSRIEKEKASAERKIKKVSMAKKVMERFQSDPNFQLLYERISDFFADCLKSDLQFLNSGELRKISLAAKWCPSVDSSFDRSTLLCESIARKVFPRESDPEYVGIEEAHYAYRVRDRLRKQVLVPLRKVLELPEVYMGANRWDSIPYNRVASVAMKNYKEKFMQHDGERFGQYLQDVKDGKTKIAAGALLPHEIIKSLDDGEEDCGEVAELQWKRMVDDLLKKGKLRNCISVCDVSGSMSGIPMDVCVALGLLVSELSEDPWKGKVITFSADPQLHLIQGDSLKSKTDFIKRMDWGYNTDFQKVFDQILKVAVDAKLNEEQMVKRLFVFSDMEFDQASANSWETDYQVIVRKFTEKGYGSAVPQIVFWNLRNSRATPVPAKEKGVALVSGYSKNLMNLFLNDNGDIQPEAVMEQAISGSEYQKLVVLD
Homology
BLAST of HG10017889 vs. NCBI nr
Match: XP_038881761.1 (uncharacterized protein LOC120073170 [Benincasa hispida])

HSP 1 Score: 1197.2 bits (3096), Expect = 0.0e+00
Identity = 610/666 (91.59%), Postives = 631/666 (94.74%), Query Frame = 0

Query: 1   MAPPSLLGPPELYAAAAPVSVPLQPSQ------TASGDPFVDALVANFNNIDNPDDNLLP 60
           MAPPSLLGPPELY AAAP  V LQ SQ      TASGDPFVD+LVA FN IDNP DNL P
Sbjct: 1   MAPPSLLGPPELY-AAAPAPVELQSSQPQPAESTASGDPFVDSLVAKFNKIDNPHDNLPP 60

Query: 61  MGFTENMSVTFLSTGNPCLDFFFHVVPDTPADSLIERLSLAWNYNPLMTLKLICNLRGVR 120
           MGFTENMSVTFLSTGNPCLDFFFHVVPDTPADSLIERLSLAWN++PLMTLKLICNLRGVR
Sbjct: 61  MGFTENMSVTFLSTGNPCLDFFFHVVPDTPADSLIERLSLAWNHDPLMTLKLICNLRGVR 120

Query: 121 GTGKSDKKGYYTAALWLHKFHHKTLAGNIPSIADFGYFKDLPEILYRLLEGSDVRQNQKK 180
           GTGKSDK+GYYTAALWLHKFH KTLAGNIPSIADFGYFKDLPEILYRLLEGSDVR+NQK 
Sbjct: 121 GTGKSDKEGYYTAALWLHKFHPKTLAGNIPSIADFGYFKDLPEILYRLLEGSDVRENQKN 180

Query: 181 EWLQRKRGNSSGKRSSSTRRGRGGLSIRHESFKQEKPKTRKKEIQSSTGGEANISKAMET 240
           EWL+RKR     KRSS+TRRGR GLSIRH SFKQ KPKTRKKEIQSST  EANISKA+ET
Sbjct: 181 EWLERKRSRKP-KRSSTTRRGRFGLSIRHGSFKQVKPKTRKKEIQSSTDREANISKAIET 240

Query: 241 SRIEKEKASAERKIKKVSMAKKVMERFQSDPNFQLLYERISDFFADCLKSDLQFLNSGEL 300
           SRIEKEKASA+RKIKKVSMAKKV+ERFQSDPNFQLLYERISDFFADCLKSDLQFLNSGEL
Sbjct: 241 SRIEKEKASADRKIKKVSMAKKVVERFQSDPNFQLLYERISDFFADCLKSDLQFLNSGEL 300

Query: 301 RKISLAAKWCPSVDSSFDRSTLLCESIARKVFPRESDPEYVGIEEAHYAYRVRDRLRKQV 360
           RKISLAAKWCPSVDSSFDRSTLLCESIARKVFPRESDPEY GIEEAHYAYRVRDRLRKQV
Sbjct: 301 RKISLAAKWCPSVDSSFDRSTLLCESIARKVFPRESDPEYEGIEEAHYAYRVRDRLRKQV 360

Query: 361 LVPLRKVLELPEVYMGANRWDSIPYNRVASVAMKNYKEKFMQHDGERFGQYLQDVKDGKT 420
           LVPLRKVLELPEVYMGANRWDSIPYNRVASVAMK YKEKFMQHDGERFGQYL+DVKDGKT
Sbjct: 361 LVPLRKVLELPEVYMGANRWDSIPYNRVASVAMKIYKEKFMQHDGERFGQYLKDVKDGKT 420

Query: 421 KIAAGALLPHEIIKSLDDGEEDCGEVAELQWKRMVDDLLKKGKLRNCISVCDVSGSMSGI 480
           KIAAGALLPHEII SL DGEED GEVAELQWKRMVDDLLKKGKLRNCI+VCDVSGSM+GI
Sbjct: 421 KIAAGALLPHEIINSLYDGEEDGGEVAELQWKRMVDDLLKKGKLRNCIAVCDVSGSMAGI 480

Query: 481 PMDVCVALGLLVSELSEDPWKGKVITFSADPQLHLIQGDSLKSKTDFIKRMDWGYNTDFQ 540
           PMDVCVALGLLVSELSEDPWKGKVITFSADP+LHLIQGDSLKSKTDFIK M+WGYNTDFQ
Sbjct: 481 PMDVCVALGLLVSELSEDPWKGKVITFSADPKLHLIQGDSLKSKTDFIKEMEWGYNTDFQ 540

Query: 541 KVFDQILKVAVDAKLNEEQMVKRLFVFSDMEFDQASANSWETDYQVIVRKFTEKGYGSAV 600
           KVFDQILKVAVDAKLNEEQMVKRLFVFSDMEFDQAS+NSWETDYQVIVRKFTEKGYGSAV
Sbjct: 541 KVFDQILKVAVDAKLNEEQMVKRLFVFSDMEFDQASSNSWETDYQVIVRKFTEKGYGSAV 600

Query: 601 PQIVFWNLRNSRATPVPAKEKGVALVSGYSKNLMNLFLNDNGDIQPEAVMEQAISGSEYQ 660
            QIVFWNLRNSRATPVPA+EKGVALVSGYSKNLMNLFLN++G IQPEA+MEQA+SGSEYQ
Sbjct: 601 SQIVFWNLRNSRATPVPAREKGVALVSGYSKNLMNLFLNNDGVIQPEAIMEQAVSGSEYQ 660

BLAST of HG10017889 vs. NCBI nr
Match: XP_008442184.1 (PREDICTED: uncharacterized protein LOC103486117 [Cucumis melo] >KAA0041221.1 GPI inositol-deacylase PGAP1-like protein [Cucumis melo var. makuwa])

HSP 1 Score: 1125.9 bits (2911), Expect = 0.0e+00
Identity = 575/680 (84.56%), Postives = 617/680 (90.74%), Query Frame = 0

Query: 1   MAPPSLLGPPELYAAAAPVS--------VPLQPSQTA-----------SGDPFVDALVAN 60
           MAPPSLLGPPELY AA+PVS        V LQP+++A           SG PFVDA++AN
Sbjct: 1   MAPPSLLGPPELYHAASPVSLQPTESAPVSLQPTESAPVSLQPTESTPSGVPFVDAMLAN 60

Query: 61  FNNIDN-PDDNLLPMGFTENMSVTFLSTGNPCLDFFFHVVPDTPADSLIERLSLAWNYNP 120
           FNNI+N  DDNL PMGFTENMS TFLSTGNPCLDFFFHVVPDTPA+SLI+RLSLAWN+NP
Sbjct: 61  FNNINNHSDDNLPPMGFTENMSATFLSTGNPCLDFFFHVVPDTPANSLIDRLSLAWNHNP 120

Query: 121 LMTLKLICNLRGVRGTGKSDKKGYYTAALWLHKFHHKTLAGNIPSIADFGYFKDLPEILY 180
           LMTLKLICNLRGVRGTGKSDK+GYYTAALWL+ FH KTLAGNIPSIADFGYFKDLPEILY
Sbjct: 121 LMTLKLICNLRGVRGTGKSDKEGYYTAALWLYNFHPKTLAGNIPSIADFGYFKDLPEILY 180

Query: 181 RLLEGSDVRQNQKKEWLQRKRGNSSGKRSSSTRRGRGGLSIRHESFKQEKPKTRKKEIQS 240
           RLLEGSDVR+NQKKEW +RK    S KR SS R  RGGLS+R+ SFKQEKPKTRKKEIQS
Sbjct: 181 RLLEGSDVRKNQKKEWGERK--GKSRKRLSSPR--RGGLSVRYGSFKQEKPKTRKKEIQS 240

Query: 241 STGGEANISKAMETSRIEKEKASAERKIKKVSMAKKVMERFQSDPNFQLLYERISDFFAD 300
           S   EANISKAME SRIEKEKASAERK++KVSMA+KVMERFQSDPNFQLL++RISDFF D
Sbjct: 241 SIDREANISKAMEKSRIEKEKASAERKLRKVSMARKVMERFQSDPNFQLLHDRISDFFTD 300

Query: 301 CLKSDLQFLNSGELRKISLAAKWCPSVDSSFDRSTLLCESIARKVFPRESDPEYVGIEEA 360
           CLKSDLQF+NSG+  +ISLAAKWCPSVDSSFDRSTLLCESIARKVFPRESDPEY GIEEA
Sbjct: 301 CLKSDLQFMNSGDFTRISLAAKWCPSVDSSFDRSTLLCESIARKVFPRESDPEYEGIEEA 360

Query: 361 HYAYRVRDRLRKQVLVPLRKVLELPEVYMGANRWDSIPYNRVASVAMKNYKEKFMQHDGE 420
           HYAYRVRDRLRK VLVPLRKVLELPEVY+GANRWDSIPYNRVASVAMKNYKEKFM+HDGE
Sbjct: 361 HYAYRVRDRLRKDVLVPLRKVLELPEVYIGANRWDSIPYNRVASVAMKNYKEKFMKHDGE 420

Query: 421 RFGQYLQDVKDGKTKIAAGALLPHEIIKSLDDGEEDCGEVAELQWKRMVDDLLKKGKLRN 480
           RF QYL+DVKDGKTKIAAGALLPHEII SL DG+ED GEVAELQWKRMVDDLLKKGKLR+
Sbjct: 421 RFAQYLKDVKDGKTKIAAGALLPHEIIMSLFDGQEDGGEVAELQWKRMVDDLLKKGKLRD 480

Query: 481 CISVCDVSGSMSGIPMDVCVALGLLVSELSEDPWKGKVITFSADPQLHLIQGDSLKSKTD 540
           CI+VCDVSGSM GIPMDVC+ALGLLVSELSEDPWKGKVITFSA+P+LH+IQGDSLKSK +
Sbjct: 481 CIAVCDVSGSMEGIPMDVCIALGLLVSELSEDPWKGKVITFSANPELHVIQGDSLKSKAE 540

Query: 541 FIKRMDWGYNTDFQKVFDQILKVAVDAKLNEEQMVKRLFVFSDMEFDQASANSWETDYQV 600
           F+K M WG NTDFQKVFDQILKVAVD KL EEQM+KR+FVFSDMEFDQASA SWETDYQV
Sbjct: 541 FVKTMHWGVNTDFQKVFDQILKVAVDGKLKEEQMIKRVFVFSDMEFDQASATSWETDYQV 600

Query: 601 IVRKFTEKGYGSAVPQIVFWNLRNSRATPVPAKEKGVALVSGYSKNLMNLFLNDNGDIQP 660
           IVRKFTEKGYGSAVPQIVFWNLR+SRATPVP KEKGVALVSGYSKNLMNLFL+ +G IQP
Sbjct: 601 IVRKFTEKGYGSAVPQIVFWNLRDSRATPVPGKEKGVALVSGYSKNLMNLFLDGDGVIQP 660

BLAST of HG10017889 vs. NCBI nr
Match: TYK16024.1 (GPI inositol-deacylase PGAP1-like protein [Cucumis melo var. makuwa])

HSP 1 Score: 1123.2 bits (2904), Expect = 0.0e+00
Identity = 574/680 (84.41%), Postives = 616/680 (90.59%), Query Frame = 0

Query: 1   MAPPSLLGPPELYAAAAPVS--------VPLQPSQTA-----------SGDPFVDALVAN 60
           MAPPSLLGPPELY AA+PVS        V LQP+++A           SG PFVDA++AN
Sbjct: 1   MAPPSLLGPPELYHAASPVSLQPTESAPVSLQPTESAPVSLQPTESTPSGVPFVDAMLAN 60

Query: 61  FNNIDN-PDDNLLPMGFTENMSVTFLSTGNPCLDFFFHVVPDTPADSLIERLSLAWNYNP 120
           FNNI+N  DDNL PMGFTENMS TFLSTGNPCLDFFFHVVPDTPA+SLI+RLSLAWN+NP
Sbjct: 61  FNNINNHSDDNLPPMGFTENMSATFLSTGNPCLDFFFHVVPDTPANSLIDRLSLAWNHNP 120

Query: 121 LMTLKLICNLRGVRGTGKSDKKGYYTAALWLHKFHHKTLAGNIPSIADFGYFKDLPEILY 180
           LMTLKLICNLRGVRGTGKSDK+GYYTAALWL+ FH KTLAGNIPSIADFGYFKDLPEILY
Sbjct: 121 LMTLKLICNLRGVRGTGKSDKEGYYTAALWLYNFHPKTLAGNIPSIADFGYFKDLPEILY 180

Query: 181 RLLEGSDVRQNQKKEWLQRKRGNSSGKRSSSTRRGRGGLSIRHESFKQEKPKTRKKEIQS 240
           RLLEGSDVR+NQKKEW +RK    S KR SS R  RGGLS+R+ SFKQEKPKTRKKEIQS
Sbjct: 181 RLLEGSDVRKNQKKEWGERK--GKSRKRLSSPR--RGGLSVRYGSFKQEKPKTRKKEIQS 240

Query: 241 STGGEANISKAMETSRIEKEKASAERKIKKVSMAKKVMERFQSDPNFQLLYERISDFFAD 300
           S   EANISKAME SRIEKEKASAERK++KVSMA+KVMERFQSDPNFQLL++RISDFF D
Sbjct: 241 SIDREANISKAMEKSRIEKEKASAERKLRKVSMARKVMERFQSDPNFQLLHDRISDFFTD 300

Query: 301 CLKSDLQFLNSGELRKISLAAKWCPSVDSSFDRSTLLCESIARKVFPRESDPEYVGIEEA 360
           CLKSDLQF+NSG+  +ISLAAKWCPSVDSSFDRSTLLCESIARKVFPRESDPEY GIEEA
Sbjct: 301 CLKSDLQFMNSGDFTRISLAAKWCPSVDSSFDRSTLLCESIARKVFPRESDPEYEGIEEA 360

Query: 361 HYAYRVRDRLRKQVLVPLRKVLELPEVYMGANRWDSIPYNRVASVAMKNYKEKFMQHDGE 420
           HYAYRVRDRLRK VLVPLRKVLELPEVY+GANRWDSIPYNRVASVAMKNYKEKFM+HDGE
Sbjct: 361 HYAYRVRDRLRKDVLVPLRKVLELPEVYIGANRWDSIPYNRVASVAMKNYKEKFMKHDGE 420

Query: 421 RFGQYLQDVKDGKTKIAAGALLPHEIIKSLDDGEEDCGEVAELQWKRMVDDLLKKGKLRN 480
           RF QYL+DVKDGKTKIAAGALLPHEII SL DG+ED GEVA LQWKRMVDDLLKKGKLR+
Sbjct: 421 RFAQYLKDVKDGKTKIAAGALLPHEIIMSLFDGQEDGGEVAVLQWKRMVDDLLKKGKLRD 480

Query: 481 CISVCDVSGSMSGIPMDVCVALGLLVSELSEDPWKGKVITFSADPQLHLIQGDSLKSKTD 540
           CI+VCDVSGSM GIPMDVC+ALGLLVSELSEDPWKGKVITFSA+P+LH+IQGDSLKSK +
Sbjct: 481 CIAVCDVSGSMEGIPMDVCIALGLLVSELSEDPWKGKVITFSANPELHVIQGDSLKSKAE 540

Query: 541 FIKRMDWGYNTDFQKVFDQILKVAVDAKLNEEQMVKRLFVFSDMEFDQASANSWETDYQV 600
           F+K M WG NTDFQKVFDQILKVAVD KL EEQM+KR+FVFSDMEFDQASA SWETDYQV
Sbjct: 541 FVKTMHWGVNTDFQKVFDQILKVAVDGKLKEEQMIKRVFVFSDMEFDQASATSWETDYQV 600

Query: 601 IVRKFTEKGYGSAVPQIVFWNLRNSRATPVPAKEKGVALVSGYSKNLMNLFLNDNGDIQP 660
           IVRKFTEKGYGSAVPQIVFWNLR+SRATPVP KEKGVALVSGYSKNLMNLFL+ +G IQP
Sbjct: 601 IVRKFTEKGYGSAVPQIVFWNLRDSRATPVPGKEKGVALVSGYSKNLMNLFLDGDGVIQP 660

BLAST of HG10017889 vs. NCBI nr
Match: XP_004144675.2 (uncharacterized protein LOC101205449 [Cucumis sativus] >KGN55197.2 hypothetical protein Csa_012063 [Cucumis sativus])

HSP 1 Score: 1095.1 bits (2831), Expect = 0.0e+00
Identity = 553/661 (83.66%), Postives = 595/661 (90.02%), Query Frame = 0

Query: 1   MAPPSLLGPPELYAAAAPVSVPLQPSQ-TASGDPFVDALVANFNNIDNPDDNLLPMGFTE 60
           MAPP+LLGPPELY AAAPVS  LQP++ T SGDPFVDA+VANFN     DD+L PMGFTE
Sbjct: 1   MAPPNLLGPPELYHAAAPVS--LQPTESTPSGDPFVDAMVANFN---KTDDSLPPMGFTE 60

Query: 61  NMSVTFLSTGNPCLDFFFHVVPDTPADSLIERLSLAWNYNPLMTLKLICNLRGVRGTGKS 120
           NMS TFLSTGNPCLDFFFHVVPDTPA+SLI+RLSLAWN+NPLMTLKLICNLRGVRGTGKS
Sbjct: 61  NMSATFLSTGNPCLDFFFHVVPDTPANSLIDRLSLAWNHNPLMTLKLICNLRGVRGTGKS 120

Query: 121 DKKGYYTAALWLHKFHHKTLAGNIPSIADFGYFKDLPEILYRLLEGSDVRQNQKKEWLQR 180
           DK+GYYTAALWL+ FH KTLAGNIPSIADFGYFKDLPEILYRLLEGSDVR+NQK EW +R
Sbjct: 121 DKEGYYTAALWLYNFHPKTLAGNIPSIADFGYFKDLPEILYRLLEGSDVRKNQKNEWKRR 180

Query: 181 KRGNSSGKRSSSTRRGRGGLSIRHESFKQEKPKTRKKEIQSSTGGEANISKAMETSRIEK 240
                             GLS+RH  FKQEKPKTRKKEIQSST  EANISKAME SRIEK
Sbjct: 181 ------------------GLSVRHGRFKQEKPKTRKKEIQSSTDREANISKAMEKSRIEK 240

Query: 241 EKASAERKIKKVSMAKKVMERFQSDPNFQLLYERISDFFADCLKSDLQFLNSGELRKISL 300
           EKAS ERK++KVSMA+KVMERFQSD NFQLL++RISDFF DCLKSDLQF+NSG+  KISL
Sbjct: 241 EKASGERKLRKVSMARKVMERFQSDSNFQLLHDRISDFFTDCLKSDLQFMNSGDFTKISL 300

Query: 301 AAKWCPSVDSSFDRSTLLCESIARKVFPRESDPEYVGIEEAHYAYRVRDRLRKQVLVPLR 360
           AAKWCPS+DSSFDRSTLLCESIARK+FPRE +PEY  IEEAHYAYRVRDRLR  VLVPLR
Sbjct: 301 AAKWCPSIDSSFDRSTLLCESIARKIFPRELNPEYKEIEEAHYAYRVRDRLRTDVLVPLR 360

Query: 361 KVLELPEVYMGANRWDSIPYNRVASVAMKNYKEKFMQHDGERFGQYLQDVKDGKTKIAAG 420
           KVLELPEV++GANRWDSIPYNRVASVAMKNYKEKFM+HDGERF QYL+DVKDGKTKIAAG
Sbjct: 361 KVLELPEVFIGANRWDSIPYNRVASVAMKNYKEKFMKHDGERFAQYLKDVKDGKTKIAAG 420

Query: 421 ALLPHEIIKSLDDGEEDCGEVAELQWKRMVDDLLKKGKLRNCISVCDVSGSMSGIPMDVC 480
           ALLPHEII SL DG+ED GEVAELQWKRMVDDLLKKGKLR CI+VCDVSGSM GIPMDVC
Sbjct: 421 ALLPHEIILSLFDGQEDGGEVAELQWKRMVDDLLKKGKLRECIAVCDVSGSMMGIPMDVC 480

Query: 481 VALGLLVSELSEDPWKGKVITFSADPQLHLIQGDSLKSKTDFIKRMDWGYNTDFQKVFDQ 540
           V LGLLVSELSEDPWKGKVITFSA+P+LH+IQGDSLKSK +F+K MDWG NTDFQKVFDQ
Sbjct: 481 VGLGLLVSELSEDPWKGKVITFSANPELHMIQGDSLKSKAEFVKSMDWGGNTDFQKVFDQ 540

Query: 541 ILKVAVDAKLNEEQMVKRLFVFSDMEFDQASANSWETDYQVIVRKFTEKGYGSAVPQIVF 600
           ILKVAVD KL EEQM+KR+FVFSDMEFDQAS  SWETDYQVIVRKFTEKGYGSAVPQIVF
Sbjct: 541 ILKVAVDGKLKEEQMIKRVFVFSDMEFDQASQTSWETDYQVIVRKFTEKGYGSAVPQIVF 600

Query: 601 WNLRNSRATPVPAKEKGVALVSGYSKNLMNLFLNDNGDIQPEAVMEQAISGSEYQKLVVL 660
           WNLR+SRATPVP+ EKGVALVSGYSKNLMNLFL+ +G IQPEAVME+AISG+EYQKLVVL
Sbjct: 601 WNLRDSRATPVPSNEKGVALVSGYSKNLMNLFLDGDGVIQPEAVMEKAISGNEYQKLVVL 638

BLAST of HG10017889 vs. NCBI nr
Match: XP_022928704.1 (uncharacterized protein LOC111435535 [Cucurbita moschata])

HSP 1 Score: 1092.8 bits (2825), Expect = 0.0e+00
Identity = 551/660 (83.48%), Postives = 595/660 (90.15%), Query Frame = 0

Query: 1   MAPPSLLGPPELYAAAAPVSVPLQPSQTASGDPFVDALVANFNNIDNPDDNLLPMGFTEN 60
           MAPPSLLGPPELY    P S P QP  T +GDPFVDALVANFN +D  DD L PMGFTEN
Sbjct: 1   MAPPSLLGPPELYTPFQP-SQPTQP--TPTGDPFVDALVANFNKVDTNDDELPPMGFTEN 60

Query: 61  MSVTFLSTGNPCLDFFFHVVPDTPADSLIERLSLAWNYNPLMTLKLICNLRGVRGTGKSD 120
           MSVTFLS+GNPCLDFFFHVVPDTP++SL ERLS+AWN+NPLMTLKLICNLRGVRGTGKSD
Sbjct: 61  MSVTFLSSGNPCLDFFFHVVPDTPSESLTERLSVAWNHNPLMTLKLICNLRGVRGTGKSD 120

Query: 121 KKGYYTAALWLHKFHHKTLAGNIPSIADFGYFKDLPEILYRLLEGSDVRQNQKKEWLQRK 180
           K+GYYTAALWLHKFH KTLAGNIPS+ADFGYFKDLPE+LYRLLEGSDVR+NQK EW+ R+
Sbjct: 121 KEGYYTAALWLHKFHPKTLAGNIPSLADFGYFKDLPELLYRLLEGSDVRKNQKAEWIGRR 180

Query: 181 RGNSSGKRSSSTRRGRGGLSIRHESFKQEKPKTRKKEIQSSTGGEANISKAMETSRIEKE 240
           +G    KR  S     G  S     FK+EK KTRKKEIQSS   EA I+KAME S I KE
Sbjct: 181 KGRHM-KRRRSLSSESGSRSASDGEFKEEKLKTRKKEIQSSPDVEAKIAKAMERSMILKE 240

Query: 241 KASAERKIKKVSMAKKVMERFQSDPNFQLLYERISDFFADCLKSDLQFLNSGELRKISLA 300
           KAS ERKIKKVSMAKK +ER+QSDP+FQ LY+R+SDFFADCLKSDLQFLNSGEL KISLA
Sbjct: 241 KASTERKIKKVSMAKKALERYQSDPHFQRLYDRVSDFFADCLKSDLQFLNSGELNKISLA 300

Query: 301 AKWCPSVDSSFDRSTLLCESIARKVFPRESDPEYVGIEEAHYAYRVRDRLRKQVLVPLRK 360
           AKWCPSVDSSFDRSTLLCESIARK+FPR+SDPEY GIEEAHYAYRVRDRLRKQVLVPLRK
Sbjct: 301 AKWCPSVDSSFDRSTLLCESIARKLFPRQSDPEYEGIEEAHYAYRVRDRLRKQVLVPLRK 360

Query: 361 VLELPEVYMGANRWDSIPYNRVASVAMKNYKEKFMQHDGERFGQYLQDVKDGKTKIAAGA 420
           VLELPE +MGAN+W++IPYNRVASVAMKNYK+KF++HDGERF QYL+DVK GKTKIAAGA
Sbjct: 361 VLELPESFMGANQWNAIPYNRVASVAMKNYKKKFVEHDGERFAQYLEDVKAGKTKIAAGA 420

Query: 421 LLPHEIIKSLDDGEEDCGEVAELQWKRMVDDLLKKGKLRNCISVCDVSGSMSGIPMDVCV 480
           LLPH+II SL+DGEED GEVAELQWKRMVDDLL+KGKLRNCISVCDVSGSM G PM+VCV
Sbjct: 421 LLPHQIIASLNDGEEDGGEVAELQWKRMVDDLLEKGKLRNCISVCDVSGSMGGTPMEVCV 480

Query: 481 ALGLLVSELSEDPWKGKVITFSADPQLHLIQGDSLKSKTDFIKRMDWGYNTDFQKVFDQI 540
           ALGLLVSELSEDPWKGKVITFSADPQLHLIQGDSLKSKT FI  MDWGYNTDFQKVFDQI
Sbjct: 481 ALGLLVSELSEDPWKGKVITFSADPQLHLIQGDSLKSKTQFIMSMDWGYNTDFQKVFDQI 540

Query: 541 LKVAVDAKLNEEQMVKRLFVFSDMEFDQASANSWETDYQVIVRKFTEKGYGSAVPQIVFW 600
           LKVAVDAKL EEQMVKR+FVFSDMEFDQASANSWETDYQVIVRKF+EKGYGS+VPQIVFW
Sbjct: 541 LKVAVDAKLKEEQMVKRVFVFSDMEFDQASANSWETDYQVIVRKFSEKGYGSSVPQIVFW 600

Query: 601 NLRNSRATPVPAKEKGVALVSGYSKNLMNLFLNDNGDIQPEAVMEQAISGSEYQKLVVLD 660
           NLR+SRATPVPA EKGVALVSG+SKNLMNLFLN +G IQP+A+ME A+SGSEYQKLVVLD
Sbjct: 601 NLRDSRATPVPANEKGVALVSGFSKNLMNLFLNGDGVIQPDAIMELAVSGSEYQKLVVLD 656

BLAST of HG10017889 vs. ExPASy Swiss-Prot
Match: Q5UNY4 (Uncharacterized protein L728 OS=Acanthamoeba polyphaga mimivirus OX=212035 GN=MIMI_L728 PE=4 SV=1)

HSP 1 Score: 214.5 bits (545), Expect = 3.5e-54
Identity = 159/600 (26.50%), Postives = 262/600 (43.67%), Query Frame = 0

Query: 55  MGFTENMSVTFLSTGNPCLDFFFHVVPDTPADSLIERLSLAWNYNPLMTLKLICNLRGVR 114
           + FTEN    + ++G+ C+DFF  +   +     I     AWN +  + +K++ NLR +R
Sbjct: 4   LSFTENGDKAYNTSGSACIDFFVRITRSSQLTDYISTFGKAWNEDKNIAMKILYNLRDIR 63

Query: 115 GTGKSDKKGYYTAALWLHKFH-HKTLAGNIPS--IADFGYFKDLPEILYRLLEGSDVRQN 174
            TGK +K     A +   KFH +  +  +I +  +  +G +KDL +I+            
Sbjct: 64  -TGKGEKI-IPVAIMTYLKFHLNSDIYNSIVTDFVTMYGCWKDLLKIV------------ 123

Query: 175 QKKEWLQRKRGNSSGKRSSSTRRGRGGLSIRHESFKQEKPKTRKKEIQSSTGGEANISKA 234
                               TR            F+   P    K I             
Sbjct: 124 -----------------EIETR------------FRLSTPSVSNKNINPI---------- 183

Query: 235 METSRIEKEKASAERKIKKVSMAKKVMERFQSDPNFQLLYERISDFFADCLKSDLQFLNS 294
                          +IK                            FAD L+ D   +N+
Sbjct: 184 ---------------EIK---------------------------LFADQLQKDFDTVNN 243

Query: 295 ---GELRKISLAAKWCPSVDSSFDRSTLLCESIARKVFPRESDPEYVGIEEAHYAYRVRD 354
                   ISL AKW PS    ++++ LL     R           +G+    Y      
Sbjct: 244 NTGSSKVAISLCAKWAPSEKQHYNKAPLLIADSIR---------SQMGLTPRQY------ 303

Query: 355 RLRKQVLVPLRKVLELPEVYMGANRWDSIPYNRVASVAMKNYKEKFMQHDGER------- 414
              +++L  LR  L++ E+ M  +++D I ++++ SVA+   K  F +    +       
Sbjct: 304 ---RKMLTKLRSHLQVLEMLMSTHQYDKIDFSKLPSVALMKMKNAFNRDTNSQGIKSDFR 363

Query: 415 ------FGQYLQDVKDGKTKIAAGALLPHEIIKSLDDGEEDCGEVAELQWKRMVDDLLKK 474
                 + +YLQD+  GKTK+    + PHE++        D  ++ E QW  +   +   
Sbjct: 364 VNLHTSYTKYLQDLSKGKTKVNTKGIQPHELVGQY-LSSSDFDQLVESQWDAIKKGVSDS 423

Query: 475 GKLRNCISVCDVSGSMSGIPMDVCVALGLLVSELSEDPWKGKVITFSADPQLHLIQGDSL 534
           G   N  +V DVSGSM G PM V +ALG+LV+E +  P+ G+VITF   P  H + G +L
Sbjct: 424 GTFNNVTAVVDVSGSMHGQPMQVAIALGILVAECTSGPYHGRVITFHEKPSWHHLTGSNL 483

Query: 535 KSKTDFIKRMDWGYNTDFQKVFDQILKVAVDAKLNEEQMVKRLFVFSDMEFDQASANSWE 594
             K   ++   WG +T+ + VFD +L+ A++AKL   +M+  LF+F+DM+F+Q   +  E
Sbjct: 484 MEKVKCMRDAPWGGSTNMKSVFDLVLQNAINAKLKPHEMIDTLFIFTDMQFNQCDCSGLE 487

Query: 595 TDYQVIVRKFTEKGYGSAVPQIVFWNLR--NSRATPVPAKEKGVALVSGYSKNLMNLFLN 634
           + ++   RKFTE GY    P++V WNLR  NS++ P+   ++G  ++SG+S  L+   +N
Sbjct: 544 STFEYGQRKFTEAGY--TFPKVVCWNLRTSNSKSLPLMKNDEGYVMLSGFSAELLKCIMN 487

BLAST of HG10017889 vs. ExPASy TrEMBL
Match: A0A5A7THS9 (GPI inositol-deacylase PGAP1-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold128G001810 PE=4 SV=1)

HSP 1 Score: 1125.9 bits (2911), Expect = 0.0e+00
Identity = 575/680 (84.56%), Postives = 617/680 (90.74%), Query Frame = 0

Query: 1   MAPPSLLGPPELYAAAAPVS--------VPLQPSQTA-----------SGDPFVDALVAN 60
           MAPPSLLGPPELY AA+PVS        V LQP+++A           SG PFVDA++AN
Sbjct: 1   MAPPSLLGPPELYHAASPVSLQPTESAPVSLQPTESAPVSLQPTESTPSGVPFVDAMLAN 60

Query: 61  FNNIDN-PDDNLLPMGFTENMSVTFLSTGNPCLDFFFHVVPDTPADSLIERLSLAWNYNP 120
           FNNI+N  DDNL PMGFTENMS TFLSTGNPCLDFFFHVVPDTPA+SLI+RLSLAWN+NP
Sbjct: 61  FNNINNHSDDNLPPMGFTENMSATFLSTGNPCLDFFFHVVPDTPANSLIDRLSLAWNHNP 120

Query: 121 LMTLKLICNLRGVRGTGKSDKKGYYTAALWLHKFHHKTLAGNIPSIADFGYFKDLPEILY 180
           LMTLKLICNLRGVRGTGKSDK+GYYTAALWL+ FH KTLAGNIPSIADFGYFKDLPEILY
Sbjct: 121 LMTLKLICNLRGVRGTGKSDKEGYYTAALWLYNFHPKTLAGNIPSIADFGYFKDLPEILY 180

Query: 181 RLLEGSDVRQNQKKEWLQRKRGNSSGKRSSSTRRGRGGLSIRHESFKQEKPKTRKKEIQS 240
           RLLEGSDVR+NQKKEW +RK    S KR SS R  RGGLS+R+ SFKQEKPKTRKKEIQS
Sbjct: 181 RLLEGSDVRKNQKKEWGERK--GKSRKRLSSPR--RGGLSVRYGSFKQEKPKTRKKEIQS 240

Query: 241 STGGEANISKAMETSRIEKEKASAERKIKKVSMAKKVMERFQSDPNFQLLYERISDFFAD 300
           S   EANISKAME SRIEKEKASAERK++KVSMA+KVMERFQSDPNFQLL++RISDFF D
Sbjct: 241 SIDREANISKAMEKSRIEKEKASAERKLRKVSMARKVMERFQSDPNFQLLHDRISDFFTD 300

Query: 301 CLKSDLQFLNSGELRKISLAAKWCPSVDSSFDRSTLLCESIARKVFPRESDPEYVGIEEA 360
           CLKSDLQF+NSG+  +ISLAAKWCPSVDSSFDRSTLLCESIARKVFPRESDPEY GIEEA
Sbjct: 301 CLKSDLQFMNSGDFTRISLAAKWCPSVDSSFDRSTLLCESIARKVFPRESDPEYEGIEEA 360

Query: 361 HYAYRVRDRLRKQVLVPLRKVLELPEVYMGANRWDSIPYNRVASVAMKNYKEKFMQHDGE 420
           HYAYRVRDRLRK VLVPLRKVLELPEVY+GANRWDSIPYNRVASVAMKNYKEKFM+HDGE
Sbjct: 361 HYAYRVRDRLRKDVLVPLRKVLELPEVYIGANRWDSIPYNRVASVAMKNYKEKFMKHDGE 420

Query: 421 RFGQYLQDVKDGKTKIAAGALLPHEIIKSLDDGEEDCGEVAELQWKRMVDDLLKKGKLRN 480
           RF QYL+DVKDGKTKIAAGALLPHEII SL DG+ED GEVAELQWKRMVDDLLKKGKLR+
Sbjct: 421 RFAQYLKDVKDGKTKIAAGALLPHEIIMSLFDGQEDGGEVAELQWKRMVDDLLKKGKLRD 480

Query: 481 CISVCDVSGSMSGIPMDVCVALGLLVSELSEDPWKGKVITFSADPQLHLIQGDSLKSKTD 540
           CI+VCDVSGSM GIPMDVC+ALGLLVSELSEDPWKGKVITFSA+P+LH+IQGDSLKSK +
Sbjct: 481 CIAVCDVSGSMEGIPMDVCIALGLLVSELSEDPWKGKVITFSANPELHVIQGDSLKSKAE 540

Query: 541 FIKRMDWGYNTDFQKVFDQILKVAVDAKLNEEQMVKRLFVFSDMEFDQASANSWETDYQV 600
           F+K M WG NTDFQKVFDQILKVAVD KL EEQM+KR+FVFSDMEFDQASA SWETDYQV
Sbjct: 541 FVKTMHWGVNTDFQKVFDQILKVAVDGKLKEEQMIKRVFVFSDMEFDQASATSWETDYQV 600

Query: 601 IVRKFTEKGYGSAVPQIVFWNLRNSRATPVPAKEKGVALVSGYSKNLMNLFLNDNGDIQP 660
           IVRKFTEKGYGSAVPQIVFWNLR+SRATPVP KEKGVALVSGYSKNLMNLFL+ +G IQP
Sbjct: 601 IVRKFTEKGYGSAVPQIVFWNLRDSRATPVPGKEKGVALVSGYSKNLMNLFLDGDGVIQP 660

BLAST of HG10017889 vs. ExPASy TrEMBL
Match: A0A1S3B5W1 (uncharacterized protein LOC103486117 OS=Cucumis melo OX=3656 GN=LOC103486117 PE=4 SV=1)

HSP 1 Score: 1125.9 bits (2911), Expect = 0.0e+00
Identity = 575/680 (84.56%), Postives = 617/680 (90.74%), Query Frame = 0

Query: 1   MAPPSLLGPPELYAAAAPVS--------VPLQPSQTA-----------SGDPFVDALVAN 60
           MAPPSLLGPPELY AA+PVS        V LQP+++A           SG PFVDA++AN
Sbjct: 1   MAPPSLLGPPELYHAASPVSLQPTESAPVSLQPTESAPVSLQPTESTPSGVPFVDAMLAN 60

Query: 61  FNNIDN-PDDNLLPMGFTENMSVTFLSTGNPCLDFFFHVVPDTPADSLIERLSLAWNYNP 120
           FNNI+N  DDNL PMGFTENMS TFLSTGNPCLDFFFHVVPDTPA+SLI+RLSLAWN+NP
Sbjct: 61  FNNINNHSDDNLPPMGFTENMSATFLSTGNPCLDFFFHVVPDTPANSLIDRLSLAWNHNP 120

Query: 121 LMTLKLICNLRGVRGTGKSDKKGYYTAALWLHKFHHKTLAGNIPSIADFGYFKDLPEILY 180
           LMTLKLICNLRGVRGTGKSDK+GYYTAALWL+ FH KTLAGNIPSIADFGYFKDLPEILY
Sbjct: 121 LMTLKLICNLRGVRGTGKSDKEGYYTAALWLYNFHPKTLAGNIPSIADFGYFKDLPEILY 180

Query: 181 RLLEGSDVRQNQKKEWLQRKRGNSSGKRSSSTRRGRGGLSIRHESFKQEKPKTRKKEIQS 240
           RLLEGSDVR+NQKKEW +RK    S KR SS R  RGGLS+R+ SFKQEKPKTRKKEIQS
Sbjct: 181 RLLEGSDVRKNQKKEWGERK--GKSRKRLSSPR--RGGLSVRYGSFKQEKPKTRKKEIQS 240

Query: 241 STGGEANISKAMETSRIEKEKASAERKIKKVSMAKKVMERFQSDPNFQLLYERISDFFAD 300
           S   EANISKAME SRIEKEKASAERK++KVSMA+KVMERFQSDPNFQLL++RISDFF D
Sbjct: 241 SIDREANISKAMEKSRIEKEKASAERKLRKVSMARKVMERFQSDPNFQLLHDRISDFFTD 300

Query: 301 CLKSDLQFLNSGELRKISLAAKWCPSVDSSFDRSTLLCESIARKVFPRESDPEYVGIEEA 360
           CLKSDLQF+NSG+  +ISLAAKWCPSVDSSFDRSTLLCESIARKVFPRESDPEY GIEEA
Sbjct: 301 CLKSDLQFMNSGDFTRISLAAKWCPSVDSSFDRSTLLCESIARKVFPRESDPEYEGIEEA 360

Query: 361 HYAYRVRDRLRKQVLVPLRKVLELPEVYMGANRWDSIPYNRVASVAMKNYKEKFMQHDGE 420
           HYAYRVRDRLRK VLVPLRKVLELPEVY+GANRWDSIPYNRVASVAMKNYKEKFM+HDGE
Sbjct: 361 HYAYRVRDRLRKDVLVPLRKVLELPEVYIGANRWDSIPYNRVASVAMKNYKEKFMKHDGE 420

Query: 421 RFGQYLQDVKDGKTKIAAGALLPHEIIKSLDDGEEDCGEVAELQWKRMVDDLLKKGKLRN 480
           RF QYL+DVKDGKTKIAAGALLPHEII SL DG+ED GEVAELQWKRMVDDLLKKGKLR+
Sbjct: 421 RFAQYLKDVKDGKTKIAAGALLPHEIIMSLFDGQEDGGEVAELQWKRMVDDLLKKGKLRD 480

Query: 481 CISVCDVSGSMSGIPMDVCVALGLLVSELSEDPWKGKVITFSADPQLHLIQGDSLKSKTD 540
           CI+VCDVSGSM GIPMDVC+ALGLLVSELSEDPWKGKVITFSA+P+LH+IQGDSLKSK +
Sbjct: 481 CIAVCDVSGSMEGIPMDVCIALGLLVSELSEDPWKGKVITFSANPELHVIQGDSLKSKAE 540

Query: 541 FIKRMDWGYNTDFQKVFDQILKVAVDAKLNEEQMVKRLFVFSDMEFDQASANSWETDYQV 600
           F+K M WG NTDFQKVFDQILKVAVD KL EEQM+KR+FVFSDMEFDQASA SWETDYQV
Sbjct: 541 FVKTMHWGVNTDFQKVFDQILKVAVDGKLKEEQMIKRVFVFSDMEFDQASATSWETDYQV 600

Query: 601 IVRKFTEKGYGSAVPQIVFWNLRNSRATPVPAKEKGVALVSGYSKNLMNLFLNDNGDIQP 660
           IVRKFTEKGYGSAVPQIVFWNLR+SRATPVP KEKGVALVSGYSKNLMNLFL+ +G IQP
Sbjct: 601 IVRKFTEKGYGSAVPQIVFWNLRDSRATPVPGKEKGVALVSGYSKNLMNLFLDGDGVIQP 660

BLAST of HG10017889 vs. ExPASy TrEMBL
Match: A0A5D3CYJ7 (GPI inositol-deacylase PGAP1-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold32G00180 PE=4 SV=1)

HSP 1 Score: 1123.2 bits (2904), Expect = 0.0e+00
Identity = 574/680 (84.41%), Postives = 616/680 (90.59%), Query Frame = 0

Query: 1   MAPPSLLGPPELYAAAAPVS--------VPLQPSQTA-----------SGDPFVDALVAN 60
           MAPPSLLGPPELY AA+PVS        V LQP+++A           SG PFVDA++AN
Sbjct: 1   MAPPSLLGPPELYHAASPVSLQPTESAPVSLQPTESAPVSLQPTESTPSGVPFVDAMLAN 60

Query: 61  FNNIDN-PDDNLLPMGFTENMSVTFLSTGNPCLDFFFHVVPDTPADSLIERLSLAWNYNP 120
           FNNI+N  DDNL PMGFTENMS TFLSTGNPCLDFFFHVVPDTPA+SLI+RLSLAWN+NP
Sbjct: 61  FNNINNHSDDNLPPMGFTENMSATFLSTGNPCLDFFFHVVPDTPANSLIDRLSLAWNHNP 120

Query: 121 LMTLKLICNLRGVRGTGKSDKKGYYTAALWLHKFHHKTLAGNIPSIADFGYFKDLPEILY 180
           LMTLKLICNLRGVRGTGKSDK+GYYTAALWL+ FH KTLAGNIPSIADFGYFKDLPEILY
Sbjct: 121 LMTLKLICNLRGVRGTGKSDKEGYYTAALWLYNFHPKTLAGNIPSIADFGYFKDLPEILY 180

Query: 181 RLLEGSDVRQNQKKEWLQRKRGNSSGKRSSSTRRGRGGLSIRHESFKQEKPKTRKKEIQS 240
           RLLEGSDVR+NQKKEW +RK    S KR SS R  RGGLS+R+ SFKQEKPKTRKKEIQS
Sbjct: 181 RLLEGSDVRKNQKKEWGERK--GKSRKRLSSPR--RGGLSVRYGSFKQEKPKTRKKEIQS 240

Query: 241 STGGEANISKAMETSRIEKEKASAERKIKKVSMAKKVMERFQSDPNFQLLYERISDFFAD 300
           S   EANISKAME SRIEKEKASAERK++KVSMA+KVMERFQSDPNFQLL++RISDFF D
Sbjct: 241 SIDREANISKAMEKSRIEKEKASAERKLRKVSMARKVMERFQSDPNFQLLHDRISDFFTD 300

Query: 301 CLKSDLQFLNSGELRKISLAAKWCPSVDSSFDRSTLLCESIARKVFPRESDPEYVGIEEA 360
           CLKSDLQF+NSG+  +ISLAAKWCPSVDSSFDRSTLLCESIARKVFPRESDPEY GIEEA
Sbjct: 301 CLKSDLQFMNSGDFTRISLAAKWCPSVDSSFDRSTLLCESIARKVFPRESDPEYEGIEEA 360

Query: 361 HYAYRVRDRLRKQVLVPLRKVLELPEVYMGANRWDSIPYNRVASVAMKNYKEKFMQHDGE 420
           HYAYRVRDRLRK VLVPLRKVLELPEVY+GANRWDSIPYNRVASVAMKNYKEKFM+HDGE
Sbjct: 361 HYAYRVRDRLRKDVLVPLRKVLELPEVYIGANRWDSIPYNRVASVAMKNYKEKFMKHDGE 420

Query: 421 RFGQYLQDVKDGKTKIAAGALLPHEIIKSLDDGEEDCGEVAELQWKRMVDDLLKKGKLRN 480
           RF QYL+DVKDGKTKIAAGALLPHEII SL DG+ED GEVA LQWKRMVDDLLKKGKLR+
Sbjct: 421 RFAQYLKDVKDGKTKIAAGALLPHEIIMSLFDGQEDGGEVAVLQWKRMVDDLLKKGKLRD 480

Query: 481 CISVCDVSGSMSGIPMDVCVALGLLVSELSEDPWKGKVITFSADPQLHLIQGDSLKSKTD 540
           CI+VCDVSGSM GIPMDVC+ALGLLVSELSEDPWKGKVITFSA+P+LH+IQGDSLKSK +
Sbjct: 481 CIAVCDVSGSMEGIPMDVCIALGLLVSELSEDPWKGKVITFSANPELHVIQGDSLKSKAE 540

Query: 541 FIKRMDWGYNTDFQKVFDQILKVAVDAKLNEEQMVKRLFVFSDMEFDQASANSWETDYQV 600
           F+K M WG NTDFQKVFDQILKVAVD KL EEQM+KR+FVFSDMEFDQASA SWETDYQV
Sbjct: 541 FVKTMHWGVNTDFQKVFDQILKVAVDGKLKEEQMIKRVFVFSDMEFDQASATSWETDYQV 600

Query: 601 IVRKFTEKGYGSAVPQIVFWNLRNSRATPVPAKEKGVALVSGYSKNLMNLFLNDNGDIQP 660
           IVRKFTEKGYGSAVPQIVFWNLR+SRATPVP KEKGVALVSGYSKNLMNLFL+ +G IQP
Sbjct: 601 IVRKFTEKGYGSAVPQIVFWNLRDSRATPVPGKEKGVALVSGYSKNLMNLFLDGDGVIQP 660

BLAST of HG10017889 vs. ExPASy TrEMBL
Match: A0A0A0L2K6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G538590 PE=4 SV=1)

HSP 1 Score: 1094.0 bits (2828), Expect = 0.0e+00
Identity = 552/661 (83.51%), Postives = 595/661 (90.02%), Query Frame = 0

Query: 1   MAPPSLLGPPELYAAAAPVSVPLQPSQ-TASGDPFVDALVANFNNIDNPDDNLLPMGFTE 60
           MAPP+LLGPPELY AAAPVS  LQP++ T SGDPFVDA+VANFN     DD+L PMGFTE
Sbjct: 1   MAPPNLLGPPELYHAAAPVS--LQPTESTPSGDPFVDAMVANFN---KTDDSLPPMGFTE 60

Query: 61  NMSVTFLSTGNPCLDFFFHVVPDTPADSLIERLSLAWNYNPLMTLKLICNLRGVRGTGKS 120
           NMS TFLSTGNPCLDFFFHVVPDTPA+SLI+RLSLAWN+NPLMTLKLICNLRGVRGTGKS
Sbjct: 61  NMSATFLSTGNPCLDFFFHVVPDTPANSLIDRLSLAWNHNPLMTLKLICNLRGVRGTGKS 120

Query: 121 DKKGYYTAALWLHKFHHKTLAGNIPSIADFGYFKDLPEILYRLLEGSDVRQNQKKEWLQR 180
           DK+GYYTAALWL+ FH KTLAGNIPSIADFGYFKDLPEILYRLLEGSDVR+NQK EW +R
Sbjct: 121 DKEGYYTAALWLYNFHPKTLAGNIPSIADFGYFKDLPEILYRLLEGSDVRKNQKNEWKRR 180

Query: 181 KRGNSSGKRSSSTRRGRGGLSIRHESFKQEKPKTRKKEIQSSTGGEANISKAMETSRIEK 240
                             GLS+RH  FKQEKPKTRKKEIQSST  EANISKAME SRIEK
Sbjct: 181 ------------------GLSVRHGRFKQEKPKTRKKEIQSSTDREANISKAMEKSRIEK 240

Query: 241 EKASAERKIKKVSMAKKVMERFQSDPNFQLLYERISDFFADCLKSDLQFLNSGELRKISL 300
           EKAS ERK++KVSMA+KVMERFQ+D NFQLL++RISDFF DCLKSDLQF+NSG+  KISL
Sbjct: 241 EKASGERKLRKVSMARKVMERFQADSNFQLLHDRISDFFTDCLKSDLQFMNSGDFTKISL 300

Query: 301 AAKWCPSVDSSFDRSTLLCESIARKVFPRESDPEYVGIEEAHYAYRVRDRLRKQVLVPLR 360
           AAKWCPS+DSSFDRSTLLCESIARK+FPRE +PEY  IEEAHYAYRVRDRLR  VLVPLR
Sbjct: 301 AAKWCPSIDSSFDRSTLLCESIARKIFPRELNPEYKEIEEAHYAYRVRDRLRTDVLVPLR 360

Query: 361 KVLELPEVYMGANRWDSIPYNRVASVAMKNYKEKFMQHDGERFGQYLQDVKDGKTKIAAG 420
           KVLELPEV++GANRWDSIPYNRVASVAMKNYKEKFM+HDGERF QYL+DVKDGKTKIAAG
Sbjct: 361 KVLELPEVFIGANRWDSIPYNRVASVAMKNYKEKFMKHDGERFAQYLKDVKDGKTKIAAG 420

Query: 421 ALLPHEIIKSLDDGEEDCGEVAELQWKRMVDDLLKKGKLRNCISVCDVSGSMSGIPMDVC 480
           ALLPHEII SL DG+ED GEVAELQWKRMVDDLLKKGKLR CI+VCDVSGSM GIPMDVC
Sbjct: 421 ALLPHEIILSLFDGQEDGGEVAELQWKRMVDDLLKKGKLRECIAVCDVSGSMMGIPMDVC 480

Query: 481 VALGLLVSELSEDPWKGKVITFSADPQLHLIQGDSLKSKTDFIKRMDWGYNTDFQKVFDQ 540
           V LGLLVSELSEDPWKGKVITFSA+P+LH+IQGDSLKSK +F+K MDWG NTDFQKVFDQ
Sbjct: 481 VGLGLLVSELSEDPWKGKVITFSANPELHMIQGDSLKSKAEFVKSMDWGGNTDFQKVFDQ 540

Query: 541 ILKVAVDAKLNEEQMVKRLFVFSDMEFDQASANSWETDYQVIVRKFTEKGYGSAVPQIVF 600
           ILKVAVD KL EEQM+KR+FVFSDMEFDQAS  SWETDYQVIVRKFTEKGYGSAVPQIVF
Sbjct: 541 ILKVAVDGKLKEEQMIKRVFVFSDMEFDQASQTSWETDYQVIVRKFTEKGYGSAVPQIVF 600

Query: 601 WNLRNSRATPVPAKEKGVALVSGYSKNLMNLFLNDNGDIQPEAVMEQAISGSEYQKLVVL 660
           WNLR+SRATPVP+ EKGVALVSGYSKNLMNLFL+ +G IQPEAVME+AISG+EYQKLVVL
Sbjct: 601 WNLRDSRATPVPSNEKGVALVSGYSKNLMNLFLDGDGVIQPEAVMEKAISGNEYQKLVVL 638

BLAST of HG10017889 vs. ExPASy TrEMBL
Match: A0A6J1ELM1 (uncharacterized protein LOC111435535 OS=Cucurbita moschata OX=3662 GN=LOC111435535 PE=4 SV=1)

HSP 1 Score: 1092.8 bits (2825), Expect = 0.0e+00
Identity = 551/660 (83.48%), Postives = 595/660 (90.15%), Query Frame = 0

Query: 1   MAPPSLLGPPELYAAAAPVSVPLQPSQTASGDPFVDALVANFNNIDNPDDNLLPMGFTEN 60
           MAPPSLLGPPELY    P S P QP  T +GDPFVDALVANFN +D  DD L PMGFTEN
Sbjct: 1   MAPPSLLGPPELYTPFQP-SQPTQP--TPTGDPFVDALVANFNKVDTNDDELPPMGFTEN 60

Query: 61  MSVTFLSTGNPCLDFFFHVVPDTPADSLIERLSLAWNYNPLMTLKLICNLRGVRGTGKSD 120
           MSVTFLS+GNPCLDFFFHVVPDTP++SL ERLS+AWN+NPLMTLKLICNLRGVRGTGKSD
Sbjct: 61  MSVTFLSSGNPCLDFFFHVVPDTPSESLTERLSVAWNHNPLMTLKLICNLRGVRGTGKSD 120

Query: 121 KKGYYTAALWLHKFHHKTLAGNIPSIADFGYFKDLPEILYRLLEGSDVRQNQKKEWLQRK 180
           K+GYYTAALWLHKFH KTLAGNIPS+ADFGYFKDLPE+LYRLLEGSDVR+NQK EW+ R+
Sbjct: 121 KEGYYTAALWLHKFHPKTLAGNIPSLADFGYFKDLPELLYRLLEGSDVRKNQKAEWIGRR 180

Query: 181 RGNSSGKRSSSTRRGRGGLSIRHESFKQEKPKTRKKEIQSSTGGEANISKAMETSRIEKE 240
           +G    KR  S     G  S     FK+EK KTRKKEIQSS   EA I+KAME S I KE
Sbjct: 181 KGRHM-KRRRSLSSESGSRSASDGEFKEEKLKTRKKEIQSSPDVEAKIAKAMERSMILKE 240

Query: 241 KASAERKIKKVSMAKKVMERFQSDPNFQLLYERISDFFADCLKSDLQFLNSGELRKISLA 300
           KAS ERKIKKVSMAKK +ER+QSDP+FQ LY+R+SDFFADCLKSDLQFLNSGEL KISLA
Sbjct: 241 KASTERKIKKVSMAKKALERYQSDPHFQRLYDRVSDFFADCLKSDLQFLNSGELNKISLA 300

Query: 301 AKWCPSVDSSFDRSTLLCESIARKVFPRESDPEYVGIEEAHYAYRVRDRLRKQVLVPLRK 360
           AKWCPSVDSSFDRSTLLCESIARK+FPR+SDPEY GIEEAHYAYRVRDRLRKQVLVPLRK
Sbjct: 301 AKWCPSVDSSFDRSTLLCESIARKLFPRQSDPEYEGIEEAHYAYRVRDRLRKQVLVPLRK 360

Query: 361 VLELPEVYMGANRWDSIPYNRVASVAMKNYKEKFMQHDGERFGQYLQDVKDGKTKIAAGA 420
           VLELPE +MGAN+W++IPYNRVASVAMKNYK+KF++HDGERF QYL+DVK GKTKIAAGA
Sbjct: 361 VLELPESFMGANQWNAIPYNRVASVAMKNYKKKFVEHDGERFAQYLEDVKAGKTKIAAGA 420

Query: 421 LLPHEIIKSLDDGEEDCGEVAELQWKRMVDDLLKKGKLRNCISVCDVSGSMSGIPMDVCV 480
           LLPH+II SL+DGEED GEVAELQWKRMVDDLL+KGKLRNCISVCDVSGSM G PM+VCV
Sbjct: 421 LLPHQIIASLNDGEEDGGEVAELQWKRMVDDLLEKGKLRNCISVCDVSGSMGGTPMEVCV 480

Query: 481 ALGLLVSELSEDPWKGKVITFSADPQLHLIQGDSLKSKTDFIKRMDWGYNTDFQKVFDQI 540
           ALGLLVSELSEDPWKGKVITFSADPQLHLIQGDSLKSKT FI  MDWGYNTDFQKVFDQI
Sbjct: 481 ALGLLVSELSEDPWKGKVITFSADPQLHLIQGDSLKSKTQFIMSMDWGYNTDFQKVFDQI 540

Query: 541 LKVAVDAKLNEEQMVKRLFVFSDMEFDQASANSWETDYQVIVRKFTEKGYGSAVPQIVFW 600
           LKVAVDAKL EEQMVKR+FVFSDMEFDQASANSWETDYQVIVRKF+EKGYGS+VPQIVFW
Sbjct: 541 LKVAVDAKLKEEQMVKRVFVFSDMEFDQASANSWETDYQVIVRKFSEKGYGSSVPQIVFW 600

Query: 601 NLRNSRATPVPAKEKGVALVSGYSKNLMNLFLNDNGDIQPEAVMEQAISGSEYQKLVVLD 660
           NLR+SRATPVPA EKGVALVSG+SKNLMNLFLN +G IQP+A+ME A+SGSEYQKLVVLD
Sbjct: 601 NLRDSRATPVPANEKGVALVSGFSKNLMNLFLNGDGVIQPDAIMELAVSGSEYQKLVVLD 656

BLAST of HG10017889 vs. TAIR 10
Match: AT5G13210.1 (Uncharacterised conserved protein UCP015417, vWA )

HSP 1 Score: 797.7 bits (2059), Expect = 6.9e-231
Identity = 410/682 (60.12%), Postives = 510/682 (74.78%), Query Frame = 0

Query: 1   MAPPSLLGPPELYAAAAPVSVPLQPSQTAS-GDPFVDALVANFNNIDNPDD-NLLPMGFT 60
           M+P  LLGPPEL     P S+  +P+ T+   DPF+DA+V+NFNN    ++ N  PMG+T
Sbjct: 1   MSPSPLLGPPEL---RDPNSLLPKPTTTSGPSDPFMDAMVSNFNNSARVNNVNSPPMGYT 60

Query: 61  ENMSVTFLSTGNPCLDFFFHVVPDTPADSLIERLSLAWNYNPLMTLKLICNLRGVRGTGK 120
           EN S T+LS+GNPCLDFFFHVVP TP  SL + L  AW+++ L TLKLICNLRGVRGTGK
Sbjct: 61  ENKSATYLSSGNPCLDFFFHVVPSTPKHSLEQWLQGAWDHDALTTLKLICNLRGVRGTGK 120

Query: 121 SDKKGYYTAALWLHKFHHKTLAGNIPSIADFGYFKDLPEILYRLLEGSDVRQNQKKEWLQ 180
           SDK+G+YTAALWLH  H KTLA N+ S++ FGYFKD PE+LYR+L+GS++R+ QK E  +
Sbjct: 121 SDKEGFYTAALWLHGRHPKTLACNLESLSQFGYFKDFPELLYRILQGSEIRKIQKSERFK 180

Query: 181 RKRGNSSGKRSSSTRRGRGGLSIRHESFKQEKPKTRKKEIQSSTGGEANISKAMETSRIE 240
           RK   +  +R+        G           +P +++K + +       ++ A   ++ E
Sbjct: 181 RK-SEALDRRAPYDGHCYHGRLYGGRGRGSSRPSSKRKPVATRA---LRVANAERKNQAE 240

Query: 241 KEKASAERKIKKVSMAKKVMERFQSDPNFQLLYERISDFFADCLKSDLQFLNSGELRKIS 300
           K +AS +RK KKVSM K    R+  DP+++ L+ER+SD FA+ LK DL+FL S +  +IS
Sbjct: 241 KARASLDRKKKKVSMGKDAFTRYSCDPDYRYLHERVSDLFANQLKKDLEFLTSDKPNEIS 300

Query: 301 LAAKWCPSVDSSFDRSTLLCESIARKVFPRESDPEYVGIEEAHYAYRVRDRLRKQVLVPL 360
           LAAKWCPS+DSSFD++TLLCESIARK+F RES PEY G+ EAHYAYRVRDRLRK VLVPL
Sbjct: 301 LAAKWCPSLDSSFDKATLLCESIARKIFTRESFPEYEGVVEAHYAYRVRDRLRKDVLVPL 360

Query: 361 RKVLELPEVYMGANRWDSIPYNRVASVAMKNYKEKFMQHDGERFGQYLQDVKDGKTKIAA 420
           RK L+LPEVYMGA  WD +PYNRVASVAMK+YKE F++HD ERF QYL D K GKTK+AA
Sbjct: 361 RKTLQLPEVYMGARNWDILPYNRVASVAMKSYKEIFLKHDAERFQQYLDDAKAGKTKVAA 420

Query: 421 GALLPHEIIKSLDDGEEDCGEVAELQWKRMVDDLLKKGKLRNCISVCDVSGSMSGIPMDV 480
           GA+LPHEII+ LD G  D G+VAELQWKR VDD+ +KG LRNCI+VCDVSGSM+G PM+V
Sbjct: 421 GAVLPHEIIRELDGG--DGGQVAELQWKRTVDDMKEKGSLRNCIAVCDVSGSMNGEPMEV 480

Query: 481 CVALGLLVSELSEDPWKGKVITFSADPQLHLIQGDSLKSKTDFIKRMDWGYNTDFQKVFD 540
           CVALGLLVSELSE+PWKGK+ITFS +P+LHL++GD L SKT+F+K+M WG NTDFQKVFD
Sbjct: 481 CVALGLLVSELSEEPWKGKLITFSQNPELHLVKGDDLYSKTEFVKKMQWGMNTDFQKVFD 540

Query: 541 QILKVAVDAKLNEEQMVKRLFVFSDMEFDQAS--------------------ANSWETDY 600
            IL VAV  KL  E+M+KR+FVFSDMEFDQA+                    +N WETDY
Sbjct: 541 LILGVAVQEKLKPEEMIKRVFVFSDMEFDQAASSSHYSRPGYAFLRQPPSNPSNGWETDY 600

Query: 601 QVIVRKFTEKGYGSAVPQIVFWNLRNSRATPVPAKEKGVALVSGYSKNLMNLFLNDNGDI 660
           +VIVRK+ + GYG  VP+IVFWNLR+SRATPVP  +KGVALVSG+SKNLM +FL  +G+I
Sbjct: 601 EVIVRKYKQNGYGDVVPEIVFWNLRDSRATPVPGNKKGVALVSGFSKNLMKMFLEHDGEI 660

BLAST of HG10017889 vs. TAIR 10
Match: AT5G43400.1 (Uncharacterised conserved protein UCP015417, vWA )

HSP 1 Score: 740.3 bits (1910), Expect = 1.3e-213
Identity = 384/681 (56.39%), Postives = 486/681 (71.37%), Query Frame = 0

Query: 6   LLGPPELYAAAAPVSVPLQPSQTASGDPFVDALVANFNNIDNPDDNLLPMGFTENMSVTF 65
           LLGPP + A  +P+  P+   +T   D           N++ P     PMG TEN S TF
Sbjct: 8   LLGPPSV-AGNSPIIKPIHSPETHISDENTLISQTATLNLEEPP----PMGLTENFSPTF 67

Query: 66  LSTGNPCLDFFFHVVPDTPADSLIERLSLAWNYNPLMTLKLICNLRGVRGTGKSDKKGYY 125
           LS+GNPCLDFFFH+VPDT  D LI+RL+++W+++PL TLKLICNLRGVRGTGKSDK+G+Y
Sbjct: 68  LSSGNPCLDFFFHIVPDTSPDDLIQRLAISWSHDPLTTLKLICNLRGVRGTGKSDKEGFY 127

Query: 126 TAALWLHKFHHKTLAGNIPSIADFGYFKDLPEILYRLLEGSDVRQNQKKEWLQRKRGNSS 185
           TAA WL+K H KTLA N+P++ DFGYFKDLPEIL+R+LEG ++ + + + W +R +    
Sbjct: 128 TAAFWLYKNHPKTLALNVPALVDFGYFKDLPEILFRILEGQNMERGKNRVWRKRVQRKFK 187

Query: 186 GKRSSSTRRGRGGLSIRHESFKQEKPKTRKKEIQSSTGGEANISKAMETSRIEKEKASAE 245
           GKR               +S    + + R  E     GG            ++K KA A 
Sbjct: 188 GKREK-------------KSEISGEMEDRILENAEEIGGS-----------VDKVKARAL 247

Query: 246 RKIKKVSMAKKVMERFQSDPNFQLLYERISDFFADCLKSDLQFLNSGELRKISLAAKWCP 305
           RK ++   AKK + R+ SD N++LL++RI+D FA  LKSDL++LNS  L KISLA+KWCP
Sbjct: 248 RKQREFEKAKKAVTRYNSDANYRLLFDRIADLFAVLLKSDLKYLNSNGLTKISLASKWCP 307

Query: 306 SVDSSFDRSTLLCESIARKVFPRESDPEYVGIEEAHYAYRVRDRLRKQVLVPLRKVLELP 365
           SVDSS+D++TL+CE+IAR++FPRE   EY GIEEAHYAYR+RDRLRK+VLVPL K LE P
Sbjct: 308 SVDSSYDKATLICEAIARRMFPRE---EYEGIEEAHYAYRIRDRLRKEVLVPLHKALEFP 367

Query: 366 EVYMGANRWDSIPYNRVASVAMKNYKEKFMQHDGERFGQYLQDVKDGKTKIAAGALLPHE 425
           E++M A  W+ + YNRV SVAMKNYK+ F +HD ERF ++L+DVK GK KIAAGALLPH+
Sbjct: 368 ELFMSAKEWNLLKYNRVPSVAMKNYKKLFEEHDSERFTEFLEDVKSGKKKIAAGALLPHQ 427

Query: 426 IIKSLDD--GEEDCGEVAELQWKRMVDDLLKKGKLRNCISVCDVSGSMSGIPMDVCVALG 485
           II  L+D  G E   EVAELQW RMVDDL KKGKL+N ++VCDVSGSMSG PM+VCVALG
Sbjct: 428 IINQLEDDSGSEVGAEVAELQWARMVDDLAKKGKLKNSLAVCDVSGSMSGTPMEVCVALG 487

Query: 486 LLVSELSEDPWKGKVITFSADPQLHLIQGDSLKSKTDFIKRMDWGYNTDFQKVFDQILKV 545
           LLVSELSE+PWKGKVITFS +P+LH++ G SL+ KT F++ M+WG NTDFQ VFD+IL+V
Sbjct: 488 LLVSELSEEPWKGKVITFSENPELHIVTGSSLREKTQFVREMEWGMNTDFQIVFDRILEV 547

Query: 546 AVDAKLNEEQMVKRLFVFSDMEFDQASANS------------------------WETDYQ 605
           AV+  L ++QM+KRLFVFSDMEFD A ANS                        WETDY+
Sbjct: 548 AVENNLTDDQMIKRLFVFSDMEFDDAMANSHSEVSYHLSVEDRLKISKERSKEKWETDYE 607

Query: 606 VIVRKFTEKGYGSAVPQIVFWNLRNSRATPVPAKEKGVALVSGYSKNLMNLFLNDNGDIQ 661
           V+ RK+ EKG+ + VP++VFWNLR+S ATPV A +KGVA+VSG+SKNL+ LFL + G + 
Sbjct: 608 VVQRKYKEKGFQN-VPEMVFWNLRDSSATPVVANQKGVAMVSGFSKNLLTLFLEEGGIVN 655

BLAST of HG10017889 vs. TAIR 10
Match: AT3G24780.1 (Uncharacterised conserved protein UCP015417, vWA )

HSP 1 Score: 721.1 bits (1860), Expect = 8.2e-208
Identity = 366/606 (60.40%), Postives = 452/606 (74.59%), Query Frame = 0

Query: 55  MGFTENMSVTFLSTGNPCLDFFFHVVPDTPADSLIERLSLAWNYNPLMTLKLICNLRGVR 114
           MG+TEN S T+LS+GNPCLDFFFH+VP TP  SL +RL  AW+++ L TLKLICNLRGVR
Sbjct: 110 MGYTENRSATYLSSGNPCLDFFFHIVPSTPKKSLEQRLEEAWDHDSLTTLKLICNLRGVR 169

Query: 115 GTGKSDKKGYYTAALWLHKFHHKTLAGNIPSIADFGYFKDLPEILYRLLEGSDVRQNQKK 174
           GTGKSDK+G+YTAALWLH  H KTLA N+ S++ FGYFKD PEILYR+L+G ++R  QK 
Sbjct: 170 GTGKSDKEGFYTAALWLHGRHPKTLACNLESLSKFGYFKDFPEILYRILQGPEIRSIQKT 229

Query: 175 EWLQRKRGNSSGKRSSSTRRGR--GGLSIRHESFKQEKPKTRKKEIQSSTGGEANISKAM 234
           +        S  +RS  +R GR  GG   R   F +    TR          E  ++ A 
Sbjct: 230 QRYDTIAAASLRRRSRFSRGGRGFGGGRSRGRHFLKRSAATR----------ELRVANAE 289

Query: 235 ETSRIEKEKASAERKIKKVSMAKKVMERFQSDPNFQLLYERISDFFADCLKSDLQFLNSG 294
             ++ EK +AS +RK KKVSMAK    ++ +DPN++ L+ER+S+ FA+ LK DL+FL SG
Sbjct: 290 RKNQEEKARASLKRKQKKVSMAKAASTKYSNDPNYRFLHERVSELFANQLKRDLEFLTSG 349

Query: 295 ELRKISLAAKWCPSVDSSFDRSTLLCESIARKVFPRESDPEYVGIEEAHYAYRVRDRLRK 354
           +  KISLAAKWCPS+DSSFD++TL+CESIARK+FP+ES PEY G+E+AHYAYRVRDRLRK
Sbjct: 350 QPNKISLAAKWCPSLDSSFDKATLICESIARKIFPQESFPEYEGVEDAHYAYRVRDRLRK 409

Query: 355 QVLVPLRKVLELPEVYMGANRWDSIPYNRVASVAMKNYKEKFMQHDGERFGQYLQDVKDG 414
           QVLVPLRK L+LPEVYMGA  W S+PYNRVASVAMK+YKE F+  D +RF QYL D K G
Sbjct: 410 QVLVPLRKTLQLPEVYMGARAWQSLPYNRVASVAMKSYKEVFLYRDEKRFQQYLNDAKTG 469

Query: 415 KTKIAAGALLPHEIIKSLDDGEEDCGEVAELQWKRMVDDLLKKGKLRNCISVCDVSGSMS 474
           KTKIAAGA+LPHEII+ L+ G  D G+VAELQWKRMVDDL +KG L NC+++CDVSGSM+
Sbjct: 470 KTKIAAGAVLPHEIIRELNGG--DGGKVAELQWKRMVDDLKEKGSLTNCMAICDVSGSMN 529

Query: 475 GIPMDVCVALGLLVSELSEDPWKGKVITFSADPQLHLIQGDSLKSKTDFIKRMDWGYNTD 534
           G PM+V VALGLLVSELSE+PWKGK+ITF   P+LHL++GD L+SKT+F++ M W  NTD
Sbjct: 530 GEPMEVSVALGLLVSELSEEPWKGKLITFRQSPELHLVKGDDLRSKTEFVESMQWDMNTD 589

Query: 535 FQKVFDQILKVAVDAKLNEEQMVKRLFVFSDMEFDQASA--------------------- 594
           FQKVFD ILKVAV++KL  + M+KR+FVFSDMEFD+AS                      
Sbjct: 590 FQKVFDLILKVAVESKLKPQDMIKRVFVFSDMEFDEASTSTSSFNKWRSSPPTPSNRWDT 649

Query: 595 ------------NSWETDYQVIVRKFTEKGYGSAVPQIVFWNLRNSRATPVPAKEKGVAL 626
                       ++W+TDY+VIVRK+ EKGYG AVP+IVFWNLR+SR+TPV   +KGVAL
Sbjct: 650 LSYSEDDEDEENDAWQTDYKVIVRKYREKGYGEAVPEIVFWNLRDSRSTPVLGNKKGVAL 703

BLAST of HG10017889 vs. TAIR 10
Match: AT5G43390.1 (Uncharacterised conserved protein UCP015417, vWA )

HSP 1 Score: 721.1 bits (1860), Expect = 8.2e-208
Identity = 379/683 (55.49%), Postives = 486/683 (71.16%), Query Frame = 0

Query: 6   LLGPPELYAAAAPVSVPLQPSQTASGDPFVDALVANFNNIDNPDDNLLPMGFTENMSVTF 65
           LLGPP + A   PV          S D  V + +A   N++ P      MG TEN S TF
Sbjct: 9   LLGPPSVAAMETPV----------SDDNSVISQIATL-NLEEPQ-----MGLTENFSPTF 68

Query: 66  LSTGNPCLDFFFHVVPDTPADSLIERLSLAWNYNPLMTLKLICNLRGVRGTGKSDKKGYY 125
           L++GNPCLDFFFH+VPDTP+D LI+RL+++W+++PL TLKL+CNLRGVRGTGKSDK+G+Y
Sbjct: 69  LTSGNPCLDFFFHIVPDTPSDDLIQRLAISWSHDPLTTLKLLCNLRGVRGTGKSDKEGFY 128

Query: 126 TAALWLHKFHHKTLAGNIPSIADFGYFKDLPEILYRLLEGSDVRQNQKKEW---LQRKRG 185
           TAALWL+K H KTLA NIP++ DFGYFKDLPEIL R+LEG    + + + W   +QRK  
Sbjct: 129 TAALWLYKNHPKTLALNIPTLVDFGYFKDLPEILLRILEGQQTERGKTRVWRKRIQRKFK 188

Query: 186 NSSGKRSSSTRRGRGGLSIRHESFKQEKPKTRKKEIQSSTGGEANISKAMETSRIEKEKA 245
             S K+S+ +    G +            + R  E    TGG            + K KA
Sbjct: 189 GDSEKKSTIS----GDM------------EDRILETAEETGGP-----------VGKVKA 248

Query: 246 SAERKIKKVSMAKKVMERFQSDPNFQLLYERISDFFADCLKSDLQFLNSGELRKISLAAK 305
            A RK ++   AKK ++R+ SD N++LL+++I+D FA+ LKSDL++LN+  L KISLA+K
Sbjct: 249 RALRKQREFEKAKKALDRYNSDANYRLLFDQIADLFAELLKSDLEYLNTDNLNKISLASK 308

Query: 306 WCPSVDSSFDRSTLLCESIARKVFPRESDPEYVGIEEAHYAYRVRDRLRKQVLVPLRKVL 365
           WCPSVDSS+D++TL+CE+IAR++F RE   E  GIEE HYAYR+RDRLRK+VLVPL K L
Sbjct: 309 WCPSVDSSYDKTTLICEAIARRMFLREEYEE--GIEEVHYAYRIRDRLRKEVLVPLHKAL 368

Query: 366 ELPEVYMGANRWDSIPYNRVASVAMKNYKEKFMQHDGERFGQYLQDVKDGKTKIAAGALL 425
           ELPEV M A  W+ + YNRV S+AM+NY  +F +HD ERF ++L+DVK GK K+AAGALL
Sbjct: 369 ELPEVSMSAKEWNLLKYNRVPSIAMQNYSSRFAEHDSERFTEFLEDVKSGKKKMAAGALL 428

Query: 426 PHEIIKS-LDDGEEDCGEVAELQWKRMVDDLLKKGKLRNCISVCDVSGSMSGIPMDVCVA 485
           PH+II   L+D E +  EVAELQW RMVDDL KKGKL+N +++CDVSGSM+G PM+VC+A
Sbjct: 429 PHQIISQLLNDSEGE--EVAELQWARMVDDLAKKGKLKNSLAICDVSGSMAGTPMNVCIA 488

Query: 486 LGLLVSELSEDPWKGKVITFSADPQLHLIQGDSLKSKTDFIKRMDWGYNTDFQKVFDQIL 545
           LGLLVSEL+E+PWKGKVITFS +PQLH++ G SL+ KT F++ MD+G NTDFQKVFD+IL
Sbjct: 489 LGLLVSELNEEPWKGKVITFSENPQLHVVTGSSLREKTKFVREMDFGINTDFQKVFDRIL 548

Query: 546 KVAVDAKLNEEQMVKRLFVFSDMEFDQASANS------------------------WETD 605
           +VAV+  L +EQM+KRLFVFSDMEFD A  +S                        WETD
Sbjct: 549 EVAVENNLTDEQMIKRLFVFSDMEFDDARVDSHSEMSDYASNLESDYESVPESFEKWETD 608

Query: 606 YQVIVRKFTEKGYGSAVPQIVFWNLRNSRATPVPAKEKGVALVSGYSKNLMNLFLNDNGD 661
           Y+V+ RK+ EKG+ + VP+IVFWNLR+S ATPV +K+KGVA+VSG+SKNL+ LFL + G 
Sbjct: 609 YEVVQRKYKEKGFQN-VPEIVFWNLRDSSATPVVSKQKGVAMVSGFSKNLLTLFLEEGGI 643

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038881761.10.0e+0091.59uncharacterized protein LOC120073170 [Benincasa hispida][more]
XP_008442184.10.0e+0084.56PREDICTED: uncharacterized protein LOC103486117 [Cucumis melo] >KAA0041221.1 GPI... [more]
TYK16024.10.0e+0084.41GPI inositol-deacylase PGAP1-like protein [Cucumis melo var. makuwa][more]
XP_004144675.20.0e+0083.66uncharacterized protein LOC101205449 [Cucumis sativus] >KGN55197.2 hypothetical ... [more]
XP_022928704.10.0e+0083.48uncharacterized protein LOC111435535 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Q5UNY43.5e-5426.50Uncharacterized protein L728 OS=Acanthamoeba polyphaga mimivirus OX=212035 GN=MI... [more]
Match NameE-valueIdentityDescription
A0A5A7THS90.0e+0084.56GPI inositol-deacylase PGAP1-like protein OS=Cucumis melo var. makuwa OX=1194695... [more]
A0A1S3B5W10.0e+0084.56uncharacterized protein LOC103486117 OS=Cucumis melo OX=3656 GN=LOC103486117 PE=... [more]
A0A5D3CYJ70.0e+0084.41GPI inositol-deacylase PGAP1-like protein OS=Cucumis melo var. makuwa OX=1194695... [more]
A0A0A0L2K60.0e+0083.51Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G538590 PE=4 SV=1[more]
A0A6J1ELM10.0e+0083.48uncharacterized protein LOC111435535 OS=Cucurbita moschata OX=3662 GN=LOC1114355... [more]
Match NameE-valueIdentityDescription
AT5G13210.16.9e-23160.12Uncharacterised conserved protein UCP015417, vWA [more]
AT5G43400.11.3e-21356.39Uncharacterised conserved protein UCP015417, vWA [more]
AT3G24780.18.2e-20860.40Uncharacterised conserved protein UCP015417, vWA [more]
AT5G43390.18.2e-20855.49Uncharacterised conserved protein UCP015417, vWA [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011205Uncharacterised conserved protein UCP015417, vWAPFAMPF11443DUF2828coord: 58..642
e-value: 1.8E-223
score: 743.3
IPR011205Uncharacterised conserved protein UCP015417, vWAPIRSFPIRSF015417T31B5_30_vWAcoord: 569..660
e-value: 6.3E-42
score: 141.5
coord: 1..573
e-value: 6.6E-241
score: 799.0
IPR011205Uncharacterised conserved protein UCP015417, vWAPANTHERPTHR31373OS06G0652100 PROTEINcoord: 6..660
IPR036465von Willebrand factor A-like domain superfamilyGENE3D3.40.50.410von Willebrand factor, type A domaincoord: 453..633
e-value: 5.4E-11
score: 44.6
IPR036465von Willebrand factor A-like domain superfamilySUPERFAMILY53300vWA-likecoord: 462..589
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 173..241
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 197..219
NoneNo IPR availableCDDcd00198vWFAcoord: 464..563
e-value: 6.7643E-5
score: 41.7826

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10017889.1HG10017889.1mRNA