HG10011072 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10011072
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionhyccin
LocationChr01: 2085780 .. 2087000 (+)
RNA-Seq ExpressionHG10011072
SyntenyHG10011072
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCCTCCTCTTCCTCCGACGACGACTCCCCCGCCGCCGTAGAACCAATCCCCGCGGAAGAAACCGCCGCCGAAAAAGAACCAACCCCTGCAGAAGAAACTGCTGTCGAAGAAACTGCCGTCGAAGAAACCACCGCCACTACCGAAGAAACCACCCCCACCATCGCCGTTGCCGAAGCACCCGTCATCAAAACCAGCAGCAGAGCGAGCGGAAGTGGCCCGGTGGTGAGATTCGACATCTCACAATCGTCATCCTCAACCACACTCGCACAAACCGCCATCGAATCCCTAAAATCCATCCTCCCAAATAACATCCCCTCATCTCTCCCTTCCGCTCCCAACCCCGCCCTCGCCCTCCTCAACGACCTCGAAACCACCGCTCAAATCACCGCCCTCCTCCGCCGTCCCACCTCGGGCGCCGGCGACGACAACCTCTGCCGTTGGCTCTACGATACTTTCCAATCCAACAACCCCGACCTTAAACTCGTCGTCCTCCGCTTCCTCCCTATCCTCCTCGGCGCCTACCTCTCCCGCGTCGTCTCCCGTCGCAAATCCCTCGCCGGCTTCGAAGCCGTTCTCCTCTCCCTCTACGCCCATGAAACCAACCGCCGCGCCAGCCAACCCCTCGCCGTCAACATCCCCGATTTAACCCACCCTAGCATTTACCACGAATCCAAATCCCCTCTCAAAAACAACGCCACCGCTCTCAATCTCGCCGTAATCTCCCCGTCGTTGGAGCCCCACGGCATGGTCCGCTCCACCAAGAGGGCCCGAATCGTCGGCGTCGCGCTCGAACTCTACTACACGAAAATCGATAAGATTCCAGAAACTTCCAAGATTGAGTTCTGCGAGTTTTGCCGGAAGTGGGCCGGCGGCGGCGATGATGAAAATGGAAGAGTTAAAAAAGAGGAAGCGGCAGAGGAGGAGAGTATTGGGAGAATTCCGTTGCCGTGGGAGATTCTTCAGCCAGTTTTGAGACTGTTAGGGCACTGTCTTTTGGGGTCGAATTTGATTACAAAGTGCAAGAAGAATGAAACGACGCCGTTGTTCGATGCGGCCATTGCTGCGATTCGGAGCTTGTATTTGAGATCGATGCATGATATTAATCCCAAAGCCATTTTGGCGACTGGGAGTTTGGTGAGGTTGGGGAATATGGCCATGGAGTCGACTGATGAAATTGATTACACCGAGATTCCTTATCAAACTGTCATCAACCTCTAG

mRNA sequence

ATGTCCTCCTCTTCCTCCGACGACGACTCCCCCGCCGCCGTAGAACCAATCCCCGCGGAAGAAACCGCCGCCGAAAAAGAACCAACCCCTGCAGAAGAAACTGCTGTCGAAGAAACTGCCGTCGAAGAAACCACCGCCACTACCGAAGAAACCACCCCCACCATCGCCGTTGCCGAAGCACCCGTCATCAAAACCAGCAGCAGAGCGAGCGGAAGTGGCCCGGTGGTGAGATTCGACATCTCACAATCGTCATCCTCAACCACACTCGCACAAACCGCCATCGAATCCCTAAAATCCATCCTCCCAAATAACATCCCCTCATCTCTCCCTTCCGCTCCCAACCCCGCCCTCGCCCTCCTCAACGACCTCGAAACCACCGCTCAAATCACCGCCCTCCTCCGCCGTCCCACCTCGGGCGCCGGCGACGACAACCTCTGCCGTTGGCTCTACGATACTTTCCAATCCAACAACCCCGACCTTAAACTCGTCGTCCTCCGCTTCCTCCCTATCCTCCTCGGCGCCTACCTCTCCCGCGTCGTCTCCCGTCGCAAATCCCTCGCCGGCTTCGAAGCCGTTCTCCTCTCCCTCTACGCCCATGAAACCAACCGCCGCGCCAGCCAACCCCTCGCCGTCAACATCCCCGATTTAACCCACCCTAGCATTTACCACGAATCCAAATCCCCTCTCAAAAACAACGCCACCGCTCTCAATCTCGCCGTAATCTCCCCGTCGTTGGAGCCCCACGGCATGGTCCGCTCCACCAAGAGGGCCCGAATCGTCGGCGTCGCGCTCGAACTCTACTACACGAAAATCGATAAGATTCCAGAAACTTCCAAGATTGAGTTCTGCGAGTTTTGCCGGAAGTGGGCCGGCGGCGGCGATGATGAAAATGGAAGAGTTAAAAAAGAGGAAGCGGCAGAGGAGGAGAGTATTGGGAGAATTCCGTTGCCGTGGGAGATTCTTCAGCCAGTTTTGAGACTGTTAGGGCACTGTCTTTTGGGGTCGAATTTGATTACAAAGTGCAAGAAGAATGAAACGACGCCGTTGTTCGATGCGGCCATTGCTGCGATTCGGAGCTTGTATTTGAGATCGATGCATGATATTAATCCCAAAGCCATTTTGGCGACTGGGAGTTTGGTGAGGTTGGGGAATATGGCCATGGAGTCGACTGATGAAATTGATTACACCGAGATTCCTTATCAAACTGTCATCAACCTCTAG

Coding sequence (CDS)

ATGTCCTCCTCTTCCTCCGACGACGACTCCCCCGCCGCCGTAGAACCAATCCCCGCGGAAGAAACCGCCGCCGAAAAAGAACCAACCCCTGCAGAAGAAACTGCTGTCGAAGAAACTGCCGTCGAAGAAACCACCGCCACTACCGAAGAAACCACCCCCACCATCGCCGTTGCCGAAGCACCCGTCATCAAAACCAGCAGCAGAGCGAGCGGAAGTGGCCCGGTGGTGAGATTCGACATCTCACAATCGTCATCCTCAACCACACTCGCACAAACCGCCATCGAATCCCTAAAATCCATCCTCCCAAATAACATCCCCTCATCTCTCCCTTCCGCTCCCAACCCCGCCCTCGCCCTCCTCAACGACCTCGAAACCACCGCTCAAATCACCGCCCTCCTCCGCCGTCCCACCTCGGGCGCCGGCGACGACAACCTCTGCCGTTGGCTCTACGATACTTTCCAATCCAACAACCCCGACCTTAAACTCGTCGTCCTCCGCTTCCTCCCTATCCTCCTCGGCGCCTACCTCTCCCGCGTCGTCTCCCGTCGCAAATCCCTCGCCGGCTTCGAAGCCGTTCTCCTCTCCCTCTACGCCCATGAAACCAACCGCCGCGCCAGCCAACCCCTCGCCGTCAACATCCCCGATTTAACCCACCCTAGCATTTACCACGAATCCAAATCCCCTCTCAAAAACAACGCCACCGCTCTCAATCTCGCCGTAATCTCCCCGTCGTTGGAGCCCCACGGCATGGTCCGCTCCACCAAGAGGGCCCGAATCGTCGGCGTCGCGCTCGAACTCTACTACACGAAAATCGATAAGATTCCAGAAACTTCCAAGATTGAGTTCTGCGAGTTTTGCCGGAAGTGGGCCGGCGGCGGCGATGATGAAAATGGAAGAGTTAAAAAAGAGGAAGCGGCAGAGGAGGAGAGTATTGGGAGAATTCCGTTGCCGTGGGAGATTCTTCAGCCAGTTTTGAGACTGTTAGGGCACTGTCTTTTGGGGTCGAATTTGATTACAAAGTGCAAGAAGAATGAAACGACGCCGTTGTTCGATGCGGCCATTGCTGCGATTCGGAGCTTGTATTTGAGATCGATGCATGATATTAATCCCAAAGCCATTTTGGCGACTGGGAGTTTGGTGAGGTTGGGGAATATGGCCATGGAGTCGACTGATGAAATTGATTACACCGAGATTCCTTATCAAACTGTCATCAACCTCTAG

Protein sequence

MSSSSSDDDSPAAVEPIPAEETAAEKEPTPAEETAVEETAVEETTATTEETTPTIAVAEAPVIKTSSRASGSGPVVRFDISQSSSSTTLAQTAIESLKSILPNNIPSSLPSAPNPALALLNDLETTAQITALLRRPTSGAGDDNLCRWLYDTFQSNNPDLKLVVLRFLPILLGAYLSRVVSRRKSLAGFEAVLLSLYAHETNRRASQPLAVNIPDLTHPSIYHESKSPLKNNATALNLAVISPSLEPHGMVRSTKRARIVGVALELYYTKIDKIPETSKIEFCEFCRKWAGGGDDENGRVKKEEAAEEESIGRIPLPWEILQPVLRLLGHCLLGSNLITKCKKNETTPLFDAAIAAIRSLYLRSMHDINPKAILATGSLVRLGNMAMESTDEIDYTEIPYQTVINL
Homology
BLAST of HG10011072 vs. NCBI nr
Match: XP_038879407.1 (uncharacterized protein LOC120071285 [Benincasa hispida])

HSP 1 Score: 699.1 bits (1803), Expect = 2.2e-197
Identity = 373/410 (90.98%), Postives = 386/410 (94.15%), Query Frame = 0

Query: 1   MSSSSSDDDSPAAVEPIPAEETAAEKEPTPAEETAVEETAVEETTATTEETTPTIAVAEA 60
           MSSSSSDDDSPAAVEP PAEE AAE++PTPAEETA++E      TA  EETTPTIA AEA
Sbjct: 1   MSSSSSDDDSPAAVEPTPAEEPAAEEKPTPAEETAIDE------TAAAEETTPTIAAAEA 60

Query: 61  PVIKTSSRASGSGPVVRFDISQSSSSTTLAQTAIESLKSILPNNIPSSLPSAPNPALALL 120
           PVIKTSSRASGSGPVVRFDISQSSS TT+A+TAIESLKSILP NIPSSLPSAPNPALALL
Sbjct: 61  PVIKTSSRASGSGPVVRFDISQSSSLTTIAKTAIESLKSILP-NIPSSLPSAPNPALALL 120

Query: 121 NDLETTAQITALLRRPTSGAGDDNLCRWLYDTFQSNNPDLKLVVLRFLPILLGAYLSRVV 180
           NDLETTAQITALLRRPTSGAGDDNLCRWLYDTFQSNNPDLKLVVLRFLP+LLGAYLSRVV
Sbjct: 121 NDLETTAQITALLRRPTSGAGDDNLCRWLYDTFQSNNPDLKLVVLRFLPVLLGAYLSRVV 180

Query: 181 SRRKSLAGFEAVLLSLYAHETNRRASQPLAVNIPDLTHPSIYHESKSPLKNNATALNLAV 240
           SRRKSLAGFEAVLLSLYAHETNRRASQPL+VNIPDLTHPSIYHESKSPLKNNATALNLAV
Sbjct: 181 SRRKSLAGFEAVLLSLYAHETNRRASQPLSVNIPDLTHPSIYHESKSPLKNNATALNLAV 240

Query: 241 ISPSLEPHGMVRSTKRARIVGVALELYYTKIDKIPETSKIEFCEFCRKWAGGGDDENGRV 300
           ISPSLEPHGMVRSTKRARIVGVALELYYTKIDKIPETSKIEFCEFCR WA GGDDE G+V
Sbjct: 241 ISPSLEPHGMVRSTKRARIVGVALELYYTKIDKIPETSKIEFCEFCRIWASGGDDEKGKV 300

Query: 301 KKEEA----AEEESIGRIPLPWEILQPVLRLLGHCLLGSNLITKCKKNETTPLFDAAIAA 360
           K EEA    AEEESIGRIPLPWE+LQP+LR+LGHCLLGSN+  KCKKNETTPLFDAAIAA
Sbjct: 301 KNEEAAAEEAEEESIGRIPLPWEMLQPILRVLGHCLLGSNVTAKCKKNETTPLFDAAIAA 360

Query: 361 IRSLYLRSMHDINPKAILATGSLVRLGNMAMESTDEIDYTEIPYQTVINL 407
           IRSLYLRSMHDINPKAILATGSLVRLGNMAMES DEIDYTEIPYQTVINL
Sbjct: 361 IRSLYLRSMHDINPKAILATGSLVRLGNMAMESADEIDYTEIPYQTVINL 403

BLAST of HG10011072 vs. NCBI nr
Match: XP_008456035.1 (PREDICTED: hyccin [Cucumis melo])

HSP 1 Score: 642.9 bits (1657), Expect = 1.9e-180
Identity = 353/412 (85.68%), Postives = 363/412 (88.11%), Query Frame = 0

Query: 2   SSSSSDDDSPAAVEPIPAEETAAEKEPTPAEETAVEETAVEETTATTEETTPTIAVAEAP 61
           SSSSSDDDSPAAVEP PAEETA  KEP    ETA+EE A     A  EETTPTIA  EAP
Sbjct: 3   SSSSSDDDSPAAVEPTPAEETAENKEP----ETAIEEIA---APADAEETTPTIAAVEAP 62

Query: 62  VIKTSSRASGSGPVVRFDISQSSSSTTLAQTAIESLKSILPNNIPSSLPSAPNPALALLN 121
           V KTSSRASGSGPVVRFDISQSSS TT+AQTAIESL  ILPN IPSSL SAPNPALALLN
Sbjct: 63  VTKTSSRASGSGPVVRFDISQSSSLTTIAQTAIESLIPILPNTIPSSLSSAPNPALALLN 122

Query: 122 DLETTAQITALLRRPTSGAGDDNLCRWLYDTFQSNNPDLKLVVLRFLPILLGAYLSRVVS 181
           DLET AQITALLRRPTSGAGDDNLCRWLYDTFQS+NPDLKLVVLRFLP+LL AYLSRVVS
Sbjct: 123 DLETIAQITALLRRPTSGAGDDNLCRWLYDTFQSSNPDLKLVVLRFLPVLLSAYLSRVVS 182

Query: 182 RRKSLAGFEAVLLSLYAHETNRRASQPLAVNIPDLTHPSIYHESKSPLKNNATALNLAVI 241
           RRKSLAGFEAVLLSLYAHETNRRA QPL+VNIPDLTHPSIYHES SP KNNATALNLAVI
Sbjct: 183 RRKSLAGFEAVLLSLYAHETNRRAGQPLSVNIPDLTHPSIYHESISPHKNNATALNLAVI 242

Query: 242 SPSLEPHGMVRSTKRARIVGVALELYYTKIDKIPETSKIEFCEFCRKWAGGGDDENGRVK 301
           SPSLEPHGMVRSTKRARIVGVALELYYTKIDKIPETSKIEFCEFCR WA  GD EN  VK
Sbjct: 243 SPSLEPHGMVRSTKRARIVGVALELYYTKIDKIPETSKIEFCEFCRIWA--GDVENRGVK 302

Query: 302 KEEA-------AEEESIGRIPLPWEILQPVLRLLGHCLLGSNLITKCKKNETTPLFDAAI 361
           KEEA       AEE+ IGRIPLPWEILQP+LR+LGHCLLGSN I KCKK E T LFDAAI
Sbjct: 303 KEEATVAVVEEAEEDGIGRIPLPWEILQPILRVLGHCLLGSNSIVKCKKKERTALFDAAI 362

Query: 362 AAIRSLYLRSMHDINPKAILATGSLVRLGNMAMESTDEIDYTEIPYQTVINL 407
            AIRSLYLRSMHDINPKAILATGSLV+LGNMAMES DEIDYTEIPYQT+INL
Sbjct: 363 GAIRSLYLRSMHDINPKAILATGSLVKLGNMAMESADEIDYTEIPYQTIINL 405

BLAST of HG10011072 vs. NCBI nr
Match: KAA0038931.1 (hyccin [Cucumis melo var. makuwa] >TYK11229.1 hyccin [Cucumis melo var. makuwa])

HSP 1 Score: 639.8 bits (1649), Expect = 1.6e-179
Identity = 351/412 (85.19%), Postives = 361/412 (87.62%), Query Frame = 0

Query: 2   SSSSSDDDSPAAVEPIPAEETAAEKEPTPAEETAVEETAVEETTATTEETTPTIAVAEAP 61
           SSSS DDDSPAAVEP PAEETA  KEP    ETA+EE A     A  EETTPTIA  EAP
Sbjct: 3   SSSSDDDDSPAAVEPTPAEETAENKEP----ETAIEEIA---APADAEETTPTIAAVEAP 62

Query: 62  VIKTSSRASGSGPVVRFDISQSSSSTTLAQTAIESLKSILPNNIPSSLPSAPNPALALLN 121
           V KTSSRASGSGPVVRFDISQSSS TT+AQTAIESL  ILPN IPSSL SAPNPALALLN
Sbjct: 63  VTKTSSRASGSGPVVRFDISQSSSLTTIAQTAIESLIPILPNTIPSSLSSAPNPALALLN 122

Query: 122 DLETTAQITALLRRPTSGAGDDNLCRWLYDTFQSNNPDLKLVVLRFLPILLGAYLSRVVS 181
           DLET AQITALLRRPTSGAGDDNLCRWLYDTFQS+NPDLKLVVLRFLP+LL AYLSRVVS
Sbjct: 123 DLETIAQITALLRRPTSGAGDDNLCRWLYDTFQSSNPDLKLVVLRFLPVLLSAYLSRVVS 182

Query: 182 RRKSLAGFEAVLLSLYAHETNRRASQPLAVNIPDLTHPSIYHESKSPLKNNATALNLAVI 241
           RRKSLAGFEAVLLSLYAHETNRRA QPL+VNIPDLTHPSIYHES SP KNNATALNL VI
Sbjct: 183 RRKSLAGFEAVLLSLYAHETNRRAGQPLSVNIPDLTHPSIYHESISPHKNNATALNLVVI 242

Query: 242 SPSLEPHGMVRSTKRARIVGVALELYYTKIDKIPETSKIEFCEFCRKWAGGGDDENGRVK 301
           SPSLEPHGMVRSTKRARIVGVALELYYTKIDKIPETSKIEFCEFCR WA  GD EN  VK
Sbjct: 243 SPSLEPHGMVRSTKRARIVGVALELYYTKIDKIPETSKIEFCEFCRIWA--GDVENRGVK 302

Query: 302 KEEA-------AEEESIGRIPLPWEILQPVLRLLGHCLLGSNLITKCKKNETTPLFDAAI 361
           KEEA       AEE+ IGRIPLPWEILQP+LR+LGHCLLGSN I KCKK E T LFDAAI
Sbjct: 303 KEEATVAVVEEAEEDGIGRIPLPWEILQPILRVLGHCLLGSNSIVKCKKKERTALFDAAI 362

Query: 362 AAIRSLYLRSMHDINPKAILATGSLVRLGNMAMESTDEIDYTEIPYQTVINL 407
            AIRSLYLRSMHDINPKAILATGSLV+LGNMAMES DEIDYTEIPYQT+INL
Sbjct: 363 GAIRSLYLRSMHDINPKAILATGSLVKLGNMAMESADEIDYTEIPYQTIINL 405

BLAST of HG10011072 vs. NCBI nr
Match: XP_004146279.1 (uncharacterized protein LOC101210037 [Cucumis sativus] >KGN57615.1 hypothetical protein Csa_011737 [Cucumis sativus])

HSP 1 Score: 633.6 bits (1633), Expect = 1.1e-177
Identity = 349/417 (83.69%), Postives = 360/417 (86.33%), Query Frame = 0

Query: 1   MSSSSSDDDSPAAVEPIPAEETAAEKEPTPAEETAVEETAVEETTATTEETTPTIAVAEA 60
           MSSSSS DD  AAVEP PAEETA  KEP    ETA+EE A     A  EETTPTIA  EA
Sbjct: 1   MSSSSSSDDDSAAVEPTPAEETAENKEP----ETAIEEIA---APAEAEETTPTIAAVEA 60

Query: 61  PVIKTSSRASGSGPVVRFDISQSSSSTTLAQTAIESLKSILPNNIPSSLPSAPNPALALL 120
           PV KTSSRASGSGPVVRFDISQSSS TT+AQTAIESLK ILPN IPSSL SAPNPALALL
Sbjct: 61  PVTKTSSRASGSGPVVRFDISQSSSLTTIAQTAIESLKPILPNTIPSSLSSAPNPALALL 120

Query: 121 NDLETTAQITALLRRPTSGAGDDNLCRWLYDTFQSNNPDLKLVVLRFLPILLGAYLSRVV 180
           NDLET AQITALLRRPTSGAGDDNLCRWLYDTFQS+NPDLKLVVLRFLP+LL AYLSRVV
Sbjct: 121 NDLETIAQITALLRRPTSGAGDDNLCRWLYDTFQSSNPDLKLVVLRFLPVLLSAYLSRVV 180

Query: 181 SRRKSLAGFEAVLLSLYAHETNRRASQPLAVNIPDLTHPSIYHESKSPLKNNATALNLAV 240
           SRRKSLAGFEAVLLSLYAHETNRRASQPL+VNIPDLTHPSIYHES  P KNNATALNLAV
Sbjct: 181 SRRKSLAGFEAVLLSLYAHETNRRASQPLSVNIPDLTHPSIYHESIFPHKNNATALNLAV 240

Query: 241 ISPSLEPHGMVRSTKRARIVGVALELYYTKIDKIPETSKIEFCEFCRKWAGGGDDENGRV 300
           ISPSLEPHGMVRSTKRARIVGVALELYYTKIDKIPETSKIEFCEFCR WA  GD  N  V
Sbjct: 241 ISPSLEPHGMVRSTKRARIVGVALELYYTKIDKIPETSKIEFCEFCRIWA--GDVNNRGV 300

Query: 301 KKEEAA-----------EEESIGRIPLPWEILQPVLRLLGHCLLGSNLITKCKKNETTPL 360
           KKEEA            EE+ IGRIPLPWEILQP+LR+LGHCLLGSN I  CKK E T L
Sbjct: 301 KKEEATAAVVEEAEEEEEEDGIGRIPLPWEILQPILRVLGHCLLGSNSIVNCKKKERTAL 360

Query: 361 FDAAIAAIRSLYLRSMHDINPKAILATGSLVRLGNMAMESTDEIDYTEIPYQTVINL 407
           FDAAI AIRSLYLRSMHDINPKAILATGSLV+LG+MAMESTDEIDYTEIPYQT+INL
Sbjct: 361 FDAAIGAIRSLYLRSMHDINPKAILATGSLVKLGDMAMESTDEIDYTEIPYQTIINL 408

BLAST of HG10011072 vs. NCBI nr
Match: XP_023552628.1 (uncharacterized protein LOC111810220 [Cucurbita pepo subsp. pepo] >XP_023552646.1 uncharacterized protein LOC111810232 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 573.9 bits (1478), Expect = 1.1e-159
Identity = 319/411 (77.62%), Postives = 346/411 (84.18%), Query Frame = 0

Query: 1   MSSSSSDDDSPAAVEPIPAEETAAEKEPTPAEETAVEETAVEETTATTEETTPTIAVAEA 60
           MSSS +D+  PAA E   AEE AA +EPTPA               + EETTP    A+ 
Sbjct: 1   MSSSHADNGRPAAQETTTAEE-AAGQEPTPA--------------VSAEETTP--PAAKQ 60

Query: 61  PVIKTSSRASGSGPVVRFDISQSSSSTTLAQTAIESLKSILPNNIPSSLPSAPNPALALL 120
           P  + + R SGSG VVRFD+SQ++S T++AQ+AIESLK ILP NI ++L +APNPALALL
Sbjct: 61  PTARHNRRTSGSGLVVRFDVSQTASLTSIAQSAIESLKLILP-NISAALSAAPNPALALL 120

Query: 121 NDLETTAQITALLRRPTSGAGDDNLCRWLYDTFQSNNPDLKLVVLRFLPILLGAYLSRVV 180
           +D E TAQITALLR  TSGAGDDNLCRWLYDTFQSNNPDLKLVVLRFLPILLGAYLSRVV
Sbjct: 121 HDTEVTAQITALLRSATSGAGDDNLCRWLYDTFQSNNPDLKLVVLRFLPILLGAYLSRVV 180

Query: 181 SRRKSLAGFEAVLLSLYAHETNRRASQPLAVNIPDLTHPSIYHESKSPLKNNATALNLAV 240
           SRRKSLAGFEAVLLSLYAHETNRRASQPL VNIPDL HPSIYHE+KSPLK NATALNLAV
Sbjct: 181 SRRKSLAGFEAVLLSLYAHETNRRASQPLTVNIPDLAHPSIYHETKSPLKYNATALNLAV 240

Query: 241 ISPSLEPHGMVRSTKRARIVGVALELYYTKIDKIPETSKIEFCEFCRKWAGGGDDENGRV 300
           ISPSLEPHGMVRSTKRARIVGVALELYYTKI+KIPE+SKI+FCEFCR WA GGDDE+G  
Sbjct: 241 ISPSLEPHGMVRSTKRARIVGVALELYYTKIEKIPESSKIDFCEFCRLWA-GGDDESGGA 300

Query: 301 KK-----EEAAEEESIGRIPLPWEILQPVLRLLGHCLLGSNLITKCKKNETTPLFDAAIA 360
           KK     EE  EEE IG IPLPWEILQP+LR+LGHCLLGSNLITK KKNETTPLF AAIA
Sbjct: 301 KKDEREEEEEEEEEDIGIIPLPWEILQPILRVLGHCLLGSNLITKSKKNETTPLFKAAIA 360

Query: 361 AIRSLYLRSMHDINPKAILATGSLVRLGNMAMESTDEIDYTEIPYQTVINL 407
           AIRSLY+RSMHDINPKAILATGSL+RLGNMAMES DE+DYTEIP QTVINL
Sbjct: 361 AIRSLYVRSMHDINPKAILATGSLMRLGNMAMESGDEVDYTEIPAQTVINL 392

BLAST of HG10011072 vs. ExPASy Swiss-Prot
Match: Q9BYI3 (Hyccin OS=Homo sapiens OX=9606 GN=FAM126A PE=1 SV=2)

HSP 1 Score: 58.9 bits (141), Expect = 1.5e-07
Identity = 50/188 (26.60%), Postives = 85/188 (45.21%), Query Frame = 0

Query: 111 SAPNPALALLNDLETTAQITALLRRPTSGAGDDNLCRWLYDTFQSNNPDLKLVVLRFLPI 170
           S PN A  L +     + +  +++ P S   +  +C  L++ ++S    L    L+FLP 
Sbjct: 23  SLPNYATNLKDKSSLVSSLYKVIQEPQSELLEP-VCHQLFEFYRSGEEQLLQFTLQFLPE 82

Query: 171 LLGAYLSRVVSRRKSLAG-FEAVLLSLYAHE--TNRRASQPLAVNIPDLTHPSIYHESKS 230
           L+  YL+   SR    +G  EA+LL +Y  E    +  ++ L+  IP L+ PS+YHE  S
Sbjct: 83  LIWCYLAVSASRNVHSSGCIEALLLGVYNLEIVDKQGHTKVLSFTIPSLSKPSVYHEPSS 142

Query: 231 ----PLKNNATA---LNLAVISPSLEPHGMVRSTKRARIVGVALELYYTKIDKIPETSKI 289
                L  +A +   L+  V S       M+ +  R  ++   L  Y   +  +P  S  
Sbjct: 143 IGSMALTESALSQHGLSKVVYSGPHPQREMLTAQNRFEVLTFLLLCYNAALTYMPSVSLQ 202

BLAST of HG10011072 vs. ExPASy Swiss-Prot
Match: Q6P9N1 (Hyccin OS=Mus musculus OX=10090 GN=Fam126a PE=1 SV=3)

HSP 1 Score: 58.5 bits (140), Expect = 2.0e-07
Identity = 51/188 (27.13%), Postives = 83/188 (44.15%), Query Frame = 0

Query: 111 SAPNPALALLNDLETTAQITALLRRPTSGAGDDNLCRWLYDTFQSNNPDLKLVVLRFLPI 170
           S PN A  L +       +  +++ P S   +  +C  L++ ++S    L    L+FLP 
Sbjct: 23  SLPNYATNLKDKSSLVTSLYKVIQEPQSELLEP-VCHQLFEFYRSGEEQLLRFTLQFLPE 82

Query: 171 LLGAYLSRVVSRRKSLAG-FEAVLLSLYAHE--TNRRASQPLAVNIPDLTHPSIYHESKS 230
           L+  YL+   SR    +G  EA+LL +Y  E       S+ L+  IP L+ PS+YHE  S
Sbjct: 83  LMWCYLAVSASRDVHSSGCIEALLLGVYNLEIVDKHGHSKVLSFTIPSLSKPSVYHEPSS 142

Query: 231 ----PLKNNATA---LNLAVISPSLEPHGMVRSTKRARIVGVALELYYTKIDKIPETSKI 289
                L  +A +   L+  V S       M+ +  R  ++   L  Y   +  +P  S  
Sbjct: 143 IGSMALTESALSQHGLSKVVYSGPHPQREMLTAQNRFEVLTFLLLCYNAALTYMPSVSLQ 202

BLAST of HG10011072 vs. ExPASy Swiss-Prot
Match: Q5ZM13 (Hyccin OS=Gallus gallus OX=9031 GN=FAM126A PE=2 SV=2)

HSP 1 Score: 57.0 bits (136), Expect = 5.8e-07
Identity = 74/299 (24.75%), Postives = 127/299 (42.47%), Query Frame = 0

Query: 108 SLPSAPNPALALLNDLETTAQITAL---LRRPTSGAGDDNLCRWLYDTFQSNNPDLKLVV 167
           +LP A   + A  N  + TA I++L   ++ P S   +  +C  L++ ++S    L    
Sbjct: 18  TLPEASISSYA-TNLKDKTALISSLYKVIQEPQSELLEP-VCHQLFEFYRSGEEQLLRFT 77

Query: 168 LRFLPILLGAYLSRVVSRRKSLAG-FEAVLLSLYAHE--TNRRASQPLAVNIPDLTHPSI 227
           L+FLP L+  YL+   SR    +G  EA+LL +Y  E       S+ L+  IP L+ PS+
Sbjct: 78  LQFLPELMWCYLAVSASRDLQSSGCIEALLLGVYNLEIVDKEGHSKVLSFTIPSLSKPSV 137

Query: 228 YHESKS----PLKNNATA---LNLAVISPSLEPHGMVRSTKRARIVGVALELYYTKIDKI 287
           YHE  S     L   A +   L+  V S       M+ +  R  ++   L  Y   +  +
Sbjct: 138 YHEPSSIGSMALTEGALSQHGLSRVVYSGPHPQREMLTAQNRFEVLTFLLLCYNAALSYM 197

Query: 288 PETSKIEFCEFCRKWAGGGDDENGRVKKEEAAEEESIGRIPLPWEILQPVLRLLGHCLLG 347
           P  S    C+ C +    G     +V+K +        RIP+  E +  +L  + +    
Sbjct: 198 PAISLQSLCQICSRICVCGYPRQ-QVRKYKGVN----SRIPVSSEFMVQMLTGIYYAFYN 257

Query: 348 SNLITKCKKNETTPLFDAAIAAIRSLYLRSMHDINPKAILATGSL-VRLGNMAMESTDE 393
                          +D A  A+  +  R+  ++ P+ +L   ++   L   AM+S+ E
Sbjct: 258 GE-------------WDLARKAMDDVLYRAQLELYPEPLLVANAIKASLPQGAMKSSKE 296

BLAST of HG10011072 vs. ExPASy Swiss-Prot
Match: Q6P121 (Hyccin OS=Danio rerio OX=7955 GN=fam126a PE=2 SV=1)

HSP 1 Score: 56.2 bits (134), Expect = 9.8e-07
Identity = 38/111 (34.23%), Postives = 60/111 (54.05%), Query Frame = 0

Query: 145 LCRWLYDTFQSNNPDLKLVVLRFLPILLGAYLSRVVSRRKSLAG-FEAVLLSLYAHETNR 204
           +C  L++ ++S  P L+   L+FLP L+ +YLS   +R    +G  EA+LL +Y  E   
Sbjct: 56  VCHQLFEFYRSGEPRLQRFTLQFLPELVWSYLSVTAARDPHCSGCIEALLLGIYNLEIVD 115

Query: 205 R--ASQPLAVNIPDLTHPSIYHESKSPLKNNATALNLAVISPSLEPHGMVR 253
           +   S+ L+  IP L+ PS+YHE  S         +LA+   +L  HG+ R
Sbjct: 116 KDGQSKVLSFTIPSLSKPSVYHEPSS-------IGSLALTEGALANHGLSR 159

BLAST of HG10011072 vs. ExPASy TrEMBL
Match: A0A1S3C3J2 (hyccin OS=Cucumis melo OX=3656 GN=LOC103496087 PE=3 SV=1)

HSP 1 Score: 642.9 bits (1657), Expect = 9.1e-181
Identity = 353/412 (85.68%), Postives = 363/412 (88.11%), Query Frame = 0

Query: 2   SSSSSDDDSPAAVEPIPAEETAAEKEPTPAEETAVEETAVEETTATTEETTPTIAVAEAP 61
           SSSSSDDDSPAAVEP PAEETA  KEP    ETA+EE A     A  EETTPTIA  EAP
Sbjct: 3   SSSSSDDDSPAAVEPTPAEETAENKEP----ETAIEEIA---APADAEETTPTIAAVEAP 62

Query: 62  VIKTSSRASGSGPVVRFDISQSSSSTTLAQTAIESLKSILPNNIPSSLPSAPNPALALLN 121
           V KTSSRASGSGPVVRFDISQSSS TT+AQTAIESL  ILPN IPSSL SAPNPALALLN
Sbjct: 63  VTKTSSRASGSGPVVRFDISQSSSLTTIAQTAIESLIPILPNTIPSSLSSAPNPALALLN 122

Query: 122 DLETTAQITALLRRPTSGAGDDNLCRWLYDTFQSNNPDLKLVVLRFLPILLGAYLSRVVS 181
           DLET AQITALLRRPTSGAGDDNLCRWLYDTFQS+NPDLKLVVLRFLP+LL AYLSRVVS
Sbjct: 123 DLETIAQITALLRRPTSGAGDDNLCRWLYDTFQSSNPDLKLVVLRFLPVLLSAYLSRVVS 182

Query: 182 RRKSLAGFEAVLLSLYAHETNRRASQPLAVNIPDLTHPSIYHESKSPLKNNATALNLAVI 241
           RRKSLAGFEAVLLSLYAHETNRRA QPL+VNIPDLTHPSIYHES SP KNNATALNLAVI
Sbjct: 183 RRKSLAGFEAVLLSLYAHETNRRAGQPLSVNIPDLTHPSIYHESISPHKNNATALNLAVI 242

Query: 242 SPSLEPHGMVRSTKRARIVGVALELYYTKIDKIPETSKIEFCEFCRKWAGGGDDENGRVK 301
           SPSLEPHGMVRSTKRARIVGVALELYYTKIDKIPETSKIEFCEFCR WA  GD EN  VK
Sbjct: 243 SPSLEPHGMVRSTKRARIVGVALELYYTKIDKIPETSKIEFCEFCRIWA--GDVENRGVK 302

Query: 302 KEEA-------AEEESIGRIPLPWEILQPVLRLLGHCLLGSNLITKCKKNETTPLFDAAI 361
           KEEA       AEE+ IGRIPLPWEILQP+LR+LGHCLLGSN I KCKK E T LFDAAI
Sbjct: 303 KEEATVAVVEEAEEDGIGRIPLPWEILQPILRVLGHCLLGSNSIVKCKKKERTALFDAAI 362

Query: 362 AAIRSLYLRSMHDINPKAILATGSLVRLGNMAMESTDEIDYTEIPYQTVINL 407
            AIRSLYLRSMHDINPKAILATGSLV+LGNMAMES DEIDYTEIPYQT+INL
Sbjct: 363 GAIRSLYLRSMHDINPKAILATGSLVKLGNMAMESADEIDYTEIPYQTIINL 405

BLAST of HG10011072 vs. ExPASy TrEMBL
Match: A0A5D3CLY5 (Hyccin OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold227G00740 PE=3 SV=1)

HSP 1 Score: 639.8 bits (1649), Expect = 7.7e-180
Identity = 351/412 (85.19%), Postives = 361/412 (87.62%), Query Frame = 0

Query: 2   SSSSSDDDSPAAVEPIPAEETAAEKEPTPAEETAVEETAVEETTATTEETTPTIAVAEAP 61
           SSSS DDDSPAAVEP PAEETA  KEP    ETA+EE A     A  EETTPTIA  EAP
Sbjct: 3   SSSSDDDDSPAAVEPTPAEETAENKEP----ETAIEEIA---APADAEETTPTIAAVEAP 62

Query: 62  VIKTSSRASGSGPVVRFDISQSSSSTTLAQTAIESLKSILPNNIPSSLPSAPNPALALLN 121
           V KTSSRASGSGPVVRFDISQSSS TT+AQTAIESL  ILPN IPSSL SAPNPALALLN
Sbjct: 63  VTKTSSRASGSGPVVRFDISQSSSLTTIAQTAIESLIPILPNTIPSSLSSAPNPALALLN 122

Query: 122 DLETTAQITALLRRPTSGAGDDNLCRWLYDTFQSNNPDLKLVVLRFLPILLGAYLSRVVS 181
           DLET AQITALLRRPTSGAGDDNLCRWLYDTFQS+NPDLKLVVLRFLP+LL AYLSRVVS
Sbjct: 123 DLETIAQITALLRRPTSGAGDDNLCRWLYDTFQSSNPDLKLVVLRFLPVLLSAYLSRVVS 182

Query: 182 RRKSLAGFEAVLLSLYAHETNRRASQPLAVNIPDLTHPSIYHESKSPLKNNATALNLAVI 241
           RRKSLAGFEAVLLSLYAHETNRRA QPL+VNIPDLTHPSIYHES SP KNNATALNL VI
Sbjct: 183 RRKSLAGFEAVLLSLYAHETNRRAGQPLSVNIPDLTHPSIYHESISPHKNNATALNLVVI 242

Query: 242 SPSLEPHGMVRSTKRARIVGVALELYYTKIDKIPETSKIEFCEFCRKWAGGGDDENGRVK 301
           SPSLEPHGMVRSTKRARIVGVALELYYTKIDKIPETSKIEFCEFCR WA  GD EN  VK
Sbjct: 243 SPSLEPHGMVRSTKRARIVGVALELYYTKIDKIPETSKIEFCEFCRIWA--GDVENRGVK 302

Query: 302 KEEA-------AEEESIGRIPLPWEILQPVLRLLGHCLLGSNLITKCKKNETTPLFDAAI 361
           KEEA       AEE+ IGRIPLPWEILQP+LR+LGHCLLGSN I KCKK E T LFDAAI
Sbjct: 303 KEEATVAVVEEAEEDGIGRIPLPWEILQPILRVLGHCLLGSNSIVKCKKKERTALFDAAI 362

Query: 362 AAIRSLYLRSMHDINPKAILATGSLVRLGNMAMESTDEIDYTEIPYQTVINL 407
            AIRSLYLRSMHDINPKAILATGSLV+LGNMAMES DEIDYTEIPYQT+INL
Sbjct: 363 GAIRSLYLRSMHDINPKAILATGSLVKLGNMAMESADEIDYTEIPYQTIINL 405

BLAST of HG10011072 vs. ExPASy TrEMBL
Match: A0A0A0L6M4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G229410 PE=3 SV=1)

HSP 1 Score: 633.6 bits (1633), Expect = 5.5e-178
Identity = 349/417 (83.69%), Postives = 360/417 (86.33%), Query Frame = 0

Query: 1   MSSSSSDDDSPAAVEPIPAEETAAEKEPTPAEETAVEETAVEETTATTEETTPTIAVAEA 60
           MSSSSS DD  AAVEP PAEETA  KEP    ETA+EE A     A  EETTPTIA  EA
Sbjct: 1   MSSSSSSDDDSAAVEPTPAEETAENKEP----ETAIEEIA---APAEAEETTPTIAAVEA 60

Query: 61  PVIKTSSRASGSGPVVRFDISQSSSSTTLAQTAIESLKSILPNNIPSSLPSAPNPALALL 120
           PV KTSSRASGSGPVVRFDISQSSS TT+AQTAIESLK ILPN IPSSL SAPNPALALL
Sbjct: 61  PVTKTSSRASGSGPVVRFDISQSSSLTTIAQTAIESLKPILPNTIPSSLSSAPNPALALL 120

Query: 121 NDLETTAQITALLRRPTSGAGDDNLCRWLYDTFQSNNPDLKLVVLRFLPILLGAYLSRVV 180
           NDLET AQITALLRRPTSGAGDDNLCRWLYDTFQS+NPDLKLVVLRFLP+LL AYLSRVV
Sbjct: 121 NDLETIAQITALLRRPTSGAGDDNLCRWLYDTFQSSNPDLKLVVLRFLPVLLSAYLSRVV 180

Query: 181 SRRKSLAGFEAVLLSLYAHETNRRASQPLAVNIPDLTHPSIYHESKSPLKNNATALNLAV 240
           SRRKSLAGFEAVLLSLYAHETNRRASQPL+VNIPDLTHPSIYHES  P KNNATALNLAV
Sbjct: 181 SRRKSLAGFEAVLLSLYAHETNRRASQPLSVNIPDLTHPSIYHESIFPHKNNATALNLAV 240

Query: 241 ISPSLEPHGMVRSTKRARIVGVALELYYTKIDKIPETSKIEFCEFCRKWAGGGDDENGRV 300
           ISPSLEPHGMVRSTKRARIVGVALELYYTKIDKIPETSKIEFCEFCR WA  GD  N  V
Sbjct: 241 ISPSLEPHGMVRSTKRARIVGVALELYYTKIDKIPETSKIEFCEFCRIWA--GDVNNRGV 300

Query: 301 KKEEAA-----------EEESIGRIPLPWEILQPVLRLLGHCLLGSNLITKCKKNETTPL 360
           KKEEA            EE+ IGRIPLPWEILQP+LR+LGHCLLGSN I  CKK E T L
Sbjct: 301 KKEEATAAVVEEAEEEEEEDGIGRIPLPWEILQPILRVLGHCLLGSNSIVNCKKKERTAL 360

Query: 361 FDAAIAAIRSLYLRSMHDINPKAILATGSLVRLGNMAMESTDEIDYTEIPYQTVINL 407
           FDAAI AIRSLYLRSMHDINPKAILATGSLV+LG+MAMESTDEIDYTEIPYQT+INL
Sbjct: 361 FDAAIGAIRSLYLRSMHDINPKAILATGSLVKLGDMAMESTDEIDYTEIPYQTIINL 408

BLAST of HG10011072 vs. ExPASy TrEMBL
Match: A0A6J1J402 (uncharacterized protein LOC111483205 OS=Cucurbita maxima OX=3661 GN=LOC111483205 PE=3 SV=1)

HSP 1 Score: 570.1 bits (1468), Expect = 7.5e-159
Identity = 319/411 (77.62%), Postives = 344/411 (83.70%), Query Frame = 0

Query: 1   MSSSSSDDDSPAAVEPIPAEETAAEKEPTPAEETAVEETAVEETTATTEETTPTIAVAEA 60
           MSSS +D+  PAA E   AEE AA+ EPTPA               + EET P    A+ 
Sbjct: 1   MSSSPADNGRPAAQEITTAEEAAAQ-EPTPA--------------VSAEETNP--PAAKP 60

Query: 61  PVIKTSSRASGSGPVVRFDISQSSSSTTLAQTAIESLKSILPNNIPSSLPSAPNPALALL 120
           P  + + R SGSG VVRFDISQ++S T++AQ+AIESLK ILP NI S+L +APNPALALL
Sbjct: 61  PAARHNRRTSGSGLVVRFDISQTASLTSIAQSAIESLKLILP-NISSALSAAPNPALALL 120

Query: 121 NDLETTAQITALLRRPTSGAGDDNLCRWLYDTFQSNNPDLKLVVLRFLPILLGAYLSRVV 180
           +D E TAQI ALLR  TSGAGDDNLCRWLYDTFQSNNPDLKLVVLRFLPILLGAYLSRVV
Sbjct: 121 HDTEVTAQIIALLRSSTSGAGDDNLCRWLYDTFQSNNPDLKLVVLRFLPILLGAYLSRVV 180

Query: 181 SRRKSLAGFEAVLLSLYAHETNRRASQPLAVNIPDLTHPSIYHESKSPLKNNATALNLAV 240
           SRRKSLAGFEAVLLSLYAHETNRRASQPL VNIPDL HPSIYHE+KSPLK NATALNLAV
Sbjct: 181 SRRKSLAGFEAVLLSLYAHETNRRASQPLTVNIPDLAHPSIYHETKSPLKYNATALNLAV 240

Query: 241 ISPSLEPHGMVRSTKRARIVGVALELYYTKIDKIPETSKIEFCEFCRKWAGGGDDENGRV 300
           ISPSLEPHGMVRSTKRARIVGVALELYYTKI+KIPE+SKI+FCEFCR WA GGDDE+G  
Sbjct: 241 ISPSLEPHGMVRSTKRARIVGVALELYYTKIEKIPESSKIDFCEFCRLWA-GGDDESGGA 300

Query: 301 KKEEAAEEES-----IGRIPLPWEILQPVLRLLGHCLLGSNLITKCKKNETTPLFDAAIA 360
           KKEE  EEE      IG IPLPWEILQP+LR+LGHCLLGSNLITK KKNET PLF+AAIA
Sbjct: 301 KKEEREEEEEEEEEYIGIIPLPWEILQPILRVLGHCLLGSNLITKSKKNETRPLFNAAIA 360

Query: 361 AIRSLYLRSMHDINPKAILATGSLVRLGNMAMESTDEIDYTEIPYQTVINL 407
           AIRSLY+RSMHDINPKAILATGSL+RLGNMAMES DEIDYTEIP QTVINL
Sbjct: 361 AIRSLYVRSMHDINPKAILATGSLMRLGNMAMESGDEIDYTEIPAQTVINL 392

BLAST of HG10011072 vs. ExPASy TrEMBL
Match: A0A6J1E958 (uncharacterized protein LOC111430539 OS=Cucurbita moschata OX=3662 GN=LOC111430539 PE=3 SV=1)

HSP 1 Score: 570.1 bits (1468), Expect = 7.5e-159
Identity = 316/411 (76.89%), Postives = 344/411 (83.70%), Query Frame = 0

Query: 1   MSSSSSDDDSPAAVEPIPAEETAAEKEPTPAEETAVEETAVEETTATTEETTPTIAVAEA 60
           MSSS +D+  PA  E   A+E AA+ EPTPA               + EETTP    A+ 
Sbjct: 1   MSSSHADNGRPAGQETTTAQEAAAQ-EPTPA--------------VSAEETTP--PAAKQ 60

Query: 61  PVIKTSSRASGSGPVVRFDISQSSSSTTLAQTAIESLKSILPNNIPSSLPSAPNPALALL 120
           P  + + R SGSG VVRFD+SQ++S T++AQ+AIESLK ILP NI S+L +APNPALALL
Sbjct: 61  PTARHNRRTSGSGLVVRFDVSQTASLTSIAQSAIESLKLILP-NISSALSAAPNPALALL 120

Query: 121 NDLETTAQITALLRRPTSGAGDDNLCRWLYDTFQSNNPDLKLVVLRFLPILLGAYLSRVV 180
           +D E TAQITALLR  TSGAGDDNLCRWLYDTFQSNNPDLKLVVLRFLPILLGAYLSRVV
Sbjct: 121 HDTEVTAQITALLRSSTSGAGDDNLCRWLYDTFQSNNPDLKLVVLRFLPILLGAYLSRVV 180

Query: 181 SRRKSLAGFEAVLLSLYAHETNRRASQPLAVNIPDLTHPSIYHESKSPLKNNATALNLAV 240
           SRRKSLAGFEAVLLSLYAHETNRRASQPL VNIPDL HPSIYHE+KSPLK NATALNLAV
Sbjct: 181 SRRKSLAGFEAVLLSLYAHETNRRASQPLTVNIPDLAHPSIYHETKSPLKYNATALNLAV 240

Query: 241 ISPSLEPHGMVRSTKRARIVGVALELYYTKIDKIPETSKIEFCEFCRKWAGGGDDENGRV 300
           ISPSLEPHGMVRSTKRARIVGVALELYYTKI+KIPE+SKI+FCEFCR WA GGDDE+G  
Sbjct: 241 ISPSLEPHGMVRSTKRARIVGVALELYYTKIEKIPESSKIDFCEFCRLWA-GGDDESGGA 300

Query: 301 KK-----EEAAEEESIGRIPLPWEILQPVLRLLGHCLLGSNLITKCKKNETTPLFDAAIA 360
           KK     EE  EEE IG IPLPWEILQP+LR+LGHCLLGSNLITK KKNETTPLF AAI 
Sbjct: 301 KKDEREEEEEEEEEDIGIIPLPWEILQPILRVLGHCLLGSNLITKSKKNETTPLFKAAIG 360

Query: 361 AIRSLYLRSMHDINPKAILATGSLVRLGNMAMESTDEIDYTEIPYQTVINL 407
           AIRSLY+RSMHDINPKAILATGSL+RLGNMA+ES DE+DYTEIP QTVINL
Sbjct: 361 AIRSLYVRSMHDINPKAILATGSLMRLGNMAIESGDEVDYTEIPAQTVINL 392

BLAST of HG10011072 vs. TAIR 10
Match: AT5G21050.1 (LOCATED IN: chloroplast; EXPRESSED IN: 14 plant structures; EXPRESSED DURING: 8 growth stages; CONTAINS InterPro DOMAIN/s: Hyccin (InterPro:IPR018619); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G64090.1); Has 206 Blast hits to 206 proteins in 60 species: Archae - 0; Bacteria - 0; Metazoa - 145; Fungi - 0; Plants - 50; Viruses - 0; Other Eukaryotes - 11 (source: NCBI BLink). )

HSP 1 Score: 342.8 bits (878), Expect = 3.7e-94
Identity = 196/366 (53.55%), Postives = 251/366 (68.58%), Query Frame = 0

Query: 65  TSSRASGSGPVVRFD-----ISQSSSSTTLAQTAIESLKSILPN-NIPSSLPSAPNPALA 124
           +SS  S   P +  D      +  S S T  QTAI+SL +I+ N NIPS++         
Sbjct: 5   SSSHDSPPSPAITGDSETTVTNNESESNTKCQTAIQSLSTIVTNTNIPSTI-------TI 64

Query: 125 LLNDLETTAQITALLRRPTSGAGDDNLCRWLYDTFQSNNPDLKLVVLRFLPILLGAYLSR 184
           LL+D   +  I++LL RP SGAGD+NLCRWLYDTFQS  P L+L+VLRF+P++ G YLSR
Sbjct: 65  LLDDEAVSTAISSLLLRPDSGAGDNNLCRWLYDTFQSAEPSLQLLVLRFVPLIAGLYLSR 124

Query: 185 VVSRRKSLAGFEAVLLSLYAHETNRRASQPLAVNIPDLTHPSIYHESKSPLK-NNATALN 244
           V  R+   AGFEAVLL+LYAHET  RA Q + VNIPDL++PSIYHESK   + NN+T LN
Sbjct: 125 VPLRQPQ-AGFEAVLLALYAHETTSRAGQAITVNIPDLSYPSIYHESKGLTRNNNSTCLN 184

Query: 245 LAVISPSLEPHGMVRSTKRARIVGVALELYYTKIDKIPETSKIEFCEFCRKWAG--GGDD 304
           +AVIS +L+PHG VRST+RARIVGVALELYY+KI K+P  SK+ FCE C KWAG  G  +
Sbjct: 185 IAVISSTLDPHGTVRSTRRARIVGVALELYYSKISKMPRESKLNFCESCEKWAGQNGETE 244

Query: 305 ENGR-----VKKEEAAEEESI----------GRIPLPWEILQPVLRLLGHCLLGSNLITK 364
           ++ R     +  +   EEE++          GRIPLPWE+LQP+LR+LGHCLLG      
Sbjct: 245 QSSRAVIPTLSDDSWREEENVAIGGRSERDSGRIPLPWELLQPILRILGHCLLG------ 304

Query: 365 CKKNETTPLFDAAIAAIRSLYLRSMHDINPKAILATGSLVRLGNMAMESTDEIDYTEIPY 407
             K E   L +AA  A +SLYLRS+HDINPKAILATGSL+RL  MA++  ++ID+TE+  
Sbjct: 305 -LKMEDRELSEAANKACQSLYLRSLHDINPKAILATGSLLRLREMALDPKNQIDHTELSN 355

BLAST of HG10011072 vs. TAIR 10
Match: AT5G64090.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Hyccin (InterPro:IPR018619); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G21050.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 221.9 bits (564), Expect = 9.6e-58
Identity = 157/412 (38.11%), Postives = 219/412 (53.16%), Query Frame = 0

Query: 45  TATTEETTPTIAVAEAPVIKTSSRASGSGPVVRFDISQSSS---STTLAQTAIESLKSIL 104
           +++T  +TP    +      T++  SG  P    D     S   S +  ++ I SL S+L
Sbjct: 15  SSSTSSSTPHRFKSVTTPTATAAAVSGFSPSAAADRDPMHSWWESVSKQRSRILSLSSLL 74

Query: 105 PNNIP---------SSLPSAPNPALALLNDLETTAQITALLRRPTSGAGDDNLCRWLYDT 164
             +           SSL  +  PAL+LL+     + I+  L  P SG+G D LC+WLY+T
Sbjct: 75  SGDSHFEDGDVTPISSLADSDRPALSLLSSRAAYSLISNSLCNPASGSGSDPLCQWLYET 134

Query: 165 FQSNNPDLKLVVLRFLPILLGAYLSRVVSRRK----SLAGFEAVLLSLYAHETNRRASQP 224
           + S++P L+LVVL F P+L+G YLSR+ S       SL+GFEAVLL++YA E   RA +P
Sbjct: 135 YLSSDPPLRLVVLSFFPLLVGMYLSRIHSSDSTSLPSLSGFEAVLLAIYAAEVKARAGKP 194

Query: 225 LAVNIPDLTHPSIYHESKSPL----KNNATALNLAVISPSLEPHGMVRSTKRARIVGVAL 284
           + V+IPDL+ PS+YH  ++ +     +N TA ++ V+SP LEP   V+STKRA IVGV L
Sbjct: 195 ILVHIPDLSQPSLYHTPRNGVDKSRDSNPTA-SVGVLSPQLEPQIAVKSTKRASIVGVGL 254

Query: 285 ELYYTKIDKIPETSKIEFCEFCRKWAGGGDDENGRVKKEE-------------------- 344
           + Y+ +I ++P  SK+EFC+F   WAG   D   ++ ++E                    
Sbjct: 255 QCYFKEISQMPAWSKLEFCKFSASWAGQDCDCKEKIDEDEDKVLALTNGFGDSSSFNGSS 314

Query: 345 ---------------AAEEESIG------------RIPLPWEILQPVLRLLGHCLLGSNL 390
                             EE +             RIPLPWE+ QP LR+LGHCLL S L
Sbjct: 315 GRSLEIEEDFDRLAIRENEEQLSSNGGGGGVGRGVRIPLPWELFQPTLRILGHCLL-SPL 374

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038879407.12.2e-19790.98uncharacterized protein LOC120071285 [Benincasa hispida][more]
XP_008456035.11.9e-18085.68PREDICTED: hyccin [Cucumis melo][more]
KAA0038931.11.6e-17985.19hyccin [Cucumis melo var. makuwa] >TYK11229.1 hyccin [Cucumis melo var. makuwa][more]
XP_004146279.11.1e-17783.69uncharacterized protein LOC101210037 [Cucumis sativus] >KGN57615.1 hypothetical ... [more]
XP_023552628.11.1e-15977.62uncharacterized protein LOC111810220 [Cucurbita pepo subsp. pepo] >XP_023552646.... [more]
Match NameE-valueIdentityDescription
Q9BYI31.5e-0726.60Hyccin OS=Homo sapiens OX=9606 GN=FAM126A PE=1 SV=2[more]
Q6P9N12.0e-0727.13Hyccin OS=Mus musculus OX=10090 GN=Fam126a PE=1 SV=3[more]
Q5ZM135.8e-0724.75Hyccin OS=Gallus gallus OX=9031 GN=FAM126A PE=2 SV=2[more]
Q6P1219.8e-0734.23Hyccin OS=Danio rerio OX=7955 GN=fam126a PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A1S3C3J29.1e-18185.68hyccin OS=Cucumis melo OX=3656 GN=LOC103496087 PE=3 SV=1[more]
A0A5D3CLY57.7e-18085.19Hyccin OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold227G00740 PE=3 SV... [more]
A0A0A0L6M45.5e-17883.69Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G229410 PE=3 SV=1[more]
A0A6J1J4027.5e-15977.62uncharacterized protein LOC111483205 OS=Cucurbita maxima OX=3661 GN=LOC111483205... [more]
A0A6J1E9587.5e-15976.89uncharacterized protein LOC111430539 OS=Cucurbita moschata OX=3662 GN=LOC1114305... [more]
Match NameE-valueIdentityDescription
AT5G21050.13.7e-9453.55LOCATED IN: chloroplast; EXPRESSED IN: 14 plant structures; EXPRESSED DURING: 8 ... [more]
AT5G64090.19.6e-5838.11FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR018619HyccinPFAMPF09790Hyccincoord: 114..382
e-value: 6.4E-53
score: 180.0
IPR018619HyccinPANTHERPTHR31220HYCCIN RELATEDcoord: 80..405
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..60
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 40..54
NoneNo IPR availablePANTHERPTHR31220:SF10HYCCINcoord: 80..405

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10011072.1HG10011072.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0046854 phosphatidylinositol phosphate biosynthetic process
biological_process GO:0072659 protein localization to plasma membrane
cellular_component GO:0005829 cytosol
cellular_component GO:0005886 plasma membrane