HG10021072 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10021072
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionSPARK domain-containing protein
LocationChr05: 5099003 .. 5103382 (+)
RNA-Seq ExpressionHG10021072
SyntenyHG10021072
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAACCGAAAACTCGCACGCTTGGAAGTTGCCACGCGTGAGAGACCAAGAGTGCCCAGTGGAGTTACTCCTCACACTTATCACGCGTGCGAGGTAAAAAATCAAAAAGAAAAATAAAATAAAATAAAATAAAAAATAAAGTTTTGTGGGAAAGGGGGGTTGGGAAAAAGCAGCTGCTGCCATTAATGCAATTTCGTTCTTCTTCTTCTTCTTCCATCTCAAAACTTTTTTTTTTCTTCTCTTCGTCTTGCAGATCTGTCCGTCTGTGGGTCCATCCAGACAGCTATTACTTGAGCTGCTTTCGTCCCCACACACTTTCAAGCTCATTCATGGCTTCTCTACTCTCCACTCTTTTCTCTCTCTTATTCCATCTCTCTATCTCCCCACCTTCTCAATTCCTTCACTCACTCAACCACTACCACTCCTCCAATCCCAATTTTAATGCTTCTTAACCCCTTTTCCCTCTCTTTCCCTCCTATTCTCTTCATTCCCACTTCCAATTTCCCTTCCTCCTCCTTTCTACCTTCTTCTTTCGAAGTCCCAGAAACTGGAAAGTGGAGACTGAACATATAAACATTTTAAAATGGAAGACAGATTGTTTCTTAAGCTTGGTTTCGATGCCTTTCTTCTCCAGGTGTTACTGCTCCTTTGTGAGTCTTCCCAACTTTTTTTTTTGGAACGTTTTTCTATGTGGGGTTCACCTCTTTTTGCTCGATCTGTTGCCTTTTCTCGAGCTTGTTGTTATTCTAGTGCTGAGGTTTAGGAAATTTTGTTCTGCTTGGAGAATGTTGCCTTTATTTCGTACAATGTTTCTTGGGTTTGAGAAAATAATAAGAATGTACTTCGAAAATTTAACCATTATTACTAGCTTCTGTTATGTGGTGGATAACGTGCTTCATCTCTGCACTTCTTTCTCGACATTCTTGATTTGCTTATTTCTTTGTTATTTATTTATTTATTGGATAGGATAGATGAGCTTTCCTTATGGAAAGAAACAGCTTTTGGTACCATATTTAGCACTGTTATTATTAACTTTTTTTTTTCCATCTCTCATTTAATTAACGGCTAGATTAAATGATTAGTGATTCAGCTGTCATTGTTGTGCTAATGGGTAGTATGCATATTCTTTTGAAGGAGCTGATGATACAAGCAAGATCTTATTGGGTTATCTTCTAAATTCATAACTCGAGCATATATGGAAAATCTGGCGTAGTAATTTAACTCTGTACTCTGGAATTTATGTTAACATGTTGGCTTAAAAGAAAATTTCGGAGCCAGGGTCTGGATTATCTATCCTGACCCGGTAATCTTAGCTTTGTTATTCTTGTAGAATACAAAAGTTAATGCAACAGAAAAATGAGTGGGGAACAAAATTTTAGCAAGAAAGGAAATCCAAACCCAAACCCAAAGCATGTATTTTGTATCATTGAGATAATGTTACTCCTAGTGTAAGTGTTTGACTACTGAGAAATTTGTGCTCGATTTAATTGAATGTCACAAGGTTTACAAATGGAACAGTAACTGTGAATGAAAAATATAGAGAGAGCGAAGTTATTGAAAGATATGTGTATAAACATATTGGTAGTATTCAAATAGTTGTGCTGACCAGCTAGCTGTTTCGCCGGAGTAACTTCTTAAAAGAGTGATTTTGTATTCTTTTCACATGTAATTATTTTTAGACAGAGTGATATTGTGTTATCTATTAAAATGGAAAATGAAAGCAGCATGAAGCTGTTAATAGCATGTTTGGATTTGCCTGTACCCTTACCACCAAACTAGGTTCTACAACCACTCCTCTCCCATGATTTTCTTGGAGATGAGTTCTCCAAGGTATTGCTTGAAATTATTGAATTAGCTCCTCCACTTTCTTGTGTCATTGCCAATATATTATCCATGTTTGGTATCTCATCATATGGCAATAAAACCCATCAATGAAATTTTTTGCTAAATCCTCTAAAAACGATTCTGTTCGTCTCTGAATTATAATATTTATGCCAATTTGAACCGTAGAACATCATTGTACAATAAACTAAAAAGGCTTATACTTGTATGTTTAACACGGTATAATATCTTACTAGGTTTTTTTTTTTCCTTTTCGTTGTAGATTTACATGAAACTAGCTGCATCCCATCAACTTATCCAACACGGCATCTGTCAAATGAGAAACCAATGGATGATATGTATCCTGTGATTGCTCCAAGTGGGAATCCAAAGCCATTCATTCCTCTCCTTGCCCCTTCTCCATTGGTACCCTTCACAAATACCACCATTCCAAAGTTATCAGGTTTATCATTTTTTATAATCAGTTTTAATCTATTATATACTTGATGTCAATTGTTTTCTAGAAAGAGGGGTTTTAATGATTTCCCTTGATTATTCTTTTCAATTTAATCTCTCATTTTCCTTACAGGGCAATGTTTGTTGAACTTCTCTGCAACGGAAACTTTGATGAGTATGACTGCTATAGATTGTTGGGCCCCTTTTGCAAAACAGATGGCCAATGTTATCTGTTGTCCACAGTTGGAAGCTACTCTTGCTATCCTTATTGGTCAATCCAGTAAAGATACTAATGTTCTTGCTTTAAATGGGACGCTTGCCAAATATTGCCTCTCAGATATTGAGCAGATTTTGGTGGGGCAGGGTGCAAGTGAAAGTCTCAGACACATATGCACAGTTCATCCAGCCAATCTCACTGAAGGATCTTGTCCAGCCAAAGATATTAGTGAATTTGAGACCACGGTTGATACCTCTAAGTTGCTTGCTGCGTGTAATAAAATTGATCCTGTGAAAGAATGCTGCAATGCAATCTGTCAGAATGCCATTTCAGAAGCCGCTACAAAGATAGCTATGATATCTACAGATTTCTTGGGTATGCCTGGATCTCAAGTCTTGCCTGAACAGTCAACTAGGGTTCGTGATTGTAAAACTATTGTCCTCCGATGGCTAGCAAGCAAACTTCACCCTGCTAATGCCAAGGAGGTTCTTAGAGTACTGTCCAATTGCAATGTTAATAAAGGTGAGCTTGAAAGCTAATTTTTTCAAACAGAGATTTATGAAATTCTTGTGATTTCACTATTATAATTCATCACATCTCTATGTAGTTGTTGATCCTATGCCCATTTTCCTTTTTGGTCAACGCCCACAACATTTTTAATTTTTCAAAAGTAGAGTTTGTTTCTCTTCATATTCGCTCTGGTCTGTTACCACTCACTGCTGATGCTATTATCTTTAGCTGTCACATTATCATATATGCACATACCTTTACAGATTTTCTCTGTGTACTATTCTCAGTTTGTCCGCTGGAATTCCCCGACATGAAATATGTTGCCAATGCTTGTGGGAATGCGATCTCTAACAAGACAGCGTGCTGCCTGGCCATGGAAAGCTATGTCACTCATCTGCAAAAGCAGAGCCTTGTTACCAACCTGCAAGCTTTGGATTGTGCTACAGCATTGGAAATGAAATTACGAAAGTCAAATATTACCAAGGACGTGTATGGCCTATGCCATATAAGCCTCAAGGATTTCTCCCTTCAAGGTTTGTAGCTTTTGTCCTCTTATATTTCCAGATTAAAAGAATTCCTTCTTGAATAATTTCACATGGATTCTAATACTTTTTTCCTTTCTTGCCCATGTTATAACTTGATAATGGATCTACAACTTTGAGCATCTCAGTTGGAAATCAAGGTACTATTTCAAGACATACATTCCTGGCATATGCTAAAAAAAGGATAGTCATCATCTATAAAAAATTACTTATTCCATATTCCTTCAAGCTAACCTTTGATTACATCATTGGCTTTTGTTGGCTTAAAAATTATTACAAGACATACCATTGGGTTGTTTTTATATTTTTTAACCCCAAAAAGATTGATGTCAATGCGTAGATACCAAATGCAGATTTTAGCCAATTTTGAAATTTCAAGTGATTCATTTATGGCCCTCACATTCAATCTCTAACTTATATTATCCTTCTTGCCAGAGTTTGGATGCCTTCTACCTAGCCTGCCTTCAGATGCCATATTTGATCCATCTTCAGGTATCAGCTTTGTTTGTGATCTGAATGACCATATTCCAGCTCCATGGTCTTCTACATCCCAAATGACAGCATCGTCATGCAATAAGAGTAAGACACCTTCTTCTCTCTCTCTCTCTCTTTAATTGTTTTTTTAACTCCCCGTATTGGGTATGGGTATGGGTATCAAATACCCTTAACCAGCTTTCCCACCTTTATAAGAAAATATCAATGTTTTGCCGTCTTAAATCTCTAATTGGTTCACCTTTTTGGATTGCAGCTATCAAAATTCCTGCACTTCCTGCAGCAGCATCTTCTCAAAGCGGTACCTATATTCTTGAACTATCCTTGAATTTCTCTACATAA

mRNA sequence

ATGAACCGAAAACTCGCACGCTTGGAAGTTGCCACGCGTGAGAGACCAAGAGTGCCCAGTGGAGTTACTCCTCACACTTATCACGCGTGCGAGCATGAAGCTGTTAATAGCATGTTTGGATTTGCCTGTACCCTTACCACCAAACTAGATTTACATGAAACTAGCTGCATCCCATCAACTTATCCAACACGGCATCTGTCAAATGAGAAACCAATGGATGATATGTATCCTGTGATTGCTCCAAGTGGGAATCCAAAGCCATTCATTCCTCTCCTTGCCCCTTCTCCATTGGTACCCTTCACAAATACCACCATTCCAAAGTTATCAGGGCAATGTTTGTTGAACTTCTCTGCAACGGAAACTTTGATGAGTATGACTGCTATAGATTGTTGGGCCCCTTTTGCAAAACAGATGGCCAATGTTATCTGTTGTCCACAGTTGGAAGCTACTCTTGCTATCCTTATTGGTCAATCCAGTAAAGATACTAATGTTCTTGCTTTAAATGGGACGCTTGCCAAATATTGCCTCTCAGATATTGAGCAGATTTTGGTGGGGCAGGGTGCAAGTGAAAGTCTCAGACACATATGCACAGTTCATCCAGCCAATCTCACTGAAGGATCTTGTCCAGCCAAAGATATTAGTGAATTTGAGACCACGGTTGATACCTCTAAGTTGCTTGCTGCGTGTAATAAAATTGATCCTGTGAAAGAATGCTGCAATGCAATCTGTCAGAATGCCATTTCAGAAGCCGCTACAAAGATAGCTATGATATCTACAGATTTCTTGGGTATGCCTGGATCTCAAGTCTTGCCTGAACAGTCAACTAGGGTTCGTGATTGTAAAACTATTGTCCTCCGATGGCTAGCAAGCAAACTTCACCCTGCTAATGCCAAGGAGGTTCTTAGAGTACTGTCCAATTGCAATGTTAATAAAGTTTGTCCGCTGGAATTCCCCGACATGAAATATGTTGCCAATGCTTGTGGGAATGCGATCTCTAACAAGACAGCGTGCTGCCTGGCCATGGAAAGCTATGTCACTCATCTGCAAAAGCAGAGCCTTGTTACCAACCTGCAAGCTTTGGATTGTGCTACAGCATTGGAAATGAAATTACGAAAGTCAAATATTACCAAGGACGTGTATGGCCTATGCCATATAAGCCTCAAGGATTTCTCCCTTCAAGAGTTTGGATGCCTTCTACCTAGCCTGCCTTCAGATGCCATATTTGATCCATCTTCAGGTATCAGCTTTGTTTGTGATCTGAATGACCATATTCCAGCTCCATGGTCTTCTACATCCCAAATGACAGCATCGTCATGCAATAAGACTATCAAAATTCCTGCACTTCCTGCAGCAGCATCTTCTCAAAGCGGTACCTATATTCTTGAACTATCCTTGAATTTCTCTACATAA

Coding sequence (CDS)

ATGAACCGAAAACTCGCACGCTTGGAAGTTGCCACGCGTGAGAGACCAAGAGTGCCCAGTGGAGTTACTCCTCACACTTATCACGCGTGCGAGCATGAAGCTGTTAATAGCATGTTTGGATTTGCCTGTACCCTTACCACCAAACTAGATTTACATGAAACTAGCTGCATCCCATCAACTTATCCAACACGGCATCTGTCAAATGAGAAACCAATGGATGATATGTATCCTGTGATTGCTCCAAGTGGGAATCCAAAGCCATTCATTCCTCTCCTTGCCCCTTCTCCATTGGTACCCTTCACAAATACCACCATTCCAAAGTTATCAGGGCAATGTTTGTTGAACTTCTCTGCAACGGAAACTTTGATGAGTATGACTGCTATAGATTGTTGGGCCCCTTTTGCAAAACAGATGGCCAATGTTATCTGTTGTCCACAGTTGGAAGCTACTCTTGCTATCCTTATTGGTCAATCCAGTAAAGATACTAATGTTCTTGCTTTAAATGGGACGCTTGCCAAATATTGCCTCTCAGATATTGAGCAGATTTTGGTGGGGCAGGGTGCAAGTGAAAGTCTCAGACACATATGCACAGTTCATCCAGCCAATCTCACTGAAGGATCTTGTCCAGCCAAAGATATTAGTGAATTTGAGACCACGGTTGATACCTCTAAGTTGCTTGCTGCGTGTAATAAAATTGATCCTGTGAAAGAATGCTGCAATGCAATCTGTCAGAATGCCATTTCAGAAGCCGCTACAAAGATAGCTATGATATCTACAGATTTCTTGGGTATGCCTGGATCTCAAGTCTTGCCTGAACAGTCAACTAGGGTTCGTGATTGTAAAACTATTGTCCTCCGATGGCTAGCAAGCAAACTTCACCCTGCTAATGCCAAGGAGGTTCTTAGAGTACTGTCCAATTGCAATGTTAATAAAGTTTGTCCGCTGGAATTCCCCGACATGAAATATGTTGCCAATGCTTGTGGGAATGCGATCTCTAACAAGACAGCGTGCTGCCTGGCCATGGAAAGCTATGTCACTCATCTGCAAAAGCAGAGCCTTGTTACCAACCTGCAAGCTTTGGATTGTGCTACAGCATTGGAAATGAAATTACGAAAGTCAAATATTACCAAGGACGTGTATGGCCTATGCCATATAAGCCTCAAGGATTTCTCCCTTCAAGAGTTTGGATGCCTTCTACCTAGCCTGCCTTCAGATGCCATATTTGATCCATCTTCAGGTATCAGCTTTGTTTGTGATCTGAATGACCATATTCCAGCTCCATGGTCTTCTACATCCCAAATGACAGCATCGTCATGCAATAAGACTATCAAAATTCCTGCACTTCCTGCAGCAGCATCTTCTCAAAGCGGTACCTATATTCTTGAACTATCCTTGAATTTCTCTACATAA

Protein sequence

MNRKLARLEVATRERPRVPSGVTPHTYHACEHEAVNSMFGFACTLTTKLDLHETSCIPSTYPTRHLSNEKPMDDMYPVIAPSGNPKPFIPLLAPSPLVPFTNTTIPKLSGQCLLNFSATETLMSMTAIDCWAPFAKQMANVICCPQLEATLAILIGQSSKDTNVLALNGTLAKYCLSDIEQILVGQGASESLRHICTVHPANLTEGSCPAKDISEFETTVDTSKLLAACNKIDPVKECCNAICQNAISEAATKIAMISTDFLGMPGSQVLPEQSTRVRDCKTIVLRWLASKLHPANAKEVLRVLSNCNVNKVCPLEFPDMKYVANACGNAISNKTACCLAMESYVTHLQKQSLVTNLQALDCATALEMKLRKSNITKDVYGLCHISLKDFSLQEFGCLLPSLPSDAIFDPSSGISFVCDLNDHIPAPWSSTSQMTASSCNKTIKIPALPAAASSQSGTYILELSLNFST
Homology
BLAST of HG10021072 vs. NCBI nr
Match: XP_038894855.1 (uncharacterized GPI-anchored protein At1g61900 [Benincasa hispida])

HSP 1 Score: 804.3 bits (2076), Expect = 5.6e-229
Identity = 401/415 (96.63%), Postives = 404/415 (97.35%), Query Frame = 0

Query: 49  LDLHETSCIPSTYPTRHLSNEKPMDDMYPVIAPSGNPKPFIPLLAPSPLVPFTNTTIPKL 108
           L  HE SCIPSTYPTRHLSNEKPMDDMYPVIAPSGNPKPFIPLLAPSPLVPFTNTT+PKL
Sbjct: 23  LYFHEASCIPSTYPTRHLSNEKPMDDMYPVIAPSGNPKPFIPLLAPSPLVPFTNTTVPKL 82

Query: 109 SGQCLLNFSATETLMSMTAIDCWAPFAKQMANVICCPQLEATLAILIGQSSKDTNVLALN 168
           SGQCLLNFSATETLMSMTAIDCWAPFAKQMANVICCPQLEATLAILIGQSS DTNVLALN
Sbjct: 83  SGQCLLNFSATETLMSMTAIDCWAPFAKQMANVICCPQLEATLAILIGQSSIDTNVLALN 142

Query: 169 GTLAKYCLSDIEQILVGQGASESLRHICTVHPANLTEGSCPAKDISEFETTVDTSKLLAA 228
           GTLAKYCLSDIEQILVGQGASE LRHICTVHPANLTEGSCPAKDISEFETT+DTSKLLAA
Sbjct: 143 GTLAKYCLSDIEQILVGQGASERLRHICTVHPANLTEGSCPAKDISEFETTIDTSKLLAA 202

Query: 229 CNKIDPVKECCNAICQNAISEAATKIAMISTDFLGMPGSQVLPEQSTRVRDCKTIVLRWL 288
           CNKIDPVKECCNAICQNAISEAATKIAMISTDFLGMPGSQVLPEQS RVRDCKTIVLRWL
Sbjct: 203 CNKIDPVKECCNAICQNAISEAATKIAMISTDFLGMPGSQVLPEQSARVRDCKTIVLRWL 262

Query: 289 ASKLHPANAKEVLRVLSNCNVNKVCPLEFPDMKYVANACGNAISNKTACCLAMESYVTHL 348
           ASKLHPANAKEVLRVLSNCNVNKVCPLEFPDMKYVANACGNAISNKTACCLAMESYVTHL
Sbjct: 263 ASKLHPANAKEVLRVLSNCNVNKVCPLEFPDMKYVANACGNAISNKTACCLAMESYVTHL 322

Query: 349 QKQSLVTNLQALDCATALEMKLRKSNITKDVYGLCHISLKDFSL----QEFGCLLPSLPS 408
           QKQSLVTNLQALDCATALEMKLRKSNITKDVYGLCHISLKDFSL    QEFGCLLPSLPS
Sbjct: 323 QKQSLVTNLQALDCATALEMKLRKSNITKDVYGLCHISLKDFSLQVGNQEFGCLLPSLPS 382

Query: 409 DAIFDPSSGISFVCDLNDHIPAPWSSTSQMTASSCNKTIKIPALPAAASSQSGTY 460
           DAIFDPSSGISFVCDLNDHIPAPWSSTSQMTASSCNKTIKIPALPAAAS+QSG Y
Sbjct: 383 DAIFDPSSGISFVCDLNDHIPAPWSSTSQMTASSCNKTIKIPALPAAASAQSGLY 437

BLAST of HG10021072 vs. NCBI nr
Match: XP_004145108.1 (uncharacterized GPI-anchored protein At1g61900 isoform X2 [Cucumis sativus] >KGN64526.1 hypothetical protein Csa_013461 [Cucumis sativus])

HSP 1 Score: 798.5 bits (2061), Expect = 3.1e-227
Identity = 396/415 (95.42%), Postives = 403/415 (97.11%), Query Frame = 0

Query: 49  LDLHETSCIPSTYPTRHLSNEKPMDDMYPVIAPSGNPKPFIPLLAPSPLVPFTNTTIPKL 108
           L  HETSC+PSTYPT+HLSNEKPMDDMYP IAPSGNPKPF+P LAPSPLVPFTNTT+PKL
Sbjct: 23  LHFHETSCLPSTYPTQHLSNEKPMDDMYPEIAPSGNPKPFLPFLAPSPLVPFTNTTVPKL 82

Query: 109 SGQCLLNFSATETLMSMTAIDCWAPFAKQMANVICCPQLEATLAILIGQSSKDTNVLALN 168
           SGQCLLNFSATETLMSMTAIDCWAPFAKQMANVICCPQLEATLAILIGQSSKDT+VLALN
Sbjct: 83  SGQCLLNFSATETLMSMTAIDCWAPFAKQMANVICCPQLEATLAILIGQSSKDTSVLALN 142

Query: 169 GTLAKYCLSDIEQILVGQGASESLRHICTVHPANLTEGSCPAKDISEFETTVDTSKLLAA 228
           GTLAKYCLSDIEQILVGQGASE LRHICTVHPANLTEGSCPAKDISEFETTVDTSKLLAA
Sbjct: 143 GTLAKYCLSDIEQILVGQGASERLRHICTVHPANLTEGSCPAKDISEFETTVDTSKLLAA 202

Query: 229 CNKIDPVKECCNAICQNAISEAATKIAMISTDFLGMPGSQVLPEQSTRVRDCKTIVLRWL 288
           CNKIDPVKECCNAICQNAISEAATKIAMISTDFLGMPGSQVLPEQSTRVRDCKTIVLRWL
Sbjct: 203 CNKIDPVKECCNAICQNAISEAATKIAMISTDFLGMPGSQVLPEQSTRVRDCKTIVLRWL 262

Query: 289 ASKLHPANAKEVLRVLSNCNVNKVCPLEFPDMKYVANACGNAISNKTACCLAMESYVTHL 348
           ASKLHPANAKEVLRVLSNCNVNKVCPLEFPDMKYVA+ACGNAISNKTACCLAME YVTHL
Sbjct: 263 ASKLHPANAKEVLRVLSNCNVNKVCPLEFPDMKYVADACGNAISNKTACCLAMEGYVTHL 322

Query: 349 QKQSLVTNLQALDCATALEMKLRKSNITKDVYGLCHISLKDFSL----QEFGCLLPSLPS 408
           QKQSLVTNLQALDCATALEMKLRKSNITKDVYGLCHISLKDFSL    QEFGCLLPSLPS
Sbjct: 323 QKQSLVTNLQALDCATALEMKLRKSNITKDVYGLCHISLKDFSLQVGNQEFGCLLPSLPS 382

Query: 409 DAIFDPSSGISFVCDLNDHIPAPWSSTSQMTASSCNKTIKIPALPAAASSQSGTY 460
           DAIFDPSSGISFVCDLNDHIPAPWSSTSQMTASSCNKTIKIPALPAAAS Q+G Y
Sbjct: 383 DAIFDPSSGISFVCDLNDHIPAPWSSTSQMTASSCNKTIKIPALPAAASGQTGLY 437

BLAST of HG10021072 vs. NCBI nr
Match: XP_031739069.1 (uncharacterized GPI-anchored protein At1g61900 isoform X1 [Cucumis sativus])

HSP 1 Score: 798.1 bits (2060), Expect = 4.0e-227
Identity = 396/414 (95.65%), Postives = 403/414 (97.34%), Query Frame = 0

Query: 49  LDLHETSCIPSTYPTRHLSNEKPMDDMYPVIAPSGNPKPFIPLLAPSPLVPFTNTTIPKL 108
           L  HETSC+PSTYPT+HLSNEKPMDDMYP IAPSGNPKPF+P LAPSPLVPFTNTT+PKL
Sbjct: 23  LHFHETSCLPSTYPTQHLSNEKPMDDMYPEIAPSGNPKPFLPFLAPSPLVPFTNTTVPKL 82

Query: 109 SGQCLLNFSATETLMSMTAIDCWAPFAKQMANVICCPQLEATLAILIGQSSKDTNVLALN 168
           SGQCLLNFSATETLMSMTAIDCWAPFAKQMANVICCPQLEATLAILIGQSSKDT+VLALN
Sbjct: 83  SGQCLLNFSATETLMSMTAIDCWAPFAKQMANVICCPQLEATLAILIGQSSKDTSVLALN 142

Query: 169 GTLAKYCLSDIEQILVGQGASESLRHICTVHPANLTEGSCPAKDISEFETTVDTSKLLAA 228
           GTLAKYCLSDIEQILVGQGASE LRHICTVHPANLTEGSCPAKDISEFETTVDTSKLLAA
Sbjct: 143 GTLAKYCLSDIEQILVGQGASERLRHICTVHPANLTEGSCPAKDISEFETTVDTSKLLAA 202

Query: 229 CNKIDPVKECCNAICQNAISEAATKIAMISTDFLGMPGSQVLPEQSTRVRDCKTIVLRWL 288
           CNKIDPVKECCNAICQNAISEAATKIAMISTDFLGMPGSQVLPEQSTRVRDCKTIVLRWL
Sbjct: 203 CNKIDPVKECCNAICQNAISEAATKIAMISTDFLGMPGSQVLPEQSTRVRDCKTIVLRWL 262

Query: 289 ASKLHPANAKEVLRVLSNCNVNKVCPLEFPDMKYVANACGNAISNKTACCLAMESYVTHL 348
           ASKLHPANAKEVLRVLSNCNVNKVCPLEFPDMKYVA+ACGNAISNKTACCLAME YVTHL
Sbjct: 263 ASKLHPANAKEVLRVLSNCNVNKVCPLEFPDMKYVADACGNAISNKTACCLAMEGYVTHL 322

Query: 349 QKQSLVTNLQALDCATALEMKLRKSNITKDVYGLCHISLKDFSL----QEFGCLLPSLPS 408
           QKQSLVTNLQALDCATALEMKLRKSNITKDVYGLCHISLKDFSL    QEFGCLLPSLPS
Sbjct: 323 QKQSLVTNLQALDCATALEMKLRKSNITKDVYGLCHISLKDFSLQVGNQEFGCLLPSLPS 382

Query: 409 DAIFDPSSGISFVCDLNDHIPAPWSSTSQMTASSCNKTIKIPALPAAASSQSGT 459
           DAIFDPSSGISFVCDLNDHIPAPWSSTSQMTASSCNKTIKIPALPAAAS Q+GT
Sbjct: 383 DAIFDPSSGISFVCDLNDHIPAPWSSTSQMTASSCNKTIKIPALPAAASGQTGT 436

BLAST of HG10021072 vs. NCBI nr
Match: XP_008441110.1 (PREDICTED: uncharacterized GPI-anchored protein At1g61900 [Cucumis melo])

HSP 1 Score: 797.7 bits (2059), Expect = 5.3e-227
Identity = 397/415 (95.66%), Postives = 403/415 (97.11%), Query Frame = 0

Query: 49  LDLHETSCIPSTYPTRHLSNEKPMDDMYPVIAPSGNPKPFIPLLAPSPLVPFTNTTIPKL 108
           L  HETSC+PSTY TRHLSNEKPMDDMYP IAPSGNPKPF+P LAPSPLVPFTNTT+PKL
Sbjct: 23  LYFHETSCLPSTYSTRHLSNEKPMDDMYPEIAPSGNPKPFLPFLAPSPLVPFTNTTVPKL 82

Query: 109 SGQCLLNFSATETLMSMTAIDCWAPFAKQMANVICCPQLEATLAILIGQSSKDTNVLALN 168
           SGQCLLNFSATETLMSMTAIDCWAPFAKQMANVICCPQLEATLAILIGQSS+DTNVLALN
Sbjct: 83  SGQCLLNFSATETLMSMTAIDCWAPFAKQMANVICCPQLEATLAILIGQSSQDTNVLALN 142

Query: 169 GTLAKYCLSDIEQILVGQGASESLRHICTVHPANLTEGSCPAKDISEFETTVDTSKLLAA 228
           GTLAKYCLSDIEQILVGQGASE LRHICTVHPANLTEGSCPAKDISEFE TVDTSKLLAA
Sbjct: 143 GTLAKYCLSDIEQILVGQGASERLRHICTVHPANLTEGSCPAKDISEFEATVDTSKLLAA 202

Query: 229 CNKIDPVKECCNAICQNAISEAATKIAMISTDFLGMPGSQVLPEQSTRVRDCKTIVLRWL 288
           CNKIDPVKECCNAICQNAISEAATKIAMISTDFLGMPGSQVLPEQSTRVRDCKTIVLRWL
Sbjct: 203 CNKIDPVKECCNAICQNAISEAATKIAMISTDFLGMPGSQVLPEQSTRVRDCKTIVLRWL 262

Query: 289 ASKLHPANAKEVLRVLSNCNVNKVCPLEFPDMKYVANACGNAISNKTACCLAMESYVTHL 348
           ASKLHPANAKEVLRVLSNCNVNKVCPLEFPDMKYVA+ACGNAISNKTACCLAMESYVTHL
Sbjct: 263 ASKLHPANAKEVLRVLSNCNVNKVCPLEFPDMKYVADACGNAISNKTACCLAMESYVTHL 322

Query: 349 QKQSLVTNLQALDCATALEMKLRKSNITKDVYGLCHISLKDFSL----QEFGCLLPSLPS 408
           QKQSLVTNLQALDCATALEMKLRKSNITKDVYGLCHISLKDFSL    QEFGCLLPSLPS
Sbjct: 323 QKQSLVTNLQALDCATALEMKLRKSNITKDVYGLCHISLKDFSLQVGNQEFGCLLPSLPS 382

Query: 409 DAIFDPSSGISFVCDLNDHIPAPWSSTSQMTASSCNKTIKIPALPAAASSQSGTY 460
           DAIFDPSSGISFVCDLNDHIPAPWSSTSQMTASSCNKTIKIPALPAAAS+QSG Y
Sbjct: 383 DAIFDPSSGISFVCDLNDHIPAPWSSTSQMTASSCNKTIKIPALPAAASAQSGLY 437

BLAST of HG10021072 vs. NCBI nr
Match: XP_023001254.1 (uncharacterized GPI-anchored protein At1g61900 isoform X2 [Cucurbita maxima])

HSP 1 Score: 786.6 bits (2030), Expect = 1.2e-223
Identity = 390/426 (91.55%), Postives = 406/426 (95.31%), Query Frame = 0

Query: 44  TLTTKLDLHETSCIPSTYPTRHLSNEKPMDDMYPVIAPSGNPKPFIPLLAPSPLVPFTNT 103
           TL   L  HE SCIPSTYPTRHLSNEKPMDDMYP IAPSGNPKPF+PLLAPSPL PFTNT
Sbjct: 52  TLLLLLYFHEASCIPSTYPTRHLSNEKPMDDMYPEIAPSGNPKPFLPLLAPSPLAPFTNT 111

Query: 104 TIPKLSGQCLLNFSATETLMSMTAIDCWAPFAKQMANVICCPQLEATLAILIGQSSKDTN 163
           T+P LSGQCLLNFSATETLM +TA+DCWAPFAKQMANVICCPQLEATLAILIGQSSKDT 
Sbjct: 112 TVPNLSGQCLLNFSATETLMGVTAMDCWAPFAKQMANVICCPQLEATLAILIGQSSKDTY 171

Query: 164 VLALNGTLAKYCLSDIEQILVGQGASESLRHICTVHPANLTEGSCPAKDISEFETTVDTS 223
           VLALNGTLA+YCLSDIEQILVGQGA+E L+HIC VHPANLTEGSCPAKD+SEFETTVDTS
Sbjct: 172 VLALNGTLAEYCLSDIEQILVGQGANERLKHICRVHPANLTEGSCPAKDVSEFETTVDTS 231

Query: 224 KLLAACNKIDPVKECCNAICQNAISEAATKIAMISTDFLGMPGSQVLPEQSTRVRDCKTI 283
           KLLAACNKIDPVKECCNAICQNAISEAATK+AMISTDFLGMPGSQVLPEQS RVRDCKTI
Sbjct: 232 KLLAACNKIDPVKECCNAICQNAISEAATKLAMISTDFLGMPGSQVLPEQSARVRDCKTI 291

Query: 284 VLRWLASKLHPANAKEVLRVLSNCNVNKVCPLEFPDMKYVANACGNAISNKTACCLAMES 343
           VLRWLASKLHPANAKEVLRVLSNCNVNKVCPLEFPDMKYVA+ACGNAISNKT CCLAMES
Sbjct: 292 VLRWLASKLHPANAKEVLRVLSNCNVNKVCPLEFPDMKYVADACGNAISNKTGCCLAMES 351

Query: 344 YVTHLQKQSLVTNLQALDCATALEMKLRKSNITKDVYGLCHISLKDFSL----QEFGCLL 403
           YVTHLQKQSLVTNLQALDCAT+LEMKLRKSNITK+VY LCHISLKDFSL    QEFGCLL
Sbjct: 352 YVTHLQKQSLVTNLQALDCATSLEMKLRKSNITKNVYDLCHISLKDFSLQVGNQEFGCLL 411

Query: 404 PSLPSDAIFDPSSGISFVCDLNDHIPAPWSSTSQMTASSCNKTIKIPALPAAASSQSGTY 463
           PSLPSDAIFDPSSGISFVCDLNDHIPAPWSS++QMTASSCNKTIKIPALPAAAS+QSGTY
Sbjct: 412 PSLPSDAIFDPSSGISFVCDLNDHIPAPWSSSTQMTASSCNKTIKIPALPAAASAQSGTY 471

Query: 464 ILELSL 466
           IL+LSL
Sbjct: 472 ILQLSL 477

BLAST of HG10021072 vs. ExPASy Swiss-Prot
Match: Q8GUI4 (Uncharacterized GPI-anchored protein At1g61900 OS=Arabidopsis thaliana OX=3702 GN=At1g61900 PE=2 SV=1)

HSP 1 Score: 491.5 bits (1264), Expect = 1.1e-137
Identity = 236/393 (60.05%), Postives = 306/393 (77.86%), Query Frame = 0

Query: 67  SNEKPMDDMYPVIAPSGNPKPFIPLLAPSPLVPFTNTTIPKLSGQCLLNFSATETLMSMT 126
           S++KP ++  P I+P  +P+PF+P +APSP+VP+ N+T+PKLSG C LNFSA+E+L+  T
Sbjct: 25  SSQKP-EEFLPEISPDTSPQPFLPFIAPSPMVPYINSTMPKLSGLCSLNFSASESLIQTT 84

Query: 127 AIDCWAPFAKQMANVICCPQLEATLAILIGQSSKDTNVLALNGTLAKYCLSDIEQILVGQ 186
           + +CW  FA  +ANV+CCPQL+ATL I++G++SK+T +LALN T +K+CLSD+EQILVG+
Sbjct: 85  SHNCWTVFAPLLANVMCCPQLDATLTIILGKASKETGLLALNRTQSKHCLSDLEQILVGK 144

Query: 187 GASESLRHICTVHPANLTEGSCPAKDISEFETTVDTSKLLAACNKIDPVKECCNAICQNA 246
           GAS  L  IC++H +NLT  SCP  ++ EFE+TVDT+KLL AC KIDPVKECC   CQNA
Sbjct: 145 GASGQLNKICSIHSSNLTSSSCPVINVDEFESTVDTAKLLLACEKIDPVKECCEEACQNA 204

Query: 247 ISEAATKIAMISTDFLGMPGSQVLPEQSTRVRDCKTIVLRWLASKLHPANAKEVLRVLSN 306
           I +AAT I+        +  S+ L + S R+ DCK +V RWLA+KL P+  KE LR L+N
Sbjct: 205 ILDAATNIS--------LKASETLTDNSDRINDCKNVVNRWLATKLDPSRVKETLRGLAN 264

Query: 307 CNVNKVCPLEFPDMKYVANACGNAISNKTACCLAMESYVTHLQKQSLVTNLQALDCATAL 366
           C +N+VCPL FP MK++   C N +SN+T CC AMESYV+HLQKQ+L+TNLQALDCAT+L
Sbjct: 265 CKINRVCPLVFPHMKHIGGNCSNELSNQTGCCRAMESYVSHLQKQTLITNLQALDCATSL 324

Query: 367 EMKLRKSNITKDVYGLCHISLKDFSL----QEFGCLLPSLPSDAIFDPSSGISFVCDLND 426
             KL+K NITK+++ +CHISLKDFSL    QE GCLLPSLPSDAIFD  +GISF CDLND
Sbjct: 325 GTKLQKLNITKNIFSVCHISLKDFSLQVGNQESGCLLPSLPSDAIFDKDTGISFTCDLND 384

Query: 427 HIPAPWSSTSQMTASSCNKTIKIPALPAAASSQ 456
           +IPAPW S+S  +AS+C K ++IPALPAAASSQ
Sbjct: 385 NIPAPWPSSSLSSASTCKKPVRIPALPAAASSQ 408

BLAST of HG10021072 vs. ExPASy TrEMBL
Match: A0A0A0LX36 (SPARK domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G063500 PE=4 SV=1)

HSP 1 Score: 798.5 bits (2061), Expect = 1.5e-227
Identity = 396/415 (95.42%), Postives = 403/415 (97.11%), Query Frame = 0

Query: 49  LDLHETSCIPSTYPTRHLSNEKPMDDMYPVIAPSGNPKPFIPLLAPSPLVPFTNTTIPKL 108
           L  HETSC+PSTYPT+HLSNEKPMDDMYP IAPSGNPKPF+P LAPSPLVPFTNTT+PKL
Sbjct: 23  LHFHETSCLPSTYPTQHLSNEKPMDDMYPEIAPSGNPKPFLPFLAPSPLVPFTNTTVPKL 82

Query: 109 SGQCLLNFSATETLMSMTAIDCWAPFAKQMANVICCPQLEATLAILIGQSSKDTNVLALN 168
           SGQCLLNFSATETLMSMTAIDCWAPFAKQMANVICCPQLEATLAILIGQSSKDT+VLALN
Sbjct: 83  SGQCLLNFSATETLMSMTAIDCWAPFAKQMANVICCPQLEATLAILIGQSSKDTSVLALN 142

Query: 169 GTLAKYCLSDIEQILVGQGASESLRHICTVHPANLTEGSCPAKDISEFETTVDTSKLLAA 228
           GTLAKYCLSDIEQILVGQGASE LRHICTVHPANLTEGSCPAKDISEFETTVDTSKLLAA
Sbjct: 143 GTLAKYCLSDIEQILVGQGASERLRHICTVHPANLTEGSCPAKDISEFETTVDTSKLLAA 202

Query: 229 CNKIDPVKECCNAICQNAISEAATKIAMISTDFLGMPGSQVLPEQSTRVRDCKTIVLRWL 288
           CNKIDPVKECCNAICQNAISEAATKIAMISTDFLGMPGSQVLPEQSTRVRDCKTIVLRWL
Sbjct: 203 CNKIDPVKECCNAICQNAISEAATKIAMISTDFLGMPGSQVLPEQSTRVRDCKTIVLRWL 262

Query: 289 ASKLHPANAKEVLRVLSNCNVNKVCPLEFPDMKYVANACGNAISNKTACCLAMESYVTHL 348
           ASKLHPANAKEVLRVLSNCNVNKVCPLEFPDMKYVA+ACGNAISNKTACCLAME YVTHL
Sbjct: 263 ASKLHPANAKEVLRVLSNCNVNKVCPLEFPDMKYVADACGNAISNKTACCLAMEGYVTHL 322

Query: 349 QKQSLVTNLQALDCATALEMKLRKSNITKDVYGLCHISLKDFSL----QEFGCLLPSLPS 408
           QKQSLVTNLQALDCATALEMKLRKSNITKDVYGLCHISLKDFSL    QEFGCLLPSLPS
Sbjct: 323 QKQSLVTNLQALDCATALEMKLRKSNITKDVYGLCHISLKDFSLQVGNQEFGCLLPSLPS 382

Query: 409 DAIFDPSSGISFVCDLNDHIPAPWSSTSQMTASSCNKTIKIPALPAAASSQSGTY 460
           DAIFDPSSGISFVCDLNDHIPAPWSSTSQMTASSCNKTIKIPALPAAAS Q+G Y
Sbjct: 383 DAIFDPSSGISFVCDLNDHIPAPWSSTSQMTASSCNKTIKIPALPAAASGQTGLY 437

BLAST of HG10021072 vs. ExPASy TrEMBL
Match: A0A1S3B280 (uncharacterized GPI-anchored protein At1g61900 OS=Cucumis melo OX=3656 GN=LOC103485333 PE=4 SV=1)

HSP 1 Score: 797.7 bits (2059), Expect = 2.5e-227
Identity = 397/415 (95.66%), Postives = 403/415 (97.11%), Query Frame = 0

Query: 49  LDLHETSCIPSTYPTRHLSNEKPMDDMYPVIAPSGNPKPFIPLLAPSPLVPFTNTTIPKL 108
           L  HETSC+PSTY TRHLSNEKPMDDMYP IAPSGNPKPF+P LAPSPLVPFTNTT+PKL
Sbjct: 23  LYFHETSCLPSTYSTRHLSNEKPMDDMYPEIAPSGNPKPFLPFLAPSPLVPFTNTTVPKL 82

Query: 109 SGQCLLNFSATETLMSMTAIDCWAPFAKQMANVICCPQLEATLAILIGQSSKDTNVLALN 168
           SGQCLLNFSATETLMSMTAIDCWAPFAKQMANVICCPQLEATLAILIGQSS+DTNVLALN
Sbjct: 83  SGQCLLNFSATETLMSMTAIDCWAPFAKQMANVICCPQLEATLAILIGQSSQDTNVLALN 142

Query: 169 GTLAKYCLSDIEQILVGQGASESLRHICTVHPANLTEGSCPAKDISEFETTVDTSKLLAA 228
           GTLAKYCLSDIEQILVGQGASE LRHICTVHPANLTEGSCPAKDISEFE TVDTSKLLAA
Sbjct: 143 GTLAKYCLSDIEQILVGQGASERLRHICTVHPANLTEGSCPAKDISEFEATVDTSKLLAA 202

Query: 229 CNKIDPVKECCNAICQNAISEAATKIAMISTDFLGMPGSQVLPEQSTRVRDCKTIVLRWL 288
           CNKIDPVKECCNAICQNAISEAATKIAMISTDFLGMPGSQVLPEQSTRVRDCKTIVLRWL
Sbjct: 203 CNKIDPVKECCNAICQNAISEAATKIAMISTDFLGMPGSQVLPEQSTRVRDCKTIVLRWL 262

Query: 289 ASKLHPANAKEVLRVLSNCNVNKVCPLEFPDMKYVANACGNAISNKTACCLAMESYVTHL 348
           ASKLHPANAKEVLRVLSNCNVNKVCPLEFPDMKYVA+ACGNAISNKTACCLAMESYVTHL
Sbjct: 263 ASKLHPANAKEVLRVLSNCNVNKVCPLEFPDMKYVADACGNAISNKTACCLAMESYVTHL 322

Query: 349 QKQSLVTNLQALDCATALEMKLRKSNITKDVYGLCHISLKDFSL----QEFGCLLPSLPS 408
           QKQSLVTNLQALDCATALEMKLRKSNITKDVYGLCHISLKDFSL    QEFGCLLPSLPS
Sbjct: 323 QKQSLVTNLQALDCATALEMKLRKSNITKDVYGLCHISLKDFSLQVGNQEFGCLLPSLPS 382

Query: 409 DAIFDPSSGISFVCDLNDHIPAPWSSTSQMTASSCNKTIKIPALPAAASSQSGTY 460
           DAIFDPSSGISFVCDLNDHIPAPWSSTSQMTASSCNKTIKIPALPAAAS+QSG Y
Sbjct: 383 DAIFDPSSGISFVCDLNDHIPAPWSSTSQMTASSCNKTIKIPALPAAASAQSGLY 437

BLAST of HG10021072 vs. ExPASy TrEMBL
Match: A0A6J1KM81 (uncharacterized GPI-anchored protein At1g61900 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111495441 PE=4 SV=1)

HSP 1 Score: 786.6 bits (2030), Expect = 5.9e-224
Identity = 390/426 (91.55%), Postives = 406/426 (95.31%), Query Frame = 0

Query: 44  TLTTKLDLHETSCIPSTYPTRHLSNEKPMDDMYPVIAPSGNPKPFIPLLAPSPLVPFTNT 103
           TL   L  HE SCIPSTYPTRHLSNEKPMDDMYP IAPSGNPKPF+PLLAPSPL PFTNT
Sbjct: 52  TLLLLLYFHEASCIPSTYPTRHLSNEKPMDDMYPEIAPSGNPKPFLPLLAPSPLAPFTNT 111

Query: 104 TIPKLSGQCLLNFSATETLMSMTAIDCWAPFAKQMANVICCPQLEATLAILIGQSSKDTN 163
           T+P LSGQCLLNFSATETLM +TA+DCWAPFAKQMANVICCPQLEATLAILIGQSSKDT 
Sbjct: 112 TVPNLSGQCLLNFSATETLMGVTAMDCWAPFAKQMANVICCPQLEATLAILIGQSSKDTY 171

Query: 164 VLALNGTLAKYCLSDIEQILVGQGASESLRHICTVHPANLTEGSCPAKDISEFETTVDTS 223
           VLALNGTLA+YCLSDIEQILVGQGA+E L+HIC VHPANLTEGSCPAKD+SEFETTVDTS
Sbjct: 172 VLALNGTLAEYCLSDIEQILVGQGANERLKHICRVHPANLTEGSCPAKDVSEFETTVDTS 231

Query: 224 KLLAACNKIDPVKECCNAICQNAISEAATKIAMISTDFLGMPGSQVLPEQSTRVRDCKTI 283
           KLLAACNKIDPVKECCNAICQNAISEAATK+AMISTDFLGMPGSQVLPEQS RVRDCKTI
Sbjct: 232 KLLAACNKIDPVKECCNAICQNAISEAATKLAMISTDFLGMPGSQVLPEQSARVRDCKTI 291

Query: 284 VLRWLASKLHPANAKEVLRVLSNCNVNKVCPLEFPDMKYVANACGNAISNKTACCLAMES 343
           VLRWLASKLHPANAKEVLRVLSNCNVNKVCPLEFPDMKYVA+ACGNAISNKT CCLAMES
Sbjct: 292 VLRWLASKLHPANAKEVLRVLSNCNVNKVCPLEFPDMKYVADACGNAISNKTGCCLAMES 351

Query: 344 YVTHLQKQSLVTNLQALDCATALEMKLRKSNITKDVYGLCHISLKDFSL----QEFGCLL 403
           YVTHLQKQSLVTNLQALDCAT+LEMKLRKSNITK+VY LCHISLKDFSL    QEFGCLL
Sbjct: 352 YVTHLQKQSLVTNLQALDCATSLEMKLRKSNITKNVYDLCHISLKDFSLQVGNQEFGCLL 411

Query: 404 PSLPSDAIFDPSSGISFVCDLNDHIPAPWSSTSQMTASSCNKTIKIPALPAAASSQSGTY 463
           PSLPSDAIFDPSSGISFVCDLNDHIPAPWSS++QMTASSCNKTIKIPALPAAAS+QSGTY
Sbjct: 412 PSLPSDAIFDPSSGISFVCDLNDHIPAPWSSSTQMTASSCNKTIKIPALPAAASAQSGTY 471

Query: 464 ILELSL 466
           IL+LSL
Sbjct: 472 ILQLSL 477

BLAST of HG10021072 vs. ExPASy TrEMBL
Match: A0A6J1F438 (uncharacterized GPI-anchored protein At1g61900 OS=Cucurbita moschata OX=3662 GN=LOC111440054 PE=4 SV=1)

HSP 1 Score: 777.7 bits (2007), Expect = 2.7e-221
Identity = 385/420 (91.67%), Postives = 400/420 (95.24%), Query Frame = 0

Query: 44  TLTTKLDLHETSCIPSTYPTRHLSNEKPMDDMYPVIAPSGNPKPFIPLLAPSPLVPFTNT 103
           TL   L  HETSCIPST PTRHLSNEKPMDDMYP IAPSGNPKPF+PLLAPSPL PFTNT
Sbjct: 18  TLLLLLYFHETSCIPSTSPTRHLSNEKPMDDMYPEIAPSGNPKPFLPLLAPSPLAPFTNT 77

Query: 104 TIPKLSGQCLLNFSATETLMSMTAIDCWAPFAKQMANVICCPQLEATLAILIGQSSKDTN 163
           T+P LSGQCLLNFSATETLM +TA+DCWAPFAKQMANVICCPQLEATLAILIGQSSKDTN
Sbjct: 78  TVPNLSGQCLLNFSATETLMGVTAMDCWAPFAKQMANVICCPQLEATLAILIGQSSKDTN 137

Query: 164 VLALNGTLAKYCLSDIEQILVGQGASESLRHICTVHPANLTEGSCPAKDISEFETTVDTS 223
           VLALNGTLA+YCLSDIEQILVGQGA+E L+HIC VHPANLTEGSCPAKD+SEFETTVDTS
Sbjct: 138 VLALNGTLAEYCLSDIEQILVGQGANERLKHICRVHPANLTEGSCPAKDVSEFETTVDTS 197

Query: 224 KLLAACNKIDPVKECCNAICQNAISEAATKIAMISTDFLGMPGSQVLPEQSTRVRDCKTI 283
           KLLAACNKIDPVKECCNAICQNAISEAATK+AMISTDFLGMPGSQVLPEQS RVRDCKTI
Sbjct: 198 KLLAACNKIDPVKECCNAICQNAISEAATKLAMISTDFLGMPGSQVLPEQSARVRDCKTI 257

Query: 284 VLRWLASKLHPANAKEVLRVLSNCNVNKVCPLEFPDMKYVANACGNAISNKTACCLAMES 343
           VLRWLASKLHPANAKEVLRVLSNCNVNKVCPLEFPDMKYVA+ACGNAISNKT CCLAMES
Sbjct: 258 VLRWLASKLHPANAKEVLRVLSNCNVNKVCPLEFPDMKYVADACGNAISNKTGCCLAMES 317

Query: 344 YVTHLQKQSLVTNLQALDCATALEMKLRKSNITKDVYGLCHISLKDFSL----QEFGCLL 403
           YVTHLQKQSLVTNLQALDCAT+LEMKLRKSNITK+VY LCHISLKDFSL    QEFGCLL
Sbjct: 318 YVTHLQKQSLVTNLQALDCATSLEMKLRKSNITKNVYDLCHISLKDFSLQVGNQEFGCLL 377

Query: 404 PSLPSDAIFDPSSGISFVCDLNDHIPAPWSSTSQMTASSCNKTIKIPALPAAASSQSGTY 460
           PSLPSDAIFDPSSGISFVCDLNDHIPAPWSS++QMTASSCNKTIKIPALPAAAS+QSG Y
Sbjct: 378 PSLPSDAIFDPSSGISFVCDLNDHIPAPWSSSTQMTASSCNKTIKIPALPAAASAQSGLY 437

BLAST of HG10021072 vs. ExPASy TrEMBL
Match: A0A6J1KPZ7 (uncharacterized GPI-anchored protein At1g61900 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111495441 PE=4 SV=1)

HSP 1 Score: 776.2 bits (2003), Expect = 7.9e-221
Identity = 384/420 (91.43%), Postives = 399/420 (95.00%), Query Frame = 0

Query: 44  TLTTKLDLHETSCIPSTYPTRHLSNEKPMDDMYPVIAPSGNPKPFIPLLAPSPLVPFTNT 103
           TL   L  HE SCIPSTYPTRHLSNEKPMDDMYP IAPSGNPKPF+PLLAPSPL PFTNT
Sbjct: 52  TLLLLLYFHEASCIPSTYPTRHLSNEKPMDDMYPEIAPSGNPKPFLPLLAPSPLAPFTNT 111

Query: 104 TIPKLSGQCLLNFSATETLMSMTAIDCWAPFAKQMANVICCPQLEATLAILIGQSSKDTN 163
           T+P LSGQCLLNFSATETLM +TA+DCWAPFAKQMANVICCPQLEATLAILIGQSSKDT 
Sbjct: 112 TVPNLSGQCLLNFSATETLMGVTAMDCWAPFAKQMANVICCPQLEATLAILIGQSSKDTY 171

Query: 164 VLALNGTLAKYCLSDIEQILVGQGASESLRHICTVHPANLTEGSCPAKDISEFETTVDTS 223
           VLALNGTLA+YCLSDIEQILVGQGA+E L+HIC VHPANLTEGSCPAKD+SEFETTVDTS
Sbjct: 172 VLALNGTLAEYCLSDIEQILVGQGANERLKHICRVHPANLTEGSCPAKDVSEFETTVDTS 231

Query: 224 KLLAACNKIDPVKECCNAICQNAISEAATKIAMISTDFLGMPGSQVLPEQSTRVRDCKTI 283
           KLLAACNKIDPVKECCNAICQNAISEAATK+AMISTDFLGMPGSQVLPEQS RVRDCKTI
Sbjct: 232 KLLAACNKIDPVKECCNAICQNAISEAATKLAMISTDFLGMPGSQVLPEQSARVRDCKTI 291

Query: 284 VLRWLASKLHPANAKEVLRVLSNCNVNKVCPLEFPDMKYVANACGNAISNKTACCLAMES 343
           VLRWLASKLHPANAKEVLRVLSNCNVNKVCPLEFPDMKYVA+ACGNAISNKT CCLAMES
Sbjct: 292 VLRWLASKLHPANAKEVLRVLSNCNVNKVCPLEFPDMKYVADACGNAISNKTGCCLAMES 351

Query: 344 YVTHLQKQSLVTNLQALDCATALEMKLRKSNITKDVYGLCHISLKDFSL----QEFGCLL 403
           YVTHLQKQSLVTNLQALDCAT+LEMKLRKSNITK+VY LCHISLKDFSL    QEFGCLL
Sbjct: 352 YVTHLQKQSLVTNLQALDCATSLEMKLRKSNITKNVYDLCHISLKDFSLQVGNQEFGCLL 411

Query: 404 PSLPSDAIFDPSSGISFVCDLNDHIPAPWSSTSQMTASSCNKTIKIPALPAAASSQSGTY 460
           PSLPSDAIFDPSSGISFVCDLNDHIPAPWSS++QMTASSCNKTIKIPALPAAAS+QSG Y
Sbjct: 412 PSLPSDAIFDPSSGISFVCDLNDHIPAPWSSSTQMTASSCNKTIKIPALPAAASAQSGLY 471

BLAST of HG10021072 vs. TAIR 10
Match: AT1G61900.3 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G30700.1). )

HSP 1 Score: 497.3 bits (1279), Expect = 1.4e-140
Identity = 236/389 (60.67%), Postives = 306/389 (78.66%), Query Frame = 0

Query: 67  SNEKPMDDMYPVIAPSGNPKPFIPLLAPSPLVPFTNTTIPKLSGQCLLNFSATETLMSMT 126
           S++KP ++  P I+P  +P+PF+P +APSP+VP+ N+T+PKLSG C LNFSA+E+L+  T
Sbjct: 25  SSQKP-EEFLPEISPDTSPQPFLPFIAPSPMVPYINSTMPKLSGLCSLNFSASESLIQTT 84

Query: 127 AIDCWAPFAKQMANVICCPQLEATLAILIGQSSKDTNVLALNGTLAKYCLSDIEQILVGQ 186
           + +CW  FA  +ANV+CCPQL+ATL I++G++SK+T +LALN T +K+CLSD+EQILVG+
Sbjct: 85  SHNCWTVFAPLLANVMCCPQLDATLTIILGKASKETGLLALNRTQSKHCLSDLEQILVGK 144

Query: 187 GASESLRHICTVHPANLTEGSCPAKDISEFETTVDTSKLLAACNKIDPVKECCNAICQNA 246
           GAS  L  IC++H +NLT  SCP  ++ EFE+TVDT+KLL AC KIDPVKECC   CQNA
Sbjct: 145 GASGQLNKICSIHSSNLTSSSCPVINVDEFESTVDTAKLLLACEKIDPVKECCEEACQNA 204

Query: 247 ISEAATKIAMISTDFLGMPGSQVLPEQSTRVRDCKTIVLRWLASKLHPANAKEVLRVLSN 306
           I +AAT I+        +  S+ L + S R+ DCK +V RWLA+KL P+  KE LR L+N
Sbjct: 205 ILDAATNIS--------LKASETLTDNSDRINDCKNVVNRWLATKLDPSRVKETLRGLAN 264

Query: 307 CNVNKVCPLEFPDMKYVANACGNAISNKTACCLAMESYVTHLQKQSLVTNLQALDCATAL 366
           C +N+VCPL FP MK++   C N +SN+T CC AMESYV+HLQKQ+L+TNLQALDCAT+L
Sbjct: 265 CKINRVCPLVFPHMKHIGGNCSNELSNQTGCCRAMESYVSHLQKQTLITNLQALDCATSL 324

Query: 367 EMKLRKSNITKDVYGLCHISLKDFSLQEFGCLLPSLPSDAIFDPSSGISFVCDLNDHIPA 426
             KL+K NITK+++ +CHISLKDFSLQE GCLLPSLPSDAIFD  +GISF CDLND+IPA
Sbjct: 325 GTKLQKLNITKNIFSVCHISLKDFSLQESGCLLPSLPSDAIFDKDTGISFTCDLNDNIPA 384

Query: 427 PWSSTSQMTASSCNKTIKIPALPAAASSQ 456
           PW S+S  +AS+C K ++IPALPAAASSQ
Sbjct: 385 PWPSSSLSSASTCKKPVRIPALPAAASSQ 404

BLAST of HG10021072 vs. TAIR 10
Match: AT1G61900.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: anchored to plasma membrane, plasma membrane, anchored to membrane; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G30700.1); Has 65 Blast hits to 65 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 65; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 491.5 bits (1264), Expect = 7.5e-139
Identity = 236/393 (60.05%), Postives = 306/393 (77.86%), Query Frame = 0

Query: 67  SNEKPMDDMYPVIAPSGNPKPFIPLLAPSPLVPFTNTTIPKLSGQCLLNFSATETLMSMT 126
           S++KP ++  P I+P  +P+PF+P +APSP+VP+ N+T+PKLSG C LNFSA+E+L+  T
Sbjct: 25  SSQKP-EEFLPEISPDTSPQPFLPFIAPSPMVPYINSTMPKLSGLCSLNFSASESLIQTT 84

Query: 127 AIDCWAPFAKQMANVICCPQLEATLAILIGQSSKDTNVLALNGTLAKYCLSDIEQILVGQ 186
           + +CW  FA  +ANV+CCPQL+ATL I++G++SK+T +LALN T +K+CLSD+EQILVG+
Sbjct: 85  SHNCWTVFAPLLANVMCCPQLDATLTIILGKASKETGLLALNRTQSKHCLSDLEQILVGK 144

Query: 187 GASESLRHICTVHPANLTEGSCPAKDISEFETTVDTSKLLAACNKIDPVKECCNAICQNA 246
           GAS  L  IC++H +NLT  SCP  ++ EFE+TVDT+KLL AC KIDPVKECC   CQNA
Sbjct: 145 GASGQLNKICSIHSSNLTSSSCPVINVDEFESTVDTAKLLLACEKIDPVKECCEEACQNA 204

Query: 247 ISEAATKIAMISTDFLGMPGSQVLPEQSTRVRDCKTIVLRWLASKLHPANAKEVLRVLSN 306
           I +AAT I+        +  S+ L + S R+ DCK +V RWLA+KL P+  KE LR L+N
Sbjct: 205 ILDAATNIS--------LKASETLTDNSDRINDCKNVVNRWLATKLDPSRVKETLRGLAN 264

Query: 307 CNVNKVCPLEFPDMKYVANACGNAISNKTACCLAMESYVTHLQKQSLVTNLQALDCATAL 366
           C +N+VCPL FP MK++   C N +SN+T CC AMESYV+HLQKQ+L+TNLQALDCAT+L
Sbjct: 265 CKINRVCPLVFPHMKHIGGNCSNELSNQTGCCRAMESYVSHLQKQTLITNLQALDCATSL 324

Query: 367 EMKLRKSNITKDVYGLCHISLKDFSL----QEFGCLLPSLPSDAIFDPSSGISFVCDLND 426
             KL+K NITK+++ +CHISLKDFSL    QE GCLLPSLPSDAIFD  +GISF CDLND
Sbjct: 325 GTKLQKLNITKNIFSVCHISLKDFSLQVGNQESGCLLPSLPSDAIFDKDTGISFTCDLND 384

Query: 427 HIPAPWSSTSQMTASSCNKTIKIPALPAAASSQ 456
           +IPAPW S+S  +AS+C K ++IPALPAAASSQ
Sbjct: 385 NIPAPWPSSSLSSASTCKKPVRIPALPAAASSQ 408

BLAST of HG10021072 vs. TAIR 10
Match: AT1G61900.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: anchored to plasma membrane, plasma membrane, anchored to membrane; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G30700.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 471.5 bits (1212), Expect = 8.0e-133
Identity = 225/379 (59.37%), Postives = 293/379 (77.31%), Query Frame = 0

Query: 67  SNEKPMDDMYPVIAPSGNPKPFIPLLAPSPLVPFTNTTIPKLSGQCLLNFSATETLMSMT 126
           S++KP ++  P I+P  +P+PF+P +APSP+VP+ N+T+PKLSG C LNFSA+E+L+  T
Sbjct: 25  SSQKP-EEFLPEISPDTSPQPFLPFIAPSPMVPYINSTMPKLSGLCSLNFSASESLIQTT 84

Query: 127 AIDCWAPFAKQMANVICCPQLEATLAILIGQSSKDTNVLALNGTLAKYCLSDIEQILVGQ 186
           + +CW  FA  +ANV+CCPQL+ATL I++G++SK+T +LALN T +K+CLSD+EQILVG+
Sbjct: 85  SHNCWTVFAPLLANVMCCPQLDATLTIILGKASKETGLLALNRTQSKHCLSDLEQILVGK 144

Query: 187 GASESLRHICTVHPANLTEGSCPAKDISEFETTVDTSKLLAACNKIDPVKECCNAICQNA 246
           GAS  L  IC++H +NLT  SCP  ++ EFE+TVDT+KLL AC KIDPVKECC   CQNA
Sbjct: 145 GASGQLNKICSIHSSNLTSSSCPVINVDEFESTVDTAKLLLACEKIDPVKECCEEACQNA 204

Query: 247 ISEAATKIAMISTDFLGMPGSQVLPEQSTRVRDCKTIVLRWLASKLHPANAKEVLRVLSN 306
           I +AAT I+        +  S+ L + S R+ DCK +V RWLA+KL P+  KE LR L+N
Sbjct: 205 ILDAATNIS--------LKASETLTDNSDRINDCKNVVNRWLATKLDPSRVKETLRGLAN 264

Query: 307 CNVNKVCPLEFPDMKYVANACGNAISNKTACCLAMESYVTHLQKQSLVTNLQALDCATAL 366
           C +N+VCPL FP MK++   C N +SN+T CC AMESYV+HLQKQ+L+TNLQALDCAT+L
Sbjct: 265 CKINRVCPLVFPHMKHIGGNCSNELSNQTGCCRAMESYVSHLQKQTLITNLQALDCATSL 324

Query: 367 EMKLRKSNITKDVYGLCHISLKDFSL----QEFGCLLPSLPSDAIFDPSSGISFVCDLND 426
             KL+K NITK+++ +CHISLKDFSL    QE GCLLPSLPSDAIFD  +GISF CDLND
Sbjct: 325 GTKLQKLNITKNIFSVCHISLKDFSLQVGNQESGCLLPSLPSDAIFDKDTGISFTCDLND 384

Query: 427 HIPAPWSSTSQMTASSCNK 442
           +IPAPW S+S  +AS+C K
Sbjct: 385 NIPAPWPSSSLSSASTCKK 394

BLAST of HG10021072 vs. TAIR 10
Match: AT2G30700.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G61900.1); Has 68 Blast hits to 67 proteins in 13 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants - 66; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 298.1 bits (762), Expect = 1.2e-80
Identity = 168/400 (42.00%), Postives = 235/400 (58.75%), Query Frame = 0

Query: 63  TRHLSNEKPMDDMYPV-IAPSGNPKPFIPLLA-PSPLVP-FTNTTIPKLSGQCLLNFSAT 122
           T  L+N   +    P+ ++PS  PK   P L    P+ P F +T  PKL+G+C  +F A 
Sbjct: 53  TSELANPPGIGVSGPIQVSPSVIPKYASPALPWTPPMYPTFPDTYEPKLTGKCPTDFQAI 112

Query: 123 ETLMSMTAIDCWAPFAKQMANVICCPQLEATLAILIGQSSKDTNVLALNGTLAKYCLSDI 182
            +++   A DC  PFA  + NVICCPQ  + L I  GQ +  +N L L   +A  C SDI
Sbjct: 113 SSVIDTAASDCSQPFAALVGNVICCPQFVSLLHIFQGQHNVKSNKLVLPDAVATDCFSDI 172

Query: 183 EQILVGQGASESLRHICTVHPANLTEGSCPAKDISEFETTVDTSKLLAACNKIDPVKECC 242
             ILV + A+ ++  +C+V  +NLT GSCP  D++ FE  V++SKLL AC  +DP+KECC
Sbjct: 173 VSILVSRRANMTIPALCSVTSSNLTGGSCPVTDVTTFEKVVNSSKLLDACRTVDPLKECC 232

Query: 243 NAICQNAISEAATKIA---MISTDFLGMPGSQVLPEQSTRVRDCKTIVLRWLASKLHPAN 302
             ICQ AI EAA  I+   M   D + + GS         + DCK +V  +L+ KL    
Sbjct: 233 RPICQPAIMEAALIISGHQMTVGDKIPLAGS----NNVNAINDCKNVVFSYLSRKLPADK 292

Query: 303 AKEVLRVLSNCNVNKVCPLEFPDMKYVANACGNAISNKTACCLAMESYVTHLQKQSLVTN 362
           A    R+LS+C VNK CPLEF +   V  AC N  +   +CC ++ +Y++ +Q Q L+TN
Sbjct: 293 ANAAFRILSSCKVNKACPLEFKEPTEVIKACRNVAAPSPSCCSSLNAYISGIQNQMLITN 352

Query: 363 LQALDCATALEMKLRKSNITKDVYGLCHISLKDFSLQEF----GCLLPSLPSDAIFDPSS 422
            QA+ CAT +   LRK  +  ++Y LC + LKDFS+Q +    GCLL S P+D IFD +S
Sbjct: 353 KQAIVCATVIGSMLRKGGVMTNIYELCDVDLKDFSVQAYGMQQGCLLRSYPADLIFDNTS 412

Query: 423 GISFVCDLNDHIPAPWSSTSQMTA-SSCNKTIKIPALPAA 452
           G SF CDL D+I APW S+S M++ S C   + +PALP +
Sbjct: 413 GYSFTCDLTDNIAAPWPSSSSMSSLSLCAPEMSLPALPTS 448

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038894855.15.6e-22996.63uncharacterized GPI-anchored protein At1g61900 [Benincasa hispida][more]
XP_004145108.13.1e-22795.42uncharacterized GPI-anchored protein At1g61900 isoform X2 [Cucumis sativus] >KGN... [more]
XP_031739069.14.0e-22795.65uncharacterized GPI-anchored protein At1g61900 isoform X1 [Cucumis sativus][more]
XP_008441110.15.3e-22795.66PREDICTED: uncharacterized GPI-anchored protein At1g61900 [Cucumis melo][more]
XP_023001254.11.2e-22391.55uncharacterized GPI-anchored protein At1g61900 isoform X2 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Q8GUI41.1e-13760.05Uncharacterized GPI-anchored protein At1g61900 OS=Arabidopsis thaliana OX=3702 G... [more]
Match NameE-valueIdentityDescription
A0A0A0LX361.5e-22795.42SPARK domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G063500 PE=4 ... [more]
A0A1S3B2802.5e-22795.66uncharacterized GPI-anchored protein At1g61900 OS=Cucumis melo OX=3656 GN=LOC103... [more]
A0A6J1KM815.9e-22491.55uncharacterized GPI-anchored protein At1g61900 isoform X2 OS=Cucurbita maxima OX... [more]
A0A6J1F4382.7e-22191.67uncharacterized GPI-anchored protein At1g61900 OS=Cucurbita moschata OX=3662 GN=... [more]
A0A6J1KPZ77.9e-22191.43uncharacterized GPI-anchored protein At1g61900 isoform X1 OS=Cucurbita maxima OX... [more]
Match NameE-valueIdentityDescription
AT1G61900.31.4e-14060.67unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G61900.17.5e-13960.05unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G61900.28.0e-13359.37unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G30700.11.2e-8042.00unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR043891SPARK domainPFAMPF19160SPARKcoord: 109..256
e-value: 7.3E-22
score: 78.4
NoneNo IPR availablePANTHERPTHR33831:SF5OS07G0102300 PROTEINcoord: 68..460
IPR040336Uncharacterized GPI-anchored protein At1g61900-lkePANTHERPTHR33831GPI-ANCHORED PROTEINcoord: 68..460

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10021072.1HG10021072.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016020 membrane