Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGGCAGACCAAGAAGGCAGAGAGTTGACGTTTAACTCTCAGTTTCAGACTGACCATGGAGATAAGCAGCAAAACCCCTTTGAAGGAGACAATTGGTCGAGCTATTTTGGACGGTCTGATTCATTTCTCAGCTTTAGTTCACCGGTGGAATCTGAGATTGGTTCCTATGAGATTGAAAGTGATAGAGATGAGGGCGAGGGCGGCGGTGATGATTACACGGCTGAGTTGAGTCGGCGGATGGCTCAGTACATGCTTCAAGATGATGATAACTCTTCCTCTACAAGCTATCAATCTGAGATTCAGAACAAGGTATTGAATTTATTAAGGCATGGAAGATTTCAGGTCGTTTGTCAAATTCTTTGTTCCTCTGTTTTGTTGATTTTGTGGGTGTGTTCTGTTTCTGGTAGTCATGGGGTTTGTCTGGTTCGCCAATTTCAACGCTGTGGTCACCCCTAGGTTCTAGCACTGGGAGCAGCCATGGAAGTCCAGAAGGGCCGTCGAAGGAGCCATCGCCGCCATCGACGCCGGTGGTTGCAGAGCGTGGAGGGCTGGACATTTCACACAACGTTTTCAGCAAATTGGAGAAGAAGAAGAAAGTGAGCACAAATGGTAAAACAATCCAAACAAGCCCCCAAATTGGAGAAACAGGATCTTCCTCTTCCAAAGACCAATCTAGAACTCCCAAAGTGAGACGAGAATCAAAATCCCCCCAACCCAAACCCCCAAATCTCCCCTGTTTCTTATTTTTCTTTTTCTTCCCATATCTCTCTTGCTGATTCAAAACTCAACCATCCTTGTAGAATCAGAAGCGAAGGCAGAACCAACAGCAGCAGCAGTTTATGAAGCAAAAAGGCTCAGTCGCCATACAAGCCAAGCAAACTCAAGGAAGCTCCTCACAAGAAAATTCAGGGGCAAAATTAGGAGGGTCATCGGGGACCGGTGTGTTCCTGCCTCGCCATGTGAACTACAACCGTCCAGCTCCATGTCCTCAGCCACCACAGGCGCCGAAGAAAAAGGGTAATTTCTCCTTTCATTCAATAGATAGCTAATTTTTTTACATCCTTTTCGTAGTGGGTTAACTAGCTAACCCCGTTAGTTGGGATTTTTCCATCAATGAATTGCCATTCAAACCATCCCTCGTGATGCATTTTTGGACTTTTCTTCCTATCCCTCCTTTTCTTCTCTCTTTCTTCGGATTCATAGAGATTCCGTGTCCTCCCATATGCCCTCCTCTGAATCTATCAGACCTTCAACAATTCACGGTTGGCCCAACCAAATTTGTCCGCTACTTTGATGGAATCTTCAATCCATCTCTTTAAACCTGGCCGTTCTGGGGTCAAAAGAAGGTGGCTGAAAACGACACCCAATTGAATCATTAAGCCACTTGCATTGCCCACTACAGTTAACTTCTTGAATGGGTAGATAGATGCGTTGTCTTCCTCTAAAAGGCAAAAACACTAAACAAATTTACCCTTTCCCCCCCTTCCCCATCTCAGGATCCTCCACTGTACTAATACCCGTGAGAGTCTTACAAGCCTTACAACTTCACTACGACAGAATGGACGACAAGACTAGACAAAAAATCACTCGCTTCACAGCTCTTAGAGGTAATTTTTCAATTTCCATCACAAAATTCTCTCTGTGGCATTTTCACAAACAGTTGGTGATTATTCCACATAAAATTGGTTGTTCGCCCCCTATTCTATCTGGGTGATTCTCTTTTTAACCTAACAACAGCAGATTCCTATTTACCTTTATGAACAGAAGCTGCAGCTAATGCAAGAACCACATCACATACCGTTAAGAAAAGTCATTCGGATGCTGCAACGGCGACGGCGACAACGGCGACAAGCCAAATCGATGTGGGCCTTCCTCAAGAATGGACATATTAA
mRNA sequence
ATGGCGGCAGACCAAGAAGGCAGAGAGTTGACGTTTAACTCTCAGTTTCAGACTGACCATGGAGATAAGCAGCAAAACCCCTTTGAAGGAGACAATTGGTCGAGCTATTTTGGACGGTCTGATTCATTTCTCAGCTTTAGTTCACCGGTGGAATCTGAGATTGGTTCCTATGAGATTGAAAGTGATAGAGATGAGGGCGAGGGCGGCGGTGATGATTACACGGCTGAGTTGAGTCGGCGGATGGCTCAGTACATGCTTCAAGATGATGATAACTCTTCCTCTACAAGCTATCAATCTGAGATTCAGAACAAGTCATGGGGTTTGTCTGGTTCGCCAATTTCAACGCTGTGGTCACCCCTAGGTTCTAGCACTGGGAGCAGCCATGGAAGTCCAGAAGGGCCGTCGAAGGAGCCATCGCCGCCATCGACGCCGGTGGTTGCAGAGCGTGGAGGGCTGGACATTTCACACAACGTTTTCAGCAAATTGGAGAAGAAGAAGAAAGTGAGCACAAATGGTAAAACAATCCAAACAAGCCCCCAAATTGGAGAAACAGGATCTTCCTCTTCCAAAGACCAATCTAGAACTCCCAAAAATCAGAAGCGAAGGCAGAACCAACAGCAGCAGCAGTTTATGAAGCAAAAAGGCTCAGTCGCCATACAAGCCAAGCAAACTCAAGGAAGCTCCTCACAAGAAAATTCAGGGGCAAAATTAGGAGGGTCATCGGGGACCGGTGTGTTCCTGCCTCGCCATGTGAACTACAACCGTCCAGCTCCATGTCCTCAGCCACCACAGGCGCCGAAGAAAAAGGGATCCTCCACTGTACTAATACCCGTGAGAGTCTTACAAGCCTTACAACTTCACTACGACAGAATGGACGACAAGACTAGACAAAAAATCACTCGCTTCACAGCTCTTAGAGAAGCTGCAGCTAATGCAAGAACCACATCACATACCGTTAAGAAAAGTCATTCGGATGCTGCAACGGCGACGGCGACAACGGCGACAAGCCAAATCGATGTGGGCCTTCCTCAAGAATGGACATATTAA
Coding sequence (CDS)
ATGGCGGCAGACCAAGAAGGCAGAGAGTTGACGTTTAACTCTCAGTTTCAGACTGACCATGGAGATAAGCAGCAAAACCCCTTTGAAGGAGACAATTGGTCGAGCTATTTTGGACGGTCTGATTCATTTCTCAGCTTTAGTTCACCGGTGGAATCTGAGATTGGTTCCTATGAGATTGAAAGTGATAGAGATGAGGGCGAGGGCGGCGGTGATGATTACACGGCTGAGTTGAGTCGGCGGATGGCTCAGTACATGCTTCAAGATGATGATAACTCTTCCTCTACAAGCTATCAATCTGAGATTCAGAACAAGTCATGGGGTTTGTCTGGTTCGCCAATTTCAACGCTGTGGTCACCCCTAGGTTCTAGCACTGGGAGCAGCCATGGAAGTCCAGAAGGGCCGTCGAAGGAGCCATCGCCGCCATCGACGCCGGTGGTTGCAGAGCGTGGAGGGCTGGACATTTCACACAACGTTTTCAGCAAATTGGAGAAGAAGAAGAAAGTGAGCACAAATGGTAAAACAATCCAAACAAGCCCCCAAATTGGAGAAACAGGATCTTCCTCTTCCAAAGACCAATCTAGAACTCCCAAAAATCAGAAGCGAAGGCAGAACCAACAGCAGCAGCAGTTTATGAAGCAAAAAGGCTCAGTCGCCATACAAGCCAAGCAAACTCAAGGAAGCTCCTCACAAGAAAATTCAGGGGCAAAATTAGGAGGGTCATCGGGGACCGGTGTGTTCCTGCCTCGCCATGTGAACTACAACCGTCCAGCTCCATGTCCTCAGCCACCACAGGCGCCGAAGAAAAAGGGATCCTCCACTGTACTAATACCCGTGAGAGTCTTACAAGCCTTACAACTTCACTACGACAGAATGGACGACAAGACTAGACAAAAAATCACTCGCTTCACAGCTCTTAGAGAAGCTGCAGCTAATGCAAGAACCACATCACATACCGTTAAGAAAAGTCATTCGGATGCTGCAACGGCGACGGCGACAACGGCGACAAGCCAAATCGATGTGGGCCTTCCTCAAGAATGGACATATTAA
Protein sequence
MAADQEGRELTFNSQFQTDHGDKQQNPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIESDRDEGEGGGDDYTAELSRRMAQYMLQDDDNSSSTSYQSEIQNKSWGLSGSPISTLWSPLGSSTGSSHGSPEGPSKEPSPPSTPVVAERGGLDISHNVFSKLEKKKKVSTNGKTIQTSPQIGETGSSSSKDQSRTPKNQKRRQNQQQQQFMKQKGSVAIQAKQTQGSSSQENSGAKLGGSSGTGVFLPRHVNYNRPAPCPQPPQAPKKKGSSTVLIPVRVLQALQLHYDRMDDKTRQKITRFTALREAAANARTTSHTVKKSHSDAATATATTATSQIDVGLPQEWTY
Homology
BLAST of HG10021882 vs. NCBI nr
Match:
XP_038895136.1 (uncharacterized protein LOC120083444 isoform X2 [Benincasa hispida])
HSP 1 Score: 590.1 bits (1520), Expect = 1.2e-164
Identity = 317/352 (90.06%), Postives = 330/352 (93.75%), Query Frame = 0
Query: 1 MAADQEGRELTFNSQFQTDHGDKQQNPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60
MAADQEG+EL FNSQFQTDHGDKQQ+PFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE
Sbjct: 1 MAADQEGKELKFNSQFQTDHGDKQQSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60
Query: 61 SDRDEGEGGGDDYTAELSRRMAQYMLQDDDNSSSTSYQSEIQNKSWGLSGSPISTLWSPL 120
SDRD+GE G DDYTAELSRRMAQYM QDDDNSS+TS+QSEIQNKSWGLSGSPISTLWSPL
Sbjct: 61 SDRDDGENGSDDYTAELSRRMAQYMFQDDDNSSTTSFQSEIQNKSWGLSGSPISTLWSPL 120
Query: 121 GSSTGSSHGSPEGPSKEPSPPSTPVVAERGGLDISHNVFSKLEKKKKVSTNGKTIQTSPQ 180
GSSTGSSHGSPEGPSKEPSPPSTPVVAERGGLDIS NVF+KLEK KKVSTNGK+IQTSPQ
Sbjct: 121 GSSTGSSHGSPEGPSKEPSPPSTPVVAERGGLDISRNVFNKLEKMKKVSTNGKSIQTSPQ 180
Query: 181 IGETGSSSSKDQSRTPKNQKRRQNQQQQQFMKQKGSVAIQAKQTQGSSSQENSGAKLGGS 240
IGET SSSSK+QSRT KNQ+RRQNQQQQQF+KQKGS AIQAKQ QGSS Q NSGAK GGS
Sbjct: 181 IGETESSSSKNQSRTSKNQERRQNQQQQQFIKQKGSAAIQAKQAQGSSLQANSGAKSGGS 240
Query: 241 SGTGVFLPRHVNYNRPAPCPQPPQAPKKKGSSTVLIPVRVLQALQLHYDRMDDKTRQKIT 300
SGTGVFLPRHVNYNRPAPC QPPQ PKKKGSSTVLIPVRVLQALQLHYDRMDD+TRQKIT
Sbjct: 241 SGTGVFLPRHVNYNRPAPCSQPPQPPKKKGSSTVLIPVRVLQALQLHYDRMDDETRQKIT 300
Query: 301 RFTALREAAANARTTSHTVKKSHSDAA----TATATTATSQIDVGLPQEWTY 349
FTALREAAANARTTSHTVKKSHS A+ ATATT+TSQIDVGLPQEWTY
Sbjct: 301 GFTALREAAANARTTSHTVKKSHSGASAAAVAATATTSTSQIDVGLPQEWTY 352
BLAST of HG10021882 vs. NCBI nr
Match:
XP_008457429.1 (PREDICTED: uncharacterized protein LOC103497120 isoform X1 [Cucumis melo] >TYJ97364.1 uncharacterized protein E5676_scaffold194G001750 [Cucumis melo var. makuwa])
HSP 1 Score: 587.8 bits (1514), Expect = 6.1e-164
Identity = 314/348 (90.23%), Postives = 325/348 (93.39%), Query Frame = 0
Query: 1 MAADQEGRELTFNSQFQTDHGDKQQNPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60
MAADQEGREL FNS+FQ +HGDKQQNPFEGD+WSSYFGRSDSFLSF+SPVESEIGS EIE
Sbjct: 1 MAADQEGRELKFNSKFQIEHGDKQQNPFEGDSWSSYFGRSDSFLSFNSPVESEIGSNEIE 60
Query: 61 SDRDEGEGGGDDYTAELSRRMAQYMLQDDDNSSSTSYQSEIQNKSWGLSGSPISTLWSPL 120
SDRD+GE GDDYTAELSRRMAQYMLQDDDNSS+TS+QSEIQNKSWGLSGSPISTLWSPL
Sbjct: 61 SDRDDGENDGDDYTAELSRRMAQYMLQDDDNSSTTSFQSEIQNKSWGLSGSPISTLWSPL 120
Query: 121 GSSTGSSHGSPEGPSKEPSPPSTPVVAERGGLDISHNVFSKLEKKKKVSTNGKTIQTSPQ 180
GSSTGSSHGSPEGPSKEPSPPSTPVV ERGGLDISHNVFSKLEK KKVS +GK+IQTS Q
Sbjct: 121 GSSTGSSHGSPEGPSKEPSPPSTPVVEERGGLDISHNVFSKLEKMKKVSIHGKSIQTSTQ 180
Query: 181 IGETGSSSSKDQSRTPKNQKRRQNQQQQQFMKQKGSVAIQAKQTQGSSSQENSGAKLGGS 240
IGETGSSSSKDQSRTPKNQKRRQNQQQQQFMKQKGS AIQ KQ QGSS Q NSGAK GG
Sbjct: 181 IGETGSSSSKDQSRTPKNQKRRQNQQQQQFMKQKGSGAIQVKQAQGSSLQANSGAKSGGP 240
Query: 241 SGTGVFLPRHVNYNRPAPCPQPPQAPKKKGSSTVLIPVRVLQALQLHYDRMDDKTRQKIT 300
SGTGVFLPRHVNYNRPAPCPQPPQ PKKKG STVLIPVRVLQALQ HYDRMDD+TRQKIT
Sbjct: 241 SGTGVFLPRHVNYNRPAPCPQPPQPPKKKGCSTVLIPVRVLQALQHHYDRMDDETRQKIT 300
Query: 301 RFTALREAAANARTTSHTVKKSHSDAATATATTATSQIDVGLPQEWTY 349
FTALREAAANARTTSHTVKKSH+ A ATATTATSQIDVGLPQEWTY
Sbjct: 301 GFTALREAAANARTTSHTVKKSHTGTAAATATTATSQIDVGLPQEWTY 348
BLAST of HG10021882 vs. NCBI nr
Match:
KAA0031768.1 (uncharacterized protein E6C27_scaffold848G00070 [Cucumis melo var. makuwa])
HSP 1 Score: 587.4 bits (1513), Expect = 8.0e-164
Identity = 314/348 (90.23%), Postives = 324/348 (93.10%), Query Frame = 0
Query: 1 MAADQEGRELTFNSQFQTDHGDKQQNPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60
MAADQEGREL FNS+FQ +HGDKQQNPFEGD+WSSYFGRSDSFLSF+SPVESEIGS EIE
Sbjct: 1 MAADQEGRELKFNSKFQIEHGDKQQNPFEGDSWSSYFGRSDSFLSFNSPVESEIGSNEIE 60
Query: 61 SDRDEGEGGGDDYTAELSRRMAQYMLQDDDNSSSTSYQSEIQNKSWGLSGSPISTLWSPL 120
SDRD+GE GDDYTAELSRRMAQYMLQDDDNSS+TS+QSEIQNKSWGLSGSPISTLWSPL
Sbjct: 61 SDRDDGENDGDDYTAELSRRMAQYMLQDDDNSSTTSFQSEIQNKSWGLSGSPISTLWSPL 120
Query: 121 GSSTGSSHGSPEGPSKEPSPPSTPVVAERGGLDISHNVFSKLEKKKKVSTNGKTIQTSPQ 180
GSSTGSSHGSPEGPSKEPSPPSTPVV ERGGLDISHNVFSKLEK KKVS N K+IQTS Q
Sbjct: 121 GSSTGSSHGSPEGPSKEPSPPSTPVVEERGGLDISHNVFSKLEKMKKVSINSKSIQTSTQ 180
Query: 181 IGETGSSSSKDQSRTPKNQKRRQNQQQQQFMKQKGSVAIQAKQTQGSSSQENSGAKLGGS 240
IGETGSSSSKDQSRTPKNQKRRQNQQQQQFMKQKGS AIQ KQ QGSS Q NSGAK GG
Sbjct: 181 IGETGSSSSKDQSRTPKNQKRRQNQQQQQFMKQKGSGAIQVKQAQGSSLQANSGAKSGGP 240
Query: 241 SGTGVFLPRHVNYNRPAPCPQPPQAPKKKGSSTVLIPVRVLQALQLHYDRMDDKTRQKIT 300
SGTGVFLPRHVNYNRPAPCPQPPQ PKKKG STVLIPVRVLQALQ HYDRMDD+TRQKIT
Sbjct: 241 SGTGVFLPRHVNYNRPAPCPQPPQPPKKKGCSTVLIPVRVLQALQHHYDRMDDETRQKIT 300
Query: 301 RFTALREAAANARTTSHTVKKSHSDAATATATTATSQIDVGLPQEWTY 349
FTALREAAANARTTSHTVKKSH+ A ATATTATSQIDVGLPQEWTY
Sbjct: 301 GFTALREAAANARTTSHTVKKSHTGTAAATATTATSQIDVGLPQEWTY 348
BLAST of HG10021882 vs. NCBI nr
Match:
XP_038895137.1 (uncharacterized protein LOC120083444 isoform X3 [Benincasa hispida])
HSP 1 Score: 583.6 bits (1503), Expect = 1.2e-162
Identity = 316/352 (89.77%), Postives = 329/352 (93.47%), Query Frame = 0
Query: 1 MAADQEGRELTFNSQFQTDHGDKQQNPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60
MAADQEG+EL FNSQFQTDHGDKQQ+PFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE
Sbjct: 1 MAADQEGKELKFNSQFQTDHGDKQQSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60
Query: 61 SDRDEGEGGGDDYTAELSRRMAQYMLQDDDNSSSTSYQSEIQNKSWGLSGSPISTLWSPL 120
SDRD+GE G DDYTAELSRRMAQYM QDDDNSS+TS+QSEIQNKSWGLSGSPISTLWSPL
Sbjct: 61 SDRDDGENGSDDYTAELSRRMAQYMFQDDDNSSTTSFQSEIQNKSWGLSGSPISTLWSPL 120
Query: 121 GSSTGSSHGSPEGPSKEPSPPSTPVVAERGGLDISHNVFSKLEKKKKVSTNGKTIQTSPQ 180
GSSTGSSHGSPEGPSKEPSPPSTPVVAERGGLDIS NVF+KLEK KKVSTNGK+IQTSPQ
Sbjct: 121 GSSTGSSHGSPEGPSKEPSPPSTPVVAERGGLDISRNVFNKLEKMKKVSTNGKSIQTSPQ 180
Query: 181 IGETGSSSSKDQSRTPKNQKRRQNQQQQQFMKQKGSVAIQAKQTQGSSSQENSGAKLGGS 240
IGET SSSSK+QSRT KNQ+RRQNQQQQQF+KQKGS AIQAKQ QGSS Q NSGAK GGS
Sbjct: 181 IGETESSSSKNQSRTSKNQERRQNQQQQQFIKQKGSAAIQAKQAQGSSLQANSGAKSGGS 240
Query: 241 SGTGVFLPRHVNYNRPAPCPQPPQAPKKKGSSTVLIPVRVLQALQLHYDRMDDKTRQKIT 300
SGTGVFLPRHVNYNRPAPC QPPQ PKKKGSSTVLIPVRVLQALQLHYDRMDD+TRQKIT
Sbjct: 241 SGTGVFLPRHVNYNRPAPCSQPPQPPKKKGSSTVLIPVRVLQALQLHYDRMDDETRQKIT 300
Query: 301 RFTALREAAANARTTSHTVKKSHSDAA----TATATTATSQIDVGLPQEWTY 349
FTALR AAANARTTSHTVKKSHS A+ ATATT+TSQIDVGLPQEWTY
Sbjct: 301 GFTALR-AAANARTTSHTVKKSHSGASAAAVAATATTSTSQIDVGLPQEWTY 351
BLAST of HG10021882 vs. NCBI nr
Match:
XP_004145277.2 (uncharacterized protein LOC101214739 [Cucumis sativus] >XP_031741143.1 uncharacterized protein LOC116403745 [Cucumis sativus] >KAE8653272.1 hypothetical protein Csa_023347 [Cucumis sativus] >KGN66184.2 hypothetical protein Csa_019645 [Cucumis sativus])
HSP 1 Score: 536.6 bits (1381), Expect = 1.6e-148
Identity = 288/325 (88.62%), Postives = 299/325 (92.00%), Query Frame = 0
Query: 24 QQNPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIESDRDEGEGGGDDYTAELSRRMAQ 83
QQNPFEGD+W SYFGRSDSFLSF+SPVESEIGSYEIESDRD+GE GDDYTAELSRRMAQ
Sbjct: 2 QQNPFEGDSWPSYFGRSDSFLSFNSPVESEIGSYEIESDRDDGENDGDDYTAELSRRMAQ 61
Query: 84 YMLQDDDNSSSTSYQSEIQNKSWGLSGSPISTLWSPLGSSTGSSHGSPEGPSKEPSPPST 143
YMLQDDDNSS+TS+QSEIQNKSWGLSGSPISTLWSPLGSSTGSSHGSPEGPSKEPSPPST
Sbjct: 62 YMLQDDDNSSTTSFQSEIQNKSWGLSGSPISTLWSPLGSSTGSSHGSPEGPSKEPSPPST 121
Query: 144 PVVAERGGLDISHNVFSKLEKKKKVSTNGKTIQTSPQIGETGSSSSKDQSRTPKNQKRRQ 203
PVV E G LDISHNVFSKLEK KKVS NGK+IQTS QIGETGSSSSKDQSRTPKNQKRRQ
Sbjct: 122 PVVEECGELDISHNVFSKLEKMKKVSINGKSIQTSTQIGETGSSSSKDQSRTPKNQKRRQ 181
Query: 204 NQQQQQFMKQKGSVAIQAKQTQGSSSQENSGAKLGGSSGTGVFLPRHVNYNRPAPCPQPP 263
NQQQQQFMKQKGS Q KQ QGSS Q NSGAK G SGTGVFLPRHVNYNRPAPCPQPP
Sbjct: 182 NQQQQQFMKQKGSGTTQVKQAQGSSLQANSGAKSVGPSGTGVFLPRHVNYNRPAPCPQPP 241
Query: 264 QAPKKKGSSTVLIPVRVLQALQLHYDRMDDKTRQKITRFTALREAAANARTTSHTVKKSH 323
Q PKKKG STVLIPVRVLQALQ HYDRMDD+TRQKIT FTALREAAANARTT++T+KKSH
Sbjct: 242 QPPKKKGCSTVLIPVRVLQALQHHYDRMDDETRQKITGFTALREAAANARTTTNTIKKSH 301
Query: 324 SDAATATATTATSQIDVGLPQEWTY 349
+ ATAT TTATSQIDVGLPQEWTY
Sbjct: 302 TGTATATVTTATSQIDVGLPQEWTY 326
BLAST of HG10021882 vs. ExPASy TrEMBL
Match:
A0A5D3BEB2 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold194G001750 PE=4 SV=1)
HSP 1 Score: 587.8 bits (1514), Expect = 3.0e-164
Identity = 314/348 (90.23%), Postives = 325/348 (93.39%), Query Frame = 0
Query: 1 MAADQEGRELTFNSQFQTDHGDKQQNPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60
MAADQEGREL FNS+FQ +HGDKQQNPFEGD+WSSYFGRSDSFLSF+SPVESEIGS EIE
Sbjct: 1 MAADQEGRELKFNSKFQIEHGDKQQNPFEGDSWSSYFGRSDSFLSFNSPVESEIGSNEIE 60
Query: 61 SDRDEGEGGGDDYTAELSRRMAQYMLQDDDNSSSTSYQSEIQNKSWGLSGSPISTLWSPL 120
SDRD+GE GDDYTAELSRRMAQYMLQDDDNSS+TS+QSEIQNKSWGLSGSPISTLWSPL
Sbjct: 61 SDRDDGENDGDDYTAELSRRMAQYMLQDDDNSSTTSFQSEIQNKSWGLSGSPISTLWSPL 120
Query: 121 GSSTGSSHGSPEGPSKEPSPPSTPVVAERGGLDISHNVFSKLEKKKKVSTNGKTIQTSPQ 180
GSSTGSSHGSPEGPSKEPSPPSTPVV ERGGLDISHNVFSKLEK KKVS +GK+IQTS Q
Sbjct: 121 GSSTGSSHGSPEGPSKEPSPPSTPVVEERGGLDISHNVFSKLEKMKKVSIHGKSIQTSTQ 180
Query: 181 IGETGSSSSKDQSRTPKNQKRRQNQQQQQFMKQKGSVAIQAKQTQGSSSQENSGAKLGGS 240
IGETGSSSSKDQSRTPKNQKRRQNQQQQQFMKQKGS AIQ KQ QGSS Q NSGAK GG
Sbjct: 181 IGETGSSSSKDQSRTPKNQKRRQNQQQQQFMKQKGSGAIQVKQAQGSSLQANSGAKSGGP 240
Query: 241 SGTGVFLPRHVNYNRPAPCPQPPQAPKKKGSSTVLIPVRVLQALQLHYDRMDDKTRQKIT 300
SGTGVFLPRHVNYNRPAPCPQPPQ PKKKG STVLIPVRVLQALQ HYDRMDD+TRQKIT
Sbjct: 241 SGTGVFLPRHVNYNRPAPCPQPPQPPKKKGCSTVLIPVRVLQALQHHYDRMDDETRQKIT 300
Query: 301 RFTALREAAANARTTSHTVKKSHSDAATATATTATSQIDVGLPQEWTY 349
FTALREAAANARTTSHTVKKSH+ A ATATTATSQIDVGLPQEWTY
Sbjct: 301 GFTALREAAANARTTSHTVKKSHTGTAAATATTATSQIDVGLPQEWTY 348
BLAST of HG10021882 vs. ExPASy TrEMBL
Match:
A0A1S3C665 (uncharacterized protein LOC103497120 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103497120 PE=4 SV=1)
HSP 1 Score: 587.8 bits (1514), Expect = 3.0e-164
Identity = 314/348 (90.23%), Postives = 325/348 (93.39%), Query Frame = 0
Query: 1 MAADQEGRELTFNSQFQTDHGDKQQNPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60
MAADQEGREL FNS+FQ +HGDKQQNPFEGD+WSSYFGRSDSFLSF+SPVESEIGS EIE
Sbjct: 1 MAADQEGRELKFNSKFQIEHGDKQQNPFEGDSWSSYFGRSDSFLSFNSPVESEIGSNEIE 60
Query: 61 SDRDEGEGGGDDYTAELSRRMAQYMLQDDDNSSSTSYQSEIQNKSWGLSGSPISTLWSPL 120
SDRD+GE GDDYTAELSRRMAQYMLQDDDNSS+TS+QSEIQNKSWGLSGSPISTLWSPL
Sbjct: 61 SDRDDGENDGDDYTAELSRRMAQYMLQDDDNSSTTSFQSEIQNKSWGLSGSPISTLWSPL 120
Query: 121 GSSTGSSHGSPEGPSKEPSPPSTPVVAERGGLDISHNVFSKLEKKKKVSTNGKTIQTSPQ 180
GSSTGSSHGSPEGPSKEPSPPSTPVV ERGGLDISHNVFSKLEK KKVS +GK+IQTS Q
Sbjct: 121 GSSTGSSHGSPEGPSKEPSPPSTPVVEERGGLDISHNVFSKLEKMKKVSIHGKSIQTSTQ 180
Query: 181 IGETGSSSSKDQSRTPKNQKRRQNQQQQQFMKQKGSVAIQAKQTQGSSSQENSGAKLGGS 240
IGETGSSSSKDQSRTPKNQKRRQNQQQQQFMKQKGS AIQ KQ QGSS Q NSGAK GG
Sbjct: 181 IGETGSSSSKDQSRTPKNQKRRQNQQQQQFMKQKGSGAIQVKQAQGSSLQANSGAKSGGP 240
Query: 241 SGTGVFLPRHVNYNRPAPCPQPPQAPKKKGSSTVLIPVRVLQALQLHYDRMDDKTRQKIT 300
SGTGVFLPRHVNYNRPAPCPQPPQ PKKKG STVLIPVRVLQALQ HYDRMDD+TRQKIT
Sbjct: 241 SGTGVFLPRHVNYNRPAPCPQPPQPPKKKGCSTVLIPVRVLQALQHHYDRMDDETRQKIT 300
Query: 301 RFTALREAAANARTTSHTVKKSHSDAATATATTATSQIDVGLPQEWTY 349
FTALREAAANARTTSHTVKKSH+ A ATATTATSQIDVGLPQEWTY
Sbjct: 301 GFTALREAAANARTTSHTVKKSHTGTAAATATTATSQIDVGLPQEWTY 348
BLAST of HG10021882 vs. ExPASy TrEMBL
Match:
A0A5A7SMB3 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold848G00070 PE=4 SV=1)
HSP 1 Score: 587.4 bits (1513), Expect = 3.9e-164
Identity = 314/348 (90.23%), Postives = 324/348 (93.10%), Query Frame = 0
Query: 1 MAADQEGRELTFNSQFQTDHGDKQQNPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60
MAADQEGREL FNS+FQ +HGDKQQNPFEGD+WSSYFGRSDSFLSF+SPVESEIGS EIE
Sbjct: 1 MAADQEGRELKFNSKFQIEHGDKQQNPFEGDSWSSYFGRSDSFLSFNSPVESEIGSNEIE 60
Query: 61 SDRDEGEGGGDDYTAELSRRMAQYMLQDDDNSSSTSYQSEIQNKSWGLSGSPISTLWSPL 120
SDRD+GE GDDYTAELSRRMAQYMLQDDDNSS+TS+QSEIQNKSWGLSGSPISTLWSPL
Sbjct: 61 SDRDDGENDGDDYTAELSRRMAQYMLQDDDNSSTTSFQSEIQNKSWGLSGSPISTLWSPL 120
Query: 121 GSSTGSSHGSPEGPSKEPSPPSTPVVAERGGLDISHNVFSKLEKKKKVSTNGKTIQTSPQ 180
GSSTGSSHGSPEGPSKEPSPPSTPVV ERGGLDISHNVFSKLEK KKVS N K+IQTS Q
Sbjct: 121 GSSTGSSHGSPEGPSKEPSPPSTPVVEERGGLDISHNVFSKLEKMKKVSINSKSIQTSTQ 180
Query: 181 IGETGSSSSKDQSRTPKNQKRRQNQQQQQFMKQKGSVAIQAKQTQGSSSQENSGAKLGGS 240
IGETGSSSSKDQSRTPKNQKRRQNQQQQQFMKQKGS AIQ KQ QGSS Q NSGAK GG
Sbjct: 181 IGETGSSSSKDQSRTPKNQKRRQNQQQQQFMKQKGSGAIQVKQAQGSSLQANSGAKSGGP 240
Query: 241 SGTGVFLPRHVNYNRPAPCPQPPQAPKKKGSSTVLIPVRVLQALQLHYDRMDDKTRQKIT 300
SGTGVFLPRHVNYNRPAPCPQPPQ PKKKG STVLIPVRVLQALQ HYDRMDD+TRQKIT
Sbjct: 241 SGTGVFLPRHVNYNRPAPCPQPPQPPKKKGCSTVLIPVRVLQALQHHYDRMDDETRQKIT 300
Query: 301 RFTALREAAANARTTSHTVKKSHSDAATATATTATSQIDVGLPQEWTY 349
FTALREAAANARTTSHTVKKSH+ A ATATTATSQIDVGLPQEWTY
Sbjct: 301 GFTALREAAANARTTSHTVKKSHTGTAAATATTATSQIDVGLPQEWTY 348
BLAST of HG10021882 vs. ExPASy TrEMBL
Match:
A0A0A0M0L9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G525320 PE=4 SV=1)
HSP 1 Score: 536.6 bits (1381), Expect = 7.9e-149
Identity = 288/325 (88.62%), Postives = 300/325 (92.31%), Query Frame = 0
Query: 24 QQNPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIESDRDEGEGGGDDYTAELSRRMAQ 83
QQNPFEGD+W SYFGRSDSFLSF+SPVESEIGSYEIESDRD+GE GDDYTAELSRRMAQ
Sbjct: 2 QQNPFEGDSWPSYFGRSDSFLSFNSPVESEIGSYEIESDRDDGENDGDDYTAELSRRMAQ 61
Query: 84 YMLQDDDNSSSTSYQSEIQNKSWGLSGSPISTLWSPLGSSTGSSHGSPEGPSKEPSPPST 143
YMLQDDDNSS+TS+QSEIQNKSWGLSGSPISTLWSPLGSSTGSSHGSPEGPSKEPSPPST
Sbjct: 62 YMLQDDDNSSTTSFQSEIQNKSWGLSGSPISTLWSPLGSSTGSSHGSPEGPSKEPSPPST 121
Query: 144 PVVAERGGLDISHNVFSKLEKKKKVSTNGKTIQTSPQIGETGSSSSKDQSRTPKNQKRRQ 203
PVV E G LDISHNVFSKLEK KKVS NGK+IQTS QIGETGSSSSKDQSRTPKNQKRRQ
Sbjct: 122 PVVEECGELDISHNVFSKLEKMKKVSINGKSIQTSTQIGETGSSSSKDQSRTPKNQKRRQ 181
Query: 204 NQQQQQFMKQKGSVAIQAKQTQGSSSQENSGAKLGGSSGTGVFLPRHVNYNRPAPCPQPP 263
NQQQQQFMKQKGS IQ KQ QGSS Q NSGAK G SGTGVFLPRHVNY+RPAPCPQPP
Sbjct: 182 NQQQQQFMKQKGSGTIQVKQAQGSSLQANSGAKSVGPSGTGVFLPRHVNYSRPAPCPQPP 241
Query: 264 QAPKKKGSSTVLIPVRVLQALQLHYDRMDDKTRQKITRFTALREAAANARTTSHTVKKSH 323
Q PKKKG STVLIPVRVLQALQ HYDRMDD+TRQKIT FTALREAAANARTT++T+KKSH
Sbjct: 242 QPPKKKGCSTVLIPVRVLQALQHHYDRMDDETRQKITGFTALREAAANARTTTNTIKKSH 301
Query: 324 SDAATATATTATSQIDVGLPQEWTY 349
+ ATAT TTATSQIDVGLPQEWTY
Sbjct: 302 TGTATATVTTATSQIDVGLPQEWTY 326
BLAST of HG10021882 vs. ExPASy TrEMBL
Match:
A0A6J1I483 (putative lysozyme-like protein OS=Cucurbita maxima OX=3661 GN=LOC111469491 PE=4 SV=1)
HSP 1 Score: 494.2 bits (1271), Expect = 4.5e-136
Identity = 285/361 (78.95%), Postives = 300/361 (83.10%), Query Frame = 0
Query: 1 MAADQEGRELTFNSQFQTDHGDKQQNPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60
MAAD EGRE F SQFQ +HG+K QNPFEGDNWSSY+GRSDSFLSFSS VES EIE
Sbjct: 1 MAADPEGREFKFYSQFQAEHGNK-QNPFEGDNWSSYYGRSDSFLSFSSKVES-----EIE 60
Query: 61 SDRDE---------GEGGGDDYTAELSRRMAQYMLQDDDNSSSTSYQSEIQNKSWGLSGS 120
SD+D+ G GGGDDYTAELSRRMAQYMLQDDDNSS S+Q EIQ+K WGLS S
Sbjct: 61 SDKDDGGCGGGSGGGGGGGDDYTAELSRRMAQYMLQDDDNSSIESFQPEIQSKPWGLSSS 120
Query: 121 PISTLWSPLGSSTGSSHGSPEGPSKEPSPPSTPVVAERGGLDISHNVFSKLEKKKKVSTN 180
PISTLWSPLGSST SS+GSPEGPSKEPSPPSTPVV ERGGLDISHNVFSKLEK KKVSTN
Sbjct: 121 PISTLWSPLGSSTESSYGSPEGPSKEPSPPSTPVVVERGGLDISHNVFSKLEKMKKVSTN 180
Query: 181 GKTIQTSPQIGETGSSSSKDQSRTPKNQKRRQNQQQQQFMKQKGSVAIQAKQTQGSSSQE 240
K+IQTSPQ G T SSSS KNQKRRQNQQQQQFMKQK S A QAKQ QGS+SQ
Sbjct: 181 DKSIQTSPQHGGTRSSSS-------KNQKRRQNQQQQQFMKQKSSAATQAKQAQGSTSQA 240
Query: 241 NSGAKL-GGSSGTGVFLPRHVNYNRPAPCPQPPQAPKKKGSSTVLIPVRVLQALQLHYDR 300
NSGAK GGSSGTGVFLPRHVNYNRPAPCPQPPQ PKKKG STVLIPVRVLQALQLHYDR
Sbjct: 241 NSGAKPGGGSSGTGVFLPRHVNYNRPAPCPQPPQPPKKKGCSTVLIPVRVLQALQLHYDR 300
Query: 301 MDDKTRQKITRFTALREAAANARTTSHTVKKSHSD---AATATATTATSQIDVGLPQEWT 349
MD++TR+KIT FTALREAAA+ART SHT KKSH + AA A T ATSQIDVGLPQEWT
Sbjct: 301 MDNETREKITGFTALREAAASARTKSHTDKKSHLEATAAAEAATTAATSQIDVGLPQEWT 348
BLAST of HG10021882 vs. TAIR 10
Match:
AT5G59050.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G54000.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )
HSP 1 Score: 75.5 bits (184), Expect = 9.5e-14
Identity = 100/334 (29.94%), Postives = 144/334 (43.11%), Query Frame = 0
Query: 40 SDSFLSFSSP---------VESEIGSYEIESDRDEGEGGGDDYTAELSRRMAQYMLQDDD 99
S+ F SFS P + + S E +S + + E D+Y EL+R+M YMLQDD
Sbjct: 13 SNPFTSFSEPTFFTPTTSSLRPDFVSDEPDSPKAKNEDEEDEYITELTRQMTNYMLQDD- 72
Query: 100 NSSSTSYQSEIQNKSWGL-SGSPISTLWSPLGSSTGSSHGSPEGPSKEPSPPSTPVVAER 159
E KS G SGSP STLWSP S SP GPS+EPSPP TP
Sbjct: 73 ---------EKHQKSCGSGSGSPQSTLWSPFASGL----SSPIGPSREPSPPLTPATVP- 132
Query: 160 GGLDISHNVFSKLEKKK-KVSTNGKTIQTSPQIGETGSSSSKDQSRTPKNQKRRQNQQQQ 219
+ +K++ K + K QI ++ K K +K ++ Q+
Sbjct: 133 -----VEKIMTKIDTKPVTIPFQSKQALIDDQIRSIQANFQK-----IKKEKEKERQRNA 192
Query: 220 QFMKQKGSVAIQAKQTQGSSS------QENSGAKLGGSSGTGVFLPRHVNYNRPAPCPQP 279
+ K Q Q S + SG++ GS GTGVFLPR
Sbjct: 193 DVLGHKARNYHHLHQNQRPRSGVKAVFVDGSGSRT-GSGGTGVFLPRGHG--------TV 252
Query: 280 PQAPKKKGSSTVLIPVRVLQALQLHYDRM--------DDKTRQKITRFTALREAAANART 339
++ KK G STV+IP RV++AL++H+D++ D + + + +
Sbjct: 253 VESRKKSGCSTVIIPARVVEALKVHFDKLGVPSTFSSDIPPFHDALLVSMNNKKIKSNKN 312
Query: 340 TSHTVKKSHSDAATATATTATSQIDVGLPQEWTY 349
TS + +S S + + + LPQEWTY
Sbjct: 313 TSLSRVQSGSPYEMEMSAESHQEPPADLPQEWTY 312
BLAST of HG10021882 vs. TAIR 10
Match:
AT5G59050.2 (unknown protein; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )
HSP 1 Score: 60.8 bits (146), Expect = 2.4e-09
Identity = 49/115 (42.61%), Postives = 59/115 (51.30%), Query Frame = 0
Query: 40 SDSFLSFSSP---------VESEIGSYEIESDRDEGEGGGDDYTAELSRRMAQYMLQDDD 99
S+ F SFS P + + S E +S + + E D+Y EL+R+M YMLQDD
Sbjct: 13 SNPFTSFSEPTFFTPTTSSLRPDFVSDEPDSPKAKNEDEEDEYITELTRQMTNYMLQDD- 72
Query: 100 NSSSTSYQSEIQNKSWGL-SGSPISTLWSPLGSSTGSSHGSPEGPSKEPSPPSTP 145
E KS G SGSP STLWSP S SP GPS+EPSPP TP
Sbjct: 73 ---------EKHQKSCGSGSGSPQSTLWSPFASGL----SSPIGPSREPSPPLTP 113
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038895136.1 | 1.2e-164 | 90.06 | uncharacterized protein LOC120083444 isoform X2 [Benincasa hispida] | [more] |
XP_008457429.1 | 6.1e-164 | 90.23 | PREDICTED: uncharacterized protein LOC103497120 isoform X1 [Cucumis melo] >TYJ97... | [more] |
KAA0031768.1 | 8.0e-164 | 90.23 | uncharacterized protein E6C27_scaffold848G00070 [Cucumis melo var. makuwa] | [more] |
XP_038895137.1 | 1.2e-162 | 89.77 | uncharacterized protein LOC120083444 isoform X3 [Benincasa hispida] | [more] |
XP_004145277.2 | 1.6e-148 | 88.62 | uncharacterized protein LOC101214739 [Cucumis sativus] >XP_031741143.1 uncharact... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A5D3BEB2 | 3.0e-164 | 90.23 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A1S3C665 | 3.0e-164 | 90.23 | uncharacterized protein LOC103497120 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A5A7SMB3 | 3.9e-164 | 90.23 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... | [more] |
A0A0A0M0L9 | 7.9e-149 | 88.62 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G525320 PE=4 SV=1 | [more] |
A0A6J1I483 | 4.5e-136 | 78.95 | putative lysozyme-like protein OS=Cucurbita maxima OX=3661 GN=LOC111469491 PE=4 ... | [more] |
Match Name | E-value | Identity | Description | |
AT5G59050.1 | 9.5e-14 | 29.94 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT5G59050.2 | 2.4e-09 | 42.61 | unknown protein; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae ... | [more] |