Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAAATTAAAAAATTAAAAAAATTAAAAAATGGGAAGCTCCACCCTCTTCAAAAGCCTTCCAAAGTTTGGTCATTTTGGGGCAGCAGTCATTGCATGGAGATCACCCACTTCCAACACTTCCCTCCGTTTCGCTTCCAACCCCAAATTTATTCATGTAAGTAATTTGTTCGTACATTGAATCATTGGTAAGGTTCTTACTTCTTTTTGATGTTCGACAACAAGTCAGAGCTTCGAATCTCATATATATTTTTGTGTGTTTTTTTGACAGACAAATCCACCACAAGATGGGTCAGAAGCCAAAGATGCCATAAACCCAGAGGCAAATGAAGGAATGACATCGGGTGAACAAATGATGCAGGACAAGGCTTACTCTACTGCAGAACACGTAATCAAAACTCCCCCAACTTTTTATTTTATTTTTTAAACCAACTGAATCGAAGCTACGAATGAGAGCGATGTTGATGTTGAAGATAGATAAGTAAGGTAAATTTGAGAATGCAGGTGAGTGAGAAGACAAGGGATATGGCAGGGATGCTAAGTGCAAGAGCGCAAGAGGTATCGGCAAAGGCAAAGCAGGCAATGGAAGCAGCAAAGGACTCAGCCCATAGGGCAAAGGACTCAGTGGTTGAGACTACCAAAGACTCCAAGCAATTTGTCAAAGCCAACGCAAAGACCGTTGAGAAGTGCATGAACACCAAGAACCGTTCTTAATTTCCCTTTTTTTCCCTTCTAATAAACCCATATAGACCTATATGGAATTTTTTTCAATCCAAAAAATGGTTTAGAAGTGAGTTATGAGCTAATGTGCTTAATTCTTTGATTTGGACGAG
mRNA sequence
AAAAAATTAAAAAATTAAAAAAATTAAAAAATGGGAAGCTCCACCCTCTTCAAAAGCCTTCCAAAGTTTGGTCATTTTGGGGCAGCAGTCATTGCATGGAGATCACCCACTTCCAACACTTCCCTCCGTTTCGCTTCCAACCCCAAATTTATTCATACAAATCCACCACAAGATGGGTCAGAAGCCAAAGATGCCATAAACCCAGAGGCAAATGAAGGAATGACATCGGGTGAACAAATGATGCAGGACAAGGCTTACTCTACTGCAGAACACGTGAGTGAGAAGACAAGGGATATGGCAGGGATGCTAAGTGCAAGAGCGCAAGAGGTATCGGCAAAGGCAAAGCAGGCAATGGAAGCAGCAAAGGACTCAGCCCATAGGGCAAAGGACTCAGTGGTTGAGACTACCAAAGACTCCAAGCAATTTGTCAAAGCCAACGCAAAGACCGTTGAGAAGTGCATGAACACCAAGAACCGTTCTTAATTTCCCTTTTTTTCCCTTCTAATAAACCCATATAGACCTATATGGAATTTTTTTCAATCCAAAAAATGGTTTAGAAGTGAGTTATGAGCTAATGTGCTTAATTCTTTGATTTGGACGAG
Coding sequence (CDS)
ATGGGAAGCTCCACCCTCTTCAAAAGCCTTCCAAAGTTTGGTCATTTTGGGGCAGCAGTCATTGCATGGAGATCACCCACTTCCAACACTTCCCTCCGTTTCGCTTCCAACCCCAAATTTATTCATACAAATCCACCACAAGATGGGTCAGAAGCCAAAGATGCCATAAACCCAGAGGCAAATGAAGGAATGACATCGGGTGAACAAATGATGCAGGACAAGGCTTACTCTACTGCAGAACACGTGAGTGAGAAGACAAGGGATATGGCAGGGATGCTAAGTGCAAGAGCGCAAGAGGTATCGGCAAAGGCAAAGCAGGCAATGGAAGCAGCAAAGGACTCAGCCCATAGGGCAAAGGACTCAGTGGTTGAGACTACCAAAGACTCCAAGCAATTTGTCAAAGCCAACGCAAAGACCGTTGAGAAGTGCATGAACACCAAGAACCGTTCTTAA
Protein sequence
MGSSTLFKSLPKFGHFGAAVIAWRSPTSNTSLRFASNPKFIHTNPPQDGSEAKDAINPEANEGMTSGEQMMQDKAYSTAEHVSEKTRDMAGMLSARAQEVSAKAKQAMEAAKDSAHRAKDSVVETTKDSKQFVKANAKTVEKCMNTKNRS
Homology
BLAST of CmaCh05G008040 vs. ExPASy TrEMBL
Match:
A0A6J1ICL4 (uncharacterized protein At4g13230 OS=Cucurbita maxima OX=3661 GN=LOC111472556 PE=4 SV=1)
HSP 1 Score: 283.1 bits (723), Expect = 6.7e-73
Identity = 150/150 (100.00%), Postives = 150/150 (100.00%), Query Frame = 0
Query: 1 MGSSTLFKSLPKFGHFGAAVIAWRSPTSNTSLRFASNPKFIHTNPPQDGSEAKDAINPEA 60
MGSSTLFKSLPKFGHFGAAVIAWRSPTSNTSLRFASNPKFIHTNPPQDGSEAKDAINPEA
Sbjct: 1 MGSSTLFKSLPKFGHFGAAVIAWRSPTSNTSLRFASNPKFIHTNPPQDGSEAKDAINPEA 60
Query: 61 NEGMTSGEQMMQDKAYSTAEHVSEKTRDMAGMLSARAQEVSAKAKQAMEAAKDSAHRAKD 120
NEGMTSGEQMMQDKAYSTAEHVSEKTRDMAGMLSARAQEVSAKAKQAMEAAKDSAHRAKD
Sbjct: 61 NEGMTSGEQMMQDKAYSTAEHVSEKTRDMAGMLSARAQEVSAKAKQAMEAAKDSAHRAKD 120
Query: 121 SVVETTKDSKQFVKANAKTVEKCMNTKNRS 151
SVVETTKDSKQFVKANAKTVEKCMNTKNRS
Sbjct: 121 SVVETTKDSKQFVKANAKTVEKCMNTKNRS 150
BLAST of CmaCh05G008040 vs. ExPASy TrEMBL
Match:
A0A6J1F3W7 (uncharacterized protein At4g13230 OS=Cucurbita moschata OX=3662 GN=LOC111439943 PE=4 SV=1)
HSP 1 Score: 265.4 bits (677), Expect = 1.5e-67
Identity = 142/150 (94.67%), Postives = 144/150 (96.00%), Query Frame = 0
Query: 1 MGSSTLFKSLPKFGHFGAAVIAWRSPTSNTSLRFASNPKFIHTNPPQDGSEAKDAINPEA 60
MGSSTLFKSLPKFGHFGA IA RSPTSNTSL FASNPKFIHTNPPQDGSEAKDAINPEA
Sbjct: 1 MGSSTLFKSLPKFGHFGAPAIARRSPTSNTSLLFASNPKFIHTNPPQDGSEAKDAINPEA 60
Query: 61 NEGMTSGEQMMQDKAYSTAEHVSEKTRDMAGMLSARAQEVSAKAKQAMEAAKDSAHRAKD 120
NEGMT GE MMQDKAYSTAEHVSEKTRDMAGMLSARAQEVSAKAK+AMEAAKDSAHRAKD
Sbjct: 61 NEGMTPGEHMMQDKAYSTAEHVSEKTRDMAGMLSARAQEVSAKAKEAMEAAKDSAHRAKD 120
Query: 121 SVVETTKDSKQFVKANAKTVEKCMNTKNRS 151
+VVETTKDSKQFVKANAKTVEKCMNTKNRS
Sbjct: 121 TVVETTKDSKQFVKANAKTVEKCMNTKNRS 150
BLAST of CmaCh05G008040 vs. ExPASy TrEMBL
Match:
A0A6J1CSM2 (uncharacterized protein At4g13230-like OS=Momordica charantia OX=3673 GN=LOC111014202 PE=4 SV=1)
HSP 1 Score: 153.7 bits (387), Expect = 6.2e-34
Identity = 97/154 (62.99%), Postives = 110/154 (71.43%), Query Frame = 0
Query: 1 MGSSTLFKSLPKFGHFGAAVIAWRSPTSNTSLRFASNPKFIHTNPPQD-----GSEAKDA 60
M S TL K+LPKF H GAA IA RS SN L SNPKF+HT PQD EAKDA
Sbjct: 1 MASFTLLKTLPKFCHLGAA-IARRSTASNPILLLPSNPKFMHTGSPQDAINPGAPEAKDA 60
Query: 61 INPEANEGMTSGEQMMQDKAYSTAEHVSEKTRDMAGMLSARAQEVSAKAKQAMEAAKDSA 120
INP ANE M GE MM + AYSTA+HV EK DM GM+ + E+S KAKQ MEAA DSA
Sbjct: 61 INPGANEAMMPGESMMTE-AYSTAQHVREKASDMGGMVXS---EISEKAKQTMEAAWDSA 120
Query: 121 HRAKDSVVETTKDSKQFVKANAKTVEKCMNTKNR 150
RAKD+VVE TK+SK+FVKANA++V+K MNTKNR
Sbjct: 121 QRAKDTVVEATKESKEFVKANAESVKKSMNTKNR 149
BLAST of CmaCh05G008040 vs. ExPASy TrEMBL
Match:
A0A2N9J2L7 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS58716 PE=4 SV=1)
HSP 1 Score: 113.2 bits (282), Expect = 9.2e-22
Identity = 76/153 (49.67%), Postives = 107/153 (69.93%), Query Frame = 0
Query: 1 MGSSTLFKSLPKFGHFGAAVIAWRSPTSNTSLRFASNPKFIH-TNPPQDGSEAKDAINPE 60
M S +L LPKFGH GAA+ R T N L ASNP+ I ++ P+ S A DA+
Sbjct: 1 MASLSLVTCLPKFGHAGAAIA--RRNTWNPRLFVASNPRLIQASSNPEASSVATDALKQG 60
Query: 61 ANEGMTSGEQMMQDKAYSTAEHVSEKTRDMAGMLSARAQEVSAKAKQ-AMEA---AKDSA 120
AN+ +G+ +++KA+STAEHV++KT+DMAGM+SA AQ+V+ KAKQ A EA AKD+A
Sbjct: 61 ANDAKKTGD-TVKNKAFSTAEHVTQKTKDMAGMMSATAQDVTEKAKQTAQEAWGTAKDTA 120
Query: 121 HRAKDSVVETTKDSKQFVKANAKTVEKCMNTKN 149
+AKD+V+ ++SK+ +K NA+TV++ MNTKN
Sbjct: 121 QKAKDTVLGKAEESKECIKENAETVKESMNTKN 150
BLAST of CmaCh05G008040 vs. ExPASy TrEMBL
Match:
A0A7N2KQQ7 (Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1)
HSP 1 Score: 107.1 bits (266), Expect = 6.6e-20
Identity = 73/153 (47.71%), Postives = 101/153 (66.01%), Query Frame = 0
Query: 1 MGSSTLFKSLPKFGHFGAAVIAWRSPTSNTSLRFASNPKFIH-TNPPQDGSEAKDAINPE 60
M S TL SLPKFGH G+A++ R T N L A P+ I ++ P+ S A DA+
Sbjct: 1 MASLTLIVSLPKFGHAGSAIV--RRSTWNPRLFAACTPRPIQASSNPEASSPATDALKQG 60
Query: 61 ANEGMTSGEQMMQDKAYSTAEHVSEKTRDMAGMLSARAQEVSAKAKQ----AMEAAKDSA 120
ANE +GE ++DKA STAEHVS+ T+DMAG +SA AQ+V+ K KQ A +AKD+A
Sbjct: 61 ANEAKKTGE-TVKDKANSTAEHVSQNTKDMAGKMSATAQDVTEKVKQTAQEAWGSAKDTA 120
Query: 121 HRAKDSVVETTKDSKQFVKANAKTVEKCMNTKN 149
+AKD+V+ T++SK+ +K +A+ V+ MNTKN
Sbjct: 121 QKAKDNVLGKTEESKESIKESAEAVKNSMNTKN 150
BLAST of CmaCh05G008040 vs. NCBI nr
Match:
XP_022973925.1 (uncharacterized protein At4g13230 [Cucurbita maxima])
HSP 1 Score: 283.1 bits (723), Expect = 1.4e-72
Identity = 150/150 (100.00%), Postives = 150/150 (100.00%), Query Frame = 0
Query: 1 MGSSTLFKSLPKFGHFGAAVIAWRSPTSNTSLRFASNPKFIHTNPPQDGSEAKDAINPEA 60
MGSSTLFKSLPKFGHFGAAVIAWRSPTSNTSLRFASNPKFIHTNPPQDGSEAKDAINPEA
Sbjct: 1 MGSSTLFKSLPKFGHFGAAVIAWRSPTSNTSLRFASNPKFIHTNPPQDGSEAKDAINPEA 60
Query: 61 NEGMTSGEQMMQDKAYSTAEHVSEKTRDMAGMLSARAQEVSAKAKQAMEAAKDSAHRAKD 120
NEGMTSGEQMMQDKAYSTAEHVSEKTRDMAGMLSARAQEVSAKAKQAMEAAKDSAHRAKD
Sbjct: 61 NEGMTSGEQMMQDKAYSTAEHVSEKTRDMAGMLSARAQEVSAKAKQAMEAAKDSAHRAKD 120
Query: 121 SVVETTKDSKQFVKANAKTVEKCMNTKNRS 151
SVVETTKDSKQFVKANAKTVEKCMNTKNRS
Sbjct: 121 SVVETTKDSKQFVKANAKTVEKCMNTKNRS 150
BLAST of CmaCh05G008040 vs. NCBI nr
Match:
XP_022933143.1 (uncharacterized protein At4g13230 [Cucurbita moschata])
HSP 1 Score: 265.4 bits (677), Expect = 3.0e-67
Identity = 142/150 (94.67%), Postives = 144/150 (96.00%), Query Frame = 0
Query: 1 MGSSTLFKSLPKFGHFGAAVIAWRSPTSNTSLRFASNPKFIHTNPPQDGSEAKDAINPEA 60
MGSSTLFKSLPKFGHFGA IA RSPTSNTSL FASNPKFIHTNPPQDGSEAKDAINPEA
Sbjct: 1 MGSSTLFKSLPKFGHFGAPAIARRSPTSNTSLLFASNPKFIHTNPPQDGSEAKDAINPEA 60
Query: 61 NEGMTSGEQMMQDKAYSTAEHVSEKTRDMAGMLSARAQEVSAKAKQAMEAAKDSAHRAKD 120
NEGMT GE MMQDKAYSTAEHVSEKTRDMAGMLSARAQEVSAKAK+AMEAAKDSAHRAKD
Sbjct: 61 NEGMTPGEHMMQDKAYSTAEHVSEKTRDMAGMLSARAQEVSAKAKEAMEAAKDSAHRAKD 120
Query: 121 SVVETTKDSKQFVKANAKTVEKCMNTKNRS 151
+VVETTKDSKQFVKANAKTVEKCMNTKNRS
Sbjct: 121 TVVETTKDSKQFVKANAKTVEKCMNTKNRS 150
BLAST of CmaCh05G008040 vs. NCBI nr
Match:
KAG6598970.1 (hypothetical protein SDJN03_08748, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 260.0 bits (663), Expect = 1.3e-65
Identity = 140/150 (93.33%), Postives = 142/150 (94.67%), Query Frame = 0
Query: 1 MGSSTLFKSLPKFGHFGAAVIAWRSPTSNTSLRFASNPKFIHTNPPQDGSEAKDAINPEA 60
MGS TLFKSLPKFGHFGA IA RSPTSNTSL FASNPK IHTNPPQDGSEAKDAINPEA
Sbjct: 1 MGSYTLFKSLPKFGHFGAPAIARRSPTSNTSLLFASNPKSIHTNPPQDGSEAKDAINPEA 60
Query: 61 NEGMTSGEQMMQDKAYSTAEHVSEKTRDMAGMLSARAQEVSAKAKQAMEAAKDSAHRAKD 120
NEGMT GE MMQDKAYSTAEHVSEKTRDMAGMLSARAQEVSAKAK+AMEAAKDSAHRAKD
Sbjct: 61 NEGMTPGEHMMQDKAYSTAEHVSEKTRDMAGMLSARAQEVSAKAKEAMEAAKDSAHRAKD 120
Query: 121 SVVETTKDSKQFVKANAKTVEKCMNTKNRS 151
+VVETTKDSKQFVKANAKTVEKCMNTKNRS
Sbjct: 121 TVVETTKDSKQFVKANAKTVEKCMNTKNRS 150
BLAST of CmaCh05G008040 vs. NCBI nr
Match:
XP_023546410.1 (uncharacterized protein At4g13230 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 257.7 bits (657), Expect = 6.3e-65
Identity = 139/150 (92.67%), Postives = 141/150 (94.00%), Query Frame = 0
Query: 1 MGSSTLFKSLPKFGHFGAAVIAWRSPTSNTSLRFASNPKFIHTNPPQDGSEAKDAINPEA 60
MGSSTLFKSLPKFGHFGA IA RSPTSNTSL FASNPK IHTNPPQDG EAKDAI PEA
Sbjct: 1 MGSSTLFKSLPKFGHFGAPAIARRSPTSNTSLLFASNPKSIHTNPPQDGLEAKDAIKPEA 60
Query: 61 NEGMTSGEQMMQDKAYSTAEHVSEKTRDMAGMLSARAQEVSAKAKQAMEAAKDSAHRAKD 120
NEGMT GE MMQDKAYSTAEHVSEKTRDMAGMLSARAQEVSAKAK+AMEAAKDSAHRAKD
Sbjct: 61 NEGMTPGEHMMQDKAYSTAEHVSEKTRDMAGMLSARAQEVSAKAKEAMEAAKDSAHRAKD 120
Query: 121 SVVETTKDSKQFVKANAKTVEKCMNTKNRS 151
+VVETTKDSKQFVKANAKTVEKCMNTKNRS
Sbjct: 121 TVVETTKDSKQFVKANAKTVEKCMNTKNRS 150
BLAST of CmaCh05G008040 vs. NCBI nr
Match:
XP_038889268.1 (uncharacterized protein At4g13230-like [Benincasa hispida])
HSP 1 Score: 205.7 bits (522), Expect = 2.8e-49
Identity = 118/154 (76.62%), Postives = 129/154 (83.77%), Query Frame = 0
Query: 1 MGSSTLFKSLPKFGHFGAAVIAWRSPTSNTSLRFASNPKFIHTNPPQDGSEAKDAINPEA 60
M SSTLFK+LPKFGHFGAA I RS TSNT+L ASNPKFIHTN QDGSEAKDAINP A
Sbjct: 1 MASSTLFKNLPKFGHFGAA-ITRRSTTSNTTLILASNPKFIHTNLSQDGSEAKDAINPGA 60
Query: 61 NEGMTSGEQMMQDKAYSTAEHVSEKTRDMAGMLSARAQEVSAKAKQAMEA----AKDSAH 120
NEGM GE MM+DKAYSTAEHVSEKT+DMAGM+SA+A VSAKAKQAMEA AKD+A
Sbjct: 61 NEGMMPGENMMKDKAYSTAEHVSEKTKDMAGMVSAKAHAVSAKAKQAMEAAWDSAKDTAQ 120
Query: 121 RAKDSVVETTKDSKQFVKANAKTVEKCMNTKNRS 151
RAKD++V+T DSKQFVKAN K+VEK MNTKN S
Sbjct: 121 RAKDTLVDTANDSKQFVKANVKSVEKSMNTKNHS 153
BLAST of CmaCh05G008040 vs. TAIR 10
Match:
AT4G13230.1 (Late embryogenesis abundant protein (LEA) family protein )
HSP 1 Score: 44.7 bits (104), Expect = 7.8e-05
Identity = 34/99 (34.34%), Postives = 54/99 (54.55%), Query Frame = 0
Query: 50 SEAKDAINPEANEGMTSGEQMMQDKAYSTAEHVSEKTRDMAGMLSARAQEVSAKAKQAME 109
S KD++ +A E Q K A+ ++ D AG L +A+ A++A +
Sbjct: 31 STRKDSVCDKATEA--------QQKVAKKADEGAQTISDAAGNLKDKAKNT---AEEAWD 90
Query: 110 AAKDSAHRAKDSVVETTKDSKQFVKANAKTVEKCMNTKN 149
KD+ + KD+V T+++K+ +KA AKTVE+ MNTKN
Sbjct: 91 KVKDTTEKIKDTVTGKTEETKESIKATAKTVERSMNTKN 118
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1ICL4 | 6.7e-73 | 100.00 | uncharacterized protein At4g13230 OS=Cucurbita maxima OX=3661 GN=LOC111472556 PE... | [more] |
A0A6J1F3W7 | 1.5e-67 | 94.67 | uncharacterized protein At4g13230 OS=Cucurbita moschata OX=3662 GN=LOC111439943 ... | [more] |
A0A6J1CSM2 | 6.2e-34 | 62.99 | uncharacterized protein At4g13230-like OS=Momordica charantia OX=3673 GN=LOC1110... | [more] |
A0A2N9J2L7 | 9.2e-22 | 49.67 | Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS58716 PE=4 SV=1 | [more] |
A0A7N2KQQ7 | 6.6e-20 | 47.71 | Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
XP_022973925.1 | 1.4e-72 | 100.00 | uncharacterized protein At4g13230 [Cucurbita maxima] | [more] |
XP_022933143.1 | 3.0e-67 | 94.67 | uncharacterized protein At4g13230 [Cucurbita moschata] | [more] |
KAG6598970.1 | 1.3e-65 | 93.33 | hypothetical protein SDJN03_08748, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_023546410.1 | 6.3e-65 | 92.67 | uncharacterized protein At4g13230 [Cucurbita pepo subsp. pepo] | [more] |
XP_038889268.1 | 2.8e-49 | 76.62 | uncharacterized protein At4g13230-like [Benincasa hispida] | [more] |
Match Name | E-value | Identity | Description | |
AT4G13230.1 | 7.8e-05 | 34.34 | Late embryogenesis abundant protein (LEA) family protein | [more] |