CmoCh08G006350 (gene) Cucurbita moschata (Rifu)

NameCmoCh08G006350
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionLate embryogenesis abundant family protein
LocationCmo_Chr08 : 4138537 .. 4139037 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAAGCCAACAAGACATTAGCCACAGCGTCGGCGAGGCCGTCGGCCAAGCTCAGGTAGTTTACCGCAACCAAATCGATATCAAACGCAGCCGTTTAGAACATATTATATATTTCAATTATTGTGTCAGGTGAAGAAGGATGAGGTCATCAACCAAGCAGCTAATTCAGCTCAAGAAGCAAAAGACAGCGCATCTACTACCCTTGATCAATCCTCAGCTGCCCAAACTGCCTCTGACCTCAAGGACCAAGCTGCAAATTTCCTTCAACAGGCAAATTTGTTATTGGAAAAAACTATATTTTAAAAAATTTGCAAATGGGTCCTTACAAAATTGCTTATTTATTTATTTTGATATGGTTTAGACTGGAGAGCAAGTGAAGAACATGGCTCAAGGAGCAGCTGAGGCAGTGAAAAACACTCTTGGGATGAACACTGATAGCAGTTCTAACGCCGCCGCCAACACCAACAACCCTACCAACACTCCATCCCCTACAATCTAA

mRNA sequence

ATGGCAAGCCAACAAGACATTAGCCACAGCGTCGGCGAGGCCGTCGGCCAAGCTCAGGTGAAGAAGGATGAGGTCATCAACCAAGCAGCTAATTCAGCTCAAGAAGCAAAAGACAGCGCATCTACTACCCTTGATCAATCCTCAGCTGCCCAAACTGCCTCTGACCTCAAGGACCAAGCTGCAAATTTCCTTCAACAGACTGGAGAGCAAGTGAAGAACATGGCTCAAGGAGCAGCTGAGGCAGTGAAAAACACTCTTGGGATGAACACTGATAGCAGTTCTAACGCCGCCGCCAACACCAACAACCCTACCAACACTCCATCCCCTACAATCTAA

Coding sequence (CDS)

ATGGCAAGCCAACAAGACATTAGCCACAGCGTCGGCGAGGCCGTCGGCCAAGCTCAGGTGAAGAAGGATGAGGTCATCAACCAAGCAGCTAATTCAGCTCAAGAAGCAAAAGACAGCGCATCTACTACCCTTGATCAATCCTCAGCTGCCCAAACTGCCTCTGACCTCAAGGACCAAGCTGCAAATTTCCTTCAACAGACTGGAGAGCAAGTGAAGAACATGGCTCAAGGAGCAGCTGAGGCAGTGAAAAACACTCTTGGGATGAACACTGATAGCAGTTCTAACGCCGCCGCCAACACCAACAACCCTACCAACACTCCATCCCCTACAATCTAA
BLAST of CmoCh08G006350 vs. TrEMBL
Match: A0A0A0K8V2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G433240 PE=4 SV=1)

HSP 1 Score: 118.2 bits (295), Expect = 6.2e-24
Identity = 75/114 (65.79%), Postives = 85/114 (74.56%), Query Frame = 1

Query: 1   MASQQDISHSVGEAVGQAQVKKDEVINQAANSAQEAKDSASTTLDQSSAAQTASDLKDQA 60
           MAS QD+SH   E VGQAQVK+DE++NQ             TT DQSSAAQTA+DLKDQA
Sbjct: 1   MASHQDLSHKADEIVGQAQVKRDEMMNQP------------TTQDQSSAAQTATDLKDQA 60

Query: 61  ANFLQQTGEQVKNMAQGAAEAVKNTLGMNTDSSSN------AAANTNNPTNTPS 109
           A+FLQQTGEQVKNMAQGAAEAVKNTLGMNTD++SN      A  + NNPTN P+
Sbjct: 61  ASFLQQTGEQVKNMAQGAAEAVKNTLGMNTDNTSNTNTHNPANNSANNPTNNPA 102

BLAST of CmoCh08G006350 vs. TrEMBL
Match: A0A059BSC4_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_F02549 PE=4 SV=1)

HSP 1 Score: 87.8 bits (216), Expect = 9.0e-15
Identity = 59/110 (53.64%), Postives = 74/110 (67.27%), Query Frame = 1

Query: 1   MASQ-QDISHSVGEAVGQAQVKKDEVINQAANSAQEAKDSASTTLDQSSAAQTASDLKDQ 60
           MA+Q QDI ++ G+  GQAQ+KK+E ++QA+N  Q   D  S T   S       D+K Q
Sbjct: 1   MANQGQDIGYAAGQMTGQAQLKKEECMDQASNEYQATMDKLSGTTPNSGY-----DIKGQ 60

Query: 61  AANFLQQTGEQVKNMAQGAAEAVKNTLGMNTDSSSNAAANTNNPTNTPSP 110
           A NFLQQTGEQVK+MA GAAEAVK+TLGMN D++ +AAA  N P N   P
Sbjct: 61  ATNFLQQTGEQVKSMAHGAAEAVKSTLGMNNDAAGDAAATVNPPNNPSYP 105

BLAST of CmoCh08G006350 vs. TrEMBL
Match: A0A061GBL6_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_028329 PE=4 SV=1)

HSP 1 Score: 84.7 bits (208), Expect = 7.6e-14
Identity = 62/111 (55.86%), Postives = 78/111 (70.27%), Query Frame = 1

Query: 1   MASQ--QDISHSVGEAVGQAQVKKDEVINQAANSAQEAKDSASTTLDQSSAAQTASDLKD 60
           MAS+  Q+I +  G+  GQAQVKKDE +NQA+          + + DQ+S+      +  
Sbjct: 1   MASRTAQNIGNQGGDITGQAQVKKDETMNQASQGTN------TQSSDQNSS------IAS 60

Query: 61  QAANFLQQTGEQVKNMAQGAAEAVKNTLGMNTDSSSNAAANTNNPTNTPSP 110
           QA NFLQQTGEQVKNMAQGAA+AVKNTLGMN D+SSN A +TN+P+N PSP
Sbjct: 61  QATNFLQQTGEQVKNMAQGAADAVKNTLGMNNDNSSN-APSTNHPSN-PSP 97

BLAST of CmoCh08G006350 vs. TrEMBL
Match: U5CRL2_AMBTC (Uncharacterized protein OS=Amborella trichopoda GN=AMTR_s00039p00184170 PE=4 SV=1)

HSP 1 Score: 84.0 bits (206), Expect = 1.3e-13
Identity = 53/98 (54.08%), Postives = 65/98 (66.33%), Query Frame = 1

Query: 1  MASQQDISHSVGEAVGQAQVKKDEVINQAANSAQEAKDSASTTLDQS---------SAAQ 60
          M   QDI +  GEA G AQVKKDE++N+A ++AQ  KD AS     +         SA  
Sbjct: 1  MDRSQDIRYKAGEAKGHAQVKKDEMVNKAKDTAQTVKDKASDAAQSAKDSMSSNSQSARC 60

Query: 61 TASDLKDQAANFLQQTGEQVKNMAQGAAEAVKNTLGMN 90
          +A + K++AA FLQQTGEQVKNMAQGA + VKNTLGMN
Sbjct: 61 SAQESKEEAAGFLQQTGEQVKNMAQGAVDTVKNTLGMN 98

BLAST of CmoCh08G006350 vs. TrEMBL
Match: A0A078HTR6_BRANA (BnaC06g05510D protein OS=Brassica napus GN=BnaC06g05510D PE=4 SV=1)

HSP 1 Score: 72.8 bits (177), Expect = 3.0e-10
Identity = 52/110 (47.27%), Postives = 69/110 (62.73%), Query Frame = 1

Query: 1   MASQQDISHSVGEAVGQAQVKKDEVINQAANSAQEAKDSASTTLDQSSAAQTASDLKDQA 60
           M+S Q++SH  GEA GQ Q+KK+E +N+ +++  E  D  S +   +   Q    L  QA
Sbjct: 1   MSSNQELSHKAGEATGQVQLKKEEYLNKVSHAMDENADHHSHS--HAEHDQNNPSLISQA 60

Query: 61  ANFLQQTGEQVKNMAQGAAEAVKNTLGMNTDSSSNAAANTNNPTNTPSPT 111
           +N +QQTG QVKNMAQGAA+AVKNTLGM     S A  N +NP  T  P+
Sbjct: 61  SNVIQQTGGQVKNMAQGAADAVKNTLGM-----SPATNNPSNPAGTTHPS 103

BLAST of CmoCh08G006350 vs. TAIR10
Match: AT1G52680.1 (AT1G52680.1 late embryogenesis abundant protein-related / LEA protein-related)

HSP 1 Score: 65.9 bits (159), Expect = 1.8e-11
Identity = 48/113 (42.48%), Postives = 69/113 (61.06%), Query Frame = 1

Query: 1   MASQQDISHSVGEAVGQAQVKKDEVINQAANSAQEAKDSASTTLDQ--SSAAQTASDLKD 60
           M+S Q +SHS GE  GQ Q+KK+E +N  +++  +  D  + +  Q  S   Q    L  
Sbjct: 1   MSSSQQLSHSAGEVTGQVQLKKEEYLNNVSHAMNQNADHHTHSQSQLHSEHDQNNPSLIS 60

Query: 61  QAANFLQQTGEQVKNMAQGAAEAVKNTLGMNTDSSSNAAANTNNPTNTPSPTI 112
           QA++ +QQTG QVKNMAQGAA+AVKNTLGM+  ++S ++      +N P   I
Sbjct: 61  QASSVIQQTGGQVKNMAQGAADAVKNTLGMSPATNSPSSPAGTTRSNKPGSKI 113

BLAST of CmoCh08G006350 vs. TAIR10
Match: AT5G38760.1 (AT5G38760.1 Late embryogenesis abundant protein (LEA) family protein)

HSP 1 Score: 47.4 bits (111), Expect = 6.8e-06
Identity = 32/88 (36.36%), Postives = 48/88 (54.55%), Query Frame = 1

Query: 2  ASQQDISHSVGEAVGQAQVKKDEVINQAANSAQEAKDSASTTLDQSSAAQTASDLKDQAA 61
          ++ Q+IS   G+A GQ Q K   ++++A+N+AQ AK+S                      
Sbjct: 3  SNSQNISFQAGQAKGQTQEKASTMMDKASNAAQSAKES---------------------- 62

Query: 62 NFLQQTGEQVKNMAQGAAEAVKNTLGMN 90
            L++TG+Q+K  AQGA E+VKN  GMN
Sbjct: 63 --LKETGQQIKEKAQGATESVKNATGMN 66

BLAST of CmoCh08G006350 vs. NCBI nr
Match: gi|449436603|ref|XP_004136082.1| (PREDICTED: late embryogenesis abundant protein 2 [Cucumis sativus])

HSP 1 Score: 118.2 bits (295), Expect = 8.9e-24
Identity = 75/114 (65.79%), Postives = 85/114 (74.56%), Query Frame = 1

Query: 1   MASQQDISHSVGEAVGQAQVKKDEVINQAANSAQEAKDSASTTLDQSSAAQTASDLKDQA 60
           MAS QD+SH   E VGQAQVK+DE++NQ             TT DQSSAAQTA+DLKDQA
Sbjct: 1   MASHQDLSHKADEIVGQAQVKRDEMMNQP------------TTQDQSSAAQTATDLKDQA 60

Query: 61  ANFLQQTGEQVKNMAQGAAEAVKNTLGMNTDSSSN------AAANTNNPTNTPS 109
           A+FLQQTGEQVKNMAQGAAEAVKNTLGMNTD++SN      A  + NNPTN P+
Sbjct: 61  ASFLQQTGEQVKNMAQGAAEAVKNTLGMNTDNTSNTNTHNPANNSANNPTNNPA 102

BLAST of CmoCh08G006350 vs. NCBI nr
Match: gi|659122328|ref|XP_008461082.1| (PREDICTED: late embryogenesis abundant protein 1 [Cucumis melo])

HSP 1 Score: 115.5 bits (288), Expect = 5.8e-23
Identity = 74/109 (67.89%), Postives = 81/109 (74.31%), Query Frame = 1

Query: 1   MASQQD-ISHSVGEAVGQAQVKKDEVINQAANSAQEAKDSASTTLDQSSAAQTASDLKDQ 60
           MAS QD +SH  GE VGQAQVKKDE++NQ              T    SA QTASDLKDQ
Sbjct: 1   MASHQDNLSHKAGETVGQAQVKKDEMMNQP-------------TAHDQSATQTASDLKDQ 60

Query: 61  AANFLQQTGEQVKNMAQGAAEAVKNTLGMNTDSSSNAAANTNNPTNTPS 109
           AANFLQQTGEQVKNMAQGAAEAVKNTLGMNTD++SN   N+NNPTN P+
Sbjct: 61  AANFLQQTGEQVKNMAQGAAEAVKNTLGMNTDNTSN--PNSNNPTNNPT 94

BLAST of CmoCh08G006350 vs. NCBI nr
Match: gi|1009174423|ref|XP_015868339.1| (PREDICTED: late embryogenesis abundant protein 2-like [Ziziphus jujuba])

HSP 1 Score: 91.7 bits (226), Expect = 8.9e-16
Identity = 56/108 (51.85%), Postives = 77/108 (71.30%), Query Frame = 1

Query: 5   QDISHSVGEAVGQAQVKKDEVINQAANSAQEAKDSASTTLDQSSAAQTASDLKDQAANFL 64
           QD+++  GE  GQAQ+KKDE +NQ++N +Q              ++Q ASD+K+QA +FL
Sbjct: 7   QDLAYQAGEITGQAQMKKDEFMNQSSNPSQ--------------SSQNASDIKEQATHFL 66

Query: 65  QQTGEQVKNMAQGAAEAVKNTLGMNTDS--SSNAAANTNNPTNTPSPT 111
           QQTGEQ+KNMAQGAAEAVKNTLGMN D+  +S  +  TNNP++  +P+
Sbjct: 67  QQTGEQMKNMAQGAAEAVKNTLGMNADNNPTSRGSGGTNNPSHPSNPS 100

BLAST of CmoCh08G006350 vs. NCBI nr
Match: gi|657961014|ref|XP_008372090.1| (PREDICTED: late embryogenesis abundant protein, group 3-like [Malus domestica])

HSP 1 Score: 89.4 bits (220), Expect = 4.4e-15
Identity = 60/126 (47.62%), Postives = 77/126 (61.11%), Query Frame = 1

Query: 5   QDISHSVGEAVGQAQVKKDEVINQAANSAQEAKDSASTTLDQ----------------SS 64
           QD+SH  GE  GQAQVKKDE +NQA+ +AQ A++ AS                     S 
Sbjct: 7   QDLSHKAGELTGQAQVKKDEFLNQASGAAQSAQNQASNLSQSAQNKASDASQSAQNKASD 66

Query: 65  AAQTASDLKDQAANFLQQTGEQVKNMAQGAAEAVKNTLGMNT----DSSSNAAANTNNPT 111
           A+ T  D+KDQA + LQQT EQV+NMAQGAA+AVKNTLGMN+     + +N  +N  NP 
Sbjct: 67  ASHTGQDIKDQATHLLQQTSEQVRNMAQGAADAVKNTLGMNSPNDPSNQTNTPSNHTNPN 126

BLAST of CmoCh08G006350 vs. NCBI nr
Match: gi|694410485|ref|XP_009333653.1| (PREDICTED: late embryogenesis abundant protein 1-like [Pyrus x bretschneideri])

HSP 1 Score: 89.4 bits (220), Expect = 4.4e-15
Identity = 60/126 (47.62%), Postives = 77/126 (61.11%), Query Frame = 1

Query: 5   QDISHSVGEAVGQAQVKKDEVINQAANSAQEAKDSASTTLDQ----------------SS 64
           QD+SH  GE  GQAQVKKDE +NQA+ +AQ A++ AS                     S 
Sbjct: 7   QDLSHKAGELTGQAQVKKDEFLNQASGAAQSAQNQASNLSQSAQNKASDASQSAQNKASD 66

Query: 65  AAQTASDLKDQAANFLQQTGEQVKNMAQGAAEAVKNTLGMNT----DSSSNAAANTNNPT 111
           A+ T  D+KDQA + LQQT EQV+NMAQGAA+AVKNTLGMN+     + +N  +N  NP 
Sbjct: 67  ASHTGQDIKDQATHLLQQTSEQVRNMAQGAADAVKNTLGMNSPNDPSNQTNTPSNHTNPN 126

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0K8V2_CUCSA6.2e-2465.79Uncharacterized protein OS=Cucumis sativus GN=Csa_7G433240 PE=4 SV=1[more]
A0A059BSC4_EUCGR9.0e-1553.64Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_F02549 PE=4 SV=1[more]
A0A061GBL6_THECC7.6e-1455.86Uncharacterized protein OS=Theobroma cacao GN=TCM_028329 PE=4 SV=1[more]
U5CRL2_AMBTC1.3e-1354.08Uncharacterized protein OS=Amborella trichopoda GN=AMTR_s00039p00184170 PE=4 SV=... [more]
A0A078HTR6_BRANA3.0e-1047.27BnaC06g05510D protein OS=Brassica napus GN=BnaC06g05510D PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G52680.11.8e-1142.48 late embryogenesis abundant protein-related / LEA protein-related[more]
AT5G38760.16.8e-0636.36 Late embryogenesis abundant protein (LEA) family protein[more]
Match NameE-valueIdentityDescription
gi|449436603|ref|XP_004136082.1|8.9e-2465.79PREDICTED: late embryogenesis abundant protein 2 [Cucumis sativus][more]
gi|659122328|ref|XP_008461082.1|5.8e-2367.89PREDICTED: late embryogenesis abundant protein 1 [Cucumis melo][more]
gi|1009174423|ref|XP_015868339.1|8.9e-1651.85PREDICTED: late embryogenesis abundant protein 2-like [Ziziphus jujuba][more]
gi|657961014|ref|XP_008372090.1|4.4e-1547.62PREDICTED: late embryogenesis abundant protein, group 3-like [Malus domestica][more]
gi|694410485|ref|XP_009333653.1|4.4e-1547.62PREDICTED: late embryogenesis abundant protein 1-like [Pyrus x bretschneideri][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh08G006350.1CmoCh08G006350.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR34191FAMILY NOT NAMEDcoord: 1..34
score: 6.3E-27coord: 50..105
score: 6.3
NoneNo IPR availablePANTHERPTHR34191:SF3F6D8.10-RELATEDcoord: 50..105
score: 6.3E-27coord: 1..34
score: 6.3

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmoCh08G006350CmoCh17G008150Cucurbita moschata (Rifu)cmocmoB312