CmaCh14G015560 (gene) Cucurbita maxima (Rimu)

NameCmaCh14G015560
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionLate embryogenesis abundant protein D-29
LocationCma_Chr14 : 11704826 .. 11706158 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AATTCCCCATGTCGCCGGAGGATAATTTTCTTCACTCTACTTGACGATTTCACTTATTTACTTTCAATCCATCAACCCAAATGCCTTCGATTCAAGTTCACCGTTGCCGGATTTCTCTACCGGTGGCGGTAGCGGTGGCGTTGCTGGTTTTTACGGTGAGTTTTTGCTGGAGCGAGAACGTGGTGCACGTGCCGTCGACGGAGGAGGCTCGAGATTACCAAGAAATGAAGGTCGAGATGGCGGCGAAAGATGAAAAGGAAAAGGATCAGTCGGCGGAGACTTGGACGGAATGGGCGAAAGAGAAGATATCCGGCGGACTTGGACTGAAAAATGAACGGCAAGAAGATGGCGTAAAGAAAGTTACTGATTTCACTTCCGGTGCCGCTGAAAAGGCCAAGGACAAAATGGGGAATGCCGCCTCAGGTTCTGTTCTTAATAAATTTTTATTTTGAAATTTTTGAACAAATTTCTTCGAAATTGGAAGTTTCTAGAGTATTTTCTGTGAATTTACTAAAAAGATATTTTCCTCTCCGATAGAAGCGGGACATTGCGGTGGTGAGAAGGGCAGAGAAGTGAAGGACACGGCGGCGGAGAAGGCCGGGAATGCCAAGGACAAGGCGGCCGAAATGGCGACCACGGCATCCGGGAAGACAACCGAAGGAGCAGAGAAGGCGAAAGAATACGCATCCAATGCAGCAAAAGCAGCGAAGGAAAAAACCGCCTCCCTGAAGAACAGAGGCGAGGAAGTCGCCGGTGAGGCGGCGGAGAAAACGAGAGAAGCGGAGGAGGCAGCGAAGAAGAAAACAGAGGAAACAACGGAGGCGGCGAAGGAGAAAGCGAAAGAGAAGGGGAAGGAGGCGAAGGAGAGCGCGGAGGCGGCGGCGGGAAGAGCAGAGGAGGCGAAGGAGAAGATGAAGAGCTGGGCGAAGGACGGGTACGAGGCGGCGAAGGAGAAGATAGGGGAGCAATACGAGGCGGCGAAGGAGAAGATGGAAGAGCAATACGAGGCAGCGAAGAAGAAGTCGCAGAGGATCAAAGACGACGTGGTTTGGTCGGAGGCGGAGATCGGGGCGGACGACGAGCTGTGAATAACTCCGGCACAGTGGCGGGAGTGTATGTTCATATTATAATGTAGAGAAGAGAGTTCCGTTTCAAGATTTCAATAAATAGTTTTTAATGGTTTATAATTATTTATTTTTGAACTTTTATTATTATTAATAATGCTTCATTTATTTAATAAATGAGACACGAAATAAATGCGAACAAAATAAAATAAAATAAAATAAAATAAAGATAAAAAAGATAAAAGCACTTTTTAGTAAGTACTTTCCCA

mRNA sequence

AATTCCCCATGTCGCCGGAGGATAATTTTCTTCACTCTACTTGACGATTTCACTTATTTACTTTCAATCCATCAACCCAAATGCCTTCGATTCAAGTTCACCGTTGCCGGATTTCTCTACCGGTGGCGGTAGCGGTGGCGTTGCTGGTTTTTACGGTGAGTTTTTGCTGGAGCGAGAACGTGGTGCACGTGCCGTCGACGGAGGAGGCTCGAGATTACCAAGAAATGAAGGTCGAGATGGCGGCGAAAGATGAAAAGGAAAAGGATCAGTCGGCGGAGACTTGGACGGAATGGGCGAAAGAGAAGATATCCGGCGGACTTGGACTGAAAAATGAACGGCAAGAAGATGGCGTAAAGAAAGTTACTGATTTCACTTCCGGTGCCGCTGAAAAGGCCAAGGACAAAATGGGGAATGCCGCCTCAGAAGCGGGACATTGCGGTGGTGAGAAGGGCAGAGAAGTGAAGGACACGGCGGCGGAGAAGGCCGGGAATGCCAAGGACAAGGCGGCCGAAATGGCGACCACGGCATCCGGGAAGACAACCGAAGGAGCAGAGAAGGCGAAAGAATACGCATCCAATGCAGCAAAAGCAGCGAAGGAAAAAACCGCCTCCCTGAAGAACAGAGGCGAGGAAGTCGCCGGTGAGGCGGCGGAGAAAACGAGAGAAGCGGAGGAGGCAGCGAAGAAGAAAACAGAGGAAACAACGGAGGCGGCGAAGGAGAAAGCGAAAGAGAAGGGGAAGGAGGCGAAGGAGAGCGCGGAGGCGGCGGCGGGAAGAGCAGAGGAGGCGAAGGAGAAGATGAAGAGCTGGGCGAAGGACGGGTACGAGGCGGCGAAGGAGAAGATAGGGGAGCAATACGAGGCGGCGAAGGAGAAGATGGAAGAGCAATACGAGGCAGCGAAGAAGAAGTCGCAGAGGATCAAAGACGACGTGGTTTGGTCGGAGGCGGAGATCGGGGCGGACGACGAGCTGTGAATAACTCCGGCACAGTGGCGGGAGTGTATGTTCATATTATAATGTAGAGAAGAGAGTTCCGTTTCAAGATTTCAATAAATAGTTTTTAATGGTTTATAATTATTTATTTTTGAACTTTTATTATTATTAATAATGCTTCATTTATTTAATAAATGAGACACGAAATAAATGCGAACAAAATAAAATAAAATAAAATAAAATAAAGATAAAAAAGATAAAAGCACTTTTTAGTAAGTACTTTCCCA

Coding sequence (CDS)

ATGCCTTCGATTCAAGTTCACCGTTGCCGGATTTCTCTACCGGTGGCGGTAGCGGTGGCGTTGCTGGTTTTTACGGTGAGTTTTTGCTGGAGCGAGAACGTGGTGCACGTGCCGTCGACGGAGGAGGCTCGAGATTACCAAGAAATGAAGGTCGAGATGGCGGCGAAAGATGAAAAGGAAAAGGATCAGTCGGCGGAGACTTGGACGGAATGGGCGAAAGAGAAGATATCCGGCGGACTTGGACTGAAAAATGAACGGCAAGAAGATGGCGTAAAGAAAGTTACTGATTTCACTTCCGGTGCCGCTGAAAAGGCCAAGGACAAAATGGGGAATGCCGCCTCAGAAGCGGGACATTGCGGTGGTGAGAAGGGCAGAGAAGTGAAGGACACGGCGGCGGAGAAGGCCGGGAATGCCAAGGACAAGGCGGCCGAAATGGCGACCACGGCATCCGGGAAGACAACCGAAGGAGCAGAGAAGGCGAAAGAATACGCATCCAATGCAGCAAAAGCAGCGAAGGAAAAAACCGCCTCCCTGAAGAACAGAGGCGAGGAAGTCGCCGGTGAGGCGGCGGAGAAAACGAGAGAAGCGGAGGAGGCAGCGAAGAAGAAAACAGAGGAAACAACGGAGGCGGCGAAGGAGAAAGCGAAAGAGAAGGGGAAGGAGGCGAAGGAGAGCGCGGAGGCGGCGGCGGGAAGAGCAGAGGAGGCGAAGGAGAAGATGAAGAGCTGGGCGAAGGACGGGTACGAGGCGGCGAAGGAGAAGATAGGGGAGCAATACGAGGCGGCGAAGGAGAAGATGGAAGAGCAATACGAGGCAGCGAAGAAGAAGTCGCAGAGGATCAAAGACGACGTGGTTTGGTCGGAGGCGGAGATCGGGGCGGACGACGAGCTGTGA

Protein sequence

MPSIQVHRCRISLPVAVAVALLVFTVSFCWSENVVHVPSTEEARDYQEMKVEMAAKDEKEKDQSAETWTEWAKEKISGGLGLKNERQEDGVKKVTDFTSGAAEKAKDKMGNAASEAGHCGGEKGREVKDTAAEKAGNAKDKAAEMATTASGKTTEGAEKAKEYASNAAKAAKEKTASLKNRGEEVAGEAAEKTREAEEAAKKKTEETTEAAKEKAKEKGKEAKESAEAAAGRAEEAKEKMKSWAKDGYEAAKEKIGEQYEAAKEKMEEQYEAAKKKSQRIKDDVVWSEAEIGADDEL
BLAST of CmaCh14G015560 vs. Swiss-Prot
Match: LEA29_GOSHI (Late embryogenesis abundant protein D-29 OS=Gossypium hirsutum PE=3 SV=1)

HSP 1 Score: 89.7 bits (221), Expect = 5.7e-17
Identity = 107/302 (35.43%), Postives = 146/302 (48.34%), Query Frame = 1

Query: 19  VALLVFTVSFCWSENVVHVPSTEE-ARDYQEMKVE------------MAAKDE------- 78
           +  LV TV+      V H+PST+E ARDY ++K +              AKDE       
Sbjct: 8   IVFLVLTVASVRCTTVDHMPSTDEDARDYSKLKTKTEEATDEHHSRTQQAKDELKSKADH 67

Query: 79  --------------------KEKDQSAETWTEWAKEKISGGLGLKNERQEDG-VKKVTDF 138
                               KE  +  E+WTEWAKEKIS GLG K +    G V+K  D 
Sbjct: 68  AANEVKSNTQQAKDRASEVGKEAKEYTESWTEWAKEKISEGLGFKQDDDPKGSVEKAFDS 127

Query: 139 TSGAAEKAKDKMGNAASEAGHCGGEKGREVKDTAAEKAGN----AKDKAAEMATTASGKT 198
            +  A K KDK+ + AS AG     K +++KDTA +K  +    AK K++EM    + K 
Sbjct: 128 VADTATKTKDKLQDMASGAGEYSAGKAKDMKDTAYKKTDDVKNAAKGKSSEMRQATTEKA 187

Query: 199 TEGAEKAKEYASNAAKAAKEKTASLKNRGEEVAGEAAEKTREAEEAAKKKTEETTEAAKE 258
            E A+ AKE A+ A  AAKEK         ++A   +E T EA+E   +K EE  E   E
Sbjct: 188 RELADSAKENANTAYIAAKEKV-------RDMADRTSEMTNEAQERGARKAEEAKEVVAE 247

Query: 259 KAKEKGKEAKESAEAAAGRAEEAKEKMKSWAKDGYEAAKEKIGEQYEAAKEKMEEQYEAA 276
           KA+   +E K+  E      + AKEK    AK GY+AAK K  E  E+AK+ +   YE+ 
Sbjct: 248 KAEGAAEETKKKNEERGESLKWAKEK----AKQGYDAAKSKAEETIESAKDTIASGYESR 298

BLAST of CmaCh14G015560 vs. TrEMBL
Match: A0A0A0LAV6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G171190 PE=4 SV=1)

HSP 1 Score: 290.0 bits (741), Expect = 3.2e-75
Identity = 203/313 (64.86%), Postives = 226/313 (72.20%), Query Frame = 1

Query: 3   SIQVHRCRISLP---VAVAVALLVFTVSFCWSENVVHVPST-EEARDYQEMKVEMAAKDE 62
           SI+V+R RISLP   +AV V LLVF VS CWS +V H+PST EEARDYQEMK +      
Sbjct: 8   SIRVYRRRISLPAVAMAVTVVLLVFMVSICWSGSVGHMPSTTEEARDYQEMKSKA----- 67

Query: 63  KEKDQSAETWTEWAKEKISGGLGLKNERQED---GVKKVTDFTSGAAEKAKDKMGNAASE 122
           +EKDQ+ ETWTEWAKEKI+GGLGLK+ERQED   GVKKVTDFTS +A+KAKDK+ N AS 
Sbjct: 68  EEKDQTGETWTEWAKEKITGGLGLKSERQEDDEGGVKKVTDFTSDSAKKAKDKIQNVASG 127

Query: 123 AGHCGGEKGREVKDTAAEKAGNAKDKAAEMATTASGKTTEGAEKAKEYASNAAKAAKEKT 182
            G  G EK  EVK  AAEKAG AKDKAA++ T A  KTTE A+KAKE A NAAK  KEK 
Sbjct: 128 VGQYGAEKAEEVKGMAAEKAGEAKDKAAKLGTVAE-KTTEAADKAKEKAQNAAKGTKEKV 187

Query: 183 ASLKNRGEEVAGEAAEKTREAEEAAKKKTEETTEAAK------------------EKAKE 242
            SLKN+ EE +GEA EKT+EA   A+KKTEET E AK                  EKAK 
Sbjct: 188 TSLKNKAEESSGEATEKTKEAANEARKKTEETAEEAKERASTGAREAEERAGEMKEKAKV 247

Query: 243 KGKEAKESAEAAAGRAEEAKEKMKSWAKDGYEAAKEKIGEQYEAAKEKMEEQYEAAKKKS 291
           KGKEAKE AE  AGRAEE  EK K WAK+G+EAAKEK  E  EAAKEK+ EQYEAAKKKS
Sbjct: 248 KGKEAKERAEEEAGRAEEIAEKGKRWAKEGFEAAKEKAEEVVEAAKEKIGEQYEAAKKKS 307

BLAST of CmaCh14G015560 vs. TrEMBL
Match: E5GC45_CUCME (Late embryogenesis abundant protein D-29 OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 240.7 bits (613), Expect = 2.2e-60
Identity = 191/345 (55.36%), Postives = 217/345 (62.90%), Query Frame = 1

Query: 3   SIQVHRCRISLP---VAVAVALLVFTVSFCWSENVVHVPSTEE-ARDYQEMKVEMAAKDE 62
           SI+V+R RISLP   +AV   LLVF V+ CWS +V H+PSTEE ARDY EMK ++AAKD+
Sbjct: 5   SIRVYRRRISLPAVAMAVTAVLLVFMVTICWSGSVGHMPSTEEDARDYHEMKSKVAAKDD 64

Query: 63  K-EKDQSAETWTEWAKEKISGGLGLKNERQED-----GVKKVTDFTSGAAEKAKDKMGNA 122
           K EKDQ+AETWTEWAKEKI+GGLGLKNER+ED     GVKKVTDFTS +A KAKDK+ N 
Sbjct: 65  KKEKDQTAETWTEWAKEKITGGLGLKNEREEDQDDDGGVKKVTDFTSDSALKAKDKIQNV 124

Query: 123 ASEAGHCGGEKGREVKDTAAEKAGNAKDKAAEMATTASGKTTEGAEKAKEYASNAAKAAK 182
           AS  G  G EK  EVK  AAEKAG AKDKAA++ T    KTTE AEKAKE A NAAK  K
Sbjct: 125 ASGVGQYGTEKAEEVKGMAAEKAGEAKDKAAKVGT----KTTEAAEKAKEKAYNAAKETK 184

Query: 183 EKTASLKNRGEEVAGEAAEKTREAEEAAKKKTEETTEAA------------------KEK 242
           +K  SLKN+  E +GEA EK +E    AKKKTEET E A                  KEK
Sbjct: 185 DKVTSLKNKAVETSGEATEKAKEVGNEAKKKTEETAEEAKERASTGAKEVEERAGEMKEK 244

Query: 243 AKEKGKEAKESAEAAA----GRAEEAKEKMKSWAKDGYEAAKEKIGEQYEAAKE------ 291
           AK K KEAKE A   A     RA E KEK K   K+  E A+E+ G   E A++      
Sbjct: 245 AKVKEKEAKERASTGAKEVEERAGEMKEKAKVKEKEAKERAEEEAGRAEEVAEKGKRWAK 304

BLAST of CmaCh14G015560 vs. TrEMBL
Match: A0A061EQH6_THECC (Late embryogenesis abundant protein D-29, putative OS=Theobroma cacao GN=TCM_021499 PE=4 SV=1)

HSP 1 Score: 119.4 bits (298), Expect = 7.4e-24
Identity = 118/307 (38.44%), Postives = 166/307 (54.07%), Query Frame = 1

Query: 15  VAVAVALLVFTVSFCWSENVVHVPSTEEAR-DYQEMKV---------------------- 74
           + +  A+LV T +     +V H+PSTEE   DY ++K                       
Sbjct: 32  ILLGAAVLVLTAASVSCTSVDHMPSTEEEEIDYAKLKSKTQQAKNEMQSKTQQAANEVKG 91

Query: 75  ------EMAAKDEKEKDQSAETWTEWAKEKISGGLGLKNERQEDGVKKVTDFTSGAAEKA 134
                 E A++ EKE  +S E+WTEWAKE+IS GLG K +  +D     +D   G A KA
Sbjct: 92  KTQQAKEKASEMEKEAKESTESWTEWAKERISEGLGFKQDHTKD-----SDSLPGTATKA 151

Query: 135 KDKMGNAASEAGHCGGEKGREVKDTAAEKAGN----AKDKAAEMATTASGKTTEGAEKAK 194
           K+K    AS AG   G+K R++K+TA++KAG+    AK+K  E    A  K  E    AK
Sbjct: 152 KEKAQEVASGAGEYIGDKARDMKNTASKKAGDVTNAAKEKTTETENAAVEKAGELTNAAK 211

Query: 195 EYASNAAKAAKEKTASLKNRGEEVAGEAAEKTREAEEAAKKKTEETTEAAKEKAKEKGKE 254
           E A  A+ AA+E T S K++  E+AG   E T E ++ A +K EE   AA EKAK+K  E
Sbjct: 212 EKADTASNAAREATTSAKDKVSEMAGTTREMTNEDKDRAAQKGEEARVAAAEKAKKKKAE 271

Query: 255 AKESAEAAAGRAEE----AKEKMKSWAKDGYEAAKEKIGEQYEAAKEKMEEQYEAAKKKS 285
            +E+   A  +A+E    AK K K  AK+GY+ AK K  E  ++AK+ +   Y AAK+KS
Sbjct: 272 TEENLSWAKEKAKEGYDAAKSKAKEKAKEGYDTAKSKAEEASKSAKDTIASSYVAAKQKS 331

BLAST of CmaCh14G015560 vs. TrEMBL
Match: M5XEI7_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa008651mg PE=4 SV=1)

HSP 1 Score: 117.9 bits (294), Expect = 2.2e-23
Identity = 125/315 (39.68%), Postives = 172/315 (54.60%), Query Frame = 1

Query: 19  VALLVFTVSFCWSENVVHVPSTEEARDYQEMKVE-MAAKDEKEKDQSA--ETWTEWAKEK 78
           VA+++ T+  C    V H  S++E    +  + +  AA+ E+E  +S+  E+W EWAKEK
Sbjct: 16  VAVMLATI--CRGSGVDHTSSSDEGFKVKTRQPQDEAAEKEREGRESSDSESWAEWAKEK 75

Query: 79  ISGGLGLKNERQEDGVKKVTDFTSGAAEKAKDKMGNAASEAGHCGGEKGREVKDTAAEKA 138
           ISGGLGLK +  ++  KK +D     A+  KDK+ + AS  G    EK +++KDTAAEKA
Sbjct: 76  ISGGLGLKQDDDDENNKKASDAAYDTAKNTKDKVQDTASGTGQYTTEKAKDIKDTAAEKA 135

Query: 139 GNAKDKAAEMA---------------TTASGKTTEGAEKAKEYASNAAKAAKEKTASLKN 198
              K+ AAE A                    K +E    AKE A  A KAA+++T   KN
Sbjct: 136 REVKEAAAEKAFEVEKAAKEKAYEATKAVKDKASEATNAAKEKAYEATKAAEDETYETKN 195

Query: 199 RGEEVAGEAAEKTREAEEA-----------AKKKTEETTEAAKEKAKEK----GKEAKES 258
             EE A +AAEK  EA++            A +K EET EA KEKAK+K     KEA E 
Sbjct: 196 AAEETASKAAEKANEAKQKVGQTAEEIKNKAYEKAEETKEA-KEKAKQKAEEVNKEAYEE 255

Query: 259 AE---AAAGRAEEAKEKMKSWAKDGYEAAKEKIGEQYEAAKEKMEEQYEAAKKKSQRIKD 298
           AE    A  + EE K K    AK+GYEAAK+K  E  ++ K+ +   +EAAK+ SQ+IK+
Sbjct: 256 AEETKEAKPKTEEVKNKAAHKAKEGYEAAKKKAEETVKSTKDTVASNFEAAKQTSQKIKE 315

BLAST of CmaCh14G015560 vs. TrEMBL
Match: A0A0D2SC73_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_005G072700 PE=4 SV=1)

HSP 1 Score: 112.8 bits (281), Expect = 7.0e-22
Identity = 117/316 (37.03%), Postives = 161/316 (50.95%), Query Frame = 1

Query: 19  VALLVFTVSFCWSENVVHVPSTEE-ARDYQEMKVE------------MAAKDE------- 78
           +  LV T +      V H+PST+E ARDY ++K +              AKDE       
Sbjct: 8   IVFLVLTAASVRCTTVDHMPSTDEDARDYSKLKTKTEEATDEHHSRTQQAKDELKSKADH 67

Query: 79  --------------------KEKDQSAETWTEWAKEKISGGLGLKNERQEDG-VKKVTDF 138
                               KE  +  E+WTEWAKEKIS GLG K +    G V+K +D 
Sbjct: 68  AANEVKSNTQQAKDRASEVEKEAKEYTESWTEWAKEKISEGLGFKQDDDPKGSVEKASDS 127

Query: 139 TSGAAEKAKDKMGNAASEAGHCGGEKGREVKDTAAEKAGN----AKDKAAEMATTASGKT 198
            +  A K KDK+ + AS AG     K +++KDTA +K  +    AK K++EM    + K 
Sbjct: 128 VADTATKTKDKLQDMASGAGEYSAGKAKDMKDTAYKKTDDVKNAAKGKSSEMRQATTEKA 187

Query: 199 TEGAEKAKEYASNAAKAAKEKTASLKNRGEEVAGEAAE----KTREAEEAAKKKTEETTE 258
            E A+ AKE A+ A  AAKEK   + +R  E+  EA E    K  EA+E A +K +   E
Sbjct: 188 RELADSAKENANTAYIAAKEKVRDMADRTSEMTNEAQERAARKAEEAKEVAAEKAKGAAE 247

Query: 259 AAKEKAKEKGKEAKESAEAAAGRAEEAKEKMKSWAKDGYEAAKEKIGEQYEAAKEKMEEQ 286
             K+K +E G+  K + E A    +   EK K  AK GY+AAK K GE  E+AK+ +   
Sbjct: 248 ETKKKNEETGESLKWAKEKAKQGYDATTEKAKETAKQGYDAAKSKAGETIESAKDTIASG 307

BLAST of CmaCh14G015560 vs. TAIR10
Match: AT3G53040.1 (AT3G53040.1 late embryogenesis abundant protein, putative / LEA protein, putative)

HSP 1 Score: 55.8 bits (133), Expect = 5.1e-08
Identity = 89/255 (34.90%), Postives = 122/255 (47.84%), Query Frame = 1

Query: 40  TEEARDYQEMKVEMAAKDEKEKDQSAETWTEWAKEKISGGLGLKNERQEDGVKKVTDFTS 99
           T+E  DY   K   A   +K  D++ ET  ++A EK         +R  D  K+  ++T+
Sbjct: 109 TKETADYTADKAREAK--DKTADKTKET-ADYAAEKAREA----KDRTADKTKETAEYTA 168

Query: 100 GAAEKAKDKMGNAASEAGHCGGEKGREVKDTAAEKAGNAKDKAAEMATTASGKTTEGAEK 159
             A +AKDK  +   E      EK +E KDT AEK G  KD   + A  A  KT E A++
Sbjct: 169 EKAREAKDKTADKLGEYKDYTAEKAKEAKDTTAEKLGEYKDYTVDKAKEAKDKTAEKAKE 228

Query: 160 AKEYASNAAKAAKEKTAS----LKNRGEEVAGEAAEKTREAEEAAKKKTEETTEAAKEKA 219
             EY S+ A+  K+KTA      K+   E A E A+K REA++   +K  E  +   EKA
Sbjct: 229 TAEYTSDKARETKDKTAEKVGEYKDYTAEKAKETADKAREAKDKTAEKVGEYRDYTAEKA 288

Query: 220 KE-------KGKEAKESAEAAA--------GRAEEAKEKMKSWAKDGYEAAKEKIGEQYE 276
            E       K  E K+SA   A        G+ EE K+K    A +  + AKEK+ E  E
Sbjct: 289 TETKDAGVSKIGELKDSAVDTAKRAMGFLSGKTEETKQK----AVETKDTAKEKMDEAGE 348

BLAST of CmaCh14G015560 vs. TAIR10
Match: AT5G44310.2 (AT5G44310.2 Late embryogenesis abundant protein (LEA) family protein)

HSP 1 Score: 55.1 bits (131), Expect = 8.7e-08
Identity = 78/257 (30.35%), Postives = 122/257 (47.47%), Query Frame = 1

Query: 51  VEMAAKDEKEKDQSAETWTEWAKEKISGGLGLKNERQEDGVKKVTDF---TSGAAEKAKD 110
           VE A     +    ++ W E + E    G G  ++ +E+   K  D    T   AE+ K+
Sbjct: 54  VEKARDSRADLAYDSKKWREESGEYAEAGKGKAHKTKEEAKDKAYDMKERTKDYAEQTKN 113

Query: 111 KMGNAASEAGHCGGEKGREVKDTAAEKAGNAKDKAAEMATTASGKTTEGAEKAKEYASNA 170
           K+   AS A     +K  E K+ A +KA + K+K  + A  A  K  EGA +A + A   
Sbjct: 114 KVNEGASRAA----DKAYETKEKAKDKAYDVKEKTKDYAEEAKDKVNEGASRAADKAYET 173

Query: 171 AKAAKEKTASLKNRGEEVAGEAAEKTREAEEAAKKKTEETTEAAKEKAKEKGKEAKESAE 230
            + AK+K   +K + ++ A E  EK  E    A  K  +  E  K  A++   +  E A 
Sbjct: 174 KEKAKDKAYDVKEKTKDFAEETKEKVNEGASRAADKAYDVKEKTKNYAEQTKDKVNEGAS 233

Query: 231 AAAGRAEEAKEKMKSWAKDGYEAA-------KEKIGEQYEAAKEKMEEQYEAAKKKSQRI 290
            AA +AEE K+K K +A+D  E A       KEK  +  E   + +++ +E AK  +Q++
Sbjct: 234 RAADKAEETKDKAKDYAEDSKEKAEDMAHGFKEKAQDIGEKTMDTVKDVWETAKSTAQKV 293

Query: 291 KDDVVWS--EAEIGADD 296
            + VV S  EA+   DD
Sbjct: 294 TEAVVGSGEEADKARDD 306

BLAST of CmaCh14G015560 vs. NCBI nr
Match: gi|778679175|ref|XP_004147633.2| (PREDICTED: late embryogenesis abundant protein D-29 [Cucumis sativus])

HSP 1 Score: 290.0 bits (741), Expect = 4.6e-75
Identity = 203/313 (64.86%), Postives = 226/313 (72.20%), Query Frame = 1

Query: 3   SIQVHRCRISLP---VAVAVALLVFTVSFCWSENVVHVPST-EEARDYQEMKVEMAAKDE 62
           SI+V+R RISLP   +AV V LLVF VS CWS +V H+PST EEARDYQEMK +      
Sbjct: 8   SIRVYRRRISLPAVAMAVTVVLLVFMVSICWSGSVGHMPSTTEEARDYQEMKSKA----- 67

Query: 63  KEKDQSAETWTEWAKEKISGGLGLKNERQED---GVKKVTDFTSGAAEKAKDKMGNAASE 122
           +EKDQ+ ETWTEWAKEKI+GGLGLK+ERQED   GVKKVTDFTS +A+KAKDK+ N AS 
Sbjct: 68  EEKDQTGETWTEWAKEKITGGLGLKSERQEDDEGGVKKVTDFTSDSAKKAKDKIQNVASG 127

Query: 123 AGHCGGEKGREVKDTAAEKAGNAKDKAAEMATTASGKTTEGAEKAKEYASNAAKAAKEKT 182
            G  G EK  EVK  AAEKAG AKDKAA++ T A  KTTE A+KAKE A NAAK  KEK 
Sbjct: 128 VGQYGAEKAEEVKGMAAEKAGEAKDKAAKLGTVAE-KTTEAADKAKEKAQNAAKGTKEKV 187

Query: 183 ASLKNRGEEVAGEAAEKTREAEEAAKKKTEETTEAAK------------------EKAKE 242
            SLKN+ EE +GEA EKT+EA   A+KKTEET E AK                  EKAK 
Sbjct: 188 TSLKNKAEESSGEATEKTKEAANEARKKTEETAEEAKERASTGAREAEERAGEMKEKAKV 247

Query: 243 KGKEAKESAEAAAGRAEEAKEKMKSWAKDGYEAAKEKIGEQYEAAKEKMEEQYEAAKKKS 291
           KGKEAKE AE  AGRAEE  EK K WAK+G+EAAKEK  E  EAAKEK+ EQYEAAKKKS
Sbjct: 248 KGKEAKERAEEEAGRAEEIAEKGKRWAKEGFEAAKEKAEEVVEAAKEKIGEQYEAAKKKS 307

BLAST of CmaCh14G015560 vs. NCBI nr
Match: gi|659077048|ref|XP_008439004.1| (PREDICTED: LOW QUALITY PROTEIN: late embryogenesis abundant protein D-29 [Cucumis melo])

HSP 1 Score: 250.8 bits (639), Expect = 3.1e-63
Identity = 195/344 (56.69%), Postives = 222/344 (64.53%), Query Frame = 1

Query: 3   SIQVHRCRISLP---VAVAVALLVFTVSFCWSENVVHVPSTEE-ARDYQEMKVEMAAKDE 62
           SI+V+R RISLP   +AV   LLVF V+ CWS +V H+PSTEE ARDY EMK ++AAKD+
Sbjct: 5   SIRVYRRRISLPAVAMAVTAVLLVFMVTICWSGSVGHMPSTEEDARDYHEMKSKVAAKDD 64

Query: 63  K-EKDQSAETWTEWAKEKISGGLGLKNERQED-----GVKKVTDFTSGAAEKAKDKMGNA 122
           K EKDQ+AETWTEWAKEKI+GGLGLKNER+ED     GVKKVTDFTS +A KAKDK+ N 
Sbjct: 65  KKEKDQTAETWTEWAKEKITGGLGLKNEREEDQDDDGGVKKVTDFTSDSALKAKDKIQNV 124

Query: 123 ASEAGHCGGEKGREVKDTAAEKAGNAKDKAAEMATTASGKTTEGAEKAKEYASNAAKAAK 182
           AS  G  G EK  EVK  AAEKAG AKDKAA++ T    KTTE AEKAKE A NAAK  K
Sbjct: 125 ASGVGQYGTEKAEEVKGMAAEKAGEAKDKAAKVGT----KTTEAAEKAKEKAYNAAKETK 184

Query: 183 EKTASLKNRG-----------EEVAGEAAEKTREAEEAAK------KKTEETTEAAKEKA 242
           +K  SLKN+            +EV  EA +K ++AEEA +      K+ EE     KEKA
Sbjct: 185 DKVTSLKNKAVETSGEATEKAKEVGNEAKKKPKKAEEAKERASTGAKEVEERAGEMKEKA 244

Query: 243 KEKGKEAKE-----------------------------SAEAAAGRAEEAKEKMKSWAKD 291
           K K KEAKE                              AE  AGRAEE  EK K WAK+
Sbjct: 245 KVKEKEAKERASTGAKEVEERAGEMKEKAKVKEKEAKERAEEEAGRAEEVAEKGKRWAKE 304

BLAST of CmaCh14G015560 vs. NCBI nr
Match: gi|307136205|gb|ADN34043.1| (late embryogenesis abundant protein D-29 [Cucumis melo subsp. melo])

HSP 1 Score: 240.7 bits (613), Expect = 3.2e-60
Identity = 191/345 (55.36%), Postives = 217/345 (62.90%), Query Frame = 1

Query: 3   SIQVHRCRISLP---VAVAVALLVFTVSFCWSENVVHVPSTEE-ARDYQEMKVEMAAKDE 62
           SI+V+R RISLP   +AV   LLVF V+ CWS +V H+PSTEE ARDY EMK ++AAKD+
Sbjct: 5   SIRVYRRRISLPAVAMAVTAVLLVFMVTICWSGSVGHMPSTEEDARDYHEMKSKVAAKDD 64

Query: 63  K-EKDQSAETWTEWAKEKISGGLGLKNERQED-----GVKKVTDFTSGAAEKAKDKMGNA 122
           K EKDQ+AETWTEWAKEKI+GGLGLKNER+ED     GVKKVTDFTS +A KAKDK+ N 
Sbjct: 65  KKEKDQTAETWTEWAKEKITGGLGLKNEREEDQDDDGGVKKVTDFTSDSALKAKDKIQNV 124

Query: 123 ASEAGHCGGEKGREVKDTAAEKAGNAKDKAAEMATTASGKTTEGAEKAKEYASNAAKAAK 182
           AS  G  G EK  EVK  AAEKAG AKDKAA++ T    KTTE AEKAKE A NAAK  K
Sbjct: 125 ASGVGQYGTEKAEEVKGMAAEKAGEAKDKAAKVGT----KTTEAAEKAKEKAYNAAKETK 184

Query: 183 EKTASLKNRGEEVAGEAAEKTREAEEAAKKKTEETTEAA------------------KEK 242
           +K  SLKN+  E +GEA EK +E    AKKKTEET E A                  KEK
Sbjct: 185 DKVTSLKNKAVETSGEATEKAKEVGNEAKKKTEETAEEAKERASTGAKEVEERAGEMKEK 244

Query: 243 AKEKGKEAKESAEAAA----GRAEEAKEKMKSWAKDGYEAAKEKIGEQYEAAKE------ 291
           AK K KEAKE A   A     RA E KEK K   K+  E A+E+ G   E A++      
Sbjct: 245 AKVKEKEAKERASTGAKEVEERAGEMKEKAKVKEKEAKERAEEEAGRAEEVAEKGKRWAK 304

BLAST of CmaCh14G015560 vs. NCBI nr
Match: gi|590662617|ref|XP_007035997.1| (Late embryogenesis abundant protein D-29, putative [Theobroma cacao])

HSP 1 Score: 119.4 bits (298), Expect = 1.1e-23
Identity = 118/307 (38.44%), Postives = 166/307 (54.07%), Query Frame = 1

Query: 15  VAVAVALLVFTVSFCWSENVVHVPSTEEAR-DYQEMKV---------------------- 74
           + +  A+LV T +     +V H+PSTEE   DY ++K                       
Sbjct: 32  ILLGAAVLVLTAASVSCTSVDHMPSTEEEEIDYAKLKSKTQQAKNEMQSKTQQAANEVKG 91

Query: 75  ------EMAAKDEKEKDQSAETWTEWAKEKISGGLGLKNERQEDGVKKVTDFTSGAAEKA 134
                 E A++ EKE  +S E+WTEWAKE+IS GLG K +  +D     +D   G A KA
Sbjct: 92  KTQQAKEKASEMEKEAKESTESWTEWAKERISEGLGFKQDHTKD-----SDSLPGTATKA 151

Query: 135 KDKMGNAASEAGHCGGEKGREVKDTAAEKAGN----AKDKAAEMATTASGKTTEGAEKAK 194
           K+K    AS AG   G+K R++K+TA++KAG+    AK+K  E    A  K  E    AK
Sbjct: 152 KEKAQEVASGAGEYIGDKARDMKNTASKKAGDVTNAAKEKTTETENAAVEKAGELTNAAK 211

Query: 195 EYASNAAKAAKEKTASLKNRGEEVAGEAAEKTREAEEAAKKKTEETTEAAKEKAKEKGKE 254
           E A  A+ AA+E T S K++  E+AG   E T E ++ A +K EE   AA EKAK+K  E
Sbjct: 212 EKADTASNAAREATTSAKDKVSEMAGTTREMTNEDKDRAAQKGEEARVAAAEKAKKKKAE 271

Query: 255 AKESAEAAAGRAEE----AKEKMKSWAKDGYEAAKEKIGEQYEAAKEKMEEQYEAAKKKS 285
            +E+   A  +A+E    AK K K  AK+GY+ AK K  E  ++AK+ +   Y AAK+KS
Sbjct: 272 TEENLSWAKEKAKEGYDAAKSKAKEKAKEGYDTAKSKAEEASKSAKDTIASSYVAAKQKS 331

BLAST of CmaCh14G015560 vs. NCBI nr
Match: gi|596147489|ref|XP_007222687.1| (hypothetical protein PRUPE_ppa008651mg [Prunus persica])

HSP 1 Score: 117.9 bits (294), Expect = 3.1e-23
Identity = 125/315 (39.68%), Postives = 172/315 (54.60%), Query Frame = 1

Query: 19  VALLVFTVSFCWSENVVHVPSTEEARDYQEMKVE-MAAKDEKEKDQSA--ETWTEWAKEK 78
           VA+++ T+  C    V H  S++E    +  + +  AA+ E+E  +S+  E+W EWAKEK
Sbjct: 16  VAVMLATI--CRGSGVDHTSSSDEGFKVKTRQPQDEAAEKEREGRESSDSESWAEWAKEK 75

Query: 79  ISGGLGLKNERQEDGVKKVTDFTSGAAEKAKDKMGNAASEAGHCGGEKGREVKDTAAEKA 138
           ISGGLGLK +  ++  KK +D     A+  KDK+ + AS  G    EK +++KDTAAEKA
Sbjct: 76  ISGGLGLKQDDDDENNKKASDAAYDTAKNTKDKVQDTASGTGQYTTEKAKDIKDTAAEKA 135

Query: 139 GNAKDKAAEMA---------------TTASGKTTEGAEKAKEYASNAAKAAKEKTASLKN 198
              K+ AAE A                    K +E    AKE A  A KAA+++T   KN
Sbjct: 136 REVKEAAAEKAFEVEKAAKEKAYEATKAVKDKASEATNAAKEKAYEATKAAEDETYETKN 195

Query: 199 RGEEVAGEAAEKTREAEEA-----------AKKKTEETTEAAKEKAKEK----GKEAKES 258
             EE A +AAEK  EA++            A +K EET EA KEKAK+K     KEA E 
Sbjct: 196 AAEETASKAAEKANEAKQKVGQTAEEIKNKAYEKAEETKEA-KEKAKQKAEEVNKEAYEE 255

Query: 259 AE---AAAGRAEEAKEKMKSWAKDGYEAAKEKIGEQYEAAKEKMEEQYEAAKKKSQRIKD 298
           AE    A  + EE K K    AK+GYEAAK+K  E  ++ K+ +   +EAAK+ SQ+IK+
Sbjct: 256 AEETKEAKPKTEEVKNKAAHKAKEGYEAAKKKAEETVKSTKDTVASNFEAAKQTSQKIKE 315

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
LEA29_GOSHI5.7e-1735.43Late embryogenesis abundant protein D-29 OS=Gossypium hirsutum PE=3 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LAV6_CUCSA3.2e-7564.86Uncharacterized protein OS=Cucumis sativus GN=Csa_3G171190 PE=4 SV=1[more]
E5GC45_CUCME2.2e-6055.36Late embryogenesis abundant protein D-29 OS=Cucumis melo subsp. melo PE=4 SV=1[more]
A0A061EQH6_THECC7.4e-2438.44Late embryogenesis abundant protein D-29, putative OS=Theobroma cacao GN=TCM_021... [more]
M5XEI7_PRUPE2.2e-2339.68Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa008651mg PE=4 SV=1[more]
A0A0D2SC73_GOSRA7.0e-2237.03Uncharacterized protein OS=Gossypium raimondii GN=B456_005G072700 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G53040.15.1e-0834.90 late embryogenesis abundant protein, putative / LEA protein, putativ... [more]
AT5G44310.28.7e-0830.35 Late embryogenesis abundant protein (LEA) family protein[more]
Match NameE-valueIdentityDescription
gi|778679175|ref|XP_004147633.2|4.6e-7564.86PREDICTED: late embryogenesis abundant protein D-29 [Cucumis sativus][more]
gi|659077048|ref|XP_008439004.1|3.1e-6356.69PREDICTED: LOW QUALITY PROTEIN: late embryogenesis abundant protein D-29 [Cucumi... [more]
gi|307136205|gb|ADN34043.1|3.2e-6055.36late embryogenesis abundant protein D-29 [Cucumis melo subsp. melo][more]
gi|590662617|ref|XP_007035997.1|1.1e-2338.44Late embryogenesis abundant protein D-29, putative [Theobroma cacao][more]
gi|596147489|ref|XP_007222687.1|3.1e-2339.68hypothetical protein PRUPE_ppa008651mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh14G015560.1CmaCh14G015560.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableunknownCoilCoilcoord: 186..283
scor
NoneNo IPR availablePANTHERPTHR23241LATE EMBRYOGENESIS ABUNDANT PLANTS LEA-RELATEDcoord: 88..284
score: 1.0