CmaCh15G000020 (gene) Cucurbita maxima (Rimu)

NameCmaCh15G000020
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionLate embryogenesis abundant protein (LEA) family protein
LocationCma_Chr15 : 3797 .. 5645 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TATATGTACTACGTGTCACTTCTGGTTGTGCAGTATCTTCCTGGTTGCTTAATAAATAAATGTCTGCTACTTCTTTTCGACACCTTCAACTTCATCAGCATGCCCAACCTATTCGCTTTATGTCTCGTTATTACTTCTTTAACTGCAGCGGGACTCTGGTCTCCTTCCCCGGCGTCCCGGCTAGATCACGAGCAGGATGTTATTGTTAAAGAAGGTCACCGAGTGGTTGTGGTCGAGTATGGCGACCAAGGTCAACACAATACTAAGGTTTCCATCTCTTCCGAACCCACCAAAGATGCCTCCCCTAGCAACCCACTACATGATAGTTTAAATGTTGGGATCCCCAACGAAGACTCCGAAAGGCACCGCACCAGAGATCTTATTTGCGATGCCTTGGGCAAATGTAAGCATAAGATAGCCAGTGCTGTGGGAAAGGCTAAAGTAATGGTTTCGGAGACGGCGCAGGAGGCCCACGACGTTGGAGAGGCTGTTGCTGGTGCTTTCGATGAAGCCAAAGAGACAGTTTCAGACAAATCTCACCACGTGGGAACGTCGTTCTCAGAGAAAGGGCATCGATTGAGGGAGTCGGTTGAGAAAGCCAGAGAGGATGCCGACGAGTTCCTGGAGAAAACAAAAGAGACGGTTTTGGAGAAAGCACGTGACTTGAAAGAGGGTGCAAAGGATGTATTGAAGGAAGGCAAAGCACGTGAGTTGAGAGAGGGTGCAATGGAGAAAGGGAGAGAAGCAAGACAAACTGCGGAGAAGATTAAAACTGGTGGAAACAAGGTTAAGGAGAATCTGATCGGTATTCCTGGTGGAGGATTAAAGCTAGTGAATGATTCCTTCAGGTACTTGGGGTCGTTAGAGTCGTGGAAAGCGGCGATGGATGTGTTGAGTCTGTTGGGATTTAGTATGGCTTTGGGAATGGGCGTGTGGACTACCTTCATCTCTAGCTATGTGCTGGCGAGTGCGCTGCCAAGGCAGCAGTTGGCAGTGGTACAGAGCAAGATATATCCCCTGTATTTTAGGGCCATGGCTTCCAGCATTGGGATGGCCCTATTCGGGCATCTATTCAGCCGCACAAAATGGATGTTTCCAATTCCGAAAAATGCTGAAGTGGTCCAAGGATATGTACTTGTGGCTGCACTTTTGACCATTTTTGCCAATTCTCTCTACATGGAGCCTCAAGCCACCAAGGTATCTTCTGACCGCTGCTCCACCTTGAGTTTTTTTAGGTTTGTACTAATTCCCTGTTATTGTGTATTTATTCCTGTGCAGGTAATGTTTGAGAGATTAAAAGTGGAAAAGGAAGAAGGAAAAGGAATTGAAGACATAGCCGCTGAACCTCGGCACGCCAATGATAACCCCCCAGCAATCACAACCAGCACAGCCACACAAGTCGTAGACCGAGAGGCGGTGAAGTCCAGAATCGTGGGGTTGAATAAGAGGCTGAAGAAGCTGAATTCGTATTCATCCTTGTTAAACCTGCTCACTCTGATGGCTCTCACCTGGCATCTTGTGTACCTGAGCCAGCGTCTGTGCATCCCCTGCTAGTATAACAATATTTTTCTTTCCCTGGTTTTGTTTTTGATGTGTGCATATTTTACCTGAAGTAGACCGTGTTTAGGTGCTTGGTACCCAAGTTAAGTTAGTTTCAACTTATTGTAGGTGCTAAGTGGTTATATAATGTATTTTAGATTGTTTATTTCCTGTTTAATGAACTATTTTAGATTGTTTATTTCCTATTTAATGATTAGGTGCTAAGTTCCCCGTGTATCCTGTTTGATTGAGGAGCTCAACAACTAGAAGCCTGAAAGATCCCCACAATTAGGAGAGTAGCAGGGAGGGT

mRNA sequence

TATATGTACTACGTGTCACTTCTGGTTGTGCAGTATCTTCCTGGTTGCTTAATAAATAAATGTCTGCTACTTCTTTTCGACACCTTCAACTTCATCAGCATGCCCAACCTATTCGCTTTATGTCTCGTTATTACTTCTTTAACTGCAGCGGGACTCTGGTCTCCTTCCCCGGCGTCCCGGCTAGATCACGAGCAGGATGTTATTGTTAAAGAAGGTCACCGAGTGGTTGTGGTCGAGTATGGCGACCAAGGTCAACACAATACTAAGGTTTCCATCTCTTCCGAACCCACCAAAGATGCCTCCCCTAGCAACCCACTACATGATAGTTTAAATGTTGGGATCCCCAACGAAGACTCCGAAAGGCACCGCACCAGAGATCTTATTTGCGATGCCTTGGGCAAATGTAAGCATAAGATAGCCAGTGCTGTGGGAAAGGCTAAAGTAATGGTTTCGGAGACGGCGCAGGAGGCCCACGACGTTGGAGAGGCTGTTGCTGGTGCTTTCGATGAAGCCAAAGAGACAGTTTCAGACAAATCTCACCACGTGGGAACGTCGTTCTCAGAGAAAGGGCATCGATTGAGGGAGTCGGTTGAGAAAGCCAGAGAGGATGCCGACGAGTTCCTGGAGAAAACAAAAGAGACGGTTTTGGAGAAAGCACGTGACTTGAAAGAGGGTGCAAAGGATGTATTGAAGGAAGGCAAAGCACGTGAGTTGAGAGAGGGTGCAATGGAGAAAGGGAGAGAAGCAAGACAAACTGCGGAGAAGATTAAAACTGGTGGAAACAAGGTTAAGGAGAATCTGATCGGTATTCCTGGTGGAGGATTAAAGCTAGTGAATGATTCCTTCAGGTACTTGGGGTCGTTAGAGTCGTGGAAAGCGGCGATGGATGTGTTGAGTCTGTTGGGATTTAGTATGGCTTTGGGAATGGGCGTGTGGACTACCTTCATCTCTAGCTATGTGCTGGCGAGTGCGCTGCCAAGGCAGCAGTTGGCAGTGGTACAGAGCAAGATATATCCCCTGTATTTTAGGGCCATGGCTTCCAGCATTGGGATGGCCCTATTCGGGCATCTATTCAGCCGCACAAAATGGATGTTTCCAATTCCGAAAAATGCTGAAGTGGTCCAAGGATATGTACTTGTGGCTGCACTTTTGACCATTTTTGCCAATTCTCTCTACATGGAGCCTCAAGCCACCAAGGTAATGTTTGAGAGATTAAAAGTGGAAAAGGAAGAAGGAAAAGGAATTGAAGACATAGCCGCTGAACCTCGGCACGCCAATGATAACCCCCCAGCAATCACAACCAGCACAGCCACACAAGTCGTAGACCGAGAGGCGGTGAAGTCCAGAATCGTGGGGTTGAATAAGAGGCTGAAGAAGCTGAATTCGTATTCATCCTTGTTAAACCTGCTCACTCTGATGGCTCTCACCTGGCATCTTGTGTACCTGAGCCAGCGTCTGTGCATCCCCTGCTAGTATAACAATATTTTTCTTTCCCTGGTTTTGTTTTTGATGTGTGCATATTTTACCTGAAGTAGACCGTGTTTAGGTGCTTGGTACCCAAGTTAAGTTAGTTTCAACTTATTGTAGGTGCTAAGTGGTTATATAATGTATTTTAGATTGTTTATTTCCTGTTTAATGAACTATTTTAGATTGTTTATTTCCTATTTAATGATTAGGTGCTAAGTTCCCCGTGTATCCTGTTTGATTGAGGAGCTCAACAACTAGAAGCCTGAAAGATCCCCACAATTAGGAGAGTAGCAGGGAGGGT

Coding sequence (CDS)

ATGTACTACGTGTCACTTCTGGTTGTGCAGTATCTTCCTGGTTGCTTAATAAATAAATGTCTGCTACTTCTTTTCGACACCTTCAACTTCATCAGCATGCCCAACCTATTCGCTTTATGTCTCGTTATTACTTCTTTAACTGCAGCGGGACTCTGGTCTCCTTCCCCGGCGTCCCGGCTAGATCACGAGCAGGATGTTATTGTTAAAGAAGGTCACCGAGTGGTTGTGGTCGAGTATGGCGACCAAGGTCAACACAATACTAAGGTTTCCATCTCTTCCGAACCCACCAAAGATGCCTCCCCTAGCAACCCACTACATGATAGTTTAAATGTTGGGATCCCCAACGAAGACTCCGAAAGGCACCGCACCAGAGATCTTATTTGCGATGCCTTGGGCAAATGTAAGCATAAGATAGCCAGTGCTGTGGGAAAGGCTAAAGTAATGGTTTCGGAGACGGCGCAGGAGGCCCACGACGTTGGAGAGGCTGTTGCTGGTGCTTTCGATGAAGCCAAAGAGACAGTTTCAGACAAATCTCACCACGTGGGAACGTCGTTCTCAGAGAAAGGGCATCGATTGAGGGAGTCGGTTGAGAAAGCCAGAGAGGATGCCGACGAGTTCCTGGAGAAAACAAAAGAGACGGTTTTGGAGAAAGCACGTGACTTGAAAGAGGGTGCAAAGGATGTATTGAAGGAAGGCAAAGCACGTGAGTTGAGAGAGGGTGCAATGGAGAAAGGGAGAGAAGCAAGACAAACTGCGGAGAAGATTAAAACTGGTGGAAACAAGGTTAAGGAGAATCTGATCGGTATTCCTGGTGGAGGATTAAAGCTAGTGAATGATTCCTTCAGGTACTTGGGGTCGTTAGAGTCGTGGAAAGCGGCGATGGATGTGTTGAGTCTGTTGGGATTTAGTATGGCTTTGGGAATGGGCGTGTGGACTACCTTCATCTCTAGCTATGTGCTGGCGAGTGCGCTGCCAAGGCAGCAGTTGGCAGTGGTACAGAGCAAGATATATCCCCTGTATTTTAGGGCCATGGCTTCCAGCATTGGGATGGCCCTATTCGGGCATCTATTCAGCCGCACAAAATGGATGTTTCCAATTCCGAAAAATGCTGAAGTGGTCCAAGGATATGTACTTGTGGCTGCACTTTTGACCATTTTTGCCAATTCTCTCTACATGGAGCCTCAAGCCACCAAGGTAATGTTTGAGAGATTAAAAGTGGAAAAGGAAGAAGGAAAAGGAATTGAAGACATAGCCGCTGAACCTCGGCACGCCAATGATAACCCCCCAGCAATCACAACCAGCACAGCCACACAAGTCGTAGACCGAGAGGCGGTGAAGTCCAGAATCGTGGGGTTGAATAAGAGGCTGAAGAAGCTGAATTCGTATTCATCCTTGTTAAACCTGCTCACTCTGATGGCTCTCACCTGGCATCTTGTGTACCTGAGCCAGCGTCTGTGCATCCCCTGCTAG

Protein sequence

MYYVSLLVVQYLPGCLINKCLLLLFDTFNFISMPNLFALCLVITSLTAAGLWSPSPASRLDHEQDVIVKEGHRVVVVEYGDQGQHNTKVSISSEPTKDASPSNPLHDSLNVGIPNEDSERHRTRDLICDALGKCKHKIASAVGKAKVMVSETAQEAHDVGEAVAGAFDEAKETVSDKSHHVGTSFSEKGHRLRESVEKAREDADEFLEKTKETVLEKARDLKEGAKDVLKEGKARELREGAMEKGREARQTAEKIKTGGNKVKENLIGIPGGGLKLVNDSFRYLGSLESWKAAMDVLSLLGFSMALGMGVWTTFISSYVLASALPRQQLAVVQSKIYPLYFRAMASSIGMALFGHLFSRTKWMFPIPKNAEVVQGYVLVAALLTIFANSLYMEPQATKVMFERLKVEKEEGKGIEDIAAEPRHANDNPPAITTSTATQVVDREAVKSRIVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRLCIPC
BLAST of CmaCh15G000020 vs. Swiss-Prot
Match: TM205_MOUSE (Transmembrane protein 205 OS=Mus musculus GN=Tmem205 PE=1 SV=1)

HSP 1 Score: 61.6 bits (148), Expect = 2.7e-08
Identity = 43/120 (35.83%), Postives = 62/120 (51.67%), Query Frame = 1

Query: 296 VLSLLGFSMALGMGVWTTFISSYVLASALPRQQLAVVQSKIYPLYFRAMASSIGMALFGH 355
           V+ LL  S A GM VW TFIS ++L  +LPR    +VQSK++P+YF     S+G A    
Sbjct: 13  VIHLLVLSGAWGMQVWVTFISGFLLFRSLPRHTFGLVQSKVFPVYFHV---SLGCAFINL 72

Query: 356 LFSRTKWMFPIPKNAEVVQGYVLVAALLTIFANSLYMEPQATKVMFERLKVEKEEGKGIE 415
                +  +      EV Q  +L+ +L     N+ ++E + T VM     +EKE G G E
Sbjct: 73  CILAPQRAWIHLTLWEVSQLSLLLLSLTLATINARWLEARTTAVMRALQSIEKERGLGTE 129

BLAST of CmaCh15G000020 vs. Swiss-Prot
Match: TM205_BOVIN (Transmembrane protein 205 OS=Bos taurus GN=TMEM205 PE=2 SV=1)

HSP 1 Score: 60.1 bits (144), Expect = 7.9e-08
Identity = 42/120 (35.00%), Postives = 61/120 (50.83%), Query Frame = 1

Query: 296 VLSLLGFSMALGMGVWTTFISSYVLASALPRQQLAVVQSKIYPLYFRAMASSIGMALFGH 355
           V+ LL  S A GM +W TFIS +VL   LPR    +VQSK++P YF     S+G A    
Sbjct: 13  VVHLLVLSGAWGMQMWVTFISGFVLFRGLPRHTFGLVQSKLFPFYFHI---SMGCAFVNL 72

Query: 356 LFSRTKWMFPIPKNAEVVQGYVLVAALLTIFANSLYMEPQATKVMFERLKVEKEEGKGIE 415
               ++  +      E  Q ++L+ +L     N+ ++E + T  M+    VEKE G G E
Sbjct: 73  CILASQCSWAQLTFWEASQLFLLLLSLTLATINARWLESRTTAAMWALQTVEKERGLGGE 129

BLAST of CmaCh15G000020 vs. TrEMBL
Match: A0A0A0KSX1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G650480 PE=4 SV=1)

HSP 1 Score: 458.8 bits (1179), Expect = 8.5e-126
Identity = 270/450 (60.00%), Postives = 314/450 (69.78%), Query Frame = 1

Query: 41  LVITSLTAAGLWSPSPASRLDHEQDVIVKEGHRVVVVEYGDQGQHNTKVSISSEPTKDAS 100
           L++T+ TAAGLWSP P       Q+VIVKEGHR+VVVEY DQGQHNTKVSISSEP +DA 
Sbjct: 3   LIVTTFTAAGLWSPPPPP-----QNVIVKEGHRMVVVEYDDQGQHNTKVSISSEPDQDA- 62

Query: 101 PSNPLHDSLNVGIPNEDSERHRTRDLICDALGKCKHKIASAVGKAKVMVSETAQEAHDVG 160
                          ++SERHRT+DLICD  GKCKHK+ASAV KAKVMV+ETAQEAHDVG
Sbjct: 63  ---------------KNSERHRTKDLICDVYGKCKHKVASAVEKAKVMVTETAQEAHDVG 122

Query: 161 EAVAGAFDEAKETVSDKSHHVGTSFSEKGHRLRESVEKAREDADEFLEKTKETVLEKARD 220
           E+V  AFD AK+ + + +              +E++E A+   ++ ++  +    E    
Sbjct: 123 ESVTDAFDGAKDKLKEGA--------------KETLEMAKSREEKVVKGAERVAKETGEK 182

Query: 221 LKEGAKDVLKEGKARELREGAMEKGREARQTAEKIKTGGNKVKENLIGIPGGGLKLVNDS 280
           +K G      E K +E   G +++G                             K+++  
Sbjct: 183 IKTG------ENKLKENLMGLVDRG----------------------------FKVIDYL 242

Query: 281 FRYLGSLESWKAAMDVLSLLGFSMALGMGVWTTFISSYVLASALPRQQLAVVQSKIYPLY 340
           FR+LG        MD L LLGF+MALGMGVW TFISSYVLAS LPRQQL VVQSKIYP+Y
Sbjct: 243 FRHLGF------GMDALGLLGFTMALGMGVWVTFISSYVLASVLPRQQLGVVQSKIYPVY 302

Query: 341 FRAMASSIGMALFGHLFSRTKWMFPIPKNAEVVQGYVLVAALLTIFANSLYMEPQATKVM 400
           F+AMAS IGMAL GHLFSRT+W FPIPKN+EVVQGYVLVAALL IFANSLYMEP+ATKVM
Sbjct: 303 FKAMASCIGMALLGHLFSRTEWTFPIPKNSEVVQGYVLVAALLMIFANSLYMEPRATKVM 362

Query: 401 FERLKVEKEEGKGIEDIAAEPR-HANDNPPAITTSTATQVVDREAVKSRIVGLNKRLKKL 460
           FERLK+EKEEG+GIEDIA E   +  DN PAIT+ST TQVVDRE VKSRIVGLNKRLKKL
Sbjct: 363 FERLKIEKEEGRGIEDIAREETGNVIDNSPAITSSTPTQVVDREVVKSRIVGLNKRLKKL 377

Query: 461 NSYSSLLNLLTLMALTWHLVYLSQRLCIPC 490
           NSYSSLLNLLTLMALTWHLVYLSQRLC PC
Sbjct: 423 NSYSSLLNLLTLMALTWHLVYLSQRLCNPC 377

BLAST of CmaCh15G000020 vs. TrEMBL
Match: A0A0S3SEX2_PHAAN (Uncharacterized protein OS=Vigna angularis var. angularis GN=Vigan.06G271300 PE=4 SV=1)

HSP 1 Score: 298.1 bits (762), Expect = 1.9e-77
Identity = 204/470 (43.40%), Postives = 272/470 (57.87%), Query Frame = 1

Query: 33  MPNLFALCLVITSLTAAGLWSPSPASRLDHEQDVIVKEGHRVVVVEYGDQGQHNTKVSIS 92
           M N+FA+ LVITSL A+ + SP+PA+      + IV+EGHRVVVVEY   G HNTK+SIS
Sbjct: 22  MLNVFAVSLVITSLAASAILSPAPATHQKQGANTIVREGHRVVVVEYDQDGHHNTKISIS 81

Query: 93  SE-PTKDASPSNPLHDSL--------NVGI----PNEDSERHRTRDLICDALGKCKHKIA 152
            E PT      +   D +        NVG     P + +  H  ++L+CDA G+CK +IA
Sbjct: 82  PEQPTHHHQVFDNAKDGIREAASVLPNVGQGISQPEDAAFLHAPKELVCDAYGRCKQRIA 141

Query: 153 SAVGKAKVMVSETAQEAHDVGEAVAGAFDEAKETVSDKSHHVGTSFSEKGHRLRESVEKA 212
            A+ K K                     D+A+E +  K   V  +  E   R+ +SV  A
Sbjct: 142 DAMEKTK---------------------DKAQEALQKKKEKVAAN-KEAARRVGDSVADA 201

Query: 213 REDADEFLEKTKETVLEKARDLKEGAKDVLKEGKARELREGAMEKGREARQTAEKIKTGG 272
                  L KTKE+V +KARD+ E A+D ++  K     E       EAR +  ++K   
Sbjct: 202 -------LGKTKESVHDKARDVHEYAQDTVETAK-----EHVAHNISEARDSLRRLKHA- 261

Query: 273 NKVKENLIGIPGGGLKLVNDSFRYLGSLESWKAAMDVLSLLGFSMALGMGVWTTFISSYV 332
                            +  SF   GSLES  + M V +LLGF+ A GM VW TFISSYV
Sbjct: 262 -----------------LKSSF---GSLESLNSVMGVANLLGFATAYGMCVWVTFISSYV 321

Query: 333 LASALPRQQLAVVQSKIYPLYFRAMASSIGMALFGHLFSRTKWMFPIPKNAEVVQGYVLV 392
            + A+ R Q AVVQSKIYP+YFRAMA S+G+AL GH+F  T  +  +   +  +Q Y L+
Sbjct: 322 QSRAMARHQFAVVQSKIYPVYFRAMAYSVGVALVGHVFGNTNTL--LSNKSHALQAYNLL 381

Query: 393 AALLTIFANSLYMEPQATKVMFERLKVEKEEGKGIEDIAAEPRHANDNPPAITTSTATQV 452
           A+L T+F NSLY+EP+ATK+MFER+K+EKEEG+G  DI+ E            +S+A   
Sbjct: 382 ASLATLFFNSLYLEPRATKLMFERIKIEKEEGRGRVDISGERGRTEHQRTGEPSSSA--- 430

Query: 453 VDREAVKSRIVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRLCIPC 490
            D++AV+SRI+ LN +LKKLNSYSSLLN+L LM+LTWHLVYL+QRL  PC
Sbjct: 442 -DQDAVRSRIIKLNDKLKKLNSYSSLLNILNLMSLTWHLVYLAQRLHTPC 430

BLAST of CmaCh15G000020 vs. TrEMBL
Match: A0A0L9V8B7_PHAAN (Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan08g215700 PE=4 SV=1)

HSP 1 Score: 298.1 bits (762), Expect = 1.9e-77
Identity = 204/470 (43.40%), Postives = 272/470 (57.87%), Query Frame = 1

Query: 33  MPNLFALCLVITSLTAAGLWSPSPASRLDHEQDVIVKEGHRVVVVEYGDQGQHNTKVSIS 92
           M N+FA+ LVITSL A+ + SP+PA+      + IV+EGHRVVVVEY   G HNTK+SIS
Sbjct: 1   MLNVFAVSLVITSLAASAILSPAPATHQKQGANTIVREGHRVVVVEYDQDGHHNTKISIS 60

Query: 93  SE-PTKDASPSNPLHDSL--------NVGI----PNEDSERHRTRDLICDALGKCKHKIA 152
            E PT      +   D +        NVG     P + +  H  ++L+CDA G+CK +IA
Sbjct: 61  PEQPTHHHQVFDNAKDGIREAASVLPNVGQGISQPEDAAFLHAPKELVCDAYGRCKQRIA 120

Query: 153 SAVGKAKVMVSETAQEAHDVGEAVAGAFDEAKETVSDKSHHVGTSFSEKGHRLRESVEKA 212
            A+ K K                     D+A+E +  K   V  +  E   R+ +SV  A
Sbjct: 121 DAMEKTK---------------------DKAQEALQKKKEKVAAN-KEAARRVGDSVADA 180

Query: 213 REDADEFLEKTKETVLEKARDLKEGAKDVLKEGKARELREGAMEKGREARQTAEKIKTGG 272
                  L KTKE+V +KARD+ E A+D ++  K     E       EAR +  ++K   
Sbjct: 181 -------LGKTKESVHDKARDVHEYAQDTVETAK-----EHVAHNISEARDSLRRLKHA- 240

Query: 273 NKVKENLIGIPGGGLKLVNDSFRYLGSLESWKAAMDVLSLLGFSMALGMGVWTTFISSYV 332
                            +  SF   GSLES  + M V +LLGF+ A GM VW TFISSYV
Sbjct: 241 -----------------LKSSF---GSLESLNSVMGVANLLGFATAYGMCVWVTFISSYV 300

Query: 333 LASALPRQQLAVVQSKIYPLYFRAMASSIGMALFGHLFSRTKWMFPIPKNAEVVQGYVLV 392
            + A+ R Q AVVQSKIYP+YFRAMA S+G+AL GH+F  T  +  +   +  +Q Y L+
Sbjct: 301 QSRAMARHQFAVVQSKIYPVYFRAMAYSVGVALVGHVFGNTNTL--LSNKSHALQAYNLL 360

Query: 393 AALLTIFANSLYMEPQATKVMFERLKVEKEEGKGIEDIAAEPRHANDNPPAITTSTATQV 452
           A+L T+F NSLY+EP+ATK+MFER+K+EKEEG+G  DI+ E            +S+A   
Sbjct: 361 ASLATLFFNSLYLEPRATKLMFERIKIEKEEGRGRVDISGERGRTEHQRTGEPSSSA--- 409

Query: 453 VDREAVKSRIVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRLCIPC 490
            D++AV+SRI+ LN +LKKLNSYSSLLN+L LM+LTWHLVYL+QRL  PC
Sbjct: 421 -DQDAVRSRIIKLNDKLKKLNSYSSLLNILNLMSLTWHLVYLAQRLHTPC 409

BLAST of CmaCh15G000020 vs. TrEMBL
Match: A0A0D2SDT7_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_013G138700 PE=4 SV=1)

HSP 1 Score: 295.8 bits (756), Expect = 9.6e-77
Identity = 202/486 (41.56%), Postives = 263/486 (54.12%), Query Frame = 1

Query: 65  DVIVKEGHRVVVVEYGDQGQHNTKVSISS---------------EPTKDASPSNPLHDSL 124
           DVI+KEGHRV+VVEY   G+HNTKVSISS               E  KDA+ + P +   
Sbjct: 25  DVILKEGHRVIVVEYDQDGKHNTKVSISSPSLHQQTDQGEYFGKETMKDAASALP-NVGH 84

Query: 125 NVGIPNEDSERHRTRDLICDALGKCKHKIASAVGKAKVMVSETAQEAHDVGEAVAGAFDE 184
            +      S RH   +LICDA GKC  ++A+A+GKAK  VS+TA EA+ + +A +G   E
Sbjct: 85  GISQGKAGSGRHSPGELICDAFGKCTQRVATALGKAKDKVSDTAHEANKLKQAASGTAHE 144

Query: 185 AKETVSDK----SHHVGTSFSEKGHRLRESVEKAREDADEFLEKTKETVLEKARDLKEGA 244
           AKE   DK    +  V    SE  H  R+ V   +    + L K K  V++K +D+KE A
Sbjct: 145 AKEKAKDKAWETAQEVREKVSESAHETRDKVADKKGAIGDALGKAKGAVVQKGQDVKERA 204

Query: 245 KDVLKEGK--------------------ARELREGAMEKG-REARQTAEKIKTGGNKVKE 304
           K+ + + K                      E  E   EK   EA + A K+KT  NK   
Sbjct: 205 KESIDKAKEAATTAKDTAKTMGADIVTNTSEQVENVQEKAMEEAGRAANKVKTSANKYL- 264

Query: 305 NLIGIPGGGLKLVNDSFRYLGSLESWKAAMDVLSLLGFSMALGMGVWTTFISSYVLASAL 364
                         D  +Y+ S+E+    M +++LLG + A GM VW TFISSY+LA  L
Sbjct: 265 --------------DGLKYMTSMEALNTVMGIVNLLGLATAYGMSVWVTFISSYILAGQL 324

Query: 365 PRQQLAVVQSKIYPLYFRAMASSIGMALFGHLFSRTKWMFPIPKNAEVVQGYVLVAALLT 424
           PRQQ  VVQSKIYP+YFRAMA SIGMAL GHL    K     P   EV Q   L+++L  
Sbjct: 325 PRQQFGVVQSKIYPVYFRAMAYSIGMALLGHLLWHRKRSISSP--PEVFQAINLLSSLFM 384

Query: 425 IFANSLYMEPQATKVMFERLKVEKEEGKGIEDIAAEPRHANDNP---------------- 484
           +  N LY+EP+ATKVMFER+K+EKE+G+G  D  AE   A ++P                
Sbjct: 385 VLVNGLYLEPKATKVMFERMKMEKEDGRGRHDFVAEGSRATESPSVADPVAKNSRKGPST 444

Query: 485 -----PAITTSTATQVVDREAVKSRIVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLSQ 490
                PA   + A    ++E +K  +  LN+RLKKLN+ SS+LN+LTLMALTWHLVYL Q
Sbjct: 445 APAPAPAPAPAVAPTSSEQEVIKRTMGRLNERLKKLNTNSSMLNILTLMALTWHLVYLGQ 492

BLAST of CmaCh15G000020 vs. TrEMBL
Match: W9RNN9_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_008289 PE=4 SV=1)

HSP 1 Score: 292.0 bits (746), Expect = 1.4e-75
Identity = 209/492 (42.48%), Postives = 290/492 (58.94%), Query Frame = 1

Query: 33  MPNLFALCLVITSLTAAGLWSPSPASRLDHEQDVIVKEGHRVVVVEYGDQGQHNTKVSIS 92
           M N+ +LCLV+TSL  AG+ SP+P  + ++  D+IVKEGHRVVVVEY  +GQ  TKVSIS
Sbjct: 1   MMNVVSLCLVVTSLVTAGVLSPTP-KKQNNGDDLIVKEGHRVVVVEYDQEGQPITKVSIS 60

Query: 93  SE-PTKDASPSNPLHDSLNVGIPN-----------------EDSERHRTRDLICDALGKC 152
            E  T+    S+ L ++ +V +PN                  D E    ++LICDA GKC
Sbjct: 61  PEDKTRQRFNSDKLKEAASV-LPNLGQGLSTPKADGGGEGEGDGEWRSPKELICDAYGKC 120

Query: 153 KHKIASAVGKAKVMVSE----TAQEAHDVGEAVAGAFDEAKETVSDKSHHVGTSFSEKGH 212
           KHKI  A+G+ K  VSE     A++  ++ E      ++AKE VS+K+        E GH
Sbjct: 121 KHKIVDAIGRTKEAVSEKAHDVAEKTKEMKEKAEDVVEKAKEAVSEKARGFAEKTRETGH 180

Query: 213 RLRESVEKAREDADEFLEKTKETVLEKARDLKEGA---KDVLKE--GKARE-LREGAMEK 272
             +++ E+    A  F+EKTKE   E     K+ A   ++  +E  GKA+E +R  A E 
Sbjct: 181 EAQDATER---KAHVFIEKTKEAAHEAVEKKKKAAYRMEEAAEESYGKAKEAVRNKAQEV 240

Query: 273 GREARQTAEKI-----------KTGGNKVKENLIGIPGGGLKLVNDSFRYLGSLESWKAA 332
             +AR+ AEK            KT    V  N+        K    +FR L + ++    
Sbjct: 241 EGQARERAEKTWEAAKDAKDVGKTFVKDVASNVTKFAATFRKQAGATFRELVTGKALNPV 300

Query: 333 MDVLSLLGFSMALGMGVWTTFISSYVLASALPRQQLAVVQSKIYPLYFRAMASSIGMALF 392
           + V+ L+ FS A G  VW TFI SYVLA ALPRQQ  VVQSKIYP+YFR MA  IG A+ 
Sbjct: 301 VGVVYLVTFSTAYGTAVWETFILSYVLAGALPRQQFGVVQSKIYPVYFRTMAWGIGTAVL 360

Query: 393 GHLFSRTKWMFPIPKNAEVVQGYVLVAALLTIFANSLYMEPQATKVMFERLKVEKEEGKG 452
           G L +     F     AE  Q + L+A+L+ +F N LY+EP+ATKVMFER++VEKEEG+G
Sbjct: 361 GLLVTGRGKAF--SSMAEKFQIFNLLASLVLVFVNMLYLEPRATKVMFERMRVEKEEGRG 420

Query: 453 IEDIAAEPRHANDNPPAITTSTATQVVDREAVKSRIVGLNKRLKKLNSYSSLLNLLTLMA 486
            E++ AE   A +  PA+++  A +  ++EAV++RI+ LN RLKKLN++SS LN+L+LM+
Sbjct: 421 REELPAEQPSAAE--PAVSSMPA-ETAEQEAVRNRILSLNGRLKKLNTWSSFLNILSLMS 480

BLAST of CmaCh15G000020 vs. TAIR10
Match: AT1G72100.1 (AT1G72100.1 late embryogenesis abundant domain-containing protein / LEA domain-containing protein)

HSP 1 Score: 244.2 bits (622), Expect = 1.7e-64
Identity = 186/512 (36.33%), Postives = 271/512 (52.93%), Query Frame = 1

Query: 33  MPNLFALCLVITSLTAAGLWSPSPA-----SRLDHEQDVIVKEGHRVVVVEY-------- 92
           M NL ALCLV+++L AA +WSPSPA     + +  E +VIVK+GH VVVVEY        
Sbjct: 1   MTNLLALCLVLSTLLAAEVWSPSPAMTTHNTAVASEGEVIVKDGHHVVVVEYDRDGKTNT 60

Query: 93  -----------GDQGQHNTKVSIS-----SEPTKDASPSNPLHDSLNVGIP---NEDSER 152
                      G++ ++  ++  S      E  K+ +   P H    +  P   +E  + 
Sbjct: 61  RVSISPPSADQGEEKENEVEMGTSMFRNVKEKAKETASYLP-HVGQGISQPVMTDEARDH 120

Query: 153 HRTR-DLICDALGKCKHKIASAVGKAKVM----VSETAQE--------AHDVGEAVAGAF 212
           H T  ++ICDA GKC+ KIAS VG+AK      V ETA +        AHDV E V  A 
Sbjct: 121 HATAGEVICDAFGKCRQKIASVVGRAKDRTVDSVGETASDVREAAAHKAHDVKETVTHAA 180

Query: 213 DEAKETVSDKSHHVGTSFSEKGHRLRESVEKAREDADEFL----EKTKETVLEKARDLKE 272
            + ++TV+D++ +     +EK H  +E V     DA E +       KE+V +KA D KE
Sbjct: 181 RDVEDTVADQAQYAKGRVTEKAHDPKEGVAHKAHDAKESVADKAHDAKESVAQKAHDAKE 240

Query: 273 GAKDVLKEGKARELREGAMEKGREARQTA-EKIKTGGNKVKENLIGIPGGGLKLVNDSFR 332
             ++     KA +++E   +K  E+++ A ++++    ++KE          + V +  R
Sbjct: 241 KVRE-----KAHDVKETVAQKAHESKERAKDRVREKAQELKETATHKSKNAWERVKNGAR 300

Query: 333 YLGS-----LESWKAAMDVLSLLGFSMALGMGVWTTFISSYVLASALPRQQLAVVQSKIY 392
             GS     L   K A  ++ L G + A G  VW TF+SSYVLAS L RQQ  VVQSK+Y
Sbjct: 301 EFGSATAATLSPTKVA-SIVGLTGIAAAFGTSVWVTFVSSYVLASVLGRQQFGVVQSKLY 360

Query: 393 PLYFRAMASSIGMALFGHLFSRTKWMFPIPKNAEVVQGYVLVAALLTIFANSLYMEPQAT 452
           P+YF+A +  I + LFGH+ SR + +  +    E+ QG  L+++   I AN  ++EP+AT
Sbjct: 361 PVYFKATSVGILVGLFGHVLSRRRKL--LTDATEMWQGVNLLSSFFMIEANKSFVEPRAT 420

Query: 453 KVMFERLKVEKEEGKGIEDIAAEPRHANDNPPAITTSTATQVVDREAVKSRIVGLNKRLK 490
           K MFER+K EKEEG+G E                           + ++ ++  L++RL 
Sbjct: 421 KAMFERMKAEKEEGRGGER-----------------------TSEQELRRKLEQLSERLS 480

BLAST of CmaCh15G000020 vs. TAIR10
Match: AT1G22600.1 (AT1G22600.1 Late embryogenesis abundant protein (LEA) family protein)

HSP 1 Score: 148.7 bits (374), Expect = 9.5e-36
Identity = 114/350 (32.57%), Postives = 188/350 (53.71%), Query Frame = 1

Query: 151 ETAQEAHDVGEAVAGAFDEAKETVS------DKSHHVGTSFSEKGHRLRESVEKAREDAD 210
           E  Q++ D GE      +E +ET S      ++ HH     +  G  + +++ K +    
Sbjct: 58  EVDQKSRDEGEVFG---NEKRETASSLPEEEEREHH-----ATPGELICDAIGKCKHKLG 117

Query: 211 EFLEKTKETVLEKARDLKEGAKDVLKEGKARELREGAMEKGREAR-QTAEKIKTGGNKVK 270
             L + K+     A DL +   ++    +A E+ E    K REAR +  E+     ++V+
Sbjct: 118 TVLGRVKDRT---ASDLSDETPEMTVAREALEVEEKVSWKAREARGKVNERATKKAHRVQ 177

Query: 271 ENLIGIPGGGLKLVNDSFRYLGSLESWKAAM----DVLSLLGFSMALGMGVWTTFISSYV 330
           + L        + V  + R +G++ +    +     V+ ++G + A GM VW TF+S YV
Sbjct: 178 KVL--------EKVQIAVRGIGTVVATALGLTKIGSVVGIVGIAAAYGMCVWVTFVSGYV 237

Query: 331 LASALPRQQLAVVQSKIYPLYFRAMASSIGMALFGHLFSRTKWMFPIPKNAEVVQGYVLV 390
           LAS L  QQ  VVQSK+YP+YF+A++  I + L GH+  R + +F      ++ Q   L+
Sbjct: 238 LASVLGEQQFGVVQSKMYPVYFKAVSVGILVGLLGHVIGRRRKVF--TDAVDMWQSVNLL 297

Query: 391 AALLTIFANSLYMEPQATKVMFERLKVEKEEGKGIEDIAAEPRHANDNPPAITTSTATQV 450
           +++L + AN+ ++  +ATK MFE +K EKE+G+G  D + + + +            T+ 
Sbjct: 298 SSILMVEANASFVYTRATKAMFELIKAEKEDGRGF-DTSDQSQSSESAGRTRGKKKVTEK 357

Query: 451 VDREAVKSRIVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRLCIPC 490
            D + VK R+  L++R++KLN+YSS LNLLTLM+LTWH VYL  RL + C
Sbjct: 358 TDEDVVKQRLTKLSERMRKLNAYSSRLNLLTLMSLTWHFVYLGYRLSLTC 385

BLAST of CmaCh15G000020 vs. TAIR10
Match: AT3G62580.1 (AT3G62580.1 Late embryogenesis abundant protein (LEA) family protein)

HSP 1 Score: 61.2 bits (147), Expect = 2.0e-09
Identity = 55/193 (28.50%), Postives = 89/193 (46.11%), Query Frame = 1

Query: 299 LLGFSMALGMGVWTTFISSYVLASALPRQQLAVVQSKIYPLYFRAMAS--SIGMALFGHL 358
           LL F+ A G  +W TFI   ++   LPR Q   +QSK++P YF  + S  +I ++ FG+L
Sbjct: 46  LLSFATAWGAALWATFIGGIIMFKNLPRHQFGNLQSKLFPAYFTLVGSCCAISLSAFGYL 105

Query: 359 FSRTKWMFPIPKNAEVVQGYVLVAALLTIFA----NSLYMEPQATKVMFERLKVEKEEGK 418
                W     K++  V+ Y  +  LL+ FA    N     P    +M +R KVE+E   
Sbjct: 106 HP---W-----KSSSTVEKYQ-IGFLLSAFAFNLTNLFVFTPMTIDMMKQRHKVERENNI 165

Query: 419 GIEDIAAEPRHANDNPPAITTSTATQVVDREAVKSRIVGLNKRLKKLNSYSSLLNLLTLM 478
           G E   ++ R    + P                  ++  +NK+   ++  SSL N+ +  
Sbjct: 166 GDEVGWSKNREKAKSIP------------------KLAAMNKKFGMIHGLSSLANIFSFG 211

Query: 479 ALTWHLVYLSQRL 486
           +L  H  YL+ +L
Sbjct: 226 SLAMHSWYLAGKL 211

BLAST of CmaCh15G000020 vs. NCBI nr
Match: gi|659119380|ref|XP_008459625.1| (PREDICTED: uncharacterized protein LOC103498695 [Cucumis melo])

HSP 1 Score: 530.8 bits (1366), Expect = 2.5e-147
Identity = 309/469 (65.88%), Postives = 356/469 (75.91%), Query Frame = 1

Query: 24  LFDTFNFISMPNLFALCLVITSLTAAGLWSPSPASRLDHEQDVIVKEGHRVVVVEYGDQG 83
           LF T +F +M NLFA+ L+IT+LTAAGLWSP P       Q+VIVKEGHRVVVVEY DQG
Sbjct: 9   LFTTLSF-NMTNLFAMFLIITTLTAAGLWSPPPPP-----QNVIVKEGHRVVVVEYDDQG 68

Query: 84  QHNTKVSISSEPTKDASPSNPLHDSLNVGIPNEDSERHRTRDLICDALGKCKHKIASAVG 143
           QHNTKVSISSEP  DA                ++SERHRT+DLICD  GKCKHK+ASAV 
Sbjct: 69  QHNTKVSISSEPDLDA----------------KNSERHRTKDLICDVYGKCKHKVASAVE 128

Query: 144 KAKVMVSETAQEAHDVGEAVAGAFDEAKETVSDKSHHVGTSFSEKGHRLRESVEKAREDA 203
           KAKVMV+ETAQEAHDVGE+VAGAFDEAK+ + + +     +F E   +L+E  + A+E  
Sbjct: 129 KAKVMVTETAQEAHDVGESVAGAFDEAKDKLKEGAKE---TFGEAKDKLKEGAKGAKETF 188

Query: 204 DEFLEKTKETVLEKARDLKEGAKDVLKEGKARELREGAMEKGRE--ARQTAEKIKTGGNK 263
            E  +K           LKEGAK+ L+  K+RE +   + KG E  A++T EKI+TG NK
Sbjct: 189 GEAKDK-----------LKEGAKETLEMAKSREEK---VVKGAERVAKETGEKIQTGENK 248

Query: 264 VKENLIGIPGGGLKLVNDSFRYLGSLESWKAAMDVLSLLGFSMALGMGVWTTFISSYVLA 323
           +KENL+G+   G K++N  FR+LG        MD L LLGF+MALGMGVW TFISSYVLA
Sbjct: 249 LKENLMGLVDRGFKVMNYLFRHLG------VGMDALGLLGFAMALGMGVWVTFISSYVLA 308

Query: 324 SALPRQQLAVVQSKIYPLYFRAMASSIGMALFGHLFSRTKWMFPIPKNAEVVQGYVLVAA 383
           S LPRQQL VVQSKIYP+YF+AMAS IGMAL GHLFSRT+W FPIPKN+EVVQGYVLVAA
Sbjct: 309 SVLPRQQLGVVQSKIYPVYFKAMASCIGMALLGHLFSRTEWKFPIPKNSEVVQGYVLVAA 368

Query: 384 LLTIFANSLYMEPQATKVMFERLKVEKEEGKGIEDIAAEPR-HANDNPPAITTSTATQVV 443
           LL IFANSLYMEP+ATKVMFERLK+EKEEG+GIEDI  E   +  DN PAIT+ST TQ+V
Sbjct: 369 LLMIFANSLYMEPRATKVMFERLKIEKEEGRGIEDIGREETVNVIDNSPAITSSTPTQIV 428

Query: 444 DREAVKSRIVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRLCIPC 490
           DRE VKSRIVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRLC PC
Sbjct: 429 DREVVKSRIVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRLCNPC 432

BLAST of CmaCh15G000020 vs. NCBI nr
Match: gi|778707978|ref|XP_004141640.2| (PREDICTED: uncharacterized protein LOC101208468 [Cucumis sativus])

HSP 1 Score: 469.5 bits (1207), Expect = 6.9e-129
Identity = 279/467 (59.74%), Postives = 325/467 (69.59%), Query Frame = 1

Query: 24  LFDTFNFISMPNLFALCLVITSLTAAGLWSPSPASRLDHEQDVIVKEGHRVVVVEYGDQG 83
           LF T  F +M NLFA+ L++T+ TAAGLWSP P       Q+VIVKEGHR+VVVEY DQG
Sbjct: 9   LFSTLCF-NMTNLFAMFLIVTTFTAAGLWSPPPPP-----QNVIVKEGHRMVVVEYDDQG 68

Query: 84  QHNTKVSISSEPTKDASPSNPLHDSLNVGIPNEDSERHRTRDLICDALGKCKHKIASAVG 143
           QHNTKVSISSEP +DA                ++SERHRT+DLICD  GKCKHK+ASAV 
Sbjct: 69  QHNTKVSISSEPDQDA----------------KNSERHRTKDLICDVYGKCKHKVASAVE 128

Query: 144 KAKVMVSETAQEAHDVGEAVAGAFDEAKETVSDKSHHVGTSFSEKGHRLRESVEKAREDA 203
           KAKVMV+ETAQEAHDVGE+V  AFD AK+ + + +              +E++E A+   
Sbjct: 129 KAKVMVTETAQEAHDVGESVTDAFDGAKDKLKEGA--------------KETLEMAKSRE 188

Query: 204 DEFLEKTKETVLEKARDLKEGAKDVLKEGKARELREGAMEKGREARQTAEKIKTGGNKVK 263
           ++ ++  +    E    +K G      E K +E   G +++G                  
Sbjct: 189 EKVVKGAERVAKETGEKIKTG------ENKLKENLMGLVDRG------------------ 248

Query: 264 ENLIGIPGGGLKLVNDSFRYLGSLESWKAAMDVLSLLGFSMALGMGVWTTFISSYVLASA 323
                      K+++  FR+LG        MD L LLGF+MALGMGVW TFISSYVLAS 
Sbjct: 249 ----------FKVIDYLFRHLGF------GMDALGLLGFTMALGMGVWVTFISSYVLASV 308

Query: 324 LPRQQLAVVQSKIYPLYFRAMASSIGMALFGHLFSRTKWMFPIPKNAEVVQGYVLVAALL 383
           LPRQQL VVQSKIYP+YF+AMAS IGMAL GHLFSRT+W FPIPKN+EVVQGYVLVAALL
Sbjct: 309 LPRQQLGVVQSKIYPVYFKAMASCIGMALLGHLFSRTEWTFPIPKNSEVVQGYVLVAALL 368

Query: 384 TIFANSLYMEPQATKVMFERLKVEKEEGKGIEDIAAEPR-HANDNPPAITTSTATQVVDR 443
            IFANSLYMEP+ATKVMFERLK+EKEEG+GIEDIA E   +  DN PAIT+ST TQVVDR
Sbjct: 369 MIFANSLYMEPRATKVMFERLKIEKEEGRGIEDIAREETGNVIDNSPAITSSTPTQVVDR 399

Query: 444 EAVKSRIVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRLCIPC 490
           E VKSRIVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRLC PC
Sbjct: 429 EVVKSRIVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRLCNPC 399

BLAST of CmaCh15G000020 vs. NCBI nr
Match: gi|700197506|gb|KGN52683.1| (hypothetical protein Csa_5G650480 [Cucumis sativus])

HSP 1 Score: 458.8 bits (1179), Expect = 1.2e-125
Identity = 270/450 (60.00%), Postives = 314/450 (69.78%), Query Frame = 1

Query: 41  LVITSLTAAGLWSPSPASRLDHEQDVIVKEGHRVVVVEYGDQGQHNTKVSISSEPTKDAS 100
           L++T+ TAAGLWSP P       Q+VIVKEGHR+VVVEY DQGQHNTKVSISSEP +DA 
Sbjct: 3   LIVTTFTAAGLWSPPPPP-----QNVIVKEGHRMVVVEYDDQGQHNTKVSISSEPDQDA- 62

Query: 101 PSNPLHDSLNVGIPNEDSERHRTRDLICDALGKCKHKIASAVGKAKVMVSETAQEAHDVG 160
                          ++SERHRT+DLICD  GKCKHK+ASAV KAKVMV+ETAQEAHDVG
Sbjct: 63  ---------------KNSERHRTKDLICDVYGKCKHKVASAVEKAKVMVTETAQEAHDVG 122

Query: 161 EAVAGAFDEAKETVSDKSHHVGTSFSEKGHRLRESVEKAREDADEFLEKTKETVLEKARD 220
           E+V  AFD AK+ + + +              +E++E A+   ++ ++  +    E    
Sbjct: 123 ESVTDAFDGAKDKLKEGA--------------KETLEMAKSREEKVVKGAERVAKETGEK 182

Query: 221 LKEGAKDVLKEGKARELREGAMEKGREARQTAEKIKTGGNKVKENLIGIPGGGLKLVNDS 280
           +K G      E K +E   G +++G                             K+++  
Sbjct: 183 IKTG------ENKLKENLMGLVDRG----------------------------FKVIDYL 242

Query: 281 FRYLGSLESWKAAMDVLSLLGFSMALGMGVWTTFISSYVLASALPRQQLAVVQSKIYPLY 340
           FR+LG        MD L LLGF+MALGMGVW TFISSYVLAS LPRQQL VVQSKIYP+Y
Sbjct: 243 FRHLGF------GMDALGLLGFTMALGMGVWVTFISSYVLASVLPRQQLGVVQSKIYPVY 302

Query: 341 FRAMASSIGMALFGHLFSRTKWMFPIPKNAEVVQGYVLVAALLTIFANSLYMEPQATKVM 400
           F+AMAS IGMAL GHLFSRT+W FPIPKN+EVVQGYVLVAALL IFANSLYMEP+ATKVM
Sbjct: 303 FKAMASCIGMALLGHLFSRTEWTFPIPKNSEVVQGYVLVAALLMIFANSLYMEPRATKVM 362

Query: 401 FERLKVEKEEGKGIEDIAAEPR-HANDNPPAITTSTATQVVDREAVKSRIVGLNKRLKKL 460
           FERLK+EKEEG+GIEDIA E   +  DN PAIT+ST TQVVDRE VKSRIVGLNKRLKKL
Sbjct: 363 FERLKIEKEEGRGIEDIAREETGNVIDNSPAITSSTPTQVVDREVVKSRIVGLNKRLKKL 377

Query: 461 NSYSSLLNLLTLMALTWHLVYLSQRLCIPC 490
           NSYSSLLNLLTLMALTWHLVYLSQRLC PC
Sbjct: 423 NSYSSLLNLLTLMALTWHLVYLSQRLCNPC 377

BLAST of CmaCh15G000020 vs. NCBI nr
Match: gi|965604213|dbj|BAT91390.1| (hypothetical protein VIGAN_06271300 [Vigna angularis var. angularis])

HSP 1 Score: 298.1 bits (762), Expect = 2.8e-77
Identity = 204/470 (43.40%), Postives = 272/470 (57.87%), Query Frame = 1

Query: 33  MPNLFALCLVITSLTAAGLWSPSPASRLDHEQDVIVKEGHRVVVVEYGDQGQHNTKVSIS 92
           M N+FA+ LVITSL A+ + SP+PA+      + IV+EGHRVVVVEY   G HNTK+SIS
Sbjct: 22  MLNVFAVSLVITSLAASAILSPAPATHQKQGANTIVREGHRVVVVEYDQDGHHNTKISIS 81

Query: 93  SE-PTKDASPSNPLHDSL--------NVGI----PNEDSERHRTRDLICDALGKCKHKIA 152
            E PT      +   D +        NVG     P + +  H  ++L+CDA G+CK +IA
Sbjct: 82  PEQPTHHHQVFDNAKDGIREAASVLPNVGQGISQPEDAAFLHAPKELVCDAYGRCKQRIA 141

Query: 153 SAVGKAKVMVSETAQEAHDVGEAVAGAFDEAKETVSDKSHHVGTSFSEKGHRLRESVEKA 212
            A+ K K                     D+A+E +  K   V  +  E   R+ +SV  A
Sbjct: 142 DAMEKTK---------------------DKAQEALQKKKEKVAAN-KEAARRVGDSVADA 201

Query: 213 REDADEFLEKTKETVLEKARDLKEGAKDVLKEGKARELREGAMEKGREARQTAEKIKTGG 272
                  L KTKE+V +KARD+ E A+D ++  K     E       EAR +  ++K   
Sbjct: 202 -------LGKTKESVHDKARDVHEYAQDTVETAK-----EHVAHNISEARDSLRRLKHA- 261

Query: 273 NKVKENLIGIPGGGLKLVNDSFRYLGSLESWKAAMDVLSLLGFSMALGMGVWTTFISSYV 332
                            +  SF   GSLES  + M V +LLGF+ A GM VW TFISSYV
Sbjct: 262 -----------------LKSSF---GSLESLNSVMGVANLLGFATAYGMCVWVTFISSYV 321

Query: 333 LASALPRQQLAVVQSKIYPLYFRAMASSIGMALFGHLFSRTKWMFPIPKNAEVVQGYVLV 392
            + A+ R Q AVVQSKIYP+YFRAMA S+G+AL GH+F  T  +  +   +  +Q Y L+
Sbjct: 322 QSRAMARHQFAVVQSKIYPVYFRAMAYSVGVALVGHVFGNTNTL--LSNKSHALQAYNLL 381

Query: 393 AALLTIFANSLYMEPQATKVMFERLKVEKEEGKGIEDIAAEPRHANDNPPAITTSTATQV 452
           A+L T+F NSLY+EP+ATK+MFER+K+EKEEG+G  DI+ E            +S+A   
Sbjct: 382 ASLATLFFNSLYLEPRATKLMFERIKIEKEEGRGRVDISGERGRTEHQRTGEPSSSA--- 430

Query: 453 VDREAVKSRIVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRLCIPC 490
            D++AV+SRI+ LN +LKKLNSYSSLLN+L LM+LTWHLVYL+QRL  PC
Sbjct: 442 -DQDAVRSRIIKLNDKLKKLNSYSSLLNILNLMSLTWHLVYLAQRLHTPC 430

BLAST of CmaCh15G000020 vs. NCBI nr
Match: gi|920709333|gb|KOM51330.1| (hypothetical protein LR48_Vigan08g215700 [Vigna angularis])

HSP 1 Score: 298.1 bits (762), Expect = 2.8e-77
Identity = 204/470 (43.40%), Postives = 272/470 (57.87%), Query Frame = 1

Query: 33  MPNLFALCLVITSLTAAGLWSPSPASRLDHEQDVIVKEGHRVVVVEYGDQGQHNTKVSIS 92
           M N+FA+ LVITSL A+ + SP+PA+      + IV+EGHRVVVVEY   G HNTK+SIS
Sbjct: 1   MLNVFAVSLVITSLAASAILSPAPATHQKQGANTIVREGHRVVVVEYDQDGHHNTKISIS 60

Query: 93  SE-PTKDASPSNPLHDSL--------NVGI----PNEDSERHRTRDLICDALGKCKHKIA 152
            E PT      +   D +        NVG     P + +  H  ++L+CDA G+CK +IA
Sbjct: 61  PEQPTHHHQVFDNAKDGIREAASVLPNVGQGISQPEDAAFLHAPKELVCDAYGRCKQRIA 120

Query: 153 SAVGKAKVMVSETAQEAHDVGEAVAGAFDEAKETVSDKSHHVGTSFSEKGHRLRESVEKA 212
            A+ K K                     D+A+E +  K   V  +  E   R+ +SV  A
Sbjct: 121 DAMEKTK---------------------DKAQEALQKKKEKVAAN-KEAARRVGDSVADA 180

Query: 213 REDADEFLEKTKETVLEKARDLKEGAKDVLKEGKARELREGAMEKGREARQTAEKIKTGG 272
                  L KTKE+V +KARD+ E A+D ++  K     E       EAR +  ++K   
Sbjct: 181 -------LGKTKESVHDKARDVHEYAQDTVETAK-----EHVAHNISEARDSLRRLKHA- 240

Query: 273 NKVKENLIGIPGGGLKLVNDSFRYLGSLESWKAAMDVLSLLGFSMALGMGVWTTFISSYV 332
                            +  SF   GSLES  + M V +LLGF+ A GM VW TFISSYV
Sbjct: 241 -----------------LKSSF---GSLESLNSVMGVANLLGFATAYGMCVWVTFISSYV 300

Query: 333 LASALPRQQLAVVQSKIYPLYFRAMASSIGMALFGHLFSRTKWMFPIPKNAEVVQGYVLV 392
            + A+ R Q AVVQSKIYP+YFRAMA S+G+AL GH+F  T  +  +   +  +Q Y L+
Sbjct: 301 QSRAMARHQFAVVQSKIYPVYFRAMAYSVGVALVGHVFGNTNTL--LSNKSHALQAYNLL 360

Query: 393 AALLTIFANSLYMEPQATKVMFERLKVEKEEGKGIEDIAAEPRHANDNPPAITTSTATQV 452
           A+L T+F NSLY+EP+ATK+MFER+K+EKEEG+G  DI+ E            +S+A   
Sbjct: 361 ASLATLFFNSLYLEPRATKLMFERIKIEKEEGRGRVDISGERGRTEHQRTGEPSSSA--- 409

Query: 453 VDREAVKSRIVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRLCIPC 490
            D++AV+SRI+ LN +LKKLNSYSSLLN+L LM+LTWHLVYL+QRL  PC
Sbjct: 421 -DQDAVRSRIIKLNDKLKKLNSYSSLLNILNLMSLTWHLVYLAQRLHTPC 409

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TM205_MOUSE2.7e-0835.83Transmembrane protein 205 OS=Mus musculus GN=Tmem205 PE=1 SV=1[more]
TM205_BOVIN7.9e-0835.00Transmembrane protein 205 OS=Bos taurus GN=TMEM205 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KSX1_CUCSA8.5e-12660.00Uncharacterized protein OS=Cucumis sativus GN=Csa_5G650480 PE=4 SV=1[more]
A0A0S3SEX2_PHAAN1.9e-7743.40Uncharacterized protein OS=Vigna angularis var. angularis GN=Vigan.06G271300 PE=... [more]
A0A0L9V8B7_PHAAN1.9e-7743.40Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan08g215700 PE=4 SV=1[more]
A0A0D2SDT7_GOSRA9.6e-7741.56Uncharacterized protein OS=Gossypium raimondii GN=B456_013G138700 PE=4 SV=1[more]
W9RNN9_9ROSA1.4e-7542.48Uncharacterized protein OS=Morus notabilis GN=L484_008289 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G72100.11.7e-6436.33 late embryogenesis abundant domain-containing protein / LEA domain-c... [more]
AT1G22600.19.5e-3632.57 Late embryogenesis abundant protein (LEA) family protein[more]
AT3G62580.12.0e-0928.50 Late embryogenesis abundant protein (LEA) family protein[more]
Match NameE-valueIdentityDescription
gi|659119380|ref|XP_008459625.1|2.5e-14765.88PREDICTED: uncharacterized protein LOC103498695 [Cucumis melo][more]
gi|778707978|ref|XP_004141640.2|6.9e-12959.74PREDICTED: uncharacterized protein LOC101208468 [Cucumis sativus][more]
gi|700197506|gb|KGN52683.1|1.2e-12560.00hypothetical protein Csa_5G650480 [Cucumis sativus][more]
gi|965604213|dbj|BAT91390.1|2.8e-7743.40hypothetical protein VIGAN_06271300 [Vigna angularis var. angularis][more]
gi|920709333|gb|KOM51330.1|2.8e-7743.40hypothetical protein LR48_Vigan08g215700 [Vigna angularis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR025423DUF4149
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh15G000020.1CmaCh15G000020.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025423Domain of unknown function DUF4149PFAMPF13664DUF4149coord: 301..403
score: 2.6
NoneNo IPR availableunknownCoilCoilcoord: 215..235
scor
NoneNo IPR availablePANTHERPTHR23241LATE EMBRYOGENESIS ABUNDANT PLANTS LEA-RELATEDcoord: 274..489
score: 4.2
NoneNo IPR availablePANTHERPTHR23241:SF50LATE EMBRYOGENESIS ABUNDANT DOMAIN-CONTAINING PROTEIN-RELATEDcoord: 274..489
score: 4.2

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CmaCh15G000020Cla011951Watermelon (97103) v1cmawmB285
CmaCh15G000020ClCG11G008550Watermelon (Charleston Gray)cmawcgB250
CmaCh15G000020CmoCh15G000010Cucurbita moschata (Rifu)cmacmoB287
CmaCh15G000020Cp4.1LG13g11760Cucurbita pepo (Zucchini)cmacpeB305
CmaCh15G000020Carg27297Silver-seed gourdcarcmaB0431
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:

None