Cla020714 (gene) Watermelon (97103) v1

NameCla020714
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionPentatricopeptide repeat-containing protein (AHRD V1 ***- D7M3D7_ARALL); contains Interpro domain(s) IPR002885 Pentatricopeptide repeat
LocationChr5 : 27443008 .. 27445287 (+)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTATGAATTGCGCGCAATAGCTTCCAAAATAAGCAATATAAGTCAGTTAAGACAGCTTCATGGGCATCTTGTTCTCAATTCCCTCCATTCTCACAACTACTGGGTCTCTCTGCTCCTCATTATCTGTACCCGTCTTCACGCTCATCCTGCGTATGCGGCTTCTATTTTTAACTCCTTGCCGTCCCCCGATGCTTCTATTTACAGTTGTATGCTCAAATATTACTCCCGCATGGGTGCGAACAATGAGGTGGTTTCCCTCTTCAGATGTATGCAGTCTTTAGATCTTAGGCCCCAGCCCTTTGTTTACATATATTTGATCAAGTTGGCTGGGAAATATGGCAATTTGTTCCATGCTTATGTCCTGAAGTTGGGTTTTGTTGATGACCACTTCATCCGTAATGCTATCTTGGATATGTATGCAAAATATGGCCAAGTTGATCTTGCGAGGAAGTTGTTTGAGCAAATGGCTGAAAGAACTTTAGCAGACTGGAATTCGATGATTTCTGGCTGTTGGAAGTCAGGAAATGAAACTGATGCGGTCATGCTGTTTAATATGATGCCTGCTAGGAATATTATTACATGGACTGCCATGGTTACTGGGTATGCCAAGATGGGGGACTTGGAGAGTGCTAGAAGGTATTTTGATGAGATGCCAGAGAGAAGTGTAGTCTCATGGAATGCAATGCTCTCAGCTTATGCTCAAAATGAATGTGCAGAAGAGGCTTTGAAATTGTTCCATCAAATGCTGAAAGAAGGGATCACACCTGATGATACAACATGGGTTGCTACAATTTCATCATGCTCTTCCATCGGCAATACTACCCTTGCTGATTCAATTCTAAGAAAGATCAACCAAAAGCATAGCATTTTGAATAATTTTGTCAAGACGGCTTTACTTGACATGCATGCAAAATTTGGTAACCTTGAATTTGCTAGAACTATCTTTGACGAATTGGGAGGTCAGAGGAATGCTGTTACTTGGAATATCATGATCTCAGCATATACGAGGGTGGGAAAACTGTCATTAGCTCGAGAGTTGTTTGATAATATGCCAAAAAGAGATGTTGTTTCGTGGAATTCGATGATAGCTGGTTATGCACAAAATGGAGAGGCAGCCAAGTCAATTGAGCTCTTTCAAGAAATGATTTCTTGTACGGACATACAGCCGGATGAGGTTACCGTAGCTAGTGTTTTGTCTGCCTGTGGACATATTGGGGCTCTAAAATTGAGTTACTGGGTTCTAGATATCGTTCAAGAGAAAAACATTAAGTTGGGGATCTCAGGATTCAATTCTTTAATATTCCTGTACTCTAAATGTGGATGTGTGGCAGATGCCCATAGGATATTCCAAACTATGGAGACAAAAGATGTTGTTTCTTTCAATACGTTGATTTCAGGATTTGCTGCTAATGGCCATGGGAAGGAAGCTGTCAAGTTAGCATTAACAATGGAGGAAGAAGGCATTGAACCAGACCATGTCACATATACTGGTGTTTTGACTGCATGTAGCCATGCAGGGCTGCTGAAAGAAGGTAAAGACGTCTTTAAGTCAATTAAATCACCTACTGTGGACCATTATGCTTGCATGGTTGATTTATTAGGAAGAGCAGGTGAATTAGATGAAGCCAAAATATTGATTCAATCTATGCCGATGGAACCTCATGCTGGTGTTTATGGCTCTTTGTTAAATGCCAGTCGAATTCACAAGAGAGTTGAGTTAGGAGAACTTGCTGCTAACAAGCTCTTTGAGCTTGAACCGCAAAACCCTGGAAATTATGTTTTACTTTCTAATATATATGCCTCGGCTGGAAGATGGGAAGATGTTAAAAAGGTTAGAGAGAAGATGAGGAACGGAGGTTTGAAGAAATCAGTTGGGATGAGTTGGGTGGAATATAAGGGTCAAGTGCATAAGTTCGTTGTGGGTGATAGATCACATGAACGATCAAAAGATATTTATAGATTATTGGCTGAACTTGAAAGGAAGATGAAGAGGGTTGGCTTTGTAGCTGATAAAAGCTGTGCACTTCGAGATGTTGAGGAGGAAGAGAAGGAAGAAATGCTGGGAACTCACAGTGAGAAGTTGGCCATTTGTTTTGCTCTCCTTGTCAGTGAAGTGGGGACACCAATTAGAGTGGTAAAGAATTTAAGAATTTGTTTGGATTGCCATACAGCTATTAAAATCATCTCGAAGCTGGAGGAAAGAGAGATTATCGTCCGTGATAATAATAGGTTCCATTGTTTTAGGGACGGGATTTGTTCTTGTCATGATTACTGGTAA

mRNA sequence

ATGTATGAATTGCGCGCAATAGCTTCCAAAATAAGCAATATAAGTCAGTTAAGACAGCTTCATGGGCATCTTGTTCTCAATTCCCTCCATTCTCACAACTACTGGGTCTCTCTGCTCCTCATTATCTGTACCCGTCTTCACGCTCATCCTGCGTATGCGGCTTCTATTTTTAACTCCTTGCCGTCCCCCGATGCTTCTATTTACAGTTGTATGCTCAAATATTACTCCCGCATGGGTGCGAACAATGAGGTGGTTTCCCTCTTCAGATGTATGCAGTCTTTAGATCTTAGGCCCCAGCCCTTTGTTTACATATATTTGATCAAGTTGGCTGGGAAATATGGCAATTTGTTCCATGCTTATGTCCTGAAGTTGGGTTTTGTTGATGACCACTTCATCCGTAATGCTATCTTGGATATGTATGCAAAATATGGCCAAGTTGATCTTGCGAGGAAGTTGTTTGAGCAAATGGCTGAAAGAACTTTAGCAGACTGGAATTCGATGATTTCTGGCTGTTGGAAGTCAGGAAATGAAACTGATGCGGTCATGCTGTTTAATATGATGCCTGCTAGGAATATTATTACATGGACTGCCATGGTTACTGGGTATGCCAAGATGGGGGACTTGGAGAGTGCTAGAAGGTATTTTGATGAGATGCCAGAGAGAAGTGTAGTCTCATGGAATGCAATGCTCTCAGCTTATGCTCAAAATGAATGTGCAGAAGAGGCTTTGAAATTGTTCCATCAAATGCTGAAAGAAGGGATCACACCTGATGATACAACATGGGTTGCTACAATTTCATCATGCTCTTCCATCGGCAATACTACCCTTGCTGATTCAATTCTAAGAAAGATCAACCAAAAGCATAGCATTTTGAATAATTTTGTCAAGACGGCTTTACTTGACATGCATGCAAAATTTGGTAACCTTGAATTTGCTAGAACTATCTTTGACGAATTGGGAGGTCAGAGGAATGCTGTTACTTGGAATATCATGATCTCAGCATATACGAGGGTGGGAAAACTGTCATTAGCTCGAGAGTTGTTTGATAATATGCCAAAAAGAGATGTTGTTTCGTGGAATTCGATGATAGCTGGTTATGCACAAAATGGAGAGGCAGCCAAGTCAATTGAGCTCTTTCAAGAAATGATTTCTTGTACGGACATACAGCCGGATGAGGTTACCGTAGCTAGTGTTTTGTCTGCCTGTGGACATATTGGGGCTCTAAAATTGAGTTACTGGGTTCTAGATATCGTTCAAGAGAAAAACATTAAGTTGGGGATCTCAGGATTCAATTCTTTAATATTCCTGTACTCTAAATGTGGATGTGTGGCAGATGCCCATAGGATATTCCAAACTATGGAGACAAAAGATGTTGTTTCTTTCAATACGTTGATTTCAGGATTTGCTGCTAATGGCCATGGGAAGGAAGCTGTCAAGTTAGCATTAACAATGGAGGAAGAAGGCATTGAACCAGACCATGTCACATATACTGGTGTTTTGACTGCATGTAGCCATGCAGGGCTGCTGAAAGAAGGTAAAGACGTCTTTAAGTCAATTAAATCACCTACTGTGGACCATTATGCTTGCATGGTTGATTTATTAGGAAGAGCAGGTGAATTAGATGAAGCCAAAATATTGATTCAATCTATGCCGATGGAACCTCATGCTGGTGTTTATGGCTCTTTGTTAAATGCCAGTCGAATTCACAAGAGAGTTGAGTTAGGAGAACTTGCTGCTAACAAGCTCTTTGAGCTTGAACCGCAAAACCCTGGAAATTATGTTTTACTTTCTAATATATATGCCTCGGCTGGAAGATGGGAAGATGTTAAAAAGGTTAGAGAGAAGATGAGGAACGGAGGTTTGAAGAAATCAGTTGGGATGAGTTGGGTGGAATATAAGGGTCAAGTGCATAAGTTCGTTGTGGGTGATAGATCACATGAACGATCAAAAGATATTTATAGATTATTGGCTGAACTTGAAAGGAAGATGAAGAGGGTTGGCTTTGTAGCTGATAAAAGCTGTGCACTTCGAGATGTTGAGGAGGAAGAGAAGGAAGAAATGCTGGGAACTCACAGTGAGAAGTTGGCCATTTGTTTTGCTCTCCTTGTCAGTGAAGTGGGGACACCAATTAGAGTGGTAAAGAATTTAAGAATTTGTTTGGATTGCCATACAGCTATTAAAATCATCTCGAAGCTGGAGGAAAGAGAGATTATCGTCCGTGATAATAATAGGTTCCATTGTTTTAGGGACGGGATTTGTTCTTGTCATGATTACTGGTAA

Coding sequence (CDS)

ATGTATGAATTGCGCGCAATAGCTTCCAAAATAAGCAATATAAGTCAGTTAAGACAGCTTCATGGGCATCTTGTTCTCAATTCCCTCCATTCTCACAACTACTGGGTCTCTCTGCTCCTCATTATCTGTACCCGTCTTCACGCTCATCCTGCGTATGCGGCTTCTATTTTTAACTCCTTGCCGTCCCCCGATGCTTCTATTTACAGTTGTATGCTCAAATATTACTCCCGCATGGGTGCGAACAATGAGGTGGTTTCCCTCTTCAGATGTATGCAGTCTTTAGATCTTAGGCCCCAGCCCTTTGTTTACATATATTTGATCAAGTTGGCTGGGAAATATGGCAATTTGTTCCATGCTTATGTCCTGAAGTTGGGTTTTGTTGATGACCACTTCATCCGTAATGCTATCTTGGATATGTATGCAAAATATGGCCAAGTTGATCTTGCGAGGAAGTTGTTTGAGCAAATGGCTGAAAGAACTTTAGCAGACTGGAATTCGATGATTTCTGGCTGTTGGAAGTCAGGAAATGAAACTGATGCGGTCATGCTGTTTAATATGATGCCTGCTAGGAATATTATTACATGGACTGCCATGGTTACTGGGTATGCCAAGATGGGGGACTTGGAGAGTGCTAGAAGGTATTTTGATGAGATGCCAGAGAGAAGTGTAGTCTCATGGAATGCAATGCTCTCAGCTTATGCTCAAAATGAATGTGCAGAAGAGGCTTTGAAATTGTTCCATCAAATGCTGAAAGAAGGGATCACACCTGATGATACAACATGGGTTGCTACAATTTCATCATGCTCTTCCATCGGCAATACTACCCTTGCTGATTCAATTCTAAGAAAGATCAACCAAAAGCATAGCATTTTGAATAATTTTGTCAAGACGGCTTTACTTGACATGCATGCAAAATTTGGTAACCTTGAATTTGCTAGAACTATCTTTGACGAATTGGGAGGTCAGAGGAATGCTGTTACTTGGAATATCATGATCTCAGCATATACGAGGGTGGGAAAACTGTCATTAGCTCGAGAGTTGTTTGATAATATGCCAAAAAGAGATGTTGTTTCGTGGAATTCGATGATAGCTGGTTATGCACAAAATGGAGAGGCAGCCAAGTCAATTGAGCTCTTTCAAGAAATGATTTCTTGTACGGACATACAGCCGGATGAGGTTACCGTAGCTAGTGTTTTGTCTGCCTGTGGACATATTGGGGCTCTAAAATTGAGTTACTGGGTTCTAGATATCGTTCAAGAGAAAAACATTAAGTTGGGGATCTCAGGATTCAATTCTTTAATATTCCTGTACTCTAAATGTGGATGTGTGGCAGATGCCCATAGGATATTCCAAACTATGGAGACAAAAGATGTTGTTTCTTTCAATACGTTGATTTCAGGATTTGCTGCTAATGGCCATGGGAAGGAAGCTGTCAAGTTAGCATTAACAATGGAGGAAGAAGGCATTGAACCAGACCATGTCACATATACTGGTGTTTTGACTGCATGTAGCCATGCAGGGCTGCTGAAAGAAGGTAAAGACGTCTTTAAGTCAATTAAATCACCTACTGTGGACCATTATGCTTGCATGGTTGATTTATTAGGAAGAGCAGGTGAATTAGATGAAGCCAAAATATTGATTCAATCTATGCCGATGGAACCTCATGCTGGTGTTTATGGCTCTTTGTTAAATGCCAGTCGAATTCACAAGAGAGTTGAGTTAGGAGAACTTGCTGCTAACAAGCTCTTTGAGCTTGAACCGCAAAACCCTGGAAATTATGTTTTACTTTCTAATATATATGCCTCGGCTGGAAGATGGGAAGATGTTAAAAAGGTTAGAGAGAAGATGAGGAACGGAGGTTTGAAGAAATCAGTTGGGATGAGTTGGGTGGAATATAAGGGTCAAGTGCATAAGTTCGTTGTGGGTGATAGATCACATGAACGATCAAAAGATATTTATAGATTATTGGCTGAACTTGAAAGGAAGATGAAGAGGGTTGGCTTTGTAGCTGATAAAAGCTGTGCACTTCGAGATGTTGAGGAGGAAGAGAAGGAAGAAATGCTGGGAACTCACAGTGAGAAGTTGGCCATTTGTTTTGCTCTCCTTGTCAGTGAAGTGGGGACACCAATTAGAGTGGTAAAGAATTTAAGAATTTGTTTGGATTGCCATACAGCTATTAAAATCATCTCGAAGCTGGAGGAAAGAGAGATTATCGTCCGTGATAATAATAGGTTCCATTGTTTTAGGGACGGGATTTGTTCTTGTCATGATTACTGGTAA

Protein sequence

MYELRAIASKISNISQLRQLHGHLVLNSLHSHNYWVSLLLIICTRLHAHPAYAASIFNSLPSPDASIYSCMLKYYSRMGANNEVVSLFRCMQSLDLRPQPFVYIYLIKLAGKYGNLFHAYVLKLGFVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVTGYAKMGDLESARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNTTLADSILRKINQKHSILNNFVKTALLDMHAKFGNLEFARTIFDELGGQRNAVTWNIMISAYTRVGKLSLARELFDNMPKRDVVSWNSMIAGYAQNGEAAKSIELFQEMISCTDIQPDEVTVASVLSACGHIGALKLSYWVLDIVQEKNIKLGISGFNSLIFLYSKCGCVADAHRIFQTMETKDVVSFNTLISGFAANGHGKEAVKLALTMEEEGIEPDHVTYTGVLTACSHAGLLKEGKDVFKSIKSPTVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPQNPGNYVLLSNIYASAGRWEDVKKVREKMRNGGLKKSVGMSWVEYKGQVHKFVVGDRSHERSKDIYRLLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFALLVSEVGTPIRVVKNLRICLDCHTAIKIISKLEEREIIVRDNNRFHCFRDGICSCHDYW
BLAST of Cla020714 vs. Swiss-Prot
Match: PPR43_ARATH (Pentatricopeptide repeat-containing protein At1g14470 OS=Arabidopsis thaliana GN=PCMP-A4 PE=2 SV=2)

HSP 1 Score: 572.0 bits (1473), Expect = 9.6e-162
Identity = 283/532 (53.20%), Postives = 383/532 (71.99%), Query Frame = 1

Query: 4   LRAIASKISNISQLRQLHGHLVL-NSLHSHNYWVSLLLIICTRLHAHPAYAASIFNSLPS 63
           L AIAS+     QL Q+H  L++ NSL   +YW S ++  CTRL A   Y   IF+S+  
Sbjct: 9   LAAIASQALTFPQLNQIHAQLIVFNSLPRQSYWASRIISCCTRLRAPSYYTRLIFDSVTF 68

Query: 64  PDASIYSCMLKYYSRMGANNEVVSLFRCMQSLDLRPQPFVYIYLIKLAGKYGNLFHAYVL 123
           P+  + + M KY+S+M   N+V+ L+       + P  F +  +IK AG++G LF A V 
Sbjct: 69  PNVFVVNSMFKYFSKMDMANDVLRLYEQRSRCGIMPDAFSFPVVIKSAGRFGILFQALVE 128

Query: 124 KLGFVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVM 183
           KLGF  D ++RN I+DMY K+  V+ ARK+F+Q+++R  +DWN MISG WK GN+ +A  
Sbjct: 129 KLGFFKDPYVRNVIMDMYVKHESVESARKVFDQISQRKGSDWNVMISGYWKWGNKEEACK 188

Query: 184 LFNMMPARNIITWTAMVTGYAKMGDLESARRYFDEMPERSVVSWNAMLSAYAQNECAEEA 243
           LF+MMP  ++++WT M+TG+AK+ DLE+AR+YFD MPE+SVVSWNAMLS YAQN   E+A
Sbjct: 189 LFDMMPENDVVSWTVMITGFAKVKDLENARKYFDRMPEKSVVSWNAMLSGYAQNGFTEDA 248

Query: 244 LKLFHQMLKEGITPDDTTWVATISSCSSIGNTTLADSILRKINQKHSILNNFVKTALLDM 303
           L+LF+ ML+ G+ P++TTWV  IS+CS   + +L  S+++ I++K   LN FVKTALLDM
Sbjct: 249 LRLFNDMLRLGVRPNETTWVIVISACSFRADPSLTRSLVKLIDEKRVRLNCFVKTALLDM 308

Query: 304 HAKFGNLEFARTIFDELGGQRNAVTWNIMISAYTRVGKLSLARELFDNMPKRDVVSWNSM 363
           HAK  +++ AR IF+ELG QRN VTWN MIS YTR+G +S AR+LFD MPKR+VVSWNS+
Sbjct: 309 HAKCRDIQSARRIFNELGTQRNLVTWNAMISGYTRIGDMSSARQLFDTMPKRNVVSWNSL 368

Query: 364 IAGYAQNGEAAKSIELFQEMISCTDIQPDEVTVASVLSACGHIGALKLSYWVLDIVQEKN 423
           IAGYA NG+AA +IE F++MI   D +PDEVT+ SVLSACGH+  L+L   ++D +++  
Sbjct: 369 IAGYAHNGQAALAIEFFEDMIDYGDSKPDEVTMISVLSACGHMADLELGDCIVDYIRKNQ 428

Query: 424 IKLGISGFNSLIFLYSKCGCVADAHRIFQTMETKDVVSFNTLISGFAANGHGKEAVKLAL 483
           IKL  SG+ SLIF+Y++ G + +A R+F  M+ +DVVS+NTL + FAANG G E + L  
Sbjct: 429 IKLNDSGYRSLIFMYARGGNLWEAKRVFDEMKERDVVSYNTLFTAFAANGDGVETLNLLS 488

Query: 484 TMEEEGIEPDHVTYTGVLTACSHAGLLKEGKDVFKSIKSPTVDHYACMVDLL 535
            M++EGIEPD VTYT VLTAC+ AGLLKEG+ +FKSI++P  DHYACM DLL
Sbjct: 489 KMKDEGIEPDRVTYTSVLTACNRAGLLKEGQRIFKSIRNPLADHYACM-DLL 539

BLAST of Cla020714 vs. Swiss-Prot
Match: PPR21_ARATH (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 518.5 bits (1334), Expect = 1.3e-145
Identity = 280/761 (36.79%), Postives = 440/761 (57.82%), Query Frame = 1

Query: 14  ISQLRQLHGHLVLNSLHSHNYWVSLLLIICTRLHAH---PAYAASIFNSLPSPDASIYSC 73
           +  LR +H  ++   LH+ NY +S L+  C  L  H     YA S+F ++  P+  I++ 
Sbjct: 46  LQSLRIIHAQMIKIGLHNTNYALSKLIEFCI-LSPHFEGLPYAISVFKTIQEPNLLIWNT 105

Query: 74  MLKYYSRMGANNEVVSLFRCMQSLDLRPQPFVYIYLIKL-----AGKYGNLFHAYVLKLG 133
           M + ++        + L+ CM SL L P  + + +++K      A K G   H +VLKLG
Sbjct: 106 MFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQIHGHVLKLG 165

Query: 134 FVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFN 193
              D ++  +++ MY + G+++ A K+F++                              
Sbjct: 166 CDLDLYVHTSLISMYVQNGRLEDAHKVFDKS----------------------------- 225

Query: 194 MMPARNIITWTAMVTGYAKMGDLESARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKL 253
             P R+++++TA++ GYA  G +E+A++ FDE+P + VVSWNAM+S YA+    +EAL+L
Sbjct: 226 --PHRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALEL 285

Query: 254 FHQMLKEGITPDDTTWVATISSCSSIGNTTLADSILRKINQKHSILNNFVKTALLDMHAK 313
           F  M+K  + PD++T V  +S+C+  G+  L   +   I+      N  +  AL+D+++K
Sbjct: 286 FKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSK 345

Query: 314 FGNLEFARTIFDELGGQRNAVTWNIMISAYTRVGKLSLARELFDNMPKRDVVSWNSMIAG 373
            G LE                                 A  LF+ +P +DV+SWN++I G
Sbjct: 346 CGELE--------------------------------TACGLFERLPYKDVISWNTLIGG 405

Query: 374 YAQNGEAAKSIELFQEMISCTDIQPDEVTVASVLSACGHIGALKLSYWVLDIVQE--KNI 433
           Y       +++ LFQEM+   +  P++VT+ S+L AC H+GA+ +  W+   + +  K +
Sbjct: 406 YTHMNLYKEALLLFQEMLRSGE-TPNDVTMLSILPACAHLGAIDIGRWIHVYIDKRLKGV 465

Query: 434 KLGISGFNSLIFLYSKCGCVADAHRIFQTMETKDVVSFNTLISGFAANGHGKEAVKLALT 493
               S   SLI +Y+KCG +  AH++F ++  K + S+N +I GFA +G    +  L   
Sbjct: 466 TNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFDLFSR 525

Query: 494 MEEEGIEPDHVTYTGVLTACSHAGLLKEGKDVFKSI-----KSPTVDHYACMVDLLGRAG 553
           M + GI+PD +T+ G+L+ACSH+G+L  G+ +F+++      +P ++HY CM+DLLG +G
Sbjct: 526 MRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHSG 585

Query: 554 ELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPQNPGNYVLLSN 613
              EA+ +I  M MEP   ++ SLL A ++H  VELGE  A  L ++EP+NPG+YVLLSN
Sbjct: 586 LFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSYVLLSN 645

Query: 614 IYASAGRWEDVKKVREKMRNGGLKKSVGMSWVEYKGQVHKFVVGDRSHERSKDIYRLLAE 673
           IYASAGRW +V K R  + + G+KK  G S +E    VH+F++GD+ H R+++IY +L E
Sbjct: 646 IYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREIYGMLEE 705

Query: 674 LERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFALLVSEVGTPIRVVKNLRI 733
           +E  +++ GFV D S  L+++EEE KE  L  HSEKLAI F L+ ++ GT + +VKNLR+
Sbjct: 706 MEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPGTKLTIVKNLRV 741

Query: 734 CLDCHTAIKIISKLEEREIIVRDNNRFHCFRDGICSCHDYW 760
           C +CH A K+ISK+ +REII RD  RFH FRDG+CSC+DYW
Sbjct: 766 CRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of Cla020714 vs. Swiss-Prot
Match: PP301_ARATH (Pentatricopeptide repeat-containing protein At4g02750 OS=Arabidopsis thaliana GN=PCMP-H24 PE=3 SV=1)

HSP 1 Score: 510.4 bits (1313), Expect = 3.4e-143
Identity = 282/713 (39.55%), Postives = 424/713 (59.47%), Query Frame = 1

Query: 53  AASIFNSLPSPDASIYSCMLKYYSRMGANNEVVSLFRCMQSLDLRPQPFVYIYLIKLAGK 112
           A  +F  +P   +  Y+ M+  Y R G       LF  M   DL      +  +IK   +
Sbjct: 83  ALRVFKRMPRWSSVSYNGMISGYLRNGEFELARKLFDEMPERDL----VSWNVMIKGYVR 142

Query: 113 YGNLFHAYVL-KLGFVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGC 172
             NL  A  L ++    D    N +L  YA+ G VD AR +F++M E+    WN+++S  
Sbjct: 143 NRNLGKARELFEIMPERDVCSWNTMLSGYAQNGCVDDARSVFDRMPEKNDVSWNALLSAY 202

Query: 173 WKSGNETDAVMLFNMMPARNIITWTAMVTGYAKMGDLESARRYFDEMPERSVVSWNAMLS 232
            ++    +A MLF       +++W  ++ G+ K   +  AR++FD M  R VVSWN +++
Sbjct: 203 VQNSKMEEACMLFKSRENWALVSWNCLLGGFVKKKKIVEARQFFDSMNVRDVVSWNTIIT 262

Query: 233 AYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNTTLADSILRKINQKHSIL 292
            YAQ+   +EA +LF     E    D  TW A +S          A  +  K+ +++ + 
Sbjct: 263 GYAQSGKIDEARQLFD----ESPVQDVFTWTAMVSGYIQNRMVEEARELFDKMPERNEVS 322

Query: 293 NNFVKTALLDMHAKFGNLEFARTIFDELGGQRNAVTWNIMISAYTRVGKLSLARELFDNM 352
            N    A+L  + +   +E A+ +FD +   RN  TWN MI+ Y + GK+S A+ LFD M
Sbjct: 323 WN----AMLAGYVQGERMEMAKELFDVMPC-RNVSTWNTMITGYAQCGKISEAKNLFDKM 382

Query: 353 PKRDVVSWNSMIAGYAQNGEAAKSIELFQEMISCTDIQPDEVTVASVLSACGHIGALKLS 412
           PKRD VSW +MIAGY+Q+G + +++ LF +M      + +  + +S LS C  + AL+L 
Sbjct: 383 PKRDPVSWAAMIAGYSQSGHSFEALRLFVQM-EREGGRLNRSSFSSALSTCADVVALELG 442

Query: 413 YWVLDIVQEKNIKLGISGFNSLIFLYSKCGCVADAHRIFQTMETKDVVSFNTLISGFAAN 472
             +   + +   + G    N+L+ +Y KCG + +A+ +F+ M  KD+VS+NT+I+G++ +
Sbjct: 443 KQLHGRLVKGGYETGCFVGNALLLMYCKCGSIEEANDLFKEMAGKDIVSWNTMIAGYSRH 502

Query: 473 GHGKEAVKLALTMEEEGIEPDHVTYTGVLTACSHAGLLKEGKDVFKSIKS-----PTVDH 532
           G G+ A++   +M+ EG++PD  T   VL+ACSH GL+ +G+  F ++       P   H
Sbjct: 503 GFGEVALRFFESMKREGLKPDDATMVAVLSACSHTGLVDKGRQYFYTMTQDYGVMPNSQH 562

Query: 533 YACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELE 592
           YACMVDLLGRAG L++A  L+++MP EP A ++G+LL ASR+H   EL E AA+K+F +E
Sbjct: 563 YACMVDLLGRAGLLEDAHNLMKNMPFEPDAAIWGTLLGASRVHGNTELAETAADKIFAME 622

Query: 593 PQNPGNYVLLSNIYASAGRWEDVKKVREKMRNGGLKKSVGMSWVEYKGQVHKFVVGDRSH 652
           P+N G YVLLSN+YAS+GRW DV K+R +MR+ G+KK  G SW+E + + H F VGD  H
Sbjct: 623 PENSGMYVLLSNLYASSGRWGDVGKLRVRMRDKGVKKVPGYSWIEIQNKTHTFSVGDEFH 682

Query: 653 ERSKDIYRLLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFALLVSEV 712
               +I+  L EL+ +MK+ G+V+  S  L DVEEEEKE M+  HSE+LA+ + ++    
Sbjct: 683 PEKDEIFAFLEELDLRMKKAGYVSKTSVVLHDVEEEEKERMVRYHSERLAVAYGIMRVSS 742

Query: 713 GTPIRVVKNLRICLDCHTAIKIISKLEEREIIVRDNNRFHCFRDGICSCHDYW 760
           G PIRV+KNLR+C DCH AIK ++++  R II+RDNNRFH F+DG CSC DYW
Sbjct: 743 GRPIRVIKNLRVCEDCHNAIKYMARITGRLIILRDNNRFHHFKDGSCSCGDYW 781


HSP 2 Score: 57.8 bits (138), Expect = 6.1e-07
Identity = 45/196 (22.96%), Postives = 95/196 (48.47%), Query Frame = 1

Query: 355 DVVSWNSMIAGYAQNGEAAKSIELFQEMISCTDIQPDEVTVASVLSACGHIGALKLSYWV 414
           D+  WN  I+ Y + G   +++ +F+ M   +      V+   ++S     G  +L+  +
Sbjct: 63  DIKEWNVAISSYMRTGRCNEALRVFKRMPRWSS-----VSYNGMISGYLRNGEFELARKL 122

Query: 415 LDIVQEKNIKLGISGFNSLIFLYSKCGCVADAHRIFQTMETKDVVSFNTLISGFAANGHG 474
            D + E+++      +N +I  Y +   +  A  +F+ M  +DV S+NT++SG+A NG  
Sbjct: 123 FDEMPERDLV----SWNVMIKGYVRNRNLGKARELFEIMPERDVCSWNTMLSGYAQNG-- 182

Query: 475 KEAVKLALTMEEEGIEPDHVTYTGVLTACSHAGLLKEGKDVFKSIKSPTVDHYACMVDLL 534
              V  A ++ +   E + V++  +L+A      ++E   +FKS ++  +  + C++   
Sbjct: 183 --CVDDARSVFDRMPEKNDVSWNALLSAYVQNSKMEEACMLFKSRENWALVSWNCLLGGF 242

Query: 535 GRAGELDEAKILIQSM 551
            +  ++ EA+    SM
Sbjct: 243 VKKKKIVEARQFFDSM 245

BLAST of Cla020714 vs. Swiss-Prot
Match: PP249_ARATH (Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana GN=PCMP-H56 PE=2 SV=1)

HSP 1 Score: 495.7 bits (1275), Expect = 8.8e-139
Identity = 285/812 (35.10%), Postives = 445/812 (54.80%), Query Frame = 1

Query: 5   RAIASKISN---ISQLRQLHGHLVLNSLHSHNYWVSLLLIICTRLHAHPA--YAASIF-N 64
           +A  S + N   I +L+  H  L    L +    ++ L+     L    +  +A  +F N
Sbjct: 33  KATPSSLKNCKTIDELKMFHRSLTKQGLDNDVSTITKLVARSCELGTRESLSFAKEVFEN 92

Query: 65  SLPSPDASIYSCMLKYYSRMGANNEVVSLFRCMQSLDLRPQPFVYIYLIKLAGKY----- 124
           S       +Y+ +++ Y+  G  NE + LF  M +  + P  + + + +    K      
Sbjct: 93  SESYGTCFMYNSLIRGYASSGLCNEAILLFLRMMNSGISPDKYTFPFGLSACAKSRAKGN 152

Query: 125 GNLFHAYVLKLGFVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISG--- 184
           G   H  ++K+G+  D F++N+++  YA+ G++D ARK+F++M+ER +  W SMI G   
Sbjct: 153 GIQIHGLIVKMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSERNVVSWTSMICGYAR 212

Query: 185 ---------------------------------CWK-----SGNETDAVMLFNMMPARNI 244
                                            C K     +G +  A +  + +   ++
Sbjct: 213 RDFAKDAVDLFFRMVRDEEVTPNSVTMVCVISACAKLEDLETGEKVYAFIRNSGIEVNDL 272

Query: 245 ITWTAMVTGYAKMGDLESARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKE 304
           +  +A+V  Y K   ++ A+R FDE    ++   NAM S Y +     EAL +F+ M+  
Sbjct: 273 MV-SALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNLMMDS 332

Query: 305 GITPDDTTWVATISSCSSIGNTTLADSILRKINQKHSILNNFVKTALLDMHAKFGNLEFA 364
           G+ PD  + ++ ISSCS + N     S    + +      + +  AL+DM+ K    + A
Sbjct: 333 GVRPDRISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCHRQDTA 392

Query: 365 RTIFDELGGQRNAVTWNIMISAYTRVGKLSLARELFDNMPKRDVVSWNSMIAGYAQNGEA 424
             IFD +   +  VTWN +++ Y   G++  A E F+ MP++++VSWN++I+G  Q    
Sbjct: 393 FRIFDRMSN-KTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQGSLF 452

Query: 425 AKSIELFQEMISCTDIQPDEVTVASVLSACGHIGALKLSYWVLDIVQEKNIKLGISGFNS 484
            ++IE+F  M S   +  D VT+ S+ SACGH+GAL L+ W+   +++  I+L +    +
Sbjct: 453 EEAIEVFCSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRLGTT 512

Query: 485 LIFLYSKCGCVADAHRIFQTMETKDVVSFNTLISGFAANGHGKEAVKLALTMEEEGIEPD 544
           L+ ++S+CG    A  IF ++  +DV ++   I   A  G+ + A++L   M E+G++PD
Sbjct: 513 LVDMFSRCGDPESAMSIFNSLTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQGLKPD 572

Query: 545 HVTYTGVLTACSHAGLLKEGKDVFKSIK-----SPTVDHYACMVDLLGRAGELDEAKILI 604
            V + G LTACSH GL+++GK++F S+      SP   HY CMVDLLGRAG L+EA  LI
Sbjct: 573 GVAFVGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEEAVQLI 632

Query: 605 QSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPQNPGNYVLLSNIYASAGRWE 664
           + MPMEP+  ++ SLL A R+   VE+   AA K+  L P+  G+YVLLSN+YASAGRW 
Sbjct: 633 EDMPMEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYASAGRWN 692

Query: 665 DVKKVREKMRNGGLKKSVGMSWVEYKGQVHKFVVGDRSHERSKDIYRLLAELERKMKRVG 724
           D+ KVR  M+  GL+K  G S ++ +G+ H+F  GD SH    +I  +L E+ ++   +G
Sbjct: 693 DMAKVRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAMLDEVSQRASHLG 752

Query: 725 FVADKSCALRDVEEEEKEEMLGTHSEKLAICFALLVSEVGTPIRVVKNLRICLDCHTAIK 760
            V D S  L DV+E+EK  ML  HSEKLA+ + L+ S  GT IR+VKNLR+C DCH+  K
Sbjct: 753 HVPDLSNVLMDVDEKEKIFMLSRHSEKLAMAYGLISSNKGTTIRIVKNLRVCSDCHSFAK 812

BLAST of Cla020714 vs. Swiss-Prot
Match: PP168_ARATH (Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana GN=PCMP-H41 PE=3 SV=1)

HSP 1 Score: 481.1 bits (1237), Expect = 2.2e-134
Identity = 263/772 (34.07%), Postives = 431/772 (55.83%), Query Frame = 1

Query: 1   MYELRAIASKISNISQLRQLHGHLVLNSLHSHNYWVSLLLIICTRLHAHPAYAASIFNSL 60
           M  L  + SK       R+L   + L +  S   W ++L     R           F+ L
Sbjct: 52  MNNLMNVYSKTGYALHARKLFDEMPLRTAFS---WNTVLSAYSKR--GDMDSTCEFFDQL 111

Query: 61  PSPDASIYSCMLKYYSRMGANNEVVSLFRCMQSLDLRPQPFVYIYLIKLAG-----KYGN 120
           P  D+  ++ M+  Y  +G  ++ + +   M    + P  F    ++         + G 
Sbjct: 112 PQRDSVSWTTMIVGYKNIGQYHKAIRVMGDMVKEGIEPTQFTLTNVLASVAATRCMETGK 171

Query: 121 LFHAYVLKLGFVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSG 180
             H++++KLG   +  + N++L+MYAK G   +A+ +F++M  R ++ WN          
Sbjct: 172 KVHSFIVKLGLRGNVSVSNSLLNMYAKCGDPMMAKFVFDRMVVRDISSWN---------- 231

Query: 181 NETDAVMLFNMMPARNIITWTAMVTGYAKMGDLESARRYFDEMPERSVVSWNAMLSAYAQ 240
                                AM+  + ++G ++ A   F++M ER +V+WN+M+S + Q
Sbjct: 232 ---------------------AMIALHMQVGQMDLAMAQFEQMAERDIVTWNSMISGFNQ 291

Query: 241 NECAEEALKLFHQMLKEGI-TPDDTTWVATISSCSSIGNTTLADSILRKINQKHSILNNF 300
                 AL +F +ML++ + +PD  T  + +S+C+++    +   I   I      ++  
Sbjct: 292 RGYDLRALDIFSKMLRDSLLSPDRFTLASVLSACANLEKLCIGKQIHSHIVTTGFDISGI 351

Query: 301 VKTALLDMHAKFGNLEFARTIFDELGGQRNAVT-WNIMISAYTRVGKLSLARELFDNMPK 360
           V  AL+ M+++ G +E AR + ++ G +   +  +  ++  Y ++G ++ A+ +F ++  
Sbjct: 352 VLNALISMYSRCGGVETARRLIEQRGTKDLKIEGFTALLDGYIKLGDMNQAKNIFVSLKD 411

Query: 361 RDVVSWNSMIAGYAQNGEAAKSIELFQEMISCTDIQPDEVTVASVLSACGHIGALKLSYW 420
           RDVV+W +MI GY Q+G   ++I LF+ M+     +P+  T+A++LS    + +L     
Sbjct: 412 RDVVAWTAMIVGYEQHGSYGEAINLFRSMVGGGQ-RPNSYTLAAMLSVASSLASLSHGKQ 471

Query: 421 VLDIVQEKNIKLGISGFNSLIFLYSKCGCVADAHRIFQTMET-KDVVSFNTLISGFAANG 480
           +     +      +S  N+LI +Y+K G +  A R F  +   +D VS+ ++I   A +G
Sbjct: 472 IHGSAVKSGEIYSVSVSNALITMYAKAGNITSASRAFDLIRCERDTVSWTSMIIALAQHG 531

Query: 481 HGKEAVKLALTMEEEGIEPDHVTYTGVLTACSHAGLLKEGKDVFKSIKS-----PTVDHY 540
           H +EA++L  TM  EG+ PDH+TY GV +AC+HAGL+ +G+  F  +K      PT+ HY
Sbjct: 532 HAEEALELFETMLMEGLRPDHITYVGVFSACTHAGLVNQGRQYFDMMKDVDKIIPTLSHY 591

Query: 541 ACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEP 600
           ACMVDL GRAG L EA+  I+ MP+EP    +GSLL+A R+HK ++LG++AA +L  LEP
Sbjct: 592 ACMVDLFGRAGLLQEAQEFIEKMPIEPDVVTWGSLLSACRVHKNIDLGKVAAERLLLLEP 651

Query: 601 QNPGNYVLLSNIYASAGRWEDVKKVREKMRNGGLKKSVGMSWVEYKGQVHKFVVGDRSHE 660
           +N G Y  L+N+Y++ G+WE+  K+R+ M++G +KK  G SW+E K +VH F V D +H 
Sbjct: 652 ENSGAYSALANLYSACGKWEEAAKIRKSMKDGRVKKEQGFSWIEVKHKVHVFGVEDGTHP 711

Query: 661 RSKDIYRLLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFALLVSEVG 720
              +IY  + ++  ++K++G+V D +  L D+EEE KE++L  HSEKLAI F L+ +   
Sbjct: 712 EKNEIYMTMKKIWDEIKKMGYVPDTASVLHDLEEEVKEQILRHHSEKLAIAFGLISTPDK 771

Query: 721 TPIRVVKNLRICLDCHTAIKIISKLEEREIIVRDNNRFHCFRDGICSCHDYW 760
           T +R++KNLR+C DCHTAIK ISKL  REIIVRD  RFH F+DG CSC DYW
Sbjct: 772 TTLRIMKNLRVCNDCHTAIKFISKLVGREIIVRDTTRFHHFKDGFCSCRDYW 786

BLAST of Cla020714 vs. TrEMBL
Match: A0A067KKU3_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_13364 PE=4 SV=1)

HSP 1 Score: 1041.6 bits (2692), Expect = 4.8e-301
Identity = 508/751 (67.64%), Postives = 610/751 (81.23%), Query Frame = 1

Query: 10  KISNISQLRQLHGHLVLNSLHSHNYWVSLLLIICTRLHAHPAYAASIFNSLPSPDASIYS 69
           ++ N + LRQLH HL+ NSLH HN W SLL+  CT LHA P Y  SIFNS P P   + +
Sbjct: 10  QLINANHLRQLHAHLIQNSLHHHNNWASLLIAQCTLLHAPPHYTRSIFNSTPDPTIHVVT 69

Query: 70  CMLKYYSRMGANNEVVSLFRCMQSLD-LRPQPFVYIYLIKLAGKYGNLFHAYVLKLGFVD 129
            MLKYYS +G  NEV+S F+ MQ    ++   +VY  +IK AGK G +FH ++ KLG+V+
Sbjct: 70  SMLKYYSHLGGQNEVISFFKHMQCCSYVKLGAYVYPLVIKSAGKDGIMFHGHIWKLGYVN 129

Query: 130 DHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMP 189
           D +IRN ILDMYAK G V++ARKLF++M ER+LADWNSMISG WK GNET+A  LF MMP
Sbjct: 130 DPYIRNVILDMYAKRGPVEVARKLFDEMTERSLADWNSMISGYWKGGNETEAFSLFKMMP 189

Query: 190 ARNIITWTAMVTGYAKMGDLESARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQ 249
            RN++TWTAMVTG+AK+ DLE AR+YFD MP RSVVSWNAMLS YAQN  AEEALKLF  
Sbjct: 190 QRNVVTWTAMVTGFAKIKDLEKARKYFDYMPMRSVVSWNAMLSGYAQNGFAEEALKLFGD 249

Query: 250 MLKEGITPDDTTWVATISSCSSIGNTTLADSILRKINQKHSILNNFVKTALLDMHAKFGN 309
           M+  G+ P++TTW   +S CS  G+  +A+SI++ ++ K   +N FVKTALLDM+AK GN
Sbjct: 250 MVNSGVQPNETTWATVVSLCSLFGDPCVAESIVKMLDGKRIKMNCFVKTALLDMNAKCGN 309

Query: 310 LEFARTIFDELGGQRNAVTWNIMISAYTRVGKLSLARELFDNMPKRDVVSWNSMIAGYAQ 369
           LE AR IF+ELG  RN+VTWN MISAYT+VG L  AR+ FD MP+RDVVSWN+MI+GYAQ
Sbjct: 310 LEAARNIFNELGVHRNSVTWNTMISAYTKVGDLDSARDHFDRMPERDVVSWNTMISGYAQ 369

Query: 370 NGEAAKSIELFQEMISCTDIQPDEVTVASVLSACGHIGALKLSYWVLDIVQEKNIKLGIS 429
           NG++AK+IE+F+EMIS  D+QPDEVT+ASV+SACGH+GAL+L  WV++ + E  I L I 
Sbjct: 370 NGQSAKAIEIFKEMISSKDLQPDEVTMASVISACGHLGALELGTWVVNHITEYKINLSIL 429

Query: 430 GFNSLIFLYSKCGCVADAHRIFQTMETKDVVSFNTLISGFAANGHGKEAVKLALTMEEEG 489
           G+NSLIF+YSKCG + +AHRIFQ MET+DVVS+NTLI+GFAA+G G EA+KL  TM+EEG
Sbjct: 430 GYNSLIFMYSKCGNMKEAHRIFQEMETRDVVSYNTLIAGFAAHGKGIEAIKLLSTMKEEG 489

Query: 490 IEPDHVTYTGVLTACSHAGLLKEGKDVFKSIKSPTVDHYACMVDLLGRAGELDEAKILIQ 549
           I PD VTY GVLTACSHAGL++EG  VF+SI+SP VDHYACMVDLLGR G+LDEAK LI 
Sbjct: 490 IHPDRVTYIGVLTACSHAGLMEEGHKVFESIESPDVDHYACMVDLLGRVGKLDEAKKLID 549

Query: 550 SMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPQNPGNYVLLSNIYASAGRWED 609
           +MPMEPHAGVYGSLL+AS+IHKRV+ GELAA  LF+LEPQN GNYVLLSNIYASAGRWE+
Sbjct: 550 NMPMEPHAGVYGSLLHASQIHKRVDFGELAAKMLFQLEPQNSGNYVLLSNIYASAGRWEE 609

Query: 610 VKKVREKMRNGGLKKSVGMSWVEYKGQVHKFVVGDRSHERSKDIYRLLAELERKMKRVGF 669
           V +VRE M  G +KK+ G SWVEY+G+VHKF+VGDRSHERS DIYRLLAEL  KM+R G+
Sbjct: 610 VNRVREMMSKGEVKKTAGWSWVEYQGKVHKFMVGDRSHERSDDIYRLLAELASKMRRHGY 669

Query: 670 VADKSCALRDVEEEEKEEMLGTHSEKLAICFALLVSEVGTPIRVVKNLRICLDCHTAIKI 729
            AD+SC LRDVEEEEKE M+GTHSEKLAICFALLVS+ G  IRVVKNLR+CLDCHTAIK+
Sbjct: 670 TADRSCVLRDVEEEEKEHMVGTHSEKLAICFALLVSKSGAAIRVVKNLRVCLDCHTAIKL 729

Query: 730 ISKLEEREIIVRDNNRFHCFRDGICSCHDYW 760
           IS+LE REIIVRDNNRFH F DG+CSC D+W
Sbjct: 730 ISQLEGREIIVRDNNRFHHFNDGLCSCKDHW 760

BLAST of Cla020714 vs. TrEMBL
Match: A5B4C7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_013866 PE=4 SV=1)

HSP 1 Score: 1025.0 bits (2649), Expect = 4.6e-296
Identity = 498/761 (65.44%), Postives = 611/761 (80.29%), Query Frame = 1

Query: 1   MYELRAIASKISNISQLRQLHGHLVLNSLHSHNYWVSLLLIICTRLHAHPAYAASIFNSL 60
           M EL +IAS++ N S LRQLH  ++ NSLH HNYWV+LL+  CTRL A P Y   +FNS 
Sbjct: 1   MLELGSIASRVGNFSHLRQLHAQIIHNSLHHHNYWVALLINHCTRLRAPPHYTHLLFNST 60

Query: 61  PSPDASIYSCMLKYYSRMGANNEVVSLFRCMQSLDLRPQPFVYIYLIKLAGKYGNLFHAY 120
            +P+  +++ ML++YS +  + +VV +F  MQ   +RP  FVY  LIK AG  G  FHA+
Sbjct: 61  LNPNVFVFTSMLRFYSHLQDHAKVVLMFEHMQGCGVRPDAFVYPILIKSAGNGGIGFHAH 120

Query: 121 VLKLGFVDDHFIRNAILDMYAKYGQVDLARKLFEQMA--ERTLADWNSMISGCWKSGNET 180
           VLKLG   D F+RNA++DMYA+ G +  ARK+F+++   ER +ADWN+M+SG WK  +E 
Sbjct: 121 VLKLGHGSDAFVRNAVIDMYARLGPIGHARKVFDEIPDYERKVADWNAMVSGYWKWESEG 180

Query: 181 DAVMLFNMMPARNIITWTAMVTGYAKMGDLESARRYFDEMPERSVVSWNAMLSAYAQNEC 240
            A  LF++MP RN+ITWTAMVTGYAK+ DLE+ARRYFD MPERSVVSWNAMLS YAQN  
Sbjct: 181 QAQWLFDVMPERNVITWTAMVTGYAKVKDLEAARRYFDCMPERSVVSWNAMLSGYAQNGL 240

Query: 241 AEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNTTLADSILRKINQKHSILNNFVKTA 300
           AEE L+LF +M+  GI PD+TTWV  IS+CSS G+  LA S++R ++QK   LN FV+TA
Sbjct: 241 AEEVLRLFDEMVNAGIEPDETTWVTVISACSSRGDPCLAASLVRTLHQKQIQLNCFVRTA 300

Query: 301 LLDMHAKFGNLEFARTIFDELGGQRNAVTWNIMISAYTRVGKLSLARELFDNMPKRDVVS 360
           LLDM+AK G++  AR IFDELG  RN+VTWN MISAYTRVG L  ARELF+ MP R+VV+
Sbjct: 301 LLDMYAKCGSIGAARRIFDELGAYRNSVTWNAMISAYTRVGNLDSARELFNTMPGRNVVT 360

Query: 361 WNSMIAGYAQNGEAAKSIELFQEMISCTDIQPDEVTVASVLSACGHIGALKLSYWVLDIV 420
           WNSMIAGYAQNG++A +IELF+EMI+   + PDEVT+ SV+SACGH+GAL+L  WV+  +
Sbjct: 361 WNSMIAGYAQNGQSAMAIELFKEMITAKKLTPDEVTMVSVISACGHLGALELGNWVVRFL 420

Query: 421 QEKNIKLGISGFNSLIFLYSKCGCVADAHRIFQTMETKDVVSFNTLISGFAANGHGKEAV 480
            E  IKL ISG N++IF+YS+CG + DA R+FQ M T+DVVS+NTLISGFAA+GHG EA+
Sbjct: 421 TENQIKLSISGHNAMIFMYSRCGSMEDAKRVFQEMATRDVVSYNTLISGFAAHGHGVEAI 480

Query: 481 KLALTMEEEGIEPDHVTYTGVLTACSHAGLLKEGKDVFKSIKSPTVDHYACMVDLLGRAG 540
            L  TM+E GIEPD VT+ GVLTACSHAGLL+EG+ VF+SIK P +DHYACMVDLLGR G
Sbjct: 481 NLMSTMKEGGIEPDRVTFIGVLTACSHAGLLEEGRKVFESIKDPAIDHYACMVDLLGRVG 540

Query: 541 ELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPQNPGNYVLLSN 600
           EL++AK  ++ MPMEPHAGVYGSLLNASRIHK+VELGELAANKLFELEP N GN++LLSN
Sbjct: 541 ELEDAKRTMERMPMEPHAGVYGSLLNASRIHKQVELGELAANKLFELEPDNSGNFILLSN 600

Query: 601 IYASAGRWEDVKKVREKMRNGGLKKSVGMSWVEYKGQVHKFVVGDRSHERSKDIYRLLAE 660
           IYASAGRW+DV+++RE M+ GG+KK+ G SWVEY G++HKF+V DRSHERS DIY+LL E
Sbjct: 601 IYASAGRWKDVERIREAMKKGGVKKTTGWSWVEYGGKLHKFIVADRSHERSDDIYQLLIE 660

Query: 661 LERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFALLVSEVGTPIRVVKNLRI 720
           L +KM+  G++ADKSC LRDVEEEEKEE++GTHSEKLAIC+ALLVSE G  IRVVKNLR+
Sbjct: 661 LRKKMREAGYIADKSCVLRDVEEEEKEEIVGTHSEKLAICYALLVSEAGAVIRVVKNLRV 720

Query: 721 CLDCHTAIKIISKLEEREIIVRDNNRFHCFRDGICSCHDYW 760
           C DCHTAIK+ISKLE R IIVRDNNRFHCF DG+CSC DYW
Sbjct: 721 CWDCHTAIKMISKLEGRVIIVRDNNRFHCFNDGLCSCKDYW 761

BLAST of Cla020714 vs. TrEMBL
Match: A0A068TP37_COFCA (Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00015081001 PE=4 SV=1)

HSP 1 Score: 1001.9 bits (2589), Expect = 4.2e-289
Identity = 475/759 (62.58%), Postives = 606/759 (79.84%), Query Frame = 1

Query: 1   MYELRAIASKISNISQLRQLHGHLVLNSLHSHNYWVSLLLIICTRLHAHPAYAASIFNSL 60
           M     IA +I + +QL+QLH  L+ NSLH+ ++WV+ L+ +CTRLHA P YA  +F   
Sbjct: 1   MSHFNFIAKEIRSQNQLKQLHAQLIQNSLHNQDFWVAQLIHLCTRLHAPPRYATRVFYLA 60

Query: 61  PSPDASIYSCMLKYYSRMGANNEVVSLFRCMQSLDLRPQPFVYIYLIKLAGKYGNLFHAY 120
           P PD  + S MLKY+S+MGA NEV +LF  MQ  +++P  FVY  LIK +GK G  FHA+
Sbjct: 61  PQPDVFVCSNMLKYHSQMGATNEVFALFDQMQEANVKPVAFVYPLLIKSSGKAGIQFHAH 120

Query: 121 VLKLGFVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDA 180
           +LK G   D +I+NA++  Y KYG ++ AR+LF++M ERT+ADWNS+ISG W  GNE +A
Sbjct: 121 LLKRGLNYDKYIQNAVMGFYCKYGAIESARELFDEMPERTIADWNSIISGYWNGGNEVEA 180

Query: 181 VMLFNMMPARNIITWTAMVTGYAKMGDLESARRYFDEMPERSVVSWNAMLSAYAQNECAE 240
             LF++MP +N+ITWTAMV+GYAK+ DLESARRYFD+MPE+S VSWNAM+S YA N  AE
Sbjct: 181 QKLFDLMPEKNVITWTAMVSGYAKVNDLESARRYFDKMPEKSTVSWNAMISGYAHNGLAE 240

Query: 241 EALKLFHQMLKEGITPDDTTWVATISSCSSIGNTTLADSILRKINQKHSILNNFVKTALL 300
           EA+KLF++M+  G+ PD+TTWVA +SSCS +G+  LA+S++++I +K +  N+FVKTALL
Sbjct: 241 EAIKLFNEMMSFGLKPDETTWVAVVSSCSMLGDPGLAESLVKRIAEKGTCPNHFVKTALL 300

Query: 301 DMHAKFGNLEFARTIFDELGGQRNAVTWNIMISAYTRVGKLSLARELFDNMPKRDVVSWN 360
           DM+AK GNL  AR IFD LG  RN VTWN MI+AYTRVG L+ A ELF  MPK++V+SWN
Sbjct: 301 DMYAKCGNLNMARKIFDGLGECRNLVTWNAMIAAYTRVGDLASAMELFHQMPKKNVISWN 360

Query: 361 SMIAGYAQNGEAAKSIELFQEMISCTDIQPDEVTVASVLSACGHIGALKLSYWVLDIVQE 420
           S+IAG +QNGE+A +IELF+EMI+  D++PDEVT+ SV+SACGH+GAL+L  WV + + E
Sbjct: 361 SIIAGCSQNGESAMAIELFKEMIASKDLKPDEVTMVSVISACGHLGALELGNWVANYLTE 420

Query: 421 KNIKLGISGFNSLIFLYSKCGCVADAHRIFQTMETKDVVSFNTLISGFAANGHGKEAVKL 480
             I+L ISG NSLIF+YSKCG +  A +IF+ ME +DV+S+NTLI+GFAA G G EA++L
Sbjct: 421 NQIRLSISGCNSLIFMYSKCGSMRKARKIFEEMENRDVISYNTLITGFAAYGSGAEALEL 480

Query: 481 ALTMEEEGIEPDHVTYTGVLTACSHAGLLKEGKDVFKSIKSPTVDHYACMVDLLGRAGEL 540
              M++E ++PD +TY G+LTACSH+GLL+EGK VFKSIK P +DHYACMVDL  R G+L
Sbjct: 481 LSKMKQESMQPDRITYIGILTACSHSGLLEEGKAVFKSIKDPDIDHYACMVDLYSRVGKL 540

Query: 541 DEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPQNPGNYVLLSNIY 600
           DEAK LI +MPM PHAG+YGSLLNASR+HKR++LGE  ANKLFELEP+N GNYVLLSNIY
Sbjct: 541 DEAKRLIDNMPMHPHAGIYGSLLNASRVHKRIDLGEFTANKLFELEPENSGNYVLLSNIY 600

Query: 601 ASAGRWEDVKKVREKMRNGGLKKSVGMSWVEYKGQVHKFVVGDRSHERSKDIYRLLAELE 660
           ASAGRWED  ++R  M+ GG+ K+ G SWVE+ G++H+FVVGD SH+ S DIYR+L E++
Sbjct: 601 ASAGRWEDADRIRGLMKAGGVAKATGWSWVEHGGKIHRFVVGDHSHKLSDDIYRVLGEMK 660

Query: 661 RKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFALLVSEVGTPIRVVKNLRICL 720
           +KM   G++ADKSC LRDVEEEEKEEM+GTHSEKLA+ FALL+SE G  IRVVKNLR+C 
Sbjct: 661 KKMMVAGYMADKSCVLRDVEEEEKEEMVGTHSEKLAVAFALLISEPGAVIRVVKNLRVCR 720

Query: 721 DCHTAIKIISKLEEREIIVRDNNRFHCFRDGICSCHDYW 760
           DCHTAIKIIS LE+REIIVRDNNRFHCFRDG+CSC+D+W
Sbjct: 721 DCHTAIKIISHLEKREIIVRDNNRFHCFRDGLCSCNDHW 759

BLAST of Cla020714 vs. TrEMBL
Match: A0A103Y3C6_CYNCS (Uncharacterized protein OS=Cynara cardunculus var. scolymus GN=Ccrd_019973 PE=4 SV=1)

HSP 1 Score: 986.9 bits (2550), Expect = 1.4e-284
Identity = 466/759 (61.40%), Postives = 600/759 (79.05%), Query Frame = 1

Query: 1   MYELRAIASKISNISQLRQLHGHLVLNSLHSHNYWVSLLLIICTRLHAHPAYAASIFNSL 60
           M  L AIA +++ I  L Q H  ++ +SL  H+YWVSLL+  CTRL A P Y   IFNS 
Sbjct: 1   MSHLNAIALRVTQIKHLSQFHAQVIHHSLQHHSYWVSLLISHCTRLRAPPFYTRLIFNSA 60

Query: 61  PSPDASIYSCMLKYYSRMGANNEVVSLFRCMQSLDLRPQPFVYIYLIKLAGKYGNLFHAY 120
             P+   ++ MLK+YS +GA+++V++LF CM+   + P  FV+  LIK  GK   +FH +
Sbjct: 61  QQPNVYAFTNMLKFYSHVGAHDDVIALFDCMRVSGVIPDAFVFPVLIKSWGKAAIVFHGH 120

Query: 121 VLKLGFVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDA 180
           VLK+G   D ++RNA++D+YAK+G +  ARKLF++M+ER +ADWNSMISG W+ GN+ +A
Sbjct: 121 VLKMGHQGDRYVRNAVMDVYAKHGPICFARKLFDEMSERMVADWNSMISGYWRWGNKVEA 180

Query: 181 VMLFNMMPARNIITWTAMVTGYAKMGDLESARRYFDEMPERSVVSWNAMLSAYAQNECAE 240
             LF +MP RN++TWTAMV+GY+K  DL +A+RYFD+MPE++VVSWNAMLS YAQN  AE
Sbjct: 181 DKLFTLMPERNVVTWTAMVSGYSKTRDLVTAKRYFDQMPEKNVVSWNAMLSGYAQNGFAE 240

Query: 241 EALKLFHQMLKEGITPDDTTWVATISSCSSIGNTTLADSILRKINQKHSILNNFVKTALL 300
           EA++LF++M+   + PD+TTWVA ISSCS   +  LA+S+++ +NQK+  LN+FVK ALL
Sbjct: 241 EAIELFNEMVNRRVQPDETTWVAVISSCSDRADPNLANSLVKMLNQKNVRLNSFVKIALL 300

Query: 301 DMHAKFGNLEFARTIFDELGGQRNAVTWNIMISAYTRVGKLSLARELFDNMPKRDVVSWN 360
           DMHAK GNL  A+ +F+ELG  RNAV WN MISAYTRVG LS ARELFD MP+++VVSWN
Sbjct: 301 DMHAKCGNLAAAKKVFEELGAFRNAVAWNAMISAYTRVGDLSSARELFDRMPRKNVVSWN 360

Query: 361 SMIAGYAQNGEAAKSIELFQEMISCTDIQPDEVTVASVLSACGHIGALKLSYWVLDIVQE 420
           SMIAGYAQNG++  +IE F+EMI   D++PDEVT+ SV+SACGH+G L+L  W L+ V E
Sbjct: 361 SMIAGYAQNGQSTMAIEAFKEMIRSKDMKPDEVTMISVISACGHLGVLELGNWALNFVNE 420

Query: 421 KNIKLGISGFNSLIFLYSKCGCVADAHRIFQTMETKDVVSFNTLISGFAANGHGKEAVKL 480
             I L ISG+NS+IF+YSKCG + DA R+F+ MET+DV+SFNTLI+GFAA+G G  A+ L
Sbjct: 421 NQISLSISGYNSVIFMYSKCGSMKDAKRVFEEMETRDVISFNTLITGFAAHGDGFSAIDL 480

Query: 481 ALTMEEEGIEPDHVTYTGVLTACSHAGLLKEGKDVFKSIKSPTVDHYACMVDLLGRAGEL 540
              M+ +G +PD +TY GVLTACSHAG+L+EG+ VF+SI +P VDHYACM+DLLGR G+L
Sbjct: 481 MRKMKGDGFQPDRITYIGVLTACSHAGMLEEGQRVFESISNPDVDHYACMIDLLGRVGKL 540

Query: 541 DEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPQNPGNYVLLSNIY 600
           DEAK LI+ MPM PHAGVYGSLLNASRI KR++LGE AA +LF++EP+N GNYVLLSN+Y
Sbjct: 541 DEAKRLIKKMPMVPHAGVYGSLLNASRIRKRIDLGEFAARELFKIEPENSGNYVLLSNMY 600

Query: 601 ASAGRWEDVKKVREKMRNGGLKKSVGMSWVEYKGQVHKFVVGDRSHERSKDIYRLLAELE 660
           AS GRW DV+++R +MR GG+KK+ G SWVE+ G++HKF+VGD+SHERS DIY+LL EL 
Sbjct: 601 ASMGRWGDVERIRGEMRLGGVKKTTGWSWVEFGGKLHKFIVGDKSHERSNDIYKLLKELR 660

Query: 661 RKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFALLVSEVGTPIRVVKNLRICL 720
            KM++ G+ ADK   LRDVEEEEKEEM+GTHSEKLA+CF LLVSE GT IR++KNLR+C 
Sbjct: 661 EKMRKAGYTADKDSVLRDVEEEEKEEMVGTHSEKLAVCFGLLVSEAGTSIRIMKNLRVCW 720

Query: 721 DCHTAIKIISKLEEREIIVRDNNRFHCFRDGICSCHDYW 760
           DCH A+K+ISKLE REIIVRDNNRFHCF +G CSC D+W
Sbjct: 721 DCHEAMKMISKLEGREIIVRDNNRFHCFNNGSCSCMDHW 759

BLAST of Cla020714 vs. TrEMBL
Match: K4B1K5_SOLLC (Uncharacterized protein OS=Solanum lycopersicum PE=4 SV=1)

HSP 1 Score: 971.1 bits (2509), Expect = 7.9e-280
Identity = 462/759 (60.87%), Postives = 602/759 (79.31%), Query Frame = 1

Query: 1   MYELRAIASKISNISQLRQLHGHLVLNSLHSHNYWVSLLLIICTRLHAHPAYAASIFNSL 60
           M  L   A K + +  L+Q H  L   SL S NYWV+ L+ +CTRLHA   Y + +F+S+
Sbjct: 1   MSHLHTAALKATKLIHLKQFHAQLFQRSLCSDNYWVAQLIKLCTRLHAPSTYVSRVFDSV 60

Query: 61  PSPDASIYSCMLKYYSRMGANNEVVSLFRCMQSLDLRPQPFVYIYLIKLAGKYGNLFHAY 120
             P+  +++ +LK+YS++GA ++V+ LF  MQ  ++ P  FVY  LIK +GK+G +FHA+
Sbjct: 61  HQPNVFVFTNILKFYSQLGAYSDVLYLFDKMQKSNVAPDAFVYPILIKASGKWGIVFHAH 120

Query: 121 VLKLGFVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDA 180
            +K+G   D F+RNAI+D+Y K+G +++AR+LF+++ ER +ADWN+MISGCW  G+E +A
Sbjct: 121 CIKMGHDWDRFVRNAIMDVYGKFGPLEIARELFDEIPERAVADWNAMISGCWNWGDEVEA 180

Query: 181 VMLFNMMPARNIITWTAMVTGYAKMGDLESARRYFDEMPERSVVSWNAMLSAYAQNECAE 240
             LF++MP +N++TWTAMVTGY++  DLE+AR+YFD+MPERSVVSWNAMLS YAQN CAE
Sbjct: 181 RSLFDLMPEKNVVTWTAMVTGYSRRKDLENARKYFDQMPERSVVSWNAMLSGYAQNGCAE 240

Query: 241 EALKLFHQMLKEGITPDDTTWVATISSCSSIGNTTLADSILRKINQKHSILNNFVKTALL 300
           E +KLF++M+   + PD+TTWV  IS CSS G+ +LA+ +++ IN+K   LN F KTALL
Sbjct: 241 EVIKLFNEMMSCEVCPDETTWVTVISLCSSHGDVSLAEGLVKMINEKGVRLNCFAKTALL 300

Query: 301 DMHAKFGNLEFARTIFDELGGQRNAVTWNIMISAYTRVGKLSLARELFDNMPKRDVVSWN 360
           DM+AK GNL  AR IFDELG  +N VTWN MISAY RVG L+ AR LFD +P+++V+SWN
Sbjct: 301 DMYAKCGNLAMARKIFDELGTYKNLVTWNAMISAYARVGDLASARGLFDKVPEKNVISWN 360

Query: 361 SMIAGYAQNGEAAKSIELFQEMISCTDIQPDEVTVASVLSACGHIGALKLSYWVLDIVQE 420
           S+IAGYAQNGE+  +I+LF++MI+  D+ PDEVT+ SV+SACGH+GAL+   W ++ +++
Sbjct: 361 SIIAGYAQNGESKVAIDLFKDMIA-KDVLPDEVTMVSVISACGHLGALEFGNWAVNFLEK 420

Query: 421 KNIKLGISGFNSLIFLYSKCGCVADAHRIFQTMETKDVVSFNTLISGFAANGHGKEAVKL 480
             IKL ISG+N+LIF+YSKCG + DA ++FQ+ME +DV+S+NTLI+G AA G+  EAV+L
Sbjct: 421 HQIKLSISGYNALIFMYSKCGNMKDAEKVFQSMEARDVISYNTLITGVAAYGNAIEAVEL 480

Query: 481 ALTMEEEGIEPDHVTYTGVLTACSHAGLLKEGKDVFKSIKSPTVDHYACMVDLLGRAGEL 540
              M++E IEPD +TY GVLTACSH GLLKEG+ +F SIK P  DHYACMVDLLGR G+L
Sbjct: 481 LWKMKKENIEPDRITYIGVLTACSHGGLLKEGQRIFDSIKDPDSDHYACMVDLLGRNGKL 540

Query: 541 DEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPQNPGNYVLLSNIY 600
           DEAK LI SM M PHAGVYGSLL+ASR+HKR++LGE AA+KLFE+EP+N GNYVLLSNIY
Sbjct: 541 DEAKCLIGSMAMHPHAGVYGSLLHASRVHKRIDLGEFAASKLFEIEPENSGNYVLLSNIY 600

Query: 601 ASAGRWEDVKKVREKMRNGGLKKSVGMSWVEYKGQVHKFVVGDRSHERSKDIYRLLAELE 660
           ASA RWEDV +VR  M  GG+KK+ G SW+E+KG++HKF+VGDRSHER+ DI+R+L E E
Sbjct: 601 ASARRWEDVDRVRGLMTIGGVKKTTGWSWIEHKGEMHKFIVGDRSHERTADIHRVLFETE 660

Query: 661 RKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFALLVSEVGTPIRVVKNLRICL 720
           +KMK  G++ADKSC L+DVEEEE EEM+GTHSEK+A+ FALLV+E  + IRVVKNLRIC 
Sbjct: 661 KKMKLAGYMADKSCVLKDVEEEEMEEMVGTHSEKMAVAFALLVTEPHSVIRVVKNLRICR 720

Query: 721 DCHTAIKIISKLEEREIIVRDNNRFHCFRDGICSCHDYW 760
           DCHTAIKIISK+E REIIVRDNNRFHCF +G CSC DYW
Sbjct: 721 DCHTAIKIISKMEGREIIVRDNNRFHCFSEGQCSCKDYW 758

BLAST of Cla020714 vs. NCBI nr
Match: gi|449460189|ref|XP_004147828.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g14470 [Cucumis sativus])

HSP 1 Score: 1380.9 bits (3573), Expect = 0.0e+00
Identity = 679/759 (89.46%), Postives = 717/759 (94.47%), Query Frame = 1

Query: 1   MYELRAIASKISNISQLRQLHGHLVLNSLHSHNYWVSLLLIICTRLHAHPAYAASIFNSL 60
           MYEL A+ASKISNI QLRQ HGHLV NSLHSHNYWVSLLLI CTRLHAHPAY  SIF S 
Sbjct: 1   MYELVALASKISNIRQLRQFHGHLVHNSLHSHNYWVSLLLINCTRLHAHPAYVDSIFTSS 60

Query: 61  PSPDASIYSCMLKYYSRMGANNEVVSLFRCMQSLDLRPQPFVYIYLIKLAGKYGNLFHAY 120
           PSPDAS+YSCMLKYYSRMGA+N+VVSLF+C  SL+LRPQPFVYIYLIKLAGK GNLFHAY
Sbjct: 61  PSPDASVYSCMLKYYSRMGAHNQVVSLFKCTHSLNLRPQPFVYIYLIKLAGKSGNLFHAY 120

Query: 121 VLKLGFVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDA 180
           VLKLG +DDHFIRNAILDMYAK GQVDLAR LFEQMAERTLADWNSMISGCWKSGNET+A
Sbjct: 121 VLKLGHIDDHFIRNAILDMYAKNGQVDLARNLFEQMAERTLADWNSMISGCWKSGNETEA 180

Query: 181 VMLFNMMPARNIITWTAMVTGYAKMGDLESARRYFDEMPERSVVSWNAMLSAYAQNECAE 240
           V+LFNMMPARNIITWT+MVTGYAKMGDLESARRYFDEMPERSVVSWNAM SAYAQ EC +
Sbjct: 181 VVLFNMMPARNIITWTSMVTGYAKMGDLESARRYFDEMPERSVVSWNAMQSAYAQKECPK 240

Query: 241 EALKLFHQMLKEGITPDDTTWVATISSCSSIGNTTLADSILRKINQKHSILNNFVKTALL 300
           EAL LFHQML+EGITPDDTTWV TISSCSSIG+ TLADSILR I+QKH +LN+FVKTALL
Sbjct: 241 EALNLFHQMLEEGITPDDTTWVVTISSCSSIGDPTLADSILRMIDQKHIVLNSFVKTALL 300

Query: 301 DMHAKFGNLEFARTIFDELGGQRNAVTWNIMISAYTRVGKLSLARELFDNMPKRDVVSWN 360
           DMHAKFGNLE AR IFDELG QRNAVTWNIMISAYTRVGKLSLARELFDNMPKRDVVSWN
Sbjct: 301 DMHAKFGNLEIARNIFDELGSQRNAVTWNIMISAYTRVGKLSLARELFDNMPKRDVVSWN 360

Query: 361 SMIAGYAQNGEAAKSIELFQEMISCTDIQPDEVTVASVLSACGHIGALKLSYWVLDIVQE 420
           SMIAGYAQNGE+A SIELF+EMISC DIQPDEVT+ASVLSACGHIGALKLSYWVLDIV+E
Sbjct: 361 SMIAGYAQNGESAMSIELFKEMISCMDIQPDEVTIASVLSACGHIGALKLSYWVLDIVRE 420

Query: 421 KNIKLGISGFNSLIFLYSKCGCVADAHRIFQTMETKDVVSFNTLISGFAANGHGKEAVKL 480
           KNIKLGISGFNSLIF+YSKCG VADAHRIFQTM T+DVVSFNTLISGFAANGHGKEA+KL
Sbjct: 421 KNIKLGISGFNSLIFMYSKCGSVADAHRIFQTMGTRDVVSFNTLISGFAANGHGKEAIKL 480

Query: 481 ALTMEEEGIEPDHVTYTGVLTACSHAGLLKEGKDVFKSIKSPTVDHYACMVDLLGRAGEL 540
            LTMEEEGIEPDHVTY GVLTACSHAGLL EGK+VFKSI++PTVDHYACMVDLLGRAGEL
Sbjct: 481 VLTMEEEGIEPDHVTYIGVLTACSHAGLLNEGKNVFKSIQAPTVDHYACMVDLLGRAGEL 540

Query: 541 DEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPQNPGNYVLLSNIY 600
           DEAK+LIQSMPM+PHAGVYGSLLNASRIHKRV LGELAA+KLFELEPQN GNYVLLSNIY
Sbjct: 541 DEAKMLIQSMPMKPHAGVYGSLLNASRIHKRVGLGELAASKLFELEPQNLGNYVLLSNIY 600

Query: 601 ASAGRWEDVKKVREKMRNGGLKKSVGMSWVEYKGQVHKFVVGDRSHERSKDIYRLLAELE 660
           AS GRWEDVK+VRE M+ GGLKKSVGMSWVEYKGQVHKF VGDRSHE+SKDIY+LLAELE
Sbjct: 601 ASFGRWEDVKRVREMMKKGGLKKSVGMSWVEYKGQVHKFTVGDRSHEQSKDIYKLLAELE 660

Query: 661 RKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFALLVSEVGTPIRVVKNLRICL 720
           RKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFALL+SEVGT IRVVKNLRICL
Sbjct: 661 RKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFALLISEVGTTIRVVKNLRICL 720

Query: 721 DCHTAIKIISKLEEREIIVRDNNRFHCFRDGICSCHDYW 760
           DCHTAIK+ISKLE REI+VRDNNRFHCF +G+CSCHDYW
Sbjct: 721 DCHTAIKMISKLEGREIVVRDNNRFHCFSEGMCSCHDYW 759

BLAST of Cla020714 vs. NCBI nr
Match: gi|659109350|ref|XP_008454670.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g14470 [Cucumis melo])

HSP 1 Score: 1378.2 bits (3566), Expect = 0.0e+00
Identity = 674/759 (88.80%), Postives = 713/759 (93.94%), Query Frame = 1

Query: 1   MYELRAIASKISNISQLRQLHGHLVLNSLHSHNYWVSLLLIICTRLHAHPAYAASIFNSL 60
           MYEL A+ASKISNI QLR  HGHLV NSLHSHNYWVSLLL+IC RLHAHPAY  SIF S 
Sbjct: 1   MYELVALASKISNIRQLRLFHGHLVHNSLHSHNYWVSLLLVICNRLHAHPAYVDSIFTSS 60

Query: 61  PSPDASIYSCMLKYYSRMGANNEVVSLFRCMQSLDLRPQPFVYIYLIKLAGKYGNLFHAY 120
           PSPDAS+YSCMLKYYSRMGA+N+VVSLF+CM SLDLRPQPFVYIYLIK AGK GNLFHAY
Sbjct: 61  PSPDASVYSCMLKYYSRMGAHNQVVSLFKCMHSLDLRPQPFVYIYLIKSAGKSGNLFHAY 120

Query: 121 VLKLGFVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDA 180
           VLKLG +DD FIRNAILDMY K GQVDLARKLFEQMAE+TL DWNSMISGCWKSGNET+A
Sbjct: 121 VLKLGHIDDRFIRNAILDMYVKNGQVDLARKLFEQMAEKTLVDWNSMISGCWKSGNETEA 180

Query: 181 VMLFNMMPARNIITWTAMVTGYAKMGDLESARRYFDEMPERSVVSWNAMLSAYAQNECAE 240
           VMLFNMMPARNIITWT+MVTGYAK+GDLESARRYFDEMPERSVVSWNAM SAYAQ EC +
Sbjct: 181 VMLFNMMPARNIITWTSMVTGYAKVGDLESARRYFDEMPERSVVSWNAMQSAYAQKECPK 240

Query: 241 EALKLFHQMLKEGITPDDTTWVATISSCSSIGNTTLADSILRKINQKHSILNNFVKTALL 300
           EALKLFHQMLKEGITPDDTTW  TISSCSSIG+ TLADSILR INQKH +LN+FV+TALL
Sbjct: 241 EALKLFHQMLKEGITPDDTTWAVTISSCSSIGDPTLADSILRMINQKHIVLNSFVQTALL 300

Query: 301 DMHAKFGNLEFARTIFDELGGQRNAVTWNIMISAYTRVGKLSLARELFDNMPKRDVVSWN 360
           DMHAKFGNLE AR IFDELG QRN V WN+MISAYTRVGKLSLARELFDNMPKRDVVSWN
Sbjct: 301 DMHAKFGNLEIARNIFDELGSQRNDVAWNVMISAYTRVGKLSLARELFDNMPKRDVVSWN 360

Query: 361 SMIAGYAQNGEAAKSIELFQEMISCTDIQPDEVTVASVLSACGHIGALKLSYWVLDIVQE 420
           SMIAGYAQNGEAA SIELF+EMISC DIQPDEVT+ASVLSACGHIGALKL YWVLDIV+E
Sbjct: 361 SMIAGYAQNGEAAMSIELFKEMISCADIQPDEVTIASVLSACGHIGALKLGYWVLDIVRE 420

Query: 421 KNIKLGISGFNSLIFLYSKCGCVADAHRIFQTMETKDVVSFNTLISGFAANGHGKEAVKL 480
           KNIKLGISGFNSLIF+YSKCG VADAHRIFQTMET+DVVSFNTLISGFAANGHGKEA+KL
Sbjct: 421 KNIKLGISGFNSLIFMYSKCGSVADAHRIFQTMETRDVVSFNTLISGFAANGHGKEAIKL 480

Query: 481 ALTMEEEGIEPDHVTYTGVLTACSHAGLLKEGKDVFKSIKSPTVDHYACMVDLLGRAGEL 540
            LTMEEEGIEPDHVTY GVLTACSHAGLL EGK+VFKSIK+PTVDHYACMVDLLGRAGEL
Sbjct: 481 VLTMEEEGIEPDHVTYIGVLTACSHAGLLNEGKNVFKSIKAPTVDHYACMVDLLGRAGEL 540

Query: 541 DEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPQNPGNYVLLSNIY 600
           DEAK+LIQSMPM+PH GVYGSLLNASRIHKRV LGELAA+KLFELEPQNPGNYVLLSNIY
Sbjct: 541 DEAKMLIQSMPMKPHGGVYGSLLNASRIHKRVGLGELAASKLFELEPQNPGNYVLLSNIY 600

Query: 601 ASAGRWEDVKKVREKMRNGGLKKSVGMSWVEYKGQVHKFVVGDRSHERSKDIYRLLAELE 660
           AS+GRWEDVK+VRE MR  GL+K VGMSWVEYKGQVHKF+VGDRSHE+SKDIY+LLAELE
Sbjct: 601 ASSGRWEDVKRVREMMRKRGLQKLVGMSWVEYKGQVHKFIVGDRSHEQSKDIYKLLAELE 660

Query: 661 RKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFALLVSEVGTPIRVVKNLRICL 720
           RKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFALL+SEVGTPIRVVKNLRICL
Sbjct: 661 RKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFALLISEVGTPIRVVKNLRICL 720

Query: 721 DCHTAIKIISKLEEREIIVRDNNRFHCFRDGICSCHDYW 760
           DCHTAIK+ISKLE REI+VRDNNRFHCF DG+CSC+DYW
Sbjct: 721 DCHTAIKMISKLEGREIVVRDNNRFHCFSDGMCSCYDYW 759

BLAST of Cla020714 vs. NCBI nr
Match: gi|802640482|ref|XP_012078834.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g14470 [Jatropha curcas])

HSP 1 Score: 1041.6 bits (2692), Expect = 6.9e-301
Identity = 508/751 (67.64%), Postives = 610/751 (81.23%), Query Frame = 1

Query: 10  KISNISQLRQLHGHLVLNSLHSHNYWVSLLLIICTRLHAHPAYAASIFNSLPSPDASIYS 69
           ++ N + LRQLH HL+ NSLH HN W SLL+  CT LHA P Y  SIFNS P P   + +
Sbjct: 10  QLINANHLRQLHAHLIQNSLHHHNNWASLLIAQCTLLHAPPHYTRSIFNSTPDPTIHVVT 69

Query: 70  CMLKYYSRMGANNEVVSLFRCMQSLD-LRPQPFVYIYLIKLAGKYGNLFHAYVLKLGFVD 129
            MLKYYS +G  NEV+S F+ MQ    ++   +VY  +IK AGK G +FH ++ KLG+V+
Sbjct: 70  SMLKYYSHLGGQNEVISFFKHMQCCSYVKLGAYVYPLVIKSAGKDGIMFHGHIWKLGYVN 129

Query: 130 DHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMP 189
           D +IRN ILDMYAK G V++ARKLF++M ER+LADWNSMISG WK GNET+A  LF MMP
Sbjct: 130 DPYIRNVILDMYAKRGPVEVARKLFDEMTERSLADWNSMISGYWKGGNETEAFSLFKMMP 189

Query: 190 ARNIITWTAMVTGYAKMGDLESARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQ 249
            RN++TWTAMVTG+AK+ DLE AR+YFD MP RSVVSWNAMLS YAQN  AEEALKLF  
Sbjct: 190 QRNVVTWTAMVTGFAKIKDLEKARKYFDYMPMRSVVSWNAMLSGYAQNGFAEEALKLFGD 249

Query: 250 MLKEGITPDDTTWVATISSCSSIGNTTLADSILRKINQKHSILNNFVKTALLDMHAKFGN 309
           M+  G+ P++TTW   +S CS  G+  +A+SI++ ++ K   +N FVKTALLDM+AK GN
Sbjct: 250 MVNSGVQPNETTWATVVSLCSLFGDPCVAESIVKMLDGKRIKMNCFVKTALLDMNAKCGN 309

Query: 310 LEFARTIFDELGGQRNAVTWNIMISAYTRVGKLSLARELFDNMPKRDVVSWNSMIAGYAQ 369
           LE AR IF+ELG  RN+VTWN MISAYT+VG L  AR+ FD MP+RDVVSWN+MI+GYAQ
Sbjct: 310 LEAARNIFNELGVHRNSVTWNTMISAYTKVGDLDSARDHFDRMPERDVVSWNTMISGYAQ 369

Query: 370 NGEAAKSIELFQEMISCTDIQPDEVTVASVLSACGHIGALKLSYWVLDIVQEKNIKLGIS 429
           NG++AK+IE+F+EMIS  D+QPDEVT+ASV+SACGH+GAL+L  WV++ + E  I L I 
Sbjct: 370 NGQSAKAIEIFKEMISSKDLQPDEVTMASVISACGHLGALELGTWVVNHITEYKINLSIL 429

Query: 430 GFNSLIFLYSKCGCVADAHRIFQTMETKDVVSFNTLISGFAANGHGKEAVKLALTMEEEG 489
           G+NSLIF+YSKCG + +AHRIFQ MET+DVVS+NTLI+GFAA+G G EA+KL  TM+EEG
Sbjct: 430 GYNSLIFMYSKCGNMKEAHRIFQEMETRDVVSYNTLIAGFAAHGKGIEAIKLLSTMKEEG 489

Query: 490 IEPDHVTYTGVLTACSHAGLLKEGKDVFKSIKSPTVDHYACMVDLLGRAGELDEAKILIQ 549
           I PD VTY GVLTACSHAGL++EG  VF+SI+SP VDHYACMVDLLGR G+LDEAK LI 
Sbjct: 490 IHPDRVTYIGVLTACSHAGLMEEGHKVFESIESPDVDHYACMVDLLGRVGKLDEAKKLID 549

Query: 550 SMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPQNPGNYVLLSNIYASAGRWED 609
           +MPMEPHAGVYGSLL+AS+IHKRV+ GELAA  LF+LEPQN GNYVLLSNIYASAGRWE+
Sbjct: 550 NMPMEPHAGVYGSLLHASQIHKRVDFGELAAKMLFQLEPQNSGNYVLLSNIYASAGRWEE 609

Query: 610 VKKVREKMRNGGLKKSVGMSWVEYKGQVHKFVVGDRSHERSKDIYRLLAELERKMKRVGF 669
           V +VRE M  G +KK+ G SWVEY+G+VHKF+VGDRSHERS DIYRLLAEL  KM+R G+
Sbjct: 610 VNRVREMMSKGEVKKTAGWSWVEYQGKVHKFMVGDRSHERSDDIYRLLAELASKMRRHGY 669

Query: 670 VADKSCALRDVEEEEKEEMLGTHSEKLAICFALLVSEVGTPIRVVKNLRICLDCHTAIKI 729
            AD+SC LRDVEEEEKE M+GTHSEKLAICFALLVS+ G  IRVVKNLR+CLDCHTAIK+
Sbjct: 670 TADRSCVLRDVEEEEKEHMVGTHSEKLAICFALLVSKSGAAIRVVKNLRVCLDCHTAIKL 729

Query: 730 ISKLEEREIIVRDNNRFHCFRDGICSCHDYW 760
           IS+LE REIIVRDNNRFH F DG+CSC D+W
Sbjct: 730 ISQLEGREIIVRDNNRFHHFNDGLCSCKDHW 760

BLAST of Cla020714 vs. NCBI nr
Match: gi|147856457|emb|CAN80769.1| (hypothetical protein VITISV_013866 [Vitis vinifera])

HSP 1 Score: 1025.0 bits (2649), Expect = 6.6e-296
Identity = 498/761 (65.44%), Postives = 611/761 (80.29%), Query Frame = 1

Query: 1   MYELRAIASKISNISQLRQLHGHLVLNSLHSHNYWVSLLLIICTRLHAHPAYAASIFNSL 60
           M EL +IAS++ N S LRQLH  ++ NSLH HNYWV+LL+  CTRL A P Y   +FNS 
Sbjct: 1   MLELGSIASRVGNFSHLRQLHAQIIHNSLHHHNYWVALLINHCTRLRAPPHYTHLLFNST 60

Query: 61  PSPDASIYSCMLKYYSRMGANNEVVSLFRCMQSLDLRPQPFVYIYLIKLAGKYGNLFHAY 120
            +P+  +++ ML++YS +  + +VV +F  MQ   +RP  FVY  LIK AG  G  FHA+
Sbjct: 61  LNPNVFVFTSMLRFYSHLQDHAKVVLMFEHMQGCGVRPDAFVYPILIKSAGNGGIGFHAH 120

Query: 121 VLKLGFVDDHFIRNAILDMYAKYGQVDLARKLFEQMA--ERTLADWNSMISGCWKSGNET 180
           VLKLG   D F+RNA++DMYA+ G +  ARK+F+++   ER +ADWN+M+SG WK  +E 
Sbjct: 121 VLKLGHGSDAFVRNAVIDMYARLGPIGHARKVFDEIPDYERKVADWNAMVSGYWKWESEG 180

Query: 181 DAVMLFNMMPARNIITWTAMVTGYAKMGDLESARRYFDEMPERSVVSWNAMLSAYAQNEC 240
            A  LF++MP RN+ITWTAMVTGYAK+ DLE+ARRYFD MPERSVVSWNAMLS YAQN  
Sbjct: 181 QAQWLFDVMPERNVITWTAMVTGYAKVKDLEAARRYFDCMPERSVVSWNAMLSGYAQNGL 240

Query: 241 AEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNTTLADSILRKINQKHSILNNFVKTA 300
           AEE L+LF +M+  GI PD+TTWV  IS+CSS G+  LA S++R ++QK   LN FV+TA
Sbjct: 241 AEEVLRLFDEMVNAGIEPDETTWVTVISACSSRGDPCLAASLVRTLHQKQIQLNCFVRTA 300

Query: 301 LLDMHAKFGNLEFARTIFDELGGQRNAVTWNIMISAYTRVGKLSLARELFDNMPKRDVVS 360
           LLDM+AK G++  AR IFDELG  RN+VTWN MISAYTRVG L  ARELF+ MP R+VV+
Sbjct: 301 LLDMYAKCGSIGAARRIFDELGAYRNSVTWNAMISAYTRVGNLDSARELFNTMPGRNVVT 360

Query: 361 WNSMIAGYAQNGEAAKSIELFQEMISCTDIQPDEVTVASVLSACGHIGALKLSYWVLDIV 420
           WNSMIAGYAQNG++A +IELF+EMI+   + PDEVT+ SV+SACGH+GAL+L  WV+  +
Sbjct: 361 WNSMIAGYAQNGQSAMAIELFKEMITAKKLTPDEVTMVSVISACGHLGALELGNWVVRFL 420

Query: 421 QEKNIKLGISGFNSLIFLYSKCGCVADAHRIFQTMETKDVVSFNTLISGFAANGHGKEAV 480
            E  IKL ISG N++IF+YS+CG + DA R+FQ M T+DVVS+NTLISGFAA+GHG EA+
Sbjct: 421 TENQIKLSISGHNAMIFMYSRCGSMEDAKRVFQEMATRDVVSYNTLISGFAAHGHGVEAI 480

Query: 481 KLALTMEEEGIEPDHVTYTGVLTACSHAGLLKEGKDVFKSIKSPTVDHYACMVDLLGRAG 540
            L  TM+E GIEPD VT+ GVLTACSHAGLL+EG+ VF+SIK P +DHYACMVDLLGR G
Sbjct: 481 NLMSTMKEGGIEPDRVTFIGVLTACSHAGLLEEGRKVFESIKDPAIDHYACMVDLLGRVG 540

Query: 541 ELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPQNPGNYVLLSN 600
           EL++AK  ++ MPMEPHAGVYGSLLNASRIHK+VELGELAANKLFELEP N GN++LLSN
Sbjct: 541 ELEDAKRTMERMPMEPHAGVYGSLLNASRIHKQVELGELAANKLFELEPDNSGNFILLSN 600

Query: 601 IYASAGRWEDVKKVREKMRNGGLKKSVGMSWVEYKGQVHKFVVGDRSHERSKDIYRLLAE 660
           IYASAGRW+DV+++RE M+ GG+KK+ G SWVEY G++HKF+V DRSHERS DIY+LL E
Sbjct: 601 IYASAGRWKDVERIREAMKKGGVKKTTGWSWVEYGGKLHKFIVADRSHERSDDIYQLLIE 660

Query: 661 LERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFALLVSEVGTPIRVVKNLRI 720
           L +KM+  G++ADKSC LRDVEEEEKEE++GTHSEKLAIC+ALLVSE G  IRVVKNLR+
Sbjct: 661 LRKKMREAGYIADKSCVLRDVEEEEKEEIVGTHSEKLAICYALLVSEAGAVIRVVKNLRV 720

Query: 721 CLDCHTAIKIISKLEEREIIVRDNNRFHCFRDGICSCHDYW 760
           C DCHTAIK+ISKLE R IIVRDNNRFHCF DG+CSC DYW
Sbjct: 721 CWDCHTAIKMISKLEGRVIIVRDNNRFHCFNDGLCSCKDYW 761

BLAST of Cla020714 vs. NCBI nr
Match: gi|731411247|ref|XP_010657905.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g14470 [Vitis vinifera])

HSP 1 Score: 1019.6 bits (2635), Expect = 2.8e-294
Identity = 495/761 (65.05%), Postives = 611/761 (80.29%), Query Frame = 1

Query: 1   MYELRAIASKISNISQLRQLHGHLVLNSLHSHNYWVSLLLIICTRLHAHPAYAASIFNSL 60
           M EL +IAS++ N + LRQLH  ++ NSLH HNYWV+LL+  CTRL A P Y   +FNS 
Sbjct: 1   MLELGSIASRVGNFNHLRQLHAQIIHNSLHHHNYWVALLINHCTRLRAPPHYTHLLFNST 60

Query: 61  PSPDASIYSCMLKYYSRMGANNEVVSLFRCMQSLDLRPQPFVYIYLIKLAGKYGNLFHAY 120
            +P+  +++ ML++YS +  + +VV ++  MQ   +RP  FVY  LIK AG  G  FHA+
Sbjct: 61  LNPNVFVFTSMLRFYSHLQDHAKVVLMYEQMQGCGVRPDAFVYPILIKSAGTGGIGFHAH 120

Query: 121 VLKLGFVDDHFIRNAILDMYAKYGQVDLARKLFEQMA--ERTLADWNSMISGCWKSGNET 180
           VLKLG   D F+RNA++DMYA+ G +  ARK+F+++   ER +ADWN+M+SG WK  +E 
Sbjct: 121 VLKLGHGSDAFVRNAVIDMYARLGPIGHARKVFDEIPDYERKVADWNAMVSGYWKWESEG 180

Query: 181 DAVMLFNMMPARNIITWTAMVTGYAKMGDLESARRYFDEMPERSVVSWNAMLSAYAQNEC 240
            A  LF++MP RN+ITWTAMVTGYAK+ DLE+ARRYFD MPERSVVSWNAMLS YAQN  
Sbjct: 181 QAQWLFDVMPERNVITWTAMVTGYAKVKDLEAARRYFDCMPERSVVSWNAMLSGYAQNGL 240

Query: 241 AEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNTTLADSILRKINQKHSILNNFVKTA 300
           AEEAL+LF +M+  GI PD+TTWV  IS+CSS G+  LA S++R ++QK   LN FV+TA
Sbjct: 241 AEEALRLFDEMVNAGIEPDETTWVTVISACSSRGDPCLAASLVRTLHQKRIQLNCFVRTA 300

Query: 301 LLDMHAKFGNLEFARTIFDELGGQRNAVTWNIMISAYTRVGKLSLARELFDNMPKRDVVS 360
           LLDM+AK G++  AR IFDELG  RN+VTWN MISAY RVG L  AR+LF+ MP R+VV+
Sbjct: 301 LLDMYAKCGSIGAARRIFDELGAYRNSVTWNAMISAYMRVGDLDSARKLFNTMPGRNVVT 360

Query: 361 WNSMIAGYAQNGEAAKSIELFQEMISCTDIQPDEVTVASVLSACGHIGALKLSYWVLDIV 420
           WNSMIAGYAQNG++A +IELF+EMI+   + PDEVT+ SV+SACGH+GAL+L  WV+  +
Sbjct: 361 WNSMIAGYAQNGQSAMAIELFKEMITAKKLTPDEVTMVSVISACGHLGALELGNWVVRFL 420

Query: 421 QEKNIKLGISGFNSLIFLYSKCGCVADAHRIFQTMETKDVVSFNTLISGFAANGHGKEAV 480
            E  IKL ISG N++IF+YS+CG + DA R+FQ M T+DVVS+NTLISGFAA+GHG EA+
Sbjct: 421 TENQIKLSISGHNAMIFMYSRCGSMEDAKRVFQEMATRDVVSYNTLISGFAAHGHGVEAI 480

Query: 481 KLALTMEEEGIEPDHVTYTGVLTACSHAGLLKEGKDVFKSIKSPTVDHYACMVDLLGRAG 540
            L  TM+E GIEPD VT+ GVLTACSHAGLL+EG+ VF+SIK P +DHYACMVDLLGR G
Sbjct: 481 NLMSTMKEGGIEPDRVTFIGVLTACSHAGLLEEGRKVFESIKDPAIDHYACMVDLLGRVG 540

Query: 541 ELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPQNPGNYVLLSN 600
           EL++AK  ++ MPMEPHAGVYGSLLNASRIHK+VELGELAANKLFELEP N GN++LLSN
Sbjct: 541 ELEDAKRTMERMPMEPHAGVYGSLLNASRIHKQVELGELAANKLFELEPDNSGNFILLSN 600

Query: 601 IYASAGRWEDVKKVREKMRNGGLKKSVGMSWVEYKGQVHKFVVGDRSHERSKDIYRLLAE 660
           IYASAGRW+DV+++RE M+ GG+KK+ G SWVEY G++HKF+V DRSHERS DIY+LL E
Sbjct: 601 IYASAGRWKDVERIREAMKKGGVKKTTGWSWVEYGGKLHKFIVADRSHERSDDIYQLLIE 660

Query: 661 LERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFALLVSEVGTPIRVVKNLRI 720
           L +KM+  G++ADKSC LRDVEEEEKEE++GTHSEKLAIC+ALLVSE G  IRVVKNLR+
Sbjct: 661 LRKKMREAGYIADKSCVLRDVEEEEKEEIVGTHSEKLAICYALLVSEAGAVIRVVKNLRV 720

Query: 721 CLDCHTAIKIISKLEEREIIVRDNNRFHCFRDGICSCHDYW 760
           C DCHTAIK+ISKLE R IIVRDNNRFHCF DG+CSC DYW
Sbjct: 721 CWDCHTAIKMISKLEGRVIIVRDNNRFHCFNDGLCSCKDYW 761

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR43_ARATH9.6e-16253.20Pentatricopeptide repeat-containing protein At1g14470 OS=Arabidopsis thaliana GN... [more]
PPR21_ARATH1.3e-14536.79Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
PP301_ARATH3.4e-14339.55Pentatricopeptide repeat-containing protein At4g02750 OS=Arabidopsis thaliana GN... [more]
PP249_ARATH8.8e-13935.10Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana GN... [more]
PP168_ARATH2.2e-13434.07Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A067KKU3_JATCU4.8e-30167.64Uncharacterized protein OS=Jatropha curcas GN=JCGZ_13364 PE=4 SV=1[more]
A5B4C7_VITVI4.6e-29665.44Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_013866 PE=4 SV=1[more]
A0A068TP37_COFCA4.2e-28962.58Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00015081001 PE=4 SV=1[more]
A0A103Y3C6_CYNCS1.4e-28461.40Uncharacterized protein OS=Cynara cardunculus var. scolymus GN=Ccrd_019973 PE=4 ... [more]
K4B1K5_SOLLC7.9e-28060.87Uncharacterized protein OS=Solanum lycopersicum PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|449460189|ref|XP_004147828.1|0.0e+0089.46PREDICTED: pentatricopeptide repeat-containing protein At1g14470 [Cucumis sativu... [more]
gi|659109350|ref|XP_008454670.1|0.0e+0088.80PREDICTED: pentatricopeptide repeat-containing protein At1g14470 [Cucumis melo][more]
gi|802640482|ref|XP_012078834.1|6.9e-30167.64PREDICTED: pentatricopeptide repeat-containing protein At1g14470 [Jatropha curca... [more]
gi|147856457|emb|CAN80769.1|6.6e-29665.44hypothetical protein VITISV_013866 [Vitis vinifera][more]
gi|731411247|ref|XP_010657905.1|2.8e-29465.05PREDICTED: pentatricopeptide repeat-containing protein At1g14470 [Vitis vinifera... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
WMU06527watermelon unigene v2 vs TrEMBLtranscribed_cluster
WMU38525watermelon unigene v2 vs TrEMBLtranscribed_cluster
WMU68795watermelon EST collection version 2.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla020714Cla020714.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
WMU38525WMU38525transcribed_cluster
WMU06527WMU06527transcribed_cluster
WMU68795WMU68795transcribed_cluster


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 527..551
score: 0.42coord: 595..620
score: 0.49coord: 164..192
score: 2.8E-5coord: 134..159
score: 0.0033coord: 67..94
score:
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 320..351
score: 2.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 222..268
score: 5.1E-12coord: 355..402
score: 1.3E-9coord: 456..504
score: 1.5
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 459..492
score: 7.0E-6coord: 224..257
score: 1.0E-9coord: 164..192
score: 0.0016coord: 193..224
score: 3.8E-7coord: 357..392
score: 3.9E-7coord: 326..357
score: 4.9E-7coord: 134..158
score: 8.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 129..163
score: 9.24coord: 457..491
score: 11.641coord: 191..221
score: 10.95coord: 257..291
score: 6.171coord: 492..526
score: 7.936coord: 292..322
score: 5.897coord: 164..190
score: 5.985coord: 324..358
score: 11.926coord: 64..98
score: 9.712coord: 426..456
score: 7.52coord: 391..425
score: 6.982coord: 222..256
score: 13.022coord: 359..385
score: 6.95coord: 589..623
score: 8
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 587..615
score: 5.9E-13coord: 322..400
score: 5.9E-13coord: 68..253
score: 5.9
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 299..384
score: 8.35E-7coord: 559..614
score: 8.35E-7coord: 190..264
score: 8.3
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 118..630
score:
NoneNo IPR availablePANTHERPTHR24015:SF911SUBFAMILY NOT NAMEDcoord: 118..630
score:

The following gene(s) are paralogous to this gene:

None