Cla002193 (gene) Watermelon (97103) v1

NameCla002193
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionPentatricopeptide repeat-containing protein (AHRD V1 ***- D7LLV1_ARALL); contains Interpro domain(s) IPR002885 Pentatricopeptide repeat
LocationChr7 : 7106832 .. 7108457 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTCAAAGAAGCCCTCCAGTTCTTGGCTCACCTTCGACGAATCTCCCGCCTTCCCACCCCTTTCACCTGCAACAGGATTCTGCACTCTCTCATCAACTCCGGCTGTGGCGACCTCTCCGCCAAATTGTTTTTCCACTTGCTCTCCAAAGGGTATACTCCTCATTCATCTTCTTTCAATTCGATCATCTCCTTTTTCTGTAGATTAGGGAATGTGAAATTTGCCGAACGGATTTTGAATTCTATGCCCAGGTTTGGGTGCTCCCCTGATATTGTATCCTACAATTCTTTGTTAGATGGTTATTGTGGGAGTTATCAAATTCAGAAAGCTTGTTTTCTTGTGAATAGAGTTCGTGGGTGTGAGTTGATTAGGCCTGATTTGGTTATGTTTAATATACTGTTTAATGGGTTTGCTAAAGTTTATATGAAGAATGAGGCATTTGTATATCTGGGTTTGATGTGGAAATGCTGTTTGCCTAGTGTTGTTACTTATGGTACGTTTGTTGATATGTTCTGTAAGATGGGGGATATGGAGATGGGTAACAGAATGTTTTTAGATATGATGAAGGTTGGGGTTGTGCCTAATTTGGTTGTTTTTAGCTCCTTGATTGATGGCTATTGCAAGGCTGGGAATTTGGATGTTGCGTTTGGATACTTTGAGAGAATGGAGGAATGTTCCGTTCAGCCAAATGAGTTCACATACTCGACATTGATTGATGGTTGCTGCAAGCATGGGATGTTGGGAAGAGCTGACTCTTTGTTTGAAAAGATGTTGAGTGCCGGTATTCTGCCTAACTGTACTGTTTACACTTCAATAATAGATGGTCATTTTAAGAAGGGAAATGTAGACAATGCGATAAAGTATATAAATGGGATGTTTTATCGAGAGATAAAACTAGATTTAGCAGCATATACGGTAGTTATCTCAGGCTTTCATAAAGTTGGTAGGTTGGATAAATCAATGGAAGCTGCAGCATATGTGGTGAAGAATGGATTACTTCCTGATAGGATTATATTGACAGCTATTATGGATGTGCATTTCAAAGCCGGAAACGTAAAAGAAGCTTTGAATGCATACAAAATATTACTTGCAAGGGGTTTCGAGCCTGATGTCGTGACTCTTTCCGCCCTAATGGATGGCCTATGCAAGCATGGGTATTTGCAGGAGGCTAGACAGTATTTGGATAAAGAAAAGGCCAATGAAATTCTATATACAGTGTTTATAGATGCACTCTGCAAGGAGGGTAATTTAGATGAAGCTGAGAGAACGATTAAGGAGATGTCTGAAGCAGGGTTTGTTCCAGATAAATATGTGTACACTTCTTGGATTGCTGAACTTTGCAAGCAAGGAAATTTGCTGAAGGCTTTCGTGGTCAAGAAAAGGATGGTTCAAGAGCACATTGAACCCGATTTATTAACTTATAGCTCCCTCATTGGTGGTTTGGCTGAGAAGGGACTAATGATAGAAGCCAAACAGGTTTTTGATGACATGTTAAATAAAGGAATCACTCCAGATTCTGTTTCTTATGACATCCTTATAAGAGGGTATCATAATCAGGGTAACGGAGCTGCGATTTCAGGTCTACATGACGAAATGAGAAAGAGAGGAATCACTATTGAAGATTAG

mRNA sequence

ATGGTCAAAGAAGCCCTCCAGTTCTTGGCTCACCTTCGACGAATCTCCCGCCTTCCCACCCCTTTCACCTGCAACAGGATTCTGCACTCTCTCATCAACTCCGGCTGTGGCGACCTCTCCGCCAAATTGTTTTTCCACTTGCTCTCCAAAGGGTATACTCCTCATTCATCTTCTTTCAATTCGATCATCTCCTTTTTCTGTAGATTAGGGAATGTGAAATTTGCCGAACGGATTTTGAATTCTATGCCCAGGTTTGGGTGCTCCCCTGATATTGTATCCTACAATTCTTTGTTAGATGGTTATTGTGGGAGTTATCAAATTCAGAAAGCTTGTTTTCTTGTGAATAGAGTTCGTGGGTGTGAGTTGATTAGGCCTGATTTGGTTATGTTTAATATACTGTTTAATGGGTTTGCTAAAGTTTATATGAAGAATGAGGCATTTGTATATCTGGGTTTGATGTGGAAATGCTGTTTGCCTAGTGTTGTTACTTATGGTACGTTTGTTGATATGTTCTGTAAGATGGGGGATATGGAGATGGGTAACAGAATGTTTTTAGATATGATGAAGGTTGGGGTTGTGCCTAATTTGGTTGTTTTTAGCTCCTTGATTGATGGCTATTGCAAGGCTGGGAATTTGGATGTTGCGTTTGGATACTTTGAGAGAATGGAGGAATGTTCCGTTCAGCCAAATGAGTTCACATACTCGACATTGATTGATGGTTGCTGCAAGCATGGGATGTTGGGAAGAGCTGACTCTTTGTTTGAAAAGATGTTGAGTGCCGGTATTCTGCCTAACTGTACTGTTTACACTTCAATAATAGATGGTCATTTTAAGAAGGGAAATGTAGACAATGCGATAAAGTATATAAATGGGATGTTTTATCGAGAGATAAAACTAGATTTAGCAGCATATACGGTAGTTATCTCAGGCTTTCATAAAGTTGGTAGGTTGGATAAATCAATGGAAGCTGCAGCATATGTGGTGAAGAATGGATTACTTCCTGATAGGATTATATTGACAGCTATTATGGATGTGCATTTCAAAGCCGGAAACGTAAAAGAAGCTTTGAATGCATACAAAATATTACTTGCAAGGGGTTTCGAGCCTGATGTCGTGACTCTTTCCGCCCTAATGGATGGCCTATGCAAGCATGGGTATTTGCAGGAGGCTAGACAGTATTTGGATAAAGAAAAGGCCAATGAAATTCTATATACAGTGTTTATAGATGCACTCTGCAAGGAGGGTAATTTAGATGAAGCTGAGAGAACGATTAAGGAGATGTCTGAAGCAGGGTTTGTTCCAGATAAATATGTGTACACTTCTTGGATTGCTGAACTTTGCAAGCAAGGAAATTTGCTGAAGGCTTTCGTGGTCAAGAAAAGGATGGTTCAAGAGCACATTGAACCCGATTTATTAACTTATAGCTCCCTCATTGGTGGTTTGGCTGAGAAGGGACTAATGATAGAAGCCAAACAGGTTTTTGATGACATGTTAAATAAAGGAATCACTCCAGATTCTGTTTCTTATGACATCCTTATAAGAGGGTATCATAATCAGGGTAACGGAGCTGCGATTTCAGGTCTACATGACGAAATGAGAAAGAGAGGAATCACTATTGAAGATTAG

Coding sequence (CDS)

ATGGTCAAAGAAGCCCTCCAGTTCTTGGCTCACCTTCGACGAATCTCCCGCCTTCCCACCCCTTTCACCTGCAACAGGATTCTGCACTCTCTCATCAACTCCGGCTGTGGCGACCTCTCCGCCAAATTGTTTTTCCACTTGCTCTCCAAAGGGTATACTCCTCATTCATCTTCTTTCAATTCGATCATCTCCTTTTTCTGTAGATTAGGGAATGTGAAATTTGCCGAACGGATTTTGAATTCTATGCCCAGGTTTGGGTGCTCCCCTGATATTGTATCCTACAATTCTTTGTTAGATGGTTATTGTGGGAGTTATCAAATTCAGAAAGCTTGTTTTCTTGTGAATAGAGTTCGTGGGTGTGAGTTGATTAGGCCTGATTTGGTTATGTTTAATATACTGTTTAATGGGTTTGCTAAAGTTTATATGAAGAATGAGGCATTTGTATATCTGGGTTTGATGTGGAAATGCTGTTTGCCTAGTGTTGTTACTTATGGTACGTTTGTTGATATGTTCTGTAAGATGGGGGATATGGAGATGGGTAACAGAATGTTTTTAGATATGATGAAGGTTGGGGTTGTGCCTAATTTGGTTGTTTTTAGCTCCTTGATTGATGGCTATTGCAAGGCTGGGAATTTGGATGTTGCGTTTGGATACTTTGAGAGAATGGAGGAATGTTCCGTTCAGCCAAATGAGTTCACATACTCGACATTGATTGATGGTTGCTGCAAGCATGGGATGTTGGGAAGAGCTGACTCTTTGTTTGAAAAGATGTTGAGTGCCGGTATTCTGCCTAACTGTACTGTTTACACTTCAATAATAGATGGTCATTTTAAGAAGGGAAATGTAGACAATGCGATAAAGTATATAAATGGGATGTTTTATCGAGAGATAAAACTAGATTTAGCAGCATATACGGTAGTTATCTCAGGCTTTCATAAAGTTGGTAGGTTGGATAAATCAATGGAAGCTGCAGCATATGTGGTGAAGAATGGATTACTTCCTGATAGGATTATATTGACAGCTATTATGGATGTGCATTTCAAAGCCGGAAACGTAAAAGAAGCTTTGAATGCATACAAAATATTACTTGCAAGGGGTTTCGAGCCTGATGTCGTGACTCTTTCCGCCCTAATGGATGGCCTATGCAAGCATGGGTATTTGCAGGAGGCTAGACAGTATTTGGATAAAGAAAAGGCCAATGAAATTCTATATACAGTGTTTATAGATGCACTCTGCAAGGAGGGTAATTTAGATGAAGCTGAGAGAACGATTAAGGAGATGTCTGAAGCAGGGTTTGTTCCAGATAAATATGTGTACACTTCTTGGATTGCTGAACTTTGCAAGCAAGGAAATTTGCTGAAGGCTTTCGTGGTCAAGAAAAGGATGGTTCAAGAGCACATTGAACCCGATTTATTAACTTATAGCTCCCTCATTGGTGGTTTGGCTGAGAAGGGACTAATGATAGAAGCCAAACAGGTTTTTGATGACATGTTAAATAAAGGAATCACTCCAGATTCTGTTTCTTATGACATCCTTATAAGAGGGTATCATAATCAGGGTAACGGAGCTGCGATTTCAGGTCTACATGACGAAATGAGAAAGAGAGGAATCACTATTGAAGATTAG

Protein sequence

MVKEALQFLAHLRRISRLPTPFTCNRILHSLINSGCGDLSAKLFFHLLSKGYTPHSSSFNSIISFFCRLGNVKFAERILNSMPRFGCSPDIVSYNSLLDGYCGSYQIQKACFLVNRVRGCELIRPDLVMFNILFNGFAKVYMKNEAFVYLGLMWKCCLPSVVTYGTFVDMFCKMGDMEMGNRMFLDMMKVGVVPNLVVFSSLIDGYCKAGNLDVAFGYFERMEECSVQPNEFTYSTLIDGCCKHGMLGRADSLFEKMLSAGILPNCTVYTSIIDGHFKKGNVDNAIKYINGMFYREIKLDLAAYTVVISGFHKVGRLDKSMEAAAYVVKNGLLPDRIILTAIMDVHFKAGNVKEALNAYKILLARGFEPDVVTLSALMDGLCKHGYLQEARQYLDKEKANEILYTVFIDALCKEGNLDEAERTIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFVVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFDDMLNKGITPDSVSYDILIRGYHNQGNGAAISGLHDEMRKRGITIED
BLAST of Cla002193 vs. Swiss-Prot
Match: PP141_ARATH (Pentatricopeptide repeat-containing protein At2g01740 OS=Arabidopsis thaliana GN=At2g01740 PE=3 SV=1)

HSP 1 Score: 589.0 bits (1517), Expect = 5.4e-167
Identity = 283/539 (52.50%), Postives = 390/539 (72.36%), Query Frame = 1

Query: 1   MVKEALQFLAHLRRISRLPTPFTCNRILHSLINSGCGDLSAKLFFHLLSKGYTPHSSSFN 60
           MV+EALQFL+ LR+ S LP PFTCN+ +H LINS CG LS K   +L+S+GYTPH SSFN
Sbjct: 1   MVREALQFLSRLRKSSNLPDPFTCNKHIHQLINSNCGILSLKFLAYLVSRGYTPHRSSFN 60

Query: 61  SIISFFCRLGNVKFAERILNSMPRFGCSPDIVSYNSLLDGYCGSYQIQKACFLVNRVRGC 120
           S++SF C+LG VKFAE I++SMPRFGC PD++SYNSL+DG+C +  I+ A  ++  +R  
Sbjct: 61  SVVSFVCKLGQVKFAEDIVHSMPRFGCEPDVISYNSLIDGHCRNGDIRSASLVLESLRAS 120

Query: 121 E--LIRPDLVMFNILFNGFAKVYMKNEAFVYLGLMWKCCLPSVVTYGTFVDMFCKMGDME 180
              + +PD+V FN LFNGF+K+ M +E FVY+G+M KCC P+VVTY T++D FCK G+++
Sbjct: 121 HGFICKPDIVSFNSLFNGFSKMKMLDEVFVYMGVMLKCCSPNVVTYSTWIDTFCKSGELQ 180

Query: 181 MGNRMFLDMMKVGVVPNLVVFSSLIDGYCKAGNLDVAFGYFERMEECSVQPNEFTYSTLI 240
           +  + F  M +  + PN+V F+ LIDGYCKAG+L+VA   ++ M    +  N  TY+ LI
Sbjct: 181 LALKSFHSMKRDALSPNVVTFTCLIDGYCKAGDLEVAVSLYKEMRRVRMSLNVVTYTALI 240

Query: 241 DGCCKHGMLGRADSLFEKMLSAGILPNCTVYTSIIDGHFKKGNVDNAIKYINGMFYREIK 300
           DG CK G + RA+ ++ +M+   + PN  VYT+IIDG F++G+ DNA+K++  M  + ++
Sbjct: 241 DGFCKKGEMQRAEEMYSRMVEDRVEPNSLVYTTIIDGFFQRGDSDNAMKFLAKMLNQGMR 300

Query: 301 LDLAAYTVVISGFHKVGRLDKSMEAAAYVVKNGLLPDRIILTAIMDVHFKAGNVKEALNA 360
           LD+ AY V+ISG    G+L ++ E    + K+ L+PD +I T +M+ +FK+G +K A+N 
Sbjct: 301 LDITAYGVIISGLCGNGKLKEATEIVEDMEKSDLVPDMVIFTTMMNAYFKSGRMKAAVNM 360

Query: 361 YKILLARGFEPDVVTLSALMDGLCKHGYLQEARQYLDKEKANEILYTVFIDALCKEGNLD 420
           Y  L+ RGFEPDVV LS ++DG+ K+G L EA  Y   EKAN+++YTV IDALCKEG+  
Sbjct: 361 YHKLIERGFEPDVVALSTMIDGIAKNGQLHEAIVYFCIEKANDVMYTVLIDALCKEGDFI 420

Query: 421 EAERTIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFVVKKRMVQEHIEPDLLTYSSLI 480
           E ER   ++SEAG VPDK++YTSWIA LCKQGNL+ AF +K RMVQE +  DLL Y++LI
Sbjct: 421 EVERLFSKISEAGLVPDKFMYTSWIAGLCKQGNLVDAFKLKTRMVQEGLLLDLLAYTTLI 480

Query: 481 GGLAEKGLMIEAKQVFDDMLNKGITPDSVSYDILIRGYHNQGNGAAISGLHDEMRKRGI 538
            GLA KGLM+EA+QVFD+MLN GI+PDS  +D+LIR Y  +GN AA S L  +M++RG+
Sbjct: 481 YGLASKGLMVEARQVFDEMLNSGISPDSAVFDLLIRAYEKEGNMAAASDLLLDMQRRGL 539

BLAST of Cla002193 vs. Swiss-Prot
Match: PP143_ARATH (Putative pentatricopeptide repeat-containing protein At2g02150 OS=Arabidopsis thaliana GN=At2g02150 PE=3 SV=1)

HSP 1 Score: 291.2 bits (744), Expect = 2.3e-77
Identity = 170/528 (32.20%), Postives = 278/528 (52.65%), Query Frame = 1

Query: 1   MVKEALQFLAHLRRISRLPTPFTCNRILHSLINSGCGDLSAKLFFHLLSKGYTPHSSSFN 60
           M++EA+Q  + ++R    P   +CN +LH     G  D   + F  ++  G  P   ++N
Sbjct: 207 MLEEAIQCFSKMKRFRVFPKTRSCNGLLHRFAKLGKTDDVKRFFKDMIGAGARPTVFTYN 266

Query: 61  SIISFFCRLGNVKFAERILNSMPRFGCSPDIVSYNSLLDGYCGSYQIQKA-CFLVNRVRG 120
            +I   C+ G+V+ A  +   M   G  PD V+YNS++DG+    ++    CF       
Sbjct: 267 IMIDCMCKEGDVEAARGLFEEMKFRGLVPDTVTYNSMIDGFGKVGRLDDTVCFFEEMKDM 326

Query: 121 CELIRPDLVMFNILFNGFAKV-YMKNEAFVYLGLMWKCCLPSVVTYGTFVDMFCKMGDME 180
           C    PD++ +N L N F K   +      Y  +      P+VV+Y T VD FCK G M+
Sbjct: 327 C--CEPDVITYNALINCFCKFGKLPIGLEFYREMKGNGLKPNVVSYSTLVDAFCKEGMMQ 386

Query: 181 MGNRMFLDMMKVGVVPNLVVFSSLIDGYCKAGNLDVAFGYFERMEECSVQPNEFTYSTLI 240
              + ++DM +VG+VPN   ++SLID  CK GNL  AF     M +  V+ N  TY+ LI
Sbjct: 387 QAIKFYVDMRRVGLVPNEYTYTSLIDANCKIGNLSDAFRLGNEMLQVGVEWNVVTYTALI 446

Query: 241 DGCCKHGMLGRADSLFEKMLSAGILPNCTVYTSIIDGHFKKGNVDNAIKYINGMFYREIK 300
           DG C    +  A+ LF KM +AG++PN   Y ++I G  K  N+D A++ +N +  R IK
Sbjct: 447 DGLCDAERMKEAEELFGKMDTAGVIPNLASYNALIHGFVKAKNMDRALELLNELKGRGIK 506

Query: 301 LDLAAYTVVISGFHKVGRLDKSMEAAAYVVKNGLLPDRIILTAIMDVHFKAGNVKEALNA 360
            DL  Y   I G   + +++ +      + + G+  + +I T +MD +FK+GN  E L+ 
Sbjct: 507 PDLLLYGTFIWGLCSLEKIEAAKVVMNEMKECGIKANSLIYTTLMDAYFKSGNPTEGLHL 566

Query: 361 YKILLARGFEPDVVTLSALMDGLCKHGYLQEARQYLDK------EKANEILYTVFIDALC 420
              +     E  VVT   L+DGLCK+  + +A  Y ++       +AN  ++T  ID LC
Sbjct: 567 LDEMKELDIEVTVVTFCVLIDGLCKNKLVSKAVDYFNRISNDFGLQANAAIFTAMIDGLC 626

Query: 421 KEGNLDEAERTIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFVVKKRMVQEHIEPDLL 480
           K+  ++ A    ++M + G VPD+  YTS +    KQGN+L+A  ++ +M +  ++ DLL
Sbjct: 627 KDNQVEAATTLFEQMVQKGLVPDRTAYTSLMDGNFKQGNVLEALALRDKMAEIGMKLDLL 686

Query: 481 TYSSLIGGLAEKGLMIEAKQVFDDMLNKGITPDSVSYDILIRGYHNQG 521
            Y+SL+ GL+    + +A+   ++M+ +GI PD V    +++ ++  G
Sbjct: 687 AYTSLVWGLSHCNQLQKARSFLEEMIGEGIHPDEVLCISVLKKHYELG 732


HSP 2 Score: 78.6 bits (192), Expect = 2.4e-13
Identity = 51/214 (23.83%), Postives = 92/214 (42.99%), Query Frame = 1

Query: 329 KNGLLPDRIILTAIMDVHFKAGNVKEALNAYKILLARGFEPDVVTLSALMDGLCKHGYLQ 388
           +N  +P   +  A+  V    G ++EA+  +  +      P   + + L+    K G   
Sbjct: 185 RNVCVPGFGVFDALFSVLIDLGMLEEAIQCFSKMKRFRVFPKTRSCNGLLHRFAKLGKTD 244

Query: 389 EARQYLDK-----EKANEILYTVFIDALCKEGNLDEAERTIKEMSEAGFVPDKYVYTSWI 448
           + +++         +     Y + ID +CKEG+++ A    +EM   G VPD   Y S I
Sbjct: 245 DVKRFFKDMIGAGARPTVFTYNIMIDCMCKEGDVEAARGLFEEMKFRGLVPDTVTYNSMI 304

Query: 449 AELCKQGNLLKAFVVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFDDMLNKGIT 508
               K G L       + M     EPD++TY++LI    + G +    + + +M   G+ 
Sbjct: 305 DGFGKVGRLDDTVCFFEEMKDMCCEPDVITYNALINCFCKFGKLPIGLEFYREMKGNGLK 364

Query: 509 PDSVSYDILIRGYHNQGNGAAISGLHDEMRKRGI 538
           P+ VSY  L+  +  +G        + +MR+ G+
Sbjct: 365 PNVVSYSTLVDAFCKEGMMQQAIKFYVDMRRVGL 398

BLAST of Cla002193 vs. Swiss-Prot
Match: PP407_ARATH (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 246.9 bits (629), Expect = 5.1e-64
Identity = 157/556 (28.24%), Postives = 268/556 (48.20%), Query Frame = 1

Query: 1   MVKEALQFLAHLRRISRLPTPFTCNRILHSLINSGCG-DLSAKLFFHLLSKGYTPHSSSF 60
           ++ +AL  +   +    +P   + N +L + I S      +  +F  +L    +P+  ++
Sbjct: 149 LIDKALSIVHLAQAHGFMPGVLSYNAVLDATIRSKRNISFAENVFKEMLESQVSPNVFTY 208

Query: 61  NSIISFFCRLGNVKFAERILNSMPRFGCSPDIVSYNSLLDGYCGSYQIQKACFLVNRVRG 120
           N +I  FC  GN+  A  + + M   GC P++V+YN+L+DGYC   +I    F + R   
Sbjct: 209 NILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYCKLRKIDDG-FKLLRSMA 268

Query: 121 CELIRPDLVMFNILFNGFAKV-YMKNEAFVYLGLMWKCCLPSVVTYGTFVDMFCKMGDME 180
            + + P+L+ +N++ NG  +   MK  +FV   +  +      VTY T +  +CK G+  
Sbjct: 269 LKGLEPNLISYNVVINGLCREGRMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFH 328

Query: 181 MGNRMFLDMMKVGVVPNLVVFSSLIDGYCKAGNLDVAFGYFERMEECSVQPNEFTYSTLI 240
               M  +M++ G+ P+++ ++SLI   CKAGN++ A  + ++M    + PNE TY+TL+
Sbjct: 329 QALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLV 388

Query: 241 DGCCKHGMLGRADSLFEKMLSAGILPNCTVYTSIIDGHFKKGNVDNAIKYINGMFYREIK 300
           DG  + G +  A  +  +M   G  P+   Y ++I+GH   G +++AI  +  M  + + 
Sbjct: 389 DGFSQKGYMNEAYRVLREMNDNGFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLS 448

Query: 301 LDLAAYTVVISGFHKVGRLDKSMEAAAYVVKNGLLPDRIILTAIMDVHFKAGNVKEALNA 360
            D+ +Y+ V+SGF +   +D+++     +V+ G+ PD I  ++++    +    KEA + 
Sbjct: 449 PDVVSYSTVLSGFCRSYDVDEALRVKREMVEKGIKPDTITYSSLIQGFCEQRRTKEACDL 508

Query: 361 YKILLARGFEPDVVTLSALMDGLCKHGYLQEARQYLDKEKANEILYTVFIDALCKEGNLD 420
           Y+ +L  G  PD                              E  YT  I+A C EG+L+
Sbjct: 509 YEEMLRVGLPPD------------------------------EFTYTALINAYCMEGDLE 568

Query: 421 EAERTIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFVVKKRMVQEHIEPDLLTYS--- 480
           +A +   EM E G +PD   Y+  I  L KQ    +A  +  ++  E   P  +TY    
Sbjct: 569 KALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTREAKRLLLKLFYEESVPSDVTYHTLI 628

Query: 481 ------------SLIGGLAEKGLMIEAKQVFDDMLNKGITPDSVSYDILIRGYHNQGNGA 540
                       SLI G   KG+M EA QVF+ ML K   PD  +Y+I+I G+   G+  
Sbjct: 629 ENCSNIEFKSVVSLIKGFCMKGMMTEADQVFESMLGKNHKPDGTAYNIMIHGHCRAGDIR 673


HSP 2 Score: 221.5 bits (563), Expect = 2.3e-56
Identity = 145/542 (26.75%), Postives = 268/542 (49.45%), Query Frame = 1

Query: 19  PTPFTCNRILHSLINSGCGDLSAKLFFHLLSKGYTPHSSSFNSIISFFCRLGNVKFAERI 78
           P  FT N ++     +G  D++  LF  + +KG  P+  ++N++I  +C+L  +    ++
Sbjct: 203 PNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKL 262

Query: 79  LNSMPRFGCSPDIVSYNSLLDGYCGSYQIQKACFLVNRV--RGCELIRPDLVMFNILFNG 138
           L SM   G  P+++SYN +++G C   ++++  F++  +  RG  L   D V +N L  G
Sbjct: 263 LRSMALKGLEPNLISYNVVINGLCREGRMKEVSFVLTEMNRRGYSL---DEVTYNTLIKG 322

Query: 139 FAKVYMKNEAFVYLGLMWKCCL-PSVVTYGTFVDMFCKMGDMEMGNRMFLDMMKV-GVVP 198
           + K    ++A V    M +  L PSV+TY + +   CK G+M      FLD M+V G+ P
Sbjct: 323 YCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMNRAME-FLDQMRVRGLCP 382

Query: 199 NLVVFSSLIDGYCKAGNLDVAFGYFERMEECSVQPNEFTYSTLIDGCCKHGMLGRADSLF 258
           N   +++L+DG+ + G ++ A+     M +    P+  TY+ LI+G C  G +  A ++ 
Sbjct: 383 NERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVTYNALINGHCVTGKMEDAIAVL 442

Query: 259 EKMLSAGILPNCTVYTSIIDGHFKKGNVDNAIKYINGMFYREIKLDLAAYTVVISGFHKV 318
           E M   G+ P+   Y++++ G  +  +VD A++    M  + IK D   Y+ +I GF + 
Sbjct: 443 EDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREMVEKGIKPDTITYSSLIQGFCEQ 502

Query: 319 GRLDKSMEAAAYVVKNGLLPDRIILTAIMDVHFKAGNVKEALNAYKILLARGFEPDVVTL 378
            R  ++ +    +++ GL PD    TA+++ +   G++++AL  +  ++ +G  PDVVT 
Sbjct: 503 RRTKEACDLYEEMLRVGLPPDEFTYTALINAYCMEGDLEKALQLHNEMVEKGVLPDVVTY 562

Query: 379 SALMDGLCKHGYLQEARQYL-----DKEKANEILYTVFID---------------ALCKE 438
           S L++GL K    +EA++ L     ++   +++ Y   I+                 C +
Sbjct: 563 SVLINGLNKQSRTREAKRLLLKLFYEESVPSDVTYHTLIENCSNIEFKSVVSLIKGFCMK 622

Query: 439 GNLDEAERTIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFVVKKRMVQEHIEPDLLTY 498
           G + EA++  + M      PD   Y   I   C+ G++ KA+ + K MV+       +T 
Sbjct: 623 GMMTEADQVFESMLGKNHKPDGTAYNIMIHGHCRAGDIRKAYTLYKEMVKSGFLLHTVTV 682

Query: 499 SSLIGGLAEKGLMIEAKQVFDDMLNKGITPDSVSYDILIRGYHNQGNGAAISGLHDEMRK 537
            +L+  L ++G + E   V   +L      ++    +L+   H +GN   +  +  EM K
Sbjct: 683 IALVKALHKEGKVNELNSVIVHVLRSCELSEAEQAKVLVEINHREGNMDVVLDVLAEMAK 740

BLAST of Cla002193 vs. Swiss-Prot
Match: PP360_ARATH (Pentatricopeptide repeat-containing protein At5g01110 OS=Arabidopsis thaliana GN=At5g01110 PE=2 SV=1)

HSP 1 Score: 244.6 bits (623), Expect = 2.5e-63
Identity = 153/526 (29.09%), Postives = 263/526 (50.00%), Query Frame = 1

Query: 24  CNRILHSLINSGCGDLSAKLFFHLLSKGYTPHSSSFNSIISFFCRLGNVKFAERILNSMP 83
           CN ++ SL+  G  +L+  ++  +   G   +  + N +++  C+ G ++     L+ + 
Sbjct: 203 CNALIGSLVRIGWVELAWGVYQEISRSGVGINVYTLNIMVNALCKDGKMEKVGTFLSQVQ 262

Query: 84  RFGCSPDIVSYNSLLDGYCGSYQIQKACFLVNRVRGCELIRPDLVMFNILFNGFAKVYMK 143
             G  PDIV+YN+L+  Y     +++A  L+N + G +   P +  +N + NG  K    
Sbjct: 263 EKGVYPDIVTYNTLISAYSSKGLMEEAFELMNAMPG-KGFSPGVYTYNTVINGLCKHGKY 322

Query: 144 NEAFVYLGLMWKCCL-PSVVTYGTFVDMFCKMGDMEMGNRMFLDMMKVGVVPNLVVFSSL 203
             A      M +  L P   TY + +   CK GD+    ++F DM    VVP+LV FSS+
Sbjct: 323 ERAKEVFAEMLRSGLSPDSTTYRSLLMEACKKGDVVETEKVFSDMRSRDVVPDLVCFSSM 382

Query: 204 IDGYCKAGNLDVAFGYFERMEECSVQPNEFTYSTLIDGCCKHGMLGRADSLFEKMLSAGI 263
           +  + ++GNLD A  YF  ++E  + P+   Y+ LI G C+ GM+  A +L  +ML  G 
Sbjct: 383 MSLFTRSGNLDKALMYFNSVKEAGLIPDNVIYTILIQGYCRKGMISVAMNLRNEMLQQGC 442

Query: 264 LPNCTVYTSIIDGHFKKGNVDNAIKYINGMFYREIKLDLAAYTVVISGFHKVGRLDKSME 323
             +   Y +I+ G  K+  +  A K  N M  R +  D    T++I G  K+G L  +ME
Sbjct: 443 AMDVVTYNTILHGLCKRKMLGEADKLFNEMTERALFPDSYTLTILIDGHCKLGNLQNAME 502

Query: 324 AAAYVVKNGLLPDRIILTAIMDVHFKAGNVKEALNAYKILLARGFEPDVVTLSALMDGLC 383
               + +  +  D +    ++D   K G++  A   +  ++++   P  ++ S L++ LC
Sbjct: 503 LFQKMKEKRIRLDVVTYNTLLDGFGKVGDIDTAKEIWADMVSKEILPTPISYSILVNALC 562

Query: 384 KHGYLQEARQYLDKEKANEILYTV-----FIDALCKEGNLDEAERTIKEMSEAGFVPDKY 443
             G+L EA +  D+  +  I  TV      I   C+ GN  + E  +++M   GFVPD  
Sbjct: 563 SKGHLAEAFRVWDEMISKNIKPTVMICNSMIKGYCRSGNASDGESFLEKMISEGFVPDCI 622

Query: 444 VYTSWIAELCKQGNLLKAFVVKKRMVQEH--IEPDLLTYSSLIGGLAEKGLMIEAKQVFD 503
            Y + I    ++ N+ KAF + K+M +E   + PD+ TY+S++ G   +  M EA+ V  
Sbjct: 623 SYNTLIYGFVREENMSKAFGLVKKMEEEQGGLVPDVFTYNSILHGFCRQNQMKEAEVVLR 682

Query: 504 DMLNKGITPDSVSYDILIRGYHNQGNGAAISGLHDEMRKRGITIED 542
            M+ +G+ PD  +Y  +I G+ +Q N      +HDEM +RG + +D
Sbjct: 683 KMIERGVNPDRSTYTCMINGFVSQDNLTEAFRIHDEMLQRGFSPDD 727


HSP 2 Score: 124.4 bits (311), Expect = 3.8e-27
Identity = 94/463 (20.30%), Postives = 202/463 (43.63%), Query Frame = 1

Query: 104 SYQIQKACFLV----NRVRGCELIRPDLVMFNILFNGFAKVYMKNEAFVYLGLMWKCCLP 163
           S+ ++K CF +    N VR   +    L +  +L+     + +       LG  +     
Sbjct: 52  SFLVEKICFSLKQGNNNVRNHLIRLNPLAVVEVLYRCRNDLTLGQRFVDQLGFHFPNFKH 111

Query: 164 SVVTYGTFVDMFCKMGDMEMGNRMFLDMMKVGVVPNLVVFSSLIDGYCKAGNLDVAFGYF 223
           + ++    + +  + G +       L M++   V  L + +SL   +   G+ D  F   
Sbjct: 112 TSLSLSAMIHILVRSGRLSDAQSCLLRMIRRSGVSRLEIVNSLDSTFSNCGSNDSVFDLL 171

Query: 224 ERMEECSVQPNE----FT------YSTLIDGC-------CKHGMLGRADSLFEKMLSAGI 283
            R    + +  E    FT      ++  ID C        + G +  A  +++++  +G+
Sbjct: 172 IRTYVQARKLREAHEAFTLLRSKGFTVSIDACNALIGSLVRIGWVELAWGVYQEISRSGV 231

Query: 284 LPNCTVYTSIIDGHFKKGNVDNAIKYINGMFYREIKLDLAAYTVVISGFHKVGRLDKSME 343
             N      +++   K G ++    +++ +  + +  D+  Y  +IS +   G ++++ E
Sbjct: 232 GINVYTLNIMVNALCKDGKMEKVGTFLSQVQEKGVYPDIVTYNTLISAYSSKGLMEEAFE 291

Query: 344 AAAYVVKNGLLPDRIILTAIMDVHFKAGNVKEALNAYKILLARGFEPDVVTLSALMDGLC 403
               +   G  P       +++   K G  + A   +  +L  G  PD  T  +L+   C
Sbjct: 292 LMNAMPGKGFSPGVYTYNTVINGLCKHGKYERAKEVFAEMLRSGLSPDSTTYRSLLMEAC 351

Query: 404 KHGYLQEARQYLDKEKANEIL-----YTVFIDALCKEGNLDEAERTIKEMSEAGFVPDKY 463
           K G + E  +     ++ +++     ++  +    + GNLD+A      + EAG +PD  
Sbjct: 352 KKGDVVETEKVFSDMRSRDVVPDLVCFSSMMSLFTRSGNLDKALMYFNSVKEAGLIPDNV 411

Query: 464 VYTSWIAELCKQGNLLKAFVVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFDDM 523
           +YT  I   C++G +  A  ++  M+Q+    D++TY++++ GL ++ ++ EA ++F++M
Sbjct: 412 IYTILIQGYCRKGMISVAMNLRNEMLQQGCAMDVVTYNTILHGLCKRKMLGEADKLFNEM 471

Query: 524 LNKGITPDSVSYDILIRGYHNQGNGAAISGLHDEMRKRGITIE 541
             + + PDS +  ILI G+   GN      L  +M+++ I ++
Sbjct: 472 TERALFPDSYTLTILIDGHCKLGNLQNAMELFQKMKEKRIRLD 514

BLAST of Cla002193 vs. Swiss-Prot
Match: PP432_ARATH (Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana GN=At5g55840 PE=3 SV=2)

HSP 1 Score: 237.3 bits (604), Expect = 4.0e-61
Identity = 152/543 (27.99%), Postives = 269/543 (49.54%), Query Frame = 1

Query: 1   MVKEALQFLAHLRRISRLPTPFTCNRILHSLINSGCGDLSAKLFF-HLLSKGYTPHSSSF 60
           M++++L+    +      P+ +TCN IL S++ SG  D+S   F   +L +   P  ++F
Sbjct: 138 MIQDSLEIFRLMGLYGFNPSVYTCNAILGSVVKSG-EDVSVWSFLKEMLKRKICPDVATF 197

Query: 61  NSIISFFCRLGNVKFAERILNSMPRFGCSPDIVSYNSLLDGYCGSYQIQKACFLVNRVRG 120
           N +I+  C  G+ + +  ++  M + G +P IV+YN++L  YC   + + A  L++ ++ 
Sbjct: 198 NILINVLCAEGSFEKSSYLMQKMEKSGYAPTIVTYNTVLHWYCKKGRFKAAIELLDHMKS 257

Query: 121 CELIRPDLVMFNILFNGFAKVYMKNEAFVYLGLMWKCCL-PSVVTYGTFVDMFCKMGDME 180
            + +  D+  +N+L +   +     + ++ L  M K  + P+ VTY T ++ F   G + 
Sbjct: 258 -KGVDADVCTYNMLIHDLCRSNRIAKGYLLLRDMRKRMIHPNEVTYNTLINGFSNEGKVL 317

Query: 181 MGNRMFLDMMKVGVVPNLVVFSSLIDGYCKAGNLDVAFGYFERMEECSVQPNEFTYSTLI 240
           + +++  +M+  G+ PN V F++LIDG+   GN   A   F  ME   + P+E +Y  L+
Sbjct: 318 IASQLLNEMLSFGLSPNHVTFNALIDGHISEGNFKEALKMFYMMEAKGLTPSEVSYGVLL 377

Query: 241 DGCCKHGMLGRADSLFEKMLSAGILPNCTVYTSIIDGHFKKGNVDNAIKYINGMFYREIK 300
           DG CK+     A   + +M   G+      YT +IDG  K G +D A+  +N M    I 
Sbjct: 378 DGLCKNAEFDLARGFYMRMKRNGVCVGRITYTGMIDGLCKNGFLDEAVVLLNEMSKDGID 437

Query: 301 LDLAAYTVVISGFHKVGRLDKSMEAAAYVVKNGLLPDRIILTAIMDVHFKAGNVKEALNA 360
            D+  Y+ +I+GF KVGR   + E    + + GL P+ II + ++    + G +KEA+  
Sbjct: 438 PDIVTYSALINGFCKVGRFKTAKEIVCRIYRVGLSPNGIIYSTLIYNCCRMGCLKEAIRI 497

Query: 361 YKILLARGFEPDVVTLSALMDGLCKHGYLQEARQYL-----DKEKANEILYTVFIDALCK 420
           Y+ ++  G   D  T + L+  LCK G + EA +++     D    N + +   I+    
Sbjct: 498 YEAMILEGHTRDHFTFNVLVTSLCKAGKVAEAEEFMRCMTSDGILPNTVSFDCLINGYGN 557

Query: 421 EGNLDEAERTIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFVVKKRMVQEHIEPDLLT 480
            G   +A     EM++ G  P  + Y S +  LCK G+L +A    K +       D + 
Sbjct: 558 SGEGLKAFSVFDEMTKVGHHPTFFTYGSLLKGLCKGGHLREAEKFLKSLHAVPAAVDTVM 617

Query: 481 YSSLIGGLAEKGLMIEAKQVFDDMLNKGITPDSVSYDILIRGYHNQGNGAAISGLHDEMR 537
           Y++L+  + + G + +A  +F +M+ + I PDS +Y  LI G   +G          E  
Sbjct: 618 YNTLLTAMCKSGNLAKAVSLFGEMVQRSILPDSYTYTSLISGLCRKGKTVIAILFAKEAE 677


HSP 2 Score: 224.9 bits (572), Expect = 2.1e-57
Identity = 143/521 (27.45%), Postives = 248/521 (47.60%), Query Frame = 1

Query: 19  PTPFTCNRILHSLINSGCGDLSAKLFFHLLSKGYTPHSSSFNSIISFFCRLGNVKFAERI 78
           PT  T N +LH     G    + +L  H+ SKG      ++N +I   CR   +     +
Sbjct: 226 PTIVTYNTVLHWYCKKGRFKAAIELLDHMKSKGVDADVCTYNMLIHDLCRSNRIAKGYLL 285

Query: 79  LNSMPRFGCSPDIVSYNSLLDGYCGSYQIQKACFLVNRVRGCELIRPDLVMFNILFNGFA 138
           L  M +    P+ V+YN+L++G+    ++  A  L+N +    L  P+ V FN L +G  
Sbjct: 286 LRDMRKRMIHPNEVTYNTLINGFSNEGKVLIASQLLNEMLSFGL-SPNHVTFNALIDGHI 345

Query: 139 KVYMKNEAFVYLGLMW-KCCLPSVVTYGTFVDMFCKMGDMEMGNRMFLDMMKVGVVPNLV 198
                 EA     +M  K   PS V+YG  +D  CK  + ++    ++ M + GV    +
Sbjct: 346 SEGNFKEALKMFYMMEAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVGRI 405

Query: 199 VFSSLIDGYCKAGNLDVAFGYFERMEECSVQPNEFTYSTLIDGCCKHGMLGRADSLFEKM 258
            ++ +IDG CK G LD A      M +  + P+  TYS LI+G CK G    A  +  ++
Sbjct: 406 TYTGMIDGLCKNGFLDEAVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVCRI 465

Query: 259 LSAGILPNCTVYTSIIDGHFKKGNVDNAIKYINGMFYREIKLDLAAYTVVISGFHKVGRL 318
              G+ PN  +Y+++I    + G +  AI+    M       D   + V+++   K G++
Sbjct: 466 YRVGLSPNGIIYSTLIYNCCRMGCLKEAIRIYEAMILEGHTRDHFTFNVLVTSLCKAGKV 525

Query: 319 DKSMEAAAYVVKNGLLPDRIILTAIMDVHFKAGNVKEALNAYKILLARGFEPDVVTLSAL 378
            ++ E    +  +G+LP+ +    +++ +  +G   +A + +  +   G  P   T  +L
Sbjct: 526 AEAEEFMRCMTSDGILPNTVSFDCLINGYGNSGEGLKAFSVFDEMTKVGHHPTFFTYGSL 585

Query: 379 MDGLCKHGYLQEARQYLDKEKA-----NEILYTVFIDALCKEGNLDEAERTIKEMSEAGF 438
           + GLCK G+L+EA ++L    A     + ++Y   + A+CK GNL +A     EM +   
Sbjct: 586 LKGLCKGGHLREAEKFLKSLHAVPAAVDTVMYNTLLTAMCKSGNLAKAVSLFGEMVQRSI 645

Query: 439 VPDKYVYTSWIAELCKQGNLLKAFV-VKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAK 498
           +PD Y YTS I+ LC++G  + A +  K+   + ++ P+ + Y+  + G+ + G      
Sbjct: 646 LPDSYTYTSLISGLCRKGKTVIAILFAKEAEARGNVLPNKVMYTCFVDGMFKAGQWKAGI 705

Query: 499 QVFDDMLNKGITPDSVSYDILIRGYHNQGNGAAISGLHDEM 533
              + M N G TPD V+ + +I GY   G     + L  EM
Sbjct: 706 YFREQMDNLGHTPDIVTTNAMIDGYSRMGKIEKTNDLLPEM 745


HSP 3 Score: 177.9 bits (450), Expect = 2.9e-43
Identity = 131/519 (25.24%), Postives = 238/519 (45.86%), Query Frame = 1

Query: 9   LAHLRRISRLPTPFTCNRILHSLINSGCGDLSAKLFFHLLSKGYTPHSSSFNSIISFFCR 68
           L  +R+    P   T N +++   N G   ++++L   +LS G +P+  +FN++I     
Sbjct: 286 LRDMRKRMIHPNEVTYNTLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDGHIS 345

Query: 69  LGNVKFAERILNSMPRFGCSPDIVSYNSLLDGYC--GSYQIQKACFLVNRVRGCELIRPD 128
            GN K A ++   M   G +P  VSY  LLDG C    + + +  ++  +  G  + R  
Sbjct: 346 EGNFKEALKMFYMMEAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVGR-- 405

Query: 129 LVMFNILFNGFAKVYMKNEAFVYLGLMWKCCL-PSVVTYGTFVDMFCKMGDMEMGNRMFL 188
            + +  + +G  K    +EA V L  M K  + P +VTY   ++ FCK+G  +    +  
Sbjct: 406 -ITYTGMIDGLCKNGFLDEAVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVC 465

Query: 189 DMMKVGVVPNLVVFSSLIDGYCKAGNLDVAFGYFERMEECSVQPNEFTYSTLIDGCCKHG 248
            + +VG+ PN +++S+LI   C+ G L  A   +E M       + FT++ L+   CK G
Sbjct: 466 RIYRVGLSPNGIIYSTLIYNCCRMGCLKEAIRIYEAMILEGHTRDHFTFNVLVTSLCKAG 525

Query: 249 MLGRADSLFEKMLSAGILPNCTVYTSIIDGHFKKGNVDNAIKYINGMFYREIKLDLAAYT 308
            +  A+     M S GILPN   +  +I+G+   G    A    + M           Y 
Sbjct: 526 KVAEAEEFMRCMTSDGILPNTVSFDCLINGYGNSGEGLKAFSVFDEMTKVGHHPTFFTYG 585

Query: 309 VVISGFHKVGRLDKSMEAAAYVVKNGLLP---DRIILTAIMDVHFKAGNVKEALNAYKIL 368
            ++ G  K G L    EA  ++     +P   D ++   ++    K+GN+ +A++ +  +
Sbjct: 586 SLLKGLCKGGHL---REAEKFLKSLHAVPAAVDTVMYNTLLTAMCKSGNLAKAVSLFGEM 645

Query: 369 LARGFEPDVVTLSALMDGLCKHGYLQEARQYLDKEKA------NEILYTVFIDALCKEGN 428
           + R   PD  T ++L+ GLC+ G    A  +  + +A      N+++YT F+D + K G 
Sbjct: 646 VQRSILPDSYTYTSLISGLCRKGKTVIAILFAKEAEARGNVLPNKVMYTCFVDGMFKAGQ 705

Query: 429 LDEAERTIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFVVKKRMVQEHIEPDLLTYSS 488
                   ++M   G  PD     + I    + G + K   +   M  ++  P+L TY+ 
Sbjct: 706 WKAGIYFREQMDNLGHTPDIVTTNAMIDGYSRMGKIEKTNDLLPEMGNQNGGPNLTTYNI 765

Query: 489 LIGGLAEKGLMIEAKQVFDDMLNKGITPDSVSYDILIRG 516
           L+ G +++  +  +  ++  ++  GI PD ++   L+ G
Sbjct: 766 LLHGYSKRKDVSTSFLLYRSIILNGILPDKLTCHSLVLG 798


HSP 4 Score: 175.6 bits (444), Expect = 1.4e-42
Identity = 128/544 (23.53%), Postives = 233/544 (42.83%), Query Frame = 1

Query: 3   KEALQFLAHLRRISRLPTPFTCNRILHSLINSGCGDLSAKLFFHLLSKGYTPHSSSFNSI 62
           KEAL+    +      P+  +   +L  L  +   DL+   +  +   G      ++  +
Sbjct: 350 KEALKMFYMMEAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVGRITYTGM 409

Query: 63  ISFFCRLGNVKFAERILNSMPRFGCSPDIVSYNSLLDGYCGSYQIQKACFLVNRVRGCEL 122
           I   C+ G +  A  +LN M + G  PDIV+Y++L++G+C   + + A  +V R+    L
Sbjct: 410 IDGLCKNGFLDEAVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVCRIYRVGL 469

Query: 123 IRPDLVMFNILFNGFAKVYMKNEAFVYLGLMWKCCLPSVVTYGTFVDMFCKMGDMEMGNR 182
               ++   +++N      +K    +Y  ++ +       T+   V   CK G +     
Sbjct: 470 SPNGIIYSTLIYNCCRMGCLKEAIRIYEAMILEGHTRDHFTFNVLVTSLCKAGKVAEAEE 529

Query: 183 MFLDMMKVGVVPNLVVFSSLIDGYCKAGNLDVAFGYFERMEECSVQPNEFTYSTLIDGCC 242
               M   G++PN V F  LI+GY  +G    AF  F+ M +    P  FTY +L+ G C
Sbjct: 530 FMRCMTSDGILPNTVSFDCLINGYGNSGEGLKAFSVFDEMTKVGHHPTFFTYGSLLKGLC 589

Query: 243 KHGMLGRADSLFEKMLSAGILPNCTVYTSIIDGHFKKGNVDNAIKYINGMFYREIKLDLA 302
           K G L  A+   + + +     +  +Y +++    K GN+  A+     M  R I  D  
Sbjct: 590 KGGHLREAEKFLKSLHAVPAAVDTVMYNTLLTAMCKSGNLAKAVSLFGEMVQRSILPDSY 649

Query: 303 AYTVVISGFHKVGR-LDKSMEAAAYVVKNGLLPDRIILTAIMDVHFKAGNVKEALNAYKI 362
            YT +ISG  + G+ +   + A     +  +LP++++ T  +D  FKAG  K  +   + 
Sbjct: 650 TYTSLISGLCRKGKTVIAILFAKEAEARGNVLPNKVMYTCFVDGMFKAGQWKAGIYFREQ 709

Query: 363 LLARGFEPDVVTLSALMDGLCKHGYLQEARQYL-----DKEKANEILYTVFIDALCKEGN 422
           +   G  PD+VT +A++DG  + G +++    L          N   Y + +    K  +
Sbjct: 710 MDNLGHTPDIVTTNAMIDGYSRMGKIEKTNDLLPEMGNQNGGPNLTTYNILLHGYSKRKD 769

Query: 423 LDEAERTIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFVVKKRMVQEHIEPDLLTYSS 482
           +  +    + +   G +PDK    S +  +C+   L     + K  +   +E D  T++ 
Sbjct: 770 VSTSFLLYRSIILNGILPDKLTCHSLVLGICESNMLEIGLKILKAFICRGVEVDRYTFNM 829

Query: 483 LIGGLAEKGLMIEAKQVFDDMLNKGITPDSVSYDILIRGYHNQGNGAAISGLHDEMRKRG 541
           LI      G +  A  +   M + GI+ D  + D ++   +          +  EM K+G
Sbjct: 830 LISKCCANGEINWAFDLVKVMTSLGISLDKDTCDAMVSVLNRNHRFQESRMVLHEMSKQG 889


HSP 5 Score: 163.7 bits (413), Expect = 5.6e-39
Identity = 126/524 (24.05%), Postives = 242/524 (46.18%), Query Frame = 1

Query: 2    VKEALQFLAHLRRISRLPTPFTCNRILHSLINSGCGDLSAKLFFHLLSKGYTPHSSSFNS 61
            V EA +F+  +     LP   + + +++   NSG G  +  +F  +   G+ P   ++ S
Sbjct: 524  VAEAEEFMRCMTSDGILPNTVSFDCLINGYGNSGEGLKAFSVFDEMTKVGHHPTFFTYGS 583

Query: 62   IISFFCRLGNVKFAERILNSMPRFGCSPDIVSYNSLLDGYCGSYQIQKACFLVNRVRGCE 121
            ++   C+ G+++ AE+ L S+     + D V YN+LL   C S  + KA  L   +    
Sbjct: 584  LLKGLCKGGHLREAEKFLKSLHAVPAAVDTVMYNTLLTAMCKSGNLAKAVSLFGEMVQRS 643

Query: 122  LIRPDLVMFNILFNGFAKVYMKNEAFVYLGLMWKC-----CLPSVVTYGTFVDMFCKMGD 181
            ++ PD   +  L +G  +   K +  + +    +       LP+ V Y  FVD   K G 
Sbjct: 644  IL-PDSYTYTSLISGLCR---KGKTVIAILFAKEAEARGNVLPNKVMYTCFVDGMFKAGQ 703

Query: 182  MEMGNRMFLDMMKVGVVPNLVVFSSLIDGYCKAGNLDVAFGYFERMEECSVQPNEFTYST 241
             + G      M  +G  P++V  +++IDGY + G ++        M   +  PN  TY+ 
Sbjct: 704  WKAGIYFREQMDNLGHTPDIVTTNAMIDGYSRMGKIEKTNDLLPEMGNQNGGPNLTTYNI 763

Query: 242  LIDGCCKHGMLGRADSLFEKMLSAGILPNCTVYTSIIDGHFKKGNVDNAIKYINGMFYRE 301
            L+ G  K   +  +  L+  ++  GILP+     S++ G  +   ++  +K +     R 
Sbjct: 764  LLHGYSKRKDVSTSFLLYRSIILNGILPDKLTCHSLVLGICESNMLEIGLKILKAFICRG 823

Query: 302  IKLDLAAYTVVISGFHKVGRLDKSMEAAAYVVKNGLLPDRIILTAIMDVHFKAGNVKEAL 361
            +++D   + ++IS     G ++ + +    +   G+  D+    A++ V  +    +E+ 
Sbjct: 824  VEVDRYTFNMLISKCCANGEINWAFDLVKVMTSLGISLDKDTCDAMVSVLNRNHRFQESR 883

Query: 362  NAYKILLARGFEPDVVTLSALMDGLCKHGYLQEARQYLDKEKANEI-----LYTVFIDAL 421
                 +  +G  P+      L++GLC+ G ++ A    ++  A++I       +  + AL
Sbjct: 884  MVLHEMSKQGISPESRKYIGLINGLCRVGDIKTAFVVKEEMIAHKICPPNVAESAMVRAL 943

Query: 422  CKEGNLDEAERTIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFVVKKRMVQEHIEPDL 481
             K G  DEA   ++ M +   VP    +T+ +   CK GN+++A  ++  M    ++ DL
Sbjct: 944  AKCGKADEATLLLRFMLKMKLVPTIASFTTLMHLCCKNGNVIEALELRVVMSNCGLKLDL 1003

Query: 482  LTYSSLIGGLAEKGLMIEAKQVFDDMLNKGITPDSVSYDILIRG 516
            ++Y+ LI GL  KG M  A +++++M   G   ++ +Y  LIRG
Sbjct: 1004 VSYNVLITGLCAKGDMALAFELYEEMKGDGFLANATTYKALIRG 1043


HSP 6 Score: 87.4 bits (215), Expect = 5.1e-16
Identity = 62/301 (20.60%), Postives = 132/301 (43.85%), Query Frame = 1

Query: 246 MLGRADSLFEKMLSAGILPNC--TVYTSIIDGHFKKGNVDNAIKYINGMFYREIKLDLAA 305
           M G++  +F  +++   L N   +VY  +I  + ++G + ++++    M        +  
Sbjct: 101 MSGKSSFVFGALMTTYRLCNSNPSVYDILIRVYLREGMIQDSLEIFRLMGLYGFNPSVYT 160

Query: 306 YTVVISGFHKVGRLDKSMEAAAYVVKNGLLPDRIILTAIMDVHFKAGNVKEALNAYKILL 365
              ++    K G           ++K  + PD      +++V    G+ +++    + + 
Sbjct: 161 CNAILGSVVKSGEDVSVWSFLKEMLKRKICPDVATFNILINVLCAEGSFEKSSYLMQKME 220

Query: 366 ARGFEPDVVTLSALMDGLCKHGYLQEARQYLDKEKANEI-----LYTVFIDALCKEGNLD 425
             G+ P +VT + ++   CK G  + A + LD  K+  +      Y + I  LC+   + 
Sbjct: 221 KSGYAPTIVTYNTVLHWYCKKGRFKAAIELLDHMKSKGVDADVCTYNMLIHDLCRSNRIA 280

Query: 426 EAERTIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFVVKKRMVQEHIEPDLLTYSSLI 485
           +    +++M +    P++  Y + I     +G +L A  +   M+   + P+ +T+++LI
Sbjct: 281 KGYLLLRDMRKRMIHPNEVTYNTLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALI 340

Query: 486 GGLAEKGLMIEAKQVFDDMLNKGITPDSVSYDILIRGYHNQGNGAAISGLHDEMRKRGIT 540
            G   +G   EA ++F  M  KG+TP  VSY +L+ G           G +  M++ G+ 
Sbjct: 341 DGHISEGNFKEALKMFYMMEAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVC 400

BLAST of Cla002193 vs. TrEMBL
Match: W9R0T6_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_005539 PE=4 SV=1)

HSP 1 Score: 711.8 bits (1836), Expect = 6.2e-202
Identity = 337/539 (62.52%), Postives = 424/539 (78.66%), Query Frame = 1

Query: 1   MVKEALQFLAHLRRISRLPTPFTCNRILHSLINSGCGDLSAKLFFHLLSKGYTPHSSSFN 60
           MV+E LQF AHLRR SR PTPFT N++LH L ++ CGDLS K+  H L+K Y PH SSFN
Sbjct: 1   MVRETLQFFAHLRRTSRFPTPFTFNKLLHHLTSANCGDLSLKILSHFLTKRYVPHPSSFN 60

Query: 61  SIISFFCRLGNVKFAERILNSMPRFGCSPDIVSYNSLLDGYCGSYQIQKACFLVNRVRGC 120
           S++SF C+ G ++FA  +++SMP+FG SPD+V+YN L+DG+C +  +++ACF+V+++R  
Sbjct: 61  SVLSFLCKSGQLRFARNVVDSMPKFGFSPDVVTYNCLVDGFCKNLDVEEACFVVSKMRMG 120

Query: 121 ELIRPDLVMFNILFNGFAKVYMKNEAFVYLGLMWKCCLPSVVTYGTFVDMFCKMGDMEMG 180
           +   PDLV FN LFNGF+K  M+ EAFVY+GLMWKCCLP+VVTY TFVDMFCK+G+ ++G
Sbjct: 121 KC-GPDLVTFNTLFNGFSKTRMEREAFVYMGLMWKCCLPNVVTYSTFVDMFCKVGNFDLG 180

Query: 181 NRMFLDMMKVGVVPNLVVFSSLIDGYCKAGNLDVAFGYFERMEECSVQPNEFTYSTLIDG 240
            ++F DM+  GV+PN VVF++L+DGYCKAGNLD+AF  F  M+  SV PN  TY+ L+DG
Sbjct: 181 YKVFRDMVNAGVLPNSVVFTALLDGYCKAGNLDIAFELFVEMKRSSVSPNVVTYAALVDG 240

Query: 241 CCKHGMLGRADSLFEKMLSAGILPNCTVYTSIIDGHFKKGNVDNAIKYINGMFYREIKLD 300
            CK G L RA+SLF KML  G+ PN  VYTSIIDGHF KGNVD+A+KY+  M  + ++LD
Sbjct: 241 FCKRGALERAESLFSKMLEDGVEPNSVVYTSIIDGHFVKGNVDDAVKYMTKMCDQGLRLD 300

Query: 301 LAAYTVVISGFHKVGRLDKSMEAAAYVVKNGLLPDRIILTAIMDVHFKAGNVKEALNAYK 360
           + AY VVI GF K GRLDK+ME    + ++GL PD+I+LT +MD HFK+G++K AL  Y+
Sbjct: 301 MTAYEVVIRGFCKNGRLDKAMEVMRSMTESGLFPDKIMLTTVMDAHFKSGDLKRALEVYR 360

Query: 361 ILLARGFEPDVVTLSALMDGLCKHGYLQEARQYLDKEKANEILYTVFIDALCKEGNLDEA 420
            +L RGFEPD+VTLS++MDGL K G+LQEAR YL +EKANEI YTV ID +CKEG+  E 
Sbjct: 361 EILFRGFEPDIVTLSSIMDGLSKKGHLQEARGYLCREKANEISYTVLIDGMCKEGHFGEV 420

Query: 421 ERTIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFVVKKRMVQEHIEPDLLTYSSLIGG 480
           E   +EMSEAGFVPDKY YTSWIA LCKQG L++AFV+K RM QE IEPDLLTYSSLI G
Sbjct: 421 EMVFREMSEAGFVPDKYAYTSWIAGLCKQGKLVEAFVLKNRMAQEGIEPDLLTYSSLIFG 480

Query: 481 LAEKGLMIEAKQVFDDMLNKGITPDSVSYDILIRGYHNQGNGAAISGLHDEMRKRGITI 540
           LA KGLM+EAKQVFDDML +GI+PDS  YDILIRGY  +GN  A+SG+HDEMRKRGI +
Sbjct: 481 LANKGLMVEAKQVFDDMLKRGISPDSAVYDILIRGYLKEGNEVAVSGMHDEMRKRGIKV 538

BLAST of Cla002193 vs. TrEMBL
Match: A0A0D2Q7W0_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_002G171900 PE=4 SV=1)

HSP 1 Score: 624.4 bits (1609), Expect = 1.3e-175
Identity = 303/533 (56.85%), Postives = 389/533 (72.98%), Query Frame = 1

Query: 6   LQFLAHLRRISRLPTPFTCNRILHSLINSGCGDLSAKLFFHLLSKGYTPHSSSFNSIISF 65
           LQF + L++ S+ P P++ N++LH L  S CG LS KL    LSKGYTPH SSFNS ISF
Sbjct: 40  LQFFSDLKKTSKYPDPYSFNKLLHKLTASDCGALSLKLLSFFLSKGYTPHPSSFNSTISF 99

Query: 66  FCRLGNVKFAERILNSMPRFGCSPDIVSYNSLLDGYCGSYQIQKACFLVNRVRGCELIRP 125
           FC+LG   +A++++N MP +GC PDI +YNSL+DGY    ++ KAC +VN +R  +  +P
Sbjct: 100 FCKLGQSSYAQKLVNLMPLYGCEPDIATYNSLIDGYFKCGEVVKACLIVNEIR-VDKCKP 159

Query: 126 DLVMFNILFNGFAKVYMKNEAFVYLGLMWKCCLPSVVTYGTFVDMFCKMGDMEMGNRMFL 185
           DLV FN+LFNGF K+  K EAFVY+GLMWKCCLP+VVTY T++DMFCK+GD+ MG ++F 
Sbjct: 160 DLVTFNVLFNGFCKMRKKKEAFVYMGLMWKCCLPNVVTYSTWIDMFCKVGDLNMGVKVFR 219

Query: 186 DMMKVGVVPNLVVFSSLIDGYCKAGNLDVAFGYFERMEECSVQPNEFTYSTLIDGCCKHG 245
           DM K  V+ N +VF+ LIDGYCK G+ + AF   + M+   +  N  TY+ LIDG CK G
Sbjct: 220 DMKKDKVLLNSIVFTCLIDGYCKVGDFEAAFELCKEMKLVKLAVNVVTYTALIDGLCKKG 279

Query: 246 MLGRADSLFEKMLSAGILPNCTVYTSIIDGHFKKGNVDNAIKYINGMFYREIKLDLAAYT 305
           ML RA+ LF +ML   + PN  VYTSIID HFKK NV +A+KY+  M+ + ++ D+AAY 
Sbjct: 280 MLERAECLFFRMLKDKVKPNSVVYTSIIDAHFKKSNVTDALKYLGKMYVQGLEFDMAAYG 339

Query: 306 VVISGFHKVGRLDKSMEAAAYVVKNGLLPDRIILTAIMDVHFKAGNVKEALNAYKILLAR 365
           V+I+G    G  DK+      +VK+GL PD+++LT IMD HFKAGNVK ALN Y  +LAR
Sbjct: 340 VIIAGLCNTGMFDKASIYMENMVKSGLRPDKLMLTTIMDAHFKAGNVKAALNVYGEILAR 399

Query: 366 GFEPDVVTLSALMDGLCKHGYLQEARQYLDKEKANEILYTVFIDALCKEGNLDEAERTIK 425
           GF+PDV+ L++LMDGLCKHG L EA  Y  + KAN+I YTV I+ L K+G+  E  R  +
Sbjct: 400 GFDPDVIVLTSLMDGLCKHGCLNEAESYFCRGKANKISYTVLINGLAKKGDFTELNRVFR 459

Query: 426 EMSEAGFVPDKYVYTSWIAELCKQGNLLKAFVVKKRMVQEHIEPDLLTYSSLIGGLAEKG 485
           EM EAGF  DKYVYTSWIA LC+QGNL++AF VK RMVQE  +PDLLTYSSLI GLA KG
Sbjct: 460 EMLEAGFTADKYVYTSWIAGLCEQGNLIEAFRVKNRMVQEGFQPDLLTYSSLIFGLANKG 519

Query: 486 LMIEAKQVFDDMLNKGITPDSVSYDILIRGYHNQGNGAAISGLHDEMRKRGIT 539
           LMIEAKQ+F+DML + ITPD+  Y+I+IRGY  Q N AA++GL +EM KRG +
Sbjct: 520 LMIEAKQIFEDMLKRQITPDAAVYEIMIRGYLRQDNEAAVTGLLEEMEKRGFS 571

BLAST of Cla002193 vs. TrEMBL
Match: A0A061EFM6_THECC (Tetratricopeptide repeat (TPR)-like superfamily protein, putative isoform 1 OS=Theobroma cacao GN=TCM_011056 PE=4 SV=1)

HSP 1 Score: 622.5 bits (1604), Expect = 4.9e-175
Identity = 307/530 (57.92%), Postives = 384/530 (72.45%), Query Frame = 1

Query: 6   LQFLAHLRRISRLPTPFTCNRILHSLINSGCGDLSAKLFFHLLSKGYTPHSSSFNSIISF 65
           LQF + L++ S+ P PF  N++LH L  S CG LS KL    LSKGYTPH SSFNS ISF
Sbjct: 43  LQFCSQLKKTSKYPDPFFFNKLLHRLTASNCGTLSLKLLSFFLSKGYTPHPSSFNSSISF 102

Query: 66  FCRLGNVKFAERILNSMPRFGCSPDIVSYNSLLDGYCGSYQIQKACFLVNRVRGCELIRP 125
            C+LG   +A++++NSMP +GC PDI +YNSL+DGY     + KAC + + +R  +  +P
Sbjct: 103 LCKLGRSDYAQKLVNSMPFYGCEPDIATYNSLIDGYFKCGDVVKACLVFDDIRAGKC-KP 162

Query: 126 DLVMFNILFNGFAKVYMKNEAFVYLGLMWKCCLPSVVTYGTFVDMFCKMGDMEMGNRMFL 185
           DLV FN LFNGF K+    E FVY+G MWKCCLP+V+TY T++DMFCK+GD++MG ++F 
Sbjct: 163 DLVTFNALFNGFCKMRRNKEVFVYMGYMWKCCLPNVITYSTWIDMFCKLGDLKMGFKVFR 222

Query: 186 DMMKVGVVPNLVVFSSLIDGYCKAGNLDVAFGYFERMEECSVQPNEFTYSTLIDGCCKHG 245
           DM K GV  N +VF+ LIDG CK G+ ++AF  +  M++  +  N  TY+ LIDG CK G
Sbjct: 223 DMKKDGVSLNSIVFTCLIDGCCKVGDFELAFELYWEMKQTKLALNVVTYTALIDGLCKKG 282

Query: 246 MLGRADSLFEKMLSAGILPNCTVYTSIIDGHFKKGNVDNAIKYINGMFYREIKLDLAAYT 305
           ML RA+ LF +ML   + PN  VYTSIIDGHFKK NV +A+KY+  M  + IK D+A Y 
Sbjct: 283 MLERAECLFLRMLKDKVQPNSVVYTSIIDGHFKKRNVSDALKYLAKMCVQGIKFDMALYG 342

Query: 306 VVISGFHKVGRLDKSMEAAAYVVKNGLLPDRIILTAIMDVHFKAGNVKEALNAYKILLAR 365
           V+ISG    GR DK+ +    +VK+GLLPD+++LT +MD HFKAGNVK AL+ Y  LLAR
Sbjct: 343 VIISGLSNCGRFDKASKFMENMVKSGLLPDKLLLTTMMDAHFKAGNVKAALDVYGELLAR 402

Query: 366 GFEPDVVTLSALMDGLCKHGYLQEARQYLDKEKANEILYTVFIDALCKEGNLDEAERTIK 425
           GF+PDVV LS+LMDGLCK G L EA  Y  +EKANEI YTV ID L K+G+  E  R  +
Sbjct: 403 GFDPDVVVLSSLMDGLCKRGCLHEAESYFCREKANEISYTVLIDGLAKKGDFTEVNRVFR 462

Query: 426 EMSEAGFVPDKYVYTSWIAELCKQGNLLKAFVVKKRMVQEHIEPDLLTYSSLIGGLAEKG 485
           EM EAGF PDKYVYTSWIA LC+QGNL++AF +K RMVQE  +PDLLTYSSLI GLA KG
Sbjct: 463 EMLEAGFTPDKYVYTSWIAGLCEQGNLIEAFRLKNRMVQEGFQPDLLTYSSLIFGLANKG 522

Query: 486 LMIEAKQVFDDMLNKGITPDSVSYDILIRGYHNQGNGAAISGLHDEMRKR 536
           LMIEAKQ+F DML + ITPD+  YDI+IRGY  Q N AA+S L +EMRKR
Sbjct: 523 LMIEAKQIFQDMLKRKITPDAAVYDIMIRGYLQQNNEAAVSELLEEMRKR 571

BLAST of Cla002193 vs. TrEMBL
Match: V4KT02_EUTSA (Uncharacterized protein OS=Eutrema salsugineum GN=EUTSA_v10003932mg PE=4 SV=1)

HSP 1 Score: 598.6 bits (1542), Expect = 7.6e-168
Identity = 290/539 (53.80%), Postives = 391/539 (72.54%), Query Frame = 1

Query: 1   MVKEALQFLAHLRRISRLPTPFTCNRILHSLINSGCGDLSAKLFFHLLSKGYTPHSSSFN 60
           MV+EALQF++ LR+ S LP P TCN+ +H LINS CG LS K   +LLS+GYTPH SSFN
Sbjct: 1   MVREALQFISRLRKSSNLPDPITCNKYIHQLINSNCGVLSLKFLAYLLSRGYTPHRSSFN 60

Query: 61  SIISFFCRLGNVKFAERILNSMPRFGCSPDIVSYNSLLDGYCGSYQIQKACFLVNRVRGC 120
           S+ SF C+LG VKFAE I++SMPRFGC PD+VSYNSL+DG+C + +I+ A  ++ R+R  
Sbjct: 61  SVASFVCKLGQVKFAEYIVHSMPRFGCLPDVVSYNSLIDGHCRNGEIRSASLVLKRLRAS 120

Query: 121 E--LIRPDLVMFNILFNGFAKVYMKNEAFVYLGLMWKCCLPSVVTYGTFVDMFCKMGDME 180
              + RPD+V FN LFNGF+K+ M  E FVY+G+M KCC P+VVTY T++D FCK G+++
Sbjct: 121 HGFMCRPDIVSFNSLFNGFSKMKMLKEVFVYMGVMLKCCSPNVVTYSTWIDTFCKSGELQ 180

Query: 181 MGNRMFLDMMKVGVVPNLVVFSSLIDGYCKAGNLDVAFGYFERMEECSVQPNEFTYSTLI 240
           +  + F  M K  + PN+V F+ LIDGYCKAG+L+VA   +E M    +  N  TY+ L+
Sbjct: 181 LALKSFNCMKKDALSPNVVTFTCLIDGYCKAGDLEVAVSLYEDMRRVQMSLNVVTYTALL 240

Query: 241 DGCCKHGMLGRADSLFEKMLSAGILPNCTVYTSIIDGHFKKGNVDNAIKYINGMFYREIK 300
           DG CK G + RA+ L+ +M    + PN  VYT+IIDG+F KG+ DNA+K++  M  + ++
Sbjct: 241 DGFCKRGEMERAEGLYSRMHEDKVEPNSLVYTTIIDGYFHKGDADNAMKFLAKMLNQGMR 300

Query: 301 LDLAAYTVVISGFHKVGRLDKSMEAAAYVVKNGLLPDRIILTAIMDVHFKAGNVKEALNA 360
           LD+AAY V+ISG    G+L ++ E    + K GL+PD++ILT +MD +FK+G +K ALN 
Sbjct: 301 LDIAAYGVIISGLCGNGKLKEATEVVEDMEKGGLVPDKMILTTMMDAYFKSGLMKAALNV 360

Query: 361 YKILLARGFEPDVVTLSALMDGLCKHGYLQEARQYLDKEKANEILYTVFIDALCKEGNLD 420
           Y+  + RGFEPDVV L+ L+DGL K+G L EA  Y  KEKAN+++YTV IDALCKEG+  
Sbjct: 361 YREFIERGFEPDVVALTTLIDGLAKNGQLHEAIAYFCKEKANDVMYTVLIDALCKEGDFI 420

Query: 421 EAERTIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFVVKKRMVQEHIEPDLLTYSSLI 480
           E ER   ++ EAG VPDK++YTSWIA LCKQGNL+ AF +K +MVQE ++ DLLTY++LI
Sbjct: 421 EVERFFSKILEAGLVPDKFMYTSWIAGLCKQGNLVDAFKLKTKMVQEGLKLDLLTYTTLI 480

Query: 481 GGLAEKGLMIEAKQVFDDMLNKGITPDSVSYDILIRGYHNQGNGAAISGLHDEMRKRGI 538
            GLA KGLM+EA+QVFD+ML  G +PDS  +D+LIR Y  +GN  A S L  +M+ RG+
Sbjct: 481 NGLASKGLMVEARQVFDEMLRSGTSPDSAVFDLLIRAYEKEGNMTAASDLFLDMQTRGL 539

BLAST of Cla002193 vs. TrEMBL
Match: A0A068V733_COFCA (Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00018059001 PE=4 SV=1)

HSP 1 Score: 584.7 bits (1506), Expect = 1.1e-163
Identity = 294/546 (53.85%), Postives = 381/546 (69.78%), Query Frame = 1

Query: 1   MVKEALQFLAH-LRRIS---RLPTPFTCNRILHSLINSGCGDLSAKLFFHLLSKGYTPHS 60
           M KEAL+F A+ L+R     + P     N+ LHSL  S CGDLS KL    +SKGY PH 
Sbjct: 1   MTKEALKFFAYGLQRFKASRKFPDRHDFNKSLHSLTRSNCGDLSLKLLVAFISKGYFPHV 60

Query: 61  SSFNSIISFFCRLGNVKFAERILNSMPRFGCSPDIVSYNSLLDGYCGSYQIQKACFLVNR 120
           S+FNS+IS+FC LG    A++++N MP+ GC PDIVSYNSL+DGY  +  +  ACF++ +
Sbjct: 61  SAFNSVISYFCNLGFPGSAQKLINLMPKLGCLPDIVSYNSLIDGYLRNADVGDACFMMRK 120

Query: 121 V-RGCELIRPDLVMFNILFNGFAKVYMKNEAFVYLGLMWKCCLPSVVTYGTFVDMFCKMG 180
           V  G   +RPDLV FN +FNGF KV       VY+ LMWK C+P+VVTYG  VDM+CKM 
Sbjct: 121 VCSGLMNVRPDLVTFNAMFNGFCKVGKVEGLLVYMSLMWKVCVPNVVTYGILVDMYCKMN 180

Query: 181 DMEMGNRMFLDMMKVGVVPNLVVFSSLIDGYCKAGNLDVAFGYFERMEECSVQPNEFTYS 240
           ++ M  R+F DM   GV PNL +F+SLIDGYCKAG L+VA G +  M   SV PN  TYS
Sbjct: 181 NVNMAYRVFKDMKSSGVFPNLQIFTSLIDGYCKAGELEVALGLYLDMHRNSVFPNVVTYS 240

Query: 241 TLIDGCCKHGMLGRADSLFEKMLSAGILPNCTVYTSIIDGHFKKGNVDNAIKYINGMFYR 300
            LIDG CK GML +A  LF +ML  G+ PN  VYTS+IDG FKK N+DNA+KY++ M  +
Sbjct: 241 ALIDGFCKRGMLEKAAYLFSRMLEDGVEPNIVVYTSMIDGEFKKKNIDNALKYLSRMHDQ 300

Query: 301 EIKLDLAAYTVVISGFHKVGRLDKSMEAAAYVVKNGLLPDRIILTAIMDVHFKAGNVKEA 360
            I+ D+ AY  ++SG   + RLD ++E    ++  G+ PD +I   +M  +F+AGNV+ A
Sbjct: 301 GIRHDVTAYGAIVSGLCNMNRLDSAVEVKKAMMDYGVAPDGVIFATLMHAYFEAGNVEAA 360

Query: 361 LNAYKILLARGFEPDVVTLSALMDGLCKHGYLQEARQYLDKEKANEILYTVFIDALCKEG 420
           +N Y+  LARGF+PDVV L++L+DGLCK+G L EA+ +  KE A+EI Y V I  +CKEG
Sbjct: 361 MNEYEESLARGFQPDVVALTSLIDGLCKNGRLYEAKLHFSKENADEISYNVLIHGMCKEG 420

Query: 421 NLDEAERTIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFVVKKRMVQEHIEPDLLTYS 480
            L + E   +EM++AGFVP+KY YT+WIA LCKQGNL++AF ++K+M++E I P+L  YS
Sbjct: 421 ELSQVEMVYREMTDAGFVPNKYFYTTWIAGLCKQGNLVEAFRLQKQMIKEGIPPNLFAYS 480

Query: 481 SLIGGLAEKGLMIEAKQVFDDMLNKGITPDSVSYDILIRGYHNQGNGAAISGLHDEMRKR 540
           SLI GL  KG MIEAKQVFDDML KGI+P+ V YDILIRGY  +GN  AIS L DEMR R
Sbjct: 481 SLIFGLTNKGFMIEAKQVFDDMLKKGISPNHVVYDILIRGYAKEGNQPAISCLIDEMRMR 540

Query: 541 GITIED 542
           G+  +D
Sbjct: 541 GLLSDD 546

BLAST of Cla002193 vs. NCBI nr
Match: gi|778698035|ref|XP_011654465.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g01740 isoform X1 [Cucumis sativus])

HSP 1 Score: 990.3 bits (2559), Expect = 1.3e-285
Identity = 485/541 (89.65%), Postives = 508/541 (93.90%), Query Frame = 1

Query: 1   MVKEALQFLAHLRRISRLPTPFTCNRILHSLINSGCGDLSAKLFFHLLSKGYTPHSSSFN 60
           MVKEALQ+LAHLRR  R PTPFTCN++LHSLINSGCG LSAKL FH LSKGYTPH SSFN
Sbjct: 1   MVKEALQYLAHLRRTFRFPTPFTCNKLLHSLINSGCGHLSAKLLFHFLSKGYTPHPSSFN 60

Query: 61  SIISFFCRLGNVKFAERILNSMPRFGCSPDIVSYNSLLDGYCGSYQIQKACFLVNRVRGC 120
           SIISFFCR GNVKFAE I  SM RFGCSPDIVSYNSLLDGYC SYQIQKACFLVNRVRGC
Sbjct: 61  SIISFFCRSGNVKFAEHIFISMSRFGCSPDIVSYNSLLDGYCSSYQIQKACFLVNRVRGC 120

Query: 121 ELIRPDLVMFNILFNGFAKVYMKNEAFVYLGLMWKCCLPSVVTYGTFVDMFCKMGDMEMG 180
           EL RPDLVMFNILFNGFAKVYMKNEAF+Y GLMWK CLPS+VTYGTFVDMFCKMGDM+MG
Sbjct: 121 ELNRPDLVMFNILFNGFAKVYMKNEAFMYFGLMWKYCLPSIVTYGTFVDMFCKMGDMKMG 180

Query: 181 NRMFLDMMKVGVVPNLVVFSSLIDGYCKAGNLDVAFGYFERMEECSVQPNEFTYSTLIDG 240
           NRMFLDMMKVG+VPNLVVFSSLIDGYCKAG+LDVAF YFERM+ECSV+PNEFTYSTLIDG
Sbjct: 181 NRMFLDMMKVGIVPNLVVFSSLIDGYCKAGSLDVAFEYFERMKECSVRPNEFTYSTLIDG 240

Query: 241 CCKHGMLGRADSLFEKMLSAGILPNCTVYTSIIDGHFKKGNVDNAIKYINGMFYREIKLD 300
           C KHGML RADSLFEKMLSA ILPNCTVYTSIIDGHFKKGNVD+AIKYIN MF R+IKLD
Sbjct: 241 CSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVDDAIKYINQMFDRDIKLD 300

Query: 301 LAAYTVVISGFHKVGRLDKSMEAAAYVVKNGLLPDRIILTAIMDVHFKAGNVKEALNAYK 360
           L AYTV+ISGFH+VGR DKSMEAA YV KNGLLPDRIILTAIMDVHFKAGN+KEALNAYK
Sbjct: 301 LTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDVHFKAGNIKEALNAYK 360

Query: 361 ILLARGFEPDVVTLSALMDGLCKHGYLQEARQYLDKEKANEILYTVFIDALCKEGNLDEA 420
           ILLA+GFE DVVTLSALMDGL KHGYLQEAR+YL KE ANEILYTVFIDALCKEGNLD+A
Sbjct: 361 ILLAKGFEADVVTLSALMDGLSKHGYLQEARRYLVKENANEILYTVFIDALCKEGNLDDA 420

Query: 421 ERTIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFVVKKRMVQEHIEPDLLTYSSLIGG 480
           E+ IKEMSEAGFVPDK+VYTSWIAELCKQGNLLKAF+VKKRMVQEH+EPDLLTYSSLIGG
Sbjct: 421 EKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHVEPDLLTYSSLIGG 480

Query: 481 LAEKGLMIEAKQVFDDMLNKGITPDSVSYDILIRGYHNQGNGAAISGLHDEMRKRGITIE 540
           LAEKGLMIEAKQVFDDMLNKGITPD VSYDILIRGYHNQGNGAAISGLHDEMRKRGI +E
Sbjct: 481 LAEKGLMIEAKQVFDDMLNKGITPDFVSYDILIRGYHNQGNGAAISGLHDEMRKRGIIVE 540

Query: 541 D 542
           D
Sbjct: 541 D 541

BLAST of Cla002193 vs. NCBI nr
Match: gi|659102971|ref|XP_008452410.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g01740 isoform X4 [Cucumis melo])

HSP 1 Score: 972.2 bits (2512), Expect = 3.6e-280
Identity = 480/541 (88.72%), Postives = 505/541 (93.35%), Query Frame = 1

Query: 1   MVKEALQFLAHLRRISRLPTPFTCNRILHSLINSGCGDLSAKLFFHLLSKGYTPHSSSFN 60
           MVKEALQFLAHLRRISR P+PFTCN++LHSLINSGCG LSAKL  HLLSKGYTPH SSFN
Sbjct: 1   MVKEALQFLAHLRRISRFPSPFTCNKLLHSLINSGCGHLSAKLLIHLLSKGYTPHPSSFN 60

Query: 61  SIISFFCRLGNVKFAERILNSMPRFGCSPDIVSYNSLLDGYCGSYQIQKACFLVNRVRGC 120
           SIISFFCR GNVKFAE+I  SM RFGCSPDIVSYNSLLDGYC S QIQKACFLVNRVRGC
Sbjct: 61  SIISFFCRSGNVKFAEQIFISMSRFGCSPDIVSYNSLLDGYCSSCQIQKACFLVNRVRGC 120

Query: 121 ELIRPDLVMFNILFNGFAKVYMKNEAFVYLGLMWKCCLPSVVTYGTFVDMFCKMGDMEMG 180
           EL RPDLVMFNILF GFAKVYMKNEAF+YLGLMWK  LPS+VTYGTFVDMFCKMGDMEMG
Sbjct: 121 ELNRPDLVMFNILFKGFAKVYMKNEAFMYLGLMWKYYLPSIVTYGTFVDMFCKMGDMEMG 180

Query: 181 NRMFLDMMKVGVVPNLVVFSSLIDGYCKAGNLDVAFGYFERMEECSVQPNEFTYSTLIDG 240
           NRMFLDMMKVG+VPNL+VFSSLIDGYCKAG+LDVAF YFERM+ECSV+PNEFTYSTLIDG
Sbjct: 181 NRMFLDMMKVGIVPNLIVFSSLIDGYCKAGSLDVAFEYFERMKECSVRPNEFTYSTLIDG 240

Query: 241 CCKHGMLGRADSLFEKMLSAGILPNCTVYTSIIDGHFKKGNVDNAIKYINGMFYREIKLD 300
           C K GML RADSLFEKMLSA ILPNCTVYTSIIDGHFKKGNVD+AIKYIN MF ++IKLD
Sbjct: 241 CSKRGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVDDAIKYINQMFDQDIKLD 300

Query: 301 LAAYTVVISGFHKVGRLDKSMEAAAYVVKNGLLPDRIILTAIMDVHFKAGNVKEALNAYK 360
           L AYTV+ISGFH+VGR DKSMEAA YV K GLLPDRIILTAIMDVHFKAGN+KEALNAYK
Sbjct: 301 LTAYTVIISGFHRVGRFDKSMEAAEYVAKKGLLPDRIILTAIMDVHFKAGNIKEALNAYK 360

Query: 361 ILLARGFEPDVVTLSALMDGLCKHGYLQEARQYLDKEKANEILYTVFIDALCKEGNLDEA 420
           ILLA+GFE DV TLSALMDGL KHGYLQ+AR+Y  KEKANEILYTVFIDALCKEGNLDEA
Sbjct: 361 ILLAKGFEADVATLSALMDGLSKHGYLQKARRYFVKEKANEILYTVFIDALCKEGNLDEA 420

Query: 421 ERTIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFVVKKRMVQEHIEPDLLTYSSLIGG 480
           E+ IKEMSEAGFVPDK+VYTS IAELCKQGNLLKAF+VKKRMVQEHIEPDLLTYSSLI G
Sbjct: 421 EKMIKEMSEAGFVPDKFVYTSLIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLISG 480

Query: 481 LAEKGLMIEAKQVFDDMLNKGITPDSVSYDILIRGYHNQGNGAAISGLHDEMRKRGITIE 540
           LAEKGLMIEAKQVFDDMLNKGITPD V+YDILIRGYHNQGNGAAISGLHDEMRKRGIT+E
Sbjct: 481 LAEKGLMIEAKQVFDDMLNKGITPDFVAYDILIRGYHNQGNGAAISGLHDEMRKRGITVE 540

Query: 541 D 542
           D
Sbjct: 541 D 541

BLAST of Cla002193 vs. NCBI nr
Match: gi|778698039|ref|XP_011654466.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g01740 isoform X2 [Cucumis sativus])

HSP 1 Score: 890.2 bits (2299), Expect = 1.8e-255
Identity = 449/541 (82.99%), Postives = 475/541 (87.80%), Query Frame = 1

Query: 1   MVKEALQFLAHLRRISRLPTPFTCNRILHSLINSGCGDLSAKLFFHLLSKGYTPHSSSFN 60
           MVKEALQ+LAHLRR  R PTPFTCN++LHSLINS           HL +K          
Sbjct: 1   MVKEALQYLAHLRRTFRFPTPFTCNKLLHSLINS--------GCGHLSAK---------- 60

Query: 61  SIISFFCRLGNVKFAERILNSMPRFGCSPDIVSYNSLLDGYCGSYQIQKACFLVNRVRGC 120
            ++  F   G              FGCSPDIVSYNSLLDGYC SYQIQKACFLVNRVRGC
Sbjct: 61  -LLFHFLSKG--------------FGCSPDIVSYNSLLDGYCSSYQIQKACFLVNRVRGC 120

Query: 121 ELIRPDLVMFNILFNGFAKVYMKNEAFVYLGLMWKCCLPSVVTYGTFVDMFCKMGDMEMG 180
           EL RPDLVMFNILFNGFAKVYMKNEAF+Y GLMWK CLPS+VTYGTFVDMFCKMGDM+MG
Sbjct: 121 ELNRPDLVMFNILFNGFAKVYMKNEAFMYFGLMWKYCLPSIVTYGTFVDMFCKMGDMKMG 180

Query: 181 NRMFLDMMKVGVVPNLVVFSSLIDGYCKAGNLDVAFGYFERMEECSVQPNEFTYSTLIDG 240
           NRMFLDMMKVG+VPNLVVFSSLIDGYCKAG+LDVAF YFERM+ECSV+PNEFTYSTLIDG
Sbjct: 181 NRMFLDMMKVGIVPNLVVFSSLIDGYCKAGSLDVAFEYFERMKECSVRPNEFTYSTLIDG 240

Query: 241 CCKHGMLGRADSLFEKMLSAGILPNCTVYTSIIDGHFKKGNVDNAIKYINGMFYREIKLD 300
           C KHGML RADSLFEKMLSA ILPNCTVYTSIIDGHFKKGNVD+AIKYIN MF R+IKLD
Sbjct: 241 CSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVDDAIKYINQMFDRDIKLD 300

Query: 301 LAAYTVVISGFHKVGRLDKSMEAAAYVVKNGLLPDRIILTAIMDVHFKAGNVKEALNAYK 360
           L AYTV+ISGFH+VGR DKSMEAA YV KNGLLPDRIILTAIMDVHFKAGN+KEALNAYK
Sbjct: 301 LTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDVHFKAGNIKEALNAYK 360

Query: 361 ILLARGFEPDVVTLSALMDGLCKHGYLQEARQYLDKEKANEILYTVFIDALCKEGNLDEA 420
           ILLA+GFE DVVTLSALMDGL KHGYLQEAR+YL KE ANEILYTVFIDALCKEGNLD+A
Sbjct: 361 ILLAKGFEADVVTLSALMDGLSKHGYLQEARRYLVKENANEILYTVFIDALCKEGNLDDA 420

Query: 421 ERTIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFVVKKRMVQEHIEPDLLTYSSLIGG 480
           E+ IKEMSEAGFVPDK+VYTSWIAELCKQGNLLKAF+VKKRMVQEH+EPDLLTYSSLIGG
Sbjct: 421 EKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHVEPDLLTYSSLIGG 480

Query: 481 LAEKGLMIEAKQVFDDMLNKGITPDSVSYDILIRGYHNQGNGAAISGLHDEMRKRGITIE 540
           LAEKGLMIEAKQVFDDMLNKGITPD VSYDILIRGYHNQGNGAAISGLHDEMRKRGI +E
Sbjct: 481 LAEKGLMIEAKQVFDDMLNKGITPDFVSYDILIRGYHNQGNGAAISGLHDEMRKRGIIVE 508

Query: 541 D 542
           D
Sbjct: 541 D 508

BLAST of Cla002193 vs. NCBI nr
Match: gi|700194419|gb|KGN49596.1| (hypothetical protein Csa_5G021290 [Cucumis sativus])

HSP 1 Score: 764.2 bits (1972), Expect = 1.5e-217
Identity = 374/413 (90.56%), Postives = 394/413 (95.40%), Query Frame = 1

Query: 129 MFNILFNGFAKVYMKNEAFVYLGLMWKCCLPSVVTYGTFVDMFCKMGDMEMGNRMFLDMM 188
           MFNILFNGFAKVYMKNEAF+Y GLMWK CLPS+VTYGTFVDMFCKMGDM+MGNRMFLDMM
Sbjct: 1   MFNILFNGFAKVYMKNEAFMYFGLMWKYCLPSIVTYGTFVDMFCKMGDMKMGNRMFLDMM 60

Query: 189 KVGVVPNLVVFSSLIDGYCKAGNLDVAFGYFERMEECSVQPNEFTYSTLIDGCCKHGMLG 248
           KVG+VPNLVVFSSLIDGYCKAG+LDVAF YFERM+ECSV+PNEFTYSTLIDGC KHGML 
Sbjct: 61  KVGIVPNLVVFSSLIDGYCKAGSLDVAFEYFERMKECSVRPNEFTYSTLIDGCSKHGMLA 120

Query: 249 RADSLFEKMLSAGILPNCTVYTSIIDGHFKKGNVDNAIKYINGMFYREIKLDLAAYTVVI 308
           RADSLFEKMLSA ILPNCTVYTSIIDGHFKKGNVD+AIKYIN MF R+IKLDL AYTV+I
Sbjct: 121 RADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVDDAIKYINQMFDRDIKLDLTAYTVII 180

Query: 309 SGFHKVGRLDKSMEAAAYVVKNGLLPDRIILTAIMDVHFKAGNVKEALNAYKILLARGFE 368
           SGFH+VGR DKSMEAA YV KNGLLPDRIILTAIMDVHFKAGN+KEALNAYKILLA+GFE
Sbjct: 181 SGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDVHFKAGNIKEALNAYKILLAKGFE 240

Query: 369 PDVVTLSALMDGLCKHGYLQEARQYLDKEKANEILYTVFIDALCKEGNLDEAERTIKEMS 428
            DVVTLSALMDGL KHGYLQEAR+YL KE ANEILYTVFIDALCKEGNLD+AE+ IKEMS
Sbjct: 241 ADVVTLSALMDGLSKHGYLQEARRYLVKENANEILYTVFIDALCKEGNLDDAEKMIKEMS 300

Query: 429 EAGFVPDKYVYTSWIAELCKQGNLLKAFVVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMI 488
           EAGFVPDK+VYTSWIAELCKQGNLLKAF+VKKRMVQEH+EPDLLTYSSLIGGLAEKGLMI
Sbjct: 301 EAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHVEPDLLTYSSLIGGLAEKGLMI 360

Query: 489 EAKQVFDDMLNKGITPDSVSYDILIRGYHNQGNGAAISGLHDEMRKRGITIED 542
           EAKQVFDDMLNKGITPD VSYDILIRGYHNQGNGAAISGLHDEMRKRGI +ED
Sbjct: 361 EAKQVFDDMLNKGITPDFVSYDILIRGYHNQGNGAAISGLHDEMRKRGIIVED 413

BLAST of Cla002193 vs. NCBI nr
Match: gi|700194419|gb|KGN49596.1| (hypothetical protein Csa_5G021290 [Cucumis sativus])

HSP 1 Score: 47.8 bits (112), Expect = 7.2e-02
Identity = 35/136 (25.74%), Postives = 63/136 (46.32%), Query Frame = 1

Query: 59  FNSIISFFCRLGNVKFAERILNSMPRFGCSPDIVSYNSLLDGYCGSYQIQKACFLVNRVR 118
           +   I   C+ GN+  AE+++  M   G  PD   Y S +   C    + KA F+V +  
Sbjct: 276 YTVFIDALCKEGNLDDAEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKA-FMVKKRM 335

Query: 119 GCELIRPDLVMFNILFNGFAKVYMKNEA-FVYLGLMWKCCLPSVVTYGTFVDMFCKMGDM 178
             E + PDL+ ++ L  G A+  +  EA  V+  ++ K   P  V+Y   +  +   G+ 
Sbjct: 336 VQEHVEPDLLTYSSLIGGLAEKGLMIEAKQVFDDMLNKGITPDFVSYDILIRGYHNQGNG 395

Query: 179 EMGNRMFLDMMKVGVV 194
              + +  +M K G++
Sbjct: 396 AAISGLHDEMRKRGII 410


HSP 2 Score: 725.3 bits (1871), Expect = 7.7e-206
Identity = 344/539 (63.82%), Postives = 427/539 (79.22%), Query Frame = 1

Query: 1   MVKEALQFLAHLRRISRLPTPFTCNRILHSLINSGCGDLSAKLFFHLLSKGYTPHSSSFN 60
           MV E LQF A LRR S+ P+PFT N++LHSL++S CG+LS K     LSKGY PH SSFN
Sbjct: 1   MVSETLQFFAQLRRASKFPSPFTFNKLLHSLVDSNCGELSLKFLSFFLSKGYVPHPSSFN 60

Query: 61  SIISFFCRLGNVKFAERILNSMPRFGCSPDIVSYNSLLDGYCGSYQIQKACFLVNRVRGC 120
           S++SF C+LGN  FAE++++SMP FGC PD V+YNSL+DGYC ++ I++ C ++ ++R  
Sbjct: 61  SVLSFLCKLGNFCFAEKVVDSMPGFGCVPDAVTYNSLIDGYCKNFDIERGCLVLKKIRVG 120

Query: 121 ELIRPDLVMFNILFNGFAKVYMKNEAFVYLGLMWKCCLPSVVTYGTFVDMFCKMGDMEMG 180
               PD+V FNIL NGF+K+ MK EAFVY+GLMWKCCLP+VVTY TF+DMFCKMGD++MG
Sbjct: 121 HC-NPDVVTFNILLNGFSKLKMKKEAFVYMGLMWKCCLPNVVTYSTFIDMFCKMGDLDMG 180

Query: 181 NRMFLDMMKVGVVPNLVVFSSLIDGYCKAGNLDVAFGYFERMEECSVQPNEFTYSTLIDG 240
            ++F DMMK GV PNL+ F+SLIDGYCKAGN+++AF  FE+M++ S+ PN  TY+ LIDG
Sbjct: 181 YKVFSDMMKNGVFPNLIAFTSLIDGYCKAGNVEMAFELFEKMKQSSLSPNVVTYTALIDG 240

Query: 241 CCKHGMLGRADSLFEKMLSAGILPNCTVYTSIIDGHFKKGNVDNAIKYINGMFYREIKLD 300
            CKHGML  A+SLF KML  G+ PN  VYTS+IDG+F KGNVDNAIKY+  M  + I+ D
Sbjct: 241 LCKHGMLEGAESLFSKMLEDGVEPNSAVYTSMIDGNFVKGNVDNAIKYVIKMREQNIRFD 300

Query: 301 LAAYTVVISGFHKVGRLDKSMEAAAYVVKNGLLPDRIILTAIMDVHFKAGNVKEALNAYK 360
           L  + V+I GF K GRLDK+ME    VV +GL PD+IILT IMD HFK+ N+K ALN Y+
Sbjct: 301 LTTFGVIIWGFCKTGRLDKAMEVMGIVVASGLAPDKIILTTIMDAHFKSRNLKAALNLYR 360

Query: 361 ILLARGFEPDVVTLSALMDGLCKHGYLQEARQYLDKEKANEILYTVFIDALCKEGNLDEA 420
            LL RGFEPDV+TLS L++GLCKHG+LQEAR+Y  +EKAN+I YTV ID +CKEG  +E 
Sbjct: 361 ELLVRGFEPDVITLSTLLNGLCKHGHLQEARRYFCREKANQISYTVLIDGICKEGQFNEV 420

Query: 421 ERTIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFVVKKRMVQEHIEPDLLTYSSLIGG 480
           E  +KEMSE GFVPDKYVYTSWIA LCKQGNL++AF +K +MV+E +EPDLLTYSSLI G
Sbjct: 421 EMVLKEMSEVGFVPDKYVYTSWIAGLCKQGNLVEAFRLKNKMVKEGVEPDLLTYSSLISG 480

Query: 481 LAEKGLMIEAKQVFDDMLNKGITPDSVSYDILIRGYHNQGNGAAISGLHDEMRKRGITI 540
           LA KGLMIEAKQVFD+ML  GITPDS  YDIL+RGY  +G+ AA+ GLHDEM KRG+++
Sbjct: 481 LASKGLMIEAKQVFDNMLKMGITPDSAVYDILVRGYLKEGDEAAVLGLHDEMIKRGLSV 538

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP141_ARATH5.4e-16752.50Pentatricopeptide repeat-containing protein At2g01740 OS=Arabidopsis thaliana GN... [more]
PP143_ARATH2.3e-7732.20Putative pentatricopeptide repeat-containing protein At2g02150 OS=Arabidopsis th... [more]
PP407_ARATH5.1e-6428.24Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN... [more]
PP360_ARATH2.5e-6329.09Pentatricopeptide repeat-containing protein At5g01110 OS=Arabidopsis thaliana GN... [more]
PP432_ARATH4.0e-6127.99Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
W9R0T6_9ROSA6.2e-20262.52Uncharacterized protein OS=Morus notabilis GN=L484_005539 PE=4 SV=1[more]
A0A0D2Q7W0_GOSRA1.3e-17556.85Uncharacterized protein OS=Gossypium raimondii GN=B456_002G171900 PE=4 SV=1[more]
A0A061EFM6_THECC4.9e-17557.92Tetratricopeptide repeat (TPR)-like superfamily protein, putative isoform 1 OS=T... [more]
V4KT02_EUTSA7.6e-16853.80Uncharacterized protein OS=Eutrema salsugineum GN=EUTSA_v10003932mg PE=4 SV=1[more]
A0A068V733_COFCA1.1e-16353.85Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00018059001 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|778698035|ref|XP_011654465.1|1.3e-28589.65PREDICTED: pentatricopeptide repeat-containing protein At2g01740 isoform X1 [Cuc... [more]
gi|659102971|ref|XP_008452410.1|3.6e-28088.72PREDICTED: pentatricopeptide repeat-containing protein At2g01740 isoform X4 [Cuc... [more]
gi|778698039|ref|XP_011654466.1|1.8e-25582.99PREDICTED: pentatricopeptide repeat-containing protein At2g01740 isoform X2 [Cuc... [more]
gi|700194419|gb|KGN49596.1|1.5e-21790.56hypothetical protein Csa_5G021290 [Cucumis sativus][more]
gi|700194419|gb|KGN49596.1|7.2e-0225.74hypothetical protein Csa_5G021290 [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla002193Cla002193.1mRNA


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 162..192
score: 0.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 194..243
score: 1.7E-17coord: 400..448
score: 7.5E-10coord: 334..383
score: 2.4E-9coord: 55..102
score: 7.3E-12coord: 474..516
score: 5.4E-11coord: 264..313
score: 3.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 437..470
score: 3.4E-5coord: 473..506
score: 1.9E-7coord: 232..266
score: 5.1E-11coord: 58..91
score: 7.2E-8coord: 197..231
score: 3.7E-9coord: 403..435
score: 1.2E-9coord: 162..195
score: 1.0E-6coord: 339..371
score: 2.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 400..434
score: 12.781coord: 470..504
score: 12.364coord: 20..54
score: 8.166coord: 505..539
score: 9.536coord: 370..396
score: 7.004coord: 90..124
score: 7.925coord: 265..299
score: 8.21coord: 435..469
score: 9.471coord: 300..334
score: 9.109coord: 55..89
score: 11.093coord: 230..264
score: 13.581coord: 195..229
score: 12.299coord: 126..156
score: 5.788coord: 160..194
score: 11.159coord: 335..369
score: 9
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 198..284
score: 3.8E-7coord: 338..447
score: 3.
NoneNo IPR availableunknownCoilCoilcoord: 410..430
scor
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 1..540
score: 1.6E