Cla004832 (gene) Watermelon (97103) v1

NameCla004832
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionPentatricopeptide repeat-containing protein (AHRD V1 ***- Q7XJ94_RAPSA); contains Interpro domain(s) IPR002885 Pentatricopeptide repeat
LocationChr10 : 11292930 .. 11295164 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTATTTGCCCATGTTCGAAAATATGTTTATCTGAGTTGTTTTTACTCCACAATCAAGTTAACTGCCATCTCTTTTCGAGAAGAGCGTGCTATTCTTCCTTGTCGGTGTTGTTGGTGGAAGACCACGTGTTTGATGAAAGTCCAGTAGCTCAAATTAAAGTTCTTCCTGAAATTAACGTCAGAAGAAACTGTTCAATTCTTAGGAGGAAGGGGAAGCTCTTTCCTTTAGTTGTTCGGGTTCTAAAATCTTTGAACTGGAGGGTTGCGAGGCAGATAAGATTCTCAACAGCCGTGAAACGTTATGGGCATTTGAACTCTCTATGTGCATTTAGAATTATGGTTCATGTATTTGCATTAGCGGAAATGCGGATGGAAGTCTACTCTCTTCTTAAAGATATCGTTTGCCACACTAGGAAGGCTAAGCAAGATTTATCTGGATTGTTCCCATTTCTTATGGACTCGAATTATGATGTGGAGAAGTCAAGTATCGTGCTTGACCTGCTTATAAAGGTTTTCTCTGGAAATTCAATGGTTGAGAATGCTGTGGAAGTGTTTATTCAAGCCAAGAAAATGGGAATCATGCCAGATATTTCGTCATGCAATTTTTTGCTCAAGTGCTTGGGTGAAGCAAACAGAGAGGATTTTGTTAGATGTTTGTTTGAGGATTTGAAGACTTCTGGCCCAAATCCTACTGTATATACCTACACAATTATGATGAACTTCTTTTGCCAAGGATTAAATGGGCGTGATGTCAACATTAAAGAAGCAACCTTTCTTCTAGAGGAATTAATGAGGAGTGGTGAGAGACCCACTGTTGTAACGTACAGCATTTACATCCGTGCACTTTGTAAAGTTGTTTCTGGTCAGTCTGCTTTGGACTTCATTCAAGATCTGAGACTCAAGAACCAACCTCTTAATGCTTATTGTTACAATCCAATTCTTCATAAATTTTGTCAAGAGGGTAAGACGGAGGAAGCTATGAAGGTTTTGCAGGAAATGATGGGTTATGGAATACTACCAGATGCATGTAGTTATAGTATTTTGGTTGATGGGTTCTGCAAGAGAGGGGAAAATGATAAAGGTCTCAATCTGATTGAGGAAATGGAACTTTGTCAACTAAAACCATCCCTAGTCACCTATTCATCTCTGATTTATGGTCTTTGTAAGAGAGGATTGATGGATCTCTCCCTAGATATTTTTCGTAAACTTCAGAAACTTGGTTACAAATATGACTTGGTTGTTTATGACATTTTGATCAGAGGATTTTTTTCTCACGATGATATGGAGTATGCTAAAAAACTTTTGGATGAGATGGTAAATGATTTATGTCCTGATGCTTTTAATTTTAATGCAATGATTCGTTGGCTTTGCAAGATGGGACACTTTGATAAAGCCATAGGACTTCTCAATCTCATGCTAAAATATTGTGGTTTGGCTGATACGATGACTTGCAATTTTATTGTAGATGGGTACTGCAGAGAAGGGAACTTGGAGGAGGCCCTGAAATTAATGATTTATATGAAAGATCATGGTATCATGCCTAATTCGTACACATTCAATATAATCATGAAGAGATTGTGTGAGGAAGGAAGACTGGAAAAAGTGTGGGAGCTTTTCCCTGCAATGCTAAAGTTGAATATACTTCCAGGGTTGGTACATTATAGTACTCTTATTGATGGTTTTGCAAAGCATTCTAATATGAAAAAGGCATTAATGCTGTATGAAAGAATGGAAAGGCTAGGAGTTCCACCAGACACTGTCGCTTCTACTATCATTATTAATATGTTATGCCAGAGGAATGAAACATATCGGGCATATAAATTATTTAAGGAATTGATTGTAAAAGGTATGAATCTAGATAAGATTCTGTACACTTCTATGATAGCTGGATTTAGTAGAACGGGAGACATGAAGAAGGCATGTGCTTTGTTTAATAAAATGTCAAATGAAGGATGTTCACCTACTGTTGTTACTTACACTTGTTTAATTGATGGATTCTTCAAGTTAAGACGCTTGGATCTTGCAAGCTTTTTGGTTGATGATATGAAGAGAAATAATCTTACACCAGATGTGATAGTTTACACGGTTCTTATTGTTGGGCTGCTTAGACTTGGAAAGATTGAAAAAGCACGTGAACTGGTTGATGAAGTGAGAGCGGGTGGTATAATTTCAGAAGATGCTACTTTTCAAATGCTGGCTTCTAGCATTGTTAATCAGAAGTTGGAGTGCTAA

mRNA sequence

ATGGGTATTTGCCCATGTTCGAAAATATGTTTATCTGAGTTGTTTTTACTCCACAATCAAGTTAACTGCCATCTCTTTTCGAGAAGAGCGTGCTATTCTTCCTTGTCGGTGTTGTTGGTGGAAGACCACGTGTTTGATGAAAGTCCAGTAGCTCAAATTAAAGTTCTTCCTGAAATTAACGTCAGAAGAAACTGTTCAATTCTTAGGAGGAAGGGGAAGCTCTTTCCTTTAGTTGTTCGGGTTCTAAAATCTTTGAACTGGAGGGTTGCGAGGCAGATAAGATTCTCAACAGCCGTGAAACGTTATGGGCATTTGAACTCTCTATGTGCATTTAGAATTATGGTTCATGTATTTGCATTAGCGGAAATGCGGATGGAAGTCTACTCTCTTCTTAAAGATATCGTTTGCCACACTAGGAAGGCTAAGCAAGATTTATCTGGATTGTTCCCATTTCTTATGGACTCGAATTATGATGTGGAGAAGTCAAGTATCGTGCTTGACCTGCTTATAAAGGTTTTCTCTGGAAATTCAATGGTTGAGAATGCTGTGGAAGTGTTTATTCAAGCCAAGAAAATGGGAATCATGCCAGATATTTCGTCATGCAATTTTTTGCTCAAGTGCTTGGGTGAAGCAAACAGAGAGGATTTTGTTAGATGTTTGTTTGAGGATTTGAAGACTTCTGGCCCAAATCCTACTGTATATACCTACACAATTATGATGAACTTCTTTTGCCAAGGATTAAATGGGCGTGATGTCAACATTAAAGAAGCAACCTTTCTTCTAGAGGAATTAATGAGGAGTGGTGAGAGACCCACTGTTGTAACGTACAGCATTTACATCCGTGCACTTTGTAAAGTTGTTTCTGGTCAGTCTGCTTTGGACTTCATTCAAGATCTGAGACTCAAGAACCAACCTCTTAATGCTTATTGTTACAATCCAATTCTTCATAAATTTTGTCAAGAGGGTAAGACGGAGGAAGCTATGAAGGTTTTGCAGGAAATGATGGGTTATGGAATACTACCAGATGCATGTAGTTATAGTATTTTGGTTGATGGGTTCTGCAAGAGAGGGGAAAATGATAAAGGTCTCAATCTGATTGAGGAAATGGAACTTTGTCAACTAAAACCATCCCTAGTCACCTATTCATCTCTGATTTATGGTCTTTGTAAGAGAGGATTGATGGATCTCTCCCTAGATATTTTTCGTAAACTTCAGAAACTTGGTTACAAATATGACTTGGTTGTTTATGACATTTTGATCAGAGGATTTTTTTCTCACGATGATATGGAGTATGCTAAAAAACTTTTGGATGAGATGGTAAATGATTTATGTCCTGATGCTTTTAATTTTAATGCAATGATTCGTTGGCTTTGCAAGATGGGACACTTTGATAAAGCCATAGGACTTCTCAATCTCATGCTAAAATATTGTGGTTTGGCTGATACGATGACTTGCAATTTTATTGTAGATGGGTACTGCAGAGAAGGGAACTTGGAGGAGGCCCTGAAATTAATGATTTATATGAAAGATCATGGTATCATGCCTAATTCGTACACATTCAATATAATCATGAAGAGATTGTGTGAGGAAGGAAGACTGGAAAAAGTGTGGGAGCTTTTCCCTGCAATGCTAAAGTTGAATATACTTCCAGGGTTGGTACATTATAGTACTCTTATTGATGGTTTTGCAAAGCATTCTAATATGAAAAAGGCATTAATGCTGTATGAAAGAATGGAAAGGCTAGGAGTTCCACCAGACACTGTCGCTTCTACTATCATTATTAATATGTTATGCCAGAGGAATGAAACATATCGGGCATATAAATTATTTAAGGAATTGATTGTAAAAGGTATGAATCTAGATAAGATTCTGTACACTTCTATGATAGCTGGATTTAGTAGAACGGGAGACATGAAGAAGGCATGTGCTTTGTTTAATAAAATGTCAAATGAAGGATGTTCACCTACTGTTGTTACTTACACTTGTTTAATTGATGGATTCTTCAAGTTAAGACGCTTGGATCTTGCAAGCTTTTTGGTTGATGATATGAAGAGAAATAATCTTACACCAGATGTGATAGTTTACACGGTTCTTATTGTTGGGCTGCTTAGACTTGGAAAGATTGAAAAAGCACGTGAACTGGTTGATGAAGTGAGAGCGGGTGGTATAATTTCAGAAGATGCTACTTTTCAAATGCTGGCTTCTAGCATTGTTAATCAGAAGTTGGAGTGCTAA

Coding sequence (CDS)

ATGGGTATTTGCCCATGTTCGAAAATATGTTTATCTGAGTTGTTTTTACTCCACAATCAAGTTAACTGCCATCTCTTTTCGAGAAGAGCGTGCTATTCTTCCTTGTCGGTGTTGTTGGTGGAAGACCACGTGTTTGATGAAAGTCCAGTAGCTCAAATTAAAGTTCTTCCTGAAATTAACGTCAGAAGAAACTGTTCAATTCTTAGGAGGAAGGGGAAGCTCTTTCCTTTAGTTGTTCGGGTTCTAAAATCTTTGAACTGGAGGGTTGCGAGGCAGATAAGATTCTCAACAGCCGTGAAACGTTATGGGCATTTGAACTCTCTATGTGCATTTAGAATTATGGTTCATGTATTTGCATTAGCGGAAATGCGGATGGAAGTCTACTCTCTTCTTAAAGATATCGTTTGCCACACTAGGAAGGCTAAGCAAGATTTATCTGGATTGTTCCCATTTCTTATGGACTCGAATTATGATGTGGAGAAGTCAAGTATCGTGCTTGACCTGCTTATAAAGGTTTTCTCTGGAAATTCAATGGTTGAGAATGCTGTGGAAGTGTTTATTCAAGCCAAGAAAATGGGAATCATGCCAGATATTTCGTCATGCAATTTTTTGCTCAAGTGCTTGGGTGAAGCAAACAGAGAGGATTTTGTTAGATGTTTGTTTGAGGATTTGAAGACTTCTGGCCCAAATCCTACTGTATATACCTACACAATTATGATGAACTTCTTTTGCCAAGGATTAAATGGGCGTGATGTCAACATTAAAGAAGCAACCTTTCTTCTAGAGGAATTAATGAGGAGTGGTGAGAGACCCACTGTTGTAACGTACAGCATTTACATCCGTGCACTTTGTAAAGTTGTTTCTGGTCAGTCTGCTTTGGACTTCATTCAAGATCTGAGACTCAAGAACCAACCTCTTAATGCTTATTGTTACAATCCAATTCTTCATAAATTTTGTCAAGAGGGTAAGACGGAGGAAGCTATGAAGGTTTTGCAGGAAATGATGGGTTATGGAATACTACCAGATGCATGTAGTTATAGTATTTTGGTTGATGGGTTCTGCAAGAGAGGGGAAAATGATAAAGGTCTCAATCTGATTGAGGAAATGGAACTTTGTCAACTAAAACCATCCCTAGTCACCTATTCATCTCTGATTTATGGTCTTTGTAAGAGAGGATTGATGGATCTCTCCCTAGATATTTTTCGTAAACTTCAGAAACTTGGTTACAAATATGACTTGGTTGTTTATGACATTTTGATCAGAGGATTTTTTTCTCACGATGATATGGAGTATGCTAAAAAACTTTTGGATGAGATGGTAAATGATTTATGTCCTGATGCTTTTAATTTTAATGCAATGATTCGTTGGCTTTGCAAGATGGGACACTTTGATAAAGCCATAGGACTTCTCAATCTCATGCTAAAATATTGTGGTTTGGCTGATACGATGACTTGCAATTTTATTGTAGATGGGTACTGCAGAGAAGGGAACTTGGAGGAGGCCCTGAAATTAATGATTTATATGAAAGATCATGGTATCATGCCTAATTCGTACACATTCAATATAATCATGAAGAGATTGTGTGAGGAAGGAAGACTGGAAAAAGTGTGGGAGCTTTTCCCTGCAATGCTAAAGTTGAATATACTTCCAGGGTTGGTACATTATAGTACTCTTATTGATGGTTTTGCAAAGCATTCTAATATGAAAAAGGCATTAATGCTGTATGAAAGAATGGAAAGGCTAGGAGTTCCACCAGACACTGTCGCTTCTACTATCATTATTAATATGTTATGCCAGAGGAATGAAACATATCGGGCATATAAATTATTTAAGGAATTGATTGTAAAAGGTATGAATCTAGATAAGATTCTGTACACTTCTATGATAGCTGGATTTAGTAGAACGGGAGACATGAAGAAGGCATGTGCTTTGTTTAATAAAATGTCAAATGAAGGATGTTCACCTACTGTTGTTACTTACACTTGTTTAATTGATGGATTCTTCAAGTTAAGACGCTTGGATCTTGCAAGCTTTTTGGTTGATGATATGAAGAGAAATAATCTTACACCAGATGTGATAGTTTACACGGTTCTTATTGTTGGGCTGCTTAGACTTGGAAAGATTGAAAAAGCACGTGAACTGGTTGATGAAGTGAGAGCGGGTGGTATAATTTCAGAAGATGCTACTTTTCAAATGCTGGCTTCTAGCATTGTTAATCAGAAGTTGGAGTGCTAA

Protein sequence

MGICPCSKICLSELFLLHNQVNCHLFSRRACYSSLSVLLVEDHVFDESPVAQIKVLPEINVRRNCSILRRKGKLFPLVVRVLKSLNWRVARQIRFSTAVKRYGHLNSLCAFRIMVHVFALAEMRMEVYSLLKDIVCHTRKAKQDLSGLFPFLMDSNYDVEKSSIVLDLLIKVFSGNSMVENAVEVFIQAKKMGIMPDISSCNFLLKCLGEANREDFVRCLFEDLKTSGPNPTVYTYTIMMNFFCQGLNGRDVNIKEATFLLEELMRSGERPTVVTYSIYIRALCKVVSGQSALDFIQDLRLKNQPLNAYCYNPILHKFCQEGKTEEAMKVLQEMMGYGILPDACSYSILVDGFCKRGENDKGLNLIEEMELCQLKPSLVTYSSLIYGLCKRGLMDLSLDIFRKLQKLGYKYDLVVYDILIRGFFSHDDMEYAKKLLDEMVNDLCPDAFNFNAMIRWLCKMGHFDKAIGLLNLMLKYCGLADTMTCNFIVDGYCREGNLEEALKLMIYMKDHGIMPNSYTFNIIMKRLCEEGRLEKVWELFPAMLKLNILPGLVHYSTLIDGFAKHSNMKKALMLYERMERLGVPPDTVASTIIINMLCQRNETYRAYKLFKELIVKGMNLDKILYTSMIAGFSRTGDMKKACALFNKMSNEGCSPTVVTYTCLIDGFFKLRRLDLASFLVDDMKRNNLTPDVIVYTVLIVGLLRLGKIEKARELVDEVRAGGIISEDATFQMLASSIVNQKLEC
BLAST of Cla004832 vs. Swiss-Prot
Match: PPR39_ARATH (Pentatricopeptide repeat-containing protein At1g12775, mitochondrial OS=Arabidopsis thaliana GN=At1g12775 PE=2 SV=1)

HSP 1 Score: 258.1 bits (658), Expect = 3.0e-67
Identity = 157/576 (27.26%), Postives = 276/576 (47.92%), Query Frame = 1

Query: 146 SGLFPFLMDSNYDVEKSSI-------VLDL--LIKVFSGNSMVENAVEVFIQAKKMGIMP 205
           SGL     D   D+ +  I       V+D   L    +     E  + +  Q +  GI  
Sbjct: 62  SGLVGIKADDAVDLFRDMIQSRPLPTVIDFNRLFSAIAKTKQYELVLALCKQMESKGIAH 121

Query: 206 DISSCNFLLKCLGEANREDFVRCLFEDLKTSGPNPTVYTYTIMMNFFCQGLNGRDVNIKE 265
            I + + ++ C     +  +       +   G  P    +  ++N  C      +  + E
Sbjct: 122 SIYTLSIMINCFCRCRKLSYAFSTMGKIMKLGYEPDTVIFNTLLNGLCL-----ECRVSE 181

Query: 266 ATFLLEELMRSGERPTVVTYSIYIRALCKVVSGQSALDFIQDLRLKNQPLNAYCYNPILH 325
           A  L++ ++  G +PT++T +  +  LC       A+  I  +       N   Y P+L+
Sbjct: 182 ALELVDRMVEMGHKPTLITLNTLVNGLCLNGKVSDAVVLIDRMVETGFQPNEVTYGPVLN 241

Query: 326 KFCQEGKTEEAMKVLQEMMGYGILPDACSYSILVDGFCKRGENDKGLNLIEEMELCQLKP 385
             C+ G+T  AM++L++M    I  DA  YSI++DG CK G  D   NL  EME+   K 
Sbjct: 242 VMCKSGQTALAMELLRKMEERNIKLDAVKYSIIIDGLCKDGSLDNAFNLFNEMEIKGFKA 301

Query: 386 SLVTYSSLIYGLCKRGLMDLSLDIFRKLQKLGYKYDLVVYDILIRGFFSHDDMEYAKKLL 445
            ++TY++LI G C  G  D    + R + K     ++V + +LI  F     +  A +LL
Sbjct: 302 DIITYNTLIGGFCNAGRWDDGAKLLRDMIKRKISPNVVTFSVLIDSFVKEGKLREADQLL 361

Query: 446 DEMVN-DLCPDAFNFNAMIRWLCKMGHFDKAIGLLNLMLKYCGLADTMTCNFIVDGYCRE 505
            EM+   + P+   +N++I   CK    ++AI +++LM+      D MT N +++GYC+ 
Sbjct: 362 KEMMQRGIAPNTITYNSLIDGFCKENRLEEAIQMVDLMISKGCDPDIMTFNILINGYCKA 421

Query: 506 GNLEEALKLMIYMKDHGIMPNSYTFNIIMKRLCEEGRLEKVWELFPAMLKLNILPGLVHY 565
             +++ L+L   M   G++ N+ T+N +++  C+ G+LE   +LF  M+   + P +V Y
Sbjct: 422 NRIDDGLELFREMSLRGVIANTVTYNTLVQGFCQSGKLEVAKKLFQEMVSRRVRPDIVSY 481

Query: 566 STLIDGFAKHSNMKKALMLYERMERLGVPPDTVASTIIINMLCQRNETYRAYKLFKELIV 625
             L+DG   +  ++KAL ++ ++E+  +  D     III+ +C  ++   A+ LF  L +
Sbjct: 482 KILLDGLCDNGELEKALEIFGKIEKSKMELDIGIYMIIIHGMCNASKVDDAWDLFCSLPL 541

Query: 626 KGMNLDKILYTSMIAGFSRTGDMKKACALFNKMSNEGCSPTVVTYTCLIDGFFKLRRLDL 685
           KG+ LD   Y  MI+   R   + KA  LF KM+ EG +P  +TY  LI           
Sbjct: 542 KGVKLDARAYNIMISELCRKDSLSKADILFRKMTEEGHAPDELTYNILIRAHLGDDDATT 601

Query: 686 ASFLVDDMKRNNLTPDVIVYTVLIVGLLRLGKIEKA 712
           A+ L+++MK +    DV     +++ +L  G+++K+
Sbjct: 602 AAELIEEMKSSGFPADVST-VKMVINMLSSGELDKS 631

BLAST of Cla004832 vs. Swiss-Prot
Match: PPR94_ARATH (Pentatricopeptide repeat-containing protein At1g62910 OS=Arabidopsis thaliana GN=At1g62910 PE=2 SV=1)

HSP 1 Score: 255.4 bits (651), Expect = 2.0e-66
Identity = 159/566 (28.09%), Postives = 286/566 (50.53%), Query Frame = 1

Query: 179 VENAVEVFIQAKKMGIMPDISSCNFLLKCLGEANREDFVRCLFEDLKTSGPNPTVYTYTI 238
           V++AV++F    K    P I   N LL  + + N+ + V  L E ++T G +  +YTY+I
Sbjct: 64  VDDAVDLFGDMVKSRPFPSIVEFNKLLSAVAKMNKFELVISLGEQMQTLGISHDLYTYSI 123

Query: 239 MMNFFCQGLNGRDVNIKEATFLLEELMRSGERPTVVTYSIYIRALCKVVSGQSALDFIQD 298
            +N FC     R   +  A  +L ++M+ G  P +VT S  +   C       A+  +  
Sbjct: 124 FINCFC-----RRSQLSLALAVLAKMMKLGYEPDIVTLSSLLNGYCHSKRISDAVALVDQ 183

Query: 299 LRLKNQPLNAYCYNPILHKFCQEGKTEEAMKVLQEMMGYGILPDACSYSILVDGFCKRGE 358
           +       + + +  ++H      K  EA+ ++ +M+  G  PD  +Y  +V+G CKRG+
Sbjct: 184 MVEMGYKPDTFTFTTLIHGLFLHNKASEAVALVDQMVQRGCQPDLVTYGTVVNGLCKRGD 243

Query: 359 NDKGLNLIEEMELCQLKPSLVTYSSLIYGLCKRGLMDLSLDIFRKLQKLGYKYDLVVYDI 418
            D  L+L+++ME  +++  +V Y+++I GLCK   MD +L++F ++   G + D+  Y  
Sbjct: 244 IDLALSLLKKMEKGKIEADVVIYNTIIDGLCKYKHMDDALNLFTEMDNKGIRPDVFTYSS 303

Query: 419 LIRGFFSHDDMEYAKKLLDEMVN-DLCPDAFNFNAMIRWLCKMGHFDKAIGLLNLMLKYC 478
           LI    ++     A +LL +M+   + P+   F+A+I    K G   +A  L + M+K  
Sbjct: 304 LISCLCNYGRWSDASRLLSDMIERKINPNVVTFSALIDAFVKEGKLVEAEKLYDEMIKRS 363

Query: 479 GLADTMTCNFIVDGYCREGNLEEALKLMIYMKDHGIMPNSYTFNIIMKRLCEEGRLEKVW 538
              D  T + +++G+C    L+EA  +   M      PN  T++ ++K  C+  R+E+  
Sbjct: 364 IDPDIFTYSSLINGFCMHDRLDEAKHMFELMISKDCFPNVVTYSTLIKGFCKAKRVEEGM 423

Query: 539 ELFPAMLKLNILPGLVHYSTLIDGFAKHSNMKKALMLYERMERLGVPPDTVASTIIINML 598
           ELF  M +  ++   V Y+TLI GF +  +   A M++++M  +GV P+ +   I+++ L
Sbjct: 424 ELFREMSQRGLVGNTVTYTTLIHGFFQARDCDNAQMVFKQMVSVGVHPNILTYNILLDGL 483

Query: 599 CQRNETYRAYKLFKELIVKGMNLDKILYTSMIAGFSRTGDMKKACALFNKMSNEGCSPTV 658
           C+  +  +A  +F+ L    M  D   Y  MI G  + G ++    LF  +S +G SP V
Sbjct: 484 CKNGKLAKAMVVFEYLQRSTMEPDIYTYNIMIEGMCKAGKVEDGWELFCNLSLKGVSPNV 543

Query: 659 VTYTCLIDGFFKLRRLDLASFLVDDMKRNNLTPDVIVYTVLIVGLLRLGKIEKARELVDE 718
           + Y  +I GF +    + A  L+  MK +   P+   Y  LI   LR G  E + EL+ E
Sbjct: 544 IAYNTMISGFCRKGSKEEADSLLKKMKEDGPLPNSGTYNTLIRARLRDGDREASAELIKE 603

Query: 719 VRAGGIISEDATFQMLASSIVNQKLE 744
           +R+ G   + +T  ++ + + + +L+
Sbjct: 604 MRSCGFAGDASTIGLVTNMLHDGRLD 624


HSP 2 Score: 173.7 bits (439), Expect = 7.5e-42
Identity = 137/552 (24.82%), Postives = 241/552 (43.66%), Query Frame = 1

Query: 71  KGKLFPLVVRVLKSLNWRVARQIRFSTAV------KRYGHLNSLCAFRIMVHVFA----- 130
           K + FP +V   K L+  VA+  +F   +      +  G  + L  + I ++ F      
Sbjct: 76  KSRPFPSIVEFNKLLS-AVAKMNKFELVISLGEQMQTLGISHDLYTYSIFINCFCRRSQL 135

Query: 131 ------LAEMRMEVY-------SLLKDIVCHTRKAKQDLSGLFPFLMDSNYDVEKSSIVL 190
                 LA+M    Y       S L +  CH+++   D   L   +++  Y  +  +   
Sbjct: 136 SLALAVLAKMMKLGYEPDIVTLSSLLNGYCHSKRIS-DAVALVDQMVEMGY--KPDTFTF 195

Query: 191 DLLIKVFSGNSMVENAVEVFIQAKKMGIMPDISSCNFLLKCLGEANREDFVRCLFEDLKT 250
             LI     ++    AV +  Q  + G  PD+ +   ++  L +    D    L + ++ 
Sbjct: 196 TTLIHGLFLHNKASEAVALVDQMVQRGCQPDLVTYGTVVNGLCKRGDIDLALSLLKKMEK 255

Query: 251 SGPNPTVYTYTIMMNFFCQGLNGRDVNIKEATFLLEELMRSGERPTVVTYSIYIRALCKV 310
                 V  Y  +++  C     +  ++ +A  L  E+   G RP V TYS  I  LC  
Sbjct: 256 GKIEADVVIYNTIIDGLC-----KYKHMDDALNLFTEMDNKGIRPDVFTYSSLISCLCNY 315

Query: 311 VSGQSALDFIQDLRLKNQPLNAYCYNPILHKFCQEGKTEEAMKVLQEMMGYGILPDACSY 370
                A   + D+  +    N   ++ ++  F +EGK  EA K+  EM+   I PD  +Y
Sbjct: 316 GRWSDASRLLSDMIERKINPNVVTFSALIDAFVKEGKLVEAEKLYDEMIKRSIDPDIFTY 375

Query: 371 SILVDGFCKRGENDKGLNLIEEMELCQLKPSLVTYSSLIYGLCKRGLMDLSLDIFRKLQK 430
           S L++GFC     D+  ++ E M      P++VTYS+LI G CK   ++  +++FR++ +
Sbjct: 376 SSLINGFCMHDRLDEAKHMFELMISKDCFPNVVTYSTLIKGFCKAKRVEEGMELFREMSQ 435

Query: 431 LGYKYDLVVYDILIRGFFSHDDMEYAKKLLDEMVN-DLCPDAFNFNAMIRWLCKMGHFDK 490
            G   + V Y  LI GFF   D + A+ +  +MV+  + P+   +N ++  LCK G   K
Sbjct: 436 RGLVGNTVTYTTLIHGFFQARDCDNAQMVFKQMVSVGVHPNILTYNILLDGLCKNGKLAK 495

Query: 491 AIGLLNLMLKYCGLADTMTCNFIVDGYCREGNLEEALKLMIYMKDHGIMPNSYTFNIIMK 550
           A+ +   + +     D  T N +++G C+ G +E+  +L   +   G+ PN   +N ++ 
Sbjct: 496 AMVVFEYLQRSTMEPDIYTYNIMIEGMCKAGKVEDGWELFCNLSLKGVSPNVIAYNTMIS 555

Query: 551 RLCEEGRLEKVWELFPAMLKLNILPGLVHYSTLIDGFAKHSNMKKALMLYERMERLGVPP 598
             C +G  E+   L   M +   LP    Y+TLI    +  + + +  L + M   G   
Sbjct: 556 GFCRKGSKEEADSLLKKMKEDGPLPNSGTYNTLIRARLRDGDREASAELIKEMRSCGFAG 615

BLAST of Cla004832 vs. Swiss-Prot
Match: PP247_ARATH (Pentatricopeptide repeat-containing protein At3g22470, mitochondrial OS=Arabidopsis thaliana GN=At3g22470 PE=2 SV=1)

HSP 1 Score: 255.0 bits (650), Expect = 2.6e-66
Identity = 156/566 (27.56%), Postives = 277/566 (48.94%), Query Frame = 1

Query: 179 VENAVEVFIQAKKMGIMPDISSCNFLLKCLGEANREDFVRCLFEDLKTSGPNPTVYTYTI 238
           V +A+++F    +   +P     N L   +    + D V    + ++ +G    +YT TI
Sbjct: 51  VNDAIDLFESMIQSRPLPTPIDFNRLCSAVARTKQYDLVLGFCKGMELNGIEHDMYTMTI 110

Query: 239 MMNFFCQGLNGRDVNIKEATFLLEELMRSGERPTVVTYSIYIRALCKVVSGQSALDFIQD 298
           M+N +C     R   +  A  +L    + G  P  +T+S  +   C       A+  +  
Sbjct: 111 MINCYC-----RKKKLLFAFSVLGRAWKLGYEPDTITFSTLVNGFCLEGRVSEAVALVDR 170

Query: 299 LRLKNQPLNAYCYNPILHKFCQEGKTEEAMKVLQEMMGYGILPDACSYSILVDGFCKRGE 358
           +    Q  +    + +++  C +G+  EA+ ++  M+ YG  PD  +Y  +++  CK G 
Sbjct: 171 MVEMKQRPDLVTVSTLINGLCLKGRVSEALVLIDRMVEYGFQPDEVTYGPVLNRLCKSGN 230

Query: 359 NDKGLNLIEEMELCQLKPSLVTYSSLIYGLCKRGLMDLSLDIFRKLQKLGYKYDLVVYDI 418
           +   L+L  +ME   +K S+V YS +I  LCK G  D +L +F +++  G K D+V Y  
Sbjct: 231 SALALDLFRKMEERNIKASVVQYSIVIDSLCKDGSFDDALSLFNEMEMKGIKADVVTYSS 290

Query: 419 LIRGFFSHDDMEYAKKLLDEMVN-DLCPDAFNFNAMIRWLCKMGHFDKAIGLLNLMLKYC 478
           LI G  +    +   K+L EM+  ++ PD   F+A+I    K G   +A  L N M+   
Sbjct: 291 LIGGLCNDGKWDDGAKMLREMIGRNIIPDVVTFSALIDVFVKEGKLLEAKELYNEMITRG 350

Query: 479 GLADTMTCNFIVDGYCREGNLEEALKLMIYMKDHGIMPNSYTFNIIMKRLCEEGRLEKVW 538
              DT+T N ++DG+C+E  L EA ++   M   G  P+  T++I++   C+  R++   
Sbjct: 351 IAPDTITYNSLIDGFCKENCLHEANQMFDLMVSKGCEPDIVTYSILINSYCKAKRVDDGM 410

Query: 539 ELFPAMLKLNILPGLVHYSTLIDGFAKHSNMKKALMLYERMERLGVPPDTVASTIIINML 598
            LF  +    ++P  + Y+TL+ GF +   +  A  L++ M   GVPP  V   I+++ L
Sbjct: 411 RLFREISSKGLIPNTITYNTLVLGFCQSGKLNAAKELFQEMVSRGVPPSVVTYGILLDGL 470

Query: 599 CQRNETYRAYKLFKELIVKGMNLDKILYTSMIAGFSRTGDMKKACALFNKMSNEGCSPTV 658
           C   E  +A ++F+++    M L   +Y  +I G      +  A +LF  +S++G  P V
Sbjct: 471 CDNGELNKALEIFEKMQKSRMTLGIGIYNIIIHGMCNASKVDDAWSLFCSLSDKGVKPDV 530

Query: 659 VTYTCLIDGFFKLRRLDLASFLVDDMKRNNLTPDVIVYTVLIVGLLRLGKIEKARELVDE 718
           VTY  +I G  K   L  A  L   MK +  TPD   Y +LI   L    +  + EL++E
Sbjct: 531 VTYNVMIGGLCKKGSLSEADMLFRKMKEDGCTPDDFTYNILIRAHLGGSGLISSVELIEE 590

Query: 719 VRAGGIISEDATFQMLASSIVNQKLE 744
           ++  G  ++ +T +M+   + +++L+
Sbjct: 591 MKVCGFSADSSTIKMVIDMLSDRRLD 611


HSP 2 Score: 185.3 bits (469), Expect = 2.5e-45
Identity = 129/500 (25.80%), Postives = 226/500 (45.20%), Query Frame = 1

Query: 126 EVYSLLKDIVCHTRKAKQDLSGLFPFLMDSN---YDVEKSSIVLDLLIKVFSGNSMVENA 185
           ++Y++   I C+ RK K     LF F +         E  +I    L+  F     V  A
Sbjct: 104 DMYTMTIMINCYCRKKKL----LFAFSVLGRAWKLGYEPDTITFSTLVNGFCLEGRVSEA 163

Query: 186 VEVFIQAKKMGIMPDISSCNFLLKCLGEANREDFVRCLFEDLKTSGPNPTVYTYTIMMNF 245
           V +  +  +M   PD+ + + L+  L    R      L + +   G  P   TY  ++N 
Sbjct: 164 VALVDRMVEMKQRPDLVTVSTLINGLCLKGRVSEALVLIDRMVEYGFQPDEVTYGPVLNR 223

Query: 246 FCQGLNGRDVNIKEATFLLEELMRSGERPTVVTYSIYIRALCKVVSGQSALDFIQDLRLK 305
            C+  N        A  L  ++     + +VV YSI I +LCK  S   AL    ++ +K
Sbjct: 224 LCKSGNSA-----LALDLFRKMEERNIKASVVQYSIVIDSLCKDGSFDDALSLFNEMEMK 283

Query: 306 NQPLNAYCYNPILHKFCQEGKTEEAMKVLQEMMGYGILPDACSYSILVDGFCKRGENDKG 365
               +   Y+ ++   C +GK ++  K+L+EM+G  I+PD  ++S L+D F K G+  + 
Sbjct: 284 GIKADVVTYSSLIGGLCNDGKWDDGAKMLREMIGRNIIPDVVTFSALIDVFVKEGKLLEA 343

Query: 366 LNLIEEMELCQLKPSLVTYSSLIYGLCKRGLMDLSLDIFRKLQKLGYKYDLVVYDILIRG 425
             L  EM    + P  +TY+SLI G CK   +  +  +F  +   G + D+V Y ILI  
Sbjct: 344 KELYNEMITRGIAPDTITYNSLIDGFCKENCLHEANQMFDLMVSKGCEPDIVTYSILINS 403

Query: 426 FFSHDDMEYAKKLLDEMVN-DLCPDAFNFNAMIRWLCKMGHFDKAIGLLNLMLKYCGLAD 485
           +     ++   +L  E+ +  L P+   +N ++   C+ G  + A  L   M+       
Sbjct: 404 YCKAKRVDDGMRLFREISSKGLIPNTITYNTLVLGFCQSGKLNAAKELFQEMVSRGVPPS 463

Query: 486 TMTCNFIVDGYCREGNLEEALKLMIYMKDHGIMPNSYTFNIIMKRLCEEGRLEKVWELFP 545
            +T   ++DG C  G L +AL++   M+   +      +NII+  +C   +++  W LF 
Sbjct: 464 VVTYGILLDGLCDNGELNKALEIFEKMQKSRMTLGIGIYNIIIHGMCNASKVDDAWSLFC 523

Query: 546 AMLKLNILPGLVHYSTLIDGFAKHSNMKKALMLYERMERLGVPPDTVASTIIINMLCQRN 605
           ++    + P +V Y+ +I G  K  ++ +A ML+ +M+  G  PD     I+I      +
Sbjct: 524 SLSDKGVKPDVVTYNVMIGGLCKKGSLSEADMLFRKMKEDGCTPDDFTYNILIRAHLGGS 583

Query: 606 ETYRAYKLFKELIVKGMNLD 622
               + +L +E+ V G + D
Sbjct: 584 GLISSVELIEEMKVCGFSAD 594

BLAST of Cla004832 vs. Swiss-Prot
Match: PPR91_ARATH (Pentatricopeptide repeat-containing protein At1g62670, mitochondrial OS=Arabidopsis thaliana GN=At1g62670 PE=3 SV=2)

HSP 1 Score: 254.6 bits (649), Expect = 3.3e-66
Identity = 157/566 (27.74%), Postives = 282/566 (49.82%), Query Frame = 1

Query: 179 VENAVEVFIQAKKMGIMPDISSCNFLLKCLGEANREDFVRCLFEDLKTSGPNPTVYTYTI 238
           +++AV +F +  K    P I   + LL  + + N+ D V  L E ++  G     YTY+I
Sbjct: 62  LDDAVALFGEMVKSRPFPSIIEFSKLLSAIAKMNKFDVVISLGEQMQNLGIPHNHYTYSI 121

Query: 239 MMNFFCQGLNGRDVNIKEATFLLEELMRSGERPTVVTYSIYIRALCKVVSGQSALDFIQD 298
           ++N FC     R   +  A  +L ++M+ G  P +VT S  +   C       A+  +  
Sbjct: 122 LINCFC-----RRSQLPLALAVLGKMMKLGYEPNIVTLSSLLNGYCHSKRISEAVALVDQ 181

Query: 299 LRLKNQPLNAYCYNPILHKFCQEGKTEEAMKVLQEMMGYGILPDACSYSILVDGFCKRGE 358
           + +     N   +N ++H      K  EAM ++  M+  G  PD  +Y ++V+G CKRG+
Sbjct: 182 MFVTGYQPNTVTFNTLIHGLFLHNKASEAMALIDRMVAKGCQPDLVTYGVVVNGLCKRGD 241

Query: 359 NDKGLNLIEEMELCQLKPSLVTYSSLIYGLCKRGLMDLSLDIFRKLQKLGYKYDLVVYDI 418
            D   NL+ +ME  +L+P ++ Y+++I GLCK   MD +L++F++++  G + ++V Y  
Sbjct: 242 TDLAFNLLNKMEQGKLEPGVLIYNTIIDGLCKYKHMDDALNLFKEMETKGIRPNVVTYSS 301

Query: 419 LIRGFFSHDDMEYAKKLLDEMV-NDLCPDAFNFNAMIRWLCKMGHFDKAIGLLNLMLKYC 478
           LI    ++     A +LL +M+   + PD F F+A+I    K G   +A  L + M+K  
Sbjct: 302 LISCLCNYGRWSDASRLLSDMIERKINPDVFTFSALIDAFVKEGKLVEAEKLYDEMVKRS 361

Query: 479 GLADTMTCNFIVDGYCREGNLEEALKLMIYMKDHGIMPNSYTFNIIMKRLCEEGRLEKVW 538
                +T + +++G+C    L+EA ++  +M      P+  T+N ++K  C+  R+E+  
Sbjct: 362 IDPSIVTYSSLINGFCMHDRLDEAKQMFEFMVSKHCFPDVVTYNTLIKGFCKYKRVEEGM 421

Query: 539 ELFPAMLKLNILPGLVHYSTLIDGFAKHSNMKKALMLYERMERLGVPPDTVASTIIINML 598
           E+F  M +  ++   V Y+ LI G  +  +   A  +++ M   GVPP+ +    +++ L
Sbjct: 422 EVFREMSQRGLVGNTVTYNILIQGLFQAGDCDMAQEIFKEMVSDGVPPNIMTYNTLLDGL 481

Query: 599 CQRNETYRAYKLFKELIVKGMNLDKILYTSMIAGFSRTGDMKKACALFNKMSNEGCSPTV 658
           C+  +  +A  +F+ L    M      Y  MI G  + G ++    LF  +S +G  P V
Sbjct: 482 CKNGKLEKAMVVFEYLQRSKMEPTIYTYNIMIEGMCKAGKVEDGWDLFCNLSLKGVKPDV 541

Query: 659 VTYTCLIDGFFKLRRLDLASFLVDDMKRNNLTPDVIVYTVLIVGLLRLGKIEKARELVDE 718
           V Y  +I GF +    + A  L  +MK +   P+   Y  LI   LR G  E + EL+ E
Sbjct: 542 VAYNTMISGFCRKGSKEEADALFKEMKEDGTLPNSGCYNTLIRARLRDGDREASAELIKE 601

Query: 719 VRAGGIISEDATFQMLASSIVNQKLE 744
           +R+ G   + +T  ++ + + + +L+
Sbjct: 602 MRSCGFAGDASTIGLVTNMLHDGRLD 622


HSP 2 Score: 110.2 bits (274), Expect = 1.0e-22
Identity = 87/345 (25.22%), Postives = 166/345 (48.12%), Query Frame = 1

Query: 105 LNSLCAFRIM---VHVFALAEMR-----MEVYSLLKDIVCHTRKAKQDLSGLFPFLMDS- 164
           ++ LC ++ M   +++F   E +     +  YS L   +C+  +   D S L   +++  
Sbjct: 263 IDGLCKYKHMDDALNLFKEMETKGIRPNVVTYSSLISCLCNYGRWS-DASRLLSDMIERK 322

Query: 165 -NYDVEKSSIVLDLLIKVFSGNSMVENAVEVFIQAKKMGIMPDISSCNFLLKCLGEANRE 224
            N DV   S ++D  +K      +VE A +++ +  K  I P I + + L+      +R 
Sbjct: 323 INPDVFTFSALIDAFVKE---GKLVE-AEKLYDEMVKRSIDPSIVTYSSLINGFCMHDRL 382

Query: 225 DFVRCLFEDLKTSGPNPTVYTYTIMMNFFCQGLNGRDVNIKEATFLLEELMRSGERPTVV 284
           D  + +FE + +    P V TY  ++  FC     +   ++E   +  E+ + G     V
Sbjct: 383 DEAKQMFEFMVSKHCFPDVVTYNTLIKGFC-----KYKRVEEGMEVFREMSQRGLVGNTV 442

Query: 285 TYSIYIRALCKVVSGQSALDFIQDLRLKNQPLNAYCYNPILHKFCQEGKTEEAMKVLQEM 344
           TY+I I+ L +      A +  +++     P N   YN +L   C+ GK E+AM V + +
Sbjct: 443 TYNILIQGLFQAGDCDMAQEIFKEMVSDGVPPNIMTYNTLLDGLCKNGKLEKAMVVFEYL 502

Query: 345 MGYGILPDACSYSILVDGFCKRGENDKGLNLIEEMELCQLKPSLVTYSSLIYGLCKRGLM 404
               + P   +Y+I+++G CK G+ + G +L   + L  +KP +V Y+++I G C++G  
Sbjct: 503 QRSKMEPTIYTYNIMIEGMCKAGKVEDGWDLFCNLSLKGVKPDVVAYNTMISGFCRKGSK 562

Query: 405 DLSLDIFRKLQKLGYKYDLVVYDILIRGFFSHDDMEYAKKLLDEM 440
           + +  +F+++++ G   +   Y+ LIR      D E + +L+ EM
Sbjct: 563 EEADALFKEMKEDGTLPNSGCYNTLIRARLRDGDREASAELIKEM 597


HSP 3 Score: 108.2 bits (269), Expect = 3.9e-22
Identity = 75/271 (27.68%), Postives = 131/271 (48.34%), Query Frame = 1

Query: 129 SLLKDIVCHTR--KAKQDLSGLFPFLMDSNYDVEKSSIVLDLLIKVFSGNSMVENAVEVF 188
           SL+     H R  +AKQ    +F F++  +   +   +  + LIK F     VE  +EVF
Sbjct: 366 SLINGFCMHDRLDEAKQ----MFEFMVSKHCFPDV--VTYNTLIKGFCKYKRVEEGMEVF 425

Query: 189 IQAKKMGIMPDISSCNFLLKCLGEANREDFVRCLFEDLKTSGPNPTVYTYTIMMNFFCQG 248
            +  + G++ +  + N L++ L +A   D  + +F+++ + G  P + TY  +++  C+ 
Sbjct: 426 REMSQRGLVGNTVTYNILIQGLFQAGDCDMAQEIFKEMVSDGVPPNIMTYNTLLDGLCK- 485

Query: 249 LNGRDVNIKEATFLLEELMRSGERPTVVTYSIYIRALCKVVSGQSALDFIQDLRLKNQPL 308
            NG+   +++A  + E L RS   PT+ TY+I I  +CK    +   D   +L LK    
Sbjct: 486 -NGK---LEKAMVVFEYLQRSKMEPTIYTYNIMIEGMCKAGKVEDGWDLFCNLSLKGVKP 545

Query: 309 NAYCYNPILHKFCQEGKTEEAMKVLQEMMGYGILPDACSYSILVDGFCKRGENDKGLNLI 368
           +   YN ++  FC++G  EEA  + +EM   G LP++  Y+ L+    + G+ +    LI
Sbjct: 546 DVVAYNTMISGFCRKGSKEEADALFKEMKEDGTLPNSGCYNTLIRARLRDGDREASAELI 605

Query: 369 EEMELCQLKPSLVTYSSLIYGLCKRGLMDLS 398
           +EM  C       T   L+  +   G +D S
Sbjct: 606 KEMRSCGFAGDASTI-GLVTNMLHDGRLDKS 624

BLAST of Cla004832 vs. Swiss-Prot
Match: PP100_ARATH (Pentatricopeptide repeat-containing protein At1g63150 OS=Arabidopsis thaliana GN=At1g63150 PE=2 SV=1)

HSP 1 Score: 253.8 bits (647), Expect = 5.7e-66
Identity = 159/566 (28.09%), Postives = 285/566 (50.35%), Query Frame = 1

Query: 179 VENAVEVFIQAKKMGIMPDISSCNFLLKCLGEANREDFVRCLFEDLKTSGPNPTVYTYTI 238
           V++AV++F    K    P I   N LL  + + N+ + V  L E ++T G +  +YTY+I
Sbjct: 64  VDDAVDLFGDMVKSRPFPSIVEFNKLLSAVAKMNKFELVISLGEQMQTLGISHDLYTYSI 123

Query: 239 MMNFFCQGLNGRDVNIKEATFLLEELMRSGERPTVVTYSIYIRALCKVVSGQSALDFIQD 298
            +N FC     R   +  A  +L ++M+ G  P +VT S  +   C       A+  +  
Sbjct: 124 FINCFC-----RRSQLSLALAVLAKMMKLGYEPDIVTLSSLLNGYCHSKRISDAVALVDQ 183

Query: 299 LRLKNQPLNAYCYNPILHKFCQEGKTEEAMKVLQEMMGYGILPDACSYSILVDGFCKRGE 358
           +       + + +  ++H      K  EA+ ++ +M+  G  PD  +Y  +V+G CKRG+
Sbjct: 184 MVEMGYKPDTFTFTTLIHGLFLHNKASEAVALVDQMVQRGCQPDLVTYGTVVNGLCKRGD 243

Query: 359 NDKGLNLIEEMELCQLKPSLVTYSSLIYGLCKRGLMDLSLDIFRKLQKLGYKYDLVVYDI 418
            D  LNL+ +ME  ++K ++V ++++I  LCK   +++++D+F +++  G + ++V Y+ 
Sbjct: 244 IDLALNLLNKMEAARIKANVVIFNTIIDSLCKYRHVEVAVDLFTEMETKGIRPNVVTYNS 303

Query: 419 LIRGFFSHDDMEYAKKLLDEMVND-LCPDAFNFNAMIRWLCKMGHFDKAIGLLNLMLKYC 478
           LI    ++     A +LL  M+   + P+   FNA+I    K G   +A  L   M++  
Sbjct: 304 LINCLCNYGRWSDASRLLSNMLEKKINPNVVTFNALIDAFFKEGKLVEAEKLHEEMIQRS 363

Query: 479 GLADTMTCNFIVDGYCREGNLEEALKLMIYMKDHGIMPNSYTFNIIMKRLCEEGRLEKVW 538
              DT+T N +++G+C    L+EA ++  +M     +PN  T+N ++   C+  R+E   
Sbjct: 364 IDPDTITYNLLINGFCMHNRLDEAKQMFKFMVSKDCLPNIQTYNTLINGFCKCKRVEDGV 423

Query: 539 ELFPAMLKLNILPGLVHYSTLIDGFAKHSNMKKALMLYERMERLGVPPDTVASTIIINML 598
           ELF  M +  ++   V Y+T+I GF +  +   A M++++M    VP D +  +I+++ L
Sbjct: 424 ELFREMSQRGLVGNTVTYTTIIQGFFQAGDCDSAQMVFKQMVSNRVPTDIMTYSILLHGL 483

Query: 599 CQRNETYRAYKLFKELIVKGMNLDKILYTSMIAGFSRTGDMKKACALFNKMSNEGCSPTV 658
           C   +   A  +FK L    M L+  +Y +MI G  + G + +A  LF  +S     P V
Sbjct: 484 CSYGKLDTALVIFKYLQKSEMELNIFIYNTMIEGMCKAGKVGEAWDLFCSLS---IKPDV 543

Query: 659 VTYTCLIDGFFKLRRLDLASFLVDDMKRNNLTPDVIVYTVLIVGLLRLGKIEKARELVDE 718
           VTY  +I G    R L  A  L   MK +   P+   Y  LI   LR      + EL+ E
Sbjct: 544 VTYNTMISGLCSKRLLQEADDLFRKMKEDGTLPNSGTYNTLIRANLRDCDRAASAELIKE 603

Query: 719 VRAGGIISEDATFQMLASSIVNQKLE 744
           +R+ G + + +T  ++ + + + +L+
Sbjct: 604 MRSSGFVGDASTISLVTNMLHDGRLD 621


HSP 2 Score: 163.3 bits (412), Expect = 1.0e-38
Identity = 132/552 (23.91%), Postives = 243/552 (44.02%), Query Frame = 1

Query: 71  KGKLFPLVVRVLKSLNWRVARQIRFSTAV------KRYGHLNSLCAFRIMVHVFA----- 130
           K + FP +V   K L+  VA+  +F   +      +  G  + L  + I ++ F      
Sbjct: 76  KSRPFPSIVEFNKLLS-AVAKMNKFELVISLGEQMQTLGISHDLYTYSIFINCFCRRSQL 135

Query: 131 ------LAEMRMEVY-------SLLKDIVCHTRKAKQDLSGLFPFLMDSNYDVEKSSIVL 190
                 LA+M    Y       S L +  CH+++   D   L   +++  Y  +  +   
Sbjct: 136 SLALAVLAKMMKLGYEPDIVTLSSLLNGYCHSKRIS-DAVALVDQMVEMGY--KPDTFTF 195

Query: 191 DLLIKVFSGNSMVENAVEVFIQAKKMGIMPDISSCNFLLKCLGEANREDFVRCLFEDLKT 250
             LI     ++    AV +  Q  + G  PD+ +   ++  L +    D    L   ++ 
Sbjct: 196 TTLIHGLFLHNKASEAVALVDQMVQRGCQPDLVTYGTVVNGLCKRGDIDLALNLLNKMEA 255

Query: 251 SGPNPTVYTYTIMMNFFCQGLNGRDVNIKEATFLLEELMRSGERPTVVTYSIYIRALCKV 310
           +     V  +  +++  C+  +     ++ A  L  E+   G RP VVTY+  I  LC  
Sbjct: 256 ARIKANVVIFNTIIDSLCKYRH-----VEVAVDLFTEMETKGIRPNVVTYNSLINCLCNY 315

Query: 311 VSGQSALDFIQDLRLKNQPLNAYCYNPILHKFCQEGKTEEAMKVLQEMMGYGILPDACSY 370
                A   + ++  K    N   +N ++  F +EGK  EA K+ +EM+   I PD  +Y
Sbjct: 316 GRWSDASRLLSNMLEKKINPNVVTFNALIDAFFKEGKLVEAEKLHEEMIQRSIDPDTITY 375

Query: 371 SILVDGFCKRGENDKGLNLIEEMELCQLKPSLVTYSSLIYGLCKRGLMDLSLDIFRKLQK 430
           ++L++GFC     D+   + + M      P++ TY++LI G CK   ++  +++FR++ +
Sbjct: 376 NLLINGFCMHNRLDEAKQMFKFMVSKDCLPNIQTYNTLINGFCKCKRVEDGVELFREMSQ 435

Query: 431 LGYKYDLVVYDILIRGFFSHDDMEYAKKLLDEMVNDLCP-DAFNFNAMIRWLCKMGHFDK 490
            G   + V Y  +I+GFF   D + A+ +  +MV++  P D   ++ ++  LC  G  D 
Sbjct: 436 RGLVGNTVTYTTIIQGFFQAGDCDSAQMVFKQMVSNRVPTDIMTYSILLHGLCSYGKLDT 495

Query: 491 AIGLLNLMLKYCGLADTMTCNFIVDGYCREGNLEEALKLMIYMKDHGIMPNSYTFNIIMK 550
           A+ +   + K     +    N +++G C+ G + EA  L   +    I P+  T+N ++ 
Sbjct: 496 ALVIFKYLQKSEMELNIFIYNTMIEGMCKAGKVGEAWDLFCSL---SIKPDVVTYNTMIS 555

Query: 551 RLCEEGRLEKVWELFPAMLKLNILPGLVHYSTLIDGFAKHSNMKKALMLYERMERLGVPP 598
            LC +  L++  +LF  M +   LP    Y+TLI    +  +   +  L + M   G   
Sbjct: 556 GLCSKRLLQEADDLFRKMKEDGTLPNSGTYNTLIRANLRDCDRAASAELIKEMRSSGFVG 615

BLAST of Cla004832 vs. TrEMBL
Match: D7TTT9_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_02s0012g01760 PE=4 SV=1)

HSP 1 Score: 751.5 bits (1939), Expect = 9.7e-214
Identity = 390/761 (51.25%), Postives = 523/761 (68.73%), Query Frame = 1

Query: 3   ICPCSKICLSELFLLHNQVNCHLFSRRACY--SSLSVLLVEDHVFDESPVAQIKVLPEIN 62
           +C  ++ CL E  LL  +V  +L      Y  +S ++LL +DHVFD SP   +  L EIN
Sbjct: 1   MCSIARFCLVEPVLLQIRVVSNLKCLFRVYRTASSALLLDDDHVFDTSPGVPVDDLVEIN 60

Query: 63  VRR----------------NCSILRRKGKLFPLVVRVLKSLNWRVARQIRFSTAVKRYGH 122
           +++                  S + RK  L P+VV+V KSLNW VAR I+FST +K+YG 
Sbjct: 61  IQKCQLRVGRKRNEIRVMKRRSRIHRKHVLSPVVVKVFKSLNWEVARHIKFSTTMKKYGF 120

Query: 123 LNSLCAFRIMVHVFALAEMRMEVYSLLKDIVCHTRKAKQDLSGLFPFLMDSNYDVEKSSI 182
             S+ AFR +V+V ALA M MEVY+LL+DIVC+  K   D   LFP L++S  D  +S I
Sbjct: 121 SRSIDAFRTVVNVLALAGMHMEVYALLRDIVCYYNKVNLDAFELFPILLESPKDAARSVI 180

Query: 183 VLDLLIKVFSGNSMVENAVEVFIQAKKMGIMPDISSCNFLLKCLGEANREDFVRCLFEDL 242
           V DLLIKVF+ NSM+ENAV+VF+QAKK G+     SCNFLLKCL EANR +F+R LFE++
Sbjct: 181 VFDLLIKVFAANSMLENAVDVFLQAKKTGLELSTRSCNFLLKCLAEANRREFLRSLFEEM 240

Query: 243 KTSGPNPTVYTYTIMMNFFCQGLNGR-DVNIKEATFLLEELMRSGERPTVVTYSIYIRAL 302
           K++GP P V+TYTIMMNF+C+G  G  D++ ++AT +LEE+ R+GE PTVVTYS YI  L
Sbjct: 241 KSTGPPPNVFTYTIMMNFYCKGNFGEADIDTRQATEILEEMERNGESPTVVTYSTYIYGL 300

Query: 303 CKVVSGQSALDFIQDLRLKNQPLNAYCYNPILHKFCQEGKTEEAMKVLQEMMGYGILPDA 362
           C+V   +SALDF++ L   N  +N YCYN I+H  C++G+ +EA+KVL+EM   GI PD 
Sbjct: 301 CRVGYVESALDFVRSLISANGLVNVYCYNAIIHGLCKKGELDEALKVLEEMKSCGISPDV 360

Query: 363 CSYSILVDGFCKRGENDKGLNLIEEMELCQLKPSLVTYSSLIYGLCKRGLMDLSLDIFRK 422
            +YSIL+ GFCK+G+ +KGL LIEEM+   ++PSLV+YSSL +GLCK+ L D+SLDIFR 
Sbjct: 361 YTYSILIHGFCKQGDVEKGLYLIEEMKYSNMEPSLVSYSSLFHGLCKKRLSDISLDIFRD 420

Query: 423 LQKLGYKYDLVVYDILIRGFFSHDDMEYAKKLLDEMV-NDLCPDAFNFNAMIRWLCKMGH 482
           L   GYKYD   Y ILI+GF    D++ A KL++EMV N+L PD  NF +++   CKMG 
Sbjct: 421 LGAAGYKYDQTAYSILIKGFCMQGDLDSAHKLMEEMVRNNLAPDPSNFESLVHGFCKMGL 480

Query: 483 FDKAIGLLNLMLKYCGLADTMTCNFIVDGYCREGNLEEALKLMIYMKDHGIMPNSYTFNI 542
           +  A+   N+ML+   L    TCN I+D +CREG +EEAL LM  M+  GI PN +T+N 
Sbjct: 481 WVNALEFFNMMLEGGILPSIATCNVIIDAHCREGRVEEALNLMNEMQTQGIFPNLFTYNA 540

Query: 543 IMKRLCEEGRLEKVWELFPAMLKLNILPGLVHYSTLIDGFAKHSNMKKALMLYERMERLG 602
           ++ RLC+E + E+  ELFP MLK N+LP +V YSTLIDGFAK SN +KALMLY RM ++G
Sbjct: 541 VINRLCKERKSERALELFPLMLKRNVLPSVVVYSTLIDGFAKQSNSQKALMLYARMLKIG 600

Query: 603 VPPDTVASTIIINMLCQRNETYRAYKLFKELIVKGMNLDKILYTSMIAGFSRTGDMKKAC 662
           V PD VA TI+IN+LC R+    AY LFK++   GM  DKI YTS+IAGF R GDM+KA 
Sbjct: 601 VTPDMVAYTILINILCHRSRMCEAYNLFKKMTENGMTPDKISYTSVIAGFCRIGDMRKAW 660

Query: 663 ALFNKMSNEGCSPTVVTYTCLIDGFFKLRRLDLASFLVDDMKRNNLTPDVIVYTVLIVGL 722
           ALFN+M   G  PTVVTYT L+DG+ K+ R+D+A  L+D+MKR  +TPDV+ Y VLI   
Sbjct: 661 ALFNEMLQRGHLPTVVTYTSLVDGYCKMNRIDIADMLIDEMKRKGITPDVVTYNVLIAAH 720

Query: 723 LRLGKIEKARELVDEVRAGGIISEDATFQMLASSIVNQKLE 744
            R G ++KA E+++E++  G++ +  T+ ML   +  +KL+
Sbjct: 721 RRRGNLDKALEMLNEMKENGVLPDHMTYMMLEWLLKAKKLK 761

BLAST of Cla004832 vs. TrEMBL
Match: A0A061GDS6_THECC (Tetratricopeptide repeat-like superfamily protein, putative OS=Theobroma cacao GN=TCM_016561 PE=4 SV=1)

HSP 1 Score: 745.3 bits (1923), Expect = 6.9e-212
Identity = 390/741 (52.63%), Postives = 520/741 (70.18%), Query Frame = 1

Query: 1   MGICPCSKICLSELFLLHNQV----NCHLFSRRACYSSLSVLLVEDHVFDESP--VAQIK 60
           MGI   SKICL +  LL + V     C L   R  YS+ S LL+EDHVFD SP  V+   
Sbjct: 1   MGIWSYSKICLVQSGLLRDVVVHYKKCLL---RVYYSASSALLLEDHVFDCSPEVVSVNN 60

Query: 61  VLPEINVRRNCSILRRKGKLFPLVVRVLKSLNWRVARQIRFSTAVKRYGHLNSLCAFRIM 120
            + E+ V R      R  +L P VVRV KSLNW +AR+IRF+ A K YG  +S+ AFRI+
Sbjct: 61  EVEELQVPRKTFEFCRNPRLTPFVVRVFKSLNWDIAREIRFNMAAKMYGFDHSMYAFRII 120

Query: 121 VHVFALAEMRMEVYSLLKDIVCHTRKAKQDLSGLFPFLMDSNYDVEKSSIVLDLLIKVFS 180
           +H+FA+A M+ME ++LL+DIVC+ ++ K D+  L  +L+DS   V +S+ V ++LIKVF+
Sbjct: 121 IHIFAMAGMQMEAHALLRDIVCYYKEVKTDMFELLLYLLDSPEHVHRSADVFNVLIKVFA 180

Query: 181 GNSMVENAVEVFIQAKKMGIMPDISSCNFLLKCLGEANREDFVRCLFEDLKTSGPNPTVY 240
            NSM+EN ++VF+QAKK+G+ P+I SCNFLLKCL EANR +FVR LFED+K SGP+P VY
Sbjct: 181 SNSMLENGIDVFVQAKKIGLEPNIMSCNFLLKCLVEANRGEFVRSLFEDMKNSGPSPNVY 240

Query: 241 TYTIMMNFFCQGLNGRDVNIKEATFLLEELMRSGERPTVVTYSIYIRALCKVVSGQSALD 300
           TYTIMMNF+C G  GRDV++ +A  LLE++ R G+ P+VVTYS YI  LC+V   + ALD
Sbjct: 241 TYTIMMNFYCNGYCGRDVDVGQANNLLEDMERGGKNPSVVTYSTYIGGLCRVGCVELALD 300

Query: 301 FIQDLRLKNQPLNAYCYNPILHKFCQEGKTEEAMKVLQEMMGYGILPDACSYSILVDGFC 360
           FI+ L   NQP+N++CYN I++ FCQ+G+  E +KVL+EM   GI PD  SYSIL+DGFC
Sbjct: 301 FIRKLCFGNQPINSFCYNAIIYGFCQKGEPYEGLKVLEEMKHCGISPDVHSYSILIDGFC 360

Query: 361 KRGENDKGLNLIEEMELCQLKPSLVTYSSLIYGLCKRGLMDLSLDIFRKLQKLGYKYDLV 420
           K+G+ +KG+NLI+EM +  +KPSLVTY+SL +GLCK GL D+SL++FR L   GY+YDL 
Sbjct: 361 KKGDCEKGINLIDEMIVNGMKPSLVTYTSLFHGLCKSGLADVSLNLFRNLANDGYEYDLA 420

Query: 421 VYDILIRGFFSHDDMEYAKKLLDEMV-NDLCPDAFNFNAMIRWLCKMGHFDKAIGLLNLM 480
            Y +L++GF    D++ A +L + M  N L P   +FN +I   CKMG  DKA+ L N+M
Sbjct: 421 AYSVLLKGFCLQGDVDSAMELFEGMFSNSLIPTTNSFNRLIHGFCKMGLLDKALELFNIM 480

Query: 481 LKYCGLADTM-TCNFIVDGYCREGNLEEALKLMIYMKDHGIMPNSYTFNIIMKRLCEEGR 540
           L+  G++ T+ TCN I DGYC+ G+LEEALKL+  M + GI PNSYT+N I+KRLC +  
Sbjct: 481 LQ-SGVSPTIFTCNVIADGYCKAGHLEEALKLINEMHEFGIFPNSYTYNGIIKRLCMQSY 540

Query: 541 LEKVWELFPAMLKLNILPGLVHYSTLIDGFAKHSNMKKALMLYERMERLGVPPDTVASTI 600
             K WEL P M+K NIL   VH + L++GFA+ S  KKALMLY RM +LG    T+  TI
Sbjct: 541 SGKAWELLPQMIKKNILHN-VHCNILMNGFAEQSKPKKALMLYARMLKLGFTRTTITHTI 600

Query: 601 IINMLCQRNETYRAYKLFKELIVKGMNLDKILYTSMIAGFSRTGDMKKACALFNKMSNEG 660
           +IN+  QR + Y AY LFK++I KG+  D I YTS+IAGF R  DMKKA AL+ +M   G
Sbjct: 601 LINIFSQRCKMYEAYSLFKDMIAKGLIPDTISYTSVIAGFCRVRDMKKAWALYTEMLRRG 660

Query: 661 CSPTVVTYTCLIDGFFKLRRLDLASFLVDDMKRNNLTPDVIVYTVLIVGLLRLGKIEKAR 720
            SP VVTYTCLIDGF  + R+D+A+ L+D+MKR  + PDV+ YT LI G  RLG I++A 
Sbjct: 661 YSPNVVTYTCLIDGFCHIHRMDMANLLIDEMKRREINPDVVTYTALISGYRRLGDIDRAH 720

Query: 721 ELVDEVRAGGIISEDATFQML 734
           EL  E+++ GI+ +DA +  L
Sbjct: 721 ELFAEMKSKGIVPDDAAYSAL 736

BLAST of Cla004832 vs. TrEMBL
Match: A0A0D2R076_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_004G065800 PE=4 SV=1)

HSP 1 Score: 691.4 bits (1783), Expect = 1.2e-195
Identity = 364/736 (49.46%), Postives = 493/736 (66.98%), Query Frame = 1

Query: 1   MGICPCSKICLSELFLLHNQVNCHLFSR--RACYSSLSVLLVEDHVFDESPVAQI--KVL 60
           MGI   SK+C  +  LL  Q+  H   R  R  Y+S S LL+EDH FD  P  +     +
Sbjct: 1   MGIWSYSKLCFLQSGLLR-QIVVHHKKRLLRVYYASSSALLMEDHDFDCIPKVESDNNEV 60

Query: 61  PEINVRRNCSILRRKGKLFPLVVRVLKSLNWRVARQIRFSTAVKRYGHLNSLCAFRIMVH 120
            E+ V        R   LFP+VVRV KSLNW  AR+I F  AVK YG  +S+ AFRI++H
Sbjct: 61  GEVQVPEKRFKFCRNPSLFPIVVRVFKSLNWCAARKISFHNAVKMYGFDHSIYAFRIIIH 120

Query: 121 VFALAEMRMEVYSLLKDIVCHTRKAKQDLSGLFPFLMDSNYDVEKSSIVLDLLIKVFSGN 180
           +FA+  M+ME ++LL+DIVC+    K D+  L P+L+DS   V +S+ V ++LIKVF+ N
Sbjct: 121 IFAMTGMQMEAHALLRDIVCYCEGVKIDVFELLPYLLDSPEHVHRSTSVFNVLIKVFASN 180

Query: 181 SMVENAVEVFIQAKKMGIMPDISSCNFLLKCLGEANREDFVRCLFEDLKTSGPNPTVYTY 240
            M+ NAV+VF++ K +GI   I SCNFLLKCL EANR D +R +FE++K SGP+P VYTY
Sbjct: 181 LMLGNAVDVFLEVKTIGIELSIMSCNFLLKCLLEANRGDIMRMMFEEMKNSGPSPNVYTY 240

Query: 241 TIMMNFFCQGLNGRDVNIKEATFLLEELMRSGERPTVVTYSIYIRALCKVVSGQSALDFI 300
           TIMMNF+C+G  GR  +I++AT L EE+   G  P+VVTYS YI  +C+V   + ALD I
Sbjct: 241 TIMMNFYCKGYYGRGADIEQATKLKEEMEIDGINPSVVTYSTYICGICRVGHVEFALDVI 300

Query: 301 QDLRLKNQPLNAYCYNPILHKFCQEGKTEEAMKVLQEMMGYGILPDACSYSILVDGFCKR 360
           +DLR  N+P+N++CYN +++ FCQ+G+  EA KVL+EM   GILPD  SYSIL+DGFCKR
Sbjct: 301 RDLRSGNKPINSFCYNAVIYGFCQKGEPYEASKVLEEMRSCGILPDVHSYSILIDGFCKR 360

Query: 361 GENDKGLNLIEEMELCQLKPSLVTYSSLIYGLCKRGLMDLSLDIFRKLQKLGYKYDLVVY 420
           G+  K  + I+EM+   +KPS+VTY+SL  GLCK G  D+SL +FR     GY++DLV Y
Sbjct: 361 GDFVKVFHFIDEMKHNDMKPSVVTYTSLFDGLCKSGRADVSLKLFRNFCTSGYEFDLVAY 420

Query: 421 DILIRGFFSHDDMEYAKKLLDEMVND-LCPDAFNFNAMIRWLCKMGHFDKAIGLLNLMLK 480
            +L++G     D++ A +L +EM+N+ L P A +FN +I   CKMG  DKA  L N+ML+
Sbjct: 421 SVLLKGLCLQGDLDSAMELFNEMINNGLIPTANSFNRLIHGFCKMGPLDKAWELFNIMLQ 480

Query: 481 YCGLADTMTCNFIVDGYCREGNLEEALKLMIYMKDHGIMPNSYTFNIIMKRLCEEGRLEK 540
              L    T N IVDGYC  G+LEEALKL+  M + GI PNSYT+N I+KRLC++  +EK
Sbjct: 481 RGVLPTVFTFNVIVDGYCNAGHLEEALKLINEMHELGIFPNSYTYNGIIKRLCKQSSVEK 540

Query: 541 VWELFPAMLKLNILPGLVHYSTLIDGFAKHSNMKKALMLYERMERLGVPPDTVASTIIIN 600
            WEL P MLK +I+     Y  ++DGFAK  N KKA+MLY RM +LGV P T   TI+IN
Sbjct: 541 AWELLPQMLKKDIIHD-SPYDIILDGFAKQLNPKKAMMLYTRMLKLGVTPTTYTYTILIN 600

Query: 601 MLCQRNETYRAYKLFKELIVKGMNLDKILYTSMIAGFSRTGDMKKACALFNKMSNEGCSP 660
           + CQ    Y A+KLF +L+ +G+  D I YT++I GF R GDM+KA ALF +M  +GCSP
Sbjct: 601 LFCQSGNMYEAWKLFMDLLGRGLIPDTIFYTTIIDGFCRVGDMRKAWALFREMPQKGCSP 660

Query: 661 TVVTYTCLIDGFFKLRRLDLASFLVDDMKRNNLTPDVIVYTVLIVGLLRLGKIEKARELV 720
            VVTYTCLI+GF  + R+D+ + L+ +MK+ ++ PDV+ YT LI G  RLG  ++A EL 
Sbjct: 661 NVVTYTCLINGFCNVHRMDVVNSLISEMKKRDINPDVVTYTALIAGYRRLGNADRALELF 720

Query: 721 DEVRAGGIISEDATFQ 732
            E+    I+ + A ++
Sbjct: 721 TEMIRNDILPDYAAYR 734

BLAST of Cla004832 vs. TrEMBL
Match: B9SNU2_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_1278650 PE=4 SV=1)

HSP 1 Score: 672.5 bits (1734), Expect = 5.7e-190
Identity = 343/737 (46.54%), Postives = 493/737 (66.89%), Query Frame = 1

Query: 7   SKICLSE-LFLLHNQ------VNCHLFSRRACYSSLSVLLVEDHVFDESPVAQIKVLPEI 66
           ++ CLS  ++LL +       ++   +   A YS +S LL+ED V+D  PV       + 
Sbjct: 16  NRFCLSSTIYLLKSSLLRSIIISRERYRLEAHYSGVSALLLEDQVYDYIPVTDSNRFEKA 75

Query: 67  NVRRNCSILRRKGKLFPLVVRVLKSLNWRVARQIRFSTAVKRYGHLNSLCAFRIMVHVFA 126
               +C    RK  LFP V+ V K+LNW++A    F  AV  +G  +S+ AF+I++HV A
Sbjct: 76  RKPNSCP---RKRGLFPFVLTVFKTLNWKLATHTNFFKAVSFHGFSHSIYAFKIIIHVLA 135

Query: 127 LAEMRMEVYSLLKDIVCHTRKAKQDLSGLFPFLMDSNYDVEK--SSIVLDLLIKVFSGNS 186
            A ++MEV   L+DI+ + ++   D+S LF  L+DS  D     S IV ++LIKVF+ N+
Sbjct: 136 SAGLQMEVQIFLRDIISYYKEVNLDVSELFSTLLDSPQDAHMGGSIIVANVLIKVFAENN 195

Query: 187 MVENAVEVFIQAKKMGIMPDISSCNFLLKCLGEANREDFVRCLFEDLKTSGPNPTVYTYT 246
           M+ +A +VF+QA++ G+  +I SCNFLL C  EAN+ +F+R LFE+LK SGP+P V+TYT
Sbjct: 196 MLVDAADVFVQARRFGLELNILSCNFLLNCFAEANQTEFIRSLFEELKDSGPSPNVFTYT 255

Query: 247 IMMNFFCQGLNGRDVNIKEATFLLEELMRSGERPTVVTYSIYIRALCKVVSGQSALDFIQ 306
           IMMN++C+G  G++++I +AT +LEE+  +GE PTVVTY  YI  LC+    + AL  I+
Sbjct: 256 IMMNYYCKGSFGKNIDIVKATEVLEEMEMNGESPTVVTYGAYIHGLCRAGCVEFALRLIR 315

Query: 307 DLRLKNQPLNAYCYNPILHKFCQEGKTEEAMKVLQEMMGYGILPDACSYSILVDGFCKRG 366
           DLR++NQPLN+YCYN ++H+FC+ G+  EA ++L++M  +GI P A SYSIL+DG CK+G
Sbjct: 316 DLRIRNQPLNSYCYNAVIHEFCRNGELHEAFELLEDMRSHGISPTAYSYSILIDGLCKKG 375

Query: 367 ENDKGLNLIEEMELCQLKPSLVTYSSLIYGLCKRGLMDLSLDIFRKLQKLGYKYDLVVYD 426
           + +K L+LIEEM    +KPSLVTYSSL  GLCK GL ++SL +F  L   GYK+D++ Y+
Sbjct: 376 QVEKALDLIEEMVQSNVKPSLVTYSSLFDGLCKSGLTEISLSMFHNLGAEGYKHDVISYN 435

Query: 427 ILIRGFFSHDDMEYAKKLLDEM-VNDLCPDAFNFNAMIRWLCKMGHFDKAIGLLNLMLKY 486
            LI GF    DM  A KL+ EM +N   P++F FN +I   CK    DKA+ +  +MLK 
Sbjct: 436 TLINGFVLQRDMGSACKLVHEMRMNGSVPNSFTFNRLIHGFCKRQRLDKALEVFTIMLKV 495

Query: 487 CGLADTMTCNFIVDGYCREGNLEEALKLMIYMKDHGIMPNSYTFNIIMKRLCEEGRLEKV 546
               +  TCN + D + REG+  EALKL+  ++D GI+PNSYT+NI++K LC+E + EK 
Sbjct: 496 GVQLNIFTCNIMADEFNREGHFWEALKLINEVQDLGIVPNSYTYNIVIKWLCKEQKTEKA 555

Query: 547 WELFPAMLKLNILPGLVHYSTLIDGFAKHSNMKKALMLYERMERLGVPPDTVASTIIINM 606
           WE+ P MLK N+ P  +HY+TLIDG+AK SN  KAL+LY +M ++G+PP  V  T++INM
Sbjct: 556 WEVLPVMLKNNVFPCAIHYNTLIDGYAKQSNPTKALLLYAKMLKVGIPPSIVTYTMLINM 615

Query: 607 LCQRNETYRAYKLFKELIVKGMNLDKILYTSMIAGFSRTGDMKKACALFNKMSNEGCSPT 666
              R++   AY LFKE+I KG+  D+I++T +IAGF + GDMK A AL+ +MS  G SP 
Sbjct: 616 FSNRSKMQEAYYLFKEMIKKGLVPDEIIFTCIIAGFCKVGDMKSAWALYEEMSQWGKSPN 675

Query: 667 VVTYTCLIDGFFKLRRLDLASFLVDDMKRNNLTPDVIVYTVLIVGLLRLGKIEKARELVD 726
           VVTYTCLIDG+FK++R+D A FL + MKR+N+TPD + YT LI G   LG  ++ RE+ +
Sbjct: 676 VVTYTCLIDGYFKIKRMDKADFLFNKMKRDNVTPDGLTYTALIFGYQSLGYSDRVREMFN 735

Query: 727 EVRAGGIISEDATFQML 734
           E++  G+      +  L
Sbjct: 736 EMKENGVFPNYTAYATL 749

BLAST of Cla004832 vs. TrEMBL
Match: A0A151RA84_CAJCA (Uncharacterized protein OS=Cajanus cajan GN=KK1_039226 PE=4 SV=1)

HSP 1 Score: 656.0 bits (1691), Expect = 5.5e-185
Identity = 339/713 (47.55%), Postives = 489/713 (68.58%), Query Frame = 1

Query: 27  SRRACY-----SSLSVLLVEDHVFDESPVAQIKVLPEINVRRNCSILRRKGKLFPLVVRV 86
           S+R C+     S  S L++EDHVFDESP +       +       +   + +LFPLV RV
Sbjct: 14  SQRQCFFRFHSSVSSALMLEDHVFDESPKSYANFFINMPAPH---VPTTRRELFPLVARV 73

Query: 87  LKSLNWRVARQIRFSTAVKRYGHLNSLCAFRIMVHVFALAEMRMEVYSLLKDIVCHTRKA 146
            KSL+W VA +IRF + V+ +G  +S+  FRI++HVFALA MR+EV++LL+D+V    +A
Sbjct: 74  FKSLSWTVASEIRFGSWVESHGFSHSVNCFRIIIHVFALAGMRLEVFALLRDVVGFCSEA 133

Query: 147 KQDLSGLFPFLMDSNYDVEKSSIVLDLLIKVFSGNSMVENAVEVFIQAKKMGIMPDISSC 206
           K D   LF  L+DS + VE+S++V D+LIKVF+ NSM+ENA++VF+ AK +G+ PDI  C
Sbjct: 134 KYDTFELFSTLLDSPHHVERSAVVFDVLIKVFASNSMLENALDVFVNAKHVGLEPDIRVC 193

Query: 207 NFLLKCLGEANREDFVRCLFEDLKTSGPNPTVYTYTIMMNFFCQGLNGRDVNIKEATFLL 266
           NFLLKCL EANR +FV   FE L   GP P +YTYTIMMNF+   + G D  ++ A  +L
Sbjct: 194 NFLLKCLVEANRLEFVGWFFEKLTAFGPLPNIYTYTIMMNFYYNDV-GCDAGMRRAAVML 253

Query: 267 EELMRSGERPTVVTYSIYIRALCKVVSGQSALDFIQDLRLKNQPLNAYCYNPILHKFCQE 326
            ++  SGE+PTVVTYS YI  LCKV   ++AL  I++L  +NQPLN++ +N +++ +C+ 
Sbjct: 254 GKIYLSGEKPTVVTYSTYIHGLCKVGCVEAALMLIRNLHYENQPLNSHSFNAVIYGYCKR 313

Query: 327 GKTEEAMKVLQEMMGYGILPDACSYSILVDGFCKRGENDKGLNLIEEMELCQLKPSLVTY 386
           G+  EA++VL EM   GILPD  SYSIL++ FC +G+  K L+L+EEMEL Q+KPS+V+Y
Sbjct: 314 GEVCEALQVLDEMKSSGILPDVYSYSILINAFCMKGDVVKCLDLMEEMELSQIKPSIVSY 373

Query: 387 SSLIYGLCKRGLMDLSLDIFRKLQKLGYKYDLVVYDILIRGFFSHDDMEYAKKLLDEMV- 446
           +SLI+GLCK+ LM  ++DIF  +     KYD  VY+ LI GF    D++ A KLL EM+ 
Sbjct: 374 TSLIHGLCKKNLMQNAVDIFHSIDASSCKYDQTVYETLIDGFCIQGDVDSAIKLLKEMIS 433

Query: 447 NDLCPDAFNFNAMIRWLCKMGHFDKAIGLLNLMLKYCGLADTMTCNFIVDGYCREGNLEE 506
           N+L P AF+  ++IR  CK+G F +A+ + N ML+     DT+TCN+I+DG CREG+ +E
Sbjct: 434 NNLVPTAFSCRSLIRGFCKLGLFHQALEVFNTMLQDGIWPDTITCNYILDGSCREGHFKE 493

Query: 507 ALKLMIYMKDHGIMPNSYTFNIIMKRLCEEGRLEKVWELFPAMLKLNILPGLVHYSTLID 566
           AL L+   ++HG   N +++N I+ +LC+E   E+  EL P MLK N++P +V+YSTLI 
Sbjct: 494 ALTLLEDFQEHGFNLNPHSYNAIIYKLCKESYPERALELLPRMLKRNVIPAVVNYSTLIS 553

Query: 567 GFAKHSNMKKALMLYERMERLGVPPDTVASTIIINMLCQRNETYRAYKLFKELIVKGMNL 626
           GFAK SN KKA++L+ +M + G+  +    TI+I++  +  + + AY +FKE+  +G+  
Sbjct: 554 GFAKQSNFKKAVILFTKMVKDGISFNAATYTILISIFSRNRKMHEAYGIFKEMRERGLRP 613

Query: 627 DKILYTSMIAGFSRTGDMKKACALFNKMSNEGCSPTVVTYTCLIDGFFKLRRLDLASFLV 686
           D+I YT++IAGF    +MKKA ALF +MS EGC P V+TYTCLIDGF K  R+DLA++L 
Sbjct: 614 DQISYTTLIAGFCNNREMKKAWALFEEMSREGCLPNVITYTCLIDGFCKSNRIDLATWLF 673

Query: 687 DDMKRNNLTPDVIVYTVLIVGLLRLGKIEKARELVDEVRAGGIISEDATFQML 734
           D M R+++ PDV+ YTVLI    + G I++A +L DE++A G++ +D T  +L
Sbjct: 674 DKMNRDSVIPDVVTYTVLIAWYHKHGYIDQAYKLYDEMKAKGVLPDDITHMVL 722

BLAST of Cla004832 vs. NCBI nr
Match: gi|359474464|ref|XP_003631475.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g63330 [Vitis vinifera])

HSP 1 Score: 751.5 bits (1939), Expect = 1.4e-213
Identity = 390/761 (51.25%), Postives = 523/761 (68.73%), Query Frame = 1

Query: 3   ICPCSKICLSELFLLHNQVNCHLFSRRACY--SSLSVLLVEDHVFDESPVAQIKVLPEIN 62
           +C  ++ CL E  LL  +V  +L      Y  +S ++LL +DHVFD SP   +  L EIN
Sbjct: 1   MCSIARFCLVEPVLLQIRVVSNLKCLFRVYRTASSALLLDDDHVFDTSPGVPVDDLVEIN 60

Query: 63  VRR----------------NCSILRRKGKLFPLVVRVLKSLNWRVARQIRFSTAVKRYGH 122
           +++                  S + RK  L P+VV+V KSLNW VAR I+FST +K+YG 
Sbjct: 61  IQKCQLRVGRKRNEIRVMKRRSRIHRKHVLSPVVVKVFKSLNWEVARHIKFSTTMKKYGF 120

Query: 123 LNSLCAFRIMVHVFALAEMRMEVYSLLKDIVCHTRKAKQDLSGLFPFLMDSNYDVEKSSI 182
             S+ AFR +V+V ALA M MEVY+LL+DIVC+  K   D   LFP L++S  D  +S I
Sbjct: 121 SRSIDAFRTVVNVLALAGMHMEVYALLRDIVCYYNKVNLDAFELFPILLESPKDAARSVI 180

Query: 183 VLDLLIKVFSGNSMVENAVEVFIQAKKMGIMPDISSCNFLLKCLGEANREDFVRCLFEDL 242
           V DLLIKVF+ NSM+ENAV+VF+QAKK G+     SCNFLLKCL EANR +F+R LFE++
Sbjct: 181 VFDLLIKVFAANSMLENAVDVFLQAKKTGLELSTRSCNFLLKCLAEANRREFLRSLFEEM 240

Query: 243 KTSGPNPTVYTYTIMMNFFCQGLNGR-DVNIKEATFLLEELMRSGERPTVVTYSIYIRAL 302
           K++GP P V+TYTIMMNF+C+G  G  D++ ++AT +LEE+ R+GE PTVVTYS YI  L
Sbjct: 241 KSTGPPPNVFTYTIMMNFYCKGNFGEADIDTRQATEILEEMERNGESPTVVTYSTYIYGL 300

Query: 303 CKVVSGQSALDFIQDLRLKNQPLNAYCYNPILHKFCQEGKTEEAMKVLQEMMGYGILPDA 362
           C+V   +SALDF++ L   N  +N YCYN I+H  C++G+ +EA+KVL+EM   GI PD 
Sbjct: 301 CRVGYVESALDFVRSLISANGLVNVYCYNAIIHGLCKKGELDEALKVLEEMKSCGISPDV 360

Query: 363 CSYSILVDGFCKRGENDKGLNLIEEMELCQLKPSLVTYSSLIYGLCKRGLMDLSLDIFRK 422
            +YSIL+ GFCK+G+ +KGL LIEEM+   ++PSLV+YSSL +GLCK+ L D+SLDIFR 
Sbjct: 361 YTYSILIHGFCKQGDVEKGLYLIEEMKYSNMEPSLVSYSSLFHGLCKKRLSDISLDIFRD 420

Query: 423 LQKLGYKYDLVVYDILIRGFFSHDDMEYAKKLLDEMV-NDLCPDAFNFNAMIRWLCKMGH 482
           L   GYKYD   Y ILI+GF    D++ A KL++EMV N+L PD  NF +++   CKMG 
Sbjct: 421 LGAAGYKYDQTAYSILIKGFCMQGDLDSAHKLMEEMVRNNLAPDPSNFESLVHGFCKMGL 480

Query: 483 FDKAIGLLNLMLKYCGLADTMTCNFIVDGYCREGNLEEALKLMIYMKDHGIMPNSYTFNI 542
           +  A+   N+ML+   L    TCN I+D +CREG +EEAL LM  M+  GI PN +T+N 
Sbjct: 481 WVNALEFFNMMLEGGILPSIATCNVIIDAHCREGRVEEALNLMNEMQTQGIFPNLFTYNA 540

Query: 543 IMKRLCEEGRLEKVWELFPAMLKLNILPGLVHYSTLIDGFAKHSNMKKALMLYERMERLG 602
           ++ RLC+E + E+  ELFP MLK N+LP +V YSTLIDGFAK SN +KALMLY RM ++G
Sbjct: 541 VINRLCKERKSERALELFPLMLKRNVLPSVVVYSTLIDGFAKQSNSQKALMLYARMLKIG 600

Query: 603 VPPDTVASTIIINMLCQRNETYRAYKLFKELIVKGMNLDKILYTSMIAGFSRTGDMKKAC 662
           V PD VA TI+IN+LC R+    AY LFK++   GM  DKI YTS+IAGF R GDM+KA 
Sbjct: 601 VTPDMVAYTILINILCHRSRMCEAYNLFKKMTENGMTPDKISYTSVIAGFCRIGDMRKAW 660

Query: 663 ALFNKMSNEGCSPTVVTYTCLIDGFFKLRRLDLASFLVDDMKRNNLTPDVIVYTVLIVGL 722
           ALFN+M   G  PTVVTYT L+DG+ K+ R+D+A  L+D+MKR  +TPDV+ Y VLI   
Sbjct: 661 ALFNEMLQRGHLPTVVTYTSLVDGYCKMNRIDIADMLIDEMKRKGITPDVVTYNVLIAAH 720

Query: 723 LRLGKIEKARELVDEVRAGGIISEDATFQMLASSIVNQKLE 744
            R G ++KA E+++E++  G++ +  T+ ML   +  +KL+
Sbjct: 721 RRRGNLDKALEMLNEMKENGVLPDHMTYMMLEWLLKAKKLK 761

BLAST of Cla004832 vs. NCBI nr
Match: gi|590679692|ref|XP_007040653.1| (Tetratricopeptide repeat-like superfamily protein, putative [Theobroma cacao])

HSP 1 Score: 745.3 bits (1923), Expect = 1.0e-211
Identity = 390/741 (52.63%), Postives = 520/741 (70.18%), Query Frame = 1

Query: 1   MGICPCSKICLSELFLLHNQV----NCHLFSRRACYSSLSVLLVEDHVFDESP--VAQIK 60
           MGI   SKICL +  LL + V     C L   R  YS+ S LL+EDHVFD SP  V+   
Sbjct: 1   MGIWSYSKICLVQSGLLRDVVVHYKKCLL---RVYYSASSALLLEDHVFDCSPEVVSVNN 60

Query: 61  VLPEINVRRNCSILRRKGKLFPLVVRVLKSLNWRVARQIRFSTAVKRYGHLNSLCAFRIM 120
            + E+ V R      R  +L P VVRV KSLNW +AR+IRF+ A K YG  +S+ AFRI+
Sbjct: 61  EVEELQVPRKTFEFCRNPRLTPFVVRVFKSLNWDIAREIRFNMAAKMYGFDHSMYAFRII 120

Query: 121 VHVFALAEMRMEVYSLLKDIVCHTRKAKQDLSGLFPFLMDSNYDVEKSSIVLDLLIKVFS 180
           +H+FA+A M+ME ++LL+DIVC+ ++ K D+  L  +L+DS   V +S+ V ++LIKVF+
Sbjct: 121 IHIFAMAGMQMEAHALLRDIVCYYKEVKTDMFELLLYLLDSPEHVHRSADVFNVLIKVFA 180

Query: 181 GNSMVENAVEVFIQAKKMGIMPDISSCNFLLKCLGEANREDFVRCLFEDLKTSGPNPTVY 240
            NSM+EN ++VF+QAKK+G+ P+I SCNFLLKCL EANR +FVR LFED+K SGP+P VY
Sbjct: 181 SNSMLENGIDVFVQAKKIGLEPNIMSCNFLLKCLVEANRGEFVRSLFEDMKNSGPSPNVY 240

Query: 241 TYTIMMNFFCQGLNGRDVNIKEATFLLEELMRSGERPTVVTYSIYIRALCKVVSGQSALD 300
           TYTIMMNF+C G  GRDV++ +A  LLE++ R G+ P+VVTYS YI  LC+V   + ALD
Sbjct: 241 TYTIMMNFYCNGYCGRDVDVGQANNLLEDMERGGKNPSVVTYSTYIGGLCRVGCVELALD 300

Query: 301 FIQDLRLKNQPLNAYCYNPILHKFCQEGKTEEAMKVLQEMMGYGILPDACSYSILVDGFC 360
           FI+ L   NQP+N++CYN I++ FCQ+G+  E +KVL+EM   GI PD  SYSIL+DGFC
Sbjct: 301 FIRKLCFGNQPINSFCYNAIIYGFCQKGEPYEGLKVLEEMKHCGISPDVHSYSILIDGFC 360

Query: 361 KRGENDKGLNLIEEMELCQLKPSLVTYSSLIYGLCKRGLMDLSLDIFRKLQKLGYKYDLV 420
           K+G+ +KG+NLI+EM +  +KPSLVTY+SL +GLCK GL D+SL++FR L   GY+YDL 
Sbjct: 361 KKGDCEKGINLIDEMIVNGMKPSLVTYTSLFHGLCKSGLADVSLNLFRNLANDGYEYDLA 420

Query: 421 VYDILIRGFFSHDDMEYAKKLLDEMV-NDLCPDAFNFNAMIRWLCKMGHFDKAIGLLNLM 480
            Y +L++GF    D++ A +L + M  N L P   +FN +I   CKMG  DKA+ L N+M
Sbjct: 421 AYSVLLKGFCLQGDVDSAMELFEGMFSNSLIPTTNSFNRLIHGFCKMGLLDKALELFNIM 480

Query: 481 LKYCGLADTM-TCNFIVDGYCREGNLEEALKLMIYMKDHGIMPNSYTFNIIMKRLCEEGR 540
           L+  G++ T+ TCN I DGYC+ G+LEEALKL+  M + GI PNSYT+N I+KRLC +  
Sbjct: 481 LQ-SGVSPTIFTCNVIADGYCKAGHLEEALKLINEMHEFGIFPNSYTYNGIIKRLCMQSY 540

Query: 541 LEKVWELFPAMLKLNILPGLVHYSTLIDGFAKHSNMKKALMLYERMERLGVPPDTVASTI 600
             K WEL P M+K NIL   VH + L++GFA+ S  KKALMLY RM +LG    T+  TI
Sbjct: 541 SGKAWELLPQMIKKNILHN-VHCNILMNGFAEQSKPKKALMLYARMLKLGFTRTTITHTI 600

Query: 601 IINMLCQRNETYRAYKLFKELIVKGMNLDKILYTSMIAGFSRTGDMKKACALFNKMSNEG 660
           +IN+  QR + Y AY LFK++I KG+  D I YTS+IAGF R  DMKKA AL+ +M   G
Sbjct: 601 LINIFSQRCKMYEAYSLFKDMIAKGLIPDTISYTSVIAGFCRVRDMKKAWALYTEMLRRG 660

Query: 661 CSPTVVTYTCLIDGFFKLRRLDLASFLVDDMKRNNLTPDVIVYTVLIVGLLRLGKIEKAR 720
            SP VVTYTCLIDGF  + R+D+A+ L+D+MKR  + PDV+ YT LI G  RLG I++A 
Sbjct: 661 YSPNVVTYTCLIDGFCHIHRMDMANLLIDEMKRREINPDVVTYTALISGYRRLGDIDRAH 720

Query: 721 ELVDEVRAGGIISEDATFQML 734
           EL  E+++ GI+ +DA +  L
Sbjct: 721 ELFAEMKSKGIVPDDAAYSAL 736

BLAST of Cla004832 vs. NCBI nr
Match: gi|823147550|ref|XP_012473689.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g63330-like isoform X1 [Gossypium raimondii])

HSP 1 Score: 691.4 bits (1783), Expect = 1.7e-195
Identity = 364/736 (49.46%), Postives = 493/736 (66.98%), Query Frame = 1

Query: 1   MGICPCSKICLSELFLLHNQVNCHLFSR--RACYSSLSVLLVEDHVFDESPVAQI--KVL 60
           MGI   SK+C  +  LL  Q+  H   R  R  Y+S S LL+EDH FD  P  +     +
Sbjct: 1   MGIWSYSKLCFLQSGLLR-QIVVHHKKRLLRVYYASSSALLMEDHDFDCIPKVESDNNEV 60

Query: 61  PEINVRRNCSILRRKGKLFPLVVRVLKSLNWRVARQIRFSTAVKRYGHLNSLCAFRIMVH 120
            E+ V        R   LFP+VVRV KSLNW  AR+I F  AVK YG  +S+ AFRI++H
Sbjct: 61  GEVQVPEKRFKFCRNPSLFPIVVRVFKSLNWCAARKISFHNAVKMYGFDHSIYAFRIIIH 120

Query: 121 VFALAEMRMEVYSLLKDIVCHTRKAKQDLSGLFPFLMDSNYDVEKSSIVLDLLIKVFSGN 180
           +FA+  M+ME ++LL+DIVC+    K D+  L P+L+DS   V +S+ V ++LIKVF+ N
Sbjct: 121 IFAMTGMQMEAHALLRDIVCYCEGVKIDVFELLPYLLDSPEHVHRSTSVFNVLIKVFASN 180

Query: 181 SMVENAVEVFIQAKKMGIMPDISSCNFLLKCLGEANREDFVRCLFEDLKTSGPNPTVYTY 240
            M+ NAV+VF++ K +GI   I SCNFLLKCL EANR D +R +FE++K SGP+P VYTY
Sbjct: 181 LMLGNAVDVFLEVKTIGIELSIMSCNFLLKCLLEANRGDIMRMMFEEMKNSGPSPNVYTY 240

Query: 241 TIMMNFFCQGLNGRDVNIKEATFLLEELMRSGERPTVVTYSIYIRALCKVVSGQSALDFI 300
           TIMMNF+C+G  GR  +I++AT L EE+   G  P+VVTYS YI  +C+V   + ALD I
Sbjct: 241 TIMMNFYCKGYYGRGADIEQATKLKEEMEIDGINPSVVTYSTYICGICRVGHVEFALDVI 300

Query: 301 QDLRLKNQPLNAYCYNPILHKFCQEGKTEEAMKVLQEMMGYGILPDACSYSILVDGFCKR 360
           +DLR  N+P+N++CYN +++ FCQ+G+  EA KVL+EM   GILPD  SYSIL+DGFCKR
Sbjct: 301 RDLRSGNKPINSFCYNAVIYGFCQKGEPYEASKVLEEMRSCGILPDVHSYSILIDGFCKR 360

Query: 361 GENDKGLNLIEEMELCQLKPSLVTYSSLIYGLCKRGLMDLSLDIFRKLQKLGYKYDLVVY 420
           G+  K  + I+EM+   +KPS+VTY+SL  GLCK G  D+SL +FR     GY++DLV Y
Sbjct: 361 GDFVKVFHFIDEMKHNDMKPSVVTYTSLFDGLCKSGRADVSLKLFRNFCTSGYEFDLVAY 420

Query: 421 DILIRGFFSHDDMEYAKKLLDEMVND-LCPDAFNFNAMIRWLCKMGHFDKAIGLLNLMLK 480
            +L++G     D++ A +L +EM+N+ L P A +FN +I   CKMG  DKA  L N+ML+
Sbjct: 421 SVLLKGLCLQGDLDSAMELFNEMINNGLIPTANSFNRLIHGFCKMGPLDKAWELFNIMLQ 480

Query: 481 YCGLADTMTCNFIVDGYCREGNLEEALKLMIYMKDHGIMPNSYTFNIIMKRLCEEGRLEK 540
              L    T N IVDGYC  G+LEEALKL+  M + GI PNSYT+N I+KRLC++  +EK
Sbjct: 481 RGVLPTVFTFNVIVDGYCNAGHLEEALKLINEMHELGIFPNSYTYNGIIKRLCKQSSVEK 540

Query: 541 VWELFPAMLKLNILPGLVHYSTLIDGFAKHSNMKKALMLYERMERLGVPPDTVASTIIIN 600
            WEL P MLK +I+     Y  ++DGFAK  N KKA+MLY RM +LGV P T   TI+IN
Sbjct: 541 AWELLPQMLKKDIIHD-SPYDIILDGFAKQLNPKKAMMLYTRMLKLGVTPTTYTYTILIN 600

Query: 601 MLCQRNETYRAYKLFKELIVKGMNLDKILYTSMIAGFSRTGDMKKACALFNKMSNEGCSP 660
           + CQ    Y A+KLF +L+ +G+  D I YT++I GF R GDM+KA ALF +M  +GCSP
Sbjct: 601 LFCQSGNMYEAWKLFMDLLGRGLIPDTIFYTTIIDGFCRVGDMRKAWALFREMPQKGCSP 660

Query: 661 TVVTYTCLIDGFFKLRRLDLASFLVDDMKRNNLTPDVIVYTVLIVGLLRLGKIEKARELV 720
            VVTYTCLI+GF  + R+D+ + L+ +MK+ ++ PDV+ YT LI G  RLG  ++A EL 
Sbjct: 661 NVVTYTCLINGFCNVHRMDVVNSLISEMKKRDINPDVVTYTALIAGYRRLGNADRALELF 720

Query: 721 DEVRAGGIISEDATFQ 732
            E+    I+ + A ++
Sbjct: 721 TEMIRNDILPDYAAYR 734

BLAST of Cla004832 vs. NCBI nr
Match: gi|802615152|ref|XP_012075034.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g63330-like [Jatropha curcas])

HSP 1 Score: 683.3 bits (1762), Expect = 4.6e-193
Identity = 350/713 (49.09%), Postives = 485/713 (68.02%), Query Frame = 1

Query: 32  YSSLSVLLVEDHVFDESPVAQIKVL--------PEINVRRNCSILRRKGKLFPLVVRVLK 91
           YS++S  ++EDHVFD S V + K++        P+ + +       RK  LFP+V  + K
Sbjct: 28  YSAVSACMLEDHVFDSSAVDKGKIVGEALRCCNPKEHGKSKIGRNTRKSGLFPIVATIFK 87

Query: 92  SLNWRVARQIRFSTAVKRYGHLNSLCAFRIMVHVFALAEMRMEVYSLLKDIVCHTRKAKQ 151
           +LNW +A   RF   V  +G  +S+ AFR++VH FA A ++MEV+ L+++I+ + +K   
Sbjct: 88  TLNWELATDKRFFRIVSDHGLSHSINAFRVIVHAFASAGLQMEVHFLIREIISYYKKVNL 147

Query: 152 DLSGLFPFLMDSNYDVEK--SSIVLDLLIKVFSGNSMVENAVEVFIQAKKMGIMPDISSC 211
           D+  LF  L+D   D     SS +++ LIKVF+ N M ENA++VF+QAKK G+ P I SC
Sbjct: 148 DVPDLFSTLLDLPADPHAGISSSIINALIKVFAENKMFENALDVFVQAKKFGLEPTILSC 207

Query: 212 NFLLKCLGEANREDFVRCLFEDLKTSGPNPTVYTYTIMMNFFCQGLNGRDVNIKEATFLL 271
           NFLLKC  EAN+ +FVR LFE+LK  GP+P VYTYTIMM+++C+G  G++++IKEA+ +L
Sbjct: 208 NFLLKCCIEANQVEFVRSLFEELKDFGPSPNVYTYTIMMDYYCKGHLGQNIDIKEASKVL 267

Query: 272 EELMRSGERPTVVTYSIYIRALCKVVSGQSALDFIQDLRLKNQPLNAYCYNPILHKFCQE 331
           EE+ ++G  PTVVTYS+YI  LC+     SA   ++ LR +N+PLN YCYN ++H FCQ+
Sbjct: 268 EEMEKTGRSPTVVTYSVYIHGLCRAGCVDSASKLLEFLRTENKPLNCYCYNAVIHGFCQK 327

Query: 332 GKTEEAMKVLQEMMGYGILPDACSYSILVDGFCKRGENDKGLNLIEEMELCQLKPSLVTY 391
           G   EA+++ ++M   GI PD  SYSIL+DGFCK G+    ++LIE+M+ C +KPSLVTY
Sbjct: 328 GDLFEALRLFEDMKNNGISPDIYSYSILIDGFCKNGDVKYAIDLIEKMDDCDVKPSLVTY 387

Query: 392 SSLIYGLCKRGLMDLSLDIFRKLQKLGYKYDLVVYDILIRGFFSHDDMEYAKKLLDEM-V 451
           SSL   LCK G  D SLD+FRKL   GYK D++ Y+IL+ GF    +ME A +L+DEM +
Sbjct: 388 SSLFNSLCKSGQTDDSLDVFRKLGASGYKLDVISYNILMNGFLLQGNMESAYELMDEMTL 447

Query: 452 NDLCPDAFNFNAMIRWLCKMGHFDKAIGLLNLMLKYCGLADTMTCNFIVDGYCREGNLEE 511
           N L P+ F FN +I   CK    DKA+ + NLML+      T TCN I++ YCR+G L++
Sbjct: 448 NGLIPNTFCFNRLIHEFCKRALLDKALEVFNLMLQVGVPPTTFTCNVIINEYCRQGFLKD 507

Query: 512 ALKLMIYMKDHGIMPNSYTFNIIMKRLCEEGRLEKVWELFPAMLKLNILPGLVHYSTLID 571
           ALK M  M+D GI+ NSYT+N ++K LC+E + EK WE+FP M+K N+ P  VH STL+D
Sbjct: 508 ALKFMNEMQDFGIVANSYTYNTVIKWLCKEQKSEKAWEVFPVMIKNNVFPSDVHCSTLMD 567

Query: 572 GFAKHSNMKKALMLYERMERLGVPPDTVASTIIINMLCQRNETYRAYKLFKELIVKGMNL 631
           GF K SN  KAL LY +M ++G+ P  V  TI+IN+  +RNE   AY LFKE+  K +  
Sbjct: 568 GFDKQSNAAKALQLYAKMLKVGILPSMVTYTILINIFSRRNEMREAYNLFKEMPRKDLLA 627

Query: 632 DKILYTSMIAGFSRTGDMKKACALFNKMSNEGCSPTVVTYTCLIDGFFKLRRLDLASFLV 691
           DKI Y+ +IAGF R G+MKKA AL+ KM  +G SP VVTYTCLI GF KL  +D+ASFL+
Sbjct: 628 DKITYSCIIAGFCRDGNMKKAWALYKKMLQQGQSPNVVTYTCLIHGFCKLNCMDMASFLM 687

Query: 692 DDMKRNNLTPDVIVYTVLIVGLLRLGKIEKARELVDEVRAGGIISEDATFQML 734
           ++M+RNN+TPDV+ YT LI G  RLG ++KA  L DE++  GI+ +D  +  L
Sbjct: 688 EEMQRNNVTPDVVTYTSLIYGYQRLGYVDKASALFDEMKEKGILLDDIAYARL 740

BLAST of Cla004832 vs. NCBI nr
Match: gi|1012104533|ref|XP_015958293.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g63330-like [Arachis duranensis])

HSP 1 Score: 677.9 bits (1748), Expect = 2.0e-191
Identity = 351/724 (48.48%), Postives = 493/724 (68.09%), Query Frame = 1

Query: 7   SKICLSELFLLHNQVNCHLFSRRACYSSLS-VLLVEDHVFDESPVAQIKVLPEINVRRNC 66
           S  CL+    L ++++         Y+S S  L++ED VFDESP  +   + +  V R+ 
Sbjct: 2   SLTCLNGCIFLRSKLHPGQRCLSRLYNSASGALMLEDQVFDESPKFEPNAVVDFGVTRSV 61

Query: 67  SILRRKG-KLFPLVVRVLKSLNWRVARQIRFSTAVKRYGHLNSLCAFRIMVHVFALAEMR 126
             +   G +LFPLV +V KSLN+RVAR+ RF + V+ +G  +S+  FRI++H+FALA MR
Sbjct: 62  PRVPTTGSELFPLVGKVFKSLNFRVAREKRFGSWVESHGFSHSINCFRIIIHIFALAGMR 121

Query: 127 MEVYSLLKDIVCHTRKAKQDLSGLFPFLMDSNYDVEKSSIVLDLLIKVFSGNSMVENAVE 186
            EV++LL+DIV    +++ D   LF  L+DS + +E+S+IV D+L+KVF+ NSM+ENA  
Sbjct: 122 QEVFTLLRDIVEFCNESEYDTFELFSALLDSPHHLERSAIVFDVLMKVFASNSMLENAFN 181

Query: 187 VFIQAKKMGIMPDISSCNFLLKCLGEANREDFVRCLFEDLKTSGPNPTVYTYTIMMNFFC 246
           VF+ AK +G+  DI SCNFLLKCL EANR DFVR  FE+LK SGP+P +YTYTIMMNF+C
Sbjct: 182 VFVNAKHVGLESDIMSCNFLLKCLVEANRVDFVRPFFEELKDSGPSPNIYTYTIMMNFYC 241

Query: 247 QGLNGRDVNIKEATFLLEELMRSGERPTVVTYSIYIRALCKVVSGQSALDFIQDLRLKNQ 306
           +G  G DVNI+ AT +L ++  SG+RPTVVTY  YI  LCKV S   A   I +L  +N+
Sbjct: 242 RGGPGWDVNIRRATEILGKIYSSGQRPTVVTYRSYIHGLCKVGSLDVAFKLICNLSYRNK 301

Query: 307 PLNAYCYNPILHKFCQEGKTEEAMKVLQEMMGYGILPDACSYSILVDGFCKRGENDKGLN 366
           PLN++C+N I+  FC  G  ++A +VL++M   G++PDA SYSIL+D  CK G+ +K L+
Sbjct: 302 PLNSHCFNAIIRGFCIRGAVQKAFEVLEKMKISGVVPDAYSYSILIDALCKEGDVEKSLD 361

Query: 367 LIEEMELCQLKPSLVTYSSLIYGLCKRGLMDLSLDIFRKLQKLGYKYDLVVYDILIRGFF 426
           L+EEME  Q+KPS+V Y+SLI+GLCKRGLM+ ++D+F ++   G +YD  VY+ LI GF 
Sbjct: 362 LMEEMERYQVKPSVVIYTSLIHGLCKRGLMECAIDVFNRIGASGCEYDQTVYETLIDGFC 421

Query: 427 SHDDMEYAKKLLDEM-VNDLCPDAFNFNAMIRWLCKMGHFDKAIGLLNLMLKYCGLADTM 486
              D   A KLL EM +N+L    F F  +IR   K+GH+DKA  + N+ML+     DT+
Sbjct: 422 MDGDKNSANKLLQEMIINNLVHTTFGFRFLIRGFYKLGHYDKAFEVFNIMLRDGISPDTI 481

Query: 487 TCNFIVDGYCREGNLEEALKLMIYMKDHGIMPNSYTFNIIMKRLCEEGRLEKVWELFPAM 546
            CN++++GYCR+G LEEALKL+  + DHGI  NS+++N IM RLC+E   E+  EL P M
Sbjct: 482 ACNYVLNGYCRDGRLEEALKLLEELSDHGITLNSHSYNAIMYRLCKESYPERALELLPRM 541

Query: 547 LKLNILPGLVHYSTLIDGFAKHSNMKKALMLYERMERLGVPPDTVASTIIINMLCQRNET 606
           LK N +PG+V+YSTLI GF K  N +K++ML+ RM ++G+  + +  TI+IN+ C+  + 
Sbjct: 542 LKRNTVPGVVNYSTLISGFFKQLNFRKSVMLFTRMVKVGIAFNNITYTILINIFCRSGKM 601

Query: 607 YRAYKLFKELIVKGMNLDKILYTSMIAGFSRTGDMKKACALFNKMSNEGCSPTVVTYTCL 666
           + AY +F E+  +G+ LD I YTS+IAGFS  G++KKA A+F +MS EGC P V+TYTCL
Sbjct: 602 HGAYAIFNEMKERGLCLDVITYTSLIAGFSDAGELKKAWAIFEEMSREGCWPNVITYTCL 661

Query: 667 IDGFFKLRRLDLASFLVDDMKRNNLTPDVIVYTVLIVGLLRLGKIEKARELVDEVRAGGI 726
           IDG  K  R+DLAS+L D M ++ + PDV+ YTVLI      G  ++A  L  E+   GI
Sbjct: 662 IDGCCKSNRIDLASWLFDKMNKDAVKPDVVTYTVLIAWYRMHGLTDQAHNLYREMMTKGI 721

Query: 727 ISED 728
             +D
Sbjct: 722 FPDD 725

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR39_ARATH3.0e-6727.26Pentatricopeptide repeat-containing protein At1g12775, mitochondrial OS=Arabidop... [more]
PPR94_ARATH2.0e-6628.09Pentatricopeptide repeat-containing protein At1g62910 OS=Arabidopsis thaliana GN... [more]
PP247_ARATH2.6e-6627.56Pentatricopeptide repeat-containing protein At3g22470, mitochondrial OS=Arabidop... [more]
PPR91_ARATH3.3e-6627.74Pentatricopeptide repeat-containing protein At1g62670, mitochondrial OS=Arabidop... [more]
PP100_ARATH5.7e-6628.09Pentatricopeptide repeat-containing protein At1g63150 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
D7TTT9_VITVI9.7e-21451.25Putative uncharacterized protein OS=Vitis vinifera GN=VIT_02s0012g01760 PE=4 SV=... [more]
A0A061GDS6_THECC6.9e-21252.63Tetratricopeptide repeat-like superfamily protein, putative OS=Theobroma cacao G... [more]
A0A0D2R076_GOSRA1.2e-19549.46Uncharacterized protein OS=Gossypium raimondii GN=B456_004G065800 PE=4 SV=1[more]
B9SNU2_RICCO5.7e-19046.54Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... [more]
A0A151RA84_CAJCA5.5e-18547.55Uncharacterized protein OS=Cajanus cajan GN=KK1_039226 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|359474464|ref|XP_003631475.1|1.4e-21351.25PREDICTED: pentatricopeptide repeat-containing protein At1g63330 [Vitis vinifera... [more]
gi|590679692|ref|XP_007040653.1|1.0e-21152.63Tetratricopeptide repeat-like superfamily protein, putative [Theobroma cacao][more]
gi|823147550|ref|XP_012473689.1|1.7e-19549.46PREDICTED: pentatricopeptide repeat-containing protein At1g63330-like isoform X1... [more]
gi|802615152|ref|XP_012075034.1|4.6e-19349.09PREDICTED: pentatricopeptide repeat-containing protein At1g63330-like [Jatropha ... [more]
gi|1012104533|ref|XP_015958293.1|2.0e-19148.48PREDICTED: pentatricopeptide repeat-containing protein At1g63330-like [Arachis d... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla004832Cla004832.1mRNA


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 442..473
score: 3.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 550..599
score: 9.3E-12coord: 376..424
score: 9.6E-8coord: 481..528
score: 1.3E-16coord: 623..669
score: 1.4E-14coord: 307..355
score: 7.0
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 188..243
score: 0.0023coord: 679..724
score: 1.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 309..343
score: 1.6E-8coord: 345..377
score: 7.4E-4coord: 693..723
score: 8.3E-4coord: 414..441
score: 4.4E-4coord: 379..412
score: 4.9E-6coord: 450..476
score: 3.6E-6coord: 624..657
score: 2.2E-9coord: 518..550
score: 3.8E-5coord: 554..587
score: 1.6E-8coord: 484..516
score: 6.5E-7coord: 658..692
score: 7.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 162..196
score: 8.166coord: 377..411
score: 10.578coord: 232..271
score: 8.287coord: 621..655
score: 12.342coord: 307..341
score: 12.299coord: 412..442
score: 8.594coord: 481..515
score: 12.934coord: 197..231
score: 8.331coord: 446..476
score: 9.372coord: 516..550
score: 11.553coord: 691..725
score: 10.073coord: 586..620
score: 8.309coord: 656..690
score: 10.742coord: 272..306
score: 6.697coord: 551..585
score: 11.586coord: 342..376
score: 10
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 41..58
score: 1.8E-259coord: 176..732
score: 1.8E
NoneNo IPR availablePANTHERPTHR24015:SF486SUBFAMILY NOT NAMEDcoord: 41..58
score: 1.8E-259coord: 176..732
score: 1.8E
NoneNo IPR availableunknownSSF81901HCP-likecoord: 427..653
score: 7.3