Cla005843 (gene) Watermelon (97103) v1

NameCla005843
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionPentatricopeptide repeat-containing protein (AHRD V1 ***- D7LLV1_ARALL); contains Interpro domain(s) IPR002885 Pentatricopeptide repeat
LocationChr1 : 7526279 .. 7528642 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGCTTCCCATTGAGCTTCTTGCCTTTGCTTCCCTTCTTTCAGCGATGTTACTCTTCTTTCGCACTCTTTTCCACGTTAGTCGCAGAGCTTCTTACCAAGTAATCTCTCTATCTTCTAATTCTTCGCATCCCGATTGCCTTTCTTTCAATGTATTTAATCCCTCATCATCTCTAACATCAATAAATGCCTATTGCATTTCTCGTCATTTTTTCTGGTTCACTAGCTTTCTTCGTATATTTCGGCTCCCTTTTGTTAGTTACTCGGGTACAAATAATTCATTTGAATTTTTAGACATTGGTACCCTTCGTAAAATCATACAACAAGACCTCTGGAACGATCCTAAGATTGTTATTTTATTGGATTCAGCACTAGCGCCCATCTGGGTCTCTAAGGTTTTAGTTGAACTGAAAGAAGATCCGAATTTAGCTCTTAAGTTCTTCAAATGGGCTGGAAGCCGGATTGGCTTCCGTCATACCACCGAGTCTTACTGCATTGTAGCTCACATGCTGTTTCGTGCGAGAATGTATACAAATGCTCATGATATTATTAAAGAAGTGATTGTGAAGAGCCGAATTGATGTGGGTTTTCCAGTTTTTAATATATTTGATATGTTATGGTCCACTAGGAATATTTGTGCGTCAGGAACAGGAGTCTTTGACGTTTTATTTAGTGTTTTCGTAGAATTGGGTTTGCTCGACGAAGCTAACAAATGTTTCTCGAGAATGAGGAAGTTTAGGACTCTTCCGAAAGCACGTTCTTGCAATTTTCTTTTGCACAGATTATCAAAATCAGGTAATGGGCAGTTGGTGAGGAAGTTTTTCAATGACATGATTGGGGCTGGGATTGCACCTTCAGTTTTTACCTACAATGTAATGATAGATTACTTGTGCAAAGAAGGGGATTTGGAAAACGCTAGACGTTTGTTTGTGCAAATGAGGCAGATGGGCTTTACTCCAGATGTTGTCACATATAATTCTTTGATTGATGGCTATGGCAAGGTTGGTTTATTAGAAGAAGCTGTGTATTTATTTAATGAAATGAAGGATGTAGGTTGTGTTCCTGATGTAATTACCTATAATGGGTTAATCAATTGTTTTTGCAAGTTTGAGAAGATGCCTCAAGCTTTTGAGTATCTCTCTAAGATGAAGAACGATGGGTTAAAACCAAATGTTGTAACCTATAGCACATTGATTGATGCTTTTTGCAAGGAGGGAATGATGCAAGGTGCCATCAAACTTTTTGTTGATATGAGAAGGGTTGGTCTTTTACCTAATGAATTCACTTACACTTCTCTGATTGACGCCCATTGTAAGGCAGGTAATTTAACAGAAGCATGGAAGTTGTCCAACGATATGTTGCAAGCAGGAGTTAATTTAAATATAGTCACTTATACAGCTCTAATGGATGGCCTTTGTGAAGATGGAAAAATGATGGAAGCAGAAGAAGTGTTTAGGGCAATGCTGAAAAATGGAATATCTCCCAACCAGCAGGTTTACACTGCATTGGTTCATGGCTATATTAAGGCAGAGAGAATGGAGGATGCAATGGAAATATTGAAGCAAATGACAGAATGTAACATCAAACCAGATTTAATACTCTATGGCACCATTATTTGGGGTCTCTGTAGTCAAAGCAAACTTGAAGAAACTAAACTTATTATTAAAGAAATGGAAAGTCAGGGTATTAGTGCAAATCCTGTTATATACACGACAATTATAGATGCTTATTTTAAGGCTGGAAAAAGCTCTGATGCAATGAATCTTCTTCAGGAAATGCAGGATGCTGGTGTTGAGGCTACTGTTGTAACCTACTGTGTACTAATTGATGGTTTGTGCAAAGCAGGTATGGTTGAACTTGCAGTTGATTATTTTGGTAGAATGTCTGATCTTGGTTTACAACCTAATGTTGCAGTTTATACGGCGCTAATTGATGGTCTTTGTAAAACTAATTGTGTTGAATCTGCCAAAAAGTTGTTTGATGAAATGCAATGTAGGGGTATGACCCCGGATATAACTGCTTTCACTGCTCTAATTGATGGCAACTTGAAGCTTGGAAATCTTCAGGAAGCTTCGGATTTGATTAGCAGAATGACAGAATCAGCTACCGAGTTTGATTTGCATGCTTATACTTCCTTGGTTTCGGGATTTTCTGAATATGGCGAGCTGCACCAAGCAAGGAAGTTTTTTAGTGAGATGATTGAGAAGGGCATACTTCCTGAGGAGATCTTATGTATATCTCTACTGAGAGAGTATTATAAGCTTGGACAGTTGGATGAAGCCATTGAATTGAAGAATGAAATGCAAAGGAGGGGTTTAATTAGTGAAAATTGCAGCCTTGCAGTTCCCTGTCTAAAAACTTGA

mRNA sequence

ATGAAGCTTCCCATTGAGCTTCTTGCCTTTGCTTCCCTTCTTTCAGCGATGTTACTCTTCTTTCGCACTCTTTTCCACGTTAGTCGCAGAGCTTCTTACCAAGTAATCTCTCTATCTTCTAATTCTTCGCATCCCGATTGCCTTTCTTTCAATGTATTTAATCCCTCATCATCTCTAACATCAATAAATGCCTATTGCATTTCTCGTCATTTTTTCTGGTTCACTAGCTTTCTTCGTATATTTCGGCTCCCTTTTGTTAGTTACTCGGGTACAAATAATTCATTTGAATTTTTAGACATTGGTACCCTTCGTAAAATCATACAACAAGACCTCTGGAACGATCCTAAGATTGTTATTTTATTGGATTCAGCACTAGCGCCCATCTGGGTCTCTAAGGTTTTAGTTGAACTGAAAGAAGATCCGAATTTAGCTCTTAAGTTCTTCAAATGGGCTGGAAGCCGGATTGGCTTCCGTCATACCACCGAGTCTTACTGCATTGTAGCTCACATGCTGTTTCGTGCGAGAATGTATACAAATGCTCATGATATTATTAAAGAAGTGATTGTGAAGAGCCGAATTGATGTGGGTTTTCCAGTTTTTAATATATTTGATATGTTATGGTCCACTAGGAATATTTGTGCGTCAGGAACAGGAGTCTTTGACGTTTTATTTAGTGTTTTCGTAGAATTGGGTTTGCTCGACGAAGCTAACAAATGTTTCTCGAGAATGAGGAAGTTTAGGACTCTTCCGAAAGCACGTTCTTGCAATTTTCTTTTGCACAGATTATCAAAATCAGGTAATGGGCAGTTGGTGAGGAAGTTTTTCAATGACATGATTGGGGCTGGGATTGCACCTTCAGTTTTTACCTACAATGTAATGATAGATTACTTGTGCAAAGAAGGGGATTTGGAAAACGCTAGACGTTTGTTTGTGCAAATGAGGCAGATGGGCTTTACTCCAGATGTTGTCACATATAATTCTTTGATTGATGGCTATGGCAAGGTTGGTTTATTAGAAGAAGCTGTGTATTTATTTAATGAAATGAAGGATGTAGGTTGTGTTCCTGATGTAATTACCTATAATGGGTTAATCAATTGTTTTTGCAAGTTTGAGAAGATGCCTCAAGCTTTTGAGTATCTCTCTAAGATGAAGAACGATGGGTTAAAACCAAATGTTGTAACCTATAGCACATTGATTGATGCTTTTTGCAAGGAGGGAATGATGCAAGGTGCCATCAAACTTTTTGTTGATATGAGAAGGGTTGGTCTTTTACCTAATGAATTCACTTACACTTCTCTGATTGACGCCCATTGTAAGGCAGGTAATTTAACAGAAGCATGGAAGTTGTCCAACGATATGTTGCAAGCAGGAGTTAATTTAAATATAGTCACTTATACAGCTCTAATGGATGGCCTTTGTGAAGATGGAAAAATGATGGAAGCAGAAGAAGTGTTTAGGGCAATGCTGAAAAATGGAATATCTCCCAACCAGCAGGTTTACACTGCATTGGTTCATGGCTATATTAAGGCAGAGAGAATGGAGGATGCAATGGAAATATTGAAGCAAATGACAGAATGTAACATCAAACCAGATTTAATACTCTATGGCACCATTATTTGGGGTCTCTGTAGTCAAAGCAAACTTGAAGAAACTAAACTTATTATTAAAGAAATGGAAAGTCAGGGTATTAGTGCAAATCCTGTTATATACACGACAATTATAGATGCTTATTTTAAGGCTGGAAAAAGCTCTGATGCAATGAATCTTCTTCAGGAAATGCAGGATGCTGGTGTTGAGGCTACTGTTGTAACCTACTGTGTACTAATTGATGGTTTGTGCAAAGCAGGTATGGTTGAACTTGCAGTTGATTATTTTGGTAGAATGTCTGATCTTGGTTTACAACCTAATGTTGCAGTTTATACGGCGCTAATTGATGGTCTTTGTAAAACTAATTGTGTTGAATCTGCCAAAAAGTTGTTTGATGAAATGCAATGTAGGGGTATGACCCCGGATATAACTGCTTTCACTGCTCTAATTGATGGCAACTTGAAGCTTGGAAATCTTCAGGAAGCTTCGGATTTGATTAGCAGAATGACAGAATCAGCTACCGAGTTTGATTTGCATGCTTATACTTCCTTGGTTTCGGGATTTTCTGAATATGGCGAGCTGCACCAAGCAAGGAAGTTTTTTAGTGAGATGATTGAGAAGGGCATACTTCCTGAGGAGATCTTATGTATATCTCTACTGAGAGAGTATTATAAGCTTGGACAGTTGGATGAAGCCATTGAATTGAAGAATGAAATGCAAAGGAGGGGTTTAATTAGTGAAAATTGCAGCCTTGCAGTTCCCTGTCTAAAAACTTGA

Coding sequence (CDS)

ATGAAGCTTCCCATTGAGCTTCTTGCCTTTGCTTCCCTTCTTTCAGCGATGTTACTCTTCTTTCGCACTCTTTTCCACGTTAGTCGCAGAGCTTCTTACCAAGTAATCTCTCTATCTTCTAATTCTTCGCATCCCGATTGCCTTTCTTTCAATGTATTTAATCCCTCATCATCTCTAACATCAATAAATGCCTATTGCATTTCTCGTCATTTTTTCTGGTTCACTAGCTTTCTTCGTATATTTCGGCTCCCTTTTGTTAGTTACTCGGGTACAAATAATTCATTTGAATTTTTAGACATTGGTACCCTTCGTAAAATCATACAACAAGACCTCTGGAACGATCCTAAGATTGTTATTTTATTGGATTCAGCACTAGCGCCCATCTGGGTCTCTAAGGTTTTAGTTGAACTGAAAGAAGATCCGAATTTAGCTCTTAAGTTCTTCAAATGGGCTGGAAGCCGGATTGGCTTCCGTCATACCACCGAGTCTTACTGCATTGTAGCTCACATGCTGTTTCGTGCGAGAATGTATACAAATGCTCATGATATTATTAAAGAAGTGATTGTGAAGAGCCGAATTGATGTGGGTTTTCCAGTTTTTAATATATTTGATATGTTATGGTCCACTAGGAATATTTGTGCGTCAGGAACAGGAGTCTTTGACGTTTTATTTAGTGTTTTCGTAGAATTGGGTTTGCTCGACGAAGCTAACAAATGTTTCTCGAGAATGAGGAAGTTTAGGACTCTTCCGAAAGCACGTTCTTGCAATTTTCTTTTGCACAGATTATCAAAATCAGGTAATGGGCAGTTGGTGAGGAAGTTTTTCAATGACATGATTGGGGCTGGGATTGCACCTTCAGTTTTTACCTACAATGTAATGATAGATTACTTGTGCAAAGAAGGGGATTTGGAAAACGCTAGACGTTTGTTTGTGCAAATGAGGCAGATGGGCTTTACTCCAGATGTTGTCACATATAATTCTTTGATTGATGGCTATGGCAAGGTTGGTTTATTAGAAGAAGCTGTGTATTTATTTAATGAAATGAAGGATGTAGGTTGTGTTCCTGATGTAATTACCTATAATGGGTTAATCAATTGTTTTTGCAAGTTTGAGAAGATGCCTCAAGCTTTTGAGTATCTCTCTAAGATGAAGAACGATGGGTTAAAACCAAATGTTGTAACCTATAGCACATTGATTGATGCTTTTTGCAAGGAGGGAATGATGCAAGGTGCCATCAAACTTTTTGTTGATATGAGAAGGGTTGGTCTTTTACCTAATGAATTCACTTACACTTCTCTGATTGACGCCCATTGTAAGGCAGGTAATTTAACAGAAGCATGGAAGTTGTCCAACGATATGTTGCAAGCAGGAGTTAATTTAAATATAGTCACTTATACAGCTCTAATGGATGGCCTTTGTGAAGATGGAAAAATGATGGAAGCAGAAGAAGTGTTTAGGGCAATGCTGAAAAATGGAATATCTCCCAACCAGCAGGTTTACACTGCATTGGTTCATGGCTATATTAAGGCAGAGAGAATGGAGGATGCAATGGAAATATTGAAGCAAATGACAGAATGTAACATCAAACCAGATTTAATACTCTATGGCACCATTATTTGGGGTCTCTGTAGTCAAAGCAAACTTGAAGAAACTAAACTTATTATTAAAGAAATGGAAAGTCAGGGTATTAGTGCAAATCCTGTTATATACACGACAATTATAGATGCTTATTTTAAGGCTGGAAAAAGCTCTGATGCAATGAATCTTCTTCAGGAAATGCAGGATGCTGGTGTTGAGGCTACTGTTGTAACCTACTGTGTACTAATTGATGGTTTGTGCAAAGCAGGTATGGTTGAACTTGCAGTTGATTATTTTGGTAGAATGTCTGATCTTGGTTTACAACCTAATGTTGCAGTTTATACGGCGCTAATTGATGGTCTTTGTAAAACTAATTGTGTTGAATCTGCCAAAAAGTTGTTTGATGAAATGCAATGTAGGGGTATGACCCCGGATATAACTGCTTTCACTGCTCTAATTGATGGCAACTTGAAGCTTGGAAATCTTCAGGAAGCTTCGGATTTGATTAGCAGAATGACAGAATCAGCTACCGAGTTTGATTTGCATGCTTATACTTCCTTGGTTTCGGGATTTTCTGAATATGGCGAGCTGCACCAAGCAAGGAAGTTTTTTAGTGAGATGATTGAGAAGGGCATACTTCCTGAGGAGATCTTATGTATATCTCTACTGAGAGAGTATTATAAGCTTGGACAGTTGGATGAAGCCATTGAATTGAAGAATGAAATGCAAAGGAGGGGTTTAATTAGTGAAAATTGCAGCCTTGCAGTTCCCTGTCTAAAAACTTGA

Protein sequence

MKLPIELLAFASLLSAMLLFFRTLFHVSRRASYQVISLSSNSSHPDCLSFNVFNPSSSLTSINAYCISRHFFWFTSFLRIFRLPFVSYSGTNNSFEFLDIGTLRKIIQQDLWNDPKIVILLDSALAPIWVSKVLVELKEDPNLALKFFKWAGSRIGFRHTTESYCIVAHMLFRARMYTNAHDIIKEVIVKSRIDVGFPVFNIFDMLWSTRNICASGTGVFDVLFSVFVELGLLDEANKCFSRMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQMGFTPDVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVITYNGLINCFCKFEKMPQAFEYLSKMKNDGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLPNEFTYTSLIDAHCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGKMMEAEEVFRAMLKNGISPNQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQSKLEETKLIIKEMESQGISANPVIYTTIIDAYFKAGKSSDAMNLLQEMQDAGVEATVVTYCVLIDGLCKAGMVELAVDYFGRMSDLGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMTPDITAFTALIDGNLKLGNLQEASDLISRMTESATEFDLHAYTSLVSGFSEYGELHQARKFFSEMIEKGILPEEILCISLLREYYKLGQLDEAIELKNEMQRRGLISENCSLAVPCLKT
BLAST of Cla005843 vs. Swiss-Prot
Match: PP143_ARATH (Putative pentatricopeptide repeat-containing protein At2g02150 OS=Arabidopsis thaliana GN=At2g02150 PE=3 SV=1)

HSP 1 Score: 856.7 bits (2212), Expect = 2.0e-247
Identity = 426/769 (55.40%), Postives = 560/769 (72.82%), Query Frame = 1

Query: 17  MLLFFRTLFHVSRRASYQVISLSSNSSHPDC-LSFNVFNPSSSLTSINAYCISRHFFWFT 76
           M    R   HV+RR    V   SS+ S     L F + +PS S +S     IS  F WFT
Sbjct: 1   MFCSLRNFLHVNRRFPRHVSPSSSSLSQIQSPLCFPLSSPSPSQSSF----ISCPFVWFT 60

Query: 77  SFLRIFRLPFVSYSGTNNSFEFLDIGTLRKIIQQDLWNDPKIVILLDSALAPIWVSKVLV 136
           SFL I R PFV+ SGT+   E  D   +RK++  DLW+DP +  L D  LAPIWV +VLV
Sbjct: 61  SFLCIIRYPFVTKSGTSTYSEDFDRDWIRKVVHNDLWDDPGLEKLFDLTLAPIWVPRVLV 120

Query: 137 ELKEDPNLALKFFKWAGSRIGFRHTTESYCIVAHMLFRARMYTNAHDIIKEVIVKSRIDV 196
           ELKEDP LA KFFKW+ +R GF+H+ ESYCIVAH+LF ARMY +A+ ++KE+++ S+ D 
Sbjct: 121 ELKEDPKLAFKFFKWSMTRNGFKHSVESYCIVAHILFCARMYYDANSVLKEMVL-SKADC 180

Query: 197 GFPVFNIFDMLWSTRNICASGTGVFDVLFSVFVELGLLDEANKCFSRMRKFRTLPKARSC 256
                ++FD+LWSTRN+C  G GVFD LFSV ++LG+L+EA +CFS+M++FR  PK RSC
Sbjct: 181 -----DVFDVLWSTRNVCVPGFGVFDALFSVLIDLGMLEEAIQCFSKMKRFRVFPKTRSC 240

Query: 257 NFLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQ 316
           N LLHR +K G    V++FF DMIGAG  P+VFTYN+MID +CKEGD+E AR LF +M+ 
Sbjct: 241 NGLLHRFAKLGKTDDVKRFFKDMIGAGARPTVFTYNIMIDCMCKEGDVEAARGLFEEMKF 300

Query: 317 MGFTPDVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVITYNGLINCFCKFEKMPQ 376
            G  PD VTYNS+IDG+GKVG L++ V  F EMKD+ C PDVITYN LINCFCKF K+P 
Sbjct: 301 RGLVPDTVTYNSMIDGFGKVGRLDDTVCFFEEMKDMCCEPDVITYNALINCFCKFGKLPI 360

Query: 377 AFEYLSKMKNDGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLPNEFTYTSLID 436
             E+  +MK +GLKPNVV+YSTL+DAFCKEGMMQ AIK +VDMRRVGL+PNE+TYTSLID
Sbjct: 361 GLEFYREMKGNGLKPNVVSYSTLVDAFCKEGMMQQAIKFYVDMRRVGLVPNEYTYTSLID 420

Query: 437 AHCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGKMMEAEEVFRAMLKNGISP 496
           A+CK GNL++A++L N+MLQ GV  N+VTYTAL+DGLC+  +M EAEE+F  M   G+ P
Sbjct: 421 ANCKIGNLSDAFRLGNEMLQVGVEWNVVTYTALIDGLCDAERMKEAEELFGKMDTAGVIP 480

Query: 497 NQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQSKLEETKLII 556
           N   Y AL+HG++KA+ M+ A+E+L ++    IKPDL+LYGT IWGLCS  K+E  K+++
Sbjct: 481 NLASYNALIHGFVKAKNMDRALELLNELKGRGIKPDLLLYGTFIWGLCSLEKIEAAKVVM 540

Query: 557 KEMESQGISANPVIYTTIIDAYFKAGKSSDAMNLLQEMQDAGVEATVVTYCVLIDGLCKA 616
            EM+  GI AN +IYTT++DAYFK+G  ++ ++LL EM++  +E TVVT+CVLIDGLCK 
Sbjct: 541 NEMKECGIKANSLIYTTLMDAYFKSGNPTEGLHLLDEMKELDIEVTVVTFCVLIDGLCKN 600

Query: 617 GMVELAVDYFGRMS-DLGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMTPDITA 676
            +V  AVDYF R+S D GLQ N A++TA+IDGLCK N VE+A  LF++M  +G+ PD TA
Sbjct: 601 KLVSKAVDYFNRISNDFGLQANAAIFTAMIDGLCKDNQVEAATTLFEQMVQKGLVPDRTA 660

Query: 677 FTALIDGNLKLGNLQEASDLISRMTESATEFDLHAYTSLVSGFSEYGELHQARKFFSEMI 736
           +T+L+DGN K GN+ EA  L  +M E   + DL AYTSLV G S   +L +AR F  EMI
Sbjct: 661 YTSLMDGNFKQGNVLEALALRDKMAEIGMKLDLLAYTSLVWGLSHCNQLQKARSFLEEMI 720

Query: 737 EKGILPEEILCISLLREYYKLGQLDEAIELKNEMQRRGLISENCSLAVP 784
            +GI P+E+LCIS+L+++Y+LG +DEA+EL++ + +  L++ +   A+P
Sbjct: 721 GEGIHPDEVLCISVLKKHYELGCIDEAVELQSYLMKHQLLTSDNDNALP 759

BLAST of Cla005843 vs. Swiss-Prot
Match: PP141_ARATH (Pentatricopeptide repeat-containing protein At2g01740 OS=Arabidopsis thaliana GN=At2g01740 PE=3 SV=1)

HSP 1 Score: 371.3 bits (952), Expect = 2.6e-101
Identity = 207/564 (36.70%), Postives = 319/564 (56.56%), Query Frame = 1

Query: 232 LLDEANKCFSRMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYN 291
           ++ EA +  SR+RK   LP   +CN  +H+L  S  G L  KF   ++  G  P   ++N
Sbjct: 1   MVREALQFLSRLRKSSNLPDPFTCNKHIHQLINSNCGILSLKFLAYLVSRGYTPHRSSFN 60

Query: 292 VMIDYLCKEGDLENARRLFVQMRQMGFTPDVVTYNSLIDGYGKVGLLEEAVYLFNEMKDV 351
            ++ ++CK G ++ A  +   M + G  PDV++YNSLIDG+ + G +  A  +   ++  
Sbjct: 61  SVVSFVCKLGQVKFAEDIVHSMPRFGCEPDVISYNSLIDGHCRNGDIRSASLVLESLRAS 120

Query: 352 G---CVPDVITYNGLINCFCKFEKMPQAFEYLSKMKNDGLKPNVVTYSTLIDAFCKEGMM 411
               C PD++++N L N F K + + + F Y+  M      PNVVTYST ID FCK G +
Sbjct: 121 HGFICKPDIVSFNSLFNGFSKMKMLDEVFVYMGVMLKC-CSPNVVTYSTWIDTFCKSGEL 180

Query: 412 QGAIKLFVDMRRVGLLPNEFTYTSLIDAHCKAGNLTEAWKLSNDMLQAGVNLNIVTYTAL 471
           Q A+K F  M+R  L PN  T+T LID +CKAG+L  A  L  +M +  ++LN+VTYTAL
Sbjct: 181 QLALKSFHSMKRDALSPNVVTFTCLIDGYCKAGDLEVAVSLYKEMRRVRMSLNVVTYTAL 240

Query: 472 MDGLCEDGKMMEAEEVFRAMLKNGISPNQQVYTALVHGYIKAERMEDAMEILKQMTECNI 531
           +DG C+ G+M  AEE++  M+++ + PN  VYT ++ G+ +    ++AM+ L +M    +
Sbjct: 241 IDGFCKKGEMQRAEEMYSRMVEDRVEPNSLVYTTIIDGFFQRGDSDNAMKFLAKMLNQGM 300

Query: 532 KPDLILYGTIIWGLCSQSKLEETKLIIKEMESQGISANPVIYTTIIDAYFKAGKSSDAMN 591
           + D+  YG II GLC   KL+E   I+++ME   +  + VI+TT+++AYFK+G+   A+N
Sbjct: 301 RLDITAYGVIISGLCGNGKLKEATEIVEDMEKSDLVPDMVIFTTMMNAYFKSGRMKAAVN 360

Query: 592 LLQEMQDAGVEATVVTYCVLIDGLCKAGMVELAVDYFGRMSDLGLQPNVAVYTALIDGLC 651
           +  ++ + G E  VV    +IDG+ K G +  A+ YF        + N  +YT LID LC
Sbjct: 361 MYHKLIERGFEPDVVALSTMIDGIAKNGQLHEAIVYF-----CIEKANDVMYTVLIDALC 420

Query: 652 KTNCVESAKKLFDEMQCRGMTPDITAFTALIDGNLKLGNLQEASDLISRMTESATEFDLH 711
           K       ++LF ++   G+ PD   +T+ I G  K GNL +A  L +RM +     DL 
Sbjct: 421 KEGDFIEVERLFSKISEAGLVPDKFMYTSWIAGLCKQGNLVDAFKLKTRMVQEGLLLDLL 480

Query: 712 AYTSLVSGFSEYGELHQARKFFSEMIEKGILPEEILCISLLREYYKLGQLDEAIELKNEM 771
           AYT+L+ G +  G + +AR+ F EM+  GI P+  +   L+R Y K G +  A +L  +M
Sbjct: 481 AYTTLIYGLASKGLMVEARQVFDEMLNSGISPDSAVFDLLIRAYEKEGNMAAASDLLLDM 540

Query: 772 QRRGLI--------SENCSLAVPC 785
           QRRGL+        S+ C   V C
Sbjct: 541 QRRGLVTAVSDADCSKQCGNEVNC 558

BLAST of Cla005843 vs. Swiss-Prot
Match: PPR12_ARATH (Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidopsis thaliana GN=At1g05670 PE=2 SV=1)

HSP 1 Score: 368.2 bits (944), Expect = 2.2e-100
Identity = 209/650 (32.15%), Postives = 344/650 (52.92%), Query Frame = 1

Query: 128 IWVSKVLVELKEDPNLALKFFKWAGSRIGFRHTTESYCIVAHMLFRARMYTNAHDIIKEV 187
           IWV   L+++K D  L L FF WA SR       ES CIV H+   ++    A  +I   
Sbjct: 91  IWV---LMKIKCDYRLVLDFFDWARSRRD--SNLESLCIVIHLAVASKDLKVAQSLISSF 150

Query: 188 IVKSRIDVGFPVFNIFDMLWSTRNICASGTGVFDVLFSVFVELGLLDEANKCFSRMRKFR 247
             + +++V       FD+L  T     S   VFDV F V V+ GLL EA + F +M  + 
Sbjct: 151 WERPKLNVTDSFVQFFDLLVYTYKDWGSDPRVFDVFFQVLVDFGLLREARRVFEKMLNYG 210

Query: 248 TLPKARSCNFLLHRLSKSGNGQLVRKF-FNDMIGAGIAPSVFTYNVMIDYLCKEGDLENA 307
            +    SCN  L RLSK           F +    G+  +V +YN++I ++C+ G ++ A
Sbjct: 211 LVLSVDSCNVYLTRLSKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEA 270

Query: 308 RRLFVQMRQMGFTPDVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVITYNGLINC 367
             L + M   G+TPDV++Y+++++GY + G L++   L   MK  G  P+   Y  +I  
Sbjct: 271 HHLLLLMELKGYTPDVISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGSIIGL 330

Query: 368 FCKFEKMPQAFEYLSKMKNDGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLPN 427
            C+  K+ +A E  S+M   G+ P+ V Y+TLID FCK G ++ A K F +M    + P+
Sbjct: 331 LCRICKLAEAEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPD 390

Query: 428 EFTYTSLIDAHCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGKMMEAEEVFR 487
             TYT++I   C+ G++ EA KL ++M   G+  + VT+T L++G C+ G M +A  V  
Sbjct: 391 VLTYTAIISGFCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHN 450

Query: 488 AMLKNGISPNQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQS 547
            M++ G SPN   YT L+ G  K   ++ A E+L +M +  ++P++  Y +I+ GLC   
Sbjct: 451 HMIQAGCSPNVVTYTTLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSG 510

Query: 548 KLEETKLIIKEMESQGISANPVIYTTIIDAYFKAGKSSDAMNLLQEMQDAGVEATVVTYC 607
            +EE   ++ E E+ G++A+ V YTT++DAY K+G+   A  +L+EM   G++ T+VT+ 
Sbjct: 511 NIEEAVKLVGEFEAAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFN 570

Query: 608 VLIDGLCKAGMVELAVDYFGRMSDLGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCR 667
           VL++G C  GM+E        M   G+ PN   + +L+   C  N +++A  ++ +M  R
Sbjct: 571 VLMNGFCLHGMLEDGEKLLNWMLAKGIAPNATTFNSLVKQYCIRNNLKAATAIYKDMCSR 630

Query: 668 GMTPDITAFTALIDGNLKLGNLQEASDLISRMTESATEFDLHAYTSLVSGFSEYGELHQA 727
           G+ PD   +  L+ G+ K  N++EA  L   M        +  Y+ L+ GF +  +  +A
Sbjct: 631 GVGPDGKTYENLVKGHCKARNMKEAWFLFQEMKGKGFSVSVSTYSVLIKGFLKRKKFLEA 690

Query: 728 RKFFSEMIEKGILPEEILCISLLREYYKLGQLDEAIELKNEMQRRGLISE 777
           R+ F +M  +G+  ++ +        YK  + D  ++  +E+    L+ E
Sbjct: 691 REVFDQMRREGLAADKEIFDFFSDTKYKGKRPDTIVDPIDEIIENYLVDE 735

BLAST of Cla005843 vs. Swiss-Prot
Match: PP432_ARATH (Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana GN=At5g55840 PE=3 SV=2)

HSP 1 Score: 341.7 bits (875), Expect = 2.2e-92
Identity = 204/637 (32.03%), Postives = 325/637 (51.02%), Query Frame = 1

Query: 143 LALKFFKWAGSRIGFR--HTTESYCIVAHMLFRARMYTNAHDIIKEVIVKSRIDVGFPVF 202
           LALKF KW   + G    H  +  CI  H+L RARMY  A  I+KE+ + S    G   F
Sbjct: 52  LALKFLKWVVKQPGLETDHIVQLVCITTHILVRARMYDPARHILKELSLMS----GKSSF 111

Query: 203 NIFDMLWSTRNICASGTGVFDVLFSVFVELGLLDEANKCFSRMRKFRTLPKARSCNFLLH 262
            +F  L +T  +C S   V+D+L  V++  G++ ++ + F  M  +   P   +CN +L 
Sbjct: 112 -VFGALMTTYRLCNSNPSVYDILIRVYLREGMIQDSLEIFRLMGLYGFNPSVYTCNAILG 171

Query: 263 RLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQMGFTP 322
            + KSG    V  F  +M+   I P V T+N++I+ LC EG  E +  L  +M + G+ P
Sbjct: 172 SVVKSGEDVSVWSFLKEMLKRKICPDVATFNILINVLCAEGSFEKSSYLMQKMEKSGYAP 231

Query: 323 DVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVITYNGLINCFCKFEKMPQAFEYL 382
            +VTYN+++  Y K G  + A+ L + MK  G   DV TYN LI+  C+  ++ + +  L
Sbjct: 232 TIVTYNTVLHWYCKKGRFKAAIELLDHMKSKGVDADVCTYNMLIHDLCRSNRIAKGYLLL 291

Query: 383 SKMKNDGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLPNEFTYTSLIDAHCKA 442
             M+   + PN VTY+TLI+ F  EG +  A +L  +M   GL PN  T+ +LID H   
Sbjct: 292 RDMRKRMIHPNEVTYNTLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDGHISE 351

Query: 443 GNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGKMMEAEEVFRAMLKNGISPNQQVY 502
           GN  EA K+   M   G+  + V+Y  L+DGLC++ +   A   +  M +NG+   +  Y
Sbjct: 352 GNFKEALKMFYMMEAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVGRITY 411

Query: 503 TALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQSKLEETKLIIKEMES 562
           T ++ G  K   +++A+ +L +M++  I PD++ Y  +I G C   + +  K I+  +  
Sbjct: 412 TGMIDGLCKNGFLDEAVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVCRIYR 471

Query: 563 QGISANPVIYTTIIDAYFKAGKSSDAMNLLQEMQDAGVEATVVTYCVLIDGLCKAGMVEL 622
            G+S N +IY+T+I    + G   +A+ + + M   G      T+ VL+  LCKAG V  
Sbjct: 472 VGLSPNGIIYSTLIYNCCRMGCLKEAIRIYEAMILEGHTRDHFTFNVLVTSLCKAGKVAE 531

Query: 623 AVDYFGRMSDLGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMTPDITAFTALID 682
           A ++   M+  G+ PN   +  LI+G   +     A  +FDEM   G  P    + +L+ 
Sbjct: 532 AEEFMRCMTSDGILPNTVSFDCLINGYGNSGEGLKAFSVFDEMTKVGHHPTFFTYGSLLK 591

Query: 683 GNLKLGNLQEASDLISRMTESATEFDLHAYTSLVSGFSEYGELHQARKFFSEMIEKGILP 742
           G  K G+L+EA   +  +       D   Y +L++   + G L +A   F EM+++ ILP
Sbjct: 592 GLCKGGHLREAEKFLKSLHAVPAAVDTVMYNTLLTAMCKSGNLAKAVSLFGEMVQRSILP 651

Query: 743 EEILCISLLREYYKLGQLDEAIELKNEMQRRGLISEN 778
           +     SL+    + G+   AI    E + RG +  N
Sbjct: 652 DSYTYTSLISGLCRKGKTVIAILFAKEAEARGNVLPN 683


HSP 2 Score: 190.7 bits (483), Expect = 6.3e-47
Identity = 139/532 (26.13%), Postives = 237/532 (44.55%), Query Frame = 1

Query: 220  FDVLFSVFVELGLLDEANKCFSRMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMI 279
            F+VL +   + G + EA +    M     LP   S + L++    SG G      F++M 
Sbjct: 511  FNVLVTSLCKAGKVAEAEEFMRCMTSDGILPNTVSFDCLINGYGNSGEGLKAFSVFDEMT 570

Query: 280  GAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQMGFTPDVVTYNSLIDGYGKVGLLE 339
              G  P+ FTY  ++  LCK G L  A +    +  +    D V YN+L+    K G L 
Sbjct: 571  KVGHHPTFFTYGSLLKGLCKGGHLREAEKFLKSLHAVPAAVDTVMYNTLLTAMCKSGNLA 630

Query: 340  EAVYLFNEMKDVGCVPDVITYNGLINCFCKFEKMPQAFEYLSKMKNDG-LKPNVVTYSTL 399
            +AV LF EM     +PD  TY  LI+  C+  K   A  +  + +  G + PN V Y+  
Sbjct: 631  KAVSLFGEMVQRSILPDSYTYTSLISGLCRKGKTVIAILFAKEAEARGNVLPNKVMYTCF 690

Query: 400  IDAFCKEGMMQGAIKLFVDMRRVGLLPNEFTYTSLIDAHCKAGNLTEAWKLSNDMLQAGV 459
            +D   K G  +  I     M  +G  P+  T  ++ID + + G + +   L  +M     
Sbjct: 691  VDGMFKAGQWKAGIYFREQMDNLGHTPDIVTTNAMIDGYSRMGKIEKTNDLLPEMGNQNG 750

Query: 460  NLNIVTYTALMDGLCEDGKMMEAEEVFRAMLKNGISPNQQVYTALVHGYIKAERMEDAME 519
              N+ TY  L+ G  +   +  +  ++R+++ NGI P++    +LV G  ++  +E  ++
Sbjct: 751  GPNLTTYNILLHGYSKRKDVSTSFLLYRSIILNGILPDKLTCHSLVLGICESNMLEIGLK 810

Query: 520  ILKQMTECNIKPDLILYGTIIWGLCSQSKLEETKLIIKEMESQGISANPVIYTTIIDAYF 579
            ILK      ++ D   +  +I   C+  ++     ++K M S GIS +      ++    
Sbjct: 811  ILKAFICRGVEVDRYTFNMLISKCCANGEINWAFDLVKVMTSLGISLDKDTCDAMVSVLN 870

Query: 580  KAGKSSDAMNLLQEMQDAGVEATVVTYCVLIDGLCKAGMVELAVDYFGRMSDLGLQPNVA 639
            +  +  ++  +L EM   G+      Y  LI+GLC+ G ++ A      M    + P   
Sbjct: 871  RNHRFQESRMVLHEMSKQGISPESRKYIGLINGLCRVGDIKTAFVVKEEMIAHKICPPNV 930

Query: 640  VYTALIDGLCKTNCVESAKKLFDEMQCRGMTPDITAFTALIDGNLKLGNLQEASDLISRM 699
              +A++  L K    + A  L   M    + P I +FT L+    K GN+ EA +L   M
Sbjct: 931  AESAMVRALAKCGKADEATLLLRFMLKMKLVPTIASFTTLMHLCCKNGNVIEALELRVVM 990

Query: 700  TESATEFDLHAYTSLVSGFSEYGELHQARKFFSEMIEKGILPEEILCISLLR 751
            +    + DL +Y  L++G    G++  A + + EM   G L       +L+R
Sbjct: 991  SNCGLKLDLVSYNVLITGLCAKGDMALAFELYEEMKGDGFLANATTYKALIR 1042

BLAST of Cla005843 vs. Swiss-Prot
Match: PP407_ARATH (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 330.1 bits (845), Expect = 6.6e-89
Identity = 200/655 (30.53%), Postives = 328/655 (50.08%), Query Frame = 1

Query: 121 LDSALAPIWVSKVLVELKEDPNLALKFFKWAGSRIGFRHTTESYCIVAHMLFRARMYTNA 180
           L +   P   S +L++ + D  L LKF  WA     F  T    CI  H+L + ++Y  A
Sbjct: 42  LSANFTPEAASNLLLKSQNDQALILKFLNWANPHQFF--TLRCKCITLHILTKFKLYKTA 101

Query: 181 HDIIKEVIVKSRIDVGFPVFNIFDMLWSTRNICASGTGVFDVLFSVFVELGLLDEANKCF 240
             + ++V  K+  D    +  +F  L  T ++C S + VFD++   +  L L+D+A    
Sbjct: 102 QILAEDVAAKTLDDEYASL--VFKSLQETYDLCYSTSSVFDLVVKSYSRLSLIDKALSIV 161

Query: 241 SRMRKFRTLPKARSCNFLLHRLSKSG-NGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLCK 300
              +    +P   S N +L    +S  N       F +M+ + ++P+VFTYN++I   C 
Sbjct: 162 HLAQAHGFMPGVLSYNAVLDATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCF 221

Query: 301 EGDLENARRLFVQMRQMGFTPDVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVIT 360
            G+++ A  LF +M   G  P+VVTYN+LIDGY K+  +++   L   M   G  P++I+
Sbjct: 222 AGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLIS 281

Query: 361 YNGLINCFCKFEKMPQAFEYLSKMKNDGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMR 420
           YN +IN  C+  +M +    L++M   G   + VTY+TLI  +CKEG    A+ +  +M 
Sbjct: 282 YNVVINGLCREGRMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEML 341

Query: 421 RVGLLPNEFTYTSLIDAHCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGKMM 480
           R GL P+  TYTSLI + CKAGN+  A +  + M   G+  N  TYT L+DG  + G M 
Sbjct: 342 RHGLTPSVITYTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMN 401

Query: 481 EAEEVFRAMLKNGISPNQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTII 540
           EA  V R M  NG SP+   Y AL++G+    +MEDA+ +L+ M E  + PD++ Y T++
Sbjct: 402 EAYRVLREMNDNGFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVL 461

Query: 541 WGLCSQSKLEETKLIIKEMESQGISANPVIYTTIIDAYFKAGKSSDAMNLLQEMQDAGVE 600
            G C    ++E   + +EM  +GI  + + Y+++I  + +  ++ +A +L +EM   G+ 
Sbjct: 462 SGFCRSYDVDEALRVKREMVEKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLP 521

Query: 601 ATVVTYCVLIDGLCKAGMVELAVDYFGRMSDLGLQPNVAVYTALIDGLCKTNCVESAKKL 660
               TY  LI+  C  G +E A+     M + G+ P+V  Y+ LI+GL K +    AK+L
Sbjct: 522 PDEFTYTALINAYCMEGDLEKALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTREAKRL 581

Query: 661 FDEMQCRGMTPDITAFTALIDGNLKLGNLQEASDLISRMTESATEFDLHAYTSLVSGFSE 720
             ++      P    +  LI                    E+ +  +  +  SL+ GF  
Sbjct: 582 LLKLFYEESVPSDVTYHTLI--------------------ENCSNIEFKSVVSLIKGFCM 641

Query: 721 YGELHQARKFFSEMIEKGILPEEILCISLLREYYKLGQLDEAIELKNEMQRRGLI 775
            G + +A + F  M+ K   P+      ++  + + G + +A  L  EM + G +
Sbjct: 642 KGMMTEADQVFESMLGKNHKPDGTAYNIMIHGHCRAGDIRKAYTLYKEMVKSGFL 672


HSP 2 Score: 272.3 bits (695), Expect = 1.6e-71
Identity = 168/561 (29.95%), Postives = 279/561 (49.73%), Query Frame = 1

Query: 236 ANKCFSRMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMID 295
           A   F  M + +  P   + N L+     +GN  +    F+ M   G  P+V TYN +ID
Sbjct: 189 AENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLID 248

Query: 296 YLCKEGDLENARRLFVQMRQMGFTPDVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVP 355
             CK   +++  +L   M   G  P++++YN +I+G  + G ++E  ++  EM   G   
Sbjct: 249 GYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFVLTEMNRRGYSL 308

Query: 356 DVITYNGLINCFCKFEKMPQAFEYLSKMKNDGLKPNVVTYSTLIDAFCKEGMMQGAIKLF 415
           D +TYN LI  +CK     QA    ++M   GL P+V+TY++LI + CK G M  A++  
Sbjct: 309 DEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMNRAMEFL 368

Query: 416 VDMRRVGLLPNEFTYTSLIDAHCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCED 475
             MR  GL PNE TYT+L+D   + G + EA+++  +M   G + ++VTY AL++G C  
Sbjct: 369 DQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVTYNALINGHCVT 428

Query: 476 GKMMEAEEVFRAMLKNGISPNQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILY 535
           GKM +A  V   M + G+SP+   Y+ ++ G+ ++  +++A+ + ++M E  IKPD I Y
Sbjct: 429 GKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREMVEKGIKPDTITY 488

Query: 536 GTIIWGLCSQSKLEETKLIIKEMESQGISANPVIYTTIIDAYFKAGKSSDAMNLLQEMQD 595
            ++I G C Q + +E   + +EM   G+  +   YT +I+AY   G    A+ L  EM +
Sbjct: 489 SSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCMEGDLEKALQLHNEMVE 548

Query: 596 AGVEATVVTYCVLIDGLCKAGMVELAVDYFGRMSDLGLQPNVAVY--------------- 655
            GV   VVTY VLI+GL K      A     ++      P+   Y               
Sbjct: 549 KGVLPDVVTYSVLINGLNKQSRTREAKRLLLKLFYEESVPSDVTYHTLIENCSNIEFKSV 608

Query: 656 TALIDGLCKTNCVESAKKLFDEMQCRGMTPDITAFTALIDGNLKLGNLQEASDLISRMTE 715
            +LI G C    +  A ++F+ M  +   PD TA+  +I G+ + G++++A  L   M +
Sbjct: 609 VSLIKGFCMKGMMTEADQVFESMLGKNHKPDGTAYNIMIHGHCRAGDIRKAYTLYKEMVK 668

Query: 716 SATEFDLHAYT--SLVSGFSEYGELHQARKFFSEMIEKGILPEEILCISLLREYYKLGQL 775
           S   F LH  T  +LV    + G++++       ++    L E      L+   ++ G +
Sbjct: 669 SG--FLLHTVTVIALVKALHKEGKVNELNSVIVHVLRSCELSEAEQAKVLVEINHREGNM 728

Query: 776 DEAIELKNEMQRRGLISENCS 780
           D  +++  EM + G +    S
Sbjct: 729 DVVLDVLAEMAKDGFLPNGIS 747

BLAST of Cla005843 vs. TrEMBL
Match: W9SE38_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_000446 PE=4 SV=1)

HSP 1 Score: 964.1 bits (2491), Expect = 1.0e-277
Identity = 473/767 (61.67%), Postives = 599/767 (78.10%), Query Frame = 1

Query: 17  MLLFFRTLFHVSRRASYQVISLSSNSSHPDCLSFNVFNPSSSLTSINAYCISRHFFWFTS 76
           MLLF R LFH SRRAS +V   S +  +P          S    S N+  ++    WFTS
Sbjct: 22  MLLFLRNLFHTSRRASTRVSPFSPSIPYPHNCDLLPSLRSVYGKSSNSCIVACPLAWFTS 81

Query: 77  FLRIFRLPFVSYSGTNNSFEFLDIGTLRKIIQQDLWNDPKIVILLDSALAPIWVSKVLVE 136
           FL + R PF S S  + S E LD   LR+I++QD W+DPKIV L DSA+API VS+ LVE
Sbjct: 82  FLFLVRFPFYSKSSASFSLEVLDREQLRRIVEQDQWHDPKIVNLFDSAIAPILVSRFLVE 141

Query: 137 LKEDPNLALKFFKWAGSRIGFRHTTESYCIVAHMLFRARMYTNAHDIIKEVIVKSRIDVG 196
           LKE P LALK FKW  +R GFRHT ESYCI+ H+LF ARM+ +A+ +++E++  +R+   
Sbjct: 142 LKEYPFLALKLFKWVRNRTGFRHTAESYCILVHILFYARMFFDANGVLRELVSSNRV--- 201

Query: 197 FPVFNIFDMLWSTRNICASGTGVFDVLFSVFVELGLLDEANKCFSRMRKFRTLPKARSCN 256
            P  ++FD+LWSTRN+C  G GVFD LFSV VELG+L+EAN+CF +MRKF  LPK RSCN
Sbjct: 202 LPGCDVFDVLWSTRNVCVPGFGVFDALFSVLVELGMLEEANQCFLKMRKFHVLPKPRSCN 261

Query: 257 FLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQM 316
             LHRLSK G   + RKFF DM+ AGIAPSVFTYN+MI+YLCKEGD++ AR LF +M+  
Sbjct: 262 AFLHRLSKLGKVDMSRKFFKDMVAAGIAPSVFTYNIMINYLCKEGDMDEARSLFEEMKHR 321

Query: 317 GFTPDVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVITYNGLINCFCKFEKMPQA 376
           G  PD+VTYNSLIDG+GKVG ++EA+ +F +MKDVGC PD+IT+N LINCF K +++P+A
Sbjct: 322 GLIPDIVTYNSLIDGFGKVGNMDEAICIFEKMKDVGCEPDIITFNALINCFGKSQRLPRA 381

Query: 377 FEYLSKMKNDGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLPNEFTYTSLIDA 436
            E+L +++N GLKPNVVTYSTLIDAFCKEGMM+ A+K FVDMRRVGL PNE+TYTSL+DA
Sbjct: 382 LEFLHELRNHGLKPNVVTYSTLIDAFCKEGMMREALKFFVDMRRVGLFPNEYTYTSLVDA 441

Query: 437 HCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGKMMEAEEVFRAMLKNGISPN 496
           +CKAGNLTEA KL+N+MLQAG+NLNIV Y+AL++ LCEDG+M EAE+VF  MLK G++PN
Sbjct: 442 NCKAGNLTEALKLTNEMLQAGINLNIVGYSALLNCLCEDGRMKEAEKVFMEMLKAGVTPN 501

Query: 497 QQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQSKLEETKLIIK 556
            QVY++LVHGY+KA++ E A + LK+M E  IKPDL+LYGTIIWGLCSQ+KLEE++L++ 
Sbjct: 502 LQVYSSLVHGYVKAKKTEKAFQTLKEMEEKKIKPDLLLYGTIIWGLCSQNKLEESELVVN 561

Query: 557 EMESQGISANPVIYTTIIDAYFKAGKSSDAMNLLQEMQDAGVEATVVTYCVLIDGLCKAG 616
           EM S+G++AN  IYTT++DAYFKAGK+++A+ LLQEM   G+E  VVTYC LIDGLCK G
Sbjct: 562 EMRSRGLNANHFIYTTLMDAYFKAGKTTEALLLLQEMHYYGIEVNVVTYCALIDGLCKRG 621

Query: 617 MVELAVDYFGRMSDLGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMTPDITAFT 676
           +VE A DYF RM  +GLQPNVAVYTALIDGLCK N +E+AKKLFDEM  +G++PD TA+T
Sbjct: 622 LVEEATDYFDRMVSIGLQPNVAVYTALIDGLCKNNRIEAAKKLFDEMLEKGISPDRTAYT 681

Query: 677 ALIDGNLKLGNLQEASDLISRMTESATEFDLHAYTSLVSGFSEYGELHQARKFFSEMIEK 736
            LIDGNLK G+LQEA  L +RM E   E DL+AYTSL+ GFS++G++ QA+ +  EMI K
Sbjct: 682 TLIDGNLKHGHLQEALTLKNRMIEMGMELDLYAYTSLIWGFSQFGQVQQAKTWLDEMIGK 741

Query: 737 GILPEEILCISLLREYYKLGQLDEAIELKNEMQRRGLISENCSLAVP 784
           GILP+EILC+ LLR+YY+LG + EA EL++E+ +RGLI   C+ AVP
Sbjct: 742 GILPDEILCVCLLRKYYELGNVVEADELRDELVKRGLIKGACTYAVP 785

BLAST of Cla005843 vs. TrEMBL
Match: W9S012_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_000854 PE=4 SV=1)

HSP 1 Score: 959.9 bits (2480), Expect = 1.9e-276
Identity = 472/767 (61.54%), Postives = 598/767 (77.97%), Query Frame = 1

Query: 17  MLLFFRTLFHVSRRASYQVISLSSNSSHPDCLSFNVFNPSSSLTSINAYCISRHFFWFTS 76
           MLLF R LF  SRRAS +V   S +  +P          S    S N+  ++    WFTS
Sbjct: 22  MLLFLRNLFLTSRRASTRVSPFSPSIPYPHNCDLLPSLRSVYGKSSNSCIVACPLAWFTS 81

Query: 77  FLRIFRLPFVSYSGTNNSFEFLDIGTLRKIIQQDLWNDPKIVILLDSALAPIWVSKVLVE 136
           FL + R PF S S  + S E LD   LR+I++QD W+DPKIV L DSA+API VS+ LVE
Sbjct: 82  FLFLVRFPFYSKSSASFSLEVLDREQLRRIVEQDQWHDPKIVNLFDSAIAPILVSRFLVE 141

Query: 137 LKEDPNLALKFFKWAGSRIGFRHTTESYCIVAHMLFRARMYTNAHDIIKEVIVKSRIDVG 196
           LKE P LALK FKW  +R GFRHT ESYCI+ H+LF ARM+ +A+ +++E++  +R+   
Sbjct: 142 LKEYPFLALKLFKWVRNRTGFRHTAESYCILVHILFYARMFFDANGVLRELVSSNRV--- 201

Query: 197 FPVFNIFDMLWSTRNICASGTGVFDVLFSVFVELGLLDEANKCFSRMRKFRTLPKARSCN 256
            P  ++FD+LWSTRN+C  G GVFD LFSV VELG+L+EAN+CF +MRKF  LPK RSCN
Sbjct: 202 LPGCDVFDVLWSTRNVCVPGFGVFDALFSVLVELGMLEEANQCFLKMRKFHVLPKPRSCN 261

Query: 257 FLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQM 316
             LHRLSK G   + RKFF DM+ AGIAPSVFTYN+MI+YLCKEGD++ AR LF +M+  
Sbjct: 262 AFLHRLSKLGKVDMSRKFFKDMVAAGIAPSVFTYNIMINYLCKEGDMDEARSLFEEMKHR 321

Query: 317 GFTPDVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVITYNGLINCFCKFEKMPQA 376
           G  PD+VTYNSLIDG+GKVG ++EA+ +F +MKDVGC PD+IT+N LINCF K +++P+A
Sbjct: 322 GLIPDIVTYNSLIDGFGKVGNMDEAICIFEKMKDVGCEPDIITFNALINCFGKSQRLPRA 381

Query: 377 FEYLSKMKNDGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLPNEFTYTSLIDA 436
            E+L +++N GLKPNVVTYSTLIDAFCKEGMM+ A+K FVDMRRVGL PNE+TYTSL+DA
Sbjct: 382 LEFLHELRNHGLKPNVVTYSTLIDAFCKEGMMREALKFFVDMRRVGLFPNEYTYTSLVDA 441

Query: 437 HCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGKMMEAEEVFRAMLKNGISPN 496
           +CKAGNLTEA KL+N+MLQAG+NLNIV Y+AL++ LCEDG+M EAE+VF  MLK G++PN
Sbjct: 442 NCKAGNLTEALKLTNEMLQAGINLNIVGYSALLNCLCEDGRMKEAEKVFMEMLKAGVTPN 501

Query: 497 QQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQSKLEETKLIIK 556
            QVY++LVHGY+KA++ E A + LK+M E  IKPDL+LYGTIIWGLCSQ+KLEE++L++ 
Sbjct: 502 LQVYSSLVHGYVKAKKTEKAFQTLKEMEEKKIKPDLLLYGTIIWGLCSQNKLEESELVVN 561

Query: 557 EMESQGISANPVIYTTIIDAYFKAGKSSDAMNLLQEMQDAGVEATVVTYCVLIDGLCKAG 616
           EM S+G++AN  IYTT++DAYFKAGK+++A+ LLQEM   G+E  VVTYC LIDGLCK G
Sbjct: 562 EMRSRGLNANHFIYTTLMDAYFKAGKTTEALLLLQEMHYYGIEVNVVTYCALIDGLCKRG 621

Query: 617 MVELAVDYFGRMSDLGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMTPDITAFT 676
           +VE A DYF RM  +GLQPNVAVYTALIDGLCK N +E+AKKLFDEM  +G++PD TA+T
Sbjct: 622 LVEEATDYFDRMVSIGLQPNVAVYTALIDGLCKNNRIEAAKKLFDEMLEKGISPDRTAYT 681

Query: 677 ALIDGNLKLGNLQEASDLISRMTESATEFDLHAYTSLVSGFSEYGELHQARKFFSEMIEK 736
            LIDGNLK G+LQEA  L +RM E   E DL+AYTSL+ GFS++G++ QA+ +  EMI K
Sbjct: 682 TLIDGNLKHGHLQEALTLKNRMIEMGMELDLYAYTSLIWGFSQFGQVQQAKTWLDEMIGK 741

Query: 737 GILPEEILCISLLREYYKLGQLDEAIELKNEMQRRGLISENCSLAVP 784
           GILP+EILC+ LLR+YY+LG + EA EL++E+ +RGLI   C+ AVP
Sbjct: 742 GILPDEILCVCLLRKYYELGNVVEADELRDELVKRGLIKGACTYAVP 785

BLAST of Cla005843 vs. TrEMBL
Match: A0A061E9Z5_THECC (Pentatricopeptide repeat-containing protein, putative isoform 1 OS=Theobroma cacao GN=TCM_011095 PE=4 SV=1)

HSP 1 Score: 958.7 bits (2477), Expect = 4.2e-276
Identity = 474/780 (60.77%), Postives = 594/780 (76.15%), Query Frame = 1

Query: 9   AFASLLSAMLLFFRTLFHVSRRASYQVISLSSNSSHPDCLSFNVFNPSSSLTSINAYCIS 68
           A  S  + ML+  R+LFH++RR     I +    SHP  L F    P +     N   I 
Sbjct: 6   AALSFFTKMLVSLRSLFHINRR-----IPVCVRVSHPFPL-FQNSRPLNFFPPSNNSIIV 65

Query: 69  RHFFWFTSFLRIFRLPFVSYSGTNNSFEFLDIG--TLRKIIQQDLWNDPKIVILLDSALA 128
             F   TSF  + + PF +   +N      D    ++ KIIQQD WNDPKIV L DS+LA
Sbjct: 66  CPFILLTSFFYMMKFPFGTKCNSNTHIFLDDFNRESICKIIQQDQWNDPKIVTLFDSSLA 125

Query: 129 PIWVSKVLVELKEDPNLALKFFKWAGSRIGFRHTTESYCIVAHMLFRARMYTNAHDIIKE 188
           PIWVSK+LV LK++P LALKFFKWA +  GF HT+ESYCI+ H+LF  RMY++A  I+KE
Sbjct: 126 PIWVSKILVGLKQEPKLALKFFKWAKTHKGFGHTSESYCILVHILFYGRMYSDASAILKE 185

Query: 189 VIVKSRIDVGFPVFNIFDMLWSTRNICASGTGVFDVLFSVFVELGLLDEANKCFSRMRKF 248
            I+  R  V  P  + FD+LWSTRN+C  G GVFD LFSV V+LG+L+EA++CFS+M+++
Sbjct: 186 FILL-RQRVVLPGCDFFDVLWSTRNVCRYGFGVFDALFSVLVDLGMLEEASQCFSKMKRY 245

Query: 249 RTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLCKEGDLENA 308
           R LPK RSCN LLHRLSK+G     R+FF +MIG G+APSVFTYN++IDY+CKEG+L+ A
Sbjct: 246 RVLPKVRSCNALLHRLSKTGRRDQSRRFFAEMIGVGVAPSVFTYNILIDYMCKEGELDTA 305

Query: 309 RRLFVQMRQMGFTPDVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVITYNGLINC 368
           R LF QM+Q+G TPD+VTYNSLIDGYGKVGLL+E ++LF EMK V C PD+ITYN LINC
Sbjct: 306 RMLFGQMKQIGLTPDIVTYNSLIDGYGKVGLLDEVIFLFEEMKSVECAPDIITYNALINC 365

Query: 369 FCKFEKMPQAFEYLSKMKNDGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLPN 428
           FCKF++MPQAFE+  +M+N GLKPNVVTYSTLIDAFCKEGMMQ  IK  VDMRRVGLLPN
Sbjct: 366 FCKFQRMPQAFEFFREMRNKGLKPNVVTYSTLIDAFCKEGMMQQGIKFLVDMRRVGLLPN 425

Query: 429 EFTYTSLIDAHCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGKMMEAEEVFR 488
            FTYTSLIDA CKAG+LTEA KL+N+MLQ  V+LNIVTYT ++DGLCE G+  EAEE+FR
Sbjct: 426 VFTYTSLIDATCKAGSLTEALKLANEMLQENVDLNIVTYTTIIDGLCEAGRTKEAEEIFR 485

Query: 489 AMLKNGISPNQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQS 548
           AMLK  + PN  +YTAL HGY+K ++ME A+ +LK+M E +IKPDL+LYGTIIWGLC+Q 
Sbjct: 486 AMLKAALKPNVHIYTALAHGYMKVKKMEHALNLLKEMKEKSIKPDLLLYGTIIWGLCNQD 545

Query: 549 KLEETKLIIKEMESQGISANPVIYTTIIDAYFKAGKSSDAMNLLQEMQDAGVEATVVTYC 608
           K+EETK+++ EM+   +S+NPVIYTT++D+YFKAGK+++A+NLL+EM D G+E TVVT+C
Sbjct: 546 KIEETKVVMSEMKESRLSSNPVIYTTVMDSYFKAGKTAEALNLLEEMSDLGIEVTVVTFC 605

Query: 609 VLIDGLCKAGMVELAVDYFGRMSDLGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCR 668
           VL+DGLCK G+V  A++YF RMS+  LQPNVA YT LIDGLCK N +++AK +FDEM  +
Sbjct: 606 VLVDGLCKTGLVLEAINYFNRMSEFNLQPNVAAYTVLIDGLCKNNFIQAAKNMFDEMLSK 665

Query: 669 GMTPDITAFTALIDGNLKLGNLQEASDLISRMTESATEFDLHAYTSLVSGFSEYGELHQA 728
            + PD TA+TALIDGNLK GN QEA +L + M E   E DL AYTSLV GF + G+L QA
Sbjct: 666 NLVPDKTAYTALIDGNLKHGNFQEALNLQNEMIEMGIELDLPAYTSLVWGFCQCGQLQQA 725

Query: 729 RKFFSEMIEKGILPEEILCISLLREYYKLGQLDEAIELKNEMQRRGLISENCSLAVPCLK 787
           RKF  EMI K ILP+EILCI +LR+YY+LG +DEAIEL+NEM +RGLI+     AVP ++
Sbjct: 726 RKFLDEMIRKHILPDEILCIGVLRKYYELGHVDEAIELQNEMAKRGLITSPIHYAVPSVQ 778

BLAST of Cla005843 vs. TrEMBL
Match: B9RY36_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_0814140 PE=4 SV=1)

HSP 1 Score: 921.4 bits (2380), Expect = 7.5e-265
Identity = 461/738 (62.47%), Postives = 576/738 (78.05%), Query Frame = 1

Query: 42  SSHPDCLSFNVFNPSSSLTSINA--YCISRHFFWFTSFLRIFRLPFVSYSGTNNSFEFLD 101
           SS+P+     VF+  S + S  +  YC        T FL I R PF++ S        LD
Sbjct: 14  SSNPNAHLPFVFSSPSLVPSHGSLSYC---PLMLLTGFLCILRFPFITQSSFLGQ---LD 73

Query: 102 IGTLRKIIQQDLWNDPKIVILLDSALAPIWVSKVLVELKEDPNLALKFFKWAGSRIGFRH 161
             ++ KIIQQD WNDPK V  +DS+L PIWVS+VLVELK+DP LALKFF+WA ++ GF  
Sbjct: 74  KASIIKIIQQDQWNDPKFVRFIDSSLGPIWVSRVLVELKQDPKLALKFFRWAKTKFGFCL 133

Query: 162 TTESYCIVAHMLFRARMYTNAHDIIKEVIVKSRIDVGFPVFNIFDMLWSTRNICASGTGV 221
           TTESYC++ H+LF ARMY +A+  +KE+I   RI    P F++F++LWSTRN+C  G GV
Sbjct: 134 TTESYCLLVHILFYARMYFDANFFLKELISSRRI---LPGFDVFEVLWSTRNVCVPGFGV 193

Query: 222 FDVLFSVFVELGLLDEANKCFSRMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMI 281
           FD LFSVF+ELG+L+EA +CFSRM +FR  PKARSCN  L+RL+K+G G L  KFF DM+
Sbjct: 194 FDALFSVFIELGMLEEAGQCFSRMTRFRVFPKARSCNAFLYRLAKTGKGDLSNKFFRDMV 253

Query: 282 GAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQMGFTPDVVTYNSLIDGYGKVGLLE 341
           GAGIA SVFTYN+MI Y+CKEGD+  A+ LF QM+QMG TPD+VTYNSLIDGYGK+GLL+
Sbjct: 254 GAGIAQSVFTYNIMIGYMCKEGDMVTAKSLFHQMKQMGLTPDIVTYNSLIDGYGKLGLLD 313

Query: 342 EAVYLFNEMKDVGCVPDVITYNGLINCFCKFEKMPQAFEYLSKMKNDGLKPNVVTYSTLI 401
           E+  LF EMKDVGC PDVITYN LINCFCK+E+MP+AF +L +MKN GLKPNVVTYSTLI
Sbjct: 314 ESFCLFEEMKDVGCEPDVITYNALINCFCKYEQMPKAFHFLHEMKNSGLKPNVVTYSTLI 373

Query: 402 DAFCKEGMMQGAIKLFVDMRRVGLLPNEFTYTSLIDAHCKAGNLTEAWKLSNDMLQAGVN 461
           DA CKE M+Q AIK  +DMRRVGL PNEFTYTSLIDA+CKAG L++A KL+++MLQ  V 
Sbjct: 374 DALCKEHMLQQAIKFLLDMRRVGLSPNEFTYTSLIDANCKAGYLSDALKLADEMLQVQVG 433

Query: 462 LNIVTYTALMDGLCEDGKMMEAEEVFRAMLKNGISPNQQVYTALVHGYIKAERMEDAMEI 521
            N+VTYT L+DGLC++G+MMEAE++FRAM+K G++PN + YTALVHG+IK +R+E+A+E+
Sbjct: 434 FNVVTYTTLLDGLCKEGRMMEAEDLFRAMIKAGVTPNLKTYTALVHGHIKNKRVENALEL 493

Query: 522 LKQMTECNIKPDLILYGTIIWGLCSQSKLEETKLIIKEMESQGISANPVIYTTIIDAYFK 581
           LK++ E  IKPDL+LYGTIIWGLCSQ+KLEE + ++ EM++ GI AN VIYT  +DAYFK
Sbjct: 494 LKEIKEKKIKPDLLLYGTIIWGLCSQNKLEECEFVMSEMKACGIRANSVIYTIRMDAYFK 553

Query: 582 AGKSSDAMNLLQEMQDAGVEATVVTYCVLIDGLCKAGMVELAVDYFGRMSDLGLQP-NVA 641
            GK+ +A+NLLQEM D GVE T+VT+CVLIDGLCK G+VE A+DYF RM+D  LQP NVA
Sbjct: 554 TGKTVEALNLLQEMCDLGVEVTIVTFCVLIDGLCKKGLVEEAIDYFARMADFNLQPNNVA 613

Query: 642 VYTALIDGLCKTNCVESAKKLFDEMQCRGMTPDITAFTALIDGNLKLGNLQEASDLISRM 701
           V TALIDGLCK N +E+AKKLFDEMQ + M PD  A+TALIDGNLK  + QEA ++ SRM
Sbjct: 614 VCTALIDGLCKNNYIEAAKKLFDEMQDKNMVPDKIAYTALIDGNLKHKDFQEALNIRSRM 673

Query: 702 TESATEFDLHAYTSLVSGFSEYGELHQARKFFSEMIEKGILPEEILCISLLREYYKLGQL 761
           +E   E DLHAYTSLV G S+   + QAR F +EMI KGI+P+EILCI LLR+YY+LG +
Sbjct: 674 SELGMELDLHAYTSLVWGLSQGNLVQQARMFLNEMIGKGIVPDEILCIRLLRKYYELGSI 733

Query: 762 DEAIELKNEMQRRGLISE 777
           DEAIEL +E+ ++  + E
Sbjct: 734 DEAIELHDELLKKVPLDE 742

BLAST of Cla005843 vs. TrEMBL
Match: K7L5N5_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_08G090700 PE=4 SV=1)

HSP 1 Score: 885.9 bits (2288), Expect = 3.5e-254
Identity = 459/771 (59.53%), Postives = 571/771 (74.06%), Query Frame = 1

Query: 17  MLLFFRTLFHVSRRASYQVISLSSNSSHPDCLSFNVFNPSSSLTSINAYCISRHFFWFTS 76
           MLLF R   ++  RAS +V   SS  S P    F +F   SSL+S N+   +R   WFTS
Sbjct: 1   MLLFAR---NIGGRASLRV---SSFHSSPLQNPFPLFLTPSSLSSQNSI-FARPVIWFTS 60

Query: 77  FLRIFRLPFVSYSGTNNSFEFLDIGTLRKIIQQDLWNDPKIVILLDSALAPIWVSKVLVE 136
           FL + R PFVS      SF+ +   ++R  +QQD    P    L DSALAPIWVSK LV+
Sbjct: 61  FLCVIRYPFVS----KPSFDDIASESMRSFLQQD---GPH---LSDSALAPIWVSKALVK 120

Query: 137 LKEDPNLALKFFKWAGSRIGFRHTTESYCIVAHMLFRARMYTNAHDIIKEVIVKSRIDVG 196
           LK DP  ALKFFK AG+R GFRH  ESYC++AH+LF    Y +A  +IKE I+  R    
Sbjct: 121 LKGDPKSALKFFKEAGARAGFRHAAESYCVLAHILFCGMFYLDARSVIKEWILLGR---E 180

Query: 197 FPVFNIFDMLWSTRNICASGTGVFDVLFSVFVELGLLDEANKCFSRMRKFRTLPKARSCN 256
           FP  + FDMLWSTRN+C  G GVFD LF+V V+LG+L+EA +CF +M KFR LPK RSCN
Sbjct: 181 FPGCDFFDMLWSTRNVCRPGFGVFDTLFNVLVDLGMLEEARQCFWKMNKFRVLPKVRSCN 240

Query: 257 FLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQM 316
            LLHRLSKS  G L   FF DM+ AG++PSVFTYN++I  L +EGDLE AR LF +M+  
Sbjct: 241 ELLHRLSKSSKGGLALSFFKDMVVAGLSPSVFTYNMVIGCLAREGDLEAARSLFEEMKAK 300

Query: 317 GFTPDVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVITYNGLINCFCKFEKMPQA 376
           G  PD+VTYNSLIDGYGKVG+L  AV +F EMKD GC PDVITYN LINCFCKFE++PQA
Sbjct: 301 GLRPDIVTYNSLIDGYGKVGMLTGAVSVFEEMKDAGCEPDVITYNSLINCFCKFERIPQA 360

Query: 377 FEYLSKMKNDGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLPNEFTYTSLIDA 436
           FEYL  MK  GL+PNVVTYSTLIDAFCK GM+  A K FVDM RVGL PNEFTYTSLIDA
Sbjct: 361 FEYLHGMKQRGLQPNVVTYSTLIDAFCKAGMLLEANKFFVDMIRVGLQPNEFTYTSLIDA 420

Query: 437 HCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGKMMEAEEVFRAMLKNGISPN 496
           +CK G+L EA+KL ++M QAGVNLNIVTYTAL+DGLCEDG+M EAEE+F A+LK G + N
Sbjct: 421 NCKIGDLNEAFKLESEMQQAGVNLNIVTYTALLDGLCEDGRMREAEELFGALLKAGWTLN 480

Query: 497 QQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQSKLEETKLIIK 556
           QQ+YT+L HGYIKA+ ME AM+IL++M + N+KPDL+LYGT IWGLC Q+++E++  +I+
Sbjct: 481 QQIYTSLFHGYIKAKMMEKAMDILEEMNKKNLKPDLLLYGTKIWGLCRQNEIEDSMAVIR 540

Query: 557 EMESQGISANPVIYTTIIDAYFKAGKSSDAMNLLQEMQDAGVEATVVTYCVLIDGLCKAG 616
           EM   G++AN  IYTT+IDAYFK GK+++A+NLLQEMQD G++ TVVTY VLIDGLCK G
Sbjct: 541 EMMDCGLTANSYIYTTLIDAYFKVGKTTEAVNLLQEMQDLGIKITVVTYGVLIDGLCKIG 600

Query: 617 MVELAVDYFGRMSDLGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMTPDITAFT 676
           +V+ AV YF  M+  GLQPN+ +YTALIDGLCK +C+E AK LF+EM  +G++PD   +T
Sbjct: 601 LVQQAVRYFDHMTRNGLQPNIMIYTALIDGLCKNDCLEEAKNLFNEMLDKGISPDKLVYT 660

Query: 677 ALIDGNLKLGNLQEASDLISRMTESATEFDLHAYTSLVSGFSEYGELHQARKFFSEMIEK 736
           +LIDGN+K GN  EA  L +RM E   E DL AYTSL+ GFS YG++  A+    EM+ K
Sbjct: 661 SLIDGNMKHGNPGEALSLRNRMVEIGMELDLCAYTSLIWGFSRYGQVQLAKSLLDEMLRK 720

Query: 737 GILPEEILCISLLREYYKLGQLDEAIELKNEMQRRGLISENCSLAVPCLKT 788
           GI+P+++LCI LLR+YY+LG ++EA+ L ++M RRGLIS    + VP + T
Sbjct: 721 GIIPDQVLCICLLRKYYELGDINEALALHDDMARRGLISGTIDITVPSVHT 751

BLAST of Cla005843 vs. NCBI nr
Match: gi|449463537|ref|XP_004149490.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At2g02150 [Cucumis sativus])

HSP 1 Score: 1376.7 bits (3562), Expect = 0.0e+00
Identity = 678/784 (86.48%), Postives = 734/784 (93.62%), Query Frame = 1

Query: 1   MKLPIELL-AFASLLSAMLLFFRTLFHVSRRASYQVISLSSNSSHPDCLSFNVFNPSSSL 60
           MKL  ELL AFASLLSAMLLFFRTLFHVSRRAS++VISLSSNSSHPD LSFNVFNPSSSL
Sbjct: 2   MKLSAELLLAFASLLSAMLLFFRTLFHVSRRASFRVISLSSNSSHPDSLSFNVFNPSSSL 61

Query: 61  TSINAYCISRHFFWFTSFLRIFRLPFVSYSGTNNSFEFLDIGTLRKIIQQDLWNDPKIVI 120
           TSINAYCISR FFWFTSFL IFRLPFVSYS  NNSF++LDIG+LRKIIQQDLWNDPKIV+
Sbjct: 62  TSINAYCISRPFFWFTSFLCIFRLPFVSYSNANNSFQYLDIGSLRKIIQQDLWNDPKIVV 121

Query: 121 LLDSALAPIWVSKVLVELKEDPNLALKFFKWAGSRIGFRHTTESYCIVAHMLFRARMYTN 180
           L DSALAPIWVSK+L+ L+EDP LALKFFKWAGS++GFRHTTESYCI+ H++FRARMYT+
Sbjct: 122 LFDSALAPIWVSKILLGLREDPKLALKFFKWAGSQVGFRHTTESYCIIVHLVFRARMYTD 181

Query: 181 AHDIIKEVIVKSRIDVGFPVFNIFDMLWSTRNICASGTGVFDVLFSVFVELGLLDEANKC 240
           AHD +KEVI+ SR+D+GFPV NIFDMLWSTRNIC SG+GVFDVLFSVFVELGLL+EAN+C
Sbjct: 182 AHDTVKEVIMNSRMDMGFPVCNIFDMLWSTRNICVSGSGVFDVLFSVFVELGLLEEANEC 241

Query: 241 FSRMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLCK 300
           FSRMR FRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLCK
Sbjct: 242 FSRMRNFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLCK 301

Query: 301 EGDLENARRLFVQMRQMGFTPDVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVIT 360
           EGDLEN+RRLFVQMR+MG +PDVVTYNSLIDGYGKVG LEE   LFNEMKDVGCVPD+IT
Sbjct: 302 EGDLENSRRLFVQMREMGLSPDVVTYNSLIDGYGKVGSLEEVASLFNEMKDVGCVPDIIT 361

Query: 361 YNGLINCFCKFEKMPQAFEYLSKMKNDGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMR 420
           YNGLINC+CKFEKMP+AFEY S+MKN+GLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMR
Sbjct: 362 YNGLINCYCKFEKMPRAFEYFSEMKNNGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMR 421

Query: 421 RVGLLPNEFTYTSLIDAHCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGKMM 480
           R GLLPNEFTYTSLIDA+CKAGNLTEAWKL NDMLQAGV LNIVTYTAL+DGLC+ G+M+
Sbjct: 422 RTGLLPNEFTYTSLIDANCKAGNLTEAWKLLNDMLQAGVKLNIVTYTALLDGLCKAGRMI 481

Query: 481 EAEEVFRAMLKNGISPNQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTII 540
           EAEEVFR+MLK+GISPNQQVYTALVHGYIKAERMEDAM+ILKQMTECNIKPDLILYG+II
Sbjct: 482 EAEEVFRSMLKDGISPNQQVYTALVHGYIKAERMEDAMKILKQMTECNIKPDLILYGSII 541

Query: 541 WGLCSQSKLEETKLIIKEMESQGISANPVIYTTIIDAYFKAGKSSDAMNLLQEMQDAGVE 600
           WG CSQ KLEETKLI++EM+S+GISANPVI TTIIDAYFKAGKSSDA+N  QEMQD GVE
Sbjct: 542 WGHCSQRKLEETKLILEEMKSRGISANPVISTTIIDAYFKAGKSSDALNFFQEMQDVGVE 601

Query: 601 ATVVTYCVLIDGLCKAGMVELAVDYFGRMSDLGLQPNVAVYTALIDGLCKTNCVESAKKL 660
           AT+VTYCVLIDGLCKAG+VELAVDYF RM  LGLQPNVAVYT+LIDGLCK NC+ESAKKL
Sbjct: 602 ATIVTYCVLIDGLCKAGIVELAVDYFCRMLSLGLQPNVAVYTSLIDGLCKNNCIESAKKL 661

Query: 661 FDEMQCRGMTPDITAFTALIDGNLKLGNLQEASDLISRMTESATEFDLHAYTSLVSGFSE 720
           FDEMQCRGMTPDITAFTALIDGNLK GNLQEA  LISRMTE A EFDLH YTSLVSGFS+
Sbjct: 662 FDEMQCRGMTPDITAFTALIDGNLKHGNLQEALVLISRMTELAIEFDLHVYTSLVSGFSQ 721

Query: 721 YGELHQARKFFSEMIEKGILPEEILCISLLREYYKLGQLDEAIELKNEMQRRGLISENCS 780
            GELHQARKFF+EMIEKGILPEE+LCI LLREYYK GQLDEAIELKNEM+R GLI+E+ +
Sbjct: 722 CGELHQARKFFNEMIEKGILPEEVLCICLLREYYKRGQLDEAIELKNEMERMGLITESAT 781

Query: 781 LAVP 784
           +  P
Sbjct: 782 MQFP 785

BLAST of Cla005843 vs. NCBI nr
Match: gi|659072656|ref|XP_008466646.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At2g02150 [Cucumis melo])

HSP 1 Score: 1369.4 bits (3543), Expect = 0.0e+00
Identity = 678/785 (86.37%), Postives = 733/785 (93.38%), Query Frame = 1

Query: 1   MKLPIELL--AFASLLSAMLLFFRTLFHVSRRASYQVISLSSNSSHPDCLSFNVFNPSSS 60
           MKL +ELL  AF SLLSAMLLFFRTLFHVSRRAS++VISLSSNSSHPD LSFNVFNPSSS
Sbjct: 2   MKLSVELLLLAFPSLLSAMLLFFRTLFHVSRRASFRVISLSSNSSHPDSLSFNVFNPSSS 61

Query: 61  LTSINAYCISRHFFWFTSFLRIFRLPFVSYSGTNNSFEFLDIGTLRKIIQQDLWNDPKIV 120
           LTSINAY ISR FFWFTSFL IFRLPFVSYS  NNS EFLDIG+LRKIIQQDLWNDPKIV
Sbjct: 62  LTSINAYRISRPFFWFTSFLCIFRLPFVSYSNANNSIEFLDIGSLRKIIQQDLWNDPKIV 121

Query: 121 ILLDSALAPIWVSKVLVELKEDPNLALKFFKWAGSRIGFRHTTESYCIVAHMLFRARMYT 180
           +L DSALAPIWVS++LV LKEDP LALKFFKWAGS++GFRHTTESYCI+ H++FRARMYT
Sbjct: 122 VLFDSALAPIWVSRILVGLKEDPKLALKFFKWAGSQVGFRHTTESYCIIVHLVFRARMYT 181

Query: 181 NAHDIIKEVIVKSRIDVGFPVFNIFDMLWSTRNICASGTGVFDVLFSVFVELGLLDEANK 240
           +AHD +KEVI+K+RID+GFPV NIFDMLWSTRNIC SG+GVFDVLFSVFVELGLL+EAN+
Sbjct: 182 DAHDTVKEVIMKNRIDMGFPVCNIFDMLWSTRNICVSGSGVFDVLFSVFVELGLLEEANE 241

Query: 241 CFSRMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLC 300
           CFSRMR FRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLC
Sbjct: 242 CFSRMRNFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLC 301

Query: 301 KEGDLENARRLFVQMRQMGFTPDVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVI 360
           KEGDLENARRLFVQMR+MG +PDVVTYNSLIDGYGKVG LEEAV  FNEMKDVGCVPD+I
Sbjct: 302 KEGDLENARRLFVQMREMGLSPDVVTYNSLIDGYGKVGSLEEAVSFFNEMKDVGCVPDII 361

Query: 361 TYNGLINCFCKFEKMPQAFEYLSKMKNDGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDM 420
           TYNGLINC+CKFEKMP+AFEY S+MKN+GLKPNVVTYSTLIDAFCKEGMMQGA+KLFVDM
Sbjct: 362 TYNGLINCYCKFEKMPRAFEYFSEMKNNGLKPNVVTYSTLIDAFCKEGMMQGAVKLFVDM 421

Query: 421 RRVGLLPNEFTYTSLIDAHCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGKM 480
           +R GLLPNEFTYTSLIDA+CKAGNLTEAWKL NDMLQAGV LNIVTYTAL+DGLCEDG+M
Sbjct: 422 KRAGLLPNEFTYTSLIDANCKAGNLTEAWKLLNDMLQAGVKLNIVTYTALVDGLCEDGRM 481

Query: 481 MEAEEVFRAMLKNGISPNQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTI 540
           +EAEEVFR+MLK+GISPNQQVYTALVHGYIKAERMEDAM+ILKQM ECNIKPDLILYG++
Sbjct: 482 IEAEEVFRSMLKDGISPNQQVYTALVHGYIKAERMEDAMKILKQMKECNIKPDLILYGSV 541

Query: 541 IWGLCSQSKLEETKLIIKEMESQGISANPVIYTTIIDAYFKAGKSSDAMNLLQEMQDAGV 600
           IWGLCSQSKLEETKLI+KEM+S+GISANPVIYTTIIDAYFKAGKSSDA+NL QEMQD GV
Sbjct: 542 IWGLCSQSKLEETKLILKEMKSRGISANPVIYTTIIDAYFKAGKSSDAINLFQEMQDVGV 601

Query: 601 EATVVTYCVLIDGLCKAGMVELAVDYFGRMSDLGLQPNVAVYTALIDGLCKTNCVESAKK 660
           EATVVTYCVLIDGLCKAG+VELAVDYF RM  LGLQPNVAVYT+LIDGL KTNC++SA K
Sbjct: 602 EATVVTYCVLIDGLCKAGIVELAVDYFCRMFSLGLQPNVAVYTSLIDGLSKTNCIKSANK 661

Query: 661 LFDEMQCRGMTPDITAFTALIDGNLKLGNLQEASDLISRMTESATEFDLHAYTSLVSGFS 720
           LFDEMQCRGMTPDITAFTALIDGNLK GNLQEA   ISRMTE A EFDLH YTSLV+GFS
Sbjct: 662 LFDEMQCRGMTPDITAFTALIDGNLKHGNLQEALVFISRMTELAIEFDLHFYTSLVAGFS 721

Query: 721 EYGELHQARKFFSEMIEKGILPEEILCISLLREYYKLGQLDEAIELKNEMQRRGLISENC 780
           + GEL QARKFF+EMI+KGILPEE+LCI LLREY K GQLDEAIELKNEMQ  GLI+E+ 
Sbjct: 722 KCGELRQARKFFNEMIKKGILPEEVLCICLLREYCKRGQLDEAIELKNEMQGMGLITESA 781

Query: 781 SLAVP 784
           ++  P
Sbjct: 782 AMQFP 786

BLAST of Cla005843 vs. NCBI nr
Match: gi|657995268|ref|XP_008389961.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At2g02150 [Malus domestica])

HSP 1 Score: 986.9 bits (2550), Expect = 2.1e-284
Identity = 492/781 (63.00%), Postives = 607/781 (77.72%), Query Frame = 1

Query: 7   LLAFASLLSAMLLFFRTLFHVSRRASYQVIS-LSSNSSHPDCLSFNVFNPSSSLTSINAY 66
           L  F S  S MLLF R LF    RAS    S +S  SS P   S   F   SSLTS +++
Sbjct: 15  LFFFISFFSEMLLFLRNLFRTGCRASSSASSRVSXLSSIPQYPSNCRFINLSSLTSSSSH 74

Query: 67  C---ISRHFFWFTSFLRIFRLPFVSYSGTNNSFEFLDIGTLRKIIQQDLWNDPKIVILLD 126
               I+  F WFT FL IFR PFV+ S  ++  E L+  +L +I+Q D W+DP+IV L D
Sbjct: 75  ATSLIACPFVWFTGFLCIFRFPFVTKSQPSSFPESLNTDSLSRIVQHDYWDDPRIVNLFD 134

Query: 127 SALAPIWVSKVLVELKEDPNLALKFFKWAGSRIGFRHTTESYCIVAHMLFRARMYTNAHD 186
           SALAPIWVS+ LVELK DP LALK FKWA ++IGFRHTTESYCI+ H+LF ARMY +AH+
Sbjct: 135 SALAPIWVSRFLVELKGDPKLALKLFKWAKTQIGFRHTTESYCILVHILFFARMYVDAHE 194

Query: 187 IIKEVIVKSRIDVGFPVFNIFDMLWSTRNICASGTGVFDVLFSVFVELGLLDEANKCFSR 246
           +++E+++ SR     P  ++FD+LW TRN+C  G GVFD LF V VE+G+L+EA++CF R
Sbjct: 195 VLRELVLLSR---ALPGCDVFDVLWWTRNVCRVGFGVFDALFGVLVEVGMLEEASECFLR 254

Query: 247 MRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLCKEGD 306
           M+KFR LPK RSCN LLHRLSK G G L RKFF DM+GAGI PSVFTYN+MI Y+CKEGD
Sbjct: 255 MKKFRVLPKVRSCNALLHRLSKPGKGNLSRKFFKDMLGAGINPSVFTYNIMIGYMCKEGD 314

Query: 307 LENARRLFVQMRQMGFTPDVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVITYNG 366
           L+ A  LF QM++MG TPDVVTYNSLIDGYGKVGLL+++V +F EMKD  C PD IT+N 
Sbjct: 315 LDTASCLFAQMKRMGLTPDVVTYNSLIDGYGKVGLLDDSVCIFEEMKDADCEPDTITFNS 374

Query: 367 LINCFCKFEKMPQAFEYLSKMKNDGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVG 426
           LINC CKF++MPQA  +L +M N+GLKPNV+TYSTLIDAFCKEGMMQ A+K+F+DM+RVG
Sbjct: 375 LINCCCKFDRMPQALNFLREMNNNGLKPNVITYSTLIDAFCKEGMMQEAVKIFMDMKRVG 434

Query: 427 LLPNEFTYTSLIDAHCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGKMMEAE 486
           LLPNEFTYTSLIDA+CK GNL+EA KL ++MLQAG++ NIVTYTAL+DGLCEDG+M EAE
Sbjct: 435 LLPNEFTYTSLIDANCKXGNLSEALKLKSEMLQAGISWNIVTYTALLDGLCEDGRMDEAE 494

Query: 487 EVFRAMLKNGISPNQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGL 546
           EVFR + K+GI PNQQ+ TAL+HGYIKA+++E+AMEI  ++     KPDL+LYGTIIWGL
Sbjct: 495 EVFREVQKSGIIPNQQICTALLHGYIKAKKIENAMEIWNEIKGKGFKPDLLLYGTIIWGL 554

Query: 547 CSQSKLEETKLIIKEMESQGISANPVIYTTIIDAYFKAGKSSDAMNLLQEMQDAGVEATV 606
           CSQ+KLEE++L++KEM   G++AN  IYTT++DAY+KAGK+  A+NL+QEM+D G E TV
Sbjct: 555 CSQNKLEESELVLKEMXGYGLTANHFIYTTLMDAYYKAGKTEAALNLVQEMRDNGXELTV 614

Query: 607 VTYCVLIDGLCKAGMVELAVDYFGRMSDLGLQPNVAVYTALIDGLCKTNCVESAKKLFDE 666
           VTYC LIDGLCK G+ + A  +F  M DLGLQPNVAV+TALIDGLCK NC+E+AK+LF E
Sbjct: 615 VTYCALIDGLCKKGLFQEATSHFRTMPDLGLQPNVAVFTALIDGLCKNNCIEAAKELFXE 674

Query: 667 MQCRGMTPDITAFTALIDGNLKLGNLQEASDLISRMTESATEFDLHAYTSLVSGFSEYGE 726
           M  +G+ PD  A+T L+DGNLK GNL+EA  + +RM E   E DL+AYTSL+ G SE+G+
Sbjct: 675 MXDKGLIPDKAAYTTLMDGNLKHGNLEEALSIQNRMREIGMELDLYAYTSLIWGLSEFGQ 734

Query: 727 LHQARKFFSEMIEKGILPEEILCISLLREYYKLGQLDEAIELKNEMQRRGLISENCSLAV 784
           + QA+    EMI KGILP+EILCISLLR+YYKLG LDEAIEL+ EM  RGLIS  C   +
Sbjct: 735 VKQAKMLLDEMIGKGILPDEILCISLLRKYYKLGNLDEAIELQIEMVNRGLISGTCDHVI 792

BLAST of Cla005843 vs. NCBI nr
Match: gi|645229248|ref|XP_008221377.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At2g02150 [Prunus mume])

HSP 1 Score: 976.9 bits (2524), Expect = 2.1e-281
Identity = 484/773 (62.61%), Postives = 601/773 (77.75%), Query Frame = 1

Query: 17  MLLFFRTLFHVSRRASYQVIS-LSSNSSHP-DCLSFNVFNPSSSLTSINAYCISRHFFWF 76
           ML+F R L  +  RAS+  +S LSS   H  +CL  NV   S SL+S +   I+    WF
Sbjct: 1   MLIFLRNLLQMGCRASFHRVSPLSSIPQHSSNCLFINV--SSLSLSSSHGSLIACPLVWF 60

Query: 77  TSFLRIFRLPFVSYSGTNNSFEFLDIGTLRKIIQQDLWNDPKIVILLDSALAPIWVSKVL 136
           TSFL I R PFV+ S  N+  + L+  +LR IIQ D W+DP+IV L  SALAPIW SK L
Sbjct: 61  TSFLCITRFPFVTKSNPNSFRDNLNTESLRIIIQHDYWDDPRIVNLFGSALAPIWASKFL 120

Query: 137 VELKEDPNLALKFFKWAGSRIGFRHTTESYCIVAHMLFRARMYTNAHDIIKEVIVKSRID 196
           VEL+ DP LALK F+W+ +RIGF HTTESYCI+ H+LF ARMY +AH+I+KE++   R+ 
Sbjct: 121 VELRGDPKLALKLFRWSKTRIGFCHTTESYCILVHILFYARMYFDAHEILKELVSLRRVS 180

Query: 197 VGFPVFNIFDMLWSTRNICASGTGVFDVLFSVFVELGLLDEANKCFSRMRKFRTLPKARS 256
           +G    ++FD+LWSTRN+C  G GVFD LFSV VE G+L++A++CF RM+KFR LPK RS
Sbjct: 181 LGC---DVFDVLWSTRNVCRLGFGVFDALFSVLVEFGMLEKASECFLRMKKFRVLPKVRS 240

Query: 257 CNFLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMR 316
           CN LL RLSKSG G   RKFF DM+GAGI PSVFTYN+MI YLCKEGDL+ A  LF QM+
Sbjct: 241 CNALLQRLSKSGKGNFSRKFFKDMLGAGITPSVFTYNIMIGYLCKEGDLDTASCLFAQMK 300

Query: 317 QMGFTPDVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVITYNGLINCFCKFEKMP 376
           +MG TPD+VTYNSLIDGYGKVG+L+ +  +F EMKD GC PDVIT+N LINC CKF+KMP
Sbjct: 301 RMGLTPDIVTYNSLIDGYGKVGILDNSFCIFEEMKDAGCEPDVITFNSLINCCCKFDKMP 360

Query: 377 QAFEYLSKMKNDGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLPNEFTYTSLI 436
           +A  +L +M N GLKPNV+TYSTLIDAFCKEGMMQ A+K+F+DM+RVGL PNEFTYTSLI
Sbjct: 361 EALNFLREMNNKGLKPNVITYSTLIDAFCKEGMMQEAVKIFMDMKRVGLSPNEFTYTSLI 420

Query: 437 DAHCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGKMMEAEEVFRAMLKNGIS 496
           DA+CKAGNL+EA KL  +M Q G++LNIVTYTAL+DGLC+DG+M +AEEVFR +L+ GIS
Sbjct: 421 DANCKAGNLSEALKLKKEMFQEGISLNIVTYTALLDGLCQDGRMEDAEEVFREVLETGIS 480

Query: 497 PNQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQSKLEETKLI 556
           PNQQ+ TALVHGYIKA+RME+AMEI K++     KPDL+LYGTIIWGLCSQ+KLEE++L+
Sbjct: 481 PNQQICTALVHGYIKAKRMENAMEIWKEIKGKGFKPDLLLYGTIIWGLCSQNKLEESELV 540

Query: 557 IKEMESQGISANPVIYTTIIDAYFKAGKSSDAMNLLQEMQDAGVEATVVTYCVLIDGLCK 616
             EM+  G + N  IYTT++DAYFKAGK+ +A+NLLQEM D G+E TVVTYC LIDGLCK
Sbjct: 541 FSEMKGCGSTPNHFIYTTLMDAYFKAGKTKEALNLLQEMLDNGIEFTVVTYCALIDGLCK 600

Query: 617 AGMVELAVDYFGRMSDLGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMTPDITA 676
            G+++ A++YF RM D+GL+PNVAV+TALIDG CK NC+E+AK+LF+EM  +GM PD  A
Sbjct: 601 KGLLQEAINYFRRMPDIGLEPNVAVFTALIDGHCKNNCIEAAKELFNEMLDKGMIPDKAA 660

Query: 677 FTALIDGNLKLGNLQEASDLISRMTESATEFDLHAYTSLVSGFSEYGELHQARKFFSEMI 736
           ++ LIDGNLK GNLQEA  +  RM E   E DL+AYTSL+ G S +G++ QA+    EMI
Sbjct: 661 YSTLIDGNLKHGNLQEALSVEKRMREMGMELDLYAYTSLIWGLSHFGQVQQAKILLDEMI 720

Query: 737 EKGILPEEILCISLLREYYKLGQLDEAIELKNEMQRRGLISENCSLAVPCLKT 788
            KGILP+EILCI LL++YY+LG LDEA EL+ EM  +GLI+  C  AVP  +T
Sbjct: 721 GKGILPDEILCICLLKKYYELGYLDEAFELQTEMVNKGLITGTCDYAVPNART 768

BLAST of Cla005843 vs. NCBI nr
Match: gi|703110107|ref|XP_010099493.1| (hypothetical protein L484_000446 [Morus notabilis])

HSP 1 Score: 964.1 bits (2491), Expect = 1.4e-277
Identity = 473/767 (61.67%), Postives = 599/767 (78.10%), Query Frame = 1

Query: 17  MLLFFRTLFHVSRRASYQVISLSSNSSHPDCLSFNVFNPSSSLTSINAYCISRHFFWFTS 76
           MLLF R LFH SRRAS +V   S +  +P          S    S N+  ++    WFTS
Sbjct: 22  MLLFLRNLFHTSRRASTRVSPFSPSIPYPHNCDLLPSLRSVYGKSSNSCIVACPLAWFTS 81

Query: 77  FLRIFRLPFVSYSGTNNSFEFLDIGTLRKIIQQDLWNDPKIVILLDSALAPIWVSKVLVE 136
           FL + R PF S S  + S E LD   LR+I++QD W+DPKIV L DSA+API VS+ LVE
Sbjct: 82  FLFLVRFPFYSKSSASFSLEVLDREQLRRIVEQDQWHDPKIVNLFDSAIAPILVSRFLVE 141

Query: 137 LKEDPNLALKFFKWAGSRIGFRHTTESYCIVAHMLFRARMYTNAHDIIKEVIVKSRIDVG 196
           LKE P LALK FKW  +R GFRHT ESYCI+ H+LF ARM+ +A+ +++E++  +R+   
Sbjct: 142 LKEYPFLALKLFKWVRNRTGFRHTAESYCILVHILFYARMFFDANGVLRELVSSNRV--- 201

Query: 197 FPVFNIFDMLWSTRNICASGTGVFDVLFSVFVELGLLDEANKCFSRMRKFRTLPKARSCN 256
            P  ++FD+LWSTRN+C  G GVFD LFSV VELG+L+EAN+CF +MRKF  LPK RSCN
Sbjct: 202 LPGCDVFDVLWSTRNVCVPGFGVFDALFSVLVELGMLEEANQCFLKMRKFHVLPKPRSCN 261

Query: 257 FLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQM 316
             LHRLSK G   + RKFF DM+ AGIAPSVFTYN+MI+YLCKEGD++ AR LF +M+  
Sbjct: 262 AFLHRLSKLGKVDMSRKFFKDMVAAGIAPSVFTYNIMINYLCKEGDMDEARSLFEEMKHR 321

Query: 317 GFTPDVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVITYNGLINCFCKFEKMPQA 376
           G  PD+VTYNSLIDG+GKVG ++EA+ +F +MKDVGC PD+IT+N LINCF K +++P+A
Sbjct: 322 GLIPDIVTYNSLIDGFGKVGNMDEAICIFEKMKDVGCEPDIITFNALINCFGKSQRLPRA 381

Query: 377 FEYLSKMKNDGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLPNEFTYTSLIDA 436
            E+L +++N GLKPNVVTYSTLIDAFCKEGMM+ A+K FVDMRRVGL PNE+TYTSL+DA
Sbjct: 382 LEFLHELRNHGLKPNVVTYSTLIDAFCKEGMMREALKFFVDMRRVGLFPNEYTYTSLVDA 441

Query: 437 HCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGKMMEAEEVFRAMLKNGISPN 496
           +CKAGNLTEA KL+N+MLQAG+NLNIV Y+AL++ LCEDG+M EAE+VF  MLK G++PN
Sbjct: 442 NCKAGNLTEALKLTNEMLQAGINLNIVGYSALLNCLCEDGRMKEAEKVFMEMLKAGVTPN 501

Query: 497 QQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQSKLEETKLIIK 556
            QVY++LVHGY+KA++ E A + LK+M E  IKPDL+LYGTIIWGLCSQ+KLEE++L++ 
Sbjct: 502 LQVYSSLVHGYVKAKKTEKAFQTLKEMEEKKIKPDLLLYGTIIWGLCSQNKLEESELVVN 561

Query: 557 EMESQGISANPVIYTTIIDAYFKAGKSSDAMNLLQEMQDAGVEATVVTYCVLIDGLCKAG 616
           EM S+G++AN  IYTT++DAYFKAGK+++A+ LLQEM   G+E  VVTYC LIDGLCK G
Sbjct: 562 EMRSRGLNANHFIYTTLMDAYFKAGKTTEALLLLQEMHYYGIEVNVVTYCALIDGLCKRG 621

Query: 617 MVELAVDYFGRMSDLGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMTPDITAFT 676
           +VE A DYF RM  +GLQPNVAVYTALIDGLCK N +E+AKKLFDEM  +G++PD TA+T
Sbjct: 622 LVEEATDYFDRMVSIGLQPNVAVYTALIDGLCKNNRIEAAKKLFDEMLEKGISPDRTAYT 681

Query: 677 ALIDGNLKLGNLQEASDLISRMTESATEFDLHAYTSLVSGFSEYGELHQARKFFSEMIEK 736
            LIDGNLK G+LQEA  L +RM E   E DL+AYTSL+ GFS++G++ QA+ +  EMI K
Sbjct: 682 TLIDGNLKHGHLQEALTLKNRMIEMGMELDLYAYTSLIWGFSQFGQVQQAKTWLDEMIGK 741

Query: 737 GILPEEILCISLLREYYKLGQLDEAIELKNEMQRRGLISENCSLAVP 784
           GILP+EILC+ LLR+YY+LG + EA EL++E+ +RGLI   C+ AVP
Sbjct: 742 GILPDEILCVCLLRKYYELGNVVEADELRDELVKRGLIKGACTYAVP 785

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP143_ARATH2.0e-24755.40Putative pentatricopeptide repeat-containing protein At2g02150 OS=Arabidopsis th... [more]
PP141_ARATH2.6e-10136.70Pentatricopeptide repeat-containing protein At2g01740 OS=Arabidopsis thaliana GN... [more]
PPR12_ARATH2.2e-10032.15Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidop... [more]
PP432_ARATH2.2e-9232.03Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana GN... [more]
PP407_ARATH6.6e-8930.53Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
W9SE38_9ROSA1.0e-27761.67Uncharacterized protein OS=Morus notabilis GN=L484_000446 PE=4 SV=1[more]
W9S012_9ROSA1.9e-27661.54Uncharacterized protein OS=Morus notabilis GN=L484_000854 PE=4 SV=1[more]
A0A061E9Z5_THECC4.2e-27660.77Pentatricopeptide repeat-containing protein, putative isoform 1 OS=Theobroma cac... [more]
B9RY36_RICCO7.5e-26562.47Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... [more]
K7L5N5_SOYBN3.5e-25459.53Uncharacterized protein OS=Glycine max GN=GLYMA_08G090700 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|449463537|ref|XP_004149490.1|0.0e+0086.48PREDICTED: putative pentatricopeptide repeat-containing protein At2g02150 [Cucum... [more]
gi|659072656|ref|XP_008466646.1|0.0e+0086.37PREDICTED: putative pentatricopeptide repeat-containing protein At2g02150 [Cucum... [more]
gi|657995268|ref|XP_008389961.1|2.1e-28463.00PREDICTED: putative pentatricopeptide repeat-containing protein At2g02150 [Malus... [more]
gi|645229248|ref|XP_008221377.1|2.1e-28162.61PREDICTED: putative pentatricopeptide repeat-containing protein At2g02150 [Prunu... [more]
gi|703110107|ref|XP_010099493.1|1.4e-27761.67hypothetical protein L484_000446 [Morus notabilis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
WMU48293watermelon EST collection version 2.0transcribed_cluster
WMU65993watermelon EST collection version 2.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla005843Cla005843.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
WMU48293WMU48293transcribed_cluster
WMU65993WMU65993transcribed_cluster


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 221..246
score: 0.11coord: 709..738
score: 5.2E-5coord: 747..773
score: 0.0016coord: 534..563
score: 0
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 282..314
score: 1.4
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 635..681
score: 2.8E-14coord: 461..503
score: 2.0E-12coord: 566..614
score: 7.8E-16coord: 320..369
score: 5.4E-20coord: 390..439
score: 1.0
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 254..286
score: 0.0024coord: 358..392
score: 6.8E-7coord: 603..637
score: 7.0E-10coord: 499..531
score: 8.6E-8coord: 428..461
score: 7.6E-8coord: 463..496
score: 4.4E-10coord: 746..773
score: 0.0032coord: 288..322
score: 3.8E-10coord: 709..740
score: 3.3E-6coord: 568..601
score: 1.3E-7coord: 639..672
score: 7.2E-10coord: 323..357
score: 1.1E-12coord: 393..427
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 321..355
score: 14.425coord: 461..495
score: 13.713coord: 671..705
score: 8.396coord: 741..775
score: 9.328coord: 391..425
score: 13.23coord: 426..460
score: 11.904coord: 286..320
score: 13.68coord: 531..565
score: 9.339coord: 356..390
score: 12.189coord: 496..530
score: 11.762coord: 251..285
score: 8.714coord: 706..740
score: 12.189coord: 636..670
score: 12.452coord: 216..250
score: 7.991coord: 160..195
score: 5.327coord: 566..600
score: 11.729coord: 601..635
score: 12
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 563..581
score: 3.6E-5coord: 439..526
score: 3.6E-5coord: 226..413
score: 8.6E-4coord: 582..764
score: 3.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 230..782
score: 2.6E-302coord: 108..139
score: 2.6E-302coord: 26..46
score: 2.6E
NoneNo IPR availablePANTHERPTHR24015:SF329SUBFAMILY NOT NAMEDcoord: 230..782
score: 2.6E-302coord: 108..139
score: 2.6E-302coord: 26..46
score: 2.6E
NoneNo IPR availableunknownSSF81901HCP-likecoord: 425..536
score: 4.97E-8coord: 289..388
score: 4.9