Cp4.1LG01g04190 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g04190
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG01 : 1404289 .. 1407242 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
TGTCGAATGCATTCCAACGATGATGAATGCATCTTCTCTAATCTCTCTTCTCGACAAATGCAAATCAATGTCCGAATTAAGAAGAATCCATGCTCTGCTTTTCACTTTCGGTGTCTCTCAAGATGACGCCGTCGTATCGAAACTTCTCTCCTTCTCTGCACTCTCCCCCGCCGGCGATCTCGACTACTCATATAAACTGTTATTGAATCTTCCCAATCCCACCACCTTCAAATGGAACACTCTCATAAGGGGTTTTTCAAATTCCAGAAACCCAAATCGTTCAATTACGGTTTTCGTTAGGATGCTGCGGAATGGGGTCTCCCCTGATTATATGACGTACCCTTTCCTGGGGAAGGCGATAGCGAAGNATCACACAGTTCCCAAAGCAGCTCACAACAGACCGAATTGAACACAGAACAACAAGAAACAGAAAAGAAACCACAAATTGAACAGGAACAAGAACAAAGTAGGAACAGTGACATCAAAAGAGGGATAGAGAGAACCTGTTCTTGAAGGGTACGTGCGAGGGCAAGGTCAGCATCGACCTGGTTGAGGTTAGTGAAGGGAGTTCTAGCGGATTGGCGAGAAGTGGAGTCGGATCCAGTAGTCTCATTGTGGGGGTCTGCAGACGATGGCGATGAATTGGAATTAGGGTTGTGTTCAAGGTCGGAGGAGGCGGTGGTGGCGGTGGTGGGCGGTGGCTTGACTCCATCAAAACCAAGTTTGGTGGCGTTAGTGGTGGCTTCGTTCTCCATAAAAGAAGGAGATTGAGGTTTGAGAGAAAAGGGAGATTTGGATTTGACAAATGGAAGATGGAAAAGAGATGATATGAAGAATTAGGAGAAGGAGAATCAAAGATTTCAGAGAAGAAAAAAATTAATAGTAACAGAAGAATCTAAGAACACGACGAAGATTATATAATTTTTGAGAGCCGAGCAAGAACAAACAAACAGACGCACCTTTTTCTTTTTTCTTTTTTCTTTTTTCTATTTTTATTATTATTTCAAAATAAACAAATATAGAAAATAAGAATTTGTTTTTCCTTTTTCTTTTTTCGATAAATAGTTTGACAAAAACAAAACTACATCTAATAATATAATTTTTTTTAAACAACTATCTTAAAATAAATTATAAATTAAATATTAATATAAAAATAATTATATATAAATCACAATAATTTAGTATTAATTTACCAAACATACTAATTATTTTAAAAGTTATTACAAAAGGCAATTCTCAATTTTAATTTTTTTTAAAATAATAAATTTTATTTAAATTTAATATTTAATATTATTTTATTTTAAAAAAAAACTCAACTTTAATCCGTTGGCTGCAGTGTCGAATGCATTCCAACGATGATGAATGCATCTTCTCTAATCTCTCTTCTCGACAAATGCAAATCAATGTCCGAATTAAGAAGAATCCATGCTCTGCTTTTCACTTTCGGTGTCTCTCAAGATGACGCCGTCGTATCGAAACTTCTCTCCTTCTCTGCACTCTCCCCCGCCGGCGATCTCGACTACTCATATAAACTGTTATTGAATCTTCCCAATCCCACCACCTTCAAATGGAACACTCTCATAAGGGGTTTTTCAAATTCCAGAAACCCAAATCGTTCAATTACGGTTTTCGTTAGGATGCTGCGGAATGGGGTCTCCCCTGATTATATGACGTACCCTTTCCTGGGGAAGGCGATAGCGAAGTTGTTGAATCAGAAGCTTGGAATGGCGGTGCATGTTCATGTTGCCAAAACTGGGCATGAGGTTGATAGGTTCGTAATGAATTCATTGATTCATATGTATGCTTCTTGCGGAGATATCGCGTTTGCGCGTAAGGTGTTCGACGAAATGCCAACGAAGAATTTGGTGTCTTGGAACGCTATGTTGGATGGGTATGCCAAATGTGGGGACGTGAATACTGCTAGGGAGGTGTTTGATTTAATGCATGAGAGGGATGTTGTGTCGTGGAGCTCTTTGATCGATGGGTATGTTAAGTGCGGGGAATATGGTGAGGCGATGGCTCTGTTTGAGCGCATGCGCTCTGCTGGGCCCATGGCGAATGAGGTGACTTTGGTGAGTGTTCTGTGTGCCTGTGCCCATTTGGGTGCACTTGAACAGGGGAGAATGATGCACGGTTATATAGTTGAGAATGAGTTGCCATTGACTATTGTGCTACTGACATCTTTGGTGGACATGTATGCCAAATGTGGCGCCATACATGAAGCTTTGGCTGCGTTTCGTGCATGTCCACTGCAACGGACTGATGTTCTAATCTGGAATGCTATAATTGGAGGTTTGGCAACACATGGGCTGATAAAAGAGTCAATGGACTTGTTTAGTGAGATGCAAATGGTAGGGATTGCGCCTGATGAGATCACATACTTGTGCTTGTTAAGCTGTTGTGCTCATGGAGGATTAGTAAATGAGGCTTGGTATTTCTTTGATTGCCTTCATAAACATGGTATGACTCCAAAGGATGAGCATTATGCTTGTATGGTAGATGCCTTATCCCGGGCCGGCCAAGTATCTGAGGCGTATCAATTCTTATGTCAAATGACCGTCCAACCAACGTCGTCGATGTTAGGTGCTCTCCTGAGTGGCTGCATGAAACATGGTAAACTAGACCTTGCAGAAGTGGTAGGAAGGAGGCTTGTTGAGTTAGATCCAGATCATGATGGTAGATATGTTGGCTTATCAAATATATATGCAGTAGACAAGCGTTGGAATGATGCCAGAAATATCAGAGAAGCCATGGAAAAGAGGGGAGTGAAGAAATCTCCTGGTTTTAGTTTTGTTGAAGTATTTGGAATCCTTCATAGATTCATAGCTCATGATAAGACACATGGTGATTCCGAGCGGATTTACGTAATGCTGAACTTAATTATAGATCAAATGAAACCGATCGAAGATTCAGAAAATCAGGAGTACTGTTTGTATGACCTCATTGGTGTCTCT

mRNA sequence

TGTCGAATGCATTCCAACGATGATGAATGCATCTTCTCTAATCTCTCTTCTCGACAAATGCAAATCAATGTCCGAATTAAGAAGAATCCATGCTCTGCTTTTCACTTTCGGTGTCTCTCAAGATGACGCCGTCGTATCGAAACTTCTCTCCTTCTCTGCACTCTCCCCCGCCGGCGATCTCGACTACTCATATAAACTGTTATTGAATCTTCCCAATCCCACCACCTTCAAATGGAACACTCTCATAAGGGGTTTTTCAAATTCCAGAAACCCAAATCGTTCAATTACGGTTTTCGTTAGGATGCTGCGGAATGGGGTCTCCCCTGATTATATGACAATCCATGCTCTGCTTTTCACTTTCGGTGTCTCTCAAGATGACGCCGTCGTATCGAAACTTCTCTCCTTCTCTGCACTCTCCCCCGCCGGCGATCTCGACTACTCATATAAACTGTTATTGAATCTTCCCAATCCCACCACCTTCAAATGGAACACTCTCATAAGGGGTTTTTCAAATTCCAGAAACCCAAATCGTTCAATTACGGTTTTCGTTAGGATGCTGCGGAATGGGGTCTCCCCTGATTATATGACGTACCCTTTCCTGGGGAAGGCGATAGCGAAGTTGTTGAATCAGAAGCTTGGAATGGCGGTGCATGTTCATGTTGCCAAAACTGGGCATGAGGTTGATAGGTTCGTAATGAATTCATTGATTCATATGTATGCTTCTTGCGGAGATATCGCGTTTGCGCGTAAGGTGTTCGACGAAATGCCAACGAAGAATTTGGTGTCTTGGAACGCTATGTTGGATGGGTATGCCAAATGTGGGGACGTGAATACTGCTAGGGAGGTGTTTGATTTAATGCATGAGAGGGATGTTGTGTCGTGGAGCTCTTTGATCGATGGGTATGTTAAGTGCGGGGAATATGGTGAGGCGATGGCTCTGTTTGAGCGCATGCGCTCTGCTGGGCCCATGGCGAATGAGGTGACTTTGGTGAGTGTTCTGTGTGCCTGTGCCCATTTGGGTGCACTTGAACAGGGGAGAATGATGCACGGTTATATAGTTGAGAATGAGTTGCCATTGACTATTGTGCTACTGACATCTTTGGTGGACATGTATGCCAAATGTGGCGCCATACATGAAGCTTTGGCTGCGTTTCGTGCATGTCCACTGCAACGGACTGATGTTCTAATCTGGAATGCTATAATTGGAGGTTTGGCAACACATGGGCTGATAAAAGAGTCAATGGACTTGTTTAGTGAGATGCAAATGGTAGGGATTGCGCCTGATGAGATCACATACTTGTGCTTGTTAAGCTGTTGTGCTCATGGAGGATTAGTAAATGAGGCTTGGTATTTCTTTGATTGCCTTCATAAACATGGTATGACTCCAAAGGATGAGCATTATGCTTGTATGGTAGATGCCTTATCCCGGGCCGGCCAAGTATCTGAGGCGTATCAATTCTTATGTCAAATGACCGTCCAACCAACGTCGTCGATGTTAGGTGCTCTCCTGAGTGGCTGCATGAAACATGGTAAACTAGACCTTGCAGAAGTGGTAGGAAGGAGGCTTGTTGAGTTAGATCCAGATCATGATGGTAGATATGTTGGCTTATCAAATATATATGCAGTAGACAAGCGTTGGAATGATGCCAGAAATATCAGAGAAGCCATGGAAAAGAGGGGAGTGAAGAAATCTCCTGGTTTTAGTTTTGTTGAAGTATTTGGAATCCTTCATAGATTCATAGCTCATGATAAGACACATGGTGATTCCGAGCGGATTTACGTAATGCTGAACTTAATTATAGATCAAATGAAACCGATCGAAGATTCAGAAAATCAGGAGTACTGTTTGTATGACCTCATTGGTGTCTCT

Coding sequence (CDS)

ATGATGAATGCATCTTCTCTAATCTCTCTTCTCGACAAATGCAAATCAATGTCCGAATTAAGAAGAATCCATGCTCTGCTTTTCACTTTCGGTGTCTCTCAAGATGACGCCGTCGTATCGAAACTTCTCTCCTTCTCTGCACTCTCCCCCGCCGGCGATCTCGACTACTCATATAAACTGTTATTGAATCTTCCCAATCCCACCACCTTCAAATGGAACACTCTCATAAGGGGTTTTTCAAATTCCAGAAACCCAAATCGTTCAATTACGGTTTTCGTTAGGATGCTGCGGAATGGGGTCTCCCCTGATTATATGACAATCCATGCTCTGCTTTTCACTTTCGGTGTCTCTCAAGATGACGCCGTCGTATCGAAACTTCTCTCCTTCTCTGCACTCTCCCCCGCCGGCGATCTCGACTACTCATATAAACTGTTATTGAATCTTCCCAATCCCACCACCTTCAAATGGAACACTCTCATAAGGGGTTTTTCAAATTCCAGAAACCCAAATCGTTCAATTACGGTTTTCGTTAGGATGCTGCGGAATGGGGTCTCCCCTGATTATATGACGTACCCTTTCCTGGGGAAGGCGATAGCGAAGTTGTTGAATCAGAAGCTTGGAATGGCGGTGCATGTTCATGTTGCCAAAACTGGGCATGAGGTTGATAGGTTCGTAATGAATTCATTGATTCATATGTATGCTTCTTGCGGAGATATCGCGTTTGCGCGTAAGGTGTTCGACGAAATGCCAACGAAGAATTTGGTGTCTTGGAACGCTATGTTGGATGGGTATGCCAAATGTGGGGACGTGAATACTGCTAGGGAGGTGTTTGATTTAATGCATGAGAGGGATGTTGTGTCGTGGAGCTCTTTGATCGATGGGTATGTTAAGTGCGGGGAATATGGTGAGGCGATGGCTCTGTTTGAGCGCATGCGCTCTGCTGGGCCCATGGCGAATGAGGTGACTTTGGTGAGTGTTCTGTGTGCCTGTGCCCATTTGGGTGCACTTGAACAGGGGAGAATGATGCACGGTTATATAGTTGAGAATGAGTTGCCATTGACTATTGTGCTACTGACATCTTTGGTGGACATGTATGCCAAATGTGGCGCCATACATGAAGCTTTGGCTGCGTTTCGTGCATGTCCACTGCAACGGACTGATGTTCTAATCTGGAATGCTATAATTGGAGGTTTGGCAACACATGGGCTGATAAAAGAGTCAATGGACTTGTTTAGTGAGATGCAAATGGTAGGGATTGCGCCTGATGAGATCACATACTTGTGCTTGTTAAGCTGTTGTGCTCATGGAGGATTAGTAAATGAGGCTTGGTATTTCTTTGATTGCCTTCATAAACATGGTATGACTCCAAAGGATGAGCATTATGCTTGTATGGTAGATGCCTTATCCCGGGCCGGCCAAGTATCTGAGGCGTATCAATTCTTATGTCAAATGACCGTCCAACCAACGTCGTCGATGTTAGGTGCTCTCCTGAGTGGCTGCATGAAACATGGTAAACTAGACCTTGCAGAAGTGGTAGGAAGGAGGCTTGTTGAGTTAGATCCAGATCATGATGGTAGATATGTTGGCTTATCAAATATATATGCAGTAGACAAGCGTTGGAATGATGCCAGAAATATCAGAGAAGCCATGGAAAAGAGGGGAGTGAAGAAATCTCCTGGTTTTAGTTTTGTTGAAGTATTTGGAATCCTTCATAGATTCATAGCTCATGATAAGACACATGGTGATTCCGAGCGGATTTACGTAATGCTGAACTTAATTATAGATCAAATGAAACCGATCGAAGATTCAGAAAATCAGGAGTACTGTTTGTATGACCTCATTGGTGTCTCT

Protein sequence

MMNASSLISLLDKCKSMSELRRIHALLFTFGVSQDDAVVSKLLSFSALSPAGDLDYSYKLLLNLPNPTTFKWNTLIRGFSNSRNPNRSITVFVRMLRNGVSPDYMTIHALLFTFGVSQDDAVVSKLLSFSALSPAGDLDYSYKLLLNLPNPTTFKWNTLIRGFSNSRNPNRSITVFVRMLRNGVSPDYMTYPFLGKAIAKLLNQKLGMAVHVHVAKTGHEVDRFVMNSLIHMYASCGDIAFARKVFDEMPTKNLVSWNAMLDGYAKCGDVNTAREVFDLMHERDVVSWSSLIDGYVKCGEYGEAMALFERMRSAGPMANEVTLVSVLCACAHLGALEQGRMMHGYIVENELPLTIVLLTSLVDMYAKCGAIHEALAAFRACPLQRTDVLIWNAIIGGLATHGLIKESMDLFSEMQMVGIAPDEITYLCLLSCCAHGGLVNEAWYFFDCLHKHGMTPKDEHYACMVDALSRAGQVSEAYQFLCQMTVQPTSSMLGALLSGCMKHGKLDLAEVVGRRLVELDPDHDGRYVGLSNIYAVDKRWNDARNIREAMEKRGVKKSPGFSFVEVFGILHRFIAHDKTHGDSERIYVMLNLIIDQMKPIEDSENQEYCLYDLIGVS
BLAST of Cp4.1LG01g04190 vs. Swiss-Prot
Match: PP369_ARATH (Pentatricopeptide repeat-containing protein At5g08305 OS=Arabidopsis thaliana GN=PCMP-E105 PE=2 SV=1)

HSP 1 Score: 610.5 bits (1573), Expect = 2.0e-173
Identity = 292/507 (57.59%), Postives = 388/507 (76.53%), Query Frame = 1

Query: 107 IHALLFTFGVSQDDAVVSKLLSFSALSPAGDLDYSYKLLLNLPNPTTFKWNTLIRGFSNS 166
           IH LL T G+S+++  VS+ LSFSALS +GD+DY+YK L  L +P  + WN +IRGFSNS
Sbjct: 27  IHTLLITLGLSEEEPFVSQTLSFSALSSSGDVDYAYKFLSKLSDPPNYGWNFVIRGFSNS 86

Query: 167 RNPNRSITVFVRMLRNGVSPDYMTYPFLGKAIAKLLNQKLGMAVHVHVAKTGHEVDRFVM 226
           RNP +SI+V+++MLR G+ PD+MTYPFL K+ ++L N+KLG ++H  V K+G E D F+ 
Sbjct: 87  RNPEKSISVYIQMLRFGLLPDHMTYPFLMKSSSRLSNRKLGGSLHCSVVKSGLEWDLFIC 146

Query: 227 NSLIHMYASCGDIAFARKVFDEMPTKNLVSWNAMLDGYAKCGDVNTAREVFDLMHERDVV 286
           N+LIHMY S  D A ARK+FDEMP KNLV+WN++LD YAK GDV +AR VFD M ERDVV
Sbjct: 147 NTLIHMYGSFRDQASARKLFDEMPHKNLVTWNSILDAYAKSGDVVSARLVFDEMSERDVV 206

Query: 287 SWSSLIDGYVKCGEYGEAMALFERM-RSAGPMANEVTLVSVLCACAHLGALEQGRMMHGY 346
           +WSS+IDGYVK GEY +A+ +F++M R     ANEVT+VSV+CACAHLGAL +G+ +H Y
Sbjct: 207 TWSSMIDGYVKRGEYNKALEIFDQMMRMGSSKANEVTMVSVICACAHLGALNRGKTVHRY 266

Query: 347 IVENELPLTIVLLTSLVDMYAKCGAIHEALAAFRACPLQRTDVLIWNAIIGGLATHGLIK 406
           I++  LPLT++L TSL+DMYAKCG+I +A + F    ++ TD L+WNAIIGGLA+HG I+
Sbjct: 267 ILDVHLPLTVILQTSLIDMYAKCGSIGDAWSVFYRASVKETDALMWNAIIGGLASHGFIR 326

Query: 407 ESMDLFSEMQMVGIAPDEITYLCLLSCCAHGGLVNEAWYFFDCLHKHGMTPKDEHYACMV 466
           ES+ LF +M+   I PDEIT+LCLL+ C+HGGLV EAW+FF  L + G  PK EHYACMV
Sbjct: 327 ESLQLFHKMRESKIDPDEITFLCLLAACSHGGLVKEAWHFFKSLKESGAEPKSEHYACMV 386

Query: 467 DALSRAGQVSEAYQFLCQMTVQPTSSMLGALLSGCMKHGKLDLAEVVGRRLVELDPDHDG 526
           D LSRAG V +A+ F+ +M ++PT SMLGALL+GC+ HG L+LAE VG++L+EL P +DG
Sbjct: 387 DVLSRAGLVKDAHDFISEMPIKPTGSMLGALLNGCINHGNLELAETVGKKLIELQPHNDG 446

Query: 527 RYVGLSNIYAVDKRWNDARNIREAMEKRGVKKSPGFSFVEVFGILHRFIAHDKTHGDSER 586
           RYVGL+N+YA++K++  AR++REAMEK+GVKK  G S +++ G  HRFIAHDKTH  S++
Sbjct: 447 RYVGLANVYAINKQFRAARSMREAMEKKGVKKIAGHSILDLDGTRHRFIAHDKTHFHSDK 506

Query: 587 IYVMLNLIIDQMK---PIEDSENQEYC 610
           IY +L L    M      +D +N  +C
Sbjct: 507 IYAVLQLTGAWMNLDVDYDDQDNHCFC 533

BLAST of Cp4.1LG01g04190 vs. Swiss-Prot
Match: PP169_ARATH (Pentatricopeptide repeat-containing protein At2g22410, mitochondrial OS=Arabidopsis thaliana GN=PCMP-E28 PE=2 SV=1)

HSP 1 Score: 412.9 bits (1060), Expect = 6.1e-114
Identity = 229/613 (37.36%), Postives = 347/613 (56.61%), Query Frame = 1

Query: 7   LISLLDKCKSMSELRRIHALLFTFGVSQDDAVVSKLLSFSALSPAGDLDYSYKLLLNLPN 66
           L+SLL+KCK +  L++I A +   G+  D    S+L++F ALS +  LDYS K+L  + N
Sbjct: 56  LLSLLEKCKLLLHLKQIQAQMIINGLILDPFASSRLIAFCALSESRYLDYSVKILKGIEN 115

Query: 67  PTTFKWNTLIRGFSNSRNPNRSITVFVRMLRNGVS---PDYMTIHALLFTFGVSQDDAVV 126
           P  F WN  IRGFS S NP  S  ++ +MLR+G     PD+ T   L       +  ++ 
Sbjct: 116 PNIFSWNVTIRGFSESENPKESFLLYKQMLRHGCCESRPDHFTYPVLFKVCADLRLSSLG 175

Query: 127 SKLL-----------------SFSALSPAGDLDYSYKLLLNLPNPTTFKWNTLIRGFSNS 186
             +L                 S    +  GD++ + K+    P      WN LI G+   
Sbjct: 176 HMILGHVLKLRLELVSHVHNASIHMFASCGDMENARKVFDESPVRDLVSWNCLINGYKKI 235

Query: 187 RNPNRSITVFVRMLRNGVSPDYMTYPFLGKAIAKLLNQKLGMAVHVHVAKTGHEVDRFVM 246
               ++I V+  M   GV PD +T   L  + + L +   G   + +V + G  +   ++
Sbjct: 236 GEAEKAIYVYKLMESEGVKPDDVTMIGLVSSCSMLGDLNRGKEFYEYVKENGLRMTIPLV 295

Query: 247 NSLIHMYASCGDIAFARKVFDEMPTKNLVSWNAMLDGYAKCGDVNTAREVFDLMHERDVV 306
           N+L+ M++ CGDI  AR++FD +  + +VSW  M+ GYA+CG ++ +R++FD M E+DVV
Sbjct: 296 NALMDMFSKCGDIHEARRIFDNLEKRTIVSWTTMISGYARCGLLDVSRKLFDDMEEKDVV 355

Query: 307 SWSSLIDGYVKCGEYGEAMALFERMRSAGPMANEVTLVSVLCACAHLGALEQGRMMHGYI 366
            W+++I G V+     +A+ALF+ M+++    +E+T++  L AC+ LGAL+ G  +H YI
Sbjct: 356 LWNAMIGGSVQAKRGQDALALFQEMQTSNTKPDEITMIHCLSACSQLGALDVGIWIHRYI 415

Query: 367 VENELPLTIVLLTSLVDMYAKCGAIHEALAAFRACPLQRTDVLIWNAIIGGLATHGLIKE 426
            +  L L + L TSLVDMYAKCG I EAL+ F    +Q  + L + AIIGGLA HG    
Sbjct: 416 EKYSLSLNVALGTSLVDMYAKCGNISEALSVFHG--IQTRNSLTYTAIIGGLALHGDAST 475

Query: 427 SMDLFSEMQMVGIAPDEITYLCLLSCCAHGGLVNEAW-YFFDCLHKHGMTPKDEHYACMV 486
           ++  F+EM   GIAPDEIT++ LLS C HGG++     YF     +  + P+ +HY+ MV
Sbjct: 476 AISYFNEMIDAGIAPDEITFIGLLSACCHGGMIQTGRDYFSQMKSRFNLNPQLKHYSIMV 535

Query: 487 DALSRAGQVSEAYQFLCQMTVQPTSSMLGALLSGCMKHGKLDLAEVVGRRLVELDPDHDG 546
           D L RAG + EA + +  M ++  +++ GALL GC  HG ++L E   ++L+ELDP   G
Sbjct: 536 DLLGRAGLLEEADRLMESMPMEADAAVWGALLFGCRMHGNVELGEKAAKKLLELDPSDSG 595

Query: 547 RYVGLSNIYAVDKRWNDARNIREAMEKRGVKKSPGFSFVEVFGILHRFIAHDKTHGDSER 599
            YV L  +Y     W DA+  R  M +RGV+K PG S +EV GI+  FI  DK+  +SE+
Sbjct: 596 IYVLLDGMYGEANMWEDAKRARRMMNERGVEKIPGCSSIEVNGIVCEFIVRDKSRPESEK 655

BLAST of Cp4.1LG01g04190 vs. Swiss-Prot
Match: PP175_ARATH (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 397.5 bits (1020), Expect = 2.6e-109
Identity = 208/613 (33.93%), Postives = 352/613 (57.42%), Query Frame = 1

Query: 8   ISLLDKCKSMSELRRIHALLFTFGVSQDDAVVSKLLSFSALSPAGDLDYSYKLLLNLPNP 67
           ISL+++C S+ +L++ H  +   G   D    SKL + +ALS    L+Y+ K+   +P P
Sbjct: 34  ISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFASLEYARKVFDEIPKP 93

Query: 68  TTFKWNTLIRGFSNSRNPNRSITVFVRMLRNGVS-PDYMT-------------------I 127
            +F WNTLIR +++  +P  SI  F+ M+      P+  T                   +
Sbjct: 94  NSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSLGQSL 153

Query: 128 HALLFTFGVSQDDAVVSKLLSFSALSPAGDLDYSYKLLLNLPNPTTFKWNTLIRGFSNSR 187
           H +     V  D  V + L+        GDLD + K+   +       WN++I GF    
Sbjct: 154 HGMAVKSAVGSDVFVANSLIH--CYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKG 213

Query: 188 NPNRSITVFVRMLRNGVSPDYMTYPFLGKAIAKLLNQKLGMAVHVHVAKTGHEVDRFVMN 247
           +P++++ +F +M    V   ++T   +  A AK+ N + G  V  ++ +    V+  + N
Sbjct: 214 SPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLTLAN 273

Query: 248 SLIHMYASCGDIAFARKVFDEMPTKNLVSWNAMLDGYAKCGDVNTAREVFDLMHERDVVS 307
           +++ MY  CG I  A+++FD M  K+ V+W  MLDGYA   D   AREV + M ++D+V+
Sbjct: 274 AMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKDIVA 333

Query: 308 WSSLIDGYVKCGEYGEAMALFERMRSAGPMA-NEVTLVSVLCACAHLGALEQGRMMHGYI 367
           W++LI  Y + G+  EA+ +F  ++    M  N++TLVS L ACA +GALE GR +H YI
Sbjct: 334 WNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIHSYI 393

Query: 368 VENELPLTIVLLTSLVDMYAKCGAIHEALAAFRACPLQRTDVLIWNAIIGGLATHGLIKE 427
            ++ + +   + ++L+ MY+KCG + ++   F +  +++ DV +W+A+IGGLA HG   E
Sbjct: 394 KKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNS--VEKRDVFVWSAMIGGLAMHGCGNE 453

Query: 428 SMDLFSEMQMVGIAPDEITYLCLLSCCAHGGLVNEAWYFFDCLHK-HGMTPKDEHYACMV 487
           ++D+F +MQ   + P+ +T+  +   C+H GLV+EA   F  +   +G+ P+++HYAC+V
Sbjct: 454 AVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACIV 513

Query: 488 DALSRAGQVSEAYQFLCQMTVQPTSSMLGALLSGCMKHGKLDLAEVVGRRLVELDPDHDG 547
           D L R+G + +A +F+  M + P++S+ GALL  C  H  L+LAE+   RL+EL+P +DG
Sbjct: 514 DVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRNDG 573

Query: 548 RYVGLSNIYAVDKRWNDARNIREAMEKRGVKKSPGFSFVEVFGILHRFIAHDKTHGDSER 599
            +V LSNIYA   +W +   +R+ M   G+KK PG S +E+ G++H F++ D  H  SE+
Sbjct: 574 AHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSEK 633

BLAST of Cp4.1LG01g04190 vs. Swiss-Prot
Match: PP235_ARATH (Putative pentatricopeptide repeat-containing protein At3g15930 OS=Arabidopsis thaliana GN=PCMP-E51 PE=3 SV=2)

HSP 1 Score: 397.5 bits (1020), Expect = 2.6e-109
Identity = 214/607 (35.26%), Postives = 340/607 (56.01%), Query Frame = 1

Query: 5   SSLISLLDKCKSMSELRRIHALLFTFGVSQDDAVVSKLLSFSALSPAGDLDYSYKLLLNL 64
           S  IS+L  CK+  + +++H+   T GV+ +     KL  F      G + Y+YKL + +
Sbjct: 35  SRFISILGVCKTTDQFKQLHSQSITRGVAPNPTFQKKLFVFWCSRLGGHVSYAYKLFVKI 94

Query: 65  PNPTTFKWNTLIRGFSNSRNPNRSITVFVRMLRNGVSPDYMT------------------ 124
           P P    WN +I+G+S        + +++ ML+ GV+PD  T                  
Sbjct: 95  PEPDVVVWNNMIKGWSKVDCDGEGVRLYLNMLKEGVTPDSHTFPFLLNGLKRDGGALACG 154

Query: 125 --IHALLFTFGVSQDDAVVSKLLSFSALSPAGDLDYSYKLLLNLPNPTTFKWNTLIRGFS 184
             +H  +  FG+  +  V + L+   +L   G +D +  +         F WN +I G++
Sbjct: 155 KKLHCHVVKFGLGSNLYVQNALVKMYSL--CGLMDMARGVFDRRCKEDVFSWNLMISGYN 214

Query: 185 NSRNPNRSITVFVRMLRNGVSPDYMTYPFLGKAIAKLLNQKLGMAVHVHVAKTGHEVDRF 244
             +    SI + V M RN VSP  +T   +  A +K+ ++ L   VH +V++   E    
Sbjct: 215 RMKEYEESIELLVEMERNLVSPTSVTLLLVLSACSKVKDKDLCKRVHEYVSECKTEPSLR 274

Query: 245 VMNSLIHMYASCGDIAFARKVFDEMPTKNLVSWNAMLDGYAKCGDVNTAREVFDLMHERD 304
           + N+L++ YA+CG++  A ++F  M  ++++SW +++ GY + G++  AR  FD M  RD
Sbjct: 275 LENALVNAYAACGEMDIAVRIFRSMKARDVISWTSIVKGYVERGNLKLARTYFDQMPVRD 334

Query: 305 VVSWSSLIDGYVKCGEYGEAMALFERMRSAGPMANEVTLVSVLCACAHLGALEQGRMMHG 364
            +SW+ +IDGY++ G + E++ +F  M+SAG + +E T+VSVL ACAHLG+LE G  +  
Sbjct: 335 RISWTIMIDGYLRAGCFNESLEIFREMQSAGMIPDEFTMVSVLTACAHLGSLEIGEWIKT 394

Query: 365 YIVENELPLTIVLLTSLVDMYAKCGAIHEALAAFRACPLQRTDVLIWNAIIGGLATHGLI 424
           YI +N++   +V+  +L+DMY KCG   +A   F    + + D   W A++ GLA +G  
Sbjct: 395 YIDKNKIKNDVVVGNALIDMYFKCGCSEKAQKVFH--DMDQRDKFTWTAMVVGLANNGQG 454

Query: 425 KESMDLFSEMQMVGIAPDEITYLCLLSCCAHGGLVNEAWYFFDCLHK-HGMTPKDEHYAC 484
           +E++ +F +MQ + I PD+ITYL +LS C H G+V++A  FF  +   H + P   HY C
Sbjct: 455 QEAIKVFFQMQDMSIQPDDITYLGVLSACNHSGMVDQARKFFAKMRSDHRIEPSLVHYGC 514

Query: 485 MVDALSRAGQVSEAYQFLCQMTVQPTSSMLGALLSGCMKHGKLDLAEVVGRRLVELDPDH 544
           MVD L RAG V EAY+ L +M + P S + GALL     H    +AE+  ++++EL+PD+
Sbjct: 515 MVDMLGRAGLVKEAYEILRKMPMNPNSIVWGALLGASRLHNDEPMAELAAKKILELEPDN 574

Query: 545 DGRYVGLSNIYAVDKRWNDARNIREAMEKRGVKKSPGFSFVEVFGILHRFIAHDKTHGDS 591
              Y  L NIYA  KRW D R +R  +    +KK+PGFS +EV G  H F+A DK+H  S
Sbjct: 575 GAVYALLCNIYAGCKRWKDLREVRRKIVDVAIKKTPGFSLIEVNGFAHEFVAGDKSHLQS 634

BLAST of Cp4.1LG01g04190 vs. Swiss-Prot
Match: PP367_ARATH (Pentatricopeptide repeat-containing protein At5g06540 OS=Arabidopsis thaliana GN=PCMP-H88 PE=3 SV=1)

HSP 1 Score: 383.3 bits (983), Expect = 5.2e-105
Identity = 197/518 (38.03%), Postives = 316/518 (61.00%), Query Frame = 1

Query: 103 DYMTIHALLFTFGVSQDDAVVSKLLSFSA-----LSPAGDLDYSYKLLLNLPNPTTFKWN 162
           D   IH  L    +  D  V S+LL+          P   L Y+Y +   + NP  F +N
Sbjct: 27  DLKIIHGFLLRTHLISDVFVASRLLALCVDDSTFNKPTNLLGYAYGIFSQIQNPNLFVFN 86

Query: 163 TLIRGFSNSRNPNRSITVFVRMLRNGVSPDYMTYPFLGKAIAKLLNQKLGMAVHVHVAKT 222
            LIR FS    P+++   + +ML++ + PD +T+PFL KA +++    +G   H  + + 
Sbjct: 87  LLIRCFSTGAEPSKAFGFYTQMLKSRIWPDNITFPFLIKASSEMECVLVGEQTHSQIVRF 146

Query: 223 GHEVDRFVMNSLIHMYASCGDIAFARKVFDEMPTKNLVSWNAMLDGYAKCGDVNTAREVF 282
           G + D +V NSL+HMYA+CG IA A ++F +M  +++VSW +M+ GY KCG V  ARE+F
Sbjct: 147 GFQNDVYVENSLVHMYANCGFIAAAGRIFGQMGFRDVVSWTSMVAGYCKCGMVENAREMF 206

Query: 283 DLMHERDVVSWSSLIDGYVKCGEYGEAMALFERMRSAGPMANEVTLVSVLCACAHLGALE 342
           D M  R++ +WS +I+GY K   + +A+ LFE M+  G +ANE  +VSV+ +CAHLGALE
Sbjct: 207 DEMPHRNLFTWSIMINGYAKNNCFEKAIDLFEFMKREGVVANETVMVSVISSCAHLGALE 266

Query: 343 QGRMMHGYIVENELPLTIVLLTSLVDMYAKCGAIHEALAAFRACPLQRTDVLIWNAIIGG 402
            G   + Y+V++ + + ++L T+LVDM+ +CG I +A+  F   P   TD L W++II G
Sbjct: 267 FGERAYEYVVKSHMTVNLILGTALVDMFWRCGDIEKAIHVFEGLP--ETDSLSWSSIIKG 326

Query: 403 LATHGLIKESMDLFSEMQMVGIAPDEITYLCLLSCCAHGGLVNEAWYFFDCLHK-HGMTP 462
           LA HG   ++M  FS+M  +G  P ++T+  +LS C+HGGLV +    ++ + K HG+ P
Sbjct: 327 LAVHGHAHKAMHYFSQMISLGFIPRDVTFTAVLSACSHGGLVEKGLEIYENMKKDHGIEP 386

Query: 463 KDEHYACMVDALSRAGQVSEAYQFLCQMTVQPTSSMLGALLSGCMKHGKLDLAEVVGRRL 522
           + EHY C+VD L RAG+++EA  F+ +M V+P + +LGALL  C  +   ++AE VG  L
Sbjct: 387 RLEHYGCIVDMLGRAGKLAEAENFILKMHVKPNAPILGALLGACKIYKNTEVAERVGNML 446

Query: 523 VELDPDHDGRYVGLSNIYAVDKRWNDARNIREAMEKRGVKKSPGFSFVEVFGILHRF-IA 582
           +++ P+H G YV LSNIYA   +W+   ++R+ M+++ VKK PG+S +E+ G +++F + 
Sbjct: 447 IKVKPEHSGYYVLLSNIYACAGQWDKIESLRDMMKEKLVKKPPGWSLIEIDGKINKFTMG 506

Query: 583 HDKTHGDSERIYVMLNLIIDQMKPIEDSENQEYCLYDL 614
            D+ H +  +I      I+ +++ I    N     +D+
Sbjct: 507 DDQKHPEMGKIRRKWEEILGKIRLIGYKGNTGDAFFDV 542

BLAST of Cp4.1LG01g04190 vs. TrEMBL
Match: A0A0A0KMZ3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G387410 PE=4 SV=1)

HSP 1 Score: 889.4 bits (2297), Expect = 2.5e-255
Identity = 422/511 (82.58%), Postives = 465/511 (91.00%), Query Frame = 1

Query: 107 IHALLFTFGVSQDDAVVSKLLSFSALSPAGDLDYSYKLLLNLPNPTTFKWNTLIRGFSNS 166
           IHALLFT G+SQD+ + SKLL FSALSPA DLDYSYKL+LN+PNPTTF WNTLIR FSN+
Sbjct: 32  IHALLFTLGISQDETIKSKLLLFSALSPARDLDYSYKLILNVPNPTTFNWNTLIRAFSNT 91

Query: 167 RNPNRSITVFVRMLRNGVSPDYMTYPFLGKAIAKLLNQKLGMAVHVHVAKTGHEVDRFVM 226
           +NPN SITVF++ML+NGVSPDY+TYPFL KA +KLLNQ+LGMAVHVH+ K+GHE+D+F+ 
Sbjct: 92  KNPNPSITVFIKMLQNGVSPDYLTYPFLVKATSKLLNQELGMAVHVHIVKSGHEIDKFIQ 151

Query: 227 NSLIHMYASCGDIAFARKVFDEMPTKNLVSWNAMLDGYAKCGDVNTAREVFDLMHERDVV 286
           NSLIHMYASC DIA ARKVFDEMP KNLV+WNAMLDGYAKCGD+N AREVF+LM E+DVV
Sbjct: 152 NSLIHMYASCRDIASARKVFDEMPRKNLVTWNAMLDGYAKCGDLNMAREVFNLMPEKDVV 211

Query: 287 SWSSLIDGYVKCGEYGEAMALFERMRSAGPMANEVTLVSVLCACAHLGALEQGRMMHGYI 346
           SWSSLIDGYVK   YGEAMALFERM   GPMANEVTLVS LCACAHLGALE GRMMH YI
Sbjct: 212 SWSSLIDGYVKGRVYGEAMALFERMSFDGPMANEVTLVSALCACAHLGALEHGRMMHRYI 271

Query: 347 VENELPLTIVLLTSLVDMYAKCGAIHEALAAFRACPLQRTDVLIWNAIIGGLATHGLIKE 406
           VENELPLTIVL TSLVDMYAKCGAIHEAL  FRAC LQ  DVLIWNAIIGGLATHGLIKE
Sbjct: 272 VENELPLTIVLQTSLVDMYAKCGAIHEALTVFRACSLQEADVLIWNAIIGGLATHGLIKE 331

Query: 407 SMDLFSEMQMVGIAPDEITYLCLLSCCAHGGLVNEAWYFFDCLHKHGMTPKDEHYACMVD 466
           +M+LF EM+MVGI PDEITYLCLLSCCAHGGLV EAWYFFDCL KHGM PK EHYACMVD
Sbjct: 332 AMNLFCEMKMVGIVPDEITYLCLLSCCAHGGLVEEAWYFFDCLRKHGMIPKVEHYACMVD 391

Query: 467 ALSRAGQVSEAYQFLCQMTVQPTSSMLGALLSGCMKHGKLDLAEVVGRRLVELDPDHDGR 526
           ALSRAGQVSEAYQFLCQM VQPTSSMLGALLSGCMKHGKLD+A+VVGRRLVELDP+HDGR
Sbjct: 392 ALSRAGQVSEAYQFLCQMPVQPTSSMLGALLSGCMKHGKLDIAKVVGRRLVELDPNHDGR 451

Query: 527 YVGLSNIYAVDKRWNDARNIREAMEKRGVKKSPGFSFVEVFGILHRFIAHDKTHGDSERI 586
           YVGLSNIYA DKRW+DA+NIREAME++GVKKSPGFSF+EV+G+LHRF+AHDKTHGD E+I
Sbjct: 452 YVGLSNIYAADKRWDDAKNIREAMERKGVKKSPGFSFIEVYGVLHRFMAHDKTHGDCEQI 511

Query: 587 YVMLNLIIDQMKPIEDSENQEYCLYDLIGVS 618
           ++MLNLI+DQMKPIED  +QE C YD++ VS
Sbjct: 512 FMMLNLIVDQMKPIEDYVHQECCFYDIMNVS 542

BLAST of Cp4.1LG01g04190 vs. TrEMBL
Match: W9SLB8_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_007552 PE=4 SV=1)

HSP 1 Score: 748.8 bits (1932), Expect = 5.2e-213
Identity = 350/502 (69.72%), Postives = 425/502 (84.66%), Query Frame = 1

Query: 107 IHALLFTFGVSQDDAVVSKLLSFSALSPAGDLDYSYKLLLNLPNPTTFKWNTLIRGFSNS 166
           IHALL   G+SQ+D+  SK+LSFSALS +G++DYS++ L  L  P TF WNT+IRG+S S
Sbjct: 7   IHALLLACGLSQEDSFASKILSFSALSDSGNVDYSFRFLSQLSCPATFYWNTVIRGYSKS 66

Query: 167 RNPNRSITVFVRMLRNGVSPDYMTYPFLGKAIAKLLNQKLGMAVHVHVAKTGHEVDRFVM 226
           RNPN SI VF+RMLRNGVSPDY+TYPFL KA A L+ ++LG+A+H  ++K G+E DRFV 
Sbjct: 67  RNPNSSILVFIRMLRNGVSPDYLTYPFLAKAAACLMKRELGVAIHACISKHGYESDRFVQ 126

Query: 227 NSLIHMYASCGDIAFARKVFDEMPTKNLVSWNAMLDGYAKCGDVNTAREVFDLMHERDVV 286
           NS+IHMYASCGDIA+ARK+FD M  +N VSWN+M+DGYAKCGDVN+AREVF+LM  +DVV
Sbjct: 127 NSMIHMYASCGDIAYARKIFDSMSHRNSVSWNSMVDGYAKCGDVNSAREVFELMPVKDVV 186

Query: 287 SWSSLIDGYVKCGEYGEAMALFERMRSAGPMANEVTLVSVLCACAHLGALEQGRMMHGYI 346
           SWS LIDGYVK G+Y EA+A+FE+M++AG  AN+VT+VSVLCAC HLGALEQG  MH YI
Sbjct: 187 SWSCLIDGYVKAGKYMEALAIFEQMQTAGGKANDVTMVSVLCACTHLGALEQGSRMHRYI 246

Query: 347 VENELPLTIVLLTSLVDMYAKCGAIHEALAAFRACPLQRTDVLIWNAIIGGLATHGLIKE 406
           +EN LPLT+VL TSLVDMYAKCG I EAL  FR+  +++TDVL+WNAIIGGLATHGL+KE
Sbjct: 247 LENGLPLTLVLKTSLVDMYAKCGEIEEALGLFRSSSMRKTDVLLWNAIIGGLATHGLVKE 306

Query: 407 SMDLFSEMQMVGIAPDEITYLCLLSCCAHGGLVNEAWYFFDCLHKHGMTPKDEHYACMVD 466
           S+DLF EMQ + I PDEITYLCLLS CAHGGLV +AWYFF+CL KHGMTPK EHYACMVD
Sbjct: 307 SLDLFDEMQRIRILPDEITYLCLLSACAHGGLVKKAWYFFECLAKHGMTPKSEHYACMVD 366

Query: 467 ALSRAGQVSEAYQFLCQMTVQPTSSMLGALLSGCMKHGKLDLAEVVGRRLVELDPDHDGR 526
            L+RAGQV +A++F+CQM ++PT+SMLGALLSGCM HGKL+L E+VGR+L+E++PDHDGR
Sbjct: 367 VLARAGQVEDAFRFVCQMPIEPTASMLGALLSGCMNHGKLELGELVGRKLIEIEPDHDGR 426

Query: 527 YVGLSNIYAVDKRWNDARNIREAMEKRGVKKSPGFSFVEVFGILHRFIAHDKTHGDSERI 586
           YVGLSN+YAV KRW++ARN+REAME+RGVKK PGFSFVEV G+L+RFIAHDK H  SE I
Sbjct: 427 YVGLSNVYAVFKRWDEARNMREAMERRGVKKFPGFSFVEVSGMLNRFIAHDKGHPKSEEI 486

Query: 587 YVMLNLIIDQMKPIEDSENQEY 609
           Y +L  + +Q+K   D EN EY
Sbjct: 487 YAILIFVTNQIKLDADYENLEY 508

BLAST of Cp4.1LG01g04190 vs. TrEMBL
Match: M5W4Z9_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa020734mg PE=4 SV=1)

HSP 1 Score: 743.0 bits (1917), Expect = 2.9e-211
Identity = 343/490 (70.00%), Postives = 423/490 (86.33%), Query Frame = 1

Query: 121 AVVSKLLSFSALSPAGDLDYSYKLLLNLPNPTTFKWNTLIRGFSNSRNPNRSITVFVRML 180
           ++ SK+LSFSALS  G++DYSY++L  LPNPT F WNT+IRG+S S+NPNRSI+VFV+ML
Sbjct: 252 SLTSKILSFSALSDLGNIDYSYRVLSQLPNPTIFDWNTVIRGYSKSKNPNRSISVFVKML 311

Query: 181 RNGVSPDYMTYPFLGKAIAKLLNQKLGMAVHVHVAKTGHEVDRFVMNSLIHMYASCGDIA 240
           R+GVSPDY+TYPFL KA A+LL ++LG+AVH H+AK G E DRF+ NSLIHMYA+CGDI 
Sbjct: 312 RDGVSPDYLTYPFLAKASARLLKRELGVAVHAHIAKNGFEFDRFISNSLIHMYAACGDIT 371

Query: 241 FARKVFDEMPTKNLVSWNAMLDGYAKCGDVNTAREVFDLMHERDVVSWSSLIDGYVKCGE 300
           +A KVFD +  KN VSWN+MLDGYAKCGDV +AREVFDLM +RDVVSWSSLIDGYVK G+
Sbjct: 372 YACKVFDGILVKNSVSWNSMLDGYAKCGDVISAREVFDLMPKRDVVSWSSLIDGYVKVGD 431

Query: 301 YGEAMALFERMRSAGPMANEVTLVSVLCACAHLGALEQGRMMHGYIVENELPLTIVLLTS 360
           Y EA+ +FERMR AGP ANEVT+VSVLCAC HLGALEQG++MH Y+V+N+LPLT+VL TS
Sbjct: 432 YREALVVFERMRVAGPKANEVTMVSVLCACTHLGALEQGKVMHRYMVDNKLPLTLVLQTS 491

Query: 361 LVDMYAKCGAIHEALAAFRACPLQRTDVLIWNAIIGGLATHGLIKESMDLFSEMQMVGIA 420
           LVDMYAKCGAI +AL  FR   L R+DVL+WNA+IGGLA HGL+++++++F+EMQ++GI 
Sbjct: 492 LVDMYAKCGAIEDALGVFRGGSLHRSDVLMWNAMIGGLAIHGLVQQALEIFAEMQIIGIV 551

Query: 421 PDEITYLCLLSCCAHGGLVNEAWYFFDCLHKHGMTPKDEHYACMVDALSRAGQVSEAYQF 480
           PDEITYLCLLS CAHGGLV EAW+ F+CL KHGM PK EHYACMVD L+R GQV+EAYQF
Sbjct: 552 PDEITYLCLLSACAHGGLVKEAWHLFECLGKHGMKPKCEHYACMVDVLARGGQVTEAYQF 611

Query: 481 LCQMTVQPTSSMLGALLSGCMKHGKLDLAEVVGRRLVELDPDHDGRYVGLSNIYAVDKRW 540
           +CQM  +PT+SMLGALLSGC+ HGKLDLAE VGR+L+E++P HDGRYVGLSN+YA+ KRW
Sbjct: 612 ICQMPREPTASMLGALLSGCVNHGKLDLAENVGRKLIEIEPGHDGRYVGLSNVYALSKRW 671

Query: 541 NDARNIREAMEKRGVKKSPGFSFVEVFGILHRFIAHDKTHGDSERIYVMLNLIIDQMKPI 600
           +DAR++REAME+RGVKKSPGFS VE+FG LH+FIAHDK++ +SE IY+ L+ I++++K  
Sbjct: 672 DDARSLREAMERRGVKKSPGFSIVEIFGTLHKFIAHDKSYPESEEIYMTLSYIVNEIKFD 731

Query: 601 EDSENQEYCL 611
            D  NQ+Y L
Sbjct: 732 MDYRNQDYFL 741

BLAST of Cp4.1LG01g04190 vs. TrEMBL
Match: K7N2R3_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_20G104500 PE=4 SV=1)

HSP 1 Score: 736.9 bits (1901), Expect = 2.0e-209
Identity = 340/506 (67.19%), Postives = 426/506 (84.19%), Query Frame = 1

Query: 107 IHALLFTFGVSQDDAVVSKLLSFSALSPAGDLDYSYKLLLNLPNPTTFKWNTLIRGFSNS 166
           +HA++ + G+SQDD  +SK+L FSALS +GD++YSY++   L +PT F WNT+IRG+SNS
Sbjct: 33  LHAVVISCGLSQDDPFISKILCFSALSNSGDINYSYRVFSQLSSPTIFSWNTIIRGYSNS 92

Query: 167 RNPNRSITVFVRMLRNGVSPDYMTYPFLGKAIAKLLNQKLGMAVHVHVAKTGHEVDRFVM 226
           +NP +S+++F++MLR GV+PDY+TYPFL KA A+LLNQ+ G++VH H+ KTGHE DRF+ 
Sbjct: 93  KNPIQSLSIFLKMLRLGVAPDYLTYPFLVKASARLLNQETGVSVHAHIIKTGHESDRFIQ 152

Query: 227 NSLIHMYASCGDIAFARKVFDEMPTKNLVSWNAMLDGYAKCGDVNTAREVFDLMHERDVV 286
           NSLIHMYA+CG+  +A+KVFD +  KN+VSWN+MLDGYAKCG++  A++ F+ M E+DV 
Sbjct: 153 NSLIHMYAACGNSMWAQKVFDSIQQKNVVSWNSMLDGYAKCGEMVMAQKAFESMSEKDVR 212

Query: 287 SWSSLIDGYVKCGEYGEAMALFERMRSAGPMANEVTLVSVLCACAHLGALEQGRMMHGYI 346
           SWSSLIDGYVK GEY EAMA+FE+M+SAGP ANEVT+VSV CACAH+GALE+GRM++ YI
Sbjct: 213 SWSSLIDGYVKAGEYSEAMAIFEKMQSAGPKANEVTMVSVSCACAHMGALEKGRMIYKYI 272

Query: 347 VENELPLTIVLLTSLVDMYAKCGAIHEALAAFRACPLQRTDVLIWNAIIGGLATHGLIKE 406
           V+N LPLT+VL TSLVDMYAKCGAI EAL  FR     +TDVLIWNA+IGGLATHGL++E
Sbjct: 273 VDNGLPLTLVLQTSLVDMYAKCGAIEEALLIFRRVSKSQTDVLIWNAVIGGLATHGLVEE 332

Query: 407 SMDLFSEMQMVGIAPDEITYLCLLSCCAHGGLVNEAWYFFDCLHKHGMTPKDEHYACMVD 466
           S+ LF EMQ+VGI PDE+TYLCLL+ CAHGGLV EAW+FF+ L K GMTP  EHYACMVD
Sbjct: 333 SLKLFKEMQIVGICPDEVTYLCLLAACAHGGLVKEAWFFFESLSKCGMTPTSEHYACMVD 392

Query: 467 ALSRAGQVSEAYQFLCQMTVQPTSSMLGALLSGCMKHGKLDLAEVVGRRLVELDPDHDGR 526
            L+RAGQ++ AYQF+CQM  +PT+SMLGALLSGC+ H  L LAE+VGR+L+EL+P+HDGR
Sbjct: 393 VLARAGQLTTAYQFICQMPTEPTASMLGALLSGCINHRNLALAEIVGRKLIELEPNHDGR 452

Query: 527 YVGLSNIYAVDKRWNDARNIREAMEKRGVKKSPGFSFVEVFGILHRFIAHDKTHGDSERI 586
           Y+GLSN+YAVDKRW+DAR++REAME+RGVKKSPGFSFVE+ G+LHRFIAHDKTH DSE  
Sbjct: 453 YIGLSNMYAVDKRWDDARSMREAMERRGVKKSPGFSFVEISGVLHRFIAHDKTHPDSEET 512

Query: 587 YVMLNLIIDQMKPIEDSENQEYCLYD 613
           Y MLN ++ QMK     +NQE  L D
Sbjct: 513 YFMLNFVVYQMKLSCHEDNQERSLND 538

BLAST of Cp4.1LG01g04190 vs. TrEMBL
Match: A0A061E2B0_THECC (Pentatricopeptide repeat superfamily protein OS=Theobroma cacao GN=TCM_007712 PE=4 SV=1)

HSP 1 Score: 729.9 bits (1883), Expect = 2.5e-207
Identity = 336/507 (66.27%), Postives = 418/507 (82.45%), Query Frame = 1

Query: 107 IHALLFTFGVSQDDAVVSKLLSFSALSPAGDLDYSYKLLLNLPNPTTFKWNTLIRGFSNS 166
           IHAL+ TFG+S  D + +KLLSF+A S  G++DY+Y++   LP P  F WN++IRG+SNS
Sbjct: 21  IHALVITFGLSHHDPISTKLLSFAAFSDTGNVDYAYRVFSRLPTPRVFNWNSIIRGYSNS 80

Query: 167 RNPNRSITVFVRMLRNGVSPDYMTYPFLGKAIAKLLNQKLGMAVHVHVAKTGHEVDRFVM 226
           +NPN+SI+ F+ MLR GV PD++TYPFL K  A+LL  +LG A+H H  K G E+D+F+ 
Sbjct: 81  KNPNKSISAFINMLRAGVFPDHLTYPFLVKTSARLLKPELGGAIHCHALKNGFELDKFIN 140

Query: 227 NSLIHMYASCGDIAFARKVFDEMPTKNLVSWNAMLDGYAKCGDVNTAREVFDLMHERDVV 286
           NSLIHMYASC DI +AR+VFDE+P KN+VSWNAMLDGYAKCGD+  AR+VFD M +RDVV
Sbjct: 141 NSLIHMYASCHDIVYARRVFDELPMKNIVSWNAMLDGYAKCGDMALARQVFDWMPQRDVV 200

Query: 287 SWSSLIDGYVKCGEYGEAMALFERMRSAGPMANEVTLVSVLCACAHLGALEQGRMMHGYI 346
           SWS LIDGY K G+Y EA+A+FE MR  GP ANEVT+VSVLCACAHLGAL  GR+MH Y+
Sbjct: 201 SWSCLIDGYAKSGDYKEALAVFEGMRVWGPKANEVTMVSVLCACAHLGALHLGRLMHCYV 260

Query: 347 VENELPLTIVLLTSLVDMYAKCGAIHEALAAFRACPLQRTDVLIWNAIIGGLATHGLIKE 406
           ++N LP+T+VL TSLVDMYAKCGAI EAL  FR     ++DVL+WNA+IGGLATHGL+KE
Sbjct: 261 MDNGLPMTLVLRTSLVDMYAKCGAIEEALDVFRGVSNCKSDVLLWNAMIGGLATHGLVKE 320

Query: 407 SMDLFSEMQMVGIAPDEITYLCLLSCCAHGGLVNEAWYFFDCLHKHGMTPKDEHYACMVD 466
           S++LF+EMQ+VGI PDEITYLCLLS CAHGG V EAWYFF+CL KHGMTPK EHYACMVD
Sbjct: 321 SLELFAEMQVVGIVPDEITYLCLLSACAHGGSVKEAWYFFECLGKHGMTPKSEHYACMVD 380

Query: 467 ALSRAGQVSEAYQFLCQMTVQPTSSMLGALLSGCMKHGKLDLAEVVGRRLVELDPDHDGR 526
            L+RAGQV+EAYQFLC+M ++PT+S+LGALL+GC+ +GK DLAE+VGR+L+ELDPDHDGR
Sbjct: 381 VLARAGQVAEAYQFLCKMPMEPTASLLGALLNGCLIYGKSDLAEIVGRKLIELDPDHDGR 440

Query: 527 YVGLSNIYAVDKRWNDARNIREAMEKRGVKKSPGFSFVEVFGILHRFIAHDKTHGDSERI 586
           Y+GLSN+YA  ++WN+AR +RE+ME+RG+KKS GFS VE+ G LH F+AHD+TH +SE I
Sbjct: 441 YIGLSNVYAAVQQWNEARRMRESMERRGLKKSAGFSCVEMPGALHSFVAHDETHPNSEDI 500

Query: 587 YVMLNLIIDQMKPIEDSENQEYCLYDL 614
           Y ML  I+ QMK     +NQEY LY++
Sbjct: 501 YTMLKFIVSQMKLDVHKDNQEYLLYEM 527

BLAST of Cp4.1LG01g04190 vs. TAIR10
Match: AT5G08305.1 (AT5G08305.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 610.5 bits (1573), Expect = 1.1e-174
Identity = 292/507 (57.59%), Postives = 388/507 (76.53%), Query Frame = 1

Query: 107 IHALLFTFGVSQDDAVVSKLLSFSALSPAGDLDYSYKLLLNLPNPTTFKWNTLIRGFSNS 166
           IH LL T G+S+++  VS+ LSFSALS +GD+DY+YK L  L +P  + WN +IRGFSNS
Sbjct: 27  IHTLLITLGLSEEEPFVSQTLSFSALSSSGDVDYAYKFLSKLSDPPNYGWNFVIRGFSNS 86

Query: 167 RNPNRSITVFVRMLRNGVSPDYMTYPFLGKAIAKLLNQKLGMAVHVHVAKTGHEVDRFVM 226
           RNP +SI+V+++MLR G+ PD+MTYPFL K+ ++L N+KLG ++H  V K+G E D F+ 
Sbjct: 87  RNPEKSISVYIQMLRFGLLPDHMTYPFLMKSSSRLSNRKLGGSLHCSVVKSGLEWDLFIC 146

Query: 227 NSLIHMYASCGDIAFARKVFDEMPTKNLVSWNAMLDGYAKCGDVNTAREVFDLMHERDVV 286
           N+LIHMY S  D A ARK+FDEMP KNLV+WN++LD YAK GDV +AR VFD M ERDVV
Sbjct: 147 NTLIHMYGSFRDQASARKLFDEMPHKNLVTWNSILDAYAKSGDVVSARLVFDEMSERDVV 206

Query: 287 SWSSLIDGYVKCGEYGEAMALFERM-RSAGPMANEVTLVSVLCACAHLGALEQGRMMHGY 346
           +WSS+IDGYVK GEY +A+ +F++M R     ANEVT+VSV+CACAHLGAL +G+ +H Y
Sbjct: 207 TWSSMIDGYVKRGEYNKALEIFDQMMRMGSSKANEVTMVSVICACAHLGALNRGKTVHRY 266

Query: 347 IVENELPLTIVLLTSLVDMYAKCGAIHEALAAFRACPLQRTDVLIWNAIIGGLATHGLIK 406
           I++  LPLT++L TSL+DMYAKCG+I +A + F    ++ TD L+WNAIIGGLA+HG I+
Sbjct: 267 ILDVHLPLTVILQTSLIDMYAKCGSIGDAWSVFYRASVKETDALMWNAIIGGLASHGFIR 326

Query: 407 ESMDLFSEMQMVGIAPDEITYLCLLSCCAHGGLVNEAWYFFDCLHKHGMTPKDEHYACMV 466
           ES+ LF +M+   I PDEIT+LCLL+ C+HGGLV EAW+FF  L + G  PK EHYACMV
Sbjct: 327 ESLQLFHKMRESKIDPDEITFLCLLAACSHGGLVKEAWHFFKSLKESGAEPKSEHYACMV 386

Query: 467 DALSRAGQVSEAYQFLCQMTVQPTSSMLGALLSGCMKHGKLDLAEVVGRRLVELDPDHDG 526
           D LSRAG V +A+ F+ +M ++PT SMLGALL+GC+ HG L+LAE VG++L+EL P +DG
Sbjct: 387 DVLSRAGLVKDAHDFISEMPIKPTGSMLGALLNGCINHGNLELAETVGKKLIELQPHNDG 446

Query: 527 RYVGLSNIYAVDKRWNDARNIREAMEKRGVKKSPGFSFVEVFGILHRFIAHDKTHGDSER 586
           RYVGL+N+YA++K++  AR++REAMEK+GVKK  G S +++ G  HRFIAHDKTH  S++
Sbjct: 447 RYVGLANVYAINKQFRAARSMREAMEKKGVKKIAGHSILDLDGTRHRFIAHDKTHFHSDK 506

Query: 587 IYVMLNLIIDQMK---PIEDSENQEYC 610
           IY +L L    M      +D +N  +C
Sbjct: 507 IYAVLQLTGAWMNLDVDYDDQDNHCFC 533

BLAST of Cp4.1LG01g04190 vs. TAIR10
Match: AT2G22410.1 (AT2G22410.1 SLOW GROWTH 1)

HSP 1 Score: 412.9 bits (1060), Expect = 3.4e-115
Identity = 229/613 (37.36%), Postives = 347/613 (56.61%), Query Frame = 1

Query: 7   LISLLDKCKSMSELRRIHALLFTFGVSQDDAVVSKLLSFSALSPAGDLDYSYKLLLNLPN 66
           L+SLL+KCK +  L++I A +   G+  D    S+L++F ALS +  LDYS K+L  + N
Sbjct: 56  LLSLLEKCKLLLHLKQIQAQMIINGLILDPFASSRLIAFCALSESRYLDYSVKILKGIEN 115

Query: 67  PTTFKWNTLIRGFSNSRNPNRSITVFVRMLRNGVS---PDYMTIHALLFTFGVSQDDAVV 126
           P  F WN  IRGFS S NP  S  ++ +MLR+G     PD+ T   L       +  ++ 
Sbjct: 116 PNIFSWNVTIRGFSESENPKESFLLYKQMLRHGCCESRPDHFTYPVLFKVCADLRLSSLG 175

Query: 127 SKLL-----------------SFSALSPAGDLDYSYKLLLNLPNPTTFKWNTLIRGFSNS 186
             +L                 S    +  GD++ + K+    P      WN LI G+   
Sbjct: 176 HMILGHVLKLRLELVSHVHNASIHMFASCGDMENARKVFDESPVRDLVSWNCLINGYKKI 235

Query: 187 RNPNRSITVFVRMLRNGVSPDYMTYPFLGKAIAKLLNQKLGMAVHVHVAKTGHEVDRFVM 246
               ++I V+  M   GV PD +T   L  + + L +   G   + +V + G  +   ++
Sbjct: 236 GEAEKAIYVYKLMESEGVKPDDVTMIGLVSSCSMLGDLNRGKEFYEYVKENGLRMTIPLV 295

Query: 247 NSLIHMYASCGDIAFARKVFDEMPTKNLVSWNAMLDGYAKCGDVNTAREVFDLMHERDVV 306
           N+L+ M++ CGDI  AR++FD +  + +VSW  M+ GYA+CG ++ +R++FD M E+DVV
Sbjct: 296 NALMDMFSKCGDIHEARRIFDNLEKRTIVSWTTMISGYARCGLLDVSRKLFDDMEEKDVV 355

Query: 307 SWSSLIDGYVKCGEYGEAMALFERMRSAGPMANEVTLVSVLCACAHLGALEQGRMMHGYI 366
            W+++I G V+     +A+ALF+ M+++    +E+T++  L AC+ LGAL+ G  +H YI
Sbjct: 356 LWNAMIGGSVQAKRGQDALALFQEMQTSNTKPDEITMIHCLSACSQLGALDVGIWIHRYI 415

Query: 367 VENELPLTIVLLTSLVDMYAKCGAIHEALAAFRACPLQRTDVLIWNAIIGGLATHGLIKE 426
            +  L L + L TSLVDMYAKCG I EAL+ F    +Q  + L + AIIGGLA HG    
Sbjct: 416 EKYSLSLNVALGTSLVDMYAKCGNISEALSVFHG--IQTRNSLTYTAIIGGLALHGDAST 475

Query: 427 SMDLFSEMQMVGIAPDEITYLCLLSCCAHGGLVNEAW-YFFDCLHKHGMTPKDEHYACMV 486
           ++  F+EM   GIAPDEIT++ LLS C HGG++     YF     +  + P+ +HY+ MV
Sbjct: 476 AISYFNEMIDAGIAPDEITFIGLLSACCHGGMIQTGRDYFSQMKSRFNLNPQLKHYSIMV 535

Query: 487 DALSRAGQVSEAYQFLCQMTVQPTSSMLGALLSGCMKHGKLDLAEVVGRRLVELDPDHDG 546
           D L RAG + EA + +  M ++  +++ GALL GC  HG ++L E   ++L+ELDP   G
Sbjct: 536 DLLGRAGLLEEADRLMESMPMEADAAVWGALLFGCRMHGNVELGEKAAKKLLELDPSDSG 595

Query: 547 RYVGLSNIYAVDKRWNDARNIREAMEKRGVKKSPGFSFVEVFGILHRFIAHDKTHGDSER 599
            YV L  +Y     W DA+  R  M +RGV+K PG S +EV GI+  FI  DK+  +SE+
Sbjct: 596 IYVLLDGMYGEANMWEDAKRARRMMNERGVEKIPGCSSIEVNGIVCEFIVRDKSRPESEK 655

BLAST of Cp4.1LG01g04190 vs. TAIR10
Match: AT2G29760.1 (AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 397.5 bits (1020), Expect = 1.5e-110
Identity = 208/613 (33.93%), Postives = 352/613 (57.42%), Query Frame = 1

Query: 8   ISLLDKCKSMSELRRIHALLFTFGVSQDDAVVSKLLSFSALSPAGDLDYSYKLLLNLPNP 67
           ISL+++C S+ +L++ H  +   G   D    SKL + +ALS    L+Y+ K+   +P P
Sbjct: 34  ISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFASLEYARKVFDEIPKP 93

Query: 68  TTFKWNTLIRGFSNSRNPNRSITVFVRMLRNGVS-PDYMT-------------------I 127
            +F WNTLIR +++  +P  SI  F+ M+      P+  T                   +
Sbjct: 94  NSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSLGQSL 153

Query: 128 HALLFTFGVSQDDAVVSKLLSFSALSPAGDLDYSYKLLLNLPNPTTFKWNTLIRGFSNSR 187
           H +     V  D  V + L+        GDLD + K+   +       WN++I GF    
Sbjct: 154 HGMAVKSAVGSDVFVANSLIH--CYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKG 213

Query: 188 NPNRSITVFVRMLRNGVSPDYMTYPFLGKAIAKLLNQKLGMAVHVHVAKTGHEVDRFVMN 247
           +P++++ +F +M    V   ++T   +  A AK+ N + G  V  ++ +    V+  + N
Sbjct: 214 SPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLTLAN 273

Query: 248 SLIHMYASCGDIAFARKVFDEMPTKNLVSWNAMLDGYAKCGDVNTAREVFDLMHERDVVS 307
           +++ MY  CG I  A+++FD M  K+ V+W  MLDGYA   D   AREV + M ++D+V+
Sbjct: 274 AMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKDIVA 333

Query: 308 WSSLIDGYVKCGEYGEAMALFERMRSAGPMA-NEVTLVSVLCACAHLGALEQGRMMHGYI 367
           W++LI  Y + G+  EA+ +F  ++    M  N++TLVS L ACA +GALE GR +H YI
Sbjct: 334 WNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIHSYI 393

Query: 368 VENELPLTIVLLTSLVDMYAKCGAIHEALAAFRACPLQRTDVLIWNAIIGGLATHGLIKE 427
            ++ + +   + ++L+ MY+KCG + ++   F +  +++ DV +W+A+IGGLA HG   E
Sbjct: 394 KKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNS--VEKRDVFVWSAMIGGLAMHGCGNE 453

Query: 428 SMDLFSEMQMVGIAPDEITYLCLLSCCAHGGLVNEAWYFFDCLHK-HGMTPKDEHYACMV 487
           ++D+F +MQ   + P+ +T+  +   C+H GLV+EA   F  +   +G+ P+++HYAC+V
Sbjct: 454 AVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACIV 513

Query: 488 DALSRAGQVSEAYQFLCQMTVQPTSSMLGALLSGCMKHGKLDLAEVVGRRLVELDPDHDG 547
           D L R+G + +A +F+  M + P++S+ GALL  C  H  L+LAE+   RL+EL+P +DG
Sbjct: 514 DVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRNDG 573

Query: 548 RYVGLSNIYAVDKRWNDARNIREAMEKRGVKKSPGFSFVEVFGILHRFIAHDKTHGDSER 599
            +V LSNIYA   +W +   +R+ M   G+KK PG S +E+ G++H F++ D  H  SE+
Sbjct: 574 AHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSEK 633

BLAST of Cp4.1LG01g04190 vs. TAIR10
Match: AT3G15930.1 (AT3G15930.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 397.5 bits (1020), Expect = 1.5e-110
Identity = 214/607 (35.26%), Postives = 340/607 (56.01%), Query Frame = 1

Query: 5   SSLISLLDKCKSMSELRRIHALLFTFGVSQDDAVVSKLLSFSALSPAGDLDYSYKLLLNL 64
           S  IS+L  CK+  + +++H+   T GV+ +     KL  F      G + Y+YKL + +
Sbjct: 35  SRFISILGVCKTTDQFKQLHSQSITRGVAPNPTFQKKLFVFWCSRLGGHVSYAYKLFVKI 94

Query: 65  PNPTTFKWNTLIRGFSNSRNPNRSITVFVRMLRNGVSPDYMT------------------ 124
           P P    WN +I+G+S        + +++ ML+ GV+PD  T                  
Sbjct: 95  PEPDVVVWNNMIKGWSKVDCDGEGVRLYLNMLKEGVTPDSHTFPFLLNGLKRDGGALACG 154

Query: 125 --IHALLFTFGVSQDDAVVSKLLSFSALSPAGDLDYSYKLLLNLPNPTTFKWNTLIRGFS 184
             +H  +  FG+  +  V + L+   +L   G +D +  +         F WN +I G++
Sbjct: 155 KKLHCHVVKFGLGSNLYVQNALVKMYSL--CGLMDMARGVFDRRCKEDVFSWNLMISGYN 214

Query: 185 NSRNPNRSITVFVRMLRNGVSPDYMTYPFLGKAIAKLLNQKLGMAVHVHVAKTGHEVDRF 244
             +    SI + V M RN VSP  +T   +  A +K+ ++ L   VH +V++   E    
Sbjct: 215 RMKEYEESIELLVEMERNLVSPTSVTLLLVLSACSKVKDKDLCKRVHEYVSECKTEPSLR 274

Query: 245 VMNSLIHMYASCGDIAFARKVFDEMPTKNLVSWNAMLDGYAKCGDVNTAREVFDLMHERD 304
           + N+L++ YA+CG++  A ++F  M  ++++SW +++ GY + G++  AR  FD M  RD
Sbjct: 275 LENALVNAYAACGEMDIAVRIFRSMKARDVISWTSIVKGYVERGNLKLARTYFDQMPVRD 334

Query: 305 VVSWSSLIDGYVKCGEYGEAMALFERMRSAGPMANEVTLVSVLCACAHLGALEQGRMMHG 364
            +SW+ +IDGY++ G + E++ +F  M+SAG + +E T+VSVL ACAHLG+LE G  +  
Sbjct: 335 RISWTIMIDGYLRAGCFNESLEIFREMQSAGMIPDEFTMVSVLTACAHLGSLEIGEWIKT 394

Query: 365 YIVENELPLTIVLLTSLVDMYAKCGAIHEALAAFRACPLQRTDVLIWNAIIGGLATHGLI 424
           YI +N++   +V+  +L+DMY KCG   +A   F    + + D   W A++ GLA +G  
Sbjct: 395 YIDKNKIKNDVVVGNALIDMYFKCGCSEKAQKVFH--DMDQRDKFTWTAMVVGLANNGQG 454

Query: 425 KESMDLFSEMQMVGIAPDEITYLCLLSCCAHGGLVNEAWYFFDCLHK-HGMTPKDEHYAC 484
           +E++ +F +MQ + I PD+ITYL +LS C H G+V++A  FF  +   H + P   HY C
Sbjct: 455 QEAIKVFFQMQDMSIQPDDITYLGVLSACNHSGMVDQARKFFAKMRSDHRIEPSLVHYGC 514

Query: 485 MVDALSRAGQVSEAYQFLCQMTVQPTSSMLGALLSGCMKHGKLDLAEVVGRRLVELDPDH 544
           MVD L RAG V EAY+ L +M + P S + GALL     H    +AE+  ++++EL+PD+
Sbjct: 515 MVDMLGRAGLVKEAYEILRKMPMNPNSIVWGALLGASRLHNDEPMAELAAKKILELEPDN 574

Query: 545 DGRYVGLSNIYAVDKRWNDARNIREAMEKRGVKKSPGFSFVEVFGILHRFIAHDKTHGDS 591
              Y  L NIYA  KRW D R +R  +    +KK+PGFS +EV G  H F+A DK+H  S
Sbjct: 575 GAVYALLCNIYAGCKRWKDLREVRRKIVDVAIKKTPGFSLIEVNGFAHEFVAGDKSHLQS 634

BLAST of Cp4.1LG01g04190 vs. TAIR10
Match: AT5G06540.1 (AT5G06540.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 383.3 bits (983), Expect = 2.9e-106
Identity = 197/518 (38.03%), Postives = 316/518 (61.00%), Query Frame = 1

Query: 103 DYMTIHALLFTFGVSQDDAVVSKLLSFSA-----LSPAGDLDYSYKLLLNLPNPTTFKWN 162
           D   IH  L    +  D  V S+LL+          P   L Y+Y +   + NP  F +N
Sbjct: 27  DLKIIHGFLLRTHLISDVFVASRLLALCVDDSTFNKPTNLLGYAYGIFSQIQNPNLFVFN 86

Query: 163 TLIRGFSNSRNPNRSITVFVRMLRNGVSPDYMTYPFLGKAIAKLLNQKLGMAVHVHVAKT 222
            LIR FS    P+++   + +ML++ + PD +T+PFL KA +++    +G   H  + + 
Sbjct: 87  LLIRCFSTGAEPSKAFGFYTQMLKSRIWPDNITFPFLIKASSEMECVLVGEQTHSQIVRF 146

Query: 223 GHEVDRFVMNSLIHMYASCGDIAFARKVFDEMPTKNLVSWNAMLDGYAKCGDVNTAREVF 282
           G + D +V NSL+HMYA+CG IA A ++F +M  +++VSW +M+ GY KCG V  ARE+F
Sbjct: 147 GFQNDVYVENSLVHMYANCGFIAAAGRIFGQMGFRDVVSWTSMVAGYCKCGMVENAREMF 206

Query: 283 DLMHERDVVSWSSLIDGYVKCGEYGEAMALFERMRSAGPMANEVTLVSVLCACAHLGALE 342
           D M  R++ +WS +I+GY K   + +A+ LFE M+  G +ANE  +VSV+ +CAHLGALE
Sbjct: 207 DEMPHRNLFTWSIMINGYAKNNCFEKAIDLFEFMKREGVVANETVMVSVISSCAHLGALE 266

Query: 343 QGRMMHGYIVENELPLTIVLLTSLVDMYAKCGAIHEALAAFRACPLQRTDVLIWNAIIGG 402
            G   + Y+V++ + + ++L T+LVDM+ +CG I +A+  F   P   TD L W++II G
Sbjct: 267 FGERAYEYVVKSHMTVNLILGTALVDMFWRCGDIEKAIHVFEGLP--ETDSLSWSSIIKG 326

Query: 403 LATHGLIKESMDLFSEMQMVGIAPDEITYLCLLSCCAHGGLVNEAWYFFDCLHK-HGMTP 462
           LA HG   ++M  FS+M  +G  P ++T+  +LS C+HGGLV +    ++ + K HG+ P
Sbjct: 327 LAVHGHAHKAMHYFSQMISLGFIPRDVTFTAVLSACSHGGLVEKGLEIYENMKKDHGIEP 386

Query: 463 KDEHYACMVDALSRAGQVSEAYQFLCQMTVQPTSSMLGALLSGCMKHGKLDLAEVVGRRL 522
           + EHY C+VD L RAG+++EA  F+ +M V+P + +LGALL  C  +   ++AE VG  L
Sbjct: 387 RLEHYGCIVDMLGRAGKLAEAENFILKMHVKPNAPILGALLGACKIYKNTEVAERVGNML 446

Query: 523 VELDPDHDGRYVGLSNIYAVDKRWNDARNIREAMEKRGVKKSPGFSFVEVFGILHRF-IA 582
           +++ P+H G YV LSNIYA   +W+   ++R+ M+++ VKK PG+S +E+ G +++F + 
Sbjct: 447 IKVKPEHSGYYVLLSNIYACAGQWDKIESLRDMMKEKLVKKPPGWSLIEIDGKINKFTMG 506

Query: 583 HDKTHGDSERIYVMLNLIIDQMKPIEDSENQEYCLYDL 614
            D+ H +  +I      I+ +++ I    N     +D+
Sbjct: 507 DDQKHPEMGKIRRKWEEILGKIRLIGYKGNTGDAFFDV 542

BLAST of Cp4.1LG01g04190 vs. NCBI nr
Match: gi|659121006|ref|XP_008460457.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g08305 [Cucumis melo])

HSP 1 Score: 891.0 bits (2301), Expect = 1.2e-255
Identity = 422/511 (82.58%), Postives = 467/511 (91.39%), Query Frame = 1

Query: 107 IHALLFTFGVSQDDAVVSKLLSFSALSPAGDLDYSYKLLLNLPNPTTFKWNTLIRGFSNS 166
           IHALLFT G+SQD+ + SKLL FSALSPA DLDYSYKL+LNLPNPTTF WNTLIRGFSN+
Sbjct: 33  IHALLFTLGISQDEIIKSKLLLFSALSPARDLDYSYKLILNLPNPTTFNWNTLIRGFSNT 92

Query: 167 RNPNRSITVFVRMLRNGVSPDYMTYPFLGKAIAKLLNQKLGMAVHVHVAKTGHEVDRFVM 226
           +NPN SITVF++ML+NGVSP+Y+TYPFL KA +KLLNQ+LGMA+HVH+ K+GHE+D+F+ 
Sbjct: 93  KNPNPSITVFIKMLQNGVSPNYLTYPFLVKATSKLLNQELGMALHVHIVKSGHEIDKFIQ 152

Query: 227 NSLIHMYASCGDIAFARKVFDEMPTKNLVSWNAMLDGYAKCGDVNTAREVFDLMHERDVV 286
           NSLIHMYASC DIA ARKVFDEM TKNLV+WNAMLDGYAKCG++N AREVF LM ERDVV
Sbjct: 153 NSLIHMYASCRDIASARKVFDEMATKNLVTWNAMLDGYAKCGNLNMAREVFSLMPERDVV 212

Query: 287 SWSSLIDGYVKCGEYGEAMALFERMRSAGPMANEVTLVSVLCACAHLGALEQGRMMHGYI 346
           SWSSLIDGYVK G YGEAMALFERM   GPMANEVTLVSVLCACAHLGALE+GRMMH YI
Sbjct: 213 SWSSLIDGYVKGGVYGEAMALFERMSFVGPMANEVTLVSVLCACAHLGALERGRMMHRYI 272

Query: 347 VENELPLTIVLLTSLVDMYAKCGAIHEALAAFRACPLQRTDVLIWNAIIGGLATHGLIKE 406
           VENELPLTIVL TSLVDMYAKCGAIHEAL  FRAC LQ  DVLIWNAIIGGLATHGLIKE
Sbjct: 273 VENELPLTIVLQTSLVDMYAKCGAIHEALTVFRACSLQEADVLIWNAIIGGLATHGLIKE 332

Query: 407 SMDLFSEMQMVGIAPDEITYLCLLSCCAHGGLVNEAWYFFDCLHKHGMTPKDEHYACMVD 466
           S+DLF +MQMVGI PDEITYLCLLSCCAHGGLV EAWYFFDCL KHGM PK EHYACMVD
Sbjct: 333 SLDLFCKMQMVGIVPDEITYLCLLSCCAHGGLVEEAWYFFDCLRKHGMFPKVEHYACMVD 392

Query: 467 ALSRAGQVSEAYQFLCQMTVQPTSSMLGALLSGCMKHGKLDLAEVVGRRLVELDPDHDGR 526
            LSRAGQVSEAYQFLCQM VQPTSSMLGALLSGCMKHGKLD+A+VVGRRLVELDP+HDGR
Sbjct: 393 VLSRAGQVSEAYQFLCQMPVQPTSSMLGALLSGCMKHGKLDIAKVVGRRLVELDPNHDGR 452

Query: 527 YVGLSNIYAVDKRWNDARNIREAMEKRGVKKSPGFSFVEVFGILHRFIAHDKTHGDSERI 586
           YVGLSNIYA DKRW+DA+N+REAME++G+KKSPGFSF+EV+GILHRF+AHDKTHGDS++I
Sbjct: 453 YVGLSNIYAADKRWDDAKNMREAMERKGMKKSPGFSFIEVYGILHRFMAHDKTHGDSKQI 512

Query: 587 YVMLNLIIDQMKPIEDSENQEYCLYDLIGVS 618
           ++MLN I+DQMKPIED  +QE C YD+I +S
Sbjct: 513 FMMLNFIVDQMKPIEDYVHQECCFYDIINIS 543

BLAST of Cp4.1LG01g04190 vs. NCBI nr
Match: gi|778702400|ref|XP_011655189.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g08305 isoform X1 [Cucumis sativus])

HSP 1 Score: 889.4 bits (2297), Expect = 3.5e-255
Identity = 422/511 (82.58%), Postives = 465/511 (91.00%), Query Frame = 1

Query: 107 IHALLFTFGVSQDDAVVSKLLSFSALSPAGDLDYSYKLLLNLPNPTTFKWNTLIRGFSNS 166
           IHALLFT G+SQD+ + SKLL FSALSPA DLDYSYKL+LN+PNPTTF WNTLIR FSN+
Sbjct: 32  IHALLFTLGISQDETIKSKLLLFSALSPARDLDYSYKLILNVPNPTTFNWNTLIRAFSNT 91

Query: 167 RNPNRSITVFVRMLRNGVSPDYMTYPFLGKAIAKLLNQKLGMAVHVHVAKTGHEVDRFVM 226
           +NPN SITVF++ML+NGVSPDY+TYPFL KA +KLLNQ+LGMAVHVH+ K+GHE+D+F+ 
Sbjct: 92  KNPNPSITVFIKMLQNGVSPDYLTYPFLVKATSKLLNQELGMAVHVHIVKSGHEIDKFIQ 151

Query: 227 NSLIHMYASCGDIAFARKVFDEMPTKNLVSWNAMLDGYAKCGDVNTAREVFDLMHERDVV 286
           NSLIHMYASC DIA ARKVFDEMP KNLV+WNAMLDGYAKCGD+N AREVF+LM E+DVV
Sbjct: 152 NSLIHMYASCRDIASARKVFDEMPRKNLVTWNAMLDGYAKCGDLNMAREVFNLMPEKDVV 211

Query: 287 SWSSLIDGYVKCGEYGEAMALFERMRSAGPMANEVTLVSVLCACAHLGALEQGRMMHGYI 346
           SWSSLIDGYVK   YGEAMALFERM   GPMANEVTLVS LCACAHLGALE GRMMH YI
Sbjct: 212 SWSSLIDGYVKGRVYGEAMALFERMSFDGPMANEVTLVSALCACAHLGALEHGRMMHRYI 271

Query: 347 VENELPLTIVLLTSLVDMYAKCGAIHEALAAFRACPLQRTDVLIWNAIIGGLATHGLIKE 406
           VENELPLTIVL TSLVDMYAKCGAIHEAL  FRAC LQ  DVLIWNAIIGGLATHGLIKE
Sbjct: 272 VENELPLTIVLQTSLVDMYAKCGAIHEALTVFRACSLQEADVLIWNAIIGGLATHGLIKE 331

Query: 407 SMDLFSEMQMVGIAPDEITYLCLLSCCAHGGLVNEAWYFFDCLHKHGMTPKDEHYACMVD 466
           +M+LF EM+MVGI PDEITYLCLLSCCAHGGLV EAWYFFDCL KHGM PK EHYACMVD
Sbjct: 332 AMNLFCEMKMVGIVPDEITYLCLLSCCAHGGLVEEAWYFFDCLRKHGMIPKVEHYACMVD 391

Query: 467 ALSRAGQVSEAYQFLCQMTVQPTSSMLGALLSGCMKHGKLDLAEVVGRRLVELDPDHDGR 526
           ALSRAGQVSEAYQFLCQM VQPTSSMLGALLSGCMKHGKLD+A+VVGRRLVELDP+HDGR
Sbjct: 392 ALSRAGQVSEAYQFLCQMPVQPTSSMLGALLSGCMKHGKLDIAKVVGRRLVELDPNHDGR 451

Query: 527 YVGLSNIYAVDKRWNDARNIREAMEKRGVKKSPGFSFVEVFGILHRFIAHDKTHGDSERI 586
           YVGLSNIYA DKRW+DA+NIREAME++GVKKSPGFSF+EV+G+LHRF+AHDKTHGD E+I
Sbjct: 452 YVGLSNIYAADKRWDDAKNIREAMERKGVKKSPGFSFIEVYGVLHRFMAHDKTHGDCEQI 511

Query: 587 YVMLNLIIDQMKPIEDSENQEYCLYDLIGVS 618
           ++MLNLI+DQMKPIED  +QE C YD++ VS
Sbjct: 512 FMMLNLIVDQMKPIEDYVHQECCFYDIMNVS 542

BLAST of Cp4.1LG01g04190 vs. NCBI nr
Match: gi|645218187|ref|XP_008229311.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g08305 [Prunus mume])

HSP 1 Score: 765.8 bits (1976), Expect = 5.9e-218
Identity = 353/505 (69.90%), Postives = 436/505 (86.34%), Query Frame = 1

Query: 107 IHALLFTFGVSQDDAVVSKLLSFSALSPAGDLDYSYKLLLNLPNPTTFKWNTLIRGFSNS 166
           +HALL T G+SQ+ ++ SK+LSFSALS  G++DYSY++L  LPNPT F WNT+IRG+S S
Sbjct: 34  MHALLLTCGLSQEQSLTSKILSFSALSDLGNIDYSYRVLSQLPNPTIFYWNTVIRGYSKS 93

Query: 167 RNPNRSITVFVRMLRNGVSPDYMTYPFLGKAIAKLLNQKLGMAVHVHVAKTGHEVDRFVM 226
           +NPNRSI+VFV+MLR+GVSPDY+TYPFL KA A+LL ++LG+AVH H+AK G E DRF+ 
Sbjct: 94  KNPNRSISVFVKMLRDGVSPDYLTYPFLAKASARLLKRELGVAVHAHIAKNGFEFDRFIS 153

Query: 227 NSLIHMYASCGDIAFARKVFDEMPTKNLVSWNAMLDGYAKCGDVNTAREVFDLMHERDVV 286
           NSLIHMYA+CGDI +ARKVFD +  KN VSWN+MLDGYAKCGDV +AREVFDLM +RDVV
Sbjct: 154 NSLIHMYAACGDITYARKVFDGIFVKNSVSWNSMLDGYAKCGDVISAREVFDLMPKRDVV 213

Query: 287 SWSSLIDGYVKCGEYGEAMALFERMRSAGPMANEVTLVSVLCACAHLGALEQGRMMHGYI 346
           SWSSLIDGYVK G+Y EA+ +FERMR  GP ANEVT+VSVLCAC HLGALE+G++MH Y+
Sbjct: 214 SWSSLIDGYVKAGDYREALVVFERMRVVGPKANEVTMVSVLCACTHLGALEEGKVMHRYM 273

Query: 347 VENELPLTIVLLTSLVDMYAKCGAIHEALAAFRACPLQRTDVLIWNAIIGGLATHGLIKE 406
           V+N+LPLT+VL TSLVDMYAKCGAI +AL  FR   L R+DVLIWNA+IGGLA HGL+++
Sbjct: 274 VDNKLPLTLVLQTSLVDMYAKCGAIEDALGVFRGGSLHRSDVLIWNAMIGGLAIHGLVQQ 333

Query: 407 SMDLFSEMQMVGIAPDEITYLCLLSCCAHGGLVNEAWYFFDCLHKHGMTPKDEHYACMVD 466
           +++ FSEMQ++GI PDEITYLCLLS CAHGGLV EAW+FF+CL KHGM PK EHYACMVD
Sbjct: 334 ALEFFSEMQIIGIVPDEITYLCLLSACAHGGLVKEAWHFFECLGKHGMKPKCEHYACMVD 393

Query: 467 ALSRAGQVSEAYQFLCQMTVQPTSSMLGALLSGCMKHGKLDLAEVVGRRLVELDPDHDGR 526
            L+R GQV+EA+QF+CQM  +PT+SMLGALLSGC+ HGKLDLAE+VGR+L+E++P HDGR
Sbjct: 394 VLARGGQVTEAHQFICQMPREPTASMLGALLSGCVNHGKLDLAEIVGRKLIEIEPGHDGR 453

Query: 527 YVGLSNIYAVDKRWNDARNIREAMEKRGVKKSPGFSFVEVFGILHRFIAHDKTHGDSERI 586
           YVGLSN+YA+ KRW+DAR++REAME+RGVKKSPGFS VE+FG LH+FIAHDK++ +SE I
Sbjct: 454 YVGLSNVYALSKRWDDARSLREAMERRGVKKSPGFSIVEIFGTLHKFIAHDKSYPESEEI 513

Query: 587 YVMLNLIIDQMKPIEDSENQEYCLY 612
           Y+ L+ I++++K   D  NQ+Y LY
Sbjct: 514 YMTLSSIVNEIKFDMDYRNQDYLLY 538

BLAST of Cp4.1LG01g04190 vs. NCBI nr
Match: gi|694422149|ref|XP_009338904.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g08305, partial [Pyrus x bretschneideri])

HSP 1 Score: 761.1 bits (1964), Expect = 1.5e-216
Identity = 352/502 (70.12%), Postives = 432/502 (86.06%), Query Frame = 1

Query: 107 IHALLFTFGVSQDDAVVSKLLSFSALSPAGDLDYSYKLLLNLPNPTTFKWNTLIRGFSNS 166
           IHALL T G+SQ   + SK+LSFSALS  G+++YSY++   L +PT F WNT+IRG+SNS
Sbjct: 39  IHALLLTLGLSQHHLLTSKILSFSALSNLGNIEYSYRVFSQLSHPTIFYWNTVIRGYSNS 98

Query: 167 RNPNRSITVFVRMLRNGVSPDYMTYPFLGKAIAKLLNQKLGMAVHVHVAKTGHEVDRFVM 226
           +NPNRS++VFV+MLR+GVSPDY+TYPFL KA A+LL ++LGMAVH H+AK G E DRF+ 
Sbjct: 99  KNPNRSLSVFVKMLRHGVSPDYLTYPFLVKASARLLKRELGMAVHAHIAKDGFESDRFIS 158

Query: 227 NSLIHMYASCGDIAFARKVFDEMPTKNLVSWNAMLDGYAKCGDVNTAREVFDLMHERDVV 286
           NSLIHMYA+C DI +ARKVFD +P +N VSWN+MLDGYAKCGDVN+AREVF+LM E +VV
Sbjct: 159 NSLIHMYATCRDIMYARKVFDGIPVRNSVSWNSMLDGYAKCGDVNSAREVFELMPEHNVV 218

Query: 287 SWSSLIDGYVKCGEYGEAMALFERMRSAGPMANEVTLVSVLCACAHLGALEQGRMMHGYI 346
           SWSSLIDGYVK GE+ EA+A+FERM   GP ANEVT+VSVL AC HLGALEQG++MH Y+
Sbjct: 219 SWSSLIDGYVKAGEFSEALAVFERMCVVGPKANEVTMVSVLSACTHLGALEQGKVMHRYM 278

Query: 347 VENELPLTIVLLTSLVDMYAKCGAIHEALAAFRACPLQRTDVLIWNAIIGGLATHGLIKE 406
            ENELPLT+ L TSLVDMYAKCGAI EAL  FR   L ++D+LIWNA+IGGLA HGL+++
Sbjct: 279 AENELPLTLALQTSLVDMYAKCGAIEEALCVFRGGSLHQSDLLIWNAMIGGLAMHGLVQQ 338

Query: 407 SMDLFSEMQMVGIAPDEITYLCLLSCCAHGGLVNEAWYFFDCLHKHGMTPKDEHYACMVD 466
           ++ +FSEMQ++GIAPDEITYLCLLS CAHGGLV EAW+FF+C+ KHGMTPK EHYACMVD
Sbjct: 339 ALKIFSEMQIIGIAPDEITYLCLLSACAHGGLVKEAWHFFECIGKHGMTPKCEHYACMVD 398

Query: 467 ALSRAGQVSEAYQFLCQMTVQPTSSMLGALLSGCMKHGKLDLAEVVGRRLVELDPDHDGR 526
            L+R GQV EAYQF+CQM+ +PT+SMLGALLSGCM HGKLDLAE+VG++L+E+ PDHDGR
Sbjct: 399 VLARGGQVVEAYQFICQMSKEPTASMLGALLSGCMNHGKLDLAEIVGKKLIEIQPDHDGR 458

Query: 527 YVGLSNIYAVDKRWNDARNIREAMEKRGVKKSPGFSFVEVFGILHRFIAHDKTHGDSERI 586
           YVGLSN+YAV +RW+DAR++REAME+RGVKKSPGFSFVE+FG LH+FIAHDK++ +SE I
Sbjct: 459 YVGLSNVYAVFRRWDDARSLREAMERRGVKKSPGFSFVEIFGTLHKFIAHDKSYPESEEI 518

Query: 587 YVMLNLIIDQMKPIEDSENQEY 609
           Y MLN I++Q+K  ++  NQ+Y
Sbjct: 519 YTMLNFIVNQIKFDKEYRNQDY 540

BLAST of Cp4.1LG01g04190 vs. NCBI nr
Match: gi|658000275|ref|XP_008392585.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g08305 [Malus domestica])

HSP 1 Score: 760.4 bits (1962), Expect = 2.5e-216
Identity = 352/502 (70.12%), Postives = 433/502 (86.25%), Query Frame = 1

Query: 107 IHALLFTFGVSQDDAVVSKLLSFSALSPAGDLDYSYKLLLNLPNPTTFKWNTLIRGFSNS 166
           IHALL T G+SQ  ++ SK+LSFSALS  G+++YSY++   LP+PT F WNT+IRG+SNS
Sbjct: 7   IHALLLTLGLSQHHSLTSKILSFSALSNLGNIEYSYRVFSQLPHPTIFYWNTVIRGYSNS 66

Query: 167 RNPNRSITVFVRMLRNGVSPDYMTYPFLGKAIAKLLNQKLGMAVHVHVAKTGHEVDRFVM 226
           +NPNRS++VFV+MLR+GVSPDY+TYPFL KA A+LL ++LGMAVH H+AK G E DRF+ 
Sbjct: 67  KNPNRSLSVFVKMLRDGVSPDYLTYPFLVKASARLLKRELGMAVHAHIAKNGFESDRFIS 126

Query: 227 NSLIHMYASCGDIAFARKVFDEMPTKNLVSWNAMLDGYAKCGDVNTAREVFDLMHERDVV 286
           NSLIHMYA+C DI +A KVFD +P +N VSWN+MLDGYAKCGDVN+A+EVF+LM E +VV
Sbjct: 127 NSLIHMYATCRDIMYAHKVFDGIPVRNSVSWNSMLDGYAKCGDVNSAQEVFELMPEHNVV 186

Query: 287 SWSSLIDGYVKCGEYGEAMALFERMRSAGPMANEVTLVSVLCACAHLGALEQGRMMHGYI 346
           SWSSLIDGYVK G++ EA+A+FERM   GP ANEVT+VSVL AC HLGALEQG++MH Y+
Sbjct: 187 SWSSLIDGYVKAGKFSEALAVFERMCVVGPKANEVTMVSVLSACTHLGALEQGKVMHRYM 246

Query: 347 VENELPLTIVLLTSLVDMYAKCGAIHEALAAFRACPLQRTDVLIWNAIIGGLATHGLIKE 406
           VENELPLT+ L TSLVDMYAKCGAI EAL  FR   L ++DVLIWNA+IGGLA HGL+++
Sbjct: 247 VENELPLTLALQTSLVDMYAKCGAIEEALGVFRGGSLHQSDVLIWNAMIGGLAMHGLVQQ 306

Query: 407 SMDLFSEMQMVGIAPDEITYLCLLSCCAHGGLVNEAWYFFDCLHKHGMTPKDEHYACMVD 466
           ++++FSEMQ++GIAPDEITYLCLLS CAH GLV EAW+FF+CL KHGMTPK EHYACMVD
Sbjct: 307 ALEIFSEMQIIGIAPDEITYLCLLSACAHRGLVKEAWHFFECLGKHGMTPKCEHYACMVD 366

Query: 467 ALSRAGQVSEAYQFLCQMTVQPTSSMLGALLSGCMKHGKLDLAEVVGRRLVELDPDHDGR 526
            L+R GQV EAYQF+CQM+ +PT SMLGALLSGCM HGKLDLAE+VG++L+E+ PDHDGR
Sbjct: 367 VLARGGQVVEAYQFICQMSKEPTXSMLGALLSGCMNHGKLDLAEIVGKKLIEIQPDHDGR 426

Query: 527 YVGLSNIYAVDKRWNDARNIREAMEKRGVKKSPGFSFVEVFGILHRFIAHDKTHGDSERI 586
           YVGLSN+YAV +RW+DAR++REAME+RGVKKSPGFSFVE+FG LH+FIAHDK++ +SE I
Sbjct: 427 YVGLSNVYAVSRRWDDARSLREAMERRGVKKSPGFSFVEIFGTLHKFIAHDKSYPESEDI 486

Query: 587 YVMLNLIIDQMKPIEDSENQEY 609
           Y MLN I++Q+K  ++  NQ+Y
Sbjct: 487 YRMLNFIVNQIKFDKEYRNQDY 508

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP369_ARATH2.0e-17357.59Pentatricopeptide repeat-containing protein At5g08305 OS=Arabidopsis thaliana GN... [more]
PP169_ARATH6.1e-11437.36Pentatricopeptide repeat-containing protein At2g22410, mitochondrial OS=Arabidop... [more]
PP175_ARATH2.6e-10933.93Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
PP235_ARATH2.6e-10935.26Putative pentatricopeptide repeat-containing protein At3g15930 OS=Arabidopsis th... [more]
PP367_ARATH5.2e-10538.03Pentatricopeptide repeat-containing protein At5g06540 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0KMZ3_CUCSA2.5e-25582.58Uncharacterized protein OS=Cucumis sativus GN=Csa_5G387410 PE=4 SV=1[more]
W9SLB8_9ROSA5.2e-21369.72Uncharacterized protein OS=Morus notabilis GN=L484_007552 PE=4 SV=1[more]
M5W4Z9_PRUPE2.9e-21170.00Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa020734mg PE=4 SV=1[more]
K7N2R3_SOYBN2.0e-20967.19Uncharacterized protein OS=Glycine max GN=GLYMA_20G104500 PE=4 SV=1[more]
A0A061E2B0_THECC2.5e-20766.27Pentatricopeptide repeat superfamily protein OS=Theobroma cacao GN=TCM_007712 PE... [more]
Match NameE-valueIdentityDescription
AT5G08305.11.1e-17457.59 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT2G22410.13.4e-11537.36 SLOW GROWTH 1[more]
AT2G29760.11.5e-11033.93 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G15930.11.5e-11035.26 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G06540.12.9e-10638.03 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659121006|ref|XP_008460457.1|1.2e-25582.58PREDICTED: pentatricopeptide repeat-containing protein At5g08305 [Cucumis melo][more]
gi|778702400|ref|XP_011655189.1|3.5e-25582.58PREDICTED: pentatricopeptide repeat-containing protein At5g08305 isoform X1 [Cuc... [more]
gi|645218187|ref|XP_008229311.1|5.9e-21869.90PREDICTED: pentatricopeptide repeat-containing protein At5g08305 [Prunus mume][more]
gi|694422149|ref|XP_009338904.1|1.5e-21670.12PREDICTED: pentatricopeptide repeat-containing protein At5g08305, partial [Pyrus... [more]
gi|658000275|ref|XP_008392585.1|2.5e-21670.12PREDICTED: pentatricopeptide repeat-containing protein At5g08305 [Malus domestic... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0000049 tRNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g04190.1Cp4.1LG01g04190.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 72..100
score: 0.001coord: 156..184
score: 0.001coord: 461..484
score: 0.021coord: 494..518
score:
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 253..280
score: 5.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 284..331
score: 6.2E-9coord: 387..434
score: 4.4
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 156..187
score: 3.6E-6coord: 286..315
score: 2.6E-7coord: 226..254
score: 0.0014coord: 72..103
score: 3.6E-6coord: 461..484
score: 0.0011coord: 424..456
score: 7.1E-4coord: 255..286
score: 4.8E-9coord: 390..423
score: 2.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 354..384
score: 5.733coord: 319..353
score: 6.39coord: 222..252
score: 9.01coord: 68..102
score: 10.698coord: 457..491
score: 7.509coord: 253..283
score: 11.082coord: 387..421
score: 11.17coord: 523..557
score: 6.336coord: 422..456
score: 10.293coord: 284..318
score: 12.2coord: 152..186
score: 10
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 252..430
score: 5.8E-9coord: 465..545
score: 5.
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 223..320
score: 5.69E-5coord: 295..548
score: 5.81
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 115..564
score: 8.2E
NoneNo IPR availablePANTHERPTHR24015:SF132SUBFAMILY NOT NAMEDcoord: 115..564
score: 8.2E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG01g04190CmaCh04G002840Cucurbita maxima (Rimu)cmacpeB720
Cp4.1LG01g04190CmoCh04G002920Cucurbita moschata (Rifu)cmocpeB673
Cp4.1LG01g04190Carg25685Silver-seed gourdcarcpeB1186
The following gene(s) are paralogous to this gene:

None