Cp4.1LG12g04710.1 (mRNA) Cucurbita pepo (Zucchini)

NameCp4.1LG12g04710.1
TypemRNA
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCp4.1LG12 : 4503648 .. 4506017 (+)
Sequence length2370
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAATCAAGGGCAACGCCGACTCTGTCTCGATTGGCCGACCTCCTCCTTGTTGCTTCCATCACCAAAACCCTATCGGAATCAGGTACTCGAACCCTTCAACACCAATCACTTTCAATATCGGAGCCTCTCCTCCTCCAAATTCTCCGTAGCAGATCTGTTCATCCTTCGAATAAGCTCGATTTCTTCAAATGGTGCTCTCTCAGCCCGAATTTCAGCCATTCAGCCTCCACATATTCTCAAATCTTCCGTACCCTCTGTCGCTCCGGATACCTCCATGAGGTTCCCCTTGTACTCTCCTCGATGAAGCGAGACGGTGTTGATGTTGATTCTCACACTTTCAAGGTCCTTCTCGATGCGTTTATCAGGTCTGGTAAATTCGATGCTGCTCTTGAGATTTTAGACCATATGGAAGAGTTAGGAACTAGCTTGGAACTTAACACGTACAACTCTGTTCTCGTCGCTCTCGTCAGGAAAAACCAGGTGGGTTTGGCCTTGTCAATTTTCTTTAAGCTCTTTGATGCTTTTAGTACTGGAGGGCAAGAAGGTAGTGCTGTACCTAGTTTTTCCTTCTTGCCTAATGCACTTGCTTGTAATGAATTGTTGGTTGCTCTTAGGAAATCAGACATGAGGGTTGAGTTCAAAAAGGTTTTTGACAAGCTTAGAACAATTAGAAGCTTTGAGTTTAATGTCTGCGGTTATAATATATGCATTCATGCCTTTGGATGCTGGGGTTATCTGGATACTTCTCTTGCCCTGTTCAAAGAAATGAAGCAAAGGAGCTTAGTTTCGGTGTCTTTTGGTCCAGATTTGTGTACATATAATAGCCTTATTCATGTTCTCTGTTTGGTAGGGAAGGTAAACGATGCACTTATTGTGTGGGAGGAACTTAAAGGGTCAGGTCATGAGCCTGATGCCTTCACATACCGTCTTATAATTCAGGGTTGCTGTAAATCTTACCGAATGGACGATGCTACCGCGATTTTTAATGAAATGGAGTACAATGGATTTGTCCCAGATACCATTGTATATAATTCTCTCCTTGATGGGTTATTTAAGGCTCGGAGAGTTATTGAAGCATGTCAATTTTTTGATAAAATGGTGCAAGAAGGTGTTAGAGCTTCTCCTTGGACATACAATATTCTAATTGATGGATTATTTAGGAATGGAAGAGCTGAAGCTAGCTACTCTTTATTCTGTGATTTGAAGAAAAAGGGTCAATTTGTTGATGGTGTTACTTACAGCATCATTATACTGCAACTGTGTAAAGAGGGACTGCTTGAGGAAGCACTACAATTGGTTGAAGAAATGGAAGCGAGAGGCTTTGTTATCGATCTTGTTACTGTAACATCTTTGTTGATTGCAATGCACAAGCAAGGGCAGTGGGAAGGGTTAGAGAGGCTCATGAAGCACATTAGAGAAGGTGATTTGGTCCCCAATGTGCTGAAATGGAAGGCCAATATGGAAGATTCAGTGAAGTATCAGAAAAATAAAAGGAAAAACTACTCATCTCTGTTTTCTCCAAAGGAGGATCTGAGTGAGATTATAAGTTCAAGAGCTTCTTCCGTTGCTAAAGTTAATGTTGGTGATATTTCCGAAAACACAGAAGAAAAAGATGATGACAATTGGTCATCATCCCCACATGTAGATCTCTTGGCTAATCTTGCTAAGTCTACAGGTGATTCATTGCAACCGTTCTCTCTTAGTCCAGGGCAACGGGTTGAAGCAAAAGGGGACAACTCATTCGATATCGATATGGTCAATACATTTTTGTCTATTTTTCTAGCAAAGGGAAAATTGAGCTTAGCTTGTAAGTTGTTTGAGATCTTCAGCGATATGGGCGTGAACCCAGTGAGGTACACCTACAATTCAATGTTGAGTGCATTTGTGAAGAAGGGATACTTTCATCAGGCATGGGGTATATTTAACGAAATGGGCGAGAAGGTATGTCCAGCTGATATAGCCACGTATAATTTGATAATTCAAGGACTCGGGAAGATGGGTAGAGCAGATCTTGCAAGTTCGGTTCTGGAAAAGCTAATGGAGCAGGGTGGCTATCTCGATATCGTAATGTACAACACGTTGATGAATGCGCTGGGGAAGGCAGGTCGAATGGATGATGTAAATAAGCTTTTTGAGCAAATGAGGAGCAGTGGGATAAACCCAGATGTTGTCAGTTTTAATACACTTATTGAAGTTCACAGCAAAGCAGGTCGGTTTAAGGACGCTTACAAATTTTTGAAGATGATGCTGGATTCGGGCTGTTCCCCGAACCATGTCACGGATACAATTTTGGATTTCCTAGGGAGAGAGATTGAGAAAGCGAGGTATGAAAAAGCTTCAATCATCCGTGACAAGAACAGTTCTTGA

mRNA sequence

ATGGAATCAAGGGCAACGCCGACTCTGTCTCGATTGGCCGACCTCCTCCTTGTTGCTTCCATCACCAAAACCCTATCGGAATCAGGTACTCGAACCCTTCAACACCAATCACTTTCAATATCGGAGCCTCTCCTCCTCCAAATTCTCCGTAGCAGATCTGTTCATCCTTCGAATAAGCTCGATTTCTTCAAATGGTGCTCTCTCAGCCCGAATTTCAGCCATTCAGCCTCCACATATTCTCAAATCTTCCGTACCCTCTGTCGCTCCGGATACCTCCATGAGGTTCCCCTTGTACTCTCCTCGATGAAGCGAGACGGTGTTGATGTTGATTCTCACACTTTCAAGGTCCTTCTCGATGCGTTTATCAGGTCTGGTAAATTCGATGCTGCTCTTGAGATTTTAGACCATATGGAAGAGTTAGGAACTAGCTTGGAACTTAACACGTACAACTCTGTTCTCGTCGCTCTCGTCAGGAAAAACCAGGTGGGTTTGGCCTTGTCAATTTTCTTTAAGCTCTTTGATGCTTTTAGTACTGGAGGGCAAGAAGGTAGTGCTGTACCTAGTTTTTCCTTCTTGCCTAATGCACTTGCTTGTAATGAATTGTTGGTTGCTCTTAGGAAATCAGACATGAGGGTTGAGTTCAAAAAGGTTTTTGACAAGCTTAGAACAATTAGAAGCTTTGAGTTTAATGTCTGCGGTTATAATATATGCATTCATGCCTTTGGATGCTGGGGTTATCTGGATACTTCTCTTGCCCTGTTCAAAGAAATGAAGCAAAGGAGCTTAGTTTCGGTGTCTTTTGGTCCAGATTTGTGTACATATAATAGCCTTATTCATGTTCTCTGTTTGGTAGGGAAGGTAAACGATGCACTTATTGTGTGGGAGGAACTTAAAGGGTCAGGTCATGAGCCTGATGCCTTCACATACCGTCTTATAATTCAGGGTTGCTGTAAATCTTACCGAATGGACGATGCTACCGCGATTTTTAATGAAATGGAGTACAATGGATTTGTCCCAGATACCATTGTATATAATTCTCTCCTTGATGGGTTATTTAAGGCTCGGAGAGTTATTGAAGCATGTCAATTTTTTGATAAAATGGTGCAAGAAGGTGTTAGAGCTTCTCCTTGGACATACAATATTCTAATTGATGGATTATTTAGGAATGGAAGAGCTGAAGCTAGCTACTCTTTATTCTGTGATTTGAAGAAAAAGGGTCAATTTGTTGATGGTGTTACTTACAGCATCATTATACTGCAACTGTGTAAAGAGGGACTGCTTGAGGAAGCACTACAATTGGTTGAAGAAATGGAAGCGAGAGGCTTTGTTATCGATCTTGTTACTGTAACATCTTTGTTGATTGCAATGCACAAGCAAGGGCAGTGGGAAGGGTTAGAGAGGCTCATGAAGCACATTAGAGAAGGTGATTTGGTCCCCAATGTGCTGAAATGGAAGGCCAATATGGAAGATTCAGTGAAGTATCAGAAAAATAAAAGGAAAAACTACTCATCTCTGTTTTCTCCAAAGGAGGATCTGAGTGAGATTATAAGTTCAAGAGCTTCTTCCGTTGCTAAAGTTAATGTTGGTGATATTTCCGAAAACACAGAAGAAAAAGATGATGACAATTGGTCATCATCCCCACATGTAGATCTCTTGGCTAATCTTGCTAAGTCTACAGGTGATTCATTGCAACCGTTCTCTCTTAGTCCAGGGCAACGGGTTGAAGCAAAAGGGGACAACTCATTCGATATCGATATGGTCAATACATTTTTGTCTATTTTTCTAGCAAAGGGAAAATTGAGCTTAGCTTGTAAGTTGTTTGAGATCTTCAGCGATATGGGCGTGAACCCAGTGAGGTACACCTACAATTCAATGTTGAGTGCATTTGTGAAGAAGGGATACTTTCATCAGGCATGGGGTATATTTAACGAAATGGGCGAGAAGGTATGTCCAGCTGATATAGCCACGTATAATTTGATAATTCAAGGACTCGGGAAGATGGGTAGAGCAGATCTTGCAAGTTCGGTTCTGGAAAAGCTAATGGAGCAGGGTGGCTATCTCGATATCGTAATGTACAACACGTTGATGAATGCGCTGGGGAAGGCAGGTCGAATGGATGATGTAAATAAGCTTTTTGAGCAAATGAGGAGCAGTGGGATAAACCCAGATGTTGTCAGTTTTAATACACTTATTGAAGTTCACAGCAAAGCAGGTCGGTTTAAGGACGCTTACAAATTTTTGAAGATGATGCTGGATTCGGGCTGTTCCCCGAACCATGTCACGGATACAATTTTGGATTTCCTAGGGAGAGAGATTGAGAAAGCGAGGTATGAAAAAGCTTCAATCATCCGTGACAAGAACAGTTCTTGA

Coding sequence (CDS)

ATGGAATCAAGGGCAACGCCGACTCTGTCTCGATTGGCCGACCTCCTCCTTGTTGCTTCCATCACCAAAACCCTATCGGAATCAGGTACTCGAACCCTTCAACACCAATCACTTTCAATATCGGAGCCTCTCCTCCTCCAAATTCTCCGTAGCAGATCTGTTCATCCTTCGAATAAGCTCGATTTCTTCAAATGGTGCTCTCTCAGCCCGAATTTCAGCCATTCAGCCTCCACATATTCTCAAATCTTCCGTACCCTCTGTCGCTCCGGATACCTCCATGAGGTTCCCCTTGTACTCTCCTCGATGAAGCGAGACGGTGTTGATGTTGATTCTCACACTTTCAAGGTCCTTCTCGATGCGTTTATCAGGTCTGGTAAATTCGATGCTGCTCTTGAGATTTTAGACCATATGGAAGAGTTAGGAACTAGCTTGGAACTTAACACGTACAACTCTGTTCTCGTCGCTCTCGTCAGGAAAAACCAGGTGGGTTTGGCCTTGTCAATTTTCTTTAAGCTCTTTGATGCTTTTAGTACTGGAGGGCAAGAAGGTAGTGCTGTACCTAGTTTTTCCTTCTTGCCTAATGCACTTGCTTGTAATGAATTGTTGGTTGCTCTTAGGAAATCAGACATGAGGGTTGAGTTCAAAAAGGTTTTTGACAAGCTTAGAACAATTAGAAGCTTTGAGTTTAATGTCTGCGGTTATAATATATGCATTCATGCCTTTGGATGCTGGGGTTATCTGGATACTTCTCTTGCCCTGTTCAAAGAAATGAAGCAAAGGAGCTTAGTTTCGGTGTCTTTTGGTCCAGATTTGTGTACATATAATAGCCTTATTCATGTTCTCTGTTTGGTAGGGAAGGTAAACGATGCACTTATTGTGTGGGAGGAACTTAAAGGGTCAGGTCATGAGCCTGATGCCTTCACATACCGTCTTATAATTCAGGGTTGCTGTAAATCTTACCGAATGGACGATGCTACCGCGATTTTTAATGAAATGGAGTACAATGGATTTGTCCCAGATACCATTGTATATAATTCTCTCCTTGATGGGTTATTTAAGGCTCGGAGAGTTATTGAAGCATGTCAATTTTTTGATAAAATGGTGCAAGAAGGTGTTAGAGCTTCTCCTTGGACATACAATATTCTAATTGATGGATTATTTAGGAATGGAAGAGCTGAAGCTAGCTACTCTTTATTCTGTGATTTGAAGAAAAAGGGTCAATTTGTTGATGGTGTTACTTACAGCATCATTATACTGCAACTGTGTAAAGAGGGACTGCTTGAGGAAGCACTACAATTGGTTGAAGAAATGGAAGCGAGAGGCTTTGTTATCGATCTTGTTACTGTAACATCTTTGTTGATTGCAATGCACAAGCAAGGGCAGTGGGAAGGGTTAGAGAGGCTCATGAAGCACATTAGAGAAGGTGATTTGGTCCCCAATGTGCTGAAATGGAAGGCCAATATGGAAGATTCAGTGAAGTATCAGAAAAATAAAAGGAAAAACTACTCATCTCTGTTTTCTCCAAAGGAGGATCTGAGTGAGATTATAAGTTCAAGAGCTTCTTCCGTTGCTAAAGTTAATGTTGGTGATATTTCCGAAAACACAGAAGAAAAAGATGATGACAATTGGTCATCATCCCCACATGTAGATCTCTTGGCTAATCTTGCTAAGTCTACAGGTGATTCATTGCAACCGTTCTCTCTTAGTCCAGGGCAACGGGTTGAAGCAAAAGGGGACAACTCATTCGATATCGATATGGTCAATACATTTTTGTCTATTTTTCTAGCAAAGGGAAAATTGAGCTTAGCTTGTAAGTTGTTTGAGATCTTCAGCGATATGGGCGTGAACCCAGTGAGGTACACCTACAATTCAATGTTGAGTGCATTTGTGAAGAAGGGATACTTTCATCAGGCATGGGGTATATTTAACGAAATGGGCGAGAAGGTATGTCCAGCTGATATAGCCACGTATAATTTGATAATTCAAGGACTCGGGAAGATGGGTAGAGCAGATCTTGCAAGTTCGGTTCTGGAAAAGCTAATGGAGCAGGGTGGCTATCTCGATATCGTAATGTACAACACGTTGATGAATGCGCTGGGGAAGGCAGGTCGAATGGATGATGTAAATAAGCTTTTTGAGCAAATGAGGAGCAGTGGGATAAACCCAGATGTTGTCAGTTTTAATACACTTATTGAAGTTCACAGCAAAGCAGGTCGGTTTAAGGACGCTTACAAATTTTTGAAGATGATGCTGGATTCGGGCTGTTCCCCGAACCATGTCACGGATACAATTTTGGATTTCCTAGGGAGAGAGATTGAGAAAGCGAGGTATGAAAAAGCTTCAATCATCCGTGACAAGAACAGTTCTTGA

Protein sequence

MESRATPTLSRLADLLLVASITKTLSESGTRTLQHQSLSISEPLLLQILRSRSVHPSNKLDFFKWCSLSPNFSHSASTYSQIFRTLCRSGYLHEVPLVLSSMKRDGVDVDSHTFKVLLDAFIRSGKFDAALEILDHMEELGTSLELNTYNSVLVALVRKNQVGLALSIFFKLFDAFSTGGQEGSAVPSFSFLPNALACNELLVALRKSDMRVEFKKVFDKLRTIRSFEFNVCGYNICIHAFGCWGYLDTSLALFKEMKQRSLVSVSFGPDLCTYNSLIHVLCLVGKVNDALIVWEELKGSGHEPDAFTYRLIIQGCCKSYRMDDATAIFNEMEYNGFVPDTIVYNSLLDGLFKARRVIEACQFFDKMVQEGVRASPWTYNILIDGLFRNGRAEASYSLFCDLKKKGQFVDGVTYSIIILQLCKEGLLEEALQLVEEMEARGFVIDLVTVTSLLIAMHKQGQWEGLERLMKHIREGDLVPNVLKWKANMEDSVKYQKNKRKNYSSLFSPKEDLSEIISSRASSVAKVNVGDISENTEEKDDDNWSSSPHVDLLANLAKSTGDSLQPFSLSPGQRVEAKGDNSFDIDMVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPVRYTYNSMLSAFVKKGYFHQAWGIFNEMGEKVCPADIATYNLIIQGLGKMGRADLASSVLEKLMEQGGYLDIVMYNTLMNALGKAGRMDDVNKLFEQMRSSGINPDVVSFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTILDFLGREIEKARYEKASIIRDKNSS
BLAST of Cp4.1LG12g04710.1 vs. Swiss-Prot
Match: PP299_ARATH (Pentatricopeptide repeat-containing protein At4g01570 OS=Arabidopsis thaliana GN=At4g01570 PE=2 SV=1)

HSP 1 Score: 991.5 bits (2562), Expect = 5.3e-288
Identity = 499/783 (63.73%), Postives = 625/783 (79.82%), Query Frame = 1

Query: 11  RLADLLLVASITKTLSESGTRTLQHQSLSISEPLLLQILRSRSVHPSNKLDFFKWC-SLS 70
           +L ++LLVAS++KTLS+SGTR+L   S+ ISEP++LQILR  S+ PS KLDFF+WC SL 
Sbjct: 26  QLCNVLLVASLSKTLSQSGTRSLDANSIPISEPVVLQILRRNSIDPSKKLDFFRWCYSLR 85

Query: 71  PNFSHSASTYSQIFRTLCRSGYLHEVPLVLSSMKRDGVDVDSHTFKVLLDAFIRSGKFDA 130
           P + HSA+ YSQIFRT+CR+G L EVP +L SMK DGV++D    K+LLD+ IRSGKF++
Sbjct: 86  PGYKHSATAYSQIFRTVCRTGLLGEVPDLLGSMKEDGVNLDQTMAKILLDSLIRSGKFES 145

Query: 131 ALEILDHMEELGTSLELNTYNSVLVALVRKNQVGLALSIFFKLFDAFSTGGQEGSA-VPS 190
           AL +LD+MEELG  L  + Y+SVL+ALV+K+++ LALSI FKL +A      + +  V  
Sbjct: 146 ALGVLDYMEELGDCLNPSVYDSVLIALVKKHELRLALSILFKLLEASDNHSDDDTGRVII 205

Query: 191 FSFLPNALACNELLVALRKSDMRVEFKKVFDKLRTIRSFEFNVCGYNICIHAFGCWGYLD 250
            S+LP  +A NELLV LR++DMR EFK+VF+KL+ ++ F+F+   YNICIH FGCWG LD
Sbjct: 206 VSYLPGTVAVNELLVGLRRADMRSEFKRVFEKLKGMKRFKFDTWSYNICIHGFGCWGDLD 265

Query: 251 TSLALFKEMKQRSLV-SVSFGPDLCTYNSLIHVLCLVGKVNDALIVWEELKGSGHEPDAF 310
            +L+LFKEMK+RS V   SFGPD+CTYNSLIHVLCL GK  DALIVW+ELK SGHEPD  
Sbjct: 266 AALSLFKEMKERSSVYGSSFGPDICTYNSLIHVLCLFGKAKDALIVWDELKVSGHEPDNS 325

Query: 311 TYRLIIQGCCKSYRMDDATAIFNEMEYNGFVPDTIVYNSLLDGLFKARRVIEACQFFDKM 370
           TYR++IQGCCKSYRMDDA  I+ EM+YNGFVPDTIVYN LLDG  KAR+V EACQ F+KM
Sbjct: 326 TYRILIQGCCKSYRMDDAMRIYGEMQYNGFVPDTIVYNCLLDGTLKARKVTEACQLFEKM 385

Query: 371 VQEGVRASPWTYNILIDGLFRNGRAEASYSLFCDLKKKGQFVDGVTYSIIILQLCKEGLL 430
           VQEGVRAS WTYNILIDGLFRNGRAEA ++LFCDLKKKGQFVD +T+SI+ LQLC+EG L
Sbjct: 386 VQEGVRASCWTYNILIDGLFRNGRAEAGFTLFCDLKKKGQFVDAITFSIVGLQLCREGKL 445

Query: 431 EEALQLVEEMEARGFVIDLVTVTSLLIAMHKQGQWEGLERLMKHIREGDLVPNVLKWKAN 490
           E A++LVEEME RGF +DLVT++SLLI  HKQG+W+  E+LMKHIREG+LVPNVL+W A 
Sbjct: 446 EGAVKLVEEMETRGFSVDLVTISSLLIGFHKQGRWDWKEKLMKHIREGNLVPNVLRWNAG 505

Query: 491 MEDSVKYQKNKRKNYSSLFSPKEDLSEIISSRASSVAKVNVGDISENTEEKDDDNWSSSP 550
           +E S+K  ++K K+Y+ +F  K    +I+S     V   + G  +E     +DD WSSSP
Sbjct: 506 VEASLKRPQSKDKDYTPMFPSKGSFLDIMSM----VGSEDDGASAEEVSPMEDDPWSSSP 565

Query: 551 HVDLLANLAKSTGDSLQP---FSLSPGQRVEAKGDNSFDIDMVNTFLSIFLAKGKLSLAC 610
           ++D LA+         QP   F L+ GQRVEAK D SFD+DM+NTFLSI+L+KG LSLAC
Sbjct: 566 YMDQLAHQRN------QPKPLFGLARGQRVEAKPD-SFDVDMMNTFLSIYLSKGDLSLAC 625

Query: 611 KLFEIFSDMGVNPV-RYTYNSMLSAFVKKGYFHQAWGIFNEMGEKVCPADIATYNLIIQG 670
           KLFEIF+ MGV  +  YTYNSM+S+FVKKGYF  A G+ ++M E  C ADIATYN+IIQG
Sbjct: 626 KLFEIFNGMGVTDLTSYTYNSMMSSFVKKGYFQTARGVLDQMFENFCAADIATYNVIIQG 685

Query: 671 LGKMGRADLASSVLEKLMEQGGYLDIVMYNTLMNALGKAGRMDDVNKLFEQMRSSGINPD 730
           LGKMGRADLAS+VL++L +QGGYLDIVMYNTL+NALGKA R+D+  +LF+ M+S+GINPD
Sbjct: 686 LGKMGRADLASAVLDRLTKQGGYLDIVMYNTLINALGKATRLDEATQLFDHMKSNGINPD 745

Query: 731 VVSFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTILDFLGREIEKARYEKASII 787
           VVS+NT+IEV+SKAG+ K+AYK+LK MLD+GC PNHVTDTILD+LG+E+EKAR++KAS +
Sbjct: 746 VVSYNTMIEVNSKAGKLKEAYKYLKAMLDAGCLPNHVTDTILDYLGKEMEKARFKKASFV 797

BLAST of Cp4.1LG12g04710.1 vs. Swiss-Prot
Match: PP217_ARATH (Pentatricopeptide repeat-containing protein At3g06920 OS=Arabidopsis thaliana GN=At3g06920 PE=2 SV=1)

HSP 1 Score: 213.4 bits (542), Expect = 9.0e-54
Identity = 169/687 (24.60%), Postives = 304/687 (44.25%), Query Frame = 1

Query: 79  YSQIFRTLCRSGYLHEVPLVLSSMKRDGVDVDSHTFKVLLDAFIRSGKFDAALEILDHME 138
           ++ + R   + G +     +L  MK   +D D   + V +D+F + GK D A +    +E
Sbjct: 206 FTTLIRGFAKEGRVDSALSLLDEMKSSSLDADIVLYNVCIDSFGKVGKVDMAWKFFHEIE 265

Query: 139 ELGTSLELNTYNSVLVALVRKNQVGLALSIFFKLFDAFSTGGQEGSAVPSFSFLPNALAC 198
             G   +  TY S++  L + N++  A+ +F  L        ++   VP         A 
Sbjct: 266 ANGLKPDEVTYTSMIGVLCKANRLDEAVEMFEHL--------EKNRRVPC------TYAY 325

Query: 199 NELLVALRKSDMRVEFKKVFDKLRTIRSFEFNVCGYNICIHAFGCWGYLDTSLALFKEMK 258
           N +++    +    E   + ++ R   S   +V  YN  +      G +D +L +F+EMK
Sbjct: 326 NTMIMGYGSAGKFDEAYSLLERQRAKGSIP-SVIAYNCILTCLRKMGKVDEALKVFEEMK 385

Query: 259 QRSLVSVSFGPDLCTYNSLIHVLCLVGKVNDALIVWEELKGSGHEPDAFTYRLIIQGCCK 318
           + +       P+L TYN LI +LC  GK++ A  + + ++ +G  P+  T  +++   CK
Sbjct: 386 KDA------APNLSTYNILIDMLCRAGKLDTAFELRDSMQKAGLFPNVRTVNIMVDRLCK 445

Query: 319 SYRMDDATAIFNEMEYNGFVPDTIVYNSLLDGLFKARRVIEACQFFDKMVQEGVRASPWT 378
           S ++D+A A+F EM+Y    PD I + SL+DGL K  RV +A + ++KM+    R +   
Sbjct: 446 SQKLDEACAMFEEMDYKVCTPDEITFCSLIDGLGKVGRVDDAYKVYEKMLDSDCRTNSIV 505

Query: 379 YNILIDGLFRNGRAEASYSLFCDLKKKGQFVDGVTYSIIILQLCKEGLLEEALQLVEEME 438
           Y  LI   F +GR E  + ++ D+  +    D    +  +  + K G  E+   + EE++
Sbjct: 506 YTSLIKNFFNHGRKEDGHKIYKDMINQNCSPDLQLLNTYMDCMFKAGEPEKGRAMFEEIK 565

Query: 439 ARGFVIDLVTVTSLLIAMHKQGQWEGLERLMKHIREGDLVPNVLKWKANMEDSVKYQKNK 498
           AR FV D  + + L+  + K G       L   ++E   V +   +   ++   K  K  
Sbjct: 566 ARRFVPDARSYSILIHGLIKAGFANETYELFYSMKEQGCVLDTRAYNIVIDGFCKCGK-V 625

Query: 499 RKNYSSLFSPKEDLSEIISSRASSVAKVNVGDISENTEEKDDDNWSSSPHVDLLANLAKS 558
            K Y         L E + ++      V  G + +   + D           +L   AKS
Sbjct: 626 NKAY--------QLLEEMKTKGFEPTVVTYGSVIDGLAKID-----RLDEAYMLFEEAKS 685

Query: 559 TGDSLQPFSLSPGQRVEAKGDNSFDIDMVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPV 618
               L                    + + ++ +  F   G++  A  + E     G+ P 
Sbjct: 686 KRIELN-------------------VVIYSSLIDGFGKVGRIDEAYLILEELMQKGLTPN 745

Query: 619 RYTYNSMLSAFVKKGYFHQAWGIFNEMGEKVCPADIATYNLIIQGLGKMGRADLASSVLE 678
            YT+NS+L A VK    ++A   F  M E  C  +  TY ++I GL K+ + + A    +
Sbjct: 746 LYTWNSLLDALVKAEEINEALVCFQSMKELKCTPNQVTYGILINGLCKVRKFNKAFVFWQ 805

Query: 679 KLMEQGGYLDIVMYNTLMNALGKAGRMDDVNKLFEQMRSSGINPDVVSFNTLIEVHSKAG 738
           ++ +QG     + Y T+++ L KAG + +   LF++ +++G  PD   +N +IE  S   
Sbjct: 806 EMQKQGMKPSTISYTTMISGLAKAGNIAEAGALFDRFKANGGVPDSACYNAMIEGLSNGN 838

Query: 739 RFKDAYKFLKMMLDSGCSPNHVTDTIL 766
           R  DA+   +     G   ++ T  +L
Sbjct: 866 RAMDAFSLFEETRRRGLPIHNKTCVVL 838

BLAST of Cp4.1LG12g04710.1 vs. Swiss-Prot
Match: PP344_ARATH (Pentatricopeptide repeat-containing protein At4g31850, chloroplastic OS=Arabidopsis thaliana GN=PGR3 PE=1 SV=1)

HSP 1 Score: 201.4 bits (511), Expect = 3.6e-50
Identity = 181/703 (25.75%), Postives = 316/703 (44.95%), Query Frame = 1

Query: 78  TYSQIFRTLCRSGYLHEVPLVLSSMKRDGVDVDSHTFKVLLDAFIRSGKFDAALEILDHM 137
           TY+ +   LC +  L     V   MK      D  T+  LLD F  +   D+  +    M
Sbjct: 295 TYTVLIDALCTARKLDCAKEVFEKMKTGRHKPDRVTYITLLDRFSDNRDLDSVKQFWSEM 354

Query: 138 EELGTSLELNTYNSVLVALVRKNQVGLALSIFFKLFDAFSTGGQEGSAVPSFSFLPNALA 197
           E+ G   ++ T+  ++ AL +    G A       FD       +G        LPN   
Sbjct: 355 EKDGHVPDVVTFTILVDALCKAGNFGEA-------FDTLDVMRDQG-------ILPNLHT 414

Query: 198 CNELLVALRKSDMRVEFKKVFDKLRTIRSFEFNVCGYNICIHAFGCWGYLDTSLALFKEM 257
            N L+  L +     +  ++F  + ++   +     Y + I  +G  G   ++L  F++M
Sbjct: 415 YNTLICGLLRVHRLDDALELFGNMESL-GVKPTAYTYIVFIDYYGKSGDSVSALETFEKM 474

Query: 258 KQRSLVSVSFGPDLCTYNSLIHVLCLVGKVNDALIVWEELKGSGHEPDAFTYRLIIQGCC 317
           K + +      P++   N+ ++ L   G+  +A  ++  LK  G  PD+ TY ++++   
Sbjct: 475 KTKGIA-----PNIVACNASLYSLAKAGRDREAKQIFYGLKDIGLVPDSVTYNMMMKCYS 534

Query: 318 KSYRMDDATAIFNEMEYNGFVPDTIVYNSLLDGLFKARRVIEACQFFDKMVQEGVRASPW 377
           K   +D+A  + +EM  NG  PD IV NSL++ L+KA RV EA + F +M +  ++ +  
Sbjct: 535 KVGEIDEAIKLLSEMMENGCEPDVIVVNSLINTLYKADRVDEAWKMFMRMKEMKLKPTVV 594

Query: 378 TYNILIDGLFRNGRAEASYSLFCDLKKKGQFVDGVTYSIIILQLCKEGLLEEALQLVEEM 437
           TYN L+ GL +NG+ + +  LF  + +KG   + +T++ +   LCK   +  AL+++ +M
Sbjct: 595 TYNTLLAGLGKNGKIQEAIELFEGMVQKGCPPNTITFNTLFDCLCKNDEVTLALKMLFKM 654

Query: 438 EARGFVIDLVTVTSLLIAMHKQGQ-------WEGLERLM--KHIREGDLVPNVLKWKANM 497
              G V D+ T  +++  + K GQ       +  +++L+    +    L+P V+K  + +
Sbjct: 655 MDMGCVPDVFTYNTIIFGLVKNGQVKEAMCFFHQMKKLVYPDFVTLCTLLPGVVK-ASLI 714

Query: 498 EDSVKYQKNKRKNY----SSLFSPKEDLSEIISSRASSVAKVNVGDISENTEEKDDDNWS 557
           ED+ K   N   N     ++LF       ++I S  +     N    SE           
Sbjct: 715 EDAYKIITNFLYNCADQPANLF-----WEDLIGSILAEAGIDNAVSFSER---------- 774

Query: 558 SSPHVDLLANLAKSTGDSLQPFSLSPGQRVEAKGDNSFDIDMVNTFLSIFLAKGKLSLAC 617
                 L+AN     GDS+    L P  R   K +N                   +S A 
Sbjct: 775 ------LVANGICRDGDSI----LVPIIRYSCKHNN-------------------VSGAR 834

Query: 618 KLFEIFS-DMGVNPVRYTYNSMLSAFVKKGYFHQAWGIFNEMGEKVCPADIATYNLIIQG 677
            LFE F+ D+GV P   TYN ++   ++      A  +F ++    C  D+ATYN ++  
Sbjct: 835 TLFEKFTKDLGVQPKLPTYNLLIGGLLEADMIEIAQDVFLQVKSTGCIPDVATYNFLLDA 894

Query: 678 LGKMGRADLASSVLEKLMEQGGYLDIVMYNTLMNALGKAGRMDDVNKL-FEQMRSSGINP 737
            GK G+ D    + +++       + + +N +++ L KAG +DD   L ++ M     +P
Sbjct: 895 YGKSGKIDELFELYKEMSTHECEANTITHNIVISGLVKAGNVDDALDLYYDLMSDRDFSP 932

Query: 738 DVVSFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTIL 766
              ++  LI+  SK+GR  +A +  + MLD GC PN     IL
Sbjct: 955 TACTYGPLIDGLSKSGRLYEAKQLFEGMLDYGCRPNCAIYNIL 932

BLAST of Cp4.1LG12g04710.1 vs. Swiss-Prot
Match: PP247_ARATH (Pentatricopeptide repeat-containing protein At3g22470, mitochondrial OS=Arabidopsis thaliana GN=At3g22470 PE=2 SV=1)

HSP 1 Score: 194.9 bits (494), Expect = 3.3e-48
Identity = 130/498 (26.10%), Postives = 235/498 (47.19%), Query Frame = 1

Query: 270 DLCTYNSLIHVLCLVGKVNDALIVWEELKGSGHEPDAFTYRLIIQGCCKSYRMDDATAIF 329
           D+ T   +I+  C   K+  A  V       G+EPD  T+  ++ G C   R+ +A A+ 
Sbjct: 104 DMYTMTIMINCYCRKKKLLFAFSVLGRAWKLGYEPDTITFSTLVNGFCLEGRVSEAVALV 163

Query: 330 NEMEYNGFVPDTIVYNSLLDGLFKARRVIEACQFFDKMVQEGVRASPWTYNILIDGLFRN 389
           + M      PD +  ++L++GL    RV EA    D+MV+ G +    TY  +++ L ++
Sbjct: 164 DRMVEMKQRPDLVTVSTLINGLCLKGRVSEALVLIDRMVEYGFQPDEVTYGPVLNRLCKS 223

Query: 390 GRAEASYSLFCDLKKKGQFVDGVTYSIIILQLCKEGLLEEALQLVEEMEARGFVIDLVTV 449
           G +  +  LF  ++++      V YSI+I  LCK+G  ++AL L  EME +G   D+VT 
Sbjct: 224 GNSALALDLFRKMEERNIKASVVQYSIVIDSLCKDGSFDDALSLFNEMEMKGIKADVVTY 283

Query: 450 TSLLIAMHKQGQWEGLERLMKHIREGDLVPNVLKWKANMEDSVKYQKNKRKNYSSLFSPK 509
           +SL+  +   G+W+   ++++ +   +++P+V+ + A ++  VK  K        L   K
Sbjct: 284 SSLIGGLCNDGKWDDGAKMLREMIGRNIIPDVVTFSALIDVFVKEGK--------LLEAK 343

Query: 510 EDLSEIISSRASSVAKVNVGDISENTEEKDDDNWSSSPHVDLLANLAKST--GDSLQPFS 569
           E  +E+I+   +                 D   ++S     L+    K     ++ Q F 
Sbjct: 344 ELYNEMITRGIA----------------PDTITYNS-----LIDGFCKENCLHEANQMFD 403

Query: 570 LSPGQRVEAKGDNSFDIDMVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPVRYTYNSMLS 629
           L   +  E       DI   +  ++ +    ++    +LF   S  G+ P   TYN+++ 
Sbjct: 404 LMVSKGCEP------DIVTYSILINSYCKAKRVDDGMRLFREISSKGLIPNTITYNTLVL 463

Query: 630 AFVKKGYFHQAWGIFNEMGEKVCPADIATYNLIIQGLGKMGRADLASSVLEKLMEQGGYL 689
            F + G  + A  +F EM  +  P  + TY +++ GL   G  + A  + EK+ +    L
Sbjct: 464 GFCQSGKLNAAKELFQEMVSRGVPPSVVTYGILLDGLCDNGELNKALEIFEKMQKSRMTL 523

Query: 690 DIVMYNTLMNALGKAGRMDDVNKLFEQMRSSGINPDVVSFNTLIEVHSKAGRFKDAYKFL 749
            I +YN +++ +  A ++DD   LF  +   G+ PDVV++N +I    K G   +A    
Sbjct: 524 GIGIYNIIIHGMCNASKVDDAWSLFCSLSDKGVKPDVVTYNVMIGGLCKKGSLSEADMLF 566

Query: 750 KMMLDSGCSPNHVTDTIL 766
           + M + GC+P+  T  IL
Sbjct: 584 RKMKEDGCTPDDFTYNIL 566

BLAST of Cp4.1LG12g04710.1 vs. Swiss-Prot
Match: PP156_ARATH (Pentatricopeptide repeat-containing protein At2g16880 OS=Arabidopsis thaliana GN=At2g16880 PE=2 SV=1)

HSP 1 Score: 190.3 bits (482), Expect = 8.2e-47
Identity = 181/755 (23.97%), Postives = 324/755 (42.91%), Query Frame = 1

Query: 17  LVASITKTLSESGTR---TLQHQSLSISEPLLLQILRSRSV--HPSNKLDFFKWCSLSPN 76
           L+ ++T  L+   T    TL      I++PLL  +L S S+   P   + FF+W      
Sbjct: 11  LLKTLTSILTSEKTHFLETLNPYIPQITQPLLTSLLSSPSLAKKPETLVSFFQWA----- 70

Query: 77  FSHSASTYSQIFRTLCRSGYLHEVPLVLSSMKRDGVDVDSHTF---KVLLDAFIRSGKFD 136
                       +T     +  + PL L S+ R  +    H F   K LL ++IR+   D
Sbjct: 71  ------------QTSIPEAFPSDSPLPLISVVRSLLS--HHKFADAKSLLVSYIRTS--D 130

Query: 137 AALEILDHMEELGTSLELNT------YNSVLVALVRKNQVGLALSIFFKLFDAFSTGGQE 196
           A+L + + +  L  +L L+       ++  L A + + +  +AL IF K+          
Sbjct: 131 ASLSLCNSL--LHPNLHLSPPPSKALFDIALSAYLHEGKPHVALQIFQKMI--------- 190

Query: 197 GSAVPSFSFLPNALACNELLVALRKSDMRVEF---KKVFDKLRTIRSFEFNVCGYNICIH 256
                     PN L CN LL+ L +          ++VFD +  I     NV  +N+ ++
Sbjct: 191 -----RLKLKPNLLTCNTLLIGLVRYPSSFSISSAREVFDDMVKI-GVSLNVQTFNVLVN 250

Query: 257 AFGCWGYLDTSLALFKEMKQRSLVSVSFGPDLCTYNSLIHVLCLVGKVNDALIVWEELKG 316
            +   G L+ +L + + M     V+    PD  TYN+++  +   G+++D   +  ++K 
Sbjct: 251 GYCLEGKLEDALGMLERMVSEFKVN----PDNVTYNTILKAMSKKGRLSDLKELLLDMKK 310

Query: 317 SGHEPDAFTYRLIIQGCCKSYRMDDATAIFNEMEYNGFVPDTIVYNSLLDGLFKARRVIE 376
           +G  P+  TY  ++ G CK   + +A  I   M+    +PD   YN L++GL  A  + E
Sbjct: 311 NGLVPNRVTYNNLVYGYCKLGSLKEAFQIVELMKQTNVLPDLCTYNILINGLCNAGSMRE 370

Query: 377 ACQFFDKMVQEGVRASPWTYNILIDGLFRNGRAEASYSLFCDLKKKGQFVDGVTYSIIIL 436
             +  D M    ++    TYN LIDG F  G +  +  L   ++  G   + VT++I + 
Sbjct: 371 GLELMDAMKSLKLQPDVVTYNTLIDGCFELGLSLEARKLMEQMENDGVKANQVTHNISLK 430

Query: 437 QLCKEGLLEEALQLVEEM-EARGFVIDLVTVTSLLIAMHKQGQWEGLERLMKHIREGDLV 496
            LCKE   E   + V+E+ +  GF  D+VT  +L+ A  K G   G   +M+ + +  + 
Sbjct: 431 WLCKEEKREAVTRKVKELVDMHGFSPDIVTYHTLIKAYLKVGDLSGALEMMREMGQKGIK 490

Query: 497 PNVLKWKANMEDSVKYQKNKRKNYSSLFSPKEDLSEIISSRASSVAKVNVGDI-----SE 556
            N +     ++   K +K              +L      R   V +V  G +      E
Sbjct: 491 MNTITLNTILDALCKERK---------LDEAHNLLNSAHKRGFIVDEVTYGTLIMGFFRE 550

Query: 557 NTEEKDDDNWSSSPHVDLLANLAKSTGDSLQPFSLSPGQRVEAKGDNSFDIDMVNTFLSI 616
              EK  + W                 D ++   ++P             +   N+ +  
Sbjct: 551 EKVEKALEMW-----------------DEMKKVKITP------------TVSTFNSLIGG 610

Query: 617 FLAKGKLSLACKLFEIFSDMGVNPVRYTYNSMLSAFVKKGYFHQAWGIFNEMGEKVCPAD 676
               GK  LA + F+  ++ G+ P   T+NS++  + K+G   +A+  +NE  +     D
Sbjct: 611 LCHHGKTELAMEKFDELAESGLLPDDSTFNSIILGYCKEGRVEKAFEFYNESIKHSFKPD 670

Query: 677 IATYNLIIQGLGKMGRADLASSVLEKLMEQGGYLDIVMYNTLMNALGKAGRMDDVNKLFE 736
             T N+++ GL K G  + A +    L+E+   +D V YNT+++A  K  ++ +   L  
Sbjct: 671 NYTCNILLNGLCKEGMTEKALNFFNTLIEE-REVDTVTYNTMISAFCKDKKLKEAYDLLS 684

Query: 737 QMRSSGINPDVVSFNTLIEVHSKAGRFKDAYKFLK 749
           +M   G+ PD  ++N+ I +  + G+  +  + LK
Sbjct: 731 EMEEKGLEPDRFTYNSFISLLMEDGKLSETDELLK 684

BLAST of Cp4.1LG12g04710.1 vs. TrEMBL
Match: A0A0A0KFG9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G101450 PE=4 SV=1)

HSP 1 Score: 1360.5 bits (3520), Expect = 0.0e+00
Identity = 675/787 (85.77%), Postives = 728/787 (92.50%), Query Frame = 1

Query: 3   SRATPTLSRLADLLLVASITKTLSESGTRTLQHQSLSISEPLLLQILRSRSVHPSNKLDF 62
           SR   TLS L+ LLL+ASITKTLSESGTRTLQH SL IS PLLLQIL SRS++PS+KLDF
Sbjct: 17  SRTASTLSHLSHLLLLASITKTLSESGTRTLQHHSLPISHPLLLQILHSRSLNPSHKLDF 76

Query: 63  FKWCSLSPNFSHSASTYSQIFRTLCRSGYLHEVPLVLSSMKRDGVDVDSHTFKVLLDAFI 122
           FKWCSL+PNF+HS STYSQIF  LCRSGYLHEVP +L SMKRDGV VDSHTFKVLLDAFI
Sbjct: 77  FKWCSLAPNFNHSPSTYSQIFHILCRSGYLHEVPPLLDSMKRDGVSVDSHTFKVLLDAFI 136

Query: 123 RSGKFDAALEILDHMEELGTSLELNTYNSVLVALVRKNQVGLALSIFFKLFDAFSTGGQE 182
           RSGK+DAALEILDHME+LGTSLELNTYNSVLVAL+RKNQVGLALSIFFKL D F+ GGQ 
Sbjct: 137 RSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLLDGFNNGGQV 196

Query: 183 GSAVPSFSFLPNALACNELLVALRKSDMRVEFKKVFDKLRTIRSFEFNVCGYNICIHAFG 242
            SA  +F FLPN+LACNELLVALRK DMRVEFKKVFDKLR I SFEF+V GYNICI+AFG
Sbjct: 197 DSAATTFHFLPNSLACNELLVALRKLDMRVEFKKVFDKLRAIESFEFSVYGYNICIYAFG 256

Query: 243 CWGYLDTSLALFKEMKQRSLVSVSFGPDLCTYNSLIHVLCLVGKVNDALIVWEELKGSGH 302
           CWGYLDT+L+LFKEMK++SLVS SF PDLCTYNS+IHVLCLVGKV DALIVWEELKGSGH
Sbjct: 257 CWGYLDTALSLFKEMKEKSLVSESFSPDLCTYNSIIHVLCLVGKVKDALIVWEELKGSGH 316

Query: 303 EPDAFTYRLIIQGCCKSYRMDDATAIFNEMEYNGFVPDTIVYNSLLDGLFKARRVIEACQ 362
           EPDAFTYR+IIQGCCKS RMDDAT IFNEMEYNG +PDTIVYNSLL+GLFKAR+V EACQ
Sbjct: 317 EPDAFTYRIIIQGCCKSCRMDDATMIFNEMEYNGLIPDTIVYNSLLNGLFKARKVTEACQ 376

Query: 363 FFDKMVQEGVRASPWTYNILIDGLFRNGRAEASYSLFCDLKKKGQFVDGVTYSIIILQLC 422
            FDKMVQE VRASPWTYNILIDGLFRNGRAEA Y+LFCDLKKKGQ VD VTYSIIILQLC
Sbjct: 377 LFDKMVQEDVRASPWTYNILIDGLFRNGRAEAGYTLFCDLKKKGQIVDAVTYSIIILQLC 436

Query: 423 KEGLLEEALQLVEEMEARGFVIDLVTVTSLLIAMHKQGQWEGLERLMKHIREGDLVPNVL 482
           KE LLEEALQLVEEMEARGFV+DL+T+TSLLIAMHKQGQW+GLERLMKHIREGDLVPNVL
Sbjct: 437 KERLLEEALQLVEEMEARGFVVDLITITSLLIAMHKQGQWDGLERLMKHIREGDLVPNVL 496

Query: 483 KWKANMEDSVKYQKNKRKNYSSLFSPKEDLSEIISSRASSVAKVNVGDISENTEEKDDDN 542
           KWK NME S+KYQKNKRK++SSLFSPKEDLSE+ISSRASS AKVN+ +  ENTEE+D D+
Sbjct: 497 KWKINMEYSIKYQKNKRKDFSSLFSPKEDLSEVISSRASSAAKVNIDNSFENTEERDMDS 556

Query: 543 WSSSPHVDLLANLAKSTGDSLQPFSLSPGQRVEAKGDNSFDIDMVNTFLSIFLAKGKLSL 602
           WSSSP+V+ LANLA ST D LQPFS+  G+R++ K DNSFDI+MVNTFLSIFLAKGKL+L
Sbjct: 557 WSSSPYVNRLANLANSTSDILQPFSIRQGRRIQEKQDNSFDINMVNTFLSIFLAKGKLNL 616

Query: 603 ACKLFEIFSDMGVNPVRYTYNSMLSAFVKKGYFHQAWGIFNEMGEKVCPADIATYNLIIQ 662
           ACKLFEIFSDMGVNPV+YTYNSMLS+FVKKGYFHQAWGIFNEMGE VCPADIATYN+IIQ
Sbjct: 617 ACKLFEIFSDMGVNPVKYTYNSMLSSFVKKGYFHQAWGIFNEMGENVCPADIATYNVIIQ 676

Query: 663 GLGKMGRADLASSVLEKLMEQGGYLDIVMYNTLMNALGKAGRMDDVNKLFEQMRSSGINP 722
           GLGKMGRADLASSVLEKLMEQGGYLDIVMYNTL+NALGKAGRMDDVNKLF QMR+SGINP
Sbjct: 677 GLGKMGRADLASSVLEKLMEQGGYLDIVMYNTLINALGKAGRMDDVNKLFGQMRNSGINP 736

Query: 723 DVVSFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTILDFLGREIEKARYEKASI 782
           DVV+FNTLIEVHSKAGR KDAYKFLKMMLDSGCSPNHVTDT LDFLGRE+EKARYEKASI
Sbjct: 737 DVVTFNTLIEVHSKAGRLKDAYKFLKMMLDSGCSPNHVTDTTLDFLGREMEKARYEKASI 796

Query: 783 IRDKNSS 790
           IRDKNSS
Sbjct: 797 IRDKNSS 803

BLAST of Cp4.1LG12g04710.1 vs. TrEMBL
Match: A0A061DRT6_THECC (Pentatricopeptide repeat-containing protein, putative OS=Theobroma cacao GN=TCM_005001 PE=4 SV=1)

HSP 1 Score: 1122.5 bits (2902), Expect = 0.0e+00
Identity = 561/784 (71.56%), Postives = 658/784 (83.93%), Query Frame = 1

Query: 12  LADLLLVASITKTLSESGTRTLQHQSLSISEPLLLQILRSRSVHPSNKLDFFKWC-SLSP 71
           L ++LL+AS+TKTLSESGTR L   S+ ISEPL++QILR  S+ PS KLDFF WC S+ P
Sbjct: 23  LGNILLIASLTKTLSESGTRNLDPNSIPISEPLVIQILRKHSLEPSKKLDFFNWCRSVKP 82

Query: 72  NFSHSASTYSQIFRTLCRSGYLHEVPLVLSSMKRDGVDVDSHTFKVLLDAFIRSGKFDAA 131
           NF HSA TYS IFRTLCRSG++ EVP +L +MK DGV VDS TFK LLDAFIRSGKFD+A
Sbjct: 83  NFKHSAVTYSHIFRTLCRSGFVEEVPNLLFAMKEDGVLVDSDTFKFLLDAFIRSGKFDSA 142

Query: 132 LEILDHMEELGTSLELNTYNSVLVALVRKNQVGLALSIFFKLFDAFSTGGQEGSAVPSFS 191
           LEILD MEELG  L L  Y+SVLVAL+RK+QVGLALS+FFKL +A + G  +G++V S  
Sbjct: 143 LEILDFMEELGAGLNLRVYDSVLVALIRKDQVGLALSLFFKLLEACN-GNDDGNSVDSS- 202

Query: 192 FLPNALACNELLVALRKSDMRVEFKKVFDKLRTIRSFEFNVCGYNICIHAFGCWGYLDTS 251
            LP ++A NELLVALRK+ MR EFK+VFD LR  R FEF+ CGYNICIH+FGCWG L  S
Sbjct: 203 -LPGSIAINELLVALRKAHMRREFKQVFDILREKREFEFDTCGYNICIHSFGCWGDLGAS 262

Query: 252 LALFKEMKQRSLVSVSFGPDLCTYNSLIHVLCLVGKVNDALIVWEELKGSGHEPDAFTYR 311
           L LFKEMK++     SFGPDLCTYNSLI VLCLVGKV DAL+VWEELK SGHEPDAFTYR
Sbjct: 263 LKLFKEMKEKEKSFGSFGPDLCTYNSLIDVLCLVGKVKDALVVWEELKVSGHEPDAFTYR 322

Query: 312 LIIQGCCKSYRMDDATAIFNEMEYNGFVPDTIVYNSLLDGLFKARRVIEACQFFDKMVQE 371
           ++IQGC KSYRMDDAT IF+EM+YNGF  DT+VYNSLL+GLFKAR+V+EACQFF+KMVQ+
Sbjct: 323 ILIQGCSKSYRMDDATKIFSEMQYNGFAMDTVVYNSLLNGLFKARKVMEACQFFEKMVQD 382

Query: 372 GVRASPWTYNILIDGLFRNGRAEASYSLFCDLKKKGQFVDGVTYSIIILQLCKEGLLEEA 431
           GVRAS WTYNILIDGLFRNGRAEA+Y+LFCDLKKKGQFVDG+TYSI++LQLC+EG LE A
Sbjct: 383 GVRASCWTYNILIDGLFRNGRAEAAYTLFCDLKKKGQFVDGITYSIVVLQLCREGQLEGA 442

Query: 432 LQLVEEMEARGFVIDLVTVTSLLIAMHKQGQWEGLERLMKHIREGDLVPNVLKWKANMED 491
           L+LVEEMEARGF++DLVT+TSLLI  HKQG+W+  ERLMKHIR+G+LVPNVLKWKANME 
Sbjct: 443 LRLVEEMEARGFIVDLVTITSLLIGFHKQGRWDWTERLMKHIRDGNLVPNVLKWKANMEA 502

Query: 492 SVKYQKNKRKNYSSLFSPKEDLSEIISSRASSVAKVNVGDISENTEEKDD-------DNW 551
           S+K     RK+Y+ LF  K D  EI++   S    +     SE+ +EKD        D W
Sbjct: 503 SMKNPPKNRKDYTPLFPSKGDFREIMNLLGSVGQAMGTNLDSEDCDEKDQEKPSIDTDQW 562

Query: 552 SSSPHVDLLANLAKSTGDSLQPFSLSPGQRVEAKGDNSFDIDMVNTFLSIFLAKGKLSLA 611
           SSSP++D LAN  KST  S Q FSL  GQRV+ KG  SFD+DMVNTFLSIFLAKGKLSLA
Sbjct: 563 SSSPYMDQLANQGKSTERSSQLFSLIRGQRVQEKGIGSFDVDMVNTFLSIFLAKGKLSLA 622

Query: 612 CKLFEIFSDMGVNPVRYTYNSMLSAFVKKGYFHQAWGIFNEMGEKVCPADIATYNLIIQG 671
           CKLFE+F+DMGV+PV YTYNS++S+FVKKGYF++AWG+ NEM EKVCPADIATYNLIIQG
Sbjct: 623 CKLFEVFTDMGVDPVSYTYNSIMSSFVKKGYFNEAWGVLNEMDEKVCPADIATYNLIIQG 682

Query: 672 LGKMGRADLASSVLEKLMEQGGYLDIVMYNTLMNALGKAGRMDDVNKLFEQMRSSGINPD 731
           LGKMGRAD+ASSVL+KLM+QGGYLD+VMYNTL+NALGKAGR+D+ +KLFEQMR+SGINPD
Sbjct: 683 LGKMGRADIASSVLDKLMKQGGYLDVVMYNTLVNALGKAGRVDEASKLFEQMRTSGINPD 742

Query: 732 VVSFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTILDFLGREIEKARYEKASII 788
           V+++NTLIEVH+KAG+ +DAYKFLKMMLD+GCSPNHVTDTILD LG+EIEK R +KAS++
Sbjct: 743 VITYNTLIEVHTKAGQLQDAYKFLKMMLDAGCSPNHVTDTILDNLGKEIEKMRLQKASMV 802

BLAST of Cp4.1LG12g04710.1 vs. TrEMBL
Match: B9GRT7_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0002s18390g PE=4 SV=2)

HSP 1 Score: 1085.5 bits (2806), Expect = 0.0e+00
Identity = 542/779 (69.58%), Postives = 648/779 (83.18%), Query Frame = 1

Query: 12  LADLLLVASITKTLSESGTRTLQHQSLSISEPLLLQILRSRSVHPSNKLDFFKWCSLSPN 71
           + ++LLVA +TKTLSESGTR+L   S+ +SE L+LQILR  S+  S K++FFKWCS+   
Sbjct: 1   MGNILLVAYLTKTLSESGTRSLDPDSIPLSESLVLQILRRNSLDSSKKMEFFKWCSVRHI 60

Query: 72  FSHSASTYSQIFRTLCRSGYLHEVPLVLSSMKRDGVDVDSHTFKVLLDAFIRSGKFDAAL 131
           + HS STYSQ+F TLCRSGYL EVP +L+SMK DGV V S TFK+LLDAFIRSGKFD+AL
Sbjct: 61  YKHSVSTYSQMFSTLCRSGYLDEVPDLLNSMKNDGVVVGSETFKLLLDAFIRSGKFDSAL 120

Query: 132 EILDHMEELGTSLELNTYNSVLVALVRKNQVGLALSIFFKLFDAFSTGGQEGSAVPSFSF 191
           +ILDHMEELG++   + Y+S++VAL +KNQVGLALSI FKL +A S G +E +   S   
Sbjct: 121 DILDHMEELGSNPNPHMYDSIIVALAKKNQVGLALSIMFKLLEA-SDGNEENAVGVS--- 180

Query: 192 LPNALACNELLVALRKSDMRVEFKKVFDKLRTIRSFEFNVCGYNICIHAFGCWGYLDTSL 251
           LP ++ACN LLVALR  +M+VEFK VF KLR    FE N  GYNICIHAFGCWG L TSL
Sbjct: 181 LPGSVACNALLVALRNGEMKVEFKTVFAKLRGKGGFELNTWGYNICIHAFGCWGDLTTSL 240

Query: 252 ALFKEMKQRSLVSVSFGPDLCTYNSLIHVLCLVGKVNDALIVWEELKGSGHEPDAFTYRL 311
            LFKEMK++SL S S  PDLCTYNSLIHVLCL GKV DA+IV+EELK SGHEPDAFTYR+
Sbjct: 241 RLFKEMKEKSLASGSLDPDLCTYNSLIHVLCLAGKVKDAVIVYEELKVSGHEPDAFTYRI 300

Query: 312 IIQGCCKSYRMDDATAIFNEMEYNGFVPDTIVYNSLLDGLFKARRVIEACQFFDKMVQEG 371
           +IQGCCKSY+M+DAT IF+EM+YNGF+PDT+VYNSLLDG+FKAR+V+EACQ F+KMVQ+G
Sbjct: 301 LIQGCCKSYQMEDATKIFSEMQYNGFLPDTVVYNSLLDGMFKARKVMEACQLFEKMVQDG 360

Query: 372 VRASPWTYNILIDGLFRNGRAEASYSLFCDLKKKGQFVDGVTYSIIILQLCKEGLLEEAL 431
           VRAS WTYNILIDGL +NGRAEA Y+LFC LKKKGQFVD VTYSI++L LC++G LEEAL
Sbjct: 361 VRASCWTYNILIDGLCKNGRAEAGYNLFCGLKKKGQFVDAVTYSIVVLLLCRKGHLEEAL 420

Query: 432 QLVEEMEARGFVIDLVTVTSLLIAMHKQGQWEGLERLMKHIREGDLVPNVLKWKANMEDS 491
            LVEEME RGFV+DL+T+TSLLIA HKQG+W+  ERLMKHIR+ +L+PNVLKW+A+ME S
Sbjct: 421 HLVEEMEERGFVVDLITITSLLIAFHKQGRWDCTERLMKHIRDVNLLPNVLKWRADMEAS 480

Query: 492 VKYQKNKRKNYSSLFSPKEDLSEIISSRASSVAKVNVGDI-SENTEEKDDDNWSSSPHVD 551
           +K     R++Y+ +F     L EI+SS +S  ++ + G    E +   D D WSSSP++D
Sbjct: 481 LKNPPRSREDYTPMFPSTGGLQEIMSSISSPKSRSDDGATEDEKSSSADTDQWSSSPYMD 540

Query: 552 LLANLAKSTGDSLQPFSLSPGQRVEAKGDNSFDIDMVNTFLSIFLAKGKLSLACKLFEIF 611
            LAN AKST  S Q FSL+ GQRV+AKG  SFDIDMVNTFLSIFLAKGKLSLACKLFEIF
Sbjct: 541 HLANQAKSTDLSSQLFSLARGQRVQAKGAGSFDIDMVNTFLSIFLAKGKLSLACKLFEIF 600

Query: 612 SDMGVNPVRYTYNSMLSAFVKKGYFHQAWGIFNEMGEKVCPADIATYNLIIQGLGKMGRA 671
           +DMGV+PV YTYNS++S+FVKKGYF++AW +FNEMGEKVCP DIATYNL+IQGLGKMGRA
Sbjct: 601 TDMGVDPVSYTYNSIMSSFVKKGYFNRAWDVFNEMGEKVCPPDIATYNLVIQGLGKMGRA 660

Query: 672 DLASSVLEKLMEQGGYLDIVMYNTLMNALGKAGRMDDVNKLFEQMRSSGINPDVVSFNTL 731
           DLASSVL+KLM+QGGYLDIVMYNTL++ALGKAGR+D+ N LFEQM+ SG+NPDVV++N +
Sbjct: 661 DLASSVLDKLMKQGGYLDIVMYNTLIDALGKAGRIDEANNLFEQMKISGLNPDVVTYNIM 720

Query: 732 IEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTILDFLGREIEKARYEKASIIRDKNSS 790
           IEVHSK GR KDAYKFLKMMLD+GC PNHVTDT LDFL +EIEK RY+KASI+R K+ S
Sbjct: 721 IEVHSKTGRLKDAYKFLKMMLDAGCLPNHVTDTTLDFLAKEIEKLRYQKASIMRQKDDS 775

BLAST of Cp4.1LG12g04710.1 vs. TrEMBL
Match: F6HG95_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0010g03410 PE=4 SV=1)

HSP 1 Score: 1085.1 bits (2805), Expect = 0.0e+00
Identity = 541/777 (69.63%), Postives = 651/777 (83.78%), Query Frame = 1

Query: 11  RLADLLLVASITKTLSESGTRTLQHQSLSISEPLLLQILRSRSVHPSNKLDFFKWCSLSP 70
           +L D+LLVASI+KTLSE GTR+   +S+ ISE L++QIL   S+    K++FF+WCS   
Sbjct: 18  KLGDMLLVASISKTLSERGTRSPDLESIPISESLVVQILGRNSIDVFRKVEFFRWCSFRH 77

Query: 71  NFSHSASTYSQIFRTLCRSG--YLHEVPLVLSSMKRDGVDVDSHTFKVLLDAFIRSGKFD 130
           N+ HS   YS IFR +CR+G  +L +VPL++SSMK DGV V   TFK+LLD+ IR+GKFD
Sbjct: 78  NYKHSVGAYSHIFRIVCRAGAEFLDQVPLLMSSMKDDGVVVGQETFKLLLDSLIRAGKFD 137

Query: 131 AALEILDHMEELGTSLELNTYNSVLVALVRKNQVGLALSIFFKLFDAFSTGGQEGSA-VP 190
           +ALEILDH+EELGT L    Y+SVLVAL+RKNQ+GLAL +FFKL      GG EG   VP
Sbjct: 138 SALEILDHIEELGTGLNSYVYDSVLVALIRKNQLGLALPLFFKLL-----GGDEGQGGVP 197

Query: 191 SFSFLPNALACNELLVALRKSDMRVEFKKVFDKLRTIRSFEFNVCGYNICIHAFGCWGYL 250
               +P + ACN+LLVALRK+DM++EF+ VF+KLR  + F+ +  GYNICIHAFGCWG L
Sbjct: 198 ----VPESNACNQLLVALRKADMKIEFRNVFEKLRAKKDFDLDTQGYNICIHAFGCWGDL 257

Query: 251 DTSLALFKEMKQRSLVSVSFGPDLCTYNSLIHVLCLVGKVNDALIVWEELKGSGHEPDAF 310
            T+L LFKEMK +SL S SFGPDLCTYNSLI VLCLVGKV DALIVWEELKGSGHEPDAF
Sbjct: 258 GTALNLFKEMKDKSLNSSSFGPDLCTYNSLIRVLCLVGKVKDALIVWEELKGSGHEPDAF 317

Query: 311 TYRLIIQGCCKSYRMDDATAIFNEMEYNGFVPDTIVYNSLLDGLFKARRVIEACQFFDKM 370
           TYR++IQGC KSYRMDDA  IFNEM+YNGF PDTIVYN+LLDGLFKAR+V+EACQ F+KM
Sbjct: 318 TYRILIQGCSKSYRMDDAMRIFNEMQYNGFCPDTIVYNTLLDGLFKARKVMEACQVFEKM 377

Query: 371 VQEGVRASPWTYNILIDGLFRNGRAEASYSLFCDLKKKGQFVDGVTYSIIILQLCKEGLL 430
           V++GVRAS WT+NI+I GLFRNGRA A Y+LFCDLKKKG+FVDG+TYSI++LQLC+EG L
Sbjct: 378 VEDGVRASCWTHNIVICGLFRNGRAAAGYTLFCDLKKKGKFVDGITYSIVVLQLCREGQL 437

Query: 431 EEALQLVEEMEARGFVIDLVTVTSLLIAMHKQGQWEGLERLMKHIREGDLVPNVLKWKAN 490
           EEALQLVEEMEARGFV+DLVT+TSLLI  HKQG+W+  ERLMKHIR+G+LVPNVL WKAN
Sbjct: 438 EEALQLVEEMEARGFVVDLVTITSLLIGFHKQGRWDWTERLMKHIRDGNLVPNVLNWKAN 497

Query: 491 MEDSVKYQKNKRKNYSSLFSPKEDLSEIISSRASSVAKVNVGDISENTEEKDDDNWSSSP 550
           ME  +K  +++RK+Y+ +F  + +LSEI+S  +S+  +++    SE    + +D WSSSP
Sbjct: 498 MEAYMKAPQSRRKDYTPMFPSEGNLSEIMSLISSADTEMDGSPGSEEDVAQHEDQWSSSP 557

Query: 551 HVDLLANLAKSTGDSLQPFSLSPGQRVEAKGDNSFDIDMVNTFLSIFLAKGKLSLACKLF 610
           ++D LA+  KS   S Q  SLS GQRV+AKG +SFDIDMVNT+LSIFLAKGKLSLACKLF
Sbjct: 558 YMDQLASQLKSIDVSSQLLSLSRGQRVQAKGIDSFDIDMVNTYLSIFLAKGKLSLACKLF 617

Query: 611 EIFSDMGVNPVRYTYNSMLSAFVKKGYFHQAWGIFNEMGEKVCPADIATYNLIIQGLGKM 670
           EIFS+MGV+PV YTYNSM++AFVKKGYF++AWG+F+EMGEKVCP DIATYN+IIQGLGKM
Sbjct: 618 EIFSNMGVDPVIYTYNSMMTAFVKKGYFNEAWGVFHEMGEKVCPPDIATYNVIIQGLGKM 677

Query: 671 GRADLASSVLEKLMEQGGYLDIVMYNTLMNALGKAGRMDDVNKLFEQMRSSGINPDVVSF 730
           GRADLAS+VL+ LM+QGGYLDIVMYNTL+NALGKAGR+D+  KLFEQMRSSGINPDVV+F
Sbjct: 678 GRADLASAVLDMLMKQGGYLDIVMYNTLINALGKAGRIDEATKLFEQMRSSGINPDVVTF 737

Query: 731 NTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTILDFLGREIEKARYEKASIIR 785
           NTLIE+H+KAG+ K AYKFLK+MLD+GCSPNHVTDT LDFLG+EIEK RY+KASIIR
Sbjct: 738 NTLIEIHAKAGQLKAAYKFLKLMLDAGCSPNHVTDTTLDFLGKEIEKLRYKKASIIR 785

BLAST of Cp4.1LG12g04710.1 vs. TrEMBL
Match: U5GLR6_POPTR (Pentatricopeptide repeat-containing family protein OS=Populus trichocarpa GN=POPTR_0002s18370g PE=4 SV=1)

HSP 1 Score: 1083.2 bits (2800), Expect = 0.0e+00
Identity = 541/779 (69.45%), Postives = 648/779 (83.18%), Query Frame = 1

Query: 12  LADLLLVASITKTLSESGTRTLQHQSLSISEPLLLQILRSRSVHPSNKLDFFKWCSLSPN 71
           + ++LLVA +TKTLSESGTR+L   S+ +SE L+LQILR  S+  S K++FFKWCS+   
Sbjct: 1   MGNILLVAYLTKTLSESGTRSLDPDSIPLSEYLVLQILRRNSLDSSKKMEFFKWCSVRHI 60

Query: 72  FSHSASTYSQIFRTLCRSGYLHEVPLVLSSMKRDGVDVDSHTFKVLLDAFIRSGKFDAAL 131
           + HS STYSQ+F TLCRSGYL EVP +L+SMK DGV V S TFK+LLDAFIRSGKFD+AL
Sbjct: 61  YKHSVSTYSQMFSTLCRSGYLEEVPDLLNSMKNDGVVVGSETFKLLLDAFIRSGKFDSAL 120

Query: 132 EILDHMEELGTSLELNTYNSVLVALVRKNQVGLALSIFFKLFDAFSTGGQEGSAVPSFSF 191
           +ILDHMEELG++   + Y+S++VAL +KNQVGLALSI FKL +A S G +E +   S   
Sbjct: 121 DILDHMEELGSNPNPHMYDSIIVALAKKNQVGLALSIMFKLLEA-SDGNEENAVRVS--- 180

Query: 192 LPNALACNELLVALRKSDMRVEFKKVFDKLRTIRSFEFNVCGYNICIHAFGCWGYLDTSL 251
           LP ++ACN LLVALR  +M+VEFK VF KLR    F+ N  GYNICIHAFGCWG L TSL
Sbjct: 181 LPGSVACNALLVALRNGEMKVEFKTVFAKLRGKVGFKLNTWGYNICIHAFGCWGDLTTSL 240

Query: 252 ALFKEMKQRSLVSVSFGPDLCTYNSLIHVLCLVGKVNDALIVWEELKGSGHEPDAFTYRL 311
            LFKEMK++SL S S  PDLCTYNSLIHVLCL GKV DA+IV+EELK SGHEPDAFTYR+
Sbjct: 241 RLFKEMKEKSLASGSLDPDLCTYNSLIHVLCLAGKVKDAVIVYEELKVSGHEPDAFTYRI 300

Query: 312 IIQGCCKSYRMDDATAIFNEMEYNGFVPDTIVYNSLLDGLFKARRVIEACQFFDKMVQEG 371
           +IQGCCKSY+M+DAT IF+EM+YNGF+PDT+VYNSLLDG+FKAR+V+EACQ F+KMVQ+G
Sbjct: 301 LIQGCCKSYQMEDATKIFSEMQYNGFLPDTVVYNSLLDGMFKARKVMEACQLFEKMVQDG 360

Query: 372 VRASPWTYNILIDGLFRNGRAEASYSLFCDLKKKGQFVDGVTYSIIILQLCKEGLLEEAL 431
           VRAS WTYNILIDGL +NGRAEA Y+LFC LKKKGQFVD VTYSI++L LC++G LEEAL
Sbjct: 361 VRASCWTYNILIDGLCKNGRAEAGYNLFCGLKKKGQFVDAVTYSIVVLLLCRKGHLEEAL 420

Query: 432 QLVEEMEARGFVIDLVTVTSLLIAMHKQGQWEGLERLMKHIREGDLVPNVLKWKANMEDS 491
            LVEEME RGFV+DL+T+TSLLIA HKQG+W+  ERLMKHIR+ +L+PNVLKW+A+ME S
Sbjct: 421 HLVEEMEERGFVVDLITITSLLIAFHKQGRWDCTERLMKHIRDVNLLPNVLKWRADMEAS 480

Query: 492 VKYQKNKRKNYSSLFSPKEDLSEIISSRASSVAKVNVGDI-SENTEEKDDDNWSSSPHVD 551
           +K     R++Y+ +F     L EI+SS +S  ++ + G    E +   D D WSSSP++D
Sbjct: 481 LKNPPRSREDYTPMFPSTGGLQEIMSSISSPKSRSDDGATEDEKSSSADTDQWSSSPYMD 540

Query: 552 LLANLAKSTGDSLQPFSLSPGQRVEAKGDNSFDIDMVNTFLSIFLAKGKLSLACKLFEIF 611
            LAN AKST  S Q FSL+ GQRV+AKG  SFDIDMVNTFLSIFLAKGKLSLACKLFEIF
Sbjct: 541 HLANQAKSTDLSSQLFSLARGQRVQAKGAGSFDIDMVNTFLSIFLAKGKLSLACKLFEIF 600

Query: 612 SDMGVNPVRYTYNSMLSAFVKKGYFHQAWGIFNEMGEKVCPADIATYNLIIQGLGKMGRA 671
           +DMGV+PV YTYNS++S+FVKKGYF++AW +FNEMGEKVCP DIATYNL+IQGLGKMGRA
Sbjct: 601 TDMGVDPVSYTYNSIMSSFVKKGYFNRAWDVFNEMGEKVCPPDIATYNLVIQGLGKMGRA 660

Query: 672 DLASSVLEKLMEQGGYLDIVMYNTLMNALGKAGRMDDVNKLFEQMRSSGINPDVVSFNTL 731
           DLASSVL+KLM+QGGYLDIVMYNTL++ALGKAGR+D+ N LFEQM+ SG+NPDVV++N +
Sbjct: 661 DLASSVLDKLMKQGGYLDIVMYNTLIDALGKAGRIDEANNLFEQMKISGLNPDVVTYNIM 720

Query: 732 IEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTILDFLGREIEKARYEKASIIRDKNSS 790
           IEVHSK GR KDAYKFLKMMLD+GC PNHVTDT LDFL +EIEK RY+KASI+R K+ S
Sbjct: 721 IEVHSKTGRLKDAYKFLKMMLDAGCLPNHVTDTTLDFLAKEIEKLRYQKASIMRQKDDS 775

BLAST of Cp4.1LG12g04710.1 vs. TAIR10
Match: AT4G01570.1 (AT4G01570.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 991.5 bits (2562), Expect = 3.0e-289
Identity = 499/783 (63.73%), Postives = 625/783 (79.82%), Query Frame = 1

Query: 11  RLADLLLVASITKTLSESGTRTLQHQSLSISEPLLLQILRSRSVHPSNKLDFFKWC-SLS 70
           +L ++LLVAS++KTLS+SGTR+L   S+ ISEP++LQILR  S+ PS KLDFF+WC SL 
Sbjct: 26  QLCNVLLVASLSKTLSQSGTRSLDANSIPISEPVVLQILRRNSIDPSKKLDFFRWCYSLR 85

Query: 71  PNFSHSASTYSQIFRTLCRSGYLHEVPLVLSSMKRDGVDVDSHTFKVLLDAFIRSGKFDA 130
           P + HSA+ YSQIFRT+CR+G L EVP +L SMK DGV++D    K+LLD+ IRSGKF++
Sbjct: 86  PGYKHSATAYSQIFRTVCRTGLLGEVPDLLGSMKEDGVNLDQTMAKILLDSLIRSGKFES 145

Query: 131 ALEILDHMEELGTSLELNTYNSVLVALVRKNQVGLALSIFFKLFDAFSTGGQEGSA-VPS 190
           AL +LD+MEELG  L  + Y+SVL+ALV+K+++ LALSI FKL +A      + +  V  
Sbjct: 146 ALGVLDYMEELGDCLNPSVYDSVLIALVKKHELRLALSILFKLLEASDNHSDDDTGRVII 205

Query: 191 FSFLPNALACNELLVALRKSDMRVEFKKVFDKLRTIRSFEFNVCGYNICIHAFGCWGYLD 250
            S+LP  +A NELLV LR++DMR EFK+VF+KL+ ++ F+F+   YNICIH FGCWG LD
Sbjct: 206 VSYLPGTVAVNELLVGLRRADMRSEFKRVFEKLKGMKRFKFDTWSYNICIHGFGCWGDLD 265

Query: 251 TSLALFKEMKQRSLV-SVSFGPDLCTYNSLIHVLCLVGKVNDALIVWEELKGSGHEPDAF 310
            +L+LFKEMK+RS V   SFGPD+CTYNSLIHVLCL GK  DALIVW+ELK SGHEPD  
Sbjct: 266 AALSLFKEMKERSSVYGSSFGPDICTYNSLIHVLCLFGKAKDALIVWDELKVSGHEPDNS 325

Query: 311 TYRLIIQGCCKSYRMDDATAIFNEMEYNGFVPDTIVYNSLLDGLFKARRVIEACQFFDKM 370
           TYR++IQGCCKSYRMDDA  I+ EM+YNGFVPDTIVYN LLDG  KAR+V EACQ F+KM
Sbjct: 326 TYRILIQGCCKSYRMDDAMRIYGEMQYNGFVPDTIVYNCLLDGTLKARKVTEACQLFEKM 385

Query: 371 VQEGVRASPWTYNILIDGLFRNGRAEASYSLFCDLKKKGQFVDGVTYSIIILQLCKEGLL 430
           VQEGVRAS WTYNILIDGLFRNGRAEA ++LFCDLKKKGQFVD +T+SI+ LQLC+EG L
Sbjct: 386 VQEGVRASCWTYNILIDGLFRNGRAEAGFTLFCDLKKKGQFVDAITFSIVGLQLCREGKL 445

Query: 431 EEALQLVEEMEARGFVIDLVTVTSLLIAMHKQGQWEGLERLMKHIREGDLVPNVLKWKAN 490
           E A++LVEEME RGF +DLVT++SLLI  HKQG+W+  E+LMKHIREG+LVPNVL+W A 
Sbjct: 446 EGAVKLVEEMETRGFSVDLVTISSLLIGFHKQGRWDWKEKLMKHIREGNLVPNVLRWNAG 505

Query: 491 MEDSVKYQKNKRKNYSSLFSPKEDLSEIISSRASSVAKVNVGDISENTEEKDDDNWSSSP 550
           +E S+K  ++K K+Y+ +F  K    +I+S     V   + G  +E     +DD WSSSP
Sbjct: 506 VEASLKRPQSKDKDYTPMFPSKGSFLDIMSM----VGSEDDGASAEEVSPMEDDPWSSSP 565

Query: 551 HVDLLANLAKSTGDSLQP---FSLSPGQRVEAKGDNSFDIDMVNTFLSIFLAKGKLSLAC 610
           ++D LA+         QP   F L+ GQRVEAK D SFD+DM+NTFLSI+L+KG LSLAC
Sbjct: 566 YMDQLAHQRN------QPKPLFGLARGQRVEAKPD-SFDVDMMNTFLSIYLSKGDLSLAC 625

Query: 611 KLFEIFSDMGVNPV-RYTYNSMLSAFVKKGYFHQAWGIFNEMGEKVCPADIATYNLIIQG 670
           KLFEIF+ MGV  +  YTYNSM+S+FVKKGYF  A G+ ++M E  C ADIATYN+IIQG
Sbjct: 626 KLFEIFNGMGVTDLTSYTYNSMMSSFVKKGYFQTARGVLDQMFENFCAADIATYNVIIQG 685

Query: 671 LGKMGRADLASSVLEKLMEQGGYLDIVMYNTLMNALGKAGRMDDVNKLFEQMRSSGINPD 730
           LGKMGRADLAS+VL++L +QGGYLDIVMYNTL+NALGKA R+D+  +LF+ M+S+GINPD
Sbjct: 686 LGKMGRADLASAVLDRLTKQGGYLDIVMYNTLINALGKATRLDEATQLFDHMKSNGINPD 745

Query: 731 VVSFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTILDFLGREIEKARYEKASII 787
           VVS+NT+IEV+SKAG+ K+AYK+LK MLD+GC PNHVTDTILD+LG+E+EKAR++KAS +
Sbjct: 746 VVSYNTMIEVNSKAGKLKEAYKYLKAMLDAGCLPNHVTDTILDYLGKEMEKARFKKASFV 797

BLAST of Cp4.1LG12g04710.1 vs. TAIR10
Match: AT3G06920.1 (AT3G06920.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 213.4 bits (542), Expect = 5.1e-55
Identity = 169/687 (24.60%), Postives = 304/687 (44.25%), Query Frame = 1

Query: 79  YSQIFRTLCRSGYLHEVPLVLSSMKRDGVDVDSHTFKVLLDAFIRSGKFDAALEILDHME 138
           ++ + R   + G +     +L  MK   +D D   + V +D+F + GK D A +    +E
Sbjct: 206 FTTLIRGFAKEGRVDSALSLLDEMKSSSLDADIVLYNVCIDSFGKVGKVDMAWKFFHEIE 265

Query: 139 ELGTSLELNTYNSVLVALVRKNQVGLALSIFFKLFDAFSTGGQEGSAVPSFSFLPNALAC 198
             G   +  TY S++  L + N++  A+ +F  L        ++   VP         A 
Sbjct: 266 ANGLKPDEVTYTSMIGVLCKANRLDEAVEMFEHL--------EKNRRVPC------TYAY 325

Query: 199 NELLVALRKSDMRVEFKKVFDKLRTIRSFEFNVCGYNICIHAFGCWGYLDTSLALFKEMK 258
           N +++    +    E   + ++ R   S   +V  YN  +      G +D +L +F+EMK
Sbjct: 326 NTMIMGYGSAGKFDEAYSLLERQRAKGSIP-SVIAYNCILTCLRKMGKVDEALKVFEEMK 385

Query: 259 QRSLVSVSFGPDLCTYNSLIHVLCLVGKVNDALIVWEELKGSGHEPDAFTYRLIIQGCCK 318
           + +       P+L TYN LI +LC  GK++ A  + + ++ +G  P+  T  +++   CK
Sbjct: 386 KDA------APNLSTYNILIDMLCRAGKLDTAFELRDSMQKAGLFPNVRTVNIMVDRLCK 445

Query: 319 SYRMDDATAIFNEMEYNGFVPDTIVYNSLLDGLFKARRVIEACQFFDKMVQEGVRASPWT 378
           S ++D+A A+F EM+Y    PD I + SL+DGL K  RV +A + ++KM+    R +   
Sbjct: 446 SQKLDEACAMFEEMDYKVCTPDEITFCSLIDGLGKVGRVDDAYKVYEKMLDSDCRTNSIV 505

Query: 379 YNILIDGLFRNGRAEASYSLFCDLKKKGQFVDGVTYSIIILQLCKEGLLEEALQLVEEME 438
           Y  LI   F +GR E  + ++ D+  +    D    +  +  + K G  E+   + EE++
Sbjct: 506 YTSLIKNFFNHGRKEDGHKIYKDMINQNCSPDLQLLNTYMDCMFKAGEPEKGRAMFEEIK 565

Query: 439 ARGFVIDLVTVTSLLIAMHKQGQWEGLERLMKHIREGDLVPNVLKWKANMEDSVKYQKNK 498
           AR FV D  + + L+  + K G       L   ++E   V +   +   ++   K  K  
Sbjct: 566 ARRFVPDARSYSILIHGLIKAGFANETYELFYSMKEQGCVLDTRAYNIVIDGFCKCGK-V 625

Query: 499 RKNYSSLFSPKEDLSEIISSRASSVAKVNVGDISENTEEKDDDNWSSSPHVDLLANLAKS 558
            K Y         L E + ++      V  G + +   + D           +L   AKS
Sbjct: 626 NKAY--------QLLEEMKTKGFEPTVVTYGSVIDGLAKID-----RLDEAYMLFEEAKS 685

Query: 559 TGDSLQPFSLSPGQRVEAKGDNSFDIDMVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPV 618
               L                    + + ++ +  F   G++  A  + E     G+ P 
Sbjct: 686 KRIELN-------------------VVIYSSLIDGFGKVGRIDEAYLILEELMQKGLTPN 745

Query: 619 RYTYNSMLSAFVKKGYFHQAWGIFNEMGEKVCPADIATYNLIIQGLGKMGRADLASSVLE 678
            YT+NS+L A VK    ++A   F  M E  C  +  TY ++I GL K+ + + A    +
Sbjct: 746 LYTWNSLLDALVKAEEINEALVCFQSMKELKCTPNQVTYGILINGLCKVRKFNKAFVFWQ 805

Query: 679 KLMEQGGYLDIVMYNTLMNALGKAGRMDDVNKLFEQMRSSGINPDVVSFNTLIEVHSKAG 738
           ++ +QG     + Y T+++ L KAG + +   LF++ +++G  PD   +N +IE  S   
Sbjct: 806 EMQKQGMKPSTISYTTMISGLAKAGNIAEAGALFDRFKANGGVPDSACYNAMIEGLSNGN 838

Query: 739 RFKDAYKFLKMMLDSGCSPNHVTDTIL 766
           R  DA+   +     G   ++ T  +L
Sbjct: 866 RAMDAFSLFEETRRRGLPIHNKTCVVL 838

BLAST of Cp4.1LG12g04710.1 vs. TAIR10
Match: AT4G31850.1 (AT4G31850.1 proton gradient regulation 3)

HSP 1 Score: 201.4 bits (511), Expect = 2.0e-51
Identity = 181/703 (25.75%), Postives = 316/703 (44.95%), Query Frame = 1

Query: 78  TYSQIFRTLCRSGYLHEVPLVLSSMKRDGVDVDSHTFKVLLDAFIRSGKFDAALEILDHM 137
           TY+ +   LC +  L     V   MK      D  T+  LLD F  +   D+  +    M
Sbjct: 295 TYTVLIDALCTARKLDCAKEVFEKMKTGRHKPDRVTYITLLDRFSDNRDLDSVKQFWSEM 354

Query: 138 EELGTSLELNTYNSVLVALVRKNQVGLALSIFFKLFDAFSTGGQEGSAVPSFSFLPNALA 197
           E+ G   ++ T+  ++ AL +    G A       FD       +G        LPN   
Sbjct: 355 EKDGHVPDVVTFTILVDALCKAGNFGEA-------FDTLDVMRDQG-------ILPNLHT 414

Query: 198 CNELLVALRKSDMRVEFKKVFDKLRTIRSFEFNVCGYNICIHAFGCWGYLDTSLALFKEM 257
            N L+  L +     +  ++F  + ++   +     Y + I  +G  G   ++L  F++M
Sbjct: 415 YNTLICGLLRVHRLDDALELFGNMESL-GVKPTAYTYIVFIDYYGKSGDSVSALETFEKM 474

Query: 258 KQRSLVSVSFGPDLCTYNSLIHVLCLVGKVNDALIVWEELKGSGHEPDAFTYRLIIQGCC 317
           K + +      P++   N+ ++ L   G+  +A  ++  LK  G  PD+ TY ++++   
Sbjct: 475 KTKGIA-----PNIVACNASLYSLAKAGRDREAKQIFYGLKDIGLVPDSVTYNMMMKCYS 534

Query: 318 KSYRMDDATAIFNEMEYNGFVPDTIVYNSLLDGLFKARRVIEACQFFDKMVQEGVRASPW 377
           K   +D+A  + +EM  NG  PD IV NSL++ L+KA RV EA + F +M +  ++ +  
Sbjct: 535 KVGEIDEAIKLLSEMMENGCEPDVIVVNSLINTLYKADRVDEAWKMFMRMKEMKLKPTVV 594

Query: 378 TYNILIDGLFRNGRAEASYSLFCDLKKKGQFVDGVTYSIIILQLCKEGLLEEALQLVEEM 437
           TYN L+ GL +NG+ + +  LF  + +KG   + +T++ +   LCK   +  AL+++ +M
Sbjct: 595 TYNTLLAGLGKNGKIQEAIELFEGMVQKGCPPNTITFNTLFDCLCKNDEVTLALKMLFKM 654

Query: 438 EARGFVIDLVTVTSLLIAMHKQGQ-------WEGLERLM--KHIREGDLVPNVLKWKANM 497
              G V D+ T  +++  + K GQ       +  +++L+    +    L+P V+K  + +
Sbjct: 655 MDMGCVPDVFTYNTIIFGLVKNGQVKEAMCFFHQMKKLVYPDFVTLCTLLPGVVK-ASLI 714

Query: 498 EDSVKYQKNKRKNY----SSLFSPKEDLSEIISSRASSVAKVNVGDISENTEEKDDDNWS 557
           ED+ K   N   N     ++LF       ++I S  +     N    SE           
Sbjct: 715 EDAYKIITNFLYNCADQPANLF-----WEDLIGSILAEAGIDNAVSFSER---------- 774

Query: 558 SSPHVDLLANLAKSTGDSLQPFSLSPGQRVEAKGDNSFDIDMVNTFLSIFLAKGKLSLAC 617
                 L+AN     GDS+    L P  R   K +N                   +S A 
Sbjct: 775 ------LVANGICRDGDSI----LVPIIRYSCKHNN-------------------VSGAR 834

Query: 618 KLFEIFS-DMGVNPVRYTYNSMLSAFVKKGYFHQAWGIFNEMGEKVCPADIATYNLIIQG 677
            LFE F+ D+GV P   TYN ++   ++      A  +F ++    C  D+ATYN ++  
Sbjct: 835 TLFEKFTKDLGVQPKLPTYNLLIGGLLEADMIEIAQDVFLQVKSTGCIPDVATYNFLLDA 894

Query: 678 LGKMGRADLASSVLEKLMEQGGYLDIVMYNTLMNALGKAGRMDDVNKL-FEQMRSSGINP 737
            GK G+ D    + +++       + + +N +++ L KAG +DD   L ++ M     +P
Sbjct: 895 YGKSGKIDELFELYKEMSTHECEANTITHNIVISGLVKAGNVDDALDLYYDLMSDRDFSP 932

Query: 738 DVVSFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTIL 766
              ++  LI+  SK+GR  +A +  + MLD GC PN     IL
Sbjct: 955 TACTYGPLIDGLSKSGRLYEAKQLFEGMLDYGCRPNCAIYNIL 932

BLAST of Cp4.1LG12g04710.1 vs. TAIR10
Match: AT3G22470.1 (AT3G22470.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 194.9 bits (494), Expect = 1.9e-49
Identity = 130/498 (26.10%), Postives = 235/498 (47.19%), Query Frame = 1

Query: 270 DLCTYNSLIHVLCLVGKVNDALIVWEELKGSGHEPDAFTYRLIIQGCCKSYRMDDATAIF 329
           D+ T   +I+  C   K+  A  V       G+EPD  T+  ++ G C   R+ +A A+ 
Sbjct: 104 DMYTMTIMINCYCRKKKLLFAFSVLGRAWKLGYEPDTITFSTLVNGFCLEGRVSEAVALV 163

Query: 330 NEMEYNGFVPDTIVYNSLLDGLFKARRVIEACQFFDKMVQEGVRASPWTYNILIDGLFRN 389
           + M      PD +  ++L++GL    RV EA    D+MV+ G +    TY  +++ L ++
Sbjct: 164 DRMVEMKQRPDLVTVSTLINGLCLKGRVSEALVLIDRMVEYGFQPDEVTYGPVLNRLCKS 223

Query: 390 GRAEASYSLFCDLKKKGQFVDGVTYSIIILQLCKEGLLEEALQLVEEMEARGFVIDLVTV 449
           G +  +  LF  ++++      V YSI+I  LCK+G  ++AL L  EME +G   D+VT 
Sbjct: 224 GNSALALDLFRKMEERNIKASVVQYSIVIDSLCKDGSFDDALSLFNEMEMKGIKADVVTY 283

Query: 450 TSLLIAMHKQGQWEGLERLMKHIREGDLVPNVLKWKANMEDSVKYQKNKRKNYSSLFSPK 509
           +SL+  +   G+W+   ++++ +   +++P+V+ + A ++  VK  K        L   K
Sbjct: 284 SSLIGGLCNDGKWDDGAKMLREMIGRNIIPDVVTFSALIDVFVKEGK--------LLEAK 343

Query: 510 EDLSEIISSRASSVAKVNVGDISENTEEKDDDNWSSSPHVDLLANLAKST--GDSLQPFS 569
           E  +E+I+   +                 D   ++S     L+    K     ++ Q F 
Sbjct: 344 ELYNEMITRGIA----------------PDTITYNS-----LIDGFCKENCLHEANQMFD 403

Query: 570 LSPGQRVEAKGDNSFDIDMVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPVRYTYNSMLS 629
           L   +  E       DI   +  ++ +    ++    +LF   S  G+ P   TYN+++ 
Sbjct: 404 LMVSKGCEP------DIVTYSILINSYCKAKRVDDGMRLFREISSKGLIPNTITYNTLVL 463

Query: 630 AFVKKGYFHQAWGIFNEMGEKVCPADIATYNLIIQGLGKMGRADLASSVLEKLMEQGGYL 689
            F + G  + A  +F EM  +  P  + TY +++ GL   G  + A  + EK+ +    L
Sbjct: 464 GFCQSGKLNAAKELFQEMVSRGVPPSVVTYGILLDGLCDNGELNKALEIFEKMQKSRMTL 523

Query: 690 DIVMYNTLMNALGKAGRMDDVNKLFEQMRSSGINPDVVSFNTLIEVHSKAGRFKDAYKFL 749
            I +YN +++ +  A ++DD   LF  +   G+ PDVV++N +I    K G   +A    
Sbjct: 524 GIGIYNIIIHGMCNASKVDDAWSLFCSLSDKGVKPDVVTYNVMIGGLCKKGSLSEADMLF 566

Query: 750 KMMLDSGCSPNHVTDTIL 766
           + M + GC+P+  T  IL
Sbjct: 584 RKMKEDGCTPDDFTYNIL 566

BLAST of Cp4.1LG12g04710.1 vs. TAIR10
Match: AT2G16880.1 (AT2G16880.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 190.3 bits (482), Expect = 4.6e-48
Identity = 181/755 (23.97%), Postives = 324/755 (42.91%), Query Frame = 1

Query: 17  LVASITKTLSESGTR---TLQHQSLSISEPLLLQILRSRSV--HPSNKLDFFKWCSLSPN 76
           L+ ++T  L+   T    TL      I++PLL  +L S S+   P   + FF+W      
Sbjct: 11  LLKTLTSILTSEKTHFLETLNPYIPQITQPLLTSLLSSPSLAKKPETLVSFFQWA----- 70

Query: 77  FSHSASTYSQIFRTLCRSGYLHEVPLVLSSMKRDGVDVDSHTF---KVLLDAFIRSGKFD 136
                       +T     +  + PL L S+ R  +    H F   K LL ++IR+   D
Sbjct: 71  ------------QTSIPEAFPSDSPLPLISVVRSLLS--HHKFADAKSLLVSYIRTS--D 130

Query: 137 AALEILDHMEELGTSLELNT------YNSVLVALVRKNQVGLALSIFFKLFDAFSTGGQE 196
           A+L + + +  L  +L L+       ++  L A + + +  +AL IF K+          
Sbjct: 131 ASLSLCNSL--LHPNLHLSPPPSKALFDIALSAYLHEGKPHVALQIFQKMI--------- 190

Query: 197 GSAVPSFSFLPNALACNELLVALRKSDMRVEF---KKVFDKLRTIRSFEFNVCGYNICIH 256
                     PN L CN LL+ L +          ++VFD +  I     NV  +N+ ++
Sbjct: 191 -----RLKLKPNLLTCNTLLIGLVRYPSSFSISSAREVFDDMVKI-GVSLNVQTFNVLVN 250

Query: 257 AFGCWGYLDTSLALFKEMKQRSLVSVSFGPDLCTYNSLIHVLCLVGKVNDALIVWEELKG 316
            +   G L+ +L + + M     V+    PD  TYN+++  +   G+++D   +  ++K 
Sbjct: 251 GYCLEGKLEDALGMLERMVSEFKVN----PDNVTYNTILKAMSKKGRLSDLKELLLDMKK 310

Query: 317 SGHEPDAFTYRLIIQGCCKSYRMDDATAIFNEMEYNGFVPDTIVYNSLLDGLFKARRVIE 376
           +G  P+  TY  ++ G CK   + +A  I   M+    +PD   YN L++GL  A  + E
Sbjct: 311 NGLVPNRVTYNNLVYGYCKLGSLKEAFQIVELMKQTNVLPDLCTYNILINGLCNAGSMRE 370

Query: 377 ACQFFDKMVQEGVRASPWTYNILIDGLFRNGRAEASYSLFCDLKKKGQFVDGVTYSIIIL 436
             +  D M    ++    TYN LIDG F  G +  +  L   ++  G   + VT++I + 
Sbjct: 371 GLELMDAMKSLKLQPDVVTYNTLIDGCFELGLSLEARKLMEQMENDGVKANQVTHNISLK 430

Query: 437 QLCKEGLLEEALQLVEEM-EARGFVIDLVTVTSLLIAMHKQGQWEGLERLMKHIREGDLV 496
            LCKE   E   + V+E+ +  GF  D+VT  +L+ A  K G   G   +M+ + +  + 
Sbjct: 431 WLCKEEKREAVTRKVKELVDMHGFSPDIVTYHTLIKAYLKVGDLSGALEMMREMGQKGIK 490

Query: 497 PNVLKWKANMEDSVKYQKNKRKNYSSLFSPKEDLSEIISSRASSVAKVNVGDI-----SE 556
            N +     ++   K +K              +L      R   V +V  G +      E
Sbjct: 491 MNTITLNTILDALCKERK---------LDEAHNLLNSAHKRGFIVDEVTYGTLIMGFFRE 550

Query: 557 NTEEKDDDNWSSSPHVDLLANLAKSTGDSLQPFSLSPGQRVEAKGDNSFDIDMVNTFLSI 616
              EK  + W                 D ++   ++P             +   N+ +  
Sbjct: 551 EKVEKALEMW-----------------DEMKKVKITP------------TVSTFNSLIGG 610

Query: 617 FLAKGKLSLACKLFEIFSDMGVNPVRYTYNSMLSAFVKKGYFHQAWGIFNEMGEKVCPAD 676
               GK  LA + F+  ++ G+ P   T+NS++  + K+G   +A+  +NE  +     D
Sbjct: 611 LCHHGKTELAMEKFDELAESGLLPDDSTFNSIILGYCKEGRVEKAFEFYNESIKHSFKPD 670

Query: 677 IATYNLIIQGLGKMGRADLASSVLEKLMEQGGYLDIVMYNTLMNALGKAGRMDDVNKLFE 736
             T N+++ GL K G  + A +    L+E+   +D V YNT+++A  K  ++ +   L  
Sbjct: 671 NYTCNILLNGLCKEGMTEKALNFFNTLIEE-REVDTVTYNTMISAFCKDKKLKEAYDLLS 684

Query: 737 QMRSSGINPDVVSFNTLIEVHSKAGRFKDAYKFLK 749
           +M   G+ PD  ++N+ I +  + G+  +  + LK
Sbjct: 731 EMEEKGLEPDRFTYNSFISLLMEDGKLSETDELLK 684

BLAST of Cp4.1LG12g04710.1 vs. NCBI nr
Match: gi|659119716|ref|XP_008459805.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g01570 [Cucumis melo])

HSP 1 Score: 1384.8 bits (3583), Expect = 0.0e+00
Identity = 684/787 (86.91%), Postives = 735/787 (93.39%), Query Frame = 1

Query: 3   SRATPTLSRLADLLLVASITKTLSESGTRTLQHQSLSISEPLLLQILRSRSVHPSNKLDF 62
           SR   TLS+L+DLLLVASITKTLSESGTRTLQH SL IS PLLLQIL SRS++PS+KLDF
Sbjct: 17  SRTVSTLSQLSDLLLVASITKTLSESGTRTLQHHSLPISHPLLLQILHSRSLNPSHKLDF 76

Query: 63  FKWCSLSPNFSHSASTYSQIFRTLCRSGYLHEVPLVLSSMKRDGVDVDSHTFKVLLDAFI 122
           FKWCSL+PNF+HS STYSQIF  LCRSGYLHEVP +L SMKRDGV VDSHTFKVLLDAFI
Sbjct: 77  FKWCSLAPNFNHSPSTYSQIFHILCRSGYLHEVPPLLDSMKRDGVSVDSHTFKVLLDAFI 136

Query: 123 RSGKFDAALEILDHMEELGTSLELNTYNSVLVALVRKNQVGLALSIFFKLFDAFSTGGQE 182
           RSGK+DAALEILDHME+LGTSLELNTYNSVLVAL+RKNQVGLALSIFFKLFD  + GGQ+
Sbjct: 137 RSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLFDGLNNGGQD 196

Query: 183 GSAVPSFSFLPNALACNELLVALRKSDMRVEFKKVFDKLRTIRSFEFNVCGYNICIHAFG 242
            SA  SF FLPN+LACNELLVALRK DMRVEF+KVFDKLR I +FEFNVCGYNICI+AFG
Sbjct: 197 DSAATSFHFLPNSLACNELLVALRKLDMRVEFRKVFDKLRAIEAFEFNVCGYNICIYAFG 256

Query: 243 CWGYLDTSLALFKEMKQRSLVSVSFGPDLCTYNSLIHVLCLVGKVNDALIVWEELKGSGH 302
           CWGYLDT+L+LFKEMK++SLV  SFGPDLCTYNS+I VLCLVGKV DALIVWEELKGSGH
Sbjct: 257 CWGYLDTALSLFKEMKEKSLVLGSFGPDLCTYNSIIRVLCLVGKVKDALIVWEELKGSGH 316

Query: 303 EPDAFTYRLIIQGCCKSYRMDDATAIFNEMEYNGFVPDTIVYNSLLDGLFKARRVIEACQ 362
           EPDAFTYR+IIQGCCKSYRMDDAT IFNEMEYNG +PD IVYNSLL+GLFKAR+V EACQ
Sbjct: 317 EPDAFTYRIIIQGCCKSYRMDDATMIFNEMEYNGLIPDIIVYNSLLNGLFKARKVTEACQ 376

Query: 363 FFDKMVQEGVRASPWTYNILIDGLFRNGRAEASYSLFCDLKKKGQFVDGVTYSIIILQLC 422
            FDKMVQE VRASPWTYNILIDGLFRNGRAEA Y+LFCDLKKKGQFVDGVTYSIIILQLC
Sbjct: 377 LFDKMVQEDVRASPWTYNILIDGLFRNGRAEAGYTLFCDLKKKGQFVDGVTYSIIILQLC 436

Query: 423 KEGLLEEALQLVEEMEARGFVIDLVTVTSLLIAMHKQGQWEGLERLMKHIREGDLVPNVL 482
           KEGLLEEALQLVEEMEARGFV+DL+T+TSLLIAMHKQGQWEGLERLMKHIREGDLVPNVL
Sbjct: 437 KEGLLEEALQLVEEMEARGFVVDLITITSLLIAMHKQGQWEGLERLMKHIREGDLVPNVL 496

Query: 483 KWKANMEDSVKYQKNKRKNYSSLFSPKEDLSEIISSRASSVAKVNVGDISENTEEKDDDN 542
           KWK NMEDS+KYQKNKR+++SSLFSPKEDL E+ISSRASS A+VN+ +  ENTEE D D 
Sbjct: 497 KWKINMEDSIKYQKNKREDFSSLFSPKEDLIEVISSRASSAAEVNIDNSVENTEEMDTDG 556

Query: 543 WSSSPHVDLLANLAKSTGDSLQPFSLSPGQRVEAKGDNSFDIDMVNTFLSIFLAKGKLSL 602
           WSSSPHVD LANLA ST D LQPFSL  G+R++ KG+NSFDI+MVNTFLSIFLAKGKL+L
Sbjct: 557 WSSSPHVDGLANLANSTTDILQPFSLRQGRRIQEKGNNSFDINMVNTFLSIFLAKGKLNL 616

Query: 603 ACKLFEIFSDMGVNPVRYTYNSMLSAFVKKGYFHQAWGIFNEMGEKVCPADIATYNLIIQ 662
           ACKLFEIFSDMGVNPV+YTYNSMLS+FVKKGYFHQAWGIFNEMGE VCPADIATYN+IIQ
Sbjct: 617 ACKLFEIFSDMGVNPVKYTYNSMLSSFVKKGYFHQAWGIFNEMGENVCPADIATYNVIIQ 676

Query: 663 GLGKMGRADLASSVLEKLMEQGGYLDIVMYNTLMNALGKAGRMDDVNKLFEQMRSSGINP 722
           GLGKMGRADLASSVLEKLMEQGGYLDIVMYNTL+NALGKAGRMDDVNKLF+QMR+SGINP
Sbjct: 677 GLGKMGRADLASSVLEKLMEQGGYLDIVMYNTLINALGKAGRMDDVNKLFDQMRNSGINP 736

Query: 723 DVVSFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTILDFLGREIEKARYEKASI 782
           DVV+FNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDT LDFLGREIEKARYEKASI
Sbjct: 737 DVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTTLDFLGREIEKARYEKASI 796

Query: 783 IRDKNSS 790
           IRDKNSS
Sbjct: 797 IRDKNSS 803

BLAST of Cp4.1LG12g04710.1 vs. NCBI nr
Match: gi|449445529|ref|XP_004140525.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g01570 [Cucumis sativus])

HSP 1 Score: 1360.5 bits (3520), Expect = 0.0e+00
Identity = 675/787 (85.77%), Postives = 728/787 (92.50%), Query Frame = 1

Query: 3   SRATPTLSRLADLLLVASITKTLSESGTRTLQHQSLSISEPLLLQILRSRSVHPSNKLDF 62
           SR   TLS L+ LLL+ASITKTLSESGTRTLQH SL IS PLLLQIL SRS++PS+KLDF
Sbjct: 17  SRTASTLSHLSHLLLLASITKTLSESGTRTLQHHSLPISHPLLLQILHSRSLNPSHKLDF 76

Query: 63  FKWCSLSPNFSHSASTYSQIFRTLCRSGYLHEVPLVLSSMKRDGVDVDSHTFKVLLDAFI 122
           FKWCSL+PNF+HS STYSQIF  LCRSGYLHEVP +L SMKRDGV VDSHTFKVLLDAFI
Sbjct: 77  FKWCSLAPNFNHSPSTYSQIFHILCRSGYLHEVPPLLDSMKRDGVSVDSHTFKVLLDAFI 136

Query: 123 RSGKFDAALEILDHMEELGTSLELNTYNSVLVALVRKNQVGLALSIFFKLFDAFSTGGQE 182
           RSGK+DAALEILDHME+LGTSLELNTYNSVLVAL+RKNQVGLALSIFFKL D F+ GGQ 
Sbjct: 137 RSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLLDGFNNGGQV 196

Query: 183 GSAVPSFSFLPNALACNELLVALRKSDMRVEFKKVFDKLRTIRSFEFNVCGYNICIHAFG 242
            SA  +F FLPN+LACNELLVALRK DMRVEFKKVFDKLR I SFEF+V GYNICI+AFG
Sbjct: 197 DSAATTFHFLPNSLACNELLVALRKLDMRVEFKKVFDKLRAIESFEFSVYGYNICIYAFG 256

Query: 243 CWGYLDTSLALFKEMKQRSLVSVSFGPDLCTYNSLIHVLCLVGKVNDALIVWEELKGSGH 302
           CWGYLDT+L+LFKEMK++SLVS SF PDLCTYNS+IHVLCLVGKV DALIVWEELKGSGH
Sbjct: 257 CWGYLDTALSLFKEMKEKSLVSESFSPDLCTYNSIIHVLCLVGKVKDALIVWEELKGSGH 316

Query: 303 EPDAFTYRLIIQGCCKSYRMDDATAIFNEMEYNGFVPDTIVYNSLLDGLFKARRVIEACQ 362
           EPDAFTYR+IIQGCCKS RMDDAT IFNEMEYNG +PDTIVYNSLL+GLFKAR+V EACQ
Sbjct: 317 EPDAFTYRIIIQGCCKSCRMDDATMIFNEMEYNGLIPDTIVYNSLLNGLFKARKVTEACQ 376

Query: 363 FFDKMVQEGVRASPWTYNILIDGLFRNGRAEASYSLFCDLKKKGQFVDGVTYSIIILQLC 422
            FDKMVQE VRASPWTYNILIDGLFRNGRAEA Y+LFCDLKKKGQ VD VTYSIIILQLC
Sbjct: 377 LFDKMVQEDVRASPWTYNILIDGLFRNGRAEAGYTLFCDLKKKGQIVDAVTYSIIILQLC 436

Query: 423 KEGLLEEALQLVEEMEARGFVIDLVTVTSLLIAMHKQGQWEGLERLMKHIREGDLVPNVL 482
           KE LLEEALQLVEEMEARGFV+DL+T+TSLLIAMHKQGQW+GLERLMKHIREGDLVPNVL
Sbjct: 437 KERLLEEALQLVEEMEARGFVVDLITITSLLIAMHKQGQWDGLERLMKHIREGDLVPNVL 496

Query: 483 KWKANMEDSVKYQKNKRKNYSSLFSPKEDLSEIISSRASSVAKVNVGDISENTEEKDDDN 542
           KWK NME S+KYQKNKRK++SSLFSPKEDLSE+ISSRASS AKVN+ +  ENTEE+D D+
Sbjct: 497 KWKINMEYSIKYQKNKRKDFSSLFSPKEDLSEVISSRASSAAKVNIDNSFENTEERDMDS 556

Query: 543 WSSSPHVDLLANLAKSTGDSLQPFSLSPGQRVEAKGDNSFDIDMVNTFLSIFLAKGKLSL 602
           WSSSP+V+ LANLA ST D LQPFS+  G+R++ K DNSFDI+MVNTFLSIFLAKGKL+L
Sbjct: 557 WSSSPYVNRLANLANSTSDILQPFSIRQGRRIQEKQDNSFDINMVNTFLSIFLAKGKLNL 616

Query: 603 ACKLFEIFSDMGVNPVRYTYNSMLSAFVKKGYFHQAWGIFNEMGEKVCPADIATYNLIIQ 662
           ACKLFEIFSDMGVNPV+YTYNSMLS+FVKKGYFHQAWGIFNEMGE VCPADIATYN+IIQ
Sbjct: 617 ACKLFEIFSDMGVNPVKYTYNSMLSSFVKKGYFHQAWGIFNEMGENVCPADIATYNVIIQ 676

Query: 663 GLGKMGRADLASSVLEKLMEQGGYLDIVMYNTLMNALGKAGRMDDVNKLFEQMRSSGINP 722
           GLGKMGRADLASSVLEKLMEQGGYLDIVMYNTL+NALGKAGRMDDVNKLF QMR+SGINP
Sbjct: 677 GLGKMGRADLASSVLEKLMEQGGYLDIVMYNTLINALGKAGRMDDVNKLFGQMRNSGINP 736

Query: 723 DVVSFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTILDFLGREIEKARYEKASI 782
           DVV+FNTLIEVHSKAGR KDAYKFLKMMLDSGCSPNHVTDT LDFLGRE+EKARYEKASI
Sbjct: 737 DVVTFNTLIEVHSKAGRLKDAYKFLKMMLDSGCSPNHVTDTTLDFLGREMEKARYEKASI 796

Query: 783 IRDKNSS 790
           IRDKNSS
Sbjct: 797 IRDKNSS 803

BLAST of Cp4.1LG12g04710.1 vs. NCBI nr
Match: gi|590720575|ref|XP_007051367.1| (Pentatricopeptide repeat-containing protein, putative [Theobroma cacao])

HSP 1 Score: 1122.5 bits (2902), Expect = 0.0e+00
Identity = 561/784 (71.56%), Postives = 658/784 (83.93%), Query Frame = 1

Query: 12  LADLLLVASITKTLSESGTRTLQHQSLSISEPLLLQILRSRSVHPSNKLDFFKWC-SLSP 71
           L ++LL+AS+TKTLSESGTR L   S+ ISEPL++QILR  S+ PS KLDFF WC S+ P
Sbjct: 23  LGNILLIASLTKTLSESGTRNLDPNSIPISEPLVIQILRKHSLEPSKKLDFFNWCRSVKP 82

Query: 72  NFSHSASTYSQIFRTLCRSGYLHEVPLVLSSMKRDGVDVDSHTFKVLLDAFIRSGKFDAA 131
           NF HSA TYS IFRTLCRSG++ EVP +L +MK DGV VDS TFK LLDAFIRSGKFD+A
Sbjct: 83  NFKHSAVTYSHIFRTLCRSGFVEEVPNLLFAMKEDGVLVDSDTFKFLLDAFIRSGKFDSA 142

Query: 132 LEILDHMEELGTSLELNTYNSVLVALVRKNQVGLALSIFFKLFDAFSTGGQEGSAVPSFS 191
           LEILD MEELG  L L  Y+SVLVAL+RK+QVGLALS+FFKL +A + G  +G++V S  
Sbjct: 143 LEILDFMEELGAGLNLRVYDSVLVALIRKDQVGLALSLFFKLLEACN-GNDDGNSVDSS- 202

Query: 192 FLPNALACNELLVALRKSDMRVEFKKVFDKLRTIRSFEFNVCGYNICIHAFGCWGYLDTS 251
            LP ++A NELLVALRK+ MR EFK+VFD LR  R FEF+ CGYNICIH+FGCWG L  S
Sbjct: 203 -LPGSIAINELLVALRKAHMRREFKQVFDILREKREFEFDTCGYNICIHSFGCWGDLGAS 262

Query: 252 LALFKEMKQRSLVSVSFGPDLCTYNSLIHVLCLVGKVNDALIVWEELKGSGHEPDAFTYR 311
           L LFKEMK++     SFGPDLCTYNSLI VLCLVGKV DAL+VWEELK SGHEPDAFTYR
Sbjct: 263 LKLFKEMKEKEKSFGSFGPDLCTYNSLIDVLCLVGKVKDALVVWEELKVSGHEPDAFTYR 322

Query: 312 LIIQGCCKSYRMDDATAIFNEMEYNGFVPDTIVYNSLLDGLFKARRVIEACQFFDKMVQE 371
           ++IQGC KSYRMDDAT IF+EM+YNGF  DT+VYNSLL+GLFKAR+V+EACQFF+KMVQ+
Sbjct: 323 ILIQGCSKSYRMDDATKIFSEMQYNGFAMDTVVYNSLLNGLFKARKVMEACQFFEKMVQD 382

Query: 372 GVRASPWTYNILIDGLFRNGRAEASYSLFCDLKKKGQFVDGVTYSIIILQLCKEGLLEEA 431
           GVRAS WTYNILIDGLFRNGRAEA+Y+LFCDLKKKGQFVDG+TYSI++LQLC+EG LE A
Sbjct: 383 GVRASCWTYNILIDGLFRNGRAEAAYTLFCDLKKKGQFVDGITYSIVVLQLCREGQLEGA 442

Query: 432 LQLVEEMEARGFVIDLVTVTSLLIAMHKQGQWEGLERLMKHIREGDLVPNVLKWKANMED 491
           L+LVEEMEARGF++DLVT+TSLLI  HKQG+W+  ERLMKHIR+G+LVPNVLKWKANME 
Sbjct: 443 LRLVEEMEARGFIVDLVTITSLLIGFHKQGRWDWTERLMKHIRDGNLVPNVLKWKANMEA 502

Query: 492 SVKYQKNKRKNYSSLFSPKEDLSEIISSRASSVAKVNVGDISENTEEKDD-------DNW 551
           S+K     RK+Y+ LF  K D  EI++   S    +     SE+ +EKD        D W
Sbjct: 503 SMKNPPKNRKDYTPLFPSKGDFREIMNLLGSVGQAMGTNLDSEDCDEKDQEKPSIDTDQW 562

Query: 552 SSSPHVDLLANLAKSTGDSLQPFSLSPGQRVEAKGDNSFDIDMVNTFLSIFLAKGKLSLA 611
           SSSP++D LAN  KST  S Q FSL  GQRV+ KG  SFD+DMVNTFLSIFLAKGKLSLA
Sbjct: 563 SSSPYMDQLANQGKSTERSSQLFSLIRGQRVQEKGIGSFDVDMVNTFLSIFLAKGKLSLA 622

Query: 612 CKLFEIFSDMGVNPVRYTYNSMLSAFVKKGYFHQAWGIFNEMGEKVCPADIATYNLIIQG 671
           CKLFE+F+DMGV+PV YTYNS++S+FVKKGYF++AWG+ NEM EKVCPADIATYNLIIQG
Sbjct: 623 CKLFEVFTDMGVDPVSYTYNSIMSSFVKKGYFNEAWGVLNEMDEKVCPADIATYNLIIQG 682

Query: 672 LGKMGRADLASSVLEKLMEQGGYLDIVMYNTLMNALGKAGRMDDVNKLFEQMRSSGINPD 731
           LGKMGRAD+ASSVL+KLM+QGGYLD+VMYNTL+NALGKAGR+D+ +KLFEQMR+SGINPD
Sbjct: 683 LGKMGRADIASSVLDKLMKQGGYLDVVMYNTLVNALGKAGRVDEASKLFEQMRTSGINPD 742

Query: 732 VVSFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTILDFLGREIEKARYEKASII 788
           V+++NTLIEVH+KAG+ +DAYKFLKMMLD+GCSPNHVTDTILD LG+EIEK R +KAS++
Sbjct: 743 VITYNTLIEVHTKAGQLQDAYKFLKMMLDAGCSPNHVTDTILDNLGKEIEKMRLQKASMV 802

BLAST of Cp4.1LG12g04710.1 vs. NCBI nr
Match: gi|658045553|ref|XP_008358457.1| (PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At4g01570-like [Malus domestica])

HSP 1 Score: 1113.2 bits (2878), Expect = 0.0e+00
Identity = 557/787 (70.78%), Postives = 658/787 (83.61%), Query Frame = 1

Query: 10  SRLADLLLVASITKTLSESGTRTLQH-QSLSISEPLLLQILRSRSVHPSNKLDFFKWCSL 69
           S+L D+LLVA+ITKTLS SGTR L    +L +SEPLL QILR++S+HPS KLDFFKWCSL
Sbjct: 18  SQLGDILLVAAITKTLSTSGTRNLPDPHTLPLSEPLLFQILRAQSLHPSKKLDFFKWCSL 77

Query: 70  SPNFSHSASTYSQIFRTLCRSGYLHEVPLVLSSMKRDGVDVDSHTFKVLLDAFIRSGKFD 129
           + N  HSA  YS I RT  R+G+LHEVP +L SMK DGV VDS TFK LLDAFIRSGKFD
Sbjct: 78  THNIKHSARAYSHILRTASRAGFLHEVPQLLXSMKEDGVVVDSQTFKALLDAFIRSGKFD 137

Query: 130 AALEILDHMEELGTSLELNTYNSVLVALVRKNQVGLALSIFFKLFDAFSTGGQEGSAVPS 189
            ALEILD ME++G  L  + YN VLVALVRKNQVGLA++I FKL +A  +          
Sbjct: 138 YALEILDIMEDVGAGLNTDMYNLVLVALVRKNQVGLAMAILFKLLEAGDS---------- 197

Query: 190 FSFLPNALACNELLVALRKSDMRVEFKKVFDKLRTIRSFEFNVCGYNICIHAFGCWGYLD 249
            + +PN++ACNELLVALRKSDMRV FK+VF+KLR    FE +  GYNICIHAFGCWG L 
Sbjct: 198 -TQVPNSIACNELLVALRKSDMRVGFKQVFNKLRESEGFEKDTWGYNICIHAFGCWGDLG 257

Query: 250 TSLALFKEMKQRSLVSVSFGPDLCTYNSLIHVLCLVGKVNDALIVWEELKGSGHEPDAFT 309
           TSL+LF+EMK  +L +V  GPDL TYNSLIHVLCLVGK+NDALIVWEELKGSGHEPDA T
Sbjct: 258 TSLSLFREMKDSNLDNV--GPDLSTYNSLIHVLCLVGKMNDALIVWEELKGSGHEPDAIT 317

Query: 310 YRLIIQGCCKSYRMDDATAIFNEMEYNGFVPDTIVYNSLLDGLFKARRVIEACQFFDKMV 369
           YR++IQGCC+ YR+DDAT IF+EM+ NG++PDTIVYNSLLDGLFKAR+V + CQ F+KMV
Sbjct: 318 YRILIQGCCRCYRIDDATKIFSEMQLNGYIPDTIVYNSLLDGLFKARKVNDGCQLFEKMV 377

Query: 370 QEGVRASPWTYNILIDGLFRNGRAEASYSLFCDLKKKGQFVDGVTYSIIILQLCKEGLLE 429
           Q GVRAS WTYNILIDGLFRNGRAEA+Y+LFCDLKKKGQFVDGVTYSI++LQLCKEGLLE
Sbjct: 378 QNGVRASTWTYNILIDGLFRNGRAEAAYTLFCDLKKKGQFVDGVTYSIVVLQLCKEGLLE 437

Query: 430 EALQLVEEMEARGFVIDLVTVTSLLIAMHKQGQWEGLERLMKHIREGDLVPNVLKWKANM 489
           EAL LVEEME RGF +DLVT++SL+I ++K+G+W+  ++LMKHIR+G+LVP+VLKWK +M
Sbjct: 438 EALGLVEEMERRGFTVDLVTISSLVIGLYKEGRWDWTDKLMKHIRDGNLVPSVLKWKVDM 497

Query: 490 EDSVKYQKNKRKNYSSLFSPKEDLSEIISSRASSVAKVNVGDISENTEEKDDDN------ 549
           E S+K  +  RK+Y+ LF  K DLSEI+S   S+ + ++    SE    K+DD       
Sbjct: 498 EASLKNPQRNRKDYTPLFPSKGDLSEIMSLIKSAESTMDADLDSEAARVKEDDKNLSTDT 557

Query: 550 --WSSSPHVDLLANLAKSTGDSLQPFSLSPGQRVEAKGDNSFDIDMVNTFLSIFLAKGKL 609
             WSSSPH+D LAN  KST  S Q FSLS GQRV+AKG+N+FDIDMVNTFLS+FLAKGKL
Sbjct: 558 GQWSSSPHMDQLANQLKSTDHSSQLFSLSRGQRVQAKGENTFDIDMVNTFLSLFLAKGKL 617

Query: 610 SLACKLFEIFSDMGVNPVRYTYNSMLSAFVKKGYFHQAWGIFNEMGEKVCPADIATYNLI 669
           S+ACKLFEIFSD+G NPV YTYNS +S+FVKKGYF++AWG+ NEMGE+VCP DIATYN+I
Sbjct: 618 SIACKLFEIFSDLGENPVSYTYNSXMSSFVKKGYFNEAWGVLNEMGERVCPTDIATYNVI 677

Query: 670 IQGLGKMGRADLASSVLEKLMEQGGYLDIVMYNTLMNALGKAGRMDDVNKLFEQMRSSGI 729
           IQGLGKMGRADLASSVL+KL+EQGGYLD+VMYNTL+NALGKA R+D+VNKLFEQM+SSGI
Sbjct: 678 IQGLGKMGRADLASSVLDKLIEQGGYLDVVMYNTLINALGKASRIDEVNKLFEQMKSSGI 737

Query: 730 NPDVVSFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTILDFLGREIEKARYEKA 788
           NPDVV+FNTLIEVHSKAGR KDAYKFLKMMLD+GC+PNHVTDT LDFLG+EIEK RY+KA
Sbjct: 738 NPDVVTFNTLIEVHSKAGRLKDAYKFLKMMLDAGCTPNHVTDTTLDFLGKEIEKMRYQKA 791

BLAST of Cp4.1LG12g04710.1 vs. NCBI nr
Match: gi|1009142863|ref|XP_015888951.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g01570 [Ziziphus jujuba])

HSP 1 Score: 1095.9 bits (2833), Expect = 0.0e+00
Identity = 553/780 (70.90%), Postives = 648/780 (83.08%), Query Frame = 1

Query: 11  RLADLLLVASITKTLSESGTRTLQH-QSLSISEPLLLQILRSRSVHPSNKLDFFKWCSLS 70
           +L D LLVAS+TKTLS+ GT  L    S+ +SE LLLQILR++++HPS KL FF+WCSL 
Sbjct: 17  QLGDHLLVASLTKTLSDCGTHNLPDPHSIPLSESLLLQILRTKTLHPSKKLAFFRWCSLV 76

Query: 71  PNFSHSASTYSQIFRTLCRSGYLHEVPLVLSSMKRDGVDVDSHTFKVLLDAFIRSGKFDA 130
           P+F HSA +YS IFRT+CR+GYLHEVP +L+SMK+DGV V S TFK LLD+FI S KFD 
Sbjct: 77  PDFKHSALSYSHIFRTVCRAGYLHEVPDLLNSMKQDGVVVHSETFKALLDSFILSSKFDY 136

Query: 131 ALEILDHMEELGTSLELNTYNSVLVALVRKNQVGLALSIFFKLFDAFSTGGQEGSAVPSF 190
           ALEIL  MEELGTSL  + YNSVLVALVRKNQVGLALSIFFK+    S            
Sbjct: 137 ALEILHVMEELGTSLNTHIYNSVLVALVRKNQVGLALSIFFKILQGNSQ----------- 196

Query: 191 SFLPNALACNELLVALRKSDMRVEFKKVFDKLRTIRSFEFNVCGYNICIHAFGCWGYLDT 250
             L +++ACN LLVALRK+DMR+EFK+VFDKLR    FEF+  GYNICIHAFGCWG L +
Sbjct: 197 --LLSSIACNMLLVALRKADMRLEFKQVFDKLRDGSGFEFDTWGYNICIHAFGCWGDLGS 256

Query: 251 SLALFKEMKQRSLVSVSFGPDLCTYNSLIHVLCLVGKVNDALIVWEELKGSGHEPDAFTY 310
           SL+LF+EMK+      + GPDLCTYNSLI VLC VGKV DAL+VWEELKGSGHEPDAFTY
Sbjct: 257 SLSLFREMKE------TVGPDLCTYNSLILVLCFVGKVKDALVVWEELKGSGHEPDAFTY 316

Query: 311 RLIIQGCCKSYRMDDATAIFNEMEYNGFVPDTIVYNSLLDGLFKARRVIEACQFFDKMVQ 370
           R++IQGC KSYRMDDA  IFNEM++NG  PDTIVYN+LLDGLFKAR+V EACQ F+KMVQ
Sbjct: 317 RILIQGCSKSYRMDDALKIFNEMQHNGIFPDTIVYNALLDGLFKARKVNEACQLFEKMVQ 376

Query: 371 EGVRASPWTYNILIDGLFRNGRAEASYSLFCDLKKKGQFVDGVTYSIIILQLCKEGLLEE 430
           +GVRAS WT+NILIDGLFRNGRAEA Y+LFCDLKKKGQFVD +TYSI++LQLCKEGLL+E
Sbjct: 377 DGVRASSWTHNILIDGLFRNGRAEAGYTLFCDLKKKGQFVDNITYSIVVLQLCKEGLLDE 436

Query: 431 ALQLVEEMEARGFVIDLVTVTSLLIAMHKQGQWEGLERLMKHIREGDLVPNVLKWKANME 490
           AL+ VEEME RGFV+DLVTVTSLLI M+KQG+W+  +RLMKHIR G+LVPNVL+WKA+ME
Sbjct: 437 ALRSVEEMEDRGFVVDLVTVTSLLIGMYKQGRWDWSDRLMKHIRNGNLVPNVLRWKADME 496

Query: 491 DSVKYQKNKRKNYSSLFSPKEDLSEIISSRASSVAKVNVGDISENTEEKDDDNWSSSPHV 550
            S+K  + KR++ +  F  + D SEI++    + + V+ G   E     D D WSSSP++
Sbjct: 497 ASMKSSQTKREDLTPFFPSRGDFSEIMNLIRYAESTVD-GVKDEENSSADTDQWSSSPYM 556

Query: 551 DLLANLAKSTGDSLQPFSLSPGQRVEAKGDNSFDIDMVNTFLSIFLAKGKLSLACKLFEI 610
           D LAN   ST    Q FSLS GQRV+AKG +SFDIDMVNTFLSIFLAKGKLSLACKLFEI
Sbjct: 557 DQLANQVNSTDHPSQLFSLSRGQRVQAKGVDSFDIDMVNTFLSIFLAKGKLSLACKLFEI 616

Query: 611 FSDMGVNPVRYTYNSMLSAFVKKGYFHQAWGIFNEMGEKVCPADIATYNLIIQGLGKMGR 670
           FSDMG NPV YTYNSM+S+FVKKGYF++AWG+ NEMGE VCPADIATYN+IIQGLGKMGR
Sbjct: 617 FSDMGANPVSYTYNSMMSSFVKKGYFNEAWGVLNEMGENVCPADIATYNVIIQGLGKMGR 676

Query: 671 ADLASSVLEKLMEQGGYLDIVMYNTLMNALGKAGRMDDVNKLFEQMRSSGINPDVVSFNT 730
           ADLASSVL+KLM+QGGYLDIVMYNTL+NALGKAGRMD+VNKLFEQMR+SGINPD+V+FNT
Sbjct: 677 ADLASSVLDKLMKQGGYLDIVMYNTLINALGKAGRMDEVNKLFEQMRTSGINPDIVTFNT 736

Query: 731 LIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTILDFLGREIEKARYEKASIIRDKNSS 790
           LIEVHSKAGR K+AYKFLKMMLD+GC+PNH+TDT LDFLG+EI+K RY+KAS++ +K+ S
Sbjct: 737 LIEVHSKAGRLKEAYKFLKMMLDAGCTPNHITDTTLDFLGKEIDKLRYQKASMMPNKDDS 776

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP299_ARATH5.3e-28863.73Pentatricopeptide repeat-containing protein At4g01570 OS=Arabidopsis thaliana GN... [more]
PP217_ARATH9.0e-5424.60Pentatricopeptide repeat-containing protein At3g06920 OS=Arabidopsis thaliana GN... [more]
PP344_ARATH3.6e-5025.75Pentatricopeptide repeat-containing protein At4g31850, chloroplastic OS=Arabidop... [more]
PP247_ARATH3.3e-4826.10Pentatricopeptide repeat-containing protein At3g22470, mitochondrial OS=Arabidop... [more]
PP156_ARATH8.2e-4723.97Pentatricopeptide repeat-containing protein At2g16880 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0KFG9_CUCSA0.0e+0085.77Uncharacterized protein OS=Cucumis sativus GN=Csa_6G101450 PE=4 SV=1[more]
A0A061DRT6_THECC0.0e+0071.56Pentatricopeptide repeat-containing protein, putative OS=Theobroma cacao GN=TCM_... [more]
B9GRT7_POPTR0.0e+0069.58Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0002s18390g PE=4 SV=2[more]
F6HG95_VITVI0.0e+0069.63Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0010g03410 PE=4 SV=... [more]
U5GLR6_POPTR0.0e+0069.45Pentatricopeptide repeat-containing family protein OS=Populus trichocarpa GN=POP... [more]
Match NameE-valueIdentityDescription
AT4G01570.13.0e-28963.73 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G06920.15.1e-5524.60 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G31850.12.0e-5125.75 proton gradient regulation 3[more]
AT3G22470.11.9e-4926.10 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT2G16880.14.6e-4823.97 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659119716|ref|XP_008459805.1|0.0e+0086.91PREDICTED: pentatricopeptide repeat-containing protein At4g01570 [Cucumis melo][more]
gi|449445529|ref|XP_004140525.1|0.0e+0085.77PREDICTED: pentatricopeptide repeat-containing protein At4g01570 [Cucumis sativu... [more]
gi|590720575|ref|XP_007051367.1|0.0e+0071.56Pentatricopeptide repeat-containing protein, putative [Theobroma cacao][more]
gi|658045553|ref|XP_008358457.1|0.0e+0070.78PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At4g... [more]
gi|1009142863|ref|XP_015888951.1|0.0e+0070.90PREDICTED: pentatricopeptide repeat-containing protein At4g01570 [Ziziphus jujub... [more]
The following terms have been associated with this mRNA:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cp4.1LG12g04710Cp4.1LG12g04710gene


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG12g04710.1:cds:001Cp4.1LG12g04710.1:cds:001CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cp4.1LG12g04710.1Cp4.1LG12g04710.1-proteinpolypeptide


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 234..260
score: 6.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 620..665
score: 1.3E-12coord: 688..731
score: 2.0E-12coord: 339..387
score: 3.1E-14coord: 269..318
score: 4.6E-14coord: 412..455
score: 1.
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 98..156
score: 8.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 378..406
score: 1.4E-5coord: 112..141
score: 2.4E-4coord: 725..758
score: 5.6E-9coord: 272..306
score: 3.0E-6coord: 307..341
score: 2.3E-6coord: 234..262
score: 5.4E-4coord: 690..724
score: 5.7E-11coord: 656..688
score: 1.1E-4coord: 342..373
score: 2.4E-7coord: 412..445
score: 1.2E-7coord: 620..653
score: 3.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 410..444
score: 11.794coord: 688..722
score: 13.833coord: 230..264
score: 8.21coord: 445..479
score: 8.188coord: 375..409
score: 10.019coord: 583..617
score: 8.057coord: 340..374
score: 11.707coord: 305..339
score: 12.266coord: 618..652
score: 9.997coord: 270..304
score: 11.641coord: 145..175
score: 6.38coord: 723..757
score: 12.682coord: 653..687
score: 9.964coord: 75..109
score: 8.89coord: 110..144
score: 1
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 621..753
score: 1.6E-7coord: 119..172
score: 1.6E-7coord: 461..462
score: 1.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 22..143
score: 0.0coord: 620..765
score: 0.0coord: 229..481
score:
NoneNo IPR availablePANTHERPTHR24015:SF479SUBFAMILY NOT NAMEDcoord: 22..143
score: 0.0coord: 620..765
score: 0.0coord: 229..481
score: