Cp4.1LG09g05200 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG09g05200
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing family protein
LocationCp4.1LG09 : 3491946 .. 3494555 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TATAGCCCAATGGGCTTTAGATTTAGGCCCAATTAATCGGCCCAAATTAGGAGCACCACCCCCCTCGCTTTCTGGAACGTTAGGGTTTTGATTAAACTCAGGATCTGTAGTAAAACCATTTCGTCTCTGGCGGAGGGAAGTGCCGGAAAATGGTGTTACTTCTACGGCGGCTCAGTCGCACCAAGAATGTGGCAAAGAGGTCGACGAAGAAGTATCTGGAGGAACCACTGTATGTGAGGCTTTTTAAAGATGGTAGCTCAGAGAAGAGCGTTCGGCTACAGTTGAATGTTTTCCTCAAGAGTCGCAAGCGAGTTTTCAAATGGGAGGTTGGAGATACGCTCAAAAAGCTTCGCGATAGGAAGCTGTATTATCCTGCTCTCAAGGTTCGCTAAACTAATTTCTTTTAATTCTGCAGTATGGATGCGAAATTTGTGGATCAATGTTGTTGTTATGACTCTGTTCTAGTTGTCTTAGCTTGGCATGATTTCAGTTTTAGGCACTGTTTCAACTCCTCTTGTTCGATCAGTGTTAAACCACCGAAACCTAAACAGCTCGAAAATAGCTTCATAATCAATTTTGATTTCTCATCTTTTCCTCCTAGCTTGTTTTCGATGAATTTCCTTAACTGCCGTGATCTCGAATCTTGATAGATACCTCATGCCCTTTGATCTAGCTCCATTTCCATCATTCTTCTTGGTGAAAGCATATTATAGTTTACATTTTGTAAGGATTTAGTCCTGTGATTTTAAAAATCTTACAATTAAGTTCTTAGAACAAAGAAAAACATTACAAAAACATTAACGTTTTTTCTGGAGAAGTATTTTATGGTTGAATGGAGCCAAATCAGCAGATTCACCTTCAGTGGTCAAAAACATTTTGTATGTTGATCATATCATTATTTGCTCTAGGAACTTCTAGGAACTTCATTGTTACATTTCTTTACTTGTGCGCAGTGCAAAAATGACTGAATTGAAGCTTCTAATAGCATCTCTACATTTCTGATACAAACGATTATTTTCTTCCATTTGTTTCTCTGTTATTATGGCAAATGATCTTATCTTTCTTGCTTATATTGCTGGTGTAAAACGATTATTCTGAAAAGAAACTGTGATATAGCTTTCAGAAACTATGGCCAAAAGGAGCATGAACAAAACAGTAAGTGATCAAGCAATACATCTTGATTTATTAGCCAAGGCTCGAGGAATTGCTGCCGCCGAGAGCTTCTTTGTTAGTCTTCCCGAATCATCAAAGAATCATCTTTGCTATGGTTCTCTTCTCAACTGTTACTGCAAGGAATTAATGACTGAAAAAGCTGAAGCTATCTCGGAGAAAATGAAGGAACTGAACCTACCTGTGACCTCCATGCCGTACAATAGCCTTATGACACTATACTCAAAAACTGGGCACCCAGAAAAAGTTAGTGCAATCATACAGGAAATGAAGGCGGCCAAAGTAATGTTTGATACCTATACATATAATGTGTGGATGAGGGCACTTGCTGCTTTAAATGACATCTCTGGTGTGGAAAGGGTTATTGATGAGATGAAGGACGGCAGAGCTGTGGGAGATTGGACAACATACAGCAATTTAGCCTCAATTTATGTTGATGCTCACATGTTCGACAAGGCAGGCAACGCGCTGAAGGAATTGGAGAAGAGAAACGCTCGTCGAGATCTTTCTGCGTTCCAGTTCCTGATTACGTTGCATGGACAAATGGGTAACCTTCTCGAAGTCTATAGAGTTTGGCGCTCATTAAGGTTAGCCTTTCCAAAAACCGCAAATATAAGCTATCTCAACATGATCCAAACTCTGATAAAATTGAAAGACTTACCTGGCGCAGAGAAATGTTTCAAGGAATGGCAATCAGGATGCTCAACTTATGATATTAGGATTGCAAATGCTCTTATAGGAGCTTATGCCAAGGAGGGTTTGCTAGAGAAGGCTATCGAACTCAAGGTTCGAGCCCGACAAAGAGGGGCTAAACCTAATGCGAAAACTTGGGAAATTTTTATGGATTATTATCTCAAAAATGGAGAATTTAAACTGGCAGCTGATTGTGCTGCCAAAGCAGTATCCAAAGGTAGACTAGATGGGGGGAAATGGGTGCCATCGCCTGAGGTTATTAGAACATTCATGAGCCATTACGAGCAAGAAAAAGATGTTGATGGAGCAGAGAGCTTTGTTGAAACGGTAAAGAAAAGTGTAGACAGTTTAGAATCAGAGGTCTTTGAATCATTGATAAGAACATATTCTGCAGCAGGAAGGAGAAGTTGTATGATGAGTCGTAGGTTGAAGATGGAGAAAGTGGAGGTCAGTGAGGCCTGCAAGAAGCTGCTCGACGAAATATCGATTGAATGAGCGTTTGTTGAACAACAAAGTTTTATGATTTAGAGAATTCCAAGGTTTGAGGAATTGAATTTTTCAGTGGTTCTTTGATTGTAAGGACTTCCTCTTCTATTTCAATTTGAATTTTTACCATTTAGAGAATTCCAAGCTCTGTTTCTGTAATAAGATTTACAATAAATTATTATGAACAGAGAGGGCGAGATGGAGGGCTGTGTTCCCAGTTCCAATACAAACACTCAAAAAAACTTGTCTTTTCTGCGTCAA

mRNA sequence

TATAGCCCAATGGGCTTTAGATTTAGGCCCAATTAATCGGCCCAAATTAGGAGCACCACCCCCCTCGCTTTCTGGAACGTTAGGGTTTTGATTAAACTCAGGATCTGTAGTAAAACCATTTCGTCTCTGGCGGAGGGAAGTGCCGGAAAATGGTGTTACTTCTACGGCGGCTCAGTCGCACCAAGAATGTGGCAAAGAGGTCGACGAAGAAGTATCTGGAGGAACCACTGTATGTGAGGCTTTTTAAAGATGGTAGCTCAGAGAAGAGCGTTCGGCTACAGTTGAATGTTTTCCTCAAGAGTCGCAAGCGAGTTTTCAAATGGGAGGTTGGAGATACGCTCAAAAAGCTTCGCGATAGGAAGCTGTATTATCCTGCTCTCAAGCTTTCAGAAACTATGGCCAAAAGGAGCATGAACAAAACAGTAAGTGATCAAGCAATACATCTTGATTTATTAGCCAAGGCTCGAGGAATTGCTGCCGCCGAGAGCTTCTTTGTTAGTCTTCCCGAATCATCAAAGAATCATCTTTGCTATGGTTCTCTTCTCAACTGTTACTGCAAGGAATTAATGACTGAAAAAGCTGAAGCTATCTCGGAGAAAATGAAGGAACTGAACCTACCTGTGACCTCCATGCCGTACAATAGCCTTATGACACTATACTCAAAAACTGGGCACCCAGAAAAAGTTAGTGCAATCATACAGGAAATGAAGGCGGCCAAAGTAATGTTTGATACCTATACATATAATGTGTGGATGAGGGCACTTGCTGCTTTAAATGACATCTCTGGTGTGGAAAGGGTTATTGATGAGATGAAGGACGGCAGAGCTGTGGGAGATTGGACAACATACAGCAATTTAGCCTCAATTTATGTTGATGCTCACATGTTCGACAAGGCAGGCAACGCGCTGAAGGAATTGGAGAAGAGAAACGCTCGTCGAGATCTTTCTGCGTTCCAGTTCCTGATTACGTTGCATGGACAAATGGGTAACCTTCTCGAAGTCTATAGAGTTTGGCGCTCATTAAGGTTAGCCTTTCCAAAAACCGCAAATATAAGCTATCTCAACATGATCCAAACTCTGATAAAATTGAAAGACTTACCTGGCGCAGAGAAATGTTTCAAGGAATGGCAATCAGGATGCTCAACTTATGATATTAGGATTGCAAATGCTCTTATAGGAGCTTATGCCAAGGAGGGTTTGCTAGAGAAGGCTATCGAACTCAAGGTTCGAGCCCGACAAAGAGGGGCTAAACCTAATGCGAAAACTTGGGAAATTTTTATGGATTATTATCTCAAAAATGGAGAATTTAAACTGGCAGCTGATTGTGCTGCCAAAGCAGTATCCAAAGGTAGACTAGATGGGGGGAAATGGGTGCCATCGCCTGAGGTTATTAGAACATTCATGAGCCATTACGAGCAAGAAAAAGATGTTGATGGAGCAGAGAGCTTTGTTGAAACGGTAAAGAAAAGTGTAGACAGTTTAGAATCAGAGGTCTTTGAATCATTGATAAGAACATATTCTGCAGCAGGAAGGAGAAGTTGTATGATGAGTCGTAGGTTGAAGATGGAGAAAGTGGAGGTCAGTGAGGCCTGCAAGAAGCTGCTCGACGAAATATCGATTGAATGAGCGTTTGTTGAACAACAAAGTTTTATGATTTAGAGAATTCCAAGGTTTGAGGAATTGAATTTTTCAGTGGTTCTTTGATTGTAAGGACTTCCTCTTCTATTTCAATTTGAATTTTTACCATTTAGAGAATTCCAAGCTCTGTTTCTGTAATAAGATTTACAATAAATTATTATGAACAGAGAGGGCGAGATGGAGGGCTGTGTTCCCAGTTCCAATACAAACACTCAAAAAAACTTGTCTTTTCTGCGTCAA

Coding sequence (CDS)

ATGGTGTTACTTCTACGGCGGCTCAGTCGCACCAAGAATGTGGCAAAGAGGTCGACGAAGAAGTATCTGGAGGAACCACTGTATGTGAGGCTTTTTAAAGATGGTAGCTCAGAGAAGAGCGTTCGGCTACAGTTGAATGTTTTCCTCAAGAGTCGCAAGCGAGTTTTCAAATGGGAGGTTGGAGATACGCTCAAAAAGCTTCGCGATAGGAAGCTGTATTATCCTGCTCTCAAGCTTTCAGAAACTATGGCCAAAAGGAGCATGAACAAAACAGTAAGTGATCAAGCAATACATCTTGATTTATTAGCCAAGGCTCGAGGAATTGCTGCCGCCGAGAGCTTCTTTGTTAGTCTTCCCGAATCATCAAAGAATCATCTTTGCTATGGTTCTCTTCTCAACTGTTACTGCAAGGAATTAATGACTGAAAAAGCTGAAGCTATCTCGGAGAAAATGAAGGAACTGAACCTACCTGTGACCTCCATGCCGTACAATAGCCTTATGACACTATACTCAAAAACTGGGCACCCAGAAAAAGTTAGTGCAATCATACAGGAAATGAAGGCGGCCAAAGTAATGTTTGATACCTATACATATAATGTGTGGATGAGGGCACTTGCTGCTTTAAATGACATCTCTGGTGTGGAAAGGGTTATTGATGAGATGAAGGACGGCAGAGCTGTGGGAGATTGGACAACATACAGCAATTTAGCCTCAATTTATGTTGATGCTCACATGTTCGACAAGGCAGGCAACGCGCTGAAGGAATTGGAGAAGAGAAACGCTCGTCGAGATCTTTCTGCGTTCCAGTTCCTGATTACGTTGCATGGACAAATGGGTAACCTTCTCGAAGTCTATAGAGTTTGGCGCTCATTAAGGTTAGCCTTTCCAAAAACCGCAAATATAAGCTATCTCAACATGATCCAAACTCTGATAAAATTGAAAGACTTACCTGGCGCAGAGAAATGTTTCAAGGAATGGCAATCAGGATGCTCAACTTATGATATTAGGATTGCAAATGCTCTTATAGGAGCTTATGCCAAGGAGGGTTTGCTAGAGAAGGCTATCGAACTCAAGGTTCGAGCCCGACAAAGAGGGGCTAAACCTAATGCGAAAACTTGGGAAATTTTTATGGATTATTATCTCAAAAATGGAGAATTTAAACTGGCAGCTGATTGTGCTGCCAAAGCAGTATCCAAAGGTAGACTAGATGGGGGGAAATGGGTGCCATCGCCTGAGGTTATTAGAACATTCATGAGCCATTACGAGCAAGAAAAAGATGTTGATGGAGCAGAGAGCTTTGTTGAAACGGTAAAGAAAAGTGTAGACAGTTTAGAATCAGAGGTCTTTGAATCATTGATAAGAACATATTCTGCAGCAGGAAGGAGAAGTTGTATGATGAGTCGTAGGTTGAAGATGGAGAAAGTGGAGGTCAGTGAGGCCTGCAAGAAGCTGCTCGACGAAATATCGATTGAATGA

Protein sequence

MVLLLRRLSRTKNVAKRSTKKYLEEPLYVRLFKDGSSEKSVRLQLNVFLKSRKRVFKWEVGDTLKKLRDRKLYYPALKLSETMAKRSMNKTVSDQAIHLDLLAKARGIAAAESFFVSLPESSKNHLCYGSLLNCYCKELMTEKAEAISEKMKELNLPVTSMPYNSLMTLYSKTGHPEKVSAIIQEMKAAKVMFDTYTYNVWMRALAALNDISGVERVIDEMKDGRAVGDWTTYSNLASIYVDAHMFDKAGNALKELEKRNARRDLSAFQFLITLHGQMGNLLEVYRVWRSLRLAFPKTANISYLNMIQTLIKLKDLPGAEKCFKEWQSGCSTYDIRIANALIGAYAKEGLLEKAIELKVRARQRGAKPNAKTWEIFMDYYLKNGEFKLAADCAAKAVSKGRLDGGKWVPSPEVIRTFMSHYEQEKDVDGAESFVETVKKSVDSLESEVFESLIRTYSAAGRRSCMMSRRLKMEKVEVSEACKKLLDEISIE
BLAST of Cp4.1LG09g05200 vs. Swiss-Prot
Match: PPR86_ARATH (Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana GN=At1g60770 PE=2 SV=1)

HSP 1 Score: 669.5 bits (1726), Expect = 2.9e-191
Identity = 333/488 (68.24%), Postives = 401/488 (82.17%), Query Frame = 1

Query: 3   LLLRRLSRTKNVAKRSTKKYLEEPLYVRLFKDGSSEKSVRLQLNVFLKSRKRVFKWEVGD 62
           + +R LSR+++V KRSTKKY+EEPLY RLFKDG +E  VR QLN FLK  K VFKWEVGD
Sbjct: 1   MAMRHLSRSRDVTKRSTKKYIEEPLYNRLFKDGGTEVKVRQQLNQFLKGTKHVFKWEVGD 60

Query: 63  TLKKLRDRKLYYPALKLSETMAKRSMNKTVSDQAIHLDLLAKARGIAAAESFFVSLPESS 122
           T+KKLR+R LYYPALKLSE M +R MNKTVSDQAIHLDL+AKAR I A E++FV LPE+S
Sbjct: 61  TIKKLRNRGLYYPALKLSEVMEERGMNKTVSDQAIHLDLVAKAREITAGENYFVDLPETS 120

Query: 123 KNHLCYGSLLNCYCKELMTEKAEAISEKMKELNLPVTSMPYNSLMTLYSKTGHPEKVSAI 182
           K  L YGSLLNCYCKEL+TEKAE +  KMKELN+  +SM YNSLMTLY+KTG  EKV A+
Sbjct: 121 KTELTYGSLLNCYCKELLTEKAEGLLNKMKELNITPSSMSYNSLMTLYTKTGETEKVPAM 180

Query: 183 IQEMKAAKVMFDTYTYNVWMRALAALNDISGVERVIDEM-KDGRAVGDWTTYSNLASIYV 242
           IQE+KA  VM D+YTYNVWMRALAA NDISGVERVI+EM +DGR   DWTTYSN+ASIYV
Sbjct: 181 IQELKAENVMPDSYTYNVWMRALAATNDISGVERVIEEMNRDGRVAPDWTTYSNMASIYV 240

Query: 243 DAHMFDKAGNALKELEKRNARRDLSAFQFLITLHGQMGNLLEVYRVWRSLRLAFPKTANI 302
           DA +  KA  AL+ELE +N +RD +A+QFLITL+G++G L EVYR+WRSLRLA PKT+N+
Sbjct: 241 DAGLSQKAEKALQELEMKNTQRDFTAYQFLITLYGRLGKLTEVYRIWRSLRLAIPKTSNV 300

Query: 303 SYLNMIQTLIKLKDLPGAEKCFKEWQSGCSTYDIRIANALIGAYAKEGLLEKAIELKVRA 362
           +YLNMIQ L+KL DLPGAE  FKEWQ+ CSTYDIRI N LIGAYA+EGL++KA ELK +A
Sbjct: 301 AYLNMIQVLVKLNDLPGAETLFKEWQANCSTYDIRIVNVLIGAYAQEGLIQKANELKEKA 360

Query: 363 RQRGAKPNAKTWEIFMDYYLKNGEFKLAADCAAKAVSKGRLDGGKWVPSPEVIRTFMSHY 422
            +RG K NAKTWEIFMDYY+K+G+   A +C +KAVS G+ DGGKW+PSPE +R  MS++
Sbjct: 361 PRRGGKLNAKTWEIFMDYYVKSGDMARALECMSKAVSIGKGDGGKWLPSPETVRALMSYF 420

Query: 423 EQEKDVDGAESFVETVKKSVDSLESEVFESLIRTYSAAGRRSCMMSRRLKMEKVEVSEAC 482
           EQ+KDV+GAE+ +E +K   D++ +E+FE LIRTY+AAG+    M RRLKME VEV+EA 
Sbjct: 421 EQKKDVNGAENLLEILKNGTDNIGAEIFEPLIRTYAAAGKSHPAMRRRLKMENVEVNEAT 480

Query: 483 KKLLDEIS 490
           KKLLDE+S
Sbjct: 481 KKLLDEVS 488

BLAST of Cp4.1LG09g05200 vs. Swiss-Prot
Match: PPR4_ARATH (Pentatricopeptide repeat-containing protein At1g02370, mitochondrial OS=Arabidopsis thaliana GN=At1g02370 PE=2 SV=1)

HSP 1 Score: 310.8 bits (795), Expect = 2.6e-83
Identity = 180/467 (38.54%), Postives = 275/467 (58.89%), Query Frame = 1

Query: 24  EEPLYVRLFKDGSSEKSVRLQLNVFLKSRKRVFKWEVGDTLKKLRDRKLYYPALKLSETM 83
           +  LY +L     +  +V   LN F+     V K ++    K LR  +    A ++ + M
Sbjct: 70  QRELYKKLSMLSVTGGTVAETLNQFIMEGITVRKDDLFRCAKTLRKFRRPQHAFEIFDWM 129

Query: 84  AKRSMNKTVSDQAIHLDLLAKARGIAAAESFFVSLPESSKNHLC-YGSLLNCYCKELMTE 143
            KR M  +VSD AI LDL+ K +G+ AAE++F +L  S+KNH   YG+L+NCYC EL  E
Sbjct: 130 EKRKMTFSVSDHAICLDLIGKTKGLEAAENYFNNLDPSAKNHQSTYGALMNCYCVELEEE 189

Query: 144 KAEAISEKMKELNLPVTSMPYNSLMTLYSKTGHPEKVSAIIQEMKAAKVMFDTYTYNVWM 203
           KA+A  E M ELN    S+P+N++M++Y +   PEKV  ++  MK   +     TY++WM
Sbjct: 190 KAKAHFEIMDELNFVNNSLPFNNMMSMYMRLSQPEKVPVLVDAMKQRGISPCGVTYSIWM 249

Query: 204 RALAALNDISGVERVIDEM-KDGRAVGDWTTYSNLASIYVDAHMFDKAGNALKELEKRNA 263
           ++  +LND+ G+E++IDEM KD  A   W T+SNLA+IY  A +++KA +ALK +E++  
Sbjct: 250 QSCGSLNDLDGLEKIIDEMGKDSEAKTTWNTFSNLAAIYTKAGLYEKADSALKSMEEKMN 309

Query: 264 RRDLSAFQFLITLHGQMGNLLEVYRVWRSLRLAFPKTANISYLNMIQTLIKLKDLPGAEK 323
             +  +  FL++L+  +    EVYRVW SL+ A P+  N+SYL M+Q + KL DL G +K
Sbjct: 310 PNNRDSHHFLMSLYAGISKGPEVYRVWESLKKARPEVNNLSYLVMLQAMSKLGDLDGIKK 369

Query: 324 CFKEWQSGCSTYDIRIANALIGAYAKEGLLEKAIELKVRARQRGAKPNAKTWEIFMDYYL 383
            F EW+S C  YD+R+AN  I  Y K  + E+A ++   A ++   P +K  ++ M + L
Sbjct: 370 IFTEWESKCWAYDMRLANIAINTYLKGNMYEEAEKILDGAMKKSKGPFSKARQLLMIHLL 429

Query: 384 KNGEFKLAADCAAKAVSKGRLDGGKWVPSPEVIRTFMSHYEQEKDVDGAESFVETVKKSV 443
           +N +  LA      AVS    +  +W  S E++  F  H+E+ KDVDGAE F + +  + 
Sbjct: 430 ENDKADLAMKHLEAAVSDSAENKDEWGWSSELVSLFFLHFEKAKDVDGAEDFCK-ILSNW 489

Query: 444 DSLESEVFESLIRTYSAAGRRSCMMSRRLKMEKVEVSEACKKLLDEI 489
             L+SE    LI+TY+AA + S  M  RL  +++EVSE  + LL  +
Sbjct: 490 KPLDSETMTFLIKTYAAAEKTSPDMRERLSQQQIEVSEEIQDLLKTV 535

BLAST of Cp4.1LG09g05200 vs. Swiss-Prot
Match: PP300_ARATH (Pentatricopeptide repeat-containing protein At4g01990, mitochondrial OS=Arabidopsis thaliana GN=At4g01990 PE=2 SV=1)

HSP 1 Score: 297.7 bits (761), Expect = 2.3e-79
Identity = 177/446 (39.69%), Postives = 260/446 (58.30%), Query Frame = 1

Query: 45  LNVFLKSRKRVFKWEVGDTLKKLRDRKLYYPALKLSETMAKRSMNKTVSDQAIHLDLLAK 104
           LN F+     V K ++    K LR  +    AL++ E M ++ +  T SD AI L+L+AK
Sbjct: 60  LNQFVMEGVPVKKHDLIRYAKDLRKFRQPQRALEIFEWMERKEIAFTGSDHAIRLNLIAK 119

Query: 105 ARGIAAAESFFVSLPESSKNHLCYGSLLNCYCKELMTEKAEAISEKMKELNLPVTSMPYN 164
           ++G+ AAE++F SL +S KN   YGSLLNCYC E    KA+A  E M +LN    S+P+N
Sbjct: 120 SKGLEAAETYFNSLDDSIKNQSTYGSLLNCYCVEKEEVKAKAHFENMVDLNHVSNSLPFN 179

Query: 165 SLMTLYSKTGHPEKVSAIIQEMKAAKVMFDTYTYNVWMRALAALNDISGVERVIDEMK-D 224
           +LM +Y   G PEKV A++  MK   +     TY++W+++  +L D+ GVE+V+DEMK +
Sbjct: 180 NLMAMYMGLGQPEKVPALVVAMKEKSITPCDITYSMWIQSCGSLKDLDGVEKVLDEMKAE 239

Query: 225 GRAVGDWTTYSNLASIYVDAHMFDKAGNALKELEKRNARRDLSAFQFLITLHGQMGNLLE 284
           G  +  W T++NLA+IY+   ++ KA  ALK LE          + FLI L+  + N  E
Sbjct: 240 GEGIFSWNTFANLAAIYIKVGLYGKAEEALKSLENNMNPDVRDCYHFLINLYTGIANASE 299

Query: 285 VYRVWRSLRLAFPKTANISYLNMIQTLIKLKDLPGAEKCFKEWQSGCSTYDIRIANALIG 344
           VYRVW  L+  +P   N SYL M++ L KL D+ G +K F EW+S C TYD+R+AN  I 
Sbjct: 300 VYRVWDLLKKRYPNVNNSSYLTMLRALSKLDDIDGVKKVFAEWESTCWTYDMRMANVAIS 359

Query: 345 AYAKEGLLEKAIELKVRARQRGAKPNAKTWEIFMDYYLKNGEFKLAADCAAKAVSKGRLD 404
           +Y K+ + E+A  +   A ++     +K  ++ M + LKN +    AD A K      LD
Sbjct: 360 SYLKQNMYEEAEAVFNGAMKKCKGQFSKARQLLMMHLLKNDQ----ADLALKHFEAAVLD 419

Query: 405 GGK-WVPSPEVIRTFMSHYEQEKDVDGAESFVETVKKSVDSLESEVFESLIRTYSAAGRR 464
             K W  S E+I +F  H+E+ KDVDGAE F +T+ K    L SE +  L++TY AAG+ 
Sbjct: 420 QDKNWTWSSELISSFFLHFEEAKDVDGAEEFCKTLTK-WSPLSSETYTLLMKTYLAAGKA 479

Query: 465 SCMMSRRLKMEKVEVSEACKKLLDEI 489
              M +RL+ + + V E  + LL +I
Sbjct: 480 CPDMKKRLEEQGILVDEEQECLLSKI 500

BLAST of Cp4.1LG09g05200 vs. Swiss-Prot
Match: PPR3_ARATH (Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana GN=At1g02150 PE=2 SV=2)

HSP 1 Score: 249.6 bits (636), Expect = 7.1e-65
Identity = 142/412 (34.47%), Postives = 228/412 (55.34%), Query Frame = 1

Query: 45  LNVFLKSRKRVFKWEVGDTLKKLRDRKLYYPALKLSETMAKRS--MNKTVSDQAIHLDLL 104
           LN + K+ +++ KWE+   +K+LR  K    AL++ + M  R      + SD AI LDL+
Sbjct: 87  LNQWEKAGRKLTKWELCRVVKELRKYKRANQALEVYDWMNNRGERFRLSASDAAIQLDLI 146

Query: 105 AKARGIAAAESFFVSLPESSKNHLCYGSLLNCYCKELMTEKAEAISEKMKELNLPVTSMP 164
            K RGI  AE FF+ LPE+ K+   YGSLLN Y +    EKAEA+   M++    +  +P
Sbjct: 147 GKVRGIPDAEEFFLQLPENFKDRRVYGSLLNAYVRAKSREKAEALLNTMRDKGYALHPLP 206

Query: 165 YNSLMTLYSKTGHPEKVSAIIQEMKAAKVMFDTYTYNVWMRALAALNDISGVERVIDEMK 224
           +N +MTLY      +KV A++ EMK   +  D Y+YN+W+ +  +L  +  +E V  +MK
Sbjct: 207 FNVMMTLYMNLREYDKVDAMVFEMKQKDIRLDIYSYNIWLSSCGSLGSVEKMELVYQQMK 266

Query: 225 DGRAV-GDWTTYSNLASIYVDAHMFDKAGNALKELEKRNARRDLSAFQFLITLHGQMGNL 284
              ++  +WTT+S +A++Y+     +KA +AL+++E R   R+   + +L++L+G +GN 
Sbjct: 267 SDVSIYPNWTTFSTMATMYIKMGETEKAEDALRKVEARITGRNRIPYHYLLSLYGSLGNK 326

Query: 285 LEVYRVWRSLRLAFPKTANISYLNMIQTLIKLKDLPGAEKCFKEWQSGCSTYDIRIANAL 344
            E+YRVW   +   P   N+ Y  ++ +L+++ D+ GAEK ++EW    S+YD RI N L
Sbjct: 327 KELYRVWHVYKSVVPSIPNLGYHALVSSLVRMGDIEGAEKVYEEWLPVKSSYDPRIPNLL 386

Query: 345 IGAYAKEGLLEKAIELKVRARQRGAKPNAKTWEIFMDYYLKNGEFKLAADCAAKAVSKGR 404
           + AY K   LE A  L     + G KP++ TWEI    + +      A  C   A S   
Sbjct: 387 MNAYVKNDQLETAEGLFDHMVEMGGKPSSSTWEILAVGHTRKRCISEALTCLRNAFSAE- 446

Query: 405 LDGGKWVPSPEVIRTFMSHYEQEKDVDGAESFVETVKKSVDSLESEVFESLI 454
                W P   ++  F    E+E DV   E+ +E +++S D LE + + +LI
Sbjct: 447 -GSSNWRPKVLMLSGFFKLCEEESDVTSKEAVLELLRQSGD-LEDKSYLALI 495

BLAST of Cp4.1LG09g05200 vs. Swiss-Prot
Match: PP302_ARATH (Pentatricopeptide repeat-containing protein At4g02820, mitochondrial OS=Arabidopsis thaliana GN=At4g02820 PE=2 SV=1)

HSP 1 Score: 248.4 bits (633), Expect = 1.6e-64
Identity = 139/450 (30.89%), Postives = 248/450 (55.11%), Query Frame = 1

Query: 37  SEKSVRLQLNVFLKSRKRVFKWEVGDTLKKLRDRKLYYPALKLSETMA-KRSMNKTVSDQ 96
           +++S  + +  + +    V K+E+   +++LR  K Y  AL++ E M  +  +     D 
Sbjct: 73  TKRSAVVTIRKWKEEGHSVRKYELNRIVRELRKIKRYKHALEICEWMVVQEDIKLQAGDY 132

Query: 97  AIHLDLLAKARGIAAAESFFVSLPESSKNHLCYGSLLNCYCKELMTEKAEAISEKMKELN 156
           A+HLDL++K RG+ +AE FF  +P+  + H    SLL+ Y +  +++KAEA+ EKM E  
Sbjct: 133 AVHLDLISKIRGLNSAEKFFEDMPDQMRGHAACTSLLHSYVQNKLSDKAEALFEKMGECG 192

Query: 157 LPVTSMPYNSLMTLYSKTGHPEKVSAIIQEMKAAKVMFDTYTYNVWMRALAALNDISGVE 216
              + +PYN ++++Y   G  EKV  +I+E+K  +   D  TYN+W+ A A+ ND+ G E
Sbjct: 193 FLKSCLPYNHMLSMYISRGQFEKVPVLIKELKI-RTSPDIVTYNLWLTAFASGNDVEGAE 252

Query: 217 RVIDEMKDGRAVGDWTTYSNLASIYVDAHMFDKAGNALKELEKRNARRDLSAFQFLITLH 276
           +V  + K+ +   DW TYS L ++Y      +KA  ALKE+EK  ++++  A+  LI+LH
Sbjct: 253 KVYLKAKEEKLNPDWVTYSVLTNLYAKTDNVEKARLALKEMEKLVSKKNRVAYASLISLH 312

Query: 277 GQMGNLLEVYRVWRSLRLAFPKTANISYLNMIQTLIKLKDLPGAEKCFKEWQSGCSTYDI 336
             +G+   V   W+ ++ +F K  +  YL+MI  ++KL +   A+  + EW+S   T D 
Sbjct: 313 ANLGDKDGVNLTWKKVKSSFKKMNDAEYLSMISAVVKLGEFEQAKGLYDEWESVSGTGDA 372

Query: 337 RIANALIGAYAKEGLLEKAIELKVRARQRGAKPNAKTWEIFMDYYLKNGEFKLAADCAAK 396
           RI N ++  Y     +    +   R  ++G  P+  TWEI    YLK  + +   DC  K
Sbjct: 373 RIPNLILAEYMNRDEVLLGEKFYERIVEKGINPSYSTWEILTWAYLKRKDMEKVLDCFGK 432

Query: 397 AVSKGRLDGGKWVPSPEVIRTFMSHYEQEKDVDGAESFVETVKKSVDSLESEVFESLIRT 456
           A+   +    KW  +  +++      E++ +V GAE  +  ++K+   + ++++ SL+RT
Sbjct: 433 AIDSVK----KWTVNVRLVKGACKELEEQGNVKGAEKLMTLLQKA-GYVNTQLYNSLLRT 492

Query: 457 YSAAGRRSCMMSRRLKMEKVEVSEACKKLL 486
           Y+ AG  + ++  R+  + VE+ E  K+L+
Sbjct: 493 YAKAGEMALIVEERMAKDNVELDEETKELI 516

BLAST of Cp4.1LG09g05200 vs. TrEMBL
Match: A0A0A0KS91_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G599800 PE=4 SV=1)

HSP 1 Score: 766.5 bits (1978), Expect = 1.9e-218
Identity = 389/487 (79.88%), Postives = 431/487 (88.50%), Query Frame = 1

Query: 5   LRRLSRTKNVAKRSTKKYLEEPLYVRLFKDGSSEKSVRLQLNVFLKSRKRVFKWEVGDTL 64
           L++   +K++AKRS +KYLEE LY+RLFKDG SEKSVRLQLN F+KS KRVFKWEVGDTL
Sbjct: 5   LQKFRPSKDLAKRSAEKYLEEALYIRLFKDGGSEKSVRLQLNKFIKSHKRVFKWEVGDTL 64

Query: 65  KKLRDRKLYYPALKLSETMAKRSMNKTVSDQAIHLDLLAKARGIAAAESFFVSLPESSKN 124
           +KLRDRKLYYPALKLSE MAKR MNKTVSDQAIHLDL+AKARGI AAE++FVSLPESSKN
Sbjct: 65  RKLRDRKLYYPALKLSEIMAKRGMNKTVSDQAIHLDLVAKARGIDAAENYFVSLPESSKN 124

Query: 125 HLCYGSLLNCYCKELMTEKAEAISEKMKELNLPVTSMPYNSLMTLYSKTGHPEKVSAIIQ 184
           HL Y SLLNCYCKEL+TEKAEA+ EK+KELNLPVT +PYNSLMTLYSK G P+KV  IIQ
Sbjct: 125 HLSYSSLLNCYCKELLTEKAEALFEKIKELNLPVTPVPYNSLMTLYSKIGRPDKVCTIIQ 184

Query: 185 EMKAAKVMFDTYTYNVWMRALAALNDISGVERVIDEMKDGRAVGDWTTYSNLASIYVDAH 244
           EMKAA V FD YTY VWMRALAALNDISGVERVIDEMK     GDWTTYSNLASIYV+A+
Sbjct: 185 EMKAANVTFDPYTYIVWMRALAALNDISGVERVIDEMKRDGVKGDWTTYSNLASIYVNAN 244

Query: 245 MFDKAGNALKELEKRNARRDLSAFQFLITLHGQMGNLLEVYRVWRSLRLAFPKTANISYL 304
           MF+KA  ALK+LEK N RRDL  FQFLITL+GQ+G+L EVYRVWRSLRLAFP+TANISYL
Sbjct: 245 MFEKAAKALKDLEKINTRRDLIGFQFLITLYGQIGDLTEVYRVWRSLRLAFPRTANISYL 304

Query: 305 NMIQTLIKLKDLPGAEKCFKEWQSGCSTYDIRIANALIGAYAKEGLLEKAIELKVRARQR 364
           NMIQTL KLKDLPGAEKCFKEW+SG  TYDIRI NALIGAY K GLLEKA+ LK RA +R
Sbjct: 305 NMIQTLTKLKDLPGAEKCFKEWESGSPTYDIRIPNALIGAYTKGGLLEKAMALKERALRR 364

Query: 365 GAKPNAKTWEIFMDYYLKNGEFKLAADCAAKAVSKGRLDGGKWVPSPEVIRTFMSHYEQE 424
           GA+PNAKTWE F++YYLKNG+FKLA DC AKA+ KG  D GKW+PSPE+I++FMSH+EQE
Sbjct: 365 GARPNAKTWEFFLNYYLKNGDFKLAGDCVAKAIGKG--DRGKWIPSPEIIKSFMSHFEQE 424

Query: 425 KDVDGAESFVETVKKSVDSLESEVFESLIRTYSAAGRRSCMMSRRLKMEKVEVSEACKKL 484
           KDVDGAESF+E VKK+VDSLESEVFESLIRTYSAAGR S  MSRRLKME VEVSEACKKL
Sbjct: 425 KDVDGAESFLEIVKKTVDSLESEVFESLIRTYSAAGRTSSSMSRRLKMENVEVSEACKKL 484

Query: 485 LDEISIE 492
           L++ISIE
Sbjct: 485 LNKISIE 489

BLAST of Cp4.1LG09g05200 vs. TrEMBL
Match: A0A067H3N9_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g011226mg PE=4 SV=1)

HSP 1 Score: 743.0 bits (1917), Expect = 2.3e-211
Identity = 376/487 (77.21%), Postives = 420/487 (86.24%), Query Frame = 1

Query: 6   RRLSRTKNVAKRSTKKYLEEPLYVRLFKDGSSEKSVRLQLNVFLKSRKRVFKWEVGDTLK 65
           +R  RTKN+AKRS KK+LEE LY RLFK GSS+ SVR QLN FLKS+KRVFKWEVGDTLK
Sbjct: 5   QRFGRTKNIAKRS-KKHLEEALYDRLFKKGSSDVSVRQQLNQFLKSKKRVFKWEVGDTLK 64

Query: 66  KLRDRKLYYPALKLSETMAKRSMNKTVSDQAIHLDLLAKARGIAAAESFFVSLPESSKNH 125
           KLRDRKLYYPALKLSE M KR MNKTVSDQAIHLDL+AK +GI AAE++FV LPE+SKNH
Sbjct: 65  KLRDRKLYYPALKLSENMEKRGMNKTVSDQAIHLDLVAKVQGIDAAENYFVDLPETSKNH 124

Query: 126 LCYGSLLNCYCKELMTEKAEAISEKMKELNLPVTSMPYNSLMTLYSKTGHPEKVSAIIQE 185
           L YGSLLNCYCKELMTEKAEA+ EKMKELNL  +SMP+NSLMTLY+KTGHPEK+ AIIQE
Sbjct: 125 LTYGSLLNCYCKELMTEKAEALLEKMKELNLGFSSMPFNSLMTLYAKTGHPEKIPAIIQE 184

Query: 186 MKAAKVMFDTYTYNVWMRALAALNDISGVERVIDEMK-DGRAVGDWTTYSNLASIYVDAH 245
           MKA+ +M D+YTYNVWMRALAA+NDISG ERVI+EMK DGR   DWTT+SNLASIYV+A 
Sbjct: 185 MKASSIMPDSYTYNVWMRALAAVNDISGAERVIEEMKRDGRVAADWTTFSNLASIYVEAG 244

Query: 246 MFDKAGNALKELEKRNARRDLSAFQFLITLHGQMGNLLEVYRVWRSLRLAFPKTANISYL 305
           +F+KA  ALKELE RNA RDLSA+QFLITL+GQ GNL EVYR+WRSLRLAFP TANISYL
Sbjct: 245 LFEKAERALKELENRNAHRDLSAYQFLITLYGQTGNLSEVYRIWRSLRLAFPNTANISYL 304

Query: 306 NMIQTLIKLKDLPGAEKCFKEWQSGCSTYDIRIANALIGAYAKEGLLEKAIELKVRARQR 365
           NMIQ L+ LKDLPGAEKCFKEW+SGC+TYDIR+ N +IGAYAKEG LE A ELK RAR+R
Sbjct: 305 NMIQVLVNLKDLPGAEKCFKEWESGCATYDIRVTNVMIGAYAKEGRLENAEELKERARRR 364

Query: 366 GAKPNAKTWEIFMDYYLKNGEFKLAADCAAKAVSKGRLDGGKWVPSPEVIRTFMSHYEQE 425
           GA PNAKTWEIF DYYL+NG+ KLA DC  KA+  GR DGGKWVPS E IRTFM H+EQE
Sbjct: 365 GADPNAKTWEIFSDYYLRNGDMKLAVDCLEKAIDTGRGDGGKWVPSSETIRTFMRHFEQE 424

Query: 426 KDVDGAESFVETVKKSVDSLESEVFESLIRTYSAAGRRSCMMSRRLKMEKVEVSEACKKL 485
           KDVDGAE F+E +KK+VD L  EVFE LIRTY+AAGR S +M RRLKMEKVEVSEA KKL
Sbjct: 425 KDVDGAEGFLEILKKAVDDLGVEVFEPLIRTYAAAGRTSPVMLRRLKMEKVEVSEASKKL 484

Query: 486 LDEISIE 492
           L+ I +E
Sbjct: 485 LEAICVE 490

BLAST of Cp4.1LG09g05200 vs. TrEMBL
Match: V4SUJ3_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10031342mg PE=4 SV=1)

HSP 1 Score: 741.5 bits (1913), Expect = 6.6e-211
Identity = 375/487 (77.00%), Postives = 419/487 (86.04%), Query Frame = 1

Query: 6   RRLSRTKNVAKRSTKKYLEEPLYVRLFKDGSSEKSVRLQLNVFLKSRKRVFKWEVGDTLK 65
           +R  RTKN+AKRS KK+LEE LY RLFK G S+ SVR QLN FLKS+KRVFKWEVGDTLK
Sbjct: 5   QRFGRTKNIAKRS-KKHLEEALYDRLFKKGGSDVSVRQQLNQFLKSKKRVFKWEVGDTLK 64

Query: 66  KLRDRKLYYPALKLSETMAKRSMNKTVSDQAIHLDLLAKARGIAAAESFFVSLPESSKNH 125
           KLRDRKLYYPALKLSE M KR MNKTVSDQAIHLDL+AK +GI AAE++FV LPE+SKNH
Sbjct: 65  KLRDRKLYYPALKLSENMEKRGMNKTVSDQAIHLDLVAKVQGIDAAENYFVDLPETSKNH 124

Query: 126 LCYGSLLNCYCKELMTEKAEAISEKMKELNLPVTSMPYNSLMTLYSKTGHPEKVSAIIQE 185
           L YGSLLNCYCKELMTEKAEA+ EKMKELNL  +SMP+NSLMTLY+KTGHPEK+ AIIQE
Sbjct: 125 LTYGSLLNCYCKELMTEKAEALLEKMKELNLGFSSMPFNSLMTLYAKTGHPEKIPAIIQE 184

Query: 186 MKAAKVMFDTYTYNVWMRALAALNDISGVERVIDEMK-DGRAVGDWTTYSNLASIYVDAH 245
           MKA+ +M D+YTYNVWMRALAA+NDISG ERVI+EMK DGR   DWTT+SNLASIYV+A 
Sbjct: 185 MKASSIMPDSYTYNVWMRALAAVNDISGAERVIEEMKRDGRVAADWTTFSNLASIYVEAG 244

Query: 246 MFDKAGNALKELEKRNARRDLSAFQFLITLHGQMGNLLEVYRVWRSLRLAFPKTANISYL 305
           +F+KA  ALKELE RNA RDLSA+QFLITL+GQ GNL EVYR+WRSLRLAFPKTANISYL
Sbjct: 245 LFEKAERALKELENRNAHRDLSAYQFLITLYGQTGNLSEVYRIWRSLRLAFPKTANISYL 304

Query: 306 NMIQTLIKLKDLPGAEKCFKEWQSGCSTYDIRIANALIGAYAKEGLLEKAIELKVRARQR 365
           NMIQ L+ LKDLPGAEKCFKEW+SGC+TYDIR+ N +IGAYAKE  LE A ELK RAR+R
Sbjct: 305 NMIQVLVNLKDLPGAEKCFKEWESGCATYDIRVTNVMIGAYAKESRLENAEELKERARRR 364

Query: 366 GAKPNAKTWEIFMDYYLKNGEFKLAADCAAKAVSKGRLDGGKWVPSPEVIRTFMSHYEQE 425
           GA PNAKTWEIF DYYL+NG+ KLA DC  KA+  GR DGGKWVPS E IRTFM H+EQE
Sbjct: 365 GANPNAKTWEIFSDYYLRNGDMKLAVDCLEKAIDTGRGDGGKWVPSSETIRTFMRHFEQE 424

Query: 426 KDVDGAESFVETVKKSVDSLESEVFESLIRTYSAAGRRSCMMSRRLKMEKVEVSEACKKL 485
           KDVDGAE F+E +KK+VD L  EVFE LIRTY+AAGR S +M RRLKMEKVEVSEA KKL
Sbjct: 425 KDVDGAEGFLEILKKAVDDLGVEVFEPLIRTYAAAGRTSPVMLRRLKMEKVEVSEASKKL 484

Query: 486 LDEISIE 492
           L+ I +E
Sbjct: 485 LEAICVE 490

BLAST of Cp4.1LG09g05200 vs. TrEMBL
Match: F6H851_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_00s0394g00020 PE=4 SV=1)

HSP 1 Score: 735.3 bits (1897), Expect = 4.7e-209
Identity = 368/490 (75.10%), Postives = 426/490 (86.94%), Query Frame = 1

Query: 3   LLLRRLSRTKNVAKRSTKKYLEEPLYVRLFKDGSSEKSVRLQLNVFLKSRKRVFKWEVGD 62
           + + +LSRTKN+AKRS KKYLEE LY RLFKDGSSE SVR QLN FLKS KRVFKWEVGD
Sbjct: 1   MAMPQLSRTKNIAKRS-KKYLEEALYDRLFKDGSSEVSVRQQLNHFLKSSKRVFKWEVGD 60

Query: 63  TLKKLRDRKLYYPALKLSETMAKRSMNKTVSDQAIHLDLLAKARGIAAAESFFVSLPESS 122
           T+KKLRDRK +YPALKLSETMAKR MN T+SDQAI+LDL+ K RG+AAAE++F+ LPE+S
Sbjct: 61  TVKKLRDRKRFYPALKLSETMAKRGMNMTISDQAIYLDLITKTRGVAAAENYFIDLPETS 120

Query: 123 KNHLCYGSLLNCYCKELMTEKAEAISEKMKELNLPVTSMPYNSLMTLYSKTGHPEKVSAI 182
           KNHL YG+LLNCYCKEL+TEKAEA+ E+MKEL L ++SMPYNSLMTLY+K G PEK+  I
Sbjct: 121 KNHLTYGALLNCYCKELLTEKAEALMERMKELKLGLSSMPYNSLMTLYTKIGQPEKIPTI 180

Query: 183 IQEMKAAKVMFDTYTYNVWMRALAALNDISGVERVIDEMK-DGRAVGDWTTYSNLASIYV 242
           IQE+K+  +M D+YTYN+WMRALAA+NDISGVERVI+EMK DGR   DWTTYSNLASIYV
Sbjct: 181 IQELKSLDIMPDSYTYNIWMRALAAVNDISGVERVIEEMKRDGRVASDWTTYSNLASIYV 240

Query: 243 DAHMFDKAGNALKELEKRNARRDLSAFQFLITLHGQMGNLLEVYRVWRSLRLAFPKTANI 302
           DA +F+KA  ALKELEKRNA RDL+AFQFLITL+G++GNLLEVYRVWRSLRLAFPKTAN+
Sbjct: 241 DAGVFEKAEKALKELEKRNACRDLTAFQFLITLYGRIGNLLEVYRVWRSLRLAFPKTANV 300

Query: 303 SYLNMIQTLIKLKDLPGAEKCFKEWQSGCSTYDIRIANALIGAYAKEGLLEKAIELKVRA 362
           SYLNMIQ L+ LKDLPGAEKCF+EW+SGCS YDIR+ANALIGAYAK+GLLEKA ELK  A
Sbjct: 301 SYLNMIQVLVNLKDLPGAEKCFREWESGCSIYDIRVANALIGAYAKDGLLEKAEELKEHA 360

Query: 363 RQRGAKPNAKTWEIFMDYYLKNGEFKLAADCAAKAVSKGRLDGGKWVPSPEVIRTFMSHY 422
           R+RGAKPNAKTWEIF+ Y+LKN E K A DC A A+S GR DG KWVPSPE+I  FM H+
Sbjct: 361 RRRGAKPNAKTWEIFLAYHLKNREMKQAVDCVANAISTGRGDGQKWVPSPEIIGVFMQHF 420

Query: 423 EQEKDVDGAESFVETVKKSVDSLESEVFESLIRTYSAAGRRSCMMSRRLKMEKVEVSEAC 482
           EQEKDVDGAE F+E +K +V+ L  EVFESLIR Y+AAGR S +M RRLKME VEVS++C
Sbjct: 421 EQEKDVDGAEGFLEILKSTVEDLGVEVFESLIRIYAAAGRTSPVMRRRLKMENVEVSDSC 480

Query: 483 KKLLDEISIE 492
           KKLL+E+S+E
Sbjct: 481 KKLLEEVSVE 489

BLAST of Cp4.1LG09g05200 vs. TrEMBL
Match: A0A061E7F4_THECC (Tetratricopeptide repeat (TPR)-like superfamily protein OS=Theobroma cacao GN=TCM_010463 PE=4 SV=1)

HSP 1 Score: 724.5 bits (1869), Expect = 8.4e-206
Identity = 371/490 (75.71%), Postives = 421/490 (85.92%), Query Frame = 1

Query: 4   LLRRLSRTKNVAKRSTKKYLEEPLYVRLFKDGSSEKSVRLQLNVFLKSRKRVFKWEVGDT 63
           L+++L RTKNV +RS KKYLEE LY RLFKDGSSE SVR QLN FLKS KRV+KWEV DT
Sbjct: 3   LMQQLGRTKNVTRRS-KKYLEEALYHRLFKDGSSEISVRQQLNQFLKSSKRVYKWEVDDT 62

Query: 64  LKKLRDRKLYYPALKLSETMA-KRSMNKTVSDQAIHLDLLAKARGIAAAESFFVSLPESS 123
           LKKLR RKLYYPALKLSETM  KR MNKTVSDQAIHLDL+AKA+GI AAE++F+ LPE+ 
Sbjct: 63  LKKLRHRKLYYPALKLSETMVTKRGMNKTVSDQAIHLDLVAKAQGIPAAENYFIDLPEAL 122

Query: 124 KNHLCYGSLLNCYCKELMTEKAEAISEKMKELNLPVTSMPYNSLMTLYSKTGHPEKVSAI 183
           KNHL YG+LLNCYCKELMTEKAEA+ EKMKE NLP+ SM YNSLMTLY+K G PE+V  +
Sbjct: 123 KNHLTYGALLNCYCKELMTEKAEALMEKMKEHNLPLGSMSYNSLMTLYTKIGQPERVPDV 182

Query: 184 IQEMKAAKVMFDTYTYNVWMRALAALNDISGVERVIDEMK-DGRAVGDWTTYSNLASIYV 243
           IQEMK+  +M D+YTYNVWMRALAA+NDISG ERVIDEMK D     DWTTYSN+AS+YV
Sbjct: 183 IQEMKSCGIMPDSYTYNVWMRALAAMNDISGFERVIDEMKRDAEDDDDWTTYSNIASVYV 242

Query: 244 DAHMFDKAGNALKELEKRNARRDLSAFQFLITLHGQMGNLLEVYRVWRSLRLAFPKTANI 303
           DA +F KA  ALKELEKRN+RRDLSAF FLITL+G++GNLLEVYR+WRSLRL+F KTAN+
Sbjct: 243 DAGLFKKAEEALKELEKRNSRRDLSAFHFLITLYGKVGNLLEVYRIWRSLRLSFHKTANV 302

Query: 304 SYLNMIQTLIKLKDLPGAEKCFKEWQSGCSTYDIRIANALIGAYAKEGLLEKAIELKVRA 363
           S+LNMIQ L+ LKDLPGAEKCF+EW+SGCSTYDIRIANALIGAYAKEGLLEKA ELK RA
Sbjct: 303 SFLNMIQVLVNLKDLPGAEKCFREWESGCSTYDIRIANALIGAYAKEGLLEKAQELKERA 362

Query: 364 RQRGAKPNAKTWEIFMDYYLKNGEFKLAADCAAKAVSKGRLDGGKWVPSPEVIRTFMSHY 423
           R+RG KPNAKTWEIF+DYYLKNG+ KLA DC A A+S GR DGGKWVPS + I T M H+
Sbjct: 363 RKRGVKPNAKTWEIFLDYYLKNGDIKLAVDCVANAISTGRGDGGKWVPSSKTIGTVMWHF 422

Query: 424 EQEKDVDGAESFVETVKKSVDSLESEVFESLIRTYSAAGRRSCMMSRRLKMEKVEVSEAC 483
           EQEKDVDGAE F+E +KK+VD +  EVFESLIRTY+AAGR S +M  RLKMEKVEVSEA 
Sbjct: 423 EQEKDVDGAEGFLEILKKAVDHVGEEVFESLIRTYAAAGRTSPVMHHRLKMEKVEVSEAS 482

Query: 484 KKLLDEISIE 492
           KKL++ IS+E
Sbjct: 483 KKLVEVISVE 491

BLAST of Cp4.1LG09g05200 vs. TAIR10
Match: AT1G60770.1 (AT1G60770.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 669.5 bits (1726), Expect = 1.6e-192
Identity = 333/488 (68.24%), Postives = 401/488 (82.17%), Query Frame = 1

Query: 3   LLLRRLSRTKNVAKRSTKKYLEEPLYVRLFKDGSSEKSVRLQLNVFLKSRKRVFKWEVGD 62
           + +R LSR+++V KRSTKKY+EEPLY RLFKDG +E  VR QLN FLK  K VFKWEVGD
Sbjct: 1   MAMRHLSRSRDVTKRSTKKYIEEPLYNRLFKDGGTEVKVRQQLNQFLKGTKHVFKWEVGD 60

Query: 63  TLKKLRDRKLYYPALKLSETMAKRSMNKTVSDQAIHLDLLAKARGIAAAESFFVSLPESS 122
           T+KKLR+R LYYPALKLSE M +R MNKTVSDQAIHLDL+AKAR I A E++FV LPE+S
Sbjct: 61  TIKKLRNRGLYYPALKLSEVMEERGMNKTVSDQAIHLDLVAKAREITAGENYFVDLPETS 120

Query: 123 KNHLCYGSLLNCYCKELMTEKAEAISEKMKELNLPVTSMPYNSLMTLYSKTGHPEKVSAI 182
           K  L YGSLLNCYCKEL+TEKAE +  KMKELN+  +SM YNSLMTLY+KTG  EKV A+
Sbjct: 121 KTELTYGSLLNCYCKELLTEKAEGLLNKMKELNITPSSMSYNSLMTLYTKTGETEKVPAM 180

Query: 183 IQEMKAAKVMFDTYTYNVWMRALAALNDISGVERVIDEM-KDGRAVGDWTTYSNLASIYV 242
           IQE+KA  VM D+YTYNVWMRALAA NDISGVERVI+EM +DGR   DWTTYSN+ASIYV
Sbjct: 181 IQELKAENVMPDSYTYNVWMRALAATNDISGVERVIEEMNRDGRVAPDWTTYSNMASIYV 240

Query: 243 DAHMFDKAGNALKELEKRNARRDLSAFQFLITLHGQMGNLLEVYRVWRSLRLAFPKTANI 302
           DA +  KA  AL+ELE +N +RD +A+QFLITL+G++G L EVYR+WRSLRLA PKT+N+
Sbjct: 241 DAGLSQKAEKALQELEMKNTQRDFTAYQFLITLYGRLGKLTEVYRIWRSLRLAIPKTSNV 300

Query: 303 SYLNMIQTLIKLKDLPGAEKCFKEWQSGCSTYDIRIANALIGAYAKEGLLEKAIELKVRA 362
           +YLNMIQ L+KL DLPGAE  FKEWQ+ CSTYDIRI N LIGAYA+EGL++KA ELK +A
Sbjct: 301 AYLNMIQVLVKLNDLPGAETLFKEWQANCSTYDIRIVNVLIGAYAQEGLIQKANELKEKA 360

Query: 363 RQRGAKPNAKTWEIFMDYYLKNGEFKLAADCAAKAVSKGRLDGGKWVPSPEVIRTFMSHY 422
            +RG K NAKTWEIFMDYY+K+G+   A +C +KAVS G+ DGGKW+PSPE +R  MS++
Sbjct: 361 PRRGGKLNAKTWEIFMDYYVKSGDMARALECMSKAVSIGKGDGGKWLPSPETVRALMSYF 420

Query: 423 EQEKDVDGAESFVETVKKSVDSLESEVFESLIRTYSAAGRRSCMMSRRLKMEKVEVSEAC 482
           EQ+KDV+GAE+ +E +K   D++ +E+FE LIRTY+AAG+    M RRLKME VEV+EA 
Sbjct: 421 EQKKDVNGAENLLEILKNGTDNIGAEIFEPLIRTYAAAGKSHPAMRRRLKMENVEVNEAT 480

Query: 483 KKLLDEIS 490
           KKLLDE+S
Sbjct: 481 KKLLDEVS 488

BLAST of Cp4.1LG09g05200 vs. TAIR10
Match: AT1G02370.1 (AT1G02370.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 310.8 bits (795), Expect = 1.5e-84
Identity = 180/467 (38.54%), Postives = 275/467 (58.89%), Query Frame = 1

Query: 24  EEPLYVRLFKDGSSEKSVRLQLNVFLKSRKRVFKWEVGDTLKKLRDRKLYYPALKLSETM 83
           +  LY +L     +  +V   LN F+     V K ++    K LR  +    A ++ + M
Sbjct: 70  QRELYKKLSMLSVTGGTVAETLNQFIMEGITVRKDDLFRCAKTLRKFRRPQHAFEIFDWM 129

Query: 84  AKRSMNKTVSDQAIHLDLLAKARGIAAAESFFVSLPESSKNHLC-YGSLLNCYCKELMTE 143
            KR M  +VSD AI LDL+ K +G+ AAE++F +L  S+KNH   YG+L+NCYC EL  E
Sbjct: 130 EKRKMTFSVSDHAICLDLIGKTKGLEAAENYFNNLDPSAKNHQSTYGALMNCYCVELEEE 189

Query: 144 KAEAISEKMKELNLPVTSMPYNSLMTLYSKTGHPEKVSAIIQEMKAAKVMFDTYTYNVWM 203
           KA+A  E M ELN    S+P+N++M++Y +   PEKV  ++  MK   +     TY++WM
Sbjct: 190 KAKAHFEIMDELNFVNNSLPFNNMMSMYMRLSQPEKVPVLVDAMKQRGISPCGVTYSIWM 249

Query: 204 RALAALNDISGVERVIDEM-KDGRAVGDWTTYSNLASIYVDAHMFDKAGNALKELEKRNA 263
           ++  +LND+ G+E++IDEM KD  A   W T+SNLA+IY  A +++KA +ALK +E++  
Sbjct: 250 QSCGSLNDLDGLEKIIDEMGKDSEAKTTWNTFSNLAAIYTKAGLYEKADSALKSMEEKMN 309

Query: 264 RRDLSAFQFLITLHGQMGNLLEVYRVWRSLRLAFPKTANISYLNMIQTLIKLKDLPGAEK 323
             +  +  FL++L+  +    EVYRVW SL+ A P+  N+SYL M+Q + KL DL G +K
Sbjct: 310 PNNRDSHHFLMSLYAGISKGPEVYRVWESLKKARPEVNNLSYLVMLQAMSKLGDLDGIKK 369

Query: 324 CFKEWQSGCSTYDIRIANALIGAYAKEGLLEKAIELKVRARQRGAKPNAKTWEIFMDYYL 383
            F EW+S C  YD+R+AN  I  Y K  + E+A ++   A ++   P +K  ++ M + L
Sbjct: 370 IFTEWESKCWAYDMRLANIAINTYLKGNMYEEAEKILDGAMKKSKGPFSKARQLLMIHLL 429

Query: 384 KNGEFKLAADCAAKAVSKGRLDGGKWVPSPEVIRTFMSHYEQEKDVDGAESFVETVKKSV 443
           +N +  LA      AVS    +  +W  S E++  F  H+E+ KDVDGAE F + +  + 
Sbjct: 430 ENDKADLAMKHLEAAVSDSAENKDEWGWSSELVSLFFLHFEKAKDVDGAEDFCK-ILSNW 489

Query: 444 DSLESEVFESLIRTYSAAGRRSCMMSRRLKMEKVEVSEACKKLLDEI 489
             L+SE    LI+TY+AA + S  M  RL  +++EVSE  + LL  +
Sbjct: 490 KPLDSETMTFLIKTYAAAEKTSPDMRERLSQQQIEVSEEIQDLLKTV 535

BLAST of Cp4.1LG09g05200 vs. TAIR10
Match: AT4G01990.1 (AT4G01990.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 297.7 bits (761), Expect = 1.3e-80
Identity = 177/446 (39.69%), Postives = 260/446 (58.30%), Query Frame = 1

Query: 45  LNVFLKSRKRVFKWEVGDTLKKLRDRKLYYPALKLSETMAKRSMNKTVSDQAIHLDLLAK 104
           LN F+     V K ++    K LR  +    AL++ E M ++ +  T SD AI L+L+AK
Sbjct: 60  LNQFVMEGVPVKKHDLIRYAKDLRKFRQPQRALEIFEWMERKEIAFTGSDHAIRLNLIAK 119

Query: 105 ARGIAAAESFFVSLPESSKNHLCYGSLLNCYCKELMTEKAEAISEKMKELNLPVTSMPYN 164
           ++G+ AAE++F SL +S KN   YGSLLNCYC E    KA+A  E M +LN    S+P+N
Sbjct: 120 SKGLEAAETYFNSLDDSIKNQSTYGSLLNCYCVEKEEVKAKAHFENMVDLNHVSNSLPFN 179

Query: 165 SLMTLYSKTGHPEKVSAIIQEMKAAKVMFDTYTYNVWMRALAALNDISGVERVIDEMK-D 224
           +LM +Y   G PEKV A++  MK   +     TY++W+++  +L D+ GVE+V+DEMK +
Sbjct: 180 NLMAMYMGLGQPEKVPALVVAMKEKSITPCDITYSMWIQSCGSLKDLDGVEKVLDEMKAE 239

Query: 225 GRAVGDWTTYSNLASIYVDAHMFDKAGNALKELEKRNARRDLSAFQFLITLHGQMGNLLE 284
           G  +  W T++NLA+IY+   ++ KA  ALK LE          + FLI L+  + N  E
Sbjct: 240 GEGIFSWNTFANLAAIYIKVGLYGKAEEALKSLENNMNPDVRDCYHFLINLYTGIANASE 299

Query: 285 VYRVWRSLRLAFPKTANISYLNMIQTLIKLKDLPGAEKCFKEWQSGCSTYDIRIANALIG 344
           VYRVW  L+  +P   N SYL M++ L KL D+ G +K F EW+S C TYD+R+AN  I 
Sbjct: 300 VYRVWDLLKKRYPNVNNSSYLTMLRALSKLDDIDGVKKVFAEWESTCWTYDMRMANVAIS 359

Query: 345 AYAKEGLLEKAIELKVRARQRGAKPNAKTWEIFMDYYLKNGEFKLAADCAAKAVSKGRLD 404
           +Y K+ + E+A  +   A ++     +K  ++ M + LKN +    AD A K      LD
Sbjct: 360 SYLKQNMYEEAEAVFNGAMKKCKGQFSKARQLLMMHLLKNDQ----ADLALKHFEAAVLD 419

Query: 405 GGK-WVPSPEVIRTFMSHYEQEKDVDGAESFVETVKKSVDSLESEVFESLIRTYSAAGRR 464
             K W  S E+I +F  H+E+ KDVDGAE F +T+ K    L SE +  L++TY AAG+ 
Sbjct: 420 QDKNWTWSSELISSFFLHFEEAKDVDGAEEFCKTLTK-WSPLSSETYTLLMKTYLAAGKA 479

Query: 465 SCMMSRRLKMEKVEVSEACKKLLDEI 489
              M +RL+ + + V E  + LL +I
Sbjct: 480 CPDMKKRLEEQGILVDEEQECLLSKI 500

BLAST of Cp4.1LG09g05200 vs. TAIR10
Match: AT1G02150.1 (AT1G02150.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 249.6 bits (636), Expect = 4.0e-66
Identity = 142/412 (34.47%), Postives = 228/412 (55.34%), Query Frame = 1

Query: 45  LNVFLKSRKRVFKWEVGDTLKKLRDRKLYYPALKLSETMAKRS--MNKTVSDQAIHLDLL 104
           LN + K+ +++ KWE+   +K+LR  K    AL++ + M  R      + SD AI LDL+
Sbjct: 87  LNQWEKAGRKLTKWELCRVVKELRKYKRANQALEVYDWMNNRGERFRLSASDAAIQLDLI 146

Query: 105 AKARGIAAAESFFVSLPESSKNHLCYGSLLNCYCKELMTEKAEAISEKMKELNLPVTSMP 164
            K RGI  AE FF+ LPE+ K+   YGSLLN Y +    EKAEA+   M++    +  +P
Sbjct: 147 GKVRGIPDAEEFFLQLPENFKDRRVYGSLLNAYVRAKSREKAEALLNTMRDKGYALHPLP 206

Query: 165 YNSLMTLYSKTGHPEKVSAIIQEMKAAKVMFDTYTYNVWMRALAALNDISGVERVIDEMK 224
           +N +MTLY      +KV A++ EMK   +  D Y+YN+W+ +  +L  +  +E V  +MK
Sbjct: 207 FNVMMTLYMNLREYDKVDAMVFEMKQKDIRLDIYSYNIWLSSCGSLGSVEKMELVYQQMK 266

Query: 225 DGRAV-GDWTTYSNLASIYVDAHMFDKAGNALKELEKRNARRDLSAFQFLITLHGQMGNL 284
              ++  +WTT+S +A++Y+     +KA +AL+++E R   R+   + +L++L+G +GN 
Sbjct: 267 SDVSIYPNWTTFSTMATMYIKMGETEKAEDALRKVEARITGRNRIPYHYLLSLYGSLGNK 326

Query: 285 LEVYRVWRSLRLAFPKTANISYLNMIQTLIKLKDLPGAEKCFKEWQSGCSTYDIRIANAL 344
            E+YRVW   +   P   N+ Y  ++ +L+++ D+ GAEK ++EW    S+YD RI N L
Sbjct: 327 KELYRVWHVYKSVVPSIPNLGYHALVSSLVRMGDIEGAEKVYEEWLPVKSSYDPRIPNLL 386

Query: 345 IGAYAKEGLLEKAIELKVRARQRGAKPNAKTWEIFMDYYLKNGEFKLAADCAAKAVSKGR 404
           + AY K   LE A  L     + G KP++ TWEI    + +      A  C   A S   
Sbjct: 387 MNAYVKNDQLETAEGLFDHMVEMGGKPSSSTWEILAVGHTRKRCISEALTCLRNAFSAE- 446

Query: 405 LDGGKWVPSPEVIRTFMSHYEQEKDVDGAESFVETVKKSVDSLESEVFESLI 454
                W P   ++  F    E+E DV   E+ +E +++S D LE + + +LI
Sbjct: 447 -GSSNWRPKVLMLSGFFKLCEEESDVTSKEAVLELLRQSGD-LEDKSYLALI 495

BLAST of Cp4.1LG09g05200 vs. TAIR10
Match: AT4G02820.1 (AT4G02820.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 248.4 bits (633), Expect = 8.9e-66
Identity = 139/450 (30.89%), Postives = 248/450 (55.11%), Query Frame = 1

Query: 37  SEKSVRLQLNVFLKSRKRVFKWEVGDTLKKLRDRKLYYPALKLSETMA-KRSMNKTVSDQ 96
           +++S  + +  + +    V K+E+   +++LR  K Y  AL++ E M  +  +     D 
Sbjct: 73  TKRSAVVTIRKWKEEGHSVRKYELNRIVRELRKIKRYKHALEICEWMVVQEDIKLQAGDY 132

Query: 97  AIHLDLLAKARGIAAAESFFVSLPESSKNHLCYGSLLNCYCKELMTEKAEAISEKMKELN 156
           A+HLDL++K RG+ +AE FF  +P+  + H    SLL+ Y +  +++KAEA+ EKM E  
Sbjct: 133 AVHLDLISKIRGLNSAEKFFEDMPDQMRGHAACTSLLHSYVQNKLSDKAEALFEKMGECG 192

Query: 157 LPVTSMPYNSLMTLYSKTGHPEKVSAIIQEMKAAKVMFDTYTYNVWMRALAALNDISGVE 216
              + +PYN ++++Y   G  EKV  +I+E+K  +   D  TYN+W+ A A+ ND+ G E
Sbjct: 193 FLKSCLPYNHMLSMYISRGQFEKVPVLIKELKI-RTSPDIVTYNLWLTAFASGNDVEGAE 252

Query: 217 RVIDEMKDGRAVGDWTTYSNLASIYVDAHMFDKAGNALKELEKRNARRDLSAFQFLITLH 276
           +V  + K+ +   DW TYS L ++Y      +KA  ALKE+EK  ++++  A+  LI+LH
Sbjct: 253 KVYLKAKEEKLNPDWVTYSVLTNLYAKTDNVEKARLALKEMEKLVSKKNRVAYASLISLH 312

Query: 277 GQMGNLLEVYRVWRSLRLAFPKTANISYLNMIQTLIKLKDLPGAEKCFKEWQSGCSTYDI 336
             +G+   V   W+ ++ +F K  +  YL+MI  ++KL +   A+  + EW+S   T D 
Sbjct: 313 ANLGDKDGVNLTWKKVKSSFKKMNDAEYLSMISAVVKLGEFEQAKGLYDEWESVSGTGDA 372

Query: 337 RIANALIGAYAKEGLLEKAIELKVRARQRGAKPNAKTWEIFMDYYLKNGEFKLAADCAAK 396
           RI N ++  Y     +    +   R  ++G  P+  TWEI    YLK  + +   DC  K
Sbjct: 373 RIPNLILAEYMNRDEVLLGEKFYERIVEKGINPSYSTWEILTWAYLKRKDMEKVLDCFGK 432

Query: 397 AVSKGRLDGGKWVPSPEVIRTFMSHYEQEKDVDGAESFVETVKKSVDSLESEVFESLIRT 456
           A+   +    KW  +  +++      E++ +V GAE  +  ++K+   + ++++ SL+RT
Sbjct: 433 AIDSVK----KWTVNVRLVKGACKELEEQGNVKGAEKLMTLLQKA-GYVNTQLYNSLLRT 492

Query: 457 YSAAGRRSCMMSRRLKMEKVEVSEACKKLL 486
           Y+ AG  + ++  R+  + VE+ E  K+L+
Sbjct: 493 YAKAGEMALIVEERMAKDNVELDEETKELI 516

BLAST of Cp4.1LG09g05200 vs. NCBI nr
Match: gi|659090741|ref|XP_008446176.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g60770 [Cucumis melo])

HSP 1 Score: 793.5 bits (2048), Expect = 2.1e-226
Identity = 400/491 (81.47%), Postives = 445/491 (90.63%), Query Frame = 1

Query: 1   MVLLLRRLSRTKNVAKRSTKKYLEEPLYVRLFKDGSSEKSVRLQLNVFLKSRKRVFKWEV 60
           M L LR+   +K++AKRST+KYLEE LY+RLFKDG SEKSVRLQLN F+KSRKRVFKWEV
Sbjct: 1   MALPLRKFKPSKDLAKRSTEKYLEEALYIRLFKDGGSEKSVRLQLNKFIKSRKRVFKWEV 60

Query: 61  GDTLKKLRDRKLYYPALKLSETMAKRSMNKTVSDQAIHLDLLAKARGIAAAESFFVSLPE 120
           GDTLKKLRDRKLYYPALKLSETMAKR MNKTVSDQAIHLDL+AKARGIAAAE++FVSLPE
Sbjct: 61  GDTLKKLRDRKLYYPALKLSETMAKRGMNKTVSDQAIHLDLVAKARGIAAAENYFVSLPE 120

Query: 121 SSKNHLCYGSLLNCYCKELMTEKAEAISEKMKELNLPVTSMPYNSLMTLYSKTGHPEKVS 180
           SSKNHL Y SLLNCYCKEL+TEKAE++ EKMKELNLP+TSMP N LMTLY+K G P+KV 
Sbjct: 121 SSKNHLSYSSLLNCYCKELLTEKAESLFEKMKELNLPLTSMPCNCLMTLYTKIGQPDKVP 180

Query: 181 AIIQEMKAAKVMFDTYTYNVWMRALAALNDISGVERVIDEMKDGRAVGDWTTYSNLASIY 240
           +IIQEMKAA V FD+YTY VWMRALAALNDISGVERVIDEMK     GDWTTYSNLASIY
Sbjct: 181 SIIQEMKAANVTFDSYTYVVWMRALAALNDISGVERVIDEMKRDGVKGDWTTYSNLASIY 240

Query: 241 VDAHMFDKAGNALKELEKRNARRDLSAFQFLITLHGQMGNLLEVYRVWRSLRLAFPKTAN 300
           V+A+MF+KA  AL +LEK N  RDL AFQFLITL+GQ+G+L++VY VWRSLRLAFP+TAN
Sbjct: 241 VNANMFEKAAKALMDLEKINTSRDLFAFQFLITLYGQIGDLIKVYSVWRSLRLAFPRTAN 300

Query: 301 ISYLNMIQTLIKLKDLPGAEKCFKEWQSGCSTYDIRIANALIGAYAKEGLLEKAIELKVR 360
           ISYLNMIQTL+KLKDLPGAEKCFKEW+SGCSTYDIRIANALIGAY KEGLLEKA+ LK R
Sbjct: 301 ISYLNMIQTLLKLKDLPGAEKCFKEWESGCSTYDIRIANALIGAYTKEGLLEKAMALKER 360

Query: 361 ARQRGAKPNAKTWEIFMDYYLKNGEFKLAADCAAKAVSKGRLDGGKWVPSPEVIRTFMSH 420
           A +RGA+PNAKTWEIF+DYYLKNG FKLA DC AKAVS+G+ DGGKW+PSPE+I++FMSH
Sbjct: 361 ALKRGARPNAKTWEIFLDYYLKNGNFKLAGDCVAKAVSRGKGDGGKWMPSPEIIKSFMSH 420

Query: 421 YEQEKDVDGAESFVETVKKSVDSLESEVFESLIRTYSAAGRRSCMMSRRLKMEKVEVSEA 480
           +EQEKDVDGAESF+E VKK+VDSLESEVFESLIRTYSAAGR S  M+RRLKME VEVSEA
Sbjct: 421 FEQEKDVDGAESFLEIVKKTVDSLESEVFESLIRTYSAAGRTSSSMNRRLKMENVEVSEA 480

Query: 481 CKKLLDEISIE 492
           CKKLL+EISIE
Sbjct: 481 CKKLLNEISIE 491

BLAST of Cp4.1LG09g05200 vs. NCBI nr
Match: gi|449434959|ref|XP_004135263.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g60770 [Cucumis sativus])

HSP 1 Score: 766.5 bits (1978), Expect = 2.8e-218
Identity = 389/487 (79.88%), Postives = 431/487 (88.50%), Query Frame = 1

Query: 5   LRRLSRTKNVAKRSTKKYLEEPLYVRLFKDGSSEKSVRLQLNVFLKSRKRVFKWEVGDTL 64
           L++   +K++AKRS +KYLEE LY+RLFKDG SEKSVRLQLN F+KS KRVFKWEVGDTL
Sbjct: 5   LQKFRPSKDLAKRSAEKYLEEALYIRLFKDGGSEKSVRLQLNKFIKSHKRVFKWEVGDTL 64

Query: 65  KKLRDRKLYYPALKLSETMAKRSMNKTVSDQAIHLDLLAKARGIAAAESFFVSLPESSKN 124
           +KLRDRKLYYPALKLSE MAKR MNKTVSDQAIHLDL+AKARGI AAE++FVSLPESSKN
Sbjct: 65  RKLRDRKLYYPALKLSEIMAKRGMNKTVSDQAIHLDLVAKARGIDAAENYFVSLPESSKN 124

Query: 125 HLCYGSLLNCYCKELMTEKAEAISEKMKELNLPVTSMPYNSLMTLYSKTGHPEKVSAIIQ 184
           HL Y SLLNCYCKEL+TEKAEA+ EK+KELNLPVT +PYNSLMTLYSK G P+KV  IIQ
Sbjct: 125 HLSYSSLLNCYCKELLTEKAEALFEKIKELNLPVTPVPYNSLMTLYSKIGRPDKVCTIIQ 184

Query: 185 EMKAAKVMFDTYTYNVWMRALAALNDISGVERVIDEMKDGRAVGDWTTYSNLASIYVDAH 244
           EMKAA V FD YTY VWMRALAALNDISGVERVIDEMK     GDWTTYSNLASIYV+A+
Sbjct: 185 EMKAANVTFDPYTYIVWMRALAALNDISGVERVIDEMKRDGVKGDWTTYSNLASIYVNAN 244

Query: 245 MFDKAGNALKELEKRNARRDLSAFQFLITLHGQMGNLLEVYRVWRSLRLAFPKTANISYL 304
           MF+KA  ALK+LEK N RRDL  FQFLITL+GQ+G+L EVYRVWRSLRLAFP+TANISYL
Sbjct: 245 MFEKAAKALKDLEKINTRRDLIGFQFLITLYGQIGDLTEVYRVWRSLRLAFPRTANISYL 304

Query: 305 NMIQTLIKLKDLPGAEKCFKEWQSGCSTYDIRIANALIGAYAKEGLLEKAIELKVRARQR 364
           NMIQTL KLKDLPGAEKCFKEW+SG  TYDIRI NALIGAY K GLLEKA+ LK RA +R
Sbjct: 305 NMIQTLTKLKDLPGAEKCFKEWESGSPTYDIRIPNALIGAYTKGGLLEKAMALKERALRR 364

Query: 365 GAKPNAKTWEIFMDYYLKNGEFKLAADCAAKAVSKGRLDGGKWVPSPEVIRTFMSHYEQE 424
           GA+PNAKTWE F++YYLKNG+FKLA DC AKA+ KG  D GKW+PSPE+I++FMSH+EQE
Sbjct: 365 GARPNAKTWEFFLNYYLKNGDFKLAGDCVAKAIGKG--DRGKWIPSPEIIKSFMSHFEQE 424

Query: 425 KDVDGAESFVETVKKSVDSLESEVFESLIRTYSAAGRRSCMMSRRLKMEKVEVSEACKKL 484
           KDVDGAESF+E VKK+VDSLESEVFESLIRTYSAAGR S  MSRRLKME VEVSEACKKL
Sbjct: 425 KDVDGAESFLEIVKKTVDSLESEVFESLIRTYSAAGRTSSSMSRRLKMENVEVSEACKKL 484

Query: 485 LDEISIE 492
           L++ISIE
Sbjct: 485 LNKISIE 489

BLAST of Cp4.1LG09g05200 vs. NCBI nr
Match: gi|1009153386|ref|XP_015894607.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g60770-like [Ziziphus jujuba])

HSP 1 Score: 745.3 bits (1923), Expect = 6.6e-212
Identity = 372/490 (75.92%), Postives = 426/490 (86.94%), Query Frame = 1

Query: 3   LLLRRLSRTKNVAKRSTKKYLEEPLYVRLFKDGSSEKSVRLQLNVFLKSRKRVFKWEVGD 62
           + L++  R+K+V KRS KKYLEE LY RLFKDGSSE +VR QLN F+KSRKRV+KWEVGD
Sbjct: 1   MALQQFGRSKSVTKRS-KKYLEEALYKRLFKDGSSEVTVRRQLNQFIKSRKRVYKWEVGD 60

Query: 63  TLKKLRDRKLYYPALKLSETMAKRSMNKTVSDQAIHLDLLAKARGIAAAESFFVSLPESS 122
           TLKKLRDRKLYYPALKLSETMAKR MNKTVSDQAIHLDL+AKARGI AAE++F+ LPES 
Sbjct: 61  TLKKLRDRKLYYPALKLSETMAKRGMNKTVSDQAIHLDLIAKARGIPAAENYFIGLPESL 120

Query: 123 KNHLCYGSLLNCYCKELMTEKAEAISEKMKELNLPVTSMPYNSLMTLYSKTGHPEKVSAI 182
           KNHLCYG+LLNCYCKELMTE+AEA+ EKMKELNLP+ SMPYNS+MTLYSKTG  EK+ AI
Sbjct: 121 KNHLCYGALLNCYCKELMTEEAEALMEKMKELNLPLISMPYNSIMTLYSKTGQSEKIPAI 180

Query: 183 IQEMKAAKVMFDTYTYNVWMRALAALNDISGVERVIDEMK-DGRAVGDWTTYSNLASIYV 242
           IQEMKA+ +M D+YTYNVWMRALAA+N+ISGVER+IDEMK DGR   DWTTYSNLASIYV
Sbjct: 181 IQEMKASNIMLDSYTYNVWMRALAAVNNISGVERIIDEMKRDGRVTRDWTTYSNLASIYV 240

Query: 243 DAHMFDKAGNALKELEKRNARRDLSAFQFLITLHGQMGNLLEVYRVWRSLRLAFPKTANI 302
           DA MF+KA  ALKELE RN+ RDLSAFQFLITL+G+ GNLLEVYR+WRSLRLAFPKTANI
Sbjct: 241 DAGMFEKAETALKELENRNSCRDLSAFQFLITLYGRTGNLLEVYRIWRSLRLAFPKTANI 300

Query: 303 SYLNMIQTLIKLKDLPGAEKCFKEWQSGCSTYDIRIANALIGAYAKEGLLEKAIELKVRA 362
           SYLNM+Q L+ LKDLPGAEKCF+EW+S CS YDIR+AN LIGAY +E LLEKA ELK RA
Sbjct: 301 SYLNMMQVLVNLKDLPGAEKCFREWESQCSIYDIRVANVLIGAYVRESLLEKAEELKERA 360

Query: 363 RQRGAKPNAKTWEIFMDYYLKNGEFKLAADCAAKAVSKGRLDGGKWVPSPEVIRTFMSHY 422
           R+RGAKPNAKTWEIF+ YYLKNGE  LA DC + AVS GR DGGKW+P  E++ TFM H+
Sbjct: 361 RRRGAKPNAKTWEIFLHYYLKNGELGLAVDCVSNAVSTGRGDGGKWIPPQEIVNTFMEHF 420

Query: 423 EQEKDVDGAESFVETVKKSVDSLESEVFESLIRTYSAAGRRSCMMSRRLKMEKVEVSEAC 482
           EQ KDVDGAE F++ +KK+VD+LE EV ESLIRTY+AAGR+S ++ RRLKME  EVS+A 
Sbjct: 421 EQNKDVDGAEGFLDILKKAVDTLEVEVLESLIRTYAAAGRKSPILHRRLKMENAEVSDAS 480

Query: 483 KKLLDEISIE 492
           KKLL+ I +E
Sbjct: 481 KKLLETICVE 489

BLAST of Cp4.1LG09g05200 vs. NCBI nr
Match: gi|1009177936|ref|XP_015870249.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g60770-like [Ziziphus jujuba])

HSP 1 Score: 743.4 bits (1918), Expect = 2.5e-211
Identity = 372/490 (75.92%), Postives = 426/490 (86.94%), Query Frame = 1

Query: 3   LLLRRLSRTKNVAKRSTKKYLEEPLYVRLFKDGSSEKSVRLQLNVFLKSRKRVFKWEVGD 62
           + L++  R+K+V KRS KKYLEE LY RLFKDGSSE +VR QLN F+KSRKRV+KWEVGD
Sbjct: 1   MALQQFGRSKSVTKRS-KKYLEEALYKRLFKDGSSEVTVRHQLNQFIKSRKRVYKWEVGD 60

Query: 63  TLKKLRDRKLYYPALKLSETMAKRSMNKTVSDQAIHLDLLAKARGIAAAESFFVSLPESS 122
           TLKKLRDRKLYYPALKLSETMAKR MNKTVSDQAIHLDL+AKARGI AAE++F+ LPES 
Sbjct: 61  TLKKLRDRKLYYPALKLSETMAKRGMNKTVSDQAIHLDLIAKARGIPAAENYFIGLPESL 120

Query: 123 KNHLCYGSLLNCYCKELMTEKAEAISEKMKELNLPVTSMPYNSLMTLYSKTGHPEKVSAI 182
           KNHLCYG+LLNCYCKELMTE+AEA+ EKMKELNLP++SMPYNS+MTLYSKTG  EK+ AI
Sbjct: 121 KNHLCYGALLNCYCKELMTEEAEALMEKMKELNLPLSSMPYNSIMTLYSKTGQSEKIPAI 180

Query: 183 IQEMKAAKVMFDTYTYNVWMRALAALNDISGVERVIDEMK-DGRAVGDWTTYSNLASIYV 242
           IQEMKA+ +M D+YTYNVWMRALAA+N+ISGVER+IDEMK DGR   DWTTYSNLASIYV
Sbjct: 181 IQEMKASNIMLDSYTYNVWMRALAAVNNISGVERIIDEMKRDGRVTRDWTTYSNLASIYV 240

Query: 243 DAHMFDKAGNALKELEKRNARRDLSAFQFLITLHGQMGNLLEVYRVWRSLRLAFPKTANI 302
           DA MF+KA  ALKELE RN+ RDLSAFQFLITL+G+ GNLLEVYR+WRSLRLAFPKTANI
Sbjct: 241 DAGMFEKAETALKELENRNSCRDLSAFQFLITLYGRTGNLLEVYRIWRSLRLAFPKTANI 300

Query: 303 SYLNMIQTLIKLKDLPGAEKCFKEWQSGCSTYDIRIANALIGAYAKEGLLEKAIELKVRA 362
           SYLNM+Q L+ LKDLPGAEKCF+EW+S CS YDIR+AN LIGAY +E LLEKA ELK RA
Sbjct: 301 SYLNMMQVLVNLKDLPGAEKCFREWESQCSIYDIRVANVLIGAYVRESLLEKAEELKERA 360

Query: 363 RQRGAKPNAKTWEIFMDYYLKNGEFKLAADCAAKAVSKGRLDGGKWVPSPEVIRTFMSHY 422
           R+RGAKPNAKTWEIF+ YYLKNGE  LA DC + AVS GR DGGKW+P  E++ TFM H+
Sbjct: 361 RRRGAKPNAKTWEIFLHYYLKNGELGLAVDCVSNAVSTGRGDGGKWIPPQEIVNTFMEHF 420

Query: 423 EQEKDVDGAESFVETVKKSVDSLESEVFESLIRTYSAAGRRSCMMSRRLKMEKVEVSEAC 482
           EQ KDVDGAE F++ +KK+VD+LE EV ESLIRTY+AAGR+S ++ RRLKME  EVS+A 
Sbjct: 421 EQNKDVDGAEGFLDILKKAVDTLEVEVLESLIRTYAAAGRKSPILHRRLKMENAEVSDAS 480

Query: 483 KKLLDEISIE 492
           KKLL+ I  E
Sbjct: 481 KKLLETICEE 489

BLAST of Cp4.1LG09g05200 vs. NCBI nr
Match: gi|641863621|gb|KDO82307.1| (hypothetical protein CISIN_1g011226mg [Citrus sinensis])

HSP 1 Score: 743.0 bits (1917), Expect = 3.3e-211
Identity = 376/487 (77.21%), Postives = 420/487 (86.24%), Query Frame = 1

Query: 6   RRLSRTKNVAKRSTKKYLEEPLYVRLFKDGSSEKSVRLQLNVFLKSRKRVFKWEVGDTLK 65
           +R  RTKN+AKRS KK+LEE LY RLFK GSS+ SVR QLN FLKS+KRVFKWEVGDTLK
Sbjct: 5   QRFGRTKNIAKRS-KKHLEEALYDRLFKKGSSDVSVRQQLNQFLKSKKRVFKWEVGDTLK 64

Query: 66  KLRDRKLYYPALKLSETMAKRSMNKTVSDQAIHLDLLAKARGIAAAESFFVSLPESSKNH 125
           KLRDRKLYYPALKLSE M KR MNKTVSDQAIHLDL+AK +GI AAE++FV LPE+SKNH
Sbjct: 65  KLRDRKLYYPALKLSENMEKRGMNKTVSDQAIHLDLVAKVQGIDAAENYFVDLPETSKNH 124

Query: 126 LCYGSLLNCYCKELMTEKAEAISEKMKELNLPVTSMPYNSLMTLYSKTGHPEKVSAIIQE 185
           L YGSLLNCYCKELMTEKAEA+ EKMKELNL  +SMP+NSLMTLY+KTGHPEK+ AIIQE
Sbjct: 125 LTYGSLLNCYCKELMTEKAEALLEKMKELNLGFSSMPFNSLMTLYAKTGHPEKIPAIIQE 184

Query: 186 MKAAKVMFDTYTYNVWMRALAALNDISGVERVIDEMK-DGRAVGDWTTYSNLASIYVDAH 245
           MKA+ +M D+YTYNVWMRALAA+NDISG ERVI+EMK DGR   DWTT+SNLASIYV+A 
Sbjct: 185 MKASSIMPDSYTYNVWMRALAAVNDISGAERVIEEMKRDGRVAADWTTFSNLASIYVEAG 244

Query: 246 MFDKAGNALKELEKRNARRDLSAFQFLITLHGQMGNLLEVYRVWRSLRLAFPKTANISYL 305
           +F+KA  ALKELE RNA RDLSA+QFLITL+GQ GNL EVYR+WRSLRLAFP TANISYL
Sbjct: 245 LFEKAERALKELENRNAHRDLSAYQFLITLYGQTGNLSEVYRIWRSLRLAFPNTANISYL 304

Query: 306 NMIQTLIKLKDLPGAEKCFKEWQSGCSTYDIRIANALIGAYAKEGLLEKAIELKVRARQR 365
           NMIQ L+ LKDLPGAEKCFKEW+SGC+TYDIR+ N +IGAYAKEG LE A ELK RAR+R
Sbjct: 305 NMIQVLVNLKDLPGAEKCFKEWESGCATYDIRVTNVMIGAYAKEGRLENAEELKERARRR 364

Query: 366 GAKPNAKTWEIFMDYYLKNGEFKLAADCAAKAVSKGRLDGGKWVPSPEVIRTFMSHYEQE 425
           GA PNAKTWEIF DYYL+NG+ KLA DC  KA+  GR DGGKWVPS E IRTFM H+EQE
Sbjct: 365 GADPNAKTWEIFSDYYLRNGDMKLAVDCLEKAIDTGRGDGGKWVPSSETIRTFMRHFEQE 424

Query: 426 KDVDGAESFVETVKKSVDSLESEVFESLIRTYSAAGRRSCMMSRRLKMEKVEVSEACKKL 485
           KDVDGAE F+E +KK+VD L  EVFE LIRTY+AAGR S +M RRLKMEKVEVSEA KKL
Sbjct: 425 KDVDGAEGFLEILKKAVDDLGVEVFEPLIRTYAAAGRTSPVMLRRLKMEKVEVSEASKKL 484

Query: 486 LDEISIE 492
           L+ I +E
Sbjct: 485 LEAICVE 490

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR86_ARATH2.9e-19168.24Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana GN... [more]
PPR4_ARATH2.6e-8338.54Pentatricopeptide repeat-containing protein At1g02370, mitochondrial OS=Arabidop... [more]
PP300_ARATH2.3e-7939.69Pentatricopeptide repeat-containing protein At4g01990, mitochondrial OS=Arabidop... [more]
PPR3_ARATH7.1e-6534.47Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana GN... [more]
PP302_ARATH1.6e-6430.89Pentatricopeptide repeat-containing protein At4g02820, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0KS91_CUCSA1.9e-21879.88Uncharacterized protein OS=Cucumis sativus GN=Csa_5G599800 PE=4 SV=1[more]
A0A067H3N9_CITSI2.3e-21177.21Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g011226mg PE=4 SV=1[more]
V4SUJ3_9ROSI6.6e-21177.00Uncharacterized protein OS=Citrus clementina GN=CICLE_v10031342mg PE=4 SV=1[more]
F6H851_VITVI4.7e-20975.10Putative uncharacterized protein OS=Vitis vinifera GN=VIT_00s0394g00020 PE=4 SV=... [more]
A0A061E7F4_THECC8.4e-20675.71Tetratricopeptide repeat (TPR)-like superfamily protein OS=Theobroma cacao GN=TC... [more]
Match NameE-valueIdentityDescription
AT1G60770.11.6e-19268.24 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G02370.11.5e-8438.54 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G01990.11.3e-8039.69 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G02150.14.0e-6634.47 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G02820.18.9e-6630.89 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659090741|ref|XP_008446176.1|2.1e-22681.47PREDICTED: pentatricopeptide repeat-containing protein At1g60770 [Cucumis melo][more]
gi|449434959|ref|XP_004135263.1|2.8e-21879.88PREDICTED: pentatricopeptide repeat-containing protein At1g60770 [Cucumis sativu... [more]
gi|1009153386|ref|XP_015894607.1|6.6e-21275.92PREDICTED: pentatricopeptide repeat-containing protein At1g60770-like [Ziziphus ... [more]
gi|1009177936|ref|XP_015870249.1|2.5e-21175.92PREDICTED: pentatricopeptide repeat-containing protein At1g60770-like [Ziziphus ... [more]
gi|641863621|gb|KDO82307.1|3.3e-21177.21hypothetical protein CISIN_1g011226mg [Citrus sinensis][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005739 mitochondrion
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG09g05200.1Cp4.1LG09g05200.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 127..155
score: 0.0033coord: 339..357
score: 0.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 162..206
score: 2.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 196..223
score: 8.0E-4coord: 339..370
score: 7.8E-4coord: 127..157
score: 1.9E-4coord: 163..191
score: 7.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 369..403
score: 6.325coord: 410..440
score: 5.064coord: 264..294
score: 5.481coord: 159..193
score: 8.583coord: 124..158
score: 9.098coord: 299..329
score: 5.086coord: 229..263
score: 7.015coord: 334..368
score: 9.887coord: 194..224
score: 8
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 232..398
score: 3.2E-14coord: 117..176
score: 3.2
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 228..370
score: 9.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 22..484
score: 4.0E
NoneNo IPR availablePANTHERPTHR24015:SF504SUBFAMILY NOT NAMEDcoord: 22..484
score: 4.0E

The following gene(s) are paralogous to this gene:

None