Cp4.1LG11g09290 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG11g09290
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG11 : 7801816 .. 7804116 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATAACATTGTGAAAGTTTCACAGGTTAGTTCGCAGTCATCGTGCCAATCGATTGTGGAGCCGTCGATTGCAGCCGTTGATTCTGCTGCACGCGTTGGGGTTCGCCACTGCTGGGCGGAGCCGTGCGACAACCGGTGGTGCGCCGCCAGTAGTGTGGGTGTTGGAGTTGAGAAGAAAATGAAATCGGAGTTCTGTATTTTTTTTCTTCCTCCAATCTGAACTTTCTCTGATTTTGGCTATGCCTTTCTTTTGATGCCGCCATTCCTTCGCTTCCCTTGTTTGCCTCTCAAAACCTTCATTTCAAACACCTCAACATCCTCCACAAAAAATGCCCTATTCAATACTCAACCATTTTTCTCATTTCTCCAAGAATTTCCCCGTGATCTTCTCTCAGTCAAATCCATACACGCTCAGATCATTATCACAAACGCCATATCCGGAGACCAGCGTTTGGTGGCAAAGCTTGTTGCGGCGTACTCAAGCTTGGGTTCTTTGGAGAATGCACGTAAGGTGTTTGATAAAATTCCTCAACCAAAAACTGTTCTCTGCAATGCCATGGTTAATGGGTATCTTCAAAATCAGCATTATAATGAGACCATCGAGCTGTTTAAATTGATGGGTCGATGTCATTTCGAATTTGATAGCTATACTTGTAATTTTGCTCTTAAGGCGTGCATGTTCTTATTGGATTATGAAATGGGCATAGAAGTGATTAGATTAGCTCTCTGTAAGGGGTTAGCTGGTGGTCGGTTTTTGGGAAGTTCGATTTTAAATTTTTTGGTAAAAGCTGGGGATATTATGAATGCACGAATTTTTTTTCATGAAATGGTTGAGAAAGATGTTGTTTGTTGGAATGTGATGATTGGTGGGTTCATGCAGGAAGGCTTGTTTAGTGAAGGCTATAAATTGTTTCTTGATATGCTTTATAATAGAATTGAGCCTAGTGCTGTGACCATGACAAGCTTGGTTCAATCCTGTGGGGAGATGAGGAATTTAGAGTTTGGAAAATGTATTCATAGCTATGTTCTTGGATTTGGAATGAGTAGTGATACAAGGGTGCTTACCTCACTGATTGATATGTATTGTAAAACGGGTGATGTCGTAAGTGCTCGATGGATTTTCGATACAATGCCCTCTAGGAATTTGGTGTCTTGGAATGTTATGATTTCGGGTTATGTTCAAAATGGTTTTCGTGTCGAAACTTTACATCTCTTCCGTATGTTGGTTACGAATGAAGGAGGTTTCGACTCGAATACCGTTGTTAGCCTCCTCCAGCTTTGTTCTCGGACGGCTGATTTGGATGGCGGGAAGATTGTTCATGGTTGCGTCTATCGAAGAGAACTCGATTTGAATTTGATTTTGTCTACTGCAATTGTTGATCTATATGCTAAATGTGGATGTTTGGCCTATGCATATTCTGTTTTTGAAAGAATGAAAACTAAGAATGTGGTATCATGGACTGCCATGCTTGTGGGACTGGCTCAGAATGGGCAAGCTAGAGATGCTTTAAAGCTATTTTCTCAAATGCAAAATGAGAGGGTTACTTTCAATGCTCTCACCTTAGTTAGTTTAGTTCATTGTTGCACGCTCCTCGGCTCGTTGCGTGAAGGGAGAAGTGTTCATGCTGTCTTAATTCGATTTCGTTTTGCATCGGACGTTGTCGCTAAGACAGCCCTCATTGATATGTATGCAAAATGCAGCGAAATCGACTCGGGTGAGAAAGTATTCAACCATGGTTTTACGCCAAAGGATGTGATATTATATAACTCGATGATTTCGGGCTATGGAATGCACGGTCTCGGGCATAAAGCACTGTCTGTCTACCATCAAATGAATCAAGAACTTCAGCCAAATGAGAGCACCTTTGTTTCTCTGCTATCTGCTTGTAGCCATTCAGGCCTAGTGGAGGAAGGGATATCTTTGTTTCGAGATATGGAGAAAGCTCATAACGTAACGCCGACCGATAAACTTTATGCCTGCTTTGTCGATCTTTTAAGTCGAGCAGGTCGCCTTCGGCAAGCTGAGGAAGTGATCAATCAAATGCCTTACAGACCAACCAGTGGCATACTTGAAACTCTGCTGAATGGGTGTCTTTTGCATAAGGACATTGAGTTGGGTGTAAAAATTGCTGACAGATTACTCTCGTTCGAGTCTAGAAATCTGAGCGTCTACGTTAGCTTGTCGAATATATATGCCGAAGCAGGTCAATGGGATACGGTAAACCATCTCCGAGGTCTCATGACCGAGCACGAGCTTAAAAAGATTCCAGGTTATAGCTCAATTGAAGTAAATATTTAG

mRNA sequence

ATGAATAACATTGTGAAAGTTTCACAGGTTAGTTCGCAGTCATCGTGCCAATCGATTGTGGAGCCGTCGATTGCAGCCGTTGATTCTGCTGCACGCGTTGGGGTTCGCCACTGCTGGGCGGAGCCGTGCGACAACCGGTGGTGCGCCGCCAGTAGTGTGGGTGTTGGAGTTGAGAAGAAAATGAAATCGGAGTTCTTCAAATCCATACACGCTCAGATCATTATCACAAACGCCATATCCGGAGACCAGCGTTTGGTGGCAAAGCTTGTTGCGGCGTACTCAAGCTTGGGTTCTTTGGAGAATGCACGTAAGGTGTTTGATAAAATTCCTCAACCAAAAACTGTTCTCTGCAATGCCATGGTTAATGGGTATCTTCAAAATCAGCATTATAATGAGACCATCGAGCTGTTTAAATTGATGGGTCGATGTCATTTCGAATTTGATAGCTATACTTGTAATTTTGCTCTTAAGGCGTGCATGTTCTTATTGGATTATGAAATGGGCATAGAAGTGATTAGATTAGCTCTCTGTAAGGGGTTAGCTGGTGGTCGGTTTTTGGGAAGTTCGATTTTAAATTTTTTGGTAAAAGCTGGGGATATTATGAATGCACGAATTTTTTTTCATGAAATGGTTGAGAAAGATGTTGTTTGTTGGAATGTGATGATTGGTGGGTTCATGCAGGAAGGCTTGTTTAGTGAAGGCTATAAATTGTTTCTTGATATGCTTTATAATAGAATTGAGCCTAGTGCTGTGACCATGACAAGCTTGGTTCAATCCTGTGGGGAGATGAGGAATTTAGAGTTTGGAAAATGGAGAAGTGTTCATGCTGTCTTAATTCGATTTCGTTTTGCATCGGACGTTGTCGCTAAGACAGCCCTCATTGATATGTATGCAAAATGCAGCGAAATCGACTCGGGTGAGAAAGTATTCAACCATGGTTTTACGCCAAAGGATGTGATATTATATAACTCGATGATTTCGGGCTATGGAATGCACGGTCTCGGGCATAAAGCACTGTCTGTCTACCATCAAATGAATCAAGAACTTCAGCCAAATGAGAGCACCTTTGTTTCTCTGCTATCTGCTTGTAGCCATTCAGGCCTAGTGGAGGAAGGGATATCTTTGTTTCGAGATATGGAGAAAGCTCATAACGTAACGCCGACCGATAAACTTTATGCCTGCTTTGTCGATCTTTTAAGTCGAGCAGGTCGCCTTCGGCAAGCTGAGGAAGTGATCAATCAAATGCCTTACAGACCAACCAGTGGCATACTTGAAACTCTGCTGAATGGGTGTCTTTTGCATAAGGACATTGAGTTGGGTGTAAAAATTGCTGACAGATTACTCTCGTTCGAGTCTAGAAATCTGAGCGTCTACGTTAGCTTGTCGAATATATATGCCGAAGCAGGTCAATGGGATACGGTAAACCATCTCCGAGGTCTCATGACCGAGCACGAGCTTAAAAAGATTCCAGGTTATAGCTCAATTGAAGTAAATATTTAG

Coding sequence (CDS)

ATGAATAACATTGTGAAAGTTTCACAGGTTAGTTCGCAGTCATCGTGCCAATCGATTGTGGAGCCGTCGATTGCAGCCGTTGATTCTGCTGCACGCGTTGGGGTTCGCCACTGCTGGGCGGAGCCGTGCGACAACCGGTGGTGCGCCGCCAGTAGTGTGGGTGTTGGAGTTGAGAAGAAAATGAAATCGGAGTTCTTCAAATCCATACACGCTCAGATCATTATCACAAACGCCATATCCGGAGACCAGCGTTTGGTGGCAAAGCTTGTTGCGGCGTACTCAAGCTTGGGTTCTTTGGAGAATGCACGTAAGGTGTTTGATAAAATTCCTCAACCAAAAACTGTTCTCTGCAATGCCATGGTTAATGGGTATCTTCAAAATCAGCATTATAATGAGACCATCGAGCTGTTTAAATTGATGGGTCGATGTCATTTCGAATTTGATAGCTATACTTGTAATTTTGCTCTTAAGGCGTGCATGTTCTTATTGGATTATGAAATGGGCATAGAAGTGATTAGATTAGCTCTCTGTAAGGGGTTAGCTGGTGGTCGGTTTTTGGGAAGTTCGATTTTAAATTTTTTGGTAAAAGCTGGGGATATTATGAATGCACGAATTTTTTTTCATGAAATGGTTGAGAAAGATGTTGTTTGTTGGAATGTGATGATTGGTGGGTTCATGCAGGAAGGCTTGTTTAGTGAAGGCTATAAATTGTTTCTTGATATGCTTTATAATAGAATTGAGCCTAGTGCTGTGACCATGACAAGCTTGGTTCAATCCTGTGGGGAGATGAGGAATTTAGAGTTTGGAAAATGGAGAAGTGTTCATGCTGTCTTAATTCGATTTCGTTTTGCATCGGACGTTGTCGCTAAGACAGCCCTCATTGATATGTATGCAAAATGCAGCGAAATCGACTCGGGTGAGAAAGTATTCAACCATGGTTTTACGCCAAAGGATGTGATATTATATAACTCGATGATTTCGGGCTATGGAATGCACGGTCTCGGGCATAAAGCACTGTCTGTCTACCATCAAATGAATCAAGAACTTCAGCCAAATGAGAGCACCTTTGTTTCTCTGCTATCTGCTTGTAGCCATTCAGGCCTAGTGGAGGAAGGGATATCTTTGTTTCGAGATATGGAGAAAGCTCATAACGTAACGCCGACCGATAAACTTTATGCCTGCTTTGTCGATCTTTTAAGTCGAGCAGGTCGCCTTCGGCAAGCTGAGGAAGTGATCAATCAAATGCCTTACAGACCAACCAGTGGCATACTTGAAACTCTGCTGAATGGGTGTCTTTTGCATAAGGACATTGAGTTGGGTGTAAAAATTGCTGACAGATTACTCTCGTTCGAGTCTAGAAATCTGAGCGTCTACGTTAGCTTGTCGAATATATATGCCGAAGCAGGTCAATGGGATACGGTAAACCATCTCCGAGGTCTCATGACCGAGCACGAGCTTAAAAAGATTCCAGGTTATAGCTCAATTGAAGTAAATATTTAG

Protein sequence

MNNIVKVSQVSSQSSCQSIVEPSIAAVDSAARVGVRHCWAEPCDNRWCAASSVGVGVEKKMKSEFFKSIHAQIIITNAISGDQRLVAKLVAAYSSLGSLENARKVFDKIPQPKTVLCNAMVNGYLQNQHYNETIELFKLMGRCHFEFDSYTCNFALKACMFLLDYEMGIEVIRLALCKGLAGGRFLGSSILNFLVKAGDIMNARIFFHEMVEKDVVCWNVMIGGFMQEGLFSEGYKLFLDMLYNRIEPSAVTMTSLVQSCGEMRNLEFGKWRSVHAVLIRFRFASDVVAKTALIDMYAKCSEIDSGEKVFNHGFTPKDVILYNSMISGYGMHGLGHKALSVYHQMNQELQPNESTFVSLLSACSHSGLVEEGISLFRDMEKAHNVTPTDKLYACFVDLLSRAGRLRQAEEVINQMPYRPTSGILETLLNGCLLHKDIELGVKIADRLLSFESRNLSVYVSLSNIYAEAGQWDTVNHLRGLMTEHELKKIPGYSSIEVNI
BLAST of Cp4.1LG11g09290 vs. Swiss-Prot
Match: PPR21_ARATH (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 283.5 bits (724), Expect = 4.5e-75
Identity = 154/435 (35.40%), Postives = 242/435 (55.63%), Query Frame = 1

Query: 65  FFKSIHAQIIITNAISGDQRLVAKLVAAYSSLGSLENARKVFDKIPQPKTVLCNAMVNGY 124
           F KS H  ++   A          L+  Y+S G +ENA+K+FD+IP    V  NAM++GY
Sbjct: 192 FDKSPHRDVVSYTA----------LIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGY 251

Query: 125 LQNQHYNETIELFKLMGRCHFEFDSYTCNFALKACMFLLDYEMGIEVIRLALCKGLAGGR 184
            +  +Y E +ELFK M + +   D  T    + AC      E+G +V       G     
Sbjct: 252 AETGNYKEALELFKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNL 311

Query: 185 FLGSSILNFLVKAGDIMNARIFFHEMVEKDVVCWNVMIGGFMQEGLFSEGYKLFLDMLYN 244
            + +++++   K G++  A   F  +  KDV+ WN +IGG+    L+ E   LF +ML +
Sbjct: 312 KIVNALIDLYSKCGELETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRS 371

Query: 245 RIEPSAVTMTSLVQSCGEMRNLEFGKWRSVHAVLIRFRFASDVVAKTALIDMYAKCSEID 304
              P+ VTM S++ +C  +  ++ G+W  V+         +    +T+LIDMYAKC +I+
Sbjct: 372 GETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIE 431

Query: 305 SGEKVFNHGFTPKDVILYNSMISGYGMHGLGHKALSVYHQMNQ-ELQPNESTFVSLLSAC 364
           +  +VFN     K +  +N+MI G+ MHG    +  ++ +M +  +QP++ TFV LLSAC
Sbjct: 432 AAHQVFN-SILHKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSAC 491

Query: 365 SHSGLVEEGISLFRDMEKAHNVTPTDKLYACFVDLLSRAGRLRQAEEVINQMPYRPTSGI 424
           SHSG+++ G  +FR M + + +TP  + Y C +DLL  +G  ++AEE+IN M   P   I
Sbjct: 492 SHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVI 551

Query: 425 LETLLNGCLLHKDIELGVKIADRLLSFESRNLSVYVSLSNIYAEAGQWDTVNHLRGLMTE 484
             +LL  C +H ++ELG   A+ L+  E  N   YV LSNIYA AG+W+ V   R L+ +
Sbjct: 552 WCSLLKACKMHGNVELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLND 611

Query: 485 HELKKIPGYSSIEVN 499
             +KK+PG SSIE++
Sbjct: 612 KGMKKVPGCSSIEID 615

BLAST of Cp4.1LG11g09290 vs. Swiss-Prot
Match: PPR14_ARATH (Pentatricopeptide repeat-containing protein At1g06140, mitochondrial OS=Arabidopsis thaliana GN=PCMP-E61 PE=2 SV=1)

HSP 1 Score: 280.4 bits (716), Expect = 3.8e-74
Identity = 162/427 (37.94%), Postives = 244/427 (57.14%), Query Frame = 1

Query: 73  IIITNAISGDQRLVAKLVAAYSSLGSLENARKVFDKIPQPKTVLCNAMVNGYLQNQHYNE 132
           + + N +  D  +   LV  Y+ LG++E+A+KVFD+IP   +VL   ++ GYL+     E
Sbjct: 134 LAMKNGLDKDDYVAPSLVEMYAQLGTMESAQKVFDEIPVRNSVLWGVLMKGYLKYSKDPE 193

Query: 133 TIELFKLMGRCHFEFDSYTCNFALKACMFLLDYEMGIEVIRLALCKGLAG-GRFLGSSIL 192
              LF LM       D+ T    +KAC  +   ++G  V  +++ +       +L +SI+
Sbjct: 194 VFRLFCLMRDTGLALDALTLICLVKACGNVFAGKVGKCVHGVSIRRSFIDQSDYLQASII 253

Query: 193 NFLVKAGDIMNARIFFHEMVEKDVVCWNVMIGGFMQEGLFSEGYKLFLDMLYNRIEPSAV 252
           +  VK   + NAR  F   V+++VV W  +I GF +     E + LF  ML   I P+  
Sbjct: 254 DMYVKCRLLDNARKLFETSVDRNVVMWTTLISGFAKCERAVEAFDLFRQMLRESILPNQC 313

Query: 253 TMTSLVQSCGEMRNLEFGKWRSVHAVLIRFRFASDVVAKTALIDMYAKCSEIDSGEKVFN 312
           T+ +++ SC  + +L  GK  SVH  +IR     D V  T+ IDMYA+C  I     VF+
Sbjct: 314 TLAAILVSCSSLGSLRHGK--SVHGYMIRNGIEMDAVNFTSFIDMYARCGNIQMARTVFD 373

Query: 313 HGFTPKDVILYNSMISGYGMHGLGHKALSVYHQM-NQELQPNESTFVSLLSACSHSGLVE 372
                ++VI ++SMI+ +G++GL  +AL  +H+M +Q + PN  TFVSLLSACSHSG V+
Sbjct: 374 M-MPERNVISWSSMINAFGINGLFEEALDCFHKMKSQNVVPNSVTFVSLLSACSHSGNVK 433

Query: 373 EGISLFRDMEKAHNVTPTDKLYACFVDLLSRAGRLRQAEEVINQMPYRPTSGILETLLNG 432
           EG   F  M + + V P ++ YAC VDLL RAG + +A+  I+ MP +P +     LL+ 
Sbjct: 434 EGWKQFESMTRDYGVVPEEEHYACMVDLLGRAGEIGEAKSFIDNMPVKPMASAWGALLSA 493

Query: 433 CLLHKDIELGVKIADRLLSFESRNLSVYVSLSNIYAEAGQWDTVNHLRGLMTEHELKKIP 492
           C +HK+++L  +IA++LLS E    SVYV LSNIYA+AG W+ VN +R  M     +K  
Sbjct: 494 CRIHKEVDLAGEIAEKLLSMEPEKSSVYVLLSNIYADAGMWEMVNCVRRKMGIKGYRKHV 553

Query: 493 GYSSIEV 498
           G S+ EV
Sbjct: 554 GQSATEV 557

BLAST of Cp4.1LG11g09290 vs. Swiss-Prot
Match: PP333_ARATH (Pentatricopeptide repeat-containing protein At4g21300 OS=Arabidopsis thaliana GN=PCMP-E36 PE=3 SV=1)

HSP 1 Score: 266.2 bits (679), Expect = 7.4e-70
Identity = 147/450 (32.67%), Postives = 248/450 (55.11%), Query Frame = 1

Query: 51  SSVGVGVEKKMKSEFFKSIHAQIIITNAISGDQRLVAKLVAAYSSLGSLENARKVFDKIP 110
           SS+   V K    E+ K IH  I+  ++IS D  L + L+ AY     +  A+ +F +  
Sbjct: 344 SSLLPSVSKFENLEYCKQIHCYIM-RHSISLDIFLTSALIDAYFKCRGVSMAQNIFSQCN 403

Query: 111 QPKTVLCNAMVNGYLQNQHYNETIELFKLMGRCHFEFDSYTCNFALKACMFLLDYEMGIE 170
               V+  AM++GYL N  Y +++E+F+ + +     +  T    L     LL  ++G E
Sbjct: 404 SVDVVVFTAMISGYLHNGLYIDSLEMFRWLVKVKISPNEITLVSILPVIGILLALKLGRE 463

Query: 171 VIRLALCKGLAGGRFLGSSILNFLVKAGDIMNARIFFHEMVEKDVVCWNVMIGGFMQEGL 230
           +    + KG      +G ++++   K G +  A   F  + ++D+V WN MI    Q   
Sbjct: 464 LHGFIIKKGFDNRCNIGCAVIDMYAKCGRMNLAYEIFERLSKRDIVSWNSMITRCAQSDN 523

Query: 231 FSEGYKLFLDMLYNRIEPSAVTMTSLVQSCGEMRNLEFGKWRSVHAVLIRFRFASDVVAK 290
            S    +F  M  + I    V++++ + +C  + +  FGK  ++H  +I+   ASDV ++
Sbjct: 524 PSAAIDIFRQMGVSGICYDCVSISAALSACANLPSESFGK--AIHGFMIKHSLASDVYSE 583

Query: 291 TALIDMYAKCSEIDSGEKVFNHGFTPKDVILYNSMISGYGMHGLGHKALSVYHQMNQE-- 350
           + LIDMYAKC  + +   VF      K+++ +NS+I+  G HG    +L ++H+M ++  
Sbjct: 584 STLIDMYAKCGNLKAAMNVFKT-MKEKNIVSWNSIIAACGNHGKLKDSLCLFHEMVEKSG 643

Query: 351 LQPNESTFVSLLSACSHSGLVEEGISLFRDMEKAHNVTPTDKLYACFVDLLSRAGRLRQA 410
           ++P++ TF+ ++S+C H G V+EG+  FR M + + + P  + YAC VDL  RAGRL +A
Sbjct: 644 IRPDQITFLEIISSCCHVGDVDEGVRFFRSMTEDYGIQPQQEHYACVVDLFGRAGRLTEA 703

Query: 411 EEVINQMPYRPTSGILETLLNGCLLHKDIELGVKIADRLLSFESRNLSVYVSLSNIYAEA 470
            E +  MP+ P +G+  TLL  C LHK++EL    + +L+  +  N   YV +SN +A A
Sbjct: 704 YETVKSMPFPPDAGVWGTLLGACRLHKNVELAEVASSKLMDLDPSNSGYYVLISNAHANA 763

Query: 471 GQWDTVNHLRGLMTEHELKKIPGYSSIEVN 499
            +W++V  +R LM E E++KIPGYS IE+N
Sbjct: 764 REWESVTKVRSLMKEREVQKIPGYSWIEIN 789

BLAST of Cp4.1LG11g09290 vs. Swiss-Prot
Match: PP219_ARATH (Putative pentatricopeptide repeat-containing protein At3g08820 OS=Arabidopsis thaliana GN=PCMP-H84 PE=3 SV=1)

HSP 1 Score: 263.1 bits (671), Expect = 6.3e-69
Identity = 143/426 (33.57%), Postives = 228/426 (53.52%), Query Frame = 1

Query: 73  IIITNAISGDQRLVAKLVAAYSSLGSLENARKVFDKIPQPKTVLCNAMVNGYLQNQHYNE 132
           +++    + D   +  L++ YS  G L +A K+FD+IP    V   A+ +GY  +  + E
Sbjct: 136 LVVKCGFNHDVAAMTSLLSIYSGSGRLNDAHKLFDEIPDRSVVTWTALFSGYTTSGRHRE 195

Query: 133 TIELFKLMGRCHFEFDSYTCNFALKACMFLLDYEMGIEVIRLALCKGLAGGRFLGSSILN 192
            I+LFK M     + DSY     L AC+ + D + G  +++      +    F+ ++++N
Sbjct: 196 AIDLFKKMVEMGVKPDSYFIVQVLSACVHVGDLDSGEWIVKYMEEMEMQKNSFVRTTLVN 255

Query: 193 FLVKAGDIMNARIFFHEMVEKDVVCWNVMIGGFMQEGLFSEGYKLFLDMLYNRIEPSAVT 252
              K G +  AR  F  MVEKD+V W+ MI G+       EG +LFL ML   ++P   +
Sbjct: 256 LYAKCGKMEKARSVFDSMVEKDIVTWSTMIQGYASNSFPKEGIELFLQMLQENLKPDQFS 315

Query: 253 MTSLVQSCGEMRNLEFGKWRSVHAVLIRFRFASDVVAKTALIDMYAKCSEIDSGEKVFNH 312
           +   + SC  +  L+ G+W    +++ R  F +++    ALIDMYAKC  +  G +VF  
Sbjct: 316 IVGFLSSCASLGALDLGEWGI--SLIDRHEFLTNLFMANALIDMYAKCGAMARGFEVFKE 375

Query: 313 GFTPKDVILYNSMISGYGMHGLGHKALSVYHQMNQ-ELQPNESTFVSLLSACSHSGLVEE 372
               KD+++ N+ ISG   +G    + +V+ Q  +  + P+ STF+ LL  C H+GL+++
Sbjct: 376 -MKEKDIVIMNAAISGLAKNGHVKLSFAVFGQTEKLGISPDGSTFLGLLCGCVHAGLIQD 435

Query: 373 GISLFRDMEKAHNVTPTDKLYACFVDLLSRAGRLRQAEEVINQMPYRPTSGILETLLNGC 432
           G+  F  +   + +  T + Y C VDL  RAG L  A  +I  MP RP + +   LL+GC
Sbjct: 436 GLRFFNAISCVYALKRTVEHYGCMVDLWGRAGMLDDAYRLICDMPMRPNAIVWGALLSGC 495

Query: 433 LLHKDIELGVKIADRLLSFESRNLSVYVSLSNIYAEAGQWDTVNHLRGLMTEHELKKIPG 492
            L KD +L   +   L++ E  N   YV LSNIY+  G+WD    +R +M +  +KKIPG
Sbjct: 496 RLVKDTQLAETVLKELIALEPWNAGNYVQLSNIYSVGGRWDEAAEVRDMMNKKGMKKIPG 555

Query: 493 YSSIEV 498
           YS IE+
Sbjct: 556 YSWIEL 558

BLAST of Cp4.1LG11g09290 vs. Swiss-Prot
Match: PP265_ARATH (Pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Arabidopsis thaliana GN=CRR2 PE=2 SV=1)

HSP 1 Score: 257.3 bits (656), Expect = 3.5e-67
Identity = 148/431 (34.34%), Postives = 237/431 (54.99%), Query Frame = 1

Query: 74  IITNAISGDQRLVAKLVAAYSSLGSLENARKVFDKIPQPKTVLCNAMVNGYLQNQHYNET 133
           I+ N    D  L  KL+  YS LGS++ ARKVFDK  +    + NA+        H  E 
Sbjct: 103 ILDNGSDQDPFLATKLIGMYSDLGSVDYARKVFDKTRKRTIYVWNALFRALTLAGHGEEV 162

Query: 134 IELFKLMGRCHFEFDSYTCNFALKACMF---LLDYEM-GIEVIRLALCKGLAGGRFLGSS 193
           + L+  M R   E D +T  + LKAC+     +++ M G E+      +G +   ++ ++
Sbjct: 163 LGLYWKMNRIGVESDRFTYTYVLKACVASECTVNHLMKGKEIHAHLTRRGYSSHVYIMTT 222

Query: 194 ILNFLVKAGDIMNARIFFHEMVEKDVVCWNVMIGGFMQEGLFSEGYKLFLDMLYNRIE-- 253
           +++   + G +  A   F  M  ++VV W+ MI  + + G   E  + F +M+    +  
Sbjct: 223 LVDMYARFGCVDYASYVFGGMPVRNVVSWSAMIACYAKNGKAFEALRTFREMMRETKDSS 282

Query: 254 PSAVTMTSLVQSCGEMRNLEFGKWRSVHAVLIRFRFASDVVAKTALIDMYAKCSEIDSGE 313
           P++VTM S++Q+C  +  LE GK   +H  ++R    S +   +AL+ MY +C +++ G+
Sbjct: 283 PNSVTMVSVLQACASLAALEQGKL--IHGYILRRGLDSILPVISALVTMYGRCGKLEVGQ 342

Query: 314 KVFNHGFTPKDVILYNSMISGYGMHGLGHKALSVYHQMNQE-LQPNESTFVSLLSACSHS 373
           +VF+     +DV+ +NS+IS YG+HG G KA+ ++ +M      P   TFVS+L ACSH 
Sbjct: 343 RVFDR-MHDRDVVSWNSLISSYGVHGYGKKAIQIFEEMLANGASPTPVTFVSVLGACSHE 402

Query: 374 GLVEEGISLFRDMEKAHNVTPTDKLYACFVDLLSRAGRLRQAEEVINQMPYRPTSGILET 433
           GLVEEG  LF  M + H + P  + YAC VDLL RA RL +A +++  M   P   +  +
Sbjct: 403 GLVEEGKRLFETMWRDHGIKPQIEHYACMVDLLGRANRLDEAAKMVQDMRTEPGPKVWGS 462

Query: 434 LLNGCLLHKDIELGVKIADRLLSFESRNLSVYVSLSNIYAEAGQWDTVNHLRGLMTEHEL 493
           LL  C +H ++EL  + + RL + E +N   YV L++IYAEA  WD V  ++ L+    L
Sbjct: 463 LLGSCRIHGNVELAERASRRLFALEPKNAGNYVLLADIYAEAQMWDEVKRVKKLLEHRGL 522

Query: 494 KKIPGYSSIEV 498
           +K+PG   +EV
Sbjct: 523 QKLPGRCWMEV 530

BLAST of Cp4.1LG11g09290 vs. TrEMBL
Match: A0A0A0LYM5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G573630 PE=4 SV=1)

HSP 1 Score: 458.4 bits (1178), Expect = 1.1e-125
Identity = 244/433 (56.35%), Postives = 302/433 (69.75%), Query Frame = 1

Query: 65  FFKSIHAQIIITNAISGDQRLVAKLVAAYSSLGSLENARKVFDKIPQPKTVLCNAMVNGY 124
           F K +H   ++   +S D R++  L+  Y   G +E+AR +F+ +P    V  N M++GY
Sbjct: 149 FGKCMHG-FVLGFGMSRDTRVLTTLIDMYCKSGDVESARWIFENMPSRNLVSWNVMISGY 208

Query: 125 LQNQHYNETIELFKLMGRCHFEFDSYTCNFALKACMFLLDYEMGIEVIRLALCKGLAGGR 184
           +QN    ET+ LF+ +      FDS T    ++ C    D + G  +      +GL    
Sbjct: 209 VQNGLLVETLRLFQKLIMDDVGFDSGTVVSLIQLCSRTADLDGGKILHGFIYRRGLDLNL 268

Query: 185 FLGSSILNFLVKAGDIMNARIFFHEMVEKDVVCWNVMIGGFMQEGLFSEGYKLFLDMLYN 244
            L ++I++   K G +  A   F  M  K+V+ W  M+ G  Q G   +  KLF  M   
Sbjct: 269 VLPTAIVDLYAKCGSLAYASSVFERMKNKNVISWTAMLVGLAQNGHARDALKLFDQMQNE 328

Query: 245 RIEPSAVTMTSLVQSCGEMRNLEFGKWRSVHAVLIRFRFASDVVAKTALIDMYAKCSEID 304
           R+  +A+T+ SLV  C  +  L  G  RSVHA L RF FAS+VV  TALIDMYAKCS+I+
Sbjct: 329 RVTFNALTLVSLVYCCTLLGLLREG--RSVHATLTRFHFASEVVVMTALIDMYAKCSKIN 388

Query: 305 SGEKVFNHGFTPKDVILYNSMISGYGMHGLGHKALSVYHQMNQE-LQPNESTFVSLLSAC 364
           S E VF +G TPKDVILYNSMISGYGMHGLGHKAL VYH+MN+E LQPNESTFVSLLSAC
Sbjct: 389 SAEMVFKYGLTPKDVILYNSMISGYGMHGLGHKALCVYHRMNREGLQPNESTFVSLLSAC 448

Query: 365 SHSGLVEEGISLFRDMEKAHNVTPTDKLYACFVDLLSRAGRLRQAEEVINQMPYRPTSGI 424
           SHSGLVEEGI+LF++M K HN TPTDKLYAC VDLLSRAGRLRQAEE+INQMP+ PTSGI
Sbjct: 449 SHSGLVEEGIALFQNMVKDHNTTPTDKLYACIVDLLSRAGRLRQAEELINQMPFTPTSGI 508

Query: 425 LETLLNGCLLHKDIELGVKIADRLLSFESRNLSVYVSLSNIYAEAGQWDTVNHLRGLMTE 484
           LETLLNGCLLHKDIELGVK+ADRLLS ESRN S+Y++LSNIYA+A +WD+V ++RGLM E
Sbjct: 509 LETLLNGCLLHKDIELGVKLADRLLSLESRNPSIYITLSNIYAKASRWDSVKYVRGLMME 568

Query: 485 HELKKIPGYSSIE 497
            E+KKIPGYSSIE
Sbjct: 569 QEIKKIPGYSSIE 578

BLAST of Cp4.1LG11g09290 vs. TrEMBL
Match: A0A0B2SD75_GLYSO (Putative pentatricopeptide repeat-containing protein OS=Glycine soja GN=glysoja_007570 PE=4 SV=1)

HSP 1 Score: 406.8 bits (1044), Expect = 3.9e-110
Identity = 208/423 (49.17%), Postives = 287/423 (67.85%), Query Frame = 1

Query: 105 VFDKIPQPKTVLCNAMVNGYLQNQHYNETIELFKLMGRCHFEFDSYTCNFALKACMFLLD 164
           +FD+   P+T +CNAM+ G+L+NQ + E  +LF++MG C+ E D+YTC F+LKAC  LLD
Sbjct: 1   MFDQCSLPETAVCNAMMAGFLRNQQHTEVPKLFRMMGSCNIEIDTYTCMFSLKACASLLD 60

Query: 165 YEMGIEVIRLALCKGLAGGRFLGSSILNFLVKAGDIMNARIFFHEMVEKDVVCWNVMIGG 224
            E+G+E++R A+ KG     ++GSS++NFLVK G + +A+  F  M EKD VCWN +IGG
Sbjct: 61  DEIGMEIVRTAVRKGFRLHPYVGSSMVNFLVKCGYLDDAQKVFDGMPEKDAVCWNSIIGG 120

Query: 225 FMQEGLFSEGYKLFLDMLYNRIEPSAVTMTSLVQSCGEMRNLEFG--------------- 284
           ++++GLF+E  ++F +M+   + PS VTM S +++CGE    + G               
Sbjct: 121 YVKKGLFTEAIQMFPEMIGGGLRPSPVTMVSSLKACGESGLKKVGMCAHGCVLALGMGND 180

Query: 285 ------------KWRSVHAVLIRFRFAS---DVVAKTALIDMYAKCSEIDSGEKVFNHGF 344
                       K RS HA      +     D V  +ALIDMYAKC +I S EK+FN+GF
Sbjct: 181 SCACCAHLGSLKKGRSAHA---HLIWHGYAFDAVITSALIDMYAKCGKIHSAEKLFNNGF 240

Query: 345 TPKDVILYNSMISGYGMHGLGHKALSVYHQMNQE-LQPNESTFVSLLSACSHSGLVEEGI 404
             KDVIL NSMI  YG++  GH AL VY +M +E L PN++TFVSLL+ACSHSGLVEEG 
Sbjct: 241 HLKDVILCNSMIMSYGIYAHGHYALGVYGRMIEERLNPNQTTFVSLLTACSHSGLVEEGK 300

Query: 405 SLFRDMEKAHNVTPTDKLYACFVDLLSRAGRLRQAEEVINQMPYRPTSGILETLLNGCLL 464
           +LF  ME+ HN+ P DK YAC VDLLSRAGRL +A+ ++ QMP++P++ +LE LL+GC  
Sbjct: 301 ALFHCMERDHNIKPQDKHYACLVDLLSRAGRLEEADALVKQMPFQPSTDVLEALLSGCRT 360

Query: 465 HKDIELGVKIADRLLSFESRNLSVYVSLSNIYAEAGQWDTVNHLRGLMTEHELKKIPGYS 497
           HK+  +G++IADRL+S +  N  +YV LSNIYAEA +W++VN++RGLM    +KKIPGYS
Sbjct: 361 HKNTNMGIQIADRLISLDYLNSGIYVMLSNIYAEARKWESVNYIRGLMRMQGMKKIPGYS 420

BLAST of Cp4.1LG11g09290 vs. TrEMBL
Match: A0A0L9TN84_PHAAN (Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan01g155600 PE=4 SV=1)

HSP 1 Score: 380.6 bits (976), Expect = 3.0e-102
Identity = 197/426 (46.24%), Postives = 280/426 (65.73%), Query Frame = 1

Query: 73  IIITNAISGDQRLVAKLVAAYSSLGSLENARKVFDKIPQPKTVLCNAMVNGYLQNQHYNE 132
           ++I   +  D  ++  LV  YS+LG  + A  VF+ +     +  NAM++GY+QN    E
Sbjct: 247 VLIALGMGNDVFVLTSLVDMYSNLGDTDIAALVFNGMFNRSLISWNAMISGYIQNGLIPE 306

Query: 133 TIELFKLMGRCHFEFDSYTCNFALKACMFLLDYEMGIEVIRLALCKGLAGGRFLGSSILN 192
           +  LF+ + +    FDS T    ++ C    D E G  +    + KG+     L +SI++
Sbjct: 307 SFALFRRLIQSGSGFDSGTLVSLIRGCSQTSDLENGRILHACIIRKGIESNIVLSTSIVD 366

Query: 193 FLVKAGDIMNARIFFHEMVEKDVVCWNVMIGGFMQEGLFSEGYKLFLDMLYNRIEPSAVT 252
              K G I  A I F  M +++V+ W  M+ G  Q G   +  KLF  M   ++  ++VT
Sbjct: 367 MYSKCGAIKLATIVFGRMEKRNVITWTAMLVGLSQNGYAEDALKLFCQMQEEKVPANSVT 426

Query: 253 MTSLVQSCGEMRNLEFGKWRSVHAVLIRFRFASDVVAKTALIDMYAKCSEIDSGEKVFNH 312
           + SLV  C  + +L+ G  RSVHA LIR  +A D V  +AL+DMYAKC +I S EK+FN+
Sbjct: 427 LVSLVHCCAHLGSLKKG--RSVHAYLIRHGYAFDAVNMSALLDMYAKCGKIRSAEKLFNN 486

Query: 313 GFTPKDVILYNSMISGYGMHGLGHKALSVYHQMNQE-LQPNESTFVSLLSACSHSGLVEE 372
           GF  KDVIL N+MI GYGMHGLGH ALS+Y +M +E L+P+++TF+SLL+ACSHSGLVEE
Sbjct: 487 GFHLKDVILCNTMIMGYGMHGLGHYALSLYDRMIEESLKPSQTTFISLLTACSHSGLVEE 546

Query: 373 GISLFRDMEKAHNVTPTDKLYACFVDLLSRAGRLRQAEEVINQMPYRPTSGILETLLNGC 432
           G +LF  ME+ HN+ P DK YAC VDLLSRAGRL++A+ ++ QMP++P++ + E LL+GC
Sbjct: 547 GKALFNCMERDHNIKPQDKHYACVVDLLSRAGRLKEADALVKQMPFQPSTDVFEALLSGC 606

Query: 433 LLHKDIELGVKIADRLLSFESRNLSVYVSLSNIYAEAGQWDTVNHLRGLMTEHELKKIPG 492
             HK+I +G++IADRL+S +  N  +YV LSNIYAEAG+W++VN++RGLM    LKK+PG
Sbjct: 607 RTHKNINMGIQIADRLISLDYLNSGIYVMLSNIYAEAGRWESVNYIRGLMRMQGLKKVPG 666

Query: 493 YSSIEV 498
           YS IE+
Sbjct: 667 YSLIEI 670

BLAST of Cp4.1LG11g09290 vs. TrEMBL
Match: V7CHX8_PHAVU (Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_002G092700g PE=4 SV=1)

HSP 1 Score: 375.6 bits (963), Expect = 9.7e-101
Identity = 198/425 (46.59%), Postives = 276/425 (64.94%), Query Frame = 1

Query: 74  IITNAISGDQRLVAKLVAAYSSLGSLENARKVFDKIPQPKTVLCNAMVNGYLQNQHYNET 133
           +I   +  D  ++  LV  YS+LG  +NA  VF+ +     +  NAM++GY+QN    E+
Sbjct: 271 VIALGMGNDVFVLTSLVDMYSNLGDTDNAALVFNGMFNRSLISWNAMISGYVQNGLIPES 330

Query: 134 IELFKLMGRCHFEFDSYTCNFALKACMFLLDYEMGIEVIRLALCKGLAGGRFLGSSILNF 193
             LF+ + +    FDS T    ++ C    D + G  +    + KG+     L +SI++ 
Sbjct: 331 FALFRRLVQSGSGFDSGTLVSLMRGCSQTSDLKNGRILHSCIIRKGIESNIVLSTSIVDM 390

Query: 194 LVKAGDIMNARIFFHEMVEKDVVCWNVMIGGFMQEGLFSEGYKLFLDMLYNRIEPSAVTM 253
             K G I  A   F  M +K+V+ W  M+ G  Q G   +  KLF  M   ++  ++VT+
Sbjct: 391 YSKCGAIKLATTVFGRMEKKNVITWTAMLVGLSQNGYAEDALKLFCQMQEEKVLANSVTL 450

Query: 254 TSLVQSCGEMRNLEFGKWRSVHAVLIRFRFASDVVAKTALIDMYAKCSEIDSGEKVFNHG 313
            SLV  C  + +L  G  RSVHA LIR  +A D V  +AL+DMYAKC +I S EK+FN G
Sbjct: 451 VSLVHCCAHLGSLNKG--RSVHAHLIRHGYAFDAVNMSALLDMYAKCGKIRSAEKLFNKG 510

Query: 314 FTPKDVILYNSMISGYGMHGLGHKALSVYHQMNQE-LQPNESTFVSLLSACSHSGLVEEG 373
           F  KDVIL N+MI GYGMHGLGH AL VY +M +E L+PN++TFVSLL+ACSHSGLVEEG
Sbjct: 511 FHLKDVILCNTMIMGYGMHGLGHYALGVYGRMIEERLKPNQTTFVSLLTACSHSGLVEEG 570

Query: 374 ISLFRDMEKAHNVTPTDKLYACFVDLLSRAGRLRQAEEVINQMPYRPTSGILETLLNGCL 433
            +LF  +E+ HN+ P +K YAC VDLLSRAGRL++A+ ++ QMP++P++ +LE LL+GC 
Sbjct: 571 KALFDCIERDHNIKPQEKHYACLVDLLSRAGRLKEADALVKQMPFQPSTDVLEALLSGCR 630

Query: 434 LHKDIELGVKIADRLLSFESRNLSVYVSLSNIYAEAGQWDTVNHLRGLMTEHELKKIPGY 493
            HK+I +G++IADRL+S +  N  +YV LSNIYAEA +W++VN++RGLM    LKK+PGY
Sbjct: 631 THKNINMGIQIADRLISLDYLNSGIYVMLSNIYAEARRWESVNYIRGLMRMQGLKKVPGY 690

Query: 494 SSIEV 498
           S IEV
Sbjct: 691 SLIEV 693

BLAST of Cp4.1LG11g09290 vs. TrEMBL
Match: A0A061F2D9_THECC (Pentatricopeptide repeat superfamily protein OS=Theobroma cacao GN=TCM_026441 PE=4 SV=1)

HSP 1 Score: 372.9 bits (956), Expect = 6.3e-100
Identity = 194/435 (44.60%), Postives = 277/435 (63.68%), Query Frame = 1

Query: 64  EFFKSIHAQIIITNAISGDQRLVAKLVAAYSSLGSLENARKVFDKIPQPKTVLCNAMVNG 123
           E  K +H   ++   +  D  ++  LV  YS +G +E+A  +FD IP    V  N M++G
Sbjct: 257 ELGKCVHG-FVLGLGMGSDILVLTALVDMYSKMGEIESAHLLFDSIPAKNLVSWNVMISG 316

Query: 124 YLQNQHYNETIELFKLMGRCHFEFDSYTCNFALKACMFLLDYEMGIEVIRLALCKGLAGG 183
           Y+QN   +++ +LF+ +     +FDS T    L+ C  + D E G  +      +GL   
Sbjct: 317 YVQNCLVSKSFDLFRELVITGGDFDSGTIISLLQCCAQIADLESGKVLHGCIFRRGLDMN 376

Query: 184 RFLGSSILNFLVKAGDIMNARIFFHEMVEKDVVCWNVMIGGFMQEGLFSEGYKLFLDMLY 243
             L ++I++   K G +  A   F  M +++V+ W  M+ G  Q G   +  KLF  M  
Sbjct: 377 LILSTAIVDLYSKCGAVKEATFVFDRMKDRNVITWTAMLVGLAQNGKAEDALKLFNQMQE 436

Query: 244 NRIEPSAVTMTSLVQSCGEMRNLEFGKWRSVHAVLIRFRFASDVVAKTALIDMYAKCSEI 303
             +  +++T+  LV SC  + +L+ G  RSVHA L R  +  DVV +TALIDMYAKC +I
Sbjct: 437 EGVAANSITLVGLVHSCAHLGSLKKG--RSVHAQLFRHGYDFDVVNRTALIDMYAKCGKI 496

Query: 304 DSGEKVFNHGFTPKDVILYNSMISGYGMHGLGHKALSVYHQMNQE-LQPNESTFVSLLSA 363
           +  E+V   G   KDVIL+NSMI+GYGMHG GHKAL ++ +M +E ++P+++TF+SLLSA
Sbjct: 497 NYAERVLRDGSFFKDVILWNSMITGYGMHGQGHKALDIFCRMLEEGVKPSQTTFISLLSA 556

Query: 364 CSHSGLVEEGISLFRDMEKAHNVTPTDKLYACFVDLLSRAGRLRQAEEVINQMPYRPTSG 423
           CSHSGLV +G SLF  ME  HN+ PT+K YAC+VDLLSRAGRL++AE +I QMP++ +  
Sbjct: 557 CSHSGLVNQGRSLFVSMESDHNIRPTEKHYACYVDLLSRAGRLQEAEALIKQMPFQSSGA 616

Query: 424 ILETLLNGCLLHKDIELGVKIADRLLSFESRNLSVYVSLSNIYAEAGQWDTVNHLRGLMT 483
           + E LL+GC  HK+I++G+K AD LLS ++ N  +YV LSNIYAEA +WD V+H+RGLM 
Sbjct: 617 VFEALLSGCRTHKNIDIGIKAADHLLSLDATNPGIYVMLSNIYAEARRWDAVDHIRGLMK 676

Query: 484 EHELKKIPGYSSIEV 498
           +  LKK PGYS IEV
Sbjct: 677 KRGLKKTPGYSLIEV 688

BLAST of Cp4.1LG11g09290 vs. TAIR10
Match: AT1G08070.1 (AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 283.5 bits (724), Expect = 2.5e-76
Identity = 154/435 (35.40%), Postives = 242/435 (55.63%), Query Frame = 1

Query: 65  FFKSIHAQIIITNAISGDQRLVAKLVAAYSSLGSLENARKVFDKIPQPKTVLCNAMVNGY 124
           F KS H  ++   A          L+  Y+S G +ENA+K+FD+IP    V  NAM++GY
Sbjct: 192 FDKSPHRDVVSYTA----------LIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGY 251

Query: 125 LQNQHYNETIELFKLMGRCHFEFDSYTCNFALKACMFLLDYEMGIEVIRLALCKGLAGGR 184
            +  +Y E +ELFK M + +   D  T    + AC      E+G +V       G     
Sbjct: 252 AETGNYKEALELFKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNL 311

Query: 185 FLGSSILNFLVKAGDIMNARIFFHEMVEKDVVCWNVMIGGFMQEGLFSEGYKLFLDMLYN 244
            + +++++   K G++  A   F  +  KDV+ WN +IGG+    L+ E   LF +ML +
Sbjct: 312 KIVNALIDLYSKCGELETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRS 371

Query: 245 RIEPSAVTMTSLVQSCGEMRNLEFGKWRSVHAVLIRFRFASDVVAKTALIDMYAKCSEID 304
              P+ VTM S++ +C  +  ++ G+W  V+         +    +T+LIDMYAKC +I+
Sbjct: 372 GETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIE 431

Query: 305 SGEKVFNHGFTPKDVILYNSMISGYGMHGLGHKALSVYHQMNQ-ELQPNESTFVSLLSAC 364
           +  +VFN     K +  +N+MI G+ MHG    +  ++ +M +  +QP++ TFV LLSAC
Sbjct: 432 AAHQVFN-SILHKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSAC 491

Query: 365 SHSGLVEEGISLFRDMEKAHNVTPTDKLYACFVDLLSRAGRLRQAEEVINQMPYRPTSGI 424
           SHSG+++ G  +FR M + + +TP  + Y C +DLL  +G  ++AEE+IN M   P   I
Sbjct: 492 SHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVI 551

Query: 425 LETLLNGCLLHKDIELGVKIADRLLSFESRNLSVYVSLSNIYAEAGQWDTVNHLRGLMTE 484
             +LL  C +H ++ELG   A+ L+  E  N   YV LSNIYA AG+W+ V   R L+ +
Sbjct: 552 WCSLLKACKMHGNVELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLND 611

Query: 485 HELKKIPGYSSIEVN 499
             +KK+PG SSIE++
Sbjct: 612 KGMKKVPGCSSIEID 615

BLAST of Cp4.1LG11g09290 vs. TAIR10
Match: AT1G06140.1 (AT1G06140.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 280.4 bits (716), Expect = 2.1e-75
Identity = 162/427 (37.94%), Postives = 244/427 (57.14%), Query Frame = 1

Query: 73  IIITNAISGDQRLVAKLVAAYSSLGSLENARKVFDKIPQPKTVLCNAMVNGYLQNQHYNE 132
           + + N +  D  +   LV  Y+ LG++E+A+KVFD+IP   +VL   ++ GYL+     E
Sbjct: 134 LAMKNGLDKDDYVAPSLVEMYAQLGTMESAQKVFDEIPVRNSVLWGVLMKGYLKYSKDPE 193

Query: 133 TIELFKLMGRCHFEFDSYTCNFALKACMFLLDYEMGIEVIRLALCKGLAG-GRFLGSSIL 192
              LF LM       D+ T    +KAC  +   ++G  V  +++ +       +L +SI+
Sbjct: 194 VFRLFCLMRDTGLALDALTLICLVKACGNVFAGKVGKCVHGVSIRRSFIDQSDYLQASII 253

Query: 193 NFLVKAGDIMNARIFFHEMVEKDVVCWNVMIGGFMQEGLFSEGYKLFLDMLYNRIEPSAV 252
           +  VK   + NAR  F   V+++VV W  +I GF +     E + LF  ML   I P+  
Sbjct: 254 DMYVKCRLLDNARKLFETSVDRNVVMWTTLISGFAKCERAVEAFDLFRQMLRESILPNQC 313

Query: 253 TMTSLVQSCGEMRNLEFGKWRSVHAVLIRFRFASDVVAKTALIDMYAKCSEIDSGEKVFN 312
           T+ +++ SC  + +L  GK  SVH  +IR     D V  T+ IDMYA+C  I     VF+
Sbjct: 314 TLAAILVSCSSLGSLRHGK--SVHGYMIRNGIEMDAVNFTSFIDMYARCGNIQMARTVFD 373

Query: 313 HGFTPKDVILYNSMISGYGMHGLGHKALSVYHQM-NQELQPNESTFVSLLSACSHSGLVE 372
                ++VI ++SMI+ +G++GL  +AL  +H+M +Q + PN  TFVSLLSACSHSG V+
Sbjct: 374 M-MPERNVISWSSMINAFGINGLFEEALDCFHKMKSQNVVPNSVTFVSLLSACSHSGNVK 433

Query: 373 EGISLFRDMEKAHNVTPTDKLYACFVDLLSRAGRLRQAEEVINQMPYRPTSGILETLLNG 432
           EG   F  M + + V P ++ YAC VDLL RAG + +A+  I+ MP +P +     LL+ 
Sbjct: 434 EGWKQFESMTRDYGVVPEEEHYACMVDLLGRAGEIGEAKSFIDNMPVKPMASAWGALLSA 493

Query: 433 CLLHKDIELGVKIADRLLSFESRNLSVYVSLSNIYAEAGQWDTVNHLRGLMTEHELKKIP 492
           C +HK+++L  +IA++LLS E    SVYV LSNIYA+AG W+ VN +R  M     +K  
Sbjct: 494 CRIHKEVDLAGEIAEKLLSMEPEKSSVYVLLSNIYADAGMWEMVNCVRRKMGIKGYRKHV 553

Query: 493 GYSSIEV 498
           G S+ EV
Sbjct: 554 GQSATEV 557

BLAST of Cp4.1LG11g09290 vs. TAIR10
Match: AT4G21300.1 (AT4G21300.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 266.2 bits (679), Expect = 4.2e-71
Identity = 147/450 (32.67%), Postives = 248/450 (55.11%), Query Frame = 1

Query: 51  SSVGVGVEKKMKSEFFKSIHAQIIITNAISGDQRLVAKLVAAYSSLGSLENARKVFDKIP 110
           SS+   V K    E+ K IH  I+  ++IS D  L + L+ AY     +  A+ +F +  
Sbjct: 344 SSLLPSVSKFENLEYCKQIHCYIM-RHSISLDIFLTSALIDAYFKCRGVSMAQNIFSQCN 403

Query: 111 QPKTVLCNAMVNGYLQNQHYNETIELFKLMGRCHFEFDSYTCNFALKACMFLLDYEMGIE 170
               V+  AM++GYL N  Y +++E+F+ + +     +  T    L     LL  ++G E
Sbjct: 404 SVDVVVFTAMISGYLHNGLYIDSLEMFRWLVKVKISPNEITLVSILPVIGILLALKLGRE 463

Query: 171 VIRLALCKGLAGGRFLGSSILNFLVKAGDIMNARIFFHEMVEKDVVCWNVMIGGFMQEGL 230
           +    + KG      +G ++++   K G +  A   F  + ++D+V WN MI    Q   
Sbjct: 464 LHGFIIKKGFDNRCNIGCAVIDMYAKCGRMNLAYEIFERLSKRDIVSWNSMITRCAQSDN 523

Query: 231 FSEGYKLFLDMLYNRIEPSAVTMTSLVQSCGEMRNLEFGKWRSVHAVLIRFRFASDVVAK 290
            S    +F  M  + I    V++++ + +C  + +  FGK  ++H  +I+   ASDV ++
Sbjct: 524 PSAAIDIFRQMGVSGICYDCVSISAALSACANLPSESFGK--AIHGFMIKHSLASDVYSE 583

Query: 291 TALIDMYAKCSEIDSGEKVFNHGFTPKDVILYNSMISGYGMHGLGHKALSVYHQMNQE-- 350
           + LIDMYAKC  + +   VF      K+++ +NS+I+  G HG    +L ++H+M ++  
Sbjct: 584 STLIDMYAKCGNLKAAMNVFKT-MKEKNIVSWNSIIAACGNHGKLKDSLCLFHEMVEKSG 643

Query: 351 LQPNESTFVSLLSACSHSGLVEEGISLFRDMEKAHNVTPTDKLYACFVDLLSRAGRLRQA 410
           ++P++ TF+ ++S+C H G V+EG+  FR M + + + P  + YAC VDL  RAGRL +A
Sbjct: 644 IRPDQITFLEIISSCCHVGDVDEGVRFFRSMTEDYGIQPQQEHYACVVDLFGRAGRLTEA 703

Query: 411 EEVINQMPYRPTSGILETLLNGCLLHKDIELGVKIADRLLSFESRNLSVYVSLSNIYAEA 470
            E +  MP+ P +G+  TLL  C LHK++EL    + +L+  +  N   YV +SN +A A
Sbjct: 704 YETVKSMPFPPDAGVWGTLLGACRLHKNVELAEVASSKLMDLDPSNSGYYVLISNAHANA 763

Query: 471 GQWDTVNHLRGLMTEHELKKIPGYSSIEVN 499
            +W++V  +R LM E E++KIPGYS IE+N
Sbjct: 764 REWESVTKVRSLMKEREVQKIPGYSWIEIN 789

BLAST of Cp4.1LG11g09290 vs. TAIR10
Match: AT3G08820.1 (AT3G08820.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 263.1 bits (671), Expect = 3.5e-70
Identity = 143/426 (33.57%), Postives = 228/426 (53.52%), Query Frame = 1

Query: 73  IIITNAISGDQRLVAKLVAAYSSLGSLENARKVFDKIPQPKTVLCNAMVNGYLQNQHYNE 132
           +++    + D   +  L++ YS  G L +A K+FD+IP    V   A+ +GY  +  + E
Sbjct: 136 LVVKCGFNHDVAAMTSLLSIYSGSGRLNDAHKLFDEIPDRSVVTWTALFSGYTTSGRHRE 195

Query: 133 TIELFKLMGRCHFEFDSYTCNFALKACMFLLDYEMGIEVIRLALCKGLAGGRFLGSSILN 192
            I+LFK M     + DSY     L AC+ + D + G  +++      +    F+ ++++N
Sbjct: 196 AIDLFKKMVEMGVKPDSYFIVQVLSACVHVGDLDSGEWIVKYMEEMEMQKNSFVRTTLVN 255

Query: 193 FLVKAGDIMNARIFFHEMVEKDVVCWNVMIGGFMQEGLFSEGYKLFLDMLYNRIEPSAVT 252
              K G +  AR  F  MVEKD+V W+ MI G+       EG +LFL ML   ++P   +
Sbjct: 256 LYAKCGKMEKARSVFDSMVEKDIVTWSTMIQGYASNSFPKEGIELFLQMLQENLKPDQFS 315

Query: 253 MTSLVQSCGEMRNLEFGKWRSVHAVLIRFRFASDVVAKTALIDMYAKCSEIDSGEKVFNH 312
           +   + SC  +  L+ G+W    +++ R  F +++    ALIDMYAKC  +  G +VF  
Sbjct: 316 IVGFLSSCASLGALDLGEWGI--SLIDRHEFLTNLFMANALIDMYAKCGAMARGFEVFKE 375

Query: 313 GFTPKDVILYNSMISGYGMHGLGHKALSVYHQMNQ-ELQPNESTFVSLLSACSHSGLVEE 372
               KD+++ N+ ISG   +G    + +V+ Q  +  + P+ STF+ LL  C H+GL+++
Sbjct: 376 -MKEKDIVIMNAAISGLAKNGHVKLSFAVFGQTEKLGISPDGSTFLGLLCGCVHAGLIQD 435

Query: 373 GISLFRDMEKAHNVTPTDKLYACFVDLLSRAGRLRQAEEVINQMPYRPTSGILETLLNGC 432
           G+  F  +   + +  T + Y C VDL  RAG L  A  +I  MP RP + +   LL+GC
Sbjct: 436 GLRFFNAISCVYALKRTVEHYGCMVDLWGRAGMLDDAYRLICDMPMRPNAIVWGALLSGC 495

Query: 433 LLHKDIELGVKIADRLLSFESRNLSVYVSLSNIYAEAGQWDTVNHLRGLMTEHELKKIPG 492
            L KD +L   +   L++ E  N   YV LSNIY+  G+WD    +R +M +  +KKIPG
Sbjct: 496 RLVKDTQLAETVLKELIALEPWNAGNYVQLSNIYSVGGRWDEAAEVRDMMNKKGMKKIPG 555

Query: 493 YSSIEV 498
           YS IE+
Sbjct: 556 YSWIEL 558

BLAST of Cp4.1LG11g09290 vs. TAIR10
Match: AT2G03380.1 (AT2G03380.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 257.3 bits (656), Expect = 1.9e-68
Identity = 144/424 (33.96%), Postives = 225/424 (53.07%), Query Frame = 1

Query: 74  IITNAISGDQRLVAKLVAAYSSLGSLENARKVFDKIPQPKTVLCNAMVNGYLQNQHYNET 133
           ++ + I     LV  L+  Y   G + NAR+VF++      V+  AM+ GY  N   NE 
Sbjct: 268 LVKSGIELSSCLVTSLLDMYVKCGDISNARRVFNEHSHVDLVMWTAMIVGYTHNGSVNEA 327

Query: 134 IELFKLMGRCHFEFDSYTCNFALKACMFLLDYEMGIEVIRLALCKGLAGGRFLGSSILNF 193
           + LF+ M     + +  T    L  C  + + E+G  V  L++  G+     + +++++ 
Sbjct: 328 LSLFQKMKGVEIKPNCVTIASVLSGCGLIENLELGRSVHGLSIKVGIWDTN-VANALVHM 387

Query: 194 LVKAGDIMNARIFFHEMVEKDVVCWNVMIGGFMQEGLFSEGYKLFLDMLYNRIEPSAVTM 253
             K     +A+  F    EKD+V WN +I GF Q G   E   LF  M    + P+ VT+
Sbjct: 388 YAKCYQNRDAKYVFEMESEKDIVAWNSIISGFSQNGSIHEALFLFHRMNSESVTPNGVTV 447

Query: 254 TSLVQSCGEMRNLEFGKWRSVHAVLIRFRFASDVVAKTALIDMYAKCSEIDSGEKVFNHG 313
            SL  +C  + +L  G     ++V + F  +S V   TAL+D YAKC +  S   +F+  
Sbjct: 448 ASLFSACASLGSLAVGSSLHAYSVKLGFLASSSVHVGTALLDFYAKCGDPQSARLIFDT- 507

Query: 314 FTPKDVILYNSMISGYGMHGLGHKALSVYHQMNQELQ-PNESTFVSLLSACSHSGLVEEG 373
              K+ I +++MI GYG  G    +L ++ +M ++ Q PNESTF S+LSAC H+G+V EG
Sbjct: 508 IEEKNTITWSAMIGGYGKQGDTIGSLELFEEMLKKQQKPNESTFTSILSACGHTGMVNEG 567

Query: 374 ISLFRDMEKAHNVTPTDKLYACFVDLLSRAGRLRQAEEVINQMPYRPTSGILETLLNGCL 433
              F  M K +N TP+ K Y C VD+L+RAG L QA ++I +MP +P        L+GC 
Sbjct: 568 KKYFSSMYKDYNFTPSTKHYTCMVDMLARAGELEQALDIIEKMPIQPDVRCFGAFLHGCG 627

Query: 434 LHKDIELGVKIADRLLSFESRNLSVYVSLSNIYAEAGQWDTVNHLRGLMTEHELKKIPGY 493
           +H   +LG  +  ++L     + S YV +SN+YA  G+W+    +R LM +  L KI G+
Sbjct: 628 MHSRFDLGEIVIKKMLDLHPDDASYYVLVSNLYASDGRWNQAKEVRNLMKQRGLSKIAGH 687

Query: 494 SSIE 497
           S++E
Sbjct: 688 STME 689

BLAST of Cp4.1LG11g09290 vs. NCBI nr
Match: gi|659099713|ref|XP_008450740.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g18750, chloroplastic-like [Cucumis melo])

HSP 1 Score: 464.2 bits (1193), Expect = 3.0e-127
Identity = 246/437 (56.29%), Postives = 306/437 (70.02%), Query Frame = 1

Query: 64  EFFKSIHAQIIITNAISGDQRLVAKLVAAYSSLGSLENARKVFDKIPQPKTVLCNAMVNG 123
           +F K +H+  ++   +S D R++  L+  Y   G +E+AR +FD +P    V  N M++G
Sbjct: 253 KFGKCMHS-FVLGFGMSSDTRVLTTLIDMYCKSGDVESARWIFDNMPSRNLVSWNVMISG 312

Query: 124 YLQNQHYNETIELFKLMGRCHFEFDSYTCNFALKACMFLLDYEMGIEVIRLALCKGLAGG 183
           Y+QN    ET+ LF+ +      FDS T    ++ C    D + G  +      +GL   
Sbjct: 313 YVQNGLLVETLRLFQKLIMDDVGFDSGTVVSLIQLCSRTADLDGGKILHGCIYRRGLDLN 372

Query: 184 RFLGSSILNFLVKAGDIMNARIFFHEMVEKDVVCWNVMIGGFMQEGLFSEGYKLFLDMLY 243
             L ++I++   K G +  A   F  +  K+V+ W  M+ G  Q G   +  KLF  M  
Sbjct: 373 LVLSTAIVDLYAKCGSLAYASSVFERIKNKNVISWTAMLVGLAQNGHARDALKLFDQMQN 432

Query: 244 NRIEPSAVTMTSLVQSCGEMRNLEFGKWRSVHAVLIRFRFASDVVAKTALIDMYAKCSEI 303
            R+  + +T+ SLV  C  +R L  G  RSVHA L RF FAS+VV  TALIDMYAKCS+I
Sbjct: 433 ERVTFNVLTLVSLVYCCTLLRLLREG--RSVHATLTRFHFASEVVVMTALIDMYAKCSKI 492

Query: 304 DSGEKVFNHGFTPKDVILYNSMISGYGMHGLGHKALSVYHQMNQE-LQPNESTFVSLLSA 363
           +S E VF +G TPKDVILYNSMISGYGMHGLGHKAL VYH+MN+E LQPNESTFVSLLSA
Sbjct: 493 NSAEMVFKYGLTPKDVILYNSMISGYGMHGLGHKALCVYHRMNREGLQPNESTFVSLLSA 552

Query: 364 CSHSGLVEEGISLFRDMEKAHNVTPTDKLYACFVDLLSRAGRLRQAEEVINQMPYRPTSG 423
           CSHSGLVEEGI+LF++M K HN TPTDKLYAC VDLLSRAGRL+QAEE+INQMP+ PTSG
Sbjct: 553 CSHSGLVEEGIALFQNMVKDHNTTPTDKLYACIVDLLSRAGRLQQAEELINQMPFTPTSG 612

Query: 424 ILETLLNGCLLHKDIELGVKIADRLLSFESRNLSVYVSLSNIYAEAGQWDTVNHLRGLMT 483
           ILETLLNGCLLHKDIELGVK+ADRLLS ESRN S+Y++LSNIYA+A +WD+V H+RGLM 
Sbjct: 613 ILETLLNGCLLHKDIELGVKLADRLLSLESRNPSIYITLSNIYAKASRWDSVKHVRGLMM 672

Query: 484 EHELKKIPGYSSIEVNI 500
           E E+KKIPG SSIEVNI
Sbjct: 673 EQEIKKIPGCSSIEVNI 686

BLAST of Cp4.1LG11g09290 vs. NCBI nr
Match: gi|778662656|ref|XP_011659934.1| (PREDICTED: pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Cucumis sativus])

HSP 1 Score: 463.8 bits (1192), Expect = 3.9e-127
Identity = 247/436 (56.65%), Postives = 305/436 (69.95%), Query Frame = 1

Query: 65  FFKSIHAQIIITNAISGDQRLVAKLVAAYSSLGSLENARKVFDKIPQPKTVLCNAMVNGY 124
           F K +H   ++   +S D R++  L+  Y   G +E+AR +F+ +P    V  N M++GY
Sbjct: 255 FGKCMHG-FVLGFGMSRDTRVLTTLIDMYCKSGDVESARWIFENMPSRNLVSWNVMISGY 314

Query: 125 LQNQHYNETIELFKLMGRCHFEFDSYTCNFALKACMFLLDYEMGIEVIRLALCKGLAGGR 184
           +QN    ET+ LF+ +      FDS T    ++ C    D + G  +      +GL    
Sbjct: 315 VQNGLLVETLRLFQKLIMDDVGFDSGTVVSLIQLCSRTADLDGGKILHGFIYRRGLDLNL 374

Query: 185 FLGSSILNFLVKAGDIMNARIFFHEMVEKDVVCWNVMIGGFMQEGLFSEGYKLFLDMLYN 244
            L ++I++   K G +  A   F  M  K+V+ W  M+ G  Q G   +  KLF  M   
Sbjct: 375 VLPTAIVDLYAKCGSLAYASSVFERMKNKNVISWTAMLVGLAQNGHARDALKLFDQMQNE 434

Query: 245 RIEPSAVTMTSLVQSCGEMRNLEFGKWRSVHAVLIRFRFASDVVAKTALIDMYAKCSEID 304
           R+  +A+T+ SLV  C  +  L  G  RSVHA L RF FAS+VV  TALIDMYAKCS+I+
Sbjct: 435 RVTFNALTLVSLVYCCTLLGLLREG--RSVHATLTRFHFASEVVVMTALIDMYAKCSKIN 494

Query: 305 SGEKVFNHGFTPKDVILYNSMISGYGMHGLGHKALSVYHQMNQE-LQPNESTFVSLLSAC 364
           S E VF +G TPKDVILYNSMISGYGMHGLGHKAL VYH+MN+E LQPNESTFVSLLSAC
Sbjct: 495 SAEMVFKYGLTPKDVILYNSMISGYGMHGLGHKALCVYHRMNREGLQPNESTFVSLLSAC 554

Query: 365 SHSGLVEEGISLFRDMEKAHNVTPTDKLYACFVDLLSRAGRLRQAEEVINQMPYRPTSGI 424
           SHSGLVEEGI+LF++M K HN TPTDKLYAC VDLLSRAGRLRQAEE+INQMP+ PTSGI
Sbjct: 555 SHSGLVEEGIALFQNMVKDHNTTPTDKLYACIVDLLSRAGRLRQAEELINQMPFTPTSGI 614

Query: 425 LETLLNGCLLHKDIELGVKIADRLLSFESRNLSVYVSLSNIYAEAGQWDTVNHLRGLMTE 484
           LETLLNGCLLHKDIELGVK+ADRLLS ESRN S+Y++LSNIYA+A +WD+V ++RGLM E
Sbjct: 615 LETLLNGCLLHKDIELGVKLADRLLSLESRNPSIYITLSNIYAKASRWDSVKYVRGLMME 674

Query: 485 HELKKIPGYSSIEVNI 500
            E+KKIPGYSSIEVNI
Sbjct: 675 QEIKKIPGYSSIEVNI 687

BLAST of Cp4.1LG11g09290 vs. NCBI nr
Match: gi|700211048|gb|KGN66144.1| (hypothetical protein Csa_1G573630 [Cucumis sativus])

HSP 1 Score: 458.4 bits (1178), Expect = 1.6e-125
Identity = 244/433 (56.35%), Postives = 302/433 (69.75%), Query Frame = 1

Query: 65  FFKSIHAQIIITNAISGDQRLVAKLVAAYSSLGSLENARKVFDKIPQPKTVLCNAMVNGY 124
           F K +H   ++   +S D R++  L+  Y   G +E+AR +F+ +P    V  N M++GY
Sbjct: 149 FGKCMHG-FVLGFGMSRDTRVLTTLIDMYCKSGDVESARWIFENMPSRNLVSWNVMISGY 208

Query: 125 LQNQHYNETIELFKLMGRCHFEFDSYTCNFALKACMFLLDYEMGIEVIRLALCKGLAGGR 184
           +QN    ET+ LF+ +      FDS T    ++ C    D + G  +      +GL    
Sbjct: 209 VQNGLLVETLRLFQKLIMDDVGFDSGTVVSLIQLCSRTADLDGGKILHGFIYRRGLDLNL 268

Query: 185 FLGSSILNFLVKAGDIMNARIFFHEMVEKDVVCWNVMIGGFMQEGLFSEGYKLFLDMLYN 244
            L ++I++   K G +  A   F  M  K+V+ W  M+ G  Q G   +  KLF  M   
Sbjct: 269 VLPTAIVDLYAKCGSLAYASSVFERMKNKNVISWTAMLVGLAQNGHARDALKLFDQMQNE 328

Query: 245 RIEPSAVTMTSLVQSCGEMRNLEFGKWRSVHAVLIRFRFASDVVAKTALIDMYAKCSEID 304
           R+  +A+T+ SLV  C  +  L  G  RSVHA L RF FAS+VV  TALIDMYAKCS+I+
Sbjct: 329 RVTFNALTLVSLVYCCTLLGLLREG--RSVHATLTRFHFASEVVVMTALIDMYAKCSKIN 388

Query: 305 SGEKVFNHGFTPKDVILYNSMISGYGMHGLGHKALSVYHQMNQE-LQPNESTFVSLLSAC 364
           S E VF +G TPKDVILYNSMISGYGMHGLGHKAL VYH+MN+E LQPNESTFVSLLSAC
Sbjct: 389 SAEMVFKYGLTPKDVILYNSMISGYGMHGLGHKALCVYHRMNREGLQPNESTFVSLLSAC 448

Query: 365 SHSGLVEEGISLFRDMEKAHNVTPTDKLYACFVDLLSRAGRLRQAEEVINQMPYRPTSGI 424
           SHSGLVEEGI+LF++M K HN TPTDKLYAC VDLLSRAGRLRQAEE+INQMP+ PTSGI
Sbjct: 449 SHSGLVEEGIALFQNMVKDHNTTPTDKLYACIVDLLSRAGRLRQAEELINQMPFTPTSGI 508

Query: 425 LETLLNGCLLHKDIELGVKIADRLLSFESRNLSVYVSLSNIYAEAGQWDTVNHLRGLMTE 484
           LETLLNGCLLHKDIELGVK+ADRLLS ESRN S+Y++LSNIYA+A +WD+V ++RGLM E
Sbjct: 509 LETLLNGCLLHKDIELGVKLADRLLSLESRNPSIYITLSNIYAKASRWDSVKYVRGLMME 568

Query: 485 HELKKIPGYSSIE 497
            E+KKIPGYSSIE
Sbjct: 569 QEIKKIPGYSSIE 578

BLAST of Cp4.1LG11g09290 vs. NCBI nr
Match: gi|734425321|gb|KHN43040.1| (Putative pentatricopeptide repeat-containing protein [Glycine soja])

HSP 1 Score: 406.8 bits (1044), Expect = 5.6e-110
Identity = 208/423 (49.17%), Postives = 287/423 (67.85%), Query Frame = 1

Query: 105 VFDKIPQPKTVLCNAMVNGYLQNQHYNETIELFKLMGRCHFEFDSYTCNFALKACMFLLD 164
           +FD+   P+T +CNAM+ G+L+NQ + E  +LF++MG C+ E D+YTC F+LKAC  LLD
Sbjct: 1   MFDQCSLPETAVCNAMMAGFLRNQQHTEVPKLFRMMGSCNIEIDTYTCMFSLKACASLLD 60

Query: 165 YEMGIEVIRLALCKGLAGGRFLGSSILNFLVKAGDIMNARIFFHEMVEKDVVCWNVMIGG 224
            E+G+E++R A+ KG     ++GSS++NFLVK G + +A+  F  M EKD VCWN +IGG
Sbjct: 61  DEIGMEIVRTAVRKGFRLHPYVGSSMVNFLVKCGYLDDAQKVFDGMPEKDAVCWNSIIGG 120

Query: 225 FMQEGLFSEGYKLFLDMLYNRIEPSAVTMTSLVQSCGEMRNLEFG--------------- 284
           ++++GLF+E  ++F +M+   + PS VTM S +++CGE    + G               
Sbjct: 121 YVKKGLFTEAIQMFPEMIGGGLRPSPVTMVSSLKACGESGLKKVGMCAHGCVLALGMGND 180

Query: 285 ------------KWRSVHAVLIRFRFAS---DVVAKTALIDMYAKCSEIDSGEKVFNHGF 344
                       K RS HA      +     D V  +ALIDMYAKC +I S EK+FN+GF
Sbjct: 181 SCACCAHLGSLKKGRSAHA---HLIWHGYAFDAVITSALIDMYAKCGKIHSAEKLFNNGF 240

Query: 345 TPKDVILYNSMISGYGMHGLGHKALSVYHQMNQE-LQPNESTFVSLLSACSHSGLVEEGI 404
             KDVIL NSMI  YG++  GH AL VY +M +E L PN++TFVSLL+ACSHSGLVEEG 
Sbjct: 241 HLKDVILCNSMIMSYGIYAHGHYALGVYGRMIEERLNPNQTTFVSLLTACSHSGLVEEGK 300

Query: 405 SLFRDMEKAHNVTPTDKLYACFVDLLSRAGRLRQAEEVINQMPYRPTSGILETLLNGCLL 464
           +LF  ME+ HN+ P DK YAC VDLLSRAGRL +A+ ++ QMP++P++ +LE LL+GC  
Sbjct: 301 ALFHCMERDHNIKPQDKHYACLVDLLSRAGRLEEADALVKQMPFQPSTDVLEALLSGCRT 360

Query: 465 HKDIELGVKIADRLLSFESRNLSVYVSLSNIYAEAGQWDTVNHLRGLMTEHELKKIPGYS 497
           HK+  +G++IADRL+S +  N  +YV LSNIYAEA +W++VN++RGLM    +KKIPGYS
Sbjct: 361 HKNTNMGIQIADRLISLDYLNSGIYVMLSNIYAEARKWESVNYIRGLMRMQGMKKIPGYS 420

BLAST of Cp4.1LG11g09290 vs. NCBI nr
Match: gi|950998130|ref|XP_014506168.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g12770-like [Vigna radiata var. radiata])

HSP 1 Score: 381.7 bits (979), Expect = 1.9e-102
Identity = 200/431 (46.40%), Postives = 283/431 (65.66%), Query Frame = 1

Query: 68  SIHAQIIITNAISGDQRLVAKLVAAYSSLGSLENARKVFDKIPQPKTVLCNAMVNGYLQN 127
           S HA ++I   +  D  ++  LV  YS+LG  +NA  VF+ +     +  NAM++GY+QN
Sbjct: 294 SAHA-VLIALGMGNDVFVLTSLVDMYSNLGDTDNAALVFNGMFNRSLISWNAMISGYIQN 353

Query: 128 QHYNETIELFKLMGRCHFEFDSYTCNFALKACMFLLDYEMGIEVIRLALCKGLAGGRFLG 187
               E+  LF+ + +    FDS T    ++ C    D E G  +    + KG+     L 
Sbjct: 354 GLIPESFALFRRLIQSGSGFDSGTLVSLIRGCSQTSDLENGRLLHACIIRKGIESNIVLS 413

Query: 188 SSILNFLVKAGDIMNARIFFHEMVEKDVVCWNVMIGGFMQEGLFSEGYKLFLDMLYNRIE 247
           +SI++   K G I  A   F  M +++V+ W  M+ G  Q G   +  KLF  M   ++ 
Sbjct: 414 TSIVDMYSKCGAIKLATNVFGRMEKRNVITWTAMLVGLSQNGYAEDALKLFCQMQEEKVP 473

Query: 248 PSAVTMTSLVQSCGEMRNLEFGKWRSVHAVLIRFRFASDVVAKTALIDMYAKCSEIDSGE 307
            ++VT+ SLV  C  + +L+ G  RSVHA LIR  +A D V  +AL+DMYAKC +I S E
Sbjct: 474 ANSVTLVSLVHCCAHLGSLKKG--RSVHAYLIRHGYAFDAVNMSALLDMYAKCGKIRSAE 533

Query: 308 KVFNHGFTPKDVILYNSMISGYGMHGLGHKALSVYHQMNQE-LQPNESTFVSLLSACSHS 367
           K+FN+GF  KDVIL N+MI GYGMHGLGH AL++Y +M +E L+P+++TFVSLL+ACSHS
Sbjct: 534 KLFNNGFHLKDVILCNTMIMGYGMHGLGHYALALYDRMIEESLKPSQTTFVSLLTACSHS 593

Query: 368 GLVEEGISLFRDMEKAHNVTPTDKLYACFVDLLSRAGRLRQAEEVINQMPYRPTSGILET 427
           GLVEEG +LF  ME+ HN+ P DK YAC VDLLSRAGRL++A+ ++ QMP++P++ + E 
Sbjct: 594 GLVEEGKALFNCMERDHNIKPQDKHYACLVDLLSRAGRLKEADALVKQMPFQPSTDVFEA 653

Query: 428 LLNGCLLHKDIELGVKIADRLLSFESRNLSVYVSLSNIYAEAGQWDTVNHLRGLMTEHEL 487
           LL+GC  HK+I +G++IADRL+S +  N  +YV LSNIYAEAG+W++VN++RGLM    L
Sbjct: 654 LLSGCRTHKNINMGIQIADRLISLDYLNSGIYVMLSNIYAEAGRWESVNYIRGLMRMKGL 713

Query: 488 KKIPGYSSIEV 498
           KK+PGYS IE+
Sbjct: 714 KKVPGYSLIEI 721

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR21_ARATH4.5e-7535.40Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
PPR14_ARATH3.8e-7437.94Pentatricopeptide repeat-containing protein At1g06140, mitochondrial OS=Arabidop... [more]
PP333_ARATH7.4e-7032.67Pentatricopeptide repeat-containing protein At4g21300 OS=Arabidopsis thaliana GN... [more]
PP219_ARATH6.3e-6933.57Putative pentatricopeptide repeat-containing protein At3g08820 OS=Arabidopsis th... [more]
PP265_ARATH3.5e-6734.34Pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0LYM5_CUCSA1.1e-12556.35Uncharacterized protein OS=Cucumis sativus GN=Csa_1G573630 PE=4 SV=1[more]
A0A0B2SD75_GLYSO3.9e-11049.17Putative pentatricopeptide repeat-containing protein OS=Glycine soja GN=glysoja_... [more]
A0A0L9TN84_PHAAN3.0e-10246.24Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan01g155600 PE=4 SV=1[more]
V7CHX8_PHAVU9.7e-10146.59Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_002G092700g PE=4 SV=1[more]
A0A061F2D9_THECC6.3e-10044.60Pentatricopeptide repeat superfamily protein OS=Theobroma cacao GN=TCM_026441 PE... [more]
Match NameE-valueIdentityDescription
AT1G08070.12.5e-7635.40 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G06140.12.1e-7537.94 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G21300.14.2e-7132.67 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G08820.13.5e-7033.57 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT2G03380.11.9e-6833.96 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659099713|ref|XP_008450740.1|3.0e-12756.29PREDICTED: pentatricopeptide repeat-containing protein At4g18750, chloroplastic-... [more]
gi|778662656|ref|XP_011659934.1|3.9e-12756.65PREDICTED: pentatricopeptide repeat-containing protein DOT4, chloroplastic-like ... [more]
gi|700211048|gb|KGN66144.1|1.6e-12556.35hypothetical protein Csa_1G573630 [Cucumis sativus][more]
gi|734425321|gb|KHN43040.1|5.6e-11049.17Putative pentatricopeptide repeat-containing protein [Glycine soja][more]
gi|950998130|ref|XP_014506168.1|1.9e-10246.40PREDICTED: pentatricopeptide repeat-containing protein At3g12770-like [Vigna rad... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG11g09290.1Cp4.1LG11g09290.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 394..416
score:
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 317..364
score: 1.2E-9coord: 213..260
score: 4.1E-12coord: 112..159
score: 8.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 321..345
score: 5.8E-6coord: 216..249
score: 1.3E-4coord: 115..148
score: 2.4E-4coord: 355..388
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 318..348
score: 9.427coord: 113..147
score: 8.1coord: 388..418
score: 7.092coord: 148..182
score: 5.349coord: 183..213
score: 5.886coord: 454..488
score: 7.07coord: 214..248
score: 10.841coord: 352..382
score: 9.383coord: 286..317
score: 5.36coord: 82..112
score: 6
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 321..415
score: 3.6E-4coord: 88..150
score: 3.6E-4coord: 448..472
score: 3.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 64..495
score: 1.6E

The following gene(s) are paralogous to this gene:

None