Cp4.1LG12g11340 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG12g11340
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionUnknown protein
LocationCp4.1LG12 : 8811497 .. 8813419 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTTCCAAGACTGTAGCCTCCGCTATTTCTTCCTCTCTTTCGCCAGCTCCGTCGAAAGGCAAGCTCTCGTCTCTGGTTACTCGCCCACTAAGAATTGCATCTTCAAATCCCCAAAATCTATCTCCAATTCCTGCCGAATCCCCGCCTATCATCTCGAAACCACCAAATTCGAATGTTTCACCGGAAGTTTCGCTGGAAAAGTCAACTACAGAGATTTCGTTTCATGAACGGCTTCGGACGTTCCTTCAGAATTGCAAGACGGGTAATTTTACCGCATCAGAAGCATTTCAATTCTTTGACCTAATGATGCTTGCGAATCCTACCCCTGCGATGTCTTCATTCAATCTTTTACTTGGTGGACTTGCTAAGACTAAGCACTACTCTGAGGTTTTGAAGCTGTATTATAGAATGAGCTTAGCCGGACTTTTGCCTAATTACATCACGCTTAATATTTTGCTTAATTGCCTTTGTAATGTGAATCGGATTAGCGAAGGTCTAGCAGCCATGGCGGGGATTATAAGGAGAGGTTTTATTCCCAATATAGTGACATATACGTCCCTGATTAAGGGCTTGTGTATGGAGCATAGGATTAGTGAAGCTACGCGGCTGTTTATGAGAATGCAAAAGTTGGGTTGTAGGCCAAATGTGATTACTTATGGGACTTTAATCAAGGGACTTTGTCAAACGGGTAATACTAACATTGCACTTAAGTTGCATGAGGAAATGCTCAATGGGACTGGTCGATACGGCATTAGTTGTAAGCCTAACGTTATTTGTTATAGTACCATTATTGATGGGCTCTGTAAAGATGGGCGGGAAGACAAGGCAAGGGAACTTTTTGAGGAAATGAAAGCTCGGAGAATGCTTCCAGATGTGATTTCTTACAGCTCTTTGATTCATGGATTTTGCAATGGTGGAAAGTGGGAGGAGGCTAAATGTCTGTTCAATGAGATGGTGGATCTAGGTATTCAACCAAATGCCGTCACATTTAATGTGTTGATGGATATACTTTGCAAGGCAGGAAAGGTTATCGAGGCCAACGAGTTGCTAGAGGTGATGATTCAGAGAGGTAATGCTCCTGATTTGTTTACTTATAATACATTGATGGATGGGTTCTGTCTGGTTAGTGATCTAGACAGTGCAAGGGAACTGTTCCTTAGTATGCCAAGTAAGGGGTGTGAACCTAATGTGATTAGCTACAATGTGCTAATCAATGGGTACTGCAAAAATTGGAAGGTGGAAGAAGCAATGAAGATTTACAATGAAATGCTTCAAGTGGGCATTAAACCATCTGTGATAACATATAATGCCTTGCTAACGGGTCTTTTTCAGGCCGGGAAGGTTGATGATGCAAAGAAAATATTTGGTGTCATTCAAGCTCATGGTTTGGTGCCGAGTTCAAGTACATTGAGTATCTTTGTAGATGGGCTGTGTAAGAACGATTGTTTGTTAGAAGCAATGGAAATTTTTAACGAGCTTTCATACAACTTGAAATTGGACATTGATATCTTTAATTGTCTTATCGATGGCCTATGTAAAGCAGGGAAACTTGAAACAGCTTGGGAGTTCTTCGACAAAATTTCCCGTGAAGGGCTTCTGCCAAATGTTGTGACCTATTCCATTCTAATCCATGGGTATTGTAAAGAAGGACAAGTAGAAAAGGCAAATGATTTATTTCGAAAGATGGAAGAAAATGGTTGTACCCCCAATGTAATTACCTACAATACACTTCTCCGTGGTTTCTACAAAAGTAATAAACGAGAGGAAGTGGTTGAGCTTCTTCATAGGATGGTTAAGAAAAATGTGGTGCCAGATGCTTCAACCTGCACCATAGTCTTAGACATGCTTTCCAAAGACGAAAAGTATAGGGGATGTTTGAACTTGCTTCCAACGTTTCCTGTCCAAGAGTATCGAGGTTGA

mRNA sequence

ATGGTTTCCAAGACTGTAGCCTCCGCTATTTCTTCCTCTCTTTCGCCAGCTCCGTCGAAAGGCAAGCTCTCGTCTCTGGTTACTCGCCCACTAAGAATTGCATCTTCAAATCCCCAAAATCTATCTCCAATTCCTGCCGAATCCCCGCCTATCATCTCGAAACCACCAAATTCGAATGTTTCACCGGAAGTTTCGCTGGAAAAGTCAACTACAGAGATTTCGTTTCATGAACGGCTTCGGACGTTCCTTCAGAATTGCAAGACGGGTAATTTTACCGCATCAGAAGCATTTCAATTCTTTGACCTAATGATGCTTGCGAATCCTACCCCTGCGATGTCTTCATTCAATCTTTTACTTGGTGGACTTGCTAAGACTAAGCACTACTCTGAGGTTTTGAAGCTGTATTATAGAATGAGCTTAGCCGGACTTTTGCCTAATTACATCACGCTTAATATTTTGCTTAATTGCCTTTGTAATGTGAATCGGATTAGCGAAGGTCTAGCAGCCATGGCGGGGATTATAAGGAGAGGTTTTATTCCCAATATAGTGACATATACGTCCCTGATTAAGGGCTTGTGTATGGAGCATAGGATTAGTGAAGCTACGCGGCTGTTTATGAGAATGCAAAAGTTGGGTTGTAGGCCAAATGTGATTACTTATGGGACTTTAATCAAGGGACTTTGTCAAACGGGTAATACTAACATTGCACTTAAGTTGCATGAGGAAATGCTCAATGGGACTGGTCGATACGGCATTAGTTGTAAGCCTAACGTTATTTGTTATAGTACCATTATTGATGGGCTCTGTAAAGATGGGCGGGAAGACAAGGCAAGGGAACTTTTTGAGGAAATGAAAGCTCGGAGAATGCTTCCAGATGTGATTTCTTACAGCTCTTTGATTCATGGATTTTGCAATGGTGGAAAGTGGGAGGAGGCTAAATGTCTGTTCAATGAGATGGTGGATCTAGGTATTCAACCAAATGCCGTCACATTTAATGTGTTGATGGATATACTTTGCAAGGCAGGAAAGGTTATCGAGGCCAACGAGTTGCTAGAGGTGATGATTCAGAGAGGTAATGCTCCTGATTTGTTTACTTATAATACATTGATGGATGGGTTCTGTCTGGTTAGTGATCTAGACAGTGCAAGGGAACTGTTCCTTAGTATGCCAAGTAAGGGGTGTGAACCTAATGTGATTAGCTACAATGTGCTAATCAATGGGTACTGCAAAAATTGGAAGGTGGAAGAAGCAATGAAGATTTACAATGAAATGCTTCAAGTGGGCATTAAACCATCTGTGATAACATATAATGCCTTGCTAACGGGTCTTTTTCAGGCCGGGAAGGTTGATGATGCAAAGAAAATATTTGGTGTCATTCAAGCTCATGGTTTGGTGCCGAGTTCAAGTACATTGAGTATCTTTGTAGATGGGCTGTGTAAGAACGATTGTTTGTTAGAAGCAATGGAAATTTTTAACGAGCTTTCATACAACTTGAAATTGGACATTGATATCTTTAATTGTCTTATCGATGGCCTATGTAAAGCAGGGAAACTTGAAACAGCTTGGGAGTTCTTCGACAAAATTTCCCGTGAAGGGCTTCTGCCAAATGTTGTGACCTATTCCATTCTAATCCATGGGTATTGTAAAGAAGGACAAGTAGAAAAGGCAAATGATTTATTTCGAAAGATGGAAGAAAATGGTTGTACCCCCAATGTAATTACCTACAATACACTTCTCCGTGGTTTCTACAAAAGTAATAAACGAGAGGAAGTGGTTGAGCTTCTTCATAGGATGGTTAAGAAAAATGTGGTGCCAGATGCTTCAACCTGCACCATAGTCTTAGACATGCTTTCCAAAGACGAAAAGTATAGGGGATGTTTGAACTTGCTTCCAACGTTTCCTGTCCAAGAGTATCGAGGTTGA

Coding sequence (CDS)

ATGGTTTCCAAGACTGTAGCCTCCGCTATTTCTTCCTCTCTTTCGCCAGCTCCGTCGAAAGGCAAGCTCTCGTCTCTGGTTACTCGCCCACTAAGAATTGCATCTTCAAATCCCCAAAATCTATCTCCAATTCCTGCCGAATCCCCGCCTATCATCTCGAAACCACCAAATTCGAATGTTTCACCGGAAGTTTCGCTGGAAAAGTCAACTACAGAGATTTCGTTTCATGAACGGCTTCGGACGTTCCTTCAGAATTGCAAGACGGGTAATTTTACCGCATCAGAAGCATTTCAATTCTTTGACCTAATGATGCTTGCGAATCCTACCCCTGCGATGTCTTCATTCAATCTTTTACTTGGTGGACTTGCTAAGACTAAGCACTACTCTGAGGTTTTGAAGCTGTATTATAGAATGAGCTTAGCCGGACTTTTGCCTAATTACATCACGCTTAATATTTTGCTTAATTGCCTTTGTAATGTGAATCGGATTAGCGAAGGTCTAGCAGCCATGGCGGGGATTATAAGGAGAGGTTTTATTCCCAATATAGTGACATATACGTCCCTGATTAAGGGCTTGTGTATGGAGCATAGGATTAGTGAAGCTACGCGGCTGTTTATGAGAATGCAAAAGTTGGGTTGTAGGCCAAATGTGATTACTTATGGGACTTTAATCAAGGGACTTTGTCAAACGGGTAATACTAACATTGCACTTAAGTTGCATGAGGAAATGCTCAATGGGACTGGTCGATACGGCATTAGTTGTAAGCCTAACGTTATTTGTTATAGTACCATTATTGATGGGCTCTGTAAAGATGGGCGGGAAGACAAGGCAAGGGAACTTTTTGAGGAAATGAAAGCTCGGAGAATGCTTCCAGATGTGATTTCTTACAGCTCTTTGATTCATGGATTTTGCAATGGTGGAAAGTGGGAGGAGGCTAAATGTCTGTTCAATGAGATGGTGGATCTAGGTATTCAACCAAATGCCGTCACATTTAATGTGTTGATGGATATACTTTGCAAGGCAGGAAAGGTTATCGAGGCCAACGAGTTGCTAGAGGTGATGATTCAGAGAGGTAATGCTCCTGATTTGTTTACTTATAATACATTGATGGATGGGTTCTGTCTGGTTAGTGATCTAGACAGTGCAAGGGAACTGTTCCTTAGTATGCCAAGTAAGGGGTGTGAACCTAATGTGATTAGCTACAATGTGCTAATCAATGGGTACTGCAAAAATTGGAAGGTGGAAGAAGCAATGAAGATTTACAATGAAATGCTTCAAGTGGGCATTAAACCATCTGTGATAACATATAATGCCTTGCTAACGGGTCTTTTTCAGGCCGGGAAGGTTGATGATGCAAAGAAAATATTTGGTGTCATTCAAGCTCATGGTTTGGTGCCGAGTTCAAGTACATTGAGTATCTTTGTAGATGGGCTGTGTAAGAACGATTGTTTGTTAGAAGCAATGGAAATTTTTAACGAGCTTTCATACAACTTGAAATTGGACATTGATATCTTTAATTGTCTTATCGATGGCCTATGTAAAGCAGGGAAACTTGAAACAGCTTGGGAGTTCTTCGACAAAATTTCCCGTGAAGGGCTTCTGCCAAATGTTGTGACCTATTCCATTCTAATCCATGGGTATTGTAAAGAAGGACAAGTAGAAAAGGCAAATGATTTATTTCGAAAGATGGAAGAAAATGGTTGTACCCCCAATGTAATTACCTACAATACACTTCTCCGTGGTTTCTACAAAAGTAATAAACGAGAGGAAGTGGTTGAGCTTCTTCATAGGATGGTTAAGAAAAATGTGGTGCCAGATGCTTCAACCTGCACCATAGTCTTAGACATGCTTTCCAAAGACGAAAAGTATAGGGGATGTTTGAACTTGCTTCCAACGTTTCCTGTCCAAGAGTATCGAGGTTGA

Protein sequence

MVSKTVASAISSSLSPAPSKGKLSSLVTRPLRIASSNPQNLSPIPAESPPIISKPPNSNVSPEVSLEKSTTEISFHERLRTFLQNCKTGNFTASEAFQFFDLMMLANPTPAMSSFNLLLGGLAKTKHYSEVLKLYYRMSLAGLLPNYITLNILLNCLCNVNRISEGLAAMAGIIRRGFIPNIVTYTSLIKGLCMEHRISEATRLFMRMQKLGCRPNVITYGTLIKGLCQTGNTNIALKLHEEMLNGTGRYGISCKPNVICYSTIIDGLCKDGREDKARELFEEMKARRMLPDVISYSSLIHGFCNGGKWEEAKCLFNEMVDLGIQPNAVTFNVLMDILCKAGKVIEANELLEVMIQRGNAPDLFTYNTLMDGFCLVSDLDSARELFLSMPSKGCEPNVISYNVLINGYCKNWKVEEAMKIYNEMLQVGIKPSVITYNALLTGLFQAGKVDDAKKIFGVIQAHGLVPSSSTLSIFVDGLCKNDCLLEAMEIFNELSYNLKLDIDIFNCLIDGLCKAGKLETAWEFFDKISREGLLPNVVTYSILIHGYCKEGQVEKANDLFRKMEENGCTPNVITYNTLLRGFYKSNKREEVVELLHRMVKKNVVPDASTCTIVLDMLSKDEKYRGCLNLLPTFPVQEYRG
BLAST of Cp4.1LG12g11340 vs. Swiss-Prot
Match: PPR38_ARATH (Putative pentatricopeptide repeat-containing protein At1g12700, mitochondrial OS=Arabidopsis thaliana GN=At1g12700 PE=3 SV=1)

HSP 1 Score: 422.9 bits (1086), Expect = 6.1e-117
Identity = 209/563 (37.12%), Postives = 339/563 (60.21%), Query Frame = 1

Query: 69  STTEISFHERLRTFLQNCKTGNFTASEAFQFFDLMMLANPTPAMSSFNLLLGGLAKTKHY 128
           S   + F ERLR+ + + K       +A   F  M+ + P P++  F+     +A+TK +
Sbjct: 50  SNGNVCFRERLRSGIVDIKK-----DDAIALFQEMIRSRPLPSLVDFSRFFSAIARTKQF 109

Query: 129 SEVLKLYYRMSLAGLLPNYITLNILLNCLCNVNRISEGLAAMAGIIRRGFIPNIVTYTSL 188
           + VL    ++ L G+  N  TLNI++NC C   +     + +  +++ G+ P+  T+ +L
Sbjct: 110 NLVLDFCKQLELNGIAHNIYTLNIMINCFCRCCKTCFAYSVLGKVMKLGYEPDTTTFNTL 169

Query: 189 IKGLCMEHRISEATRLFMRMQKLGCRPNVITYGTLIKGLCQTGNTNIALKLHEEMLNGTG 248
           IKGL +E ++SEA  L  RM + GC+P+V+TY +++ G+C++G+T++AL L  +M     
Sbjct: 170 IKGLFLEGKVSEAVVLVDRMVENGCQPDVVTYNSIVNGICRSGDTSLALDLLRKMEER-- 229

Query: 249 RYGISCKPNVICYSTIIDGLCKDGREDKARELFEEMKARRMLPDVISYSSLIHGFCNGGK 308
               + K +V  YSTIID LC+DG  D A  LF+EM+ + +   V++Y+SL+ G C  GK
Sbjct: 230 ----NVKADVFTYSTIIDSLCRDGCIDAAISLFKEMETKGIKSSVVTYNSLVRGLCKAGK 289

Query: 309 WEEAKCLFNEMVDLGIQPNAVTFNVLMDILCKAGKVIEANELLEVMIQRGNAPDLFTYNT 368
           W +   L  +MV   I PN +TFNVL+D+  K GK+ EANEL + MI RG +P++ TYNT
Sbjct: 290 WNDGALLLKDMVSREIVPNVITFNVLLDVFVKEGKLQEANELYKEMITRGISPNIITYNT 349

Query: 369 LMDGFCLVSDLDSARELFLSMPSKGCEPNVISYNVLINGYCKNWKVEEAMKIYNEMLQVG 428
           LMDG+C+ + L  A  +   M    C P+++++  LI GYC   +V++ MK++  + + G
Sbjct: 350 LMDGYCMQNRLSEANNMLDLMVRNKCSPDIVTFTSLIKGYCMVKRVDDGMKVFRNISKRG 409

Query: 429 IKPSVITYNALLTGLFQAGKVDDAKKIFGVIQAHGLVPSSSTLSIFVDGLCKNDCLLEAM 488
           +  + +TY+ L+ G  Q+GK+  A+++F  + +HG++P   T  I +DGLC N  L +A+
Sbjct: 410 LVANAVTYSILVQGFCQSGKIKLAEELFQEMVSHGVLPDVMTYGILLDGLCDNGKLEKAL 469

Query: 489 EIFNELSYN-LKLDIDIFNCLIDGLCKAGKLETAWEFFDKISREGLLPNVVTYSILIHGY 548
           EIF +L  + + L I ++  +I+G+CK GK+E AW  F  +  +G+ PNV+TY+++I G 
Sbjct: 470 EIFEDLQKSKMDLGIVMYTTIIEGMCKGGKVEDAWNLFCSLPCKGVKPNVMTYTVMISGL 529

Query: 549 CKEGQVEKANDLFRKMEENGCTPNVITYNTLLRGFYKSNKREEVVELLHRMVKKNVVPDA 608
           CK+G + +AN L RKMEE+G  PN  TYNTL+R   +        +L+  M       DA
Sbjct: 530 CKKGSLSEANILLRKMEEDGNAPNDCTYNTLIRAHLRDGDLTASAKLIEEMKSCGFSADA 589

Query: 609 STCTIVLDMLSKDEKYRGCLNLL 631
           S+  +V+DML   E  +  L++L
Sbjct: 590 SSIKMVIDMLLSGELDKSFLDML 601

BLAST of Cp4.1LG12g11340 vs. Swiss-Prot
Match: PPR90_ARATH (Pentatricopeptide repeat-containing protein At1g62590 OS=Arabidopsis thaliana GN=At1g62590 PE=2 SV=1)

HSP 1 Score: 407.5 bits (1046), Expect = 2.6e-112
Identity = 202/518 (39.00%), Postives = 323/518 (62.36%), Query Frame = 1

Query: 114 SFNLLLGGLAKTKHYSEVLKLYYRMSLAGLLPNYITLNILLNCLCNVNRISEGLAAMAGI 173
           ++N+L+    +    S  L L  +M   G  P+ +TL+ LLN  C+  RIS+ +A +  +
Sbjct: 122 TYNILINCFCRRSQISLALALLGKMMKLGYEPSIVTLSSLLNGYCHGKRISDAVALVDQM 181

Query: 174 IRRGFIPNIVTYTSLIKGLCMEHRISEATRLFMRMQKLGCRPNVITYGTLIKGLCQTGNT 233
           +  G+ P+ +T+T+LI GL + ++ SEA  L  RM + GC+PN++TYG ++ GLC+ G+T
Sbjct: 182 VEMGYRPDTITFTTLIHGLFLHNKASEAVALVDRMVQRGCQPNLVTYGVVVNGLCKRGDT 241

Query: 234 NIALKLHEEMLNGTGRYGISCKPNVICYSTIIDGLCKDGREDKARELFEEMKARRMLPDV 293
           ++AL L  +M           + +V+ ++TIID LCK    D A  LF+EM+ + + P+V
Sbjct: 242 DLALNLLNKM------EAAKIEADVVIFNTIIDSLCKYRHVDDALNLFKEMETKGIRPNV 301

Query: 294 ISYSSLIHGFCNGGKWEEAKCLFNEMVDLGIQPNAVTFNVLMDILCKAGKVIEANELLEV 353
           ++YSSLI   C+ G+W +A  L ++M++  I PN VTFN L+D   K GK +EA +L + 
Sbjct: 302 VTYSSLISCLCSYGRWSDASQLLSDMIEKKINPNLVTFNALIDAFVKEGKFVEAEKLYDD 361

Query: 354 MIQRGNAPDLFTYNTLMDGFCLVSDLDSARELFLSMPSKGCEPNVISYNVLINGYCKNWK 413
           MI+R   PD+FTYN+L++GFC+   LD A+++F  M SK C P+V++YN LI G+CK+ +
Sbjct: 362 MIKRSIDPDIFTYNSLVNGFCMHDRLDKAKQMFEFMVSKDCFPDVVTYNTLIKGFCKSKR 421

Query: 414 VEEAMKIYNEMLQVGIKPSVITYNALLTGLFQAGKVDDAKKIFGVIQAHGLVPSSSTLSI 473
           VE+  +++ EM   G+    +TY  L+ GLF  G  D+A+K+F  + + G+ P   T SI
Sbjct: 422 VEDGTELFREMSHRGLVGDTVTYTTLIQGLFHDGDCDNAQKVFKQMVSDGVPPDIMTYSI 481

Query: 474 FVDGLCKNDCLLEAMEIFNELSYN-LKLDIDIFNCLIDGLCKAGKLETAWEFFDKISREG 533
            +DGLC N  L +A+E+F+ +  + +KLDI I+  +I+G+CKAGK++  W+ F  +S +G
Sbjct: 482 LLDGLCNNGKLEKALEVFDYMQKSEIKLDIYIYTTMIEGMCKAGKVDDGWDLFCSLSLKG 541

Query: 534 LLPNVVTYSILIHGYCKEGQVEKANDLFRKMEENGCTPNVITYNTLLRGFYKSNKREEVV 593
           + PNVVTY+ +I G C +  +++A  L +KM+E+G  PN  TYNTL+R   +   +    
Sbjct: 542 VKPNVVTYNTMISGLCSKRLLQEAYALLKKMKEDGPLPNSGTYNTLIRAHLRDGDKAASA 601

Query: 594 ELLHRMVKKNVVPDASTCTIVLDMLSKDEKYRGCLNLL 631
           EL+  M     V DAST  +V +ML      +  L++L
Sbjct: 602 ELIREMRSCRFVGDASTIGLVANMLHDGRLDKSFLDML 633

BLAST of Cp4.1LG12g11340 vs. Swiss-Prot
Match: PPR94_ARATH (Pentatricopeptide repeat-containing protein At1g62910 OS=Arabidopsis thaliana GN=At1g62910 PE=2 SV=1)

HSP 1 Score: 407.5 bits (1046), Expect = 2.6e-112
Identity = 204/518 (39.38%), Postives = 323/518 (62.36%), Query Frame = 1

Query: 114 SFNLLLGGLAKTKHYSEVLKLYYRMSLAGLLPNYITLNILLNCLCNVNRISEGLAAMAGI 173
           ++++ +    +    S  L +  +M   G  P+ +TL+ LLN  C+  RIS+ +A +  +
Sbjct: 120 TYSIFINCFCRRSQLSLALAVLAKMMKLGYEPDIVTLSSLLNGYCHSKRISDAVALVDQM 179

Query: 174 IRRGFIPNIVTYTSLIKGLCMEHRISEATRLFMRMQKLGCRPNVITYGTLIKGLCQTGNT 233
           +  G+ P+  T+T+LI GL + ++ SEA  L  +M + GC+P+++TYGT++ GLC+ G+ 
Sbjct: 180 VEMGYKPDTFTFTTLIHGLFLHNKASEAVALVDQMVQRGCQPDLVTYGTVVNGLCKRGDI 239

Query: 234 NIALKLHEEMLNGTGRYGISCKPNVICYSTIIDGLCKDGREDKARELFEEMKARRMLPDV 293
           ++AL L ++M  G        + +V+ Y+TIIDGLCK    D A  LF EM  + + PDV
Sbjct: 240 DLALSLLKKMEKG------KIEADVVIYNTIIDGLCKYKHMDDALNLFTEMDNKGIRPDV 299

Query: 294 ISYSSLIHGFCNGGKWEEAKCLFNEMVDLGIQPNAVTFNVLMDILCKAGKVIEANELLEV 353
            +YSSLI   CN G+W +A  L ++M++  I PN VTF+ L+D   K GK++EA +L + 
Sbjct: 300 FTYSSLISCLCNYGRWSDASRLLSDMIERKINPNVVTFSALIDAFVKEGKLVEAEKLYDE 359

Query: 354 MIQRGNAPDLFTYNTLMDGFCLVSDLDSARELFLSMPSKGCEPNVISYNVLINGYCKNWK 413
           MI+R   PD+FTY++L++GFC+   LD A+ +F  M SK C PNV++Y+ LI G+CK  +
Sbjct: 360 MIKRSIDPDIFTYSSLINGFCMHDRLDEAKHMFELMISKDCFPNVVTYSTLIKGFCKAKR 419

Query: 414 VEEAMKIYNEMLQVGIKPSVITYNALLTGLFQAGKVDDAKKIFGVIQAHGLVPSSSTLSI 473
           VEE M+++ EM Q G+  + +TY  L+ G FQA   D+A+ +F  + + G+ P+  T +I
Sbjct: 420 VEEGMELFREMSQRGLVGNTVTYTTLIHGFFQARDCDNAQMVFKQMVSVGVHPNILTYNI 479

Query: 474 FVDGLCKNDCLLEAMEIFNELSYN-LKLDIDIFNCLIDGLCKAGKLETAWEFFDKISREG 533
            +DGLCKN  L +AM +F  L  + ++ DI  +N +I+G+CKAGK+E  WE F  +S +G
Sbjct: 480 LLDGLCKNGKLAKAMVVFEYLQRSTMEPDIYTYNIMIEGMCKAGKVEDGWELFCNLSLKG 539

Query: 534 LLPNVVTYSILIHGYCKEGQVEKANDLFRKMEENGCTPNVITYNTLLRGFYKSNKREEVV 593
           + PNV+ Y+ +I G+C++G  E+A+ L +KM+E+G  PN  TYNTL+R   +   RE   
Sbjct: 540 VSPNVIAYNTMISGFCRKGSKEEADSLLKKMKEDGPLPNSGTYNTLIRARLRDGDREASA 599

Query: 594 ELLHRMVKKNVVPDASTCTIVLDMLSKDEKYRGCLNLL 631
           EL+  M       DAST  +V +ML      +  L++L
Sbjct: 600 ELIKEMRSCGFAGDASTIGLVTNMLHDGRLDKSFLDML 631

BLAST of Cp4.1LG12g11340 vs. Swiss-Prot
Match: PP247_ARATH (Pentatricopeptide repeat-containing protein At3g22470, mitochondrial OS=Arabidopsis thaliana GN=At3g22470 PE=2 SV=1)

HSP 1 Score: 406.4 bits (1043), Expect = 5.9e-112
Identity = 203/490 (41.43%), Postives = 312/490 (63.67%), Query Frame = 1

Query: 142 GLLPNYITLNILLNCLCNVNRISEGLAAMAGIIRRGFIPNIVTYTSLIKGLCMEHRISEA 201
           G  P+ IT + L+N  C   R+SE +A +  ++     P++VT ++LI GLC++ R+SEA
Sbjct: 135 GYEPDTITFSTLVNGFCLEGRVSEAVALVDRMVEMKQRPDLVTVSTLINGLCLKGRVSEA 194

Query: 202 TRLFMRMQKLGCRPNVITYGTLIKGLCQTGNTNIALKLHEEMLNGTGRYGISCKPNVICY 261
             L  RM + G +P+ +TYG ++  LC++GN+ +AL L  +M         + K +V+ Y
Sbjct: 195 LVLIDRMVEYGFQPDEVTYGPVLNRLCKSGNSALALDLFRKMEER------NIKASVVQY 254

Query: 262 STIIDGLCKDGREDKARELFEEMKARRMLPDVISYSSLIHGFCNGGKWEEAKCLFNEMVD 321
           S +ID LCKDG  D A  LF EM+ + +  DV++YSSLI G CN GKW++   +  EM+ 
Sbjct: 255 SIVIDSLCKDGSFDDALSLFNEMEMKGIKADVVTYSSLIGGLCNDGKWDDGAKMLREMIG 314

Query: 322 LGIQPNAVTFNVLMDILCKAGKVIEANELLEVMIQRGNAPDLFTYNTLMDGFCLVSDLDS 381
             I P+ VTF+ L+D+  K GK++EA EL   MI RG APD  TYN+L+DGFC  + L  
Sbjct: 315 RNIIPDVVTFSALIDVFVKEGKLLEAKELYNEMITRGIAPDTITYNSLIDGFCKENCLHE 374

Query: 382 ARELFLSMPSKGCEPNVISYNVLINGYCKNWKVEEAMKIYNEMLQVGIKPSVITYNALLT 441
           A ++F  M SKGCEP++++Y++LIN YCK  +V++ M+++ E+   G+ P+ ITYN L+ 
Sbjct: 375 ANQMFDLMVSKGCEPDIVTYSILINSYCKAKRVDDGMRLFREISSKGLIPNTITYNTLVL 434

Query: 442 GLFQAGKVDDAKKIFGVIQAHGLVPSSSTLSIFVDGLCKNDCLLEAMEIFNELSYN-LKL 501
           G  Q+GK++ AK++F  + + G+ PS  T  I +DGLC N  L +A+EIF ++  + + L
Sbjct: 435 GFCQSGKLNAAKELFQEMVSRGVPPSVVTYGILLDGLCDNGELNKALEIFEKMQKSRMTL 494

Query: 502 DIDIFNCLIDGLCKAGKLETAWEFFDKISREGLLPNVVTYSILIHGYCKEGQVEKANDLF 561
            I I+N +I G+C A K++ AW  F  +S +G+ P+VVTY+++I G CK+G + +A+ LF
Sbjct: 495 GIGIYNIIIHGMCNASKVDDAWSLFCSLSDKGVKPDVVTYNVMIGGLCKKGSLSEADMLF 554

Query: 562 RKMEENGCTPNVITYNTLLRGFYKSNKREEVVELLHRMVKKNVVPDASTCTIVLDMLSKD 621
           RKM+E+GCTP+  TYN L+R     +     VEL+  M       D+ST  +V+DMLS  
Sbjct: 555 RKMKEDGCTPDDFTYNILIRAHLGGSGLISSVELIEEMKVCGFSADSSTIKMVIDMLSDR 614

Query: 622 EKYRGCLNLL 631
              +  L++L
Sbjct: 615 RLDKSFLDML 618

BLAST of Cp4.1LG12g11340 vs. Swiss-Prot
Match: PPR96_ARATH (Pentatricopeptide repeat-containing protein At1g62930, chloroplastic OS=Arabidopsis thaliana GN=At1g62930 PE=2 SV=2)

HSP 1 Score: 406.4 bits (1043), Expect = 5.9e-112
Identity = 207/518 (39.96%), Postives = 321/518 (61.97%), Query Frame = 1

Query: 114 SFNLLLGGLAKTKHYSEVLKLYYRMSLAGLLPNYITLNILLNCLCNVNRISEGLAAMAGI 173
           S+N+L+    +       L +  +M   G  P+ +TL+ LLN  C+  RISE +A +  +
Sbjct: 117 SYNILINCFCRRSQLPLALAVLGKMMKLGYEPDIVTLSSLLNGYCHGKRISEAVALVDQM 176

Query: 174 IRRGFIPNIVTYTSLIKGLCMEHRISEATRLFMRMQKLGCRPNVITYGTLIKGLCQTGNT 233
               + PN VT+ +LI GL + ++ SEA  L  RM   GC+P++ TYGT++ GLC+ G+ 
Sbjct: 177 FVMEYQPNTVTFNTLIHGLFLHNKASEAVALIDRMVARGCQPDLFTYGTVVNGLCKRGDI 236

Query: 234 NIALKLHEEMLNGTGRYGISCKPNVICYSTIIDGLCKDGREDKARELFEEMKARRMLPDV 293
           ++AL L ++M  G        + +V+ Y+TIID LC     + A  LF EM  + + P+V
Sbjct: 237 DLALSLLKKMEKG------KIEADVVIYTTIIDALCNYKNVNDALNLFTEMDNKGIRPNV 296

Query: 294 ISYSSLIHGFCNGGKWEEAKCLFNEMVDLGIQPNAVTFNVLMDILCKAGKVIEANELLEV 353
           ++Y+SLI   CN G+W +A  L ++M++  I PN VTF+ L+D   K GK++EA +L + 
Sbjct: 297 VTYNSLIRCLCNYGRWSDASRLLSDMIERKINPNVVTFSALIDAFVKEGKLVEAEKLYDE 356

Query: 354 MIQRGNAPDLFTYNTLMDGFCLVSDLDSARELFLSMPSKGCEPNVISYNVLINGYCKNWK 413
           MI+R   PD+FTY++L++GFC+   LD A+ +F  M SK C PNV++YN LI G+CK  +
Sbjct: 357 MIKRSIDPDIFTYSSLINGFCMHDRLDEAKHMFELMISKDCFPNVVTYNTLIKGFCKAKR 416

Query: 414 VEEAMKIYNEMLQVGIKPSVITYNALLTGLFQAGKVDDAKKIFGVIQAHGLVPSSSTLSI 473
           VEE M+++ EM Q G+  + +TYN L+ GLFQAG  D A+KIF  + + G+ P   T SI
Sbjct: 417 VEEGMELFREMSQRGLVGNTVTYNTLIQGLFQAGDCDMAQKIFKKMVSDGVPPDIITYSI 476

Query: 474 FVDGLCKNDCLLEAMEIFNELSYN-LKLDIDIFNCLIDGLCKAGKLETAWEFFDKISREG 533
            +DGLCK   L +A+ +F  L  + ++ DI  +N +I+G+CKAGK+E  W+ F  +S +G
Sbjct: 477 LLDGLCKYGKLEKALVVFEYLQKSKMEPDIYTYNIMIEGMCKAGKVEDGWDLFCSLSLKG 536

Query: 534 LLPNVVTYSILIHGYCKEGQVEKANDLFRKMEENGCTPNVITYNTLLRGFYKSNKREEVV 593
           + PNV+ Y+ +I G+C++G  E+A+ LFR+M+E+G  PN  TYNTL+R   +   +    
Sbjct: 537 VKPNVIIYTTMISGFCRKGLKEEADALFREMKEDGTLPNSGTYNTLIRARLRDGDKAASA 596

Query: 594 ELLHRMVKKNVVPDASTCTIVLDMLSKDEKYRGCLNLL 631
           EL+  M     V DAST ++V++ML      +  L +L
Sbjct: 597 ELIKEMRSCGFVGDASTISMVINMLHDGRLEKSYLEML 628

BLAST of Cp4.1LG12g11340 vs. TrEMBL
Match: D7SQM4_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0141g00780 PE=4 SV=1)

HSP 1 Score: 704.5 bits (1817), Expect = 1.2e-199
Identity = 356/636 (55.97%), Postives = 460/636 (72.33%), Query Frame = 1

Query: 7   ASAISSSLSPAPSKGKLSSLVTRPLRIASSNPQNLSPIPAESPPIISKPPNSNVSPEVSL 66
           ASAISSS +    + KLSSL   P R  S  P +L+         +S  P+         
Sbjct: 12  ASAISSSRTTP--RCKLSSLFEHPHRPISPGPISLTK------DTVSNAPDRG------- 71

Query: 67  EKSTTEISFHERLRTFLQ-NCKTGNFTASEAFQFFDLMMLANPTPAMSSFNLLLGGLAKT 126
                      +L  FL+ NCK+G+   SEAF  F+ ++   PTP +SSFN LLG +AK 
Sbjct: 72  -----------QLENFLKSNCKSGHIKRSEAFSVFNHLIDMQPTPPISSFNTLLGAVAKI 131

Query: 127 KHYSEVLKLYYRMSLAGLLPNYITLNILLNCLCNVNRISEGLAAMAGIIRRGFIPNIVTY 186
           K Y +V+ LY RMSL GL P++ITLNIL+NC CN+N++  GLA +  ++RRG  PN VT+
Sbjct: 132 KRYFDVISLYKRMSLIGLAPDFITLNILINCYCNLNKVDFGLAVLGEMLRRGHSPNTVTF 191

Query: 187 TSLIKGLCMEHRISEATRLFMRMQKLGCRPNVITYGTLIKGLCQTGNTNIALKLHEEMLN 246
           TSL+KGLC+  RISEAT L  +M ++G RPNV+TYGTL+ GLC TGNT +A+KLHEEMLN
Sbjct: 192 TSLVKGLCLGSRISEATGLLRKMVRMGYRPNVVTYGTLLNGLCMTGNTMLAVKLHEEMLN 251

Query: 247 GTGRYGISCKPNVICYSTIIDGLCKDGREDKARELFEEMKARRMLPDVISYSSLIHGFCN 306
           G G +G++ KPN++CY TIID LCKDG  DK +ELF EMK R + PDV++YSS+IHG C+
Sbjct: 252 GNGGFGVTIKPNLVCYCTIIDSLCKDGLIDKGKELFLEMKGRGISPDVVAYSSIIHGMCH 311

Query: 307 GGKWEEAKCLFNEMVDLGIQPNAVTFNVLMDILCKAGKVIEANELLEVMIQRGNAPDLFT 366
            G+WE AK LFNEMVD G+ PN VTFNVL+D LCKAGK+ EAN LL++MIQRG +PD FT
Sbjct: 312 TGRWEGAKGLFNEMVDEGVHPNVVTFNVLIDALCKAGKMEEANHLLKLMIQRGESPDTFT 371

Query: 367 YNTLMDGFCLVSDLDSARELFLSMPSKGCEPNVISYNVLINGYCKNWKVEEAMKIYNEML 426
           YNTL+DGFCL   +D AR+LF+SM SKG E + +SYNVLINGYCK+ ++ EA K+Y EM+
Sbjct: 372 YNTLIDGFCLEGRIDDARDLFVSMESKGIETDAVSYNVLINGYCKSGRMVEAKKLYREMM 431

Query: 427 QVGIKPSVITYNALLTGLFQAGKVDDAKKIFGVIQAHGLVPSSSTLSIFVDGLCKNDCLL 486
              I P+VITYN LLTGLF+ GKV DA  +FG ++ H L P S T +I +DGLCKN+ L 
Sbjct: 432 CKEIMPTVITYNTLLTGLFREGKVRDAWNLFGEMKVHDLTPESCTYNILLDGLCKNNHLS 491

Query: 487 EAMEIFNEL-SYNLKLDIDIFNCLIDGLCKAGKLETAWEFFDKISREGLLPNVVTYSILI 546
           EAME+F+ L +++ +  I IFNCLIDGLCKA K+E A E F+++S EGL PNV+TY+++I
Sbjct: 492 EAMELFHYLENHDFQPSIQIFNCLIDGLCKARKIEIARELFNRLSHEGLEPNVITYTVMI 551

Query: 547 HGYCKEGQVEKANDLFRKMEENGCTPNVITYNTLLRGFYKSNKREEVVELLHRMVKKNVV 606
           HG CK GQ+E A DLF  MEE GC PN++T+NTL+RGF ++++ ++VVELL  M +K+  
Sbjct: 552 HGLCKSGQLENAKDLFLGMEEKGCAPNLVTFNTLMRGFCQNDEMQKVVELLQEMAEKDFS 611

Query: 607 PDASTCTIVLDMLSKDEKYRGCLNLLPTFPVQEYRG 641
           PDAST +IV+D+LSKDEKYR  L+LLPTFP Q   G
Sbjct: 612 PDASTISIVVDLLSKDEKYREYLHLLPTFPAQGQTG 621

BLAST of Cp4.1LG12g11340 vs. TrEMBL
Match: M5VNT7_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003905mg PE=4 SV=1)

HSP 1 Score: 698.0 bits (1800), Expect = 1.1e-197
Identity = 327/533 (61.35%), Postives = 410/533 (76.92%), Query Frame = 1

Query: 103 MMLANPTPAMSSFNLLLGGLAKTKHYSEVLKLYYRMSLAGLLPNYITLNILLNCLCNVNR 162
           M+   PTP + SFN L GGLAK+KH S+V   Y ++   GLLPN+ITLNILLNC CNVNR
Sbjct: 1   MIQMQPTPPIWSFNRLFGGLAKSKHCSQVFLFYNKLISVGLLPNFITLNILLNCFCNVNR 60

Query: 163 ISEGLAAMAGIIRRGFIPNIVTYTSLIKGLCMEHRISEATRLFMRMQKLGCRPNVITYGT 222
             +G   M  ++RRG+ P+ VTYT+L+KGLCME RI  ATRLF  M KLGC+P V+T+GT
Sbjct: 61  ARDGFVVMGSLLRRGYRPSTVTYTALLKGLCMEDRIDVATRLFKTMIKLGCQPTVVTFGT 120

Query: 223 LIKGLCQTGNTNIALKLHEEMLNGTGRYGISCKPNVICYSTIIDGLCKDGREDKARELFE 282
           LI GLC+TGNTN+AL+LHEEM NG G YG+ CKP+V+ Y TIIDGLCK G  DKA+ELF 
Sbjct: 121 LINGLCRTGNTNVALRLHEEMANGNGVYGVECKPSVVSYGTIIDGLCKAGLVDKAKELFI 180

Query: 283 EMKARRMLPDVISYSSLIHGFCNGGKWEEAKCLFNEMVDLGIQPNAVTFNVLMDILCKAG 342
           EMK R  +PDVI YS+LIH      KWE AK L NEMVD G++PN VTFNVL+ +LC+ G
Sbjct: 181 EMKDRGFVPDVIVYSALIHELYYNEKWEAAKALLNEMVDQGVRPNVVTFNVLIGVLCRRG 240

Query: 343 KVIEANELLEVMIQRGNAPDLFTYNTLMDGFCLVSDLDSARELFLSMPSKGCEPNVISYN 402
            + E+++LL++MIQRG  PD+FTYNTLMDGFCL   L+ ARELF S+PS+GCEP+ ISYN
Sbjct: 241 HLKESSDLLKLMIQRGIDPDVFTYNTLMDGFCLAGRLNEARELFHSIPSRGCEPDAISYN 300

Query: 403 VLINGYCKNWKVEEAMKIYNEMLQVGIKPSVITYNALLTGLFQAGKVDDAKKIFGVIQAH 462
           VLINGYCKN  ++EA+ +Y EM+  G +P+VITYN+LLTGLF  GKV DA+++FG +Q  
Sbjct: 301 VLINGYCKNRNIQEAVNLYKEMIGKGTRPTVITYNSLLTGLFHMGKVQDAQELFGEMQTQ 360

Query: 463 GLVPSSSTLSIFVDGLCKNDCLLEAMEIFNEL-SYNLKLDIDIFNCLIDGLCKAGKLETA 522
            L+P+S+T  I +DGLCK DC+ EAME+F  L + N K+ +++ NCLIDG CKAG LE A
Sbjct: 361 NLLPNSTTYKILLDGLCKTDCVPEAMEVFRTLENCNFKISVEMLNCLIDGFCKAGNLEVA 420

Query: 523 WEFFDKISREGLLPNVVTYSILIHGYCKEGQVEKANDLFRKMEENGCTPNVITYNTLLRG 582
           W+ F  +S  GL PNV+TYS++IHG C EGQ+EKAN LF +ME NGC PNVI YN L+RG
Sbjct: 421 WDLFLTLSNRGLAPNVITYSVMIHGLCIEGQLEKANGLFIEMEANGCAPNVIIYNILMRG 480

Query: 583 FYKSNKREEVVELLHRMVKKNVVPDASTCTIVLDMLSKDEKYRGCLNLLPTFP 635
           F +S+   +VVELLH MV +N+ PD+ T +IV+D+LSKDEKYR CL+LLPTFP
Sbjct: 481 FCQSDDSAKVVELLHMMVARNLSPDSCTISIVIDLLSKDEKYRKCLDLLPTFP 533

BLAST of Cp4.1LG12g11340 vs. TrEMBL
Match: A0A061DX07_THECC (Pentatricopeptide repeat-containing protein, putative isoform 1 OS=Theobroma cacao GN=TCM_004172 PE=4 SV=1)

HSP 1 Score: 683.3 bits (1762), Expect = 2.8e-193
Identity = 350/648 (54.01%), Postives = 461/648 (71.14%), Query Frame = 1

Query: 1   MVSKTV--------ASAISSSLSPAPSKGKLSSLVTRPLR-IASSNPQNLSPIPAESPPI 60
           MVSKT          +A+SSS S +  KGKLSSL T P   I S+NP++ +P        
Sbjct: 1   MVSKTALRFASAANGAALSSSSSKSTPKGKLSSLFTHPKNPIFSNNPKSSNP-------- 60

Query: 61  ISKPPNSNVSPEVSLEKSTTEISFHERLRTFLQ-NCKTGNFTASEAFQFFDLMMLANPTP 120
                           K T E    + L  FL+ +CK+G  T +EA  FFD M    P P
Sbjct: 61  ---------------NKETVESQ--DPLNKFLKTSCKSGTITLNEALNFFDEMTQMKPFP 120

Query: 121 AMSSFNLLLGGLAKTKHYSEVLKLYYRMSLAGLLPNYITLNILLNCLCNVNRISEGLAAM 180
            MSSFNLLLG L K K ++ V+ LY ++   G+ P++ITLNILLNCLC+++R+S G A +
Sbjct: 121 PMSSFNLLLGALVKIKQHNHVVVLYKKLGSIGISPDFITLNILLNCLCHMSRVSFGFAVL 180

Query: 181 AGIIRRGFIPNIVTYTSLIKGLCMEHRISEATRLFMRMQKLGCRPNVITYGTLIKGLCQT 240
             + R G+ PN VT+TSL+KGLCME++I EATRLF +M   GC+P++++YGTLI GLC+ 
Sbjct: 181 GRVFRWGYRPNTVTFTSLVKGLCMENKICEATRLFRKMVVFGCQPSIVSYGTLINGLCRM 240

Query: 241 GNTNIALKLHEEMLNGTGRYGISCKPNVICYSTIIDGLCKDGREDKARELFEEMKARRML 300
           GNT++AL+L+EEM+ G G      +PNV+ Y +IID LCK+G  +KARE+F EMK + + 
Sbjct: 241 GNTSVALRLYEEMVRGNG----VLEPNVVIYGSIIDCLCKEGMLEKAREIFLEMKGKGIH 300

Query: 301 PDVISYSSLIHGFCNGGKWEEAKCLFNEMVDLGIQPNAVTFNVLMDILCKAGKVIEANEL 360
           PDV+ YSSL+HGFC  G  EEAK LF EMVD G+QPN VTFNVL+D LCK  K+ EAN L
Sbjct: 301 PDVVVYSSLLHGFCCMGDLEEAKGLFVEMVDQGVQPNVVTFNVLIDALCKVEKLEEANGL 360

Query: 361 LEVMIQRGNAPDLFTYNTLMDGFCLVSDLDSARELFLSMPSKGCEPNVISYNVLINGYCK 420
           L++MIQRG  PD+FTYNTLMDG+CL   L+ AR+LF+SM SK    N ISYN++INGYCK
Sbjct: 361 LDLMIQRGVDPDIFTYNTLMDGYCLAGKLNVARDLFVSMQSKENRQNAISYNIMINGYCK 420

Query: 421 NWKVEEAMKIYNEMLQVGIKPSVITYNALLTGLFQAGKVDDAKKIFGVIQAHGLVPSSST 480
           NWKV+EAM +Y EM+   I+P+VITYN LLTG FQAG+V++A+++FG +Q   +  +S T
Sbjct: 421 NWKVDEAMSLYMEMICKRIRPTVITYNTLLTGFFQAGRVEEARELFGKLQVDNITLNSCT 480

Query: 481 LSIFVDGLCKNDCLLEAMEIFNEL-SYNLKLDIDIFNCLIDGLCKAGKLETAWEFFDKIS 540
            + FVDGLCKN C+ EA+E+F++L +   K  I++FN LIDGLCK GKL+TAWE F  + 
Sbjct: 481 YNTFVDGLCKNGCVSEALELFHKLENCKFKFSIEMFNSLIDGLCKTGKLQTAWELFYGLP 540

Query: 541 REGLLPNVVTYSILIHGYCKEGQVEKANDLFRKMEENGCTPNVITYNTLLRGFYKSNKRE 600
            +GL P VVT+SI+IHG CKEGQ+EKANDL  +MEE GC+PNV+T+NTL+ GF ++N+ +
Sbjct: 541 NKGLEPTVVTFSIMIHGLCKEGQLEKANDLLIEMEEKGCSPNVVTFNTLMHGFSQNNETQ 600

Query: 601 EVVELLHRMVKKNVVPDASTCTIVLDMLSKDEKYRGCLNLLPTFPVQE 638
           ++VELL +MV+K + PDAST + V+D+LSKDE Y   L LLPTFPVQE
Sbjct: 601 KMVELLQKMVEKKLSPDASTISAVVDLLSKDEAYHETLKLLPTFPVQE 619

BLAST of Cp4.1LG12g11340 vs. TrEMBL
Match: W9S0N2_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_005753 PE=4 SV=1)

HSP 1 Score: 681.0 bits (1756), Expect = 1.4e-192
Identity = 318/536 (59.33%), Postives = 413/536 (77.05%), Query Frame = 1

Query: 103 MMLANPTPAMSSFNLLLGGLAKTKHYSEVLKLYYRMSLAGLLPNYITLNILLNCLCNVNR 162
           M+   PTP +S FN LL  LAK KHYS+V  LY RM+ +GLLPN+ITL+IL+NC CN NR
Sbjct: 1   MIRTKPTPPVSLFNCLLSALAKKKHYSDVFPLYNRMNSSGLLPNFITLSILINCFCNTNR 60

Query: 163 ISEGLAAMAGIIRRGFIPNIVTYTSLIKGLCMEHRISEATRLFMRMQKLGCRPNVITYGT 222
           +S+    +  ++RRGF PNIV+YTSLIKGL ME +I EA RL+  M KLGC+P V+TYGT
Sbjct: 61  VSDAFVVVGSMLRRGFSPNIVSYTSLIKGLFMEEKIFEAIRLYKNMFKLGCKPTVVTYGT 120

Query: 223 LIKGLCQTGNTNIALKLHEEMLNGTGRYGISCKPNVICYSTIIDGLCKDGREDKARELFE 282
           LI GLC+TGNTNIALKLHEEM +G GR+G+ CKPN++ Y  IIDGLCK+G  +KA+E+F 
Sbjct: 121 LIDGLCRTGNTNIALKLHEEMSSGNGRHGLDCKPNLVSYGAIIDGLCKEGMVEKAKEIFL 180

Query: 283 EMKARRMLPDVISYSSLIHGFCNGGKWEEAKCLFNEMVDLGIQPNAVTFNVLMDILCKAG 342
           EMKA  + PDV+ YSSLIHG C   KW+EAK LF EM+D G++PN VTFNV +D L K G
Sbjct: 181 EMKAAGIAPDVVVYSSLIHGLCFDDKWDEAKSLFVEMMDHGVRPNTVTFNVWIDALSKKG 240

Query: 343 KVIEANELLEVMIQRGNAPDLFTYNTLMDGFCLVSDLDSARELFLSMPSKGCEPNVISYN 402
           K  +A ++LE+M++RG + DLFTYN L+DGFC    L+ A+ELF S+P+KGCEPN  SYN
Sbjct: 241 KRKQARDMLEMMVERGVSADLFTYNALLDGFCRAGKLEKAKELFYSLPAKGCEPNSYSYN 300

Query: 403 VLINGYCKNWKVEEAMKIYNEMLQVGIKPSVITYNALLTGLFQAGKVDDAKKIFGVIQAH 462
           +LI  +    K+EE +KI+ EM+  G+ P+V+TYN LL GLF AGKVDDA+K+F  +Q H
Sbjct: 301 MLIRVHIMKRKIEETLKIFREMIPKGVSPTVVTYNTLLCGLFHAGKVDDARKMFDEMQDH 360

Query: 463 GLVPSSSTLSIFVDGLCKNDCLLEAMEIFNEL-SYNLKLDIDIFNCLIDGLCKAGKLETA 522
           G+VP+SST SI +DGLCKN+ + EA+EIF  L   N ++ I + N +I+GLCK+GK E A
Sbjct: 361 GMVPNSSTYSILLDGLCKNNRVEEALEIFRTLKDSNFEMTIQVMNTIINGLCKSGKFEIA 420

Query: 523 WEFFDKISREGLLPNVVTYSILIHGYCKEGQVEKANDLFRKMEENGCTPNVITYNTLLRG 582
           WE F ++ + G +PNV+TYSI+I+G CKEGQ+EKAN LF +ME  GC+PN +TYNTL+R 
Sbjct: 421 WELFQRLLQRGPVPNVITYSIIINGLCKEGQLEKANCLFLEMEGQGCSPNYVTYNTLMRS 480

Query: 583 FYKSNKREEVVELLHRMVKKNVVPDASTCTIVLDMLSKDEKYRGCLNLLPTFPVQE 638
           F  +N+  +VVELLH+M K+NV PDAST +IV+D++SKD+KYR CL+LLPTFP+QE
Sbjct: 481 FCDNNELPKVVELLHQMAKRNVQPDASTFSIVIDLVSKDKKYRECLDLLPTFPMQE 536

BLAST of Cp4.1LG12g11340 vs. TrEMBL
Match: A0A059A8Z6_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_K03302 PE=4 SV=1)

HSP 1 Score: 644.8 bits (1662), Expect = 1.1e-181
Identity = 320/629 (50.87%), Postives = 437/629 (69.48%), Query Frame = 1

Query: 7   ASAISSSLSP-APSKGKLSSLVTRPLRIASSNPQNLSPIPAESPPIISKPPNSNVSPEVS 66
           + A+ SS SP + S+GKLSSL T P R    NP++  P    +  I   PP  +   E+S
Sbjct: 3   SKALRSSSSPVSASEGKLSSLFTYPRR---RNPRSREPRQEPAVEISPPPPQKS---ELS 62

Query: 67  LEKSTTEISFHERLRTFLQN-CKTGNFTASEAFQFFDLMMLANPTPAMSSFNLLLGGLAK 126
              +T E     RL  FL+  CK+G+FT  EA   FD M+   P P  SSF LLLG LAK
Sbjct: 63  DRNTTQED--RTRLALFLETKCKSGDFTVGEALHHFDRMVSMRPAPPASSFRLLLGALAK 122

Query: 127 TKHYSEVLKLYYRMSLAGLLPNYITLNILLNCLCNVNRISEGLAAMAGIIRRGFIPNIVT 186
             H S V+ L  ++  AGLL +YI LNILLNC CN  R+++G A +  + RRG  PN V+
Sbjct: 123 NSHCSAVISLRAKLESAGLLSDYIVLNILLNCCCNARRVADGFAVLGAMFRRGHCPNTVS 182

Query: 187 YTSLIKGLCMEHRISEATRLFMRMQKLGCRPNVITYGTLIKGLCQTGNTNIALKLHEEML 246
           YT L+KGLC+EH+I EA  LFM+M + G RPN++TYGTLI GLC++GNT++ALKLH +M+
Sbjct: 183 YTQLVKGLCLEHKIGEAVGLFMKMFRSGRRPNIVTYGTLISGLCKSGNTSMALKLHRDMM 242

Query: 247 NGTGRYGISCKPNVICYSTIIDGLCKDGREDKARELFEEMKARRMLPDVISYSSLIHGFC 306
              G  G+ CKPN++CYS II+GLCKDG  ++AR LF EMK   + PDV+ YSSLIHG C
Sbjct: 243 GSNGEDGVLCKPNLVCYSAIINGLCKDGLLEEARGLFSEMKLSGISPDVVVYSSLIHGLC 302

Query: 307 NGGKWEEAKCLFNEMVDLGIQPNAVTFNVLMDILCKAGKVIEANELLEVMIQRGNAPDLF 366
           + G+ +EA  L NEMVD  I PN +TFN L+++LC+ GK+ EA  +L+VMI RG  PD +
Sbjct: 303 SIGQCKEATVLLNEMVDKRISPNTITFNSLIEVLCREGKLKEAENVLKVMIDRGVEPDSY 362

Query: 367 TYNTLMDGFCLVSDLDSARELFLSMPSKGCEPNVISYNVLINGYCKNWKVEEAMKIYNEM 426
           TYNT+M G C    +D AREL + M SKGC  + +SYNVLINGY      EEA+ +Y EM
Sbjct: 363 TYNTIMCGLCFAGQIDEARELLVQMVSKGCLIDAVSYNVLINGYFNQGSTEEAICMYKEM 422

Query: 427 LQVGIKPSVITYNALLTGLFQAGKVDDAKKIFGVIQAHGLVPSSSTLSIFVDGLCKNDCL 486
           +  GIKP+V+T+N L+ G FQ G+V DA+ +F  ++   + P+SST +IF+DGLCKN+ L
Sbjct: 423 IDEGIKPTVVTFNTLMKGFFQKGRVGDARNMFREMKHQEVEPNSSTFTIFIDGLCKNNFL 482

Query: 487 LEAMEIFNELSYNLKLDIDIFNCLIDGLCKAGKLETAWEFFDKISREGLLPNVVTYSILI 546
            EA+++F+    N+KL + + N LIDG+CKAG +E AW+ F ++ ++GL P VVTYSI+I
Sbjct: 483 SEALDLFHSQDPNIKLSVPMSNSLIDGMCKAGDIEAAWDLFHRLQKDGLAPTVVTYSIMI 542

Query: 547 HGYCKEGQVEKANDLFRKMEENGCTPNVITYNTLLRGFYKSNKREEVVELLHRMVKKNVV 606
            G C++GQ E+AN+LF KME NGC PNVIT+NTL+ GF ++N+  +V+ELL +M +++++
Sbjct: 543 RGLCQQGQFEEANELFMKMEANGCPPNVITFNTLMHGFCQNNEIVKVIELLKKMAERSIL 602

Query: 607 PDASTCTIVLDMLSKDEKYRGCLNLLPTF 634
           PD  T ++V+D+LSKDE ++  LNLLPTF
Sbjct: 603 PDDFTTSMVVDVLSKDENHKEYLNLLPTF 623

BLAST of Cp4.1LG12g11340 vs. TAIR10
Match: AT1G12700.1 (AT1G12700.1 ATP binding;nucleic acid binding;helicases)

HSP 1 Score: 419.1 bits (1076), Expect = 5.0e-117
Identity = 206/550 (37.45%), Postives = 333/550 (60.55%), Query Frame = 1

Query: 69  STTEISFHERLRTFLQNCKTGNFTASEAFQFFDLMMLANPTPAMSSFNLLLGGLAKTKHY 128
           S   + F ERLR+ + + K       +A   F  M+ + P P++  F+     +A+TK +
Sbjct: 50  SNGNVCFRERLRSGIVDIKK-----DDAIALFQEMIRSRPLPSLVDFSRFFSAIARTKQF 109

Query: 129 SEVLKLYYRMSLAGLLPNYITLNILLNCLCNVNRISEGLAAMAGIIRRGFIPNIVTYTSL 188
           + VL    ++ L G+  N  TLNI++NC C   +     + +  +++ G+ P+  T+ +L
Sbjct: 110 NLVLDFCKQLELNGIAHNIYTLNIMINCFCRCCKTCFAYSVLGKVMKLGYEPDTTTFNTL 169

Query: 189 IKGLCMEHRISEATRLFMRMQKLGCRPNVITYGTLIKGLCQTGNTNIALKLHEEMLNGTG 248
           IKGL +E ++SEA  L  RM + GC+P+V+TY +++ G+C++G+T++AL L  +M     
Sbjct: 170 IKGLFLEGKVSEAVVLVDRMVENGCQPDVVTYNSIVNGICRSGDTSLALDLLRKMEER-- 229

Query: 249 RYGISCKPNVICYSTIIDGLCKDGREDKARELFEEMKARRMLPDVISYSSLIHGFCNGGK 308
               + K +V  YSTIID LC+DG  D A  LF+EM+ + +   V++Y+SL+ G C  GK
Sbjct: 230 ----NVKADVFTYSTIIDSLCRDGCIDAAISLFKEMETKGIKSSVVTYNSLVRGLCKAGK 289

Query: 309 WEEAKCLFNEMVDLGIQPNAVTFNVLMDILCKAGKVIEANELLEVMIQRGNAPDLFTYNT 368
           W +   L  +MV   I PN +TFNVL+D+  K GK+ EANEL + MI RG +P++ TYNT
Sbjct: 290 WNDGALLLKDMVSREIVPNVITFNVLLDVFVKEGKLQEANELYKEMITRGISPNIITYNT 349

Query: 369 LMDGFCLVSDLDSARELFLSMPSKGCEPNVISYNVLINGYCKNWKVEEAMKIYNEMLQVG 428
           LMDG+C+ + L  A  +   M    C P+++++  LI GYC   +V++ MK++  + + G
Sbjct: 350 LMDGYCMQNRLSEANNMLDLMVRNKCSPDIVTFTSLIKGYCMVKRVDDGMKVFRNISKRG 409

Query: 429 IKPSVITYNALLTGLFQAGKVDDAKKIFGVIQAHGLVPSSSTLSIFVDGLCKNDCLLEAM 488
           +  + +TY+ L+ G  Q+GK+  A+++F  + +HG++P   T  I +DGLC N  L +A+
Sbjct: 410 LVANAVTYSILVQGFCQSGKIKLAEELFQEMVSHGVLPDVMTYGILLDGLCDNGKLEKAL 469

Query: 489 EIFNELSYN-LKLDIDIFNCLIDGLCKAGKLETAWEFFDKISREGLLPNVVTYSILIHGY 548
           EIF +L  + + L I ++  +I+G+CK GK+E AW  F  +  +G+ PNV+TY+++I G 
Sbjct: 470 EIFEDLQKSKMDLGIVMYTTIIEGMCKGGKVEDAWNLFCSLPCKGVKPNVMTYTVMISGL 529

Query: 549 CKEGQVEKANDLFRKMEENGCTPNVITYNTLLRGFYKSNKREEVVELLHRMVKKNVVPDA 608
           CK+G + +AN L RKMEE+G  PN  TYNTL+R   +        +L+  M       DA
Sbjct: 530 CKKGSLSEANILLRKMEEDGNAPNDCTYNTLIRAHLRDGDLTASAKLIEEMKSCGFSADA 588

Query: 609 STCTIVLDML 618
           S+  +V+DML
Sbjct: 590 SSIKMVIDML 588

BLAST of Cp4.1LG12g11340 vs. TAIR10
Match: AT1G62590.1 (AT1G62590.1 pentatricopeptide (PPR) repeat-containing protein)

HSP 1 Score: 407.5 bits (1046), Expect = 1.5e-113
Identity = 202/518 (39.00%), Postives = 323/518 (62.36%), Query Frame = 1

Query: 114 SFNLLLGGLAKTKHYSEVLKLYYRMSLAGLLPNYITLNILLNCLCNVNRISEGLAAMAGI 173
           ++N+L+    +    S  L L  +M   G  P+ +TL+ LLN  C+  RIS+ +A +  +
Sbjct: 122 TYNILINCFCRRSQISLALALLGKMMKLGYEPSIVTLSSLLNGYCHGKRISDAVALVDQM 181

Query: 174 IRRGFIPNIVTYTSLIKGLCMEHRISEATRLFMRMQKLGCRPNVITYGTLIKGLCQTGNT 233
           +  G+ P+ +T+T+LI GL + ++ SEA  L  RM + GC+PN++TYG ++ GLC+ G+T
Sbjct: 182 VEMGYRPDTITFTTLIHGLFLHNKASEAVALVDRMVQRGCQPNLVTYGVVVNGLCKRGDT 241

Query: 234 NIALKLHEEMLNGTGRYGISCKPNVICYSTIIDGLCKDGREDKARELFEEMKARRMLPDV 293
           ++AL L  +M           + +V+ ++TIID LCK    D A  LF+EM+ + + P+V
Sbjct: 242 DLALNLLNKM------EAAKIEADVVIFNTIIDSLCKYRHVDDALNLFKEMETKGIRPNV 301

Query: 294 ISYSSLIHGFCNGGKWEEAKCLFNEMVDLGIQPNAVTFNVLMDILCKAGKVIEANELLEV 353
           ++YSSLI   C+ G+W +A  L ++M++  I PN VTFN L+D   K GK +EA +L + 
Sbjct: 302 VTYSSLISCLCSYGRWSDASQLLSDMIEKKINPNLVTFNALIDAFVKEGKFVEAEKLYDD 361

Query: 354 MIQRGNAPDLFTYNTLMDGFCLVSDLDSARELFLSMPSKGCEPNVISYNVLINGYCKNWK 413
           MI+R   PD+FTYN+L++GFC+   LD A+++F  M SK C P+V++YN LI G+CK+ +
Sbjct: 362 MIKRSIDPDIFTYNSLVNGFCMHDRLDKAKQMFEFMVSKDCFPDVVTYNTLIKGFCKSKR 421

Query: 414 VEEAMKIYNEMLQVGIKPSVITYNALLTGLFQAGKVDDAKKIFGVIQAHGLVPSSSTLSI 473
           VE+  +++ EM   G+    +TY  L+ GLF  G  D+A+K+F  + + G+ P   T SI
Sbjct: 422 VEDGTELFREMSHRGLVGDTVTYTTLIQGLFHDGDCDNAQKVFKQMVSDGVPPDIMTYSI 481

Query: 474 FVDGLCKNDCLLEAMEIFNELSYN-LKLDIDIFNCLIDGLCKAGKLETAWEFFDKISREG 533
            +DGLC N  L +A+E+F+ +  + +KLDI I+  +I+G+CKAGK++  W+ F  +S +G
Sbjct: 482 LLDGLCNNGKLEKALEVFDYMQKSEIKLDIYIYTTMIEGMCKAGKVDDGWDLFCSLSLKG 541

Query: 534 LLPNVVTYSILIHGYCKEGQVEKANDLFRKMEENGCTPNVITYNTLLRGFYKSNKREEVV 593
           + PNVVTY+ +I G C +  +++A  L +KM+E+G  PN  TYNTL+R   +   +    
Sbjct: 542 VKPNVVTYNTMISGLCSKRLLQEAYALLKKMKEDGPLPNSGTYNTLIRAHLRDGDKAASA 601

Query: 594 ELLHRMVKKNVVPDASTCTIVLDMLSKDEKYRGCLNLL 631
           EL+  M     V DAST  +V +ML      +  L++L
Sbjct: 602 ELIREMRSCRFVGDASTIGLVANMLHDGRLDKSFLDML 633

BLAST of Cp4.1LG12g11340 vs. TAIR10
Match: AT1G62910.1 (AT1G62910.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 407.5 bits (1046), Expect = 1.5e-113
Identity = 204/518 (39.38%), Postives = 323/518 (62.36%), Query Frame = 1

Query: 114 SFNLLLGGLAKTKHYSEVLKLYYRMSLAGLLPNYITLNILLNCLCNVNRISEGLAAMAGI 173
           ++++ +    +    S  L +  +M   G  P+ +TL+ LLN  C+  RIS+ +A +  +
Sbjct: 120 TYSIFINCFCRRSQLSLALAVLAKMMKLGYEPDIVTLSSLLNGYCHSKRISDAVALVDQM 179

Query: 174 IRRGFIPNIVTYTSLIKGLCMEHRISEATRLFMRMQKLGCRPNVITYGTLIKGLCQTGNT 233
           +  G+ P+  T+T+LI GL + ++ SEA  L  +M + GC+P+++TYGT++ GLC+ G+ 
Sbjct: 180 VEMGYKPDTFTFTTLIHGLFLHNKASEAVALVDQMVQRGCQPDLVTYGTVVNGLCKRGDI 239

Query: 234 NIALKLHEEMLNGTGRYGISCKPNVICYSTIIDGLCKDGREDKARELFEEMKARRMLPDV 293
           ++AL L ++M  G        + +V+ Y+TIIDGLCK    D A  LF EM  + + PDV
Sbjct: 240 DLALSLLKKMEKG------KIEADVVIYNTIIDGLCKYKHMDDALNLFTEMDNKGIRPDV 299

Query: 294 ISYSSLIHGFCNGGKWEEAKCLFNEMVDLGIQPNAVTFNVLMDILCKAGKVIEANELLEV 353
            +YSSLI   CN G+W +A  L ++M++  I PN VTF+ L+D   K GK++EA +L + 
Sbjct: 300 FTYSSLISCLCNYGRWSDASRLLSDMIERKINPNVVTFSALIDAFVKEGKLVEAEKLYDE 359

Query: 354 MIQRGNAPDLFTYNTLMDGFCLVSDLDSARELFLSMPSKGCEPNVISYNVLINGYCKNWK 413
           MI+R   PD+FTY++L++GFC+   LD A+ +F  M SK C PNV++Y+ LI G+CK  +
Sbjct: 360 MIKRSIDPDIFTYSSLINGFCMHDRLDEAKHMFELMISKDCFPNVVTYSTLIKGFCKAKR 419

Query: 414 VEEAMKIYNEMLQVGIKPSVITYNALLTGLFQAGKVDDAKKIFGVIQAHGLVPSSSTLSI 473
           VEE M+++ EM Q G+  + +TY  L+ G FQA   D+A+ +F  + + G+ P+  T +I
Sbjct: 420 VEEGMELFREMSQRGLVGNTVTYTTLIHGFFQARDCDNAQMVFKQMVSVGVHPNILTYNI 479

Query: 474 FVDGLCKNDCLLEAMEIFNELSYN-LKLDIDIFNCLIDGLCKAGKLETAWEFFDKISREG 533
            +DGLCKN  L +AM +F  L  + ++ DI  +N +I+G+CKAGK+E  WE F  +S +G
Sbjct: 480 LLDGLCKNGKLAKAMVVFEYLQRSTMEPDIYTYNIMIEGMCKAGKVEDGWELFCNLSLKG 539

Query: 534 LLPNVVTYSILIHGYCKEGQVEKANDLFRKMEENGCTPNVITYNTLLRGFYKSNKREEVV 593
           + PNV+ Y+ +I G+C++G  E+A+ L +KM+E+G  PN  TYNTL+R   +   RE   
Sbjct: 540 VSPNVIAYNTMISGFCRKGSKEEADSLLKKMKEDGPLPNSGTYNTLIRARLRDGDREASA 599

Query: 594 ELLHRMVKKNVVPDASTCTIVLDMLSKDEKYRGCLNLL 631
           EL+  M       DAST  +V +ML      +  L++L
Sbjct: 600 ELIKEMRSCGFAGDASTIGLVTNMLHDGRLDKSFLDML 631

BLAST of Cp4.1LG12g11340 vs. TAIR10
Match: AT3G22470.1 (AT3G22470.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 406.4 bits (1043), Expect = 3.3e-113
Identity = 203/490 (41.43%), Postives = 312/490 (63.67%), Query Frame = 1

Query: 142 GLLPNYITLNILLNCLCNVNRISEGLAAMAGIIRRGFIPNIVTYTSLIKGLCMEHRISEA 201
           G  P+ IT + L+N  C   R+SE +A +  ++     P++VT ++LI GLC++ R+SEA
Sbjct: 135 GYEPDTITFSTLVNGFCLEGRVSEAVALVDRMVEMKQRPDLVTVSTLINGLCLKGRVSEA 194

Query: 202 TRLFMRMQKLGCRPNVITYGTLIKGLCQTGNTNIALKLHEEMLNGTGRYGISCKPNVICY 261
             L  RM + G +P+ +TYG ++  LC++GN+ +AL L  +M         + K +V+ Y
Sbjct: 195 LVLIDRMVEYGFQPDEVTYGPVLNRLCKSGNSALALDLFRKMEER------NIKASVVQY 254

Query: 262 STIIDGLCKDGREDKARELFEEMKARRMLPDVISYSSLIHGFCNGGKWEEAKCLFNEMVD 321
           S +ID LCKDG  D A  LF EM+ + +  DV++YSSLI G CN GKW++   +  EM+ 
Sbjct: 255 SIVIDSLCKDGSFDDALSLFNEMEMKGIKADVVTYSSLIGGLCNDGKWDDGAKMLREMIG 314

Query: 322 LGIQPNAVTFNVLMDILCKAGKVIEANELLEVMIQRGNAPDLFTYNTLMDGFCLVSDLDS 381
             I P+ VTF+ L+D+  K GK++EA EL   MI RG APD  TYN+L+DGFC  + L  
Sbjct: 315 RNIIPDVVTFSALIDVFVKEGKLLEAKELYNEMITRGIAPDTITYNSLIDGFCKENCLHE 374

Query: 382 ARELFLSMPSKGCEPNVISYNVLINGYCKNWKVEEAMKIYNEMLQVGIKPSVITYNALLT 441
           A ++F  M SKGCEP++++Y++LIN YCK  +V++ M+++ E+   G+ P+ ITYN L+ 
Sbjct: 375 ANQMFDLMVSKGCEPDIVTYSILINSYCKAKRVDDGMRLFREISSKGLIPNTITYNTLVL 434

Query: 442 GLFQAGKVDDAKKIFGVIQAHGLVPSSSTLSIFVDGLCKNDCLLEAMEIFNELSYN-LKL 501
           G  Q+GK++ AK++F  + + G+ PS  T  I +DGLC N  L +A+EIF ++  + + L
Sbjct: 435 GFCQSGKLNAAKELFQEMVSRGVPPSVVTYGILLDGLCDNGELNKALEIFEKMQKSRMTL 494

Query: 502 DIDIFNCLIDGLCKAGKLETAWEFFDKISREGLLPNVVTYSILIHGYCKEGQVEKANDLF 561
            I I+N +I G+C A K++ AW  F  +S +G+ P+VVTY+++I G CK+G + +A+ LF
Sbjct: 495 GIGIYNIIIHGMCNASKVDDAWSLFCSLSDKGVKPDVVTYNVMIGGLCKKGSLSEADMLF 554

Query: 562 RKMEENGCTPNVITYNTLLRGFYKSNKREEVVELLHRMVKKNVVPDASTCTIVLDMLSKD 621
           RKM+E+GCTP+  TYN L+R     +     VEL+  M       D+ST  +V+DMLS  
Sbjct: 555 RKMKEDGCTPDDFTYNILIRAHLGGSGLISSVELIEEMKVCGFSADSSTIKMVIDMLSDR 614

Query: 622 EKYRGCLNLL 631
              +  L++L
Sbjct: 615 RLDKSFLDML 618

BLAST of Cp4.1LG12g11340 vs. TAIR10
Match: AT1G62930.1 (AT1G62930.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 406.4 bits (1043), Expect = 3.3e-113
Identity = 207/518 (39.96%), Postives = 321/518 (61.97%), Query Frame = 1

Query: 114 SFNLLLGGLAKTKHYSEVLKLYYRMSLAGLLPNYITLNILLNCLCNVNRISEGLAAMAGI 173
           S+N+L+    +       L +  +M   G  P+ +TL+ LLN  C+  RISE +A +  +
Sbjct: 117 SYNILINCFCRRSQLPLALAVLGKMMKLGYEPDIVTLSSLLNGYCHGKRISEAVALVDQM 176

Query: 174 IRRGFIPNIVTYTSLIKGLCMEHRISEATRLFMRMQKLGCRPNVITYGTLIKGLCQTGNT 233
               + PN VT+ +LI GL + ++ SEA  L  RM   GC+P++ TYGT++ GLC+ G+ 
Sbjct: 177 FVMEYQPNTVTFNTLIHGLFLHNKASEAVALIDRMVARGCQPDLFTYGTVVNGLCKRGDI 236

Query: 234 NIALKLHEEMLNGTGRYGISCKPNVICYSTIIDGLCKDGREDKARELFEEMKARRMLPDV 293
           ++AL L ++M  G        + +V+ Y+TIID LC     + A  LF EM  + + P+V
Sbjct: 237 DLALSLLKKMEKG------KIEADVVIYTTIIDALCNYKNVNDALNLFTEMDNKGIRPNV 296

Query: 294 ISYSSLIHGFCNGGKWEEAKCLFNEMVDLGIQPNAVTFNVLMDILCKAGKVIEANELLEV 353
           ++Y+SLI   CN G+W +A  L ++M++  I PN VTF+ L+D   K GK++EA +L + 
Sbjct: 297 VTYNSLIRCLCNYGRWSDASRLLSDMIERKINPNVVTFSALIDAFVKEGKLVEAEKLYDE 356

Query: 354 MIQRGNAPDLFTYNTLMDGFCLVSDLDSARELFLSMPSKGCEPNVISYNVLINGYCKNWK 413
           MI+R   PD+FTY++L++GFC+   LD A+ +F  M SK C PNV++YN LI G+CK  +
Sbjct: 357 MIKRSIDPDIFTYSSLINGFCMHDRLDEAKHMFELMISKDCFPNVVTYNTLIKGFCKAKR 416

Query: 414 VEEAMKIYNEMLQVGIKPSVITYNALLTGLFQAGKVDDAKKIFGVIQAHGLVPSSSTLSI 473
           VEE M+++ EM Q G+  + +TYN L+ GLFQAG  D A+KIF  + + G+ P   T SI
Sbjct: 417 VEEGMELFREMSQRGLVGNTVTYNTLIQGLFQAGDCDMAQKIFKKMVSDGVPPDIITYSI 476

Query: 474 FVDGLCKNDCLLEAMEIFNELSYN-LKLDIDIFNCLIDGLCKAGKLETAWEFFDKISREG 533
            +DGLCK   L +A+ +F  L  + ++ DI  +N +I+G+CKAGK+E  W+ F  +S +G
Sbjct: 477 LLDGLCKYGKLEKALVVFEYLQKSKMEPDIYTYNIMIEGMCKAGKVEDGWDLFCSLSLKG 536

Query: 534 LLPNVVTYSILIHGYCKEGQVEKANDLFRKMEENGCTPNVITYNTLLRGFYKSNKREEVV 593
           + PNV+ Y+ +I G+C++G  E+A+ LFR+M+E+G  PN  TYNTL+R   +   +    
Sbjct: 537 VKPNVIIYTTMISGFCRKGLKEEADALFREMKEDGTLPNSGTYNTLIRARLRDGDKAASA 596

Query: 594 ELLHRMVKKNVVPDASTCTIVLDMLSKDEKYRGCLNLL 631
           EL+  M     V DAST ++V++ML      +  L +L
Sbjct: 597 ELIKEMRSCGFVGDASTISMVINMLHDGRLEKSYLEML 628

BLAST of Cp4.1LG12g11340 vs. NCBI nr
Match: gi|778727543|ref|XP_011659273.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g22470, mitochondrial-like [Cucumis sativus])

HSP 1 Score: 850.1 bits (2195), Expect = 2.5e-243
Identity = 421/630 (66.83%), Postives = 503/630 (79.84%), Query Frame = 1

Query: 11  SSSLSPAPS-KGKLSSLVTRPLRIASSNPQNLSPIPAESPPIISKPPNSNVSPEVSLEKS 70
           S +L P+PS K  LS L T    I SS PQ  S          +  PN    P +    +
Sbjct: 3   SKTLLPSPSFKPNLSPLSTHSSTIPSSIPQTSS----------THHPN----PILPAAFN 62

Query: 71  TTEISFHERLRTFLQNCKTGNFTASEAFQFFDLMMLANPTPAMSSFNLLLGGLAKTKHYS 130
             EISFH  L  FL+NCKTGN TA +AF FFDLMM ++P P +SSFN LLGGLAK  HYS
Sbjct: 63  REEISFHHPLSLFLRNCKTGNITAIQAFHFFDLMMRSHPIPPISSFNRLLGGLAKINHYS 122

Query: 131 EVLKLYYRMSLAGLLPNYITLNILLNCLCNVNRISEGLAAMAGIIRRGFIPNIVTYTSLI 190
           ++  LY  M LAGL P+  TL+IL NCLCNVNR+SE LAAMAGI+RRG+IPN+VTYT+LI
Sbjct: 123 QLFSLYNEMRLAGLSPDLFTLSILANCLCNVNRVSEALAAMAGILRRGYIPNVVTYTTLI 182

Query: 191 KGLCMEHRISEATRLFMRMQKLGCRPNVITYGTLIKGLCQTGNTNIALKLHEEMLNGTGR 250
           KGLCMEHRISEATRLF+RMQKLGC PN +TYGTLIKGLCQTGN NIALKLH+EMLN   +
Sbjct: 183 KGLCMEHRISEATRLFLRMQKLGCTPNAVTYGTLIKGLCQTGNVNIALKLHKEMLNDASQ 242

Query: 251 YGISCKPNVICYSTIIDGLCKDGREDKARELFEEMKARRMLPDVISYSSLIHGFCNGGKW 310
           YG++CKP VI YS IIDGLCK GRED+A+ELFEEMKA+ M+PDVISYSSLIHG C  GKW
Sbjct: 243 YGVNCKPGVITYSIIIDGLCKVGREDEAKELFEEMKAQGMIPDVISYSSLIHGLCCAGKW 302

Query: 311 EEAKCLFNEMVDLGIQPNAVTFNVLMDILCKAGKVIEANELLEVMIQRGNAPDLFTYNTL 370
           EE+K LF+EMVD G+QP+ VTF+VL+D LCK GKV EA +LLEVMIQRG  P+L TYN+L
Sbjct: 303 EESKRLFDEMVDQGVQPDMVTFSVLIDTLCKEGKVTEAKKLLEVMIQRGIVPNLITYNSL 362

Query: 371 MDGFCLVSDLDSARELFLSMPSKGCEPNVISYNVLINGYCKNWKVEEAMKIYNEMLQVGI 430
           +DGFC+V DL+SARELFLSMPSKG EP+ ISY  LINGYCK WKV+EAM +YNEMLQVG 
Sbjct: 363 IDGFCMVGDLNSARELFLSMPSKGLEPDEISYTTLINGYCKTWKVKEAMNLYNEMLQVGK 422

Query: 431 KPSVITYNALLTGLFQAGKVDDAKKIFGVIQAHGLVPSSSTLSIFVDGLCKNDCLLEAME 490
           +P+  TY  LLTGLFQ GKV DAKK+FGV++ +G+  +S    IF+DGLCKNDCL EAME
Sbjct: 423 RPNATTYGTLLTGLFQTGKVGDAKKLFGVMKTYGVSANSQIYGIFLDGLCKNDCLFEAME 482

Query: 491 IFNEL-SYNLKLDIDIFNCLIDGLCKAGKLETAWEFFDKISREGLLPNVVTYSILIHGYC 550
           +FNEL SYN KL+I+ ++CLIDGLCKAGKLETAWE F+K+S+EGL P+VVTY+I+IHG+C
Sbjct: 483 LFNELKSYNFKLNIENYSCLIDGLCKAGKLETAWELFEKLSQEGLQPDVVTYNIMIHGFC 542

Query: 551 KEGQVEKANDLFRKMEENGCTPNVITYNTLLRGFYKSNKREEVVELLHRMVKKNVVPDAS 610
           K GQV+ AN LF KMEENGCTP++I YNTLL GF + NK EEV++LLH+MV+K+V P+A+
Sbjct: 543 KVGQVDNANILFEKMEENGCTPDIIAYNTLLCGFCEGNKLEEVIKLLHKMVQKDVSPNAA 602

Query: 611 TCTIVLDMLSKDEKYRGCLNLLPTFPVQEY 639
           +CTIV+DML KDEKY+  ++LLP FPVQ +
Sbjct: 603 SCTIVVDMLCKDEKYKKFVDLLPKFPVQRH 618

BLAST of Cp4.1LG12g11340 vs. NCBI nr
Match: gi|659100702|ref|XP_008451225.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g22470, mitochondrial-like [Cucumis melo])

HSP 1 Score: 850.1 bits (2195), Expect = 2.5e-243
Identity = 403/560 (71.96%), Postives = 481/560 (85.89%), Query Frame = 1

Query: 82  FLQNCKTGNFTASEAFQFFDLMMLANPTPAMSSFNLLLGGLAKTKHYSEVLKLYYRMSLA 141
           FL+NCKTGN TA++AF FFDLM+ + P P +SSF+ LLGGLAK KHYS +  LY +M LA
Sbjct: 10  FLRNCKTGNITATQAFHFFDLMLRSYPIPPISSFDCLLGGLAKIKHYSLLFSLYNKMRLA 69

Query: 142 GLLPNYITLNILLNCLCNVNRISEGLAAMAGIIRRGFIPNIVTYTSLIKGLCMEHRISEA 201
           GL P+ ITLNIL+NCLCNVNR+SE LA MAG++RRG+IPN+VTYT+LIKGLCMEHRISEA
Sbjct: 70  GLSPSAITLNILVNCLCNVNRVSEALAGMAGLLRRGYIPNVVTYTTLIKGLCMEHRISEA 129

Query: 202 TRLFMRMQKLGCRPNVITYGTLIKGLCQTGNTNIALKLHEEMLNGTGRYGISCKPNVICY 261
           TRLF+RMQKLGC PNV+TYGTL+KGLCQTGN NIALKLH+EMLN T +YGI+CKPNV  Y
Sbjct: 130 TRLFLRMQKLGCTPNVVTYGTLVKGLCQTGNVNIALKLHQEMLNDTSQYGINCKPNVFNY 189

Query: 262 STIIDGLCKDGREDKARELFEEMKARRMLPDVISYSSLIHGFCNGGKWEEAKCLFNEMVD 321
           + IIDGLCK GRED+A ELFEEMKA+ M+P+VISYSSLIHGFC   KWEE+K LF+EMVD
Sbjct: 190 NIIIDGLCKVGREDEANELFEEMKAQGMIPNVISYSSLIHGFCCARKWEESKRLFDEMVD 249

Query: 322 LGIQPNAVTFNVLMDILCKAGKVIEANELLEVMIQRGNAPDLFTYNTLMDGFCLVSDLDS 381
            G+QP+ VTF+VL+D LCK GKVIEA +L EVMIQRG  PDLF Y++LM+GFC+V DL+S
Sbjct: 250 QGVQPDKVTFSVLIDTLCKEGKVIEAKKLFEVMIQRGIVPDLFIYSSLMEGFCMVGDLNS 309

Query: 382 ARELFLSMPSKGCEPNVISYNVLINGYCKNWKVEEAMKIYNEMLQVGIKPSVITYNALLT 441
           ARELF+SMPSKGCEP+VISY VLINGYCK  KVEEAMK+YNEML VG +P+VITY ALLT
Sbjct: 310 ARELFVSMPSKGCEPDVISYTVLINGYCKTLKVEEAMKLYNEMLLVGKRPNVITYGALLT 369

Query: 442 GLFQAGKVDDAKKIFGVIQAHGLVPSSSTLSIFVDGLCKNDCLLEAMEIFNEL-SYNLKL 501
           GLF AGKV DAKK+F  ++A G+  +S    I +DGLCKN CL EAM++F EL SYN KL
Sbjct: 370 GLFLAGKVGDAKKLFSAMKARGISANSHIYGIILDGLCKNGCLFEAMKLFTELKSYNFKL 429

Query: 502 DIDIFNCLIDGLCKAGKLETAWEFFDKISREGLLPNVVTYSILIHGYCKEGQVEKANDLF 561
           DI+ ++CLIDGLCK GKLETAWE F+K+S+EGL PNVVTYSI+IHG C+EGQV+KAN L 
Sbjct: 430 DIETYSCLIDGLCKEGKLETAWELFEKLSQEGLQPNVVTYSIMIHGLCREGQVDKANVLI 489

Query: 562 RKMEENGCTPNVITYNTLLRGFYKSNKREEVVELLHRMVKKNVVPDASTCTIVLDMLSKD 621
           +KME NGC PN+ITYNTL+RGFY+SNK +EVV+LLH MVKK+V+PDA+TC+IV+DML KD
Sbjct: 490 QKMETNGCNPNIITYNTLMRGFYESNKLDEVVQLLHGMVKKDVLPDATTCSIVVDMLCKD 549

Query: 622 EKYRGCLNLLPTFPVQEYRG 641
           EKY+ CL+LLP F VQ+++G
Sbjct: 550 EKYQECLDLLPRFSVQKHQG 569

BLAST of Cp4.1LG12g11340 vs. NCBI nr
Match: gi|645259328|ref|XP_008235309.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g63330-like [Prunus mume])

HSP 1 Score: 728.0 bits (1878), Expect = 1.4e-206
Identity = 357/624 (57.21%), Postives = 457/624 (73.24%), Query Frame = 1

Query: 13  SLSPAPSKGKLSSLVTRPLRIASSNPQNLSPIPAESPPIISKPPNSNVSPEVSLEKSTTE 72
           S S   SK KLSSL   P R  SSN +  S + +        P +SN  P  S E   ++
Sbjct: 2   SSSGTASKVKLSSLFANPQRPTSSNSKTSSKLSSLFIKNQGTPTSSN--PTTSSENPISK 61

Query: 73  ISFHERLRTFLQ-NCKTGNFTASEAFQFFDLMMLANPTPAMSSFNLLLGGLAKTKHYSEV 132
           +    +L  FL+ NCK+G+ T +EA  +F+ M+   PTP + SFN L GGLAK+KH S V
Sbjct: 62  VPIQTQLEKFLETNCKSGDVTINEALHYFEHMIQMQPTPPIWSFNCLFGGLAKSKHCSHV 121

Query: 133 LKLYYRMSLAGLLPNYITLNILLNCLCNVNRISEGLAAMAGIIRRGFIPNIVTYTSLIKG 192
             LY ++   GLLPN+ITLNILLNC CNVNR  +G   M  ++RRG+ P+ VTYT+L+KG
Sbjct: 122 FLLYNKLISVGLLPNFITLNILLNCFCNVNRARDGFVVMGSLLRRGYRPSTVTYTALLKG 181

Query: 193 LCMEHRISEATRLFMRMQKLGCRPNVITYGTLIKGLCQTGNTNIALKLHEEMLNGTGRYG 252
           LC E RI EATRLFM M KLGC+P V+T+GTLI GLC+TGNTN+A++LHEEM NG G YG
Sbjct: 182 LCKEDRIDEATRLFMTMIKLGCQPTVVTFGTLINGLCRTGNTNVAVRLHEEMANGNGVYG 241

Query: 253 ISCKPNVICYSTIIDGLCKDGREDKARELFEEMKARRMLPDVISYSSLIHGFCNGGKWEE 312
           + CKP+V+ Y TIIDGLCK G  DKA+ELF EMK R  +PDVI YS+LIH      KWE 
Sbjct: 242 VECKPSVVSYGTIIDGLCKAGLVDKAKELFIEMKDRGFVPDVIVYSALIHELYYNEKWEA 301

Query: 313 AKCLFNEMVDLGIQPNAVTFNVLMDILCKAGKVIEANELLEVMIQRGNAPDLFTYNTLMD 372
           AK L NEMVD G++PN VTFNVL+ +LC+ G + E+++LL++MIQRG  PD+FTYNTL+D
Sbjct: 302 AKALLNEMVDQGVRPNVVTFNVLIGVLCRRGHLKESSDLLKLMIQRGIDPDVFTYNTLID 361

Query: 373 GFCLVSDLDSARELFLSMPSKGCEPNVISYNVLINGYCKNWKVEEAMKIYNEMLQVGIKP 432
           GFCL   L+ ARELF S+PS+GCEP+ ISYNVLINGYCKN  ++EA+ +Y EM+  G +P
Sbjct: 362 GFCLAGRLNEARELFHSIPSRGCEPDAISYNVLINGYCKNRNIQEAVNLYKEMIGKGTRP 421

Query: 433 SVITYNALLTGLFQAGKVDDAKKIFGVIQAHGLVPSSSTLSIFVDGLCKNDCLLEAMEIF 492
           +VITYN+LLTGLF  GKV DA+++FG +Q   L+P+S+T  I +DGLCK D + EAME+F
Sbjct: 422 TVITYNSLLTGLFHMGKVQDAQELFGEMQTQNLLPNSTTYKILLDGLCKTDRVPEAMEVF 481

Query: 493 NEL-SYNLKLDIDIFNCLIDGLCKAGKLETAWEFFDKISREGLLPNVVTYSILIHGYCKE 552
             L + N K+ +++ NCLIDG CKAG LE AW+ F  +S  GL PNV+TYS++IHG C E
Sbjct: 482 RTLENCNFKISVEMLNCLIDGFCKAGNLEVAWDLFLTLSNRGLAPNVITYSVMIHGLCIE 541

Query: 553 GQVEKANDLFRKMEENGCTPNVITYNTLLRGFYKSNKREEVVELLHRMVKKNVVPDASTC 612
           GQ+EKAN LF +ME NGC PNVI YN L+RGF +S+   +VVELLH+MV +N+ PD+ T 
Sbjct: 542 GQLEKANGLFIEMEANGCAPNVIIYNILMRGFCQSDDSAKVVELLHKMVTRNLSPDSCTI 601

Query: 613 TIVLDMLSKDEKYRGCLNLLPTFP 635
           ++V+D+LSKDEKYR CL+LLPTFP
Sbjct: 602 SVVIDLLSKDEKYRKCLDLLPTFP 623

BLAST of Cp4.1LG12g11340 vs. NCBI nr
Match: gi|694328165|ref|XP_009354905.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g12700, mitochondrial isoform X2 [Pyrus x bretschneideri])

HSP 1 Score: 726.9 bits (1875), Expect = 3.2e-206
Identity = 350/620 (56.45%), Postives = 456/620 (73.55%), Query Frame = 1

Query: 17  APSKGKLSSLVTRPLRIASSNPQNLSPIPAESPPIISKPPNSNVSPEVSLEKSTTEISFH 76
           A SK KLSS  T P +  S NP+  + +     P       ++ +P+   +     +   
Sbjct: 6   ATSKSKLSSFFTNPQKPTSPNPKTPTKLSPFFVPTNQSSQPTSPNPKSPPKTPIPNLPIQ 65

Query: 77  ERLRTFLQ-NCKTGNFTASEAFQFFDLMMLANPTPAMSSFNLLLGGLAKTKHYSEVLKLY 136
             L+ FL+ NCK+GNFT  EA Q+F  M+   PTP +SSFNLL  GLAK+KH+S+V  LY
Sbjct: 66  TPLQKFLESNCKSGNFTLYEALQYFSHMIRMQPTPPISSFNLLFSGLAKSKHFSQVFPLY 125

Query: 137 YRMSLAGLLPNYITLNILLNCLCNVNRISEGLAAMAGIIRRGFIPNIVTYTSLIKGLCME 196
             +  AGL+PN+ITLN+LLNC C  NR+ +G   M  + RRGF+P+ V+YT L+KGLC E
Sbjct: 126 NAIISAGLVPNFITLNVLLNCCCTANRVRDGFVVMGSLFRRGFLPSTVSYTCLVKGLCRE 185

Query: 197 HRISEATRLFMRMQKLGCRPNVITYGTLIKGLCQTGNTNIALKLHEEMLNGTGRYGISCK 256
            +I EATRLF +M KLGC PN  T+ TLIKGLC+TGN N+AL+LHE M NG+G Y + C+
Sbjct: 186 DKIFEATRLFEKMVKLGCHPNEFTFSTLIKGLCRTGNVNVALRLHEGMANGSGAYSVGCR 245

Query: 257 PNVICYSTIIDGLCKDGREDKARELFEEMKARRMLPDVISYSSLIHGFCNGGKWEEAKCL 316
           PNV  Y+TIIDG+CK G  DKA+ELF EMK R ++PDV  YSSLIHG     KWE AK +
Sbjct: 246 PNVFTYATIIDGMCKAGLVDKAKELFIEMKGRGIVPDVNVYSSLIHGLYYNEKWEAAKAM 305

Query: 317 FNEMVDLGIQPNAVTFNVLMDILCKAGKVIEANELLEVMIQRGNAPDLFTYNTLMDGFCL 376
            +EMVD G+QPN VTFNVL+ ++C+ G V E+++LL++MI+RG  PD+FTY+ L+DG CL
Sbjct: 306 LSEMVDQGVQPNLVTFNVLIGVICRRGNVKESSDLLKLMIRRGIDPDIFTYSILIDGLCL 365

Query: 377 VSDLDSARELFLSMPSKGCEPNVISYNVLINGYCKNWKVEEAMKIYNEMLQVGIKPSVIT 436
           V  L+ ARELF S+PS+GCEP  ISYNVLINGYCK+ K+ EA   Y EM+  G++P++IT
Sbjct: 366 VGRLNEARELFHSIPSRGCEPGAISYNVLINGYCKHKKILEATNFYKEMINKGVRPTIIT 425

Query: 437 YNALLTGLFQAGKVDDAKKIFGVIQAHGLVPSSSTLSIFVDGLCKNDCLLEAMEIFNEL- 496
           YNALLTGLFQ GKV DA+++F  I+   L+P+S+T +IF+DGLCKN+CL EAMEIF+ L 
Sbjct: 426 YNALLTGLFQMGKVQDAQELFSEIRTQNLLPNSTTYTIFMDGLCKNNCLTEAMEIFHTLE 485

Query: 497 SYNLKLDIDIFNCLIDGLCKAGKLETAWEFFDKISREGLLPNVVTYSILIHGYCKEGQVE 556
           + N KL +++F CLIDGLCKAGKLE AW+ F K+S  GL P VVTYS++IHG C +GQ+E
Sbjct: 486 NSNFKLGVEMFGCLIDGLCKAGKLEIAWDLFHKMSNRGLGPTVVTYSMMIHGLCIDGQLE 545

Query: 557 KANDLFRKMEENGCTPNVITYNTLLRGFYKSNKREEVVELLHRMVKKNVVPDASTCTIVL 616
           KANDLF +ME  GC PN+ITYN L+RGF +++   +VVELLH+M ++N+ PDAST +IV+
Sbjct: 546 KANDLFLEMEGKGCAPNIITYNILMRGFCQNDDSAKVVELLHKMAERNLSPDASTVSIVI 605

Query: 617 DMLSKDEKYRGCLNLLPTFP 635
           D+L KDEKYR CL+LLPTFP
Sbjct: 606 DLLLKDEKYRKCLDLLPTFP 625

BLAST of Cp4.1LG12g11340 vs. NCBI nr
Match: gi|694328160|ref|XP_009354903.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g12700, mitochondrial isoform X1 [Pyrus x bretschneideri])

HSP 1 Score: 726.9 bits (1875), Expect = 3.2e-206
Identity = 350/620 (56.45%), Postives = 456/620 (73.55%), Query Frame = 1

Query: 17  APSKGKLSSLVTRPLRIASSNPQNLSPIPAESPPIISKPPNSNVSPEVSLEKSTTEISFH 76
           A SK KLSS  T P +  S NP+  + +     P       ++ +P+   +     +   
Sbjct: 6   ATSKSKLSSFFTNPQKPTSPNPKTPTKLSPFFVPTNQSSQPTSPNPKSPPKTPIPNLPIQ 65

Query: 77  ERLRTFLQ-NCKTGNFTASEAFQFFDLMMLANPTPAMSSFNLLLGGLAKTKHYSEVLKLY 136
             L+ FL+ NCK+GNFT  EA Q+F  M+   PTP +SSFNLL  GLAK+KH+S+V  LY
Sbjct: 66  TPLQKFLESNCKSGNFTLYEALQYFSHMIRMQPTPPISSFNLLFSGLAKSKHFSQVFPLY 125

Query: 137 YRMSLAGLLPNYITLNILLNCLCNVNRISEGLAAMAGIIRRGFIPNIVTYTSLIKGLCME 196
             +  AGL+PN+ITLN+LLNC C  NR+ +G   M  + RRGF+P+ V+YT L+KGLC E
Sbjct: 126 NAIISAGLVPNFITLNVLLNCCCTANRVRDGFVVMGSLFRRGFLPSTVSYTCLVKGLCRE 185

Query: 197 HRISEATRLFMRMQKLGCRPNVITYGTLIKGLCQTGNTNIALKLHEEMLNGTGRYGISCK 256
            +I EATRLF +M KLGC PN  T+ TLIKGLC+TGN N+AL+LHE M NG+G Y + C+
Sbjct: 186 DKIFEATRLFEKMVKLGCHPNEFTFSTLIKGLCRTGNVNVALRLHEGMANGSGAYSVGCR 245

Query: 257 PNVICYSTIIDGLCKDGREDKARELFEEMKARRMLPDVISYSSLIHGFCNGGKWEEAKCL 316
           PNV  Y+TIIDG+CK G  DKA+ELF EMK R ++PDV  YSSLIHG     KWE AK +
Sbjct: 246 PNVFTYATIIDGMCKAGLVDKAKELFIEMKGRGIVPDVNVYSSLIHGLYYNEKWEAAKAM 305

Query: 317 FNEMVDLGIQPNAVTFNVLMDILCKAGKVIEANELLEVMIQRGNAPDLFTYNTLMDGFCL 376
            +EMVD G+QPN VTFNVL+ ++C+ G V E+++LL++MI+RG  PD+FTY+ L+DG CL
Sbjct: 306 LSEMVDQGVQPNLVTFNVLIGVICRRGNVKESSDLLKLMIRRGIDPDIFTYSILIDGLCL 365

Query: 377 VSDLDSARELFLSMPSKGCEPNVISYNVLINGYCKNWKVEEAMKIYNEMLQVGIKPSVIT 436
           V  L+ ARELF S+PS+GCEP  ISYNVLINGYCK+ K+ EA   Y EM+  G++P++IT
Sbjct: 366 VGRLNEARELFHSIPSRGCEPGAISYNVLINGYCKHKKILEATNFYKEMINKGVRPTIIT 425

Query: 437 YNALLTGLFQAGKVDDAKKIFGVIQAHGLVPSSSTLSIFVDGLCKNDCLLEAMEIFNEL- 496
           YNALLTGLFQ GKV DA+++F  I+   L+P+S+T +IF+DGLCKN+CL EAMEIF+ L 
Sbjct: 426 YNALLTGLFQMGKVQDAQELFSEIRTQNLLPNSTTYTIFMDGLCKNNCLTEAMEIFHTLE 485

Query: 497 SYNLKLDIDIFNCLIDGLCKAGKLETAWEFFDKISREGLLPNVVTYSILIHGYCKEGQVE 556
           + N KL +++F CLIDGLCKAGKLE AW+ F K+S  GL P VVTYS++IHG C +GQ+E
Sbjct: 486 NSNFKLGVEMFGCLIDGLCKAGKLEIAWDLFHKMSNRGLGPTVVTYSMMIHGLCIDGQLE 545

Query: 557 KANDLFRKMEENGCTPNVITYNTLLRGFYKSNKREEVVELLHRMVKKNVVPDASTCTIVL 616
           KANDLF +ME  GC PN+ITYN L+RGF +++   +VVELLH+M ++N+ PDAST +IV+
Sbjct: 546 KANDLFLEMEGKGCAPNIITYNILMRGFCQNDDSAKVVELLHKMAERNLSPDASTVSIVI 605

Query: 617 DMLSKDEKYRGCLNLLPTFP 635
           D+L KDEKYR CL+LLPTFP
Sbjct: 606 DLLLKDEKYRKCLDLLPTFP 625

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR38_ARATH6.1e-11737.12Putative pentatricopeptide repeat-containing protein At1g12700, mitochondrial OS... [more]
PPR90_ARATH2.6e-11239.00Pentatricopeptide repeat-containing protein At1g62590 OS=Arabidopsis thaliana GN... [more]
PPR94_ARATH2.6e-11239.38Pentatricopeptide repeat-containing protein At1g62910 OS=Arabidopsis thaliana GN... [more]
PP247_ARATH5.9e-11241.43Pentatricopeptide repeat-containing protein At3g22470, mitochondrial OS=Arabidop... [more]
PPR96_ARATH5.9e-11239.96Pentatricopeptide repeat-containing protein At1g62930, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
D7SQM4_VITVI1.2e-19955.97Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0141g00780 PE=4 SV=... [more]
M5VNT7_PRUPE1.1e-19761.35Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003905mg PE=4 SV=1[more]
A0A061DX07_THECC2.8e-19354.01Pentatricopeptide repeat-containing protein, putative isoform 1 OS=Theobroma cac... [more]
W9S0N2_9ROSA1.4e-19259.33Uncharacterized protein OS=Morus notabilis GN=L484_005753 PE=4 SV=1[more]
A0A059A8Z6_EUCGR1.1e-18150.87Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_K03302 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G12700.15.0e-11737.45 ATP binding;nucleic acid binding;helicases[more]
AT1G62590.11.5e-11339.00 pentatricopeptide (PPR) repeat-containing protein[more]
AT1G62910.11.5e-11339.38 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G22470.13.3e-11341.43 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G62930.13.3e-11339.96 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778727543|ref|XP_011659273.1|2.5e-24366.83PREDICTED: pentatricopeptide repeat-containing protein At3g22470, mitochondrial-... [more]
gi|659100702|ref|XP_008451225.1|2.5e-24371.96PREDICTED: pentatricopeptide repeat-containing protein At3g22470, mitochondrial-... [more]
gi|645259328|ref|XP_008235309.1|1.4e-20657.21PREDICTED: pentatricopeptide repeat-containing protein At1g63330-like [Prunus mu... [more]
gi|694328165|ref|XP_009354905.1|3.2e-20656.45PREDICTED: putative pentatricopeptide repeat-containing protein At1g12700, mitoc... [more]
gi|694328160|ref|XP_009354903.1|3.2e-20656.45PREDICTED: putative pentatricopeptide repeat-containing protein At1g12700, mitoc... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG12g11340.1Cp4.1LG12g11340.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 470..494
score: 0.
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 254..285
score: 8.0E-14coord: 497..527
score: 2.5E-9coord: 360..389
score: 2.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 291..340
score: 1.4E-17coord: 535..584
score: 3.4E-21coord: 396..443
score: 2.2E-18coord: 180..229
score: 2.6E-19coord: 113..159
score: 4.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 504..537
score: 2.4E-9coord: 294..328
score: 1.3E-10coord: 538..572
score: 3.0E-13coord: 259..293
score: 3.6E-10coord: 434..467
score: 9.6E-7coord: 364..398
score: 2.2E-9coord: 573..607
score: 1.3E-7coord: 399..433
score: 1.5E-9coord: 329..362
score: 3.0E-6coord: 114..146
score: 1.9E-4coord: 218..244
score: 1.0E-4coord: 148..182
score: 3.3E-4coord: 183..217
score: 6.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 292..326
score: 14.009coord: 536..570
score: 15.083coord: 467..497
score: 7.114coord: 501..535
score: 12.814coord: 146..180
score: 8.868coord: 571..605
score: 12.134coord: 397..431
score: 14.118coord: 216..246
score: 9.909coord: 362..396
score: 12.814coord: 111..145
score: 9.208coord: 257..291
score: 12.902coord: 181..215
score: 12.912coord: 432..466
score: 11.542coord: 327..361
score: 12
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 258..344
score: 1.0E-8coord: 400..597
score: 1.
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 258..456
score: 3.79E-9coord: 94..217
score: 3.7
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 81..612
score: 9.5E-245coord: 9..63
score: 9.5E