Cp4.1LG12g11100 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG12g11100
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCp4.1LG12 : 8975615 .. 8979323 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGGGAAAAAAAGGATAAAATATATAATGTCACACCTATCGGAGATGGGATACTAAGGCCGTGGATGCGAGGCGAATGGCCTGAGAATCTCAAAGCTCAGCATCCACTATCCATGGCCGCCCACTCCACTCCCTCCATCCTTCTTCCACCGGCAGTCAGCCATTCAAATCCTAAGAACCGTGTATTTCTTTGTCAACAACGAGCGAGCCATCCCAATTTCGAATCTCCCGCCCCATCTTCCTCTTCGTCTTCTACAGATGGTAAGCTCATGCAGAGAAGTCCTCGCGAAGGACGCATGGACATCGCAAAGCTGAAGGCGAAGGAAGCGGGCGAGAGAAAAGAGGAGGTCAACAGGAAGATTGCTTCTCAGAAAGCCATTTCTGTGATTTTGCGCAGGGAAGCCACGAAGGCCGTCATTGAGAGGAAGAGAGGCCCCACTAACTCTAAGAAACTGCTTCCGCGGACTGTTCTTGAGGCTCTCCATGAACGAATCACGGCCTTGCGATGGGAGTCTGCGCTCAAGGTCAGTTAATTTTGGTTTTAAGCTGCGTCTGATTTTTCCTTTTCTAGTATTCGTTTGGAAATTTATAATCGCTTTGGTTTGGAGACTCTTGGAAGAACAACGTAAAGGTGTTTTGTTCTTCTCTCTTTTTGGTTCTTGTATTGATGAGTGAAATGACTCTTGTTCGGAATTCCAGCAGTTCCCTTCATGTTTGAGTTCAATGTAATTCATTTTAAGAACAAATTTAATTGACTGCTTTGAGTGCAAATAAACCTCAATTGAAGCCCAGAAGTTGGATCTTTCTTCATTTAGCAACCAGGAAAGAAGAAACAAGTGTATCTTTGGATTGAGTCGGTAAACACGGGCAGCACCGAACGAGAATGAATATGTATAGAACTGGTACAACAAAGACGAGGCATAGAAATATAAATCACCATGGAACCTAAAAGTTTGAGCTTGTGGGTAACAACAAATTTAATTTGTATCAACACTCAACATTTACTAGGTGGATTTGCTCGTGTGCTAGCCATCAACTGGAAGAGGACATTCTGGTTTCCTGTCTTTTTAGGGTATATCAGACTTTGTTGGGACGATCTAAGGCTTTGACTTATCTTACAACCATGGAATAGATTATGTGAGATCCCACATCGGTTGGGGAGGAGAACGAAGCATTATTTACAAGGGTGTGGAAACCTCTCCCTAGTAAACCTGTTTTAAAAACCTTGAGGGGAAGCCCGAAAGAAAAAGCTCAGAGAGAACAATATCTGCTAGCGGTGTGATTGAGCTGTTACATATTCATTGTTTTGTTAAATATAAGCTGCTATTGCTATTTGAGCTTGTTGTCTGCCAGGATTTTAGTGAGATGATGATTGTGTTAAGAGTGGGAATTTGGATGCATGAGAAAAATTTAAATGCACAAGGCCAGCAACTTATGAAAAACTGGCACCATTGACAACTTATGGATTGTGTCATTAGCATTCAATGTTAAATTTTGTTCATTGTTAAGAACAAAAAGAGTTCAAAACTTGGTCAAGAACTACCATGATTAATCTAATTCGTCTTCTTAATTGTGTGAGTTTATTTATTATGGATTTCAGGTTTTTGAACTACTACGCGAACAATTGTGGTACAGACCTTACGCCGGGATGTACATCAAGCTAATTGTCATGCTTGGAAAATGTAAGCAACCTGAAAAGGCCTATGAGCTGTTCCAAGAAATGATCGAGGAAGGCTGTGAAGTTAGCCATGAGTCCTATACTGCTCTCCTGTCGGCCTATAGTAGGAGTGGTCTTCTTGACAGGGCATTTTCACTCCTCAACGAGATGAAAAACAGTCCTGATTGCCAGCCTGACGTTCACACTTACTCTATCCTCATAAAATCATGTTTGCAGGTGTTTGCATTCAGCGAAGCACAAACTCTACTCTCTGACATGGTGACTCGAGGAATAAAACCGAACACTATTACATATAATACCTTCATTGATGCTTATGGCAAAGCAAAAATGTACTCTTACCTGCCTGCCTTTTGAGAACATAATAGATGATTCATCTGGTTGCTGACTGAACTATGTTTAAATTTGAGTACTATAGATGATAATCGACTTCATTTTTAGATGGAGCTTATATCTTTCTGCAGGCGTAACGCCTTCCACGGCCAGTTCTCTTCAACCTTTCCTCCAGAGTCAACTGCTAGAAGTTACGGCCAGTTCTCTTCAACCTTTCCTCCAGAGTCAACTGCTAGAAGTTAAGCTTGGTTATAGCTAGTAGTTGGAGCCAGCATGGGGTGTGTGAGGTCTCCTTTTGTAGGAAAGGGGTGGGTGCCGTGGGGCTATTAGAAGAAAATGGTTTCGTCTTTTATGCTTATGGACTAGTTCTGCTCCTCTATGTCATGGTTGTTATCAATCTCTCTGGAAGTAACAGTCCAAGCCCACCACTAGCAGATATTGTCCTCTTTAGGCTCTCCCTTCTTTCCCTTCTGAGCTTCCCTTCAAGGTTTTAAAATGTTTGCTACGGAGAGGTTTTTATACCCTTATAAGGAATGTTTTGTTCCCATCTCCAACAGACGTGGGATCTCACAACTTCTGTCTCTTCTTCGCATTGTCCTGTCTTATAATCTGCGACCAGATGAATGTGTCTACTACTTCCATAGTAGAAAAGTAACTTCTTTTTGTGTGTATATATTCTGCAGGTTTGCTGAAATGGAGTCCATCCTTGTGGAAATGCTGAGTGATGACGGATGTAAGCCCGACGTTTGGACCATGAACTCGACACTTCGAGGCTTTGGCAGCAGTGGACAATTAGAGACCATGGAGAAATGTTATGAAAAGTTCCAGGGAGCTGGAATCCAACCAAACATTCAGACCTTCAACATCCTTTTGGATTCATACGGCAAAGCTAAAAGCTATGAGAAAATGAGTGCTGTTATGGAGTATATGCAGAAGTACCATTATTCATGGACAATCGTAACCTACAACATCGTTATCGACGCGTTCGGGAGGGCTGGGAATTTGAAACAGATGGAGCATCTATTTCGCCTTATGAGATCAGAGAGGATCCAACCGAGTTGTGTAACACTTTGCTCGCTCGTAAAGGCCTACGGTCAAGCAGGAAAACGCGACAAAATCGACAGTGTACTGCACATAGTCGAGAATTCCAATATAACGCTGGATACCGTCTTTTACAACTGTCTCGTGGACGCTTACGGGCGAATGGAATGTTTCGCAGAGATGAAGGCGGTACTCGGAATGATGGAGCAGAGAGGCTGCAAGCCTGATAAGACTACCTACAGGATCATGGCTCGAGCTTATTCAGATGGAGGAATGGCCAACCATGCCAGGGAGATCCATGATCTTGTAAGCACAGCAGAAGCAAGTAAGAGAACTCGTCCTGACTTATGATATTAATCTCTTATCATATGTAATGAAATAAATCTTAGTATTGATAATTAAAGGAGAAATTTAAAAATTGGATTTGGGCTTTGTGGGGTGGGCCAGGCCCATGCGGTAGTGCAAGGCAAGGGCAACGCTCAAAGGCCCACAATAGCAAGCTAATGTGTTGGGCCTTCCCCCGAGAAAAATTAATTTAATATCATGTGGGCCGATGGCCCACGTATCAGATATTAAACTGATAAGAACAGATACTACACTTGATCTTAGCCAAAAGGCCGAGAAAGGTATGAAGATTTTAAGTCACATGATTCCCCTTTTAT

mRNA sequence

GGGGAAAAAAAGGATAAAATATATAATGTCACACCTATCGGAGATGGGATACTAAGGCCGTGGATGCGAGGCGAATGGCCTGAGAATCTCAAAGCTCAGCATCCACTATCCATGGCCGCCCACTCCACTCCCTCCATCCTTCTTCCACCGGCAGTCAGCCATTCAAATCCTAAGAACCGTGTATTTCTTTGTCAACAACGAGCGAGCCATCCCAATTTCGAATCTCCCGCCCCATCTTCCTCTTCGTCTTCTACAGATGGTAAGCTCATGCAGAGAAGTCCTCGCGAAGGACGCATGGACATCGCAAAGCTGAAGGCGAAGGAAGCGGGCGAGAGAAAAGAGGAGGTCAACAGGAAGATTGCTTCTCAGAAAGCCATTTCTGTGATTTTGCGCAGGGAAGCCACGAAGGCCGTCATTGAGAGGAAGAGAGGCCCCACTAACTCTAAGAAACTGCTTCCGCGGACTGTTCTTGAGGCTCTCCATGAACGAATCACGGCCTTGCGATGGGAGTCTGCGCTCAAGGTTTTTGAACTACTACGCGAACAATTGTGGTACAGACCTTACGCCGGGATGTACATCAAGCTAATTGTCATGCTTGGAAAATGTAAGCAACCTGAAAAGGCCTATGAGCTGTTCCAAGAAATGATCGAGGAAGGCTGTGAAGTTAGCCATGAGTCCTATACTGCTCTCCTGTCGGCCTATAGTAGGAGTGGTCTTCTTGACAGGGCATTTTCACTCCTCAACGAGATGAAAAACAGTCCTGATTGCCAGCCTGACGTTCACACTTACTCTATCCTCATAAAATCATGTTTGCAGGTGTTTGCATTCAGCGAAGCACAAACTCTACTCTCTGACATGGTGACTCGAGGAATAAAACCGAACACTATTACATATAATACCTTCATTGATGCTTATGGCAAAGCAAAAATGTTTGCTGAAATGGAGTCCATCCTTGTGGAAATGCTGAGTGATGACGGATGTAAGCCCGACGTTTGGACCATGAACTCGACACTTCGAGGCTTTGGCAGCAGTGGACAATTAGAGACCATGGAGAAATGTTATGAAAAGTTCCAGGGAGCTGGAATCCAACCAAACATTCAGACCTTCAACATCCTTTTGGATTCATACGGCAAAGCTAAAAGCTATGAGAAAATGAGTGCTGTTATGGAGTATATGCAGAAGTACCATTATTCATGGACAATCGTAACCTACAACATCGTTATCGACGCGTTCGGGAGGGCTGGGAATTTGAAACAGATGGAGCATCTATTTCGCCTTATGAGATCAGAGAGGATCCAACCGAGTTGTGTAACACTTTGCTCGCTCGTAAAGGCCTACGGTCAAGCAGGAAAACGCGACAAAATCGACAGTGTACTGCACATAGTCGAGAATTCCAATATAACGCTGGATACCGTCTTTTACAACTGTCTCGTGGACGCTTACGGGCGAATGGAATGTTTCGCAGAGATGAAGGCGGTACTCGGAATGATGGAGCAGAGAGGCTGCAAGCCTGATAAGACTACCTACAGGATCATGGCTCGAGCTTATTCAGATGGAGGAATGGCCAACCATGCCAGGGAGATCCATGATCTTGTAAGCACAGCAGAAGCAAGTAAGAGAACTCGTCCTGACTTATGATATTAATCTCTTATCATATGTAATGAAATAAATCTTAGTATTGATAATTAAAGGAGAAATTTAAAAATTGGATTTGGGCTTTGTGGGGTGGGCCAGGCCCATGCGGTAGTGCAAGGCAAGGGCAACGCTCAAAGGCCCACAATAGCAAGCTAATGTGTTGGGCCTTCCCCCGAGAAAAATTAATTTAATATCATGTGGGCCGATGGCCCACGTATCAGATATTAAACTGATAAGAACAGATACTACACTTGATCTTAGCCAAAAGGCCGAGAAAGGTATGAAGATTTTAAGTCACATGATTCCCCTTTTAT

Coding sequence (CDS)

ATGCGAGGCGAATGGCCTGAGAATCTCAAAGCTCAGCATCCACTATCCATGGCCGCCCACTCCACTCCCTCCATCCTTCTTCCACCGGCAGTCAGCCATTCAAATCCTAAGAACCGTGTATTTCTTTGTCAACAACGAGCGAGCCATCCCAATTTCGAATCTCCCGCCCCATCTTCCTCTTCGTCTTCTACAGATGGTAAGCTCATGCAGAGAAGTCCTCGCGAAGGACGCATGGACATCGCAAAGCTGAAGGCGAAGGAAGCGGGCGAGAGAAAAGAGGAGGTCAACAGGAAGATTGCTTCTCAGAAAGCCATTTCTGTGATTTTGCGCAGGGAAGCCACGAAGGCCGTCATTGAGAGGAAGAGAGGCCCCACTAACTCTAAGAAACTGCTTCCGCGGACTGTTCTTGAGGCTCTCCATGAACGAATCACGGCCTTGCGATGGGAGTCTGCGCTCAAGGTTTTTGAACTACTACGCGAACAATTGTGGTACAGACCTTACGCCGGGATGTACATCAAGCTAATTGTCATGCTTGGAAAATGTAAGCAACCTGAAAAGGCCTATGAGCTGTTCCAAGAAATGATCGAGGAAGGCTGTGAAGTTAGCCATGAGTCCTATACTGCTCTCCTGTCGGCCTATAGTAGGAGTGGTCTTCTTGACAGGGCATTTTCACTCCTCAACGAGATGAAAAACAGTCCTGATTGCCAGCCTGACGTTCACACTTACTCTATCCTCATAAAATCATGTTTGCAGGTGTTTGCATTCAGCGAAGCACAAACTCTACTCTCTGACATGGTGACTCGAGGAATAAAACCGAACACTATTACATATAATACCTTCATTGATGCTTATGGCAAAGCAAAAATGTTTGCTGAAATGGAGTCCATCCTTGTGGAAATGCTGAGTGATGACGGATGTAAGCCCGACGTTTGGACCATGAACTCGACACTTCGAGGCTTTGGCAGCAGTGGACAATTAGAGACCATGGAGAAATGTTATGAAAAGTTCCAGGGAGCTGGAATCCAACCAAACATTCAGACCTTCAACATCCTTTTGGATTCATACGGCAAAGCTAAAAGCTATGAGAAAATGAGTGCTGTTATGGAGTATATGCAGAAGTACCATTATTCATGGACAATCGTAACCTACAACATCGTTATCGACGCGTTCGGGAGGGCTGGGAATTTGAAACAGATGGAGCATCTATTTCGCCTTATGAGATCAGAGAGGATCCAACCGAGTTGTGTAACACTTTGCTCGCTCGTAAAGGCCTACGGTCAAGCAGGAAAACGCGACAAAATCGACAGTGTACTGCACATAGTCGAGAATTCCAATATAACGCTGGATACCGTCTTTTACAACTGTCTCGTGGACGCTTACGGGCGAATGGAATGTTTCGCAGAGATGAAGGCGGTACTCGGAATGATGGAGCAGAGAGGCTGCAAGCCTGATAAGACTACCTACAGGATCATGGCTCGAGCTTATTCAGATGGAGGAATGGCCAACCATGCCAGGGAGATCCATGATCTTGTAAGCACAGCAGAAGCAAGTAAGAGAACTCGTCCTGACTTATGA

Protein sequence

MRGEWPENLKAQHPLSMAAHSTPSILLPPAVSHSNPKNRVFLCQQRASHPNFESPAPSSSSSSTDGKLMQRSPREGRMDIAKLKAKEAGERKEEVNRKIASQKAISVILRREATKAVIERKRGPTNSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIEEGCEVSHESYTALLSAYSRSGLLDRAFSLLNEMKNSPDCQPDVHTYSILIKSCLQVFAFSEAQTLLSDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILVEMLSDDGCKPDVWTMNSTLRGFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKAKSYEKMSAVMEYMQKYHYSWTIVTYNIVIDAFGRAGNLKQMEHLFRLMRSERIQPSCVTLCSLVKAYGQAGKRDKIDSVLHIVENSNITLDTVFYNCLVDAYGRMECFAEMKAVLGMMEQRGCKPDKTTYRIMARAYSDGGMANHAREIHDLVSTAEASKRTRPDL
BLAST of Cp4.1LG12g11100 vs. Swiss-Prot
Match: PP424_ARATH (Pentatricopeptide repeat-containing protein At5g48730, chloroplastic OS=Arabidopsis thaliana GN=At5g48730 PE=2 SV=2)

HSP 1 Score: 642.5 bits (1656), Expect = 4.0e-183
Identity = 328/498 (65.86%), Postives = 395/498 (79.32%), Query Frame = 1

Query: 17  MAAHSTPSILLPPAVSHSNPKNRVFLCQQRASHPNFESPA-PSSSSSSTDGKLMQRSPRE 76
           M + ST +   PP  ++     R F  +  +  P   + A  S  S++T   L +    +
Sbjct: 1   MVSLSTSTSHAPPLPTNRRTAERTFTVRCISISPREPNYAITSDKSNNTSLSLRETRQSK 60

Query: 77  GRMDIAKLKAKEAGERKEEVNRKIASQKAISVILRREATKAVIERKRGPTNSKKLLPRTV 136
             ++   +  +++ E KE+ N KIAS+KAIS+ILRREATK++IE+K+G   SKKLLPRTV
Sbjct: 61  WLINAEDVNERDSKEIKEDKNTKIASRKAISIILRREATKSIIEKKKG---SKKLLPRTV 120

Query: 137 LEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMI 196
           LE+LHERITALRWESA++VFELLREQLWY+P  G+Y+KLIVMLGKCKQPEKA+ELFQEMI
Sbjct: 121 LESLHERITALRWESAIQVFELLREQLWYKPNVGIYVKLIVMLGKCKQPEKAHELFQEMI 180

Query: 197 EEGCEVSHESYTALLSAYSRSGLLDRAFSLLNEMKNSPDCQPDVHTYSILIKSCLQVFAF 256
            EGC V+HE YTAL+SAYSRSG  D AF+LL  MK+S +CQPDVHTYSILIKS LQVFAF
Sbjct: 181 NEGCVVNHEVYTALVSAYSRSGRFDAAFTLLERMKSSHNCQPDVHTYSILIKSFLQVFAF 240

Query: 257 SEAQTLLSDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILVEMLSDDGCKPDVWTMNS 316
            + Q LLSDM  +GI+PNTITYNT IDAYGKAKMF EMES L++ML +D CKPD WTMNS
Sbjct: 241 DKVQDLLSDMRRQGIRPNTITYNTLIDAYGKAKMFVEMESTLIQMLGEDDCKPDSWTMNS 300

Query: 317 TLRGFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKAKSYEKMSAVMEYMQKYH 376
           TLR FG +GQ+E ME CYEKFQ +GI+PNI+TFNILLDSYGK+ +Y+KMSAVMEYMQKYH
Sbjct: 301 TLRAFGGNGQIEMMENCYEKFQSSGIEPNIRTFNILLDSYGKSGNYKKMSAVMEYMQKYH 360

Query: 377 YSWTIVTYNIVIDAFGRAGNLKQMEHLFRLMRSERIQPSCVTLCSLVKAYGQAGKRDKID 436
           YSWTIVTYN+VIDAFGRAG+LKQME+LFRLM+SERI PSCVTLCSLV+AYG+A K DKI 
Sbjct: 361 YSWTIVTYNVVIDAFGRAGDLKQMEYLFRLMQSERIFPSCVTLCSLVRAYGRASKADKIG 420

Query: 437 SVLHIVENSNITLDTVFYNCLVDAYGRMECFAEMKAVLGMMEQRGCKPDKTTYRIMARAY 496
            VL  +ENS+I LD VF+NCLVDAYGRME FAEMK VL +ME++G KPDK TYR M +AY
Sbjct: 421 GVLRFIENSDIRLDLVFFNCLVDAYGRMEKFAEMKGVLELMEKKGFKPDKITYRTMVKAY 480

Query: 497 SDGGMANHAREIHDLVST 514
              GM  H +E+H +V +
Sbjct: 481 RISGMTTHVKELHGVVES 495

BLAST of Cp4.1LG12g11100 vs. Swiss-Prot
Match: PP279_ARATH (Pentatricopeptide repeat-containing protein At3g53170 OS=Arabidopsis thaliana GN=At3g53170 PE=3 SV=1)

HSP 1 Score: 375.2 bits (962), Expect = 1.2e-102
Identity = 195/438 (44.52%), Postives = 285/438 (65.07%), Query Frame = 1

Query: 86  KEAGERKEEVNRKIAS-------QKAISVILRREATKAVIERKRGPTNSKKLLPRTVLEA 145
           K   ER E++N  + S       +K +S ILR +A    IERK        L P+ VLEA
Sbjct: 5   KVPNERTEKMNSGLISTRHQVDPKKELSRILRTDAAVKGIERKANSEKYLTLWPKAVLEA 64

Query: 146 LHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIEEG 205
           L E I   RW+SALK+F LLR+Q WY P    Y KL  +LG CKQP++A  LF+ M+ EG
Sbjct: 65  LDEAIKENRWQSALKIFNLLRKQHWYEPRCKTYTKLFKVLGNCKQPDQASLLFEVMLSEG 124

Query: 206 CEVSHESYTALLSAYSRSGLLDRAFSLLNEMKNSPDCQPDVHTYSILIKSCLQVFAFSEA 265
            + + + YT+L+S Y +S LLD+AFS L  MK+  DC+PDV T+++LI  C ++  F   
Sbjct: 125 LKPTIDVYTSLISVYGKSELLDKAFSTLEYMKSVSDCKPDVFTFTVLISCCCKLGRFDLV 184

Query: 266 QTLLSDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILVEMLSDDGCKPDVWTMNSTLR 325
           ++++ +M   G+  +T+TYNT ID YGKA MF EMES+L +M+ D    PDV T+NS + 
Sbjct: 185 KSIVLEMSYLGVGCSTVTYNTIIDGYGKAGMFEEMESVLADMIEDGDSLPDVCTLNSIIG 244

Query: 326 GFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKAKSYEKMSAVMEYMQKYHYSW 385
            +G+   +  ME  Y +FQ  G+QP+I TFNIL+ S+GKA  Y+KM +VM++M+K  +S 
Sbjct: 245 SYGNGRNMRKMESWYSRFQLMGVQPDITTFNILILSFGKAGMYKKMCSVMDFMEKRFFSL 304

Query: 386 TIVTYNIVIDAFGRAGNLKQMEHLFRLMRSERIQPSCVTLCSLVKAYGQAGKRDKIDSVL 445
           T VTYNIVI+ FG+AG +++M+ +FR M+ + ++P+ +T CSLV AY +AG   KIDSVL
Sbjct: 305 TTVTYNIVIETFGKAGRIEKMDDVFRKMKYQGVKPNSITYCSLVNAYSKAGLVVKIDSVL 364

Query: 446 HIVENSNITLDTVFYNCLVDAYGRMECFAEMKAVLGMMEQRGCKPDKTTYRIMARAYSDG 505
             + NS++ LDT F+NC+++AYG+    A MK +   ME+R CKPDK T+  M + Y+  
Sbjct: 365 RQIVNSDVVLDTPFFNCIINAYGQAGDLATMKELYIQMEERKCKPDKITFATMIKTYTAH 424

Query: 506 GMANHAREIH-DLVSTAE 516
           G+ +  +E+   ++S+ E
Sbjct: 425 GIFDAVQELEKQMISSGE 442

BLAST of Cp4.1LG12g11100 vs. Swiss-Prot
Match: PP216_ARATH (Pentatricopeptide repeat-containing protein At3g06430, chloroplastic OS=Arabidopsis thaliana GN=EMB2750 PE=2 SV=1)

HSP 1 Score: 341.3 bits (874), Expect = 1.9e-92
Identity = 177/396 (44.70%), Postives = 243/396 (61.36%), Query Frame = 1

Query: 114 TKAVIERKRGPTNSKKLLPR---------TVLEALHERITALRWESALKVFELLREQLWY 173
           T+ V +R+    N KK L R         TV E L + I   +W  AL+VF++LREQ +Y
Sbjct: 61  TEPVNQRRTPIKNVKKKLDRRSKANGWVNTVTETLSDLIAKKQWLQALEVFDMLREQTFY 120

Query: 174 RPYAGMYIKLIVMLGKCKQPEKAYELFQEMIEEGCEVSHESYTALLSAYSRSGLLDRAFS 233
           +P  G Y+KL+V+LGK  QP +A +LF EM+EEG E + E YTALL+AY+RS L+D AFS
Sbjct: 121 QPKEGTYMKLLVLLGKSGQPNRAQKLFDEMLEEGLEPTVELYTALLAAYTRSNLIDDAFS 180

Query: 234 LLNEMKNSPDCQPDVHTYSILIKSCLQVFAFSEAQTLLSDMVTRGIKPNTITYNTFIDAY 293
           +L++MK+ P CQPDV TYS L+K+C+    F    +L  +M  R I PNT+T N  +  Y
Sbjct: 181 ILDKMKSFPQCQPDVFTYSTLLKACVDASQFDLVDSLYKEMDERLITPNTVTQNIVLSGY 240

Query: 294 GKAKMFAEMESILVEMLSDDGCKPDVWTMNSTLRGFGSSGQLETMEKCYEKFQGAGIQPN 353
           G+   F +ME +L +ML    CKPDVWTMN  L  FG+ G+++ ME  YEKF+  GI+P 
Sbjct: 241 GRVGRFDQMEKVLSDMLVSTACKPDVWTMNIILSVFGNMGKIDMMESWYEKFRNFGIEPE 300

Query: 354 IQTFNILLDSYGKAKSYEKMSAVMEYMQKYHYSWTIVTYNIVIDAFGRAGNLKQMEHLFR 413
            +TFNIL+ SYGK + Y+KMS+VMEYM+K  + WT  TYN +I+AF   G+ K ME  F 
Sbjct: 301 TRTFNILIGSYGKKRMYDKMSSVMEYMRKLEFPWTTSTYNNIIEAFADVGDAKNMELTFD 360

Query: 414 LMRSERIQPSCVTLCSLVKAYGQAGKRDKIDSVLHIVENSNITLDTVFYNCLVDAYGRME 473
            MRSE ++    T C L+  Y  AG   K+ S + +     I  +T FYN ++ A  + +
Sbjct: 361 QMRSEGMKADTKTFCCLINGYANAGLFHKVISSVQLAAKFEIPENTAFYNAVISACAKAD 420

Query: 474 CFAEMKAVLGMMEQRGCKPDKTTYRIMARAYSDGGM 501
              EM+ V   M++R C  D  T+ IM  AY   GM
Sbjct: 421 DLIEMERVYIRMKERQCVCDSRTFEIMVEAYEKEGM 456

BLAST of Cp4.1LG12g11100 vs. Swiss-Prot
Match: PP358_ARATH (Pentatricopeptide repeat-containing protein At4g39620, chloroplastic OS=Arabidopsis thaliana GN=EMB2453 PE=2 SV=1)

HSP 1 Score: 181.0 bits (458), Expect = 3.3e-44
Identity = 103/344 (29.94%), Postives = 181/344 (52.62%), Query Frame = 1

Query: 147 RWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIEEGCEVSHESY 206
           +W   L+VF  +++Q WY P  G+Y KLI ++GK  Q   A  LF EM   GC      Y
Sbjct: 112 KWLQCLEVFRWMQKQRWYIPDNGVYSKLISVMGKKGQTRMAMWLFSEMKNSGCRPDASVY 171

Query: 207 TALLSAY----SRSGLLDRAFSLLNEMKNSPDCQPDVHTYSILIKSCLQVFAFSEAQTLL 266
            AL++A+     ++  L++    L++MK    CQP+V TY+IL+++  Q     +   L 
Sbjct: 172 NALITAHLHTRDKAKALEKVRGYLDKMKGIERCQPNVVTYNILLRAFAQSGKVDQVNALF 231

Query: 267 SDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILVEMLSDDGCKPDVWTMNSTLRGFGS 326
            D+    + P+  T+N  +DAYGK  M  EME++L  M S++ CKPD+ T N  +  +G 
Sbjct: 232 KDLDMSPVSPDVYTFNGVMDAYGKNGMIKEMEAVLTRMRSNE-CKPDIITFNVLIDSYGK 291

Query: 327 SGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKAKSYEKMSAVMEYMQKYHYSWTIVT 386
             + E ME+ ++    +  +P + TFN ++ +YGKA+  +K   V + M   +Y  + +T
Sbjct: 292 KQEFEKMEQTFKSLMRSKEKPTLPTFNSMIINYGKARMIDKAEWVFKKMNDMNYIPSFIT 351

Query: 387 YNIVIDAFGRAGNLKQMEHLF-RLMRSERIQPSCVTLCSLVKAYGQAGKRDKIDSVLHIV 446
           Y  +I  +G  G++ +   +F  +  S+R+  +  TL ++++ Y + G   + D + H  
Sbjct: 352 YECMIMMYGYCGSVSRAREIFEEVGESDRVLKAS-TLNAMLEVYCRNGLYIEADKLFHNA 411

Query: 447 ENSNITLDTVFYNCLVDAYGRMECFAEMKAVLGMMEQRGCKPDK 486
               +  D   Y  L  AY + +   +++ ++  ME+ G  P+K
Sbjct: 412 SAFRVHPDASTYKFLYKAYTKADMKEQVQILMKKMEKDGIVPNK 453

BLAST of Cp4.1LG12g11100 vs. Swiss-Prot
Match: PP362_ARATH (Pentatricopeptide repeat-containing protein At5g02860 OS=Arabidopsis thaliana GN=At5g02860 PE=2 SV=1)

HSP 1 Score: 164.5 bits (415), Expect = 3.2e-39
Identity = 103/339 (30.38%), Postives = 161/339 (47.49%), Query Frame = 1

Query: 171 YIKLIVMLGKCKQPEKAYELFQEMIEEGCEVSHESYTALLSAYSRSGLLDRAFSLLNEMK 230
           Y  L+ + GK  +P++A ++  EM+  G   S  +Y +L+SAY+R G+LD A  L N+M 
Sbjct: 317 YNALLDVYGKSHRPKEAMKVLNEMVLNGFSPSIVTYNSLISAYARDGMLDEAMELKNQMA 376

Query: 231 NSPDCQPDVHTYSILIKSCLQVFAFSEAQTLLSDMVTRGIKPNTITYNTFIDAYGKAKMF 290
                +PDV TY+ L+    +      A ++  +M   G KPN  T+N FI  YG    F
Sbjct: 377 EK-GTKPDVFTYTTLLSGFERAGKVESAMSIFEEMRNAGCKPNICTFNAFIKMYGNRGKF 436

Query: 291 AEMESILVEMLSDDGCKPDVWTMNSTLRGFGSSGQLETMEKCYEKFQGAGIQPNIQTFNI 350
            EM  I  E ++  G  PD+ T N+ L  FG +G    +   +++ + AG  P  +TFN 
Sbjct: 437 TEMMKIFDE-INVCGLSPDIVTWNTLLAVFGQNGMDSEVSGVFKEMKRAGFVPERETFNT 496

Query: 351 LLDSYGKAKSYEKMSAVMEYMQKYHYSWTIVTYNIVIDAFGRAGNLKQMEHLFRLMRSER 410
           L+ +Y +  S+E+   V   M     +  + TYN V+ A  R G  +Q E +   M   R
Sbjct: 497 LISAYSRCGSFEQAMTVYRRMLDAGVTPDLSTYNTVLAALARGGMWEQSEKVLAEMEDGR 556

Query: 411 IQPSCVTLCSLVKAYGQAGKRDKIDSVLHIVENSNITLDTVFYNCLVDAYGRMECFAEMK 470
            +P+ +T CSL+ AY    +   + S+   V +  I    V    LV    + +   E +
Sbjct: 557 CKPNELTYCSLLHAYANGKEIGLMHSLAEEVYSGVIEPRAVLLKTLVLVCSKCDLLPEAE 616

Query: 471 AVLGMMEQRGCKPDKTTYRIMARAYSDGGMANHAREIHD 510
                +++RG  PD TT   M   Y    M   A  + D
Sbjct: 617 RAFSELKERGFSPDITTLNSMVSIYGRRQMVAKANGVLD 653

BLAST of Cp4.1LG12g11100 vs. TrEMBL
Match: A0A0A0K8C6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G342750 PE=4 SV=1)

HSP 1 Score: 887.1 bits (2291), Expect = 1.0e-254
Identity = 447/506 (88.34%), Postives = 477/506 (94.27%), Query Frame = 1

Query: 27  LPPAVSHSNP-KNRVFLCQQRASHPNFESPAPSSSSSST-------DGKLMQRSPREGRM 86
           LPP ++H NP  NR++L +Q+ +HP F+SPAPSSSSSS+       DGKLM+ SP EGRM
Sbjct: 6   LPPPINHPNPTNNRLYLRRQQRTHPKFQSPAPSSSSSSSSSTTTTADGKLMKISPHEGRM 65

Query: 87  DIAKLKAKEAGERKEEVNRKIASQKAISVILRREATKAVIERKRGPTNSKKLLPRTVLEA 146
           D+AKLKAKEA ERKEEVNRKIASQKAISVILRREATKAVIERKRGP NSKKLLPRTVLEA
Sbjct: 66  DVAKLKAKEASERKEEVNRKIASQKAISVILRREATKAVIERKRGPNNSKKLLPRTVLEA 125

Query: 147 LHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIEEG 206
           LH+RIT LRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQ EKAYELFQEMIEEG
Sbjct: 126 LHDRITTLRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQQEKAYELFQEMIEEG 185

Query: 207 CEVSHESYTALLSAYSRSGLLDRAFSLLNEMKNSPDCQPDVHTYSILIKSCLQVFAFSEA 266
           CEVSHESYTALLSAYSRSGLLD AFS+LNEMKNSPDCQPDVHTYSILIKSCLQVFAF++A
Sbjct: 186 CEVSHESYTALLSAYSRSGLLDEAFSILNEMKNSPDCQPDVHTYSILIKSCLQVFAFNKA 245

Query: 267 QTLLSDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILVEMLSDDGCKPDVWTMNSTLR 326
           QTLLSDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILV+ML+DDGCKPDVWTMNSTLR
Sbjct: 246 QTLLSDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILVDMLNDDGCKPDVWTMNSTLR 305

Query: 327 GFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKAKSYEKMSAVMEYMQKYHYSW 386
            FG SGQLETMEKCYEKFQ AGIQP+IQTFNILLDSYGKA+SYEKMSAVMEYMQKYHYSW
Sbjct: 306 AFGRSGQLETMEKCYEKFQEAGIQPSIQTFNILLDSYGKAESYEKMSAVMEYMQKYHYSW 365

Query: 387 TIVTYNIVIDAFGRAGNLKQMEHLFRLMRSERIQPSCVTLCSLVKAYGQAGKRDKIDSVL 446
           TIVTYNIVIDAFGRAGNLKQMEHLFRLMRSERI+PSCVTLCSLV+AYGQAGKR+KIDSVL
Sbjct: 366 TIVTYNIVIDAFGRAGNLKQMEHLFRLMRSERIKPSCVTLCSLVRAYGQAGKREKIDSVL 425

Query: 447 HIVENSNITLDTVFYNCLVDAYGRMECFAEMKAVLGMMEQRGCKPDKTTYRIMARAYSDG 506
           ++VENS+I LDTVFYNCLVDAYGR+ECFAEMK VLGMMEQRGCKPDKTTYR MARAYSDG
Sbjct: 426 NLVENSDIMLDTVFYNCLVDAYGRLECFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDG 485

Query: 507 GMANHAREIHDLVSTAEASKRTRPDL 525
           GMANHA+EI +L++TAE SKRTRPDL
Sbjct: 486 GMANHAKEIQELITTAEPSKRTRPDL 511

BLAST of Cp4.1LG12g11100 vs. TrEMBL
Match: B9SV96_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_1303150 PE=4 SV=1)

HSP 1 Score: 706.1 bits (1821), Expect = 3.3e-200
Identity = 348/434 (80.18%), Postives = 389/434 (89.63%), Query Frame = 1

Query: 77  RMDIAKLKAKEAGERKEEVNRKIASQKAISVILRREATKAVIERKRGPTNSKKLLPRTVL 136
           RMD   +K KE  E KEE++RKIAS+KAISVILRREATKA+IE+KRGPTNSKKLLPRTVL
Sbjct: 64  RMDWEIVKKKEEKEGKEEMDRKIASRKAISVILRREATKAIIEKKRGPTNSKKLLPRTVL 123

Query: 137 EALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIE 196
           EALHERITALRWESALKVFELLREQ+WYRPY+GMYIKLIVMLGKCKQPEKA+ELFQ MI 
Sbjct: 124 EALHERITALRWESALKVFELLREQIWYRPYSGMYIKLIVMLGKCKQPEKAHELFQAMIH 183

Query: 197 EGCEVSHESYTALLSAYSRSGLLDRAFSLLNEMKNSPDCQPDVHTYSILIKSCLQVFAFS 256
           EGC+VSHESYTALLSAY RSGLLD+AFSLL EMK +PDCQPDVHTYSILIKSC+QVFAF 
Sbjct: 184 EGCDVSHESYTALLSAYGRSGLLDKAFSLLEEMKRNPDCQPDVHTYSILIKSCVQVFAFD 243

Query: 257 EAQTLLSDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILVEMLSDDGCKPDVWTMNST 316
           +A+TLLS+M + GI PNTITYNT IDAYGKAKMF EME+ LV+MLS   C+PDVWTMNST
Sbjct: 244 KAKTLLSNMESLGISPNTITYNTLIDAYGKAKMFEEMEATLVKMLSQQNCEPDVWTMNST 303

Query: 317 LRGFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKAKSYEKMSAVMEYMQKYHY 376
           LR FG SGQ+ETMEKCYEKFQGAGI+P+I TFN+LLDSYGKA  Y+KMSAVMEYMQKYHY
Sbjct: 304 LRAFGISGQIETMEKCYEKFQGAGIEPSIMTFNVLLDSYGKAGDYKKMSAVMEYMQKYHY 363

Query: 377 SWTIVTYNIVIDAFGRAGNLKQMEHLFRLMRSERIQPSCVTLCSLVKAYGQAGKRDKIDS 436
           SWTI+TYNIVIDAFGRAG+LKQME+LFRLMRSERI+PSCVTLCSLV+AYGQA K +KI+ 
Sbjct: 364 SWTIITYNIVIDAFGRAGDLKQMEYLFRLMRSERIKPSCVTLCSLVRAYGQAEKPEKIEG 423

Query: 437 VLHIVENSNITLDTVFYNCLVDAYGRMECFAEMKAVLGMMEQRGCKPDKTTYRIMARAYS 496
           VL  +ENS+ITLDTVF+NCLVDAYGRM CFAEMK VL +MEQ+G +PDK TYR M +AYS
Sbjct: 424 VLRFIENSDITLDTVFFNCLVDAYGRMGCFAEMKGVLILMEQKGYRPDKITYRTMIKAYS 483

Query: 497 DGGMANHAREIHDL 511
             GM  H +E+ DL
Sbjct: 484 SKGMTKHVKELQDL 497

BLAST of Cp4.1LG12g11100 vs. TrEMBL
Match: A0A067LME3_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_17186 PE=4 SV=1)

HSP 1 Score: 697.2 bits (1798), Expect = 1.5e-197
Identity = 346/448 (77.23%), Postives = 388/448 (86.61%), Query Frame = 1

Query: 78  MDIAKLKAKEAGERKEEVNRKIASQKAISVILRREATKAVIERKRGPTNSKKLLPRTVLE 137
           +D  K+  +EA E+KEE NRKIAS+KAISVILRREATKA+IE+K+GPTNSKKLLPRTVLE
Sbjct: 72  IDWKKVNEREAREKKEEANRKIASRKAISVILRREATKAIIEKKKGPTNSKKLLPRTVLE 131

Query: 138 ALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIEE 197
           ALH+RITALRWESALKVFELLREQLWYRP  GMYIKLIVMLGKCKQPEKA+ELF  MI E
Sbjct: 132 ALHDRITALRWESALKVFELLREQLWYRPSPGMYIKLIVMLGKCKQPEKAHELFDAMIAE 191

Query: 198 GCEVSHESYTALLSAYSRSGLLDRAFSLLNEMKNSPDCQPDVHTYSILIKSCLQVFAFSE 257
           GC VS ESYTALLSAY RSGLLD AFSLL EMKN+PDC+PDVHTYSILIKSCLQVFAF +
Sbjct: 192 GCVVSRESYTALLSAYGRSGLLDEAFSLLEEMKNNPDCRPDVHTYSILIKSCLQVFAFDK 251

Query: 258 AQTLLSDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILVEMLSDDGCKPDVWTMNSTL 317
           AQ LLSDM   G++ NTITYNT IDAYGKAKMFAEME+ LV+MLS+  C+PDVWTMNSTL
Sbjct: 252 AQELLSDMAPLGVRANTITYNTLIDAYGKAKMFAEMEATLVKMLSEQNCEPDVWTMNSTL 311

Query: 318 RGFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKAKSYEKMSAVMEYMQKYHYS 377
           R FG SGQ+E ME CYEKFQ AGI+PNI+TFNILLDSYGKA  Y KMSAVMEYMQKYHYS
Sbjct: 312 RAFGGSGQIEMMETCYEKFQSAGIEPNIKTFNILLDSYGKAGDYRKMSAVMEYMQKYHYS 371

Query: 378 WTIVTYNIVIDAFGRAGNLKQMEHLFRLMRSERIQPSCVTLCSLVKAYGQAGKRDKIDSV 437
           WTIVTYN+VIDAFGRAG+LKQME+LFRLMRSERI+PSCVTLCSLV+AYG+AG+ +KI  V
Sbjct: 372 WTIVTYNVVIDAFGRAGDLKQMEYLFRLMRSERIKPSCVTLCSLVRAYGEAGEPEKIGGV 431

Query: 438 LHIVENSNITLDTVFYNCLVDAYGRMECFAEMKAVLGMMEQRGCKPDKTTYRIMARAYSD 497
           L  +ENS+ITLD VF+NCLVDAYG++ CF EMK VL +MEQ+GCK DK TYR M  AYS 
Sbjct: 432 LRFIENSDITLDIVFFNCLVDAYGKLGCFVEMKGVLELMEQKGCKADKITYRTMINAYSS 491

Query: 498 GGMANHAREIHDLVSTAEASK--RTRPD 524
            GM  HA+E+ DLV +AE  +  R +PD
Sbjct: 492 KGMTKHAKELQDLVVSAERPRLHRNKPD 519

BLAST of Cp4.1LG12g11100 vs. TrEMBL
Match: V4TYK6_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10019744mg PE=4 SV=1)

HSP 1 Score: 694.1 bits (1790), Expect = 1.3e-196
Identity = 358/507 (70.61%), Postives = 414/507 (81.66%), Query Frame = 1

Query: 23  PSILLPPAVSHSNPKNRVFLCQQRASHPNFES---PAPSSSSSSTDGKLMQRSPREGRM- 82
           P+I   P  ++S P   V       S P  E    P    SS++  G   +     G + 
Sbjct: 14  PAIKRCPTTTNSKP---VLSTSVPLSKPEIEPRKRPHHIISSNNHSGNAGKTPQSRGTLL 73

Query: 83  DIAKLKAKEAGERKEEVNRKIASQKAISVILRREATKAVIERKRGPTNSKKLLPRTVLEA 142
           D+ KLK +E  ERKEEVNRKIAS+KAISVILRREATKAVIE+KRGP NSKKLLPRTVLEA
Sbjct: 74  DLEKLKEREEKERKEEVNRKIASKKAISVILRREATKAVIEKKRGPVNSKKLLPRTVLEA 133

Query: 143 LHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIEEG 202
           L+ERITALRWESALKVFELLREQLWY+P A +Y+KLIVMLGKCKQPEKA+ELFQ M++EG
Sbjct: 134 LNERITALRWESALKVFELLREQLWYKPNAAVYVKLIVMLGKCKQPEKAHELFQAMVDEG 193

Query: 203 CEVSHESYTALLSAYSRSGLLDRAFSLLNEMKNSPDCQPDVHTYSILIKSCLQVFAFSEA 262
           C+ + +S+TALLSAY RSGL D+AFSLL  MKN+PDCQPDV+TYSILIKSCL+ FAF + 
Sbjct: 194 CDANTQSFTALLSAYGRSGLFDKAFSLLEHMKNTPDCQPDVNTYSILIKSCLKAFAFDKV 253

Query: 263 QTLLSDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILVEMLSDDGCKPDVWTMNSTLR 322
           Q LLSDM T+GI+PNT+TYNT IDAYG+AKMFAEME  LV+MLS+D C+PDVWTMN TLR
Sbjct: 254 QALLSDMSTQGIRPNTVTYNTLIDAYGRAKMFAEMELTLVKMLSED-CEPDVWTMNCTLR 313

Query: 323 GFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKAKSYEKMSAVMEYMQKYHYSW 382
            FG+SGQ++TMEKCYEKFQ AGIQP+I TFNILLDSYGKA  +EKMSAVMEYMQKYHYSW
Sbjct: 314 AFGNSGQIDTMEKCYEKFQSAGIQPSINTFNILLDSYGKAGHFEKMSAVMEYMQKYHYSW 373

Query: 383 TIVTYNIVIDAFGRAGNLKQMEHLFRLMRSERIQPSCVTLCSLVKAYGQAGKRDKIDSVL 442
           TIVTYNIVIDAFGRAG+LKQME+LFRLMRSERI+PSCVTLCSLV+AYG AGK +K+ SVL
Sbjct: 374 TIVTYNIVIDAFGRAGDLKQMEYLFRLMRSERIKPSCVTLCSLVRAYGHAGKPEKLGSVL 433

Query: 443 HIVENSNITLDTVFYNCLVDAYGRMECFAEMKAVLGMMEQRGCKPDKTTYRIMARAYSDG 502
             ++NS+I LDTVF+NCLVDAYGR++CFAEMK VL +M+QRGCKPDK TYR M RAYS  
Sbjct: 434 RFIDNSDIMLDTVFFNCLVDAYGRLKCFAEMKGVLEVMQQRGCKPDKVTYRTMVRAYSTN 493

Query: 503 GMANHAREIHDLVSTAEAS--KRTRPD 524
           GM NHA+E  DLV   + +     RPD
Sbjct: 494 GMKNHAKEFQDLVEKMDETCLAMKRPD 516

BLAST of Cp4.1LG12g11100 vs. TrEMBL
Match: A0A0D2QTF2_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_007G155400 PE=4 SV=1)

HSP 1 Score: 694.1 bits (1790), Expect = 1.3e-196
Identity = 350/477 (73.38%), Postives = 401/477 (84.07%), Query Frame = 1

Query: 47  ASHPNFESPAPSSSSSSTDGKLMQRSPREGRMDIAK-LKAKEAGERKEEVNRKIASQKAI 106
           AS     +P       +T   + +R   +GR+ IA+ LK +E   R+EEV+RKIAS+KAI
Sbjct: 33  ASKSETSAPNKDEEEETTSLTVSERGREKGRLVIAEELKQRETRGRREEVSRKIASRKAI 92

Query: 107 SVILRREATKAVIERKRGPTNSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYR 166
           SVILRREATKA IE+KRGP NSKKLLPRTVLE+LHERITALRWESALKVFELLREQLWYR
Sbjct: 93  SVILRREATKAFIEKKRGPNNSKKLLPRTVLESLHERITALRWESALKVFELLREQLWYR 152

Query: 167 PYAGMYIKLIVMLGKCKQPEKAYELFQEMIEEGCEVSHESYTALLSAYSRSGLLDRAFSL 226
           P A +YIKLIV+LGKCKQP+KAYELFQ M +EGC ++HE+YTALLSAYSRSGL D+AFSL
Sbjct: 153 PNAAIYIKLIVLLGKCKQPDKAYELFQAMSDEGCVMNHEAYTALLSAYSRSGLFDKAFSL 212

Query: 227 LNEMKNSPDCQPDVHTYSILIKSCLQVFAFSEAQTLLSDMVTRGIKPNTITYNTFIDAYG 286
           L EMK++P C PDV TYSILIKSCLQVFAF E + LLSDM ++GI+PNT+TYNT IDAYG
Sbjct: 213 LEEMKDTPICHPDVQTYSILIKSCLQVFAFDEVRALLSDMASQGIRPNTVTYNTLIDAYG 272

Query: 287 KAKMFAEMESILVEMLSDDGCKPDVWTMNSTLRGFGSSGQLETMEKCYEKFQGAGIQPNI 346
           KAKMF EME  LVEML    C+PDVWTMNST+R FGSSGQ+ETMEKCYEKFQ AGIQPNI
Sbjct: 273 KAKMFQEMEMTLVEMLRGKDCEPDVWTMNSTIRAFGSSGQIETMEKCYEKFQSAGIQPNI 332

Query: 347 QTFNILLDSYGKAKSYEKMSAVMEYMQKYHYSWTIVTYNIVIDAFGRAGNLKQMEHLFRL 406
           +TFNILLDSYGK  +YEKMSAVMEYMQKYHYSWTIVTYN+VIDAFGRAG+LKQME+LFRL
Sbjct: 333 KTFNILLDSYGKTGNYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGDLKQMEYLFRL 392

Query: 407 MRSERIQPSCVTLCSLVKAYGQAGKRDKIDSVLHIVENSNITLDTVFYNCLVDAYGRMEC 466
           MRSERI+PSCVTLCSLV+AYGQAGK +KI  VL I+ENS++TLD VF+NCLVDAYGRM C
Sbjct: 393 MRSERIKPSCVTLCSLVRAYGQAGKAEKIAGVLRIIENSDVTLDIVFFNCLVDAYGRMGC 452

Query: 467 FAEMKAVLGMMEQRGCKPDKTTYRIMARAYSDGGMANHAREIHDLVSTAEASKRTRP 523
           FAEMK VL MM+Q+G KPDK TYR M +AYS  GM +HA+E+ +LV +A  S    P
Sbjct: 453 FAEMKGVLEMMKQKGYKPDKITYRTMIKAYSISGMTSHAKELRNLVESAAGSSLGMP 509

BLAST of Cp4.1LG12g11100 vs. TAIR10
Match: AT5G48730.1 (AT5G48730.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 642.5 bits (1656), Expect = 2.3e-184
Identity = 328/498 (65.86%), Postives = 395/498 (79.32%), Query Frame = 1

Query: 17  MAAHSTPSILLPPAVSHSNPKNRVFLCQQRASHPNFESPA-PSSSSSSTDGKLMQRSPRE 76
           M + ST +   PP  ++     R F  +  +  P   + A  S  S++T   L +    +
Sbjct: 1   MVSLSTSTSHAPPLPTNRRTAERTFTVRCISISPREPNYAITSDKSNNTSLSLRETRQSK 60

Query: 77  GRMDIAKLKAKEAGERKEEVNRKIASQKAISVILRREATKAVIERKRGPTNSKKLLPRTV 136
             ++   +  +++ E KE+ N KIAS+KAIS+ILRREATK++IE+K+G   SKKLLPRTV
Sbjct: 61  WLINAEDVNERDSKEIKEDKNTKIASRKAISIILRREATKSIIEKKKG---SKKLLPRTV 120

Query: 137 LEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMI 196
           LE+LHERITALRWESA++VFELLREQLWY+P  G+Y+KLIVMLGKCKQPEKA+ELFQEMI
Sbjct: 121 LESLHERITALRWESAIQVFELLREQLWYKPNVGIYVKLIVMLGKCKQPEKAHELFQEMI 180

Query: 197 EEGCEVSHESYTALLSAYSRSGLLDRAFSLLNEMKNSPDCQPDVHTYSILIKSCLQVFAF 256
            EGC V+HE YTAL+SAYSRSG  D AF+LL  MK+S +CQPDVHTYSILIKS LQVFAF
Sbjct: 181 NEGCVVNHEVYTALVSAYSRSGRFDAAFTLLERMKSSHNCQPDVHTYSILIKSFLQVFAF 240

Query: 257 SEAQTLLSDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILVEMLSDDGCKPDVWTMNS 316
            + Q LLSDM  +GI+PNTITYNT IDAYGKAKMF EMES L++ML +D CKPD WTMNS
Sbjct: 241 DKVQDLLSDMRRQGIRPNTITYNTLIDAYGKAKMFVEMESTLIQMLGEDDCKPDSWTMNS 300

Query: 317 TLRGFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKAKSYEKMSAVMEYMQKYH 376
           TLR FG +GQ+E ME CYEKFQ +GI+PNI+TFNILLDSYGK+ +Y+KMSAVMEYMQKYH
Sbjct: 301 TLRAFGGNGQIEMMENCYEKFQSSGIEPNIRTFNILLDSYGKSGNYKKMSAVMEYMQKYH 360

Query: 377 YSWTIVTYNIVIDAFGRAGNLKQMEHLFRLMRSERIQPSCVTLCSLVKAYGQAGKRDKID 436
           YSWTIVTYN+VIDAFGRAG+LKQME+LFRLM+SERI PSCVTLCSLV+AYG+A K DKI 
Sbjct: 361 YSWTIVTYNVVIDAFGRAGDLKQMEYLFRLMQSERIFPSCVTLCSLVRAYGRASKADKIG 420

Query: 437 SVLHIVENSNITLDTVFYNCLVDAYGRMECFAEMKAVLGMMEQRGCKPDKTTYRIMARAY 496
            VL  +ENS+I LD VF+NCLVDAYGRME FAEMK VL +ME++G KPDK TYR M +AY
Sbjct: 421 GVLRFIENSDIRLDLVFFNCLVDAYGRMEKFAEMKGVLELMEKKGFKPDKITYRTMVKAY 480

Query: 497 SDGGMANHAREIHDLVST 514
              GM  H +E+H +V +
Sbjct: 481 RISGMTTHVKELHGVVES 495

BLAST of Cp4.1LG12g11100 vs. TAIR10
Match: AT3G53170.1 (AT3G53170.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 376.3 bits (965), Expect = 3.0e-104
Identity = 194/441 (43.99%), Postives = 286/441 (64.85%), Query Frame = 1

Query: 86  KEAGERKEEVNRKIAS-------QKAISVILRREATKAVIERKRGPTNSKKLLPRTVLEA 145
           K   ER E++N  + S       +K +S ILR +A    IERK        L P+ VLEA
Sbjct: 55  KVPNERTEKMNSGLISTRHQVDPKKELSRILRTDAAVKGIERKANSEKYLTLWPKAVLEA 114

Query: 146 LHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIEEG 205
           L E I   RW+SALK+F LLR+Q WY P    Y KL  +LG CKQP++A  LF+ M+ EG
Sbjct: 115 LDEAIKENRWQSALKIFNLLRKQHWYEPRCKTYTKLFKVLGNCKQPDQASLLFEVMLSEG 174

Query: 206 CEVSHESYTALLSAYSRSGLLDRAFSLLNEMKNSPDCQPDVHTYSILIKSCLQVFAFSEA 265
            + + + YT+L+S Y +S LLD+AFS L  MK+  DC+PDV T+++LI  C ++  F   
Sbjct: 175 LKPTIDVYTSLISVYGKSELLDKAFSTLEYMKSVSDCKPDVFTFTVLISCCCKLGRFDLV 234

Query: 266 QTLLSDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILVEMLSDDGCKPDVWTMNSTLR 325
           ++++ +M   G+  +T+TYNT ID YGKA MF EMES+L +M+ D    PDV T+NS + 
Sbjct: 235 KSIVLEMSYLGVGCSTVTYNTIIDGYGKAGMFEEMESVLADMIEDGDSLPDVCTLNSIIG 294

Query: 326 GFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKAKSYEKMSAVMEYMQKYHYSW 385
            +G+   +  ME  Y +FQ  G+QP+I TFNIL+ S+GKA  Y+KM +VM++M+K  +S 
Sbjct: 295 SYGNGRNMRKMESWYSRFQLMGVQPDITTFNILILSFGKAGMYKKMCSVMDFMEKRFFSL 354

Query: 386 TIVTYNIVIDAFGRAGNLKQMEHLFRLMRSERIQPSCVTLCSLVKAYGQAGKRDKIDSVL 445
           T VTYNIVI+ FG+AG +++M+ +FR M+ + ++P+ +T CSLV AY +AG   KIDSVL
Sbjct: 355 TTVTYNIVIETFGKAGRIEKMDDVFRKMKYQGVKPNSITYCSLVNAYSKAGLVVKIDSVL 414

Query: 446 HIVENSNITLDTVFYNCLVDAYGRMECFAEMKAVLGMMEQRGCKPDKTTYRIMARAYSDG 505
             + NS++ LDT F+NC+++AYG+    A MK +   ME+R CKPDK T+  M + Y+  
Sbjct: 415 RQIVNSDVVLDTPFFNCIINAYGQAGDLATMKELYIQMEERKCKPDKITFATMIKTYTAH 474

Query: 506 GMANHAREIHDLVSTAEASKR 520
           G+ +  +E+   + +++  K+
Sbjct: 475 GIFDAVQELEKQMISSDIGKK 495

BLAST of Cp4.1LG12g11100 vs. TAIR10
Match: AT3G06430.1 (AT3G06430.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 341.3 bits (874), Expect = 1.1e-93
Identity = 177/396 (44.70%), Postives = 243/396 (61.36%), Query Frame = 1

Query: 114 TKAVIERKRGPTNSKKLLPR---------TVLEALHERITALRWESALKVFELLREQLWY 173
           T+ V +R+    N KK L R         TV E L + I   +W  AL+VF++LREQ +Y
Sbjct: 61  TEPVNQRRTPIKNVKKKLDRRSKANGWVNTVTETLSDLIAKKQWLQALEVFDMLREQTFY 120

Query: 174 RPYAGMYIKLIVMLGKCKQPEKAYELFQEMIEEGCEVSHESYTALLSAYSRSGLLDRAFS 233
           +P  G Y+KL+V+LGK  QP +A +LF EM+EEG E + E YTALL+AY+RS L+D AFS
Sbjct: 121 QPKEGTYMKLLVLLGKSGQPNRAQKLFDEMLEEGLEPTVELYTALLAAYTRSNLIDDAFS 180

Query: 234 LLNEMKNSPDCQPDVHTYSILIKSCLQVFAFSEAQTLLSDMVTRGIKPNTITYNTFIDAY 293
           +L++MK+ P CQPDV TYS L+K+C+    F    +L  +M  R I PNT+T N  +  Y
Sbjct: 181 ILDKMKSFPQCQPDVFTYSTLLKACVDASQFDLVDSLYKEMDERLITPNTVTQNIVLSGY 240

Query: 294 GKAKMFAEMESILVEMLSDDGCKPDVWTMNSTLRGFGSSGQLETMEKCYEKFQGAGIQPN 353
           G+   F +ME +L +ML    CKPDVWTMN  L  FG+ G+++ ME  YEKF+  GI+P 
Sbjct: 241 GRVGRFDQMEKVLSDMLVSTACKPDVWTMNIILSVFGNMGKIDMMESWYEKFRNFGIEPE 300

Query: 354 IQTFNILLDSYGKAKSYEKMSAVMEYMQKYHYSWTIVTYNIVIDAFGRAGNLKQMEHLFR 413
            +TFNIL+ SYGK + Y+KMS+VMEYM+K  + WT  TYN +I+AF   G+ K ME  F 
Sbjct: 301 TRTFNILIGSYGKKRMYDKMSSVMEYMRKLEFPWTTSTYNNIIEAFADVGDAKNMELTFD 360

Query: 414 LMRSERIQPSCVTLCSLVKAYGQAGKRDKIDSVLHIVENSNITLDTVFYNCLVDAYGRME 473
            MRSE ++    T C L+  Y  AG   K+ S + +     I  +T FYN ++ A  + +
Sbjct: 361 QMRSEGMKADTKTFCCLINGYANAGLFHKVISSVQLAAKFEIPENTAFYNAVISACAKAD 420

Query: 474 CFAEMKAVLGMMEQRGCKPDKTTYRIMARAYSDGGM 501
              EM+ V   M++R C  D  T+ IM  AY   GM
Sbjct: 421 DLIEMERVYIRMKERQCVCDSRTFEIMVEAYEKEGM 456

BLAST of Cp4.1LG12g11100 vs. TAIR10
Match: AT4G39620.1 (AT4G39620.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 181.0 bits (458), Expect = 1.9e-45
Identity = 103/344 (29.94%), Postives = 181/344 (52.62%), Query Frame = 1

Query: 147 RWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIEEGCEVSHESY 206
           +W   L+VF  +++Q WY P  G+Y KLI ++GK  Q   A  LF EM   GC      Y
Sbjct: 112 KWLQCLEVFRWMQKQRWYIPDNGVYSKLISVMGKKGQTRMAMWLFSEMKNSGCRPDASVY 171

Query: 207 TALLSAY----SRSGLLDRAFSLLNEMKNSPDCQPDVHTYSILIKSCLQVFAFSEAQTLL 266
            AL++A+     ++  L++    L++MK    CQP+V TY+IL+++  Q     +   L 
Sbjct: 172 NALITAHLHTRDKAKALEKVRGYLDKMKGIERCQPNVVTYNILLRAFAQSGKVDQVNALF 231

Query: 267 SDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILVEMLSDDGCKPDVWTMNSTLRGFGS 326
            D+    + P+  T+N  +DAYGK  M  EME++L  M S++ CKPD+ T N  +  +G 
Sbjct: 232 KDLDMSPVSPDVYTFNGVMDAYGKNGMIKEMEAVLTRMRSNE-CKPDIITFNVLIDSYGK 291

Query: 327 SGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKAKSYEKMSAVMEYMQKYHYSWTIVT 386
             + E ME+ ++    +  +P + TFN ++ +YGKA+  +K   V + M   +Y  + +T
Sbjct: 292 KQEFEKMEQTFKSLMRSKEKPTLPTFNSMIINYGKARMIDKAEWVFKKMNDMNYIPSFIT 351

Query: 387 YNIVIDAFGRAGNLKQMEHLF-RLMRSERIQPSCVTLCSLVKAYGQAGKRDKIDSVLHIV 446
           Y  +I  +G  G++ +   +F  +  S+R+  +  TL ++++ Y + G   + D + H  
Sbjct: 352 YECMIMMYGYCGSVSRAREIFEEVGESDRVLKAS-TLNAMLEVYCRNGLYIEADKLFHNA 411

Query: 447 ENSNITLDTVFYNCLVDAYGRMECFAEMKAVLGMMEQRGCKPDK 486
               +  D   Y  L  AY + +   +++ ++  ME+ G  P+K
Sbjct: 412 SAFRVHPDASTYKFLYKAYTKADMKEQVQILMKKMEKDGIVPNK 453

BLAST of Cp4.1LG12g11100 vs. TAIR10
Match: AT5G02860.1 (AT5G02860.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 164.5 bits (415), Expect = 1.8e-40
Identity = 103/339 (30.38%), Postives = 161/339 (47.49%), Query Frame = 1

Query: 171 YIKLIVMLGKCKQPEKAYELFQEMIEEGCEVSHESYTALLSAYSRSGLLDRAFSLLNEMK 230
           Y  L+ + GK  +P++A ++  EM+  G   S  +Y +L+SAY+R G+LD A  L N+M 
Sbjct: 317 YNALLDVYGKSHRPKEAMKVLNEMVLNGFSPSIVTYNSLISAYARDGMLDEAMELKNQMA 376

Query: 231 NSPDCQPDVHTYSILIKSCLQVFAFSEAQTLLSDMVTRGIKPNTITYNTFIDAYGKAKMF 290
                +PDV TY+ L+    +      A ++  +M   G KPN  T+N FI  YG    F
Sbjct: 377 EK-GTKPDVFTYTTLLSGFERAGKVESAMSIFEEMRNAGCKPNICTFNAFIKMYGNRGKF 436

Query: 291 AEMESILVEMLSDDGCKPDVWTMNSTLRGFGSSGQLETMEKCYEKFQGAGIQPNIQTFNI 350
            EM  I  E ++  G  PD+ T N+ L  FG +G    +   +++ + AG  P  +TFN 
Sbjct: 437 TEMMKIFDE-INVCGLSPDIVTWNTLLAVFGQNGMDSEVSGVFKEMKRAGFVPERETFNT 496

Query: 351 LLDSYGKAKSYEKMSAVMEYMQKYHYSWTIVTYNIVIDAFGRAGNLKQMEHLFRLMRSER 410
           L+ +Y +  S+E+   V   M     +  + TYN V+ A  R G  +Q E +   M   R
Sbjct: 497 LISAYSRCGSFEQAMTVYRRMLDAGVTPDLSTYNTVLAALARGGMWEQSEKVLAEMEDGR 556

Query: 411 IQPSCVTLCSLVKAYGQAGKRDKIDSVLHIVENSNITLDTVFYNCLVDAYGRMECFAEMK 470
            +P+ +T CSL+ AY    +   + S+   V +  I    V    LV    + +   E +
Sbjct: 557 CKPNELTYCSLLHAYANGKEIGLMHSLAEEVYSGVIEPRAVLLKTLVLVCSKCDLLPEAE 616

Query: 471 AVLGMMEQRGCKPDKTTYRIMARAYSDGGMANHAREIHD 510
                +++RG  PD TT   M   Y    M   A  + D
Sbjct: 617 RAFSELKERGFSPDITTLNSMVSIYGRRQMVAKANGVLD 653

BLAST of Cp4.1LG12g11100 vs. NCBI nr
Match: gi|659116302|ref|XP_008458009.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g48730, chloroplastic [Cucumis melo])

HSP 1 Score: 892.9 bits (2306), Expect = 2.7e-256
Identity = 448/501 (89.42%), Postives = 479/501 (95.61%), Query Frame = 1

Query: 27  LPPAVSHSNPKNRVFLCQQRASHPNFESPAPSSSSSS---TDGKLMQRSPREGRMDIAKL 86
           LPP ++H +P  R+FL +Q+ ++P F+SPAPSSSSSS   TDGKL++ SP EGRMD+AKL
Sbjct: 6   LPPPINHRDPNKRLFLRRQQPTYPKFQSPAPSSSSSSTTTTDGKLVKISPHEGRMDVAKL 65

Query: 87  KAKEAGERKEEVNRKIASQKAISVILRREATKAVIERKRGPTNSKKLLPRTVLEALHERI 146
           KAKEA ERKEEVNRKIASQKAISVILRREATKAVIERKRGP NSKKLLPRTVLEALH+RI
Sbjct: 66  KAKEAAERKEEVNRKIASQKAISVILRREATKAVIERKRGPNNSKKLLPRTVLEALHDRI 125

Query: 147 TALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIEEGCEVSH 206
           TALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAY+LFQEMIEEGCEVSH
Sbjct: 126 TALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYKLFQEMIEEGCEVSH 185

Query: 207 ESYTALLSAYSRSGLLDRAFSLLNEMKNSPDCQPDVHTYSILIKSCLQVFAFSEAQTLLS 266
           ESYTALLSAYSRSGLLD+AFS+LNEMKNSPDCQPDVHTYSILIKSCLQVFAF++AQTLLS
Sbjct: 186 ESYTALLSAYSRSGLLDKAFSILNEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLS 245

Query: 267 DMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILVEMLSDDGCKPDVWTMNSTLRGFGSS 326
           DMVT+GIKPNTITYNTFIDAYGKAKMFAEMESILVEMLSDDGCKPDVWTMNSTLR FG S
Sbjct: 246 DMVTQGIKPNTITYNTFIDAYGKAKMFAEMESILVEMLSDDGCKPDVWTMNSTLRAFGHS 305

Query: 327 GQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKAKSYEKMSAVMEYMQKYHYSWTIVTY 386
           GQ+ETMEKCYEKFQ AGIQPNIQTFNILLDSYGKA+SYEKMSAVMEYMQKYHYSWTIVTY
Sbjct: 306 GQIETMEKCYEKFQEAGIQPNIQTFNILLDSYGKAESYEKMSAVMEYMQKYHYSWTIVTY 365

Query: 387 NIVIDAFGRAGNLKQMEHLFRLMRSERIQPSCVTLCSLVKAYGQAGKRDKIDSVLHIVEN 446
           NIVIDAFGRAGNLKQMEHLFRLMRSERI+PSCVTLCSLVKAYGQAGK +KI+SVL++VEN
Sbjct: 366 NIVIDAFGRAGNLKQMEHLFRLMRSERIKPSCVTLCSLVKAYGQAGKCEKINSVLNLVEN 425

Query: 447 SNITLDTVFYNCLVDAYGRMECFAEMKAVLGMMEQRGCKPDKTTYRIMARAYSDGGMANH 506
           S+I LDTVFYNCLVDAYGRMECFAEMK VLGMMEQRGCKPDKTTYR MARAYSDGGMANH
Sbjct: 426 SDIMLDTVFYNCLVDAYGRMECFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANH 485

Query: 507 AREIHDLVSTAEASKRTRPDL 525
           A+EI +L++TAEASKRTRPDL
Sbjct: 486 AKEIQELITTAEASKRTRPDL 506

BLAST of Cp4.1LG12g11100 vs. NCBI nr
Match: gi|778726970|ref|XP_004139628.2| (PREDICTED: pentatricopeptide repeat-containing protein At5g48730, chloroplastic [Cucumis sativus])

HSP 1 Score: 887.1 bits (2291), Expect = 1.5e-254
Identity = 447/506 (88.34%), Postives = 477/506 (94.27%), Query Frame = 1

Query: 27  LPPAVSHSNP-KNRVFLCQQRASHPNFESPAPSSSSSST-------DGKLMQRSPREGRM 86
           LPP ++H NP  NR++L +Q+ +HP F+SPAPSSSSSS+       DGKLM+ SP EGRM
Sbjct: 6   LPPPINHPNPTNNRLYLRRQQRTHPKFQSPAPSSSSSSSSSTTTTADGKLMKISPHEGRM 65

Query: 87  DIAKLKAKEAGERKEEVNRKIASQKAISVILRREATKAVIERKRGPTNSKKLLPRTVLEA 146
           D+AKLKAKEA ERKEEVNRKIASQKAISVILRREATKAVIERKRGP NSKKLLPRTVLEA
Sbjct: 66  DVAKLKAKEASERKEEVNRKIASQKAISVILRREATKAVIERKRGPNNSKKLLPRTVLEA 125

Query: 147 LHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIEEG 206
           LH+RIT LRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQ EKAYELFQEMIEEG
Sbjct: 126 LHDRITTLRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQQEKAYELFQEMIEEG 185

Query: 207 CEVSHESYTALLSAYSRSGLLDRAFSLLNEMKNSPDCQPDVHTYSILIKSCLQVFAFSEA 266
           CEVSHESYTALLSAYSRSGLLD AFS+LNEMKNSPDCQPDVHTYSILIKSCLQVFAF++A
Sbjct: 186 CEVSHESYTALLSAYSRSGLLDEAFSILNEMKNSPDCQPDVHTYSILIKSCLQVFAFNKA 245

Query: 267 QTLLSDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILVEMLSDDGCKPDVWTMNSTLR 326
           QTLLSDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILV+ML+DDGCKPDVWTMNSTLR
Sbjct: 246 QTLLSDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILVDMLNDDGCKPDVWTMNSTLR 305

Query: 327 GFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKAKSYEKMSAVMEYMQKYHYSW 386
            FG SGQLETMEKCYEKFQ AGIQP+IQTFNILLDSYGKA+SYEKMSAVMEYMQKYHYSW
Sbjct: 306 AFGRSGQLETMEKCYEKFQEAGIQPSIQTFNILLDSYGKAESYEKMSAVMEYMQKYHYSW 365

Query: 387 TIVTYNIVIDAFGRAGNLKQMEHLFRLMRSERIQPSCVTLCSLVKAYGQAGKRDKIDSVL 446
           TIVTYNIVIDAFGRAGNLKQMEHLFRLMRSERI+PSCVTLCSLV+AYGQAGKR+KIDSVL
Sbjct: 366 TIVTYNIVIDAFGRAGNLKQMEHLFRLMRSERIKPSCVTLCSLVRAYGQAGKREKIDSVL 425

Query: 447 HIVENSNITLDTVFYNCLVDAYGRMECFAEMKAVLGMMEQRGCKPDKTTYRIMARAYSDG 506
           ++VENS+I LDTVFYNCLVDAYGR+ECFAEMK VLGMMEQRGCKPDKTTYR MARAYSDG
Sbjct: 426 NLVENSDIMLDTVFYNCLVDAYGRLECFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDG 485

Query: 507 GMANHAREIHDLVSTAEASKRTRPDL 525
           GMANHA+EI +L++TAE SKRTRPDL
Sbjct: 486 GMANHAKEIQELITTAEPSKRTRPDL 511

BLAST of Cp4.1LG12g11100 vs. NCBI nr
Match: gi|743833914|ref|XP_011024658.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g48730, chloroplastic [Populus euphratica])

HSP 1 Score: 713.0 bits (1839), Expect = 3.9e-202
Identity = 354/481 (73.60%), Postives = 416/481 (86.49%), Query Frame = 1

Query: 47  ASHPNFESPAPSSSSSSTD--GKLMQRSPREGRMDIAKLKAKEAGERKEEVNRKIASQKA 106
           AS P  +S  PSSS+++    G+  +       ++  KLK KE  ERKEEVNRKIASQKA
Sbjct: 30  ASRPETKSNFPSSSTNNNVAIGRSERGEMTRREIEWEKLKKKEEKERKEEVNRKIASQKA 89

Query: 107 ISVILRREATKAVIERKRGPTNSKKLLPRTVLEALHERITALRWESALKVFELLREQLWY 166
           ISVILRREATKAVIE+KRGPTNSKKLLP+TVLEALHERITALRW SAL+VFELLREQLWY
Sbjct: 90  ISVILRREATKAVIEKKRGPTNSKKLLPQTVLEALHERITALRWASALEVFELLREQLWY 149

Query: 167 RPYAGMYIKLIVMLGKCKQPEKAYELFQEMIEEGCEVSHESYTALLSAYSRSGLLDRAFS 226
           RPYAGMY+KLIVMLGKCKQP+KA++LFQ MI+EGC V+HESYTALLSAY RSGL D+AFS
Sbjct: 150 RPYAGMYVKLIVMLGKCKQPDKAHQLFQAMIDEGCAVTHESYTALLSAYGRSGLFDKAFS 209

Query: 227 LLNEMKNSPDCQPDVHTYSILIKSCLQVFAFSEAQTLLSDMVTRGIKPNTITYNTFIDAY 286
           ++ EMKN+PDC+PDVHTYSILIKSCLQVFAF + Q LLSDM + GI+PNT+TYNT IDAY
Sbjct: 210 IMEEMKNTPDCRPDVHTYSILIKSCLQVFAFDKVQVLLSDMESLGIRPNTVTYNTLIDAY 269

Query: 287 GKAKMFAEMESILVEMLSDDGCKPDVWTMNSTLRGFGSSGQLETMEKCYEKFQGAGIQPN 346
           GKAKMFAEME+ L+EMLS   C+PDVWTMNST+R FG SGQ+E ME CYEKFQ AGI+PN
Sbjct: 270 GKAKMFAEMEATLMEMLSQQDCEPDVWTMNSTIRAFGGSGQMEMMENCYEKFQSAGIEPN 329

Query: 347 IQTFNILLDSYGKAKSYEKMSAVMEYMQKYHYSWTIVTYNIVIDAFGRAGNLKQMEHLFR 406
           I+TFNILLDSYGKA +Y+KMSAVMEYMQ+YHYSWTIVTYN+VIDAFGRAG+LKQME+LFR
Sbjct: 330 IKTFNILLDSYGKAGNYQKMSAVMEYMQRYHYSWTIVTYNVVIDAFGRAGDLKQMEYLFR 389

Query: 407 LMRSERIQPSCVTLCSLVKAYGQAGKRDKIDSVLHIVENSNITLDTVFYNCLVDAYGRME 466
           LMRSERI+PSCVTLCSLV+AY +AGK +KI SVL  ++NS++TLDTVF+NCLVDAYGR+E
Sbjct: 390 LMRSERIKPSCVTLCSLVRAYREAGKPEKIRSVLRFIDNSDVTLDTVFFNCLVDAYGRLE 449

Query: 467 CFAEMKAVLGMMEQRGCKPDKTTYRIMARAYSDGGMANHAREIHDLVSTAEA--SKRTRP 524
           CFAEMK VL +ME++GCKPDK TYR M +AYS  GM +HA+E+ +L+ + E   S++ +P
Sbjct: 450 CFAEMKEVLELMEEKGCKPDKVTYRTMIKAYSIKGMTSHAKELRNLLGSVEVTRSQKKKP 509

BLAST of Cp4.1LG12g11100 vs. NCBI nr
Match: gi|1000945412|ref|XP_015581300.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g48730, chloroplastic [Ricinus communis])

HSP 1 Score: 712.2 bits (1837), Expect = 6.6e-202
Identity = 353/449 (78.62%), Postives = 398/449 (88.64%), Query Frame = 1

Query: 77  RMDIAKLKAKEAGERKEEVNRKIASQKAISVILRREATKAVIERKRGPTNSKKLLPRTVL 136
           RMD   +K KE  E KEE++RKIAS+KAISVILRREATKA+IE+KRGPTNSKKLLPRTVL
Sbjct: 64  RMDWEIVKKKEEKEGKEEMDRKIASRKAISVILRREATKAIIEKKRGPTNSKKLLPRTVL 123

Query: 137 EALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIE 196
           EALHERITALRWESALKVFELLREQ+WYRPY+GMYIKLIVMLGKCKQPEKA+ELFQ MI 
Sbjct: 124 EALHERITALRWESALKVFELLREQIWYRPYSGMYIKLIVMLGKCKQPEKAHELFQAMIH 183

Query: 197 EGCEVSHESYTALLSAYSRSGLLDRAFSLLNEMKNSPDCQPDVHTYSILIKSCLQVFAFS 256
           EGC+VSHESYTALLSAY RSGLLD+AFSLL EMK +PDCQPDVHTYSILIKSC+QVFAF 
Sbjct: 184 EGCDVSHESYTALLSAYGRSGLLDKAFSLLEEMKRNPDCQPDVHTYSILIKSCVQVFAFD 243

Query: 257 EAQTLLSDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILVEMLSDDGCKPDVWTMNST 316
           +A+TLLS+M + GI PNTITYNT IDAYGKAKMF EME+ LV+MLS   C+PDVWTMNST
Sbjct: 244 KAKTLLSNMESLGISPNTITYNTLIDAYGKAKMFEEMEATLVKMLSQQNCEPDVWTMNST 303

Query: 317 LRGFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKAKSYEKMSAVMEYMQKYHY 376
           LR FG SGQ+ETMEKCYEKFQGAGI+P+I TFN+LLDSYGKA  Y+KMSAVMEYMQKYHY
Sbjct: 304 LRAFGISGQIETMEKCYEKFQGAGIEPSIMTFNVLLDSYGKAGDYKKMSAVMEYMQKYHY 363

Query: 377 SWTIVTYNIVIDAFGRAGNLKQMEHLFRLMRSERIQPSCVTLCSLVKAYGQAGKRDKIDS 436
           SWTI+TYNIVIDAFGRAG+LKQME+LFRLMRSERI+PSCVTLCSLV+AYGQA K +KI+ 
Sbjct: 364 SWTIITYNIVIDAFGRAGDLKQMEYLFRLMRSERIKPSCVTLCSLVRAYGQAEKPEKIEG 423

Query: 437 VLHIVENSNITLDTVFYNCLVDAYGRMECFAEMKAVLGMMEQRGCKPDKTTYRIMARAYS 496
           VL  +ENS+ITLDTVF+NCLVDAYGRM CFAEMK VL +MEQ+G +PDK TYR M +AYS
Sbjct: 424 VLRFIENSDITLDTVFFNCLVDAYGRMGCFAEMKGVLILMEQKGYRPDKITYRTMIKAYS 483

Query: 497 DGGMANHAREIHDLVSTAEASK--RTRPD 524
             GM  H +E+ DLV++ E  +  R +PD
Sbjct: 484 SKGMTKHVKELQDLVASVEGPQLHRKKPD 512

BLAST of Cp4.1LG12g11100 vs. NCBI nr
Match: gi|223530592|gb|EEF32469.1| (pentatricopeptide repeat-containing protein, putative [Ricinus communis])

HSP 1 Score: 706.1 bits (1821), Expect = 4.7e-200
Identity = 348/434 (80.18%), Postives = 389/434 (89.63%), Query Frame = 1

Query: 77  RMDIAKLKAKEAGERKEEVNRKIASQKAISVILRREATKAVIERKRGPTNSKKLLPRTVL 136
           RMD   +K KE  E KEE++RKIAS+KAISVILRREATKA+IE+KRGPTNSKKLLPRTVL
Sbjct: 64  RMDWEIVKKKEEKEGKEEMDRKIASRKAISVILRREATKAIIEKKRGPTNSKKLLPRTVL 123

Query: 137 EALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIE 196
           EALHERITALRWESALKVFELLREQ+WYRPY+GMYIKLIVMLGKCKQPEKA+ELFQ MI 
Sbjct: 124 EALHERITALRWESALKVFELLREQIWYRPYSGMYIKLIVMLGKCKQPEKAHELFQAMIH 183

Query: 197 EGCEVSHESYTALLSAYSRSGLLDRAFSLLNEMKNSPDCQPDVHTYSILIKSCLQVFAFS 256
           EGC+VSHESYTALLSAY RSGLLD+AFSLL EMK +PDCQPDVHTYSILIKSC+QVFAF 
Sbjct: 184 EGCDVSHESYTALLSAYGRSGLLDKAFSLLEEMKRNPDCQPDVHTYSILIKSCVQVFAFD 243

Query: 257 EAQTLLSDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILVEMLSDDGCKPDVWTMNST 316
           +A+TLLS+M + GI PNTITYNT IDAYGKAKMF EME+ LV+MLS   C+PDVWTMNST
Sbjct: 244 KAKTLLSNMESLGISPNTITYNTLIDAYGKAKMFEEMEATLVKMLSQQNCEPDVWTMNST 303

Query: 317 LRGFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKAKSYEKMSAVMEYMQKYHY 376
           LR FG SGQ+ETMEKCYEKFQGAGI+P+I TFN+LLDSYGKA  Y+KMSAVMEYMQKYHY
Sbjct: 304 LRAFGISGQIETMEKCYEKFQGAGIEPSIMTFNVLLDSYGKAGDYKKMSAVMEYMQKYHY 363

Query: 377 SWTIVTYNIVIDAFGRAGNLKQMEHLFRLMRSERIQPSCVTLCSLVKAYGQAGKRDKIDS 436
           SWTI+TYNIVIDAFGRAG+LKQME+LFRLMRSERI+PSCVTLCSLV+AYGQA K +KI+ 
Sbjct: 364 SWTIITYNIVIDAFGRAGDLKQMEYLFRLMRSERIKPSCVTLCSLVRAYGQAEKPEKIEG 423

Query: 437 VLHIVENSNITLDTVFYNCLVDAYGRMECFAEMKAVLGMMEQRGCKPDKTTYRIMARAYS 496
           VL  +ENS+ITLDTVF+NCLVDAYGRM CFAEMK VL +MEQ+G +PDK TYR M +AYS
Sbjct: 424 VLRFIENSDITLDTVFFNCLVDAYGRMGCFAEMKGVLILMEQKGYRPDKITYRTMIKAYS 483

Query: 497 DGGMANHAREIHDL 511
             GM  H +E+ DL
Sbjct: 484 SKGMTKHVKELQDL 497

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP424_ARATH4.0e-18365.86Pentatricopeptide repeat-containing protein At5g48730, chloroplastic OS=Arabidop... [more]
PP279_ARATH1.2e-10244.52Pentatricopeptide repeat-containing protein At3g53170 OS=Arabidopsis thaliana GN... [more]
PP216_ARATH1.9e-9244.70Pentatricopeptide repeat-containing protein At3g06430, chloroplastic OS=Arabidop... [more]
PP358_ARATH3.3e-4429.94Pentatricopeptide repeat-containing protein At4g39620, chloroplastic OS=Arabidop... [more]
PP362_ARATH3.2e-3930.38Pentatricopeptide repeat-containing protein At5g02860 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0K8C6_CUCSA1.0e-25488.34Uncharacterized protein OS=Cucumis sativus GN=Csa_7G342750 PE=4 SV=1[more]
B9SV96_RICCO3.3e-20080.18Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... [more]
A0A067LME3_JATCU1.5e-19777.23Uncharacterized protein OS=Jatropha curcas GN=JCGZ_17186 PE=4 SV=1[more]
V4TYK6_9ROSI1.3e-19670.61Uncharacterized protein OS=Citrus clementina GN=CICLE_v10019744mg PE=4 SV=1[more]
A0A0D2QTF2_GOSRA1.3e-19673.38Uncharacterized protein OS=Gossypium raimondii GN=B456_007G155400 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G48730.12.3e-18465.86 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G53170.13.0e-10443.99 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G06430.11.1e-9344.70 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G39620.11.9e-4529.94 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G02860.11.8e-4030.38 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659116302|ref|XP_008458009.1|2.7e-25689.42PREDICTED: pentatricopeptide repeat-containing protein At5g48730, chloroplastic ... [more]
gi|778726970|ref|XP_004139628.2|1.5e-25488.34PREDICTED: pentatricopeptide repeat-containing protein At5g48730, chloroplastic ... [more]
gi|743833914|ref|XP_011024658.1|3.9e-20273.60PREDICTED: pentatricopeptide repeat-containing protein At5g48730, chloroplastic ... [more]
gi|1000945412|ref|XP_015581300.1|6.6e-20278.62PREDICTED: pentatricopeptide repeat-containing protein At5g48730, chloroplastic ... [more]
gi|223530592|gb|EEF32469.1|4.7e-20080.18pentatricopeptide repeat-containing protein, putative [Ricinus communis][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG12g11100.1Cp4.1LG12g11100.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 174..199
score: 1.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 272..320
score: 3.2E-9coord: 205..249
score: 2.1E-13coord: 449..495
score: 1.
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 334..391
score: 4.9
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 452..484
score: 4.5E-7coord: 347..377
score: 3.5E-5coord: 275..310
score: 2.4E-9coord: 381..415
score: 2.7E-9coord: 171..202
score: 3.7E-6coord: 205..239
score: 2.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 344..378
score: 8.364coord: 449..483
score: 10.83coord: 238..272
score: 10.479coord: 202..232
score: 9.986coord: 167..201
score: 9.317coord: 414..448
score: 6.95coord: 273..308
score: 10.797coord: 484..518
score: 6.818coord: 379..413
score: 11.772coord: 309..343
score: 9
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 147..282
score: 1.
NoneNo IPR availableunknownCoilCoilcoord: 78..98
scor
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 47..511
score: 1.6E
NoneNo IPR availablePANTHERPTHR24015:SF664SUBFAMILY NOT NAMEDcoord: 47..511
score: 1.6E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG12g11100CmaCh17G012570Cucurbita maxima (Rimu)cmacpeB371
Cp4.1LG12g11100Cla014951Watermelon (97103) v1cpewmB135
Cp4.1LG12g11100MELO3C021027Melon (DHL92) v3.5.1cpemeB119
Cp4.1LG12g11100ClCG09G008970Watermelon (Charleston Gray)cpewcgB129
Cp4.1LG12g11100MELO3C021027.2Melon (DHL92) v3.6.1cpemedB146
Cp4.1LG12g11100Carg25528Silver-seed gourdcarcpeB0597
The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG12g11100Cp4.1LG17g10180Cucurbita pepo (Zucchini)cpecpeB165