Cp4.1LG09g10520 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG09g10520
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPlastid transcriptionally active 2 isoform 2
LocationCp4.1LG09 : 9183071 .. 9188935 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTCGCCGTAGATTTTTTTTCAGATGACTGGAGATTGTCCTCCGATGCGGGGAAGTGCCGGGCGAAGCCGAAGGATCTTGTTCTTGGGAATATGTCGGTAATTTTTTAGACGGGCGAGTACAGCTATGATGTTGAAACCGCCACGAGGTAGCATAGCTCGTTGTCTTGATATCTTCAAGAATAGGCTCTCACTCAATGACTTCAGCTTGGATTTTAAGGAATTCGCGGCGTGCGGCGATTGGCAAAGCTCTTTACGCCTCTTTAATACACGCAGCGTCAGATATGGTGCAAGCCGAACGAGTACATCTATGCCATCGTGATCAGCTTTCTCGGGCGCAAAGGATTGCTAGAGAACTGTAGCGAGATATTCGATGAAATGGCGAGCCAGGGCGTGATACGTAGCATGTTTTCTTATACCGCTTTGATAAATGCCTACGGGTGCAATGGTCAGTACGAAACCTCACTCGCACTTCTTGAAAGGATGAAGAGAGAGAGAGAGAGTGTCGCCTAATATATTGACTTACAATACCGTGATAAATGCATGTGCTAGAGGTGATTTAGATTGCGAGGGACTGTTGGGACTGTTTGCCGAGATGAGGCATGAAGGGGTTCAACCTGATCTGGTTACTTAAAATACTTTGCTTAGTGCATGTGCTGCCCGCGGTTTAGGTGATGAGGCAGTGATGGTCTTCAAAACTATGATAGAGGGAGGGATCGTCCCGGAGATAACAACATACAGTTTATAGTTCTAACATTTGAAAAACTGGGTAAGCTCGAGAAAGTTGCTATGTTTTGCTAAAAGAAATGGAGTCCGAAGGTTATATGCCTGACATATCATCCAACAATGTGTGAATAGAGGCATATGAAAATCAGCGTCGATTAAGGAGATCAATGGATGTGTTTAAGCAGATGCAAGCAGCAGGATGCGTGCCAAATTCGTCAACTTACAGCATTCTGTTGAATCTATATGGGAAGCATGGGAGGTATGATGATGTTCGAGAGCTTTTCCTTGAAATGAAAGAGAGTAGTGCTGAGCCAGATGCAACTACTTACAACATTCTCATACGAGTATTTGGAGAGGGTGGATACTTCAAGGAGGTTGTAGCTTTGTTTCATGACTTTCTGGATGAAAAGATTGACCCAAATATCGAAACATAGTATTTGCTTGTGGAAAGGGAGGGTTGCATGAGGATGCCAAGAAAATTTTACTTCGCATGAATGAGAAAGGAATAGTTCCAAGTTCTAATGCGTATAGTGGACTGTTTGAAGCTTATGGACGTGCTGCATTGTATGATGAAGCTCTTGTTGCATTTAACACAATGAACGAAGTGGGAAATAAGTCAACTGTTGATACCTACAATTCGCTAATTCACTCATTTGCTAGAGGTGGACTGTACATGGAGTTTGAAGCAATCTTGTTGAGAATGAGAGAATCCGGCAATTCAAGGAATGTGAATTCATTTAGTGGTATTGTTGAAGGTTATAGGCAAAGTGGTAAGTTTGAAGAAGCTATAAAGTCCTTTGTTGAGATGGAAAAGCTGAGATGTGAACCCATTGAGAAGGCCCTTGAGGCAGTTTTAGGTGTTTATTGCTTTGCAGGTCTTGTTGATGAGAGCAAGGAGCAGTTTCATGAGATTAAAGCTTCAGGAATATTACCTAGTGTATTATGCTACTGCATGATGCTGGCAGTTTATGCCAAGAATGACCGGTAATAATTACTGTGTGTATGTGTGTGTGTGTGTGGTATTTGTCTATGAAAGTTATGTTCATTATTAACTCATGTTTTCTTAGTTTTATTTCCAATTGATATAGACTGTTCAGGGTACCTTTCTCTTAAACAAATTTTGTGCTCATGATTCAGTCTATCATAATTTTATTCTATACAGAAATCTGTCACGCCTTGAAGGCTGGTTACTTAGAAAATGTGTAGTTATATGAGATTGAGAGATCCCACATCGGTTGGGGAGGAGAACAAAACATTCTTTATAAGAGTGTGGAAACCTCTCCCTAGCTTACGCGTTTTAAAAACCTTGAGGAAAAGCCCGAAAGGGAAAGTCAAAGAAGACAATATCTACTAGCGGTGGCTTGGGCCGATATAGATACAGAGATGGTTGGAGCATTTTAGAGATTTTGAGGGGGAATTGTTAATGGACGACAGTTAGATAAACATCCTTCACTTTCCCTTTTCTGTTAATGTTGTAAGGTCCTCAGAATAGTCTGTTTTGTCTCGAAAAAATTCTTCTTCCTTAAAATCCCTATATCATGCAACATGATCATGACCTTGCCATGTGCAAAAAAGGAACTCATTGTATGGACCATGAAGTCAATGGTCATTTTATACCAAGTTAGAGTTATAGGAAGAGTTTGTCAGTTGTCATCGGTTGTTAACTCTTACCTTGTAAGAACATCATTTGTTTCCTCATTGTCGACATTTTTCTTAGATGATTTCTCCAGCAATTATCACTAATTAACTATCAGAGTTGGACCATTTTAAGAAATATATGGTTTTATCATTACATCTCATAGTCCCTTCTTTCCAGGTGGGATGATGCCTGTGAATTACTTGATGAGACGATTACAAACAGGGTATCCAGTGTTCATCAAGTCATTGGCCAGATGATCAAGGGAGACTACGATAATGGTTGAATATGTTTTTGACAAACTTAATGCTCAAGGGTGTGGATTGGGAATGAGATTTTACAATACACTGTTAGAAGCACTTTGGTGGCTTGGACAGAAAGGGCGTGCTGCAAGGGTTCTCACTGAAGCAACAAAACGAGGCCTTTTGCCTGAAATTTTCCGCAAAAACAATCTCGTGTGGTCTGTGGATGTGCACAGGTAAAATTGGAATCGTCTTTTCTTCCATGCTGGCTGGCTGATACCTGTGCTTGAGTTTATACTAATTCGTATGTCTGGTGAAATTATTACTCTAAATCATTCCCCTCTAAATCTTCCAGAAGCAAAGATATCCAGATCTAGTATAGATCACAAGTACAAAGATGTTGATTCTAAATCAAGTTATAGGACAGGAACTGAGCTCAGACCTCAACCGTCCGAAGGAAATTATGATGTCCCACATTGGTTGGGAGAAGAACAAAACACCCTTTATGAGGGTATGGAAACCTCCCATGGCGGACACGTTTTAAAAACCTTGAGAGGAAGTCCGAAAGGGAAAGTCCAAAGAGGACAATATCTGCTAGAAGTGGATTTGGGCCATTACAAATGGTATTAGAGCTAGACACTGGATTATGTGCCAGCGAGGAGGCTGTTCCCCAAAGGGGTAGATACGAGACGGTGTCCTAGTAAGGACGGGCCCCAAAGGGGGGTGGATTAGACGGGGGTCCCACATCGATTGGAGAAAGGAAAGAGTGCCAGTGAGGACGCTGGCCCCGAAGGGGGGTGGATTGTGATGTCCCACATTGGTTGGGGAGGAGAACAAAACACATTTTATAAGGGTATGGAAACCTCCCCTAGTATACGTGTTTTAAAAAGCCTTGAGGGGAGGTTTGAACGAGAAAGTCCAAAAAGAACAATGTCTGCTAGCGGTGGATCTAGACCGTTACAGAAATACAAGGCTTTGTTGAGGATTGTTGGGAGAGGAGTCCCACGTCGGCTAATTAAGAGGTTGATCATGAGTTTATAAGTAAGGAATACATCTCGATTGGTATGAGGTCTTTTGGAGAAACCAAAAGTAAAACCACTCTAACTTATGCTCAAAGTGGACAGTATCATACCAATGTGGAAGTCCGTAGTTCCTACCGAGCTGAGCATCTAGTTTTCAAGTCTTGGAATCCATCCCCTCCCATTTTAATTGGTAGGATGTGATCCACAGCGCCAATATCAGCTCTTTTGAGTTTGGAAAGAACTCACCAAATGATTTCATCACACCATTCAGGTGATGGCGGAGTTTTCTTGCAACATTGATCGTGATAATTCAGTGTTGAATTAAGGGAAGAAGATGCATGCATAAGAAAGAGGTTAGTACTTCTGTAGCTGTAGAATTAATATTCTTTATTGAAAAAAAGCCGTTTATAGGTAATCATTAGACCAACGGTTTAAATCTCTCTGAACTAATCAGGGAGATGCGATGTGAATATTCTTTATTGGAAAAAACCGTTTATAGGTAATCATTAGACCATTGGTTTAAATCTCTCTGAATTAATCAGGGAGATGCGATGTGAGATCCCACATCGATCAGAGAAAGGAACGAGCGCCAGCAAAGACGCTGGCTCCAAATGGGGGTGGATTATAAGATCTTATATTGGTTAGAAAGGAAGACAAAACATTATTTATAAGAGTTTAAAAACCTCTCTCTAGCAGATGTGAAAACCCAAAAGAGAAAATAAAAAATAGAAAACAGAAAGAGAACAATATCTGCTAGCCCCGTTTGTTTCTATGTATTTATTTATTTATTTATCGATATTAGAAAGAAAAACTATTCTTTTTTTTACATGAATATTACATAACTCTTGGGCGTGTATCCGTAGAGAGATTTATTGATTGGAAAGGTCACATCTGTGGTTTCTTATATTCTTTTTTAAGATATGAAATGGAAGAAAGAAGAAAGTAGAAAAGATCCTTAAAAAAAGTACTCTTTGATGGGATGTTCCTCTTTCACGCTTGCCCACACACACATTTCCTGAAGATGCCTTATTCATTTTCTCTCCTATCTTTTTTTTCTCCTTAAGATATGATTTTGACTACTGCTAGGAAAATGAAAAACTTTTGAATGAAAAATTATTAACCACACTTTTTAATCTAAGAGTCGAAGCCTACTACTAGCCGATATTGTCTGTTTAGTCTGTTATATATCGTCTTCAGCCTCATGATTTTAAAATGTGTTTGCTAGAGAGAGGTTTTTATGCCGTTATAAAGAATACTTTGTTCCCTTCTCCAATTGAGTAAACTTTTGTTGATCCATTAATATATAATTTTTTTTGTCTTTCAATAATTTTTTTGAAGTTGCAGAGGGAAATCAAACTAACGGGAGAGTTTCAAATGGTGTGAGTTGCTGGAGAAGATGAAGAGTTTTTGATTGATTCATTAAATGGTGACAGGTAATTATACTCTCAAATTGACATTTTTCATCGGTGAATTTGGTTCATTTTACATAGATAAGTTAAGATTACTCATTTTTTTGCTCAACAAATTGATCTTTAGATTCTAAAATACTCATATTTTTAAATATGCTATGACAATAGAACCCGCCCAAAGCAAAGTCATGAGAGTTTATGCTCAAAGTAGACAATATCATACACGGTGGAGAGTTGTGTTCATCTAATATGGTATAAGAGTCATGCCCTAAATTTAGTCATGTCAATAGAATCCTCAAATGTCGAACAAAGGACTCCAAAAGGAGGCTCCTCGAAGGCATCGTAAAAAATGACTAAAACTCCAAAAGAAAAGGAGTCGAGCCTCGATTAAGGGGAGGCGTACTTTGTTCGAGGGTAGGTGTTACATCAACTAATTTAGAGAATGATCATGGGTTTATAAGTAAGGAATATATCTCTGTTTTTTCGGGGAAGTCCAAAGCAAAGCCATAAGAGCTTATGTTCAAAGTAGACAATATCATACTATTGTAGAGAGTCGTGTTCATCTAACACATCTAATTACAATTATAAGTGCACAATAGAAGTTATCCACGACGTCACTTTTATCAATTTGTTTATTTGTATTTGTATTTACAATTATTGATCCCATTAAAAATTTCTTAGGTCTATATGCACATTTTAGTACAGATGAATTAATGTTGCATATTCTACTACACATATATTTAATTCAATACAATGTAATCATGTATGGATTATTGATCCATAAACTCTTTTCCTAAGTCTGCACATTTTAGTCGTTGAATTTGCAATAG

mRNA sequence

ATGTTCGCCGTAGATTTTTTTTCAGATGACTGGAGATTGTCCTCCGATGCGGGGAAGTGCCGGGCGAAGCCGAAGGATCTTGTTCTTGGGAATATGTCGCGTCAGATATGGTGCAAGCCGAACGAGTACATCTATGCCATCGTGATCAGCTTTCTCGGGCGCAAAGGATTGCTAGAGAACTGTAGCGAGATATTCGATGAAATGGCGAGCCAGGGCGTGATACGTAGCATGTTTTCTTATACCGCTTTGATAAATGCCTACGGGTGCAATGGTCAAGGTGATTTAGATTGCGAGGGACTGTTGGGACTGTTTGCCGAGATGAGGCATGAAGGGGTTCAACCTGATCTGATGCAAGCAGCAGGATGCGTGCCAAATTCGTCAACTTACAGCATTCTGTTGAATCTATATGGGAAGCATGGGAGGTATGATGATGTTCGAGAGCTTTTCCTTGAAATGAAAGAGAGTAGTGCTGAGCCAGATGCAACTACTTACAACATTCTCATACGAGTATTTGGAGAGGGTGGATACTTCAAGGAGGGAGGGTTGCATGAGGATGCCAAGAAAATTTTACTTCGCATGAATGAGAAAGGAATAGTTCCAAGTTCTAATGCGTATAGTGGACTGTTTGAAGCTTATGGACGTGCTGCATTGTATGATGAAGCTCTTGTTGCATTTAACACAATGAACGAAGTGGGAAATAAGTCAACTGTTGATACCTACAATTCGCTAATTCACTCATTTGCTAGAGGTGGACTGTACATGGAGTTTGAAGCAATCTTGTTGAGAATGAGAGAATCCGGCAATTCAAGGAATGTGAATTCATTTAGTGGTATTGTTGAAGGTTATAGGCAAAGTGGTAAGTTTGAAGAAGCTATAAAGTCCTTTGTTGAGATGGAAAAGCTGAGATGTGAACCCATTGAGAAGGCCCTTGAGGCAGTTTTAGGTGTTTATTGCTTTGCAGGTCTTGTTGATGAGAGCAAGGAGCAGTTTCATGAGATTAAAGCTTCAGGAATATTACCTAGTGTATTATGCTACTGCATGATGCTGGCAGTTTATGCCAAGAATGACCGGTGGGATGATGCCTGTGAATTACTTGATGAGACGATTACAAACAGGGTATCCAGTGTTCATCAAGTCATTGGCCAGATGATCAAGGGAGACTACGATAATGGTTGAATATGTTTTTGACAAACTTAATGCTCAAGGGTGTGGATTGGGAATGAGATTTTACAATACACTGTTAGAAGCACTTTGGTGGCTTGGACAGAAAGGGCGTGCTGCAAGGGTTCTCACTGAAGCAACAAAACGAGGCCTTTTGCCTGAAATTTTCCGCAAAAACAATCTCGTGTGGTCTGTGGATGTGCACAGGTAAAATTGGAATCGTCTTTTCTTCCATGCTGGCTGGCTGATACCTGTGCTTGAGTTTATACTAATTCGTATGTCTGGTGAAATTATTACTCTAAATCATTCCCCTCTAAATCTTCCAGAAGCAAAGATATCCAGATCTAGTATAGATCACAAGTACAAAGATGTTGATTCTAAATCAAGTTATAGGACAGGAACTGAGCTCAGACCTCAACCGTCCGAAGGAAATTATGATGTCCCACATTGGTTGGGAGAAGAACAAAACACCCTTTATGAGGGTATGGAAACCTCCCATGGCGGACACGTTTTAAAAACCTTGAGAGGAAGTCCGAAAGGGAAAGTCCAAAGAGGACAATATCTGCTAGAAGTGGATTTGGGCCATTACAAATGGTATTAGAGCTAGACACTGGATTATGTGCCAGCGAGGAGGCTGTTCCCCAAAGGGGTAGATACGAGACGGTGTCCTAGTAAGGACGGGCCCCAAAGGGGGGTGGATTAGACGGGGGTCCCACATCGATTGGAGAAAGGAAAGAGTGCCAGTGAGGACGCTGGCCCCGAAGGGGGGTGGATTGTGATGTCCCACATTGGTTGGGGAGGAGAACAAAACACATTTTATAAGGGTATGGAAACCTCCCCTAGTATACGTGTTTTAAAAAGCCTTGAGGGGAGGTTTGAACGAGAAAGTCCAAAAAGAACAATGTCTGCTAGCGGTGGATCTAGACCGTTACAGAAATACAAGGCTTTGTTGAGGATTGTTGGGAGAGGAGTCCCACGTCGGCTAATTAAGAGGTTGATCATGAGTTTATAAGTAAGGAATACATCTCGATTGGTATGAGGTCTTTTGGAGAAACCAAAAGTAAAACCACTCTAACTTATGCTCAAAGTGGACAGTATCATACCAATGTGGAAGTCCGTAGTTCCTACCGAGCTGAGCATCTAGTTTTCAAGTCTTGGAATCCATCCCCTCCCATTTTAATTGGTAGGATGTGATCCACAGCGCCAATATCAGCTCTTTTGAGTTTGGAAAGAACTCACCAAATGATTTCATCACACCATTCAGGTGATGGCGGAGTTTTCTTGCAACATTGATCGTGATAATTCAGTGTTGAATTAAGGGAAGAAGATGCATGCATAAGAAAGAGAGGGAAATCAAACTAACGGGAGAGTTTCAAATGGTGTGAGTTGCTGGAGAAGATGAAGAGTTTTTGATTGATTCATTAAATGGTGACAGGTAATTATACTCTCAAATTGACATTTTTCATCGGTGAATTTGGTTCATTTTACATAGATAAGTTAAGATTACTCATTTTTTTGCTCAACAAATTGATCTTTAGATTCTAAAATACTCATATTTTTAAATATGCTATGACAATAGAACCCGCCCAAAGCAAAGTCATGAGAGTTTATGCTCAAAGTAGACAATATCATACACGGTGGAGAGTTGTGTTCATCTAATATGGTATAAGAGTCATGCCCTAAATTTAGTCATGTCAATAGAATCCTCAAATGTCGAACAAAGGACTCCAAAAGGAGGCTCCTCGAAGGCATCGTAAAAAATGACTAAAACTCCAAAAGAAAAGGAGTCGAGCCTCGATTAAGGGGAGGCGTACTTTGTTCGAGGGTAGGTGTTACATCAACTAATTTAGAGAATGATCATGGGTTTATAAGTAAGGAATATATCTCTGTTTTTTCGGGGAAGTCCAAAGCAAAGCCATAAGAGCTTATGTTCAAAGTAGACAATATCATACTATTGTAGAGAGTCGTGTTCATCTAACACATCTAATTACAATTATAAGTGCACAATAGAAGTTATCCACGACGTCACTTTTATCAATTTGTTTATTTGTATTTGTATTTACAATTATTGATCCCATTAAAAATTTCTTAGGTCTATATGCACATTTTAGTACAGATGAATTAATGTTGCATATTCTACTACACATATATTTAATTCAATACAATGTAATCATGTATGGATTATTGATCCATAAACTCTTTTCCTAAGTCTGCACATTTTAGTCGTTGAATTTGCAATAG

Coding sequence (CDS)

ATGTTCGCCGTAGATTTTTTTTCAGATGACTGGAGATTGTCCTCCGATGCGGGGAAGTGCCGGGCGAAGCCGAAGGATCTTGTTCTTGGGAATATGTCGCGTCAGATATGGTGCAAGCCGAACGAGTACATCTATGCCATCGTGATCAGCTTTCTCGGGCGCAAAGGATTGCTAGAGAACTGTAGCGAGATATTCGATGAAATGGCGAGCCAGGGCGTGATACGTAGCATGTTTTCTTATACCGCTTTGATAAATGCCTACGGGTGCAATGGTCAAGGTGATTTAGATTGCGAGGGACTGTTGGGACTGTTTGCCGAGATGAGGCATGAAGGGGTTCAACCTGATCTGATGCAAGCAGCAGGATGCGTGCCAAATTCGTCAACTTACAGCATTCTGTTGAATCTATATGGGAAGCATGGGAGGTATGATGATGTTCGAGAGCTTTTCCTTGAAATGAAAGAGAGTAGTGCTGAGCCAGATGCAACTACTTACAACATTCTCATACGAGTATTTGGAGAGGGTGGATACTTCAAGGAGGGAGGGTTGCATGAGGATGCCAAGAAAATTTTACTTCGCATGAATGAGAAAGGAATAGTTCCAAGTTCTAATGCGTATAGTGGACTGTTTGAAGCTTATGGACGTGCTGCATTGTATGATGAAGCTCTTGTTGCATTTAACACAATGAACGAAGTGGGAAATAAGTCAACTGTTGATACCTACAATTCGCTAATTCACTCATTTGCTAGAGGTGGACTGTACATGGAGTTTGAAGCAATCTTGTTGAGAATGAGAGAATCCGGCAATTCAAGGAATGTGAATTCATTTAGTGGTATTGTTGAAGGTTATAGGCAAAGTGGTAAGTTTGAAGAAGCTATAAAGTCCTTTGTTGAGATGGAAAAGCTGAGATGTGAACCCATTGAGAAGGCCCTTGAGGCAGTTTTAGGTGTTTATTGCTTTGCAGGTCTTGTTGATGAGAGCAAGGAGCAGTTTCATGAGATTAAAGCTTCAGGAATATTACCTAGTGTATTATGCTACTGCATGATGCTGGCAGTTTATGCCAAGAATGACCGGTGGGATGATGCCTGTGAATTACTTGATGAGACGATTACAAACAGGGTATCCAGTGTTCATCAAGTCATTGGCCAGATGATCAAGGGAGACTACGATAATGGTTGA

Protein sequence

MFAVDFFSDDWRLSSDAGKCRAKPKDLVLGNMSRQIWCKPNEYIYAIVISFLGRKGLLENCSEIFDEMASQGVIRSMFSYTALINAYGCNGQGDLDCEGLLGLFAEMRHEGVQPDLMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYFKEGGLHEDAKKILLRMNEKGIVPSSNAYSGLFEAYGRAALYDEALVAFNTMNEVGNKSTVDTYNSLIHSFARGGLYMEFEAILLRMRESGNSRNVNSFSGIVEGYRQSGKFEEAIKSFVEMEKLRCEPIEKALEAVLGVYCFAGLVDESKEQFHEIKASGILPSVLCYCMMLAVYAKNDRWDDACELLDETITNRVSSVHQVIGQMIKGDYDNG
BLAST of Cp4.1LG09g10520 vs. Swiss-Prot
Match: PP124_ARATH (Pentatricopeptide repeat-containing protein At1g74850, chloroplastic OS=Arabidopsis thaliana GN=PTAC2 PE=2 SV=1)

HSP 1 Score: 344.0 bits (881), Expect = 2.2e-93
Identity = 172/351 (49.00%), Postives = 243/351 (69.23%), Query Frame = 1

Query: 40  PNEYIYAIVISFLGRKGLLENCSEIFDEMASQGVIRSMFSYTALINAYGCNGQGDLDCEG 99
           P+   Y +++    + G ++    +F +M + G   +  +Y+ L+N +G +G+ D     
Sbjct: 315 PDITSYNVLLEAYAKSGSIKEAMGVFHQMQAAGCTPNANTYSVLLNLFGQSGRYD----D 374

Query: 100 LLGLFAEMRHEGVQPDLMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEP 159
           +  LF EM+     PD          ++TY+IL+ ++G+ G + +V  LF +M E + EP
Sbjct: 375 VRQLFLEMKSSNTDPD----------AATYNILIEVFGEGGYFKEVVTLFHDMVEENIEP 434

Query: 160 DATTYNILIRVFGEGGYFKEGGLHEDAKKILLRMNEKGIVPSSNAYSGLFEAYGRAALYD 219
           D  TY  +I   G      +GGLHEDA+KIL  M    IVPSS AY+G+ EA+G+AALY+
Sbjct: 435 DMETYEGIIFACG------KGGLHEDARKILQYMTANDIVPSSKAYTGVIEAFGQAALYE 494

Query: 220 EALVAFNTMNEVGNKSTVDTYNSLIHSFARGGLYMEFEAILLRMRESGNSRNVNSFSGIV 279
           EALVAFNTM+EVG+  +++T++SL++SFARGGL  E EAIL R+ +SG  RN ++F+  +
Sbjct: 495 EALVAFNTMHEVGSNPSIETFHSLLYSFARGGLVKESEAILSRLVDSGIPRNRDTFNAQI 554

Query: 280 EGYRQSGKFEEAIKSFVEMEKLRCEPIEKALEAVLGVYCFAGLVDESKEQFHEIKASGIL 339
           E Y+Q GKFEEA+K++V+MEK RC+P E+ LEAVL VY FA LVDE +EQF E+KAS IL
Sbjct: 555 EAYKQGGKFEEAVKTYVDMEKSRCDPDERTLEAVLSVYSFARLVDECREQFEEMKASDIL 614

Query: 340 PSVLCYCMMLAVYAKNDRWDDACELLDETITNRVSSVHQVIGQMIKGDYDN 391
           PS++CYCMMLAVY K +RWDD  ELL+E ++NRVS++HQVIGQMIKGDYD+
Sbjct: 615 PSIMCYCMMLAVYGKTERWDDVNELLEEMLSNRVSNIHQVIGQMIKGDYDD 645

BLAST of Cp4.1LG09g10520 vs. Swiss-Prot
Match: PP178_ARATH (Pentatricopeptide repeat-containing protein At2g31400, chloroplastic OS=Arabidopsis thaliana GN=At2g31400 PE=2 SV=1)

HSP 1 Score: 151.0 bits (380), Expect = 2.7e-35
Identity = 102/360 (28.33%), Postives = 171/360 (47.50%), Query Frame = 1

Query: 43  YIYAIVISFLGRKGLLENCSEIFDEMASQGVIRSMFSYTALINAYGCNGQGDLDCEGLLG 102
           Y ++ +IS  GR GL E    +F+ M   G+  ++ +Y A+I+A G   +G ++ + +  
Sbjct: 269 YAFSALISAYGRSGLHEEAISVFNSMKEYGLRPNLVTYNAVIDACG---KGGMEFKQVAK 328

Query: 103 LFAEMRHEGVQPDLMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEPDAT 162
            F EM+  GVQPD +          T++ LL +  + G ++  R LF EM     E D  
Sbjct: 329 FFDEMQRNGVQPDRI----------TFNSLLAVCSRGGLWEAARNLFDEMTNRRIEQDVF 388

Query: 163 TYNILIRVFGEGGYFKEGGLHEDAKKILLRMNEKGIVPSSNAYSGLFEAYGRAALYDEAL 222
           +YN L+    +GG        + A +IL +M  K I+P+  +YS + + + +A  +DEAL
Sbjct: 389 SYNTLLDAICKGGQM------DLAFEILAQMPVKRIMPNVVSYSTVIDGFAKAGRFDEAL 448

Query: 223 VAFNTMNEVGNKSTVDTYNSLIHSFARGGLYMEFEAILLRMRESGNSRNVNSFSGIVEGY 282
             F  M  +G      +YN+L+  + + G   E   IL  M   G  ++V +++ ++ GY
Sbjct: 449 NLFGEMRYLGIALDRVSYNTLLSIYTKVGRSEEALDILREMASVGIKKDVVTYNALLGGY 508

Query: 283 RQSGKFEEAIKSFVEMEKLRCEPIEKALEAVLGVYCFAGLVDESKEQFHEIKASGILPSV 342
            + GK++E  K F EM++    P       ++  Y   GL  E+ E F E K++G+   V
Sbjct: 509 GKQGKYDEVKKVFTEMKREHVLPNLLTYSTLIDGYSKGGLYKEAMEIFREFKSAGLRADV 568

Query: 343 LCYCMMLAVYAKNDRWDDACELLDETITNRVS-------SVHQVIGQMI----KGDYDNG 392
           + Y  ++    KN     A  L+DE     +S       S+    G+        DY NG
Sbjct: 569 VLYSALIDALCKNGLVGSAVSLIDEMTKEGISPNVVTYNSIIDAFGRSATMDRSADYSNG 609

BLAST of Cp4.1LG09g10520 vs. Swiss-Prot
Match: PP362_ARATH (Pentatricopeptide repeat-containing protein At5g02860 OS=Arabidopsis thaliana GN=At5g02860 PE=2 SV=1)

HSP 1 Score: 144.1 bits (362), Expect = 3.3e-33
Identity = 81/327 (24.77%), Postives = 152/327 (46.48%), Query Frame = 1

Query: 40  PNEYIYAIVISFLGRKGLLENCSEIFDEMASQGVIRSMFSYTALINAYGCNGQGDLDCEG 99
           P+   Y  +IS   R G+L+   E+ ++MA +G    +F+YT L++ +   G+     E 
Sbjct: 347 PSIVTYNSLISAYARDGMLDEAMELKNQMAEKGTKPDVFTYTTLLSGFERAGK----VES 406

Query: 100 LLGLFAEMRHEGVQPDLMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEP 159
            + +F EMR+          AGC PN  T++  + +YG  G++ ++ ++F E+      P
Sbjct: 407 AMSIFEEMRN----------AGCKPNICTFNAFIKMYGNRGKFTEMMKIFDEINVCGLSP 466

Query: 160 DATTYNILIRVFGEGGYFKEGGLHEDAKKILLRMNEKGIVPSSNAYSGLFEAYGRAALYD 219
           D  T+N L+ VFG+ G      +  +   +   M   G VP    ++ L  AY R   ++
Sbjct: 467 DIVTWNTLLAVFGQNG------MDSEVSGVFKEMKRAGFVPERETFNTLISAYSRCGSFE 526

Query: 220 EALVAFNTMNEVGNKSTVDTYNSLIHSFARGGLYMEFEAILLRMRESGNSRNVNSFSGIV 279
           +A+  +  M + G    + TYN+++ + ARGG++ + E +L  M +     N  ++  ++
Sbjct: 527 QAMTVYRRMLDAGVTPDLSTYNTVLAALARGGMWEQSEKVLAEMEDGRCKPNELTYCSLL 586

Query: 280 EGYRQSGKFEEAIKSFVEMEKLRCEPIEKALEAVLGVYCFAGLVDESKEQFHEIKASGIL 339
             Y    +         E+     EP    L+ ++ V     L+ E++  F E+K  G  
Sbjct: 587 HAYANGKEIGLMHSLAEEVYSGVIEPRAVLLKTLVLVCSKCDLLPEAERAFSELKERGFS 646

Query: 340 PSVLCYCMMLAVYAKNDRWDDACELLD 367
           P +     M+++Y +      A  +LD
Sbjct: 647 PDITTLNSMVSIYGRRQMVAKANGVLD 653

BLAST of Cp4.1LG09g10520 vs. Swiss-Prot
Match: PP413_ARATH (Pentatricopeptide repeat-containing protein At5g42310, mitochondrial OS=Arabidopsis thaliana GN=At5g42310 PE=2 SV=1)

HSP 1 Score: 132.9 bits (333), Expect = 7.7e-30
Identity = 82/276 (29.71%), Postives = 136/276 (49.28%), Query Frame = 1

Query: 34  RQIWCKPNEYIYAIVISFLGRKGLLENCSEIFDEMASQGVIRSMFSYTALINAYGCNGQG 93
           + I  KP+   Y +VI   G+   L++    FD M S+G+     ++  LI         
Sbjct: 436 KSIGVKPDRQFYNVVIDTFGKFNCLDHAMTTFDRMLSEGIEPDRVTWNTLI--------- 495

Query: 94  DLDCEGLLGLFAEMRHEGVQPDLMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMK 153
           D  C+    + AE   E      M+  GC+P ++TY+I++N YG   R+DD++ L  +MK
Sbjct: 496 DCHCKHGRHIVAEEMFEA-----MERRGCLPCATTYNIMINSYGDQERWDDMKRLLGKMK 555

Query: 154 ESSAEPDATTYNILIRVFGEGGYFKEGGLHEDAKKILLRMNEKGIVPSSNAYSGLFEAYG 213
                P+  T+  L+ V+G+ G F       DA + L  M   G+ PSS  Y+ L  AY 
Sbjct: 556 SQGILPNVVTHTTLVDVYGKSGRFN------DAIECLEEMKSVGLKPSSTMYNALINAYA 615

Query: 214 RAALYDEALVAFNTMNEVGNKSTVDTYNSLIHSFARGGLYMEFEAILLRMRESGNSRNVN 273
           +  L ++A+ AF  M   G K ++   NSLI++F       E  A+L  M+E+G   +V 
Sbjct: 616 QRGLSEQAVNAFRVMTSDGLKPSLLALNSLINAFGEDRRDAEAFAVLQYMKENGVKPDVV 675

Query: 274 SFSGIVEGYRQSGKFEEAIKSFVEMEKLRCEPIEKA 310
           +++ +++   +  KF++    + EM    C+P  KA
Sbjct: 676 TYTTLMKALIRVDKFQKVPVVYEEMIMSGCKPDRKA 691

BLAST of Cp4.1LG09g10520 vs. Swiss-Prot
Match: PP213_ARATH (Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidopsis thaliana GN=At3g04760 PE=2 SV=1)

HSP 1 Score: 131.0 bits (328), Expect = 2.9e-29
Identity = 87/334 (26.05%), Postives = 154/334 (46.11%), Query Frame = 1

Query: 38  CKPNEYIYAIVISFLGRKGLLENCSEIFDEMASQGVIRSMFSYTALINAYGCNGQGDLDC 97
           C+P    Y I+I     +G ++   ++ DEM S+G+   MF+Y  +I    C        
Sbjct: 224 CQPTVITYTILIEATMLEGGVDEALKLMDEMLSRGLKPDMFTYNTIIRGM-CK------- 283

Query: 98  EGLLGLFAEMRHEGVQPDLMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSA 157
           EG++    EM         ++  GC P+  +Y+ILL      G++++  +L  +M     
Sbjct: 284 EGMVDRAFEMVRN------LELKGCEPDVISYNILLRALLNQGKWEEGEKLMTKMFSEKC 343

Query: 158 EPDATTYNILIRVFGEGGYFKEGGLHEDAKKILLRMNEKGIVPSSNAYSGLFEAYGRAAL 217
           +P+  TY+ILI      G        E+A  +L  M EKG+ P + +Y  L  A+ R   
Sbjct: 344 DPNVVTYSILITTLCRDGKI------EEAMNLLKLMKEKGLTPDAYSYDPLIAAFCREGR 403

Query: 218 YDEALVAFNTMNEVGNKSTVDTYNSLIHSFARGGLYMEFEAILLRMRESGNSRNVNSFSG 277
            D A+    TM   G    +  YN+++ +  + G   +   I  ++ E G S N +S++ 
Sbjct: 404 LDVAIEFLETMISDGCLPDIVNYNTVLATLCKNGKADQALEIFGKLGEVGCSPNSSSYNT 463

Query: 278 IVEGYRQSGKFEEAIKSFVEMEKLRCEPIEKALEAVLGVYCFAGLVDESKEQFHEIKASG 337
           +      SG    A+   +EM     +P E    +++   C  G+VDE+ E   ++++  
Sbjct: 464 MFSALWSSGDKIRALHMILEMMSNGIDPDEITYNSMISCLCREGMVDEAFELLVDMRSCE 523

Query: 338 ILPSVLCYCMMLAVYAKNDRWDDACELLDETITN 372
             PSV+ Y ++L  + K  R +DA  +L+  + N
Sbjct: 524 FHPSVVTYNIVLLGFCKAHRIEDAINVLESMVGN 537

BLAST of Cp4.1LG09g10520 vs. TrEMBL
Match: A0A0A0LW62_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G045740 PE=4 SV=1)

HSP 1 Score: 414.5 bits (1064), Expect = 1.5e-112
Identity = 211/351 (60.11%), Postives = 261/351 (74.36%), Query Frame = 1

Query: 40  PNEYIYAIVISFLGRKGLLENCSEIFDEMASQGVIRSMFSYTALINAYGCNGQGDLDCEG 99
           P+   Y ++I    + G ++   ++F +M + G + +  +Y+ L+N YG +G+ D     
Sbjct: 322 PDISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNASTYSILLNLYGKHGRYD----D 381

Query: 100 LLGLFAEMRHEGVQPDLMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEP 159
           +  LF +M+    +PD          ++TY+IL+ ++G+ G + +V  LF ++ + + +P
Sbjct: 382 VRELFLQMKESSAEPD----------ATTYNILIRVFGEGGYFKEVVTLFHDLVDENIDP 441

Query: 160 DATTYNILIRVFGEGGYFKEGGLHEDAKKILLRMNEKGIVPSSNAYSGLFEAYGRAALYD 219
           +  TY  L+   G+GG      LHEDAKKIL  MN KGIVPSS AYSGL EAYG+AALYD
Sbjct: 442 NMETYEGLVFACGKGG------LHEDAKKILFHMNGKGIVPSSKAYSGLIEAYGQAALYD 501

Query: 220 EALVAFNTMNEVGNKSTVDTYNSLIHSFARGGLYMEFEAILLRMRESGNSRNVNSFSGIV 279
           EALVAFNTMNEVG+KST+DTYNSLIH+FARGGLY EFEAIL RMRE G SRN  SFSGI+
Sbjct: 502 EALVAFNTMNEVGSKSTIDTYNSLIHTFARGGLYKEFEAILSRMREYGISRNAKSFSGII 561

Query: 280 EGYRQSGKFEEAIKSFVEMEKLRCEPIEKALEAVLGVYCFAGLVDESKEQFHEIKASGIL 339
           EGYRQSG++EEAIK+FVEMEK+RCE  E+ LE VLGVYCFAGLVDESKEQF EIKASGIL
Sbjct: 562 EGYRQSGQYEEAIKAFVEMEKMRCELDEQTLEGVLGVYCFAGLVDESKEQFIEIKASGIL 621

Query: 340 PSVLCYCMMLAVYAKNDRWDDACELLDETITNRVSSVHQVIGQMIKGDYDN 391
           PSVLCYCMMLAVYAKN RWDDA ELLDE I  RVSS+HQVIGQMIKGDYD+
Sbjct: 622 PSVLCYCMMLAVYAKNGRWDDASELLDEMIKTRVSSIHQVIGQMIKGDYDD 652

BLAST of Cp4.1LG09g10520 vs. TrEMBL
Match: F6GSY1_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_17s0000g07500 PE=4 SV=1)

HSP 1 Score: 390.2 bits (1001), Expect = 3.0e-105
Identity = 196/351 (55.84%), Postives = 258/351 (73.50%), Query Frame = 1

Query: 40  PNEYIYAIVISFLGRKGLLENCSEIFDEMASQGVIRSMFSYTALINAYGCNGQGDLDCEG 99
           P+   Y +++    + G ++    +F +M   G + +  +Y+ L+N YG +G+ D     
Sbjct: 320 PDITSYNVLLEAHAQSGSIKEAMGVFRQMQGAGCVPNAATYSILLNLYGRHGRYD----D 379

Query: 100 LLGLFAEMRHEGVQPDLMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEP 159
           +  LF EM+    +P          N++TY+IL+N++G+ G + +V  LF +M E + EP
Sbjct: 380 VRDLFLEMKVSNTEP----------NAATYNILINVFGEGGYFKEVVTLFHDMVEENVEP 439

Query: 160 DATTYNILIRVFGEGGYFKEGGLHEDAKKILLRMNEKGIVPSSNAYSGLFEAYGRAALYD 219
           +  TY  LI   G+GG      LHEDAKKILL MNEKG+VPSS AY+G+ EAYG+AALY+
Sbjct: 440 NMETYEGLIFACGKGG------LHEDAKKILLHMNEKGVVPSSKAYTGVIEAYGQAALYE 499

Query: 220 EALVAFNTMNEVGNKSTVDTYNSLIHSFARGGLYMEFEAILLRMRESGNSRNVNSFSGIV 279
           EALVAFNTMNEVG+K TV+TYNSLI  FA+GGLY E EAILL+M +SG +RN ++F+G++
Sbjct: 500 EALVAFNTMNEVGSKPTVETYNSLIQMFAKGGLYKESEAILLKMGQSGVARNRDTFNGVI 559

Query: 280 EGYRQSGKFEEAIKSFVEMEKLRCEPIEKALEAVLGVYCFAGLVDESKEQFHEIKASGIL 339
           E +RQ G+FEEAIK++VEMEK RC+P E+ LEAVL VYCFAGLV+ES+EQF EIKA GIL
Sbjct: 560 EAFRQGGQFEEAIKAYVEMEKARCDPDEQTLEAVLSVYCFAGLVEESEEQFGEIKALGIL 619

Query: 340 PSVLCYCMMLAVYAKNDRWDDACELLDETITNRVSSVHQVIGQMIKGDYDN 391
           PSV+CYCMMLAVYAK DRWDDA +LLDE  TNRVS++HQVIGQMI+GDYD+
Sbjct: 620 PSVMCYCMMLAVYAKADRWDDAHQLLDEMFTNRVSNIHQVIGQMIRGDYDD 650

BLAST of Cp4.1LG09g10520 vs. TrEMBL
Match: V4SYF8_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10018817mg PE=4 SV=1)

HSP 1 Score: 384.8 bits (987), Expect = 1.3e-103
Identity = 191/351 (54.42%), Postives = 257/351 (73.22%), Query Frame = 1

Query: 40  PNEYIYAIVISFLGRKGLLENCSEIFDEMASQGVIRSMFSYTALINAYGCNGQGDLDCEG 99
           P+   Y +++    + G ++   ++F +M + G + +  +Y+ L+N YG NG+ D     
Sbjct: 321 PDVTCYNVLLEAHAKMGSIKEAMDVFRQMQAAGSVANATTYSILLNLYGRNGRYD----D 380

Query: 100 LLGLFAEMRHEGVQPDLMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEP 159
           +  LF EM+          A+   PN++TY+IL+ ++G+ G + +V  LF +M E + EP
Sbjct: 381 VRELFLEMK----------ASNTEPNAATYNILIQVFGEGGYFKEVVTLFHDMVEENVEP 440

Query: 160 DATTYNILIRVFGEGGYFKEGGLHEDAKKILLRMNEKGIVPSSNAYSGLFEAYGRAALYD 219
           +  TY  LI   G      +GGLHED KKILL MNE+G VPSS AY+G+ EAYG AALY+
Sbjct: 441 NMETYEGLIFACG------KGGLHEDVKKILLYMNERGTVPSSKAYTGVIEAYGLAALYE 500

Query: 220 EALVAFNTMNEVGNKSTVDTYNSLIHSFARGGLYMEFEAILLRMRESGNSRNVNSFSGIV 279
           EALVAFNTMNEV +K T++TYNSL+H+FARGGLY E +AIL RM ESG +RN +SF+ ++
Sbjct: 501 EALVAFNTMNEVESKPTIETYNSLLHTFARGGLYKECQAILSRMSESGVARNSDSFNAVI 560

Query: 280 EGYRQSGKFEEAIKSFVEMEKLRCEPIEKALEAVLGVYCFAGLVDESKEQFHEIKASGIL 339
           E +RQ G+FEEAIK++VEMEK+RC+P E+ LEAVL VYCFAGLVDESKEQF EIK+SGIL
Sbjct: 561 EAFRQGGRFEEAIKAYVEMEKVRCDPNERTLEAVLSVYCFAGLVDESKEQFQEIKSSGIL 620

Query: 340 PSVLCYCMMLAVYAKNDRWDDACELLDETITNRVSSVHQVIGQMIKGDYDN 391
           PSV+CYCM+LAVYAK++RWDDA  LLDE  TNR+S++HQV GQMIKG++D+
Sbjct: 621 PSVMCYCMLLAVYAKSNRWDDAYGLLDEMHTNRISNIHQVTGQMIKGEFDD 651

BLAST of Cp4.1LG09g10520 vs. TrEMBL
Match: A0A067K823_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_11544 PE=4 SV=1)

HSP 1 Score: 382.9 bits (982), Expect = 4.8e-103
Identity = 191/346 (55.20%), Postives = 257/346 (74.28%), Query Frame = 1

Query: 45  YAIVISFLGRKGLLENCSEIFDEMASQGVIRSMFSYTALINAYGCNGQGDLDCEGLLGLF 104
           Y +++    RKG +++   +F +M   G + +  +Y+ L+N YG +G+ D     +  LF
Sbjct: 325 YNVLLEAYARKGNIKDAMGVFRQMQEAGCVPNAVTYSILLNLYGRHGRYD----DVRELF 384

Query: 105 AEMRHEGVQPDLMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTY 164
            EM+    +PD+          +TY+IL+ ++G+ G + +V  LF +M E + EP+  TY
Sbjct: 385 LEMKVSNTEPDV----------ATYNILIEVFGEGGYFKEVVTLFHDMVEENVEPNMGTY 444

Query: 165 NILIRVFGEGGYFKEGGLHEDAKKILLRMNEKGIVPSSNAYSGLFEAYGRAALYDEALVA 224
             LI   G+GG      LHEDAKKILL M+E+G+VPSS AY+ + EAYG+AALYDEALV 
Sbjct: 445 EGLIYACGKGG------LHEDAKKILLHMDEQGVVPSSKAYTSVIEAYGQAALYDEALVM 504

Query: 225 FNTMNEVGNKSTVDTYNSLIHSFARGGLYMEFEAILLRMRESGNSRNVNSFSGIVEGYRQ 284
           FNTMNE+G+K TVDTYNSLI+ FARGGLY E EAIL +M ESG +++ NSF+G++EGY+Q
Sbjct: 505 FNTMNEMGSKPTVDTYNSLIYMFARGGLYKESEAILWKMGESGVAQDRNSFNGLIEGYKQ 564

Query: 285 SGKFEEAIKSFVEMEKLRCEPIEKALEAVLGVYCFAGLVDESKEQFHEIKASGILPSVLC 344
            G+FEEAIK++VEMEK R EP E++LEAVL VYC AGL+DES+EQF EI+ASGILPSV+C
Sbjct: 565 GGQFEEAIKAYVEMEKARFEPDERSLEAVLSVYCAAGLIDESEEQFREIRASGILPSVMC 624

Query: 345 YCMMLAVYAKNDRWDDACELLDETITNRVSSVHQVIGQMIKGDYDN 391
           YCMMLAVYAK++RW++  E+LDE +TNRVS++HQVIGQMIKGDYD+
Sbjct: 625 YCMMLAVYAKSNRWNEVYEVLDEMVTNRVSNIHQVIGQMIKGDYDD 650

BLAST of Cp4.1LG09g10520 vs. TrEMBL
Match: A0A061FUI6_THECC (Plastid transcriptionally active 2 isoform 3 OS=Theobroma cacao GN=TCM_011951 PE=4 SV=1)

HSP 1 Score: 374.4 bits (960), Expect = 1.7e-100
Identity = 186/351 (52.99%), Postives = 250/351 (71.23%), Query Frame = 1

Query: 40  PNEYIYAIVISFLGRKGLLENCSEIFDEMASQGVIRSMFSYTALINAYGCNGQGDLDCEG 99
           P+   Y +++    + G ++    +F +M   G   +  +Y+ L+N YG NG+ D     
Sbjct: 310 PDIMSYNVLLEAYAKSGSIKEAMGVFKQMQVAGCAPNATTYSILLNLYGRNGRYD----D 369

Query: 100 LLGLFAEMRHEGVQPDLMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEP 159
           +  LF EM+    +PD          ++TY+IL+ ++G+ G + +V  LF +M E + EP
Sbjct: 370 VRELFLEMKESNTEPD----------AATYNILIQVFGEGGYFKEVVTLFHDMVEENIEP 429

Query: 160 DATTYNILIRVFGEGGYFKEGGLHEDAKKILLRMNEKGIVPSSNAYSGLFEAYGRAALYD 219
           +  TY+ LI   G+GG      LHEDAKKILL MNEK IVPSS AY+G+ EAYG+AALY+
Sbjct: 430 NVKTYDGLIFACGKGG------LHEDAKKILLHMNEKCIVPSSRAYTGVIEAYGQAALYE 489

Query: 220 EALVAFNTMNEVGNKSTVDTYNSLIHSFARGGLYMEFEAILLRMRESGNSRNVNSFSGIV 279
           E LVAFNTMNEV +  T++TYNSL+ +FARGGLY E  AIL RM E+G ++N +SF+ ++
Sbjct: 490 EVLVAFNTMNEVESNPTIETYNSLLQTFARGGLYKEANAILSRMNETGVAKNRDSFNALI 549

Query: 280 EGYRQSGKFEEAIKSFVEMEKLRCEPIEKALEAVLGVYCFAGLVDESKEQFHEIKASGIL 339
           E +RQ G+FE+AIK++VEMEK RC+P E+ LEAVL VYCFAGLVDES EQF EIKA G+L
Sbjct: 550 EAFRQGGQFEDAIKAYVEMEKARCDPDERTLEAVLSVYCFAGLVDESNEQFQEIKALGVL 609

Query: 340 PSVLCYCMMLAVYAKNDRWDDACELLDETITNRVSSVHQVIGQMIKGDYDN 391
           PSV+CYCMMLAVYAK DRWDDA +L DE +TN+VS++HQVIG+MI+GDYD+
Sbjct: 610 PSVMCYCMMLAVYAKCDRWDDAYQLFDEMLTNKVSNIHQVIGKMIRGDYDD 640

BLAST of Cp4.1LG09g10520 vs. TAIR10
Match: AT1G74850.1 (AT1G74850.1 plastid transcriptionally active 2)

HSP 1 Score: 344.0 bits (881), Expect = 1.2e-94
Identity = 172/351 (49.00%), Postives = 243/351 (69.23%), Query Frame = 1

Query: 40  PNEYIYAIVISFLGRKGLLENCSEIFDEMASQGVIRSMFSYTALINAYGCNGQGDLDCEG 99
           P+   Y +++    + G ++    +F +M + G   +  +Y+ L+N +G +G+ D     
Sbjct: 315 PDITSYNVLLEAYAKSGSIKEAMGVFHQMQAAGCTPNANTYSVLLNLFGQSGRYD----D 374

Query: 100 LLGLFAEMRHEGVQPDLMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEP 159
           +  LF EM+     PD          ++TY+IL+ ++G+ G + +V  LF +M E + EP
Sbjct: 375 VRQLFLEMKSSNTDPD----------AATYNILIEVFGEGGYFKEVVTLFHDMVEENIEP 434

Query: 160 DATTYNILIRVFGEGGYFKEGGLHEDAKKILLRMNEKGIVPSSNAYSGLFEAYGRAALYD 219
           D  TY  +I   G      +GGLHEDA+KIL  M    IVPSS AY+G+ EA+G+AALY+
Sbjct: 435 DMETYEGIIFACG------KGGLHEDARKILQYMTANDIVPSSKAYTGVIEAFGQAALYE 494

Query: 220 EALVAFNTMNEVGNKSTVDTYNSLIHSFARGGLYMEFEAILLRMRESGNSRNVNSFSGIV 279
           EALVAFNTM+EVG+  +++T++SL++SFARGGL  E EAIL R+ +SG  RN ++F+  +
Sbjct: 495 EALVAFNTMHEVGSNPSIETFHSLLYSFARGGLVKESEAILSRLVDSGIPRNRDTFNAQI 554

Query: 280 EGYRQSGKFEEAIKSFVEMEKLRCEPIEKALEAVLGVYCFAGLVDESKEQFHEIKASGIL 339
           E Y+Q GKFEEA+K++V+MEK RC+P E+ LEAVL VY FA LVDE +EQF E+KAS IL
Sbjct: 555 EAYKQGGKFEEAVKTYVDMEKSRCDPDERTLEAVLSVYSFARLVDECREQFEEMKASDIL 614

Query: 340 PSVLCYCMMLAVYAKNDRWDDACELLDETITNRVSSVHQVIGQMIKGDYDN 391
           PS++CYCMMLAVY K +RWDD  ELL+E ++NRVS++HQVIGQMIKGDYD+
Sbjct: 615 PSIMCYCMMLAVYGKTERWDDVNELLEEMLSNRVSNIHQVIGQMIKGDYDD 645

BLAST of Cp4.1LG09g10520 vs. TAIR10
Match: AT2G31400.1 (AT2G31400.1 genomes uncoupled 1)

HSP 1 Score: 151.0 bits (380), Expect = 1.5e-36
Identity = 102/360 (28.33%), Postives = 171/360 (47.50%), Query Frame = 1

Query: 43  YIYAIVISFLGRKGLLENCSEIFDEMASQGVIRSMFSYTALINAYGCNGQGDLDCEGLLG 102
           Y ++ +IS  GR GL E    +F+ M   G+  ++ +Y A+I+A G   +G ++ + +  
Sbjct: 269 YAFSALISAYGRSGLHEEAISVFNSMKEYGLRPNLVTYNAVIDACG---KGGMEFKQVAK 328

Query: 103 LFAEMRHEGVQPDLMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEPDAT 162
            F EM+  GVQPD +          T++ LL +  + G ++  R LF EM     E D  
Sbjct: 329 FFDEMQRNGVQPDRI----------TFNSLLAVCSRGGLWEAARNLFDEMTNRRIEQDVF 388

Query: 163 TYNILIRVFGEGGYFKEGGLHEDAKKILLRMNEKGIVPSSNAYSGLFEAYGRAALYDEAL 222
           +YN L+    +GG        + A +IL +M  K I+P+  +YS + + + +A  +DEAL
Sbjct: 389 SYNTLLDAICKGGQM------DLAFEILAQMPVKRIMPNVVSYSTVIDGFAKAGRFDEAL 448

Query: 223 VAFNTMNEVGNKSTVDTYNSLIHSFARGGLYMEFEAILLRMRESGNSRNVNSFSGIVEGY 282
             F  M  +G      +YN+L+  + + G   E   IL  M   G  ++V +++ ++ GY
Sbjct: 449 NLFGEMRYLGIALDRVSYNTLLSIYTKVGRSEEALDILREMASVGIKKDVVTYNALLGGY 508

Query: 283 RQSGKFEEAIKSFVEMEKLRCEPIEKALEAVLGVYCFAGLVDESKEQFHEIKASGILPSV 342
            + GK++E  K F EM++    P       ++  Y   GL  E+ E F E K++G+   V
Sbjct: 509 GKQGKYDEVKKVFTEMKREHVLPNLLTYSTLIDGYSKGGLYKEAMEIFREFKSAGLRADV 568

Query: 343 LCYCMMLAVYAKNDRWDDACELLDETITNRVS-------SVHQVIGQMI----KGDYDNG 392
           + Y  ++    KN     A  L+DE     +S       S+    G+        DY NG
Sbjct: 569 VLYSALIDALCKNGLVGSAVSLIDEMTKEGISPNVVTYNSIIDAFGRSATMDRSADYSNG 609

BLAST of Cp4.1LG09g10520 vs. TAIR10
Match: AT5G02860.1 (AT5G02860.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 144.1 bits (362), Expect = 1.9e-34
Identity = 81/327 (24.77%), Postives = 152/327 (46.48%), Query Frame = 1

Query: 40  PNEYIYAIVISFLGRKGLLENCSEIFDEMASQGVIRSMFSYTALINAYGCNGQGDLDCEG 99
           P+   Y  +IS   R G+L+   E+ ++MA +G    +F+YT L++ +   G+     E 
Sbjct: 347 PSIVTYNSLISAYARDGMLDEAMELKNQMAEKGTKPDVFTYTTLLSGFERAGK----VES 406

Query: 100 LLGLFAEMRHEGVQPDLMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEP 159
            + +F EMR+          AGC PN  T++  + +YG  G++ ++ ++F E+      P
Sbjct: 407 AMSIFEEMRN----------AGCKPNICTFNAFIKMYGNRGKFTEMMKIFDEINVCGLSP 466

Query: 160 DATTYNILIRVFGEGGYFKEGGLHEDAKKILLRMNEKGIVPSSNAYSGLFEAYGRAALYD 219
           D  T+N L+ VFG+ G      +  +   +   M   G VP    ++ L  AY R   ++
Sbjct: 467 DIVTWNTLLAVFGQNG------MDSEVSGVFKEMKRAGFVPERETFNTLISAYSRCGSFE 526

Query: 220 EALVAFNTMNEVGNKSTVDTYNSLIHSFARGGLYMEFEAILLRMRESGNSRNVNSFSGIV 279
           +A+  +  M + G    + TYN+++ + ARGG++ + E +L  M +     N  ++  ++
Sbjct: 527 QAMTVYRRMLDAGVTPDLSTYNTVLAALARGGMWEQSEKVLAEMEDGRCKPNELTYCSLL 586

Query: 280 EGYRQSGKFEEAIKSFVEMEKLRCEPIEKALEAVLGVYCFAGLVDESKEQFHEIKASGIL 339
             Y    +         E+     EP    L+ ++ V     L+ E++  F E+K  G  
Sbjct: 587 HAYANGKEIGLMHSLAEEVYSGVIEPRAVLLKTLVLVCSKCDLLPEAERAFSELKERGFS 646

Query: 340 PSVLCYCMMLAVYAKNDRWDDACELLD 367
           P +     M+++Y +      A  +LD
Sbjct: 647 PDITTLNSMVSIYGRRQMVAKANGVLD 653

BLAST of Cp4.1LG09g10520 vs. TAIR10
Match: AT5G42310.1 (AT5G42310.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 132.9 bits (333), Expect = 4.3e-31
Identity = 82/276 (29.71%), Postives = 136/276 (49.28%), Query Frame = 1

Query: 34  RQIWCKPNEYIYAIVISFLGRKGLLENCSEIFDEMASQGVIRSMFSYTALINAYGCNGQG 93
           + I  KP+   Y +VI   G+   L++    FD M S+G+     ++  LI         
Sbjct: 436 KSIGVKPDRQFYNVVIDTFGKFNCLDHAMTTFDRMLSEGIEPDRVTWNTLI--------- 495

Query: 94  DLDCEGLLGLFAEMRHEGVQPDLMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMK 153
           D  C+    + AE   E      M+  GC+P ++TY+I++N YG   R+DD++ L  +MK
Sbjct: 496 DCHCKHGRHIVAEEMFEA-----MERRGCLPCATTYNIMINSYGDQERWDDMKRLLGKMK 555

Query: 154 ESSAEPDATTYNILIRVFGEGGYFKEGGLHEDAKKILLRMNEKGIVPSSNAYSGLFEAYG 213
                P+  T+  L+ V+G+ G F       DA + L  M   G+ PSS  Y+ L  AY 
Sbjct: 556 SQGILPNVVTHTTLVDVYGKSGRFN------DAIECLEEMKSVGLKPSSTMYNALINAYA 615

Query: 214 RAALYDEALVAFNTMNEVGNKSTVDTYNSLIHSFARGGLYMEFEAILLRMRESGNSRNVN 273
           +  L ++A+ AF  M   G K ++   NSLI++F       E  A+L  M+E+G   +V 
Sbjct: 616 QRGLSEQAVNAFRVMTSDGLKPSLLALNSLINAFGEDRRDAEAFAVLQYMKENGVKPDVV 675

Query: 274 SFSGIVEGYRQSGKFEEAIKSFVEMEKLRCEPIEKA 310
           +++ +++   +  KF++    + EM    C+P  KA
Sbjct: 676 TYTTLMKALIRVDKFQKVPVVYEEMIMSGCKPDRKA 691

BLAST of Cp4.1LG09g10520 vs. TAIR10
Match: AT3G04760.1 (AT3G04760.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 131.0 bits (328), Expect = 1.6e-30
Identity = 87/334 (26.05%), Postives = 154/334 (46.11%), Query Frame = 1

Query: 38  CKPNEYIYAIVISFLGRKGLLENCSEIFDEMASQGVIRSMFSYTALINAYGCNGQGDLDC 97
           C+P    Y I+I     +G ++   ++ DEM S+G+   MF+Y  +I    C        
Sbjct: 224 CQPTVITYTILIEATMLEGGVDEALKLMDEMLSRGLKPDMFTYNTIIRGM-CK------- 283

Query: 98  EGLLGLFAEMRHEGVQPDLMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSA 157
           EG++    EM         ++  GC P+  +Y+ILL      G++++  +L  +M     
Sbjct: 284 EGMVDRAFEMVRN------LELKGCEPDVISYNILLRALLNQGKWEEGEKLMTKMFSEKC 343

Query: 158 EPDATTYNILIRVFGEGGYFKEGGLHEDAKKILLRMNEKGIVPSSNAYSGLFEAYGRAAL 217
           +P+  TY+ILI      G        E+A  +L  M EKG+ P + +Y  L  A+ R   
Sbjct: 344 DPNVVTYSILITTLCRDGKI------EEAMNLLKLMKEKGLTPDAYSYDPLIAAFCREGR 403

Query: 218 YDEALVAFNTMNEVGNKSTVDTYNSLIHSFARGGLYMEFEAILLRMRESGNSRNVNSFSG 277
            D A+    TM   G    +  YN+++ +  + G   +   I  ++ E G S N +S++ 
Sbjct: 404 LDVAIEFLETMISDGCLPDIVNYNTVLATLCKNGKADQALEIFGKLGEVGCSPNSSSYNT 463

Query: 278 IVEGYRQSGKFEEAIKSFVEMEKLRCEPIEKALEAVLGVYCFAGLVDESKEQFHEIKASG 337
           +      SG    A+   +EM     +P E    +++   C  G+VDE+ E   ++++  
Sbjct: 464 MFSALWSSGDKIRALHMILEMMSNGIDPDEITYNSMISCLCREGMVDEAFELLVDMRSCE 523

Query: 338 ILPSVLCYCMMLAVYAKNDRWDDACELLDETITN 372
             PSV+ Y ++L  + K  R +DA  +L+  + N
Sbjct: 524 FHPSVVTYNIVLLGFCKAHRIEDAINVLESMVGN 537

BLAST of Cp4.1LG09g10520 vs. NCBI nr
Match: gi|659067140|ref|XP_008437858.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g74850, chloroplastic isoform X2 [Cucumis melo])

HSP 1 Score: 419.5 bits (1077), Expect = 6.6e-114
Identity = 213/351 (60.68%), Postives = 264/351 (75.21%), Query Frame = 1

Query: 40  PNEYIYAIVISFLGRKGLLENCSEIFDEMASQGVIRSMFSYTALINAYGCNGQGDLDCEG 99
           P+   Y ++I    + G ++   ++F +M + G + +  +Y+ L+N YG +G+ D     
Sbjct: 322 PDISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNASTYSILLNLYGKHGRYD----D 381

Query: 100 LLGLFAEMRHEGVQPDLMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEP 159
           +  LF +M+    +PD          ++TY+IL+ ++G+ G + +V  LF ++ E + +P
Sbjct: 382 VRELFLQMKESSAEPD----------ATTYNILIRVFGEGGYFKEVVTLFHDLVEENIDP 441

Query: 160 DATTYNILIRVFGEGGYFKEGGLHEDAKKILLRMNEKGIVPSSNAYSGLFEAYGRAALYD 219
           +  TY  L+   G+GG      LHEDAKKIL  MNEKGIVPSS AY+GL EAYG+AALYD
Sbjct: 442 NMETYEGLVFACGKGG------LHEDAKKILFHMNEKGIVPSSKAYTGLIEAYGQAALYD 501

Query: 220 EALVAFNTMNEVGNKSTVDTYNSLIHSFARGGLYMEFEAILLRMRESGNSRNVNSFSGIV 279
           EA+VAFNTMNEVG+KST+DTYNSLIH+FARGGLY EFEAIL RMRE G SRN  SFSGI+
Sbjct: 502 EAVVAFNTMNEVGSKSTIDTYNSLIHTFARGGLYKEFEAILSRMREYGISRNAKSFSGII 561

Query: 280 EGYRQSGKFEEAIKSFVEMEKLRCEPIEKALEAVLGVYCFAGLVDESKEQFHEIKASGIL 339
           EGYRQSG++EEAIK+FVEMEK+RCE  E+ LEAVLGVYCFAGLVDESKEQF EIKASGIL
Sbjct: 562 EGYRQSGQYEEAIKAFVEMEKMRCELDEQTLEAVLGVYCFAGLVDESKEQFVEIKASGIL 621

Query: 340 PSVLCYCMMLAVYAKNDRWDDACELLDETITNRVSSVHQVIGQMIKGDYDN 391
           PSVLCYCMMLAVYAKN RWDDA ELLDE I NRVSS+HQVIGQMIKGDYD+
Sbjct: 622 PSVLCYCMMLAVYAKNGRWDDAYELLDEMIKNRVSSIHQVIGQMIKGDYDD 652

BLAST of Cp4.1LG09g10520 vs. NCBI nr
Match: gi|659067138|ref|XP_008437850.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g74850, chloroplastic isoform X1 [Cucumis melo])

HSP 1 Score: 419.5 bits (1077), Expect = 6.6e-114
Identity = 213/351 (60.68%), Postives = 264/351 (75.21%), Query Frame = 1

Query: 40  PNEYIYAIVISFLGRKGLLENCSEIFDEMASQGVIRSMFSYTALINAYGCNGQGDLDCEG 99
           P+   Y ++I    + G ++   ++F +M + G + +  +Y+ L+N YG +G+ D     
Sbjct: 322 PDISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNASTYSILLNLYGKHGRYD----D 381

Query: 100 LLGLFAEMRHEGVQPDLMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEP 159
           +  LF +M+    +PD          ++TY+IL+ ++G+ G + +V  LF ++ E + +P
Sbjct: 382 VRELFLQMKESSAEPD----------ATTYNILIRVFGEGGYFKEVVTLFHDLVEENIDP 441

Query: 160 DATTYNILIRVFGEGGYFKEGGLHEDAKKILLRMNEKGIVPSSNAYSGLFEAYGRAALYD 219
           +  TY  L+   G+GG      LHEDAKKIL  MNEKGIVPSS AY+GL EAYG+AALYD
Sbjct: 442 NMETYEGLVFACGKGG------LHEDAKKILFHMNEKGIVPSSKAYTGLIEAYGQAALYD 501

Query: 220 EALVAFNTMNEVGNKSTVDTYNSLIHSFARGGLYMEFEAILLRMRESGNSRNVNSFSGIV 279
           EA+VAFNTMNEVG+KST+DTYNSLIH+FARGGLY EFEAIL RMRE G SRN  SFSGI+
Sbjct: 502 EAVVAFNTMNEVGSKSTIDTYNSLIHTFARGGLYKEFEAILSRMREYGISRNAKSFSGII 561

Query: 280 EGYRQSGKFEEAIKSFVEMEKLRCEPIEKALEAVLGVYCFAGLVDESKEQFHEIKASGIL 339
           EGYRQSG++EEAIK+FVEMEK+RCE  E+ LEAVLGVYCFAGLVDESKEQF EIKASGIL
Sbjct: 562 EGYRQSGQYEEAIKAFVEMEKMRCELDEQTLEAVLGVYCFAGLVDESKEQFVEIKASGIL 621

Query: 340 PSVLCYCMMLAVYAKNDRWDDACELLDETITNRVSSVHQVIGQMIKGDYDN 391
           PSVLCYCMMLAVYAKN RWDDA ELLDE I NRVSS+HQVIGQMIKGDYD+
Sbjct: 622 PSVLCYCMMLAVYAKNGRWDDAYELLDEMIKNRVSSIHQVIGQMIKGDYDD 652

BLAST of Cp4.1LG09g10520 vs. NCBI nr
Match: gi|449469490|ref|XP_004152453.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g74850, chloroplastic isoform X1 [Cucumis sativus])

HSP 1 Score: 414.5 bits (1064), Expect = 2.1e-112
Identity = 211/351 (60.11%), Postives = 261/351 (74.36%), Query Frame = 1

Query: 40  PNEYIYAIVISFLGRKGLLENCSEIFDEMASQGVIRSMFSYTALINAYGCNGQGDLDCEG 99
           P+   Y ++I    + G ++   ++F +M + G + +  +Y+ L+N YG +G+ D     
Sbjct: 322 PDISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNASTYSILLNLYGKHGRYD----D 381

Query: 100 LLGLFAEMRHEGVQPDLMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEP 159
           +  LF +M+    +PD          ++TY+IL+ ++G+ G + +V  LF ++ + + +P
Sbjct: 382 VRELFLQMKESSAEPD----------ATTYNILIRVFGEGGYFKEVVTLFHDLVDENIDP 441

Query: 160 DATTYNILIRVFGEGGYFKEGGLHEDAKKILLRMNEKGIVPSSNAYSGLFEAYGRAALYD 219
           +  TY  L+   G+GG      LHEDAKKIL  MN KGIVPSS AYSGL EAYG+AALYD
Sbjct: 442 NMETYEGLVFACGKGG------LHEDAKKILFHMNGKGIVPSSKAYSGLIEAYGQAALYD 501

Query: 220 EALVAFNTMNEVGNKSTVDTYNSLIHSFARGGLYMEFEAILLRMRESGNSRNVNSFSGIV 279
           EALVAFNTMNEVG+KST+DTYNSLIH+FARGGLY EFEAIL RMRE G SRN  SFSGI+
Sbjct: 502 EALVAFNTMNEVGSKSTIDTYNSLIHTFARGGLYKEFEAILSRMREYGISRNAKSFSGII 561

Query: 280 EGYRQSGKFEEAIKSFVEMEKLRCEPIEKALEAVLGVYCFAGLVDESKEQFHEIKASGIL 339
           EGYRQSG++EEAIK+FVEMEK+RCE  E+ LE VLGVYCFAGLVDESKEQF EIKASGIL
Sbjct: 562 EGYRQSGQYEEAIKAFVEMEKMRCELDEQTLEGVLGVYCFAGLVDESKEQFIEIKASGIL 621

Query: 340 PSVLCYCMMLAVYAKNDRWDDACELLDETITNRVSSVHQVIGQMIKGDYDN 391
           PSVLCYCMMLAVYAKN RWDDA ELLDE I  RVSS+HQVIGQMIKGDYD+
Sbjct: 622 PSVLCYCMMLAVYAKNGRWDDASELLDEMIKTRVSSIHQVIGQMIKGDYDD 652

BLAST of Cp4.1LG09g10520 vs. NCBI nr
Match: gi|778657681|ref|XP_011651334.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g74850, chloroplastic isoform X2 [Cucumis sativus])

HSP 1 Score: 414.5 bits (1064), Expect = 2.1e-112
Identity = 211/351 (60.11%), Postives = 261/351 (74.36%), Query Frame = 1

Query: 40  PNEYIYAIVISFLGRKGLLENCSEIFDEMASQGVIRSMFSYTALINAYGCNGQGDLDCEG 99
           P+   Y ++I    + G ++   ++F +M + G + +  +Y+ L+N YG +G+ D     
Sbjct: 322 PDISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNASTYSILLNLYGKHGRYD----D 381

Query: 100 LLGLFAEMRHEGVQPDLMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEP 159
           +  LF +M+    +PD          ++TY+IL+ ++G+ G + +V  LF ++ + + +P
Sbjct: 382 VRELFLQMKESSAEPD----------ATTYNILIRVFGEGGYFKEVVTLFHDLVDENIDP 441

Query: 160 DATTYNILIRVFGEGGYFKEGGLHEDAKKILLRMNEKGIVPSSNAYSGLFEAYGRAALYD 219
           +  TY  L+   G+GG      LHEDAKKIL  MN KGIVPSS AYSGL EAYG+AALYD
Sbjct: 442 NMETYEGLVFACGKGG------LHEDAKKILFHMNGKGIVPSSKAYSGLIEAYGQAALYD 501

Query: 220 EALVAFNTMNEVGNKSTVDTYNSLIHSFARGGLYMEFEAILLRMRESGNSRNVNSFSGIV 279
           EALVAFNTMNEVG+KST+DTYNSLIH+FARGGLY EFEAIL RMRE G SRN  SFSGI+
Sbjct: 502 EALVAFNTMNEVGSKSTIDTYNSLIHTFARGGLYKEFEAILSRMREYGISRNAKSFSGII 561

Query: 280 EGYRQSGKFEEAIKSFVEMEKLRCEPIEKALEAVLGVYCFAGLVDESKEQFHEIKASGIL 339
           EGYRQSG++EEAIK+FVEMEK+RCE  E+ LE VLGVYCFAGLVDESKEQF EIKASGIL
Sbjct: 562 EGYRQSGQYEEAIKAFVEMEKMRCELDEQTLEGVLGVYCFAGLVDESKEQFIEIKASGIL 621

Query: 340 PSVLCYCMMLAVYAKNDRWDDACELLDETITNRVSSVHQVIGQMIKGDYDN 391
           PSVLCYCMMLAVYAKN RWDDA ELLDE I  RVSS+HQVIGQMIKGDYD+
Sbjct: 622 PSVLCYCMMLAVYAKNGRWDDASELLDEMIKTRVSSIHQVIGQMIKGDYDD 652

BLAST of Cp4.1LG09g10520 vs. NCBI nr
Match: gi|297733858|emb|CBI15105.3| (unnamed protein product [Vitis vinifera])

HSP 1 Score: 390.2 bits (1001), Expect = 4.3e-105
Identity = 196/351 (55.84%), Postives = 258/351 (73.50%), Query Frame = 1

Query: 40  PNEYIYAIVISFLGRKGLLENCSEIFDEMASQGVIRSMFSYTALINAYGCNGQGDLDCEG 99
           P+   Y +++    + G ++    +F +M   G + +  +Y+ L+N YG +G+ D     
Sbjct: 78  PDITSYNVLLEAHAQSGSIKEAMGVFRQMQGAGCVPNAATYSILLNLYGRHGRYD----D 137

Query: 100 LLGLFAEMRHEGVQPDLMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEP 159
           +  LF EM+    +P          N++TY+IL+N++G+ G + +V  LF +M E + EP
Sbjct: 138 VRDLFLEMKVSNTEP----------NAATYNILINVFGEGGYFKEVVTLFHDMVEENVEP 197

Query: 160 DATTYNILIRVFGEGGYFKEGGLHEDAKKILLRMNEKGIVPSSNAYSGLFEAYGRAALYD 219
           +  TY  LI   G      +GGLHEDAKKILL MNEKG+VPSS AY+G+ EAYG+AALY+
Sbjct: 198 NMETYEGLIFACG------KGGLHEDAKKILLHMNEKGVVPSSKAYTGVIEAYGQAALYE 257

Query: 220 EALVAFNTMNEVGNKSTVDTYNSLIHSFARGGLYMEFEAILLRMRESGNSRNVNSFSGIV 279
           EALVAFNTMNEVG+K TV+TYNSLI  FA+GGLY E EAILL+M +SG +RN ++F+G++
Sbjct: 258 EALVAFNTMNEVGSKPTVETYNSLIQMFAKGGLYKESEAILLKMGQSGVARNRDTFNGVI 317

Query: 280 EGYRQSGKFEEAIKSFVEMEKLRCEPIEKALEAVLGVYCFAGLVDESKEQFHEIKASGIL 339
           E +RQ G+FEEAIK++VEMEK RC+P E+ LEAVL VYCFAGLV+ES+EQF EIKA GIL
Sbjct: 318 EAFRQGGQFEEAIKAYVEMEKARCDPDEQTLEAVLSVYCFAGLVEESEEQFGEIKALGIL 377

Query: 340 PSVLCYCMMLAVYAKNDRWDDACELLDETITNRVSSVHQVIGQMIKGDYDN 391
           PSV+CYCMMLAVYAK DRWDDA +LLDE  TNRVS++HQVIGQMI+GDYD+
Sbjct: 378 PSVMCYCMMLAVYAKADRWDDAHQLLDEMFTNRVSNIHQVIGQMIRGDYDD 408

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP124_ARATH2.2e-9349.00Pentatricopeptide repeat-containing protein At1g74850, chloroplastic OS=Arabidop... [more]
PP178_ARATH2.7e-3528.33Pentatricopeptide repeat-containing protein At2g31400, chloroplastic OS=Arabidop... [more]
PP362_ARATH3.3e-3324.77Pentatricopeptide repeat-containing protein At5g02860 OS=Arabidopsis thaliana GN... [more]
PP413_ARATH7.7e-3029.71Pentatricopeptide repeat-containing protein At5g42310, mitochondrial OS=Arabidop... [more]
PP213_ARATH2.9e-2926.05Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0LW62_CUCSA1.5e-11260.11Uncharacterized protein OS=Cucumis sativus GN=Csa_1G045740 PE=4 SV=1[more]
F6GSY1_VITVI3.0e-10555.84Putative uncharacterized protein OS=Vitis vinifera GN=VIT_17s0000g07500 PE=4 SV=... [more]
V4SYF8_9ROSI1.3e-10354.42Uncharacterized protein OS=Citrus clementina GN=CICLE_v10018817mg PE=4 SV=1[more]
A0A067K823_JATCU4.8e-10355.20Uncharacterized protein OS=Jatropha curcas GN=JCGZ_11544 PE=4 SV=1[more]
A0A061FUI6_THECC1.7e-10052.99Plastid transcriptionally active 2 isoform 3 OS=Theobroma cacao GN=TCM_011951 PE... [more]
Match NameE-valueIdentityDescription
AT1G74850.11.2e-9449.00 plastid transcriptionally active 2[more]
AT2G31400.11.5e-3628.33 genomes uncoupled 1[more]
AT5G02860.11.9e-3424.77 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G42310.14.3e-3129.71 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT3G04760.11.6e-3026.05 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659067140|ref|XP_008437858.1|6.6e-11460.68PREDICTED: pentatricopeptide repeat-containing protein At1g74850, chloroplastic ... [more]
gi|659067138|ref|XP_008437850.1|6.6e-11460.68PREDICTED: pentatricopeptide repeat-containing protein At1g74850, chloroplastic ... [more]
gi|449469490|ref|XP_004152453.1|2.1e-11260.11PREDICTED: pentatricopeptide repeat-containing protein At1g74850, chloroplastic ... [more]
gi|778657681|ref|XP_011651334.1|2.1e-11260.11PREDICTED: pentatricopeptide repeat-containing protein At1g74850, chloroplastic ... [more]
gi|297733858|emb|CBI15105.3|4.3e-10555.84unnamed protein product [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0045893 positive regulation of transcription, DNA-templated
biological_process GO:0042793 transcription from plastid promoter
biological_process GO:0044767 single-organism developmental process
biological_process GO:0044763 single-organism cellular process
biological_process GO:0009657 plastid organization
biological_process GO:0034660 ncRNA metabolic process
biological_process GO:0010467 gene expression
biological_process GO:0006399 tRNA metabolic process
biological_process GO:0010027 thylakoid membrane organization
biological_process GO:0010103 stomatal complex morphogenesis
biological_process GO:0006364 rRNA processing
biological_process GO:0035304 regulation of protein dephosphorylation
biological_process GO:0045036 protein targeting to chloroplast
biological_process GO:0010207 photosystem II assembly
biological_process GO:0009965 leaf morphogenesis
biological_process GO:0009902 chloroplast relocation
biological_process GO:0030154 cell differentiation
biological_process GO:0008150 biological_process
cellular_component GO:0009508 plastid chromosome
cellular_component GO:0009507 chloroplast
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG09g10520.1Cp4.1LG09g10520.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 313..338
score: 0.28coord: 274..303
score: 0.0064coord: 344..368
score: 0.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 124..171
score: 7.6
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 189..247
score: 1.7E-6coord: 64..116
score: 7.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 274..305
score: 0.0024coord: 344..369
score: 3.2E-4coord: 128..161
score: 1.5E-8coord: 239..271
score: 4.5E-5coord: 43..73
score: 7.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 341..375
score: 8.385coord: 160..200
score: 8.451coord: 125..159
score: 12.375coord: 76..114
score: 9.602coord: 306..340
score: 7.739coord: 271..305
score: 10.643coord: 236..270
score: 9.383coord: 41..75
score: 9.219coord: 201..235
score: 7
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 128..301
score: 1.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 287..384
score: 5.7E-174coord: 32..251
score: 5.7E
NoneNo IPR availablePANTHERPTHR24015:SF726SUBFAMILY NOT NAMEDcoord: 287..384
score: 5.7E-174coord: 32..251
score: 5.7E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG09g10520Cp4.1LG20g02190Cucurbita pepo (Zucchini)cpecpeB049