CSPI05G23780 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI05G23780
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr5: 23714177 .. 23716589 (+)
RNA-Seq ExpressionCSPI05G23780
SyntenyCSPI05G23780
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAGTTTGTGGCGGTGGAAAATTTTCCGGGCAGGTGGACGCTCTTTCTTTTCTTCCCCGCTTCAACGGACAATCCTTTTCACAATTTCAAGGTGAAAATTTTCTTTTTGTCATTGCTTGAAACATTATCTTCTGCAATGGATCTATTAAGCTTCATCCTGCTCTCGTTTCTGCTACGTTTACCACTCTGAAACACCGCCTACTTTTAACTAATTTCATTTTGGATTTGCTTTATCTCTTCCATGGGCAGAATGTTTTTCAAATTGTTTAGAAGTTGAGATAATTTGCTCTTGTTTTGCTATTCTTGTTCTCTGTAATGCACTGCGTCTCGTTAGAGTTTAATTATTGAGTATTGAAATGGCAATCTCCTGTATTCCAACTTGCCAATTCACTCTCACAAAACCAAGCTCCGCCTTTTCCAAGAATGAGTTCGTAATAAATCAACTCCACCCTCTTTCTCTTTTGTCCAAATGTACATCTCTCAACGAGCTCAAGCAAATTCAAGCATATACCATTAAAACCAATCTTCAGAGTGATATCTCTGTTCTCACCAAGCTCATTAATTTCTGCACACTCAACCCGACAACTTCGTATATGGACCATGCCCACCATCTGTTTGATCAAATTCTTGACAAGGATATTATTCTGTTTAACATAATGGCACGGGGTTATGCTCGCTCTAATTCTCCCTATCTTGCCTTTTCTCTTTTTGGTGAACTCTTGTGCTCTGGTCTTCTTCCTGATGACTACACATTCTCGTCCCTTCTCAAGGCGTGTGCGAGTTCTAAGGCCTTGAGAGAAGGTATGGGGTTGCATTGTTTTGCTGTTAAACTTGGACTGAATCATAATATTTACATATGCCCAACTCTCATAAATATGTATGCGGAGTGCAATGACATGAATGCTGCGCGTGGAGTGTTTGATGAAATGGAACAGCCATGCATAGTTAGCTATAATGCAATTATCACGGGTTATGCTCGAAGTAGTCAGCCCAATGAGGCTTTGTCCTTGTTTAGAGAATTGCAAGCGAGCAATATTGAGCCAACTGATGTAACTATGCTTAGTGTAATTATGTCATGTGCTCTGTTGGGCGCACTAGACCTGGGAAAGTGGATTCATGAATATGTTAAGAAGAAAGGTTTTGACAAATATGTGAAGGTGAACACTGCACTTATAGATATGTTTGCGAAATGTGGAAGTCTAACTGATGCTATTTCTATCTTTGAGGGCATGCGTGTGAGAGATACACAAGCTTGGTCTGCAATGATAGTTGCCTTTGCAACTCATGGGGATGGCTTGAAAGCTATCTCAATGTTTGAAGAGATGAAGAGGGAAGGAGTTAGACCTGATGAGATCACCTTTTTGGGGCTTTTGTATGCTTGTAGTCATGCTGGGCTAGTAGAACAAGGTAGAGGGTATTTCTATAGTATGTCTAAAACCTATGGGATAACTCCAGGGATCAAGCATTACGGTTGTATGGTGGATTTGCTTGGTCGAGCAGGTCATTTAGATGAGGCTTATAACTTTGTAGATAAACTGGAAACTAAGGCCACACCTATACTCTGGCGCACCCTTTTATCTGCTTGCAGCACCCATGGTAATGTCGAAATGGCAAAGCGGGTTATTGAAAGAATTTTTGAATTAGATGACGCCCATGGAGGGGACTATGTTATATTATCAAACTTGTATGCTAGAGTAGGAAGATGGGAAGATGTGAATCATTTAAGGAAATTGATGAAAGATAGAGGGGTGGTGAAGGTTCCTGGGTGTAGTTCCGTGGAGGTAAACAATGTTGTACATGAGTTCTTCTCTGGAGATGGAGTTCACTGCGTTTCGGTGGAGTTACGACGAGCACTTGACGAGTTAATGAAAGAAATAAAGTTGGTGGGATATGTTCCGGATACATCTTTAGTATATCATGCTGATATGGAAGAGGAAGGGAAAGAACTTGTCCTAAGATATCACAGTGAAAAATTGGCCATGGCTTTTGGGCTCCTAAATACACCTCCCGGTACAACAATAAGGGTAGCTAAGAACCTCCGTATTTGTGGAGATTGTCATAGTGCTGCAAAACTTATATCTTTTATTTTTGGGAGGAAAATCGTCATTAGGGACGTTCAACGATTCCATCGATTTGAAGATGGGAAATGCTCGTGTGGTGATTTCTGGTAATAGAATATGTAGAAATCCCTGGCTATTCTTGCAGTTCGATGTTTCTTTCCTGCAATGGCCATGTTTTGAGATTCATTTGGTATATCTTCATTGGGAACCTCTATGTTTGTATATCGACCAAGGAATTCAGAGTAAGATTGATTTGAAGGATGTACATATTTCTACCATCATTACACCGTATGTTAACATGAGTTTAACTATACGGTAGTCAATATATACTCATTTCTATGATAAAACCTCCA

mRNA sequence

AAAGTTTGTGGCGGTGGAAAATTTTCCGGGCAGGTGGACGCTCTTTCTTTTCTTCCCCGCTTCAACGGACAATCCTTTTCACAATTTCAAGGTGAAAATTTTCTTTTTGTCATTGCTTGAAACATTATCTTCTGCAATGGATCTATTAAGCTTCATCCTGCTCTCGTTTCTGCTACGTTTACCACTCTGAAACACCGCCTACTTTTAACTAATTTCATTTTGGATTTGCTTTATCTCTTCCATGGGCAGAATGTTTTTCAAATTGTTTAGAAGTTGAGATAATTTGCTCTTGTTTTGCTATTCTTGTTCTCTGTAATGCACTGCGTCTCGTTAGAGTTTAATTATTGAGTATTGAAATGGCAATCTCCTGTATTCCAACTTGCCAATTCACTCTCACAAAACCAAGCTCCGCCTTTTCCAAGAATGAGTTCGTAATAAATCAACTCCACCCTCTTTCTCTTTTGTCCAAATGTACATCTCTCAACGAGCTCAAGCAAATTCAAGCATATACCATTAAAACCAATCTTCAGAGTGATATCTCTGTTCTCACCAAGCTCATTAATTTCTGCACACTCAACCCGACAACTTCGTATATGGACCATGCCCACCATCTGTTTGATCAAATTCTTGACAAGGATATTATTCTGTTTAACATAATGGCACGGGGTTATGCTCGCTCTAATTCTCCCTATCTTGCCTTTTCTCTTTTTGGTGAACTCTTGTGCTCTGGTCTTCTTCCTGATGACTACACATTCTCGTCCCTTCTCAAGGCGTGTGCGAGTTCTAAGGCCTTGAGAGAAGGTATGGGGTTGCATTGTTTTGCTGTTAAACTTGGACTGAATCATAATATTTACATATGCCCAACTCTCATAAATATGTATGCGGAGTGCAATGACATGAATGCTGCGCGTGGAGTGTTTGATGAAATGGAACAGCCATGCATAGTTAGCTATAATGCAATTATCACGGGTTATGCTCGAAGTAGTCAGCCCAATGAGGCTTTGTCCTTGTTTAGAGAATTGCAAGCGAGCAATATTGAGCCAACTGATGTAACTATGCTTAGTGTAATTATGTCATGTGCTCTGTTGGGCGCACTAGACCTGGGAAAGTGGATTCATGAATATGTTAAGAAGAAAGGTTTTGACAAATATGTGAAGGTGAACACTGCACTTATAGATATGTTTGCGAAATGTGGAAGTCTAACTGATGCTATTTCTATCTTTGAGGGCATGCGTGTGAGAGATACACAAGCTTGGTCTGCAATGATAGTTGCCTTTGCAACTCATGGGGATGGCTTGAAAGCTATCTCAATGTTTGAAGAGATGAAGAGGGAAGGAGTTAGACCTGATGAGATCACCTTTTTGGGGCTTTTGTATGCTTGTAGTCATGCTGGGCTAGTAGAACAAGGTAGAGGGTATTTCTATAGTATGTCTAAAACCTATGGGATAACTCCAGGGATCAAGCATTACGGTTGTATGGTGGATTTGCTTGGTCGAGCAGGTCATTTAGATGAGGCTTATAACTTTGTAGATAAACTGGAAACTAAGGCCACACCTATACTCTGGCGCACCCTTTTATCTGCTTGCAGCACCCATGGTAATGTCGAAATGGCAAAGCGGGTTATTGAAAGAATTTTTGAATTAGATGACGCCCATGGAGGGGACTATGTTATATTATCAAACTTGTATGCTAGAGTAGGAAGATGGGAAGATGTGAATCATTTAAGGAAATTGATGAAAGATAGAGGGGTGGTGAAGGTTCCTGGGTGTAGTTCCGTGGAGGTAAACAATGTTGTACATGAGTTCTTCTCTGGAGATGGAGTTCACTGCGTTTCGGTGGAGTTACGACGAGCACTTGACGAGTTAATGAAAGAAATAAAGTTGGTGGGATATGTTCCGGATACATCTTTAGTATATCATGCTGATATGGAAGAGGAAGGGAAAGAACTTGTCCTAAGATATCACAGTGAAAAATTGGCCATGGCTTTTGGGCTCCTAAATACACCTCCCGGTACAACAATAAGGGTAGCTAAGAACCTCCGTATTTGTGGAGATTGTCATAGTGCTGCAAAACTTATATCTTTTATTTTTGGGAGGAAAATCGTCATTAGGGACGTTCAACGATTCCATCGATTTGAAGATGGGAAATGCTCGTGTGGTGATTTCTGGTAATAGAATATGTAGAAATCCCTGGCTATTCTTGCAGTTCGATGTTTCTTTCCTGCAATGGCCATGTTTTGAGATTCATTTGGTATATCTTCATTGGGAACCTCTATGTTTGTATATCGACCAAGGAATTCAGAGTAAGATTGATTTGAAGGATGTACATATTTCTACCATCATTACACCGTATGTTAACATGAGTTTAACTATACGGTAGTCAATATATACTCATTTCTATGATAAAACCTCCA

Coding sequence (CDS)

ATGGCAATCTCCTGTATTCCAACTTGCCAATTCACTCTCACAAAACCAAGCTCCGCCTTTTCCAAGAATGAGTTCGTAATAAATCAACTCCACCCTCTTTCTCTTTTGTCCAAATGTACATCTCTCAACGAGCTCAAGCAAATTCAAGCATATACCATTAAAACCAATCTTCAGAGTGATATCTCTGTTCTCACCAAGCTCATTAATTTCTGCACACTCAACCCGACAACTTCGTATATGGACCATGCCCACCATCTGTTTGATCAAATTCTTGACAAGGATATTATTCTGTTTAACATAATGGCACGGGGTTATGCTCGCTCTAATTCTCCCTATCTTGCCTTTTCTCTTTTTGGTGAACTCTTGTGCTCTGGTCTTCTTCCTGATGACTACACATTCTCGTCCCTTCTCAAGGCGTGTGCGAGTTCTAAGGCCTTGAGAGAAGGTATGGGGTTGCATTGTTTTGCTGTTAAACTTGGACTGAATCATAATATTTACATATGCCCAACTCTCATAAATATGTATGCGGAGTGCAATGACATGAATGCTGCGCGTGGAGTGTTTGATGAAATGGAACAGCCATGCATAGTTAGCTATAATGCAATTATCACGGGTTATGCTCGAAGTAGTCAGCCCAATGAGGCTTTGTCCTTGTTTAGAGAATTGCAAGCGAGCAATATTGAGCCAACTGATGTAACTATGCTTAGTGTAATTATGTCATGTGCTCTGTTGGGCGCACTAGACCTGGGAAAGTGGATTCATGAATATGTTAAGAAGAAAGGTTTTGACAAATATGTGAAGGTGAACACTGCACTTATAGATATGTTTGCGAAATGTGGAAGTCTAACTGATGCTATTTCTATCTTTGAGGGCATGCGTGTGAGAGATACACAAGCTTGGTCTGCAATGATAGTTGCCTTTGCAACTCATGGGGATGGCTTGAAAGCTATCTCAATGTTTGAAGAGATGAAGAGGGAAGGAGTTAGACCTGATGAGATCACCTTTTTGGGGCTTTTGTATGCTTGTAGTCATGCTGGGCTAGTAGAACAAGGTAGAGGGTATTTCTATAGTATGTCTAAAACCTATGGGATAACTCCAGGGATCAAGCATTACGGTTGTATGGTGGATTTGCTTGGTCGAGCAGGTCATTTAGATGAGGCTTATAACTTTGTAGATAAACTGGAAACTAAGGCCACACCTATACTCTGGCGCACCCTTTTATCTGCTTGCAGCACCCATGGTAATGTCGAAATGGCAAAGCGGGTTATTGAAAGAATTTTTGAATTAGATGACGCCCATGGAGGGGACTATGTTATATTATCAAACTTGTATGCTAGAGTAGGAAGATGGGAAGATGTGAATCATTTAAGGAAATTGATGAAAGATAGAGGGGTGGTGAAGGTTCCTGGGTGTAGTTCCGTGGAGGTAAACAATGTTGTACATGAGTTCTTCTCTGGAGATGGAGTTCACTGCGTTTCGGTGGAGTTACGACGAGCACTTGACGAGTTAATGAAAGAAATAAAGTTGGTGGGATATGTTCCGGATACATCTTTAGTATATCATGCTGATATGGAAGAGGAAGGGAAAGAACTTGTCCTAAGATATCACAGTGAAAAATTGGCCATGGCTTTTGGGCTCCTAAATACACCTCCCGGTACAACAATAAGGGTAGCTAAGAACCTCCGTATTTGTGGAGATTGTCATAGTGCTGCAAAACTTATATCTTTTATTTTTGGGAGGAAAATCGTCATTAGGGACGTTCAACGATTCCATCGATTTGAAGATGGGAAATGCTCGTGTGGTGATTTCTGGTAA

Protein sequence

MAISCIPTCQFTLTKPSSAFSKNEFVINQLHPLSLLSKCTSLNELKQIQAYTIKTNLQSDISVLTKLINFCTLNPTTSYMDHAHHLFDQILDKDIILFNIMARGYARSNSPYLAFSLFGELLCSGLLPDDYTFSSLLKACASSKALREGMGLHCFAVKLGLNHNIYICPTLINMYAECNDMNAARGVFDEMEQPCIVSYNAIITGYARSSQPNEALSLFRELQASNIEPTDVTMLSVIMSCALLGALDLGKWIHEYVKKKGFDKYVKVNTALIDMFAKCGSLTDAISIFEGMRVRDTQAWSAMIVAFATHGDGLKAISMFEEMKREGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSKTYGITPGIKHYGCMVDLLGRAGHLDEAYNFVDKLETKATPILWRTLLSACSTHGNVEMAKRVIERIFELDDAHGGDYVILSNLYARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHCVSVELRRALDELMKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRVAKNLRICGDCHSAAKLISFIFGRKIVIRDVQRFHRFEDGKCSCGDFW*
Homology
BLAST of CSPI05G23780 vs. ExPASy Swiss-Prot
Match: Q8LK93 (Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H26 PE=2 SV=2)

HSP 1 Score: 832.4 bits (2149), Expect = 3.3e-240
Identity = 390/590 (66.10%), Postives = 481/590 (81.53%), Query Frame = 0

Query: 17  SSAFSKNEFV--INQLHPLSLLSKCTSLNELKQIQAYTIKTNLQSDISVLTKLINFCTLN 76
           +  F+K+  +  +N  +P+ L+SKC SL EL QIQAY IK++++ D+S + KLINFCT +
Sbjct: 15  AETFTKHSKIDTVNTQNPILLISKCNSLRELMQIQAYAIKSHIE-DVSFVAKLINFCTES 74

Query: 77  PTTSYMDHAHHLFDQILDKDIILFNIMARGYARSNSPYLAFSLFGELLCSGLLPDDYTFS 136
           PT S M +A HLF+ + + DI++FN MARGY+R  +P   FSLF E+L  G+LPD+YTF 
Sbjct: 75  PTESSMSYARHLFEAMSEPDIVIFNSMARGYSRFTNPLEVFSLFVEILEDGILPDNYTFP 134

Query: 137 SLLKACASSKALREGMGLHCFAVKLGLNHNIYICPTLINMYAECNDMNAARGVFDEMEQP 196
           SLLKACA +KAL EG  LHC ++KLGL+ N+Y+CPTLINMY EC D+++AR VFD + +P
Sbjct: 135 SLLKACAVAKALEEGRQLHCLSMKLGLDDNVYVCPTLINMYTECEDVDSARCVFDRIVEP 194

Query: 197 CIVSYNAIITGYARSSQPNEALSLFRELQASNIEPTDVTMLSVIMSCALLGALDLGKWIH 256
           C+V YNA+ITGYAR ++PNEALSLFRE+Q   ++P ++T+LSV+ SCALLG+LDLGKWIH
Sbjct: 195 CVVCYNAMITGYARRNRPNEALSLFREMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIH 254

Query: 257 EYVKKKGFDKYVKVNTALIDMFAKCGSLTDAISIFEGMRVRDTQAWSAMIVAFATHGDGL 316
           +Y KK  F KYVKVNTALIDMFAKCGSL DA+SIFE MR +DTQAWSAMIVA+A HG   
Sbjct: 255 KYAKKHSFCKYVKVNTALIDMFAKCGSLDDAVSIFEKMRYKDTQAWSAMIVAYANHGKAE 314

Query: 317 KAISMFEEMKREGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSKTYGITPGIKHYGCM 376
           K++ MFE M+ E V+PDEITFLGLL ACSH G VE+GR YF  M   +GI P IKHYG M
Sbjct: 315 KSMLMFERMRSENVQPDEITFLGLLNACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSM 374

Query: 377 VDLLGRAGHLDEAYNFVDKLETKATPILWRTLLSACSTHGNVEMAKRVIERIFELDDAHG 436
           VDLL RAG+L++AY F+DKL    TP+LWR LL+ACS+H N+++A++V ERIFELDD+HG
Sbjct: 375 VDLLSRAGNLEDAYEFIDKLPISPTPMLWRILLAACSSHNNLDLAEKVSERIFELDDSHG 434

Query: 437 GDYVILSNLYARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHCVSV 496
           GDYVILSNLYAR  +WE V+ LRK+MKDR  VKVPGCSS+EVNNVVHEFFSGDGV   + 
Sbjct: 435 GDYVILSNLYARNKKWEYVDSLRKVMKDRKAVKVPGCSSIEVNNVVHEFFSGDGVKSATT 494

Query: 497 ELRRALDELMKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTT 556
           +L RALDE++KE+KL GYVPDTS+V HA+M ++ KE+ LRYHSEKLA+ FGLLNTPPGTT
Sbjct: 495 KLHRALDEMVKELKLSGYVPDTSMVVHANMNDQEKEITLRYHSEKLAITFGLLNTPPGTT 554

Query: 557 IRVAKNLRICGDCHSAAKLISFIFGRKIVIRDVQRFHRFEDGKCSCGDFW 605
           IRV KNLR+C DCH+AAKLIS IFGRK+V+RDVQRFH FEDGKCSCGDFW
Sbjct: 555 IRVVKNLRVCRDCHNAAKLISLIFGRKVVLRDVQRFHHFEDGKCSCGDFW 603

BLAST of CSPI05G23780 vs. ExPASy Swiss-Prot
Match: Q9FJY7 (Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H61 PE=2 SV=1)

HSP 1 Score: 499.6 bits (1285), Expect = 5.0e-140
Identity = 238/608 (39.14%), Postives = 365/608 (60.03%), Query Frame = 0

Query: 28  NQLHPLSLLSKCTSLNELKQIQAYTIKTNLQSDISVLTKLINFCTLNPTTSYMDHAHHLF 87
           N    +S L +C+   ELKQI A  +KT L  D   +TK ++FC  + ++ ++ +A  +F
Sbjct: 13  NLYETMSCLQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVF 72

Query: 88  DQILDKDIILFNIMARGYARSNSPYLAFSLFGELLCSGLLPDDYTFSSLLKACASSKALR 147
           D     D  L+N+M RG++ S+ P  +  L+  +LCS    + YTF SLLKAC++  A  
Sbjct: 73  DGFDRPDTFLWNLMIRGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFE 132

Query: 148 EGMGLHCFAVKLGLNHNIYICPTLINMYAECNDMNAARGVFDEMEQPCIVSYNAIITGYA 207
           E   +H    KLG  +++Y   +LIN YA   +   A  +FD + +P  VS+N++I GY 
Sbjct: 133 ETTQIHAQITKLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSWNSVIKGYV 192

Query: 208 RSSQPN-------------------------------EALSLFRELQASNIEPTDVTMLS 267
           ++ + +                               EAL LF E+Q S++EP +V++ +
Sbjct: 193 KAGKMDIALTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSLAN 252

Query: 268 VIMSCALLGALDLGKWIHEYVKKKGFDKYVKVNTALIDMFAKCGSLTDAISIFEGMRVRD 327
            + +CA LGAL+ GKWIH Y+ K        +   LIDM+AKCG + +A+ +F+ ++ + 
Sbjct: 253 ALSACAQLGALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKKKS 312

Query: 328 TQAWSAMIVAFATHGDGLKAISMFEEMKREGVRPDEITFLGLLYACSHAGLVEQGRGYFY 387
            QAW+A+I  +A HG G +AIS F EM++ G++P+ ITF  +L ACS+ GLVE+G+  FY
Sbjct: 313 VQAWTALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFY 372

Query: 388 SMSKTYGITPGIKHYGCMVDLLGRAGHLDEAYNFVDKLETKATPILWRTLLSACSTHGNV 447
           SM + Y + P I+HYGC+VDLLGRAG LDEA  F+ ++  K   ++W  LL AC  H N+
Sbjct: 373 SMERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNI 432

Query: 448 EMAKRVIERIFELDDAHGGDYVILSNLYARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEV 507
           E+ + + E +  +D  HGG YV  +N++A   +W+     R+LMK++GV KVPGCS++ +
Sbjct: 433 ELGEEIGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTISL 492

Query: 508 NNVVHEFFSGDGVHCVSVELRRALDELMKEIKLVGYVPDTSLVYHADMEEEGKELVLRYH 567
               HEF +GD  H    +++     + ++++  GYVP+   +    ++++ +E ++  H
Sbjct: 493 EGTTHEFLAGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDEREAIVHQH 552

Query: 568 SEKLAMAFGLLNTPPGTTIRVAKNLRICGDCHSAAKLISFIFGRKIVIRDVQRFHRFEDG 605
           SEKLA+ +GL+ T PGT IR+ KNLR+C DCH   KLIS I+ R IV+RD  RFH F DG
Sbjct: 553 SEKLAITYGLIKTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRTRFHHFRDG 612

BLAST of CSPI05G23780 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 490.3 bits (1261), Expect = 3.0e-137
Identity = 238/602 (39.53%), Postives = 365/602 (60.63%), Query Frame = 0

Query: 33  LSLLSKCTSLNELKQIQAYTIKTNLQSDISVLTKLINFCTLN------------------ 92
           L   +K  +  E +QI  + +K     D+ V T LI+    N                  
Sbjct: 141 LKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDV 200

Query: 93  ----------PTTSYMDHAHHLFDQILDKDIILFNIMARGYARSNSPYLAFSLFGELLCS 152
                      +  Y+++A  LFD+I  KD++ +N M  GYA + +   A  LF +++ +
Sbjct: 201 VSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKT 260

Query: 153 GLLPDDYTFSSLLKACASSKALREGMGLHCFAVKLGLNHNIYICPTLINMYAECNDMNAA 212
            + PD+ T  +++ ACA S ++  G  +H +    G   N+ I   LI++Y++C ++  A
Sbjct: 261 NVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETA 320

Query: 213 RGVFDEMEQPCIVSYNAIITGYARSSQPNEALSLFRELQASNIEPTDVTMLSVIMSCALL 272
            G+F+ +    ++S+N +I GY   +   EAL LF+E+  S   P DVTMLS++ +CA L
Sbjct: 321 CGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHL 380

Query: 273 GALDLGKWIHEYVKK--KGFDKYVKVNTALIDMFAKCGSLTDAISIFEGMRVRDTQAWSA 332
           GA+D+G+WIH Y+ K  KG      + T+LIDM+AKCG +  A  +F  +  +   +W+A
Sbjct: 381 GAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNA 440

Query: 333 MIVAFATHGDGLKAISMFEEMKREGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSKTY 392
           MI  FA HG    +  +F  M++ G++PD+ITF+GLL ACSH+G+++ GR  F +M++ Y
Sbjct: 441 MIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDY 500

Query: 393 GITPGIKHYGCMVDLLGRAGHLDEAYNFVDKLETKATPILWRTLLSACSTHGNVEMAKRV 452
            +TP ++HYGCM+DLLG +G   EA   ++ +E +   ++W +LL AC  HGNVE+ +  
Sbjct: 501 KMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESF 560

Query: 453 IERIFELDDAHGGDYVILSNLYARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHE 512
            E + +++  + G YV+LSN+YA  GRW +V   R L+ D+G+ KVPGCSS+E+++VVHE
Sbjct: 561 AENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHE 620

Query: 513 FFSGDGVHCVSVELRRALDELMKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAM 572
           F  GD  H  + E+   L+E+   ++  G+VPDTS V   +MEEE KE  LR+HSEKLA+
Sbjct: 621 FIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQ-EMEEEWKEGALRHHSEKLAI 680

Query: 573 AFGLLNTPPGTTIRVAKNLRICGDCHSAAKLISFIFGRKIVIRDVQRFHRFEDGKCSCGD 605
           AFGL++T PGT + + KNLR+C +CH A KLIS I+ R+I+ RD  RFH F DG CSC D
Sbjct: 681 AFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCND 740

BLAST of CSPI05G23780 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 480.7 bits (1236), Expect = 2.4e-134
Identity = 232/559 (41.50%), Postives = 354/559 (63.33%), Query Frame = 0

Query: 46  KQIQAYTIKTNLQSDISVLTKLINFCTLNPTTSYMDHAHHLFDQILDKDIILFNIMARGY 105
           K+I  Y +++   S +++ T L++   +      ++ A  LFD +L+++++ +N M   Y
Sbjct: 256 KEIHGYAMRSGFDSLVNISTALVD---MYAKCGSLETARQLFDGMLERNVVSWNSMIDAY 315

Query: 106 ARSNSPYLAFSLFGELLCSGLLPDDYTFSSLLKACASSKALREGMGLHCFAVKLGLNHNI 165
            ++ +P  A  +F ++L  G+ P D +    L ACA    L  G  +H  +V+LGL+ N+
Sbjct: 316 VQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGLDRNV 375

Query: 166 YICPTLINMYAECNDMNAARGVFDEMEQPCIVSYNAIITGYARSSQPNEALSLFRELQAS 225
            +  +LI+MY +C +++ A  +F +++   +VS+NA+I G+A++ +P +AL+ F ++++ 
Sbjct: 376 SVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSR 435

Query: 226 NIEPTDVTMLSVIMSCALLGALDLGKWIHEYVKKKGFDKYVKVNTALIDMFAKCGSLTDA 285
            ++P   T +SVI + A L      KWIH  V +   DK V V TAL+DM+AKCG++  A
Sbjct: 436 TVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIA 495

Query: 286 ISIFEGMRVRDTQAWSAMIVAFATHGDGLKAISMFEEMKREGVRPDEITFLGLLYACSHA 345
             IF+ M  R    W+AMI  + THG G  A+ +FEEM++  ++P+ +TFL ++ ACSH+
Sbjct: 496 RLIFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHS 555

Query: 346 GLVEQGRGYFYSMSKTYGITPGIKHYGCMVDLLGRAGHLDEAYNFVDKLETKATPILWRT 405
           GLVE G   FY M + Y I   + HYG MVDLLGRAG L+EA++F+ ++  K    ++  
Sbjct: 556 GLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGA 615

Query: 406 LLSACSTHGNVEMAKRVIERIFELDDAHGGDYVILSNLYARVGRWEDVNHLRKLMKDRGV 465
           +L AC  H NV  A++  ER+FEL+   GG +V+L+N+Y     WE V  +R  M  +G+
Sbjct: 616 MLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGL 675

Query: 466 VKVPGCSSVEVNNVVHEFFSGDGVHCVSVELRRALDELMKEIKLVGYVPDTSLVYHADME 525
            K PGCS VE+ N VH FFSG   H  S ++   L++L+  IK  GYVPDT+LV    +E
Sbjct: 676 RKTPGCSMVEIKNEVHSFFSGSTAHPDSKKIYAFLEKLICHIKEAGYVPDTNLV--LGVE 735

Query: 526 EEGKELVLRYHSEKLAMAFGLLNTPPGTTIRVAKNLRICGDCHSAAKLISFIFGRKIVIR 585
            + KE +L  HSEKLA++FGLLNT  GTTI V KNLR+C DCH+A K IS + GR+IV+R
Sbjct: 736 NDVKEQLLSTHSEKLAISFGLLNTTAGTTIHVRKNLRVCADCHNATKYISLVTGREIVVR 795

Query: 586 DVQRFHRFEDGKCSCGDFW 605
           D+QRFH F++G CSCGD+W
Sbjct: 796 DMQRFHHFKNGACSCGDYW 809

BLAST of CSPI05G23780 vs. ExPASy Swiss-Prot
Match: A8MQA3 (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 480.3 bits (1235), Expect = 3.1e-134
Identity = 236/569 (41.48%), Postives = 369/569 (64.85%), Query Frame = 0

Query: 40  TSLNELKQIQAYTIKTNLQ-SDISVLTKLINFCTLNPTTSYMDHAHHLFDQILDK-DIIL 99
           +S+ +L+QI A++I+  +  SD  +   LI +    P+   M +AH +F +I    ++ +
Sbjct: 28  SSITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKVFSKIEKPINVFI 87

Query: 100 FNIMARGYARSNSPYLAFSLFGELLCSGLL-PDDYTFSSLLKACASSKALREGMGLHCFA 159
           +N + RGYA   +   AFSL+ E+  SGL+ PD +T+  L+KA  +   +R G  +H   
Sbjct: 88  WNTLIRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVTTMADVRLGETIHSVV 147

Query: 160 VKLGLNHNIYICPTLINMYAECNDMNAARGVFDEMEQPCIVSYNAIITGYARSSQPNEAL 219
           ++ G    IY+  +L+++YA C D+ +A  VFD+M +  +V++N++I G+A + +P EAL
Sbjct: 148 IRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSVINGFAENGKPEEAL 207

Query: 220 SLFRELQASNIEPTDVTMLSVIMSCALLGALDLGKWIHEYVKKKGFDKYVKVNTALIDMF 279
           +L+ E+ +  I+P   T++S++ +CA +GAL LGK +H Y+ K G  + +  +  L+D++
Sbjct: 208 ALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLHSSNVLLDLY 267

Query: 280 AKCGSLTDAISIFEGMRVRDTQAWSAMIVAFATHGDGLKAISMFEEMK-REGVRPDEITF 339
           A+CG + +A ++F+ M  +++ +W+++IV  A +G G +AI +F+ M+  EG+ P EITF
Sbjct: 268 ARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTEGLLPCEITF 327

Query: 340 LGLLYACSHAGLVEQGRGYFYSMSKTYGITPGIKHYGCMVDLLGRAGHLDEAYNFVDKLE 399
           +G+LYACSH G+V++G  YF  M + Y I P I+H+GCMVDLL RAG + +AY ++  + 
Sbjct: 328 VGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQVKKAYEYIKSMP 387

Query: 400 TKATPILWRTLLSACSTHGNVEMAKRVIERIFELDDAHGGDYVILSNLYARVGRWEDVNH 459
            +   ++WRTLL AC+ HG+ ++A+    +I +L+  H GDYV+LSN+YA   RW DV  
Sbjct: 388 MQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLSNMYASEQRWSDVQK 447

Query: 460 LRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHCVSVELRRALDELMKEIKLVGYVPD 519
           +RK M   GV KVPG S VEV N VHEF  GD  H  S  +   L E+   ++  GYVP 
Sbjct: 448 IRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLKEMTGRLRSEGYVPQ 507

Query: 520 TSLVYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRVAKNLRICGDCHSAAKLIS 579
            S VY  D+EEE KE  + YHSEK+A+AF L++TP  + I V KNLR+C DCH A KL+S
Sbjct: 508 ISNVY-VDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLRVCADCHLAIKLVS 567

Query: 580 FIFGRKIVIRDVQRFHRFEDGKCSCGDFW 605
            ++ R+IV+RD  RFH F++G CSC D+W
Sbjct: 568 KVYNREIVVRDRSRFHHFKNGSCSCQDYW 595

BLAST of CSPI05G23780 vs. ExPASy TrEMBL
Match: A0A0A0KU15 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G605160 PE=3 SV=1)

HSP 1 Score: 1233.8 bits (3191), Expect = 0.0e+00
Identity = 602/604 (99.67%), Postives = 603/604 (99.83%), Query Frame = 0

Query: 1   MAISCIPTCQFTLTKPSSAFSKNEFVINQLHPLSLLSKCTSLNELKQIQAYTIKTNLQSD 60
           MAISCIPTCQFTLTKPSSAFSKNEFVINQLHPLSLLSKCTSLNELKQIQAYTIKTNLQSD
Sbjct: 1   MAISCIPTCQFTLTKPSSAFSKNEFVINQLHPLSLLSKCTSLNELKQIQAYTIKTNLQSD 60

Query: 61  ISVLTKLINFCTLNPTTSYMDHAHHLFDQILDKDIILFNIMARGYARSNSPYLAFSLFGE 120
           ISVLTKLINFCTLNPTTSYMDHAHHLFDQILDKDIILFNIMARGYARSNSPYLAFSLFGE
Sbjct: 61  ISVLTKLINFCTLNPTTSYMDHAHHLFDQILDKDIILFNIMARGYARSNSPYLAFSLFGE 120

Query: 121 LLCSGLLPDDYTFSSLLKACASSKALREGMGLHCFAVKLGLNHNIYICPTLINMYAECND 180
           LLCSGLLPDDYTFSSLLKACASSKALREGMGLHCFAVKLGLNHNIYICPTLINMYAECND
Sbjct: 121 LLCSGLLPDDYTFSSLLKACASSKALREGMGLHCFAVKLGLNHNIYICPTLINMYAECND 180

Query: 181 MNAARGVFDEMEQPCIVSYNAIITGYARSSQPNEALSLFRELQASNIEPTDVTMLSVIMS 240
           MNAARGVFDEMEQPCIVSYNAIITGYARSSQPNEALSLFRELQASNIEPTDVTMLSVIMS
Sbjct: 181 MNAARGVFDEMEQPCIVSYNAIITGYARSSQPNEALSLFRELQASNIEPTDVTMLSVIMS 240

Query: 241 CALLGALDLGKWIHEYVKKKGFDKYVKVNTALIDMFAKCGSLTDAISIFEGMRVRDTQAW 300
           CALLGALDLGKWIHEYVKKKGFDKYVKVNTALIDMFAKCGSLTDAISIFEGMRVRDTQAW
Sbjct: 241 CALLGALDLGKWIHEYVKKKGFDKYVKVNTALIDMFAKCGSLTDAISIFEGMRVRDTQAW 300

Query: 301 SAMIVAFATHGDGLKAISMFEEMKREGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSK 360
           SAMIVAFATHGDGLKAISMFEEMKREGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSK
Sbjct: 301 SAMIVAFATHGDGLKAISMFEEMKREGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSK 360

Query: 361 TYGITPGIKHYGCMVDLLGRAGHLDEAYNFVDKLETKATPILWRTLLSACSTHGNVEMAK 420
           TYGITPGIKHYGCMVDLLGRAGHLDEAYNFVDKLE KATPILWRTLLSACSTHGNVEMAK
Sbjct: 361 TYGITPGIKHYGCMVDLLGRAGHLDEAYNFVDKLEIKATPILWRTLLSACSTHGNVEMAK 420

Query: 421 RVIERIFELDDAHGGDYVILSNLYARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVV 480
           RVIERIFELDDAHGGDYVILSNLYARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVV
Sbjct: 421 RVIERIFELDDAHGGDYVILSNLYARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVV 480

Query: 481 HEFFSGDGVHCVSVELRRALDELMKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKL 540
           HEFFSGDGVHCVSVELRRALDELMKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKL
Sbjct: 481 HEFFSGDGVHCVSVELRRALDELMKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKL 540

Query: 541 AMAFGLLNTPPGTTIRVAKNLRICGDCHSAAKLISFIFGRKIVIRDVQRFHRFEDGKCSC 600
           AMAFGLLNTPPGTTIRVAKNLRICGDCH+AAKLISFIFGRKIVIRDVQRFHRFEDGKCSC
Sbjct: 541 AMAFGLLNTPPGTTIRVAKNLRICGDCHNAAKLISFIFGRKIVIRDVQRFHRFEDGKCSC 600

Query: 601 GDFW 605
           GDFW
Sbjct: 601 GDFW 604

BLAST of CSPI05G23780 vs. ExPASy TrEMBL
Match: A0A5A7STH8 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1932G00400 PE=3 SV=1)

HSP 1 Score: 1203.7 bits (3113), Expect = 0.0e+00
Identity = 584/604 (96.69%), Postives = 596/604 (98.68%), Query Frame = 0

Query: 1   MAISCIPTCQFTLTKPSSAFSKNEFVINQLHPLSLLSKCTSLNELKQIQAYTIKTNLQSD 60
           MAISCIPTCQFTLTKPSS FSKNEFVINQLHPLSLLSKCTSL ELKQIQAYTIKTNLQSD
Sbjct: 1   MAISCIPTCQFTLTKPSSTFSKNEFVINQLHPLSLLSKCTSLKELKQIQAYTIKTNLQSD 60

Query: 61  ISVLTKLINFCTLNPTTSYMDHAHHLFDQILDKDIILFNIMARGYARSNSPYLAFSLFGE 120
           ISVLTKLINFCTLNPTTSYMDHAHHLFDQILDKDIILFNIMARGYARSNSPYLAFSLF +
Sbjct: 61  ISVLTKLINFCTLNPTTSYMDHAHHLFDQILDKDIILFNIMARGYARSNSPYLAFSLFAQ 120

Query: 121 LLCSGLLPDDYTFSSLLKACASSKALREGMGLHCFAVKLGLNHNIYICPTLINMYAECND 180
           LLCSGLLPDDYTFSSLLKACASSKALR+GMGLHCFAVKLGLNHNIYICPTLINMYAECND
Sbjct: 121 LLCSGLLPDDYTFSSLLKACASSKALRQGMGLHCFAVKLGLNHNIYICPTLINMYAECND 180

Query: 181 MNAARGVFDEMEQPCIVSYNAIITGYARSSQPNEALSLFRELQASNIEPTDVTMLSVIMS 240
           MNAARGVFDEMEQPCIVSYNAIITGYARSSQPNEALSLFRELQAS+IEPTDVTMLSVIMS
Sbjct: 181 MNAARGVFDEMEQPCIVSYNAIITGYARSSQPNEALSLFRELQASDIEPTDVTMLSVIMS 240

Query: 241 CALLGALDLGKWIHEYVKKKGFDKYVKVNTALIDMFAKCGSLTDAISIFEGMRVRDTQAW 300
           CALLGALDLGKWIHEYVKKKGFDKYVKVNTALIDMFAKCGSLTDAISIFEGMRVRDTQAW
Sbjct: 241 CALLGALDLGKWIHEYVKKKGFDKYVKVNTALIDMFAKCGSLTDAISIFEGMRVRDTQAW 300

Query: 301 SAMIVAFATHGDGLKAISMFEEMKREGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSK 360
           SAMIVAFATHGDGLK+IS+FEEMKR GVRPDEITFLGLLYACSHAGLVEQGRGYFYSMS+
Sbjct: 301 SAMIVAFATHGDGLKSISIFEEMKRAGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSR 360

Query: 361 TYGITPGIKHYGCMVDLLGRAGHLDEAYNFVDKLETKATPILWRTLLSACSTHGNVEMAK 420
           TYGITPGIKHYGCMVDLLGR G LDEAYNFVD+LE K TPILWRTLLSACSTHGNVEMAK
Sbjct: 361 TYGITPGIKHYGCMVDLLGRTGCLDEAYNFVDELEIKPTPILWRTLLSACSTHGNVEMAK 420

Query: 421 RVIERIFELDDAHGGDYVILSNLYARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVV 480
           RVIERIFELDD+HGGDYVILSNLYARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVV
Sbjct: 421 RVIERIFELDDSHGGDYVILSNLYARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVV 480

Query: 481 HEFFSGDGVHCVSVELRRALDELMKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKL 540
           HEFFSGDGVHCVSVELRRALDELMKEIKLVGY+PDTSLVYHADM+EEGKELVLRYHSEKL
Sbjct: 481 HEFFSGDGVHCVSVELRRALDELMKEIKLVGYIPDTSLVYHADMDEEGKELVLRYHSEKL 540

Query: 541 AMAFGLLNTPPGTTIRVAKNLRICGDCHSAAKLISFIFGRKIVIRDVQRFHRFEDGKCSC 600
           AMAFGLLNTPPGTTIRVAKNLRICGDCH+AAKLISFIFGRKIVIRDVQRFH+FEDGKCSC
Sbjct: 541 AMAFGLLNTPPGTTIRVAKNLRICGDCHNAAKLISFIFGRKIVIRDVQRFHQFEDGKCSC 600

Query: 601 GDFW 605
           GDFW
Sbjct: 601 GDFW 604

BLAST of CSPI05G23780 vs. ExPASy TrEMBL
Match: A0A1S3BFK0 (pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103489124 PE=3 SV=1)

HSP 1 Score: 1203.7 bits (3113), Expect = 0.0e+00
Identity = 584/604 (96.69%), Postives = 596/604 (98.68%), Query Frame = 0

Query: 1   MAISCIPTCQFTLTKPSSAFSKNEFVINQLHPLSLLSKCTSLNELKQIQAYTIKTNLQSD 60
           MAISCIPTCQFTLTKPSS FSKNEFVINQLHPLSLLSKCTSL ELKQIQAYTIKTNLQSD
Sbjct: 1   MAISCIPTCQFTLTKPSSTFSKNEFVINQLHPLSLLSKCTSLKELKQIQAYTIKTNLQSD 60

Query: 61  ISVLTKLINFCTLNPTTSYMDHAHHLFDQILDKDIILFNIMARGYARSNSPYLAFSLFGE 120
           ISVLTKLINFCTLNPTTSYMDHAHHLFDQILDKDIILFNIMARGYARSNSPYLAFSLF +
Sbjct: 61  ISVLTKLINFCTLNPTTSYMDHAHHLFDQILDKDIILFNIMARGYARSNSPYLAFSLFAQ 120

Query: 121 LLCSGLLPDDYTFSSLLKACASSKALREGMGLHCFAVKLGLNHNIYICPTLINMYAECND 180
           LLCSGLLPDDYTFSSLLKACASSKALR+GMGLHCFAVKLGLNHNIYICPTLINMYAECND
Sbjct: 121 LLCSGLLPDDYTFSSLLKACASSKALRQGMGLHCFAVKLGLNHNIYICPTLINMYAECND 180

Query: 181 MNAARGVFDEMEQPCIVSYNAIITGYARSSQPNEALSLFRELQASNIEPTDVTMLSVIMS 240
           MNAARGVFDEMEQPCIVSYNAIITGYARSSQPNEALSLFRELQAS+IEPTDVTMLSVIMS
Sbjct: 181 MNAARGVFDEMEQPCIVSYNAIITGYARSSQPNEALSLFRELQASDIEPTDVTMLSVIMS 240

Query: 241 CALLGALDLGKWIHEYVKKKGFDKYVKVNTALIDMFAKCGSLTDAISIFEGMRVRDTQAW 300
           CALLGALDLGKWIHEYVKKKGFDKYVKVNTALIDMFAKCGSLTDAISIFEGMRVRDTQAW
Sbjct: 241 CALLGALDLGKWIHEYVKKKGFDKYVKVNTALIDMFAKCGSLTDAISIFEGMRVRDTQAW 300

Query: 301 SAMIVAFATHGDGLKAISMFEEMKREGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSK 360
           SAMIVAFATHGDGLK+IS+FEEMKR GVRPDEITFLGLLYACSHAGLVEQGRGYFYSMS+
Sbjct: 301 SAMIVAFATHGDGLKSISIFEEMKRAGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSR 360

Query: 361 TYGITPGIKHYGCMVDLLGRAGHLDEAYNFVDKLETKATPILWRTLLSACSTHGNVEMAK 420
           TYGITPGIKHYGCMVDLLGR G LDEAYNFVD+LE K TPILWRTLLSACSTHGNVEMAK
Sbjct: 361 TYGITPGIKHYGCMVDLLGRTGCLDEAYNFVDELEIKPTPILWRTLLSACSTHGNVEMAK 420

Query: 421 RVIERIFELDDAHGGDYVILSNLYARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVV 480
           RVIERIFELDD+HGGDYVILSNLYARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVV
Sbjct: 421 RVIERIFELDDSHGGDYVILSNLYARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVV 480

Query: 481 HEFFSGDGVHCVSVELRRALDELMKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKL 540
           HEFFSGDGVHCVSVELRRALDELMKEIKLVGY+PDTSLVYHADM+EEGKELVLRYHSEKL
Sbjct: 481 HEFFSGDGVHCVSVELRRALDELMKEIKLVGYIPDTSLVYHADMDEEGKELVLRYHSEKL 540

Query: 541 AMAFGLLNTPPGTTIRVAKNLRICGDCHSAAKLISFIFGRKIVIRDVQRFHRFEDGKCSC 600
           AMAFGLLNTPPGTTIRVAKNLRICGDCH+AAKLISFIFGRKIVIRDVQRFH+FEDGKCSC
Sbjct: 541 AMAFGLLNTPPGTTIRVAKNLRICGDCHNAAKLISFIFGRKIVIRDVQRFHQFEDGKCSC 600

Query: 601 GDFW 605
           GDFW
Sbjct: 601 GDFW 604

BLAST of CSPI05G23780 vs. ExPASy TrEMBL
Match: A0A6J1DA68 (pentatricopeptide repeat-containing protein At2g02980, chloroplastic isoform X1 OS=Momordica charantia OX=3673 GN=LOC111019083 PE=3 SV=1)

HSP 1 Score: 1108.6 bits (2866), Expect = 0.0e+00
Identity = 535/604 (88.58%), Postives = 567/604 (93.87%), Query Frame = 0

Query: 1   MAISCIPTCQFTLTKPSSAFSKNEFVINQLHPLSLLSKCTSLNELKQIQAYTIKTNLQSD 60
           M ISCIP+CQFTL KP+SAF  NEF  N  HPL LLSKCTSL ELKQIQA+TIKTNLQ+D
Sbjct: 1   MGISCIPSCQFTLAKPNSAFPNNEFT-NPPHPLFLLSKCTSLRELKQIQAFTIKTNLQND 60

Query: 61  ISVLTKLINFCTLNPTTSYMDHAHHLFDQILDKDIILFNIMARGYARSNSPYLAFSLFGE 120
           ISVLTK+INFCTLNP+TS MDHAHHLFDQI DKDI+LFNIMARGYARSN+PYLAFSLF +
Sbjct: 61  ISVLTKIINFCTLNPSTSSMDHAHHLFDQIPDKDIVLFNIMARGYARSNTPYLAFSLFSQ 120

Query: 121 LLCSGLLPDDYTFSSLLKACASSKALREGMGLHCFAVKLGLNHNIYICPTLINMYAECND 180
           +LCSGLLPDDYTFSSLLKACASSKA  EG  LHCFA+KLGLNHNIYICP+LIN+YAECND
Sbjct: 121 VLCSGLLPDDYTFSSLLKACASSKAFSEGRQLHCFAIKLGLNHNIYICPSLINLYAECND 180

Query: 181 MNAARGVFDEMEQPCIVSYNAIITGYARSSQPNEALSLFRELQASNIEPTDVTMLSVIMS 240
           MNAARGVFDEME PCIVSYNAIITG+ARSSQPNEALSLFRELQASN+EPTDVTMLS+IMS
Sbjct: 181 MNAARGVFDEMEAPCIVSYNAIITGHARSSQPNEALSLFRELQASNLEPTDVTMLSIIMS 240

Query: 241 CALLGALDLGKWIHEYVKKKGFDKYVKVNTALIDMFAKCGSLTDAISIFEGMRVRDTQAW 300
           CALLGALDLG+WIHEYVKKKGFDK+VKVNTALIDM+AKCGSL DAISIFE MRVRDTQAW
Sbjct: 241 CALLGALDLGRWIHEYVKKKGFDKFVKVNTALIDMYAKCGSLVDAISIFEDMRVRDTQAW 300

Query: 301 SAMIVAFATHGDGLKAISMFEEMKREGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSK 360
           SAMIVA+ATHGDGLKAISMFEEMKR GVRPDEITFLGLLYACSHAGLVE+GRGYF SMSK
Sbjct: 301 SAMIVAYATHGDGLKAISMFEEMKRAGVRPDEITFLGLLYACSHAGLVEEGRGYFNSMSK 360

Query: 361 TYGITPGIKHYGCMVDLLGRAGHLDEAYNFVDKLETKATPILWRTLLSACSTHGNVEMAK 420
            YGI PGIKHYGCMVDLLGR GHLDEAY F+D  E K TPILWRTLLSACS  GNV++AK
Sbjct: 361 YYGIAPGIKHYGCMVDLLGRTGHLDEAYKFIDGSEIKPTPILWRTLLSACSNRGNVDLAK 420

Query: 421 RVIERIFELDDAHGGDYVILSNLYARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVV 480
           RVIERIFELDD+HGGDYVILSNL ARVGRWEDVNH+RKLMKDRGVVKVPGCSSVEVNNVV
Sbjct: 421 RVIERIFELDDSHGGDYVILSNLCARVGRWEDVNHIRKLMKDRGVVKVPGCSSVEVNNVV 480

Query: 481 HEFFSGDGVHCVSVELRRALDELMKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKL 540
           HEFFSGDGVH +SVELRRALDEL+KEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKL
Sbjct: 481 HEFFSGDGVHSISVELRRALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKL 540

Query: 541 AMAFGLLNTPPGTTIRVAKNLRICGDCHSAAKLISFIFGRKIVIRDVQRFHRFEDGKCSC 600
           A+AFGLLN+PPGT IRV KNLRICGDCH+AAKLISFIFGR+IVIRDVQRFHRFEDGKCSC
Sbjct: 541 AIAFGLLNSPPGTPIRVVKNLRICGDCHTAAKLISFIFGRQIVIRDVQRFHRFEDGKCSC 600

Query: 601 GDFW 605
            DFW
Sbjct: 601 CDFW 603

BLAST of CSPI05G23780 vs. ExPASy TrEMBL
Match: A0A6J1HTT7 (pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111467412 PE=3 SV=1)

HSP 1 Score: 1094.0 bits (2828), Expect = 0.0e+00
Identity = 529/604 (87.58%), Postives = 560/604 (92.72%), Query Frame = 0

Query: 1   MAISCIPTCQFTLTKPSSAFSKNEFVINQLHPLSLLSKCTSLNELKQIQAYTIKTNLQSD 60
           MAISCIPTCQF LTKP     KNEF INQ HPLSL SKC SL ELKQIQAYTIKTNL +D
Sbjct: 1   MAISCIPTCQFALTKP-----KNEF-INQPHPLSLFSKCASLRELKQIQAYTIKTNLHND 60

Query: 61  ISVLTKLINFCTLNPTTSYMDHAHHLFDQILDKDIILFNIMARGYARSNSPYLAFSLFGE 120
           ISVLTKLINFCT  PTTS MDHAHHLFD++LDKDI+LFNIMARGYARSNSPYL FSLF +
Sbjct: 61  ISVLTKLINFCTRYPTTSSMDHAHHLFDKMLDKDIVLFNIMARGYARSNSPYLVFSLFAQ 120

Query: 121 LLCSGLLPDDYTFSSLLKACASSKALREGMGLHCFAVKLGLNHNIYICPTLINMYAECND 180
           +LCSGLLPDDYTFSSLLKACA SKAL EG  LHCFA+KLG  HNIYICPTLINMYA CND
Sbjct: 121 VLCSGLLPDDYTFSSLLKACAGSKALEEGRQLHCFAIKLGFGHNIYICPTLINMYAACND 180

Query: 181 MNAARGVFDEMEQPCIVSYNAIITGYARSSQPNEALSLFRELQASNIEPTDVTMLSVIMS 240
           MNAARGVFD ME+PCIVSYNAIITGYARSSQPNEALSLFRELQASN+EPTDVTMLS+IMS
Sbjct: 181 MNAARGVFDGMEEPCIVSYNAIITGYARSSQPNEALSLFRELQASNLEPTDVTMLSIIMS 240

Query: 241 CALLGALDLGKWIHEYVKKKGFDKYVKVNTALIDMFAKCGSLTDAISIFEGMRVRDTQAW 300
           CALLGALDLG+WIHEYVKKKGFDK+VKVNTALIDM+AKCGS+ DAISIFEGMRVRDTQAW
Sbjct: 241 CALLGALDLGRWIHEYVKKKGFDKFVKVNTALIDMYAKCGSIVDAISIFEGMRVRDTQAW 300

Query: 301 SAMIVAFATHGDGLKAISMFEEMKREGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSK 360
           SAMIVA+ATHGDGLKAISMFEEMK+ GVRPDEITFLGLLYACSHAGLVE+GRGYFYSM K
Sbjct: 301 SAMIVAYATHGDGLKAISMFEEMKKAGVRPDEITFLGLLYACSHAGLVEEGRGYFYSMYK 360

Query: 361 TYGITPGIKHYGCMVDLLGRAGHLDEAYNFVDKLETKATPILWRTLLSACSTHGNVEMAK 420
            +G+TPGIKHYGCMVDLLGR G LDEAY F+D+L  K TPILWRTLLSACS HGNV++AK
Sbjct: 361 NHGMTPGIKHYGCMVDLLGRTGRLDEAYKFIDELAIKPTPILWRTLLSACSNHGNVDLAK 420

Query: 421 RVIERIFELDDAHGGDYVILSNLYARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVV 480
           RVIERIFELDD+HGGDYVILSNL AR+GRWEDVN LRKLMKDRGVVKVPGCSSVEVNNVV
Sbjct: 421 RVIERIFELDDSHGGDYVILSNLCARLGRWEDVNRLRKLMKDRGVVKVPGCSSVEVNNVV 480

Query: 481 HEFFSGDGVHCVSVELRRALDELMKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKL 540
           HEFFSGDGVH +SVELRRALDEL++EIKL GYVPDTSLVYHADMEEE KELVLRYHSEKL
Sbjct: 481 HEFFSGDGVHSISVELRRALDELIQEIKLAGYVPDTSLVYHADMEEEAKELVLRYHSEKL 540

Query: 541 AMAFGLLNTPPGTTIRVAKNLRICGDCHSAAKLISFIFGRKIVIRDVQRFHRFEDGKCSC 600
           AMAFGLLNTPPGTTIRV KNLRICGDCH+AAKLIS IFGR+IVIRDVQRFHRFEDG+CSC
Sbjct: 541 AMAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGQCSC 598

Query: 601 GDFW 605
            DFW
Sbjct: 601 CDFW 598

BLAST of CSPI05G23780 vs. NCBI nr
Match: XP_031742214.1 (pentatricopeptide repeat-containing protein At2g02980, chloroplastic [Cucumis sativus])

HSP 1 Score: 1233.8 bits (3191), Expect = 0.0e+00
Identity = 602/604 (99.67%), Postives = 603/604 (99.83%), Query Frame = 0

Query: 1   MAISCIPTCQFTLTKPSSAFSKNEFVINQLHPLSLLSKCTSLNELKQIQAYTIKTNLQSD 60
           MAISCIPTCQFTLTKPSSAFSKNEFVINQLHPLSLLSKCTSLNELKQIQAYTIKTNLQSD
Sbjct: 1   MAISCIPTCQFTLTKPSSAFSKNEFVINQLHPLSLLSKCTSLNELKQIQAYTIKTNLQSD 60

Query: 61  ISVLTKLINFCTLNPTTSYMDHAHHLFDQILDKDIILFNIMARGYARSNSPYLAFSLFGE 120
           ISVLTKLINFCTLNPTTSYMDHAHHLFDQILDKDIILFNIMARGYARSNSPYLAFSLFGE
Sbjct: 61  ISVLTKLINFCTLNPTTSYMDHAHHLFDQILDKDIILFNIMARGYARSNSPYLAFSLFGE 120

Query: 121 LLCSGLLPDDYTFSSLLKACASSKALREGMGLHCFAVKLGLNHNIYICPTLINMYAECND 180
           LLCSGLLPDDYTFSSLLKACASSKALREGMGLHCFAVKLGLNHNIYICPTLINMYAECND
Sbjct: 121 LLCSGLLPDDYTFSSLLKACASSKALREGMGLHCFAVKLGLNHNIYICPTLINMYAECND 180

Query: 181 MNAARGVFDEMEQPCIVSYNAIITGYARSSQPNEALSLFRELQASNIEPTDVTMLSVIMS 240
           MNAARGVFDEMEQPCIVSYNAIITGYARSSQPNEALSLFRELQASNIEPTDVTMLSVIMS
Sbjct: 181 MNAARGVFDEMEQPCIVSYNAIITGYARSSQPNEALSLFRELQASNIEPTDVTMLSVIMS 240

Query: 241 CALLGALDLGKWIHEYVKKKGFDKYVKVNTALIDMFAKCGSLTDAISIFEGMRVRDTQAW 300
           CALLGALDLGKWIHEYVKKKGFDKYVKVNTALIDMFAKCGSLTDAISIFEGMRVRDTQAW
Sbjct: 241 CALLGALDLGKWIHEYVKKKGFDKYVKVNTALIDMFAKCGSLTDAISIFEGMRVRDTQAW 300

Query: 301 SAMIVAFATHGDGLKAISMFEEMKREGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSK 360
           SAMIVAFATHGDGLKAISMFEEMKREGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSK
Sbjct: 301 SAMIVAFATHGDGLKAISMFEEMKREGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSK 360

Query: 361 TYGITPGIKHYGCMVDLLGRAGHLDEAYNFVDKLETKATPILWRTLLSACSTHGNVEMAK 420
           TYGITPGIKHYGCMVDLLGRAGHLDEAYNFVDKLE KATPILWRTLLSACSTHGNVEMAK
Sbjct: 361 TYGITPGIKHYGCMVDLLGRAGHLDEAYNFVDKLEIKATPILWRTLLSACSTHGNVEMAK 420

Query: 421 RVIERIFELDDAHGGDYVILSNLYARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVV 480
           RVIERIFELDDAHGGDYVILSNLYARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVV
Sbjct: 421 RVIERIFELDDAHGGDYVILSNLYARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVV 480

Query: 481 HEFFSGDGVHCVSVELRRALDELMKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKL 540
           HEFFSGDGVHCVSVELRRALDELMKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKL
Sbjct: 481 HEFFSGDGVHCVSVELRRALDELMKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKL 540

Query: 541 AMAFGLLNTPPGTTIRVAKNLRICGDCHSAAKLISFIFGRKIVIRDVQRFHRFEDGKCSC 600
           AMAFGLLNTPPGTTIRVAKNLRICGDCH+AAKLISFIFGRKIVIRDVQRFHRFEDGKCSC
Sbjct: 541 AMAFGLLNTPPGTTIRVAKNLRICGDCHNAAKLISFIFGRKIVIRDVQRFHRFEDGKCSC 600

Query: 601 GDFW 605
           GDFW
Sbjct: 601 GDFW 604

BLAST of CSPI05G23780 vs. NCBI nr
Match: XP_008446357.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g02980, chloroplastic [Cucumis melo] >KAA0034422.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK17712.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1203.7 bits (3113), Expect = 0.0e+00
Identity = 584/604 (96.69%), Postives = 596/604 (98.68%), Query Frame = 0

Query: 1   MAISCIPTCQFTLTKPSSAFSKNEFVINQLHPLSLLSKCTSLNELKQIQAYTIKTNLQSD 60
           MAISCIPTCQFTLTKPSS FSKNEFVINQLHPLSLLSKCTSL ELKQIQAYTIKTNLQSD
Sbjct: 1   MAISCIPTCQFTLTKPSSTFSKNEFVINQLHPLSLLSKCTSLKELKQIQAYTIKTNLQSD 60

Query: 61  ISVLTKLINFCTLNPTTSYMDHAHHLFDQILDKDIILFNIMARGYARSNSPYLAFSLFGE 120
           ISVLTKLINFCTLNPTTSYMDHAHHLFDQILDKDIILFNIMARGYARSNSPYLAFSLF +
Sbjct: 61  ISVLTKLINFCTLNPTTSYMDHAHHLFDQILDKDIILFNIMARGYARSNSPYLAFSLFAQ 120

Query: 121 LLCSGLLPDDYTFSSLLKACASSKALREGMGLHCFAVKLGLNHNIYICPTLINMYAECND 180
           LLCSGLLPDDYTFSSLLKACASSKALR+GMGLHCFAVKLGLNHNIYICPTLINMYAECND
Sbjct: 121 LLCSGLLPDDYTFSSLLKACASSKALRQGMGLHCFAVKLGLNHNIYICPTLINMYAECND 180

Query: 181 MNAARGVFDEMEQPCIVSYNAIITGYARSSQPNEALSLFRELQASNIEPTDVTMLSVIMS 240
           MNAARGVFDEMEQPCIVSYNAIITGYARSSQPNEALSLFRELQAS+IEPTDVTMLSVIMS
Sbjct: 181 MNAARGVFDEMEQPCIVSYNAIITGYARSSQPNEALSLFRELQASDIEPTDVTMLSVIMS 240

Query: 241 CALLGALDLGKWIHEYVKKKGFDKYVKVNTALIDMFAKCGSLTDAISIFEGMRVRDTQAW 300
           CALLGALDLGKWIHEYVKKKGFDKYVKVNTALIDMFAKCGSLTDAISIFEGMRVRDTQAW
Sbjct: 241 CALLGALDLGKWIHEYVKKKGFDKYVKVNTALIDMFAKCGSLTDAISIFEGMRVRDTQAW 300

Query: 301 SAMIVAFATHGDGLKAISMFEEMKREGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSK 360
           SAMIVAFATHGDGLK+IS+FEEMKR GVRPDEITFLGLLYACSHAGLVEQGRGYFYSMS+
Sbjct: 301 SAMIVAFATHGDGLKSISIFEEMKRAGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSR 360

Query: 361 TYGITPGIKHYGCMVDLLGRAGHLDEAYNFVDKLETKATPILWRTLLSACSTHGNVEMAK 420
           TYGITPGIKHYGCMVDLLGR G LDEAYNFVD+LE K TPILWRTLLSACSTHGNVEMAK
Sbjct: 361 TYGITPGIKHYGCMVDLLGRTGCLDEAYNFVDELEIKPTPILWRTLLSACSTHGNVEMAK 420

Query: 421 RVIERIFELDDAHGGDYVILSNLYARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVV 480
           RVIERIFELDD+HGGDYVILSNLYARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVV
Sbjct: 421 RVIERIFELDDSHGGDYVILSNLYARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVV 480

Query: 481 HEFFSGDGVHCVSVELRRALDELMKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKL 540
           HEFFSGDGVHCVSVELRRALDELMKEIKLVGY+PDTSLVYHADM+EEGKELVLRYHSEKL
Sbjct: 481 HEFFSGDGVHCVSVELRRALDELMKEIKLVGYIPDTSLVYHADMDEEGKELVLRYHSEKL 540

Query: 541 AMAFGLLNTPPGTTIRVAKNLRICGDCHSAAKLISFIFGRKIVIRDVQRFHRFEDGKCSC 600
           AMAFGLLNTPPGTTIRVAKNLRICGDCH+AAKLISFIFGRKIVIRDVQRFH+FEDGKCSC
Sbjct: 541 AMAFGLLNTPPGTTIRVAKNLRICGDCHNAAKLISFIFGRKIVIRDVQRFHQFEDGKCSC 600

Query: 601 GDFW 605
           GDFW
Sbjct: 601 GDFW 604

BLAST of CSPI05G23780 vs. NCBI nr
Match: XP_038893049.1 (pentatricopeptide repeat-containing protein At2g02980, chloroplastic [Benincasa hispida])

HSP 1 Score: 1138.3 bits (2943), Expect = 0.0e+00
Identity = 549/604 (90.89%), Postives = 576/604 (95.36%), Query Frame = 0

Query: 1   MAISCIPTCQFTLTKPSSAFSKNEFVINQLHPLSLLSKCTSLNELKQIQAYTIKTNLQSD 60
           M +SCIPTCQFTLTKPSS FS NEF INQ HPLSLLSKCTSL ELKQIQAYTIKTNLQ+D
Sbjct: 1   MTVSCIPTCQFTLTKPSSTFSNNEF-INQPHPLSLLSKCTSLRELKQIQAYTIKTNLQND 60

Query: 61  ISVLTKLINFCTLNPTTSYMDHAHHLFDQILDKDIILFNIMARGYARSNSPYLAFSLFGE 120
           +SVLTKLIN CT NPTTS MD+AHHLFDQI DKDI+LFNIMARGYARSNSP LAFSLF +
Sbjct: 61  VSVLTKLINICTRNPTTSSMDYAHHLFDQISDKDIVLFNIMARGYARSNSPNLAFSLFAK 120

Query: 121 LLCSGLLPDDYTFSSLLKACASSKALREGMGLHCFAVKLGLNHNIYICPTLINMYAECND 180
           +L SGLLPDDYTFSSLLKACASSKA ++GM LHCFA+KLGLNHNIYICPTLINMYAECND
Sbjct: 121 VLSSGLLPDDYTFSSLLKACASSKAFKQGMELHCFAIKLGLNHNIYICPTLINMYAECND 180

Query: 181 MNAARGVFDEMEQPCIVSYNAIITGYARSSQPNEALSLFRELQASNIEPTDVTMLSVIMS 240
           MNAARGVFDE+EQPCIVSYNAIITGYARSSQPNEALSLFRELQAS++EPTDVTMLS+IMS
Sbjct: 181 MNAARGVFDEIEQPCIVSYNAIITGYARSSQPNEALSLFRELQASHLEPTDVTMLSIIMS 240

Query: 241 CALLGALDLGKWIHEYVKKKGFDKYVKVNTALIDMFAKCGSLTDAISIFEGMRVRDTQAW 300
           CALLGALDLG+WIHEYVKKKGFDKYVKVNTALIDMFAKCGSL DAISIFEGMRVRDTQAW
Sbjct: 241 CALLGALDLGRWIHEYVKKKGFDKYVKVNTALIDMFAKCGSLADAISIFEGMRVRDTQAW 300

Query: 301 SAMIVAFATHGDGLKAISMFEEMKREGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSK 360
           SAMIVAFATHGDGLKAISMFEEMKR GVRPDEITFLGLLYACSHAGLVEQGR YFY+MSK
Sbjct: 301 SAMIVAFATHGDGLKAISMFEEMKRTGVRPDEITFLGLLYACSHAGLVEQGREYFYNMSK 360

Query: 361 TYGITPGIKHYGCMVDLLGRAGHLDEAYNFVDKLETKATPILWRTLLSACSTHGNVEMAK 420
            YGITPGIKHYGCMVDLLGR G LDEAYNF+D+LE K TP+LWRTLLSACSTHGNV+MAK
Sbjct: 361 NYGITPGIKHYGCMVDLLGRTGRLDEAYNFIDELEIKPTPVLWRTLLSACSTHGNVDMAK 420

Query: 421 RVIERIFELDDAHGGDYVILSNLYARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVV 480
           RV ERIFELDD+HGGDYVILSNL ARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVV
Sbjct: 421 RVTERIFELDDSHGGDYVILSNLCARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVV 480

Query: 481 HEFFSGDGVHCVSVELRRALDELMKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKL 540
           HEFFSGDGVHC+SVELRRALDEL+KEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKL
Sbjct: 481 HEFFSGDGVHCISVELRRALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKL 540

Query: 541 AMAFGLLNTPPGTTIRVAKNLRICGDCHSAAKLISFIFGRKIVIRDVQRFHRFEDGKCSC 600
           AMAFGLLNTPPGTTIRV KNLRICGDCH+AAKLIS IFGR+I+IRDVQRFHRFE+GKCSC
Sbjct: 541 AMAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFGRQIIIRDVQRFHRFENGKCSC 600

Query: 601 GDFW 605
            DFW
Sbjct: 601 SDFW 603

BLAST of CSPI05G23780 vs. NCBI nr
Match: XP_022151060.1 (pentatricopeptide repeat-containing protein At2g02980, chloroplastic isoform X1 [Momordica charantia])

HSP 1 Score: 1108.6 bits (2866), Expect = 0.0e+00
Identity = 535/604 (88.58%), Postives = 567/604 (93.87%), Query Frame = 0

Query: 1   MAISCIPTCQFTLTKPSSAFSKNEFVINQLHPLSLLSKCTSLNELKQIQAYTIKTNLQSD 60
           M ISCIP+CQFTL KP+SAF  NEF  N  HPL LLSKCTSL ELKQIQA+TIKTNLQ+D
Sbjct: 1   MGISCIPSCQFTLAKPNSAFPNNEFT-NPPHPLFLLSKCTSLRELKQIQAFTIKTNLQND 60

Query: 61  ISVLTKLINFCTLNPTTSYMDHAHHLFDQILDKDIILFNIMARGYARSNSPYLAFSLFGE 120
           ISVLTK+INFCTLNP+TS MDHAHHLFDQI DKDI+LFNIMARGYARSN+PYLAFSLF +
Sbjct: 61  ISVLTKIINFCTLNPSTSSMDHAHHLFDQIPDKDIVLFNIMARGYARSNTPYLAFSLFSQ 120

Query: 121 LLCSGLLPDDYTFSSLLKACASSKALREGMGLHCFAVKLGLNHNIYICPTLINMYAECND 180
           +LCSGLLPDDYTFSSLLKACASSKA  EG  LHCFA+KLGLNHNIYICP+LIN+YAECND
Sbjct: 121 VLCSGLLPDDYTFSSLLKACASSKAFSEGRQLHCFAIKLGLNHNIYICPSLINLYAECND 180

Query: 181 MNAARGVFDEMEQPCIVSYNAIITGYARSSQPNEALSLFRELQASNIEPTDVTMLSVIMS 240
           MNAARGVFDEME PCIVSYNAIITG+ARSSQPNEALSLFRELQASN+EPTDVTMLS+IMS
Sbjct: 181 MNAARGVFDEMEAPCIVSYNAIITGHARSSQPNEALSLFRELQASNLEPTDVTMLSIIMS 240

Query: 241 CALLGALDLGKWIHEYVKKKGFDKYVKVNTALIDMFAKCGSLTDAISIFEGMRVRDTQAW 300
           CALLGALDLG+WIHEYVKKKGFDK+VKVNTALIDM+AKCGSL DAISIFE MRVRDTQAW
Sbjct: 241 CALLGALDLGRWIHEYVKKKGFDKFVKVNTALIDMYAKCGSLVDAISIFEDMRVRDTQAW 300

Query: 301 SAMIVAFATHGDGLKAISMFEEMKREGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSK 360
           SAMIVA+ATHGDGLKAISMFEEMKR GVRPDEITFLGLLYACSHAGLVE+GRGYF SMSK
Sbjct: 301 SAMIVAYATHGDGLKAISMFEEMKRAGVRPDEITFLGLLYACSHAGLVEEGRGYFNSMSK 360

Query: 361 TYGITPGIKHYGCMVDLLGRAGHLDEAYNFVDKLETKATPILWRTLLSACSTHGNVEMAK 420
            YGI PGIKHYGCMVDLLGR GHLDEAY F+D  E K TPILWRTLLSACS  GNV++AK
Sbjct: 361 YYGIAPGIKHYGCMVDLLGRTGHLDEAYKFIDGSEIKPTPILWRTLLSACSNRGNVDLAK 420

Query: 421 RVIERIFELDDAHGGDYVILSNLYARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVV 480
           RVIERIFELDD+HGGDYVILSNL ARVGRWEDVNH+RKLMKDRGVVKVPGCSSVEVNNVV
Sbjct: 421 RVIERIFELDDSHGGDYVILSNLCARVGRWEDVNHIRKLMKDRGVVKVPGCSSVEVNNVV 480

Query: 481 HEFFSGDGVHCVSVELRRALDELMKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKL 540
           HEFFSGDGVH +SVELRRALDEL+KEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKL
Sbjct: 481 HEFFSGDGVHSISVELRRALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKL 540

Query: 541 AMAFGLLNTPPGTTIRVAKNLRICGDCHSAAKLISFIFGRKIVIRDVQRFHRFEDGKCSC 600
           A+AFGLLN+PPGT IRV KNLRICGDCH+AAKLISFIFGR+IVIRDVQRFHRFEDGKCSC
Sbjct: 541 AIAFGLLNSPPGTPIRVVKNLRICGDCHTAAKLISFIFGRQIVIRDVQRFHRFEDGKCSC 600

Query: 601 GDFW 605
            DFW
Sbjct: 601 CDFW 603

BLAST of CSPI05G23780 vs. NCBI nr
Match: XP_023541252.1 (pentatricopeptide repeat-containing protein At2g02980, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1095.1 bits (2831), Expect = 0.0e+00
Identity = 531/604 (87.91%), Postives = 561/604 (92.88%), Query Frame = 0

Query: 1   MAISCIPTCQFTLTKPSSAFSKNEFVINQLHPLSLLSKCTSLNELKQIQAYTIKTNLQSD 60
           MAISCIPT QF L+KP     KNEF INQ HPLSL SKC+SL ELKQIQAYTIKTNL +D
Sbjct: 1   MAISCIPTSQFALSKP-----KNEF-INQPHPLSLFSKCSSLRELKQIQAYTIKTNLHND 60

Query: 61  ISVLTKLINFCTLNPTTSYMDHAHHLFDQILDKDIILFNIMARGYARSNSPYLAFSLFGE 120
           ISVLTKLINFCT NPT S MDHAHHLFD++LDKDI+LFNIMARGYARSNSPYL FSLF +
Sbjct: 61  ISVLTKLINFCTRNPTISSMDHAHHLFDKMLDKDIVLFNIMARGYARSNSPYLVFSLFAQ 120

Query: 121 LLCSGLLPDDYTFSSLLKACASSKALREGMGLHCFAVKLGLNHNIYICPTLINMYAECND 180
           +LCSGLLPDDYTFSSLLKACA SKAL EG  LHCFA+KLGL HNIYICPTLINMYA CND
Sbjct: 121 VLCSGLLPDDYTFSSLLKACAGSKALEEGRQLHCFAIKLGLGHNIYICPTLINMYAACND 180

Query: 181 MNAARGVFDEMEQPCIVSYNAIITGYARSSQPNEALSLFRELQASNIEPTDVTMLSVIMS 240
           MNAARGVFD ME+PCIVSYNAIITGYARSSQPNEALSLFRELQASN+EPTDVTMLS+IMS
Sbjct: 181 MNAARGVFDGMEEPCIVSYNAIITGYARSSQPNEALSLFRELQASNLEPTDVTMLSIIMS 240

Query: 241 CALLGALDLGKWIHEYVKKKGFDKYVKVNTALIDMFAKCGSLTDAISIFEGMRVRDTQAW 300
           CALLGALDLG+WIHEYVKKKGFDK+VKVNTALIDM+AKCGS+ DAISIFEGMRVRDTQAW
Sbjct: 241 CALLGALDLGRWIHEYVKKKGFDKFVKVNTALIDMYAKCGSIVDAISIFEGMRVRDTQAW 300

Query: 301 SAMIVAFATHGDGLKAISMFEEMKREGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSK 360
           SAMIVAFATHGDGLKAISMFEEMK+ GVRPDEITFLGLLYACSHAGLVE+GRGYFYSM K
Sbjct: 301 SAMIVAFATHGDGLKAISMFEEMKKAGVRPDEITFLGLLYACSHAGLVEEGRGYFYSMYK 360

Query: 361 TYGITPGIKHYGCMVDLLGRAGHLDEAYNFVDKLETKATPILWRTLLSACSTHGNVEMAK 420
            +GITPGIKHYGCMVDLLGR G LDEAY F+D+L  K TPILWRTLLSACS HGNV++AK
Sbjct: 361 NHGITPGIKHYGCMVDLLGRTGRLDEAYKFIDELAIKPTPILWRTLLSACSNHGNVDLAK 420

Query: 421 RVIERIFELDDAHGGDYVILSNLYARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVV 480
           RVIERIFELDD+HGGDYVILSNL AR+GRWEDVN LRKLMKDRGVVKVPGCSSVEVNNVV
Sbjct: 421 RVIERIFELDDSHGGDYVILSNLCARLGRWEDVNRLRKLMKDRGVVKVPGCSSVEVNNVV 480

Query: 481 HEFFSGDGVHCVSVELRRALDELMKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKL 540
           HEFFSGDGVH +SVELRRALDEL+KEIKL GYVPDTSLVYHADMEEE KELVLRYHSEKL
Sbjct: 481 HEFFSGDGVHSISVELRRALDELIKEIKLAGYVPDTSLVYHADMEEEAKELVLRYHSEKL 540

Query: 541 AMAFGLLNTPPGTTIRVAKNLRICGDCHSAAKLISFIFGRKIVIRDVQRFHRFEDGKCSC 600
           AMAFGLLNTPPGTTIRV KNLRICGDCH+AAKLIS IFGR+IVIRDVQRFHRFEDG+CSC
Sbjct: 541 AMAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGQCSC 598

Query: 601 GDFW 605
            DFW
Sbjct: 601 CDFW 598

BLAST of CSPI05G23780 vs. TAIR 10
Match: AT2G02980.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 832.4 bits (2149), Expect = 2.3e-241
Identity = 390/590 (66.10%), Postives = 481/590 (81.53%), Query Frame = 0

Query: 17  SSAFSKNEFV--INQLHPLSLLSKCTSLNELKQIQAYTIKTNLQSDISVLTKLINFCTLN 76
           +  F+K+  +  +N  +P+ L+SKC SL EL QIQAY IK++++ D+S + KLINFCT +
Sbjct: 15  AETFTKHSKIDTVNTQNPILLISKCNSLRELMQIQAYAIKSHIE-DVSFVAKLINFCTES 74

Query: 77  PTTSYMDHAHHLFDQILDKDIILFNIMARGYARSNSPYLAFSLFGELLCSGLLPDDYTFS 136
           PT S M +A HLF+ + + DI++FN MARGY+R  +P   FSLF E+L  G+LPD+YTF 
Sbjct: 75  PTESSMSYARHLFEAMSEPDIVIFNSMARGYSRFTNPLEVFSLFVEILEDGILPDNYTFP 134

Query: 137 SLLKACASSKALREGMGLHCFAVKLGLNHNIYICPTLINMYAECNDMNAARGVFDEMEQP 196
           SLLKACA +KAL EG  LHC ++KLGL+ N+Y+CPTLINMY EC D+++AR VFD + +P
Sbjct: 135 SLLKACAVAKALEEGRQLHCLSMKLGLDDNVYVCPTLINMYTECEDVDSARCVFDRIVEP 194

Query: 197 CIVSYNAIITGYARSSQPNEALSLFRELQASNIEPTDVTMLSVIMSCALLGALDLGKWIH 256
           C+V YNA+ITGYAR ++PNEALSLFRE+Q   ++P ++T+LSV+ SCALLG+LDLGKWIH
Sbjct: 195 CVVCYNAMITGYARRNRPNEALSLFREMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIH 254

Query: 257 EYVKKKGFDKYVKVNTALIDMFAKCGSLTDAISIFEGMRVRDTQAWSAMIVAFATHGDGL 316
           +Y KK  F KYVKVNTALIDMFAKCGSL DA+SIFE MR +DTQAWSAMIVA+A HG   
Sbjct: 255 KYAKKHSFCKYVKVNTALIDMFAKCGSLDDAVSIFEKMRYKDTQAWSAMIVAYANHGKAE 314

Query: 317 KAISMFEEMKREGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSKTYGITPGIKHYGCM 376
           K++ MFE M+ E V+PDEITFLGLL ACSH G VE+GR YF  M   +GI P IKHYG M
Sbjct: 315 KSMLMFERMRSENVQPDEITFLGLLNACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSM 374

Query: 377 VDLLGRAGHLDEAYNFVDKLETKATPILWRTLLSACSTHGNVEMAKRVIERIFELDDAHG 436
           VDLL RAG+L++AY F+DKL    TP+LWR LL+ACS+H N+++A++V ERIFELDD+HG
Sbjct: 375 VDLLSRAGNLEDAYEFIDKLPISPTPMLWRILLAACSSHNNLDLAEKVSERIFELDDSHG 434

Query: 437 GDYVILSNLYARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHCVSV 496
           GDYVILSNLYAR  +WE V+ LRK+MKDR  VKVPGCSS+EVNNVVHEFFSGDGV   + 
Sbjct: 435 GDYVILSNLYARNKKWEYVDSLRKVMKDRKAVKVPGCSSIEVNNVVHEFFSGDGVKSATT 494

Query: 497 ELRRALDELMKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTT 556
           +L RALDE++KE+KL GYVPDTS+V HA+M ++ KE+ LRYHSEKLA+ FGLLNTPPGTT
Sbjct: 495 KLHRALDEMVKELKLSGYVPDTSMVVHANMNDQEKEITLRYHSEKLAITFGLLNTPPGTT 554

Query: 557 IRVAKNLRICGDCHSAAKLISFIFGRKIVIRDVQRFHRFEDGKCSCGDFW 605
           IRV KNLR+C DCH+AAKLIS IFGRK+V+RDVQRFH FEDGKCSCGDFW
Sbjct: 555 IRVVKNLRVCRDCHNAAKLISLIFGRKVVLRDVQRFHHFEDGKCSCGDFW 603

BLAST of CSPI05G23780 vs. TAIR 10
Match: AT5G66520.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 499.6 bits (1285), Expect = 3.6e-141
Identity = 238/608 (39.14%), Postives = 365/608 (60.03%), Query Frame = 0

Query: 28  NQLHPLSLLSKCTSLNELKQIQAYTIKTNLQSDISVLTKLINFCTLNPTTSYMDHAHHLF 87
           N    +S L +C+   ELKQI A  +KT L  D   +TK ++FC  + ++ ++ +A  +F
Sbjct: 13  NLYETMSCLQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVF 72

Query: 88  DQILDKDIILFNIMARGYARSNSPYLAFSLFGELLCSGLLPDDYTFSSLLKACASSKALR 147
           D     D  L+N+M RG++ S+ P  +  L+  +LCS    + YTF SLLKAC++  A  
Sbjct: 73  DGFDRPDTFLWNLMIRGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFE 132

Query: 148 EGMGLHCFAVKLGLNHNIYICPTLINMYAECNDMNAARGVFDEMEQPCIVSYNAIITGYA 207
           E   +H    KLG  +++Y   +LIN YA   +   A  +FD + +P  VS+N++I GY 
Sbjct: 133 ETTQIHAQITKLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSWNSVIKGYV 192

Query: 208 RSSQPN-------------------------------EALSLFRELQASNIEPTDVTMLS 267
           ++ + +                               EAL LF E+Q S++EP +V++ +
Sbjct: 193 KAGKMDIALTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSLAN 252

Query: 268 VIMSCALLGALDLGKWIHEYVKKKGFDKYVKVNTALIDMFAKCGSLTDAISIFEGMRVRD 327
            + +CA LGAL+ GKWIH Y+ K        +   LIDM+AKCG + +A+ +F+ ++ + 
Sbjct: 253 ALSACAQLGALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKKKS 312

Query: 328 TQAWSAMIVAFATHGDGLKAISMFEEMKREGVRPDEITFLGLLYACSHAGLVEQGRGYFY 387
            QAW+A+I  +A HG G +AIS F EM++ G++P+ ITF  +L ACS+ GLVE+G+  FY
Sbjct: 313 VQAWTALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFY 372

Query: 388 SMSKTYGITPGIKHYGCMVDLLGRAGHLDEAYNFVDKLETKATPILWRTLLSACSTHGNV 447
           SM + Y + P I+HYGC+VDLLGRAG LDEA  F+ ++  K   ++W  LL AC  H N+
Sbjct: 373 SMERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNI 432

Query: 448 EMAKRVIERIFELDDAHGGDYVILSNLYARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEV 507
           E+ + + E +  +D  HGG YV  +N++A   +W+     R+LMK++GV KVPGCS++ +
Sbjct: 433 ELGEEIGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTISL 492

Query: 508 NNVVHEFFSGDGVHCVSVELRRALDELMKEIKLVGYVPDTSLVYHADMEEEGKELVLRYH 567
               HEF +GD  H    +++     + ++++  GYVP+   +    ++++ +E ++  H
Sbjct: 493 EGTTHEFLAGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDEREAIVHQH 552

Query: 568 SEKLAMAFGLLNTPPGTTIRVAKNLRICGDCHSAAKLISFIFGRKIVIRDVQRFHRFEDG 605
           SEKLA+ +GL+ T PGT IR+ KNLR+C DCH   KLIS I+ R IV+RD  RFH F DG
Sbjct: 553 SEKLAITYGLIKTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRTRFHHFRDG 612

BLAST of CSPI05G23780 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 490.3 bits (1261), Expect = 2.2e-138
Identity = 238/602 (39.53%), Postives = 365/602 (60.63%), Query Frame = 0

Query: 33  LSLLSKCTSLNELKQIQAYTIKTNLQSDISVLTKLINFCTLN------------------ 92
           L   +K  +  E +QI  + +K     D+ V T LI+    N                  
Sbjct: 141 LKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDV 200

Query: 93  ----------PTTSYMDHAHHLFDQILDKDIILFNIMARGYARSNSPYLAFSLFGELLCS 152
                      +  Y+++A  LFD+I  KD++ +N M  GYA + +   A  LF +++ +
Sbjct: 201 VSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKT 260

Query: 153 GLLPDDYTFSSLLKACASSKALREGMGLHCFAVKLGLNHNIYICPTLINMYAECNDMNAA 212
            + PD+ T  +++ ACA S ++  G  +H +    G   N+ I   LI++Y++C ++  A
Sbjct: 261 NVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETA 320

Query: 213 RGVFDEMEQPCIVSYNAIITGYARSSQPNEALSLFRELQASNIEPTDVTMLSVIMSCALL 272
            G+F+ +    ++S+N +I GY   +   EAL LF+E+  S   P DVTMLS++ +CA L
Sbjct: 321 CGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHL 380

Query: 273 GALDLGKWIHEYVKK--KGFDKYVKVNTALIDMFAKCGSLTDAISIFEGMRVRDTQAWSA 332
           GA+D+G+WIH Y+ K  KG      + T+LIDM+AKCG +  A  +F  +  +   +W+A
Sbjct: 381 GAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNA 440

Query: 333 MIVAFATHGDGLKAISMFEEMKREGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSKTY 392
           MI  FA HG    +  +F  M++ G++PD+ITF+GLL ACSH+G+++ GR  F +M++ Y
Sbjct: 441 MIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDY 500

Query: 393 GITPGIKHYGCMVDLLGRAGHLDEAYNFVDKLETKATPILWRTLLSACSTHGNVEMAKRV 452
            +TP ++HYGCM+DLLG +G   EA   ++ +E +   ++W +LL AC  HGNVE+ +  
Sbjct: 501 KMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESF 560

Query: 453 IERIFELDDAHGGDYVILSNLYARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHE 512
            E + +++  + G YV+LSN+YA  GRW +V   R L+ D+G+ KVPGCSS+E+++VVHE
Sbjct: 561 AENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHE 620

Query: 513 FFSGDGVHCVSVELRRALDELMKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAM 572
           F  GD  H  + E+   L+E+   ++  G+VPDTS V   +MEEE KE  LR+HSEKLA+
Sbjct: 621 FIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQ-EMEEEWKEGALRHHSEKLAI 680

Query: 573 AFGLLNTPPGTTIRVAKNLRICGDCHSAAKLISFIFGRKIVIRDVQRFHRFEDGKCSCGD 605
           AFGL++T PGT + + KNLR+C +CH A KLIS I+ R+I+ RD  RFH F DG CSC D
Sbjct: 681 AFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCND 740

BLAST of CSPI05G23780 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 480.7 bits (1236), Expect = 1.7e-135
Identity = 232/559 (41.50%), Postives = 354/559 (63.33%), Query Frame = 0

Query: 46  KQIQAYTIKTNLQSDISVLTKLINFCTLNPTTSYMDHAHHLFDQILDKDIILFNIMARGY 105
           K+I  Y +++   S +++ T L++   +      ++ A  LFD +L+++++ +N M   Y
Sbjct: 256 KEIHGYAMRSGFDSLVNISTALVD---MYAKCGSLETARQLFDGMLERNVVSWNSMIDAY 315

Query: 106 ARSNSPYLAFSLFGELLCSGLLPDDYTFSSLLKACASSKALREGMGLHCFAVKLGLNHNI 165
            ++ +P  A  +F ++L  G+ P D +    L ACA    L  G  +H  +V+LGL+ N+
Sbjct: 316 VQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGLDRNV 375

Query: 166 YICPTLINMYAECNDMNAARGVFDEMEQPCIVSYNAIITGYARSSQPNEALSLFRELQAS 225
            +  +LI+MY +C +++ A  +F +++   +VS+NA+I G+A++ +P +AL+ F ++++ 
Sbjct: 376 SVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSR 435

Query: 226 NIEPTDVTMLSVIMSCALLGALDLGKWIHEYVKKKGFDKYVKVNTALIDMFAKCGSLTDA 285
            ++P   T +SVI + A L      KWIH  V +   DK V V TAL+DM+AKCG++  A
Sbjct: 436 TVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIA 495

Query: 286 ISIFEGMRVRDTQAWSAMIVAFATHGDGLKAISMFEEMKREGVRPDEITFLGLLYACSHA 345
             IF+ M  R    W+AMI  + THG G  A+ +FEEM++  ++P+ +TFL ++ ACSH+
Sbjct: 496 RLIFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHS 555

Query: 346 GLVEQGRGYFYSMSKTYGITPGIKHYGCMVDLLGRAGHLDEAYNFVDKLETKATPILWRT 405
           GLVE G   FY M + Y I   + HYG MVDLLGRAG L+EA++F+ ++  K    ++  
Sbjct: 556 GLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGA 615

Query: 406 LLSACSTHGNVEMAKRVIERIFELDDAHGGDYVILSNLYARVGRWEDVNHLRKLMKDRGV 465
           +L AC  H NV  A++  ER+FEL+   GG +V+L+N+Y     WE V  +R  M  +G+
Sbjct: 616 MLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGL 675

Query: 466 VKVPGCSSVEVNNVVHEFFSGDGVHCVSVELRRALDELMKEIKLVGYVPDTSLVYHADME 525
            K PGCS VE+ N VH FFSG   H  S ++   L++L+  IK  GYVPDT+LV    +E
Sbjct: 676 RKTPGCSMVEIKNEVHSFFSGSTAHPDSKKIYAFLEKLICHIKEAGYVPDTNLV--LGVE 735

Query: 526 EEGKELVLRYHSEKLAMAFGLLNTPPGTTIRVAKNLRICGDCHSAAKLISFIFGRKIVIR 585
            + KE +L  HSEKLA++FGLLNT  GTTI V KNLR+C DCH+A K IS + GR+IV+R
Sbjct: 736 NDVKEQLLSTHSEKLAISFGLLNTTAGTTIHVRKNLRVCADCHNATKYISLVTGREIVVR 795

Query: 586 DVQRFHRFEDGKCSCGDFW 605
           D+QRFH F++G CSCGD+W
Sbjct: 796 DMQRFHHFKNGACSCGDYW 809

BLAST of CSPI05G23780 vs. TAIR 10
Match: AT4G21065.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 480.3 bits (1235), Expect = 2.2e-135
Identity = 236/569 (41.48%), Postives = 369/569 (64.85%), Query Frame = 0

Query: 40  TSLNELKQIQAYTIKTNLQ-SDISVLTKLINFCTLNPTTSYMDHAHHLFDQILDK-DIIL 99
           +S+ +L+QI A++I+  +  SD  +   LI +    P+   M +AH +F +I    ++ +
Sbjct: 28  SSITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKVFSKIEKPINVFI 87

Query: 100 FNIMARGYARSNSPYLAFSLFGELLCSGLL-PDDYTFSSLLKACASSKALREGMGLHCFA 159
           +N + RGYA   +   AFSL+ E+  SGL+ PD +T+  L+KA  +   +R G  +H   
Sbjct: 88  WNTLIRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVTTMADVRLGETIHSVV 147

Query: 160 VKLGLNHNIYICPTLINMYAECNDMNAARGVFDEMEQPCIVSYNAIITGYARSSQPNEAL 219
           ++ G    IY+  +L+++YA C D+ +A  VFD+M +  +V++N++I G+A + +P EAL
Sbjct: 148 IRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSVINGFAENGKPEEAL 207

Query: 220 SLFRELQASNIEPTDVTMLSVIMSCALLGALDLGKWIHEYVKKKGFDKYVKVNTALIDMF 279
           +L+ E+ +  I+P   T++S++ +CA +GAL LGK +H Y+ K G  + +  +  L+D++
Sbjct: 208 ALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLHSSNVLLDLY 267

Query: 280 AKCGSLTDAISIFEGMRVRDTQAWSAMIVAFATHGDGLKAISMFEEMK-REGVRPDEITF 339
           A+CG + +A ++F+ M  +++ +W+++IV  A +G G +AI +F+ M+  EG+ P EITF
Sbjct: 268 ARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTEGLLPCEITF 327

Query: 340 LGLLYACSHAGLVEQGRGYFYSMSKTYGITPGIKHYGCMVDLLGRAGHLDEAYNFVDKLE 399
           +G+LYACSH G+V++G  YF  M + Y I P I+H+GCMVDLL RAG + +AY ++  + 
Sbjct: 328 VGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQVKKAYEYIKSMP 387

Query: 400 TKATPILWRTLLSACSTHGNVEMAKRVIERIFELDDAHGGDYVILSNLYARVGRWEDVNH 459
            +   ++WRTLL AC+ HG+ ++A+    +I +L+  H GDYV+LSN+YA   RW DV  
Sbjct: 388 MQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLSNMYASEQRWSDVQK 447

Query: 460 LRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHCVSVELRRALDELMKEIKLVGYVPD 519
           +RK M   GV KVPG S VEV N VHEF  GD  H  S  +   L E+   ++  GYVP 
Sbjct: 448 IRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLKEMTGRLRSEGYVPQ 507

Query: 520 TSLVYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRVAKNLRICGDCHSAAKLIS 579
            S VY  D+EEE KE  + YHSEK+A+AF L++TP  + I V KNLR+C DCH A KL+S
Sbjct: 508 ISNVY-VDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLRVCADCHLAIKLVS 567

Query: 580 FIFGRKIVIRDVQRFHRFEDGKCSCGDFW 605
            ++ R+IV+RD  RFH F++G CSC D+W
Sbjct: 568 KVYNREIVVRDRSRFHHFKNGSCSCQDYW 595

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8LK933.3e-24066.10Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidop... [more]
Q9FJY75.0e-14039.14Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX... [more]
Q9LN013.0e-13739.53Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Q3E6Q12.4e-13441.50Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
A8MQA33.1e-13441.48Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A0A0KU150.0e+0099.67DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G6051... [more]
A0A5A7STH80.0e+0096.69Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3BFK00.0e+0096.69pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Cucumis ... [more]
A0A6J1DA680.0e+0088.58pentatricopeptide repeat-containing protein At2g02980, chloroplastic isoform X1 ... [more]
A0A6J1HTT70.0e+0087.58pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Cucurbit... [more]
Match NameE-valueIdentityDescription
XP_031742214.10.0e+0099.67pentatricopeptide repeat-containing protein At2g02980, chloroplastic [Cucumis sa... [more]
XP_008446357.10.0e+0096.69PREDICTED: pentatricopeptide repeat-containing protein At2g02980, chloroplastic ... [more]
XP_038893049.10.0e+0090.89pentatricopeptide repeat-containing protein At2g02980, chloroplastic [Benincasa ... [more]
XP_022151060.10.0e+0088.58pentatricopeptide repeat-containing protein At2g02980, chloroplastic isoform X1 ... [more]
XP_023541252.10.0e+0087.91pentatricopeptide repeat-containing protein At2g02980, chloroplastic [Cucurbita ... [more]
Match NameE-valueIdentityDescription
AT2G02980.12.3e-24166.10Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G66520.13.6e-14139.14Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G08070.12.2e-13839.53Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G11290.11.7e-13541.50Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G21065.12.2e-13541.48Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 299..332
e-value: 1.1E-6
score: 26.5
coord: 97..129
e-value: 8.4E-4
score: 17.4
coord: 197..230
e-value: 7.7E-8
score: 30.1
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 296..342
e-value: 9.0E-10
score: 38.6
coord: 194..241
e-value: 2.6E-10
score: 40.3
coord: 93..141
e-value: 1.5E-8
score: 34.7
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 437..465
e-value: 0.41
score: 11.0
coord: 270..295
e-value: 0.015
score: 15.5
coord: 402..426
e-value: 0.11
score: 12.7
coord: 170..192
e-value: 0.049
score: 13.9
coord: 371..395
e-value: 0.28
score: 11.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 195..229
score: 11.717688
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 94..128
score: 9.624079
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 296..330
score: 11.717688
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 152..246
e-value: 2.2E-21
score: 78.0
coord: 14..151
e-value: 2.2E-15
score: 58.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 361..519
e-value: 3.0E-14
score: 55.2
coord: 247..360
e-value: 5.7E-24
score: 87.1
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 469..594
e-value: 1.0E-37
score: 128.8
NoneNo IPR availablePANTHERPTHR47926:SF132BNAA02G26650D PROTEINcoord: 25..590
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 25..590

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI05G23780.1CSPI05G23780.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:1900865 chloroplast RNA modification
biological_process GO:0031425 chloroplast RNA processing
cellular_component GO:0009507 chloroplast
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding