CSPI03G37740 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI03G37740
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr3: 32680641 .. 32683896 (+)
RNA-Seq ExpressionCSPI03G37740
SyntenyCSPI03G37740
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAGTTCTTCTGACTTCCTCACTGCTAATATCATCTCACCATGTGGGCTCTTCGTAGAGCTTCTACTCCTCTTAGGTATGCTTCTCTTCCTTCCTATATTCGCCTCTTCCTCACACTACACCTCTCTCTTTCTCTACCTTACATACTTCATTCGATAACTAAACTTTGAATCTCCTTTTTGGCTGTTCACTTTTTAATTCATTTCATCTGTTTTGTTTTTCAATTACACTATGCATCCATAATCTCCCTTTCTCGCTTCTGCTCGACTTTCCGGTGCTTATTAAAGAAGAAAATCAAGTAAATAGTACTGCGTTATGCGTTCTGTTTTTGTGTCTACCTTTGCCTTCCATTTCTGTTTTCTTTAACAGGTTCTTGCTTGTTTTTTATTGATATTTTCTTTTGTCTCCTACGGATAACCTGGTCCGAGTGATGAATTTAGAATCAGTGCAAAATGTCGACAATGGAATTGGGATGTAAGAACTTGTAATATTGTGAAGAAAATAAGAAATCCTTGTAATTTGGACATTCTTCTTCTTATTCTAATGTTCTAAACATTAAATTTTCAAATGGCAGGACTGTTTTTTGTATTTATTTACTTCTGTATACACATACTACATGCATATATCATGGTGGGTCAGGTATAATGCCTCCAACGTTTGAGACCCCTCTCTGTAAAATCTTTTTGTGTTCTCTTTTCAGGAATCAAGGGTATAGAGTAAGAACTTCATATGTCTTTGGCAAACTAGAGGTACCATATTTTTGGGAAGGAAATGTTGCTGGTTTTGGAACCGCCGCCGCTTTATCCGACAGATTCATTTATTTTGACAGAAATAACCTTACAACATGGCCGTCTTCTGAGGTTTATATTAGTAGTCATGGTCTATCTACACAAGCTGGTGCTGAGAACAGTGGAGAGGAAGGTAATGTGGAAGATGGATGTTCCGAACTTGATGAAACACTTCCAAGCACTAGTCCACTGGAAGATAGTAAGACAGCTGATGATAATGAAGAGGAACTAACTTCTGGATCAGAAATTGATGATGACGATGACGTTGTAGATGATAGGACTGAACTGGATTTACCTGAGGGAGAAACTGGACTTGTTGAAAAGATATCTATAAAAAGGGCTCCTTCAGAACTTCTCAATGTTATTTGGAAGGCTCCAGGTTTAACTGTCTCTAGTGCACTTGACAAGTGGGTCAGTGAAGGAAAAGAACTAAGCCGGGACGATATCTCTTCAGCCATGCTCAATCTTCGCAAATGTCGGATGTATGGGAAGGCTTTGCAGGTAAATTGACTTTGATGTTGCTTTTCAAGCAATGAAATTCATTCATGTGGCATGATGTTTGCAATTAATCTCGTTTCACTTGGGATATGTGGGACTCTATGTATGCTCACTTGATTAGTGGACACAATTTATTTCAGGATAGAAATGTTGATCAAATTTTGTAAAGAACAACAAACAATTGTAGTATTATGGTCTAACAAGCATAAAATGGTTAAGCAATTTTTAACATTGAAGATTATTGTACACTGAAGATGCATCCGTTGGGCTTGTAATTAGTAAAACCGATTCTGTAGATCAACTTGTAAGAAGTACCATTATCAAATGCTTGTATAAAAAACTGTTGTTCTGGCCTATAGCATGCAAATGGTGCTGGAGCCTTTTACAGAAATTTTAGTCGGTTTGATGTTTAACTGCTGCACCAGTATATTTGGATCAAAATTATATATTGTTGGCCAAAAGTCATTATGGTATAAAAATGGTTATGCAATTTTTGACACTGAAGATTTTTCTGGCCGTTGTATATGGTAAATATTTTCTCCTATTTTTAGACGTTGGTTCATCATTGCTCGAATTCATTTCTGGCATTTCTGTATGGCTTTAACAGTTTTCAGAGTGGTTGGAAACAAGTGGGAAACTCGATTTTATTGAAAATGATTATGCTTCTCGGCTCTACTTGATTGGAAAATTAAGAGGTCTCCGTATGGCAGAGAATTACATTGCTAAAATTCCAAAGTCCTTCCAAGGTGAGGTGGTATACCAAACTCTTTTGGTTAACTGTGTGATTGCGAGCAATGTACACAAAGCGGAGAAAGTATTCAACAAAATGAAGAACCTTGAGTTCCCCATCACAGCATTTGCTTGCAACCAGTTGCTTCTTCTTTACAAGAGGACTGACAAGAGGAAAATAGCCGACGTTTTGTTGTTGATGAAGAAGGAAAATGTCAAGTATTCTACGTCAACTTACAGAATCTTAATAGATGTTAACGGCCTTTCTAATGACATAACTGGGATGGAAGAAGTTGTTGATTCAATGAAGGCTGAAGGAATTAAGCTGGATGTTGAGACACTTTCCCGATTAGTTAAACACTATGTTTCAGGTGGGCTTAAAGACAAAGCCAAGGCCGTTTTGAAGGAAATGGAAGAAATTAACTCCGAAGGTTCTCGAAGGCCATGCAGGATTTTACTTCCCCTTTATGGAGAACTCCAAATGGAAGATGAAGTGAGGAGGCTCTGGGAGATCTGTGAGTCTAATCCTCATATTGAAGAATGTATGGCTGCCCTTGTTGCTTGGGGAAAGCTGAAGAACGTCCAGGAAGCAGAGAAAATTTTTGATAGAGTTTTAAAAACAGGGAAGAAGCTATCTGCAAGACACTATTCTACCATGATGAACGTTTATAGAAAGGACAGTAAGATGCTGACGAAGGGTAAGGAACTAGTCAATCAGATGGCAGAGAGCGGTTGCCGCATGGATCCGTTTACATTGGATGCAGTTGTGAAGCTCTATGTGGAAGCAGGGGAGGTAGAAAAGGCAGACTCTTTCTTGGTTAAGGCTGTTCTACAAAACAAGAAGAAACCAATGTTTACCACATACATAACTCTCATGGATCGCTATGCAAGTAGGGGCGATGTTCCCAATGTTGAAAAAAATTTTGCTATGATGAGAAGATTGGGTTATGTTGGTCGATTAAGCCAATTTCAAACTCTAATACAGGCATACGTTAATGCCAAGGCTCCAGCCTATGGTATGAGAGAGAGAATGAAGGCAGATAATGTATTTCCTAACAAAGATTTGGCAGGAAAATTAGCTCAAGTTGATTGTTTGAAGATGAGAAAAGTGTCCGATTTACTTGATTGAAAAATATAACCCGTTACTCTTGGTATGTATAAGCTACCCAGATTTCAAATTTATTGTTAGAATCATGTATTTGTTCATTTGTCTGGTGGATGCGATTTGTTAGATGAAAGTATA

mRNA sequence

GAGTTCTTCTGACTTCCTCACTGCTAATATCATCTCACCATGTGGGCTCTTCGTAGAGCTTCTACTCCTCTTAGGAATCAAGGGTATAGAGTAAGAACTTCATATGTCTTTGGCAAACTAGAGGTACCATATTTTTGGGAAGGAAATGTTGCTGGTTTTGGAACCGCCGCCGCTTTATCCGACAGATTCATTTATTTTGACAGAAATAACCTTACAACATGGCCGTCTTCTGAGGTTTATATTAGTAGTCATGGTCTATCTACACAAGCTGGTGCTGAGAACAGTGGAGAGGAAGGTAATGTGGAAGATGGATGTTCCGAACTTGATGAAACACTTCCAAGCACTAGTCCACTGGAAGATAGTAAGACAGCTGATGATAATGAAGAGGAACTAACTTCTGGATCAGAAATTGATGATGACGATGACGTTGTAGATGATAGGACTGAACTGGATTTACCTGAGGGAGAAACTGGACTTGTTGAAAAGATATCTATAAAAAGGGCTCCTTCAGAACTTCTCAATGTTATTTGGAAGGCTCCAGGTTTAACTGTCTCTAGTGCACTTGACAAGTGGGTCAGTGAAGGAAAAGAACTAAGCCGGGACGATATCTCTTCAGCCATGCTCAATCTTCGCAAATGTCGGATGTATGGGAAGGCTTTGCAGTTTTCAGAGTGGTTGGAAACAAGTGGGAAACTCGATTTTATTGAAAATGATTATGCTTCTCGGCTCTACTTGATTGGAAAATTAAGAGGTCTCCGTATGGCAGAGAATTACATTGCTAAAATTCCAAAGTCCTTCCAAGGTGAGGTGGTATACCAAACTCTTTTGGTTAACTGTGTGATTGCGAGCAATGTACACAAAGCGGAGAAAGTATTCAACAAAATGAAGAACCTTGAGTTCCCCATCACAGCATTTGCTTGCAACCAGTTGCTTCTTCTTTACAAGAGGACTGACAAGAGGAAAATAGCCGACGTTTTGTTGTTGATGAAGAAGGAAAATGTCAAGTATTCTACGTCAACTTACAGAATCTTAATAGATGTTAACGGCCTTTCTAATGACATAACTGGGATGGAAGAAGTTGTTGATTCAATGAAGGCTGAAGGAATTAAGCTGGATGTTGAGACACTTTCCCGATTAGTTAAACACTATGTTTCAGGTGGGCTTAAAGACAAAGCCAAGGCCGTTTTGAAGGAAATGGAAGAAATTAACTCCGAAGGTTCTCGAAGGCCATGCAGGATTTTACTTCCCCTTTATGGAGAACTCCAAATGGAAGATGAAGTGAGGAGGCTCTGGGAGATCTGTGAGTCTAATCCTCATATTGAAGAATGTATGGCTGCCCTTGTTGCTTGGGGAAAGCTGAAGAACGTCCAGGAAGCAGAGAAAATTTTTGATAGAGTTTTAAAAACAGGGAAGAAGCTATCTGCAAGACACTATTCTACCATGATGAACGTTTATAGAAAGGACAGTAAGATGCTGACGAAGGGTAAGGAACTAGTCAATCAGATGGCAGAGAGCGGTTGCCGCATGGATCCGTTTACATTGGATGCAGTTGTGAAGCTCTATGTGGAAGCAGGGGAGGTAGAAAAGGCAGACTCTTTCTTGGTTAAGGCTGTTCTACAAAACAAGAAGAAACCAATGTTTACCACATACATAACTCTCATGGATCGCTATGCAAGTAGGGGCGATGTTCCCAATGTTGAAAAAAATTTTGCTATGATGAGAAGATTGGGTTATGTTGGTCGATTAAGCCAATTTCAAACTCTAATACAGGCATACGTTAATGCCAAGGCTCCAGCCTATGGTATGAGAGAGAGAATGAAGGCAGATAATGTATTTCCTAACAAAGATTTGGCAGGAAAATTAGCTCAAGTTGATTGTTTGAAGATGAGAAAAGTGTCCGATTTACTTGATTGAAAAATATAACCCGTTACTCTTGGTATGTATAAGCTACCCAGATTTCAAATTTATTGTTAGAATCATGTATTTGTTCATTTGTCTGGTGGATGCGATTTGTTAGATGAAAGTATA

Coding sequence (CDS)

ATGTGGGCTCTTCGTAGAGCTTCTACTCCTCTTAGGAATCAAGGGTATAGAGTAAGAACTTCATATGTCTTTGGCAAACTAGAGGTACCATATTTTTGGGAAGGAAATGTTGCTGGTTTTGGAACCGCCGCCGCTTTATCCGACAGATTCATTTATTTTGACAGAAATAACCTTACAACATGGCCGTCTTCTGAGGTTTATATTAGTAGTCATGGTCTATCTACACAAGCTGGTGCTGAGAACAGTGGAGAGGAAGGTAATGTGGAAGATGGATGTTCCGAACTTGATGAAACACTTCCAAGCACTAGTCCACTGGAAGATAGTAAGACAGCTGATGATAATGAAGAGGAACTAACTTCTGGATCAGAAATTGATGATGACGATGACGTTGTAGATGATAGGACTGAACTGGATTTACCTGAGGGAGAAACTGGACTTGTTGAAAAGATATCTATAAAAAGGGCTCCTTCAGAACTTCTCAATGTTATTTGGAAGGCTCCAGGTTTAACTGTCTCTAGTGCACTTGACAAGTGGGTCAGTGAAGGAAAAGAACTAAGCCGGGACGATATCTCTTCAGCCATGCTCAATCTTCGCAAATGTCGGATGTATGGGAAGGCTTTGCAGTTTTCAGAGTGGTTGGAAACAAGTGGGAAACTCGATTTTATTGAAAATGATTATGCTTCTCGGCTCTACTTGATTGGAAAATTAAGAGGTCTCCGTATGGCAGAGAATTACATTGCTAAAATTCCAAAGTCCTTCCAAGGTGAGGTGGTATACCAAACTCTTTTGGTTAACTGTGTGATTGCGAGCAATGTACACAAAGCGGAGAAAGTATTCAACAAAATGAAGAACCTTGAGTTCCCCATCACAGCATTTGCTTGCAACCAGTTGCTTCTTCTTTACAAGAGGACTGACAAGAGGAAAATAGCCGACGTTTTGTTGTTGATGAAGAAGGAAAATGTCAAGTATTCTACGTCAACTTACAGAATCTTAATAGATGTTAACGGCCTTTCTAATGACATAACTGGGATGGAAGAAGTTGTTGATTCAATGAAGGCTGAAGGAATTAAGCTGGATGTTGAGACACTTTCCCGATTAGTTAAACACTATGTTTCAGGTGGGCTTAAAGACAAAGCCAAGGCCGTTTTGAAGGAAATGGAAGAAATTAACTCCGAAGGTTCTCGAAGGCCATGCAGGATTTTACTTCCCCTTTATGGAGAACTCCAAATGGAAGATGAAGTGAGGAGGCTCTGGGAGATCTGTGAGTCTAATCCTCATATTGAAGAATGTATGGCTGCCCTTGTTGCTTGGGGAAAGCTGAAGAACGTCCAGGAAGCAGAGAAAATTTTTGATAGAGTTTTAAAAACAGGGAAGAAGCTATCTGCAAGACACTATTCTACCATGATGAACGTTTATAGAAAGGACAGTAAGATGCTGACGAAGGGTAAGGAACTAGTCAATCAGATGGCAGAGAGCGGTTGCCGCATGGATCCGTTTACATTGGATGCAGTTGTGAAGCTCTATGTGGAAGCAGGGGAGGTAGAAAAGGCAGACTCTTTCTTGGTTAAGGCTGTTCTACAAAACAAGAAGAAACCAATGTTTACCACATACATAACTCTCATGGATCGCTATGCAAGTAGGGGCGATGTTCCCAATGTTGAAAAAAATTTTGCTATGATGAGAAGATTGGGTTATGTTGGTCGATTAAGCCAATTTCAAACTCTAATACAGGCATACGTTAATGCCAAGGCTCCAGCCTATGGTATGAGAGAGAGAATGAAGGCAGATAATGTATTTCCTAACAAAGATTTGGCAGGAAAATTAGCTCAAGTTGATTGTTTGAAGATGAGAAAAGTGTCCGATTTACTTGATTGA

Protein sequence

MWALRRASTPLRNQGYRVRTSYVFGKLEVPYFWEGNVAGFGTAAALSDRFIYFDRNNLTTWPSSEVYISSHGLSTQAGAENSGEEGNVEDGCSELDETLPSTSPLEDSKTADDNEEELTSGSEIDDDDDVVDDRTELDLPEGETGLVEKISIKRAPSELLNVIWKAPGLTVSSALDKWVSEGKELSRDDISSAMLNLRKCRMYGKALQFSEWLETSGKLDFIENDYASRLYLIGKLRGLRMAENYIAKIPKSFQGEVVYQTLLVNCVIASNVHKAEKVFNKMKNLEFPITAFACNQLLLLYKRTDKRKIADVLLLMKKENVKYSTSTYRILIDVNGLSNDITGMEEVVDSMKAEGIKLDVETLSRLVKHYVSGGLKDKAKAVLKEMEEINSEGSRRPCRILLPLYGELQMEDEVRRLWEICESNPHIEECMAALVAWGKLKNVQEAEKIFDRVLKTGKKLSARHYSTMMNVYRKDSKMLTKGKELVNQMAESGCRMDPFTLDAVVKLYVEAGEVEKADSFLVKAVLQNKKKPMFTTYITLMDRYASRGDVPNVEKNFAMMRRLGYVGRLSQFQTLIQAYVNAKAPAYGMRERMKADNVFPNKDLAGKLAQVDCLKMRKVSDLLD*
Homology
BLAST of CSPI03G37740 vs. ExPASy Swiss-Prot
Match: Q9C977 (Pentatricopeptide repeat-containing protein At1g80270, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g80270 PE=2 SV=1)

HSP 1 Score: 557.8 bits (1436), Expect = 1.6e-157
Identity = 293/557 (52.60%), Postives = 401/557 (71.99%), Query Frame = 0

Query: 68  ISSHGLSTQAGAENSGEEGNVEDGCSELDETLPSTSPLEDSKTADDNEEELTSGSEIDDD 127
           +S+  LS+ AG ++  EE ++EDG SEL+     +   + S ++D++E +L++  E    
Sbjct: 57  LSNRALSSSAGTKSDQEEDDLEDGFSELE----GSKSGQGSTSSDEDEGKLSADEE---- 116

Query: 128 DDVVDDRTELDLPEGETGLVEKISIKRAPSELLNVIWKAPGLTVSSALDKWVSEGKELSR 187
                +  ELDL E +   V + ++++  SEL   I  APGL++ SALDKWV EG E++R
Sbjct: 117 -----EEEELDLIETD---VSRKTVEKKQSELFKTIVSAPGLSIGSALDKWVEEGNEITR 176

Query: 188 DDISSAMLNLRKCRMYGKALQFSEWLETSGKLDFIENDYASRLYLIGKLRGLRMAENYIA 247
            +I+ AML LR+ RMYG+ALQ SEWLE + K++  E DYASRL L  K+RGL   E  + 
Sbjct: 177 VEIAKAMLQLRRRRMYGRALQMSEWLEANKKIEMTERDYASRLDLTVKIRGLEKGEACMQ 236

Query: 248 KIPKSFQGEVVYQTLLVNCVIASNVHKAEKVFNKMKNLEFPITAFACNQLLLLYKRTDKR 307
           KIPKSF+GEV+Y+TLL NCV A NV K+E VFNKMK+L FP++ F C+Q+LLL+KR D++
Sbjct: 237 KIPKSFKGEVLYRTLLANCVAAGNVKKSELVFNKMKDLGFPLSGFTCDQMLLLHKRIDRK 296

Query: 308 KIADVLLLMKKENVKYSTSTYRILIDVNGLSNDITGMEEVVDSMKAEGIKLDVETLSRLV 367
           KIADVLLLM+KEN+K S  TY+ILIDV G +NDI+GME+++++MK EG++LD +T +   
Sbjct: 297 KIADVLLLMEKENIKPSLLTYKILIDVKGATNDISGMEQILETMKDEGVELDFQTQALTA 356

Query: 368 KHYVSGGLKDKAKAVLKEMEEINSEGSRRPCRILLPLYGELQMEDEVRRLWEICESNPHI 427
           +HY   GLKDKA+ VLKEME  + E +RR  + LL +Y  L  EDEV+R+W+ICES P+ 
Sbjct: 357 RHYSGAGLKDKAEKVLKEMEGESLEANRRAFKDLLSIYASLGREDEVKRIWKICESKPYF 416

Query: 428 EECMAALVAWGKLKNVQEAEKIFDRVLKTGKKLSARHYSTMMNVYRKDSKMLTKGKELVN 487
           EE +AA+ A+GKL  VQEAE IF++++K  ++ S+  YS ++ VY  D KML+KGK+LV 
Sbjct: 417 EESLAAIQAFGKLNKVQEAEAIFEKIVKMDRRASSSTYSVLLRVY-VDHKMLSKGKDLVK 476

Query: 488 QMAESGCRMDPFTLDAVVKLYVEAGEVEKADSFLVKAVLQNKKKPMFTTYITLMDRYASR 547
           +MAESGCR++  T DA++KLYVEAGEVEKADS L KA  Q+  K M  +++ +MD Y+ R
Sbjct: 477 RMAESGCRIEATTWDALIKLYVEAGEVEKADSLLDKASKQSHTKLMMNSFMYIMDEYSKR 536

Query: 548 GDVPNVEKNFAMMRRLGYVGRLSQFQTLIQAYVNAKAPAYGMRERMKADNVFPNKDLAGK 607
           GDV N EK F  MR  GY  RL QFQ L+QAY+NAK+PAYGMR+R+KADN+FPNK +A +
Sbjct: 537 GDVHNTEKIFLKMREAGYTSRLRQFQALMQAYINAKSPAYGMRDRLKADNIFPNKSMAAQ 596

Query: 608 LAQVDCLKMRKVSDLLD 625
           LAQ D  K   +SD+LD
Sbjct: 597 LAQGDPFKKTAISDILD 596

BLAST of CSPI03G37740 vs. ExPASy Swiss-Prot
Match: Q9XI21 (Pentatricopeptide repeat-containing protein At1g15480, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g15480 PE=2 SV=2)

HSP 1 Score: 538.5 bits (1386), Expect = 1.0e-151
Identity = 298/614 (48.53%), Postives = 413/614 (67.26%), Query Frame = 0

Query: 12  RNQGYRV-RTSYVFGKLEVPYFWEGNVAGFGTAAALSDRFIYFDRNNLTTWPSSEVYISS 71
           R+Q  R+   + V+ KL++P   E N+A   + A + D+     R    +W SS      
Sbjct: 10  RSQSLRLGACNAVYSKLDIP-LGERNIA-IESNALIHDKHEALPRFYELSWSSS---TGR 69

Query: 72  HGLSTQAGAENSGEEGNVEDGCSELDETLPSTSPLEDSKTADDNEEELTSGSEIDDDDDV 131
             LS+ AGA+ +G++ ++E      D+ +   +P E S  ++D EE   SG    D+ D+
Sbjct: 70  RSLSSDAGAKTTGDDDDLE------DKNVDLATPDETSSDSEDGEE--FSG----DEGDI 129

Query: 132 VDDRTELDLPEGETGLVEKISIKRAPSELLNVIWKAPGLTVSSALDKWVSEGKELSRDDI 191
                EL +PE            + PSE+   I    GL+V SALDKWV +GK+ +R + 
Sbjct: 130 EGAELELHVPE-----------SKRPSEMFKAIVSVSGLSVGSALDKWVEQGKDTNRKEF 189

Query: 192 SSAMLNLRKCRMYGKALQFSEWLETSGKLDFIENDYASRLYLIGKLRGLRMAENYIAKIP 251
            SAML LRK RM+G+ALQ +EWL+ + + +  E DYA RL LI K+RG    E YI  IP
Sbjct: 190 ESAMLQLRKRRMFGRALQMTEWLDENKQFEMEERDYACRLDLISKVRGWYKGEAYIKTIP 249

Query: 252 KSFQGEVVYQTLLVNCVIASNVHKAEKVFNKMKNLEFPITAFACNQLLLLYKRTDKRKIA 311
           +SF+GE+VY+TLL N V  SNV  AE VFNKMK+L FP++ F CNQ+L+LYKR DK+KIA
Sbjct: 250 ESFRGELVYRTLLANHVATSNVRTAEAVFNKMKDLGFPLSTFTCNQMLILYKRVDKKKIA 309

Query: 312 DVLLLMKKENVKYSTSTYRILIDVNGLSNDITGMEEVVDSMKAEGIKLDVETLSRLVKHY 371
           DVLLL++KEN+K + +TY+ILID  G SNDITGME++V++MK+EG++LD+   + + +HY
Sbjct: 310 DVLLLLEKENLKPNLNTYKILIDTKGSSNDITGMEQIVETMKSEGVELDLRARALIARHY 369

Query: 372 VSGGLKDKAKAVLKEMEEINSEGSRRPCRILLPLYGELQMEDEVRRLWEICESNPHIEEC 431
            S GLK+KA+ VLKEME  + E +R  C+ LL +YG LQ EDEVRR+W+ICE NP   E 
Sbjct: 370 ASAGLKEKAEKVLKEMEGESLEENRHMCKDLLSVYGYLQREDEVRRVWKICEENPRYNEV 429

Query: 432 MAALVAWGKLKNVQEAEKIFDRVLKTGKKLSARHYSTMMNVYRKDSKMLTKGKELVNQMA 491
           +AA++A+GK+  V++AE +F++VLK   ++S+  YS ++ VY  D KM+++GK+LV QM+
Sbjct: 430 LAAILAFGKIDKVKDAEAVFEKVLKMSHRVSSNVYSVLLRVY-VDHKMVSEGKDLVKQMS 489

Query: 492 ESGCRMDPFTLDAVVKLYVEAGEVEKADSFLVKAVLQNKKKPMFTTYITLMDRYASRGDV 551
           +SGC +   T DAV+KLYVEAGEVEKA+S L KA+   + KP+ ++++ LM  Y  RGDV
Sbjct: 490 DSGCNIGALTWDAVIKLYVEAGEVEKAESSLSKAIQSKQIKPLMSSFMYLMHEYVRRGDV 549

Query: 552 PNVEKNFAMMRRLGYVGRLSQFQTLIQAYVNAKAPAYGMRERMKADNVFPNKDLAGKLAQ 611
            N EK F  M++ GY  R   +QTLIQAYVNAKAPAYGM+ERMKADN+FPNK LA +LA+
Sbjct: 550 HNTEKIFQRMKQAGYQSRFWAYQTLIQAYVNAKAPAYGMKERMKADNIFPNKRLAAQLAK 594

Query: 612 VDCLKMRKVSDLLD 625
            D  K   +SDLLD
Sbjct: 610 ADPFKKTPLSDLLD 594

BLAST of CSPI03G37740 vs. ExPASy Swiss-Prot
Match: Q9LRP6 (Pentatricopeptide repeat-containing protein At3g15590, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g15590 PE=1 SV=1)

HSP 1 Score: 464.2 bits (1193), Expect = 2.4e-129
Identity = 255/558 (45.70%), Postives = 365/558 (65.41%), Query Frame = 0

Query: 67  YISSHGLSTQAGAENSGEEGNVEDGCSELDETLPSTSPLEDSKTADDNEEELTSGSEIDD 126
           +   H LS+ A A++ G+E   E+  SE +E +P +  + +    DD+  E   GS+ DD
Sbjct: 65  FFGIHKLSSIADAKDKGDEVVREEELSESEEAVPVSGDVPEGVVDDDSLFEPELGSDNDD 124

Query: 127 DDDVVDDRTELDLPEGETGLVEKISIKRAPSELLNVIWKAPGLTVSSALDKWVSEGKELS 186
                     L++ E  +    K + KR  SEL   I      +V   L+KWV EGK+LS
Sbjct: 125 ----------LEIEEKHSKDGGKPTKKRGQSELYESI--VAYKSVKHVLEKWVKEGKDLS 184

Query: 187 RDDISSAMLNLRKCRMYGKALQFSEWLETSGKLDFIENDYASRLYLIGKLRGLRMAENYI 246
           + +++ A+ NLRK + Y   LQ  EWL  + + +F E +YAS+L L+ K+  L+ AE ++
Sbjct: 185 QAEVTLAIHNLRKRKSYAMCLQLWEWLGANTQFEFTEANYASQLDLVAKVHSLQKAEIFL 244

Query: 247 AKIPKSFQGEVVYQTLLVNCVIASNVHKAEKVFNKMKNLEFPITAFACNQLLLLYKRTDK 306
             IP+S +GEVVY+TLL NCV+  +V+KAE +FNKMK L+FP + FACNQLLLLY   D+
Sbjct: 245 KDIPESSRGEVVYRTLLANCVLKHHVNKAEDIFNKMKELKFPTSVFACNQLLLLYSMHDR 304

Query: 307 RKIADVLLLMKKENVKYSTSTYRILIDVNGLSNDITGMEEVVDSMKAEGIKLDVETLSRL 366
           +KI+DVLLLM++EN+K S +TY  LI+  GL+ DITGME++V+++K EGI+LD E  S L
Sbjct: 305 KKISDVLLLMERENIKPSRATYHFLINSKGLAGDITGMEKIVETIKEEGIELDPELQSIL 364

Query: 367 VKHYVSGGLKDKAKAVLKEMEEINSEGSRRPCRILLPLYGELQMEDEVRRLWEICESNPH 426
            K+Y+  GLK++A+ ++KE+E    + +   CR LLPLY ++   D VRRL    + NP 
Sbjct: 365 AKYYIRAGLKERAQDLMKEIEGKGLQQTPWVCRSLLPLYADIGDSDNVRRLSRFVDQNPR 424

Query: 427 IEECMAALVAWGKLKNVQEAEKIFDRVLKTGKKLSARHYSTMMNVYRKDSKMLTKGKELV 486
            + C++A+ AWGKLK V+EAE +F+R+++  K      Y  +M +Y  ++KML KG++LV
Sbjct: 425 YDNCISAIKAWGKLKEVEEAEAVFERLVEKYKIFPMMPYFALMEIY-TENKMLAKGRDLV 484

Query: 487 NQMAESGCRMDPFTLDAVVKLYVEAGEVEKADSFLVKAVLQNKKKPMFTTYITLMDRYAS 546
            +M  +G  + P T  A+VKLY++AGEV KA+  L +A   NK +PMFTTY+ +++ YA 
Sbjct: 485 KRMGNAGIAIGPSTWHALVKLYIKAGEVGKAELILNRATKDNKMRPMFTTYMAILEEYAK 544

Query: 547 RGDVPNVEKNFAMMRRLGYVGRLSQFQTLIQAYVNAKAPAYGMRERMKADNVFPNKDLAG 606
           RGDV N EK F  M+R  Y  +L Q++T++ AY+NAK PAYGM ERMKADNVFPNK LA 
Sbjct: 545 RGDVHNTEKVFMKMKRASYAAQLMQYETVLLAYINAKTPAYGMIERMKADNVFPNKSLAA 604

Query: 607 KLAQVDCLKMRKVSDLLD 625
           KLAQV+  K   VS LLD
Sbjct: 605 KLAQVNPFKKCPVSVLLD 609

BLAST of CSPI03G37740 vs. ExPASy Swiss-Prot
Match: Q940Q2 (Pentatricopeptide repeat-containing protein At1g07590, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g07590 PE=2 SV=1)

HSP 1 Score: 152.1 bits (383), Expect = 2.0e-35
Identity = 120/471 (25.48%), Postives = 217/471 (46.07%), Query Frame = 0

Query: 156 PSELLNV-IWKAP-GLTVSSALDKWVSEGKELSRDDISSAMLNLRKCRMYGKALQFSEWL 215
           P++ L++ I K P G+TV SAL  W+ +G  +   D+  A+  LRK     +AL+  EW+
Sbjct: 64  PNKCLSLRIEKLPKGVTVGSALQSWMGDGFPVHGGDVYHAINRLRKLGRNKRALELMEWI 123

Query: 216 ETSGKLDFIENDYASRLYLIGKLRGLRMAENYIAKIPKSFQGEVVYQTLLVNCVIASNVH 275
                    E +Y+  L    KL G+   E    ++P+ FQ E++Y  L++ C+    + 
Sbjct: 124 IRERPYRLGELEYSYLLEFTVKLHGVSQGEKLFTRVPQEFQNELLYNNLVIACLDQGVIR 183

Query: 276 KAEKVFNKMKNLEFPITAFACNQLLLLYKRTDKRK-IADVLLLMKKENVKYSTSTYRILI 335
            A +   KM+ L +  +    N+L++      +RK IA  L LMK +      STY IL+
Sbjct: 184 LALEYMKKMRELGYRTSHLVYNRLIIRNSAPGRRKLIAKDLALMKADKATPHVSTYHILM 243

Query: 336 DVNGLSNDITGMEEVVDSMKAEGIKLDVETLSRLVKHYVSGGLKDKAKAVLKEMEEINSE 395
            +    ++I G+ +  D MK  G++ +  +   L   +    L   A+A  +E+E+  + 
Sbjct: 244 KLEANEHNIDGVLKAFDGMKKAGVEPNEVSYCILAMAHAVARLYTVAEAYTEEIEKSITG 303

Query: 396 GSRRPCRILLPLYGELQMEDEVRRLWEICESNPHI--EECMAALVAWGKLKNVQEAEKIF 455
            +     IL+ LYG L  E E+ R W +     H+  +  + A  A+ ++ N+  AE+++
Sbjct: 304 DNWSTLDILMILYGRLGKEKELARTWNVIRGFHHVRSKSYLLATEAFARVGNLDRAEELW 363

Query: 456 DRVLKTGKKLSARHYSTMMNVYRKDSKMLTKGKELVNQMAESGCRMDPFTLD------AV 515
             +           ++++++VY KD  ++ K   +  +M  +G + +  T        A 
Sbjct: 364 LEMKNVKGLKETEQFNSLLSVYCKDG-LIEKAIGVFREMTGNGFKPNSITYRHLALGCAK 423

Query: 516 VKLYVEAGEVEKADSFLVKAVLQNKKKPMFTTYITLMDRYASRGDVPNVEKNFAMMRRLG 575
            KL  EA +  +    L  +       P   T +++++ +A +GDV N EK F  ++   
Sbjct: 424 AKLMKEALKNIEMGLNLKTSKSIGSSTPWLETTLSIIECFAEKGDVENSEKLFEEVKNAK 483

Query: 576 YVGRLSQFQTLIQAYVNAKAPAYGMRERMKADNVFPNKDLAGKLAQVDCLK 616
           Y      +  L +AYV AK     + +RM      P+ +    L  V+  K
Sbjct: 484 YNRYAFVYNALFKAYVKAKVYDPNLFKRMVLGGARPDAESYSLLKLVEQYK 533

BLAST of CSPI03G37740 vs. ExPASy Swiss-Prot
Match: O22714 (Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana OX=3702 GN=At1g60770 PE=1 SV=1)

HSP 1 Score: 150.2 bits (378), Expect = 7.7e-35
Identity = 130/473 (27.48%), Postives = 221/473 (46.72%), Query Frame = 0

Query: 147 VEKISIKRAPSE-LLNVIWKAPG--LTVSSALDKWVSEGKELSRDDISSAMLNLRKCRMY 206
           V K S K+   E L N ++K  G  + V   L++++   K + + ++   +  LR   +Y
Sbjct: 12  VTKRSTKKYIEEPLYNRLFKDGGTEVKVRQQLNQFLKGTKHVFKWEVGDTIKKLRNRGLY 71

Query: 207 GKALQFSEWLETSGKLDFIENDYASRLYLIGKLRGLRMAENYIAKIPKSFQGEVVYQTLL 266
             AL+ SE +E  G ++   +D A  L L+ K R +   ENY   +P++ + E+ Y +LL
Sbjct: 72  YPALKLSEVMEERG-MNKTVSDQAIHLDLVAKAREITAGENYFVDLPETSKTELTYGSLL 131

Query: 267 VNCVIASNV-HKAEKVFNKMKNLEFPITAFACNQLLLLYKRT-DKRKIADVLLLMKKENV 326
            NC     +  KAE + NKMK L    ++ + N L+ LY +T +  K+  ++  +K ENV
Sbjct: 132 -NCYCKELLTEKAEGLLNKMKELNITPSSMSYNSLMTLYTKTGETEKVPAMIQELKAENV 191

Query: 327 KYSTSTYRILIDVNGLSNDITGMEEVVDSMKAEG-IKLDVETLSRLVKHYVSGGLKDKAK 386
              + TY + +     +NDI+G+E V++ M  +G +  D  T S +   YV  GL  KA+
Sbjct: 192 MPDSYTYNVWMRALAATNDISGVERVIEEMNRDGRVAPDWTTYSNMASIYVDAGLSQKAE 251

Query: 387 AVLKEMEEINSEGSRRPCRILLPLYGELQMEDEVRRLWEICE------SNPHIEECMAAL 446
             L+E+E  N++      + L+ LYG L    EV R+W          SN      +  L
Sbjct: 252 KALQELEMKNTQRDFTAYQFLITLYGRLGKLTEVYRIWRSLRLAIPKTSNVAYLNMIQVL 311

Query: 447 VAWGKLKNVQEAEKIFDRVLKTGKKLSARHYSTMMNVYRKDSKMLTKGKELVNQMAESGC 506
           V   KL ++  AE +F            R  + ++  Y ++  ++ K  EL  +    G 
Sbjct: 312 V---KLNDLPGAETLFKEWQANCSTYDIRIVNVLIGAYAQEG-LIQKANELKEKAPRRGG 371

Query: 507 RMDPFTLDAVVKLYVEAGEVEKADSFLVKAVLQNKKK-----PMFTTYITLMDRYASRGD 566
           +++  T +  +  YV++G++ +A   + KAV   K       P   T   LM  +  + D
Sbjct: 372 KLNAKTWEIFMDYYVKSGDMARALECMSKAVSIGKGDGGKWLPSPETVRALMSYFEQKKD 431

Query: 567 VPNVEKNFAMMRRLGYVGRLSQFQTLIQAYVNAKAPAYGMRERMKADNVFPNK 603
           V   E    +++          F+ LI+ Y  A      MR R+K +NV  N+
Sbjct: 432 VNGAENLLEILKNGTDNIGAEIFEPLIRTYAAAGKSHPAMRRRLKMENVEVNE 478

BLAST of CSPI03G37740 vs. ExPASy TrEMBL
Match: A0A0A0LHD8 (PPR_long domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G819930 PE=3 SV=1)

HSP 1 Score: 1206.4 bits (3120), Expect = 0.0e+00
Identity = 616/626 (98.40%), Postives = 622/626 (99.36%), Query Frame = 0

Query: 1   MWALRRASTPLRNQGYRVRTSYVFGKLEVPYFWEGNVAGFGTAAALSDRFIYFDRNNLTT 60
           MWALRRASTPLRNQGYRVRTSYVFGKLEVPYFWEGNVAGFGTAAALSDRFIYFDRNNLTT
Sbjct: 1   MWALRRASTPLRNQGYRVRTSYVFGKLEVPYFWEGNVAGFGTAAALSDRFIYFDRNNLTT 60

Query: 61  WPSSEVYISSHGLSTQAGAENSGEEGNVEDGCSELDETLPSTSPLEDSKTADDNEEELTS 120
           WPSSEVYISSHGLSTQAGAENSGEEGNVEDGCSELDETLPSTSPLEDSKTADDNEEELTS
Sbjct: 61  WPSSEVYISSHGLSTQAGAENSGEEGNVEDGCSELDETLPSTSPLEDSKTADDNEEELTS 120

Query: 121 GSEIDDDDDVVDDRT--ELDLPEGETGLVEKISIKRAPSELLNVIWKAPGLTVSSALDKW 180
           GSEIDDDDDVVDD T  ELDLPEGETGLVEKISIKRAPSELLNVIWKAPGLTVSSALDKW
Sbjct: 121 GSEIDDDDDVVDDGTQNELDLPEGETGLVEKISIKRAPSELLNVIWKAPGLTVSSALDKW 180

Query: 181 VSEGKELSRDDISSAMLNLRKCRMYGKALQFSEWLETSGKLDFIENDYASRLYLIGKLRG 240
           VSEGKELSRDDISSAMLNLRKCRMYGKALQFSEWLETSGKLDFIENDYASRLYLIGKLRG
Sbjct: 181 VSEGKELSRDDISSAMLNLRKCRMYGKALQFSEWLETSGKLDFIENDYASRLYLIGKLRG 240

Query: 241 LRMAENYIAKIPKSFQGEVVYQTLLVNCVIASNVHKAEKVFNKMKNLEFPITAFACNQLL 300
           LRMAENYIAKIPKSFQGEVVYQTLLVNCVIASNVHKAEKVFNKMKNLEFPITAFACNQLL
Sbjct: 241 LRMAENYIAKIPKSFQGEVVYQTLLVNCVIASNVHKAEKVFNKMKNLEFPITAFACNQLL 300

Query: 301 LLYKRTDKRKIADVLLLMKKENVKYSTSTYRILIDVNGLSNDITGMEEVVDSMKAEGIKL 360
           LLYKRTDKRKIADVLLLMKKENVKYSTSTYRILIDVNGLSNDITGMEEVVDSMKAEGIKL
Sbjct: 301 LLYKRTDKRKIADVLLLMKKENVKYSTSTYRILIDVNGLSNDITGMEEVVDSMKAEGIKL 360

Query: 361 DVETLSRLVKHYVSGGLKDKAKAVLKEMEEINSEGSRRPCRILLPLYGELQMEDEVRRLW 420
           DVETLSRLVKHYVSGGLKDKAKAVLKEMEEINSEGSRRPCRILLPLYGELQMEDEVRRLW
Sbjct: 361 DVETLSRLVKHYVSGGLKDKAKAVLKEMEEINSEGSRRPCRILLPLYGELQMEDEVRRLW 420

Query: 421 EICESNPHIEECMAALVAWGKLKNVQEAEKIFDRVLKTGKKLSARHYSTMMNVYRKDSKM 480
           EICESNPHIEECMAA+VAWGKLKNVQEAEKIFDRV+KTG+KLSARHYSTM+NVYR+DSKM
Sbjct: 421 EICESNPHIEECMAAIVAWGKLKNVQEAEKIFDRVVKTGEKLSARHYSTMLNVYREDSKM 480

Query: 481 LTKGKELVNQMAESGCRMDPFTLDAVVKLYVEAGEVEKADSFLVKAVLQNKKKPMFTTYI 540
           LTKGKE+V QMAESGCRMDPFTLDAVVKLYVEAGEVEKADSFLVKAVLQNKKKPMFTTYI
Sbjct: 481 LTKGKEVVKQMAESGCRMDPFTLDAVVKLYVEAGEVEKADSFLVKAVLQNKKKPMFTTYI 540

Query: 541 TLMDRYASRGDVPNVEKNFAMMRRLGYVGRLSQFQTLIQAYVNAKAPAYGMRERMKADNV 600
           TLMDRYASRGDVPNVEKNFAMMRRLGYVGRLSQFQTLIQAYVNAKAPAYGMRERMKADNV
Sbjct: 541 TLMDRYASRGDVPNVEKNFAMMRRLGYVGRLSQFQTLIQAYVNAKAPAYGMRERMKADNV 600

Query: 601 FPNKDLAGKLAQVDCLKMRKVSDLLD 625
           FPNKDLAGKLAQVDCLKMRKVSDLLD
Sbjct: 601 FPNKDLAGKLAQVDCLKMRKVSDLLD 626

BLAST of CSPI03G37740 vs. ExPASy TrEMBL
Match: A0A0A0LEU5 (PPR_long domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G819910 PE=3 SV=1)

HSP 1 Score: 1120.5 bits (2897), Expect = 0.0e+00
Identity = 570/624 (91.35%), Postives = 594/624 (95.19%), Query Frame = 0

Query: 1   MWALRRASTPLRNQGYRVRTSYVFGKLEVPYFWEGNVAGFGTAAALSDRFIYFDRNNLTT 60
           MWALRRASTPLRNQGYRVRTSYVFGKLEVPYFWEGNVAGFGTAAALSDRFIYFDRNNLTT
Sbjct: 1   MWALRRASTPLRNQGYRVRTSYVFGKLEVPYFWEGNVAGFGTAAALSDRFIYFDRNNLTT 60

Query: 61  WPSSEVYISSHGLSTQAGAENSGEEGNVEDGCSELDETLPSTSPLEDSKTADDNEEELTS 120
           WPSSEVYISSHGLSTQAGAENSGEEGNVEDGCSELDETLPSTSPLEDSKTADDNEEELTS
Sbjct: 61  WPSSEVYISSHGLSTQAGAENSGEEGNVEDGCSELDETLPSTSPLEDSKTADDNEEELTS 120

Query: 121 GSEIDDDDDVVDDRTELDLPEGETGLVEKISIKRAPSELLNVIWKAPGLTVSSALDKWVS 180
           GSEIDDD+DVVDD TELDLPEGETGLVEKISIKRAPSELLNVIWKAPGLTVSSALDKWVS
Sbjct: 121 GSEIDDDNDVVDDGTELDLPEGETGLVEKISIKRAPSELLNVIWKAPGLTVSSALDKWVS 180

Query: 181 EGKELSRDDISSAMLNLRKCRMYGKALQFSEWLETSGKLDFIENDYASRLYLIGKLRGLR 240
           EGKELSRDDISSAMLNLRKCRMYGKALQFSEWLE +GKLDF+E DYASRL LIGKLRGLR
Sbjct: 181 EGKELSRDDISSAMLNLRKCRMYGKALQFSEWLEANGKLDFVEKDYASRLDLIGKLRGLR 240

Query: 241 MAENYIAKIPKSFQGEVVYQTLLVNCVIASNVHKAEKVFNKMKNLEFPITAFACNQLLLL 300
           MAENYIAKIPKSFQGEVVY+TLL NCVIA NV KAE+VFNKMK+LEFPITAFACNQLLLL
Sbjct: 241 MAENYIAKIPKSFQGEVVYRTLLANCVIACNVQKAEEVFNKMKDLEFPITAFACNQLLLL 300

Query: 301 YKRTDKRKIADVLLLMKKENVKYSTSTYRILIDVNGLSNDITGMEEVVDSMKAEGIKLDV 360
           YKRTDKRK+AD+LLLM+KENVK S  TYRILID  GLSNDITGME+VVD+MKAEGI+LDV
Sbjct: 301 YKRTDKRKVADILLLMEKENVKPSRFTYRILIDTKGLSNDITGMEQVVDTMKAEGIELDV 360

Query: 361 ETLSRLVKHYVSGGLKDKAKAVLKEMEEINSEGSRRPCRILLPLYGELQMEDEVRRLWEI 420
            TLS L KHY+SGGLKDKAKA+LKEMEEINSEGSR PCRILLPLYGELQMEDEVRRLWEI
Sbjct: 361 STLSVLAKHYISGGLKDKAKAILKEMEEINSEGSRWPCRILLPLYGELQMEDEVRRLWEI 420

Query: 421 CESNPHIEECMAALVAWGKLKNVQEAEKIFDRVLKTGKKLSARHYSTMMNVYRKDSKMLT 480
           C SNPHIEECMAA+VAWGKLKN+QEAEKIFDRV+KTG+KLSARHYSTM+NVYR+DSKMLT
Sbjct: 421 CGSNPHIEECMAAIVAWGKLKNIQEAEKIFDRVVKTGEKLSARHYSTMLNVYREDSKMLT 480

Query: 481 KGKELVNQMAESGCRMDPFTLDAVVKLYVEAGEVEKADSFLVKAVLQNKKKPMFTTYITL 540
           KGKE+V QMAESG RMDP TLDAVVKLYVEAGE EKADSFLVK VLQ KKKPMFTTYITL
Sbjct: 481 KGKEVVKQMAESGSRMDPVTLDAVVKLYVEAGEGEKADSFLVKTVLQYKKKPMFTTYITL 540

Query: 541 MDRYASRGDVPNVEKNFAMMRRLGYVGRLSQFQTLIQAYVNAKAPAYGMRERMKADNVFP 600
           MDRYASRGDVPN EK F MMR+ GYVGRLS FQTLIQAYVNAKAPAYGMRERMKAD+VFP
Sbjct: 541 MDRYASRGDVPNAEKIFGMMRKYGYVGRLSHFQTLIQAYVNAKAPAYGMRERMKADSVFP 600

Query: 601 NKDLAGKLAQVDCLKMRKVSDLLD 625
           NK LAGKLAQVD LKMR+VSDLLD
Sbjct: 601 NKALAGKLAQVDSLKMREVSDLLD 624

BLAST of CSPI03G37740 vs. ExPASy TrEMBL
Match: A0A5A7UK78 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G006660 PE=3 SV=1)

HSP 1 Score: 1012.7 bits (2617), Expect = 6.7e-292
Identity = 521/627 (83.09%), Postives = 566/627 (90.27%), Query Frame = 0

Query: 1   MWALRRASTPLRNQGYRVRTSYVFGKLEVPYFWEGNVAGFGTAAALSDRFIYFDRNNLTT 60
           MWALRRASTPLRNQGY+VRTSYVFGKLEVP+FWEGNVAGFGT  ALSDRFI F+RNNL T
Sbjct: 1   MWALRRASTPLRNQGYKVRTSYVFGKLEVPFFWEGNVAGFGTTTALSDRFISFERNNLAT 60

Query: 61  WPSSEVYISSHGLSTQAGAENSGEEGNVEDGCSELDETLPSTSPLEDSKTADDNEEELTS 120
           WPS+ VYISSHGLSTQAGAENSGEE NV+DG SELDETL STSPLEDSK ADDNEEELTS
Sbjct: 61  WPSAGVYISSHGLSTQAGAENSGEEDNVKDGFSELDETLASTSPLEDSKAADDNEEELTS 120

Query: 121 GSEIDDDDD-VVDDRT--ELDLPEGETGLVEKISIKRAPSELLNVIWKAPGLTVSSALDK 180
           GSEIDDDDD  VDD T  ELDL EGETGL EK S KR PSEL NVIWKAPGL+V++ALDK
Sbjct: 121 GSEIDDDDDNAVDDGTQNELDLLEGETGLAEKKSTKRGPSELFNVIWKAPGLSVANALDK 180

Query: 181 WVSEGKELSRDDISSAMLNLRKCRMYGKALQFSEWLETSGKLDFIENDYASRLYLIGKLR 240
           WVSEGKELSR DIS AML LRK +M+GKALQFSEWLE SGKL+F + DYASRL LIGKLR
Sbjct: 181 WVSEGKELSRADISLAMLYLRKRQMFGKALQFSEWLEASGKLNFTDKDYASRLDLIGKLR 240

Query: 241 GLRMAENYIAKIPKSFQGEVVYQTLLVNCVIASNVHKAEKVFNKMKNLEFPITAFACNQL 300
           GLRMAENY+AKIPKSFQGEVVY+TLL NCVIASNV KAE+VFNKMK+LEFPITAFAC+QL
Sbjct: 241 GLRMAENYLAKIPKSFQGEVVYRTLLANCVIASNVQKAEEVFNKMKDLEFPITAFACSQL 300

Query: 301 LLLYKRTDKRKIADVLLLMKKENVKYSTSTYRILIDVNGLSNDITGMEEVVDSMKAEGIK 360
           LLLY+RTDKRKIAD+LLLM+KENVK S  TY+ILID  GLSNDI+GME+VVD+MKA+GI+
Sbjct: 301 LLLYRRTDKRKIADILLLMEKENVKPSRFTYKILIDAKGLSNDISGMEQVVDTMKADGIE 360

Query: 361 LDVETLSRLVKHYVSGGLKDKAKAVLKEMEEINSEGSRRPCRILLPLYGELQMEDEVRRL 420
           LD +TL+ L KHYVSGGLKDKAKA LK+MEEINS+GSR PCRILLP YGEL+MEDEVRRL
Sbjct: 361 LDFDTLALLAKHYVSGGLKDKAKATLKQMEEINSQGSRWPCRILLPRYGELEMEDEVRRL 420

Query: 421 WEICESNPHIEECMAALVAWGKLKNVQEAEKIFDRVLKTGKKLSARHYSTMMNVYRKDSK 480
           WEICES+PHIEECMAA+VAWGKLKNVQEAEKIFDRV+K+GKKLSARHYSTMMNVYR+ +K
Sbjct: 421 WEICESDPHIEECMAAIVAWGKLKNVQEAEKIFDRVVKSGKKLSARHYSTMMNVYRQ-TK 480

Query: 481 MLTKGKELVNQMAESGCRMDPFTLDAVVKLYVEAGEVEKADSFLVKAVLQNKKKPMFTTY 540
           MLTKGKELVNQMAESGCRMDP T DAVVK YVEAGEVEKADSFLVKAV QNKKKP+F TY
Sbjct: 481 MLTKGKELVNQMAESGCRMDPLTWDAVVKFYVEAGEVEKADSFLVKAVQQNKKKPLFATY 540

Query: 541 ITLMDRYASRGDVPNVEKNFAMMRRLGYVGRLSQFQTLIQAYVNAKAPAYGMRERMKADN 600
           +TLM  YASRGDVPN E  F  MRRLGY+GR +QFQTL+QAYVNAKAPAYGMRERM  DN
Sbjct: 541 MTLMHHYASRGDVPNAENIFDRMRRLGYMGRFTQFQTLVQAYVNAKAPAYGMRERMMVDN 600

Query: 601 VFPNKDLAGKLAQVDCLKMRKVSDLLD 625
           +FPNK LAGKLAQVD  +M +VSDLLD
Sbjct: 601 IFPNKALAGKLAQVDPFRMTEVSDLLD 626

BLAST of CSPI03G37740 vs. ExPASy TrEMBL
Match: A0A1S3B8B0 (pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like isoform X1 OS=Cucumis melo OX=3656 GN=LOC103486882 PE=3 SV=1)

HSP 1 Score: 966.8 bits (2498), Expect = 4.2e-278
Identity = 503/624 (80.61%), Postives = 539/624 (86.38%), Query Frame = 0

Query: 1   MWALRRASTPLRNQGYRVRTSYVFGKLEVPYFWEGNVAGFGTAAALSDRFIYFDRNNLTT 60
           MWALRRAS PLRNQGYRVRTSYVFGKLEVPYFWEGNVAGFGT AALSDRFI F+RNNL T
Sbjct: 1   MWALRRASAPLRNQGYRVRTSYVFGKLEVPYFWEGNVAGFGTTAALSDRFISFERNNLET 60

Query: 61  WPSSEVYISSHGLSTQAGAENSGEEGNVEDGCSELDETLPSTSPLEDSKTADDNEEELTS 120
           WPSS VYISSHGLSTQAGAENSGEEGNVE                       DNEEELTS
Sbjct: 61  WPSSGVYISSHGLSTQAGAENSGEEGNVE-----------------------DNEEELTS 120

Query: 121 GSEIDDDDDVVDDRTELDLPEGETGLVEKISIKRAPSELLNVIWKAPGLTVSSALDKWVS 180
           GSEIDDDD+    + ELDLPEGETGL EKIS K APSEL N+IWKAPGL+V SALDKWVS
Sbjct: 121 GSEIDDDDET---QNELDLPEGETGLAEKISTKGAPSELFNIIWKAPGLSVPSALDKWVS 180

Query: 181 EGKELSRDDISSAMLNLRKCRMYGKALQFSEWLETSGKLDFIENDYASRLYLIGKLRGLR 240
           EGKELSR DIS  ML LR+ RM+GKAL+FSEWLE +GKL   + DYAS+L LIGKLRGLR
Sbjct: 181 EGKELSRADISLTMLYLRRRRMFGKALKFSEWLEANGKL-VTDRDYASQLDLIGKLRGLR 240

Query: 241 MAENYIAKIPKSFQGEVVYQTLLVNCVIASNVHKAEKVFNKMKNLEFPITAFACNQLLLL 300
           MAENYI+KIPKSFQGEVVY+TLL NCV+++NV KAE+VFNKMK+LEFPITAFACNQLLLL
Sbjct: 241 MAENYISKIPKSFQGEVVYRTLLANCVMSTNVRKAEEVFNKMKDLEFPITAFACNQLLLL 300

Query: 301 YKRTDKRKIADVLLLMKKENVKYSTSTYRILIDVNGLSNDITGMEEVVDSMKAEGIKLDV 360
           YKRTDK+KIADVLLLM+KENVK S  TY+ILID  GLSNDI+GME+VVD+MKAEGIKL V
Sbjct: 301 YKRTDKKKIADVLLLMEKENVKPSPFTYKILIDAKGLSNDISGMEQVVDTMKAEGIKLGV 360

Query: 361 ETLSRLVKHYVSGGLKDKAKAVLKEMEEINSEGSRRPCRILLPLYGELQMEDEVRRLWEI 420
            TL  L KHYVS GLKDKAKA LKE EEINS+GSRRPCR LLPLYGELQMEDEVRRLWEI
Sbjct: 361 GTLLLLAKHYVSAGLKDKAKATLKETEEINSKGSRRPCRFLLPLYGELQMEDEVRRLWEI 420

Query: 421 CESNPHIEECMAALVAWGKLKNVQEAEKIFDRVLKTGKKLSARHYSTMMNVYRKDSKMLT 480
           CESNPH+EECMAA+VAWGKLKNVQEAEKIFDRV+KTGKKLS RHYSTMMNVYR DSKMLT
Sbjct: 421 CESNPHVEECMAAIVAWGKLKNVQEAEKIFDRVVKTGKKLSTRHYSTMMNVYR-DSKMLT 480

Query: 481 KGKELVNQMAESGCRMDPFTLDAVVKLYVEAGEVEKADSFLVKAVLQNKKKPMFTTYITL 540
           KGKELVNQMAESGC MDPFT DAVVKLYVEAGEVEKADSFLVKAV Q+KKKP+F TYI L
Sbjct: 481 KGKELVNQMAESGCSMDPFTWDAVVKLYVEAGEVEKADSFLVKAVQQSKKKPLFATYIAL 540

Query: 541 MDRYASRGDVPNVEKNFAMMRRLGYVGRLSQFQTLIQAYVNAKAPAYGMRERMKADNVFP 600
           MD YASRGDVPN E+ F  +R LGYVGR +Q+QTLIQAYVNAK PAYGMRERMKADN+FP
Sbjct: 541 MDHYASRGDVPNAERIFDKLRILGYVGRFTQYQTLIQAYVNAKTPAYGMRERMKADNIFP 596

Query: 601 NKDLAGKLAQVDCLKMRKVSDLLD 625
           NK LAG+LAQVD  KM  VSDLLD
Sbjct: 601 NKALAGQLAQVDSFKMTDVSDLLD 596

BLAST of CSPI03G37740 vs. ExPASy TrEMBL
Match: A0A5A7UD76 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G006630 PE=3 SV=1)

HSP 1 Score: 966.8 bits (2498), Expect = 4.2e-278
Identity = 503/624 (80.61%), Postives = 539/624 (86.38%), Query Frame = 0

Query: 1   MWALRRASTPLRNQGYRVRTSYVFGKLEVPYFWEGNVAGFGTAAALSDRFIYFDRNNLTT 60
           MWALRRAS PLRNQGYRVRTSYVFGKLEVPYFWEGNVAGFGT AALSDRFI F+RNNL T
Sbjct: 1   MWALRRASAPLRNQGYRVRTSYVFGKLEVPYFWEGNVAGFGTTAALSDRFISFERNNLET 60

Query: 61  WPSSEVYISSHGLSTQAGAENSGEEGNVEDGCSELDETLPSTSPLEDSKTADDNEEELTS 120
           WPSS VYISSHGLSTQAGAENSGEEGNVE                       DNEEELTS
Sbjct: 61  WPSSGVYISSHGLSTQAGAENSGEEGNVE-----------------------DNEEELTS 120

Query: 121 GSEIDDDDDVVDDRTELDLPEGETGLVEKISIKRAPSELLNVIWKAPGLTVSSALDKWVS 180
           GSEIDDDD+    + ELDLPEGETGL EKIS K APSEL N+IWKAPGL+V SALDKWVS
Sbjct: 121 GSEIDDDDET---QNELDLPEGETGLAEKISTKGAPSELFNIIWKAPGLSVPSALDKWVS 180

Query: 181 EGKELSRDDISSAMLNLRKCRMYGKALQFSEWLETSGKLDFIENDYASRLYLIGKLRGLR 240
           EGKELSR DIS  ML LR+ RM+GKAL+FSEWLE +GKL   + DYAS+L LIGKLRGLR
Sbjct: 181 EGKELSRADISLTMLYLRRRRMFGKALKFSEWLEANGKL-VTDRDYASQLDLIGKLRGLR 240

Query: 241 MAENYIAKIPKSFQGEVVYQTLLVNCVIASNVHKAEKVFNKMKNLEFPITAFACNQLLLL 300
           MAENYI+KIPKSFQGEVVY+TLL NCV+++NV KAE+VFNKMK+LEFPITAFACNQLLLL
Sbjct: 241 MAENYISKIPKSFQGEVVYRTLLANCVMSTNVRKAEEVFNKMKDLEFPITAFACNQLLLL 300

Query: 301 YKRTDKRKIADVLLLMKKENVKYSTSTYRILIDVNGLSNDITGMEEVVDSMKAEGIKLDV 360
           YKRTDK+KIADVLLLM+KENVK S  TY+ILID  GLSNDI+GME+VVD+MKAEGIKL V
Sbjct: 301 YKRTDKKKIADVLLLMEKENVKPSPFTYKILIDAKGLSNDISGMEQVVDTMKAEGIKLGV 360

Query: 361 ETLSRLVKHYVSGGLKDKAKAVLKEMEEINSEGSRRPCRILLPLYGELQMEDEVRRLWEI 420
            TL  L KHYVS GLKDKAKA LKE EEINS+GSRRPCR LLPLYGELQMEDEVRRLWEI
Sbjct: 361 GTLLLLAKHYVSAGLKDKAKATLKETEEINSKGSRRPCRFLLPLYGELQMEDEVRRLWEI 420

Query: 421 CESNPHIEECMAALVAWGKLKNVQEAEKIFDRVLKTGKKLSARHYSTMMNVYRKDSKMLT 480
           CESNPH+EECMAA+VAWGKLKNVQEAEKIFDRV+KTGKKLS RHYSTMMNVYR DSKMLT
Sbjct: 421 CESNPHVEECMAAIVAWGKLKNVQEAEKIFDRVVKTGKKLSTRHYSTMMNVYR-DSKMLT 480

Query: 481 KGKELVNQMAESGCRMDPFTLDAVVKLYVEAGEVEKADSFLVKAVLQNKKKPMFTTYITL 540
           KGKELVNQMAESGC MDPFT DAVVKLYVEAGEVEKADSFLVKAV Q+KKKP+F TYI L
Sbjct: 481 KGKELVNQMAESGCSMDPFTWDAVVKLYVEAGEVEKADSFLVKAVQQSKKKPLFATYIAL 540

Query: 541 MDRYASRGDVPNVEKNFAMMRRLGYVGRLSQFQTLIQAYVNAKAPAYGMRERMKADNVFP 600
           MD YASRGDVPN E+ F  +R LGYVGR +Q+QTLIQAYVNAK PAYGMRERMKADN+FP
Sbjct: 541 MDHYASRGDVPNAERIFDKLRILGYVGRFTQYQTLIQAYVNAKTPAYGMRERMKADNIFP 596

Query: 601 NKDLAGKLAQVDCLKMRKVSDLLD 625
           NK LAG+LAQVD  KM  VSDLLD
Sbjct: 601 NKALAGQLAQVDSFKMTDVSDLLD 596

BLAST of CSPI03G37740 vs. NCBI nr
Match: XP_011652178.2 (pentatricopeptide repeat-containing protein At1g80270, mitochondrial isoform X1 [Cucumis sativus] >XP_031739460.1 pentatricopeptide repeat-containing protein At1g80270, mitochondrial isoform X2 [Cucumis sativus] >KAE8651176.1 hypothetical protein Csa_001864 [Cucumis sativus])

HSP 1 Score: 1215.3 bits (3143), Expect = 0.0e+00
Identity = 622/626 (99.36%), Postives = 623/626 (99.52%), Query Frame = 0

Query: 1   MWALRRASTPLRNQGYRVRTSYVFGKLEVPYFWEGNVAGFGTAAALSDRFIYFDRNNLTT 60
           MWALRRASTPLRNQGYRVRTSYVFGKLEVPYFWEGNVAGFGTAAALSDRFIYFDRNNLTT
Sbjct: 1   MWALRRASTPLRNQGYRVRTSYVFGKLEVPYFWEGNVAGFGTAAALSDRFIYFDRNNLTT 60

Query: 61  WPSSEVYISSHGLSTQAGAENSGEEGNVEDGCSELDETLPSTSPLEDSKTADDNEEELTS 120
           WPSSEVYISSHGLSTQAGAENSGEEGNVEDGCSELDETLPSTSPLEDSKTADDNEEELTS
Sbjct: 61  WPSSEVYISSHGLSTQAGAENSGEEGNVEDGCSELDETLPSTSPLEDSKTADDNEEELTS 120

Query: 121 GSEIDDDDDVVDDRT--ELDLPEGETGLVEKISIKRAPSELLNVIWKAPGLTVSSALDKW 180
           GSEIDDDDDVVDD T  ELDLPEGETGLVEKISIKRAPSELLNVIWKAPGLTVSSALDKW
Sbjct: 121 GSEIDDDDDVVDDGTQNELDLPEGETGLVEKISIKRAPSELLNVIWKAPGLTVSSALDKW 180

Query: 181 VSEGKELSRDDISSAMLNLRKCRMYGKALQFSEWLETSGKLDFIENDYASRLYLIGKLRG 240
           VSEGKELSRDDISSAMLNLRKCRMYGKALQFSEWLETSGKLDFIENDYASRLYLIGKLRG
Sbjct: 181 VSEGKELSRDDISSAMLNLRKCRMYGKALQFSEWLETSGKLDFIENDYASRLYLIGKLRG 240

Query: 241 LRMAENYIAKIPKSFQGEVVYQTLLVNCVIASNVHKAEKVFNKMKNLEFPITAFACNQLL 300
           LRMAENYIAKIPKSFQGEVVYQTLLVNCVIASNVHKAEKVFNKMKNLEFPITAFACNQLL
Sbjct: 241 LRMAENYIAKIPKSFQGEVVYQTLLVNCVIASNVHKAEKVFNKMKNLEFPITAFACNQLL 300

Query: 301 LLYKRTDKRKIADVLLLMKKENVKYSTSTYRILIDVNGLSNDITGMEEVVDSMKAEGIKL 360
           LLYKRTDKRKIADVLLLMKKENVKYSTSTYRILIDVNGLSNDITGMEEVVDSMKAEGIKL
Sbjct: 301 LLYKRTDKRKIADVLLLMKKENVKYSTSTYRILIDVNGLSNDITGMEEVVDSMKAEGIKL 360

Query: 361 DVETLSRLVKHYVSGGLKDKAKAVLKEMEEINSEGSRRPCRILLPLYGELQMEDEVRRLW 420
           DVETLSRLVKHYVSGGLKDKAKAVLKEMEEINSEGSRRPCRILLPLYGELQMEDEVRRLW
Sbjct: 361 DVETLSRLVKHYVSGGLKDKAKAVLKEMEEINSEGSRRPCRILLPLYGELQMEDEVRRLW 420

Query: 421 EICESNPHIEECMAALVAWGKLKNVQEAEKIFDRVLKTGKKLSARHYSTMMNVYRKDSKM 480
           EICESNPHIEECMAA+VAWGKLKNVQEAEKIFDRVLKTGKKLSARHYSTMMNVYRKDSKM
Sbjct: 421 EICESNPHIEECMAAIVAWGKLKNVQEAEKIFDRVLKTGKKLSARHYSTMMNVYRKDSKM 480

Query: 481 LTKGKELVNQMAESGCRMDPFTLDAVVKLYVEAGEVEKADSFLVKAVLQNKKKPMFTTYI 540
           LTKGKELVNQMAESGCRMDPFTLDAVVKLYVEAGEVEKADSFLVKAVLQNKKKPMFTTYI
Sbjct: 481 LTKGKELVNQMAESGCRMDPFTLDAVVKLYVEAGEVEKADSFLVKAVLQNKKKPMFTTYI 540

Query: 541 TLMDRYASRGDVPNVEKNFAMMRRLGYVGRLSQFQTLIQAYVNAKAPAYGMRERMKADNV 600
           TLMDRYASRGDVPNVEKNFAMMRRLGYVGRLSQFQTLIQAYVNAKAPAYGMRERMKADNV
Sbjct: 541 TLMDRYASRGDVPNVEKNFAMMRRLGYVGRLSQFQTLIQAYVNAKAPAYGMRERMKADNV 600

Query: 601 FPNKDLAGKLAQVDCLKMRKVSDLLD 625
           FPNKDLAGKLAQVDCLKMRKVSDLLD
Sbjct: 601 FPNKDLAGKLAQVDCLKMRKVSDLLD 626

BLAST of CSPI03G37740 vs. NCBI nr
Match: XP_031739463.1 (LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At1g80270, mitochondrial [Cucumis sativus])

HSP 1 Score: 1114.8 bits (2882), Expect = 0.0e+00
Identity = 570/633 (90.05%), Postives = 595/633 (94.00%), Query Frame = 0

Query: 1   MWALRRASTPLRNQGYRVRTSYVFGKLEVPYFWEGNVAGFGTAAALSDRFIYFDRNNLTT 60
           MWALRRASTPLRNQGYRVRTSYVFGKLEVPYFWEGNVAGFGTAAALSDRFIYFDRNNLTT
Sbjct: 1   MWALRRASTPLRNQGYRVRTSYVFGKLEVPYFWEGNVAGFGTAAALSDRFIYFDRNNLTT 60

Query: 61  WPSSEVYISSHGLSTQAGAENSGEEGNVEDGCSELDETLPSTSPLEDSKTADDNEEELTS 120
           WPSSEVYISSHGLSTQAGAENSGEEGNVEDGCSELDETLPSTSPLEDSKTADDNEEELTS
Sbjct: 61  WPSSEVYISSHGLSTQAGAENSGEEGNVEDGCSELDETLPSTSPLEDSKTADDNEEELTS 120

Query: 121 GSEIDDDDDVVDDRTELDLPEGETGLVEKISIKRAPSELLNVIWKAPGLTVSSALDKWVS 180
           GSEIDDD+DVVDD TELDLPEGETGLVEKISIKRAPSELLNVIWKAPGLTVSSALDKWVS
Sbjct: 121 GSEIDDDNDVVDDGTELDLPEGETGLVEKISIKRAPSELLNVIWKAPGLTVSSALDKWVS 180

Query: 181 EGKELSRDDISSAMLNLRKCRMYGK---------ALQFSEWLETSGKLDFIENDYASRLY 240
           EGKELSRDDISSAMLNLRKCRMYG+         A QFSEWLE +GKLDF+E DYASRL 
Sbjct: 181 EGKELSRDDISSAMLNLRKCRMYGRLCRXIDFDVAFQFSEWLEANGKLDFVEKDYASRLD 240

Query: 241 LIGKLRGLRMAENYIAKIPKSFQGEVVYQTLLVNCVIASNVHKAEKVFNKMKNLEFPITA 300
           LIGKLRGLRMAENYIAKIPKSFQGEVVY+TLL NCVIA NV KAE+VFNKMK+LEFPITA
Sbjct: 241 LIGKLRGLRMAENYIAKIPKSFQGEVVYRTLLANCVIACNVQKAEEVFNKMKDLEFPITA 300

Query: 301 FACNQLLLLYKRTDKRKIADVLLLMKKENVKYSTSTYRILIDVNGLSNDITGMEEVVDSM 360
           FACNQLLLLYKRTDKRK+AD+LLLM+KENVK S  TYRILID  GLSNDITGME+VVD+M
Sbjct: 301 FACNQLLLLYKRTDKRKVADILLLMEKENVKPSRFTYRILIDTKGLSNDITGMEQVVDTM 360

Query: 361 KAEGIKLDVETLSRLVKHYVSGGLKDKAKAVLKEMEEINSEGSRRPCRILLPLYGELQME 420
           KAEGI+LDV TLS L KHY+SGGLKDKAKA+LKEMEEINSEGSR PCRILLPLYGELQME
Sbjct: 361 KAEGIELDVSTLSVLAKHYISGGLKDKAKAILKEMEEINSEGSRWPCRILLPLYGELQME 420

Query: 421 DEVRRLWEICESNPHIEECMAALVAWGKLKNVQEAEKIFDRVLKTGKKLSARHYSTMMNV 480
           DEVRRLWEIC SNPHIEECMAA+VAWGKLKN+QEAEKIFDRV+KTG+KLSARHYSTM+NV
Sbjct: 421 DEVRRLWEICGSNPHIEECMAAIVAWGKLKNIQEAEKIFDRVVKTGEKLSARHYSTMLNV 480

Query: 481 YRKDSKMLTKGKELVNQMAESGCRMDPFTLDAVVKLYVEAGEVEKADSFLVKAVLQNKKK 540
           YR+DSKMLTKGKE+V QMAESG RMDP TLDAVVKLYVEAGEVEKADSFLVK VLQ KKK
Sbjct: 481 YREDSKMLTKGKEVVKQMAESGSRMDPVTLDAVVKLYVEAGEVEKADSFLVKTVLQYKKK 540

Query: 541 PMFTTYITLMDRYASRGDVPNVEKNFAMMRRLGYVGRLSQFQTLIQAYVNAKAPAYGMRE 600
           PMFTTYITLMDRYASRGDVPN EK F MMR+ GYVGRLSQFQTLIQAYVNAKAPAYGMRE
Sbjct: 541 PMFTTYITLMDRYASRGDVPNAEKIFGMMRKYGYVGRLSQFQTLIQAYVNAKAPAYGMRE 600

Query: 601 RMKADNVFPNKDLAGKLAQVDCLKMRKVSDLLD 625
           RMKAD+VFPNK LAGKLAQVD LKMR+VSDLLD
Sbjct: 601 RMKADSVFPNKALAGKLAQVDSLKMREVSDLLD 633

BLAST of CSPI03G37740 vs. NCBI nr
Match: KAE8651174.1 (hypothetical protein Csa_002089 [Cucumis sativus])

HSP 1 Score: 1099.0 bits (2841), Expect = 0.0e+00
Identity = 568/664 (85.54%), Postives = 594/664 (89.46%), Query Frame = 0

Query: 1   MWALRRASTPLRNQGYRVRTSYVFGKLEVPYFWEGNVAGFGTAAALSDRFIYFDRNNLTT 60
           MWALRRASTPLRNQGYRVRTSYVFGKLEVPYFWEGNVAGFGTAAALSDRFIYFDRNNLTT
Sbjct: 1   MWALRRASTPLRNQGYRVRTSYVFGKLEVPYFWEGNVAGFGTAAALSDRFIYFDRNNLTT 60

Query: 61  WPSSEVYISSHGLSTQAGAENSGEEGNVEDGCSELDETLPSTSPLEDSKTADDNEEELTS 120
           WPSSEVYISSHGLSTQAGAENSGEEGNVEDGCSELDETLPSTSPLEDSKTADDNEEELTS
Sbjct: 61  WPSSEVYISSHGLSTQAGAENSGEEGNVEDGCSELDETLPSTSPLEDSKTADDNEEELTS 120

Query: 121 GSEIDDDDDVVDDRTELDLPEGETGLVEKISIKRAPSELLNVIWKAPGLTVSSALDKWVS 180
           GSEIDDD+DVVDD TELDLPEGETGLVEKISIKRAPSELLNVIWKAPGLTVSSALDKWVS
Sbjct: 121 GSEIDDDNDVVDDGTELDLPEGETGLVEKISIKRAPSELLNVIWKAPGLTVSSALDKWVS 180

Query: 181 EGKELSRDDISSAMLNLRKCRMYGKALQ-------------------------------- 240
           EGKELSRDDISSAMLNLRKCRMYG+  +                                
Sbjct: 181 EGKELSRDDISSAMLNLRKCRMYGRLCRIEMLIKFCKVQQTFIVLWSNKHKMVKQFLTLK 240

Query: 241 --------FSEWLETSGKLDFIENDYASRLYLIGKLRGLRMAENYIAKIPKSFQGEVVYQ 300
                   FSEWLE +GKLDF+E DYASRL LIGKLRGLRMAENYIAKIPKSFQGEVVY+
Sbjct: 241 IHPIGLESFSEWLEANGKLDFVEKDYASRLDLIGKLRGLRMAENYIAKIPKSFQGEVVYR 300

Query: 301 TLLVNCVIASNVHKAEKVFNKMKNLEFPITAFACNQLLLLYKRTDKRKIADVLLLMKKEN 360
           TLL NCVIA NV KAE+VFNKMK+LEFPITAFACNQLLLLYKRTDKRK+AD+LLLM+KEN
Sbjct: 301 TLLANCVIACNVQKAEEVFNKMKDLEFPITAFACNQLLLLYKRTDKRKVADILLLMEKEN 360

Query: 361 VKYSTSTYRILIDVNGLSNDITGMEEVVDSMKAEGIKLDVETLSRLVKHYVSGGLKDKAK 420
           VK S  TYRILID  GLSNDITGME+VVD+MKAEGI+LDV TLS L KHY+SGGLKDKAK
Sbjct: 361 VKPSRFTYRILIDTKGLSNDITGMEQVVDTMKAEGIELDVSTLSVLAKHYISGGLKDKAK 420

Query: 421 AVLKEMEEINSEGSRRPCRILLPLYGELQMEDEVRRLWEICESNPHIEECMAALVAWGKL 480
           A+LKEMEEINSEGSR PCRILLPLYGELQMEDEVRRLWEIC SNPHIEECMAA+VAWGKL
Sbjct: 421 AILKEMEEINSEGSRWPCRILLPLYGELQMEDEVRRLWEICGSNPHIEECMAAIVAWGKL 480

Query: 481 KNVQEAEKIFDRVLKTGKKLSARHYSTMMNVYRKDSKMLTKGKELVNQMAESGCRMDPFT 540
           KN+QEAEKIFDRV+KTG+KLSARHYSTM+NVYR+DSKMLTKGKE+V QMAESG RMDP T
Sbjct: 481 KNIQEAEKIFDRVVKTGEKLSARHYSTMLNVYREDSKMLTKGKEVVKQMAESGSRMDPVT 540

Query: 541 LDAVVKLYVEAGEVEKADSFLVKAVLQNKKKPMFTTYITLMDRYASRGDVPNVEKNFAMM 600
           LDAVVKLYVEAGEVEKADSFLVK VLQ KKKPMFTTYITLMDRYASRGDVPN EK F MM
Sbjct: 541 LDAVVKLYVEAGEVEKADSFLVKTVLQYKKKPMFTTYITLMDRYASRGDVPNAEKIFGMM 600

Query: 601 RRLGYVGRLSQFQTLIQAYVNAKAPAYGMRERMKADNVFPNKDLAGKLAQVDCLKMRKVS 625
           R+ GYVGRLSQFQTLIQAYVNAKAPAYGMRERMKAD+VFPNK LAGKLAQVD LKMR+VS
Sbjct: 601 RKYGYVGRLSQFQTLIQAYVNAKAPAYGMRERMKADSVFPNKALAGKLAQVDSLKMREVS 660

BLAST of CSPI03G37740 vs. NCBI nr
Match: KAA0053889.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK25516.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1012.7 bits (2617), Expect = 1.4e-291
Identity = 521/627 (83.09%), Postives = 566/627 (90.27%), Query Frame = 0

Query: 1   MWALRRASTPLRNQGYRVRTSYVFGKLEVPYFWEGNVAGFGTAAALSDRFIYFDRNNLTT 60
           MWALRRASTPLRNQGY+VRTSYVFGKLEVP+FWEGNVAGFGT  ALSDRFI F+RNNL T
Sbjct: 1   MWALRRASTPLRNQGYKVRTSYVFGKLEVPFFWEGNVAGFGTTTALSDRFISFERNNLAT 60

Query: 61  WPSSEVYISSHGLSTQAGAENSGEEGNVEDGCSELDETLPSTSPLEDSKTADDNEEELTS 120
           WPS+ VYISSHGLSTQAGAENSGEE NV+DG SELDETL STSPLEDSK ADDNEEELTS
Sbjct: 61  WPSAGVYISSHGLSTQAGAENSGEEDNVKDGFSELDETLASTSPLEDSKAADDNEEELTS 120

Query: 121 GSEIDDDDD-VVDDRT--ELDLPEGETGLVEKISIKRAPSELLNVIWKAPGLTVSSALDK 180
           GSEIDDDDD  VDD T  ELDL EGETGL EK S KR PSEL NVIWKAPGL+V++ALDK
Sbjct: 121 GSEIDDDDDNAVDDGTQNELDLLEGETGLAEKKSTKRGPSELFNVIWKAPGLSVANALDK 180

Query: 181 WVSEGKELSRDDISSAMLNLRKCRMYGKALQFSEWLETSGKLDFIENDYASRLYLIGKLR 240
           WVSEGKELSR DIS AML LRK +M+GKALQFSEWLE SGKL+F + DYASRL LIGKLR
Sbjct: 181 WVSEGKELSRADISLAMLYLRKRQMFGKALQFSEWLEASGKLNFTDKDYASRLDLIGKLR 240

Query: 241 GLRMAENYIAKIPKSFQGEVVYQTLLVNCVIASNVHKAEKVFNKMKNLEFPITAFACNQL 300
           GLRMAENY+AKIPKSFQGEVVY+TLL NCVIASNV KAE+VFNKMK+LEFPITAFAC+QL
Sbjct: 241 GLRMAENYLAKIPKSFQGEVVYRTLLANCVIASNVQKAEEVFNKMKDLEFPITAFACSQL 300

Query: 301 LLLYKRTDKRKIADVLLLMKKENVKYSTSTYRILIDVNGLSNDITGMEEVVDSMKAEGIK 360
           LLLY+RTDKRKIAD+LLLM+KENVK S  TY+ILID  GLSNDI+GME+VVD+MKA+GI+
Sbjct: 301 LLLYRRTDKRKIADILLLMEKENVKPSRFTYKILIDAKGLSNDISGMEQVVDTMKADGIE 360

Query: 361 LDVETLSRLVKHYVSGGLKDKAKAVLKEMEEINSEGSRRPCRILLPLYGELQMEDEVRRL 420
           LD +TL+ L KHYVSGGLKDKAKA LK+MEEINS+GSR PCRILLP YGEL+MEDEVRRL
Sbjct: 361 LDFDTLALLAKHYVSGGLKDKAKATLKQMEEINSQGSRWPCRILLPRYGELEMEDEVRRL 420

Query: 421 WEICESNPHIEECMAALVAWGKLKNVQEAEKIFDRVLKTGKKLSARHYSTMMNVYRKDSK 480
           WEICES+PHIEECMAA+VAWGKLKNVQEAEKIFDRV+K+GKKLSARHYSTMMNVYR+ +K
Sbjct: 421 WEICESDPHIEECMAAIVAWGKLKNVQEAEKIFDRVVKSGKKLSARHYSTMMNVYRQ-TK 480

Query: 481 MLTKGKELVNQMAESGCRMDPFTLDAVVKLYVEAGEVEKADSFLVKAVLQNKKKPMFTTY 540
           MLTKGKELVNQMAESGCRMDP T DAVVK YVEAGEVEKADSFLVKAV QNKKKP+F TY
Sbjct: 481 MLTKGKELVNQMAESGCRMDPLTWDAVVKFYVEAGEVEKADSFLVKAVQQNKKKPLFATY 540

Query: 541 ITLMDRYASRGDVPNVEKNFAMMRRLGYVGRLSQFQTLIQAYVNAKAPAYGMRERMKADN 600
           +TLM  YASRGDVPN E  F  MRRLGY+GR +QFQTL+QAYVNAKAPAYGMRERM  DN
Sbjct: 541 MTLMHHYASRGDVPNAENIFDRMRRLGYMGRFTQFQTLVQAYVNAKAPAYGMRERMMVDN 600

Query: 601 VFPNKDLAGKLAQVDCLKMRKVSDLLD 625
           +FPNK LAGKLAQVD  +M +VSDLLD
Sbjct: 601 IFPNKALAGKLAQVDPFRMTEVSDLLD 626

BLAST of CSPI03G37740 vs. NCBI nr
Match: XP_008443248.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like isoform X1 [Cucumis melo] >XP_016899662.1 PREDICTED: pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like isoform X2 [Cucumis melo] >KAA0053893.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK25513.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 966.8 bits (2498), Expect = 8.7e-278
Identity = 503/624 (80.61%), Postives = 539/624 (86.38%), Query Frame = 0

Query: 1   MWALRRASTPLRNQGYRVRTSYVFGKLEVPYFWEGNVAGFGTAAALSDRFIYFDRNNLTT 60
           MWALRRAS PLRNQGYRVRTSYVFGKLEVPYFWEGNVAGFGT AALSDRFI F+RNNL T
Sbjct: 1   MWALRRASAPLRNQGYRVRTSYVFGKLEVPYFWEGNVAGFGTTAALSDRFISFERNNLET 60

Query: 61  WPSSEVYISSHGLSTQAGAENSGEEGNVEDGCSELDETLPSTSPLEDSKTADDNEEELTS 120
           WPSS VYISSHGLSTQAGAENSGEEGNVE                       DNEEELTS
Sbjct: 61  WPSSGVYISSHGLSTQAGAENSGEEGNVE-----------------------DNEEELTS 120

Query: 121 GSEIDDDDDVVDDRTELDLPEGETGLVEKISIKRAPSELLNVIWKAPGLTVSSALDKWVS 180
           GSEIDDDD+    + ELDLPEGETGL EKIS K APSEL N+IWKAPGL+V SALDKWVS
Sbjct: 121 GSEIDDDDET---QNELDLPEGETGLAEKISTKGAPSELFNIIWKAPGLSVPSALDKWVS 180

Query: 181 EGKELSRDDISSAMLNLRKCRMYGKALQFSEWLETSGKLDFIENDYASRLYLIGKLRGLR 240
           EGKELSR DIS  ML LR+ RM+GKAL+FSEWLE +GKL   + DYAS+L LIGKLRGLR
Sbjct: 181 EGKELSRADISLTMLYLRRRRMFGKALKFSEWLEANGKL-VTDRDYASQLDLIGKLRGLR 240

Query: 241 MAENYIAKIPKSFQGEVVYQTLLVNCVIASNVHKAEKVFNKMKNLEFPITAFACNQLLLL 300
           MAENYI+KIPKSFQGEVVY+TLL NCV+++NV KAE+VFNKMK+LEFPITAFACNQLLLL
Sbjct: 241 MAENYISKIPKSFQGEVVYRTLLANCVMSTNVRKAEEVFNKMKDLEFPITAFACNQLLLL 300

Query: 301 YKRTDKRKIADVLLLMKKENVKYSTSTYRILIDVNGLSNDITGMEEVVDSMKAEGIKLDV 360
           YKRTDK+KIADVLLLM+KENVK S  TY+ILID  GLSNDI+GME+VVD+MKAEGIKL V
Sbjct: 301 YKRTDKKKIADVLLLMEKENVKPSPFTYKILIDAKGLSNDISGMEQVVDTMKAEGIKLGV 360

Query: 361 ETLSRLVKHYVSGGLKDKAKAVLKEMEEINSEGSRRPCRILLPLYGELQMEDEVRRLWEI 420
            TL  L KHYVS GLKDKAKA LKE EEINS+GSRRPCR LLPLYGELQMEDEVRRLWEI
Sbjct: 361 GTLLLLAKHYVSAGLKDKAKATLKETEEINSKGSRRPCRFLLPLYGELQMEDEVRRLWEI 420

Query: 421 CESNPHIEECMAALVAWGKLKNVQEAEKIFDRVLKTGKKLSARHYSTMMNVYRKDSKMLT 480
           CESNPH+EECMAA+VAWGKLKNVQEAEKIFDRV+KTGKKLS RHYSTMMNVYR DSKMLT
Sbjct: 421 CESNPHVEECMAAIVAWGKLKNVQEAEKIFDRVVKTGKKLSTRHYSTMMNVYR-DSKMLT 480

Query: 481 KGKELVNQMAESGCRMDPFTLDAVVKLYVEAGEVEKADSFLVKAVLQNKKKPMFTTYITL 540
           KGKELVNQMAESGC MDPFT DAVVKLYVEAGEVEKADSFLVKAV Q+KKKP+F TYI L
Sbjct: 481 KGKELVNQMAESGCSMDPFTWDAVVKLYVEAGEVEKADSFLVKAVQQSKKKPLFATYIAL 540

Query: 541 MDRYASRGDVPNVEKNFAMMRRLGYVGRLSQFQTLIQAYVNAKAPAYGMRERMKADNVFP 600
           MD YASRGDVPN E+ F  +R LGYVGR +Q+QTLIQAYVNAK PAYGMRERMKADN+FP
Sbjct: 541 MDHYASRGDVPNAERIFDKLRILGYVGRFTQYQTLIQAYVNAKTPAYGMRERMKADNIFP 596

Query: 601 NKDLAGKLAQVDCLKMRKVSDLLD 625
           NK LAG+LAQVD  KM  VSDLLD
Sbjct: 601 NKALAGQLAQVDSFKMTDVSDLLD 596

BLAST of CSPI03G37740 vs. TAIR 10
Match: AT1G80270.1 (PENTATRICOPEPTIDE REPEAT 596 )

HSP 1 Score: 557.8 bits (1436), Expect = 1.1e-158
Identity = 293/557 (52.60%), Postives = 401/557 (71.99%), Query Frame = 0

Query: 68  ISSHGLSTQAGAENSGEEGNVEDGCSELDETLPSTSPLEDSKTADDNEEELTSGSEIDDD 127
           +S+  LS+ AG ++  EE ++EDG SEL+     +   + S ++D++E +L++  E    
Sbjct: 57  LSNRALSSSAGTKSDQEEDDLEDGFSELE----GSKSGQGSTSSDEDEGKLSADEE---- 116

Query: 128 DDVVDDRTELDLPEGETGLVEKISIKRAPSELLNVIWKAPGLTVSSALDKWVSEGKELSR 187
                +  ELDL E +   V + ++++  SEL   I  APGL++ SALDKWV EG E++R
Sbjct: 117 -----EEEELDLIETD---VSRKTVEKKQSELFKTIVSAPGLSIGSALDKWVEEGNEITR 176

Query: 188 DDISSAMLNLRKCRMYGKALQFSEWLETSGKLDFIENDYASRLYLIGKLRGLRMAENYIA 247
            +I+ AML LR+ RMYG+ALQ SEWLE + K++  E DYASRL L  K+RGL   E  + 
Sbjct: 177 VEIAKAMLQLRRRRMYGRALQMSEWLEANKKIEMTERDYASRLDLTVKIRGLEKGEACMQ 236

Query: 248 KIPKSFQGEVVYQTLLVNCVIASNVHKAEKVFNKMKNLEFPITAFACNQLLLLYKRTDKR 307
           KIPKSF+GEV+Y+TLL NCV A NV K+E VFNKMK+L FP++ F C+Q+LLL+KR D++
Sbjct: 237 KIPKSFKGEVLYRTLLANCVAAGNVKKSELVFNKMKDLGFPLSGFTCDQMLLLHKRIDRK 296

Query: 308 KIADVLLLMKKENVKYSTSTYRILIDVNGLSNDITGMEEVVDSMKAEGIKLDVETLSRLV 367
           KIADVLLLM+KEN+K S  TY+ILIDV G +NDI+GME+++++MK EG++LD +T +   
Sbjct: 297 KIADVLLLMEKENIKPSLLTYKILIDVKGATNDISGMEQILETMKDEGVELDFQTQALTA 356

Query: 368 KHYVSGGLKDKAKAVLKEMEEINSEGSRRPCRILLPLYGELQMEDEVRRLWEICESNPHI 427
           +HY   GLKDKA+ VLKEME  + E +RR  + LL +Y  L  EDEV+R+W+ICES P+ 
Sbjct: 357 RHYSGAGLKDKAEKVLKEMEGESLEANRRAFKDLLSIYASLGREDEVKRIWKICESKPYF 416

Query: 428 EECMAALVAWGKLKNVQEAEKIFDRVLKTGKKLSARHYSTMMNVYRKDSKMLTKGKELVN 487
           EE +AA+ A+GKL  VQEAE IF++++K  ++ S+  YS ++ VY  D KML+KGK+LV 
Sbjct: 417 EESLAAIQAFGKLNKVQEAEAIFEKIVKMDRRASSSTYSVLLRVY-VDHKMLSKGKDLVK 476

Query: 488 QMAESGCRMDPFTLDAVVKLYVEAGEVEKADSFLVKAVLQNKKKPMFTTYITLMDRYASR 547
           +MAESGCR++  T DA++KLYVEAGEVEKADS L KA  Q+  K M  +++ +MD Y+ R
Sbjct: 477 RMAESGCRIEATTWDALIKLYVEAGEVEKADSLLDKASKQSHTKLMMNSFMYIMDEYSKR 536

Query: 548 GDVPNVEKNFAMMRRLGYVGRLSQFQTLIQAYVNAKAPAYGMRERMKADNVFPNKDLAGK 607
           GDV N EK F  MR  GY  RL QFQ L+QAY+NAK+PAYGMR+R+KADN+FPNK +A +
Sbjct: 537 GDVHNTEKIFLKMREAGYTSRLRQFQALMQAYINAKSPAYGMRDRLKADNIFPNKSMAAQ 596

Query: 608 LAQVDCLKMRKVSDLLD 625
           LAQ D  K   +SD+LD
Sbjct: 597 LAQGDPFKKTAISDILD 596

BLAST of CSPI03G37740 vs. TAIR 10
Match: AT1G80270.2 (PENTATRICOPEPTIDE REPEAT 596 )

HSP 1 Score: 557.8 bits (1436), Expect = 1.1e-158
Identity = 293/557 (52.60%), Postives = 401/557 (71.99%), Query Frame = 0

Query: 68  ISSHGLSTQAGAENSGEEGNVEDGCSELDETLPSTSPLEDSKTADDNEEELTSGSEIDDD 127
           +S+  LS+ AG ++  EE ++EDG SEL+     +   + S ++D++E +L++  E    
Sbjct: 57  LSNRALSSSAGTKSDQEEDDLEDGFSELE----GSKSGQGSTSSDEDEGKLSADEE---- 116

Query: 128 DDVVDDRTELDLPEGETGLVEKISIKRAPSELLNVIWKAPGLTVSSALDKWVSEGKELSR 187
                +  ELDL E +   V + ++++  SEL   I  APGL++ SALDKWV EG E++R
Sbjct: 117 -----EEEELDLIETD---VSRKTVEKKQSELFKTIVSAPGLSIGSALDKWVEEGNEITR 176

Query: 188 DDISSAMLNLRKCRMYGKALQFSEWLETSGKLDFIENDYASRLYLIGKLRGLRMAENYIA 247
            +I+ AML LR+ RMYG+ALQ SEWLE + K++  E DYASRL L  K+RGL   E  + 
Sbjct: 177 VEIAKAMLQLRRRRMYGRALQMSEWLEANKKIEMTERDYASRLDLTVKIRGLEKGEACMQ 236

Query: 248 KIPKSFQGEVVYQTLLVNCVIASNVHKAEKVFNKMKNLEFPITAFACNQLLLLYKRTDKR 307
           KIPKSF+GEV+Y+TLL NCV A NV K+E VFNKMK+L FP++ F C+Q+LLL+KR D++
Sbjct: 237 KIPKSFKGEVLYRTLLANCVAAGNVKKSELVFNKMKDLGFPLSGFTCDQMLLLHKRIDRK 296

Query: 308 KIADVLLLMKKENVKYSTSTYRILIDVNGLSNDITGMEEVVDSMKAEGIKLDVETLSRLV 367
           KIADVLLLM+KEN+K S  TY+ILIDV G +NDI+GME+++++MK EG++LD +T +   
Sbjct: 297 KIADVLLLMEKENIKPSLLTYKILIDVKGATNDISGMEQILETMKDEGVELDFQTQALTA 356

Query: 368 KHYVSGGLKDKAKAVLKEMEEINSEGSRRPCRILLPLYGELQMEDEVRRLWEICESNPHI 427
           +HY   GLKDKA+ VLKEME  + E +RR  + LL +Y  L  EDEV+R+W+ICES P+ 
Sbjct: 357 RHYSGAGLKDKAEKVLKEMEGESLEANRRAFKDLLSIYASLGREDEVKRIWKICESKPYF 416

Query: 428 EECMAALVAWGKLKNVQEAEKIFDRVLKTGKKLSARHYSTMMNVYRKDSKMLTKGKELVN 487
           EE +AA+ A+GKL  VQEAE IF++++K  ++ S+  YS ++ VY  D KML+KGK+LV 
Sbjct: 417 EESLAAIQAFGKLNKVQEAEAIFEKIVKMDRRASSSTYSVLLRVY-VDHKMLSKGKDLVK 476

Query: 488 QMAESGCRMDPFTLDAVVKLYVEAGEVEKADSFLVKAVLQNKKKPMFTTYITLMDRYASR 547
           +MAESGCR++  T DA++KLYVEAGEVEKADS L KA  Q+  K M  +++ +MD Y+ R
Sbjct: 477 RMAESGCRIEATTWDALIKLYVEAGEVEKADSLLDKASKQSHTKLMMNSFMYIMDEYSKR 536

Query: 548 GDVPNVEKNFAMMRRLGYVGRLSQFQTLIQAYVNAKAPAYGMRERMKADNVFPNKDLAGK 607
           GDV N EK F  MR  GY  RL QFQ L+QAY+NAK+PAYGMR+R+KADN+FPNK +A +
Sbjct: 537 GDVHNTEKIFLKMREAGYTSRLRQFQALMQAYINAKSPAYGMRDRLKADNIFPNKSMAAQ 596

Query: 608 LAQVDCLKMRKVSDLLD 625
           LAQ D  K   +SD+LD
Sbjct: 597 LAQGDPFKKTAISDILD 596

BLAST of CSPI03G37740 vs. TAIR 10
Match: AT1G80270.3 (PENTATRICOPEPTIDE REPEAT 596 )

HSP 1 Score: 557.8 bits (1436), Expect = 1.1e-158
Identity = 293/557 (52.60%), Postives = 401/557 (71.99%), Query Frame = 0

Query: 68  ISSHGLSTQAGAENSGEEGNVEDGCSELDETLPSTSPLEDSKTADDNEEELTSGSEIDDD 127
           +S+  LS+ AG ++  EE ++EDG SEL+     +   + S ++D++E +L++  E    
Sbjct: 57  LSNRALSSSAGTKSDQEEDDLEDGFSELE----GSKSGQGSTSSDEDEGKLSADEE---- 116

Query: 128 DDVVDDRTELDLPEGETGLVEKISIKRAPSELLNVIWKAPGLTVSSALDKWVSEGKELSR 187
                +  ELDL E +   V + ++++  SEL   I  APGL++ SALDKWV EG E++R
Sbjct: 117 -----EEEELDLIETD---VSRKTVEKKQSELFKTIVSAPGLSIGSALDKWVEEGNEITR 176

Query: 188 DDISSAMLNLRKCRMYGKALQFSEWLETSGKLDFIENDYASRLYLIGKLRGLRMAENYIA 247
            +I+ AML LR+ RMYG+ALQ SEWLE + K++  E DYASRL L  K+RGL   E  + 
Sbjct: 177 VEIAKAMLQLRRRRMYGRALQMSEWLEANKKIEMTERDYASRLDLTVKIRGLEKGEACMQ 236

Query: 248 KIPKSFQGEVVYQTLLVNCVIASNVHKAEKVFNKMKNLEFPITAFACNQLLLLYKRTDKR 307
           KIPKSF+GEV+Y+TLL NCV A NV K+E VFNKMK+L FP++ F C+Q+LLL+KR D++
Sbjct: 237 KIPKSFKGEVLYRTLLANCVAAGNVKKSELVFNKMKDLGFPLSGFTCDQMLLLHKRIDRK 296

Query: 308 KIADVLLLMKKENVKYSTSTYRILIDVNGLSNDITGMEEVVDSMKAEGIKLDVETLSRLV 367
           KIADVLLLM+KEN+K S  TY+ILIDV G +NDI+GME+++++MK EG++LD +T +   
Sbjct: 297 KIADVLLLMEKENIKPSLLTYKILIDVKGATNDISGMEQILETMKDEGVELDFQTQALTA 356

Query: 368 KHYVSGGLKDKAKAVLKEMEEINSEGSRRPCRILLPLYGELQMEDEVRRLWEICESNPHI 427
           +HY   GLKDKA+ VLKEME  + E +RR  + LL +Y  L  EDEV+R+W+ICES P+ 
Sbjct: 357 RHYSGAGLKDKAEKVLKEMEGESLEANRRAFKDLLSIYASLGREDEVKRIWKICESKPYF 416

Query: 428 EECMAALVAWGKLKNVQEAEKIFDRVLKTGKKLSARHYSTMMNVYRKDSKMLTKGKELVN 487
           EE +AA+ A+GKL  VQEAE IF++++K  ++ S+  YS ++ VY  D KML+KGK+LV 
Sbjct: 417 EESLAAIQAFGKLNKVQEAEAIFEKIVKMDRRASSSTYSVLLRVY-VDHKMLSKGKDLVK 476

Query: 488 QMAESGCRMDPFTLDAVVKLYVEAGEVEKADSFLVKAVLQNKKKPMFTTYITLMDRYASR 547
           +MAESGCR++  T DA++KLYVEAGEVEKADS L KA  Q+  K M  +++ +MD Y+ R
Sbjct: 477 RMAESGCRIEATTWDALIKLYVEAGEVEKADSLLDKASKQSHTKLMMNSFMYIMDEYSKR 536

Query: 548 GDVPNVEKNFAMMRRLGYVGRLSQFQTLIQAYVNAKAPAYGMRERMKADNVFPNKDLAGK 607
           GDV N EK F  MR  GY  RL QFQ L+QAY+NAK+PAYGMR+R+KADN+FPNK +A +
Sbjct: 537 GDVHNTEKIFLKMREAGYTSRLRQFQALMQAYINAKSPAYGMRDRLKADNIFPNKSMAAQ 596

Query: 608 LAQVDCLKMRKVSDLLD 625
           LAQ D  K   +SD+LD
Sbjct: 597 LAQGDPFKKTAISDILD 596

BLAST of CSPI03G37740 vs. TAIR 10
Match: AT1G15480.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 538.5 bits (1386), Expect = 7.1e-153
Identity = 298/614 (48.53%), Postives = 413/614 (67.26%), Query Frame = 0

Query: 12  RNQGYRV-RTSYVFGKLEVPYFWEGNVAGFGTAAALSDRFIYFDRNNLTTWPSSEVYISS 71
           R+Q  R+   + V+ KL++P   E N+A   + A + D+     R    +W SS      
Sbjct: 10  RSQSLRLGACNAVYSKLDIP-LGERNIA-IESNALIHDKHEALPRFYELSWSSS---TGR 69

Query: 72  HGLSTQAGAENSGEEGNVEDGCSELDETLPSTSPLEDSKTADDNEEELTSGSEIDDDDDV 131
             LS+ AGA+ +G++ ++E      D+ +   +P E S  ++D EE   SG    D+ D+
Sbjct: 70  RSLSSDAGAKTTGDDDDLE------DKNVDLATPDETSSDSEDGEE--FSG----DEGDI 129

Query: 132 VDDRTELDLPEGETGLVEKISIKRAPSELLNVIWKAPGLTVSSALDKWVSEGKELSRDDI 191
                EL +PE            + PSE+   I    GL+V SALDKWV +GK+ +R + 
Sbjct: 130 EGAELELHVPE-----------SKRPSEMFKAIVSVSGLSVGSALDKWVEQGKDTNRKEF 189

Query: 192 SSAMLNLRKCRMYGKALQFSEWLETSGKLDFIENDYASRLYLIGKLRGLRMAENYIAKIP 251
            SAML LRK RM+G+ALQ +EWL+ + + +  E DYA RL LI K+RG    E YI  IP
Sbjct: 190 ESAMLQLRKRRMFGRALQMTEWLDENKQFEMEERDYACRLDLISKVRGWYKGEAYIKTIP 249

Query: 252 KSFQGEVVYQTLLVNCVIASNVHKAEKVFNKMKNLEFPITAFACNQLLLLYKRTDKRKIA 311
           +SF+GE+VY+TLL N V  SNV  AE VFNKMK+L FP++ F CNQ+L+LYKR DK+KIA
Sbjct: 250 ESFRGELVYRTLLANHVATSNVRTAEAVFNKMKDLGFPLSTFTCNQMLILYKRVDKKKIA 309

Query: 312 DVLLLMKKENVKYSTSTYRILIDVNGLSNDITGMEEVVDSMKAEGIKLDVETLSRLVKHY 371
           DVLLL++KEN+K + +TY+ILID  G SNDITGME++V++MK+EG++LD+   + + +HY
Sbjct: 310 DVLLLLEKENLKPNLNTYKILIDTKGSSNDITGMEQIVETMKSEGVELDLRARALIARHY 369

Query: 372 VSGGLKDKAKAVLKEMEEINSEGSRRPCRILLPLYGELQMEDEVRRLWEICESNPHIEEC 431
            S GLK+KA+ VLKEME  + E +R  C+ LL +YG LQ EDEVRR+W+ICE NP   E 
Sbjct: 370 ASAGLKEKAEKVLKEMEGESLEENRHMCKDLLSVYGYLQREDEVRRVWKICEENPRYNEV 429

Query: 432 MAALVAWGKLKNVQEAEKIFDRVLKTGKKLSARHYSTMMNVYRKDSKMLTKGKELVNQMA 491
           +AA++A+GK+  V++AE +F++VLK   ++S+  YS ++ VY  D KM+++GK+LV QM+
Sbjct: 430 LAAILAFGKIDKVKDAEAVFEKVLKMSHRVSSNVYSVLLRVY-VDHKMVSEGKDLVKQMS 489

Query: 492 ESGCRMDPFTLDAVVKLYVEAGEVEKADSFLVKAVLQNKKKPMFTTYITLMDRYASRGDV 551
           +SGC +   T DAV+KLYVEAGEVEKA+S L KA+   + KP+ ++++ LM  Y  RGDV
Sbjct: 490 DSGCNIGALTWDAVIKLYVEAGEVEKAESSLSKAIQSKQIKPLMSSFMYLMHEYVRRGDV 549

Query: 552 PNVEKNFAMMRRLGYVGRLSQFQTLIQAYVNAKAPAYGMRERMKADNVFPNKDLAGKLAQ 611
            N EK F  M++ GY  R   +QTLIQAYVNAKAPAYGM+ERMKADN+FPNK LA +LA+
Sbjct: 550 HNTEKIFQRMKQAGYQSRFWAYQTLIQAYVNAKAPAYGMKERMKADNIFPNKRLAAQLAK 594

Query: 612 VDCLKMRKVSDLLD 625
            D  K   +SDLLD
Sbjct: 610 ADPFKKTPLSDLLD 594

BLAST of CSPI03G37740 vs. TAIR 10
Match: AT3G15590.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 464.2 bits (1193), Expect = 1.7e-130
Identity = 255/558 (45.70%), Postives = 365/558 (65.41%), Query Frame = 0

Query: 67  YISSHGLSTQAGAENSGEEGNVEDGCSELDETLPSTSPLEDSKTADDNEEELTSGSEIDD 126
           +   H LS+ A A++ G+E   E+  SE +E +P +  + +    DD+  E   GS+ DD
Sbjct: 65  FFGIHKLSSIADAKDKGDEVVREEELSESEEAVPVSGDVPEGVVDDDSLFEPELGSDNDD 124

Query: 127 DDDVVDDRTELDLPEGETGLVEKISIKRAPSELLNVIWKAPGLTVSSALDKWVSEGKELS 186
                     L++ E  +    K + KR  SEL   I      +V   L+KWV EGK+LS
Sbjct: 125 ----------LEIEEKHSKDGGKPTKKRGQSELYESI--VAYKSVKHVLEKWVKEGKDLS 184

Query: 187 RDDISSAMLNLRKCRMYGKALQFSEWLETSGKLDFIENDYASRLYLIGKLRGLRMAENYI 246
           + +++ A+ NLRK + Y   LQ  EWL  + + +F E +YAS+L L+ K+  L+ AE ++
Sbjct: 185 QAEVTLAIHNLRKRKSYAMCLQLWEWLGANTQFEFTEANYASQLDLVAKVHSLQKAEIFL 244

Query: 247 AKIPKSFQGEVVYQTLLVNCVIASNVHKAEKVFNKMKNLEFPITAFACNQLLLLYKRTDK 306
             IP+S +GEVVY+TLL NCV+  +V+KAE +FNKMK L+FP + FACNQLLLLY   D+
Sbjct: 245 KDIPESSRGEVVYRTLLANCVLKHHVNKAEDIFNKMKELKFPTSVFACNQLLLLYSMHDR 304

Query: 307 RKIADVLLLMKKENVKYSTSTYRILIDVNGLSNDITGMEEVVDSMKAEGIKLDVETLSRL 366
           +KI+DVLLLM++EN+K S +TY  LI+  GL+ DITGME++V+++K EGI+LD E  S L
Sbjct: 305 KKISDVLLLMERENIKPSRATYHFLINSKGLAGDITGMEKIVETIKEEGIELDPELQSIL 364

Query: 367 VKHYVSGGLKDKAKAVLKEMEEINSEGSRRPCRILLPLYGELQMEDEVRRLWEICESNPH 426
            K+Y+  GLK++A+ ++KE+E    + +   CR LLPLY ++   D VRRL    + NP 
Sbjct: 365 AKYYIRAGLKERAQDLMKEIEGKGLQQTPWVCRSLLPLYADIGDSDNVRRLSRFVDQNPR 424

Query: 427 IEECMAALVAWGKLKNVQEAEKIFDRVLKTGKKLSARHYSTMMNVYRKDSKMLTKGKELV 486
            + C++A+ AWGKLK V+EAE +F+R+++  K      Y  +M +Y  ++KML KG++LV
Sbjct: 425 YDNCISAIKAWGKLKEVEEAEAVFERLVEKYKIFPMMPYFALMEIY-TENKMLAKGRDLV 484

Query: 487 NQMAESGCRMDPFTLDAVVKLYVEAGEVEKADSFLVKAVLQNKKKPMFTTYITLMDRYAS 546
            +M  +G  + P T  A+VKLY++AGEV KA+  L +A   NK +PMFTTY+ +++ YA 
Sbjct: 485 KRMGNAGIAIGPSTWHALVKLYIKAGEVGKAELILNRATKDNKMRPMFTTYMAILEEYAK 544

Query: 547 RGDVPNVEKNFAMMRRLGYVGRLSQFQTLIQAYVNAKAPAYGMRERMKADNVFPNKDLAG 606
           RGDV N EK F  M+R  Y  +L Q++T++ AY+NAK PAYGM ERMKADNVFPNK LA 
Sbjct: 545 RGDVHNTEKVFMKMKRASYAAQLMQYETVLLAYINAKTPAYGMIERMKADNVFPNKSLAA 604

Query: 607 KLAQVDCLKMRKVSDLLD 625
           KLAQV+  K   VS LLD
Sbjct: 605 KLAQVNPFKKCPVSVLLD 609

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9C9771.6e-15752.60Pentatricopeptide repeat-containing protein At1g80270, mitochondrial OS=Arabidop... [more]
Q9XI211.0e-15148.53Pentatricopeptide repeat-containing protein At1g15480, mitochondrial OS=Arabidop... [more]
Q9LRP62.4e-12945.70Pentatricopeptide repeat-containing protein At3g15590, mitochondrial OS=Arabidop... [more]
Q940Q22.0e-3525.48Pentatricopeptide repeat-containing protein At1g07590, mitochondrial OS=Arabidop... [more]
O227147.7e-3527.48Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A0A0LHD80.0e+0098.40PPR_long domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G819930 PE... [more]
A0A0A0LEU50.0e+0091.35PPR_long domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G819910 PE... [more]
A0A5A7UK786.7e-29283.09Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3B8B04.2e-27880.61pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like isofor... [more]
A0A5A7UD764.2e-27880.61Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
XP_011652178.20.0e+0099.36pentatricopeptide repeat-containing protein At1g80270, mitochondrial isoform X1 ... [more]
XP_031739463.10.0e+0090.05LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At1g80270, mito... [more]
KAE8651174.10.0e+0085.54hypothetical protein Csa_002089 [Cucumis sativus][more]
KAA0053889.11.4e-29183.09pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK25516... [more]
XP_008443248.18.7e-27880.61PREDICTED: pentatricopeptide repeat-containing protein At1g80270, mitochondrial-... [more]
Match NameE-valueIdentityDescription
AT1G80270.11.1e-15852.60PENTATRICOPEPTIDE REPEAT 596 [more]
AT1G80270.21.1e-15852.60PENTATRICOPEPTIDE REPEAT 596 [more]
AT1G80270.31.1e-15852.60PENTATRICOPEPTIDE REPEAT 596 [more]
AT1G15480.17.1e-15348.53Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G15590.11.7e-13045.70Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 169..309
e-value: 3.1E-6
score: 28.6
coord: 310..419
e-value: 1.0E-11
score: 46.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 427..620
e-value: 1.9E-24
score: 88.6
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 257..288
e-value: 0.0014
score: 16.6
coord: 465..497
e-value: 0.0021
score: 16.1
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 277..335
e-value: 0.0044
score: 17.1
coord: 347..388
e-value: 0.0043
score: 17.1
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 461..496
score: 8.648523
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 75..131
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 110..131
NoneNo IPR availablePANTHERPTHR45717OS12G0527900 PROTEINcoord: 27..624
NoneNo IPR availablePANTHERPTHR45717:SF15OS01G0280400 PROTEINcoord: 27..624

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI03G37740.1CSPI03G37740.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0005739 mitochondrion
molecular_function GO:0005515 protein binding