CSPI02G07210 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI02G07210
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr2: 6058820 .. 6061181 (+)
RNA-Seq ExpressionCSPI02G07210
SyntenyCSPI02G07210
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
CTTCAGTCTTCAAATGGCTTTGGTTTGATGGATTTGAATTCGGCTTTTTGGGTTTTTCCTACTTTTTTGAACAAGGTAGTTTAAGATGAGGTGGATGAATCTAAGCAGTTCTTGCTTTCCTTCTCCTGCTTTTCTGAAACTTTCTCATTCTATTTCTCAAGGTACAATGACCCATAAAATCATATCATTCAACTTGTCTGAGCATCACTTGTTCAAGTCATTTTCCTACCACACTTCAAATCATTTTTCATCCAATACCCTTCATGCCAAAATGGTCAAGATTGGTTCTATTTTTGTATCAGGCAAGTTTGTTTTGACCTCTTATGTGAAATCTGAGAAATTAAACGATGCTCAGAAACTGTTCGACGAAATGCCCAATAGAGATGTACTTACATGGACGGCCCTTATATCGGGTTTTTCTAGAGTCAATTCTTCTGGAATGGCATTGCAACTGTTTAGAGAAATGCTGGTTGAAGGTGTTTCTCCAAATCACTTTACTTTGTCTACTGTTCTTAAACTTTGCTCTAAAGTAGGTGATGTGCGAATGGGTAAGGGAATTCATGGATGGATACTGAGAAATGGGGTTAAATTAGATGTTGTCTTGGAGAATTCTATGCTTGATTTGTATGCTAAGTTTGATGAATTTGTTTATGCCAGAAAGTTGTATGATTCAATGAGAGAAAAGAGTACTGATACTGACAACATAATACTTGGTGTGTACGTCCGTAGTTGTGATGTTAACAAATCTCTTCATTTATTCAGAAACTTGCCCTGCAGAAATGCTGCGAGTTGGAATACAATTATATGTGGGCTAATGCAAGGTGGGTATTTGAATGCAGCATTGGAGCTACTCTATGAAATGGTGGAGAACGAATCTGAGTTTAACAATTTTACTTCTTCCATAGCTTTGAGTGTGGTTTCTTCTTTACTGATTCTTGAGCTAGGTAGACAGGTACATGGCCGAATTGTCAGGTGTGGTCTTCATAATGATGGATTTGTAAAGAGTGCACTGATAAATATGTACATTAAATGTGGAAATTTGGAAAAAGCATCAGTGATATATAGTCGACTGCCTTCAGGTTTTGCAACAAAACAAGGTTCCAATATTGTATGCAGTGACACGATGACAGAAATTGTTTCGCGGAGCTCAATGGTGTATGGATATGTCCGAAATGGCAAATATGAAGATGCCTTCAAAACTTTTGTGTCTATGGTCCGTGAACGGGTTCTAATGGACAAATTTACCATTGCAAATGTTGTGTCTGCTTGTTCTAATGCTGGTGTTTTGGAGCTTGGACGTCAAGTCCATGGATTCATTCATAAAACTGTGGAACAACTTGATGCTCACTTGGCTTCCTCCTTGATTGACATGTACGCTAAAGGTGGGAGTTTGGATTGTGCCCATCGAATTTTTGACCAAATGACCAATTACTTAAATGTTGTAATATGGACTTCCATGATCGTTGGATGTGCTTTACACGGGCATGGTAAGGAAGCCATTAGACTGTTTGAACAGATGAGATATGAGGGAATTATACCAAATGAGGTCACTTTTATAGGAGTTTTAACAGCTTGCAGTCATGCGGGGCTGCTTGAAGATGGTCATCTATATTTTAATATGATGAAAGATGTTTATGCAATCAAGCCTAAGGTTGAGCATTACACTTGTATGGTAGATCTTTACGGCCGAGCTGGACTCTTGAATGAAGTCAAAGAATTCATCTATGAGAATGATTTATCACACCTTAGTGCAGTTTGGAAGGCATTCCTCTCATCCTGTCGGCTTTACAGGGACCTTGAAATGGGGAAGTGGGTTTCTGAAAAATTGTTTAGACTCAAACCACAAGATGAAGGGTCTTACGTTTTACTATCAAACATGTGCTCCGGCAGTCAAAAGTGGGAAGAAGCTTCAAGAGCAAGAAGATCTATGCAACACAGTGGGATTAACAAAACACCTGGTCAATCTTGGATTCATTTGAAAAATCAAGTCCACTCTTTCGTTGCAGGAGACCAATCACACCCTCAACATGCTCAAATATATGAATATCTGGACAAGCTAATTGGAAGGTTGAAGGAAATCGGGTACTTGCATGATGTGAAATTGGTGATGCAGGATGTAGAAGAAGAACAGGGTGAAGTGCTTCTTGGTTGGCATAGTGAAAAGCTTGCAGTTGCTTATGGTATAATCAGCTTGGGTTCTGCCATTCCAATCCGAATCATGAAGAACCTTCGGATATGTACTGATTGTCATAACTTTATGAAGTTAACATCTCAACTTTTAGGCAGAGAGATCATTGTTCGAGATATTTATCGTTTCCATCATTTCAATTCCGGTCATTGCTCTTGTGGTGATTATTGGTGA

mRNA sequence

CTTCAGTCTTCAAATGGCTTTGGTTTGATGGATTTGAATTCGGCTTTTTGGGTTTTTCCTACTTTTTTGAACAAGGTAGTTTAAGATGAGGTGGATGAATCTAAGCAGTTCTTGCTTTCCTTCTCCTGCTTTTCTGAAACTTTCTCATTCTATTTCTCAAGGTACAATGACCCATAAAATCATATCATTCAACTTGTCTGAGCATCACTTGTTCAAGTCATTTTCCTACCACACTTCAAATCATTTTTCATCCAATACCCTTCATGCCAAAATGGTCAAGATTGGTTCTATTTTTGTATCAGGCAAGTTTGTTTTGACCTCTTATGTGAAATCTGAGAAATTAAACGATGCTCAGAAACTGTTCGACGAAATGCCCAATAGAGATGTACTTACATGGACGGCCCTTATATCGGGTTTTTCTAGAGTCAATTCTTCTGGAATGGCATTGCAACTGTTTAGAGAAATGCTGGTTGAAGGTGTTTCTCCAAATCACTTTACTTTGTCTACTGTTCTTAAACTTTGCTCTAAAGTAGGTGATGTGCGAATGGGTAAGGGAATTCATGGATGGATACTGAGAAATGGGGTTAAATTAGATGTTGTCTTGGAGAATTCTATGCTTGATTTGTATGCTAAGTTTGATGAATTTGTTTATGCCAGAAAGTTGTATGATTCAATGAGAGAAAAGAGTACTGATACTGACAACATAATACTTGGTGTGTACGTCCGTAGTTGTGATGTTAACAAATCTCTTCATTTATTCAGAAACTTGCCCTGCAGAAATGCTGCGAGTTGGAATACAATTATATGTGGGCTAATGCAAGGTGGGTATTTGAATGCAGCATTGGAGCTACTCTATGAAATGGTGGAGAACGAATCTGAGTTTAACAATTTTACTTCTTCCATAGCTTTGAGTGTGGTTTCTTCTTTACTGATTCTTGAGCTAGGTAGACAGGTACATGGCCGAATTGTCAGGTGTGGTCTTCATAATGATGGATTTGTAAAGAGTGCACTGATAAATATGTACATTAAATGTGGAAATTTGGAAAAAGCATCAGTGATATATAGTCGACTGCCTTCAGGTTTTGCAACAAAACAAGGTTCCAATATTGTATGCAGTGACACGATGACAGAAATTGTTTCGCGGAGCTCAATGGTGTATGGATATGTCCGAAATGGCAAATATGAAGATGCCTTCAAAACTTTTGTGTCTATGGTCCGTGAACGGGTTCTAATGGACAAATTTACCATTGCAAATGTTGTGTCTGCTTGTTCTAATGCTGGTGTTTTGGAGCTTGGACGTCAAGTCCATGGATTCATTCATAAAACTGTGGAACAACTTGATGCTCACTTGGCTTCCTCCTTGATTGACATGTACGCTAAAGGTGGGAGTTTGGATTGTGCCCATCGAATTTTTGACCAAATGACCAATTACTTAAATGTTGTAATATGGACTTCCATGATCGTTGGATGTGCTTTACACGGGCATGGTAAGGAAGCCATTAGACTGTTTGAACAGATGAGATATGAGGGAATTATACCAAATGAGGTCACTTTTATAGGAGTTTTAACAGCTTGCAGTCATGCGGGGCTGCTTGAAGATGGTCATCTATATTTTAATATGATGAAAGATGTTTATGCAATCAAGCCTAAGGTTGAGCATTACACTTGTATGGTAGATCTTTACGGCCGAGCTGGACTCTTGAATGAAGTCAAAGAATTCATCTATGAGAATGATTTATCACACCTTAGTGCAGTTTGGAAGGCATTCCTCTCATCCTGTCGGCTTTACAGGGACCTTGAAATGGGGAAGTGGGTTTCTGAAAAATTGTTTAGACTCAAACCACAAGATGAAGGGTCTTACGTTTTACTATCAAACATGTGCTCCGGCAGTCAAAAGTGGGAAGAAGCTTCAAGAGCAAGAAGATCTATGCAACACAGTGGGATTAACAAAACACCTGGTCAATCTTGGATTCATTTGAAAAATCAAGTCCACTCTTTCGTTGCAGGAGACCAATCACACCCTCAACATGCTCAAATATATGAATATCTGGACAAGCTAATTGGAAGGTTGAAGGAAATCGGGTACTTGCATGATGTGAAATTGGTGATGCAGGATGTAGAAGAAGAACAGGGTGAAGTGCTTCTTGGTTGGCATAGTGAAAAGCTTGCAGTTGCTTATGGTATAATCAGCTTGGGTTCTGCCATTCCAATCCGAATCATGAAGAACCTTCGGATATGTACTGATTGTCATAACTTTATGAAGTTAACATCTCAACTTTTAGGCAGAGAGATCATTGTTCGAGATATTTATCGTTTCCATCATTTCAATTCCGGTCATTGCTCTTGTGGTGATTATTGGTGA

Coding sequence (CDS)

ATGAGGTGGATGAATCTAAGCAGTTCTTGCTTTCCTTCTCCTGCTTTTCTGAAACTTTCTCATTCTATTTCTCAAGGTACAATGACCCATAAAATCATATCATTCAACTTGTCTGAGCATCACTTGTTCAAGTCATTTTCCTACCACACTTCAAATCATTTTTCATCCAATACCCTTCATGCCAAAATGGTCAAGATTGGTTCTATTTTTGTATCAGGCAAGTTTGTTTTGACCTCTTATGTGAAATCTGAGAAATTAAACGATGCTCAGAAACTGTTCGACGAAATGCCCAATAGAGATGTACTTACATGGACGGCCCTTATATCGGGTTTTTCTAGAGTCAATTCTTCTGGAATGGCATTGCAACTGTTTAGAGAAATGCTGGTTGAAGGTGTTTCTCCAAATCACTTTACTTTGTCTACTGTTCTTAAACTTTGCTCTAAAGTAGGTGATGTGCGAATGGGTAAGGGAATTCATGGATGGATACTGAGAAATGGGGTTAAATTAGATGTTGTCTTGGAGAATTCTATGCTTGATTTGTATGCTAAGTTTGATGAATTTGTTTATGCCAGAAAGTTGTATGATTCAATGAGAGAAAAGAGTACTGATACTGACAACATAATACTTGGTGTGTACGTCCGTAGTTGTGATGTTAACAAATCTCTTCATTTATTCAGAAACTTGCCCTGCAGAAATGCTGCGAGTTGGAATACAATTATATGTGGGCTAATGCAAGGTGGGTATTTGAATGCAGCATTGGAGCTACTCTATGAAATGGTGGAGAACGAATCTGAGTTTAACAATTTTACTTCTTCCATAGCTTTGAGTGTGGTTTCTTCTTTACTGATTCTTGAGCTAGGTAGACAGGTACATGGCCGAATTGTCAGGTGTGGTCTTCATAATGATGGATTTGTAAAGAGTGCACTGATAAATATGTACATTAAATGTGGAAATTTGGAAAAAGCATCAGTGATATATAGTCGACTGCCTTCAGGTTTTGCAACAAAACAAGGTTCCAATATTGTATGCAGTGACACGATGACAGAAATTGTTTCGCGGAGCTCAATGGTGTATGGATATGTCCGAAATGGCAAATATGAAGATGCCTTCAAAACTTTTGTGTCTATGGTCCGTGAACGGGTTCTAATGGACAAATTTACCATTGCAAATGTTGTGTCTGCTTGTTCTAATGCTGGTGTTTTGGAGCTTGGACGTCAAGTCCATGGATTCATTCATAAAACTGTGGAACAACTTGATGCTCACTTGGCTTCCTCCTTGATTGACATGTACGCTAAAGGTGGGAGTTTGGATTGTGCCCATCGAATTTTTGACCAAATGACCAATTACTTAAATGTTGTAATATGGACTTCCATGATCGTTGGATGTGCTTTACACGGGCATGGTAAGGAAGCCATTAGACTGTTTGAACAGATGAGATATGAGGGAATTATACCAAATGAGGTCACTTTTATAGGAGTTTTAACAGCTTGCAGTCATGCGGGGCTGCTTGAAGATGGTCATCTATATTTTAATATGATGAAAGATGTTTATGCAATCAAGCCTAAGGTTGAGCATTACACTTGTATGGTAGATCTTTACGGCCGAGCTGGACTCTTGAATGAAGTCAAAGAATTCATCTATGAGAATGATTTATCACACCTTAGTGCAGTTTGGAAGGCATTCCTCTCATCCTGTCGGCTTTACAGGGACCTTGAAATGGGGAAGTGGGTTTCTGAAAAATTGTTTAGACTCAAACCACAAGATGAAGGGTCTTACGTTTTACTATCAAACATGTGCTCCGGCAGTCAAAAGTGGGAAGAAGCTTCAAGAGCAAGAAGATCTATGCAACACAGTGGGATTAACAAAACACCTGGTCAATCTTGGATTCATTTGAAAAATCAAGTCCACTCTTTCGTTGCAGGAGACCAATCACACCCTCAACATGCTCAAATATATGAATATCTGGACAAGCTAATTGGAAGGTTGAAGGAAATCGGGTACTTGCATGATGTGAAATTGGTGATGCAGGATGTAGAAGAAGAACAGGGTGAAGTGCTTCTTGGTTGGCATAGTGAAAAGCTTGCAGTTGCTTATGGTATAATCAGCTTGGGTTCTGCCATTCCAATCCGAATCATGAAGAACCTTCGGATATGTACTGATTGTCATAACTTTATGAAGTTAACATCTCAACTTTTAGGCAGAGAGATCATTGTTCGAGATATTTATCGTTTCCATCATTTCAATTCCGGTCATTGCTCTTGTGGTGATTATTGGTGA

Protein sequence

MRWMNLSSSCFPSPAFLKLSHSISQGTMTHKIISFNLSEHHLFKSFSYHTSNHFSSNTLHAKMVKIGSIFVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMALQLFREMLVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDLYAKFDEFVYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLHNDGFVKSALINMYIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTEIVSRSSMVYGYVRNGKYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDAHLASSLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRYEGIIPNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLNEVKEFIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCSGSQKWEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIGRLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTDCHNFMKLTSQLLGREIIVRDIYRFHHFNSGHCSCGDYW*
Homology
BLAST of CSPI02G07210 vs. ExPASy Swiss-Prot
Match: Q9LW63 (Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H32 PE=3 SV=1)

HSP 1 Score: 520.0 bits (1338), Expect = 4.5e-146
Identity = 262/706 (37.11%), Postives = 431/706 (61.05%), Query Frame = 0

Query: 59  LHAKMVKIGSI-FVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSS 118
           LHA+ ++  S+   S   V++ Y   + L++A  LF  + +  VL W ++I  F+  +  
Sbjct: 27  LHAQFIRTQSLSHTSASIVISIYTNLKLLHEALLLFKTLKSPPVLAWKSVIRCFTDQSLF 86

Query: 119 GMALQLFREMLVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSM 178
             AL  F EM   G  P+H    +VLK C+ + D+R G+ +HG+I+R G+  D+   N++
Sbjct: 87  SKALASFVEMRASGRCPDHNVFPSVLKSCTMMMDLRFGESVHGFIVRLGMDCDLYTGNAL 146

Query: 179 LDLYAK---FDEFVYARKLYDSMREKSTDT--DNIILGVYVRSCDVNKSLHLFRNLPCRN 238
           +++YAK       +    ++D M ++++++  +++     +    ++    +F  +P ++
Sbjct: 147 MNMYAKLLGMGSKISVGNVFDEMPQRTSNSGDEDVKAETCIMPFGIDSVRRVFEVMPRKD 206

Query: 239 AASWNTIICGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHG 298
             S+NTII G  Q G    AL ++ EM   + + ++FT S  L + S  + +  G+++HG
Sbjct: 207 VVSYNTIIAGYAQSGMYEDALRMVREMGTTDLKPDSFTLSSVLPIFSEYVDVIKGKEIHG 266

Query: 299 RIVRCGLHNDGFVKSALINMYIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTEIVS 358
            ++R G+ +D ++ S+L++MY K   +E +  ++SRL             C D     +S
Sbjct: 267 YVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVFSRL------------YCRDG----IS 326

Query: 359 RSSMVYGYVRNGKYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIH 418
            +S+V GYV+NG+Y +A + F  MV  +V       ++V+ AC++   L LG+Q+HG++ 
Sbjct: 327 WNSLVAGYVQNGRYNEALRLFRQMVTAKVKPGAVAFSSVIPACAHLATLHLGKQLHGYVL 386

Query: 419 KTVEQLDAHLASSLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAI 478
           +     +  +AS+L+DMY+K G++  A +IFD+M N L+ V WT++I+G ALHGHG EA+
Sbjct: 387 RGGFGSNIFIASALVDMYSKCGNIKAARKIFDRM-NVLDEVSWTAIIMGHALHGHGHEAV 446

Query: 479 RLFEQMRYEGIIPNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDL 538
            LFE+M+ +G+ PN+V F+ VLTACSH GL+++   YFN M  VY +  ++EHY  + DL
Sbjct: 447 SLFEEMKRQGVKPNQVAFVAVLTACSHVGLVDEAWGYFNSMTKVYGLNQELEHYAAVADL 506

Query: 539 YGRAGLLNEVKEFIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSY 598
            GRAG L E   FI +  +    +VW   LSSC ++++LE+ + V+EK+F +  ++ G+Y
Sbjct: 507 LGRAGKLEEAYNFISKMCVEPTGSVWSTLLSSCSVHKNLELAEKVAEKIFTVDSENMGAY 566

Query: 599 VLLSNMCSGSQKWEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIY 658
           VL+ NM + + +W+E ++ R  M+  G+ K P  SWI +KN+ H FV+GD+SHP   +I 
Sbjct: 567 VLMCNMYASNGRWKEMAKLRLRMRKKGLRKKPACSWIEMKNKTHGFVSGDRSHPSMDKIN 626

Query: 659 EYLDKLIGRLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIM 718
           E+L  ++ ++++ GY+ D   V+ DV+EE    LL  HSE+LAVA+GII+      IR+ 
Sbjct: 627 EFLKAVMEQMEKEGYVADTSGVLHDVDEEHKRELLFGHSERLAVAFGIINTEPGTTIRVT 686

Query: 719 KNLRICTDCHNFMKLTSQLLGREIIVRDIYRFHHFNSGHCSCGDYW 759
           KN+RICTDCH  +K  S++  REIIVRD  RFHHFN G+CSCGDYW
Sbjct: 687 KNIRICTDCHVAIKFISKITEREIIVRDNSRFHHFNRGNCSCGDYW 715

BLAST of CSPI02G07210 vs. ExPASy Swiss-Prot
Match: Q9SHZ8 (Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H41 PE=3 SV=1)

HSP 1 Score: 517.3 bits (1331), Expect = 2.9e-145
Identity = 271/754 (35.94%), Postives = 437/754 (57.96%), Query Frame = 0

Query: 23  ISQGTMTHKIISFNLSEHHLFKSFSYHTSNHFSSNTLHAKMVKIGSIFVSGKFVLTSYVK 82
           I  G M    +  NL   +    ++ H    F    L            S   VL++Y K
Sbjct: 41  IKSGLMFSVYLMNNLMNVYSKTGYALHARKLFDEMPLRTAF--------SWNTVLSAYSK 100

Query: 83  SEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMALQLFREMLVEGVSPNHFTLSTV 142
              ++   + FD++P RD ++WT +I G+  +     A+++  +M+ EG+ P  FTL+ V
Sbjct: 101 RGDMDSTCEFFDQLPQRDSVSWTTMIVGYKNIGQYHKAIRVMGDMVKEGIEPTQFTLTNV 160

Query: 143 LKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDLYAKFDEFVYARKLYDSMREKST 202
           L   +    +  GK +H +I++ G++ +V + NS+L++YAK  + + A+ ++D M  +  
Sbjct: 161 LASVAATRCMETGKKVHSFIVKLGLRGNVSVSNSLLNMYAKCGDPMMAKFVFDRMVVRDI 220

Query: 203 DTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGGYLNAALELLYEMVEN 262
            + N ++ ++++   ++ ++  F  +  R+  +WN++I G  Q GY   AL++  +M+ +
Sbjct: 221 SSWNAMIALHMQVGQMDLAMAQFEQMAERDIVTWNSMISGFNQRGYDLRALDIFSKMLRD 280

Query: 263 E-SEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLHNDGFVKSALINMYIKCGNLEK 322
                + FT +  LS  ++L  L +G+Q+H  IV  G    G V +ALI+MY +CG +E 
Sbjct: 281 SLLSPDRFTLASVLSACANLEKLCIGKQIHSHIVTTGFDISGIVLNALISMYSRCGGVET 340

Query: 323 ASVI-------------YSRLPSGFA----TKQGSNIVCSDTMTEIVSRSSMVYGYVRNG 382
           A  +             ++ L  G+       Q  NI  S    ++V+ ++M+ GY ++G
Sbjct: 341 ARRLIEQRGTKDLKIEGFTALLDGYIKLGDMNQAKNIFVSLKDRDVVAWTAMIVGYEQHG 400

Query: 383 KYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDAHLAS 442
            Y +A   F SMV      + +T+A ++S  S+   L  G+Q+HG   K+ E     +++
Sbjct: 401 SYGEAINLFRSMVGGGQRPNSYTLAAMLSVASSLASLSHGKQIHGSAVKSGEIYSVSVSN 460

Query: 443 SLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRYEGII 502
           +LI MYAK G++  A R FD +    + V WTSMI+  A HGH +EA+ LFE M  EG+ 
Sbjct: 461 ALITMYAKAGNITSASRAFDLIRCERDTVSWTSMIIALAQHGHAEEALELFETMLMEGLR 520

Query: 503 PNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLNEVKE 562
           P+ +T++GV +AC+HAGL+  G  YF+MMKDV  I P + HY CMVDL+GRAGLL E +E
Sbjct: 521 PDHITYVGVFSACTHAGLVNQGRQYFDMMKDVDKIIPTLSHYACMVDLFGRAGLLQEAQE 580

Query: 563 FIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCSGSQK 622
           FI +  +      W + LS+CR+++++++GK  +E+L  L+P++ G+Y  L+N+ S   K
Sbjct: 581 FIEKMPIEPDVVTWGSLLSACRVHKNIDLGKVAAERLLLLEPENSGAYSALANLYSACGK 640

Query: 623 WEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIGRLKE 682
           WEEA++ R+SM+   + K  G SWI +K++VH F   D +HP+  +IY  + K+   +K+
Sbjct: 641 WEEAAKIRKSMKDGRVKKEQGFSWIEVKHKVHVFGVEDGTHPEKNEIYMTMKKIWDEIKK 700

Query: 683 IGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTDCHNF 742
           +GY+ D   V+ D+EEE  E +L  HSEKLA+A+G+IS      +RIMKNLR+C DCH  
Sbjct: 701 MGYVPDTASVLHDLEEEVKEQILRHHSEKLAIAFGLISTPDKTTLRIMKNLRVCNDCHTA 760

Query: 743 MKLTSQLLGREIIVRDIYRFHHFNSGHCSCGDYW 759
           +K  S+L+GREIIVRD  RFHHF  G CSC DYW
Sbjct: 761 IKFISKLVGREIIVRDTTRFHHFKDGFCSCRDYW 786

BLAST of CSPI02G07210 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 467.6 bits (1202), Expect = 2.6e-130
Identity = 251/704 (35.65%), Postives = 397/704 (56.39%), Query Frame = 0

Query: 59  LHAKMVKIGSIFVSGKFVLTS----YVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRV 118
           +H  +VK G  F    F +T     Y K  ++N+A+K+FD MP RD+++W  +++G+S+ 
Sbjct: 157 IHGLLVKSG--FSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQN 216

Query: 119 NSSGMALQLFREMLVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLE 178
             + MAL++ + M  E + P+  T+ +VL   S +  + +GK IHG+ +R+G    V + 
Sbjct: 217 GMARMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNIS 276

Query: 179 NSMLDLYAKFDEFVYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAA 238
            +++D+YAK      AR+L+D M E                               RN  
Sbjct: 277 TALVDMYAKCGSLETARQLFDGMLE-------------------------------RNVV 336

Query: 239 SWNTIICGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRI 298
           SWN++I   +Q      A+ +  +M++   +  + +   AL   + L  LE GR +H   
Sbjct: 337 SWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLS 396

Query: 299 VRCGLHNDGFVKSALINMYIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTEIVSRS 358
           V  GL  +  V ++LI+MY KC  ++ A+ ++ +L S                  +VS +
Sbjct: 397 VELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQS----------------RTLVSWN 456

Query: 359 SMVYGYVRNGKYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKT 418
           +M+ G+ +NG+  DA   F  M    V  D FT  +V++A +   +    + +HG + ++
Sbjct: 457 AMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRS 516

Query: 419 VEQLDAHLASSLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRL 478
               +  + ++L+DMYAK G++  A  IFD M+   +V  W +MI G   HG GK A+ L
Sbjct: 517 CLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSE-RHVTTWNAMIDGYGTHGFGKAALEL 576

Query: 479 FEQMRYEGIIPNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYG 538
           FE+M+   I PN VTF+ V++ACSH+GL+E G   F MMK+ Y+I+  ++HY  MVDL G
Sbjct: 577 FEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLG 636

Query: 539 RAGLLNEVKEFIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVL 598
           RAG LNE  +FI +  +     V+ A L +C++++++   +  +E+LF L P D G +VL
Sbjct: 637 RAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVL 696

Query: 599 LSNMCSGSQKWEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEY 658
           L+N+   +  WE+  + R SM   G+ KTPG S + +KN+VHSF +G  +HP   +IY +
Sbjct: 697 LANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKIYAF 756

Query: 659 LDKLIGRLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKN 718
           L+KLI  +KE GY+ D  LV+  VE +  E LL  HSEKLA+++G+++  +   I + KN
Sbjct: 757 LEKLICHIKEAGYVPDTNLVL-GVENDVKEQLLSTHSEKLAISFGLLNTTAGTTIHVRKN 809

Query: 719 LRICTDCHNFMKLTSQLLGREIIVRDIYRFHHFNSGHCSCGDYW 759
           LR+C DCHN  K  S + GREI+VRD+ RFHHF +G CSCGDYW
Sbjct: 817 LRVCADCHNATKYISLVTGREIVVRDMQRFHHFKNGACSCGDYW 809

BLAST of CSPI02G07210 vs. ExPASy Swiss-Prot
Match: Q9SVP7 (Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H42 PE=2 SV=2)

HSP 1 Score: 467.2 bits (1201), Expect = 3.5e-130
Identity = 242/709 (34.13%), Postives = 395/709 (55.71%), Query Frame = 0

Query: 54   FSSNTLHAKMVKIGSIFVSGKFV----LTSYVKSEKLNDAQKLFDEMPNRDVLTWTALIS 113
            F    LHA   K+G  F S   +    L  Y K   +  A   F E    +V+ W  ++ 
Sbjct: 406  FRGQQLHAYTTKLG--FASNNKIEGALLNLYAKCADIETALDYFLETEVENVVLWNVMLV 465

Query: 114  GFSRVNSSGMALQLFREMLVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKL 173
             +  ++    + ++FR+M +E + PN +T  ++LK C ++GD+ +G+ IH  I++   +L
Sbjct: 466  AYGLLDDLRNSFRIFRQMQIEEIVPNQYTYPSILKTCIRLGDLELGEQIHSQIIKTNFQL 525

Query: 174  DVVLENSMLDLYAKFDEFVYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLP 233
            +  + + ++D+YAK  +               T  D +I                     
Sbjct: 526  NAYVCSVLIDMYAKLGKL-------------DTAWDILI------------------RFA 585

Query: 234  CRNAASWNTIICGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQ 293
             ++  SW T+I G  Q  + + AL    +M++     +    + A+S  + L  L+ G+Q
Sbjct: 586  GKDVVSWTTMIAGYTQYNFDDKALTTFRQMLDRGIRSDEVGLTNAVSACAGLQALKEGQQ 645

Query: 294  VHGRIVRCGLHNDGFVKSALINMYIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTE 353
            +H +    G  +D   ++AL+ +Y +CG +E++ + + +      T+ G NI        
Sbjct: 646  IHAQACVSGFSSDLPFQNALVTLYSRCGKIEESYLAFEQ------TEAGDNI-------- 705

Query: 354  IVSRSSMVYGYVRNGKYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHG 413
              + +++V G+ ++G  E+A + FV M RE +  + FT  + V A S    ++ G+QVH 
Sbjct: 706  --AWNALVSGFQQSGNNEEALRVFVRMNREGIDNNNFTFGSAVKAASETANMKQGKQVHA 765

Query: 414  FIHKTVEQLDAHLASSLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGK 473
             I KT    +  + ++LI MYAK GS+  A + F +++   N V W ++I   + HG G 
Sbjct: 766  VITKTGYDSETEVCNALISMYAKCGSISDAEKQFLEVST-KNEVSWNAIINAYSKHGFGS 825

Query: 474  EAIRLFEQMRYEGIIPNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCM 533
            EA+  F+QM +  + PN VT +GVL+ACSH GL++ G  YF  M   Y + PK EHY C+
Sbjct: 826  EALDSFDQMIHSNVRPNHVTLVGVLSACSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCV 885

Query: 534  VDLYGRAGLLNEVKEFIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDE 593
            VD+  RAGLL+  KEFI E  +   + VW+  LS+C +++++E+G++ +  L  L+P+D 
Sbjct: 886  VDMLTRAGLLSRAKEFIQEMPIKPDALVWRTLLSACVVHKNMEIGEFAAHHLLELEPEDS 945

Query: 594  GSYVLLSNMCSGSQKWEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHA 653
             +YVLLSN+ + S+KW+     R+ M+  G+ K PGQSWI +KN +HSF  GDQ+HP   
Sbjct: 946  ATYVLLSNLYAVSKKWDARDLTRQKMKEKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLAD 1005

Query: 654  QIYEYLDKLIGRLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPI 713
            +I+EY   L  R  EIGY+ D   ++ +++ EQ + ++  HSEKLA+++G++SL + +PI
Sbjct: 1006 EIHEYFQDLTKRASEIGYVQDCFSLLNELQHEQKDPIIFIHSEKLAISFGLLSLPATVPI 1064

Query: 714  RIMKNLRICTDCHNFMKLTSQLLGREIIVRDIYRFHHFNSGHCSCGDYW 759
             +MKNLR+C DCH ++K  S++  REIIVRD YRFHHF  G CSC DYW
Sbjct: 1066 NVMKNLRVCNDCHAWIKFVSKVSNREIIVRDAYRFHHFEGGACSCKDYW 1064

BLAST of CSPI02G07210 vs. ExPASy Swiss-Prot
Match: Q9CAA8 (Putative pentatricopeptide repeat-containing protein At1g68930 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H22 PE=3 SV=1)

HSP 1 Score: 460.7 bits (1184), Expect = 3.2e-128
Identity = 246/687 (35.81%), Postives = 381/687 (55.46%), Query Frame = 0

Query: 76  VLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMALQLFREMLVE-GVSP 135
           +L +Y K+  +++ +  F+++P+RD +TW  LI G+S     G A++ +  M+ +   + 
Sbjct: 78  LLLAYSKAGLISEMESTFEKLPDRDGVTWNVLIEGYSLSGLVGAAVKAYNTMMRDFSANL 137

Query: 136 NHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDLYAKFDEFVYARKLY 195
              TL T+LKL S  G V +GK IHG +++ G +  +++ + +L +YA       A+K++
Sbjct: 138 TRVTLMTMLKLSSSNGHVSLGKQIHGQVIKLGFESYLLVGSPLLYMYANVGCISDAKKVF 197

Query: 196 DSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGGYLNAALE 255
             + +++T   N ++G  +    +  +L LFR +  +++ SW  +I GL Q G    A+E
Sbjct: 198 YGLDDRNTVMYNSLMGGLLACGMIEDALQLFRGME-KDSVSWAAMIKGLAQNGLAKEAIE 257

Query: 256 LLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLHNDGFVKSALINMYI 315
              EM     + + +     L     L  +  G+Q+H  I+R    +  +V SALI+MY 
Sbjct: 258 CFREMKVQGLKMDQYPFGSVLPACGGLGAINEGKQIHACIIRTNFQDHIYVGSALIDMYC 317

Query: 316 KCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTEIVSRSSMVYGYVRNGKYEDAFKTFV 375
           KC  L  A  ++ R+                    +VS ++MV GY + G+ E+A K F+
Sbjct: 318 KCKCLHYAKTVFDRM----------------KQKNVVSWTAMVVGYGQTGRAEEAVKIFL 377

Query: 376 SMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDAH---LASSLIDMYA 435
            M R  +  D +T+   +SAC+N   LE G Q HG   K +     H   +++SL+ +Y 
Sbjct: 378 DMQRSGIDPDHYTLGQAISACANVSSLEEGSQFHG---KAITSGLIHYVTVSNSLVTLYG 437

Query: 436 KGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRYEGIIPNEVTFI 495
           K G +D + R+F++M N  + V WT+M+   A  G   E I+LF++M   G+ P+ VT  
Sbjct: 438 KCGDIDDSTRLFNEM-NVRDAVSWTAMVSAYAQFGRAVETIQLFDKMVQHGLKPDGVTLT 497

Query: 496 GVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLNEVKEFIYENDL 555
           GV++ACS AGL+E G  YF +M   Y I P + HY+CM+DL+ R+G L E   FI     
Sbjct: 498 GVISACSRAGLVEKGQRYFKLMTSEYGIVPSIGHYSCMIDLFSRSGRLEEAMRFINGMPF 557

Query: 556 SHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCSGSQKWEEASRA 615
              +  W   LS+CR   +LE+GKW +E L  L P     Y LLS++ +   KW+  ++ 
Sbjct: 558 PPDAIGWTTLLSACRNKGNLEIGKWAAESLIELDPHHPAGYTLLSSIYASKGKWDSVAQL 617

Query: 616 RRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIGRLKEIGYLHDV 675
           RR M+   + K PGQSWI  K ++HSF A D+S P   QIY  L++L  ++ + GY  D 
Sbjct: 618 RRGMREKNVKKEPGQSWIKWKGKLHSFSADDESSPYLDQIYAKLEELNNKIIDNGYKPDT 677

Query: 676 KLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTDCHNFMKLTSQL 735
             V  DVEE     +L +HSE+LA+A+G+I + S  PIR+ KNLR+C DCHN  K  S +
Sbjct: 678 SFVHHDVEEAVKVKMLNYHSERLAIAFGLIFVPSGQPIRVGKNLRVCVDCHNATKHISSV 737

Query: 736 LGREIIVRDIYRFHHFNSGHCSCGDYW 759
            GREI+VRD  RFH F  G CSCGD+W
Sbjct: 738 TGREILVRDAVRFHRFKDGTCSCGDFW 743

BLAST of CSPI02G07210 vs. ExPASy TrEMBL
Match: A0A0A0LKI4 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G074230 PE=3 SV=1)

HSP 1 Score: 1537.7 bits (3980), Expect = 0.0e+00
Identity = 757/758 (99.87%), Postives = 757/758 (99.87%), Query Frame = 0

Query: 1   MRWMNLSSSCFPSPAFLKLSHSISQGTMTHKIISFNLSEHHLFKSFSYHTSNHFSSNTLH 60
           MRWMNLSSSCFPSPAFLKLSHSISQGTMTHKIISFNLSEHHLFKSFSYHTSNHFSSNTLH
Sbjct: 1   MRWMNLSSSCFPSPAFLKLSHSISQGTMTHKIISFNLSEHHLFKSFSYHTSNHFSSNTLH 60

Query: 61  AKMVKIGSIFVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMA 120
           AKMVKIGSIFVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMA
Sbjct: 61  AKMVKIGSIFVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMA 120

Query: 121 LQLFREMLVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDL 180
           LQLFREMLVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDL
Sbjct: 121 LQLFREMLVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDL 180

Query: 181 YAKFDEFVYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTII 240
           YAKFDEFVYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTII
Sbjct: 181 YAKFDEFVYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTII 240

Query: 241 CGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLH 300
           CGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLH
Sbjct: 241 CGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLH 300

Query: 301 NDGFVKSALINMYIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTEIVSRSSMVYGY 360
           NDGFVKSALINMYIKCGNLEKASVIYSRLPSGFATKQ SNIVCSDTMTEIVSRSSMVYGY
Sbjct: 301 NDGFVKSALINMYIKCGNLEKASVIYSRLPSGFATKQSSNIVCSDTMTEIVSRSSMVYGY 360

Query: 361 VRNGKYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDA 420
           VRNGKYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDA
Sbjct: 361 VRNGKYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDA 420

Query: 421 HLASSLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRY 480
           HLASSLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRY
Sbjct: 421 HLASSLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRY 480

Query: 481 EGIIPNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLN 540
           EGIIPNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLN
Sbjct: 481 EGIIPNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLN 540

Query: 541 EVKEFIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCS 600
           EVKEFIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCS
Sbjct: 541 EVKEFIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCS 600

Query: 601 GSQKWEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIG 660
           GSQKWEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIG
Sbjct: 601 GSQKWEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIG 660

Query: 661 RLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTD 720
           RLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTD
Sbjct: 661 RLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTD 720

Query: 721 CHNFMKLTSQLLGREIIVRDIYRFHHFNSGHCSCGDYW 759
           CHNFMKLTSQLLGREIIVRDIYRFHHFNSGHCSCGDYW
Sbjct: 721 CHNFMKLTSQLLGREIIVRDIYRFHHFNSGHCSCGDYW 758

BLAST of CSPI02G07210 vs. ExPASy TrEMBL
Match: A0A1S3B4E3 (LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g23330 OS=Cucumis melo OX=3656 GN=LOC103485889 PE=3 SV=1)

HSP 1 Score: 1400.2 bits (3623), Expect = 0.0e+00
Identity = 696/746 (93.30%), Postives = 712/746 (95.44%), Query Frame = 0

Query: 19   LSHSISQGTMTH----KIISFNLSEHHL--FKSFSYHTSNHFSSNTLHAKMVKIGSIFVS 78
            L+  ISQGT+        + F+LS +       F YHTSN FSSNTLHAKMVKIGSI  S
Sbjct: 267  LASKISQGTVATVGGLLFLGFSLSSYFFPPLXKFCYHTSNSFSSNTLHAKMVKIGSIIES 326

Query: 79   GKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMALQLFREMLVEGV 138
            GKFVLTSYVKS+KLNDAQKLFDEMPNRDVLTWTA+ISGFSRVN SGMALQLFREMLVEGV
Sbjct: 327  GKFVLTSYVKSKKLNDAQKLFDEMPNRDVLTWTAIISGFSRVNCSGMALQLFREMLVEGV 386

Query: 139  SPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDLYAKFDEFVYARK 198
             PNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENS+LDLYAKFDEFVYARK
Sbjct: 387  CPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSLLDLYAKFDEFVYARK 446

Query: 199  LYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGGYLNAA 258
            LYDSM EKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGGYLNAA
Sbjct: 447  LYDSMGEKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGGYLNAA 506

Query: 259  LELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLHNDGFVKSALINM 318
            LELLYEMVENESEFNNFTSSIALSV SSLLILELGRQVHGRIVRCGLHNDGFVKSALINM
Sbjct: 507  LELLYEMVENESEFNNFTSSIALSVASSLLILELGRQVHGRIVRCGLHNDGFVKSALINM 566

Query: 319  YIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTEIVSRSSMVYGYVRNGKYEDAFKT 378
            YIKCGNLEKASVIYS+LPSGFATKQGSNIVCSDTMTEIVSRSSMVYGYVRNGKYEDAFKT
Sbjct: 567  YIKCGNLEKASVIYSQLPSGFATKQGSNIVCSDTMTEIVSRSSMVYGYVRNGKYEDAFKT 626

Query: 379  FVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDAHLASSLIDMYAK 438
            FVSMVRERVLMDKFTIA+VVSAC+NAGVLELGRQVHGFI K+VEQLDAHLASSLIDMYAK
Sbjct: 627  FVSMVRERVLMDKFTIASVVSACANAGVLELGRQVHGFIQKSVEQLDAHLASSLIDMYAK 686

Query: 439  GGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRYEGIIPNEVTFIG 498
            GGSLDCAHRIFDQMT YLNVVIWTSMIVGC+LHGHGKEAIRLFEQMRYEGIIPNEVTFIG
Sbjct: 687  GGSLDCAHRIFDQMTYYLNVVIWTSMIVGCSLHGHGKEAIRLFEQMRYEGIIPNEVTFIG 746

Query: 499  VLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLNEVKEFIYENDLS 558
            VLTACSHAGLLEDG LYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLNEVKEFIYENDLS
Sbjct: 747  VLTACSHAGLLEDGLLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLNEVKEFIYENDLS 806

Query: 559  HLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCSGSQKWEEASRAR 618
            HLS VWKAFLSSC LYRDLEMGKWVSEKLFRL+PQDEGSYVLLSNMCSGSQKW+EASRAR
Sbjct: 807  HLSVVWKAFLSSCLLYRDLEMGKWVSEKLFRLEPQDEGSYVLLSNMCSGSQKWQEASRAR 866

Query: 619  RSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIGRLKEIGYLHDVK 678
             SMQHSGINKTPGQSWIHLKNQVHSFVAGD+SHPQHAQIYEYLDKLIGRLKEIGYLHDVK
Sbjct: 867  SSMQHSGINKTPGQSWIHLKNQVHSFVAGDRSHPQHAQIYEYLDKLIGRLKEIGYLHDVK 926

Query: 679  LVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTDCHNFMKLTSQLL 738
            LVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTDCHNFMKLTSQLL
Sbjct: 927  LVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTDCHNFMKLTSQLL 986

Query: 739  GREIIVRDIYRFHHFNSGHCSCGDYW 759
            GREIIVRDI RFHHFNSGHCSCGDYW
Sbjct: 987  GREIIVRDICRFHHFNSGHCSCGDYW 1012

BLAST of CSPI02G07210 vs. ExPASy TrEMBL
Match: A0A5D3C6T4 (Putative pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold105G00140 PE=3 SV=1)

HSP 1 Score: 1365.5 bits (3533), Expect = 0.0e+00
Identity = 673/713 (94.39%), Postives = 687/713 (96.35%), Query Frame = 0

Query: 46  FSYHTSNHFSSNTLHAKMVKIGSIFVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWT 105
           F ++  NH          +KIGSI  SGKFVLTSYVKS+KLNDAQKLFDEMPNRDVLTWT
Sbjct: 178 FYHNCGNHGVLKDRTLLFLKIGSIIESGKFVLTSYVKSKKLNDAQKLFDEMPNRDVLTWT 237

Query: 106 ALISGFSRVNSSGMALQLFREMLVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRN 165
           A+ISGFSRVN SGMALQLFREMLVEGV PNHFTLSTVLKLCSKVGDVRMGKGIHGWILRN
Sbjct: 238 AIISGFSRVNCSGMALQLFREMLVEGVCPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRN 297

Query: 166 GVKLDVVLENSMLDLYAKFDEFVYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLF 225
           GVKLDVVLENS+LDLYAKFDEFVYARKLYDSM EKSTDTDNIILGVYVRSCDVNKSLHLF
Sbjct: 298 GVKLDVVLENSLLDLYAKFDEFVYARKLYDSMGEKSTDTDNIILGVYVRSCDVNKSLHLF 357

Query: 226 RNLPCRNAASWNTIICGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILE 285
           RNLPCRNAASWNTIICGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSV SSLLILE
Sbjct: 358 RNLPCRNAASWNTIICGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVASSLLILE 417

Query: 286 LGRQVHGRIVRCGLHNDGFVKSALINMYIKCGNLEKASVIYSRLPSGFATKQGSNIVCSD 345
           LGRQVHGRIVRCGLHNDGFVKSALINMYIKCGNLEKASVIYS+LPSGFATKQGSNIVCSD
Sbjct: 418 LGRQVHGRIVRCGLHNDGFVKSALINMYIKCGNLEKASVIYSQLPSGFATKQGSNIVCSD 477

Query: 346 TMTEIVSRSSMVYGYVRNGKYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGR 405
           TMTEIVSRSSMVYGYVRNGKYEDAFKTFVSMVRERVLMDKFTIA+VVSAC+NAGVLELGR
Sbjct: 478 TMTEIVSRSSMVYGYVRNGKYEDAFKTFVSMVRERVLMDKFTIASVVSACANAGVLELGR 537

Query: 406 QVHGFIHKTVEQLDAHLASSLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALH 465
           QVHGFI K+VEQLDAHLASSLIDMYAKGGSLDCAHRIFDQMT YLNVVIWTSMIVGC+LH
Sbjct: 538 QVHGFIQKSVEQLDAHLASSLIDMYAKGGSLDCAHRIFDQMTYYLNVVIWTSMIVGCSLH 597

Query: 466 GHGKEAIRLFEQMRYEGIIPNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEH 525
           GHGKEAIRLFEQMRYEGIIPNEVTFIGVLTACSHAGLLEDG LYFNMMKDVYAIKPKVEH
Sbjct: 598 GHGKEAIRLFEQMRYEGIIPNEVTFIGVLTACSHAGLLEDGLLYFNMMKDVYAIKPKVEH 657

Query: 526 YTCMVDLYGRAGLLNEVKEFIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLK 585
           YTCMVDLYGRAGLLNEVKEFIYENDLSHLS VWKAFLSSC LYRDLEMGKWVSEKLFRL+
Sbjct: 658 YTCMVDLYGRAGLLNEVKEFIYENDLSHLSVVWKAFLSSCLLYRDLEMGKWVSEKLFRLE 717

Query: 586 PQDEGSYVLLSNMCSGSQKWEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSH 645
           PQDEGSYVLLSNMCSGSQKW+EASRAR SMQHSGINKTPGQSWIHLKNQVHSFVAGD+SH
Sbjct: 718 PQDEGSYVLLSNMCSGSQKWQEASRARSSMQHSGINKTPGQSWIHLKNQVHSFVAGDRSH 777

Query: 646 PQHAQIYEYLDKLIGRLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGS 705
           PQHAQIYEYLDKLIGRLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGS
Sbjct: 778 PQHAQIYEYLDKLIGRLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGS 837

Query: 706 AIPIRIMKNLRICTDCHNFMKLTSQLLGREIIVRDIYRFHHFNSGHCSCGDYW 759
           AIPIRIMKNLRICTDCHNFMKLTSQLLGREIIVRDI RFHHFNSGHCSCGDYW
Sbjct: 838 AIPIRIMKNLRICTDCHNFMKLTSQLLGREIIVRDICRFHHFNSGHCSCGDYW 890

BLAST of CSPI02G07210 vs. ExPASy TrEMBL
Match: A0A6J1EPP7 (putative pentatricopeptide repeat-containing protein At3g23330 OS=Cucurbita moschata OX=3662 GN=LOC111436248 PE=3 SV=1)

HSP 1 Score: 1258.8 bits (3256), Expect = 0.0e+00
Identity = 627/749 (83.71%), Postives = 677/749 (90.39%), Query Frame = 0

Query: 19   LSHSISQGTMTH----KIISFNLSEHHLFKSF-----SYHTSNHFSSNTLHAKMVKIGSI 78
            L+  ISQGT+        + F+LS +     F     ++H+SN    NTLHAKMVK GSI
Sbjct: 268  LASKISQGTVATVGGLLFLGFSLSSYFFPPLFLVALENFHSSNDSLPNTLHAKMVKNGSI 327

Query: 79   FVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMALQLFREMLV 138
            F S KF+L+SYVKSEKLNDA+K+FDEMP+RDVLTWT LISGF+RVN S MALQLFREMLV
Sbjct: 328  FESRKFILSSYVKSEKLNDARKVFDEMPSRDVLTWTVLISGFARVNCSEMALQLFREMLV 387

Query: 139  EGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDLYAKFDEFVY 198
            EGV PN FTLSTVLKLCS+VGDV+MGKGIHGWILR+GV LDVVLENSMLDLYAKFDEF Y
Sbjct: 388  EGVCPNPFTLSTVLKLCSRVGDVKMGKGIHGWILRSGVSLDVVLENSMLDLYAKFDEFDY 447

Query: 199  ARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGGYL 258
              KL+DSMREKST T NI+LGV+VRS DVNKSL LFRNLPCR+ ASWNT+ICGLMQGGYL
Sbjct: 448  VTKLFDSMREKSTATYNILLGVHVRS-DVNKSLDLFRNLPCRDTASWNTVICGLMQGGYL 507

Query: 259  NAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLHNDGFVKSAL 318
            N ALELLYEMVENE EFN  TSSIALSVVSSLLI+ELGRQVHGRIVRCGLHNDGFVKS+L
Sbjct: 508  NEALELLYEMVENEPEFNKVTSSIALSVVSSLLIIELGRQVHGRIVRCGLHNDGFVKSSL 567

Query: 319  INMYIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTEIVSRSSMVYGYVRNGKYEDA 378
            INMYIKCGNLEKASVIYS++PSGFATKQ  NIVCSDTMTEIVSRSSMV GYVRNGKYEDA
Sbjct: 568  INMYIKCGNLEKASVIYSQMPSGFATKQDFNIVCSDTMTEIVSRSSMVSGYVRNGKYEDA 627

Query: 379  FKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDAHLASSLIDM 438
            FKTFVSMVRERVLMDKFTIA+VVSACSNAGV ELGRQ+H +I KT EQLDAHL SSLIDM
Sbjct: 628  FKTFVSMVRERVLMDKFTIASVVSACSNAGVFELGRQIHAYIQKTGEQLDAHLTSSLIDM 687

Query: 439  YAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRYEGIIPNEVT 498
            YAKGGSLDCA +IF+Q T YLNVVIWTSMI GCALHG GKEAIRLFE+MRYEG+IPNEVT
Sbjct: 688  YAKGGSLDCARQIFEQ-TTYLNVVIWTSMITGCALHGQGKEAIRLFEKMRYEGMIPNEVT 747

Query: 499  FIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLNEVKEFIYEN 558
            FIGVL ACSHAGLLEDG LYFNMMKDVYAIKPKVEH+TCMVDLYGRAG LNEVK+FIYEN
Sbjct: 748  FIGVLAACSHAGLLEDGRLYFNMMKDVYAIKPKVEHFTCMVDLYGRAGHLNEVKKFIYEN 807

Query: 559  DLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCSGSQKWEEAS 618
            DLSHL+AVWKAFLSSC+LY+D+EMG WVSE+LFRL+P DEG YVLLSNMCS +QKWEEA 
Sbjct: 808  DLSHLNAVWKAFLSSCQLYKDIEMGNWVSERLFRLEPLDEGPYVLLSNMCSSNQKWEEAF 867

Query: 619  RARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIGRLKEIGYLH 678
            R RRSMQH GI+KTPGQSWIH+KN+VHSFVAGD+SHPQHAQIYEYLDKLIGRLKEIGYL 
Sbjct: 868  RTRRSMQHRGISKTPGQSWIHVKNRVHSFVAGDRSHPQHAQIYEYLDKLIGRLKEIGYLF 927

Query: 679  DVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTDCHNFMKLTS 738
            DVKLVMQDVEEEQGEVLLGWHSEKLA+AYG+ISLGS+IPIRIMKNLRICTDCHNFMKLTS
Sbjct: 928  DVKLVMQDVEEEQGEVLLGWHSEKLAIAYGLISLGSSIPIRIMKNLRICTDCHNFMKLTS 987

Query: 739  QLLGREIIVRDIYRFHHFNSGHCSCGDYW 759
            QLL REIIVRDI+RFHHFNSGHCSCGDYW
Sbjct: 988  QLLCREIIVRDIHRFHHFNSGHCSCGDYW 1014

BLAST of CSPI02G07210 vs. ExPASy TrEMBL
Match: A0A6J1KA70 (putative pentatricopeptide repeat-containing protein At3g23330 OS=Cucurbita maxima OX=3661 GN=LOC111492492 PE=3 SV=1)

HSP 1 Score: 1242.6 bits (3214), Expect = 0.0e+00
Identity = 618/749 (82.51%), Postives = 672/749 (89.72%), Query Frame = 0

Query: 19   LSHSISQGTMTH----KIISFNLSEHHLFKSF-----SYHTSNHFSSNTLHAKMVKIGSI 78
            L+  ISQGT+        + F+LS +     F     +YH+SN    NTLHAKMVK GSI
Sbjct: 268  LASKISQGTVATVGGLLFLGFSLSSYFFPPLFLVALENYHSSNDSLPNTLHAKMVKNGSI 327

Query: 79   FVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMALQLFREMLV 138
            F S KF+L+SYVKSEKLNDA+K+FDEMP+RDVLTWT LISGF+RVN S MALQLFREMLV
Sbjct: 328  FESRKFILSSYVKSEKLNDARKVFDEMPSRDVLTWTVLISGFARVNCSEMALQLFREMLV 387

Query: 139  EGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDLYAKFDEFVY 198
            EGV PN FTLSTVLKLCS+VGDV+MGKGIHGWILR+GV LDVVLENSMLDLYAKFDEF Y
Sbjct: 388  EGVYPNPFTLSTVLKLCSRVGDVKMGKGIHGWILRSGVSLDVVLENSMLDLYAKFDEFDY 447

Query: 199  ARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGGYL 258
             +KL+DSMREKST T NI+LGV+VRS DVNKSL LFRNLPCR+ ASWNT+ICGLMQGGYL
Sbjct: 448  VKKLFDSMREKSTATYNILLGVHVRS-DVNKSLDLFRNLPCRDTASWNTVICGLMQGGYL 507

Query: 259  NAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLHNDGFVKSAL 318
            N ALELLYEMVEN+ EFN  TSSIALSVVSSLLI+ELGRQVHGRI+RCG HNDGFVKS+L
Sbjct: 508  NEALELLYEMVENQPEFNKVTSSIALSVVSSLLIIELGRQVHGRILRCGFHNDGFVKSSL 567

Query: 319  INMYIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTEIVSRSSMVYGYVRNGKYEDA 378
            INMYIKCGNLEKASVIYS++PSGF  KQ  +IV SDTMTEIVSRSSMV GYVRNGKYEDA
Sbjct: 568  INMYIKCGNLEKASVIYSQMPSGFGKKQDFDIVYSDTMTEIVSRSSMVSGYVRNGKYEDA 627

Query: 379  FKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDAHLASSLIDM 438
            FKTFVSMVRERVLMDKFTIA+VVSACSNAGV ELGRQ+H +I KT EQLDAHL SSLIDM
Sbjct: 628  FKTFVSMVRERVLMDKFTIASVVSACSNAGVFELGRQIHAYIQKTGEQLDAHLTSSLIDM 687

Query: 439  YAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRYEGIIPNEVT 498
            YAKGGSLDCA +IF+QMT YLNVVIWTSMI GCALHG GKEAIRLFE+MRYEG+IPNEVT
Sbjct: 688  YAKGGSLDCARQIFEQMT-YLNVVIWTSMITGCALHGQGKEAIRLFEKMRYEGMIPNEVT 747

Query: 499  FIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLNEVKEFIYEN 558
            FIGVL ACSHAGL+EDG LYFNMMKDVYAIKPKVEH+TCMVDLYGRAG LNEVK+FIYEN
Sbjct: 748  FIGVLAACSHAGLIEDGRLYFNMMKDVYAIKPKVEHFTCMVDLYGRAGRLNEVKKFIYEN 807

Query: 559  DLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCSGSQKWEEAS 618
            DLSHL+AVWKAFLSSC+LY+D+EMG WVSE+LFRL+P DEG Y+LLSNMCS +QKWEEA 
Sbjct: 808  DLSHLNAVWKAFLSSCQLYKDIEMGNWVSERLFRLEPLDEGPYILLSNMCSSNQKWEEAF 867

Query: 619  RARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIGRLKEIGYLH 678
            R RR MQH GI+KTPGQSWIH+KNQVHSFVAGD+SHPQHAQIYEYLD LIGRLKEIGYL 
Sbjct: 868  RTRRFMQHRGISKTPGQSWIHVKNQVHSFVAGDRSHPQHAQIYEYLDNLIGRLKEIGYLF 927

Query: 679  DVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTDCHNFMKLTS 738
            DVKLVMQDVEEEQGEVLLGWHSEKLA+AYG+ISL SAIPIRIMKNLR+CTDCHNFMKLTS
Sbjct: 928  DVKLVMQDVEEEQGEVLLGWHSEKLAIAYGLISLDSAIPIRIMKNLRMCTDCHNFMKLTS 987

Query: 739  QLLGREIIVRDIYRFHHFNSGHCSCGDYW 759
            QLL REIIVRDI+RFHHFNSGHCSCGDYW
Sbjct: 988  QLLCREIIVRDIHRFHHFNSGHCSCGDYW 1014

BLAST of CSPI02G07210 vs. NCBI nr
Match: XP_011648996.1 (LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g23330 [Cucumis sativus])

HSP 1 Score: 1463.0 bits (3786), Expect = 0.0e+00
Identity = 727/751 (96.80%), Postives = 731/751 (97.34%), Query Frame = 0

Query: 19   LSHSISQGTMTH----KIISFNLSE-------HHLFKSFSYHTSNHFSSNTLHAKMVKIG 78
            L+  ISQGT+        + F+LS        HHLFKSFSYHTSNHFSSNTLHAKMVKIG
Sbjct: 267  LASKISQGTVATVGGLLFLGFSLSSYFFPPLXHHLFKSFSYHTSNHFSSNTLHAKMVKIG 326

Query: 79   SIFVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMALQLFREM 138
            SIFVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMALQLFREM
Sbjct: 327  SIFVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMALQLFREM 386

Query: 139  LVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDLYAKFDEF 198
            LVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDLYAKFDEF
Sbjct: 387  LVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDLYAKFDEF 446

Query: 199  VYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGG 258
            VYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGG
Sbjct: 447  VYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGG 506

Query: 259  YLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLHNDGFVKS 318
            YLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLHNDGFVKS
Sbjct: 507  YLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLHNDGFVKS 566

Query: 319  ALINMYIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTEIVSRSSMVYGYVRNGKYE 378
            ALINMYIKCGNLEKASVIYSRLPSGFATKQ SNIVCSDTMTEIVSRSSMVYGYVRNGKYE
Sbjct: 567  ALINMYIKCGNLEKASVIYSRLPSGFATKQSSNIVCSDTMTEIVSRSSMVYGYVRNGKYE 626

Query: 379  DAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDAHLASSLI 438
            DAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDAHLASSLI
Sbjct: 627  DAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDAHLASSLI 686

Query: 439  DMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRYEGIIPNE 498
            DMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRYEGIIPNE
Sbjct: 687  DMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRYEGIIPNE 746

Query: 499  VTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLNEVKEFIY 558
            VTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLNEVKEFIY
Sbjct: 747  VTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLNEVKEFIY 806

Query: 559  ENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCSGSQKWEE 618
            ENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCSGSQKWEE
Sbjct: 807  ENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCSGSQKWEE 866

Query: 619  ASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIGRLKEIGY 678
            ASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIGRLKEIGY
Sbjct: 867  ASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIGRLKEIGY 926

Query: 679  LHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTDCHNFMKL 738
            LHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTDCHNFMKL
Sbjct: 927  LHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTDCHNFMKL 986

Query: 739  TSQLLGREIIVRDIYRFHHFNSGHCSCGDYW 759
            TSQLLGREIIVRDIYRFHHFNSGHCSCGDYW
Sbjct: 987  TSQLLGREIIVRDIYRFHHFNSGHCSCGDYW 1017

BLAST of CSPI02G07210 vs. NCBI nr
Match: XP_008441858.1 (PREDICTED: LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g23330 [Cucumis melo])

HSP 1 Score: 1400.2 bits (3623), Expect = 0.0e+00
Identity = 696/746 (93.30%), Postives = 712/746 (95.44%), Query Frame = 0

Query: 19   LSHSISQGTMTH----KIISFNLSEHHL--FKSFSYHTSNHFSSNTLHAKMVKIGSIFVS 78
            L+  ISQGT+        + F+LS +       F YHTSN FSSNTLHAKMVKIGSI  S
Sbjct: 267  LASKISQGTVATVGGLLFLGFSLSSYFFPPLXKFCYHTSNSFSSNTLHAKMVKIGSIIES 326

Query: 79   GKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMALQLFREMLVEGV 138
            GKFVLTSYVKS+KLNDAQKLFDEMPNRDVLTWTA+ISGFSRVN SGMALQLFREMLVEGV
Sbjct: 327  GKFVLTSYVKSKKLNDAQKLFDEMPNRDVLTWTAIISGFSRVNCSGMALQLFREMLVEGV 386

Query: 139  SPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDLYAKFDEFVYARK 198
             PNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENS+LDLYAKFDEFVYARK
Sbjct: 387  CPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSLLDLYAKFDEFVYARK 446

Query: 199  LYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGGYLNAA 258
            LYDSM EKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGGYLNAA
Sbjct: 447  LYDSMGEKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGGYLNAA 506

Query: 259  LELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLHNDGFVKSALINM 318
            LELLYEMVENESEFNNFTSSIALSV SSLLILELGRQVHGRIVRCGLHNDGFVKSALINM
Sbjct: 507  LELLYEMVENESEFNNFTSSIALSVASSLLILELGRQVHGRIVRCGLHNDGFVKSALINM 566

Query: 319  YIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTEIVSRSSMVYGYVRNGKYEDAFKT 378
            YIKCGNLEKASVIYS+LPSGFATKQGSNIVCSDTMTEIVSRSSMVYGYVRNGKYEDAFKT
Sbjct: 567  YIKCGNLEKASVIYSQLPSGFATKQGSNIVCSDTMTEIVSRSSMVYGYVRNGKYEDAFKT 626

Query: 379  FVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDAHLASSLIDMYAK 438
            FVSMVRERVLMDKFTIA+VVSAC+NAGVLELGRQVHGFI K+VEQLDAHLASSLIDMYAK
Sbjct: 627  FVSMVRERVLMDKFTIASVVSACANAGVLELGRQVHGFIQKSVEQLDAHLASSLIDMYAK 686

Query: 439  GGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRYEGIIPNEVTFIG 498
            GGSLDCAHRIFDQMT YLNVVIWTSMIVGC+LHGHGKEAIRLFEQMRYEGIIPNEVTFIG
Sbjct: 687  GGSLDCAHRIFDQMTYYLNVVIWTSMIVGCSLHGHGKEAIRLFEQMRYEGIIPNEVTFIG 746

Query: 499  VLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLNEVKEFIYENDLS 558
            VLTACSHAGLLEDG LYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLNEVKEFIYENDLS
Sbjct: 747  VLTACSHAGLLEDGLLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLNEVKEFIYENDLS 806

Query: 559  HLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCSGSQKWEEASRAR 618
            HLS VWKAFLSSC LYRDLEMGKWVSEKLFRL+PQDEGSYVLLSNMCSGSQKW+EASRAR
Sbjct: 807  HLSVVWKAFLSSCLLYRDLEMGKWVSEKLFRLEPQDEGSYVLLSNMCSGSQKWQEASRAR 866

Query: 619  RSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIGRLKEIGYLHDVK 678
             SMQHSGINKTPGQSWIHLKNQVHSFVAGD+SHPQHAQIYEYLDKLIGRLKEIGYLHDVK
Sbjct: 867  SSMQHSGINKTPGQSWIHLKNQVHSFVAGDRSHPQHAQIYEYLDKLIGRLKEIGYLHDVK 926

Query: 679  LVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTDCHNFMKLTSQLL 738
            LVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTDCHNFMKLTSQLL
Sbjct: 927  LVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTDCHNFMKLTSQLL 986

Query: 739  GREIIVRDIYRFHHFNSGHCSCGDYW 759
            GREIIVRDI RFHHFNSGHCSCGDYW
Sbjct: 987  GREIIVRDICRFHHFNSGHCSCGDYW 1012

BLAST of CSPI02G07210 vs. NCBI nr
Match: KAA0049879.1 (putative pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK07617.1 putative pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1365.5 bits (3533), Expect = 0.0e+00
Identity = 673/713 (94.39%), Postives = 687/713 (96.35%), Query Frame = 0

Query: 46  FSYHTSNHFSSNTLHAKMVKIGSIFVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWT 105
           F ++  NH          +KIGSI  SGKFVLTSYVKS+KLNDAQKLFDEMPNRDVLTWT
Sbjct: 178 FYHNCGNHGVLKDRTLLFLKIGSIIESGKFVLTSYVKSKKLNDAQKLFDEMPNRDVLTWT 237

Query: 106 ALISGFSRVNSSGMALQLFREMLVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRN 165
           A+ISGFSRVN SGMALQLFREMLVEGV PNHFTLSTVLKLCSKVGDVRMGKGIHGWILRN
Sbjct: 238 AIISGFSRVNCSGMALQLFREMLVEGVCPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRN 297

Query: 166 GVKLDVVLENSMLDLYAKFDEFVYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLF 225
           GVKLDVVLENS+LDLYAKFDEFVYARKLYDSM EKSTDTDNIILGVYVRSCDVNKSLHLF
Sbjct: 298 GVKLDVVLENSLLDLYAKFDEFVYARKLYDSMGEKSTDTDNIILGVYVRSCDVNKSLHLF 357

Query: 226 RNLPCRNAASWNTIICGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILE 285
           RNLPCRNAASWNTIICGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSV SSLLILE
Sbjct: 358 RNLPCRNAASWNTIICGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVASSLLILE 417

Query: 286 LGRQVHGRIVRCGLHNDGFVKSALINMYIKCGNLEKASVIYSRLPSGFATKQGSNIVCSD 345
           LGRQVHGRIVRCGLHNDGFVKSALINMYIKCGNLEKASVIYS+LPSGFATKQGSNIVCSD
Sbjct: 418 LGRQVHGRIVRCGLHNDGFVKSALINMYIKCGNLEKASVIYSQLPSGFATKQGSNIVCSD 477

Query: 346 TMTEIVSRSSMVYGYVRNGKYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGR 405
           TMTEIVSRSSMVYGYVRNGKYEDAFKTFVSMVRERVLMDKFTIA+VVSAC+NAGVLELGR
Sbjct: 478 TMTEIVSRSSMVYGYVRNGKYEDAFKTFVSMVRERVLMDKFTIASVVSACANAGVLELGR 537

Query: 406 QVHGFIHKTVEQLDAHLASSLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALH 465
           QVHGFI K+VEQLDAHLASSLIDMYAKGGSLDCAHRIFDQMT YLNVVIWTSMIVGC+LH
Sbjct: 538 QVHGFIQKSVEQLDAHLASSLIDMYAKGGSLDCAHRIFDQMTYYLNVVIWTSMIVGCSLH 597

Query: 466 GHGKEAIRLFEQMRYEGIIPNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEH 525
           GHGKEAIRLFEQMRYEGIIPNEVTFIGVLTACSHAGLLEDG LYFNMMKDVYAIKPKVEH
Sbjct: 598 GHGKEAIRLFEQMRYEGIIPNEVTFIGVLTACSHAGLLEDGLLYFNMMKDVYAIKPKVEH 657

Query: 526 YTCMVDLYGRAGLLNEVKEFIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLK 585
           YTCMVDLYGRAGLLNEVKEFIYENDLSHLS VWKAFLSSC LYRDLEMGKWVSEKLFRL+
Sbjct: 658 YTCMVDLYGRAGLLNEVKEFIYENDLSHLSVVWKAFLSSCLLYRDLEMGKWVSEKLFRLE 717

Query: 586 PQDEGSYVLLSNMCSGSQKWEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSH 645
           PQDEGSYVLLSNMCSGSQKW+EASRAR SMQHSGINKTPGQSWIHLKNQVHSFVAGD+SH
Sbjct: 718 PQDEGSYVLLSNMCSGSQKWQEASRARSSMQHSGINKTPGQSWIHLKNQVHSFVAGDRSH 777

Query: 646 PQHAQIYEYLDKLIGRLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGS 705
           PQHAQIYEYLDKLIGRLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGS
Sbjct: 778 PQHAQIYEYLDKLIGRLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGS 837

Query: 706 AIPIRIMKNLRICTDCHNFMKLTSQLLGREIIVRDIYRFHHFNSGHCSCGDYW 759
           AIPIRIMKNLRICTDCHNFMKLTSQLLGREIIVRDI RFHHFNSGHCSCGDYW
Sbjct: 838 AIPIRIMKNLRICTDCHNFMKLTSQLLGREIIVRDICRFHHFNSGHCSCGDYW 890

BLAST of CSPI02G07210 vs. NCBI nr
Match: XP_038889548.1 (putative pentatricopeptide repeat-containing protein At3g23330 [Benincasa hispida] >XP_038889549.1 putative pentatricopeptide repeat-containing protein At3g23330 [Benincasa hispida])

HSP 1 Score: 1344.7 bits (3479), Expect = 0.0e+00
Identity = 664/758 (87.60%), Postives = 702/758 (92.61%), Query Frame = 0

Query: 1   MRWMNLSSSCFPSPAFLKLSHSISQGTMTHKIISFNLSEHHLFKSFSYHTSNHFSSNTLH 60
           MR MNLSS CF + AFLKL H I Q TM  KIISFNLSEH LFKS  YHTSN    NTLH
Sbjct: 1   MRLMNLSSCCF-ATAFLKLPHPICQVTMAQKIISFNLSEHQLFKSCCYHTSNDSLVNTLH 60

Query: 61  AKMVKIGSIFVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMA 120
           AKMVK GSI  SGKFVL+SYVKSEKLNDAQKLFDEMP+RDVLTWT LISGFSR+N S MA
Sbjct: 61  AKMVKNGSILESGKFVLSSYVKSEKLNDAQKLFDEMPSRDVLTWTVLISGFSRINCSEMA 120

Query: 121 LQLFREMLVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDL 180
           LQLFR+MLVEGV PNHFTLSTVLKLCS+VGD++MGKGIHGWILRNGV LDVVLENSMLDL
Sbjct: 121 LQLFRKMLVEGVCPNHFTLSTVLKLCSRVGDMQMGKGIHGWILRNGVNLDVVLENSMLDL 180

Query: 181 YAKFDEFVYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTII 240
           YAKFD+F  A+KL+DSMREKST T NI+LGVYVRSCDVNKSL LFRN+PCRN ASWNTII
Sbjct: 181 YAKFDDFYCAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLDLFRNMPCRNTASWNTII 240

Query: 241 CGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLH 300
           CGLMQGG+LNAALELLYEMVENE EFN  TSSIALSVV+SLLI+ELGRQVHGRI+RCGLH
Sbjct: 241 CGLMQGGHLNAALELLYEMVENEPEFNKVTSSIALSVVASLLIIELGRQVHGRIIRCGLH 300

Query: 301 NDGFVKSALINMYIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTEIVSRSSMVYGY 360
           NDGFVKS+LINMYIKCGNLEKASVIYS++PSGF TKQ SNIVCSD MTEIVSRSSMV GY
Sbjct: 301 NDGFVKSSLINMYIKCGNLEKASVIYSQMPSGFVTKQDSNIVCSDMMTEIVSRSSMVSGY 360

Query: 361 VRNGKYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDA 420
           + NGKYE+AFKT VSMVRERVLMDKFTIA+VVSACSNAGVLELGRQ+HG+I KT EQLDA
Sbjct: 361 IWNGKYENAFKTVVSMVRERVLMDKFTIASVVSACSNAGVLELGRQIHGYIQKTGEQLDA 420

Query: 421 HLASSLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRY 480
           HLASSLIDMYAKGGSLDCAHRIF+Q TNYLNVV+WTSMI G ALHG GKEAIRLFE+MRY
Sbjct: 421 HLASSLIDMYAKGGSLDCAHRIFEQTTNYLNVVLWTSMIAGYALHGQGKEAIRLFERMRY 480

Query: 481 EGIIPNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLN 540
           EGIIPNEVTF+GVLTACSHAGLLE G LYFNMMKDVYAIKPKVEH+TCMVDLYGRAG LN
Sbjct: 481 EGIIPNEVTFVGVLTACSHAGLLEHGRLYFNMMKDVYAIKPKVEHFTCMVDLYGRAGCLN 540

Query: 541 EVKEFIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCS 600
           EVKEFIYENDLSHLSAVWKAFLSSCRLY++LEMG WVSEKLF L+ QDEGSYVLLSNMCS
Sbjct: 541 EVKEFIYENDLSHLSAVWKAFLSSCRLYKNLEMGNWVSEKLFSLEQQDEGSYVLLSNMCS 600

Query: 601 GSQKWEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIG 660
           GSQKWEEASR RRSMQH GINKTPGQSWIH+KNQVHSFVAGDQSHPQH QIYEYLDKLIG
Sbjct: 601 GSQKWEEASRTRRSMQHRGINKTPGQSWIHVKNQVHSFVAGDQSHPQHVQIYEYLDKLIG 660

Query: 661 RLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTD 720
           RLKEIGYL+DVKLVMQDVEEEQGEVLLGWHSEKLA+AYGIISLGSAIPIRIMKNLR+CTD
Sbjct: 661 RLKEIGYLYDVKLVMQDVEEEQGEVLLGWHSEKLALAYGIISLGSAIPIRIMKNLRVCTD 720

Query: 721 CHNFMKLTSQLLGREIIVRDIYRFHHFNSGHCSCGDYW 759
           CHNFMKLTSQLLGREIIVRDI+RFH FNSGHCSCGDYW
Sbjct: 721 CHNFMKLTSQLLGREIIVRDIHRFHRFNSGHCSCGDYW 757

BLAST of CSPI02G07210 vs. NCBI nr
Match: KAG7020981.1 (putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1280.0 bits (3311), Expect = 0.0e+00
Identity = 624/758 (82.32%), Postives = 687/758 (90.63%), Query Frame = 0

Query: 1   MRWMNLSSSCFPSPAFLKLSHSISQGTMTHKIISFNLSEHHLFKSFSYHTSNHFSSNTLH 60
           MRWMN  S  F S AFLKL+HS+SQ +M  KII FNLSEH LFKS  YH+SN  SSNTLH
Sbjct: 1   MRWMNPCSGGFASTAFLKLTHSVSQVSMAQKIIPFNLSEHQLFKSCRYHSSNDDSSNTLH 60

Query: 61  AKMVKIGSIFVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMA 120
           AKMVK GSI   GK V++SYVKSEKL+DAQK+FDEMP+RDVL+WT LISGF+RVN S  A
Sbjct: 61  AKMVKNGSILYLGKLVMSSYVKSEKLDDAQKVFDEMPHRDVLSWTVLISGFARVNCSERA 120

Query: 121 LQLFREMLVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDL 180
           LQLFREMLVEGV PNHFTLS VLKLCS+VGD++MGKGIHGWILR+GV LDVVLENSMLDL
Sbjct: 121 LQLFREMLVEGVCPNHFTLSCVLKLCSRVGDLQMGKGIHGWILRSGVNLDVVLENSMLDL 180

Query: 181 YAKFDEFVYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTII 240
           Y KFD F YA KL+DSMREKST + NI+LGVYVRSCDVNKSL LFRNLPCR+ ASWNTII
Sbjct: 181 YTKFDAFDYATKLFDSMREKSTASYNIMLGVYVRSCDVNKSLDLFRNLPCRDTASWNTII 240

Query: 241 CGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLH 300
           CGLMQGGYLN A+ELLYEMV+NE EFN  TSSIALSVVSSLLI+ELGRQVHGRI R GLH
Sbjct: 241 CGLMQGGYLNIAMELLYEMVKNEPEFNEVTSSIALSVVSSLLIIELGRQVHGRIFRFGLH 300

Query: 301 NDGFVKSALINMYIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTEIVSRSSMVYGY 360
           NDGFV S+LINMYIKCGNLEKASVIYS++PS F  ++ SNIVCS+TMTEIVSRSS+V GY
Sbjct: 301 NDGFVNSSLINMYIKCGNLEKASVIYSQMPSNFGKRRDSNIVCSNTMTEIVSRSSIVSGY 360

Query: 361 VRNGKYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDA 420
           V+NGKYED+F+TFVSMVRER +MD+FTIA+++SACSNAGVLELGRQ+H +I KT EQLDA
Sbjct: 361 VQNGKYEDSFQTFVSMVRERAVMDRFTIASIISACSNAGVLELGRQIHAYIQKTGEQLDA 420

Query: 421 HLASSLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRY 480
           HLASS+IDMYAKGGSLDCAH++F+Q T YLNVV WTSMI GCALHG GKEAIRLFEQMRY
Sbjct: 421 HLASSMIDMYAKGGSLDCAHQVFEQ-TTYLNVVTWTSMITGCALHGQGKEAIRLFEQMRY 480

Query: 481 EGIIPNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLN 540
           EGIIPNEVTFIGVLTACSHAGLL++G LYFNMMKDVYAI+PKVEH+TCMVD+YGRAG LN
Sbjct: 481 EGIIPNEVTFIGVLTACSHAGLLDEGRLYFNMMKDVYAIEPKVEHFTCMVDVYGRAGRLN 540

Query: 541 EVKEFIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCS 600
           EVKEFIY+NDLSH SAVWKAFLSSCRLY+D+EMG WVSEKLF+L+P+DEG YVLLSNMCS
Sbjct: 541 EVKEFIYQNDLSHHSAVWKAFLSSCRLYKDIEMGNWVSEKLFKLEPRDEGPYVLLSNMCS 600

Query: 601 GSQKWEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIG 660
            +QKWEEAS+ RRSMQH GI+KTPGQSWIH+KNQVHSF+AGD+SH QHAQIY YLDKLIG
Sbjct: 601 SNQKWEEASKTRRSMQHRGISKTPGQSWIHVKNQVHSFIAGDRSHLQHAQIYAYLDKLIG 660

Query: 661 RLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTD 720
           RLKEIGY  DVKLVMQDVEEEQGEVLLGWHSEKLAVAYGII+L S IPIRIMKNLR+CTD
Sbjct: 661 RLKEIGYSCDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIINLASGIPIRIMKNLRVCTD 720

Query: 721 CHNFMKLTSQLLGREIIVRDIYRFHHFNSGHCSCGDYW 759
           CHNFMKLTSQLL REIIVRDI+RFHHFNSGHCSCGDYW
Sbjct: 721 CHNFMKLTSQLLDREIIVRDIHRFHHFNSGHCSCGDYW 757

BLAST of CSPI02G07210 vs. TAIR 10
Match: AT3G23330.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 520.0 bits (1338), Expect = 3.2e-147
Identity = 262/706 (37.11%), Postives = 431/706 (61.05%), Query Frame = 0

Query: 59  LHAKMVKIGSI-FVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSS 118
           LHA+ ++  S+   S   V++ Y   + L++A  LF  + +  VL W ++I  F+  +  
Sbjct: 27  LHAQFIRTQSLSHTSASIVISIYTNLKLLHEALLLFKTLKSPPVLAWKSVIRCFTDQSLF 86

Query: 119 GMALQLFREMLVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSM 178
             AL  F EM   G  P+H    +VLK C+ + D+R G+ +HG+I+R G+  D+   N++
Sbjct: 87  SKALASFVEMRASGRCPDHNVFPSVLKSCTMMMDLRFGESVHGFIVRLGMDCDLYTGNAL 146

Query: 179 LDLYAK---FDEFVYARKLYDSMREKSTDT--DNIILGVYVRSCDVNKSLHLFRNLPCRN 238
           +++YAK       +    ++D M ++++++  +++     +    ++    +F  +P ++
Sbjct: 147 MNMYAKLLGMGSKISVGNVFDEMPQRTSNSGDEDVKAETCIMPFGIDSVRRVFEVMPRKD 206

Query: 239 AASWNTIICGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHG 298
             S+NTII G  Q G    AL ++ EM   + + ++FT S  L + S  + +  G+++HG
Sbjct: 207 VVSYNTIIAGYAQSGMYEDALRMVREMGTTDLKPDSFTLSSVLPIFSEYVDVIKGKEIHG 266

Query: 299 RIVRCGLHNDGFVKSALINMYIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTEIVS 358
            ++R G+ +D ++ S+L++MY K   +E +  ++SRL             C D     +S
Sbjct: 267 YVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVFSRL------------YCRDG----IS 326

Query: 359 RSSMVYGYVRNGKYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIH 418
            +S+V GYV+NG+Y +A + F  MV  +V       ++V+ AC++   L LG+Q+HG++ 
Sbjct: 327 WNSLVAGYVQNGRYNEALRLFRQMVTAKVKPGAVAFSSVIPACAHLATLHLGKQLHGYVL 386

Query: 419 KTVEQLDAHLASSLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAI 478
           +     +  +AS+L+DMY+K G++  A +IFD+M N L+ V WT++I+G ALHGHG EA+
Sbjct: 387 RGGFGSNIFIASALVDMYSKCGNIKAARKIFDRM-NVLDEVSWTAIIMGHALHGHGHEAV 446

Query: 479 RLFEQMRYEGIIPNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDL 538
            LFE+M+ +G+ PN+V F+ VLTACSH GL+++   YFN M  VY +  ++EHY  + DL
Sbjct: 447 SLFEEMKRQGVKPNQVAFVAVLTACSHVGLVDEAWGYFNSMTKVYGLNQELEHYAAVADL 506

Query: 539 YGRAGLLNEVKEFIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSY 598
            GRAG L E   FI +  +    +VW   LSSC ++++LE+ + V+EK+F +  ++ G+Y
Sbjct: 507 LGRAGKLEEAYNFISKMCVEPTGSVWSTLLSSCSVHKNLELAEKVAEKIFTVDSENMGAY 566

Query: 599 VLLSNMCSGSQKWEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIY 658
           VL+ NM + + +W+E ++ R  M+  G+ K P  SWI +KN+ H FV+GD+SHP   +I 
Sbjct: 567 VLMCNMYASNGRWKEMAKLRLRMRKKGLRKKPACSWIEMKNKTHGFVSGDRSHPSMDKIN 626

Query: 659 EYLDKLIGRLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIM 718
           E+L  ++ ++++ GY+ D   V+ DV+EE    LL  HSE+LAVA+GII+      IR+ 
Sbjct: 627 EFLKAVMEQMEKEGYVADTSGVLHDVDEEHKRELLFGHSERLAVAFGIINTEPGTTIRVT 686

Query: 719 KNLRICTDCHNFMKLTSQLLGREIIVRDIYRFHHFNSGHCSCGDYW 759
           KN+RICTDCH  +K  S++  REIIVRD  RFHHFN G+CSCGDYW
Sbjct: 687 KNIRICTDCHVAIKFISKITEREIIVRDNSRFHHFNRGNCSCGDYW 715

BLAST of CSPI02G07210 vs. TAIR 10
Match: AT2G22070.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 517.3 bits (1331), Expect = 2.1e-146
Identity = 271/754 (35.94%), Postives = 437/754 (57.96%), Query Frame = 0

Query: 23  ISQGTMTHKIISFNLSEHHLFKSFSYHTSNHFSSNTLHAKMVKIGSIFVSGKFVLTSYVK 82
           I  G M    +  NL   +    ++ H    F    L            S   VL++Y K
Sbjct: 41  IKSGLMFSVYLMNNLMNVYSKTGYALHARKLFDEMPLRTAF--------SWNTVLSAYSK 100

Query: 83  SEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMALQLFREMLVEGVSPNHFTLSTV 142
              ++   + FD++P RD ++WT +I G+  +     A+++  +M+ EG+ P  FTL+ V
Sbjct: 101 RGDMDSTCEFFDQLPQRDSVSWTTMIVGYKNIGQYHKAIRVMGDMVKEGIEPTQFTLTNV 160

Query: 143 LKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDLYAKFDEFVYARKLYDSMREKST 202
           L   +    +  GK +H +I++ G++ +V + NS+L++YAK  + + A+ ++D M  +  
Sbjct: 161 LASVAATRCMETGKKVHSFIVKLGLRGNVSVSNSLLNMYAKCGDPMMAKFVFDRMVVRDI 220

Query: 203 DTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGGYLNAALELLYEMVEN 262
            + N ++ ++++   ++ ++  F  +  R+  +WN++I G  Q GY   AL++  +M+ +
Sbjct: 221 SSWNAMIALHMQVGQMDLAMAQFEQMAERDIVTWNSMISGFNQRGYDLRALDIFSKMLRD 280

Query: 263 E-SEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLHNDGFVKSALINMYIKCGNLEK 322
                + FT +  LS  ++L  L +G+Q+H  IV  G    G V +ALI+MY +CG +E 
Sbjct: 281 SLLSPDRFTLASVLSACANLEKLCIGKQIHSHIVTTGFDISGIVLNALISMYSRCGGVET 340

Query: 323 ASVI-------------YSRLPSGFA----TKQGSNIVCSDTMTEIVSRSSMVYGYVRNG 382
           A  +             ++ L  G+       Q  NI  S    ++V+ ++M+ GY ++G
Sbjct: 341 ARRLIEQRGTKDLKIEGFTALLDGYIKLGDMNQAKNIFVSLKDRDVVAWTAMIVGYEQHG 400

Query: 383 KYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDAHLAS 442
            Y +A   F SMV      + +T+A ++S  S+   L  G+Q+HG   K+ E     +++
Sbjct: 401 SYGEAINLFRSMVGGGQRPNSYTLAAMLSVASSLASLSHGKQIHGSAVKSGEIYSVSVSN 460

Query: 443 SLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRYEGII 502
           +LI MYAK G++  A R FD +    + V WTSMI+  A HGH +EA+ LFE M  EG+ 
Sbjct: 461 ALITMYAKAGNITSASRAFDLIRCERDTVSWTSMIIALAQHGHAEEALELFETMLMEGLR 520

Query: 503 PNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLNEVKE 562
           P+ +T++GV +AC+HAGL+  G  YF+MMKDV  I P + HY CMVDL+GRAGLL E +E
Sbjct: 521 PDHITYVGVFSACTHAGLVNQGRQYFDMMKDVDKIIPTLSHYACMVDLFGRAGLLQEAQE 580

Query: 563 FIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCSGSQK 622
           FI +  +      W + LS+CR+++++++GK  +E+L  L+P++ G+Y  L+N+ S   K
Sbjct: 581 FIEKMPIEPDVVTWGSLLSACRVHKNIDLGKVAAERLLLLEPENSGAYSALANLYSACGK 640

Query: 623 WEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIGRLKE 682
           WEEA++ R+SM+   + K  G SWI +K++VH F   D +HP+  +IY  + K+   +K+
Sbjct: 641 WEEAAKIRKSMKDGRVKKEQGFSWIEVKHKVHVFGVEDGTHPEKNEIYMTMKKIWDEIKK 700

Query: 683 IGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTDCHNF 742
           +GY+ D   V+ D+EEE  E +L  HSEKLA+A+G+IS      +RIMKNLR+C DCH  
Sbjct: 701 MGYVPDTASVLHDLEEEVKEQILRHHSEKLAIAFGLISTPDKTTLRIMKNLRVCNDCHTA 760

Query: 743 MKLTSQLLGREIIVRDIYRFHHFNSGHCSCGDYW 759
           +K  S+L+GREIIVRD  RFHHF  G CSC DYW
Sbjct: 761 IKFISKLVGREIIVRDTTRFHHFKDGFCSCRDYW 786

BLAST of CSPI02G07210 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 467.6 bits (1202), Expect = 1.9e-131
Identity = 251/704 (35.65%), Postives = 397/704 (56.39%), Query Frame = 0

Query: 59  LHAKMVKIGSIFVSGKFVLTS----YVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRV 118
           +H  +VK G  F    F +T     Y K  ++N+A+K+FD MP RD+++W  +++G+S+ 
Sbjct: 157 IHGLLVKSG--FSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQN 216

Query: 119 NSSGMALQLFREMLVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLE 178
             + MAL++ + M  E + P+  T+ +VL   S +  + +GK IHG+ +R+G    V + 
Sbjct: 217 GMARMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNIS 276

Query: 179 NSMLDLYAKFDEFVYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAA 238
            +++D+YAK      AR+L+D M E                               RN  
Sbjct: 277 TALVDMYAKCGSLETARQLFDGMLE-------------------------------RNVV 336

Query: 239 SWNTIICGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRI 298
           SWN++I   +Q      A+ +  +M++   +  + +   AL   + L  LE GR +H   
Sbjct: 337 SWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLS 396

Query: 299 VRCGLHNDGFVKSALINMYIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTEIVSRS 358
           V  GL  +  V ++LI+MY KC  ++ A+ ++ +L S                  +VS +
Sbjct: 397 VELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQS----------------RTLVSWN 456

Query: 359 SMVYGYVRNGKYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKT 418
           +M+ G+ +NG+  DA   F  M    V  D FT  +V++A +   +    + +HG + ++
Sbjct: 457 AMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRS 516

Query: 419 VEQLDAHLASSLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRL 478
               +  + ++L+DMYAK G++  A  IFD M+   +V  W +MI G   HG GK A+ L
Sbjct: 517 CLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSE-RHVTTWNAMIDGYGTHGFGKAALEL 576

Query: 479 FEQMRYEGIIPNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYG 538
           FE+M+   I PN VTF+ V++ACSH+GL+E G   F MMK+ Y+I+  ++HY  MVDL G
Sbjct: 577 FEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLG 636

Query: 539 RAGLLNEVKEFIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVL 598
           RAG LNE  +FI +  +     V+ A L +C++++++   +  +E+LF L P D G +VL
Sbjct: 637 RAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVL 696

Query: 599 LSNMCSGSQKWEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEY 658
           L+N+   +  WE+  + R SM   G+ KTPG S + +KN+VHSF +G  +HP   +IY +
Sbjct: 697 LANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKIYAF 756

Query: 659 LDKLIGRLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKN 718
           L+KLI  +KE GY+ D  LV+  VE +  E LL  HSEKLA+++G+++  +   I + KN
Sbjct: 757 LEKLICHIKEAGYVPDTNLVL-GVENDVKEQLLSTHSEKLAISFGLLNTTAGTTIHVRKN 809

Query: 719 LRICTDCHNFMKLTSQLLGREIIVRDIYRFHHFNSGHCSCGDYW 759
           LR+C DCHN  K  S + GREI+VRD+ RFHHF +G CSCGDYW
Sbjct: 817 LRVCADCHNATKYISLVTGREIVVRDMQRFHHFKNGACSCGDYW 809

BLAST of CSPI02G07210 vs. TAIR 10
Match: AT4G13650.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 467.2 bits (1201), Expect = 2.5e-131
Identity = 242/709 (34.13%), Postives = 395/709 (55.71%), Query Frame = 0

Query: 54   FSSNTLHAKMVKIGSIFVSGKFV----LTSYVKSEKLNDAQKLFDEMPNRDVLTWTALIS 113
            F    LHA   K+G  F S   +    L  Y K   +  A   F E    +V+ W  ++ 
Sbjct: 406  FRGQQLHAYTTKLG--FASNNKIEGALLNLYAKCADIETALDYFLETEVENVVLWNVMLV 465

Query: 114  GFSRVNSSGMALQLFREMLVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKL 173
             +  ++    + ++FR+M +E + PN +T  ++LK C ++GD+ +G+ IH  I++   +L
Sbjct: 466  AYGLLDDLRNSFRIFRQMQIEEIVPNQYTYPSILKTCIRLGDLELGEQIHSQIIKTNFQL 525

Query: 174  DVVLENSMLDLYAKFDEFVYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLP 233
            +  + + ++D+YAK  +               T  D +I                     
Sbjct: 526  NAYVCSVLIDMYAKLGKL-------------DTAWDILI------------------RFA 585

Query: 234  CRNAASWNTIICGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQ 293
             ++  SW T+I G  Q  + + AL    +M++     +    + A+S  + L  L+ G+Q
Sbjct: 586  GKDVVSWTTMIAGYTQYNFDDKALTTFRQMLDRGIRSDEVGLTNAVSACAGLQALKEGQQ 645

Query: 294  VHGRIVRCGLHNDGFVKSALINMYIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTE 353
            +H +    G  +D   ++AL+ +Y +CG +E++ + + +      T+ G NI        
Sbjct: 646  IHAQACVSGFSSDLPFQNALVTLYSRCGKIEESYLAFEQ------TEAGDNI-------- 705

Query: 354  IVSRSSMVYGYVRNGKYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHG 413
              + +++V G+ ++G  E+A + FV M RE +  + FT  + V A S    ++ G+QVH 
Sbjct: 706  --AWNALVSGFQQSGNNEEALRVFVRMNREGIDNNNFTFGSAVKAASETANMKQGKQVHA 765

Query: 414  FIHKTVEQLDAHLASSLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGK 473
             I KT    +  + ++LI MYAK GS+  A + F +++   N V W ++I   + HG G 
Sbjct: 766  VITKTGYDSETEVCNALISMYAKCGSISDAEKQFLEVST-KNEVSWNAIINAYSKHGFGS 825

Query: 474  EAIRLFEQMRYEGIIPNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCM 533
            EA+  F+QM +  + PN VT +GVL+ACSH GL++ G  YF  M   Y + PK EHY C+
Sbjct: 826  EALDSFDQMIHSNVRPNHVTLVGVLSACSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCV 885

Query: 534  VDLYGRAGLLNEVKEFIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDE 593
            VD+  RAGLL+  KEFI E  +   + VW+  LS+C +++++E+G++ +  L  L+P+D 
Sbjct: 886  VDMLTRAGLLSRAKEFIQEMPIKPDALVWRTLLSACVVHKNMEIGEFAAHHLLELEPEDS 945

Query: 594  GSYVLLSNMCSGSQKWEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHA 653
             +YVLLSN+ + S+KW+     R+ M+  G+ K PGQSWI +KN +HSF  GDQ+HP   
Sbjct: 946  ATYVLLSNLYAVSKKWDARDLTRQKMKEKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLAD 1005

Query: 654  QIYEYLDKLIGRLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPI 713
            +I+EY   L  R  EIGY+ D   ++ +++ EQ + ++  HSEKLA+++G++SL + +PI
Sbjct: 1006 EIHEYFQDLTKRASEIGYVQDCFSLLNELQHEQKDPIIFIHSEKLAISFGLLSLPATVPI 1064

Query: 714  RIMKNLRICTDCHNFMKLTSQLLGREIIVRDIYRFHHFNSGHCSCGDYW 759
             +MKNLR+C DCH ++K  S++  REIIVRD YRFHHF  G CSC DYW
Sbjct: 1066 NVMKNLRVCNDCHAWIKFVSKVSNREIIVRDAYRFHHFEGGACSCKDYW 1064

BLAST of CSPI02G07210 vs. TAIR 10
Match: AT1G68930.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 460.7 bits (1184), Expect = 2.3e-129
Identity = 246/687 (35.81%), Postives = 381/687 (55.46%), Query Frame = 0

Query: 76  VLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMALQLFREMLVE-GVSP 135
           +L +Y K+  +++ +  F+++P+RD +TW  LI G+S     G A++ +  M+ +   + 
Sbjct: 78  LLLAYSKAGLISEMESTFEKLPDRDGVTWNVLIEGYSLSGLVGAAVKAYNTMMRDFSANL 137

Query: 136 NHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDLYAKFDEFVYARKLY 195
              TL T+LKL S  G V +GK IHG +++ G +  +++ + +L +YA       A+K++
Sbjct: 138 TRVTLMTMLKLSSSNGHVSLGKQIHGQVIKLGFESYLLVGSPLLYMYANVGCISDAKKVF 197

Query: 196 DSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGGYLNAALE 255
             + +++T   N ++G  +    +  +L LFR +  +++ SW  +I GL Q G    A+E
Sbjct: 198 YGLDDRNTVMYNSLMGGLLACGMIEDALQLFRGME-KDSVSWAAMIKGLAQNGLAKEAIE 257

Query: 256 LLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLHNDGFVKSALINMYI 315
              EM     + + +     L     L  +  G+Q+H  I+R    +  +V SALI+MY 
Sbjct: 258 CFREMKVQGLKMDQYPFGSVLPACGGLGAINEGKQIHACIIRTNFQDHIYVGSALIDMYC 317

Query: 316 KCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTEIVSRSSMVYGYVRNGKYEDAFKTFV 375
           KC  L  A  ++ R+                    +VS ++MV GY + G+ E+A K F+
Sbjct: 318 KCKCLHYAKTVFDRM----------------KQKNVVSWTAMVVGYGQTGRAEEAVKIFL 377

Query: 376 SMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDAH---LASSLIDMYA 435
            M R  +  D +T+   +SAC+N   LE G Q HG   K +     H   +++SL+ +Y 
Sbjct: 378 DMQRSGIDPDHYTLGQAISACANVSSLEEGSQFHG---KAITSGLIHYVTVSNSLVTLYG 437

Query: 436 KGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRYEGIIPNEVTFI 495
           K G +D + R+F++M N  + V WT+M+   A  G   E I+LF++M   G+ P+ VT  
Sbjct: 438 KCGDIDDSTRLFNEM-NVRDAVSWTAMVSAYAQFGRAVETIQLFDKMVQHGLKPDGVTLT 497

Query: 496 GVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLNEVKEFIYENDL 555
           GV++ACS AGL+E G  YF +M   Y I P + HY+CM+DL+ R+G L E   FI     
Sbjct: 498 GVISACSRAGLVEKGQRYFKLMTSEYGIVPSIGHYSCMIDLFSRSGRLEEAMRFINGMPF 557

Query: 556 SHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCSGSQKWEEASRA 615
              +  W   LS+CR   +LE+GKW +E L  L P     Y LLS++ +   KW+  ++ 
Sbjct: 558 PPDAIGWTTLLSACRNKGNLEIGKWAAESLIELDPHHPAGYTLLSSIYASKGKWDSVAQL 617

Query: 616 RRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIGRLKEIGYLHDV 675
           RR M+   + K PGQSWI  K ++HSF A D+S P   QIY  L++L  ++ + GY  D 
Sbjct: 618 RRGMREKNVKKEPGQSWIKWKGKLHSFSADDESSPYLDQIYAKLEELNNKIIDNGYKPDT 677

Query: 676 KLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTDCHNFMKLTSQL 735
             V  DVEE     +L +HSE+LA+A+G+I + S  PIR+ KNLR+C DCHN  K  S +
Sbjct: 678 SFVHHDVEEAVKVKMLNYHSERLAIAFGLIFVPSGQPIRVGKNLRVCVDCHNATKHISSV 737

Query: 736 LGREIIVRDIYRFHHFNSGHCSCGDYW 759
            GREI+VRD  RFH F  G CSCGD+W
Sbjct: 738 TGREILVRDAVRFHRFKDGTCSCGDFW 743

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LW634.5e-14637.11Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis th... [more]
Q9SHZ82.9e-14535.94Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX... [more]
Q3E6Q12.6e-13035.65Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Q9SVP73.5e-13034.13Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX... [more]
Q9CAA83.2e-12835.81Putative pentatricopeptide repeat-containing protein At1g68930 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
A0A0A0LKI40.0e+0099.87DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G0742... [more]
A0A1S3B4E30.0e+0093.30LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g23... [more]
A0A5D3C6T40.0e+0094.39Putative pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa... [more]
A0A6J1EPP70.0e+0083.71putative pentatricopeptide repeat-containing protein At3g23330 OS=Cucurbita mosc... [more]
A0A6J1KA700.0e+0082.51putative pentatricopeptide repeat-containing protein At3g23330 OS=Cucurbita maxi... [more]
Match NameE-valueIdentityDescription
XP_011648996.10.0e+0096.80LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g23... [more]
XP_008441858.10.0e+0093.30PREDICTED: LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing pro... [more]
KAA0049879.10.0e+0094.39putative pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] ... [more]
XP_038889548.10.0e+0087.60putative pentatricopeptide repeat-containing protein At3g23330 [Benincasa hispid... [more]
KAG7020981.10.0e+0082.32putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyros... [more]
Match NameE-valueIdentityDescription
AT3G23330.13.2e-14737.11Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G22070.12.1e-14635.94pentatricopeptide (PPR) repeat-containing protein [more]
AT1G11290.11.9e-13135.65Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G13650.12.5e-13134.13Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G68930.12.3e-12935.81pentatricopeptide (PPR) repeat-containing protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 417..649
e-value: 2.5E-36
score: 127.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 68..151
e-value: 1.4E-19
score: 72.2
coord: 283..408
e-value: 2.4E-12
score: 48.5
coord: 152..282
e-value: 3.4E-15
score: 57.8
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 453..487
e-value: 1.9E-5
score: 22.5
coord: 103..135
e-value: 7.6E-7
score: 26.9
coord: 235..264
e-value: 6.2E-5
score: 20.9
coord: 354..384
e-value: 5.5E-5
score: 21.1
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 424..447
e-value: 0.043
score: 14.1
coord: 525..547
e-value: 0.15
score: 12.4
coord: 235..262
e-value: 1.7E-4
score: 21.6
coord: 175..200
e-value: 0.058
score: 13.7
coord: 307..328
e-value: 0.38
score: 11.1
coord: 352..380
e-value: 1.9E-4
score: 21.5
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 451..498
e-value: 4.5E-10
score: 39.6
coord: 100..146
e-value: 6.1E-11
score: 42.4
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 232..266
score: 9.13082
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 100..134
score: 11.651919
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 451..485
score: 11.586152
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 349..383
score: 8.889672
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 624..747
e-value: 1.9E-39
score: 134.4
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 34..739
NoneNo IPR availablePANTHERPTHR24015:SF1922OS07G0239600 PROTEINcoord: 34..739

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI02G07210.1CSPI02G07210.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding