CSPI07G04370 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI07G04370
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr7: 3310427 .. 3312724 (+)
RNA-Seq ExpressionCSPI07G04370
SyntenyCSPI07G04370
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GCAAAGTTTTGAGGTGCCATCGTTTCTACAAAAGTCCGCTCTTTTTTTACTTCCTTGGAAAAACTGAAGACGTTTCTCGTCAAATGATGCTCTTCTTGAAACGCCATTGCAGTAGTAGTTTCACCTCTCAAAATTTCAAATATTCTACTCACCCGTCAATCAAACTGTCTCAAATTCTCCAATTTTGCAAGTCTGGCTTACTCAACGATGCGCTACACCTCTTAAACTCCATTGATTTGTACGATTCTAGAATTAACAAACCACTTCTCTACGCTTCTCTCTTACAAACCTGCATCAAGGTAGATTCCTTCACCCGTGGCCGTCAATTTCATGCCCATGTTGTTAAATCTGGCCTTGAGACTGACCGCTTTGTTGGGAATAGCTTGCTTTCTCTCTACTTTAAATTGGGTTCAGATTCTCTGCTGACTCGAAGAGTATTTGATGGTCTTTTTGTCAAAGATGTGGTGTCTTGGGCATCCATGATTACGGGGTATGTTCGAGAAGGTAAATCTGGAATTGCTATTGAATTGTTTTGGGATATGTTGGATTCGGGGATTGAGCCGAATGGCTTTACTTTATCTGCTGTGATCAAAGCGTGCTCCGAGATTGAGAATTTGGTACTTGGTAAGTGCTTTCATGGGGTTGTTGTAAGGCGTGGATTCGATTCAAATCCTGTCATTTTGAGTTCTTTGATTGATATGTACGGGAGAAACTTTGTGTCAAGTGATGCACGCCAATTGTTTGATGAATTGCTTGAACCAGATCCAGTATGTTGGACAACGGTTATTTCAGCGTTTACAAGGAACGATTTGTATGAGGAAGCATTGGGGTTCTTTTATTTGAAGCATAGAGCTCATAGGTTGTGTCCTGATAATTATACATTTGGAAGTGTATTGACTGCCTGTGGTAATTTGGGGAGGTTGAGGCAAGGTGAAGAGATCCATGCTAAGGTGATTGCTTACGGATTTAGTGGGAATGTGGTGACTGAGAGTAGTCTGGTGGATATGTATGGAAAATGTGGAGCGGTTGAGAAATCTCAACGCTTATTCGATAGAATGTCGAACAGGAACTCAGTTTCATGGTCTGCATTGCTTGCAGTATATTGCCACAATGGTGACTACGAAAAGGCTGTAAACCTTTTCAGAGAGATGAAGGAGGTTGACCTCTACAGTTTTGGGACAGTTATTCGTGCGTGTGCCGGGTTGGCAGCTGTTACTCCTGGGAAGGAGATTCACTGTCAGTATATAAGAAAGGGTGGATGGAGAGATGTCATTGTAGAATCAGCTCTAGTCGACTTGTATGCAAAATGTGGTTGCATTAATTTTGCATATAGAGTCTTTGATCGTATGCCAACAAGAAACTTGATCACGTGGAATTCAATGATTCATGGTTTTGCTCAGAATGGAAGTAGTGGAATTGCTATTCAGATCTTTGAAGCAATGATTAAGGAAGGGATTAAGCCTGATTGTATCAGTTTTATTGGTTTACTTTTTGCTTGTAGTCATACAGGTTTGGTCGATCAAGCACGGCACTACTTTGATCTAATGACTGGGAAATATGGAATTAAACCAGGGGTTGAGCATTATAACTGCATGGTTGATCTTCTAGGCCGTGCCGGGCTGCTAGAAGAAGCTGAGAATTTGATAGAAAATGCAGAATGTAGAAATGATTCGTCTCTTTGGCTGGTTCTTCTAGGAGCTTGCACTACTACATGCACGAACTCTGCTACTGCAGAACGCATTGCCAAGAAGTTGATGGAGCTTGAGCCTCAATGCTATTTAAGTTATGTTCACCTGGCTAATGTTTATAGAGCAGTAGGCCGATGGGATGACGCTGTAAAGGTTAGAGAGTTGATGAAAAACCGACAGCTGAAGAAGATGCCAGGTCAGAGTTGGATGTAAAGAGGTAAATCATGAAGGACAAGGTTATGAGCCTCATATTGAACCTGAATTCAAGACTGATGATGAGTTGACATTGATTATTAAAACATGATCATGTCCTTTTGCTCTTCCAAGATGAAGAAGAACAGCCATGAGAAGCCACTGAATATATCAATGATATACCAACCTGGTTGAAAAGGTTATATAGAGAGAGTTAGTGGATTTTCAGTTAATTGGTTTGGAGAAGAAGAGATAGACTTGCCGGCATTAACAACAAGAAATTGAATGTTTTGTTTTGATTTTTCTTTCTTTTCCCCCTGGAAAAACTAACTTATTTATTTGTTACATGTGTTCTTACCATTCTTTACATCTCTCCATTTCAACAGAAATAAATGGTTGTGGAATCAGCAAGAGCACC

mRNA sequence

GCAAAGTTTTGAGGTGCCATCGTTTCTACAAAAGTCCGCTCTTTTTTTACTTCCTTGGAAAAACTGAAGACGTTTCTCGTCAAATGATGCTCTTCTTGAAACGCCATTGCAGTAGTAGTTTCACCTCTCAAAATTTCAAATATTCTACTCACCCGTCAATCAAACTGTCTCAAATTCTCCAATTTTGCAAGTCTGGCTTACTCAACGATGCGCTACACCTCTTAAACTCCATTGATTTGTACGATTCTAGAATTAACAAACCACTTCTCTACGCTTCTCTCTTACAAACCTGCATCAAGGTAGATTCCTTCACCCGTGGCCGTCAATTTCATGCCCATGTTGTTAAATCTGGCCTTGAGACTGACCGCTTTGTTGGGAATAGCTTGCTTTCTCTCTACTTTAAATTGGGTTCAGATTCTCTGCTGACTCGAAGAGTATTTGATGGTCTTTTTGTCAAAGATGTGGTGTCTTGGGCATCCATGATTACGGGGTATGTTCGAGAAGGTAAATCTGGAATTGCTATTGAATTGTTTTGGGATATGTTGGATTCGGGGATTGAGCCGAATGGCTTTACTTTATCTGCTGTGATCAAAGCGTGCTCCGAGATTGAGAATTTGGTACTTGGTAAGTGCTTTCATGGGGTTGTTGTAAGGCGTGGATTCGATTCAAATCCTGTCATTTTGAGTTCTTTGATTGATATGTACGGGAGAAACTTTGTGTCAAGTGATGCACGCCAATTGTTTGATGAATTGCTTGAACCAGATCCAGTATGTTGGACAACGGTTATTTCAGCGTTTACAAGGAACGATTTGTATGAGGAAGCATTGGGGTTCTTTTATTTGAAGCATAGAGCTCATAGGTTGTGTCCTGATAATTATACATTTGGAAGTGTATTGACTGCCTGTGGTAATTTGGGGAGGTTGAGGCAAGGTGAAGAGATCCATGCTAAGGTGATTGCTTACGGATTTAGTGGGAATGTGGTGACTGAGAGTAGTCTGGTGGATATGTATGGAAAATGTGGAGCGGTTGAGAAATCTCAACGCTTATTCGATAGAATGTCGAACAGGAACTCAGTTTCATGGTCTGCATTGCTTGCAGTATATTGCCACAATGGTGACTACGAAAAGGCTGTAAACCTTTTCAGAGAGATGAAGGAGGTTGACCTCTACAGTTTTGGGACAGTTATTCGTGCGTGTGCCGGGTTGGCAGCTGTTACTCCTGGGAAGGAGATTCACTGTCAGTATATAAGAAAGGGTGGATGGAGAGATGTCATTGTAGAATCAGCTCTAGTCGACTTGTATGCAAAATGTGGTTGCATTAATTTTGCATATAGAGTCTTTGATCGTATGCCAACAAGAAACTTGATCACGTGGAATTCAATGATTCATGGTTTTGCTCAGAATGGAAGTAGTGGAATTGCTATTCAGATCTTTGAAGCAATGATTAAGGAAGGGATTAAGCCTGATTGTATCAGTTTTATTGGTTTACTTTTTGCTTGTAGTCATACAGGTTTGGTCGATCAAGCACGGCACTACTTTGATCTAATGACTGGGAAATATGGAATTAAACCAGGGGTTGAGCATTATAACTGCATGGTTGATCTTCTAGGCCGTGCCGGGCTGCTAGAAGAAGCTGAGAATTTGATAGAAAATGCAGAATGTAGAAATGATTCGTCTCTTTGGCTGGTTCTTCTAGGAGCTTGCACTACTACATGCACGAACTCTGCTACTGCAGAACGCATTGCCAAGAAGTTGATGGAGCTTGAGCCTCAATGCTATTTAAGTTATGTTCACCTGGCTAATGTTTATAGAGCAGTAGGCCGATGGGATGACGCTGTAAAGGTTAGAGAGTTGATGAAAAACCGACAGCTGAAGAAGATGCCAGGTCAGAGTTGGATGTAAAGAGGTAAATCATGAAGGACAAGGTTATGAGCCTCATATTGAACCTGAATTCAAGACTGATGATGAGTTGACATTGATTATTAAAACATGATCATGTCCTTTTGCTCTTCCAAGATGAAGAAGAACAGCCATGAGAAGCCACTGAATATATCAATGATATACCAACCTGGTTGAAAAGGTTATATAGAGAGAGTTAGTGGATTTTCAGTTAATTGGTTTGGAGAAGAAGAGATAGACTTGCCGGCATTAACAACAAGAAATTGAATGTTTTGTTTTGATTTTTCTTTCTTTTCCCCCTGGAAAAACTAACTTATTTATTTGTTACATGTGTTCTTACCATTCTTTACATCTCTCCATTTCAACAGAAATAAATGGTTGTGGAATCAGCAAGAGCACC

Coding sequence (CDS)

ATGATGCTCTTCTTGAAACGCCATTGCAGTAGTAGTTTCACCTCTCAAAATTTCAAATATTCTACTCACCCGTCAATCAAACTGTCTCAAATTCTCCAATTTTGCAAGTCTGGCTTACTCAACGATGCGCTACACCTCTTAAACTCCATTGATTTGTACGATTCTAGAATTAACAAACCACTTCTCTACGCTTCTCTCTTACAAACCTGCATCAAGGTAGATTCCTTCACCCGTGGCCGTCAATTTCATGCCCATGTTGTTAAATCTGGCCTTGAGACTGACCGCTTTGTTGGGAATAGCTTGCTTTCTCTCTACTTTAAATTGGGTTCAGATTCTCTGCTGACTCGAAGAGTATTTGATGGTCTTTTTGTCAAAGATGTGGTGTCTTGGGCATCCATGATTACGGGGTATGTTCGAGAAGGTAAATCTGGAATTGCTATTGAATTGTTTTGGGATATGTTGGATTCGGGGATTGAGCCGAATGGCTTTACTTTATCTGCTGTGATCAAAGCGTGCTCCGAGATTGAGAATTTGGTACTTGGTAAGTGCTTTCATGGGGTTGTTGTAAGGCGTGGATTCGATTCAAATCCTGTCATTTTGAGTTCTTTGATTGATATGTACGGGAGAAACTTTGTGTCAAGTGATGCACGCCAATTGTTTGATGAATTGCTTGAACCAGATCCAGTATGTTGGACAACGGTTATTTCAGCGTTTACAAGGAACGATTTGTATGAGGAAGCATTGGGGTTCTTTTATTTGAAGCATAGAGCTCATAGGTTGTGTCCTGATAATTATACATTTGGAAGTGTATTGACTGCCTGTGGTAATTTGGGGAGGTTGAGGCAAGGTGAAGAGATCCATGCTAAGGTGATTGCTTACGGATTTAGTGGGAATGTGGTGACTGAGAGTAGTCTGGTGGATATGTATGGAAAATGTGGAGCGGTTGAGAAATCTCAACGCTTATTCGATAGAATGTCGAACAGGAACTCAGTTTCATGGTCTGCATTGCTTGCAGTATATTGCCACAATGGTGACTACGAAAAGGCTGTAAACCTTTTCAGAGAGATGAAGGAGGTTGACCTCTACAGTTTTGGGACAGTTATTCGTGCGTGTGCCGGGTTGGCAGCTGTTACTCCTGGGAAGGAGATTCACTGTCAGTATATAAGAAAGGGTGGATGGAGAGATGTCATTGTAGAATCAGCTCTAGTCGACTTGTATGCAAAATGTGGTTGCATTAATTTTGCATATAGAGTCTTTGATCGTATGCCAACAAGAAACTTGATCACGTGGAATTCAATGATTCATGGTTTTGCTCAGAATGGAAGTAGTGGAATTGCTATTCAGATCTTTGAAGCAATGATTAAGGAAGGGATTAAGCCTGATTGTATCAGTTTTATTGGTTTACTTTTTGCTTGTAGTCATACAGGTTTGGTCGATCAAGCACGGCACTACTTTGATCTAATGACTGGGAAATATGGAATTAAACCAGGGGTTGAGCATTATAACTGCATGGTTGATCTTCTAGGCCGTGCCGGGCTGCTAGAAGAAGCTGAGAATTTGATAGAAAATGCAGAATGTAGAAATGATTCGTCTCTTTGGCTGGTTCTTCTAGGAGCTTGCACTACTACATGCACGAACTCTGCTACTGCAGAACGCATTGCCAAGAAGTTGATGGAGCTTGAGCCTCAATGCTATTTAAGTTATGTTCACCTGGCTAATGTTTATAGAGCAGTAGGCCGATGGGATGACGCTGTAAAGGTTAGAGAGTTGATGAAAAACCGACAGCTGAAGAAGATGCCAGGTCAGAGTTGGATGTAA

Protein sequence

MMLFLKRHCSSSFTSQNFKYSTHPSIKLSQILQFCKSGLLNDALHLLNSIDLYDSRINKPLLYASLLQTCIKVDSFTRGRQFHAHVVKSGLETDRFVGNSLLSLYFKLGSDSLLTRRVFDGLFVKDVVSWASMITGYVREGKSGIAIELFWDMLDSGIEPNGFTLSAVIKACSEIENLVLGKCFHGVVVRRGFDSNPVILSSLIDMYGRNFVSSDARQLFDELLEPDPVCWTTVISAFTRNDLYEEALGFFYLKHRAHRLCPDNYTFGSVLTACGNLGRLRQGEEIHAKVIAYGFSGNVVTESSLVDMYGKCGAVEKSQRLFDRMSNRNSVSWSALLAVYCHNGDYEKAVNLFREMKEVDLYSFGTVIRACAGLAAVTPGKEIHCQYIRKGGWRDVIVESALVDLYAKCGCINFAYRVFDRMPTRNLITWNSMIHGFAQNGSSGIAIQIFEAMIKEGIKPDCISFIGLLFACSHTGLVDQARHYFDLMTGKYGIKPGVEHYNCMVDLLGRAGLLEEAENLIENAECRNDSSLWLVLLGACTTTCTNSATAERIAKKLMELEPQCYLSYVHLANVYRAVGRWDDAVKVRELMKNRQLKKMPGQSWM*
Homology
BLAST of CSPI07G04370 vs. ExPASy Swiss-Prot
Match: Q9LR69 (Pentatricopeptide repeat-containing protein At1g03540 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E4 PE=2 SV=1)

HSP 1 Score: 725.7 bits (1872), Expect = 4.3e-208
Identity = 359/611 (58.76%), Postives = 450/611 (73.65%), Query Frame = 0

Query: 2   MLFLKRHCSSSFTSQNFKYSTHPSI------KLSQILQFCKSGLLNDALHLLNSIDLYDS 61
           ++ LKRH      SQ+      PSI      K S+IL+ CK G L +A+ +LNS   + S
Sbjct: 3   LIILKRH-----FSQHASLCLTPSISSSAPTKQSRILELCKLGQLTEAIRILNS--THSS 62

Query: 62  RI-NKPLLYASLLQTCIKVDSFTRGRQFHAHVVKSGLETDRFVGNSLLSLYFKLGSDSLL 121
            I   P LYASLLQTC KV SF  G QFHAHVVKSGLETDR VGNSLLSLYFKLG     
Sbjct: 63  EIPATPKLYASLLQTCNKVFSFIHGIQFHAHVVKSGLETDRNVGNSLLSLYFKLGPGMRE 122

Query: 122 TRRVFDGLFVKDVVSWASMITGYVREGKSGIAIELFWDMLDSGIEPNGFTLSAVIKACSE 181
           TRRVFDG FVKD +SW SM++GYV   +   A+E+F +M+  G++ N FTLS+ +KACSE
Sbjct: 123 TRRVFDGRFVKDAISWTSMMSGYVTGKEHVKALEVFVEMVSFGLDANEFTLSSAVKACSE 182

Query: 182 IENLVLGKCFHGVVVRRGFDSNPVILSSLIDMYGRNFVSSDARQLFDELLEPDPVCWTTV 241
           +  + LG+CFHGVV+  GF+ N  I S+L  +YG N    DAR++FDE+ EPD +CWT V
Sbjct: 183 LGEVRLGRCFHGVVITHGFEWNHFISSTLAYLYGVNREPVDARRVFDEMPEPDVICWTAV 242

Query: 242 ISAFTRNDLYEEALGFFYLKHRAHRLCPDNYTFGSVLTACGNLGRLRQGEEIHAKVIAYG 301
           +SAF++NDLYEEALG FY  HR   L PD  TFG+VLTACGNL RL+QG+EIH K+I  G
Sbjct: 243 LSAFSKNDLYEEALGLFYAMHRGKGLVPDGSTFGTVLTACGNLRRLKQGKEIHGKLITNG 302

Query: 302 FSGNVVTESSLVDMYGKCGAVEKSQRLFDRMSNRNSVSWSALLAVYCHNGDYEKAVNLFR 361
              NVV ESSL+DMYGKCG+V +++++F+ MS +NSVSWSALL  YC NG++EKA+ +FR
Sbjct: 303 IGSNVVVESSLLDMYGKCGSVREARQVFNGMSKKNSVSWSALLGGYCQNGEHEKAIEIFR 362

Query: 362 EMKEVDLYSFGTVIRACAGLAAVTPGKEIHCQYIRKGGWRDVIVESALVDLYAKCGCINF 421
           EM+E DLY FGTV++ACAGLAAV  GKEIH QY+R+G + +VIVESAL+DLY K GCI+ 
Sbjct: 363 EMEEKDLYCFGTVLKACAGLAAVRLGKEIHGQYVRRGCFGNVIVESALIDLYGKSGCIDS 422

Query: 422 AYRVFDRMPTRNLITWNSMIHGFAQNGSSGIAIQIFEAMIKEGIKPDCISFIGLLFACSH 481
           A RV+ +M  RN+ITWN+M+   AQNG    A+  F  M+K+GIKPD ISFI +L AC H
Sbjct: 423 ASRVYSKMSIRNMITWNAMLSALAQNGRGEEAVSFFNDMVKKGIKPDYISFIAILTACGH 482

Query: 482 TGLVDQARHYFDLMTGKYGIKPGVEHYNCMVDLLGRAGLLEEAENLIENAECRNDSSLWL 541
           TG+VD+ R+YF LM   YGIKPG EHY+CM+DLLGRAGL EEAENL+E AECRND+SLW 
Sbjct: 483 TGMVDEGRNYFVLMAKSYGIKPGTEHYSCMIDLLGRAGLFEEAENLLERAECRNDASLWG 542

Query: 542 VLLGACTTTCTNSATAERIAKKLMELEPQCYLSYVHLANVYRAVGRWDDAVKVRELMKNR 601
           VLLG C      S  AERIAK++MELEP+ ++SYV L+N+Y+A+GR  DA+ +R+LM  R
Sbjct: 543 VLLGPCAANADASRVAERIAKRMMELEPKYHMSYVLLSNMYKAIGRHGDALNIRKLMVRR 602

Query: 602 QLKKMPGQSWM 606
            + K  GQSW+
Sbjct: 603 GVAKTVGQSWI 606

BLAST of CSPI07G04370 vs. ExPASy Swiss-Prot
Match: Q9SMZ2 (Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H53 PE=3 SV=1)

HSP 1 Score: 385.2 bits (988), Expect = 1.4e-105
Identity = 201/545 (36.88%), Postives = 322/545 (59.08%), Query Frame = 0

Query: 66  LLQTCIKVDSFTRGRQFHAHVVKSGLETDRFVGNSLLSLYFKLGSDSLLTRRVFDGLFVK 125
           +L T +KVDS   G+Q H   +K GL+    V NSL+++Y KL       R VFD +  +
Sbjct: 321 MLATAVKVDSLALGQQVHCMALKLGLDLMLTVSNSLINMYCKLRKFG-FARTVFDNMSER 380

Query: 126 DVVSWASMITGYVREGKSGIAIELFWDMLDSGIEPNGFTLSAVIKACSEI-ENLVLGKCF 185
           D++SW S+I G  + G    A+ LF  +L  G++P+ +T+++V+KA S + E L L K  
Sbjct: 381 DLISWNSVIAGIAQNGLEVEAVCLFMQLLRCGLKPDQYTMTSVLKAASSLPEGLSLSKQV 440

Query: 186 HGVVVRRGFDSNPVILSSLIDMYGRNFVSSDARQLFDELLEPDPVCWTTVISAFTRNDLY 245
           H   ++    S+  + ++LID Y RN    +A  LF E    D V W  +++ +T++   
Sbjct: 441 HVHAIKINNVSDSFVSTALIDAYSRNRCMKEAEILF-ERHNFDLVAWNAMMAGYTQSHDG 500

Query: 246 EEALGFFYLKHRAHRLCPDNYTFGSVLTACGNLGRLRQGEEIHAKVIAYGFSGNVVTESS 305
            + L  F L H+      D++T  +V   CG L  + QG+++HA  I  G+  ++   S 
Sbjct: 501 HKTLKLFALMHKQGER-SDDFTLATVFKTCGFLFAINQGKQVHAYAIKSGYDLDLWVSSG 560

Query: 306 LVDMYGKCGAVEKSQRLFDRMSNRNSVSWSALLAVYCHNGDYEKAVNLFREMKEV----D 365
           ++DMY KCG +  +Q  FD +   + V+W+ +++    NG+ E+A ++F +M+ +    D
Sbjct: 561 ILDMYVKCGDMSAAQFAFDSIPVPDDVAWTTMISGCIENGEEERAFHVFSQMRLMGVLPD 620

Query: 366 LYSFGTVIRACAGLAAVTPGKEIHCQYIRKGGWRDVIVESALVDLYAKCGCINFAYRVFD 425
            ++  T+ +A + L A+  G++IH   ++     D  V ++LVD+YAKCG I+ AY +F 
Sbjct: 621 EFTIATLAKASSCLTALEQGRQIHANALKLNCTNDPFVGTSLVDMYAKCGSIDDAYCLFK 680

Query: 426 RMPTRNLITWNSMIHGFAQNGSSGIAIQIFEAMIKEGIKPDCISFIGLLFACSHTGLVDQ 485
           R+   N+  WN+M+ G AQ+G     +Q+F+ M   GIKPD ++FIG+L ACSH+GLV +
Sbjct: 681 RIEMMNITAWNAMLVGLAQHGEGKETLQLFKQMKSLGIKPDKVTFIGVLSACSHSGLVSE 740

Query: 486 ARHYFDLMTGKYGIKPGVEHYNCMVDLLGRAGLLEEAENLIENAECRNDSSLWLVLLGAC 545
           A  +   M G YGIKP +EHY+C+ D LGRAGL+++AENLIE+      +S++  LL AC
Sbjct: 741 AYKHMRSMHGDYGIKPEIEHYSCLADALGRAGLVKQAENLIESMSMEASASMYRTLLAAC 800

Query: 546 TTTCTNSATAERIAKKLMELEPQCYLSYVHLANVYRAVGRWDDAVKVRELMKNRQLKKMP 605
                ++ T +R+A KL+ELEP    +YV L+N+Y A  +WD+    R +MK  ++KK P
Sbjct: 801 RVQ-GDTETGKRVATKLLELEPLDSSAYVLLSNMYAAASKWDEMKLARTMMKGHKVKKDP 860

BLAST of CSPI07G04370 vs. ExPASy Swiss-Prot
Match: Q9SIT7 (Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E76 PE=3 SV=1)

HSP 1 Score: 380.9 bits (977), Expect = 2.6e-104
Identity = 209/616 (33.93%), Postives = 335/616 (54.38%), Query Frame = 0

Query: 63  YASLLQTCIKVD-SFTRGRQFHAHVVKSGLETDRFVGNSLLSLYFKLGS--------DSL 122
           +A LL +CIK   S    R  HA V+KSG   + F+ N L+  Y K GS        D +
Sbjct: 22  FAKLLDSCIKSKLSAIYVRYVHASVIKSGFSNEIFIQNRLIDAYSKCGSLEDGRQVFDKM 81

Query: 123 LTRRVF------------------DGLF----VKDVVSWASMITGYVREGKSGIAIELFW 182
             R ++                  D LF     +D  +W SM++G+ +  +   A+  F 
Sbjct: 82  PQRNIYTWNSVVTGLTKLGFLDEADSLFRSMPERDQCTWNSMVSGFAQHDRCEEALCYFA 141

Query: 183 DMLDSGIEPNGFTLSAVIKACSEIENLVLGKCFHGVVVRRGFDSNPVILSSLIDMYGRNF 242
            M   G   N ++ ++V+ ACS + ++  G   H ++ +  F S+  I S+L+DMY +  
Sbjct: 142 MMHKEGFVLNEYSFASVLSACSGLNDMNKGVQVHSLIAKSPFLSDVYIGSALVDMYSKCG 201

Query: 243 VSSDARQLFDELLEPDPVCWTTVISAFTRNDLYEEALGFFYLKHRAHRLCPDNYTFGSVL 302
             +DA+++FDE+ + + V W ++I+ F +N    EAL  F +   + R+ PD  T  SV+
Sbjct: 202 NVNDAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDVFQMMLES-RVEPDEVTLASVI 261

Query: 303 TACGNLGRLRQGEEIHAKVIAYG-FSGNVVTESSLVDMYGKCGAVEKSQRLFD------- 362
           +AC +L  ++ G+E+H +V+       +++  ++ VDMY KC  +++++ +FD       
Sbjct: 262 SACASLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYAKCSRIKEARFIFDSMPIRNV 321

Query: 363 ------------------------RMSNRNSVSWSALLAVYCHNGDYEKAVNLFREMKEV 422
                                   +M+ RN VSW+AL+A Y  NG+ E+A++LF  +K  
Sbjct: 322 IAETSMISGYAMAASTKAARLMFTKMAERNVVSWNALIAGYTQNGENEEALSLFCLLKRE 381

Query: 423 DL----YSFGTVIRACAGLAAVTPGKEIHCQYIR------KGGWRDVIVESALVDLYAKC 482
            +    YSF  +++ACA LA +  G + H   ++       G   D+ V ++L+D+Y KC
Sbjct: 382 SVCPTHYSFANILKACADLAELHLGMQAHVHVLKHGFKFQSGEEDDIFVGNSLIDMYVKC 441

Query: 483 GCINFAYRVFDRMPTRNLITWNSMIHGFAQNGSSGIAIQIFEAMIKEGIKPDCISFIGLL 542
           GC+   Y VF +M  R+ ++WN+MI GFAQNG    A+++F  M++ G KPD I+ IG+L
Sbjct: 442 GCVEEGYLVFRKMMERDCVSWNAMIIGFAQNGYGNEALELFREMLESGEKPDHITMIGVL 501

Query: 543 FACSHTGLVDQARHYFDLMTGKYGIKPGVEHYNCMVDLLGRAGLLEEAENLIENAECRND 602
            AC H G V++ RHYF  MT  +G+ P  +HY CMVDLLGRAG LEEA+++IE    + D
Sbjct: 502 SACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDLLGRAGFLEEAKSMIEEMPMQPD 561

Query: 603 SSLWLVLLGACTTTCTNSATAERIAKKLMELEPQCYLSYVHLANVYRAVGRWDDAVKVRE 606
           S +W  LL AC     N    + +A+KL+E+EP     YV L+N+Y  +G+W+D + VR+
Sbjct: 562 SVIWGSLLAACKVH-RNITLGKYVAEKLLEVEPSNSGPYVLLSNMYAELGKWEDVMNVRK 621

BLAST of CSPI07G04370 vs. ExPASy Swiss-Prot
Match: Q9FWA6 (Pentatricopeptide repeat-containing protein At3g02330, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E90 PE=2 SV=2)

HSP 1 Score: 370.2 bits (949), Expect = 4.6e-101
Identity = 206/607 (33.94%), Postives = 331/607 (54.53%), Query Frame = 0

Query: 25  SIKLSQILQFC-KSGLLNDALHLLNSIDLYDSRINKPLLYASLLQTCIKVDSFTRGRQFH 84
           S+  S I+  C ++ LL+ AL     +   ++ +++  +YAS+L++C  +     G Q H
Sbjct: 246 SVSWSAIIAGCVQNNLLSLALKFFKEMQKVNAGVSQS-IYASVLRSCAALSELRLGGQLH 305

Query: 85  AHVVKSGLETDRFVGNSLLSLYFKLGSDSLLTRRV-FDGLFVKDVVSWASMITGYVREGK 144
           AH +KS    D  V  + L +Y K   D++   ++ FD     +  S+ +MITGY +E  
Sbjct: 306 AHALKSDFAADGIVRTATLDMYAK--CDNMQDAQILFDNSENLNRQSYNAMITGYSQEEH 365

Query: 145 SGIAIELFWDMLDSGIEPNGFTLSAVIKACSEIENLVLGKCFHGVVVRRGFDSNPVILSS 204
              A+ LF  ++ SG+  +  +LS V +AC+ ++ L  G   +G+ ++     +  + ++
Sbjct: 366 GFKALLLFHRLMSSGLGFDEISLSGVFRACALVKGLSEGLQIYGLAIKSSLSLDVCVANA 425

Query: 205 LIDMYGRNFVSSDARQLFDELLEPDPVCWTTVISAFTRNDLYEEALGFFYLKHRAHRLCP 264
            IDMYG+    ++A ++FDE+   D V W  +I+A  +N    E L F ++     R+ P
Sbjct: 426 AIDMYGKCQALAEAFRVFDEMRRRDAVSWNAIIAAHEQNGKGYETL-FLFVSMLRSRIEP 485

Query: 265 DNYTFGSVLTACGNLGRLRQGEEIHAKVIAYGFSGNVVTESSLVDMYGKCGAVEKSQRLF 324
           D +TFGS+L AC   G L  G EIH+ ++  G + N     SL+DMY KCG +E+++++ 
Sbjct: 486 DEFTFGSILKACTG-GSLGYGMEIHSSIVKSGMASNSSVGCSLIDMYSKCGMIEEAEKIH 545

Query: 325 DRMSNRNS--------------------VSWSALLAVYCHNGDYEKAVNLFREMKEV--- 384
            R   R +                    VSW+++++ Y      E A  LF  M E+   
Sbjct: 546 SRFFQRANVSGTMEELEKMHNKRLQEMCVSWNSIISGYVMKEQSEDAQMLFTRMMEMGIT 605

Query: 385 -DLYSFGTVIRACAGLAAVTPGKEIHCQYIRKGGWRDVIVESALVDLYAKCGCINFAYRV 444
            D +++ TV+  CA LA+   GK+IH Q I+K    DV + S LVD+Y+KCG ++ +  +
Sbjct: 606 PDKFTYATVLDTCANLASAGLGKQIHAQVIKKELQSDVYICSTLVDMYSKCGDLHDSRLM 665

Query: 445 FDRMPTRNLITWNSMIHGFAQNGSSGIAIQIFEAMIKEGIKPDCISFIGLLFACSHTGLV 504
           F++   R+ +TWN+MI G+A +G    AIQ+FE MI E IKP+ ++FI +L AC+H GL+
Sbjct: 666 FEKSLRRDFVTWNAMICGYAHHGKGEEAIQLFERMILENIKPNHVTFISILRACAHMGLI 725

Query: 505 DQARHYFDLMTGKYGIKPGVEHYNCMVDLLGRAGLLEEAENLIENAECRNDSSLWLVLLG 564
           D+   YF +M   YG+ P + HY+ MVD+LG++G ++ A  LI       D  +W  LLG
Sbjct: 726 DKGLEYFYMMKRDYGLDPQLPHYSNMVDILGKSGKVKRALELIREMPFEADDVIWRTLLG 785

Query: 565 ACTTTCTNSATAERIAKKLMELEPQCYLSYVHLANVYRAVGRWDDAVKVRELMKNRQLKK 606
            CT    N   AE     L+ L+PQ   +Y  L+NVY   G W+    +R  M+  +LKK
Sbjct: 786 VCTIHRNNVEVAEEATAALLRLDPQDSSAYTLLSNVYADAGMWEKVSDLRRNMRGFKLKK 845

BLAST of CSPI07G04370 vs. ExPASy Swiss-Prot
Match: Q9FIB2 (Putative pentatricopeptide repeat-containing protein At5g09950 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H35 PE=3 SV=1)

HSP 1 Score: 367.9 bits (943), Expect = 2.3e-100
Identity = 194/538 (36.06%), Postives = 318/538 (59.11%), Query Frame = 0

Query: 78  RGRQFHAHVVKSGLETDRF-VGNSLLSLYFKLGSDSLLTRRVFDGLFVKDVVSWASMITG 137
           +GR+ H HV+ +GL      +GN L+++Y K GS +   RRVF  +  KD VSW SMITG
Sbjct: 331 KGREVHGHVITTGLVDFMVGIGNGLVNMYAKCGSIA-DARRVFYFMTDKDSVSWNSMITG 390

Query: 138 YVREGKSGIAIELFWDMLDSGIEPNGFTLSAVIKACSEIENLVLGKCFHGVVVRRGFDSN 197
             + G    A+E +  M    I P  FTL + + +C+ ++   LG+  HG  ++ G D N
Sbjct: 391 LDQNGCFIEAVERYKSMRRHDILPGSFTLISSLSSCASLKWAKLGQQIHGESLKLGIDLN 450

Query: 198 PVILSSLIDMYGRNFVSSDARQLFDELLEPDPVCWTTVISAFTRND--LYEEALGFFYLK 257
             + ++L+ +Y      ++ R++F  + E D V W ++I A  R++  L E  + F   +
Sbjct: 451 VSVSNALMTLYAETGYLNECRKIFSSMPEHDQVSWNSIIGALARSERSLPEAVVCFLNAQ 510

Query: 258 HRAHRLCPDNYTFGSVLTACGNLGRLRQGEEIHAKVIAYGFSGNVVTESSLVDMYGKCGA 317
               +L  +  TF SVL+A  +L     G++IH   +    +    TE++L+  YGKCG 
Sbjct: 511 RAGQKL--NRITFSSVLSAVSSLSFGELGKQIHGLALKNNIADEATTENALIACYGKCGE 570

Query: 318 VEKSQRLFDRMS-NRNSVSWSALLAVYCHNGDYEKAVNLFREM----KEVDLYSFGTVIR 377
           ++  +++F RM+  R++V+W+++++ Y HN    KA++L   M    + +D + + TV+ 
Sbjct: 571 MDGCEKIFSRMAERRDNVTWNSMISGYIHNELLAKALDLVWFMLQTGQRLDSFMYATVLS 630

Query: 378 ACAGLAAVTPGKEIHCQYIRKGGWRDVIVESALVDLYAKCGCINFAYRVFDRMPTRNLIT 437
           A A +A +  G E+H   +R     DV+V SALVD+Y+KCG +++A R F+ MP RN  +
Sbjct: 631 AFASVATLERGMEVHACSVRACLESDVVVGSALVDMYSKCGRLDYALRFFNTMPVRNSYS 690

Query: 438 WNSMIHGFAQNGSSGIAIQIFEAMIKEG-IKPDCISFIGLLFACSHTGLVDQARHYFDLM 497
           WNSMI G+A++G    A+++FE M  +G   PD ++F+G+L ACSH GL+++   +F+ M
Sbjct: 691 WNSMISGYARHGQGEEALKLFETMKLDGQTPPDHVTFVGVLSACSHAGLLEEGFKHFESM 750

Query: 498 TGKYGIKPGVEHYNCMVDLLGRAGLLEEAENLIENAECRNDSSLWLVLLGACTTTCTNSA 557
           +  YG+ P +EH++CM D+LGRAG L++ E+ IE    + +  +W  +LGAC       A
Sbjct: 751 SDSYGLAPRIEHFSCMADVLGRAGELDKLEDFIEKMPMKPNVLIWRTVLGACCRANGRKA 810

Query: 558 -TAERIAKKLMELEPQCYLSYVHLANVYRAVGRWDDAVKVRELMKNRQLKKMPGQSWM 606
              ++ A+ L +LEP+  ++YV L N+Y A GRW+D VK R+ MK+  +KK  G SW+
Sbjct: 811 ELGKKAAEMLFQLEPENAVNYVLLGNMYAAGGRWEDLVKARKKMKDADVKKEAGYSWV 865

BLAST of CSPI07G04370 vs. ExPASy TrEMBL
Match: A0A0A0K5P0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G047230 PE=4 SV=1)

HSP 1 Score: 1240.7 bits (3209), Expect = 0.0e+00
Identity = 603/605 (99.67%), Postives = 603/605 (99.67%), Query Frame = 0

Query: 1   MMLFLKRHCSSSFTSQNFKYSTHPSIKLSQILQFCKSGLLNDALHLLNSIDLYDSRINKP 60
           MMLFLKRHCSSSFTSQNFKYSTHPSIKLSQILQFCKSGLLNDALHLLNSIDLYDSRINKP
Sbjct: 1   MMLFLKRHCSSSFTSQNFKYSTHPSIKLSQILQFCKSGLLNDALHLLNSIDLYDSRINKP 60

Query: 61  LLYASLLQTCIKVDSFTRGRQFHAHVVKSGLETDRFVGNSLLSLYFKLGSDSLLTRRVFD 120
           LLYASLLQTCIKVDSFTRGRQFHAHVVKSGLETDRFVGNSLLSLYFKLGSDSLLTRRVFD
Sbjct: 61  LLYASLLQTCIKVDSFTRGRQFHAHVVKSGLETDRFVGNSLLSLYFKLGSDSLLTRRVFD 120

Query: 121 GLFVKDVVSWASMITGYVREGKSGIAIELFWDMLDSGIEPNGFTLSAVIKACSEIENLVL 180
           GLFVKDVVSWASMITGYVREGKSGIAIELFWDMLDSGIEPNGFTLSAVIKACSEI NLVL
Sbjct: 121 GLFVKDVVSWASMITGYVREGKSGIAIELFWDMLDSGIEPNGFTLSAVIKACSEIGNLVL 180

Query: 181 GKCFHGVVVRRGFDSNPVILSSLIDMYGRNFVSSDARQLFDELLEPDPVCWTTVISAFTR 240
           GKCFHGVVVRRGFDSNPVILSSLIDMYGRN VSSDARQLFDELLEPDPVCWTTVISAFTR
Sbjct: 181 GKCFHGVVVRRGFDSNPVILSSLIDMYGRNSVSSDARQLFDELLEPDPVCWTTVISAFTR 240

Query: 241 NDLYEEALGFFYLKHRAHRLCPDNYTFGSVLTACGNLGRLRQGEEIHAKVIAYGFSGNVV 300
           NDLYEEALGFFYLKHRAHRLCPDNYTFGSVLTACGNLGRLRQGEEIHAKVIAYGFSGNVV
Sbjct: 241 NDLYEEALGFFYLKHRAHRLCPDNYTFGSVLTACGNLGRLRQGEEIHAKVIAYGFSGNVV 300

Query: 301 TESSLVDMYGKCGAVEKSQRLFDRMSNRNSVSWSALLAVYCHNGDYEKAVNLFREMKEVD 360
           TESSLVDMYGKCGAVEKSQRLFDRMSNRNSVSWSALLAVYCHNGDYEKAVNLFREMKEVD
Sbjct: 301 TESSLVDMYGKCGAVEKSQRLFDRMSNRNSVSWSALLAVYCHNGDYEKAVNLFREMKEVD 360

Query: 361 LYSFGTVIRACAGLAAVTPGKEIHCQYIRKGGWRDVIVESALVDLYAKCGCINFAYRVFD 420
           LYSFGTVIRACAGLAAVTPGKEIHCQYIRKGGWRDVIVESALVDLYAKCGCINFAYRVFD
Sbjct: 361 LYSFGTVIRACAGLAAVTPGKEIHCQYIRKGGWRDVIVESALVDLYAKCGCINFAYRVFD 420

Query: 421 RMPTRNLITWNSMIHGFAQNGSSGIAIQIFEAMIKEGIKPDCISFIGLLFACSHTGLVDQ 480
           RMPTRNLITWNSMIHGFAQNGSSGIAIQIFEAMIKEGIKPDCISFIGLLFACSHTGLVDQ
Sbjct: 421 RMPTRNLITWNSMIHGFAQNGSSGIAIQIFEAMIKEGIKPDCISFIGLLFACSHTGLVDQ 480

Query: 481 ARHYFDLMTGKYGIKPGVEHYNCMVDLLGRAGLLEEAENLIENAECRNDSSLWLVLLGAC 540
           ARHYFDLMTGKYGIKPGVEHYNCMVDLLGRAGLLEEAENLIENAECRNDSSLWLVLLGAC
Sbjct: 481 ARHYFDLMTGKYGIKPGVEHYNCMVDLLGRAGLLEEAENLIENAECRNDSSLWLVLLGAC 540

Query: 541 TTTCTNSATAERIAKKLMELEPQCYLSYVHLANVYRAVGRWDDAVKVRELMKNRQLKKMP 600
           TTTCTNSATAERIAKKLMELEPQCYLSYVHLANVYRAVGRWDDAVKVRELMKNRQLKKMP
Sbjct: 541 TTTCTNSATAERIAKKLMELEPQCYLSYVHLANVYRAVGRWDDAVKVRELMKNRQLKKMP 600

Query: 601 GQSWM 606
           GQSWM
Sbjct: 601 GQSWM 605

BLAST of CSPI07G04370 vs. ExPASy TrEMBL
Match: A0A5A7SR37 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold13G003650 PE=4 SV=1)

HSP 1 Score: 1148.3 bits (2969), Expect = 0.0e+00
Identity = 553/605 (91.40%), Postives = 578/605 (95.54%), Query Frame = 0

Query: 1   MMLFLKRHCSSSFTSQNFKYSTHPSIKLSQILQFCKSGLLNDALHLLNSIDLYDSRINKP 60
           MMLF KRHC+SSFTSQNFKYSTH S KL QILQFCKSGLLNDALH+LNS+DLYDSRINKP
Sbjct: 1   MMLFFKRHCTSSFTSQNFKYSTHLSNKLFQILQFCKSGLLNDALHILNSVDLYDSRINKP 60

Query: 61  LLYASLLQTCIKVDSFTRGRQFHAHVVKSGLETDRFVGNSLLSLYFKLGSDSLLTRRVFD 120
           LLYASLLQTC KVDSF+ G QFHAHVVKSGLETDRFVGNSLLSLYFKLGS+ LLTRRVFD
Sbjct: 61  LLYASLLQTCTKVDSFSSGCQFHAHVVKSGLETDRFVGNSLLSLYFKLGSNCLLTRRVFD 120

Query: 121 GLFVKDVVSWASMITGYVREGKSGIAIELFWDMLDSGIEPNGFTLSAVIKACSEIENLVL 180
           GLFVKDVVSWASMITGYVREGKSG+AIELFWDMLDSGIEPN FTLS VIKACSEI NLVL
Sbjct: 121 GLFVKDVVSWASMITGYVREGKSGMAIELFWDMLDSGIEPNDFTLSTVIKACSEIGNLVL 180

Query: 181 GKCFHGVVVRRGFDSNPVILSSLIDMYGRNFVSSDARQLFDELLEPDPVCWTTVISAFTR 240
           GKCFHGVVVRRGFDSNPVILSSLIDMYGRN++SS+ARQLFDELLEPDPVCWTTVISAFTR
Sbjct: 181 GKCFHGVVVRRGFDSNPVILSSLIDMYGRNYLSSEARQLFDELLEPDPVCWTTVISAFTR 240

Query: 241 NDLYEEALGFFYLKHRAHRLCPDNYTFGSVLTACGNLGRLRQGEEIHAKVIAYGFSGNVV 300
           ND YEEALGFFYL HRA+RL PDNYTFGSVLTACGNLGRL+QGEEIHAKVIAYGF GNVV
Sbjct: 241 NDFYEEALGFFYLMHRAYRLSPDNYTFGSVLTACGNLGRLKQGEEIHAKVIAYGFGGNVV 300

Query: 301 TESSLVDMYGKCGAVEKSQRLFDRMSNRNSVSWSALLAVYCHNGDYEKAVNLFREMKEVD 360
            ESSLVDMYGKCGAVEKSQR+FDRMSNRNSVSWSALLAVYC NGD+EK V+LFREMK+VD
Sbjct: 301 VESSLVDMYGKCGAVEKSQRVFDRMSNRNSVSWSALLAVYCQNGDFEKVVSLFREMKKVD 360

Query: 361 LYSFGTVIRACAGLAAVTPGKEIHCQYIRKGGWRDVIVESALVDLYAKCGCINFAYRVFD 420
           LYSFGTV+RACAGLAAV PGKE+HCQYIRKGGWRDVIVESALVDLYAKCG I+FAYR+F+
Sbjct: 361 LYSFGTVLRACAGLAAVAPGKEVHCQYIRKGGWRDVIVESALVDLYAKCGSIDFAYRIFE 420

Query: 421 RMPTRNLITWNSMIHGFAQNGSSGIAIQIFEAMIKEGIKPDCISFIGLLFACSHTGLVDQ 480
           RMPTRNLITWN+MIHGFAQNG S IAIQIFEAMIKEGIKPDCISFIGLLFACSHTGLVDQ
Sbjct: 421 RMPTRNLITWNAMIHGFAQNGRSEIAIQIFEAMIKEGIKPDCISFIGLLFACSHTGLVDQ 480

Query: 481 ARHYFDLMTGKYGIKPGVEHYNCMVDLLGRAGLLEEAENLIENAECRNDSSLWLVLLGAC 540
           ARHYFDLMTG+YGIKPG+EHYNCMVDLLGRAGLLEEAENLIENA+CRNDS+LWLVLLGA 
Sbjct: 481 ARHYFDLMTGEYGIKPGIEHYNCMVDLLGRAGLLEEAENLIENADCRNDSALWLVLLGAS 540

Query: 541 TTTCTNSATAERIAKKLMELEPQCYLSYVHLANVYRAVGRWDDAVKVRELMKNRQLKKMP 600
           T T TNSA AERIAKKLMELEPQCYLSYVHLAN YRAVGRWDDAVKVRELMKNRQLKKMP
Sbjct: 541 TATWTNSAIAERIAKKLMELEPQCYLSYVHLANFYRAVGRWDDAVKVRELMKNRQLKKMP 600

Query: 601 GQSWM 606
           GQSWM
Sbjct: 601 GQSWM 605

BLAST of CSPI07G04370 vs. ExPASy TrEMBL
Match: A0A1S3C0U7 (pentatricopeptide repeat-containing protein At1g03540 OS=Cucumis melo OX=3656 GN=LOC103495534 PE=4 SV=1)

HSP 1 Score: 1148.3 bits (2969), Expect = 0.0e+00
Identity = 553/605 (91.40%), Postives = 578/605 (95.54%), Query Frame = 0

Query: 1   MMLFLKRHCSSSFTSQNFKYSTHPSIKLSQILQFCKSGLLNDALHLLNSIDLYDSRINKP 60
           MMLF KRHC+SSFTSQNFKYSTH S KL QILQFCKSGLLNDALH+LNS+DLYDSRINKP
Sbjct: 1   MMLFFKRHCTSSFTSQNFKYSTHLSNKLFQILQFCKSGLLNDALHILNSVDLYDSRINKP 60

Query: 61  LLYASLLQTCIKVDSFTRGRQFHAHVVKSGLETDRFVGNSLLSLYFKLGSDSLLTRRVFD 120
           LLYASLLQTC KVDSF+ G QFHAHVVKSGLETDRFVGNSLLSLYFKLGS+ LLTRRVFD
Sbjct: 61  LLYASLLQTCTKVDSFSSGCQFHAHVVKSGLETDRFVGNSLLSLYFKLGSNCLLTRRVFD 120

Query: 121 GLFVKDVVSWASMITGYVREGKSGIAIELFWDMLDSGIEPNGFTLSAVIKACSEIENLVL 180
           GLFVKDVVSWASMITGYVREGKSG+AIELFWDMLDSGIEPN FTLS VIKACSEI NLVL
Sbjct: 121 GLFVKDVVSWASMITGYVREGKSGMAIELFWDMLDSGIEPNDFTLSTVIKACSEIGNLVL 180

Query: 181 GKCFHGVVVRRGFDSNPVILSSLIDMYGRNFVSSDARQLFDELLEPDPVCWTTVISAFTR 240
           GKCFHGVVVRRGFDSNPVILSSLIDMYGRN++SS+ARQLFDELLEPDPVCWTTVISAFTR
Sbjct: 181 GKCFHGVVVRRGFDSNPVILSSLIDMYGRNYLSSEARQLFDELLEPDPVCWTTVISAFTR 240

Query: 241 NDLYEEALGFFYLKHRAHRLCPDNYTFGSVLTACGNLGRLRQGEEIHAKVIAYGFSGNVV 300
           ND YEEALGFFYL HRA+RL PDNYTFGSVLTACGNLGRL+QGEEIHAKVIAYGF GNVV
Sbjct: 241 NDFYEEALGFFYLMHRAYRLSPDNYTFGSVLTACGNLGRLKQGEEIHAKVIAYGFGGNVV 300

Query: 301 TESSLVDMYGKCGAVEKSQRLFDRMSNRNSVSWSALLAVYCHNGDYEKAVNLFREMKEVD 360
            ESSLVDMYGKCGAVEKSQR+FDRMSNRNSVSWSALLAVYC NGD+EK V+LFREMK+VD
Sbjct: 301 VESSLVDMYGKCGAVEKSQRVFDRMSNRNSVSWSALLAVYCQNGDFEKVVSLFREMKKVD 360

Query: 361 LYSFGTVIRACAGLAAVTPGKEIHCQYIRKGGWRDVIVESALVDLYAKCGCINFAYRVFD 420
           LYSFGTV+RACAGLAAV PGKE+HCQYIRKGGWRDVIVESALVDLYAKCG I+FAYR+F+
Sbjct: 361 LYSFGTVLRACAGLAAVAPGKEVHCQYIRKGGWRDVIVESALVDLYAKCGSIDFAYRIFE 420

Query: 421 RMPTRNLITWNSMIHGFAQNGSSGIAIQIFEAMIKEGIKPDCISFIGLLFACSHTGLVDQ 480
           RMPTRNLITWN+MIHGFAQNG S IAIQIFEAMIKEGIKPDCISFIGLLFACSHTGLVDQ
Sbjct: 421 RMPTRNLITWNAMIHGFAQNGRSEIAIQIFEAMIKEGIKPDCISFIGLLFACSHTGLVDQ 480

Query: 481 ARHYFDLMTGKYGIKPGVEHYNCMVDLLGRAGLLEEAENLIENAECRNDSSLWLVLLGAC 540
           ARHYFDLMTG+YGIKPG+EHYNCMVDLLGRAGLLEEAENLIENA+CRNDS+LWLVLLGA 
Sbjct: 481 ARHYFDLMTGEYGIKPGIEHYNCMVDLLGRAGLLEEAENLIENADCRNDSALWLVLLGAS 540

Query: 541 TTTCTNSATAERIAKKLMELEPQCYLSYVHLANVYRAVGRWDDAVKVRELMKNRQLKKMP 600
           T T TNSA AERIAKKLMELEPQCYLSYVHLAN YRAVGRWDDAVKVRELMKNRQLKKMP
Sbjct: 541 TATWTNSAIAERIAKKLMELEPQCYLSYVHLANFYRAVGRWDDAVKVRELMKNRQLKKMP 600

Query: 601 GQSWM 606
           GQSWM
Sbjct: 601 GQSWM 605

BLAST of CSPI07G04370 vs. ExPASy TrEMBL
Match: A0A6J1D3F1 (pentatricopeptide repeat-containing protein At1g03540 OS=Momordica charantia OX=3673 GN=LOC111016671 PE=4 SV=1)

HSP 1 Score: 1041.2 bits (2691), Expect = 1.7e-300
Identity = 499/605 (82.48%), Postives = 548/605 (90.58%), Query Frame = 0

Query: 1   MMLFLKRHCSSSFTSQNFKYSTHPSIKLSQILQFCKSGLLNDALHLLNSIDLYDSRINKP 60
           M LF KRHC SSFT  N K  T+ S K SQ+LQ C+SGLL+DALH+LNS+DL+D+  NKP
Sbjct: 1   MRLFFKRHC-SSFTFHNLKNFTYASTKGSQVLQHCRSGLLHDALHILNSVDLFDTATNKP 60

Query: 61  LLYASLLQTCIKVDSFTRGRQFHAHVVKSGLETDRFVGNSLLSLYFKLGSDSLLTRRVFD 120
           +LYASLLQTCIKV SF+ GRQ HAHVVKSGLETDRFVGNSLLSLYFKLGSD LLTRRVFD
Sbjct: 61  ILYASLLQTCIKVASFSHGRQIHAHVVKSGLETDRFVGNSLLSLYFKLGSDYLLTRRVFD 120

Query: 121 GLFVKDVVSWASMITGYVREGKSGIAIELFWDMLDSGIEPNGFTLSAVIKACSEIENLVL 180
           GLFVKDVVSW SMITGYVREGK G AIELFWDMLD GIEPNGFT+SAVIKACSEI NLVL
Sbjct: 121 GLFVKDVVSWTSMITGYVREGKPGNAIELFWDMLDLGIEPNGFTISAVIKACSEIGNLVL 180

Query: 181 GKCFHGVVVRRGFDSNPVILSSLIDMYGRNFVSSDARQLFDELLEPDPVCWTTVISAFTR 240
           G+CFHG+V+R GFDSN VI+SSLIDMYGRN  S+DARQLFDELLEPD +CWT+VISAFTR
Sbjct: 181 GRCFHGLVLRHGFDSNHVIVSSLIDMYGRNCASNDARQLFDELLEPDAICWTSVISAFTR 240

Query: 241 NDLYEEALGFFYLKHRAHRLCPDNYTFGSVLTACGNLGRLRQGEEIHAKVIAYGFSGNVV 300
           NDLYEEALGFFY   R +RL PD +TFG+VLTACGNLGRLRQGEE+HAKVIA+G  GNVV
Sbjct: 241 NDLYEEALGFFYSMQRTYRLSPDGFTFGTVLTACGNLGRLRQGEEVHAKVIAHGLGGNVV 300

Query: 301 TESSLVDMYGKCGAVEKSQRLFDRMSNRNSVSWSALLAVYCHNGDYEKAVNLFREMKEVD 360
            ESSLVDMYGKCGAV+KSQ +FDRMS RNSVSWSALL VYC NGD+E  +NLFREM+EVD
Sbjct: 301 VESSLVDMYGKCGAVDKSQLVFDRMSRRNSVSWSALLGVYCQNGDFEMVINLFREMEEVD 360

Query: 361 LYSFGTVIRACAGLAAVTPGKEIHCQYIRKGGWRDVIVESALVDLYAKCGCINFAYRVFD 420
           LYSFGTVIRACAGLAAVT GKE+HCQY+RKGGWRDVIVESALVDLYAKCGCI+FAYR+F+
Sbjct: 361 LYSFGTVIRACAGLAAVTQGKEVHCQYVRKGGWRDVIVESALVDLYAKCGCIDFAYRIFE 420

Query: 421 RMPTRNLITWNSMIHGFAQNGSSGIAIQIFEAMIKEGIKPDCISFIGLLFACSHTGLVDQ 480
           +MP++NLITWNSMI GFAQNG  GIA+QIFE MIKEGIKPD ISFIG+LFACSHTGLVDQ
Sbjct: 421 QMPSKNLITWNSMIRGFAQNGQGGIALQIFEEMIKEGIKPDYISFIGVLFACSHTGLVDQ 480

Query: 481 ARHYFDLMTGKYGIKPGVEHYNCMVDLLGRAGLLEEAENLIENAECRNDSSLWLVLLGAC 540
            RHYF LMT +YGIKPG+EHYNCMVDLLGR GLLEEAENLIENA+CRNDSSLW VLLGAC
Sbjct: 481 GRHYFALMTEQYGIKPGIEHYNCMVDLLGRTGLLEEAENLIENADCRNDSSLWQVLLGAC 540

Query: 541 TTTCTNSATAERIAKKLMELEPQCYLSYVHLANVYRAVGRWDDAVKVRELMKNRQLKKMP 600
            TTCTNSATAERIAKK+MELEP+ +LSYV LANVYRAVGRWDDA+K+R+LMKNRQ+KKMP
Sbjct: 541 -TTCTNSATAERIAKKMMELEPRHHLSYVLLANVYRAVGRWDDALKIRKLMKNRQVKKMP 600

Query: 601 GQSWM 606
           GQSW+
Sbjct: 601 GQSWI 603

BLAST of CSPI07G04370 vs. ExPASy TrEMBL
Match: A0A6J1JMC8 (pentatricopeptide repeat-containing protein At1g03540 OS=Cucurbita maxima OX=3661 GN=LOC111485968 PE=4 SV=1)

HSP 1 Score: 1019.2 bits (2634), Expect = 7.0e-294
Identity = 492/605 (81.32%), Postives = 541/605 (89.42%), Query Frame = 0

Query: 1   MMLFLKRHCSSSFTSQNFKYSTHPSIKLSQILQFCKSGLLNDALHLLNSIDLYDSRINKP 60
           M LF+KRHC  SFTSQN K STHP  K SQILQFC SGLL+DALH LNS+D ++S  NK 
Sbjct: 1   MRLFIKRHC-RSFTSQNLKNSTHPPTKESQILQFCGSGLLHDALHTLNSLDSFNSTTNKS 60

Query: 61  LLYASLLQTCIKVDSFTRGRQFHAHVVKSGLETDRFVGNSLLSLYFKLGSDSLLTRRVFD 120
           +LYASLLQTC KV SF+ GRQ HAHV+KSGLETDRFVGNSLLSLYFKLGSD  LTRRVFD
Sbjct: 61  ILYASLLQTCTKVASFSHGRQIHAHVLKSGLETDRFVGNSLLSLYFKLGSDFRLTRRVFD 120

Query: 121 GLFVKDVVSWASMITGYVREGKSGIAIELFWDMLDSGIEPNGFTLSAVIKACSEIENLVL 180
           GLFVKDVVSW SMIT YVREGK G AIE FWDMLD GIEPNGFTLSAVIKACSEI NL+L
Sbjct: 121 GLFVKDVVSWTSMITSYVREGKPGNAIEFFWDMLDLGIEPNGFTLSAVIKACSEIGNLIL 180

Query: 181 GKCFHGVVVRRGFDSNPVILSSLIDMYGRNFVSSDARQLFDELLEPDPVCWTTVISAFTR 240
           G+CFHG+VVR GF+SN VI+SSLIDMYGRNF SSDARQLFDE+ EPD +CWT+VISA TR
Sbjct: 181 GRCFHGLVVRHGFNSNHVIVSSLIDMYGRNFASSDARQLFDEMPEPDAICWTSVISALTR 240

Query: 241 NDLYEEALGFFYLKHRAHRLCPDNYTFGSVLTACGNLGRLRQGEEIHAKVIAYGFSGNVV 300
           NDLYE+ALGFFYL  R + L PD +TFGSVLTAC NLGRLRQGEE+HAKVIA+G  GNVV
Sbjct: 241 NDLYEDALGFFYLMLRTYSLSPDGFTFGSVLTACANLGRLRQGEEVHAKVIAHGLGGNVV 300

Query: 301 TESSLVDMYGKCGAVEKSQRLFDRMSNRNSVSWSALLAVYCHNGDYEKAVNLFREMKEVD 360
            ESSLVDMYGKCGA+EKSQ +FDRMS RNSVSWSALL VYC NGD+EK +N+FR M+++D
Sbjct: 301 VESSLVDMYGKCGAIEKSQLVFDRMSKRNSVSWSALLGVYCQNGDFEKVINIFRGMEKID 360

Query: 361 LYSFGTVIRACAGLAAVTPGKEIHCQYIRKGGWRDVIVESALVDLYAKCGCINFAYRVFD 420
           LYSFGTVIRACAGLAAVT GKE+HCQY+RKGGWRDVIVESALVDLYAKCGCI+FAYR+F+
Sbjct: 361 LYSFGTVIRACAGLAAVTQGKEVHCQYVRKGGWRDVIVESALVDLYAKCGCIDFAYRIFE 420

Query: 421 RMPTRNLITWNSMIHGFAQNGSSGIAIQIFEAMIKEGIKPDCISFIGLLFACSHTGLVDQ 480
           RMPTRNLITWNSMI GFAQNG SGI+I+IFE MIKEGIKPD ISFIG+LFACSHTGLVDQ
Sbjct: 421 RMPTRNLITWNSMIRGFAQNGRSGISIEIFEEMIKEGIKPDYISFIGVLFACSHTGLVDQ 480

Query: 481 ARHYFDLMTGKYGIKPGVEHYNCMVDLLGRAGLLEEAENLIENAECRNDSSLWLVLLGAC 540
            RHYF LMT +YGIKPG+EHYNCMVDLLGRAGLLEEAENLIENA+ R+DSSLW VLLGAC
Sbjct: 481 GRHYFVLMTEEYGIKPGIEHYNCMVDLLGRAGLLEEAENLIENADFRSDSSLWQVLLGAC 540

Query: 541 TTTCTNSATAERIAKKLMELEPQCYLSYVHLANVYRAVGRWDDAVKVRELMKNRQLKKMP 600
           TT+ TNS TAERIAKK+MELEPQ +LSYV LANVYRAVGRWDDA+ VR+LMK+RQ+KK+P
Sbjct: 541 TTS-TNSGTAERIAKKMMELEPQHHLSYVLLANVYRAVGRWDDALTVRKLMKSRQVKKVP 600

Query: 601 GQSWM 606
           GQSWM
Sbjct: 601 GQSWM 603

BLAST of CSPI07G04370 vs. NCBI nr
Match: XP_004137012.1 (pentatricopeptide repeat-containing protein At1g03540 [Cucumis sativus] >KGN43587.1 hypothetical protein Csa_020336 [Cucumis sativus])

HSP 1 Score: 1240.7 bits (3209), Expect = 0.0e+00
Identity = 603/605 (99.67%), Postives = 603/605 (99.67%), Query Frame = 0

Query: 1   MMLFLKRHCSSSFTSQNFKYSTHPSIKLSQILQFCKSGLLNDALHLLNSIDLYDSRINKP 60
           MMLFLKRHCSSSFTSQNFKYSTHPSIKLSQILQFCKSGLLNDALHLLNSIDLYDSRINKP
Sbjct: 1   MMLFLKRHCSSSFTSQNFKYSTHPSIKLSQILQFCKSGLLNDALHLLNSIDLYDSRINKP 60

Query: 61  LLYASLLQTCIKVDSFTRGRQFHAHVVKSGLETDRFVGNSLLSLYFKLGSDSLLTRRVFD 120
           LLYASLLQTCIKVDSFTRGRQFHAHVVKSGLETDRFVGNSLLSLYFKLGSDSLLTRRVFD
Sbjct: 61  LLYASLLQTCIKVDSFTRGRQFHAHVVKSGLETDRFVGNSLLSLYFKLGSDSLLTRRVFD 120

Query: 121 GLFVKDVVSWASMITGYVREGKSGIAIELFWDMLDSGIEPNGFTLSAVIKACSEIENLVL 180
           GLFVKDVVSWASMITGYVREGKSGIAIELFWDMLDSGIEPNGFTLSAVIKACSEI NLVL
Sbjct: 121 GLFVKDVVSWASMITGYVREGKSGIAIELFWDMLDSGIEPNGFTLSAVIKACSEIGNLVL 180

Query: 181 GKCFHGVVVRRGFDSNPVILSSLIDMYGRNFVSSDARQLFDELLEPDPVCWTTVISAFTR 240
           GKCFHGVVVRRGFDSNPVILSSLIDMYGRN VSSDARQLFDELLEPDPVCWTTVISAFTR
Sbjct: 181 GKCFHGVVVRRGFDSNPVILSSLIDMYGRNSVSSDARQLFDELLEPDPVCWTTVISAFTR 240

Query: 241 NDLYEEALGFFYLKHRAHRLCPDNYTFGSVLTACGNLGRLRQGEEIHAKVIAYGFSGNVV 300
           NDLYEEALGFFYLKHRAHRLCPDNYTFGSVLTACGNLGRLRQGEEIHAKVIAYGFSGNVV
Sbjct: 241 NDLYEEALGFFYLKHRAHRLCPDNYTFGSVLTACGNLGRLRQGEEIHAKVIAYGFSGNVV 300

Query: 301 TESSLVDMYGKCGAVEKSQRLFDRMSNRNSVSWSALLAVYCHNGDYEKAVNLFREMKEVD 360
           TESSLVDMYGKCGAVEKSQRLFDRMSNRNSVSWSALLAVYCHNGDYEKAVNLFREMKEVD
Sbjct: 301 TESSLVDMYGKCGAVEKSQRLFDRMSNRNSVSWSALLAVYCHNGDYEKAVNLFREMKEVD 360

Query: 361 LYSFGTVIRACAGLAAVTPGKEIHCQYIRKGGWRDVIVESALVDLYAKCGCINFAYRVFD 420
           LYSFGTVIRACAGLAAVTPGKEIHCQYIRKGGWRDVIVESALVDLYAKCGCINFAYRVFD
Sbjct: 361 LYSFGTVIRACAGLAAVTPGKEIHCQYIRKGGWRDVIVESALVDLYAKCGCINFAYRVFD 420

Query: 421 RMPTRNLITWNSMIHGFAQNGSSGIAIQIFEAMIKEGIKPDCISFIGLLFACSHTGLVDQ 480
           RMPTRNLITWNSMIHGFAQNGSSGIAIQIFEAMIKEGIKPDCISFIGLLFACSHTGLVDQ
Sbjct: 421 RMPTRNLITWNSMIHGFAQNGSSGIAIQIFEAMIKEGIKPDCISFIGLLFACSHTGLVDQ 480

Query: 481 ARHYFDLMTGKYGIKPGVEHYNCMVDLLGRAGLLEEAENLIENAECRNDSSLWLVLLGAC 540
           ARHYFDLMTGKYGIKPGVEHYNCMVDLLGRAGLLEEAENLIENAECRNDSSLWLVLLGAC
Sbjct: 481 ARHYFDLMTGKYGIKPGVEHYNCMVDLLGRAGLLEEAENLIENAECRNDSSLWLVLLGAC 540

Query: 541 TTTCTNSATAERIAKKLMELEPQCYLSYVHLANVYRAVGRWDDAVKVRELMKNRQLKKMP 600
           TTTCTNSATAERIAKKLMELEPQCYLSYVHLANVYRAVGRWDDAVKVRELMKNRQLKKMP
Sbjct: 541 TTTCTNSATAERIAKKLMELEPQCYLSYVHLANVYRAVGRWDDAVKVRELMKNRQLKKMP 600

Query: 601 GQSWM 606
           GQSWM
Sbjct: 601 GQSWM 605

BLAST of CSPI07G04370 vs. NCBI nr
Match: XP_008455346.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g03540 [Cucumis melo] >KAA0031595.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK07047.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1148.3 bits (2969), Expect = 0.0e+00
Identity = 553/605 (91.40%), Postives = 578/605 (95.54%), Query Frame = 0

Query: 1   MMLFLKRHCSSSFTSQNFKYSTHPSIKLSQILQFCKSGLLNDALHLLNSIDLYDSRINKP 60
           MMLF KRHC+SSFTSQNFKYSTH S KL QILQFCKSGLLNDALH+LNS+DLYDSRINKP
Sbjct: 1   MMLFFKRHCTSSFTSQNFKYSTHLSNKLFQILQFCKSGLLNDALHILNSVDLYDSRINKP 60

Query: 61  LLYASLLQTCIKVDSFTRGRQFHAHVVKSGLETDRFVGNSLLSLYFKLGSDSLLTRRVFD 120
           LLYASLLQTC KVDSF+ G QFHAHVVKSGLETDRFVGNSLLSLYFKLGS+ LLTRRVFD
Sbjct: 61  LLYASLLQTCTKVDSFSSGCQFHAHVVKSGLETDRFVGNSLLSLYFKLGSNCLLTRRVFD 120

Query: 121 GLFVKDVVSWASMITGYVREGKSGIAIELFWDMLDSGIEPNGFTLSAVIKACSEIENLVL 180
           GLFVKDVVSWASMITGYVREGKSG+AIELFWDMLDSGIEPN FTLS VIKACSEI NLVL
Sbjct: 121 GLFVKDVVSWASMITGYVREGKSGMAIELFWDMLDSGIEPNDFTLSTVIKACSEIGNLVL 180

Query: 181 GKCFHGVVVRRGFDSNPVILSSLIDMYGRNFVSSDARQLFDELLEPDPVCWTTVISAFTR 240
           GKCFHGVVVRRGFDSNPVILSSLIDMYGRN++SS+ARQLFDELLEPDPVCWTTVISAFTR
Sbjct: 181 GKCFHGVVVRRGFDSNPVILSSLIDMYGRNYLSSEARQLFDELLEPDPVCWTTVISAFTR 240

Query: 241 NDLYEEALGFFYLKHRAHRLCPDNYTFGSVLTACGNLGRLRQGEEIHAKVIAYGFSGNVV 300
           ND YEEALGFFYL HRA+RL PDNYTFGSVLTACGNLGRL+QGEEIHAKVIAYGF GNVV
Sbjct: 241 NDFYEEALGFFYLMHRAYRLSPDNYTFGSVLTACGNLGRLKQGEEIHAKVIAYGFGGNVV 300

Query: 301 TESSLVDMYGKCGAVEKSQRLFDRMSNRNSVSWSALLAVYCHNGDYEKAVNLFREMKEVD 360
            ESSLVDMYGKCGAVEKSQR+FDRMSNRNSVSWSALLAVYC NGD+EK V+LFREMK+VD
Sbjct: 301 VESSLVDMYGKCGAVEKSQRVFDRMSNRNSVSWSALLAVYCQNGDFEKVVSLFREMKKVD 360

Query: 361 LYSFGTVIRACAGLAAVTPGKEIHCQYIRKGGWRDVIVESALVDLYAKCGCINFAYRVFD 420
           LYSFGTV+RACAGLAAV PGKE+HCQYIRKGGWRDVIVESALVDLYAKCG I+FAYR+F+
Sbjct: 361 LYSFGTVLRACAGLAAVAPGKEVHCQYIRKGGWRDVIVESALVDLYAKCGSIDFAYRIFE 420

Query: 421 RMPTRNLITWNSMIHGFAQNGSSGIAIQIFEAMIKEGIKPDCISFIGLLFACSHTGLVDQ 480
           RMPTRNLITWN+MIHGFAQNG S IAIQIFEAMIKEGIKPDCISFIGLLFACSHTGLVDQ
Sbjct: 421 RMPTRNLITWNAMIHGFAQNGRSEIAIQIFEAMIKEGIKPDCISFIGLLFACSHTGLVDQ 480

Query: 481 ARHYFDLMTGKYGIKPGVEHYNCMVDLLGRAGLLEEAENLIENAECRNDSSLWLVLLGAC 540
           ARHYFDLMTG+YGIKPG+EHYNCMVDLLGRAGLLEEAENLIENA+CRNDS+LWLVLLGA 
Sbjct: 481 ARHYFDLMTGEYGIKPGIEHYNCMVDLLGRAGLLEEAENLIENADCRNDSALWLVLLGAS 540

Query: 541 TTTCTNSATAERIAKKLMELEPQCYLSYVHLANVYRAVGRWDDAVKVRELMKNRQLKKMP 600
           T T TNSA AERIAKKLMELEPQCYLSYVHLAN YRAVGRWDDAVKVRELMKNRQLKKMP
Sbjct: 541 TATWTNSAIAERIAKKLMELEPQCYLSYVHLANFYRAVGRWDDAVKVRELMKNRQLKKMP 600

Query: 601 GQSWM 606
           GQSWM
Sbjct: 601 GQSWM 605

BLAST of CSPI07G04370 vs. NCBI nr
Match: XP_038888158.1 (pentatricopeptide repeat-containing protein At1g03540 [Benincasa hispida])

HSP 1 Score: 1062.4 bits (2746), Expect = 1.5e-306
Identity = 517/605 (85.45%), Postives = 553/605 (91.40%), Query Frame = 0

Query: 1   MMLFLKRHCSSSFTSQNFKYSTHPSIKLSQILQFCKSGLLNDALHLLNSIDLYDSRINKP 60
           M LF KRH  SSFTSQN K S+HP  K SQILQFC+SGLL+DALH+LNSIDL++S  NKP
Sbjct: 1   MRLFFKRHW-SSFTSQNLKNSSHPLNKQSQILQFCRSGLLHDALHILNSIDLFNSITNKP 60

Query: 61  LLYASLLQTCIKVDSFTRGRQFHAHVVKSGLETDRFVGNSLLSLYFKLGSDSLLTRRVFD 120
           ++YASLLQTC KV SF+ G Q HA VVKSGLETDRFVGNSLLSLYFKLGSD LLTRRVFD
Sbjct: 61  IVYASLLQTCTKVASFSHGCQIHAQVVKSGLETDRFVGNSLLSLYFKLGSDLLLTRRVFD 120

Query: 121 GLFVKDVVSWASMITGYVREGKSGIAIELFWDMLDSGIEPNGFTLSAVIKACSEIENLVL 180
           GLFVKDVVSW SMITGYVREGKSG AIELFWDMLD GI+PN FTLSAVIKACSEI NLVL
Sbjct: 121 GLFVKDVVSWTSMITGYVREGKSGNAIELFWDMLDWGIQPNSFTLSAVIKACSEIGNLVL 180

Query: 181 GKCFHGVVVRRGFDSNPVILSSLIDMYGRNFVSSDARQLFDELLEPDPVCWTTVISAFTR 240
           GKCFHGVV+R GFDSN VI+SSLIDMYGRN+VSSDARQLFDELLEPD +CWT+VISAFTR
Sbjct: 181 GKCFHGVVIRHGFDSNHVIVSSLIDMYGRNYVSSDARQLFDELLEPDAICWTSVISAFTR 240

Query: 241 NDLYEEALGFFYLKHRAHRLCPDNYTFGSVLTACGNLGRLRQGEEIHAKVIAYGFSGNVV 300
           NDLYEEALGFFY   RA+RL PD YTFG+VLTACGNLGRLRQGEE+HAKVIAYG  GNVV
Sbjct: 241 NDLYEEALGFFYFMQRAYRLSPDGYTFGTVLTACGNLGRLRQGEEVHAKVIAYGLGGNVV 300

Query: 301 TESSLVDMYGKCGAVEKSQRLFDRMSNRNSVSWSALLAVYCHNGDYEKAVNLFREMKEVD 360
            ESSLVDMYGKCGAVEKS+R+FDRMS RNSVSWSALL VYC NGD+EK VNLFREMKEVD
Sbjct: 301 VESSLVDMYGKCGAVEKSRRVFDRMSKRNSVSWSALLGVYCQNGDFEKVVNLFREMKEVD 360

Query: 361 LYSFGTVIRACAGLAAVTPGKEIHCQYIRKGGWRDVIVESALVDLYAKCGCINFAYRVFD 420
           LYSFGTVIRACAGLAAVT GKE+HCQY+RKGGWRDVIVESALVDLYAKCGCI+FAYR+F+
Sbjct: 361 LYSFGTVIRACAGLAAVTQGKEVHCQYVRKGGWRDVIVESALVDLYAKCGCIDFAYRIFE 420

Query: 421 RMPTRNLITWNSMIHGFAQNGSSGIAIQIFEAMIKEGIKPDCISFIGLLFACSHTGLVDQ 480
           +MPTRNLITWNSMI G AQNG  GIAIQIFE MIKEGIKPD ISFIG+LFACSHTGLVDQ
Sbjct: 421 QMPTRNLITWNSMIGGLAQNGRGGIAIQIFEEMIKEGIKPDYISFIGVLFACSHTGLVDQ 480

Query: 481 ARHYFDLMTGKYGIKPGVEHYNCMVDLLGRAGLLEEAENLIENAECRNDSSLWLVLLGAC 540
            RHYF LMTG+YGIKPGVEHYNCMVDLLGRAGLLEEAENLIENA+CRN+SSLW VLLGAC
Sbjct: 481 GRHYFALMTGEYGIKPGVEHYNCMVDLLGRAGLLEEAENLIENADCRNNSSLWQVLLGAC 540

Query: 541 TTTCTNSATAERIAKKLMELEPQCYLSYVHLANVYRAVGRWDDAVKVRELMKNRQLKKMP 600
            TTCTNSATAERIAKK+MELEPQ +LSYV LANVYRAVGRWDDA+K+R LMKNRQ+KKMP
Sbjct: 541 -TTCTNSATAERIAKKMMELEPQHHLSYVLLANVYRAVGRWDDALKIRNLMKNRQVKKMP 600

Query: 601 GQSWM 606
           GQSWM
Sbjct: 601 GQSWM 603

BLAST of CSPI07G04370 vs. NCBI nr
Match: XP_022147827.1 (pentatricopeptide repeat-containing protein At1g03540 [Momordica charantia])

HSP 1 Score: 1041.2 bits (2691), Expect = 3.5e-300
Identity = 499/605 (82.48%), Postives = 548/605 (90.58%), Query Frame = 0

Query: 1   MMLFLKRHCSSSFTSQNFKYSTHPSIKLSQILQFCKSGLLNDALHLLNSIDLYDSRINKP 60
           M LF KRHC SSFT  N K  T+ S K SQ+LQ C+SGLL+DALH+LNS+DL+D+  NKP
Sbjct: 1   MRLFFKRHC-SSFTFHNLKNFTYASTKGSQVLQHCRSGLLHDALHILNSVDLFDTATNKP 60

Query: 61  LLYASLLQTCIKVDSFTRGRQFHAHVVKSGLETDRFVGNSLLSLYFKLGSDSLLTRRVFD 120
           +LYASLLQTCIKV SF+ GRQ HAHVVKSGLETDRFVGNSLLSLYFKLGSD LLTRRVFD
Sbjct: 61  ILYASLLQTCIKVASFSHGRQIHAHVVKSGLETDRFVGNSLLSLYFKLGSDYLLTRRVFD 120

Query: 121 GLFVKDVVSWASMITGYVREGKSGIAIELFWDMLDSGIEPNGFTLSAVIKACSEIENLVL 180
           GLFVKDVVSW SMITGYVREGK G AIELFWDMLD GIEPNGFT+SAVIKACSEI NLVL
Sbjct: 121 GLFVKDVVSWTSMITGYVREGKPGNAIELFWDMLDLGIEPNGFTISAVIKACSEIGNLVL 180

Query: 181 GKCFHGVVVRRGFDSNPVILSSLIDMYGRNFVSSDARQLFDELLEPDPVCWTTVISAFTR 240
           G+CFHG+V+R GFDSN VI+SSLIDMYGRN  S+DARQLFDELLEPD +CWT+VISAFTR
Sbjct: 181 GRCFHGLVLRHGFDSNHVIVSSLIDMYGRNCASNDARQLFDELLEPDAICWTSVISAFTR 240

Query: 241 NDLYEEALGFFYLKHRAHRLCPDNYTFGSVLTACGNLGRLRQGEEIHAKVIAYGFSGNVV 300
           NDLYEEALGFFY   R +RL PD +TFG+VLTACGNLGRLRQGEE+HAKVIA+G  GNVV
Sbjct: 241 NDLYEEALGFFYSMQRTYRLSPDGFTFGTVLTACGNLGRLRQGEEVHAKVIAHGLGGNVV 300

Query: 301 TESSLVDMYGKCGAVEKSQRLFDRMSNRNSVSWSALLAVYCHNGDYEKAVNLFREMKEVD 360
            ESSLVDMYGKCGAV+KSQ +FDRMS RNSVSWSALL VYC NGD+E  +NLFREM+EVD
Sbjct: 301 VESSLVDMYGKCGAVDKSQLVFDRMSRRNSVSWSALLGVYCQNGDFEMVINLFREMEEVD 360

Query: 361 LYSFGTVIRACAGLAAVTPGKEIHCQYIRKGGWRDVIVESALVDLYAKCGCINFAYRVFD 420
           LYSFGTVIRACAGLAAVT GKE+HCQY+RKGGWRDVIVESALVDLYAKCGCI+FAYR+F+
Sbjct: 361 LYSFGTVIRACAGLAAVTQGKEVHCQYVRKGGWRDVIVESALVDLYAKCGCIDFAYRIFE 420

Query: 421 RMPTRNLITWNSMIHGFAQNGSSGIAIQIFEAMIKEGIKPDCISFIGLLFACSHTGLVDQ 480
           +MP++NLITWNSMI GFAQNG  GIA+QIFE MIKEGIKPD ISFIG+LFACSHTGLVDQ
Sbjct: 421 QMPSKNLITWNSMIRGFAQNGQGGIALQIFEEMIKEGIKPDYISFIGVLFACSHTGLVDQ 480

Query: 481 ARHYFDLMTGKYGIKPGVEHYNCMVDLLGRAGLLEEAENLIENAECRNDSSLWLVLLGAC 540
            RHYF LMT +YGIKPG+EHYNCMVDLLGR GLLEEAENLIENA+CRNDSSLW VLLGAC
Sbjct: 481 GRHYFALMTEQYGIKPGIEHYNCMVDLLGRTGLLEEAENLIENADCRNDSSLWQVLLGAC 540

Query: 541 TTTCTNSATAERIAKKLMELEPQCYLSYVHLANVYRAVGRWDDAVKVRELMKNRQLKKMP 600
            TTCTNSATAERIAKK+MELEP+ +LSYV LANVYRAVGRWDDA+K+R+LMKNRQ+KKMP
Sbjct: 541 -TTCTNSATAERIAKKMMELEPRHHLSYVLLANVYRAVGRWDDALKIRKLMKNRQVKKMP 600

Query: 601 GQSWM 606
           GQSW+
Sbjct: 601 GQSWI 603

BLAST of CSPI07G04370 vs. NCBI nr
Match: XP_023529479.1 (pentatricopeptide repeat-containing protein At1g03540 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1020.8 bits (2638), Expect = 4.9e-294
Identity = 494/605 (81.65%), Postives = 540/605 (89.26%), Query Frame = 0

Query: 1   MMLFLKRHCSSSFTSQNFKYSTHPSIKLSQILQFCKSGLLNDALHLLNSIDLYDSRINKP 60
           M LF+KRHC  SFTSQN K STHP  K SQILQFC S LL+DALH LNS+D +DS  NK 
Sbjct: 1   MRLFIKRHC-RSFTSQNLKNSTHPPTKESQILQFCGSDLLHDALHTLNSLDSFDSTTNKS 60

Query: 61  LLYASLLQTCIKVDSFTRGRQFHAHVVKSGLETDRFVGNSLLSLYFKLGSDSLLTRRVFD 120
           +LYASLLQTC KV SFT GRQ HAHV+KSGLE DRFVGNSLLSLYFKLGSD  LTRRVFD
Sbjct: 61  ILYASLLQTCTKVASFTHGRQIHAHVLKSGLEADRFVGNSLLSLYFKLGSDFRLTRRVFD 120

Query: 121 GLFVKDVVSWASMITGYVREGKSGIAIELFWDMLDSGIEPNGFTLSAVIKACSEIENLVL 180
           GLFVKDVVSW SMIT YVREGK G AIELFWDMLD GIEPNGFTLSAVIKACSEI NLVL
Sbjct: 121 GLFVKDVVSWTSMITSYVREGKPGNAIELFWDMLDLGIEPNGFTLSAVIKACSEIGNLVL 180

Query: 181 GKCFHGVVVRRGFDSNPVILSSLIDMYGRNFVSSDARQLFDELLEPDPVCWTTVISAFTR 240
           G+CFHG+VVR GF+SN VI+SSLIDMYGRNF SSDARQLFDE+ EPD +CWT+VISA TR
Sbjct: 181 GRCFHGLVVRHGFNSNHVIVSSLIDMYGRNFASSDARQLFDEMPEPDAICWTSVISALTR 240

Query: 241 NDLYEEALGFFYLKHRAHRLCPDNYTFGSVLTACGNLGRLRQGEEIHAKVIAYGFSGNVV 300
           NDLYE+ALGFFYL  R + L PD +TFGSVLTAC NLGRLRQGEE+HAKVIA+G  GNVV
Sbjct: 241 NDLYEDALGFFYLMLRTYSLSPDGFTFGSVLTACANLGRLRQGEEVHAKVIAHGLGGNVV 300

Query: 301 TESSLVDMYGKCGAVEKSQRLFDRMSNRNSVSWSALLAVYCHNGDYEKAVNLFREMKEVD 360
            ESSLVDMYGKCGAVEKSQR+FDRMS RNSVSWSALL VYC NGD+EK +N+FR M+++D
Sbjct: 301 VESSLVDMYGKCGAVEKSQRVFDRMSKRNSVSWSALLGVYCQNGDFEKVINIFRGMEKID 360

Query: 361 LYSFGTVIRACAGLAAVTPGKEIHCQYIRKGGWRDVIVESALVDLYAKCGCINFAYRVFD 420
           LYSFGTVIRACAGLAAVT GKE+HCQY+RKGGWRDVIVESALVDLYAKCGCI+FAYR+F+
Sbjct: 361 LYSFGTVIRACAGLAAVTQGKEVHCQYVRKGGWRDVIVESALVDLYAKCGCIDFAYRIFE 420

Query: 421 RMPTRNLITWNSMIHGFAQNGSSGIAIQIFEAMIKEGIKPDCISFIGLLFACSHTGLVDQ 480
           +MPTRNLITWNSMI GFAQNG SGI+I+IFE MIKEGIKPD ISFIG+LFACSHTGLVDQ
Sbjct: 421 QMPTRNLITWNSMIRGFAQNGRSGISIEIFEEMIKEGIKPDYISFIGVLFACSHTGLVDQ 480

Query: 481 ARHYFDLMTGKYGIKPGVEHYNCMVDLLGRAGLLEEAENLIENAECRNDSSLWLVLLGAC 540
            RHYF  MT +YGIKPG+EHYNCMVDLLGRAGLLEEAENLIENA+ RNDSSLW VLLGAC
Sbjct: 481 GRHYFVRMTEEYGIKPGIEHYNCMVDLLGRAGLLEEAENLIENADFRNDSSLWQVLLGAC 540

Query: 541 TTTCTNSATAERIAKKLMELEPQCYLSYVHLANVYRAVGRWDDAVKVRELMKNRQLKKMP 600
           TT+ TNS TAERIAKK+MELEPQ +LSYV LANVYRAVGRWDDA+ +R+LMK+RQ+KK+P
Sbjct: 541 TTS-TNSGTAERIAKKMMELEPQHHLSYVLLANVYRAVGRWDDALTIRKLMKSRQVKKVP 600

Query: 601 GQSWM 606
           GQSWM
Sbjct: 601 GQSWM 603

BLAST of CSPI07G04370 vs. TAIR 10
Match: AT1G03540.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 725.7 bits (1872), Expect = 3.1e-209
Identity = 359/611 (58.76%), Postives = 450/611 (73.65%), Query Frame = 0

Query: 2   MLFLKRHCSSSFTSQNFKYSTHPSI------KLSQILQFCKSGLLNDALHLLNSIDLYDS 61
           ++ LKRH      SQ+      PSI      K S+IL+ CK G L +A+ +LNS   + S
Sbjct: 3   LIILKRH-----FSQHASLCLTPSISSSAPTKQSRILELCKLGQLTEAIRILNS--THSS 62

Query: 62  RI-NKPLLYASLLQTCIKVDSFTRGRQFHAHVVKSGLETDRFVGNSLLSLYFKLGSDSLL 121
            I   P LYASLLQTC KV SF  G QFHAHVVKSGLETDR VGNSLLSLYFKLG     
Sbjct: 63  EIPATPKLYASLLQTCNKVFSFIHGIQFHAHVVKSGLETDRNVGNSLLSLYFKLGPGMRE 122

Query: 122 TRRVFDGLFVKDVVSWASMITGYVREGKSGIAIELFWDMLDSGIEPNGFTLSAVIKACSE 181
           TRRVFDG FVKD +SW SM++GYV   +   A+E+F +M+  G++ N FTLS+ +KACSE
Sbjct: 123 TRRVFDGRFVKDAISWTSMMSGYVTGKEHVKALEVFVEMVSFGLDANEFTLSSAVKACSE 182

Query: 182 IENLVLGKCFHGVVVRRGFDSNPVILSSLIDMYGRNFVSSDARQLFDELLEPDPVCWTTV 241
           +  + LG+CFHGVV+  GF+ N  I S+L  +YG N    DAR++FDE+ EPD +CWT V
Sbjct: 183 LGEVRLGRCFHGVVITHGFEWNHFISSTLAYLYGVNREPVDARRVFDEMPEPDVICWTAV 242

Query: 242 ISAFTRNDLYEEALGFFYLKHRAHRLCPDNYTFGSVLTACGNLGRLRQGEEIHAKVIAYG 301
           +SAF++NDLYEEALG FY  HR   L PD  TFG+VLTACGNL RL+QG+EIH K+I  G
Sbjct: 243 LSAFSKNDLYEEALGLFYAMHRGKGLVPDGSTFGTVLTACGNLRRLKQGKEIHGKLITNG 302

Query: 302 FSGNVVTESSLVDMYGKCGAVEKSQRLFDRMSNRNSVSWSALLAVYCHNGDYEKAVNLFR 361
              NVV ESSL+DMYGKCG+V +++++F+ MS +NSVSWSALL  YC NG++EKA+ +FR
Sbjct: 303 IGSNVVVESSLLDMYGKCGSVREARQVFNGMSKKNSVSWSALLGGYCQNGEHEKAIEIFR 362

Query: 362 EMKEVDLYSFGTVIRACAGLAAVTPGKEIHCQYIRKGGWRDVIVESALVDLYAKCGCINF 421
           EM+E DLY FGTV++ACAGLAAV  GKEIH QY+R+G + +VIVESAL+DLY K GCI+ 
Sbjct: 363 EMEEKDLYCFGTVLKACAGLAAVRLGKEIHGQYVRRGCFGNVIVESALIDLYGKSGCIDS 422

Query: 422 AYRVFDRMPTRNLITWNSMIHGFAQNGSSGIAIQIFEAMIKEGIKPDCISFIGLLFACSH 481
           A RV+ +M  RN+ITWN+M+   AQNG    A+  F  M+K+GIKPD ISFI +L AC H
Sbjct: 423 ASRVYSKMSIRNMITWNAMLSALAQNGRGEEAVSFFNDMVKKGIKPDYISFIAILTACGH 482

Query: 482 TGLVDQARHYFDLMTGKYGIKPGVEHYNCMVDLLGRAGLLEEAENLIENAECRNDSSLWL 541
           TG+VD+ R+YF LM   YGIKPG EHY+CM+DLLGRAGL EEAENL+E AECRND+SLW 
Sbjct: 483 TGMVDEGRNYFVLMAKSYGIKPGTEHYSCMIDLLGRAGLFEEAENLLERAECRNDASLWG 542

Query: 542 VLLGACTTTCTNSATAERIAKKLMELEPQCYLSYVHLANVYRAVGRWDDAVKVRELMKNR 601
           VLLG C      S  AERIAK++MELEP+ ++SYV L+N+Y+A+GR  DA+ +R+LM  R
Sbjct: 543 VLLGPCAANADASRVAERIAKRMMELEPKYHMSYVLLSNMYKAIGRHGDALNIRKLMVRR 602

Query: 602 QLKKMPGQSWM 606
            + K  GQSW+
Sbjct: 603 GVAKTVGQSWI 606

BLAST of CSPI07G04370 vs. TAIR 10
Match: AT4G33170.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 385.2 bits (988), Expect = 9.8e-107
Identity = 201/545 (36.88%), Postives = 322/545 (59.08%), Query Frame = 0

Query: 66  LLQTCIKVDSFTRGRQFHAHVVKSGLETDRFVGNSLLSLYFKLGSDSLLTRRVFDGLFVK 125
           +L T +KVDS   G+Q H   +K GL+    V NSL+++Y KL       R VFD +  +
Sbjct: 321 MLATAVKVDSLALGQQVHCMALKLGLDLMLTVSNSLINMYCKLRKFG-FARTVFDNMSER 380

Query: 126 DVVSWASMITGYVREGKSGIAIELFWDMLDSGIEPNGFTLSAVIKACSEI-ENLVLGKCF 185
           D++SW S+I G  + G    A+ LF  +L  G++P+ +T+++V+KA S + E L L K  
Sbjct: 381 DLISWNSVIAGIAQNGLEVEAVCLFMQLLRCGLKPDQYTMTSVLKAASSLPEGLSLSKQV 440

Query: 186 HGVVVRRGFDSNPVILSSLIDMYGRNFVSSDARQLFDELLEPDPVCWTTVISAFTRNDLY 245
           H   ++    S+  + ++LID Y RN    +A  LF E    D V W  +++ +T++   
Sbjct: 441 HVHAIKINNVSDSFVSTALIDAYSRNRCMKEAEILF-ERHNFDLVAWNAMMAGYTQSHDG 500

Query: 246 EEALGFFYLKHRAHRLCPDNYTFGSVLTACGNLGRLRQGEEIHAKVIAYGFSGNVVTESS 305
            + L  F L H+      D++T  +V   CG L  + QG+++HA  I  G+  ++   S 
Sbjct: 501 HKTLKLFALMHKQGER-SDDFTLATVFKTCGFLFAINQGKQVHAYAIKSGYDLDLWVSSG 560

Query: 306 LVDMYGKCGAVEKSQRLFDRMSNRNSVSWSALLAVYCHNGDYEKAVNLFREMKEV----D 365
           ++DMY KCG +  +Q  FD +   + V+W+ +++    NG+ E+A ++F +M+ +    D
Sbjct: 561 ILDMYVKCGDMSAAQFAFDSIPVPDDVAWTTMISGCIENGEEERAFHVFSQMRLMGVLPD 620

Query: 366 LYSFGTVIRACAGLAAVTPGKEIHCQYIRKGGWRDVIVESALVDLYAKCGCINFAYRVFD 425
            ++  T+ +A + L A+  G++IH   ++     D  V ++LVD+YAKCG I+ AY +F 
Sbjct: 621 EFTIATLAKASSCLTALEQGRQIHANALKLNCTNDPFVGTSLVDMYAKCGSIDDAYCLFK 680

Query: 426 RMPTRNLITWNSMIHGFAQNGSSGIAIQIFEAMIKEGIKPDCISFIGLLFACSHTGLVDQ 485
           R+   N+  WN+M+ G AQ+G     +Q+F+ M   GIKPD ++FIG+L ACSH+GLV +
Sbjct: 681 RIEMMNITAWNAMLVGLAQHGEGKETLQLFKQMKSLGIKPDKVTFIGVLSACSHSGLVSE 740

Query: 486 ARHYFDLMTGKYGIKPGVEHYNCMVDLLGRAGLLEEAENLIENAECRNDSSLWLVLLGAC 545
           A  +   M G YGIKP +EHY+C+ D LGRAGL+++AENLIE+      +S++  LL AC
Sbjct: 741 AYKHMRSMHGDYGIKPEIEHYSCLADALGRAGLVKQAENLIESMSMEASASMYRTLLAAC 800

Query: 546 TTTCTNSATAERIAKKLMELEPQCYLSYVHLANVYRAVGRWDDAVKVRELMKNRQLKKMP 605
                ++ T +R+A KL+ELEP    +YV L+N+Y A  +WD+    R +MK  ++KK P
Sbjct: 801 RVQ-GDTETGKRVATKLLELEPLDSSAYVLLSNMYAAASKWDEMKLARTMMKGHKVKKDP 860

BLAST of CSPI07G04370 vs. TAIR 10
Match: AT2G13600.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 380.9 bits (977), Expect = 1.8e-105
Identity = 209/616 (33.93%), Postives = 335/616 (54.38%), Query Frame = 0

Query: 63  YASLLQTCIKVD-SFTRGRQFHAHVVKSGLETDRFVGNSLLSLYFKLGS--------DSL 122
           +A LL +CIK   S    R  HA V+KSG   + F+ N L+  Y K GS        D +
Sbjct: 22  FAKLLDSCIKSKLSAIYVRYVHASVIKSGFSNEIFIQNRLIDAYSKCGSLEDGRQVFDKM 81

Query: 123 LTRRVF------------------DGLF----VKDVVSWASMITGYVREGKSGIAIELFW 182
             R ++                  D LF     +D  +W SM++G+ +  +   A+  F 
Sbjct: 82  PQRNIYTWNSVVTGLTKLGFLDEADSLFRSMPERDQCTWNSMVSGFAQHDRCEEALCYFA 141

Query: 183 DMLDSGIEPNGFTLSAVIKACSEIENLVLGKCFHGVVVRRGFDSNPVILSSLIDMYGRNF 242
            M   G   N ++ ++V+ ACS + ++  G   H ++ +  F S+  I S+L+DMY +  
Sbjct: 142 MMHKEGFVLNEYSFASVLSACSGLNDMNKGVQVHSLIAKSPFLSDVYIGSALVDMYSKCG 201

Query: 243 VSSDARQLFDELLEPDPVCWTTVISAFTRNDLYEEALGFFYLKHRAHRLCPDNYTFGSVL 302
             +DA+++FDE+ + + V W ++I+ F +N    EAL  F +   + R+ PD  T  SV+
Sbjct: 202 NVNDAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDVFQMMLES-RVEPDEVTLASVI 261

Query: 303 TACGNLGRLRQGEEIHAKVIAYG-FSGNVVTESSLVDMYGKCGAVEKSQRLFD------- 362
           +AC +L  ++ G+E+H +V+       +++  ++ VDMY KC  +++++ +FD       
Sbjct: 262 SACASLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYAKCSRIKEARFIFDSMPIRNV 321

Query: 363 ------------------------RMSNRNSVSWSALLAVYCHNGDYEKAVNLFREMKEV 422
                                   +M+ RN VSW+AL+A Y  NG+ E+A++LF  +K  
Sbjct: 322 IAETSMISGYAMAASTKAARLMFTKMAERNVVSWNALIAGYTQNGENEEALSLFCLLKRE 381

Query: 423 DL----YSFGTVIRACAGLAAVTPGKEIHCQYIR------KGGWRDVIVESALVDLYAKC 482
            +    YSF  +++ACA LA +  G + H   ++       G   D+ V ++L+D+Y KC
Sbjct: 382 SVCPTHYSFANILKACADLAELHLGMQAHVHVLKHGFKFQSGEEDDIFVGNSLIDMYVKC 441

Query: 483 GCINFAYRVFDRMPTRNLITWNSMIHGFAQNGSSGIAIQIFEAMIKEGIKPDCISFIGLL 542
           GC+   Y VF +M  R+ ++WN+MI GFAQNG    A+++F  M++ G KPD I+ IG+L
Sbjct: 442 GCVEEGYLVFRKMMERDCVSWNAMIIGFAQNGYGNEALELFREMLESGEKPDHITMIGVL 501

Query: 543 FACSHTGLVDQARHYFDLMTGKYGIKPGVEHYNCMVDLLGRAGLLEEAENLIENAECRND 602
            AC H G V++ RHYF  MT  +G+ P  +HY CMVDLLGRAG LEEA+++IE    + D
Sbjct: 502 SACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDLLGRAGFLEEAKSMIEEMPMQPD 561

Query: 603 SSLWLVLLGACTTTCTNSATAERIAKKLMELEPQCYLSYVHLANVYRAVGRWDDAVKVRE 606
           S +W  LL AC     N    + +A+KL+E+EP     YV L+N+Y  +G+W+D + VR+
Sbjct: 562 SVIWGSLLAACKVH-RNITLGKYVAEKLLEVEPSNSGPYVLLSNMYAELGKWEDVMNVRK 621

BLAST of CSPI07G04370 vs. TAIR 10
Match: AT3G02330.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 370.2 bits (949), Expect = 3.3e-102
Identity = 206/607 (33.94%), Postives = 331/607 (54.53%), Query Frame = 0

Query: 25  SIKLSQILQFC-KSGLLNDALHLLNSIDLYDSRINKPLLYASLLQTCIKVDSFTRGRQFH 84
           S+  S I+  C ++ LL+ AL     +   ++ +++  +YAS+L++C  +     G Q H
Sbjct: 246 SVSWSAIIAGCVQNNLLSLALKFFKEMQKVNAGVSQS-IYASVLRSCAALSELRLGGQLH 305

Query: 85  AHVVKSGLETDRFVGNSLLSLYFKLGSDSLLTRRV-FDGLFVKDVVSWASMITGYVREGK 144
           AH +KS    D  V  + L +Y K   D++   ++ FD     +  S+ +MITGY +E  
Sbjct: 306 AHALKSDFAADGIVRTATLDMYAK--CDNMQDAQILFDNSENLNRQSYNAMITGYSQEEH 365

Query: 145 SGIAIELFWDMLDSGIEPNGFTLSAVIKACSEIENLVLGKCFHGVVVRRGFDSNPVILSS 204
              A+ LF  ++ SG+  +  +LS V +AC+ ++ L  G   +G+ ++     +  + ++
Sbjct: 366 GFKALLLFHRLMSSGLGFDEISLSGVFRACALVKGLSEGLQIYGLAIKSSLSLDVCVANA 425

Query: 205 LIDMYGRNFVSSDARQLFDELLEPDPVCWTTVISAFTRNDLYEEALGFFYLKHRAHRLCP 264
            IDMYG+    ++A ++FDE+   D V W  +I+A  +N    E L F ++     R+ P
Sbjct: 426 AIDMYGKCQALAEAFRVFDEMRRRDAVSWNAIIAAHEQNGKGYETL-FLFVSMLRSRIEP 485

Query: 265 DNYTFGSVLTACGNLGRLRQGEEIHAKVIAYGFSGNVVTESSLVDMYGKCGAVEKSQRLF 324
           D +TFGS+L AC   G L  G EIH+ ++  G + N     SL+DMY KCG +E+++++ 
Sbjct: 486 DEFTFGSILKACTG-GSLGYGMEIHSSIVKSGMASNSSVGCSLIDMYSKCGMIEEAEKIH 545

Query: 325 DRMSNRNS--------------------VSWSALLAVYCHNGDYEKAVNLFREMKEV--- 384
            R   R +                    VSW+++++ Y      E A  LF  M E+   
Sbjct: 546 SRFFQRANVSGTMEELEKMHNKRLQEMCVSWNSIISGYVMKEQSEDAQMLFTRMMEMGIT 605

Query: 385 -DLYSFGTVIRACAGLAAVTPGKEIHCQYIRKGGWRDVIVESALVDLYAKCGCINFAYRV 444
            D +++ TV+  CA LA+   GK+IH Q I+K    DV + S LVD+Y+KCG ++ +  +
Sbjct: 606 PDKFTYATVLDTCANLASAGLGKQIHAQVIKKELQSDVYICSTLVDMYSKCGDLHDSRLM 665

Query: 445 FDRMPTRNLITWNSMIHGFAQNGSSGIAIQIFEAMIKEGIKPDCISFIGLLFACSHTGLV 504
           F++   R+ +TWN+MI G+A +G    AIQ+FE MI E IKP+ ++FI +L AC+H GL+
Sbjct: 666 FEKSLRRDFVTWNAMICGYAHHGKGEEAIQLFERMILENIKPNHVTFISILRACAHMGLI 725

Query: 505 DQARHYFDLMTGKYGIKPGVEHYNCMVDLLGRAGLLEEAENLIENAECRNDSSLWLVLLG 564
           D+   YF +M   YG+ P + HY+ MVD+LG++G ++ A  LI       D  +W  LLG
Sbjct: 726 DKGLEYFYMMKRDYGLDPQLPHYSNMVDILGKSGKVKRALELIREMPFEADDVIWRTLLG 785

Query: 565 ACTTTCTNSATAERIAKKLMELEPQCYLSYVHLANVYRAVGRWDDAVKVRELMKNRQLKK 606
            CT    N   AE     L+ L+PQ   +Y  L+NVY   G W+    +R  M+  +LKK
Sbjct: 786 VCTIHRNNVEVAEEATAALLRLDPQDSSAYTLLSNVYADAGMWEKVSDLRRNMRGFKLKK 845

BLAST of CSPI07G04370 vs. TAIR 10
Match: AT5G09950.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 367.9 bits (943), Expect = 1.6e-101
Identity = 194/538 (36.06%), Postives = 318/538 (59.11%), Query Frame = 0

Query: 78  RGRQFHAHVVKSGLETDRF-VGNSLLSLYFKLGSDSLLTRRVFDGLFVKDVVSWASMITG 137
           +GR+ H HV+ +GL      +GN L+++Y K GS +   RRVF  +  KD VSW SMITG
Sbjct: 331 KGREVHGHVITTGLVDFMVGIGNGLVNMYAKCGSIA-DARRVFYFMTDKDSVSWNSMITG 390

Query: 138 YVREGKSGIAIELFWDMLDSGIEPNGFTLSAVIKACSEIENLVLGKCFHGVVVRRGFDSN 197
             + G    A+E +  M    I P  FTL + + +C+ ++   LG+  HG  ++ G D N
Sbjct: 391 LDQNGCFIEAVERYKSMRRHDILPGSFTLISSLSSCASLKWAKLGQQIHGESLKLGIDLN 450

Query: 198 PVILSSLIDMYGRNFVSSDARQLFDELLEPDPVCWTTVISAFTRND--LYEEALGFFYLK 257
             + ++L+ +Y      ++ R++F  + E D V W ++I A  R++  L E  + F   +
Sbjct: 451 VSVSNALMTLYAETGYLNECRKIFSSMPEHDQVSWNSIIGALARSERSLPEAVVCFLNAQ 510

Query: 258 HRAHRLCPDNYTFGSVLTACGNLGRLRQGEEIHAKVIAYGFSGNVVTESSLVDMYGKCGA 317
               +L  +  TF SVL+A  +L     G++IH   +    +    TE++L+  YGKCG 
Sbjct: 511 RAGQKL--NRITFSSVLSAVSSLSFGELGKQIHGLALKNNIADEATTENALIACYGKCGE 570

Query: 318 VEKSQRLFDRMS-NRNSVSWSALLAVYCHNGDYEKAVNLFREM----KEVDLYSFGTVIR 377
           ++  +++F RM+  R++V+W+++++ Y HN    KA++L   M    + +D + + TV+ 
Sbjct: 571 MDGCEKIFSRMAERRDNVTWNSMISGYIHNELLAKALDLVWFMLQTGQRLDSFMYATVLS 630

Query: 378 ACAGLAAVTPGKEIHCQYIRKGGWRDVIVESALVDLYAKCGCINFAYRVFDRMPTRNLIT 437
           A A +A +  G E+H   +R     DV+V SALVD+Y+KCG +++A R F+ MP RN  +
Sbjct: 631 AFASVATLERGMEVHACSVRACLESDVVVGSALVDMYSKCGRLDYALRFFNTMPVRNSYS 690

Query: 438 WNSMIHGFAQNGSSGIAIQIFEAMIKEG-IKPDCISFIGLLFACSHTGLVDQARHYFDLM 497
           WNSMI G+A++G    A+++FE M  +G   PD ++F+G+L ACSH GL+++   +F+ M
Sbjct: 691 WNSMISGYARHGQGEEALKLFETMKLDGQTPPDHVTFVGVLSACSHAGLLEEGFKHFESM 750

Query: 498 TGKYGIKPGVEHYNCMVDLLGRAGLLEEAENLIENAECRNDSSLWLVLLGACTTTCTNSA 557
           +  YG+ P +EH++CM D+LGRAG L++ E+ IE    + +  +W  +LGAC       A
Sbjct: 751 SDSYGLAPRIEHFSCMADVLGRAGELDKLEDFIEKMPMKPNVLIWRTVLGACCRANGRKA 810

Query: 558 -TAERIAKKLMELEPQCYLSYVHLANVYRAVGRWDDAVKVRELMKNRQLKKMPGQSWM 606
              ++ A+ L +LEP+  ++YV L N+Y A GRW+D VK R+ MK+  +KK  G SW+
Sbjct: 811 ELGKKAAEMLFQLEPENAVNYVLLGNMYAAGGRWEDLVKARKKMKDADVKKEAGYSWV 865

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LR694.3e-20858.76Pentatricopeptide repeat-containing protein At1g03540 OS=Arabidopsis thaliana OX... [more]
Q9SMZ21.4e-10536.88Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX... [more]
Q9SIT72.6e-10433.93Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX... [more]
Q9FWA64.6e-10133.94Pentatricopeptide repeat-containing protein At3g02330, mitochondrial OS=Arabidop... [more]
Q9FIB22.3e-10036.06Putative pentatricopeptide repeat-containing protein At5g09950 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
A0A0A0K5P00.0e+0099.67Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G047230 PE=4 SV=1[more]
A0A5A7SR370.0e+0091.40Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3C0U70.0e+0091.40pentatricopeptide repeat-containing protein At1g03540 OS=Cucumis melo OX=3656 GN... [more]
A0A6J1D3F11.7e-30082.48pentatricopeptide repeat-containing protein At1g03540 OS=Momordica charantia OX=... [more]
A0A6J1JMC87.0e-29481.32pentatricopeptide repeat-containing protein At1g03540 OS=Cucurbita maxima OX=366... [more]
Match NameE-valueIdentityDescription
XP_004137012.10.0e+0099.67pentatricopeptide repeat-containing protein At1g03540 [Cucumis sativus] >KGN4358... [more]
XP_008455346.10.0e+0091.40PREDICTED: pentatricopeptide repeat-containing protein At1g03540 [Cucumis melo] ... [more]
XP_038888158.11.5e-30685.45pentatricopeptide repeat-containing protein At1g03540 [Benincasa hispida][more]
XP_022147827.13.5e-30082.48pentatricopeptide repeat-containing protein At1g03540 [Momordica charantia][more]
XP_023529479.14.9e-29481.65pentatricopeptide repeat-containing protein At1g03540 [Cucurbita pepo subsp. pep... [more]
Match NameE-valueIdentityDescription
AT1G03540.13.1e-20958.76Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT4G33170.19.8e-10736.88Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G13600.11.8e-10533.93Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G02330.13.3e-10233.94Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G09950.11.6e-10136.06Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 426..472
e-value: 5.1E-9
score: 36.2
coord: 329..372
e-value: 2.1E-9
score: 37.4
coord: 125..172
e-value: 3.7E-12
score: 46.2
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 500..522
e-value: 0.0046
score: 17.1
coord: 199..224
e-value: 1.3
score: 9.5
coord: 229..251
e-value: 0.0014
score: 18.7
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 128..161
e-value: 5.5E-5
score: 21.1
coord: 428..462
e-value: 3.1E-8
score: 31.3
coord: 331..360
e-value: 2.3E-7
score: 28.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 126..160
score: 12.101333
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 426..460
score: 12.660359
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 329..363
score: 11.564229
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 298..328
score: 8.714292
IPR019734Tetratricopeptide repeatPFAMPF13176TPR_7coord: 567..590
e-value: 0.011
score: 15.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 464..600
e-value: 3.0E-16
score: 61.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 360..462
e-value: 4.6E-21
score: 77.5
coord: 28..182
e-value: 2.3E-22
score: 81.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 289..359
e-value: 1.5E-10
score: 42.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 183..288
e-value: 2.6E-15
score: 58.2
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 331..585
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 17..605
NoneNo IPR availablePANTHERPTHR24015:SF328OS06G0611200 PROTEINcoord: 17..605

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI07G04370.1CSPI07G04370.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009451 RNA modification
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0003723 RNA binding