CSPI01G33570 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI01G33570
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr1: 28542288 .. 28544585 (-)
RNA-Seq ExpressionCSPI01G33570
SyntenyCSPI01G33570
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATCAACTAATTCTGTGCTCCCCTTCCTTCTCTGGCTGCTCCGGCCATTCCCATTTGAATCTCCAACAAACCCACCAACTCCATGCCCATTTCATCAAAACCCAATTTCATAATCCTCACCCTTTCTTCTCTCAATCCCACTTCACCCCTGAAGCCAATTACAATCTCCTCATTTCATCTTACACCAACAACCACCTCCCGCAAGCTTCTTTCAACTGCTACCTCCATATGCGTTCAAACGATGCTGCTGCACTTGACAACTTCATTCTCCCTTCACTTCTCAAAGCTTGTGCCCAAGCTTCCTCTGGGTATTTAGGCAGGGAACTCCACGGTTTCGCCCAAAAGAACGGCTTTGCTTCAGACGTTTTTGTGTGCAACGCTCTTATGAACATGTATGAGAAATGTGGGTGCTTGGTTTCTGCTCGCTTGGTGTTTGATCAAATGCCCGAAAGAGATGTTGTCTCTTGGACTACTATGCTTGGGTGCTATGTACGGAGCAAAGCTTTTGGTGAAGCGCTTCGACTCGTACGGGAGATGCAGTTTGTGGGAGTGAAGCTCAGTGGTGTTGCTTTGATTAGCTTGATTGCTGTATTTGGAAATCTCTTGGATATGAAATCGGGGAGGGCTGTTCATGGTTACATCGTGAGAAATGTTGGTGATGGGAAGATGGAAGTTTCAATGACTACTGCACTGATCGATATGTATTGCAAAGGTGGATGTTTAGCATCAGCACAGAGGCTTTTTGACAGGTTATCTAAAAGAAGTGTTGTCTCATGGACGGTGATGATAGCAGGTTGTATTCGCAGTTGCAGATTAGATGAAGGGGCAAAGAACTTTAATATAATGCTCGAAGAAAAATTATTCCCCAATGAGATTACACTACTAAGTTTGATTACTGAATGTGGTTTCGTGGGAACCTTGGATTTGGGCAAATGGTTTCATGCGTATCTCTTAAGAAATGGGTTTGGTATGTCTTTGGCTTTGGTCACTGCTCTCATAGACATGTATGGAAAGTGTGGGCAAGTTGGATATGCAAGAGCTCTTTTCAACGGTGTCAAGAAAAAAGATGTCAAGATTTGGAGTGTTTTAATATCGGCTTATGCACATGTGAGTTGCATGGATCAAGTTTTTAACCTCTTCGTCGAGATGTTGAACAATGACATGAAACCAAACAACGTGACAATGGTTAGCCTTCTTTCTTTGTGTGCAGAGGCTGGAGCCCTTGACCTTGGCAAGTGGACTCATGCATACATAAACCGTCATGGTCTTGAAGTAGATGTCATTCTAGAAACAGCTCTAATCAACATGTATGCAAAATGTGGAGATGTAACAATTGCTCGTAGCCTGTTCAATGAAGCTATGCAACGGGACATTCGCATGTGGAACACAATGATGGCTGGATTCTCGATGCATGGTTGTGGAAAAGAAGCTTTGGAACTCTTTTCAGAGATGGAGAGCCATGGTGTTGAACCTAATGATATCACATTCGTTTCCATTTTCCATGCTTGTAGTCATTCCGGATTGGTAGTAGAAGGGAAAAAGTATTTCAACAAAATGGTTCACGACTTTGGAATTGTTCCAAAGATGGAGCACTATGGATGCTTGGTGGATCTTCTTGGTCGAGCTGGACATCTTGACGAAGCTCACAACATCATTGAAAACATGCCCATGAGGCCTAACACAATTATATGGGGTGCTCTGCTTGCTGCATGTAAGCTGCATAAGAATCTGGCCTTGGGGGAGGTGGCTGCAAGAAAGATTCTTGAATTGGATCCACAAAACTGTGGGTATAGTGTTCTTAAGTCAAACATCTATGCATCAGCAAAGCGATGGAATGATGTAACAAGCGTTAGAGAAGCAATGAGCCATTCAGGAATAAAGAAAGAACCAGGCCTCAGCTGGATTGAAGTAAATGGTTCAGTTCACCACTTCAAATCCGGAGACAAGGCATGCACACAAACTACAAAAGTATATGAAATGGTGACCGAAATGTGCATCAAATTGAGAGAGTCGGGATACACACCGAACACAGCGGCAGTTTTGTTAAACATAGATGAGGAAGAGAAGGAATCTGCACTCAGTTACCATAGCGAGAAACTTGCTACAGCATTTGGACTCATAAGCACAGCTCCTGGTACACCCATCCGAATCGTTAAGAATTTGAGGATTTGTGATGATTGTCATGCTTCAACGAAGCTATTATCAAAAATCTATGGACGAACAATAATAGTTAGAGATAGAAACCGATTTCACCACTTCAGTGAAGGATATTGTTCTTGTATGGGTTATTGGTAA

mRNA sequence

ATGAATCAACTAATTCTGTGCTCCCCTTCCTTCTCTGGCTGCTCCGGCCATTCCCATTTGAATCTCCAACAAACCCACCAACTCCATGCCCATTTCATCAAAACCCAATTTCATAATCCTCACCCTTTCTTCTCTCAATCCCACTTCACCCCTGAAGCCAATTACAATCTCCTCATTTCATCTTACACCAACAACCACCTCCCGCAAGCTTCTTTCAACTGCTACCTCCATATGCGTTCAAACGATGCTGCTGCACTTGACAACTTCATTCTCCCTTCACTTCTCAAAGCTTGTGCCCAAGCTTCCTCTGGGTATTTAGGCAGGGAACTCCACGGTTTCGCCCAAAAGAACGGCTTTGCTTCAGACGTTTTTGTGTGCAACGCTCTTATGAACATGTATGAGAAATGTGGGTGCTTGGTTTCTGCTCGCTTGGTGTTTGATCAAATGCCCGAAAGAGATGTTGTCTCTTGGACTACTATGCTTGGGTGCTATGTACGGAGCAAAGCTTTTGGTGAAGCGCTTCGACTCGTACGGGAGATGCAGTTTGTGGGAGTGAAGCTCAGTGGTGTTGCTTTGATTAGCTTGATTGCTGTATTTGGAAATCTCTTGGATATGAAATCGGGGAGGGCTGTTCATGGTTACATCGTGAGAAATGTTGGTGATGGGAAGATGGAAGTTTCAATGACTACTGCACTGATCGATATGTATTGCAAAGGTGGATGTTTAGCATCAGCACAGAGGCTTTTTGACAGGTTATCTAAAAGAAGTGTTGTCTCATGGACGGTGATGATAGCAGGTTGTATTCGCAGTTGCAGATTAGATGAAGGGGCAAAGAACTTTAATATAATGCTCGAAGAAAAATTATTCCCCAATGAGATTACACTACTAAGTTTGATTACTGAATGTGGTTTCGTGGGAACCTTGGATTTGGGCAAATGGTTTCATGCGTATCTCTTAAGAAATGGGTTTGGTATGTCTTTGGCTTTGGTCACTGCTCTCATAGACATGTATGGAAAGTGTGGGCAAGTTGGATATGCAAGAGCTCTTTTCAACGGTGTCAAGAAAAAAGATGTCAAGATTTGGAGTGTTTTAATATCGGCTTATGCACATGTGAGTTGCATGGATCAAGTTTTTAACCTCTTCGTCGAGATGTTGAACAATGACATGAAACCAAACAACGTGACAATGGTTAGCCTTCTTTCTTTGTGTGCAGAGGCTGGAGCCCTTGACCTTGGCAAGTGGACTCATGCATACATAAACCGTCATGGTCTTGAAGTAGATGTCATTCTAGAAACAGCTCTAATCAACATGTATGCAAAATGTGGAGATGTAACAATTGCTCGTAGCCTGTTCAATGAAGCTATGCAACGGGACATTCGCATGTGGAACACAATGATGGCTGGATTCTCGATGCATGGTTGTGGAAAAGAAGCTTTGGAACTCTTTTCAGAGATGGAGAGCCATGGTGTTGAACCTAATGATATCACATTCGTTTCCATTTTCCATGCTTGTAGTCATTCCGGATTGGTAGTAGAAGGGAAAAAGTATTTCAACAAAATGGTTCACGACTTTGGAATTGTTCCAAAGATGGAGCACTATGGATGCTTGGTGGATCTTCTTGGTCGAGCTGGACATCTTGACGAAGCTCACAACATCATTGAAAACATGCCCATGAGGCCTAACACAATTATATGGGGTGCTCTGCTTGCTGCATGTAAGCTGCATAAGAATCTGGCCTTGGGGGAGGTGGCTGCAAGAAAGATTCTTGAATTGGATCCACAAAACTGTGGGTATAGTGTTCTTAAGTCAAACATCTATGCATCAGCAAAGCGATGGAATGATGTAACAAGCGTTAGAGAAGCAATGAGCCATTCAGGAATAAAGAAAGAACCAGGCCTCAGCTGGATTGAAGTAAATGGTTCAGTTCACCACTTCAAATCCGGAGACAAGGCATGCACACAAACTACAAAAGTATATGAAATGGTGACCGAAATGTGCATCAAATTGAGAGAGTCGGGATACACACCGAACACAGCGGCAGTTTTGTTAAACATAGATGAGGAAGAGAAGGAATCTGCACTCAGTTACCATAGCGAGAAACTTGCTACAGCATTTGGACTCATAAGCACAGCTCCTGGTACACCCATCCGAATCGTTAAGAATTTGAGGATTTGTGATGATTGTCATGCTTCAACGAAGCTATTATCAAAAATCTATGGACGAACAATAATAGTTAGAGATAGAAACCGATTTCACCACTTCAGTGAAGGATATTGTTCTTGTATGGGTTATTGGTAA

Coding sequence (CDS)

ATGAATCAACTAATTCTGTGCTCCCCTTCCTTCTCTGGCTGCTCCGGCCATTCCCATTTGAATCTCCAACAAACCCACCAACTCCATGCCCATTTCATCAAAACCCAATTTCATAATCCTCACCCTTTCTTCTCTCAATCCCACTTCACCCCTGAAGCCAATTACAATCTCCTCATTTCATCTTACACCAACAACCACCTCCCGCAAGCTTCTTTCAACTGCTACCTCCATATGCGTTCAAACGATGCTGCTGCACTTGACAACTTCATTCTCCCTTCACTTCTCAAAGCTTGTGCCCAAGCTTCCTCTGGGTATTTAGGCAGGGAACTCCACGGTTTCGCCCAAAAGAACGGCTTTGCTTCAGACGTTTTTGTGTGCAACGCTCTTATGAACATGTATGAGAAATGTGGGTGCTTGGTTTCTGCTCGCTTGGTGTTTGATCAAATGCCCGAAAGAGATGTTGTCTCTTGGACTACTATGCTTGGGTGCTATGTACGGAGCAAAGCTTTTGGTGAAGCGCTTCGACTCGTACGGGAGATGCAGTTTGTGGGAGTGAAGCTCAGTGGTGTTGCTTTGATTAGCTTGATTGCTGTATTTGGAAATCTCTTGGATATGAAATCGGGGAGGGCTGTTCATGGTTACATCGTGAGAAATGTTGGTGATGGGAAGATGGAAGTTTCAATGACTACTGCACTGATCGATATGTATTGCAAAGGTGGATGTTTAGCATCAGCACAGAGGCTTTTTGACAGGTTATCTAAAAGAAGTGTTGTCTCATGGACGGTGATGATAGCAGGTTGTATTCGCAGTTGCAGATTAGATGAAGGGGCAAAGAACTTTAATATAATGCTCGAAGAAAAATTATTCCCCAATGAGATTACACTACTAAGTTTGATTACTGAATGTGGTTTCGTGGGAACCTTGGATTTGGGCAAATGGTTTCATGCGTATCTCTTAAGAAATGGGTTTGGTATGTCTTTGGCTTTGGTCACTGCTCTCATAGACATGTATGGAAAGTGTGGGCAAGTTGGATATGCAAGAGCTCTTTTCAACGGTGTCAAGAAAAAAGATGTCAAGATTTGGAGTGTTTTAATATCGGCTTATGCACATGTGAGTTGCATGGATCAAGTTTTTAACCTCTTCGTCGAGATGTTGAACAATGACATGAAACCAAACAACGTGACAATGGTTAGCCTTCTTTCTTTGTGTGCAGAGGCTGGAGCCCTTGACCTTGGCAAGTGGACTCATGCATACATAAACCGTCATGGTCTTGAAGTAGATGTCATTCTAGAAACAGCTCTAATCAACATGTATGCAAAATGTGGAGATGTAACAATTGCTCGTAGCCTGTTCAATGAAGCTATGCAACGGGACATTCGCATGTGGAACACAATGATGGCTGGATTCTCGATGCATGGTTGTGGAAAAGAAGCTTTGGAACTCTTTTCAGAGATGGAGAGCCATGGTGTTGAACCTAATGATATCACATTCGTTTCCATTTTCCATGCTTGTAGTCATTCCGGATTGGTAGTAGAAGGGAAAAAGTATTTCAACAAAATGGTTCACGACTTTGGAATTGTTCCAAAGATGGAGCACTATGGATGCTTGGTGGATCTTCTTGGTCGAGCTGGACATCTTGACGAAGCTCACAACATCATTGAAAACATGCCCATGAGGCCTAACACAATTATATGGGGTGCTCTGCTTGCTGCATGTAAGCTGCATAAGAATCTGGCCTTGGGGGAGGTGGCTGCAAGAAAGATTCTTGAATTGGATCCACAAAACTGTGGGTATAGTGTTCTTAAGTCAAACATCTATGCATCAGCAAAGCGATGGAATGATGTAACAAGCGTTAGAGAAGCAATGAGCCATTCAGGAATAAAGAAAGAACCAGGCCTCAGCTGGATTGAAGTAAATGGTTCAGTTCACCACTTCAAATCCGGAGACAAGGCATGCACACAAACTACAAAAGTATATGAAATGGTGACCGAAATGTGCATCAAATTGAGAGAGTCGGGATACACACCGAACACAGCGGCAGTTTTGTTAAACATAGATGAGGAAGAGAAGGAATCTGCACTCAGTTACCATAGCGAGAAACTTGCTACAGCATTTGGACTCATAAGCACAGCTCCTGGTACACCCATCCGAATCGTTAAGAATTTGAGGATTTGTGATGATTGTCATGCTTCAACGAAGCTATTATCAAAAATCTATGGACGAACAATAATAGTTAGAGATAGAAACCGATTTCACCACTTCAGTGAAGGATATTGTTCTTGTATGGGTTATTGGTAA

Protein sequence

MNQLILCSPSFSGCSGHSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLISSYTNNHLPQASFNCYLHMRSNDAAALDNFILPSLLKACAQASSGYLGRELHGFAQKNGFASDVFVCNALMNMYEKCGCLVSARLVFDQMPERDVVSWTTMLGCYVRSKAFGEALRLVREMQFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDGKMEVSMTTALIDMYCKGGCLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNIMLEEKLFPNEITLLSLITECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVKIWSVLISAYAHVSCMDQVFNLFVEMLNNDMKPNNVTMVSLLSLCAEAGALDLGKWTHAYINRHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEALELFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDLLGRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVREAMSHSGIKKEPGLSWIEVNGSVHHFKSGDKACTQTTKVYEMVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIVKNLRICDDCHASTKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW*
Homology
BLAST of CSPI01G33570 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 545.0 bits (1403), Expect = 1.3e-153
Identity = 275/712 (38.62%), Postives = 423/712 (59.41%), Query Frame = 0

Query: 55  YNLLISSYTNNHLPQASFNCYLHMRSNDA-AALDNFILPSLLKACAQASSGYLGRELHGF 114
           Y+ ++  +        +   ++ MR +D    + NF    LLK C   +   +G+E+HG 
Sbjct: 103 YHTMLKGFAKVSDLDKALQFFVRMRYDDVEPVVYNFTY--LLKVCGDEAELRVGKEIHGL 162

Query: 115 AQKNGFASDVFVCNALMNMYEKCGCLVSARLVFDQMPERDVVSWTTMLGCYVRSKAFGEA 174
             K+GF+ D+F    L NMY KC  +  AR VFD+MPERD+VSW T++  Y ++     A
Sbjct: 163 LVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMA 222

Query: 175 LRLVREMQFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDGKMEVSMTTALI 234
           L +V+ M    +K S + ++S++     L  +  G+ +HGY +R+  D  + +S  TAL+
Sbjct: 223 LEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNIS--TALV 282

Query: 235 DMYCKGGCLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNIMLEEKLFPNEI 294
           DMY K G L +A++LFD + +R+VVSW  MI   +++    E    F  ML+E + P ++
Sbjct: 283 DMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDV 342

Query: 295 TLLSLITECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGV 354
           +++  +  C  +G L+ G++ H   +  G   ++++V +LI MY KC +V  A ++F  +
Sbjct: 343 SVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKL 402

Query: 355 KKKDVKIWSVLISAYAHVSCMDQVFNLFVEMLNNDMKPNNVTMVSLLSLCAEAGALDLGK 414
           + + +  W+ +I  +A         N F +M +  +KP+  T VS+++  AE       K
Sbjct: 403 QSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAK 462

Query: 415 WTHAYINRHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHG 474
           W H  + R  L+ +V + TAL++MYAKCG + IAR +F+   +R +  WN M+ G+  HG
Sbjct: 463 WIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHG 522

Query: 475 CGKEALELFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHY 534
            GK ALELF EM+   ++PN +TF+S+  ACSHSGLV  G K F  M  ++ I   M+HY
Sbjct: 523 FGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHY 582

Query: 535 GCLVDLLGRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDP 594
           G +VDLLGRAG L+EA + I  MP++P   ++GA+L AC++HKN+   E AA ++ EL+P
Sbjct: 583 GAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNP 642

Query: 595 QNCGYSVLKSNIYASAKRWNDVTSVREAMSHSGIKKEPGLSWIEVNGSVHHFKSGDKACT 654
            + GY VL +NIY +A  W  V  VR +M   G++K PG S +E+   VH F SG  A  
Sbjct: 643 DDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHP 702

Query: 655 QTTKVYEMVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPG 714
            + K+Y  + ++   ++E+GY P+T  V L ++ + KE  LS HSEKLA +FGL++T  G
Sbjct: 703 DSKKIYAFLEKLICHIKEAGYVPDTNLV-LGVENDVKEQLLSTHSEKLAISFGLLNTTAG 762

Query: 715 TPIRIVKNLRICDDCHASTKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 766
           T I + KNLR+C DCH +TK +S + GR I+VRD  RFHHF  G CSC  YW
Sbjct: 763 TTIHVRKNLRVCADCHNATKYISLVTGREIVVRDMQRFHHFKNGACSCGDYW 809

BLAST of CSPI01G33570 vs. ExPASy Swiss-Prot
Match: O82380 (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 543.9 bits (1400), Expect = 2.9e-153
Identity = 294/770 (38.18%), Postives = 443/770 (57.53%), Query Frame = 0

Query: 20  LNLQQTHQLHAHFIKT-QFHNPH---PFFSQSHFT---------------PEAN---YNL 79
           ++L+Q  Q H H I+T  F +P+     F+ +  +               P+ N   +N 
Sbjct: 41  VSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFASLEYARKVFDEIPKPNSFAWNT 100

Query: 80  LISSYTNNHLPQASFNCYLHMRSNDAAALDNFILPSLLKACAQASSGYLGRELHGFAQKN 139
           LI +Y +   P  S   +L M S      + +  P L+KA A+ SS  LG+ LHG A K+
Sbjct: 101 LIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSLGQSLHGMAVKS 160

Query: 140 GFASDVFVCNALMNMYEKCGCLVSARLVFDQMPERDVVSWTTMLGCYVRSKAFGEALRLV 199
              SDVFV N+L++ Y  CG L SA  VF  + E+DVVSW +M+  +V+  +  +AL L 
Sbjct: 161 AVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSPDKALELF 220

Query: 200 REMQFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDGKMEVSMTTALIDMYC 259
           ++M+   VK S V ++ +++    + +++ GR V  YI  N     + +++  A++DMY 
Sbjct: 221 KKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEEN--RVNVNLTLANAMLDMYT 280

Query: 260 KGGCLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNIMLEEKLFPNEITLLS 319
           K G +  A+RLFD + ++  V+WT M+ G                               
Sbjct: 281 KCGSIEDAKRLFDAMEEKDNVTWTTMLDG------------------------------- 340

Query: 320 LITECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKD 379
                                                  Y        AR + N + +KD
Sbjct: 341 ---------------------------------------YAISEDYEAAREVLNSMPQKD 400

Query: 380 VKIWSVLISAYAHVSCMDQVFNLFVEM-LNNDMKPNNVTMVSLLSLCAEAGALDLGKWTH 439
           +  W+ LISAY      ++   +F E+ L  +MK N +T+VS LS CA+ GAL+LG+W H
Sbjct: 401 IVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIH 460

Query: 440 AYINRHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGK 499
           +YI +HG+ ++  + +ALI+MY+KCGD+  +R +FN   +RD+ +W+ M+ G +MHGCG 
Sbjct: 461 SYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGN 520

Query: 500 EALELFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCL 559
           EA+++F +M+   V+PN +TF ++F ACSH+GLV E +  F++M  ++GIVP+ +HY C+
Sbjct: 521 EAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACI 580

Query: 560 VDLLGRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNC 619
           VD+LGR+G+L++A   IE MP+ P+T +WGALL ACK+H NL L E+A  ++LEL+P+N 
Sbjct: 581 VDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRND 640

Query: 620 GYSVLKSNIYASAKRWNDVTSVREAMSHSGIKKEPGLSWIEVNGSVHHFKSGDKACTQTT 679
           G  VL SNIYA   +W +V+ +R+ M  +G+KKEPG S IE++G +H F SGD A   + 
Sbjct: 641 GAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSE 700

Query: 680 KVYEMVTEMCIKLRESGYTPNTAAVLLNIDEEE-KESALSYHSEKLATAFGLISTAPGTP 739
           KVY  + E+  KL+ +GY P  + VL  I+EEE KE +L+ HSEKLA  +GLIST     
Sbjct: 701 KVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYGLISTEAPKV 738

Query: 740 IRIVKNLRICDDCHASTKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 766
           IR++KNLR+C DCH+  KL+S++Y R IIVRDR RFHHF  G CSC  +W
Sbjct: 761 IRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCSCNDFW 738

BLAST of CSPI01G33570 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 540.8 bits (1392), Expect = 2.5e-152
Identity = 292/774 (37.73%), Postives = 428/774 (55.30%), Query Frame = 0

Query: 17  HSHLNLQQTHQLHAHFIKTQFHNPHPFFSQ--------SHF------------TPEAN-- 76
           H+   LQ    +HA  IK   HN +   S+         HF              E N  
Sbjct: 41  HNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFEGLPYAISVFKTIQEPNLL 100

Query: 77  -YNLLISSYTNNHLPQASFNCYLHMRSNDAAALDNFILPSLLKACAQASSGYLGRELHGF 136
            +N +   +  +  P ++   Y+ M S      +++  P +LK+CA++ +   G+++HG 
Sbjct: 101 IWNTMFRGHALSSDPVSALKLYVCMISLGLLP-NSYTFPFVLKSCAKSKAFKEGQQIHGH 160

Query: 137 AQKNGFASDVFVCNALMNMYEKCGCLVSARLVFDQMPERDVVSWTTMLGCYVRSKAFGEA 196
             K G   D++V  +L++MY + G L  A  VFD+ P RDVVS+                
Sbjct: 161 VLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSY---------------- 220

Query: 197 LRLVREMQFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDGKMEVSMTTALI 256
                                                                   TALI
Sbjct: 221 --------------------------------------------------------TALI 280

Query: 257 DMYCKGGCLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNIMLEEKLFPNEI 316
             Y   G + +AQ+LFD +  + VVSW  MI+G   +    E  + F  M++  + P+E 
Sbjct: 281 KGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDES 340

Query: 317 TLLSLITECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGV 376
           T++++++ C   G+++LG+  H ++  +GFG +L +V ALID+Y KCG++  A  LF  +
Sbjct: 341 TMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERL 400

Query: 377 KKKDVKIWSVLISAYAHVSCMDQVFNLFVEMLNNDMKPNNVTMVSLLSLCAEAGALDLGK 436
             KDV  W+ LI  Y H++   +   LF EML +   PN+VTM+S+L  CA  GA+D+G+
Sbjct: 401 PYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGR 460

Query: 437 WTHAYINRH--GLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSM 496
           W H YI++   G+     L T+LI+MYAKCGD+  A  +FN  + + +  WN M+ GF+M
Sbjct: 461 WIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAM 520

Query: 497 HGCGKEALELFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKME 556
           HG    + +LFS M   G++P+DITFV +  ACSHSG++  G+  F  M  D+ + PK+E
Sbjct: 521 HGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLE 580

Query: 557 HYGCLVDLLGRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILEL 616
           HYGC++DLLG +G   EA  +I  M M P+ +IW +LL ACK+H N+ LGE  A  ++++
Sbjct: 581 HYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKI 640

Query: 617 DPQNCGYSVLKSNIYASAKRWNDVTSVREAMSHSGIKKEPGLSWIEVNGSVHHFKSGDKA 676
           +P+N G  VL SNIYASA RWN+V   R  ++  G+KK PG S IE++  VH F  GDK 
Sbjct: 641 EPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKF 700

Query: 677 CTQTTKVYEMVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTA 736
             +  ++Y M+ EM + L ++G+ P+T+ VL  ++EE KE AL +HSEKLA AFGLIST 
Sbjct: 701 HPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTK 741

Query: 737 PGTPIRIVKNLRICDDCHASTKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 766
           PGT + IVKNLR+C +CH +TKL+SKIY R II RDR RFHHF +G CSC  YW
Sbjct: 761 PGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of CSPI01G33570 vs. ExPASy Swiss-Prot
Match: Q9LUJ2 (Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H56 PE=3 SV=1)

HSP 1 Score: 537.7 bits (1384), Expect = 2.1e-151
Identity = 291/749 (38.85%), Postives = 433/749 (57.81%), Query Frame = 0

Query: 55  YNLLISSYTNNHLPQASFNCYLHMRSNDAAALDNFILPSLLKACAQASSGYLGRELHGFA 114
           YN LI  Y ++ L   +   +L M  N   + D +  P  L ACA++ +   G ++HG  
Sbjct: 102 YNSLIRGYASSGLCNEAILLFLRM-MNSGISPDKYTFPFGLSACAKSRAKGNGIQIHGLI 161

Query: 115 QKNGFASDVFVCNALMNMYEKCGCLVSARLVFDQMPERDVVSWTTMLGCYVRSKAFGEAL 174
            K G+A D+FV N+L++ Y +CG L SAR VFD+M ER+VVSWT+M+  Y R     +A+
Sbjct: 162 VKMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSERNVVSWTSMICGYARRDFAKDAV 221

Query: 175 ----RLVREMQFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDGKMEVS--M 234
               R+VR+ +   V  + V ++ +I+    L D+++G  V+ +I RN G   +EV+  M
Sbjct: 222 DLFFRMVRDEE---VTPNSVTMVCVISACAKLEDLETGEKVYAFI-RNSG---IEVNDLM 281

Query: 235 TTALIDMYCKGGCLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNIMLEEKL 294
            +AL+DMY K   +  A+RLFD     ++     M +  +R     E    FN+M++  +
Sbjct: 282 VSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNLMMDSGV 341

Query: 295 FPNEITLLSLITECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKC-------- 354
            P+ I++LS I+ C  +  +  GK  H Y+LRNGF     +  ALIDMY KC        
Sbjct: 342 RPDRISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCHRQDTAFR 401

Query: 355 -----------------------GQVGYARALFNGVKKKDVKIWSVLISAYAHVSCMDQV 414
                                  G+V  A   F  + +K++  W+ +IS     S  ++ 
Sbjct: 402 IFDRMSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQGSLFEEA 461

Query: 415 FNLFVEMLNND-MKPNNVTMVSLLSLCAEAGALDLGKWTHAYINRHGLEVDVILETALIN 474
             +F  M + + +  + VTM+S+ S C   GALDL KW + YI ++G+++DV L T L++
Sbjct: 462 IEVFCSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRLGTTLVD 521

Query: 475 MYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEALELFSEMESHGVEPNDIT 534
           M+++CGD   A S+FN    RD+  W   +   +M G  + A+ELF +M   G++P+ + 
Sbjct: 522 MFSRCGDPESAMSIFNSLTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQGLKPDGVA 581

Query: 535 FVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDLLGRAGHLDEAHNIIENM 594
           FV    ACSH GLV +GK+ F  M+   G+ P+  HYGC+VDLLGRAG L+EA  +IE+M
Sbjct: 582 FVGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEEAVQLIEDM 641

Query: 595 PMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVT 654
           PM PN +IW +LLAAC++  N+ +   AA KI  L P+  G  VL SN+YASA RWND+ 
Sbjct: 642 PMEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYASAGRWNDMA 701

Query: 655 SVREAMSHSGIKKEPGLSWIEVNGSVHHFKSGDKACTQTTKVYEMVTEMCIKLRESGYTP 714
            VR +M   G++K PG S I++ G  H F SGD++  +   +  M+ E+  +    G+ P
Sbjct: 702 KVRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAMLDEVSQRASHLGHVP 761

Query: 715 NTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIVKNLRICDDCHASTKLLS 766
           + + VL+++DE+EK   LS HSEKLA A+GLIS+  GT IRIVKNLR+C DCH+  K  S
Sbjct: 762 DLSNVLMDVDEKEKIFMLSRHSEKLAMAYGLISSNKGTTIRIVKNLRVCSDCHSFAKFAS 821

BLAST of CSPI01G33570 vs. ExPASy Swiss-Prot
Match: Q9SN39 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DOT4 PE=2 SV=1)

HSP 1 Score: 533.1 bits (1372), Expect = 5.2e-150
Identity = 275/712 (38.62%), Postives = 418/712 (58.71%), Query Frame = 0

Query: 55  YNLLISSYTNNHLPQASFNCYLHMRSNDAAALDNFILPSLLKACAQASSGYLGRELHGFA 114
           +N+L++    +     S   +  M S+    +D++    + K+ +   S + G +LHGF 
Sbjct: 163 WNILMNELAKSGDFSGSIGLFKKMMSS-GVEMDSYTFSCVSKSFSSLRSVHGGEQLHGFI 222

Query: 115 QKNGFASDVFVCNALMNMYEKCGCLVSARLVFDQMPERDVVSWTTMLGCYVRSKAFGEAL 174
            K+GF     V N+L+  Y K   + SAR VFD+M ERDV+SW +++  YV +    + L
Sbjct: 223 LKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSNGLAEKGL 282

Query: 175 RLVREMQFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDGKMEVSMTTALID 234
            +  +M   G+++    ++S+ A   +   +  GRAVH   V+       E      L+D
Sbjct: 283 SVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVK--ACFSREDRFCNTLLD 342

Query: 235 MYCKGGCLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNIMLEEKLFPNEIT 294
           MY K G L SA+ +F  +S RSVVS+T MIAG  R     E  K F  M EE + P+  T
Sbjct: 343 MYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGISPDVYT 402

Query: 295 LLSLITECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVK 354
           + +++  C     LD GK  H ++  N  G  + +  AL+DMY KCG +  A  +F+ ++
Sbjct: 403 VTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAELVFSEMR 462

Query: 355 KKDVKIWSVLISAYAHVSCMDQVFNLFVEMLNND-MKPNNVTMVSLLSLCAEAGALDLGK 414
            KD+  W+ +I  Y+     ++  +LF  +L      P+  T+  +L  CA   A D G+
Sbjct: 463 VKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLSAFDKGR 522

Query: 415 WTHAYINRHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHG 474
             H YI R+G   D  +  +L++MYAKCG + +A  LF++   +D+  W  M+AG+ MHG
Sbjct: 523 EIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMIAGYGMHG 582

Query: 475 CGKEALELFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHY 534
            GKEA+ LF++M   G+E ++I+FVS+ +ACSHSGLV EG ++FN M H+  I P +EHY
Sbjct: 583 FGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEPTVEHY 642

Query: 535 GCLVDLLGRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDP 594
            C+VD+L R G L +A+  IENMP+ P+  IWGALL  C++H ++ L E  A K+ EL+P
Sbjct: 643 ACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKVFELEP 702

Query: 595 QNCGYSVLKSNIYASAKRWNDVTSVREAMSHSGIKKEPGLSWIEVNGSVHHFKSGDKACT 654
           +N GY VL +NIYA A++W  V  +R+ +   G++K PG SWIE+ G V+ F +GD +  
Sbjct: 703 ENTGYYVLMANIYAEAEKWEQVKRLRKRIGQRGLRKNPGCSWIEIKGRVNIFVAGDSSNP 762

Query: 655 QTTKVYEMVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPG 714
           +T  +   + ++  ++ E GY+P T   L++ +E EKE AL  HSEKLA A G+IS+  G
Sbjct: 763 ETENIEAFLRKVRARMIEEGYSPLTKYALIDAEEMEKEEALCGHSEKLAMALGIISSGHG 822

Query: 715 TPIRIVKNLRICDDCHASTKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 766
             IR+ KNLR+C DCH   K +SK+  R I++RD NRFH F +G+CSC G+W
Sbjct: 823 KIIRVTKNLRVCGDCHEMAKFMSKLTRREIVLRDSNRFHQFKDGHCSCRGFW 871

BLAST of CSPI01G33570 vs. ExPASy TrEMBL
Match: A0A0A0LYC2 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G690260 PE=3 SV=1)

HSP 1 Score: 1553.1 bits (4020), Expect = 0.0e+00
Identity = 758/765 (99.08%), Postives = 762/765 (99.61%), Query Frame = 0

Query: 1   MNQLILCSPSFSGCSGHSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS 60
           MNQLILCSPSFSGCSGHSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS
Sbjct: 1   MNQLILCSPSFSGCSGHSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS 60

Query: 61  SYTNNHLPQASFNCYLHMRSNDAAALDNFILPSLLKACAQASSGYLGRELHGFAQKNGFA 120
           SYTNNHLPQASFNCYLHMRSNDAAALDNFILPSLLKACAQASSG LGRELHGFAQKNGFA
Sbjct: 61  SYTNNHLPQASFNCYLHMRSNDAAALDNFILPSLLKACAQASSGDLGRELHGFAQKNGFA 120

Query: 121 SDVFVCNALMNMYEKCGCLVSARLVFDQMPERDVVSWTTMLGCYVRSKAFGEALRLVREM 180
           SDVFVCNALMNMYEKCGCLVSARLVFDQMPERDVVSWTTMLGCYVRSKAFGEALRLVREM
Sbjct: 121 SDVFVCNALMNMYEKCGCLVSARLVFDQMPERDVVSWTTMLGCYVRSKAFGEALRLVREM 180

Query: 181 QFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDGKMEVSMTTALIDMYCKGG 240
           QFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGD KMEVSMTTALIDMYCKGG
Sbjct: 181 QFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSMTTALIDMYCKGG 240

Query: 241 CLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNIMLEEKLFPNEITLLSLIT 300
           CLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFN MLEEKLFPNEITLLSLIT
Sbjct: 241 CLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNRMLEEKLFPNEITLLSLIT 300

Query: 301 ECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVKI 360
           ECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVKI
Sbjct: 301 ECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVKI 360

Query: 361 WSVLISAYAHVSCMDQVFNLFVEMLNNDMKPNNVTMVSLLSLCAEAGALDLGKWTHAYIN 420
           WSVLISAYAHVSCMDQVFNLFVEMLNND+KPNNVTMVSLLSLCAEAGALDLGKWTHAYIN
Sbjct: 361 WSVLISAYAHVSCMDQVFNLFVEMLNNDVKPNNVTMVSLLSLCAEAGALDLGKWTHAYIN 420

Query: 421 RHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEALE 480
           RHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEALE
Sbjct: 421 RHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEALE 480

Query: 481 LFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDLL 540
           LFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDLL
Sbjct: 481 LFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDLL 540

Query: 541 GRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYSV 600
           GRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYSV
Sbjct: 541 GRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYSV 600

Query: 601 LKSNIYASAKRWNDVTSVREAMSHSGIKKEPGLSWIEVNGSVHHFKSGDKACTQTTKVYE 660
           LKSNIYASAKRWNDVTSVREAMSHSG+KKEPGLSWIEV+GSVHHFKSGDKACTQTTKVYE
Sbjct: 601 LKSNIYASAKRWNDVTSVREAMSHSGMKKEPGLSWIEVSGSVHHFKSGDKACTQTTKVYE 660

Query: 661 MVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIVK 720
           MVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIVK
Sbjct: 661 MVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIVK 720

Query: 721 NLRICDDCHASTKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 766
           NLRICDDCHA+TKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW
Sbjct: 721 NLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 765

BLAST of CSPI01G33570 vs. ExPASy TrEMBL
Match: A0A5A7V2V9 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold403G001350 PE=3 SV=1)

HSP 1 Score: 1464.9 bits (3791), Expect = 0.0e+00
Identity = 715/766 (93.34%), Postives = 735/766 (95.95%), Query Frame = 0

Query: 1   MNQLILCSPSFSGCSGHSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS 60
           MNQLIL SPS SGCSG+SHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS
Sbjct: 1   MNQLILSSPSSSGCSGYSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS 60

Query: 61  SYTNNHLPQASFNCYLHMRSND-AAALDNFILPSLLKACAQASSGYLGRELHGFAQKNGF 120
           SYTNNHLPQAS NCYLHMR+ND AAALDNFILPSLLKACAQASS  LGRELHGFAQKNGF
Sbjct: 61  SYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGFAQKNGF 120

Query: 121 ASDVFVCNALMNMYEKCGCLVSARLVFDQMPERDVVSWTTMLGCYVRSKAFGEALRLVRE 180
           ASDVFVCNALMNMYEKCGCLVSA LVFD+MPERDVVSW+TMLGCYVRSKAFGEALRLVRE
Sbjct: 121 ASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEALRLVRE 180

Query: 181 MQFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDGKMEVSMTTALIDMYCKG 240
           MQFVGVKLSGVALISLI VFGNLLDMKSGRAVHGYIVRNVGD KMEVS+TTALIDMYCK 
Sbjct: 181 MQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSLTTALIDMYCKC 240

Query: 241 GCLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNIMLEEKLFPNEITLLSLI 300
            CLASAQRLFDRLSKRSVVSWTVMI GCIRSCRL EGAKNFN MLEEKLFPNEITLLSLI
Sbjct: 241 ECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLFPNEITLLSLI 300

Query: 301 TECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVK 360
           TECGFV TLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGV+KKDVK
Sbjct: 301 TECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVEKKDVK 360

Query: 361 IWSVLISAYAHVSCMDQVFNLFVEMLNNDMKPNNVTMVSLLSLCAEAGALDLGKWTHAYI 420
           IWS LISAYAHVSCMDQVFNLF+EML+N++KPN VTMVSLLSLCAEAG LDLGKWTHAYI
Sbjct: 361 IWSALISAYAHVSCMDQVFNLFLEMLDNEVKPNKVTMVSLLSLCAEAGTLDLGKWTHAYI 420

Query: 421 NRHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEAL 480
           NRHGLEVDVILETALINMY KCGDVTIARSLF+EA QRDI MWN MMAGFSMHGCGKEAL
Sbjct: 421 NRHGLEVDVILETALINMYVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHGCGKEAL 480

Query: 481 ELFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDL 540
           ELFSEMESHGVEPNDITF+SIFHACSHSGLVVEGKK+FN+MVH FGIVPKMEHYGCLVDL
Sbjct: 481 ELFSEMESHGVEPNDITFISIFHACSHSGLVVEGKKHFNRMVHSFGIVPKMEHYGCLVDL 540

Query: 541 LGRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 600
           LGRAGHL+EAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS
Sbjct: 541 LGRAGHLEEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 600

Query: 601 VLKSNIYASAKRWNDVTSVREAMSHSGIKKEPGLSWIEVNGSVHHFKSGDKACTQTTKVY 660
           VLKSNIYASAKRWNDVTSVRE MSH G+KKEPGLSWIEVNGSVHHFKSGDK CTQTTKVY
Sbjct: 601 VLKSNIYASAKRWNDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQTTKVY 660

Query: 661 EMVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIV 720
           EMV EMCIKLRE+GYTPNTA VLLNIDEEEKESALSYHSEKLA AFGLISTAPGTPIRI+
Sbjct: 661 EMVAEMCIKLREAGYTPNTAEVLLNIDEEEKESALSYHSEKLAMAFGLISTAPGTPIRII 720

Query: 721 KNLRICDDCHASTKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 766
           KNLRICDDCHA+ KLLSKIY RTIIVRDRNRFHHFSEGYCSC+GYW
Sbjct: 721 KNLRICDDCHAAMKLLSKIYARTIIVRDRNRFHHFSEGYCSCLGYW 766

BLAST of CSPI01G33570 vs. ExPASy TrEMBL
Match: A0A1S3CJ58 (pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like OS=Cucumis melo OX=3656 GN=LOC103501009 PE=3 SV=1)

HSP 1 Score: 1464.1 bits (3789), Expect = 0.0e+00
Identity = 714/766 (93.21%), Postives = 736/766 (96.08%), Query Frame = 0

Query: 1   MNQLILCSPSFSGCSGHSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS 60
           MNQLIL SPS SGCSG+SHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS
Sbjct: 1   MNQLILSSPSSSGCSGYSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS 60

Query: 61  SYTNNHLPQASFNCYLHMRSND-AAALDNFILPSLLKACAQASSGYLGRELHGFAQKNGF 120
           SYTNNHLPQAS NCYLHMR+ND AAALDNFILPSLLKACAQASS  LGRELHGFAQKNGF
Sbjct: 61  SYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGFAQKNGF 120

Query: 121 ASDVFVCNALMNMYEKCGCLVSARLVFDQMPERDVVSWTTMLGCYVRSKAFGEALRLVRE 180
           ASDVFVCNALMNMYEKCGCLVSA LVFD+MPERDVVSW+TMLGCYVRSKAFGEALRLVRE
Sbjct: 121 ASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEALRLVRE 180

Query: 181 MQFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDGKMEVSMTTALIDMYCKG 240
           MQFVGVKLSGVALISLI VFGNLLDMKSGRAVHGYI+RNVGD KMEVS+TTALIDMYCK 
Sbjct: 181 MQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYIMRNVGDEKMEVSLTTALIDMYCKC 240

Query: 241 GCLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNIMLEEKLFPNEITLLSLI 300
            CLASAQRLFDRLSKRSVVSWTVMI GCIRSCRL EGAKNFN MLEEKLFPNEITLLSLI
Sbjct: 241 ECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLFPNEITLLSLI 300

Query: 301 TECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVK 360
           TECGFV TLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGV+KKDVK
Sbjct: 301 TECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVEKKDVK 360

Query: 361 IWSVLISAYAHVSCMDQVFNLFVEMLNNDMKPNNVTMVSLLSLCAEAGALDLGKWTHAYI 420
           IWS LISAYAHVSCMDQVFNLF+EML+N++KPN VTMVSLLSLCAEAG LDLGKWTHAYI
Sbjct: 361 IWSALISAYAHVSCMDQVFNLFLEMLDNEVKPNKVTMVSLLSLCAEAGTLDLGKWTHAYI 420

Query: 421 NRHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEAL 480
           NRHGLEVDVILETALINMY KCGDVTIARSLF+EA QRDI MWN MMAGFSMHGCGKEAL
Sbjct: 421 NRHGLEVDVILETALINMYVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHGCGKEAL 480

Query: 481 ELFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDL 540
           ELFSEMESHGVEPNDITF+SIFHACSHSGLVVEGKK+FN+MVH+FGIVPKMEHYGCLVDL
Sbjct: 481 ELFSEMESHGVEPNDITFISIFHACSHSGLVVEGKKHFNRMVHNFGIVPKMEHYGCLVDL 540

Query: 541 LGRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 600
           LGRAGHL+EAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS
Sbjct: 541 LGRAGHLEEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 600

Query: 601 VLKSNIYASAKRWNDVTSVREAMSHSGIKKEPGLSWIEVNGSVHHFKSGDKACTQTTKVY 660
           VLKSNIYASAKRWNDVTSVRE MSH G+KKEPGLSWIEVNGSVHHFKSGDK CTQTTKVY
Sbjct: 601 VLKSNIYASAKRWNDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQTTKVY 660

Query: 661 EMVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIV 720
           EMV EMCIKLRE+GYTPNTA VLLNIDEEEKESALSYHSEKLA AFGLISTAPGTPIRI+
Sbjct: 661 EMVAEMCIKLREAGYTPNTAEVLLNIDEEEKESALSYHSEKLAMAFGLISTAPGTPIRII 720

Query: 721 KNLRICDDCHASTKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 766
           KNLRICDDCHA+ KLLSKIY RTIIVRDRNRFHHFSEGYCSC+GYW
Sbjct: 721 KNLRICDDCHAAMKLLSKIYARTIIVRDRNRFHHFSEGYCSCLGYW 766

BLAST of CSPI01G33570 vs. ExPASy TrEMBL
Match: A0A6J1HA74 (pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like OS=Cucurbita moschata OX=3662 GN=LOC111461539 PE=3 SV=1)

HSP 1 Score: 1314.3 bits (3400), Expect = 0.0e+00
Identity = 631/765 (82.48%), Postives = 698/765 (91.24%), Query Frame = 0

Query: 1   MNQLILCSPSFSGCSGHSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS 60
           M+QLIL + S SG SGHSHLNLQQTHQ+HAH IKTQF NPH FFS+SHFTPEAN+NLLIS
Sbjct: 1   MDQLILSAASPSG-SGHSHLNLQQTHQIHAHCIKTQFRNPHSFFSRSHFTPEANFNLLIS 60

Query: 61  SYTNNHLPQASFNCYLHMRSNDAAALDNFILPSLLKACAQASSGYLGRELHGFAQKNGFA 120
           SYT+NHLPQA+F  Y HMR+ DAAA+DNFI+PSLLKACAQASS   GRE+HGFA KNGF 
Sbjct: 61  SYTDNHLPQAAFILYHHMRTTDAAAVDNFIVPSLLKACAQASSTNFGREVHGFAVKNGFV 120

Query: 121 SDVFVCNALMNMYEKCGCLVSARLVFDQMPERDVVSWTTMLGCYVRSKAFGEALRLVREM 180
           SDVFVCNALMNMYEKCG LVSA LVFD+MP+RDVVSW+TMLGCYVRSK+FGEA RLVREM
Sbjct: 121 SDVFVCNALMNMYEKCGSLVSACLVFDKMPDRDVVSWSTMLGCYVRSKSFGEAYRLVREM 180

Query: 181 QFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDGKMEVSMTTALIDMYCKGG 240
            FVGVKLS VALIS+I VFG L DMKSGRA+HGY+VRNVG+ ++E+ +TTALIDMYCKG 
Sbjct: 181 HFVGVKLSDVALISMIGVFGELSDMKSGRAIHGYVVRNVGNERIELPLTTALIDMYCKGD 240

Query: 241 CLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNIMLEEKLFPNEITLLSLIT 300
            LASA RLFD LS+R+VVSWT +IAGCIRSCR  EGAKNF+ MLEE + PNEITLLSLIT
Sbjct: 241 KLASAMRLFDGLSQRNVVSWTALIAGCIRSCRFVEGAKNFSRMLEENIAPNEITLLSLIT 300

Query: 301 ECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVKI 360
           ECGFVG LDLGKW HAYLLRNGFGMSLAL TALIDMYGKCGQV YARALFNGV++KDVKI
Sbjct: 301 ECGFVGALDLGKWLHAYLLRNGFGMSLALATALIDMYGKCGQVAYARALFNGVEEKDVKI 360

Query: 361 WSVLISAYAHVSCMDQVFNLFVEMLNNDMKPNNVTMVSLLSLCAEAGALDLGKWTHAYIN 420
           WS LISAYAH SC+DQ F+LF++ML++++KPN VTMVSLLSLCAE GALDLG+WTHAYIN
Sbjct: 361 WSALISAYAHASCIDQAFSLFLKMLDSEVKPNKVTMVSLLSLCAEVGALDLGRWTHAYIN 420

Query: 421 RHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEALE 480
           RHG+EVDV+LETALINMYAKCGD+  AR LF+EA +RDI MWN MMAGFS+HGCGKEALE
Sbjct: 421 RHGVEVDVVLETALINMYAKCGDLKTARCLFDEATRRDIHMWNAMMAGFSIHGCGKEALE 480

Query: 481 LFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDLL 540
           LFS+M  HGVEPNDITF+S+FHACSHSGLV EG K+F++MVH+FGIVPK+EHYGCLVDLL
Sbjct: 481 LFSDMVCHGVEPNDITFISVFHACSHSGLVGEGMKHFDRMVHEFGIVPKIEHYGCLVDLL 540

Query: 541 GRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYSV 600
           GRA  LD AH+IIENMPMRPNTI+WGALLAACKLHKNLALGEVAARKILELDP+NCGY V
Sbjct: 541 GRAKRLDAAHSIIENMPMRPNTIVWGALLAACKLHKNLALGEVAARKILELDPENCGYRV 600

Query: 601 LKSNIYASAKRWNDVTSVREAMSHSGIKKEPGLSWIEVNGSVHHFKSGDKACTQTTKVYE 660
           LKSNIYAS KRW DVTSVRE MSH G+KKEPGLSWIEVNGSVHHF+SGDK CTQT KV+E
Sbjct: 601 LKSNIYASEKRWTDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFRSGDKTCTQTRKVHE 660

Query: 661 MVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIVK 720
           MVTEMCIKLRE+GY PNT+AVLLN+++EEKESALSYHSEKLA AFGLISTAPGTPIRI+K
Sbjct: 661 MVTEMCIKLREAGYAPNTSAVLLNVEDEEKESALSYHSEKLAMAFGLISTAPGTPIRIIK 720

Query: 721 NLRICDDCHASTKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 766
           NLRICDDCHA+TKLLSKIYGRTIIVRDRNRFHHFSEGYCSC+GYW
Sbjct: 721 NLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW 764

BLAST of CSPI01G33570 vs. ExPASy TrEMBL
Match: A0A6J1JKG9 (pentatricopeptide repeat-containing protein At4g21065-like OS=Cucurbita maxima OX=3661 GN=LOC111485395 PE=3 SV=1)

HSP 1 Score: 1299.3 bits (3361), Expect = 0.0e+00
Identity = 623/765 (81.44%), Postives = 694/765 (90.72%), Query Frame = 0

Query: 1   MNQLILCSPSFSGCSGHSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS 60
           M+QLIL + S SG SGHSHLNLQQTHQ+HAHFIKTQF NPH FFS+S+FTPEAN+NLLIS
Sbjct: 1   MDQLILSAASPSG-SGHSHLNLQQTHQIHAHFIKTQFRNPHNFFSRSNFTPEANFNLLIS 60

Query: 61  SYTNNHLPQASFNCYLHMRSNDAAALDNFILPSLLKACAQASSGYLGRELHGFAQKNGFA 120
           SYT+NH PQA+FN Y HMR+ DAAA+DNFI+PSLLKACAQASS  LGRE+HGFA KNGF 
Sbjct: 61  SYTDNHRPQAAFNLYHHMRTTDAAAVDNFIVPSLLKACAQASSTNLGREVHGFAVKNGFV 120

Query: 121 SDVFVCNALMNMYEKCGCLVSARLVFDQMPERDVVSWTTMLGCYVRSKAFGEALRLVREM 180
           SDVFVCNALMNMYEKCG LVSA LVFD+MP+RDVVSW+TMLGCYVRSK+FGEA RLVREM
Sbjct: 121 SDVFVCNALMNMYEKCGSLVSACLVFDKMPDRDVVSWSTMLGCYVRSKSFGEAYRLVREM 180

Query: 181 QFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDGKMEVSMTTALIDMYCKGG 240
            FVGVKLS VALIS+I VFG L DMKSGRA+HGY+VRNVG  ++E+ +TTALIDMYCKG 
Sbjct: 181 HFVGVKLSDVALISMIGVFGELSDMKSGRAIHGYVVRNVGKERIELPLTTALIDMYCKGD 240

Query: 241 CLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNIMLEEKLFPNEITLLSLIT 300
            LASA RLF+ LS+R+VVSWT +IAGCIRSCR  EGAKNF+ MLEE + PNEITLLSLIT
Sbjct: 241 NLASAMRLFNGLSQRNVVSWTALIAGCIRSCRFVEGAKNFSRMLEENIVPNEITLLSLIT 300

Query: 301 ECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVKI 360
           ECGFVG LDLGKW H+YLLRNGFGMSL L TALIDMYGKCGQV YARALFN V +KDVKI
Sbjct: 301 ECGFVGALDLGKWLHSYLLRNGFGMSLTLTTALIDMYGKCGQVAYARALFNVVDEKDVKI 360

Query: 361 WSVLISAYAHVSCMDQVFNLFVEMLNNDMKPNNVTMVSLLSLCAEAGALDLGKWTHAYIN 420
           WS LISAYAH SC+DQ F+LF++ML++++KPN VTMVSLLSLCAE GALDLG+WTHAYI 
Sbjct: 361 WSALISAYAHTSCIDQAFSLFLKMLDSEVKPNKVTMVSLLSLCAEVGALDLGRWTHAYII 420

Query: 421 RHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEALE 480
            HG+EVD++LETALINMYAKCGD+  ARSLF+EA QRDI MWN MMAGFS+HGCGKEALE
Sbjct: 421 HHGVEVDIVLETALINMYAKCGDLKTARSLFDEATQRDIHMWNAMMAGFSIHGCGKEALE 480

Query: 481 LFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDLL 540
           LFS+ME HGVEPNDITF+S+FHACSHSGLV EG K+F++MVH+FGIVPK+EHYGCLVDLL
Sbjct: 481 LFSDMECHGVEPNDITFISVFHACSHSGLVGEGMKHFDRMVHEFGIVPKIEHYGCLVDLL 540

Query: 541 GRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYSV 600
           GRA  LD AH+IIENMPMRPNTIIWGALLAACKLHKNL LG+VAARKILELDP+NCGY V
Sbjct: 541 GRAKRLDAAHSIIENMPMRPNTIIWGALLAACKLHKNLPLGKVAARKILELDPENCGYRV 600

Query: 601 LKSNIYASAKRWNDVTSVREAMSHSGIKKEPGLSWIEVNGSVHHFKSGDKACTQTTKVYE 660
           LKSNIYAS KRW +VTS+RE+MSH G+KKEPGLSW EVNGSVHHF+SGDK CTQ  KV+E
Sbjct: 601 LKSNIYASEKRWTNVTSIRESMSHLGMKKEPGLSWTEVNGSVHHFRSGDKTCTQARKVHE 660

Query: 661 MVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIVK 720
           MVTEMCIKLRE+GY PNT+AVLLN+++EEKESALSYHSEKLA AFGLISTAPGTPIRI+K
Sbjct: 661 MVTEMCIKLREAGYAPNTSAVLLNVEDEEKESALSYHSEKLAMAFGLISTAPGTPIRIIK 720

Query: 721 NLRICDDCHASTKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 766
           NLRICDDCHA+TKLLSKIYGRTIIVRDRNRFHHFSEGYCSC+GYW
Sbjct: 721 NLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW 764

BLAST of CSPI01G33570 vs. NCBI nr
Match: XP_011660280.1 (pentatricopeptide repeat-containing protein At3g26782, mitochondrial [Cucumis sativus])

HSP 1 Score: 1553.1 bits (4020), Expect = 0.0e+00
Identity = 758/765 (99.08%), Postives = 762/765 (99.61%), Query Frame = 0

Query: 1   MNQLILCSPSFSGCSGHSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS 60
           MNQLILCSPSFSGCSGHSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS
Sbjct: 1   MNQLILCSPSFSGCSGHSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS 60

Query: 61  SYTNNHLPQASFNCYLHMRSNDAAALDNFILPSLLKACAQASSGYLGRELHGFAQKNGFA 120
           SYTNNHLPQASFNCYLHMRSNDAAALDNFILPSLLKACAQASSG LGRELHGFAQKNGFA
Sbjct: 61  SYTNNHLPQASFNCYLHMRSNDAAALDNFILPSLLKACAQASSGDLGRELHGFAQKNGFA 120

Query: 121 SDVFVCNALMNMYEKCGCLVSARLVFDQMPERDVVSWTTMLGCYVRSKAFGEALRLVREM 180
           SDVFVCNALMNMYEKCGCLVSARLVFDQMPERDVVSWTTMLGCYVRSKAFGEALRLVREM
Sbjct: 121 SDVFVCNALMNMYEKCGCLVSARLVFDQMPERDVVSWTTMLGCYVRSKAFGEALRLVREM 180

Query: 181 QFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDGKMEVSMTTALIDMYCKGG 240
           QFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGD KMEVSMTTALIDMYCKGG
Sbjct: 181 QFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSMTTALIDMYCKGG 240

Query: 241 CLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNIMLEEKLFPNEITLLSLIT 300
           CLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFN MLEEKLFPNEITLLSLIT
Sbjct: 241 CLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNRMLEEKLFPNEITLLSLIT 300

Query: 301 ECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVKI 360
           ECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVKI
Sbjct: 301 ECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVKI 360

Query: 361 WSVLISAYAHVSCMDQVFNLFVEMLNNDMKPNNVTMVSLLSLCAEAGALDLGKWTHAYIN 420
           WSVLISAYAHVSCMDQVFNLFVEMLNND+KPNNVTMVSLLSLCAEAGALDLGKWTHAYIN
Sbjct: 361 WSVLISAYAHVSCMDQVFNLFVEMLNNDVKPNNVTMVSLLSLCAEAGALDLGKWTHAYIN 420

Query: 421 RHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEALE 480
           RHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEALE
Sbjct: 421 RHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEALE 480

Query: 481 LFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDLL 540
           LFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDLL
Sbjct: 481 LFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDLL 540

Query: 541 GRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYSV 600
           GRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYSV
Sbjct: 541 GRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYSV 600

Query: 601 LKSNIYASAKRWNDVTSVREAMSHSGIKKEPGLSWIEVNGSVHHFKSGDKACTQTTKVYE 660
           LKSNIYASAKRWNDVTSVREAMSHSG+KKEPGLSWIEV+GSVHHFKSGDKACTQTTKVYE
Sbjct: 601 LKSNIYASAKRWNDVTSVREAMSHSGMKKEPGLSWIEVSGSVHHFKSGDKACTQTTKVYE 660

Query: 661 MVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIVK 720
           MVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIVK
Sbjct: 661 MVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIVK 720

Query: 721 NLRICDDCHASTKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 766
           NLRICDDCHA+TKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW
Sbjct: 721 NLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 765

BLAST of CSPI01G33570 vs. NCBI nr
Match: KAA0062552.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK28774.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1464.9 bits (3791), Expect = 0.0e+00
Identity = 715/766 (93.34%), Postives = 735/766 (95.95%), Query Frame = 0

Query: 1   MNQLILCSPSFSGCSGHSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS 60
           MNQLIL SPS SGCSG+SHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS
Sbjct: 1   MNQLILSSPSSSGCSGYSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS 60

Query: 61  SYTNNHLPQASFNCYLHMRSND-AAALDNFILPSLLKACAQASSGYLGRELHGFAQKNGF 120
           SYTNNHLPQAS NCYLHMR+ND AAALDNFILPSLLKACAQASS  LGRELHGFAQKNGF
Sbjct: 61  SYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGFAQKNGF 120

Query: 121 ASDVFVCNALMNMYEKCGCLVSARLVFDQMPERDVVSWTTMLGCYVRSKAFGEALRLVRE 180
           ASDVFVCNALMNMYEKCGCLVSA LVFD+MPERDVVSW+TMLGCYVRSKAFGEALRLVRE
Sbjct: 121 ASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEALRLVRE 180

Query: 181 MQFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDGKMEVSMTTALIDMYCKG 240
           MQFVGVKLSGVALISLI VFGNLLDMKSGRAVHGYIVRNVGD KMEVS+TTALIDMYCK 
Sbjct: 181 MQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSLTTALIDMYCKC 240

Query: 241 GCLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNIMLEEKLFPNEITLLSLI 300
            CLASAQRLFDRLSKRSVVSWTVMI GCIRSCRL EGAKNFN MLEEKLFPNEITLLSLI
Sbjct: 241 ECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLFPNEITLLSLI 300

Query: 301 TECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVK 360
           TECGFV TLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGV+KKDVK
Sbjct: 301 TECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVEKKDVK 360

Query: 361 IWSVLISAYAHVSCMDQVFNLFVEMLNNDMKPNNVTMVSLLSLCAEAGALDLGKWTHAYI 420
           IWS LISAYAHVSCMDQVFNLF+EML+N++KPN VTMVSLLSLCAEAG LDLGKWTHAYI
Sbjct: 361 IWSALISAYAHVSCMDQVFNLFLEMLDNEVKPNKVTMVSLLSLCAEAGTLDLGKWTHAYI 420

Query: 421 NRHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEAL 480
           NRHGLEVDVILETALINMY KCGDVTIARSLF+EA QRDI MWN MMAGFSMHGCGKEAL
Sbjct: 421 NRHGLEVDVILETALINMYVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHGCGKEAL 480

Query: 481 ELFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDL 540
           ELFSEMESHGVEPNDITF+SIFHACSHSGLVVEGKK+FN+MVH FGIVPKMEHYGCLVDL
Sbjct: 481 ELFSEMESHGVEPNDITFISIFHACSHSGLVVEGKKHFNRMVHSFGIVPKMEHYGCLVDL 540

Query: 541 LGRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 600
           LGRAGHL+EAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS
Sbjct: 541 LGRAGHLEEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 600

Query: 601 VLKSNIYASAKRWNDVTSVREAMSHSGIKKEPGLSWIEVNGSVHHFKSGDKACTQTTKVY 660
           VLKSNIYASAKRWNDVTSVRE MSH G+KKEPGLSWIEVNGSVHHFKSGDK CTQTTKVY
Sbjct: 601 VLKSNIYASAKRWNDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQTTKVY 660

Query: 661 EMVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIV 720
           EMV EMCIKLRE+GYTPNTA VLLNIDEEEKESALSYHSEKLA AFGLISTAPGTPIRI+
Sbjct: 661 EMVAEMCIKLREAGYTPNTAEVLLNIDEEEKESALSYHSEKLAMAFGLISTAPGTPIRII 720

Query: 721 KNLRICDDCHASTKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 766
           KNLRICDDCHA+ KLLSKIY RTIIVRDRNRFHHFSEGYCSC+GYW
Sbjct: 721 KNLRICDDCHAAMKLLSKIYARTIIVRDRNRFHHFSEGYCSCLGYW 766

BLAST of CSPI01G33570 vs. NCBI nr
Match: XP_008462708.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like [Cucumis melo])

HSP 1 Score: 1464.1 bits (3789), Expect = 0.0e+00
Identity = 714/766 (93.21%), Postives = 736/766 (96.08%), Query Frame = 0

Query: 1   MNQLILCSPSFSGCSGHSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS 60
           MNQLIL SPS SGCSG+SHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS
Sbjct: 1   MNQLILSSPSSSGCSGYSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS 60

Query: 61  SYTNNHLPQASFNCYLHMRSND-AAALDNFILPSLLKACAQASSGYLGRELHGFAQKNGF 120
           SYTNNHLPQAS NCYLHMR+ND AAALDNFILPSLLKACAQASS  LGRELHGFAQKNGF
Sbjct: 61  SYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGFAQKNGF 120

Query: 121 ASDVFVCNALMNMYEKCGCLVSARLVFDQMPERDVVSWTTMLGCYVRSKAFGEALRLVRE 180
           ASDVFVCNALMNMYEKCGCLVSA LVFD+MPERDVVSW+TMLGCYVRSKAFGEALRLVRE
Sbjct: 121 ASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEALRLVRE 180

Query: 181 MQFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDGKMEVSMTTALIDMYCKG 240
           MQFVGVKLSGVALISLI VFGNLLDMKSGRAVHGYI+RNVGD KMEVS+TTALIDMYCK 
Sbjct: 181 MQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYIMRNVGDEKMEVSLTTALIDMYCKC 240

Query: 241 GCLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNIMLEEKLFPNEITLLSLI 300
            CLASAQRLFDRLSKRSVVSWTVMI GCIRSCRL EGAKNFN MLEEKLFPNEITLLSLI
Sbjct: 241 ECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLFPNEITLLSLI 300

Query: 301 TECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVK 360
           TECGFV TLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGV+KKDVK
Sbjct: 301 TECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVEKKDVK 360

Query: 361 IWSVLISAYAHVSCMDQVFNLFVEMLNNDMKPNNVTMVSLLSLCAEAGALDLGKWTHAYI 420
           IWS LISAYAHVSCMDQVFNLF+EML+N++KPN VTMVSLLSLCAEAG LDLGKWTHAYI
Sbjct: 361 IWSALISAYAHVSCMDQVFNLFLEMLDNEVKPNKVTMVSLLSLCAEAGTLDLGKWTHAYI 420

Query: 421 NRHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEAL 480
           NRHGLEVDVILETALINMY KCGDVTIARSLF+EA QRDI MWN MMAGFSMHGCGKEAL
Sbjct: 421 NRHGLEVDVILETALINMYVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHGCGKEAL 480

Query: 481 ELFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDL 540
           ELFSEMESHGVEPNDITF+SIFHACSHSGLVVEGKK+FN+MVH+FGIVPKMEHYGCLVDL
Sbjct: 481 ELFSEMESHGVEPNDITFISIFHACSHSGLVVEGKKHFNRMVHNFGIVPKMEHYGCLVDL 540

Query: 541 LGRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 600
           LGRAGHL+EAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS
Sbjct: 541 LGRAGHLEEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 600

Query: 601 VLKSNIYASAKRWNDVTSVREAMSHSGIKKEPGLSWIEVNGSVHHFKSGDKACTQTTKVY 660
           VLKSNIYASAKRWNDVTSVRE MSH G+KKEPGLSWIEVNGSVHHFKSGDK CTQTTKVY
Sbjct: 601 VLKSNIYASAKRWNDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQTTKVY 660

Query: 661 EMVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIV 720
           EMV EMCIKLRE+GYTPNTA VLLNIDEEEKESALSYHSEKLA AFGLISTAPGTPIRI+
Sbjct: 661 EMVAEMCIKLREAGYTPNTAEVLLNIDEEEKESALSYHSEKLAMAFGLISTAPGTPIRII 720

Query: 721 KNLRICDDCHASTKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 766
           KNLRICDDCHA+ KLLSKIY RTIIVRDRNRFHHFSEGYCSC+GYW
Sbjct: 721 KNLRICDDCHAAMKLLSKIYARTIIVRDRNRFHHFSEGYCSCLGYW 766

BLAST of CSPI01G33570 vs. NCBI nr
Match: XP_038879151.1 (pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like [Benincasa hispida])

HSP 1 Score: 1376.3 bits (3561), Expect = 0.0e+00
Identity = 666/767 (86.83%), Postives = 716/767 (93.35%), Query Frame = 0

Query: 1   MNQLILCSPSFSGCS-GHSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLI 60
           MNQLIL SPSFSG   GHSHL+LQQT Q+HAHFIKTQFH PHPFFSQ+HF+PEANYNLLI
Sbjct: 1   MNQLILSSPSFSGSGHGHSHLSLQQTQQIHAHFIKTQFHRPHPFFSQTHFSPEANYNLLI 60

Query: 61  SSYTNNHLPQASFNCYLHMRSND-AAALDNFILPSLLKACAQASSGYLGRELHGFAQKNG 120
           SSYTNNHLPQASF  YLHMR+ D AAALDNFILPSLLKACAQAS G LGRELHGFA KNG
Sbjct: 61  SSYTNNHLPQASFKLYLHMRTTDAAAALDNFILPSLLKACAQASCGVLGRELHGFAIKNG 120

Query: 121 FASDVFVCNALMNMYEKCGCLVSARLVFDQMPERDVVSWTTMLGCYVRSKAFGEALRLVR 180
           FA DVFVCNALMNMYEKCG LV ARLVFD+MP+RDVVSW+TMLGCYVRSK++ EAL LVR
Sbjct: 121 FAPDVFVCNALMNMYEKCGSLVFARLVFDKMPDRDVVSWSTMLGCYVRSKSYDEALVLVR 180

Query: 181 EMQFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDGKMEVSMTTALIDMYCK 240
           EM FVGVKLSGVALIS+I  FG LLDMKSGRAVHGYIVRNV D KMEV +TTALI+MYCK
Sbjct: 181 EMHFVGVKLSGVALISMIGAFGELLDMKSGRAVHGYIVRNVVDEKMEVPLTTALINMYCK 240

Query: 241 GGCLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNIMLEEKLFPNEITLLSL 300
           G  L SAQRLFD L ++SVVSWTVMIAGCIR+CRL EGA NFN MLEE++FPNEITLL+L
Sbjct: 241 GERLESAQRLFDVLPQKSVVSWTVMIAGCIRNCRLVEGANNFNRMLEEEVFPNEITLLNL 300

Query: 301 ITECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDV 360
           ITECGFVGTLDLGKWFHAYLLRN FGMSLALVTALIDMYGKCGQVGYARALFNG+++KDV
Sbjct: 301 ITECGFVGTLDLGKWFHAYLLRNEFGMSLALVTALIDMYGKCGQVGYARALFNGIEEKDV 360

Query: 361 KIWSVLISAYAHVSCMDQVFNLFVEMLNNDMKPNNVTMVSLLSLCAEAGALDLGKWTHAY 420
           KIWS L+ AYAH SC+DQ FNLF+EML++++KPN VTMV LLSLCAEAGAL+LGKWTH Y
Sbjct: 361 KIWSALLLAYAHASCIDQAFNLFLEMLDSEVKPNKVTMVGLLSLCAEAGALNLGKWTHTY 420

Query: 421 INRHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEA 480
           INRHGLEVDV+LETALINMYAKCGD+TIARSLF+EA QRDI MWN MMAGFSMHGCGKEA
Sbjct: 421 INRHGLEVDVVLETALINMYAKCGDLTIARSLFDEATQRDIHMWNAMMAGFSMHGCGKEA 480

Query: 481 LELFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVD 540
           LELFSEME +GVEPNDITF+S+FHACSHSGLVV+GKK+FN+MVHDFGIVPK+EHYGCLVD
Sbjct: 481 LELFSEMEGYGVEPNDITFISVFHACSHSGLVVDGKKHFNRMVHDFGIVPKIEHYGCLVD 540

Query: 541 LLGRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGY 600
           LLGRAGHLDEAHNIIENMPM+PNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGY
Sbjct: 541 LLGRAGHLDEAHNIIENMPMKPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGY 600

Query: 601 SVLKSNIYASAKRWNDVTSVREAMSHSGIKKEPGLSWIEVNGSVHHFKSGDKACTQTTKV 660
           SVLKSNIYASAKRW DVTSVRE MSH G+KKEPGLSWIEVNGSVHHFKSGDK CTQTT+V
Sbjct: 601 SVLKSNIYASAKRWTDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQTTEV 660

Query: 661 YEMVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRI 720
           YEMVTEMCIKLRE+GYTPNT+AVLLN++EEEKES LSYHSEKLA AFGLISTAPGTPIRI
Sbjct: 661 YEMVTEMCIKLRETGYTPNTSAVLLNVEEEEKESTLSYHSEKLAMAFGLISTAPGTPIRI 720

Query: 721 VKNLRICDDCHASTKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 766
           VKNLRICDDCHA+TKLLSKIYGRTIIVRDRNRFHHFSEG+CSC+GYW
Sbjct: 721 VKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGFCSCLGYW 767

BLAST of CSPI01G33570 vs. NCBI nr
Match: XP_023533718.1 (pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1319.7 bits (3414), Expect = 0.0e+00
Identity = 633/765 (82.75%), Postives = 698/765 (91.24%), Query Frame = 0

Query: 1   MNQLILCSPSFSGCSGHSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS 60
           M+QLIL + S S  SGHSHLNLQQTHQ+HAHFIKTQF NPH FFS+S+FTPEAN+NLLIS
Sbjct: 1   MDQLILSAASPSR-SGHSHLNLQQTHQIHAHFIKTQFRNPHSFFSRSNFTPEANFNLLIS 60

Query: 61  SYTNNHLPQASFNCYLHMRSNDAAALDNFILPSLLKACAQASSGYLGRELHGFAQKNGFA 120
           SYT+NHLPQA+FN Y HMR+ DAAA+DNFI+PSLLKACAQASS   GRE+HGFA KNGF 
Sbjct: 61  SYTDNHLPQAAFNLYHHMRTTDAAAVDNFIVPSLLKACAQASSTNFGREVHGFAVKNGFV 120

Query: 121 SDVFVCNALMNMYEKCGCLVSARLVFDQMPERDVVSWTTMLGCYVRSKAFGEALRLVREM 180
           SDVFVCNALMNMYEKCG LVSA LVFD+MP+RDVVSW+TMLGCYVRSK+FGEA RLVREM
Sbjct: 121 SDVFVCNALMNMYEKCGSLVSACLVFDKMPDRDVVSWSTMLGCYVRSKSFGEAYRLVREM 180

Query: 181 QFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDGKMEVSMTTALIDMYCKGG 240
            FVGV+LS VALIS+I VFG L DMKSGRA+HGY+VRNVG+ +MEV +TTALIDMYCKG 
Sbjct: 181 HFVGVRLSDVALISMIGVFGELSDMKSGRAIHGYVVRNVGNERMEVPLTTALIDMYCKGD 240

Query: 241 CLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNIMLEEKLFPNEITLLSLIT 300
            LASA RLFD LS+R+VVSWT +IAGCIRSCR DEGAKNF+ MLEE + PNEITLLSLIT
Sbjct: 241 NLASAMRLFDGLSQRNVVSWTALIAGCIRSCRFDEGAKNFSRMLEENIVPNEITLLSLIT 300

Query: 301 ECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVKI 360
           ECGFVG LDLGKW HAYLLRNGFGMSLAL TALIDMYGKCGQV YARALFNGV++KDVKI
Sbjct: 301 ECGFVGALDLGKWLHAYLLRNGFGMSLALATALIDMYGKCGQVAYARALFNGVEEKDVKI 360

Query: 361 WSVLISAYAHVSCMDQVFNLFVEMLNNDMKPNNVTMVSLLSLCAEAGALDLGKWTHAYIN 420
           WS LISAYAH SC+DQ F LF++ML++++KPN VTMVSLLSLCAE GALDLG+WTHAYIN
Sbjct: 361 WSALISAYAHASCIDQAFGLFLKMLDSEVKPNKVTMVSLLSLCAEVGALDLGRWTHAYIN 420

Query: 421 RHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEALE 480
           RHGLEVDV+LETALINMYAKCGD+  ARSLF+EA +RDI MWN MMAGFS+HGCGKEALE
Sbjct: 421 RHGLEVDVVLETALINMYAKCGDLKTARSLFDEATRRDIHMWNAMMAGFSIHGCGKEALE 480

Query: 481 LFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDLL 540
           LF +ME HGVEPNDITF+S+FHACSHSGLV EG K+F++MVH+FGIVPK+EHYGCLVDLL
Sbjct: 481 LFLDMECHGVEPNDITFISLFHACSHSGLVGEGMKHFDRMVHEFGIVPKIEHYGCLVDLL 540

Query: 541 GRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYSV 600
           GRA  LD AH+IIENMPMRPNTI+WGALLAACKLHKNL LGEVAARKILELDP+NCGY V
Sbjct: 541 GRAKRLDAAHSIIENMPMRPNTIVWGALLAACKLHKNLTLGEVAARKILELDPENCGYRV 600

Query: 601 LKSNIYASAKRWNDVTSVREAMSHSGIKKEPGLSWIEVNGSVHHFKSGDKACTQTTKVYE 660
           LKSNIYAS KRW DVTSVRE MSH G+KKEPGLSWIEVNGSVHHF+SGDK CTQT KV+E
Sbjct: 601 LKSNIYASEKRWTDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFRSGDKTCTQTRKVHE 660

Query: 661 MVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIVK 720
           MVTEMCIKLRE+GY PNT+AVLLN+++EEKESALSYHSEKLA AFGLISTAPGTPIRI+K
Sbjct: 661 MVTEMCIKLREAGYAPNTSAVLLNVEDEEKESALSYHSEKLAMAFGLISTAPGTPIRIIK 720

Query: 721 NLRICDDCHASTKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 766
           NLRICDDCHA+TKLLSKIYGRTIIVRDRNRFHHF EGYCSC+GYW
Sbjct: 721 NLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFREGYCSCLGYW 764

BLAST of CSPI01G33570 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 545.0 bits (1403), Expect = 9.3e-155
Identity = 275/712 (38.62%), Postives = 423/712 (59.41%), Query Frame = 0

Query: 55  YNLLISSYTNNHLPQASFNCYLHMRSNDA-AALDNFILPSLLKACAQASSGYLGRELHGF 114
           Y+ ++  +        +   ++ MR +D    + NF    LLK C   +   +G+E+HG 
Sbjct: 103 YHTMLKGFAKVSDLDKALQFFVRMRYDDVEPVVYNFTY--LLKVCGDEAELRVGKEIHGL 162

Query: 115 AQKNGFASDVFVCNALMNMYEKCGCLVSARLVFDQMPERDVVSWTTMLGCYVRSKAFGEA 174
             K+GF+ D+F    L NMY KC  +  AR VFD+MPERD+VSW T++  Y ++     A
Sbjct: 163 LVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMA 222

Query: 175 LRLVREMQFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDGKMEVSMTTALI 234
           L +V+ M    +K S + ++S++     L  +  G+ +HGY +R+  D  + +S  TAL+
Sbjct: 223 LEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNIS--TALV 282

Query: 235 DMYCKGGCLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNIMLEEKLFPNEI 294
           DMY K G L +A++LFD + +R+VVSW  MI   +++    E    F  ML+E + P ++
Sbjct: 283 DMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDV 342

Query: 295 TLLSLITECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGV 354
           +++  +  C  +G L+ G++ H   +  G   ++++V +LI MY KC +V  A ++F  +
Sbjct: 343 SVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKL 402

Query: 355 KKKDVKIWSVLISAYAHVSCMDQVFNLFVEMLNNDMKPNNVTMVSLLSLCAEAGALDLGK 414
           + + +  W+ +I  +A         N F +M +  +KP+  T VS+++  AE       K
Sbjct: 403 QSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAK 462

Query: 415 WTHAYINRHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHG 474
           W H  + R  L+ +V + TAL++MYAKCG + IAR +F+   +R +  WN M+ G+  HG
Sbjct: 463 WIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHG 522

Query: 475 CGKEALELFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHY 534
            GK ALELF EM+   ++PN +TF+S+  ACSHSGLV  G K F  M  ++ I   M+HY
Sbjct: 523 FGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHY 582

Query: 535 GCLVDLLGRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDP 594
           G +VDLLGRAG L+EA + I  MP++P   ++GA+L AC++HKN+   E AA ++ EL+P
Sbjct: 583 GAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNP 642

Query: 595 QNCGYSVLKSNIYASAKRWNDVTSVREAMSHSGIKKEPGLSWIEVNGSVHHFKSGDKACT 654
            + GY VL +NIY +A  W  V  VR +M   G++K PG S +E+   VH F SG  A  
Sbjct: 643 DDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHP 702

Query: 655 QTTKVYEMVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPG 714
            + K+Y  + ++   ++E+GY P+T  V L ++ + KE  LS HSEKLA +FGL++T  G
Sbjct: 703 DSKKIYAFLEKLICHIKEAGYVPDTNLV-LGVENDVKEQLLSTHSEKLAISFGLLNTTAG 762

Query: 715 TPIRIVKNLRICDDCHASTKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 766
           T I + KNLR+C DCH +TK +S + GR I+VRD  RFHHF  G CSC  YW
Sbjct: 763 TTIHVRKNLRVCADCHNATKYISLVTGREIVVRDMQRFHHFKNGACSCGDYW 809

BLAST of CSPI01G33570 vs. TAIR 10
Match: AT2G29760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 543.9 bits (1400), Expect = 2.1e-154
Identity = 294/770 (38.18%), Postives = 443/770 (57.53%), Query Frame = 0

Query: 20  LNLQQTHQLHAHFIKT-QFHNPH---PFFSQSHFT---------------PEAN---YNL 79
           ++L+Q  Q H H I+T  F +P+     F+ +  +               P+ N   +N 
Sbjct: 41  VSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFASLEYARKVFDEIPKPNSFAWNT 100

Query: 80  LISSYTNNHLPQASFNCYLHMRSNDAAALDNFILPSLLKACAQASSGYLGRELHGFAQKN 139
           LI +Y +   P  S   +L M S      + +  P L+KA A+ SS  LG+ LHG A K+
Sbjct: 101 LIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSLGQSLHGMAVKS 160

Query: 140 GFASDVFVCNALMNMYEKCGCLVSARLVFDQMPERDVVSWTTMLGCYVRSKAFGEALRLV 199
              SDVFV N+L++ Y  CG L SA  VF  + E+DVVSW +M+  +V+  +  +AL L 
Sbjct: 161 AVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSPDKALELF 220

Query: 200 REMQFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDGKMEVSMTTALIDMYC 259
           ++M+   VK S V ++ +++    + +++ GR V  YI  N     + +++  A++DMY 
Sbjct: 221 KKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEEN--RVNVNLTLANAMLDMYT 280

Query: 260 KGGCLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNIMLEEKLFPNEITLLS 319
           K G +  A+RLFD + ++  V+WT M+ G                               
Sbjct: 281 KCGSIEDAKRLFDAMEEKDNVTWTTMLDG------------------------------- 340

Query: 320 LITECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKD 379
                                                  Y        AR + N + +KD
Sbjct: 341 ---------------------------------------YAISEDYEAAREVLNSMPQKD 400

Query: 380 VKIWSVLISAYAHVSCMDQVFNLFVEM-LNNDMKPNNVTMVSLLSLCAEAGALDLGKWTH 439
           +  W+ LISAY      ++   +F E+ L  +MK N +T+VS LS CA+ GAL+LG+W H
Sbjct: 401 IVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIH 460

Query: 440 AYINRHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGK 499
           +YI +HG+ ++  + +ALI+MY+KCGD+  +R +FN   +RD+ +W+ M+ G +MHGCG 
Sbjct: 461 SYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGN 520

Query: 500 EALELFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCL 559
           EA+++F +M+   V+PN +TF ++F ACSH+GLV E +  F++M  ++GIVP+ +HY C+
Sbjct: 521 EAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACI 580

Query: 560 VDLLGRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNC 619
           VD+LGR+G+L++A   IE MP+ P+T +WGALL ACK+H NL L E+A  ++LEL+P+N 
Sbjct: 581 VDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRND 640

Query: 620 GYSVLKSNIYASAKRWNDVTSVREAMSHSGIKKEPGLSWIEVNGSVHHFKSGDKACTQTT 679
           G  VL SNIYA   +W +V+ +R+ M  +G+KKEPG S IE++G +H F SGD A   + 
Sbjct: 641 GAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSE 700

Query: 680 KVYEMVTEMCIKLRESGYTPNTAAVLLNIDEEE-KESALSYHSEKLATAFGLISTAPGTP 739
           KVY  + E+  KL+ +GY P  + VL  I+EEE KE +L+ HSEKLA  +GLIST     
Sbjct: 701 KVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYGLISTEAPKV 738

Query: 740 IRIVKNLRICDDCHASTKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 766
           IR++KNLR+C DCH+  KL+S++Y R IIVRDR RFHHF  G CSC  +W
Sbjct: 761 IRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCSCNDFW 738

BLAST of CSPI01G33570 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 540.8 bits (1392), Expect = 1.8e-153
Identity = 292/774 (37.73%), Postives = 428/774 (55.30%), Query Frame = 0

Query: 17  HSHLNLQQTHQLHAHFIKTQFHNPHPFFSQ--------SHF------------TPEAN-- 76
           H+   LQ    +HA  IK   HN +   S+         HF              E N  
Sbjct: 41  HNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFEGLPYAISVFKTIQEPNLL 100

Query: 77  -YNLLISSYTNNHLPQASFNCYLHMRSNDAAALDNFILPSLLKACAQASSGYLGRELHGF 136
            +N +   +  +  P ++   Y+ M S      +++  P +LK+CA++ +   G+++HG 
Sbjct: 101 IWNTMFRGHALSSDPVSALKLYVCMISLGLLP-NSYTFPFVLKSCAKSKAFKEGQQIHGH 160

Query: 137 AQKNGFASDVFVCNALMNMYEKCGCLVSARLVFDQMPERDVVSWTTMLGCYVRSKAFGEA 196
             K G   D++V  +L++MY + G L  A  VFD+ P RDVVS+                
Sbjct: 161 VLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSY---------------- 220

Query: 197 LRLVREMQFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDGKMEVSMTTALI 256
                                                                   TALI
Sbjct: 221 --------------------------------------------------------TALI 280

Query: 257 DMYCKGGCLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNIMLEEKLFPNEI 316
             Y   G + +AQ+LFD +  + VVSW  MI+G   +    E  + F  M++  + P+E 
Sbjct: 281 KGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDES 340

Query: 317 TLLSLITECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGV 376
           T++++++ C   G+++LG+  H ++  +GFG +L +V ALID+Y KCG++  A  LF  +
Sbjct: 341 TMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERL 400

Query: 377 KKKDVKIWSVLISAYAHVSCMDQVFNLFVEMLNNDMKPNNVTMVSLLSLCAEAGALDLGK 436
             KDV  W+ LI  Y H++   +   LF EML +   PN+VTM+S+L  CA  GA+D+G+
Sbjct: 401 PYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGR 460

Query: 437 WTHAYINRH--GLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSM 496
           W H YI++   G+     L T+LI+MYAKCGD+  A  +FN  + + +  WN M+ GF+M
Sbjct: 461 WIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAM 520

Query: 497 HGCGKEALELFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKME 556
           HG    + +LFS M   G++P+DITFV +  ACSHSG++  G+  F  M  D+ + PK+E
Sbjct: 521 HGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLE 580

Query: 557 HYGCLVDLLGRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILEL 616
           HYGC++DLLG +G   EA  +I  M M P+ +IW +LL ACK+H N+ LGE  A  ++++
Sbjct: 581 HYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKI 640

Query: 617 DPQNCGYSVLKSNIYASAKRWNDVTSVREAMSHSGIKKEPGLSWIEVNGSVHHFKSGDKA 676
           +P+N G  VL SNIYASA RWN+V   R  ++  G+KK PG S IE++  VH F  GDK 
Sbjct: 641 EPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKF 700

Query: 677 CTQTTKVYEMVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTA 736
             +  ++Y M+ EM + L ++G+ P+T+ VL  ++EE KE AL +HSEKLA AFGLIST 
Sbjct: 701 HPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTK 741

Query: 737 PGTPIRIVKNLRICDDCHASTKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 766
           PGT + IVKNLR+C +CH +TKL+SKIY R II RDR RFHHF +G CSC  YW
Sbjct: 761 PGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of CSPI01G33570 vs. TAIR 10
Match: AT3G22690.2 (INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification; LOCATED IN: chloroplast; EXPRESSED IN: 13 plant structures; EXPRESSED DURING: LP.04 four leaves visible, 4 anthesis, petal differentiation and expansion stage, E expanded cotyledon stage, D bilateral stage; CONTAINS InterPro DOMAIN/s: Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Tetratricopeptide repeat (TPR)-like superfamily protein (TAIR:AT2G29760.1). )

HSP 1 Score: 537.7 bits (1384), Expect = 1.5e-152
Identity = 291/749 (38.85%), Postives = 433/749 (57.81%), Query Frame = 0

Query: 55  YNLLISSYTNNHLPQASFNCYLHMRSNDAAALDNFILPSLLKACAQASSGYLGRELHGFA 114
           YN LI  Y ++ L   +   +L M  N   + D +  P  L ACA++ +   G ++HG  
Sbjct: 102 YNSLIRGYASSGLCNEAILLFLRM-MNSGISPDKYTFPFGLSACAKSRAKGNGIQIHGLI 161

Query: 115 QKNGFASDVFVCNALMNMYEKCGCLVSARLVFDQMPERDVVSWTTMLGCYVRSKAFGEAL 174
            K G+A D+FV N+L++ Y +CG L SAR VFD+M ER+VVSWT+M+  Y R     +A+
Sbjct: 162 VKMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSERNVVSWTSMICGYARRDFAKDAV 221

Query: 175 ----RLVREMQFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDGKMEVS--M 234
               R+VR+ +   V  + V ++ +I+    L D+++G  V+ +I RN G   +EV+  M
Sbjct: 222 DLFFRMVRDEE---VTPNSVTMVCVISACAKLEDLETGEKVYAFI-RNSG---IEVNDLM 281

Query: 235 TTALIDMYCKGGCLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNIMLEEKL 294
            +AL+DMY K   +  A+RLFD     ++     M +  +R     E    FN+M++  +
Sbjct: 282 VSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNLMMDSGV 341

Query: 295 FPNEITLLSLITECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKC-------- 354
            P+ I++LS I+ C  +  +  GK  H Y+LRNGF     +  ALIDMY KC        
Sbjct: 342 RPDRISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCHRQDTAFR 401

Query: 355 -----------------------GQVGYARALFNGVKKKDVKIWSVLISAYAHVSCMDQV 414
                                  G+V  A   F  + +K++  W+ +IS     S  ++ 
Sbjct: 402 IFDRMSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQGSLFEEA 461

Query: 415 FNLFVEMLNND-MKPNNVTMVSLLSLCAEAGALDLGKWTHAYINRHGLEVDVILETALIN 474
             +F  M + + +  + VTM+S+ S C   GALDL KW + YI ++G+++DV L T L++
Sbjct: 462 IEVFCSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRLGTTLVD 521

Query: 475 MYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEALELFSEMESHGVEPNDIT 534
           M+++CGD   A S+FN    RD+  W   +   +M G  + A+ELF +M   G++P+ + 
Sbjct: 522 MFSRCGDPESAMSIFNSLTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQGLKPDGVA 581

Query: 535 FVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDLLGRAGHLDEAHNIIENM 594
           FV    ACSH GLV +GK+ F  M+   G+ P+  HYGC+VDLLGRAG L+EA  +IE+M
Sbjct: 582 FVGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEEAVQLIEDM 641

Query: 595 PMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVT 654
           PM PN +IW +LLAAC++  N+ +   AA KI  L P+  G  VL SN+YASA RWND+ 
Sbjct: 642 PMEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYASAGRWNDMA 701

Query: 655 SVREAMSHSGIKKEPGLSWIEVNGSVHHFKSGDKACTQTTKVYEMVTEMCIKLRESGYTP 714
            VR +M   G++K PG S I++ G  H F SGD++  +   +  M+ E+  +    G+ P
Sbjct: 702 KVRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAMLDEVSQRASHLGHVP 761

Query: 715 NTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIVKNLRICDDCHASTKLLS 766
           + + VL+++DE+EK   LS HSEKLA A+GLIS+  GT IRIVKNLR+C DCH+  K  S
Sbjct: 762 DLSNVLMDVDEKEKIFMLSRHSEKLAMAYGLISSNKGTTIRIVKNLRVCSDCHSFAKFAS 821

BLAST of CSPI01G33570 vs. TAIR 10
Match: AT3G22690.1 (CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Tetratricopeptide repeat (TPR)-like superfamily protein (TAIR:AT2G29760.1); Has 49784 Blast hits to 14716 proteins in 280 species: Archae - 2; Bacteria - 10; Metazoa - 107; Fungi - 167; Plants - 48594; Viruses - 0; Other Eukaryotes - 904 (source: NCBI BLink). )

HSP 1 Score: 533.9 bits (1374), Expect = 2.2e-151
Identity = 290/745 (38.93%), Postives = 431/745 (57.85%), Query Frame = 0

Query: 55  YNLLISSYTNNHLPQASFNCYLHMRSNDAAALDNFILPSLLKACAQASSGYLGRELHGFA 114
           YN LI  Y ++ L   +   +L M  N   + D +  P  L ACA++ +   G ++HG  
Sbjct: 102 YNSLIRGYASSGLCNEAILLFLRM-MNSGISPDKYTFPFGLSACAKSRAKGNGIQIHGLI 161

Query: 115 QKNGFASDVFVCNALMNMYEKCGCLVSARLVFDQMPERDVVSWTTMLGCYVRSKAFGEAL 174
            K G+A D+FV N+L++ Y +CG L SAR VFD+M ER+VVSWT+M+  Y R     +A+
Sbjct: 162 VKMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSERNVVSWTSMICGYARRDFAKDAV 221

Query: 175 ----RLVREMQFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDGKMEVS--M 234
               R+VR+ +   V  + V ++ +I+    L D+++G  V+ +I RN G   +EV+  M
Sbjct: 222 DLFFRMVRDEE---VTPNSVTMVCVISACAKLEDLETGEKVYAFI-RNSG---IEVNDLM 281

Query: 235 TTALIDMYCKGGCLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNIMLEEKL 294
            +AL+DMY K   +  A+RLFD     ++     M +  +R     E    FN+M++  +
Sbjct: 282 VSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNLMMDSGV 341

Query: 295 FPNEITLLSLITECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKC-------- 354
            P+ I++LS I+ C  +  +  GK  H Y+LRNGF     +  ALIDMY KC        
Sbjct: 342 RPDRISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCHRQDTAFR 401

Query: 355 -----------------------GQVGYARALFNGVKKKDVKIWSVLISAYAHVSCMDQV 414
                                  G+V  A   F  + +K++  W+ +IS     S  ++ 
Sbjct: 402 IFDRMSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQGSLFEEA 461

Query: 415 FNLFVEMLNND-MKPNNVTMVSLLSLCAEAGALDLGKWTHAYINRHGLEVDVILETALIN 474
             +F  M + + +  + VTM+S+ S C   GALDL KW + YI ++G+++DV L T L++
Sbjct: 462 IEVFCSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRLGTTLVD 521

Query: 475 MYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEALELFSEMESHGVEPNDIT 534
           M+++CGD   A S+FN    RD+  W   +   +M G  + A+ELF +M   G++P+ + 
Sbjct: 522 MFSRCGDPESAMSIFNSLTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQGLKPDGVA 581

Query: 535 FVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDLLGRAGHLDEAHNIIENM 594
           FV    ACSH GLV +GK+ F  M+   G+ P+  HYGC+VDLLGRAG L+EA  +IE+M
Sbjct: 582 FVGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEEAVQLIEDM 641

Query: 595 PMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVT 654
           PM PN +IW +LLAAC++  N+ +   AA KI  L P+  G  VL SN+YASA RWND+ 
Sbjct: 642 PMEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYASAGRWNDMA 701

Query: 655 SVREAMSHSGIKKEPGLSWIEVNGSVHHFKSGDKACTQTTKVYEMVTEMCIKLRESGYTP 714
            VR +M   G++K PG S I++ G  H F SGD++  +   +  M+ E+  +    G+ P
Sbjct: 702 KVRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAMLDEVSQRASHLGHVP 761

Query: 715 NTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIVKNLRICDDCHASTKLLS 762
           + + VL+++DE+EK   LS HSEKLA A+GLIS+  GT IRIVKNLR+C DCH+  K  S
Sbjct: 762 DLSNVLMDVDEKEKIFMLSRHSEKLAMAYGLISSNKGTTIRIVKNLRVCSDCHSFAKFAS 821

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q3E6Q11.3e-15338.62Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
O823802.9e-15338.18Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
Q9LN012.5e-15237.73Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Q9LUJ22.1e-15138.85Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana OX... [more]
Q9SN395.2e-15038.62Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
Match NameE-valueIdentityDescription
A0A0A0LYC20.0e+0099.08DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G6902... [more]
A0A5A7V2V90.0e+0093.34Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3CJ580.0e+0093.21pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like OS=Cuc... [more]
A0A6J1HA740.0e+0082.48pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like OS=Cuc... [more]
A0A6J1JKG90.0e+0081.44pentatricopeptide repeat-containing protein At4g21065-like OS=Cucurbita maxima O... [more]
Match NameE-valueIdentityDescription
XP_011660280.10.0e+0099.08pentatricopeptide repeat-containing protein At3g26782, mitochondrial [Cucumis sa... [more]
KAA0062552.10.0e+0093.34pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK28774... [more]
XP_008462708.10.0e+0093.21PREDICTED: pentatricopeptide repeat-containing protein At3g26782, mitochondrial-... [more]
XP_038879151.10.0e+0086.83pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like [Benin... [more]
XP_023533718.10.0e+0082.75pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita pepo subsp... [more]
Match NameE-valueIdentityDescription
AT1G11290.19.3e-15538.62Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G29760.12.1e-15438.18Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G08070.11.8e-15337.73Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G22690.21.5e-15238.85INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic pro... [more]
AT3G22690.12.2e-15138.93CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012... [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 50..218
e-value: 8.3E-26
score: 93.1
coord: 432..674
e-value: 2.3E-42
score: 147.5
coord: 306..428
e-value: 2.5E-20
score: 75.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 219..305
e-value: 2.2E-15
score: 58.5
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 631..754
e-value: 3.5E-37
score: 127.1
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 495..528
e-value: 0.0029
score: 15.7
coord: 360..392
e-value: 1.8E-4
score: 19.5
coord: 125..155
e-value: 9.7E-4
score: 17.2
coord: 155..187
e-value: 1.0E-4
score: 20.3
coord: 461..493
e-value: 1.4E-8
score: 32.4
coord: 258..292
e-value: 0.0021
score: 16.1
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 258..286
e-value: 0.3
score: 11.4
coord: 230..255
e-value: 0.013
score: 15.7
coord: 331..357
e-value: 0.098
score: 12.9
coord: 432..457
e-value: 0.021
score: 15.0
coord: 155..182
e-value: 2.9E-4
score: 20.9
coord: 360..387
e-value: 0.0055
score: 16.9
coord: 125..153
e-value: 0.0046
score: 17.1
coord: 532..557
e-value: 0.27
score: 11.6
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 458..505
e-value: 3.5E-12
score: 46.3
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 256..290
score: 9.361008
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 357..391
score: 9.952918
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 122..152
score: 8.53891
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 458..492
score: 12.441133
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 153..187
score: 10.676364
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 166..306
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 34..182
coord: 243..369
NoneNo IPR availablePANTHERPTHR47924:SF1SUBFAMILY NOT NAMEDcoord: 166..306
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 339..759
NoneNo IPR availablePANTHERPTHR47924:SF1SUBFAMILY NOT NAMEDcoord: 243..369
coord: 339..759
NoneNo IPR availablePANTHERPTHR47924:SF1SUBFAMILY NOT NAMEDcoord: 34..182

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G33570.1CSPI01G33570.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding