CSPI04G22400 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI04G22400
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr4: 20710811 .. 20712427 (+)
RNA-Seq ExpressionCSPI04G22400
SyntenyCSPI04G22400
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCGCCAGAAAGCATGGGGATGAAGAAAACAAGCTTCAAGCTTCCCTTGGCCTTGATCGTTTCTTTTGGATGTAACAATGGCCGCTCTCTCCCTTTCCCCCTCTCTCAACCATTCACAATCCTTTCCCCCCATTTGTACCCCAAATCTTTTTGTTCTCAGTCGGTGAACGGCAAATCGAGGAACAGTACTGGCCACCGGGAAACCAACGACACCCAAATGAAGACCCAAAAGCCAATTTCTCCTTTTCGATTGTCTTCACTTCTTCGTCTCCAAAAGGACCCAACGCTCGCTCTTCAACTCTTCCTCAACCCTAATCCCAGCTCTTCAGAACCCCCCAAACCCTTTCGTTACTCTTTACTTTCCTACGATCTCATCATCTCTAAGCTCGGACGTGCCAAAATGTTCGATGAAATGGAAGAAATCCTCCAACAGCTCAAGCAAGAGACTCGTTTTGCTCCTCACGAGGTCATTTTCTGTAATGTCATTGCCTTCTATGGCCGAGCCCACCTCCCTGATCGTGCCTTCCAAGTGTTCGAAAGAATTCCATCGTTCCGTTGCAAGCGGACGGTGAAATCTGTAAATTCTTTGTTGGCTGCATTGTTGAAGAATCGGCAACTTGAGAAAATGACGCAAGTTTTTGTGGACATTAGTAACTATGGCTCCCCTGATGCTTGTACTTTTAATATTTTGATTCATGCTGCTTGTTTATGTGGTGATTTGGATGCTGTATGGGGAGTGTTTGATGAAATGCAAAAGAGAGGTGTAAAACCGAACGTGGTTACTTTTGGGACTCTGATTTATGGGCTTTCTCTGAATTCTAAGTTGAAAGAAGCATTGAGATTGAAAGAGGATATGGTGAAAGTGTACATGATTAAACCTAATGCGTCAATATATACTACTTTGATCAAAGGATTTTGTGGGGTTGGAGAATTGAATTTCGCTTTCAAGCTAAAGGAGGAGATGGTTACTAGCAACGTAAAATTGGTCTCGGCAGTTTACTCTACTTTGATCAGTGCGCTCTTCAAGCATGGTAGGAAAGAAGAAGTTTCTGACATTTTAAGAGAAATGGGAGAGAATGGGTGCAAACCTGATACCGTTACCTACAATGCCATTATCAATGGACATTGTAAAGAAAACGATCTCGAATCTGCACATAGAGTTATGGATGAAATGGTGGAGAAAGGGTGTAAGCCAGATGTGTTCAGTTTCAACACAATCATTGGATGGTTATGTAAGGAGGGGAAATTAGATAAAGCAATGGACTTGCTTGAAGATATGCCGAGACGAGGTTGCCCTCCTGATGTGTTATCATACAGGATAATTTTTGATGGGCTGTGTGAAATGATGCAGCTAAAGGAAGCAACTTCCATACTGGACGAGATGATCTTCAAGGGTTATGTGCCTCGTAATGAAAGCATAAACAAACTCGTAGACAGGTTGTGCCAGGAATGCAATATGGAGCTGTTGTGGATGGTCTTAAACAGTCTGGGAAGAGGAAATCGCATGAATATGGACATGTGGGCTAGAGTTGTTGCTTTTGTTTATAAGGAGAACCTATCGGAATCCTCCAACTTAATTGACTCATTGATTAGTTGATTTCATAGCTCTATCAGGGAC

mRNA sequence

CCGCCAGAAAGCATGGGGATGAAGAAAACAAGCTTCAAGCTTCCCTTGGCCTTGATCGTTTCTTTTGGATGTAACAATGGCCGCTCTCTCCCTTTCCCCCTCTCTCAACCATTCACAATCCTTTCCCCCCATTTGTACCCCAAATCTTTTTGTTCTCAGTCGGTGAACGGCAAATCGAGGAACAGTACTGGCCACCGGGAAACCAACGACACCCAAATGAAGACCCAAAAGCCAATTTCTCCTTTTCGATTGTCTTCACTTCTTCGTCTCCAAAAGGACCCAACGCTCGCTCTTCAACTCTTCCTCAACCCTAATCCCAGCTCTTCAGAACCCCCCAAACCCTTTCGTTACTCTTTACTTTCCTACGATCTCATCATCTCTAAGCTCGGACGTGCCAAAATGTTCGATGAAATGGAAGAAATCCTCCAACAGCTCAAGCAAGAGACTCGTTTTGCTCCTCACGAGGTCATTTTCTGTAATGTCATTGCCTTCTATGGCCGAGCCCACCTCCCTGATCGTGCCTTCCAAGTGTTCGAAAGAATTCCATCGTTCCGTTGCAAGCGGACGGTGAAATCTGTAAATTCTTTGTTGGCTGCATTGTTGAAGAATCGGCAACTTGAGAAAATGACGCAAGTTTTTGTGGACATTAGTAACTATGGCTCCCCTGATGCTTGTACTTTTAATATTTTGATTCATGCTGCTTGTTTATGTGGTGATTTGGATGCTGTATGGGGAGTGTTTGATGAAATGCAAAAGAGAGGTGTAAAACCGAACGTGGTTACTTTTGGGACTCTGATTTATGGGCTTTCTCTGAATTCTAAGTTGAAAGAAGCATTGAGATTGAAAGAGGATATGGTGAAAGTGTACATGATTAAACCTAATGCGTCAATATATACTACTTTGATCAAAGGATTTTGTGGGGTTGGAGAATTGAATTTCGCTTTCAAGCTAAAGGAGGAGATGGTTACTAGCAACGTAAAATTGGTCTCGGCAGTTTACTCTACTTTGATCAGTGCGCTCTTCAAGCATGGTAGGAAAGAAGAAGTTTCTGACATTTTAAGAGAAATGGGAGAGAATGGGTGCAAACCTGATACCGTTACCTACAATGCCATTATCAATGGACATTGTAAAGAAAACGATCTCGAATCTGCACATAGAGTTATGGATGAAATGGTGGAGAAAGGGTGTAAGCCAGATGTGTTCAGTTTCAACACAATCATTGGATGGTTATGTAAGGAGGGGAAATTAGATAAAGCAATGGACTTGCTTGAAGATATGCCGAGACGAGGTTGCCCTCCTGATGTGTTATCATACAGGATAATTTTTGATGGGCTGTGTGAAATGATGCAGCTAAAGGAAGCAACTTCCATACTGGACGAGATGATCTTCAAGGGTTATGTGCCTCGTAATGAAAGCATAAACAAACTCGTAGACAGGTTGTGCCAGGAATGCAATATGGAGCTGTTGTGGATGGTCTTAAACAGTCTGGGAAGAGGAAATCGCATGAATATGGACATGTGGGCTAGAGTTGTTGCTTTTGTTTATAAGGAGAACCTATCGGAATCCTCCAACTTAATTGACTCATTGATTAGTTGATTTCATAGCTCTATCAGGGAC

Coding sequence (CDS)

ATGGGGATGAAGAAAACAAGCTTCAAGCTTCCCTTGGCCTTGATCGTTTCTTTTGGATGTAACAATGGCCGCTCTCTCCCTTTCCCCCTCTCTCAACCATTCACAATCCTTTCCCCCCATTTGTACCCCAAATCTTTTTGTTCTCAGTCGGTGAACGGCAAATCGAGGAACAGTACTGGCCACCGGGAAACCAACGACACCCAAATGAAGACCCAAAAGCCAATTTCTCCTTTTCGATTGTCTTCACTTCTTCGTCTCCAAAAGGACCCAACGCTCGCTCTTCAACTCTTCCTCAACCCTAATCCCAGCTCTTCAGAACCCCCCAAACCCTTTCGTTACTCTTTACTTTCCTACGATCTCATCATCTCTAAGCTCGGACGTGCCAAAATGTTCGATGAAATGGAAGAAATCCTCCAACAGCTCAAGCAAGAGACTCGTTTTGCTCCTCACGAGGTCATTTTCTGTAATGTCATTGCCTTCTATGGCCGAGCCCACCTCCCTGATCGTGCCTTCCAAGTGTTCGAAAGAATTCCATCGTTCCGTTGCAAGCGGACGGTGAAATCTGTAAATTCTTTGTTGGCTGCATTGTTGAAGAATCGGCAACTTGAGAAAATGACGCAAGTTTTTGTGGACATTAGTAACTATGGCTCCCCTGATGCTTGTACTTTTAATATTTTGATTCATGCTGCTTGTTTATGTGGTGATTTGGATGCTGTATGGGGAGTGTTTGATGAAATGCAAAAGAGAGGTGTAAAACCGAACGTGGTTACTTTTGGGACTCTGATTTATGGGCTTTCTCTGAATTCTAAGTTGAAAGAAGCATTGAGATTGAAAGAGGATATGGTGAAAGTGTACATGATTAAACCTAATGCGTCAATATATACTACTTTGATCAAAGGATTTTGTGGGGTTGGAGAATTGAATTTCGCTTTCAAGCTAAAGGAGGAGATGGTTACTAGCAACGTAAAATTGGTCTCGGCAGTTTACTCTACTTTGATCAGTGCGCTCTTCAAGCATGGTAGGAAAGAAGAAGTTTCTGACATTTTAAGAGAAATGGGAGAGAATGGGTGCAAACCTGATACCGTTACCTACAATGCCATTATCAATGGACATTGTAAAGAAAACGATCTCGAATCTGCACATAGAGTTATGGATGAAATGGTGGAGAAAGGGTGTAAGCCAGATGTGTTCAGTTTCAACACAATCATTGGATGGTTATGTAAGGAGGGGAAATTAGATAAAGCAATGGACTTGCTTGAAGATATGCCGAGACGAGGTTGCCCTCCTGATGTGTTATCATACAGGATAATTTTTGATGGGCTGTGTGAAATGATGCAGCTAAAGGAAGCAACTTCCATACTGGACGAGATGATCTTCAAGGGTTATGTGCCTCGTAATGAAAGCATAAACAAACTCGTAGACAGGTTGTGCCAGGAATGCAATATGGAGCTGTTGTGGATGGTCTTAAACAGTCTGGGAAGAGGAAATCGCATGAATATGGACATGTGGGCTAGAGTTGTTGCTTTTGTTTATAAGGAGAACCTATCGGAATCCTCCAACTTAATTGACTCATTGATTAGTTGA

Protein sequence

MGMKKTSFKLPLALIVSFGCNNGRSLPFPLSQPFTILSPHLYPKSFCSQSVNGKSRNSTGHRETNDTQMKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDLIISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSFRCKRTVKSVNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAACLCGDLDAVWGVFDEMQKRGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIYTTLIKGFCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPDTVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLEDMPRRGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQECNMELLWMVLNSLGRGNRMNMDMWARVVAFVYKENLSESSNLIDSLIS*
Homology
BLAST of CSPI04G22400 vs. ExPASy Swiss-Prot
Match: Q9MAG8 (Putative pentatricopeptide repeat-containing protein At1g53330 OS=Arabidopsis thaliana OX=3702 GN=At1g53330 PE=3 SV=1)

HSP 1 Score: 513.1 bits (1320), Expect = 3.8e-144
Identity = 250/455 (54.95%), Postives = 331/455 (72.75%), Query Frame = 0

Query: 69  MKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDLIISKLGRA 128
           M   K +S FRL+SLLR + DP+ A++LF NP+P S+ P +PFRYSLL YD+II+KLG +
Sbjct: 1   MSAVKSVSSFRLASLLRRENDPSAAMKLFRNPDPESTNPKRPFRYSLLCYDIIITKLGGS 60

Query: 129 KMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSFRCKRTVKS 188
           KMFDE++++L  LK +TR  P E+IFCNVI F+GR  LP RA  +F+ +P +RC+RTVKS
Sbjct: 61  KMFDELDQVLLHLKTDTRIVPTEIIFCNVINFFGRGKLPSRALHMFDEMPQYRCQRTVKS 120

Query: 189 VNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAACLCGDLDAVWGVFDEMQK 248
           +NSLL+ALLK  +LEKM +    I  +G PDACT+NILIH     G  D    +FDEM K
Sbjct: 121 LNSLLSALLKCGELEKMKERLSSIDEFGKPDACTYNILIHGCSQSGCFDDALKLFDEMVK 180

Query: 249 RGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIYTTLIKGFCGVGELN 308
           + VKP  VTFGTLI+GL  +S++KEAL++K DM+KVY ++P   IY +LIK  C +GEL+
Sbjct: 181 KKVKPTGVTFGTLIHGLCKDSRVKEALKMKHDMLKVYGVRPTVHIYASLIKALCQIGELS 240

Query: 309 FAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPDTVTYNAII 368
           FAFKLK+E     +K+ +A+YSTLIS+L K GR  EVS IL EM E GCKPDTVTYN +I
Sbjct: 241 FAFKLKDEAYEGKIKVDAAIYSTLISSLIKAGRSNEVSMILEEMSEKGCKPDTVTYNVLI 300

Query: 369 NGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLEDMPRRGCP 428
           NG C END ESA+RV+DEMVEKG KPDV S+N I+G   +  K ++A  L EDMPRRGC 
Sbjct: 301 NGFCVENDSESANRVLDEMVEKGLKPDVISYNMILGVFFRIKKWEEATYLFEDMPRRGCS 360

Query: 429 PDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQECNMELLWMV 488
           PD LSYRI+FDGLCE +Q +EA  ILDEM+FKGY PR + +   + +LC+   +E+L  V
Sbjct: 361 PDTLSYRIVFDGLCEGLQFEEAAVILDEMLFKGYKPRRDRLEGFLQKLCESGKLEILSKV 420

Query: 489 LNSLGRGNRMNMDMWARVVAFVYKEN-LSESSNLI 523
           ++SL RG   + D+W+ ++  + KE  +S+S +L+
Sbjct: 421 ISSLHRGIAGDADVWSVMIPTMCKEPVISDSIDLL 455

BLAST of CSPI04G22400 vs. ExPASy Swiss-Prot
Match: Q3E9F0 (Pentatricopeptide repeat-containing protein At5g18475 OS=Arabidopsis thaliana OX=3702 GN=At5g18475 PE=2 SV=1)

HSP 1 Score: 205.3 bits (521), Expect = 1.7e-51
Identity = 142/469 (30.28%), Postives = 245/469 (52.24%), Query Frame = 0

Query: 66  DTQMKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDLIISKL 125
           +T  KT K IS     SL++ ++DP   L +F   N +S +  K F ++  +Y +++  L
Sbjct: 46  ETNPKT-KFISHESAVSLMKRERDPQGVLDIF---NKASQQ--KGFNHNNATYSVLLDNL 105

Query: 126 GRAKMFDEMEEILQQLKQET-RFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSF-RCK 185
            R K F  ++ IL Q+K ET RF   E +F N++  + R+ L D+  ++F  I    R K
Sbjct: 106 VRHKKFLAVDAILHQMKYETCRF--QESLFLNLMRHFSRSDLHDKVMEMFNLIQVIARVK 165

Query: 186 RTVKSVNSLLAALLKNRQLEKMTQVFVDIS-NYG-SPDACTFNILIHAACLCGDLDAVWG 245
            ++ ++++ L  L+ + ++    ++ +    N G  P+ C FNIL+   C  GD++  + 
Sbjct: 166 PSLNAISTCLNLLIDSGEVNLSRKLLLYAKHNLGLQPNTCIFNILVKHHCKNGDINFAFL 225

Query: 246 VFDEMQKRGVK-PNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIYTTLIKG 305
           V +EM++ G+  PN +T+ TL+  L  +S+ KEA+ L EDM+    I P+   +  +I G
Sbjct: 226 VVEEMKRSGISYPNSITYSTLMDCLFAHSRSKEAVELFEDMISKEGISPDPVTFNVMING 285

Query: 306 FCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPD 365
           FC  GE+  A K+ + M  +        YS L++   K G+ +E      E+ + G K D
Sbjct: 286 FCRAGEVERAKKILDFMKKNGCNPNVYNYSALMNGFCKVGKIQEAKQTFDEVKKTGLKLD 345

Query: 366 TVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLE 425
           TV Y  ++N  C+  + + A +++ EM    C+ D  ++N I+  L  EG+ ++A+ +L+
Sbjct: 346 TVGYTTLMNCFCRNGETDEAMKLLGEMKASRCRADTLTYNVILRGLSSEGRSEEALQMLD 405

Query: 426 DMPRRGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQEC 485
                G   +  SYRII + LC   +L++A   L  M  +G  P + + N+LV RLC+  
Sbjct: 406 QWGSEGVHLNKGSYRIILNALCCNGELEKAVKFLSVMSERGIWPHHATWNELVVRLCESG 465

Query: 486 NMEL-LWMVLNSLGRGNRMNMDMWARVVAFVYKE-NLSESSNLIDSLIS 528
             E+ + +++  L  G       W  VV  + KE  L     L+DSL+S
Sbjct: 466 YTEIGVRVLIGFLRIGLIPGPKSWGAVVESICKERKLVHVFELLDSLVS 506

BLAST of CSPI04G22400 vs. ExPASy Swiss-Prot
Match: Q9FFE3 (Pentatricopeptide repeat-containing protein At5g16420, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At5g16420 PE=2 SV=1)

HSP 1 Score: 203.4 bits (516), Expect = 6.5e-51
Identity = 125/429 (29.14%), Postives = 233/429 (54.31%), Query Frame = 0

Query: 68  QMKTQKP--------ISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYD 127
           Q  T+KP        + P RL S++  Q++  LALQ+FL    S       F ++  +Y 
Sbjct: 32  QYCTEKPPIKPWPQRLFPKRLVSMITQQQNIDLALQIFLYAGKSH----PGFTHNYDTYH 91

Query: 128 LIISKLGRAKMFDEMEEILQQLKQETRFAP---HEVIFCNVIAFYGRAHLPDRAFQVFER 187
            I+ KL RA+ FD +E ++  L+    + P    E +F +++  YG A   + + ++F R
Sbjct: 92  SILFKLSRARAFDPVESLMADLRNS--YPPIKCGENLFIDLLRNYGLAGRYESSMRIFLR 151

Query: 188 IPSFRCKRTVKSVNSLLAALLKNRQLEKMTQVFVDI-SNYG-SPDACTFNILIHAACLCG 247
           IP F  KR+V+S+N+LL  L++N++ + +  +F +   ++G +P+  T N+L+ A C   
Sbjct: 152 IPDFGVKRSVRSLNTLLNVLIQNQRFDLVHAMFKNSKESFGITPNIFTCNLLVKALCKKN 211

Query: 248 DLDAVWGVFDEMQKRGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIY 307
           D+++ + V DE+   G+ PN+VT+ T++ G      ++ A R+ E+M+      P+A+ Y
Sbjct: 212 DIESAYKVLDEIPSMGLVPNLVTYTTILGGYVARGDMESAKRVLEEMLDRGWY-PDATTY 271

Query: 308 TTLIKGFCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGE 367
           T L+ G+C +G  + A  + ++M  + ++     Y  +I AL K  +  E  ++  EM E
Sbjct: 272 TVLMDGYCKLGRFSEAATVMDDMEKNEIEPNEVTYGVMIRALCKEKKSGEARNMFDEMLE 331

Query: 368 NGCKPDTVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDK 427
               PD+     +I+  C+++ ++ A  +  +M++  C PD    +T+I WLCKEG++ +
Sbjct: 332 RSFMPDSSLCCKVIDALCEDHKVDEACGLWRKMLKNNCMPDNALLSTLIHWLCKEGRVTE 391

Query: 428 AMDLLEDMPRRGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVD 484
           A  L ++   +G  P +L+Y  +  G+CE  +L EA  + D+M  +   P   + N L++
Sbjct: 392 ARKLFDEF-EKGSIPSLLTYNTLIAGMCEKGELTEAGRLWDDMYERKCKPNAFTYNVLIE 451

BLAST of CSPI04G22400 vs. ExPASy Swiss-Prot
Match: Q9LQQ1 (Pentatricopeptide repeat-containing protein At1g07740, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g07740 PE=2 SV=1)

HSP 1 Score: 197.2 bits (500), Expect = 4.6e-49
Identity = 140/413 (33.90%), Postives = 209/413 (50.61%), Query Frame = 0

Query: 67  TQMKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDLIISKLG 126
           T   T+KP       + L+  +DP  AL LF             FR+   SY  +I KL 
Sbjct: 39  THKFTRKPWEEVPFLTDLKEIEDPEEALSLF------HQYQEMGFRHDYPSYSSLIYKLA 98

Query: 127 RAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSFRCKRTV 186
           +++ FD +++IL +L +       E +F  +I  YG+A   D+A  VF +I SF C RT+
Sbjct: 99  KSRNFDAVDQIL-RLVRYRNVRCRESLFMGLIQHYGKAGSVDKAIDVFHKITSFDCVRTI 158

Query: 187 KSVNSLLAALLKNRQLEKMTQVFVDISNYG-SPDACTFNILIHAACLCGDLDAVWGVFDE 246
           +S+N+L+  L+ N +LEK    F    +    P++ +FNILI       D +A   VFDE
Sbjct: 159 QSLNTLINVLVDNGELEKAKSFFDGAKDMRLRPNSVSFNILIKGFLDKCDWEAACKVFDE 218

Query: 247 MQKRGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIYTTLIKGFCGVG 306
           M +  V+P+VVT+ +LI  L  N  + +A  L EDM+K   I+PNA  +  L+KG C  G
Sbjct: 219 MLEMEVQPSVVTYNSLIGFLCRNDDMGKAKSLLEDMIK-KRIRPNAVTFGLLMKGLCCKG 278

Query: 307 ELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPDTVTYN 366
           E N A KL  +M     K     Y  L+S L K GR +E   +L EM +   KPD V YN
Sbjct: 279 EYNEAKKLMFDMEYRGCKPGLVNYGILMSDLGKRGRIDEAKLLLGEMKKRRIKPDVVIYN 338

Query: 367 AIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLEDM-PR 426
            ++N  C E  +  A+RV+ EM  KGCKP+  ++  +I   C+    D  +++L  M   
Sbjct: 339 ILVNHLCTECRVPEAYRVLTEMQMKGCKPNAATYRMMIDGFCRIEDFDSGLNVLNAMLAS 398

Query: 427 RGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLC 478
           R CP    ++  +  GL +   L  A  +L+ M  K     + +   L+  LC
Sbjct: 399 RHCPTPA-TFVCMVAGLIKGGNLDHACFVLEVMGKKNLSFGSGAWQNLLSDLC 442

BLAST of CSPI04G22400 vs. ExPASy Swiss-Prot
Match: O49436 (Pentatricopeptide repeat-containing protein At4g20090 OS=Arabidopsis thaliana OX=3702 GN=EMB1025 PE=3 SV=1)

HSP 1 Score: 196.8 bits (499), Expect = 6.1e-49
Identity = 126/423 (29.79%), Postives = 194/423 (45.86%), Query Frame = 0

Query: 131 FDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFER-IPSFRCKRTVKSV 190
           FD +E++L +++ E R    E  F  V   YG+AHLPD+A  +F R +  FRCKR+VKS 
Sbjct: 93  FDSVEKLLSRIRLENRVI-IERSFIVVFRAYGKAHLPDKAVDLFHRMVDEFRCKRSVKSF 152

Query: 191 NSLL---------------------------------------AALLKNRQLEKMTQVFV 250
           NS+L                                        AL K R +++  +VF 
Sbjct: 153 NSVLNVIINEGLYHRGLEFYDYVVNSNMNMNISPNGLSFNLVIKALCKLRFVDRAIEVFR 212

Query: 251 DI------------------------------------SNYGSPDACTFNILIHAACLCG 310
            +                                    S   SP    +N+LI   C  G
Sbjct: 213 GMPERKCLPDGYTYCTLMDGLCKEERIDEAVLLLDEMQSEGCSPSPVIYNVLIDGLCKKG 272

Query: 311 DLDAVWGVFDEMQKRGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIY 370
           DL  V  + D M  +G  PN VT+ TLI+GL L  KL +A+ L E MV    I PN   Y
Sbjct: 273 DLTRVTKLVDNMFLKGCVPNEVTYNTLIHGLCLKGKLDKAVSLLERMVSSKCI-PNDVTY 332

Query: 371 TTLIKGFCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGE 430
            TLI G         A +L   M      L   +YS LIS LFK G+ EE   + R+M E
Sbjct: 333 GTLINGLVKQRRATDAVRLLSSMEERGYHLNQHIYSVLISGLFKEGKAEEAMSLWRKMAE 392

Query: 431 NGCKPDTVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDK 478
            GCKP+ V Y+ +++G C+E     A  +++ M+  GC P+ +++++++    K G  ++
Sbjct: 393 KGCKPNIVVYSVLVDGLCREGKPNEAKEILNRMIASGCLPNAYTYSSLMKGFFKTGLCEE 452

BLAST of CSPI04G22400 vs. ExPASy TrEMBL
Match: A0A0A0KZF4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G622790 PE=4 SV=1)

HSP 1 Score: 980.7 bits (2534), Expect = 2.4e-282
Identity = 493/527 (93.55%), Postives = 494/527 (93.74%), Query Frame = 0

Query: 1   MGMKKTSFKLPLALIVSFGCNNGRSLPFPLSQPFTILSPHLYPKSFCSQSVNGKSRNSTG 60
           MGMKKTSFKLPLALIVSFGCNNGRSLPFPLSQPFTILSPH YPKSFCSQSVNGKSRNSTG
Sbjct: 1   MGMKKTSFKLPLALIVSFGCNNGRSLPFPLSQPFTILSPHFYPKSFCSQSVNGKSRNSTG 60

Query: 61  HRETNDTQMKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDL 120
           HRETNDTQMKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDL
Sbjct: 61  HRETNDTQMKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDL 120

Query: 121 IISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSF 180
           IISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSF
Sbjct: 121 IISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSF 180

Query: 181 RCKRTVKSVNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAACLCGDLDAVW 240
           RCKRTVKSVNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAACLCGDLDA  
Sbjct: 181 RCKRTVKSVNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAACLCGDLDA-- 240

Query: 241 GVFDEMQKRGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIYTTLIKG 300
                                         LKEALRLKEDMVKVYMIKPNASIYTTLIKG
Sbjct: 241 ------------------------------LKEALRLKEDMVKVYMIKPNASIYTTLIKG 300

Query: 301 FCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPD 360
           FCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPD
Sbjct: 301 FCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPD 360

Query: 361 TVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLE 420
           TVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLE
Sbjct: 361 TVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLE 420

Query: 421 DMPRRGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQEC 480
           DMPRRGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQEC
Sbjct: 421 DMPRRGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQEC 480

Query: 481 NMELLWMVLNSLGRGNRMNMDMWARVVAFVYKENLSESSNLIDSLIS 528
           NMELLWM+LNSLGRGNRMNMDMWARVVAFVYKENLSESSNLIDSLIS
Sbjct: 481 NMELLWMILNSLGRGNRMNMDMWARVVAFVYKENLSESSNLIDSLIS 495

BLAST of CSPI04G22400 vs. ExPASy TrEMBL
Match: A0A1S3BT71 (putative pentatricopeptide repeat-containing protein At1g53330 OS=Cucumis melo OX=3656 GN=LOC103493464 PE=4 SV=1)

HSP 1 Score: 971.5 bits (2510), Expect = 1.4e-279
Identity = 481/527 (91.27%), Postives = 499/527 (94.69%), Query Frame = 0

Query: 1   MGMKKTSFKLPLALIVSFGCNNGRSLPFPLSQPFTILSPHLYPKSFCSQSVNGKSRNSTG 60
           MGMKKTSF+LP  LIVSFGC NG SLPF LS PFT L PH YPKSFCS S+NGKSRNS+G
Sbjct: 1   MGMKKTSFELP--LIVSFGCKNGGSLPFLLSHPFTFLFPHFYPKSFCSHSMNGKSRNSSG 60

Query: 61  HRETNDTQMKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDL 120
           HRETN TQMKTQKPISPFRLSSLLRL+KDP LALQLFLNPNPSSSEPPKPFRYSLLSYDL
Sbjct: 61  HRETNGTQMKTQKPISPFRLSSLLRLEKDPKLALQLFLNPNPSSSEPPKPFRYSLLSYDL 120

Query: 121 IISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSF 180
           IISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRA LPDRAFQVFERIP+F
Sbjct: 121 IISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRARLPDRAFQVFERIPTF 180

Query: 181 RCKRTVKSVNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAACLCGDLDAVW 240
           RCKRTVKSVNSLL ALLK+R+  KM +V V I N+GSPDACTFNILIHAACLCGDLDAVW
Sbjct: 181 RCKRTVKSVNSLLDALLKSREFGKMMEVLVGIGNHGSPDACTFNILIHAACLCGDLDAVW 240

Query: 241 GVFDEMQKRGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIYTTLIKG 300
           GVFDEM+KRGV+PNVVTFGTLIYGLSLNSKLKEALRLKEDMV+VYMIKPNASIYTTLIKG
Sbjct: 241 GVFDEMRKRGVQPNVVTFGTLIYGLSLNSKLKEALRLKEDMVRVYMIKPNASIYTTLIKG 300

Query: 301 FCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPD 360
           FCGVGELNFAFKLKEEMVTS VKL S +YSTLISALFKHGRKEEVSDILREMGENGCKPD
Sbjct: 301 FCGVGELNFAFKLKEEMVTSKVKLDSKIYSTLISALFKHGRKEEVSDILREMGENGCKPD 360

Query: 361 TVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLE 420
           T TYNA+INGHCKENDLESA+RV+DEMVEKGCKPDV SFNTIIG LCKEGKLDKAMDLLE
Sbjct: 361 TATYNAMINGHCKENDLESANRVVDEMVEKGCKPDVISFNTIIGGLCKEGKLDKAMDLLE 420

Query: 421 DMPRRGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQEC 480
           DMPRRGCPPDVLSYRI+FDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQEC
Sbjct: 421 DMPRRGCPPDVLSYRIVFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQEC 480

Query: 481 NMELLWMVLNSLGRGNRMNMDMWARVVAFVYKENLSESSNLIDSLIS 528
           NMELLWMVLNSLGRGNRMNMD WARVVAF+YKENL ESSNLIDSLIS
Sbjct: 481 NMELLWMVLNSLGRGNRMNMDTWARVVAFLYKENLLESSNLIDSLIS 525

BLAST of CSPI04G22400 vs. ExPASy TrEMBL
Match: A0A5A7VBA5 (Putative pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold134G004190 PE=4 SV=1)

HSP 1 Score: 896.3 bits (2315), Expect = 5.9e-257
Identity = 442/477 (92.66%), Postives = 459/477 (96.23%), Query Frame = 0

Query: 51  VNGKSRNSTGHRETNDTQMKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKP 110
           +NGKSRNS+GHRETN TQMKTQKPISPFRLSSLLRL+KDP LALQLFLNPNPSSSEPPKP
Sbjct: 1   MNGKSRNSSGHRETNGTQMKTQKPISPFRLSSLLRLEKDPKLALQLFLNPNPSSSEPPKP 60

Query: 111 FRYSLLSYDLIISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRA 170
           FRYSLLSYDLIISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRA LPDRA
Sbjct: 61  FRYSLLSYDLIISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRARLPDRA 120

Query: 171 FQVFERIPSFRCKRTVKSVNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAA 230
           FQVFERIP+FRCKRTVKSVNSLL ALLK+R+  KM +V V I N+GSPDACTFNILIHAA
Sbjct: 121 FQVFERIPTFRCKRTVKSVNSLLDALLKSREFGKMMEVLVGIGNHGSPDACTFNILIHAA 180

Query: 231 CLCGDLDAVWGVFDEMQKRGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPN 290
           CLCGDLDAVWGVFDEM+KRGV+PNVVTFGTLIYGLSLNSKLKEALRLKEDMV+VYMIKPN
Sbjct: 181 CLCGDLDAVWGVFDEMRKRGVQPNVVTFGTLIYGLSLNSKLKEALRLKEDMVRVYMIKPN 240

Query: 291 ASIYTTLIKGFCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILR 350
           ASIYTTLIKGFCGVGELNFAFKLKEEMVTS VKL S +YSTLISALFKHGRKEEVSDILR
Sbjct: 241 ASIYTTLIKGFCGVGELNFAFKLKEEMVTSKVKLDSKIYSTLISALFKHGRKEEVSDILR 300

Query: 351 EMGENGCKPDTVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEG 410
           EMGENGCKPDT TYNA+INGHCKENDLESA+RV+DEMVEKGCKPDV SFNTIIG LCKEG
Sbjct: 301 EMGENGCKPDTATYNAMINGHCKENDLESANRVVDEMVEKGCKPDVISFNTIIGGLCKEG 360

Query: 411 KLDKAMDLLEDMPRRGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESIN 470
           KLDKAMDLLEDMPRRGCPPDVLSYRI+FDGLCEMMQLKEATSILDEMIFKGYVPRNESIN
Sbjct: 361 KLDKAMDLLEDMPRRGCPPDVLSYRIVFDGLCEMMQLKEATSILDEMIFKGYVPRNESIN 420

Query: 471 KLVDRLCQECNMELLWMVLNSLGRGNRMNMDMWARVVAFVYKENLSESSNLIDSLIS 528
           KLVDRLCQECNMELLWMVLNSLGRGNRMNMD WARVVAF+YKENL ESSNLIDSLIS
Sbjct: 421 KLVDRLCQECNMELLWMVLNSLGRGNRMNMDTWARVVAFLYKENLLESSNLIDSLIS 477

BLAST of CSPI04G22400 vs. ExPASy TrEMBL
Match: A0A6J1FLC2 (LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At1g53330 OS=Cucurbita moschata OX=3662 GN=LOC111446799 PE=4 SV=1)

HSP 1 Score: 791.6 bits (2043), Expect = 2.1e-225
Identity = 393/477 (82.39%), Postives = 423/477 (88.68%), Query Frame = 0

Query: 51  VNGKSRNSTGHRETNDTQMKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKP 110
           +N KSR+ +G R+ N  QMKT K ISPFRL SLLRLQKDP LA QLFLNPNPSS++PPKP
Sbjct: 1   MNNKSRHISGQRKXNSIQMKTHKLISPFRLCSLLRLQKDPKLAFQLFLNPNPSSTKPPKP 60

Query: 111 FRYSLLSYDLIISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRA 170
           FRYSLLSYDLII+KLGRAKMFDEMEEIL QLKQETRFAPHE+IFCNVI+FYGRA L DRA
Sbjct: 61  FRYSLLSYDLIITKLGRAKMFDEMEEILHQLKQETRFAPHEIIFCNVISFYGRARLLDRA 120

Query: 171 FQVFERIPSFRCKRTVKSVNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAA 230
           FQVF++IPSF CKRTVKS NSLL ALLK R+ EKM +VFV I NYGSPDACTFNILIHAA
Sbjct: 121 FQVFDKIPSFGCKRTVKSANSLLDALLKCREFEKMAEVFVGIGNYGSPDACTFNILIHAA 180

Query: 231 CLCGDLDAVWGVFDEMQKRGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPN 290
           CLCGDLD  W VFDEM KRGVKPNVVTFGTLIYGLSLNSKL EALRLKEDMV+VY I PN
Sbjct: 181 CLCGDLDDAWEVFDEMPKRGVKPNVVTFGTLIYGLSLNSKLDEALRLKEDMVRVYRILPN 240

Query: 291 ASIYTTLIKGFCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILR 350
           ASIY TLIKGFCG G+LN AFKLKEEMVT+ VKL SA+YST ISALFKHGRKEEV  IL 
Sbjct: 241 ASIYATLIKGFCGSGDLNLAFKLKEEMVTNKVKLDSAIYSTFISALFKHGRKEEVPGILA 300

Query: 351 EMGENGCKPDTVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEG 410
           EMGENGCKPDTVTYNA+ING CKENDLESA+RV+DEMV+KGCKPDV SFNTI+G LCKEG
Sbjct: 301 EMGENGCKPDTVTYNAMINGQCKENDLESAYRVLDEMVKKGCKPDVISFNTIMGRLCKEG 360

Query: 411 KLDKAMDLLEDMPRRGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESIN 470
           K DKAMDLL DMPRRGC PDV+SYR +FDGLCEMMQL+EATSILDEMIFKGYVPR+ESIN
Sbjct: 361 KWDKAMDLLADMPRRGCSPDVVSYRTLFDGLCEMMQLQEATSILDEMIFKGYVPRSESIN 420

Query: 471 KLVDRLCQECNMELLWMVLNSLGRGNRMNMDMWARVVAFVYKENLSESSNLIDSLIS 528
           KLVDRLCQE NMELLWMV+NSL RGN MN+D+WARVVAFV KE  SE+S L D LIS
Sbjct: 421 KLVDRLCQESNMELLWMVVNSLARGNVMNVDIWARVVAFVCKEKASEASKLCDLLIS 477

BLAST of CSPI04G22400 vs. ExPASy TrEMBL
Match: A0A6J1J467 (putative pentatricopeptide repeat-containing protein At1g53330 OS=Cucurbita maxima OX=3661 GN=LOC111481084 PE=4 SV=1)

HSP 1 Score: 773.1 bits (1995), Expect = 7.6e-220
Identity = 379/459 (82.57%), Postives = 412/459 (89.76%), Query Frame = 0

Query: 69  MKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDLIISKLGRA 128
           MKT K ISPFRL SLLRLQKDP LA QLFLNPNP+S++PPKPFRYSLLSYDLII+KLGRA
Sbjct: 1   MKTHKLISPFRLCSLLRLQKDPKLAFQLFLNPNPNSTKPPKPFRYSLLSYDLIITKLGRA 60

Query: 129 KMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSFRCKRTVKS 188
           KMFDEMEEIL QLKQETRF+PHE+IFCNVI+FYGRA L DRAFQVF++IPS+ CKRTVKS
Sbjct: 61  KMFDEMEEILHQLKQETRFSPHEIIFCNVISFYGRARLLDRAFQVFDKIPSYGCKRTVKS 120

Query: 189 VNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAACLCGDLDAVWGVFDEMQK 248
            NSLL ALLK R+ E+M +VFV I NYGSPDACTFNILIHAACLCGDLD  W VFDEM K
Sbjct: 121 ANSLLDALLKCREFERMAEVFVGIGNYGSPDACTFNILIHAACLCGDLDDAWEVFDEMPK 180

Query: 249 RGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIYTTLIKGFCGVGELN 308
           RGVKPNVVTFGTLIYGLSLNSKL EALRLKEDMV+VY I PNASIY TLIKGFCG+G+LN
Sbjct: 181 RGVKPNVVTFGTLIYGLSLNSKLDEALRLKEDMVRVYRILPNASIYATLIKGFCGIGDLN 240

Query: 309 FAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPDTVTYNAII 368
            AFKLKEEMVT+ VKL SA+YST ISALFKHGRKEEV +IL EMGENGCKPDTVTYN +I
Sbjct: 241 LAFKLKEEMVTNKVKLDSAIYSTFISALFKHGRKEEVPEILAEMGENGCKPDTVTYNTMI 300

Query: 369 NGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLEDMPRRGCP 428
           NG CKENDLESA+RV+DEMV+KGCKPDV SFNTI+  LCKEGK DKAMDLL DMPRRGC 
Sbjct: 301 NGQCKENDLESAYRVLDEMVKKGCKPDVISFNTIMERLCKEGKWDKAMDLLADMPRRGCS 360

Query: 429 PDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQECNMELLWMV 488
           PDV+SYR +FDGLCEMMQL+EATSILDEMIFKGYVPR+ESINKL+DRLCQE NMELLWMV
Sbjct: 361 PDVVSYRTLFDGLCEMMQLQEATSILDEMIFKGYVPRSESINKLIDRLCQESNMELLWMV 420

Query: 489 LNSLGRGNRMNMDMWARVVAFVYKENLSESSNLIDSLIS 528
            NSLGRGN MN+D+WARVVAFV KE  SE+S L+D LIS
Sbjct: 421 ANSLGRGNVMNVDIWARVVAFVCKEKPSEASKLVDLLIS 459

BLAST of CSPI04G22400 vs. NCBI nr
Match: XP_004146077.1 (putative pentatricopeptide repeat-containing protein At1g53330 [Cucumis sativus] >KAE8649815.1 hypothetical protein Csa_012179 [Cucumis sativus])

HSP 1 Score: 1063.9 bits (2750), Expect = 4.4e-307
Identity = 525/527 (99.62%), Postives = 526/527 (99.81%), Query Frame = 0

Query: 1   MGMKKTSFKLPLALIVSFGCNNGRSLPFPLSQPFTILSPHLYPKSFCSQSVNGKSRNSTG 60
           MGMKKTSFKLPLALIVSFGCNNGRSLPFPLSQPFTILSPH YPKSFCSQSVNGKSRNSTG
Sbjct: 1   MGMKKTSFKLPLALIVSFGCNNGRSLPFPLSQPFTILSPHFYPKSFCSQSVNGKSRNSTG 60

Query: 61  HRETNDTQMKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDL 120
           HRETNDTQMKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDL
Sbjct: 61  HRETNDTQMKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDL 120

Query: 121 IISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSF 180
           IISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSF
Sbjct: 121 IISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSF 180

Query: 181 RCKRTVKSVNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAACLCGDLDAVW 240
           RCKRTVKSVNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAACLCGDLDAVW
Sbjct: 181 RCKRTVKSVNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAACLCGDLDAVW 240

Query: 241 GVFDEMQKRGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIYTTLIKG 300
           GVFDEMQKRGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIYTTLIKG
Sbjct: 241 GVFDEMQKRGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIYTTLIKG 300

Query: 301 FCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPD 360
           FCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPD
Sbjct: 301 FCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPD 360

Query: 361 TVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLE 420
           TVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLE
Sbjct: 361 TVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLE 420

Query: 421 DMPRRGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQEC 480
           DMPRRGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQEC
Sbjct: 421 DMPRRGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQEC 480

Query: 481 NMELLWMVLNSLGRGNRMNMDMWARVVAFVYKENLSESSNLIDSLIS 528
           NMELLWM+LNSLGRGNRMNMDMWARVVAFVYKENLSESSNLIDSLIS
Sbjct: 481 NMELLWMILNSLGRGNRMNMDMWARVVAFVYKENLSESSNLIDSLIS 527

BLAST of CSPI04G22400 vs. NCBI nr
Match: XP_008452430.1 (PREDICTED: putative pentatricopeptide repeat-containing protein At1g53330 [Cucumis melo])

HSP 1 Score: 971.5 bits (2510), Expect = 3.0e-279
Identity = 481/527 (91.27%), Postives = 499/527 (94.69%), Query Frame = 0

Query: 1   MGMKKTSFKLPLALIVSFGCNNGRSLPFPLSQPFTILSPHLYPKSFCSQSVNGKSRNSTG 60
           MGMKKTSF+LP  LIVSFGC NG SLPF LS PFT L PH YPKSFCS S+NGKSRNS+G
Sbjct: 1   MGMKKTSFELP--LIVSFGCKNGGSLPFLLSHPFTFLFPHFYPKSFCSHSMNGKSRNSSG 60

Query: 61  HRETNDTQMKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDL 120
           HRETN TQMKTQKPISPFRLSSLLRL+KDP LALQLFLNPNPSSSEPPKPFRYSLLSYDL
Sbjct: 61  HRETNGTQMKTQKPISPFRLSSLLRLEKDPKLALQLFLNPNPSSSEPPKPFRYSLLSYDL 120

Query: 121 IISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSF 180
           IISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRA LPDRAFQVFERIP+F
Sbjct: 121 IISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRARLPDRAFQVFERIPTF 180

Query: 181 RCKRTVKSVNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAACLCGDLDAVW 240
           RCKRTVKSVNSLL ALLK+R+  KM +V V I N+GSPDACTFNILIHAACLCGDLDAVW
Sbjct: 181 RCKRTVKSVNSLLDALLKSREFGKMMEVLVGIGNHGSPDACTFNILIHAACLCGDLDAVW 240

Query: 241 GVFDEMQKRGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIYTTLIKG 300
           GVFDEM+KRGV+PNVVTFGTLIYGLSLNSKLKEALRLKEDMV+VYMIKPNASIYTTLIKG
Sbjct: 241 GVFDEMRKRGVQPNVVTFGTLIYGLSLNSKLKEALRLKEDMVRVYMIKPNASIYTTLIKG 300

Query: 301 FCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPD 360
           FCGVGELNFAFKLKEEMVTS VKL S +YSTLISALFKHGRKEEVSDILREMGENGCKPD
Sbjct: 301 FCGVGELNFAFKLKEEMVTSKVKLDSKIYSTLISALFKHGRKEEVSDILREMGENGCKPD 360

Query: 361 TVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLE 420
           T TYNA+INGHCKENDLESA+RV+DEMVEKGCKPDV SFNTIIG LCKEGKLDKAMDLLE
Sbjct: 361 TATYNAMINGHCKENDLESANRVVDEMVEKGCKPDVISFNTIIGGLCKEGKLDKAMDLLE 420

Query: 421 DMPRRGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQEC 480
           DMPRRGCPPDVLSYRI+FDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQEC
Sbjct: 421 DMPRRGCPPDVLSYRIVFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQEC 480

Query: 481 NMELLWMVLNSLGRGNRMNMDMWARVVAFVYKENLSESSNLIDSLIS 528
           NMELLWMVLNSLGRGNRMNMD WARVVAF+YKENL ESSNLIDSLIS
Sbjct: 481 NMELLWMVLNSLGRGNRMNMDTWARVVAFLYKENLLESSNLIDSLIS 525

BLAST of CSPI04G22400 vs. NCBI nr
Match: KAA0064307.1 (putative pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK20279.1 putative pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 896.3 bits (2315), Expect = 1.2e-256
Identity = 442/477 (92.66%), Postives = 459/477 (96.23%), Query Frame = 0

Query: 51  VNGKSRNSTGHRETNDTQMKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKP 110
           +NGKSRNS+GHRETN TQMKTQKPISPFRLSSLLRL+KDP LALQLFLNPNPSSSEPPKP
Sbjct: 1   MNGKSRNSSGHRETNGTQMKTQKPISPFRLSSLLRLEKDPKLALQLFLNPNPSSSEPPKP 60

Query: 111 FRYSLLSYDLIISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRA 170
           FRYSLLSYDLIISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRA LPDRA
Sbjct: 61  FRYSLLSYDLIISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRARLPDRA 120

Query: 171 FQVFERIPSFRCKRTVKSVNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAA 230
           FQVFERIP+FRCKRTVKSVNSLL ALLK+R+  KM +V V I N+GSPDACTFNILIHAA
Sbjct: 121 FQVFERIPTFRCKRTVKSVNSLLDALLKSREFGKMMEVLVGIGNHGSPDACTFNILIHAA 180

Query: 231 CLCGDLDAVWGVFDEMQKRGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPN 290
           CLCGDLDAVWGVFDEM+KRGV+PNVVTFGTLIYGLSLNSKLKEALRLKEDMV+VYMIKPN
Sbjct: 181 CLCGDLDAVWGVFDEMRKRGVQPNVVTFGTLIYGLSLNSKLKEALRLKEDMVRVYMIKPN 240

Query: 291 ASIYTTLIKGFCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILR 350
           ASIYTTLIKGFCGVGELNFAFKLKEEMVTS VKL S +YSTLISALFKHGRKEEVSDILR
Sbjct: 241 ASIYTTLIKGFCGVGELNFAFKLKEEMVTSKVKLDSKIYSTLISALFKHGRKEEVSDILR 300

Query: 351 EMGENGCKPDTVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEG 410
           EMGENGCKPDT TYNA+INGHCKENDLESA+RV+DEMVEKGCKPDV SFNTIIG LCKEG
Sbjct: 301 EMGENGCKPDTATYNAMINGHCKENDLESANRVVDEMVEKGCKPDVISFNTIIGGLCKEG 360

Query: 411 KLDKAMDLLEDMPRRGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESIN 470
           KLDKAMDLLEDMPRRGCPPDVLSYRI+FDGLCEMMQLKEATSILDEMIFKGYVPRNESIN
Sbjct: 361 KLDKAMDLLEDMPRRGCPPDVLSYRIVFDGLCEMMQLKEATSILDEMIFKGYVPRNESIN 420

Query: 471 KLVDRLCQECNMELLWMVLNSLGRGNRMNMDMWARVVAFVYKENLSESSNLIDSLIS 528
           KLVDRLCQECNMELLWMVLNSLGRGNRMNMD WARVVAF+YKENL ESSNLIDSLIS
Sbjct: 421 KLVDRLCQECNMELLWMVLNSLGRGNRMNMDTWARVVAFLYKENLLESSNLIDSLIS 477

BLAST of CSPI04G22400 vs. NCBI nr
Match: XP_038898311.1 (LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At1g53330 [Benincasa hispida])

HSP 1 Score: 833.2 bits (2151), Expect = 1.3e-237
Identity = 411/477 (86.16%), Postives = 437/477 (91.61%), Query Frame = 0

Query: 51  VNGKSRNSTGHRETNDTQMKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKP 110
           +N KSR+ TG  +TN   MKT KPISPFRLSSLLRLQKDP LA QLFLNPNPSS +PPKP
Sbjct: 1   MNDKSRHCTGQXKTNCIHMKTHKPISPFRLSSLLRLQKDPKLAFQLFLNPNPSSPKPPKP 60

Query: 111 FRYSLLSYDLIISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRA 170
           FRYSLLSYDLIISKLGRAKMFDEMEEIL QLKQETRFAPHEVIFCNVI+FYGRA LPDRA
Sbjct: 61  FRYSLLSYDLIISKLGRAKMFDEMEEILNQLKQETRFAPHEVIFCNVISFYGRARLPDRA 120

Query: 171 FQVFERIPSFRCKRTVKSVNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAA 230
           FQVF++IPSFRCKRTVKS NSLL ALLK R+ EKM +VFV I NYGSPDACTFNILI+AA
Sbjct: 121 FQVFDKIPSFRCKRTVKSANSLLDALLKCREFEKMAEVFVGIVNYGSPDACTFNILINAA 180

Query: 231 CLCGDLDAVWGVFDEMQKRGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPN 290
           CL GDLDAVW VFDEMQKRGVKPNVVTFGT+IYGLSLNSKLKEALRLKEDMV+VYMIKPN
Sbjct: 181 CLWGDLDAVWYVFDEMQKRGVKPNVVTFGTMIYGLSLNSKLKEALRLKEDMVRVYMIKPN 240

Query: 291 ASIYTTLIKGFCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILR 350
           ASIY TLIKG CGVGELN AFKLKEEMVT+ VKL SA+YSTLISALFKHGRKEE SDIL 
Sbjct: 241 ASIYATLIKGLCGVGELNLAFKLKEEMVTNKVKLDSAIYSTLISALFKHGRKEEASDILG 300

Query: 351 EMGENGCKPDTVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEG 410
           EMGEN CKPDTVTYNA+INGHCK+NDLESA R++DEM++KGCKPDV SFN IIG LCKEG
Sbjct: 301 EMGENRCKPDTVTYNAMINGHCKDNDLESAQRILDEMIDKGCKPDVISFNIIIGGLCKEG 360

Query: 411 KLDKAMDLLEDMPRRGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESIN 470
           K DKAMDLLEDMPRRGCPPDV+SYR +FDGLCEMMQLKEATSILDEMIFKGYVP +ESIN
Sbjct: 361 KWDKAMDLLEDMPRRGCPPDVVSYRTLFDGLCEMMQLKEATSILDEMIFKGYVPHSESIN 420

Query: 471 KLVDRLCQECNMELLWMVLNSLGRGNRMNMDMWARVVAFVYKENLSESSNLIDSLIS 528
           KLVDRLCQECNMELLWMV+NSLGRGNRMN+D WAR+VAFV KE LSESSNLIDSLIS
Sbjct: 421 KLVDRLCQECNMELLWMVINSLGRGNRMNVDTWARIVAFVCKEKLSESSNLIDSLIS 477

BLAST of CSPI04G22400 vs. NCBI nr
Match: XP_022941521.1 (LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At1g53330 [Cucurbita moschata])

HSP 1 Score: 791.6 bits (2043), Expect = 4.2e-225
Identity = 393/477 (82.39%), Postives = 423/477 (88.68%), Query Frame = 0

Query: 51  VNGKSRNSTGHRETNDTQMKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKP 110
           +N KSR+ +G R+ N  QMKT K ISPFRL SLLRLQKDP LA QLFLNPNPSS++PPKP
Sbjct: 1   MNNKSRHISGQRKXNSIQMKTHKLISPFRLCSLLRLQKDPKLAFQLFLNPNPSSTKPPKP 60

Query: 111 FRYSLLSYDLIISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRA 170
           FRYSLLSYDLII+KLGRAKMFDEMEEIL QLKQETRFAPHE+IFCNVI+FYGRA L DRA
Sbjct: 61  FRYSLLSYDLIITKLGRAKMFDEMEEILHQLKQETRFAPHEIIFCNVISFYGRARLLDRA 120

Query: 171 FQVFERIPSFRCKRTVKSVNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAA 230
           FQVF++IPSF CKRTVKS NSLL ALLK R+ EKM +VFV I NYGSPDACTFNILIHAA
Sbjct: 121 FQVFDKIPSFGCKRTVKSANSLLDALLKCREFEKMAEVFVGIGNYGSPDACTFNILIHAA 180

Query: 231 CLCGDLDAVWGVFDEMQKRGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPN 290
           CLCGDLD  W VFDEM KRGVKPNVVTFGTLIYGLSLNSKL EALRLKEDMV+VY I PN
Sbjct: 181 CLCGDLDDAWEVFDEMPKRGVKPNVVTFGTLIYGLSLNSKLDEALRLKEDMVRVYRILPN 240

Query: 291 ASIYTTLIKGFCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILR 350
           ASIY TLIKGFCG G+LN AFKLKEEMVT+ VKL SA+YST ISALFKHGRKEEV  IL 
Sbjct: 241 ASIYATLIKGFCGSGDLNLAFKLKEEMVTNKVKLDSAIYSTFISALFKHGRKEEVPGILA 300

Query: 351 EMGENGCKPDTVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEG 410
           EMGENGCKPDTVTYNA+ING CKENDLESA+RV+DEMV+KGCKPDV SFNTI+G LCKEG
Sbjct: 301 EMGENGCKPDTVTYNAMINGQCKENDLESAYRVLDEMVKKGCKPDVISFNTIMGRLCKEG 360

Query: 411 KLDKAMDLLEDMPRRGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESIN 470
           K DKAMDLL DMPRRGC PDV+SYR +FDGLCEMMQL+EATSILDEMIFKGYVPR+ESIN
Sbjct: 361 KWDKAMDLLADMPRRGCSPDVVSYRTLFDGLCEMMQLQEATSILDEMIFKGYVPRSESIN 420

Query: 471 KLVDRLCQECNMELLWMVLNSLGRGNRMNMDMWARVVAFVYKENLSESSNLIDSLIS 528
           KLVDRLCQE NMELLWMV+NSL RGN MN+D+WARVVAFV KE  SE+S L D LIS
Sbjct: 421 KLVDRLCQESNMELLWMVVNSLARGNVMNVDIWARVVAFVCKEKASEASKLCDLLIS 477

BLAST of CSPI04G22400 vs. TAIR 10
Match: AT1G53330.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 513.1 bits (1320), Expect = 2.7e-145
Identity = 250/455 (54.95%), Postives = 331/455 (72.75%), Query Frame = 0

Query: 69  MKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDLIISKLGRA 128
           M   K +S FRL+SLLR + DP+ A++LF NP+P S+ P +PFRYSLL YD+II+KLG +
Sbjct: 1   MSAVKSVSSFRLASLLRRENDPSAAMKLFRNPDPESTNPKRPFRYSLLCYDIIITKLGGS 60

Query: 129 KMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSFRCKRTVKS 188
           KMFDE++++L  LK +TR  P E+IFCNVI F+GR  LP RA  +F+ +P +RC+RTVKS
Sbjct: 61  KMFDELDQVLLHLKTDTRIVPTEIIFCNVINFFGRGKLPSRALHMFDEMPQYRCQRTVKS 120

Query: 189 VNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAACLCGDLDAVWGVFDEMQK 248
           +NSLL+ALLK  +LEKM +    I  +G PDACT+NILIH     G  D    +FDEM K
Sbjct: 121 LNSLLSALLKCGELEKMKERLSSIDEFGKPDACTYNILIHGCSQSGCFDDALKLFDEMVK 180

Query: 249 RGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIYTTLIKGFCGVGELN 308
           + VKP  VTFGTLI+GL  +S++KEAL++K DM+KVY ++P   IY +LIK  C +GEL+
Sbjct: 181 KKVKPTGVTFGTLIHGLCKDSRVKEALKMKHDMLKVYGVRPTVHIYASLIKALCQIGELS 240

Query: 309 FAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPDTVTYNAII 368
           FAFKLK+E     +K+ +A+YSTLIS+L K GR  EVS IL EM E GCKPDTVTYN +I
Sbjct: 241 FAFKLKDEAYEGKIKVDAAIYSTLISSLIKAGRSNEVSMILEEMSEKGCKPDTVTYNVLI 300

Query: 369 NGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLEDMPRRGCP 428
           NG C END ESA+RV+DEMVEKG KPDV S+N I+G   +  K ++A  L EDMPRRGC 
Sbjct: 301 NGFCVENDSESANRVLDEMVEKGLKPDVISYNMILGVFFRIKKWEEATYLFEDMPRRGCS 360

Query: 429 PDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQECNMELLWMV 488
           PD LSYRI+FDGLCE +Q +EA  ILDEM+FKGY PR + +   + +LC+   +E+L  V
Sbjct: 361 PDTLSYRIVFDGLCEGLQFEEAAVILDEMLFKGYKPRRDRLEGFLQKLCESGKLEILSKV 420

Query: 489 LNSLGRGNRMNMDMWARVVAFVYKEN-LSESSNLI 523
           ++SL RG   + D+W+ ++  + KE  +S+S +L+
Sbjct: 421 ISSLHRGIAGDADVWSVMIPTMCKEPVISDSIDLL 455

BLAST of CSPI04G22400 vs. TAIR 10
Match: AT5G18475.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 205.3 bits (521), Expect = 1.2e-52
Identity = 142/469 (30.28%), Postives = 245/469 (52.24%), Query Frame = 0

Query: 66  DTQMKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDLIISKL 125
           +T  KT K IS     SL++ ++DP   L +F   N +S +  K F ++  +Y +++  L
Sbjct: 46  ETNPKT-KFISHESAVSLMKRERDPQGVLDIF---NKASQQ--KGFNHNNATYSVLLDNL 105

Query: 126 GRAKMFDEMEEILQQLKQET-RFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSF-RCK 185
            R K F  ++ IL Q+K ET RF   E +F N++  + R+ L D+  ++F  I    R K
Sbjct: 106 VRHKKFLAVDAILHQMKYETCRF--QESLFLNLMRHFSRSDLHDKVMEMFNLIQVIARVK 165

Query: 186 RTVKSVNSLLAALLKNRQLEKMTQVFVDIS-NYG-SPDACTFNILIHAACLCGDLDAVWG 245
            ++ ++++ L  L+ + ++    ++ +    N G  P+ C FNIL+   C  GD++  + 
Sbjct: 166 PSLNAISTCLNLLIDSGEVNLSRKLLLYAKHNLGLQPNTCIFNILVKHHCKNGDINFAFL 225

Query: 246 VFDEMQKRGVK-PNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIYTTLIKG 305
           V +EM++ G+  PN +T+ TL+  L  +S+ KEA+ L EDM+    I P+   +  +I G
Sbjct: 226 VVEEMKRSGISYPNSITYSTLMDCLFAHSRSKEAVELFEDMISKEGISPDPVTFNVMING 285

Query: 306 FCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPD 365
           FC  GE+  A K+ + M  +        YS L++   K G+ +E      E+ + G K D
Sbjct: 286 FCRAGEVERAKKILDFMKKNGCNPNVYNYSALMNGFCKVGKIQEAKQTFDEVKKTGLKLD 345

Query: 366 TVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLE 425
           TV Y  ++N  C+  + + A +++ EM    C+ D  ++N I+  L  EG+ ++A+ +L+
Sbjct: 346 TVGYTTLMNCFCRNGETDEAMKLLGEMKASRCRADTLTYNVILRGLSSEGRSEEALQMLD 405

Query: 426 DMPRRGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQEC 485
                G   +  SYRII + LC   +L++A   L  M  +G  P + + N+LV RLC+  
Sbjct: 406 QWGSEGVHLNKGSYRIILNALCCNGELEKAVKFLSVMSERGIWPHHATWNELVVRLCESG 465

Query: 486 NMEL-LWMVLNSLGRGNRMNMDMWARVVAFVYKE-NLSESSNLIDSLIS 528
             E+ + +++  L  G       W  VV  + KE  L     L+DSL+S
Sbjct: 466 YTEIGVRVLIGFLRIGLIPGPKSWGAVVESICKERKLVHVFELLDSLVS 506

BLAST of CSPI04G22400 vs. TAIR 10
Match: AT5G16420.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 203.4 bits (516), Expect = 4.6e-52
Identity = 125/429 (29.14%), Postives = 233/429 (54.31%), Query Frame = 0

Query: 68  QMKTQKP--------ISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYD 127
           Q  T+KP        + P RL S++  Q++  LALQ+FL    S       F ++  +Y 
Sbjct: 32  QYCTEKPPIKPWPQRLFPKRLVSMITQQQNIDLALQIFLYAGKSH----PGFTHNYDTYH 91

Query: 128 LIISKLGRAKMFDEMEEILQQLKQETRFAP---HEVIFCNVIAFYGRAHLPDRAFQVFER 187
            I+ KL RA+ FD +E ++  L+    + P    E +F +++  YG A   + + ++F R
Sbjct: 92  SILFKLSRARAFDPVESLMADLRNS--YPPIKCGENLFIDLLRNYGLAGRYESSMRIFLR 151

Query: 188 IPSFRCKRTVKSVNSLLAALLKNRQLEKMTQVFVDI-SNYG-SPDACTFNILIHAACLCG 247
           IP F  KR+V+S+N+LL  L++N++ + +  +F +   ++G +P+  T N+L+ A C   
Sbjct: 152 IPDFGVKRSVRSLNTLLNVLIQNQRFDLVHAMFKNSKESFGITPNIFTCNLLVKALCKKN 211

Query: 248 DLDAVWGVFDEMQKRGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIY 307
           D+++ + V DE+   G+ PN+VT+ T++ G      ++ A R+ E+M+      P+A+ Y
Sbjct: 212 DIESAYKVLDEIPSMGLVPNLVTYTTILGGYVARGDMESAKRVLEEMLDRGWY-PDATTY 271

Query: 308 TTLIKGFCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGE 367
           T L+ G+C +G  + A  + ++M  + ++     Y  +I AL K  +  E  ++  EM E
Sbjct: 272 TVLMDGYCKLGRFSEAATVMDDMEKNEIEPNEVTYGVMIRALCKEKKSGEARNMFDEMLE 331

Query: 368 NGCKPDTVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDK 427
               PD+     +I+  C+++ ++ A  +  +M++  C PD    +T+I WLCKEG++ +
Sbjct: 332 RSFMPDSSLCCKVIDALCEDHKVDEACGLWRKMLKNNCMPDNALLSTLIHWLCKEGRVTE 391

Query: 428 AMDLLEDMPRRGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVD 484
           A  L ++   +G  P +L+Y  +  G+CE  +L EA  + D+M  +   P   + N L++
Sbjct: 392 ARKLFDEF-EKGSIPSLLTYNTLIAGMCEKGELTEAGRLWDDMYERKCKPNAFTYNVLIE 451

BLAST of CSPI04G22400 vs. TAIR 10
Match: AT1G07740.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 197.2 bits (500), Expect = 3.3e-50
Identity = 140/413 (33.90%), Postives = 209/413 (50.61%), Query Frame = 0

Query: 67  TQMKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDLIISKLG 126
           T   T+KP       + L+  +DP  AL LF             FR+   SY  +I KL 
Sbjct: 39  THKFTRKPWEEVPFLTDLKEIEDPEEALSLF------HQYQEMGFRHDYPSYSSLIYKLA 98

Query: 127 RAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSFRCKRTV 186
           +++ FD +++IL +L +       E +F  +I  YG+A   D+A  VF +I SF C RT+
Sbjct: 99  KSRNFDAVDQIL-RLVRYRNVRCRESLFMGLIQHYGKAGSVDKAIDVFHKITSFDCVRTI 158

Query: 187 KSVNSLLAALLKNRQLEKMTQVFVDISNYG-SPDACTFNILIHAACLCGDLDAVWGVFDE 246
           +S+N+L+  L+ N +LEK    F    +    P++ +FNILI       D +A   VFDE
Sbjct: 159 QSLNTLINVLVDNGELEKAKSFFDGAKDMRLRPNSVSFNILIKGFLDKCDWEAACKVFDE 218

Query: 247 MQKRGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIYTTLIKGFCGVG 306
           M +  V+P+VVT+ +LI  L  N  + +A  L EDM+K   I+PNA  +  L+KG C  G
Sbjct: 219 MLEMEVQPSVVTYNSLIGFLCRNDDMGKAKSLLEDMIK-KRIRPNAVTFGLLMKGLCCKG 278

Query: 307 ELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPDTVTYN 366
           E N A KL  +M     K     Y  L+S L K GR +E   +L EM +   KPD V YN
Sbjct: 279 EYNEAKKLMFDMEYRGCKPGLVNYGILMSDLGKRGRIDEAKLLLGEMKKRRIKPDVVIYN 338

Query: 367 AIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLEDM-PR 426
            ++N  C E  +  A+RV+ EM  KGCKP+  ++  +I   C+    D  +++L  M   
Sbjct: 339 ILVNHLCTECRVPEAYRVLTEMQMKGCKPNAATYRMMIDGFCRIEDFDSGLNVLNAMLAS 398

Query: 427 RGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLC 478
           R CP    ++  +  GL +   L  A  +L+ M  K     + +   L+  LC
Sbjct: 399 RHCPTPA-TFVCMVAGLIKGGNLDHACFVLEVMGKKNLSFGSGAWQNLLSDLC 442

BLAST of CSPI04G22400 vs. TAIR 10
Match: AT4G20090.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 196.8 bits (499), Expect = 4.3e-50
Identity = 126/423 (29.79%), Postives = 194/423 (45.86%), Query Frame = 0

Query: 131 FDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFER-IPSFRCKRTVKSV 190
           FD +E++L +++ E R    E  F  V   YG+AHLPD+A  +F R +  FRCKR+VKS 
Sbjct: 93  FDSVEKLLSRIRLENRVI-IERSFIVVFRAYGKAHLPDKAVDLFHRMVDEFRCKRSVKSF 152

Query: 191 NSLL---------------------------------------AALLKNRQLEKMTQVFV 250
           NS+L                                        AL K R +++  +VF 
Sbjct: 153 NSVLNVIINEGLYHRGLEFYDYVVNSNMNMNISPNGLSFNLVIKALCKLRFVDRAIEVFR 212

Query: 251 DI------------------------------------SNYGSPDACTFNILIHAACLCG 310
            +                                    S   SP    +N+LI   C  G
Sbjct: 213 GMPERKCLPDGYTYCTLMDGLCKEERIDEAVLLLDEMQSEGCSPSPVIYNVLIDGLCKKG 272

Query: 311 DLDAVWGVFDEMQKRGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIY 370
           DL  V  + D M  +G  PN VT+ TLI+GL L  KL +A+ L E MV    I PN   Y
Sbjct: 273 DLTRVTKLVDNMFLKGCVPNEVTYNTLIHGLCLKGKLDKAVSLLERMVSSKCI-PNDVTY 332

Query: 371 TTLIKGFCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGE 430
            TLI G         A +L   M      L   +YS LIS LFK G+ EE   + R+M E
Sbjct: 333 GTLINGLVKQRRATDAVRLLSSMEERGYHLNQHIYSVLISGLFKEGKAEEAMSLWRKMAE 392

Query: 431 NGCKPDTVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDK 478
            GCKP+ V Y+ +++G C+E     A  +++ M+  GC P+ +++++++    K G  ++
Sbjct: 393 KGCKPNIVVYSVLVDGLCREGKPNEAKEILNRMIASGCLPNAYTYSSLMKGFFKTGLCEE 452

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9MAG83.8e-14454.95Putative pentatricopeptide repeat-containing protein At1g53330 OS=Arabidopsis th... [more]
Q3E9F01.7e-5130.28Pentatricopeptide repeat-containing protein At5g18475 OS=Arabidopsis thaliana OX... [more]
Q9FFE36.5e-5129.14Pentatricopeptide repeat-containing protein At5g16420, mitochondrial OS=Arabidop... [more]
Q9LQQ14.6e-4933.90Pentatricopeptide repeat-containing protein At1g07740, mitochondrial OS=Arabidop... [more]
O494366.1e-4929.79Pentatricopeptide repeat-containing protein At4g20090 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A0A0KZF42.4e-28293.55Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G622790 PE=4 SV=1[more]
A0A1S3BT711.4e-27991.27putative pentatricopeptide repeat-containing protein At1g53330 OS=Cucumis melo O... [more]
A0A5A7VBA55.9e-25792.66Putative pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa... [more]
A0A6J1FLC22.1e-22582.39LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At1g53... [more]
A0A6J1J4677.6e-22082.57putative pentatricopeptide repeat-containing protein At1g53330 OS=Cucurbita maxi... [more]
Match NameE-valueIdentityDescription
XP_004146077.14.4e-30799.62putative pentatricopeptide repeat-containing protein At1g53330 [Cucumis sativus]... [more]
XP_008452430.13.0e-27991.27PREDICTED: putative pentatricopeptide repeat-containing protein At1g53330 [Cucum... [more]
KAA0064307.11.2e-25692.66putative pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] ... [more]
XP_038898311.11.3e-23786.16LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At1g53... [more]
XP_022941521.14.2e-22582.39LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At1g53... [more]
Match NameE-valueIdentityDescription
AT1G53330.12.7e-14554.95Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G18475.11.2e-5230.28Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G16420.14.6e-5229.14Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT1G07740.13.3e-5033.90Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G20090.14.3e-5029.79Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 397..431
e-value: 1.2E-10
score: 38.9
coord: 221..255
e-value: 1.7E-9
score: 35.3
coord: 328..361
e-value: 4.5E-8
score: 30.8
coord: 433..464
e-value: 5.7E-4
score: 17.9
coord: 293..323
e-value: 2.3E-4
score: 19.1
coord: 362..396
e-value: 2.1E-12
score: 44.4
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 117..143
e-value: 0.64
score: 10.4
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 287..318
e-value: 3.4E-8
score: 33.1
coord: 355..388
e-value: 4.0E-14
score: 52.0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 218..265
e-value: 1.3E-14
score: 54.1
coord: 394..442
e-value: 5.9E-19
score: 68.0
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 430..464
score: 10.435215
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 325..359
score: 11.060009
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 360..394
score: 14.370322
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 290..324
score: 9.930995
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 395..429
score: 13.646876
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 219..253
score: 12.693243
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 238..350
e-value: 6.9E-32
score: 113.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 351..521
e-value: 3.6E-40
score: 140.2
coord: 86..237
e-value: 2.2E-20
score: 75.3
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 50..72
NoneNo IPR availablePANTHERPTHR47933:SF24OS05G0207200 PROTEINcoord: 69..525
NoneNo IPR availablePANTHERPTHR47933PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN 1, MITOCHONDRIALcoord: 69..525
NoneNo IPR availablePROSITEPS51257PROKAR_LIPOPROTEINcoord: 1..20
score: 6.0

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G22400.1CSPI04G22400.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009793 embryo development ending in seed dormancy
biological_process GO:0048364 root development
biological_process GO:0048367 shoot system development
molecular_function GO:0005515 protein binding