CSPI03G23130 (gene) Wild cucumber (PI 183967)

NameCSPI03G23130
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationChr3 : 19948178 .. 19952273 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GAACCAGCCAAAATCGGAAAGTTGCCCTGGCCGATAAACCCCGAAAACCCAGCTGAGAGCGGTGCAGCTCAATTCGCCGGAGACCGGGAAGTTGCGGACCTCTTGGAATCTGATTCTCCAATGCGTTCCAAACTAAGCTCACTAGTCCGCTCCGCCATCATATCTTCAAAATCTTCACAGAATGCCCAAGATGCCGCTCTTCAAAACTACGTTTCCACTATTGATCCTTTATCTCCCAGCACCTCTCTCTCTAACGCCATCAACTCCCCGACCTCCAAGAAGCTTCCTCAGAACCCCAATTCTGACGTTCAATTTCCTGCTCTCATTCTAGAAGAGTCTTCGGATTCTGGTACGTGTATTGCATTTTATTACTTTGGTTATGTTTACATCGTGTTTTTGTCATTTTGGTTAATGTGTCTATAGTTAATCTTCAATCCAGCTTAATTTTGCCGTAGGCTTCGTGGTTTAGTGTTTTGTTGAAGCTGAAAAGAAGTTTTATTTTAATTATATTTCATTTGCATTGAAGGGCATGTGAATGACATTTGATTTGTCTTGTTTTACTTATCCCAAAAGTTCATGTTTTAATGCAGGGGATCCTACGAAGCATCTAGCCAAGGCCATATCTTCCGTTCTGTGCGGTACGTGTCATTGAATGTTTGTTGTAGGAATTCAATCTCAATTTGTTTTAGATTTGTTATAACTCATTAGTTACAATTTCCTTCTTTAAGTTGTTAATAACCTAATAAGCATATGGTAAATATTGAAGGTCAACGGCTAAAATATCTAATGAGTTTCTTTGGCTACTAAACGTAGGGTCAGGCAATTGTCCAATCCTCGAAGGTTTGGTCCAGCTCACGTATTAAAAAAAACATGTAGTGCCTTAGAAATTTTAGAGTGTATTCCTCTTGATGTATCTTTCAGCATTAGTCTCTTTTCATTTTACTAGTTTTCTTTAAAAAAAATAGCCACCTTTTGGATCTTACGTTTCAGATTTTTATGTTGTCTATTTCATCATACTGAGAATGAAGGGATGTGAGTAATCGAATTAATTTCCTGTCCAGTTATACAAGCTGATTAAATAAATTCTTATGTAGTAATACGTTCTATAAATTATAATAATTTTGATGATCAAACTGTAAACTAAAAATTTGGTTTCTTACATGGGCAGGTTTCACCATCCTAAACAGGTTTTAAAGTTGCATTTACTATTTATAATTATTTGATTTGTAATTTTCATAGACATTATTTCATATGTTATCGTCTTGTTTTGCCTCCAATTATGAATCTATCTTAATGCAGTTCTCAGTTTTTCTGTTTTCTCCAAGTGAACTGCTTTAGTAACGTATTATTTCCTTTTGCAGAAGGCTCATCTGTTATGTCTCCTGAAGCACAAGGAAATTGTGTTGAAGAATCTTTGGAAAAATTATTAGATATACCATGGTTTTCCATCAAAACAAATCATAGCTTAACATTACATCGCAAAGAAATATCCCGGGAAAGAAAGCATAACTGGGTATTAAAAAATACCCAGAGCGATAGATTTAGACGATTAGTTAGAAGTTGCGCAAACAGGCTGGGGAGTGATGTTACTCTAGAAGTCTTTGGTAAATTGGGACGAGAAACTGGTGTGAAAGAATACAATGCACTAGTAGGCATATGTTTAGAGAAGGCTAAGGCAAGTAAGGACGTGGAGGTTGTATTGGAACAGATTGGGAAGGTTTATCAGCTATTTAAACTAATGAAAGAACAGGGTTTCTCATTAGAAGATGAAACTTATGGCCCAGTGCTTGCGTGTTTAATTGACATGGACATGATGGAAGAATTTAATTTTTTCTGTGAGGCTATAAAAGATGGAAATCCAGGCTCAATATCAAGGTTGGGTTACTATAAGATGTTGTTCTATATTAAAATCAATGATGAAGAAAAAGTTCAGGAGCTTTGTTACCGTGCTACAGTTGATGATGGAGTGGACAAGTTCAGTTTACAAGGTACGGCCAACTGTTTATGGATACACAAACCTCAGCTGCACCTTTTGTCTTCTTGCTCTGAAATAAACAATGGGGAAGTTTTGCTTATCGGGAAACTCTATTTTTGTTTAAGATAATTGTTAAATTGGCGCCAAACCTAGAAGTTTAAGCTGATGGGTTTGGGTAAAGTTAATTATAATTTATACAAATACTTTTAACCATAAGAAATAACCTGTTGATAAAAGCATGAATAACAAGGGATCTTTTTTTGGCTGAGAAATTAGTAGAAAGCTCTCAATCAGCTATAGTAAATAACAACAAGAAATAACTAAATGTCTTGGTTGCAGCCTAAAAGAAATGCATATAGTCAGGTGGCGAGTCTTTGATGTAACATGTACTATGTTTGAGACTTAAGGCTGACGAGTATTATGTATTTTAAGCTTCTCTGATTTGTAGGTCCCGTTTGGTAAGCATTTCATTTTTGTTTTTGGTTTTTGAAAGTTAAGCCTATTTCCTTTCATTTTATTACCATGACATTAATACGTAAGAGTTGCATTTATAGTTAGACTCTAAAAACAAAAATGAGGTTTTTAAAAACTATTTTATTATTATTATTTTTAATTTTCAAACTTCGATTTTGTTTTTAAAAAAAGGGTAAAAAGTAGATATATCAAAACAACAAATTTAGAGGTGGAAGAGGTGTTTATAGGCTTAATTTGCAAAAATCAAAACAAAAAATGAAATGGTTACCAAAAGGGACCTTAGTATTTATTGTTTTTACGACAAGTTATTTTTTCAAGTTGTTTTTGCAGCAAGTTAAATTTTCAAGTTTGGTTTAGTTACTTCTTAGTTCATTATTTTAAGTTATTCTTTTAAATCTCTCAAAGGCTGGTTTGTAGCCTTGAAATAGGGATCCCCAGGCTGTAAGAGGCATTAGTTCTAAGCGAGTTCTTCAAGAATTCTCTAGATTTTGTTCAAGAAATTAGTTCTACACCAGACTCTCTCCCACCTCTTTCTATACCCTGGAATATTTTAGCTTTCCTTTTGAGAATAGGGTCAGCAATGATATTGGACAATTCTTGGAAATTATATTGAGGTTTAGAGTTCAATTTTGAAGTTGTGATAATTTTAAGGCTTTAGGGACAATCAAACTTGAAGTTGAATTCTTATTTAGCGTTTGCAATGAAAAATAACAAGCTATGTGAAAGAAATTTATAATAGAAGGAACCTATAATAAATAACACCTTTGGCATCCTCCTTGGGCAGCCCATAAAAATTCTTTGAAGTGATTAGAACAGTGTTTTCTCTGTTTAACTATTTCTGGAAAAAGGTTGTAGCTATTGTTTTAAAGATAATGCAGAGTTTAGCACATTTATCAACTGGGTACATACCAAAGGGAATATGACCGAGTTTCCCACTTTCCCACCCTTCGTTACCTGTACGGATAAAATGATAATCTCTCTAGATGCTTTTACATATTATATAGTAGCACACACTACTTCTATCCACTCCTTCATCAGGTCAGTTAGTCTTACCAGTCATGATGGATATGATCTTCTACAGGAGTTCAGTAGGTTGTCATGGTTGTTTCATATGATTTCAGTACTTTCTTACACATTAAATAAGAACACACCCACACCCACACCTACACCCACGCCCACACACAACTTCTATCTAATCCACTCATTCATCAGATCGGTTAGTCTTATTGGTCATGATGGATATGACCTTCTACATGAGTTCAGTAGGTTATGGATGTCCCATATGGCATAAGCAATTATAAAATTGAAATTTACATTCACAAACTTTTTTTATCAGCTATGTTCATGTTTTTGGTGAGGCATATATCTTACGATGAAATGGATACATTTCAGAGATAAACCTGTTTGGTTAAAAATTTGGAACCTTTGGTTACAGAAAATTATTTGTTGGCACTCTGTGGAAGTGAGCAGAAGAAGGAACTTTTACAGATGCTGGAAGTTATAGACATCACAAAACTTTCGACAACTGTAGTTGCACCTAACATCTTCAAATCCTTAGGGAGGCTATCACTTCACACTTTTGCAGAGAAGTCACTTTTGGCGTTTAAAACTTCTGGTATGTTGATGATTGCCTTAAAATTTTATTGA

mRNA sequence

ATGCGTTCCAAACTAAGCTCACTAGTCCGCTCCGCCATCATATCTTCAAAATCTTCACAGAATGCCCAAGATGCCGCTCTTCAAAACTACGTTTCCACTATTGATCCTTTATCTCCCAGCACCTCTCTCTCTAACGCCATCAACTCCCCGACCTCCAAGAAGCTTCCTCAGAACCCCAATTCTGACGTTCAATTTCCTGCTCTCATTCTAGAAGAGTCTTCGGATTCTGGGGATCCTACGAAGCATCTAGCCAAGGCCATATCTTCCGTTCTGTGCGAAGGCTCATCTGTTATGTCTCCTGAAGCACAAGGAAATTGTGTTGAAGAATCTTTGGAAAAATTATTAGATATACCATGGTTTTCCATCAAAACAAATCATAGCTTAACATTACATCGCAAAGAAATATCCCGGGAAAGAAAGCATAACTGGGTATTAAAAAATACCCAGAGCGATAGATTTAGACGATTAGTTAGAAGTTGCGCAAACAGGCTGGGGAGTGATGTTACTCTAGAAGTCTTTGGTAAATTGGGACGAGAAACTGGTGTGAAAGAATACAATGCACTAGTAGGCATATGTTTAGAGAAGGCTAAGGCAAGTAAGGACGTGGAGGTTGTATTGGAACAGATTGGGAAGGTTTATCAGCTATTTAAACTAATGAAAGAACAGGGTTTCTCATTAGAAGATGAAACTTATGGCCCAGTGCTTGCGTGTTTAATTGACATGGACATGATGGAAGAATTTAATTTTTTCTGTGAGGCTATAAAAGATGGAAATCCAGGCTCAATATCAAGGTTGGGTTACTATAAGATGTTGTTCTATATTAAAATCAATGATGAAGAAAAAGTTCAGGAGCTTTGTTACCGTGCTACAGTTGATGATGGAGTGGACAAGTTCAGTTTACAAGAAAATTATTTGTTGGCACTCTGTGGAAGTGAGCAGAAGAAGGAACTTTTACAGATGCTGGAAGTTATAGACATCACAAAACTTTCGACAACTGTAGTTGCACCTAACATCTTCAAATCCTTAGGGAGGCTATCACTTCACACTTTTGCAGAGAAGTCACTTTTGGCGTTTAAAACTTCTGGTATGTTGATGATTGCCTTAAAATTTTATTGA

Coding sequence (CDS)

ATGCGTTCCAAACTAAGCTCACTAGTCCGCTCCGCCATCATATCTTCAAAATCTTCACAGAATGCCCAAGATGCCGCTCTTCAAAACTACGTTTCCACTATTGATCCTTTATCTCCCAGCACCTCTCTCTCTAACGCCATCAACTCCCCGACCTCCAAGAAGCTTCCTCAGAACCCCAATTCTGACGTTCAATTTCCTGCTCTCATTCTAGAAGAGTCTTCGGATTCTGGGGATCCTACGAAGCATCTAGCCAAGGCCATATCTTCCGTTCTGTGCGAAGGCTCATCTGTTATGTCTCCTGAAGCACAAGGAAATTGTGTTGAAGAATCTTTGGAAAAATTATTAGATATACCATGGTTTTCCATCAAAACAAATCATAGCTTAACATTACATCGCAAAGAAATATCCCGGGAAAGAAAGCATAACTGGGTATTAAAAAATACCCAGAGCGATAGATTTAGACGATTAGTTAGAAGTTGCGCAAACAGGCTGGGGAGTGATGTTACTCTAGAAGTCTTTGGTAAATTGGGACGAGAAACTGGTGTGAAAGAATACAATGCACTAGTAGGCATATGTTTAGAGAAGGCTAAGGCAAGTAAGGACGTGGAGGTTGTATTGGAACAGATTGGGAAGGTTTATCAGCTATTTAAACTAATGAAAGAACAGGGTTTCTCATTAGAAGATGAAACTTATGGCCCAGTGCTTGCGTGTTTAATTGACATGGACATGATGGAAGAATTTAATTTTTTCTGTGAGGCTATAAAAGATGGAAATCCAGGCTCAATATCAAGGTTGGGTTACTATAAGATGTTGTTCTATATTAAAATCAATGATGAAGAAAAAGTTCAGGAGCTTTGTTACCGTGCTACAGTTGATGATGGAGTGGACAAGTTCAGTTTACAAGAAAATTATTTGTTGGCACTCTGTGGAAGTGAGCAGAAGAAGGAACTTTTACAGATGCTGGAAGTTATAGACATCACAAAACTTTCGACAACTGTAGTTGCACCTAACATCTTCAAATCCTTAGGGAGGCTATCACTTCACACTTTTGCAGAGAAGTCACTTTTGGCGTTTAAAACTTCTGGTATGTTGATGATTGCCTTAAAATTTTATTGA
BLAST of CSPI03G23130 vs. Swiss-Prot
Match: PP304_ARATH (Pentatricopeptide repeat-containing protein At4g04790, mitochondrial OS=Arabidopsis thaliana GN=At4g04790 PE=2 SV=2)

HSP 1 Score: 212.2 bits (539), Expect = 9.5e-54
Identity = 111/257 (43.19%), Postives = 166/257 (64.59%), Query Frame = 1

Query: 108 EESLEK--LLDIPWFSIKTNHSLTLHRKEISRERKHNWVLK-NTQSDRFRRLVRSCANRL 167
           + SLEK   L IP F+ K  + ++L  KE+SRERK   V K N  S RF ++ R  A +L
Sbjct: 74  KSSLEKNLFLKIPSFTTKIPYDISLRTKELSRERKERRVYKQNGLSRRFAKIFRDSAQKL 133

Query: 168 GSDVTLEVFGKLGRETGVKEYNALVGICLEKAKASKDVEVVLEQIGKVYQLFKLMKEQGF 227
           G++     F ++ +E  V EYNA++G+ LE A+ S D++  L  I K ++L K M+++GF
Sbjct: 134 GTEAMFGAFDRVAKEMSVTEYNAMIGVYLEHAEKSNDLDYALGHIEKAFELLKSMRDRGF 193

Query: 228 SLEDETYGPVLACLIDMDMMEEFNFFCEAIKDGNPGSISRLGYYKMLFYIKINDEEKVQE 287
            +E+  YGP+L  LI MDM++EF+ F + I++ +PGS+ RLGYY+ML +I + D EK++E
Sbjct: 194 LIEERVYGPLLGYLIGMDMVDEFHSFKDVIREASPGSVERLGYYEMLLWIHLGDGEKIEE 253

Query: 288 LCYRATVDDGVDKFSLQENYLLALCGSEQKKELLQMLEVIDITKLSTTVVAPNIFKSLGR 347
           LC     D+G     LQENYLLALC  +QK  L ++LE++DITK+ ++ +  NIF+ LGR
Sbjct: 254 LCSTIDGDNGESLSVLQENYLLALCKKDQKYHLERLLEIVDITKVRSSDLLANIFEYLGR 313

Query: 348 LSLHTFAEKSLLAFKTS 362
            SL + A + L   + S
Sbjct: 314 FSLDSVASRFLWELRES 330

BLAST of CSPI03G23130 vs. Swiss-Prot
Match: PP335_ARATH (Pentatricopeptide repeat-containing protein At4g21880, mitochondrial OS=Arabidopsis thaliana GN=At4g21880 PE=3 SV=2)

HSP 1 Score: 174.5 bits (441), Expect = 2.2e-42
Identity = 89/223 (39.91%), Postives = 136/223 (60.99%), Query Frame = 1

Query: 133 KEISRERKHNWVLKNTQSDRFRRLVRSCANRLGSDVTLEVFGKLGRETGVKEYNALVGIC 192
           K+ SR R      +N QS+   +L + C  +LG++   EV  K+G+E G KEYNA+  +C
Sbjct: 122 KQASRGRMLTENYQNKQSEIMEKLAKGCVRKLGTETMFEVLTKMGKEAGEKEYNAMTKLC 181

Query: 193 LEKAKASKDVEVVLEQIGKVYQLFKLMKEQGFSLEDETYGPVLACLIDMDMMEEFNFFCE 252
           +++A+ S D E  L+QIGK  +  K M++ GFS+ +  YGP    L+DM+M+ EF    +
Sbjct: 182 IQRARRSNDAEYALDQIGKAIEHLKEMRQLGFSIGEGAYGPFFKYLVDMEMVAEFQILKD 241

Query: 253 AIKDGNPGSISRLGYYKMLFYIKINDEEKVQELCYRATVDDGVDKFSLQENYLLALCGSE 312
            IK+  P S  RL YY+ML +I++NDEEK+ +LC +   D G+    LQE YL+ALC  +
Sbjct: 242 FIKEACPESCGRLVYYEMLLWIQVNDEEKIHKLCNKVD-DSGLSLSILQEYYLVALCEKD 301

Query: 313 QKKELLQMLEVIDITKLSTTVVAPNIFKSLGRLSLHTFAEKSL 356
            K+   ++LE++DIT +S+     +IF  LG+  L + A K L
Sbjct: 302 SKENFQKLLEIVDITTVSSPDALKSIFGYLGKSLLESVAMKLL 343

BLAST of CSPI03G23130 vs. TrEMBL
Match: A0A0A0L854_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G415080 PE=4 SV=1)

HSP 1 Score: 713.4 bits (1840), Expect = 1.5e-202
Identity = 370/371 (99.73%), Postives = 370/371 (99.73%), Query Frame = 1

Query: 1   MRSKLSSLVRSAIISSKSSQNAQDAALQNYVSTIDPLSPSTSLSNAINSPTSKKLPQNPN 60
           MRSKLSSLVRSAIISSKSSQNAQDAALQNYVSTIDPLSPSTSLSNAINSPTSKKLPQNPN
Sbjct: 1   MRSKLSSLVRSAIISSKSSQNAQDAALQNYVSTIDPLSPSTSLSNAINSPTSKKLPQNPN 60

Query: 61  SDVQFPALILEESSDSGDPTKHLAKAISSVLCEGSSVMSPEAQGNCVEESLEKLLDIPWF 120
           SDVQFPALILEESSDSGDPTKHLAKAISSVLCEGSSVMSPEAQGNCVEESLEKLLDIPWF
Sbjct: 61  SDVQFPALILEESSDSGDPTKHLAKAISSVLCEGSSVMSPEAQGNCVEESLEKLLDIPWF 120

Query: 121 SIKTNHSLTLHRKEISRERKHNWVLKNTQSDRFRRLVRSCANRLGSDVTLEVFGKLGRET 180
           SIKTNHSLTLHRKEISRERKHNWVLKNTQSDRFRRLVRSCANRLGSDVTLEVFGKLGRET
Sbjct: 121 SIKTNHSLTLHRKEISRERKHNWVLKNTQSDRFRRLVRSCANRLGSDVTLEVFGKLGRET 180

Query: 181 GVKEYNALVGICLEKAKASKDVEVVLEQIGKVYQLFKLMKEQGFSLEDETYGPVLACLID 240
           GVKEYNALVGICLEKAKASKDVEVVLEQIGKVYQLFKLMKEQGFSLEDETYGPVLACLID
Sbjct: 181 GVKEYNALVGICLEKAKASKDVEVVLEQIGKVYQLFKLMKEQGFSLEDETYGPVLACLID 240

Query: 241 MDMMEEFNFFCEAIKDGNPGSISRLGYYKMLFYIKINDEEKVQELCYRATVDDGVDKFSL 300
           MDMMEEFNFFCEAIKDGNPGSISRLGYYKMLFYIKINDEEKVQELCYRATVDDGVDKFSL
Sbjct: 241 MDMMEEFNFFCEAIKDGNPGSISRLGYYKMLFYIKINDEEKVQELCYRATVDDGVDKFSL 300

Query: 301 QENYLLALCGSEQKKELLQMLEVIDITKLSTTVVAPNIFKSLGRLSLHTFAEKSLLAFKT 360
           QENYLLALCGSEQKKELLQMLEVIDITKLSTTVVAPNIFKSLGRLSLHTFAEKSLLAFKT
Sbjct: 301 QENYLLALCGSEQKKELLQMLEVIDITKLSTTVVAPNIFKSLGRLSLHTFAEKSLLAFKT 360

Query: 361 SGMLMIALKFY 372
           SG LMIALKFY
Sbjct: 361 SGTLMIALKFY 371

BLAST of CSPI03G23130 vs. TrEMBL
Match: A0A061F5D3_THECC (Tetratricopeptide repeat-like superfamily protein, putative isoform 2 OS=Theobroma cacao GN=TCM_031088 PE=4 SV=1)

HSP 1 Score: 309.3 bits (791), Expect = 6.4e-81
Identity = 169/346 (48.84%), Postives = 234/346 (67.63%), Query Frame = 1

Query: 15  SSKSSQNAQDAALQNYVSTIDPLSPSTSLSNAINSPTSKKLPQNPNSDVQFPALILEESS 74
           SS SS +A+D +L+ +VS++D  S S+  S +  SP + K P    +D     L+ + +S
Sbjct: 29  SSSSSLDARDKSLKEFVSSLDTSSLSSPASFSKRSPIAIKKP----TDGSLFNLLRDSAS 88

Query: 75  DSGDPTKHLAKAISSVLCEGSSVMSPEAQGNCVEESLEKLLDIPWFSIKTNHSLTLHRKE 134
            S D    L   +SS+LC+GS V S +  G        + L IPW S+ +N+  +L +KE
Sbjct: 89  LSEDSMNELTHEVSSLLCDGS-VNSSKDSG--------RALTIPWLSM-SNNKTSLMQKE 148

Query: 135 ISRERKHNWVLKNTQSDRFRRLVRSCANRLGSDVTLEVFGKLGRETGVKEYNALVGICLE 194
           +SRERK  WV K +Q  RF RL++ C ++LG+  T+EVF KLGRETG+KEYNAL+ +CLE
Sbjct: 149 VSRERKQKWVFKTSQVIRFNRLIKMCGDKLGTKATMEVFDKLGRETGLKEYNALIELCLE 208

Query: 195 KAKASKDVEVVLEQIGKVYQLFKLMKEQGFSLEDETYGPVLACLIDMDMMEEFNFFCEAI 254
            A+ S D +V LE I + ++ FK M+E+GF +E+ET GP L   ID  M+EEF FFC  I
Sbjct: 209 NARTSDDEDVALEHISEAFRTFKKMRERGFQVEEETIGPFLMYFIDRGMVEEFFFFCGPI 268

Query: 255 KDGNPGSISRLGYYKMLFYIKINDEEKVQELCYRATVDDGVDKFSLQENYLLALCGSEQK 314
           KDGNP  + RLGYY+ML +I +N+E+K+QELC      DG+D F L+ENYLLALC S +K
Sbjct: 269 KDGNPSLLPRLGYYEMLLWIGVNNEKKIQELCNYIAATDGIDDFELKENYLLALCESGRK 328

Query: 315 KELLQMLEVIDITKLSTTVVAPNIFKSLGRLSLHTFAEKSLLAFKT 361
           ++L+Q+LE+IDI ++S+     NIFKSLGRLSL +FAEK LLAFK+
Sbjct: 329 EDLMQLLEIIDIKRISSVNKVANIFKSLGRLSLESFAEKFLLAFKS 360

BLAST of CSPI03G23130 vs. TrEMBL
Match: A0A061F6A5_THECC (Tetratricopeptide repeat-like superfamily protein, putative isoform 4 OS=Theobroma cacao GN=TCM_031088 PE=4 SV=1)

HSP 1 Score: 309.3 bits (791), Expect = 6.4e-81
Identity = 169/346 (48.84%), Postives = 234/346 (67.63%), Query Frame = 1

Query: 15  SSKSSQNAQDAALQNYVSTIDPLSPSTSLSNAINSPTSKKLPQNPNSDVQFPALILEESS 74
           SS SS +A+D +L+ +VS++D  S S+  S +  SP + K P    +D     L+ + +S
Sbjct: 29  SSSSSLDARDKSLKEFVSSLDTSSLSSPASFSKRSPIAIKKP----TDGSLFNLLRDSAS 88

Query: 75  DSGDPTKHLAKAISSVLCEGSSVMSPEAQGNCVEESLEKLLDIPWFSIKTNHSLTLHRKE 134
            S D    L   +SS+LC+GS V S +  G        + L IPW S+ +N+  +L +KE
Sbjct: 89  LSEDSMNELTHEVSSLLCDGS-VNSSKDSG--------RALTIPWLSM-SNNKTSLMQKE 148

Query: 135 ISRERKHNWVLKNTQSDRFRRLVRSCANRLGSDVTLEVFGKLGRETGVKEYNALVGICLE 194
           +SRERK  WV K +Q  RF RL++ C ++LG+  T+EVF KLGRETG+KEYNAL+ +CLE
Sbjct: 149 VSRERKQKWVFKTSQVIRFNRLIKMCGDKLGTKATMEVFDKLGRETGLKEYNALIELCLE 208

Query: 195 KAKASKDVEVVLEQIGKVYQLFKLMKEQGFSLEDETYGPVLACLIDMDMMEEFNFFCEAI 254
            A+ S D +V LE I + ++ FK M+E+GF +E+ET GP L   ID  M+EEF FFC  I
Sbjct: 209 NARTSDDEDVALEHISEAFRTFKKMRERGFQVEEETIGPFLMYFIDRGMVEEFFFFCGPI 268

Query: 255 KDGNPGSISRLGYYKMLFYIKINDEEKVQELCYRATVDDGVDKFSLQENYLLALCGSEQK 314
           KDGNP  + RLGYY+ML +I +N+E+K+QELC      DG+D F L+ENYLLALC S +K
Sbjct: 269 KDGNPSLLPRLGYYEMLLWIGVNNEKKIQELCNYIAATDGIDDFELKENYLLALCESGRK 328

Query: 315 KELLQMLEVIDITKLSTTVVAPNIFKSLGRLSLHTFAEKSLLAFKT 361
           ++L+Q+LE+IDI ++S+     NIFKSLGRLSL +FAEK LLAFK+
Sbjct: 329 EDLMQLLEIIDIKRISSVNKVANIFKSLGRLSLESFAEKFLLAFKS 360

BLAST of CSPI03G23130 vs. TrEMBL
Match: A0A0D2Q9M0_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_002G082600 PE=4 SV=1)

HSP 1 Score: 299.3 bits (765), Expect = 6.6e-78
Identity = 174/377 (46.15%), Postives = 243/377 (64.46%), Query Frame = 1

Query: 5   LSSLVRSAIISS---KSSQNAQDAALQNYVSTIDPLS-----PSTSLSNA---INSPTSK 64
           LSSL ++A+ ++    SS  + D  L+ +VS++D  S     PS S S+A   I SP   
Sbjct: 9   LSSLFKTAVKNATKESSSLPSGDKPLKQFVSSLDTSSAPSPSPSPSPSSASFRIRSPQPT 68

Query: 65  KLPQNPNSDVQFPALILEESSDSGDPTKHLAKAISSVLCEGSSVMSPEAQGNCVE----- 124
           K P N  S     + +L+ +S S +        + S+LC G     PE+  +  E     
Sbjct: 69  KKPANDGS---LHSWLLQPASTSENSIGGFTDELHSILCTGH----PESSKDIEEMMDNG 128

Query: 125 ESLEKLLDIPWFSIKTNHSLTLHRKEISRERKHNWVLKNTQSDRFRRLVRSCANRLGSDV 184
            SL ++L+IPW S  +N++++L RKE+SRERK  WV K TQS RF RL++ C ++LG+  
Sbjct: 129 SSLGRVLNIPWLSNVSNNNISLRRKELSRERKQKWVFKKTQSGRFNRLIKMCGDKLGTKA 188

Query: 185 TLEVFGKLGRETGVKEYNALVGICLEKAKASKDVEVVLEQIGKVYQLFKLMKEQGFSLED 244
           T+EVF KLGR+TG+KEYNAL+ ICLEKA+ S D +  LE + + ++  K M+E+GF +E+
Sbjct: 189 TIEVFDKLGRDTGLKEYNALIAICLEKARTSNDEDDALEHMSEAFKTLKKMRERGFQVEE 248

Query: 245 ETYGPVLACLIDMDMMEEFNFFCEAIKDGNPGSISRLGYYKMLFYIKINDEEKVQELCYR 304
            TYGP L   IDM M+EEF FFC  IK+GNP S++RLGYY+ML +I +N+EEK+QELC  
Sbjct: 249 GTYGPFLMYFIDMGMVEEFFFFCGPIKEGNPSSVTRLGYYEMLLWIGVNNEEKIQELCNC 308

Query: 305 ATVDDGVDKFSLQENYLLALCGSEQKKELLQMLEVIDITKLSTTVVAPNIFKSLGRLSLH 364
               D  D F L+ENYLLALC S  +K+++Q+LEVIDIT++S+  V  NIF+SLGRLS  
Sbjct: 309 IVAADEEDDFKLKENYLLALCES-GRKDIMQLLEVIDITRISSVNVVANIFESLGRLSFD 368

Query: 365 TFAEKSLLAFKTSGMLM 366
           +FAEK L   K +   M
Sbjct: 369 SFAEKFLWTLKNNDYKM 377

BLAST of CSPI03G23130 vs. TrEMBL
Match: A0A0D2M8B9_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_002G082600 PE=4 SV=1)

HSP 1 Score: 299.3 bits (765), Expect = 6.6e-78
Identity = 174/377 (46.15%), Postives = 243/377 (64.46%), Query Frame = 1

Query: 5   LSSLVRSAIISS---KSSQNAQDAALQNYVSTIDPLS-----PSTSLSNA---INSPTSK 64
           LSSL ++A+ ++    SS  + D  L+ +VS++D  S     PS S S+A   I SP   
Sbjct: 9   LSSLFKTAVKNATKESSSLPSGDKPLKQFVSSLDTSSAPSPSPSPSPSSASFRIRSPQPT 68

Query: 65  KLPQNPNSDVQFPALILEESSDSGDPTKHLAKAISSVLCEGSSVMSPEAQGNCVE----- 124
           K P N  S     + +L+ +S S +        + S+LC G     PE+  +  E     
Sbjct: 69  KKPANDGS---LHSWLLQPASTSENSIGGFTDELHSILCTGH----PESSKDIEEMMDNG 128

Query: 125 ESLEKLLDIPWFSIKTNHSLTLHRKEISRERKHNWVLKNTQSDRFRRLVRSCANRLGSDV 184
            SL ++L+IPW S  +N++++L RKE+SRERK  WV K TQS RF RL++ C ++LG+  
Sbjct: 129 SSLGRVLNIPWLSNVSNNNISLRRKELSRERKQKWVFKKTQSGRFNRLIKMCGDKLGTKA 188

Query: 185 TLEVFGKLGRETGVKEYNALVGICLEKAKASKDVEVVLEQIGKVYQLFKLMKEQGFSLED 244
           T+EVF KLGR+TG+KEYNAL+ ICLEKA+ S D +  LE + + ++  K M+E+GF +E+
Sbjct: 189 TIEVFDKLGRDTGLKEYNALIAICLEKARTSNDEDDALEHMSEAFKTLKKMRERGFQVEE 248

Query: 245 ETYGPVLACLIDMDMMEEFNFFCEAIKDGNPGSISRLGYYKMLFYIKINDEEKVQELCYR 304
            TYGP L   IDM M+EEF FFC  IK+GNP S++RLGYY+ML +I +N+EEK+QELC  
Sbjct: 249 GTYGPFLMYFIDMGMVEEFFFFCGPIKEGNPSSVTRLGYYEMLLWIGVNNEEKIQELCNC 308

Query: 305 ATVDDGVDKFSLQENYLLALCGSEQKKELLQMLEVIDITKLSTTVVAPNIFKSLGRLSLH 364
               D  D F L+ENYLLALC S  +K+++Q+LEVIDIT++S+  V  NIF+SLGRLS  
Sbjct: 309 IVAADEEDDFKLKENYLLALCES-GRKDIMQLLEVIDITRISSVNVVANIFESLGRLSFD 368

Query: 365 TFAEKSLLAFKTSGMLM 366
           +FAEK L   K +   M
Sbjct: 369 SFAEKFLWTLKNNDYKM 377

BLAST of CSPI03G23130 vs. TAIR10
Match: AT4G04790.1 (AT4G04790.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 212.2 bits (539), Expect = 5.3e-55
Identity = 111/257 (43.19%), Postives = 166/257 (64.59%), Query Frame = 1

Query: 108 EESLEK--LLDIPWFSIKTNHSLTLHRKEISRERKHNWVLK-NTQSDRFRRLVRSCANRL 167
           + SLEK   L IP F+ K  + ++L  KE+SRERK   V K N  S RF ++ R  A +L
Sbjct: 74  KSSLEKNLFLKIPSFTTKIPYDISLRTKELSRERKERRVYKQNGLSRRFAKIFRDSAQKL 133

Query: 168 GSDVTLEVFGKLGRETGVKEYNALVGICLEKAKASKDVEVVLEQIGKVYQLFKLMKEQGF 227
           G++     F ++ +E  V EYNA++G+ LE A+ S D++  L  I K ++L K M+++GF
Sbjct: 134 GTEAMFGAFDRVAKEMSVTEYNAMIGVYLEHAEKSNDLDYALGHIEKAFELLKSMRDRGF 193

Query: 228 SLEDETYGPVLACLIDMDMMEEFNFFCEAIKDGNPGSISRLGYYKMLFYIKINDEEKVQE 287
            +E+  YGP+L  LI MDM++EF+ F + I++ +PGS+ RLGYY+ML +I + D EK++E
Sbjct: 194 LIEERVYGPLLGYLIGMDMVDEFHSFKDVIREASPGSVERLGYYEMLLWIHLGDGEKIEE 253

Query: 288 LCYRATVDDGVDKFSLQENYLLALCGSEQKKELLQMLEVIDITKLSTTVVAPNIFKSLGR 347
           LC     D+G     LQENYLLALC  +QK  L ++LE++DITK+ ++ +  NIF+ LGR
Sbjct: 254 LCSTIDGDNGESLSVLQENYLLALCKKDQKYHLERLLEIVDITKVRSSDLLANIFEYLGR 313

Query: 348 LSLHTFAEKSLLAFKTS 362
            SL + A + L   + S
Sbjct: 314 FSLDSVASRFLWELRES 330

BLAST of CSPI03G23130 vs. TAIR10
Match: AT4G21880.1 (AT4G21880.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 174.5 bits (441), Expect = 1.2e-43
Identity = 89/223 (39.91%), Postives = 136/223 (60.99%), Query Frame = 1

Query: 133 KEISRERKHNWVLKNTQSDRFRRLVRSCANRLGSDVTLEVFGKLGRETGVKEYNALVGIC 192
           K+ SR R      +N QS+   +L + C  +LG++   EV  K+G+E G KEYNA+  +C
Sbjct: 122 KQASRGRMLTENYQNKQSEIMEKLAKGCVRKLGTETMFEVLTKMGKEAGEKEYNAMTKLC 181

Query: 193 LEKAKASKDVEVVLEQIGKVYQLFKLMKEQGFSLEDETYGPVLACLIDMDMMEEFNFFCE 252
           +++A+ S D E  L+QIGK  +  K M++ GFS+ +  YGP    L+DM+M+ EF    +
Sbjct: 182 IQRARRSNDAEYALDQIGKAIEHLKEMRQLGFSIGEGAYGPFFKYLVDMEMVAEFQILKD 241

Query: 253 AIKDGNPGSISRLGYYKMLFYIKINDEEKVQELCYRATVDDGVDKFSLQENYLLALCGSE 312
            IK+  P S  RL YY+ML +I++NDEEK+ +LC +   D G+    LQE YL+ALC  +
Sbjct: 242 FIKEACPESCGRLVYYEMLLWIQVNDEEKIHKLCNKVD-DSGLSLSILQEYYLVALCEKD 301

Query: 313 QKKELLQMLEVIDITKLSTTVVAPNIFKSLGRLSLHTFAEKSL 356
            K+   ++LE++DIT +S+     +IF  LG+  L + A K L
Sbjct: 302 SKENFQKLLEIVDITTVSSPDALKSIFGYLGKSLLESVAMKLL 343

BLAST of CSPI03G23130 vs. NCBI nr
Match: gi|700202830|gb|KGN57963.1| (hypothetical protein Csa_3G415080 [Cucumis sativus])

HSP 1 Score: 713.4 bits (1840), Expect = 2.1e-202
Identity = 370/371 (99.73%), Postives = 370/371 (99.73%), Query Frame = 1

Query: 1   MRSKLSSLVRSAIISSKSSQNAQDAALQNYVSTIDPLSPSTSLSNAINSPTSKKLPQNPN 60
           MRSKLSSLVRSAIISSKSSQNAQDAALQNYVSTIDPLSPSTSLSNAINSPTSKKLPQNPN
Sbjct: 1   MRSKLSSLVRSAIISSKSSQNAQDAALQNYVSTIDPLSPSTSLSNAINSPTSKKLPQNPN 60

Query: 61  SDVQFPALILEESSDSGDPTKHLAKAISSVLCEGSSVMSPEAQGNCVEESLEKLLDIPWF 120
           SDVQFPALILEESSDSGDPTKHLAKAISSVLCEGSSVMSPEAQGNCVEESLEKLLDIPWF
Sbjct: 61  SDVQFPALILEESSDSGDPTKHLAKAISSVLCEGSSVMSPEAQGNCVEESLEKLLDIPWF 120

Query: 121 SIKTNHSLTLHRKEISRERKHNWVLKNTQSDRFRRLVRSCANRLGSDVTLEVFGKLGRET 180
           SIKTNHSLTLHRKEISRERKHNWVLKNTQSDRFRRLVRSCANRLGSDVTLEVFGKLGRET
Sbjct: 121 SIKTNHSLTLHRKEISRERKHNWVLKNTQSDRFRRLVRSCANRLGSDVTLEVFGKLGRET 180

Query: 181 GVKEYNALVGICLEKAKASKDVEVVLEQIGKVYQLFKLMKEQGFSLEDETYGPVLACLID 240
           GVKEYNALVGICLEKAKASKDVEVVLEQIGKVYQLFKLMKEQGFSLEDETYGPVLACLID
Sbjct: 181 GVKEYNALVGICLEKAKASKDVEVVLEQIGKVYQLFKLMKEQGFSLEDETYGPVLACLID 240

Query: 241 MDMMEEFNFFCEAIKDGNPGSISRLGYYKMLFYIKINDEEKVQELCYRATVDDGVDKFSL 300
           MDMMEEFNFFCEAIKDGNPGSISRLGYYKMLFYIKINDEEKVQELCYRATVDDGVDKFSL
Sbjct: 241 MDMMEEFNFFCEAIKDGNPGSISRLGYYKMLFYIKINDEEKVQELCYRATVDDGVDKFSL 300

Query: 301 QENYLLALCGSEQKKELLQMLEVIDITKLSTTVVAPNIFKSLGRLSLHTFAEKSLLAFKT 360
           QENYLLALCGSEQKKELLQMLEVIDITKLSTTVVAPNIFKSLGRLSLHTFAEKSLLAFKT
Sbjct: 301 QENYLLALCGSEQKKELLQMLEVIDITKLSTTVVAPNIFKSLGRLSLHTFAEKSLLAFKT 360

Query: 361 SGMLMIALKFY 372
           SG LMIALKFY
Sbjct: 361 SGTLMIALKFY 371

BLAST of CSPI03G23130 vs. NCBI nr
Match: gi|778681163|ref|XP_011651464.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g04790, mitochondrial [Cucumis sativus])

HSP 1 Score: 699.9 bits (1805), Expect = 2.4e-198
Identity = 362/362 (100.00%), Postives = 362/362 (100.00%), Query Frame = 1

Query: 1   MRSKLSSLVRSAIISSKSSQNAQDAALQNYVSTIDPLSPSTSLSNAINSPTSKKLPQNPN 60
           MRSKLSSLVRSAIISSKSSQNAQDAALQNYVSTIDPLSPSTSLSNAINSPTSKKLPQNPN
Sbjct: 1   MRSKLSSLVRSAIISSKSSQNAQDAALQNYVSTIDPLSPSTSLSNAINSPTSKKLPQNPN 60

Query: 61  SDVQFPALILEESSDSGDPTKHLAKAISSVLCEGSSVMSPEAQGNCVEESLEKLLDIPWF 120
           SDVQFPALILEESSDSGDPTKHLAKAISSVLCEGSSVMSPEAQGNCVEESLEKLLDIPWF
Sbjct: 61  SDVQFPALILEESSDSGDPTKHLAKAISSVLCEGSSVMSPEAQGNCVEESLEKLLDIPWF 120

Query: 121 SIKTNHSLTLHRKEISRERKHNWVLKNTQSDRFRRLVRSCANRLGSDVTLEVFGKLGRET 180
           SIKTNHSLTLHRKEISRERKHNWVLKNTQSDRFRRLVRSCANRLGSDVTLEVFGKLGRET
Sbjct: 121 SIKTNHSLTLHRKEISRERKHNWVLKNTQSDRFRRLVRSCANRLGSDVTLEVFGKLGRET 180

Query: 181 GVKEYNALVGICLEKAKASKDVEVVLEQIGKVYQLFKLMKEQGFSLEDETYGPVLACLID 240
           GVKEYNALVGICLEKAKASKDVEVVLEQIGKVYQLFKLMKEQGFSLEDETYGPVLACLID
Sbjct: 181 GVKEYNALVGICLEKAKASKDVEVVLEQIGKVYQLFKLMKEQGFSLEDETYGPVLACLID 240

Query: 241 MDMMEEFNFFCEAIKDGNPGSISRLGYYKMLFYIKINDEEKVQELCYRATVDDGVDKFSL 300
           MDMMEEFNFFCEAIKDGNPGSISRLGYYKMLFYIKINDEEKVQELCYRATVDDGVDKFSL
Sbjct: 241 MDMMEEFNFFCEAIKDGNPGSISRLGYYKMLFYIKINDEEKVQELCYRATVDDGVDKFSL 300

Query: 301 QENYLLALCGSEQKKELLQMLEVIDITKLSTTVVAPNIFKSLGRLSLHTFAEKSLLAFKT 360
           QENYLLALCGSEQKKELLQMLEVIDITKLSTTVVAPNIFKSLGRLSLHTFAEKSLLAFKT
Sbjct: 301 QENYLLALCGSEQKKELLQMLEVIDITKLSTTVVAPNIFKSLGRLSLHTFAEKSLLAFKT 360

Query: 361 SG 363
           SG
Sbjct: 361 SG 362

BLAST of CSPI03G23130 vs. NCBI nr
Match: gi|659109137|ref|XP_008454564.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g04790, mitochondrial-like isoform X1 [Cucumis melo])

HSP 1 Score: 613.6 bits (1581), Expect = 2.3e-172
Identity = 328/370 (88.65%), Postives = 338/370 (91.35%), Query Frame = 1

Query: 1   MRSK---LSSLVRSAIISSKSSQNAQDAALQNYVSTIDPLSPSTSLSNAI-----NSPTS 60
           MRSK   LSSL RSAI ++KSS+N QDAALQN VS+IDPLS STSL NAI     NSPT 
Sbjct: 1   MRSKSKQLSSLFRSAIKAAKSSKNPQDAALQNNVSSIDPLSLSTSLYNAIYKRSVNSPTF 60

Query: 61  KKLPQNPNSDVQFPALILEESSDSGDPTKHLAKAISSVLCEGSSVMSPEAQGNCVEESLE 120
           KKLPQNPNSDVQFPALILEESSDSGD  KHL +AISSVLCEGSSV SPEAQGNCVE+SLE
Sbjct: 61  KKLPQNPNSDVQFPALILEESSDSGDSVKHLTEAISSVLCEGSSVRSPEAQGNCVEKSLE 120

Query: 121 KLLDIPWFSIKTNHSLTLHRKEISRERKHNWVLKNTQSDRFRRLVRSCANRLGSDVTLEV 180
           KLLDIPWFSIKTNHSLTLH KEIS ERK  WVLKNTQSDRFRRLVRSCA+RLGSDVTLEV
Sbjct: 121 KLLDIPWFSIKTNHSLTLHHKEISWERKQKWVLKNTQSDRFRRLVRSCADRLGSDVTLEV 180

Query: 181 FGKLGRETGVKEYNALVGICLEKAKASKDVEVVLEQIGKVYQLFKLMKEQGFSLEDETYG 240
           FGKLGRETGVKEY+ALVGICLEKAKAS DVEVVL QIGKVYQ+FK MKEQGFSLED TYG
Sbjct: 181 FGKLGRETGVKEYDALVGICLEKAKASNDVEVVLGQIGKVYQIFKSMKEQGFSLEDGTYG 240

Query: 241 PVLACLIDMDMMEEFNFFCEAIKDGNPGSISRLGYYKMLFYIKINDEEKVQELCYRATVD 300
           PVLACLIDMDMMEEFNFFCEAIKDGNPGSISRLGYYKMLFYIKINDEEKVQELCYRATVD
Sbjct: 241 PVLACLIDMDMMEEFNFFCEAIKDGNPGSISRLGYYKMLFYIKINDEEKVQELCYRATVD 300

Query: 301 DGVDKFSLQENYLLALCGSEQKKELLQMLEVIDITKLSTTVVAPNIFKSLGRLSLHTFAE 360
           DGVDKFSLQENYLLALC SEQ K+LLQMLEVIDITKLSTTVVAP IFK LGRLSLHTFAE
Sbjct: 301 DGVDKFSLQENYLLALCESEQTKQLLQMLEVIDITKLSTTVVAPIIFKCLGRLSLHTFAE 360

Query: 361 KSLLAFKTSG 363
           K LLAFKTSG
Sbjct: 361 KLLLAFKTSG 370

BLAST of CSPI03G23130 vs. NCBI nr
Match: gi|659109139|ref|XP_008454565.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g04790, mitochondrial-like isoform X2 [Cucumis melo])

HSP 1 Score: 516.5 bits (1329), Expect = 3.8e-143
Identity = 274/311 (88.10%), Postives = 283/311 (91.00%), Query Frame = 1

Query: 1   MRSK---LSSLVRSAIISSKSSQNAQDAALQNYVSTIDPLSPSTSLSNAI-----NSPTS 60
           MRSK   LSSL RSAI ++KSS+N QDAALQN VS+IDPLS STSL NAI     NSPT 
Sbjct: 1   MRSKSKQLSSLFRSAIKAAKSSKNPQDAALQNNVSSIDPLSLSTSLYNAIYKRSVNSPTF 60

Query: 61  KKLPQNPNSDVQFPALILEESSDSGDPTKHLAKAISSVLCEGSSVMSPEAQGNCVEESLE 120
           KKLPQNPNSDVQFPALILEESSDSGD  KHL +AISSVLCEGSSV SPEAQGNCVE+SLE
Sbjct: 61  KKLPQNPNSDVQFPALILEESSDSGDSVKHLTEAISSVLCEGSSVRSPEAQGNCVEKSLE 120

Query: 121 KLLDIPWFSIKTNHSLTLHRKEISRERKHNWVLKNTQSDRFRRLVRSCANRLGSDVTLEV 180
           KLLDIPWFSIKTNHSLTLH KEIS ERK  WVLKNTQSDRFRRLVRSCA+RLGSDVTLEV
Sbjct: 121 KLLDIPWFSIKTNHSLTLHHKEISWERKQKWVLKNTQSDRFRRLVRSCADRLGSDVTLEV 180

Query: 181 FGKLGRETGVKEYNALVGICLEKAKASKDVEVVLEQIGKVYQLFKLMKEQGFSLEDETYG 240
           FGKLGRETGVKEY+ALVGICLEKAKAS DVEVVL QIGKVYQ+FK MKEQGFSLED TYG
Sbjct: 181 FGKLGRETGVKEYDALVGICLEKAKASNDVEVVLGQIGKVYQIFKSMKEQGFSLEDGTYG 240

Query: 241 PVLACLIDMDMMEEFNFFCEAIKDGNPGSISRLGYYKMLFYIKINDEEKVQELCYRATVD 300
           PVLACLIDMDMMEEFNFFCEAIKDGNPGSISRLGYYKMLFYIKINDEEKVQELCYRATVD
Sbjct: 241 PVLACLIDMDMMEEFNFFCEAIKDGNPGSISRLGYYKMLFYIKINDEEKVQELCYRATVD 300

Query: 301 DGVDKFSLQEN 304
           DGVDKFSLQ N
Sbjct: 301 DGVDKFSLQGN 311

BLAST of CSPI03G23130 vs. NCBI nr
Match: gi|731432614|ref|XP_010644336.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g04790, mitochondrial-like [Vitis vinifera])

HSP 1 Score: 336.7 bits (862), Expect = 5.3e-89
Identity = 186/360 (51.67%), Postives = 248/360 (68.89%), Query Frame = 1

Query: 5   LSSLVRSAII--SSKSSQNAQDAALQNYVSTIDPLS---PSTSLSNAINSPTSKKLPQNP 64
           LSSL RSA+   S  SS  A DA L+++VS++D  S   P  S   +I+S T  +   +P
Sbjct: 9   LSSLFRSAVKARSKPSSSPADDATLKHFVSSLDTSSSEFPKISPKTSISSGTGSETTSHP 68

Query: 65  NSDVQFPALILEESSDSGDPTKHLAKAISSVLCEGSSVMSPEAQGNCVEESLEKLLDIPW 124
                             D TK L++ +SS+LC G    SP++Q    E+ LEK+LD+PW
Sbjct: 69  E-----------------DSTKQLSQTLSSILCGG----SPDSQLTNDEKPLEKVLDVPW 128

Query: 125 FSIKTNHSLTLHRKEISRERKHNWVLKNTQSDRFRRLVRSCANRLGSDVTLEVFGKLGRE 184
           F   ++++++L RKE+SRERK  WV KNTQ  R  RLV++CA +LG++ T++VFGKLGRE
Sbjct: 129 FPTLSHNNISLRRKEVSRERKQKWVFKNTQGGRLDRLVKTCAQKLGTEATIQVFGKLGRE 188

Query: 185 TGVKEYNALVGICLEKAKASKDVEVVLEQIGKVYQLFKLMKEQGFSLEDETYGPVLACLI 244
           TGVKEY AL+GIC+EKA+ S D E  LEQI K +QLF+ MKEQGF +E+ETYGP    LI
Sbjct: 189 TGVKEYKALLGICIEKARTSDDEEASLEQIYKAFQLFEEMKEQGFHIEEETYGPFFMYLI 248

Query: 245 DMDMMEEFNFFCEAIKDGNPGSISRLGYYKMLFYIKINDEEKVQELCYRATVDDGVDKFS 304
           DM M+EEF+FF   I D N  S+SRLGYY+ML ++++N+EEK+QELC     DDG DK +
Sbjct: 249 DMGMIEEFHFFHGVITDENSRSLSRLGYYEMLLWVRVNNEEKIQELCNGIAADDGADKPN 308

Query: 305 LQENYLLALCGSEQKKELLQMLEVIDITKLSTTVVAPNIFKSLGRLSLHTFAEKSLLAFK 360
           L ENY+LALC S +K+ELL++LE+IDITK+S+     +IFKSLGRLSL +F EK + AFK
Sbjct: 309 LTENYVLALCESGRKEELLKVLEIIDITKVSSVDYVASIFKSLGRLSLASFMEKFVSAFK 347

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP304_ARATH9.5e-5443.19Pentatricopeptide repeat-containing protein At4g04790, mitochondrial OS=Arabidop... [more]
PP335_ARATH2.2e-4239.91Pentatricopeptide repeat-containing protein At4g21880, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0L854_CUCSA1.5e-20299.73Uncharacterized protein OS=Cucumis sativus GN=Csa_3G415080 PE=4 SV=1[more]
A0A061F5D3_THECC6.4e-8148.84Tetratricopeptide repeat-like superfamily protein, putative isoform 2 OS=Theobro... [more]
A0A061F6A5_THECC6.4e-8148.84Tetratricopeptide repeat-like superfamily protein, putative isoform 4 OS=Theobro... [more]
A0A0D2Q9M0_GOSRA6.6e-7846.15Uncharacterized protein OS=Gossypium raimondii GN=B456_002G082600 PE=4 SV=1[more]
A0A0D2M8B9_GOSRA6.6e-7846.15Uncharacterized protein OS=Gossypium raimondii GN=B456_002G082600 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G04790.15.3e-5543.19 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G21880.11.2e-4339.91 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|700202830|gb|KGN57963.1|2.1e-20299.73hypothetical protein Csa_3G415080 [Cucumis sativus][more]
gi|778681163|ref|XP_011651464.1|2.4e-198100.00PREDICTED: pentatricopeptide repeat-containing protein At4g04790, mitochondrial ... [more]
gi|659109137|ref|XP_008454564.1|2.3e-17288.65PREDICTED: pentatricopeptide repeat-containing protein At4g04790, mitochondrial-... [more]
gi|659109139|ref|XP_008454565.1|3.8e-14388.10PREDICTED: pentatricopeptide repeat-containing protein At4g04790, mitochondrial-... [more]
gi|731432614|ref|XP_010644336.1|5.3e-8951.67PREDICTED: pentatricopeptide repeat-containing protein At4g04790, mitochondrial-... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI03G23130.1CSPI03G23130.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 296..325
score: 4.6E-35coord: 151..260
score: 4.6
NoneNo IPR availablePANTHERPTHR24015:SF623SUBFAMILY NOT NAMEDcoord: 151..260
score: 4.6E-35coord: 296..325
score: 4.6