CSPI04G03820 (gene) Wild cucumber (PI 183967)

NameCSPI04G03820
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionUnplaced genomic scaffold supercont1.21, whole genome shotgun sequence
LocationChr4 : 2371085 .. 2374100 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TAGAAAAAATGGTGGGTTTAGAAAATGCAGACAAGTCTTCTCTTCGAAACCCTAGAAATAAACGTAAACAGCGGCCTCCTAATAAGCCTAGGAACGCCAAGAAGAGAAAGCCCGATGTGAACCACAATCGCCATGCTCAAGAGACCACAAATGGTAGTGAAGCCATTGCATCAGAGCCACTTCCATTGGAGCCCTTCAGCTTTTTCCTGAATGAGTTCCAAACGGCTAATGATGTGCAGGTTTCTTCTCTTGAGTTGGATTCCATGAAAGGTGCTTTACTAATCTGTGTCATATTCTTATTCTTTTTCCATGGAATATAATGTAATATGTTTTGTAGTGTTTAATTGTATTCGGCATCCATTTTTTCTTTTCTGAATGCGTCAGAATTTATCTGGTTTTTGTTATGATTGAAATTTTTTGTAGATAGATGCATTCTGGGTCCTCCTGAAAGTTCAGTTCAGGATGATAAAAGTTTGGTGAAACATGTTAAGGAAGCTTTTGGATCGTCGTGGAAAGAAATTCTATGTAAGGGGGAACTCTTGGAAGGACGGACTGAACCTGGGAGTCCTGCAGTTCTTATAATCAGTACATCTGCACTTAGGTCGATTGAACTTTTAAAGTAAGATGGATAATTTTTTTCTGCTAGTATGTTTTTGAAATAAGTGCGTGCTTTTGTTTAACTCATTTATATTTATTAGGGGTTTTCGATCGATAACTCAAGAATGCCATGCTGTAAAGCTATTTTCGAAGCACATGAAGGTTGAGGAACAGGTAGGTGGAAGTTTCTTTCCATGATATTTTCTCTTTTAGATGATTCTGAGATAGGGGTAACATATGTTTTTGTTGTTTGATGATGTTTCAAGTTGTATGGTTTACCAATTCTTTCAATTGAATGCTTGTTGTATATCAACGCTAAACAGAGTGTGGATGATAATTGTTTATATTTTGCTATCTGGAACAAGTGGGAAGAGTTCTTGATATGTTTTAAGAGGCCTGAAGTAGTCATTCTTTTCTTTATGTTGAAAACCTCGTAGTTTCAAAATTTTGTTTGGGAGATGAAGATAATTCAAAAAACCATCTGAGGCAAAAACACTGCTGAATTTAATGGTTTGCACGTCAGTCTGATCCTGTATTTTCATTACCTTTGCTTTTAATTTTCCTGATTTCTGAAACTGATCTTATCTTATCTTTGTGATTCATTCCAGTTTTGATCTTCATGGTCACATGATTAATTTTGTGAGAAGTGTTTGTTATACACTGCTTCTCTTCATGAGCTTGGAAAGCTCAAGCAGTGATTTAGCTTCTTGATTCTTCAACACATAACAAGTAACATTATTCAGCATTAGCAATTGGTGATTTTTATGGCTCTATTCTTTTTTTGGTAACTGGGTGTCCTGGCCAATTAGTGGTTTCCATGTGCATCCATGCAACTGCTTTTCTTTATTTTGTTTGGGAAGTGATGACAAACTAGAAAAGTTTCTCAGATGCAGTTGCTATGCTGAACTTTTTATTTGCACGTCAGTCTGATCCCGTTTTCTCAATAACATGGTTGTCATTTCTGAGCTTATGAATGCTAGACTTGAATTTTTCTTGTTATTTTTAATGGTTTTTTAAAAAACAAGGATATAGAATTGGCATTTTTGCATTCCCCGTTGACCTCTTCAAACAATCATTAACATCCTTAACTTTTCCTTTAAATTTTTTGCGTTCCCTCTTCTTTTTGTTTTCTTACATCCATTTTGGACACAGATCTCTTATTTTTATGTGAAATCTTCAACTGAACTCCCAATATTATTTTCAGTTGGAAATTTTTATGGTGATCACAATTATGTTATCTTATTTTTAAAGTATGGGAAATGAAGACAAATTGAAAATTTCATCAGAGGTAGATACTATGCTGAACTCTATTGCACGTCAGTCTGATCCCTATAATTGTTTATTGGGTGTAATCTATGCGATTATTACATAAAGTATTTTTTGGGTCCAAGATGAAAGGGGGGATGGGGCAATTCCAGGTTTTGTTATTCTGTTGTGATTTAGATAATTCCGTCTAGTTCACATTTCTGCTGAATTCGTTCAAGATCCTTTTGAATTGTGCAAAATTGTTTCAAATGACATCTAATTTCAATTGTGTCTCTAACCATCTGTGTAATGCAACATTACTTGGTGGTTTTTCTTAAAACATCCAATTGTCGGTTTTGGTTTTCATATCTCTTGAACTGTTATTGTTATTCCCTTTATGATGGGTGCAGGTACAGCTGTTGAAGAACCGTGTTAACATTGCGAGTGGTACACCGAGCAGGTTAGGCTTTTTTTGTATTGGGTTTATCGTTCAGGAATTCTAGTTCCAGATTTTGTTTCTAATGGAGTACTGTTGAGCATCTATATTGATTATCTACATTCTTCCGGGATCTATCTCTTCTCATATACATACTCTTCTGTGTCTATTTTAGGATTAAAAAATTGATTGACATTGAGGCGTTGGGGCTGTCAAGACTAGCCGTCATTGTGCTGGACGTTCAACCAGATGTCAAGGGATATTCTTTATTTTCACTTCCACAAGTCAGGTTTGAAAACCTCCTTTATATAACTTGTCTCAACTTCCTCGATCTATGTTCAAATTAGAAGAGAATGTTTTTATATAAATAGCTGCTCATAAAATTCATATTGCAGAGATGAATTCTGGGATTTGTACAAGAGCTACTTACACCCGCGGATAGTCGAGGGTGAGCTGCGAATTTGCCTCTTTGGACCTTTACAACCTACTCGTAAACGAAGAAAAAAGGGAATTTAGTTATTGTTAATGCATCCTCAAAATTCATCAGTTCCCTCAAATTATCTTTCATCACTTGATCATTTTTCCCCAACTTCCGTTTCATTCCTATAATTCCGATTTGGTTTGTTGTACCAAAATTTTGCAATCTGAGCTGAGAGTCAATTTTGAAGTCAAGCTGTGTGAAATATTACATTGTGTAATAGATTTTGGAATTATGTATAAATTATTCACCTTTTCTAA

mRNA sequence

ATGGTGGGTTTAGAAAATGCAGACAAGTCTTCTCTTCGAAACCCTAGAAATAAACGTAAACAGCGGCCTCCTAATAAGCCTAGGAACGCCAAGAAGAGAAAGCCCGATGTGAACCACAATCGCCATGCTCAAGAGACCACAAATGGTAGTGAAGCCATTGCATCAGAGCCACTTCCATTGGAGCCCTTCAGCTTTTTCCTGAATGAGTTCCAAACGGCTAATGATGTGCAGGTTTCTTCTCTTGAGTTGGATTCCATGAAAGATAGATGCATTCTGGGTCCTCCTGAAAGTTCAGTTCAGGATGATAAAAGTTTGGTGAAACATGTTAAGGAAGCTTTTGGATCGTCGTGGAAAGAAATTCTATGTAAGGGGGAACTCTTGGAAGGACGGACTGAACCTGGGAGTCCTGCAGTTCTTATAATCAGTACATCTGCACTTAGGTCGATTGAACTTTTAAAGGGTTTTCGATCGATAACTCAAGAATGCCATGCTGTAAAGCTATTTTCGAAGCACATGAAGGTTGAGGAACAGGTACAGCTGTTGAAGAACCGTGTTAACATTGCGAGTGGTACACCGAGCAGGATTAAAAAATTGATTGACATTGAGGCGTTGGGGCTGTCAAGACTAGCCGTCATTGTGCTGGACGTTCAACCAGATGTCAAGGGATATTCTTTATTTTCACTTCCACAAGTCAGAGATGAATTCTGGGATTTGTACAAGAGCTACTTACACCCGCGGATAGTCGAGGGTGAGCTGCGAATTTGCCTCTTTGGACCTTTACAACCTACTCGTAAACGAAGAAAAAAGGGAATTTAG

Coding sequence (CDS)

ATGGTGGGTTTAGAAAATGCAGACAAGTCTTCTCTTCGAAACCCTAGAAATAAACGTAAACAGCGGCCTCCTAATAAGCCTAGGAACGCCAAGAAGAGAAAGCCCGATGTGAACCACAATCGCCATGCTCAAGAGACCACAAATGGTAGTGAAGCCATTGCATCAGAGCCACTTCCATTGGAGCCCTTCAGCTTTTTCCTGAATGAGTTCCAAACGGCTAATGATGTGCAGGTTTCTTCTCTTGAGTTGGATTCCATGAAAGATAGATGCATTCTGGGTCCTCCTGAAAGTTCAGTTCAGGATGATAAAAGTTTGGTGAAACATGTTAAGGAAGCTTTTGGATCGTCGTGGAAAGAAATTCTATGTAAGGGGGAACTCTTGGAAGGACGGACTGAACCTGGGAGTCCTGCAGTTCTTATAATCAGTACATCTGCACTTAGGTCGATTGAACTTTTAAAGGGTTTTCGATCGATAACTCAAGAATGCCATGCTGTAAAGCTATTTTCGAAGCACATGAAGGTTGAGGAACAGGTACAGCTGTTGAAGAACCGTGTTAACATTGCGAGTGGTACACCGAGCAGGATTAAAAAATTGATTGACATTGAGGCGTTGGGGCTGTCAAGACTAGCCGTCATTGTGCTGGACGTTCAACCAGATGTCAAGGGATATTCTTTATTTTCACTTCCACAAGTCAGAGATGAATTCTGGGATTTGTACAAGAGCTACTTACACCCGCGGATAGTCGAGGGTGAGCTGCGAATTTGCCTCTTTGGACCTTTACAACCTACTCGTAAACGAAGAAAAAAGGGAATTTAG
BLAST of CSPI04G03820 vs. Swiss-Prot
Match: CMS1_RAT (Protein CMSS1 OS=Rattus norvegicus GN=Cmss1 PE=1 SV=1)

HSP 1 Score: 77.0 bits (188), Expect = 3.5e-13
Identity = 49/149 (32.89%), Postives = 85/149 (57.05%), Query Frame = 1

Query: 112 AFGSSWKEILCKG-ELLEGRTEPGSPAVLIISTSALRSIELLKGFRSITQECHAVKLFSK 171
           +  S  KEI  K  +L +   E  S  +LI+ +SA+R++EL++   +   +   +KLF+K
Sbjct: 128 SLSSYLKEICPKWVKLRKTHNEKKSVLMLILCSSAVRALELIRSLTAFKGDAKVMKLFAK 187

Query: 172 HMKVEEQVQLLKNR-VNIASGTPSRIKKLIDIEALGLSRLAVIVLDVQ-PDVKGYSLFSL 231
           H+KV+EQV+LL+ R +++  GTP RIK+L+  + L L+ L  +V D    D K   +  +
Sbjct: 188 HIKVQEQVKLLEKRAIHLGVGTPGRIKELVKQDGLNLNPLKFLVFDWNWRDQKLRRMMDI 247

Query: 232 PQVRDEFWDLYKSYLHPRIVEGELRICLF 258
           P++R E ++L    +        L++ LF
Sbjct: 248 PEIRKEVFELLDMGVFSLCKSDSLKLGLF 276

BLAST of CSPI04G03820 vs. Swiss-Prot
Match: CMS1_MOUSE (Protein CMSS1 OS=Mus musculus GN=Cmss1 PE=2 SV=1)

HSP 1 Score: 76.6 bits (187), Expect = 4.6e-13
Identity = 50/149 (33.56%), Postives = 85/149 (57.05%), Query Frame = 1

Query: 112 AFGSSWKEILCKG-ELLEGRTEPGSPAVLIISTSALRSIELLKGFRSITQECHAVKLFSK 171
           +  S  KEI  K  +L +   E  S  +LI+ +SA+R++EL++   +   +   +KLF+K
Sbjct: 128 SLSSYLKEICPKWVKLRKTHNEKKSVLMLILCSSAVRALELIRSLTAFKGDAKVMKLFAK 187

Query: 172 HMKVEEQVQLLKNRV-NIASGTPSRIKKLIDIEALGLSRLAVIVLDVQ-PDVKGYSLFSL 231
           H+KV+EQV+LL+ RV ++  GTP RIK+L+  + L L+ L  +V D    D K   +  +
Sbjct: 188 HIKVQEQVKLLEKRVIHLGVGTPGRIKELVKQDGLHLNPLKFLVFDWNWRDQKLRRMMDI 247

Query: 232 PQVRDEFWDLYKSYLHPRIVEGELRICLF 258
           P++R E ++L    +        L++ LF
Sbjct: 248 PEIRKEVFELLDMGVFSLCKSDSLKLGLF 276

BLAST of CSPI04G03820 vs. Swiss-Prot
Match: CMS1_DANRE (Protein CMSS1 OS=Danio rerio GN=cmss1 PE=2 SV=1)

HSP 1 Score: 76.3 bits (186), Expect = 5.9e-13
Identity = 70/242 (28.93%), Postives = 116/242 (47.93%), Query Frame = 1

Query: 20  KQRPPNKPRNAKKRKPDVNHNRHAQETTNGSEAIASEPLPLEPFSFFLNEFQTANDVQVS 79
           ++R   KP N   +    N  R  ++ T      +S+P+P  P               VS
Sbjct: 66  QERSEEKPDNESNK----NKKRRKKKKTITDVLTSSKPVPGSPVDL------------VS 125

Query: 80  SLELDSMKDRCILGPPESSVQDDKSL-VKHVKEAFGSSWKEILCK-GELLEGRTEPGSPA 139
            L+    + R ++   E ++QD   L    +  +  S  KE+  K  ++ +  T+  S  
Sbjct: 126 LLKTYHSQTRSVIEQEELTLQDSCFLSCNDLTHSLSSYLKEVCPKWAKMQKQHTQTSSVV 185

Query: 140 VLIISTSALRSIELLKGFRSITQECHAVKLFSKHMKVEEQVQ-LLKNRVNIASGTPSRIK 199
           +LI+  SALR+I+L+K   +   +   +KLF+KH+KVEEQ++ L K   +IA GTP RI 
Sbjct: 186 LLIVCGSALRTIDLIKQLVTFKGQAKVLKLFAKHIKVEEQIKSLSKGVTHIAVGTPGRIC 245

Query: 200 KLIDIEALGLSRLAVIVLDVQ-PDVKGYSLFSLPQVRDEFWDLYKSYLHPRIVEGELRIC 258
            L++ E L +  L  +VLD    D K   +  +P+V+ +   +    L     EG ++I 
Sbjct: 246 ALLEKEGLTVQGLRYLVLDWNYRDQKQRRMVDVPEVKGDLLKMMDQGLIQSCREGTVKIG 291

BLAST of CSPI04G03820 vs. Swiss-Prot
Match: CMS1_XENLA (Protein CMSS1 OS=Xenopus laevis GN=cmss1 PE=2 SV=1)

HSP 1 Score: 72.0 bits (175), Expect = 1.1e-11
Identity = 71/253 (28.06%), Postives = 123/253 (48.62%), Query Frame = 1

Query: 9   KSSLRNPRNKRKQRPPNKPRNAKKRKPDVNHNRHAQETTNGSEAIASEPLPLEPFSFFLN 68
           K S  +  NK+K+ P  K +  +K++  V+     +++    +    + +  +  +    
Sbjct: 27  KESEESKGNKKKKIPSGKTQVKRKKEVKVSQEAEKEDSAPKKKRRKKKTIS-DVLAQHAP 86

Query: 69  EFQTANDVQVSSLELDSMKDRCILGPPESSVQDDKSLVKH-VKEAFGSSWKEILCK-GEL 128
              T  D+Q   LE    K R ++   E  + D   + ++ +     S  KEI  K  +L
Sbjct: 87  TAGTPEDMQKLVLEHFENK-RSVIELEELQIPDTCFVKENDLTHTLSSYLKEICPKWSKL 146

Query: 129 LEGRTEPGSPAVLIISTSALRSIELLKGFRSITQECHAVKLFSKHMKVEEQVQLL-KNRV 188
            +   E  S  +L++ +SA R++EL+K   +   +   +KLF+KH+K+++Q+ LL KN  
Sbjct: 147 SKNHKEKKSVLLLVVCSSAHRTLELIKLINAFKADTKVMKLFAKHIKIKDQINLLEKNVT 206

Query: 189 NIASGTPSRIKKLIDIEALGLSRLAVIVLDVQ-PDVKGYSLFSLPQVRDEFWDLYKSYLH 248
           +I  GTP RIK LID + L L  +  +V D    D K   +  +P+V+ E  +L  S L 
Sbjct: 207 HIGIGTPGRIKALIDQDGLSLESMKYLVFDWNWRDQKLRRMMDIPEVKKETLELLDSGLI 266

Query: 249 PRIVEGELRICLF 258
                G L+I LF
Sbjct: 267 RASRAGTLKIGLF 277

BLAST of CSPI04G03820 vs. Swiss-Prot
Match: CMS1_HUMAN (Protein CMSS1 OS=Homo sapiens GN=CMSS1 PE=1 SV=2)

HSP 1 Score: 70.9 bits (172), Expect = 2.5e-11
Identity = 69/248 (27.82%), Postives = 118/248 (47.58%), Query Frame = 1

Query: 15  PRNKRKQRPPNKPRNAKKRKPDVNHNRHAQETTNGSEAIASEPLP--LEPFSFFLNEFQT 74
           P  K KQ         K+RK +    R  ++         SEP P   E     + ++ +
Sbjct: 45  PSEKTKQPKECFLIQPKERKENTTKTRKRRKKKITDVLAKSEPKPGLPEDLQKLMKDYYS 104

Query: 75  ANDVQVSSLELDSMKDRCILGPPESSVQDDKSLVKHVKEAFGSSWKEILCKG-ELLEGRT 134
           +  + +   EL+ + D C L   +            +  +  S  KEI  K  +L +  +
Sbjct: 105 SRRLVIELEELN-LPDSCFLKAND------------LTHSLSSYLKEICPKWVKLRKNHS 164

Query: 135 EPGSPAVLIISTSALRSIELLKGFRSITQECHAVKLFSKHMKVEEQVQLLKNR-VNIASG 194
           E  S  +LII +SA+R++EL++   +   +   +KLF+KH+KV+ QV+LL+ R V++  G
Sbjct: 165 EKKSVLMLIICSSAVRALELIRSMTAFRGDGKVIKLFAKHIKVQAQVKLLEKRVVHLGVG 224

Query: 195 TPSRIKKLIDIEALGLSRLAVIVLDVQ-PDVKGYSLFSLPQVRDEFWDLYKSYLHPRIVE 254
           TP RIK+L+    L LS L  +V D    D K   +  +P++R E ++L +  +      
Sbjct: 225 TPGRIKELVKQGGLNLSPLKFLVFDWNWRDQKLRRMMDIPEIRKEVFELLEMGVLSLCKS 279

Query: 255 GELRICLF 258
             L++ LF
Sbjct: 285 ESLKLGLF 279

BLAST of CSPI04G03820 vs. TrEMBL
Match: A0A0A0KUA9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G017160 PE=4 SV=1)

HSP 1 Score: 528.1 bits (1359), Expect = 6.4e-147
Identity = 270/271 (99.63%), Postives = 270/271 (99.63%), Query Frame = 1

Query: 1   MVGLENADKSSLRNPRNKRKQRPPNKPRNAKKRKPDVNHNRHAQETTNGSEAIASEPLPL 60
           MVGLENADKSSLRNPRNKRKQRPPNKPRNAKKRKPDVNHNRHAQETTNGSEAIASEPLPL
Sbjct: 1   MVGLENADKSSLRNPRNKRKQRPPNKPRNAKKRKPDVNHNRHAQETTNGSEAIASEPLPL 60

Query: 61  EPFSFFLNEFQTANDVQVSSLELDSMKDRCILGPPESSVQDDKSLVKHVKEAFGSSWKEI 120
           EPFSFFLNEFQTANDVQVSSLELDSMKDRCILGPPESSVQDDKSLVKHVKEAFGSSWKEI
Sbjct: 61  EPFSFFLNEFQTANDVQVSSLELDSMKDRCILGPPESSVQDDKSLVKHVKEAFGSSWKEI 120

Query: 121 LCKGELLEGRTEPGSPAVLIISTSALRSIELLKGFRSITQECHAVKLFSKHMKVEEQVQL 180
           LCKGELLEGRTEPGSPAVLIISTSALRSIELLKGFRSITQECHAVKLFSKHMKVEEQVQL
Sbjct: 121 LCKGELLEGRTEPGSPAVLIISTSALRSIELLKGFRSITQECHAVKLFSKHMKVEEQVQL 180

Query: 181 LKNRVNIASGTPSRIKKLIDIEALGLSRLAVIVLDVQPDVKGYSLFSLPQVRDEFWDLYK 240
           LKNRVNIASGTPSRIKKLIDIEALGLSRLAVIVLDVQPDVKGYSLFSLPQVRDEFWDLYK
Sbjct: 181 LKNRVNIASGTPSRIKKLIDIEALGLSRLAVIVLDVQPDVKGYSLFSLPQVRDEFWDLYK 240

Query: 241 SYLHPRIVEGELRICLFGPLQPTRKRRKKGI 272
           SYLHPRIVEGELRICLFGPLQPTRKRRKK I
Sbjct: 241 SYLHPRIVEGELRICLFGPLQPTRKRRKKEI 271

BLAST of CSPI04G03820 vs. TrEMBL
Match: M5XLJ0_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa009444mg PE=4 SV=1)

HSP 1 Score: 304.3 bits (778), Expect = 1.5e-79
Identity = 169/280 (60.36%), Postives = 205/280 (73.21%), Query Frame = 1

Query: 5   ENADKSSLRNPRNKRKQRPPN--KPRNAKKRKPDVNH---------NRHAQETTNG---S 64
           +N  K SLRNP+NK K   PN  K +  KK+K  VN+         N++ Q + N    S
Sbjct: 7   KNLSKLSLRNPKNKGKPLGPNNTKRQKEKKKKQRVNNQNPIEDKATNQNTQNSNNNNSLS 66

Query: 65  EAIASEPLPLEP-FSFFLNEFQTANDVQVSSLELDSMKDRCILGPPESSVQDDKSLVKHV 124
            A  ++P       SFFL++FQ+ N V++SSLEL+S+ D+CIL   ES  QD   L KHV
Sbjct: 67  NATKAQPASASAQLSFFLDQFQSGNGVKISSLELESVNDKCILDLSESLDQDVTLLGKHV 126

Query: 125 KEAFGSSWKEILCKGELLEGRTEPGSPAVLIISTSALRSIELLKGFRSITQECHAVKLFS 184
             AFG SWKE LC+  LL G+ +PGSPA+LIISTSALRSIELL+G R++T+ECHA KLFS
Sbjct: 127 MAAFGPSWKEELCEKHLLNGKIDPGSPAILIISTSALRSIELLRGLRALTKECHAAKLFS 186

Query: 185 KHMKVEEQVQLLKNRVNIASGTPSRIKKLIDIEALGLSRLAVIVLDVQPDVKGYSLFSLP 244
           KHMKVEEQV LLKNRVNIASGTP+RIKKLIDIEALGLSRL+VIVLD  PDVKGYSLF+LP
Sbjct: 187 KHMKVEEQVSLLKNRVNIASGTPNRIKKLIDIEALGLSRLSVIVLDTHPDVKGYSLFTLP 246

Query: 245 QVRDEFWDLYKSYLHPRIVEGELRICLFGPLQPTRKRRKK 270
           QVRDEFWDLYKSY H R+++G LR+ L+GPL    + + K
Sbjct: 247 QVRDEFWDLYKSYFHDRLLQGSLRLGLYGPLSSGNELKGK 286

BLAST of CSPI04G03820 vs. TrEMBL
Match: A0A061EKJ3_THECC (Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_020062 PE=4 SV=1)

HSP 1 Score: 299.3 bits (765), Expect = 4.8e-78
Identity = 158/269 (58.74%), Postives = 205/269 (76.21%), Query Frame = 1

Query: 9   KSSLRNPRNKRKQRPPNK--PRN-AKKRKPDVNHNRHAQET-----------TNGSEAIA 68
           +++ +NPRN RK   P    P++ +KK+K   N+N H  +T             G+ +  
Sbjct: 18  QNARQNPRNNRKPLGPKSDIPKSKSKKKKKTKNNNNHQNDTDPIETKNKTTVVTGNNSNN 77

Query: 69  SEPLPLEP---FSFFLNEFQTANDVQVSSLELDSMKDRCILGPPESSVQDDKSLVKHVKE 128
           + P P  P    S+FL++FQ+AN VQ+SSLEL+S+KD CIL   + S QD   L K++KE
Sbjct: 78  ARPPPATPRQQLSYFLSQFQSANGVQLSSLELESIKDSCILDVSQESGQDVMRLEKYIKE 137

Query: 129 AFGSSWKEILCKGELLEGRTEPGSPAVLIISTSALRSIELLKGFRSITQECHAVKLFSKH 188
           AFG+ WKE LC+G+L+ G+TE GSPAVL+++TSALRSIELL+G RS T+EC AVKLFSKH
Sbjct: 138 AFGAKWKEELCEGKLIGGKTEAGSPAVLVVATSALRSIELLRGMRSFTKECCAVKLFSKH 197

Query: 189 MKVEEQVQLLKNRVNIASGTPSRIKKLIDIEALGLSRLAVIVLDVQPDVKGYSLFSLPQV 248
           MK++EQV LLKNRVNIASGTPSRIKKLIDIEALGLSRL++I+LD+  DVKGYSL +LPQV
Sbjct: 198 MKIDEQVSLLKNRVNIASGTPSRIKKLIDIEALGLSRLSLILLDIHTDVKGYSLLTLPQV 257

Query: 249 RDEFWDLYKSYLHPRIVEGELRICLFGPL 261
           RDEFWDLYK+Y H ++V+G+LRICL+GP+
Sbjct: 258 RDEFWDLYKNYFHQQVVQGDLRICLYGPI 286

BLAST of CSPI04G03820 vs. TrEMBL
Match: A0A061EJS9_THECC (Uncharacterized protein isoform 2 OS=Theobroma cacao GN=TCM_020062 PE=4 SV=1)

HSP 1 Score: 294.7 bits (753), Expect = 1.2e-76
Identity = 158/270 (58.52%), Postives = 205/270 (75.93%), Query Frame = 1

Query: 9   KSSLRNPRNKRKQRPPNK--PRN-AKKRKPDVNHNRHAQET-----------TNGSEAIA 68
           +++ +NPRN RK   P    P++ +KK+K   N+N H  +T             G+ +  
Sbjct: 18  QNARQNPRNNRKPLGPKSDIPKSKSKKKKKTKNNNNHQNDTDPIETKNKTTVVTGNNSNN 77

Query: 69  SEPLPLEP---FSFFLNEFQTANDVQVSSLELDSMKDRCILGPPESSVQDDKSLVKHVKE 128
           + P P  P    S+FL++FQ+AN VQ+SSLEL+S+KD CIL   + S QD   L K++KE
Sbjct: 78  ARPPPATPRQQLSYFLSQFQSANGVQLSSLELESIKDSCILDVSQESGQDVMRLEKYIKE 137

Query: 129 AFGSSWKEILCKGELLEGRTEPGSPAVLIISTSALRSIELLKGFRSITQECHAVKLFSKH 188
           AFG+ WKE LC+G+L+ G+TE GSPAVL+++TSALRSIELL+G RS T+EC AVKLFSKH
Sbjct: 138 AFGAKWKEELCEGKLIGGKTEAGSPAVLVVATSALRSIELLRGMRSFTKECCAVKLFSKH 197

Query: 189 MKV-EEQVQLLKNRVNIASGTPSRIKKLIDIEALGLSRLAVIVLDVQPDVKGYSLFSLPQ 248
           MK+ E+QV LLKNRVNIASGTPSRIKKLIDIEALGLSRL++I+LD+  DVKGYSL +LPQ
Sbjct: 198 MKIDEQQVSLLKNRVNIASGTPSRIKKLIDIEALGLSRLSLILLDIHTDVKGYSLLTLPQ 257

Query: 249 VRDEFWDLYKSYLHPRIVEGELRICLFGPL 261
           VRDEFWDLYK+Y H ++V+G+LRICL+GP+
Sbjct: 258 VRDEFWDLYKNYFHQQVVQGDLRICLYGPI 287

BLAST of CSPI04G03820 vs. TrEMBL
Match: A0A067LFJ7_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_23821 PE=4 SV=1)

HSP 1 Score: 292.4 bits (747), Expect = 5.9e-76
Identity = 156/270 (57.78%), Postives = 202/270 (74.81%), Query Frame = 1

Query: 9   KSSLRNPR-----NKRKQRPPN----KPRNAKKRKPDVNHNRHAQETTNGSEAIASEPLP 68
           K+S +NP+      K+K++PP     K +N KK+   + +N + +   N +    +    
Sbjct: 15  KASFKNPKIDNNFKKKKKQPPTLKAEKKKNKKKKVHQIKNNDNNEIINNNNNNELTPTSV 74

Query: 69  LEPFSFFLNEFQTANDVQVSSLELDSMKDRCILGPPESSVQDDKSLVKHVKEAFGSSWKE 128
            +  SFF+NEF++AND+Q+SSLEL+S+K+   +  P    QD KSL K ++ AFG SWKE
Sbjct: 75  SQELSFFVNEFESANDLQLSSLELESIKETSFVELP----QDVKSLGKQIQAAFGPSWKE 134

Query: 129 ILCKGELLEGRTEPGSPAVLIISTSALRSIELLKGFRSITQECHAVKLFSKHMKVEEQVQ 188
           +LC+G+L+EG+ E G+P+VLIISTSALR IELL+G R +T+ECHAVKLFSKHMK+EEQV 
Sbjct: 135 VLCEGQLVEGKIEAGNPSVLIISTSALRCIELLRGVRPLTKECHAVKLFSKHMKIEEQVA 194

Query: 189 LLKNRVNIASGTPSRIKKLIDIEALGLSRLAVIVLDVQPDVKGYSLFSLPQVRDEFWDLY 248
           LLKNRVN ASGTPSR+KKLIDIEALGLSRLAVIVLD+  DVKGYSL +LPQVRDEFWDLY
Sbjct: 195 LLKNRVNFASGTPSRLKKLIDIEALGLSRLAVIVLDIHTDVKGYSLLTLPQVRDEFWDLY 254

Query: 249 KSYLHPRIVEGELRICLFGPLQPTRKRRKK 270
           K+Y H R+++ +LRICLFGPL    + R K
Sbjct: 255 KNYFHQRLLQRDLRICLFGPLLNGNQLRGK 280

BLAST of CSPI04G03820 vs. TAIR10
Match: AT2G43110.1 (AT2G43110.1 unknown protein)

HSP 1 Score: 264.6 bits (675), Expect = 6.6e-71
Identity = 145/278 (52.16%), Postives = 196/278 (70.50%), Query Frame = 1

Query: 10  SSLRNP-RNKRKQRPPNKPRNAKKRK--------------PDVNHNRHAQETTNGSE--A 69
           S+++NP RN+R    P K    KK K                +  +R   + T+  E   
Sbjct: 9   SAVKNPKRNRRPSHGPKKDLKKKKTKITKKTKKSKAPTFDKTIEKSRSNDQKTDNDEDEQ 68

Query: 70  IASEPLPL-EPFSFFLNEFQTANDVQVSSLELDSMKDRCILGPPESSVQDDKSLVKHVKE 129
           + SEP+   E  ++FLN   +A  ++VSSLEL+ +KD CI+   +   QD  +L +H+K 
Sbjct: 69  LYSEPVSASEQLNYFLNHLDSAIGIKVSSLELEPIKDTCIVELSQGLDQDVSNLGEHIKL 128

Query: 130 AFGSSWKEILCKGELLEGRTEPGSPAVLIISTSALRSIELLKGFRSITQECHAVKLFSKH 189
           + GSSW+E LC+GE LE + EPG+P+VL+IS+SALRS+ELL+G  S+T++C AVKLFSKH
Sbjct: 129 SCGSSWRETLCEGESLERKVEPGNPSVLVISSSALRSLELLRGLHSLTKQCPAVKLFSKH 188

Query: 190 MKVEEQVQLLKNRVNIASGTPSRIKKLIDIEALGLSRLAVIVLDVQPDVKGYSLFSLPQV 249
           +KVEEQV LLK RVNI SGTP+RIKKL+DIEALGLSRL +IV+D+ PDVKG+SLF+LPQV
Sbjct: 189 LKVEEQVSLLKKRVNIGSGTPNRIKKLVDIEALGLSRLDMIVIDMHPDVKGFSLFTLPQV 248

Query: 250 RDEFWDLYKSYLHPRIVEGELRICLFGPLQPTRKRRKK 270
           RDEFWDLYK+  H R++EG LRIC++GP +P+   +KK
Sbjct: 249 RDEFWDLYKNCFHQRVLEGRLRICMYGP-KPSPNLKKK 285

BLAST of CSPI04G03820 vs. NCBI nr
Match: gi|449468740|ref|XP_004152079.1| (PREDICTED: protein CMSS1 [Cucumis sativus])

HSP 1 Score: 528.1 bits (1359), Expect = 9.1e-147
Identity = 270/271 (99.63%), Postives = 270/271 (99.63%), Query Frame = 1

Query: 1   MVGLENADKSSLRNPRNKRKQRPPNKPRNAKKRKPDVNHNRHAQETTNGSEAIASEPLPL 60
           MVGLENADKSSLRNPRNKRKQRPPNKPRNAKKRKPDVNHNRHAQETTNGSEAIASEPLPL
Sbjct: 1   MVGLENADKSSLRNPRNKRKQRPPNKPRNAKKRKPDVNHNRHAQETTNGSEAIASEPLPL 60

Query: 61  EPFSFFLNEFQTANDVQVSSLELDSMKDRCILGPPESSVQDDKSLVKHVKEAFGSSWKEI 120
           EPFSFFLNEFQTANDVQVSSLELDSMKDRCILGPPESSVQDDKSLVKHVKEAFGSSWKEI
Sbjct: 61  EPFSFFLNEFQTANDVQVSSLELDSMKDRCILGPPESSVQDDKSLVKHVKEAFGSSWKEI 120

Query: 121 LCKGELLEGRTEPGSPAVLIISTSALRSIELLKGFRSITQECHAVKLFSKHMKVEEQVQL 180
           LCKGELLEGRTEPGSPAVLIISTSALRSIELLKGFRSITQECHAVKLFSKHMKVEEQVQL
Sbjct: 121 LCKGELLEGRTEPGSPAVLIISTSALRSIELLKGFRSITQECHAVKLFSKHMKVEEQVQL 180

Query: 181 LKNRVNIASGTPSRIKKLIDIEALGLSRLAVIVLDVQPDVKGYSLFSLPQVRDEFWDLYK 240
           LKNRVNIASGTPSRIKKLIDIEALGLSRLAVIVLDVQPDVKGYSLFSLPQVRDEFWDLYK
Sbjct: 181 LKNRVNIASGTPSRIKKLIDIEALGLSRLAVIVLDVQPDVKGYSLFSLPQVRDEFWDLYK 240

Query: 241 SYLHPRIVEGELRICLFGPLQPTRKRRKKGI 272
           SYLHPRIVEGELRICLFGPLQPTRKRRKK I
Sbjct: 241 SYLHPRIVEGELRICLFGPLQPTRKRRKKEI 271

BLAST of CSPI04G03820 vs. NCBI nr
Match: gi|659107923|ref|XP_008453922.1| (PREDICTED: protein CMSS1 [Cucumis melo])

HSP 1 Score: 511.5 bits (1316), Expect = 8.8e-142
Identity = 258/271 (95.20%), Postives = 264/271 (97.42%), Query Frame = 1

Query: 1   MVGLENADKSSLRNPRNKRKQRPPNKPRNAKKRKPDVNHNRHAQETTNGSEAIASEPLPL 60
           MVGLENAD +SLRNPRNKRKQRPPNKPRNAKKRKPDVNHNRHAQ  TN SE IASEPLPL
Sbjct: 1   MVGLENADNASLRNPRNKRKQRPPNKPRNAKKRKPDVNHNRHAQNATNSSEIIASEPLPL 60

Query: 61  EPFSFFLNEFQTANDVQVSSLELDSMKDRCILGPPESSVQDDKSLVKHVKEAFGSSWKEI 120
           EPFSFFLNEFQTANDVQVSSLEL+SMKDRCILGPPESSVQDDKSLVKH+KEA GSSWKE+
Sbjct: 61  EPFSFFLNEFQTANDVQVSSLELESMKDRCILGPPESSVQDDKSLVKHIKEALGSSWKEV 120

Query: 121 LCKGELLEGRTEPGSPAVLIISTSALRSIELLKGFRSITQECHAVKLFSKHMKVEEQVQL 180
           LCKGELLEGRTEPGSPAVLIISTSALRSIELLKGFRSITQECHAVKLFSKHMKVEEQVQL
Sbjct: 121 LCKGELLEGRTEPGSPAVLIISTSALRSIELLKGFRSITQECHAVKLFSKHMKVEEQVQL 180

Query: 181 LKNRVNIASGTPSRIKKLIDIEALGLSRLAVIVLDVQPDVKGYSLFSLPQVRDEFWDLYK 240
           LKNRVNIASGTPSRIKKLIDIEALGLSRLAVIVLDVQPDVKGYSLFSLPQVRDEFWDLYK
Sbjct: 181 LKNRVNIASGTPSRIKKLIDIEALGLSRLAVIVLDVQPDVKGYSLFSLPQVRDEFWDLYK 240

Query: 241 SYLHPRIVEGELRICLFGPLQPTRKRRKKGI 272
           SYLHPR+VEGELRICLFGPLQPTRKRRKK +
Sbjct: 241 SYLHPRVVEGELRICLFGPLQPTRKRRKKEV 271

BLAST of CSPI04G03820 vs. NCBI nr
Match: gi|645232023|ref|XP_008222671.1| (PREDICTED: protein CMSS1 [Prunus mume])

HSP 1 Score: 307.0 bits (785), Expect = 3.3e-80
Identity = 170/280 (60.71%), Postives = 207/280 (73.93%), Query Frame = 1

Query: 5   ENADKSSLRNPRNKRKQRPPN--KPRNAKKRKPDVNH---------NRHAQETTNG---S 64
           +N  K SLRNP+NK K   PN  K +  KK+K  VN+         N++ Q + N    S
Sbjct: 7   KNHSKLSLRNPKNKGKPLGPNNTKRKKEKKKKQRVNNQNPIEDKASNQNTQNSNNNNSLS 66

Query: 65  EAIASEPLPLEP-FSFFLNEFQTANDVQVSSLELDSMKDRCILGPPESSVQDDKSLVKHV 124
            A  ++P       SFFL++FQ+ N V++SSLEL+S+ D+CIL   ES  QD   L KHV
Sbjct: 67  NATKAQPASASAQLSFFLDQFQSGNGVKISSLELESVNDKCILDLSESLDQDVSLLGKHV 126

Query: 125 KEAFGSSWKEILCKGELLEGRTEPGSPAVLIISTSALRSIELLKGFRSITQECHAVKLFS 184
             AFG SWKE LC+  LL+G+ +PGSPA+LIISTSALRSIELL+G RS+T+ECHA KLFS
Sbjct: 127 MAAFGPSWKEELCEKHLLDGKIDPGSPAILIISTSALRSIELLRGLRSLTKECHAAKLFS 186

Query: 185 KHMKVEEQVQLLKNRVNIASGTPSRIKKLIDIEALGLSRLAVIVLDVQPDVKGYSLFSLP 244
           KHMKVEEQV LLKNRVNIASGTP+RIKKLIDIEALGLSRL+VIVLD+ PDVKGYSLF+LP
Sbjct: 187 KHMKVEEQVSLLKNRVNIASGTPNRIKKLIDIEALGLSRLSVIVLDMHPDVKGYSLFTLP 246

Query: 245 QVRDEFWDLYKSYLHPRIVEGELRICLFGPLQPTRKRRKK 270
           QVRDEFWDLYKSY H R+++G LR+ L+GPL    + + K
Sbjct: 247 QVRDEFWDLYKSYFHDRLLQGSLRLGLYGPLSSGNELKGK 286

BLAST of CSPI04G03820 vs. NCBI nr
Match: gi|596287299|ref|XP_007225753.1| (hypothetical protein PRUPE_ppa009444mg [Prunus persica])

HSP 1 Score: 304.3 bits (778), Expect = 2.1e-79
Identity = 169/280 (60.36%), Postives = 205/280 (73.21%), Query Frame = 1

Query: 5   ENADKSSLRNPRNKRKQRPPN--KPRNAKKRKPDVNH---------NRHAQETTNG---S 64
           +N  K SLRNP+NK K   PN  K +  KK+K  VN+         N++ Q + N    S
Sbjct: 7   KNLSKLSLRNPKNKGKPLGPNNTKRQKEKKKKQRVNNQNPIEDKATNQNTQNSNNNNSLS 66

Query: 65  EAIASEPLPLEP-FSFFLNEFQTANDVQVSSLELDSMKDRCILGPPESSVQDDKSLVKHV 124
            A  ++P       SFFL++FQ+ N V++SSLEL+S+ D+CIL   ES  QD   L KHV
Sbjct: 67  NATKAQPASASAQLSFFLDQFQSGNGVKISSLELESVNDKCILDLSESLDQDVTLLGKHV 126

Query: 125 KEAFGSSWKEILCKGELLEGRTEPGSPAVLIISTSALRSIELLKGFRSITQECHAVKLFS 184
             AFG SWKE LC+  LL G+ +PGSPA+LIISTSALRSIELL+G R++T+ECHA KLFS
Sbjct: 127 MAAFGPSWKEELCEKHLLNGKIDPGSPAILIISTSALRSIELLRGLRALTKECHAAKLFS 186

Query: 185 KHMKVEEQVQLLKNRVNIASGTPSRIKKLIDIEALGLSRLAVIVLDVQPDVKGYSLFSLP 244
           KHMKVEEQV LLKNRVNIASGTP+RIKKLIDIEALGLSRL+VIVLD  PDVKGYSLF+LP
Sbjct: 187 KHMKVEEQVSLLKNRVNIASGTPNRIKKLIDIEALGLSRLSVIVLDTHPDVKGYSLFTLP 246

Query: 245 QVRDEFWDLYKSYLHPRIVEGELRICLFGPLQPTRKRRKK 270
           QVRDEFWDLYKSY H R+++G LR+ L+GPL    + + K
Sbjct: 247 QVRDEFWDLYKSYFHDRLLQGSLRLGLYGPLSSGNELKGK 286

BLAST of CSPI04G03820 vs. NCBI nr
Match: gi|590655450|ref|XP_007033991.1| (Uncharacterized protein isoform 1 [Theobroma cacao])

HSP 1 Score: 299.3 bits (765), Expect = 6.9e-78
Identity = 158/269 (58.74%), Postives = 205/269 (76.21%), Query Frame = 1

Query: 9   KSSLRNPRNKRKQRPPNK--PRN-AKKRKPDVNHNRHAQET-----------TNGSEAIA 68
           +++ +NPRN RK   P    P++ +KK+K   N+N H  +T             G+ +  
Sbjct: 18  QNARQNPRNNRKPLGPKSDIPKSKSKKKKKTKNNNNHQNDTDPIETKNKTTVVTGNNSNN 77

Query: 69  SEPLPLEP---FSFFLNEFQTANDVQVSSLELDSMKDRCILGPPESSVQDDKSLVKHVKE 128
           + P P  P    S+FL++FQ+AN VQ+SSLEL+S+KD CIL   + S QD   L K++KE
Sbjct: 78  ARPPPATPRQQLSYFLSQFQSANGVQLSSLELESIKDSCILDVSQESGQDVMRLEKYIKE 137

Query: 129 AFGSSWKEILCKGELLEGRTEPGSPAVLIISTSALRSIELLKGFRSITQECHAVKLFSKH 188
           AFG+ WKE LC+G+L+ G+TE GSPAVL+++TSALRSIELL+G RS T+EC AVKLFSKH
Sbjct: 138 AFGAKWKEELCEGKLIGGKTEAGSPAVLVVATSALRSIELLRGMRSFTKECCAVKLFSKH 197

Query: 189 MKVEEQVQLLKNRVNIASGTPSRIKKLIDIEALGLSRLAVIVLDVQPDVKGYSLFSLPQV 248
           MK++EQV LLKNRVNIASGTPSRIKKLIDIEALGLSRL++I+LD+  DVKGYSL +LPQV
Sbjct: 198 MKIDEQVSLLKNRVNIASGTPSRIKKLIDIEALGLSRLSLILLDIHTDVKGYSLLTLPQV 257

Query: 249 RDEFWDLYKSYLHPRIVEGELRICLFGPL 261
           RDEFWDLYK+Y H ++V+G+LRICL+GP+
Sbjct: 258 RDEFWDLYKNYFHQQVVQGDLRICLYGPI 286

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
CMS1_RAT3.5e-1332.89Protein CMSS1 OS=Rattus norvegicus GN=Cmss1 PE=1 SV=1[more]
CMS1_MOUSE4.6e-1333.56Protein CMSS1 OS=Mus musculus GN=Cmss1 PE=2 SV=1[more]
CMS1_DANRE5.9e-1328.93Protein CMSS1 OS=Danio rerio GN=cmss1 PE=2 SV=1[more]
CMS1_XENLA1.1e-1128.06Protein CMSS1 OS=Xenopus laevis GN=cmss1 PE=2 SV=1[more]
CMS1_HUMAN2.5e-1127.82Protein CMSS1 OS=Homo sapiens GN=CMSS1 PE=1 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0KUA9_CUCSA6.4e-14799.63Uncharacterized protein OS=Cucumis sativus GN=Csa_4G017160 PE=4 SV=1[more]
M5XLJ0_PRUPE1.5e-7960.36Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa009444mg PE=4 SV=1[more]
A0A061EKJ3_THECC4.8e-7858.74Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_020062 PE=4 SV=1[more]
A0A061EJS9_THECC1.2e-7658.52Uncharacterized protein isoform 2 OS=Theobroma cacao GN=TCM_020062 PE=4 SV=1[more]
A0A067LFJ7_JATCU5.9e-7657.78Uncharacterized protein OS=Jatropha curcas GN=JCGZ_23821 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G43110.16.6e-7152.16 unknown protein[more]
Match NameE-valueIdentityDescription
gi|449468740|ref|XP_004152079.1|9.1e-14799.63PREDICTED: protein CMSS1 [Cucumis sativus][more]
gi|659107923|ref|XP_008453922.1|8.8e-14295.20PREDICTED: protein CMSS1 [Cucumis melo][more]
gi|645232023|ref|XP_008222671.1|3.3e-8060.71PREDICTED: protein CMSS1 [Prunus mume][more]
gi|596287299|ref|XP_007225753.1|2.1e-7960.36hypothetical protein PRUPE_ppa009444mg [Prunus persica][more]
gi|590655450|ref|XP_007033991.1|6.9e-7858.74Uncharacterized protein isoform 1 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR027417P-loop_NTPase
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G03820.1CSPI04G03820.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR027417P-loop containing nucleoside triphosphate hydrolaseGENE3DG3DSA:3.40.50.300coord: 134..215
score: 1.
NoneNo IPR availablePANTHERPTHR24030:SF0PROTEIN CMSS1coord: 13..257
score: 1.0