CSPI02G18840 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI02G18840
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionSASA domain-containing protein
LocationChr2: 17282487 .. 17283667 (-)
RNA-Seq ExpressionCSPI02G18840
SyntenyCSPI02G18840
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACCAACAAGGAGAGGGAAGAGGCAAGGATCAGCCAACAAAGACAAGAGTGGTTGTGTGTTTGTGTTTGTGTTTGTGCAGATAGACACAGAGGAAAGAGCTTTCCTCTTTCTTTTACTTTAACACTTTTTTCTGTGTCTTTCTGTGCAAATACCAGGTTCGAAATGTCTTTGCTGAAACTATCAATTATGCTATGTGCAATGTTATTTGGTCCTTCTCTTTCCAGAGCTGTTTCTCCAAATAACATATTTATTCTTTCCGGTCAGAGCAACATGGCTGGTCGAGGTGGAGTTGAAAAAAATGCAACAGGAAACTTACATTGGGATGGTGTGATCCCACCAGATTCTGAACCCACCCCATGTATTCTACGACTCAACGCTGCACGCCAATGGGAGGAAGCACGAGAGCCTCTCAATTTTGATATCGACGTTAAAAAGGAAAATGGAATTAGTCCAGGAATGGGATTTGCTCATGAAATTTTAAGAAAGGCAGGGCCAAGAGCAGGTGTTGTGGGTTTAGTTCCTACTGCTATAGGTGGCACTGTCATCAGACAATGGATGAAAAATACTACCGATCCTAATGCAACATATTACCAACACTTAGTTGAACGGATTAAAGCTTCGGATAAAGATGGTGGAGTTGTTCGTGCTCTTCTATGGTTCCAAGGAGAAAGCGATGCAGCTGTGAAAGATTATGCTATCAATTATAAAGACAACTTAAAAACTTTAATTAACGACCTTCGCAACGACCTCAAGCCTAGATTTTTACCTGTCATTTTGGTTAAAATAGCCATCTATGACTTTTTTGCAGTAAATGGTACCGATAATTTGTCAACAGTGAGGGCGGCACAAGAAGCAGTTAGCAATGAGGTTCCAGATGTATCGATCATCGATTCATGGAAATTGCCGATGAACTTAACAACACGTGAGGGCTTTAACTTGGATCGTGGTCATTTTAATTCCACAGTTCTTCTTACCGCCGGTAGATGGTTGGCTGATACCTACCTCTCCCGATACAGCCAATTACTCTGATTCTTTATTATTTGTACTTTTAATGAGTTCGATTTTATCGTTCTTTTATTTCATTTATTAAATAGTATTTATGTTTGATATTAATATCCCACATTTTAGATATTCTAAGTACATTTTTTTTATTTACTAACGTAAAACCACCTCCAACC

mRNA sequence

ATGACCAACAAGGAGAGGGAAGAGGCAAGGATCAGCCAACAAAGACAAGAGTGGTTGTGTGTTTGTGTTTGTGTTTGTGCAGATAGACACAGAGGAAAGAGCTTTCCTCTTTCTTTTACTTTAACACTTTTTTCTGTGTCTTTCTGTGCAAATACCAGGTTCGAAATGTCTTTGCTGAAACTATCAATTATGCTATGTGCAATGTTATTTGGTCCTTCTCTTTCCAGAGCTGTTTCTCCAAATAACATATTTATTCTTTCCGGTCAGAGCAACATGGCTGGTCGAGGTGGAGTTGAAAAAAATGCAACAGGAAACTTACATTGGGATGGTGTGATCCCACCAGATTCTGAACCCACCCCATGTATTCTACGACTCAACGCTGCACGCCAATGGGAGGAAGCACGAGAGCCTCTCAATTTTGATATCGACGTTAAAAAGGAAAATGGAATTAGTCCAGGAATGGGATTTGCTCATGAAATTTTAAGAAAGGCAGGGCCAAGAGCAGGTGTTGTGGGTTTAGTTCCTACTGCTATAGGTGGCACTGTCATCAGACAATGGATGAAAAATACTACCGATCCTAATGCAACATATTACCAACACTTAGTTGAACGGATTAAAGCTTCGGATAAAGATGGTGGAGTTGTTCGTGCTCTTCTATGGTTCCAAGGAGAAAGCGATGCAGCTGTGAAAGATTATGCTATCAATTATAAAGACAACTTAAAAACTTTAATTAACGACCTTCGCAACGACCTCAAGCCTAGATTTTTACCTGTCATTTTGGTTAAAATAGCCATCTATGACTTTTTTGCAGTAAATGGTACCGATAATTTGTCAACAGTGAGGGCGGCACAAGAAGCAGTTAGCAATGAGGTTCCAGATGTATCGATCATCGATTCATGGAAATTGCCGATGAACTTAACAACACGTGAGGGCTTTAACTTGGATCGTGGTCATTTTAATTCCACAGTTCTTCTTACCGCCGGTAGATGGTTGGCTGATACCTACCTCTCCCGATACAGCCAATTACTCTGATTCTTTATTATTTGTACTTTTAATGAGTTCGATTTTATCGTTCTTTTATTTCATTTATTAAATAGTATTTATGTTTGATATTAATATCCCACATTTTAGATATTCTAAGTACATTTTTTTTATTTACTAACGTAAAACCACCTCCAACC

Coding sequence (CDS)

ATGACCAACAAGGAGAGGGAAGAGGCAAGGATCAGCCAACAAAGACAAGAGTGGTTGTGTGTTTGTGTTTGTGTTTGTGCAGATAGACACAGAGGAAAGAGCTTTCCTCTTTCTTTTACTTTAACACTTTTTTCTGTGTCTTTCTGTGCAAATACCAGGTTCGAAATGTCTTTGCTGAAACTATCAATTATGCTATGTGCAATGTTATTTGGTCCTTCTCTTTCCAGAGCTGTTTCTCCAAATAACATATTTATTCTTTCCGGTCAGAGCAACATGGCTGGTCGAGGTGGAGTTGAAAAAAATGCAACAGGAAACTTACATTGGGATGGTGTGATCCCACCAGATTCTGAACCCACCCCATGTATTCTACGACTCAACGCTGCACGCCAATGGGAGGAAGCACGAGAGCCTCTCAATTTTGATATCGACGTTAAAAAGGAAAATGGAATTAGTCCAGGAATGGGATTTGCTCATGAAATTTTAAGAAAGGCAGGGCCAAGAGCAGGTGTTGTGGGTTTAGTTCCTACTGCTATAGGTGGCACTGTCATCAGACAATGGATGAAAAATACTACCGATCCTAATGCAACATATTACCAACACTTAGTTGAACGGATTAAAGCTTCGGATAAAGATGGTGGAGTTGTTCGTGCTCTTCTATGGTTCCAAGGAGAAAGCGATGCAGCTGTGAAAGATTATGCTATCAATTATAAAGACAACTTAAAAACTTTAATTAACGACCTTCGCAACGACCTCAAGCCTAGATTTTTACCTGTCATTTTGGTTAAAATAGCCATCTATGACTTTTTTGCAGTAAATGGTACCGATAATTTGTCAACAGTGAGGGCGGCACAAGAAGCAGTTAGCAATGAGGTTCCAGATGTATCGATCATCGATTCATGGAAATTGCCGATGAACTTAACAACACGTGAGGGCTTTAACTTGGATCGTGGTCATTTTAATTCCACAGTTCTTCTTACCGCCGGTAGATGGTTGGCTGATACCTACCTCTCCCGATACAGCCAATTACTCTGA

Protein sequence

MTNKEREEARISQQRQEWLCVCVCVCADRHRGKSFPLSFTLTLFSVSFCANTRFEMSLLKLSIMLCAMLFGPSLSRAVSPNNIFILSGQSNMAGRGGVEKNATGNLHWDGVIPPDSEPTPCILRLNAARQWEEAREPLNFDIDVKKENGISPGMGFAHEILRKAGPRAGVVGLVPTAIGGTVIRQWMKNTTDPNATYYQHLVERIKASDKDGGVVRALLWFQGESDAAVKDYAINYKDNLKTLINDLRNDLKPRFLPVILVKIAIYDFFAVNGTDNLSTVRAAQEAVSNEVPDVSIIDSWKLPMNLTTREGFNLDRGHFNSTVLLTAGRWLADTYLSRYSQLL*
Homology
BLAST of CSPI02G18840 vs. ExPASy Swiss-Prot
Match: Q8L9J9 (Probable carbohydrate esterase At4g34215 OS=Arabidopsis thaliana OX=3702 GN=At4g34215 PE=1 SV=2)

HSP 1 Score: 181.0 bits (458), Expect = 2.2e-44
Identity = 106/269 (39.41%), Postives = 151/269 (56.13%), Query Frame = 0

Query: 72  PSLSRAVSPNNIFILSGQSNMAGRGGVEKNATGNLH-WDGVIPPDSEPTPCILRLNAARQ 131
           P +   + PN IFILSGQSNMAGRGGV K+   N   WD ++PP+  P   ILRL+A  +
Sbjct: 13  PEIQSPIPPNQIFILSGQSNMAGRGGVFKDHHNNRWVWDKILPPECAPNSSILRLSADLR 72

Query: 132 WEEAREPLNFDIDVKKENGISPGMGFAHEILRKAGPRAGVVGLVPTAIGGTVIRQWMKNT 191
           WEEA EPL+ DID  K  G+ PGM FA+ +  +    + V+GLVP A GGT I++W    
Sbjct: 73  WEEAHEPLHVDIDTGKVCGVGPGMAFANAVKNRLETDSAVIGLVPCASGGTAIKEW---- 132

Query: 192 TDPNATYYQHLVERIKASDKDGGVVRALLWFQGESDAAVKDYAINYKDNLKTLINDLRND 251
            +  +  Y+ +V+R + S K GG ++A+LW+QGESD      A +Y +N+  LI +LR+D
Sbjct: 133 -ERGSHLYERMVKRTEESRKCGGEIKAVLWYQGESDVLDIHDAESYGNNMDRLIKNLRHD 192

Query: 252 LKPRFLPVILVKIAIYDFFAVNGTDNLSTVRAAQEAVSNEVPDVSIIDSWKLPMNLTTRE 311
           L    LP+I V IA       +G   +  VR AQ  +  ++ +V  +D+  LP+      
Sbjct: 193 LNLPSLPIIQVAIA-------SGGGYIDKVREAQ--LGLKLSNVVCVDAKGLPL------ 252

Query: 312 GFNLDRGHFNSTVLLTAGRWLADTYLSRY 340
               D  H  +   +  G  LA  YLS +
Sbjct: 253 --KSDNLHLTTEAQVQLGLSLAQAYLSNF 259

BLAST of CSPI02G18840 vs. ExPASy TrEMBL
Match: A0A5A7V4V5 (Putative carbohydrate esterase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold501G00580 PE=4 SV=1)

HSP 1 Score: 483.8 bits (1244), Expect = 6.0e-133
Identity = 236/288 (81.94%), Postives = 259/288 (89.93%), Query Frame = 0

Query: 56  MSLLKLSIMLCAMLFGPSLSRAVSPNNIFILSGQSNMAGRGGVEKNATGNLHWDGVIPPD 115
           M+L+KLSI+LCAMLFGPSLS AVSP NIFIL GQSNMAGRGGVEKN++G   WDGVIPPD
Sbjct: 4   MALVKLSILLCAMLFGPSLSGAVSPKNIFILGGQSNMAGRGGVEKNSSGKFEWDGVIPPD 63

Query: 116 SEPTPCILRLNAARQWEEAREPLNFDIDVKKENGISPGMGFAHEILRKAGPRAGVVGLVP 175
            +P P ILRLNAARQWE AREPL++DIDV K NGISPGMGFAHE+L KAGPRAGVVGLVP
Sbjct: 64  CKPNPSILRLNAARQWEVAREPLHWDIDVMKANGISPGMGFAHELLVKAGPRAGVVGLVP 123

Query: 176 TAIGGTVIRQWMKNTTDPNATYYQHLVERIKASDKDGGVVRALLWFQGESDAAVKDYAIN 235
           TAIGGT IRQW+KN + PNATYYQ+LVERI+ASDK+GGVVRALLWFQGESDAAVK+ AIN
Sbjct: 124 TAIGGTFIRQWLKNDSYPNATYYQNLVERIQASDKEGGVVRALLWFQGESDAAVKEEAIN 183

Query: 236 YKDNLKTLINDLRNDLKPRFLPVILVKIAIYDFFAVNGTDNLSTVRAAQEAVSNEVPDVS 295
           YKDNLKT I DLR D++PRFLPVI+VKIA+YDF   N TDNLS VRAAQEAVS EVPDVS
Sbjct: 184 YKDNLKTFIMDLRRDIQPRFLPVIIVKIALYDFLRANATDNLSIVRAAQEAVSKEVPDVS 243

Query: 296 IIDSWKLPMNLTTREGFNLDRGHFNSTVLLTAGRWLADTYLSRYSQLL 344
           IIDSWKLPMNL TREGFNLDRGHFN+T+ LTAG+WLAD YLSRYS+LL
Sbjct: 244 IIDSWKLPMNLKTREGFNLDRGHFNTTIELTAGKWLADAYLSRYSRLL 291

BLAST of CSPI02G18840 vs. ExPASy TrEMBL
Match: E5GB85 (SASA domain-containing protein OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1)

HSP 1 Score: 483.8 bits (1244), Expect = 6.0e-133
Identity = 236/288 (81.94%), Postives = 259/288 (89.93%), Query Frame = 0

Query: 56  MSLLKLSIMLCAMLFGPSLSRAVSPNNIFILSGQSNMAGRGGVEKNATGNLHWDGVIPPD 115
           M+L+KLSI+LCAMLFGPSLS AVSP NIFIL GQSNMAGRGGVEKN++G   WDGVIPPD
Sbjct: 4   MALVKLSILLCAMLFGPSLSGAVSPQNIFILGGQSNMAGRGGVEKNSSGKFEWDGVIPPD 63

Query: 116 SEPTPCILRLNAARQWEEAREPLNFDIDVKKENGISPGMGFAHEILRKAGPRAGVVGLVP 175
            +P P ILRLNAARQWE AREPL++DIDV K NGISPGMGFAHE+L KAGPRAGVVGLVP
Sbjct: 64  CKPNPSILRLNAARQWEVAREPLHWDIDVMKANGISPGMGFAHELLVKAGPRAGVVGLVP 123

Query: 176 TAIGGTVIRQWMKNTTDPNATYYQHLVERIKASDKDGGVVRALLWFQGESDAAVKDYAIN 235
           TAIGGT IRQW+KN + PNATYYQ+LVERI+ASDK+GGVVRALLWFQGESDAAVK+ AIN
Sbjct: 124 TAIGGTFIRQWLKNDSYPNATYYQNLVERIQASDKEGGVVRALLWFQGESDAAVKEEAIN 183

Query: 236 YKDNLKTLINDLRNDLKPRFLPVILVKIAIYDFFAVNGTDNLSTVRAAQEAVSNEVPDVS 295
           YKDNLKT I DLR D++PRFLPVI+VKIA+YDF   N TDNLS VRAAQEAVS EVPDVS
Sbjct: 184 YKDNLKTFIMDLRRDIQPRFLPVIIVKIALYDFLRANATDNLSIVRAAQEAVSKEVPDVS 243

Query: 296 IIDSWKLPMNLTTREGFNLDRGHFNSTVLLTAGRWLADTYLSRYSQLL 344
           IIDSWKLPMNL TREGFNLDRGHFN+T+ LTAG+WLAD YLSRYS+LL
Sbjct: 244 IIDSWKLPMNLKTREGFNLDRGHFNTTIELTAGKWLADAYLSRYSRLL 291

BLAST of CSPI02G18840 vs. ExPASy TrEMBL
Match: A0A1S4DVE0 (probable carbohydrate esterase At4g34215 OS=Cucumis melo OX=3656 GN=LOC103487864 PE=4 SV=1)

HSP 1 Score: 483.8 bits (1244), Expect = 6.0e-133
Identity = 236/288 (81.94%), Postives = 259/288 (89.93%), Query Frame = 0

Query: 56  MSLLKLSIMLCAMLFGPSLSRAVSPNNIFILSGQSNMAGRGGVEKNATGNLHWDGVIPPD 115
           M+L+KLSI+LCAMLFGPSLS AVSP NIFIL GQSNMAGRGGVEKN++G   WDGVIPPD
Sbjct: 1   MALVKLSILLCAMLFGPSLSGAVSPQNIFILGGQSNMAGRGGVEKNSSGKFEWDGVIPPD 60

Query: 116 SEPTPCILRLNAARQWEEAREPLNFDIDVKKENGISPGMGFAHEILRKAGPRAGVVGLVP 175
            +P P ILRLNAARQWE AREPL++DIDV K NGISPGMGFAHE+L KAGPRAGVVGLVP
Sbjct: 61  CKPNPSILRLNAARQWEVAREPLHWDIDVMKANGISPGMGFAHELLVKAGPRAGVVGLVP 120

Query: 176 TAIGGTVIRQWMKNTTDPNATYYQHLVERIKASDKDGGVVRALLWFQGESDAAVKDYAIN 235
           TAIGGT IRQW+KN + PNATYYQ+LVERI+ASDK+GGVVRALLWFQGESDAAVK+ AIN
Sbjct: 121 TAIGGTFIRQWLKNDSYPNATYYQNLVERIQASDKEGGVVRALLWFQGESDAAVKEEAIN 180

Query: 236 YKDNLKTLINDLRNDLKPRFLPVILVKIAIYDFFAVNGTDNLSTVRAAQEAVSNEVPDVS 295
           YKDNLKT I DLR D++PRFLPVI+VKIA+YDF   N TDNLS VRAAQEAVS EVPDVS
Sbjct: 181 YKDNLKTFIMDLRRDIQPRFLPVIIVKIALYDFLRANATDNLSIVRAAQEAVSKEVPDVS 240

Query: 296 IIDSWKLPMNLTTREGFNLDRGHFNSTVLLTAGRWLADTYLSRYSQLL 344
           IIDSWKLPMNL TREGFNLDRGHFN+T+ LTAG+WLAD YLSRYS+LL
Sbjct: 241 IIDSWKLPMNLKTREGFNLDRGHFNTTIELTAGKWLADAYLSRYSRLL 288

BLAST of CSPI02G18840 vs. ExPASy TrEMBL
Match: A0A0A0LNC5 (SASA domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G356040 PE=4 SV=1)

HSP 1 Score: 408.3 bits (1048), Expect = 3.2e-110
Identity = 194/288 (67.36%), Postives = 231/288 (80.21%), Query Frame = 0

Query: 56  MSLLKLSIMLCAMLFGPSLSRAVSPNNIFILSGQSNMAGRGGVEKNATGNLHWDGVIPPD 115
           M LL+LSI+LC ML+GPSLS A SP NIFIL+GQSNMAGRGGVE NA GNL WDG++PP+
Sbjct: 1   MVLLRLSIILCVMLYGPSLSGAASPKNIFILAGQSNMAGRGGVENNAQGNLQWDGLVPPE 60

Query: 116 SEPTPCILRLNAARQWEEAREPLNFDIDVKKENGISPGMGFAHEILRKAGPRAGVVGLVP 175
            +P P ILRLN   QWE AREPL+  ID+K+  GI PG+ FAHE+L KAGP AG VGLVP
Sbjct: 61  CQPQPSILRLNPGLQWEIAREPLHLGIDIKRTPGIGPGIAFAHELLVKAGPNAGAVGLVP 120

Query: 176 TAIGGTVIRQWMKNTTDPNATYYQHLVERIKASDKDGGVVRALLWFQGESDAAVKDYAIN 235
            A GGT+I QW+KN ++P+AT+YQ+ +ERIKASDKDGGVVRAL WFQGESDAA+ D AI 
Sbjct: 121 CARGGTLIEQWIKNPSNPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIR 180

Query: 236 YKDNLKTLINDLRNDLKPRFLPVILVKIAIYDFFAVNGTDNLSTVRAAQEAVSNEVPDVS 295
           YKDNLK    D+R+D+KPRFLP+I+VKIA+YDFF  + T NL  VR A+EAVS E+PDV 
Sbjct: 181 YKDNLKKFFTDIRDDIKPRFLPIIVVKIALYDFFRQHDTHNLPAVREAKEAVSKELPDVV 240

Query: 296 IIDSWKLPMNLTTREGFNLDRGHFNSTVLLTAGRWLADTYLSRYSQLL 344
            IDS KLP+N TT EG NLD GHFN+T  +T G+WLA+TYLS + QLL
Sbjct: 241 AIDSLKLPINYTTNEGINLDHGHFNTTTEITLGKWLAETYLSHFGQLL 288

BLAST of CSPI02G18840 vs. ExPASy TrEMBL
Match: A0A0A0LKQ6 (SASA domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G356050 PE=4 SV=1)

HSP 1 Score: 401.0 bits (1029), Expect = 5.1e-108
Identity = 197/197 (100.00%), Postives = 197/197 (100.00%), Query Frame = 0

Query: 147 ENGISPGMGFAHEILRKAGPRAGVVGLVPTAIGGTVIRQWMKNTTDPNATYYQHLVERIK 206
           ENGISPGMGFAHEILRKAGPRAGVVGLVPTAIGGTVIRQWMKNTTDPNATYYQHLVERIK
Sbjct: 30  ENGISPGMGFAHEILRKAGPRAGVVGLVPTAIGGTVIRQWMKNTTDPNATYYQHLVERIK 89

Query: 207 ASDKDGGVVRALLWFQGESDAAVKDYAINYKDNLKTLINDLRNDLKPRFLPVILVKIAIY 266
           ASDKDGGVVRALLWFQGESDAAVKDYAINYKDNLKTLINDLRNDLKPRFLPVILVKIAIY
Sbjct: 90  ASDKDGGVVRALLWFQGESDAAVKDYAINYKDNLKTLINDLRNDLKPRFLPVILVKIAIY 149

Query: 267 DFFAVNGTDNLSTVRAAQEAVSNEVPDVSIIDSWKLPMNLTTREGFNLDRGHFNSTVLLT 326
           DFFAVNGTDNLSTVRAAQEAVSNEVPDVSIIDSWKLPMNLTTREGFNLDRGHFNSTVLLT
Sbjct: 150 DFFAVNGTDNLSTVRAAQEAVSNEVPDVSIIDSWKLPMNLTTREGFNLDRGHFNSTVLLT 209

Query: 327 AGRWLADTYLSRYSQLL 344
           AGRWLADTYLSRYSQLL
Sbjct: 210 AGRWLADTYLSRYSQLL 226

BLAST of CSPI02G18840 vs. NCBI nr
Match: KAE8652072.1 (hypothetical protein Csa_018570 [Cucumis sativus])

HSP 1 Score: 687.2 bits (1772), Expect = 7.3e-194
Identity = 340/343 (99.13%), Postives = 341/343 (99.42%), Query Frame = 0

Query: 1   MTNKEREEARISQQRQEWLCVCVCVCADRHRGKSFPLSFTLTLFSVSFCANTRFEMSLLK 60
           MTNKEREEARISQQRQEWL  CVCVCADRHRGK+FPLSFTLTLFSVSFCANTRFEMSLLK
Sbjct: 1   MTNKEREEARISQQRQEWL--CVCVCADRHRGKNFPLSFTLTLFSVSFCANTRFEMSLLK 60

Query: 61  LSIMLCAMLFGPSLSRAVSPNNIFILSGQSNMAGRGGVEKNATGNLHWDGVIPPDSEPTP 120
           LSIMLCAMLFGPSLSRAVSPNNIFILSGQSNMAGRGGVEKNATGNLHWDGVIPPDSEPTP
Sbjct: 61  LSIMLCAMLFGPSLSRAVSPNNIFILSGQSNMAGRGGVEKNATGNLHWDGVIPPDSEPTP 120

Query: 121 CILRLNAARQWEEAREPLNFDIDVKKENGISPGMGFAHEILRKAGPRAGVVGLVPTAIGG 180
           CILRLNAARQWEEAREPLNFDIDVKKENGISPGMGFAHEILRKAGPRAGVVGLVPTAIGG
Sbjct: 121 CILRLNAARQWEEAREPLNFDIDVKKENGISPGMGFAHEILRKAGPRAGVVGLVPTAIGG 180

Query: 181 TVIRQWMKNTTDPNATYYQHLVERIKASDKDGGVVRALLWFQGESDAAVKDYAINYKDNL 240
           TVIRQWMKNTTDPNATYYQHLVERIKASDKDGGVVRALLWFQGESDAAVKDYAINYKDNL
Sbjct: 181 TVIRQWMKNTTDPNATYYQHLVERIKASDKDGGVVRALLWFQGESDAAVKDYAINYKDNL 240

Query: 241 KTLINDLRNDLKPRFLPVILVKIAIYDFFAVNGTDNLSTVRAAQEAVSNEVPDVSIIDSW 300
           KTLINDLRNDLKPRFLPVILVKIAIYDFFAVNGTDNLSTVRAAQEAVSNEVPDVSIIDSW
Sbjct: 241 KTLINDLRNDLKPRFLPVILVKIAIYDFFAVNGTDNLSTVRAAQEAVSNEVPDVSIIDSW 300

Query: 301 KLPMNLTTREGFNLDRGHFNSTVLLTAGRWLADTYLSRYSQLL 344
           KLPMNLTTREGFNLDRGHFNSTVLLTAGRWLADTYLSRYSQLL
Sbjct: 301 KLPMNLTTREGFNLDRGHFNSTVLLTAGRWLADTYLSRYSQLL 341

BLAST of CSPI02G18840 vs. NCBI nr
Match: XP_011650181.1 (probable carbohydrate esterase At4g34215 [Cucumis sativus])

HSP 1 Score: 583.6 bits (1503), Expect = 1.1e-162
Identity = 288/288 (100.00%), Postives = 288/288 (100.00%), Query Frame = 0

Query: 56  MSLLKLSIMLCAMLFGPSLSRAVSPNNIFILSGQSNMAGRGGVEKNATGNLHWDGVIPPD 115
           MSLLKLSIMLCAMLFGPSLSRAVSPNNIFILSGQSNMAGRGGVEKNATGNLHWDGVIPPD
Sbjct: 1   MSLLKLSIMLCAMLFGPSLSRAVSPNNIFILSGQSNMAGRGGVEKNATGNLHWDGVIPPD 60

Query: 116 SEPTPCILRLNAARQWEEAREPLNFDIDVKKENGISPGMGFAHEILRKAGPRAGVVGLVP 175
           SEPTPCILRLNAARQWEEAREPLNFDIDVKKENGISPGMGFAHEILRKAGPRAGVVGLVP
Sbjct: 61  SEPTPCILRLNAARQWEEAREPLNFDIDVKKENGISPGMGFAHEILRKAGPRAGVVGLVP 120

Query: 176 TAIGGTVIRQWMKNTTDPNATYYQHLVERIKASDKDGGVVRALLWFQGESDAAVKDYAIN 235
           TAIGGTVIRQWMKNTTDPNATYYQHLVERIKASDKDGGVVRALLWFQGESDAAVKDYAIN
Sbjct: 121 TAIGGTVIRQWMKNTTDPNATYYQHLVERIKASDKDGGVVRALLWFQGESDAAVKDYAIN 180

Query: 236 YKDNLKTLINDLRNDLKPRFLPVILVKIAIYDFFAVNGTDNLSTVRAAQEAVSNEVPDVS 295
           YKDNLKTLINDLRNDLKPRFLPVILVKIAIYDFFAVNGTDNLSTVRAAQEAVSNEVPDVS
Sbjct: 181 YKDNLKTLINDLRNDLKPRFLPVILVKIAIYDFFAVNGTDNLSTVRAAQEAVSNEVPDVS 240

Query: 296 IIDSWKLPMNLTTREGFNLDRGHFNSTVLLTAGRWLADTYLSRYSQLL 344
           IIDSWKLPMNLTTREGFNLDRGHFNSTVLLTAGRWLADTYLSRYSQLL
Sbjct: 241 IIDSWKLPMNLTTREGFNLDRGHFNSTVLLTAGRWLADTYLSRYSQLL 288

BLAST of CSPI02G18840 vs. NCBI nr
Match: ADN33727.1 (hypothetical protein [Cucumis melo subsp. melo])

HSP 1 Score: 483.8 bits (1244), Expect = 1.2e-132
Identity = 236/288 (81.94%), Postives = 259/288 (89.93%), Query Frame = 0

Query: 56  MSLLKLSIMLCAMLFGPSLSRAVSPNNIFILSGQSNMAGRGGVEKNATGNLHWDGVIPPD 115
           M+L+KLSI+LCAMLFGPSLS AVSP NIFIL GQSNMAGRGGVEKN++G   WDGVIPPD
Sbjct: 4   MALVKLSILLCAMLFGPSLSGAVSPQNIFILGGQSNMAGRGGVEKNSSGKFEWDGVIPPD 63

Query: 116 SEPTPCILRLNAARQWEEAREPLNFDIDVKKENGISPGMGFAHEILRKAGPRAGVVGLVP 175
            +P P ILRLNAARQWE AREPL++DIDV K NGISPGMGFAHE+L KAGPRAGVVGLVP
Sbjct: 64  CKPNPSILRLNAARQWEVAREPLHWDIDVMKANGISPGMGFAHELLVKAGPRAGVVGLVP 123

Query: 176 TAIGGTVIRQWMKNTTDPNATYYQHLVERIKASDKDGGVVRALLWFQGESDAAVKDYAIN 235
           TAIGGT IRQW+KN + PNATYYQ+LVERI+ASDK+GGVVRALLWFQGESDAAVK+ AIN
Sbjct: 124 TAIGGTFIRQWLKNDSYPNATYYQNLVERIQASDKEGGVVRALLWFQGESDAAVKEEAIN 183

Query: 236 YKDNLKTLINDLRNDLKPRFLPVILVKIAIYDFFAVNGTDNLSTVRAAQEAVSNEVPDVS 295
           YKDNLKT I DLR D++PRFLPVI+VKIA+YDF   N TDNLS VRAAQEAVS EVPDVS
Sbjct: 184 YKDNLKTFIMDLRRDIQPRFLPVIIVKIALYDFLRANATDNLSIVRAAQEAVSKEVPDVS 243

Query: 296 IIDSWKLPMNLTTREGFNLDRGHFNSTVLLTAGRWLADTYLSRYSQLL 344
           IIDSWKLPMNL TREGFNLDRGHFN+T+ LTAG+WLAD YLSRYS+LL
Sbjct: 244 IIDSWKLPMNLKTREGFNLDRGHFNTTIELTAGKWLADAYLSRYSRLL 291

BLAST of CSPI02G18840 vs. NCBI nr
Match: XP_016899952.1 (PREDICTED: probable carbohydrate esterase At4g34215 [Cucumis melo])

HSP 1 Score: 483.8 bits (1244), Expect = 1.2e-132
Identity = 236/288 (81.94%), Postives = 259/288 (89.93%), Query Frame = 0

Query: 56  MSLLKLSIMLCAMLFGPSLSRAVSPNNIFILSGQSNMAGRGGVEKNATGNLHWDGVIPPD 115
           M+L+KLSI+LCAMLFGPSLS AVSP NIFIL GQSNMAGRGGVEKN++G   WDGVIPPD
Sbjct: 1   MALVKLSILLCAMLFGPSLSGAVSPQNIFILGGQSNMAGRGGVEKNSSGKFEWDGVIPPD 60

Query: 116 SEPTPCILRLNAARQWEEAREPLNFDIDVKKENGISPGMGFAHEILRKAGPRAGVVGLVP 175
            +P P ILRLNAARQWE AREPL++DIDV K NGISPGMGFAHE+L KAGPRAGVVGLVP
Sbjct: 61  CKPNPSILRLNAARQWEVAREPLHWDIDVMKANGISPGMGFAHELLVKAGPRAGVVGLVP 120

Query: 176 TAIGGTVIRQWMKNTTDPNATYYQHLVERIKASDKDGGVVRALLWFQGESDAAVKDYAIN 235
           TAIGGT IRQW+KN + PNATYYQ+LVERI+ASDK+GGVVRALLWFQGESDAAVK+ AIN
Sbjct: 121 TAIGGTFIRQWLKNDSYPNATYYQNLVERIQASDKEGGVVRALLWFQGESDAAVKEEAIN 180

Query: 236 YKDNLKTLINDLRNDLKPRFLPVILVKIAIYDFFAVNGTDNLSTVRAAQEAVSNEVPDVS 295
           YKDNLKT I DLR D++PRFLPVI+VKIA+YDF   N TDNLS VRAAQEAVS EVPDVS
Sbjct: 181 YKDNLKTFIMDLRRDIQPRFLPVIIVKIALYDFLRANATDNLSIVRAAQEAVSKEVPDVS 240

Query: 296 IIDSWKLPMNLTTREGFNLDRGHFNSTVLLTAGRWLADTYLSRYSQLL 344
           IIDSWKLPMNL TREGFNLDRGHFN+T+ LTAG+WLAD YLSRYS+LL
Sbjct: 241 IIDSWKLPMNLKTREGFNLDRGHFNTTIELTAGKWLADAYLSRYSRLL 288

BLAST of CSPI02G18840 vs. NCBI nr
Match: KAA0060925.1 (putative carbohydrate esterase [Cucumis melo var. makuwa])

HSP 1 Score: 483.8 bits (1244), Expect = 1.2e-132
Identity = 236/288 (81.94%), Postives = 259/288 (89.93%), Query Frame = 0

Query: 56  MSLLKLSIMLCAMLFGPSLSRAVSPNNIFILSGQSNMAGRGGVEKNATGNLHWDGVIPPD 115
           M+L+KLSI+LCAMLFGPSLS AVSP NIFIL GQSNMAGRGGVEKN++G   WDGVIPPD
Sbjct: 4   MALVKLSILLCAMLFGPSLSGAVSPKNIFILGGQSNMAGRGGVEKNSSGKFEWDGVIPPD 63

Query: 116 SEPTPCILRLNAARQWEEAREPLNFDIDVKKENGISPGMGFAHEILRKAGPRAGVVGLVP 175
            +P P ILRLNAARQWE AREPL++DIDV K NGISPGMGFAHE+L KAGPRAGVVGLVP
Sbjct: 64  CKPNPSILRLNAARQWEVAREPLHWDIDVMKANGISPGMGFAHELLVKAGPRAGVVGLVP 123

Query: 176 TAIGGTVIRQWMKNTTDPNATYYQHLVERIKASDKDGGVVRALLWFQGESDAAVKDYAIN 235
           TAIGGT IRQW+KN + PNATYYQ+LVERI+ASDK+GGVVRALLWFQGESDAAVK+ AIN
Sbjct: 124 TAIGGTFIRQWLKNDSYPNATYYQNLVERIQASDKEGGVVRALLWFQGESDAAVKEEAIN 183

Query: 236 YKDNLKTLINDLRNDLKPRFLPVILVKIAIYDFFAVNGTDNLSTVRAAQEAVSNEVPDVS 295
           YKDNLKT I DLR D++PRFLPVI+VKIA+YDF   N TDNLS VRAAQEAVS EVPDVS
Sbjct: 184 YKDNLKTFIMDLRRDIQPRFLPVIIVKIALYDFLRANATDNLSIVRAAQEAVSKEVPDVS 243

Query: 296 IIDSWKLPMNLTTREGFNLDRGHFNSTVLLTAGRWLADTYLSRYSQLL 344
           IIDSWKLPMNL TREGFNLDRGHFN+T+ LTAG+WLAD YLSRYS+LL
Sbjct: 244 IIDSWKLPMNLKTREGFNLDRGHFNTTIELTAGKWLADAYLSRYSRLL 291

BLAST of CSPI02G18840 vs. TAIR 10
Match: AT4G34215.1 (Domain of unknown function (DUF303) )

HSP 1 Score: 181.0 bits (458), Expect = 1.6e-45
Identity = 106/269 (39.41%), Postives = 151/269 (56.13%), Query Frame = 0

Query: 72  PSLSRAVSPNNIFILSGQSNMAGRGGVEKNATGNLH-WDGVIPPDSEPTPCILRLNAARQ 131
           P +   + PN IFILSGQSNMAGRGGV K+   N   WD ++PP+  P   ILRL+A  +
Sbjct: 13  PEIQSPIPPNQIFILSGQSNMAGRGGVFKDHHNNRWVWDKILPPECAPNSSILRLSADLR 72

Query: 132 WEEAREPLNFDIDVKKENGISPGMGFAHEILRKAGPRAGVVGLVPTAIGGTVIRQWMKNT 191
           WEEA EPL+ DID  K  G+ PGM FA+ +  +    + V+GLVP A GGT I++W    
Sbjct: 73  WEEAHEPLHVDIDTGKVCGVGPGMAFANAVKNRLETDSAVIGLVPCASGGTAIKEW---- 132

Query: 192 TDPNATYYQHLVERIKASDKDGGVVRALLWFQGESDAAVKDYAINYKDNLKTLINDLRND 251
            +  +  Y+ +V+R + S K GG ++A+LW+QGESD      A +Y +N+  LI +LR+D
Sbjct: 133 -ERGSHLYERMVKRTEESRKCGGEIKAVLWYQGESDVLDIHDAESYGNNMDRLIKNLRHD 192

Query: 252 LKPRFLPVILVKIAIYDFFAVNGTDNLSTVRAAQEAVSNEVPDVSIIDSWKLPMNLTTRE 311
           L    LP+I V IA       +G   +  VR AQ  +  ++ +V  +D+  LP+      
Sbjct: 193 LNLPSLPIIQVAIA-------SGGGYIDKVREAQ--LGLKLSNVVCVDAKGLPL------ 252

Query: 312 GFNLDRGHFNSTVLLTAGRWLADTYLSRY 340
               D  H  +   +  G  LA  YLS +
Sbjct: 253 --KSDNLHLTTEAQVQLGLSLAQAYLSNF 259

BLAST of CSPI02G18840 vs. TAIR 10
Match: AT4G34215.2 (Domain of unknown function (DUF303) )

HSP 1 Score: 181.0 bits (458), Expect = 1.6e-45
Identity = 106/269 (39.41%), Postives = 151/269 (56.13%), Query Frame = 0

Query: 72  PSLSRAVSPNNIFILSGQSNMAGRGGVEKNATGNLH-WDGVIPPDSEPTPCILRLNAARQ 131
           P +   + PN IFILSGQSNMAGRGGV K+   N   WD ++PP+  P   ILRL+A  +
Sbjct: 13  PEIQSPIPPNQIFILSGQSNMAGRGGVFKDHHNNRWVWDKILPPECAPNSSILRLSADLR 72

Query: 132 WEEAREPLNFDIDVKKENGISPGMGFAHEILRKAGPRAGVVGLVPTAIGGTVIRQWMKNT 191
           WEEA EPL+ DID  K  G+ PGM FA+ +  +    + V+GLVP A GGT I++W    
Sbjct: 73  WEEAHEPLHVDIDTGKVCGVGPGMAFANAVKNRLETDSAVIGLVPCASGGTAIKEW---- 132

Query: 192 TDPNATYYQHLVERIKASDKDGGVVRALLWFQGESDAAVKDYAINYKDNLKTLINDLRND 251
            +  +  Y+ +V+R + S K GG ++A+LW+QGESD      A +Y +N+  LI +LR+D
Sbjct: 133 -ERGSHLYERMVKRTEESRKCGGEIKAVLWYQGESDVLDIHDAESYGNNMDRLIKNLRHD 192

Query: 252 LKPRFLPVILVKIAIYDFFAVNGTDNLSTVRAAQEAVSNEVPDVSIIDSWKLPMNLTTRE 311
           L    LP+I V IA       +G   +  VR AQ  +  ++ +V  +D+  LP+      
Sbjct: 193 LNLPSLPIIQVAIA-------SGGGYIDKVREAQ--LGLKLSNVVCVDAKGLPL------ 252

Query: 312 GFNLDRGHFNSTVLLTAGRWLADTYLSRY 340
               D  H  +   +  G  LA  YLS +
Sbjct: 253 --KSDNLHLTTEAQVQLGLSLAQAYLSNF 259

BLAST of CSPI02G18840 vs. TAIR 10
Match: AT3G53010.1 (Domain of unknown function (DUF303) )

HSP 1 Score: 172.9 bits (437), Expect = 4.3e-43
Identity = 107/267 (40.07%), Postives = 149/267 (55.81%), Query Frame = 0

Query: 75  SRAVSPN-NIFILSGQSNMAGRGGV-EKNATGNLHWDGVIPPDSEPTPCILRLNAARQWE 134
           S+ ++ N +IFIL+GQSNMAGRGGV    AT    WDGVIPP+    P ILRL +  +W+
Sbjct: 22  SQTITRNISIFILAGQSNMAGRGGVYNDTATNTTVWDGVIPPECRSNPSILRLTSKLEWK 81

Query: 135 EAREPLNFDIDVKKENGISPGMGFAHEILRKAGPRAGVVGLVPTAIGGTVIRQWMKNTTD 194
           EA+EPL+ DID+ K NG+ PGM FA+ ++     R G VGLVP +IGGT + QW K    
Sbjct: 82  EAKEPLHVDIDINKTNGVGPGMPFANRVVN----RFGQVGLVPCSIGGTKLSQWQK---- 141

Query: 195 PNATYYQHLVERIKA--SDKDGGVVRALLWFQGESDAAVKDYAINYKDNLKTLINDLRND 254
                Y+  V+R KA  +   GG  RA+LW+QGESD      A  YK  L    +DLRND
Sbjct: 142 -GEFLYEETVKRAKAAMASGGGGSYRAVLWYQGESDTVDMVDASVYKKRLVKFFSDLRND 201

Query: 255 LKPRFLPVILVKIAIYDFFAVNGTDNLSTVRAAQEAVSNEVPDVSIIDSWKLPMNLTTRE 314
           L+   LP+I V +      A      L  VR AQ  +  ++ +V  +D+  LP+      
Sbjct: 202 LQHPNLPIIQVAL------ATGAGPYLDAVRKAQ--LKTDLENVYCVDARGLPL------ 261

Query: 315 GFNLDRGHFNSTVLLTAGRWLADTYLS 338
               D  H  ++  +  G  +A+++L+
Sbjct: 262 --EPDGLHLTTSSQVQLGHMIAESFLA 263

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8L9J92.2e-4439.41Probable carbohydrate esterase At4g34215 OS=Arabidopsis thaliana OX=3702 GN=At4g... [more]
Match NameE-valueIdentityDescription
A0A5A7V4V56.0e-13381.94Putative carbohydrate esterase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_s... [more]
E5GB856.0e-13381.94SASA domain-containing protein OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1[more]
A0A1S4DVE06.0e-13381.94probable carbohydrate esterase At4g34215 OS=Cucumis melo OX=3656 GN=LOC103487864... [more]
A0A0A0LNC53.2e-11067.36SASA domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G356040 PE=4 S... [more]
A0A0A0LKQ65.1e-108100.00SASA domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G356050 PE=4 S... [more]
Match NameE-valueIdentityDescription
KAE8652072.17.3e-19499.13hypothetical protein Csa_018570 [Cucumis sativus][more]
XP_011650181.11.1e-162100.00probable carbohydrate esterase At4g34215 [Cucumis sativus][more]
ADN33727.11.2e-13281.94hypothetical protein [Cucumis melo subsp. melo][more]
XP_016899952.11.2e-13281.94PREDICTED: probable carbohydrate esterase At4g34215 [Cucumis melo][more]
KAA0060925.11.2e-13281.94putative carbohydrate esterase [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
AT4G34215.11.6e-4539.41Domain of unknown function (DUF303) [more]
AT4G34215.21.6e-4539.41Domain of unknown function (DUF303) [more]
AT3G53010.14.3e-4340.07Domain of unknown function (DUF303) [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005181Sialate O-acetylesterase domainPFAMPF03629SASAcoord: 81..336
e-value: 1.8E-65
score: 220.7
IPR036514SGNH hydrolase superfamilyGENE3D3.40.50.1110SGNH hydrolasecoord: 79..340
e-value: 1.1E-56
score: 194.4
NoneNo IPR availablePANTHERPTHR31988:SF19BNACNNG62850D PROTEINcoord: 78..339
NoneNo IPR availablePANTHERPTHR31988ESTERASE, PUTATIVE (DUF303)-RELATEDcoord: 78..339
NoneNo IPR availableSUPERFAMILY52266SGNH hydrolasecoord: 82..339

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI02G18840.1CSPI02G18840.1mRNA