HG10014165 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10014165
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionSASA domain-containing protein
LocationChr02: 8128888 .. 8130679 (-)
RNA-Seq ExpressionHG10014165
SyntenyHG10014165
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATTGAGAAAAAGGTATGGTATTTATTTGTTTGTAAAAGAGAGCTTTGAGGATCCCACTGCTGCGTAAGCTAAGGCTGAAGACAGTATGTCATGTTGTTGATATTATCCTTTCCTCGTAACCAACGGTCCCATCTGTTTAATTTTGCTACGTCTTTCCTTTTTTAATTTAAGCCTGCGTTTCCATAATTTTTTACAGTAATTAGAAACATTTGTTTTTTCTCCCCATAATATTTGAGGATTCTAATTTATTTTTTTAACAAAGTGATGCACATACACATTTCTTAATTGCTCTTTAACAGGTCGATTTAAAAAGTTTTATAAATGGGTGTGAAATTGCATTGTCCATTCCATGCGCCAGTTTTTTAAAAGAGGACATAATCACGTTTTTTTTTTTAATCAAATAATTAATATAATAATAAAAAAAAAAAATTTATTTTGAAATTCAAACAGAGCAAATAGGCCAAGTGGAATTTAAATAGTCGATGGACTTTGGTTAGGAATTTTTTTAAGCAGACTTGAGTTGAGAGGGACTAAATTGCCACATTAATTAGTGGGGAAATGGGATAATAAGACGGCAACTAAAGATAAAACTTAAAAGAAAAAGAAAACTTTTGTTTTGAAATTATATATTCTTTCAGTTTCCGCCGGGTAGATTCAGCCACACTCCAAACCAAAATGCTCTACTTGTGCCTATTATTTCTCACGGCCACTCAGATTCCGGCCACCTCTCAACAACCATCACCACCCACCGCCATTTTCCTCCTTGCCGGACAGAGCAACATGGCCGGAAGAGGCGGTGTCACAAACAGCACCGTCACCCACCGCCCCACGTGGGACGGCGTTGTGCCTCCCCAATGCTCACCAACCCCTTCCATCCTCCGCCTTGCCGCGGATCTCACTTGGGTCGAAGCTCGCGAGCCACTCCACGCGGACATTGATTTTCTCAAGACCAACGGGATTGGGCCGGGCATGCCCTTTGCCCACGACATTCTCATGGATAAACCGGGCGGTCAGATGGTGATCGGTCTGGTCCCATGCGCAATCGGCGGGACTTCGATCAAAGAGTGGCAACAGGGATCCAATCTGTACAACCATTTATTGAGCAGAGCCGAAGCATCGGTACTCAGCAGAGGGAAAATTAAAGCGCTTCTGTGGTATCAGGGTGAAAGCGATACTCAGAATGCAGAAGATTCTGAGCTGTACGGTGGCAGATTGAAGAAGTTCTTCACTGACATTCGCTCGGATTTAAAGATTCCATTGCTCCCAATTATTCAGGTTTTTTTGTATTCTAATTTGACCCCTTTCTATTTTAAAATCATTACTTGCGAGACCATTCGAGCAGATTAGGTAATTTGATTAAAATAATTACATAGAAAAATCCAAGAGAAAAGAAATAGCAAGTAATGAATGGGATAGTCGAGTTTGAGTTTGATAAAACGTCAATTTTTTTTTAATATTTTTTTGGTAGGTTGGTATTGCGTCAGGAGAAGGGCCGTATAAAGAAGGAGTAAGAAGGGGGCAATTTGGAATGGATTTAGTGAACGTGATGAGTGTGGACGCATTGGGCCTTTCATTGGAACCAGATGGGCTTCACTTAAACACTCCTTCCCAAGTTCGACTGGGTGGGCTTTTAGCCGATGCGTATCGACGATTTCCATCTCACCCACTGGCTACCCCATTAACAAACGCTGCTCCATTGCCTATAATTGCAACTTACTTATACTTGCTTTCCATTTCTACGATTCTCACATTTCTGTTGCTATTTCTACAACTATTTCTCTTATTATGA

mRNA sequence

ATGATTGAGAAAAAGGTTGGTATTGCGTCAGGAGAAGGGCCGTATAAAGAAGGAGTAAGAAGGGGGCAATTTGGAATGGATTTAGTGAACGTGATGAGTGTGGACGCATTGGGCCTTTCATTGGAACCAGATGGGCTTCACTTAAACACTCCTTCCCAAGTTCGACTGGGTGGGCTTTTAGCCGATGCGTATCGACGATTTCCATCTCACCCACTGGCTACCCCATTAACAAACGCTGCTCCATTGCCTATAATTGCAACTTACTTATACTTGCTTTCCATTTCTACGATTCTCACATTTCTGTTGCTATTTCTACAACTATTTCTCTTATTATGA

Coding sequence (CDS)

ATGATTGAGAAAAAGGTTGGTATTGCGTCAGGAGAAGGGCCGTATAAAGAAGGAGTAAGAAGGGGGCAATTTGGAATGGATTTAGTGAACGTGATGAGTGTGGACGCATTGGGCCTTTCATTGGAACCAGATGGGCTTCACTTAAACACTCCTTCCCAAGTTCGACTGGGTGGGCTTTTAGCCGATGCGTATCGACGATTTCCATCTCACCCACTGGCTACCCCATTAACAAACGCTGCTCCATTGCCTATAATTGCAACTTACTTATACTTGCTTTCCATTTCTACGATTCTCACATTTCTGTTGCTATTTCTACAACTATTTCTCTTATTATGA

Protein sequence

MIEKKVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAYRRFPSHPLATPLTNAAPLPIIATYLYLLSISTILTFLLLFLQLFLLL
Homology
BLAST of HG10014165 vs. NCBI nr
Match: XP_038899610.1 (probable carbohydrate esterase At4g34215 [Benincasa hispida])

HSP 1 Score: 159.1 bits (401), Expect = 2.2e-35
Identity = 85/106 (80.19%), Postives = 98/106 (92.45%), Query Frame = 0

Query: 5   KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAY 64
           +VGIA+GEGPYKEGVRRGQFG+DLVNVM+VDA+GLSLEPDGLHL TPSQV+LGGLLADAY
Sbjct: 200 QVGIATGEGPYKEGVRRGQFGIDLVNVMTVDAMGLSLEPDGLHLTTPSQVQLGGLLADAY 259

Query: 65  RRFPSHPLATPLTNAAPLPIIATYLYLLSISTILTFLLLFLQLFLL 111
           RRFPSHPLATPLTNAA +P+I+T  + LSIS ILT +LLFL+L L+
Sbjct: 260 RRFPSHPLATPLTNAAHIPMIST--FFLSISRILTVVLLFLRLILM 303

BLAST of HG10014165 vs. NCBI nr
Match: XP_004141392.1 (probable carbohydrate esterase At4g34215 [Cucumis sativus] >KGN55236.1 hypothetical protein Csa_012338 [Cucumis sativus])

HSP 1 Score: 145.2 bits (365), Expect = 3.4e-31
Identity = 72/86 (83.72%), Postives = 77/86 (89.53%), Query Frame = 0

Query: 5   KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAY 64
           +VGIASGEG YKEGVRRGQFG+DLVNVM VDALGL LEPDGLHL T SQVRLGGLLADAY
Sbjct: 201 QVGIASGEGEYKEGVRRGQFGIDLVNVMIVDALGLPLEPDGLHLTTTSQVRLGGLLADAY 260

Query: 65  RRFPSHPLATPLTNAAPLPIIATYLY 91
           RRFPSHPLATPLTNAAP+  I+T  +
Sbjct: 261 RRFPSHPLATPLTNAAPISRISTIFF 286

BLAST of HG10014165 vs. NCBI nr
Match: XP_022140685.1 (probable carbohydrate esterase At4g34215 [Momordica charantia])

HSP 1 Score: 141.0 bits (354), Expect = 6.3e-30
Identity = 66/77 (85.71%), Postives = 74/77 (96.10%), Query Frame = 0

Query: 5   KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAY 64
           +VGIASG+GPYKEGVRRGQFG++L NVM+VDALGL LEPDGLHLNTP+QV+LGGLLADAY
Sbjct: 26  QVGIASGDGPYKEGVRRGQFGIELRNVMTVDALGLPLEPDGLHLNTPAQVKLGGLLADAY 85

Query: 65  RRFPSHPLATPLTNAAP 82
           RRFPSHPLA+PL NAAP
Sbjct: 86  RRFPSHPLASPLRNAAP 102

BLAST of HG10014165 vs. NCBI nr
Match: XP_008452605.1 (PREDICTED: probable carbohydrate esterase At4g34215 [Cucumis melo] >KAA0064418.1 putative carbohydrate esterase [Cucumis melo var. makuwa] >TYK20169.1 putative carbohydrate esterase [Cucumis melo var. makuwa])

HSP 1 Score: 140.6 bits (353), Expect = 8.3e-30
Identity = 81/107 (75.70%), Postives = 88/107 (82.24%), Query Frame = 0

Query: 5   KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAY 64
           +VGIASGEG YKEGVRRGQFG+DLVNVM VDALGL LEPDGLHL TPSQVRLGGLLA AY
Sbjct: 202 QVGIASGEGEYKEGVRRGQFGIDLVNVMIVDALGLPLEPDGLHLTTPSQVRLGGLLAHAY 261

Query: 65  RRFPSHPLATPLTNAAPLPIIATYLYLLSISTILTFLLLFLQLFLLL 112
           RRFPSHPLATPLTNAAP+  I +  +LLSI  I TFL  F+  +  L
Sbjct: 262 RRFPSHPLATPLTNAAPISTIIS-TFLLSIWWIFTFLFPFVVNYFYL 307

BLAST of HG10014165 vs. NCBI nr
Match: XP_022982273.1 (probable carbohydrate esterase At4g34215 [Cucurbita maxima])

HSP 1 Score: 126.3 bits (316), Expect = 1.6e-25
Identity = 69/102 (67.65%), Postives = 84/102 (82.35%), Query Frame = 0

Query: 5   KVGIASGEGPYKEGVRRGQFGMDLVNVMSVD--ALGLSLEPDGLHLNTPSQVRLGGLLAD 64
           +VGIASGEGPYKEGVRRGQFG++++NVM+VD  ALGLS EPDGLHLNTPSQV+LGG+LAD
Sbjct: 200 QVGIASGEGPYKEGVRRGQFGIEVMNVMTVDAYALGLSFEPDGLHLNTPSQVKLGGVLAD 259

Query: 65  AYRRFPSHPL-ATPLTNAAPLPIIATYLYLLSISTILTFLLL 104
           AYRRFP HPL A+PL NAA     + Y + +S+   +TF+ L
Sbjct: 260 AYRRFPPHPLAASPLRNAA--STASVYSFCISMFRTMTFVFL 299

BLAST of HG10014165 vs. ExPASy Swiss-Prot
Match: Q8L9J9 (Probable carbohydrate esterase At4g34215 OS=Arabidopsis thaliana OX=3702 GN=At4g34215 PE=1 SV=2)

HSP 1 Score: 58.5 bits (140), Expect = 5.4e-08
Identity = 33/60 (55.00%), Postives = 40/60 (66.67%), Query Frame = 0

Query: 5   KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAY 64
           +V IASG G Y + VR  Q G+ L NV+ VDA GL L+ D LHL T +QV+LG  LA AY
Sbjct: 197 QVAIASG-GGYIDKVREAQLGLKLSNVVCVDAKGLPLKSDNLHLTTEAQVQLGLSLAQAY 255

BLAST of HG10014165 vs. ExPASy TrEMBL
Match: A0A0A0L4Z7 (SASA domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G641680 PE=4 SV=1)

HSP 1 Score: 145.2 bits (365), Expect = 1.6e-31
Identity = 72/86 (83.72%), Postives = 77/86 (89.53%), Query Frame = 0

Query: 5   KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAY 64
           +VGIASGEG YKEGVRRGQFG+DLVNVM VDALGL LEPDGLHL T SQVRLGGLLADAY
Sbjct: 201 QVGIASGEGEYKEGVRRGQFGIDLVNVMIVDALGLPLEPDGLHLTTTSQVRLGGLLADAY 260

Query: 65  RRFPSHPLATPLTNAAPLPIIATYLY 91
           RRFPSHPLATPLTNAAP+  I+T  +
Sbjct: 261 RRFPSHPLATPLTNAAPISRISTIFF 286

BLAST of HG10014165 vs. ExPASy TrEMBL
Match: A0A6J1CFT3 (probable carbohydrate esterase At4g34215 OS=Momordica charantia OX=3673 GN=LOC111011284 PE=4 SV=1)

HSP 1 Score: 141.0 bits (354), Expect = 3.1e-30
Identity = 66/77 (85.71%), Postives = 74/77 (96.10%), Query Frame = 0

Query: 5   KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAY 64
           +VGIASG+GPYKEGVRRGQFG++L NVM+VDALGL LEPDGLHLNTP+QV+LGGLLADAY
Sbjct: 26  QVGIASGDGPYKEGVRRGQFGIELRNVMTVDALGLPLEPDGLHLNTPAQVKLGGLLADAY 85

Query: 65  RRFPSHPLATPLTNAAP 82
           RRFPSHPLA+PL NAAP
Sbjct: 86  RRFPSHPLASPLRNAAP 102

BLAST of HG10014165 vs. ExPASy TrEMBL
Match: A0A5A7VA07 (Putative carbohydrate esterase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold134G003010 PE=4 SV=1)

HSP 1 Score: 140.6 bits (353), Expect = 4.0e-30
Identity = 81/107 (75.70%), Postives = 88/107 (82.24%), Query Frame = 0

Query: 5   KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAY 64
           +VGIASGEG YKEGVRRGQFG+DLVNVM VDALGL LEPDGLHL TPSQVRLGGLLA AY
Sbjct: 202 QVGIASGEGEYKEGVRRGQFGIDLVNVMIVDALGLPLEPDGLHLTTPSQVRLGGLLAHAY 261

Query: 65  RRFPSHPLATPLTNAAPLPIIATYLYLLSISTILTFLLLFLQLFLLL 112
           RRFPSHPLATPLTNAAP+  I +  +LLSI  I TFL  F+  +  L
Sbjct: 262 RRFPSHPLATPLTNAAPISTIIS-TFLLSIWWIFTFLFPFVVNYFYL 307

BLAST of HG10014165 vs. ExPASy TrEMBL
Match: A0A1S3BVE3 (probable carbohydrate esterase At4g34215 OS=Cucumis melo OX=3656 GN=LOC103493577 PE=4 SV=1)

HSP 1 Score: 140.6 bits (353), Expect = 4.0e-30
Identity = 81/107 (75.70%), Postives = 88/107 (82.24%), Query Frame = 0

Query: 5   KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAY 64
           +VGIASGEG YKEGVRRGQFG+DLVNVM VDALGL LEPDGLHL TPSQVRLGGLLA AY
Sbjct: 202 QVGIASGEGEYKEGVRRGQFGIDLVNVMIVDALGLPLEPDGLHLTTPSQVRLGGLLAHAY 261

Query: 65  RRFPSHPLATPLTNAAPLPIIATYLYLLSISTILTFLLLFLQLFLLL 112
           RRFPSHPLATPLTNAAP+  I +  +LLSI  I TFL  F+  +  L
Sbjct: 262 RRFPSHPLATPLTNAAPISTIIS-TFLLSIWWIFTFLFPFVVNYFYL 307

BLAST of HG10014165 vs. ExPASy TrEMBL
Match: A0A6J1J4F1 (probable carbohydrate esterase At4g34215 OS=Cucurbita maxima OX=3661 GN=LOC111481152 PE=4 SV=1)

HSP 1 Score: 126.3 bits (316), Expect = 7.8e-26
Identity = 69/102 (67.65%), Postives = 84/102 (82.35%), Query Frame = 0

Query: 5   KVGIASGEGPYKEGVRRGQFGMDLVNVMSVD--ALGLSLEPDGLHLNTPSQVRLGGLLAD 64
           +VGIASGEGPYKEGVRRGQFG++++NVM+VD  ALGLS EPDGLHLNTPSQV+LGG+LAD
Sbjct: 200 QVGIASGEGPYKEGVRRGQFGIEVMNVMTVDAYALGLSFEPDGLHLNTPSQVKLGGVLAD 259

Query: 65  AYRRFPSHPL-ATPLTNAAPLPIIATYLYLLSISTILTFLLL 104
           AYRRFP HPL A+PL NAA     + Y + +S+   +TF+ L
Sbjct: 260 AYRRFPPHPLAASPLRNAA--STASVYSFCISMFRTMTFVFL 299

BLAST of HG10014165 vs. TAIR 10
Match: AT3G53010.1 (Domain of unknown function (DUF303) )

HSP 1 Score: 73.9 bits (180), Expect = 8.8e-14
Identity = 34/65 (52.31%), Postives = 46/65 (70.77%), Query Frame = 0

Query: 5   KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAY 64
           +V +A+G GPY + VR+ Q   DL NV  VDA GL LEPDGLHL T SQV+LG ++A+++
Sbjct: 202 QVALATGAGPYLDAVRKAQLKTDLENVYCVDARGLPLEPDGLHLTTSSQVQLGHMIAESF 261

Query: 65  RRFPS 70
              P+
Sbjct: 262 LAIPN 266

BLAST of HG10014165 vs. TAIR 10
Match: AT4G34215.1 (Domain of unknown function (DUF303) )

HSP 1 Score: 58.5 bits (140), Expect = 3.8e-09
Identity = 33/60 (55.00%), Postives = 40/60 (66.67%), Query Frame = 0

Query: 5   KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAY 64
           +V IASG G Y + VR  Q G+ L NV+ VDA GL L+ D LHL T +QV+LG  LA AY
Sbjct: 197 QVAIASG-GGYIDKVREAQLGLKLSNVVCVDAKGLPLKSDNLHLTTEAQVQLGLSLAQAY 255

BLAST of HG10014165 vs. TAIR 10
Match: AT4G34215.2 (Domain of unknown function (DUF303) )

HSP 1 Score: 58.5 bits (140), Expect = 3.8e-09
Identity = 33/60 (55.00%), Postives = 40/60 (66.67%), Query Frame = 0

Query: 5   KVGIASGEGPYKEGVRRGQFGMDLVNVMSVDALGLSLEPDGLHLNTPSQVRLGGLLADAY 64
           +V IASG G Y + VR  Q G+ L NV+ VDA GL L+ D LHL T +QV+LG  LA AY
Sbjct: 197 QVAIASG-GGYIDKVREAQLGLKLSNVVCVDAKGLPLKSDNLHLTTEAQVQLGLSLAQAY 255

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038899610.12.2e-3580.19probable carbohydrate esterase At4g34215 [Benincasa hispida][more]
XP_004141392.13.4e-3183.72probable carbohydrate esterase At4g34215 [Cucumis sativus] >KGN55236.1 hypotheti... [more]
XP_022140685.16.3e-3085.71probable carbohydrate esterase At4g34215 [Momordica charantia][more]
XP_008452605.18.3e-3075.70PREDICTED: probable carbohydrate esterase At4g34215 [Cucumis melo] >KAA0064418.1... [more]
XP_022982273.11.6e-2567.65probable carbohydrate esterase At4g34215 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Q8L9J95.4e-0855.00Probable carbohydrate esterase At4g34215 OS=Arabidopsis thaliana OX=3702 GN=At4g... [more]
Match NameE-valueIdentityDescription
A0A0A0L4Z71.6e-3183.72SASA domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G641680 PE=4 S... [more]
A0A6J1CFT33.1e-3085.71probable carbohydrate esterase At4g34215 OS=Momordica charantia OX=3673 GN=LOC11... [more]
A0A5A7VA074.0e-3075.70Putative carbohydrate esterase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_s... [more]
A0A1S3BVE34.0e-3075.70probable carbohydrate esterase At4g34215 OS=Cucumis melo OX=3656 GN=LOC103493577... [more]
A0A6J1J4F17.8e-2667.65probable carbohydrate esterase At4g34215 OS=Cucurbita maxima OX=3661 GN=LOC11148... [more]
Match NameE-valueIdentityDescription
AT3G53010.18.8e-1452.31Domain of unknown function (DUF303) [more]
AT4G34215.13.8e-0955.00Domain of unknown function (DUF303) [more]
AT4G34215.23.8e-0955.00Domain of unknown function (DUF303) [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005181Sialate O-acetylesterase domainPFAMPF03629SASAcoord: 6..64
e-value: 1.9E-8
score: 34.2
NoneNo IPR availablePANTHERPTHR31988:SF24SUBFAMILY NOT NAMEDcoord: 5..68
NoneNo IPR availablePANTHERPTHR31988ESTERASE, PUTATIVE (DUF303)-RELATEDcoord: 5..68

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10014165.1HG10014165.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane