Sed0003732 (gene) Chayote v1

Overview
NameSed0003732
Typegene
OrganismSechium edule (Chayote v1)
Descriptionhydroxyproline-rich glycoprotein family protein
LocationLG06: 11380451 .. 11381750 (-)
RNA-Seq ExpressionSed0003732
SyntenySed0003732
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AATCCATTCCATCATTTTGAAAAACCTCATCCATTCCAATGGAGATTGTGAGTGAACCCCCCACCGTTCCCAAAGCCCCAACTCAGCCCGAAACTCGGGCGATCCCGGAGCTTGAAACGCCCAACAATGGAAGCGCAGATCGAGAGAAGGAGGAGGAAGGAGGCCAAAACGACATGGAAAGTTCAAAATGTGAGTGTAAAACGGCAACCCCAGATGAGAAAAAACAGAGTGCAGTTACAGAAATAGAAACAGGGCAAAGGGTCAAATTTAGAGGCATTGAATTTCCTGTAAATGGAACACCCAATCGCCTTAAACTCCCCAAAGCATTCAAATACCCTGAAAGGTAATTCCTTATGCCTCGTTTGATGATCATTTAAGTCCGTTTCGTAACCATTTGTCTTTCTGTTTTTATTTTTTAAACTTAAACTTATTTTTATTCCAATTTCCTTCCTTCCATATGTATTCCAAAAAATTATGTAATGCTAATCAAATTTTAAAAACAGTTTTTAAAACGCTTTTTTTTCAATGGGATAAAATAGTTATTGCTGGCTTAAATTTCAAAAACAAAAAATTAAGACGAAAATGGAATTTGAGTAAATAGAAGGTTAAATTAAAAAACCTAAAAACAAAAAATAATTAAAATCTGCATTTGAAAGAGGTGGATTTGTTTGTAGGTATTCAAGTCCTACAGATTTGATGATGTCTCCTATCAGCAAAGGCCTTCTTGCCAGAACCAGGAAAGGGGCTCTCCCTTCTAAGGTTATAATCCTATAAACTTGCTTTATTTTGTAGGATGAATTTTCTAAACAAGTTTCAAAGATTTTGAATCTTTTGTGTTTTTTTGGTTTTGTTGATGTGCAGATGCATGAGTTGAGAATTACAGAGATGAGTCTCCAAAGCTGATGTCATGGATTCTCTTTTCTTTTTCCTCTTCAACAATTTGTTTCACTCACATTGTGATGCAATCTCATGCCTTTTTATTATTTTCATTATTTGTGATTATTTCTTTCAATATGTTGTTGTTGGGGTGAGTCAGGTCAATATATTGTGGGGTAAGTTTTTTTCTCAAGTGCTTTTAACAGCAACCAGTTTGTTAACTAACCATGTAACAGTTACTAGCAGCAGATGGTTTTACCAACTTCACAACTTCTTTGGGAATTTGAATTACATAAAGGAGGAAACATCTTGCTATATAGACAATATCTCATATAAATATAAAGTTATGTGGATGTTTGAATGTTTTTTTTATATATATCTGTGACTGTTCGGGCTAACTTGCGCACACCTAAAACATATACCG

mRNA sequence

AATCCATTCCATCATTTTGAAAAACCTCATCCATTCCAATGGAGATTGTGAGTGAACCCCCCACCGTTCCCAAAGCCCCAACTCAGCCCGAAACTCGGGCGATCCCGGAGCTTGAAACGCCCAACAATGGAAGCGCAGATCGAGAGAAGGAGGAGGAAGGAGGCCAAAACGACATGGAAAGTTCAAAATGTGAGTGTAAAACGGCAACCCCAGATGAGAAAAAACAGAGTGCAGTTACAGAAATAGAAACAGGGCAAAGGGTCAAATTTAGAGGCATTGAATTTCCTGTAAATGGAACACCCAATCGCCTTAAACTCCCCAAAGCATTCAAATACCCTGAAAGGTATTCAAGTCCTACAGATTTGATGATGTCTCCTATCAGCAAAGGCCTTCTTGCCAGAACCAGGAAAGGGGCTCTCCCTTCTAAGATGCATGAGTTGAGAATTACAGAGATGAGTCTCCAAAGCTGATGTCATGGATTCTCTTTTCTTTTTCCTCTTCAACAATTTGTTTCACTCACATTGTGATGCAATCTCATGCCTTTTTATTATTTTCATTATTTGTGATTATTTCTTTCAATATGTTGTTGTTGGGGTGAGTCAGGTCAATATATTGTGGGGTAAGTTTTTTTCTCAAGTGCTTTTAACAGCAACCAGTTTGTTAACTAACCATGTAACAGTTACTAGCAGCAGATGGTTTTACCAACTTCACAACTTCTTTGGGAATTTGAATTACATAAAGGAGGAAACATCTTGCTATATAGACAATATCTCATATAAATATAAAGTTATGTGGATGTTTGAATGTTTTTTTTATATATATCTGTGACTGTTCGGGCTAACTTGCGCACACCTAAAACATATACCG

Coding sequence (CDS)

ATGGAGATTGTGAGTGAACCCCCCACCGTTCCCAAAGCCCCAACTCAGCCCGAAACTCGGGCGATCCCGGAGCTTGAAACGCCCAACAATGGAAGCGCAGATCGAGAGAAGGAGGAGGAAGGAGGCCAAAACGACATGGAAAGTTCAAAATGTGAGTGTAAAACGGCAACCCCAGATGAGAAAAAACAGAGTGCAGTTACAGAAATAGAAACAGGGCAAAGGGTCAAATTTAGAGGCATTGAATTTCCTGTAAATGGAACACCCAATCGCCTTAAACTCCCCAAAGCATTCAAATACCCTGAAAGGTATTCAAGTCCTACAGATTTGATGATGTCTCCTATCAGCAAAGGCCTTCTTGCCAGAACCAGGAAAGGGGCTCTCCCTTCTAAGATGCATGAGTTGAGAATTACAGAGATGAGTCTCCAAAGCTGA

Protein sequence

MEIVSEPPTVPKAPTQPETRAIPELETPNNGSADREKEEEGGQNDMESSKCECKTATPDEKKQSAVTEIETGQRVKFRGIEFPVNGTPNRLKLPKAFKYPERYSSPTDLMMSPISKGLLARTRKGALPSKMHELRITEMSLQS
Homology
BLAST of Sed0003732 vs. NCBI nr
Match: XP_022998048.1 (uncharacterized protein LOC111492813 [Cucurbita maxima])

HSP 1 Score: 159.8 bits (403), Expect = 1.7e-35
Identity = 100/176 (56.82%), Postives = 114/176 (64.77%), Query Frame = 0

Query: 1   MEIVSEPPTVPKAPTQP---------------ETRAIPELETPNNGSADREKEEEGGQND 60
           MEI +E P +P  PTQ                ET A PELETP NG+ DREK+ +  QN 
Sbjct: 1   MEIATE-PAIPTTPTQSKPVDSEQQSETSGAGETTATPELETPINGTPDREKKGK-AQNQ 60

Query: 61  MESSKCECKTATPDEK---KQSAVTEIE---------------TGQRVKFRGIEFPVNGT 120
           MESSKCECKT TPDEK   KQ  +  +                 G+  +  GI FP NGT
Sbjct: 61  MESSKCECKTPTPDEKTEEKQRILVPVNGLMMDRLKVPKSPNVAGEVKQRGGIAFPKNGT 120

Query: 121 PNRLKLPKAFKYPERYSSPTDLMMSPISKGLLARTRKGALPSKMHELRITEMSLQS 144
           PNRLK+PKAFKY ERY+SPTDLMMSPI+KGLLARTRKGA+PSKMHELRI+EMSL S
Sbjct: 121 PNRLKVPKAFKYHERYTSPTDLMMSPITKGLLARTRKGAVPSKMHELRISEMSLHS 174

BLAST of Sed0003732 vs. NCBI nr
Match: KAG6607098.1 (hypothetical protein SDJN03_00440, partial [Cucurbita argyrosperma subsp. sororia] >KAG7036787.1 hypothetical protein SDJN02_00407 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 157.5 bits (397), Expect = 8.4e-35
Identity = 99/176 (56.25%), Postives = 114/176 (64.77%), Query Frame = 0

Query: 1   MEIVSEPPTVPKAPTQP---------------ETRAIPELETPNNGSADREKEEEGGQND 60
           MEI +E PT+P   TQ                ET A PELETP NG+ DREK+ + GQ+ 
Sbjct: 1   MEIATE-PTIPTTSTQSKPVDSEQQSETSGAGETTATPELETPINGTPDREKKGK-GQDQ 60

Query: 61  MESSKCECKTATPDEK---KQSAVTEIE---------------TGQRVKFRGIEFPVNGT 120
           MESSKCECKT TPDEK   KQ  +  +                 G+  +  GI  P NGT
Sbjct: 61  MESSKCECKTPTPDEKTEEKQRILVPVNGLMMDRLKVPKSPNVAGEAKQRGGIALPKNGT 120

Query: 121 PNRLKLPKAFKYPERYSSPTDLMMSPISKGLLARTRKGALPSKMHELRITEMSLQS 144
           PNRLK+PKAFKY ERY+SPTDLMMSPI+KGLLARTRKGA+PSKMHELRI+EMSL S
Sbjct: 121 PNRLKVPKAFKYHERYTSPTDLMMSPITKGLLARTRKGAVPSKMHELRISEMSLHS 174

BLAST of Sed0003732 vs. NCBI nr
Match: XP_022948389.1 (uncharacterized protein LOC111452080 [Cucurbita moschata])

HSP 1 Score: 157.1 bits (396), Expect = 1.1e-34
Identity = 99/176 (56.25%), Postives = 114/176 (64.77%), Query Frame = 0

Query: 1   MEIVSEPPTVPKAPTQP---------------ETRAIPELETPNNGSADREKEEEGGQND 60
           MEI +E PT+P  PTQ                ET A  ELETP NG+ DREK+ + GQ+ 
Sbjct: 1   MEIATE-PTIPTTPTQSKPVDSEQQSETSGAGETTATRELETPINGTPDREKKGK-GQDQ 60

Query: 61  MESSKCECKTATPDEK---KQSAVTEIE---------------TGQRVKFRGIEFPVNGT 120
           MESSKCECKT TPDEK   KQ  +  +                 G+  +  GI  P NGT
Sbjct: 61  MESSKCECKTPTPDEKTEEKQRILVPVNGLMMDRLKVPKSPNVAGEAKQRGGIALPKNGT 120

Query: 121 PNRLKLPKAFKYPERYSSPTDLMMSPISKGLLARTRKGALPSKMHELRITEMSLQS 144
           PNRLK+PKAFKY ERY+SPTDLMMSPI+KGLLARTRKGA+PSKMHELRI+EMSL S
Sbjct: 121 PNRLKVPKAFKYHERYTSPTDLMMSPITKGLLARTRKGAVPSKMHELRISEMSLHS 174

BLAST of Sed0003732 vs. NCBI nr
Match: XP_023525350.1 (uncharacterized protein LOC111788976 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 156.8 bits (395), Expect = 1.4e-34
Identity = 98/176 (55.68%), Postives = 114/176 (64.77%), Query Frame = 0

Query: 1   MEIVSEPPTVPKAPTQP---------------ETRAIPELETPNNGSADREKEEEGGQND 60
           MEI +E PT+P  PTQ                ET A PELETP +G+ DREK+ + G+N 
Sbjct: 1   MEIATE-PTIPTTPTQSKPVDSEQQSETSGAGETTATPELETPIDGTPDREKKGK-GENQ 60

Query: 61  MESSKCECKTATPDEK---KQSAVTEIE---------------TGQRVKFRGIEFPVNGT 120
            ESSKCECKT TPDEK   KQ  +  +                 G+  +  GI  P NGT
Sbjct: 61  KESSKCECKTPTPDEKTEEKQRILVPVNGLMMDRLKVPKSPNVAGEAKQRGGIALPRNGT 120

Query: 121 PNRLKLPKAFKYPERYSSPTDLMMSPISKGLLARTRKGALPSKMHELRITEMSLQS 144
           PNRLK+PKAFKY ERY+SPTDLMMSPI+KGLLARTRKGA+PSKMHELRI+EMSL S
Sbjct: 121 PNRLKVPKAFKYHERYTSPTDLMMSPITKGLLARTRKGAVPSKMHELRISEMSLHS 174

BLAST of Sed0003732 vs. NCBI nr
Match: XP_022155023.1 (uncharacterized protein LOC111022170 [Momordica charantia])

HSP 1 Score: 136.0 bits (341), Expect = 2.6e-28
Identity = 85/154 (55.19%), Postives = 96/154 (62.34%), Query Frame = 0

Query: 7   PPTVPKAPTQPETRAIPELETPNNGSADREKEEEGGQNDMESSKCECKTATPDEK----- 66
           P  + K     ET A PE ETP       +KE+   QN  E S  E KTATP EK     
Sbjct: 50  PAQIGKLTPAGETPATPEFETP-------KKEKARSQNQSEKSNSEMKTATPVEKAEDET 109

Query: 67  --------KQSAVTEIETGQ------RVKFRGIEFPVNGTPNRLKLPKAFKYPERYSSPT 126
                   K  ++  ++  +      +VKF GIE P NGTPNRLK+PKAFKYPERY SPT
Sbjct: 110 EKKRILVPKNGSMDRLKVPKSPNPAGKVKFSGIELPKNGTPNRLKVPKAFKYPERYMSPT 169

Query: 127 DLMMSPISKGLLARTRKGALPSKMHELRITEMSL 142
           DLMMSPISKGLLARTRKGA+PSKMHELRI EMSL
Sbjct: 170 DLMMSPISKGLLARTRKGAVPSKMHELRIREMSL 196

BLAST of Sed0003732 vs. ExPASy TrEMBL
Match: A0A6J1KFQ1 (uncharacterized protein LOC111492813 OS=Cucurbita maxima OX=3661 GN=LOC111492813 PE=4 SV=1)

HSP 1 Score: 159.8 bits (403), Expect = 8.2e-36
Identity = 100/176 (56.82%), Postives = 114/176 (64.77%), Query Frame = 0

Query: 1   MEIVSEPPTVPKAPTQP---------------ETRAIPELETPNNGSADREKEEEGGQND 60
           MEI +E P +P  PTQ                ET A PELETP NG+ DREK+ +  QN 
Sbjct: 1   MEIATE-PAIPTTPTQSKPVDSEQQSETSGAGETTATPELETPINGTPDREKKGK-AQNQ 60

Query: 61  MESSKCECKTATPDEK---KQSAVTEIE---------------TGQRVKFRGIEFPVNGT 120
           MESSKCECKT TPDEK   KQ  +  +                 G+  +  GI FP NGT
Sbjct: 61  MESSKCECKTPTPDEKTEEKQRILVPVNGLMMDRLKVPKSPNVAGEVKQRGGIAFPKNGT 120

Query: 121 PNRLKLPKAFKYPERYSSPTDLMMSPISKGLLARTRKGALPSKMHELRITEMSLQS 144
           PNRLK+PKAFKY ERY+SPTDLMMSPI+KGLLARTRKGA+PSKMHELRI+EMSL S
Sbjct: 121 PNRLKVPKAFKYHERYTSPTDLMMSPITKGLLARTRKGAVPSKMHELRISEMSLHS 174

BLAST of Sed0003732 vs. ExPASy TrEMBL
Match: A0A6J1G9R7 (uncharacterized protein LOC111452080 OS=Cucurbita moschata OX=3662 GN=LOC111452080 PE=4 SV=1)

HSP 1 Score: 157.1 bits (396), Expect = 5.3e-35
Identity = 99/176 (56.25%), Postives = 114/176 (64.77%), Query Frame = 0

Query: 1   MEIVSEPPTVPKAPTQP---------------ETRAIPELETPNNGSADREKEEEGGQND 60
           MEI +E PT+P  PTQ                ET A  ELETP NG+ DREK+ + GQ+ 
Sbjct: 1   MEIATE-PTIPTTPTQSKPVDSEQQSETSGAGETTATRELETPINGTPDREKKGK-GQDQ 60

Query: 61  MESSKCECKTATPDEK---KQSAVTEIE---------------TGQRVKFRGIEFPVNGT 120
           MESSKCECKT TPDEK   KQ  +  +                 G+  +  GI  P NGT
Sbjct: 61  MESSKCECKTPTPDEKTEEKQRILVPVNGLMMDRLKVPKSPNVAGEAKQRGGIALPKNGT 120

Query: 121 PNRLKLPKAFKYPERYSSPTDLMMSPISKGLLARTRKGALPSKMHELRITEMSLQS 144
           PNRLK+PKAFKY ERY+SPTDLMMSPI+KGLLARTRKGA+PSKMHELRI+EMSL S
Sbjct: 121 PNRLKVPKAFKYHERYTSPTDLMMSPITKGLLARTRKGAVPSKMHELRISEMSLHS 174

BLAST of Sed0003732 vs. ExPASy TrEMBL
Match: A0A6J1DLV5 (uncharacterized protein LOC111022170 OS=Momordica charantia OX=3673 GN=LOC111022170 PE=4 SV=1)

HSP 1 Score: 136.0 bits (341), Expect = 1.3e-28
Identity = 85/154 (55.19%), Postives = 96/154 (62.34%), Query Frame = 0

Query: 7   PPTVPKAPTQPETRAIPELETPNNGSADREKEEEGGQNDMESSKCECKTATPDEK----- 66
           P  + K     ET A PE ETP       +KE+   QN  E S  E KTATP EK     
Sbjct: 50  PAQIGKLTPAGETPATPEFETP-------KKEKARSQNQSEKSNSEMKTATPVEKAEDET 109

Query: 67  --------KQSAVTEIETGQ------RVKFRGIEFPVNGTPNRLKLPKAFKYPERYSSPT 126
                   K  ++  ++  +      +VKF GIE P NGTPNRLK+PKAFKYPERY SPT
Sbjct: 110 EKKRILVPKNGSMDRLKVPKSPNPAGKVKFSGIELPKNGTPNRLKVPKAFKYPERYMSPT 169

Query: 127 DLMMSPISKGLLARTRKGALPSKMHELRITEMSL 142
           DLMMSPISKGLLARTRKGA+PSKMHELRI EMSL
Sbjct: 170 DLMMSPISKGLLARTRKGAVPSKMHELRIREMSL 196

BLAST of Sed0003732 vs. ExPASy TrEMBL
Match: A0A5A7UZK0 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold692G00200 PE=4 SV=1)

HSP 1 Score: 120.9 bits (302), Expect = 4.2e-24
Identity = 80/147 (54.42%), Postives = 88/147 (59.86%), Query Frame = 0

Query: 10  VPKAPTQ-----------PETRAIPELETPNNGSADREK--EEEGGQNDMESSKCECKTA 69
           VPK+PT+            +T     +  P NGS DR K  +  G  N       ECKT 
Sbjct: 62  VPKSPTKLNLECKTPTPDEKTDKKERILVPKNGSMDRSKVPKSPGKVN------LECKTP 121

Query: 70  TPDEKKQSAVTEIETGQRVKFRGIEFPVNGTPNRLKLPKAFKYPERYSSPTDLMMSPISK 129
           T               QRVK  GIE P NGTPNRLKLP AFKYPERY SPTDLM+SPISK
Sbjct: 122 T---------------QRVKIGGIELPKNGTPNRLKLPIAFKYPERYKSPTDLMISPISK 181

Query: 130 GLLARTRKGALPSKMHELRITEMSLQS 144
           GLLARTRKGA+PSKMHELR +EMSL S
Sbjct: 182 GLLARTRKGAVPSKMHELRNSEMSLLS 187

BLAST of Sed0003732 vs. ExPASy TrEMBL
Match: A0A1S3CI15 (uncharacterized protein LOC103501189 OS=Cucumis melo OX=3656 GN=LOC103501189 PE=4 SV=1)

HSP 1 Score: 120.9 bits (302), Expect = 4.2e-24
Identity = 80/147 (54.42%), Postives = 88/147 (59.86%), Query Frame = 0

Query: 10  VPKAPTQ-----------PETRAIPELETPNNGSADREK--EEEGGQNDMESSKCECKTA 69
           VPK+PT+            +T     +  P NGS DR K  +  G  N       ECKT 
Sbjct: 62  VPKSPTKLNLECKTPTPDEKTDKKERILVPKNGSMDRSKVPKSPGKVN------LECKTP 121

Query: 70  TPDEKKQSAVTEIETGQRVKFRGIEFPVNGTPNRLKLPKAFKYPERYSSPTDLMMSPISK 129
           T               QRVK  GIE P NGTPNRLKLP AFKYPERY SPTDLM+SPISK
Sbjct: 122 T---------------QRVKIGGIELPKNGTPNRLKLPIAFKYPERYKSPTDLMISPISK 181

Query: 130 GLLARTRKGALPSKMHELRITEMSLQS 144
           GLLARTRKGA+PSKMHELR +EMSL S
Sbjct: 182 GLLARTRKGAVPSKMHELRNSEMSLLS 187

BLAST of Sed0003732 vs. TAIR 10
Match: AT3G02120.1 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 72.4 bits (176), Expect = 3.3e-13
Identity = 34/57 (59.65%), Postives = 44/57 (77.19%), Query Frame = 0

Query: 86  GTPNRLKLPKAFKYPERYSSPTDLMMSPISKGLLARTRKGA---LPSKMHELRITEM 140
           GTP RL++P AFKYPERY SPTD MMSP++KGLLARTRK +   +P   ++ +I E+
Sbjct: 58  GTPERLRVPIAFKYPERYRSPTDAMMSPVTKGLLARTRKSSGSLIPPSFNQTKIQEL 114

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022998048.11.7e-3556.82uncharacterized protein LOC111492813 [Cucurbita maxima][more]
KAG6607098.18.4e-3556.25hypothetical protein SDJN03_00440, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022948389.11.1e-3456.25uncharacterized protein LOC111452080 [Cucurbita moschata][more]
XP_023525350.11.4e-3455.68uncharacterized protein LOC111788976 [Cucurbita pepo subsp. pepo][more]
XP_022155023.12.6e-2855.19uncharacterized protein LOC111022170 [Momordica charantia][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1KFQ18.2e-3656.82uncharacterized protein LOC111492813 OS=Cucurbita maxima OX=3661 GN=LOC111492813... [more]
A0A6J1G9R75.3e-3556.25uncharacterized protein LOC111452080 OS=Cucurbita moschata OX=3662 GN=LOC1114520... [more]
A0A6J1DLV51.3e-2855.19uncharacterized protein LOC111022170 OS=Momordica charantia OX=3673 GN=LOC111022... [more]
A0A5A7UZK04.2e-2454.42Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3CI154.2e-2454.42uncharacterized protein LOC103501189 OS=Cucumis melo OX=3656 GN=LOC103501189 PE=... [more]
Match NameE-valueIdentityDescription
AT3G02120.13.3e-1359.65hydroxyproline-rich glycoprotein family protein [more]
InterPro
Analysis Name: InterPro Annotations of Chayote (edule) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 30..66
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..66
NoneNo IPR availablePANTHERPTHR36747HYDROXYPROLINE-RICH GLYCOPROTEIN FAMILY PROTEINcoord: 42..140

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sed0003732.1Sed0003732.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0016740 transferase activity