CmoCh16G006460 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh16G006460
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionHydroxyproline-rich glycoprotein family protein
LocationCmo_Chr16: 3181973 .. 3188909 (-)
RNA-Seq ExpressionCmoCh16G006460
SyntenyCmoCh16G006460
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAGGTTTTGATAGATGGCCTTATAAAGGTTGGTTTTATGGCATATATCTTCATATATTATATTCATTAAAGATCGATCTTTCGATAATCGTTCGATGAATACGAATTTTATAACACAGTGACGGTTTCTTCGAATAATTTTTGGTCGAAAACCCTTTTTTCGAACGCTCACTCTCCAATTTTGGCTGTTAATGATGACACAATGTACTTTAAGCAGTAGATTTTGGTGTGTGGATTTGTTATGATTACCAATTGTAGCAACTGTGGAATTTCACTGTTTTATGTCACATTCTTTACTTCATTAGTAAAATTGTTCTTTCTGTTCTATTTCTCTTGTTTTGAACATAACCTTATGGAATTTGAATGTTGATCAATGATCTTGGAGGGACTGATGATAATAAATTAGCAATAGTTGAAGGATGAAATAGCAAAAAATTACATCTTTTTCAGGTCATTTATGGCAGTGTTTAACACAAAAACACATATATACCATTCTGATTTCAGAAAGAAAGCTTGGTTTTAAGTATTTATTATGTGACAAGTAGTTAGCATTTATGTTCATAGCTATGCTTTTTACTCCTATGTCTTGAATTCTCTTAAAGGGAAGTACTTTTTTTGCTATTATTTTGTGTTCTTATAATTTGAATTGCTAGCTAGAAGCTTAACGATACAATTTTTTGGGGATTTGAATCTCGGATCTGTTGATCGAGAGTCTATGTGAGATCCCACGTCAGTTTGAGAGACGAATGAGTGCCAGCGAGGACGCTGGGCTCATAAGCGGTGGATTGTGAGATCCCACATTAGTTGGAGGGGGAACGAAGCATTCTTTATAAGGGTGTGGAAACTTCTCCTTAGTAGACGCGTTTTAAAAACTTTGAGGGAAAAGTCTAGAGAGGACAATATCTGTTAGCGGTGGGCTTAGGTTGTTACCGTATATATATCTTTAACTCACATATGATCTATTTTAGGTTCATAAAACTCTTTGTTTTGACATAAATTAAAGCTTTTTAGCATGAGAATTCTTAGAAATGAACCTATTCTATATATCTCCCTTGCTTCGATTGTATAAAAATGTTCTTGGAAACGTCTTTGTGGGTCATGTCGTTAGATGTCTGAAAGAGAAATTGTAAAAGTACTTTGAATTTACATATCGTGTTATCGTGACAAGACATGCAGATCTTTTAATACATAGGATTTGTACTAGGTTTTGTTTTTTGTTTTTTGTTTTTTTTTTTTTTGTTTTTTGGTGTTTAGATTCTTAAATTATATGCAATTTGAGTTCTCCCACCTTCCTTTGTGATCTTGTTCTTAAAAAAAAAGTGATATAATAATAAGATTTATCAGTAGTTTCTCAAGAATTTAGGGTTTGAACAAGATGTTAAGACATTGTTTGGACCAAAAAACATAGCAGCTAAGCCACGACAGACAAATAATGTCGTTGCTCAATTAAAATCTAGGATCAAGCAGCATGCTTTGTCTGACACACTACTTATTTTGATGAATTACAAATTAACTTTGATAAATTCTATACAAAAACAGCAATTTTTTTGGTTTTCTTGTTGTTGTGATGTTGTTGATGTTCCTCATGAATATCACTTTGTTCTTGGTGTTGTGCAGTTTTTGGGTATCTAAATGGCAGTGGAAGAAGCAAGTTCTTGTGTTCTTTCTCATAAATTGCCTCACAAGAAGCCAATTTGGTGTTGTTTATAGGTTTTGAGATCTTTTCAAAGGTGTGTATCTAGTTTTTGACATGTTTTTTGTTCTTTGTCGTGTAAAACATTCTAATTTGTGCTTTAAAAACTAGTTTTTGATCTATTAAGAAGGGATATGATGAGCAGCTACATGTTCTTAAATACTTATATGAACAAGAAACTCATAGGGCTTGATTATTTGATAGCATATATATACCAAACCCTAACTTGTTTCATATAGATATATAAACACACATGAAATATTAGAAAAATATGTGTATTATATTTGATGAAAAAAGAGGGTTTGAGAACATGATGGAAATGGAAGGGAATTGGGAGTGGAAAAAGGGTATCCTATTGTTGATATGCATAGAAAAGAGAAGGGGTAGGTGCCCTATAAGTGTGGGGTAGGTTTAGGCATATATAGGGAAAGATAGCATCTAGAATCTAAAATTCATTCACGGAGCGAGATCAAACATCGGTTAGGGAGTAGAACGAAACATTCTTTATAAGGGTATGAAAATCTCTCTCGAGTAGACGCGTTTTAAAAATCTTGAGTAGAAACTTGAAAGAGAAAGTCCAAAGATGACAATATCTGATAGAGGTGGGTTTGGGCTGTTACAAATGGTATCAGAACTAGACACAGGGCGGTGTGCCACCGAGGATGTTGGGTTCCAAAGGGAAGTGGATTGTGAGATCCCACATTGGTTGGAGAGGGGAACGAATCATTCTTCATAAGGGTGTGGAAACCTCTTCCTTGCAAACGCGTTTTAGAAACGTTCAGGGAAAGCCAAAAGAAGACAAGTATCTATTAGCGGTAGGCTTGGGATGTTACAAATGGTATCAGAGCTAGACTTCGGGTGATGTGTCAGCGAAGACGCTAGGCCCTGAAGAGAAGTGGATTATGAGATCCCACATCAGTTGGAGAGGGAAACAAAGTATTTTTTCATAAGGATGTGGAAACCTCTTCCTAGCAAACGGTTTTAGAAACGTTCAGGGAAAGCCCAAAGAAGACAATACCTACCAGCGGTGGGCTTGAACTGTTACAAATGATATCAGAGCCAAACACCAGGTAGTGTGCCAGCAAAAACACTGGACCCCGGAGAAGGGTATGTAAACCTCTCACTAGCAGACATGTTTAAAAACCTCGAGTGGAAGTACGAAAGGGAAAACCCAAAGAAGATAATATCTGTTACGTCATTACACATTCATTCATTCGTTCATTCTCATATTCATTGATTAGTGTTAAATGAGTTGAGGTGAGTGGAGGGCATGCAGTAGGAAGCAGCAGCAGCACGCAGCAGCAACAGCAGCAGCTAAAACAAAGGGCATGCAGTGAACAGTAGCAGAAAGTAGGTAGGGCTGGCTGGTTGGCTGGCTTATACAACTTTATTTTAAAGCTTGTAACGAAATGGGATAGTAGTAGCCCATTTCACCCATATCTCCTTACCTCTCTCATTAATTGTACTGCAACTCCTAAACCAAACAAACCCTTCTGACCCCTTTTTCCCACTCCTTTGGATATTTGTATGTCACCCTCTCCCCTTCTTTCACTTTTTCTATGCTTCACTCCTTATACCCTCTAATTCTCTACGCCTCCTCCATCTCCATGCAATGCTATCAATGCCCCTTTTATGCCTCCTTCTTTGGCAAACAAGAGTTAACCTAGGTAACACCTTCCCAATCATACATACCTCTAGGTCAATGAATACGGGACGATCATGTCTTTCTCTTCTCGTCATAGATCATAATCTAGTGGGTTCTAGTTTTAATATCTTTTCGAGAAATTGAGTTAGCTAATCTTGATATATAGTCAAGAATAGAACAGATCAATTACTTTTTGAGATGGGCTGAATTGTTAGAGTTTTTCATGGCGAATAAGAAAGAAATGAAGATTAAGAGTGTAATGCTTCATACCCAGAATTCAGATCACGATTTGGAATCCGAATTTAAGATTAGGTAATTGCAACATTCATATGCAATATGGTTACGTTACCTACTTATTCAGAAACTTCTAAGAGTGAATATTATCCTCACACTGTCACAGGTTCTTTCTACATGTTTTGTCTCTACTCAACATAAAATTACTCAAAGTAAAGTTGCAACAACTCCAGTAGGTCAGCCCCAACGTAAAATTACTAAGTTGTAACGGCCCAAGCCCCACTGCTAGTAGATGTTGTCCTCTTTAGACTTTCTCTCAAGATTTTTAAAACACGTCTAGTAAGGAGAGGTTTTCATACCTTAAAGAGTGTTTCGTTCTCTTCCCCAACCAATGTGGGATATCACAAAAGTACGCTTAACTTTTGAGTTTTCTAGTTACCTTACTGGTATAGATCGTAACTTTAAACTTTTTAGAACCTTTTTTTAGTTACTTTATTCTTATGATTCCTCTCATTCAGATGTAATGATAGTATATTCGTATCTGACAACAGACCATTGGAAGACCAAGTTGGGTTAATCTATATATATATATGATAAAAATGCATGGTGGTATATTATATATATATGAGAGAGAGAGAGATATGGGCTATGTATGCAATAGGGTGGGGTAGGGGTTAGCATAGGACAATAGGATAGCTTAAAGACAGACATATGGGGTTTAATTAAATATTGGTGCAGAGAGCATTAAGAGTGACAGTCGGCAGCCGATCCCATAAAATTAAAACAACACACATGCCAAACTATATTATAAAATATAATATAATGTATCAATGTATATATAAATTTATCAAAATGTTTAAAGTCCGTACGACTGCCAATCACACACACTTTTTTTTTTTTTTTTCCTCTCTCTCTTCTCTCTCTTCTCTCTCTCCACCCTTGAGAAGCTCCATAGGTCCAATGAATTATGGTAAATTTATAGAATATAGAAAGTAATTTTGTTTAGTGAGAAAATAACGTTACACGAGAATATGAGTAATTGTTGAACTTCGAAACTTTAAACGGAAAGATTACGATAAATTTAATTATATTTAGAATTTTAAAATTTAAACTTCCGACTTTAAGAGCGTAGCATTTCTTTCATCATTGAATTTGGTTCATATTGATATCGGTTTTGTTGAATTAAAAAATAATTATTTGATTTAAATGATTAAATTGTAAGTTAGTATTTTCAAACATAATCATGATTATATAATTTTTATTAGAAAAAATAATAAGAACCTATTAGTAAAAAAATATATATATATATAAATTTATCGACATGTAATTTAATCTCCGTTGTTTTATTTTTATTTTATTTTATGTATTAATATTTATATATTAAATTAGAGGATTCAGTCAAGAAGCCCTTATAGCTCAGTGGTAGAGCGTCAGTCTTGTAAACTGAAGGTCCGTAGTTCGATCCTGCGTGAGGGCAATGCATTATGTATTTTAGGCATTTTCGCGCTAATTCTAAAATGAAATTCTTATTTCCAAAACTAACCAATGAATTTTTGTAATTCCAATTTTACCCCTATTGGCTATTTTGATTTCCCATCATTGGTGTATTTACAAGCCTTCAATACTATATATATTATATATTGACTTACATATTTCATATACTAGTAACTCATATACAAAATATAAAAACTTTATAACACTTGTAGCTTCAATTCTCCTACCATCGACATCGGGTTTTGGTTCTATCTAAAAAAGTCAACAAGCCCTTATTGTTCAGTGGTACAGCGTCAGTCTTGTAAACTGAAGGTTCGTAGTTTAATCCTGCGTGGGGGCAAATATTATTATGTAGTGATTATGTATATATATATATATTACGTCTAGAATTATCGCCCTTGACATTAGGGCGGGTGAGGGCAAAATTGTAAATTCAAGATTGAAAATCCTTGCCGTGTTTGACATTTAAATAAACCTTAAACCTAAAAGTAAGACCCAGAAATAATAAATTAATAATTAAGAGAAATAATTCTCACATCATATTTCCTTTTCTTTATTACGCAATAATCTTTTCTTTTATTTATTTATTATTTATTTTTTTAATTTTCATTTCCGTCTTCTTTCTACTTCTCTTCACTTTCTGACTGCAAATTTTCCTCTTTGGTTCTGTTTTAGAATTTCATCATCTCTCGTGTATGAGGAGCTACGATTCTTTCATGAAACTGATCGGCGATTCTCTAGTTTTGATGAAATCGATCGGCGAATCTCTGGTTTTGAGCAACGATATCGATGAGACGCCGAATGGATGATGATGATGATGCCGTGCGGCTGCCGATTTTATAGCCACCGTCGATCATCGTTTTCCTCGGGCTACTGATATCCAGGTATGGTAATGTTCACATCAGTTCGCCGTCCATTTAATCATTTGGAGTTTTGGATTTTAGTGTTGGTTTATTTATGGATTGAGAATACTGGCTATGTTTTTGGAAATTACTGTTCATTCTTAACTCCGTTAGGTATTAGGATGTAGTTTTTTGTCGAGGGATGATATTAGTGTTGGTTTATTTATGGATTGAGAATACTGGCTATGTTTTTGGAAATTACTGTTCATTCTTAACTCCGTTAGGTATTAGGATGTAGTTTTTTGTCGAGGGATGATATTATGGCGGAATGGAATCTGTATTGTTTGTCGTGGATTTGCGGAATTCTAAATTAGGGTTTTAATTTCCCCCTTTTTCCTGCTCCTTGCTGTTTGTTTGCTTAGGGACTGCGAATCGTTACATGAATACACGGAATCACGAGCTGGCTTCGTTTCTGATTTTCTGCTTTTTCTGCTTTTGGCTTTCTCCTCGCATGATTATTAGATACTTAGGGTTTGGAAGGTACCAAGATTTGTGCATTGGGTTGGACGACTGTGTCAGGAAGCAATGCGCTGTTCTAGCCCAATGTTGTTTGTCTTCTTGTAATAACGCTGCTCTTTTTAACCAGAAAGGCATCTCTCCATTCTTTAAATTAATGTCATTCTCTATTGAGGAAGAGTGTTGCTTTATATATAAAGATTCCAACAAATAGGGCTCTCATTTGTCCAAGCTCTTTCCACTGTTCTGTACTGATGTCATTATCACTTTCCTCGTTTTCTCTTTGTTTGCTTGCCCTCAGAAAAGGAGATGGGGTAGCTGTTGGAGTATTTATTGGTGCTTTGGATCTCTCAAACAGAGGAAAAGAGTTGTTGTGCACGCTGTGTGTTGGTACCAGAACCTGGTCCTCCTTCAGCTGAGGCTCATGAAGAAGATTCATTGCACTCACCCGATATTGAGCTTCCACTTGCTGCACCCCTCCCTCTTCCCCTGTATCCTTCCTTCAATCTGA

mRNA sequence

ATGCAGGTTTTGATAGATGGCCTTATAAAGTTTTTGGCCACCGTCGATCATCGTTTTCCTCGGGCTACTGATATCCAGAAAAGGAGATGGGGTAGCTGTTGGAGTATTTATTGGTGCTTTGGATCTCTCAAACAGAGGAAAAGAGTTGTTGTGCACGCTGTGTGTTGGTACCAGAACCTGGTCCTCCTTCAGCTGAGGCTCATGAAGAAGATTCATTGCACTCACCCGATATTGAGCTTCCACTTGCTGCACCCCTCCCTCTTCCCCTGTATCCTTCCTTCAATCTGA

Coding sequence (CDS)

ATGCAGGTTTTGATAGATGGCCTTATAAAGTTTTTGGCCACCGTCGATCATCGTTTTCCTCGGGCTACTGATATCCAGAAAAGGAGATGGGGTAGCTGTTGGAGTATTTATTGGTGCTTTGGATCTCTCAAACAGAGGAAAAGAGTTGTTGTGCACGCTGTGTGTTGGTACCAGAACCTGGTCCTCCTTCAGCTGAGGCTCATGAAGAAGATTCATTGCACTCACCCGATATTGAGCTTCCACTTGCTGCACCCCTCCCTCTTCCCCTGTATCCTTCCTTCAATCTGA

Protein sequence

MQVLIDGLIKFLATVDHRFPRATDIQKRRWGSCWSIYWCFGSLKQRKRVVVHAVCWYQNLVLLQLRLMKKIHCTHPILSFHLLHPSLFPCILPSI
Homology
BLAST of CmoCh16G006460 vs. ExPASy TrEMBL
Match: A0A6J1C828 (uncharacterized protein At1g76660-like OS=Momordica charantia OX=3673 GN=LOC111008285 PE=4 SV=1)

HSP 1 Score: 85.9 bits (211), Expect = 1.0e-13
Identity = 37/43 (86.05%), Postives = 40/43 (93.02%), Query Frame = 0

Query: 12 LATVDHRFPRATDIQKRRWGSCWSIYWCFGSLKQRKRVVVHAV 55
          +ATVDHRFPRAT +QKRRWGSCWSIYWCFGSLKQRKR + HAV
Sbjct: 30 IATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKR-IGHAV 71

BLAST of CmoCh16G006460 vs. ExPASy TrEMBL
Match: A0A6J1FSP7 (uncharacterized protein At1g76660-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111446946 PE=4 SV=1)

HSP 1 Score: 85.9 bits (211), Expect = 1.0e-13
Identity = 37/43 (86.05%), Postives = 40/43 (93.02%), Query Frame = 0

Query: 12 LATVDHRFPRATDIQKRRWGSCWSIYWCFGSLKQRKRVVVHAV 55
          +ATVDHRFPRAT +QKRRWGSCWSIYWCFGSLKQRKR + HAV
Sbjct: 42 IATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKR-IGHAV 83

BLAST of CmoCh16G006460 vs. ExPASy TrEMBL
Match: A0A6J1FP20 (uncharacterized protein At1g76660-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111446946 PE=4 SV=1)

HSP 1 Score: 85.9 bits (211), Expect = 1.0e-13
Identity = 37/43 (86.05%), Postives = 40/43 (93.02%), Query Frame = 0

Query: 12 LATVDHRFPRATDIQKRRWGSCWSIYWCFGSLKQRKRVVVHAV 55
          +ATVDHRFPRAT +QKRRWGSCWSIYWCFGSLKQRKR + HAV
Sbjct: 32 IATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKR-IGHAV 73

BLAST of CmoCh16G006460 vs. ExPASy TrEMBL
Match: A0A6J1IUL0 (uncharacterized protein At1g76660-like OS=Cucurbita maxima OX=3661 GN=LOC111480076 PE=4 SV=1)

HSP 1 Score: 82.4 bits (202), Expect = 1.1e-12
Identity = 35/43 (81.40%), Postives = 39/43 (90.70%), Query Frame = 0

Query: 12 LATVDHRFPRATDIQKRRWGSCWSIYWCFGSLKQRKRVVVHAV 55
          +ATVDHRFPR T +QKRRWGSCWSIYWCFGSL+QRKR + HAV
Sbjct: 35 IATVDHRFPRDTAVQKRRWGSCWSIYWCFGSLRQRKR-IGHAV 76

BLAST of CmoCh16G006460 vs. ExPASy TrEMBL
Match: A0A5D3CYQ2 (Mucin-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G004630 PE=4 SV=1)

HSP 1 Score: 80.9 bits (198), Expect = 3.2e-12
Identity = 36/43 (83.72%), Postives = 39/43 (90.70%), Query Frame = 0

Query: 12 LATVDHRFPRATDIQKRRWGSCWSIYWCFGSLKQRKRVVVHAV 55
          +ATVDHRFPRAT +QKRRWGSC SIYWCFGSLKQRKR + HAV
Sbjct: 27 IATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKR-IGHAV 68

BLAST of CmoCh16G006460 vs. TAIR 10
Match: AT5G52430.1 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 42.4 bits (98), Expect = 2.4e-04
Identity = 15/24 (62.50%), Postives = 18/24 (75.00%), Query Frame = 0

Query: 26 QKRRWGSCWSIYWCFGSLKQRKRV 50
          QK RWG CWS+Y CFG+ K  KR+
Sbjct: 32 QKGRWGKCWSLYSCFGTQKNNKRI 55

BLAST of CmoCh16G006460 vs. TAIR 10
Match: AT4G25620.1 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 41.6 bits (96), Expect = 4.2e-04
Identity = 19/32 (59.38%), Postives = 24/32 (75.00%), Query Frame = 0

Query: 23 TDIQKRRWGSCWSIYWCFGSLKQRKRVVVHAV 55
          + +QK+R GS WS+YWCFGS K  KR + HAV
Sbjct: 29 SSVQKKR-GSWWSLYWCFGSKKNNKR-IGHAV 58

BLAST of CmoCh16G006460 vs. TAIR 10
Match: AT1G63720.1 (BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1); Has 490 Blast hits to 394 proteins in 96 species: Archae - 0; Bacteria - 2; Metazoa - 132; Fungi - 88; Plants - 175; Viruses - 14; Other Eukaryotes - 79 (source: NCBI BLink). )

HSP 1 Score: 40.4 bits (93), Expect = 9.3e-04
Identity = 18/46 (39.13%), Postives = 30/46 (65.22%), Query Frame = 0

Query: 5  IDGLIKFLATVDHRFPRATDI-QKRRWGSCWSIYWCFGSLKQRKRV 50
          I+     +A+ D R  +++ I +KR+W + WS+  CFGS +QRKR+
Sbjct: 14 INAAASAIASSDDRLHQSSPIHKKRKWWNRWSLLKCFGSSRQRKRI 59

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1C8281.0e-1386.05uncharacterized protein At1g76660-like OS=Momordica charantia OX=3673 GN=LOC1110... [more]
A0A6J1FSP71.0e-1386.05uncharacterized protein At1g76660-like isoform X1 OS=Cucurbita moschata OX=3662 ... [more]
A0A6J1FP201.0e-1386.05uncharacterized protein At1g76660-like isoform X2 OS=Cucurbita moschata OX=3662 ... [more]
A0A6J1IUL01.1e-1281.40uncharacterized protein At1g76660-like OS=Cucurbita maxima OX=3661 GN=LOC1114800... [more]
A0A5D3CYQ23.2e-1283.72Mucin-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G004630 PE=4 S... [more]
Match NameE-valueIdentityDescription
AT5G52430.12.4e-0462.50hydroxyproline-rich glycoprotein family protein [more]
AT4G25620.14.2e-0459.38hydroxyproline-rich glycoprotein family protein [more]
AT1G63720.19.3e-0439.13BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein fam... [more]
Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh16G006460.1CmoCh16G006460.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0051260 protein homooligomerization
biological_process GO:0016567 protein ubiquitination