Chy4G069010 (gene) Cucumber (hystrix) v1

Overview
NameChy4G069010
Typegene
OrganismCucumis hystrix (Cucumber (hystrix) v1)
Descriptionhydroxyproline-rich glycoprotein family protein
LocationchrH04: 1627585 .. 1628091 (-)
RNA-Seq ExpressionChy4G069010
SyntenyChy4G069010
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGTATGATAATCGCGAAGCACAGGATCCTGCCTCTCTAAGCTTTCCAGTTGGTTTAGTTCTTTTGTTAACGTTTTTGTTTTGTATGTGTTGTTTCTTTTGTTGTTGTCTTCACTGGGAGAAGCTCCGATCCTTTCTTGGCTGCCCCGATCATCTCCACCACCACCATCCTCCCATTCCCCCGCCCCAATCCCCCGCCCCCGATAAAGTTCCGCCTATTCACTCGGTGCGTCCTTTTCTTCATTTCTTCATTATTTTAAATACCATAATCCAATTGTTTTTACCTGGTGTGTTACTCTTACTATATATACGTTTAACTAATCTTTCCTTCTTCGTCTATTATTCTCTTAATAACGTGAAGATATGGAAGGAGAATCGGCCGCAAAGCGTATCGGTGTTGATGCCAGGCGACGAGGTTCCGAGGTTTATAGCAATGGCGTGTCCGGCGTTGGTGGAAATTGTAGTACAAAAACCTTCGCAAAGTATTTCTTCAGATAATCCTTAA

mRNA sequence

ATGGAGTATGATAATCGCGAAGCACAGGATCCTGCCTCTCTAAGCTTTCCAGTTGGTTTAGTTCTTTTGTTAACGTTTTTGTTTTGTATGTGTTGTTTCTTTTGTTGTTGTCTTCACTGGGAGAAGCTCCGATCCTTTCTTGGCTGCCCCGATCATCTCCACCACCACCATCCTCCCATTCCCCCGCCCCAATCCCCCGCCCCCGATAAAGTTCCGCCTATTCACTCGATATGGAAGGAGAATCGGCCGCAAAGCGTATCGGTGTTGATGCCAGGCGACGAGGTTCCGAGGTTTATAGCAATGGCGTGTCCGGCGTTGGTGGAAATTGTAGTACAAAAACCTTCGCAAAGTATTTCTTCAGATAATCCTTAA

Coding sequence (CDS)

ATGGAGTATGATAATCGCGAAGCACAGGATCCTGCCTCTCTAAGCTTTCCAGTTGGTTTAGTTCTTTTGTTAACGTTTTTGTTTTGTATGTGTTGTTTCTTTTGTTGTTGTCTTCACTGGGAGAAGCTCCGATCCTTTCTTGGCTGCCCCGATCATCTCCACCACCACCATCCTCCCATTCCCCCGCCCCAATCCCCCGCCCCCGATAAAGTTCCGCCTATTCACTCGATATGGAAGGAGAATCGGCCGCAAAGCGTATCGGTGTTGATGCCAGGCGACGAGGTTCCGAGGTTTATAGCAATGGCGTGTCCGGCGTTGGTGGAAATTGTAGTACAAAAACCTTCGCAAAGTATTTCTTCAGATAATCCTTAA

Protein sequence

MEYDNREAQDPASLSFPVGLVLLLTFLFCMCCFFCCCLHWEKLRSFLGCPDHLHHHHPPIPPPQSPAPDKVPPIHSIWKENRPQSVSVLMPGDEVPRFIAMACPALVEIVVQKPSQSISSDNP*
Homology
BLAST of Chy4G069010 vs. ExPASy Swiss-Prot
Match: Q9LSK9 (Uncharacterized protein At5g65660 OS=Arabidopsis thaliana OX=3702 GN=At5g65660 PE=2 SV=1)

HSP 1 Score: 57.0 bits (136), Expect = 1.8e-07
Identity = 43/111 (38.74%), Postives = 56/111 (50.45%), Query Frame = 0

Query: 13  SLSFPVGLVLLLTFLFCMCCFFCCCLHWEKLRSFLGCPDHLHHHHPPIPPPQSPAPDKVP 72
           SL FP+G  LLL  +F +   F CC HW+K RS       L +  P      +P   K P
Sbjct: 18  SLGFPLGTALLLIIIFSLSGIFSCCYHWDKHRSL---RRSLANGRPSADIESNPYKPK-P 77

Query: 73  PIHSIWKENRPQSVSVLMPGDEVPRFIAMACPAL------VEIVVQKPSQS 118
           P   + K+ +  SV VLMPGD  P+FIA+ CP        + + VQ P QS
Sbjct: 78  PFPEM-KKPQNLSVPVLMPGDNTPKFIALPCPCAPPRPEKLTVDVQTPPQS 123

BLAST of Chy4G069010 vs. ExPASy TrEMBL
Match: A0A5D3E545 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold455G005320 PE=4 SV=1)

HSP 1 Score: 233.4 bits (594), Expect = 5.1e-58
Identity = 114/133 (85.71%), Postives = 117/133 (87.97%), Query Frame = 0

Query: 1   MEYDNREAQDPASLSFPVGLVLLLTFLFCMCCFFCCCLHWEKLRSFLGCPDHLHHHHPPI 60
           MEYDNREAQDPASLSFPVGLVLLLTFLFCMCCFFCCCLHWEKLRSFLGCPDHLHHHHPPI
Sbjct: 1   MEYDNREAQDPASLSFPVGLVLLLTFLFCMCCFFCCCLHWEKLRSFLGCPDHLHHHHPPI 60

Query: 61  PPPQSPA----PDKVPPIHSIWKENRPQSVSVLMPGDEVPRFIAMACP-------ALVEI 120
           PPP SPA    PDK  PIH+IWKENRPQSVSVLMPGDEVPRFIA+ACP       ALVEI
Sbjct: 61  PPPHSPAALSPPDKFSPIHTIWKENRPQSVSVLMPGDEVPRFIALACPPCATAAAALVEI 120

Query: 121 VVQKPSQSISSDN 123
           VVQKPSQSIS  +
Sbjct: 121 VVQKPSQSISDSS 133

BLAST of Chy4G069010 vs. ExPASy TrEMBL
Match: A0A1S3CQS4 (uncharacterized protein At5g65660-like OS=Cucumis melo OX=3656 GN=LOC103503633 PE=4 SV=1)

HSP 1 Score: 233.4 bits (594), Expect = 5.1e-58
Identity = 114/133 (85.71%), Postives = 117/133 (87.97%), Query Frame = 0

Query: 1   MEYDNREAQDPASLSFPVGLVLLLTFLFCMCCFFCCCLHWEKLRSFLGCPDHLHHHHPPI 60
           MEYDNREAQDPASLSFPVGLVLLLTFLFCMCCFFCCCLHWEKLRSFLGCPDHLHHHHPPI
Sbjct: 1   MEYDNREAQDPASLSFPVGLVLLLTFLFCMCCFFCCCLHWEKLRSFLGCPDHLHHHHPPI 60

Query: 61  PPPQSPA----PDKVPPIHSIWKENRPQSVSVLMPGDEVPRFIAMACP-------ALVEI 120
           PPP SPA    PDK  PIH+IWKENRPQSVSVLMPGDEVPRFIA+ACP       ALVEI
Sbjct: 61  PPPHSPAALSPPDKFSPIHTIWKENRPQSVSVLMPGDEVPRFIALACPPCATAAAALVEI 120

Query: 121 VVQKPSQSISSDN 123
           VVQKPSQSIS  +
Sbjct: 121 VVQKPSQSISDSS 133

BLAST of Chy4G069010 vs. ExPASy TrEMBL
Match: A0A6J1F906 (uncharacterized protein At5g65660-like OS=Cucurbita moschata OX=3662 GN=LOC111443410 PE=4 SV=1)

HSP 1 Score: 199.9 bits (507), Expect = 6.2e-48
Identity = 100/130 (76.92%), Postives = 106/130 (81.54%), Query Frame = 0

Query: 1   MEYDNREAQDPASLSFPVGLVLLLTFLFCMCCFFCCCLHWEKLRSFLGCPDHLHHHHPPI 60
           MEYD+REAQD ASLSFPVGLVLLLTFLFCMCCFF CCLHWEKLRS LGCPDH  H HP I
Sbjct: 1   MEYDDREAQDSASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSILGCPDHADHRHPSI 60

Query: 61  PP--PQSPAPDKVPPIHSIWKENRPQSVSVLMPGDEVPRFIAMACP-----ALVEIVVQK 120
           PP  P    PDK  PIH+IWKENRP+S++VLMPGDEVPRFIAMACP      LVEIV+QK
Sbjct: 61  PPETPAFSPPDKFSPIHTIWKENRPESLTVLMPGDEVPRFIAMACPPCGAAPLVEIVIQK 120

Query: 121 PSQSISSDNP 124
           PSQSIS   P
Sbjct: 121 PSQSISGALP 130

BLAST of Chy4G069010 vs. ExPASy TrEMBL
Match: A0A6J1IEI5 (uncharacterized protein At5g65660-like OS=Cucurbita maxima OX=3661 GN=LOC111476516 PE=4 SV=1)

HSP 1 Score: 198.4 bits (503), Expect = 1.8e-47
Identity = 100/130 (76.92%), Postives = 106/130 (81.54%), Query Frame = 0

Query: 1   MEYDNREAQDPASLSFPVGLVLLLTFLFCMCCFFCCCLHWEKLRSFLGCPDHLHHHHPPI 60
           MEYD+REAQD ASLSFPVGLVLLLTFLFCMCCFF CCLHWEKLRS LGCPDH  H HP I
Sbjct: 1   MEYDDREAQDSASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSILGCPDHSDHLHPSI 60

Query: 61  PP--PQSPAPDKVPPIHSIWKENRPQSVSVLMPGDEVPRFIAMACP-----ALVEIVVQK 120
           PP  P    PDK  PIH+IWKENRP+S++VLMPGDEVPRFIAMACP      LVEIV+QK
Sbjct: 61  PPETPAFSPPDKFSPIHTIWKENRPESLTVLMPGDEVPRFIAMACPPCGAAPLVEIVIQK 120

Query: 121 PSQSISSDNP 124
           PSQSIS   P
Sbjct: 121 PSQSISGALP 130

BLAST of Chy4G069010 vs. ExPASy TrEMBL
Match: A0A0A0LE40 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G893400 PE=4 SV=1)

HSP 1 Score: 193.4 bits (490), Expect = 5.8e-46
Identity = 91/94 (96.81%), Postives = 91/94 (96.81%), Query Frame = 0

Query: 30  MCCFFCCCLHWEKLRSFLGCPDHLHHHHPPIPPPQSPAPDKVPPIHSIWKENRPQSVSVL 89
           MCCFFCCCLHWEKLRSFLGCPDHLHHHHPPIP PQSPAPDKV PIHSIWKENRPQSVSVL
Sbjct: 1   MCCFFCCCLHWEKLRSFLGCPDHLHHHHPPIPQPQSPAPDKVSPIHSIWKENRPQSVSVL 60

Query: 90  MPGDEVPRFIAMACPALVEIVVQKPSQSISSDNP 124
           MPGDEVPRFIAMACPALVEIVVQKPSQSI SDNP
Sbjct: 61  MPGDEVPRFIAMACPALVEIVVQKPSQSI-SDNP 93

BLAST of Chy4G069010 vs. NCBI nr
Match: XP_008466113.1 (PREDICTED: uncharacterized protein At5g65660-like [Cucumis melo] >KAA0038642.1 uncharacterized protein E6C27_scaffold92G001760 [Cucumis melo var. makuwa] >TYK31243.1 uncharacterized protein E5676_scaffold455G005320 [Cucumis melo var. makuwa])

HSP 1 Score: 233 bits (593), Expect = 1.26e-76
Identity = 114/133 (85.71%), Postives = 117/133 (87.97%), Query Frame = 0

Query: 1   MEYDNREAQDPASLSFPVGLVLLLTFLFCMCCFFCCCLHWEKLRSFLGCPDHLHHHHPPI 60
           MEYDNREAQDPASLSFPVGLVLLLTFLFCMCCFFCCCLHWEKLRSFLGCPDHLHHHHPPI
Sbjct: 1   MEYDNREAQDPASLSFPVGLVLLLTFLFCMCCFFCCCLHWEKLRSFLGCPDHLHHHHPPI 60

Query: 61  PPPQSPA----PDKVPPIHSIWKENRPQSVSVLMPGDEVPRFIAMACP-------ALVEI 120
           PPP SPA    PDK  PIH+IWKENRPQSVSVLMPGDEVPRFIA+ACP       ALVEI
Sbjct: 61  PPPHSPAALSPPDKFSPIHTIWKENRPQSVSVLMPGDEVPRFIALACPPCATAAAALVEI 120

Query: 121 VVQKPSQSISSDN 122
           VVQKPSQSIS  +
Sbjct: 121 VVQKPSQSISDSS 133

BLAST of Chy4G069010 vs. NCBI nr
Match: XP_038899315.1 (uncharacterized protein At5g65660-like [Benincasa hispida])

HSP 1 Score: 213 bits (543), Expect = 4.82e-69
Identity = 108/131 (82.44%), Postives = 114/131 (87.02%), Query Frame = 0

Query: 1   MEYDNREAQDPASLSFPVGLVLLLTFLFCMCCFFCCCLHWEKLRSFLGCPDHLHHHHPPI 60
           MEYDNREAQDPASLSFPVGLVLLLTFLFCMCCFF CCLHWEKLRSFLGCPDHLHHHHPPI
Sbjct: 1   MEYDNREAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSFLGCPDHLHHHHPPI 60

Query: 61  PPPQSPA---PDKVPPIHS-IWKENRPQSVSVLMPGDEVPRFIAMACP-----ALVEIVV 120
           PP Q+ A   PDK+ PIH+ IW+ENRPQS+SVLMPGDEVPRFIAMACP     ALVEIVV
Sbjct: 61  PP-QTAAFSPPDKISPIHTKIWRENRPQSLSVLMPGDEVPRFIAMACPPCAAAALVEIVV 120

Query: 121 QKPSQSISSDN 122
           QKPSQS    +
Sbjct: 121 QKPSQSFPDSS 130

BLAST of Chy4G069010 vs. NCBI nr
Match: KAG6591711.1 (hypothetical protein SDJN03_14057, partial [Cucurbita argyrosperma subsp. sororia] >KAG7024594.1 hypothetical protein SDJN02_13412 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 200 bits (508), Expect = 1.08e-63
Identity = 101/131 (77.10%), Postives = 109/131 (83.21%), Query Frame = 0

Query: 1   MEYDNREAQDPASLSFPVGLVLLLTFLFCMCCFFCCCLHWEKLRSFLGCPDHLHHHHPPI 60
           MEYD+REAQD ASLSFPVGLVLLLTFLFCMCCFF CCLHWEKLRS LGCPDH  H HP I
Sbjct: 1   MEYDDREAQDSASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSILGCPDHADHRHPSI 60

Query: 61  PPPQSPA---PDKVPPIHSIWKENRPQSVSVLMPGDEVPRFIAMACPA-----LVEIVVQ 120
           PP ++PA   PDK  PIH+IWKENRP+S++VLMPGDEVPRFIAMACP      LVEIV+Q
Sbjct: 61  PP-ETPAFSPPDKFSPIHTIWKENRPESLTVLMPGDEVPRFIAMACPPCGAAPLVEIVIQ 120

Query: 121 KPSQSISSDNP 123
           KPSQSIS   P
Sbjct: 121 KPSQSISEALP 130

BLAST of Chy4G069010 vs. NCBI nr
Match: XP_022936981.1 (uncharacterized protein At5g65660-like [Cucurbita moschata])

HSP 1 Score: 200 bits (508), Expect = 1.08e-63
Identity = 101/131 (77.10%), Postives = 109/131 (83.21%), Query Frame = 0

Query: 1   MEYDNREAQDPASLSFPVGLVLLLTFLFCMCCFFCCCLHWEKLRSFLGCPDHLHHHHPPI 60
           MEYD+REAQD ASLSFPVGLVLLLTFLFCMCCFF CCLHWEKLRS LGCPDH  H HP I
Sbjct: 1   MEYDDREAQDSASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSILGCPDHADHRHPSI 60

Query: 61  PPPQSPA---PDKVPPIHSIWKENRPQSVSVLMPGDEVPRFIAMACPA-----LVEIVVQ 120
           PP ++PA   PDK  PIH+IWKENRP+S++VLMPGDEVPRFIAMACP      LVEIV+Q
Sbjct: 61  PP-ETPAFSPPDKFSPIHTIWKENRPESLTVLMPGDEVPRFIAMACPPCGAAPLVEIVIQ 120

Query: 121 KPSQSISSDNP 123
           KPSQSIS   P
Sbjct: 121 KPSQSISGALP 130

BLAST of Chy4G069010 vs. NCBI nr
Match: XP_022975977.1 (uncharacterized protein At5g65660-like [Cucurbita maxima])

HSP 1 Score: 198 bits (504), Expect = 4.38e-63
Identity = 101/131 (77.10%), Postives = 109/131 (83.21%), Query Frame = 0

Query: 1   MEYDNREAQDPASLSFPVGLVLLLTFLFCMCCFFCCCLHWEKLRSFLGCPDHLHHHHPPI 60
           MEYD+REAQD ASLSFPVGLVLLLTFLFCMCCFF CCLHWEKLRS LGCPDH  H HP I
Sbjct: 1   MEYDDREAQDSASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSILGCPDHSDHLHPSI 60

Query: 61  PPPQSPA---PDKVPPIHSIWKENRPQSVSVLMPGDEVPRFIAMACPA-----LVEIVVQ 120
           PP ++PA   PDK  PIH+IWKENRP+S++VLMPGDEVPRFIAMACP      LVEIV+Q
Sbjct: 61  PP-ETPAFSPPDKFSPIHTIWKENRPESLTVLMPGDEVPRFIAMACPPCGAAPLVEIVIQ 120

Query: 121 KPSQSISSDNP 123
           KPSQSIS   P
Sbjct: 121 KPSQSISGALP 130

BLAST of Chy4G069010 vs. TAIR 10
Match: AT5G65660.1 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 57.0 bits (136), Expect = 1.3e-08
Identity = 43/111 (38.74%), Postives = 56/111 (50.45%), Query Frame = 0

Query: 13  SLSFPVGLVLLLTFLFCMCCFFCCCLHWEKLRSFLGCPDHLHHHHPPIPPPQSPAPDKVP 72
           SL FP+G  LLL  +F +   F CC HW+K RS       L +  P      +P   K P
Sbjct: 18  SLGFPLGTALLLIIIFSLSGIFSCCYHWDKHRSL---RRSLANGRPSADIESNPYKPK-P 77

Query: 73  PIHSIWKENRPQSVSVLMPGDEVPRFIAMACPAL------VEIVVQKPSQS 118
           P   + K+ +  SV VLMPGD  P+FIA+ CP        + + VQ P QS
Sbjct: 78  PFPEM-KKPQNLSVPVLMPGDNTPKFIALPCPCAPPRPEKLTVDVQTPPQS 123

BLAST of Chy4G069010 vs. TAIR 10
Match: AT4G28170.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G11120.2); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 41.2 bits (95), Expect = 7.1e-04
Identity = 25/60 (41.67%), Postives = 30/60 (50.00%), Query Frame = 0

Query: 61  PPPQSPAPDKVPPIHSIWKENRPQSVSVLMPGDEVPRFIAMACPALVEIVVQKPSQSISS 120
           P P  P   K PP  S   +   + +SVLMPG++VP FIA  CP         PS S SS
Sbjct: 92  PTPSPPLDQKFPPFASPKMDVCKREISVLMPGEDVPTFIAQPCP---------PSSSSSS 142

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LSK91.8e-0738.74Uncharacterized protein At5g65660 OS=Arabidopsis thaliana OX=3702 GN=At5g65660 P... [more]
Match NameE-valueIdentityDescription
A0A5D3E5455.1e-5885.71Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3CQS45.1e-5885.71uncharacterized protein At5g65660-like OS=Cucumis melo OX=3656 GN=LOC103503633 P... [more]
A0A6J1F9066.2e-4876.92uncharacterized protein At5g65660-like OS=Cucurbita moschata OX=3662 GN=LOC11144... [more]
A0A6J1IEI51.8e-4776.92uncharacterized protein At5g65660-like OS=Cucurbita maxima OX=3661 GN=LOC1114765... [more]
A0A0A0LE405.8e-4696.81Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G893400 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
XP_008466113.11.26e-7685.71PREDICTED: uncharacterized protein At5g65660-like [Cucumis melo] >KAA0038642.1 u... [more]
XP_038899315.14.82e-6982.44uncharacterized protein At5g65660-like [Benincasa hispida][more]
KAG6591711.11.08e-6377.10hypothetical protein SDJN03_14057, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022936981.11.08e-6377.10uncharacterized protein At5g65660-like [Cucurbita moschata][more]
XP_022975977.14.38e-6377.10uncharacterized protein At5g65660-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
AT5G65660.11.3e-0838.74hydroxyproline-rich glycoprotein family protein [more]
AT4G28170.17.1e-0441.67unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (hystrix) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR34291:SF7PROTEIN, PUTATIVE-RELATEDcoord: 5..118
IPR037699Uncharacterized protein At5g65660-likePANTHERPTHR34291HYDROXYPROLINE-RICH GLYCOPROTEIN FAMILY PROTEINcoord: 5..118

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Chy4G069010.1Chy4G069010.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane