Cp4.1LG08g09730 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG08g09730
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionHydroxyproline-rich glycoprotein family protein
LocationCp4.1LG08: 7616576 .. 7617418 (+)
RNA-Seq ExpressionCp4.1LG08g09730
SyntenyCp4.1LG08g09730
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCTCTTCATCGGACAATCAACAATCCAAATCCAAATCCACCGACTCTCAACCTCTTCCGCCGCCCTCCGCCGCACATAACCCACCTCCGATCTACCCTCCTCCCACCATGGGGTACCCTCCAGCCCCACATCCGGGGTACCCTCCAGCGCCAGGGGCTTACCCACCTTACAATGGCTACGCCTACGCCCAAGCCCCTCCTGCCGCCTATTACCACAACAGCCCCCAAAATTACGCGGTGGAGCCGTTTCACGCCGCCTTCATCCGCGGCATTGTCACCGCCTTAATAATTCTGGTGGTTCTAATGATGCTCTCCAGCATAATCACCTGGATCATCCTCCGACCAGAAATCCCAACGTTCAGAGTTGATACATTGGGCGTAACCAATTTTAACATCTCCAAATCGAATTACTCCGGAAACTGGAACGCGACCTTGGTGGTCCAGAATCCCAACAAGAAATTGAACCTGACTTTCAAGCGGATCCAGGGGTTCGTGGGGTATAAGGACAACACGCTGGCAATGTCGTTTGCGGACCCATTTTTTCTTGCCGTGGAGAGGACTAACCTAATGCGGGTGAGATGGACATCGAGTAGCCCTGATGATCCGGGGAATTGGGAGGAGACAGAGGAGAAATTGGGGAAGGAGAAGGCGACGAGGAAAGTTGGTTTCAATTTGAGATTCTTCGTATGGACCACTTTCCAATCTGGGTCTTGGTGGACCAGGCACGTTATTTTGAGAGTCTTTTGTGACGATTTGAAGATCGACTTCGGCACTCCCAACTCCGTTAATGGCTCCTTCTCCGCCCATGGCCACCACATGCATTGCGCGGTTCTCATGTAG

mRNA sequence

ATGGCCTCTTCATCGGACAATCAACAATCCAAATCCAAATCCACCGACTCTCAACCTCTTCCGCCGCCCTCCGCCGCACATAACCCACCTCCGATCTACCCTCCTCCCACCATGGGGTACCCTCCAGCCCCACATCCGGGGTACCCTCCAGCGCCAGGGGCTTACCCACCTTACAATGGCTACGCCTACGCCCAAGCCCCTCCTGCCGCCTATTACCACAACAGCCCCCAAAATTACGCGGTGGAGCCGTTTCACGCCGCCTTCATCCGCGGCATTGTCACCGCCTTAATAATTCTGGTGGTTCTAATGATGCTCTCCAGCATAATCACCTGGATCATCCTCCGACCAGAAATCCCAACGTTCAGAGTTGATACATTGGGCGTAACCAATTTTAACATCTCCAAATCGAATTACTCCGGAAACTGGAACGCGACCTTGGTGGTCCAGAATCCCAACAAGAAATTGAACCTGACTTTCAAGCGGATCCAGGGGTTCGTGGGGTATAAGGACAACACGCTGGCAATGTCGTTTGCGGACCCATTTTTTCTTGCCGTGGAGAGGACTAACCTAATGCGGGTGAGATGGACATCGAGTAGCCCTGATGATCCGGGGAATTGGGAGGAGACAGAGGAGAAATTGGGGAAGGAGAAGGCGACGAGGAAAGTTGGTTTCAATTTGAGATTCTTCGTATGGACCACTTTCCAATCTGGGTCTTGGTGGACCAGGCACGTTATTTTGAGAGTCTTTTGTGACGATTTGAAGATCGACTTCGGCACTCCCAACTCCGTTAATGGCTCCTTCTCCGCCCATGGCCACCACATGCATTGCGCGGTTCTCATGTAG

Coding sequence (CDS)

ATGGCCTCTTCATCGGACAATCAACAATCCAAATCCAAATCCACCGACTCTCAACCTCTTCCGCCGCCCTCCGCCGCACATAACCCACCTCCGATCTACCCTCCTCCCACCATGGGGTACCCTCCAGCCCCACATCCGGGGTACCCTCCAGCGCCAGGGGCTTACCCACCTTACAATGGCTACGCCTACGCCCAAGCCCCTCCTGCCGCCTATTACCACAACAGCCCCCAAAATTACGCGGTGGAGCCGTTTCACGCCGCCTTCATCCGCGGCATTGTCACCGCCTTAATAATTCTGGTGGTTCTAATGATGCTCTCCAGCATAATCACCTGGATCATCCTCCGACCAGAAATCCCAACGTTCAGAGTTGATACATTGGGCGTAACCAATTTTAACATCTCCAAATCGAATTACTCCGGAAACTGGAACGCGACCTTGGTGGTCCAGAATCCCAACAAGAAATTGAACCTGACTTTCAAGCGGATCCAGGGGTTCGTGGGGTATAAGGACAACACGCTGGCAATGTCGTTTGCGGACCCATTTTTTCTTGCCGTGGAGAGGACTAACCTAATGCGGGTGAGATGGACATCGAGTAGCCCTGATGATCCGGGGAATTGGGAGGAGACAGAGGAGAAATTGGGGAAGGAGAAGGCGACGAGGAAAGTTGGTTTCAATTTGAGATTCTTCGTATGGACCACTTTCCAATCTGGGTCTTGGTGGACCAGGCACGTTATTTTGAGAGTCTTTTGTGACGATTTGAAGATCGACTTCGGCACTCCCAACTCCGTTAATGGCTCCTTCTCCGCCCATGGCCACCACATGCATTGCGCGGTTCTCATGTAG

Protein sequence

MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPYNGYAYAQAPPAAYYHNSPQNYAVEPFHAAFIRGIVTALIILVVLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLAVERTNLMRVRWTSSSPDDPGNWEETEEKLGKEKATRKVGFNLRFFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPNSVNGSFSAHGHHMHCAVLM
Homology
BLAST of Cp4.1LG08g09730 vs. ExPASy Swiss-Prot
Match: Q9SJ52 (NDR1/HIN1-like protein 10 OS=Arabidopsis thaliana OX=3702 GN=NHL10 PE=2 SV=1)

HSP 1 Score: 52.8 bits (125), Expect = 7.5e-06
Identity = 50/214 (23.36%), Postives = 95/214 (44.39%), Query Frame = 0

Query: 54  AYPPYNGYAYAQA--PPA--AYYHNSPQNYAVEPFHAAFIRGIVTALIILVVLMMLSSII 113
           A  P NG  Y  +  PPA   YY             + F++ I++    L+V++ ++++I
Sbjct: 3   AEQPLNGAFYGPSVPPPAPKGYYRRGHGRGCGCCLLSLFVKVIIS----LIVILGVAALI 62

Query: 114 TWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWN--ATLVVQNPNKKLNLTFKRIQGFVG 173
            W+I+RP    F V    +T F+ +  +    +N   T+ V+NPNK++ L + RI+    
Sbjct: 63  FWLIVRPRAIKFHVTDASLTRFDHTSPDNILRYNLALTVPVRNPNKRIGLYYDRIEAHAY 122

Query: 174 YKDNTLAMSFADPFFLAVERTNLMRVRWTSSSPDDPGNWEETEEKLGKEKATRKVGFNLR 233
           Y+    +     PF+   + T ++   +   +       +     L  E+ +      ++
Sbjct: 123 YEGKRFSTITLTPFYQGHKNTTVLTPTFQGQNLVIFNAGQ--SRTLNAERISGVYNIEIK 182

Query: 234 FFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPN 262
           F +   F+ G    R +  +V CDDL++   T N
Sbjct: 183 FRLRVRFKLGDLKFRRIKPKVDCDDLRLPLSTSN 210

BLAST of Cp4.1LG08g09730 vs. NCBI nr
Match: XP_023539989.1 (uncharacterized protein LOC111800503 [Cucurbita pepo subsp. pepo] >XP_023539990.1 uncharacterized protein LOC111800503 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 559 bits (1441), Expect = 6.62e-201
Identity = 280/280 (100.00%), Postives = 280/280 (100.00%), Query Frame = 0

Query: 1   MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPYNG 60
           MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPYNG
Sbjct: 1   MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPYNG 60

Query: 61  YAYAQAPPAAYYHNSPQNYAVEPFHAAFIRGIVTALIILVVLMMLSSIITWIILRPEIPT 120
           YAYAQAPPAAYYHNSPQNYAVEPFHAAFIRGIVTALIILVVLMMLSSIITWIILRPEIPT
Sbjct: 61  YAYAQAPPAAYYHNSPQNYAVEPFHAAFIRGIVTALIILVVLMMLSSIITWIILRPEIPT 120

Query: 121 FRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADP 180
           FRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADP
Sbjct: 121 FRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADP 180

Query: 181 FFLAVERTNLMRVRWTSSSPDDPGNWEETEEKLGKEKATRKVGFNLRFFVWTTFQSGSWW 240
           FFLAVERTNLMRVRWTSSSPDDPGNWEETEEKLGKEKATRKVGFNLRFFVWTTFQSGSWW
Sbjct: 181 FFLAVERTNLMRVRWTSSSPDDPGNWEETEEKLGKEKATRKVGFNLRFFVWTTFQSGSWW 240

Query: 241 TRHVILRVFCDDLKIDFGTPNSVNGSFSAHGHHMHCAVLM 280
           TRHVILRVFCDDLKIDFGTPNSVNGSFSAHGHHMHCAVLM
Sbjct: 241 TRHVILRVFCDDLKIDFGTPNSVNGSFSAHGHHMHCAVLM 280

BLAST of Cp4.1LG08g09730 vs. NCBI nr
Match: KAG7028149.1 (NDR1/HIN1-like protein 2, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 552 bits (1423), Expect = 4.42e-198
Identity = 276/278 (99.28%), Postives = 277/278 (99.64%), Query Frame = 0

Query: 1   MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPYNG 60
           MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPYNG
Sbjct: 1   MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPYNG 60

Query: 61  YAYAQAPPAAYYHNSPQNYAVEPFHAAFIRGIVTALIILVVLMMLSSIITWIILRPEIPT 120
           YAYAQAPPAAYYHNSPQNYAVEPFHAAFIRGIVTALIILVVLMML+SIITWIILRPEIPT
Sbjct: 61  YAYAQAPPAAYYHNSPQNYAVEPFHAAFIRGIVTALIILVVLMMLTSIITWIILRPEIPT 120

Query: 121 FRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADP 180
           FRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADP
Sbjct: 121 FRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADP 180

Query: 181 FFLAVERTNLMRVRWTSSSPDDPGNWEETEEKLGKEKATRKVGFNLRFFVWTTFQSGSWW 240
           FFLAVERTNLMRVRWTSSSPDDPGNWEETEEKLGKEKATRKV FNLRFFVWTTFQSGSWW
Sbjct: 181 FFLAVERTNLMRVRWTSSSPDDPGNWEETEEKLGKEKATRKVSFNLRFFVWTTFQSGSWW 240

Query: 241 TRHVILRVFCDDLKIDFGTPNSVNGSFSAHGHHMHCAV 278
           TRHVILRVFCDDLKIDFGTPNSVNGSFSAHGHHMHCAV
Sbjct: 241 TRHVILRVFCDDLKIDFGTPNSVNGSFSAHGHHMHCAV 278

BLAST of Cp4.1LG08g09730 vs. NCBI nr
Match: XP_022941877.1 (uncharacterized protein LOC111447106 [Cucurbita moschata])

HSP 1 Score: 546 bits (1408), Expect = 7.66e-196
Identity = 276/282 (97.87%), Postives = 278/282 (98.58%), Query Frame = 0

Query: 1   MASSSDNQQSKSKS--TDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPY 60
           MASSSDNQQSKSKS  TDSQPLPPPSAAHNP PIYPPPTMGYPPAPHPGYPPAPGAYPPY
Sbjct: 1   MASSSDNQQSKSKSKSTDSQPLPPPSAAHNPAPIYPPPTMGYPPAPHPGYPPAPGAYPPY 60

Query: 61  NGYAYAQAPPAAYYHNSPQNYAVEPFHAAFIRGIVTALIILVVLMMLSSIITWIILRPEI 120
           NGYAYAQAPPAAYYHNSPQNYAVEPFHA+FIRGIVTALIILVVLMML+SIITWIILRPEI
Sbjct: 61  NGYAYAQAPPAAYYHNSPQNYAVEPFHASFIRGIVTALIILVVLMMLTSIITWIILRPEI 120

Query: 121 PTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFA 180
           PTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFA
Sbjct: 121 PTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFA 180

Query: 181 DPFFLAVERTNLMRVRWTSSSPDDPGNWEETEEKLGKEKATRKVGFNLRFFVWTTFQSGS 240
           DPFFLAVERTNLMRVRWTSSSPDDPGNWEETEEKLGKEKATRKV FNLRFFVWTTFQSGS
Sbjct: 181 DPFFLAVERTNLMRVRWTSSSPDDPGNWEETEEKLGKEKATRKVSFNLRFFVWTTFQSGS 240

Query: 241 WWTRHVILRVFCDDLKIDFGTPNSVNGSFSAHGHHMHCAVLM 280
           WWTRHVILRVFCDDLKIDFGTPNSVNGSFSAHGHHMHCAVLM
Sbjct: 241 WWTRHVILRVFCDDLKIDFGTPNSVNGSFSAHGHHMHCAVLM 282

BLAST of Cp4.1LG08g09730 vs. NCBI nr
Match: XP_023005718.1 (uncharacterized protein LOC111498631 [Cucurbita maxima])

HSP 1 Score: 544 bits (1402), Expect = 5.84e-195
Identity = 272/280 (97.14%), Postives = 274/280 (97.86%), Query Frame = 0

Query: 1   MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPYNG 60
           MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPYNG
Sbjct: 1   MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPYNG 60

Query: 61  YAYAQAPPAAYYHNSPQNYAVEPFHAAFIRGIVTALIILVVLMMLSSIITWIILRPEIPT 120
           YAYAQAPP AYYHNSPQNY VEPFHAA IRGIVTALIILVVLMMLSSIITWIILRPEIPT
Sbjct: 61  YAYAQAPPTAYYHNSPQNYGVEPFHAALIRGIVTALIILVVLMMLSSIITWIILRPEIPT 120

Query: 121 FRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADP 180
           FRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADP
Sbjct: 121 FRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADP 180

Query: 181 FFLAVERTNLMRVRWTSSSPDDPGNWEETEEKLGKEKATRKVGFNLRFFVWTTFQSGSWW 240
           FFL VERTNLMRVRWTSSSPDDPG+WEETEEKLGKEKATRKV FNLRFFVWTTFQSGSWW
Sbjct: 181 FFLGVERTNLMRVRWTSSSPDDPGHWEETEEKLGKEKATRKVSFNLRFFVWTTFQSGSWW 240

Query: 241 TRHVILRVFCDDLKIDFGTPNSVNGSFSAHGHHMHCAVLM 280
           TRHVILRVFCDDLKIDFGTPNSVNGSFSA+GHHMHC VLM
Sbjct: 241 TRHVILRVFCDDLKIDFGTPNSVNGSFSAYGHHMHCTVLM 280

BLAST of Cp4.1LG08g09730 vs. NCBI nr
Match: KAG6596613.1 (hypothetical protein SDJN03_09793, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 424 bits (1090), Expect = 4.43e-148
Identity = 224/249 (89.96%), Postives = 230/249 (92.37%), Query Frame = 0

Query: 1   MASSSDNQQSKSKS--TDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPY 60
           MASSSDNQQSKSKS  TDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPY
Sbjct: 1   MASSSDNQQSKSKSKSTDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPY 60

Query: 61  NGYAYAQAPPAAYYHNSPQNYAVEPFHAAFIRGIVTALIILVVLMMLSSIITWIILRPEI 120
           NGYAYAQAPPAAYYHNSPQNYAVEPFHAAFIRGIVTALIILVVLMML+SIITWIILRPEI
Sbjct: 61  NGYAYAQAPPAAYYHNSPQNYAVEPFHAAFIRGIVTALIILVVLMMLTSIITWIILRPEI 120

Query: 121 PTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFA 180
           PTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFA
Sbjct: 121 PTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFA 180

Query: 181 DPFFLAVERTNLMRVRWTSSSPDDPGNWEETEEKLGKEKATRKVGFNLRFFVWTTFQSGS 240
           DPFFLAVERTNLMRVRWTSSSPDDPGNWEETEEKLGKEKATRK            ++  S
Sbjct: 181 DPFFLAVERTNLMRVRWTSSSPDDPGNWEETEEKLGKEKATRK-----------NYRLCS 238

Query: 241 WWTRHVILR 247
           W +R+ + R
Sbjct: 241 WVSRNCVWR 238

BLAST of Cp4.1LG08g09730 vs. ExPASy TrEMBL
Match: A0A6J1FNP1 (uncharacterized protein LOC111447106 OS=Cucurbita moschata OX=3662 GN=LOC111447106 PE=4 SV=1)

HSP 1 Score: 546 bits (1408), Expect = 3.71e-196
Identity = 276/282 (97.87%), Postives = 278/282 (98.58%), Query Frame = 0

Query: 1   MASSSDNQQSKSKS--TDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPY 60
           MASSSDNQQSKSKS  TDSQPLPPPSAAHNP PIYPPPTMGYPPAPHPGYPPAPGAYPPY
Sbjct: 1   MASSSDNQQSKSKSKSTDSQPLPPPSAAHNPAPIYPPPTMGYPPAPHPGYPPAPGAYPPY 60

Query: 61  NGYAYAQAPPAAYYHNSPQNYAVEPFHAAFIRGIVTALIILVVLMMLSSIITWIILRPEI 120
           NGYAYAQAPPAAYYHNSPQNYAVEPFHA+FIRGIVTALIILVVLMML+SIITWIILRPEI
Sbjct: 61  NGYAYAQAPPAAYYHNSPQNYAVEPFHASFIRGIVTALIILVVLMMLTSIITWIILRPEI 120

Query: 121 PTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFA 180
           PTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFA
Sbjct: 121 PTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFA 180

Query: 181 DPFFLAVERTNLMRVRWTSSSPDDPGNWEETEEKLGKEKATRKVGFNLRFFVWTTFQSGS 240
           DPFFLAVERTNLMRVRWTSSSPDDPGNWEETEEKLGKEKATRKV FNLRFFVWTTFQSGS
Sbjct: 181 DPFFLAVERTNLMRVRWTSSSPDDPGNWEETEEKLGKEKATRKVSFNLRFFVWTTFQSGS 240

Query: 241 WWTRHVILRVFCDDLKIDFGTPNSVNGSFSAHGHHMHCAVLM 280
           WWTRHVILRVFCDDLKIDFGTPNSVNGSFSAHGHHMHCAVLM
Sbjct: 241 WWTRHVILRVFCDDLKIDFGTPNSVNGSFSAHGHHMHCAVLM 282

BLAST of Cp4.1LG08g09730 vs. ExPASy TrEMBL
Match: A0A6J1L2Y7 (uncharacterized protein LOC111498631 OS=Cucurbita maxima OX=3661 GN=LOC111498631 PE=4 SV=1)

HSP 1 Score: 544 bits (1402), Expect = 2.83e-195
Identity = 272/280 (97.14%), Postives = 274/280 (97.86%), Query Frame = 0

Query: 1   MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPYNG 60
           MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPYNG
Sbjct: 1   MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPYNG 60

Query: 61  YAYAQAPPAAYYHNSPQNYAVEPFHAAFIRGIVTALIILVVLMMLSSIITWIILRPEIPT 120
           YAYAQAPP AYYHNSPQNY VEPFHAA IRGIVTALIILVVLMMLSSIITWIILRPEIPT
Sbjct: 61  YAYAQAPPTAYYHNSPQNYGVEPFHAALIRGIVTALIILVVLMMLSSIITWIILRPEIPT 120

Query: 121 FRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADP 180
           FRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADP
Sbjct: 121 FRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADP 180

Query: 181 FFLAVERTNLMRVRWTSSSPDDPGNWEETEEKLGKEKATRKVGFNLRFFVWTTFQSGSWW 240
           FFL VERTNLMRVRWTSSSPDDPG+WEETEEKLGKEKATRKV FNLRFFVWTTFQSGSWW
Sbjct: 181 FFLGVERTNLMRVRWTSSSPDDPGHWEETEEKLGKEKATRKVSFNLRFFVWTTFQSGSWW 240

Query: 241 TRHVILRVFCDDLKIDFGTPNSVNGSFSAHGHHMHCAVLM 280
           TRHVILRVFCDDLKIDFGTPNSVNGSFSA+GHHMHC VLM
Sbjct: 241 TRHVILRVFCDDLKIDFGTPNSVNGSFSAYGHHMHCTVLM 280

BLAST of Cp4.1LG08g09730 vs. ExPASy TrEMBL
Match: A0A0A0LGS8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G780530 PE=4 SV=1)

HSP 1 Score: 354 bits (909), Expect = 5.05e-120
Identity = 184/293 (62.80%), Postives = 225/293 (76.79%), Query Frame = 0

Query: 1   MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPT--------------MGYPPAPHP 60
           MASSS++QQS+SK+TD  P  P SA +NPPP+YPPPT              MGYPP P P
Sbjct: 1   MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPP 60

Query: 61  GYPPAPGAYPPYNGYAYAQAPPAAYYHNSPQNYAVEPFHAAFIRGIVTALIILVVLMMLS 120
           GYPPAPG YPPYN Y YAQAPPAAYY N+PQNY  +   A F+RGIVTALI+LV +M LS
Sbjct: 61  GYPPAPGNYPPYNTY-YAQAPPAAYY-NNPQNYRAQTVSAGFLRGIVTALILLVAVMTLS 120

Query: 121 SIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFV 180
           SIITWI+LRP+IP F+VD+  V+NFNISK NYSGNWN +L V+NPN KL +  +RIQ FV
Sbjct: 121 SIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFV 180

Query: 181 GYKDNTLAMSFADPFFLAVERTNLMRVRWTSSSPDDPGNWEETEEKLGKEKATRKVGFNL 240
            YK+NTLAMS+ADPFF+ VE+++ MRV+ TSSSPDDPGNW ETEEK+G+EKA+  V FNL
Sbjct: 181 NYKENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNL 240

Query: 241 RFFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPNSVNGSFSAHGHHMHCAVL 279
           RFF WT F+SGSWWTR ++++VFC+DLK+ F  P + +G + A  H   C+VL
Sbjct: 241 RFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVL 291

BLAST of Cp4.1LG08g09730 vs. ExPASy TrEMBL
Match: A0A1S3B6W4 (uncharacterized protein LOC103486674 OS=Cucumis melo OX=3656 GN=LOC103486674 PE=4 SV=1)

HSP 1 Score: 349 bits (895), Expect = 7.01e-118
Identity = 185/295 (62.71%), Postives = 223/295 (75.59%), Query Frame = 0

Query: 1   MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPT---------------MGYPPAPH 60
           MASSS++QQS+SK+TD  P  P SA +NPPP+YPPPT               MGYPPAPH
Sbjct: 1   MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPQGHGGYSPAMGYPPAPH 60

Query: 61  PGYPPAPGAYPPYNGYAYAQAPPAAYYHNSPQNYAVEPFHAAFIRGIVTALIILVVLMML 120
           P YPPA G YPPYN Y YAQAPPAAYY N+PQNY      A F+RGIV ALI+LV +M L
Sbjct: 61  PRYPPATGNYPPYNAY-YAQAPPAAYY-NNPQNYRAGTISAGFLRGIVAALILLVAIMTL 120

Query: 121 SSIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGF 180
           SSIITWIILRPE+P F+VD+  V+NFNISK NYSGNW+A++ VQNPN KLN+  +RIQ F
Sbjct: 121 SSIITWIILRPEVPVFKVDSFSVSNFNISKLNYSGNWDASVTVQNPNHKLNVNMERIQSF 180

Query: 181 VGYKDNTLAMSFADPFFLAVERTNLMRVRWTSSSPDDPGNWEETEEKLGKEKATRKVGFN 240
           V YK NTLAMS+ADPFFL VE++  M+V+ TSSSPDDPGNW ETEEKLG+E+AT  V FN
Sbjct: 181 VDYKQNTLAMSYADPFFLDVEKSGQMKVKLTSSSPDDPGNWLETEEKLGRERATGTVSFN 240

Query: 241 LRFFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPNSVNGSFSAHGHHMHCAVLM 280
           LRFF WTTF++GSWWTR V++RV C+D+K+ F  P + +  + A  H   C+VL+
Sbjct: 241 LRFFAWTTFRTGSWWTRRVVMRVSCEDMKLVFTGPAAGHAVYLADEHSKTCSVLV 293

BLAST of Cp4.1LG08g09730 vs. ExPASy TrEMBL
Match: A0A5A7TLT1 (Protein YLS9 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G004520 PE=4 SV=1)

HSP 1 Score: 349 bits (895), Expect = 1.03e-113
Identity = 185/295 (62.71%), Postives = 223/295 (75.59%), Query Frame = 0

Query: 1   MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPT---------------MGYPPAPH 60
           MASSS++QQS+SK+TD  P  P SA +NPPP+YPPPT               MGYPPAPH
Sbjct: 306 MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPQGHGGYSPAMGYPPAPH 365

Query: 61  PGYPPAPGAYPPYNGYAYAQAPPAAYYHNSPQNYAVEPFHAAFIRGIVTALIILVVLMML 120
           P YPPA G YPPYN Y YAQAPPAAYY N+PQNY      A F+RGIV ALI+LV +M L
Sbjct: 366 PRYPPATGNYPPYNAY-YAQAPPAAYY-NNPQNYRAGTISAGFLRGIVAALILLVAIMTL 425

Query: 121 SSIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGF 180
           SSIITWIILRPE+P F+VD+  V+NFNISK NYSGNW+A++ VQNPN KLN+  +RIQ F
Sbjct: 426 SSIITWIILRPEVPVFKVDSFSVSNFNISKLNYSGNWDASVTVQNPNHKLNVNMERIQSF 485

Query: 181 VGYKDNTLAMSFADPFFLAVERTNLMRVRWTSSSPDDPGNWEETEEKLGKEKATRKVGFN 240
           V YK NTLAMS+ADPFFL VE++  M+V+ TSSSPDDPGNW ETEEKLG+E+AT  V FN
Sbjct: 486 VDYKQNTLAMSYADPFFLDVEKSGQMKVKLTSSSPDDPGNWLETEEKLGRERATGTVSFN 545

Query: 241 LRFFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPNSVNGSFSAHGHHMHCAVLM 280
           LRFF WTTF++GSWWTR V++RV C+D+K+ F  P + +  + A  H   C+VL+
Sbjct: 546 LRFFAWTTFRTGSWWTRRVVMRVSCEDMKLVFTGPAAGHAVYLADEHSKTCSVLV 598

BLAST of Cp4.1LG08g09730 vs. TAIR 10
Match: AT3G52460.1 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 157.9 bits (398), Expect = 1.2e-38
Identity = 104/259 (40.15%), Postives = 146/259 (56.37%), Query Frame = 0

Query: 17  SQPLPPPSAAHNPPPIYP----PPTMGY-----PPAPHPGYPPAPGAYPPYNGYAYAQAP 76
           +QP PPP  +  PPP       PP MGY     PP P+P YP A     PY  Y YAQAP
Sbjct: 26  NQPPPPPPQSQPPPPQTQQQTYPPVMGYPGYHQPPPPYPNYPNA-----PYQQYPYAQAP 85

Query: 77  PAAYYHNS---PQNYAVE-PFHAAFIRGIVTALIILVVLMMLSSIITWIILRPEIPTFRV 136
           PA+YY +S    QN   + P  + F+RGI T LI+LVVL+ +S+ ITW++LRP+IP F V
Sbjct: 86  PASYYGSSYPAQQNPVYQRPASSGFVRGIFTGLIVLVVLLCISTTITWLVLRPQIPLFSV 145

Query: 137 DTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGY-----KDNTLAMSFA 196
           +   V+NFN++   +S  W A L ++N N KL   F RIQG V +     +D  LA +F 
Sbjct: 146 NNFSVSNFNVTGPVFSAQWTANLTIENQNTKLKGYFDRIQGLVYHQNAVGEDEFLATAFF 205

Query: 197 DPFFLAVERTNLMRVRWTSSSPDDPGNWEETEEKLGKEKATRKVGFNLRFFVWTTFQSGS 256
            P F+  +++ ++    T+   + P       +++ KE+ T  V F+LR  VW TF++  
Sbjct: 206 QPVFVETKKSVVIGETLTAGDKEQPKVPSWVVDEMKKERETGTVTFSLRMAVWVTFKTDG 265

Query: 257 WWTRHVILRVFCDDLKIDF 258
           W  R   L+VFC  LK+ F
Sbjct: 266 WAARESGLKVFCGKLKVGF 279

BLAST of Cp4.1LG08g09730 vs. TAIR 10
Match: AT2G27260.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family )

HSP 1 Score: 80.9 bits (198), Expect = 1.8e-15
Identity = 69/229 (30.13%), Postives = 114/229 (49.78%), Query Frame = 0

Query: 36  PTMGYPPAPHPGYPPAPGAYPPYNGYAYAQAPPAAYYHNSPQNYAVEPF-HAAFIRGIVT 95
           P  GY P P+P YP      PP NGY    A  A  Y N    YA +P   A  IR +  
Sbjct: 7   PATGY-PYPYP-YPNPQQQQPPTNGYPNPAAGTAYPYQNHNPYYAPQPNPRAVIIRRLFI 66

Query: 96  ALIILVVLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKK 155
                ++L+ L   I ++I+RP++P   +++L V+NFN+S +  SG W+  L  +NPN K
Sbjct: 67  VFTTFLLLLGLILFIFFLIVRPQLPDVNLNSLSVSNFNVSNNQVSGKWDLQLQFRNPNSK 126

Query: 156 LNLTFKRIQGFVGYKDNTLAMSFADPFFLAVERTNLMRVRWTSSSPDDPGNWEETEEKLG 215
           ++L ++     + Y   +L+ +   PF    +   ++    + S     G      + +G
Sbjct: 127 MSLHYETALCAMYYNRVSLSETRLQPFDQGKKDQTVVNATLSVSGTYVDG---RLVDSIG 186

Query: 216 KEKATR-KVGFNLRFFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPNS 263
           KE++ +  V F+LR   + TF+ G++  R  +  V+CDD+ +  G P S
Sbjct: 187 KERSVKGNVEFDLRMISYVTFRYGAFRRRRYV-TVYCDDVAV--GVPVS 227

BLAST of Cp4.1LG08g09730 vs. TAIR 10
Match: AT5G22870.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family )

HSP 1 Score: 66.2 bits (160), Expect = 4.7e-11
Identity = 41/176 (23.30%), Postives = 84/176 (47.73%), Query Frame = 0

Query: 92  IVTALIILVVLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSNY-SGNWNATLVVQN 151
           I   ++ L+ +  +  +ITW+  +P+   + V+   V NFN++  N+ S  +  T+   N
Sbjct: 29  IFLVILTLIFMAAVGFLITWLETKPKKLRYTVENASVQNFNLTNDNHMSATFQFTIQSHN 88

Query: 152 PNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLAVERTNLMRVRWTSSSPDDPGNWEETE 211
           PN ++++ +  ++ FV +KD TLA    +PF     R N+ ++  T  + ++    +   
Sbjct: 89  PNHRISVYYSSVEIFVKFKDQTLAFDTVEPFH--QPRMNVKQIDETLIA-ENVAVSKSNG 148

Query: 212 EKLGKEKATRKVGFNLRFFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPNSVNGS 267
           + L  + +  K+GF +       F+ G W + H   ++ C  + +    PN    S
Sbjct: 149 KDLRSQNSLGKIGFEVFVKARVRFKVGIWKSSHRTAKIKCSHVTVSLSQPNKSQNS 201

BLAST of Cp4.1LG08g09730 vs. TAIR 10
Match: AT2G35980.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family )

HSP 1 Score: 52.8 bits (125), Expect = 5.3e-07
Identity = 50/214 (23.36%), Postives = 95/214 (44.39%), Query Frame = 0

Query: 54  AYPPYNGYAYAQA--PPA--AYYHNSPQNYAVEPFHAAFIRGIVTALIILVVLMMLSSII 113
           A  P NG  Y  +  PPA   YY             + F++ I++    L+V++ ++++I
Sbjct: 3   AEQPLNGAFYGPSVPPPAPKGYYRRGHGRGCGCCLLSLFVKVIIS----LIVILGVAALI 62

Query: 114 TWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWN--ATLVVQNPNKKLNLTFKRIQGFVG 173
            W+I+RP    F V    +T F+ +  +    +N   T+ V+NPNK++ L + RI+    
Sbjct: 63  FWLIVRPRAIKFHVTDASLTRFDHTSPDNILRYNLALTVPVRNPNKRIGLYYDRIEAHAY 122

Query: 174 YKDNTLAMSFADPFFLAVERTNLMRVRWTSSSPDDPGNWEETEEKLGKEKATRKVGFNLR 233
           Y+    +     PF+   + T ++   +   +       +     L  E+ +      ++
Sbjct: 123 YEGKRFSTITLTPFYQGHKNTTVLTPTFQGQNLVIFNAGQ--SRTLNAERISGVYNIEIK 182

Query: 234 FFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPN 262
           F +   F+ G    R +  +V CDDL++   T N
Sbjct: 183 FRLRVRFKLGDLKFRRIKPKVDCDDLRLPLSTSN 210

BLAST of Cp4.1LG08g09730 vs. TAIR 10
Match: AT3G52470.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family )

HSP 1 Score: 51.6 bits (122), Expect = 1.2e-06
Identity = 26/93 (27.96%), Postives = 52/93 (55.91%), Query Frame = 0

Query: 89  IRGIVTALIILVVLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSN-YSGNWNATLV 148
           +R +  A+I  +V+++++  + W+ILRP  P F +    V  FN+S+ N  + N+  T+ 
Sbjct: 16  VRKLCAAIIAFIVIVLITIFLVWVILRPTKPRFVLQDATVYAFNLSQPNLLTSNFQVTIA 75

Query: 149 VQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADP 181
            +NPN K+ + + R+  +  Y +  + +  A P
Sbjct: 76  SRNPNSKIGIYYDRLHVYATYMNQQITLRTAIP 108

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SJ527.5e-0623.36NDR1/HIN1-like protein 10 OS=Arabidopsis thaliana OX=3702 GN=NHL10 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
XP_023539989.16.62e-201100.00uncharacterized protein LOC111800503 [Cucurbita pepo subsp. pepo] >XP_023539990.... [more]
KAG7028149.14.42e-19899.28NDR1/HIN1-like protein 2, partial [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_022941877.17.66e-19697.87uncharacterized protein LOC111447106 [Cucurbita moschata][more]
XP_023005718.15.84e-19597.14uncharacterized protein LOC111498631 [Cucurbita maxima][more]
KAG6596613.14.43e-14889.96hypothetical protein SDJN03_09793, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
A0A6J1FNP13.71e-19697.87uncharacterized protein LOC111447106 OS=Cucurbita moschata OX=3662 GN=LOC1114471... [more]
A0A6J1L2Y72.83e-19597.14uncharacterized protein LOC111498631 OS=Cucurbita maxima OX=3661 GN=LOC111498631... [more]
A0A0A0LGS85.05e-12062.80Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G780530 PE=4 SV=1[more]
A0A1S3B6W47.01e-11862.71uncharacterized protein LOC103486674 OS=Cucumis melo OX=3656 GN=LOC103486674 PE=... [more]
A0A5A7TLT11.03e-11362.71Protein YLS9 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G004520 ... [more]
Match NameE-valueIdentityDescription
AT3G52460.11.2e-3840.15hydroxyproline-rich glycoprotein family protein [more]
AT2G27260.11.8e-1530.13Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family [more]
AT5G22870.14.7e-1123.30Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family [more]
AT2G35980.15.3e-0723.36Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family [more]
AT3G52470.11.2e-0627.96Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 22..39
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..39
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..21
NoneNo IPR availablePANTHERPTHR31852LATE EMBRYOGENESIS ABUNDANT (LEA) HYDROXYPROLINE-RICH GLYCOPROTEIN FAMILYcoord: 45..258
NoneNo IPR availablePANTHERPTHR31852:SF183HYDROXYPROLINE-RICH GLYCOPROTEIN FAMILY PROTEINcoord: 45..258

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG08g09730.1Cp4.1LG08g09730.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane