CSPI03G35560 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI03G35560
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionHydroxyproline-rich glycoprotein family protein
LocationChr3: 30999758 .. 31001581 (-)
RNA-Seq ExpressionCSPI03G35560
SyntenyCSPI03G35560
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TAACCTTCCGTTTTTTTTCTAAGAAAAGCCTTAACGGATCTCCATATTCTAAATCCTCCTCCATTTCTGATTTCTCCTTTACCATTCTTCTCTCTCTCTCTCTCTCTCTCTCTCTCCAAATCCTCTCACAGGGAGAGAGAAAAAGCCACAAACAGAGCATCCTTTCCAATGGCTTCCTCATCGGAGGATCAACAATCTCAATCCAAAGCCACTGACCCACCTCCTCCGCACCCCTCCTCTGCTGGAAACAACCCTCCTCCTGTCTATCCACCGCCCACATTGGGGTACCCTCCTCCTCACGGCCATGGGTACTCTCCGGCGATGGGGTACCCTCCACCTCCACCTCCAGGGTACCCACCGGCTCCGGGGAATTACCCTCCTTACAATACGTACTACGCTCAGGCTCCCCCGGCGGCGTATTACAATAACCCTCAAAACTACAGAGCCCAGACCGTAAGCGCGGGATTCCTCCGAGGGATTGTGACGGCGTTGATTTTATTGGTGGCTGTAATGACTCTGTCCAGCATAATCACATGGATCGTCCTCCGCCCTCAAATCCCAGTGTTTAAAGTCGATTCATTCTCCGTTTCGAATTTCAATATCTCGAAATTGAATTACTCCGGAAATTGGAATGGGAGTCTGACGGTTGAAAATCCGAACCATAAACTGACTGTGAATATAGAGCGCATCCAGAGCTTCGTGAACTACAAAGAAAATACGTTGGCAATGTCTTACGCGGACCCATTTTTTATAGATGTGGAGAAGAGCAGTCAAATGAGGGTGAAATTGACGTCGAGTAGTCCCGATGATCCGGGAAATTGGTTAGAAACAGAGGAGAAGGTGGGGCAGGAGAAGGCGAGTGGAACGGTGAGTTTCAATTTGAGATTCTTTGCTTGGACGGCTTTCCGATCCGGTTCTTGGTGGACAAGGCGGATTGTCATGAAAGTGTTTTGTGAAGATTTGAAGCTGGCCTTCACCGGACCCGCCGCCACTCATGGCGTTTACTTGGCCGACGCACACTCCAAGACTTGTTCTGTTCTCTTCTAGAAGAATTCTTCGGTAAAATGTTTCTTCTTCTTTTTTTCTCTAACAAACGTTTCTTAAGCTTTCATAGTATTAATTAACTGTACATATTATAGCTTCTTTCTATTTCTTCCAATGTTTTGTAACTTTCTTTTTTTTTTTTCAACAATTAATTTTCATCTAAACTTTTTCCTTTTTTTAACAACAATCGGTAAAAGTCACTATCATTATAAATAATTCCATTTAATTCAATGTTACTTATTAAATTATTATTCACTAAAGTTTGGAGGCTAAATTGCAATTTTTAGGAAAGTTCAATAATTTTTCATCTAAATTTAACAAGTAATCATTTGTTATCTACCCATACCCATACCCATCAATTGATCTTTTGTGTTTAGTTTTAAGTAGTTAATTAATGAAGGAAAAAAAAAAAGATCAAAGTGTTTTAAAAGACTCTCCTAAATTATTACTATCTAACTCATTGACCCATAATCTATAGAAATACACACATTACCATATAGGTATAACTTAACCACCATATGTTTATCTTTCATAAGTATTTTTAAAGTCATACAAACTATTTAGGATGATTTTGTGAATTATTTATAGGAGCGGTATTTCAAATTTCTTGAAAGAAATGAAAATAAGGAGATTTTTTTTTTTTTTTTTTCTTTTGTCAGGAAAGTAGGCAAATTGTGTGTGGGGGCTTGAAGGAAAAGGGGTATAGCAGAGAGATTTGCTTTCATGTTTGGAGATTATGAAACATTATATTCAACTTTAGGGATCTTTTTTTTTTTTTTT

mRNA sequence

TAACCTTCCGTTTTTTTTCTAAGAAAAGCCTTAACGGATCTCCATATTCTAAATCCTCCTCCATTTCTGATTTCTCCTTTACCATTCTTCTCTCTCTCTCTCTCTCTCTCTCTCTCCAAATCCTCTCACAGGGAGAGAGAAAAAGCCACAAACAGAGCATCCTTTCCAATGGCTTCCTCATCGGAGGATCAACAATCTCAATCCAAAGCCACTGACCCACCTCCTCCGCACCCCTCCTCTGCTGGAAACAACCCTCCTCCTGTCTATCCACCGCCCACATTGGGGTACCCTCCTCCTCACGGCCATGGGTACTCTCCGGCGATGGGGTACCCTCCACCTCCACCTCCAGGGTACCCACCGGCTCCGGGGAATTACCCTCCTTACAATACGTACTACGCTCAGGCTCCCCCGGCGGCGTATTACAATAACCCTCAAAACTACAGAGCCCAGACCGTAAGCGCGGGATTCCTCCGAGGGATTGTGACGGCGTTGATTTTATTGGTGGCTGTAATGACTCTGTCCAGCATAATCACATGGATCGTCCTCCGCCCTCAAATCCCAGTGTTTAAAGTCGATTCATTCTCCGTTTCGAATTTCAATATCTCGAAATTGAATTACTCCGGAAATTGGAATGGGAGTCTGACGGTTGAAAATCCGAACCATAAACTGACTGTGAATATAGAGCGCATCCAGAGCTTCGTGAACTACAAAGAAAATACGTTGGCAATGTCTTACGCGGACCCATTTTTTATAGATGTGGAGAAGAGCAGTCAAATGAGGGTGAAATTGACGTCGAGTAGTCCCGATGATCCGGGAAATTGGTTAGAAACAGAGGAGAAGGTGGGGCAGGAGAAGGCGAGTGGAACGGTGAGTTTCAATTTGAGATTCTTTGCTTGGACGGCTTTCCGATCCGGTTCTTGGTGGACAAGGCGGATTGTCATGAAAGTGTTTTGTGAAGATTTGAAGCTGGCCTTCACCGGACCCGCCGCCACTCATGGCGTTTACTTGGCCGACGCACACTCCAAGACTTGTTCTGTTCTCTTCTAGAAGAATTCTTCGGAAAGTAGGCAAATTGTGTGTGGGGGCTTGAAGGAAAAGGGGTATAGCAGAGAGATTTGCTTTCATGTTTGGAGATTATGAAACATTATATTCAACTTTAGGGATCTTTTTTTTTTTTTTT

Coding sequence (CDS)

ATGGCTTCCTCATCGGAGGATCAACAATCTCAATCCAAAGCCACTGACCCACCTCCTCCGCACCCCTCCTCTGCTGGAAACAACCCTCCTCCTGTCTATCCACCGCCCACATTGGGGTACCCTCCTCCTCACGGCCATGGGTACTCTCCGGCGATGGGGTACCCTCCACCTCCACCTCCAGGGTACCCACCGGCTCCGGGGAATTACCCTCCTTACAATACGTACTACGCTCAGGCTCCCCCGGCGGCGTATTACAATAACCCTCAAAACTACAGAGCCCAGACCGTAAGCGCGGGATTCCTCCGAGGGATTGTGACGGCGTTGATTTTATTGGTGGCTGTAATGACTCTGTCCAGCATAATCACATGGATCGTCCTCCGCCCTCAAATCCCAGTGTTTAAAGTCGATTCATTCTCCGTTTCGAATTTCAATATCTCGAAATTGAATTACTCCGGAAATTGGAATGGGAGTCTGACGGTTGAAAATCCGAACCATAAACTGACTGTGAATATAGAGCGCATCCAGAGCTTCGTGAACTACAAAGAAAATACGTTGGCAATGTCTTACGCGGACCCATTTTTTATAGATGTGGAGAAGAGCAGTCAAATGAGGGTGAAATTGACGTCGAGTAGTCCCGATGATCCGGGAAATTGGTTAGAAACAGAGGAGAAGGTGGGGCAGGAGAAGGCGAGTGGAACGGTGAGTTTCAATTTGAGATTCTTTGCTTGGACGGCTTTCCGATCCGGTTCTTGGTGGACAAGGCGGATTGTCATGAAAGTGTTTTGTGAAGATTTGAAGCTGGCCTTCACCGGACCCGCCGCCACTCATGGCGTTTACTTGGCCGACGCACACTCCAAGACTTGTTCTGTTCTCTTCTAG

Protein sequence

MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPPGYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVLF*
Homology
BLAST of CSPI03G35560 vs. ExPASy Swiss-Prot
Match: Q9SJ52 (NDR1/HIN1-like protein 10 OS=Arabidopsis thaliana OX=3702 GN=NHL10 PE=2 SV=1)

HSP 1 Score: 49.7 bits (117), Expect = 6.6e-05
Identity = 48/200 (24.00%), Postives = 87/200 (43.50%), Query Frame = 0

Query: 75  YYAQAPPAAYYNNPQNYRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFK 134
           Y    PP A     +    +      L   V  +I L+ ++ ++++I W+++RP+   F 
Sbjct: 12  YGPSVPPPAPKGYYRRGHGRGCGCCLLSLFVKVIISLIVILGVAALIFWLIVRPRAIKFH 71

Query: 135 VDSFSVSNFNISKLNYSGNWNGSLT--VENPNHKLTVNIERIQSFVNYKENTLAMSYADP 194
           V   S++ F+ +  +    +N +LT  V NPN ++ +  +RI++   Y+    +     P
Sbjct: 72  VTDASLTRFDHTSPDNILRYNLALTVPVRNPNKRIGLYYDRIEAHAYYEGKRFSTITLTP 131

Query: 195 FFIDVEKSSQMRVKLTSSSPDDPGNWL-----ETEEKVGQEKASGTVSFNLRFFAWTAFR 254
           F+       Q     T  +P   G  L          +  E+ SG  +  ++F     F+
Sbjct: 132 FY-------QGHKNTTVLTPTFQGQNLVIFNAGQSRTLNAERISGVYNIEIKFRLRVRFK 191

Query: 255 SGSWWTRRIVMKVFCEDLKL 268
            G    RRI  KV C+DL+L
Sbjct: 192 LGDLKFRRIKPKVDCDDLRL 204

BLAST of CSPI03G35560 vs. ExPASy TrEMBL
Match: A0A0A0LGS8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G780530 PE=4 SV=1)

HSP 1 Score: 576.2 bits (1484), Expect = 7.5e-161
Identity = 292/292 (100.00%), Postives = 292/292 (100.00%), Query Frame = 0

Query: 1   MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPP 60
           MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPP
Sbjct: 1   MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPP 60

Query: 61  GYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAGFLRGIVTALILLVAVMTLSSI 120
           GYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAGFLRGIVTALILLVAVMTLSSI
Sbjct: 61  GYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAGFLRGIVTALILLVAVMTLSSI 120

Query: 121 ITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNY 180
           ITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNY
Sbjct: 121 ITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNY 180

Query: 181 KENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRF 240
           KENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRF
Sbjct: 181 KENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRF 240

Query: 241 FAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVLF 293
           FAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVLF
Sbjct: 241 FAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVLF 292

BLAST of CSPI03G35560 vs. ExPASy TrEMBL
Match: A0A5A7TLT1 (Protein YLS9 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G004520 PE=4 SV=1)

HSP 1 Score: 506.5 bits (1303), Expect = 7.3e-140
Identity = 253/292 (86.64%), Postives = 274/292 (93.84%), Query Frame = 0

Query: 1   MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGH-GYSPAMGYPPPPP 60
           MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPP GH GYSPAMGYPP P 
Sbjct: 306 MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPQGHGGYSPAMGYPPAPH 365

Query: 61  PGYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAGFLRGIVTALILLVAVMTLSS 120
           P YPPA GNYPPYN YYAQAPPAAYYNNPQNYRA T+SAGFLRGIV ALILLVA+MTLSS
Sbjct: 366 PRYPPATGNYPPYNAYYAQAPPAAYYNNPQNYRAGTISAGFLRGIVAALILLVAIMTLSS 425

Query: 121 IITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVN 180
           IITWI+LRP++PVFKVDSFSVSNFNISKLNYSGNW+ S+TV+NPNHKL VN+ERIQSFV+
Sbjct: 426 IITWIILRPEVPVFKVDSFSVSNFNISKLNYSGNWDASVTVQNPNHKLNVNMERIQSFVD 485

Query: 181 YKENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLR 240
           YK+NTLAMSYADPFF+DVEKS QM+VKLTSSSPDDPGNWLETEEK+G+E+A+GTVSFNLR
Sbjct: 486 YKQNTLAMSYADPFFLDVEKSGQMKVKLTSSSPDDPGNWLETEEKLGRERATGTVSFNLR 545

Query: 241 FFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVL 292
           FFAWT FR+GSWWTRR+VM+V CED+KL FTGPAA H VYLAD HSKTCSVL
Sbjct: 546 FFAWTTFRTGSWWTRRVVMRVSCEDMKLVFTGPAAGHAVYLADEHSKTCSVL 597

BLAST of CSPI03G35560 vs. ExPASy TrEMBL
Match: A0A1S3B6W4 (uncharacterized protein LOC103486674 OS=Cucumis melo OX=3656 GN=LOC103486674 PE=4 SV=1)

HSP 1 Score: 506.5 bits (1303), Expect = 7.3e-140
Identity = 253/292 (86.64%), Postives = 274/292 (93.84%), Query Frame = 0

Query: 1   MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGH-GYSPAMGYPPPPP 60
           MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPP GH GYSPAMGYPP P 
Sbjct: 1   MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPQGHGGYSPAMGYPPAPH 60

Query: 61  PGYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAGFLRGIVTALILLVAVMTLSS 120
           P YPPA GNYPPYN YYAQAPPAAYYNNPQNYRA T+SAGFLRGIV ALILLVA+MTLSS
Sbjct: 61  PRYPPATGNYPPYNAYYAQAPPAAYYNNPQNYRAGTISAGFLRGIVAALILLVAIMTLSS 120

Query: 121 IITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVN 180
           IITWI+LRP++PVFKVDSFSVSNFNISKLNYSGNW+ S+TV+NPNHKL VN+ERIQSFV+
Sbjct: 121 IITWIILRPEVPVFKVDSFSVSNFNISKLNYSGNWDASVTVQNPNHKLNVNMERIQSFVD 180

Query: 181 YKENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLR 240
           YK+NTLAMSYADPFF+DVEKS QM+VKLTSSSPDDPGNWLETEEK+G+E+A+GTVSFNLR
Sbjct: 181 YKQNTLAMSYADPFFLDVEKSGQMKVKLTSSSPDDPGNWLETEEKLGRERATGTVSFNLR 240

Query: 241 FFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVL 292
           FFAWT FR+GSWWTRR+VM+V CED+KL FTGPAA H VYLAD HSKTCSVL
Sbjct: 241 FFAWTTFRTGSWWTRRVVMRVSCEDMKLVFTGPAAGHAVYLADEHSKTCSVL 292

BLAST of CSPI03G35560 vs. ExPASy TrEMBL
Match: A0A6J1J6I9 (uncharacterized protein LOC111481675 OS=Cucurbita maxima OX=3661 GN=LOC111481675 PE=4 SV=1)

HSP 1 Score: 403.3 bits (1035), Expect = 8.7e-109
Identity = 214/304 (70.39%), Postives = 246/304 (80.92%), Query Frame = 0

Query: 1   MASSSEDQ---QSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPP 60
           MASSS DQ   QSQSK TDPPPP P SAGNNPPP+YPPPTLGY PPH HGY PAMGYPP 
Sbjct: 65  MASSSVDQQHFQSQSKPTDPPPPLPPSAGNNPPPIYPPPTLGY-PPHAHGYPPAMGYPPA 124

Query: 61  PPPGYPPAPGNYPPYNTY-YAQAPPAAYY-------NNPQNYRAQTVSAGFLRGIVTALI 120
           P PGYPPAPGNYPPYN Y Y QAPPAAYY       NNPQ YR +T  AGFLRGI  AL+
Sbjct: 125 PHPGYPPAPGNYPPYNAYAYTQAPPAAYYNSNNNNNNNPQYYRQETAGAGFLRGIFAALL 184

Query: 121 LLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTV 180
           LLV +MT+SSIITWI+LRP+IP FKVDSFSV+NFNISK NYSG W+  +TV+NPNHKL +
Sbjct: 185 LLVVIMTMSSIITWIILRPEIPNFKVDSFSVANFNISKSNYSGIWDVKVTVQNPNHKLNL 244

Query: 181 NIERIQSFVNYKENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEK 240
           + ERI+SFV+Y +NT+A S++DPFF+D+EKS QM VK+TSSSPDDPGNW++TEEK+ +E+
Sbjct: 245 HFERIRSFVDYSDNTVATSFSDPFFLDMEKSKQMLVKMTSSSPDDPGNWVQTEEKLERER 304

Query: 241 ASGTVSFNLRFFAWTAFR--SGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKT 292
           A+GTVSF LR  AWT FR  SGS WTRR++++VFCEDLKL FTG   T GVY   AH KT
Sbjct: 305 ATGTVSFTLRLLAWTTFRSGSGSGWTRRVILRVFCEDLKLVFTG-HTTDGVYSPGAHPKT 364

BLAST of CSPI03G35560 vs. ExPASy TrEMBL
Match: A0A6J1F415 (uncharacterized protein LOC111442188 OS=Cucurbita moschata OX=3662 GN=LOC111442188 PE=4 SV=1)

HSP 1 Score: 400.6 bits (1028), Expect = 5.7e-108
Identity = 213/303 (70.30%), Postives = 245/303 (80.86%), Query Frame = 0

Query: 1   MASSSEDQ---QSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPP 60
           MASSS DQ   QSQSK TDPPPP P SAGNNPPP+YPPPTLGY PPH HGY PAMGYPP 
Sbjct: 75  MASSSVDQQHFQSQSKPTDPPPPLPPSAGNNPPPIYPPPTLGY-PPHAHGYPPAMGYPPA 134

Query: 61  PPPGYPPAPGNYPPYNTY-YAQAPPAAYY------NNPQNYRAQTVSAGFLRGIVTALIL 120
           P PGYPPAPGNYPPYN Y Y QAPPAAYY      NNPQ YR +T  AGFLRGI  AL+L
Sbjct: 135 PHPGYPPAPGNYPPYNAYAYTQAPPAAYYNSNNNNNNPQYYRQETAGAGFLRGIFAALLL 194

Query: 121 LVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVN 180
           LV +MT+SSIITWI+LRP+IP FKVDSFSV+NFNISK NYSG W+  +TV+NPNHKL ++
Sbjct: 195 LVVIMTMSSIITWIILRPEIPNFKVDSFSVTNFNISKSNYSGIWDIKVTVQNPNHKLNLH 254

Query: 181 IERIQSFVNYKENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKA 240
            ERI+SFV+Y +NT+A S++DPFF+D+EKS QM+VK+TSSSPDDPGNW +TEEK+ +E+ 
Sbjct: 255 FERIRSFVDYSDNTVATSFSDPFFLDMEKSRQMQVKMTSSSPDDPGNWAQTEEKLERERT 314

Query: 241 SGTVSFNLRFFAWTAFR--SGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTC 292
           +GTVSF LR  AWT FR  SGS WTRR++++VFCEDLKL FTG   T GVY   A SKTC
Sbjct: 315 TGTVSFTLRLLAWTTFRSGSGSGWTRRVILRVFCEDLKLVFTG-HTTDGVYSPGAQSKTC 374

BLAST of CSPI03G35560 vs. NCBI nr
Match: XP_011652032.1 (uncharacterized protein LOC105434983 [Cucumis sativus] >KAE8651112.1 hypothetical protein Csa_002611 [Cucumis sativus])

HSP 1 Score: 576.2 bits (1484), Expect = 1.6e-160
Identity = 292/292 (100.00%), Postives = 292/292 (100.00%), Query Frame = 0

Query: 1   MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPP 60
           MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPP
Sbjct: 68  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPP 127

Query: 61  GYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAGFLRGIVTALILLVAVMTLSSI 120
           GYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAGFLRGIVTALILLVAVMTLSSI
Sbjct: 128 GYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAGFLRGIVTALILLVAVMTLSSI 187

Query: 121 ITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNY 180
           ITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNY
Sbjct: 188 ITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNY 247

Query: 181 KENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRF 240
           KENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRF
Sbjct: 248 KENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRF 307

Query: 241 FAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVLF 293
           FAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVLF
Sbjct: 308 FAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVLF 359

BLAST of CSPI03G35560 vs. NCBI nr
Match: XP_008442912.1 (PREDICTED: uncharacterized protein LOC103486674 [Cucumis melo])

HSP 1 Score: 506.5 bits (1303), Expect = 1.5e-139
Identity = 253/292 (86.64%), Postives = 274/292 (93.84%), Query Frame = 0

Query: 1   MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGH-GYSPAMGYPPPPP 60
           MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPP GH GYSPAMGYPP P 
Sbjct: 1   MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPQGHGGYSPAMGYPPAPH 60

Query: 61  PGYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAGFLRGIVTALILLVAVMTLSS 120
           P YPPA GNYPPYN YYAQAPPAAYYNNPQNYRA T+SAGFLRGIV ALILLVA+MTLSS
Sbjct: 61  PRYPPATGNYPPYNAYYAQAPPAAYYNNPQNYRAGTISAGFLRGIVAALILLVAIMTLSS 120

Query: 121 IITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVN 180
           IITWI+LRP++PVFKVDSFSVSNFNISKLNYSGNW+ S+TV+NPNHKL VN+ERIQSFV+
Sbjct: 121 IITWIILRPEVPVFKVDSFSVSNFNISKLNYSGNWDASVTVQNPNHKLNVNMERIQSFVD 180

Query: 181 YKENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLR 240
           YK+NTLAMSYADPFF+DVEKS QM+VKLTSSSPDDPGNWLETEEK+G+E+A+GTVSFNLR
Sbjct: 181 YKQNTLAMSYADPFFLDVEKSGQMKVKLTSSSPDDPGNWLETEEKLGRERATGTVSFNLR 240

Query: 241 FFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVL 292
           FFAWT FR+GSWWTRR+VM+V CED+KL FTGPAA H VYLAD HSKTCSVL
Sbjct: 241 FFAWTTFRTGSWWTRRVVMRVSCEDMKLVFTGPAAGHAVYLADEHSKTCSVL 292

BLAST of CSPI03G35560 vs. NCBI nr
Match: KAA0043818.1 (protein YLS9 [Cucumis melo var. makuwa] >TYK25314.1 protein YLS9 [Cucumis melo var. makuwa])

HSP 1 Score: 506.5 bits (1303), Expect = 1.5e-139
Identity = 253/292 (86.64%), Postives = 274/292 (93.84%), Query Frame = 0

Query: 1   MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGH-GYSPAMGYPPPPP 60
           MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPP GH GYSPAMGYPP P 
Sbjct: 306 MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPQGHGGYSPAMGYPPAPH 365

Query: 61  PGYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAGFLRGIVTALILLVAVMTLSS 120
           P YPPA GNYPPYN YYAQAPPAAYYNNPQNYRA T+SAGFLRGIV ALILLVA+MTLSS
Sbjct: 366 PRYPPATGNYPPYNAYYAQAPPAAYYNNPQNYRAGTISAGFLRGIVAALILLVAIMTLSS 425

Query: 121 IITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVN 180
           IITWI+LRP++PVFKVDSFSVSNFNISKLNYSGNW+ S+TV+NPNHKL VN+ERIQSFV+
Sbjct: 426 IITWIILRPEVPVFKVDSFSVSNFNISKLNYSGNWDASVTVQNPNHKLNVNMERIQSFVD 485

Query: 181 YKENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLR 240
           YK+NTLAMSYADPFF+DVEKS QM+VKLTSSSPDDPGNWLETEEK+G+E+A+GTVSFNLR
Sbjct: 486 YKQNTLAMSYADPFFLDVEKSGQMKVKLTSSSPDDPGNWLETEEKLGRERATGTVSFNLR 545

Query: 241 FFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVL 292
           FFAWT FR+GSWWTRR+VM+V CED+KL FTGPAA H VYLAD HSKTCSVL
Sbjct: 546 FFAWTTFRTGSWWTRRVVMRVSCEDMKLVFTGPAAGHAVYLADEHSKTCSVL 597

BLAST of CSPI03G35560 vs. NCBI nr
Match: XP_038905898.1 (uncharacterized protein LOC120091828 [Benincasa hispida])

HSP 1 Score: 464.9 bits (1195), Expect = 5.0e-127
Identity = 231/291 (79.38%), Postives = 259/291 (89.00%), Query Frame = 0

Query: 1   MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPP 60
           MASSS+D QSQSKATDPPP  P SAGNNPPPVYPPPTLGYPPP GH Y PAMGYPP P P
Sbjct: 62  MASSSDDHQSQSKATDPPPMPPPSAGNNPPPVYPPPTLGYPPPQGHCYPPAMGYPPAPHP 121

Query: 61  GYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAGFLRGIVTALILLVAVMTLSSI 120
           GYPPAPGNYPPYN YYAQAPPAAYYNN QNYRA+TV+ GFLRGIVTALIL VA+MTLSSI
Sbjct: 122 GYPPAPGNYPPYNPYYAQAPPAAYYNNHQNYRAETVNTGFLRGIVTALILFVAIMTLSSI 181

Query: 121 ITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNY 180
           +TWI+LRP+IPVF++DSFSV NFNISK NYSGNW+G++TV+NPNH+L VN+ER+QSFV+Y
Sbjct: 182 LTWIILRPEIPVFRMDSFSVVNFNISKSNYSGNWDGNMTVQNPNHRLNVNVERVQSFVDY 241

Query: 181 KENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRF 240
           K+NTLAMSY DPFF+DVEKS QMRVKLTSSSPDDPG+W ETE+K+GQEKA+GTVSFNLRF
Sbjct: 242 KDNTLAMSYGDPFFLDVEKSIQMRVKLTSSSPDDPGSWAETEDKLGQEKATGTVSFNLRF 301

Query: 241 FAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVL 292
            AWT FR GSWWTRR+V++VFCEDLKL F GPAA   VY  + + K CSVL
Sbjct: 302 IAWTTFRYGSWWTRRVVIRVFCEDLKLVFAGPAAGKVVYSPNVNPKICSVL 352

BLAST of CSPI03G35560 vs. NCBI nr
Match: XP_022983003.1 (uncharacterized protein LOC111481675 [Cucurbita maxima])

HSP 1 Score: 403.3 bits (1035), Expect = 1.8e-108
Identity = 214/304 (70.39%), Postives = 246/304 (80.92%), Query Frame = 0

Query: 1   MASSSEDQ---QSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPP 60
           MASSS DQ   QSQSK TDPPPP P SAGNNPPP+YPPPTLGY PPH HGY PAMGYPP 
Sbjct: 65  MASSSVDQQHFQSQSKPTDPPPPLPPSAGNNPPPIYPPPTLGY-PPHAHGYPPAMGYPPA 124

Query: 61  PPPGYPPAPGNYPPYNTY-YAQAPPAAYY-------NNPQNYRAQTVSAGFLRGIVTALI 120
           P PGYPPAPGNYPPYN Y Y QAPPAAYY       NNPQ YR +T  AGFLRGI  AL+
Sbjct: 125 PHPGYPPAPGNYPPYNAYAYTQAPPAAYYNSNNNNNNNPQYYRQETAGAGFLRGIFAALL 184

Query: 121 LLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTV 180
           LLV +MT+SSIITWI+LRP+IP FKVDSFSV+NFNISK NYSG W+  +TV+NPNHKL +
Sbjct: 185 LLVVIMTMSSIITWIILRPEIPNFKVDSFSVANFNISKSNYSGIWDVKVTVQNPNHKLNL 244

Query: 181 NIERIQSFVNYKENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEK 240
           + ERI+SFV+Y +NT+A S++DPFF+D+EKS QM VK+TSSSPDDPGNW++TEEK+ +E+
Sbjct: 245 HFERIRSFVDYSDNTVATSFSDPFFLDMEKSKQMLVKMTSSSPDDPGNWVQTEEKLERER 304

Query: 241 ASGTVSFNLRFFAWTAFR--SGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKT 292
           A+GTVSF LR  AWT FR  SGS WTRR++++VFCEDLKL FTG   T GVY   AH KT
Sbjct: 305 ATGTVSFTLRLLAWTTFRSGSGSGWTRRVILRVFCEDLKLVFTG-HTTDGVYSPGAHPKT 364

BLAST of CSPI03G35560 vs. TAIR 10
Match: AT3G52460.1 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 159.5 bits (402), Expect = 4.2e-39
Identity = 120/298 (40.27%), Postives = 168/298 (56.38%), Query Frame = 0

Query: 4   SSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPP--HGHGYSPAMGYP--PPPP 63
           S  ++++Q K    P  +     N PPP  PPP    PPP      Y P MGYP    PP
Sbjct: 3   SPPEEETQPKPDTGPGQNSERDINQPPP--PPPQSQPPPPQTQQQTYPPVMGYPGYHQPP 62

Query: 64  PGYPPAPGNYP--PYNTY-YAQAPPAAYYNNPQNYRAQ-------TVSAGFLRGIVTALI 123
           P YP    NYP  PY  Y YAQAPPA+YY +  +Y AQ         S+GF+RGI T LI
Sbjct: 63  PPYP----NYPNAPYQQYPYAQAPPASYYGS--SYPAQQNPVYQRPASSGFVRGIFTGLI 122

Query: 124 LLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTV 183
           +LV ++ +S+ ITW+VLRPQIP+F V++FSVSNFN++   +S  W  +LT+EN N KL  
Sbjct: 123 VLVVLLCISTTITWLVLRPQIPLFSVNNFSVSNFNVTGPVFSAQWTANLTIENQNTKLKG 182

Query: 184 NIERIQSFVNY-----KENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDP--GNWLETE 243
             +RIQ  V +     ++  LA ++  P F++ +KS  +   LT+   + P   +W+  E
Sbjct: 183 YFDRIQGLVYHQNAVGEDEFLATAFFQPVFVETKKSVVIGETLTAGDKEQPKVPSWVVDE 242

Query: 244 EKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYL 281
            K  +E+ +GTV+F+LR   W  F++  W  R   +KVFC  LK+ F G +    V L
Sbjct: 243 MK--KERETGTVTFSLRMAVWVTFKTDGWAARESGLKVFCGKLKVGFEGISGNGAVLL 290

BLAST of CSPI03G35560 vs. TAIR 10
Match: AT2G27260.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family )

HSP 1 Score: 75.9 bits (185), Expect = 6.1e-14
Identity = 77/237 (32.49%), Postives = 119/237 (50.21%), Query Frame = 0

Query: 36  PTLGYPPPHGHGYSPAMGYPPPPPPGYP-PAPGNYPPY---NTYYAQAPPAAYYNNPQNY 95
           P  GYP P+ +   P      PP  GYP PA G   PY   N YYA  P         N 
Sbjct: 7   PATGYPYPYPY---PNPQQQQPPTNGYPNPAAGTAYPYQNHNPYYAPQP---------NP 66

Query: 96  RAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYS 155
           RA  +   F+  + T  +LL+ ++     I ++++RPQ+P   ++S SVSNFN+S    S
Sbjct: 67  RAVIIRRLFI--VFTTFLLLLGLIL---FIFFLIVRPQLPDVNLNSLSVSNFNVSNNQVS 126

Query: 156 GNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEKSSQMRVKLTSSS 215
           G W+  L   NPN K++++ E     + Y   +L+ +   PF  D  K  Q  V  T S 
Sbjct: 127 GKWDLQLQFRNPNSKMSLHYETALCAMYYNRVSLSETRLQPF--DQGKKDQTVVNATLSV 186

Query: 216 PDDPGNWLETE--EKVGQEKA-SGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDL 266
               G +++    + +G+E++  G V F+LR  ++  FR G++  RR V  V+C+D+
Sbjct: 187 ---SGTYVDGRLVDSIGKERSVKGNVEFDLRMISYVTFRYGAFRRRRYV-TVYCDDV 220

BLAST of CSPI03G35560 vs. TAIR 10
Match: AT5G22870.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family )

HSP 1 Score: 59.7 bits (143), Expect = 4.6e-09
Identity = 39/173 (22.54%), Postives = 85/173 (49.13%), Query Frame = 0

Query: 104 IVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNY-SGNWNGSLTVEN 163
           I   ++ L+ +  +  +ITW+  +P+   + V++ SV NFN++  N+ S  +  ++   N
Sbjct: 29  IFLVILTLIFMAAVGFLITWLETKPKKLRYTVENASVQNFNLTNDNHMSATFQFTIQSHN 88

Query: 164 PNHKLTVNIERIQSFVNYKENTLAMSYADPFF---IDVEKSSQMRVKLTSSSPDDPGNWL 223
           PNH+++V    ++ FV +K+ TLA    +PF    ++V++  +  +    +     G  L
Sbjct: 89  PNHRISVYYSSVEIFVKFKDQTLAFDTVEPFHQPRMNVKQIDETLIAENVAVSKSNGKDL 148

Query: 224 ETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGP 273
            ++  +G+      + F +   A   F+ G W +     K+ C  + ++ + P
Sbjct: 149 RSQNSLGK------IGFEVFVKARVRFKVGIWKSSHRTAKIKCSHVTVSLSQP 195

BLAST of CSPI03G35560 vs. TAIR 10
Match: AT2G35980.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family )

HSP 1 Score: 49.7 bits (117), Expect = 4.7e-06
Identity = 48/200 (24.00%), Postives = 87/200 (43.50%), Query Frame = 0

Query: 75  YYAQAPPAAYYNNPQNYRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFK 134
           Y    PP A     +    +      L   V  +I L+ ++ ++++I W+++RP+   F 
Sbjct: 12  YGPSVPPPAPKGYYRRGHGRGCGCCLLSLFVKVIISLIVILGVAALIFWLIVRPRAIKFH 71

Query: 135 VDSFSVSNFNISKLNYSGNWNGSLT--VENPNHKLTVNIERIQSFVNYKENTLAMSYADP 194
           V   S++ F+ +  +    +N +LT  V NPN ++ +  +RI++   Y+    +     P
Sbjct: 72  VTDASLTRFDHTSPDNILRYNLALTVPVRNPNKRIGLYYDRIEAHAYYEGKRFSTITLTP 131

Query: 195 FFIDVEKSSQMRVKLTSSSPDDPGNWL-----ETEEKVGQEKASGTVSFNLRFFAWTAFR 254
           F+       Q     T  +P   G  L          +  E+ SG  +  ++F     F+
Sbjct: 132 FY-------QGHKNTTVLTPTFQGQNLVIFNAGQSRTLNAERISGVYNIEIKFRLRVRFK 191

Query: 255 SGSWWTRRIVMKVFCEDLKL 268
            G    RRI  KV C+DL+L
Sbjct: 192 LGDLKFRRIKPKVDCDDLRL 204

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SJ526.6e-0524.00NDR1/HIN1-like protein 10 OS=Arabidopsis thaliana OX=3702 GN=NHL10 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LGS87.5e-161100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G780530 PE=4 SV=1[more]
A0A5A7TLT17.3e-14086.64Protein YLS9 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G004520 ... [more]
A0A1S3B6W47.3e-14086.64uncharacterized protein LOC103486674 OS=Cucumis melo OX=3656 GN=LOC103486674 PE=... [more]
A0A6J1J6I98.7e-10970.39uncharacterized protein LOC111481675 OS=Cucurbita maxima OX=3661 GN=LOC111481675... [more]
A0A6J1F4155.7e-10870.30uncharacterized protein LOC111442188 OS=Cucurbita moschata OX=3662 GN=LOC1114421... [more]
Match NameE-valueIdentityDescription
XP_011652032.11.6e-160100.00uncharacterized protein LOC105434983 [Cucumis sativus] >KAE8651112.1 hypothetica... [more]
XP_008442912.11.5e-13986.64PREDICTED: uncharacterized protein LOC103486674 [Cucumis melo][more]
KAA0043818.11.5e-13986.64protein YLS9 [Cucumis melo var. makuwa] >TYK25314.1 protein YLS9 [Cucumis melo v... [more]
XP_038905898.15.0e-12779.38uncharacterized protein LOC120091828 [Benincasa hispida][more]
XP_022983003.11.8e-10870.39uncharacterized protein LOC111481675 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
AT3G52460.14.2e-3940.27hydroxyproline-rich glycoprotein family protein [more]
AT2G27260.16.1e-1432.49Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family [more]
AT5G22870.14.6e-0922.54Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family [more]
AT2G35980.14.7e-0624.00Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 15..52
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..52
NoneNo IPR availablePANTHERPTHR31852LATE EMBRYOGENESIS ABUNDANT (LEA) HYDROXYPROLINE-RICH GLYCOPROTEIN FAMILYcoord: 56..271
NoneNo IPR availablePANTHERPTHR31852:SF183HYDROXYPROLINE-RICH GLYCOPROTEIN FAMILY PROTEINcoord: 56..271

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI03G35560.1CSPI03G35560.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane