Csa5G056080 (gene) Cucumber (Chinese Long) v2

NameCsa5G056080
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionHydroxyproline-rich glycoprotein-like protein
LocationChr5 : 1816221 .. 1818581 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAATGCCATCGGGAAATGTTGGTGTACCGGATAAAGTTTCGTTTCAGAGTGGTGGTGGAGTTGCAGTGAGTGGTGGTGGTGGCGAGATCCATCAGCACCATCCCCGCCCTTGGTTTCCCGATGAGCGTGATGGGTTTATCTCATGGTTGCGCGGGGAATTTGCTGCCTCGAATGCGATAATTGATGCGCTTTGCCATCATTTACGTGCTGTAGGAGAGCCTGGGGAATATGATATGGTTATTGGATGTATACAGCAACGGCGATGTAATTGGACACCGGTGCTTCATATGCAGCAGTACTTTTCAGTTGCAGAAGTGATGTATGCCCTTCAGCAGGTAACCTCAAGGAGGCAGCAGAGGTATATGGACCCTGTGAAAGTGGGGCCGAAGTTGTATAGAAGACCTGGGCCAGGGTTTAAGCAGCAGCAGGGCCATCGGGCTGAAGCCACAGTCAAGGAAGAGACAATCACTTGTGCAGAGTCATGTAATGGTGGGAACTCTTCAACTTTTGTAAGCTCTAGGAAGGTGGAGCAAGTAAGTAATACGTGTGATGAAAGTAAGGCATCAGGGGAGGATGAAAAATTGAGCGAAAAAGATTCAGGGTCAGCTGTGGACAATAAAGGTAATTTCTTTTTGTTCTCCTATCAATTCTTAATAGTGTGTGAAATTTGTTTTGGGCTAGATTATAAGTTTAATCTATGGACTGTCTGATTCTGGACTTTGAATGTTTAGGACATTTTAGACATTTCTTAAAAAGTTTAATGATCTATAGGAAACAAGCTTTGGAATTGCTGGGACCAAACTTCTTATTTTATCTTGGTTTGTTTAAATTGTACTTTCACATTTTTTCTAACATATATAAACTTTTAACTTTTTTTATTCTGCTGCTTTCATGATACTAGTTATTTTGGTTTGTCAAGGATTGATATTTTCCATTTTTAGTATAGTATGGCAGTATGACTTTGAGTGTAATACATGACCTCGTTAAAATAACGAAATGCAATTGTTTCCTCATCTTATTATCATCTAATAATTCATGGAATAAATAGTATTACGGATTTGGTGTTAACTGTTTAAGGCATTCCGTGCAGTGCATTGATCAAGAAAATTAAACAGAAAGGGATTAATACGTTGATTGTATGATCGGTTTGTTTGTCTTTTGGGTTACACTATAAACTGTTCTCCATTTATAAGATTTTAATGAATTGATTTTATTGTATGAAATGCTTAATTTTTTTATGGGTTATAACTACAAGTATTTTTTGGATGAAAATTAAATTTCAAACTAGGAGCCTAGGAGGGGTTGATAACATTATCAAATTGGAAAATTTAGCGGTTTTCTTGAACCCCTAATAAAATCCTTGTTTCATTGCAATATCAACTTGATATGCTTGGAGGCTATAAGCTTCGGGAAGTCCTATCTTGGTAATGAAAATGACCATGTTTTCGTTCACTTTTGATAGTAGTTCTTTATTCTTTTACCAGCATGTATAAACTATAATGTGCATGTTTCTTGTCATTTCTATTTCACTTGTAGAGATTTGAATTACTTCATAGGTAAATATTCATCAACGAAGAAGAAATTAAGACTATATCTCAATTTTGAAAAACTTGCTTGTTGACTAGATACTCATGGGAAGGACCAAAGTAATTGCAAAACGAAGTCTGCAGAAAATCTAGAAGACAATGCAATTAATAAAGACTCTCAAGTTGAACCTGATGATGGATGCTCTTCAAGCCATAGAGGTGAGTACTTTAATCATTGTTTCAATCTGGAAAACTTAACATCATATTTAGTGGGAATTTGCTCTTTAATCAATTCACTGTCGATATAGTTATAAATCACAATTGACCGTGCTGTAATTGAAAATTAATATTAAATTTTCAGCTTTATCCTACTTATTGTTGTAATGAAATTATGATTTTTTTTACTAAAGAAAATGACTTCACTATAATTATATTAATGACCAACTTTTTCTCTAGATAAGGAGCTGCAGTCTGTTCAAAGCCAGAATGGAAAGCAGTATGCTGCCACAACCCCGAGAACCTTTGTTGCCAGTGAGATGTTTGATGGAAAGATGGTATAATTGTTTGGTTTTATCCTGAATTTGGTTCTTTGATTCTTTTTGAGAAATTCTCTGAAGGAAGGAGTGTTTCATATTGGTAGACTGATCGACTTCAACTTATTTTATTTTTCTTCAAATTCAGGTGAATGTGATGGATGGATTGAAATTATTTGAAGAATTATTGGATGATGCTGAGGTTTCAAAGCTTCTTTCGCTGGTTAATGATTTGAGGGCTTCCGGAAAGAGAGGGCAATTTCAAGGCAAGTTCTTGGTCTTTTTTAACTCTGCTTGA

mRNA sequence

ATGGCAATGCCATCGGGAAATGTTGGTGTACCGGATAAAGTTTCGTTTCAGAGTGGTGGTGGAGTTGCAGTGAGTGGTGGTGGTGGCGAGATCCATCAGCACCATCCCCGCCCTTGGTTTCCCGATGAGCGTGATGGGTTTATCTCATGGTTGCGCGGGGAATTTGCTGCCTCGAATGCGATAATTGATGCGCTTTGCCATCATTTACGTGCTGTAGGAGAGCCTGGGGAATATGATATGGTTATTGGATGTATACAGCAACGGCGATGTAATTGGACACCGGTGCTTCATATGCAGCAGTACTTTTCAGTTGCAGAAGTGATGTATGCCCTTCAGCAGGTAACCTCAAGGAGGCAGCAGAGGTATATGGACCCTGTGAAAGTGGGGCCGAAGTTGTATAGAAGACCTGGGCCAGGGTTTAAGCAGCAGCAGGGCCATCGGGCTGAAGCCACAGTCAAGGAAGAGACAATCACTTGTGCAGAGTCATGTAATGGTGGGAACTCTTCAACTTTTGTAAGCTCTAGGAAGGTGGAGCAAGTAAGTAATACGTGTGATGAAAGTAAGGCATCAGGGGAGGATGAAAAATTGAGCGAAAAAGATTCAGGGTCAGCTGTGGACAATAAAGATACTCATGGGAAGGACCAAAGTAATTGCAAAACGAAGTCTGCAGAAAATCTAGAAGACAATGCAATTAATAAAGACTCTCAAGTTGAACCTGATGATGGATGCTCTTCAAGCCATAGAGATAAGGAGCTGCAGTCTGTTCAAAGCCAGAATGGAAAGCAGTATGCTGCCACAACCCCGAGAACCTTTGTTGCCAGTGAGATGTTTGATGGAAAGATGGTGAATGTGATGGATGGATTGAAATTATTTGAAGAATTATTGGATGATGCTGAGGTTTCAAAGCTTCTTTCGCTGGTTAATGATTTGAGGGCTTCCGGAAAGAGAGGGCAATTTCAAGGCAAGTTCTTGGTCTTTTTTAACTCTGCTTGA

Coding sequence (CDS)

ATGGCAATGCCATCGGGAAATGTTGGTGTACCGGATAAAGTTTCGTTTCAGAGTGGTGGTGGAGTTGCAGTGAGTGGTGGTGGTGGCGAGATCCATCAGCACCATCCCCGCCCTTGGTTTCCCGATGAGCGTGATGGGTTTATCTCATGGTTGCGCGGGGAATTTGCTGCCTCGAATGCGATAATTGATGCGCTTTGCCATCATTTACGTGCTGTAGGAGAGCCTGGGGAATATGATATGGTTATTGGATGTATACAGCAACGGCGATGTAATTGGACACCGGTGCTTCATATGCAGCAGTACTTTTCAGTTGCAGAAGTGATGTATGCCCTTCAGCAGGTAACCTCAAGGAGGCAGCAGAGGTATATGGACCCTGTGAAAGTGGGGCCGAAGTTGTATAGAAGACCTGGGCCAGGGTTTAAGCAGCAGCAGGGCCATCGGGCTGAAGCCACAGTCAAGGAAGAGACAATCACTTGTGCAGAGTCATGTAATGGTGGGAACTCTTCAACTTTTGTAAGCTCTAGGAAGGTGGAGCAAGTAAGTAATACGTGTGATGAAAGTAAGGCATCAGGGGAGGATGAAAAATTGAGCGAAAAAGATTCAGGGTCAGCTGTGGACAATAAAGATACTCATGGGAAGGACCAAAGTAATTGCAAAACGAAGTCTGCAGAAAATCTAGAAGACAATGCAATTAATAAAGACTCTCAAGTTGAACCTGATGATGGATGCTCTTCAAGCCATAGAGATAAGGAGCTGCAGTCTGTTCAAAGCCAGAATGGAAAGCAGTATGCTGCCACAACCCCGAGAACCTTTGTTGCCAGTGAGATGTTTGATGGAAAGATGGTGAATGTGATGGATGGATTGAAATTATTTGAAGAATTATTGGATGATGCTGAGGTTTCAAAGCTTCTTTCGCTGGTTAATGATTTGAGGGCTTCCGGAAAGAGAGGGCAATTTCAAGGCAAGTTCTTGGTCTTTTTTAACTCTGCTTGA

Protein sequence

MAMPSGNVGVPDKVSFQSGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNAIIDALCHHLRAVGEPGEYDMVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRAEATVKEETITCAESCNGGNSSTFVSSRKVEQVSNTCDESKASGEDEKLSEKDSGSAVDNKDTHGKDQSNCKTKSAENLEDNAINKDSQVEPDDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVASEMFDGKMVNVMDGLKLFEELLDDAEVSKLLSLVNDLRASGKRGQFQGKFLVFFNSA*
BLAST of Csa5G056080 vs. TrEMBL
Match: A0A0A0KLD4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G056080 PE=4 SV=1)

HSP 1 Score: 674.9 bits (1740), Expect = 5.1e-191
Identity = 330/330 (100.00%), Postives = 330/330 (100.00%), Query Frame = 1

Query: 1   MAMPSGNVGVPDKVSFQSGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNA 60
           MAMPSGNVGVPDKVSFQSGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNA
Sbjct: 1   MAMPSGNVGVPDKVSFQSGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNA 60

Query: 61  IIDALCHHLRAVGEPGEYDMVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQQ 120
           IIDALCHHLRAVGEPGEYDMVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQQ
Sbjct: 61  IIDALCHHLRAVGEPGEYDMVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQQ 120

Query: 121 RYMDPVKVGPKLYRRPGPGFKQQQGHRAEATVKEETITCAESCNGGNSSTFVSSRKVEQV 180
           RYMDPVKVGPKLYRRPGPGFKQQQGHRAEATVKEETITCAESCNGGNSSTFVSSRKVEQV
Sbjct: 121 RYMDPVKVGPKLYRRPGPGFKQQQGHRAEATVKEETITCAESCNGGNSSTFVSSRKVEQV 180

Query: 181 SNTCDESKASGEDEKLSEKDSGSAVDNKDTHGKDQSNCKTKSAENLEDNAINKDSQVEPD 240
           SNTCDESKASGEDEKLSEKDSGSAVDNKDTHGKDQSNCKTKSAENLEDNAINKDSQVEPD
Sbjct: 181 SNTCDESKASGEDEKLSEKDSGSAVDNKDTHGKDQSNCKTKSAENLEDNAINKDSQVEPD 240

Query: 241 DGCSSSHRDKELQSVQSQNGKQYAATTPRTFVASEMFDGKMVNVMDGLKLFEELLDDAEV 300
           DGCSSSHRDKELQSVQSQNGKQYAATTPRTFVASEMFDGKMVNVMDGLKLFEELLDDAEV
Sbjct: 241 DGCSSSHRDKELQSVQSQNGKQYAATTPRTFVASEMFDGKMVNVMDGLKLFEELLDDAEV 300

Query: 301 SKLLSLVNDLRASGKRGQFQGKFLVFFNSA 331
           SKLLSLVNDLRASGKRGQFQGKFLVFFNSA
Sbjct: 301 SKLLSLVNDLRASGKRGQFQGKFLVFFNSA 330

BLAST of Csa5G056080 vs. TrEMBL
Match: A0A061EA95_THECC (Hydroxyproline-rich glycoprotein family protein, putative isoform 2 OS=Theobroma cacao GN=TCM_011235 PE=4 SV=1)

HSP 1 Score: 331.3 bits (848), Expect = 1.4e-87
Identity = 187/337 (55.49%), Postives = 226/337 (67.06%), Query Frame = 1

Query: 1   MAMPSGNVGVPDKVSFQS------GGGVAVS------GGGGEIHQHHPRPWFPDERDGFI 60
           MAMPSGNV + DK+ F +      GGG AV       GGGGEIHQHH R W PDERDGFI
Sbjct: 1   MAMPSGNVVLSDKMQFPATAAAGAGGGGAVGAVGGGGGGGGEIHQHHHRQWLPDERDGFI 60

Query: 61  SWLRGEFAASNAIIDALCHHLRAVGEPGEYDMVIGCIQQRRCNWTPVLHMQQYFSVAEVM 120
            WLRGEFAASNAIID+LCHHLR VGE GEY+ VI CIQQRRCNW PVLHMQQYFSVAEV 
Sbjct: 61  YWLRGEFAASNAIIDSLCHHLREVGEVGEYEAVIACIQQRRCNWNPVLHMQQYFSVAEVS 120

Query: 121 YALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRAEATVKEETITCAESCNGGNS 180
           YALQQV  RR+QR+ +  KVG K ++R G GFK   G R E   KE   +  +S   GNS
Sbjct: 121 YALQQVAWRRRQRHYESGKVGGKEFKRSGMGFK---GQRME-VAKEGQNSGVDS--DGNS 180

Query: 181 STFVSSRKVEQVSNTCDESKASGEDEKLSEKDSGSAVDNKDTHGKDQSNCKTKSAENLED 240
           +    S + E+ S   +E K+ GE  K+ +K S    D KDT  K  +      AE++ +
Sbjct: 181 TVTAVSERNERGSEKREEVKSCGEVGKVEDKCSTFTEDKKDTGSKPHAG----DAESVTE 240

Query: 241 NAINKDSQVEPDDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVASEMFDGKMVNVMDGL 300
           +          + GC+SS+++ +L S+Q+QN KQ  A  P+TFV +EMFDGKMVNV+DGL
Sbjct: 241 DV---------NGGCTSSYKENDLCSIQNQNEKQNLAAGPKTFVGNEMFDGKMVNVVDGL 300

Query: 301 KLFEELLDDAEVSKLLSLVNDLRASGKRGQFQGKFLV 326
           KL+EEL DD EV  L+SLVNDLRA+GKRGQ QG+  V
Sbjct: 301 KLYEELFDDKEVLDLVSLVNDLRAAGKRGQLQGQTYV 318

BLAST of Csa5G056080 vs. TrEMBL
Match: A0A061E8L7_THECC (Hydroxyproline-rich glycoprotein family protein, putative isoform 1 OS=Theobroma cacao GN=TCM_011235 PE=4 SV=1)

HSP 1 Score: 328.2 bits (840), Expect = 1.2e-86
Identity = 185/332 (55.72%), Postives = 223/332 (67.17%), Query Frame = 1

Query: 1   MAMPSGNVGVPDKVSFQS------GGGVAVS------GGGGEIHQHHPRPWFPDERDGFI 60
           MAMPSGNV + DK+ F +      GGG AV       GGGGEIHQHH R W PDERDGFI
Sbjct: 1   MAMPSGNVVLSDKMQFPATAAAGAGGGGAVGAVGGGGGGGGEIHQHHHRQWLPDERDGFI 60

Query: 61  SWLRGEFAASNAIIDALCHHLRAVGEPGEYDMVIGCIQQRRCNWTPVLHMQQYFSVAEVM 120
            WLRGEFAASNAIID+LCHHLR VGE GEY+ VI CIQQRRCNW PVLHMQQYFSVAEV 
Sbjct: 61  YWLRGEFAASNAIIDSLCHHLREVGEVGEYEAVIACIQQRRCNWNPVLHMQQYFSVAEVS 120

Query: 121 YALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRAEATVKEETITCAESCNGGNS 180
           YALQQV  RR+QR+ +  KVG K ++R G GFK   G R E   KE   +  +S   GNS
Sbjct: 121 YALQQVAWRRRQRHYESGKVGGKEFKRSGMGFK---GQRME-VAKEGQNSGVDS--DGNS 180

Query: 181 STFVSSRKVEQVSNTCDESKASGEDEKLSEKDSGSAVDNKDTHGKDQSNCKTKSAENLED 240
           +    S + E+ S   +E K+ GE  K+ +K S    D KDT  K  +      AE++ +
Sbjct: 181 TVTAVSERNERGSEKREEVKSCGEVGKVEDKCSTFTEDKKDTGSKPHAG----DAESVTE 240

Query: 241 NAINKDSQVEPDDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVASEMFDGKMVNVMDGL 300
           +          + GC+SS+++ +L S+Q+QN KQ  A  P+TFV +EMFDGKMVNV+DGL
Sbjct: 241 DV---------NGGCTSSYKENDLCSIQNQNEKQNLAAGPKTFVGNEMFDGKMVNVVDGL 300

Query: 301 KLFEELLDDAEVSKLLSLVNDLRASGKRGQFQ 321
           KL+EEL DD EV  L+SLVNDLRA+GKRGQ Q
Sbjct: 301 KLYEELFDDKEVLDLVSLVNDLRAAGKRGQLQ 313

BLAST of Csa5G056080 vs. TrEMBL
Match: A0A0B2R130_GLYSO (Uncharacterized protein OS=Glycine soja GN=glysoja_018833 PE=4 SV=1)

HSP 1 Score: 322.4 bits (825), Expect = 6.5e-85
Identity = 174/324 (53.70%), Postives = 222/324 (68.52%), Query Frame = 1

Query: 1   MAMPSGNVGVPDKVSFQSGGGVAVSGGGGEIHQ-HHPRPWFPDERDGFISWLRGEFAASN 60
           MAMPSGNV + DK+ F SGG  A  G GGEIHQ H+ + WF DERDG I WLR EFAA+N
Sbjct: 1   MAMPSGNVVIQDKMQFPSGGAGA-GGAGGEIHQPHYCQQWFVDERDGLIGWLRSEFAAAN 60

Query: 61  AIIDALCHHLRAVGEPGEYDMVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQ 120
           AIID+LCHHLR VG+PGEYDMVIG IQQRRCNW  VL MQQYFSVA+V +ALQQV  RRQ
Sbjct: 61  AIIDSLCHHLRVVGDPGEYDMVIGAIQQRRCNWNQVLMMQQYFSVADVAHALQQVAWRRQ 120

Query: 121 QRYMDPVKVGPKLYRRPGPGFKQQQGHRAEATVKEETITCAESCNGGNSSTFVS--SRKV 180
           QR +DPVKVG K +R+ G G++   G R E  VKE   +  ES N  +++  V+  + K 
Sbjct: 121 QRPLDPVKVGAKEFRKSGSGYR--HGQRFE-PVKEGYNSSVESYNQYDANVTVTGGTEKG 180

Query: 181 EQVSNTCDESKASGEDEKLSEKDSGSAVDNKDTHGKDQSNCKTKSAENLEDNAINKDSQV 240
             V    +E K+ G+ EK+ +K   SA D KD   K Q++   KS  + E +  N +S+ 
Sbjct: 181 TPVVEKSEEHKSGGKVEKVGDKGLASAEDKKDAITKHQTDGSLKSTRSTEGSLSNLESEA 240

Query: 241 EPDDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVASEMFDGKMVNVMDGLKLFEELLDD 300
             +D C S+ +  +  SVQ+Q+  Q  +T  +TF+ +EMFDGKMVNV+DGLKL+E+L D 
Sbjct: 241 VVNDECISNSKGDDSHSVQNQHQSQSLSTKAKTFIGNEMFDGKMVNVVDGLKLYEDLFDS 300

Query: 301 AEVSKLLSLVNDLRASGKRGQFQG 322
            E++ L+SLVNDLR SGK+GQ QG
Sbjct: 301 TEIANLVSLVNDLRVSGKKGQLQG 320

BLAST of Csa5G056080 vs. TrEMBL
Match: K7KQ45_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_05G138600 PE=4 SV=1)

HSP 1 Score: 322.4 bits (825), Expect = 6.5e-85
Identity = 174/324 (53.70%), Postives = 222/324 (68.52%), Query Frame = 1

Query: 1   MAMPSGNVGVPDKVSFQSGGGVAVSGGGGEIHQ-HHPRPWFPDERDGFISWLRGEFAASN 60
           MAMPSGNV + DK+ F SGG  A  G GGEIHQ H+ + WF DERDG I WLR EFAA+N
Sbjct: 1   MAMPSGNVVIQDKMQFPSGGAGA-GGAGGEIHQPHYCQQWFVDERDGLIGWLRSEFAAAN 60

Query: 61  AIIDALCHHLRAVGEPGEYDMVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQ 120
           AIID+LCHHLR VG+PGEYDMVIG IQQRRCNW  VL MQQYFSVA+V +ALQQV  RRQ
Sbjct: 61  AIIDSLCHHLRVVGDPGEYDMVIGAIQQRRCNWNQVLMMQQYFSVADVAHALQQVAWRRQ 120

Query: 121 QRYMDPVKVGPKLYRRPGPGFKQQQGHRAEATVKEETITCAESCNGGNSSTFVS--SRKV 180
           QR +DPVKVG K +R+ G G++   G R E  VKE   +  ES N  +++  V+  + K 
Sbjct: 121 QRPLDPVKVGAKEFRKSGSGYR--HGQRFE-PVKEGYNSSVESYNQYDANVTVTGGTEKG 180

Query: 181 EQVSNTCDESKASGEDEKLSEKDSGSAVDNKDTHGKDQSNCKTKSAENLEDNAINKDSQV 240
             V    +E K+ G+ EK+ +K   SA D KD   K Q++   KS  + E +  N +S+ 
Sbjct: 181 TPVVEKSEEHKSGGKVEKVGDKGLASAEDKKDAITKHQTDGSLKSTRSTEGSLSNLESEA 240

Query: 241 EPDDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVASEMFDGKMVNVMDGLKLFEELLDD 300
             +D C S+ +  +  SVQ+Q+  Q  +T  +TF+ +EMFDGKMVNV+DGLKL+E+L D 
Sbjct: 241 VVNDECISNSKGDDSHSVQNQHQSQSLSTKAKTFIGNEMFDGKMVNVVDGLKLYEDLFDS 300

Query: 301 AEVSKLLSLVNDLRASGKRGQFQG 322
            E++ L+SLVNDLR SGK+GQ QG
Sbjct: 301 TEIANLVSLVNDLRVSGKKGQLQG 320

BLAST of Csa5G056080 vs. TAIR10
Match: AT1G14710.1 (AT1G14710.1 hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 203.4 bits (516), Expect = 2.2e-52
Identity = 131/334 (39.22%), Postives = 180/334 (53.89%), Query Frame = 1

Query: 1   MAMPS-GNVGVP-DKVSFQSGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAAS 60
           MAMP  GNV  P +K+ F                   P  W PDERDGFISWLR EFAA+
Sbjct: 1   MAMPPPGNVTTPSEKLQFPP-----------------PANWIPDERDGFISWLRAEFAAA 60

Query: 61  NAIIDALCHHLRAVGEPGEYDMVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRR 120
           NAIID+LC HL+AVG+  EY+ VIG I  RR  W+ VL MQQ+F VA+V Y LQQ+  +R
Sbjct: 61  NAIIDSLCQHLQAVGDHNEYESVIGSIHHRRLAWSQVLTMQQFFPVADVSYNLQQIAWKR 120

Query: 121 Q-----QRYMDPVKVGPKLYRRPGPGFKQQQGHRAEATVKEETITCAESCNGGNSSTFVS 180
           Q     QR+ +  +VG    RR GPGF +  G        +       + NG      V+
Sbjct: 121 QQQMPPQRHYNSDQVGKFGARRSGPGFNKHHGGGGGYRGADSMARNGHNFNG------VN 180

Query: 181 SRKVEQVSNTCDESKASGEDEKLSEKDSGSAVDNKDTHGKDQSNCKTKSAENLEDNAINK 240
           S +VE             E+ KL+      +V  +   G ++    +K  + LE++    
Sbjct: 181 SDRVEH-----------REEAKLASDVKALSVAEEKRDGSEKPRSDSKVEKKLEES--ET 240

Query: 241 DSQVEPDDGCSSSHRDKELQSVQSQ--NGKQYAATTPRTFVASEMFDGKMVNVMDGLKLF 300
             ++  +  C+S  +D  L S Q Q  N K+  A+  +TFV  EM+D KMVNV++GLKL+
Sbjct: 241 QEEIVKNHKCNSGSKDNSLISEQKQEENDKECPASMAKTFVVQEMYDAKMVNVVEGLKLY 298

Query: 301 EELLDDAEVSKLLSLVNDLRASGKRGQFQGKFLV 326
           +++LD  EVS+L+SLV +LR +G+RGQ Q +  V
Sbjct: 301 DKMLDANEVSQLVSLVTNLRLAGRRGQLQSEAYV 298

BLAST of Csa5G056080 vs. TAIR10
Match: AT4G02940.1 (AT4G02940.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein)

HSP 1 Score: 97.4 bits (241), Expect = 1.7e-20
Identity = 78/292 (26.71%), Postives = 123/292 (42.12%), Query Frame = 1

Query: 44  RDGFISWLRGEFAASNAIIDALCHHLRAVGEP---GEYDMVIGCIQQRRCNWTPVLHMQQ 103
           +D  ISW RGEFAA+NAIIDA+C HLR   E     EY+ V   I +RR NW PVL MQ+
Sbjct: 47  KDALISWFRGEFAAANAIIDAMCSHLRIAEEAVSGSEYEAVFAAIHRRRLNWIPVLQMQK 106

Query: 104 YFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRAEATVKEETITCA 163
           Y S+AEV   LQ+V +++ +                    +++    AE  +KE   T  
Sbjct: 107 YHSIAEVAIELQKVAAKKAEDLK-----------------QKKTEEEAEEDLKEVVAT-- 166

Query: 164 ESCNGGNSSTFVSSRKVEQVSNTCDESKASGEDEKLSEKDSGSAVDNKDTHGKDQSNCKT 223
                          + E+V   C         EK++E D    V++ +          +
Sbjct: 167 ---------------EEEEVKKECFNG------EKVTENDVNGDVEDVEDDSPTSDITDS 226

Query: 224 KSAENLEDNAINKDSQ---VEPDDGCSSSHRDKELQSVQS-QNGKQYAATTPRTFVASEM 283
            S +++    +   +        + C +  R  E++ ++  Q  +Q    T       ++
Sbjct: 227 GSHQDVHQTVVADTAHQIICHSHEDCDA--RSCEIKPIKGFQAKEQVKGHTVNVVKGLKL 282

Query: 284 FDGKMVNVMDGLKLFEELLDDAEVSKLLSLVNDLRASGKRGQFQGKFLVFFN 329
           ++              ELL + E+SKLL  V +LR +G  G+  G+  + FN
Sbjct: 287 YE--------------ELLKEDEISKLLDFVAELREAGINGKLAGESFILFN 282

BLAST of Csa5G056080 vs. TAIR10
Match: AT2G48080.1 (AT2G48080.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein)

HSP 1 Score: 82.4 bits (202), Expect = 5.7e-16
Identity = 38/82 (46.34%), Postives = 54/82 (65.85%), Query Frame = 1

Query: 44  RDGFISWLRGEFAASNAIIDALCHHL-RAVGEPGEYDMVIGCIQQRRCNWTPVLHMQQYF 103
           +D  ++W RGEFAA+NAIIDALC HL +A G   +Y+ V+  + +RR NW PVL MQ+Y 
Sbjct: 23  KDAMLTWFRGEFAAANAIIDALCAHLMQASGGSAQYESVMAALHRRRLNWIPVLQMQKYH 82

Query: 104 SVAEVMYALQQVTSRRQQRYMD 125
           S+++V   LQQ  ++    ++D
Sbjct: 83  SISQVTLQLQQHLAKGFHHHLD 104


HSP 2 Score: 48.5 bits (114), Expect = 9.1e-06
Identity = 23/60 (38.33%), Postives = 33/60 (55.00%), Query Frame = 1

Query: 269 RTFVASEMFDGKMVNVMDGLKLFEELLDDAEVSKLLSLVNDLRASGKRGQFQGKFLVFFN 328
           + F A E   G   NV+ GLKL++++    ++SKLL  +N LR +G+  Q  G+  V FN
Sbjct: 149 KRFSAKEHVRGHTANVVKGLKLYQDVFTRPQLSKLLDSINQLREAGRNHQLSGETFVLFN 208

BLAST of Csa5G056080 vs. NCBI nr
Match: gi|700194492|gb|KGN49669.1| (hypothetical protein Csa_5G056080 [Cucumis sativus])

HSP 1 Score: 674.9 bits (1740), Expect = 7.3e-191
Identity = 330/330 (100.00%), Postives = 330/330 (100.00%), Query Frame = 1

Query: 1   MAMPSGNVGVPDKVSFQSGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNA 60
           MAMPSGNVGVPDKVSFQSGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNA
Sbjct: 1   MAMPSGNVGVPDKVSFQSGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNA 60

Query: 61  IIDALCHHLRAVGEPGEYDMVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQQ 120
           IIDALCHHLRAVGEPGEYDMVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQQ
Sbjct: 61  IIDALCHHLRAVGEPGEYDMVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQQ 120

Query: 121 RYMDPVKVGPKLYRRPGPGFKQQQGHRAEATVKEETITCAESCNGGNSSTFVSSRKVEQV 180
           RYMDPVKVGPKLYRRPGPGFKQQQGHRAEATVKEETITCAESCNGGNSSTFVSSRKVEQV
Sbjct: 121 RYMDPVKVGPKLYRRPGPGFKQQQGHRAEATVKEETITCAESCNGGNSSTFVSSRKVEQV 180

Query: 181 SNTCDESKASGEDEKLSEKDSGSAVDNKDTHGKDQSNCKTKSAENLEDNAINKDSQVEPD 240
           SNTCDESKASGEDEKLSEKDSGSAVDNKDTHGKDQSNCKTKSAENLEDNAINKDSQVEPD
Sbjct: 181 SNTCDESKASGEDEKLSEKDSGSAVDNKDTHGKDQSNCKTKSAENLEDNAINKDSQVEPD 240

Query: 241 DGCSSSHRDKELQSVQSQNGKQYAATTPRTFVASEMFDGKMVNVMDGLKLFEELLDDAEV 300
           DGCSSSHRDKELQSVQSQNGKQYAATTPRTFVASEMFDGKMVNVMDGLKLFEELLDDAEV
Sbjct: 241 DGCSSSHRDKELQSVQSQNGKQYAATTPRTFVASEMFDGKMVNVMDGLKLFEELLDDAEV 300

Query: 301 SKLLSLVNDLRASGKRGQFQGKFLVFFNSA 331
           SKLLSLVNDLRASGKRGQFQGKFLVFFNSA
Sbjct: 301 SKLLSLVNDLRASGKRGQFQGKFLVFFNSA 330

BLAST of Csa5G056080 vs. NCBI nr
Match: gi|449449076|ref|XP_004142291.1| (PREDICTED: uncharacterized protein LOC101210274 isoform X2 [Cucumis sativus])

HSP 1 Score: 658.3 bits (1697), Expect = 7.1e-186
Identity = 322/325 (99.08%), Postives = 323/325 (99.38%), Query Frame = 1

Query: 1   MAMPSGNVGVPDKVSFQSGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNA 60
           MAMPSGNVGVPDKVSFQSGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNA
Sbjct: 1   MAMPSGNVGVPDKVSFQSGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNA 60

Query: 61  IIDALCHHLRAVGEPGEYDMVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQQ 120
           IIDALCHHLRAVGEPGEYDMVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQQ
Sbjct: 61  IIDALCHHLRAVGEPGEYDMVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQQ 120

Query: 121 RYMDPVKVGPKLYRRPGPGFKQQQGHRAEATVKEETITCAESCNGGNSSTFVSSRKVEQV 180
           RYMDPVKVGPKLYRRPGPGFKQQQGHRAEATVKEETITCAESCNGGNSSTFVSSRKVEQV
Sbjct: 121 RYMDPVKVGPKLYRRPGPGFKQQQGHRAEATVKEETITCAESCNGGNSSTFVSSRKVEQV 180

Query: 181 SNTCDESKASGEDEKLSEKDSGSAVDNKDTHGKDQSNCKTKSAENLEDNAINKDSQVEPD 240
           SNTCDESKASGEDEKLSEKDSGSAVDNKDTHGKDQSNCKTKSAENLEDNAINKDSQVEPD
Sbjct: 181 SNTCDESKASGEDEKLSEKDSGSAVDNKDTHGKDQSNCKTKSAENLEDNAINKDSQVEPD 240

Query: 241 DGCSSSHRDKELQSVQSQNGKQYAATTPRTFVASEMFDGKMVNVMDGLKLFEELLDDAEV 300
           DGCSSSHRDKELQSVQSQNGKQYAATTPRTFVASEMFDGKMVNVMDGLKLFEELLDDAEV
Sbjct: 241 DGCSSSHRDKELQSVQSQNGKQYAATTPRTFVASEMFDGKMVNVMDGLKLFEELLDDAEV 300

Query: 301 SKLLSLVNDLRASGKRGQFQGKFLV 326
           SKLLSLVNDLRASGKRGQFQG+  V
Sbjct: 301 SKLLSLVNDLRASGKRGQFQGQTYV 325

BLAST of Csa5G056080 vs. NCBI nr
Match: gi|778698245|ref|XP_011654491.1| (PREDICTED: uncharacterized protein LOC101210274 isoform X1 [Cucumis sativus])

HSP 1 Score: 658.3 bits (1697), Expect = 7.1e-186
Identity = 322/325 (99.08%), Postives = 323/325 (99.38%), Query Frame = 1

Query: 1   MAMPSGNVGVPDKVSFQSGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNA 60
           MAMPSGNVGVPDKVSFQSGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNA
Sbjct: 1   MAMPSGNVGVPDKVSFQSGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNA 60

Query: 61  IIDALCHHLRAVGEPGEYDMVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQQ 120
           IIDALCHHLRAVGEPGEYDMVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQQ
Sbjct: 61  IIDALCHHLRAVGEPGEYDMVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQQ 120

Query: 121 RYMDPVKVGPKLYRRPGPGFKQQQGHRAEATVKEETITCAESCNGGNSSTFVSSRKVEQV 180
           RYMDPVKVGPKLYRRPGPGFKQQQGHRAEATVKEETITCAESCNGGNSSTFVSSRKVEQV
Sbjct: 121 RYMDPVKVGPKLYRRPGPGFKQQQGHRAEATVKEETITCAESCNGGNSSTFVSSRKVEQV 180

Query: 181 SNTCDESKASGEDEKLSEKDSGSAVDNKDTHGKDQSNCKTKSAENLEDNAINKDSQVEPD 240
           SNTCDESKASGEDEKLSEKDSGSAVDNKDTHGKDQSNCKTKSAENLEDNAINKDSQVEPD
Sbjct: 181 SNTCDESKASGEDEKLSEKDSGSAVDNKDTHGKDQSNCKTKSAENLEDNAINKDSQVEPD 240

Query: 241 DGCSSSHRDKELQSVQSQNGKQYAATTPRTFVASEMFDGKMVNVMDGLKLFEELLDDAEV 300
           DGCSSSHRDKELQSVQSQNGKQYAATTPRTFVASEMFDGKMVNVMDGLKLFEELLDDAEV
Sbjct: 241 DGCSSSHRDKELQSVQSQNGKQYAATTPRTFVASEMFDGKMVNVMDGLKLFEELLDDAEV 300

Query: 301 SKLLSLVNDLRASGKRGQFQGKFLV 326
           SKLLSLVNDLRASGKRGQFQG+  V
Sbjct: 301 SKLLSLVNDLRASGKRGQFQGQTYV 325

BLAST of Csa5G056080 vs. NCBI nr
Match: gi|659109443|ref|XP_008454723.1| (PREDICTED: uncharacterized protein LOC103495063 isoform X2 [Cucumis melo])

HSP 1 Score: 629.4 bits (1622), Expect = 3.5e-177
Identity = 312/327 (95.41%), Postives = 319/327 (97.55%), Query Frame = 1

Query: 1   MAMPSGNVGVPDKVSFQSGGG-VAVSGGGGEIHQ-HHPRPWFPDERDGFISWLRGEFAAS 60
           MA+PSGNVGVPDKVSFQSGGG VAVSGGGGEIHQ HHPRPWFPDERDGFISWLRGEFAAS
Sbjct: 1   MALPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQHHHPRPWFPDERDGFISWLRGEFAAS 60

Query: 61  NAIIDALCHHLRAVGEPGEYDMVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRR 120
           NA+IDALCHHLRAVGEPGEYD+VIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRR
Sbjct: 61  NAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRR 120

Query: 121 QQRYMDPVKVGPKLYRRPGPGFKQQQGHRAEATVKEETITCAESCNGGNSSTFVSSRKVE 180
           QQRYMDPVKVGPKLYRRPGPGFKQQQGHRAEATVKEETITCAESCNGGNSS+FVSSRKVE
Sbjct: 121 QQRYMDPVKVGPKLYRRPGPGFKQQQGHRAEATVKEETITCAESCNGGNSSSFVSSRKVE 180

Query: 181 QVSNTCDESKASGEDEKLSEKDSGSAVDNKDTHGKDQSNCKTKSAENLEDNAINKDSQVE 240
           QVSNTCDESKASGEDEKLSEKDSGSA DNKDTHGKDQSN KTK AENLEDNA NKDSQVE
Sbjct: 181 QVSNTCDESKASGEDEKLSEKDSGSAEDNKDTHGKDQSNSKTKCAENLEDNAGNKDSQVE 240

Query: 241 PDDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVASEMFDGKMVNVMDGLKLFEELLDDA 300
           PDDGCSSSHRDKELQSVQSQNGKQ+AATTPRTFVA+EMFDGKMVNVMDGLKLFEELLDDA
Sbjct: 241 PDDGCSSSHRDKELQSVQSQNGKQHAATTPRTFVANEMFDGKMVNVMDGLKLFEELLDDA 300

Query: 301 EVSKLLSLVNDLRASGKRGQFQGKFLV 326
           EVSKLLSLVNDLRASGKRGQFQG+  V
Sbjct: 301 EVSKLLSLVNDLRASGKRGQFQGQTYV 327

BLAST of Csa5G056080 vs. NCBI nr
Match: gi|659109441|ref|XP_008454722.1| (PREDICTED: uncharacterized protein LOC103495063 isoform X1 [Cucumis melo])

HSP 1 Score: 629.4 bits (1622), Expect = 3.5e-177
Identity = 312/327 (95.41%), Postives = 319/327 (97.55%), Query Frame = 1

Query: 1   MAMPSGNVGVPDKVSFQSGGG-VAVSGGGGEIHQ-HHPRPWFPDERDGFISWLRGEFAAS 60
           MA+PSGNVGVPDKVSFQSGGG VAVSGGGGEIHQ HHPRPWFPDERDGFISWLRGEFAAS
Sbjct: 1   MALPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQHHHPRPWFPDERDGFISWLRGEFAAS 60

Query: 61  NAIIDALCHHLRAVGEPGEYDMVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRR 120
           NA+IDALCHHLRAVGEPGEYD+VIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRR
Sbjct: 61  NAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRR 120

Query: 121 QQRYMDPVKVGPKLYRRPGPGFKQQQGHRAEATVKEETITCAESCNGGNSSTFVSSRKVE 180
           QQRYMDPVKVGPKLYRRPGPGFKQQQGHRAEATVKEETITCAESCNGGNSS+FVSSRKVE
Sbjct: 121 QQRYMDPVKVGPKLYRRPGPGFKQQQGHRAEATVKEETITCAESCNGGNSSSFVSSRKVE 180

Query: 181 QVSNTCDESKASGEDEKLSEKDSGSAVDNKDTHGKDQSNCKTKSAENLEDNAINKDSQVE 240
           QVSNTCDESKASGEDEKLSEKDSGSA DNKDTHGKDQSN KTK AENLEDNA NKDSQVE
Sbjct: 181 QVSNTCDESKASGEDEKLSEKDSGSAEDNKDTHGKDQSNSKTKCAENLEDNAGNKDSQVE 240

Query: 241 PDDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVASEMFDGKMVNVMDGLKLFEELLDDA 300
           PDDGCSSSHRDKELQSVQSQNGKQ+AATTPRTFVA+EMFDGKMVNVMDGLKLFEELLDDA
Sbjct: 241 PDDGCSSSHRDKELQSVQSQNGKQHAATTPRTFVANEMFDGKMVNVMDGLKLFEELLDDA 300

Query: 301 EVSKLLSLVNDLRASGKRGQFQGKFLV 326
           EVSKLLSLVNDLRASGKRGQFQG+  V
Sbjct: 301 EVSKLLSLVNDLRASGKRGQFQGQTYV 327

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0KLD4_CUCSA5.1e-191100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_5G056080 PE=4 SV=1[more]
A0A061EA95_THECC1.4e-8755.49Hydroxyproline-rich glycoprotein family protein, putative isoform 2 OS=Theobroma... [more]
A0A061E8L7_THECC1.2e-8655.72Hydroxyproline-rich glycoprotein family protein, putative isoform 1 OS=Theobroma... [more]
A0A0B2R130_GLYSO6.5e-8553.70Uncharacterized protein OS=Glycine soja GN=glysoja_018833 PE=4 SV=1[more]
K7KQ45_SOYBN6.5e-8553.70Uncharacterized protein OS=Glycine max GN=GLYMA_05G138600 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G14710.12.2e-5239.22 hydroxyproline-rich glycoprotein family protein[more]
AT4G02940.11.7e-2026.71 oxidoreductase, 2OG-Fe(II) oxygenase family protein[more]
AT2G48080.15.7e-1646.34 oxidoreductase, 2OG-Fe(II) oxygenase family protein[more]
Match NameE-valueIdentityDescription
gi|700194492|gb|KGN49669.1|7.3e-191100.00hypothetical protein Csa_5G056080 [Cucumis sativus][more]
gi|449449076|ref|XP_004142291.1|7.1e-18699.08PREDICTED: uncharacterized protein LOC101210274 isoform X2 [Cucumis sativus][more]
gi|778698245|ref|XP_011654491.1|7.1e-18699.08PREDICTED: uncharacterized protein LOC101210274 isoform X1 [Cucumis sativus][more]
gi|659109443|ref|XP_008454723.1|3.5e-17795.41PREDICTED: uncharacterized protein LOC103495063 isoform X2 [Cucumis melo][more]
gi|659109441|ref|XP_008454722.1|3.5e-17795.41PREDICTED: uncharacterized protein LOC103495063 isoform X1 [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
CU085123cucumber EST collection version 3.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa5G056080.1Csa5G056080.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
CU085123CU085123transcribed_cluster


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR31447FAMILY NOT NAMEDcoord: 1..322
score: 1.2E
NoneNo IPR availablePANTHERPTHR31447:SF0HYDROXYPROLINE-RICH GLYCOPROTEIN-LIKE PROTEINcoord: 1..322
score: 1.2E