CsGy5G029010 (gene) Cucumber (Gy14) v2.1

Overview
NameCsGy5G029010
Typegene
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionPollen Ole e 1 allergen and extensin family protein
LocationGy14Chr5: 32436711 .. 32437947 (-)
RNA-Seq ExpressionCsGy5G029010
SyntenyCsGy5G029010
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGTTGGCTTCTCATATTACTTCTTCTCAATTTTAGTTTCTTCGATCTTTCAGAAGCCAGGCACCACAGGAAGCTGCCGTCGGCTGTCGTCGTCGGTACCGTTTTCTGCGACACATGCTATCAAGAGAAGTTTTCCAAGACCAGTCACTTCATTTCAGGTACTTTTACTAAGTGGCTCATGTTAGAATTACATAGTTGTAGAATTCAACTAGTTAAGACGTATATATGTTACTCTTAAAATGTCTTGAGTTTGTTATCTTAAGTTTCGAATGTTCCAGCATGACGTCTTGTTACCCGATCGACGAGTTAGGGTATGAGGTGATGATTTTGGGAGGAATCTGTTTATATACCCTCTAGTACTCCTTTACCCTCAACATATCAAAATTTATCTACGGGTTACTTCTTTTTTCCTTTGTTTGTCGATTTCATAACAGTTGGGTTTGTACTATCTAGGTGCAACGGTTGCTGTCGAATGTGGAAACAAAGGACCAGAACCGAGTTTTAGAGAAGAAGTAAAGACAGACAAAAGAGGGGAATTCAAGGTGAATTTGCCAGTTTTAGTGAGCAAACATGTGAAGAAGATTGAGGAATGTTATGTGGAATTAGTTAAAAGTAGTGAACCATATTGTGATGTAGCTGCAACAGCAAAATCATCTTCCCTTCAACTCAAGTCAAGAAAACAAAACACACACACATTCTCAGCTGGCTTCTTCACTTTCAAGCCTCTAAAACAACCAAACCTTTGCAACCAAAAACCACAAAATCCCAACACATTTGATGACATGAAAGAAATCCAACTCCCCCCACCACCATCATACGACATTCCGAATTTGCCATCTCCTATCCAAATCCCGACCGTACCGAGCGCTCCTCGGATTTACGATAACCTTCCTCCTCTCCCACTGCTTCCTGGACTTCTTCCATTGCCTCAGTTACCTCCACTGCCTCCATTGCCACCACTTCCTACTCTTCCACCTCTCCCTAAGTTTCCAATATTCCCACCAAAAGAGAAAGATGAGAAAAATGCACCAAATGAAACTCCAAACACAAGTGAAAAACTAGACAAGTTTCCAATACCACCCATAAAGCCTTTGAGGAAGCCACATCATTTTGTTCTGCCTCCACAAAGGCTGCACCACCACCCTCGGCTGCCACCTCATGTGGCGGTGATAGGCGGCGAGCCGATACCTAACCTTTCTAAGATCTCCTCATCTCATAAGAAAACTTCTCCTTGA

mRNA sequence

ATGAGTTGGCTTCTCATATTACTTCTTCTCAATTTTAGTTTCTTCGATCTTTCAGAAGCCAGGCACCACAGGAAGCTGCCGTCGGCTGTCGTCGTCGGTACCGTTTTCTGCGACACATGCTATCAAGAGAAGTTTTCCAAGACCAGTCACTTCATTTCAGGTGCAACGGTTGCTGTCGAATGTGGAAACAAAGGACCAGAACCGAGTTTTAGAGAAGAAGTAAAGACAGACAAAAGAGGGGAATTCAAGGTGAATTTGCCAGTTTTAGTGAGCAAACATGTGAAGAAGATTGAGGAATGTTATGTGGAATTAGTTAAAAGTAGTGAACCATATTGTGATGTAGCTGCAACAGCAAAATCATCTTCCCTTCAACTCAAGTCAAGAAAACAAAACACACACACATTCTCAGCTGGCTTCTTCACTTTCAAGCCTCTAAAACAACCAAACCTTTGCAACCAAAAACCACAAAATCCCAACACATTTGATGACATGAAAGAAATCCAACTCCCCCCACCACCATCATACGACATTCCGAATTTGCCATCTCCTATCCAAATCCCGACCGTACCGAGCGCTCCTCGGATTTACGATAACCTTCCTCCTCTCCCACTGCTTCCTGGACTTCTTCCATTGCCTCAGTTACCTCCACTGCCTCCATTGCCACCACTTCCTACTCTTCCACCTCTCCCTAAGTTTCCAATATTCCCACCAAAAGAGAAAGATGAGAAAAATGCACCAAATGAAACTCCAAACACAAGTGAAAAACTAGACAAGTTTCCAATACCACCCATAAAGCCTTTGAGGAAGCCACATCATTTTGTTCTGCCTCCACAAAGGCTGCACCACCACCCTCGGCTGCCACCTCATGTGGCGGTGATAGGCGGCGAGCCGATACCTAACCTTTCTAAGATCTCCTCATCTCATAAGAAAACTTCTCCTTGA

Coding sequence (CDS)

ATGAGTTGGCTTCTCATATTACTTCTTCTCAATTTTAGTTTCTTCGATCTTTCAGAAGCCAGGCACCACAGGAAGCTGCCGTCGGCTGTCGTCGTCGGTACCGTTTTCTGCGACACATGCTATCAAGAGAAGTTTTCCAAGACCAGTCACTTCATTTCAGGTGCAACGGTTGCTGTCGAATGTGGAAACAAAGGACCAGAACCGAGTTTTAGAGAAGAAGTAAAGACAGACAAAAGAGGGGAATTCAAGGTGAATTTGCCAGTTTTAGTGAGCAAACATGTGAAGAAGATTGAGGAATGTTATGTGGAATTAGTTAAAAGTAGTGAACCATATTGTGATGTAGCTGCAACAGCAAAATCATCTTCCCTTCAACTCAAGTCAAGAAAACAAAACACACACACATTCTCAGCTGGCTTCTTCACTTTCAAGCCTCTAAAACAACCAAACCTTTGCAACCAAAAACCACAAAATCCCAACACATTTGATGACATGAAAGAAATCCAACTCCCCCCACCACCATCATACGACATTCCGAATTTGCCATCTCCTATCCAAATCCCGACCGTACCGAGCGCTCCTCGGATTTACGATAACCTTCCTCCTCTCCCACTGCTTCCTGGACTTCTTCCATTGCCTCAGTTACCTCCACTGCCTCCATTGCCACCACTTCCTACTCTTCCACCTCTCCCTAAGTTTCCAATATTCCCACCAAAAGAGAAAGATGAGAAAAATGCACCAAATGAAACTCCAAACACAAGTGAAAAACTAGACAAGTTTCCAATACCACCCATAAAGCCTTTGAGGAAGCCACATCATTTTGTTCTGCCTCCACAAAGGCTGCACCACCACCCTCGGCTGCCACCTCATGTGGCGGTGATAGGCGGCGAGCCGATACCTAACCTTTCTAAGATCTCCTCATCTCATAAGAAAACTTCTCCTTGA

Protein sequence

MSWLLILLLLNFSFFDLSEARHHRKLPSAVVVGTVFCDTCYQEKFSKTSHFISGATVAVECGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATAKSSSLQLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIPNLPSPIQIPTVPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPKEKDEKNAPNETPNTSEKLDKFPIPPIKPLRKPHHFVLPPQRLHHHPRLPPHVAVIGGEPIPNLSKISSSHKKTSP*
Homology
BLAST of CsGy5G029010 vs. NCBI nr
Match: XP_011656281.2 (proline-rich protein 4 [Cucumis sativus] >KAE8648949.1 hypothetical protein Csa_008537 [Cucumis sativus])

HSP 1 Score: 621 bits (1601), Expect = 3.63e-224
Identity = 313/313 (100.00%), Postives = 313/313 (100.00%), Query Frame = 0

Query: 1   MSWLLILLLLNFSFFDLSEARHHRKLPSAVVVGTVFCDTCYQEKFSKTSHFISGATVAVE 60
           MSWLLILLLLNFSFFDLSEARHHRKLPSAVVVGTVFCDTCYQEKFSKTSHFISGATVAVE
Sbjct: 1   MSWLLILLLLNFSFFDLSEARHHRKLPSAVVVGTVFCDTCYQEKFSKTSHFISGATVAVE 60

Query: 61  CGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATAKS 120
           CGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATAKS
Sbjct: 61  CGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATAKS 120

Query: 121 SSLQLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIPNL 180
           SSLQLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIPNL
Sbjct: 121 SSLQLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIPNL 180

Query: 181 PSPIQIPTVPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPKEK 240
           PSPIQIPTVPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPKEK
Sbjct: 181 PSPIQIPTVPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPKEK 240

Query: 241 DEKNAPNETPNTSEKLDKFPIPPIKPLRKPHHFVLPPQRLHHHPRLPPHVAVIGGEPIPN 300
           DEKNAPNETPNTSEKLDKFPIPPIKPLRKPHHFVLPPQRLHHHPRLPPHVAVIGGEPIPN
Sbjct: 241 DEKNAPNETPNTSEKLDKFPIPPIKPLRKPHHFVLPPQRLHHHPRLPPHVAVIGGEPIPN 300

Query: 301 LSKISSSHKKTSP 313
           LSKISSSHKKTSP
Sbjct: 301 LSKISSSHKKTSP 313

BLAST of CsGy5G029010 vs. NCBI nr
Match: KAA0039420.1 (major pollen allergen Lol p 11 [Cucumis melo var. makuwa] >TYK00608.1 major pollen allergen Lol p 11 [Cucumis melo var. makuwa])

HSP 1 Score: 563 bits (1452), Expect = 1.93e-201
Identity = 285/314 (90.76%), Postives = 298/314 (94.90%), Query Frame = 0

Query: 1   MSWLLILLLLNFSFFDLSEARHHRKLPSAVVVGTVFCDTCYQEKFSKTSHFISGATVAVE 60
           M WLLILLLLNFSFFDLSEARHHRKLPSAV++GTVFCDTC+QEKFSKTSHFISGATVAVE
Sbjct: 1   MIWLLILLLLNFSFFDLSEARHHRKLPSAVIIGTVFCDTCFQEKFSKTSHFISGATVAVE 60

Query: 61  CGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATAKS 120
           CGN+G +PSFREEVKTDKRGEFKVNLPVLVSKHV+KIEECYVE +KSSEPYCDVAATAKS
Sbjct: 61  CGNRGRKPSFREEVKTDKRGEFKVNLPVLVSKHVEKIEECYVESIKSSEPYCDVAATAKS 120

Query: 121 SSLQLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEI-QLPPPPSYDIPN 180
           SSLQLKS+KQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEI QLPPPP +D PN
Sbjct: 121 SSLQLKSKKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIIQLPPPPPFDSPN 180

Query: 181 LPSPIQIPTVPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPKE 240
           LPSPIQ PTVP+APRIYDNLPPLPLLPGLLPLP LPPLPPLPPLP LPPLPKFPIFPPK 
Sbjct: 181 LPSPIQNPTVPNAPRIYDNLPPLPLLPGLLPLPPLPPLPPLPPLPPLPPLPKFPIFPPKA 240

Query: 241 KDEKNAPNETPNTSEKLDKFPIPPIKPLRKPHHFVLPPQRLHHHPRLPPHVAVIGGEPIP 300
            DEKNAP ETPNTSEKLDKFPIPPIKPLR+PH+FVLPPQRLHHHP+ PPHVAVIGGEPIP
Sbjct: 241 NDEKNAPIETPNTSEKLDKFPIPPIKPLRRPHYFVLPPQRLHHHPQPPPHVAVIGGEPIP 300

Query: 301 NLSKISSSHKKTSP 313
           NLS ISS  KKTSP
Sbjct: 301 NLSNISSPQKKTSP 314

BLAST of CsGy5G029010 vs. NCBI nr
Match: XP_038889683.1 (pollen-specific leucine-rich repeat extensin-like protein 3 [Benincasa hispida])

HSP 1 Score: 439 bits (1129), Expect = 2.95e-152
Identity = 239/318 (75.16%), Postives = 264/318 (83.02%), Query Frame = 0

Query: 1   MSWLLILLLLNFSFFDLSEARHHRKLPSAVVVGTVFCDTCYQEKFSKTSHFISGATVAVE 60
           M WL+ILLLLN SFFDLSEARHHRKLPSAVVVGTVFCDTC+QEKFSKTSHFISGATV V+
Sbjct: 5   MVWLIILLLLNLSFFDLSEARHHRKLPSAVVVGTVFCDTCFQEKFSKTSHFISGATVVVK 64

Query: 61  CGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATAKS 120
           CGN+G  PSFREEVKTDKRGEFKVNLPV VSKHVKKIEECYVEL+KSSEPYC VAATAKS
Sbjct: 65  CGNEGSRPSFREEVKTDKRGEFKVNLPVPVSKHVKKIEECYVELIKSSEPYCAVAATAKS 124

Query: 121 SSLQLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIPNL 180
           SSLQLKS+KQ THTFSAGFFTFKPLKQPNLC Q+PQN NT+DD K++ L  PP++D P+L
Sbjct: 125 SSLQLKSKKQGTHTFSAGFFTFKPLKQPNLCKQRPQNSNTYDDTKQV-LAAPPTFDYPSL 184

Query: 181 PSPIQIPTVPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPKEK 240
           PSPIQ P VP   RIY+NLPPLPLLPGLLPLP   PLPPLPPLP LPPLP FP+FPPK K
Sbjct: 185 PSPIQNPNVP---RIYENLPPLPLLPGLLPLP---PLPPLPPLPPLPPLPGFPLFPPK-K 244

Query: 241 DEKNAPNETPNTSEKLDKFPIPPIKPLRKPHHFVLPPQRLHHHP----RLPPHVAVIG-G 300
           +E NAPNETPNTSEK ++F    + PLR+PH  VLPP RLHHH     RL P    +   
Sbjct: 245 NENNAPNETPNTSEKPNQFHPQTLLPLRRPHFSVLPPHRLHHHQPLLHRLQPAAGGVALS 304

Query: 301 EPIPNLSKISSSHKKTSP 313
            P+P+L +ISS  K+ SP
Sbjct: 305 LPLPDLPEISSPPKQNSP 314

BLAST of CsGy5G029010 vs. NCBI nr
Match: XP_023528636.1 (proline-rich protein 4-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 353 bits (907), Expect = 1.51e-118
Identity = 206/335 (61.49%), Postives = 233/335 (69.55%), Query Frame = 0

Query: 1   MSWLLILLLLNFSFFDLSEARHHRKLPSAVVVGTVFCDTCYQEKFSKTSHFISGATVAVE 60
           M  LLIL+ LNFS  DLS+ARHH  LPSA VVGTVFCDTC+Q+ FSKTSHFISGATVAVE
Sbjct: 1   MICLLILIALNFSLLDLSQARHHNNLPSAAVVGTVFCDTCFQDTFSKTSHFISGATVAVE 60

Query: 61  CGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATAKS 120
           CGN G  PSFR+EVKTDK GEFK+ LPV     V+KIEECYV L++SSEPYC VAA AKS
Sbjct: 61  CGNGGSNPSFRDEVKTDKTGEFKIQLPV----SVRKIEECYVRLIRSSEPYCAVAARAKS 120

Query: 121 SSLQLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIPNL 180
           SSL+LKSRKQ  H FSAGFFTFKPLKQP LC+    + N FDD K++        D P L
Sbjct: 121 SSLKLKSRKQGMHVFSAGFFTFKPLKQPKLCSHN-SHFNEFDDTKQV-------VDFPGL 180

Query: 181 PSPIQIPTVPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPKEK 240
           P+PIQ PTVP+ PRIYDNLPPLPLLPGL PLPQLPPLPPLPPLP       FP+FPPK K
Sbjct: 181 PAPIQNPTVPNVPRIYDNLPPLPLLPGLPPLPQLPPLPPLPPLPV------FPLFPPK-K 240

Query: 241 DEKNAPNETPNTSEKLDKF------PIPPIKPLRKPHHFVLPPQRLHHHPRL----PPHV 300
           D++N   +TP  S+  D F      PIP +KP R   HFV+PP +L HHP      PP  
Sbjct: 241 DDENV--QTPKISQNPDLFHPQTLLPIPSLKPFRP--HFVMPPHKLRHHPLTHGPTPPSA 300

Query: 301 AVIGGE------------PIPNLSKISSSHKKTSP 313
           A   GE             IPN+ +ISS  K+TSP
Sbjct: 301 AAAAGELAPSPPLPFSLPSIPNMPEISSPPKQTSP 312

BLAST of CsGy5G029010 vs. NCBI nr
Match: XP_022990149.1 (amyloid beta A4 precursor protein-binding family B member 1-interacting protein-like [Cucurbita maxima])

HSP 1 Score: 343 bits (881), Expect = 1.01e-114
Identity = 199/316 (62.97%), Postives = 228/316 (72.15%), Query Frame = 0

Query: 1   MSWLLILLLLNFSFFDLSEARHHRKLPSAV--VVGTVFCDTCYQEKFSKTSHFISGATVA 60
           M  LLIL+ LNFSF DLS+ARHH  LPSA   VVGTVFCDTC+Q+ FSKTSHFISGATVA
Sbjct: 1   MICLLILIALNFSFLDLSQARHHNNLPSAAAAVVGTVFCDTCFQDTFSKTSHFISGATVA 60

Query: 61  VECGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATA 120
           VECGN+G  P+FREEVKTDK GEFK+ LPV     V+K+EECYV L++SSEPYC VAA A
Sbjct: 61  VECGNEGSNPNFREEVKTDKTGEFKIQLPV----SVRKVEECYVRLIRSSEPYCAVAARA 120

Query: 121 KSSSLQLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIP 180
           KSSSL+LKS+KQ TH FSAGFFTFKPLKQP LC+    + N FDD K++        D P
Sbjct: 121 KSSSLRLKSKKQGTHVFSAGFFTFKPLKQPKLCSHNSHS-NEFDDTKQV-------VDFP 180

Query: 181 NLPSPIQIPTVPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPK 240
            LP+PIQ PTVP+ PRIYDNLPPLPLLPGL PLPQLPPLPPLPPLP       FP+FPPK
Sbjct: 181 GLPAPIQNPTVPNVPRIYDNLPPLPLLPGLPPLPQLPPLPPLPPLPI------FPLFPPK 240

Query: 241 EKDEKNAPNETPNTSEKLDKF------PIPPIKPLRKPHHFVLPPQRLHHHPRL----PP 300
            KD++N   +TP  S+  D F      PIP +KPLR   HFV+PP +L HHP      PP
Sbjct: 241 -KDDENV--QTPKISQNPDLFHPQTLLPIPSLKPLRP--HFVMPPHKLRHHPLTHGPTPP 293

BLAST of CsGy5G029010 vs. ExPASy TrEMBL
Match: A0A0A0KXM2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G642130 PE=4 SV=1)

HSP 1 Score: 572 bits (1475), Expect = 1.74e-205
Identity = 291/296 (98.31%), Postives = 291/296 (98.31%), Query Frame = 0

Query: 1   MSWLLILLLLNFSFFDLSEARHHRKLPSAVVVGTVFCDTCYQEKFSKTSHFISGATVAVE 60
           MSWLLILLLLNFSFFDLSEARHHRKLPSAVVVGTVFCDTCYQEKFSKTSHFISGATVAVE
Sbjct: 1   MSWLLILLLLNFSFFDLSEARHHRKLPSAVVVGTVFCDTCYQEKFSKTSHFISGATVAVE 60

Query: 61  CGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATAKS 120
           CGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATAKS
Sbjct: 61  CGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATAKS 120

Query: 121 SSLQLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIPNL 180
           SSLQLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIPNL
Sbjct: 121 SSLQLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIPNL 180

Query: 181 PSPIQIPTVPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPKEK 240
           PSPIQIPTVPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPKEK
Sbjct: 181 PSPIQIPTVPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPKEK 240

Query: 241 DEKNAPNETPNTSEKLDKFPIPPIKPLRKPHHFVLPPQRLHHHPRLPPHVAVIGGE 296
           DEKNAPNETPNTSEKLDKFPIPPIKPLRKPHHFVLPPQRLHHHPRLPP     GGE
Sbjct: 241 DEKNAPNETPNTSEKLDKFPIPPIKPLRKPHHFVLPPQRLHHHPRLPPQSG--GGE 294

BLAST of CsGy5G029010 vs. ExPASy TrEMBL
Match: A0A5A7T850 (Major pollen allergen Lol p 11 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold169G002260 PE=4 SV=1)

HSP 1 Score: 563 bits (1452), Expect = 9.35e-202
Identity = 285/314 (90.76%), Postives = 298/314 (94.90%), Query Frame = 0

Query: 1   MSWLLILLLLNFSFFDLSEARHHRKLPSAVVVGTVFCDTCYQEKFSKTSHFISGATVAVE 60
           M WLLILLLLNFSFFDLSEARHHRKLPSAV++GTVFCDTC+QEKFSKTSHFISGATVAVE
Sbjct: 1   MIWLLILLLLNFSFFDLSEARHHRKLPSAVIIGTVFCDTCFQEKFSKTSHFISGATVAVE 60

Query: 61  CGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATAKS 120
           CGN+G +PSFREEVKTDKRGEFKVNLPVLVSKHV+KIEECYVE +KSSEPYCDVAATAKS
Sbjct: 61  CGNRGRKPSFREEVKTDKRGEFKVNLPVLVSKHVEKIEECYVESIKSSEPYCDVAATAKS 120

Query: 121 SSLQLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEI-QLPPPPSYDIPN 180
           SSLQLKS+KQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEI QLPPPP +D PN
Sbjct: 121 SSLQLKSKKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIIQLPPPPPFDSPN 180

Query: 181 LPSPIQIPTVPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPKE 240
           LPSPIQ PTVP+APRIYDNLPPLPLLPGLLPLP LPPLPPLPPLP LPPLPKFPIFPPK 
Sbjct: 181 LPSPIQNPTVPNAPRIYDNLPPLPLLPGLLPLPPLPPLPPLPPLPPLPPLPKFPIFPPKA 240

Query: 241 KDEKNAPNETPNTSEKLDKFPIPPIKPLRKPHHFVLPPQRLHHHPRLPPHVAVIGGEPIP 300
            DEKNAP ETPNTSEKLDKFPIPPIKPLR+PH+FVLPPQRLHHHP+ PPHVAVIGGEPIP
Sbjct: 241 NDEKNAPIETPNTSEKLDKFPIPPIKPLRRPHYFVLPPQRLHHHPQPPPHVAVIGGEPIP 300

Query: 301 NLSKISSSHKKTSP 313
           NLS ISS  KKTSP
Sbjct: 301 NLSNISSPQKKTSP 314

BLAST of CsGy5G029010 vs. ExPASy TrEMBL
Match: A0A6J1JRA2 (amyloid beta A4 precursor protein-binding family B member 1-interacting protein-like OS=Cucurbita maxima OX=3661 GN=LOC111487127 PE=4 SV=1)

HSP 1 Score: 343 bits (881), Expect = 4.91e-115
Identity = 199/316 (62.97%), Postives = 228/316 (72.15%), Query Frame = 0

Query: 1   MSWLLILLLLNFSFFDLSEARHHRKLPSAV--VVGTVFCDTCYQEKFSKTSHFISGATVA 60
           M  LLIL+ LNFSF DLS+ARHH  LPSA   VVGTVFCDTC+Q+ FSKTSHFISGATVA
Sbjct: 1   MICLLILIALNFSFLDLSQARHHNNLPSAAAAVVGTVFCDTCFQDTFSKTSHFISGATVA 60

Query: 61  VECGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATA 120
           VECGN+G  P+FREEVKTDK GEFK+ LPV     V+K+EECYV L++SSEPYC VAA A
Sbjct: 61  VECGNEGSNPNFREEVKTDKTGEFKIQLPV----SVRKVEECYVRLIRSSEPYCAVAARA 120

Query: 121 KSSSLQLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIP 180
           KSSSL+LKS+KQ TH FSAGFFTFKPLKQP LC+    + N FDD K++        D P
Sbjct: 121 KSSSLRLKSKKQGTHVFSAGFFTFKPLKQPKLCSHNSHS-NEFDDTKQV-------VDFP 180

Query: 181 NLPSPIQIPTVPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPK 240
            LP+PIQ PTVP+ PRIYDNLPPLPLLPGL PLPQLPPLPPLPPLP       FP+FPPK
Sbjct: 181 GLPAPIQNPTVPNVPRIYDNLPPLPLLPGLPPLPQLPPLPPLPPLPI------FPLFPPK 240

Query: 241 EKDEKNAPNETPNTSEKLDKF------PIPPIKPLRKPHHFVLPPQRLHHHPRL----PP 300
            KD++N   +TP  S+  D F      PIP +KPLR   HFV+PP +L HHP      PP
Sbjct: 241 -KDDENV--QTPKISQNPDLFHPQTLLPIPSLKPLRP--HFVMPPHKLRHHPLTHGPTPP 293

BLAST of CsGy5G029010 vs. ExPASy TrEMBL
Match: A0A6J1HD47 (amyloid beta A4 precursor protein-binding family B member 1-interacting protein-like OS=Cucurbita moschata OX=3662 GN=LOC111463006 PE=4 SV=1)

HSP 1 Score: 339 bits (869), Expect = 5.40e-113
Identity = 202/342 (59.06%), Postives = 232/342 (67.84%), Query Frame = 0

Query: 1   MSWLLILLLLNFSFFDLSEARHHRKLPS-AVVVGTVFCDTCYQEKFSKTSHFISGATVAV 60
           M  LLIL+ LNFSF DLS+ARHH  LPS A VVGTVFCDTC+Q+ FSK+SHFISGATVAV
Sbjct: 1   MICLLILIALNFSFLDLSQARHHNNLPSTAAVVGTVFCDTCFQDTFSKSSHFISGATVAV 60

Query: 61  ECGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATAK 120
           ECG+ G  PSFR+EVKTDK GEFK+ LPV     V+KIEECYV L++SSEPYC VAA AK
Sbjct: 61  ECGDGGSNPSFRDEVKTDKTGEFKIQLPV----SVRKIEECYVRLIRSSEPYCAVAARAK 120

Query: 121 SSSLQLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIPN 180
           SSSL+LKSRKQ TH FSAGFFTFKPLK P LC+    + + FDD K++        D P 
Sbjct: 121 SSSLKLKSRKQGTHVFSAGFFTFKPLKHPKLCSHNSHS-SEFDDTKQV-------VDFPG 180

Query: 181 LPSPIQIPTVPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPKE 240
           LP+PIQ PTVP+ PRIYDNLPPL LLPGL PLPQLPPLPPLPPLP       FP+FPPK 
Sbjct: 181 LPAPIQNPTVPNVPRIYDNLPPLTLLPGLPPLPQLPPLPPLPPLPV------FPLFPPK- 240

Query: 241 KDEKNAPNETPNTSEKLDKF------PIPPIKPLRKPHHFVLPPQRLHHHPRL------- 300
           KD++N   +TP  S+  D F      PIP +KPLR   HFV+PP +L HHP         
Sbjct: 241 KDDENV--QTPKISQNPDMFHPQTLLPIPSLKPLRP--HFVMPPHKLRHHPLTHGPFSPS 300

Query: 301 -----PPHVAV----------IGGEPIPNLSKISSSHKKTSP 313
                PP  A               PIP + +ISS  K+TSP
Sbjct: 301 FSTPTPPSAAADELAPSPPLPFSLPPIPRMPEISSPPKETSP 319

BLAST of CsGy5G029010 vs. ExPASy TrEMBL
Match: A0A6J1BYU3 (proline-rich protein 4-like OS=Momordica charantia OX=3673 GN=LOC111006489 PE=4 SV=1)

HSP 1 Score: 337 bits (864), Expect = 3.67e-112
Identity = 209/347 (60.23%), Postives = 230/347 (66.28%), Query Frame = 0

Query: 1   MSWLLILLLLNFSFFDLSEARHHRKLPSAVVVGTVFCDTCYQEKFSKTSHFISGATVAVE 60
           M WLLILL+LN SF DLS+ARHH+ LPSAV+VGTVFCDTC QEK SKTS FISGATVAVE
Sbjct: 1   MVWLLILLILNLSFCDLSQARHHKNLPSAVIVGTVFCDTCSQEKLSKTSRFISGATVAVE 60

Query: 61  CGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATAKS 120
           CGN GP+PSFREEVKTDKRGEFKV+LPV VS+HVK IE C V L++SSE YC VAA A S
Sbjct: 61  CGNGGPKPSFREEVKTDKRGEFKVDLPVSVSRHVKTIEGCSVNLIRSSEAYCAVAAAATS 120

Query: 121 SSLQLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIPNL 180
           SS +LKSR Q TH FSAGFFTFKPLK PN+C QKP + NTF DMK+      P  D P L
Sbjct: 121 SSFELKSRNQGTHFFSAGFFTFKPLKHPNICTQKPYS-NTFHDMKQAL----PMLDYPAL 180

Query: 181 PSPIQIPT-VPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPKE 240
           P+PI+ PT VP+ PRIYDNLPPLP LP L PLPQLPPLPPLPPLP       FPIFPPK 
Sbjct: 181 PTPIENPTIVPNVPRIYDNLPPLPFLPRLPPLPQLPPLPPLPPLPG------FPIFPPK- 240

Query: 241 KDEKNAPNETPNTSEKLDKFPIPPIKPLRKPHHFVLPPQRLHHHPRLP------------ 300
           K  +NAPN          K  +P  K LR   HFVLPP RL H P  P            
Sbjct: 241 KTVENAPN---------GKTLLPHKKHLRP--HFVLPPHRLQHPPLFPNIPSLLPFEIPP 300

Query: 301 PHVA-----VIGGE----------------PIPNLSKISSSHKKTSP 313
           P VA     V GG                 P+P L  I S  ++TSP
Sbjct: 301 PPVAAAVGAVAGGSVPSPPSPTSPTPFPIPPVPGLPGIPSPPRQTSP 324

BLAST of CsGy5G029010 vs. TAIR 10
Match: AT5G15780.1 (Pollen Ole e 1 allergen and extensin family protein )

HSP 1 Score: 176.4 bits (446), Expect = 3.6e-44
Identity = 147/391 (37.60%), Postives = 191/391 (48.85%), Query Frame = 0

Query: 3   WLLILLLLNFSF-FDLSEARHH---RKLPSAVVVGTVFCDTCYQEKFSKT-SHFISGATV 62
           W  +++ L  S    LS+ + H   +   SAVVVGTV+CDTC+   FSK+ +H ISGA V
Sbjct: 10  WFSLMIFLGISINGGLSQGQQHVMKKTRSSAVVVGTVYCDTCFNGAFSKSPNHLISGALV 69

Query: 63  AVECGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAAT 122
           AVEC ++  +PSFR+EVKTDKRGEFKV LP  VSKHVKKI+ C V+L+ SS+PYC +A++
Sbjct: 70  AVECIDENSKPSFRQEVKTDKRGEFKVKLPFSVSKHVKKIKRCSVKLLSSSQPYCSIASS 129

Query: 123 AKSSSL-QLKSRK--QNTHTFSAGFFTFKPLKQPNLCNQKP------------------- 182
           A SSSL +LKS    +NT  FSAGFFTF+P  QP +C+QKP                   
Sbjct: 130 ATSSSLKRLKSNHHGENTRVFSAGFFTFRPENQPEICSQKPINLRGSKPLLPDPSFPPPL 189

Query: 183 QNPNTFDDMKEIQLPPP------PSYDIPNLPSPIQIPTVPSAPRIYDNL---------- 242
           Q+P     +  + + PP      P   +P+LP P+  P +P  P+   +L          
Sbjct: 190 QDPPNPSPLPNLPIVPPLPNLPVPKLPVPDLPLPLVPPLLPPGPQKSASLHNKKSDSLKD 249

Query: 243 -----------------------PPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFP 302
                                  PP PL+P  +P P LPP P +P  P+LPP+P  P  P
Sbjct: 250 KKTEALKPNFFFPPNPLNPPSIIPPNPLIPS-IPTPTLPPNPLIPSPPSLPPIPLIPT-P 309

Query: 303 PKEKDEKNAPNETPNTSEKLDKFPIPPIKPLRKPHHFVLPPQRLHHHPRLP--------- 309
           P        P     T   +   P  P  P+  P   V PP      P  P         
Sbjct: 310 PTLPTIPLLPTPPTPTLPPIPTIPTLPPLPVLPPVPIVNPPSLPPPPPSFPVPLPPVPGL 369

BLAST of CsGy5G029010 vs. TAIR 10
Match: AT5G13140.1 (Pollen Ole e 1 allergen and extensin family protein )

HSP 1 Score: 48.9 bits (115), Expect = 8.6e-06
Identity = 67/234 (28.63%), Postives = 103/234 (44.02%), Query Frame = 0

Query: 31  VVGTVFCDTCYQEKFSKTSHFISGATVAVECGNKGPEPSFREEVK------TDKRGEFKV 90
           VVG V+CDTC    FS+ S+F+ G  V V C  K   P   EEV       T++ G +K+
Sbjct: 41  VVGVVYCDTCSINTFSRQSYFLQGVEVHVTCRFKASSPKTAEEVNISVNRTTNRSGVYKL 100

Query: 91  NLPVLVSKHVKKIE---------ECYVELVKSSEP---YCDVAA-TAKSSSLQLKSRKQN 150
            +P     HV  I+         +C  +++K+S      C +      ++ + +KS++  
Sbjct: 101 EIP-----HVDGIDCVDGIAISSQCSAKILKTSSDDNGGCSIPVFQTATNEVSIKSKQDR 160

Query: 151 THTFSAGFFTFK-PLKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIPNLPSPIQIPTVP 210
              +S    ++K P K  +LC    +  +  D+  E +      +  P L +P   P   
Sbjct: 161 VCIYSLSALSYKPPHKNTSLCGNGGKKHHRKDEKVEKKF-RDSKFFWPYL-APYWFPWP- 220

Query: 211 SAPRIYDNLPPLPLLPGL--LPLPQLP---PLPPLP------PLPTLPPLPKFP 234
                Y +LPPLP LP     P P LP   P   LP      P+  +P LP+FP
Sbjct: 221 -----YPDLPPLPTLPPFPSFPFPSLPFGNPNLALPAFDWKNPVTWIPYLPRFP 261

BLAST of CsGy5G029010 vs. TAIR 10
Match: AT2G16630.1 (Pollen Ole e 1 allergen and extensin family protein )

HSP 1 Score: 48.1 bits (113), Expect = 1.5e-05
Identity = 61/221 (27.60%), Postives = 97/221 (43.89%), Query Frame = 0

Query: 29  AVVVGTVFCDTCYQEKFSKTSHFISGATVAVECGNKGPEPSFREEVKTDKRGEFKVNLPV 88
           A V G+VFCD C   + S     +SG  ++V C ++  +     E  T+  G +     V
Sbjct: 28  ATVTGSVFCDQCKDGERSLFDFPVSGIKISVTCADENGQVYMSREETTNWLGGY-----V 87

Query: 89  LVSKHVKKIEECYVEL----VKSSEPYCDVAATAKSSSLQLKSRKQNTHTFSAGFFTFKP 148
           +       +  CY ++    V+     C + A+  +  L+L        TF+A     +P
Sbjct: 88  MRFDGTPDLSNCYAQVSDNGVQQDPSSCSI-ASGPAQKLKLMFSFFGIETFAADALLAQP 147

Query: 149 LKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIPNLPSPIQIPTV----------PSAPR 208
           ++  + C + P  P     M   Q+P  P   +P  P P ++P +          P  P 
Sbjct: 148 VQPSSFCPKPPTAP----VMPPPQVPVMPPPQVPVKPHP-KVPVISPDPPATLPPPKVPV 207

Query: 209 IYDNLPPLPLLPGLLPLPQLPPL--PP---LPPLPTLPPLP 231
           I  + PP  L P L+P+  LPP+  PP   LPPLP +PP+P
Sbjct: 208 ISPD-PPTTLPPPLVPVINLPPVTSPPQFKLPPLPQIPPMP 236

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_011656281.23.63e-224100.00proline-rich protein 4 [Cucumis sativus] >KAE8648949.1 hypothetical protein Csa_... [more]
KAA0039420.11.93e-20190.76major pollen allergen Lol p 11 [Cucumis melo var. makuwa] >TYK00608.1 major poll... [more]
XP_038889683.12.95e-15275.16pollen-specific leucine-rich repeat extensin-like protein 3 [Benincasa hispida][more]
XP_023528636.11.51e-11861.49proline-rich protein 4-like [Cucurbita pepo subsp. pepo][more]
XP_022990149.11.01e-11462.97amyloid beta A4 precursor protein-binding family B member 1-interacting protein-... [more]
Match NameE-valueIdentityDescription
A0A0A0KXM21.74e-20598.31Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G642130 PE=4 SV=1[more]
A0A5A7T8509.35e-20290.76Major pollen allergen Lol p 11 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_s... [more]
A0A6J1JRA24.91e-11562.97amyloid beta A4 precursor protein-binding family B member 1-interacting protein-... [more]
A0A6J1HD475.40e-11359.06amyloid beta A4 precursor protein-binding family B member 1-interacting protein-... [more]
A0A6J1BYU33.67e-11260.23proline-rich protein 4-like OS=Momordica charantia OX=3673 GN=LOC111006489 PE=4 ... [more]
Match NameE-valueIdentityDescription
AT5G15780.13.6e-4437.60Pollen Ole e 1 allergen and extensin family protein [more]
AT5G13140.18.6e-0628.63Pollen Ole e 1 allergen and extensin family protein [more]
AT2G16630.11.5e-0527.60Pollen Ole e 1 allergen and extensin family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Gy14) v2.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF01190Pollen_Ole_e_1coord: 31..119
e-value: 9.0E-20
score: 70.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 277..313
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 234..262
NoneNo IPR availablePANTHERPTHR47273EXPRESSED PROTEINcoord: 1..313
NoneNo IPR availablePANTHERPTHR47273:SF4EXPRESSED PROTEINcoord: 1..313

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy5G029010.2CsGy5G029010.2mRNA