Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GAAAAAAAAAAAGGAAAAAACAGTTGGATCTAAAATAAAAAATGAGTTGGCTTCTCATATTACTTCTTCTCAATTTTAGTTTCTTCGATCTTTCAGAAGCCAGGCACCACAGGAAGCTGCCGTCGGCTGTCGTCGTCGGTACCGTTTTCTGCGACACATGCTATCAAGAGAAGTTTTCCAAGACCAGTCACTTCATTTCAGGTACTTTTACTAAGTGGCTCATGTTAGAATTACATAGTTGTAGAATTCAACTAGTTAAGACGTATATATGTTACTCTTAAAATGTCTTGAGTTTGTTATCTTAAGTTTCGAATGTTCCAGCATGACGTCTTGTTACCCGATCGACGAGTTAGGGTATGAGGTGATGATTTTGGGAGGAATCTGTTTATATACCCTCTAGTACTCCTTTACCCTCAACATATCAAAATTTATCTACGGGTTACTTCTTTTTTCCTTTGTTTGTCGATTTCATAACAGTTGGGTTTGTACTATCTAGGTGCAACGGTTGCTGTCGAATGTGGAAACAAAGGACCAGAACCGAGTTTTAGAGAAGAAGTAAAGACAGACAAAAGAGGGGAATTCAAGGTGAATTTGCCAGTTTTAGTGAGCAAACATGTGAAGAAGATTGAGGAATGTTATGTGGAATTAGTTAAAAGTAGTGAACCATATTGTGATGTAGCTGCAACAGCAAAATCATCTTCCCTTCAACTCAAGTCAAGAAAACAAAACACACACACATTCTCAGCTGGCTTCTTCACTTTCAAGCCTCTAAAACAACCAAACCTTTGCAACCAAAAACCACAAAATCCCAACACATTTGATGACATGAAAGAAATCCAACTCCCCCCACCACCATCATACGACATTCCGAATTTGCCATCTCCTATCCAAATCCCGACCGTACCGAGCGCTCCTCGGATTTACGATAACCTTCCTCCTCTCCCACTGCTTCCTGGACTTCTTCCATTGCCTCAGTTACCTCCACTGCCTCCATTGCCACCACTTCCTACTCTTCCACCTCTCCCTAAGTTTCCAATATTCCCACCAAAAGAGAAAGATGAGAAAAATGCACCAAATGAAACTCCAAACACAAGTGAAAAACTAGACAAGTTTCCAATACCACCCATAAAGCCTTTGAGGAAGCCACATCATTTTGTTCTGCCTCCACAAAGGCTGCACCACCACCCTCGGCTGCCACCTCATGTGGCGGTGATAGGCGGCGAGCCGATACCTAACCTTTCTAAGATCTCCTCATCTCATAAGAAAACTTCTCCTTGAAATATTTCAACGTTTAAACCCAAAGTTAACCTTCAGAATAAGTCGCAGTTGCAAATATATCACAATATTAAAGAATTTGCAAATGTATCAAAATTTAGATCTAACTTTAGGAGTTGATATATCCGTTATAGACTATATTACTAATAAGAGTCTATCGAGGATAAAGTACGTTATCGACATATTTTGCGATATTTGTAATTCTTTAAAAATACTACCCTGCAAATTGCAATTGCTCGTTTTTTTTCTTGTATTCTTGTGCTTATTTTCTAGTAAAAATCAATAATTTGTGAGCTAAATTGGATGGATGTGAGGAAAATTGAAGGTCAGGATTAGGCATAAAAGGGTTGGTGCTGTGATTTGAATGGTTTGATCATAATTATTTATGTTGTTGGG
mRNA sequence
ATGAGTTGGCTTCTCATATTACTTCTTCTCAATTTTAGTTTCTTCGATCTTTCAGAAGCCAGGCACCACAGGAAGCTGCCGTCGGCTGTCGTCGTCGGTACCGTTTTCTGCGACACATGCTATCAAGAGAAGTTTTCCAAGACCAGTCACTTCATTTCAGGTGCAACGGTTGCTGTCGAATGTGGAAACAAAGGACCAGAACCGAGTTTTAGAGAAGAAGTAAAGACAGACAAAAGAGGGGAATTCAAGGTGAATTTGCCAGTTTTAGTGAGCAAACATGTGAAGAAGATTGAGGAATGTTATGTGGAATTAGTTAAAAGTAGTGAACCATATTGTGATGTAGCTGCAACAGCAAAATCATCTTCCCTTCAACTCAAGTCAAGAAAACAAAACACACACACATTCTCAGCTGGCTTCTTCACTTTCAAGCCTCTAAAACAACCAAACCTTTGCAACCAAAAACCACAAAATCCCAACACATTTGATGACATGAAAGAAATCCAACTCCCCCCACCACCATCATACGACATTCCGAATTTGCCATCTCCTATCCAAATCCCGACCGTACCGAGCGCTCCTCGGATTTACGATAACCTTCCTCCTCTCCCACTGCTTCCTGGACTTCTTCCATTGCCTCAGTTACCTCCACTGCCTCCATTGCCACCACTTCCTACTCTTCCACCTCTCCCTAAGTTTCCAATATTCCCACCAAAAGAGAAAGATGAGAAAAATGCACCAAATGAAACTCCAAACACAAGTGAAAAACTAGACAAGTTTCCAATACCACCCATAAAGCCTTTGAGGAAGCCACATCATTTTGTTCTGCCTCCACAAAGGCTGCACCACCACCCTCGGCTGCCACCTCATGTGGCGGTGATAGGCGGCGAGCCGATACCTAACCTTTCTAAGATCTCCTCATCTCATAAGAAAACTTCTCCTTGA
Coding sequence (CDS)
ATGAGTTGGCTTCTCATATTACTTCTTCTCAATTTTAGTTTCTTCGATCTTTCAGAAGCCAGGCACCACAGGAAGCTGCCGTCGGCTGTCGTCGTCGGTACCGTTTTCTGCGACACATGCTATCAAGAGAAGTTTTCCAAGACCAGTCACTTCATTTCAGGTGCAACGGTTGCTGTCGAATGTGGAAACAAAGGACCAGAACCGAGTTTTAGAGAAGAAGTAAAGACAGACAAAAGAGGGGAATTCAAGGTGAATTTGCCAGTTTTAGTGAGCAAACATGTGAAGAAGATTGAGGAATGTTATGTGGAATTAGTTAAAAGTAGTGAACCATATTGTGATGTAGCTGCAACAGCAAAATCATCTTCCCTTCAACTCAAGTCAAGAAAACAAAACACACACACATTCTCAGCTGGCTTCTTCACTTTCAAGCCTCTAAAACAACCAAACCTTTGCAACCAAAAACCACAAAATCCCAACACATTTGATGACATGAAAGAAATCCAACTCCCCCCACCACCATCATACGACATTCCGAATTTGCCATCTCCTATCCAAATCCCGACCGTACCGAGCGCTCCTCGGATTTACGATAACCTTCCTCCTCTCCCACTGCTTCCTGGACTTCTTCCATTGCCTCAGTTACCTCCACTGCCTCCATTGCCACCACTTCCTACTCTTCCACCTCTCCCTAAGTTTCCAATATTCCCACCAAAAGAGAAAGATGAGAAAAATGCACCAAATGAAACTCCAAACACAAGTGAAAAACTAGACAAGTTTCCAATACCACCCATAAAGCCTTTGAGGAAGCCACATCATTTTGTTCTGCCTCCACAAAGGCTGCACCACCACCCTCGGCTGCCACCTCATGTGGCGGTGATAGGCGGCGAGCCGATACCTAACCTTTCTAAGATCTCCTCATCTCATAAGAAAACTTCTCCTTGA
Protein sequence
MSWLLILLLLNFSFFDLSEARHHRKLPSAVVVGTVFCDTCYQEKFSKTSHFISGATVAVECGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATAKSSSLQLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIPNLPSPIQIPTVPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPKEKDEKNAPNETPNTSEKLDKFPIPPIKPLRKPHHFVLPPQRLHHHPRLPPHVAVIGGEPIPNLSKISSSHKKTSP*
Homology
BLAST of CsaV3_5G038710 vs. NCBI nr
Match:
XP_011656281.2 (proline-rich protein 4 [Cucumis sativus] >KAE8648949.1 hypothetical protein Csa_008537 [Cucumis sativus])
HSP 1 Score: 622.1 bits (1603), Expect = 2.6e-174
Identity = 313/313 (100.00%), Postives = 313/313 (100.00%), Query Frame = 0
Query: 1 MSWLLILLLLNFSFFDLSEARHHRKLPSAVVVGTVFCDTCYQEKFSKTSHFISGATVAVE 60
MSWLLILLLLNFSFFDLSEARHHRKLPSAVVVGTVFCDTCYQEKFSKTSHFISGATVAVE
Sbjct: 1 MSWLLILLLLNFSFFDLSEARHHRKLPSAVVVGTVFCDTCYQEKFSKTSHFISGATVAVE 60
Query: 61 CGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATAKS 120
CGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATAKS
Sbjct: 61 CGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATAKS 120
Query: 121 SSLQLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIPNL 180
SSLQLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIPNL
Sbjct: 121 SSLQLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIPNL 180
Query: 181 PSPIQIPTVPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPKEK 240
PSPIQIPTVPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPKEK
Sbjct: 181 PSPIQIPTVPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPKEK 240
Query: 241 DEKNAPNETPNTSEKLDKFPIPPIKPLRKPHHFVLPPQRLHHHPRLPPHVAVIGGEPIPN 300
DEKNAPNETPNTSEKLDKFPIPPIKPLRKPHHFVLPPQRLHHHPRLPPHVAVIGGEPIPN
Sbjct: 241 DEKNAPNETPNTSEKLDKFPIPPIKPLRKPHHFVLPPQRLHHHPRLPPHVAVIGGEPIPN 300
Query: 301 LSKISSSHKKTSP 314
LSKISSSHKKTSP
Sbjct: 301 LSKISSSHKKTSP 313
BLAST of CsaV3_5G038710 vs. NCBI nr
Match:
KAA0039420.1 (major pollen allergen Lol p 11 [Cucumis melo var. makuwa] >TYK00608.1 major pollen allergen Lol p 11 [Cucumis melo var. makuwa])
HSP 1 Score: 564.7 bits (1454), Expect = 5.0e-157
Identity = 285/314 (90.76%), Postives = 298/314 (94.90%), Query Frame = 0
Query: 1 MSWLLILLLLNFSFFDLSEARHHRKLPSAVVVGTVFCDTCYQEKFSKTSHFISGATVAVE 60
M WLLILLLLNFSFFDLSEARHHRKLPSAV++GTVFCDTC+QEKFSKTSHFISGATVAVE
Sbjct: 1 MIWLLILLLLNFSFFDLSEARHHRKLPSAVIIGTVFCDTCFQEKFSKTSHFISGATVAVE 60
Query: 61 CGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATAKS 120
CGN+G +PSFREEVKTDKRGEFKVNLPVLVSKHV+KIEECYVE +KSSEPYCDVAATAKS
Sbjct: 61 CGNRGRKPSFREEVKTDKRGEFKVNLPVLVSKHVEKIEECYVESIKSSEPYCDVAATAKS 120
Query: 121 SSLQLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKE-IQLPPPPSYDIPN 180
SSLQLKS+KQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKE IQLPPPP +D PN
Sbjct: 121 SSLQLKSKKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIIQLPPPPPFDSPN 180
Query: 181 LPSPIQIPTVPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPKE 240
LPSPIQ PTVP+APRIYDNLPPLPLLPGLLPLP LPPLPPLPPLP LPPLPKFPIFPPK
Sbjct: 181 LPSPIQNPTVPNAPRIYDNLPPLPLLPGLLPLPPLPPLPPLPPLPPLPPLPKFPIFPPKA 240
Query: 241 KDEKNAPNETPNTSEKLDKFPIPPIKPLRKPHHFVLPPQRLHHHPRLPPHVAVIGGEPIP 300
DEKNAP ETPNTSEKLDKFPIPPIKPLR+PH+FVLPPQRLHHHP+ PPHVAVIGGEPIP
Sbjct: 241 NDEKNAPIETPNTSEKLDKFPIPPIKPLRRPHYFVLPPQRLHHHPQPPPHVAVIGGEPIP 300
Query: 301 NLSKISSSHKKTSP 314
NLS ISS KKTSP
Sbjct: 301 NLSNISSPQKKTSP 314
BLAST of CsaV3_5G038710 vs. NCBI nr
Match:
XP_038889683.1 (pollen-specific leucine-rich repeat extensin-like protein 3 [Benincasa hispida])
HSP 1 Score: 440.7 bits (1132), Expect = 1.1e-119
Identity = 239/318 (75.16%), Postives = 263/318 (82.70%), Query Frame = 0
Query: 1 MSWLLILLLLNFSFFDLSEARHHRKLPSAVVVGTVFCDTCYQEKFSKTSHFISGATVAVE 60
M WL+ILLLLN SFFDLSEARHHRKLPSAVVVGTVFCDTC+QEKFSKTSHFISGATV V+
Sbjct: 5 MVWLIILLLLNLSFFDLSEARHHRKLPSAVVVGTVFCDTCFQEKFSKTSHFISGATVVVK 64
Query: 61 CGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATAKS 120
CGN+G PSFREEVKTDKRGEFKVNLPV VSKHVKKIEECYVEL+KSSEPYC VAATAKS
Sbjct: 65 CGNEGSRPSFREEVKTDKRGEFKVNLPVPVSKHVKKIEECYVELIKSSEPYCAVAATAKS 124
Query: 121 SSLQLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIPNL 180
SSLQLKS+KQ THTFSAGFFTFKPLKQPNLC Q+PQN NT+DD K++ L PP++D P+L
Sbjct: 125 SSLQLKSKKQGTHTFSAGFFTFKPLKQPNLCKQRPQNSNTYDDTKQV-LAAPPTFDYPSL 184
Query: 181 PSPIQIPTVPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPKEK 240
PSPIQ P V PRIY+NLPPLPLLPGLLP LPPLPPLPPLP LPPLP FP+FPPK K
Sbjct: 185 PSPIQNPNV---PRIYENLPPLPLLPGLLP---LPPLPPLPPLPPLPPLPGFPLFPPK-K 244
Query: 241 DEKNAPNETPNTSEKLDKFPIPPIKPLRKPHHFVLPPQRLHHHPRL-----PPHVAVIGG 300
+E NAPNETPNTSEK ++F + PLR+PH VLPP RLHHH L P V
Sbjct: 245 NENNAPNETPNTSEKPNQFHPQTLLPLRRPHFSVLPPHRLHHHQPLLHRLQPAAGGVALS 304
Query: 301 EPIPNLSKISSSHKKTSP 314
P+P+L +ISS K+ SP
Sbjct: 305 LPLPDLPEISSPPKQNSP 314
BLAST of CsaV3_5G038710 vs. NCBI nr
Match:
XP_023528636.1 (proline-rich protein 4-like [Cucurbita pepo subsp. pepo])
HSP 1 Score: 353.6 bits (906), Expect = 1.8e-93
Identity = 206/335 (61.49%), Postives = 233/335 (69.55%), Query Frame = 0
Query: 1 MSWLLILLLLNFSFFDLSEARHHRKLPSAVVVGTVFCDTCYQEKFSKTSHFISGATVAVE 60
M LLIL+ LNFS DLS+ARHH LPSA VVGTVFCDTC+Q+ FSKTSHFISGATVAVE
Sbjct: 1 MICLLILIALNFSLLDLSQARHHNNLPSAAVVGTVFCDTCFQDTFSKTSHFISGATVAVE 60
Query: 61 CGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATAKS 120
CGN G PSFR+EVKTDK GEFK+ LPV V+KIEECYV L++SSEPYC VAA AKS
Sbjct: 61 CGNGGSNPSFRDEVKTDKTGEFKIQLPV----SVRKIEECYVRLIRSSEPYCAVAARAKS 120
Query: 121 SSLQLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIPNL 180
SSL+LKSRKQ H FSAGFFTFKPLKQP LC+ + N FDD K++ D P L
Sbjct: 121 SSLKLKSRKQGMHVFSAGFFTFKPLKQPKLCSHN-SHFNEFDDTKQV-------VDFPGL 180
Query: 181 PSPIQIPTVPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPKEK 240
P+PIQ PTVP+ PRIYDNLPPLPLLPGL PLPQLPPLPPLPPLP FP+FPPK K
Sbjct: 181 PAPIQNPTVPNVPRIYDNLPPLPLLPGLPPLPQLPPLPPLPPLPV------FPLFPPK-K 240
Query: 241 DEKNAPNETPNTSEKLDKF------PIPPIKPLRKPHHFVLPPQRLHHHP----RLPPHV 300
D++N +TP S+ D F PIP +KP R HFV+PP +L HHP PP
Sbjct: 241 DDENV--QTPKISQNPDLFHPQTLLPIPSLKPFRP--HFVMPPHKLRHHPLTHGPTPPSA 300
Query: 301 AVIGGE------------PIPNLSKISSSHKKTSP 314
A GE IPN+ +ISS K+TSP
Sbjct: 301 AAAAGELAPSPPLPFSLPSIPNMPEISSPPKQTSP 312
BLAST of CsaV3_5G038710 vs. NCBI nr
Match:
XP_022990149.1 (amyloid beta A4 precursor protein-binding family B member 1-interacting protein-like [Cucurbita maxima])
HSP 1 Score: 343.6 bits (880), Expect = 1.8e-90
Identity = 199/316 (62.97%), Postives = 228/316 (72.15%), Query Frame = 0
Query: 1 MSWLLILLLLNFSFFDLSEARHHRKLPS--AVVVGTVFCDTCYQEKFSKTSHFISGATVA 60
M LLIL+ LNFSF DLS+ARHH LPS A VVGTVFCDTC+Q+ FSKTSHFISGATVA
Sbjct: 1 MICLLILIALNFSFLDLSQARHHNNLPSAAAAVVGTVFCDTCFQDTFSKTSHFISGATVA 60
Query: 61 VECGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATA 120
VECGN+G P+FREEVKTDK GEFK+ LPV V+K+EECYV L++SSEPYC VAA A
Sbjct: 61 VECGNEGSNPNFREEVKTDKTGEFKIQLPV----SVRKVEECYVRLIRSSEPYCAVAARA 120
Query: 121 KSSSLQLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIP 180
KSSSL+LKS+KQ TH FSAGFFTFKPLKQP LC+ + N FDD K++ D P
Sbjct: 121 KSSSLRLKSKKQGTHVFSAGFFTFKPLKQPKLCSHN-SHSNEFDDTKQV-------VDFP 180
Query: 181 NLPSPIQIPTVPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPK 240
LP+PIQ PTVP+ PRIYDNLPPLPLLPGL PLPQLPPLPPLPPLP FP+FPPK
Sbjct: 181 GLPAPIQNPTVPNVPRIYDNLPPLPLLPGLPPLPQLPPLPPLPPLPI------FPLFPPK 240
Query: 241 EKDEKNAPNETPNTSEKLDKF------PIPPIKPLRKPHHFVLPPQRLHHHP----RLPP 300
KD++N +TP S+ D F PIP +KPLR HFV+PP +L HHP PP
Sbjct: 241 -KDDENV--QTPKISQNPDLFHPQTLLPIPSLKPLRP--HFVMPPHKLRHHPLTHGPTPP 293
BLAST of CsaV3_5G038710 vs. ExPASy TrEMBL
Match:
A0A0A0KXM2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G642130 PE=4 SV=1)
HSP 1 Score: 573.5 bits (1477), Expect = 5.2e-160
Identity = 291/296 (98.31%), Postives = 291/296 (98.31%), Query Frame = 0
Query: 1 MSWLLILLLLNFSFFDLSEARHHRKLPSAVVVGTVFCDTCYQEKFSKTSHFISGATVAVE 60
MSWLLILLLLNFSFFDLSEARHHRKLPSAVVVGTVFCDTCYQEKFSKTSHFISGATVAVE
Sbjct: 1 MSWLLILLLLNFSFFDLSEARHHRKLPSAVVVGTVFCDTCYQEKFSKTSHFISGATVAVE 60
Query: 61 CGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATAKS 120
CGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATAKS
Sbjct: 61 CGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATAKS 120
Query: 121 SSLQLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIPNL 180
SSLQLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIPNL
Sbjct: 121 SSLQLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIPNL 180
Query: 181 PSPIQIPTVPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPKEK 240
PSPIQIPTVPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPKEK
Sbjct: 181 PSPIQIPTVPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPKEK 240
Query: 241 DEKNAPNETPNTSEKLDKFPIPPIKPLRKPHHFVLPPQRLHHHPRLPPHVAVIGGE 297
DEKNAPNETPNTSEKLDKFPIPPIKPLRKPHHFVLPPQRLHHHPRLPP GGE
Sbjct: 241 DEKNAPNETPNTSEKLDKFPIPPIKPLRKPHHFVLPPQRLHHHPRLPPQSG--GGE 294
BLAST of CsaV3_5G038710 vs. ExPASy TrEMBL
Match:
A0A5A7T850 (Major pollen allergen Lol p 11 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold169G002260 PE=4 SV=1)
HSP 1 Score: 564.7 bits (1454), Expect = 2.4e-157
Identity = 285/314 (90.76%), Postives = 298/314 (94.90%), Query Frame = 0
Query: 1 MSWLLILLLLNFSFFDLSEARHHRKLPSAVVVGTVFCDTCYQEKFSKTSHFISGATVAVE 60
M WLLILLLLNFSFFDLSEARHHRKLPSAV++GTVFCDTC+QEKFSKTSHFISGATVAVE
Sbjct: 1 MIWLLILLLLNFSFFDLSEARHHRKLPSAVIIGTVFCDTCFQEKFSKTSHFISGATVAVE 60
Query: 61 CGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATAKS 120
CGN+G +PSFREEVKTDKRGEFKVNLPVLVSKHV+KIEECYVE +KSSEPYCDVAATAKS
Sbjct: 61 CGNRGRKPSFREEVKTDKRGEFKVNLPVLVSKHVEKIEECYVESIKSSEPYCDVAATAKS 120
Query: 121 SSLQLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKE-IQLPPPPSYDIPN 180
SSLQLKS+KQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKE IQLPPPP +D PN
Sbjct: 121 SSLQLKSKKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIIQLPPPPPFDSPN 180
Query: 181 LPSPIQIPTVPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPKE 240
LPSPIQ PTVP+APRIYDNLPPLPLLPGLLPLP LPPLPPLPPLP LPPLPKFPIFPPK
Sbjct: 181 LPSPIQNPTVPNAPRIYDNLPPLPLLPGLLPLPPLPPLPPLPPLPPLPPLPKFPIFPPKA 240
Query: 241 KDEKNAPNETPNTSEKLDKFPIPPIKPLRKPHHFVLPPQRLHHHPRLPPHVAVIGGEPIP 300
DEKNAP ETPNTSEKLDKFPIPPIKPLR+PH+FVLPPQRLHHHP+ PPHVAVIGGEPIP
Sbjct: 241 NDEKNAPIETPNTSEKLDKFPIPPIKPLRRPHYFVLPPQRLHHHPQPPPHVAVIGGEPIP 300
Query: 301 NLSKISSSHKKTSP 314
NLS ISS KKTSP
Sbjct: 301 NLSNISSPQKKTSP 314
BLAST of CsaV3_5G038710 vs. ExPASy TrEMBL
Match:
A0A6J1JRA2 (amyloid beta A4 precursor protein-binding family B member 1-interacting protein-like OS=Cucurbita maxima OX=3661 GN=LOC111487127 PE=4 SV=1)
HSP 1 Score: 343.6 bits (880), Expect = 8.8e-91
Identity = 199/316 (62.97%), Postives = 228/316 (72.15%), Query Frame = 0
Query: 1 MSWLLILLLLNFSFFDLSEARHHRKLPS--AVVVGTVFCDTCYQEKFSKTSHFISGATVA 60
M LLIL+ LNFSF DLS+ARHH LPS A VVGTVFCDTC+Q+ FSKTSHFISGATVA
Sbjct: 1 MICLLILIALNFSFLDLSQARHHNNLPSAAAAVVGTVFCDTCFQDTFSKTSHFISGATVA 60
Query: 61 VECGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATA 120
VECGN+G P+FREEVKTDK GEFK+ LPV V+K+EECYV L++SSEPYC VAA A
Sbjct: 61 VECGNEGSNPNFREEVKTDKTGEFKIQLPV----SVRKVEECYVRLIRSSEPYCAVAARA 120
Query: 121 KSSSLQLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIP 180
KSSSL+LKS+KQ TH FSAGFFTFKPLKQP LC+ + N FDD K++ D P
Sbjct: 121 KSSSLRLKSKKQGTHVFSAGFFTFKPLKQPKLCSHN-SHSNEFDDTKQV-------VDFP 180
Query: 181 NLPSPIQIPTVPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPK 240
LP+PIQ PTVP+ PRIYDNLPPLPLLPGL PLPQLPPLPPLPPLP FP+FPPK
Sbjct: 181 GLPAPIQNPTVPNVPRIYDNLPPLPLLPGLPPLPQLPPLPPLPPLPI------FPLFPPK 240
Query: 241 EKDEKNAPNETPNTSEKLDKF------PIPPIKPLRKPHHFVLPPQRLHHHP----RLPP 300
KD++N +TP S+ D F PIP +KPLR HFV+PP +L HHP PP
Sbjct: 241 -KDDENV--QTPKISQNPDLFHPQTLLPIPSLKPLRP--HFVMPPHKLRHHPLTHGPTPP 293
BLAST of CsaV3_5G038710 vs. ExPASy TrEMBL
Match:
A0A6J1HD47 (amyloid beta A4 precursor protein-binding family B member 1-interacting protein-like OS=Cucurbita moschata OX=3662 GN=LOC111463006 PE=4 SV=1)
HSP 1 Score: 339.0 bits (868), Expect = 2.2e-89
Identity = 202/342 (59.06%), Postives = 232/342 (67.84%), Query Frame = 0
Query: 1 MSWLLILLLLNFSFFDLSEARHHRKLPS-AVVVGTVFCDTCYQEKFSKTSHFISGATVAV 60
M LLIL+ LNFSF DLS+ARHH LPS A VVGTVFCDTC+Q+ FSK+SHFISGATVAV
Sbjct: 1 MICLLILIALNFSFLDLSQARHHNNLPSTAAVVGTVFCDTCFQDTFSKSSHFISGATVAV 60
Query: 61 ECGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATAK 120
ECG+ G PSFR+EVKTDK GEFK+ LPV V+KIEECYV L++SSEPYC VAA AK
Sbjct: 61 ECGDGGSNPSFRDEVKTDKTGEFKIQLPV----SVRKIEECYVRLIRSSEPYCAVAARAK 120
Query: 121 SSSLQLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIPN 180
SSSL+LKSRKQ TH FSAGFFTFKPLK P LC+ + + FDD K++ D P
Sbjct: 121 SSSLKLKSRKQGTHVFSAGFFTFKPLKHPKLCSHN-SHSSEFDDTKQV-------VDFPG 180
Query: 181 LPSPIQIPTVPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPKE 240
LP+PIQ PTVP+ PRIYDNLPPL LLPGL PLPQLPPLPPLPPLP FP+FPPK
Sbjct: 181 LPAPIQNPTVPNVPRIYDNLPPLTLLPGLPPLPQLPPLPPLPPLPV------FPLFPPK- 240
Query: 241 KDEKNAPNETPNTSEKLDKF------PIPPIKPLRKPHHFVLPPQRLHHHP--------- 300
KD++N +TP S+ D F PIP +KPLR HFV+PP +L HHP
Sbjct: 241 KDDENV--QTPKISQNPDMFHPQTLLPIPSLKPLRP--HFVMPPHKLRHHPLTHGPFSPS 300
Query: 301 ---RLPPHVAV----------IGGEPIPNLSKISSSHKKTSP 314
PP A PIP + +ISS K+TSP
Sbjct: 301 FSTPTPPSAAADELAPSPPLPFSLPPIPRMPEISSPPKETSP 319
BLAST of CsaV3_5G038710 vs. ExPASy TrEMBL
Match:
A0A6J1BYU3 (proline-rich protein 4-like OS=Momordica charantia OX=3673 GN=LOC111006489 PE=4 SV=1)
HSP 1 Score: 337.4 bits (864), Expect = 6.3e-89
Identity = 209/347 (60.23%), Postives = 230/347 (66.28%), Query Frame = 0
Query: 1 MSWLLILLLLNFSFFDLSEARHHRKLPSAVVVGTVFCDTCYQEKFSKTSHFISGATVAVE 60
M WLLILL+LN SF DLS+ARHH+ LPSAV+VGTVFCDTC QEK SKTS FISGATVAVE
Sbjct: 1 MVWLLILLILNLSFCDLSQARHHKNLPSAVIVGTVFCDTCSQEKLSKTSRFISGATVAVE 60
Query: 61 CGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATAKS 120
CGN GP+PSFREEVKTDKRGEFKV+LPV VS+HVK IE C V L++SSE YC VAA A S
Sbjct: 61 CGNGGPKPSFREEVKTDKRGEFKVDLPVSVSRHVKTIEGCSVNLIRSSEAYCAVAAAATS 120
Query: 121 SSLQLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIPNL 180
SS +LKSR Q TH FSAGFFTFKPLK PN+C QKP + NTF DMK+ P D P L
Sbjct: 121 SSFELKSRNQGTHFFSAGFFTFKPLKHPNICTQKPYS-NTFHDMKQAL----PMLDYPAL 180
Query: 181 PSPIQIPT-VPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPKE 240
P+PI+ PT VP+ PRIYDNLPPLP LP L PLPQLPPLPP LPPLP FPIFPPK
Sbjct: 181 PTPIENPTIVPNVPRIYDNLPPLPFLPRLPPLPQLPPLPP------LPPLPGFPIFPPK- 240
Query: 241 KDEKNAPNETPNTSEKLDKFPIPPIKPLRKPHHFVLPPQRLHHHPRL------------P 300
K +NAPN K +P K LR HFVLPP RL H P P
Sbjct: 241 KTVENAPN---------GKTLLPHKKHLRP--HFVLPPHRLQHPPLFPNIPSLLPFEIPP 300
Query: 301 PHV-----AVIGGE----------------PIPNLSKISSSHKKTSP 314
P V AV GG P+P L I S ++TSP
Sbjct: 301 PPVAAAVGAVAGGSVPSPPSPTSPTPFPIPPVPGLPGIPSPPRQTSP 324
BLAST of CsaV3_5G038710 vs. TAIR 10
Match:
AT5G15780.1 (Pollen Ole e 1 allergen and extensin family protein )
HSP 1 Score: 176.4 bits (446), Expect = 3.6e-44
Identity = 147/391 (37.60%), Postives = 191/391 (48.85%), Query Frame = 0
Query: 3 WLLILLLLNFSF-FDLSEARHH---RKLPSAVVVGTVFCDTCYQEKFSKT-SHFISGATV 62
W +++ L S LS+ + H + SAVVVGTV+CDTC+ FSK+ +H ISGA V
Sbjct: 10 WFSLMIFLGISINGGLSQGQQHVMKKTRSSAVVVGTVYCDTCFNGAFSKSPNHLISGALV 69
Query: 63 AVECGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAAT 122
AVEC ++ +PSFR+EVKTDKRGEFKV LP VSKHVKKI+ C V+L+ SS+PYC +A++
Sbjct: 70 AVECIDENSKPSFRQEVKTDKRGEFKVKLPFSVSKHVKKIKRCSVKLLSSSQPYCSIASS 129
Query: 123 AKSSSL-QLKSRK--QNTHTFSAGFFTFKPLKQPNLCNQKP------------------- 182
A SSSL +LKS +NT FSAGFFTF+P QP +C+QKP
Sbjct: 130 ATSSSLKRLKSNHHGENTRVFSAGFFTFRPENQPEICSQKPINLRGSKPLLPDPSFPPPL 189
Query: 183 QNPNTFDDMKEIQLPPP------PSYDIPNLPSPIQIPTVPSAPRIYDNL---------- 242
Q+P + + + PP P +P+LP P+ P +P P+ +L
Sbjct: 190 QDPPNPSPLPNLPIVPPLPNLPVPKLPVPDLPLPLVPPLLPPGPQKSASLHNKKSDSLKD 249
Query: 243 -----------------------PPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFP 302
PP PL+P +P P LPP P +P P+LPP+P P P
Sbjct: 250 KKTEALKPNFFFPPNPLNPPSIIPPNPLIPS-IPTPTLPPNPLIPSPPSLPPIPLIPT-P 309
Query: 303 PKEKDEKNAPNETPNTSEKLDKFPIPPIKPLRKPHHFVLPPQRLHHHPRLP--------- 309
P P T + P P P+ P V PP P P
Sbjct: 310 PTLPTIPLLPTPPTPTLPPIPTIPTLPPLPVLPPVPIVNPPSLPPPPPSFPVPLPPVPGL 369
BLAST of CsaV3_5G038710 vs. TAIR 10
Match:
AT5G13140.1 (Pollen Ole e 1 allergen and extensin family protein )
HSP 1 Score: 48.9 bits (115), Expect = 8.6e-06
Identity = 67/234 (28.63%), Postives = 103/234 (44.02%), Query Frame = 0
Query: 31 VVGTVFCDTCYQEKFSKTSHFISGATVAVECGNKGPEPSFREEVK------TDKRGEFKV 90
VVG V+CDTC FS+ S+F+ G V V C K P EEV T++ G +K+
Sbjct: 41 VVGVVYCDTCSINTFSRQSYFLQGVEVHVTCRFKASSPKTAEEVNISVNRTTNRSGVYKL 100
Query: 91 NLPVLVSKHVKKIE---------ECYVELVKSSEP---YCDVAA-TAKSSSLQLKSRKQN 150
+P HV I+ +C +++K+S C + ++ + +KS++
Sbjct: 101 EIP-----HVDGIDCVDGIAISSQCSAKILKTSSDDNGGCSIPVFQTATNEVSIKSKQDR 160
Query: 151 THTFSAGFFTFK-PLKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIPNLPSPIQIPTVP 210
+S ++K P K +LC + + D+ E + + P L +P P
Sbjct: 161 VCIYSLSALSYKPPHKNTSLCGNGGKKHHRKDEKVEKKF-RDSKFFWPYL-APYWFPWP- 220
Query: 211 SAPRIYDNLPPLPLLPGL--LPLPQLP---PLPPLP------PLPTLPPLPKFP 234
Y +LPPLP LP P P LP P LP P+ +P LP+FP
Sbjct: 221 -----YPDLPPLPTLPPFPSFPFPSLPFGNPNLALPAFDWKNPVTWIPYLPRFP 261
BLAST of CsaV3_5G038710 vs. TAIR 10
Match:
AT2G16630.1 (Pollen Ole e 1 allergen and extensin family protein )
HSP 1 Score: 48.1 bits (113), Expect = 1.5e-05
Identity = 61/221 (27.60%), Postives = 97/221 (43.89%), Query Frame = 0
Query: 29 AVVVGTVFCDTCYQEKFSKTSHFISGATVAVECGNKGPEPSFREEVKTDKRGEFKVNLPV 88
A V G+VFCD C + S +SG ++V C ++ + E T+ G + V
Sbjct: 28 ATVTGSVFCDQCKDGERSLFDFPVSGIKISVTCADENGQVYMSREETTNWLGGY-----V 87
Query: 89 LVSKHVKKIEECYVEL----VKSSEPYCDVAATAKSSSLQLKSRKQNTHTFSAGFFTFKP 148
+ + CY ++ V+ C + A+ + L+L TF+A +P
Sbjct: 88 MRFDGTPDLSNCYAQVSDNGVQQDPSSCSI-ASGPAQKLKLMFSFFGIETFAADALLAQP 147
Query: 149 LKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIPNLPSPIQIPTV----------PSAPR 208
++ + C + P P M Q+P P +P P P ++P + P P
Sbjct: 148 VQPSSFCPKPPTAP----VMPPPQVPVMPPPQVPVKPHP-KVPVISPDPPATLPPPKVPV 207
Query: 209 IYDNLPPLPLLPGLLPLPQLPPL--PP---LPPLPTLPPLP 231
I + PP L P L+P+ LPP+ PP LPPLP +PP+P
Sbjct: 208 ISPD-PPTTLPPPLVPVINLPPVTSPPQFKLPPLPQIPPMP 236
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_011656281.2 | 2.6e-174 | 100.00 | proline-rich protein 4 [Cucumis sativus] >KAE8648949.1 hypothetical protein Csa_... | [more] |
KAA0039420.1 | 5.0e-157 | 90.76 | major pollen allergen Lol p 11 [Cucumis melo var. makuwa] >TYK00608.1 major poll... | [more] |
XP_038889683.1 | 1.1e-119 | 75.16 | pollen-specific leucine-rich repeat extensin-like protein 3 [Benincasa hispida] | [more] |
XP_023528636.1 | 1.8e-93 | 61.49 | proline-rich protein 4-like [Cucurbita pepo subsp. pepo] | [more] |
XP_022990149.1 | 1.8e-90 | 62.97 | amyloid beta A4 precursor protein-binding family B member 1-interacting protein-... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0KXM2 | 5.2e-160 | 98.31 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G642130 PE=4 SV=1 | [more] |
A0A5A7T850 | 2.4e-157 | 90.76 | Major pollen allergen Lol p 11 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_s... | [more] |
A0A6J1JRA2 | 8.8e-91 | 62.97 | amyloid beta A4 precursor protein-binding family B member 1-interacting protein-... | [more] |
A0A6J1HD47 | 2.2e-89 | 59.06 | amyloid beta A4 precursor protein-binding family B member 1-interacting protein-... | [more] |
A0A6J1BYU3 | 6.3e-89 | 60.23 | proline-rich protein 4-like OS=Momordica charantia OX=3673 GN=LOC111006489 PE=4 ... | [more] |
Match Name | E-value | Identity | Description | |
AT5G15780.1 | 3.6e-44 | 37.60 | Pollen Ole e 1 allergen and extensin family protein | [more] |
AT5G13140.1 | 8.6e-06 | 28.63 | Pollen Ole e 1 allergen and extensin family protein | [more] |
AT2G16630.1 | 1.5e-05 | 27.60 | Pollen Ole e 1 allergen and extensin family protein | [more] |