Cp4.1LG01g22370 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g22370
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPollen Ole e 1 allergen/extensin
LocationCp4.1LG01 : 20236262 .. 20238178 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTCCTCACCTCAAACAACATCAACACTGCCATCTAAAAATGATTTGCCTCCTCATTTTAATCGCTCTCAACTTCAGTCTCCTCGACCTCTCACAGGCCAGGCACCACAACAACCTCCCTTCCGCCGCAGTCGTCGGTACCGTCTTCTGCGACACGTGTTTTCAAGACACGTTTTCTAAGACAAGTCACTTCATTTCAGGTACTTTACCAACTAGCTCGTAGAGTTAATATGATGTTGGTTTGTTCATATATTGATGTGTACTATTTAGGCGCGACGGTGGCTGTCGAATGTGGCAATGGGGGATCGAACCCGAGTTTTAGAGACGAAGTAAAGACAGACAAAACAGGGGAATTCAAGATTCAGCTGCCAGTTTCAGTGAGGAAGATTGAGGAATGTTATGTGCGGTTAATAAGAAGCAGTGAACCATATTGTGCGGTGGCTGCAAGAGCCAAATCATCATCGCTTAAGCTCAAGTCAAGAAAACAAGGCATGCATGTGTTCTCGGCTGGATTCTTCACTTTCAAGCCTCTTAAACAGCCAAAACTTTGCAGCCATAACTCACATTTTAATGAATTTGATGACACGAAACAAGTCGTTGACTTCCCGGGGTTACCGGCTCCGATCCAGAACCCGACCGTGCCGAACGTTCCTCGGATTTACGATAACCTTCCGCCTCTACCTCTTCTTCCTGGACTTCCTCCATTGCCTCAACTTCCTCCTCTCCCTCCTCTTCCACCTCTCCCTGTTTTTCCATTGTTTCCACCAAAAAAGGATGATGAAAATGTACAAACTCCAAAGATAAGTCAAAACCCGGACCTGTTTCATCCACAAACTCTCCTTCCAATTCCATCTTTAAAGCCTTTCAGGCCACATTTTGTTATGCCTCCACATAAGCTGCGCCACCATCCTCTCACGCATGGTCCTACACCGCCCTCAGCCGCTGCCGCGGCGGGCGAGTTGGCTCCGTCCCCGCCGCTCCCTTTTTCACTCCCATCCATCCCTAACATGCCCGAGATCTCCTCACCTCCAAAGCAAACTTCTCCTTAAAAAATTATAACGACCCTAGTTTTCGAAGGACCATCGTCACAAAAATAAAACTTTAAATTAGTAATTTTTACTTGAACTGATGTACTTATTTTCTAGTACGAATTAATAATCGTGAAGGAAATTATCAATGAATTTGAGAGTAAATTGTGGGATTTGACGTTTTGGATAAGATTGATCATATAATTATTCAAATGTCAAATAGATTAAATTGGACGATTGATCCAATTGCTTTTGTTTGTTATAGATTTTGTTCTTTTGTGATCAATAATTTGAATACGTTATATTATTTTTCTAAATTTTGTTTAAAAAAAAAAAAAAATCGTTGTAAAAGGATATAAATTCGGACCTTAGAAAAAAAGAAGTGAAATCTTTTCCAAGCAAGAACAAAATTATTTTATCCTTTTTGGCACGAGACAAATTATACATTTGATTGGAGAGGGGATAATAAATGTTGTTATCGGTTGATGAAGATTCAAATCCCGAAAATAAATTTAGGTTGGAATTGGTACCAAATTCTAAATTGGTAGCAGTTCTTGGGGGAGTAGAACATAGTGGAGATTAAATTAGAGCGTTCTATCGAACAAAACCATCGAAGCAAAGGAGTCACCGGGAGCGCGATATAGAGATAAACAGAGGGAAAGAGGGGAAGAGGGCAGTAGCTTTATTTTCGCGGAATTCGTAGCTTTAATTTCATCGCCTTCAATTCCTTTTTCCAATCTAAACGCCCCTTCAAACGCGCAATCTTGATTTAGCTTGAAATCCCAATTATCTGCGTGCGTGATTTTTCCCAAAAAGTGCACGTATATATTTTCCGTATATTCTCCCAATTTCCGTCTTCCAGGTTTCTCTGTCCTCCACCCACCCACCCAC

mRNA sequence

TTCCTCACCTCAAACAACATCAACACTGCCATCTAAAAATGATTTGCCTCCTCATTTTAATCGCTCTCAACTTCAGTCTCCTCGACCTCTCACAGGCCAGGCACCACAACAACCTCCCTTCCGCCGCAGTCGTCGGTACCGTCTTCTGCGACACGTGTTTTCAAGACACGTTTTCTAAGACAAGTCACTTCATTTCAGGCGCGACGGTGGCTGTCGAATGTGGCAATGGGGGATCGAACCCGAGTTTTAGAGACGAAGTAAAGACAGACAAAACAGGGGAATTCAAGATTCAGCTGCCAGTTTCAGTGAGGAAGATTGAGGAATGTTATGTGCGGTTAATAAGAAGCAGTGAACCATATTGTGCGGTGGCTGCAAGAGCCAAATCATCATCGCTTAAGCTCAAGTCAAGAAAACAAGGCATGCATGTGTTCTCGGCTGGATTCTTCACTTTCAAGCCTCTTAAACAGCCAAAACTTTGCAGCCATAACTCACATTTTAATGAATTTGATGACACGAAACAAGTCGTTGACTTCCCGGGGTTACCGGCTCCGATCCAGAACCCGACCGTGCCGAACGTTCCTCGGATTTACGATAACCTTCCGCCTCTACCTCTTCTTCCTGGACTTCCTCCATTGCCTCAACTTCCTCCTCTCCCTCCTCTTCCACCTCTCCCTGTTTTTCCATTGTTTCCACCAAAAAAGGATGATGAAAATGTACAAACTCCAAAGATAAGTCAAAACCCGGACCTGTTTCATCCACAAACTCTCCTTCCAATTCCATCTTTAAAGCCTTTCAGGCCACATTTTGTTATGCCTCCACATAAGCTGCGCCACCATCCTCTCACGCATGGTCCTACACCGCCCTCAGCCGCTGCCGCGGCGGGCGAGTTGGCTCCGTCCCCGCCGCTCCCTTTTTCACTCCCATCCATCCCTAACATGCCCGAGATCTCCTCACCTCCAAAGCAAACTTCTCCTTAAAAAATTATAACGACCCTAGTTTTCGAAGGACCATCGTCACAAAAATAAAACTTTAAATTAGTAATTTTTACTTGAACTGATGTACTTATTTTCTAGTACGAATTAATAATCGTGAAGGAAATTATCAATGAATTTGAGAGTAAATTGTGGGATTTGACGTTTTGGATAAGATTGATCATATAATTATTCAAATGTCAAATAGATTAAATTGGACGATTGATCCAATTGCTTTTGTTTGTTATAGATTTTGTTCTTTTGTGATCAATAATTTGAATACGTTATATTATTTTTCTAAATTTTGTTTAAAAAAAAAAAAAAATCGTTGTAAAAGGATATAAATTCGGACCTTAGAAAAAAAGAAGTGAAATCTTTTCCAAGCAAGAACAAAATTATTTTATCCTTTTTGGCACGAGACAAATTATACATTTGATTGGAGAGGGGATAATAAATGTTGTTATCGGTTGATGAAGATTCAAATCCCGAAAATAAATTTAGGTTGGAATTGGTACCAAATTCTAAATTGGTAGCAGTTCTTGGGGGAGTAGAACATAGTGGAGATTAAATTAGAGCGTTCTATCGAACAAAACCATCGAAGCAAAGGAGTCACCGGGAGCGCGATATAGAGATAAACAGAGGGAAAGAGGGGAAGAGGGCAGTAGCTTTATTTTCGCGGAATTCGTAGCTTTAATTTCATCGCCTTCAATTCCTTTTTCCAATCTAAACGCCCCTTCAAACGCGCAATCTTGATTTAGCTTGAAATCCCAATTATCTGCGTGCGTGATTTTTCCCAAAAAGTGCACGTATATATTTTCCGTATATTCTCCCAATTTCCGTCTTCCAGGTTTCTCTGTCCTCCACCCACCCACCCAC

Coding sequence (CDS)

ATGATTTGCCTCCTCATTTTAATCGCTCTCAACTTCAGTCTCCTCGACCTCTCACAGGCCAGGCACCACAACAACCTCCCTTCCGCCGCAGTCGTCGGTACCGTCTTCTGCGACACGTGTTTTCAAGACACGTTTTCTAAGACAAGTCACTTCATTTCAGGCGCGACGGTGGCTGTCGAATGTGGCAATGGGGGATCGAACCCGAGTTTTAGAGACGAAGTAAAGACAGACAAAACAGGGGAATTCAAGATTCAGCTGCCAGTTTCAGTGAGGAAGATTGAGGAATGTTATGTGCGGTTAATAAGAAGCAGTGAACCATATTGTGCGGTGGCTGCAAGAGCCAAATCATCATCGCTTAAGCTCAAGTCAAGAAAACAAGGCATGCATGTGTTCTCGGCTGGATTCTTCACTTTCAAGCCTCTTAAACAGCCAAAACTTTGCAGCCATAACTCACATTTTAATGAATTTGATGACACGAAACAAGTCGTTGACTTCCCGGGGTTACCGGCTCCGATCCAGAACCCGACCGTGCCGAACGTTCCTCGGATTTACGATAACCTTCCGCCTCTACCTCTTCTTCCTGGACTTCCTCCATTGCCTCAACTTCCTCCTCTCCCTCCTCTTCCACCTCTCCCTGTTTTTCCATTGTTTCCACCAAAAAAGGATGATGAAAATGTACAAACTCCAAAGATAAGTCAAAACCCGGACCTGTTTCATCCACAAACTCTCCTTCCAATTCCATCTTTAAAGCCTTTCAGGCCACATTTTGTTATGCCTCCACATAAGCTGCGCCACCATCCTCTCACGCATGGTCCTACACCGCCCTCAGCCGCTGCCGCGGCGGGCGAGTTGGCTCCGTCCCCGCCGCTCCCTTTTTCACTCCCATCCATCCCTAACATGCCCGAGATCTCCTCACCTCCAAAGCAAACTTCTCCTTAA

Protein sequence

MICLLILIALNFSLLDLSQARHHNNLPSAAVVGTVFCDTCFQDTFSKTSHFISGATVAVECGNGGSNPSFRDEVKTDKTGEFKIQLPVSVRKIEECYVRLIRSSEPYCAVAARAKSSSLKLKSRKQGMHVFSAGFFTFKPLKQPKLCSHNSHFNEFDDTKQVVDFPGLPAPIQNPTVPNVPRIYDNLPPLPLLPGLPPLPQLPPLPPLPPLPVFPLFPPKKDDENVQTPKISQNPDLFHPQTLLPIPSLKPFRPHFVMPPHKLRHHPLTHGPTPPSAAAAAGELAPSPPLPFSLPSIPNMPEISSPPKQTSP
BLAST of Cp4.1LG01g22370 vs. TrEMBL
Match: A0A0A0KXM2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G642130 PE=4 SV=1)

HSP 1 Score: 340.9 bits (873), Expect = 1.7e-90
Identity = 194/305 (63.61%), Postives = 219/305 (71.80%), Query Frame = 1

Query: 1   MICLLILIALNFSLLDLSQARHHNNLPSAAVVGTVFCDTCFQDTFSKTSHFISGATVAVE 60
           M  LLIL+ LNFS  DLS+ARHH  LPSA VVGTVFCDTC+Q+ FSKTSHFISGATVAVE
Sbjct: 1   MSWLLILLLLNFSFFDLSEARHHRKLPSAVVVGTVFCDTCYQEKFSKTSHFISGATVAVE 60

Query: 61  CGNGGSNPSFRDEVKTDKTGEFKIQLPV----SVRKIEECYVRLIRSSEPYCAVAARAKS 120
           CGN G  PSFR+EVKTDK GEFK+ LPV     V+KIEECYV L++SSEPYC VAA AKS
Sbjct: 61  CGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATAKS 120

Query: 121 SSLKLKSRKQGMHVFSAGFFTFKPLKQPKLCSHN-SHFNEFDDTKQV-------VDFPGL 180
           SSL+LKSRKQ  H FSAGFFTFKPLKQP LC+    + N FDD K++        D P L
Sbjct: 121 SSLQLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIPNL 180

Query: 181 PAPIQNPTVPNVPRIYDNLPPLPLLPGLPPLPQLPPLPPLPPLPV------FPLFPPK-K 240
           P+PIQ PTVP+ PRIYDNLPPLPLLPGL PLPQLPPLPPLPPLP       FP+FPPK K
Sbjct: 181 PSPIQIPTVPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPKEK 240

Query: 241 DDENV--QTPKISQNPDLFHPQTLLPIPSLKPFRP--HFVMPPHKLRHHPLTHGPTPPSA 283
           D++N   +TP  S+  D F      PIP +KP R   HFV+PP +L HHP      PP +
Sbjct: 241 DEKNAPNETPNTSEKLDKF------PIPPIKPLRKPHHFVLPPQRLHHHP----RLPPQS 295

BLAST of Cp4.1LG01g22370 vs. TrEMBL
Match: B9SIV7_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0790500 PE=4 SV=1)

HSP 1 Score: 238.4 bits (607), Expect = 1.2e-59
Identity = 153/333 (45.95%), Postives = 195/333 (58.56%), Query Frame = 1

Query: 17  LSQARHHNNLPSAAVVGTVFCDTCFQDTFSKTSHFISGATVAVECGNGGSNPSFRDEVKT 76
           LS+A H+  LPSA VVGTV+CDTCF + FSK SHFISGATVAVEC +   N SF  EVKT
Sbjct: 17  LSEASHNKKLPSAVVVGTVYCDTCFHEDFSKNSHFISGATVAVECKD--ENSSFHQEVKT 76

Query: 77  DKTGEFKIQLPVSV----RKIEECYVRLIRSSEPYCAVAARAKSSSLKLKSRKQGMHVFS 136
           D+ GEF++ LP SV    ++I++C V+L+ SSEPYCAVA+ A SSSL+LKSRKQG+H+FS
Sbjct: 77  DEHGEFRVHLPFSVGKHVKRIKKCSVKLLSSSEPYCAVASTATSSSLRLKSRKQGLHIFS 136

Query: 137 AGFFTFKPLKQPKLCSHNSHF---NEFDDTK----------------------QVVDFPG 196
           AGFF+FKP KQP LC+         EF+  K                       + + P 
Sbjct: 137 AGFFSFKPQKQPNLCNQKPSIQDSKEFNSKKISSIPTIGAGSIPSVSSPLQDPTIPNLPP 196

Query: 197 LPAPIQNPTVPNVPRIYDNLPPLPLLPGLPPLPQLPPLPPLPPLPVFPLFPPKKDDENVQ 256
           +  P+Q+PT+PN+P +  +  PLP LP LPPLPQLPPLPPLP LP FP  P K   E   
Sbjct: 197 VSPPLQDPTIPNLPPVNQHFFPLPFLPQLPPLPQLPPLPPLPGLPKFPPIPGKTTKEVKT 256

Query: 257 TPKI--------SQNPDLFHPQTLLPIPSLKPFRPHFVMPPHKLRHHPLTHGPTPPSAAA 308
           +P+          + PD F P   L  P+  PF+P  ++PP+ L+  PL     PP+   
Sbjct: 257 SPESVKKTPESGEEQPDFFFPTPPLFPPN--PFQPPPILPPNPLQPPPLIPPLLPPNPFQ 316

BLAST of Cp4.1LG01g22370 vs. TrEMBL
Match: A0A067FY85_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g019020mg PE=4 SV=1)

HSP 1 Score: 235.7 bits (600), Expect = 7.5e-59
Identity = 165/347 (47.55%), Postives = 197/347 (56.77%), Query Frame = 1

Query: 4   LLILIALNFSLLDLSQARHHNNLPSAAVVGTVFCDTCFQDTFSKTSHFISGATVAVECGN 63
           L+  ++     L+     H    PSA VVGTV+CDTCFQD FSK SHFISGA+VAVEC +
Sbjct: 8   LIFFLSFIIGNLEAESNHHEKRHPSAVVVGTVYCDTCFQDNFSKASHFISGASVAVECKD 67

Query: 64  GGSNPSFRDEVKTDKTGEFKIQLPVS----VRKIEECYVRLIRSSEPYCAVAARAKSSSL 123
             S PSFR EVKTD+ GEFK+ LP S    V+KI  C V+LI SSEPYC VA+ A SSSL
Sbjct: 68  ETSKPSFRQEVKTDEHGEFKVDLPFSVSKHVKKINRCSVKLINSSEPYCGVASTATSSSL 127

Query: 124 KLKSRKQGMHVFSAGFFTFKPLKQPKLCS------HNSHFNEFDDTKQVVDFPG-LPAPI 183
            LKSRKQG+H+FSAGFFTFKPLKQP LC+      +++  N  + +    D P   P PI
Sbjct: 128 HLKSRKQGIHIFSAGFFTFKPLKQPNLCNQKPSLENSTSLNSEEASLPPFDSPSTFPPPI 187

Query: 184 QNPTVPNVPRIYDNLPPLPLLPGLPPLPQLPPLPPLPPLPVFPLF----PPKKDDENVQT 243
           Q+PT+P         PP+P LP LP +PQLPPLP LP LP  P      P KK +E  + 
Sbjct: 188 QDPTMP-------EFPPMPQLPRLPAMPQLPPLPSLPGLPFLPPMPGKTPEKKPEEISRE 247

Query: 244 PKISQN----PDLFHPQTLLPIPSLKPFR---------PHFVMPPHKLRHHPL-THGP-- 303
            K+S+     P LF    L PIP L P           P  V+PP+ L+  PL    P  
Sbjct: 248 TKLSEEKLGPPRLFDIPPLPPIPFLPPISILPPNPLQPPSPVLPPNPLQPPPLFPPNPLL 307

Query: 304 TPPS---AAAAAGELAPSPPLPFSLPSI----PNMPEISSPPKQTSP 313
            PPS          L P PP PF LP      P +P  SS PK  SP
Sbjct: 308 PPPSPLIPLPPVPGLTPPPPFPFPLPPFPPLSPGIPPASSSPKNPSP 347

BLAST of Cp4.1LG01g22370 vs. TrEMBL
Match: V4UWT1_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10012100mg PE=4 SV=1)

HSP 1 Score: 233.4 bits (594), Expect = 3.7e-58
Identity = 165/347 (47.55%), Postives = 196/347 (56.48%), Query Frame = 1

Query: 4   LLILIALNFSLLDLSQARHHNNLPSAAVVGTVFCDTCFQDTFSKTSHFISGATVAVECGN 63
           L+  ++     L+     H    PSA VVGTV+CDTCFQD FSK SHFISGA+VAVEC +
Sbjct: 8   LIFFLSFIIGNLEAESNHHEKRHPSAIVVGTVYCDTCFQDNFSKASHFISGASVAVECKD 67

Query: 64  GGSNPSFRDEVKTDKTGEFKIQLPVS----VRKIEECYVRLIRSSEPYCAVAARAKSSSL 123
             S PSFR EVKTD+ GEFK+ LP S    V+KI  C V+LI SSEPYC VA+ A SSSL
Sbjct: 68  ETSKPSFRQEVKTDEHGEFKVDLPFSVSKHVKKINRCSVKLINSSEPYCGVASTATSSSL 127

Query: 124 KLKSRKQGMHVFSAGFFTFKPLKQPKLCS------HNSHFNEFDDTKQVVDFPG-LPAPI 183
            LKSRKQG+H+FSAGFFTFKPLKQP LC+      +++  N  + +    D P   P PI
Sbjct: 128 HLKSRKQGIHIFSAGFFTFKPLKQPNLCNQKPSLENSTSLNSEEASLPPFDSPSTFPPPI 187

Query: 184 QNPTVPNVPRIYDNLPPLPLLPGLPPLPQLPPLPPLPPLPVFPLF----PPKKDDENVQT 243
           Q+PT+P         PP+P LP LP +PQLPPLP LP LP  P      P KK +E  + 
Sbjct: 188 QDPTMP-------EFPPMPQLPRLPAMPQLPPLPSLPGLPFLPPMPGKTPEKKPEEISRE 247

Query: 244 PKISQN----PDLFHPQTLLPIPSLKPFR---------PHFVMPPHKLRHHPL-THGP-- 303
            K S+     P LF    L PIP L P           P  V+PP+ L+  PL    P  
Sbjct: 248 TKPSEEKLGPPRLFDIPPLPPIPFLPPISILPPNPLQPPSPVLPPNPLQPPPLFPPNPLL 307

Query: 304 TPPS---AAAAAGELAPSPPLPFSLPSI----PNMPEISSPPKQTSP 313
            PPS          L P PP PF LP      P +P  SS PK  SP
Sbjct: 308 PPPSPLIPLPPVPGLTPPPPFPFPLPPFPPLSPGIPPASSSPKNPSP 347

BLAST of Cp4.1LG01g22370 vs. TrEMBL
Match: U5FLV7_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0017s13490g PE=4 SV=1)

HSP 1 Score: 231.9 bits (590), Expect = 1.1e-57
Identity = 154/320 (48.12%), Postives = 194/320 (60.62%), Query Frame = 1

Query: 4   LLILIALNFSLLDLSQARHHNNLPSAAVVGTVFCDTCFQDTFSKTSHFISGATVAVECGN 63
           ++ L++  F+ L  ++A H   LPSA VVGTVFCDTCFQ+ FS+ SHFISGA+VAVEC +
Sbjct: 6   IIFLLSCTFNNLS-AEASHGKKLPSAVVVGTVFCDTCFQEAFSRNSHFISGASVAVECKD 65

Query: 64  GGSNPSFRDEVKTDKTGEFKIQLPVS----VRKIEECYVRLIRSSEPYCAVAARAKSSSL 123
             S P FR+EVKTD+ GEFK+ LP S    V+KI+ C V+L+ SSEP+CAVA+ A SSSL
Sbjct: 66  EESRPGFREEVKTDEHGEFKVHLPFSVSKHVKKIKRCSVKLLSSSEPFCAVASSATSSSL 125

Query: 124 KLKSRKQGMHVFSAGFFTFKPLKQPKLCSH---NSHFNEFDDTK---QVVDFPGLPAPIQ 183
            LKSRKQG H+FS+GFFTFKP KQP LC+      +  EF   K     +D P  P P+Q
Sbjct: 126 HLKSRKQGTHIFSSGFFTFKPEKQPILCNQKPSTENSREFSSRKASLPSIDNPTFPPPLQ 185

Query: 184 NPTVPNVPRIYDN-LPPLPLLPGLPPLPQLPPLPPLPPLPVFPLFP----PKKDDENVQT 243
           +PT P +P +  N LPPLP+LP LPPLPQLPPLPPLP LP+ P  P      K  E+  +
Sbjct: 186 DPTTPYLPPLNQNYLPPLPVLPKLPPLPQLPPLPPLPGLPLLPPIPGNTKKTKTSESFAS 245

Query: 244 PKISQNPDLFHP-QTLLPIPSLKPFRPHFVMPPHKLRHHPLTHGPTPPSAAAAAGELAPS 303
             +     + HP Q   P P L         PP+  +  PL     PP+      +  PS
Sbjct: 246 TTLPDQKAVHHPNQFSYPTPPL--------FPPNTFQLPPL----FPPNPI----QPPPS 305

Query: 304 PPLPFSLPSIPNMPEISSPP 308
           P  PF  P IP +     PP
Sbjct: 306 PLFPF--PPIPGLTPPPPPP 306

BLAST of Cp4.1LG01g22370 vs. TAIR10
Match: AT5G15780.1 (AT5G15780.1 Pollen Ole e 1 allergen and extensin family protein)

HSP 1 Score: 175.3 bits (443), Expect = 6.1e-44
Identity = 136/322 (42.24%), Postives = 175/322 (54.35%), Query Frame = 1

Query: 17  LSQARHH---NNLPSAAVVGTVFCDTCFQDTFSKT-SHFISGATVAVECGNGGSNPSFRD 76
           LSQ + H       SA VVGTV+CDTCF   FSK+ +H ISGA VAVEC +  S PSFR 
Sbjct: 25  LSQGQQHVMKKTRSSAVVVGTVYCDTCFNGAFSKSPNHLISGALVAVECIDENSKPSFRQ 84

Query: 77  EVKTDKTGEFKIQLPVS----VRKIEECYVRLIRSSEPYCAVAARAKSSSLK-LKSRKQG 136
           EVKTDK GEFK++LP S    V+KI+ C V+L+ SS+PYC++A+ A SSSLK LKS   G
Sbjct: 85  EVKTDKRGEFKVKLPFSVSKHVKKIKRCSVKLLSSSQPYCSIASSATSSSLKRLKSNHHG 144

Query: 137 --MHVFSAGFFTFKPLKQPKLCSHNSHFNEFDDTKQVVDFPGLPAPIQNPTVPNVPRIYD 196
               VFSAGFFTF+P  QP++CS          +K ++  P  P P+Q+P  P+      
Sbjct: 145 ENTRVFSAGFFTFRPENQPEICSQKP--INLRGSKPLLPDPSFPPPLQDPPNPS------ 204

Query: 197 NLPPLPLLPGLPPLPQLP----PLPPLPPLPVFPLFPP----------KKDDENVQTPKI 256
              PLP LP +PPLP LP    P+P LP   V PL PP          KK D        
Sbjct: 205 ---PLPNLPIVPPLPNLPVPKLPVPDLPLPLVPPLLPPGPQKSASLHNKKSDSLKDKKTE 264

Query: 257 SQNPDLFHPQTLLPIPSLKPFRPHFVMPPHKLRHHPLTHGPT-PPSAAAAAGELAPSPPL 313
           +  P+ F P          P  P  ++PP+ L   P    PT PP+    +    P  PL
Sbjct: 265 ALKPNFFFPP--------NPLNPPSIIPPNPL--IPSIPTPTLPPNPLIPSPPSLPPIPL 324

BLAST of Cp4.1LG01g22370 vs. TAIR10
Match: AT5G13140.1 (AT5G13140.1 Pollen Ole e 1 allergen and extensin family protein)

HSP 1 Score: 59.7 bits (143), Expect = 3.7e-09
Identity = 57/205 (27.80%), Postives = 95/205 (46.34%), Query Frame = 1

Query: 31  VVGTVFCDTCFQDTFSKTSHFISGATVAVECGNGGSNPSFRDEVK------TDKTGEFKI 90
           VVG V+CDTC  +TFS+ S+F+ G  V V C    S+P   +EV       T+++G +K+
Sbjct: 41  VVGVVYCDTCSINTFSRQSYFLQGVEVHVTCRFKASSPKTAEEVNISVNRTTNRSGVYKL 100

Query: 91  QLP--------VSVRKIEECYVRLIRSSEP---YCAVAA-RAKSSSLKLKSRKQGMHVFS 150
           ++P          +    +C  +++++S      C++   +  ++ + +KS++  + ++S
Sbjct: 101 EIPHVDGIDCVDGIAISSQCSAKILKTSSDDNGGCSIPVFQTATNEVSIKSKQDRVCIYS 160

Query: 151 AGFFTFK-PLKQPKLCSHNSHFNEFDDTKQVVDFPGLPAPIQNPTVPNVPRIYDNLPPLP 210
               ++K P K   LC +    +   D K    F               P  Y +LPPLP
Sbjct: 161 LSALSYKPPHKNTSLCGNGGKKHHRKDEKVEKKFRDSKFFWPYLAPYWFPWPYPDLPPLP 220

Query: 211 LLPGLP--PLPQLPPLPPLPPLPVF 215
            LP  P  P P LP   P   LP F
Sbjct: 221 TLPPFPSFPFPSLPFGNPNLALPAF 245

BLAST of Cp4.1LG01g22370 vs. TAIR10
Match: AT5G47635.1 (AT5G47635.1 Pollen Ole e 1 allergen and extensin family protein)

HSP 1 Score: 50.8 bits (120), Expect = 1.7e-06
Identity = 35/114 (30.70%), Postives = 53/114 (46.49%), Query Frame = 1

Query: 26  LPSAAVVGTVFCDTCFQDTFSKTSHFISGATVAVECGNGGSNPSFRDEVKTDKTGEFKIQ 85
           L S  + G++ CDT      S     I GATVA++C  G    S   +  TD+ GEF+I 
Sbjct: 43  LSSVVITGSLLCDTSRPHLHSIP---IPGATVAIKCHTGSKRRSKWIKAVTDELGEFEID 102

Query: 86  LPVSVRKI----EECYVRLIRSSEPY-CAVAARAKSSSLKLKSRKQGMHVFSAG 135
           LP  +  I      C+++ +    PY C   +      +KL S   G  V+++G
Sbjct: 103 LPSQLHAIPHLENTCFIKPVYVPRPYRCYNTSTNIHKPIKLVSSTNGFRVYTSG 153

BLAST of Cp4.1LG01g22370 vs. TAIR10
Match: AT1G29140.1 (AT1G29140.1 Pollen Ole e 1 allergen and extensin family protein)

HSP 1 Score: 49.7 bits (117), Expect = 3.9e-06
Identity = 29/101 (28.71%), Postives = 51/101 (50.50%), Query Frame = 1

Query: 9   ALNFSLLDLSQARHHNNLPSAAVVGTVFCDTCFQDTFSKTSHFISGATVAVEC-GNGGSN 68
           AL F+ L    A   ++     + G+V+CDTC     ++ S F+ GA V +EC G     
Sbjct: 12  ALCFTTLVHFTAADADDFDKFHIKGSVYCDTCRVQFITRISKFLEGAKVKLECKGRENQT 71

Query: 69  PSFRDEVKTDKTGEFKIQLPVSVRKIEECYVRLIRSSEPYC 109
            +   E  TD  G +++++ +   + E C + L++S +P C
Sbjct: 72  VTLTKEAVTDNAGNYQMEV-MGDHEEEVCEIVLLQSPDPEC 111

BLAST of Cp4.1LG01g22370 vs. NCBI nr
Match: gi|778708779|ref|XP_011656281.1| (PREDICTED: proline-rich protein 4-like [Cucumis sativus])

HSP 1 Score: 340.9 bits (873), Expect = 2.4e-90
Identity = 194/305 (63.61%), Postives = 219/305 (71.80%), Query Frame = 1

Query: 1   MICLLILIALNFSLLDLSQARHHNNLPSAAVVGTVFCDTCFQDTFSKTSHFISGATVAVE 60
           M  LLIL+ LNFS  DLS+ARHH  LPSA VVGTVFCDTC+Q+ FSKTSHFISGATVAVE
Sbjct: 1   MSWLLILLLLNFSFFDLSEARHHRKLPSAVVVGTVFCDTCYQEKFSKTSHFISGATVAVE 60

Query: 61  CGNGGSNPSFRDEVKTDKTGEFKIQLPV----SVRKIEECYVRLIRSSEPYCAVAARAKS 120
           CGN G  PSFR+EVKTDK GEFK+ LPV     V+KIEECYV L++SSEPYC VAA AKS
Sbjct: 61  CGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATAKS 120

Query: 121 SSLKLKSRKQGMHVFSAGFFTFKPLKQPKLCSHN-SHFNEFDDTKQV-------VDFPGL 180
           SSL+LKSRKQ  H FSAGFFTFKPLKQP LC+    + N FDD K++        D P L
Sbjct: 121 SSLQLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIPNL 180

Query: 181 PAPIQNPTVPNVPRIYDNLPPLPLLPGLPPLPQLPPLPPLPPLPV------FPLFPPK-K 240
           P+PIQ PTVP+ PRIYDNLPPLPLLPGL PLPQLPPLPPLPPLP       FP+FPPK K
Sbjct: 181 PSPIQIPTVPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPKEK 240

Query: 241 DDENV--QTPKISQNPDLFHPQTLLPIPSLKPFRP--HFVMPPHKLRHHPLTHGPTPPSA 283
           D++N   +TP  S+  D F      PIP +KP R   HFV+PP +L HHP      PP +
Sbjct: 241 DEKNAPNETPNTSEKLDKF------PIPPIKPLRKPHHFVLPPQRLHHHP----RLPPQS 295

BLAST of Cp4.1LG01g22370 vs. NCBI nr
Match: gi|645248450|ref|XP_008230302.1| (PREDICTED: proline-rich receptor-like protein kinase PERK2 isoform X2 [Prunus mume])

HSP 1 Score: 243.0 bits (619), Expect = 6.7e-61
Identity = 163/326 (50.00%), Postives = 191/326 (58.59%), Query Frame = 1

Query: 13  SLLDLSQARHHNNLPSAAVVGTVFCDTCFQDTFSKTSHFISGATVAVECGNGGSNPSFRD 72
           SL+  S+A H    PSA VVGTV+CDTCFQ  FS  SHFISGA+V VEC +G S PSF+ 
Sbjct: 15  SLVTPSEASHEKKHPSAVVVGTVYCDTCFQAEFSHASHFISGASVGVECKDGSSKPSFQT 74

Query: 73  EVKTDKTGEFKIQLPVS----VRKIEECYVRLIRSSEPYCAVAARAKSSSLKLKSRKQGM 132
           EVKTD  G F++QLP S    V+KIE C V+LI SSEPYCAVA+ A SSSL LKSRKQG 
Sbjct: 75  EVKTDSHGVFRVQLPFSVSKHVKKIEGCSVKLISSSEPYCAVASTATSSSLHLKSRKQGT 134

Query: 133 HVFSAGFFTFKPLKQPKLCSHN-SHFNEFDDTKQVVDFP-----GLPAPIQNPTVPNVPR 192
           H+FSAGFFTFKPLKQP LC+   S  N  + + Q + FP       P P QNPT+P    
Sbjct: 135 HIFSAGFFTFKPLKQPSLCNQKPSIQNSKEFSSQKISFPPTDELTFPPPTQNPTIP---- 194

Query: 193 IYDNLPPLPLLPGLPPLPQLPPLPPLPPL--------------PVFPLFPPKKDDENVQT 252
              +LPPLP LP LPPLPQLPPLPPLP L              PV P  P K  +    T
Sbjct: 195 ---DLPPLPTLPYLPPLPQLPPLPPLPGLPGIPIPGIPGIPGIPVLPPIPGKTTEAGQLT 254

Query: 253 PKISQNPDLFHPQTLLPIPSLKPFRPHFVMPPHKLRHHPLTHGPTPPSAAAAAGELAPSP 312
            K   +PD F P          PF+P  ++PP     +PL   PTP         L P+P
Sbjct: 255 DKKVAHPDAFFPP--------NPFQPPSILPP-----NPLVPQPTP---------LIPNP 311

BLAST of Cp4.1LG01g22370 vs. NCBI nr
Match: gi|1009127424|ref|XP_015880688.1| (PREDICTED: proline-rich protein 4 [Ziziphus jujuba])

HSP 1 Score: 239.6 bits (610), Expect = 7.4e-60
Identity = 169/345 (48.99%), Postives = 202/345 (58.55%), Query Frame = 1

Query: 1   MICLLILIALNFSLLDLSQARHHNNLPSAAVVGTVFCDTCFQDTFSKTSHFISGATVAVE 60
           M  LL +     +   LS+AR   N  +A VVGTV+CDTCFQ  FSK SHFISGA+VAVE
Sbjct: 1   MFYLLKIFFFILTFTYLSEARPQKNPSTAVVVGTVYCDTCFQQDFSKDSHFISGASVAVE 60

Query: 61  CGNGGSNP-SFRDEVKTDKTGEFKIQLPVS----VRKIEECYVRLIRSSEPYCAVAARAK 120
           C +G SN  SFR EVKTD  GEFK+QLP S    V+KIE C V+LI SSEPYCAVA+ A 
Sbjct: 61  CKDGTSNETSFRKEVKTDNHGEFKVQLPFSIGKHVKKIEGCSVKLISSSEPYCAVASTAT 120

Query: 121 SSSLKLKSRKQGMHVFSAGFFTFKPLKQPKLCS------HNSHFNEFDDTKQVVDFPGLP 180
            SSL LKSRKQG+H+FSAGFFTFKPLKQP LC+      ++   N    +   VD    P
Sbjct: 121 KSSLHLKSRKQGIHIFSAGFFTFKPLKQPNLCNQKPSIENSKGLNSNKASLPPVDDLSFP 180

Query: 181 APIQNPTVPNVPRIYDNLPPLPLLPGLPPLPQLPPLPPLPPLPVFPLFPPKKDDEN-VQT 240
            PIQ+PT+P +P  +  LPPLP LP LPPLP LPPLPPLP LP FP    K + E+ V T
Sbjct: 181 PPIQDPTIPGLPP-FQYLPPLPTLPQLPPLPTLPPLPPLPGLPKFPPAQGKTNTESKVPT 240

Query: 241 PKISQNPDL-----FHPQTLLPIPSL------KPFRPHFVMP----PHKLRHHPLTHGPT 300
            K SQ   L      +P+   P+P +       PF+P  ++P    P  L  +P    P 
Sbjct: 241 EKSSQKSQLSDEKVVNPEFFFPVPPILPPLIPNPFQPPPLIPNPFQPPPLIPNPFQPPPA 300

Query: 301 PPSAAAAAGELAPSPP---LPFSLPSI----PNMPEISSPPKQTS 312
           P         L PSPP   LPF  P I    P +P I   P  +S
Sbjct: 301 PLLPFPPIPGLTPSPPPPSLPFPFPPIIPFPPTIPRIPGTPPASS 344

BLAST of Cp4.1LG01g22370 vs. NCBI nr
Match: gi|659118972|ref|XP_008459405.1| (PREDICTED: major pollen allergen Lol p 11 [Cucumis melo])

HSP 1 Score: 238.8 bits (608), Expect = 1.3e-59
Identity = 120/171 (70.18%), Postives = 135/171 (78.95%), Query Frame = 1

Query: 1   MICLLILIALNFSLLDLSQARHHNNLPSAAVVGTVFCDTCFQDTFSKTSHFISGATVAVE 60
           MI LLIL+ LNFS  DLS+ARHH  LPSA ++GTVFCDTCFQ+ FSKTSHFISGATVAVE
Sbjct: 1   MIWLLILLLLNFSFFDLSEARHHRKLPSAVIIGTVFCDTCFQEKFSKTSHFISGATVAVE 60

Query: 61  CGNGGSNPSFRDEVKTDKTGEFKIQLPV----SVRKIEECYVRLIRSSEPYCAVAARAKS 120
           CGN G  PSFR+EVKTDK GEFK+ LPV     V KIEECYV  I+SSEPYC VAA AKS
Sbjct: 61  CGNRGRKPSFREEVKTDKRGEFKVNLPVLVSKHVEKIEECYVESIKSSEPYCDVAATAKS 120

Query: 121 SSLKLKSRKQGMHVFSAGFFTFKPLKQPKLCSHN-SHFNEFDDTKQVVDFP 167
           SSL+LKS+KQ  H FSAGFFTFKPLKQP LC+    + N FDD K+++  P
Sbjct: 121 SSLQLKSKKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIIQLP 171

BLAST of Cp4.1LG01g22370 vs. NCBI nr
Match: gi|255569926|ref|XP_002525926.1| (PREDICTED: proline-rich protein 4 [Ricinus communis])

HSP 1 Score: 238.4 bits (607), Expect = 1.7e-59
Identity = 153/333 (45.95%), Postives = 195/333 (58.56%), Query Frame = 1

Query: 17  LSQARHHNNLPSAAVVGTVFCDTCFQDTFSKTSHFISGATVAVECGNGGSNPSFRDEVKT 76
           LS+A H+  LPSA VVGTV+CDTCF + FSK SHFISGATVAVEC +   N SF  EVKT
Sbjct: 17  LSEASHNKKLPSAVVVGTVYCDTCFHEDFSKNSHFISGATVAVECKD--ENSSFHQEVKT 76

Query: 77  DKTGEFKIQLPVSV----RKIEECYVRLIRSSEPYCAVAARAKSSSLKLKSRKQGMHVFS 136
           D+ GEF++ LP SV    ++I++C V+L+ SSEPYCAVA+ A SSSL+LKSRKQG+H+FS
Sbjct: 77  DEHGEFRVHLPFSVGKHVKRIKKCSVKLLSSSEPYCAVASTATSSSLRLKSRKQGLHIFS 136

Query: 137 AGFFTFKPLKQPKLCSHNSHF---NEFDDTK----------------------QVVDFPG 196
           AGFF+FKP KQP LC+         EF+  K                       + + P 
Sbjct: 137 AGFFSFKPQKQPNLCNQKPSIQDSKEFNSKKISSIPTIGAGSIPSVSSPLQDPTIPNLPP 196

Query: 197 LPAPIQNPTVPNVPRIYDNLPPLPLLPGLPPLPQLPPLPPLPPLPVFPLFPPKKDDENVQ 256
           +  P+Q+PT+PN+P +  +  PLP LP LPPLPQLPPLPPLP LP FP  P K   E   
Sbjct: 197 VSPPLQDPTIPNLPPVNQHFFPLPFLPQLPPLPQLPPLPPLPGLPKFPPIPGKTTKEVKT 256

Query: 257 TPKI--------SQNPDLFHPQTLLPIPSLKPFRPHFVMPPHKLRHHPLTHGPTPPSAAA 308
           +P+          + PD F P   L  P+  PF+P  ++PP+ L+  PL     PP+   
Sbjct: 257 SPESVKKTPESGEEQPDFFFPTPPLFPPN--PFQPPPILPPNPLQPPPLIPPLLPPNPFQ 316

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KXM2_CUCSA1.7e-9063.61Uncharacterized protein OS=Cucumis sativus GN=Csa_5G642130 PE=4 SV=1[more]
B9SIV7_RICCO1.2e-5945.95Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0790500 PE=4 SV=1[more]
A0A067FY85_CITSI7.5e-5947.55Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g019020mg PE=4 SV=1[more]
V4UWT1_9ROSI3.7e-5847.55Uncharacterized protein OS=Citrus clementina GN=CICLE_v10012100mg PE=4 SV=1[more]
U5FLV7_POPTR1.1e-5748.13Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0017s13490g PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G15780.16.1e-4442.24 Pollen Ole e 1 allergen and extensin family protein[more]
AT5G13140.13.7e-0927.80 Pollen Ole e 1 allergen and extensin family protein[more]
AT5G47635.11.7e-0630.70 Pollen Ole e 1 allergen and extensin family protein[more]
AT1G29140.13.9e-0628.71 Pollen Ole e 1 allergen and extensin family protein[more]
Match NameE-valueIdentityDescription
gi|778708779|ref|XP_011656281.1|2.4e-9063.61PREDICTED: proline-rich protein 4-like [Cucumis sativus][more]
gi|645248450|ref|XP_008230302.1|6.7e-6150.00PREDICTED: proline-rich receptor-like protein kinase PERK2 isoform X2 [Prunus mu... [more]
gi|1009127424|ref|XP_015880688.1|7.4e-6048.99PREDICTED: proline-rich protein 4 [Ziziphus jujuba][more]
gi|659118972|ref|XP_008459405.1|1.3e-5970.18PREDICTED: major pollen allergen Lol p 11 [Cucumis melo][more]
gi|255569926|ref|XP_002525926.1|1.7e-5945.95PREDICTED: proline-rich protein 4 [Ricinus communis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g22370.1Cp4.1LG01g22370.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR31614FAMILY NOT NAMEDcoord: 185..220
score: 1.2E-75coord: 1..150
score: 1.2
NoneNo IPR availablePANTHERPTHR31614:SF9SUBFAMILY NOT NAMEDcoord: 185..220
score: 1.2E-75coord: 1..150
score: 1.2
NoneNo IPR availablePFAMPF01190Pollen_Ole_e_Icoord: 31..115
score: 9.0

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG01g22370Cucumber (Chinese Long) v3cpecucB0472
Cp4.1LG01g22370Wax gourdcpewgoB0502
Cp4.1LG01g22370Cucurbita pepo (Zucchini)cpecpeB199
Cp4.1LG01g22370Cucumber (Gy14) v1cgycpeB0255
Cp4.1LG01g22370Cucurbita maxima (Rimu)cmacpeB317
Cp4.1LG01g22370Cucurbita maxima (Rimu)cmacpeB721
Cp4.1LG01g22370Cucurbita moschata (Rifu)cmocpeB279
Cp4.1LG01g22370Wild cucumber (PI 183967)cpecpiB380
Cp4.1LG01g22370Bottle gourd (USVL1VR-Ls)cpelsiB330
Cp4.1LG01g22370Melon (DHL92) v3.6.1cpemedB408
Cp4.1LG01g22370Silver-seed gourdcarcpeB0374