CmoCh04G029450 (gene) Cucurbita moschata (Rifu)

NameCmoCh04G029450
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionPollen Ole e 1 allergen and extensin family protein
LocationCmo_Chr04 : 20849123 .. 20851796 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
AACACTGCCATCTAAAAATGATTTGCCTCCTCATTCTAATCGCCCTCAATTTCAGTTTCCTCGACCTCTCACAGGCCAGGCACCACAACAACCTCCCTTCCACGGCCGCGGTCGTCGGTACCGTCTTCTGCGACACGTGTTTTCAAGACACGTTTTCTAAGTCAAGTCACTTCATTTCAGGTACTTTACCAACTAGCTTGTAGAATTAATATGATGTTGGTTTGTTCAGATATTGATGCGTACTATTTAGGCGCGACGGTGGCTGTCGAATGTGGCGATGGGGGATCGAACCCGAGTTTTAGAGACGAAGTAAAGACAGACAAAACAGGGGAATTCAAGATTCAGCTGCCAGTTTCAGTGAGGAAGATTGAGGAATGTTATGTGCGGTTAATAAGAAGCAGTGAACCATATTGTGCGGTGGCTGCAAGAGCCAAATCATCATCGCTTAAGCTCAAGTCAAGAAAACAAGGCACGCATGTGTTCTCGGCTGGATTCTTCACTTTCAAGCCTCTTAAACACCCAAAACTTTGCAGCCATAACTCACATTCTAGTGAATTTGATGACACGAAACAAGTCGTTGACTTCCCGGGGTTACCGGCTCCGATCCAGAACCCGACCGTGCCGAATGTTCCTCGGATTTACGATAACCTTCCGCCTCTAACTCTTCTTCCTGGACTTCCTCCATTGCCTCAGCTTCCTCCTCTCCCTCCTCTTCCACCTCTCCCTGTTTTTCCATTGTTTCCACCAAAAAAGGATGATGAAAATGTACAAACTCCAAAGATAAGTCAAAACCCGGACATGTTTCATCCACAAACTCTCCTTCCAATTCCATCTTTAAAGCCTTTGAGGCCACATTTTGTTATGCCTCCACATAAGCTGCGTCACCATCCTCTCACGCATGGTCCTTTTTCGCCTTCATTCTCAACTCCTACACCGCCCTCGGCCGCCGCCGACGAGTTAGCTCCGTCCCCGCCGCTCCCTTTTTCACTCCCACCCATCCCTCGTATGCCCGAGATCTCCTCACCTCCAAAGGAAACTTCTCCTTAAAAAATTGTAACGACCCTTGTTTTCAAATGATTTCAGTAACAAAAATAAAACTTTGAATTAGTAGTTTTTACCTATTCATAGTACGAATTAGTAATTGTGAACGAAATCGATGAACTTGCGAGTAGATGATGGGATTTGACATGAAAGATTGTGTTGTTTTGGATAAGATTGATCGTATGATTATTCAAATCATCAAAGGTTTATCAAATGGATTAAATTGGACGATTGATCCAATTGCTTTTGTTTGTTATGGATTTTGTTCTTTTGTGATCAATAATTTGAATATGTTATTTTATTTTCTAAATCTTGTTTAAACAAAAAAAAATTATTATAAAAGGACATAAATTTCAGACCTTAAAAAAAATGACAATTAAGTAAAATTATAATGATGCTGATATTCAAAATATATATATAAAAAATGTATCGAAGAAAAAAATGTGAAATCTTTTTCCAAGCAAGAACAAAATTATTTTATCCTTTTTAGCAGGAGACAAATTATACATTCGATTCGAGAGAGGATAATAAATGTTTTTATCGGTTCATGAAGATTCAAATCCCGAAAGTAAATTTAGGTTGAAATTGGTACCAAATCCTAAATTTGTAGCAGTTCTTGGGGGAGTAGACACATAGTGGAGATTAAATTAGAGCGTTCTATCGAACAAAACCATCGAAGCAAAGGAGTCACCGGGAGCGCGATAAACAGAGGGGAAGAGGGCAGTAGCTTTATTTTCGCGGAATTCGTAGCTTTAATTTCATCGCCTTCAATTCCTTTTTCCAATCTAAACGCCCCTTCAAACGCGCAATCTTGATTTAGCTTGAAATCCCAATTATCTGCGTGCGTGATTTTTCCCAAAAAGTGCACGTATATATTTTCCGTATATTCTCCCGATTTCCGTCTTCCAGGTTTCTCTGTCCTCCACCCACCCACCCACCCACCCACCGAAGATCGTATTTTCTTTCGGTCTTCTCTTCTCCCTCCACTTTACTGCTCTTTCAAGGTGCGCTTCTATTTTTCTCCTTTTTCGTATGAATAGTTCAAGGGCATCCCGATGATCGCCCCAAATTTGTATTGTTCTGTATTGGGATTCTCAGTTTTTTTCATGATTTGTGCGTATTCATGGTCCGATCGTCCATTTTTTGAGTTTTTTCTTGTGGGATTGTCGTATAATTTCAATAGTTTTTGATGATCATGATCAGCTTGGTATGCTATTGATGTTGTCAAGGTATCTTGTTTCCTCCTCATCGAAGGAAAGAATATGTGTTTTCAGAGGTCACCCAGAAAAAAAATCTCAATTTCTTTAGACGAGAGGGTAATTTTGACGAAGATTATGGCATTTTATTAACCCTTTGATTGATTTCTGGCTGTTTCAACAAATTTTTGTTTCATTATTGTGATGAGTGCAGCAAAGGATTGCTGCCGACAGATTTATGTTCCTTGAAATAATGCCGATTGTTGGTTTAATGAACGATCTTAGTAGAATTTTTTTGATTTGACTTGCTATCTTGGCGATCGCATAGGCATTTATGCATCATAGTTGGTTTAAGGATAATCCCTTGGGATCCTCATTTGATTATCAGTGTTACTGAACTGTGGCTGCTTATTTTACAGGTGAAATTCACAGTTTGA

mRNA sequence

AACACTGCCATCTAAAAATGATTTGCCTCCTCATTCTAATCGCCCTCAATTTCAGTTTCCTCGACCTCTCACAGGCCAGGCACCACAACAACCTCCCTTCCACGGCCGCGGTCGTCGGTACCGTCTTCTGCGACACGTGTTTTCAAGACACGTTTTCTAAGTCAAGTCACTTCATTTCAGGCGCGACGGTGGCTGTCGAATGTGGCGATGGGGGATCGAACCCGAGTTTTAGAGACGAAGTAAAGACAGACAAAACAGGGGAATTCAAGATTCAGCTGCCAGTTTCAGTGAGGAAGATTGAGGAATGTTATGTGCGGTTAATAAGAAGCAGTGAACCATATTGTGCGGTGGCTGCAAGAGCCAAATCATCATCGCTTAAGCTCAAGTCAAGAAAACAAGGCACGCATGTGTTCTCGGCTGGATTCTTCACTTTCAAGCCTCTTAAACACCCAAAACTTTGCAGCCATAACTCACATTCTAGTGAATTTGATGACACGAAACAAGTCGTTGACTTCCCGGGGTTACCGGCTCCGATCCAGAACCCGACCGTGCCGAATGTTCCTCGGATTTACGATAACCTTCCGCCTCTAACTCTTCTTCCTGGACTTCCTCCATTGCCTCAGCTTCCTCCTCTCCCTCCTCTTCCACCTCTCCCTGTTTTTCCATTGTTTCCACCAAAAAAGGATGATGAAAATGTACAAACTCCAAAGATAAGTCAAAACCCGGACATGTTTCATCCACAAACTCTCCTTCCAATTCCATCTTTAAAGCCTTTGAGGCCACATTTTGTTATGCCTCCACATAAGCTGCGTCACCATCCTCTCACGCATGGTCCTTTTTCGCCTTCATTCTCAACTCCTACACCGCCCTCGGCCGCCGCCGACGAGTTAGCTCCGTCCCCGCCGCTCCCTTTTTCACTCCCACCCATCCCTCGTTTCTCTGTCCTCCACCCACCCACCCACCCACCCACCGAAGATCGTATTTTCTTTCGGTCTTCTCTTCTCCCTCCACTTTACTGCTCTTTCAAGGTGAAATTCACAGTTTGA

Coding sequence (CDS)

ATGATTTGCCTCCTCATTCTAATCGCCCTCAATTTCAGTTTCCTCGACCTCTCACAGGCCAGGCACCACAACAACCTCCCTTCCACGGCCGCGGTCGTCGGTACCGTCTTCTGCGACACGTGTTTTCAAGACACGTTTTCTAAGTCAAGTCACTTCATTTCAGGCGCGACGGTGGCTGTCGAATGTGGCGATGGGGGATCGAACCCGAGTTTTAGAGACGAAGTAAAGACAGACAAAACAGGGGAATTCAAGATTCAGCTGCCAGTTTCAGTGAGGAAGATTGAGGAATGTTATGTGCGGTTAATAAGAAGCAGTGAACCATATTGTGCGGTGGCTGCAAGAGCCAAATCATCATCGCTTAAGCTCAAGTCAAGAAAACAAGGCACGCATGTGTTCTCGGCTGGATTCTTCACTTTCAAGCCTCTTAAACACCCAAAACTTTGCAGCCATAACTCACATTCTAGTGAATTTGATGACACGAAACAAGTCGTTGACTTCCCGGGGTTACCGGCTCCGATCCAGAACCCGACCGTGCCGAATGTTCCTCGGATTTACGATAACCTTCCGCCTCTAACTCTTCTTCCTGGACTTCCTCCATTGCCTCAGCTTCCTCCTCTCCCTCCTCTTCCACCTCTCCCTGTTTTTCCATTGTTTCCACCAAAAAAGGATGATGAAAATGTACAAACTCCAAAGATAAGTCAAAACCCGGACATGTTTCATCCACAAACTCTCCTTCCAATTCCATCTTTAAAGCCTTTGAGGCCACATTTTGTTATGCCTCCACATAAGCTGCGTCACCATCCTCTCACGCATGGTCCTTTTTCGCCTTCATTCTCAACTCCTACACCGCCCTCGGCCGCCGCCGACGAGTTAGCTCCGTCCCCGCCGCTCCCTTTTTCACTCCCACCCATCCCTCGTTTCTCTGTCCTCCACCCACCCACCCACCCACCCACCGAAGATCGTATTTTCTTTCGGTCTTCTCTTCTCCCTCCACTTTACTGCTCTTTCAAGGTGAAATTCACAGTTTGA
BLAST of CmoCh04G029450 vs. TrEMBL
Match: A0A0A0KXM2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G642130 PE=4 SV=1)

HSP 1 Score: 337.0 bits (863), Expect = 2.6e-89
Identity = 189/291 (64.95%), Postives = 216/291 (74.23%), Query Frame = 1

Query: 1   MICLLILIALNFSFLDLSQARHHNNLPSTAAVVGTVFCDTCFQDTFSKSSHFISGATVAV 60
           M  LLIL+ LNFSF DLS+ARHH  LPS A VVGTVFCDTC+Q+ FSK+SHFISGATVAV
Sbjct: 1   MSWLLILLLLNFSFFDLSEARHHRKLPS-AVVVGTVFCDTCYQEKFSKTSHFISGATVAV 60

Query: 61  ECGDGGSNPSFRDEVKTDKTGEFKIQLPV----SVRKIEECYVRLIRSSEPYCAVAARAK 120
           ECG+ G  PSFR+EVKTDK GEFK+ LPV     V+KIEECYV L++SSEPYC VAA AK
Sbjct: 61  ECGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATAK 120

Query: 121 SSSLKLKSRKQGTHVFSAGFFTFKPLKHPKLCSHN-SHSSEFDDTKQV-------VDFPG 180
           SSSL+LKSRKQ TH FSAGFFTFKPLK P LC+    + + FDD K++        D P 
Sbjct: 121 SSSLQLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIPN 180

Query: 181 LPAPIQNPTVPNVPRIYDNLPPLTLLPGLPPLPQLPPLPPLPPLPV------FPLFPPK- 240
           LP+PIQ PTVP+ PRIYDNLPPL LLPGL PLPQLPPLPPLPPLP       FP+FPPK 
Sbjct: 181 LPSPIQIPTVPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPKE 240

Query: 241 KDDENV--QTPKISQNPDMFHPQTLLPIPSLKPLRP--HFVMPPHKLRHHP 269
           KD++N   +TP  S+  D F      PIP +KPLR   HFV+PP +L HHP
Sbjct: 241 KDEKNAPNETPNTSEKLDKF------PIPPIKPLRKPHHFVLPPQRLHHHP 284

BLAST of CmoCh04G029450 vs. TrEMBL
Match: A0A067FY85_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g019020mg PE=4 SV=1)

HSP 1 Score: 237.7 bits (605), Expect = 2.2e-59
Identity = 160/328 (48.78%), Postives = 190/328 (57.93%), Query Frame = 1

Query: 18  SQARHHNNLPSTAAVVGTVFCDTCFQDTFSKSSHFISGATVAVECGDGGSNPSFRDEVKT 77
           +++ HH     +A VVGTV+CDTCFQD FSK+SHFISGA+VAVEC D  S PSFR EVKT
Sbjct: 21  AESNHHEKRHPSAVVVGTVYCDTCFQDNFSKASHFISGASVAVECKDETSKPSFRQEVKT 80

Query: 78  DKTGEFKIQLPVS----VRKIEECYVRLIRSSEPYCAVAARAKSSSLKLKSRKQGTHVFS 137
           D+ GEFK+ LP S    V+KI  C V+LI SSEPYC VA+ A SSSL LKSRKQG H+FS
Sbjct: 81  DEHGEFKVDLPFSVSKHVKKINRCSVKLINSSEPYCGVASTATSSSLHLKSRKQGIHIFS 140

Query: 138 AGFFTFKPLKHPKLCS-----HNSHSSEFDDTK-QVVDFPG-LPAPIQNPTVPNVPRIYD 197
           AGFFTFKPLK P LC+      NS S   ++      D P   P PIQ+PT+P       
Sbjct: 141 AGFFTFKPLKQPNLCNQKPSLENSTSLNSEEASLPPFDSPSTFPPPIQDPTMP------- 200

Query: 198 NLPPLTLLPGLPPLPQLPPLPPLPPLPVFPLF----PPKKDDENVQTPKISQN----PDM 257
             PP+  LP LP +PQLPPLP LP LP  P      P KK +E  +  K+S+     P +
Sbjct: 201 EFPPMPQLPRLPAMPQLPPLPSLPGLPFLPPMPGKTPEKKPEEISRETKLSEEKLGPPRL 260

Query: 258 FHPQTLLPIPSLKPLR---------PHFVMPPHKLRHHPLTHGPFSPSFSTPTP----PS 314
           F    L PIP L P+          P  V+PP+ L+  PL    F P+   P P    P 
Sbjct: 261 FDIPPLPPIPFLPPISILPPNPLQPPSPVLPPNPLQPPPL----FPPNPLLPPPSPLIPL 320

BLAST of CmoCh04G029450 vs. TrEMBL
Match: B9SIV7_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0790500 PE=4 SV=1)

HSP 1 Score: 237.3 bits (604), Expect = 2.8e-59
Identity = 156/345 (45.22%), Postives = 198/345 (57.39%), Query Frame = 1

Query: 6   ILIALNFSFLDLSQARHHNNLPSTAAVVGTVFCDTCFQDTFSKSSHFISGATVAVECGDG 65
           I+  L  +F  LS+A H+  LPS A VVGTV+CDTCF + FSK+SHFISGATVAVEC D 
Sbjct: 6   IIFFLCSTFNHLSEASHNKKLPS-AVVVGTVYCDTCFHEDFSKNSHFISGATVAVECKD- 65

Query: 66  GSNPSFRDEVKTDKTGEFKIQLPVSV----RKIEECYVRLIRSSEPYCAVAARAKSSSLK 125
             N SF  EVKTD+ GEF++ LP SV    ++I++C V+L+ SSEPYCAVA+ A SSSL+
Sbjct: 66  -ENSSFHQEVKTDEHGEFRVHLPFSVGKHVKRIKKCSVKLLSSSEPYCAVASTATSSSLR 125

Query: 126 LKSRKQGTHVFSAGFFTFKPLKHPKLCSHN---SHSSEFDDTK----------------- 185
           LKSRKQG H+FSAGFF+FKP K P LC+       S EF+  K                 
Sbjct: 126 LKSRKQGLHIFSAGFFSFKPQKQPNLCNQKPSIQDSKEFNSKKISSIPTIGAGSIPSVSS 185

Query: 186 -----QVVDFPGLPAPIQNPTVPNVPRIYDNLPPLTLLPGLPPLPQLPPLPPLPPLPVFP 245
                 + + P +  P+Q+PT+PN+P +  +  PL  LP LPPLPQLPPLPPLP LP FP
Sbjct: 186 PLQDPTIPNLPPVSPPLQDPTIPNLPPVNQHFFPLPFLPQLPPLPQLPPLPPLPGLPKFP 245

Query: 246 LFPPKKDDENVQTPKI--------SQNPDMFHPQTLLPIPSLKPLRPHFVMPPHKLRHHP 305
             P K   E   +P+          + PD F P   L  P+  P +P  ++PP+ L+  P
Sbjct: 246 PIPGKTTKEVKTSPESVKKTPESGEEQPDFFFPTPPLFPPN--PFQPPPILPPNPLQPPP 305

Query: 306 LTHGPFSPSFSTPTPPSAAADELAPSPPLPFSLPPIPRFSVLHPP 314
           L   P  P      PP    +   P P   F  PPIP  +   PP
Sbjct: 306 LI-PPLLPPNPFQPPPLFPPNPFQPPPSPSFPFPPIPGLTPSPPP 344

BLAST of CmoCh04G029450 vs. TrEMBL
Match: U5FLV7_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0017s13490g PE=4 SV=1)

HSP 1 Score: 236.5 bits (602), Expect = 4.8e-59
Identity = 167/346 (48.27%), Postives = 204/346 (58.96%), Query Frame = 1

Query: 6   ILIALNFSFLDLS-QARHHNNLPSTAAVVGTVFCDTCFQDTFSKSSHFISGATVAVECGD 65
           I+  L+ +F +LS +A H   LPS A VVGTVFCDTCFQ+ FS++SHFISGA+VAVEC D
Sbjct: 6   IIFLLSCTFNNLSAEASHGKKLPS-AVVVGTVFCDTCFQEAFSRNSHFISGASVAVECKD 65

Query: 66  GGSNPSFRDEVKTDKTGEFKIQLPVS----VRKIEECYVRLIRSSEPYCAVAARAKSSSL 125
             S P FR+EVKTD+ GEFK+ LP S    V+KI+ C V+L+ SSEP+CAVA+ A SSSL
Sbjct: 66  EESRPGFREEVKTDEHGEFKVHLPFSVSKHVKKIKRCSVKLLSSSEPFCAVASSATSSSL 125

Query: 126 KLKSRKQGTHVFSAGFFTFKPLKHPKLCSH---NSHSSEFDDTK---QVVDFPGLPAPIQ 185
            LKSRKQGTH+FS+GFFTFKP K P LC+      +S EF   K     +D P  P P+Q
Sbjct: 126 HLKSRKQGTHIFSSGFFTFKPEKQPILCNQKPSTENSREFSSRKASLPSIDNPTFPPPLQ 185

Query: 186 NPTVPNVPRIYDN-LPPLTLLPGLPPLPQLPPLPPLPPLPVFPLFPPKKDDENVQTPKIS 245
           +PT P +P +  N LPPL +LP LPPLPQLPPLPPLP LP+ P  P      N +  K S
Sbjct: 186 DPTTPYLPPLNQNYLPPLPVLPKLPPLPQLPPLPPLPGLPLLPPIP-----GNTKKTKTS 245

Query: 246 QNPDMFHPQTLLPIPSLKPLRPHFVMPPHKLRHHPLTHGPFSPSFSTPTPPSAAAD--EL 305
           ++   F   TL               P  K  HHP         FS PTPP    +  +L
Sbjct: 246 ES---FASTTL---------------PDQKAVHHP-------NQFSYPTPPLFPPNTFQL 305

Query: 306 AP-SPPLPFSLPPIPRFSV-----LHPPTHPPTEDRIFFRSSLLPP 332
            P  PP P   PP P F       L PP  PP    +F  + + PP
Sbjct: 306 PPLFPPNPIQPPPSPLFPFPPIPGLTPPPPPP----LFPPNPIQPP 316

BLAST of CmoCh04G029450 vs. TrEMBL
Match: V4UWT1_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10012100mg PE=4 SV=1)

HSP 1 Score: 235.3 bits (599), Expect = 1.1e-58
Identity = 160/328 (48.78%), Postives = 189/328 (57.62%), Query Frame = 1

Query: 18  SQARHHNNLPSTAAVVGTVFCDTCFQDTFSKSSHFISGATVAVECGDGGSNPSFRDEVKT 77
           +++ HH     +A VVGTV+CDTCFQD FSK+SHFISGA+VAVEC D  S PSFR EVKT
Sbjct: 21  AESNHHEKRHPSAIVVGTVYCDTCFQDNFSKASHFISGASVAVECKDETSKPSFRQEVKT 80

Query: 78  DKTGEFKIQLPVS----VRKIEECYVRLIRSSEPYCAVAARAKSSSLKLKSRKQGTHVFS 137
           D+ GEFK+ LP S    V+KI  C V+LI SSEPYC VA+ A SSSL LKSRKQG H+FS
Sbjct: 81  DEHGEFKVDLPFSVSKHVKKINRCSVKLINSSEPYCGVASTATSSSLHLKSRKQGIHIFS 140

Query: 138 AGFFTFKPLKHPKLCS-----HNSHSSEFDDTK-QVVDFPG-LPAPIQNPTVPNVPRIYD 197
           AGFFTFKPLK P LC+      NS S   ++      D P   P PIQ+PT+P       
Sbjct: 141 AGFFTFKPLKQPNLCNQKPSLENSTSLNSEEASLPPFDSPSTFPPPIQDPTMP------- 200

Query: 198 NLPPLTLLPGLPPLPQLPPLPPLPPLPVFPLF----PPKKDDENVQTPKISQN----PDM 257
             PP+  LP LP +PQLPPLP LP LP  P      P KK +E  +  K S+     P +
Sbjct: 201 EFPPMPQLPRLPAMPQLPPLPSLPGLPFLPPMPGKTPEKKPEEISRETKPSEEKLGPPRL 260

Query: 258 FHPQTLLPIPSLKPLR---------PHFVMPPHKLRHHPLTHGPFSPSFSTPTP----PS 314
           F    L PIP L P+          P  V+PP+ L+  PL    F P+   P P    P 
Sbjct: 261 FDIPPLPPIPFLPPISILPPNPLQPPSPVLPPNPLQPPPL----FPPNPLLPPPSPLIPL 320

BLAST of CmoCh04G029450 vs. TAIR10
Match: AT5G15780.1 (AT5G15780.1 Pollen Ole e 1 allergen and extensin family protein)

HSP 1 Score: 184.5 bits (467), Expect = 1.1e-46
Identity = 144/324 (44.44%), Postives = 175/324 (54.01%), Query Frame = 1

Query: 17  LSQARHH--NNLPSTAAVVGTVFCDTCFQDTFSKS-SHFISGATVAVECGDGGSNPSFRD 76
           LSQ + H      S+A VVGTV+CDTCF   FSKS +H ISGA VAVEC D  S PSFR 
Sbjct: 25  LSQGQQHVMKKTRSSAVVVGTVYCDTCFNGAFSKSPNHLISGALVAVECIDENSKPSFRQ 84

Query: 77  EVKTDKTGEFKIQLPVS----VRKIEECYVRLIRSSEPYCAVAARAKSSSLK-LKSRKQG 136
           EVKTDK GEFK++LP S    V+KI+ C V+L+ SS+PYC++A+ A SSSLK LKS   G
Sbjct: 85  EVKTDKRGEFKVKLPFSVSKHVKKIKRCSVKLLSSSQPYCSIASSATSSSLKRLKSNHHG 144

Query: 137 --THVFSAGFFTFKPLKHPKLCSHNSHSSEFDDTKQVVDFPGLPAPIQNPTVPNVPRIYD 196
             T VFSAGFFTF+P   P++CS          +K ++  P  P P+Q+P  P+      
Sbjct: 145 ENTRVFSAGFFTFRPENQPEICSQK--PINLRGSKPLLPDPSFPPPLQDPPNPS------ 204

Query: 197 NLPPLTLLPGLPPLPQLP----PLPPLPPLPVFPLFPP----------KKDDENVQTPKI 256
              PL  LP +PPLP LP    P+P LP   V PL PP          KK D        
Sbjct: 205 ---PLPNLPIVPPLPNLPVPKLPVPDLPLPLVPPLLPPGPQKSASLHNKKSDSLKDKKTE 264

Query: 257 SQNPDMFHPQTLLPIPSLKPLRPHFVMPPHKLRHHPLTHGPFSPSFSTPTPPSAAADELA 316
           +  P+ F P          PL P  ++PP+          P  PS  TPT P    + L 
Sbjct: 265 ALKPNFFFPP--------NPLNPPSIIPPN----------PLIPSIPTPTLP---PNPLI 311

BLAST of CmoCh04G029450 vs. TAIR10
Match: AT5G13140.1 (AT5G13140.1 Pollen Ole e 1 allergen and extensin family protein)

HSP 1 Score: 60.5 bits (145), Expect = 2.4e-09
Identity = 64/246 (26.02%), Postives = 105/246 (42.68%), Query Frame = 1

Query: 28  STAAVVGTVFCDTCFQDTFSKSSHFISGATVAVECGDGGSNPSFRDEVK------TDKTG 87
           S   VVG V+CDTC  +TFS+ S+F+ G  V V C    S+P   +EV       T+++G
Sbjct: 37  SRITVVGVVYCDTCSINTFSRQSYFLQGVEVHVTCRFKASSPKTAEEVNISVNRTTNRSG 96

Query: 88  EFKIQLP--------VSVRKIEECYVRLIRSSEP---YCAVAA-RAKSSSLKLKSRKQGT 147
            +K+++P          +    +C  +++++S      C++   +  ++ + +KS++   
Sbjct: 97  VYKLEIPHVDGIDCVDGIAISSQCSAKILKTSSDDNGGCSIPVFQTATNEVSIKSKQDRV 156

Query: 148 HVFSAGFFTFK-PLKHPKLCSHNSHSSEFDDTKQVVDFPGLPAPIQNPTVPNVPRIYDNL 207
            ++S    ++K P K+  LC +        D K    F                R     
Sbjct: 157 CIYSLSALSYKPPHKNTSLCGNGGKKHHRKDEKVEKKF----------------RDSKFF 216

Query: 208 PPLTLLPGLP-PLPQLPPLPPLPPLPVFPLFPPKKDDENVQTPKISQNPDMFHPQTLLP- 253
            P       P P P LPPLP LPP P FP       + N+  P      D  +P T +P 
Sbjct: 217 WPYLAPYWFPWPYPDLPPLPTLPPFPSFPFPSLPFGNPNLALPAF----DWKNPVTWIPY 262

BLAST of CmoCh04G029450 vs. TAIR10
Match: AT5G47635.1 (AT5G47635.1 Pollen Ole e 1 allergen and extensin family protein)

HSP 1 Score: 48.5 bits (114), Expect = 9.4e-06
Identity = 34/113 (30.09%), Postives = 53/113 (46.90%), Query Frame = 1

Query: 28  STAAVVGTVFCDTCFQDTFSKSSHFISGATVAVECGDGGSNPSFRDEVKTDKTGEFKIQL 87
           S+  + G++ CDT      S     I GATVA++C  G    S   +  TD+ GEF+I L
Sbjct: 44  SSVVITGSLLCDTSRPHLHSIP---IPGATVAIKCHTGSKRRSKWIKAVTDELGEFEIDL 103

Query: 88  PVSVRKI----EECYVRLIRSSEPY-CAVAARAKSSSLKLKSRKQGTHVFSAG 136
           P  +  I      C+++ +    PY C   +      +KL S   G  V+++G
Sbjct: 104 PSQLHAIPHLENTCFIKPVYVPRPYRCYNTSTNIHKPIKLVSSTNGFRVYTSG 153

BLAST of CmoCh04G029450 vs. NCBI nr
Match: gi|778708779|ref|XP_011656281.1| (PREDICTED: proline-rich protein 4-like [Cucumis sativus])

HSP 1 Score: 337.0 bits (863), Expect = 3.8e-89
Identity = 189/291 (64.95%), Postives = 216/291 (74.23%), Query Frame = 1

Query: 1   MICLLILIALNFSFLDLSQARHHNNLPSTAAVVGTVFCDTCFQDTFSKSSHFISGATVAV 60
           M  LLIL+ LNFSF DLS+ARHH  LPS A VVGTVFCDTC+Q+ FSK+SHFISGATVAV
Sbjct: 1   MSWLLILLLLNFSFFDLSEARHHRKLPS-AVVVGTVFCDTCYQEKFSKTSHFISGATVAV 60

Query: 61  ECGDGGSNPSFRDEVKTDKTGEFKIQLPV----SVRKIEECYVRLIRSSEPYCAVAARAK 120
           ECG+ G  PSFR+EVKTDK GEFK+ LPV     V+KIEECYV L++SSEPYC VAA AK
Sbjct: 61  ECGNKGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATAK 120

Query: 121 SSSLKLKSRKQGTHVFSAGFFTFKPLKHPKLCSHN-SHSSEFDDTKQV-------VDFPG 180
           SSSL+LKSRKQ TH FSAGFFTFKPLK P LC+    + + FDD K++        D P 
Sbjct: 121 SSSLQLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIPN 180

Query: 181 LPAPIQNPTVPNVPRIYDNLPPLTLLPGLPPLPQLPPLPPLPPLPV------FPLFPPK- 240
           LP+PIQ PTVP+ PRIYDNLPPL LLPGL PLPQLPPLPPLPPLP       FP+FPPK 
Sbjct: 181 LPSPIQIPTVPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPKE 240

Query: 241 KDDENV--QTPKISQNPDMFHPQTLLPIPSLKPLRP--HFVMPPHKLRHHP 269
           KD++N   +TP  S+  D F      PIP +KPLR   HFV+PP +L HHP
Sbjct: 241 KDEKNAPNETPNTSEKLDKF------PIPPIKPLRKPHHFVLPPQRLHHHP 284

BLAST of CmoCh04G029450 vs. NCBI nr
Match: gi|1009127424|ref|XP_015880688.1| (PREDICTED: proline-rich protein 4 [Ziziphus jujuba])

HSP 1 Score: 248.8 bits (634), Expect = 1.3e-62
Identity = 175/340 (51.47%), Postives = 202/340 (59.41%), Query Frame = 1

Query: 1   MICLLILIALNFSFLDLSQARHHNNLPSTAAVVGTVFCDTCFQDTFSKSSHFISGATVAV 60
           M  LL +     +F  LS+AR   N PSTA VVGTV+CDTCFQ  FSK SHFISGA+VAV
Sbjct: 1   MFYLLKIFFFILTFTYLSEARPQKN-PSTAVVVGTVYCDTCFQQDFSKDSHFISGASVAV 60

Query: 61  ECGDGGSNP-SFRDEVKTDKTGEFKIQLPVS----VRKIEECYVRLIRSSEPYCAVAARA 120
           EC DG SN  SFR EVKTD  GEFK+QLP S    V+KIE C V+LI SSEPYCAVA+ A
Sbjct: 61  ECKDGTSNETSFRKEVKTDNHGEFKVQLPFSIGKHVKKIEGCSVKLISSSEPYCAVASTA 120

Query: 121 KSSSLKLKSRKQGTHVFSAGFFTFKPLKHPKLCSHN---SHSSEFDDTK---QVVDFPGL 180
             SSL LKSRKQG H+FSAGFFTFKPLK P LC+      +S   +  K     VD    
Sbjct: 121 TKSSLHLKSRKQGIHIFSAGFFTFKPLKQPNLCNQKPSIENSKGLNSNKASLPPVDDLSF 180

Query: 181 PAPIQNPTVPNVPRIYDNLPPLTLLPGLPPLPQLPPLPPLPPLPVFPLFPPKKDDEN-VQ 240
           P PIQ+PT+P +P  +  LPPL  LP LPPLP LPPLPPLP LP FP    K + E+ V 
Sbjct: 181 PPPIQDPTIPGLPP-FQYLPPLPTLPQLPPLPTLPPLPPLPGLPKFPPAQGKTNTESKVP 240

Query: 241 TPKISQNPDM-----FHPQTLLPIPS-LKPLRPHFVMP----PHKLRHHPLTHGPFSPSF 300
           T K SQ   +      +P+   P+P  L PL P+   P    P+  +  PL   PF P  
Sbjct: 241 TEKSSQKSQLSDEKVVNPEFFFPVPPILPPLIPNPFQPPPLIPNPFQPPPLIPNPFQPPP 300

Query: 301 S--TPTPPSAAADELAPSPPLPFSLPPIPRFSVLHPPTHP 317
           +   P PP        P P LPF  PPI  F    PPT P
Sbjct: 301 APLLPFPPIPGLTPSPPPPSLPFPFPPIIPF----PPTIP 334

BLAST of CmoCh04G029450 vs. NCBI nr
Match: gi|747076980|ref|XP_011085586.1| (PREDICTED: proline-rich extensin-like protein EPR1 [Sesamum indicum])

HSP 1 Score: 241.9 bits (616), Expect = 1.6e-60
Identity = 157/331 (47.43%), Postives = 199/331 (60.12%), Query Frame = 1

Query: 7   LIALNFSFLDLSQARHHNNLPSTAAVVGTVFCDTCFQDTFSKSSHFISGATVAVECGDGG 66
           +I L  +F   S+ +HH   PS A VVGTV+CDTCFQ  F K+SHFISGA+VAVEC    
Sbjct: 7   IIFLCVAFTISSEGKHHKKHPS-AVVVGTVYCDTCFQQDFPKASHFISGASVAVECKTTS 66

Query: 67  SNPSFRDEVKTDKTGEFKIQLPVS----VRKIEECYVRLIRSSEPYCAVAARAKSSSLKL 126
           S PSF+  VKTDK GEF++ LP S    V+KI++C V+LI S+EP+CAVAA A SSSL L
Sbjct: 67  SKPSFQQVVKTDKNGEFRVHLPFSVSKHVKKIKKCAVQLISSNEPFCAVAATAASSSLSL 126

Query: 127 KSRKQGTHVFSAGFFTFKPLKHPKLCSHNSHSSEFDDTKQV-------VDFPGLPAPIQN 186
           K+RK GTHVFSAGFFTFKPLK P++C+     S F + +          + P  P P+Q+
Sbjct: 127 KTRKHGTHVFSAGFFTFKPLKQPEICNQKPSISSFKNLESADKPSVLNPNDPLFPPPLQD 186

Query: 187 PTVPNVPRIYDNLPPLTLLPGLPPLPQLPPLPPLPPLPVFPLFPPKKDDENVQTPKISQN 246
           P+  + P     LPPL  LP LPPLP+LP LPPLP +P  P   PKK   N +  ++S N
Sbjct: 187 PSPSDPPTGRRYLPPLPQLPNLPPLPELPRLPPLPVIPFLPPAEPKKTATNFKASELS-N 246

Query: 247 PDMFHPQTLLPIPSLKPLRPHFVMPPHKLRHHPLTHGPFSPSFS-------TPTPPSAAA 306
            ++  P++L   P+  PL P  + PP+ L   P    P  PS          P+PP +  
Sbjct: 247 HEINQPKSLFFPPN--PLNPPSLFPPNPLLPPPSLIPPVLPSPPPSIFPPLVPSPPPSVL 306

Query: 307 DELAPSPPLP--FSLPPIPRFSVLHPPTHPP 318
             L PSPP P  F LPPIP  +   PP  PP
Sbjct: 307 PPLFPSPPSPPFFHLPPIPGLTPSPPPPPPP 333

BLAST of CmoCh04G029450 vs. NCBI nr
Match: gi|641853485|gb|KDO72303.1| (hypothetical protein CISIN_1g019020mg [Citrus sinensis])

HSP 1 Score: 237.7 bits (605), Expect = 3.1e-59
Identity = 160/328 (48.78%), Postives = 190/328 (57.93%), Query Frame = 1

Query: 18  SQARHHNNLPSTAAVVGTVFCDTCFQDTFSKSSHFISGATVAVECGDGGSNPSFRDEVKT 77
           +++ HH     +A VVGTV+CDTCFQD FSK+SHFISGA+VAVEC D  S PSFR EVKT
Sbjct: 21  AESNHHEKRHPSAVVVGTVYCDTCFQDNFSKASHFISGASVAVECKDETSKPSFRQEVKT 80

Query: 78  DKTGEFKIQLPVS----VRKIEECYVRLIRSSEPYCAVAARAKSSSLKLKSRKQGTHVFS 137
           D+ GEFK+ LP S    V+KI  C V+LI SSEPYC VA+ A SSSL LKSRKQG H+FS
Sbjct: 81  DEHGEFKVDLPFSVSKHVKKINRCSVKLINSSEPYCGVASTATSSSLHLKSRKQGIHIFS 140

Query: 138 AGFFTFKPLKHPKLCS-----HNSHSSEFDDTK-QVVDFPG-LPAPIQNPTVPNVPRIYD 197
           AGFFTFKPLK P LC+      NS S   ++      D P   P PIQ+PT+P       
Sbjct: 141 AGFFTFKPLKQPNLCNQKPSLENSTSLNSEEASLPPFDSPSTFPPPIQDPTMP------- 200

Query: 198 NLPPLTLLPGLPPLPQLPPLPPLPPLPVFPLF----PPKKDDENVQTPKISQN----PDM 257
             PP+  LP LP +PQLPPLP LP LP  P      P KK +E  +  K+S+     P +
Sbjct: 201 EFPPMPQLPRLPAMPQLPPLPSLPGLPFLPPMPGKTPEKKPEEISRETKLSEEKLGPPRL 260

Query: 258 FHPQTLLPIPSLKPLR---------PHFVMPPHKLRHHPLTHGPFSPSFSTPTP----PS 314
           F    L PIP L P+          P  V+PP+ L+  PL    F P+   P P    P 
Sbjct: 261 FDIPPLPPIPFLPPISILPPNPLQPPSPVLPPNPLQPPPL----FPPNPLLPPPSPLIPL 320

BLAST of CmoCh04G029450 vs. NCBI nr
Match: gi|255569926|ref|XP_002525926.1| (PREDICTED: proline-rich protein 4 [Ricinus communis])

HSP 1 Score: 237.3 bits (604), Expect = 4.0e-59
Identity = 156/345 (45.22%), Postives = 198/345 (57.39%), Query Frame = 1

Query: 6   ILIALNFSFLDLSQARHHNNLPSTAAVVGTVFCDTCFQDTFSKSSHFISGATVAVECGDG 65
           I+  L  +F  LS+A H+  LPS A VVGTV+CDTCF + FSK+SHFISGATVAVEC D 
Sbjct: 6   IIFFLCSTFNHLSEASHNKKLPS-AVVVGTVYCDTCFHEDFSKNSHFISGATVAVECKD- 65

Query: 66  GSNPSFRDEVKTDKTGEFKIQLPVSV----RKIEECYVRLIRSSEPYCAVAARAKSSSLK 125
             N SF  EVKTD+ GEF++ LP SV    ++I++C V+L+ SSEPYCAVA+ A SSSL+
Sbjct: 66  -ENSSFHQEVKTDEHGEFRVHLPFSVGKHVKRIKKCSVKLLSSSEPYCAVASTATSSSLR 125

Query: 126 LKSRKQGTHVFSAGFFTFKPLKHPKLCSHN---SHSSEFDDTK----------------- 185
           LKSRKQG H+FSAGFF+FKP K P LC+       S EF+  K                 
Sbjct: 126 LKSRKQGLHIFSAGFFSFKPQKQPNLCNQKPSIQDSKEFNSKKISSIPTIGAGSIPSVSS 185

Query: 186 -----QVVDFPGLPAPIQNPTVPNVPRIYDNLPPLTLLPGLPPLPQLPPLPPLPPLPVFP 245
                 + + P +  P+Q+PT+PN+P +  +  PL  LP LPPLPQLPPLPPLP LP FP
Sbjct: 186 PLQDPTIPNLPPVSPPLQDPTIPNLPPVNQHFFPLPFLPQLPPLPQLPPLPPLPGLPKFP 245

Query: 246 LFPPKKDDENVQTPKI--------SQNPDMFHPQTLLPIPSLKPLRPHFVMPPHKLRHHP 305
             P K   E   +P+          + PD F P   L  P+  P +P  ++PP+ L+  P
Sbjct: 246 PIPGKTTKEVKTSPESVKKTPESGEEQPDFFFPTPPLFPPN--PFQPPPILPPNPLQPPP 305

Query: 306 LTHGPFSPSFSTPTPPSAAADELAPSPPLPFSLPPIPRFSVLHPP 314
           L   P  P      PP    +   P P   F  PPIP  +   PP
Sbjct: 306 LI-PPLLPPNPFQPPPLFPPNPFQPPPSPSFPFPPIPGLTPSPPP 344

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KXM2_CUCSA2.6e-8964.95Uncharacterized protein OS=Cucumis sativus GN=Csa_5G642130 PE=4 SV=1[more]
A0A067FY85_CITSI2.2e-5948.78Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g019020mg PE=4 SV=1[more]
B9SIV7_RICCO2.8e-5945.22Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0790500 PE=4 SV=1[more]
U5FLV7_POPTR4.8e-5948.27Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0017s13490g PE=4 SV=1[more]
V4UWT1_9ROSI1.1e-5848.78Uncharacterized protein OS=Citrus clementina GN=CICLE_v10012100mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G15780.11.1e-4644.44 Pollen Ole e 1 allergen and extensin family protein[more]
AT5G13140.12.4e-0926.02 Pollen Ole e 1 allergen and extensin family protein[more]
AT5G47635.19.4e-0630.09 Pollen Ole e 1 allergen and extensin family protein[more]
Match NameE-valueIdentityDescription
gi|778708779|ref|XP_011656281.1|3.8e-8964.95PREDICTED: proline-rich protein 4-like [Cucumis sativus][more]
gi|1009127424|ref|XP_015880688.1|1.3e-6251.47PREDICTED: proline-rich protein 4 [Ziziphus jujuba][more]
gi|747076980|ref|XP_011085586.1|1.6e-6047.43PREDICTED: proline-rich extensin-like protein EPR1 [Sesamum indicum][more]
gi|641853485|gb|KDO72303.1|3.1e-5948.78hypothetical protein CISIN_1g019020mg [Citrus sinensis][more]
gi|255569926|ref|XP_002525926.1|4.0e-5945.22PREDICTED: proline-rich protein 4 [Ricinus communis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G029450.1CmoCh04G029450.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR31614FAMILY NOT NAMEDcoord: 1..151
score: 1.2E-74coord: 186..221
score: 1.2
NoneNo IPR availablePANTHERPTHR31614:SF9SUBFAMILY NOT NAMEDcoord: 1..151
score: 1.2E-74coord: 186..221
score: 1.2
NoneNo IPR availablePFAMPF01190Pollen_Ole_e_Icoord: 32..116
score: 1.4