CmoCh14G011770 (gene) Cucurbita moschata (Rifu)

NameCmoCh14G011770
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionPollen Ole e I family allergen
LocationCmo_Chr14 : 9696813 .. 9698383 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTACTTTCATACACATGACTCATGCTCATGCATGTATATAACCCGTTACGATTGCTGATTCATCGCCAACACGAATGTTCTAAATGAGATACAAACTCTGAACAAAGTGTTTGTGCAGTCATTCTTTTTTGCCTATTTCCTCTACAGTACAAGAAGTGAAGTGAAAGGTTCTAAAAGTGGAAAGAACATTCATCTATGGGCCCAATTTCTGTTAAATGACATAGAAGACAATTAGATGAAAAGAGTTAAAATTGGTATGTGTGGTGACTCTACTTGTCTCGTGAACCAACATGGTCCCATTCCCAATTATGTGAATCATTCCCAATTCTTCTTCTTATTCACTTGTCAGAGTTTCCAGAACACTCCCATATGGGAATTTAGATAGGGATAGAGAGGGCCTTGGTTTGTGAAAGATTGGTATATAAAGGAGAGGAGGGTTAAGCTCTGTATTCACAGTTCCTTGTATGCAATATGAAGATGGGTTGGACTTTGCTTCTCTTTGCTCTGTTTCTGAGCCTCACTTCCCATAATGCCTTCTCTCACTCTCTTCACAGCAACAAGCCTGCCGTTGTTGTCGGCACTGTGTTCTGCGACACTTGTTCCCAACAACGCCTCTCCAAGTCTACCTATTTCATCTCAGGTCCTTTTTATTTCTCTTTTACAATCTTTACTTAGAGACGCATCTAATTTGAGGTTACGTTCTTTGATTGCAGGAGCCATTGTTGCAGTTGAATGCAGAGACCATAAAACCTCTGAAACCAATTTCAAAGAGGAAGTGAAGACCAACAAAAATGGAAATTTCAAAGTAGTGTTGCCATTCTCTGAGCAAGAACACACCAACAAAATCGAAACTTGTTCTGTAAAAATGATCAAAAGCAGTGACCCCTTCTGCTCTGTACCCTCCTCTGCTACTTCCTCTTCTCTGAAACTGAAGAACTCTAAGATCAGAGATGGCACCAGGGTTTTCTCAGCTGGGTATTTCGCTTTCAAGCCATTGAAACAGCCCACTTCGTGCAATCGGAAGAACACCGACATTTTAAAAGCCAAACAAGTTAACCCTCAGCTTCCTTTTCCGCCGATTGTTCCGCCGATTGTTCCGCCGGTTATTCAGCCGCCATCGCTTTTGCCGCCCAACCCCCTGCAGCCAGCGCCGCTGATCCCGAACCCGTTAGAACCGCCGACTCCGGTGATTCCGAACCCATTCCAGCCGCCAGCTCCGTTGATTCCGATCATTCCTATGCCGCCGTTGCCAATTTTAACACCGCCGTCTCCTCCGCCGACGGTGTTGCCGCCGTTTATACCGCCGTTTCTGCCGCCGATCCCTGGTATTCCGTCTGGTCCGCCAAAAGTGAAAAAGATTCCATGAAGGATGATTTATGATTTGAATTATGGTTAATGGGTACAATTTGTAGTGATGTGAGGAGGTGTGATCGTGGAGTAAGTAGAATATCTGAAATAAGTGGAACACAATATTGGGTTTTTTCTAATATTATTTAAACTTCAAATTAGAATATGAAGTATATCCATTCTCATATTCTTCCAACCCAACACAAAATATTCACTTTTCCGA

mRNA sequence

GTTACTTTCATACACATGACTCATGCTCATGCATGTATATAACCCGTTACGATTGCTGATTCATCGCCAACACGAATGTTCTAAATGAGATACAAACTCTGAACAAAGTGTTTGTGCAGTCATTCTTTTTTGCCTATTTCCTCTACAGTACAAGAAGTGAAGTGAAAGGTTCTAAAAGTGGAAAGAACATTCATCTATGGGCCCAATTTCTGTTAAATGACATAGAAGACAATTAGATGAAAAGAGTTAAAATTGGTATGTGTGGTGACTCTACTTGTCTCGTGAACCAACATGGTCCCATTCCCAATTATGTGAATCATTCCCAATTCTTCTTCTTATTCACTTGTCAGAGTTTCCAGAACACTCCCATATGGGAATTTAGATAGGGATAGAGAGGGCCTTGGTTTGTGAAAGATTGGTATATAAAGGAGAGGAGGGTTAAGCTCTGTATTCACAGTTCCTTGTATGCAATATGAAGATGGGTTGGACTTTGCTTCTCTTTGCTCTGTTTCTGAGCCTCACTTCCCATAATGCCTTCTCTCACTCTCTTCACAGCAACAAGCCTGCCGTTGTTGTCGGCACTGTGTTCTGCGACACTTGTTCCCAACAACGCCTCTCCAAGTCTACCTATTTCATCTCAGGAGCCATTGTTGCAGTTGAATGCAGAGACCATAAAACCTCTGAAACCAATTTCAAAGAGGAAGTGAAGACCAACAAAAATGGAAATTTCAAAGTAGTGTTGCCATTCTCTGAGCAAGAACACACCAACAAAATCGAAACTTGTTCTGTAAAAATGATCAAAAGCAGTGACCCCTTCTGCTCTGTACCCTCCTCTGCTACTTCCTCTTCTCTGAAACTGAAGAACTCTAAGATCAGAGATGGCACCAGGGTTTTCTCAGCTGGGTATTTCGCTTTCAAGCCATTGAAACAGCCCACTTCGTGCAATCGGAAGAACACCGACATTTTAAAAGCCAAACAAGTTAACCCTCAGCTTCCTTTTCCGCCGATTGTTCCGCCGATTGTTCCGCCGGTTATTCAGCCGCCATCGCTTTTGCCGCCCAACCCCCTGCAGCCAGCGCCGCTGATCCCGAACCCGTTAGAACCGCCGACTCCGGTGATTCCGAACCCATTCCAGCCGCCAGCTCCGTTGATTCCGATCATTCCTATGCCGCCGTTGCCAATTTTAACACCGCCGTCTCCTCCGCCGACGGTGTTGCCGCCGTTTATACCGCCGTTTCTGCCGCCGATCCCTGGTATTCCGTCTGGTCCGCCAAAAGTGAAAAAGATTCCATGAAGGATGATTTATGATTTGAATTATGGTTAATGGGTACAATTTGTAGTGATGTGAGGAGGTGTGATCGTGGAGTAAGTAGAATATCTGAAATAAGTGGAACACAATATTGGGTTTTTTCTAATATTATTTAAACTTCAAATTAGAATATGAAGTATATCCATTCTCATATTCTTCCAACCCAACACAAAATATTCACTTTTCCGA

Coding sequence (CDS)

ATGAAGATGGGTTGGACTTTGCTTCTCTTTGCTCTGTTTCTGAGCCTCACTTCCCATAATGCCTTCTCTCACTCTCTTCACAGCAACAAGCCTGCCGTTGTTGTCGGCACTGTGTTCTGCGACACTTGTTCCCAACAACGCCTCTCCAAGTCTACCTATTTCATCTCAGGAGCCATTGTTGCAGTTGAATGCAGAGACCATAAAACCTCTGAAACCAATTTCAAAGAGGAAGTGAAGACCAACAAAAATGGAAATTTCAAAGTAGTGTTGCCATTCTCTGAGCAAGAACACACCAACAAAATCGAAACTTGTTCTGTAAAAATGATCAAAAGCAGTGACCCCTTCTGCTCTGTACCCTCCTCTGCTACTTCCTCTTCTCTGAAACTGAAGAACTCTAAGATCAGAGATGGCACCAGGGTTTTCTCAGCTGGGTATTTCGCTTTCAAGCCATTGAAACAGCCCACTTCGTGCAATCGGAAGAACACCGACATTTTAAAAGCCAAACAAGTTAACCCTCAGCTTCCTTTTCCGCCGATTGTTCCGCCGATTGTTCCGCCGGTTATTCAGCCGCCATCGCTTTTGCCGCCCAACCCCCTGCAGCCAGCGCCGCTGATCCCGAACCCGTTAGAACCGCCGACTCCGGTGATTCCGAACCCATTCCAGCCGCCAGCTCCGTTGATTCCGATCATTCCTATGCCGCCGTTGCCAATTTTAACACCGCCGTCTCCTCCGCCGACGGTGTTGCCGCCGTTTATACCGCCGTTTCTGCCGCCGATCCCTGGTATTCCGTCTGGTCCGCCAAAAGTGAAAAAGATTCCATGA
BLAST of CmoCh14G011770 vs. TrEMBL
Match: A0A0A0KW32_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G126430 PE=4 SV=1)

HSP 1 Score: 323.6 bits (828), Expect = 2.4e-85
Identity = 187/293 (63.82%), Postives = 219/293 (74.74%), Query Frame = 1

Query: 1   MKMGWTLLLFALFLSLTSHNAFSHSL--HSNK--PAVVVGTVFCDTCSQQRLSKSTYFIS 60
           MK GW LLLF +FL+LT +N FS  L  H+NK  PAVVVGTVFCDTC QQ LS S +FIS
Sbjct: 1   MKKGWILLLFFMFLNLTFYNGFSKPLLLHNNKLQPAVVVGTVFCDTCFQQHLSNSHHFIS 60

Query: 61  GAIVAVECRDHKTSETNFKEEVKTNKNGNFKVVLPFSEQEHTNKIETCSVKMIKSSDPFC 120
           GA V VECRD KT E +FK++VKTNKNG FKVVLPFS  +H  KIE+CSVK+IKSS+PFC
Sbjct: 61  GARVEVECRDEKTPEASFKQQVKTNKNGKFKVVLPFSIAKHVKKIESCSVKLIKSSEPFC 120

Query: 121 SVPSSATSSSLKLKNSKIRDGTRVFSAGYFAFKPLKQPTSCNRKNTDILKAKQVNPQLPF 180
           SV SSA+SSSL+LKNSK ++G R+FSAG+F FKPL+QPT CN+K+         NPQLPF
Sbjct: 121 SVASSASSSSLQLKNSKNKNGVRIFSAGFFTFKPLQQPTLCNQKS---------NPQLPF 180

Query: 181 PPIVPPIVPPVIQPPSLLPPNPLQPAPLIPNPLEPPTPVIPNPFQPPAPLIPI------- 240
           PP    +VPPVIQPPS  PPNPLQP PL+PNP +PP P+IPNPFQPPAP+IP        
Sbjct: 181 PP----LVPPVIQPPSFFPPNPLQPTPLVPNPFQPPAPLIPNPFQPPAPVIPNPFQPPTP 240

Query: 241 -------------IPMPPLPILTPPSPPPTVLPP-FIPPFLPPIPGIPSGPPK 269
                        +P+PPLP +TP  PPPT+LPP F+PPFLPPIPGIP GPP+
Sbjct: 241 VIPNPFQPAPATGLPLPPLPFITPSPPPPTLLPPSFLPPFLPPIPGIPPGPPR 280

BLAST of CmoCh14G011770 vs. TrEMBL
Match: A0A0B0MR63_GOSAR (Major pollen allergen Lig v 1 OS=Gossypium arboreum GN=F383_24177 PE=4 SV=1)

HSP 1 Score: 214.2 bits (544), Expect = 2.0e-52
Identity = 146/275 (53.09%), Postives = 185/275 (67.27%), Query Frame = 1

Query: 4   GWTLLLFALFLSLTSHNAFSHSLHSNKPAVVVGTVFCDTCSQQRLSKSTYFISGAIVAVE 63
           G+   L   F+    ++  SH     + AVVVGTV+CDTC Q  +S+ T+FISGA VAVE
Sbjct: 3   GFFFTLLLCFIIFNCYSEGSHGYQQQQSAVVVGTVYCDTCFQSDISRPTHFISGATVAVE 62

Query: 64  CRDHKTSETNFKEEVKTNKNGNFKVVLPFSEQEHTNKIETCSVKMIKSSDPFCSVPSSAT 123
           C+D K+S  +F+++VKTN++G FKV LPFS  +H  KIE C VK+IKSS+P+C+V SSAT
Sbjct: 63  CKDGKSSRASFRQQVKTNRHGEFKVHLPFSVSKHVKKIEGCEVKLIKSSEPYCAVASSAT 122

Query: 124 SSSLKLKNSKIRDGTRVFSAGYFAFKPLKQPTSCNRKNTDILKAKQVNPQLPFPPIVPP- 183
           SSSL LK+ K   GT VFSAG+F FKP KQPT C++K +         P LP  P++PP 
Sbjct: 123 SSSLHLKSMK--QGTHVFSAGFFTFKPFKQPTLCSQKPSVHPNQLLPPPILPPNPLLPPP 182

Query: 184 IVPP-VIQPPSLLPPNPLQ--PAPLIPNPLE-PPTPVIPNPFQ-PPAPLIPIIPMPPLPI 243
           I+PP  + PP  LPPNP Q  PAPL+PNPL+ PP P++P+P Q PPAP  P   +PP+P 
Sbjct: 183 ILPPNPLLPPPALPPNPFQPPPAPLVPNPLQPPPAPLVPDPLQPPPAPAAP--SLPPVPG 242

Query: 244 LTP----PSPPPTV---LPPFIPPFL--PPIPGIP 264
           LTP    PSPPP     LPPF  PFL  PP PG+P
Sbjct: 243 LTPPPSSPSPPPDFPFPLPPF--PFLPVPPFPGVP 271

BLAST of CmoCh14G011770 vs. TrEMBL
Match: A0A0D2PUQ4_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_001G206700 PE=4 SV=1)

HSP 1 Score: 209.5 bits (532), Expect = 5.0e-51
Identity = 142/266 (53.38%), Postives = 180/266 (67.67%), Query Frame = 1

Query: 13  FLSLTSHNAFSHSLHSNKPAVVVGTVFCDTCSQQRLSKSTYFISGAIVAVECRDHKTSET 72
           F+    ++  SH     + AVVVGTV+CDTC Q   S+ T+FISGA VAVEC+D K+S  
Sbjct: 12  FIIFNGYSEGSHGYQQQQSAVVVGTVYCDTCFQSDFSRPTHFISGATVAVECKDGKSSRA 71

Query: 73  NFKEEVKTNKNGNFKVVLPFSEQEHTNKIETCSVKMIKSSDPFCSVPSSATSSSLKLKNS 132
           +F+++VKTN++G FKV LPFS  +H  KIE C VK+IKSS+P+C+V SSATSSSL LK+ 
Sbjct: 72  SFRQQVKTNRHGEFKVHLPFSVSKHVKKIEGCEVKLIKSSEPYCAVASSATSSSLHLKSR 131

Query: 133 KIRDGTRVFSAGYFAFKPLKQPTSCNRKNTDILKAKQVNPQLPFPPIVPP-IVPP-VIQP 192
           K   GT VFSAG+F FKP KQPT C++K +         P LP  P++PP I+PP  + P
Sbjct: 132 K--QGTHVFSAGFFTFKPFKQPTLCSQKPSVHPNQLLPPPILPPNPLLPPPILPPNPLLP 191

Query: 193 PSLLPPNPLQ--PAPLIPNPLE-PPTPVIPNPFQ-PPAPLIPIIPMPPLPILTP----PS 252
           P +LPPNP Q  PAPL+PNPL+ PP P++P+P Q PP P  P   +PP+P L P    PS
Sbjct: 192 PPVLPPNPFQPPPAPLVPNPLQPPPAPLVPDPLQPPPEPAAP--SLPPVPGLAPPPSSPS 251

Query: 253 PPPTV---LPPFIPPFL--PPIPGIP 264
           PPP     LPPF  PFL  PP PG+P
Sbjct: 252 PPPDFPFPLPPF--PFLPVPPFPGVP 271

BLAST of CmoCh14G011770 vs. TrEMBL
Match: A0A061EFY3_THECC (Pollen Ole e 1 allergen and extensin family protein OS=Theobroma cacao GN=TCM_018997 PE=4 SV=1)

HSP 1 Score: 206.8 bits (525), Expect = 3.3e-50
Identity = 154/316 (48.73%), Postives = 189/316 (59.81%), Query Frame = 1

Query: 7   LLLFALFLSLTSHNAFSHSLHSNK-PAVVVGTVFCDTCSQQRLSKSTYFISGAIVAVECR 66
           LLL   F      N FS S H  K  AVVVGTV+CDTC Q+  S+++YFISGA VAVEC+
Sbjct: 7   LLLLGFFF-----NNFSESSHDQKLSAVVVGTVYCDTCFQEEFSRTSYFISGASVAVECK 66

Query: 67  DHKTSETNFKEEVKTNKNGNFKVVLPFSEQEHTNKIETCSVKMIKSSDPFCSVPSSATSS 126
           D  TS  +F++EVKTN++G FK+ LPFS  +H  KI+ CSVK+I+S +P+C+V S+ATSS
Sbjct: 67  DG-TSRPSFRQEVKTNEHGEFKIHLPFSVSKHVKKIKGCSVKLIRSREPYCAVASTATSS 126

Query: 127 SLKLKNSKIRDGTRVFSAGYFAFKPLKQPTSCNRK----NTDILKAKQVNPQ-------- 186
           SL LK+     GT +FSAG+F FKPLKQPT C++K    N   LK K+   Q        
Sbjct: 127 SLHLKSRM--HGTHIFSAGFFTFKPLKQPTLCSQKPSTQNPKQLKNKEPPAQNVLHPESF 186

Query: 187 LPFPPIVPPI----VPPVIQP-----PSLLPPNPLQ-------------PAPLIPNPLE- 246
           L  P + PP      PP++ P     P LLPPNP+Q             PAPLIPNP + 
Sbjct: 187 LSTPSVFPPDDAVPAPPILTPNPFQLPPLLPPNPIQPPPLLPPNPFQPPPAPLIPNPFQP 246

Query: 247 PPTPVIPNPFQPP--------------APLIPIIPMPPLPILTPPSPPPTVLPPFIP--- 265
           PP P+IPNPFQPP              AP  P+ P PP+P LTPPSPPP   PPF P   
Sbjct: 247 PPAPLIPNPFQPPPAPLFPPNPFRPPRAPPSPLFPFPPIPGLTPPSPPPPP-PPFFPFPF 306

BLAST of CmoCh14G011770 vs. TrEMBL
Match: A0A151UAZ7_CAJCA (Uncharacterized protein OS=Cajanus cajan GN=KK1_020696 PE=4 SV=1)

HSP 1 Score: 202.6 bits (514), Expect = 6.1e-49
Identity = 147/288 (51.04%), Postives = 187/288 (64.93%), Query Frame = 1

Query: 3   MGWTLLLFALFLSLTSHNAFSHSLHSNKP--AVVVGTVFCDTCSQQRLSKSTYFISGAIV 62
           M W L+L  LFLSLT     S S H   P  AVVVGTV+CDTC QQ  S  +++ISGA V
Sbjct: 1   MSWFLVL--LFLSLTFGAIQSESSHDKMPPSAVVVGTVYCDTCFQQDFSMGSHYISGASV 60

Query: 63  AVECRD-HKTSETNFKEEVKTNKNGNFKVVLPFSEQEHTNKIETCSVKMIKSSDPFCSVP 122
           AVEC+D + +S+  F++EVKT+++G FKV LPFS  +H  +I+ C+VK+I SS+P+C+V 
Sbjct: 61  AVECKDGYGSSKPRFRKEVKTDEHGEFKVQLPFSVSKHVKRIKGCTVKLISSSEPYCAVA 120

Query: 123 SSATSSSLKLKNSKIRDGTRVFSAGYFAFKPLKQPTSCNRKNT--------DILKAKQVN 182
           S+ATSSSL+LK+ K   G  +FSAG+F+FKPLKQP  CN+K +        D+ K K  +
Sbjct: 121 SAATSSSLRLKSRK--QGLHIFSAGFFSFKPLKQPNLCNQKPSTENIKGLDDVKKQKIAD 180

Query: 183 P-QLPFPP---IVPPIVPPVIQPPSLLPPNPLQPAPLIPNPLEP--PTPVIPNPFQPPA- 242
           P  L FPP     PPIVP   QPP L+ PNPLQP P+IPNPL+P  P+P++PNPFQPP+ 
Sbjct: 181 PNNLIFPPNPLFPPPIVPNPFQPPPLV-PNPLQPPPVIPNPLQPPGPSPLVPNPFQPPSS 240

Query: 243 -PLIPIIPMPPLPILTPPSPPPTVLPPFIPPFLPPIPGIPSGPPKVKK 272
               P+ P P  P  TP S PP    PF PP  PP P  P  P    K
Sbjct: 241 GTSPPLFPFPTEPGSTPSSSPPAFPFPF-PPLFPP-PSSPDTPSTSSK 281

BLAST of CmoCh14G011770 vs. TAIR10
Match: AT5G15780.1 (AT5G15780.1 Pollen Ole e 1 allergen and extensin family protein)

HSP 1 Score: 147.1 bits (370), Expect = 1.6e-35
Identity = 124/316 (39.24%), Postives = 168/316 (53.16%), Query Frame = 1

Query: 5   WTLLLFALFLSLTSHNAFSHSLH-----SNKPAVVVGTVFCDTCSQQRLSKS-TYFISGA 64
           W      +FL ++ +   S         +   AVVVGTV+CDTC     SKS  + ISGA
Sbjct: 8   WFWFSLMIFLGISINGGLSQGQQHVMKKTRSSAVVVGTVYCDTCFNGAFSKSPNHLISGA 67

Query: 65  IVAVECRDHKTSETNFKEEVKTNKNGNFKVVLPFSEQEHTNKIETCSVKMIKSSDPFCSV 124
           +VAVEC D + S+ +F++EVKT+K G FKV LPFS  +H  KI+ CSVK++ SS P+CS+
Sbjct: 68  LVAVECID-ENSKPSFRQEVKTDKRGEFKVKLPFSVSKHVKKIKRCSVKLLSSSQPYCSI 127

Query: 125 PSSATSSSLK-LKNSKIRDGTRVFSAGYFAFKPLKQPTSCNRKNTDILKAKQVNPQLPFP 184
            SSATSSSLK LK++   + TRVFSAG+F F+P  QP  C++K  ++  +K + P   FP
Sbjct: 128 ASSATSSSLKRLKSNHHGENTRVFSAGFFTFRPENQPEICSQKPINLRGSKPLLPDPSFP 187

Query: 185 PIVPPIVPPVIQPPSLLPPNPLQPAPLIPNPLEPPTPVIPNPFQPPAPLIPIIPMPPLPI 244
                  PP+  PP+        P+PL   P+ PP P +P P  P    +P +P+P +P 
Sbjct: 188 -------PPLQDPPN--------PSPLPNLPIVPPLPNLPVPKLP----VPDLPLPLVPP 247

Query: 245 LTPPSP----------------------------------PPTVLP--PFIP----PFLP 274
           L PP P                                  PP+++P  P IP    P LP
Sbjct: 248 LLPPGPQKSASLHNKKSDSLKDKKTEALKPNFFFPPNPLNPPSIIPPNPLIPSIPTPTLP 302

BLAST of CmoCh14G011770 vs. TAIR10
Match: AT4G08685.1 (AT4G08685.1 Pollen Ole e 1 allergen and extensin family protein)

HSP 1 Score: 59.7 bits (143), Expect = 3.3e-09
Identity = 34/107 (31.78%), Postives = 61/107 (57.01%), Query Frame = 1

Query: 12  LFLSLTSHNAFSHSLHSNK-PAVVVGTVFCDTCSQQRLSKSTYFISGAIVAVECRDHKTS 71
           L ++L    A + +   NK P VV G V+CDTC     + ++ +ISGA+V +EC+D +T 
Sbjct: 6   LLVALCFLPALAIAARPNKNPFVVRGRVYCDTCLAGFETPASTYISGAVVRLECKDRRTM 65

Query: 72  ETNFKEEVKTNKNGNFKVVLPFSEQEHTNKIETCSVKMIKSSDPFCS 118
           E  +  E +T+  G++K+++    ++H  +   C   +++SS   CS
Sbjct: 66  ELTYSHEARTDSTGSYKILV---NEDHDEQF--CDAMLVRSSQLRCS 107

BLAST of CmoCh14G011770 vs. TAIR10
Match: AT3G26960.1 (AT3G26960.1 Pollen Ole e 1 allergen and extensin family protein)

HSP 1 Score: 55.5 bits (132), Expect = 6.1e-08
Identity = 49/173 (28.32%), Postives = 79/173 (45.66%), Query Frame = 1

Query: 2   KMGWTLLLFALFLSLTSHNAFS-HSLHSNKP---AVVVGTVFCDTCSQQRLSKSTYFISG 61
           K   T+ LF L   L  ++  S HS    KP     ++G V+CD CS+   SK +YF+SG
Sbjct: 3   KKSLTMFLFILLQFLLVNSLSSKHSSPKPKPDAEITIMGFVYCDVCSKNSFSKHSYFMSG 62

Query: 62  AIVAVECRDHKTSET-----NFKEEVKTNKNGNFKVVLPFSEQEHTNKI-ETCSVKMI-- 121
             V + CR    S T      F     TN+ G +KV +   +    + +  +C   +I  
Sbjct: 63  VEVRIVCRFKAASSTTTETITFSANRTTNEFGLYKVAISSLDCADVDSLASSCQASLIGR 122

Query: 122 -KSSDPFCSVPSSATSSSLKLKNSKIRDGTRVFSAGYFAFKPLKQPTS-CNRK 161
              SD  C++P   T++   L  S+ +  + V+      F+P K+  + C +K
Sbjct: 123 KNFSDSSCNIPGYRTTTDQVLFKSQ-QSNSCVYGFNALNFRPFKRDLALCGKK 174

BLAST of CmoCh14G011770 vs. TAIR10
Match: AT1G78040.1 (AT1G78040.1 Pollen Ole e 1 allergen and extensin family protein)

HSP 1 Score: 55.5 bits (132), Expect = 6.1e-08
Identity = 36/127 (28.35%), Postives = 62/127 (48.82%), Query Frame = 1

Query: 33  VVVGTVFCDTCSQQ-RLSKSTYFISGAIVAVECRDHKTSETNFKEEVKTNKNGNFKVVLP 92
           VV G+ +CD C       +S+YFI GA V + C+D KT E  + ++  ++K G +K ++ 
Sbjct: 31  VVQGSTYCDICKFGFETPESSYFIPGATVKLSCKDRKTMEEVYTDKAVSDKEGKYKFIV- 90

Query: 93  FSEQEHTNKIETCSVKMIKSSDPFCSVPSSATSSSLKLKNSKIRDGTRVFSAGYFAFKPL 152
               +H +++  C V ++KSSD  CS  S     S  + N      +++  A    F+  
Sbjct: 91  --HDDHRDQM--CDVLLVKSSDKTCSKISVGREKSRVILNHYSGIASQIRHANNMGFEKE 150

Query: 153 KQPTSCN 159
                C+
Sbjct: 151 VSDVFCS 152

BLAST of CmoCh14G011770 vs. TAIR10
Match: AT5G41050.1 (AT5G41050.1 Pollen Ole e 1 allergen and extensin family protein)

HSP 1 Score: 54.7 bits (130), Expect = 1.0e-07
Identity = 47/162 (29.01%), Postives = 72/162 (44.44%), Query Frame = 1

Query: 7   LLLFALFLSLTSHNAFSHSLHSNKP---AVVVGTVFCDTCSQQRLSKSTYFISGAIVAVE 66
           L++  L L L S N+ S    S KP     V+G V+CD CS    S  +YFI G  V + 
Sbjct: 5   LIMLLLLLQLLSLNSLSLKHSSAKPNGKITVMGLVYCDVCSNNSFSNHSYFIPGVEVRII 64

Query: 67  CRDHKTSE-----TNFKEEVKTNKNGNFKV-------VLPFSEQEHTNKIETCSVKMIKS 126
           CR +  S        F     TN+ G +K+       V   +E +  + + +C   +I  
Sbjct: 65  CRFNSASSRTREMITFSANRTTNELGLYKLDITSLEGVACAAEAKKDSLMASCQASLIGR 124

Query: 127 SDPFCSVPSSATSSSLKLKNSKIRDGTRVFSAGYFAFKPLKQ 154
           S   C+VP   T++   +  SKI     V+      F+PL++
Sbjct: 125 SKDSCNVPGFRTTTEQVVFKSKI-SNLCVYGFTALNFRPLEK 165

BLAST of CmoCh14G011770 vs. NCBI nr
Match: gi|449438733|ref|XP_004137142.1| (PREDICTED: amyloid beta A4 precursor protein-binding family B member 1-interacting protein [Cucumis sativus])

HSP 1 Score: 323.6 bits (828), Expect = 3.4e-85
Identity = 187/293 (63.82%), Postives = 219/293 (74.74%), Query Frame = 1

Query: 1   MKMGWTLLLFALFLSLTSHNAFSHSL--HSNK--PAVVVGTVFCDTCSQQRLSKSTYFIS 60
           MK GW LLLF +FL+LT +N FS  L  H+NK  PAVVVGTVFCDTC QQ LS S +FIS
Sbjct: 1   MKKGWILLLFFMFLNLTFYNGFSKPLLLHNNKLQPAVVVGTVFCDTCFQQHLSNSHHFIS 60

Query: 61  GAIVAVECRDHKTSETNFKEEVKTNKNGNFKVVLPFSEQEHTNKIETCSVKMIKSSDPFC 120
           GA V VECRD KT E +FK++VKTNKNG FKVVLPFS  +H  KIE+CSVK+IKSS+PFC
Sbjct: 61  GARVEVECRDEKTPEASFKQQVKTNKNGKFKVVLPFSIAKHVKKIESCSVKLIKSSEPFC 120

Query: 121 SVPSSATSSSLKLKNSKIRDGTRVFSAGYFAFKPLKQPTSCNRKNTDILKAKQVNPQLPF 180
           SV SSA+SSSL+LKNSK ++G R+FSAG+F FKPL+QPT CN+K+         NPQLPF
Sbjct: 121 SVASSASSSSLQLKNSKNKNGVRIFSAGFFTFKPLQQPTLCNQKS---------NPQLPF 180

Query: 181 PPIVPPIVPPVIQPPSLLPPNPLQPAPLIPNPLEPPTPVIPNPFQPPAPLIPI------- 240
           PP    +VPPVIQPPS  PPNPLQP PL+PNP +PP P+IPNPFQPPAP+IP        
Sbjct: 181 PP----LVPPVIQPPSFFPPNPLQPTPLVPNPFQPPAPLIPNPFQPPAPVIPNPFQPPTP 240

Query: 241 -------------IPMPPLPILTPPSPPPTVLPP-FIPPFLPPIPGIPSGPPK 269
                        +P+PPLP +TP  PPPT+LPP F+PPFLPPIPGIP GPP+
Sbjct: 241 VIPNPFQPAPATGLPLPPLPFITPSPPPPTLLPPSFLPPFLPPIPGIPPGPPR 280

BLAST of CmoCh14G011770 vs. NCBI nr
Match: gi|659111055|ref|XP_008455556.1| (PREDICTED: formin-like protein 7 [Cucumis melo])

HSP 1 Score: 313.2 bits (801), Expect = 4.6e-82
Identity = 183/293 (62.46%), Postives = 214/293 (73.04%), Query Frame = 1

Query: 1   MKMGWTLLLFALFLSLTSHNAFSHSL-HSN---KPAVVVGTVFCDTCSQQRLSKSTYFIS 60
           MKMGW LLLF LFL+   +N FS  L H+N   +PAVVVGTVFCDTC QQ LS S +FIS
Sbjct: 1   MKMGWVLLLFFLFLNFNFNNGFSKPLVHNNNKLQPAVVVGTVFCDTCFQQHLSGSHHFIS 60

Query: 61  GAIVAVECRDHKTSETNFKEEVKTNKNGNFKVVLPFSEQEHTNKIETCSVKMIKSSDPFC 120
           GA + VECRD      +FK++VKTNKNG F+VVLPFS  +H  KIE CSVK+IKSS+PFC
Sbjct: 61  GARIEVECRDENNPIASFKQQVKTNKNGKFRVVLPFSIAKHIKKIERCSVKLIKSSEPFC 120

Query: 121 SVPSSATSSSLKLKNSKIRDGTRVFSAGYFAFKPLKQPTSCNRKNTDILKAKQVNPQLPF 180
           SV SSA+SS LK KNSK ++G R+FSAG+F FKPL+QPT CN+K+         NPQLPF
Sbjct: 121 SVVSSASSSFLKFKNSKNKNGVRIFSAGFFTFKPLQQPTLCNQKS---------NPQLPF 180

Query: 181 PPIVPPIVPPVIQPPSLLPPNPLQPAPLIPNPLEPPTPVIPNPFQPPAPLIPI------- 240
           PP    +VPPVIQPPS LPPNPLQP PL+PNP +PP P+IPNPFQPPAPLIP        
Sbjct: 181 PP----LVPPVIQPPSFLPPNPLQPTPLVPNPFQPPAPLIPNPFQPPAPLIPNPFQPPAP 240

Query: 241 -------------IPMPPLPILTPPSPPPTVLPP-FIPPFLPPIPGIPSGPPK 269
                        +P+PPLP +TP  PPPT+LPP F+PPFLPPIPGIPSGPP+
Sbjct: 241 VIPNPFQPAPATGLPLPPLPFITPSPPPPTLLPPSFLPPFLPPIPGIPSGPPR 280

BLAST of CmoCh14G011770 vs. NCBI nr
Match: gi|728818493|gb|KHG02822.1| (Major pollen allergen Lig v 1 [Gossypium arboreum])

HSP 1 Score: 214.2 bits (544), Expect = 2.9e-52
Identity = 146/275 (53.09%), Postives = 185/275 (67.27%), Query Frame = 1

Query: 4   GWTLLLFALFLSLTSHNAFSHSLHSNKPAVVVGTVFCDTCSQQRLSKSTYFISGAIVAVE 63
           G+   L   F+    ++  SH     + AVVVGTV+CDTC Q  +S+ T+FISGA VAVE
Sbjct: 3   GFFFTLLLCFIIFNCYSEGSHGYQQQQSAVVVGTVYCDTCFQSDISRPTHFISGATVAVE 62

Query: 64  CRDHKTSETNFKEEVKTNKNGNFKVVLPFSEQEHTNKIETCSVKMIKSSDPFCSVPSSAT 123
           C+D K+S  +F+++VKTN++G FKV LPFS  +H  KIE C VK+IKSS+P+C+V SSAT
Sbjct: 63  CKDGKSSRASFRQQVKTNRHGEFKVHLPFSVSKHVKKIEGCEVKLIKSSEPYCAVASSAT 122

Query: 124 SSSLKLKNSKIRDGTRVFSAGYFAFKPLKQPTSCNRKNTDILKAKQVNPQLPFPPIVPP- 183
           SSSL LK+ K   GT VFSAG+F FKP KQPT C++K +         P LP  P++PP 
Sbjct: 123 SSSLHLKSMK--QGTHVFSAGFFTFKPFKQPTLCSQKPSVHPNQLLPPPILPPNPLLPPP 182

Query: 184 IVPP-VIQPPSLLPPNPLQ--PAPLIPNPLE-PPTPVIPNPFQ-PPAPLIPIIPMPPLPI 243
           I+PP  + PP  LPPNP Q  PAPL+PNPL+ PP P++P+P Q PPAP  P   +PP+P 
Sbjct: 183 ILPPNPLLPPPALPPNPFQPPPAPLVPNPLQPPPAPLVPDPLQPPPAPAAP--SLPPVPG 242

Query: 244 LTP----PSPPPTV---LPPFIPPFL--PPIPGIP 264
           LTP    PSPPP     LPPF  PFL  PP PG+P
Sbjct: 243 LTPPPSSPSPPPDFPFPLPPF--PFLPVPPFPGVP 271

BLAST of CmoCh14G011770 vs. NCBI nr
Match: gi|823126330|ref|XP_012488255.1| (PREDICTED: sulfated surface glycoprotein 185-like [Gossypium raimondii])

HSP 1 Score: 209.5 bits (532), Expect = 7.2e-51
Identity = 142/266 (53.38%), Postives = 180/266 (67.67%), Query Frame = 1

Query: 13  FLSLTSHNAFSHSLHSNKPAVVVGTVFCDTCSQQRLSKSTYFISGAIVAVECRDHKTSET 72
           F+    ++  SH     + AVVVGTV+CDTC Q   S+ T+FISGA VAVEC+D K+S  
Sbjct: 12  FIIFNGYSEGSHGYQQQQSAVVVGTVYCDTCFQSDFSRPTHFISGATVAVECKDGKSSRA 71

Query: 73  NFKEEVKTNKNGNFKVVLPFSEQEHTNKIETCSVKMIKSSDPFCSVPSSATSSSLKLKNS 132
           +F+++VKTN++G FKV LPFS  +H  KIE C VK+IKSS+P+C+V SSATSSSL LK+ 
Sbjct: 72  SFRQQVKTNRHGEFKVHLPFSVSKHVKKIEGCEVKLIKSSEPYCAVASSATSSSLHLKSR 131

Query: 133 KIRDGTRVFSAGYFAFKPLKQPTSCNRKNTDILKAKQVNPQLPFPPIVPP-IVPP-VIQP 192
           K   GT VFSAG+F FKP KQPT C++K +         P LP  P++PP I+PP  + P
Sbjct: 132 K--QGTHVFSAGFFTFKPFKQPTLCSQKPSVHPNQLLPPPILPPNPLLPPPILPPNPLLP 191

Query: 193 PSLLPPNPLQ--PAPLIPNPLE-PPTPVIPNPFQ-PPAPLIPIIPMPPLPILTP----PS 252
           P +LPPNP Q  PAPL+PNPL+ PP P++P+P Q PP P  P   +PP+P L P    PS
Sbjct: 192 PPVLPPNPFQPPPAPLVPNPLQPPPAPLVPDPLQPPPEPAAP--SLPPVPGLAPPPSSPS 251

Query: 253 PPPTV---LPPFIPPFL--PPIPGIP 264
           PPP     LPPF  PFL  PP PG+P
Sbjct: 252 PPPDFPFPLPPF--PFLPVPPFPGVP 271

BLAST of CmoCh14G011770 vs. NCBI nr
Match: gi|590651402|ref|XP_007032885.1| (Pollen Ole e 1 allergen and extensin family protein [Theobroma cacao])

HSP 1 Score: 206.8 bits (525), Expect = 4.7e-50
Identity = 154/316 (48.73%), Postives = 189/316 (59.81%), Query Frame = 1

Query: 7   LLLFALFLSLTSHNAFSHSLHSNK-PAVVVGTVFCDTCSQQRLSKSTYFISGAIVAVECR 66
           LLL   F      N FS S H  K  AVVVGTV+CDTC Q+  S+++YFISGA VAVEC+
Sbjct: 7   LLLLGFFF-----NNFSESSHDQKLSAVVVGTVYCDTCFQEEFSRTSYFISGASVAVECK 66

Query: 67  DHKTSETNFKEEVKTNKNGNFKVVLPFSEQEHTNKIETCSVKMIKSSDPFCSVPSSATSS 126
           D  TS  +F++EVKTN++G FK+ LPFS  +H  KI+ CSVK+I+S +P+C+V S+ATSS
Sbjct: 67  DG-TSRPSFRQEVKTNEHGEFKIHLPFSVSKHVKKIKGCSVKLIRSREPYCAVASTATSS 126

Query: 127 SLKLKNSKIRDGTRVFSAGYFAFKPLKQPTSCNRK----NTDILKAKQVNPQ-------- 186
           SL LK+     GT +FSAG+F FKPLKQPT C++K    N   LK K+   Q        
Sbjct: 127 SLHLKSRM--HGTHIFSAGFFTFKPLKQPTLCSQKPSTQNPKQLKNKEPPAQNVLHPESF 186

Query: 187 LPFPPIVPPI----VPPVIQP-----PSLLPPNPLQ-------------PAPLIPNPLE- 246
           L  P + PP      PP++ P     P LLPPNP+Q             PAPLIPNP + 
Sbjct: 187 LSTPSVFPPDDAVPAPPILTPNPFQLPPLLPPNPIQPPPLLPPNPFQPPPAPLIPNPFQP 246

Query: 247 PPTPVIPNPFQPP--------------APLIPIIPMPPLPILTPPSPPPTVLPPFIP--- 265
           PP P+IPNPFQPP              AP  P+ P PP+P LTPPSPPP   PPF P   
Sbjct: 247 PPAPLIPNPFQPPPAPLFPPNPFRPPRAPPSPLFPFPPIPGLTPPSPPPPP-PPFFPFPF 306

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KW32_CUCSA2.4e-8563.82Uncharacterized protein OS=Cucumis sativus GN=Csa_4G126430 PE=4 SV=1[more]
A0A0B0MR63_GOSAR2.0e-5253.09Major pollen allergen Lig v 1 OS=Gossypium arboreum GN=F383_24177 PE=4 SV=1[more]
A0A0D2PUQ4_GOSRA5.0e-5153.38Uncharacterized protein OS=Gossypium raimondii GN=B456_001G206700 PE=4 SV=1[more]
A0A061EFY3_THECC3.3e-5048.73Pollen Ole e 1 allergen and extensin family protein OS=Theobroma cacao GN=TCM_01... [more]
A0A151UAZ7_CAJCA6.1e-4951.04Uncharacterized protein OS=Cajanus cajan GN=KK1_020696 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G15780.11.6e-3539.24 Pollen Ole e 1 allergen and extensin family protein[more]
AT4G08685.13.3e-0931.78 Pollen Ole e 1 allergen and extensin family protein[more]
AT3G26960.16.1e-0828.32 Pollen Ole e 1 allergen and extensin family protein[more]
AT1G78040.16.1e-0828.35 Pollen Ole e 1 allergen and extensin family protein[more]
AT5G41050.11.0e-0729.01 Pollen Ole e 1 allergen and extensin family protein[more]
Match NameE-valueIdentityDescription
gi|449438733|ref|XP_004137142.1|3.4e-8563.82PREDICTED: amyloid beta A4 precursor protein-binding family B member 1-interacti... [more]
gi|659111055|ref|XP_008455556.1|4.6e-8262.46PREDICTED: formin-like protein 7 [Cucumis melo][more]
gi|728818493|gb|KHG02822.1|2.9e-5253.09Major pollen allergen Lig v 1 [Gossypium arboreum][more]
gi|823126330|ref|XP_012488255.1|7.2e-5153.38PREDICTED: sulfated surface glycoprotein 185-like [Gossypium raimondii][more]
gi|590651402|ref|XP_007032885.1|4.7e-5048.73Pollen Ole e 1 allergen and extensin family protein [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0051225 spindle assembly
cellular_component GO:0005575 cellular_component
cellular_component GO:0070652 HAUS complex
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh14G011770.1CmoCh14G011770.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33210FAMILY NOT NAMEDcoord: 1..267
score: 3.3
NoneNo IPR availablePANTHERPTHR33210:SF1SUBFAMILY NOT NAMEDcoord: 1..267
score: 3.3
NoneNo IPR availablePFAMPF01190Pollen_Ole_e_Icoord: 34..123
score: 7.1