CmaCh05G004850 (gene) Cucurbita maxima (Rimu)

NameCmaCh05G004850
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionPollen Ole e I family allergen
LocationCma_Chr05 : 2282997 .. 2286110 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTGTCCAATTCCATACGTTTCAGTATATCATTGGAGGAAAATAGTTTATTATATAGGATCATTCACAGGTTAAAAGAACCCTACACGGCATAATTTCTTACACCGCAATTCCAGGCTTTCCCTCGCAGCATCTGATTAGAGGTTTTCACTTTACTCTTTATTATTCAAACCCTTACTTCTCATTTTCCCTTCATCAGTTCTTTCTTCTTCCATTTCCATTTTTCTTCCAATCGAAGAACGAAAATCCATGGCTCGATTTGCGATTCTCTTTGCTCTCATTTTGCTTCCTGCACTCGCCGTCGCAAGCCGGCCCGTTAGAACTCCGTTCCTTGTTCGTGGTAAAGTGTTCTGTGATACTTGCGTTGCCGGCTTCGAAACCTCCGCTACCACCTACATTCCCGGTATCTGCTCTCTTTCCTCGCTTTTAGCCTACTAGTTGAGAATTGTGGATGTGTTTGTTCTTATGTGCCTCGTGTTCTTGCTAGATCTGTGATGTAGTTAAGAGATTATGTTGATTTGTGGCTTGCTTGCTAGTTTAATGGTGGATCTGGTGCTTGAATCTTGAATGCTGCCTGATGGTATGTGGAGTGAGTTAGGGTTTAATTATAGGGATTATTTTAGTCCAAATTCGCGGTTTGGGTATGATTATTTTTTAATTGAAGCTTAATCTTGGGCTTAGTAATTAGATCTATGTTTGTTAGGTAAGAAGACTTATTTATTTTATATTTCTTTCTATGTCGTGCGTAACTTTTTATATTTAAGCAATGATTTTTCAAATCTGCCCGCTTACATTAGAATTTTTGTATTGATCCTGTATTTAGAACACTTCTCAGAAACTTTTACATACTATGTGTCCGAAATAACATCAACTGCTCGTAATTCTTTTTGCATCCATTGTCTGTGTTAAGAATATGAATAGGATAGAAGATTAAATGTAACAAACTAGCAGTTTAAGGCTTAAGCTTTTAGGTCATGTAGTAATTTAAATTTAAAATGGTTGTGGAGTAAAAGGTCGTGGGTAGGGGTGTACATTCAACCCAATAACCTGGACCAACCCAACCCGAATTATAAGGGTTGGGTTGGATTAGCATTTCAGGTAGGGTTGGGTTCATTTTTCTGAACCCGAACTGAATCGGTTCGGTTTCGGGTTCGTGGATAATTATACATCGATTTGTGTTTATTTGTGAATTTTTAATATATTTTATCTAAGATTGTATTTGAATAATTATAAATTGTGGATAAATAAAACTAAAAAAATCCGATAACACGACAACCCAACCTGAAGGTTGGGTTGGGTTGAGTTGTGAAAATTTTCGGGCTGGGTTGGGTTATCAATCCAACCAATCTGAACTTTTGGGTTGGGTCAAAAAAATCCTTCAACCCAACCCGAACCATGTACATCCCTACTCATGGGTCATTTGTCATGGGCTAATTTTCAAGCCCTTAAAAAGGATGAGAGTGAGGTGGTTGGTATATCAACTTGCTTCCAGATTCATTTTAGTATGTGCAATCGGTTGATTCAAACCTCTAATATCTTTGTCGACCATAAGTGTCTTAACTAGTTTTGTTTGTTAAATCATTACCTGGTTGGTAGATATAATCTTTTGTTATATCCCTAACCCTTCTCTTGATTGGAGTCAGGATATGAATTTAATATACAAAGGCCCACATGCAGGTTATTATACAAAGGCCCACGTGTCTTAACTTAGTGCTTAACATGTGATAATTGTCTAGGTGTGGGTAGTTCACGGATAAAGCCTTTTCACCAATTGTTCAATTTCCTTGTAGGTGCTAAGGTCGGAATTGAATGCAAAGACAGGAACACAATGGAACTTCTATATAAGCATGAGGCCACGACAGACTCCACTGGTTCTTACTCACTCTTAGTCAATGAAGACCACGAGGACCAAGTTTGTGATGCCGTACTAATCAGCAGCCCTCAGAAGGATTGTTCATTAGCAGCTGAAGGCCGTGACCGTGCCCGTGTAATCCTCACCCGCTACAACGGTATGGCCTCAAACGAACGTTATGTCAATGCCATGGGTTTTGCAAGATATGAACCTATGTCTGGCTGCAACCAAGTCCTCAGTCAGTACCAGGACATCGAAGATTAGAACGGGTTTTAATCCTACAATCGCTTAGTTGTATTTCTTGAGGTATATTTGCCCTGTTCTGTTCATGCTAGTGTTGCCCCACATACTCTTTTATCTGGACATTGAATGGTGTTTGTTATTGCATTCTTAAAGATGATTAATATGTTTTGTTCATTCACTTCATTACTATCATCACCAGGATGATGCCATCAACACACCCTTATTTATAAATACAAACTAGTGAAAATAGCCTTAATGACTTCTAATTTCTAACTGAAATAGGCTACCTGTTGTTATGCCGTAACTTAATTACTTTGTTGTGCAATAATTTTCTCCGGATTATGAATTAGATCTAATAATCCCTGACTTTCTAATTTGAATCTCTATGATTTGATCTCCCATATTTGTTGAAGCAAGCCATGAAGTTCAAGCACTGCACTGGTTAAGCCGAACATGGGTTTTAAAAATCTCGAGGGAAAACTTGAAATGAAAAGTCTAAATAAGACAATATCTGCTAGCGGTGAGCCTGAGCTATTACAAATGGTATCAGAGCTAAACACCGGACGATGTACTAGCAAGGAGGCTGTTATCTAAAGGGGTAGACAGAGGCGGTGTGTCAATAAGGATGTCAGGCTTTCAAAGGGGGTGGATTTGGTGAAGGTCCCACATAGATTGAAGAAAGGAATTAGTGCGAGCTAGGACACTAGGCCTTGAACGGGATGGATTGTGAGATCACACATTGGTTGAAAAGGAGAACGAAAAACTTTTTATAAAAGTGTAGAAACCTCTCTTTAGGAGAGGTGTTTTAAAATCTAGACGGAAACCAAAGAGAAAATATTTGCTAGCAATAAGTCAAAAACCTTGTATTTAATTAGTAAGGGGTGAATTTCCAAATTTCCAACCATTCACACTTAGGATTATGAAATAAAAGAATCAACCAACTTTTAATATTTTGGCTTTAATCTTTAATAAGAAAAAGAAAAAGTTTAAAAAAAATGGTTAATATATTCGAGATAACCAC

mRNA sequence

GTGTCCAATTCCATACGTTTCAGTATATCATTGGAGGAAAATAGTTTATTATATAGGATCATTCACAGGTTAAAAGAACCCTACACGGCATAATTTCTTACACCGCAATTCCAGGCTTTCCCTCGCAGCATCTGATTAGAGGTTTTCACTTTACTCTTTATTATTCAAACCCTTACTTCTCATTTTCCCTTCATCAGTTCTTTCTTCTTCCATTTCCATTTTTCTTCCAATCGAAGAACGAAAATCCATGGCTCGATTTGCGATTCTCTTTGCTCTCATTTTGCTTCCTGCACTCGCCGTCGCAAGCCGGCCCGTTAGAACTCCGTTCCTTGTTCGTGGTAAAGTGTTCTGTGATACTTGCGTTGCCGGCTTCGAAACCTCCGCTACCACCTACATTCCCGGTGCTAAGGTCGGAATTGAATGCAAAGACAGGAACACAATGGAACTTCTATATAAGCATGAGGCCACGACAGACTCCACTGGTTCTTACTCACTCTTAGTCAATGAAGACCACGAGGACCAAGTTTGTGATGCCGTACTAATCAGCAGCCCTCAGAAGGATTGTTCATTAGCAGCTGAAGGCCGTGACCGTGCCCGTGTAATCCTCACCCGCTACAACGGTATGGCCTCAAACGAACGTTATGTCAATGCCATGGGTTTTGCAAGATATGAACCTATGTCTGGCTGCAACCAAGTCCTCAGTCAGTACCAGGACATCGAAGATTAGAACGGGTTTTAATCCTACAATCGCTTAGTTGTATTTCTTGAGGTATATTTGCCCTGTTCTGTTCATGCTAGTGTTGCCCCACATACTCTTTTATCTGGACATTGAATGGTGTTTGTTATTGCATTCTTAAAGATGATTAATATGTTTTGTTCATTCACTTCATTACTATCATCACCAGGATGATGCCATCAACACACCCTTATTTATAAATACAAACTAGTGAAAATAGCCTTAATGACTTCTAATTTCTAACTGAAATAGGCTACCTGTTGTTATGCCGTAACTTAATTACTTTGTTGTGCAATAATTTTCTCCGGATTATGAATTAGATCTAATAATCCCTGACTTTCTAATTTGAATCTCTATGATTTGATCTCCCATATTTGTTGAAGCAAGCCATGAAGTTCAAGCACTGCACTGGTTAAGCCGAACATGGGTTTTAAAAATCTCGAGGGAAAACTTGAAATGAAAAGTCTAAATAAGACAATATCTGCTAGCGGTGAGCCTGAGCTATTACAAATGGTATCAGAGCTAAACACCGGACGATGTACTAGCAAGGAGGCTGTTATCTAAAGGGGTAGACAGAGGCGGTGTGTCAATAAGGATGTCAGGCTTTCAAAGGGGGTGGATTTGGTGAAGGTCCCACATAGATTGAAGAAAGGAATTAGTGCGAGCTAGGACACTAGGCCTTGAACGGGATGGATTGTGAGATCACACATTGGTTGAAAAGGAGAACGAAAAACTTTTTATAAAAGTGTAGAAACCTCTCTTTAGGAGAGGTGTTTTAAAATCTAGACGGAAACCAAAGAGAAAATATTTGCTAGCAATAAGTCAAAAACCTTGTATTTAATTAGTAAGGGGTGAATTTCCAAATTTCCAACCATTCACACTTAGGATTATGAAATAAAAGAATCAACCAACTTTTAATATTTTGGCTTTAATCTTTAATAAGAAAAAGAAAAAGTTTAAAAAAAATGGTTAATATATTCGAGATAACCAC

Coding sequence (CDS)

ATGGCTCGATTTGCGATTCTCTTTGCTCTCATTTTGCTTCCTGCACTCGCCGTCGCAAGCCGGCCCGTTAGAACTCCGTTCCTTGTTCGTGGTAAAGTGTTCTGTGATACTTGCGTTGCCGGCTTCGAAACCTCCGCTACCACCTACATTCCCGGTGCTAAGGTCGGAATTGAATGCAAAGACAGGAACACAATGGAACTTCTATATAAGCATGAGGCCACGACAGACTCCACTGGTTCTTACTCACTCTTAGTCAATGAAGACCACGAGGACCAAGTTTGTGATGCCGTACTAATCAGCAGCCCTCAGAAGGATTGTTCATTAGCAGCTGAAGGCCGTGACCGTGCCCGTGTAATCCTCACCCGCTACAACGGTATGGCCTCAAACGAACGTTATGTCAATGCCATGGGTTTTGCAAGATATGAACCTATGTCTGGCTGCAACCAAGTCCTCAGTCAGTACCAGGACATCGAAGATTAG

Protein sequence

MARFAILFALILLPALAVASRPVRTPFLVRGKVFCDTCVAGFETSATTYIPGAKVGIECKDRNTMELLYKHEATTDSTGSYSLLVNEDHEDQVCDAVLISSPQKDCSLAAEGRDRARVILTRYNGMASNERYVNAMGFARYEPMSGCNQVLSQYQDIED
BLAST of CmaCh05G004850 vs. Swiss-Prot
Match: DFC_ARATH (Protein DOWNSTREAM OF FLC OS=Arabidopsis thaliana GN=DFC PE=2 SV=1)

HSP 1 Score: 128.6 bits (322), Expect = 5.9e-29
Identity = 71/151 (47.02%), Postives = 87/151 (57.62%), Query Frame = 1

Query: 4   FAILFALILLPALAVASRPVRTPFLVRGKVFCDTCVAGFETSATTYIPGAKVGIECKDRN 63
           F  L A++ +  L +A+  V TPF + G V+CDTC  GFET AT YI GA+V I CKDR 
Sbjct: 5   FVPLIAVLCVLVLPLAAMAVGTPFHIEGSVYCDTCRFGFETIATQYIRGARVRIVCKDRV 64

Query: 64  TMELLYKHEATTDSTGSYSLLVNEDHEDQVCDAVLISSPQKDCSLAAEGRDRARVILTRY 123
           T++      A T   G Y + V  D +DQ C A L+ SP   C  A  GR  A VILTR 
Sbjct: 65  TLKSELVGVAVTGPDGKYKVAVRGDRQDQQCLAELVHSPLSRCQEADPGRSTATVILTRS 124

Query: 124 NGMASNERYVNAMGFARYEPMSGCNQVLSQY 155
           NG AS   Y NAMGF R EP+ GC  +  +Y
Sbjct: 125 NGAASTRHYANAMGFFRDEPLRGCAALRKRY 155

BLAST of CmaCh05G004850 vs. Swiss-Prot
Match: PSC13_MAIZE (Pollen-specific protein C13 OS=Zea mays GN=MGS1 PE=2 SV=1)

HSP 1 Score: 120.2 bits (300), Expect = 2.1e-26
Identity = 59/155 (38.06%), Postives = 95/155 (61.29%), Query Frame = 1

Query: 5   AILFALILLPALAVASRPVRTPFLVRGKVFCDTCVAGFETSATTYIPGAKVGIECKDRNT 64
           A++  L ++ + A A  P    ++++G+V+CDTC AGF T+ T YI GAKV +ECK   T
Sbjct: 13  AVILCLCVVLSCAAADDPNLPDYVIQGRVYCDTCRAGFVTNVTEYIAGAKVRLECKHFGT 72

Query: 65  MELLYKHEATTDSTGSYSLLVNEDHEDQVCDAVLISSPQKDCSLAAEGRDRARVILTRYN 124
            +L    +  TD+TG+Y++ + + HE+ +C  VL++SP+KDC      RDRA V+LTR  
Sbjct: 73  GKLERAIDGVTDATGTYTIELKDSHEEDICQVVLVASPRKDCDEVQALRDRAGVLLTRNV 132

Query: 125 GMASNERYVNAMGFARYEPMSGCNQVLSQYQDIED 160
           G++ + R  N +G+ +  P+  C  +L Q    +D
Sbjct: 133 GISDSLRPANPLGYFKDVPLPVCAALLKQLDSDDD 167

BLAST of CmaCh05G004850 vs. Swiss-Prot
Match: PHLB_PHLPR (Pollen allergen Phl p 11 OS=Phleum pratense PE=1 SV=1)

HSP 1 Score: 115.9 bits (289), Expect = 4.0e-25
Identity = 58/134 (43.28%), Postives = 81/134 (60.45%), Query Frame = 1

Query: 27  FLVRGKVFCDTCVAGFETSATTYIPGAKVGIECKDRNTMELLYKHEATTDSTGSYSLLVN 86
           F+V G+V+CD C AGFET+ +  + GA V ++C+  N  E   K EATTD  G Y + ++
Sbjct: 6   FVVTGRVYCDPCRAGFETNVSHNVQGATVAVDCRPFNGGESKLKAEATTDGLGWYKIEID 65

Query: 87  EDHEDQVCDAVLISSPQKDCSLAAEGRDRARVILTRYNGMASNE-RYVNAMGFARYEPMS 146
           +DH++++C+ VL  SP   CS   E RDRARV LT  NG+     RY N + F R EP+ 
Sbjct: 66  QDHQEEICEVVLAKSPDTTCSEIEEFRDRARVPLTSNNGIKQQGIRYANPIAFFRKEPLK 125

Query: 147 GCNQVLSQYQDIED 160
            C  +L  Y D+ D
Sbjct: 126 ECGGILQAY-DLRD 138

BLAST of CmaCh05G004850 vs. Swiss-Prot
Match: LOLB_LOLPR (Major pollen allergen Lol p 11 OS=Lolium perenne PE=1 SV=1)

HSP 1 Score: 113.2 bits (282), Expect = 2.6e-24
Identity = 55/129 (42.64%), Postives = 78/129 (60.47%), Query Frame = 1

Query: 27  FLVRGKVFCDTCVAGFETSATTYIPGAKVGIECKDRNTMELLYKHEATTDSTGSYSLLVN 86
           F+V G+V+CD C AGFET+ +  + GA V ++C+  +  E   K EATTD  G Y + ++
Sbjct: 6   FVVTGRVYCDPCRAGFETNVSHNVEGATVAVDCRPFDGGESKLKAEATTDKDGWYKIEID 65

Query: 87  EDHEDQVCDAVLISSPQKDCSLAAEGRDRARVILTRYNGMASNE-RYVNAMGFARYEPMS 146
           +DH++++C+ VL  SP K CS   E RDRARV LT   G+     RY N + F R EP+ 
Sbjct: 66  QDHQEEICEVVLAKSPDKSCSEIEEFRDRARVPLTSNXGIKQQGIRYANPIAFFRKEPLK 125

Query: 147 GCNQVLSQY 155
            C  +L  Y
Sbjct: 126 ECGGILQAY 134

BLAST of CmaCh05G004850 vs. Swiss-Prot
Match: CHE1_CHEAL (Pollen allergen Che a 1 OS=Chenopodium album PE=1 SV=1)

HSP 1 Score: 106.3 bits (264), Expect = 3.1e-22
Identity = 61/155 (39.35%), Postives = 87/155 (56.13%), Query Frame = 1

Query: 2   ARFAILFALILLPALAVASRPVRTPFLVRGKVFCDTCVAGFETSATTYIPGAKVGIECKD 61
           A F ++ AL +L +LA  +      F V+G V+CDTC   F T  +T + GA V +EC++
Sbjct: 6   AVFLLVGALCVL-SLAGVANAAENHFKVQGMVYCDTCRIQFMTRISTIMEGATVKLECRN 65

Query: 62  RNTMELLYKHEATTDSTGSYSLLVNEDHEDQVCDAVLISSPQKDCSLAAE---GRDRARV 121
                  +K EA TD  G YS+ VN D ED +C+  L+ SP  +CS  +     +  A+V
Sbjct: 66  ITAGTQTFKAEAVTDKVGQYSIPVNGDFEDDICEIELVKSPNSECSEVSHDVYAKQSAKV 125

Query: 122 ILTRYNGMASNERYVNAMGFARYEPMSGCNQVLSQ 154
            LT  NG AS+ R  NA+GF R EP+  C +VL +
Sbjct: 126 SLTSNNGEASDIRSANALGFMRKEPLKECPEVLKE 159

BLAST of CmaCh05G004850 vs. TrEMBL
Match: A0A0A0LQA5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G309370 PE=4 SV=1)

HSP 1 Score: 275.4 bits (703), Expect = 4.3e-71
Identity = 135/159 (84.91%), Postives = 148/159 (93.08%), Query Frame = 1

Query: 1   MARFAILFALILLPALAVASRPVRTPFLVRGKVFCDTCVAGFETSATTYIPGAKVGIECK 60
           MAR  ILFALI+LPALA+ASRPVRTPF+VRGKVFCDTC+AGFETSATTYIPGAKV IECK
Sbjct: 1   MARVIILFALIMLPALALASRPVRTPFVVRGKVFCDTCLAGFETSATTYIPGAKVRIECK 60

Query: 61  DRNTMELLYKHEATTDSTGSYSLLVNEDHEDQVCDAVLISSPQKDCSLAAEGRDRARVIL 120
           DRN+MEL Y HEATTDSTGSY+LLVNEDH D++CDAVL+SSPQ+ CS  +EGRDRARVIL
Sbjct: 61  DRNSMELQYTHEATTDSTGSYTLLVNEDHGDELCDAVLVSSPQEKCSSVSEGRDRARVIL 120

Query: 121 TRYNGMASNERYVNAMGFARYEPMSGCNQVLSQYQDIED 160
           TRYNG+ASNERYVNAMGFA  EPMSGCNQV+SQYQDIED
Sbjct: 121 TRYNGIASNERYVNAMGFAMDEPMSGCNQVMSQYQDIED 159

BLAST of CmaCh05G004850 vs. TrEMBL
Match: M5XNU1_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa012675mg PE=4 SV=1)

HSP 1 Score: 217.6 bits (553), Expect = 1.1e-53
Identity = 105/159 (66.04%), Postives = 126/159 (79.25%), Query Frame = 1

Query: 1   MARFAILFALILLPALAVASRPVRTPFLVRGKVFCDTCVAGFETSATTYIPGAKVGIECK 60
           MA+  +LFAL +LPALAVA+RP+RTPF V GKVFCD C AGFETSATTYIPGA V +EC+
Sbjct: 1   MAKLIVLFALCVLPALAVATRPMRTPFTVEGKVFCDPCRAGFETSATTYIPGATVRLECR 60

Query: 61  DRNTMELLYKHEATTDSTGSYSLLVNEDHEDQVCDAVLISSPQKDCSLAAEGRDRARVIL 120
           DR TM++ Y  E  TDSTG+Y + V EDHEDQ CDAVL+SS QKDC+ AA GRDRARVIL
Sbjct: 61  DRKTMDIRYTKEGRTDSTGTYKIPVTEDHEDQFCDAVLVSSSQKDCAAAAPGRDRARVIL 120

Query: 121 TRYNGMASNERYVNAMGFARYEPMSGCNQVLSQYQDIED 160
           T YNG+AS  R+ NAMGF + E +SGC Q+L Q Q+ ++
Sbjct: 121 TGYNGIASYNRFANAMGFMKNEAVSGCAQILKQLQEFDE 159

BLAST of CmaCh05G004850 vs. TrEMBL
Match: Q2I307_VITPS (Pollen-specific protein OS=Vitis pseudoreticulata PE=2 SV=1)

HSP 1 Score: 215.7 bits (548), Expect = 4.1e-53
Identity = 102/159 (64.15%), Postives = 124/159 (77.99%), Query Frame = 1

Query: 1   MARFAILFALILLPALAVASRPVRTPFLVRGKVFCDTCVAGFETSATTYIPGAKVGIECK 60
           M R  +L AL +LPAL  A RPV  PF+++G+V+CDTC AGFETSATTYI GAKV +ECK
Sbjct: 1   MGRLMLLVALCVLPALVSAGRPVSQPFVLQGRVYCDTCRAGFETSATTYIAGAKVRVECK 60

Query: 61  DRNTMELLYKHEATTDSTGSYSLLVNEDHEDQVCDAVLISSPQKDCSLAAEGRDRARVIL 120
           DRN+M+LLY  E  TDSTG+Y ++V EDHEDQ+CDAVL+SSPQ DC+    GRDRA VIL
Sbjct: 61  DRNSMQLLYSIEGITDSTGTYKIMVTEDHEDQLCDAVLVSSPQSDCASVDPGRDRAAVIL 120

Query: 121 TRYNGMASNERYVNAMGFARYEPMSGCNQVLSQYQDIED 160
           TRYNG+ S+ RY N+MGF +  PMS C Q+L QYQ+ ED
Sbjct: 121 TRYNGIVSDNRYANSMGFLKDHPMSECTQLLQQYQEFED 159

BLAST of CmaCh05G004850 vs. TrEMBL
Match: A0A059BKH8_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_F00490 PE=4 SV=1)

HSP 1 Score: 214.2 bits (544), Expect = 1.2e-52
Identity = 104/159 (65.41%), Postives = 126/159 (79.25%), Query Frame = 1

Query: 1   MARFAILFALI-LLPALAVASRPVRTPFLVRGKVFCDTCVAGFETSATTYIPGAKVGIEC 60
           MAR ++L AL  LLPA+AVA+RP R P  V GKV+CDTC AGFET A+TYI GAKV +EC
Sbjct: 1   MARLSVLLALCCLLPAIAVAARPARNPLTVTGKVYCDTCRAGFETPASTYIAGAKVKVEC 60

Query: 61  KDRNTMELLYKHEATTDSTGSYSLLVNEDHEDQVCDAVLISSPQKDCSLAAEGRDRARVI 120
           KDR +M+LLY  EATTDSTG+Y L V+EDH+DQ+CDA+L+SSPQ +C   A GRDRARVI
Sbjct: 61  KDRTSMKLLYSQEATTDSTGTYKLFVSEDHQDQLCDAMLLSSPQLNCQKPAAGRDRARVI 120

Query: 121 LTRYNGMASNERYVNAMGFARYEPMSGCNQVLSQYQDIE 159
           LTRYNG+ S+ RY N MGF   +P++GC QVL QYQD +
Sbjct: 121 LTRYNGIVSDTRYANNMGFEMDQPLAGCTQVLQQYQDFD 159

BLAST of CmaCh05G004850 vs. TrEMBL
Match: E2LMG1_MONPE (Uncharacterized protein OS=Moniliophthora perniciosa (strain FA553 / isolate CP02) GN=MPER_07958 PE=4 SV=1)

HSP 1 Score: 212.6 bits (540), Expect = 3.5e-52
Identity = 103/159 (64.78%), Postives = 125/159 (78.62%), Query Frame = 1

Query: 1   MARFAILFALI-LLPALAVASRPVRTPFLVRGKVFCDTCVAGFETSATTYIPGAKVGIEC 60
           MAR ++L AL  LLPA+A A+RP R P  V GKV+CDTC AGFET A+TYI GAKV +EC
Sbjct: 1   MARLSVLLALCCLLPAIAFAARPARNPLTVTGKVYCDTCRAGFETPASTYIAGAKVKVEC 60

Query: 61  KDRNTMELLYKHEATTDSTGSYSLLVNEDHEDQVCDAVLISSPQKDCSLAAEGRDRARVI 120
           KDR +M+LLY  EATTDSTG+Y L V+EDH+DQ+CDA+L+SSPQ +C   A GRDRARVI
Sbjct: 61  KDRTSMKLLYSQEATTDSTGTYKLFVSEDHQDQLCDAMLLSSPQPNCQKPAAGRDRARVI 120

Query: 121 LTRYNGMASNERYVNAMGFARYEPMSGCNQVLSQYQDIE 159
           LTRYNG+ S+ RY N MGF   +P++GC QVL QYQD +
Sbjct: 121 LTRYNGIVSDTRYANNMGFEMDQPLAGCTQVLQQYQDFD 159

BLAST of CmaCh05G004850 vs. TAIR10
Match: AT4G08685.1 (AT4G08685.1 Pollen Ole e 1 allergen and extensin family protein)

HSP 1 Score: 201.1 bits (510), Expect = 5.3e-52
Identity = 95/159 (59.75%), Postives = 121/159 (76.10%), Query Frame = 1

Query: 1   MARFAILFALILLPALAVASRPVRTPFLVRGKVFCDTCVAGFETSATTYIPGAKVGIECK 60
           M++  +L AL  LPALA+A+RP + PF+VRG+V+CDTC+AGFET A+TYI GA V +ECK
Sbjct: 1   MSKAVLLVALCFLPALAIAARPNKNPFVVRGRVYCDTCLAGFETPASTYISGAVVRLECK 60

Query: 61  DRNTMELLYKHEATTDSTGSYSLLVNEDHEDQVCDAVLISSPQKDCSLAAEGRDRARVIL 120
           DR TMEL Y HEA TDSTGSY +LVNEDH++Q CDA+L+ S Q  CS  + G DRARV L
Sbjct: 61  DRRTMELTYSHEARTDSTGSYKILVNEDHDEQFCDAMLVRSSQLRCSNVSPGHDRARVTL 120

Query: 121 TRYNGMASNERYVNAMGFARYEPMSGCNQVLSQYQDIED 160
           TR+NG+AS++R+ N MGF R   M GC  ++  YQ+ ED
Sbjct: 121 TRFNGIASDDRFANNMGFLRDAAMPGCADIMKLYQETED 159

BLAST of CmaCh05G004850 vs. TAIR10
Match: AT1G78040.1 (AT1G78040.1 Pollen Ole e 1 allergen and extensin family protein)

HSP 1 Score: 133.7 bits (335), Expect = 1.0e-31
Identity = 68/163 (41.72%), Postives = 101/163 (61.96%), Query Frame = 1

Query: 1   MARFAILFALILLPALAVASRPV---RTPFLVRGKVFCDTCVAGFETSATTY-IPGAKVG 60
           MA+  +L  L +LPA+A+A+R     +   +V+G  +CD C  GFET  ++Y IPGA V 
Sbjct: 1   MAKLVMLLVLCILPAIAMAARRGNIGKNTMVVQGSTYCDICKFGFETPESSYFIPGATVK 60

Query: 61  IECKDRNTMELLYKHEATTDSTGSYSLLVNEDHEDQVCDAVLISSPQKDCSLAAEGRDRA 120
           + CKDR TME +Y  +A +D  G Y  +V++DH DQ+CD +L+ S  K CS  + GR+++
Sbjct: 61  LSCKDRKTMEEVYTDKAVSDKEGKYKFIVHDDHRDQMCDVLLVKSSDKTCSKISVGREKS 120

Query: 121 RVILTRYNGMASNERYVNAMGFARYEPMSGCNQVLSQYQDIED 160
           RVIL  Y+G+AS  R+ N MGF +      C+ +  +Y   ED
Sbjct: 121 RVILNHYSGIASQIRHANNMGFEKEVSDVFCSALFQKYMVDED 163

BLAST of CmaCh05G004850 vs. TAIR10
Match: AT5G10130.1 (AT5G10130.1 Pollen Ole e 1 allergen and extensin family protein)

HSP 1 Score: 128.6 bits (322), Expect = 3.3e-30
Identity = 71/151 (47.02%), Postives = 87/151 (57.62%), Query Frame = 1

Query: 4   FAILFALILLPALAVASRPVRTPFLVRGKVFCDTCVAGFETSATTYIPGAKVGIECKDRN 63
           F  L A++ +  L +A+  V TPF + G V+CDTC  GFET AT YI GA+V I CKDR 
Sbjct: 5   FVPLIAVLCVLVLPLAAMAVGTPFHIEGSVYCDTCRFGFETIATQYIRGARVRIVCKDRV 64

Query: 64  TMELLYKHEATTDSTGSYSLLVNEDHEDQVCDAVLISSPQKDCSLAAEGRDRARVILTRY 123
           T++      A T   G Y + V  D +DQ C A L+ SP   C  A  GR  A VILTR 
Sbjct: 65  TLKSELVGVAVTGPDGKYKVAVRGDRQDQQCLAELVHSPLSRCQEADPGRSTATVILTRS 124

Query: 124 NGMASNERYVNAMGFARYEPMSGCNQVLSQY 155
           NG AS   Y NAMGF R EP+ GC  +  +Y
Sbjct: 125 NGAASTRHYANAMGFFRDEPLRGCAALRKRY 155

BLAST of CmaCh05G004850 vs. TAIR10
Match: AT1G29140.1 (AT1G29140.1 Pollen Ole e 1 allergen and extensin family protein)

HSP 1 Score: 102.1 bits (253), Expect = 3.3e-22
Identity = 60/168 (35.71%), Postives = 88/168 (52.38%), Query Frame = 1

Query: 1   MARFAILF-----ALILLPALAVASRPVRTPFLVRGKVFCDTCVAGFETSATTYIPGAKV 60
           MA+F I+F         L     A       F ++G V+CDTC   F T  + ++ GAKV
Sbjct: 1   MAKFFIVFLASALCFTTLVHFTAADADDFDKFHIKGSVYCDTCRVQFITRISKFLEGAKV 60

Query: 61  GIECKDRNTMELLYKHEATTDSTGSYSLLVNEDHEDQVCDAVLISSPQKDCSLAAEG--- 120
            +ECK R    +    EA TD+ G+Y + V  DHE++VC+ VL+ SP  +C         
Sbjct: 61  KLECKGRENQTVTLTKEAVTDNAGNYQMEVMGDHEEEVCEIVLLQSPDPECGDVNNQEFL 120

Query: 121 RDRARVILTRYNGMASNE-RYVNAMGFARYEPMSGCNQVLSQYQDIED 160
           R+ AR+ LT  +G+ SNE R +N +GF R  P++ C QV  +   + D
Sbjct: 121 RNAARISLTANDGIVSNETRTINPLGFMRKTPLAECPQVFKELGIVPD 168

BLAST of CmaCh05G004850 vs. TAIR10
Match: AT5G45880.1 (AT5G45880.1 Pollen Ole e 1 allergen and extensin family protein)

HSP 1 Score: 99.4 bits (246), Expect = 2.2e-21
Identity = 60/171 (35.09%), Postives = 89/171 (52.05%), Query Frame = 1

Query: 1   MARFAILFALILLPAL--------AVASRPVRTPFLVRGKVFCDTCVAGFETSATTYIPG 60
           MA  AI F+  ++ A+        A A       F ++G V+CDTC   F T  + ++ G
Sbjct: 1   MASKAIFFSFFVVSAVCLSSLAGFAAADADDFDRFQIQGSVYCDTCRVQFVTRLSKFLEG 60

Query: 61  AKVGIECKDRNTMELLYKHEATTDSTGSYSLLVNEDHEDQVCDAVLISSPQKDC---SLA 120
           AKV +EC+ R    +    EA TD TGSY + V  DHE++VC+ VL+ SP   C   S  
Sbjct: 61  AKVKLECRSRTNGTITLTKEAVTDKTGSYKMEVTGDHEEEVCELVLVQSPDSGCSDVSTE 120

Query: 121 AEGRDRARVILTRYNGMASNE-RYVNAMGFARYEPMSGCNQVLSQYQDIED 160
           A  R+ A++ LT  +G+ S+E R VN +GF    P++ C     +   + D
Sbjct: 121 AYLRNAAKISLTANDGIVSHETRIVNPLGFMVQTPLADCPAAFKELGIVPD 171

BLAST of CmaCh05G004850 vs. NCBI nr
Match: gi|449465966|ref|XP_004150698.1| (PREDICTED: protein DOWNSTREAM OF FLC [Cucumis sativus])

HSP 1 Score: 275.4 bits (703), Expect = 6.2e-71
Identity = 135/159 (84.91%), Postives = 148/159 (93.08%), Query Frame = 1

Query: 1   MARFAILFALILLPALAVASRPVRTPFLVRGKVFCDTCVAGFETSATTYIPGAKVGIECK 60
           MAR  ILFALI+LPALA+ASRPVRTPF+VRGKVFCDTC+AGFETSATTYIPGAKV IECK
Sbjct: 1   MARVIILFALIMLPALALASRPVRTPFVVRGKVFCDTCLAGFETSATTYIPGAKVRIECK 60

Query: 61  DRNTMELLYKHEATTDSTGSYSLLVNEDHEDQVCDAVLISSPQKDCSLAAEGRDRARVIL 120
           DRN+MEL Y HEATTDSTGSY+LLVNEDH D++CDAVL+SSPQ+ CS  +EGRDRARVIL
Sbjct: 61  DRNSMELQYTHEATTDSTGSYTLLVNEDHGDELCDAVLVSSPQEKCSSVSEGRDRARVIL 120

Query: 121 TRYNGMASNERYVNAMGFARYEPMSGCNQVLSQYQDIED 160
           TRYNG+ASNERYVNAMGFA  EPMSGCNQV+SQYQDIED
Sbjct: 121 TRYNGIASNERYVNAMGFAMDEPMSGCNQVMSQYQDIED 159

BLAST of CmaCh05G004850 vs. NCBI nr
Match: gi|659089151|ref|XP_008445353.1| (PREDICTED: protein DOWNSTREAM OF FLC [Cucumis melo])

HSP 1 Score: 273.5 bits (698), Expect = 2.4e-70
Identity = 134/159 (84.28%), Postives = 148/159 (93.08%), Query Frame = 1

Query: 1   MARFAILFALILLPALAVASRPVRTPFLVRGKVFCDTCVAGFETSATTYIPGAKVGIECK 60
           MAR  ILFALI+LPALAVASRPVRTPF+VRGKVFCDTC+AGFETSATTYIPGAKV IECK
Sbjct: 1   MARLVILFALIMLPALAVASRPVRTPFVVRGKVFCDTCLAGFETSATTYIPGAKVRIECK 60

Query: 61  DRNTMELLYKHEATTDSTGSYSLLVNEDHEDQVCDAVLISSPQKDCSLAAEGRDRARVIL 120
           DRN+ME+ Y HEATTDSTGSY+LLV+EDH D++CDAVL+SSPQ+ CS  AEGRDRARVIL
Sbjct: 61  DRNSMEVRYTHEATTDSTGSYTLLVSEDHGDELCDAVLVSSPQEKCSSVAEGRDRARVIL 120

Query: 121 TRYNGMASNERYVNAMGFARYEPMSGCNQVLSQYQDIED 160
           TRYNG+ASN+RYVNAMGFA  EPMSGCNQV+SQYQDIED
Sbjct: 121 TRYNGIASNDRYVNAMGFAIDEPMSGCNQVMSQYQDIED 159

BLAST of CmaCh05G004850 vs. NCBI nr
Match: gi|1009136509|ref|XP_015885563.1| (PREDICTED: pollen-specific protein C13-like [Ziziphus jujuba])

HSP 1 Score: 217.6 bits (553), Expect = 1.5e-53
Identity = 100/159 (62.89%), Postives = 129/159 (81.13%), Query Frame = 1

Query: 1   MARFAILFALILLPALAVASRPVRTPFLVRGKVFCDTCVAGFETSATTYIPGAKVGIECK 60
           MA+  +L AL +LPALA+A+RP+R P +V+G+V+CDTC+AGFETSA+ YI GAKV +ECK
Sbjct: 1   MAKLVLLLALCILPALALATRPLRKPLIVQGRVYCDTCLAGFETSASNYIAGAKVRLECK 60

Query: 61  DRNTMELLYKHEATTDSTGSYSLLVNEDHEDQVCDAVLISSPQKDCSLAAEGRDRARVIL 120
           +RNTM+LLY  E TTDSTG+Y + V EDHEDQ+CDA+L+SSPQ DC+  + GR+RARVIL
Sbjct: 61  NRNTMQLLYSKEGTTDSTGTYKITVTEDHEDQLCDALLVSSPQGDCAKVSPGRERARVIL 120

Query: 121 TRYNGMASNERYVNAMGFARYEPMSGCNQVLSQYQDIED 160
           T YNGMAS  R+ NAMGF + EP +GC  VL QYQ+ ++
Sbjct: 121 TNYNGMASETRFANAMGFTKEEPAAGCANVLKQYQEFDN 159

BLAST of CmaCh05G004850 vs. NCBI nr
Match: gi|596288682|ref|XP_007225977.1| (hypothetical protein PRUPE_ppa012675mg [Prunus persica])

HSP 1 Score: 217.6 bits (553), Expect = 1.5e-53
Identity = 105/159 (66.04%), Postives = 126/159 (79.25%), Query Frame = 1

Query: 1   MARFAILFALILLPALAVASRPVRTPFLVRGKVFCDTCVAGFETSATTYIPGAKVGIECK 60
           MA+  +LFAL +LPALAVA+RP+RTPF V GKVFCD C AGFETSATTYIPGA V +EC+
Sbjct: 1   MAKLIVLFALCVLPALAVATRPMRTPFTVEGKVFCDPCRAGFETSATTYIPGATVRLECR 60

Query: 61  DRNTMELLYKHEATTDSTGSYSLLVNEDHEDQVCDAVLISSPQKDCSLAAEGRDRARVIL 120
           DR TM++ Y  E  TDSTG+Y + V EDHEDQ CDAVL+SS QKDC+ AA GRDRARVIL
Sbjct: 61  DRKTMDIRYTKEGRTDSTGTYKIPVTEDHEDQFCDAVLVSSSQKDCAAAAPGRDRARVIL 120

Query: 121 TRYNGMASNERYVNAMGFARYEPMSGCNQVLSQYQDIED 160
           T YNG+AS  R+ NAMGF + E +SGC Q+L Q Q+ ++
Sbjct: 121 TGYNGIASYNRFANAMGFMKNEAVSGCAQILKQLQEFDE 159

BLAST of CmaCh05G004850 vs. NCBI nr
Match: gi|645256564|ref|XP_008234009.1| (PREDICTED: pollen-specific protein C13 [Prunus mume])

HSP 1 Score: 216.9 bits (551), Expect = 2.6e-53
Identity = 104/159 (65.41%), Postives = 126/159 (79.25%), Query Frame = 1

Query: 1   MARFAILFALILLPALAVASRPVRTPFLVRGKVFCDTCVAGFETSATTYIPGAKVGIECK 60
           MA+  +LFAL +LPALAVA+RP+RTPF V GKV+CDTC AGFETSATTYIPGA V +EC+
Sbjct: 1   MAKLIVLFALCVLPALAVATRPMRTPFTVEGKVYCDTCRAGFETSATTYIPGATVRLECR 60

Query: 61  DRNTMELLYKHEATTDSTGSYSLLVNEDHEDQVCDAVLISSPQKDCSLAAEGRDRARVIL 120
           DR TM++ Y  E  TDSTG+Y + V EDHEDQ CDAVL+SS QKDC+ AA GRDRARVIL
Sbjct: 61  DRKTMDIRYTKEGRTDSTGTYKIPVTEDHEDQFCDAVLVSSSQKDCAAAAPGRDRARVIL 120

Query: 121 TRYNGMASNERYVNAMGFARYEPMSGCNQVLSQYQDIED 160
           T YNG+AS  R+ NAMGF + E + GC Q+L Q Q+ ++
Sbjct: 121 TGYNGIASYNRFANAMGFMKNEAVPGCAQILKQLQEFDE 159

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
DFC_ARATH5.9e-2947.02Protein DOWNSTREAM OF FLC OS=Arabidopsis thaliana GN=DFC PE=2 SV=1[more]
PSC13_MAIZE2.1e-2638.06Pollen-specific protein C13 OS=Zea mays GN=MGS1 PE=2 SV=1[more]
PHLB_PHLPR4.0e-2543.28Pollen allergen Phl p 11 OS=Phleum pratense PE=1 SV=1[more]
LOLB_LOLPR2.6e-2442.64Major pollen allergen Lol p 11 OS=Lolium perenne PE=1 SV=1[more]
CHE1_CHEAL3.1e-2239.35Pollen allergen Che a 1 OS=Chenopodium album PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LQA5_CUCSA4.3e-7184.91Uncharacterized protein OS=Cucumis sativus GN=Csa_2G309370 PE=4 SV=1[more]
M5XNU1_PRUPE1.1e-5366.04Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa012675mg PE=4 SV=1[more]
Q2I307_VITPS4.1e-5364.15Pollen-specific protein OS=Vitis pseudoreticulata PE=2 SV=1[more]
A0A059BKH8_EUCGR1.2e-5265.41Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_F00490 PE=4 SV=1[more]
E2LMG1_MONPE3.5e-5264.78Uncharacterized protein OS=Moniliophthora perniciosa (strain FA553 / isolate CP0... [more]
Match NameE-valueIdentityDescription
AT4G08685.15.3e-5259.75 Pollen Ole e 1 allergen and extensin family protein[more]
AT1G78040.11.0e-3141.72 Pollen Ole e 1 allergen and extensin family protein[more]
AT5G10130.13.3e-3047.02 Pollen Ole e 1 allergen and extensin family protein[more]
AT1G29140.13.3e-2235.71 Pollen Ole e 1 allergen and extensin family protein[more]
AT5G45880.12.2e-2135.09 Pollen Ole e 1 allergen and extensin family protein[more]
Match NameE-valueIdentityDescription
gi|449465966|ref|XP_004150698.1|6.2e-7184.91PREDICTED: protein DOWNSTREAM OF FLC [Cucumis sativus][more]
gi|659089151|ref|XP_008445353.1|2.4e-7084.28PREDICTED: protein DOWNSTREAM OF FLC [Cucumis melo][more]
gi|1009136509|ref|XP_015885563.1|1.5e-5362.89PREDICTED: pollen-specific protein C13-like [Ziziphus jujuba][more]
gi|596288682|ref|XP_007225977.1|1.5e-5366.04hypothetical protein PRUPE_ppa012675mg [Prunus persica][more]
gi|645256564|ref|XP_008234009.1|2.6e-5365.41PREDICTED: pollen-specific protein C13 [Prunus mume][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0071840 cellular component organization or biogenesis
biological_process GO:0009987 cellular process
biological_process GO:0044699 single-organism process
cellular_component GO:0005615 extracellular space
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh05G004850.1CmaCh05G004850.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR31614FAMILY NOT NAMEDcoord: 1..159
score: 1.5
NoneNo IPR availablePANTHERPTHR31614:SF5ALLERGEN-LIKE PROTEIN BRSN20-RELATEDcoord: 1..159
score: 1.5
NoneNo IPR availablePFAMPF01190Pollen_Ole_e_Icoord: 29..111
score: 6.4