CmaCh17G010750.1 (mRNA) Cucurbita maxima (Rimu)

NameCmaCh17G010750.1
TypemRNA
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionPollen Ole e 1 allergen and extensin family protein, putative
LocationCma_Chr17 : 7648367 .. 7649734 (-)
Sequence length883
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexonthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCGTAGCTGGTCTCAAGTTCTTGATAATGGCTTCTCTGCGTGCAGTTTTCTTGTCACTTTTGGTCATTGTTGCTTCAGCTGGTGGCGATGACAATAATGGTGGTGGGTATTATGATCTTATGACACCCATATTGGCTAAGGAACAAAGGCTTCTCTCTACCATGATTGGCATCCAAGGAATTATCCTATACAAATTTGGCTCGACAATTTCTCCTCTTGAAGGTATCTGATTCATTGTTTGGTTTGAACGAATATAACTAAAGTAACTTAAAGTTCTAACTATTTTAAAATCACTCTCACTCTTAGAATATATGCTCTTAGTGAGATTCCACATCGATTGGAGAGGAGAACGAAACATTGCTTATAAGGGTGTGGAAACCTCTCCCAACTAGATGTGTTTTAAAACCTGGAGGGGAAGCCCAGAAGGGAATACCCAAAGAGGACAATATCTGCTAGCTGTGAACTTGGGCTGTTACAAATGGTATCAGAGCCAGATACCGGCCCCCAAGGGGGTGGATTGTGAGATCCCACATAGGTGTGAAACCTCCGCCTATTATACACATTTTAAAACCTTGAGGAAAAACCTGAAAGAGAAGTCCAAAAAGGACAATATCTATTAGCGGTGGATTTGAGCTGTCACAGACTTAGTTATCGAATGAGATGTACGTAGGAGCTTGAGTTTAATTTTGGGTATGTTTCTTATATAGGAGGTTTGGCAAGAATCACATGTAAAGCAGTGGATGAGTATGGCTATGAGGCAGCTTCTTACACATTTTTAAGTGATTCAAGTGATGCAAATGGCTACTTCTTGGCAACACTATCTCCTTCAGAGGTAGATGACAAGAGGGAGTTAAAGGAATGCAAGGCTTTTCTTGAGCTCTCACCATTAGAGAACTGTCAAGCTCCTTCTGACCTCAACAATGGAGTATCTGGTGCTCTTCTCCATTCCTATAAACTTTTGGTCCATAACAAGATGAAACTCTTCTCTGTCGGGCCTTTCCTTTTCACTTGCCAAAGTTAAAGGGGAGATGGAAGGCATGATAGTTAATGATCCCATACGTTTAAATACTTAGTGTTGAAAGTTTACTCATCTTATGATGGAATGGTGGCTAAGTTTGTAATATGAATTGGGTTTGGAGAGATATTATAACGGCTCAAGTCCACCGCTAGCAGATATTGTCCTTTAAGTTTTCCTTTTCAGGCTTTTCTTCAAAAATTTTAAAACGCGTTTTCAAGGGAGAGGTTTCTACACCCTTGTAAGAAATAATTCGTTCCTGATGGAATCTAACAATCCACCCCTTGGGGACCCAACAAGGCTTTGATATCATTCGTTCCCCCCTCCAACCGATATGTGATCTCACAAT

mRNA sequence

ATGGCCGTAGCTGGTCTCAAGTTCTTGATAATGGCTTCTCTGCGTGCAGTTTTCTTGTCACTTTTGGTCATTGTTGCTTCAGCTGGTGGCGATGACAATAATGGTGGTGGGTATTATGATCTTATGACACCCATATTGGCTAAGGAACAAAGGCTTCTCTCTACCATGATTGGCATCCAAGGAATTATCCTATACAAATTTGGCTCGACAATTTCTCCTCTTGAAGGAGGTTTGGCAAGAATCACATGTAAAGCAGTGGATGAGTATGGCTATGAGGCAGCTTCTTACACATTTTTAAGTGATTCAAGTGATGCAAATGGCTACTTCTTGGCAACACTATCTCCTTCAGAGGTAGATGACAAGAGGGAGTTAAAGGAATGCAAGGCTTTTCTTGAGCTCTCACCATTAGAGAACTGTCAAGCTCCTTCTGACCTCAACAATGGAGTATCTGGTGCTCTTCTCCATTCCTATAAACTTTTGGTCCATAACAAGATGAAACTCTTCTCTGTCGGGCCTTTCCTTTTCACTTGCCAAAGTTAAAGGGGAGATGGAAGGCATGATAGTTAATGATCCCATACGTTTAAATACTTAGTGTTGAAAGTTTACTCATCTTATGATGGAATGGTGGCTAAGTTTGTAATATGAATTGGGTTTGGAGAGATATTATAACGGCTCAAGTCCACCGCTAGCAGATATTGTCCTTTAAGTTTTCCTTTTCAGGCTTTTCTTCAAAAATTTTAAAACGCGTTTTCAAGGGAGAGGTTTCTACACCCTTGTAAGAAATAATTCGTTCCTGATGGAATCTAACAATCCACCCCTTGGGGACCCAACAAGGCTTTGATATCATTCGTTCCCCCCTCCAACCGATATGTGATCTCACAAT

Coding sequence (CDS)

ATGGCCGTAGCTGGTCTCAAGTTCTTGATAATGGCTTCTCTGCGTGCAGTTTTCTTGTCACTTTTGGTCATTGTTGCTTCAGCTGGTGGCGATGACAATAATGGTGGTGGGTATTATGATCTTATGACACCCATATTGGCTAAGGAACAAAGGCTTCTCTCTACCATGATTGGCATCCAAGGAATTATCCTATACAAATTTGGCTCGACAATTTCTCCTCTTGAAGGAGGTTTGGCAAGAATCACATGTAAAGCAGTGGATGAGTATGGCTATGAGGCAGCTTCTTACACATTTTTAAGTGATTCAAGTGATGCAAATGGCTACTTCTTGGCAACACTATCTCCTTCAGAGGTAGATGACAAGAGGGAGTTAAAGGAATGCAAGGCTTTTCTTGAGCTCTCACCATTAGAGAACTGTCAAGCTCCTTCTGACCTCAACAATGGAGTATCTGGTGCTCTTCTCCATTCCTATAAACTTTTGGTCCATAACAAGATGAAACTCTTCTCTGTCGGGCCTTTCCTTTTCACTTGCCAAAGTTAA

Protein sequence

MAVAGLKFLIMASLRAVFLSLLVIVASAGGDDNNGGGYYDLMTPILAKEQRLLSTMIGIQGIILYKFGSTISPLEGGLARITCKAVDEYGYEAASYTFLSDSSDANGYFLATLSPSEVDDKRELKECKAFLELSPLENCQAPSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFTCQS
BLAST of CmaCh17G010750.1 vs. Swiss-Prot
Match: PRP3_ARATH (Proline-rich protein 3 OS=Arabidopsis thaliana GN=PRP3 PE=2 SV=1)

HSP 1 Score: 72.8 bits (177), Expect = 4.3e-12
Identity = 40/120 (33.33%), Postives = 63/120 (52.50%), Query Frame = 1

Query: 59  IQGIILYKFGSTISPLEGGLARITCKAVDEYGYEAASYTFLSDSSDANGYFLATLSPSEV 118
           + GIIL K G    P+ G   +I C     YG         S+ +D+ GYF  +L+    
Sbjct: 188 VDGIILCKNGYETYPILGAKIQIVCSDPASYGKSNTEVVIYSNPTDSKGYFHLSLTSI-- 247

Query: 119 DDKRELKECKAFLELSPLENCQAPSDLNNGVSGA--LLHSYKLLVHNKMKLFSVGPFLFT 177
              ++L  C+  L LSP+E C+ P+++N G++G    L+ Y+      ++LFSVGPF +T
Sbjct: 248 ---KDLAYCRVKLYLSPVETCKNPTNVNKGLTGVPLALYGYRFYPDKNLELFSVGPFYYT 302

BLAST of CmaCh17G010750.1 vs. Swiss-Prot
Match: PRP1_ARATH (Proline-rich protein 1 OS=Arabidopsis thaliana GN=PRP1 PE=2 SV=1)

HSP 1 Score: 70.5 bits (171), Expect = 2.1e-11
Identity = 41/121 (33.88%), Postives = 58/121 (47.93%), Query Frame = 1

Query: 59  IQGIILYKFGSTISPLEGGLARITCKAVDEYGYEAASYTFLSDSSDANGYFLATLSPSEV 118
           + GIIL K G    P++G  A+I C     Y          SD +D  GYF   L+    
Sbjct: 214 VGGIILCKNGYETYPIQGAKAKIVCSERGSYEKSKNEVVIYSDPTDFKGYFHVVLT---- 273

Query: 119 DDKRELKECKAFLELSPLENCQAPSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFTCQ 178
              + L  C+  L  SP+E C+ P+++N G++G     Y       +KLF+VGPF FT  
Sbjct: 274 -HIKNLSNCRVKLYTSPVETCKNPTNVNKGLTGVPFSMYS---DKNLKLFNVGPFYFTAG 326

Query: 179 S 180
           S
Sbjct: 334 S 326

BLAST of CmaCh17G010750.1 vs. TrEMBL
Match: A0A0A0K5H5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G398170 PE=4 SV=1)

HSP 1 Score: 274.6 bits (701), Expect = 8.3e-71
Identity = 143/173 (82.66%), Postives = 154/173 (89.02%), Query Frame = 1

Query: 11  MASLRAVFLSLLVI---VASAGGDDNNGGGYYDLMTPILAKE-QRLLSTMIGIQGIILYK 70
           M SL AVF SLLVI   V SA GD  +GG Y D MTP LAK+ +RLLSTMIGI+GIILYK
Sbjct: 1   MGSLHAVFFSLLVISIIVGSANGDIYDGGSY-DSMTPKLAKDQERLLSTMIGIEGIILYK 60

Query: 71  FGSTISPLEGGLARITCKAVDEYGYEAASYTFLSDSSDANGYFLATLSPSEVDDKRELKE 130
           FGS+ISPL+GGLARITCK VDEYGYEAASYTFLS+SSD NGYFLATLSPSEV+DKRELKE
Sbjct: 61  FGSSISPLQGGLARITCKTVDEYGYEAASYTFLSESSDENGYFLATLSPSEVEDKRELKE 120

Query: 131 CKAFLELSPLENCQAPSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFTCQS 180
           CKAFLE+SPLENCQ+PSDLNNGVSGALLHSYK LVHN MKLFSVGPFLFTCQ+
Sbjct: 121 CKAFLEVSPLENCQSPSDLNNGVSGALLHSYKFLVHNNMKLFSVGPFLFTCQT 172

BLAST of CmaCh17G010750.1 vs. TrEMBL
Match: B9GSD1_POPTR (Pollen Ole e 1 allergen and extensin family protein OS=Populus trichocarpa GN=POPTR_0002s20300g PE=4 SV=1)

HSP 1 Score: 165.6 bits (418), Expect = 5.5e-38
Identity = 87/167 (52.10%), Postives = 116/167 (69.46%), Query Frame = 1

Query: 12  ASLRAVFLSLLVIVASAGGDDNNGGGYYD--LMTPILAKEQRLLSTMIGIQGIILYKFGS 71
           +S  A F+SL ++ A A   D   G + +  L+ P L  +++ LSTMIG+QG++  + G 
Sbjct: 5   SSYFAFFMSLSMVAAIASATDGGYGSHPNPNLVKPKL-NKEKPLSTMIGVQGLVYCRSGP 64

Query: 72  TISPLEGGLARITCKAVDEYGYEAASYTFLSDSSDANGYFLATLSPSEVDDKRELKECKA 131
              PLEG + RITC A D YGYEAA ++FLS+++DA GYF ATLSP E+ D  ++KECKA
Sbjct: 65  KRFPLEGAVIRITCLANDVYGYEAAPFSFLSEATDAKGYFFATLSPYEMQDNLKIKECKA 124

Query: 132 FLELSPLENCQAPSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFT 177
           FLELSPLE C+ P+D   G+SGALL SY  L   KMKLF+VGPF++T
Sbjct: 125 FLELSPLETCKIPTDEKQGISGALLASYHYLSDKKMKLFTVGPFVYT 170

BLAST of CmaCh17G010750.1 vs. TrEMBL
Match: B9SAV3_RICCO (Structural constituent of cell wall, putative OS=Ricinus communis GN=RCOM_1179460 PE=4 SV=1)

HSP 1 Score: 164.9 bits (416), Expect = 9.3e-38
Identity = 81/134 (60.45%), Postives = 102/134 (76.12%), Query Frame = 1

Query: 44  PILAKEQRLLSTMIGIQGIILYKFGSTISPLEGGLARITCKAVDEYGYEAASYTFLSDSS 103
           P L KE +LLS+++GIQG+I  K G  + PLEG +ARITC   DEYG+EAA ++ LS ++
Sbjct: 36  PRLEKE-KLLSSLVGIQGLIYCKSGPKLIPLEGAIARITCLTTDEYGHEAAPWSILSGAT 95

Query: 104 DANGYFLATLSPSEVDDKRELKECKAFLELSP-LENCQAPSDLNNGVSGALLHSYKLLVH 163
           DA GYFLATLSPSEV+DK ++KECKAFLE SP LE C  P+D+N G++GA L SY  L H
Sbjct: 96  DAKGYFLATLSPSEVEDKMKIKECKAFLETSPSLETCNVPTDINKGITGAPLASYNFLTH 155

Query: 164 NKMKLFSVGPFLFT 177
             MKLF+VGPF +T
Sbjct: 156 KNMKLFTVGPFFYT 168

BLAST of CmaCh17G010750.1 vs. TrEMBL
Match: A0A0B2RFK1_GLYSO (Pistil-specific extensin-like protein OS=Glycine soja GN=glysoja_037705 PE=4 SV=1)

HSP 1 Score: 162.9 bits (411), Expect = 3.5e-37
Identity = 75/129 (58.14%), Postives = 101/129 (78.29%), Query Frame = 1

Query: 48  KEQRLLSTMIGIQGIILYKFGSTISPLEGGLARITCKAVDEYGYEAASYTFLSDSSDANG 107
           +E++LLS  IGIQGI+  K  S ++PLEG L RI+C+AVDEYG+E   ++FLS+++D+ G
Sbjct: 32  EEEKLLSKTIGIQGIVYCKSASKLTPLEGALTRISCEAVDEYGFETTPFSFLSEATDSKG 91

Query: 108 YFLATLSPSEVDDKRELKECKAFLELSPLENCQAPSDLNNGVSGALLHSYKLLVHNKMKL 167
           YFLATLSP EV+ K  LKEC+AFL+ SPL NC  P+D+N G+SGA+L  ++ L   KMKL
Sbjct: 92  YFLATLSPQEVEGKGVLKECRAFLDASPLNNCSYPTDVNKGISGAVLRFHRFLHDKKMKL 151

Query: 168 FSVGPFLFT 177
           ++VGPFLFT
Sbjct: 152 YTVGPFLFT 160

BLAST of CmaCh17G010750.1 vs. TrEMBL
Match: A0A061DR56_THECC (Pollen Ole e 1 allergen and extensin family protein, putative OS=Theobroma cacao GN=TCM_004708 PE=4 SV=1)

HSP 1 Score: 161.4 bits (407), Expect = 1.0e-36
Identity = 83/162 (51.23%), Postives = 111/162 (68.52%), Query Frame = 1

Query: 17  VFLSLLVIVASAGGDDNNGGGYYDLMTPILAKEQRLLSTMIGIQGIILYKFGSTISPLEG 76
           +F+  L++  +A   D       +L  P + K ++LLSTMIGIQG++  + GS   PLEG
Sbjct: 66  LFMLPLLLPTAALDSDGEYKPNPNLQKPYVEK-EKLLSTMIGIQGLVYCRSGSQFIPLEG 125

Query: 77  GLARITCKAVDEYGYEAASYTFLSDSSDANGYFLATLSPSEVDDKRELKECKAFLELSPL 136
            +ARITC+ VD+YGYE  S++ LS ++DA GYF+AT+SP EV D R L+ECKAFLELSP 
Sbjct: 126 AVARITCQGVDKYGYETESFSILSCATDAKGYFIATVSPYEVKDSRRLRECKAFLELSPS 185

Query: 137 ENCQAPSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFTCQ 179
           + C  P+D+N G++GA L SY LL    MKLF+VGPF F  Q
Sbjct: 186 DACDVPTDVNQGITGAPLASYHLLHDKNMKLFTVGPFFFIPQ 226

BLAST of CmaCh17G010750.1 vs. TAIR10
Match: AT2G47540.1 (AT2G47540.1 Pollen Ole e 1 allergen and extensin family protein)

HSP 1 Score: 153.7 bits (387), Expect = 1.1e-37
Identity = 74/134 (55.22%), Postives = 103/134 (76.87%), Query Frame = 1

Query: 49  EQRLLSTMIGIQGIILYKFGSTISPLEGGLARITCKAVDEYGYEAASYTFLSDSSDANGY 108
           E  LLS+MIG+QG+I  K GS ++P++G +AR+TC+  DEYGYEA   T LS ++DA GY
Sbjct: 29  EGELLSSMIGVQGLIYCKRGSKLTPIQGAVARVTCERTDEYGYEAEDVTVLSQATDAKGY 88

Query: 109 FLATLSPSEVDDKR---ELKECKAFLELSPLENCQAPSDLNNGVSGALLHSYKLLVHN-K 168
           FLATLS SEV D +   ++KEC+AFLELSP + C  P+++N G+SGA+L +Y+LL +  K
Sbjct: 89  FLATLSSSEVKDYKKVIKIKECRAFLELSPSDTCSFPTEINRGISGAILQNYRLLENKLK 148

Query: 169 MKLFSVGPFLFTCQ 179
           MKLF+VGPF+F+ +
Sbjct: 149 MKLFTVGPFVFSSE 162

BLAST of CmaCh17G010750.1 vs. TAIR10
Match: AT4G02270.1 (AT4G02270.1 root hair specific 13)

HSP 1 Score: 107.8 bits (268), Expect = 6.8e-24
Identity = 51/122 (41.80%), Postives = 73/122 (59.84%), Query Frame = 1

Query: 57  IGIQGIILYKFGSTISPLEGGLARITCKAVDEYGYEAASYTFLSDSSDANGYFLATLSPS 116
           I ++GII  K G    P++G  ARI C  VD YG E    + LS  +DA GYF+AT+ PS
Sbjct: 40  IAVEGIIKCKSGGKTYPIQGATARIACVKVDAYGNELVPISILSSKTDAKGYFIATIFPS 99

Query: 117 EVDDKRELKECKAFLELSPLENCQAPSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFT 176
           ++   R + +CK +L  SPL +C  P+D+N GV G  L +Y++L     KL+  GPF +T
Sbjct: 100 QLRAGRTVTKCKTYLYKSPLADCDFPTDVNKGVRGQPLSTYRILQDKSFKLYWAGPFFYT 159

Query: 177 CQ 179
            +
Sbjct: 160 SE 161

BLAST of CmaCh17G010750.1 vs. TAIR10
Match: AT2G47530.1 (AT2G47530.1 Pollen Ole e 1 allergen and extensin family protein)

HSP 1 Score: 78.2 bits (191), Expect = 5.8e-15
Identity = 53/179 (29.61%), Postives = 85/179 (47.49%), Query Frame = 1

Query: 12  ASLRAVFLSLLVIVASAGGDDNNGGGYYDLMTPILAKEQRLLSTM------------IGI 71
           A+   + L+++V+VA+A         YY    P + K     ++             I I
Sbjct: 6   AATNLLLLAMVVVVATAD--------YYAQPQPYVPKPTTTYTSPVKTPYLPKSNPDIAI 65

Query: 72  QGIILYKFGSTISPLEGGLARITCKAVDEYGYEAASYTFLSDSSDANGYFLATLSPSEVD 131
           +G IL K G    P++GG  ++ C  VD YG   A  T  S  +D  GYF   ++     
Sbjct: 66  EGFILCKSGYKTYPIQGGKVKVVCPVVDSYGKLVAKVTISSYPTDLKGYFYF-ITYGLSH 125

Query: 132 DKRELKECKAFLELSPLENCQAPSDLNNGVSGALL--HSYKLLVHNKMKLFSVGPFLFT 177
               +  CK  LE SP+  C+ P+++N GV+GA L   + K L H+ + L+++ PF F+
Sbjct: 126 KVNNISSCKVKLESSPVFTCKTPTNVNKGVTGAPLSPDNSKFLSHDNLTLYTLEPFYFS 175

BLAST of CmaCh17G010750.1 vs. TAIR10
Match: AT3G62680.1 (AT3G62680.1 proline-rich protein 3)

HSP 1 Score: 72.8 bits (177), Expect = 2.4e-13
Identity = 40/120 (33.33%), Postives = 63/120 (52.50%), Query Frame = 1

Query: 59  IQGIILYKFGSTISPLEGGLARITCKAVDEYGYEAASYTFLSDSSDANGYFLATLSPSEV 118
           + GIIL K G    P+ G   +I C     YG         S+ +D+ GYF  +L+    
Sbjct: 188 VDGIILCKNGYETYPILGAKIQIVCSDPASYGKSNTEVVIYSNPTDSKGYFHLSLTSI-- 247

Query: 119 DDKRELKECKAFLELSPLENCQAPSDLNNGVSGA--LLHSYKLLVHNKMKLFSVGPFLFT 177
              ++L  C+  L LSP+E C+ P+++N G++G    L+ Y+      ++LFSVGPF +T
Sbjct: 248 ---KDLAYCRVKLYLSPVETCKNPTNVNKGLTGVPLALYGYRFYPDKNLELFSVGPFYYT 302

BLAST of CmaCh17G010750.1 vs. TAIR10
Match: AT1G54970.1 (AT1G54970.1 proline-rich protein 1)

HSP 1 Score: 70.5 bits (171), Expect = 1.2e-12
Identity = 41/121 (33.88%), Postives = 58/121 (47.93%), Query Frame = 1

Query: 59  IQGIILYKFGSTISPLEGGLARITCKAVDEYGYEAASYTFLSDSSDANGYFLATLSPSEV 118
           + GIIL K G    P++G  A+I C     Y          SD +D  GYF   L+    
Sbjct: 214 VGGIILCKNGYETYPIQGAKAKIVCSERGSYEKSKNEVVIYSDPTDFKGYFHVVLT---- 273

Query: 119 DDKRELKECKAFLELSPLENCQAPSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFTCQ 178
              + L  C+  L  SP+E C+ P+++N G++G     Y       +KLF+VGPF FT  
Sbjct: 274 -HIKNLSNCRVKLYTSPVETCKNPTNVNKGLTGVPFSMYS---DKNLKLFNVGPFYFTAG 326

Query: 179 S 180
           S
Sbjct: 334 S 326

BLAST of CmaCh17G010750.1 vs. NCBI nr
Match: gi|778728091|ref|XP_011659366.1| (PREDICTED: proline-rich protein 3-like [Cucumis sativus])

HSP 1 Score: 274.6 bits (701), Expect = 1.2e-70
Identity = 143/173 (82.66%), Postives = 154/173 (89.02%), Query Frame = 1

Query: 11  MASLRAVFLSLLVI---VASAGGDDNNGGGYYDLMTPILAKE-QRLLSTMIGIQGIILYK 70
           M SL AVF SLLVI   V SA GD  +GG Y D MTP LAK+ +RLLSTMIGI+GIILYK
Sbjct: 1   MGSLHAVFFSLLVISIIVGSANGDIYDGGSY-DSMTPKLAKDQERLLSTMIGIEGIILYK 60

Query: 71  FGSTISPLEGGLARITCKAVDEYGYEAASYTFLSDSSDANGYFLATLSPSEVDDKRELKE 130
           FGS+ISPL+GGLARITCK VDEYGYEAASYTFLS+SSD NGYFLATLSPSEV+DKRELKE
Sbjct: 61  FGSSISPLQGGLARITCKTVDEYGYEAASYTFLSESSDENGYFLATLSPSEVEDKRELKE 120

Query: 131 CKAFLELSPLENCQAPSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFTCQS 180
           CKAFLE+SPLENCQ+PSDLNNGVSGALLHSYK LVHN MKLFSVGPFLFTCQ+
Sbjct: 121 CKAFLEVSPLENCQSPSDLNNGVSGALLHSYKFLVHNNMKLFSVGPFLFTCQT 172

BLAST of CmaCh17G010750.1 vs. NCBI nr
Match: gi|659101333|ref|XP_008451553.1| (PREDICTED: proline-rich protein 3-like [Cucumis melo])

HSP 1 Score: 266.5 bits (680), Expect = 3.3e-68
Identity = 137/172 (79.65%), Postives = 149/172 (86.63%), Query Frame = 1

Query: 11  MASLRAVFLSLLVI--VASAGGDDNNGGGYYDLMTPILAKE-QRLLSTMIGIQGIILYKF 70
           M SL AV LSLLVI  +  +   D   G  YD MT  LAK+ +RLLSTMIGI+GIILYKF
Sbjct: 1   MGSLHAVSLSLLVISIIVGSANSDIYDGASYDFMTSKLAKDQERLLSTMIGIEGIILYKF 60

Query: 71  GSTISPLEGGLARITCKAVDEYGYEAASYTFLSDSSDANGYFLATLSPSEVDDKRELKEC 130
           GS+ISPL+GGLARITCK VDEYGYEAASYTFLS+SSD NGYFLATLSPSEV+DKRELKEC
Sbjct: 61  GSSISPLQGGLARITCKTVDEYGYEAASYTFLSESSDENGYFLATLSPSEVEDKRELKEC 120

Query: 131 KAFLELSPLENCQAPSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFTCQS 180
           KAFLE+SPLENCQ+PSDLNNGVSGALLHSYK LVHN MKLFSVGPFLFTCQ+
Sbjct: 121 KAFLEVSPLENCQSPSDLNNGVSGALLHSYKFLVHNNMKLFSVGPFLFTCQT 172

BLAST of CmaCh17G010750.1 vs. NCBI nr
Match: gi|802689054|ref|XP_012082820.1| (PREDICTED: proline-rich protein 3-like [Jatropha curcas])

HSP 1 Score: 169.9 bits (429), Expect = 4.2e-39
Identity = 88/158 (55.70%), Postives = 112/158 (70.89%), Query Frame = 1

Query: 20  SLLVIVASAGGDDNNGGGYYDLMTPILAKEQRLLSTMIGIQGIILYKFGSTISPLEGGLA 79
           S L+I+ASA  DD  G    +L  P    E+ LLST++GIQG+I  K G+ + PL+G +A
Sbjct: 16  SALLIIASASDDDGYGLKNINLQKP-KPDEEYLLSTLVGIQGLIFCKSGAKLIPLQGAVA 75

Query: 80  RITCKAVDEYGYEAASYTFLSDSSDANGYFLATLSPSEVDDKRELKECKAFLELSPL-EN 139
           RITC AVDEYGYE A  + LS ++D  GYFLATLSPSEV+   ++ ECKAFLELSPL + 
Sbjct: 76  RITCLAVDEYGYETAPLSILSGATDVKGYFLATLSPSEVEKNLKITECKAFLELSPLTKT 135

Query: 140 CQAPSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFT 177
           C  PSD+N G++GALL SY+ L    MKLF+VGPF +T
Sbjct: 136 CDVPSDVNKGIAGALLSSYEFLHDKNMKLFTVGPFFYT 172

BLAST of CmaCh17G010750.1 vs. NCBI nr
Match: gi|224068374|ref|XP_002302729.1| (pollen Ole e 1 allergen and extensin family protein [Populus trichocarpa])

HSP 1 Score: 165.6 bits (418), Expect = 7.8e-38
Identity = 87/167 (52.10%), Postives = 116/167 (69.46%), Query Frame = 1

Query: 12  ASLRAVFLSLLVIVASAGGDDNNGGGYYD--LMTPILAKEQRLLSTMIGIQGIILYKFGS 71
           +S  A F+SL ++ A A   D   G + +  L+ P L  +++ LSTMIG+QG++  + G 
Sbjct: 5   SSYFAFFMSLSMVAAIASATDGGYGSHPNPNLVKPKL-NKEKPLSTMIGVQGLVYCRSGP 64

Query: 72  TISPLEGGLARITCKAVDEYGYEAASYTFLSDSSDANGYFLATLSPSEVDDKRELKECKA 131
              PLEG + RITC A D YGYEAA ++FLS+++DA GYF ATLSP E+ D  ++KECKA
Sbjct: 65  KRFPLEGAVIRITCLANDVYGYEAAPFSFLSEATDAKGYFFATLSPYEMQDNLKIKECKA 124

Query: 132 FLELSPLENCQAPSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFT 177
           FLELSPLE C+ P+D   G+SGALL SY  L   KMKLF+VGPF++T
Sbjct: 125 FLELSPLETCKIPTDEKQGISGALLASYHYLSDKKMKLFTVGPFVYT 170

BLAST of CmaCh17G010750.1 vs. NCBI nr
Match: gi|764535911|ref|XP_011458743.1| (PREDICTED: proline-rich protein 1-like [Fragaria vesca subsp. vesca])

HSP 1 Score: 165.6 bits (418), Expect = 7.8e-38
Identity = 82/137 (59.85%), Postives = 108/137 (78.83%), Query Frame = 1

Query: 44  PILAKEQRLLSTMIGIQGIILYKFGSTISPLEGGLARITCKAVDEYGYEAASYTFLSDSS 103
           P L KE  LLST++GIQG++  K G  ++PLEG +ARITC+AVDEYG++AA  T LSD++
Sbjct: 32  PKLEKEN-LLSTIMGIQGLVYCKSGPKVTPLEGSVARITCEAVDEYGFQAAPITILSDAT 91

Query: 104 DANGYFLATLSPSEVDD-KRELKECKAFLELSPLENCQAPSDLNNGVSGALLHSYKLLVH 163
           D  GYFLATLSP E+ + K++L +CKAFLELSPL++C   +D NNG+SGALL +Y+LL  
Sbjct: 92  DERGYFLATLSPFEIQNKKKKLTQCKAFLELSPLDSCNVLTDANNGISGALLTAYQLLHE 151

Query: 164 NKMKLFSVGPFLFTCQS 180
             MKLF+VGPF+FT +S
Sbjct: 152 KNMKLFTVGPFVFTSES 167

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PRP3_ARATH4.3e-1233.33Proline-rich protein 3 OS=Arabidopsis thaliana GN=PRP3 PE=2 SV=1[more]
PRP1_ARATH2.1e-1133.88Proline-rich protein 1 OS=Arabidopsis thaliana GN=PRP1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0K5H5_CUCSA8.3e-7182.66Uncharacterized protein OS=Cucumis sativus GN=Csa_7G398170 PE=4 SV=1[more]
B9GSD1_POPTR5.5e-3852.10Pollen Ole e 1 allergen and extensin family protein OS=Populus trichocarpa GN=PO... [more]
B9SAV3_RICCO9.3e-3860.45Structural constituent of cell wall, putative OS=Ricinus communis GN=RCOM_117946... [more]
A0A0B2RFK1_GLYSO3.5e-3758.14Pistil-specific extensin-like protein OS=Glycine soja GN=glysoja_037705 PE=4 SV=... [more]
A0A061DR56_THECC1.0e-3651.23Pollen Ole e 1 allergen and extensin family protein, putative OS=Theobroma cacao... [more]
Match NameE-valueIdentityDescription
AT2G47540.11.1e-3755.22 Pollen Ole e 1 allergen and extensin family protein[more]
AT4G02270.16.8e-2441.80 root hair specific 13[more]
AT2G47530.15.8e-1529.61 Pollen Ole e 1 allergen and extensin family protein[more]
AT3G62680.12.4e-1333.33 proline-rich protein 3[more]
AT1G54970.11.2e-1233.88 proline-rich protein 1[more]
Match NameE-valueIdentityDescription
gi|778728091|ref|XP_011659366.1|1.2e-7082.66PREDICTED: proline-rich protein 3-like [Cucumis sativus][more]
gi|659101333|ref|XP_008451553.1|3.3e-6879.65PREDICTED: proline-rich protein 3-like [Cucumis melo][more]
gi|802689054|ref|XP_012082820.1|4.2e-3955.70PREDICTED: proline-rich protein 3-like [Jatropha curcas][more]
gi|224068374|ref|XP_002302729.1|7.8e-3852.10pollen Ole e 1 allergen and extensin family protein [Populus trichocarpa][more]
gi|764535911|ref|XP_011458743.1|7.8e-3859.85PREDICTED: proline-rich protein 1-like [Fragaria vesca subsp. vesca][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmaCh17G010750CmaCh17G010750gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmaCh17G010750.1CmaCh17G010750.1-proteinpolypeptide


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh17G010750.1.three_prime_UTR.1CmaCh17G010750.1.three_prime_UTR.1three_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh17G010750.1.CDS.2CmaCh17G010750.1.CDS.2CDS
CmaCh17G010750.1.CDS.1CmaCh17G010750.1.CDS.1CDS


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh17G010750.1.exon.2CmaCh17G010750.1.exon.2exon
CmaCh17G010750.1.exon.1CmaCh17G010750.1.exon.1exon


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33470FAMILY NOT NAMEDcoord: 48..176
score: 3.5
NoneNo IPR availablePANTHERPTHR33470:SF1POLLEN OLE E 1 ALLERGEN AND EXTENSIN FAMILY PROTEINcoord: 48..176
score: 3.5
NoneNo IPR availablePFAMPF01190Pollen_Ole_e_Icoord: 71..151
score: 3.4