CmoCh11G012590.1 (mRNA) Cucurbita moschata (Rifu)

NameCmoCh11G012590.1
TypemRNA
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
Description(Expressed protein) (Late embryogenesis abundant hydroxyproline-rich glycoprotein)
LocationCmo_Chr11 : 8041694 .. 8043957 (+)
Sequence length1056
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGTTTGAAAACCGAACCGAAATGTCATCGTCCTCTTCAACTTCTAGGCCCCTCCCGAACGGCACTCTCTCAACCTCCACCCACCGCCACCACCCGCCCTACAGCCCCCGATCCTCCTCCTTCTCCTCCGCCTCCTTCAAGGGCTGCTGCTGCTGCCTCTTCCTCCTCCTCTCCTTCCTCGCCCTCCTCGTCCTCGCCGTCGTCCTCGTCGTCGTCCTCGCCCTCAAGCCCAAGAAGCCGCAGTTTGATCTCCAGCAGGTCGGCGTCCAGTACATGGGCATTACCACTCCCAACACTCCCACTACTCCTATGGCTTCCCTTTCCCTCAACATTCGAATGGTTTTCACTGCCGTTAACCCTAACAAGGTCGGGATCAAGTATAGCGAGTCCCGTTTCACTGTTATGTACCGAGGAATCCCGCTCGGTCGAGCCTCTGTTCCTGGATTTGTTCAAGAACCACATAGCCAGCGCCAGGTCGATACCACCGTCGCCGTCGATCGCGTCAATCTCCTCCAAGCCGACGCTGCTGATTTGATCCGCGACGCCTCCTTGAACGACCGTGTTGAGCTCAGAATACTCGGCGATGTCGCCGCTAAGATCCGCCTCTTATCCTTCAATTCCCCCGGCGTTCAGGTACTATACTATACCATACTATACTTACCTTTTTCTTTCTAGGAGATTCAGATTTAGGGACATGGATGTAATTACCTGATTTCTGGTATACCCTTCTGGTCTTCCGCATTTTAATGCTCTGCCATCCCCTGCCATCTGCCCCCTCTGTTCCTCTTGAATTTATTTCTCTTGTGTTATAAATTGGGCGAGTTTGAACCAGAAATGCTGTCAATGTTTTTCGGACTTTTTAATGTTTCTTTTCCTTAATTCTGAAGAACACATTGTGCATAAGTAGAAAAAGCTTTGAGTTCTTAATAACTTTGATACACATGAACACAGAAAAGAGTAGAAAACTGGTCTGCTTTGTTTTTGGAAATCAAACAATAAGATTAGTAGAAGAAGAGATTGAAGAACAAAGAATTTGGCAAGAACTCTTTTATGTTGTTGACTGGATTAGGAAGGATCTGAACTAAACCCCCCATTTTTTAGCAATTGTTTTCGTTAATTGTTTGTTCATAATTGTTTTCATGTAATATTTTAGTCAAGAAATTGAGCTGTGGAGTCTAAAAGGAGAGATGGAAAGCGACAAGAGAAAGTAGCATCGTTTTCCTTCATATTTGCTTTACTTTCACGCCCATTTTGTAATCTTCCCAATAGCTTTCGCCATAGCTTTTGCTGCAACTCCTTAACGTTATTAGATCAGCAAATCATAGCTTTATCCAATGATAGAACCAATTAGATACTCCGATGCATTCGTCAATCCAAACATAGCTATGTATCGATTAGATCTCGGATGAATTCGAGAGGACCTTCGGTGGTAGCGGTGTTTTTAAGTGAAACGGAGGTTCAAACCATACCTTTGAAGTTATACCCACAAGTGCTATTTCGAGTCAGGGTTTGCCTTCGTTATGTTGTTGCTAGCAAATACCACTATGTTTTGTTTCGATATTCTCCGTGCCCTTTCTTTTCTGATCCTTCGATAAAAAGATTCATACTCGGGTGGACAATTGGATCTCGTCTCTCTGGGAGAAGTAATGGTGACAGCTAATATTAATATCAAAAAGTTGCATTATTTAGGAGATGAAAGGAACTTTTGGATCATTCCTTGCCCCATGTGACTTTTGGATGCATCTCTTCCCATTTTCTATTCTTTTGTTTATAATATTATTCTTTCACTTTCAATTATTTTTACAATGCTTTCCTTTTCTTCTCCTTCTGGGATTGTTCAGGTTTCAGTGGATTGTGCAATTGTGATCAGTCCAAGGAAGCAGTCTCTTACGTACAAGCAATGTGGTTTTGATGGCTTAAGTGTCTGACTCGGGTCGGTTCGTGTTTCAATTATTCCATTTTGGTCCCTATACGATTTACCAATTCAGTTCTCATCCACAACCACCAGTCAAGATAGAAACGGTTGTGGAATTAGGGATTGAATTGAAATGATGAAGATTTTTTGGAAGGACATAATTAACGAGAGGATAGCTACTCGGGAAACGATTTGTAGAGAGAGCGATTACTCGAGAGAGAGAGAGAGAGAGAGAGACTTAGACGAAGAAAATGTAATATTATGAACATTTTTTCACGAGAATATTACGAGAGGCGGTACTCGAGCTTTGAGATGGCAACTTGAAATATAAAATTGATGGGATTTTGGGA

mRNA sequence

GGTTTGAAAACCGAACCGAAATGTCATCGTCCTCTTCAACTTCTAGGCCCCTCCCGAACGGCACTCTCTCAACCTCCACCCACCGCCACCACCCGCCCTACAGCCCCCGATCCTCCTCCTTCTCCTCCGCCTCCTTCAAGGGCTGCTGCTGCTGCCTCTTCCTCCTCCTCTCCTTCCTCGCCCTCCTCGTCCTCGCCGTCGTCCTCGTCGTCGTCCTCGCCCTCAAGCCCAAGAAGCCGCAGTTTGATCTCCAGCAGGTCGGCGTCCAGTACATGGGCATTACCACTCCCAACACTCCCACTACTCCTATGGCTTCCCTTTCCCTCAACATTCGAATGGTTTTCACTGCCGTTAACCCTAACAAGGTCGGGATCAAGTATAGCGAGTCCCGTTTCACTGTTATGTACCGAGGAATCCCGCTCGGTCGAGCCTCTGTTCCTGGATTTGTTCAAGAACCACATAGCCAGCGCCAGGTCGATACCACCGTCGCCGTCGATCGCGTCAATCTCCTCCAAGCCGACGCTGCTGATTTGATCCGCGACGCCTCCTTGAACGACCGTGTTGAGCTCAGAATACTCGGCGATGTCGCCGCTAAGATCCGCCTCTTATCCTTCAATTCCCCCGGCGTTCAGGTTTCAGTGGATTGTGCAATTGTGATCAGTCCAAGGAAGCAGTCTCTTACGTACAAGCAATGTGGTTTTGATGGCTTAAGTGTCTGACTCGGGTCGGTTCGTGTTTCAATTATTCCATTTTGGTCCCTATACGATTTACCAATTCAGTTCTCATCCACAACCACCAGTCAAGATAGAAACGGTTGTGGAATTAGGGATTGAATTGAAATGATGAAGATTTTTTGGAAGGACATAATTAACGAGAGGATAGCTACTCGGGAAACGATTTGTAGAGAGAGCGATTACTCGAGAGAGAGAGAGAGAGAGAGAGACTTAGACGAAGAAAATGTAATATTATGAACATTTTTTCACGAGAATATTACGAGAGGCGGTACTCGAGCTTTGAGATGGCAACTTGAAATATAAAATTGATGGGATTTTGGGA

Coding sequence (CDS)

ATGTCATCGTCCTCTTCAACTTCTAGGCCCCTCCCGAACGGCACTCTCTCAACCTCCACCCACCGCCACCACCCGCCCTACAGCCCCCGATCCTCCTCCTTCTCCTCCGCCTCCTTCAAGGGCTGCTGCTGCTGCCTCTTCCTCCTCCTCTCCTTCCTCGCCCTCCTCGTCCTCGCCGTCGTCCTCGTCGTCGTCCTCGCCCTCAAGCCCAAGAAGCCGCAGTTTGATCTCCAGCAGGTCGGCGTCCAGTACATGGGCATTACCACTCCCAACACTCCCACTACTCCTATGGCTTCCCTTTCCCTCAACATTCGAATGGTTTTCACTGCCGTTAACCCTAACAAGGTCGGGATCAAGTATAGCGAGTCCCGTTTCACTGTTATGTACCGAGGAATCCCGCTCGGTCGAGCCTCTGTTCCTGGATTTGTTCAAGAACCACATAGCCAGCGCCAGGTCGATACCACCGTCGCCGTCGATCGCGTCAATCTCCTCCAAGCCGACGCTGCTGATTTGATCCGCGACGCCTCCTTGAACGACCGTGTTGAGCTCAGAATACTCGGCGATGTCGCCGCTAAGATCCGCCTCTTATCCTTCAATTCCCCCGGCGTTCAGGTTTCAGTGGATTGTGCAATTGTGATCAGTCCAAGGAAGCAGTCTCTTACGTACAAGCAATGTGGTTTTGATGGCTTAAGTGTCTGA
BLAST of CmoCh11G012590.1 vs. TrEMBL
Match: A0A0A0K5H7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G223380 PE=4 SV=1)

HSP 1 Score: 360.1 bits (923), Expect = 2.0e-96
Identity = 197/228 (86.40%), Postives = 207/228 (90.79%), Query Frame = 1

Query: 5   SSTSRPLPNGTLSTSTHRHHPPYSPRSSSFSSASFKGCCCCLFLLLSFLALLVLAVVLVV 64
           SS+SR +PNG         HP Y+PRSSS SSA+FKGCCCCLFLL SFLALLVLA+VLVV
Sbjct: 2   SSSSRTVPNGV--------HPRYNPRSSS-SSATFKGCCCCLFLLFSFLALLVLAIVLVV 61

Query: 65  VLALKPKKPQFDLQQVGVQYMGITTPNTPTTPMASLSLNIRMVFTAVNPNKVGIKYSESR 124
           VLALKPKKPQFDLQQV VQY+GIT PN PTT  ASLSLNIRM+FTAVNPNKVGIKY ESR
Sbjct: 62  VLALKPKKPQFDLQQVKVQYVGITNPN-PTT--ASLSLNIRMIFTAVNPNKVGIKYEESR 121

Query: 125 FTVMYRGIPLGRASVPGFVQEPHSQRQVDTTVAVDRVNLLQADAADLIRDASLNDRVELR 184
           FTVMYRGIPLGRASVPGF Q+PHSQRQVD T+AVDRVNLLQADAADLIRDASLNDRVELR
Sbjct: 122 FTVMYRGIPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELR 181

Query: 185 ILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRKQSLTYKQCGFDGLSV 233
           ILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRKQSLTYKQCGFDGL+V
Sbjct: 182 ILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRKQSLTYKQCGFDGLNV 217

BLAST of CmoCh11G012590.1 vs. TrEMBL
Match: A0A061DZ99_THECC (Late embryogenesis abundant hydroxyproline-rich glycoprotein family isoform 1 OS=Theobroma cacao GN=TCM_006430 PE=4 SV=1)

HSP 1 Score: 331.3 bits (848), Expect = 9.7e-88
Identity = 179/224 (79.91%), Postives = 193/224 (86.16%), Query Frame = 1

Query: 22  RHHPPYSPRSSSFSSASFKGCCCCLFLLLSFLALLVLAVVLVVVLALKPKKPQFDLQQVG 81
           RHHP Y PRSSS SSASFKGCCCCLFLL SFLALLVLAVVL++VLA+KPKKPQFDLQQVG
Sbjct: 39  RHHP-YYPRSSS-SSASFKGCCCCLFLLFSFLALLVLAVVLIIVLAVKPKKPQFDLQQVG 98

Query: 82  VQYMGITTPN-------------TPTTPMASLSLNIRMVFTAVNPNKVGIKYSESRFTVM 141
           VQYMGI+T N             TPTT  ASLSL I M+FTAVNPNKVGIKY ESRFTVM
Sbjct: 99  VQYMGISTSNPSAFDGAAAAVTTTPTT--ASLSLTIHMLFTAVNPNKVGIKYGESRFTVM 158

Query: 142 YRGIPLGRASVPGFVQEPHSQRQVDTTVAVDRVNLLQADAADLIRDASLNDRVELRILGD 201
           YRGIPLG+A+VPGF QE HS R V+ T+AVDR NL+QADAADLIRDASLNDRVELR+LGD
Sbjct: 159 YRGIPLGKAAVPGFFQEAHSTRNVEATIAVDRANLMQADAADLIRDASLNDRVELRVLGD 218

Query: 202 VAAKIRLLSFNSPGVQVSVDCAIVISPRKQSLTYKQCGFDGLSV 233
           V AKIR+L F+SPGVQVS+DCAIVISPRKQSLTYKQCGFDGLSV
Sbjct: 219 VGAKIRVLDFDSPGVQVSIDCAIVISPRKQSLTYKQCGFDGLSV 258

BLAST of CmoCh11G012590.1 vs. TrEMBL
Match: F6H1R4_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_12s0055g00370 PE=4 SV=1)

HSP 1 Score: 326.2 bits (835), Expect = 3.1e-86
Identity = 182/239 (76.15%), Postives = 194/239 (81.17%), Query Frame = 1

Query: 9   RPLPNGTLSTSTHRHHPP------YSPRSSSFSSASFKGCCCCLFLLLSFLALLVLAVVL 68
           RP PN       H HHP       Y   S S SSASFKGCCCCLFLL SFLALLVLAVVL
Sbjct: 14  RPPPN-------HHHHPHSQHHSHYQSPSYSPSSASFKGCCCCLFLLFSFLALLVLAVVL 73

Query: 69  VVVLALKPKKPQFDLQQVGVQYMGIT---------TPNTPTTPMASLSLNIRMVFTAVNP 128
           ++VLA+KPKKPQFDLQQVGVQYMGIT         +P TPT+  ASLSLNI+M+FTAVNP
Sbjct: 74  IIVLAVKPKKPQFDLQQVGVQYMGITANPSSTVAGSPPTPTS--ASLSLNIKMLFTAVNP 133

Query: 129 NKVGIKYSESRFTVMYRGIPLGRASVPGFVQEPHSQRQVDTTVAVDRVNLLQADAADLIR 188
           NKVGIKY ESRFTVMYRGIPLG+  VPGF Q  HS RQV+TTVAVDR NLLQADAADLI+
Sbjct: 134 NKVGIKYGESRFTVMYRGIPLGKGVVPGFYQPAHSVRQVETTVAVDRANLLQADAADLIK 193

Query: 189 DASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRKQSLTYKQCGFDGLSV 233
           DASLNDRVELRILG+V AKIR+L F SPGVQVSVDCAIVISPRKQSLTYKQCGFDGLSV
Sbjct: 194 DASLNDRVELRILGEVGAKIRVLDFTSPGVQVSVDCAIVISPRKQSLTYKQCGFDGLSV 243

BLAST of CmoCh11G012590.1 vs. TrEMBL
Match: M5Y567_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa010161mg PE=4 SV=1)

HSP 1 Score: 325.5 bits (833), Expect = 5.3e-86
Identity = 178/236 (75.42%), Postives = 198/236 (83.90%), Query Frame = 1

Query: 10  PLPNGTLSTSTHRHHPPYSPR--SSSFSSASFKGCCCCLFLLLSFLALLVLAVVLVVVLA 69
           P P  + S   + +H PY P   SSS SSASFKGCCCCLFLL SFLALLVLAVVLV++LA
Sbjct: 26  PRPPPSSSNPHNSNHHPYYPTTSSSSSSSASFKGCCCCLFLLFSFLALLVLAVVLVIILA 85

Query: 70  LKPKKPQFDLQQVGVQYMGITTPN-TPTTPM----------ASLSLNIRMVFTAVNPNKV 129
           +KPKKPQFDLQQVGVQYMGI +PN TP              ASLSL+IRM+F+AVNPNKV
Sbjct: 86  VKPKKPQFDLQQVGVQYMGINSPNPTPAAAATADPNQNPTSASLSLSIRMLFSAVNPNKV 145

Query: 130 GIKYSESRFTVMYRGIPLGRASVPGFVQEPHSQRQVDTTVAVDRVNLLQADAADLIRDAS 189
           GI+Y ESRFTVMYRGIPLG+ASVPGF Q+ H+ RQV  T++VDRVNLLQADAADLIRDAS
Sbjct: 146 GIRYGESRFTVMYRGIPLGKASVPGFFQDAHTVRQVVATISVDRVNLLQADAADLIRDAS 205

Query: 190 LNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRKQSLTYKQCGFDGLSV 233
           LNDRVELR+LGDV AKIR+L+F+SPGVQVSVDCAIVISPRKQSLTYKQCGFDGLSV
Sbjct: 206 LNDRVELRVLGDVGAKIRVLNFDSPGVQVSVDCAIVISPRKQSLTYKQCGFDGLSV 261

BLAST of CmoCh11G012590.1 vs. TrEMBL
Match: M1CX10_SOLTU (Uncharacterized protein OS=Solanum tuberosum GN=PGSC0003DMG400029777 PE=4 SV=1)

HSP 1 Score: 322.0 bits (824), Expect = 5.9e-85
Identity = 174/229 (75.98%), Postives = 190/229 (82.97%), Query Frame = 1

Query: 7   TSRPLPNGTLSTSTHRHHPPYSPR--SSSFSSASFKGCCCCLFLLLSFLALLVLAVVLVV 66
           TSR   NG   T    HHP Y P   SSS S ASFKGCCCCLFLL SFL LL+LAV+LV+
Sbjct: 2   TSRLTQNGINGT---HHHPQYYPHPTSSSSSKASFKGCCCCLFLLFSFLLLLILAVILVI 61

Query: 67  VLALKPKKPQFDLQQVGVQYMGIT-TPNTPTTPMASLSLNIRMVFTAVNPNKVGIKYSES 126
           VLA+KPKKPQFDLQQVGVQY+GIT  P T  T  AS+SLNIRMVFTA N NKVGIKY ES
Sbjct: 62  VLAVKPKKPQFDLQQVGVQYVGITPNPATIATSSASVSLNIRMVFTAFNDNKVGIKYGES 121

Query: 127 RFTVMYRGIPLGRASVPGFVQEPHSQRQVDTTVAVDRVNLLQADAADLIRDASLNDRVEL 186
           RFT+MYRGIPLGR SVP F Q  HS ++V+TT+ VDRVNLLQADAADLIRDA+LNDRVEL
Sbjct: 122 RFTIMYRGIPLGRGSVPAFYQPAHSVKRVETTIVVDRVNLLQADAADLIRDAALNDRVEL 181

Query: 187 RILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRKQSLTYKQCGFDGLSV 233
           R+LGDV AKIR+L F SPGV+VSVDCAIVISPRKQ+LTYKQCGFDGLSV
Sbjct: 182 RVLGDVGAKIRILGFTSPGVEVSVDCAIVISPRKQALTYKQCGFDGLSV 227

BLAST of CmoCh11G012590.1 vs. TAIR10
Match: AT2G01080.1 (AT2G01080.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 289.7 bits (740), Expect = 1.6e-78
Identity = 158/233 (67.81%), Postives = 187/233 (80.26%), Query Frame = 1

Query: 1   MSSSSSTSRPLPNGTLSTSTHRHHPPYSPRSSSFSSASFKGCCCCLFLLLSFLALLVLAV 60
           M    S+SR   NG    + ++   PY    SS SSAS KGCCCCLFLL +FLALLVLAV
Sbjct: 1   MPPPPSSSRAGLNGDPIAAQNQQ--PYYRSYSSSSSASLKGCCCCLFLLFAFLALLVLAV 60

Query: 61  VLVVVLALKPKKPQFDLQQVGVQYMGITTPNTPTTPM-ASLSLNIRMVFTAVNPNKVGIK 120
           VL+V+LA+KPKKPQFDLQQV V YMGI+ P+    P  ASLSL IRM+FTAVNPNKVGI+
Sbjct: 61  VLIVILAVKPKKPQFDLQQVAVVYMGISNPSAVLDPTTASLSLTIRMLFTAVNPNKVGIR 120

Query: 121 YSESRFTVMYRGIPLGRASVPGFVQEPHSQRQVDTTVAVDRVNLLQADAADLIRDASLND 180
           Y ES FTVMY+G+PLGRA+VPGF Q+ HS + V+ T++VDRVNL+QA AADL+RDASLND
Sbjct: 121 YGESSFTVMYKGMPLGRATVPGFYQDAHSTKNVEATISVDRVNLMQAHAADLVRDASLND 180

Query: 181 RVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRKQSLTYKQCGFDGLSV 233
           RVEL + GDV AKIR+++F+SPGVQVSV+C I ISPRKQ+L YKQCGFDGLSV
Sbjct: 181 RVELTVRGDVGAKIRVMNFDSPGVQVSVNCGIGISPRKQALIYKQCGFDGLSV 231

BLAST of CmoCh11G012590.1 vs. TAIR10
Match: AT3G54200.1 (AT3G54200.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 50.4 bits (119), Expect = 1.7e-06
Identity = 48/216 (22.22%), Postives = 88/216 (40.74%), Query Frame = 1

Query: 25  PPYSPRSSSFSSASF-----------KGCCCCLFLLLSFLALLVLAVVLVVVLALKPKKP 84
           PP  P +SS  + S            + C  C+   +  + L+ + +V++     KPK+P
Sbjct: 23  PPPKPNASSMETQSANTGTAKKLRRKRNCKICICFTILLILLIAIVIVILAFTLFKPKRP 82

Query: 85  QFDLQQVGVQYMGITTPNTPTTPMASLSLNIRMVFTAVNPNKVGIKYSESRFTVMYRGIP 144
              +  V V  +  +        + +L+LN+ +  +  NPN++G  Y  S   + YRG  
Sbjct: 83  TTTIDSVTVDRLQASVNPLLLKVLLNLTLNVDL--SLKNPNRIGFSYDSSSALLNYRGQV 142

Query: 145 LGRASVPG--FVQEPHSQRQVDTTVAVDRVNLLQADAADLIRDASLNDRVELRILGDVAA 204
           +G A +P             +  T+  DR+         L+ D  +   + L     V  
Sbjct: 143 IGEAPLPANRIAARKTVPLNITLTLMADRL----LSETQLLSDV-MAGVIPLNTFVKVTG 202

Query: 205 KIRLLSFNSPGVQVSVDCAIVISPRKQSLTYKQCGF 228
           K+ +L      VQ S  C + IS   +++T + C +
Sbjct: 203 KVTVLKIFKIKVQSSSSCDLSISVSDRNVTSQHCKY 231

BLAST of CmoCh11G012590.1 vs. NCBI nr
Match: gi|659092735|ref|XP_008447191.1| (PREDICTED: uncharacterized protein LOC103489700 isoform X2 [Cucumis melo])

HSP 1 Score: 365.2 bits (936), Expect = 8.7e-98
Identity = 200/231 (86.58%), Postives = 210/231 (90.91%), Query Frame = 1

Query: 2   SSSSSTSRPLPNGTLSTSTHRHHPPYSPRSSSFSSASFKGCCCCLFLLLSFLALLVLAVV 61
           S+ SS+SR +PNG         HP Y+PRSSS SSA+FKGCCCCLFLL SFLALLVLA+V
Sbjct: 52  SAMSSSSRTVPNGV--------HPRYNPRSSS-SSATFKGCCCCLFLLFSFLALLVLAIV 111

Query: 62  LVVVLALKPKKPQFDLQQVGVQYMGITTPNTPTTPMASLSLNIRMVFTAVNPNKVGIKYS 121
           LVVVLALKPKKPQFDLQQVGVQYMGIT PN PTT  ASLSLNIRM+FTAVNPNKVGIKY 
Sbjct: 112 LVVVLALKPKKPQFDLQQVGVQYMGITNPN-PTT--ASLSLNIRMIFTAVNPNKVGIKYE 171

Query: 122 ESRFTVMYRGIPLGRASVPGFVQEPHSQRQVDTTVAVDRVNLLQADAADLIRDASLNDRV 181
           ESRFTVMYRGIPLGRASVPGF Q+PHSQRQVD T+AVDRVNLLQADAADLIRDASLNDRV
Sbjct: 172 ESRFTVMYRGIPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRV 231

Query: 182 ELRILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRKQSLTYKQCGFDGLSV 233
           ELRILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRKQSLTYKQCGFDGL+V
Sbjct: 232 ELRILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRKQSLTYKQCGFDGLNV 270

BLAST of CmoCh11G012590.1 vs. NCBI nr
Match: gi|449444084|ref|XP_004139805.1| (PREDICTED: uncharacterized protein LOC101207234 [Cucumis sativus])

HSP 1 Score: 360.1 bits (923), Expect = 2.8e-96
Identity = 197/228 (86.40%), Postives = 207/228 (90.79%), Query Frame = 1

Query: 5   SSTSRPLPNGTLSTSTHRHHPPYSPRSSSFSSASFKGCCCCLFLLLSFLALLVLAVVLVV 64
           SS+SR +PNG         HP Y+PRSSS SSA+FKGCCCCLFLL SFLALLVLA+VLVV
Sbjct: 2   SSSSRTVPNGV--------HPRYNPRSSS-SSATFKGCCCCLFLLFSFLALLVLAIVLVV 61

Query: 65  VLALKPKKPQFDLQQVGVQYMGITTPNTPTTPMASLSLNIRMVFTAVNPNKVGIKYSESR 124
           VLALKPKKPQFDLQQV VQY+GIT PN PTT  ASLSLNIRM+FTAVNPNKVGIKY ESR
Sbjct: 62  VLALKPKKPQFDLQQVKVQYVGITNPN-PTT--ASLSLNIRMIFTAVNPNKVGIKYEESR 121

Query: 125 FTVMYRGIPLGRASVPGFVQEPHSQRQVDTTVAVDRVNLLQADAADLIRDASLNDRVELR 184
           FTVMYRGIPLGRASVPGF Q+PHSQRQVD T+AVDRVNLLQADAADLIRDASLNDRVELR
Sbjct: 122 FTVMYRGIPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELR 181

Query: 185 ILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRKQSLTYKQCGFDGLSV 233
           ILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRKQSLTYKQCGFDGL+V
Sbjct: 182 ILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRKQSLTYKQCGFDGLNV 217

BLAST of CmoCh11G012590.1 vs. NCBI nr
Match: gi|590683364|ref|XP_007041580.1| (Late embryogenesis abundant hydroxyproline-rich glycoprotein family isoform 1 [Theobroma cacao])

HSP 1 Score: 331.3 bits (848), Expect = 1.4e-87
Identity = 179/224 (79.91%), Postives = 193/224 (86.16%), Query Frame = 1

Query: 22  RHHPPYSPRSSSFSSASFKGCCCCLFLLLSFLALLVLAVVLVVVLALKPKKPQFDLQQVG 81
           RHHP Y PRSSS SSASFKGCCCCLFLL SFLALLVLAVVL++VLA+KPKKPQFDLQQVG
Sbjct: 39  RHHP-YYPRSSS-SSASFKGCCCCLFLLFSFLALLVLAVVLIIVLAVKPKKPQFDLQQVG 98

Query: 82  VQYMGITTPN-------------TPTTPMASLSLNIRMVFTAVNPNKVGIKYSESRFTVM 141
           VQYMGI+T N             TPTT  ASLSL I M+FTAVNPNKVGIKY ESRFTVM
Sbjct: 99  VQYMGISTSNPSAFDGAAAAVTTTPTT--ASLSLTIHMLFTAVNPNKVGIKYGESRFTVM 158

Query: 142 YRGIPLGRASVPGFVQEPHSQRQVDTTVAVDRVNLLQADAADLIRDASLNDRVELRILGD 201
           YRGIPLG+A+VPGF QE HS R V+ T+AVDR NL+QADAADLIRDASLNDRVELR+LGD
Sbjct: 159 YRGIPLGKAAVPGFFQEAHSTRNVEATIAVDRANLMQADAADLIRDASLNDRVELRVLGD 218

Query: 202 VAAKIRLLSFNSPGVQVSVDCAIVISPRKQSLTYKQCGFDGLSV 233
           V AKIR+L F+SPGVQVS+DCAIVISPRKQSLTYKQCGFDGLSV
Sbjct: 219 VGAKIRVLDFDSPGVQVSIDCAIVISPRKQSLTYKQCGFDGLSV 258

BLAST of CmoCh11G012590.1 vs. NCBI nr
Match: gi|658008149|ref|XP_008339262.1| (PREDICTED: uncharacterized protein LOC103402303 [Malus domestica])

HSP 1 Score: 328.6 bits (841), Expect = 9.1e-87
Identity = 182/237 (76.79%), Postives = 202/237 (85.23%), Query Frame = 1

Query: 3   SSSSTSRPLPNGTLSTSTHRHHPPYSPRSSSFSSASFKGCCCCLFLLLSFLALLVLAVVL 62
           S SS+S P      ++S HRH+ P +  SSS SSASFKGCCCCLFLL SFLALLVLAVVL
Sbjct: 37  SPSSSSHPH-----NSSNHRHYYPTTSSSSS-SSASFKGCCCCLFLLFSFLALLVLAVVL 96

Query: 63  VVVLALKPKKPQFDLQQVGVQYMGITTPN-TPTTP------MASLSLNIRMVFTAVNPNK 122
           V+VLALKPKKPQFDLQQVGVQYMGI +PN TPT         ASLSLNIRM+F+A NPNK
Sbjct: 97  VIVLALKPKKPQFDLQQVGVQYMGINSPNPTPTADPNQNPTSASLSLNIRMLFSAANPNK 156

Query: 123 VGIKYSESRFTVMYRGIPLGRASVPGFVQEPHSQRQVDTTVAVDRVNLLQADAADLIRDA 182
           VGIKY ESRFTVMYRGIPLG+AS+PGF Q+ H+ RQV  T+AVDRVNLLQADA DL+RDA
Sbjct: 157 VGIKYGESRFTVMYRGIPLGKASIPGFYQDAHTVRQVVATIAVDRVNLLQADAXDLVRDA 216

Query: 183 SLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRKQSLTYKQCGFDGLSV 233
           SLNDRVELR+LGDV AKIR+L+F+SPGVQVSVDCAIVISPRKQSL+YKQCGFDGLSV
Sbjct: 217 SLNDRVELRVLGDVGAKIRVLNFDSPGVQVSVDCAIVISPRKQSLSYKQCGFDGLSV 267

BLAST of CmoCh11G012590.1 vs. NCBI nr
Match: gi|225447781|ref|XP_002265790.1| (PREDICTED: uncharacterized protein LOC100267543 [Vitis vinifera])

HSP 1 Score: 326.2 bits (835), Expect = 4.5e-86
Identity = 182/239 (76.15%), Postives = 194/239 (81.17%), Query Frame = 1

Query: 9   RPLPNGTLSTSTHRHHPP------YSPRSSSFSSASFKGCCCCLFLLLSFLALLVLAVVL 68
           RP PN       H HHP       Y   S S SSASFKGCCCCLFLL SFLALLVLAVVL
Sbjct: 14  RPPPN-------HHHHPHSQHHSHYQSPSYSPSSASFKGCCCCLFLLFSFLALLVLAVVL 73

Query: 69  VVVLALKPKKPQFDLQQVGVQYMGIT---------TPNTPTTPMASLSLNIRMVFTAVNP 128
           ++VLA+KPKKPQFDLQQVGVQYMGIT         +P TPT+  ASLSLNI+M+FTAVNP
Sbjct: 74  IIVLAVKPKKPQFDLQQVGVQYMGITANPSSTVAGSPPTPTS--ASLSLNIKMLFTAVNP 133

Query: 129 NKVGIKYSESRFTVMYRGIPLGRASVPGFVQEPHSQRQVDTTVAVDRVNLLQADAADLIR 188
           NKVGIKY ESRFTVMYRGIPLG+  VPGF Q  HS RQV+TTVAVDR NLLQADAADLI+
Sbjct: 134 NKVGIKYGESRFTVMYRGIPLGKGVVPGFYQPAHSVRQVETTVAVDRANLLQADAADLIK 193

Query: 189 DASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRKQSLTYKQCGFDGLSV 233
           DASLNDRVELRILG+V AKIR+L F SPGVQVSVDCAIVISPRKQSLTYKQCGFDGLSV
Sbjct: 194 DASLNDRVELRILGEVGAKIRVLDFTSPGVQVSVDCAIVISPRKQSLTYKQCGFDGLSV 243

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0K5H7_CUCSA2.0e-9686.40Uncharacterized protein OS=Cucumis sativus GN=Csa_7G223380 PE=4 SV=1[more]
A0A061DZ99_THECC9.7e-8879.91Late embryogenesis abundant hydroxyproline-rich glycoprotein family isoform 1 OS... [more]
F6H1R4_VITVI3.1e-8676.15Putative uncharacterized protein OS=Vitis vinifera GN=VIT_12s0055g00370 PE=4 SV=... [more]
M5Y567_PRUPE5.3e-8675.42Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa010161mg PE=4 SV=1[more]
M1CX10_SOLTU5.9e-8575.98Uncharacterized protein OS=Solanum tuberosum GN=PGSC0003DMG400029777 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G01080.11.6e-7867.81 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT3G54200.11.7e-0622.22 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
Match NameE-valueIdentityDescription
gi|659092735|ref|XP_008447191.1|8.7e-9886.58PREDICTED: uncharacterized protein LOC103489700 isoform X2 [Cucumis melo][more]
gi|449444084|ref|XP_004139805.1|2.8e-9686.40PREDICTED: uncharacterized protein LOC101207234 [Cucumis sativus][more]
gi|590683364|ref|XP_007041580.1|1.4e-8779.91Late embryogenesis abundant hydroxyproline-rich glycoprotein family isoform 1 [T... [more]
gi|658008149|ref|XP_008339262.1|9.1e-8776.79PREDICTED: uncharacterized protein LOC103402303 [Malus domestica][more]
gi|225447781|ref|XP_002265790.1|4.5e-8676.15PREDICTED: uncharacterized protein LOC100267543 [Vitis vinifera][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR004864LEA_2
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005886 plasma membrane
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmoCh11G012590CmoCh11G012590gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmoCh11G012590.1CmoCh11G012590.1-proteinpolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh11G012590.1.exon.1CmoCh11G012590.1.exon.1exon
CmoCh11G012590.1.exon.2CmoCh11G012590.1.exon.2exon


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh11G012590.1.five_prime_UTR.1CmoCh11G012590.1.five_prime_UTR.1five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh11G012590.1.CDS.1CmoCh11G012590.1.CDS.1CDS
CmoCh11G012590.1.CDS.2CmoCh11G012590.1.CDS.2CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh11G012590.1.three_prime_UTR.1CmoCh11G012590.1.three_prime_UTR.1three_prime_UTR


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004864Late embryogenesis abundant protein, LEA-14PFAMPF03168LEA_2coord: 110..209
score: 1.2
NoneNo IPR availablePANTHERPTHR31234FAMILY NOT NAMEDcoord: 4..230
score: 1.1E
NoneNo IPR availablePANTHERPTHR31234:SF8EXPRESSED PROTEINcoord: 4..230
score: 1.1E
NoneNo IPR availableunknownSSF117070LEA14-likecoord: 54..189
score: 4.0