CSPI03G18650 (gene) Wild cucumber (PI 183967)

NameCSPI03G18650
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionLate embryogenesis abundant protein
LocationChr3 : 14272404 .. 14275865 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TACATGAGGGTGATAAAAAGAGAAATTGGATATAAATGATGAATAAATATTTACCAAATTTCTATTCTCGTAATTCTTTAGAAAATCTTCACACACCTTGTTATGTCGATATTTAGATTTAAAAAGAAAACAAAAAGAAAGGTAAAAGGAAAACCCCTTCGTAAGGCTTAGTACAGAAAATGAAAATCTAAATTTAGATACAAACTAATACTTCAAAACTTATATGGTTTTGGACTAAATTGTGGTGGTAAATTTTATTCTTAAAGGTAATTATGAATAATATGATAATGAAAAAAAAAATAAAGAAAGAAAGAAAGGAAAGGAAATAGGGAGAAGAGTCGGAAACTTTTCCTTTTGGCACATTGTAACTTTAGTTAAGTATTAAGCAAAGTAGCCAAAGCTCACACACACTATTCTAACTCTCACTTCAAAGAAGAATAAAAGGAAAAAAAAAACACACACACACAAAAACATAGAGAATCGATCATGGAGGAAATGACGTCACGACCTCAACTGAACCCTCGCAACACGCAACCTCCTCTACCACCCCCACCGTCACGACGACCCGACAACAACCATCGCCCTCCTCTCCCGCCACCACCCTCAAGAGCTCCCTTCAATCTTCAAACCAATCCCCGTTCTCCTCCATTCCCATCAACCACACCTAATTCCAACACTCGCAACACTCGCTATCCATCTCCGCCATCTCCGCCGTCCTCTCGACGGCAACATTTCGGTTACGGAGCGGCATCCTCATCACCTTCCTTACGAGGCTGCTGCTGTTGCCTCTGCCTCCTCTTCTCATTCATCGCTCTCCTCGCGGTCGCCATCGTCCTCGTCATTGTCCTAGCTGTCAAACCTAAAAAGCCTCAGTTCGACCTCCAACGAGTCGGTGTTCAATACATGGGAATAACCGCTCCAAATCTCTTCTCATTGTCATCCTCCGACACCGAGACTGCTGCAACAACTTCAACGACATCCGCATCGTTGTCGCTCAACATTCGATTGCTGTTCACGGCGGTGAATCCGAACAAAGTAGGAATAAAATACGGGGATTCGAGGTTTACGGTGATGTACAGAGGGATACCGCTAGGGAAAGCGATAGTACCTGGATTTTACCAAGAGGCACATAGTGAAAGAGAGGTGGAGGCGACGATAGCCGTAGATCGAGTGAATTTGCTTCAGGCGGACGCGGCCGATCTGATAAGAGACGCTTCGTTGAACGATCGAGTTGAACTGAGGGTTTTGGGTGAAGTTGGCGCCAGGATCCGCGTCTTGGATTTTGATTCGCCGGGCGTCCAGGTCAGTTTCCCACCTCTTTTTTCTTCTTCTTCTTAATTACCATTTTCACCTTGCTTTTATTTTTTTTCTTTTTAAAAATTACTCATTTAATTTTATTCGCTTTTGCTTTCAAGATTATGCATATGATGGAAGGCTTTCATTTTTTAGATGTTTGATTGGATTTTGGATTCTTTGTAATGAATTGATAGATTTTTGGTGAAAATAAAAGAATAGATGTGTGTAAGGAATCATAATTACATCTACTTACTCCAACAAATTAGAGCGCTTAAATTTTTCAAATAAAAAATATATAACAAAACTCAGAATTTATAGGGAAAAAATCTTCATTTTTATAGATTTATTACTTAAAATTAGTATTATTTTGACTATTTTTCATAACTTAAATTTAATAAACTTCTTTTTAAAAAGAATTTTTGACGTGGGAGAATCTAATCTCCAACGTGAAGATAAATAATATAAACCTTATCCCAGTGAAAATACCCATTATCCTAGTGGAAAGAAATAGCACGAGAAAAACTTAGATCAACCCATACTTCTGTTCGGATTCGTTATATGTATATTTAAATGAAATATTTGTTGAGGGTTGTTTCTATACACTGAGATGGAGGGACTAAATTGTTACCACTATAAATTTATATATATTTTTTTAAAAAAATCCAAATTTAATTAGCATAACTGAAAAAGCAATTCTCATTCCCAAAAAAAATAGTACAATTAAATGAAATGAAGAATGCAATTATAAAAAAGAAGAAGAAGAAGAAGAAAAAAAAAGAAAGTGAAAAGTGAAAGAGTAAAAGAGTAAAAGAGTAAAAGAGTAAAAGAAAGCGTTGTTTAAGGAGGAAAGGGTGGGGGTTAGTTTGGTTATAGTTTTAAGGCGGCGATTTAGGGATTTAGGGGTGGGGGCCATGCCGACCGAACAAAACTACTTTTGTTTTTCAACCCTTTCACATTTATGAACTACTTTATCCCTTTCACCCATCATCTCTTCCCTCTATTCTAATCTTATTTCTTCATATCTCACAATAAACCTTTCACAAAATATTTACAATTTACATTCAAACAACTATTTCATTATTTTCATACAAGAAGACACAATATTTGAATATCTTTTCAATTTTATAGTAACACATTAATGGGTAAATAACATTTTCTTGACAAAATTAATAATAGGGATTAAATTCCCCAAAGTATATATATCAAATCATAATACACTTGAAAGTTGAATAACTAAACCTGACATTTTGAAGATTAGAGACCAAGAAAGAACCCAAGTAAATGTTTGAGGAAGAGATTTAATTTAAACCTTTAACTAACATGCTAATCAATATATTCATCATAGCTTGACACGTTGATCATTTCACTCACTATTGTAACCTTTTCTTTTCTTTTTTTAATTCATTTATTACTCTTAGAAAAAGATTTCTAAAGATAACCAAACTATGCCATACTCTATTACTCTTGTATGTTAAATGCTCCACCCATATCATTGTATTATATCTACCCAATATAATTCATGCTTATGTTTTGATCCGTGCCTAACAAAATATTGTTGTGTTTTGACGAGATGCTAATCGTAGTTGAATAAGATACTATATAAGAAGTTGAATAGTAGGAGATTATCTTACTAATCAATGTATACCAAAACATATTGTTGTTACTTGTTTTGACAACGACAAAGGAACTTAAAATGATCGATAAATGACAGGTGTCAGTGGATTGCTCAATAGTGATAAGTCCAAGGAATCAGTCATTGACTTCAAAGCAATGTGGATTTGATGGGTTCAGTTTATGATTCTTAGCCGCCCATACTTTTCTTTTTTCTATTCTAACCTAAGTTTCTTCTTCTTCCTCACTCTATTTCTGCTATTTTATACTCAACAGAGATTGGAATGATGAGGGAGAAAAAGAAGAAAAAGAAAGATGTACAAAATAAAAGAGAACAAGTAGGTGTGTGTGTGTGTGTAAAAGTAATTAACAAGCAATCCTTCTGTTCAGTACTTTCACATGGAAATGAGTCATCCAAAAGCTAGCAATGGCTTTCTTTTTCTTTGTCCTTGACTCCTTCTGATTCTTAATATTAAAAAAACAAAAGGAATGGTGAGAATCAGAAAACTTTATTCTTTTATATTTTTGGGAAGTTTTGTATTTCTTCACGTTGTC

mRNA sequence

ATGGAGGAAATGACGTCACGACCTCAACTGAACCCTCGCAACACGCAACCTCCTCTACCACCCCCACCGTCACGACGACCCGACAACAACCATCGCCCTCCTCTCCCGCCACCACCCTCAAGAGCTCCCTTCAATCTTCAAACCAATCCCCGTTCTCCTCCATTCCCATCAACCACACCTAATTCCAACACTCGCAACACTCGCTATCCATCTCCGCCATCTCCGCCGTCCTCTCGACGGCAACATTTCGGTTACGGAGCGGCATCCTCATCACCTTCCTTACGAGGCTGCTGCTGTTGCCTCTGCCTCCTCTTCTCATTCATCGCTCTCCTCGCGGTCGCCATCGTCCTCGTCATTGTCCTAGCTGTCAAACCTAAAAAGCCTCAGTTCGACCTCCAACGAGTCGGTGTTCAATACATGGGAATAACCGCTCCAAATCTCTTCTCATTGTCATCCTCCGACACCGAGACTGCTGCAACAACTTCAACGACATCCGCATCGTTGTCGCTCAACATTCGATTGCTGTTCACGGCGGTGAATCCGAACAAAGTAGGAATAAAATACGGGGATTCGAGGTTTACGGTGATGTACAGAGGGATACCGCTAGGGAAAGCGATAGTACCTGGATTTTACCAAGAGGCACATAGTGAAAGAGAGGTGGAGGCGACGATAGCCGTAGATCGAGTGAATTTGCTTCAGGCGGACGCGGCCGATCTGATAAGAGACGCTTCGTTGAACGATCGAGTTGAACTGAGGGTTTTGGGTGAAGTTGGCGCCAGGATCCGCGTCTTGGATTTTGATTCGCCGGGCGTCCAGGTGTCAGTGGATTGCTCAATAGTGATAAGTCCAAGGAATCAGTCATTGACTTCAAAGCAATGTGGATTTGATGGGTTCAGTTTATGA

Coding sequence (CDS)

ATGGAGGAAATGACGTCACGACCTCAACTGAACCCTCGCAACACGCAACCTCCTCTACCACCCCCACCGTCACGACGACCCGACAACAACCATCGCCCTCCTCTCCCGCCACCACCCTCAAGAGCTCCCTTCAATCTTCAAACCAATCCCCGTTCTCCTCCATTCCCATCAACCACACCTAATTCCAACACTCGCAACACTCGCTATCCATCTCCGCCATCTCCGCCGTCCTCTCGACGGCAACATTTCGGTTACGGAGCGGCATCCTCATCACCTTCCTTACGAGGCTGCTGCTGTTGCCTCTGCCTCCTCTTCTCATTCATCGCTCTCCTCGCGGTCGCCATCGTCCTCGTCATTGTCCTAGCTGTCAAACCTAAAAAGCCTCAGTTCGACCTCCAACGAGTCGGTGTTCAATACATGGGAATAACCGCTCCAAATCTCTTCTCATTGTCATCCTCCGACACCGAGACTGCTGCAACAACTTCAACGACATCCGCATCGTTGTCGCTCAACATTCGATTGCTGTTCACGGCGGTGAATCCGAACAAAGTAGGAATAAAATACGGGGATTCGAGGTTTACGGTGATGTACAGAGGGATACCGCTAGGGAAAGCGATAGTACCTGGATTTTACCAAGAGGCACATAGTGAAAGAGAGGTGGAGGCGACGATAGCCGTAGATCGAGTGAATTTGCTTCAGGCGGACGCGGCCGATCTGATAAGAGACGCTTCGTTGAACGATCGAGTTGAACTGAGGGTTTTGGGTGAAGTTGGCGCCAGGATCCGCGTCTTGGATTTTGATTCGCCGGGCGTCCAGGTGTCAGTGGATTGCTCAATAGTGATAAGTCCAAGGAATCAGTCATTGACTTCAAAGCAATGTGGATTTGATGGGTTCAGTTTATGA
BLAST of CSPI03G18650 vs. TrEMBL
Match: A0A0A0L9B0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G209460 PE=4 SV=1)

HSP 1 Score: 565.5 bits (1456), Expect = 4.0e-158
Identity = 300/300 (100.00%), Postives = 300/300 (100.00%), Query Frame = 1

Query: 1   MEEMTSRPQLNPRNTQPPLPPPPSRRPDNNHRPPLPPPPSRAPFNLQTNPRSPPFPSTTP 60
           MEEMTSRPQLNPRNTQPPLPPPPSRRPDNNHRPPLPPPPSRAPFNLQTNPRSPPFPSTTP
Sbjct: 1   MEEMTSRPQLNPRNTQPPLPPPPSRRPDNNHRPPLPPPPSRAPFNLQTNPRSPPFPSTTP 60

Query: 61  NSNTRNTRYPSPPSPPSSRRQHFGYGAASSSPSLRGCCCCLCLLFSFIALLAVAIVLVIV 120
           NSNTRNTRYPSPPSPPSSRRQHFGYGAASSSPSLRGCCCCLCLLFSFIALLAVAIVLVIV
Sbjct: 61  NSNTRNTRYPSPPSPPSSRRQHFGYGAASSSPSLRGCCCCLCLLFSFIALLAVAIVLVIV 120

Query: 121 LAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDTETAATTSTTSASLSLNIRLLFTAVN 180
           LAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDTETAATTSTTSASLSLNIRLLFTAVN
Sbjct: 121 LAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDTETAATTSTTSASLSLNIRLLFTAVN 180

Query: 181 PNKVGIKYGDSRFTVMYRGIPLGKAIVPGFYQEAHSEREVEATIAVDRVNLLQADAADLI 240
           PNKVGIKYGDSRFTVMYRGIPLGKAIVPGFYQEAHSEREVEATIAVDRVNLLQADAADLI
Sbjct: 181 PNKVGIKYGDSRFTVMYRGIPLGKAIVPGFYQEAHSEREVEATIAVDRVNLLQADAADLI 240

Query: 241 RDASLNDRVELRVLGEVGARIRVLDFDSPGVQVSVDCSIVISPRNQSLTSKQCGFDGFSL 300
           RDASLNDRVELRVLGEVGARIRVLDFDSPGVQVSVDCSIVISPRNQSLTSKQCGFDGFSL
Sbjct: 241 RDASLNDRVELRVLGEVGARIRVLDFDSPGVQVSVDCSIVISPRNQSLTSKQCGFDGFSL 300

BLAST of CSPI03G18650 vs. TrEMBL
Match: B9SPL8_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1183370 PE=4 SV=1)

HSP 1 Score: 317.0 bits (811), Expect = 2.5e-83
Identity = 180/266 (67.67%), Postives = 206/266 (77.44%), Query Frame = 1

Query: 37  PPPSRAPFNLQTNPRSPPFPSTTPNSNTRNTRYPSPPSPPSSRRQHFG--YGAASSSPSL 96
           PPP  AP   Q +  +    +T+ N   R T+ P P        QH    Y  +SSS SL
Sbjct: 2   PPPGPAPQQQQHHHPT----TTSQNGEHRPTQRPPPQQQQHHHHQHHHPYYPTSSSSASL 61

Query: 97  RGCCCCLCLLFSFIALLAVAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSD 156
           +GCCCCL LLFSF+ALL +AI L+I+LAVKPKKPQFDLQ+VGVQYMGI+A N    +S D
Sbjct: 62  KGCCCCLFLLFSFLALLVLAIFLIIILAVKPKKPQFDLQQVGVQYMGISASN--PTASLD 121

Query: 157 TETAATTSTTSASLSLNIRLLFTAVNPNKVGIKYGDSRFTVMYRGIPLGKAIVPGFYQEA 216
             T   T  T+ASLSL I +LFTAVNPNKVGIKYG+SRFTVMY GIPLGKA VPGFYQEA
Sbjct: 122 PTTTVATGPTTASLSLTIHMLFTAVNPNKVGIKYGESRFTVMYHGIPLGKASVPGFYQEA 181

Query: 217 HSEREVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQVS 276
           HSER+VEATI+VDR +L+QA+AADLIRDASLNDRVELRVLGEVGA+IRVLDFDSPGVQVS
Sbjct: 182 HSERQVEATISVDRYSLMQANAADLIRDASLNDRVELRVLGEVGAKIRVLDFDSPGVQVS 241

Query: 277 VDCSIVISPRNQSLTSKQCGFDGFSL 301
           V+C+I ISPR QSLT K CGFDG S+
Sbjct: 242 VNCAIAISPRKQSLTYKDCGFDGLSV 261

BLAST of CSPI03G18650 vs. TrEMBL
Match: A0A061DZ99_THECC (Late embryogenesis abundant hydroxyproline-rich glycoprotein family isoform 1 OS=Theobroma cacao GN=TCM_006430 PE=4 SV=1)

HSP 1 Score: 316.2 bits (809), Expect = 4.2e-83
Identity = 177/259 (68.34%), Postives = 203/259 (78.38%), Query Frame = 1

Query: 43  PFNLQTNPRSPPFPSTTPNSNTRNTRYPSPPSPPSSRRQHFGYG-AASSSPSLRGCCCCL 102
           P N+  N   P      P +N  +        PP  +R H  Y  ++SSS S +GCCCCL
Sbjct: 3   PTNMSPN-HQPHAREMRPTANGEHHHRGLTAPPPRPQRHHPYYPRSSSSSASFKGCCCCL 62

Query: 103 CLLFSFIALLAVAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDTETAATT 162
            LLFSF+ALL +A+VL+IVLAVKPKKPQFDLQ+VGVQYMGI+  N  +   +    A TT
Sbjct: 63  FLLFSFLALLVLAVVLIIVLAVKPKKPQFDLQQVGVQYMGISTSNPSAFDGA--AAAVTT 122

Query: 163 STTSASLSLNIRLLFTAVNPNKVGIKYGDSRFTVMYRGIPLGKAIVPGFYQEAHSEREVE 222
           + T+ASLSL I +LFTAVNPNKVGIKYG+SRFTVMYRGIPLGKA VPGF+QEAHS R VE
Sbjct: 123 TPTTASLSLTIHMLFTAVNPNKVGIKYGESRFTVMYRGIPLGKAAVPGFFQEAHSTRNVE 182

Query: 223 ATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQVSVDCSIVI 282
           ATIAVDR NL+QADAADLIRDASLNDRVELRVLG+VGA+IRVLDFDSPGVQVS+DC+IVI
Sbjct: 183 ATIAVDRANLMQADAADLIRDASLNDRVELRVLGDVGAKIRVLDFDSPGVQVSIDCAIVI 242

Query: 283 SPRNQSLTSKQCGFDGFSL 301
           SPR QSLT KQCGFDG S+
Sbjct: 243 SPRKQSLTYKQCGFDGLSV 258

BLAST of CSPI03G18650 vs. TrEMBL
Match: A0A067LHD5_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_16906 PE=4 SV=1)

HSP 1 Score: 315.1 bits (806), Expect = 9.4e-83
Identity = 181/269 (67.29%), Postives = 210/269 (78.07%), Query Frame = 1

Query: 33  PPLPPPPSRAPFNLQTNPRSPPFPSTTPNSNTRNTRYPSPPSPPSSRRQHFGYGAASSSP 92
           PP P P S A  N    P  PP P++  + N  +  +P  P+            ++SSS 
Sbjct: 2   PPPPQPQSSASLNGDHRPPRPP-PASNTHQNHHHHHHPYYPT-----------SSSSSSA 61

Query: 93  SLRGCCCCLCLLFSFIALLAVAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLS- 152
           SL+GCCCCL LLFSF+ALL +AI L+I+LAVKPKKPQFDLQ+VGVQYMGI+A N  S   
Sbjct: 62  SLKGCCCCLFLLFSFLALLVLAIFLIIILAVKPKKPQFDLQQVGVQYMGISASNPASFDP 121

Query: 153 SSDTETAATTSTTSASLSLNIRLLFTAVNPNKVGIKYGDSRFTVMYRGIPLGKAIVPGFY 212
           S+ T T   T  T+ASLSL I +LFTAVNPNKVGIKYG+SRFTVMY GIPLGKA VPGFY
Sbjct: 122 STTTTTTVATGPTTASLSLTIHMLFTAVNPNKVGIKYGESRFTVMYHGIPLGKASVPGFY 181

Query: 213 QEAHSEREVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGV 272
           QEAHSER+VEATI+VDR +L+QA+AA+LIRDASLNDRVELRVLGEVGA+IRVLDFDSPGV
Sbjct: 182 QEAHSERQVEATISVDRYSLMQANAAELIRDASLNDRVELRVLGEVGAKIRVLDFDSPGV 241

Query: 273 QVSVDCSIVISPRNQSLTSKQCGFDGFSL 301
           QVSV+C+IVISPR QSLT KQCGFDG S+
Sbjct: 242 QVSVNCAIVISPRKQSLTYKQCGFDGLSV 258

BLAST of CSPI03G18650 vs. TrEMBL
Match: A0A067EH55_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g023930mg PE=4 SV=1)

HSP 1 Score: 315.1 bits (806), Expect = 9.4e-83
Identity = 174/264 (65.91%), Postives = 202/264 (76.52%), Query Frame = 1

Query: 48  TNPRSPPFPSTTPNSNTRNTRYPSPPSPPS-----------SRRQHFGYGAASSSPSLRG 107
           + P+ PP   T PN    + R P PP PP               Q++   ++SSS S RG
Sbjct: 18  SQPKMPP--QTQPNGTHHHQRRPHPPPPPPLQPQSQYHHHHDHHQYYPTTSSSSSASFRG 77

Query: 108 CCCCLCLLFSFIALLAVAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDTE 167
           CCCCL LLFSFIALL +A+VL++ LAVKPKKPQFDLQ+VGVQYMGI+ PN     +S  +
Sbjct: 78  CCCCLFLLFSFIALLILAVVLIVFLAVKPKKPQFDLQQVGVQYMGISTPN----PTSSVD 137

Query: 168 TAATTSTTSASLSLNIRLLFTAVNPNKVGIKYGDSRFTVMYRGIPLGKAIVPGFYQEAHS 227
            + T + TSASLSL I LLFTA NPNKVGIKYG+S+FTVMYRGIPLGKA VPGFYQ AHS
Sbjct: 138 PSTTIAATSASLSLTIHLLFTAANPNKVGIKYGESKFTVMYRGIPLGKASVPGFYQGAHS 197

Query: 228 EREVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQVSVD 287
            R VEATIAVDR NL+QADAA LI+DASLNDRVELRVLG+V A+IRV++FDSPGVQVSVD
Sbjct: 198 VRNVEATIAVDRANLMQADAASLIKDASLNDRVELRVLGDVSAKIRVMNFDSPGVQVSVD 257

Query: 288 CSIVISPRNQSLTSKQCGFDGFSL 301
           C+IVISPR QSLT KQCGFDG ++
Sbjct: 258 CAIVISPRKQSLTYKQCGFDGLTV 275

BLAST of CSPI03G18650 vs. TAIR10
Match: AT2G01080.1 (AT2G01080.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 284.3 bits (726), Expect = 9.0e-77
Identity = 152/242 (62.81%), Postives = 187/242 (77.27%), Query Frame = 1

Query: 72  PPSPPSSR-------------RQHFGYGAASSSPSLRGCCCCLCLLFSFIALLAVAIVLV 131
           PP P SSR             + ++   ++SSS SL+GCCCCL LLF+F+ALL +A+VL+
Sbjct: 2   PPPPSSSRAGLNGDPIAAQNQQPYYRSYSSSSSASLKGCCCCLFLLFAFLALLVLAVVLI 61

Query: 132 IVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDTETAATTSTTSASLSLNIRLLFTA 191
           ++LAVKPKKPQFDLQ+V V YMGI+ P+            A    T+ASLSL IR+LFTA
Sbjct: 62  VILAVKPKKPQFDLQQVAVVYMGISNPS------------AVLDPTTASLSLTIRMLFTA 121

Query: 192 VNPNKVGIKYGDSRFTVMYRGIPLGKAIVPGFYQEAHSEREVEATIAVDRVNLLQADAAD 251
           VNPNKVGI+YG+S FTVMY+G+PLG+A VPGFYQ+AHS + VEATI+VDRVNL+QA AAD
Sbjct: 122 VNPNKVGIRYGESSFTVMYKGMPLGRATVPGFYQDAHSTKNVEATISVDRVNLMQAHAAD 181

Query: 252 LIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQVSVDCSIVISPRNQSLTSKQCGFDGF 301
           L+RDASLNDRVEL V G+VGA+IRV++FDSPGVQVSV+C I ISPR Q+L  KQCGFDG 
Sbjct: 182 LVRDASLNDRVELTVRGDVGAKIRVMNFDSPGVQVSVNCGIGISPRKQALIYKQCGFDGL 231

BLAST of CSPI03G18650 vs. TAIR10
Match: AT3G54200.1 (AT3G54200.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 56.6 bits (135), Expect = 3.0e-08
Identity = 54/226 (23.89%), Postives = 91/226 (40.27%), Query Frame = 1

Query: 70  PSPPSPPSSRRQHFGYGAASSSPSLRGCCCCLCLLFSFIALLAVAIVLVIVLAVKPKKPQ 129
           P  P+  S   Q    G A      R C  C+C     I L+A+ IV++     KPK+P 
Sbjct: 24  PPKPNASSMETQSANTGTAKKLRRKRNCKICICFTILLILLIAIVIVILAFTLFKPKRPT 83

Query: 130 FDLQRVGVQYMGITAPNLFSLSSSDTETAATTSTTSASLSLNIRLLFTAVNPNKVGIKYG 189
             +  V V  +               + +         L+L + +  +  NPN++G  Y 
Sbjct: 84  TTIDSVTVDRL---------------QASVNPLLLKVLLNLTLNVDLSLKNPNRIGFSYD 143

Query: 190 DSRFTVMYRGIPLGKAIVPGFYQEAHSEREVEATIAVDRVNLLQADAADLIRDASLNDRV 249
            S   + YRG  +G+A +P     A     +  T+ +    LL      L+ D  +   +
Sbjct: 144 SSSALLNYRGQVIGEAPLPANRIAARKTVPLNITLTLMADRLL--SETQLLSDV-MAGVI 203

Query: 250 ELRVLGEVGARIRVLDFDSPGVQVSVDCSIVISPRNQSLTSKQCGF 296
            L    +V  ++ VL      VQ S  C + IS  ++++TS+ C +
Sbjct: 204 PLNTFVKVTGKVTVLKIFKIKVQSSSSCDLSISVSDRNVTSQHCKY 231

BLAST of CSPI03G18650 vs. NCBI nr
Match: gi|449446081|ref|XP_004140800.1| (PREDICTED: proline-rich receptor-like protein kinase PERK10 [Cucumis sativus])

HSP 1 Score: 565.5 bits (1456), Expect = 5.7e-158
Identity = 300/300 (100.00%), Postives = 300/300 (100.00%), Query Frame = 1

Query: 1   MEEMTSRPQLNPRNTQPPLPPPPSRRPDNNHRPPLPPPPSRAPFNLQTNPRSPPFPSTTP 60
           MEEMTSRPQLNPRNTQPPLPPPPSRRPDNNHRPPLPPPPSRAPFNLQTNPRSPPFPSTTP
Sbjct: 1   MEEMTSRPQLNPRNTQPPLPPPPSRRPDNNHRPPLPPPPSRAPFNLQTNPRSPPFPSTTP 60

Query: 61  NSNTRNTRYPSPPSPPSSRRQHFGYGAASSSPSLRGCCCCLCLLFSFIALLAVAIVLVIV 120
           NSNTRNTRYPSPPSPPSSRRQHFGYGAASSSPSLRGCCCCLCLLFSFIALLAVAIVLVIV
Sbjct: 61  NSNTRNTRYPSPPSPPSSRRQHFGYGAASSSPSLRGCCCCLCLLFSFIALLAVAIVLVIV 120

Query: 121 LAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDTETAATTSTTSASLSLNIRLLFTAVN 180
           LAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDTETAATTSTTSASLSLNIRLLFTAVN
Sbjct: 121 LAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDTETAATTSTTSASLSLNIRLLFTAVN 180

Query: 181 PNKVGIKYGDSRFTVMYRGIPLGKAIVPGFYQEAHSEREVEATIAVDRVNLLQADAADLI 240
           PNKVGIKYGDSRFTVMYRGIPLGKAIVPGFYQEAHSEREVEATIAVDRVNLLQADAADLI
Sbjct: 181 PNKVGIKYGDSRFTVMYRGIPLGKAIVPGFYQEAHSEREVEATIAVDRVNLLQADAADLI 240

Query: 241 RDASLNDRVELRVLGEVGARIRVLDFDSPGVQVSVDCSIVISPRNQSLTSKQCGFDGFSL 300
           RDASLNDRVELRVLGEVGARIRVLDFDSPGVQVSVDCSIVISPRNQSLTSKQCGFDGFSL
Sbjct: 241 RDASLNDRVELRVLGEVGARIRVLDFDSPGVQVSVDCSIVISPRNQSLTSKQCGFDGFSL 300

BLAST of CSPI03G18650 vs. NCBI nr
Match: gi|659112197|ref|XP_008456109.1| (PREDICTED: proline-rich receptor-like protein kinase PERK10 [Cucumis melo])

HSP 1 Score: 543.9 bits (1400), Expect = 1.8e-151
Identity = 288/300 (96.00%), Postives = 292/300 (97.33%), Query Frame = 1

Query: 1   MEEMTSRPQLNPRNTQPPLPPPPSRRPDNNHRPPLPPPPSRAPFNLQTNPRSPPFPSTTP 60
           MEEMTSRPQLNPR+TQPPLPPPPSRRP NNH  PLPPPPSRAPFNL +NPRSPPFPST P
Sbjct: 1   MEEMTSRPQLNPRSTQPPLPPPPSRRPHNNHHRPLPPPPSRAPFNLHSNPRSPPFPSTAP 60

Query: 61  NSNTRNTRYPSPPSPPSSRRQHFGYGAASSSPSLRGCCCCLCLLFSFIALLAVAIVLVIV 120
           N NTRNTRYPSPPSPPSSRRQHFGYGAASSSPS RGCCCCLCLLFSFIALLA+AI+LVIV
Sbjct: 61  NPNTRNTRYPSPPSPPSSRRQHFGYGAASSSPSFRGCCCCLCLLFSFIALLAIAIILVIV 120

Query: 121 LAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDTETAATTSTTSASLSLNIRLLFTAVN 180
           LAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSD ETAATTSTTSASLSLNIRLLFTAVN
Sbjct: 121 LAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAATTSTTSASLSLNIRLLFTAVN 180

Query: 181 PNKVGIKYGDSRFTVMYRGIPLGKAIVPGFYQEAHSEREVEATIAVDRVNLLQADAADLI 240
           PNKVGIKYGDSRFTVMYRGIPLGKAIVPGFYQEAHSEREVEATIAVDRVNLLQADAADLI
Sbjct: 181 PNKVGIKYGDSRFTVMYRGIPLGKAIVPGFYQEAHSEREVEATIAVDRVNLLQADAADLI 240

Query: 241 RDASLNDRVELRVLGEVGARIRVLDFDSPGVQVSVDCSIVISPRNQSLTSKQCGFDGFSL 300
           RDASLNDRVELRVLGEVGARIRVLDFDSPGVQVSVDCSIVISPRNQSLTSKQCGFDGFSL
Sbjct: 241 RDASLNDRVELRVLGEVGARIRVLDFDSPGVQVSVDCSIVISPRNQSLTSKQCGFDGFSL 300

BLAST of CSPI03G18650 vs. NCBI nr
Match: gi|743928961|ref|XP_011008700.1| (PREDICTED: uncharacterized protein LOC105114011 [Populus euphratica])

HSP 1 Score: 321.6 bits (823), Expect = 1.4e-84
Identity = 174/242 (71.90%), Postives = 201/242 (83.06%), Query Frame = 1

Query: 62  SNTRNTRYPSPPSPPSSRRQH---FGYGAASSSPSLRGCCCCLCLLFSFIALLAVAIVLV 121
           SN  N     P  P S+++QH   +   A+SSS S +GCCCCL LLFSF+ALL +A+ LV
Sbjct: 8   SNGENPASQRPQPPHSNQQQHHQPYYSSASSSSASFKGCCCCLFLLFSFLALLILAVFLV 67

Query: 122 IVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDTETAATTSTTSASLSLNIRLLFTA 181
           I+LAVKPKKPQFDLQ+VGVQYMGITA N    +S D  TA  T+  +ASLSL I +LFTA
Sbjct: 68  IILAVKPKKPQFDLQQVGVQYMGITASN--PTASMDPTTATATTPATASLSLTIHMLFTA 127

Query: 182 VNPNKVGIKYGDSRFTVMYRGIPLGKAIVPGFYQEAHSEREVEATIAVDRVNLLQADAAD 241
           VNPNKVGIKYG+S F+VMYRGIPLGKA+VPGFYQEAHS+R+VEATI+VDR +L+QADA+D
Sbjct: 128 VNPNKVGIKYGESSFSVMYRGIPLGKALVPGFYQEAHSQRQVEATISVDRYSLMQADASD 187

Query: 242 LIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQVSVDCSIVISPRNQSLTSKQCGFDGF 301
           LIRDASLNDRVELRVLGEVGA+IRVLD DSPGVQVSVDC+IVISPR QSLT KQCGFDG 
Sbjct: 188 LIRDASLNDRVELRVLGEVGAKIRVLDLDSPGVQVSVDCAIVISPRKQSLTYKQCGFDGL 247

BLAST of CSPI03G18650 vs. NCBI nr
Match: gi|255574042|ref|XP_002527937.1| (PREDICTED: uncharacterized protein LOC8273514 [Ricinus communis])

HSP 1 Score: 317.0 bits (811), Expect = 3.5e-83
Identity = 180/266 (67.67%), Postives = 206/266 (77.44%), Query Frame = 1

Query: 37  PPPSRAPFNLQTNPRSPPFPSTTPNSNTRNTRYPSPPSPPSSRRQHFG--YGAASSSPSL 96
           PPP  AP   Q +  +    +T+ N   R T+ P P        QH    Y  +SSS SL
Sbjct: 2   PPPGPAPQQQQHHHPT----TTSQNGEHRPTQRPPPQQQQHHHHQHHHPYYPTSSSSASL 61

Query: 97  RGCCCCLCLLFSFIALLAVAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSD 156
           +GCCCCL LLFSF+ALL +AI L+I+LAVKPKKPQFDLQ+VGVQYMGI+A N    +S D
Sbjct: 62  KGCCCCLFLLFSFLALLVLAIFLIIILAVKPKKPQFDLQQVGVQYMGISASN--PTASLD 121

Query: 157 TETAATTSTTSASLSLNIRLLFTAVNPNKVGIKYGDSRFTVMYRGIPLGKAIVPGFYQEA 216
             T   T  T+ASLSL I +LFTAVNPNKVGIKYG+SRFTVMY GIPLGKA VPGFYQEA
Sbjct: 122 PTTTVATGPTTASLSLTIHMLFTAVNPNKVGIKYGESRFTVMYHGIPLGKASVPGFYQEA 181

Query: 217 HSEREVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQVS 276
           HSER+VEATI+VDR +L+QA+AADLIRDASLNDRVELRVLGEVGA+IRVLDFDSPGVQVS
Sbjct: 182 HSERQVEATISVDRYSLMQANAADLIRDASLNDRVELRVLGEVGAKIRVLDFDSPGVQVS 241

Query: 277 VDCSIVISPRNQSLTSKQCGFDGFSL 301
           V+C+I ISPR QSLT K CGFDG S+
Sbjct: 242 VNCAIAISPRKQSLTYKDCGFDGLSV 261

BLAST of CSPI03G18650 vs. NCBI nr
Match: gi|590683364|ref|XP_007041580.1| (Late embryogenesis abundant hydroxyproline-rich glycoprotein family isoform 1 [Theobroma cacao])

HSP 1 Score: 316.2 bits (809), Expect = 6.0e-83
Identity = 177/259 (68.34%), Postives = 203/259 (78.38%), Query Frame = 1

Query: 43  PFNLQTNPRSPPFPSTTPNSNTRNTRYPSPPSPPSSRRQHFGYG-AASSSPSLRGCCCCL 102
           P N+  N   P      P +N  +        PP  +R H  Y  ++SSS S +GCCCCL
Sbjct: 3   PTNMSPN-HQPHAREMRPTANGEHHHRGLTAPPPRPQRHHPYYPRSSSSSASFKGCCCCL 62

Query: 103 CLLFSFIALLAVAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDTETAATT 162
            LLFSF+ALL +A+VL+IVLAVKPKKPQFDLQ+VGVQYMGI+  N  +   +    A TT
Sbjct: 63  FLLFSFLALLVLAVVLIIVLAVKPKKPQFDLQQVGVQYMGISTSNPSAFDGA--AAAVTT 122

Query: 163 STTSASLSLNIRLLFTAVNPNKVGIKYGDSRFTVMYRGIPLGKAIVPGFYQEAHSEREVE 222
           + T+ASLSL I +LFTAVNPNKVGIKYG+SRFTVMYRGIPLGKA VPGF+QEAHS R VE
Sbjct: 123 TPTTASLSLTIHMLFTAVNPNKVGIKYGESRFTVMYRGIPLGKAAVPGFFQEAHSTRNVE 182

Query: 223 ATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQVSVDCSIVI 282
           ATIAVDR NL+QADAADLIRDASLNDRVELRVLG+VGA+IRVLDFDSPGVQVS+DC+IVI
Sbjct: 183 ATIAVDRANLMQADAADLIRDASLNDRVELRVLGDVGAKIRVLDFDSPGVQVSIDCAIVI 242

Query: 283 SPRNQSLTSKQCGFDGFSL 301
           SPR QSLT KQCGFDG S+
Sbjct: 243 SPRKQSLTYKQCGFDGLSV 258

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0L9B0_CUCSA4.0e-158100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_3G209460 PE=4 SV=1[more]
B9SPL8_RICCO2.5e-8367.67Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1183370 PE=4 SV=1[more]
A0A061DZ99_THECC4.2e-8368.34Late embryogenesis abundant hydroxyproline-rich glycoprotein family isoform 1 OS... [more]
A0A067LHD5_JATCU9.4e-8367.29Uncharacterized protein OS=Jatropha curcas GN=JCGZ_16906 PE=4 SV=1[more]
A0A067EH55_CITSI9.4e-8365.91Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g023930mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G01080.19.0e-7762.81 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT3G54200.13.0e-0823.89 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
Match NameE-valueIdentityDescription
gi|449446081|ref|XP_004140800.1|5.7e-158100.00PREDICTED: proline-rich receptor-like protein kinase PERK10 [Cucumis sativus][more]
gi|659112197|ref|XP_008456109.1|1.8e-15196.00PREDICTED: proline-rich receptor-like protein kinase PERK10 [Cucumis melo][more]
gi|743928961|ref|XP_011008700.1|1.4e-8471.90PREDICTED: uncharacterized protein LOC105114011 [Populus euphratica][more]
gi|255574042|ref|XP_002527937.1|3.5e-8367.67PREDICTED: uncharacterized protein LOC8273514 [Ricinus communis][more]
gi|590683364|ref|XP_007041580.1|6.0e-8368.34Late embryogenesis abundant hydroxyproline-rich glycoprotein family isoform 1 [T... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004864LEA_2
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0016310 phosphorylation
cellular_component GO:0005886 plasma membrane
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0016301 kinase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI03G18650.1CSPI03G18650.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004864Late embryogenesis abundant protein, LEA-14PFAMPF03168LEA_2coord: 178..277
score: 4.4
NoneNo IPR availablePANTHERPTHR31234FAMILY NOT NAMEDcoord: 6..38
score: 6.6E-132coord: 56..298
score: 6.6E
NoneNo IPR availablePANTHERPTHR31234:SF8EXPRESSED PROTEINcoord: 6..38
score: 6.6E-132coord: 56..298
score: 6.6E
NoneNo IPR availableunknownSSF117070LEA14-likecoord: 169..257
score: 2.8