CSPI04G06460 (gene) Wild cucumber (PI 183967)

NameCSPI04G06460
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionLate embryogenesis abundant hydroxyproline-rich glycoprotein
LocationChr4 : 4492220 .. 4493123 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAACAAACCTCACTTTTCAAATTTAGAACCCAAAAAATATCTATTAAAATTATTCATTATTTATTTAATATTCATTTTTTCTAAAAAAACTTAATTTTATTTGCACAATTTCCACACTTACTTACTTTCTCTATTCATTTTCTCTTCCATCATTTTATTTTGTATCCAAACACACCGCTATTCTCTTTCCTCTCTTCCAAATTCAGATCCCGTTCCACCTCCGCCGTAAGCACCGCCATGGGCAATCCCCACGGCCACGGCGATCACCCGCCATCTGGCCGCACCAACTTGGCGTCCTGTGTAGTTGCCACAGTCTTCTTGATCTTCCTCATCATCGTCATCCTTATCGTCTTCTTCACTGTCTTCAAGCCTCAGGATCCAAAGATCGCCGTTTCCGCCGTCCAGTTGCCGTCCTTCTCCGTCGCCAACGGCACTATCAATTTCACTTTCTCACAGTACGTCTCCGTCAAAAACCCTAACAAAGCCTCTTTCTCTCACTACGACAGTTCCCTCCAACTCCTCTACTCCGGTTCTCAAATTGGATTTATGTTCATTCCGGCCGGTAAAATCGACGCCGGTCAGACGCAGTACATGGCAGCTACCTTCTCTGTCCAGTCATTCCCGTTGGCTGCTCCAGTCGCCTCCGTTGGAGCTGGACCTACCTTCTCGGAGGGAATGAACGGGTACAGAGTCGGACCGATACTGGAGATTGAATCTAAGATGGATATGGCCGGTAGGGTTAGGGTATTGCACTTCTTCACTCACCACGTGGAAGCCACGTCGAGCTGCAGAGTCGCCATTGCTGTAAGTGATGGATCTGTGTTAGGTTTCCATTGCTAATTCTTCTTCTGGAAAATTTGGTCAGTTGGGTGTTGAGATTCTCATACTCTTTTAAGTGTAAAAA

mRNA sequence

ATGGGCAATCCCCACGGCCACGGCGATCACCCGCCATCTGGCCGCACCAACTTGGCGTCCTGTGTAGTTGCCACAGTCTTCTTGATCTTCCTCATCATCGTCATCCTTATCGTCTTCTTCACTGTCTTCAAGCCTCAGGATCCAAAGATCGCCGTTTCCGCCGTCCAGTTGCCGTCCTTCTCCGTCGCCAACGGCACTATCAATTTCACTTTCTCACAGTACGTCTCCGTCAAAAACCCTAACAAAGCCTCTTTCTCTCACTACGACAGTTCCCTCCAACTCCTCTACTCCGGTTCTCAAATTGGATTTATGTTCATTCCGGCCGGTAAAATCGACGCCGGTCAGACGCAGTACATGGCAGCTACCTTCTCTGTCCAGTCATTCCCGTTGGCTGCTCCAGTCGCCTCCGTTGGAGCTGGACCTACCTTCTCGGAGGGAATGAACGGGTACAGAGTCGGACCGATACTGGAGATTGAATCTAAGATGGATATGGCCGGTAGGGTTAGGGTATTGCACTTCTTCACTCACCACGTGGAAGCCACGTCGAGCTGCAGAGTCGCCATTGCTGTAAGTGATGGATCTGTGTTAGGTTTCCATTGCTAA

Coding sequence (CDS)

ATGGGCAATCCCCACGGCCACGGCGATCACCCGCCATCTGGCCGCACCAACTTGGCGTCCTGTGTAGTTGCCACAGTCTTCTTGATCTTCCTCATCATCGTCATCCTTATCGTCTTCTTCACTGTCTTCAAGCCTCAGGATCCAAAGATCGCCGTTTCCGCCGTCCAGTTGCCGTCCTTCTCCGTCGCCAACGGCACTATCAATTTCACTTTCTCACAGTACGTCTCCGTCAAAAACCCTAACAAAGCCTCTTTCTCTCACTACGACAGTTCCCTCCAACTCCTCTACTCCGGTTCTCAAATTGGATTTATGTTCATTCCGGCCGGTAAAATCGACGCCGGTCAGACGCAGTACATGGCAGCTACCTTCTCTGTCCAGTCATTCCCGTTGGCTGCTCCAGTCGCCTCCGTTGGAGCTGGACCTACCTTCTCGGAGGGAATGAACGGGTACAGAGTCGGACCGATACTGGAGATTGAATCTAAGATGGATATGGCCGGTAGGGTTAGGGTATTGCACTTCTTCACTCACCACGTGGAAGCCACGTCGAGCTGCAGAGTCGCCATTGCTGTAAGTGATGGATCTGTGTTAGGTTTCCATTGCTAA
BLAST of CSPI04G06460 vs. TrEMBL
Match: A0A0A0KX26_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G052640 PE=4 SV=1)

HSP 1 Score: 394.4 bits (1012), Expect = 8.1e-107
Identity = 200/200 (100.00%), Postives = 200/200 (100.00%), Query Frame = 1

Query: 1   MGNPHGHGDHPPSGRTNLASCVVATVFLIFLIIVILIVFFTVFKPQDPKIAVSAVQLPSF 60
           MGNPHGHGDHPPSGRTNLASCVVATVFLIFLIIVILIVFFTVFKPQDPKIAVSAVQLPSF
Sbjct: 1   MGNPHGHGDHPPSGRTNLASCVVATVFLIFLIIVILIVFFTVFKPQDPKIAVSAVQLPSF 60

Query: 61  SVANGTINFTFSQYVSVKNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDAGQTQYMA 120
           SVANGTINFTFSQYVSVKNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDAGQTQYMA
Sbjct: 61  SVANGTINFTFSQYVSVKNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDAGQTQYMA 120

Query: 121 ATFSVQSFPLAAPVASVGAGPTFSEGMNGYRVGPILEIESKMDMAGRVRVLHFFTHHVEA 180
           ATFSVQSFPLAAPVASVGAGPTFSEGMNGYRVGPILEIESKMDMAGRVRVLHFFTHHVEA
Sbjct: 121 ATFSVQSFPLAAPVASVGAGPTFSEGMNGYRVGPILEIESKMDMAGRVRVLHFFTHHVEA 180

Query: 181 TSSCRVAIAVSDGSVLGFHC 201
           TSSCRVAIAVSDGSVLGFHC
Sbjct: 181 TSSCRVAIAVSDGSVLGFHC 200

BLAST of CSPI04G06460 vs. TrEMBL
Match: A0A067JZ66_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_20441 PE=4 SV=1)

HSP 1 Score: 310.1 bits (793), Expect = 2.0e-81
Identity = 157/200 (78.50%), Postives = 179/200 (89.50%), Query Frame = 1

Query: 11  PPSGRTNLASCVVATVFLIFLIIVILIVFFTVFKPQDPKIAVSAVQLPSFSVANGTINFT 70
           PPSGRTNLASC+VAT+FLIF+II+ILIVFFTVFKP+DPKI+V+AVQLPSFSV+N T+NFT
Sbjct: 7   PPSGRTNLASCIVATIFLIFVIIIILIVFFTVFKPKDPKISVNAVQLPSFSVSNNTVNFT 66

Query: 71  FSQYVSVKNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDAGQTQYMAATFSVQSFPL 130
           FSQYVSVKNPNKASFSHYDS+LQLLYSGSQ+GFMFIPAGKIDAG+TQYMAATF+VQSFPL
Sbjct: 67  FSQYVSVKNPNKASFSHYDSTLQLLYSGSQVGFMFIPAGKIDAGRTQYMAATFAVQSFPL 126

Query: 131 -AAPVASVGAGPTFSEGM---------NGYRVGPILEIESKMDMAGRVRVLHFFTHHVEA 190
            ++P A+V  GPTF+ G+          GYRVGP +EIES++ MAGRVRVLH FTHHVEA
Sbjct: 127 SSSPDAAVNVGPTFAGGVLPGGYPSVNGGYRVGPTMEIESRIHMAGRVRVLHIFTHHVEA 186

Query: 191 TSSCRVAIAVSDGSVLGFHC 201
            + CRVAIAVSDGSVLGFHC
Sbjct: 187 KAGCRVAIAVSDGSVLGFHC 206

BLAST of CSPI04G06460 vs. TrEMBL
Match: B9GXJ6_POPTR (Proline-rich family protein OS=Populus trichocarpa GN=POPTR_0003s14110g PE=4 SV=2)

HSP 1 Score: 305.4 bits (781), Expect = 5.0e-80
Identity = 156/211 (73.93%), Postives = 179/211 (84.83%), Query Frame = 1

Query: 1   MGNPHGHGDHPPSGRTNLASCVVATVFLIFLIIVILIVFFTVFKPQDPKIAVSAVQLPSF 60
           M  PH     PPSGRTNLASC+VAT+FLIFL+I+ILIVFFTVFKP+DPKI+V++VQLPSF
Sbjct: 1   MSKPH----RPPSGRTNLASCIVATIFLIFLVIIILIVFFTVFKPKDPKISVNSVQLPSF 60

Query: 61  SVANGTINFTFSQYVSVKNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDAGQTQYMA 120
           SV+N T+NFTFSQYVSVKNPN+A FSH+DS+LQLLYSGSQIGFMFIPAGKIDAG+TQYMA
Sbjct: 61  SVSNNTVNFTFSQYVSVKNPNRAVFSHFDSTLQLLYSGSQIGFMFIPAGKIDAGRTQYMA 120

Query: 121 ATFSVQSFPL-AAPVASVGAGPTFSEG----------MNGYRVGPILEIESKMDMAGRVR 180
           ATFSV+SFPL A+P A+V  GP F++G           NGYRVGP +EIES++ MAGRVR
Sbjct: 121 ATFSVESFPLSASPDAAVNVGPAFNDGGFGGGGQTGFNNGYRVGPTMEIESRIQMAGRVR 180

Query: 181 VLHFFTHHVEATSSCRVAIAVSDGSVLGFHC 201
           VLHFFTHH+E    CRV IAVSDGSVLGFHC
Sbjct: 181 VLHFFTHHLETKVGCRVVIAVSDGSVLGFHC 207

BLAST of CSPI04G06460 vs. TrEMBL
Match: B9RCU1_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1692280 PE=4 SV=1)

HSP 1 Score: 301.6 bits (771), Expect = 7.2e-79
Identity = 153/209 (73.21%), Postives = 179/209 (85.65%), Query Frame = 1

Query: 1   MGNPHGHGDHPPSGRTNLASCVVATVFLIFLIIVILIVFFTVFKPQDPKIAVSAVQLPSF 60
           M  PH     PPSGRTNLASC+VAT+FLIF+II+ILIVFFTVFKP+DPKI+V+AVQLPSF
Sbjct: 1   MSKPH----RPPSGRTNLASCIVATIFLIFVIIIILIVFFTVFKPKDPKISVNAVQLPSF 60

Query: 61  SVANGTINFTFSQYVSVKNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDAGQTQYMA 120
           SV+N T+NFTFSQYVSVKNPN+A+FSHYDS+LQLLYSGSQ+GFMFIPAGKI++G+TQYMA
Sbjct: 61  SVSNNTVNFTFSQYVSVKNPNRATFSHYDSTLQLLYSGSQVGFMFIPAGKIESGRTQYMA 120

Query: 121 ATFSVQSFPL-AAPVASVGAGPTFS--------EGMNGYRVGPILEIESKMDMAGRVRVL 180
           ATF+VQSFPL ++P A+V  GP F+           NG+RVGP +EIES++ M GRVRVL
Sbjct: 121 ATFAVQSFPLSSSPDAAVNVGPAFTGSGFPGVPGSSNGFRVGPTMEIESRIQMVGRVRVL 180

Query: 181 HFFTHHVEATSSCRVAIAVSDGSVLGFHC 201
           H FTHHVEA + CRVAIAVSDGSVLGFHC
Sbjct: 181 HIFTHHVEAKAECRVAIAVSDGSVLGFHC 205

BLAST of CSPI04G06460 vs. TrEMBL
Match: F6HU27_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_02s0025g01600 PE=4 SV=1)

HSP 1 Score: 300.4 bits (768), Expect = 1.6e-78
Identity = 146/190 (76.84%), Postives = 175/190 (92.11%), Query Frame = 1

Query: 11  PPSGRTNLASCVVATVFLIFLIIVILIVFFTVFKPQDPKIAVSAVQLPSFSVANGTINFT 70
           PPSGRTNLASCVVAT+FLIF+IIV+LIVFF+VFKP++P I+V+AVQLPSF+++NGT+NFT
Sbjct: 7   PPSGRTNLASCVVATIFLIFIIIVVLIVFFSVFKPKEPIISVNAVQLPSFAISNGTVNFT 66

Query: 71  FSQYVSVKNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDAGQTQYMAATFSVQSFPL 130
           FSQYVSVKNPNKA FSHYDS+LQLLY G+Q+GFMFIPAGKI +G+TQYMAATF+V+SFPL
Sbjct: 67  FSQYVSVKNPNKAEFSHYDSTLQLLYGGNQVGFMFIPAGKIGSGRTQYMAATFAVESFPL 126

Query: 131 AAPVASVGAGPTFSEGMNGYRVGPILEIESKMDMAGRVRVLHFFTHHVEATSSCRVAIAV 190
            A   SV  GPT ++G+ G+R+GP LEIES+M+MAGRVRVLHFFTHHV+A + CRV+IAV
Sbjct: 127 GAVPESV--GPTITDGLGGFRIGPNLEIESRMEMAGRVRVLHFFTHHVDARAVCRVSIAV 186

Query: 191 SDGSVLGFHC 201
           SDGSVLGFHC
Sbjct: 187 SDGSVLGFHC 194

BLAST of CSPI04G06460 vs. TAIR10
Match: AT4G23930.1 (AT4G23930.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 206.8 bits (525), Expect = 1.2e-53
Identity = 102/185 (55.14%), Postives = 146/185 (78.92%), Query Frame = 1

Query: 16  TNLASCVVATVFLIFLIIVILIVFFTVFKPQDPKIAVSAVQLPSFSVANGTINFTFSQYV 75
           +NLASC VAT+F++FLII  L V+ TVF+P+DP+I+V++V++PSFSVAN +++FTFSQ+ 
Sbjct: 6   SNLASCAVATLFIVFLIIAALTVYLTVFRPRDPEISVTSVKVPSFSVANSSVSFTFSQFS 65

Query: 76  SVKNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDAGQTQYMAATFSVQSFPLAAPVA 135
           +V+NPN+A+FSHY++ +QL Y G++IG+ F+PAG+I++G+T+ M ATFSVQSFPLAA  +
Sbjct: 66  AVRNPNRAAFSHYNNVIQLFYYGNRIGYTFVPAGEIESGRTKRMLATFSVQSFPLAAASS 125

Query: 136 SVGAGPTFSEGMNGYRVGPILEIESKMDMAGRVRVLHFFTHHVEATSSCRVAIAVSDGSV 195
           S  +   F    N  R G  +EIESK++MAGRVRVL  FTH + A  +CR+AI+ SDGS+
Sbjct: 126 SQISAAQF---QNSDRSGSTVEIESKLEMAGRVRVLGLFTHRIAAKCNCRIAISSSDGSI 185

Query: 196 LGFHC 201
           +   C
Sbjct: 186 VAVRC 187

BLAST of CSPI04G06460 vs. TAIR10
Match: AT1G64450.1 (AT1G64450.1 Glycine-rich protein family)

HSP 1 Score: 190.7 bits (483), Expect = 9.0e-49
Identity = 94/136 (69.12%), Postives = 118/136 (86.76%), Query Frame = 1

Query: 1   MGNPHGHGDHPPSGRTNLASCVVATVFLIFLIIVILIVFFTVFKPQDPKIAVSAVQLPSF 60
           M  PH       SGRTNLASC VATVFL+ L++V+L+V+FTVFKP+DPKI+V+AVQLPSF
Sbjct: 1   MAKPHDR--RRSSGRTNLASCAVATVFLLILLVVLLVVYFTVFKPKDPKISVNAVQLPSF 60

Query: 61  SVANGTINFTFSQYVSVKNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDAGQTQYMA 120
           +V+N T NF+FSQYV+V+NPN+A FSHYDSS+QLLYSG+Q+GFMFIPAGKID+G+ QYMA
Sbjct: 61  AVSNNTANFSFSQYVAVRNPNRAVFSHYDSSIQLLYSGNQVGFMFIPAGKIDSGRIQYMA 120

Query: 121 ATFSVQSFPLAAPVAS 137
           ATF+V SFP++ P +S
Sbjct: 121 ATFTVHSFPISPPSSS 134

BLAST of CSPI04G06460 vs. TAIR10
Match: AT3G54200.1 (AT3G54200.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 80.1 bits (196), Expect = 1.7e-15
Identity = 60/193 (31.09%), Postives = 101/193 (52.33%), Query Frame = 1

Query: 15  RTNLASCVVATVFLIFLI-IVILIVFFTVFKPQDPKIAVSAVQLPSFSVANGTI------ 74
           + N   C+  T+ LI LI IVI+I+ FT+FKP+ P   + +V +     +   +      
Sbjct: 48  KRNCKICICFTILLILLIAIVIVILAFTLFKPKRPTTTIDSVTVDRLQASVNPLLLKVLL 107

Query: 75  NFTFSQYVSVKNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDAGQTQYMAATFSVQS 134
           N T +  +S+KNPN+  FS+  SS  L Y G  IG   +PA +I A +T  +  T ++ +
Sbjct: 108 NLTLNVDLSLKNPNRIGFSYDSSSALLNYRGQVIGEAPLPANRIAARKTVPLNITLTLMA 167

Query: 135 FPLAAPVASVGAGPTFSEGMNGYRVGPILEIESKMDMAGRVRVLHFFTHHVEATSSCRVA 194
             L +    +      S+ M G     ++ + + + + G+V VL  F   V+++SSC ++
Sbjct: 168 DRLLSETQLL------SDVMAG-----VIPLNTFVKVTGKVTVLKIFKIKVQSSSSCDLS 227

Query: 195 IAVSDGSVLGFHC 201
           I+VSD +V   HC
Sbjct: 228 ISVSDRNVTSQHC 229

BLAST of CSPI04G06460 vs. TAIR10
Match: AT2G46150.1 (AT2G46150.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 61.6 bits (148), Expect = 6.3e-10
Identity = 49/198 (24.75%), Postives = 87/198 (43.94%), Query Frame = 1

Query: 10  HPPSGRTNLASCVVATVFLIFLIIVILIVFFTVFKPQDPKIAVSAVQLPSFSVANGT--- 69
           H    R   + CV AT  ++  I++ L+  FTVF+ +DP I ++ V +       GT   
Sbjct: 30  HRSRNRIKCSICVTATSLILTTIVLTLV--FTVFRVKDPIIKMNGVMVNGLDSVTGTNQV 89

Query: 70  ----INFTFSQYVSVKNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDAGQTQYMAAT 129
                N +    VSVKNPN ASF + +++  + Y G+ +G      GK    +T  M  T
Sbjct: 90  QLLGTNISMIVDVSVKNPNTASFKYSNTTTDIYYKGTLVGEAHGLPGKARPHRTSRMNVT 149

Query: 130 FSVQSFPLAAPVASVGAGPTFSEGMNGYRVGPILEIESKMDMAGRVRVLHFFTHHVEATS 189
             +    L   ++  G G   S          ++ + S   + G+V+++     HV    
Sbjct: 150 VDIM---LDRILSDPGLGREISR-------SGLVNVWSYTRVGGKVKIMGIVKKHVTVKM 209

Query: 190 SCRVAIAVSDGSVLGFHC 201
           +C +A+ ++  ++    C
Sbjct: 210 NCTMAVNITGQAIQDVDC 215

BLAST of CSPI04G06460 vs. NCBI nr
Match: gi|449462527|ref|XP_004148992.1| (PREDICTED: uncharacterized protein LOC101209064 [Cucumis sativus])

HSP 1 Score: 394.4 bits (1012), Expect = 1.2e-106
Identity = 200/200 (100.00%), Postives = 200/200 (100.00%), Query Frame = 1

Query: 1   MGNPHGHGDHPPSGRTNLASCVVATVFLIFLIIVILIVFFTVFKPQDPKIAVSAVQLPSF 60
           MGNPHGHGDHPPSGRTNLASCVVATVFLIFLIIVILIVFFTVFKPQDPKIAVSAVQLPSF
Sbjct: 1   MGNPHGHGDHPPSGRTNLASCVVATVFLIFLIIVILIVFFTVFKPQDPKIAVSAVQLPSF 60

Query: 61  SVANGTINFTFSQYVSVKNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDAGQTQYMA 120
           SVANGTINFTFSQYVSVKNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDAGQTQYMA
Sbjct: 61  SVANGTINFTFSQYVSVKNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDAGQTQYMA 120

Query: 121 ATFSVQSFPLAAPVASVGAGPTFSEGMNGYRVGPILEIESKMDMAGRVRVLHFFTHHVEA 180
           ATFSVQSFPLAAPVASVGAGPTFSEGMNGYRVGPILEIESKMDMAGRVRVLHFFTHHVEA
Sbjct: 121 ATFSVQSFPLAAPVASVGAGPTFSEGMNGYRVGPILEIESKMDMAGRVRVLHFFTHHVEA 180

Query: 181 TSSCRVAIAVSDGSVLGFHC 201
           TSSCRVAIAVSDGSVLGFHC
Sbjct: 181 TSSCRVAIAVSDGSVLGFHC 200

BLAST of CSPI04G06460 vs. NCBI nr
Match: gi|659102111|ref|XP_008451958.1| (PREDICTED: uncharacterized protein LOC103493106 [Cucumis melo])

HSP 1 Score: 368.6 bits (945), Expect = 6.8e-99
Identity = 189/200 (94.50%), Postives = 194/200 (97.00%), Query Frame = 1

Query: 1   MGNPHGHGDHPPSGRTNLASCVVATVFLIFLIIVILIVFFTVFKPQDPKIAVSAVQLPSF 60
           MGNPH  G+ PPSGRTNLASCVVATVFLIFLIIVILIVFFTVFKPQDPKIAVSAVQLPSF
Sbjct: 1   MGNPHIPGEDPPSGRTNLASCVVATVFLIFLIIVILIVFFTVFKPQDPKIAVSAVQLPSF 60

Query: 61  SVANGTINFTFSQYVSVKNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDAGQTQYMA 120
           SV NGTINFTFSQYVSVKNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKI+AGQTQYMA
Sbjct: 61  SVTNGTINFTFSQYVSVKNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIEAGQTQYMA 120

Query: 121 ATFSVQSFPLAAPVASVGAGPTFSEGMNGYRVGPILEIESKMDMAGRVRVLHFFTHHVEA 180
           ATFSVQSFPLA+PVA+VGAGPTFS GMNGYRVGPILEIESKMDMAGRVRVL+FFTHHVEA
Sbjct: 121 ATFSVQSFPLASPVAAVGAGPTFSGGMNGYRVGPILEIESKMDMAGRVRVLNFFTHHVEA 180

Query: 181 TSSCRVAIAVSDGSVLGFHC 201
            SSCRVAIAVSDGSVLGFHC
Sbjct: 181 ISSCRVAIAVSDGSVLGFHC 200

BLAST of CSPI04G06460 vs. NCBI nr
Match: gi|802733826|ref|XP_012086703.1| (PREDICTED: uncharacterized protein LOC105645659 [Jatropha curcas])

HSP 1 Score: 310.1 bits (793), Expect = 2.9e-81
Identity = 157/200 (78.50%), Postives = 179/200 (89.50%), Query Frame = 1

Query: 11  PPSGRTNLASCVVATVFLIFLIIVILIVFFTVFKPQDPKIAVSAVQLPSFSVANGTINFT 70
           PPSGRTNLASC+VAT+FLIF+II+ILIVFFTVFKP+DPKI+V+AVQLPSFSV+N T+NFT
Sbjct: 7   PPSGRTNLASCIVATIFLIFVIIIILIVFFTVFKPKDPKISVNAVQLPSFSVSNNTVNFT 66

Query: 71  FSQYVSVKNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDAGQTQYMAATFSVQSFPL 130
           FSQYVSVKNPNKASFSHYDS+LQLLYSGSQ+GFMFIPAGKIDAG+TQYMAATF+VQSFPL
Sbjct: 67  FSQYVSVKNPNKASFSHYDSTLQLLYSGSQVGFMFIPAGKIDAGRTQYMAATFAVQSFPL 126

Query: 131 -AAPVASVGAGPTFSEGM---------NGYRVGPILEIESKMDMAGRVRVLHFFTHHVEA 190
            ++P A+V  GPTF+ G+          GYRVGP +EIES++ MAGRVRVLH FTHHVEA
Sbjct: 127 SSSPDAAVNVGPTFAGGVLPGGYPSVNGGYRVGPTMEIESRIHMAGRVRVLHIFTHHVEA 186

Query: 191 TSSCRVAIAVSDGSVLGFHC 201
            + CRVAIAVSDGSVLGFHC
Sbjct: 187 KAGCRVAIAVSDGSVLGFHC 206

BLAST of CSPI04G06460 vs. NCBI nr
Match: gi|566162577|ref|XP_002304562.2| (proline-rich family protein [Populus trichocarpa])

HSP 1 Score: 305.4 bits (781), Expect = 7.1e-80
Identity = 156/211 (73.93%), Postives = 179/211 (84.83%), Query Frame = 1

Query: 1   MGNPHGHGDHPPSGRTNLASCVVATVFLIFLIIVILIVFFTVFKPQDPKIAVSAVQLPSF 60
           M  PH     PPSGRTNLASC+VAT+FLIFL+I+ILIVFFTVFKP+DPKI+V++VQLPSF
Sbjct: 1   MSKPH----RPPSGRTNLASCIVATIFLIFLVIIILIVFFTVFKPKDPKISVNSVQLPSF 60

Query: 61  SVANGTINFTFSQYVSVKNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDAGQTQYMA 120
           SV+N T+NFTFSQYVSVKNPN+A FSH+DS+LQLLYSGSQIGFMFIPAGKIDAG+TQYMA
Sbjct: 61  SVSNNTVNFTFSQYVSVKNPNRAVFSHFDSTLQLLYSGSQIGFMFIPAGKIDAGRTQYMA 120

Query: 121 ATFSVQSFPL-AAPVASVGAGPTFSEG----------MNGYRVGPILEIESKMDMAGRVR 180
           ATFSV+SFPL A+P A+V  GP F++G           NGYRVGP +EIES++ MAGRVR
Sbjct: 121 ATFSVESFPLSASPDAAVNVGPAFNDGGFGGGGQTGFNNGYRVGPTMEIESRIQMAGRVR 180

Query: 181 VLHFFTHHVEATSSCRVAIAVSDGSVLGFHC 201
           VLHFFTHH+E    CRV IAVSDGSVLGFHC
Sbjct: 181 VLHFFTHHLETKVGCRVVIAVSDGSVLGFHC 207

BLAST of CSPI04G06460 vs. NCBI nr
Match: gi|743825315|ref|XP_011022490.1| (PREDICTED: uncharacterized protein LOC105124255 [Populus euphratica])

HSP 1 Score: 303.9 bits (777), Expect = 2.1e-79
Identity = 155/211 (73.46%), Postives = 179/211 (84.83%), Query Frame = 1

Query: 1   MGNPHGHGDHPPSGRTNLASCVVATVFLIFLIIVILIVFFTVFKPQDPKIAVSAVQLPSF 60
           M  PH     PPSGRTNLASC+VAT+FLIFL+I+ILIVFFTVFKP+DPKI+V++VQLPSF
Sbjct: 1   MSKPH----RPPSGRTNLASCIVATIFLIFLVIIILIVFFTVFKPKDPKISVNSVQLPSF 60

Query: 61  SVANGTINFTFSQYVSVKNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDAGQTQYMA 120
           SV+N T+NFTFSQYV+VKNPN+A FSH+DS+LQLLYSGSQIGFMFIPAGKIDAG+TQYMA
Sbjct: 61  SVSNNTVNFTFSQYVAVKNPNRAVFSHFDSTLQLLYSGSQIGFMFIPAGKIDAGRTQYMA 120

Query: 121 ATFSVQSFPL-AAPVASVGAGPTFSEG----------MNGYRVGPILEIESKMDMAGRVR 180
           ATFSV+SFPL A+P A+V  GP F++G           NGYRVGP +EIES++ MAGRVR
Sbjct: 121 ATFSVESFPLSASPDAAVNVGPAFNDGGFGGGGQPGFNNGYRVGPTMEIESRIHMAGRVR 180

Query: 181 VLHFFTHHVEATSSCRVAIAVSDGSVLGFHC 201
           VLHFFTHH+E    CRV IAVSDGSVLGFHC
Sbjct: 181 VLHFFTHHLETKVGCRVVIAVSDGSVLGFHC 207

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KX26_CUCSA8.1e-107100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_4G052640 PE=4 SV=1[more]
A0A067JZ66_JATCU2.0e-8178.50Uncharacterized protein OS=Jatropha curcas GN=JCGZ_20441 PE=4 SV=1[more]
B9GXJ6_POPTR5.0e-8073.93Proline-rich family protein OS=Populus trichocarpa GN=POPTR_0003s14110g PE=4 SV=... [more]
B9RCU1_RICCO7.2e-7973.21Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1692280 PE=4 SV=1[more]
F6HU27_VITVI1.6e-7876.84Putative uncharacterized protein OS=Vitis vinifera GN=VIT_02s0025g01600 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT4G23930.11.2e-5355.14 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT1G64450.19.0e-4969.12 Glycine-rich protein family[more]
AT3G54200.11.7e-1531.09 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT2G46150.16.3e-1024.75 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
Match NameE-valueIdentityDescription
gi|449462527|ref|XP_004148992.1|1.2e-106100.00PREDICTED: uncharacterized protein LOC101209064 [Cucumis sativus][more]
gi|659102111|ref|XP_008451958.1|6.8e-9994.50PREDICTED: uncharacterized protein LOC103493106 [Cucumis melo][more]
gi|802733826|ref|XP_012086703.1|2.9e-8178.50PREDICTED: uncharacterized protein LOC105645659 [Jatropha curcas][more]
gi|566162577|ref|XP_002304562.2|7.1e-8073.93proline-rich family protein [Populus trichocarpa][more]
gi|743825315|ref|XP_011022490.1|2.1e-7973.46PREDICTED: uncharacterized protein LOC105124255 [Populus euphratica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004864LEA_2
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G06460.1CSPI04G06460.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004864Late embryogenesis abundant protein, LEA-14PFAMPF03168LEA_2coord: 75..184
score: 7.0
NoneNo IPR availablePANTHERPTHR31852FAMILY NOT NAMEDcoord: 11..200
score: 9.3E
NoneNo IPR availablePANTHERPTHR31852:SF22LATE EMBRYOGENESIS ABUNDANT HYDROXYPROLINE-RICH GLYCOPROTEINcoord: 11..200
score: 9.3E
NoneNo IPR availableunknownSSF117070LEA14-likecoord: 30..143
score: 3.0

The following gene(s) are paralogous to this gene:

None