Cp4.1LG01g00540 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g00540
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionLate embryogenesis abundant hydroxyproline-rich glycoprotein
LocationCp4.1LG01 : 3538009 .. 3538605 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGGAAACACCACGACCACCACCCGCCGTCAGGTCGCACGAACTTGGCGTCCTGTATAGTCGCCACGATCTTCTTAATCTTTGTCATCATCGTCGTCCTCATCGTCTTTTTCACCGTCTTTAAGCCTCAGGATCCAAAGATCGCCGTCTCCGCCGTCCAGTTGCCGTCCTTCTCCGTTGCCAACGGCACCGTTAATTTCACTTTCTCTCAGTACGTCTCTGTTAGAAACCCTAACAAAGCCTCTTTCTCTCACTATGACAGCTCACTCCAGCTCCTCTACTCTGGTTCCCAAATTGGATTCATGTTCATTCCTGCCGGGAAAATCGACTCCGGCCAGACGCAGTACATGGCAGCGACCTTCTCCGTCCAGTCATTCCCCTTGGCCGCTCCGGCCACTGCCATCGGAGCGGGACCGACTTACTCGGAGGGAATGAACGGGTACAGAATCGGACCGACACTGGAGATTGAATCGAAGATGGATATGGCCGGTAGGGTTAGGGTATTGCACTTCTTCACTCACCATGTGGAGGCCACATTGAGTTGCAGAGTCGCCATTGCTGTAAGTGATGGATCTGTGTTAGGTTTCCACTGCTAA

mRNA sequence

ATGAGGAAACACCACGACCACCACCCGCCGTCAGGTCGCACGAACTTGGCGTCCTGTATAGTCGCCACGATCTTCTTAATCTTTGTCATCATCGTCGTCCTCATCGTCTTTTTCACCGTCTTTAAGCCTCAGGATCCAAAGATCGCCGTCTCCGCCGTCCAGTTGCCGTCCTTCTCCGTTGCCAACGGCACCGTTAATTTCACTTTCTCTCAGTACGTCTCTGTTAGAAACCCTAACAAAGCCTCTTTCTCTCACTATGACAGCTCACTCCAGCTCCTCTACTCTGGTTCCCAAATTGGATTCATGTTCATTCCTGCCGGGAAAATCGACTCCGGCCAGACGCAGTACATGGCAGCGACCTTCTCCGTCCAGTCATTCCCCTTGGCCGCTCCGGCCACTGCCATCGGAGCGGGACCGACTTACTCGGAGGGAATGAACGGGTACAGAATCGGACCGACACTGGAGATTGAATCGAAGATGGATATGGCCGGTAGGGTTAGGGTATTGCACTTCTTCACTCACCATGTGGAGGCCACATTGAGTTGCAGAGTCGCCATTGCTGTAAGTGATGGATCTGTGTTAGGTTTCCACTGCTAA

Coding sequence (CDS)

ATGAGGAAACACCACGACCACCACCCGCCGTCAGGTCGCACGAACTTGGCGTCCTGTATAGTCGCCACGATCTTCTTAATCTTTGTCATCATCGTCGTCCTCATCGTCTTTTTCACCGTCTTTAAGCCTCAGGATCCAAAGATCGCCGTCTCCGCCGTCCAGTTGCCGTCCTTCTCCGTTGCCAACGGCACCGTTAATTTCACTTTCTCTCAGTACGTCTCTGTTAGAAACCCTAACAAAGCCTCTTTCTCTCACTATGACAGCTCACTCCAGCTCCTCTACTCTGGTTCCCAAATTGGATTCATGTTCATTCCTGCCGGGAAAATCGACTCCGGCCAGACGCAGTACATGGCAGCGACCTTCTCCGTCCAGTCATTCCCCTTGGCCGCTCCGGCCACTGCCATCGGAGCGGGACCGACTTACTCGGAGGGAATGAACGGGTACAGAATCGGACCGACACTGGAGATTGAATCGAAGATGGATATGGCCGGTAGGGTTAGGGTATTGCACTTCTTCACTCACCATGTGGAGGCCACATTGAGTTGCAGAGTCGCCATTGCTGTAAGTGATGGATCTGTGTTAGGTTTCCACTGCTAA

Protein sequence

MRKHHDHHPPSGRTNLASCIVATIFLIFVIIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDSGQTQYMAATFSVQSFPLAAPATAIGAGPTYSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC
BLAST of Cp4.1LG01g00540 vs. TrEMBL
Match: A0A0A0KX26_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G052640 PE=4 SV=1)

HSP 1 Score: 358.2 bits (918), Expect = 6.3e-96
Identity = 177/194 (91.24%), Postives = 188/194 (96.91%), Query Frame = 1

Query: 5   HDHHPPSGRTNLASCIVATIFLIFVIIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGT 64
           H  HPPSGRTNLASC+VAT+FLIF+IIV+LIVFFTVFKPQDPKIAVSAVQLPSFSVANGT
Sbjct: 7   HGDHPPSGRTNLASCVVATVFLIFLIIVILIVFFTVFKPQDPKIAVSAVQLPSFSVANGT 66

Query: 65  VNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDSGQTQYMAATFSVQ 124
           +NFTFSQYVSV+NPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKID+GQTQYMAATFSVQ
Sbjct: 67  INFTFSQYVSVKNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDAGQTQYMAATFSVQ 126

Query: 125 SFPLAAPATAIGAGPTYSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRV 184
           SFPLAAP  ++GAGPT+SEGMNGYR+GP LEIESKMDMAGRVRVLHFFTHHVEAT SCRV
Sbjct: 127 SFPLAAPVASVGAGPTFSEGMNGYRVGPILEIESKMDMAGRVRVLHFFTHHVEATSSCRV 186

Query: 185 AIAVSDGSVLGFHC 199
           AIAVSDGSVLGFHC
Sbjct: 187 AIAVSDGSVLGFHC 200

BLAST of Cp4.1LG01g00540 vs. TrEMBL
Match: A0A067JZ66_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_20441 PE=4 SV=1)

HSP 1 Score: 308.5 bits (789), Expect = 5.8e-81
Identity = 156/200 (78.00%), Postives = 178/200 (89.00%), Query Frame = 1

Query: 9   PPSGRTNLASCIVATIFLIFVIIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFT 68
           PPSGRTNLASCIVATIFLIFVII++LIVFFTVFKP+DPKI+V+AVQLPSFSV+N TVNFT
Sbjct: 7   PPSGRTNLASCIVATIFLIFVIIIILIVFFTVFKPKDPKISVNAVQLPSFSVSNNTVNFT 66

Query: 69  FSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDSGQTQYMAATFSVQSFPL 128
           FSQYVSV+NPNKASFSHYDS+LQLLYSGSQ+GFMFIPAGKID+G+TQYMAATF+VQSFPL
Sbjct: 67  FSQYVSVKNPNKASFSHYDSTLQLLYSGSQVGFMFIPAGKIDAGRTQYMAATFAVQSFPL 126

Query: 129 -AAPATAIGAGPTYSEGM---------NGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEA 188
            ++P  A+  GPT++ G+          GYR+GPT+EIES++ MAGRVRVLH FTHHVEA
Sbjct: 127 SSSPDAAVNVGPTFAGGVLPGGYPSVNGGYRVGPTMEIESRIHMAGRVRVLHIFTHHVEA 186

Query: 189 TLSCRVAIAVSDGSVLGFHC 199
              CRVAIAVSDGSVLGFHC
Sbjct: 187 KAGCRVAIAVSDGSVLGFHC 206

BLAST of Cp4.1LG01g00540 vs. TrEMBL
Match: B9GXJ6_POPTR (Proline-rich family protein OS=Populus trichocarpa GN=POPTR_0003s14110g PE=4 SV=2)

HSP 1 Score: 304.3 bits (778), Expect = 1.1e-79
Identity = 151/203 (74.38%), Postives = 178/203 (87.68%), Query Frame = 1

Query: 7   HHPPSGRTNLASCIVATIFLIFVIIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVN 66
           H PPSGRTNLASCIVATIFLIF++I++LIVFFTVFKP+DPKI+V++VQLPSFSV+N TVN
Sbjct: 5   HRPPSGRTNLASCIVATIFLIFLVIIILIVFFTVFKPKDPKISVNSVQLPSFSVSNNTVN 64

Query: 67  FTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDSGQTQYMAATFSVQSF 126
           FTFSQYVSV+NPN+A FSH+DS+LQLLYSGSQIGFMFIPAGKID+G+TQYMAATFSV+SF
Sbjct: 65  FTFSQYVSVKNPNRAVFSHFDSTLQLLYSGSQIGFMFIPAGKIDAGRTQYMAATFSVESF 124

Query: 127 PL-AAPATAIGAGPTYSEG----------MNGYRIGPTLEIESKMDMAGRVRVLHFFTHH 186
           PL A+P  A+  GP +++G           NGYR+GPT+EIES++ MAGRVRVLHFFTHH
Sbjct: 125 PLSASPDAAVNVGPAFNDGGFGGGGQTGFNNGYRVGPTMEIESRIQMAGRVRVLHFFTHH 184

Query: 187 VEATLSCRVAIAVSDGSVLGFHC 199
           +E  + CRV IAVSDGSVLGFHC
Sbjct: 185 LETKVGCRVVIAVSDGSVLGFHC 207

BLAST of Cp4.1LG01g00540 vs. TrEMBL
Match: B9RCU1_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1692280 PE=4 SV=1)

HSP 1 Score: 303.1 bits (775), Expect = 2.4e-79
Identity = 152/201 (75.62%), Postives = 176/201 (87.56%), Query Frame = 1

Query: 7   HHPPSGRTNLASCIVATIFLIFVIIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVN 66
           H PPSGRTNLASCIVATIFLIFVII++LIVFFTVFKP+DPKI+V+AVQLPSFSV+N TVN
Sbjct: 5   HRPPSGRTNLASCIVATIFLIFVIIIILIVFFTVFKPKDPKISVNAVQLPSFSVSNNTVN 64

Query: 67  FTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDSGQTQYMAATFSVQSF 126
           FTFSQYVSV+NPN+A+FSHYDS+LQLLYSGSQ+GFMFIPAGKI+SG+TQYMAATF+VQSF
Sbjct: 65  FTFSQYVSVKNPNRATFSHYDSTLQLLYSGSQVGFMFIPAGKIESGRTQYMAATFAVQSF 124

Query: 127 PL-AAPATAIGAGPTYS--------EGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVE 186
           PL ++P  A+  GP ++           NG+R+GPT+EIES++ M GRVRVLH FTHHVE
Sbjct: 125 PLSSSPDAAVNVGPAFTGSGFPGVPGSSNGFRVGPTMEIESRIQMVGRVRVLHIFTHHVE 184

Query: 187 ATLSCRVAIAVSDGSVLGFHC 199
           A   CRVAIAVSDGSVLGFHC
Sbjct: 185 AKAECRVAIAVSDGSVLGFHC 205

BLAST of Cp4.1LG01g00540 vs. TrEMBL
Match: F6HU27_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_02s0025g01600 PE=4 SV=1)

HSP 1 Score: 299.7 bits (766), Expect = 2.7e-78
Identity = 147/190 (77.37%), Postives = 174/190 (91.58%), Query Frame = 1

Query: 9   PPSGRTNLASCIVATIFLIFVIIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFT 68
           PPSGRTNLASC+VATIFLIF+IIVVLIVFF+VFKP++P I+V+AVQLPSF+++NGTVNFT
Sbjct: 7   PPSGRTNLASCVVATIFLIFIIIVVLIVFFSVFKPKEPIISVNAVQLPSFAISNGTVNFT 66

Query: 69  FSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDSGQTQYMAATFSVQSFPL 128
           FSQYVSV+NPNKA FSHYDS+LQLLY G+Q+GFMFIPAGKI SG+TQYMAATF+V+SFPL
Sbjct: 67  FSQYVSVKNPNKAEFSHYDSTLQLLYGGNQVGFMFIPAGKIGSGRTQYMAATFAVESFPL 126

Query: 129 AAPATAIGAGPTYSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAV 188
            A   ++  GPT ++G+ G+RIGP LEIES+M+MAGRVRVLHFFTHHV+A   CRV+IAV
Sbjct: 127 GAVPESV--GPTITDGLGGFRIGPNLEIESRMEMAGRVRVLHFFTHHVDARAVCRVSIAV 186

Query: 189 SDGSVLGFHC 199
           SDGSVLGFHC
Sbjct: 187 SDGSVLGFHC 194

BLAST of Cp4.1LG01g00540 vs. TAIR10
Match: AT4G23930.1 (AT4G23930.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 208.4 bits (529), Expect = 4.1e-54
Identity = 104/185 (56.22%), Postives = 148/185 (80.00%), Query Frame = 1

Query: 14  TNLASCIVATIFLIFVIIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYV 73
           +NLASC VAT+F++F+II  L V+ TVF+P+DP+I+V++V++PSFSVAN +V+FTFSQ+ 
Sbjct: 6   SNLASCAVATLFIVFLIIAALTVYLTVFRPRDPEISVTSVKVPSFSVANSSVSFTFSQFS 65

Query: 74  SVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDSGQTQYMAATFSVQSFPLAAPAT 133
           +VRNPN+A+FSHY++ +QL Y G++IG+ F+PAG+I+SG+T+ M ATFSVQSFPLAA   
Sbjct: 66  AVRNPNRAAFSHYNNVIQLFYYGNRIGYTFVPAGEIESGRTKRMLATFSVQSFPLAA--- 125

Query: 134 AIGAGPTYSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSV 193
           A  +  + ++  N  R G T+EIESK++MAGRVRVL  FTH + A  +CR+AI+ SDGS+
Sbjct: 126 ASSSQISAAQFQNSDRSGSTVEIESKLEMAGRVRVLGLFTHRIAAKCNCRIAISSSDGSI 185

Query: 194 LGFHC 199
           +   C
Sbjct: 186 VAVRC 187

BLAST of Cp4.1LG01g00540 vs. TAIR10
Match: AT1G64450.1 (AT1G64450.1 Glycine-rich protein family)

HSP 1 Score: 196.8 bits (499), Expect = 1.2e-50
Identity = 94/134 (70.15%), Postives = 120/134 (89.55%), Query Frame = 1

Query: 1   MRKHHDHHPPSGRTNLASCIVATIFLIFVIIVVLIVFFTVFKPQDPKIAVSAVQLPSFSV 60
           M K HD    SGRTNLASC VAT+FL+ +++V+L+V+FTVFKP+DPKI+V+AVQLPSF+V
Sbjct: 1   MAKPHDRRRSSGRTNLASCAVATVFLLILLVVLLVVYFTVFKPKDPKISVNAVQLPSFAV 60

Query: 61  ANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDSGQTQYMAAT 120
           +N T NF+FSQYV+VRNPN+A FSHYDSS+QLLYSG+Q+GFMFIPAGKIDSG+ QYMAAT
Sbjct: 61  SNNTANFSFSQYVAVRNPNRAVFSHYDSSIQLLYSGNQVGFMFIPAGKIDSGRIQYMAAT 120

Query: 121 FSVQSFPLAAPATA 135
           F+V SFP++ P+++
Sbjct: 121 FTVHSFPISPPSSS 134

BLAST of Cp4.1LG01g00540 vs. TAIR10
Match: AT3G54200.1 (AT3G54200.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 72.4 bits (176), Expect = 3.5e-13
Identity = 57/193 (29.53%), Postives = 99/193 (51.30%), Query Frame = 1

Query: 13  RTNLASCIVATIFLIFVI-IVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTV------ 72
           + N   CI  TI LI +I IV++I+ FT+FKP+ P   + +V +     +   +      
Sbjct: 48  KRNCKICICFTILLILLIAIVIVILAFTLFKPKRPTTTIDSVTVDRLQASVNPLLLKVLL 107

Query: 73  NFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDSGQTQYMAATFSVQS 132
           N T +  +S++NPN+  FS+  SS  L Y G  IG   +PA +I + +T  +  T ++ +
Sbjct: 108 NLTLNVDLSLKNPNRIGFSYDSSSALLNYRGQVIGEAPLPANRIAARKTVPLNITLTLMA 167

Query: 133 FPLAAPATAIGAGPTYSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVA 192
             L +    +      S+ M G      + + + + + G+V VL  F   V+++ SC ++
Sbjct: 168 DRLLSETQLL------SDVMAG-----VIPLNTFVKVTGKVTVLKIFKIKVQSSSSCDLS 227

Query: 193 IAVSDGSVLGFHC 199
           I+VSD +V   HC
Sbjct: 228 ISVSDRNVTSQHC 229

BLAST of Cp4.1LG01g00540 vs. TAIR10
Match: AT2G46150.1 (AT2G46150.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 57.8 bits (138), Expect = 9.0e-09
Identity = 47/198 (23.74%), Postives = 86/198 (43.43%), Query Frame = 1

Query: 8   HPPSGRTNLASCIVATIFLIFVIIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGT--- 67
           H    R   + C+ AT  ++  I++ L+  FTVF+ +DP I ++ V +       GT   
Sbjct: 30  HRSRNRIKCSICVTATSLILTTIVLTLV--FTVFRVKDPIIKMNGVMVNGLDSVTGTNQV 89

Query: 68  ----VNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDSGQTQYMAAT 127
                N +    VSV+NPN ASF + +++  + Y G+ +G      GK    +T  M  T
Sbjct: 90  QLLGTNISMIVDVSVKNPNTASFKYSNTTTDIYYKGTLVGEAHGLPGKARPHRTSRMNVT 149

Query: 128 FSVQSFPLAAPATAIGAGPTYSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATL 187
             +    L    +  G G   S           + + S   + G+V+++     HV   +
Sbjct: 150 VDIM---LDRILSDPGLGREISR-------SGLVNVWSYTRVGGKVKIMGIVKKHVTVKM 209

Query: 188 SCRVAIAVSDGSVLGFHC 199
           +C +A+ ++  ++    C
Sbjct: 210 NCTMAVNITGQAIQDVDC 215

BLAST of Cp4.1LG01g00540 vs. NCBI nr
Match: gi|449462527|ref|XP_004148992.1| (PREDICTED: uncharacterized protein LOC101209064 [Cucumis sativus])

HSP 1 Score: 358.2 bits (918), Expect = 9.1e-96
Identity = 177/194 (91.24%), Postives = 188/194 (96.91%), Query Frame = 1

Query: 5   HDHHPPSGRTNLASCIVATIFLIFVIIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGT 64
           H  HPPSGRTNLASC+VAT+FLIF+IIV+LIVFFTVFKPQDPKIAVSAVQLPSFSVANGT
Sbjct: 7   HGDHPPSGRTNLASCVVATVFLIFLIIVILIVFFTVFKPQDPKIAVSAVQLPSFSVANGT 66

Query: 65  VNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDSGQTQYMAATFSVQ 124
           +NFTFSQYVSV+NPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKID+GQTQYMAATFSVQ
Sbjct: 67  INFTFSQYVSVKNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDAGQTQYMAATFSVQ 126

Query: 125 SFPLAAPATAIGAGPTYSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRV 184
           SFPLAAP  ++GAGPT+SEGMNGYR+GP LEIESKMDMAGRVRVLHFFTHHVEAT SCRV
Sbjct: 127 SFPLAAPVASVGAGPTFSEGMNGYRVGPILEIESKMDMAGRVRVLHFFTHHVEATSSCRV 186

Query: 185 AIAVSDGSVLGFHC 199
           AIAVSDGSVLGFHC
Sbjct: 187 AIAVSDGSVLGFHC 200

BLAST of Cp4.1LG01g00540 vs. NCBI nr
Match: gi|659102111|ref|XP_008451958.1| (PREDICTED: uncharacterized protein LOC103493106 [Cucumis melo])

HSP 1 Score: 342.8 bits (878), Expect = 4.0e-91
Identity = 170/190 (89.47%), Postives = 183/190 (96.32%), Query Frame = 1

Query: 9   PPSGRTNLASCIVATIFLIFVIIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFT 68
           PPSGRTNLASC+VAT+FLIF+IIV+LIVFFTVFKPQDPKIAVSAVQLPSFSV NGT+NFT
Sbjct: 11  PPSGRTNLASCVVATVFLIFLIIVILIVFFTVFKPQDPKIAVSAVQLPSFSVTNGTINFT 70

Query: 69  FSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDSGQTQYMAATFSVQSFPL 128
           FSQYVSV+NPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKI++GQTQYMAATFSVQSFPL
Sbjct: 71  FSQYVSVKNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIEAGQTQYMAATFSVQSFPL 130

Query: 129 AAPATAIGAGPTYSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAV 188
           A+P  A+GAGPT+S GMNGYR+GP LEIESKMDMAGRVRVL+FFTHHVEA  SCRVAIAV
Sbjct: 131 ASPVAAVGAGPTFSGGMNGYRVGPILEIESKMDMAGRVRVLNFFTHHVEAISSCRVAIAV 190

Query: 189 SDGSVLGFHC 199
           SDGSVLGFHC
Sbjct: 191 SDGSVLGFHC 200

BLAST of Cp4.1LG01g00540 vs. NCBI nr
Match: gi|802733826|ref|XP_012086703.1| (PREDICTED: uncharacterized protein LOC105645659 [Jatropha curcas])

HSP 1 Score: 308.5 bits (789), Expect = 8.3e-81
Identity = 156/200 (78.00%), Postives = 178/200 (89.00%), Query Frame = 1

Query: 9   PPSGRTNLASCIVATIFLIFVIIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFT 68
           PPSGRTNLASCIVATIFLIFVII++LIVFFTVFKP+DPKI+V+AVQLPSFSV+N TVNFT
Sbjct: 7   PPSGRTNLASCIVATIFLIFVIIIILIVFFTVFKPKDPKISVNAVQLPSFSVSNNTVNFT 66

Query: 69  FSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDSGQTQYMAATFSVQSFPL 128
           FSQYVSV+NPNKASFSHYDS+LQLLYSGSQ+GFMFIPAGKID+G+TQYMAATF+VQSFPL
Sbjct: 67  FSQYVSVKNPNKASFSHYDSTLQLLYSGSQVGFMFIPAGKIDAGRTQYMAATFAVQSFPL 126

Query: 129 -AAPATAIGAGPTYSEGM---------NGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEA 188
            ++P  A+  GPT++ G+          GYR+GPT+EIES++ MAGRVRVLH FTHHVEA
Sbjct: 127 SSSPDAAVNVGPTFAGGVLPGGYPSVNGGYRVGPTMEIESRIHMAGRVRVLHIFTHHVEA 186

Query: 189 TLSCRVAIAVSDGSVLGFHC 199
              CRVAIAVSDGSVLGFHC
Sbjct: 187 KAGCRVAIAVSDGSVLGFHC 206

BLAST of Cp4.1LG01g00540 vs. NCBI nr
Match: gi|566162577|ref|XP_002304562.2| (proline-rich family protein [Populus trichocarpa])

HSP 1 Score: 304.3 bits (778), Expect = 1.6e-79
Identity = 151/203 (74.38%), Postives = 178/203 (87.68%), Query Frame = 1

Query: 7   HHPPSGRTNLASCIVATIFLIFVIIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVN 66
           H PPSGRTNLASCIVATIFLIF++I++LIVFFTVFKP+DPKI+V++VQLPSFSV+N TVN
Sbjct: 5   HRPPSGRTNLASCIVATIFLIFLVIIILIVFFTVFKPKDPKISVNSVQLPSFSVSNNTVN 64

Query: 67  FTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDSGQTQYMAATFSVQSF 126
           FTFSQYVSV+NPN+A FSH+DS+LQLLYSGSQIGFMFIPAGKID+G+TQYMAATFSV+SF
Sbjct: 65  FTFSQYVSVKNPNRAVFSHFDSTLQLLYSGSQIGFMFIPAGKIDAGRTQYMAATFSVESF 124

Query: 127 PL-AAPATAIGAGPTYSEG----------MNGYRIGPTLEIESKMDMAGRVRVLHFFTHH 186
           PL A+P  A+  GP +++G           NGYR+GPT+EIES++ MAGRVRVLHFFTHH
Sbjct: 125 PLSASPDAAVNVGPAFNDGGFGGGGQTGFNNGYRVGPTMEIESRIQMAGRVRVLHFFTHH 184

Query: 187 VEATLSCRVAIAVSDGSVLGFHC 199
           +E  + CRV IAVSDGSVLGFHC
Sbjct: 185 LETKVGCRVVIAVSDGSVLGFHC 207

BLAST of Cp4.1LG01g00540 vs. NCBI nr
Match: gi|255537817|ref|XP_002509975.1| (PREDICTED: uncharacterized protein LOC8288025 [Ricinus communis])

HSP 1 Score: 303.1 bits (775), Expect = 3.5e-79
Identity = 152/201 (75.62%), Postives = 176/201 (87.56%), Query Frame = 1

Query: 7   HHPPSGRTNLASCIVATIFLIFVIIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVN 66
           H PPSGRTNLASCIVATIFLIFVII++LIVFFTVFKP+DPKI+V+AVQLPSFSV+N TVN
Sbjct: 5   HRPPSGRTNLASCIVATIFLIFVIIIILIVFFTVFKPKDPKISVNAVQLPSFSVSNNTVN 64

Query: 67  FTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDSGQTQYMAATFSVQSF 126
           FTFSQYVSV+NPN+A+FSHYDS+LQLLYSGSQ+GFMFIPAGKI+SG+TQYMAATF+VQSF
Sbjct: 65  FTFSQYVSVKNPNRATFSHYDSTLQLLYSGSQVGFMFIPAGKIESGRTQYMAATFAVQSF 124

Query: 127 PL-AAPATAIGAGPTYS--------EGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVE 186
           PL ++P  A+  GP ++           NG+R+GPT+EIES++ M GRVRVLH FTHHVE
Sbjct: 125 PLSSSPDAAVNVGPAFTGSGFPGVPGSSNGFRVGPTMEIESRIQMVGRVRVLHIFTHHVE 184

Query: 187 ATLSCRVAIAVSDGSVLGFHC 199
           A   CRVAIAVSDGSVLGFHC
Sbjct: 185 AKAECRVAIAVSDGSVLGFHC 205

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KX26_CUCSA6.3e-9691.24Uncharacterized protein OS=Cucumis sativus GN=Csa_4G052640 PE=4 SV=1[more]
A0A067JZ66_JATCU5.8e-8178.00Uncharacterized protein OS=Jatropha curcas GN=JCGZ_20441 PE=4 SV=1[more]
B9GXJ6_POPTR1.1e-7974.38Proline-rich family protein OS=Populus trichocarpa GN=POPTR_0003s14110g PE=4 SV=... [more]
B9RCU1_RICCO2.4e-7975.62Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1692280 PE=4 SV=1[more]
F6HU27_VITVI2.7e-7877.37Putative uncharacterized protein OS=Vitis vinifera GN=VIT_02s0025g01600 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT4G23930.14.1e-5456.22 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT1G64450.11.2e-5070.15 Glycine-rich protein family[more]
AT3G54200.13.5e-1329.53 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT2G46150.19.0e-0923.74 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
Match NameE-valueIdentityDescription
gi|449462527|ref|XP_004148992.1|9.1e-9691.24PREDICTED: uncharacterized protein LOC101209064 [Cucumis sativus][more]
gi|659102111|ref|XP_008451958.1|4.0e-9189.47PREDICTED: uncharacterized protein LOC103493106 [Cucumis melo][more]
gi|802733826|ref|XP_012086703.1|8.3e-8178.00PREDICTED: uncharacterized protein LOC105645659 [Jatropha curcas][more]
gi|566162577|ref|XP_002304562.2|1.6e-7974.38proline-rich family protein [Populus trichocarpa][more]
gi|255537817|ref|XP_002509975.1|3.5e-7975.62PREDICTED: uncharacterized protein LOC8288025 [Ricinus communis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004864LEA_2
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g00540.1Cp4.1LG01g00540.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004864Late embryogenesis abundant protein, LEA-14PFAMPF03168LEA_2coord: 73..182
score: 1.7
NoneNo IPR availablePANTHERPTHR31852FAMILY NOT NAMEDcoord: 7..198
score: 1.3E
NoneNo IPR availablePANTHERPTHR31852:SF22LATE EMBRYOGENESIS ABUNDANT HYDROXYPROLINE-RICH GLYCOPROTEINcoord: 7..198
score: 1.3E
NoneNo IPR availableunknownSSF117070LEA14-likecoord: 28..147
score: 8.3

The following gene(s) are paralogous to this gene:

None