CmaCh04G006810 (gene) Cucurbita maxima (Rimu)

NameCmaCh04G006810
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionLate embryogenesis abundant hydroxyproline-rich glycoprotein
LocationCma_Chr04 : 3465559 .. 3467306 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGGAAACATCACGACCACCACCCGCCATCAGGTCGCACAAACTTGGCGTCCTGTATAGTCGCCACGATCTTCTTAATCTTTGTCATCATCGTCGTCCTCATCGTCTTTTTCACCGTCTTTAAGCCTCAGGATCCAAAGATCGCCGTCTCCGCCGTCCAGTTGCCGTCCTTCTCCGTTGCCAACGGCACCGTTAATTTCACTTTCTCTCAGTACGTCTCTGTCAGAAACCCTAACAAAGCCTCTTTCTCTCACTATGACAGCTCACTCCAGCTCCTCTACTCTGGTTCCCAAATTGGATTCATGTTCATTCCTGCCGGGAAAATCGACTCCGGCCAGACGCAGTACATGGCAGCGACCTTCTCTGTCCAGTCATTCCCCTTGGCCGCTCCGGTCACTGCCATCGGAGCGGGACCGACTTACTCGGAGGGAATGAACGGGTACAGAATCGGACCGACACTGGAGATTGAATCGAAGATGGATATGGCCGGTAGGGTTAGGGTATTGCACTTCTTCACTCACCATGTGGAGGCCACATTGAGTTGCAGAGTCGCCATTGCTGTAAGTGATGGATCTGTGTTAGGTTTCCACTGCTAATTCTTCTTCTTCTTGTTTAGTTCCGGAAAATTTGGTCAGTTAAGTGTTTAAATTCTCATACTCAAAGTGAAAAGAAGAAAAAAAAAAACTCCAAAAAGATTTTGAGTTCTATCCATGAGCCATTTGTGGAATTGAATTGTAGAGTGATCTGATCTTAGATTTTGCTTTGGCTTAAGCTCTGTTTCTGCACTGAAATTCATCAGAAATCGCTTTTATTTTTGTGGATTTGTGAGTTTTTTTTATTAGGGTAATGAACCTAATTTTGATTCCTAAGGATAAATCTATGGCCATAAAGGAATGTTGATGTGAACAAAGGATGATTGCAATTGAAACAAAGAACAAAGTGGGATTGTACTCATTTTCTTTGGACTTTTTTTTATTATATTTTTTTCAGTTAAACAGATTGGTCACTAAGCAAAGTTTATCATTAATTCAAAGTTGCATCTGGCTAAGGATTTGAATACTGACGTTAGGTAGTTGGATTTATTTGGGTCTTCATCTTAGTCCTTCTTGAATTAAAATTTTTATTGTTTTGAATTATTTCATTTTCTCTGGACCCAGACATTTTGTACTGACCTAAAAATACGAAATAAAAATATAATTTAAATAAGTACTTGATGCCATGTAGAGTTTCAAGAATTAGGGAATTGGTGAATTATTATTTTCTAAATAATGATAAGTAATAATTAAATAAATATATTTCTAATGTGAATCTTTGAAACTAAAATAGAATAAAATCAAACATGAAATTCTTTTTCAATGAGTAGAGGCTTTTCTTTAGTTGAATCAATGTGGGTAGACTACTAAACTTTAAACTTTATAATTAGTTATTAACAAGAGCTTATCTATTTGATTATGCTTAGTAAAATAAACGATAATCTAAGGCTAAGATTCTAATCTTTAAAAAGTTTCAATTTTGTTTACTGAAATTCAGATTCGTTTGAACTTATCTTCACTGTGGAGAAGGTGAGTGAGAGAAAATTGAATGAGTGGGGACTGTTTGGTCAGAGTAAATAAAAGTTTCAAATTTGGCTGCAACTGTGGGTGGAAATCGAGACGGCAATAGAGAGAAAAGCTCTCCGACCAAAGGGGTGGCGGAGACGACGGTGGGCGGCGGCGATGAGAGAGGAGAAGGGAAGAGATGGGTGTAG

mRNA sequence

ATGAGGAAACATCACGACCACCACCCGCCATCAGGTCGCACAAACTTGGCGTCCTGTATAGTCGCCACGATCTTCTTAATCTTTGTCATCATCGTCGTCCTCATCGTCTTTTTCACCGTCTTTAAGCCTCAGGATCCAAAGATCGCCGTCTCCGCCGTCCAGTTGCCGTCCTTCTCCGTTGCCAACGGCACCGTTAATTTCACTTTCTCTCAGTACGTCTCTGTCAGAAACCCTAACAAAGCCTCTTTCTCTCACTATGACAGCTCACTCCAGCTCCTCTACTCTGGTTCCCAAATTGGATTCATGTTCATTCCTGCCGGGAAAATCGACTCCGGCCAGACGCAGTACATGGCAGCGACCTTCTCTGTCCAGTCATTCCCCTTGGCCGCTCCGGTCACTGCCATCGGAGCGGGACCGACTTACTCGGAGGGAATGAACGGGTACAGAATCGGACCGACACTGGAGATTGAATCGAAGATGGATATGGCCGGTAGGGTTAGGGTATTGCACTTCTTCACTCACCATGTGGAGGCCACATTGAGTTGCAGAGTCGCCATTGCTTTAAACAGATTGGTCACTAAGCAAAATTCGTTTGAACTTATCTTCACTGTGGAGAAGGTGAGTGAGAGAAAATTGAATGAGTGGGGACTGTTTGGTCAGATTTCAAATTTGGCTGCAACTGTGGGTGGAAATCGAGACGGCAATAGAGAGAAAAGCTCTCCGACCAAAGGGGTGGCGGAGACGACGGTGGGCGGCGGCGATGAGAGAGGAGAAGGGAAGAGATGGGTGTAG

Coding sequence (CDS)

ATGAGGAAACATCACGACCACCACCCGCCATCAGGTCGCACAAACTTGGCGTCCTGTATAGTCGCCACGATCTTCTTAATCTTTGTCATCATCGTCGTCCTCATCGTCTTTTTCACCGTCTTTAAGCCTCAGGATCCAAAGATCGCCGTCTCCGCCGTCCAGTTGCCGTCCTTCTCCGTTGCCAACGGCACCGTTAATTTCACTTTCTCTCAGTACGTCTCTGTCAGAAACCCTAACAAAGCCTCTTTCTCTCACTATGACAGCTCACTCCAGCTCCTCTACTCTGGTTCCCAAATTGGATTCATGTTCATTCCTGCCGGGAAAATCGACTCCGGCCAGACGCAGTACATGGCAGCGACCTTCTCTGTCCAGTCATTCCCCTTGGCCGCTCCGGTCACTGCCATCGGAGCGGGACCGACTTACTCGGAGGGAATGAACGGGTACAGAATCGGACCGACACTGGAGATTGAATCGAAGATGGATATGGCCGGTAGGGTTAGGGTATTGCACTTCTTCACTCACCATGTGGAGGCCACATTGAGTTGCAGAGTCGCCATTGCTTTAAACAGATTGGTCACTAAGCAAAATTCGTTTGAACTTATCTTCACTGTGGAGAAGGTGAGTGAGAGAAAATTGAATGAGTGGGGACTGTTTGGTCAGATTTCAAATTTGGCTGCAACTGTGGGTGGAAATCGAGACGGCAATAGAGAGAAAAGCTCTCCGACCAAAGGGGTGGCGGAGACGACGGTGGGCGGCGGCGATGAGAGAGGAGAAGGGAAGAGATGGGTGTAG

Protein sequence

MRKHHDHHPPSGRTNLASCIVATIFLIFVIIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDSGQTQYMAATFSVQSFPLAAPVTAIGAGPTYSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIALNRLVTKQNSFELIFTVEKVSERKLNEWGLFGQISNLAATVGGNRDGNREKSSPTKGVAETTVGGGDERGEGKRWV
BLAST of CmaCh04G006810 vs. TrEMBL
Match: A0A0A0KX26_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G052640 PE=4 SV=1)

HSP 1 Score: 337.0 bits (863), Expect = 2.0e-89
Identity = 167/185 (90.27%), Postives = 180/185 (97.30%), Query Frame = 1

Query: 5   HDHHPPSGRTNLASCIVATIFLIFVIIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGT 64
           H  HPPSGRTNLASC+VAT+FLIF+IIV+LIVFFTVFKPQDPKIAVSAVQLPSFSVANGT
Sbjct: 7   HGDHPPSGRTNLASCVVATVFLIFLIIVILIVFFTVFKPQDPKIAVSAVQLPSFSVANGT 66

Query: 65  VNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDSGQTQYMAATFSVQ 124
           +NFTFSQYVSV+NPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKID+GQTQYMAATFSVQ
Sbjct: 67  INFTFSQYVSVKNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDAGQTQYMAATFSVQ 126

Query: 125 SFPLAAPVTAIGAGPTYSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRV 184
           SFPLAAPV ++GAGPT+SEGMNGYR+GP LEIESKMDMAGRVRVLHFFTHHVEAT SCRV
Sbjct: 127 SFPLAAPVASVGAGPTFSEGMNGYRVGPILEIESKMDMAGRVRVLHFFTHHVEATSSCRV 186

Query: 185 AIALN 190
           AIA++
Sbjct: 187 AIAVS 191

BLAST of CmaCh04G006810 vs. TrEMBL
Match: A0A067JZ66_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_20441 PE=4 SV=1)

HSP 1 Score: 285.0 bits (728), Expect = 9.1e-74
Identity = 145/191 (75.92%), Postives = 169/191 (88.48%), Query Frame = 1

Query: 9   PPSGRTNLASCIVATIFLIFVIIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFT 68
           PPSGRTNLASCIVATIFLIFVII++LIVFFTVFKP+DPKI+V+AVQLPSFSV+N TVNFT
Sbjct: 7   PPSGRTNLASCIVATIFLIFVIIIILIVFFTVFKPKDPKISVNAVQLPSFSVSNNTVNFT 66

Query: 69  FSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDSGQTQYMAATFSVQSFPL 128
           FSQYVSV+NPNKASFSHYDS+LQLLYSGSQ+GFMFIPAGKID+G+TQYMAATF+VQSFPL
Sbjct: 67  FSQYVSVKNPNKASFSHYDSTLQLLYSGSQVGFMFIPAGKIDAGRTQYMAATFAVQSFPL 126

Query: 129 -AAPVTAIGAGPTYSEGM---------NGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEA 188
            ++P  A+  GPT++ G+          GYR+GPT+EIES++ MAGRVRVLH FTHHVEA
Sbjct: 127 SSSPDAAVNVGPTFAGGVLPGGYPSVNGGYRVGPTMEIESRIHMAGRVRVLHIFTHHVEA 186

Query: 189 TLSCRVAIALN 190
              CRVAIA++
Sbjct: 187 KAGCRVAIAVS 197

BLAST of CmaCh04G006810 vs. TrEMBL
Match: B9GXJ6_POPTR (Proline-rich family protein OS=Populus trichocarpa GN=POPTR_0003s14110g PE=4 SV=2)

HSP 1 Score: 281.2 bits (718), Expect = 1.3e-72
Identity = 140/194 (72.16%), Postives = 169/194 (87.11%), Query Frame = 1

Query: 7   HHPPSGRTNLASCIVATIFLIFVIIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVN 66
           H PPSGRTNLASCIVATIFLIF++I++LIVFFTVFKP+DPKI+V++VQLPSFSV+N TVN
Sbjct: 5   HRPPSGRTNLASCIVATIFLIFLVIIILIVFFTVFKPKDPKISVNSVQLPSFSVSNNTVN 64

Query: 67  FTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDSGQTQYMAATFSVQSF 126
           FTFSQYVSV+NPN+A FSH+DS+LQLLYSGSQIGFMFIPAGKID+G+TQYMAATFSV+SF
Sbjct: 65  FTFSQYVSVKNPNRAVFSHFDSTLQLLYSGSQIGFMFIPAGKIDAGRTQYMAATFSVESF 124

Query: 127 PL-AAPVTAIGAGPTYSEG----------MNGYRIGPTLEIESKMDMAGRVRVLHFFTHH 186
           PL A+P  A+  GP +++G           NGYR+GPT+EIES++ MAGRVRVLHFFTHH
Sbjct: 125 PLSASPDAAVNVGPAFNDGGFGGGGQTGFNNGYRVGPTMEIESRIQMAGRVRVLHFFTHH 184

Query: 187 VEATLSCRVAIALN 190
           +E  + CRV IA++
Sbjct: 185 LETKVGCRVVIAVS 198

BLAST of CmaCh04G006810 vs. TrEMBL
Match: B9RCU1_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1692280 PE=4 SV=1)

HSP 1 Score: 280.0 bits (715), Expect = 2.9e-72
Identity = 141/192 (73.44%), Postives = 167/192 (86.98%), Query Frame = 1

Query: 7   HHPPSGRTNLASCIVATIFLIFVIIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVN 66
           H PPSGRTNLASCIVATIFLIFVII++LIVFFTVFKP+DPKI+V+AVQLPSFSV+N TVN
Sbjct: 5   HRPPSGRTNLASCIVATIFLIFVIIIILIVFFTVFKPKDPKISVNAVQLPSFSVSNNTVN 64

Query: 67  FTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDSGQTQYMAATFSVQSF 126
           FTFSQYVSV+NPN+A+FSHYDS+LQLLYSGSQ+GFMFIPAGKI+SG+TQYMAATF+VQSF
Sbjct: 65  FTFSQYVSVKNPNRATFSHYDSTLQLLYSGSQVGFMFIPAGKIESGRTQYMAATFAVQSF 124

Query: 127 PL-AAPVTAIGAGPTYS--------EGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVE 186
           PL ++P  A+  GP ++           NG+R+GPT+EIES++ M GRVRVLH FTHHVE
Sbjct: 125 PLSSSPDAAVNVGPAFTGSGFPGVPGSSNGFRVGPTMEIESRIQMVGRVRVLHIFTHHVE 184

Query: 187 ATLSCRVAIALN 190
           A   CRVAIA++
Sbjct: 185 AKAECRVAIAVS 196

BLAST of CmaCh04G006810 vs. TrEMBL
Match: I1KPK7_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_08G023500 PE=4 SV=1)

HSP 1 Score: 276.9 bits (707), Expect = 2.5e-71
Identity = 141/185 (76.22%), Postives = 163/185 (88.11%), Query Frame = 1

Query: 9   PPSGRTNLASCIVATIFLIFVIIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFT 68
           PPSGRTNLASC+VATIFLIF++IV+LIV++T+FKPQDPKIAV+AVQLPSFSVANGTVNFT
Sbjct: 18  PPSGRTNLASCVVATIFLIFIVIVILIVYYTIFKPQDPKIAVNAVQLPSFSVANGTVNFT 77

Query: 69  FSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDSGQTQYMAATFSVQSFPL 128
           FSQY SVRNPN+A+FSHYDSSLQL+YSGSQ+GFMFIPAG+ID+G+TQYMAATFSVQSFPL
Sbjct: 78  FSQYASVRNPNRAAFSHYDSSLQLIYSGSQVGFMFIPAGEIDAGRTQYMAATFSVQSFPL 137

Query: 129 AAPVTAIGAGPTYSEGMN-----GYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCR 188
           +AP      GPT + G       G R+ PTLEIESK++MAGRV+VLHFFTHHV A   CR
Sbjct: 138 SAPPR---MGPTLANGDGVGFNYGLRVEPTLEIESKLEMAGRVKVLHFFTHHVYAKAGCR 197

BLAST of CmaCh04G006810 vs. TAIR10
Match: AT4G23930.1 (AT4G23930.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 196.4 bits (498), Expect = 2.1e-50
Identity = 99/174 (56.90%), Postives = 141/174 (81.03%), Query Frame = 1

Query: 14  TNLASCIVATIFLIFVIIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYV 73
           +NLASC VAT+F++F+II  L V+ TVF+P+DP+I+V++V++PSFSVAN +V+FTFSQ+ 
Sbjct: 6   SNLASCAVATLFIVFLIIAALTVYLTVFRPRDPEISVTSVKVPSFSVANSSVSFTFSQFS 65

Query: 74  SVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDSGQTQYMAATFSVQSFPLAAPVT 133
           +VRNPN+A+FSHY++ +QL Y G++IG+ F+PAG+I+SG+T+ M ATFSVQSFPLAA   
Sbjct: 66  AVRNPNRAAFSHYNNVIQLFYYGNRIGYTFVPAGEIESGRTKRMLATFSVQSFPLAA--- 125

Query: 134 AIGAGPTYSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIA 188
           A  +  + ++  N  R G T+EIESK++MAGRVRVL  FTH + A  +CR+AI+
Sbjct: 126 ASSSQISAAQFQNSDRSGSTVEIESKLEMAGRVRVLGLFTHRIAAKCNCRIAIS 176

BLAST of CmaCh04G006810 vs. TAIR10
Match: AT1G64450.1 (AT1G64450.1 Glycine-rich protein family)

HSP 1 Score: 196.1 bits (497), Expect = 2.8e-50
Identity = 94/131 (71.76%), Postives = 117/131 (89.31%), Query Frame = 1

Query: 1   MRKHHDHHPPSGRTNLASCIVATIFLIFVIIVVLIVFFTVFKPQDPKIAVSAVQLPSFSV 60
           M K HD    SGRTNLASC VAT+FL+ +++V+L+V+FTVFKP+DPKI+V+AVQLPSF+V
Sbjct: 1   MAKPHDRRRSSGRTNLASCAVATVFLLILLVVLLVVYFTVFKPKDPKISVNAVQLPSFAV 60

Query: 61  ANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDSGQTQYMAAT 120
           +N T NF+FSQYV+VRNPN+A FSHYDSS+QLLYSG+Q+GFMFIPAGKIDSG+ QYMAAT
Sbjct: 61  SNNTANFSFSQYVAVRNPNRAVFSHYDSSIQLLYSGNQVGFMFIPAGKIDSGRIQYMAAT 120

Query: 121 FSVQSFPLAAP 132
           F+V SFP++ P
Sbjct: 121 FTVHSFPISPP 131

BLAST of CmaCh04G006810 vs. TAIR10
Match: AT3G54200.1 (AT3G54200.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 63.2 bits (152), Expect = 2.8e-10
Identity = 55/192 (28.65%), Postives = 99/192 (51.56%), Query Frame = 1

Query: 13  RTNLASCIVATIFLIFVI-IVVLIVFFTVFKPQDPKIAVSAVQLPSFSVA------NGTV 72
           + N   CI  TI LI +I IV++I+ FT+FKP+ P   + +V +     +         +
Sbjct: 48  KRNCKICICFTILLILLIAIVIVILAFTLFKPKRPTTTIDSVTVDRLQASVNPLLLKVLL 107

Query: 73  NFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDSGQTQYMAATFSVQS 132
           N T +  +S++NPN+  FS+  SS  L Y G  IG   +PA +I + +T  +  T ++ +
Sbjct: 108 NLTLNVDLSLKNPNRIGFSYDSSSALLNYRGQVIGEAPLPANRIAARKTVPLNITLTLMA 167

Query: 133 FPLAAPVTAIGAGPTYSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVA 192
             L +    +      S+ M G      + + + + + G+V VL  F   V+++ SC ++
Sbjct: 168 DRLLSETQLL------SDVMAG-----VIPLNTFVKVTGKVTVLKIFKIKVQSSSSCDLS 227

Query: 193 IAL-NRLVTKQN 197
           I++ +R VT Q+
Sbjct: 228 ISVSDRNVTSQH 228

BLAST of CmaCh04G006810 vs. TAIR10
Match: AT2G46150.1 (AT2G46150.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 55.8 bits (133), Expect = 4.5e-08
Identity = 46/186 (24.73%), Postives = 82/186 (44.09%), Query Frame = 1

Query: 8   HPPSGRTNLASCIVATIFLIFVIIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGT--- 67
           H    R   + C+ AT  ++  I++ L+  FTVF+ +DP I ++ V +       GT   
Sbjct: 30  HRSRNRIKCSICVTATSLILTTIVLTLV--FTVFRVKDPIIKMNGVMVNGLDSVTGTNQV 89

Query: 68  ----VNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDSGQTQYMAAT 127
                N +    VSV+NPN ASF + +++  + Y G+ +G      GK    +T  M  T
Sbjct: 90  QLLGTNISMIVDVSVKNPNTASFKYSNTTTDIYYKGTLVGEAHGLPGKARPHRTSRMNVT 149

Query: 128 FSVQSFPLAAPVTAIGAGPTYSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATL 187
             +    L   ++  G G   S           + + S   + G+V+++     HV   +
Sbjct: 150 VDIM---LDRILSDPGLGREISR-------SGLVNVWSYTRVGGKVKIMGIVKKHVTVKM 203

BLAST of CmaCh04G006810 vs. NCBI nr
Match: gi|449462527|ref|XP_004148992.1| (PREDICTED: uncharacterized protein LOC101209064 [Cucumis sativus])

HSP 1 Score: 337.0 bits (863), Expect = 2.9e-89
Identity = 167/185 (90.27%), Postives = 180/185 (97.30%), Query Frame = 1

Query: 5   HDHHPPSGRTNLASCIVATIFLIFVIIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGT 64
           H  HPPSGRTNLASC+VAT+FLIF+IIV+LIVFFTVFKPQDPKIAVSAVQLPSFSVANGT
Sbjct: 7   HGDHPPSGRTNLASCVVATVFLIFLIIVILIVFFTVFKPQDPKIAVSAVQLPSFSVANGT 66

Query: 65  VNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDSGQTQYMAATFSVQ 124
           +NFTFSQYVSV+NPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKID+GQTQYMAATFSVQ
Sbjct: 67  INFTFSQYVSVKNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDAGQTQYMAATFSVQ 126

Query: 125 SFPLAAPVTAIGAGPTYSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRV 184
           SFPLAAPV ++GAGPT+SEGMNGYR+GP LEIESKMDMAGRVRVLHFFTHHVEAT SCRV
Sbjct: 127 SFPLAAPVASVGAGPTFSEGMNGYRVGPILEIESKMDMAGRVRVLHFFTHHVEATSSCRV 186

Query: 185 AIALN 190
           AIA++
Sbjct: 187 AIAVS 191

BLAST of CmaCh04G006810 vs. NCBI nr
Match: gi|659102111|ref|XP_008451958.1| (PREDICTED: uncharacterized protein LOC103493106 [Cucumis melo])

HSP 1 Score: 321.2 bits (822), Expect = 1.6e-84
Identity = 160/181 (88.40%), Postives = 175/181 (96.69%), Query Frame = 1

Query: 9   PPSGRTNLASCIVATIFLIFVIIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFT 68
           PPSGRTNLASC+VAT+FLIF+IIV+LIVFFTVFKPQDPKIAVSAVQLPSFSV NGT+NFT
Sbjct: 11  PPSGRTNLASCVVATVFLIFLIIVILIVFFTVFKPQDPKIAVSAVQLPSFSVTNGTINFT 70

Query: 69  FSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDSGQTQYMAATFSVQSFPL 128
           FSQYVSV+NPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKI++GQTQYMAATFSVQSFPL
Sbjct: 71  FSQYVSVKNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIEAGQTQYMAATFSVQSFPL 130

Query: 129 AAPVTAIGAGPTYSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAL 188
           A+PV A+GAGPT+S GMNGYR+GP LEIESKMDMAGRVRVL+FFTHHVEA  SCRVAIA+
Sbjct: 131 ASPVAAVGAGPTFSGGMNGYRVGPILEIESKMDMAGRVRVLNFFTHHVEAISSCRVAIAV 190

Query: 189 N 190
           +
Sbjct: 191 S 191

BLAST of CmaCh04G006810 vs. NCBI nr
Match: gi|802733826|ref|XP_012086703.1| (PREDICTED: uncharacterized protein LOC105645659 [Jatropha curcas])

HSP 1 Score: 285.0 bits (728), Expect = 1.3e-73
Identity = 145/191 (75.92%), Postives = 169/191 (88.48%), Query Frame = 1

Query: 9   PPSGRTNLASCIVATIFLIFVIIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFT 68
           PPSGRTNLASCIVATIFLIFVII++LIVFFTVFKP+DPKI+V+AVQLPSFSV+N TVNFT
Sbjct: 7   PPSGRTNLASCIVATIFLIFVIIIILIVFFTVFKPKDPKISVNAVQLPSFSVSNNTVNFT 66

Query: 69  FSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDSGQTQYMAATFSVQSFPL 128
           FSQYVSV+NPNKASFSHYDS+LQLLYSGSQ+GFMFIPAGKID+G+TQYMAATF+VQSFPL
Sbjct: 67  FSQYVSVKNPNKASFSHYDSTLQLLYSGSQVGFMFIPAGKIDAGRTQYMAATFAVQSFPL 126

Query: 129 -AAPVTAIGAGPTYSEGM---------NGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEA 188
            ++P  A+  GPT++ G+          GYR+GPT+EIES++ MAGRVRVLH FTHHVEA
Sbjct: 127 SSSPDAAVNVGPTFAGGVLPGGYPSVNGGYRVGPTMEIESRIHMAGRVRVLHIFTHHVEA 186

Query: 189 TLSCRVAIALN 190
              CRVAIA++
Sbjct: 187 KAGCRVAIAVS 197

BLAST of CmaCh04G006810 vs. NCBI nr
Match: gi|566162577|ref|XP_002304562.2| (proline-rich family protein [Populus trichocarpa])

HSP 1 Score: 281.2 bits (718), Expect = 1.9e-72
Identity = 140/194 (72.16%), Postives = 169/194 (87.11%), Query Frame = 1

Query: 7   HHPPSGRTNLASCIVATIFLIFVIIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVN 66
           H PPSGRTNLASCIVATIFLIF++I++LIVFFTVFKP+DPKI+V++VQLPSFSV+N TVN
Sbjct: 5   HRPPSGRTNLASCIVATIFLIFLVIIILIVFFTVFKPKDPKISVNSVQLPSFSVSNNTVN 64

Query: 67  FTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDSGQTQYMAATFSVQSF 126
           FTFSQYVSV+NPN+A FSH+DS+LQLLYSGSQIGFMFIPAGKID+G+TQYMAATFSV+SF
Sbjct: 65  FTFSQYVSVKNPNRAVFSHFDSTLQLLYSGSQIGFMFIPAGKIDAGRTQYMAATFSVESF 124

Query: 127 PL-AAPVTAIGAGPTYSEG----------MNGYRIGPTLEIESKMDMAGRVRVLHFFTHH 186
           PL A+P  A+  GP +++G           NGYR+GPT+EIES++ MAGRVRVLHFFTHH
Sbjct: 125 PLSASPDAAVNVGPAFNDGGFGGGGQTGFNNGYRVGPTMEIESRIQMAGRVRVLHFFTHH 184

Query: 187 VEATLSCRVAIALN 190
           +E  + CRV IA++
Sbjct: 185 LETKVGCRVVIAVS 198

BLAST of CmaCh04G006810 vs. NCBI nr
Match: gi|1012093032|ref|XP_015955318.1| (PREDICTED: proline-rich receptor-like protein kinase PERK3 [Arachis duranensis])

HSP 1 Score: 280.4 bits (716), Expect = 3.2e-72
Identity = 143/185 (77.30%), Postives = 163/185 (88.11%), Query Frame = 1

Query: 9   PPSGRTNLASCIVATIFLIFVIIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFT 68
           PPSGRTNLASC+VATIFLIF+IIV+LIV+FTVFKPQDPKIAVSAVQLPSFSV NGTVNFT
Sbjct: 76  PPSGRTNLASCVVATIFLIFIIIVILIVYFTVFKPQDPKIAVSAVQLPSFSVVNGTVNFT 135

Query: 69  FSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDSGQTQYMAATFSVQSFPL 128
           FSQY SVRNPN+A+FSHYDSSLQLLYSG+Q+GFMFIPAG+ID+G+T+YMAATFSVQSFPL
Sbjct: 136 FSQYASVRNPNRAAFSHYDSSLQLLYSGTQVGFMFIPAGEIDAGRTKYMAATFSVQSFPL 195

Query: 129 AAPVTAIGAGPTYSEGMN-----GYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCR 188
           AAP  A+   PT   G       G R+ PT+EIESK++MAGRVRVLHFF+H V+AT  CR
Sbjct: 196 AAPPVAMTGMPTVMNGGGVGFNYGMRVQPTMEIESKLEMAGRVRVLHFFSHRVQATAGCR 255

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KX26_CUCSA2.0e-8990.27Uncharacterized protein OS=Cucumis sativus GN=Csa_4G052640 PE=4 SV=1[more]
A0A067JZ66_JATCU9.1e-7475.92Uncharacterized protein OS=Jatropha curcas GN=JCGZ_20441 PE=4 SV=1[more]
B9GXJ6_POPTR1.3e-7272.16Proline-rich family protein OS=Populus trichocarpa GN=POPTR_0003s14110g PE=4 SV=... [more]
B9RCU1_RICCO2.9e-7273.44Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1692280 PE=4 SV=1[more]
I1KPK7_SOYBN2.5e-7176.22Uncharacterized protein OS=Glycine max GN=GLYMA_08G023500 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G23930.12.1e-5056.90 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT1G64450.12.8e-5071.76 Glycine-rich protein family[more]
AT3G54200.12.8e-1028.65 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT2G46150.14.5e-0824.73 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
Match NameE-valueIdentityDescription
gi|449462527|ref|XP_004148992.1|2.9e-8990.27PREDICTED: uncharacterized protein LOC101209064 [Cucumis sativus][more]
gi|659102111|ref|XP_008451958.1|1.6e-8488.40PREDICTED: uncharacterized protein LOC103493106 [Cucumis melo][more]
gi|802733826|ref|XP_012086703.1|1.3e-7375.92PREDICTED: uncharacterized protein LOC105645659 [Jatropha curcas][more]
gi|566162577|ref|XP_002304562.2|1.9e-7272.16proline-rich family protein [Populus trichocarpa][more]
gi|1012093032|ref|XP_015955318.1|3.2e-7277.30PREDICTED: proline-rich receptor-like protein kinase PERK3 [Arachis duranensis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004864LEA_2
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh04G006810.1CmaCh04G006810.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004864Late embryogenesis abundant protein, LEA-14PFAMPF03168LEA_2coord: 73..182
score: 3.0
NoneNo IPR availablePANTHERPTHR31852FAMILY NOT NAMEDcoord: 7..189
score: 2.4E
NoneNo IPR availablePANTHERPTHR31852:SF22LATE EMBRYOGENESIS ABUNDANT HYDROXYPROLINE-RICH GLYCOPROTEINcoord: 7..189
score: 2.4E
NoneNo IPR availableunknownSSF117070LEA14-likecoord: 28..147
score: 1.9

The following gene(s) are paralogous to this gene:

None