Cla97C02G045060 (gene) Watermelon (97103) v2

NameCla97C02G045060
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionLate embryogenesis abundant protein, LEA-14
LocationCla97Chr02 : 33195615 .. 33196187 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCGCTCCCAGTACAAAATTACTGCGAAACATTTGCATAGCCATATTGCTTTGTCTAATTCTTACCGTAATTTTGATCCTCATTTTAGCCTTTACTGTTTTCAAGCCCAAGCGGCCTATCATCGCCGTCGATTCAGTTTCTCTACTCGATCTGAACGTTTCTCTGGTTAGCGGCGTCGATCTGAACCTATCTCTCATGGTGGATCTATCCGTTGAGAATCCGAATAAGGTCGCCTTTGAATACTCTCAAAGCACCGCCGTTGTGAGTTACAGAGGCGAAGAAGTCGGAGAAGCGCCGATTCCAGCTGGCCGATTATCAGCCGAAGGGACTGAGAAAATGAACCTAACGCTGACGATGATGGCGGACCGGATGCTGGCGAAGTCGGAGGTGTTTTCCGACGTGGTTTCCGGTAAACTTCCGATCAGTACTTTCGCTCGGTTGTCCGGGAAAGTGAAGGTGATCGGTGTTTTCAAGATTCATGTTGTGGCGTCGTCGTCTTGTGATCTCACCATTGATATTACAAATGGAAGCATTGGAGATCAGCAATGCCAATACCGGACGAAGCTCTGA

mRNA sequence

ATGGCCGCTCCCAGTACAAAATTACTGCGAAACATTTGCATAGCCATATTGCTTTGTCTAATTCTTACCGTAATTTTGATCCTCATTTTAGCCTTTACTGTTTTCAAGCCCAAGCGGCCTATCATCGCCGTCGATTCAGTTTCTCTACTCGATCTGAACGTTTCTCTGGTTAGCGGCGTCGATCTGAACCTATCTCTCATGGTGGATCTATCCGTTGAGAATCCGAATAAGGTCGCCTTTGAATACTCTCAAAGCACCGCCGTTGTGAGTTACAGAGGCGAAGAAGTCGGAGAAGCGCCGATTCCAGCTGGCCGATTATCAGCCGAAGGGACTGAGAAAATGAACCTAACGCTGACGATGATGGCGGACCGGATGCTGGCGAAGTCGGAGGTGTTTTCCGACGTGGTTTCCGGTAAACTTCCGATCAGTACTTTCGCTCGGTTGTCCGGGAAAGTGAAGGTGATCGGTGTTTTCAAGATTCATGTTGTGGCGTCGTCGTCTTGTGATCTCACCATTGATATTACAAATGGAAGCATTGGAGATCAGCAATGCCAATACCGGACGAAGCTCTGA

Coding sequence (CDS)

ATGGCCGCTCCCAGTACAAAATTACTGCGAAACATTTGCATAGCCATATTGCTTTGTCTAATTCTTACCGTAATTTTGATCCTCATTTTAGCCTTTACTGTTTTCAAGCCCAAGCGGCCTATCATCGCCGTCGATTCAGTTTCTCTACTCGATCTGAACGTTTCTCTGGTTAGCGGCGTCGATCTGAACCTATCTCTCATGGTGGATCTATCCGTTGAGAATCCGAATAAGGTCGCCTTTGAATACTCTCAAAGCACCGCCGTTGTGAGTTACAGAGGCGAAGAAGTCGGAGAAGCGCCGATTCCAGCTGGCCGATTATCAGCCGAAGGGACTGAGAAAATGAACCTAACGCTGACGATGATGGCGGACCGGATGCTGGCGAAGTCGGAGGTGTTTTCCGACGTGGTTTCCGGTAAACTTCCGATCAGTACTTTCGCTCGGTTGTCCGGGAAAGTGAAGGTGATCGGTGTTTTCAAGATTCATGTTGTGGCGTCGTCGTCTTGTGATCTCACCATTGATATTACAAATGGAAGCATTGGAGATCAGCAATGCCAATACCGGACGAAGCTCTGA

Protein sequence

MAAPSTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLLDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGRLSAEGTEKMNLTLTMMADRMLAKSEVFSDVVSGKLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL
BLAST of Cla97C02G045060 vs. NCBI nr
Match: XP_011658424.1 (PREDICTED: uncharacterized protein LOC105435999 [Cucumis sativus] >KGN47220.1 hypothetical protein Csa_6G212910 [Cucumis sativus])

HSP 1 Score: 247.7 bits (631), Expect = 3.2e-62
Identity = 130/189 (68.78%), Postives = 146/189 (77.25%), Query Frame = 0

Query: 2   AAPSTKLLRNICIAILLXXXXXXXXXXXXAFTVFKPKRPIIAVDSVSLLDLNVSLXXXXX 61
           AAP++KLL NIC+ + L            AFTVFKPK+PII VDSVSLLDLNVS+     
Sbjct: 3   AAPASKLLPNICLTLFLSLILLLLFSLILAFTVFKPKQPIIVVDSVSLLDLNVSITDGVH 62

Query: 62  XXXXXXVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGRLSAEGTEKMNLTLTMM 121
                 VDL+V+NPNKV FEYS+STAVV YRGE+VGEAPIP GRL  +GTEKMNLTLT+M
Sbjct: 63  LSLSLNVDLTVQNPNKVGFEYSESTAVVIYRGEKVGEAPIPGGRLPGKGTEKMNLTLTIM 122

Query: 122 ADRMLAKSEVFSDVVSGKLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGD 181
            DRML KSEVFSDVVSG+LPISTFARL GKVKV+ V KIHVVAS+SCDL ID+ N S GD
Sbjct: 123 GDRMLGKSEVFSDVVSGQLPISTFARLPGKVKVMNVLKIHVVASTSCDLIIDVKNESFGD 182

Query: 182 QQCQYRTKL 191
           Q CQYRT L
Sbjct: 183 QLCQYRTTL 191

BLAST of Cla97C02G045060 vs. NCBI nr
Match: XP_008465308.1 (PREDICTED: uncharacterized protein LOC103502964 [Cucumis melo])

HSP 1 Score: 244.6 bits (623), Expect = 2.7e-61
Identity = 139/190 (73.16%), Postives = 161/190 (84.74%), Query Frame = 0

Query: 1   MAAPSTKLLRNICIAILLXXXXXXXXXXXXAFTVFKPKRPIIAVDSVSLLDLNVSLXXXX 60
           MAAP++KLLRN CI ++ XXXXXXXXXXXX FTVFKP+RPII VDSVSLLDLNV+L    
Sbjct: 1   MAAPASKLLRNFCITLVXXXXXXXXXXXXXXFTVFKPQRPIIVVDSVSLLDLNVALTDGV 60

Query: 61  XXXXXXXVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGRLSAEGTEKMNLTLTM 120
                  VDL+VENPNKVAFEYS+STAVV YRGE+VGEAPIP GRL  +GT+KMNLTLT+
Sbjct: 61  DLNLSINVDLTVENPNKVAFEYSKSTAVVIYRGEKVGEAPIPGGRLPGKGTKKMNLTLTI 120

Query: 121 MADRMLAKSEVFSDVVSGKLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIG 180
           M +RML +SEVFSDVVSG+L IST ARL+GKVKV+GV KIHVVAS+SCDL ID+ NGS G
Sbjct: 121 MGERMLGRSEVFSDVVSGQLSISTLARLAGKVKVMGVVKIHVVASTSCDLIIDVKNGSFG 180

Query: 181 DQQCQYRTKL 191
           DQ CQ+RT++
Sbjct: 181 DQLCQFRTRV 190

BLAST of Cla97C02G045060 vs. NCBI nr
Match: XP_023512272.1 (uncharacterized protein LOC111777064 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 236.5 bits (602), Expect = 7.5e-59
Identity = 132/189 (69.84%), Postives = 145/189 (76.72%), Query Frame = 0

Query: 1   MAAPSTKLLRNICIAILLXXXXXXXXXXXXAFTVFKPKRPIIAVDSVSLLDLNVSL---X 60
           MAAPS K LR+ICI +LL            AFT FKPKRP IAVDSVSLLDLN+SL    
Sbjct: 1   MAAPSRK-LRSICIPVLLSVTLLVISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAAR 60

Query: 61  XXXXXXXXXXVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGRLSAEGTEKMNLT 120
                     +DLSVENPNKVAFEYS STAVVSYRGEE+GEAPIPAGRL A+ TEKMNLT
Sbjct: 61  LSVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSYRGEELGEAPIPAGRLPADRTEKMNLT 120

Query: 121 LTMMADRMLAKSEVFSDVVSGKLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNG 180
           LTMMADR+LAKSE+FSD +SG++PI+ F RLSG VKVIGVFKIHVVASSSCD TI I N 
Sbjct: 121 LTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDFTIGIGNR 180

Query: 181 SIGDQQCQY 187
           SI DQ+C Y
Sbjct: 181 SIKDQKCHY 188

BLAST of Cla97C02G045060 vs. NCBI nr
Match: XP_022944105.1 (uncharacterized protein LOC111448649 [Cucurbita moschata])

HSP 1 Score: 233.4 bits (594), Expect = 6.3e-58
Identity = 130/189 (68.78%), Postives = 145/189 (76.72%), Query Frame = 0

Query: 1   MAAPSTKLLRNICIAILLXXXXXXXXXXXXAFTVFKPKRPIIAVDSVSLLDLNVSL---X 60
           MAAPS K LR+ICI +LL            AFT FKPKRP IAVDSVSLLDLN+SL    
Sbjct: 1   MAAPSRK-LRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAAR 60

Query: 61  XXXXXXXXXXVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGRLSAEGTEKMNLT 120
                     +DLS+ENPNKVAFEYS +TAVVSYRGEE+GEAPIPAG L A+ TEKMNLT
Sbjct: 61  LSVDLNLSLLLDLSIENPNKVAFEYSYTTAVVSYRGEELGEAPIPAGWLPADRTEKMNLT 120

Query: 121 LTMMADRMLAKSEVFSDVVSGKLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNG 180
           LTMMADR+LAKSE+FSD +SG++PI+ F RLSG VKVIGVFKIHVVASSSCDLTI I N 
Sbjct: 121 LTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNR 180

Query: 181 SIGDQQCQY 187
           SI DQ+C Y
Sbjct: 181 SIEDQKCHY 188

BLAST of Cla97C02G045060 vs. NCBI nr
Match: XP_022986213.1 (uncharacterized protein LOC111484029 [Cucurbita maxima])

HSP 1 Score: 231.5 bits (589), Expect = 2.4e-57
Identity = 130/189 (68.78%), Postives = 143/189 (75.66%), Query Frame = 0

Query: 1   MAAPSTKLLRNICIAILLXXXXXXXXXXXXAFTVFKPKRPIIAVDSVSLLDLNVSL---X 60
           M APS K LR+ICI +LL            AFT FKPKRP IAVDSVSLLDLN+SL    
Sbjct: 1   MVAPSRK-LRSICIPVLLSVTLLVISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAAR 60

Query: 61  XXXXXXXXXXVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGRLSAEGTEKMNLT 120
                     +DLSVENPNKVAFEYS STAVVSYRGEE+GE PIPAGRL A+ TEKMNLT
Sbjct: 61  LSVDLNLFLLLDLSVENPNKVAFEYSYSTAVVSYRGEELGEVPIPAGRLLADRTEKMNLT 120

Query: 121 LTMMADRMLAKSEVFSDVVSGKLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNG 180
           L MMADR+LAKSE+FSD +SG++PI+ F RLSG VKVIGVFKIHVVASSSCDLTI I N 
Sbjct: 121 LKMMADRLLAKSELFSDAMSGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNR 180

Query: 181 SIGDQQCQY 187
           SI DQ+C Y
Sbjct: 181 SIEDQKCHY 188

BLAST of Cla97C02G045060 vs. TrEMBL
Match: tr|A0A0A0KBX8|A0A0A0KBX8_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G212910 PE=4 SV=1)

HSP 1 Score: 247.7 bits (631), Expect = 2.1e-62
Identity = 130/189 (68.78%), Postives = 146/189 (77.25%), Query Frame = 0

Query: 2   AAPSTKLLRNICIAILLXXXXXXXXXXXXAFTVFKPKRPIIAVDSVSLLDLNVSLXXXXX 61
           AAP++KLL NIC+ + L            AFTVFKPK+PII VDSVSLLDLNVS+     
Sbjct: 3   AAPASKLLPNICLTLFLSLILLLLFSLILAFTVFKPKQPIIVVDSVSLLDLNVSITDGVH 62

Query: 62  XXXXXXVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGRLSAEGTEKMNLTLTMM 121
                 VDL+V+NPNKV FEYS+STAVV YRGE+VGEAPIP GRL  +GTEKMNLTLT+M
Sbjct: 63  LSLSLNVDLTVQNPNKVGFEYSESTAVVIYRGEKVGEAPIPGGRLPGKGTEKMNLTLTIM 122

Query: 122 ADRMLAKSEVFSDVVSGKLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGD 181
            DRML KSEVFSDVVSG+LPISTFARL GKVKV+ V KIHVVAS+SCDL ID+ N S GD
Sbjct: 123 GDRMLGKSEVFSDVVSGQLPISTFARLPGKVKVMNVLKIHVVASTSCDLIIDVKNESFGD 182

Query: 182 QQCQYRTKL 191
           Q CQYRT L
Sbjct: 183 QLCQYRTTL 191

BLAST of Cla97C02G045060 vs. TrEMBL
Match: tr|A0A1S3CNL0|A0A1S3CNL0_CUCME (uncharacterized protein LOC103502964 OS=Cucumis melo OX=3656 GN=LOC103502964 PE=4 SV=1)

HSP 1 Score: 244.6 bits (623), Expect = 1.8e-61
Identity = 139/190 (73.16%), Postives = 161/190 (84.74%), Query Frame = 0

Query: 1   MAAPSTKLLRNICIAILLXXXXXXXXXXXXAFTVFKPKRPIIAVDSVSLLDLNVSLXXXX 60
           MAAP++KLLRN CI ++ XXXXXXXXXXXX FTVFKP+RPII VDSVSLLDLNV+L    
Sbjct: 1   MAAPASKLLRNFCITLVXXXXXXXXXXXXXXFTVFKPQRPIIVVDSVSLLDLNVALTDGV 60

Query: 61  XXXXXXXVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGRLSAEGTEKMNLTLTM 120
                  VDL+VENPNKVAFEYS+STAVV YRGE+VGEAPIP GRL  +GT+KMNLTLT+
Sbjct: 61  DLNLSINVDLTVENPNKVAFEYSKSTAVVIYRGEKVGEAPIPGGRLPGKGTKKMNLTLTI 120

Query: 121 MADRMLAKSEVFSDVVSGKLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIG 180
           M +RML +SEVFSDVVSG+L IST ARL+GKVKV+GV KIHVVAS+SCDL ID+ NGS G
Sbjct: 121 MGERMLGRSEVFSDVVSGQLSISTLARLAGKVKVMGVVKIHVVASTSCDLIIDVKNGSFG 180

Query: 181 DQQCQYRTKL 191
           DQ CQ+RT++
Sbjct: 181 DQLCQFRTRV 190

BLAST of Cla97C02G045060 vs. TrEMBL
Match: tr|A0A2P5ESV2|A0A2P5ESV2_9ROSA (Late embryogenesis abundant protein OS=Trema orientalis OX=63057 GN=TorRG33x02_156750 PE=4 SV=1)

HSP 1 Score: 184.1 bits (466), Expect = 2.9e-43
Identity = 102/192 (53.12%), Postives = 141/192 (73.44%), Query Frame = 0

Query: 2   AAPSTKLLRNICIAILLXXXXXXXXXXXXAFTVFKPKRPIIAVDSVSLLDLNVSL---XX 61
           A P  K  R++CI     XXXXXXXXXXX  TVFKPKRP+  VDSVSL DLN  L     
Sbjct: 24  AIPRPKRRRSLCIGTCAVXXXXXXXXXXXXLTVFKPKRPVTTVDSVSLKDLNADLDIRRL 83

Query: 62  XXXXXXXXXVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGRLSAEGTEKMNLTL 121
                    VDLS++NPNKV F+Y  S+AV+ YRG++VG+  IP G +SA+ T+++N+TL
Sbjct: 84  RVDLNVTLDVDLSIKNPNKVGFKYRNSSAVLIYRGDQVGDVAIPGGEISADETKRVNVTL 143

Query: 122 TMMADRMLAKSEVFSDVVSGKLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGS 181
           T+MADR+L++S+V+SDV +G LP+ST  R+SG+V ++G+FKIHVV+++ CD T++++N S
Sbjct: 144 TLMADRLLSQSQVYSDVFAGALPLSTHTRVSGRVTILGIFKIHVVSTTWCDFTVNVSNRS 203

Query: 182 IGDQQCQYRTKL 191
           + DQ C Y+TKL
Sbjct: 204 VSDQSCTYKTKL 215

BLAST of Cla97C02G045060 vs. TrEMBL
Match: tr|A0A2P5BG70|A0A2P5BG70_PARAD (Late embryogenesis abundant protein OS=Parasponia andersonii OX=3476 GN=PanWU01x14_242160 PE=4 SV=1)

HSP 1 Score: 181.4 bits (459), Expect = 1.9e-42
Identity = 100/192 (52.08%), Postives = 140/192 (72.92%), Query Frame = 0

Query: 2   AAPSTKLLRNICIAILLXXXXXXXXXXXXAFTVFKPKRPIIAVDSVSLLDLNVSL---XX 61
           A P  K  R++CI      XXXXXXXXXX  TVFKPKRP+  VDSVSL DL+  L     
Sbjct: 24  AIPRPKRRRSLCIGTCAVIXXXXXXXXXXXLTVFKPKRPVTTVDSVSLKDLHADLDIRRL 83

Query: 62  XXXXXXXXXVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGRLSAEGTEKMNLTL 121
                    VDLS++NPNKV F+Y  S+AV+ YRG++VG+  IP G++SA+ T ++N+TL
Sbjct: 84  RVDLNVTLDVDLSIKNPNKVGFKYRNSSAVLIYRGDQVGDVAIPGGQMSADETRRVNVTL 143

Query: 122 TMMADRMLAKSEVFSDVVSGKLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGS 181
            +MADR+L++S+V+SDV +G LP+ST AR+SG+V ++G+FKIHVV+++ CD T++++N S
Sbjct: 144 ILMADRLLSQSQVYSDVFAGALPLSTHARVSGRVTILGIFKIHVVSTTWCDFTVNVSNRS 203

Query: 182 IGDQQCQYRTKL 191
           + DQ C Y+TKL
Sbjct: 204 VSDQSCTYKTKL 215

BLAST of Cla97C02G045060 vs. TrEMBL
Match: tr|A0A2P5ACQ8|A0A2P5ACQ8_9ROSA (Late embryogenesis abundant protein OS=Trema orientalis OX=63057 GN=TorRG33x02_353600 PE=4 SV=1)

HSP 1 Score: 180.3 bits (456), Expect = 4.2e-42
Identity = 85/161 (52.80%), Postives = 123/161 (76.40%), Query Frame = 0

Query: 33  TVFKPKRPIIAVDSVSLLDLNVSL---XXXXXXXXXXXVDLSVENPNKVAFEYSQSTAVV 92
           TVFKPKRP+  VDSVSL DLN  L              VDLS++NPNKV+F+Y  S+AV+
Sbjct: 55  TVFKPKRPVTTVDSVSLKDLNADLDIRRLRVDLNVTLDVDLSIKNPNKVSFKYRNSSAVL 114

Query: 93  SYRGEEVGEAPIPAGRLSAEGTEKMNLTLTMMADRMLAKSEVFSDVVSGKLPISTFARLS 152
            YRG++VG+  IP G +SA+ T+++N+TLT+MADR+L++S+V+SDV +G LP+ST  R+S
Sbjct: 115 IYRGDQVGDVAIPGGEISADETKRVNVTLTLMADRLLSQSQVYSDVFAGALPLSTHTRVS 174

Query: 153 GKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL 191
           G+V ++G+FKIHVV+++ CD T++++N S+ DQ C Y+TKL
Sbjct: 175 GRVTILGIFKIHVVSTTWCDFTVNVSNRSVSDQSCTYKTKL 215

BLAST of Cla97C02G045060 vs. Swiss-Prot
Match: sp|Q6DST1|Y1465_ARATH (Late embryogenesis abundant protein At1g64065 OS=Arabidopsis thaliana OX=3702 GN=At1g64065 PE=2 SV=1)

HSP 1 Score: 46.2 bits (108), Expect = 4.7e-04
Identity = 35/119 (29.41%), Postives = 62/119 (52.10%), Query Frame = 0

Query: 69  DLSVENPNKVAFEYSQSTAVVSYRGE-EVGEAPIPAGRLSAEGTEKM-NLTLTMMADRML 128
           D+S+ N N  AFE+  ST  V Y     VGE  I   R+ A  T ++  + + + + R+L
Sbjct: 97  DISIRNSNFGAFEFEDSTLRVVYADHGVVGETKIEGRRVEAHKTVRITGVVVEIGSFRLL 156

Query: 129 AKSEVFSDVVSGKLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQ 186
              ++  D+  G L + + A + G++KV+G  K   V+  SC + +++T   I +  C+
Sbjct: 157 DTKDLDKDLRLGFLELRSVAEVRGRIKVLG-RKRWKVSVMSCTMRLNLTGRFIQNLLCE 214

BLAST of Cla97C02G045060 vs. TAIR10
Match: AT3G54200.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 154.8 bits (390), Expect = 5.2e-38
Identity = 73/161 (45.34%), Postives = 117/161 (72.67%), Query Frame = 0

Query: 33  TVFKPKRPIIAVDSVSLLDLNVS---LXXXXXXXXXXXVDLSVENPNKVAFEYSQSTAVV 92
           T+FKPKRP   +DSV++  L  S   L           VDLS++NPN++ F Y  S+A++
Sbjct: 75  TLFKPKRPTTTIDSVTVDRLQASVNPLLLKVLLNLTLNVDLSLKNPNRIGFSYDSSSALL 134

Query: 93  SYRGEEVGEAPIPAGRLSAEGTEKMNLTLTMMADRMLAKSEVFSDVVSGKLPISTFARLS 152
           +YRG+ +GEAP+PA R++A  T  +N+TLT+MADR+L+++++ SDV++G +P++TF +++
Sbjct: 135 NYRGQVIGEAPLPANRIAARKTVPLNITLTLMADRLLSETQLLSDVMAGVIPLNTFVKVT 194

Query: 153 GKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL 191
           GKV V+ +FKI V +SSSCDL+I +++ ++  Q C+Y TKL
Sbjct: 195 GKVTVLKIFKIKVQSSSSCDLSISVSDRNVTSQHCKYSTKL 235

BLAST of Cla97C02G045060 vs. TAIR10
Match: AT2G46150.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 93.2 bits (230), Expect = 1.8e-19
Identity = 54/164 (32.93%), Postives = 92/164 (56.10%), Query Frame = 0

Query: 32  FTVFKPKRPIIAVDSVSLLDLN----VSLXXXXXXXXXXXVDLSVENPNKVAFEYSQSTA 91
           FTVF+ K PII ++ V +  L+     +            VD+SV+NPN  +F+YS +T 
Sbjct: 58  FTVFRVKDPIIKMNGVMVNGLDSVTGTNQVQLLGTNISMIVDVSVKNPNTASFKYSNTTT 117

Query: 92  VVSYRGEEVGEAPIPAGRLSAEGTEKMNLTLTMMADRMLAKSEVFSDVV-SGKLPISTFA 151
            + Y+G  VGEA    G+     T +MN+T+ +M DR+L+   +  ++  SG + + ++ 
Sbjct: 118 DIYYKGTLVGEAHGLPGKARPHRTSRMNVTVDIMLDRILSDPGLGREISRSGLVNVWSYT 177

Query: 152 RLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL 191
           R+ GKVK++G+ K HV    +C + ++IT  +I D  C+ +  L
Sbjct: 178 RVGGKVKIMGIVKKHVTVKMNCTMAVNITGQAIQDVDCKKKIDL 221

BLAST of Cla97C02G045060 vs. TAIR10
Match: AT3G05975.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 88.6 bits (218), Expect = 4.6e-18
Identity = 50/160 (31.25%), Postives = 86/160 (53.75%), Query Frame = 0

Query: 34  VFKPKRPIIAV--DSVSLLDLNVSLXXXXXXXXXXXVDLSVENPNKVAFEYSQSTAVVSY 93
           VFKPK PI+     +V  +  N+SL           +++ ++NPN   FEY     +V Y
Sbjct: 30  VFKPKHPILQTVSSTVDGISTNISLPYEVQLNFTLTLEMLLKNPNVADFEYKTVENLVYY 89

Query: 94  RGEEVGEAPIPAGRLSAEGTEKMNLTLTMMADRMLAK-SEVFSDVVSGKLPISTFARLSG 153
           R   VG   +P+  L A+G+  +   L +  D+ +A   ++  DV+ GK+ + T A++ G
Sbjct: 90  RDTLVGNLTLPSSTLPAKGSVLLPCPLFLQLDKFVANLGDIVQDVLHGKIVMETRAKMPG 149

Query: 154 KVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL 191
           K+ ++G+FKI + + S C+L +   +  + DQ C  +TKL
Sbjct: 150 KITLLGIFKIPLDSISHCNLVLGFPSMVVEDQVCDLKTKL 189

BLAST of Cla97C02G045060 vs. TAIR10
Match: AT4G23930.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 62.4 bits (150), Expect = 3.5e-10
Identity = 42/160 (26.25%), Postives = 77/160 (48.12%), Query Frame = 0

Query: 33  TVFKPKRPIIAVDSVSLLDLNVSLXXXXXXXXXXXVDLSVENPNKVAFEYSQSTAVVSYR 92
           TVF+P+ P I+V SV +   +V+               +V NPN+ AF +  +   + Y 
Sbjct: 31  TVFRPRDPEISVTSVKVPSFSVANSSVSFTFSQFS---AVRNPNRAAFSHYNNVIQLFYY 90

Query: 93  GEEVGEAPIPAGRLSAEGTEKMNLTLTMMADRMLAKSE--------VFSDVVSGKLPIST 152
           G  +G   +PAG + +  T++M  T ++ +  + A S           SD     + I +
Sbjct: 91  GNRIGYTFVPAGEIESGRTKRMLATFSVQSFPLAAASSSQISAAQFQNSDRSGSTVEIES 150

Query: 153 FARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQC 185
              ++G+V+V+G+F   + A  +C + I  ++GSI   +C
Sbjct: 151 KLEMAGRVRVLGLFTHRIAAKCNCRIAISSSDGSIVAVRC 187

BLAST of Cla97C02G045060 vs. TAIR10
Match: AT1G64065.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 46.2 bits (108), Expect = 2.6e-05
Identity = 35/119 (29.41%), Postives = 62/119 (52.10%), Query Frame = 0

Query: 69  DLSVENPNKVAFEYSQSTAVVSYRGE-EVGEAPIPAGRLSAEGTEKM-NLTLTMMADRML 128
           D+S+ N N  AFE+  ST  V Y     VGE  I   R+ A  T ++  + + + + R+L
Sbjct: 97  DISIRNSNFGAFEFEDSTLRVVYADHGVVGETKIEGRRVEAHKTVRITGVVVEIGSFRLL 156

Query: 129 AKSEVFSDVVSGKLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQ 186
              ++  D+  G L + + A + G++KV+G  K   V+  SC + +++T   I +  C+
Sbjct: 157 DTKDLDKDLRLGFLELRSVAEVRGRIKVLG-RKRWKVSVMSCTMRLNLTGRFIQNLLCE 214

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011658424.13.2e-6268.78PREDICTED: uncharacterized protein LOC105435999 [Cucumis sativus] >KGN47220.1 hy... [more]
XP_008465308.12.7e-6173.16PREDICTED: uncharacterized protein LOC103502964 [Cucumis melo][more]
XP_023512272.17.5e-5969.84uncharacterized protein LOC111777064 [Cucurbita pepo subsp. pepo][more]
XP_022944105.16.3e-5868.78uncharacterized protein LOC111448649 [Cucurbita moschata][more]
XP_022986213.12.4e-5768.78uncharacterized protein LOC111484029 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
tr|A0A0A0KBX8|A0A0A0KBX8_CUCSA2.1e-6268.78Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G212910 PE=4 SV=1[more]
tr|A0A1S3CNL0|A0A1S3CNL0_CUCME1.8e-6173.16uncharacterized protein LOC103502964 OS=Cucumis melo OX=3656 GN=LOC103502964 PE=... [more]
tr|A0A2P5ESV2|A0A2P5ESV2_9ROSA2.9e-4353.13Late embryogenesis abundant protein OS=Trema orientalis OX=63057 GN=TorRG33x02_1... [more]
tr|A0A2P5BG70|A0A2P5BG70_PARAD1.9e-4252.08Late embryogenesis abundant protein OS=Parasponia andersonii OX=3476 GN=PanWU01x... [more]
tr|A0A2P5ACQ8|A0A2P5ACQ8_9ROSA4.2e-4252.80Late embryogenesis abundant protein OS=Trema orientalis OX=63057 GN=TorRG33x02_3... [more]
Match NameE-valueIdentityDescription
sp|Q6DST1|Y1465_ARATH4.7e-0429.41Late embryogenesis abundant protein At1g64065 OS=Arabidopsis thaliana OX=3702 GN... [more]
Match NameE-valueIdentityDescription
AT3G54200.15.2e-3845.34Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family[more]
AT2G46150.11.8e-1932.93Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family[more]
AT3G05975.14.6e-1831.25Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family[more]
AT4G23930.13.5e-1026.25Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family[more]
AT1G64065.12.6e-0529.41Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family[more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004864LEA_2
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
cellular_component GO:0046658 anchored component of plasma membrane
cellular_component GO:0009506 plasmodesma
cellular_component GO:0005886 plasma membrane
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C02G045060.1Cla97C02G045060.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004864Late embryogenesis abundant protein, LEA_2 subgroupPFAMPF03168LEA_2coord: 70..162
e-value: 3.4E-12
score: 46.7
NoneNo IPR availableGENE3DG3DSA:2.60.40.1820coord: 32..174
e-value: 3.1E-5
score: 25.6
NoneNo IPR availablePANTHERPTHR31852:SF43LATE EMBRYOGENESIS ABUNDANT HYDROXYPROLINE-RICH GLYCOPROTEINcoord: 10..189
NoneNo IPR availablePANTHERPTHR31852FAMILY NOT NAMEDcoord: 10..189
NoneNo IPR availableSUPERFAMILYSSF117070LEA14-likecoord: 23..126