Cla97C03G050810 (gene) Watermelon (97103) v2

NameCla97C03G050810
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionLate embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family
LocationCla97Chr03 : 76713 .. 77450 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATAATAATGTTGCTGCTAATCAACTTGATCGACTTCCAAGCCAAAACAAAGGATCACGCCGGGTGGCGTTCTCCGATTCCCTTCCTAAACACCGACCTACATCCAACGATAGTAACTCTGAGCATTGCAACAAATGTTGTTGTCCTCGTTTATTTGCTTGTTGCGCTTGGATTTGTCTTGGGGTTTTCGGAATTGTTGTTGCCATTCTTATCCTTGGCGTAATATTTATGTCCTTCCTTCAATCAGGATTGCCAGAAATCACCGTAAGAATGTTGAACTTGTCCAAATTCGAGATTGGAAATTCCACAAATCAGAATAATGTTGTGTTAAATGCAAAAGTAGATATGTCAATCGAGATGAGGAACAAGAACGAAAAAATAGAGTTGAGTTATAGCAATATTGTGGTGAATTTGGCGTCAGAAGATGTGAAATTGGGAAGGAGTGTGATTCCCGGTTTCTCTCACGATCCTGGAAATACCACATACTTTAATGTAACTATGAATGTTGTGGGAGATTCCACAGATAAAGACAATGTATTGCAACTAGAAGATGATCGAAAAAGGGTGCAAATGAATGTGCATGTGACAATGGAGGCTACAGTTGATTTTCATGTTGGGATATTCAAATTGAACAAGGTGCCAATCCATGTAGCATGTGATTTTCGACAGTTCCTTCTTTTATATCGAATAAACGAGCCCCCATGTAATATTAGAATGTTTCCCAACCTCAGGTAA

mRNA sequence

ATGAATAATAATGTTGCTGCTAATCAACTTGATCGACTTCCAAGCCAAAACAAAGGATCACGCCGGGTGGCGTTCTCCGATTCCCTTCCTAAACACCGACCTACATCCAACGATAGTAACTCTGAGCATTGCAACAAATGTTGTTGTCCTCGTTTATTTGCTTGTTGCGCTTGGATTTGTCTTGGGGTTTTCGGAATTGTTGTTGCCATTCTTATCCTTGGCGTAATATTTATGTCCTTCCTTCAATCAGGATTGCCAGAAATCACCGTAAGAATGTTGAACTTGTCCAAATTCGAGATTGGAAATTCCACAAATCAGAATAATGTTGTGTTAAATGCAAAAGTAGATATGTCAATCGAGATGAGGAACAAGAACGAAAAAATAGAGTTGAGTTATAGCAATATTGTGGTGAATTTGGCGTCAGAAGATGTGAAATTGGGAAGGAGTGTGATTCCCGGTTTCTCTCACGATCCTGGAAATACCACATACTTTAATGTAACTATGAATGTTGTGGGAGATTCCACAGATAAAGACAATGTATTGCAACTAGAAGATGATCGAAAAAGGGTGCAAATGAATGTGCATGTGACAATGGAGGCTACAGTTGATTTTCATGTTGGGATATTCAAATTGAACAAGGTGCCAATCCATGTAGCATGTGATTTTCGACAGTTCCTTCTTTTATATCGAATAAACGAGCCCCCATGTAATATTAGAATGTTTCCCAACCTCAGGTAA

Coding sequence (CDS)

ATGAATAATAATGTTGCTGCTAATCAACTTGATCGACTTCCAAGCCAAAACAAAGGATCACGCCGGGTGGCGTTCTCCGATTCCCTTCCTAAACACCGACCTACATCCAACGATAGTAACTCTGAGCATTGCAACAAATGTTGTTGTCCTCGTTTATTTGCTTGTTGCGCTTGGATTTGTCTTGGGGTTTTCGGAATTGTTGTTGCCATTCTTATCCTTGGCGTAATATTTATGTCCTTCCTTCAATCAGGATTGCCAGAAATCACCGTAAGAATGTTGAACTTGTCCAAATTCGAGATTGGAAATTCCACAAATCAGAATAATGTTGTGTTAAATGCAAAAGTAGATATGTCAATCGAGATGAGGAACAAGAACGAAAAAATAGAGTTGAGTTATAGCAATATTGTGGTGAATTTGGCGTCAGAAGATGTGAAATTGGGAAGGAGTGTGATTCCCGGTTTCTCTCACGATCCTGGAAATACCACATACTTTAATGTAACTATGAATGTTGTGGGAGATTCCACAGATAAAGACAATGTATTGCAACTAGAAGATGATCGAAAAAGGGTGCAAATGAATGTGCATGTGACAATGGAGGCTACAGTTGATTTTCATGTTGGGATATTCAAATTGAACAAGGTGCCAATCCATGTAGCATGTGATTTTCGACAGTTCCTTCTTTTATATCGAATAAACGAGCCCCCATGTAATATTAGAATGTTTCCCAACCTCAGGTAA

Protein sequence

MNNNVAANQLDRLPSQNKGSRRVAFSDSLPKHRPTSNDSNSEHCNKCCCPRLFACCAWICLGVFGIVVAILILGVIFMSFLQSGLPEITVRMLNLSKFEIGNSTNQNNVVLNAKVDMSIEMRNKNEKIELSYSNIVVNLASEDVKLGRSVIPGFSHDPGNTTYFNVTMNVVGDSTDKDNVLQLEDDRKRVQMNVHVTMEATVDFHVGIFKLNKVPIHVACDFRQFLLLYRINEPPCNIRMFPNLR
BLAST of Cla97C03G050810 vs. NCBI nr
Match: XP_008457557.1 (PREDICTED: uncharacterized protein LOC103497223 [Cucumis melo])

HSP 1 Score: 385.6 bits (989), Expect = 1.3e-103
Identity = 201/245 (82.04%), Postives = 215/245 (87.76%), Query Frame = 0

Query: 3   NNVAANQLDRLPSQNKGSRRVAFSDSLPKHRPTSNDSNSEHCNKCCCPRLFACCAWICLG 62
           NN+ AN+LDRLPSQNKGSRRVAFSDSLPKHR    D NSE  NK CCPRLFACCAWIC+G
Sbjct: 2   NNIGANRLDRLPSQNKGSRRVAFSDSLPKHRAAFGDGNSERHNK-CCPRLFACCAWICVG 61

Query: 63  VFGIVVAILILGVIFMSFLQSGLPEITVRMLNLSKFEIGNSTNQ--NNVVLNAKVDMSIE 122
           +FGIVVAILILGVIF+SFLQSGLPEITVRMLNLS FEI NSTNQ  NN +LNAK+DMSIE
Sbjct: 62  IFGIVVAILILGVIFVSFLQSGLPEITVRMLNLSNFEIKNSTNQNDNNALLNAKLDMSIE 121

Query: 123 MRNKNEKIELSYSNIVVNLASEDVKLGRSVIPGFSHDPGNTTYFNVTMNVVGDSTDKDNV 182
           MRNKNEKIELSYS+IVVNL SEDVKLGRSVIP FSH PGNTTY NVTMNV   STDKDN+
Sbjct: 122 MRNKNEKIELSYSSIVVNLVSEDVKLGRSVIPSFSHSPGNTTYLNVTMNVERVSTDKDNL 181

Query: 183 LQLEDDRKRVQMNVHVTMEATVDFHVGIFKLNKVPIHVACDFRQFLLLYRINEPPCNIRM 242
            QLEDDRK+VQM+V V MEA V FHVGIF L  VPIHVACDF+Q LL+YRINEPPCNIRM
Sbjct: 182 SQLEDDRKKVQMDVQVKMEAKVGFHVGIFNLKNVPIHVACDFQQTLLVYRINEPPCNIRM 241

Query: 243 FPNLR 246
           FPN+R
Sbjct: 242 FPNIR 245

BLAST of Cla97C03G050810 vs. NCBI nr
Match: XP_022949169.1 (uncharacterized protein LOC111452600 [Cucurbita moschata] >XP_023522969.1 uncharacterized protein LOC111787033 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 293.1 bits (749), Expect = 8.7e-76
Identity = 153/241 (63.49%), Postives = 190/241 (78.84%), Query Frame = 0

Query: 2   NNNVAANQLDRLPSQNKGSRRVAFSDSLPKHRPTSNDSNSEHCNKCCCPRLFACCAWICL 61
           +NN  A QLDR+PS +KG+RRVAFSDSLPKHR     S S    K     L A CAWICL
Sbjct: 3   DNNHVAIQLDRVPSTDKGARRVAFSDSLPKHR-----STSLRATKFVFSHLLAFCAWICL 62

Query: 62  GVFGIVVAILILGVIFMSFLQSGLPEITVRMLNLSKFEIGNSTNQNNVVLNAKVDMSIEM 121
            VFGI + +LILGVIF+SFLQSGLPEITV+ML+LSK +I NSTNQN  VLN KV M+I++
Sbjct: 63  AVFGIAITLLILGVIFVSFLQSGLPEITVKMLDLSKIQIQNSTNQNVAVLNTKVRMAIDI 122

Query: 122 RNKNEKIELSYSNIVVNLASEDVKLGRSVIPGFSHDPGNTTYFNVTMNVVGDSTDKDNVL 181
           +NKNEK+ELSYS++ + L SE+++LGR+VIP FS +PGNTT  NVT+NV  DS D+D++ 
Sbjct: 123 KNKNEKLELSYSDLNMKLVSENIELGRNVIPSFSQEPGNTTSLNVTLNVDRDSIDRDSIS 182

Query: 182 QLEDDRKRVQMNVHVTMEATVDFHVGIFKLNKVPIHVACDFRQFLLLYRINEPPCNIRMF 241
            LEDDRK+ Q+ V +TM  +V FH+GIFKLNKVPIHV C+F+Q+LLLYR+ EPPC+I MF
Sbjct: 183 LLEDDRKKAQVVVKITMVGSVGFHLGIFKLNKVPIHVTCNFQQYLLLYRVKEPPCSITMF 238

Query: 242 P 243
           P
Sbjct: 243 P 238

BLAST of Cla97C03G050810 vs. NCBI nr
Match: XP_022998792.1 (uncharacterized protein LOC111493353 [Cucurbita maxima])

HSP 1 Score: 291.2 bits (744), Expect = 3.3e-75
Identity = 153/241 (63.49%), Postives = 189/241 (78.42%), Query Frame = 0

Query: 2   NNNVAANQLDRLPSQNKGSRRVAFSDSLPKHRPTSNDSNSEHCNKCCCPRLFACCAWICL 61
           +NN  A QLDR+PS +KG+RRVAFSDSLPKHR     S S    K     LFA CAWICL
Sbjct: 3   DNNHVAIQLDRVPSTDKGARRVAFSDSLPKHR-----SASLRATKFVFSHLFAFCAWICL 62

Query: 62  GVFGIVVAILILGVIFMSFLQSGLPEITVRMLNLSKFEIGNSTNQNNVVLNAKVDMSIEM 121
            VFGI + +LILGVIF+SFLQS LPEITV+ML+LSK +I NSTNQN  VLN KV M+I++
Sbjct: 63  AVFGIAITLLILGVIFVSFLQSSLPEITVKMLDLSKIQIQNSTNQNVAVLNTKVRMAIDI 122

Query: 122 RNKNEKIELSYSNIVVNLASEDVKLGRSVIPGFSHDPGNTTYFNVTMNVVGDSTDKDNVL 181
           RNKNEK+ELSYS++ + L SE+++LGR+VIP FS +PGNTT  NVT+ V  DS D+D++ 
Sbjct: 123 RNKNEKLELSYSDLNMKLVSENIELGRNVIPSFSQEPGNTTSLNVTLKVDRDSIDRDSIS 182

Query: 182 QLEDDRKRVQMNVHVTMEATVDFHVGIFKLNKVPIHVACDFRQFLLLYRINEPPCNIRMF 241
            LEDDRK+ Q+ V +TM  +V FH+GIFKLNKVPIHV C+F+Q+LLLYR+ EPPC+I MF
Sbjct: 183 LLEDDRKKAQVVVKITMVGSVGFHLGIFKLNKVPIHVTCNFQQYLLLYRVKEPPCSITMF 238

Query: 242 P 243
           P
Sbjct: 243 P 238

BLAST of Cla97C03G050810 vs. NCBI nr
Match: XP_022158431.1 (uncharacterized protein LOC111024923 [Momordica charantia])

HSP 1 Score: 268.1 bits (684), Expect = 3.0e-68
Identity = 148/245 (60.41%), Postives = 186/245 (75.92%), Query Frame = 0

Query: 1   MNNNVAANQLDRLPSQNKGSRRVAFSDSLPKHRPTSNDSNSEHCNKCCCPRLFACCAWIC 60
           MNNN          +++KG RRV FS+SLP HR TS DS ++     C  RLFA C  IC
Sbjct: 1   MNNNY---------NEDKGERRVYFSESLPTHRATS-DSGTK-----CRRRLFAYCGRIC 60

Query: 61  LGVFGIVVAILILGVIFMSFLQSGLPEITVRMLNLSKFEIGNSTNQ--NNVVLNAKVDMS 120
           +G FGI++A+LI+ VIFMSFLQSGLPEI+++ L LSKFEI +STNQ  NN VL+A+VD+S
Sbjct: 61  IGAFGILLALLIIAVIFMSFLQSGLPEISIKTLQLSKFEIHDSTNQNHNNAVLDARVDIS 120

Query: 121 IEMRNKNEKIELSYSNIVVNLASEDVKLGRSVIPGFSHDPGNTTYFNVTMNVVGDSTDKD 180
           + +RNKN+KIELSY +IVVN+AS+DVKLG+SVI GFSH PGNTTY NVT NVVGD  D++
Sbjct: 121 MTVRNKNDKIELSYGDIVVNVASDDVKLGKSVIGGFSHGPGNTTYLNVTTNVVGDGVDRE 180

Query: 181 NVLQLEDDRKRVQMNVHVTMEATVDFHVGIFKLNKVPIHVAC-DFRQFLLLYRINEPPCN 240
           N L++++++KRV+M   V MEA + FH GIF + KVPIHV C D +QFLL+ RI E  CN
Sbjct: 181 NALEIQEEKKRVEMVAQVRMEAIIGFHAGIFSIEKVPIHVRCDDVQQFLLVNRIKEASCN 230

Query: 241 IRMFP 243
           IRMFP
Sbjct: 241 IRMFP 230

BLAST of Cla97C03G050810 vs. NCBI nr
Match: XP_023521220.1 (uncharacterized protein LOC111784941 [Cucurbita pepo subsp. pepo] >XP_023524547.1 uncharacterized protein LOC111788445 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 169.9 bits (429), Expect = 1.1e-38
Identity = 78/126 (61.90%), Postives = 105/126 (83.33%), Query Frame = 0

Query: 117 MSIEMRNKNEKIELSYSNIVVNLASEDVKLGRSVIPGFSHDPGNTTYFNVTMNVVGDSTD 176
           M+I+++NKNEK+ELSYS++ + L SE+++LGR+VIP FS +PGNTT  NVT+NV  DS D
Sbjct: 1   MAIDIKNKNEKLELSYSDLNMKLVSENIELGRNVIPSFSQEPGNTTSLNVTLNVDRDSID 60

Query: 177 KDNVLQLEDDRKRVQMNVHVTMEATVDFHVGIFKLNKVPIHVACDFRQFLLLYRINEPPC 236
           +D++  LEDDRK+ Q+ V +TM  +V FH+GIFKLNKVPIHV C+F+Q+LLLYR+ EPPC
Sbjct: 61  RDSISLLEDDRKKAQVVVKITMVGSVGFHLGIFKLNKVPIHVTCNFQQYLLLYRVKEPPC 120

Query: 237 NIRMFP 243
           +I MFP
Sbjct: 121 SITMFP 126

BLAST of Cla97C03G050810 vs. TrEMBL
Match: tr|A0A1S3C5S1|A0A1S3C5S1_CUCME (uncharacterized protein LOC103497223 OS=Cucumis melo OX=3656 GN=LOC103497223 PE=4 SV=1)

HSP 1 Score: 385.6 bits (989), Expect = 8.5e-104
Identity = 201/245 (82.04%), Postives = 215/245 (87.76%), Query Frame = 0

Query: 3   NNVAANQLDRLPSQNKGSRRVAFSDSLPKHRPTSNDSNSEHCNKCCCPRLFACCAWICLG 62
           NN+ AN+LDRLPSQNKGSRRVAFSDSLPKHR    D NSE  NK CCPRLFACCAWIC+G
Sbjct: 2   NNIGANRLDRLPSQNKGSRRVAFSDSLPKHRAAFGDGNSERHNK-CCPRLFACCAWICVG 61

Query: 63  VFGIVVAILILGVIFMSFLQSGLPEITVRMLNLSKFEIGNSTNQ--NNVVLNAKVDMSIE 122
           +FGIVVAILILGVIF+SFLQSGLPEITVRMLNLS FEI NSTNQ  NN +LNAK+DMSIE
Sbjct: 62  IFGIVVAILILGVIFVSFLQSGLPEITVRMLNLSNFEIKNSTNQNDNNALLNAKLDMSIE 121

Query: 123 MRNKNEKIELSYSNIVVNLASEDVKLGRSVIPGFSHDPGNTTYFNVTMNVVGDSTDKDNV 182
           MRNKNEKIELSYS+IVVNL SEDVKLGRSVIP FSH PGNTTY NVTMNV   STDKDN+
Sbjct: 122 MRNKNEKIELSYSSIVVNLVSEDVKLGRSVIPSFSHSPGNTTYLNVTMNVERVSTDKDNL 181

Query: 183 LQLEDDRKRVQMNVHVTMEATVDFHVGIFKLNKVPIHVACDFRQFLLLYRINEPPCNIRM 242
            QLEDDRK+VQM+V V MEA V FHVGIF L  VPIHVACDF+Q LL+YRINEPPCNIRM
Sbjct: 182 SQLEDDRKKVQMDVQVKMEAKVGFHVGIFNLKNVPIHVACDFQQTLLVYRINEPPCNIRM 241

Query: 243 FPNLR 246
           FPN+R
Sbjct: 242 FPNIR 245

BLAST of Cla97C03G050810 vs. TrEMBL
Match: tr|A0A1S2Y8L5|A0A1S2Y8L5_CICAR (uncharacterized protein LOC101510711 OS=Cicer arietinum OX=3827 GN=LOC101510711 PE=4 SV=1)

HSP 1 Score: 122.5 bits (306), Expect = 1.3e-24
Identity = 76/238 (31.93%), Postives = 129/238 (54.20%), Query Frame = 0

Query: 15  SQNKGSRRVAFSDSLPKHRPTSN-----------DSNSEHCNKCCCPRLFACCAWICLGV 74
           S  K  RRVAF      H P ++           D + EH + CC     ACCAW CL +
Sbjct: 18  SWQKSGRRVAFEVPSNHHHPNNSPSLNDIDTSIYDFDREHYHPCC----LACCAWSCLVM 77

Query: 75  FGIVVAILILGVIFMSFLQSGLPEITVRMLNLSKFEIGNSTNQNNVVLNAKVDMSIEMRN 134
           F  V+  L+ G+ +++FLQ+G+P++ VR  N+ KF++ NS +QN   ++A + + +   N
Sbjct: 78  FIFVITFLVFGISYLAFLQAGMPKVNVRTFNIIKFQVDNS-SQN---MDASISLGLRFSN 137

Query: 135 KNEKIELSYSNIVVNLASEDVKLGRSVIPGFSHDPGNTTYFNVTMNVVGDSTDKDNVLQL 194
           KNE+++L Y  + +++ S+ V LG++ + GFS  P N T  ++TM +   S +K +  +L
Sbjct: 138 KNEELKLLYGPLFLDVTSDGVLLGKTKLNGFSQMPRNDTDLDMTMTMNHASVNKYDADEL 197

Query: 195 EDDRKRVQMNVHVTMEATVDFHVGIFKLNKVPIHVACDFRQFLLLYRINEPPCNIRMF 242
           + D    +M   V +   +  H+G  K+  VP   +C+  + + +     P CNI+MF
Sbjct: 198 KSDIMANEMVFDVFVSGNIGVHIGSLKMINVPFLTSCEQIKRMDVDFGRRPGCNIKMF 247

BLAST of Cla97C03G050810 vs. TrEMBL
Match: tr|A0A2P6QKY3|A0A2P6QKY3_ROSCH (Uncharacterized protein OS=Rosa chinensis OX=74649 GN=RchiOBHm_Chr5g0073561 PE=4 SV=1)

HSP 1 Score: 120.9 bits (302), Expect = 3.9e-24
Identity = 79/233 (33.91%), Postives = 125/233 (53.65%), Query Frame = 0

Query: 15  SQNKGSRRVAFSD-SLPKHRPTS-----NDSNSEHCNKCCCPRLFACCAWICLGVFGIVV 74
           S    +R VAFS+ S P  +P        D + E   K   PR +ACCAW C+ +F  V+
Sbjct: 21  SSRASARHVAFSETSTPSTKPDGAFLPPPDLDGER-PKRFRPRAYACCAWGCMFIFAFVL 80

Query: 75  AILILGVIFMSFLQSGLPEITVRMLNLSKFEIGNSTNQNNVVLNAKVDMSIEMRNKNEKI 134
             LILG +F+S   S LPEI VR  N ++ + GN+ N+  V L  KVD+ +E  NKNEK 
Sbjct: 81  LALILGFVFISIFHSYLPEIKVRRFNATRIDFGNAQNKQKVSLKGKVDLLVEFNNKNEKT 140

Query: 135 ELSYSNIVVNLASEDVKLGRSVIPGFSHDPGNTTYFNVTMNVVGDSTDKDNVLQLEDDRK 194
           EL +    V+ ++  V LG++    F+    +T   N T+ V     DK++   L+ D +
Sbjct: 141 ELKFGLFKVSASASHVDLGKTEFQPFTQPKKSTKSLNATIGVNHPGVDKEDADLLKQDIQ 200

Query: 195 RVQMNVHVTMEATVDFHVGIFKLNKVPIHVACDFRQFLLLYRINEPPCNIRMF 242
             ++++ +T+  +V F +    +NK+PI  +CD +Q  + +  N   C+ R+F
Sbjct: 201 NHEVDLTLTLVGSVSFPLSGIMMNKIPIMASCDCKQTEVDFG-NNAKCDYRIF 251

BLAST of Cla97C03G050810 vs. TrEMBL
Match: tr|W9RIU1|W9RIU1_9ROSA (Uncharacterized protein OS=Morus notabilis OX=981085 GN=L484_018810 PE=4 SV=1)

HSP 1 Score: 120.2 bits (300), Expect = 6.6e-24
Identity = 80/246 (32.52%), Postives = 133/246 (54.07%), Query Frame = 0

Query: 1   MNNNVAANQLDRLP--SQNKGSRRVAFSDSLPKH-RPTSNDSNSEHCNKCCCPRLFACCA 60
           M ++  A  L R P  +Q+  +R V FSD  PKH RPT    N ++C   C     + CA
Sbjct: 1   MADSRPAPPLPRPPHRTQSTRTRHVCFSDLPPKHNRPTLPLHNGKNCRPFC----LSLCA 60

Query: 61  WICLGVFGIVVAILILGVIFMSFLQSGLPEITVRMLNLSKFEIGN-STNQNNVVLNAKVD 120
           W CL +F +V+ +L++G++F SFL + LP+I VR L  ++ E+ N  + +    +NA ++
Sbjct: 61  WACLSIFALVLLVLLVGILFASFLHTALPDIAVRGLKFARLEVQNPKSGKLTGSINANLE 120

Query: 121 MSIEMRNKNEKIELSYSNIVVNLASEDVKLGRSVIPGFSHDPGNTTYFNVTMNVVGDSTD 180
           + +E  N+NEKI +S+  + V  +SE + LG + I  FS  P NTT   V+  V     D
Sbjct: 121 LLVEFSNQNEKIAVSFGRLEVEASSERINLGETEIGAFSQRPRNTTELRVSTAVEKPGVD 180

Query: 181 KDNVLQLEDDRKRVQMNVHVTMEATVDFHVGIFKLNKVPIHVACDFRQFLLLYRINEPPC 240
            ++  ++  D    +    VT+   V FH+G   ++ VP+ ++C+  Q   +    +  C
Sbjct: 181 GEDAEEMVSDLNDSEAVFDVTLRGNVGFHIGGIDIDGVPVLISCN-PQKTEVDHGKKSRC 240

Query: 241 NIRMFP 243
           + R+FP
Sbjct: 241 SARIFP 241

BLAST of Cla97C03G050810 vs. TrEMBL
Match: tr|A0A1Q3ASG3|A0A1Q3ASG3_CEPFO (Uncharacterized protein OS=Cephalotus follicularis OX=3775 GN=CFOL_v3_02137 PE=4 SV=1)

HSP 1 Score: 116.7 bits (291), Expect = 7.3e-23
Identity = 70/237 (29.54%), Postives = 130/237 (54.85%), Query Frame = 0

Query: 10  LDRLPSQNK---GSRRVAFSDSLPKHRPTSNDSNSEHCNKCCCPRLFACCAWICLGVFGI 69
           L + PS+++     R VAF +  PK++   ++  S+  +  C P  F CCAW CL +  +
Sbjct: 15  LPKPPSRDQCKSSQRHVAFCEIPPKYQVRDDERCSDEESSRCRPCFFTCCAWTCLALCTL 74

Query: 70  VVAILILGVIFMSFLQSGLPEITVRMLNLSKFEIGNST--NQNNVVLNAKVDMSIEMRNK 129
            + +LI+GV F + +QS LPEI +  ++  K +  NS+  +   + LNA  D+ +++ NK
Sbjct: 75  SLILLIVGVSFYAMVQSWLPEIRMEKISFIKTDFVNSSPGSPKMLALNAATDVVLQVSNK 134

Query: 130 NEKIELSYSNIVVNLASEDVKLGRSVIPGFSHDPGNTTYFNVTMNVVGDSTDKDNVLQLE 189
           NE   L +  + V+++ E++KLGR+ +PGFS  P N+T   +   V     DKD+  +++
Sbjct: 135 NENAGLVFGPLAVDVSWEEIKLGRANVPGFSLHPKNSTTLTLHSGVATSQVDKDDEGEIK 194

Query: 190 DDRKRVQMNVHVTMEATVDFHVGIFKLNKVPIHVACDFRQFLLLYRINEPPCNIRMF 242
              K   M + V +   + F++G  ++N +P+ + C       +    +P C++++F
Sbjct: 195 TSLKNRDMMIDVFLTGLIGFNLGGLRMNGLPLQIVCQDINLSDVDLGRQPACDVKIF 251

BLAST of Cla97C03G050810 vs. TAIR10
Match: AT2G30505.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 58.2 bits (139), Expect = 8.5e-09
Identity = 57/246 (23.17%), Postives = 107/246 (43.50%), Query Frame = 0

Query: 1   MNNNVAANQLDRLPSQNKGSRRVAFSDSLPKHRPTSNDSNSEHCNK-----CCCPRLFAC 60
           + N   A+   R PS    SR   F +     R   + S  +H  +     C       C
Sbjct: 80  VENPKKASSFKRPPS---ASRLSGFREEEEADRSRKSGSFVDHIGQEDKRICASGCFRKC 139

Query: 61  CAWICLGVFGIVVAILILGVIFMSFLQSGLPEITVRMLNLSKFEIGNSTNQNNVVLNAKV 120
           CA                     S ++S LP++ V  L  S+ +I  S+   ++++NA +
Sbjct: 140 CACXXXXXXXXXXXXXXXXXSANSSIKSILPQVLVTNLKFSRLDIAKSS--TDLLMNANL 199

Query: 121 DMSIEMRNKNEKIELSYSNIVVNLASEDVKLGRSVIPGFSHDPGNTTYFNVTMNVVGDST 180
           +  +++ N N+K  L YS +  +++SE++ LG+  + GF  DPGN T   +   +     
Sbjct: 200 NTVLQLSNNNDKTVLYYSPMKADISSENINLGKKTLSGFKQDPGNVTSLKILTRLRKSKV 259

Query: 181 DKDNVLQLEDDRKRVQMNVHVTMEATVDFHVGIFKLNKVPIHVACDFRQFLLLYRINEPP 240
              +   L +  K ++  V V +   +      FK++ +PI +AC+  +   +    +P 
Sbjct: 260 YDVDATLLTNKEKTLEALVDVFLRGKLSVDWLGFKVH-IPIVIACESVKQSDVINGLKPA 319

Query: 241 CNIRMF 242
           C++R+F
Sbjct: 320 CDVRIF 319

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008457557.11.3e-10382.04PREDICTED: uncharacterized protein LOC103497223 [Cucumis melo][more]
XP_022949169.18.7e-7663.49uncharacterized protein LOC111452600 [Cucurbita moschata] >XP_023522969.1 unchar... [more]
XP_022998792.13.3e-7563.49uncharacterized protein LOC111493353 [Cucurbita maxima][more]
XP_022158431.13.0e-6860.41uncharacterized protein LOC111024923 [Momordica charantia][more]
XP_023521220.11.1e-3861.90uncharacterized protein LOC111784941 [Cucurbita pepo subsp. pepo] >XP_023524547.... [more]
Match NameE-valueIdentityDescription
tr|A0A1S3C5S1|A0A1S3C5S1_CUCME8.5e-10482.04uncharacterized protein LOC103497223 OS=Cucumis melo OX=3656 GN=LOC103497223 PE=... [more]
tr|A0A1S2Y8L5|A0A1S2Y8L5_CICAR1.3e-2431.93uncharacterized protein LOC101510711 OS=Cicer arietinum OX=3827 GN=LOC101510711 ... [more]
tr|A0A2P6QKY3|A0A2P6QKY3_ROSCH3.9e-2433.91Uncharacterized protein OS=Rosa chinensis OX=74649 GN=RchiOBHm_Chr5g0073561 PE=4... [more]
tr|W9RIU1|W9RIU1_9ROSA6.6e-2432.52Uncharacterized protein OS=Morus notabilis OX=981085 GN=L484_018810 PE=4 SV=1[more]
tr|A0A1Q3ASG3|A0A1Q3ASG3_CEPFO7.3e-2329.54Uncharacterized protein OS=Cephalotus follicularis OX=3775 GN=CFOL_v3_02137 PE=4... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
AT2G30505.18.5e-0923.17Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family[more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004864LEA_2
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C03G050810.1Cla97C03G050810.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004864Late embryogenesis abundant protein, LEA_2 subgroupPFAMPF03168LEA_2coord: 119..220
e-value: 5.3E-12
score: 46.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 17..43
NoneNo IPR availablePANTHERPTHR31234:SF7LATE EMBRYOGENESIS ABUNDANT HYDROXYPROLINE-RICH GLYCOPROTEINcoord: 38..242
NoneNo IPR availablePANTHERPTHR31234FAMILY NOT NAMEDcoord: 38..242

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cla97C03G050810Cla008093Watermelon (97103) v1wmwmbB124
Cla97C03G050810MELO3C020481Melon (DHL92) v3.5.1mewmbB153
Cla97C03G050810ClCG03G000030Watermelon (Charleston Gray)wcgwmbB186
Cla97C03G050810MELO3C020481.2Melon (DHL92) v3.6.1medwmbB142
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla97C03G050810Silver-seed gourdcarwmbB0545
Cla97C03G050810Cucurbita moschata (Rifu)cmowmbB449
Cla97C03G050810Cucurbita maxima (Rimu)cmawmbB225
Cla97C03G050810Cucurbita maxima (Rimu)cmawmbB468
Cla97C03G050810Cucurbita moschata (Rifu)cmowmbB209
Cla97C03G050810Wild cucumber (PI 183967)cpiwmbB049
Cla97C03G050810Bottle gourd (USVL1VR-Ls)lsiwmbB177
Cla97C03G050810Wax gourdwgowmbB470