ClCG01G011580 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG01G011580
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionLEA_2 domain-containing protein
LocationCG_Chr01: 19239656 .. 19245630 (+)
RNA-Seq ExpressionClCG01G011580
SyntenyClCG01G011580
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATCGAAAATCGACTTCCAAATTCTATGTTTCTCTTGGGGGTAACCTCAATACGTAGGGATTGAAGAAGCAGTCCATTATTTCTAGATCAAGCATAGAAGTTGAATATCGTTTTCTTGCTTTGGTTGCAACTGAAATGGTAAGGATTTGTGTAAGCCCCGCATCTATTTTTATGAGAACGCAAAGTAAGAAGGGGCCGAGAGATGGGTGAATGTGTAAAAGTAGTTGGCTGGCTTGCGTTTGGAGTGGGAAAATGGCTAAGTGTAAGCTATGCGTTGATGACAACCTGTTGTGGAGGAAATAATTTTGGTGTAAACGCAAAGGTGAGGGAAGCTTATCATGCGTTGAAGTCGTGATGTGTTGAGGATAATAAGGCGTGGAGGATCACTTGGTACCGAGGATCATTATGCGTTAAACGCATTGAAGGCAGTGTGTCGGTGGAAGTAAAGGCCACGAGGGGTTTGGGAGCTGTTATGCGGTGAGTCCATTATGCGGTGAATGCTACCATGCGTCGAAGGCGAGTGTGTGTTGGACAAGGAGTGTTGGCTGAGAGAAGGTAATGCGTATGTTAATTATTTGGCGGGAGAGTTTGAAATGAATTATCATCAGAGGCCGTGTGCGTTGCTGAAGCTGAGCATAGGAGTATGCGTCGAACGGATTATAATATAGAGTTGGAATGGGGAACAAGTCGGTGATATATATATATATATATATGTGGACTGAGAATTCAATTGTTCTGCTTGCATTTCAGAGCTCGAGTTGGGTCCATATTAAGGCGAAACTCCGAAGAGAGGTAGAAGAAGATGTTCATGCTGATCTGAAGCAGAAATTTCATTGAATTCGTGGAAGAAACTCGGGTTTGACAGAGGTTTCGGCTGCTATGATCTCTTCAGAGTGATGCTCGGTTTTGGAGGAAGAACAAATTGAAGAGACGAAGGTTAGGGCTGAAACTTTGAGGTTAGTAAGTTAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAAGCTGAGCATAGGAGGTATGCGTCGAACGGATTATAATATAGAGTTGGAATGGGGAACAAGTCGGTGATATATATATATATATATATGTGGACTGAGAATTCAATTGTTCTGCTTGCATTTCAGAGCTCGAGTTGGGTCCATATTAAGGCGAAACTCCGAAGAGAGGTAGAAGAAGATGTTCATGCTGATCTGAAGCAGAAATTTCATTGAATTCGTGGAAGAAACTCGGGTTTGACAGAGGTTTCGGCTGCTATGATCTCTTCAGAGTGATGCTCGGTTTTGGAGGAAGAACAAATTGAAGAGACGAAGGTTAGGGCTGAAACTTTGAGGTTAGTAAGTTAACTCGAATTCTAGCAGAATTCATGAAGGATGTTTGAGCTGAAATTGTCTAATTGTGTGTCAAACTCAGGAAGAAGAGACGGAAGTTGAACTGTCGGATGATTAGAGGCAACTTGGAGTTGTCGAAGGTTATGTCCTAAACGGTGAGGAGGTTCTTCATGAAATTTGGAGGTGAGATGTAGGACCTGTATTTTAACAAGTTTATAGAAGGAAGAAAGTACTGGAATTTGTTGAAACGAAATGTTTAACATGTTGAAAAAGGAAGAAGAAAGGAAGAAGGAAACGGTTCTGTCACAGCTTGAGTATCGCTGAGTGATGAGAGATTGAGCAGCAGAGGAAGTAACGCTGCGACGCAAAGATAGGCTTGCGTCTCACAATTCTCGCTAGAGTATCGCTGGGTACCGCTCCTCAAGGTAGTTTCGCTGGTTTTAAACAACGCAGTGTCGCTAGTACAGTTTGAATATCTTTTGAGTAACGCTGGTAGCTTATTAATTCCCATAGAGTAACGATGCTACTGTATGTTAAAGAAAAGGGGAGTTTATGAAAACTAAACTTGAGGATTGTGTATTTTCAGGCCAAGAAGAGCTCGGGGAGGCTTATAACCCATTTAGAGCCGGGACCGATTGTGAGTGACTGTTTGTTATATCTTGTGTTAAACTTAGTACGCATAGTGATGAGGAAACGTAATGCATGGTAGTTTGCTTTATGTATGTGATGTGTTAGTGATTTAAAAGAATACTATTACAAATGTCTAAGCATATAAGTGGCTGAGGGACCTTATGGTGAAAGCTACTTGTGAGTAAGCGTGATATTAAGTGCCGAGGGATTTGAGAAGGCACTTGAATAACAGTAGAGAATGGACTACAATGCTCAGGGACTGTAAGTGAAGGCATGTAGTATCCTATCACAACTAGCTCGTGCCGAGGGATTCTGTGTGAAGGCACTTGAGCGTAGATAGAGTTTGTTACCTGTGCTGAGGAATTTCATGTGAAGGCACATGGTGTTGTAGTTAAACATGAACAAGTGCTGAGGGACGGGAAGTGAAGGCACTAAATAGATAGGAAACTTGTGAAACATGAGAATTGATAGATATACATGCGATGGTTACTACCTTACAAACTGAATTAAACTAATATGATTTGTCTTAGTAGATTTAGTCACTCACTGAGCCTTTTGCTCATCCAGTTTGTTGTTGTTTGCCGTTTCAGGTAGCGAGCGTGTCCGGGACGCCTAGCCTACTGAAGAATCTCGTCTGGGCCTGTCTAGAAGCGAACCTCTGGGATAGTTGTAAATACTTAGCTTGTTCGTATATCTTGTAATGAACATATTTTATGAGGGTAGAGAGGGGGACGTGTTGTACTTATTATGTGAATATACAAACATGTTGATTGAAGTTTGCACGTTTTTCTATGATGAGATTATGAAAAGTTGATTGTCTTGCTATTCTTTTATGGTGTTTATCTTATAGAGTCTTATGTTGGGAAATAGGATCATGCTCTTTCCTAGGTATAGAGTAAATCTGGGTTGGGGTGTGACAATTTGTTCTTTACTTAATGATCTAAGAATCTCTTTTGCTAATATTCCTGTTTTATGGTGTAATAATCTCAGTGTTGTTCATCGTAGTGCTAATCCTATCTTATATTCCAAAACTAAGTATGTTGAGCTTGACATTTATTATGTTCGAGATCTTGTGTTTAAGAAGCATGTCAATATTCGTCATCTCCCTACCTCAGAACAAATTGCTAATGTGTTTATAAAACCTCTATCCACTTCAAGTTTTTTGAAGCTAAAGAGTAAATTGAATGTTGTTGTAGCAGCCAATATAGGTTTGCCCGGGGGGGATTGAATTTATTATCATTGTAAAGGCCCAAGGCCTAGTTTCCTCTTCTAGGAGGCCTTCAATTCGGTTACTATCGCGATGTAATCCAGTATGTGTGAGTATAGTGAGCTGTTTTGTAAGAGTTTCTTTTCAAGCTTTGTTATTTATATGACATTTCATCTTCCATTAATAAATAACAAAACATAGTGTTTTCTCTCGCATGGCTATGGCTCCAAATTATCTCATAATTATTTTTTTAAAAAAATTATTTTAAGTTTGGTAATTGTGAAAAGTAGAGAAAGATAAAATTAAATGATAAATGTAAAAAGTAAATATAGAGGGAAATAAGTGAAAAAAAAATGGAAAAAAAAATTACCACAAAAATGTGCAAAATTTAACGATAACTACCACCACTTTTGCAATTTGCTCTATTTACTACCAAATAAAAAGTAAAAATTAGAATTAGAAACATAAAAAACCAATCTTAGATTAAGCTGGTCATTCTCATAGTTGTGTGTGTTCCCCTCACAGATATCTCGTGCCATGGCTAACTCCTCCATCGGCGGCTGGCCGACGCATCCTCAACCCCAAACCCATCCCCATCGCCACAACTCCTCGCCGTGCCTCCGAGCCTTCGCCGCCGGCATGGTTCTCCTCCTTTCCATCGCTCTCATCATCTACACCGTCCAATATTTCATCTTCCGCCCCATCCTCCCTATTCTCCGGGTCGACACACTTCAACTCGCCAAATTTCTCTGCCGCCGCCCCGTCCCTTATCTCCTCATGGGTCGTTGGATTTTCCGTCAACAACCCCAACAAGAAGCTCGCCATCTCATTCCAAAACCTTGAGTCCTCCATTTACTACAAAGATAACATTATCGCTCAAGCCCGAATTCACCGCTTCCTCCTCCACCGAAGGAACTCGACGGCCGTTGTCACTCCCTTCATCACCGACTCGCCCGTCGATGAGTCGGTTTTGAACGACATTAAAGGAGACTTAGCGTGTGGAGCAATTAATTTCAATGTTGTAGTTCTTGGCTATGCCGAGTTCCAAATCGGTGTGTGGCGGTGGAGGGGCAACAATTTTCGGGTTCTTTGCAGCGATTTGTCTGTCGGATTGTTGTCGCCGCTGAGTCCCGGCGGCGGGTCCGGCCAGTTGGTTGGTGGCTCAAGGCAATGCCAGCTACGATGA

mRNA sequence

ATGATCGAAAATCGACTTCCAAATTCTATATATCTCGTGCCATGGCTAACTCCTCCATCGGCGGCTGGCCGACGCATCCTCAACCCCAAACCCATCCCCATCGCCACAACTCCTCGCCGTGCCTCCGAGCCTTCGCCGCCGGCATGGTTCTCCTCCTTTCCATCGCTCTCATCATCTACACCGTCCAATATTTCATCTTCCGCCCCATCCTCCCTATTCTCCGGGTCGACACACTTCAACTCGCCAAATTTCTCTGCCGCCGCCCCGTCCCTTATCTCCTCATGGGTCGTTGGATTTTCCGTCAACAACCCCAACAAGAAGCTCGCCATCTCATTCCAAAACCTTGAGTCCTCCATTTACTACAAAGATAACATTATCGCTCAAGCCCGAATTCACCGCTTCCTCCTCCACCGAAGGAACTCGACGGCCGTTGTCACTCCCTTCATCACCGACTCGCCCGTCGATGAGTCGGTTTTGAACGACATTAAAGGAGACTTAGCGTGTGGAGCAATTAATTTCAATGTTGTAGTTCTTGGCTATGCCGAGTTCCAAATCGGTGTGTGGCGGTGGAGGGGCAACAATTTTCGGGTTCTTTGCAGCGATTTGTCTGTCGGATTGTTGTCGCCGCTGAGTCCCGGCGGCGGGTCCGGCCAGTTGGTTGGTGGCTCAAGGCAATGCCAGCTACGATGA

Coding sequence (CDS)

ATGATCGAAAATCGACTTCCAAATTCTATATATCTCGTGCCATGGCTAACTCCTCCATCGGCGGCTGGCCGACGCATCCTCAACCCCAAACCCATCCCCATCGCCACAACTCCTCGCCGTGCCTCCGAGCCTTCGCCGCCGGCATGGTTCTCCTCCTTTCCATCGCTCTCATCATCTACACCGTCCAATATTTCATCTTCCGCCCCATCCTCCCTATTCTCCGGGTCGACACACTTCAACTCGCCAAATTTCTCTGCCGCCGCCCCGTCCCTTATCTCCTCATGGGTCGTTGGATTTTCCGTCAACAACCCCAACAAGAAGCTCGCCATCTCATTCCAAAACCTTGAGTCCTCCATTTACTACAAAGATAACATTATCGCTCAAGCCCGAATTCACCGCTTCCTCCTCCACCGAAGGAACTCGACGGCCGTTGTCACTCCCTTCATCACCGACTCGCCCGTCGATGAGTCGGTTTTGAACGACATTAAAGGAGACTTAGCGTGTGGAGCAATTAATTTCAATGTTGTAGTTCTTGGCTATGCCGAGTTCCAAATCGGTGTGTGGCGGTGGAGGGGCAACAATTTTCGGGTTCTTTGCAGCGATTTGTCTGTCGGATTGTTGTCGCCGCTGAGTCCCGGCGGCGGGTCCGGCCAGTTGGTTGGTGGCTCAAGGCAATGCCAGCTACGATGA

Protein sequence

MIENRLPNSIYLVPWLTPPSAAGRRILNPKPIPIATTPRRASEPSPPAWFSSFPSLSSSTPSNISSSAPSSLFSGSTHFNSPNFSAAAPSLISSWVVGFSVNNPNKKLAISFQNLESSIYYKDNIIAQARIHRFLLHRRNSTAVVTPFITDSPVDESVLNDIKGDLACGAINFNVVVLGYAEFQIGVWRWRGNNFRVLCSDLSVGLLSPLSPGGGSGQLVGGSRQCQLR
Homology
BLAST of ClCG01G011580 vs. NCBI nr
Match: XP_008463664.1 (PREDICTED: uncharacterized protein LOC103501757 [Cucumis melo] >KAA0035579.1 protein YLS9 isoform X2 [Cucumis melo var. makuwa] >TYK30956.1 protein YLS9 isoform X2 [Cucumis melo var. makuwa])

HSP 1 Score: 212.2 bits (539), Expect = 4.6e-51
Identity = 108/147 (73.47%), Postives = 120/147 (81.63%), Query Frame = 0

Query: 83  NFSAAAPSLISSWVVGFSVNNPNKKLAISFQNLESSIYYKDNIIAQARIHRFLLHRRNST 142
           NFS+AA +   SW+VGFS+NNPNKKLAISFQNL+SSIYYKDNIIAQARI RFLL  RNST
Sbjct: 70  NFSSAA-AAAPSWIVGFSINNPNKKLAISFQNLDSSIYYKDNIIAQARIRRFLLRPRNST 129

Query: 143 AVVTPFITDSPVDESVLNDIKGDLACGAINFNVVVLGYAEFQIGVWRWRGNNFRVLCSDL 202
            +V PFI  S VDESVLNDI GDLA G INF VVVLGYA FQI +W+WRG N +V+CSDL
Sbjct: 130 TLVIPFIAVSLVDESVLNDINGDLARGTINFTVVVLGYANFQISLWQWRGTNIQVVCSDL 189

Query: 203 SVGLLSPLSPGGGSGQLVGGSRQCQLR 230
           SVG   P S  G SGQLVGGS+QCQL+
Sbjct: 190 SVGFSWPPSLAGRSGQLVGGSKQCQLQ 215

BLAST of ClCG01G011580 vs. NCBI nr
Match: XP_018857049.1 (uncharacterized protein At1g08160-like [Juglans regia] >KAF5447433.1 hypothetical protein F2P56_032987 [Juglans regia])

HSP 1 Score: 131.7 bits (330), Expect = 7.9e-27
Identity = 70/147 (47.62%), Postives = 96/147 (65.31%), Query Frame = 0

Query: 83  NFSAAAPSLISSWVVGFSVNNPNKKLAISFQNLESSIYYKDNIIAQARIHRFLLHRRNST 142
           NFS+A+ SL  +W V FSV NPNKKL+IS++ + SS++YK   I+  R+  F L +RN T
Sbjct: 100 NFSSASSSLTGNWNVRFSVYNPNKKLSISYEEVLSSLFYKKEFISNTRMPPFKLGKRNQT 159

Query: 143 AVVTPF-ITDSPVDESVLNDIKGDLACGAINFNVVVLGYAEFQIGVWRWRGNNFRVLCSD 202
            +   F   D+ VD  V+NDI GD A G ++FNV+V  + +F+ G WR R  + RVLC  
Sbjct: 160 VLDVSFSAADTYVDRWVVNDINGDRARGTVSFNVLVKAWVQFRAGAWRPRNRSIRVLCEG 219

Query: 203 LSVGLLSPLSPGGGSGQLVGGSRQCQL 229
           L+VGL S  S   GSG LVGG+R C++
Sbjct: 220 LAVGLSSNSS---GSGMLVGGARDCRV 243

BLAST of ClCG01G011580 vs. NCBI nr
Match: XP_040991250.1 (uncharacterized protein At1g08160-like [Juglans microcarpa x Juglans regia])

HSP 1 Score: 127.1 bits (318), Expect = 2.0e-25
Identity = 68/147 (46.26%), Postives = 95/147 (64.63%), Query Frame = 0

Query: 83  NFSAAAPSLISSWVVGFSVNNPNKKLAISFQNLESSIYYKDNIIAQARIHRFLLHRRNST 142
           NFS+A+ +L  +W V FSV NPNKKL+IS++ + SS++YK   I+  R+  F L +RN T
Sbjct: 100 NFSSASSTLTGNWNVRFSVYNPNKKLSISYEEVLSSLFYKSEFISHTRMPPFKLGKRNQT 159

Query: 143 AVVTPF-ITDSPVDESVLNDIKGDLACGAINFNVVVLGYAEFQIGVWRWRGNNFRVLCSD 202
            +   F   D+ VD  V+NDI  D A G ++FNV+V  + +F+ G WR R  + RVLC  
Sbjct: 160 VLDVSFSAADTYVDRWVVNDIYWDRARGTVSFNVLVKAWVQFRAGAWRPRNRSIRVLCEG 219

Query: 203 LSVGLLSPLSPGGGSGQLVGGSRQCQL 229
           L+VGL S  S   GSG LVGG+R C++
Sbjct: 220 LAVGLSSNSS---GSGMLVGGARDCRV 243

BLAST of ClCG01G011580 vs. NCBI nr
Match: XP_042956465.1 (uncharacterized protein At1g08160 [Carya illinoinensis] >KAG6632036.1 hypothetical protein CIPAW_13G131000 [Carya illinoinensis] >KAG6682257.1 hypothetical protein I3842_13G129700 [Carya illinoinensis] >KAG7950481.1 hypothetical protein I3843_13G114700 [Carya illinoinensis])

HSP 1 Score: 126.7 bits (317), Expect = 2.5e-25
Identity = 67/147 (45.58%), Postives = 94/147 (63.95%), Query Frame = 0

Query: 83  NFSAAAPSLISSWVVGFSVNNPNKKLAISFQNLESSIYYKDNIIAQARIHRFLLHRRNST 142
           NFS+A+ SL  +W V FSV NPNKKL+IS++ + +S++YK   I+  ++  F L +RN T
Sbjct: 100 NFSSASSSLTGNWNVRFSVYNPNKKLSISYEEVLASLFYKSEFISNTQMPPFKLRKRNQT 159

Query: 143 AVVTPF-ITDSPVDESVLNDIKGDLACGAINFNVVVLGYAEFQIGVWRWRGNNFRVLCSD 202
            +   F   D+ V   V+NDI GD A G ++FNV+V  + +F+ G WR R  + RVLC  
Sbjct: 160 VLDVSFSAADAYVARWVVNDINGDRARGTVSFNVMVKAWVQFRAGAWRPRNRSIRVLCEG 219

Query: 203 LSVGLLSPLSPGGGSGQLVGGSRQCQL 229
           L+VGL S  S   GSG LVGG R C++
Sbjct: 220 LAVGLSSNSS---GSGMLVGGERDCRV 243

BLAST of ClCG01G011580 vs. NCBI nr
Match: KAG2674349.1 (hypothetical protein I3760_13G129800 [Carya illinoinensis])

HSP 1 Score: 126.3 bits (316), Expect = 3.3e-25
Identity = 66/147 (44.90%), Postives = 94/147 (63.95%), Query Frame = 0

Query: 83  NFSAAAPSLISSWVVGFSVNNPNKKLAISFQNLESSIYYKDNIIAQARIHRFLLHRRNST 142
           NFS+A+ SL  +W V FSV NPNKKL+IS++ + +S++YK   I+  ++  F L +RN T
Sbjct: 100 NFSSASSSLTGNWNVRFSVYNPNKKLSISYEEVLASLFYKSEFISNTQMPPFKLRKRNQT 159

Query: 143 AVVTPF-ITDSPVDESVLNDIKGDLACGAINFNVVVLGYAEFQIGVWRWRGNNFRVLCSD 202
            +   F   D+ V   V+NDI GD A G ++FNV+V  + +F+ G WR R  + RVLC  
Sbjct: 160 VLDVSFSAADAYVARWVVNDINGDRARGTVSFNVMVKAWVQFRAGAWRPRNRSIRVLCEG 219

Query: 203 LSVGLLSPLSPGGGSGQLVGGSRQCQL 229
           L++GL S  S   GSG LVGG R C++
Sbjct: 220 LAIGLSSNSS---GSGMLVGGERDCRV 243

BLAST of ClCG01G011580 vs. ExPASy Swiss-Prot
Match: Q9SRN1 (NDR1/HIN1-like protein 2 OS=Arabidopsis thaliana OX=3702 GN=NHL2 PE=2 SV=1)

HSP 1 Score: 46.6 bits (109), Expect = 4.4e-04
Identity = 30/123 (24.39%), Postives = 53/123 (43.09%), Query Frame = 0

Query: 97  VGFSVNNPNKKLAISFQNLESSIYYKDNIIAQARIHRFLLHRRNSTAVVTPFITDSPV-- 156
           + F++ NPN+++ + +     S YY D     A +  F    +N+T ++T     + V  
Sbjct: 108 LNFTIRNPNQRVGVYYDEFSVSGYYGDQRFGSANVSSFYQGHKNTTVILTKIEGQNLVVL 167

Query: 157 DESVLNDIKGDLACGAINFNVVVLGYAEFQ---IGVWRWRGNNFRVLCSDLSVGLLSPLS 215
            +    D+K D   G    N  +     F+   I  W+ +    ++ C DL + L S  S
Sbjct: 168 GDGARTDLKDDEKSGIYRINAKLRLSVRFKFWFIKSWKLKP---KIKCDDLKIPLGSSNS 227

BLAST of ClCG01G011580 vs. ExPASy TrEMBL
Match: A0A5D3E5N0 (Protein YLS9 isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold455G001820 PE=4 SV=1)

HSP 1 Score: 212.2 bits (539), Expect = 2.2e-51
Identity = 108/147 (73.47%), Postives = 120/147 (81.63%), Query Frame = 0

Query: 83  NFSAAAPSLISSWVVGFSVNNPNKKLAISFQNLESSIYYKDNIIAQARIHRFLLHRRNST 142
           NFS+AA +   SW+VGFS+NNPNKKLAISFQNL+SSIYYKDNIIAQARI RFLL  RNST
Sbjct: 70  NFSSAA-AAAPSWIVGFSINNPNKKLAISFQNLDSSIYYKDNIIAQARIRRFLLRPRNST 129

Query: 143 AVVTPFITDSPVDESVLNDIKGDLACGAINFNVVVLGYAEFQIGVWRWRGNNFRVLCSDL 202
            +V PFI  S VDESVLNDI GDLA G INF VVVLGYA FQI +W+WRG N +V+CSDL
Sbjct: 130 TLVIPFIAVSLVDESVLNDINGDLARGTINFTVVVLGYANFQISLWQWRGTNIQVVCSDL 189

Query: 203 SVGLLSPLSPGGGSGQLVGGSRQCQLR 230
           SVG   P S  G SGQLVGGS+QCQL+
Sbjct: 190 SVGFSWPPSLAGRSGQLVGGSKQCQLQ 215

BLAST of ClCG01G011580 vs. ExPASy TrEMBL
Match: A0A1S3CJS5 (uncharacterized protein LOC103501757 OS=Cucumis melo OX=3656 GN=LOC103501757 PE=4 SV=1)

HSP 1 Score: 212.2 bits (539), Expect = 2.2e-51
Identity = 108/147 (73.47%), Postives = 120/147 (81.63%), Query Frame = 0

Query: 83  NFSAAAPSLISSWVVGFSVNNPNKKLAISFQNLESSIYYKDNIIAQARIHRFLLHRRNST 142
           NFS+AA +   SW+VGFS+NNPNKKLAISFQNL+SSIYYKDNIIAQARI RFLL  RNST
Sbjct: 70  NFSSAA-AAAPSWIVGFSINNPNKKLAISFQNLDSSIYYKDNIIAQARIRRFLLRPRNST 129

Query: 143 AVVTPFITDSPVDESVLNDIKGDLACGAINFNVVVLGYAEFQIGVWRWRGNNFRVLCSDL 202
            +V PFI  S VDESVLNDI GDLA G INF VVVLGYA FQI +W+WRG N +V+CSDL
Sbjct: 130 TLVIPFIAVSLVDESVLNDINGDLARGTINFTVVVLGYANFQISLWQWRGTNIQVVCSDL 189

Query: 203 SVGLLSPLSPGGGSGQLVGGSRQCQLR 230
           SVG   P S  G SGQLVGGS+QCQL+
Sbjct: 190 SVGFSWPPSLAGRSGQLVGGSKQCQLQ 215

BLAST of ClCG01G011580 vs. ExPASy TrEMBL
Match: A0A0A0LT76 (LEA_2 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G169920 PE=4 SV=1)

HSP 1 Score: 193.0 bits (489), Expect = 1.4e-45
Identity = 103/146 (70.55%), Postives = 112/146 (76.71%), Query Frame = 0

Query: 83  NFSAAAPSLISSWVVGFSVNNPNKKLAISFQNLESSIYYKDNIIAQARIHRFLLHRRNST 142
           NFSA A +   SWVVGFS+NNPNKKLAISF+NLESSIYYKDNIIAQAR  RFLL  RNST
Sbjct: 70  NFSATAAA--PSWVVGFSINNPNKKLAISFRNLESSIYYKDNIIAQARTRRFLLPPRNST 129

Query: 143 AVVTPFITDSPVDESVLNDIKGDLACGAINFNVVVLGYAEFQIGVWRWRGNNFRVLCSDL 202
            +V+PFI D  VDESVLNDI GDL  G I+F VVVLGYA  +IGVWR  G + RV+CSDL
Sbjct: 130 TLVSPFIADLLVDESVLNDIHGDLERGTIDFTVVVLGYANVEIGVWRPIGTDIRVVCSDL 189

Query: 203 SVGLLSPLSPGGGSGQLVGGSRQCQL 229
           SV    P    G SGQLVGGSRQC L
Sbjct: 190 SVKFSWPPGLSGRSGQLVGGSRQCHL 213

BLAST of ClCG01G011580 vs. ExPASy TrEMBL
Match: A0A2I4HLK5 (uncharacterized protein At1g08160-like OS=Juglans regia OX=51240 GN=LOC109019244 PE=4 SV=1)

HSP 1 Score: 131.7 bits (330), Expect = 3.8e-27
Identity = 70/147 (47.62%), Postives = 96/147 (65.31%), Query Frame = 0

Query: 83  NFSAAAPSLISSWVVGFSVNNPNKKLAISFQNLESSIYYKDNIIAQARIHRFLLHRRNST 142
           NFS+A+ SL  +W V FSV NPNKKL+IS++ + SS++YK   I+  R+  F L +RN T
Sbjct: 100 NFSSASSSLTGNWNVRFSVYNPNKKLSISYEEVLSSLFYKKEFISNTRMPPFKLGKRNQT 159

Query: 143 AVVTPF-ITDSPVDESVLNDIKGDLACGAINFNVVVLGYAEFQIGVWRWRGNNFRVLCSD 202
            +   F   D+ VD  V+NDI GD A G ++FNV+V  + +F+ G WR R  + RVLC  
Sbjct: 160 VLDVSFSAADTYVDRWVVNDINGDRARGTVSFNVLVKAWVQFRAGAWRPRNRSIRVLCEG 219

Query: 203 LSVGLLSPLSPGGGSGQLVGGSRQCQL 229
           L+VGL S  S   GSG LVGG+R C++
Sbjct: 220 LAVGLSSNSS---GSGMLVGGARDCRV 243

BLAST of ClCG01G011580 vs. ExPASy TrEMBL
Match: A0A6J1K154 (NDR1/HIN1-like protein 2 OS=Cucurbita maxima OX=3661 GN=LOC111490764 PE=4 SV=1)

HSP 1 Score: 120.2 bits (300), Expect = 1.2e-23
Identity = 65/151 (43.05%), Postives = 96/151 (63.58%), Query Frame = 0

Query: 79  FNSPNFSAAAPSLISSWVVGFSVNNPNKKLAISFQNLESSIYYKDNIIAQARIHRFLLHR 138
           F   NFSAA+ SL ++W VGFSV NPNKK++IS+  ++S+++YK+ I+++ R+  F+  +
Sbjct: 97  FQVTNFSAASQSLSAAWFVGFSVFNPNKKMSISYDFVDSTVFYKNQILSETRVPPFIQDK 156

Query: 139 RNSTAVVTPFIT-DSPVDESVLNDIKGDLACGAINFNVVVLGYAEFQIGVWRWRGNNFRV 198
           R  T V   F + ++ +D S +NDI  D   GA+ F+V +     F+ G WR R    RV
Sbjct: 157 RTHTVVNASFSSLNAYIDASSVNDINDDRRRGAVKFDVGLSARVGFRAGWWRARRRLLRV 216

Query: 199 LCSDLSVGLLSPLSPGGGSGQLVGGSRQCQL 229
           LC DLSVGL   LS    SG+L+G SR C++
Sbjct: 217 LCEDLSVGL--SLSNSSASGKLLGESRSCRV 245

BLAST of ClCG01G011580 vs. TAIR 10
Match: AT4G05220.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family )

HSP 1 Score: 47.0 bits (110), Expect = 2.4e-05
Identity = 26/116 (22.41%), Postives = 55/116 (47.41%), Query Frame = 0

Query: 89  PSLISSWVVGFSVN--NPNKKLAISFQNLESSIYYKDNIIAQARIHRFLLHRRNSTAVVT 148
           P+ + +  + F+V   NPN+ + + F ++E SIYYKD  +    +      +  +T +VT
Sbjct: 88  PTGVENARIAFNVTILNPNQHMGVYFDSMEGSIYYKDQRVGLIPLLNPFFQQPTNTTIVT 147

Query: 149 PFITDS--PVDESVLNDIKGDLACGAINFNVVVLGYAEFQIGVWRWRGNNFRVLCS 201
             +T +   V+ +   +   D A G + F + ++    F++  W  + +     C+
Sbjct: 148 GTLTGASLTVNSNRWTEFSNDRAQGTVGFRLDIVSTIRFKLHRWISKHHRMHANCN 203

BLAST of ClCG01G011580 vs. TAIR 10
Match: AT3G11650.1 (NDR1/HIN1-like 2 )

HSP 1 Score: 46.6 bits (109), Expect = 3.1e-05
Identity = 30/123 (24.39%), Postives = 53/123 (43.09%), Query Frame = 0

Query: 97  VGFSVNNPNKKLAISFQNLESSIYYKDNIIAQARIHRFLLHRRNSTAVVTPFITDSPV-- 156
           + F++ NPN+++ + +     S YY D     A +  F    +N+T ++T     + V  
Sbjct: 108 LNFTIRNPNQRVGVYYDEFSVSGYYGDQRFGSANVSSFYQGHKNTTVILTKIEGQNLVVL 167

Query: 157 DESVLNDIKGDLACGAINFNVVVLGYAEFQ---IGVWRWRGNNFRVLCSDLSVGLLSPLS 215
            +    D+K D   G    N  +     F+   I  W+ +    ++ C DL + L S  S
Sbjct: 168 GDGARTDLKDDEKSGIYRINAKLRLSVRFKFWFIKSWKLKP---KIKCDDLKIPLGSSNS 227

BLAST of ClCG01G011580 vs. TAIR 10
Match: AT4G26820.1 (unknown protein; Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 42.0 bits (97), Expect = 7.7e-04
Identity = 30/134 (22.39%), Postives = 58/134 (43.28%), Query Frame = 0

Query: 76  STHFNSPNFSAAAPSLISSWVVGFSVNNPNKKLAISFQNLESSIYYKDNIIAQARIHRFL 135
           + H  S + S A  +L   W   FS+ NPN+KL ++++N    + ++  +++ AR   F 
Sbjct: 79  TVHVQSMHISFANHNL-PVWSATFSIKNPNEKLHVTYENPSVWLVHRGKLVSTARADSFW 138

Query: 136 LHRRNSTAVVTPFITDSPVDESVLNDIKGDLAC--GAINFNVVVLGYAEFQIGVWR-WRG 195
                   V+        +DE    +++ ++A   G +  ++V  G   F  G    W  
Sbjct: 139 QKGGEKNEVIVKRNETKVIDEEAAWEMEDEVAVTGGVVGLDMVFSGRVGFYPGTSALWGE 198

Query: 196 NNFRVLCSDLSVGL 207
                +C ++S  L
Sbjct: 199 QYMSAVCENVSAKL 211

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008463664.14.6e-5173.47PREDICTED: uncharacterized protein LOC103501757 [Cucumis melo] >KAA0035579.1 pro... [more]
XP_018857049.17.9e-2747.62uncharacterized protein At1g08160-like [Juglans regia] >KAF5447433.1 hypothetica... [more]
XP_040991250.12.0e-2546.26uncharacterized protein At1g08160-like [Juglans microcarpa x Juglans regia][more]
XP_042956465.12.5e-2545.58uncharacterized protein At1g08160 [Carya illinoinensis] >KAG6632036.1 hypothetic... [more]
KAG2674349.13.3e-2544.90hypothetical protein I3760_13G129800 [Carya illinoinensis][more]
Match NameE-valueIdentityDescription
Q9SRN14.4e-0424.39NDR1/HIN1-like protein 2 OS=Arabidopsis thaliana OX=3702 GN=NHL2 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A5D3E5N02.2e-5173.47Protein YLS9 isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3CJS52.2e-5173.47uncharacterized protein LOC103501757 OS=Cucumis melo OX=3656 GN=LOC103501757 PE=... [more]
A0A0A0LT761.4e-4570.55LEA_2 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G169920 PE=4 ... [more]
A0A2I4HLK53.8e-2747.62uncharacterized protein At1g08160-like OS=Juglans regia OX=51240 GN=LOC109019244... [more]
A0A6J1K1541.2e-2343.05NDR1/HIN1-like protein 2 OS=Cucurbita maxima OX=3661 GN=LOC111490764 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G05220.12.4e-0522.41Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family [more]
AT3G11650.13.1e-0524.39NDR1/HIN1-like 2 [more]
AT4G26820.17.7e-0422.39unknown protein; Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004864Late embryogenesis abundant protein, LEA_2 subgroupPFAMPF03168LEA_2coord: 100..186
e-value: 6.5E-6
score: 26.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 27..47
NoneNo IPR availablePANTHERPTHR31852LATE EMBRYOGENESIS ABUNDANT (LEA) HYDROXYPROLINE-RICH GLYCOPROTEIN FAMILYcoord: 82..227
NoneNo IPR availablePANTHERPTHR31852:SF2GRPE-LIKE PROTEINcoord: 82..227

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG01G011580.1ClCG01G011580.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane