Clc10G01940 (gene) Watermelon (cordophanus) v2

Overview
NameClc10G01940
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionLSD1 zinc finger family protein
LocationClcChr10: 1803918 .. 1809626 (-)
RNA-Seq ExpressionClc10G01940
SyntenyClc10G01940
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCATTTCCTCCAGTTACTTGTATGACTACACTACCCCAAAACTTCGAGTTTGGATATAATTTGAATTGTTTTCCTGTATAGATCTGAGAATAACTAACAAATTTCTAATGTGAGATGGTTATTTTTGTTTGTATATGAGCACTAGGGAGAGAAAACTCTCAACATGAACAATGAAGTTTAGGGAAAGAATTTGACATTCATATAATTAAAGTTAGACCCTTTTCTTAGTTTTATTGTATCTTTTGCTCTTATGTCTCATTGTATTGAGCTTTTGTCTCTTTCATTTCATCAATGAAGGTTTATGTTTCTGTTTCGGAAAAAAAAAAGTTAGATCCTATTCTTAGAAAAATGACCAACTAATGAATTGATTTCACGTCATAATACTATCTTTTAAGAACTGAAACTTGTTGGGCAATAGGTAGTCAAAGCCAAATAGTGTGCTCTGGATGTAAACATCTGTTGATGTATCCTGCCGGAGCAACCTCCATTTGCTGTGCTCTTTGCCATGTTGTATCTCCTGTGCCAACCTCTGGTTTGTTTCTATATCCTTCCATTTTTCTTTTGATCTTTTTTGAAATTATGCTTCGATCATAATTCAAATCGAAATCTTTTTTCTCAAATTCTAGTGCAATTGGAATAGTTAAAAAGGATTGTGATTTTATCTATTCCAACTCTAATTCCCACCCCTCCTTTCACCTTTTTTGTTCTGACAAGTGAAAACATCTTCTAAAAATAAGGAATCCTCTCATTAATTTTTTATGCTATAGTTAATGCCAAATCGCTCTCCCTTTTCATCACTTTTAGCTTTTCAATGGGAGAAATCTTTTGTAATTGTCCCTTTTGGAGTTTTTCCCTTCTATTGTAATTTCATTCTCAAACTGTTTTGAAGTGATAAGCTCAAACCCAGTTCGAATTACTTGGGAAGTTGAACTCTGACTATTTTTGCATCTATCATCAATCTTATTAGTTGTTGTCTCCACCGGTTGTAGAGTAAAGAAAGGGCGGAAGTAATGAGGTTGTTACTAAATCTTGTAGAGTCAGGTGGTTGAGTTAGCCTAGATATCCAGAATTATCAACAAAACAAGTTAACTAATGATGATTTGCAAGTTCTTATATTGTTCGTATATATGCTTGGAATTGTGTATAGTTAGAATATGATTGGTTTGGAATATGACCCTTAAGCAGGCTTAACGATGGCTCAGCTGGTATGCAGTGGCTGCCACACCCTGCTCATGTACAGGCGTGGGGCGACAAGTGTACAATGTTCTTGTTGTCGCACTGTAAATGCAGCCTCCAAAGGTAAATTGTGACAACTAAAGGTTTACATATGTATTTTTTCTTTAGTGAAGAGCAATGCTTTTGTTTGATTTACCTTAAAACCACTCATCAAGAAGCCTATTTGTGATCATAGGAATAGTTACAATTAAGTTTTACCCATGCTTTATGCTTTTTGTGTTATCATAGGAATAGTTTTAATTTCTTTGCATCCAAATGGACTATCATGGAGCTACAGTAGCACGCCTTCGAAACCTTAGATTTACCTTAAAACCACTCATCAAGAAGCCTATTTGTGATATCTTTGACAAGGCACCAAGATAAAATACACTTTCACTAAATTAAAGACCATCTGTAAAGAGGCCAATTTGTAGTGAACCAGCAAATGTCTTGAAGCTATTATTTGATGCTCCAAATGACCAAACATGGAGATCTCTGTTTTTTGGACTCCGTTGCTCTTCAAGATAATAAAAAATTACTCTCCCTGTCCATGGATATAATAGCTAACACATTGTTAGTGAACTACGTAAATCTATGCGTCGGTTTTTTTTTGTTTGCTTCTTTGCTTGTCGTCTGTTTTTGTCATAACATGTAGAATAGTTCAAAAAAAAAAAAAACACGTAGAATAGTTCACTTATTGTAGAATCACATAAGAATAATAGGGTGCGTCAAACACTCAATGTGCTGCACATGTAGCTCAAATGTGTTAAATGGGCCCAAATAGATGAAACTTTGGAATTACAATTTGGAACTATAGACAATTATTTTTATCATTAAGAGATGCAAGGTATAATCAAGCAGGGAACTGAATTCTAAGCATTAAATTTCATTCTTGATTATTAAGAAAATCCTTTGTTGTTGAGTCCTTTTAGCTAGAGATAGGGTAAATCCAGGAAAGAGTTTTAATAAATATATATCCTAAGTAAAGAGTATATTGTATCAAATTATTTGTGTACATAAAATCTTATGATGGGGAAGGGGAATGTTTTAAAATTTGGTCCTCCAATTAGTACTACTGATGGTTTAATCTAATCTCCTCAAAGGCTTATTATGAAATGTTGGATTTTGAATGGTTGCAAACAGCAAATCAGATGGCTCACATCAACTGTGGGAACTGTAGGGTGCTGCTGATGTACCAATGTGGGGCACATTCTGTTAAATGTACACTTTGCAATTTTGTGACTTCAGTTGGGATTTTGAGGTTTAATTAGTTCACAATTTTCCCTTTTTGTCTCGAACTTCACAGCACTTGGATCAATGAAGATGAACCCTAAAAGTATCAACACTTTCTATTGGTTCCTAATGAAAAAGGAATCTCTTTTAATTAGGATCAAGGTTGTTTGTACCCAATGGTCTATTCTAATTTGAGATTTTCTTTTCTCCGTAAACCAGTGTTTTTTTTTTTACTCTAATAACTATGGGACGACCTCTCACTTCAAAAGAGGGAGCTACCTAATTGTAAAAGAGAGCTTTAATGTTTTTTTTTTTTTTCATGTCTCTAAATTTATGGGACTATTTAGCATGCTAAGTTGAGTTCATATGTTTGTGTAGTTTATATATCTATAGAATTCATATGTTTGTATTTTGGGTGTTGAGTTCATATGTCAAGTATATCTGATATAACTTTTTACATAAATAATTTTACACACACACAGATATATATATTTAATAATTATAGATGACACTTTGTTTTATTGTTTTTTAAACATACATGCATTGTTTGGAGTTTTGAGAGATGGGGTTTAGGGTTTACTTTTGGATAATGTATTTCAATTTTATTTATTTATTTAAGTAATCATTTTACAAATTTGTACATGTACATTTCTAGTTAAGTTATTTTTATTTTAACACTTTTGATAAAAACTTTTTAAAATAAAAACATTTATGTGTTTGACTATTTTTTTTCAAAAGTGTTTTTAACATACAAGTTTGATTTTTTACATCATAAAAGTGATTTTATTAATGTTAAATTGTTTATAAAGAATGATTATAAAATACTATACACTAATAAAAAAAATTGGAAATTCAACGTTTTGATGGAATTTAAGTACTTTGCGTCAACAAAAGTGTTTTTATTAACAATTTGTGCATTAACTTAAATATAAACGTTTATTGATGTTTTATATGTATTTTTAAATTAATAGTTAACTTTAAGTTTTAAATTAAAATTTTATTAATAGTTTTATGTCACAAGCGTCAACAAAACAAAAGATTAGTGTATCAACAAAACATAAATTTGTATTAGTTATAATTTAAAATAATGGTCATTAGAGTTAGCAGAGTGGCTGCCATTGTCAGTGGTCGTCAGTGGTAGTTCGATGTTGCTAGAATTGGATGCAGGCAGTAATTGTCAGAGTTGGCTGTTGGAGTGGTCGTCTTATCTCATCTCTTCACTCCAAATCCCTCTTGTTCTGTTTTACCTCTTTTTTCTATATAACATTTGTATTTATGCCTTTAGATTTTTGTTTAAATGTCCTTCACTCATTTTCTTCATTATCTCCCTAATTTCGAACATTTTAAACCCTAAAATAGTTATAAGTTGGAGTTTGTTATTCTATTTTCTAAAAAGAAAAATAGTCATGAATAAGCATTTACTGCTCTTGTTTTAGAAAAATAGGAGAACAAAAAGAAAAAGAAAAAATGGTTATGAAGTTGAAATAAATTTTAAACTAAAATGTGCCTGGTTTAATTAATTAAACTTTTCAGAGTCAAATGTTGAGACATGAAAATAAAAATTTCAAACTTTATGAATATGCAATAAATCTTATAAATCAAACCTTCACTAAAATAAATATCGTTATAAATAAAAAAAATAAAAATTATTAAGTTGTTTTACTTTCAAAACTAAGTAATAAATTTTGTTATTACTATCCATATTCATGTTGGAGTTAATAAATGATATTATCATACATATATATACATCCACACATACCCATGAATATTTCCATCTTTATTAGAAGTAAAACTATATAATTTAACTCAAAATATGTCCGTGGAGATTCATGGAAAGGGCCGAACTATAGATAATTAATTATTACTCGATTGAACGATGTACCAATAGTAGTAGAAGAAATATATTACAAAATTTTCTTGTTATGTAATTTATGAAAGAAAGTGGATATATGTTGAGAATTATATGGTTAATAAAAAGGATAGAAGTTTATAAATTCTCCTTCCTTAATTTTATAAGATAAAAAGAGAAATGACAGGTGGGACAATAAGGGCATGCATGGGTGACAAATGGCATCCAACGTGTAATGTTTTTTGAAGATTTAAGATTTCTTGATTCACTATATATATATATATATATATATATGTTTGTGTCTTGTCGTTTTTGTTTCTGAATGATCTAAAAATTGGGAATAATGCAATCTGCAATGGAGAAGCTGAGTAATATGGGAAGTGTTGCTAAAGAGAAGCTCAAAATTTGTAGAGCCAAACTCCACGAGAAGGTGTGAATCTTTTATCTACATATATCTCCATTGTTATCATTAGTGTTAAATTTCAGTCGGTATATTAATATTTTCATCCTTATTTTTGCTATATTGTAGAAAAATTTATAAACATAACAAATAAACAAAAACTAATATAAATAACAGAGTAATTTATTTATATTATAATATAAATAAAGATATTTTAACTTATTTAAAAAATTTAAACTTAAATCTTTAAATTTTTATTTTTATAAATTTTTTCGAAAATATTTATTGAAATCAACATTTTACCAATAATTCTATCAACATTTTCGTAAAATTTAGATCTTATTCAACATATAAATGGCCATTTTCTTCCCAAATACATATGTCTCTGAATATTTATAATGTATTTATAATGTATGTGTCTAATAAGCCATTTAATTAATATGTTTCTAAATTTGTAATTTTGTATTTAATTAGTTCATTAACCTTTACATTTATGTTTAATAGGTGGACATAGGTGTCTTTAATTGAGAGGTATTTGAATTTTTTCAATATTGACCTTCTAAAATCTAAAATCTATGTTAAATATAGGTCTTTAGTTTCTAATTATTGTTTTTAGGTTCTTGATCTAACATTTTTAAAATAACGATGGTCTGCCTTTTAAACACAATATAAACATTTTCAATATAATTTAAAACAATAGATAATTAATGTGTAAATGAAGGAACAGGTGGAGAAAGCGTCGGTGAAAACAGCGGAGGAGAGGAAGATTGTGGAGGAGAGAAGAAAGGCGGCAACGGCGGAGGCAAAGCGGGAGCTACACGAGGCCAAAGCCAGACATGCTGCTCAAAAGCTAAGGAATAGGAAGTCAAAACAAGTACTTGGCGGCCATTTACACCATCACCACCCTCCGGTCGAGGGTGGCGCCGCCGCCACACATCTCGGCGGAGTAAATGTTCCGGCTTATCCTATAGTCAGCCCGGATGGGTACTTTCCCGGACATAAAATTTAA

mRNA sequence

ATGCCATTTCCTCCAGTTACTTGTAGTCAAAGCCAAATAGTGTGCTCTGGATGTAAACATCTGTTGATGTATCCTGCCGGAGCAACCTCCATTTGCTGTGCTCTTTGCCATGTTGTATCTCCTGTGCCAACCTCTGGCTTAACGATGGCTCAGCTGGTATGCAGTGGCTGCCACACCCTGCTCATGTACAGGCGTGGGGCGACAAGTGTACAATGTTCTTGTTGTCGCACTGTAAATGCAGCCTCCAAAGCAAATCAGATGGCTCACATCAACTGTGGGAACTGTAGGGTGCTGCTGATGTACCAATCACTTGGATCAATGAAGATGAACCCTAAAAGTATCAACACTTTCTATTGGTTCCTAATGAAAAAGGAATCTCTTTTAATTAGGATCAAGGAACAGGTGGAGAAAGCGTCGGTGAAAACAGCGGAGGAGAGGAAGATTGTGGAGGAGAGAAGAAAGGCGGCAACGGCGGAGGCAAAGCGGGAGCTACACGAGGCCAAAGCCAGACATGCTGCTCAAAAGCTAAGGAATAGGAAGTCAAAACAAGTACTTGGCGGCCATTTACACCATCACCACCCTCCGGTCGAGGGTGGCGCCGCCGCCACACATCTCGGCGGAGTAAATGTTCCGGCTTATCCTATAGTCAGCCCGGATGGGTACTTTCCCGGACATAAAATTTAA

Coding sequence (CDS)

ATGCCATTTCCTCCAGTTACTTGTAGTCAAAGCCAAATAGTGTGCTCTGGATGTAAACATCTGTTGATGTATCCTGCCGGAGCAACCTCCATTTGCTGTGCTCTTTGCCATGTTGTATCTCCTGTGCCAACCTCTGGCTTAACGATGGCTCAGCTGGTATGCAGTGGCTGCCACACCCTGCTCATGTACAGGCGTGGGGCGACAAGTGTACAATGTTCTTGTTGTCGCACTGTAAATGCAGCCTCCAAAGCAAATCAGATGGCTCACATCAACTGTGGGAACTGTAGGGTGCTGCTGATGTACCAATCACTTGGATCAATGAAGATGAACCCTAAAAGTATCAACACTTTCTATTGGTTCCTAATGAAAAAGGAATCTCTTTTAATTAGGATCAAGGAACAGGTGGAGAAAGCGTCGGTGAAAACAGCGGAGGAGAGGAAGATTGTGGAGGAGAGAAGAAAGGCGGCAACGGCGGAGGCAAAGCGGGAGCTACACGAGGCCAAAGCCAGACATGCTGCTCAAAAGCTAAGGAATAGGAAGTCAAAACAAGTACTTGGCGGCCATTTACACCATCACCACCCTCCGGTCGAGGGTGGCGCCGCCGCCACACATCTCGGCGGAGTAAATGTTCCGGCTTATCCTATAGTCAGCCCGGATGGGTACTTTCCCGGACATAAAATTTAA

Protein sequence

MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLMYQSLGSMKMNPKSINTFYWFLMKKESLLIRIKEQVEKASVKTAEERKIVEERRKAATAEAKRELHEAKARHAAQKLRNRKSKQVLGGHLHHHHPPVEGGAAATHLGGVNVPAYPIVSPDGYFPGHKI
Homology
BLAST of Clc10G01940 vs. NCBI nr
Match: XP_031738784.1 (protein LSD1 [Cucumis sativus])

HSP 1 Score: 193.0 bits (489), Expect = 2.9e-45
Identity = 89/102 (87.25%), Postives = 98/102 (96.08%), Query Frame = 0

Query: 1   MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTL 60
           MPFPPVTC Q+QI+CSGCK+LL+YPAGATSICCALCH V+PVPTSGLTMA+LVCSGC+TL
Sbjct: 1   MPFPPVTCCQNQIMCSGCKNLLIYPAGATSICCALCHAVTPVPTSGLTMARLVCSGCYTL 60

Query: 61  LMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLMYQ 103
           LMY RGA SVQCSCCRT+NAAS+ANQMAHINCGNCRVLLMYQ
Sbjct: 61  LMYSRGAKSVQCSCCRTINAASEANQMAHINCGNCRVLLMYQ 102

BLAST of Clc10G01940 vs. NCBI nr
Match: KAE8650032.1 (hypothetical protein Csa_010851 [Cucumis sativus])

HSP 1 Score: 193.0 bits (489), Expect = 2.9e-45
Identity = 89/102 (87.25%), Postives = 98/102 (96.08%), Query Frame = 0

Query: 1   MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTL 60
           MPFPPVTC Q+QI+CSGCK+LL+YPAGATSICCALCH V+PVPTSGLTMA+LVCSGC+TL
Sbjct: 378 MPFPPVTCCQNQIMCSGCKNLLIYPAGATSICCALCHAVTPVPTSGLTMARLVCSGCYTL 437

Query: 61  LMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLMYQ 103
           LMY RGA SVQCSCCRT+NAAS+ANQMAHINCGNCRVLLMYQ
Sbjct: 438 LMYSRGAKSVQCSCCRTINAASEANQMAHINCGNCRVLLMYQ 479

BLAST of Clc10G01940 vs. NCBI nr
Match: XP_038903355.1 (protein LSD1-like [Benincasa hispida])

HSP 1 Score: 192.2 bits (487), Expect = 4.9e-45
Identity = 90/102 (88.24%), Postives = 96/102 (94.12%), Query Frame = 0

Query: 1   MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTL 60
           MPFPPVTC ++ IVCSGCK LL+YPAGAT ICCALCH VSPVPTSGLTMA+LVCSGCHTL
Sbjct: 1   MPFPPVTCCENHIVCSGCKILLIYPAGATFICCALCHTVSPVPTSGLTMARLVCSGCHTL 60

Query: 61  LMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLMYQ 103
           LMY RGATSVQCSCCRTVNAAS+ANQMAHINCGNCRV+LMYQ
Sbjct: 61  LMYSRGATSVQCSCCRTVNAASEANQMAHINCGNCRVVLMYQ 102

BLAST of Clc10G01940 vs. NCBI nr
Match: XP_008448703.1 (PREDICTED: protein LOL1 isoform X1 [Cucumis melo])

HSP 1 Score: 190.7 bits (483), Expect = 1.4e-44
Identity = 94/126 (74.60%), Postives = 106/126 (84.13%), Query Frame = 0

Query: 1   MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTL 60
           MPFP VTC Q+QI+CSGCK++L+YPAGATSICCALCH V+PVPTSGLTMA+LVCSGC+TL
Sbjct: 51  MPFPSVTCCQNQIMCSGCKNMLIYPAGATSICCALCHAVTPVPTSGLTMARLVCSGCYTL 110

Query: 61  LMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLMYQ-SLGSMKMNPKSINTFYW 120
           LMY RGATSVQCSCCRT+NAASKANQMAH NCGNCRVLLMYQ    S+K    +  T   
Sbjct: 111 LMYSRGATSVQCSCCRTINAASKANQMAHTNCGNCRVLLMYQCEAHSVKCTLCNFVTSVG 170

Query: 121 FLMKKE 126
            L K+E
Sbjct: 171 ILRKRE 176

BLAST of Clc10G01940 vs. NCBI nr
Match: XP_008448702.2 (PREDICTED: protein LSD1 isoform X2 [Cucumis melo] >XP_016900659.1 PREDICTED: protein LSD1 isoform X2 [Cucumis melo] >XP_016900660.1 PREDICTED: protein LSD1 isoform X2 [Cucumis melo])

HSP 1 Score: 181.4 bits (459), Expect = 8.6e-42
Identity = 94/139 (67.63%), Postives = 106/139 (76.26%), Query Frame = 0

Query: 1   MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTL 60
           MPFP VTC Q+QI+CSGCK++L+YPAGATSICCALCH V+PVPTSGLTMA+LVCSGC+TL
Sbjct: 1   MPFPSVTCCQNQIMCSGCKNMLIYPAGATSICCALCHAVTPVPTSGLTMARLVCSGCYTL 60

Query: 61  LMYRRGATSVQCSCCRTVNAASK-------------ANQMAHINCGNCRVLLMYQ-SLGS 120
           LMY RGATSVQCSCCRT+NAASK             ANQMAH NCGNCRVLLMYQ    S
Sbjct: 61  LMYSRGATSVQCSCCRTINAASKESYQIDFELWLQTANQMAHTNCGNCRVLLMYQCEAHS 120

Query: 121 MKMNPKSINTFYWFLMKKE 126
           +K    +  T    L K+E
Sbjct: 121 VKCTLCNFVTSVGILRKRE 139

BLAST of Clc10G01940 vs. ExPASy Swiss-Prot
Match: Q0J7V9 (Protein LSD1 OS=Oryza sativa subsp. japonica OX=39947 GN=LSD1 PE=2 SV=1)

HSP 1 Score: 161.4 bits (407), Expect = 1.2e-38
Identity = 74/102 (72.55%), Postives = 86/102 (84.31%), Query Frame = 0

Query: 1   MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTL 60
           +PF P   +QSQ+VCSGC++LLMYPAGATS+CCA+C  V+ VP  G  MAQLVC GCHTL
Sbjct: 54  VPFTPPNGAQSQLVCSGCRNLLMYPAGATSVCCAVCSTVTAVPAPGTEMAQLVCGGCHTL 113

Query: 61  LMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLMYQ 103
           LMY RGATSVQCSCC TVN A +ANQ+AH+NCGNCR+LLMYQ
Sbjct: 114 LMYIRGATSVQCSCCHTVNLAMEANQVAHVNCGNCRMLLMYQ 155

BLAST of Clc10G01940 vs. ExPASy Swiss-Prot
Match: Q93ZB1 (Protein LOL1 OS=Arabidopsis thaliana OX=3702 GN=LOL1 PE=2 SV=1)

HSP 1 Score: 152.9 bits (385), Expect = 4.3e-36
Identity = 70/96 (72.92%), Postives = 81/96 (84.38%), Query Frame = 0

Query: 7   TCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRG 66
           T  QSQ+VCSGC++LLMYP GATS+CCA+C+ V+ VP  G  MAQLVC GCHTLLMY RG
Sbjct: 29  TSGQSQLVCSGCRNLLMYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRG 88

Query: 67  ATSVQCSCCRTVNAASKANQMAHINCGNCRVLLMYQ 103
           ATSVQCSCC TVN A +ANQ+AH+NCGNC +LLMYQ
Sbjct: 89  ATSVQCSCCHTVNLALEANQVAHVNCGNCMMLLMYQ 124

BLAST of Clc10G01940 vs. ExPASy Swiss-Prot
Match: Q2QMB3 (Protein LOL2 OS=Oryza sativa subsp. japonica OX=39947 GN=LOL2 PE=2 SV=1)

HSP 1 Score: 122.1 bits (305), Expect = 8.2e-27
Identity = 54/93 (58.06%), Postives = 70/93 (75.27%), Query Frame = 0

Query: 10  QSQIVCSGCKHLLMYPAGATSICCALCHVV-SPVPTSGLTMAQLVCSGCHTLLMYRRGAT 69
           QSQIVC GC+++L+YP GA S+CCA+CH V S  P+ G+ +A L+C GC TLLMY R AT
Sbjct: 2   QSQIVCHGCRNILLYPRGAPSVCCAVCHAVSSTAPSPGMDIAHLICGGCRTLLMYTRNAT 61

Query: 70  SVQCSCCRTVNAASKANQMAHINCGNCRVLLMY 102
           SV+CSCC TVN     + +AH+NCG C+ +LMY
Sbjct: 62  SVRCSCCDTVNLVRPVSSIAHLNCGQCQTVLMY 94

BLAST of Clc10G01940 vs. ExPASy Swiss-Prot
Match: Q6ASS2 (Protein LOL3 OS=Oryza sativa subsp. japonica OX=39947 GN=LOL3 PE=2 SV=1)

HSP 1 Score: 114.8 bits (286), Expect = 1.3e-24
Identity = 52/94 (55.32%), Postives = 65/94 (69.15%), Query Frame = 0

Query: 10  QSQIVCSGCKHLLMYPAGATSICCALCHVVS--PVPTSGLTMAQLVCSGCHTLLMYRRGA 69
           QSQIVC GC+ +L YP+GA S+CCALC  ++  P P   + MA L+C GC TLLMY R A
Sbjct: 2   QSQIVCHGCRSVLRYPSGAPSVCCALCQAITTVPPPAPVMEMAHLICGGCRTLLMYTRNA 61

Query: 70  TSVQCSCCRTVNAASKANQMAHINCGNCRVLLMY 102
            +V+CSCC TVN     N +AH++CG CR  LMY
Sbjct: 62  DTVRCSCCSTVNLVRPVNNIAHVSCGQCRTTLMY 95

BLAST of Clc10G01940 vs. ExPASy Swiss-Prot
Match: P94077 (Protein LSD1 OS=Arabidopsis thaliana OX=3702 GN=LSD1 PE=1 SV=1)

HSP 1 Score: 105.5 bits (262), Expect = 7.9e-22
Identity = 51/103 (49.51%), Postives = 71/103 (68.93%), Query Frame = 0

Query: 10  QSQIVCSGCKHLLMYPAGATSICCALCHVVS--PVPTSGLTMAQLVCSGCHTLLMYRRGA 69
           Q Q+VC GC++LLMYP GA+++ CALC+ ++  P P     MA ++C GC T+LMY RGA
Sbjct: 7   QDQLVCHGCRNLLMYPRGASNVRCALCNTINMVPPPPPPHDMAHIICGGCRTMLMYTRGA 66

Query: 70  TSVQCSCCRTVN---------AASKANQMAHINCGNCRVLLMY 102
           +SV+CSCC+T N         A + ++Q+A INCG+CR  LMY
Sbjct: 67  SSVRCSCCQTTNLVPAHSNQVAHAPSSQVAQINCGHCRTTLMY 109

BLAST of Clc10G01940 vs. ExPASy TrEMBL
Match: A0A1S3BKC0 (protein LOL1 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103490794 PE=4 SV=1)

HSP 1 Score: 190.7 bits (483), Expect = 6.9e-45
Identity = 94/126 (74.60%), Postives = 106/126 (84.13%), Query Frame = 0

Query: 1   MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTL 60
           MPFP VTC Q+QI+CSGCK++L+YPAGATSICCALCH V+PVPTSGLTMA+LVCSGC+TL
Sbjct: 51  MPFPSVTCCQNQIMCSGCKNMLIYPAGATSICCALCHAVTPVPTSGLTMARLVCSGCYTL 110

Query: 61  LMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLMYQ-SLGSMKMNPKSINTFYW 120
           LMY RGATSVQCSCCRT+NAASKANQMAH NCGNCRVLLMYQ    S+K    +  T   
Sbjct: 111 LMYSRGATSVQCSCCRTINAASKANQMAHTNCGNCRVLLMYQCEAHSVKCTLCNFVTSVG 170

Query: 121 FLMKKE 126
            L K+E
Sbjct: 171 ILRKRE 176

BLAST of Clc10G01940 vs. ExPASy TrEMBL
Match: A0A1S3BJQ2 (protein LSD1 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103490794 PE=4 SV=1)

HSP 1 Score: 181.4 bits (459), Expect = 4.2e-42
Identity = 94/139 (67.63%), Postives = 106/139 (76.26%), Query Frame = 0

Query: 1   MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTL 60
           MPFP VTC Q+QI+CSGCK++L+YPAGATSICCALCH V+PVPTSGLTMA+LVCSGC+TL
Sbjct: 1   MPFPSVTCCQNQIMCSGCKNMLIYPAGATSICCALCHAVTPVPTSGLTMARLVCSGCYTL 60

Query: 61  LMYRRGATSVQCSCCRTVNAASK-------------ANQMAHINCGNCRVLLMYQ-SLGS 120
           LMY RGATSVQCSCCRT+NAASK             ANQMAH NCGNCRVLLMYQ    S
Sbjct: 61  LMYSRGATSVQCSCCRTINAASKESYQIDFELWLQTANQMAHTNCGNCRVLLMYQCEAHS 120

Query: 121 MKMNPKSINTFYWFLMKKE 126
           +K    +  T    L K+E
Sbjct: 121 VKCTLCNFVTSVGILRKRE 139

BLAST of Clc10G01940 vs. ExPASy TrEMBL
Match: A0A6J1CWA3 (protein LSD1-like OS=Momordica charantia OX=3673 GN=LOC111014817 PE=4 SV=1)

HSP 1 Score: 174.9 bits (442), Expect = 3.9e-40
Identity = 81/102 (79.41%), Postives = 90/102 (88.24%), Query Frame = 0

Query: 1   MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTL 60
           MP PPVTC QS++VCSGCK+LL+YP GATSICC LCH VSPVPT GL MA+LVC GCHTL
Sbjct: 1   MPIPPVTCCQSRMVCSGCKNLLIYPVGATSICCTLCHSVSPVPTPGLEMARLVCKGCHTL 60

Query: 61  LMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLMYQ 103
           L++ RGATSVQCSCCRTVN+ASKANQ A INC NCR+LLMYQ
Sbjct: 61  LLFSRGATSVQCSCCRTVNSASKANQTAEINCRNCRMLLMYQ 102

BLAST of Clc10G01940 vs. ExPASy TrEMBL
Match: A0A0A9NHW2 (Uncharacterized protein OS=Arundo donax OX=35708 PE=4 SV=1)

HSP 1 Score: 161.8 bits (408), Expect = 3.4e-36
Identity = 74/102 (72.55%), Postives = 86/102 (84.31%), Query Frame = 0

Query: 1   MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTL 60
           +PF P   SQSQ+VC+GC++LLMYPAGATS+CCA+C  V+ VP  G  MAQLVC GCHTL
Sbjct: 13  VPFTPPNGSQSQLVCTGCRNLLMYPAGATSVCCAICSTVTSVPAPGTEMAQLVCGGCHTL 72

Query: 61  LMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLMYQ 103
           LMY RGATSVQCSCC TVN A +ANQ+AH+NCGNCR+LLMYQ
Sbjct: 73  LMYIRGATSVQCSCCHTVNLAMEANQVAHVNCGNCRMLLMYQ 114

BLAST of Clc10G01940 vs. ExPASy TrEMBL
Match: A0A0E0EI65 (Uncharacterized protein OS=Oryza meridionalis OX=40149 PE=4 SV=1)

HSP 1 Score: 161.4 bits (407), Expect = 4.5e-36
Identity = 74/102 (72.55%), Postives = 86/102 (84.31%), Query Frame = 0

Query: 1   MPFPPVTCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTL 60
           +PF P   +QSQ+VCSGC++LLMYPAGATS+CCA+C  V+ VP  G  MAQLVC GCHTL
Sbjct: 13  VPFTPPNGAQSQLVCSGCRNLLMYPAGATSVCCAVCSTVTAVPAPGTEMAQLVCGGCHTL 72

Query: 61  LMYRRGATSVQCSCCRTVNAASKANQMAHINCGNCRVLLMYQ 103
           LMY RGATSVQCSCC TVN A +ANQ+AH+NCGNCR+LLMYQ
Sbjct: 73  LMYIRGATSVQCSCCHTVNLAMEANQVAHVNCGNCRMLLMYQ 114

BLAST of Clc10G01940 vs. TAIR 10
Match: AT1G32540.2 (lsd one like 1 )

HSP 1 Score: 152.9 bits (385), Expect = 3.1e-37
Identity = 70/96 (72.92%), Postives = 81/96 (84.38%), Query Frame = 0

Query: 7   TCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRG 66
           T  QSQ+VCSGC++LLMYP GATS+CCA+C+ V+ VP  G  MAQLVC GCHTLLMY RG
Sbjct: 29  TSGQSQLVCSGCRNLLMYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRG 88

Query: 67  ATSVQCSCCRTVNAASKANQMAHINCGNCRVLLMYQ 103
           ATSVQCSCC TVN A +ANQ+AH+NCGNC +LLMYQ
Sbjct: 89  ATSVQCSCCHTVNLALEANQVAHVNCGNCMMLLMYQ 124

BLAST of Clc10G01940 vs. TAIR 10
Match: AT1G32540.1 (lsd one like 1 )

HSP 1 Score: 152.9 bits (385), Expect = 3.1e-37
Identity = 70/96 (72.92%), Postives = 81/96 (84.38%), Query Frame = 0

Query: 7   TCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRG 66
           T  QSQ+VCSGC++LLMYP GATS+CCA+C+ V+ VP  G  MAQLVC GCHTLLMY RG
Sbjct: 62  TSGQSQLVCSGCRNLLMYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRG 121

Query: 67  ATSVQCSCCRTVNAASKANQMAHINCGNCRVLLMYQ 103
           ATSVQCSCC TVN A +ANQ+AH+NCGNC +LLMYQ
Sbjct: 122 ATSVQCSCCHTVNLALEANQVAHVNCGNCMMLLMYQ 157

BLAST of Clc10G01940 vs. TAIR 10
Match: AT1G32540.3 (lsd one like 1 )

HSP 1 Score: 152.9 bits (385), Expect = 3.1e-37
Identity = 70/96 (72.92%), Postives = 81/96 (84.38%), Query Frame = 0

Query: 7   TCSQSQIVCSGCKHLLMYPAGATSICCALCHVVSPVPTSGLTMAQLVCSGCHTLLMYRRG 66
           T  QSQ+VCSGC++LLMYP GATS+CCA+C+ V+ VP  G  MAQLVC GCHTLLMY RG
Sbjct: 29  TSGQSQLVCSGCRNLLMYPVGATSVCCAVCNAVTAVPPPGTEMAQLVCGGCHTLLMYIRG 88

Query: 67  ATSVQCSCCRTVNAASKANQMAHINCGNCRVLLMYQ 103
           ATSVQCSCC TVN A +ANQ+AH+NCGNC +LLMYQ
Sbjct: 89  ATSVQCSCCHTVNLALEANQVAHVNCGNCMMLLMYQ 124

BLAST of Clc10G01940 vs. TAIR 10
Match: AT4G20380.2 (LSD1 zinc finger family protein )

HSP 1 Score: 105.5 bits (262), Expect = 5.6e-23
Identity = 51/103 (49.51%), Postives = 71/103 (68.93%), Query Frame = 0

Query: 10  QSQIVCSGCKHLLMYPAGATSICCALCHVVS--PVPTSGLTMAQLVCSGCHTLLMYRRGA 69
           Q Q+VC GC++LLMYP GA+++ CALC+ ++  P P     MA ++C GC T+LMY RGA
Sbjct: 7   QDQLVCHGCRNLLMYPRGASNVRCALCNTINMVPPPPPPHDMAHIICGGCRTMLMYTRGA 66

Query: 70  TSVQCSCCRTVN---------AASKANQMAHINCGNCRVLLMY 102
           +SV+CSCC+T N         A + ++Q+A INCG+CR  LMY
Sbjct: 67  SSVRCSCCQTTNLVPAHSNQVAHAPSSQVAQINCGHCRTTLMY 109

BLAST of Clc10G01940 vs. TAIR 10
Match: AT4G20380.1 (LSD1 zinc finger family protein )

HSP 1 Score: 105.5 bits (262), Expect = 5.6e-23
Identity = 51/103 (49.51%), Postives = 71/103 (68.93%), Query Frame = 0

Query: 10  QSQIVCSGCKHLLMYPAGATSICCALCHVVS--PVPTSGLTMAQLVCSGCHTLLMYRRGA 69
           Q Q+VC GC++LLMYP GA+++ CALC+ ++  P P     MA ++C GC T+LMY RGA
Sbjct: 2   QDQLVCHGCRNLLMYPRGASNVRCALCNTINMVPPPPPPHDMAHIICGGCRTMLMYTRGA 61

Query: 70  TSVQCSCCRTVN---------AASKANQMAHINCGNCRVLLMY 102
           +SV+CSCC+T N         A + ++Q+A INCG+CR  LMY
Sbjct: 62  SSVRCSCCQTTNLVPAHSNQVAHAPSSQVAQINCGHCRTTLMY 104

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_031738784.12.9e-4587.25protein LSD1 [Cucumis sativus][more]
KAE8650032.12.9e-4587.25hypothetical protein Csa_010851 [Cucumis sativus][more]
XP_038903355.14.9e-4588.24protein LSD1-like [Benincasa hispida][more]
XP_008448703.11.4e-4474.60PREDICTED: protein LOL1 isoform X1 [Cucumis melo][more]
XP_008448702.28.6e-4267.63PREDICTED: protein LSD1 isoform X2 [Cucumis melo] >XP_016900659.1 PREDICTED: pro... [more]
Match NameE-valueIdentityDescription
Q0J7V91.2e-3872.55Protein LSD1 OS=Oryza sativa subsp. japonica OX=39947 GN=LSD1 PE=2 SV=1[more]
Q93ZB14.3e-3672.92Protein LOL1 OS=Arabidopsis thaliana OX=3702 GN=LOL1 PE=2 SV=1[more]
Q2QMB38.2e-2758.06Protein LOL2 OS=Oryza sativa subsp. japonica OX=39947 GN=LOL2 PE=2 SV=1[more]
Q6ASS21.3e-2455.32Protein LOL3 OS=Oryza sativa subsp. japonica OX=39947 GN=LOL3 PE=2 SV=1[more]
P940777.9e-2249.51Protein LSD1 OS=Arabidopsis thaliana OX=3702 GN=LSD1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A1S3BKC06.9e-4574.60protein LOL1 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103490794 PE=4 SV=1[more]
A0A1S3BJQ24.2e-4267.63protein LSD1 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103490794 PE=4 SV=1[more]
A0A6J1CWA33.9e-4079.41protein LSD1-like OS=Momordica charantia OX=3673 GN=LOC111014817 PE=4 SV=1[more]
A0A0A9NHW23.4e-3672.55Uncharacterized protein OS=Arundo donax OX=35708 PE=4 SV=1[more]
A0A0E0EI654.5e-3672.55Uncharacterized protein OS=Oryza meridionalis OX=40149 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G32540.23.1e-3772.92lsd one like 1 [more]
AT1G32540.13.1e-3772.92lsd one like 1 [more]
AT1G32540.33.1e-3772.92lsd one like 1 [more]
AT4G20380.25.6e-2349.51LSD1 zinc finger family protein [more]
AT4G20380.15.6e-2349.51LSD1 zinc finger family protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 141..168
NoneNo IPR availablePANTHERPTHR31747:SF1PROTEIN LOL1coord: 6..103
IPR005735Zinc finger, LSD1-typeTIGRFAMTIGR01053TIGR01053coord: 51..79
e-value: 2.0E-12
score: 44.5
coord: 12..39
e-value: 3.1E-11
score: 40.7
IPR005735Zinc finger, LSD1-typePFAMPF06943zf-LSD1coord: 54..78
e-value: 1.7E-11
score: 43.8
coord: 15..39
e-value: 8.4E-10
score: 38.3
IPR005513Late embryogenesis abundant protein, LEA_1 subgroupPFAMPF03760LEA_1coord: 131..175
e-value: 1.1E-6
score: 29.2
IPR040319LSD1-likePANTHERPTHR31747PROTEIN LSD1coord: 6..103

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc10G01940.1Clc10G01940.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009793 embryo development ending in seed dormancy
biological_process GO:0034051 negative regulation of plant-type hypersensitive response
biological_process GO:0045595 regulation of cell differentiation
cellular_component GO:0005634 nucleus