Cla019912 (gene) Watermelon (97103) v1

NameCla019912
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionProtein ALWAYS EARLY 2 (AHRD V1 *--- ALY2_ARATH)
LocationChr2 : 25844922 .. 25848307 (-)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCTTTCAATCCAATGGATAACTTTCCAGAAGCTTTTAGACGTCAGTGCTGTTCCATCAACACAGCACCTCTCGAATACAAAGACCTACAACAAAATAGCCATCCAAATGTAAGTAGAGAATTGGAAAAAAAATCCAGCCCACACACCACTGATACATTGGTAACACACTAGAGTTCTAAGTATTTTCTAAATCAAAATTACTCTGCATGTTATTAACGTTTTGGATAATAAGATATTTATGCTATCTTCCTGATTTTAATTCTACAATTGTTTGCTGATTTGATGGCAAGTAATTAAATTTTTTTTTTCTTTTCCGATGGATAGATACATATTTCTTGTGAGTTTAATTTGATTTTGGATCATTAGTTTGACAATTGACAAGGCCAGAGTGATACATTTAAAAGCTAGACAATACTTAGATCCTTAGCATGTCTTATTCTCAACAATTGGACTTTTTTGTAACTTTATATCTTTGTTGTTAGTTATGTATGCATAGGTTCCTTTTTCTTTTTAGGTCACATTTTTGTATTACTCATTTGTTTATGCTTCTCAGAGCTCACCACATGTTTGCCCAACATTCAAATATTGCCTCTTAAATTGCTAATTGACCCAAAAGCTTAAGCTTATGGGTGAAGGCAAATTTAATATCATATCATCTAACACTCCCCCTCATTTGTGGGCTTGAAATATGAAGAAAGCCTAACAAGTGGAAACCAATTTTAATTGGGGAGGAAACAACTATGCAGGGGTTTGAACATAGAACCTCCTCAACCAACTTGCTTTGATACCATGTTAAATTGTTGATTGACCAAAAAACTTAACCTTATGGGTGAAGGAAAATTTAATATCGTATCATCTAACATTTTGTGTTTTTAAAGCTTAAGTAGTTTAGACGGCCACATCCATGTTGCATAACTATATAAGCGTGCTTTTATGTGTGCCCAAGAGGTGGTCGGTTGGTGTTGTTCTTCTAGGCTCCTAGCTATTACAGGAAGCTAAACCCCAGTTAAGCCTTGTAGCATTCTAATGGTAGAAGAGATAGGCGCCTCTTGTCAACCAATGAGACTTAAGTGATGTCATTTTATGTTGAGGCTTAAGTTTAAAAACATGATTTTTATATATTTTTTTTTTTTGAAAAAGGAAACGAGCCTCTTCGTTATAAGATAATGAAATGAGACATACAAGATAATAGTAGATACAATAAACCGAGACCAAAGGATCAGTGCACCCGGGCATCTCAACTAGGTTGACACCCCCATAACACTCTCATCATATCCAATACATGTGTACAAAAGTGACAACGAACAAAATGAGCAATAAAAACAACCAAACTAGCAAAAATACATCAAAACAAACAATCTATCCCAAAATACAACATTGGGGAATATAAAAGCCTTCCAATTGATGCAGATATCTTGAATGGAGAAGTCTTCAAAGGTCTTGGAAAGAGAGCACCATGATGAAACATTTAACCTATAAAAAAACATGATTTTTTTATTTTAATTTTTAAATATTTTGTTTATGGCAAAAAGACTATCCAATAGCCTACATTTGGCTTAAAGTTTTATTTTGGAGAATTGATTGCATTTGTCCTACATTAGTTGGTAATTAAGAAAGAATATGCCAAGGGAAGTGAATAAGGAGATAATTGCATGCATATAGGAATAGGGCCTTCGGGCCCACAAATTCACCAGGAATGACCAAACAATTATGTAAGAGATTCCTTTAAATAAGACTTTGAAGCGAGCTAAGCTTGAAAAATTGGGTTCAAAAACTGAAGTTGGATAATATGCATGTTTAGGTTGCAATCTGTCCAATTAACTAGAAACCCAAACTAATCAAAAACACCTATAGCAAAAGAACTTTAAAGTGGCCCAGGAAATAGTTATTTATACTCTGTACCTTCACAACTTATTAGCCATCTCCATGCGCCTTCCGTTAATATACATTTGTGTGTTGAATCTGTTATTTGCTTCAACTTCAAACAAACATCTTAATTGATTATGTGCATCTGTAGGTTCCTTCCACCACGTTTAACCTGAAGCAGCATAATACTTTCTCTGGGAACTCATTGCCTCTGTGGCTGATGCCTCCTGCCAATACCAGAGCACTTAGTAGCATCCCTTGTTCTTCAAATGTTTCTCAAGGATCGGGATGTGGGGCAGTTGATATTGTCAAAGGTTCGAGGGAAAAGGCACAATTGATGGTAAATGTTGCTATTGAGGTACTCACCCTCTTTCTCTTGCACATAACATCATGGATATGCAGTGTAATTGGATAAGAACTACACCTTTTACATGACTTTGTGGTGGACAACTGGTGAAACATAATGAATACTGGATTATAAGAAACCATGAAGCTTGGTTGAGAGGCAGTTTTGAACTGGATAGTGGATAGTTTTCAGATGATATTGATGGCTTTTCTCTTCCCATTTTCTACTTGAAATAATATTAGTATTGAGATAATTGTGGAGATGTACGACAAAATAAGAATTTGAATATGAGGGTGGGCGCTTTACTGACCAATTTATTTACTGGTGGTATCTGTTTTTCTAATTTGAAATTGATCTAAAACTACTCTAGCGAGTTTAGGGTTTTGTTCTTAGGCTGATTGGTACCAATGAACGCCATAATGTATTTCATGTTTTCTTTTTGAGTTTTTGTGCATATAATATCTCAGGTTGAATGCTTAGCTTCCCTCGTCTCCTTATTTCTTTGCAACTTCAACTATTATTACGTTTTCTTTTTGTGCATGAAATATCTCAGGTTGAGTGTTGGGTGCTTTGCTTTCCTGATCTCGTCATTTTCCATTTTTCCTTGCAACTTCAGTGAACAAGTTAGATCTTTAAAAAATTGATTGGCAATTTCCTTGAATTCTACGCTTTTGGTTTTCTTCACTCTTTTCTTAGTGTTGTAGTTATCGGAATGCTAGTGCTCAAGTAGTTGTCAAAGTGAGATTCCATGATACCATTAATTTTCAGGTCTTGTTGAGCAAGAACGATGGTGATGATCCTCTTACAAGTATTTGTGGTGCCTTGCATTCTTTTGATGATCAGATTTCGTCGTTTGAGGTTCAAAAACCTTCAAGCATGTCTCAAGATATGAACGATAGCCTAGGAGCCCACTTTAATCAGTTGTTCCCGTCAAAACACCTTTCTAGTGGTGCTCTATCTAGTCTGAGATCAAGACATTCCAATAGAGATTATGGAGGAATTCCGTCAAATCTAATCACTTCATGTGTGGCTACTTTGCTCATGATACAGGTAATTGTTAGTTTCTGAGTTAACTATGATTCAGGGTAAAAAACTGTTTTTTTTTCCCCTTATTGTGACAATACTGATATAGGTTATTCTTATTTTCCTGCCATTACATGCATCTAGATGA

mRNA sequence

ATGCCTTTCAATCCAATGGATAACTTTCCAGAAGCTTTTAGACGTCAGTGCTGTTCCATCAACACAGCACCTCTCGAATACAAAGACCTACAACAAAATAGCCATCCAAATGTTCCTTCCACCACGTTTAACCTGAAGCAGCATAATACTTTCTCTGGGAACTCATTGCCTCTGTGGCTGATGCCTCCTGCCAATACCAGAGCACTTAGTAGCATCCCTTGTTCTTCAAATGTTTCTCAAGGATCGGGATGTGGGGCAGTTGATATTGTCAAAGGTTCGAGGGAAAAGGCACAATTGATGGTAAATGTTGCTATTGAGGTCTTGTTGAGCAAGAACGATGGTGATGATCCTCTTACAAGTATTTGTGGTGCCTTGCATTCTTTTGATGATCAGATTTCGTCGTTTGAGGTTCAAAAACCTTCAAGCATGTCTCAAGATATGAACGATAGCCTAGGAGCCCACTTTAATCAGTTGTTCCCGTCAAAACACCTTTCTAGTGGTGCTCTATCTAGTCTGAGATCAAGACATTCCAATAGAGATTATGGAGGAATTCCGTCAAATCTAATCACTTCATGTGTGGCTACTTTGCTCATGATACAGGTTATTCTTATTTTCCTGCCATTACATGCATCTAGATGA

Coding sequence (CDS)

ATGCCTTTCAATCCAATGGATAACTTTCCAGAAGCTTTTAGACGTCAGTGCTGTTCCATCAACACAGCACCTCTCGAATACAAAGACCTACAACAAAATAGCCATCCAAATGTTCCTTCCACCACGTTTAACCTGAAGCAGCATAATACTTTCTCTGGGAACTCATTGCCTCTGTGGCTGATGCCTCCTGCCAATACCAGAGCACTTAGTAGCATCCCTTGTTCTTCAAATGTTTCTCAAGGATCGGGATGTGGGGCAGTTGATATTGTCAAAGGTTCGAGGGAAAAGGCACAATTGATGGTAAATGTTGCTATTGAGGTCTTGTTGAGCAAGAACGATGGTGATGATCCTCTTACAAGTATTTGTGGTGCCTTGCATTCTTTTGATGATCAGATTTCGTCGTTTGAGGTTCAAAAACCTTCAAGCATGTCTCAAGATATGAACGATAGCCTAGGAGCCCACTTTAATCAGTTGTTCCCGTCAAAACACCTTTCTAGTGGTGCTCTATCTAGTCTGAGATCAAGACATTCCAATAGAGATTATGGAGGAATTCCGTCAAATCTAATCACTTCATGTGTGGCTACTTTGCTCATGATACAGGTTATTCTTATTTTCCTGCCATTACATGCATCTAGATGA

Protein sequence

MPFNPMDNFPEAFRRQCCSINTAPLEYKDLQQNSHPNVPSTTFNLKQHNTFSGNSLPLWLMPPANTRALSSIPCSSNVSQGSGCGAVDIVKGSREKAQLMVNVAIEVLLSKNDGDDPLTSICGALHSFDDQISSFEVQKPSSMSQDMNDSLGAHFNQLFPSKHLSSGALSSLRSRHSNRDYGGIPSNLITSCVATLLMIQVILIFLPLHASR
BLAST of Cla019912 vs. Swiss-Prot
Match: ALY2_ARATH (Protein ALWAYS EARLY 2 OS=Arabidopsis thaliana GN=ALY2 PE=1 SV=1)

HSP 1 Score: 58.2 bits (139), Expect = 1.3e-07
Identity = 42/117 (35.90%), Postives = 62/117 (52.99%), Query Frame = 1

Query: 87  VDIVKGSREKAQLMVNVAIEVLLSKNDGDDPLTSICGALHSFDDQISSFEVQKPSSMS-- 146
           ++IVKGS+ +AQ MV+ AI+   S  +G+D  T I  AL    + +   ++ + S +   
Sbjct: 887 LEIVKGSKTRAQAMVDAAIKAASSVKEGEDVNTMIQEAL----ELVGKNQLLRSSMVKHH 946

Query: 147 QDMNDSLGAHFNQLFPSKHLSSGALSSLRSRHSNRDYGGIPSNLITSCVATLLMIQV 202
           + +N S+  H N   PS      A + L S+  +     +PS LITSCVAT LMIQ+
Sbjct: 947 EHVNGSIEHHHNPS-PSNGSEPVANNDLNSQDGSEKNAQMPSELITSCVATWLMIQM 998

BLAST of Cla019912 vs. TrEMBL
Match: A0A0A0LLU7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G030020 PE=4 SV=1)

HSP 1 Score: 323.6 bits (828), Expect = 1.9e-85
Identity = 165/203 (81.28%), Postives = 176/203 (86.70%), Query Frame = 1

Query: 1   MPFNPMDNFPEAFRRQCCSINTAPLEYKDLQQNSHPNVPSTTFNLKQHNTFSGNSLPLWL 60
           MPFNPMDNFPE FRRQ CSIN APLEYK+LQ+N+HPNVPSTTFNLKQHNTFSGNSL    
Sbjct: 692 MPFNPMDNFPETFRRQICSINRAPLEYKELQRNNHPNVPSTTFNLKQHNTFSGNSLA--- 751

Query: 61  MPPANTRALSSIPCSSNVSQGSGCGAVDIVKGSREKAQLMVNVAIEVLLSKNDGDDPLTS 120
             PAN RAL SIPCS NVSQGSG GAVDIV+GSREKAQ+MVNVAIEVLLSKNDGDDPLT 
Sbjct: 752 --PANARALGSIPCSLNVSQGSGRGAVDIVQGSREKAQMMVNVAIEVLLSKNDGDDPLTI 811

Query: 121 ICGALHSFDDQISSFEVQKPSSMSQDMNDSLGAHFNQLFPSKHLSSGALSSLRSRHSNRD 180
           I GALHS D+Q SSF+VQKPSSMSQ+M D LGAH  +LFPSKHLS+  LSSLRSRH NRD
Sbjct: 812 IYGALHSSDNQNSSFKVQKPSSMSQNMKDCLGAHVKELFPSKHLSTADLSSLRSRHFNRD 871

Query: 181 YGGIPSNLITSCVATLLMIQVIL 204
           Y GIPSNLITSCVATLLMIQ  +
Sbjct: 872 YRGIPSNLITSCVATLLMIQACI 889

BLAST of Cla019912 vs. TrEMBL
Match: A0A067JU30_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_23338 PE=4 SV=1)

HSP 1 Score: 100.1 bits (248), Expect = 3.3e-18
Identity = 63/166 (37.95%), Postives = 92/166 (55.42%), Query Frame = 1

Query: 40   STTFNLKQHNTFSGNSLPLWLMPPANTRALSSIPC--SSNVSQGSGCGAVDIVKGSREKA 99
            S   NL+QH+TFSGN+LP WL PPAN      +P    S +SQ SG   V+IV+ SR KA
Sbjct: 893  SALVNLRQHHTFSGNTLPPWLKPPANISLPGGLPGLHDSFISQESGSAVVEIVRSSRHKA 952

Query: 100  QLMVNVAIEVLLSKNDGDDPLTSICGALHSFDDQISSFEVQKPSSMSQDMNDSLGAHFNQ 159
              M++ A++ + S  +G+D    I  AL S D +  + E +     S +  + + +H NQ
Sbjct: 953  HTMIDAAVQAISSMKEGEDAFMKIGEALDSIDKRQLASESKVQVIRSPEQVNGILSHQNQ 1012

Query: 160  LF--PSKHLSSGALSSLRSR-HSNRDYGGIPSNLITSCVATLLMIQ 201
                 S   ++   S  +S+ ++ +    IPS LI SCVATLLM+Q
Sbjct: 1013 FISRTSDPQANSTTSDPKSQDNAEKVETAIPSELIKSCVATLLMLQ 1058

BLAST of Cla019912 vs. TrEMBL
Match: W9SEH2_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_006676 PE=4 SV=1)

HSP 1 Score: 100.1 bits (248), Expect = 3.3e-18
Identity = 67/172 (38.95%), Postives = 95/172 (55.23%), Query Frame = 1

Query: 35  HPNVPSTTFNLKQHNTFSGNSLPLWLMPPANTRALSSIPCSSN---VSQGSGCGAVDIVK 94
           +  V S   +L+Q N++ GN+L  WL  PAN    S +P S +   + Q SG   ++IVK
Sbjct: 766 YATVSSALLDLRQRNSYRGNALLPWLKAPANIGVHSVLPGSLDSFSIPQDSGSSVIEIVK 825

Query: 95  GSREKAQLMVNVAIEVLLSKNDGDDPLTSICGALHSFDDQISSFEVQKPSSMSQDMNDSL 154
           GS  KAQ MV+ AI+   S+ +G+D    I  AL S D+ ++S      +     +N +L
Sbjct: 826 GSTVKAQAMVDAAIQAFSSRGEGEDAYAKIREALDSMDNSLTSDSRVSMNRTQDQVNGNL 885

Query: 155 GAHFNQLFPSKHLSSGAL--SSLRSR-HSNRDYGGIPSNLITSCVATLLMIQ 201
           G H NQ   S      A+  S+L SR  S ++   +PS +ITSCVATLLMIQ
Sbjct: 886 G-HRNQQLSSTSEPVHAVDSSALNSRTDSEKNEAQVPSEVITSCVATLLMIQ 936

BLAST of Cla019912 vs. TrEMBL
Match: W9R5N4_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_001430 PE=4 SV=1)

HSP 1 Score: 100.1 bits (248), Expect = 3.3e-18
Identity = 67/172 (38.95%), Postives = 95/172 (55.23%), Query Frame = 1

Query: 35  HPNVPSTTFNLKQHNTFSGNSLPLWLMPPANTRALSSIPCSSN---VSQGSGCGAVDIVK 94
           +  V S   +L+Q N++ GN+L  WL  PAN    S +P S +   + Q SG   ++IVK
Sbjct: 753 YATVSSALLDLRQRNSYPGNALLPWLKAPANIGVHSVLPGSLDSFSIPQDSGSSVIEIVK 812

Query: 95  GSREKAQLMVNVAIEVLLSKNDGDDPLTSICGALHSFDDQISSFEVQKPSSMSQDMNDSL 154
           GS  KAQ MV+ AI+   S+ +G+D    I  AL S D+ ++S      +     +N +L
Sbjct: 813 GSTVKAQAMVDAAIQAFSSRGEGEDAYAKIREALDSMDNSLTSDSRVSMNRTQDQVNGNL 872

Query: 155 GAHFNQLFPSKHLSSGAL--SSLRSR-HSNRDYGGIPSNLITSCVATLLMIQ 201
           G H NQ   S      A+  S+L SR  S ++   +PS +ITSCVATLLMIQ
Sbjct: 873 G-HRNQQLSSTSEPVHAVDSSALNSRTDSEKNEAQVPSEVITSCVATLLMIQ 923

BLAST of Cla019912 vs. TrEMBL
Match: M5WJB8_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa000472mg PE=4 SV=1)

HSP 1 Score: 93.6 bits (231), Expect = 3.1e-16
Identity = 64/188 (34.04%), Postives = 100/188 (53.19%), Query Frame = 1

Query: 21   NTAPLEYKDLQ--QNSHPNVPSTTFNLKQHNTFSGNSLPLWLMPPANTRALSSIPCS--S 80
            N+     KD +  +  +  V S   NL+Q NT+  NSLP WL  PAN+     +P S  S
Sbjct: 905  NSGECSLKDSEPFKKHYATVSSALLNLRQRNTYPANSLPPWLKQPANSTIYGGLPSSFDS 964

Query: 81   NVSQGSGCGAVDIVKGSREKAQLMVNVAIEVLLSKNDGDDPLTSICGALHSFDDQISSFE 140
            ++SQ SG    +IV+ SR KA +MVN AI+ + S+  G+D    I  AL S D+Q    +
Sbjct: 965  SISQESGSSVAEIVEVSRSKAHMMVNAAIQAMSSRKGGEDAYVRIREALDSIDNQHLPSD 1024

Query: 141  VQKPSSMSQD-MNDSLGAHFNQLFPS---KHLSSGALSSLRSRHSNRDYGGIPSNLITSC 200
             +   + SQ+ +N +LG H NQL  S    + +S +     +  + +    + S++I++C
Sbjct: 1025 SRLSLNRSQEQVNGNLG-HRNQLISSTSDPNFTSDSPGPKPNTDTEKTEAQVLSDIISAC 1084

BLAST of Cla019912 vs. NCBI nr
Match: gi|778666893|ref|XP_011648836.1| (PREDICTED: protein ALWAYS EARLY 3-like isoform X3 [Cucumis sativus])

HSP 1 Score: 323.6 bits (828), Expect = 2.7e-85
Identity = 165/203 (81.28%), Postives = 176/203 (86.70%), Query Frame = 1

Query: 1   MPFNPMDNFPEAFRRQCCSINTAPLEYKDLQQNSHPNVPSTTFNLKQHNTFSGNSLPLWL 60
           MPFNPMDNFPE FRRQ CSIN APLEYK+LQ+N+HPNVPSTTFNLKQHNTFSGNSL    
Sbjct: 692 MPFNPMDNFPETFRRQICSINRAPLEYKELQRNNHPNVPSTTFNLKQHNTFSGNSLA--- 751

Query: 61  MPPANTRALSSIPCSSNVSQGSGCGAVDIVKGSREKAQLMVNVAIEVLLSKNDGDDPLTS 120
             PAN RAL SIPCS NVSQGSG GAVDIV+GSREKAQ+MVNVAIEVLLSKNDGDDPLT 
Sbjct: 752 --PANARALGSIPCSLNVSQGSGRGAVDIVQGSREKAQMMVNVAIEVLLSKNDGDDPLTI 811

Query: 121 ICGALHSFDDQISSFEVQKPSSMSQDMNDSLGAHFNQLFPSKHLSSGALSSLRSRHSNRD 180
           I GALHS D+Q SSF+VQKPSSMSQ+M D LGAH  +LFPSKHLS+  LSSLRSRH NRD
Sbjct: 812 IYGALHSSDNQNSSFKVQKPSSMSQNMKDCLGAHVKELFPSKHLSTADLSSLRSRHFNRD 871

Query: 181 YGGIPSNLITSCVATLLMIQVIL 204
           Y GIPSNLITSCVATLLMIQ  +
Sbjct: 872 YRGIPSNLITSCVATLLMIQACI 889

BLAST of Cla019912 vs. NCBI nr
Match: gi|659071560|ref|XP_008460621.1| (PREDICTED: protein ALWAYS EARLY 2-like [Cucumis melo])

HSP 1 Score: 313.2 bits (801), Expect = 3.6e-82
Identity = 163/220 (74.09%), Postives = 174/220 (79.09%), Query Frame = 1

Query: 1    MPFNPMDNFPEAFRRQCCSINTAPLEYKDLQQNSHPNV-----------------PSTTF 60
            MPFNPMDNFPE FRRQ CSIN APL YK+L++N+HPNV                 PSTTF
Sbjct: 809  MPFNPMDNFPETFRRQICSINRAPLAYKELRRNNHPNVSRELEKRSSPLTTDTSVPSTTF 868

Query: 61   NLKQHNTFSGNSLPLWLMPPANTRALSSIPCSSNVSQGSGCGAVDIVKGSREKAQLMVNV 120
            NL+QHNTFSGNSL      PANTRAL SIPCS NVSQ SGCGAVDIVKGSREKAQ+MVNV
Sbjct: 869  NLQQHNTFSGNSLA-----PANTRALGSIPCSLNVSQRSGCGAVDIVKGSREKAQMMVNV 928

Query: 121  AIEVLLSKNDGDDPLTSICGALHSFDDQISSFEVQKPSSMSQDMNDSLGAHFNQLFPSKH 180
            AIEV LSKNDGDDPLT IC ALH FD+Q SSF+VQKP S  QD  DSLGAH N+LFPSKH
Sbjct: 929  AIEVWLSKNDGDDPLTIICDALHCFDNQNSSFKVQKPLSTLQDTKDSLGAHINELFPSKH 988

Query: 181  LSSGALSSLRSRHSNRDYGGIPSNLITSCVATLLMIQVIL 204
            LS+  LSSLRSRH NRDYGGIPSNLITSCVATLLMIQ  +
Sbjct: 989  LSTADLSSLRSRHFNRDYGGIPSNLITSCVATLLMIQACI 1023

BLAST of Cla019912 vs. NCBI nr
Match: gi|778666891|ref|XP_011648835.1| (PREDICTED: protein ALWAYS EARLY 2-like isoform X2 [Cucumis sativus])

HSP 1 Score: 312.8 bits (800), Expect = 4.7e-82
Identity = 165/220 (75.00%), Postives = 176/220 (80.00%), Query Frame = 1

Query: 1   MPFNPMDNFPEAFRRQCCSINTAPLEYKDLQQNSHPNV-----------------PSTTF 60
           MPFNPMDNFPE FRRQ CSIN APLEYK+LQ+N+HPNV                 PSTTF
Sbjct: 690 MPFNPMDNFPETFRRQICSINRAPLEYKELQRNNHPNVSRELEKRSSPLTTDTSVPSTTF 749

Query: 61  NLKQHNTFSGNSLPLWLMPPANTRALSSIPCSSNVSQGSGCGAVDIVKGSREKAQLMVNV 120
           NLKQHNTFSGNSL      PAN RAL SIPCS NVSQGSG GAVDIV+GSREKAQ+MVNV
Sbjct: 750 NLKQHNTFSGNSLA-----PANARALGSIPCSLNVSQGSGRGAVDIVQGSREKAQMMVNV 809

Query: 121 AIEVLLSKNDGDDPLTSICGALHSFDDQISSFEVQKPSSMSQDMNDSLGAHFNQLFPSKH 180
           AIEVLLSKNDGDDPLT I GALHS D+Q SSF+VQKPSSMSQ+M D LGAH  +LFPSKH
Sbjct: 810 AIEVLLSKNDGDDPLTIIYGALHSSDNQNSSFKVQKPSSMSQNMKDCLGAHVKELFPSKH 869

Query: 181 LSSGALSSLRSRHSNRDYGGIPSNLITSCVATLLMIQVIL 204
           LS+  LSSLRSRH NRDY GIPSNLITSCVATLLMIQ  +
Sbjct: 870 LSTADLSSLRSRHFNRDYRGIPSNLITSCVATLLMIQACI 904

BLAST of Cla019912 vs. NCBI nr
Match: gi|778666888|ref|XP_011648834.1| (PREDICTED: protein ALWAYS EARLY 2-like isoform X1 [Cucumis sativus])

HSP 1 Score: 312.8 bits (800), Expect = 4.7e-82
Identity = 165/220 (75.00%), Postives = 176/220 (80.00%), Query Frame = 1

Query: 1   MPFNPMDNFPEAFRRQCCSINTAPLEYKDLQQNSHPNV-----------------PSTTF 60
           MPFNPMDNFPE FRRQ CSIN APLEYK+LQ+N+HPNV                 PSTTF
Sbjct: 692 MPFNPMDNFPETFRRQICSINRAPLEYKELQRNNHPNVSRELEKRSSPLTTDTSVPSTTF 751

Query: 61  NLKQHNTFSGNSLPLWLMPPANTRALSSIPCSSNVSQGSGCGAVDIVKGSREKAQLMVNV 120
           NLKQHNTFSGNSL      PAN RAL SIPCS NVSQGSG GAVDIV+GSREKAQ+MVNV
Sbjct: 752 NLKQHNTFSGNSLA-----PANARALGSIPCSLNVSQGSGRGAVDIVQGSREKAQMMVNV 811

Query: 121 AIEVLLSKNDGDDPLTSICGALHSFDDQISSFEVQKPSSMSQDMNDSLGAHFNQLFPSKH 180
           AIEVLLSKNDGDDPLT I GALHS D+Q SSF+VQKPSSMSQ+M D LGAH  +LFPSKH
Sbjct: 812 AIEVLLSKNDGDDPLTIIYGALHSSDNQNSSFKVQKPSSMSQNMKDCLGAHVKELFPSKH 871

Query: 181 LSSGALSSLRSRHSNRDYGGIPSNLITSCVATLLMIQVIL 204
           LS+  LSSLRSRH NRDY GIPSNLITSCVATLLMIQ  +
Sbjct: 872 LSTADLSSLRSRHFNRDYRGIPSNLITSCVATLLMIQACI 906

BLAST of Cla019912 vs. NCBI nr
Match: gi|703131672|ref|XP_010104934.1| (hypothetical protein L484_006676 [Morus notabilis])

HSP 1 Score: 100.1 bits (248), Expect = 4.8e-18
Identity = 67/172 (38.95%), Postives = 95/172 (55.23%), Query Frame = 1

Query: 35  HPNVPSTTFNLKQHNTFSGNSLPLWLMPPANTRALSSIPCSSN---VSQGSGCGAVDIVK 94
           +  V S   +L+Q N++ GN+L  WL  PAN    S +P S +   + Q SG   ++IVK
Sbjct: 766 YATVSSALLDLRQRNSYRGNALLPWLKAPANIGVHSVLPGSLDSFSIPQDSGSSVIEIVK 825

Query: 95  GSREKAQLMVNVAIEVLLSKNDGDDPLTSICGALHSFDDQISSFEVQKPSSMSQDMNDSL 154
           GS  KAQ MV+ AI+   S+ +G+D    I  AL S D+ ++S      +     +N +L
Sbjct: 826 GSTVKAQAMVDAAIQAFSSRGEGEDAYAKIREALDSMDNSLTSDSRVSMNRTQDQVNGNL 885

Query: 155 GAHFNQLFPSKHLSSGAL--SSLRSR-HSNRDYGGIPSNLITSCVATLLMIQ 201
           G H NQ   S      A+  S+L SR  S ++   +PS +ITSCVATLLMIQ
Sbjct: 886 G-HRNQQLSSTSEPVHAVDSSALNSRTDSEKNEAQVPSEVITSCVATLLMIQ 936

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ALY2_ARATH1.3e-0735.90Protein ALWAYS EARLY 2 OS=Arabidopsis thaliana GN=ALY2 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LLU7_CUCSA1.9e-8581.28Uncharacterized protein OS=Cucumis sativus GN=Csa_2G030020 PE=4 SV=1[more]
A0A067JU30_JATCU3.3e-1837.95Uncharacterized protein OS=Jatropha curcas GN=JCGZ_23338 PE=4 SV=1[more]
W9SEH2_9ROSA3.3e-1838.95Uncharacterized protein OS=Morus notabilis GN=L484_006676 PE=4 SV=1[more]
W9R5N4_9ROSA3.3e-1838.95Uncharacterized protein OS=Morus notabilis GN=L484_001430 PE=4 SV=1[more]
M5WJB8_PRUPE3.1e-1634.04Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa000472mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|778666893|ref|XP_011648836.1|2.7e-8581.28PREDICTED: protein ALWAYS EARLY 3-like isoform X3 [Cucumis sativus][more]
gi|659071560|ref|XP_008460621.1|3.6e-8274.09PREDICTED: protein ALWAYS EARLY 2-like [Cucumis melo][more]
gi|778666891|ref|XP_011648835.1|4.7e-8275.00PREDICTED: protein ALWAYS EARLY 2-like isoform X2 [Cucumis sativus][more]
gi|778666888|ref|XP_011648834.1|4.7e-8275.00PREDICTED: protein ALWAYS EARLY 2-like isoform X1 [Cucumis sativus][more]
gi|703131672|ref|XP_010104934.1|4.8e-1838.95hypothetical protein L484_006676 [Morus notabilis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR010561LIN-9/ALY1
Vocabulary: Biological Process
TermDefinition
GO:0006351transcription, DNA-templated
GO:0007049cell cycle
Vocabulary: Cellular Component
TermDefinition
GO:0017053transcriptional repressor complex
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0007049 cell cycle
biological_process GO:0006351 transcription, DNA-templated
biological_process GO:0051726 regulation of cell cycle
biological_process GO:0000003 reproduction
cellular_component GO:0017053 transcriptional repressor complex
cellular_component GO:0005634 nucleus
cellular_component GO:0005654 nucleoplasm
molecular_function GO:0003677 DNA binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla019912Cla019912.1mRNA


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR010561Protein LIN-9/Protein ALWAYS EARLYPANTHERPTHR21689LIN-9coord: 37..202
score: 1.1
NoneNo IPR availablePANTHERPTHR21689:SF3PROTEIN ALWAYS EARLY 1-RELATEDcoord: 37..202
score: 1.1