ClCG01G014820 (gene) Watermelon (Charleston Gray)

NameClCG01G014820
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionDNA-binding protein RHL1
LocationCG_Chr01 : 29160066 .. 29170713 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAGAGTCAAGGATCCCCAAGAGTGAACGGCAGTCCGTAGCTGACTATCGGCATCAGAAATGGCGCGAGGATCATCGTCTTCAAAGAGGGACGAAGCAAAAGGAGAAATCGATCCGGAGATTGCAGCACGAAAGCGGCTTAAGAAGCTCGCATTCTCCAATCACATACTTTCAGAGACCCAGGCAAAGCCTCAGGCGTATCTGAGCCCTTCAGCGACGGTTCTGAAGCACCATGGCAAAGACATTGTCAAGAAATCTCAGCGAAAGAACAGGTTCCTCTTCTCCTTTTCAGGCTTGCTCGCTCCCGTCAGTGGAGGCAAGATTGGCGAGCTCAAAGATTTGGGAACCAAGAATCCTATTCTCTATCTCGATTTTCCTCAGGTTCGTCTGTTCTTCCATGCTCTAAACGCCTTTCTGTTTTTCTCTTCTTTGAAAGTGTCATACAGTCTCCTTCACTTTTTTTTTTTTTTTTTTTTTCATACTGTTTCGTTGTCGTTTTTAGAATGATTCTTGTAGCCTGGGAATTAGCAATTATTTGCCATTTTCATCCACTCCTAAAGATTTGATAGATAAACCCCAATATAAAACTCCAGAGTTTTTCCATGTTCCTTGTATACCTACTCCTTGCCTTCTGGATTCTACAAAATAGAGGAGTTATACTTTTCGCAGGGAATACCCGCGATAAACTTGGAGGGATAGGTGTCCAGTGTTCTGAACTCAGAAAAGACTATGAGAGTATAGGATGAGAAAATTAGAGTTGCATCGATGAGATAATAGTAACTGAACGATCGGAAATGTATTATGATTTAAAACGATGGTTTCTAGAGGAGGGTGATTTGGATGCTGGTGTTACAAAATTGAAGATGTGGGCCGGGCATATTCTGTAAAAGATTTGAACCAGTATACTAGCGAAGAAAGCATTGGCAGGGATCAATAAGCTGCAACTAGGCCTCTATCATGAAGGAGTTTGGTGTACGTTCTTGCGGGGTTGAATGGCTGGTGTTTGAAGGGTAAACCAAGATATTTGTGGAGGTGTGCTGCTCAGGCCTTTTTGTGGGAGATTTGGCTGGAAAGGAACTGTAGAAACTTCGAAGATAAAGCTGCGAATTTTACTTCTACTTGGTGGTGTCTTAGGCTCTTTATCATTAGATAACATAGGGATCATCTCAAAACCAATTGGAAATGAGAGGAGTAGTTTATCTATCTTATTAAGTGTGTGAGGTCCCTTATTTTTTTGATGTGGGATCCTCAACTTGCCCCTCAAGATGGTGCCTCTTCGGATTCACCATTTTTTTTATCAGATCACAGTTTCTTTTTATTGGGACCGAATACCTGTTTGAGCTTTTTGGACTCTAATGCCATATTAGATAATGTGGGGTTCATCTCAAAACCAATTGGCAATGAGAGGGAGTAGCCCATTTTTCTTATTAAGAGTTTGAGATCTCACAACATTTGCAACATCATTCTTTGTATTAGTTCCTTCTCCAGATACTTCATGATTGGAAGGCTTTTCTGCTTAGGTTTTGATGGGAGGAGGGATCCCTCTACTCCCTCTAGGCTGTGATCTTTTTTTTCTTTTTCTTTTTTCTTTTTTTGGGTGCCTTTCAAACTTTTCTCAAGCATTGTAATATGTCATCGCTGTGGGACTTGCAATGTGCACTGTACTTGAAGGATTTCTGTGAAGCAAGATCAAAAGCTACTTGTTGGTAAAATTATCATTAGAAGTACATATAGTTAAGTAATCAGGAAACCGTTTCGTGGCATAACAAATTGTAATCCACAATTTAATCTATTAAAGATTCATGAGCTTCAAGAGAAAGTGGATTATTGGAATTTGCAAATTCCACTTAACGAATTTTGTCGTTCCCCAAGGATTGAAAAACCTAAATGGTGATAAATCTCTTTATTATTGAATTGAAACATAATGGTTATGGGCAACTGCTGAGCAATATTATTTCAATTATTGATCAGTAAAGGGTTTTTCTTTCTTTTTTTTTTTTTCTTTTTTTTGTGTGTGTGTGTGTGTGTGTGTGTGTGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGATATTTTTGAGTAATTTTGATTAGTCAATTCTGCATAATTAAAGACAAGCTTTTAATCACAAAGAATGTGAAGTTTGTATTGTAATAGATAAGGCATGATTAATATTGGAGGATTTTGGTTTATTTGGAGTTCATGTAGTTGTTATCAGTATCTATCAATTATTGTATTTTGGGTTATTATTGTTATTATCATTATTATTATTTTTAAATGTCAAAAAAAAAAAAAAAAGAACTCCATGAAATTAATTTACTATTTGAGTCCACAGCTTTGAAACAAGTTAAATATTGGCTTATTATTTATTTCTACTTGGAAAATGAATGCAAAAGTATATTGTTGGGTATGCGCAAGCTAGCATGGATGCTCACAGATATATACATTTATAAAGTATATAAAAGCCTGATTTGAAGTGTAAATTGCCTTTCTGAAGTTTCCAGAAATATCCTTTTTTTTGTTCTCCCTTTCTCTCTCTCGAATGTTGATGCAGATTTTATGTGGTCATGAAGCATCCAGTGGTAGAATTGTTCTGATTACTATTTCTTTTGCTTTTCAGGGGCGTATGAAGTTGTTTGGAACTATTATGTATCCGAAGAACAAATATTTGACTCTGCAGTTCTCTAGAGGTGGAAAGAATGTGATGTGTGAAGATTATTTTGATAATATGGTTTGTCCTTTTGCTTTTCCTTTTGTCCTCCTTTTTTGGCAACTTCATTTCTCTTCATTTTCATCAAGTGCTTCAATGAAAGTGTCCCCTCAACTTGTTAGATCTGTAGCACAATAAATGATAGTTGTAGCTGGTTTTATCAATATGATGTGAATGCTTCTTTATGTACTACTTTTAATTTTTTTTCTTTTCCAGTTAAGCCGTCATTCTCTGTGATTTTTCTTTCAAAGATAAGGCAAAAATTCTCTGGTGCAATGCAAACTGGGCTCTCCTTTGGGGAGCTTGGTTAAAGGAATAAAAGTAAAGAACCTTTCATGGGAAAGATCAAAATTTTCAAGGACTTTCAGAAAGTTATTGTTTGTGGCCTCCCCTTGGAGTACCTTATTTAGTTCTTTATGTATCTACACTTCTATTCATTCTATTATTGCCATTGGAGACTGGAGAGTGTTTTAACTTCTTGGCTTTTCTGAGTCTTTTCTCACATGTAACTTTTTCATGTCAATAAAATGCATTTTTTCTCTTTTTTCTTTTTGTTTAAAAAAATGTTTATCCTTTTAGTTTGTTAGCACTCTTGTTTAGACCTAATGAATAAGCTTTTCATTTGGAATAGTTTTGTTTTTAGATAGTTTCTTAATACAGATAAGATTGTGGGGTGCTTTTTTTCCCCTCCCTTTTAGTAAGGAGGTTCTTAACCTTTATATAACTTTGCCGTGAATTCACTAATGCATCAGATTAATCAGCTGACTTTAGTTTCCCTTAAAACCCTGGTTAATCTCAATGCGGAAATTCTTATCTTAATCTTGTTATAAAAGTCAATACAGACGGTATCACAGTGAACAAACAAAGTAGAACTGCAATGAAGCACGGTGCCTAAAATGCTGATGCTCAAGGTGCAAGACTTCTGTTAGTATTAGGTCTTTTGTCCAAGCAGACAAGTAGTCAAGCATTGGCCACAGCTTGTGAACATCTAGCAAGATCTCTTCTTAAAATGATAGGTTTGCCCGATAAAAGTGTGGCAACGACAATCATACCCCACCAGAAAAAGACAAAAATGGAAAAAAACAACAGTTTGAGTTATCCTGGTGTTTTGGTTAAAGAGGCCTCGTGTAGATACTATATGGTGTCATTTTCACCCTGTTGGTATTTACTATCTTCAGATTAAGAGCATTTGTTCAGATTGACGTTCTTAATTTAATCATTGTACTGTCCTAAACACTTCTTTCTTGGTATTTAAGAAATATGCCCTCGAAGCATGTACATATGATTCAACGATTCTTTTCTCCGACTGCAGATTGTCTTTTCTGATGCATGGTGGATTGGAACTAAAGATGAAAATCCAGAGGAGGCTTGCCTTGATTTTCCTAAAGAATTGACTACGGTAAGTTTCTTGATAATTTTTAGGTCCTTGAAATGCATTTTTTTTTTTGTTTTCTTGAGTATCCCTATATTTCCATGTAAAATTTCTGCAAGCATCTCATTTCTCAAGGTCATTTATGGTGCAAAATAATGGAAATGCAGCTCTTATGAGAGCCACAGACTAATTTATACTGTGAAAGGACTATAATTTTTGTTGTCTAAACTTCTTAATTTTCCTCGTGTGATCATTTAGCAAAAAGAGACGTAACTTAACTTGGCAGTAATGTTACTTCTTACTTTTTTAAATAGTCGTTTAGTTATTTTTAATTTGGAACCCTGCTTGGGATCTTCATGATGCTGATTTTATTACTACACGAAGCCTACTACTACTGATTCTAACTTTTCTATTATGTTTCTATAATGAAGTTTTCCGTTGGGCTGGTCGATATCATTGCTCTAGGTTACTGTTTGATCCATATTTGTGTACTTTTTCCTGTTCCGTGGATTTCCCTTTCTTATTATATTTTTCTCCTTTAGTACACTCTTTTACCACTTCTAATTAGCTAGAATAGTTGTATAAATTCTAGTGCCGTATCTTTCTTGGGGGGCTTATTTATGAAGGATTCTATATGGTTGTTATTTTGGGGCTGCACTGTTCTATTTGGGGCTGGTGCTGTTTTTCCTGTCTTTCTGTTGTACTTACATTTTCCGAATTGAAGCATCTTGAGAGAGAGAGAGAGAGAGCGTTTGCTAACTCCTAGCCGTTATGTGTTAGGGAAAATGTGGAGAATATGACTTTAACAGTGGTGCTGGTGTTGCTAGTACGAGTGGTGGTGCTGGTGTTACCAGTTCAAGTAAGCAGAGTGTTAAAAATAAGGGAATCAATCCTGCTGTAGAAAATTCCTTTAAAGGAGAAGATGGAGATGATTTACTGGACCTTGAAGAAAATGTGACAAATTCAATAAAGACTACGCCAGTTAGACATTCTGAAAGATCTGCCGGAAAAGTATTCAAGTATTACCCTTTGCATATACTGCTATTATGTAATTATTATCTTATTATCAAAAAGGCAGAAAAAAATAGGGTTTCTTGTCAAATAATATTTTGCATCAAATGTCTTATTTCATCGGCAATCTCCATACTACGATTGGTTGAAGATTTTACTGCATGACCATAATTATAGACACATATTTGATACTTGATTTATGATCTTAAAAGTTAGAACTTACAAAATCTGTGTTTTTCTAAGAGTTCACATGTGTTTAGCTTGTATTGGGTCTTCGTAGGTGGCCTTTGTTTCAAAGAGAGCATTCTGTGAGTTTGTGGTGTTTACTGTTTTCTGAGAGGTTAGGTTGGGAAGGAATCGTCTTCAACAAATGGAGAACTCTTGAAAGGAAGTGTGGGATCTTATTATTTCGTAGCCACCTAGAAGGCTTCTGATGTGTCCTTAAGTTGAGATTGAGCTTGGAAATTAAGAAACTAGCGATTGAAACGAATTACCATATCAAAGGAAAGTACAAAAGGTTTACCAAGCATTTGAGAGGCTTACATTTCTCTCGATACTATTTCCAGAATATGTCTATCCCCCACTTCTCCCAATCTATATTATTTATAAACACTATTCCCTAAAAAAAGCTCATTAACCAATTATTAACATACAATTACTAATTTTCCTAACAGCTTCTTTGTTTAAATATTTTTCCAATTATTTATTTTCTACGATCTTAGGTAATCGGGAAGCCTTTTTGTAACTTTTACTTTCAGGAGGCTTTTGCTTTGGGTTGTGAATTTAGTTTATGAATGAAATTAGTTTCCGTAAAAAGAAATGTCCTCTTAAGTTCTTCACATTCTTGGAGAAAGTGCCACCAGTTTGTACCATTACGGTATTGTTATATCTGTATTACGGTCTTCTATTATTCCTAAACATAGTAGGATCAATTTACCGTATGGAAATTGGCGGAAGCAAAGTAGATGTTGGGAAGGCTATCAGTAGAAATACCATCTAGATTGACGGCATTGCCCATCTCAAGTAATGACAATAGACTGTAAGTAGCGATATTTTGAATTAATGAAACCAGCACCGCATTAGTCCTGGATGTTTCTCTAGTACAACTACCTTAAGCCCATCCAAAATTCCAGATTCTTTCACTTGATCAGCTTGGCCGTGCTAGTTGCTATACAGCACATTGGTTTGTATTTATAATCCTAATAAGGCTGTACAAATATGAAACAACCAACGTAACGTTGAGTCCATCTTTTGTCATGTTGGTAGACAAATAGTCTTTCAACAATTTCAGATCAAATGTGAGTTAATTGCAACTTCTTTATTCTCATATATTACTATTCACGTTGAGGCTTATTTCAGTTAATGAATGTTTTTTTTTCGTGATCTGTCTAGCAAAATGATTTTATTCTTTAATTTATTTCCTATACTTGTTAAGTTTTGCAGAGGCTTCTTCTGAGGATGAGTCTGCTGGCACCTACGATGATTTGTCTGAAGGAGAAGAAAAGAATATTCTCATACACGAACCTTCAATTGGAGATCATGCTAGTGAAAATATCCTGTTTATGATTCTTGAATAGTTTGTTTTAATTCATTCTCTGCAGCTTTCTATTATTGTTGATTCAGTCTTTAAATTTATCTGCGGCTGTATGCTCGCTCCCTTAATTGATTGCTACAGACAGAAGATCTCTCTGTTGAGTCTATAGATGAAGATGCTGTGGAAATTAGACCTCCTGTTCTTGAAGGAAATCAGACATCAATTTCTAAGGGGGAAAAAAGTTCTCGGGCTACAGGAAATGCTCAGAGTGATGCTCGTGGACTTGTCCAGCCTACTCTACTTAGTTTGTTCAAGAAAGTGGAGGAGAAGGTAATATCTGAAGACCTTTACTTAATATGTAGAATCAGGTTGCTGGTATGGAATCCATGTACTGCTACATTTGTGGGTTTATTAAAACTACTAATTATTTTTCTTGATGCTAAATGAAAAGGAAAGGATCATTAAAGCCCAATGAAAGTACAAGCAAGAGCTGTTAGAACCTAGCTATCGAGGAACCAAAACAAAAATACTCCCCCTCAAACTATAAAACACTTGGTGAAGCAATGGAAAGGCATTTACCTTTCCCTTGGTTGTTAACAATGATAATAATGTTTTAAAAAGTCAGCACAAAAGTGATCGTAATTCCAGAGCAAAGTATTAAAATGTGTTAGTTACTTACCTCTAAACCCTCTGTAGTCTTATCCTTAAATCCTCCTATTACATTCCAAAAATGAACTGCTTCATGCCAAAGAATCCTCCCTCTCCCCATGACTGGAAGATTACTAAGCATTTCCTCCATACTGCTCCAATTAAGGAATGGTCATCTATCGTTGGAAGCAGAAGAAGGCCTCCTTCCCAGAGAACTGAGCGAAGATATGAGCCTTGTGCATTACAGCTTTAAAGATTATGAAAATTTTAATAAATGACGTGATGGATTCCTTACCTGATAAATTTTTCAGTGGAGGACAGATTATGAGATAACATTCTTTTAGAGCACACAATAAGCATAAAACAAGAAACTCTTAAAAAGAAAAGACCATCCAAGATTTTCCGAAAACATGATTGTTTCTTTCCATCCAAATGACCAACAAAGTACTGAAACTGCAAATTCCAGAAAGCTCTCGATCTCGTGCTAGTAACTCCTCCAAAATTGTCTTATCCGAGTAATGAACTTTATATTTTTAAGGCCTACATATATTTGAAATGAATGCTTACTGTTTTCAGAGGACACCAAGAAGTTCAAAGAGGTCTTCAACACCCAAAGGTTCTAATATTTTTCATTGATTTTATCACGATGGTCACGAGAATACTACAGTAACATCAAATCTGCAAAATTGCTTTCCTACAGTTTCTGCCCAAAAGATGCAGCTGTCTGGTTCAAAGCGAAAGATTGACCAGGTGACAGATCTTCATTTTTAAATAGCACCTTTTCTCCTCGTAATGACTGTAATGGATTGTCATGTATTTTCAGGATGAAGGATCAAAAAAGAGGAGGGCTGTCCGGGGACAAGATGATGGTCAATCTTTCATGCTCTCTCCATTTACTACTGCTAGTTATTTCTCTAACAGGCTTAGATATGACAATATTGGCTTACTAAGAATTTGTCACCTCTCTTTTACGTTGGGTTATCATGGAAATTCTAATAGGATTGCTCAAGAAATTCTATTAGATCTACTACAACTCTTGTAGTAGATCTAATAGAATATTGATTCTCATTTCCACAAGATAGATGTGCATTTTATTTGGACTAATTTTCTGTGCTCTATATTTGTAGTCAGTTTATGCCAATTTGAGTTCTGGAATTACCTAGTGTGTAGCTTTATAGTCGCCTTCAGTCGACTGTGTTTATATGGTATGTTTACATGGTCGGTTGATGCATATTAGCTTGAATTTGCAGGAGAAGTCCAGAAGAAGGATACAGAATATGAGGTACTTGTGCATTTTTACAAATTGTCCTTTCAATAGATTTGTTGCTTTATTAGCTTCAGAATTTCTTATGTGCACTAATCCAACTTGGAGTGTACATGCTTCCCTAAGTTATTATCTAACCGGACAAGTTGCCATTATAACTGCAATAACCAGAAGTATCTTAATAGCAATCAGCAGAGGGCATGTTTGGAATAACAAGGGCTTAAAAATGTGTTTTTGAACACTTGAAAGTCATTCCAAAAGACTCTTAGCTCCTGGTGAAAAGGTTTCATCACAAATAATTCTTTTTGCTTCTCAAGTATCAAAATTTTACGTTTTTTTTCTTTTAATTAAGATTTTATTATTTATTTTTATCAATGAAATCTTACGTGTTAACACACGAGGTTTCTAAAATGATCAGCGTTAGTTTTTGCTTGGGGATACTAACAAAAAGCCTTAATTAATTTTATGTTGTTAATTATAAAAGGGCACCTGGCATCCTTCAAACAACTTCAACCAGCCATTCCTTTTCATGTCTTATTGCCTTATCATGTTACAAGTGAGGGTTTTTAGAATTTAGTTTAAAATGTTATTGGTTTCTGTATTTTGGGTCTCATCCAAATTTAGTACTTGTACTTTCAAATGAAATTCCAAATTTAGTAGTTATGTACCCTCAAATGTTTTAATAAATATTAAAATTAGTTATTATTGTTAGTTTGAAGTTGATTTGTATTGAAATTGGCTATATAATAATAATGATAATAATATCTTTGCAAGGAAGGAGACCATAGTATCTTTGCAAAGAAGGAGGCCATGTGAATATGTTTCCAAAATTTATAGAGGAAAGACTAATAACTAACAGTAGGACTAATTTTAAGATTTATTAGACATTTCTTAGTAGGAAGATTAAAATGGAATTGGACCTGAAATACAAGATCAAATAGTATTATAACCTAAAATTTAATATGTTCTGAACTTGCAACTCCATGTAGTCTATTTCCATTTACTACTCACAAATCACGATGGATTGCTGAATGAAATGGTTGACGGTACATGTCATTGTATGTTAAGTACGTTTTATCTATATCTTGTAGGTTGAAGATGAGATTGAAGAATCGTCAAGTTCTCAAGAGGTGAGTGTTCACTGCCTATTTTATAAACCTAAATACTCCCATGATCTTTTAGAATATTGGACTGGAAGTTAGTGGGCCCTAGGTTTGTACTCATTTTGGAGTCTGATGGAAATATTTGTCTGAACTACTCTTTTGATGTTACGTATAGTTAATTTCTCGCTTGGCTTAGTCCCTACCCTTTATAAACTTCTCACGTCGTGCGTGCCTCACACCATGTTTGTAGGAGAGTTGTCATTTGTATACCGTAATTATTTATTTCATGTAAAGTTAAGAACAAAACAGGTTCTCCATTTGCCTAATTGTAGTTCTTGATCATGATCACCAAAAAGCGTTCACATTGTCATAAACTTTTCACATGCAGGTGATTGAATTAAAATGTTTGAATAATACAGCAATTAAACTTTAAGTGGGAGTGAAAAGATGAAATCAAAGTGAAATGATCATTTAGGGACTAGATGAATAGATTAACTAAAACATATTCCTTTAGTTTCTTTCATAAATATTGGTTAGGAGCTTGCACATTCTTGCTCATGATTTTGTGTCATCTAGGACACTGATGAAGATTGGACAAGTTGAGGTTATTACATTCTACCAATGCTAAGCCACAGCAGCAGCTAGGATCGCATTGCAGGGGCTATATTCAGAATGCTATGCTCTACCAGTTTCTTTGATGCTGCTGCCTAATTGAAATCAAAGAGATTTAACCATAAAATGATATTGAAGTTAAGCTTTTATCCTAAAATTAGGCAATGTAAAATTATGTTTTGATATCAGAGAGATCTACTCTTACACTTCTTAGGGACCAGCCCAGATTTTCAGCTTGAAGGTAAAAATTATTCACAAGATTTTATGCAGCCTATAAACCATGGAGTGTCCATGCCTTTGGATAATATGGTCCAATTAGTTACTATTGTTTCTAGACAATATGGTAACTAATCTAAGTAT

mRNA sequence

AAAAGAGTCAAGGATCCCCAAGAGTGAACGGCAGTCCGTAGCTGACTATCGGCATCAGAAATGGCGCGAGGATCATCGTCTTCAAAGAGGGACGAAGCAAAAGGAGAAATCGATCCGGAGATTGCAGCACGAAAGCGGCTTAAGAAGCTCGCATTCTCCAATCACATACTTTCAGAGACCCAGGCAAAGCCTCAGGCGTATCTGAGCCCTTCAGCGACGGTTCTGAAGCACCATGGCAAAGACATTGTCAAGAAATCTCAGCGAAAGAACAGGTTCCTCTTCTCCTTTTCAGGCTTGCTCGCTCCCGTCAGTGGAGGCAAGATTGGCGAGCTCAAAGATTTGGGAACCAAGAATCCTATTCTCTATCTCGATTTTCCTCAGGGGCGTATGAAGTTGTTTGGAACTATTATGTATCCGAAGAACAAATATTTGACTCTGCAGTTCTCTAGAGGTGGAAAGAATGTGATGTGTGAAGATTATTTTGATAATATGATTGTCTTTTCTGATGCATGGTGGATTGGAACTAAAGATGAAAATCCAGAGGAGGCTTGCCTTGATTTTCCTAAAGAATTGACTACGGGAAAATGTGGAGAATATGACTTTAACAGTGGTGCTGGTGTTGCTAGTACGAGTGGTGGTGCTGGTGTTACCAGTTCAAGTAAGCAGAGTGTTAAAAATAAGGGAATCAATCCTGCTGTAGAAAATTCCTTTAAAGGAGAAGATGGAGATGATTTACTGGACCTTGAAGAAAATGTGACAAATTCAATAAAGACTACGCCAGTTAGACATTCTGAAAGATCTGCCGGAAAAGTGGCCTTTGTTTCAAAGAGAGCATTCTAGGCTTCTTCTGAGGATGAGTCTGCTGGCACCTACGATGATTTGTCTGAAGGAGAAGAAAAGAATATTCTCATACACGAACCTTCAATTGGAGATCATGCTAAAGATCTCTCTGTTGAGTCTATAGATGAAGATGCTGTGGAAATTAGACCTCCTGTTCTTGAAGGAAATCAGACATCAATTTCTAAGGGGGAAAAAAGTTCTCGGGCTACAGGAAATGCTCAGAGTGATGCTCGTGGACTTGTCCAGCCTACTCTACTTAGTTTGTTCAAGAAAGTGGAGGAGAAGAGGACACCAAGAAGTTCAAAGAGGTCTTCAACACCCAAAGTTTCTGCCCAAAAGATGCAGCTGTCTGGTTCAAAGCGAAAGATTGACCAGGATGAAGGATCAAAAAAGAGGAGGGCTGTCCGGGGACAAGATGATGGAGAAGTCCAGAAGAAGGATACAGAATATGAGGTTGAAGATGAGATTGAAGAATCGTCAAGTTCTCAAGAGGACACTGATGAAGATTGGACAAGTTGAGGTTATTACATTCTACCAATGCTAAGCCACAGCAGCAGCTAGGATCGCATTGCAGGGGCTATATTCAGAATGCTATGCTCTACCAGTTTCTTTGATGCTGCTGCCTAATTGAAATCAAAGAGATTTAACCATAAAATGATATTGAAGTTAAGCTTTTATCCTAAAATTAGGCAATGTAAAATTATGTTTTGATATCAGAGAGATCTACTCTTACACTTCTTAGGGACCAGCCCAGATTTTCAGCTTGAAGGTAAAAATTATTCACAAGATTTTATGCAGCCTATAAACCATGGAGTGTCCATGCCTTTGGATAATATGGTCCAATTAGTTACTATTGTTTCTAGACAATATGGTAACTAATCTAAGTAT

Coding sequence (CDS)

ATGGCGCGAGGATCATCGTCTTCAAAGAGGGACGAAGCAAAAGGAGAAATCGATCCGGAGATTGCAGCACGAAAGCGGCTTAAGAAGCTCGCATTCTCCAATCACATACTTTCAGAGACCCAGGCAAAGCCTCAGGCGTATCTGAGCCCTTCAGCGACGGTTCTGAAGCACCATGGCAAAGACATTGTCAAGAAATCTCAGCGAAAGAACAGGTTCCTCTTCTCCTTTTCAGGCTTGCTCGCTCCCGTCAGTGGAGGCAAGATTGGCGAGCTCAAAGATTTGGGAACCAAGAATCCTATTCTCTATCTCGATTTTCCTCAGGGGCGTATGAAGTTGTTTGGAACTATTATGTATCCGAAGAACAAATATTTGACTCTGCAGTTCTCTAGAGGTGGAAAGAATGTGATGTGTGAAGATTATTTTGATAATATGATTGTCTTTTCTGATGCATGGTGGATTGGAACTAAAGATGAAAATCCAGAGGAGGCTTGCCTTGATTTTCCTAAAGAATTGACTACGGGAAAATGTGGAGAATATGACTTTAACAGTGGTGCTGGTGTTGCTAGTACGAGTGGTGGTGCTGGTGTTACCAGTTCAAGTAAGCAGAGTGTTAAAAATAAGGGAATCAATCCTGCTGTAGAAAATTCCTTTAAAGGAGAAGATGGAGATGATTTACTGGACCTTGAAGAAAATGTGACAAATTCAATAAAGACTACGCCAGTTAGACATTCTGAAAGATCTGCCGGAAAAGTGGCCTTTGTTTCAAAGAGAGCATTCTAG

Protein sequence

MARGSSSSKRDEAKGEIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGRMKLFGTIMYPKNKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGEYDFNSGAGVASTSGGAGVTSSSKQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTPVRHSERSAGKVAFVSKRAF
BLAST of ClCG01G014820 vs. Swiss-Prot
Match: RHL1_ARATH (DNA-binding protein RHL1 OS=Arabidopsis thaliana GN=RHL1 PE=1 SV=1)

HSP 1 Score: 274.6 bits (701), Expect = 1.1e-72
Identity = 146/257 (56.81%), Postives = 179/257 (69.65%), Query Frame = 1

Query: 5   SSSSKRDEAKG--EIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDI 64
           +SSSK+  +KG  + D E   RKRLK LA  N +LS++ AK  + L PS  VLKHHG DI
Sbjct: 4   ASSSKKGGSKGGDKDDAESKQRKRLKTLALDNQLLSDSPAKSHSSLKPSKQVLKHHGTDI 63

Query: 65  VKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGRMKLFGTIMYPKNK 124
           ++KSQRKNRFLFSF GLLAP+S   IG+L  L TKNP+LYL+FPQGRMKLFGTI+YPKN+
Sbjct: 64  IRKSQRKNRFLFSFPGLLAPISAATIGDLDRLSTKNPVLYLNFPQGRMKLFGTILYPKNR 123

Query: 125 YLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGEYDFN 184
           YLTLQFSRGGKNV+C+DYFDNMIVFS++WWIGTK+ENPEEA LDFPKEL   +  E+DF 
Sbjct: 124 YLTLQFSRGGKNVLCDDYFDNMIVFSESWWIGTKEENPEEARLDFPKELAQAENTEFDFQ 183

Query: 185 SGAG-------VASTSGGAGVTSSSKQSVKNKGINPAVENSFKGEDGDDLLDLEENV--T 244
            GAG       +AS   G+  T +    V N+ +      S  GE  DD + +   V  T
Sbjct: 184 GGAGGAASVKKLASPEIGSQPTETDSPEVDNEDV-----LSEDGEFLDDKIQVTPPVQLT 243

Query: 245 NSIKTTPVRHSERSAGK 251
             ++ TPVR S+R++GK
Sbjct: 244 PPVQVTPVRQSQRNSGK 255

BLAST of ClCG01G014820 vs. TrEMBL
Match: A0A0A0LVZ6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G043140 PE=4 SV=1)

HSP 1 Score: 453.8 bits (1166), Expect = 1.5e-124
Identity = 226/251 (90.04%), Postives = 236/251 (94.02%), Query Frame = 1

Query: 1   MARGSSSSKRDEAKGEIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGK 60
           MARGSSS K+DEAKGEI+PEIA RKRLKKLAFSNHILSETQA+PQAYLSPSATVLKHHGK
Sbjct: 1   MARGSSS-KKDEAKGEINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHGK 60

Query: 61  DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGRMKLFGTIMYPK 120
           DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDL TKNPILYLDFPQGRMKLFGTIMYPK
Sbjct: 61  DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYPK 120

Query: 121 NKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGEYD 180
           N+YLTLQFSRGGKNV CED FDNMIVFSDAWWIGTKDENPEEACLDFPK+LT G+CGEYD
Sbjct: 121 NRYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEYD 180

Query: 181 FNSGAGVASTSGGAGVTSSSKQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTP 240
           FN GAGV STSG AGVTS+SKQSV+ KGINPA ENSFKGE GDDL+ LE +VTNSIKTTP
Sbjct: 181 FNGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIKTTP 240

Query: 241 VRHSERSAGKV 252
           VRHSERSA KV
Sbjct: 241 VRHSERSARKV 250

BLAST of ClCG01G014820 vs. TrEMBL
Match: F6I6C9_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_15s0046g00750 PE=4 SV=1)

HSP 1 Score: 320.5 bits (820), Expect = 1.9e-84
Identity = 162/245 (66.12%), Postives = 193/245 (78.78%), Query Frame = 1

Query: 8   SKRDEAKG--EIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDIVKK 67
           SK++E  G  E++PE   RKR KKLAFS ++LS+T +K  + LSPS TV+KHHGKDI+KK
Sbjct: 5   SKKNENGGVSELNPEAEERKRRKKLAFSKNLLSDTPSKAFSALSPSKTVIKHHGKDILKK 64

Query: 68  SQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGRMKLFGTIMYPKNKYLT 127
           SQRKNRFLFSF GLLAP++GGKIGELKDLGTKNPILYLDFPQG+MKLFGTI+YPKN+YLT
Sbjct: 65  SQRKNRFLFSFPGLLAPIAGGKIGELKDLGTKNPILYLDFPQGQMKLFGTIVYPKNRYLT 124

Query: 128 LQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGEYDFNSGA 187
           L FSRGGKNVMCEDYFDNMIVFSDAWWIG K+ENPEEA L+FPKEL+ G+  EYDF  GA
Sbjct: 125 LHFSRGGKNVMCEDYFDNMIVFSDAWWIGRKEENPEEARLEFPKELSEGQSVEYDFKGGA 184

Query: 188 GVASTSGGAGVTSSSKQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTPVRHSE 247
           G+AS S   GV     + V+ +   P +E+   GE  D L D+ E     ++ TPVRHS+
Sbjct: 185 GMASDS-KQGVNKPEMKYVEPQSPKPELEDDLSGE--DSLKDVVEMTPKDVEVTPVRHSQ 244

Query: 248 RSAGK 251
           R+AGK
Sbjct: 245 RTAGK 246

BLAST of ClCG01G014820 vs. TrEMBL
Match: A0A061EYH8_THECC (Root hair initiation protein root hairless 1, putative isoform 3 OS=Theobroma cacao GN=TCM_025225 PE=4 SV=1)

HSP 1 Score: 310.8 bits (795), Expect = 1.5e-81
Identity = 163/252 (64.68%), Postives = 186/252 (73.81%), Query Frame = 1

Query: 1   MARGSSSSKRDEAKGEIDPEIAARKRLKKLAFSNHILSETQAKPQAY--LSPSATVLKHH 60
           M R SSS K   A+    PE   RKRLKKLA  N++LS+T A P++Y  LSPS  V+KHH
Sbjct: 1   MVRTSSSKKPPIAE---TPEATERKRLKKLALKNNLLSDTPATPKSYVPLSPSKLVMKHH 60

Query: 61  GKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGRMKLFGTIMY 120
           GKDI++KSQRKNRFLFSF GLLAP+SGGKIGELK+LG+KNPILYLDFPQG+MKLFGTI+Y
Sbjct: 61  GKDILRKSQRKNRFLFSFPGLLAPISGGKIGELKNLGSKNPILYLDFPQGQMKLFGTIVY 120

Query: 121 PKNKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGE 180
           PKN+YLTL FSRGGKNVMCEDYFDNMIVFSDAWWIG KDENPEEA LDFPKEL  G+  E
Sbjct: 121 PKNRYLTLLFSRGGKNVMCEDYFDNMIVFSDAWWIGKKDENPEEARLDFPKELCQGQQME 180

Query: 181 YDFNSGAGVASTSGGAGVTSSSKQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKT 240
           YDF          GGAGV S +KQ      I      S   E GD L D + ++T  ++ 
Sbjct: 181 YDF---------KGGAGVESVNKQDTPRTEIKQVEIESLDNESGDALSDDDNDLTAKMEV 240

Query: 241 TPVRHSERSAGK 251
           TP RHS R+AGK
Sbjct: 241 TPTRHSARNAGK 240

BLAST of ClCG01G014820 vs. TrEMBL
Match: A0A061EZJ8_THECC (Root hair initiation protein root hairless 1, putative isoform 4 OS=Theobroma cacao GN=TCM_025225 PE=4 SV=1)

HSP 1 Score: 310.8 bits (795), Expect = 1.5e-81
Identity = 163/252 (64.68%), Postives = 186/252 (73.81%), Query Frame = 1

Query: 1   MARGSSSSKRDEAKGEIDPEIAARKRLKKLAFSNHILSETQAKPQAY--LSPSATVLKHH 60
           M R SSS K   A+    PE   RKRLKKLA  N++LS+T A P++Y  LSPS  V+KHH
Sbjct: 1   MVRTSSSKKPPIAE---TPEATERKRLKKLALKNNLLSDTPATPKSYVPLSPSKLVMKHH 60

Query: 61  GKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGRMKLFGTIMY 120
           GKDI++KSQRKNRFLFSF GLLAP+SGGKIGELK+LG+KNPILYLDFPQG+MKLFGTI+Y
Sbjct: 61  GKDILRKSQRKNRFLFSFPGLLAPISGGKIGELKNLGSKNPILYLDFPQGQMKLFGTIVY 120

Query: 121 PKNKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGE 180
           PKN+YLTL FSRGGKNVMCEDYFDNMIVFSDAWWIG KDENPEEA LDFPKEL  G+  E
Sbjct: 121 PKNRYLTLLFSRGGKNVMCEDYFDNMIVFSDAWWIGKKDENPEEARLDFPKELCQGQQME 180

Query: 181 YDFNSGAGVASTSGGAGVTSSSKQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKT 240
           YDF          GGAGV S +KQ      I      S   E GD L D + ++T  ++ 
Sbjct: 181 YDF---------KGGAGVESVNKQDTPRTEIKQVEIESLDNESGDALSDDDNDLTAKMEV 240

Query: 241 TPVRHSERSAGK 251
           TP RHS R+AGK
Sbjct: 241 TPTRHSARNAGK 240

BLAST of ClCG01G014820 vs. TrEMBL
Match: A0A061EXP0_THECC (Root hair initiation protein root hairless 1, putative isoform 2 OS=Theobroma cacao GN=TCM_025225 PE=4 SV=1)

HSP 1 Score: 310.8 bits (795), Expect = 1.5e-81
Identity = 163/252 (64.68%), Postives = 186/252 (73.81%), Query Frame = 1

Query: 1   MARGSSSSKRDEAKGEIDPEIAARKRLKKLAFSNHILSETQAKPQAY--LSPSATVLKHH 60
           M R SSS K   A+    PE   RKRLKKLA  N++LS+T A P++Y  LSPS  V+KHH
Sbjct: 1   MVRTSSSKKPPIAE---TPEATERKRLKKLALKNNLLSDTPATPKSYVPLSPSKLVMKHH 60

Query: 61  GKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGRMKLFGTIMY 120
           GKDI++KSQRKNRFLFSF GLLAP+SGGKIGELK+LG+KNPILYLDFPQG+MKLFGTI+Y
Sbjct: 61  GKDILRKSQRKNRFLFSFPGLLAPISGGKIGELKNLGSKNPILYLDFPQGQMKLFGTIVY 120

Query: 121 PKNKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGE 180
           PKN+YLTL FSRGGKNVMCEDYFDNMIVFSDAWWIG KDENPEEA LDFPKEL  G+  E
Sbjct: 121 PKNRYLTLLFSRGGKNVMCEDYFDNMIVFSDAWWIGKKDENPEEARLDFPKELCQGQQME 180

Query: 181 YDFNSGAGVASTSGGAGVTSSSKQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKT 240
           YDF          GGAGV S +KQ      I      S   E GD L D + ++T  ++ 
Sbjct: 181 YDF---------KGGAGVESVNKQDTPRTEIKQVEIESLDNESGDALSDDDNDLTAKMEV 240

Query: 241 TPVRHSERSAGK 251
           TP RHS R+AGK
Sbjct: 241 TPTRHSARNAGK 240

BLAST of ClCG01G014820 vs. TAIR10
Match: AT1G48380.2 (AT1G48380.2 root hair initiation protein root hairless 1 (RHL1))

HSP 1 Score: 258.5 bits (659), Expect = 4.5e-69
Identity = 146/288 (50.69%), Postives = 179/288 (62.15%), Query Frame = 1

Query: 5   SSSSKRDEAKG--EIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDI 64
           +SSSK+  +KG  + D E   RKRLK LA  N +LS++ AK  + L PS  VLKHHG DI
Sbjct: 4   ASSSKKGGSKGGDKDDAESKQRKRLKTLALDNQLLSDSPAKSHSSLKPSKQVLKHHGTDI 63

Query: 65  VKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGRMKLFGTIMYPKNK 124
           ++KSQRKNRFLFSF GLLAP+S   IG+L  L TKNP+LYL+FPQGRMKLFGTI+YPKN+
Sbjct: 64  IRKSQRKNRFLFSFPGLLAPISAATIGDLDRLSTKNPVLYLNFPQGRMKLFGTILYPKNR 123

Query: 125 YLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELT---------- 184
           YLTLQFSRGGKNV+C+DYFDNMIVFS++WWIGTK+ENPEEA LDFPKEL           
Sbjct: 124 YLTLQFSRGGKNVLCDDYFDNMIVFSESWWIGTKEENPEEARLDFPKELAQVDTFHLFLH 183

Query: 185 ---------------------TGKCGEYDFNSGAG-------VASTSGGAGVTSSSKQSV 244
                                  +  E+DF  GAG       +AS   G+  T +    V
Sbjct: 184 FLFKTMVATEMFNMIRRILWFQAENTEFDFQGGAGGAASVKKLASPEIGSQPTETDSPEV 243

Query: 245 KNKGINPAVENSFKGEDGDDLLDLEENV--TNSIKTTPVRHSERSAGK 251
            N+ +      S  GE  DD + +   V  T  ++ TPVR S+R++GK
Sbjct: 244 DNEDV-----LSEDGEFLDDKIQVTPPVQLTPPVQVTPVRQSQRNSGK 286

BLAST of ClCG01G014820 vs. NCBI nr
Match: gi|449439513|ref|XP_004137530.1| (PREDICTED: DNA-binding protein RHL1 [Cucumis sativus])

HSP 1 Score: 453.8 bits (1166), Expect = 2.1e-124
Identity = 226/251 (90.04%), Postives = 236/251 (94.02%), Query Frame = 1

Query: 1   MARGSSSSKRDEAKGEIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGK 60
           MARGSSS K+DEAKGEI+PEIA RKRLKKLAFSNHILSETQA+PQAYLSPSATVLKHHGK
Sbjct: 1   MARGSSS-KKDEAKGEINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHGK 60

Query: 61  DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGRMKLFGTIMYPK 120
           DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDL TKNPILYLDFPQGRMKLFGTIMYPK
Sbjct: 61  DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYPK 120

Query: 121 NKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGEYD 180
           N+YLTLQFSRGGKNV CED FDNMIVFSDAWWIGTKDENPEEACLDFPK+LT G+CGEYD
Sbjct: 121 NRYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEYD 180

Query: 181 FNSGAGVASTSGGAGVTSSSKQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTP 240
           FN GAGV STSG AGVTS+SKQSV+ KGINPA ENSFKGE GDDL+ LE +VTNSIKTTP
Sbjct: 181 FNGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIKTTP 240

Query: 241 VRHSERSAGKV 252
           VRHSERSA KV
Sbjct: 241 VRHSERSARKV 250

BLAST of ClCG01G014820 vs. NCBI nr
Match: gi|659066963|ref|XP_008467323.1| (PREDICTED: DNA-binding protein RHL1 [Cucumis melo])

HSP 1 Score: 436.8 bits (1122), Expect = 2.6e-119
Identity = 217/251 (86.45%), Postives = 229/251 (91.24%), Query Frame = 1

Query: 1   MARGSSSSKRDEAKGEIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGK 60
           MARGSSSSK+DEAKGEI+PEI  RKRLKKLAFSN+ILSETQAKPQAYLSPSATVLKHHGK
Sbjct: 1   MARGSSSSKKDEAKGEINPEIGERKRLKKLAFSNNILSETQAKPQAYLSPSATVLKHHGK 60

Query: 61  DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGRMKLFGTIMYPK 120
           DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDL TKNP+LYLDFPQGRMKLFGTIMYPK
Sbjct: 61  DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLATKNPVLYLDFPQGRMKLFGTIMYPK 120

Query: 121 NKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGEYD 180
           N+YLTLQFS+GGKNV CED FDNMIVFSDAWWIGTKDENPEEACLDFPKELT G+CGEYD
Sbjct: 121 NRYLTLQFSKGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKELTLGQCGEYD 180

Query: 181 FNSGAGVASTSGGAGVTSSSKQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTP 240
           FN         GGAGVTS+SKQSV+ KGINPA ENSFKGE GDDL+ LE +VTNS+KT P
Sbjct: 181 FN---------GGAGVTSTSKQSVQKKGINPATENSFKGEHGDDLVGLEASVTNSVKTMP 240

Query: 241 VRHSERSAGKV 252
           VRHSERSA KV
Sbjct: 241 VRHSERSARKV 242

BLAST of ClCG01G014820 vs. NCBI nr
Match: gi|225454536|ref|XP_002281960.1| (PREDICTED: DNA-binding protein RHL1 [Vitis vinifera])

HSP 1 Score: 320.5 bits (820), Expect = 2.8e-84
Identity = 162/245 (66.12%), Postives = 193/245 (78.78%), Query Frame = 1

Query: 8   SKRDEAKG--EIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDIVKK 67
           SK++E  G  E++PE   RKR KKLAFS ++LS+T +K  + LSPS TV+KHHGKDI+KK
Sbjct: 5   SKKNENGGVSELNPEAEERKRRKKLAFSKNLLSDTPSKAFSALSPSKTVIKHHGKDILKK 64

Query: 68  SQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGRMKLFGTIMYPKNKYLT 127
           SQRKNRFLFSF GLLAP++GGKIGELKDLGTKNPILYLDFPQG+MKLFGTI+YPKN+YLT
Sbjct: 65  SQRKNRFLFSFPGLLAPIAGGKIGELKDLGTKNPILYLDFPQGQMKLFGTIVYPKNRYLT 124

Query: 128 LQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGEYDFNSGA 187
           L FSRGGKNVMCEDYFDNMIVFSDAWWIG K+ENPEEA L+FPKEL+ G+  EYDF  GA
Sbjct: 125 LHFSRGGKNVMCEDYFDNMIVFSDAWWIGRKEENPEEARLEFPKELSEGQSVEYDFKGGA 184

Query: 188 GVASTSGGAGVTSSSKQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTPVRHSE 247
           G+AS S   GV     + V+ +   P +E+   GE  D L D+ E     ++ TPVRHS+
Sbjct: 185 GMASDS-KQGVNKPEMKYVEPQSPKPELEDDLSGE--DSLKDVVEMTPKDVEVTPVRHSQ 244

Query: 248 RSAGK 251
           R+AGK
Sbjct: 245 RTAGK 246

BLAST of ClCG01G014820 vs. NCBI nr
Match: gi|645270176|ref|XP_008240338.1| (PREDICTED: DNA-binding protein RHL1 [Prunus mume])

HSP 1 Score: 315.5 bits (807), Expect = 8.9e-83
Identity = 163/250 (65.20%), Postives = 196/250 (78.40%), Query Frame = 1

Query: 1   MARGSSSSKRDEAKGEIDPEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGK 60
           MAR SSS K+ + + + +PE+  RKRLK LAFSN++LSE  AKP A L+PS TV+KHHGK
Sbjct: 1   MARTSSSKKKRKDEEDPNPEVTQRKRLKALAFSNNLLSEVPAKPHAPLTPSNTVVKHHGK 60

Query: 61  DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGRMKLFGTIMYPK 120
           DI+KKSQRKNRFLFSF GLLAP+ GGKIGELKDLGTKNP+LYLDFPQGRMKLFGTI++PK
Sbjct: 61  DILKKSQRKNRFLFSFPGLLAPIGGGKIGELKDLGTKNPVLYLDFPQGRMKLFGTIVFPK 120

Query: 121 NKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGEYD 180
           N+YLT+QF RGGK+VMCEDYFDNMIVFSDAWWIGT+ ENP+EA LDFPKELT G+  EYD
Sbjct: 121 NRYLTMQFPRGGKSVMCEDYFDNMIVFSDAWWIGTQAENPKEAQLDFPKELTEGQHAEYD 180

Query: 181 FNSGAGVASTSGGAGVTSSSKQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTP 240
           F          GGAG TS++KQS  +K     VE+S   +  D++ D  +   + ++ TP
Sbjct: 181 F---------KGGAGSTSANKQS-DHKNETTYVEHSPNVKVEDNVSD--DGNKDLMRATP 238

Query: 241 VRHSERSAGK 251
           VRHS R+AGK
Sbjct: 241 VRHSARTAGK 238

BLAST of ClCG01G014820 vs. NCBI nr
Match: gi|590638291|ref|XP_007029352.1| (Root hair initiation protein root hairless 1, putative isoform 1 [Theobroma cacao])

HSP 1 Score: 310.8 bits (795), Expect = 2.2e-81
Identity = 163/252 (64.68%), Postives = 186/252 (73.81%), Query Frame = 1

Query: 1   MARGSSSSKRDEAKGEIDPEIAARKRLKKLAFSNHILSETQAKPQAY--LSPSATVLKHH 60
           M R SSS K   A+    PE   RKRLKKLA  N++LS+T A P++Y  LSPS  V+KHH
Sbjct: 1   MVRTSSSKKPPIAE---TPEATERKRLKKLALKNNLLSDTPATPKSYVPLSPSKLVMKHH 60

Query: 61  GKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGRMKLFGTIMY 120
           GKDI++KSQRKNRFLFSF GLLAP+SGGKIGELK+LG+KNPILYLDFPQG+MKLFGTI+Y
Sbjct: 61  GKDILRKSQRKNRFLFSFPGLLAPISGGKIGELKNLGSKNPILYLDFPQGQMKLFGTIVY 120

Query: 121 PKNKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEACLDFPKELTTGKCGE 180
           PKN+YLTL FSRGGKNVMCEDYFDNMIVFSDAWWIG KDENPEEA LDFPKEL  G+  E
Sbjct: 121 PKNRYLTLLFSRGGKNVMCEDYFDNMIVFSDAWWIGKKDENPEEARLDFPKELCQGQQME 180

Query: 181 YDFNSGAGVASTSGGAGVTSSSKQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKT 240
           YDF          GGAGV S +KQ      I      S   E GD L D + ++T  ++ 
Sbjct: 181 YDF---------KGGAGVESVNKQDTPRTEIKQVEIESLDNESGDALSDDDNDLTAKMEV 240

Query: 241 TPVRHSERSAGK 251
           TP RHS R+AGK
Sbjct: 241 TPTRHSARNAGK 240

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
RHL1_ARATH1.1e-7256.81DNA-binding protein RHL1 OS=Arabidopsis thaliana GN=RHL1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LVZ6_CUCSA1.5e-12490.04Uncharacterized protein OS=Cucumis sativus GN=Csa_1G043140 PE=4 SV=1[more]
F6I6C9_VITVI1.9e-8466.12Putative uncharacterized protein OS=Vitis vinifera GN=VIT_15s0046g00750 PE=4 SV=... [more]
A0A061EYH8_THECC1.5e-8164.68Root hair initiation protein root hairless 1, putative isoform 3 OS=Theobroma ca... [more]
A0A061EZJ8_THECC1.5e-8164.68Root hair initiation protein root hairless 1, putative isoform 4 OS=Theobroma ca... [more]
A0A061EXP0_THECC1.5e-8164.68Root hair initiation protein root hairless 1, putative isoform 2 OS=Theobroma ca... [more]
Match NameE-valueIdentityDescription
AT1G48380.24.5e-6950.69 root hair initiation protein root hairless 1 (RHL1)[more]
Match NameE-valueIdentityDescription
gi|449439513|ref|XP_004137530.1|2.1e-12490.04PREDICTED: DNA-binding protein RHL1 [Cucumis sativus][more]
gi|659066963|ref|XP_008467323.1|2.6e-11986.45PREDICTED: DNA-binding protein RHL1 [Cucumis melo][more]
gi|225454536|ref|XP_002281960.1|2.8e-8466.12PREDICTED: DNA-binding protein RHL1 [Vitis vinifera][more]
gi|645270176|ref|XP_008240338.1|8.9e-8365.20PREDICTED: DNA-binding protein RHL1 [Prunus mume][more]
gi|590638291|ref|XP_007029352.1|2.2e-8164.68Root hair initiation protein root hairless 1, putative isoform 1 [Theobroma caca... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0042023 DNA endoreduplication
cellular_component GO:0005575 cellular_component
molecular_function GO:0003677 DNA binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG01G014820.1ClCG01G014820.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR35698FAMILY NOT NAMEDcoord: 3..250
score: 6.5