Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTTTCTGCCAAATATAATGAAATGAGAAATTTAATTAAAGGAGAAGAAAAGAGTCAAGGATCCCCAAGAGTGAACTGCAGTCCGAAGCTGACTACCAGCATCAGAAATGGCGCGAGGATCATCGTCTTCAAAGAGGGACGAAGCAAAAGGAGAAATCGATTCGGAGATTGCAGCACGAAAGCGGCTTAAGAAACTCGCATTCTCCAATCACATACTTTCAGAGACCCAGGCAAAGCCTCAGGCGTATCTGAGCCCTTCAGCGACGGTTCTGAAGCACCATGGCAAAGACATTGTCAAGAAATCTCAGCGAAAGAACAGGTTCCTCTTCTCCTTTTCAGGCTTGCTCGCTCCCGTCAGTGGAGGCAAGATTGGCGAGCTCAAAGATTTGGGAACCAAGAATCCTATTCTCTATCTCGATTTTCCTCAGGTTCGTCTGTTCTTCCATGCTCTAAACGCCTTTCTGTTTTTCTCTGCTTTGAAAGTGTCATACAGTCTTCTTCACTTCTTTTTTTTTTTCATACTGTTTCGTTGTCGTTTTTAGAATGATTCTTTGTAGCCTGGGAATTAGCAATTATTTGCCATTTTCATCCACTCCTAAAGATTTGATAGATAAACCCCAATATAACACTCCAGAGTTTTTCCATGTTCCTTGTATACCTACTTCTTGCCTTCTGGATTCTACAATATAGAGGAGTTATAATTTTCGCAGGGAATACCCGCGATAAACTTGGAGGGATAGGTGTCCAGTGTTCTGAACTCAGAAAAGACTAGGAGAGTATAGGATGAGAAAATTAGAGTTGCATCTATGAGATAATAGTAACTGAACAATCGGAAATGTATTATGATTTATGAACGATGGTTTCTAGAGGAGGGTGATTTGGATGCTGGTGTTACAAAATTGAAGATGTGGGCCGGGCATATTCTGTAAAAGATTTGAACCAGTATACTAGCGGAGAAAGCATTGGCAGGGATCAATAAGCTGCAACTAGGCCTTTATCATGAAGGAGTTTGGTGTCGTTCTTGCAGGGTTGAATGGGTGGTGTTTGAAGGGTAAACCAAGATATTTGTGGAGGTGTGCTGCTCAGGCCTTTTTGTGGGAGATTTGGCTGGAAAAGAACTGTAGAAACTTCGAAGATAAAGCTGCGAATATTACTTCTACTTGGTGGTGTCTTAGGCTCTTTATTATTAGATAACATGGGGATCATCTCAAAACCAATTGGCAATGAGAGGAGTAGTTTATCTATCTTATTAAGTGTGTGAGGTCCCTTATTTTTTTGATGTGGGATCCTCAACTTGCCCCTCAAGATGGTGCCTCCGGATTCACCATTTTTTTATCAGATCACAGTTTTTTTTTATTGGGACCGAATACCCGTTTGAGCTTTTTGGACTCTAATGCCATATTAGATGATATGGGGTTCATCTCAAAACCAATTGGCAAGGAGAGGAGTAGCCCATTTTTCTTATTAAGAGTTTGAGATCTCACAACATTTGCAACATCATTCTTTGTAATAGTTCCTTCTCCAGATACTTCATGATTGGAAGGCTTTTCTTCTTAGATTTTGATGGGAGGAGGGATACCTCTACTCCCTCTAGGCTGTGATTTTTTTTTTTCTTTTTTGGGTGCCTTTCAAACTTTTCTCGAGCATTGTAATATGTCATCACTGTGGGACTTGCAATGTGCACTGCACTTGAAGGACTTCTGTGAAGCAAGATCAAAAGCTACTTGTTGGTAAAATTATCATTAGAAGTACATATAGTTAAGTAATCAGGAAACCTTTTCGTGGCATAACAAATTGTAATCCACAATTTAATCTATTAAAGATTCATGAGCTTCAAGAGAAGGTGGATTATTGGAATTTTCAAATTCAACTTAACGAATTTTGTCGTTCCCCAAGGATTGAAAAACCTAAATGGTGATAAATCTCTTTATTATTGAATTGAAACATAATGGTTATGGGCAACTGCTGAGTAATATCATTTCAATTATTGACCAGTAAAGGGTTTTTCTTTCCTTTTTTCTTTTTTTTGTGTGTGTGTGTGAGAGAGAGAGAGAGAGAGAGAGGGAGGGAGAGAGAGAGAGAGAAGTATTTTTGAGTAATTTTGATTAGTTAATTCTGCATAATTAAAGACAAGCTTTCAATCACAAAGAATGTGAAGTTTGTATTGTAATAGATAAGGCATGATTAATATTGGAGGATTTTGGTTTATTTGGAGTTCATGTAGTTGTTATCAGTATCTATCAATTATTGTATTTTGGGTTATTATTGTTATTATCATTATTATTTTTTTTAAATGTCAAAAAAAAAAAAAAAAAAAAAAAAGAACTCCATGAAATTAACTTACTATTTGATTAATTATTATTTTTTGCTAAACTCATATTTCATATGATGTATCATTTCTCATAAAATGAGATGCTTTGCTTGGTATTTGAGTCCACAGGTTTGAAACAAGTTAAATATTGGCTTATTATTTATTTCTACTTGGAAAATGAATGGAAAAGTATATTGTTGGGTATGCGCAAGCTATCATGGATGCTCACAGATGTATACATTTATAAAGTATATAATAGCCTGATTTCAAGTGTAAATTGCCTTTCTGAAGTTTCCAGAAATATCCTTTTTTTGTTCTCCCTTTCTCTCTCTCGAATCCAGTGGTAGAATTGTTCTGACTACTATTTCTTTTGCTTTTCAGGGGCGTATGAAATTGTTTGGAACTATTATGTATCCGAAGAACAAATATTTGACTTTGCAGTTCTCTAGAGGTGGAAAGAATGTGATGTGTGAAGATTATTTTGATAATATGGTTTGTCCTTTTGCTTTTCCTTTTGTCCTCCTTTTTTGGCAACTTCATTTCTCTTCATTTTCATCAAGTGCTTCAATGAAAGTGTCCCCCCAACTTGTTAGATCTATAGCACAATAAATGATAGTTGTAGCCGGTTTTATCAATATGATGTGAATGCTTCTTTATGTACTACTTTTAATTTTTTTTCTTTTCCAGTTAAGCCGTCATTCTCTGTGATTTTTCTTTCAAAGATAAGGCAAAAATTCTCTGGTGCAATGCAAACTGGGCTCTCCTTTGGGGAGCTTGGTTAAAGGAATAAAAGTAAAGAACCTTTCATGGGAAAGATCAAAATTTTCAAGGACTTTCAGAAAGTTATTGTTTGTGGACTCCCCTTGGAGTACCTTATTTAGTTCTTTATGTATCTACTCTTCTATTCATTCTATTATTGCCATTGGAGACTGGAGAGTGTTTTAACTTCTTGGCTTTTCTGAGTCTTTTCTCACATGTAACTTTTTCTCGTCAATAAAATGCATTTTTTCTCTCTTTTCTTTTTGTTTAAAAAAATGTTTATCTTTTAGTTTGTTAGCACTCTTGTTTAGACCTAATGAATAAGCTTTTCATTTGGAATAGTTTTGTTTTTAGATAGTTTCTTAATACAGATAAGATTGTGGGGTGCCTTTTTTCCCCTCCCTTTTAGTAAGGAGCTTCTTAACCTTTATATAACTTTTGCATCAGATTAATCAGCTGACTTTATTTTCCCTTAAAACCCTGGTTAATCTCAATGCGGAAATTCTTATCTTAATCTTGTTATAAAAGTCAATACAGATGGACAGTGAATAAACAAAGTAGAACTGTAATGAAGCACGGTGCCTAAAATGCTGATGCTCAAGGTGCAAGACTTCTGTTAGTATTAGGTCTTTTGTCCGAGCAGACAAGTAGTCAAGCATTGGCCACAGCTTGTGAACATCTAGCAAGATCTCTTCTTAAAATGATAGGTTTGCCCGATAAAAGTGTGGCAACGACAATCATACCCCACCAGAAAAAGACAAAAATGGAAAAAAACAACAGTTTGAGTTATCCTGGTGTTTTGGTTAAAGAGGCCTCGTGTAGATACTATATGGTGTCATTTTCACCCTGTTGGTATTTACTATCTTCAGATTAAGAGCATTTGTTCAGATTGGCGTTCTTAATTTAATCATTGTACTGTCCTAAACACTTCTTTCTTGGTGTTTAAGAAATATGCCCTCGAAGCATGTACATACGATTCAACGATTCTTTTCTCCGACTGCAGATTGTCTTTTCTGATGCATGGTGGATTGGAACTAAAGATGAAAATCCAGAGGAGGCTCGCCTTGATTTTCCTAAAGAATTGACTACGGTAAGTTTCTTGATAATTTTTAGGTCCTTGAAATGCATTTTTTTTTTGTTTTCTCGAGTATCCCTATATTTCCATGTAAAATTTCTGCAAGCATCTCATTTCTCAAGGTCATTTATGGTGCAAAATATTGGAAATGCAGCTCTTATGAGAGCCACAGACTAATTTATACTGTGAAAGGACTATAATTTTTGTTGTCCAAACTTCTTAATTTTCCTCGTGTGATCATTTAGCAAAAAAAAAGTAACTTAACTTGGCAGTAATGTTACTTCTTACTATTTTAAATAGTTGTTTAGTTATTTTTAAATTGGAACCCTGCTTGGGATCTTCATGATGCTGATTTTATTACTACACGAAGCCTACTACTACTGATTCTAATTTTTCTATTATGTTTCTATATTGAAGTTTTCCGTTGGGCTGGTCGATATCATTGCTCAAGGTTACTGTTTGATCCATATTTGTGTACATTTTCCTGTTCTGTGGATTTCCCTTTCTTATTATATTTTTCTCCTTTAGTACACTCTTTTACCACTTGTAATTAGCTAGAATAGTTGTATAAATTCTAGTGCCGTATTTTTCTTCGGGGGCTTATATATGAAGGTTTCTATATGGTTGTTATTTTGGGACTGCACTGTTCTATTTGGGGCTGGTGCTGTTTTTCCTGTCTTTCTGTTGTACTTACATTTTCCGAATTGAAGCATCTTGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGCGTTTGCTAACTCCTAGCCGTTATGCGTCAGGGAAAATGTGGAGAATATGACTTTAACGGTGGTGCTGGTGTTGCTAGTACGAGTGGTGCTGCTGGTGTTACCAGTTCAAGTAAGCAGAGTGTTAAAAATAAGGGAATCAATCCTGCTGTAGAAAATTCCTTTAAAGGAGAAGATGGAGATGATTTACTGGACCTTGAAGAAAATGTGACAAATTCAATAAAGACTACGCCAGTTAGACATTCTGAAAGATCTGCCGGAAAAGTATTCAAGTATTACCCTTTGCATATAATGCTATTATGTAATTATTATCTTATTATCAAAAAGGCAGAAAAAAATAGGGTTTCTTGTCAAATAATATTTTGCATCAAATGTCTTATTTCATCGGCAATCTCCATACTCGATTGGTTGAAGATTTTACTGCATGACCATAATTATAGACACATATTTGATACTTGATTTATGATCTTAAAAGTTAGAACTTCAAAATCTGTGTTTTTCTAAGAGTTCACATGTGTTTAGCTTGTATTGGGTCTTCGTAGGTGGCCTCTGTTTCAAAGAGAGCATTCTGTGAGTTTGTGGTGTTTACTGTTTTCTGAGAGGTTAGGTTGGGAAGGAATCGTCTTCAACAAATGGAGAACTCTTGAAAGGAACTGTGGGATCTTATTATTTCGTAGCCACCTAGAAGGCTTCTGATGTGTCCTTAAGTTGAGATGAGCTGGGAAATTAAGAAACTAGCGATTGAAACGAATTACCATATTAAAGGAAAGTACAAAAGGTTTACCAAGCATTTGAGAGGCTTACATTTCTCTCGATGCTATTTCCAGAATATGTCTATCCCCCACTTCTCCCAATCTATATTATTTATAAACACTATTCCCTAAAAAAAGCTCATTAACCAATTATTAACATACAATTACTAATTTTCCCAACAGCTTCTTTGTTTAAATATTTTTCCAATTATTTATTTTCTACGATCTTAGGTAATTGGGGAGCCTTTTTGTAACTTTTACTTTCAGGAGGCTTTTGCTTTGGGTTGTGAATTTAGTTTATGAATGAAATTAGTTTCCGTAAAAAGAAATGTCCTCTTAAGTTCTTCACATTCTTGGAGAAAGTGCCACCAGTTTGTACCATTACGGTATTGTTATATCTGTATTACGGTCTTCTATTATTCCTAAACATAGTAGGATCAATTTACCGTATGGAAATTGGCGGAAGCAAAGTAGATGTTAGGAAGGCTATCAGTAGAAATACCATCTAGATTGACGGCATTGCCCATCTCAAGTAATGACAATAGACTGTGAGTAGCGATATTTTGACTTAATGAAACCAGCACCGCATTAGTCCTGGATGTTTTTCTAGTACAACTACCTTAAGCCCATCCATAATTCCAGATTCTTTCACTTGATCAGCTAGGCCGTGCTAGTTGCTATACAGCACATTGGTTTGTATTTATAATCCTAATAAGGCTCTACAAATATGAAACAACCAACGTAACGTTGAGTCCATCTTTTATCATGTTGGTAGACAAATAGTCTTTCAACAATTTCAGATCAAATGTGAGTTAATTGCAACTTCTTTATTCTCATATATTACTCTTCACGTTGAGGCTTATTTCAGTTAATGCAAATGTTTTTTTTTCGTGATCTGTCTAGCAAAATGATTTTATTATTTAATTTATTTCCTATACTTGTTAAGTTTTGCAGAGGCTTCTTCTGAGGATGAGTCTGCTGGCACCTACGATGATTTGTCTGAAGGAGAAGAAAAGAATATTCTCATACACGAACCTTCAATTGGAGATCATGCTAGTGAAAATATCCTGTTTATGATTCTTGAATAGTTTGTTTTAATTCTTTCTCTGCAGCTTTCTATTATTGTTGATTCAGTCTTTAAATTTGTCTGCGGCTGTATGCTCGCTCCCTTAATTGATTGCTACAGACAGAAGATCTCTCTGTTGAGTCTATAGATGAAGATGCTGTGGAGATTAGACCTCCTGTTCTTGAAGGAAATCAGACATCAATTTCTAAGGGGAAAAAAAGTTCTCGGGCTACAGGAAATGCTCGGAGTGATGCTCGTGGACTTGTCCAGCCTACTCTACTTAGTTTGTTCAAGAAAGTGGAGGAGAAGGTAATATCTGAAGACCTTACTTAATATGTGGAAACAGGTTACTGGTATGGAATCCATGTACTGCTACATTTGGGGGTTTATTAAAACTACTAATTATTTTTCTTGATGCTAAATGAAAAGGAAAGGATCATTAAAGCCCAATGAAAGTACAAGCAAGAGCTGTTAGAACCTAGCTATTGAGGAACCAAAACAAAAATACTCCCCCTCAAACTATAAAACACTTGGTGAAGCAATGGAAAGGCATTTACCTTTCCCTTGGTTGTTAACAATGATAATAATGTTTTAAAAAGTCAGCACAAAAGTGATTGTAATTCCAGAGCAAAGTATTAAAATGTGTTAGTTAATTACCTCTAAACCCTCTGTAGTCTTATCCTTAAATTCTCCTATTACATTCCAAAAATGAACTGCTTCATGCCAAAGAATCCTCCCTCTCCCCATGACTGGAAGATTACTAAGCATTTCCTCCATACTGCTCCAATTAAGGAATGGTCATCTATCGTTGGAAGCAGAAGGTCTCCTTCCCAGAGAACTGAGCGAAGATATGCGCCTTGTGCATTACAGCTTTAATGACGTGATGGATTCCTTACCTGATAAATTTTTCAGTGGAAGACAGATTATGAGATAACATTCTTTTAGAGCATAAAACAAGAAACTCTTAAAAAGAAAAGACCATCCAAGATATTCCAAAAACATGATTGTTTCTTTCCATCCAAATGACCCAACAAAGTACTGAAACTGCAAATTCCAGAAAGCTCTCGATCTTGTGCTAGTAACTCCTCCAAAATTGTCTTAACCGAGTAATGAATTTTATATTTTTAAGGCCTACATATATTTGAAATGAATGCTTACTGTTTTCAGAGGACACCAAGAAGTTCAAAGAGGTCTTCAACGCCCAAAGGTTCTAATATTTTTCATTGATTTTATCACGATGGTCACGAGAATACTACAGTAACATCAAATCTGCAAAATTGCTTTCCTACAGTTTCTGCCCAAAAGATGCAGCTGTCTGGTTCAAAGCGAAAGGTTGACCAGGTGACAGATCTTCATTTTTAAATAGCACTTTTTCTCCTCGTAATGACTGTAATGGATTGTCATGTATTTTCAGGATGAAGGATCAAAAAAGAGGAGGGCTGTCCAGGGACAAGATGATGGTCAATCTTTCATGCTCTCTCCATTTACTTACTGCTAGTTATTTCTCTAACAGGCTTAGATATGACAATATTGGCTTACTAAGAATTTGTCACCTCTCTTTTACGTTGGGTTCTCATGGAAATTCTAATAGGATTGCTCAAGAAATCATACATCTCTTGTAGTAGATCCAATAGAATATTGATTCTCATTTCCACAAGATAGATGTGCATTTTATTTGGACTAATTTTCTGTGCTCTATATTTGTAGTCAGTTTATGCCAATTTGAGTTCTGGAATTACCTAGTGTGTAGCTTTATAGTCGCCTTCAGTCGACTGTGTTTATATGGTATATTTACATGGTCGGTTGATGCATATTAGCTTGAATTTGCAGGAGGAGAAGTCCAGAGGAAGGATACAGAATATGAGGTACTTGTGCTATTTTTACAAATTGTCCTTTCAAAAGATTTGTTGCTTTATTAGCTTCAGAATTTCTTATGTGCACTAATCCAACTTGGAGTGTACATGCTTCCCTAAGTTATTATCTAACCGGACAAGTTGCCATTATAACTGCAATAACCAGAAGTATCTTAATAGCAATCAGCTTAGGGCATGTTTGGAATAACAAGGGCTTAAAAATGTGTTTTTGAACACTTGAAAGTCATTCCAAAAGACTCTTAGCTCCTGGTGAAAAGGTTTCATCACAAATAATTCTTTTTGCTTCTCAAGTATCAAAATTTTACGTTTTTTTTCTTTTAATTAAGATTTTGTTATTTATTTTTATCAATGAAATCTTACGTGTTAACACACGAGGTTTCTAAAATGATCAGCGTTAGTTTTTGCTTGGGGATACTAACAAAAAGCCTTCATTAATTTTATGTTGTTAATTATAAAAGGGCACCTGGCATCCTTCAAACAACTTCAACCAGCCATTCCTTTTCATGTCTTATTGCCTTATCATGTTACAAGTGAGGGTTTTTAGAATTTAGTTTAAAATGTTATTGGTTTCTAGATTTTGGGTCTCATCATCCAAATTTAGTCCTTGTACTTTCAAATGCAATGCCAAATTTAGTAGTTATGTACCCTCAAATGTTTTAATAAATATTAAAATTAGTTATATTATTGTTAGTTTGAAGTTGATTTGTATTGAAATTGGCTATATAATAATAATGATAATAATATCTTTGCAAGAAAGGAGACCATAATATCTTTGCAAAGAAGGAGGCTATGTGAATATGTTTTCAAAATTTATAGAGGAAAGACTAATAACTAGCAGGAGGCCTAATTTTAAGATTTATTGGACATTTCTTAGTAGGAAGATTAAAATGAAATACAAGATCAAATAGTATTATAACCTAAAATTTAATATGTTCTGAACTTGCAACTCCATGTAGTCTATTTCCATTTACTACTCACAAATCACGATGGATTGCTGAATGAAATGGTTGACGGTACATGTCATTGTATGTTAAGTACGTTTTATCTATATCTTGTAGGTTGAAGATGAGATTGAAGAATTGTCAAGTTCTCAAGAGGTGAGTGTTCACTGCCTTTTTATAAACCTAAATACTCCCATGATTTTTTAGAATATTGGATTGGAAGTTAGTGGGCCCTAGGTTTGTACTCATTTTGGAGTCTGATGAAATATTTGTCTGAACTACTCTTTTGATGTTACATATAGTTAATTTCTCGCTTGGCTTAGTCCCTACCCTTTATAAACTTCTTACGTCGTGCGTGCCTCGCACCATGTTTGTAGGAAAGTTGTCATTTGTATACCGTAATTATTTATTTCATGTGAAGTTAAGAACAAAACAGGTTCTCCATTTGCCTAATTGTAGTTCTTGATCATGATCACGAAAAAGTGTTCGCATTGTCATAAACTTTTCACATGCAGGTGATTGAATTAAAATGTTTCAATAATACAGCAATTAAACTTTAAGTGGGAGTGAAAAGATGAAATCAAAGTGAAATGATCATTTAGGGACTAGATGAATAGTTTAACTAAAACATATTCCTTTTAGTTTCTTTCATAAATATTGGTTAGGAGCTTGCACATTCTTACTCATGATTTTGTTTGTCATTTAGGACACTGATGAAGATTGGACAAGTTGAGGTTATTACATTCTAACAATTCTAAGCCGCAGCAGCAGCAAGGATCGCATTGCAGGGGCTATATTCAGAATGCTATGCTCCCAGTTTCTTTGATGCTGCTGCCTAATTGAAATCAAAGAGATTTAACCATAAAATGATATTGAAGTTAAGCTTTTATCCTAAAGTTAGGCAATGTAAAATTATGTTTTGGTATCAGAGAGATCTACTCTTACACTTCTTAGGGACCAGCCCAGATTTTCAGCTTGAAGGTAAAAATTATTCACAAGATTTTATGCAGCTTATAAACTATGGACTGTCCATGCCTTTGGATAATGTGGTCCAATTAGTTACTAATGTTTCTAGACAATATGGTAACTAATCTAAGTATAG
mRNA sequence
ATTTTCTGCCAAATATAATGAAATGAGAAATTTAATTAAAGGAGAAGAAAAGAGTCAAGGATCCCCAAGAGTGAACTGCAGTCCGAAGCTGACTACCAGCATCAGAAATGGCGCGAGGATCATCGTCTTCAAAGAGGGACGAAGCAAAAGGAGAAATCGATTCGGAGATTGCAGCACGAAAGCGGCTTAAGAAACTCGCATTCTCCAATCACATACTTTCAGAGACCCAGGCAAAGCCTCAGGCGTATCTGAGCCCTTCAGCGACGGTTCTGAAGCACCATGGCAAAGACATTGTCAAGAAATCTCAGCGAAAGAACAGGTTCCTCTTCTCCTTTTCAGGCTTGCTCGCTCCCGTCAGTGGAGGCAAGATTGGCGAGCTCAAAGATTTGGGAACCAAGAATCCTATTCTCTATCTCGATTTTCCTCAGGGGCGTATGAAATTGTTTGGAACTATTATGTATCCGAAGAACAAATATTTGACTTTGCAGTTCTCTAGAGGTGGAAAGAATGTGATGTGTGAAGATTATTTTGATAATATGATTGTCTTTTCTGATGCATGGTGGATTGGAACTAAAGATGAAAATCCAGAGGAGGCTCGCCTTGATTTTCCTAAAGAATTGACTACGGGAAAATGTGGAGAATATGACTTTAACGGTGGTGCTGGTGTTGCTAGTACGAGTGGTGCTGCTGGTGTTACCAGTTCAAGTAAGCAGAGTGTTAAAAATAAGGGAATCAATCCTGCTGTAGAAAATTCCTTTAAAGGAGAAGATGGAGATGATTTACTGGACCTTGAAGAAAATGTGACAAATTCAATAAAGACTACGCCAGTTAGACATTCTGAAAGATCTGCCGGAAAAGTATTCAATTTTGCAGAGGCTTCTTCTGAGGATGAGTCTGCTGGCACCTACGATGATTTGTCTGAAGGAGAAGAAAAGAATATTCTCATACACGAACCTTCAATTGGAGATCATGCTAAAGATCTCTCTGTTGAGTCTATAGATGAAGATGCTGTGGAGATTAGACCTCCTGTTCTTGAAGGAAATCAGACATCAATTTCTAAGGGGAAAAAAAGTTCTCGGGCTACAGGAAATGCTCGGAGTGATGCTCGTGGACTTGTCCAGCCTACTCTACTTAGTTTGTTCAAGAAAGTGGAGGAGAAGAGGACACCAAGAAGTTCAAAGAGGTCTTCAACGCCCAAAGTTTCTGCCCAAAAGATGCAGCTGTCTGGTTCAAAGCGAAAGGTTGACCAGGATGAAGGATCAAAAAAGAGGAGGGCTGTCCAGGGACAAGATGATGGAGGAGAAGTCCAGAGGAAGGATACAGAATATGAGGTTGAAGATGAGATTGAAGAATTGTCAAGTTCTCAAGAGGACACTGATGAAGATTGGACAAGTTGAGGTTATTACATTCTAACAATTCTAAGCCGCAGCAGCAGCAAGGATCGCATTGCAGGGGCTATATTCAGAATGCTATGCTCCCAGTTTCTTTGATGCTGCTGCCTAATTGAAATCAAAGAGATTTAACCATAAAATGATATTGAAGTTAAGCTTTTATCCTAAAGTTAGGCAATGTAAAATTATGTTTTGGTATCAGAGAGATCTACTCTTACACTTCTTAGGGACCAGCCCAGATTTTCAGCTTGAAGGTAAAAATTATTCACAAGATTTTATGCAGCTTATAAACTATGGACTGTCCATGCCTTTGGATAATGTGGTCCAATTAGTTACTAATGTTTCTAGACAATATGGTAACTAATCTAAGTATAG
Coding sequence (CDS)
ATGGCGCGAGGATCATCGTCTTCAAAGAGGGACGAAGCAAAAGGAGAAATCGATTCGGAGATTGCAGCACGAAAGCGGCTTAAGAAACTCGCATTCTCCAATCACATACTTTCAGAGACCCAGGCAAAGCCTCAGGCGTATCTGAGCCCTTCAGCGACGGTTCTGAAGCACCATGGCAAAGACATTGTCAAGAAATCTCAGCGAAAGAACAGGTTCCTCTTCTCCTTTTCAGGCTTGCTCGCTCCCGTCAGTGGAGGCAAGATTGGCGAGCTCAAAGATTTGGGAACCAAGAATCCTATTCTCTATCTCGATTTTCCTCAGGGGCGTATGAAATTGTTTGGAACTATTATGTATCCGAAGAACAAATATTTGACTTTGCAGTTCTCTAGAGGTGGAAAGAATGTGATGTGTGAAGATTATTTTGATAATATGATTGTCTTTTCTGATGCATGGTGGATTGGAACTAAAGATGAAAATCCAGAGGAGGCTCGCCTTGATTTTCCTAAAGAATTGACTACGGGAAAATGTGGAGAATATGACTTTAACGGTGGTGCTGGTGTTGCTAGTACGAGTGGTGCTGCTGGTGTTACCAGTTCAAGTAAGCAGAGTGTTAAAAATAAGGGAATCAATCCTGCTGTAGAAAATTCCTTTAAAGGAGAAGATGGAGATGATTTACTGGACCTTGAAGAAAATGTGACAAATTCAATAAAGACTACGCCAGTTAGACATTCTGAAAGATCTGCCGGAAAAGTATTCAATTTTGCAGAGGCTTCTTCTGAGGATGAGTCTGCTGGCACCTACGATGATTTGTCTGAAGGAGAAGAAAAGAATATTCTCATACACGAACCTTCAATTGGAGATCATGCTAAAGATCTCTCTGTTGAGTCTATAGATGAAGATGCTGTGGAGATTAGACCTCCTGTTCTTGAAGGAAATCAGACATCAATTTCTAAGGGGAAAAAAAGTTCTCGGGCTACAGGAAATGCTCGGAGTGATGCTCGTGGACTTGTCCAGCCTACTCTACTTAGTTTGTTCAAGAAAGTGGAGGAGAAGAGGACACCAAGAAGTTCAAAGAGGTCTTCAACGCCCAAAGTTTCTGCCCAAAAGATGCAGCTGTCTGGTTCAAAGCGAAAGGTTGACCAGGATGAAGGATCAAAAAAGAGGAGGGCTGTCCAGGGACAAGATGATGGAGGAGAAGTCCAGAGGAAGGATACAGAATATGAGGTTGAAGATGAGATTGAAGAATTGTCAAGTTCTCAAGAGGACACTGATGAAGATTGGACAAGTTGA
Protein sequence
MARGSSSSKRDEAKGEIDSEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGRMKLFGTIMYPKNKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEARLDFPKELTTGKCGEYDFNGGAGVASTSGAAGVTSSSKQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTPVRHSERSAGKVFNFAEASSEDESAGTYDDLSEGEEKNILIHEPSIGDHAKDLSVESIDEDAVEIRPPVLEGNQTSISKGKKSSRATGNARSDARGLVQPTLLSLFKKVEEKRTPRSSKRSSTPKVSAQKMQLSGSKRKVDQDEGSKKRRAVQGQDDGGEVQRKDTEYEVEDEIEELSSSQEDTDEDWTS
Homology
BLAST of CcUC01G014520 vs. NCBI nr
Match:
XP_004137530.1 (DNA-binding protein RHL1 isoform X2 [Cucumis sativus] >KGN64211.1 hypothetical protein Csa_014154 [Cucumis sativus])
HSP 1 Score: 706.1 bits (1821), Expect = 1.9e-199
Identity = 377/433 (87.07%), Postives = 400/433 (92.38%), Query Frame = 0
Query: 1 MARGSSSSKRDEAKGEIDSEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGK 60
MARG SSSK+DEAKGEI+ EIA RKRLKKLAFSNHILSETQA+PQAYLSPSATVLKHHGK
Sbjct: 1 MARG-SSSKKDEAKGEINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHGK 60
Query: 61 DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGRMKLFGTIMYPK 120
DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDL TKNPILYLDFPQGRMKLFGTIMYPK
Sbjct: 61 DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYPK 120
Query: 121 NKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEARLDFPKELTTGKCGEYD 180
N+YLTLQFSRGGKNV CED FDNMIVFSDAWWIGTKDENPEEA LDFPK+LT G+CGEYD
Sbjct: 121 NRYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEYD 180
Query: 181 FNGGAGVASTSGAAGVTSSSKQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTP 240
FNGGAGV STSG AGVTS+SKQSV+ KGINPA ENSFKGE GDDL+ LE +VTNSIKTTP
Sbjct: 181 FNGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIKTTP 240
Query: 241 VRHSERSAGKVFNFAEASSEDESAGTYDDLSEGEEKNILIHEPSIGDHA----KDLSVES 300
VRHSERSA KVFNFAEASSEDESAGT DLSEGEEKNI+IHEPSIGDHA +D+SVES
Sbjct: 241 VRHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNIVIHEPSIGDHASEKTEDISVES 300
Query: 301 IDEDAVEIRPPVLEGNQTSISKGKKSSRATGNARSDARGLVQPTLLSLFKKVEEKRTPRS 360
IDEDAV+I+PP LEGNQTSISK KKS RA G+A+SD RGLVQPTLLSLFKKVEEKRTPRS
Sbjct: 301 IDEDAVKIKPPFLEGNQTSISKEKKSFRAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRS 360
Query: 361 SKRSSTPKVSAQKMQLSGSKRKVDQDEGSKKRRAVQGQDDGGEVQRKDTEYEVEDEIEEL 420
SKRSS PKVS QKMQLSGSK+K+DQDEGSKKRR V+GQ GG+ Q+KDTEYEVEDEIE+L
Sbjct: 361 SKRSSAPKVSTQKMQLSGSKQKIDQDEGSKKRRVVRGQ--GGKAQKKDTEYEVEDEIEDL 420
Query: 421 SSSQEDTDEDWTS 430
SSSQEDTDEDWTS
Sbjct: 421 SSSQEDTDEDWTS 430
BLAST of CcUC01G014520 vs. NCBI nr
Match:
XP_038893754.1 (DNA-binding protein RHL1 isoform X1 [Benincasa hispida])
HSP 1 Score: 702.2 bits (1811), Expect = 2.8e-198
Identity = 374/433 (86.37%), Postives = 394/433 (90.99%), Query Frame = 0
Query: 1 MARGSSSSKRDEAKGEIDSEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGK 60
MARGSSSSKRDEAKGEID IAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGK
Sbjct: 1 MARGSSSSKRDEAKGEIDPGIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGK 60
Query: 61 DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGRMKLFGTIMYPK 120
DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDL TKNPILYLDFPQGRMKLFGTIMYPK
Sbjct: 61 DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLRTKNPILYLDFPQGRMKLFGTIMYPK 120
Query: 121 NKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEARLDFPKELTTGKCGEYD 180
N+YLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEA LDFP ELTTG+CGE D
Sbjct: 121 NRYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEAHLDFPIELTTGQCGECD 180
Query: 181 FNGGAGVASTSGAAGVTSSSKQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTP 240
FNGG AGVT SKQSV+ KGINPAVENS KGE GDDL+DL++NVTNSIKTTP
Sbjct: 181 FNGG---------AGVTGLSKQSVQKKGINPAVENSLKGEHGDDLVDLKDNVTNSIKTTP 240
Query: 241 VRHSERSAGKVFNFAEASSEDESAGTYDDLSEGEEKNILIHEPSIGDHAK----DLSVES 300
VRHSERSA KVFNFAE SSEDES TY DLSEGEEKNI+IHEPSIGDHA+ DLSV+S
Sbjct: 241 VRHSERSARKVFNFAEVSSEDESTSTYADLSEGEEKNIVIHEPSIGDHAREKTEDLSVDS 300
Query: 301 IDEDAVEIRPPVLEGNQTSISKGKKSSRATGNARSDARGLVQPTLLSLFKKVEEKRTPRS 360
+DEDA EIRPP LEGNQTSIS KKSS A G+A+SD RGLVQPTLLSLFKKVEEKRT RS
Sbjct: 301 MDEDAGEIRPPFLEGNQTSISTEKKSSLAKGSAQSDTRGLVQPTLLSLFKKVEEKRTSRS 360
Query: 361 SKRSSTPKVSAQKMQLSGSKRKVDQDEGSKKRRAVQGQDDGGEVQRKDTEYEVEDEIEEL 420
SKRSSTPKVS QKMQLSGSKRK+DQDEG +KRRAV+GQDDGG++Q+KDTEYEV+D+IEEL
Sbjct: 361 SKRSSTPKVSVQKMQLSGSKRKIDQDEGLRKRRAVRGQDDGGKIQKKDTEYEVKDDIEEL 420
Query: 421 SSSQEDTDEDWTS 430
SSSQEDTDEDWTS
Sbjct: 421 SSSQEDTDEDWTS 424
BLAST of CcUC01G014520 vs. NCBI nr
Match:
XP_031744387.1 (DNA-binding protein RHL1 isoform X1 [Cucumis sativus])
HSP 1 Score: 699.9 bits (1805), Expect = 1.4e-197
Identity = 377/438 (86.07%), Postives = 400/438 (91.32%), Query Frame = 0
Query: 1 MARGSSSSKRDEAKGEIDSEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGK 60
MARG SSSK+DEAKGEI+ EIA RKRLKKLAFSNHILSETQA+PQAYLSPSATVLKHHGK
Sbjct: 1 MARG-SSSKKDEAKGEINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHGK 60
Query: 61 DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGRMKLFGTIMYPK 120
DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDL TKNPILYLDFPQGRMKLFGTIMYPK
Sbjct: 61 DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYPK 120
Query: 121 NKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEARLDFPKELTTGKCGEYD 180
N+YLTLQFSRGGKNV CED FDNMIVFSDAWWIGTKDENPEEA LDFPK+LT G+CGEYD
Sbjct: 121 NRYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEYD 180
Query: 181 FNGGAGVASTSGAAGVTSSSKQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTP 240
FNGGAGV STSG AGVTS+SKQSV+ KGINPA ENSFKGE GDDL+ LE +VTNSIKTTP
Sbjct: 181 FNGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIKTTP 240
Query: 241 VRHSERSAGKVFNFAEASSEDESAGTYDDLSEGEEKNILIHEPSIGDHA----KDLSVES 300
VRHSERSA KVFNFAEASSEDESAGT DLSEGEEKNI+IHEPSIGDHA +D+SVES
Sbjct: 241 VRHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNIVIHEPSIGDHASEKTEDISVES 300
Query: 301 IDEDAVEIRPPVLEGNQTSISKGKKSSRATGNARSDARGLVQPTLLSLFKKVEEKRTPRS 360
IDEDAV+I+PP LEGNQTSISK KKS RA G+A+SD RGLVQPTLLSLFKKVEEKRTPRS
Sbjct: 301 IDEDAVKIKPPFLEGNQTSISKEKKSFRAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRS 360
Query: 361 SKRSSTPKVSAQKMQLSGSKRKVDQDEGSKKRRAVQGQDDGGEVQRKDTEY-----EVED 420
SKRSS PKVS QKMQLSGSK+K+DQDEGSKKRR V+GQ GG+ Q+KDTEY EVED
Sbjct: 361 SKRSSAPKVSTQKMQLSGSKQKIDQDEGSKKRRVVRGQ--GGKAQKKDTEYELQQLEVED 420
Query: 421 EIEELSSSQEDTDEDWTS 430
EIE+LSSSQEDTDEDWTS
Sbjct: 421 EIEDLSSSQEDTDEDWTS 435
BLAST of CcUC01G014520 vs. NCBI nr
Match:
XP_008467323.1 (PREDICTED: DNA-binding protein RHL1 isoform X2 [Cucumis melo])
HSP 1 Score: 689.5 bits (1778), Expect = 1.8e-194
Identity = 369/433 (85.22%), Postives = 392/433 (90.53%), Query Frame = 0
Query: 1 MARGSSSSKRDEAKGEIDSEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGK 60
MARGSSSSK+DEAKGEI+ EI RKRLKKLAFSN+ILSETQAKPQAYLSPSATVLKHHGK
Sbjct: 1 MARGSSSSKKDEAKGEINPEIGERKRLKKLAFSNNILSETQAKPQAYLSPSATVLKHHGK 60
Query: 61 DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGRMKLFGTIMYPK 120
DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDL TKNP+LYLDFPQGRMKLFGTIMYPK
Sbjct: 61 DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLATKNPVLYLDFPQGRMKLFGTIMYPK 120
Query: 121 NKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEARLDFPKELTTGKCGEYD 180
N+YLTLQFS+GGKNV CED FDNMIVFSDAWWIGTKDENPEEA LDFPKELT G+CGEYD
Sbjct: 121 NRYLTLQFSKGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKELTLGQCGEYD 180
Query: 181 FNGGAGVASTSGAAGVTSSSKQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTP 240
FNGG AGVTS+SKQSV+ KGINPA ENSFKGE GDDL+ LE +VTNS+KT P
Sbjct: 181 FNGG---------AGVTSTSKQSVQKKGINPATENSFKGEHGDDLVGLEASVTNSVKTMP 240
Query: 241 VRHSERSAGKVFNFAEASSEDESAGTYDDLSEGEEKNILIHEPSIGDHA----KDLSVES 300
VRHSERSA KVFNFAEASSEDES GT DLSEGEEKNI+IHEPSIGDHA +D+SVES
Sbjct: 241 VRHSERSARKVFNFAEASSEDESTGTDTDLSEGEEKNIVIHEPSIGDHASEKTEDISVES 300
Query: 301 IDEDAVEIRPPVLEGNQTSISKGKKSSRATGNARSDARGLVQPTLLSLFKKVEEKRTPRS 360
IDEDAVEI+P LEGNQTSISK KK+SRA G+A+SD RGLVQPTLLSLFKKVEEKRTPRS
Sbjct: 301 IDEDAVEIKPSFLEGNQTSISKEKKNSRAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRS 360
Query: 361 SKRSSTPKVSAQKMQLSGSKRKVDQDEGSKKRRAVQGQDDGGEVQRKDTEYEVEDEIEEL 420
SKRSS PKVS QKMQLSGSK+K+DQDEGSKKRRAV+GQ GG+ QRKDTEYEVEDEIEEL
Sbjct: 361 SKRSSVPKVSTQKMQLSGSKQKIDQDEGSKKRRAVRGQ--GGKAQRKDTEYEVEDEIEEL 420
Query: 421 SSSQEDTDEDWTS 430
SSSQEDTDEDWTS
Sbjct: 421 SSSQEDTDEDWTS 422
BLAST of CcUC01G014520 vs. NCBI nr
Match:
XP_023519239.1 (DNA-binding protein RHL1-like [Cucurbita pepo subsp. pepo])
HSP 1 Score: 671.4 bits (1731), Expect = 5.2e-189
Identity = 361/434 (83.18%), Postives = 390/434 (89.86%), Query Frame = 0
Query: 1 MARGSSSSKRDEAKGEIDSEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGK 60
MARGSSSSKRDEAKG +D EIAARKRLKKLAF+N+ILSETQAKPQAYLSPSATVLKHHGK
Sbjct: 1 MARGSSSSKRDEAKGAMDPEIAARKRLKKLAFTNNILSETQAKPQAYLSPSATVLKHHGK 60
Query: 61 DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGRMKLFGTIMYPK 120
DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGR+KLFGTI+YPK
Sbjct: 61 DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGRLKLFGTIVYPK 120
Query: 121 NKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEARLDFPKELTTGKCGEYD 180
N+YLTLQFSRGGKNVMCED FDNMIVFSDAWWIGTKDENPEE RLDFPKE+T G+CGEYD
Sbjct: 121 NRYLTLQFSRGGKNVMCEDCFDNMIVFSDAWWIGTKDENPEEDRLDFPKEMTMGQCGEYD 180
Query: 181 FNGGAGVASTSGAAGVTSSSKQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTP 240
FNGGAGVAST SKQSV+ KGIN A ENS K E GDDL+DLE+N+TNS+KTTP
Sbjct: 181 FNGGAGVAST---------SKQSVQKKGINRAEENSLKEEHGDDLVDLEDNMTNSMKTTP 240
Query: 241 VRHSERSAGKVFNFAEASSEDESAGTYDDLSEGEEKNILIHEPSIGDHAKD----LSVES 300
VRHSERSAGKVFNFA+A S++ESAGTY D SEGEEKNI+IHEPSIGDHA + +SV+S
Sbjct: 241 VRHSERSAGKVFNFAQAFSKEESAGTYADFSEGEEKNIVIHEPSIGDHASEKTEVVSVDS 300
Query: 301 IDEDAVEIRPPVLEGNQTSISKGKKSSRATGNARSDARG-LVQPTLLSLFKKVEEKRTPR 360
D+DAVE RP LEGN+T ISK K SRA GNA+S RG LVQPTL SLFKKVEEKRTPR
Sbjct: 301 EDKDAVE-RPRFLEGNRTPISKSKNGSRAKGNAQSGNRGLLVQPTLPSLFKKVEEKRTPR 360
Query: 361 SSKRSSTPKVSAQKMQLSGSKRKVDQDEGSKKRRAVQGQDDGGEVQRKDTEYEVEDEIEE 420
SSKRSSTPKVSAQKMQLSGSK+K+DQDEG KKRR VQGQDDGG+++RKDTEYE ED+IEE
Sbjct: 361 SSKRSSTPKVSAQKMQLSGSKQKIDQDEGLKKRRVVQGQDDGGKIRRKDTEYEDEDDIEE 420
Query: 421 LSSSQEDTDEDWTS 430
LSSSQEDTDEDWTS
Sbjct: 421 LSSSQEDTDEDWTS 424
BLAST of CcUC01G014520 vs. ExPASy Swiss-Prot
Match:
O81242 (DNA-binding protein RHL1 OS=Arabidopsis thaliana OX=3702 GN=RHL1 PE=1 SV=1)
HSP 1 Score: 306.2 bits (783), Expect = 5.8e-82
Identity = 185/369 (50.14%), Postives = 236/369 (63.96%), Query Frame = 0
Query: 5 SSSSKRDEAKG--EIDSEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDI 64
+SSSK+ +KG + D+E RKRLK LA N +LS++ AK + L PS VLKHHG DI
Sbjct: 4 ASSSKKGGSKGGDKDDAESKQRKRLKTLALDNQLLSDSPAKSHSSLKPSKQVLKHHGTDI 63
Query: 65 VKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGRMKLFGTIMYPKNK 124
++KSQRKNRFLFSF GLLAP+S IG+L L TKNP+LYL+FPQGRMKLFGTI+YPKN+
Sbjct: 64 IRKSQRKNRFLFSFPGLLAPISAATIGDLDRLSTKNPVLYLNFPQGRMKLFGTILYPKNR 123
Query: 125 YLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEARLDFPKELTTGKCGEYDFN 184
YLTLQFSRGGKNV+C+DYFDNMIVFS++WWIGTK+ENPEEARLDFPKEL + E+DF
Sbjct: 124 YLTLQFSRGGKNVLCDDYFDNMIVFSESWWIGTKEENPEEARLDFPKELAQAENTEFDFQ 183
Query: 185 GGAGVASTSGAAGVTSSSKQSVKNKGI---NPAVEN-SFKGEDGDDLLDLEE-----NVT 244
GGAG GAA V + + ++ +P V+N EDG+ L D + +T
Sbjct: 184 GGAG-----GAASVKKLASPEIGSQPTETDSPEVDNEDVLSEDGEFLDDKIQVTPPVQLT 243
Query: 245 NSIKTTPVRHSERSAGKVFNFAEASSEDESAGTYDDLSEGEEKNILIHEPSIGDHAKDLS 304
++ TPVR S+R++GK FNFAE SSE S + + S+ +EK +L E S +
Sbjct: 244 PPVQVTPVRQSQRNSGKKFNFAETSSEASSGESEGNTSDEDEKPLLEPESSTRSREESQD 303
Query: 305 VESIDEDAVEIRPPVLEGNQTSISKGKKSSRATGNARSDARGLVQPTLLSLFKKVEEKRT 363
I A ++ P L + + K K S LVQ TL +LFKK EEK
Sbjct: 304 GNGITASASKL-PEELPAKREKL-KSKDSK------------LVQATLSNLFKKAEEKTA 353
BLAST of CcUC01G014520 vs. ExPASy TrEMBL
Match:
A0A0A0LVZ6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G043140 PE=4 SV=1)
HSP 1 Score: 706.1 bits (1821), Expect = 9.2e-200
Identity = 377/433 (87.07%), Postives = 400/433 (92.38%), Query Frame = 0
Query: 1 MARGSSSSKRDEAKGEIDSEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGK 60
MARG SSSK+DEAKGEI+ EIA RKRLKKLAFSNHILSETQA+PQAYLSPSATVLKHHGK
Sbjct: 1 MARG-SSSKKDEAKGEINPEIAERKRLKKLAFSNHILSETQARPQAYLSPSATVLKHHGK 60
Query: 61 DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGRMKLFGTIMYPK 120
DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDL TKNPILYLDFPQGRMKLFGTIMYPK
Sbjct: 61 DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLSTKNPILYLDFPQGRMKLFGTIMYPK 120
Query: 121 NKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEARLDFPKELTTGKCGEYD 180
N+YLTLQFSRGGKNV CED FDNMIVFSDAWWIGTKDENPEEA LDFPK+LT G+CGEYD
Sbjct: 121 NRYLTLQFSRGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKDLTMGQCGEYD 180
Query: 181 FNGGAGVASTSGAAGVTSSSKQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTP 240
FNGGAGV STSG AGVTS+SKQSV+ KGINPA ENSFKGE GDDL+ LE +VTNSIKTTP
Sbjct: 181 FNGGAGVTSTSGVAGVTSTSKQSVQRKGINPAAENSFKGEHGDDLVGLEASVTNSIKTTP 240
Query: 241 VRHSERSAGKVFNFAEASSEDESAGTYDDLSEGEEKNILIHEPSIGDHA----KDLSVES 300
VRHSERSA KVFNFAEASSEDESAGT DLSEGEEKNI+IHEPSIGDHA +D+SVES
Sbjct: 241 VRHSERSARKVFNFAEASSEDESAGTDADLSEGEEKNIVIHEPSIGDHASEKTEDISVES 300
Query: 301 IDEDAVEIRPPVLEGNQTSISKGKKSSRATGNARSDARGLVQPTLLSLFKKVEEKRTPRS 360
IDEDAV+I+PP LEGNQTSISK KKS RA G+A+SD RGLVQPTLLSLFKKVEEKRTPRS
Sbjct: 301 IDEDAVKIKPPFLEGNQTSISKEKKSFRAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRS 360
Query: 361 SKRSSTPKVSAQKMQLSGSKRKVDQDEGSKKRRAVQGQDDGGEVQRKDTEYEVEDEIEEL 420
SKRSS PKVS QKMQLSGSK+K+DQDEGSKKRR V+GQ GG+ Q+KDTEYEVEDEIE+L
Sbjct: 361 SKRSSAPKVSTQKMQLSGSKQKIDQDEGSKKRRVVRGQ--GGKAQKKDTEYEVEDEIEDL 420
Query: 421 SSSQEDTDEDWTS 430
SSSQEDTDEDWTS
Sbjct: 421 SSSQEDTDEDWTS 430
BLAST of CcUC01G014520 vs. ExPASy TrEMBL
Match:
A0A1S3CTA5 (DNA-binding protein RHL1 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103504698 PE=4 SV=1)
HSP 1 Score: 689.5 bits (1778), Expect = 8.9e-195
Identity = 369/433 (85.22%), Postives = 392/433 (90.53%), Query Frame = 0
Query: 1 MARGSSSSKRDEAKGEIDSEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGK 60
MARGSSSSK+DEAKGEI+ EI RKRLKKLAFSN+ILSETQAKPQAYLSPSATVLKHHGK
Sbjct: 1 MARGSSSSKKDEAKGEINPEIGERKRLKKLAFSNNILSETQAKPQAYLSPSATVLKHHGK 60
Query: 61 DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGRMKLFGTIMYPK 120
DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDL TKNP+LYLDFPQGRMKLFGTIMYPK
Sbjct: 61 DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLATKNPVLYLDFPQGRMKLFGTIMYPK 120
Query: 121 NKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEARLDFPKELTTGKCGEYD 180
N+YLTLQFS+GGKNV CED FDNMIVFSDAWWIGTKDENPEEA LDFPKELT G+CGEYD
Sbjct: 121 NRYLTLQFSKGGKNVTCEDCFDNMIVFSDAWWIGTKDENPEEACLDFPKELTLGQCGEYD 180
Query: 181 FNGGAGVASTSGAAGVTSSSKQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTP 240
FNGG AGVTS+SKQSV+ KGINPA ENSFKGE GDDL+ LE +VTNS+KT P
Sbjct: 181 FNGG---------AGVTSTSKQSVQKKGINPATENSFKGEHGDDLVGLEASVTNSVKTMP 240
Query: 241 VRHSERSAGKVFNFAEASSEDESAGTYDDLSEGEEKNILIHEPSIGDHA----KDLSVES 300
VRHSERSA KVFNFAEASSEDES GT DLSEGEEKNI+IHEPSIGDHA +D+SVES
Sbjct: 241 VRHSERSARKVFNFAEASSEDESTGTDTDLSEGEEKNIVIHEPSIGDHASEKTEDISVES 300
Query: 301 IDEDAVEIRPPVLEGNQTSISKGKKSSRATGNARSDARGLVQPTLLSLFKKVEEKRTPRS 360
IDEDAVEI+P LEGNQTSISK KK+SRA G+A+SD RGLVQPTLLSLFKKVEEKRTPRS
Sbjct: 301 IDEDAVEIKPSFLEGNQTSISKEKKNSRAKGSAQSDTRGLVQPTLLSLFKKVEEKRTPRS 360
Query: 361 SKRSSTPKVSAQKMQLSGSKRKVDQDEGSKKRRAVQGQDDGGEVQRKDTEYEVEDEIEEL 420
SKRSS PKVS QKMQLSGSK+K+DQDEGSKKRRAV+GQ GG+ QRKDTEYEVEDEIEEL
Sbjct: 361 SKRSSVPKVSTQKMQLSGSKQKIDQDEGSKKRRAVRGQ--GGKAQRKDTEYEVEDEIEEL 420
Query: 421 SSSQEDTDEDWTS 430
SSSQEDTDEDWTS
Sbjct: 421 SSSQEDTDEDWTS 422
BLAST of CcUC01G014520 vs. ExPASy TrEMBL
Match:
A0A6J1E8G5 (DNA-binding protein RHL1-like OS=Cucurbita moschata OX=3662 GN=LOC111431599 PE=4 SV=1)
HSP 1 Score: 670.2 bits (1728), Expect = 5.6e-189
Identity = 361/434 (83.18%), Postives = 389/434 (89.63%), Query Frame = 0
Query: 1 MARGSSSSKRDEAKGEIDSEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGK 60
MARGSSSSKRDEAKGE++ EIAARKRLKKLAF+N+ILSETQAKPQAY SPSATVLKHHGK
Sbjct: 1 MARGSSSSKRDEAKGEMEPEIAARKRLKKLAFTNNILSETQAKPQAYPSPSATVLKHHGK 60
Query: 61 DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGRMKLFGTIMYPK 120
DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGR+KLFGTI+YPK
Sbjct: 61 DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGRLKLFGTIVYPK 120
Query: 121 NKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEARLDFPKELTTGKCGEYD 180
N+YLTLQFSRGGKNVMCED FDNMIVFSDAWWIGTKDENPEE RLDFPKE+T GKCGEYD
Sbjct: 121 NRYLTLQFSRGGKNVMCEDCFDNMIVFSDAWWIGTKDENPEEDRLDFPKEMTMGKCGEYD 180
Query: 181 FNGGAGVASTSGAAGVTSSSKQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTP 240
FNGGAGVAST SKQSV+ KGIN A E S KGE GDDL+DLE+N+TNS+KTTP
Sbjct: 181 FNGGAGVAST---------SKQSVQKKGINRAEEKSLKGEHGDDLVDLEDNMTNSMKTTP 240
Query: 241 VRHSERSAGKVFNFAEASSEDESAGTYDDLSEGEEKNILIHEPSIGDHAKD----LSVES 300
VRHSERSAGKVFNFA+A S++ESAGTY D SEGEEKNI+IHEPSIGDHA + +SV+S
Sbjct: 241 VRHSERSAGKVFNFAQAFSKEESAGTYADFSEGEEKNIVIHEPSIGDHASEKTEVVSVDS 300
Query: 301 IDEDAVEIRPPVLEGNQTSISKGKKSSRATGNARSDARG-LVQPTLLSLFKKVEEKRTPR 360
D+DAVE RP LEGN+T ISK K SRA GNA+S RG LVQPTL SLFKKVEEKRTPR
Sbjct: 301 EDKDAVE-RPRFLEGNKTPISKSKNGSRAKGNAQSGNRGLLVQPTLPSLFKKVEEKRTPR 360
Query: 361 SSKRSSTPKVSAQKMQLSGSKRKVDQDEGSKKRRAVQGQDDGGEVQRKDTEYEVEDEIEE 420
SSKRSSTPKVSAQKMQLSGSK+K+DQDEG KKRR VQGQDDGG+ +RKDTEYE ED+IEE
Sbjct: 361 SSKRSSTPKVSAQKMQLSGSKQKIDQDEGLKKRRVVQGQDDGGKFRRKDTEYEDEDDIEE 420
Query: 421 LSSSQEDTDEDWTS 430
LSSSQEDTDEDWTS
Sbjct: 421 LSSSQEDTDEDWTS 424
BLAST of CcUC01G014520 vs. ExPASy TrEMBL
Match:
A0A6J1KHM2 (DNA-binding protein RHL1-like OS=Cucurbita maxima OX=3661 GN=LOC111495823 PE=4 SV=1)
HSP 1 Score: 656.4 bits (1692), Expect = 8.4e-185
Identity = 356/434 (82.03%), Postives = 384/434 (88.48%), Query Frame = 0
Query: 1 MARGSSSSKRDEAKGEIDSEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGK 60
MARGSSSSKRDEAKGE+D EIAARKRLKKLAF+N+ILSETQAKPQAYLSPSATVLKHHGK
Sbjct: 1 MARGSSSSKRDEAKGEMDPEIAARKRLKKLAFTNNILSETQAKPQAYLSPSATVLKHHGK 60
Query: 61 DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGRMKLFGTIMYPK 120
DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGR+KLFGTI+YPK
Sbjct: 61 DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGRLKLFGTIVYPK 120
Query: 121 NKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEARLDFPKELTTGKCGEYD 180
N+YLTLQFSRGGKNVMCED FDNMIVFSDAWWIGTKDENPEE RLDFPKE+T GKCGEYD
Sbjct: 121 NRYLTLQFSRGGKNVMCEDCFDNMIVFSDAWWIGTKDENPEEDRLDFPKEMTMGKCGEYD 180
Query: 181 FNGGAGVASTSGAAGVTSSSKQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTP 240
FNGGAGVAST SKQSV+ KGI+ A ENS K E GDDL+DLE+N+TNS+KTTP
Sbjct: 181 FNGGAGVAST---------SKQSVQKKGIDRAEENSLKEEHGDDLVDLEDNMTNSMKTTP 240
Query: 241 VRHSERSAGKVFNFAEASSEDESAGTYDDLSEGEEKNILIHEPSIGDHAKD----LSVES 300
VRHSERS GKVFNFA+A S++ESAGT D SEGEEKNI+I+EPSIGDHA + +SV+S
Sbjct: 241 VRHSERSGGKVFNFAQAFSKEESAGTLADFSEGEEKNIVIYEPSIGDHASEKTEVVSVDS 300
Query: 301 IDEDAVEIRPPVLEGNQTSISKGKKSSRATGNARSDARG-LVQPTLLSLFKKVEEKRTPR 360
D+DAVE RP LEGN+T ISK K SRA GNA+S RG LVQPTL SLFKKVEEKRTPR
Sbjct: 301 EDKDAVE-RPRFLEGNKTPISKSKNGSRAKGNAQSGNRGLLVQPTLPSLFKKVEEKRTPR 360
Query: 361 SSKRSSTPKVSAQKMQLSGSKRKVDQDEGSKKRRAVQGQDDGGEVQRKDTEYEVEDEIEE 420
SSKRSSTPKVSAQK QLSGSK+K+DQDEG KKR VQGQDDGG+ RKDTEYE ED+IEE
Sbjct: 361 SSKRSSTPKVSAQKKQLSGSKQKIDQDEGLKKRGVVQGQDDGGKFGRKDTEYEDEDDIEE 420
Query: 421 LSSSQEDTDEDWTS 430
L SSQEDTDEDWTS
Sbjct: 421 LLSSQEDTDEDWTS 424
BLAST of CcUC01G014520 vs. ExPASy TrEMBL
Match:
A0A6J1C822 (DNA-binding protein RHL1 OS=Momordica charantia OX=3673 GN=LOC111008835 PE=4 SV=1)
HSP 1 Score: 655.6 bits (1690), Expect = 1.4e-184
Identity = 355/435 (81.61%), Postives = 381/435 (87.59%), Query Frame = 0
Query: 1 MARGSSSSKRDEAKGEIDSEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGK 60
MARGSSSSKR+ GE+D E A RKRLKKLAFSN++LS+TQAKPQAYLSPSATVLKHHGK
Sbjct: 1 MARGSSSSKREATNGELDPEAATRKRLKKLAFSNNVLSQTQAKPQAYLSPSATVLKHHGK 60
Query: 61 DIVKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGRMKLFGTIMYPK 120
DIVKKSQRKNRFLFSFSGLL PVSGGKIGELKDLGTKNPILYLDFPQGRMKLFGTIMYPK
Sbjct: 61 DIVKKSQRKNRFLFSFSGLLTPVSGGKIGELKDLGTKNPILYLDFPQGRMKLFGTIMYPK 120
Query: 121 NKYLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEARLDFPKELTTGKCGEYD 180
N+YLTLQFSRGGKNV CEDYFD+M+VFSDAWWIGTKDENPEEARLD PKELTTGKCGEYD
Sbjct: 121 NRYLTLQFSRGGKNVTCEDYFDSMVVFSDAWWIGTKDENPEEARLDIPKELTTGKCGEYD 180
Query: 181 FNGGAGVASTSGAAGVTSSSKQSVKNKGINPAVENSFKGEDGDDLLDLEENVTNSIKTTP 240
FNGG AGVTS+SK SV+ KGIN A E+S KGE GD L DLE+N+ NSI TTP
Sbjct: 181 FNGG---------AGVTSTSKHSVQKKGINHAEEHSHKGERGDGLADLEDNMINSINTTP 240
Query: 241 VRHSERSAGKVFNFAEASSEDESAGTYDDLSEGEEKNILIHEPSIGDHA----KDLSVES 300
VRHSERSAGKVF FAEASSEDES G+YDDLSEGEEKNI+IHEPSIGDHA +DLSV++
Sbjct: 241 VRHSERSAGKVFKFAEASSEDESTGSYDDLSEGEEKNIVIHEPSIGDHASEKTEDLSVDA 300
Query: 301 IDEDAVEIRPPVLEGNQTSISKGKKSSRATGNARSDARGL-VQPTLLSLFKKVEEKRTPR 360
DED + RPP LEGNQ ISK KK SRA GN S+ RGL VQ TL SLFKKVEEKRTPR
Sbjct: 301 KDEDTMG-RPPFLEGNQVLISKPKKVSRAKGNVESNNRGLFVQSTLPSLFKKVEEKRTPR 360
Query: 361 SSKRSSTPKVSAQKMQLSGSKRKVDQDEGSKKRRAVQGQDDGGEVQRKDTEYEVEDEIEE 420
SSKRSS PKVSA+KMQLSGSKRK++QDEGSKKRRAV+GQDDGG+V RKD EYEVED+IEE
Sbjct: 361 SSKRSSAPKVSAEKMQLSGSKRKIEQDEGSKKRRAVRGQDDGGKVPRKDAEYEVEDDIEE 420
Query: 421 LSSSQ-EDTDEDWTS 430
LSSSQ EDTDEDWTS
Sbjct: 421 LSSSQEEDTDEDWTS 425
BLAST of CcUC01G014520 vs. TAIR 10
Match:
AT1G48380.1 (root hair initiation protein root hairless 1 (RHL1) )
HSP 1 Score: 306.2 bits (783), Expect = 4.1e-83
Identity = 185/369 (50.14%), Postives = 236/369 (63.96%), Query Frame = 0
Query: 5 SSSSKRDEAKG--EIDSEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDI 64
+SSSK+ +KG + D+E RKRLK LA N +LS++ AK + L PS VLKHHG DI
Sbjct: 4 ASSSKKGGSKGGDKDDAESKQRKRLKTLALDNQLLSDSPAKSHSSLKPSKQVLKHHGTDI 63
Query: 65 VKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGRMKLFGTIMYPKNK 124
++KSQRKNRFLFSF GLLAP+S IG+L L TKNP+LYL+FPQGRMKLFGTI+YPKN+
Sbjct: 64 IRKSQRKNRFLFSFPGLLAPISAATIGDLDRLSTKNPVLYLNFPQGRMKLFGTILYPKNR 123
Query: 125 YLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEARLDFPKELTTGKCGEYDFN 184
YLTLQFSRGGKNV+C+DYFDNMIVFS++WWIGTK+ENPEEARLDFPKEL + E+DF
Sbjct: 124 YLTLQFSRGGKNVLCDDYFDNMIVFSESWWIGTKEENPEEARLDFPKELAQAENTEFDFQ 183
Query: 185 GGAGVASTSGAAGVTSSSKQSVKNKGI---NPAVEN-SFKGEDGDDLLDLEE-----NVT 244
GGAG GAA V + + ++ +P V+N EDG+ L D + +T
Sbjct: 184 GGAG-----GAASVKKLASPEIGSQPTETDSPEVDNEDVLSEDGEFLDDKIQVTPPVQLT 243
Query: 245 NSIKTTPVRHSERSAGKVFNFAEASSEDESAGTYDDLSEGEEKNILIHEPSIGDHAKDLS 304
++ TPVR S+R++GK FNFAE SSE S + + S+ +EK +L E S +
Sbjct: 244 PPVQVTPVRQSQRNSGKKFNFAETSSEASSGESEGNTSDEDEKPLLEPESSTRSREESQD 303
Query: 305 VESIDEDAVEIRPPVLEGNQTSISKGKKSSRATGNARSDARGLVQPTLLSLFKKVEEKRT 363
I A ++ P L + + K K S LVQ TL +LFKK EEK
Sbjct: 304 GNGITASASKL-PEELPAKREKL-KSKDSK------------LVQATLSNLFKKAEEKTA 353
BLAST of CcUC01G014520 vs. TAIR 10
Match:
AT1G48380.2 (root hair initiation protein root hairless 1 (RHL1) )
HSP 1 Score: 290.0 bits (741), Expect = 3.0e-78
Identity = 185/400 (46.25%), Postives = 236/400 (59.00%), Query Frame = 0
Query: 5 SSSSKRDEAKG--EIDSEIAARKRLKKLAFSNHILSETQAKPQAYLSPSATVLKHHGKDI 64
+SSSK+ +KG + D+E RKRLK LA N +LS++ AK + L PS VLKHHG DI
Sbjct: 4 ASSSKKGGSKGGDKDDAESKQRKRLKTLALDNQLLSDSPAKSHSSLKPSKQVLKHHGTDI 63
Query: 65 VKKSQRKNRFLFSFSGLLAPVSGGKIGELKDLGTKNPILYLDFPQGRMKLFGTIMYPKNK 124
++KSQRKNRFLFSF GLLAP+S IG+L L TKNP+LYL+FPQGRMKLFGTI+YPKN+
Sbjct: 64 IRKSQRKNRFLFSFPGLLAPISAATIGDLDRLSTKNPVLYLNFPQGRMKLFGTILYPKNR 123
Query: 125 YLTLQFSRGGKNVMCEDYFDNMIVFSDAWWIGTKDENPEEARLDFPKELT---------- 184
YLTLQFSRGGKNV+C+DYFDNMIVFS++WWIGTK+ENPEEARLDFPKEL
Sbjct: 124 YLTLQFSRGGKNVLCDDYFDNMIVFSESWWIGTKEENPEEARLDFPKELAQVDTFHLFLH 183
Query: 185 ---------------------TGKCGEYDFNGGAGVASTSGAAGVTSSSKQSVKNKGI-- 244
+ E+DF GGAG GAA V + + ++
Sbjct: 184 FLFKTMVATEMFNMIRRILWFQAENTEFDFQGGAG-----GAASVKKLASPEIGSQPTET 243
Query: 245 -NPAVEN-SFKGEDGDDLLDLEE-----NVTNSIKTTPVRHSERSAGKVFNFAEASSEDE 304
+P V+N EDG+ L D + +T ++ TPVR S+R++GK FNFAE SSE
Sbjct: 244 DSPEVDNEDVLSEDGEFLDDKIQVTPPVQLTPPVQVTPVRQSQRNSGKKFNFAETSSEAS 303
Query: 305 SAGTYDDLSEGEEKNILIHEPSIGDHAKDLSVESIDEDAVEIRPPVLEGNQTSISKGKKS 363
S + + S+ +EK +L E S + I A ++ P L + + K K S
Sbjct: 304 SGESEGNTSDEDEKPLLEPESSTRSREESQDGNGITASASKL-PEELPAKREKL-KSKDS 363
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_004137530.1 | 1.9e-199 | 87.07 | DNA-binding protein RHL1 isoform X2 [Cucumis sativus] >KGN64211.1 hypothetical p... | [more] |
XP_038893754.1 | 2.8e-198 | 86.37 | DNA-binding protein RHL1 isoform X1 [Benincasa hispida] | [more] |
XP_031744387.1 | 1.4e-197 | 86.07 | DNA-binding protein RHL1 isoform X1 [Cucumis sativus] | [more] |
XP_008467323.1 | 1.8e-194 | 85.22 | PREDICTED: DNA-binding protein RHL1 isoform X2 [Cucumis melo] | [more] |
XP_023519239.1 | 5.2e-189 | 83.18 | DNA-binding protein RHL1-like [Cucurbita pepo subsp. pepo] | [more] |
Match Name | E-value | Identity | Description | |
O81242 | 5.8e-82 | 50.14 | DNA-binding protein RHL1 OS=Arabidopsis thaliana OX=3702 GN=RHL1 PE=1 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0LVZ6 | 9.2e-200 | 87.07 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G043140 PE=4 SV=1 | [more] |
A0A1S3CTA5 | 8.9e-195 | 85.22 | DNA-binding protein RHL1 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103504698 PE=4... | [more] |
A0A6J1E8G5 | 5.6e-189 | 83.18 | DNA-binding protein RHL1-like OS=Cucurbita moschata OX=3662 GN=LOC111431599 PE=4... | [more] |
A0A6J1KHM2 | 8.4e-185 | 82.03 | DNA-binding protein RHL1-like OS=Cucurbita maxima OX=3661 GN=LOC111495823 PE=4 S... | [more] |
A0A6J1C822 | 1.4e-184 | 81.61 | DNA-binding protein RHL1 OS=Momordica charantia OX=3673 GN=LOC111008835 PE=4 SV=... | [more] |
Match Name | E-value | Identity | Description | |
AT1G48380.1 | 4.1e-83 | 50.14 | root hair initiation protein root hairless 1 (RHL1) | [more] |
AT1G48380.2 | 3.0e-78 | 46.25 | root hair initiation protein root hairless 1 (RHL1) | [more] |