Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CCGTAAATGTATAGTCCTTGATAAATCAAACTGGAAGCCGACTAGGGTTCGAACCTCCCGAAGCTAACAAAATTTGGAATTTACGAAGCATTGCGAAGCAAGGAGAGTGGCAATGGAAGAGCTTCCACATCAAGCAGAAGCATCCATGGGCACTAGCTGCAAGAAGGGGAAGAAGAAATCAGTGAGTCTGGAGGAACCGCAGAAAAGAGCTAAGAAGAAAGGCGGGGCTACTTCAGTCAACGAAGTGCAGCCTACAGGTCGTTTAGATGACTCCCGGGTGAAGGTTTCAGAGTTTGATCATTGTGTTGAAAATCATTTTAGAGCCATTGATGCGATTGCCGAGCTCTATGGCGAGGCAGAAAATGGCGAAGGCGGAGTTGATGAAAGTGATTTTCAGCGGTTTTCATCCTCCACAACTTTCTTAAGGTACACATGGTTGCATGTTTTCCGGTAGATACAACAAATACATGTTTCCTGTAGCTTAAGGATCATCGCATTGCTTACACCGATTAGTTCTGGAATGCTTGTTAAATGCTGAAGATCTTGAAGTTCGTGCAAGGCATGCAATGTATTGAATATTGTGCAGCTTCTTACGATTTTTGTTCTTGCACAATTTCTCGGAAGACGATGACATGAATTTCTTCCTGCATTATCTGTCACTGCTGGGAATCTCCAACTTAATTTTTTTCGAATTATACTTCACTAGGGAATGGAAGTTCTACAATTATGAGCCGAAAACCGTCAAGTTTACAAGCGATTCGAGAGTTCCTGAGGGTAAGGATGCTGATATCACGATGGAATTGCCACAGTTTTCTTCTGCGGCTGTTCTAAAGGTGCTGGGTTGGCTGCCATGTTAAACTTGTTGGCGTTTTCTTTCTAATTTTATCCCATCAACCAAACCCACACTCGCGCGCGCGCGCGCGCACACACACTGTTTTCTTCTGTTTAGGTGACGAGCTGAAACTGTTATGCGGCAGTAAGAATTAACACAAGTCTTAATTTAACACGCAGAATGGAGCACCGCCCGGAGCCACTGCATCTCTGGACTTTAGGTAAGGAACAAGCAAACTCTGTATGAGTGTCAAGGTCGTCCAGTTTAACACTTGGGCAAAGGATGCATAATGAGCTGCCTCTTAAGATATTTTAGGAATTTGTATGTCGCATCTAATGCGAAAGCCGGCATACATGGAAATTTATTATTTGGCACTTGCCAGTTATTATCTTACCCTCTAGAAACCCATGATGTGATTAAAATACGTAATGTTGCTTATGGCACCACTTTGATTTTGGATTGCTTCTGCTGTTGGTTATTGTAGTTTTCGTTGGTGGGCTAGTGTTGTCAGTTGGGCATGAATTGAAGTTACTTTTTTGATGTCTGGTCACTGGTGTGTAATTCTATCTTCCGTAGTTGATATGTTCATTCAATCATTCATTTTTATTGATTTTTTTCTCCAGAAACTTTATCATGCATGTTGGTGGGCCTGTTTGGGCCATAGATTGGTGTCCTCTAGTTCATGAAAGGACCGACTCCCTTATCAAATGTGAGGTATCCCTCCTCTTCTATTGGTCTTTTTCAAAAGCAAAACTGTTATGGACGCATTAAAATTTAAGTGTAAAATTTTGTTTTATTCGTCTTCACCTATGACTTTCATTTTCATTCACTTAGTTAGGCACTGCTTTCTTTTTGTTCACACTTTCTCTTTATTTATAAGTTCTCATTCATATTTATAACAATATATATCTCAACTCGTATGGTAGTTTATTGCTGTTTCTGCTCATCCACCTGGCTCTTCTTATCACACGATGGGCATCCCGCTCAGTGGAAGAGGAATGGTGCAGATATGGTGCTTAGTGCATGGCACCGAAAGCCATGAATCAGAAACGACCAGTGCAACAGAGTGCAAGGATTCAGACTTATCTCAACCAAAGAGGCCTAGAGGAAGACCCCCAGGGCGCAAGAAAAATGGGGCGTCGGCCTTACCATCTCAACCAAAGAGACCTAGAGGAAGGCCTAAAAAGAAACAAGAAGAACCTAATGATGATAACAAGGTTGCCAGTTACCAACTTGTTCAGCCCCTTTCTGTTGAATACCCAGATGTTTCATCCAACTTGCTTGAGATTGATGACGTCTCCCACAATTCTGAAAAACCTGTATCACTGGAAAACAGTGTTGAAAGAGGGAGCAGTACCATTGAAGAAATTTCTACGTGCAATTCTGAAGATGAAGTTCCTGTGCAGAAGAGAAGAGTGAGAAGAAATGCTGATACTAAGAATCATGTTGATGATGTGGGAACGTTATCACTTATAGAGAATCGAGAAGATGGATCTAATGCTACAAATCATGAGGCAAATGAGAATGTTACAAGTGAGTATTCTGGAGAAGACACTCGATTATGTAAGAACATTTCAGAGAAAGCTATTTTAGACACTGGCTCAACTGGATTTTCTATTCCGGAGACTGTTGCTTTGCCTAGACTAGTATTGTGCTTAGCTCACAATGGAAAGGTAGCGTGGGATTTGAAATGGAAGCCAACTAATGCGCGTACTACCAAGTGCAAGCAAAGAATGGGCTACCTTGCTGTCTTGCTGGGCAACGGATCTCTAGAAGTGTAATTAAACTGCTTTAACTGTCTTCTGACATCTATGCTCAGTTACTATATGTTTAATTTCTTTTTGTCCATTTTTATCCAAAACTTTTTTGTTTATTTAATTTATTGTGTTTCCTGCGTATATTTGGTTATTCTCTTGGTCTTACTGAGAGCTTTTCATGCTTGCTTAAGATTTAAAATAAGTTTATAGATATGGAAGTATGTTGGATAATCTCCTCGACGTTTCCTTTTGAGATAAATTATAATTTGTGTTGAATATGATACAGATTTTTACTGCTAAACCAAGAATCAGTTTCCATGGAATTGCTGACATAGTGGATCTGATTTTTTCAACATGACTAGGTTGCTAGGCTGAATGACGTTTTATGTGAGCATGCCATAACAGGAAAACTACTTTTGAAGTGTGTAGTTTTGGGATAAATTGTATAAATACAATTACACTACATACAATCTTCAAATTAAGCTCTACAAAGTTTCTGGTGAATGAAAAGTACGTGCCTTTAGTTTTTTCATTTTCTATAGATATGATTTCTTTGTAGAGTTTGCTTTGGATTTTTTTTTTTTTTTTTAACTTCAAAAAATTAGTTCATAATTCTAGTTTTCATGTTTAATTCGAGCAGCACAATTGGTTTAACTGACATGATGAACACACCATACTAGGTTCTACAACAAAATTATATCCCTCGAAGTGTATGTCTTATTCTAGTTTAATTGCTGTTCTTGCATCTTTGGTTGTTCTAAGGTTTTGACTTTTAATGTTTACTACGTGCCTCCCCAATGATTAACAATATTAGTTCTATATTAGGAACGACACTATCTGACTATCCAATGAACTAATTGCAGTTGGGAGGTTCCTTTTCCCCATGTAGTGAAGGCCATCTATTCTAAACTCAATGGGGAGGGTACAGATCCCCGCTTTGTGAAGTTGAAGCCTACTTTCAGATGCTCGATGTTGAGAAGTGCAGATACACAGAGGTATTCGATTTTTATTTTTAACATAATGACATGACAACTCTTCAAGTTTAACATGATACAGTCTTTGCGAAAAGAAAAACCAATGTGTTTAGTATTGCTATAAATTGAAGAGTAGAATAGGCTGCCTTAGAGATGGTCATTATTCTCTGCATAATTGCACATTACACCATCATCATATCTGGAAAACTTTTGAATGAAGTTCTTGGCATTGGCTATTGTTAGAAGGAATGGTATTATGAAGTACGTCTAAGACTATTCAGTTCATTCTTGTTGAATCCAGGAATTAATCTTGGTTCCGCTTAATTTTGATATTTGGAGATTCTTACACTTTTTCCTTGCTGATCAACCACTCCTACTCGATACTTGATCTCGCAAAGGAAGAATTATATTATTCTGATTGGATTAAATTTAAGTGATGAAGCAAATATTTTCTTCATTTTCTTTTCAAGTATATTCTGGGTTATTACGAGTGTTATTCTTTTGGTAAAATTGCATCTTGTGTTAGCTATTTCTCACGAGTTTTACTTTTTCCCCAACTTTCTTTGTTACTTCTAATTCATAGTCTATTGTTGGTGCTGCAGCATCCCTCTGACAGTGGAATGGTCGCCAACACCTCCTTATGATTATCTACTCGCTGGATGTCATGATGGAACGGTTATTATTTATTTTCCTTTCCTTTTATCGCTGTTCATAGCAAAAACAAAAAACGTAACGTTCTTTATGTGGTTCTCCTTGCATTTTGGAAGAGTTAGTTCTCCTTGCATTTTTGAATTGTTTTCTAACTTTTGTTTGTTATTGATACAACTTTTTACTGACTCAATTCCTGATCCAGTTTTGGGTGGTCCTGTCCCTTTTTGGTTTGAATTTTAATTTTATGATGACTAAAACATAATTCATCCATTCAGGTCGCCTTGTGGAAGTTTTCTGCAAGTAGTACCGCTGAAGGTTGTTTTACTTCTCTAAATTTGTGTTTCTAATTCCTTCTCTTGATTGTATTTTAACAATTACAAGCAATAATGACATCATTCTCATATTTTTTTGTTCAGATACGAGGCCTTTACTTCGTTTTAGTGCCGATACAGTTCCAATAAGAGCAGTTGCATGGGCACCAAGTGAAAGGTTTGTAGTGGTAGAGTTCAAGTTGAACTTCTTTATCCTTTTATACTACTGCCGCCACACCCCCAAAGAAAAGAGGATAAAGATTGAAGAAAAAAAGGGAAGTGTCATTTCAGTTGAAATTTTGTTACTGTTAATTCACAGTAATCGGTAAATTTTACCTTCTTAAAAAATGGTCACAAGCCATGTCTCCCACCCTAATGCTGCTTATTTACATGATGGGGAAAACATCTTTACCTTTGATGACAAAAGTAAGAACTGGACTATAACTACAAAATGGGTCAGGCAATCATCAAGTACAAAGTGCGCCCCGACACTCGTGGATATAAAAATAAAATCTAAGATGATGAGTCTAAATTTTCTTTTACTAACATTTTTATACCTTCTGATTAAAATTATAAATATTCATACATTTTCTTATTATGTGCAGTGAGCCCGAAAGTGAAAATGTGATACTAATTGCTAGTCATGGAGGCATAAAATTTTGGGACCTAAGGTTGGTGGAGTTATTTCTTTCATTTTATTTTTCATTATTTTTTTTGTTTGTTTTTGGATTAAAGGAAGTTAGTTAATTTTTTTGGGACAAAGAACTATATATATATAACCAAACTTTTTATTAGACATAATGAAAAAGAAGTCCACTGGACAAGGTGAATGCTAACTATACAAGAAAGGCATCTTATCCAAAAAAAAATAACGCCTGGTGGAGAAAAGCTCCTATCCAAGGAATTTCCAAAATCATTTCCAATCGACAATAAGATATAAGACACAAATGTGAAAGTCTAGATAACTTTTACACCATGATAAGGCAAGAAACGATAATGAATAGAAAAATCTATCGAAGCATCTTTTTGTTATTCTTTTTCTTCCATGTGGTCTGTAGGAAGGCTCTACTGAAGTTGTTTCTTAGGATTGAAAGAATTTTATTAATTTAATTGGAGAAACTTTTTTAATAGTATTAGGAGCCCATGCGATGCACTGTCTTCCACTTCCCAATTCAAGCATCCTTTAAAAGGCTTAGTTTAATAATAAACACACTGAAGTTTGAGGCAATATCAAGATGTTCCAATCGAAAATGACAAAACCAACCCTGTTCCATTCTCTTAGACATACCCGAGGGAAGATTTCCTTGAAACCGTTCTTCTCCAACAATGGAGTTACTTCTTGACCATTGCAATACAGAGAAACTTCCTTACTGGTGGACTGTAGGAAGGCTCAAAATGAAGTTGTTTCGTAGGATTGAAGGAATTTTATTAATTCAATTGAATCAACTTTTTAAATACTATTTCCTGCCCATGCGATGCACTCTGTCTTCCATGTCCCAATTGAAGCTTCATTGGTTTAATAACCAACAAATTCAAATATCTTGAGGTAATACCAAGATGTACCAATAAAAAATGACAAAACCAACACCCTGTTCCATTCTCTTTAGAAATACCTGGTTCCTTGAAACCACTCTTCTCCAACAATGGAGTAACTTCTTTACCATTGCACTTCTGTTTTTGGGGAGTCTGTACAGCGGAGAGCTTATAAGCAAGATGCCTAGATATGACACTCCGACAAAGCTGCTGTGTGCTTTGTGAACGATCACTTGTTTCAATTCTCTCTTTGTTATTTCATTTACTCTGTGCTCATTGCATATGATTTAATATTTTTTGGTTTTCAGTTCATTTTATTGTTTCATTTCAATTCGGTGACAATTTGATATTAATCATTACAATATTATATCAGAGATCCTTTCCGTCCCTTGTGGGACCTTCATCCGGCACCAAGGATAATATATAGTTTGGATTGGCTTCCTAATCCTAGGTATATATTTTATCTTGATAGAACGCAGATTAGGCACATTAAGTTACTGTTTTACTAATTCTCTTTTATTTATGACCCTATTCTTGTTAGATGCGTTTTTTTATCCTTTGATGATGGAACATTGAGACTTCTCAGTTTGCTAAAGGCTGCATATGATGTTCCAGTAACTGGTCAACCCTTCACAGCAATAAAACAAAAAGGTTTACACACTTACTGTTGTTCACCATTTGCTATCTGGAGTATTCAAGTGTCAAGGCAGACAGGTATATCTGTAATTCTTGGTTTGTTAGCTTTGGTTTATTTCCCTTTTTAATCTACTTTTTAGGCTTCCTTCAAGTCATTAAAAACACATTCTCATTGATGTGGTTATATGGCTTGTCTTGATTAATTTTCAGCCATATAGAAATAAGCAACAAATAGTACCAACAGGGGATTTGTGGATCGTAGTTCTTTCTTTTTCTGTATCAAATAATTTTTTTTGGTAACAAAATGTAGAAGCCTTTTCTCGGTTACTGTCGGCTTTAACATCCAATTTGGTTTTACTTTTAAATTTCAACACAATCAAATGAACTATTTGATTCTTATCATGTACATTTAAGATATAGCAACTGGTTTTTTCAGGCATGGTTGCTTACTGCGGTGCTGACGGAGCTGTTGTCCGTTTCCAGGTAAGTTCAACTTCCCAATAGATTTCACTATGGAGATGAAAGTTTGCCTCTTCGGCACGTGCCACTCATTTGGTTTCAGTTTTTTATAATCTTAGCTTCTTGTAGTTCTGTTATTTTCTATAATCAACTTGCATTATTGCTGAGTATTTATGTATTTGTGAATATATAATAAACTACTTTTCCATTGATGAGAGCAAGGGATTTTGTTAGGATCACACAACAACTCACACACTCGATTTAGATGAACACAAATAATAAGAGAGACAAAATGTAAAGATAAATATTGGCACTTGGTTGTATCCTCCAACTGTAGGGCCTCATCAAAAGACTCTGTTCCCCTTCATTAGTCAGCAATAAATACTATAATGAAGGTGAATATTTATGTGGTGCTCTGATGGTTCTGGATGATCTCCTCAACACTTGGTCATGTGTCACTTGCTCCACCTATGGTTCATCAACAACCATCTCAGGAATTTATTGAGTTTCTGTAGCGACTTCACTAGGTGAATTTTTTTGCAACTCAATTTCAACTCCCATTTGCTTCGTAGTCTCATCACCTTTCTTCTTTTTGTTCTTGTACATGACATTTTCATCAAATGTCACGTCACAATGTCTTAAAATTTTTCTGTGTCATCCCAAAACCTATACCTGAACATGTCAAAACCATAGCCTATGAAGTAGCACTTCACAACTTTAACATCACTCCCTGAACGTATGCCGTGCAACCAAAAGTTCTCAAGTGAGAGTACTTGAATCCTTTTCCTGTCCATACCTTTTCTACCAACTTGAACTATAAGGGCACTGACGGTCCACTATTGATCAAGTAGGTTGTTGTATTCACAACATAAGCCAAGAATGTCTTTGACATTCCAGAATGCATCGTCATACTCTTTGCCCCCTCATTTAATGTTCTGTTCATTCTTTCAGTGACACCATTTGTCTTGCCTTACCAGGAACTGTCTTGATTAATATGATTCCTTGAGTTGTACAGAACACTTTAAATTCCGACTTGTTGTACTCTCTTTCATTGTCAGACCTAAACACGTAATCTTCAAACCCATCTGACTTTCAACTTCAACCTTCCATCTGTTGAAGGTGGCAAACACATCTGATTTGTGTTTCAGAAAATAAAACCATACATTCCTGTTGAAATCATCAAATGAAAGCAACGTAAAACTTAGATCCACCAATTGATGGAGTTGGAGCTGGAGCTGGAGATGGTCCCTAAACATATGTATGCACTGTTTCTAACCTTTCTTTCTTTGGTTCTCTGACAACCTTTGTGAAACTAACTCATTTTTGTTTGTCCATAATGCAACTTTCACAAAGATCTATATCAACAGTTTTTTCTAAAACTCCTATCGCAACCAGCATCTTCATTCCTTTGATCACGTGCTACCACAATTATACCCTTCACAATCTTTCACGAACTTCTTCCAATCTTTGTTGCAATACCTATGCTGTCTAGCTGACCAATAGATATCAGATTCTTCTTGAGACTAGAAATATTTCTGACATCCTTCGATATCTAATGGACATCCTTCGATATTTGATGATTTCTTGCTGGAGTTTGTATGCAGATATCTCCCTTTCCTTTAACCTCAAAGAATTTATTGTTGGCAAGATACACCTTTCCGAAATTTCAAGATTTGAAATTTTGAAACAGTTCCTTGTTCGGAGACAAATGAAAAATGCACTTGAATAAAAAATCTAAGATTCGGTTGGACTATCCACACTAAGGATTAGAGTATCCTCAGTATCTTATGTTGAATATGCAGAATCATGAGCATCTCCAAATTTGTGATTCAACTTCTTCGGTTTTGTACAATCTATTCGAAAATGCCTTGCAATATGTTGAATATGATTTTTTTTTTAAATTTTATTACAATTCTAACACGCTATGTTTGGTCTGTTGGAAGATTTTCCTCGATTCTGTGATTTTGATTGGCCATGTATATTTGGGCCTTTCGAGTTACTTCTTACCCTTCAATCAACGTTGAAAGTATTGCCCAATAGATCCTCAATTTCTTGTTTGCAAATACTTTTGTTGAAACTGAAATCGGATCCATCAAACTTCTCGTTCCCAATCTTTGAGCTTTCCATCTTCAGATCATGGTGAATATCTCCTAGCTCGCTTTGATATCAGTTATTAGGATAGCAGCAACTCACATAAATGAACACAAATAACAAGGGAGAGAGAATGCAAGAGAAATATTGGCTAGAGGTTTATAGAAGTACATCTTCAAGGCACTATCGATGGGGTGTTTATTTTATATATATATATAGTCATTTATTTGCATAGCGATTACCTTGTCACTTACTATATATAGCCATTTATTTGCGCAGCGATTACCTTGTCACTTACTATATATAGCCATTCATAAGTTATAAATAAGCATTAGACCAGCAACAATGATAAAGAGTTACTTGTCCATACAAATAGCTGAGGAGTTCGTATATGCTTACTTTCTCTGTATGCGCCACAGATTGAATTCGAAGATAATTGATACATACATTTAACACATTTTCATGTTTAATTTGTCAGCTTACTACGAAAGCAGTGGACAAAGAAAATTCACGCAACCGCACCCCACATTTTGTATGCGAATACTTAACCGAGGAGCAATCAATTATTACAATCCACTCTCCAGCATCAGATGTTCCAATCCCTTTGAAAAAGCTATCCAACAAATCTGAACAGCCATTGTCCATGCGAGCTATTCTATCTGATTCCATGCAGCCAAATGAAGGAAATGATAAAAGTGCCACAACTTCAGCATTGGAAAATGAATCAGCCCTTTGCTATGATGACGATGTCGACGTTGAATCTGGGTCTGAGGATACGCCGATGTCTATTCAGAATAAAAACCAAACTCAATCAAAGAGCAAGAAGAAGGGAGTGGTCAACCAAGAATTGGAACATAGCCATGAGCCTAGTGATTCACAGACAGATGATGACGTAGTGCCTGGTTTGGGGGAGCACTTCGAAAATTTCCCCCCCAAATCAGTTGCATTGCATAGACTGAGATGGAACATGAACATTGGGAGTGAAAGATGGCTGTCCTATGGGGGAGCAGCTGGAATTCTACGCTGTCAGGAGATTGTGCTGTCTGCCCTCGATAAGAAGTTGATGGCGAAGAAATGA
mRNA sequence
CCGTAAATGTATAGTCCTTGATAAATCAAACTGGAAGCCGACTAGGGTTCGAACCTCCCGAAGCTAACAAAATTTGGAATTTACGAAGCATTGCGAAGCAAGGAGAGTGGCAATGGAAGAGCTTCCACATCAAGCAGAAGCATCCATGGGCACTAGCTGCAAGAAGGGGAAGAAGAAATCAGTGAGTCTGGAGGAACCGCAGAAAAGAGCTAAGAAGAAAGGCGGGGCTACTTCAGTCAACGAAGTGCAGCCTACAGGTCGTTTAGATGACTCCCGGGTGAAGGTTTCAGAGTTTGATCATTGTGTTGAAAATCATTTTAGAGCCATTGATGCGATTGCCGAGCTCTATGGCGAGGCAGAAAATGGCGAAGGCGGAGTTGATGAAAGTGATTTTCAGCGGTTTTCATCCTCCACAACTTTCTTAAGGGAATGGAAGTTCTACAATTATGAGCCGAAAACCGTCAAGTTTACAAGCGATTCGAGAGTTCCTGAGGGTAAGGATGCTGATATCACGATGGAATTGCCACAGTTTTCTTCTGCGGCTGTTCTAAAGAATGGAGCACCGCCCGGAGCCACTGCATCTCTGGACTTTAGAAACTTTATCATGCATGTTGGTGGGCCTGTTTGGGCCATAGATTGGTGTCCTCTAGTTCATGAAAGGACCGACTCCCTTATCAAATGTGAGTTTATTGCTGTTTCTGCTCATCCACCTGGCTCTTCTTATCACACGATGGGCATCCCGCTCAGTGGAAGAGGAATGGTGCAGATATGGTGCTTAGTGCATGGCACCGAAAGCCATGAATCAGAAACGACCAGTGCAACAGAGTGCAAGGATTCAGACTTATCTCAACCAAAGAGGCCTAGAGGAAGACCCCCAGGGCGCAAGAAAAATGGGGCGTCGGCCTTACCATCTCAACCAAAGAGACCTAGAGGAAGGCCTAAAAAGAAACAAGAAGAACCTAATGATGATAACAAGGTTGCCAGTTACCAACTTGTTCAGCCCCTTTCTGTTGAATACCCAGATGTTTCATCCAACTTGCTTGAGATTGATGACGTCTCCCACAATTCTGAAAAACCTGTATCACTGGAAAACAGTGTTGAAAGAGGGAGCAGTACCATTGAAGAAATTTCTACGTGCAATTCTGAAGATGAAGTTCCTGTGCAGAAGAGAAGAGTGAGAAGAAATGCTGATACTAAGAATCATGTTGATGATGTGGGAACGTTATCACTTATAGAGAATCGAGAAGATGGATCTAATGCTACAAATCATGAGGCAAATGAGAATGTTACAAGTGAGTATTCTGGAGAAGACACTCGATTATGTAAGAACATTTCAGAGAAAGCTATTTTAGACACTGGCTCAACTGGATTTTCTATTCCGGAGACTGTTGCTTTGCCTAGACTAGTATTGTGCTTAGCTCACAATGGAAAGGTAGCGTGGGATTTGAAATGGAAGCCAACTAATGCGCGTACTACCAAGTGCAAGCAAAGAATGGGCTACCTTGCTGTCTTGCTGGGCAACGGATCTCTAGAAGTTTGGGAGGTTCCTTTTCCCCATGTAGTGAAGGCCATCTATTCTAAACTCAATGGGGAGGGTACAGATCCCCGCTTTGTGAAGTTGAAGCCTACTTTCAGATGCTCGATGTTGAGAAGTGCAGATACACAGAGCATCCCTCTGACAGTGGAATGGTCGCCAACACCTCCTTATGATTATCTACTCGCTGGATGTCATGATGGAACGGTCGCCTTGTGGAAGTTTTCTGCAAGTAGTACCGCTGAAGATACGAGGCCTTTACTTCGTTTTAGTGCCGATACAGTTCCAATAAGAGCAGTTGCATGGGCACCAAGTGAAAGTGAGCCCGAAAGTGAAAATGTGATACTAATTGCTAGTCATGGAGGCATAAAATTTTGGGACCTAAGAGATCCTTTCCGTCCCTTGTGGGACCTTCATCCGGCACCAAGGATAATATATAGTTTGGATTGGCTTCCTAATCCTAGATGCGTTTTTTTATCCTTTGATGATGGAACATTGAGACTTCTCAGTTTGCTAAAGGCTGCATATGATGTTCCAGTAACTGGTCAACCCTTCACAGCAATAAAACAAAAAGGTTTACACACTTACTGTTGTTCACCATTTGCTATCTGGAGTATTCAAGTGTCAAGGCAGACAGGCATGGTTGCTTACTGCGGTGCTGACGGAGCTGTTGTCCGTTTCCAGCTTACTACGAAAGCAGTGGACAAAGAAAATTCACGCAACCGCACCCCACATTTTGTATGCGAATACTTAACCGAGGAGCAATCAATTATTACAATCCACTCTCCAGCATCAGATGTTCCAATCCCTTTGAAAAAGCTATCCAACAAATCTGAACAGCCATTGTCCATGCGAGCTATTCTATCTGATTCCATGCAGCCAAATGAAGGAAATGATAAAAGTGCCACAACTTCAGCATTGGAAAATGAATCAGCCCTTTGCTATGATGACGATGTCGACGTTGAATCTGGGTCTGAGGATACGCCGATGTCTATTCAGAATAAAAACCAAACTCAATCAAAGAGCAAGAAGAAGGGAGTGGTCAACCAAGAATTGGAACATAGCCATGAGCCTAGTGATTCACAGACAGATGATGACGTAGTGCCTGGTTTGGGGGAGCACTTCGAAAATTTCCCCCCCAAATCAGTTGCATTGCATAGACTGAGATGGAACATGAACATTGGGAGTGAAAGATGGCTGTCCTATGGGGGAGCAGCTGGAATTCTACGCTGTCAGGAGATTGTGCTGTCTGCCCTCGATAAGAAGTTGATGGCGAAGAAATGA
Coding sequence (CDS)
ATGGAAGAGCTTCCACATCAAGCAGAAGCATCCATGGGCACTAGCTGCAAGAAGGGGAAGAAGAAATCAGTGAGTCTGGAGGAACCGCAGAAAAGAGCTAAGAAGAAAGGCGGGGCTACTTCAGTCAACGAAGTGCAGCCTACAGGTCGTTTAGATGACTCCCGGGTGAAGGTTTCAGAGTTTGATCATTGTGTTGAAAATCATTTTAGAGCCATTGATGCGATTGCCGAGCTCTATGGCGAGGCAGAAAATGGCGAAGGCGGAGTTGATGAAAGTGATTTTCAGCGGTTTTCATCCTCCACAACTTTCTTAAGGGAATGGAAGTTCTACAATTATGAGCCGAAAACCGTCAAGTTTACAAGCGATTCGAGAGTTCCTGAGGGTAAGGATGCTGATATCACGATGGAATTGCCACAGTTTTCTTCTGCGGCTGTTCTAAAGAATGGAGCACCGCCCGGAGCCACTGCATCTCTGGACTTTAGAAACTTTATCATGCATGTTGGTGGGCCTGTTTGGGCCATAGATTGGTGTCCTCTAGTTCATGAAAGGACCGACTCCCTTATCAAATGTGAGTTTATTGCTGTTTCTGCTCATCCACCTGGCTCTTCTTATCACACGATGGGCATCCCGCTCAGTGGAAGAGGAATGGTGCAGATATGGTGCTTAGTGCATGGCACCGAAAGCCATGAATCAGAAACGACCAGTGCAACAGAGTGCAAGGATTCAGACTTATCTCAACCAAAGAGGCCTAGAGGAAGACCCCCAGGGCGCAAGAAAAATGGGGCGTCGGCCTTACCATCTCAACCAAAGAGACCTAGAGGAAGGCCTAAAAAGAAACAAGAAGAACCTAATGATGATAACAAGGTTGCCAGTTACCAACTTGTTCAGCCCCTTTCTGTTGAATACCCAGATGTTTCATCCAACTTGCTTGAGATTGATGACGTCTCCCACAATTCTGAAAAACCTGTATCACTGGAAAACAGTGTTGAAAGAGGGAGCAGTACCATTGAAGAAATTTCTACGTGCAATTCTGAAGATGAAGTTCCTGTGCAGAAGAGAAGAGTGAGAAGAAATGCTGATACTAAGAATCATGTTGATGATGTGGGAACGTTATCACTTATAGAGAATCGAGAAGATGGATCTAATGCTACAAATCATGAGGCAAATGAGAATGTTACAAGTGAGTATTCTGGAGAAGACACTCGATTATGTAAGAACATTTCAGAGAAAGCTATTTTAGACACTGGCTCAACTGGATTTTCTATTCCGGAGACTGTTGCTTTGCCTAGACTAGTATTGTGCTTAGCTCACAATGGAAAGGTAGCGTGGGATTTGAAATGGAAGCCAACTAATGCGCGTACTACCAAGTGCAAGCAAAGAATGGGCTACCTTGCTGTCTTGCTGGGCAACGGATCTCTAGAAGTTTGGGAGGTTCCTTTTCCCCATGTAGTGAAGGCCATCTATTCTAAACTCAATGGGGAGGGTACAGATCCCCGCTTTGTGAAGTTGAAGCCTACTTTCAGATGCTCGATGTTGAGAAGTGCAGATACACAGAGCATCCCTCTGACAGTGGAATGGTCGCCAACACCTCCTTATGATTATCTACTCGCTGGATGTCATGATGGAACGGTCGCCTTGTGGAAGTTTTCTGCAAGTAGTACCGCTGAAGATACGAGGCCTTTACTTCGTTTTAGTGCCGATACAGTTCCAATAAGAGCAGTTGCATGGGCACCAAGTGAAAGTGAGCCCGAAAGTGAAAATGTGATACTAATTGCTAGTCATGGAGGCATAAAATTTTGGGACCTAAGAGATCCTTTCCGTCCCTTGTGGGACCTTCATCCGGCACCAAGGATAATATATAGTTTGGATTGGCTTCCTAATCCTAGATGCGTTTTTTTATCCTTTGATGATGGAACATTGAGACTTCTCAGTTTGCTAAAGGCTGCATATGATGTTCCAGTAACTGGTCAACCCTTCACAGCAATAAAACAAAAAGGTTTACACACTTACTGTTGTTCACCATTTGCTATCTGGAGTATTCAAGTGTCAAGGCAGACAGGCATGGTTGCTTACTGCGGTGCTGACGGAGCTGTTGTCCGTTTCCAGCTTACTACGAAAGCAGTGGACAAAGAAAATTCACGCAACCGCACCCCACATTTTGTATGCGAATACTTAACCGAGGAGCAATCAATTATTACAATCCACTCTCCAGCATCAGATGTTCCAATCCCTTTGAAAAAGCTATCCAACAAATCTGAACAGCCATTGTCCATGCGAGCTATTCTATCTGATTCCATGCAGCCAAATGAAGGAAATGATAAAAGTGCCACAACTTCAGCATTGGAAAATGAATCAGCCCTTTGCTATGATGACGATGTCGACGTTGAATCTGGGTCTGAGGATACGCCGATGTCTATTCAGAATAAAAACCAAACTCAATCAAAGAGCAAGAAGAAGGGAGTGGTCAACCAAGAATTGGAACATAGCCATGAGCCTAGTGATTCACAGACAGATGATGACGTAGTGCCTGGTTTGGGGGAGCACTTCGAAAATTTCCCCCCCAAATCAGTTGCATTGCATAGACTGAGATGGAACATGAACATTGGGAGTGAAAGATGGCTGTCCTATGGGGGAGCAGCTGGAATTCTACGCTGTCAGGAGATTGTGCTGTCTGCCCTCGATAAGAAGTTGATGGCGAAGAAATGA
Protein sequence
MEELPHQAEASMGTSCKKGKKKSVSLEEPQKRAKKKGGATSVNEVQPTGRLDDSRVKVSEFDHCVENHFRAIDAIAELYGEAENGEGGVDESDFQRFSSSTTFLREWKFYNYEPKTVKFTSDSRVPEGKDADITMELPQFSSAAVLKNGAPPGATASLDFRNFIMHVGGPVWAIDWCPLVHERTDSLIKCEFIAVSAHPPGSSYHTMGIPLSGRGMVQIWCLVHGTESHESETTSATECKDSDLSQPKRPRGRPPGRKKNGASALPSQPKRPRGRPKKKQEEPNDDNKVASYQLVQPLSVEYPDVSSNLLEIDDVSHNSEKPVSLENSVERGSSTIEEISTCNSEDEVPVQKRRVRRNADTKNHVDDVGTLSLIENREDGSNATNHEANENVTSEYSGEDTRLCKNISEKAILDTGSTGFSIPETVALPRLVLCLAHNGKVAWDLKWKPTNARTTKCKQRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKLNGEGTDPRFVKLKPTFRCSMLRSADTQSIPLTVEWSPTPPYDYLLAGCHDGTVALWKFSASSTAEDTRPLLRFSADTVPIRAVAWAPSESEPESENVILIASHGGIKFWDLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFTAIKQKGLHTYCCSPFAIWSIQVSRQTGMVAYCGADGAVVRFQLTTKAVDKENSRNRTPHFVCEYLTEEQSIITIHSPASDVPIPLKKLSNKSEQPLSMRAILSDSMQPNEGNDKSATTSALENESALCYDDDVDVESGSEDTPMSIQNKNQTQSKSKKKGVVNQELEHSHEPSDSQTDDDVVPGLGEHFENFPPKSVALHRLRWNMNIGSERWLSYGGAAGILRCQEIVLSALDKKLMAKK
Homology
BLAST of CmoCh14G002880 vs. ExPASy Swiss-Prot
Match:
Q8WUA4 (General transcription factor 3C polypeptide 2 OS=Homo sapiens OX=9606 GN=GTF3C2 PE=1 SV=2)
HSP 1 Score: 74.7 bits (182), Expect = 5.9e-12
Identity = 48/187 (25.67%), Postives = 83/187 (44.39%), Query Frame = 0
Query: 443 WDLKWKPTNA-------RTTKCKQRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKLNGEG 502
WDLK+ P+ A R R+G LA+ +G + ++ +P P +A+ ++ +
Sbjct: 471 WDLKFCPSGAWELPGTPRKAPLLPRLGLLALACSDGKVLLFSLPHP---EALLAQQPPDA 530
Query: 503 TDPRFVKLK--PTFRCSMLRSADTQSIP--LTVEWSPTPPYDYLLAGCHDGTVALWKFSA 562
P K++ T + +++ D L++ W PT P+ +L AG ++G V W
Sbjct: 531 VKPAIYKVQCVATLQVGSMQATDPSECGQCLSLAWMPTRPHQHLAAGYYNGMVVFWNLPT 590
Query: 563 SSTAEDTR---------PLLRFSADTVPIRAVAWAPSESEPESENVILIASHGGIKFWDL 610
+S + R P F A +R + W + S ++ S IKFWDL
Sbjct: 591 NSPLQRIRLSDGSLKLYPFQCFLAHDQAVRTLQWC----KANSHFLVSAGSDRKIKFWDL 650
BLAST of CmoCh14G002880 vs. ExPASy Swiss-Prot
Match:
Q8BL74 (General transcription factor 3C polypeptide 2 OS=Mus musculus OX=10090 GN=Gtf3c2 PE=2 SV=2)
HSP 1 Score: 74.7 bits (182), Expect = 5.9e-12
Identity = 48/187 (25.67%), Postives = 84/187 (44.92%), Query Frame = 0
Query: 443 WDLKWKPTNA-------RTTKCKQRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKLNGEG 502
WDLK+ P+ A R R+G LA+ +G + ++ +P P +A+ ++ +
Sbjct: 467 WDLKFCPSGAWEHPETLRKAPLLPRLGLLALACSDGKVLLFSLPHP---EALLAQQPPDA 526
Query: 503 TDPRFVKLK--PTFRCSMLRSADTQSIP--LTVEWSPTPPYDYLLAGCHDGTVALWKFSA 562
P K++ T + ++++D L++ W PT P+ +L AG ++G V W
Sbjct: 527 MKPAIYKVQCLATLQVGSVQASDPSECGQCLSLAWMPTRPHHHLAAGYYNGMVVFWNLPT 586
Query: 563 SSTAEDTR---------PLLRFSADTVPIRAVAWAPSESEPESENVILIASHGGIKFWDL 610
+S + R P F A +R + W + S ++ S IKFWDL
Sbjct: 587 NSPLQRIRLSDGSLKLYPFQCFLAHDQAVRTIQWC----KANSHFLVSAGSDRKIKFWDL 646
BLAST of CmoCh14G002880 vs. ExPASy Swiss-Prot
Match:
Q5RDC3 (General transcription factor 3C polypeptide 2 OS=Pongo abelii OX=9601 GN=GTF3C2 PE=2 SV=1)
HSP 1 Score: 74.7 bits (182), Expect = 5.9e-12
Identity = 50/190 (26.32%), Postives = 84/190 (44.21%), Query Frame = 0
Query: 443 WDLKWKPTNA-------RTTKCKQRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKLNGEG 502
WDLK+ P+ A R R+G LA+ +G + ++ +P P +A+ ++ +
Sbjct: 471 WDLKFCPSGAWELPGTPRKAPLLPRLGLLALACSDGKVLLFSLPHP---EALLAQQPPDA 530
Query: 503 TDPRFVKLK--PTFRCSMLRSADTQSIP--LTVEWSPTPPYDYLLAGCHDGTVALWKFSA 562
P K++ T + +++ D L++ W PT P+ +L AG ++G V W
Sbjct: 531 VKPAIYKVQCVATLQVGSMQATDPSECGQCLSLAWMPTRPHQHLAAGYYNGMVVFWNLPT 590
Query: 563 SSTAEDTR---------PLLRFSADTVPIRAVAWAPSESEPESENVILIASHGG---IKF 610
+S + R P F A +R + W + S +AS G IKF
Sbjct: 591 NSPLQRIRLSDGSLKLYPFQCFLAHDQAVRTLQWCKANSH-------FLASAGSDRKIKF 650
BLAST of CmoCh14G002880 vs. ExPASy TrEMBL
Match:
A0A6J1F7U5 (uncharacterized protein LOC111441649 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111441649 PE=4 SV=1)
HSP 1 Score: 1817.0 bits (4705), Expect = 0.0e+00
Identity = 901/901 (100.00%), Postives = 901/901 (100.00%), Query Frame = 0
Query: 1 MEELPHQAEASMGTSCKKGKKKSVSLEEPQKRAKKKGGATSVNEVQPTGRLDDSRVKVSE 60
MEELPHQAEASMGTSCKKGKKKSVSLEEPQKRAKKKGGATSVNEVQPTGRLDDSRVKVSE
Sbjct: 1 MEELPHQAEASMGTSCKKGKKKSVSLEEPQKRAKKKGGATSVNEVQPTGRLDDSRVKVSE 60
Query: 61 FDHCVENHFRAIDAIAELYGEAENGEGGVDESDFQRFSSSTTFLREWKFYNYEPKTVKFT 120
FDHCVENHFRAIDAIAELYGEAENGEGGVDESDFQRFSSSTTFLREWKFYNYEPKTVKFT
Sbjct: 61 FDHCVENHFRAIDAIAELYGEAENGEGGVDESDFQRFSSSTTFLREWKFYNYEPKTVKFT 120
Query: 121 SDSRVPEGKDADITMELPQFSSAAVLKNGAPPGATASLDFRNFIMHVGGPVWAIDWCPLV 180
SDSRVPEGKDADITMELPQFSSAAVLKNGAPPGATASLDFRNFIMHVGGPVWAIDWCPLV
Sbjct: 121 SDSRVPEGKDADITMELPQFSSAAVLKNGAPPGATASLDFRNFIMHVGGPVWAIDWCPLV 180
Query: 181 HERTDSLIKCEFIAVSAHPPGSSYHTMGIPLSGRGMVQIWCLVHGTESHESETTSATECK 240
HERTDSLIKCEFIAVSAHPPGSSYHTMGIPLSGRGMVQIWCLVHGTESHESETTSATECK
Sbjct: 181 HERTDSLIKCEFIAVSAHPPGSSYHTMGIPLSGRGMVQIWCLVHGTESHESETTSATECK 240
Query: 241 DSDLSQPKRPRGRPPGRKKNGASALPSQPKRPRGRPKKKQEEPNDDNKVASYQLVQPLSV 300
DSDLSQPKRPRGRPPGRKKNGASALPSQPKRPRGRPKKKQEEPNDDNKVASYQLVQPLSV
Sbjct: 241 DSDLSQPKRPRGRPPGRKKNGASALPSQPKRPRGRPKKKQEEPNDDNKVASYQLVQPLSV 300
Query: 301 EYPDVSSNLLEIDDVSHNSEKPVSLENSVERGSSTIEEISTCNSEDEVPVQKRRVRRNAD 360
EYPDVSSNLLEIDDVSHNSEKPVSLENSVERGSSTIEEISTCNSEDEVPVQKRRVRRNAD
Sbjct: 301 EYPDVSSNLLEIDDVSHNSEKPVSLENSVERGSSTIEEISTCNSEDEVPVQKRRVRRNAD 360
Query: 361 TKNHVDDVGTLSLIENREDGSNATNHEANENVTSEYSGEDTRLCKNISEKAILDTGSTGF 420
TKNHVDDVGTLSLIENREDGSNATNHEANENVTSEYSGEDTRLCKNISEKAILDTGSTGF
Sbjct: 361 TKNHVDDVGTLSLIENREDGSNATNHEANENVTSEYSGEDTRLCKNISEKAILDTGSTGF 420
Query: 421 SIPETVALPRLVLCLAHNGKVAWDLKWKPTNARTTKCKQRMGYLAVLLGNGSLEVWEVPF 480
SIPETVALPRLVLCLAHNGKVAWDLKWKPTNARTTKCKQRMGYLAVLLGNGSLEVWEVPF
Sbjct: 421 SIPETVALPRLVLCLAHNGKVAWDLKWKPTNARTTKCKQRMGYLAVLLGNGSLEVWEVPF 480
Query: 481 PHVVKAIYSKLNGEGTDPRFVKLKPTFRCSMLRSADTQSIPLTVEWSPTPPYDYLLAGCH 540
PHVVKAIYSKLNGEGTDPRFVKLKPTFRCSMLRSADTQSIPLTVEWSPTPPYDYLLAGCH
Sbjct: 481 PHVVKAIYSKLNGEGTDPRFVKLKPTFRCSMLRSADTQSIPLTVEWSPTPPYDYLLAGCH 540
Query: 541 DGTVALWKFSASSTAEDTRPLLRFSADTVPIRAVAWAPSESEPESENVILIASHGGIKFW 600
DGTVALWKFSASSTAEDTRPLLRFSADTVPIRAVAWAPSESEPESENVILIASHGGIKFW
Sbjct: 541 DGTVALWKFSASSTAEDTRPLLRFSADTVPIRAVAWAPSESEPESENVILIASHGGIKFW 600
Query: 601 DLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFT 660
DLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFT
Sbjct: 601 DLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFT 660
Query: 661 AIKQKGLHTYCCSPFAIWSIQVSRQTGMVAYCGADGAVVRFQLTTKAVDKENSRNRTPHF 720
AIKQKGLHTYCCSPFAIWSIQVSRQTGMVAYCGADGAVVRFQLTTKAVDKENSRNRTPHF
Sbjct: 661 AIKQKGLHTYCCSPFAIWSIQVSRQTGMVAYCGADGAVVRFQLTTKAVDKENSRNRTPHF 720
Query: 721 VCEYLTEEQSIITIHSPASDVPIPLKKLSNKSEQPLSMRAILSDSMQPNEGNDKSATTSA 780
VCEYLTEEQSIITIHSPASDVPIPLKKLSNKSEQPLSMRAILSDSMQPNEGNDKSATTSA
Sbjct: 721 VCEYLTEEQSIITIHSPASDVPIPLKKLSNKSEQPLSMRAILSDSMQPNEGNDKSATTSA 780
Query: 781 LENESALCYDDDVDVESGSEDTPMSIQNKNQTQSKSKKKGVVNQELEHSHEPSDSQTDDD 840
LENESALCYDDDVDVESGSEDTPMSIQNKNQTQSKSKKKGVVNQELEHSHEPSDSQTDDD
Sbjct: 781 LENESALCYDDDVDVESGSEDTPMSIQNKNQTQSKSKKKGVVNQELEHSHEPSDSQTDDD 840
Query: 841 VVPGLGEHFENFPPKSVALHRLRWNMNIGSERWLSYGGAAGILRCQEIVLSALDKKLMAK 900
VVPGLGEHFENFPPKSVALHRLRWNMNIGSERWLSYGGAAGILRCQEIVLSALDKKLMAK
Sbjct: 841 VVPGLGEHFENFPPKSVALHRLRWNMNIGSERWLSYGGAAGILRCQEIVLSALDKKLMAK 900
Query: 901 K 902
K
Sbjct: 901 K 901
BLAST of CmoCh14G002880 vs. ExPASy TrEMBL
Match:
A0A6J1F1Y4 (uncharacterized protein LOC111441649 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111441649 PE=4 SV=1)
HSP 1 Score: 1741.1 bits (4508), Expect = 0.0e+00
Identity = 869/901 (96.45%), Postives = 869/901 (96.45%), Query Frame = 0
Query: 1 MEELPHQAEASMGTSCKKGKKKSVSLEEPQKRAKKKGGATSVNEVQPTGRLDDSRVKVSE 60
MEELPHQAEASMGTSCKKGKKKSVSLEEPQKRAKKKGGATSVNEVQPTGRLDDSRVKVSE
Sbjct: 1 MEELPHQAEASMGTSCKKGKKKSVSLEEPQKRAKKKGGATSVNEVQPTGRLDDSRVKVSE 60
Query: 61 FDHCVENHFRAIDAIAELYGEAENGEGGVDESDFQRFSSSTTFLREWKFYNYEPKTVKFT 120
FDHCVENHFRAIDAIAELYGEAENGEGGVDESDFQRFSSSTTFLREWKFYNYEPKTVKFT
Sbjct: 61 FDHCVENHFRAIDAIAELYGEAENGEGGVDESDFQRFSSSTTFLREWKFYNYEPKTVKFT 120
Query: 121 SDSRVPEGKDADITMELPQFSSAAVLKNGAPPGATASLDFRNFIMHVGGPVWAIDWCPLV 180
SDSRVPEGKDADITMELPQFSSAAVLKNGAPPGATASLDFRNFIMHVGGPVWAIDWCPLV
Sbjct: 121 SDSRVPEGKDADITMELPQFSSAAVLKNGAPPGATASLDFRNFIMHVGGPVWAIDWCPLV 180
Query: 181 HERTDSLIKCEFIAVSAHPPGSSYHTMGIPLSGRGMVQIWCLVHGTESHESETTSATECK 240
HERTDSLIKCEFIAVSAHPPGSSYHTMGIPLSGRGMVQIWCLVHGTESHESETTSATECK
Sbjct: 181 HERTDSLIKCEFIAVSAHPPGSSYHTMGIPLSGRGMVQIWCLVHGTESHESETTSATECK 240
Query: 241 DSDLSQPKRPRGRPPGRKKNGASALPSQPKRPRGRPKKKQEEPNDDNKVASYQLVQPLSV 300
DSDLSQPKRPRGRPPGRKKNGASALPSQPKRPRGRPKKKQEEPNDDNKVASYQLVQPLSV
Sbjct: 241 DSDLSQPKRPRGRPPGRKKNGASALPSQPKRPRGRPKKKQEEPNDDNKVASYQLVQPLSV 300
Query: 301 EYPDVSSNLLEIDDVSHNSEKPVSLENSVERGSSTIEEISTCNSEDEVPVQKRRVRRNAD 360
EYPDVSSNLLEIDDVSHNSEKPVSLENSVERGSSTIEEISTCNSEDEVPVQKRRVRRNAD
Sbjct: 301 EYPDVSSNLLEIDDVSHNSEKPVSLENSVERGSSTIEEISTCNSEDEVPVQKRRVRRNAD 360
Query: 361 TKNHVDDVGTLSLIENREDGSNATNHEANENVTSEYSGEDTRLCKNISEKAILDTGSTGF 420
TKNHVDDVGT LCKNISEKAILDTGSTGF
Sbjct: 361 TKNHVDDVGT--------------------------------LCKNISEKAILDTGSTGF 420
Query: 421 SIPETVALPRLVLCLAHNGKVAWDLKWKPTNARTTKCKQRMGYLAVLLGNGSLEVWEVPF 480
SIPETVALPRLVLCLAHNGKVAWDLKWKPTNARTTKCKQRMGYLAVLLGNGSLEVWEVPF
Sbjct: 421 SIPETVALPRLVLCLAHNGKVAWDLKWKPTNARTTKCKQRMGYLAVLLGNGSLEVWEVPF 480
Query: 481 PHVVKAIYSKLNGEGTDPRFVKLKPTFRCSMLRSADTQSIPLTVEWSPTPPYDYLLAGCH 540
PHVVKAIYSKLNGEGTDPRFVKLKPTFRCSMLRSADTQSIPLTVEWSPTPPYDYLLAGCH
Sbjct: 481 PHVVKAIYSKLNGEGTDPRFVKLKPTFRCSMLRSADTQSIPLTVEWSPTPPYDYLLAGCH 540
Query: 541 DGTVALWKFSASSTAEDTRPLLRFSADTVPIRAVAWAPSESEPESENVILIASHGGIKFW 600
DGTVALWKFSASSTAEDTRPLLRFSADTVPIRAVAWAPSESEPESENVILIASHGGIKFW
Sbjct: 541 DGTVALWKFSASSTAEDTRPLLRFSADTVPIRAVAWAPSESEPESENVILIASHGGIKFW 600
Query: 601 DLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFT 660
DLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFT
Sbjct: 601 DLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFT 660
Query: 661 AIKQKGLHTYCCSPFAIWSIQVSRQTGMVAYCGADGAVVRFQLTTKAVDKENSRNRTPHF 720
AIKQKGLHTYCCSPFAIWSIQVSRQTGMVAYCGADGAVVRFQLTTKAVDKENSRNRTPHF
Sbjct: 661 AIKQKGLHTYCCSPFAIWSIQVSRQTGMVAYCGADGAVVRFQLTTKAVDKENSRNRTPHF 720
Query: 721 VCEYLTEEQSIITIHSPASDVPIPLKKLSNKSEQPLSMRAILSDSMQPNEGNDKSATTSA 780
VCEYLTEEQSIITIHSPASDVPIPLKKLSNKSEQPLSMRAILSDSMQPNEGNDKSATTSA
Sbjct: 721 VCEYLTEEQSIITIHSPASDVPIPLKKLSNKSEQPLSMRAILSDSMQPNEGNDKSATTSA 780
Query: 781 LENESALCYDDDVDVESGSEDTPMSIQNKNQTQSKSKKKGVVNQELEHSHEPSDSQTDDD 840
LENESALCYDDDVDVESGSEDTPMSIQNKNQTQSKSKKKGVVNQELEHSHEPSDSQTDDD
Sbjct: 781 LENESALCYDDDVDVESGSEDTPMSIQNKNQTQSKSKKKGVVNQELEHSHEPSDSQTDDD 840
Query: 841 VVPGLGEHFENFPPKSVALHRLRWNMNIGSERWLSYGGAAGILRCQEIVLSALDKKLMAK 900
VVPGLGEHFENFPPKSVALHRLRWNMNIGSERWLSYGGAAGILRCQEIVLSALDKKLMAK
Sbjct: 841 VVPGLGEHFENFPPKSVALHRLRWNMNIGSERWLSYGGAAGILRCQEIVLSALDKKLMAK 869
Query: 901 K 902
K
Sbjct: 901 K 869
BLAST of CmoCh14G002880 vs. ExPASy TrEMBL
Match:
A0A6J1J0H6 (uncharacterized protein LOC111481574 OS=Cucurbita maxima OX=3661 GN=LOC111481574 PE=4 SV=1)
HSP 1 Score: 1692.6 bits (4382), Expect = 0.0e+00
Identity = 848/901 (94.12%), Postives = 854/901 (94.78%), Query Frame = 0
Query: 1 MEELPHQAEASMGTSCKKGKKKSVSLEEPQKRAKKKGGATSVNEVQPTGRLDDSRVKVSE 60
MEELPHQAEASMGTSCKKGKKKSVSLEEP KRAKKK GATSVNEVQPTGRLDD RVKVSE
Sbjct: 1 MEELPHQAEASMGTSCKKGKKKSVSLEEPLKRAKKKAGATSVNEVQPTGRLDDFRVKVSE 60
Query: 61 FDHCVENHFRAIDAIAELYGEAENGEGGVDESDFQRFSSSTTFLREWKFYNYEPKTVKFT 120
FDHCVENHFRAIDAIAELYGEAENGEGGVDESDFQRFSSSTTFLREWKFYNYEPKTVKFT
Sbjct: 61 FDHCVENHFRAIDAIAELYGEAENGEGGVDESDFQRFSSSTTFLREWKFYNYEPKTVKFT 120
Query: 121 SDSRVPEGKDADITMELPQFSSAAVLKNGAPPGATASLDFRNFIMHVGGPVWAIDWCPLV 180
SDSRVPEGKDADITMELPQFSSAAVLKNGAPPGAT SLDFRNFIMHVGGPVWAIDWCPLV
Sbjct: 121 SDSRVPEGKDADITMELPQFSSAAVLKNGAPPGATTSLDFRNFIMHVGGPVWAIDWCPLV 180
Query: 181 HERTDSLIKCEFIAVSAHPPGSSYHTMGIPLSGRGMVQIWCLVHGTESHESETTSATECK 240
HERTDSLIKCEFIAVSAHPPGSSYHTMGIPLSGRGMVQIWCLVHGTESHESETT+ATECK
Sbjct: 181 HERTDSLIKCEFIAVSAHPPGSSYHTMGIPLSGRGMVQIWCLVHGTESHESETTNATECK 240
Query: 241 DSDLSQPKRPRGRPPGRKKNGASALPSQPKRPRGRPKKKQEEPNDDNKVASYQLVQPLSV 300
SDLSQPKRPRGRPPGRKKNGASAL SQ KRPRGRPKKKQEEPN DN+VASYQLVQPLSV
Sbjct: 241 ASDLSQPKRPRGRPPGRKKNGASALSSQQKRPRGRPKKKQEEPN-DNEVASYQLVQPLSV 300
Query: 301 EYPDVSSNLLEIDDVSHNSEKPVSLENSVERGSSTIEEISTCNSEDEVPVQKRRVRRNAD 360
EYPDVSSNLLEIDDV HNSEK VSLENSVERGSSTIEEISTCNSEDEVPVQKRR RRNAD
Sbjct: 301 EYPDVSSNLLEIDDVPHNSEKLVSLENSVERGSSTIEEISTCNSEDEVPVQKRRERRNAD 360
Query: 361 TKNHVDDVGTLSLIENREDGSNATNHEANENVTSEYSGEDTRLCKNISEKAILDTGSTGF 420
TKNHVDDVGT LCKNISE AILDTGSTGF
Sbjct: 361 TKNHVDDVGT--------------------------------LCKNISENAILDTGSTGF 420
Query: 421 SIPETVALPRLVLCLAHNGKVAWDLKWKPTNARTTKCKQRMGYLAVLLGNGSLEVWEVPF 480
SIPE+VALPRLVLCLAHNGKVAWDLKWKPTNARTTKCKQRMGYLAVLLGNGSLEVWE+PF
Sbjct: 421 SIPESVALPRLVLCLAHNGKVAWDLKWKPTNARTTKCKQRMGYLAVLLGNGSLEVWEIPF 480
Query: 481 PHVVKAIYSKLNGEGTDPRFVKLKPTFRCSMLRSADTQSIPLTVEWSPTPPYDYLLAGCH 540
PHVVKAIYS LNGEGTDPRFVKLKPTFRCSMLRSADTQSIPLTVEWSPTPPYDYLLAGCH
Sbjct: 481 PHVVKAIYSNLNGEGTDPRFVKLKPTFRCSMLRSADTQSIPLTVEWSPTPPYDYLLAGCH 540
Query: 541 DGTVALWKFSASSTAEDTRPLLRFSADTVPIRAVAWAPSESEPESENVILIASHGGIKFW 600
DGTVALWKFSA+STAEDTRPLLRFSADTVPIRAVAWAPSESEPESENVILIASHGGIKFW
Sbjct: 541 DGTVALWKFSANSTAEDTRPLLRFSADTVPIRAVAWAPSESEPESENVILIASHGGIKFW 600
Query: 601 DLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFT 660
DLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFT
Sbjct: 601 DLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFT 660
Query: 661 AIKQKGLHTYCCSPFAIWSIQVSRQTGMVAYCGADGAVVRFQLTTKAVDKENSRNRTPHF 720
AIKQKGLHTYCCSPFAIWSIQVSRQTGMVAYCGADGAVVRFQLTTKAVDKENSRNRTPHF
Sbjct: 661 AIKQKGLHTYCCSPFAIWSIQVSRQTGMVAYCGADGAVVRFQLTTKAVDKENSRNRTPHF 720
Query: 721 VCEYLTEEQSIITIHSPASDVPIPLKKLSNKSEQPLSMRAILSDSMQPNEGNDKSATTSA 780
VCEYLTEEQSIITIHSPASDVPIPLKKLSNKSEQPLSMRAILSDSMQPNEGNDKSATTSA
Sbjct: 721 VCEYLTEEQSIITIHSPASDVPIPLKKLSNKSEQPLSMRAILSDSMQPNEGNDKSATTSA 780
Query: 781 LENESALCYDDDVDVESGSEDTPMSIQNKNQTQSKSKKKGVVNQELEHSHEPSDSQTDDD 840
LENESALCYDDDV VESGSEDTPMSIQNKNQTQSKSKKKGVVNQELEHSHEPSDSQTDDD
Sbjct: 781 LENESALCYDDDVGVESGSEDTPMSIQNKNQTQSKSKKKGVVNQELEHSHEPSDSQTDDD 840
Query: 841 VVPGLGEHFENFPPKSVALHRLRWNMNIGSERWLSYGGAAGILRCQEIVLSALDKKLMAK 900
VVPGLG+HFENFPPKSVALHRLRWNMNIGSERWL YGGAAGILRCQEIVLSALDKKLMAK
Sbjct: 841 VVPGLGDHFENFPPKSVALHRLRWNMNIGSERWLCYGGAAGILRCQEIVLSALDKKLMAK 868
Query: 901 K 902
K
Sbjct: 901 K 868
BLAST of CmoCh14G002880 vs. ExPASy TrEMBL
Match:
A0A5D3DPQ1 (DNA binding protein, putative isoform 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G003700 PE=4 SV=1)
HSP 1 Score: 1330.9 bits (3443), Expect = 0.0e+00
Identity = 686/918 (74.73%), Postives = 751/918 (81.81%), Query Frame = 0
Query: 18 KGKKKSVSLE--EPQKRAKKK------------GGATSVNEVQPTGRLDD--SRVKVSEF 77
KGKKK + E EP+KRAKKK +TSVNE Q T RL+D +VKVSEF
Sbjct: 42 KGKKKPPAKEKKEPEKRAKKKTPVATTTAAAATTTSTSVNEHQRTDRLNDVLPKVKVSEF 101
Query: 78 DHCVENHFRAIDAIAELYGEAENGEGGVDESDFQRFSSSTTFLREWKFYNYEPKTVKFTS 137
D CVENHFRA+DAI EL EAE G+GG+DESD QRFSSST FLREW+FYNYE KT+KF +
Sbjct: 102 DPCVENHFRAMDAIVELCCEAEEGDGGIDESDIQRFSSSTIFLREWRFYNYEAKTIKFAN 161
Query: 138 DSRVPEGKDADITMELPQFSSAAVLKNGAPPGATASLDFRNFIMHVGGPVWAIDWCPLVH 197
DS PEGKDADIT+ LPQFSSAAVLK GAPPGA+ SLDFRNF MHVGGPVWAIDWCP VH
Sbjct: 162 DSTGPEGKDADITINLPQFSSAAVLKKGAPPGASTSLDFRNFAMHVGGPVWAIDWCPQVH 221
Query: 198 ERTDSLIKCEFIAVSAHPPGSSYHTMGIPLSGRGMVQIWCLVHGTESHESETTSATECKD 257
RT+SLIKCEFIAVSAHPPGSSYH MGIPL+GRGMVQIWCLVHGTE++E
Sbjct: 222 GRTNSLIKCEFIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTENYEPIDVGE---PP 281
Query: 258 SDL-SQPKRPRGRPPGRKKNGASALPSQPKRPRGRPKKKQEEPNDDNKVASYQLVQPLSV 317
SDL SQPK+PRGRPPGRKK AS LPS PKRPRGRPKK+Q+E + D K + QLVQ S+
Sbjct: 282 SDLSSQPKKPRGRPPGRKKKEASGLPSPPKRPRGRPKKEQKE-STDKKGDNCQLVQEFSM 341
Query: 318 EYPDVSSNLLEIDDVSHNSEKPVSLENSVERGSSTIEEISTCNSEDEVPVQKRRVRRNAD 377
E P SS+LLEID V N+E V LEN+VER ST++E+STCNSEDEVP +KRRVRR
Sbjct: 342 ENPVGSSSLLEIDGVPKNTENFVLLENNVERERSTLQEVSTCNSEDEVPAKKRRVRRKVK 401
Query: 378 TKNHVDDVGTLSLIENREDGSNATNHEANENVTSEYSGEDTRLCKNISEKAILDTGSTGF 437
++N VDDVG SL E +EDGS A NHEA+ENV SEYSGED LCK+ISE +LD S F
Sbjct: 402 SRNLVDDVGVSSLTEYQEDGSIANNHEADENVKSEYSGEDNLLCKDISENVVLDASSIEF 461
Query: 438 SIPETVALPRLVLCLAHNGKVAWDLKWKPTNARTTKCKQRMGYLAVLLGNGSLEVWEVPF 497
SIPE+VALPR+VLCLAHNGKVAWDLKWKP NA T CK RMGYLAVLLGNGSLEVWEVPF
Sbjct: 462 SIPESVALPRVVLCLAHNGKVAWDLKWKPINACTDNCKHRMGYLAVLLGNGSLEVWEVPF 521
Query: 498 PHVVKAIYSKLNGEGTDPRFVKLKPTFRCSMLRSADTQSIPLTVEWSPTPPYDYLLAGCH 557
PH VK IYSK NGEGTDPRFVKLKP FRCS LR+A+TQSIPLTVEWS PPYDYLLAGCH
Sbjct: 522 PHAVKTIYSKFNGEGTDPRFVKLKPIFRCSRLRTANTQSIPLTVEWSLAPPYDYLLAGCH 581
Query: 558 DGTVALWKFSASSTAEDTRPLLRFSADTVPIRAVAWAPSESEPESENVILIASHGGIKFW 617
DGTVALWKFSA+S+ EDTRPLLRFSADTVPIRAVAWAPSES ES NVIL A HGG+KFW
Sbjct: 582 DGTVALWKFSANSSCEDTRPLLRFSADTVPIRAVAWAPSESNLESANVILTAGHGGLKFW 641
Query: 618 DLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFT 677
DLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLSLLKAA DVP TGQPFT
Sbjct: 642 DLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLSLLKAANDVPATGQPFT 701
Query: 678 AIKQKGLHTYCCSPFAIWSIQVSRQTGMVAYCGADGAVVRFQLTTKAVDKENSRNRTPHF 737
AIKQKGLHTY CS +AIWSIQVSRQTGMVAYCGADGAVVRFQLTTKA DKENSR+RTPH+
Sbjct: 702 AIKQKGLHTYICSSYAIWSIQVSRQTGMVAYCGADGAVVRFQLTTKAADKENSRHRTPHY 761
Query: 738 VCEYLTEEQSIITIHSPASDVPIPLKKLSNKSEQPLSMRAILSDSMQPNEGNDKSATTSA 797
VCEYLTEE+SIIT SP +VPIPLKKLSNKSE PLSMRAILSDSMQ NEGN K+AT S
Sbjct: 762 VCEYLTEEESIITFRSPPPNVPIPLKKLSNKSEHPLSMRAILSDSMQSNEGNHKTATAST 821
Query: 798 LENESALCYDDDVDVESGSEDTPMSIQNKNQTQSKSKKKGVVNQELEHSHEPSD------ 857
LENE+++C D DV VESGSEDTP+S + KN+TQ K KKKGV N ELE + EP D
Sbjct: 822 LENEASICSDVDVGVESGSEDTPLSTKKKNRTQPKCKKKGVENLELECNVEPKDDAHIDA 881
Query: 858 -----------SQTDDDVVPGLGEHFENFPPKSVALHRLRWNMNIGSERWLSYGGAAGIL 902
++ D DVVP G+HFEN PPKSVA+HR+RWNMN+GSE+WL YGGA+GIL
Sbjct: 882 DVEAQTDAVLEARMDADVVPSSGDHFENLPPKSVAMHRVRWNMNMGSEKWLCYGGASGIL 941
BLAST of CmoCh14G002880 vs. ExPASy TrEMBL
Match:
A0A0A0LGM2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G775290 PE=4 SV=1)
HSP 1 Score: 1328.5 bits (3437), Expect = 0.0e+00
Identity = 688/916 (75.11%), Postives = 749/916 (81.77%), Query Frame = 0
Query: 18 KGKKKSVSLE--EPQKRAKKK----------GGATSVNEVQPTGRLDD--SRVKVSEFDH 77
KGKKK + E E +KRAKKK +T VN+ Q T RLDD VKVSEFD
Sbjct: 43 KGKKKPPAKEKKELEKRAKKKTPVTATVVTATTSTEVNKHQSTARLDDVVPEVKVSEFDP 102
Query: 78 CVENHFRAIDAIAELYGEAENGEGGVDESDFQRFSSSTTFLREWKFYNYEPKTVKFTSDS 137
CVENHFRA+DAI EL EAE+G+GG+DESD QRFSSST FLREW+FYNYEPKT+KF +DS
Sbjct: 103 CVENHFRAMDAIVELCCEAEDGDGGIDESDIQRFSSSTIFLREWRFYNYEPKTIKFANDS 162
Query: 138 RVPEGKDADITMELPQFSSAAVLKNGAPPGATASLDFRNFIMHVGGPVWAIDWCPLVHER 197
R PEGKDADIT++LPQFSSAAVLK GAPPGA+ SLDFRNF MHVGGPVWAIDWCP VHER
Sbjct: 163 RGPEGKDADITIDLPQFSSAAVLKKGAPPGASTSLDFRNFAMHVGGPVWAIDWCPQVHER 222
Query: 198 TDSLIKCEFIAVSAHPPGSSYHTMGIPLSGRGMVQIWCLVHGTESHESETTSATECKDSD 257
T+SLIKCEFIAVSAHPPGSSYH MGIPL+GRGMVQIWCLVHGTES+E SD
Sbjct: 223 TNSLIKCEFIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTESYEPIDVGE---PPSD 282
Query: 258 L-SQPKRPRGRPPGRKKNGASALPSQPKRPRGRPKKKQEEPNDDNKVASYQLVQPLSVEY 317
L SQPKRPRGRPPGRK+ GAS LPSQPKRPRGRPKK+Q+E ND K + QLVQ S+E
Sbjct: 283 LSSQPKRPRGRPPGRKEKGASVLPSQPKRPRGRPKKEQKESNDKKKGDNCQLVQEFSMEN 342
Query: 318 PDVSSNLLEIDDVSHNSEKPVSLENSVERGSSTIEEISTCNSEDEVPVQKRRVRRNADTK 377
P SSNLLEID V N+E V LEN+VER SST++E+STC+SEDEVP +KRRVRR +
Sbjct: 343 PVGSSNLLEIDGVPKNTENFVLLENNVERESSTLQEVSTCHSEDEVPAKKRRVRRKVKPR 402
Query: 378 NHVDDVGTLSLIENREDGSNATNHEANENVTSEYSGEDTRLCKNISEKAILDTGSTGFSI 437
N VDDVG LSL E +EDGS A NHEANENV SEYSGED LCK+ISE +LD S FSI
Sbjct: 403 NLVDDVGVLSLAEYQEDGSIANNHEANENVKSEYSGEDNLLCKDISENVVLDASSIEFSI 462
Query: 438 PETVALPRLVLCLAHNGKVAWDLKWKPTNARTTKCKQRMGYLAVLLGNGSLEVWEVPFPH 497
PE+VALPR+VLCLAHNGKVAWDLKWKP NA T CK RMGYLAVLLGNGSLEVWEVPFPH
Sbjct: 463 PESVALPRVVLCLAHNGKVAWDLKWKPMNACTDNCKHRMGYLAVLLGNGSLEVWEVPFPH 522
Query: 498 VVKAIYSKLNGEGTDPRFVKLKPTFRCSMLRSADTQSIPLTVEWSPTPPYDYLLAGCHDG 557
VKAIYSK NGEGTDPRF+KLKP FRCS LR+ +TQSIPLTVEWS TPPYDYLLAGCHDG
Sbjct: 523 AVKAIYSKFNGEGTDPRFMKLKPIFRCSRLRTTNTQSIPLTVEWSRTPPYDYLLAGCHDG 582
Query: 558 TVALWKFSASSTAEDTRPLLRFSADTVPIRAVAWAPSESEPESENVILIASHGGIKFWDL 617
TVALWKFSA+S+ EDTRPLLRFSADTVPIRAVAWAPSES+ ES NVIL A HGG+KFWDL
Sbjct: 583 TVALWKFSANSSCEDTRPLLRFSADTVPIRAVAWAPSESDLESANVILTAGHGGLKFWDL 642
Query: 618 RDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFTAI 677
RDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLSLLKAA DVP TG+PFTAI
Sbjct: 643 RDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLSLLKAANDVPATGRPFTAI 702
Query: 678 KQKGLHTYCCSPFAIWSIQVSRQTGMVAYCGADGAVVRFQLTTKAVDKENSRNRTPHFVC 737
KQKGLHTY CS +AIWSIQVSRQTGMVAYCGADGAVVRFQLTTKA DKENSR+RTPH+VC
Sbjct: 703 KQKGLHTYICSSYAIWSIQVSRQTGMVAYCGADGAVVRFQLTTKAADKENSRHRTPHYVC 762
Query: 738 EYLTEEQSIITIHSPASDVPIPLKKLSNKSEQPLSMRAILSDSMQPNEGNDKSATTSALE 797
EYLTEE+SIIT SP +VPIPLKKLSNKSE PLSMRAILSDS+Q NE DK+AT S LE
Sbjct: 763 EYLTEEESIITFRSPPPNVPIPLKKLSNKSEHPLSMRAILSDSVQSNE--DKTATASTLE 822
Query: 798 NESALCYDDDVDVESGSEDTPMSIQNKNQTQSKSKKKGVVNQELEHSHEPSD-------- 857
NE+ +C D DV VESGSEDT + KN+TQ K K+GV ELE S EP D
Sbjct: 823 NEATICSDVDVRVESGSEDTLTPTKKKNRTQPKC-KEGVEKLELECSDEPKDDAHMDADV 882
Query: 858 ---------SQTDDDVVPGLGEHFENFPPKSVALHRLRWNMNIGSERWLSYGGAAGILRC 902
+Q D D +P G+HFEN PPKSVA+HR+RWNMNIGSE WL YGGAAGILRC
Sbjct: 883 DAQTDAVLEAQMDADALPTSGDHFENLPPKSVAMHRVRWNMNIGSEEWLCYGGAAGILRC 942
BLAST of CmoCh14G002880 vs. TAIR 10
Match:
AT1G19485.1 (Transducin/WD40 repeat-like superfamily protein )
HSP 1 Score: 692.2 bits (1785), Expect = 5.6e-199
Identity = 398/866 (45.96%), Postives = 528/866 (60.97%), Query Frame = 0
Query: 51 LDDSRVKVSEFDHCVENHFRAIDAIAELYGEAENGEGGVDESDFQRFSSSTTFLREWKFY 110
+D +S FD+ E+H +A+++I +L GEA +DE+D SSS TFLREW+ Y
Sbjct: 1 MDGEECNISLFDYSAESHLKAVESITDLCGEA---NADIDENDINILSSSVTFLREWRHY 60
Query: 111 NYEPKTVKFTSDSRVPEGKDADITMELPQFSSAAV--LKNGAPPGATASLDFRNFIMHVG 170
N+EPK+ F +++ + LPQFSSA +K +++ ++F+MHVG
Sbjct: 61 NFEPKSFAFYNEAEKNHQPKDINSQTLPQFSSARAPKVKIHDDESSSSGEISKDFVMHVG 120
Query: 171 GPVWAIDWCPLVHERTDSLIKCEFIAVSAHPPGSSYHTMGIPLSGRGMVQIWCLVHGTES 230
G VWA++WCP VH D+ KCEF+AV+ HPP S H +GIPL GRG++QIWC+++ T
Sbjct: 121 GSVWAMEWCPRVHGNPDAQAKCEFLAVATHPPDSYSHKIGIPLIGRGIIQIWCIINATCK 180
Query: 231 HESETTSATECK------------DSDLSQPKRPRGRPPGRKKNGASALPSQPKRPRGRP 290
+S S K ++ ++PK+PRGRP +K+ ++PK+PRGRP
Sbjct: 181 KDSGQVSDKGKKLTGKSRKQPSGETTETTEPKKPRGRP---RKHPVET--TEPKKPRGRP 240
Query: 291 KKKQ--EEPNDDNKVASYQLVQPLSVEYPDVSSNLLEIDDVSHNSEKPVSLENSVERGSS 350
+KK E P + + Y V+ LSV YP+ ++++ + E PV+ GS
Sbjct: 241 RKKSTAELPVELDDDVLY--VEALSVRYPE--NSVVPATPLRILRETPVTETKVNNEGSG 300
Query: 351 TIEEISTCNSEDEVPVQKRRVRRNADTKNHVDDVGTLSLIENREDGSNATNHEANENVTS 410
+ +S+ N+ ++PV+++R + TK+ ++ T ++E EA NV S
Sbjct: 301 QV--LSSDNANIKLPVRRKRQK----TKS-TEESCTPMILE---------YSEAVGNVPS 360
Query: 411 EYSGEDTRLCKNISEKAILDTGSTGFSIPETVALPRLVLCLAHNGKVAWDLKWKPTNART 470
+ S ISE + VALPR+VLCLAHNGKV WD+KW+P+ A
Sbjct: 361 KPS-------SGISE--------------DIVALPRVVLCLAHNGKVVWDMKWRPSYAGD 420
Query: 471 TKCKQRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKLNGEGTDPRFVKLKPTFRCSMLRS 530
+ K MGYLAVLLGNGSLEVW+VP P A+Y TDPRFVKL P F+CS L+
Sbjct: 421 SLNKHSMGYLAVLLGNGSLEVWDVPMPKATSALYLSSKKAATDPRFVKLAPVFKCSNLKC 480
Query: 531 ADTQSIPLTVEWSPTPPYDYLLAGCHDGTVALWKFSASSTAEDTRPLLRFSADTVPIRAV 590
DT+SIPLTVEWS D+LLAGCHDGTVALWKFS + ++EDTRPLL FSADT PIRAV
Sbjct: 481 GDTKSIPLTVEWSTLGNPDFLLAGCHDGTVALWKFSTTKSSEDTRPLLFFSADTAPIRAV 540
Query: 591 AWAPSESEPESENVILIASHGGIKFWDLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLS 650
AWAP ES+ ES N++ A H G+KFWDLRDPFRPLWDLHP PR IYSLDWL +P CV LS
Sbjct: 541 AWAPGESDQESANIVATAGHAGLKFWDLRDPFRPLWDLHPVPRFIYSLDWLQDPSCVLLS 600
Query: 651 FDDGTLRLLSLLKAAYDVPVTGQPFTAIKQKGLHTYCCSPFAIWSIQVSRQTGMVAYCGA 710
FDDGTLR+LSL+K AYDVP TG+P+ KQ+GL Y CS F IWSIQVSR TG+ AYC A
Sbjct: 601 FDDGTLRILSLVKVAYDVPATGRPYPNTKQQGLSVYNCSTFPIWSIQVSRLTGIAAYCTA 660
Query: 711 DGAVVRFQLTTKAVDKENSRNRTPHFVCEYLTEEQSIITIHSPASDVPIPLKK-LSNKSE 770
DG++ F+LTTKAV+K+ +RNRTPH++C LT + S +HSP D+PI LKK + E
Sbjct: 661 DGSIFHFELTTKAVEKD-TRNRTPHYLCGQLTMKDSTFIVHSPVPDIPIVLKKPVGETGE 720
Query: 771 QPLSMRAILSDSMQPNEGNDKSATTSALENESALCYDDDVDVESGSEDTPMSIQNKNQTQ 830
+ +R++L NE + A+ + A + +D +ES SE T N +
Sbjct: 721 KQRCLRSLL------NESPSRYASNVSDVQPLAFAHVEDPGLESESEGT-----NNKAAK 780
Query: 831 SKSK--KKGVVNQELEHSHEPSDSQTDDDVVPGL---------GEHFENFPPKSVALHRL 889
SK+K K +E E+S + D G G E FPPK VA+HR+
Sbjct: 781 SKAKKGKNNARAEEDENSRALVCVKEDGGEEEGRRKAASNNSNGMKAEGFPPKMVAMHRV 805
BLAST of CmoCh14G002880 vs. TAIR 10
Match:
AT1G19485.2 (Transducin/WD40 repeat-like superfamily protein )
HSP 1 Score: 692.2 bits (1785), Expect = 5.6e-199
Identity = 398/866 (45.96%), Postives = 528/866 (60.97%), Query Frame = 0
Query: 51 LDDSRVKVSEFDHCVENHFRAIDAIAELYGEAENGEGGVDESDFQRFSSSTTFLREWKFY 110
+D +S FD+ E+H +A+++I +L GEA +DE+D SSS TFLREW+ Y
Sbjct: 1 MDGEECNISLFDYSAESHLKAVESITDLCGEA---NADIDENDINILSSSVTFLREWRHY 60
Query: 111 NYEPKTVKFTSDSRVPEGKDADITMELPQFSSAAV--LKNGAPPGATASLDFRNFIMHVG 170
N+EPK+ F +++ + LPQFSSA +K +++ ++F+MHVG
Sbjct: 61 NFEPKSFAFYNEAEKNHQPKDINSQTLPQFSSARAPKVKIHDDESSSSGEISKDFVMHVG 120
Query: 171 GPVWAIDWCPLVHERTDSLIKCEFIAVSAHPPGSSYHTMGIPLSGRGMVQIWCLVHGTES 230
G VWA++WCP VH D+ KCEF+AV+ HPP S H +GIPL GRG++QIWC+++ T
Sbjct: 121 GSVWAMEWCPRVHGNPDAQAKCEFLAVATHPPDSYSHKIGIPLIGRGIIQIWCIINATCK 180
Query: 231 HESETTSATECK------------DSDLSQPKRPRGRPPGRKKNGASALPSQPKRPRGRP 290
+S S K ++ ++PK+PRGRP +K+ ++PK+PRGRP
Sbjct: 181 KDSGQVSDKGKKLTGKSRKQPSGETTETTEPKKPRGRP---RKHPVET--TEPKKPRGRP 240
Query: 291 KKKQ--EEPNDDNKVASYQLVQPLSVEYPDVSSNLLEIDDVSHNSEKPVSLENSVERGSS 350
+KK E P + + Y V+ LSV YP+ ++++ + E PV+ GS
Sbjct: 241 RKKSTAELPVELDDDVLY--VEALSVRYPE--NSVVPATPLRILRETPVTETKVNNEGSG 300
Query: 351 TIEEISTCNSEDEVPVQKRRVRRNADTKNHVDDVGTLSLIENREDGSNATNHEANENVTS 410
+ +S+ N+ ++PV+++R + TK+ ++ T ++E EA NV S
Sbjct: 301 QV--LSSDNANIKLPVRRKRQK----TKS-TEESCTPMILE---------YSEAVGNVPS 360
Query: 411 EYSGEDTRLCKNISEKAILDTGSTGFSIPETVALPRLVLCLAHNGKVAWDLKWKPTNART 470
+ S ISE + VALPR+VLCLAHNGKV WD+KW+P+ A
Sbjct: 361 KPS-------SGISE--------------DIVALPRVVLCLAHNGKVVWDMKWRPSYAGD 420
Query: 471 TKCKQRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKLNGEGTDPRFVKLKPTFRCSMLRS 530
+ K MGYLAVLLGNGSLEVW+VP P A+Y TDPRFVKL P F+CS L+
Sbjct: 421 SLNKHSMGYLAVLLGNGSLEVWDVPMPKATSALYLSSKKAATDPRFVKLAPVFKCSNLKC 480
Query: 531 ADTQSIPLTVEWSPTPPYDYLLAGCHDGTVALWKFSASSTAEDTRPLLRFSADTVPIRAV 590
DT+SIPLTVEWS D+LLAGCHDGTVALWKFS + ++EDTRPLL FSADT PIRAV
Sbjct: 481 GDTKSIPLTVEWSTLGNPDFLLAGCHDGTVALWKFSTTKSSEDTRPLLFFSADTAPIRAV 540
Query: 591 AWAPSESEPESENVILIASHGGIKFWDLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLS 650
AWAP ES+ ES N++ A H G+KFWDLRDPFRPLWDLHP PR IYSLDWL +P CV LS
Sbjct: 541 AWAPGESDQESANIVATAGHAGLKFWDLRDPFRPLWDLHPVPRFIYSLDWLQDPSCVLLS 600
Query: 651 FDDGTLRLLSLLKAAYDVPVTGQPFTAIKQKGLHTYCCSPFAIWSIQVSRQTGMVAYCGA 710
FDDGTLR+LSL+K AYDVP TG+P+ KQ+GL Y CS F IWSIQVSR TG+ AYC A
Sbjct: 601 FDDGTLRILSLVKVAYDVPATGRPYPNTKQQGLSVYNCSTFPIWSIQVSRLTGIAAYCTA 660
Query: 711 DGAVVRFQLTTKAVDKENSRNRTPHFVCEYLTEEQSIITIHSPASDVPIPLKK-LSNKSE 770
DG++ F+LTTKAV+K+ +RNRTPH++C LT + S +HSP D+PI LKK + E
Sbjct: 661 DGSIFHFELTTKAVEKD-TRNRTPHYLCGQLTMKDSTFIVHSPVPDIPIVLKKPVGETGE 720
Query: 771 QPLSMRAILSDSMQPNEGNDKSATTSALENESALCYDDDVDVESGSEDTPMSIQNKNQTQ 830
+ +R++L NE + A+ + A + +D +ES SE T N +
Sbjct: 721 KQRCLRSLL------NESPSRYASNVSDVQPLAFAHVEDPGLESESEGT-----NNKAAK 780
Query: 831 SKSK--KKGVVNQELEHSHEPSDSQTDDDVVPGL---------GEHFENFPPKSVALHRL 889
SK+K K +E E+S + D G G E FPPK VA+HR+
Sbjct: 781 SKAKKGKNNARAEEDENSRALVCVKEDGGEEEGRRKAASNNSNGMKAEGFPPKMVAMHRV 805
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q8WUA4 | 5.9e-12 | 25.67 | General transcription factor 3C polypeptide 2 OS=Homo sapiens OX=9606 GN=GTF3C2 ... | [more] |
Q8BL74 | 5.9e-12 | 25.67 | General transcription factor 3C polypeptide 2 OS=Mus musculus OX=10090 GN=Gtf3c2... | [more] |
Q5RDC3 | 5.9e-12 | 26.32 | General transcription factor 3C polypeptide 2 OS=Pongo abelii OX=9601 GN=GTF3C2 ... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1F7U5 | 0.0e+00 | 100.00 | uncharacterized protein LOC111441649 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1F1Y4 | 0.0e+00 | 96.45 | uncharacterized protein LOC111441649 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1J0H6 | 0.0e+00 | 94.12 | uncharacterized protein LOC111481574 OS=Cucurbita maxima OX=3661 GN=LOC111481574... | [more] |
A0A5D3DPQ1 | 0.0e+00 | 74.73 | DNA binding protein, putative isoform 1 OS=Cucumis melo var. makuwa OX=1194695 G... | [more] |
A0A0A0LGM2 | 0.0e+00 | 75.11 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G775290 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT1G19485.1 | 5.6e-199 | 45.96 | Transducin/WD40 repeat-like superfamily protein | [more] |
AT1G19485.2 | 5.6e-199 | 45.96 | Transducin/WD40 repeat-like superfamily protein | [more] |