Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAATCCGCCCGCCGTTTCTCAATTGAACATCGTCACTCTCGTCTCCTGTTTGTCTCTGCGACGCGATAAACAAAGAAAGGCCGATGAATTGATCGCAGTGACCGCCGGTGAGTACCTTCCCCGTATCGATCATGACCCAGAACCAGCTTATCGACTCCCTTACATCCCATATCTCTCTCTACCACTCTACATCTGGTAATTTCAACCGTGATCCTAATCCCAATCCCAGGTCCTCGATCCTGAAATGGTTCTCTTCTCTCAGCGTCCACCAACGCCAAGCTCACCTCACGGTCGTTGATTTCAAATTCGTCCAAGTCCTCATCCAAATGGTGGCAGAAGTTCGGAAACGAGGACACGGTTTCTTCATCCTCCTGCCTGACATTCCCTCCTGCGACCCTCTGCACCTACCCAGCTTATGCTTTAAGAAGTCCCGCGGGCTCTTGTCTCGTGTCTCCGAGTCCAGCGTGTCCGAAAGGATGATTTTCGAGTCCAGTCGACTATTCGGTTCCAGGGAAGGCGATAAGCTCGAGGAGTGTTCTTGTTCGTTAAAGAACATCGATTCTTTAACTGTAAGCGAGGATTTCGTCTCAAACGTGGACAAATTTGTCGAGGCAATGGACGGAGTTTCAAATGGGGCGTTTTTGAGAGGTGAAGGGGGTGACATGGCGTCCAATTGGGCTGAGTTAAATTGGTTAAAAGCGAAAGGATATTACAGTATCGAGGCCTTTGTGGCAAACAAGTTGGAGGTGGCTTTGAGACTCTCATGGATGAGCTTGAATAATGGAAAAAAAAGATCGGTAAAGTTCAAAGAAAAGGCTAGCGCAATTGGCATGGCGACAAACGTGTTTTGGAGGAAGAAGGGATGCGTGGACTGGTGGGATAAATTGGATGCTTCGTCAAAGGAAAAAATATTGACAGCAATTCTGGGAAAATCAGCAAAAAGTTTGGTAATGCTGGGAACATTAACATGCTGTCTATTTTGTGTTCTCATTTAGTTGAAGTAACAAATTATTTAAGAATAATCGGGTTGCTGGTGCTTAATCCTGCATCTCTTACTTAATTCTGCTGAAGTTTCTGTACCTTTTATTTTATACATTTTTTAGTCCATACTCCATACATTTAGGTCGACCATGTGGATTGAGATTGTTGGATCAAGCTATGTCGCGCAAAGGATTTGGGTATAAATGGAGATATTGGATTTCTAGTTGTCTTAGGACAACTAGATACTCTATCCTTGTTAATAGTAAGTTGAGAGGTAGCATTCTTGCACTGTCCCCTTCGTCTTCTTACTGGTTGGTGTGTAATCCGAGAACAATCGTATCGAGGGAGGTGGAGAGCGACATAATTGAAGGGTTTCAGGTAGGGTGGGATAATATTTCGTTGTTTCATTTTTAGTTTTTTGACGATACTATCTTCTTTTGCTGGGTGACTTGTGAGGTAGGTTCCTTTCCTTCCTTTCTACTTTTTTGGATCCAATTGTGGAAAAGTTTCAAAAACGTCTGCCTTCTCTCAACAAAAGTTTTTTTTTCCAAGGGAGGTCGATTCACTTTGATACAATCAGTTTTCAGCTCTTTTCTCCTTTTGTTAAGGGTTCCTGTGTCAGCTAGTAAGTCTTTGGAGAAGCATATAAGAGACTTCTTATGGGACAGGTTTGATGAGGGGAAGAGTCTCTTGAGTCTGCTTGGATGTGGTGACCAAGCCGTTGGATCTCATGGTTCTAGGTATATGTAATTTAAGGGCACAAAACGAGGTATTGTTGGCTAAATGATTGTGGCAATGTCCTCAATATTATGACACCTTATGACATAAGGTTATCGCGAGTAAATATGATTGTTATTTTGATTGGATCCCCCGCCATGGTTTCAAAAGGTACTACTAGAAATTTATTTGTGATGGGATTTTGATATGAACTAACAGACATCAATCATACAATATATTTCAATGAAGATGTGAAGAAAATGCTTTTGAAATTATCAAAAGATATTTCAATTCATGTCACATTGAAAAAGGATGAAACTTCATTGTTCTAAATTTTTGGTGGATTACAATTTAGAGTAATTTAGAGTTATTGTATTTCATTAATATATTTTAAGACTTTTCATTTACTTTAAAGTTACATTTACATTGTGGCTAATTATAGTCATAAAGTATACATAATGGCTAATTATGGTTATAAAGTATGAATGTTACACTCTTGAAATGTCTCATGTAAGGTCAAGAGTTTCATTTGAGGCTATATAAAGCCATGTATGTTGTATTTGTAAGGAGACTTGGAAAATGTAGTAAAAGAAGCATTTGTGCTTTCTTTAGCCAATGACTAAGTTTCTTTGAATTTTGTGTAGTTTAGGTTGCATGTAGAATCGTTCAAGCCCGACATGATCAATCTTGCTTGTAGAATGATTCGAATCTCAAACAAGTGTTCTTGTCTTGAGATATTCGATCAACAAGGTAATTTGAATCTTACTCCCCTTGTAGTGATTCCTTATCATTTGGCATCAGAACCTTTTCATTGGGTTGATTTATTTGATCAACAAGAATTATTCAAGCTAGACATGATCGATTTTTCTTGTGGAGTGATTCGAATCTCAAACAAGTGTCCTTGCCTTGAGATATTCGATCAACAAGGTAATTCGAATCTTACTCCCCTTGTAGTGATTCCTTATCATTTGGCATCTGAACCTTTTCATTGGGTTGATTTCCTTATCATTTGGTATCAGAGCCTTCTCTTGGATTTATTCCATATCATTTGGTATCAGAGCTTTCCATTGGGCTAATTTCATATCATTCCTTTCCATTGGGCTAATTTCATATCATTTGGTTTGGGCTGATTTCATATAATTTAGTATGCTTTCTCTTGGGCGGATTCGTGTTTGGTAGCTGATGGACTGGATACTTATTTATGGGAGGATAAATGGTTAGGGGAATAAAGCTCTTTGCTCTTCGTTTCATCGTTTATACTATTGTCTTCTATGAGAAATTATTAGGTGGCTTCTATCCTTCCCATGTATGGACATTTTTTTTAAAGGGCATCATGTAACAGCTCTAACCCACCCCAAACCCACCGCTAACAGATATTGTTCGCTTTGGCCCATTACGTATCGTCATCAGCCTTACGATTTTAAAACGCATATACTAGAGAATGGTTTCCACACCCTTATAAGGAATGTTTCGTTTCCCTCTCTAACTGATGTGGGATCTTAGAATCCACCCCCTTGGGGGCCCAGCATCCTCGCTGACACACAACTCGGTGTTTGACTCTAATGCCATTTGTAATAGCCCAGGCCCACCGTTAGCAGATATTGTGGAGAGGGGAACGACAAAGGTGTGGAAACTCTCCCTAGTAGACGCGTTTTAAAATCGTGAGGCTGACGGCGATAAGTAACGGGCAAAAGCGGACAATATCTGCTGGTGTTGGGCTTGGGCTGTTACTAATGGTATCAAAGCCAAACATCGGGCGGTGTGCTAGGGAGGACGTTGGCCCCCCGGGGGGGGGGGGCGGATTGTGAGATCCCACATCGGTTGGAGAGGGGAACGATAAGCGTGTGGAAACCTCTCCTTAATAGATGTATTTTAAAACCGTGAGACAGACGACGATACGTAATGAGCCGAAGTGGACAATATCTGGGCTTGAGTTGTTACAGATGGTATCAGAGTCAGACACCGGGCGGTGCGTCTGCGAGGACGTTAGGCCCATAAGGGGGGTGGATAGCAAGATCCCACATCGGTTGGAGAGTGGAACGAAGCACTCCTTATATGGATGTGGAAACCTCTCCCTAGTATATGCATCTTAAAACCAAGAGGCTGACGACGATACGTAACAGGCCGAAGTGGATAATATTTGTTAGCGGTGGGTTTGAGCTGTTTCAAAAATCCAAAGAGGATAATATTTGCTAGTGGTGGGTTTGGACCGTTACAAATGGTATCAGAGCCAGACATCGGGCGTGCCAGCGAGGACACTGGGCCTCCAGGGGGGTGGATTGTGAGATCTCACATCGATTGGAGAGGGGAATGAAGCATTCCTTATAAGAGCATGGAAACCTCTCCCTAACAAACGGGTTTTAAAAACTGTAAGATTGACAACGATGGGCTTGGGTTGTTACACATTCAGCTTTCTCTTGGCCCGTCTTAGAGGACGTAAGTTCTTCATATTTGATATAGCTAGATTTGTCGCCTTTTTATCATTTGTTGGGCTGTTGTTTTGCGGTTGCTTTGTCTTTACTCTTTAGGCCGTTAATTTTGCATTTTTTACACATTGTGGCTTTTAGTTAGTGTGGCTGCCTGTTTAGTTGTTAGTATAAATATTATTTTCTGTACTGAAGGCATAACCTTCTAACAAAAATTCAGAATATTGGTTCACCATTATTCTATGTCTGTCCTTTGACCAGGATCGTATTCTGTGATTTACCAACTTTTGTGCTATTTTACGGGAGTTTTTGGATTGAGAGGAAAAATAGAAAGTTTACGGGAGTTTTTTGGCATAACCTTCCAAGAATGTTTTCGAGGAAAGTCCAATCAACCTTGTCGAAGGCCTTTTCCATGTCAAGTTTAATGACTACACCTTTTTGTTTCTTTCTTTCCCATTCATCAATGAGTTCGTTGGTCATGAGGATGCATCAAGAATTTGTTGCCCCTCAATAAAAGCTGTTTGTTGCCCAATTATTGTGAAGGGGAGAACCTTTTTAAGGCGTTCTGATAATACCCTTGCTATGATTTTATATAGACCGGTCGTGAGGCTTATGGGACGAAAGTCTGCAACAGTGCGAGCATCTAGTTTCTTTGAGATCAAACATATGTATGTTTCATTCAGGTTGCCATTAATAACTCCGGTTCGAAAAAAATCATGGAATACTCTCACGATATCAGCTCTCATAAAGTTCCAACATTTCTGAAAGAATTTTGAGGTAAAACCATCCGGTCCTGGAGTTTTGTCAGAGCCCAGATTCTGAATCGCCTTCCCCACCTCTTCTTCAGTGAAAGCGACCTTGTGGGAGGCAGCTTGCTGTTGATCAATAGGACTCCAGTCAAGTGAGGAAATTCGCAAAGGGCATTGTCCTTTGTGTATAAGGCGTGTAAAAGGACACAAATTCTAATACAATTTTGTCTTCGTTTACTAAACTTATACCTTGTGTGGATAGAATTCCCATGATGGTACTCTTTCGTTTCTTTGTGGCCATGATACGATGGAAAAATTTGGAGTTTATGTCACCTTCCTCTAACCATCGTTTTTTGCATTTTTGTCTCCAAGATTGCTCTTCATTTACTACCAACGAAATAAGTTCTGCTTTCATGAGCTTTTTCTTTGTGTTGAACTAGTGTGATGGAGCCAGTTTCCTCCATCTGATCAAGGAGGGCAATTTCGGTTAGCAATTGGTTCTTTTTTGTCGAAATACAACCAAAAACCTCCTTATTCCATTTCTTTAGGACTCCTTTTAGTCTTTTAAGTTTGTTGATGAAACCGTGCCCTGGCCATCCATGGAGGGACGTATTCTTCCCCCAGTATTCTACCATTGGCAAAAATTCAGAATGGTTCAGCCACATATTCTCAGAACGGAAGGGTGAAGGGCCCCATTTACAGCAACCCATGGAAAGTAGAATGGGATAATGATAAACATGTCCATGGAAAGTAGAGGGGCCCCGTTTGGAGAGGGAGAATTTCACCACCAACCATCTTTAACTTGTAGGGGATGCTTTCTTTTTATTAGATTACAGCGGTGAAAAGCTTGGAACATGTTTCTTGTTCAAAAAAAAAAAAAAAAAAACAAAGAGTTATCTCGAAATAACTCTTGCAGTGACTTTATTTTCTAGTTAAGAAACATTATTTCATTACTAGAAAAAATTGTGTAGTCTTGAAGGAGAAGGTAGGTTATTTTTTTCAGATCCAAAAATTGTTTACGGCTGGGAAACCCAAGTCTTTATATTTCTAATCTAGATTTTGAGAAATTGAGAATTAGTAACTAACCTCGGTAACTAATGAACTGAGTTGGCTGACAAAATACTAAAACTAATTTATTAATGGACCTGCTAATTAAGATAATTACAAAGATACCCCTAACAAGCTTACATCGTTCTTCTTTCCTGGAAAATTTGACTCACTTTTTTTTCGGGGTAAGAAATTGAGATTTCAGTAGTTTGAGATCTATCATCATGTTTCAGCCTAATTTCCATTCCTTTTTTTTGTTTTCAGATACATGAGATTCTGAGGTGGACTAGTGGACTTGCGGAGCATGAGATGGGGCTCTTTAGTGCGGAATGGAATAGACCGTTTAGGTACAATTGTACTATATCTCAACCAAGGTCCATGTTAACATCCCAAGCGGACCTGCATATTGACTTCAACATAATTCCAGCTGCCCATTCTGGAAAACCTTATTTGTTAACCAACATCTTTAGAAATTTGCTTGTGCTTCAGGACATTGTTACGATGGTAACATCGTGTCTTCATGATGAATACTACAAGACTAATCTATTTTATAGCACTTTGGGTTCTATCTGCGCCATCCCTGATTGTATATTAAGAAAATTGCGGGAACTTCTTATGTTTACTTCACTTGATTGCACAAAACTTGAACTTCTAGGAGACGGGACTAGTAAGTCCCTGCCTAGTAAATTAAGAGAGGATCTAGGTGCTTCCCGTCGAAGGAAAAAGGGAAAGAGCCGGAAGTCGCAGAATCCTGTGCTGAGGGCATGCGCGGATGATTTATCATGCAATAAATTTCTGAAGGTAAAATTAGAAGCATAAGAAATGTTATATTATTGATTTATCCGCAGGATATGAATTTAATTTTTAGATGTAGCCTCAGGAATTTGACAAGGAGTGTGCTCATAAAGGGAGAGAAGATATAGCAGAATCCACAACTATGTCGATTATGTCGAAGAGAAATGAGACTTGTAGAGAAATTTCATCTGATGTATCTAAAACGGTTGATTTGGTTTTTATTTATTATTTTATTAGTTTTTATATGTATCATATATCTAATGGCTTGAATGCACTGTAACCAGTAGCCTGTCACCAGGTACATGACGATAACACGAGTGTTGGAAAAGATCAAGGCACTGCAAGGAGGAAGAAAAAACACAAGAGTAAAAACTCTTGTGGGAACAGCAGATTAGTTGAAATAAAACCTTCTGTTGGGCCAGCCGTTAAATTTTCCTCTCCTTTTAGTTCTCAGGATCAGGTAGCAGAGTTGGATAATATAATCAGAAAACCTTCCATCTCAAGTATCAAGAATGATAGTTCAAATAATTATGAGAGTTCAACATTAAACTCAAGTCCTCTAGTTCCCTCTATCGAACCTAATAGCGAGTATGACAGTAGCCAAAATATTGAAGTATATGAAGTTTCTGGGTTAGCAAAATCTGTCTGCCAAATTGGTCCTGGAGAATCTCAGTTCCCAAAAGGAATAATTGAAAACCAACGCTTATCATCTACTTTGGAAACTTCTACATCTTTTATGGATTGTAGTGTAGTACCTTCTCATTTGCCTTCATTAAAGCTAAAGAATATCGTCAAAAGTGATGTTAATGTGAAGGGTTCTGTGCAAACTTACGAATTAAGAGATAAATCATCTTTGTTGGATAAGCTTCCAAGAACCATTGATGTAAAGGAGAAAGTATGCTTATCTCGACATCAGCTTAGTGGTGATGCTTGTAATACTAAGGCCTTGAATTCCTTGAAACATTCTCCCTATGAATGGCATGGTGTAGCTTCTTTGTATATCCCACCATTCAATTCACATCTCCCACCTGCTACTGATAGACTACATTTAGATGTTGGTCATAATTGGCACAACCATTTCCGTCGGTCCTTTGCACCTGCAATGCATCAATCAAGAAATTCTTCTGTTAAAGGTGTTTGTAATCCAGTTATGACTCGACCAGTGTTAATGAGTCTAGATTGGCCCCCAGTCTTACGGAGTGCTTCTGGCCTGGCTTCAACAATGATGTCAAATCATGATATTGGGTTTCTTACTAGGAGACAATCTTCTTTTTGTCAGGGGTTCCCCACTAACAGCAATCAAATTAGCACGGAAGATGAGTACTCTGGTAATCTCACTGATTTTCCTGATTTGTCAAACAATCAAGATCTAGCAGAGGAGTGTGATGGAAACTGGATATCGGAGGAAGAATTGGAAATGCATGCAGTTTCTGGGATAGACTATAATCAGTATTTTGGTGGTGGTGTAATGTACTGGAACCCTTCTGATCATCATGGGACAGGGTTCTCTCGACCTCCTTCTCTGAGTTCTGATGATAGCTCATGGGCTTGGCGTGAAGCTGACATGAACAGGACTGTTGATGATATGGTTGCTTTCTCTTCTTCTTACAGTAATGGGTTGACTTCCCCAACTTCTACTTCATTTTGTTCTCCTTCTGATCCAGTGGGTTCTGGAAAGCAGGCTCTTGGTTATGTGGTTCAAGGGTCTGATCTACCTAACAACATGCTTCATTCCTCACCAACTATGAAAGACACGGTGACAGAGGAGGATGCTCCTAGATCTTTGCCAAATTTGCCCAGTGATGTTGAAGGGAAGACAGGCGACTCACATTCATTTCCAATCTTGCGCCCTATTGTTGTTCCAAGTATGTCAAGGGAAAGATCAAGATCTGAGTTCTGCCATGGTCGTGATCATAAAAGCCCATGTATCCCTCCCACTAGGAGAGAGCAATCTCGAGTAAAGCGTCCACCATCTCCAGTAGTTCTTTGTGTTCCACGGGCGCCAATACCACCTCCACCTTCTCCTGTAAGTGATTCCAGGAAGCAGAGAGGGTTTCCAACTGTTAGATCTGGTAGCTCAAGTCCAAGGCATTGGGGTGTGAAGGGTTGGTATCCTGATGGAACTAATTTGGAAGAAGCATGCTTGCGTATTGATGGTGCTGAAGTAATATGGCCTAATTGGAGAAATAAAAGTAAATCTAATTGCTCGACAGTTCAACCTTTATCATTAATAGCAATGTCCCAGATAGCTATCGATCAGGAACGTGTGAGTGTGCAACCACAATCCAAATATTTGAGCTATGTACCGGGTCTGAAGTTTCCTTGTAATTAATTTTCTTTGCAGCTAGATGTTGCATTTCCTCTCTTTCCACCTACTAGTGGTCGCTCTGTAAAAAAGGAATCTCTTTCTTTGATCCATAGCCGCCTACATGATGAGATCGACTCTTTCTGCAAGCATGTAAGAGCCTTCATTTTTGCTATTATTTTAGGATTCCTAATTTAATTTATTTAGACGAATGTTGTCACTGAGATACGCTGTTCATGGGATATATGTCAAATGGTGGCATACTAGGTTGCTGCAGAAAACATGGCTAAGAAGCCTTACATCACTTGGGCTGTTAAACGGGTCACACGGTCCCTTCAAGTCTTATGGCCCAGGTCTAGGACAAACATTTTTGGTTCAAATGCAACTGGTTTGTCCCTCCCCACGAGTGATGTGGATCTTGTGGTTTGTCTGCCTCCAGTGAGAAATTTGGTAAGTTCATTACTATAATTGGATCCATCACGCAGCAGTCCTGCTTGCTTTAATCAAGACTTGTTGCTGTTATCTTTTAGGAACCTATTAAAGAAGCTGGGATCTTAGAGGGACGTAATGGTATCAAAGAGACCTGCCTCCAGGTATGTTTATTCTCATCTCAAGGAATTAAATATAAGATCGTAGCTCATTATTTTTTATGCTTACTCCAGGTACTGCTGATTATTCCTTTTAACGAATATATTTTTATTGTTAAAAAAATGAAAAATAGAAAAGAAAAAGAACTATCAGTTAGTAAATCCTATCTTAGTTGTTCGTAAAGAATTAGTTGTGTCAATATGATGATTGTTTATTCGTCTTGTTGCATTGGGATTTCTAATACCCTAGAGATGTTCACTTTTTTCCTTCGTTGATTCAGATTAGTTGAAGAGGTTTTTTTGTTCTCTCCTTGACATGAGGTTGGAATTGATCCCTAGTCTTCCAGTCTCTTGTCATTTTGATTCTTCTTTATCGTATGTTTCTTATCAATTGAACGAAAGGAAGTCAATTGTCTTATTATCTAAACCACGTTGAAGTTCCATGATCAAAAAACTTGATTAATGAATGAAAGGATAATAAGGTGATGATAAAGTGTCGTGTTTATTTGAGCTTTCAAAATAAGTACTGAATAAATCTCCGCATTACATGAAATATATGCCCACGGTGCCTAAAATTCAGAAGAGAACAATAAGAATAAAAGAAAAGGAAGGAAGCTCCTTTCCTTTTATCTCTCATTTAAAGAATTCTTGGTAGGTCACTAGTGAAGTGTATTATTGTGTTCGTTTATCCTTCTTTCTTGTTGATACAAAGCAGGTTAGATATGATTTTAGGTAGTACATATAGAAAAATTAATTATAACACTAACAAATATATATTGATTTAAAAGTAGATCAAAATCTTACTGTCTCGCTGTTCATGGAAATATGCTGTCGTTGAAATATGACTGTTTAAGAAAATGTGATGTTAAAGGAGATATTTAGATTCTTTTATTCGATGAATTTTATGTAACTGTAAAATGATACTAGATAGGATGCACGATCACTCAAAGGAAAAGGTTGGCTCCTTAGCCCTCACTTTGGAGCTGTGGAGGTTGGTTGTTGAATGATGTTAATATCCTCCGTAACATTTGCAATGCTATTACGAATGTATCTACAGTTGATCTCTGATTCCATGGATTCGACTTCGCCAATAAATGAACATTGATGCCTTAGCATCAATACCTGTAGTGATTCCATGGATTGATCTTTGTGGTTTGCACCTCCTTTGCTGTGTCTTATAAGTCCGTCTTTACTCTTTGCTCACTGAGACGCTAACTAGTTGGACCAAATTTTAACTGATTTTGACCATTAAACTCATGGGGTTTATTTCATTTGTAAATGACATACTGCCCCAATGTCGAGTTTTGTCGGCAATGTAATCTGCTCCACAAAAGACTAAATAGCCACCAACCAGGATACCCTATTAAAACAAGAATTATAAATTTATAAAGATGCGTTTGATTGCAGGAGCTGTTCAGTTGTAGCTTTACCACAAAAAAGTACATTCTCAATCTCACGAGTCACATTCATATTATTCCTTTCTCTCTTTGTTACAATTCGCTTGGCTGAGGTTATGATTTTATATTTGGAAGTTTTGAAGTTTAATAAGTATTTTGGTCTTTCAGCATGCTGCCAGATATCTTTCCAATCAGGAATGGGTAAAAAGTGATTCTTTAAAGACGGTGGAAAATACTGCTGTAACTGCCTTCTCTATCAACTATTTAGTTAATTCACGTTGTCAATGATAGATATTATTCCGTACATTTGCTCATCAGTTTCTTTTTGTGTCAGATACCTATTATCATGCTTGTTGTTGAAGTTCCCCATGATCTCATTATTTTGTCCACGTCAAATATGCAATCACCTAAGGAGGAATCCTCTGCTGTATCTGGAAAACAAGATGTCAACATTCTCAATGATATGGCTGGTCTAGAAGATTCTGCGTTGCCAAAATGTTTGGAGGTGAATTATGATACCTCTATTGGCACCAAGTCAGTTCGTATTGACATCAGTTTCAAGACTCCATCACATACAGGACTTCAAACTTCTGAGCTGGTAAGCATAGTACACATCACTTTGAATTTTATCATAATTGTTTATGAATTTCTGAACTTTCCATTTGAAGTCATTTTTTCTTAAAAAGAAAACAAGAAAAAAAATAGAGAACTTGGCATCCATACTGCTCATTAATGAAATTGAAAGTTTGTGTCCTGATTTTCTCAAAGAAGCAGGATGCAGCTGAGCCAATAATAATTATAATTATAATTATAAAGCCTTCAGAAATCTTATCTCTAGTGGAATCTATCGTTTCTTATCCTCAACTAAAAGAACCTTTGGCTGGGAGTTAAACTGACAGGTTTTTACTAGAATCGACGAATGATATTTTCCCATAGGAGTTCTTACATATGGAATTTTATTTAGTAATATCATTATGAGATGGACTTCTCGTAGATCCACCATAATCTTCACGATTAGCAATTCCCTTTTATCGCTTAAGGAGGCAGACATACGCTTATTCAGTCAACTCTTTCTAGTATGCCTATTTACTTTACTATCTTTGTTCAAAAGGCCTACCAAGATTACAAAAGAAATTGAAAAGATCATTAGGGTTTTTCTTCGGGAAGGAGCTAAAGGAGATGCACAATATTAGTTGGTGGGACAAAACTCAATTTCCTACTCTTATGTGTGGCTAGGGCATTGGAAATTTCAAATAACGAAATGAGTCCGTCTTATCTAAATGGATTTGGAGATATCTTTGTGAGGAAGGGGCTATTTGTCAGAAAATTATGAATAACAATTTGGCCCACATTGGCCATCATGTTCTGGTTTAGGCTCTTTCAAGGCTTCTTGGAGGGAAATCTCAGTTTGGTAAAATCACATGTTCGAGGAGTATTGGGCAATGATAATCACATCTCCTTTTTGGCATGGCGTTTGGGTTATGGTATTGCCTTGGTTGCCATGTTTCTAAACCTCTATAGGCTTTCTAATGATATTGATGCAACCATGGATGATCTCTGGAATTTGAAAAATAAGGATCGGGATTTGGGTCTTAGATAGTGTCTTAAAGATGATGAGTTTGTTGAAGTGCCTCTCTTTCTTATCTTTTACCATCTATTTCCTTGACTGATATGAATGATTCATGGAAATGGAATTGAGATTCATCTATAAACTTTACAGTGAGCTCCATGATGGATATTTTTTGTGAAACCTCAAGGCCCTTTTAATAAATCCTTATTTGCTGCAATCTGAATGGATTTCTATTCAAGGAAGATATAGGTTTTTCTTTGGGAGCTTAGTCATTGTGGTATTAATACAGCATATATCTCCAAAGACAAATGCCTTTTCTTTTTTTCTCACCTTCATGTGGATTATGTGTAAAGCCAACTTGGAAACTCATTGCCATTTATTTGGCTCTTGCACTTCGCTCTATGATATTGGAATTTTGTTTTGGGCAGTTTTGGTTGGAATATGATTATGCTACGTTCAAGATTTATTTCCCTTGATTGTGTGGGGCATCCATTTAAAGGAGATGCAAAAGCTTTATGGTTTGCCTTTGACCGTTCATTCTTTTGGTCTCTATGGTGTGAAGGAATGGAAGGATCTTTACTCGATCATCTTTTGAGGGCTCCATGGATTTAGTTTTATTTAATGTTGTTTATTGGTGCAAATGTTCTTTTCCTTTTATAGACTATAGCCTTTCTTCTTCAACACGTAGTTGGAGAATTTTTTTGTAATCCACAAAGGGTGGTTTGGGATTTTTTTTTTTTTTCCTTCTTCATCATTTCATTTATAAATGAAATCGTTCTGTTTCTATATGAAGAAAAGAGGACTTCTCGTTTGTGAAACGTTTTTTTAAATTAATTGATTGATTTTTTTAATGTAGACTTTTTCTTTGAACCTTTATTAGCTTGGATGAGGCATATTTGGTTGTGGATATTTTTGTCTTCCATCCCTCTTCTTTAGATGTGCACTCTCATTTTCTTAGTTTTTCCTCTTTTGCCTTGCGAGTAATTTCTTCTCTCAATTCAATGTATTATAAAGTGAGCAGAAAAGAGGAGAATATTATGCTGTTGGAAAATCGAACTTTCCTCTTCAAATGAGAAATTTCGCGTTTACAAGTGAAAAGTTGCAAGGAAATGTTCAAAGAACAAAGTGTTGCATACAAGGATCAGCTCCCTCATGGAGCTTTCGCAAAAATTCACCCCAGTTTGAATTGATTAGGAAGGAATTATAGCTTTTCGATTCATTATCTCCAGAGATTTAGAAAAATAAATGAAATTTAGCTAACTCAAGCATACCCTCCACTCTTTCATCGTTTTATTGACAATTGTGTTATTTCTTTCCATCCAAATGCTCAAAAGAACTGGTTTGGTAGCATTTTCTTTAAGTATCTTATCTTTCCTCTTGATTGGGTGATGACAACAAAGCAGCTGTGACAAATTATCTCGGAAATAAATATTCTCCGTTTGGTCTGCTTTTTTACAACTTTATTCTCAGCAAAGTTTCACAAGATTGCTGTTTGTATACAAATATTATCTTGTAATTTGTATTTGATTTTATGCAATGTCAACTGCAGGTTAAGGAGCTGACTGAACAATTTCCAGCTACTATACCTTTGGCTTTGGTACTGAAGAAATTTTTGGCAGATCGTAGTCTTGATCAGTCCTATTCTGGCGGTTTAAGTTCTTATTGTTTGGTGCGTTGTTCCAACCTCTTACTGTAG
mRNA sequence
AAAATCCGCCCGCCGTTTCTCAATTGAACATCGTCACTCTCGTCTCCTGTTTGTCTCTGCGACGCGATAAACAAAGAAAGGCCGATGAATTGATCGCAGTGACCGCCGGTGAGTACCTTCCCCGTATCGATCATGACCCAGAACCAGCTTATCGACTCCCTTACATCCCATATCTCTCTCTACCACTCTACATCTGGTAATTTCAACCGTGATCCTAATCCCAATCCCAGGTCCTCGATCCTGAAATGGTTCTCTTCTCTCAGCGTCCACCAACGCCAAGCTCACCTCACGGTCGTTGATTTCAAATTCGTCCAAGTCCTCATCCAAATGGTGGCAGAAGTTCGGAAACGAGGACACGGTTTCTTCATCCTCCTGCCTGACATTCCCTCCTGCGACCCTCTGCACCTACCCAGCTTATGCTTTAAGAAGTCCCGCGGGCTCTTGTCTCGTGTCTCCGAGTCCAGCGTGTCCGAAAGGATGATTTTCGAGTCCAGTCGACTATTCGGTTCCAGGGAAGGCGATAAGCTCGAGGAGTGTTCTTGTTCGTTAAAGAACATCGATTCTTTAACTGTAAGCGAGGATTTCGTCTCAAACGTGGACAAATTTGTCGAGGCAATGGACGGAGTTTCAAATGGGGCGTTTTTGAGAGGTGAAGGGGGTGACATGGCGTCCAATTGGGCTGAGTTAAATTGGTTAAAAGCGAAAGGATATTACAGTATCGAGGCCTTTGTGGCAAACAAGTTGGAGGTGGCTTTGAGACTCTCATGGATGAGCTTGAATAATGGAAAAAAAAGATCGGTAAAGTTCAAAGAAAAGGCTAGCGCAATTGGCATGGCGACAAACGTGTTTTGGAGGAAGAAGGGATGCGTGGACTGGTGGGATAAATTGGATGCTTCGTCAAAGGAAAAAATATTGACAGCAATTCTGGGAAAATCAGCAAAAAGTTTGATACATGAGATTCTGAGGTGGACTAGTGGACTTGCGGAGCATGAGATGGGGCTCTTTAGTGCGGAATGGAATAGACCGTTTAGGTACAATTGTACTATATCTCAACCAAGGTCCATGTTAACATCCCAAGCGGACCTGCATATTGACTTCAACATAATTCCAGCTGCCCATTCTGGAAAACCTTATTTGTTAACCAACATCTTTAGAAATTTGCTTGTGCTTCAGGACATTGTTACGATGGTAACATCGTGTCTTCATGATGAATACTACAAGACTAATCTATTTTATAGCACTTTGGGTTCTATCTGCGCCATCCCTGATTGTATATTAAGAAAATTGCGGGAACTTCTTATGTTTACTTCACTTGATTGCACAAAACTTGAACTTCTAGGAGACGGGACTAGTAAGTCCCTGCCTAGTAAATTAAGAGAGGATCTAGGTGCTTCCCGTCGAAGGAAAAAGGGAAAGAGCCGGAAGTCGCAGAATCCTGTGCTGAGGGCATGCGCGGATGATTTATCATGCAATAAATTTCTGAAGGTACATGACGATAACACGAGTGTTGGAAAAGATCAAGGCACTGCAAGGAGGAAGAAAAAACACAAGAGTAAAAACTCTTGTGGGAACAGCAGATTAGTTGAAATAAAACCTTCTGTTGGGCCAGCCGTTAAATTTTCCTCTCCTTTTAGTTCTCAGGATCAGGTAGCAGAGTTGGATAATATAATCAGAAAACCTTCCATCTCAAGTATCAAGAATGATAGTTCAAATAATTATGAGAGTTCAACATTAAACTCAAGTCCTCTAGTTCCCTCTATCGAACCTAATAGCGAGTATGACAGTAGCCAAAATATTGAAGTATATGAAGTTTCTGGGTTAGCAAAATCTGTCTGCCAAATTGGTCCTGGAGAATCTCAGTTCCCAAAAGGAATAATTGAAAACCAACGCTTATCATCTACTTTGGAAACTTCTACATCTTTTATGGATTGTAGTGTAGTACCTTCTCATTTGCCTTCATTAAAGCTAAAGAATATCGTCAAAAGTGATGTTAATGTGAAGGGTTCTGTGCAAACTTACGAATTAAGAGATAAATCATCTTTGTTGGATAAGCTTCCAAGAACCATTGATGTAAAGGAGAAAGTATGCTTATCTCGACATCAGCTTAGTGGTGATGCTTGTAATACTAAGGCCTTGAATTCCTTGAAACATTCTCCCTATGAATGGCATGGTGTAGCTTCTTTGTATATCCCACCATTCAATTCACATCTCCCACCTGCTACTGATAGACTACATTTAGATGTTGGTCATAATTGGCACAACCATTTCCGTCGGTCCTTTGCACCTGCAATGCATCAATCAAGAAATTCTTCTGTTAAAGGTGTTTGTAATCCAGTTATGACTCGACCAGTGTTAATGAGTCTAGATTGGCCCCCAGTCTTACGGAGTGCTTCTGGCCTGGCTTCAACAATGATGTCAAATCATGATATTGGGTTTCTTACTAGGAGACAATCTTCTTTTTGTCAGGGGTTCCCCACTAACAGCAATCAAATTAGCACGGAAGATGAGTACTCTGGTAATCTCACTGATTTTCCTGATTTGTCAAACAATCAAGATCTAGCAGAGGAGTGTGATGGAAACTGGATATCGGAGGAAGAATTGGAAATGCATGCAGTTTCTGGGATAGACTATAATCAGTATTTTGGTGGTGGTGTAATGTACTGGAACCCTTCTGATCATCATGGGACAGGGTTCTCTCGACCTCCTTCTCTGAGTTCTGATGATAGCTCATGGGCTTGGCGTGAAGCTGACATGAACAGGACTGTTGATGATATGGTTGCTTTCTCTTCTTCTTACAGTAATGGGTTGACTTCCCCAACTTCTACTTCATTTTGTTCTCCTTCTGATCCAGTGGGTTCTGGAAAGCAGGCTCTTGGTTATGTGGTTCAAGGGTCTGATCTACCTAACAACATGCTTCATTCCTCACCAACTATGAAAGACACGGTGACAGAGGAGGATGCTCCTAGATCTTTGCCAAATTTGCCCAGTGATGTTGAAGGGAAGACAGGCGACTCACATTCATTTCCAATCTTGCGCCCTATTGTTGTTCCAAGTATGTCAAGGGAAAGATCAAGATCTGAGTTCTGCCATGGTCGTGATCATAAAAGCCCATGTATCCCTCCCACTAGGAGAGAGCAATCTCGAGTAAAGCGTCCACCATCTCCAGTAGTTCTTTGTGTTCCACGGGCGCCAATACCACCTCCACCTTCTCCTGTAAGTGATTCCAGGAAGCAGAGAGGGTTTCCAACTGTTAGATCTGGTAGCTCAAGTCCAAGGCATTGGGGTGTGAAGGGTTGGTATCCTGATGGAACTAATTTGGAAGAAGCATGCTTGCGTATTGATGGTGCTGAAGTAATATGGCCTAATTGGAGAAATAAAAGTAAATCTAATTGCTCGACAGTTCAACCTTTATCATTAATAGCAATGTCCCAGATAGCTATCGATCAGGAACGTCTAGATGTTGCATTTCCTCTCTTTCCACCTACTAGTGGTCGCTCTGTAAAAAAGGAATCTCTTTCTTTGATCCATAGCCGCCTACATGATGAGATCGACTCTTTCTGCAAGCATGTTGCTGCAGAAAACATGGCTAAGAAGCCTTACATCACTTGGGCTGTTAAACGGGTCACACGGTCCCTTCAAGTCTTATGGCCCAGGTCTAGGACAAACATTTTTGGTTCAAATGCAACTGGTTTGTCCCTCCCCACGAGTGATGTGGATCTTGTGGTTTGTCTGCCTCCAGTGAGAAATTTGGAACCTATTAAAGAAGCTGGGATCTTAGAGGGACGTAATGGTATCAAAGAGACCTGCCTCCAGCATGCTGCCAGATATCTTTCCAATCAGGAATGGGTAAAAAGTGATTCTTTAAAGACGGTGGAAAATACTGCTATACCTATTATCATGCTTGTTGTTGAAGTTCCCCATGATCTCATTATTTTGTCCACGTCAAATATGCAATCACCTAAGGAGGAATCCTCTGCTGTATCTGGAAAACAAGATGTCAACATTCTCAATGATATGGCTGGTCTAGAAGATTCTGCGTTGCCAAAATGTTTGGAGGTGAATTATGATACCTCTATTGGCACCAAGTCAGTTCGTATTGACATCAGTTTCAAGACTCCATCACATACAGGACTTCAAACTTCTGAGCTGGTTAAGGAGCTGACTGAACAATTTCCAGCTACTATACCTTTGGCTTTGGTACTGAAGAAATTTTTGGCAGATCGTAGTCTTGATCAGTCCTATTCTGGCGGTTTAAGTTCTTATTGTTTGGTGCGTTGTTCCAACCTCTTACTGTAG
Coding sequence (CDS)
ATGACCCAGAACCAGCTTATCGACTCCCTTACATCCCATATCTCTCTCTACCACTCTACATCTGGTAATTTCAACCGTGATCCTAATCCCAATCCCAGGTCCTCGATCCTGAAATGGTTCTCTTCTCTCAGCGTCCACCAACGCCAAGCTCACCTCACGGTCGTTGATTTCAAATTCGTCCAAGTCCTCATCCAAATGGTGGCAGAAGTTCGGAAACGAGGACACGGTTTCTTCATCCTCCTGCCTGACATTCCCTCCTGCGACCCTCTGCACCTACCCAGCTTATGCTTTAAGAAGTCCCGCGGGCTCTTGTCTCGTGTCTCCGAGTCCAGCGTGTCCGAAAGGATGATTTTCGAGTCCAGTCGACTATTCGGTTCCAGGGAAGGCGATAAGCTCGAGGAGTGTTCTTGTTCGTTAAAGAACATCGATTCTTTAACTGTAAGCGAGGATTTCGTCTCAAACGTGGACAAATTTGTCGAGGCAATGGACGGAGTTTCAAATGGGGCGTTTTTGAGAGGTGAAGGGGGTGACATGGCGTCCAATTGGGCTGAGTTAAATTGGTTAAAAGCGAAAGGATATTACAGTATCGAGGCCTTTGTGGCAAACAAGTTGGAGGTGGCTTTGAGACTCTCATGGATGAGCTTGAATAATGGAAAAAAAAGATCGGTAAAGTTCAAAGAAAAGGCTAGCGCAATTGGCATGGCGACAAACGTGTTTTGGAGGAAGAAGGGATGCGTGGACTGGTGGGATAAATTGGATGCTTCGTCAAAGGAAAAAATATTGACAGCAATTCTGGGAAAATCAGCAAAAAGTTTGATACATGAGATTCTGAGGTGGACTAGTGGACTTGCGGAGCATGAGATGGGGCTCTTTAGTGCGGAATGGAATAGACCGTTTAGGTACAATTGTACTATATCTCAACCAAGGTCCATGTTAACATCCCAAGCGGACCTGCATATTGACTTCAACATAATTCCAGCTGCCCATTCTGGAAAACCTTATTTGTTAACCAACATCTTTAGAAATTTGCTTGTGCTTCAGGACATTGTTACGATGGTAACATCGTGTCTTCATGATGAATACTACAAGACTAATCTATTTTATAGCACTTTGGGTTCTATCTGCGCCATCCCTGATTGTATATTAAGAAAATTGCGGGAACTTCTTATGTTTACTTCACTTGATTGCACAAAACTTGAACTTCTAGGAGACGGGACTAGTAAGTCCCTGCCTAGTAAATTAAGAGAGGATCTAGGTGCTTCCCGTCGAAGGAAAAAGGGAAAGAGCCGGAAGTCGCAGAATCCTGTGCTGAGGGCATGCGCGGATGATTTATCATGCAATAAATTTCTGAAGGTACATGACGATAACACGAGTGTTGGAAAAGATCAAGGCACTGCAAGGAGGAAGAAAAAACACAAGAGTAAAAACTCTTGTGGGAACAGCAGATTAGTTGAAATAAAACCTTCTGTTGGGCCAGCCGTTAAATTTTCCTCTCCTTTTAGTTCTCAGGATCAGGTAGCAGAGTTGGATAATATAATCAGAAAACCTTCCATCTCAAGTATCAAGAATGATAGTTCAAATAATTATGAGAGTTCAACATTAAACTCAAGTCCTCTAGTTCCCTCTATCGAACCTAATAGCGAGTATGACAGTAGCCAAAATATTGAAGTATATGAAGTTTCTGGGTTAGCAAAATCTGTCTGCCAAATTGGTCCTGGAGAATCTCAGTTCCCAAAAGGAATAATTGAAAACCAACGCTTATCATCTACTTTGGAAACTTCTACATCTTTTATGGATTGTAGTGTAGTACCTTCTCATTTGCCTTCATTAAAGCTAAAGAATATCGTCAAAAGTGATGTTAATGTGAAGGGTTCTGTGCAAACTTACGAATTAAGAGATAAATCATCTTTGTTGGATAAGCTTCCAAGAACCATTGATGTAAAGGAGAAAGTATGCTTATCTCGACATCAGCTTAGTGGTGATGCTTGTAATACTAAGGCCTTGAATTCCTTGAAACATTCTCCCTATGAATGGCATGGTGTAGCTTCTTTGTATATCCCACCATTCAATTCACATCTCCCACCTGCTACTGATAGACTACATTTAGATGTTGGTCATAATTGGCACAACCATTTCCGTCGGTCCTTTGCACCTGCAATGCATCAATCAAGAAATTCTTCTGTTAAAGGTGTTTGTAATCCAGTTATGACTCGACCAGTGTTAATGAGTCTAGATTGGCCCCCAGTCTTACGGAGTGCTTCTGGCCTGGCTTCAACAATGATGTCAAATCATGATATTGGGTTTCTTACTAGGAGACAATCTTCTTTTTGTCAGGGGTTCCCCACTAACAGCAATCAAATTAGCACGGAAGATGAGTACTCTGGTAATCTCACTGATTTTCCTGATTTGTCAAACAATCAAGATCTAGCAGAGGAGTGTGATGGAAACTGGATATCGGAGGAAGAATTGGAAATGCATGCAGTTTCTGGGATAGACTATAATCAGTATTTTGGTGGTGGTGTAATGTACTGGAACCCTTCTGATCATCATGGGACAGGGTTCTCTCGACCTCCTTCTCTGAGTTCTGATGATAGCTCATGGGCTTGGCGTGAAGCTGACATGAACAGGACTGTTGATGATATGGTTGCTTTCTCTTCTTCTTACAGTAATGGGTTGACTTCCCCAACTTCTACTTCATTTTGTTCTCCTTCTGATCCAGTGGGTTCTGGAAAGCAGGCTCTTGGTTATGTGGTTCAAGGGTCTGATCTACCTAACAACATGCTTCATTCCTCACCAACTATGAAAGACACGGTGACAGAGGAGGATGCTCCTAGATCTTTGCCAAATTTGCCCAGTGATGTTGAAGGGAAGACAGGCGACTCACATTCATTTCCAATCTTGCGCCCTATTGTTGTTCCAAGTATGTCAAGGGAAAGATCAAGATCTGAGTTCTGCCATGGTCGTGATCATAAAAGCCCATGTATCCCTCCCACTAGGAGAGAGCAATCTCGAGTAAAGCGTCCACCATCTCCAGTAGTTCTTTGTGTTCCACGGGCGCCAATACCACCTCCACCTTCTCCTGTAAGTGATTCCAGGAAGCAGAGAGGGTTTCCAACTGTTAGATCTGGTAGCTCAAGTCCAAGGCATTGGGGTGTGAAGGGTTGGTATCCTGATGGAACTAATTTGGAAGAAGCATGCTTGCGTATTGATGGTGCTGAAGTAATATGGCCTAATTGGAGAAATAAAAGTAAATCTAATTGCTCGACAGTTCAACCTTTATCATTAATAGCAATGTCCCAGATAGCTATCGATCAGGAACGTCTAGATGTTGCATTTCCTCTCTTTCCACCTACTAGTGGTCGCTCTGTAAAAAAGGAATCTCTTTCTTTGATCCATAGCCGCCTACATGATGAGATCGACTCTTTCTGCAAGCATGTTGCTGCAGAAAACATGGCTAAGAAGCCTTACATCACTTGGGCTGTTAAACGGGTCACACGGTCCCTTCAAGTCTTATGGCCCAGGTCTAGGACAAACATTTTTGGTTCAAATGCAACTGGTTTGTCCCTCCCCACGAGTGATGTGGATCTTGTGGTTTGTCTGCCTCCAGTGAGAAATTTGGAACCTATTAAAGAAGCTGGGATCTTAGAGGGACGTAATGGTATCAAAGAGACCTGCCTCCAGCATGCTGCCAGATATCTTTCCAATCAGGAATGGGTAAAAAGTGATTCTTTAAAGACGGTGGAAAATACTGCTATACCTATTATCATGCTTGTTGTTGAAGTTCCCCATGATCTCATTATTTTGTCCACGTCAAATATGCAATCACCTAAGGAGGAATCCTCTGCTGTATCTGGAAAACAAGATGTCAACATTCTCAATGATATGGCTGGTCTAGAAGATTCTGCGTTGCCAAAATGTTTGGAGGTGAATTATGATACCTCTATTGGCACCAAGTCAGTTCGTATTGACATCAGTTTCAAGACTCCATCACATACAGGACTTCAAACTTCTGAGCTGGTTAAGGAGCTGACTGAACAATTTCCAGCTACTATACCTTTGGCTTTGGTACTGAAGAAATTTTTGGCAGATCGTAGTCTTGATCAGTCCTATTCTGGCGGTTTAAGTTCTTATTGTTTGGTGCGTTGTTCCAACCTCTTACTGTAG
Protein sequence
MTQNQLIDSLTSHISLYHSTSGNFNRDPNPNPRSSILKWFSSLSVHQRQAHLTVVDFKFVQVLIQMVAEVRKRGHGFFILLPDIPSCDPLHLPSLCFKKSRGLLSRVSESSVSERMIFESSRLFGSREGDKLEECSCSLKNIDSLTVSEDFVSNVDKFVEAMDGVSNGAFLRGEGGDMASNWAELNWLKAKGYYSIEAFVANKLEVALRLSWMSLNNGKKRSVKFKEKASAIGMATNVFWRKKGCVDWWDKLDASSKEKILTAILGKSAKSLIHEILRWTSGLAEHEMGLFSAEWNRPFRYNCTISQPRSMLTSQADLHIDFNIIPAAHSGKPYLLTNIFRNLLVLQDIVTMVTSCLHDEYYKTNLFYSTLGSICAIPDCILRKLRELLMFTSLDCTKLELLGDGTSKSLPSKLREDLGASRRRKKGKSRKSQNPVLRACADDLSCNKFLKVHDDNTSVGKDQGTARRKKKHKSKNSCGNSRLVEIKPSVGPAVKFSSPFSSQDQVAELDNIIRKPSISSIKNDSSNNYESSTLNSSPLVPSIEPNSEYDSSQNIEVYEVSGLAKSVCQIGPGESQFPKGIIENQRLSSTLETSTSFMDCSVVPSHLPSLKLKNIVKSDVNVKGSVQTYELRDKSSLLDKLPRTIDVKEKVCLSRHQLSGDACNTKALNSLKHSPYEWHGVASLYIPPFNSHLPPATDRLHLDVGHNWHNHFRRSFAPAMHQSRNSSVKGVCNPVMTRPVLMSLDWPPVLRSASGLASTMMSNHDIGFLTRRQSSFCQGFPTNSNQISTEDEYSGNLTDFPDLSNNQDLAEECDGNWISEEELEMHAVSGIDYNQYFGGGVMYWNPSDHHGTGFSRPPSLSSDDSSWAWREADMNRTVDDMVAFSSSYSNGLTSPTSTSFCSPSDPVGSGKQALGYVVQGSDLPNNMLHSSPTMKDTVTEEDAPRSLPNLPSDVEGKTGDSHSFPILRPIVVPSMSRERSRSEFCHGRDHKSPCIPPTRREQSRVKRPPSPVVLCVPRAPIPPPPSPVSDSRKQRGFPTVRSGSSSPRHWGVKGWYPDGTNLEEACLRIDGAEVIWPNWRNKSKSNCSTVQPLSLIAMSQIAIDQERLDVAFPLFPPTSGRSVKKESLSLIHSRLHDEIDSFCKHVAAENMAKKPYITWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPTSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLSNQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIILSTSNMQSPKEESSAVSGKQDVNILNDMAGLEDSALPKCLEVNYDTSIGTKSVRIDISFKTPSHTGLQTSELVKELTEQFPATIPLALVLKKFLADRSLDQSYSGGLSSYCLVRCSNLLL
Homology
BLAST of CmoCh14G022320 vs. ExPASy Swiss-Prot
Match:
Q7KVS9 (Non-canonical poly(A) RNA polymerase protein Trf4-1 OS=Drosophila melanogaster OX=7227 GN=Trf4-1 PE=1 SV=1)
HSP 1 Score: 71.6 bits (174), Expect = 7.7e-11
Identity = 63/246 (25.61%), Postives = 99/246 (40.24%), Query Frame = 0
Query: 1135 LHDEIDSFCKHVAAENMAKKPYITWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPTSDV 1194
LH+EI+ F ++V + VKR+ + +WP++ IFGS TGL LPTSD+
Sbjct: 271 LHEEIEHFYQYV-LPTPCEHAIRNEVVKRIEAVVHSIWPQAVVEIFGSFRTGLFLPTSDI 330
Query: 1195 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLSNQEWVKSDSLKTVENTAIPI 1254
DLVV + P++ GI E C +++ ++ ++PI
Sbjct: 331 DLVVL--GLWEKLPLRTLEFELVSRGIAEAC-----------------TVRVLDKASVPI 390
Query: 1255 IMLVVEVPHDLIILSTSNMQSPKEESSAVSGKQDVNILNDMAGLEDSALPKCLEVNYDTS 1314
I L
Sbjct: 391 IKLTDR------------------------------------------------------ 438
Query: 1315 IGTKSVRIDISFKTPSHTGLQTSELVKELTEQFPATIPLALVLKKFLADRSLDQSYSGGL 1374
V++DISF S G+Q++EL+K+ +P L LVLK+FL R L++ ++GG+
Sbjct: 451 --ETQVKVDISFNMQS--GVQSAELIKKFKRDYPVLEKLVLVLKQFLLLRDLNEVFTGGI 438
Query: 1375 SSYCLV 1381
SSY L+
Sbjct: 511 SSYSLI 438
BLAST of CmoCh14G022320 vs. ExPASy Swiss-Prot
Match:
Q8NDF8 (Terminal nucleotidyltransferase 4B OS=Homo sapiens OX=9606 GN=TENT4B PE=1 SV=2)
HSP 1 Score: 70.9 bits (172), Expect = 1.3e-10
Identity = 66/250 (26.40%), Postives = 102/250 (40.80%), Query Frame = 0
Query: 1135 LHDEIDSFCKHVAAENMAKKPYITWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPTSDV 1194
LH+EI F ++++ +K + V R+ ++ LWP + IFGS TGL LPTSD+
Sbjct: 120 LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 179
Query: 1195 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLSNQEWVKSDSLKTVEN 1254
DLVV LP L ++EA L + DS+K ++
Sbjct: 180 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 239
Query: 1255 TAIPIIMLVVEVPHDLIILSTSNMQSPKEESSAVSGKQDVNILNDMAGLEDSALPKCLEV 1314
+PII L DS
Sbjct: 240 ATVPIIKLT-----------------------------------------DS-------- 286
Query: 1315 NYDTSIGTKSVRIDISFKTPSHTGLQTSELVKELTEQFPATIPLALVLKKFLADRSLDQS 1374
V++DISF G++ ++L+K+ T+++P L LVLK+FL R L++
Sbjct: 300 -------FTEVKVDISFNV--QNGVRAADLIKDFTKKYPVLPYLVLVLKQFLLQRDLNEV 286
Query: 1375 YSGGLSSYCL 1380
++GG+ SY L
Sbjct: 360 FTGGIGSYSL 286
BLAST of CmoCh14G022320 vs. ExPASy Swiss-Prot
Match:
Q68ED3 (Terminal nucleotidyltransferase 4B OS=Mus musculus OX=10090 GN=Tent4b PE=1 SV=2)
HSP 1 Score: 70.9 bits (172), Expect = 1.3e-10
Identity = 66/250 (26.40%), Postives = 102/250 (40.80%), Query Frame = 0
Query: 1135 LHDEIDSFCKHVAAENMAKKPYITWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPTSDV 1194
LH+EI F ++++ +K + V R+ ++ LWP + IFGS TGL LPTSD+
Sbjct: 134 LHEEISDFYEYMSPRPEEEKMRME-VVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 193
Query: 1195 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLSNQEWVKSDSLKTVEN 1254
DLVV LP L ++EA L + DS+K ++
Sbjct: 194 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 253
Query: 1255 TAIPIIMLVVEVPHDLIILSTSNMQSPKEESSAVSGKQDVNILNDMAGLEDSALPKCLEV 1314
+PII L DS
Sbjct: 254 ATVPIIKLT-----------------------------------------DS-------- 300
Query: 1315 NYDTSIGTKSVRIDISFKTPSHTGLQTSELVKELTEQFPATIPLALVLKKFLADRSLDQS 1374
V++DISF G++ ++L+K+ T+++P L LVLK+FL R L++
Sbjct: 314 -------FTEVKVDISFNV--QNGVRAADLIKDFTKKYPVLPYLVLVLKQFLLQRDLNEV 300
Query: 1375 YSGGLSSYCL 1380
++GG+ SY L
Sbjct: 374 FTGGIGSYSL 300
BLAST of CmoCh14G022320 vs. ExPASy Swiss-Prot
Match:
Q5XG87 (Terminal nucleotidyltransferase 4A OS=Homo sapiens OX=9606 GN=TENT4A PE=1 SV=3)
HSP 1 Score: 70.1 bits (170), Expect = 2.2e-10
Identity = 69/270 (25.56%), Postives = 107/270 (39.63%), Query Frame = 0
Query: 1117 PTSGRSVKKESLSLIHSRLHDEIDSFCKHVA--AENMAKKPYITWAVKRVTRSLQVLWPR 1176
P G K + S LH+EI F ++ E A + + VKR+ ++ LWP
Sbjct: 202 PRPGTPWKSRAYSPGIQGLHEEIIDFYNFMSPCPEEAAMRREV---VKRIETVVKDLWPT 261
Query: 1177 SRTNIFGSNATGLSLPTSDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAA 1236
+ IFGS +TGL LPTSD+DLVV PP++ LE ++ + E C
Sbjct: 262 ADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALR------KHNVAEPC----- 321
Query: 1237 RYLSNQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIILSTSNMQSPKEESSAVSGKQDVN 1296
S+K ++ +PII L +
Sbjct: 322 ------------SIKVLDKATVPIIKLTDQ------------------------------ 381
Query: 1297 ILNDMAGLEDSALPKCLEVNYDTSIGTKSVRIDISFKTPSHTGLQTSELVKELTEQFPAT 1356
V++DISF TG++ +E +K +++
Sbjct: 382 --------------------------ETEVKVDISFN--METGVRAAEFIKNYMKKYSLL 387
Query: 1357 IPLALVLKKFLADRSLDQSYSGGLSSYCLV 1381
L LVLK+FL R L++ ++GG+SSY L+
Sbjct: 442 PYLILVLKQFLLQRDLNEVFTGGISSYSLI 387
BLAST of CmoCh14G022320 vs. ExPASy Swiss-Prot
Match:
Q6PB75 (Terminal nucleotidyltransferase 4A OS=Mus musculus OX=10090 GN=Tent4a PE=2 SV=2)
HSP 1 Score: 66.6 bits (161), Expect = 2.5e-09
Identity = 58/224 (25.89%), Postives = 90/224 (40.18%), Query Frame = 0
Query: 1161 VKRVTRSLQVLWPRSRTNIFGSNATGLSLPTSDVDLVVC----LPPVRNLEPIKEAGILE 1220
VKR+ ++ LWP + IFGS +TGL LPTSD+DLVV PP++ LE
Sbjct: 15 VKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALR----- 74
Query: 1221 GRNGIKETCLQHAARYLSNQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIILSTSNMQSP 1280
++ + E C S+K ++ +PII L +
Sbjct: 75 -KHNVAEPC-----------------SIKVLDKATVPIIKLTDQ---------------- 134
Query: 1281 KEESSAVSGKQDVNILNDMAGLEDSALPKCLEVNYDTSIGTKSVRIDISFKTPSHTGLQT 1340
V++DISF TG++
Sbjct: 135 ----------------------------------------ETEVKVDISFN--METGVRA 157
Query: 1341 SELVKELTEQFPATIPLALVLKKFLADRSLDQSYSGGLSSYCLV 1381
+E +K +++ L LVLK+FL R L++ ++GG+SSY L+
Sbjct: 195 AEFIKNYMKKYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLI 157
BLAST of CmoCh14G022320 vs. ExPASy TrEMBL
Match:
A0A6J1ECL0 (uncharacterized protein LOC111431966 isoform X4 OS=Cucurbita moschata OX=3662 GN=LOC111431966 PE=4 SV=1)
HSP 1 Score: 2740.3 bits (7102), Expect = 0.0e+00
Identity = 1380/1416 (97.46%), Postives = 1380/1416 (97.46%), Query Frame = 0
Query: 1 MTQNQLIDSLTSHISLYHSTSGNFNRDPNPNPRSSILKWFSSLSVHQRQAHLTVVDFKFV 60
MTQNQLIDSLTSHISLYHSTSGNFNRDPNPNPRSSILKWFSSLSVHQRQAHLTVVDFKFV
Sbjct: 1 MTQNQLIDSLTSHISLYHSTSGNFNRDPNPNPRSSILKWFSSLSVHQRQAHLTVVDFKFV 60
Query: 61 QVLIQMVAEVRKRGHGFFILLPDIPSCDPLHLPSLCFKKSRGLLSRVSESSVSERMIFES 120
QVLIQMVAEVRKRGHGFFILLPDIPSCDPLHLPSLCFKKSRGLLSRVSESSVSERMIFES
Sbjct: 61 QVLIQMVAEVRKRGHGFFILLPDIPSCDPLHLPSLCFKKSRGLLSRVSESSVSERMIFES 120
Query: 121 SRLFGSREGDKLEECSCSLKNIDSLTVSEDFVSNVDKFVEAMDGVSNGAFLRGEGGDMAS 180
SRLFGSREGDKLEECSCSLKNIDSLTVSEDFVSNVDKFVEAMDGVSNGAFLRGEGGDMAS
Sbjct: 121 SRLFGSREGDKLEECSCSLKNIDSLTVSEDFVSNVDKFVEAMDGVSNGAFLRGEGGDMAS 180
Query: 181 NWAELNWLKAKGYYSIEAFVANKLEVALRLSWMSLNNGKKRSVKFKEKASAIGMATNVFW 240
NWAELNWLKAKGYYSIEAFVANKLEVALRLSWMSLNNGKKRSVKFKEKASAIGMATNVFW
Sbjct: 181 NWAELNWLKAKGYYSIEAFVANKLEVALRLSWMSLNNGKKRSVKFKEKASAIGMATNVFW 240
Query: 241 RKKGCVDWWDKLDASSKEKILTAILGKSAKSLIHEILRWTSGLAEHEMGLFSAEWNRPFR 300
RKKGCVDWWDKLDASSKEKILTAILGKSAKSLIHEILRWTSGLAEHEMGLFSAEWNRPFR
Sbjct: 241 RKKGCVDWWDKLDASSKEKILTAILGKSAKSLIHEILRWTSGLAEHEMGLFSAEWNRPFR 300
Query: 301 YNCTISQPRSMLTSQADLHIDFNIIPAAHSGKPYLLTNIFRNLLVLQDIVTMVTSCLHDE 360
YNCTISQPRSMLTSQADLHIDFNIIPAAHSGKPYLLTNIFRNLLVLQDIVTMVTSCLHDE
Sbjct: 301 YNCTISQPRSMLTSQADLHIDFNIIPAAHSGKPYLLTNIFRNLLVLQDIVTMVTSCLHDE 360
Query: 361 YYKTNLFYSTLGSICAIPDCILRKLRELLMFTSLDCTKLELLGDGTSKSLPSKLREDLGA 420
YYKTNLFYSTLGSICAIPDCILRKLRELLMFTSLDCTKLELLGDGTSKSLPSKLREDLGA
Sbjct: 361 YYKTNLFYSTLGSICAIPDCILRKLRELLMFTSLDCTKLELLGDGTSKSLPSKLREDLGA 420
Query: 421 SRRRKKGKSRKSQNPVLRACADDLSCNKFLK----------------------------- 480
SRRRKKGKSRKSQNPVLRACADDLSCNKFLK
Sbjct: 421 SRRRKKGKSRKSQNPVLRACADDLSCNKFLKEFDKECAHKGREDIAESTTMSIMSKRNET 480
Query: 481 -------VHDDNTSVGKDQGTARRKKKHKSKNSCGNSRLVEIKPSVGPAVKFSSPFSSQD 540
VHDDNTSVGKDQGTARRKKKHKSKNSCGNSRLVEIKPSVGPAVKFSSPFSSQD
Sbjct: 481 CREISSDVHDDNTSVGKDQGTARRKKKHKSKNSCGNSRLVEIKPSVGPAVKFSSPFSSQD 540
Query: 541 QVAELDNIIRKPSISSIKNDSSNNYESSTLNSSPLVPSIEPNSEYDSSQNIEVYEVSGLA 600
QVAELDNIIRKPSISSIKNDSSNNYESSTLNSSPLVPSIEPNSEYDSSQNIEVYEVSGLA
Sbjct: 541 QVAELDNIIRKPSISSIKNDSSNNYESSTLNSSPLVPSIEPNSEYDSSQNIEVYEVSGLA 600
Query: 601 KSVCQIGPGESQFPKGIIENQRLSSTLETSTSFMDCSVVPSHLPSLKLKNIVKSDVNVKG 660
KSVCQIGPGESQFPKGIIENQRLSSTLETSTSFMDCSVVPSHLPSLKLKNIVKSDVNVKG
Sbjct: 601 KSVCQIGPGESQFPKGIIENQRLSSTLETSTSFMDCSVVPSHLPSLKLKNIVKSDVNVKG 660
Query: 661 SVQTYELRDKSSLLDKLPRTIDVKEKVCLSRHQLSGDACNTKALNSLKHSPYEWHGVASL 720
SVQTYELRDKSSLLDKLPRTIDVKEKVCLSRHQLSGDACNTKALNSLKHSPYEWHGVASL
Sbjct: 661 SVQTYELRDKSSLLDKLPRTIDVKEKVCLSRHQLSGDACNTKALNSLKHSPYEWHGVASL 720
Query: 721 YIPPFNSHLPPATDRLHLDVGHNWHNHFRRSFAPAMHQSRNSSVKGVCNPVMTRPVLMSL 780
YIPPFNSHLPPATDRLHLDVGHNWHNHFRRSFAPAMHQSRNSSVKGVCNPVMTRPVLMSL
Sbjct: 721 YIPPFNSHLPPATDRLHLDVGHNWHNHFRRSFAPAMHQSRNSSVKGVCNPVMTRPVLMSL 780
Query: 781 DWPPVLRSASGLASTMMSNHDIGFLTRRQSSFCQGFPTNSNQISTEDEYSGNLTDFPDLS 840
DWPPVLRSASGLASTMMSNHDIGFLTRRQSSFCQGFPTNSNQISTEDEYSGNLTDFPDLS
Sbjct: 781 DWPPVLRSASGLASTMMSNHDIGFLTRRQSSFCQGFPTNSNQISTEDEYSGNLTDFPDLS 840
Query: 841 NNQDLAEECDGNWISEEELEMHAVSGIDYNQYFGGGVMYWNPSDHHGTGFSRPPSLSSDD 900
NNQDLAEECDGNWISEEELEMHAVSGIDYNQYFGGGVMYWNPSDHHGTGFSRPPSLSSDD
Sbjct: 841 NNQDLAEECDGNWISEEELEMHAVSGIDYNQYFGGGVMYWNPSDHHGTGFSRPPSLSSDD 900
Query: 901 SSWAWREADMNRTVDDMVAFSSSYSNGLTSPTSTSFCSPSDPVGSGKQALGYVVQGSDLP 960
SSWAWREADMNRTVDDMVAFSSSYSNGLTSPTSTSFCSPSDPVGSGKQALGYVVQGSDLP
Sbjct: 901 SSWAWREADMNRTVDDMVAFSSSYSNGLTSPTSTSFCSPSDPVGSGKQALGYVVQGSDLP 960
Query: 961 NNMLHSSPTMKDTVTEEDAPRSLPNLPSDVEGKTGDSHSFPILRPIVVPSMSRERSRSEF 1020
NNMLHSSPTMKDTVTEEDAPRSLPNLPSDVEGKTGDSHSFPILRPIVVPSMSRERSRSEF
Sbjct: 961 NNMLHSSPTMKDTVTEEDAPRSLPNLPSDVEGKTGDSHSFPILRPIVVPSMSRERSRSEF 1020
Query: 1021 CHGRDHKSPCIPPTRREQSRVKRPPSPVVLCVPRAPIPPPPSPVSDSRKQRGFPTVRSGS 1080
CHGRDHKSPCIPPTRREQSRVKRPPSPVVLCVPRAPIPPPPSPVSDSRKQRGFPTVRSGS
Sbjct: 1021 CHGRDHKSPCIPPTRREQSRVKRPPSPVVLCVPRAPIPPPPSPVSDSRKQRGFPTVRSGS 1080
Query: 1081 SSPRHWGVKGWYPDGTNLEEACLRIDGAEVIWPNWRNKSKSNCSTVQPLSLIAMSQIAID 1140
SSPRHWGVKGWYPDGTNLEEACLRIDGAEVIWPNWRNKSKSNCSTVQPLSLIAMSQIAID
Sbjct: 1081 SSPRHWGVKGWYPDGTNLEEACLRIDGAEVIWPNWRNKSKSNCSTVQPLSLIAMSQIAID 1140
Query: 1141 QERLDVAFPLFPPTSGRSVKKESLSLIHSRLHDEIDSFCKHVAAENMAKKPYITWAVKRV 1200
QERLDVAFPLFPPTSGRSVKKESLSLIHSRLHDEIDSFCKHVAAENMAKKPYITWAVKRV
Sbjct: 1141 QERLDVAFPLFPPTSGRSVKKESLSLIHSRLHDEIDSFCKHVAAENMAKKPYITWAVKRV 1200
Query: 1201 TRSLQVLWPRSRTNIFGSNATGLSLPTSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKET 1260
TRSLQVLWPRSRTNIFGSNATGLSLPTSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKET
Sbjct: 1201 TRSLQVLWPRSRTNIFGSNATGLSLPTSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKET 1260
Query: 1261 CLQHAARYLSNQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIILSTSNMQSPKEESSAVS 1320
CLQHAARYLSNQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIILSTSNMQSPKEESSAVS
Sbjct: 1261 CLQHAARYLSNQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIILSTSNMQSPKEESSAVS 1320
Query: 1321 GKQDVNILNDMAGLEDSALPKCLEVNYDTSIGTKSVRIDISFKTPSHTGLQTSELVKELT 1380
GKQDVNILNDMAGLEDSALPKCLEVNYDTSIGTKSVRIDISFKTPSHTGLQTSELVKELT
Sbjct: 1321 GKQDVNILNDMAGLEDSALPKCLEVNYDTSIGTKSVRIDISFKTPSHTGLQTSELVKELT 1380
BLAST of CmoCh14G022320 vs. ExPASy TrEMBL
Match:
A0A6J1E927 (uncharacterized protein LOC111431966 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111431966 PE=4 SV=1)
HSP 1 Score: 2739.5 bits (7100), Expect = 0.0e+00
Identity = 1380/1418 (97.32%), Postives = 1380/1418 (97.32%), Query Frame = 0
Query: 1 MTQNQLIDSLTSHISLYHSTSGNFNRDPNPNPRSSILKWFSSLSVHQRQAHLTVVDFKFV 60
MTQNQLIDSLTSHISLYHSTSGNFNRDPNPNPRSSILKWFSSLSVHQRQAHLTVVDFKFV
Sbjct: 1 MTQNQLIDSLTSHISLYHSTSGNFNRDPNPNPRSSILKWFSSLSVHQRQAHLTVVDFKFV 60
Query: 61 QVLIQMVAEVRKRGHGFFILLPDIPSCDPLHLPSLCFKKSRGLLSRVSESSVSERMIFES 120
QVLIQMVAEVRKRGHGFFILLPDIPSCDPLHLPSLCFKKSRGLLSRVSESSVSERMIFES
Sbjct: 61 QVLIQMVAEVRKRGHGFFILLPDIPSCDPLHLPSLCFKKSRGLLSRVSESSVSERMIFES 120
Query: 121 SRLFGSREGDKLEECSCSLKNIDSLTVSEDFVSNVDKFVEAMDGVSNGAFLRGEGGDMAS 180
SRLFGSREGDKLEECSCSLKNIDSLTVSEDFVSNVDKFVEAMDGVSNGAFLRGEGGDMAS
Sbjct: 121 SRLFGSREGDKLEECSCSLKNIDSLTVSEDFVSNVDKFVEAMDGVSNGAFLRGEGGDMAS 180
Query: 181 NWAELNWLKAKGYYSIEAFVANKLEVALRLSWMSLNNGKKRSVKFKEKASAIGMATNVFW 240
NWAELNWLKAKGYYSIEAFVANKLEVALRLSWMSLNNGKKRSVKFKEKASAIGMATNVFW
Sbjct: 181 NWAELNWLKAKGYYSIEAFVANKLEVALRLSWMSLNNGKKRSVKFKEKASAIGMATNVFW 240
Query: 241 RKKGCVDWWDKLDASSKEKILTAILGKSAKSLIHEILRWTSGLAEHEMGLFSAEWNRPFR 300
RKKGCVDWWDKLDASSKEKILTAILGKSAKSLIHEILRWTSGLAEHEMGLFSAEWNRPFR
Sbjct: 241 RKKGCVDWWDKLDASSKEKILTAILGKSAKSLIHEILRWTSGLAEHEMGLFSAEWNRPFR 300
Query: 301 YNCTISQPRSMLTSQADLHIDFNIIPAAHSGKPYLLTNIFRNLLVLQDIVTMVTSCLHDE 360
YNCTISQPRSMLTSQADLHIDFNIIPAAHSGKPYLLTNIFRNLLVLQDIVTMVTSCLHDE
Sbjct: 301 YNCTISQPRSMLTSQADLHIDFNIIPAAHSGKPYLLTNIFRNLLVLQDIVTMVTSCLHDE 360
Query: 361 YYKTNLFYSTLGSICAIPDCILRKLRELLMFTSLDCTKLELLGDGTSKSLPSKLREDLGA 420
YYKTNLFYSTLGSICAIPDCILRKLRELLMFTSLDCTKLELLGDGTSKSLPSKLREDLGA
Sbjct: 361 YYKTNLFYSTLGSICAIPDCILRKLRELLMFTSLDCTKLELLGDGTSKSLPSKLREDLGA 420
Query: 421 SRRRKKGKSRKSQNPVLRACADDLSCNKFLK----------------------------- 480
SRRRKKGKSRKSQNPVLRACADDLSCNKFLK
Sbjct: 421 SRRRKKGKSRKSQNPVLRACADDLSCNKFLKPQEFDKECAHKGREDIAESTTMSIMSKRN 480
Query: 481 ---------VHDDNTSVGKDQGTARRKKKHKSKNSCGNSRLVEIKPSVGPAVKFSSPFSS 540
VHDDNTSVGKDQGTARRKKKHKSKNSCGNSRLVEIKPSVGPAVKFSSPFSS
Sbjct: 481 ETCREISSDVHDDNTSVGKDQGTARRKKKHKSKNSCGNSRLVEIKPSVGPAVKFSSPFSS 540
Query: 541 QDQVAELDNIIRKPSISSIKNDSSNNYESSTLNSSPLVPSIEPNSEYDSSQNIEVYEVSG 600
QDQVAELDNIIRKPSISSIKNDSSNNYESSTLNSSPLVPSIEPNSEYDSSQNIEVYEVSG
Sbjct: 541 QDQVAELDNIIRKPSISSIKNDSSNNYESSTLNSSPLVPSIEPNSEYDSSQNIEVYEVSG 600
Query: 601 LAKSVCQIGPGESQFPKGIIENQRLSSTLETSTSFMDCSVVPSHLPSLKLKNIVKSDVNV 660
LAKSVCQIGPGESQFPKGIIENQRLSSTLETSTSFMDCSVVPSHLPSLKLKNIVKSDVNV
Sbjct: 601 LAKSVCQIGPGESQFPKGIIENQRLSSTLETSTSFMDCSVVPSHLPSLKLKNIVKSDVNV 660
Query: 661 KGSVQTYELRDKSSLLDKLPRTIDVKEKVCLSRHQLSGDACNTKALNSLKHSPYEWHGVA 720
KGSVQTYELRDKSSLLDKLPRTIDVKEKVCLSRHQLSGDACNTKALNSLKHSPYEWHGVA
Sbjct: 661 KGSVQTYELRDKSSLLDKLPRTIDVKEKVCLSRHQLSGDACNTKALNSLKHSPYEWHGVA 720
Query: 721 SLYIPPFNSHLPPATDRLHLDVGHNWHNHFRRSFAPAMHQSRNSSVKGVCNPVMTRPVLM 780
SLYIPPFNSHLPPATDRLHLDVGHNWHNHFRRSFAPAMHQSRNSSVKGVCNPVMTRPVLM
Sbjct: 721 SLYIPPFNSHLPPATDRLHLDVGHNWHNHFRRSFAPAMHQSRNSSVKGVCNPVMTRPVLM 780
Query: 781 SLDWPPVLRSASGLASTMMSNHDIGFLTRRQSSFCQGFPTNSNQISTEDEYSGNLTDFPD 840
SLDWPPVLRSASGLASTMMSNHDIGFLTRRQSSFCQGFPTNSNQISTEDEYSGNLTDFPD
Sbjct: 781 SLDWPPVLRSASGLASTMMSNHDIGFLTRRQSSFCQGFPTNSNQISTEDEYSGNLTDFPD 840
Query: 841 LSNNQDLAEECDGNWISEEELEMHAVSGIDYNQYFGGGVMYWNPSDHHGTGFSRPPSLSS 900
LSNNQDLAEECDGNWISEEELEMHAVSGIDYNQYFGGGVMYWNPSDHHGTGFSRPPSLSS
Sbjct: 841 LSNNQDLAEECDGNWISEEELEMHAVSGIDYNQYFGGGVMYWNPSDHHGTGFSRPPSLSS 900
Query: 901 DDSSWAWREADMNRTVDDMVAFSSSYSNGLTSPTSTSFCSPSDPVGSGKQALGYVVQGSD 960
DDSSWAWREADMNRTVDDMVAFSSSYSNGLTSPTSTSFCSPSDPVGSGKQALGYVVQGSD
Sbjct: 901 DDSSWAWREADMNRTVDDMVAFSSSYSNGLTSPTSTSFCSPSDPVGSGKQALGYVVQGSD 960
Query: 961 LPNNMLHSSPTMKDTVTEEDAPRSLPNLPSDVEGKTGDSHSFPILRPIVVPSMSRERSRS 1020
LPNNMLHSSPTMKDTVTEEDAPRSLPNLPSDVEGKTGDSHSFPILRPIVVPSMSRERSRS
Sbjct: 961 LPNNMLHSSPTMKDTVTEEDAPRSLPNLPSDVEGKTGDSHSFPILRPIVVPSMSRERSRS 1020
Query: 1021 EFCHGRDHKSPCIPPTRREQSRVKRPPSPVVLCVPRAPIPPPPSPVSDSRKQRGFPTVRS 1080
EFCHGRDHKSPCIPPTRREQSRVKRPPSPVVLCVPRAPIPPPPSPVSDSRKQRGFPTVRS
Sbjct: 1021 EFCHGRDHKSPCIPPTRREQSRVKRPPSPVVLCVPRAPIPPPPSPVSDSRKQRGFPTVRS 1080
Query: 1081 GSSSPRHWGVKGWYPDGTNLEEACLRIDGAEVIWPNWRNKSKSNCSTVQPLSLIAMSQIA 1140
GSSSPRHWGVKGWYPDGTNLEEACLRIDGAEVIWPNWRNKSKSNCSTVQPLSLIAMSQIA
Sbjct: 1081 GSSSPRHWGVKGWYPDGTNLEEACLRIDGAEVIWPNWRNKSKSNCSTVQPLSLIAMSQIA 1140
Query: 1141 IDQERLDVAFPLFPPTSGRSVKKESLSLIHSRLHDEIDSFCKHVAAENMAKKPYITWAVK 1200
IDQERLDVAFPLFPPTSGRSVKKESLSLIHSRLHDEIDSFCKHVAAENMAKKPYITWAVK
Sbjct: 1141 IDQERLDVAFPLFPPTSGRSVKKESLSLIHSRLHDEIDSFCKHVAAENMAKKPYITWAVK 1200
Query: 1201 RVTRSLQVLWPRSRTNIFGSNATGLSLPTSDVDLVVCLPPVRNLEPIKEAGILEGRNGIK 1260
RVTRSLQVLWPRSRTNIFGSNATGLSLPTSDVDLVVCLPPVRNLEPIKEAGILEGRNGIK
Sbjct: 1201 RVTRSLQVLWPRSRTNIFGSNATGLSLPTSDVDLVVCLPPVRNLEPIKEAGILEGRNGIK 1260
Query: 1261 ETCLQHAARYLSNQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIILSTSNMQSPKEESSA 1320
ETCLQHAARYLSNQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIILSTSNMQSPKEESSA
Sbjct: 1261 ETCLQHAARYLSNQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIILSTSNMQSPKEESSA 1320
Query: 1321 VSGKQDVNILNDMAGLEDSALPKCLEVNYDTSIGTKSVRIDISFKTPSHTGLQTSELVKE 1380
VSGKQDVNILNDMAGLEDSALPKCLEVNYDTSIGTKSVRIDISFKTPSHTGLQTSELVKE
Sbjct: 1321 VSGKQDVNILNDMAGLEDSALPKCLEVNYDTSIGTKSVRIDISFKTPSHTGLQTSELVKE 1380
BLAST of CmoCh14G022320 vs. ExPASy TrEMBL
Match:
A0A6J1E9K0 (uncharacterized protein LOC111431966 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111431966 PE=4 SV=1)
HSP 1 Score: 2738.8 bits (7098), Expect = 0.0e+00
Identity = 1380/1420 (97.18%), Postives = 1380/1420 (97.18%), Query Frame = 0
Query: 1 MTQNQLIDSLTSHISLYHSTSGNFNRDPNPNPRSSILKWFSSLSVHQRQAHLTVVDFKFV 60
MTQNQLIDSLTSHISLYHSTSGNFNRDPNPNPRSSILKWFSSLSVHQRQAHLTVVDFKFV
Sbjct: 1 MTQNQLIDSLTSHISLYHSTSGNFNRDPNPNPRSSILKWFSSLSVHQRQAHLTVVDFKFV 60
Query: 61 QVLIQMVAEVRKRGHGFFILLPDIPSCDPLHLPSLCFKKSRGLLSRVSESSVSERMIFES 120
QVLIQMVAEVRKRGHGFFILLPDIPSCDPLHLPSLCFKKSRGLLSRVSESSVSERMIFES
Sbjct: 61 QVLIQMVAEVRKRGHGFFILLPDIPSCDPLHLPSLCFKKSRGLLSRVSESSVSERMIFES 120
Query: 121 SRLFGSREGDKLEECSCSLKNIDSLTVSEDFVSNVDKFVEAMDGVSNGAFLRGEGGDMAS 180
SRLFGSREGDKLEECSCSLKNIDSLTVSEDFVSNVDKFVEAMDGVSNGAFLRGEGGDMAS
Sbjct: 121 SRLFGSREGDKLEECSCSLKNIDSLTVSEDFVSNVDKFVEAMDGVSNGAFLRGEGGDMAS 180
Query: 181 NWAELNWLKAKGYYSIEAFVANKLEVALRLSWMSLNNGKKRSVKFKEKASAIGMATNVFW 240
NWAELNWLKAKGYYSIEAFVANKLEVALRLSWMSLNNGKKRSVKFKEKASAIGMATNVFW
Sbjct: 181 NWAELNWLKAKGYYSIEAFVANKLEVALRLSWMSLNNGKKRSVKFKEKASAIGMATNVFW 240
Query: 241 RKKGCVDWWDKLDASSKEKILTAILGKSAKSLIHEILRWTSGLAEHEMGLFSAEWNRPFR 300
RKKGCVDWWDKLDASSKEKILTAILGKSAKSLIHEILRWTSGLAEHEMGLFSAEWNRPFR
Sbjct: 241 RKKGCVDWWDKLDASSKEKILTAILGKSAKSLIHEILRWTSGLAEHEMGLFSAEWNRPFR 300
Query: 301 YNCTISQPRSMLTSQADLHIDFNIIPAAHSGKPYLLTNIFRNLLVLQDIVTMVTSCLHDE 360
YNCTISQPRSMLTSQADLHIDFNIIPAAHSGKPYLLTNIFRNLLVLQDIVTMVTSCLHDE
Sbjct: 301 YNCTISQPRSMLTSQADLHIDFNIIPAAHSGKPYLLTNIFRNLLVLQDIVTMVTSCLHDE 360
Query: 361 YYKTNLFYSTLGSICAIPDCILRKLRELLMFTSLDCTKLELLGDGTSKSLPSKLREDLGA 420
YYKTNLFYSTLGSICAIPDCILRKLRELLMFTSLDCTKLELLGDGTSKSLPSKLREDLGA
Sbjct: 361 YYKTNLFYSTLGSICAIPDCILRKLRELLMFTSLDCTKLELLGDGTSKSLPSKLREDLGA 420
Query: 421 SRRRKKGKSRKSQNPVLRACADDLSCNKFLK----------------------------- 480
SRRRKKGKSRKSQNPVLRACADDLSCNKFLK
Sbjct: 421 SRRRKKGKSRKSQNPVLRACADDLSCNKFLKEFDKECAHKGREDIAESTTMSIMSKRNET 480
Query: 481 -----------VHDDNTSVGKDQGTARRKKKHKSKNSCGNSRLVEIKPSVGPAVKFSSPF 540
VHDDNTSVGKDQGTARRKKKHKSKNSCGNSRLVEIKPSVGPAVKFSSPF
Sbjct: 481 CREISSDVSKTVHDDNTSVGKDQGTARRKKKHKSKNSCGNSRLVEIKPSVGPAVKFSSPF 540
Query: 541 SSQDQVAELDNIIRKPSISSIKNDSSNNYESSTLNSSPLVPSIEPNSEYDSSQNIEVYEV 600
SSQDQVAELDNIIRKPSISSIKNDSSNNYESSTLNSSPLVPSIEPNSEYDSSQNIEVYEV
Sbjct: 541 SSQDQVAELDNIIRKPSISSIKNDSSNNYESSTLNSSPLVPSIEPNSEYDSSQNIEVYEV 600
Query: 601 SGLAKSVCQIGPGESQFPKGIIENQRLSSTLETSTSFMDCSVVPSHLPSLKLKNIVKSDV 660
SGLAKSVCQIGPGESQFPKGIIENQRLSSTLETSTSFMDCSVVPSHLPSLKLKNIVKSDV
Sbjct: 601 SGLAKSVCQIGPGESQFPKGIIENQRLSSTLETSTSFMDCSVVPSHLPSLKLKNIVKSDV 660
Query: 661 NVKGSVQTYELRDKSSLLDKLPRTIDVKEKVCLSRHQLSGDACNTKALNSLKHSPYEWHG 720
NVKGSVQTYELRDKSSLLDKLPRTIDVKEKVCLSRHQLSGDACNTKALNSLKHSPYEWHG
Sbjct: 661 NVKGSVQTYELRDKSSLLDKLPRTIDVKEKVCLSRHQLSGDACNTKALNSLKHSPYEWHG 720
Query: 721 VASLYIPPFNSHLPPATDRLHLDVGHNWHNHFRRSFAPAMHQSRNSSVKGVCNPVMTRPV 780
VASLYIPPFNSHLPPATDRLHLDVGHNWHNHFRRSFAPAMHQSRNSSVKGVCNPVMTRPV
Sbjct: 721 VASLYIPPFNSHLPPATDRLHLDVGHNWHNHFRRSFAPAMHQSRNSSVKGVCNPVMTRPV 780
Query: 781 LMSLDWPPVLRSASGLASTMMSNHDIGFLTRRQSSFCQGFPTNSNQISTEDEYSGNLTDF 840
LMSLDWPPVLRSASGLASTMMSNHDIGFLTRRQSSFCQGFPTNSNQISTEDEYSGNLTDF
Sbjct: 781 LMSLDWPPVLRSASGLASTMMSNHDIGFLTRRQSSFCQGFPTNSNQISTEDEYSGNLTDF 840
Query: 841 PDLSNNQDLAEECDGNWISEEELEMHAVSGIDYNQYFGGGVMYWNPSDHHGTGFSRPPSL 900
PDLSNNQDLAEECDGNWISEEELEMHAVSGIDYNQYFGGGVMYWNPSDHHGTGFSRPPSL
Sbjct: 841 PDLSNNQDLAEECDGNWISEEELEMHAVSGIDYNQYFGGGVMYWNPSDHHGTGFSRPPSL 900
Query: 901 SSDDSSWAWREADMNRTVDDMVAFSSSYSNGLTSPTSTSFCSPSDPVGSGKQALGYVVQG 960
SSDDSSWAWREADMNRTVDDMVAFSSSYSNGLTSPTSTSFCSPSDPVGSGKQALGYVVQG
Sbjct: 901 SSDDSSWAWREADMNRTVDDMVAFSSSYSNGLTSPTSTSFCSPSDPVGSGKQALGYVVQG 960
Query: 961 SDLPNNMLHSSPTMKDTVTEEDAPRSLPNLPSDVEGKTGDSHSFPILRPIVVPSMSRERS 1020
SDLPNNMLHSSPTMKDTVTEEDAPRSLPNLPSDVEGKTGDSHSFPILRPIVVPSMSRERS
Sbjct: 961 SDLPNNMLHSSPTMKDTVTEEDAPRSLPNLPSDVEGKTGDSHSFPILRPIVVPSMSRERS 1020
Query: 1021 RSEFCHGRDHKSPCIPPTRREQSRVKRPPSPVVLCVPRAPIPPPPSPVSDSRKQRGFPTV 1080
RSEFCHGRDHKSPCIPPTRREQSRVKRPPSPVVLCVPRAPIPPPPSPVSDSRKQRGFPTV
Sbjct: 1021 RSEFCHGRDHKSPCIPPTRREQSRVKRPPSPVVLCVPRAPIPPPPSPVSDSRKQRGFPTV 1080
Query: 1081 RSGSSSPRHWGVKGWYPDGTNLEEACLRIDGAEVIWPNWRNKSKSNCSTVQPLSLIAMSQ 1140
RSGSSSPRHWGVKGWYPDGTNLEEACLRIDGAEVIWPNWRNKSKSNCSTVQPLSLIAMSQ
Sbjct: 1081 RSGSSSPRHWGVKGWYPDGTNLEEACLRIDGAEVIWPNWRNKSKSNCSTVQPLSLIAMSQ 1140
Query: 1141 IAIDQERLDVAFPLFPPTSGRSVKKESLSLIHSRLHDEIDSFCKHVAAENMAKKPYITWA 1200
IAIDQERLDVAFPLFPPTSGRSVKKESLSLIHSRLHDEIDSFCKHVAAENMAKKPYITWA
Sbjct: 1141 IAIDQERLDVAFPLFPPTSGRSVKKESLSLIHSRLHDEIDSFCKHVAAENMAKKPYITWA 1200
Query: 1201 VKRVTRSLQVLWPRSRTNIFGSNATGLSLPTSDVDLVVCLPPVRNLEPIKEAGILEGRNG 1260
VKRVTRSLQVLWPRSRTNIFGSNATGLSLPTSDVDLVVCLPPVRNLEPIKEAGILEGRNG
Sbjct: 1201 VKRVTRSLQVLWPRSRTNIFGSNATGLSLPTSDVDLVVCLPPVRNLEPIKEAGILEGRNG 1260
Query: 1261 IKETCLQHAARYLSNQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIILSTSNMQSPKEES 1320
IKETCLQHAARYLSNQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIILSTSNMQSPKEES
Sbjct: 1261 IKETCLQHAARYLSNQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIILSTSNMQSPKEES 1320
Query: 1321 SAVSGKQDVNILNDMAGLEDSALPKCLEVNYDTSIGTKSVRIDISFKTPSHTGLQTSELV 1380
SAVSGKQDVNILNDMAGLEDSALPKCLEVNYDTSIGTKSVRIDISFKTPSHTGLQTSELV
Sbjct: 1321 SAVSGKQDVNILNDMAGLEDSALPKCLEVNYDTSIGTKSVRIDISFKTPSHTGLQTSELV 1380
BLAST of CmoCh14G022320 vs. ExPASy TrEMBL
Match:
A0A6J1EF53 (uncharacterized protein LOC111431966 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111431966 PE=4 SV=1)
HSP 1 Score: 2738.0 bits (7096), Expect = 0.0e+00
Identity = 1380/1422 (97.05%), Postives = 1380/1422 (97.05%), Query Frame = 0
Query: 1 MTQNQLIDSLTSHISLYHSTSGNFNRDPNPNPRSSILKWFSSLSVHQRQAHLTVVDFKFV 60
MTQNQLIDSLTSHISLYHSTSGNFNRDPNPNPRSSILKWFSSLSVHQRQAHLTVVDFKFV
Sbjct: 1 MTQNQLIDSLTSHISLYHSTSGNFNRDPNPNPRSSILKWFSSLSVHQRQAHLTVVDFKFV 60
Query: 61 QVLIQMVAEVRKRGHGFFILLPDIPSCDPLHLPSLCFKKSRGLLSRVSESSVSERMIFES 120
QVLIQMVAEVRKRGHGFFILLPDIPSCDPLHLPSLCFKKSRGLLSRVSESSVSERMIFES
Sbjct: 61 QVLIQMVAEVRKRGHGFFILLPDIPSCDPLHLPSLCFKKSRGLLSRVSESSVSERMIFES 120
Query: 121 SRLFGSREGDKLEECSCSLKNIDSLTVSEDFVSNVDKFVEAMDGVSNGAFLRGEGGDMAS 180
SRLFGSREGDKLEECSCSLKNIDSLTVSEDFVSNVDKFVEAMDGVSNGAFLRGEGGDMAS
Sbjct: 121 SRLFGSREGDKLEECSCSLKNIDSLTVSEDFVSNVDKFVEAMDGVSNGAFLRGEGGDMAS 180
Query: 181 NWAELNWLKAKGYYSIEAFVANKLEVALRLSWMSLNNGKKRSVKFKEKASAIGMATNVFW 240
NWAELNWLKAKGYYSIEAFVANKLEVALRLSWMSLNNGKKRSVKFKEKASAIGMATNVFW
Sbjct: 181 NWAELNWLKAKGYYSIEAFVANKLEVALRLSWMSLNNGKKRSVKFKEKASAIGMATNVFW 240
Query: 241 RKKGCVDWWDKLDASSKEKILTAILGKSAKSLIHEILRWTSGLAEHEMGLFSAEWNRPFR 300
RKKGCVDWWDKLDASSKEKILTAILGKSAKSLIHEILRWTSGLAEHEMGLFSAEWNRPFR
Sbjct: 241 RKKGCVDWWDKLDASSKEKILTAILGKSAKSLIHEILRWTSGLAEHEMGLFSAEWNRPFR 300
Query: 301 YNCTISQPRSMLTSQADLHIDFNIIPAAHSGKPYLLTNIFRNLLVLQDIVTMVTSCLHDE 360
YNCTISQPRSMLTSQADLHIDFNIIPAAHSGKPYLLTNIFRNLLVLQDIVTMVTSCLHDE
Sbjct: 301 YNCTISQPRSMLTSQADLHIDFNIIPAAHSGKPYLLTNIFRNLLVLQDIVTMVTSCLHDE 360
Query: 361 YYKTNLFYSTLGSICAIPDCILRKLRELLMFTSLDCTKLELLGDGTSKSLPSKLREDLGA 420
YYKTNLFYSTLGSICAIPDCILRKLRELLMFTSLDCTKLELLGDGTSKSLPSKLREDLGA
Sbjct: 361 YYKTNLFYSTLGSICAIPDCILRKLRELLMFTSLDCTKLELLGDGTSKSLPSKLREDLGA 420
Query: 421 SRRRKKGKSRKSQNPVLRACADDLSCNKFLK----------------------------- 480
SRRRKKGKSRKSQNPVLRACADDLSCNKFLK
Sbjct: 421 SRRRKKGKSRKSQNPVLRACADDLSCNKFLKPQEFDKECAHKGREDIAESTTMSIMSKRN 480
Query: 481 -------------VHDDNTSVGKDQGTARRKKKHKSKNSCGNSRLVEIKPSVGPAVKFSS 540
VHDDNTSVGKDQGTARRKKKHKSKNSCGNSRLVEIKPSVGPAVKFSS
Sbjct: 481 ETCREISSDVSKTVHDDNTSVGKDQGTARRKKKHKSKNSCGNSRLVEIKPSVGPAVKFSS 540
Query: 541 PFSSQDQVAELDNIIRKPSISSIKNDSSNNYESSTLNSSPLVPSIEPNSEYDSSQNIEVY 600
PFSSQDQVAELDNIIRKPSISSIKNDSSNNYESSTLNSSPLVPSIEPNSEYDSSQNIEVY
Sbjct: 541 PFSSQDQVAELDNIIRKPSISSIKNDSSNNYESSTLNSSPLVPSIEPNSEYDSSQNIEVY 600
Query: 601 EVSGLAKSVCQIGPGESQFPKGIIENQRLSSTLETSTSFMDCSVVPSHLPSLKLKNIVKS 660
EVSGLAKSVCQIGPGESQFPKGIIENQRLSSTLETSTSFMDCSVVPSHLPSLKLKNIVKS
Sbjct: 601 EVSGLAKSVCQIGPGESQFPKGIIENQRLSSTLETSTSFMDCSVVPSHLPSLKLKNIVKS 660
Query: 661 DVNVKGSVQTYELRDKSSLLDKLPRTIDVKEKVCLSRHQLSGDACNTKALNSLKHSPYEW 720
DVNVKGSVQTYELRDKSSLLDKLPRTIDVKEKVCLSRHQLSGDACNTKALNSLKHSPYEW
Sbjct: 661 DVNVKGSVQTYELRDKSSLLDKLPRTIDVKEKVCLSRHQLSGDACNTKALNSLKHSPYEW 720
Query: 721 HGVASLYIPPFNSHLPPATDRLHLDVGHNWHNHFRRSFAPAMHQSRNSSVKGVCNPVMTR 780
HGVASLYIPPFNSHLPPATDRLHLDVGHNWHNHFRRSFAPAMHQSRNSSVKGVCNPVMTR
Sbjct: 721 HGVASLYIPPFNSHLPPATDRLHLDVGHNWHNHFRRSFAPAMHQSRNSSVKGVCNPVMTR 780
Query: 781 PVLMSLDWPPVLRSASGLASTMMSNHDIGFLTRRQSSFCQGFPTNSNQISTEDEYSGNLT 840
PVLMSLDWPPVLRSASGLASTMMSNHDIGFLTRRQSSFCQGFPTNSNQISTEDEYSGNLT
Sbjct: 781 PVLMSLDWPPVLRSASGLASTMMSNHDIGFLTRRQSSFCQGFPTNSNQISTEDEYSGNLT 840
Query: 841 DFPDLSNNQDLAEECDGNWISEEELEMHAVSGIDYNQYFGGGVMYWNPSDHHGTGFSRPP 900
DFPDLSNNQDLAEECDGNWISEEELEMHAVSGIDYNQYFGGGVMYWNPSDHHGTGFSRPP
Sbjct: 841 DFPDLSNNQDLAEECDGNWISEEELEMHAVSGIDYNQYFGGGVMYWNPSDHHGTGFSRPP 900
Query: 901 SLSSDDSSWAWREADMNRTVDDMVAFSSSYSNGLTSPTSTSFCSPSDPVGSGKQALGYVV 960
SLSSDDSSWAWREADMNRTVDDMVAFSSSYSNGLTSPTSTSFCSPSDPVGSGKQALGYVV
Sbjct: 901 SLSSDDSSWAWREADMNRTVDDMVAFSSSYSNGLTSPTSTSFCSPSDPVGSGKQALGYVV 960
Query: 961 QGSDLPNNMLHSSPTMKDTVTEEDAPRSLPNLPSDVEGKTGDSHSFPILRPIVVPSMSRE 1020
QGSDLPNNMLHSSPTMKDTVTEEDAPRSLPNLPSDVEGKTGDSHSFPILRPIVVPSMSRE
Sbjct: 961 QGSDLPNNMLHSSPTMKDTVTEEDAPRSLPNLPSDVEGKTGDSHSFPILRPIVVPSMSRE 1020
Query: 1021 RSRSEFCHGRDHKSPCIPPTRREQSRVKRPPSPVVLCVPRAPIPPPPSPVSDSRKQRGFP 1080
RSRSEFCHGRDHKSPCIPPTRREQSRVKRPPSPVVLCVPRAPIPPPPSPVSDSRKQRGFP
Sbjct: 1021 RSRSEFCHGRDHKSPCIPPTRREQSRVKRPPSPVVLCVPRAPIPPPPSPVSDSRKQRGFP 1080
Query: 1081 TVRSGSSSPRHWGVKGWYPDGTNLEEACLRIDGAEVIWPNWRNKSKSNCSTVQPLSLIAM 1140
TVRSGSSSPRHWGVKGWYPDGTNLEEACLRIDGAEVIWPNWRNKSKSNCSTVQPLSLIAM
Sbjct: 1081 TVRSGSSSPRHWGVKGWYPDGTNLEEACLRIDGAEVIWPNWRNKSKSNCSTVQPLSLIAM 1140
Query: 1141 SQIAIDQERLDVAFPLFPPTSGRSVKKESLSLIHSRLHDEIDSFCKHVAAENMAKKPYIT 1200
SQIAIDQERLDVAFPLFPPTSGRSVKKESLSLIHSRLHDEIDSFCKHVAAENMAKKPYIT
Sbjct: 1141 SQIAIDQERLDVAFPLFPPTSGRSVKKESLSLIHSRLHDEIDSFCKHVAAENMAKKPYIT 1200
Query: 1201 WAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPTSDVDLVVCLPPVRNLEPIKEAGILEGR 1260
WAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPTSDVDLVVCLPPVRNLEPIKEAGILEGR
Sbjct: 1201 WAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPTSDVDLVVCLPPVRNLEPIKEAGILEGR 1260
Query: 1261 NGIKETCLQHAARYLSNQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIILSTSNMQSPKE 1320
NGIKETCLQHAARYLSNQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIILSTSNMQSPKE
Sbjct: 1261 NGIKETCLQHAARYLSNQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIILSTSNMQSPKE 1320
Query: 1321 ESSAVSGKQDVNILNDMAGLEDSALPKCLEVNYDTSIGTKSVRIDISFKTPSHTGLQTSE 1380
ESSAVSGKQDVNILNDMAGLEDSALPKCLEVNYDTSIGTKSVRIDISFKTPSHTGLQTSE
Sbjct: 1321 ESSAVSGKQDVNILNDMAGLEDSALPKCLEVNYDTSIGTKSVRIDISFKTPSHTGLQTSE 1380
BLAST of CmoCh14G022320 vs. ExPASy TrEMBL
Match:
A0A6J1E9B4 (uncharacterized protein LOC111431966 isoform X5 OS=Cucurbita moschata OX=3662 GN=LOC111431966 PE=4 SV=1)
HSP 1 Score: 2738.0 bits (7096), Expect = 0.0e+00
Identity = 1380/1422 (97.05%), Postives = 1380/1422 (97.05%), Query Frame = 0
Query: 1 MTQNQLIDSLTSHISLYHSTSGNFNRDPNPNPRSSILKWFSSLSVHQRQAHLTVVDFKFV 60
MTQNQLIDSLTSHISLYHSTSGNFNRDPNPNPRSSILKWFSSLSVHQRQAHLTVVDFKFV
Sbjct: 1 MTQNQLIDSLTSHISLYHSTSGNFNRDPNPNPRSSILKWFSSLSVHQRQAHLTVVDFKFV 60
Query: 61 QVLIQMVAEVRKRGHGFFILLPDIPSCDPLHLPSLCFKKSRGLLSRVSESSVSERMIFES 120
QVLIQMVAEVRKRGHGFFILLPDIPSCDPLHLPSLCFKKSRGLLSRVSESSVSERMIFES
Sbjct: 61 QVLIQMVAEVRKRGHGFFILLPDIPSCDPLHLPSLCFKKSRGLLSRVSESSVSERMIFES 120
Query: 121 SRLFGSREGDKLEECSCSLKNIDSLTVSEDFVSNVDKFVEAMDGVSNGAFLRGEGGDMAS 180
SRLFGSREGDKLEECSCSLKNIDSLTVSEDFVSNVDKFVEAMDGVSNGAFLRGEGGDMAS
Sbjct: 121 SRLFGSREGDKLEECSCSLKNIDSLTVSEDFVSNVDKFVEAMDGVSNGAFLRGEGGDMAS 180
Query: 181 NWAELNWLKAKGYYSIEAFVANKLEVALRLSWMSLNNGKKRSVKFKEKASAIGMATNVFW 240
NWAELNWLKAKGYYSIEAFVANKLEVALRLSWMSLNNGKKRSVKFKEKASAIGMATNVFW
Sbjct: 181 NWAELNWLKAKGYYSIEAFVANKLEVALRLSWMSLNNGKKRSVKFKEKASAIGMATNVFW 240
Query: 241 RKKGCVDWWDKLDASSKEKILTAILGKSAKSLIHEILRWTSGLAEHEMGLFSAEWNRPFR 300
RKKGCVDWWDKLDASSKEKILTAILGKSAKSLIHEILRWTSGLAEHEMGLFSAEWNRPFR
Sbjct: 241 RKKGCVDWWDKLDASSKEKILTAILGKSAKSLIHEILRWTSGLAEHEMGLFSAEWNRPFR 300
Query: 301 YNCTISQPRSMLTSQADLHIDFNIIPAAHSGKPYLLTNIFRNLLVLQDIVTMVTSCLHDE 360
YNCTISQPRSMLTSQADLHIDFNIIPAAHSGKPYLLTNIFRNLLVLQDIVTMVTSCLHDE
Sbjct: 301 YNCTISQPRSMLTSQADLHIDFNIIPAAHSGKPYLLTNIFRNLLVLQDIVTMVTSCLHDE 360
Query: 361 YYKTNLFYSTLGSICAIPDCILRKLRELLMFTSLDCTKLELLGDGTSKSLPSKLREDLGA 420
YYKTNLFYSTLGSICAIPDCILRKLRELLMFTSLDCTKLELLGDGTSKSLPSKLREDLGA
Sbjct: 361 YYKTNLFYSTLGSICAIPDCILRKLRELLMFTSLDCTKLELLGDGTSKSLPSKLREDLGA 420
Query: 421 SRRRKKGKSRKSQNPVLRACADDLSCNKFLK----------------------------- 480
SRRRKKGKSRKSQNPVLRACADDLSCNKFLK
Sbjct: 421 SRRRKKGKSRKSQNPVLRACADDLSCNKFLKPQEFDKECAHKGREDIAESTTMSIMSKRN 480
Query: 481 -------------VHDDNTSVGKDQGTARRKKKHKSKNSCGNSRLVEIKPSVGPAVKFSS 540
VHDDNTSVGKDQGTARRKKKHKSKNSCGNSRLVEIKPSVGPAVKFSS
Sbjct: 481 ETCREISSDVSKTVHDDNTSVGKDQGTARRKKKHKSKNSCGNSRLVEIKPSVGPAVKFSS 540
Query: 541 PFSSQDQVAELDNIIRKPSISSIKNDSSNNYESSTLNSSPLVPSIEPNSEYDSSQNIEVY 600
PFSSQDQVAELDNIIRKPSISSIKNDSSNNYESSTLNSSPLVPSIEPNSEYDSSQNIEVY
Sbjct: 541 PFSSQDQVAELDNIIRKPSISSIKNDSSNNYESSTLNSSPLVPSIEPNSEYDSSQNIEVY 600
Query: 601 EVSGLAKSVCQIGPGESQFPKGIIENQRLSSTLETSTSFMDCSVVPSHLPSLKLKNIVKS 660
EVSGLAKSVCQIGPGESQFPKGIIENQRLSSTLETSTSFMDCSVVPSHLPSLKLKNIVKS
Sbjct: 601 EVSGLAKSVCQIGPGESQFPKGIIENQRLSSTLETSTSFMDCSVVPSHLPSLKLKNIVKS 660
Query: 661 DVNVKGSVQTYELRDKSSLLDKLPRTIDVKEKVCLSRHQLSGDACNTKALNSLKHSPYEW 720
DVNVKGSVQTYELRDKSSLLDKLPRTIDVKEKVCLSRHQLSGDACNTKALNSLKHSPYEW
Sbjct: 661 DVNVKGSVQTYELRDKSSLLDKLPRTIDVKEKVCLSRHQLSGDACNTKALNSLKHSPYEW 720
Query: 721 HGVASLYIPPFNSHLPPATDRLHLDVGHNWHNHFRRSFAPAMHQSRNSSVKGVCNPVMTR 780
HGVASLYIPPFNSHLPPATDRLHLDVGHNWHNHFRRSFAPAMHQSRNSSVKGVCNPVMTR
Sbjct: 721 HGVASLYIPPFNSHLPPATDRLHLDVGHNWHNHFRRSFAPAMHQSRNSSVKGVCNPVMTR 780
Query: 781 PVLMSLDWPPVLRSASGLASTMMSNHDIGFLTRRQSSFCQGFPTNSNQISTEDEYSGNLT 840
PVLMSLDWPPVLRSASGLASTMMSNHDIGFLTRRQSSFCQGFPTNSNQISTEDEYSGNLT
Sbjct: 781 PVLMSLDWPPVLRSASGLASTMMSNHDIGFLTRRQSSFCQGFPTNSNQISTEDEYSGNLT 840
Query: 841 DFPDLSNNQDLAEECDGNWISEEELEMHAVSGIDYNQYFGGGVMYWNPSDHHGTGFSRPP 900
DFPDLSNNQDLAEECDGNWISEEELEMHAVSGIDYNQYFGGGVMYWNPSDHHGTGFSRPP
Sbjct: 841 DFPDLSNNQDLAEECDGNWISEEELEMHAVSGIDYNQYFGGGVMYWNPSDHHGTGFSRPP 900
Query: 901 SLSSDDSSWAWREADMNRTVDDMVAFSSSYSNGLTSPTSTSFCSPSDPVGSGKQALGYVV 960
SLSSDDSSWAWREADMNRTVDDMVAFSSSYSNGLTSPTSTSFCSPSDPVGSGKQALGYVV
Sbjct: 901 SLSSDDSSWAWREADMNRTVDDMVAFSSSYSNGLTSPTSTSFCSPSDPVGSGKQALGYVV 960
Query: 961 QGSDLPNNMLHSSPTMKDTVTEEDAPRSLPNLPSDVEGKTGDSHSFPILRPIVVPSMSRE 1020
QGSDLPNNMLHSSPTMKDTVTEEDAPRSLPNLPSDVEGKTGDSHSFPILRPIVVPSMSRE
Sbjct: 961 QGSDLPNNMLHSSPTMKDTVTEEDAPRSLPNLPSDVEGKTGDSHSFPILRPIVVPSMSRE 1020
Query: 1021 RSRSEFCHGRDHKSPCIPPTRREQSRVKRPPSPVVLCVPRAPIPPPPSPVSDSRKQRGFP 1080
RSRSEFCHGRDHKSPCIPPTRREQSRVKRPPSPVVLCVPRAPIPPPPSPVSDSRKQRGFP
Sbjct: 1021 RSRSEFCHGRDHKSPCIPPTRREQSRVKRPPSPVVLCVPRAPIPPPPSPVSDSRKQRGFP 1080
Query: 1081 TVRSGSSSPRHWGVKGWYPDGTNLEEACLRIDGAEVIWPNWRNKSKSNCSTVQPLSLIAM 1140
TVRSGSSSPRHWGVKGWYPDGTNLEEACLRIDGAEVIWPNWRNKSKSNCSTVQPLSLIAM
Sbjct: 1081 TVRSGSSSPRHWGVKGWYPDGTNLEEACLRIDGAEVIWPNWRNKSKSNCSTVQPLSLIAM 1140
Query: 1141 SQIAIDQERLDVAFPLFPPTSGRSVKKESLSLIHSRLHDEIDSFCKHVAAENMAKKPYIT 1200
SQIAIDQERLDVAFPLFPPTSGRSVKKESLSLIHSRLHDEIDSFCKHVAAENMAKKPYIT
Sbjct: 1141 SQIAIDQERLDVAFPLFPPTSGRSVKKESLSLIHSRLHDEIDSFCKHVAAENMAKKPYIT 1200
Query: 1201 WAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPTSDVDLVVCLPPVRNLEPIKEAGILEGR 1260
WAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPTSDVDLVVCLPPVRNLEPIKEAGILEGR
Sbjct: 1201 WAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPTSDVDLVVCLPPVRNLEPIKEAGILEGR 1260
Query: 1261 NGIKETCLQHAARYLSNQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIILSTSNMQSPKE 1320
NGIKETCLQHAARYLSNQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIILSTSNMQSPKE
Sbjct: 1261 NGIKETCLQHAARYLSNQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIILSTSNMQSPKE 1320
Query: 1321 ESSAVSGKQDVNILNDMAGLEDSALPKCLEVNYDTSIGTKSVRIDISFKTPSHTGLQTSE 1380
ESSAVSGKQDVNILNDMAGLEDSALPKCLEVNYDTSIGTKSVRIDISFKTPSHTGLQTSE
Sbjct: 1321 ESSAVSGKQDVNILNDMAGLEDSALPKCLEVNYDTSIGTKSVRIDISFKTPSHTGLQTSE 1380
BLAST of CmoCh14G022320 vs. TAIR 10
Match:
AT4G00060.1 (Nucleotidyltransferase family protein )
HSP 1 Score: 1266.1 bits (3275), Expect = 0.0e+00
Identity = 728/1426 (51.05%), Postives = 928/1426 (65.08%), Query Frame = 0
Query: 1 MTQNQLIDSLTSHISLYHS-TSGNFNRDPNPNPRSSILKWFSSLSVHQRQAHLTVVDFKF 60
M QNQLIDSLTSHISLYHS +S + + PNPRS+IL+WFSSLSVHQR +HLTVVD KF
Sbjct: 17 MAQNQLIDSLTSHISLYHSHSSSSSMANTIPNPRSAILRWFSSLSVHQRLSHLTVVDPKF 76
Query: 61 VQVLIQMVAEVRKRGHGFFILLPDIPSCDPLHLPSLCFKKSRGLLSRVSESSVSERMIFE 120
VQ+L+QM+ +R +G FI+LPD+PS LPSLCFKKSRGL+SRVSES+ SER +F+
Sbjct: 77 VQILLQMLGYIRTKGPCSFIILPDLPSSS--DLPSLCFKKSRGLISRVSESNESERFVFD 136
Query: 121 SSRLFGSREGDKLEECSCSLKNIDSLTVSEDFVSNVDKFVEAMDGVSNGAFLRGEGGDMA 180
S+RLFGS EG++ ++CSCS+ ++DS+ ++E+F++NVD+FVE MD +S+GAFLRGE D+
Sbjct: 137 STRLFGSGEGERAQDCSCSVNSLDSVVMAEEFLTNVDRFVETMDVLSDGAFLRGEESDLG 196
Query: 181 SNWAELNWLKAKGYYSIEAFVANKLEVALRLSWMSLNNGKKRSVKFKEKASAIGMATNVF 240
SNW EL WLKAKGYYS+EAFVAN+LEV++RL+W++ N+GK+R +K KEK +A A N +
Sbjct: 197 SNWVELEWLKAKGYYSMEAFVANRLEVSMRLAWLNTNSGKRRGIKLKEKLNAAAAAANSY 256
Query: 241 WRKKGCVDWWDKLDASSKEKILTAILGKSAKSLIHEILRWTSGLAEHEMGLFSAEWNRPF 300
WRKK CVDWW LDA++ +KI T + GKSAKS+I+EILR + + EM LF+ R
Sbjct: 257 WRKKACVDWWQNLDAATHKKIWTCLFGKSAKSVIYEILREANQAQQGEMWLFNFASARKG 316
Query: 301 RYNCTISQPRSMLTSQADLHIDFNIIPAAHSGKPYLLTNIFRNLLVLQDIVTMVTSCLHD 360
R + + S D+ ++ N +P KP + + L VLQ+ +++ C +
Sbjct: 317 RTD-------TSAVSFCDMILEPNSVPR----KPITVASNLSGLYVLQEFASLLILCQNG 376
Query: 361 EYYKTNLFYSTLGSICAIPDCILRKLRELLMFTSLDCTKLELLGDGTSKSLP-SKLREDL 420
++F+S++G+I + DCILRKLR LM S+D K ELL D T K P S + L
Sbjct: 377 LVPVHSVFFSSMGTITTLVDCILRKLRGFLMVISIDSVKSELLDDNTHKCSPSSSSNQKL 436
Query: 421 GASRRRKKGKSRKSQNPVLRACADDLSCNKFLKVHDDNTSVGKDQGTARRKKKHKSKNSC 480
G++ R++KGK+R + P A + D N ++ G KK ++K
Sbjct: 437 GSTNRKQKGKTRNMKKPTPEAKS------------DKNVNLSTKNG-----KKDQAKLEF 496
Query: 481 GNSR-LVEIKPSVGPAVKFSSPFSSQDQVAELDNIIRKPSISSIKNDSSN---------- 540
SR +E K + + P +S + + ++ + + K N
Sbjct: 497 NKSREAIECKKVPTASTMINDPEASAATMEVVPGLVARKGRTKKKRKEKNKSKKCTSLEN 556
Query: 541 --NYESSTLNSSPLVPSIEPNS----------EYDSSQNIEVY-------EVSGLAKSV- 600
S +NSS +V + + +S EY ++Q IE + SG SV
Sbjct: 557 NGEVNKSVVNSSAIVKASKCDSSCTSANQHPQEYINAQIIEEHGSFSCERNRSGTCASVN 616
Query: 601 ----CQIGPGESQFPKGIIENQRLSSTLETSTSFMDCSVVPSHLPSLKLKNIVKSDVNVK 660
C+ E K E +SS L SV P+ PS + +VN +
Sbjct: 617 GAANCEYSGEEESHSKA--ETHVISSDLS--------SVDPAGGPSCE-------NVNPQ 676
Query: 661 GSVQTYELRDKSSLLDKLPRTIDVKEKVCLSRHQLSGDACNTKALNSLKHSPYEWHGVAS 720
S + ++K ++ ++ RT+D E + H +A A +S + YEW VA
Sbjct: 677 KSCCRGDRKEKLTMPNERSRTLDEGESHRI--HHQRREAGYGFASSSSEFVSYEWPAVAP 736
Query: 721 LYIPPFNSHLPPATDRLHLDVGHNWHNHFRRSFAPAMHQSRNSSVKGVCNPVMTRPVLMS 780
+Y +SHLP ATDRLHLDVGHN H + R+ F + +RN S++G V++RP+ MS
Sbjct: 737 MYFSHVSSHLPTATDRLHLDVGHNLHPYVRQPFVSTVQHARNPSIEGSHKQVLSRPMPMS 796
Query: 781 LDWPPVLRSASGLASTMMSNHDIGFLTRRQSSFCQGFPTNSNQISTEDEYSGNLTDFPDL 840
LDWPP++ S GL + N+D SG L D P+
Sbjct: 797 LDWPPMVHSNCGLTTAFTCNYD----------------------------SGILVDIPEQ 856
Query: 841 SNNQDLAEECDGNWISEEELEMHAVSGIDYNQYFGGGVMYWNPSDHHGTGFSRPPSLSSD 900
N +L EC+ NW+ EE+ E+H VSG+DYNQYFGGGVMYWNPSDH GTGFSRPPSLSSD
Sbjct: 857 KNKHELGNECENNWMLEEDFEVHTVSGVDYNQYFGGGVMYWNPSDHLGTGFSRPPSLSSD 916
Query: 901 DSSWAWREADMNRTVDDMVAFSSSYS-NGLTSPTSTSFCSPSDPVGSGKQALGYVVQGSD 960
DSSWAW EA+M R+VDDMVAFSSSYS NGL SPT+ SFCSP P+G Q LGYVV G++
Sbjct: 917 DSSWAWHEAEMKRSVDDMVAFSSSYSANGLDSPTAASFCSPFHPLGPPNQPLGYVVPGNE 976
Query: 961 LPNNMLHSSPTMKDTVTEEDAPRSLPNLPSDVEGKTGDSHSFPILRPIVVPSMSRERSRS 1020
+ +L + PT + EE+ +L +L DVEG +GDS +PILRPI++P+M S+S
Sbjct: 977 ISTKILQAPPTTIEGAGEEEVSGTLASLSGDVEGNSGDSLPYPILRPIIIPNM----SKS 1036
Query: 1021 EFCHGRDHKSPCIPPTRREQSRVKRPPSPVVLCVPRAPIPPPPSPVSDSRKQRGFPTVRS 1080
E+ D KSP +PPTRRE R+KRPPSPVVLCVPRAP PPPPSPVS+SR +RGFPTVRS
Sbjct: 1037 EYKRSYDTKSPNVPPTRREHPRIKRPPSPVVLCVPRAPRPPPPSPVSNSRARRGFPTVRS 1096
Query: 1081 GSSSPRHWGVKGWYPDGTNLEEACLRIDGAEVIWPNWRNKSKSNCSTVQPL-------SL 1140
GSSSPRHWG++GW+ DG N EE GAE++ P WRNKS + +QPL L
Sbjct: 1097 GSSSPRHWGMRGWFHDGVNWEEP----RGAEIVLP-WRNKSLAVRPIIQPLPGALLQDHL 1156
Query: 1141 IAMSQIAIDQERLDVAFPLFPP-TSGRSVKKESLSLIHSRLHDEIDSFCKHVAAENMAKK 1200
IAMSQ+ DQE DVAFPL PP ++ ESLSLIH L+DEIDSFCK VAAENMA+K
Sbjct: 1157 IAMSQLGRDQEHPDVAFPLQPPELLNCPMQGESLSLIHGILNDEIDSFCKQVAAENMARK 1216
Query: 1201 PYITWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPTSDVDLVVCLPPVRNLEPIKEAGI 1260
PYI WA+KRVTRSLQVLWPRSRTNIFGS+ATGLSLP+SDVDLVVCLPPVRNLEPIKEAGI
Sbjct: 1217 PYINWAIKRVTRSLQVLWPRSRTNIFGSSATGLSLPSSDVDLVVCLPPVRNLEPIKEAGI 1276
Query: 1261 LEGRNGIKETCLQHAARYLSNQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIILSTSNMQ 1320
LEGRNGIKETCLQHAARYL+NQEWVK+DSLKTVENTAIPIIMLVVEVP DLI ++Q
Sbjct: 1277 LEGRNGIKETCLQHAARYLANQEWVKTDSLKTVENTAIPIIMLVVEVPCDLI----CSIQ 1336
Query: 1321 SPKEESSAVSGKQDVNILNDMAGLEDSALPKCLEVNYDTSIGTKSVRIDISFKTPSHTGL 1380
SPK+ ++ QD N +M G EDSA L N KSVR+DISFKTPSHTGL
Sbjct: 1337 SPKDGPDCITVDQDSNGNTEMVGFEDSAAANSLPTNTGNLAIAKSVRLDISFKTPSHTGL 1352
BLAST of CmoCh14G022320 vs. TAIR 10
Match:
AT5G53770.1 (Nucleotidyltransferase family protein )
HSP 1 Score: 68.6 bits (166), Expect = 4.6e-11
Identity = 64/247 (25.91%), Postives = 100/247 (40.49%), Query Frame = 0
Query: 1134 RLHDEIDSFCKHVAAENMAKKPYITWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPTSD 1193
+LH EI FC + A+K AV+ V+ ++ +WP + +FGS TGL LPTSD
Sbjct: 120 QLHKEIVDFCDFL-LPTQAEKAERDAAVESVSSVIKYIWPSCKVEVFGSYKTGLYLPTSD 179
Query: 1194 VDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLSNQEWVKSDSLKTVENTAIP 1253
+D+V I E+G+ + G L+ +R LS + K +L + +P
Sbjct: 180 IDVV-----------ILESGLTNPQLG-----LRALSRALSQRGIAK--NLLVIAKARVP 239
Query: 1254 IIMLVVEVPHDLIILSTSNMQSPKEESSAVSGKQDVNILNDMAGLEDSALPKCLEVNYDT 1313
II V E+ S
Sbjct: 240 IIKFV-------------------EKKS-------------------------------- 289
Query: 1314 SIGTKSVRIDISFKTPSHTGLQTSELVKELTEQFPATIPLALVLKKFLADRSLDQSYSGG 1373
++ D+SF G + +E +++ + P PL L+LK FL R L++ YSGG
Sbjct: 300 -----NIAFDLSF--DMENGPKAAEFIQDAVSKLPPLRPLCLILKVFLQQRELNEVYSGG 289
Query: 1374 LSSYCLV 1381
+ SY L+
Sbjct: 360 IGSYALL 289
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q7KVS9 | 7.7e-11 | 25.61 | Non-canonical poly(A) RNA polymerase protein Trf4-1 OS=Drosophila melanogaster O... | [more] |
Q8NDF8 | 1.3e-10 | 26.40 | Terminal nucleotidyltransferase 4B OS=Homo sapiens OX=9606 GN=TENT4B PE=1 SV=2 | [more] |
Q68ED3 | 1.3e-10 | 26.40 | Terminal nucleotidyltransferase 4B OS=Mus musculus OX=10090 GN=Tent4b PE=1 SV=2 | [more] |
Q5XG87 | 2.2e-10 | 25.56 | Terminal nucleotidyltransferase 4A OS=Homo sapiens OX=9606 GN=TENT4A PE=1 SV=3 | [more] |
Q6PB75 | 2.5e-09 | 25.89 | Terminal nucleotidyltransferase 4A OS=Mus musculus OX=10090 GN=Tent4a PE=2 SV=2 | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1ECL0 | 0.0e+00 | 97.46 | uncharacterized protein LOC111431966 isoform X4 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1E927 | 0.0e+00 | 97.32 | uncharacterized protein LOC111431966 isoform X3 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1E9K0 | 0.0e+00 | 97.18 | uncharacterized protein LOC111431966 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1EF53 | 0.0e+00 | 97.05 | uncharacterized protein LOC111431966 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1E9B4 | 0.0e+00 | 97.05 | uncharacterized protein LOC111431966 isoform X5 OS=Cucurbita moschata OX=3662 GN... | [more] |
Match Name | E-value | Identity | Description | |
AT4G00060.1 | 0.0e+00 | 51.05 | Nucleotidyltransferase family protein | [more] |
AT5G53770.1 | 4.6e-11 | 25.91 | Nucleotidyltransferase family protein | [more] |