Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACAAATTTTGGGAGGGAAGAATTGGAGACGGCTTCACTTCAAGCATATCTAATAGGAATGGAAAGAGATAGAAAAGTTATCGGTAAGTTATATCTCCTTTTTCTTTTATCTTTTGACAGAGATAGAAAAGTTATCTTGATATTTTTTTCAATTATTTAGATTTTTAGATATCTGATTGAAAATCAAAATTGAAAAGTTATCTTAGATCTCTCTTTTAATTATTTATTGGATATCGAATAGAAAATCAAAATTAATTGAAAATAGCTTTGTGAATTGCATCTAAATGTAATCATCATTTTTAGGAAAGAGAAGAAAACAGACTAGATTGAAAAAGGAAAGAGAAGAAAACAGAATAGATTTAAAATGTCTTCCGTTAATGATAATAGTTTTTAGGCAAGGGAAGAAAAAAAAAATTGAAAATGGCTTCTGTTAATTGATAGTCAATAATGTGGAAGAAGAAAACATAATTGATTGAATATTGTTTTATAATTGCATAGTCATTTATATGAAAAAGAAGACAGAGAGAATAGAAATAAAGTGGTACAATAGAGAGAAGATGTAACGACTCGGGAAAGAAAAAAAAAATATATATATATATATAAAATAAATAAATAAATGAAAACGAAAAAAAAAAAAAAAAAAAAACCAGTCGCCGGAAAACCCGCGAGTTTTCCGGCGACCGCCGAGTTTCGCGGAACCCACGCGAAACGACGCCCACAGCACGCCCACCCTCAGCCCAGCATCTTCAGAACACCTGCAAAGGGAGAAAAGAGAGAAAATTTCGAGAGAGAGAGAGAGATTTTGGACAAACGTCGGCGAGTGTCCAGTTTCTCCGACGAGCCCTCAAACCACCCTTAAACCGACCCCAAACGACCTGTTACTACCATAAGAGAGATCCCTAACGAAAGCACAATAACTTAGGGTGCGTTTTGTGTTATTTCGTAAGCGTCGTCGTCGGTAACCGAGGTCGGAAAATTAGGTTTCTTAATCGATTCTTAGATCTATGGCTTTTAGGAGCTTTTAGGCCGCAAAACTCCCAAAAAGGAAAGAAGGACGACGAACAAAGAGGAAAAGGGGAGGAAGAGGACCGAAATCGGCAACGGAAACGCCGCGGGAAGGAGAGAGAAAGCTCGCCGGAAAATCGGCCAAAGTCAGGTAAGGAAGACAAGATCCACGGATCCGGGTCAACCCGAACCCAGAACCCGGGAGAGCCAGCCCGTTCCTCCCTTGGCCTCTGCCTTCGGCCCAGCCCAACTACAGCGGCCCAAACCTCAAACAGGCTCAGAAGCTACGGTTCAGTTGAACCAACCCAGACTAGACCCACGGTCCAATTGAGCCGGCCTGCAAACCAAACACTTGCGGCCCAGCCCAGTAAGCCGAGGCCCAACGTTTAGCCTTGGCCCAGTAGCCTTCCACAGCCCGCGACCCAAACCACGACCCGAACCGGACTGCGACCCACGCTTCGACCCGACACTACTGCTCGACGTGGCTTTCTCTGCTGCTGCCACGTGTCACACAGCGGCGCGGCAGTCCCTCGGCTCGGCTCGGCGCAAACGGCTCGGGCTCCCTCCGGTGACGCTCCGGTGGCTCTAGCGGCTGTTCGGCTCGGCTCACCTATTTTCAGCCCGGTTTCCACTGTTTCGACCCTCTCGGATCTGTTTTCGGCCCCGATTAAGGTATCCGAGGTCGTTTTAGAGCCGTATTTCACATTTATTAAATTATAGGTAAATTAATTATGAAATACTAACGGAAAGTGGTGAACGGTGTGATAGGAAACAAACCCCGGCGACGTAGGAAGCAGCTCTCACGGAACAGGAGCTTACGTTTAGCTATTTAGGGAATACTTCGGCCAAGGCCAACCAAGTAAGTAGCCTTACTATAGGATAGGCTAAATAAATTATATGATACTTGATGATATGTTTGACGCACGTTGTGGCATTATGATGAGCATGATGATTGCATGATTTTTACCTTAATATGATGATGTGTTCCATATTGAGCATGCTAGATGATGATGAGTGCCATATTGCATCATGACGTTAGATCGACGTAGGACACAACCCTAAGAGCATGAAAATGATATTAATATAAATATCCACGGCGTTACCTAGAGTAACAAAGAGATGAGACTAGAGGGTTGTGTCAGAAGAGACATTACGATGGATTTAGAGGGACCTCATGCATTTTGTATGTTCATAAGCATAGGGCTACTTCCCCTAGAGATGATGAGTGCGGACGCGCACCTTATGATGAGCGCGAATGCGCACAGGTGCGAAAGCACACAGATGAGGGTATTCATGAGGCGCATGACACTATGGGGTTCCGCTGACCTCTGGACGTCGCTACAGATTAGCTTGACCAGAGGGTCCAGGGGGTGTGCGAGCACCCTGGAGACTCACACTCGCACGTGTGAGTCGTGTGTAGGGAAGTACTACACATCCAATTTGTCCGAGACTGGAGGCCACCCCTAAGATGATTAGAGATAGGTCCCTATTCATGATTGCATGTGTTTGCATTAGCATTGCCCCTATAGTGGGGTCACTTACTGAGTATTTCTGCCATGTGCCATATTGTTTTTCAGGTAAAGGCAAGGCTCCCATGTACGGTTGACGGTGGCATCGTGATCAGAGACTGTGGCGCGTGTATAGGATAGTTGCATATTTAATTCCTAGTCTTGGTTAGGATAGAGCGTTTGCATTTCATTCATTTATATTAATTAATTTGTAATCGTTTTATTTTATCTTTTTGAACTCCAGCACGATGTTTGAAACACATAAGTCCGGCAGTGTTTTCCAATGTTTTAATGTATTCTGAAGTTTTAAATTTTTCCGCTAATAAAGATGAGCATGCTTAGGTTTTATTCTGCATTAGAGATGAGCATGAGACGTTAGTGGCGACTCTAGATATGTGAAAATTTAGGGTCGTTACAGTTGGTATCAGAGCTCTAGGTTATAGATTCTGTAGACTAGCCTATGATGTAGGCTTGACACGCTCCACGGTCCCAACGTCGATGCCCGTCACCGTCAGGTACGCCTTGCATGAGATAAGAAAAGAGAAACGTTTTAAAACTCGCATGCATTTTTTTTTATGTTGACATTACTGAAATGGTTGTGATATTTCACGTATTTATTTAAGAGTATGACGTCATGTGTTGCATGCTGAGTAATTTTGTTAATAGCATGATATGATGCTTAGCATGATGAATGTTTACTTCTTTGGCGTTACAATGTATTGTATAGTGTGCGTATTGTTATATATATATATATATATGTATGCATAGTATTAAGGCCTTAGACTTAAGCAATCTTATATTGGTTTATAGAACCATGCCCGGAGGACGAGGAAGAGGAAGAGGAAGAGGAAGAGGAAGACGAGGAAGGGGGCGACCCCGTCGTAGAGAAGTAGGATGAGGTGTGGCAGTAGAGGGCCCACCTCAAGAAAGATTAGAGACTGGGCCGGTAACAGTTACGCCGCCCCCAGTCTTACCACCAGTTGAGCTACCTCCTCTAGCTCCACTACAAGCTGAGCCTCCACCACAAGTAGAACCTCTACCCTAGCCTACCCTACCACTCCAAGTCACACCTGAAGCTGCACCCCCAGTGGGTGGAGTAGATGCTAATACTAGGTTAATGTGTGACTCCATCCAAGCCCTAATTCAAACCTTGGTAACCAATCAGCAAGCCAATCTACAATAGGCCCAGAAACAAGCCCAAGCACAGGCCCAGGCACTGGCACAGGTACAGGCACAGGTACAGGCACAGGCACAGGCCACTGCTCAGGCCAATGCCCCAATAAACCCTCCACAACCTGTTAGATGTTCAAACTGGATAAGGGATTTCAAGAGATACGACCCTCGACCATTCACTGGTTCCCTGGTAGACCCCATAGAAGCCCAAATGTGGATTGTAGTGGTGGAAACCACCTTTGAGACTATGGAGTGCCCAGAGAACCAAAAGGTCGCCTGTGCAACCTTCGTCCTACAAAAGGACGCAGAGATATGGTGGAGAGATAACAAAACCCTCCTTAACCCAGAAGGGGGACCAATGAACTGGGAACGATTTAAGGAGGCCTTCCTTAAAGAATATTATCTTAAGTCAGAGCGACTTAAAAGGCAGCAGGAGTTCACCCACCTAGTACAGGGAGGACTCACAGTGGAGAAGTACAACAGAGAGTTTAATAAACTCAAGAGATTTGCACCGTCCATGGTGGACACTGAGGAAAAGATGACAGAGAAATTTGTATTGGGTTTGGAACCAAGAATCCGCCGCATGTTGGAGGCGTTCAACCCAAAGACCTATGAGGAAGCCTTGAGAACTGCCAAGGCTTTAGAGAAACCAAAGGATGAGAAACGATGTGAAGAGCCAGTTATAATTGGGCAGAAGCGTCATGAATTAGGAGGCTCTGACCGTCCACCACTAGCACGTAGGCACCGTTCCAATAACAGACCCGCTCCTAGATGGGATGATCGACGCCCTCCCCGACATACTGACAGGAACCCCAGGAATCAAGATGGGGCCAGAGGGAGGAGAGAGGAAGGGTGCACTATCTGTGGAAGACTACATGGTGGGAGGTGCATGGCTGGCAGCCGAGCGTGTTATAGATGTGGTCAAGAGGGGCACATCGCCGTGAACTGCACGGTCGGAAATGCTGCAGCACAAGCAAACCCACCCAGAGTAGTAGAGCAAATGGATCAACCAGCACCACTGCGAGCTCAAGCTAGGGCATACGCGTCAACCAGCAAGGACACTGGGAGGTCCGACGCCGTGGTGACAGGTACACTATCCATTTTAGGTCATTTCGCTTTTACCTTATTTGATTCTGGTTCCACGCATTCCTTTATTTCCATGCCTTTTGTTGTACAAGCAGGGTTCGAATTAGAACCCTTATTGCATGAAATGTCTGTAAGCACCCCTGCGGGGGTAGACTTAGTATCTAGGGATAGAGTAAAGGATGGACAAGTAATCATAGGGAACCAAACTTTAAGCGTTGACCTGATGGTGGTAAACATGACAGATTTCGACATCATACTAGGCATGGATTGGTTAGCTGAAAATCGAGCTAGTATAGACTGTCGCAAAAAGGAAGTAAAATTTTCACCACCGATAGGACCTACCTTTAAATTTAAAGGCACAAATACCGGGATTACCCCCAAGGTAGTCTCGATGATGAAAGCAAAGAGGTTAGTCCAACAAGGTGGATGGGCTATATTAGCATGTGCTGTAGACGTGAGAGGAAAGAAAGAGACCCTAGTAAACGTGCCAATAGTAAACGAGTTCCCGGATGTATTTCCGAATGACTTACCTGGAATACCCCCTTCCCGAGCGGTCGACTTCGTCATCGAACTCGAGCCGGAAACTGGGCCTATTTCCAAAGCACCCTATCGCATGGCGCCAGCAGAGTTGAAGGAACTCAAGGCGCAACTGCAAGACTTACTAGATAAAGGATTCATTCGACCTAGCGTGTCCCCCTGGGGTGCGCCAGTGTTGTTTGTTAAGAAGAAAGATGCATCGATGCGTCTGTGCATCGATTACAGAGAGCTAAACAAGAGAACCATAAAAAACAAATATCCTCTGCCTAGAATAGAAGATTTGTTTGATCAACTCAGAGAAGCAACAGTATTCTCTAAGATAGATCTTCGGTCCGGTTACCACCAAATTAGGATTAATGAAAAAGACGTACCAAAAACAGCGTTTAGGACAAGGTACGGTCACTACGAGTTTGTGGTGATGTCATTTGGCCTCACCAATGCCCCAGCTGTATTTATGGAGTTAATGAACCGGGTATTCAAAGAATGCCTAGACATGTTTGTGATTGTGTTCATTGATGACATCCTCATATACTCGAGAACCGACCTAGAGCACGAGGAACACCTCCGAAAAGTCCTAACCACCCTAAGAGAGCACAAGTTGTACGCCAAGTTCTCCAAATGTGAATTTTGGTTACGACAAGTCTCTTTCCTAGGACACGTGGTGTCAAAGGACGGAATATCTGTAGATCCCACCAAGGTCGAAGCGATTACAAAGTGGGAACGCCCAACTACGGTAACGGAAGTAAGGAGTTTCCTAGGGTTGGCGGGATATTATCGGAGGTTCGTATAGGACTTCGCTAAAATATCCTCGCCTTTAACAAAGTTAACAAAAAAAGGGGTGCCATTTAGATGGGATGATGCTTGTGAGGCAAGCTTCCAGAACCTAAAAGAGAGATTGGTAACCGCCCCAGTGCTCATAGTACCCGAGAGCTCAGAAGGATATGTGATCTATAGTGATGCCTCCATGAAAGGACTGGGATGTGTGTTAATGCAACACGGCAAGGTTGTCGCATATGCATCTCGTCAACTTAAAGAATATGAAAAGAACTACCATACCCATGACCTAGAGTTGGCCGCTGTAGTGTTCGCACTGAAAATCTGGCGACATTACCTGTATGGCGAGAAAACCCAAATTTTTACCAACCACAAAAGTTTGAAATACTTCTTCACCCAGAAAGAGTTAAACATGAGGCAGAGAAGGTGGTTAGAATTGGTGAAGGATTATGACGTAGATATCTAGTACCACCCTAAGAAAGCAAATGTGGTTGCAGATGCCTTGAGTAGGAAGACGGTCCACTCGTCGGCCCTCATTACGAGGGAAGTAAGGGTACAAAGAGAGTTCGAGCGAGCCAACATAGCTGTAGCGACCGAGGGAGTCATAGCACAGCTGGCCCGACTCACGGTACAACCTACGCTTAGGCAGAGAATTATTGCCTCCCAACGAGAGGATCCTAACCTACAGAAAGTCCTAGGACAACTAGACGAAAGTCCAGTAGATGGATTCTCAAAGTCATCAGATGAAGAACTATTGTATCAGGGACGCTTATGTGTTCCGGCAATAGAAGATTTAAGGAAGGAAGTACTGATGGAAGCTCACAACTCACCATTTGCGATGCATCCAAGAGGTACTAAGATGTACCAAGATTTAAAACAACACTTTTGGTGGAAGAGCATGAAGAGGGATGTGGCCGGGTTTGTGAGCAAGTGCTTAGTTTGTCAACAAGTGAAAGCTCCAAGACAAAAGACGGCAGGGTTGTTGCAGCCCCTAAGCATACCGAAGTGGAAGTGGGAAAACATAGCGATGGACTTCATAGTAGGTTTACCCAAAACGCCCAAGGGCTATACAGTGATCTGGGTAGTTGTCGATAGGTTGACCAAGTCGGCACACTTTCTACCTAGGAAGGTTACATATACAGTTGACGATTGGGCAAAACTGTACGTGAAAGAAATAGTAAGACTACATGGAGTCCCAGTGTCTATAGTGTCGGATCGGGATCCACGCTTTACGTCAGCGTTTTGGCGCGGACTTCAAAAAGCACTGGGTACCCGCTTCGACTTTAGTACCGCCTTTCACCCCCAAACAGATGGACAAACGGAGCGTTTAAACCAAATTCTAGAGGACATGCTACGCGCTTGCGTACTAGATTTTAAGGAAAGTTGGGATTCCAAACTACATCTAATGGAATTCTCGTATAACAACAGTTTCCAAGCAACTATTGGAATGGCACCGTTTGAGGCCCTGTAG
mRNA sequence
ATGACAAATTTTGGGAGGGAAGAATTGGAGACGGCTTCACTTCAAGCATATCTAATAGGAATGGAAAGAGATAGAAAAGTTATCGGCCGCAAAACTCCCAAAAAGGAAAGAAGGACGACGAACAAAGAGGAAAAGGGGAGGAAGAGGACCGAAATCGGCAACGGAAACGCCGCGGGAAGGAGAGAGAAAGCTCGCCGGAAAATCGGCCAAAGTCAGCCCGCGACCCAAACCACGACCCGAACCGGACTGCGACCCACGCTTCGACCCGACACTACTGCTCGACGTGGCTTTCTCTGCTGCTGCCACGTGTCACACAGCGGCGCGGCAGTCCCTCGGCTCGGCTCGGCGCAAACGGCTCGGGCTCCCTCCGGTGACGCTCCGGTGGCTCTAGCGGCTGTTCGGCTCGGCTCACCTATTTTCAGCCCGGTTTCCACTGTTTCGACCCTCTCGGATCTGTTTTCGGCCCCGATTAAGGCCCAGAAACAAGCCCAAGCACAGGCCCAGGCACTGGCACAGGTACAGGCACAGGTACAGGCACAGGCACAGGCCACTGCTCAGGCCAATGCCCCAATAAACCCTCCACAACCTGTTAGATGTTCAAACTGGATAAGGGATTTCAAGAGATACGACCCTCGACCATTCACTGGTTCCCTGGTAGACCCCATAGAAGCCCAAATGTGGATTGTAGTGGTGGAAACCACCTTTGAGACTATGGAGTGCCCAGAGAACCAAAAGGTCGCCTGTGCAACCTTCGTCCTACAAAAGGACGCAGAGATATGGTGGAGAGATAACAAAACCCTCCTTAACCCAGAAGGGGGACCAATGAACTGGGAACGATTTAAGGAGGCCTTCCTTAAAGAATATTATCTTAAGTCAGAGCGACTTAAAAGGCAGCAGGAGTTCACCCACCTAGTACAGGGAGGACTCACAGTGGAGAAGTACAACAGAGAGTTTAATAAACTCAAGAGATTTGCACCGTCCATGGTGGACACTGAGGAAAAGATGACAGAGAAATTTGTATTGGGTTTGGAACCAAGAATCCGCCGCATGTTGGAGGCGTTCAACCCAAAGACCTATGAGGAAGCCTTGAGAACTGCCAAGGCTTTAGAGAAACCAAAGGATGAGAAACGATGTGAAGAGCCAGTTATAATTGGGCAGAAGCGTCATGAATTAGGAGGCTCTGACCGTCCACCACTAGCACGTAGGCACCGTTCCAATAACAGACCCGCTCCTAGATGGGATGATCGACGCCCTCCCCGACATACTGACAGGAACCCCAGGAATCAAGATGGGGCCAGAGGGAGGAGAGAGGAAGGGTGCACTATCTGTGGAAGACTACATGGTGGGAGGTGCATGGCTGGCAGCCGAGCGTGTTATAGATGTGGTCAAGAGGGGCACATCGCCGTGAACTGCACGGTCGGAAATGCTGCAGCACAAGCAAACCCACCCAGAGTAGTAGAGCAAATGGATCAACCAGCACCACTGCGAGCTCAAGCTAGGGCATACGCGTCAACCAGCAAGGACACTGGGAGGTCCGACGCCGTGGTGACAGGTACACTATCCATTTTAGGTCATTTCGCTTTTACCTTATTTGATTCTGGTTCCACGCATTCCTTTATTTCCATGCCTTTTGTTGTACAAGCAGGGTTCGAATTAGAACCCTTATTGCATGAAATGTCTGTAAGCACCCCTGCGGGGGTAGACTTAGTATCTAGGGATAGAGTAAAGGATGGACAAGTAATCATAGGGAACCAAACTTTAAGCGTTGACCTGATGGTGGTAAACATGACAGATTTCGACATCATACTAGGCATGGATTGGTTAGCTGAAAATCGAGCTAGTATAGACTGTCGCAAAAAGGAAGTAAAATTTTCACCACCGATAGGACCTACCTTTAAATTTAAAGGCACAAATACCGGGATTACCCCCAAGGTAGTCTCGATGATGAAAGCAAAGAGGTTAGTCCAACAAGGTGGATGGGCTATATTAGCATGTGCTGTAGACGTGAGAGGAAAGAAAGAGACCCTAGTAAACGTGCCAATAGTAAACGAGTTCCCGGATGTATTTCCGAATGACTTACCTGGAATACCCCCTTCCCGAGCGGTCGACTTCGTCATCGAACTCGAGCCGGAAACTGGGCCTATTTCCAAAGCACCCTATCGCATGGCGCCAGCAGAGTTGAAGGAACTCAAGGCGCAACTGCAAGACTTACTAGATAAAGGATTCATTCGACCTAGCGTGTCCCCCTGGGGTGCGCCAGTGTTGTTTGTTAAGAAGAAAGATGCATCGATGCGTCTGTGCATCGATTACAGAGAGCTAAACAAGAGAACCATAAAAAACAAATATCCTCTGCCTAGAATAGAAGATTTGTTTGATCAACTCAGAGAAGCAACAGTATTCTCTAAGATAGATCTTCGGTCCGGTTACCACCAAATTAGGATTAATGAAAAAGACGTACCAAAAACAGCGTTTAGGACAAGGTACGGTCACTACGAGTTTGTGGTGATGTCATTTGGCCTCACCAATGCCCCAGCTGTATTTATGGAGTTAATGAACCGGGTATTCAAAGAATGCCTAGACATGTTTGTGATTGTGTTCATTGATGACATCCTCATATACTCGAGAACCGACCTAGAGCACGAGGAACACCTCCGAAAAGTCCTAACCACCCTAAGAGAGCACAAGTTGTACGCCAAGTTCTCCAAATGTGAATTTTGGTTACGACAAGTCTCTTTCCTAGGACACGTGGTGTCAAAGGACGGAATATCTGTAGATCCCACCAAGGTCGAAGCGATTACAAAGTGGGAACGCCCAACTACGTTAACAAAAAAAGGGGTGCCATTTAGATGGGATGATGCTTGTGAGGCAAGCTTCCAGAACCTAAAAGAGAGATTGGTAACCGCCCCAGTGCTCATAGTACCCGAGAGCTCAGAAGGATATGTGATCTATAGTGATGCCTCCATGAAAGGACTGGGATGTGTGTTAATGCAACACGGCAAGGTTGTCGCATATGCATCTCGTCAACTTAAAGAATATGAAAAGAACTACCATACCCATGACCTAGAGTTGGCCGCTGTAGTGTTCGCACTGAAAATCTGGCGACATTACCTGTATGGCGAGAAAACCCAAATTTTTACCAACCACAAAAGTTTGAAATACTTCTTCACCCAGAAAGAGTTAAACATGAGGCAGAGAAGGTGGAAGACGGTCCACTCGTCGGCCCTCATTACGAGGGAAGTAAGGGTACAAAGAGAGTTCGAGCGAGCCAACATAGCTGTAGCGACCGAGGGAGTCATAGCACAGCTGGCCCGACTCACGGTACAACCTACGCTTAGGCAGAGAATTATTGCCTCCCAACGAGAGGATCCTAACCTACAGAAAGTCCTAGGACAACTAGACGAAAGTCCAGTAGATGGATTCTCAAAGTCATCAGATGAAGAACTATTGTATCAGGGACGCTTATGTGTTCCGGCAATAGAAGATTTAAGGAAGGAAGTACTGATGGAAGCTCACAACTCACCATTTGCGATGCATCCAAGAGGTACTAAGATGTACCAAGATTTAAAACAACACTTTTGGTGGAAGAGCATGAAGAGGGATGTGGCCGGGTTTGTGAGCAAGTGCTTAGTTTGTCAACAAGTGAAAGCTCCAAGACAAAAGACGGCAGGGTTGTTGCAGCCCCTAAGCATACCGAAGTGGAAGTGGGAAAACATAGCGATGGACTTCATAGTAGGTTTACCCAAAACGCCCAAGGGCTATACAGTGATCTGGGTAGTTGTCGATAGGTTGACCAAGTCGGCACACTTTCTACCTAGGAAGGTTACATATACAGTTGACGATTGGGCAAAACTGTACGTGAAAGAAATAGTAAGACTACATGGAGTCCCAGTGTCTATAGTGTCGGATCGGGATCCACGCTTTACGTCAGCGTTTTGGCGCGGACTTCAAAAAGCACTGGGTACCCGCTTCGACTTTAGTACCGCCTTTCACCCCCAAACAGATGGACAAACGGAGCGTTTAAACCAAATTCTAGAGGACATGCTACGCGCTTGCGTACTAGATTTTAAGGAAAGTTGGGATTCCAAACTACATCTAATGGAATTCTCGTATAACAACAGTTTCCAAGCAACTATTGGAATGGCACCGTTTGAGGCCCTGTAG
Coding sequence (CDS)
ATGACAAATTTTGGGAGGGAAGAATTGGAGACGGCTTCACTTCAAGCATATCTAATAGGAATGGAAAGAGATAGAAAAGTTATCGGCCGCAAAACTCCCAAAAAGGAAAGAAGGACGACGAACAAAGAGGAAAAGGGGAGGAAGAGGACCGAAATCGGCAACGGAAACGCCGCGGGAAGGAGAGAGAAAGCTCGCCGGAAAATCGGCCAAAGTCAGCCCGCGACCCAAACCACGACCCGAACCGGACTGCGACCCACGCTTCGACCCGACACTACTGCTCGACGTGGCTTTCTCTGCTGCTGCCACGTGTCACACAGCGGCGCGGCAGTCCCTCGGCTCGGCTCGGCGCAAACGGCTCGGGCTCCCTCCGGTGACGCTCCGGTGGCTCTAGCGGCTGTTCGGCTCGGCTCACCTATTTTCAGCCCGGTTTCCACTGTTTCGACCCTCTCGGATCTGTTTTCGGCCCCGATTAAGGCCCAGAAACAAGCCCAAGCACAGGCCCAGGCACTGGCACAGGTACAGGCACAGGTACAGGCACAGGCACAGGCCACTGCTCAGGCCAATGCCCCAATAAACCCTCCACAACCTGTTAGATGTTCAAACTGGATAAGGGATTTCAAGAGATACGACCCTCGACCATTCACTGGTTCCCTGGTAGACCCCATAGAAGCCCAAATGTGGATTGTAGTGGTGGAAACCACCTTTGAGACTATGGAGTGCCCAGAGAACCAAAAGGTCGCCTGTGCAACCTTCGTCCTACAAAAGGACGCAGAGATATGGTGGAGAGATAACAAAACCCTCCTTAACCCAGAAGGGGGACCAATGAACTGGGAACGATTTAAGGAGGCCTTCCTTAAAGAATATTATCTTAAGTCAGAGCGACTTAAAAGGCAGCAGGAGTTCACCCACCTAGTACAGGGAGGACTCACAGTGGAGAAGTACAACAGAGAGTTTAATAAACTCAAGAGATTTGCACCGTCCATGGTGGACACTGAGGAAAAGATGACAGAGAAATTTGTATTGGGTTTGGAACCAAGAATCCGCCGCATGTTGGAGGCGTTCAACCCAAAGACCTATGAGGAAGCCTTGAGAACTGCCAAGGCTTTAGAGAAACCAAAGGATGAGAAACGATGTGAAGAGCCAGTTATAATTGGGCAGAAGCGTCATGAATTAGGAGGCTCTGACCGTCCACCACTAGCACGTAGGCACCGTTCCAATAACAGACCCGCTCCTAGATGGGATGATCGACGCCCTCCCCGACATACTGACAGGAACCCCAGGAATCAAGATGGGGCCAGAGGGAGGAGAGAGGAAGGGTGCACTATCTGTGGAAGACTACATGGTGGGAGGTGCATGGCTGGCAGCCGAGCGTGTTATAGATGTGGTCAAGAGGGGCACATCGCCGTGAACTGCACGGTCGGAAATGCTGCAGCACAAGCAAACCCACCCAGAGTAGTAGAGCAAATGGATCAACCAGCACCACTGCGAGCTCAAGCTAGGGCATACGCGTCAACCAGCAAGGACACTGGGAGGTCCGACGCCGTGGTGACAGGTACACTATCCATTTTAGGTCATTTCGCTTTTACCTTATTTGATTCTGGTTCCACGCATTCCTTTATTTCCATGCCTTTTGTTGTACAAGCAGGGTTCGAATTAGAACCCTTATTGCATGAAATGTCTGTAAGCACCCCTGCGGGGGTAGACTTAGTATCTAGGGATAGAGTAAAGGATGGACAAGTAATCATAGGGAACCAAACTTTAAGCGTTGACCTGATGGTGGTAAACATGACAGATTTCGACATCATACTAGGCATGGATTGGTTAGCTGAAAATCGAGCTAGTATAGACTGTCGCAAAAAGGAAGTAAAATTTTCACCACCGATAGGACCTACCTTTAAATTTAAAGGCACAAATACCGGGATTACCCCCAAGGTAGTCTCGATGATGAAAGCAAAGAGGTTAGTCCAACAAGGTGGATGGGCTATATTAGCATGTGCTGTAGACGTGAGAGGAAAGAAAGAGACCCTAGTAAACGTGCCAATAGTAAACGAGTTCCCGGATGTATTTCCGAATGACTTACCTGGAATACCCCCTTCCCGAGCGGTCGACTTCGTCATCGAACTCGAGCCGGAAACTGGGCCTATTTCCAAAGCACCCTATCGCATGGCGCCAGCAGAGTTGAAGGAACTCAAGGCGCAACTGCAAGACTTACTAGATAAAGGATTCATTCGACCTAGCGTGTCCCCCTGGGGTGCGCCAGTGTTGTTTGTTAAGAAGAAAGATGCATCGATGCGTCTGTGCATCGATTACAGAGAGCTAAACAAGAGAACCATAAAAAACAAATATCCTCTGCCTAGAATAGAAGATTTGTTTGATCAACTCAGAGAAGCAACAGTATTCTCTAAGATAGATCTTCGGTCCGGTTACCACCAAATTAGGATTAATGAAAAAGACGTACCAAAAACAGCGTTTAGGACAAGGTACGGTCACTACGAGTTTGTGGTGATGTCATTTGGCCTCACCAATGCCCCAGCTGTATTTATGGAGTTAATGAACCGGGTATTCAAAGAATGCCTAGACATGTTTGTGATTGTGTTCATTGATGACATCCTCATATACTCGAGAACCGACCTAGAGCACGAGGAACACCTCCGAAAAGTCCTAACCACCCTAAGAGAGCACAAGTTGTACGCCAAGTTCTCCAAATGTGAATTTTGGTTACGACAAGTCTCTTTCCTAGGACACGTGGTGTCAAAGGACGGAATATCTGTAGATCCCACCAAGGTCGAAGCGATTACAAAGTGGGAACGCCCAACTACGTTAACAAAAAAAGGGGTGCCATTTAGATGGGATGATGCTTGTGAGGCAAGCTTCCAGAACCTAAAAGAGAGATTGGTAACCGCCCCAGTGCTCATAGTACCCGAGAGCTCAGAAGGATATGTGATCTATAGTGATGCCTCCATGAAAGGACTGGGATGTGTGTTAATGCAACACGGCAAGGTTGTCGCATATGCATCTCGTCAACTTAAAGAATATGAAAAGAACTACCATACCCATGACCTAGAGTTGGCCGCTGTAGTGTTCGCACTGAAAATCTGGCGACATTACCTGTATGGCGAGAAAACCCAAATTTTTACCAACCACAAAAGTTTGAAATACTTCTTCACCCAGAAAGAGTTAAACATGAGGCAGAGAAGGTGGAAGACGGTCCACTCGTCGGCCCTCATTACGAGGGAAGTAAGGGTACAAAGAGAGTTCGAGCGAGCCAACATAGCTGTAGCGACCGAGGGAGTCATAGCACAGCTGGCCCGACTCACGGTACAACCTACGCTTAGGCAGAGAATTATTGCCTCCCAACGAGAGGATCCTAACCTACAGAAAGTCCTAGGACAACTAGACGAAAGTCCAGTAGATGGATTCTCAAAGTCATCAGATGAAGAACTATTGTATCAGGGACGCTTATGTGTTCCGGCAATAGAAGATTTAAGGAAGGAAGTACTGATGGAAGCTCACAACTCACCATTTGCGATGCATCCAAGAGGTACTAAGATGTACCAAGATTTAAAACAACACTTTTGGTGGAAGAGCATGAAGAGGGATGTGGCCGGGTTTGTGAGCAAGTGCTTAGTTTGTCAACAAGTGAAAGCTCCAAGACAAAAGACGGCAGGGTTGTTGCAGCCCCTAAGCATACCGAAGTGGAAGTGGGAAAACATAGCGATGGACTTCATAGTAGGTTTACCCAAAACGCCCAAGGGCTATACAGTGATCTGGGTAGTTGTCGATAGGTTGACCAAGTCGGCACACTTTCTACCTAGGAAGGTTACATATACAGTTGACGATTGGGCAAAACTGTACGTGAAAGAAATAGTAAGACTACATGGAGTCCCAGTGTCTATAGTGTCGGATCGGGATCCACGCTTTACGTCAGCGTTTTGGCGCGGACTTCAAAAAGCACTGGGTACCCGCTTCGACTTTAGTACCGCCTTTCACCCCCAAACAGATGGACAAACGGAGCGTTTAAACCAAATTCTAGAGGACATGCTACGCGCTTGCGTACTAGATTTTAAGGAAAGTTGGGATTCCAAACTACATCTAATGGAATTCTCGTATAACAACAGTTTCCAAGCAACTATTGGAATGGCACCGTTTGAGGCCCTGTAG
Protein sequence
MTNFGREELETASLQAYLIGMERDRKVIGRKTPKKERRTTNKEEKGRKRTEIGNGNAAGRREKARRKIGQSQPATQTTTRTGLRPTLRPDTTARRGFLCCCHVSHSGAAVPRLGSAQTARAPSGDAPVALAAVRLGSPIFSPVSTVSTLSDLFSAPIKAQKQAQAQAQALAQVQAQVQAQAQATAQANAPINPPQPVRCSNWIRDFKRYDPRPFTGSLVDPIEAQMWIVVVETTFETMECPENQKVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYLKSERLKRQQEFTHLVQGGLTVEKYNREFNKLKRFAPSMVDTEEKMTEKFVLGLEPRIRRMLEAFNPKTYEEALRTAKALEKPKDEKRCEEPVIIGQKRHELGGSDRPPLARRHRSNNRPAPRWDDRRPPRHTDRNPRNQDGARGRREEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTVGNAAAQANPPRVVEQMDQPAPLRAQARAYASTSKDTGRSDAVVTGTLSILGHFAFTLFDSGSTHSFISMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDGQVIIGNQTLSVDLMVVNMTDFDIILGMDWLAENRASIDCRKKEVKFSPPIGPTFKFKGTNTGITPKVVSMMKAKRLVQQGGWAILACAVDVRGKKETLVNVPIVNEFPDVFPNDLPGIPPSRAVDFVIELEPETGPISKAPYRMAPAELKELKAQLQDLLDKGFIRPSVSPWGAPVLFVKKKDASMRLCIDYRELNKRTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDMFVIVFIDDILIYSRTDLEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKWERPTTLTKKGVPFRWDDACEASFQNLKERLVTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYASRQLKEYEKNYHTHDLELAAVVFALKIWRHYLYGEKTQIFTNHKSLKYFFTQKELNMRQRRWKTVHSSALITREVRVQREFERANIAVATEGVIAQLARLTVQPTLRQRIIASQREDPNLQKVLGQLDESPVDGFSKSSDEELLYQGRLCVPAIEDLRKEVLMEAHNSPFAMHPRGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAGLLQPLSIPKWKWENIAMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPRKVTYTVDDWAKLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRFDFSTAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATIGMAPFEAL
Homology
BLAST of CmoCh10G011790 vs. ExPASy Swiss-Prot
Match:
Q7LHG5 (Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-I PE=1 SV=2)
HSP 1 Score: 376.7 bits (966), Expect = 1.1e-102
Identity = 254/775 (32.77%), Postives = 383/775 (49.42%), Query Frame = 0
Query: 673 KETLVNVPI--VNEFPDVFPNDLPGIPP---SRAVDFVIELEPETGPISKAPYRMAPAEL 732
K+T +P+ ++ ++ NDLP P + V IE++P PY +
Sbjct: 576 KDTFCTLPVWLQQKYREIIRNDLPPRPADINNIPVKHDIEIKPGARLPRLQPYHVTEKNE 635
Query: 733 KELKAQLQDLLDKGFIRPSVSPWGAPVLFVKKKDASMRLCIDYRELNKRTIKNKYPLPRI 792
+E+ +Q LLD FI PS SP +PV+ V KKD + RLC+DYR LNK TI + +PLPRI
Sbjct: 636 QEINKIVQKLLDNKFIVPSKSPCSSPVVLVPKKDGTFRLCVDYRTLNKATISDPFPLPRI 695
Query: 793 EDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVF 852
++L ++ A +F+ +DL SGYHQI + KD KTAF T G YE+ VM FGL NAP+ F
Sbjct: 696 DNLLSRIGNAQIFTTLDLHSGYHQIPMEPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTF 755
Query: 853 MELMNRVFKECLDMFVIVFIDDILIYSRTDLEHEEHLRKVLTTLREHKLYAKFSKCEFWL 912
M F++ FV V++DDILI+S + EH +HL VL L+ L K KC+F
Sbjct: 756 ARYMADTFRDL--RFVNVYLDDILIFSESPEEHWKHLDTVLERLKNENLIVKKKKCKFAS 815
Query: 913 RQVSFLGHVVSKDGISVDPTKVEAITKWERPTTLT------------KKGVP-------- 972
+ FLG+ + I+ K AI + P T+ ++ +P
Sbjct: 816 EETEFLGYSIGIQKIAPLQHKCAAIRDFPTPKTVKQAQRFLGMINYYRRFIPNCSKIAQP 875
Query: 973 --------FRWDDACEASFQNLKERLVTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHG 1032
+W + + + + LK L +PVL+ + Y + +DAS G+G VL +
Sbjct: 876 IQLFICDKSQWTEKQDKAIEKLKAALCNSPVLVPFNNKANYRLTTDASKDGIGAVLEEVD 935
Query: 1033 K------VVAYASRQLKEYEKNYHTHDLELAAVVFALKIWRHYLYGEKTQIFTNHKSLKY 1092
VV Y S+ L+ +KNY +LEL ++ AL +R+ L+G+ + T+H SL
Sbjct: 936 NKNKLVGVVGYFSKSLESAQKNYPAGELELLGIIKALHHFRYMLHGKHFTLRTDHISLLS 995
Query: 1093 FFTQKELNMRQRRWKTVHSSALITREVRVQREFERANIAVATEGVIAQLARLTVQPTLRQ 1152
+ E R +RW ++ T E + +A A I + T +P +
Sbjct: 996 LQNKNEPARRVQRWLDDLATYDFTLEYLAG---PKNVVADAISRAIYTITPETSRPIDTE 1055
Query: 1153 RIIASQREDPNLQKVLGQLDE------SPVD---------------GFSKS---SDEELL 1212
+ + DP VL + E +P D F K+ DE +
Sbjct: 1056 SWKSYYKSDPLCSAVLIHMKELTQHNVTPEDMSAFRSYQKKLELSETFRKNYSLEDEMIY 1115
Query: 1213 YQGRLCVPAIEDLRKEVLMEAHNSPFAMHPRGTKMYQDLKQHFWWKSMKRDVAGFVSKCL 1272
YQ RL VP I+ + + ++ F H T + ++W ++ + ++ C+
Sbjct: 1116 YQDRLVVP-IKQQNAVMRLYHDHTLFGGHFGVTVTLAKISPIYYWPKLQHSIIQYIRTCV 1175
Query: 1273 VCQQVKAPRQKTAGLLQPLSIPKWKWENIAMDFIVGLPKTPKGYTVIWVVVDRLTKSAHF 1332
CQ +K+ R + GLLQPL I + +W +I+MDF+ GLP T +I VVVDR +K AHF
Sbjct: 1176 QCQLIKSHRPRLHGLLQPLPIAEGRWLDISMDFVTGLPPTSNNLNMILVVVDRFSKRAHF 1235
Query: 1333 LPRKVTYTVDDWAKLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRFDFSTAF 1385
+ + T L + I HG P +I SDRD R T+ ++ L K LG + S+A
Sbjct: 1236 IATRKTLDATQLIDLLFRYIFSYHGFPRTITSDRDVRMTADKYQELTKRLGIKSTMSSAN 1295
BLAST of CmoCh10G011790 vs. ExPASy Swiss-Prot
Match:
Q99315 (Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-G PE=1 SV=3)
HSP 1 Score: 375.2 bits (962), Expect = 3.3e-102
Identity = 252/775 (32.52%), Postives = 382/775 (49.29%), Query Frame = 0
Query: 673 KETLVNVPI--VNEFPDVFPNDLPGIPP---SRAVDFVIELEPETGPISKAPYRMAPAEL 732
K+T +P+ ++ ++ NDLP P + V IE++P PY +
Sbjct: 550 KDTFCTLPVWLQQKYREIIRNDLPPRPADINNIPVKHDIEIKPGARLPRLQPYHVTEKNE 609
Query: 733 KELKAQLQDLLDKGFIRPSVSPWGAPVLFVKKKDASMRLCIDYRELNKRTIKNKYPLPRI 792
+E+ +Q LLD FI PS SP +PV+ V KKD + RLC+DYR LNK TI + +PLPRI
Sbjct: 610 QEINKIVQKLLDNKFIVPSKSPCSSPVVLVPKKDGTFRLCVDYRTLNKATISDPFPLPRI 669
Query: 793 EDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVF 852
++L ++ A +F+ +DL SGYHQI + KD KTAF T G YE+ VM FGL NAP+ F
Sbjct: 670 DNLLSRIGNAQIFTTLDLHSGYHQIPMEPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTF 729
Query: 853 MELMNRVFKECLDMFVIVFIDDILIYSRTDLEHEEHLRKVLTTLREHKLYAKFSKCEFWL 912
M F++ FV V++DDILI+S + EH +HL VL L+ L K KC+F
Sbjct: 730 ARYMADTFRDL--RFVNVYLDDILIFSESPEEHWKHLDTVLERLKNENLIVKKKKCKFAS 789
Query: 913 RQVSFLGHVVSKDGISVDPTKVEAITKWERPTTLT------------KKGVP-------- 972
+ FLG+ + I+ K AI + P T+ ++ +P
Sbjct: 790 EETEFLGYSIGIQKIAPLQHKCAAIRDFPTPKTVKQAQRFLGMINYYRRFIPNCSKIAQP 849
Query: 973 --------FRWDDACEASFQNLKERLVTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHG 1032
+W + + + LK+ L +PVL+ + Y + +DAS G+G VL +
Sbjct: 850 IQLFICDKSQWTEKQDKAIDKLKDALCNSPVLVPFNNKANYRLTTDASKDGIGAVLEEVD 909
Query: 1033 K------VVAYASRQLKEYEKNYHTHDLELAAVVFALKIWRHYLYGEKTQIFTNHKSLKY 1092
VV Y S+ L+ +KNY +LEL ++ AL +R+ L+G+ + T+H SL
Sbjct: 910 NKNKLVGVVGYFSKSLESAQKNYPAGELELLGIIKALHHFRYMLHGKHFTLRTDHISLLS 969
Query: 1093 FFTQKELNMRQRRWKTVHSSALITREVRVQREFERANIAVATEGVIAQLARLTVQPTLRQ 1152
+ E R +RW ++ T E + +A A + + T +P +
Sbjct: 970 LQNKNEPARRVQRWLDDLATYDFTLEYLAG---PKNVVADAISRAVYTITPETSRPIDTE 1029
Query: 1153 RIIASQREDPNLQKVLGQLDE------SPVD---------------GFSKS---SDEELL 1212
+ + DP VL + E +P D F K+ DE +
Sbjct: 1030 SWKSYYKSDPLCSAVLIHMKELTQHNVTPEDMSAFRSYQKKLELSETFRKNYSLEDEMIY 1089
Query: 1213 YQGRLCVPAIEDLRKEVLMEAHNSPFAMHPRGTKMYQDLKQHFWWKSMKRDVAGFVSKCL 1272
YQ RL VP I+ + + ++ F H T + ++W ++ + ++ C+
Sbjct: 1090 YQDRLVVP-IKQQNAVMRLYHDHTLFGGHFGVTVTLAKISPIYYWPKLQHSIIQYIRTCV 1149
Query: 1273 VCQQVKAPRQKTAGLLQPLSIPKWKWENIAMDFIVGLPKTPKGYTVIWVVVDRLTKSAHF 1332
CQ +K+ R + GLLQPL I + +W +I+MDF+ GLP T +I VVVDR +K AHF
Sbjct: 1150 QCQLIKSHRPRLHGLLQPLPIAEGRWLDISMDFVTGLPPTSNNLNMILVVVDRFSKRAHF 1209
Query: 1333 LPRKVTYTVDDWAKLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRFDFSTAF 1385
+ + T L + I HG P +I SDRD R T+ ++ L K LG + S+A
Sbjct: 1210 IATRKTLDATQLIDLLFRYIFSYHGFPRTITSDRDVRMTADKYQELTKRLGIKSTMSSAN 1269
BLAST of CmoCh10G011790 vs. ExPASy Swiss-Prot
Match:
P0CT41 (Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-12 PE=3 SV=1)
HSP 1 Score: 355.1 bits (910), Expect = 3.5e-96
Identity = 215/761 (28.25%), Postives = 375/761 (49.28%), Query Frame = 0
Query: 681 IVNEFPDV-FPNDLPGIP-PSRAVDFVIELEPETGPISKAPYRMAPAELKELKAQLQDLL 740
I EF D+ + +P P + ++F +EL E + Y + P +++ + ++ L
Sbjct: 377 IYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRNYPLPPGKMQAMNDEINQGL 436
Query: 741 DKGFIRPSVSPWGAPVLFVKKKDASMRLCIDYRELNKRTIKNKYPLPRIEDLFDQLREAT 800
G IR S + PV+FV KK+ ++R+ +DY+ LNK N YPLP IE L +++ +T
Sbjct: 437 KSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGST 496
Query: 801 VFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKEC 860
+F+K+DL+S YH IR+ + D K AFR G +E++VM +G++ APA F +N + E
Sbjct: 497 IFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEA 556
Query: 861 LDMFVIVFIDDILIYSRTDLEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVS 920
+ V+ ++DDILI+S+++ EH +H++ VL L+ L +KCEF QV F+G+ +S
Sbjct: 557 KESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHIS 616
Query: 921 KDGISVDPTKVEAITKWERP------------------------------TTLTKKGVPF 980
+ G + ++ + +W++P L KK V +
Sbjct: 617 EKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRW 676
Query: 981 RWDDACEASFQNLKERLVTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHGK-----VVA 1040
+W + +N+K+ LV+ PVL + S+ ++ +DAS +G VL Q V
Sbjct: 677 KWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVG 736
Query: 1041 YASRQLKEYEKNYHTHDLELAAVVFALKIWRHYLYG--EKTQIFTNHKSLKYFFTQKE-- 1100
Y S ++ + + NY D E+ A++ +LK WRHYL E +I T+H++L T +
Sbjct: 737 YYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEP 796
Query: 1101 LNMRQRRWKTV----------------HSSALITREVRVQREFERANIAVATEGVIAQLA 1160
N R RW+ H + ++R V + + + I +
Sbjct: 797 ENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEPIPKD----SEDNSINFVN 856
Query: 1161 RLTVQPTLRQRIIASQREDPNLQKVLGQLDESPVDGFSKSSDEELLYQGRLCVPAIEDLR 1220
++++ + +++ D L +L D+ + + + ++ +P L
Sbjct: 857 QISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILLPNDTQLT 916
Query: 1221 KEVLMEAHNSPFAMHPRGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAG 1280
+ ++ + H +HP + + + F WK +++ + +V C CQ K+ K G
Sbjct: 917 RTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYG 976
Query: 1281 LLQPLSIPKWKWENIAMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPRKVTYTVDDWAK 1340
LQP+ + WE+++MDFI LP++ GY ++VVVDR +K A +P + T + A+
Sbjct: 977 PLQPIPPSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAILVPCTKSITAEQTAR 1036
Query: 1341 LYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRFDFSTAFHPQTDGQTERLNQI 1385
++ + ++ G P I++D D FTS W+ FS + PQTDGQTER NQ
Sbjct: 1037 MFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQT 1096
BLAST of CmoCh10G011790 vs. ExPASy Swiss-Prot
Match:
P0CT34 (Transposon Tf2-1 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-1 PE=3 SV=1)
HSP 1 Score: 355.1 bits (910), Expect = 3.5e-96
Identity = 215/761 (28.25%), Postives = 375/761 (49.28%), Query Frame = 0
Query: 681 IVNEFPDV-FPNDLPGIP-PSRAVDFVIELEPETGPISKAPYRMAPAELKELKAQLQDLL 740
I EF D+ + +P P + ++F +EL E + Y + P +++ + ++ L
Sbjct: 377 IYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRNYPLPPGKMQAMNDEINQGL 436
Query: 741 DKGFIRPSVSPWGAPVLFVKKKDASMRLCIDYRELNKRTIKNKYPLPRIEDLFDQLREAT 800
G IR S + PV+FV KK+ ++R+ +DY+ LNK N YPLP IE L +++ +T
Sbjct: 437 KSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGST 496
Query: 801 VFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKEC 860
+F+K+DL+S YH IR+ + D K AFR G +E++VM +G++ APA F +N + E
Sbjct: 497 IFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEA 556
Query: 861 LDMFVIVFIDDILIYSRTDLEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVS 920
+ V+ ++DDILI+S+++ EH +H++ VL L+ L +KCEF QV F+G+ +S
Sbjct: 557 KESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHIS 616
Query: 921 KDGISVDPTKVEAITKWERP------------------------------TTLTKKGVPF 980
+ G + ++ + +W++P L KK V +
Sbjct: 617 EKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRW 676
Query: 981 RWDDACEASFQNLKERLVTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHGK-----VVA 1040
+W + +N+K+ LV+ PVL + S+ ++ +DAS +G VL Q V
Sbjct: 677 KWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVG 736
Query: 1041 YASRQLKEYEKNYHTHDLELAAVVFALKIWRHYLYG--EKTQIFTNHKSLKYFFTQKE-- 1100
Y S ++ + + NY D E+ A++ +LK WRHYL E +I T+H++L T +
Sbjct: 737 YYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEP 796
Query: 1101 LNMRQRRWKTV----------------HSSALITREVRVQREFERANIAVATEGVIAQLA 1160
N R RW+ H + ++R V + + + I +
Sbjct: 797 ENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEPIPKD----SEDNSINFVN 856
Query: 1161 RLTVQPTLRQRIIASQREDPNLQKVLGQLDESPVDGFSKSSDEELLYQGRLCVPAIEDLR 1220
++++ + +++ D L +L D+ + + + ++ +P L
Sbjct: 857 QISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILLPNDTQLT 916
Query: 1221 KEVLMEAHNSPFAMHPRGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAG 1280
+ ++ + H +HP + + + F WK +++ + +V C CQ K+ K G
Sbjct: 917 RTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYG 976
Query: 1281 LLQPLSIPKWKWENIAMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPRKVTYTVDDWAK 1340
LQP+ + WE+++MDFI LP++ GY ++VVVDR +K A +P + T + A+
Sbjct: 977 PLQPIPPSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAILVPCTKSITAEQTAR 1036
Query: 1341 LYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRFDFSTAFHPQTDGQTERLNQI 1385
++ + ++ G P I++D D FTS W+ FS + PQTDGQTER NQ
Sbjct: 1037 MFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQT 1096
BLAST of CmoCh10G011790 vs. ExPASy Swiss-Prot
Match:
P0CT35 (Transposon Tf2-2 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-2 PE=3 SV=1)
HSP 1 Score: 355.1 bits (910), Expect = 3.5e-96
Identity = 215/761 (28.25%), Postives = 375/761 (49.28%), Query Frame = 0
Query: 681 IVNEFPDV-FPNDLPGIP-PSRAVDFVIELEPETGPISKAPYRMAPAELKELKAQLQDLL 740
I EF D+ + +P P + ++F +EL E + Y + P +++ + ++ L
Sbjct: 377 IYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRNYPLPPGKMQAMNDEINQGL 436
Query: 741 DKGFIRPSVSPWGAPVLFVKKKDASMRLCIDYRELNKRTIKNKYPLPRIEDLFDQLREAT 800
G IR S + PV+FV KK+ ++R+ +DY+ LNK N YPLP IE L +++ +T
Sbjct: 437 KSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGST 496
Query: 801 VFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKEC 860
+F+K+DL+S YH IR+ + D K AFR G +E++VM +G++ APA F +N + E
Sbjct: 497 IFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEA 556
Query: 861 LDMFVIVFIDDILIYSRTDLEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVS 920
+ V+ ++DDILI+S+++ EH +H++ VL L+ L +KCEF QV F+G+ +S
Sbjct: 557 KESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHIS 616
Query: 921 KDGISVDPTKVEAITKWERP------------------------------TTLTKKGVPF 980
+ G + ++ + +W++P L KK V +
Sbjct: 617 EKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRW 676
Query: 981 RWDDACEASFQNLKERLVTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHGK-----VVA 1040
+W + +N+K+ LV+ PVL + S+ ++ +DAS +G VL Q V
Sbjct: 677 KWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVG 736
Query: 1041 YASRQLKEYEKNYHTHDLELAAVVFALKIWRHYLYG--EKTQIFTNHKSLKYFFTQKE-- 1100
Y S ++ + + NY D E+ A++ +LK WRHYL E +I T+H++L T +
Sbjct: 737 YYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEP 796
Query: 1101 LNMRQRRWKTV----------------HSSALITREVRVQREFERANIAVATEGVIAQLA 1160
N R RW+ H + ++R V + + + I +
Sbjct: 797 ENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEPIPKD----SEDNSINFVN 856
Query: 1161 RLTVQPTLRQRIIASQREDPNLQKVLGQLDESPVDGFSKSSDEELLYQGRLCVPAIEDLR 1220
++++ + +++ D L +L D+ + + + ++ +P L
Sbjct: 857 QISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILLPNDTQLT 916
Query: 1221 KEVLMEAHNSPFAMHPRGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAG 1280
+ ++ + H +HP + + + F WK +++ + +V C CQ K+ K G
Sbjct: 917 RTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYG 976
Query: 1281 LLQPLSIPKWKWENIAMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPRKVTYTVDDWAK 1340
LQP+ + WE+++MDFI LP++ GY ++VVVDR +K A +P + T + A+
Sbjct: 977 PLQPIPPSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAILVPCTKSITAEQTAR 1036
Query: 1341 LYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRFDFSTAFHPQTDGQTERLNQI 1385
++ + ++ G P I++D D FTS W+ FS + PQTDGQTER NQ
Sbjct: 1037 MFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQT 1096
BLAST of CmoCh10G011790 vs. ExPASy TrEMBL
Match:
A0A6J1EYH9 (Reverse transcriptase OS=Cucurbita moschata OX=3662 GN=LOC111440131 PE=4 SV=1)
HSP 1 Score: 1624.4 bits (4205), Expect = 0.0e+00
Identity = 915/1665 (54.95%), Postives = 1014/1665 (60.90%), Query Frame = 0
Query: 159 AQKQAQAQAQALAQVQAQVQAQAQATAQANAPINPPQPVRCSNWIRDFKRYDPRPFTGSL 218
A A A +A A A A A P+N P P R +NW++DF+RY+PRPF GS
Sbjct: 6 ANAMATQPANYVATQPANAVAPQPVDAVAIPPVNNPHPDRGANWLKDFRRYNPRPFVGSK 65
Query: 219 VDPIEAQMWIVVVETTFETMECPENQKVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWE 278
DP AQMWI +ETTFE+M CP+ KV CAT+VLQKDAE+WW DNK +NP GG WE
Sbjct: 66 EDPTAAQMWIANMETTFESMRCPDEHKVTCATYVLQKDAEVWWSDNKQSINPGGGITTWE 125
Query: 279 RFKEAFLKEYYLKSERLKRQQEFTHLVQGGLTVEKYNREFNKLKRFAPSMVDTEEKMTEK 338
FKEAFLK YY K R+K+QQEF L QG TV++Y+++F +L+RFAPS+ DTEEK TEK
Sbjct: 126 TFKEAFLKYYYPKEIRIKKQQEFNLLTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEK 185
Query: 339 FVLGLEPRIRRMLEAFNPKTYEEALRTAKALEKPKDEKRCEEPVIIGQKRH-ELGGSDRP 398
FVLGL P+ RRMLEAFNPKTYEEALRTAKALE+P +EK+ E V IG+KR E+ +
Sbjct: 186 FVLGLNPKTRRMLEAFNPKTYEEALRTAKALEEPPEEKKTEPTVAIGRKRPVEVDTIEFQ 245
Query: 399 PLARRHRSNNR-PAPRWDDRRPP--RHTDRNPRNQDGARGRREEGCTICGRLHGGRCMAG 458
P ++R R +R PAP PP R+ + C CG+ H GRC+AG
Sbjct: 246 PPSQRPRYQSRPPAP------PPIGRYL-----------AMEKPLCRNCGKQHVGRCLAG 305
Query: 459 SRACYRCGQEGHIAVNCTVGNAAAQANP---PRVVEQMDQPAPLRAQARAYASTSKDTGR 518
S CY CG GH+A C + P P + E Q +P Q +AY +TS + G
Sbjct: 306 SGMCYICGHAGHVARTCPTKSPGIPREPLRGPVIREPTLQTSP---QTKAYVTTSNEAGT 365
Query: 519 SDAVVTGTLSILGHFAFTLFDSGSTHSFISMPFVVQAGFELEPLLHEMSVSTPAGVDLVS 578
S VVTGTLSILGHFA TLFDSGSTHSF++ PF+ QAGF +EPL+H +SV TPAGVDLV+
Sbjct: 366 SGTVVTGTLSILGHFALTLFDSGSTHSFVASPFIKQAGFVIEPLMHALSVGTPAGVDLVT 425
Query: 579 RDRVKDGQVIIGNQTLSVDLMVVNMTDFDIILGMDWLAENRASIDCRKKEVKFSPPIGPT 638
+DRV+DGQV+I QT+ VDL VV+MTDFD+ILGMDWLAEN A+IDC KKEV F+PP G T
Sbjct: 426 KDRVRDGQVVIAGQTIHVDLKVVDMTDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLT 485
Query: 639 FKFKGTNTGITPKVVSMMKAKRLVQQGGWAILACAVDVRGKKETLVNVPIVNEFPDVFPN 698
FKFKGT+TG TPK++SMMKA+RL+QQGGWA LA AV+ +GK++ + +P+VNEF DVFP
Sbjct: 486 FKFKGTSTGTTPKIISMMKARRLIQQGGWAFLAYAVNTKGKEKPIDTIPVVNEFMDVFPE 545
Query: 699 DLPGIPPSRAVDFVIELEPETGPISKAPYRMAPAELKELKAQLQDLLDKGFIRPSVSPWG 758
DLPGIPPSR VDF I+LE TGPISKAPYRMAPAELKELK QLQDLLD
Sbjct: 546 DLPGIPPSREVDFGIDLELGTGPISKAPYRMAPAELKELKTQLQDLLD------------ 605
Query: 759 APVLFVKKKDASMRLCIDYRELNKRTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQ 818
KD SMRLCI YRELNKRT+KNKYPLPRIEDLFDQLR ATVFSKIDLRSGYHQ
Sbjct: 606 --------KDDSMRLCIGYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQ 665
Query: 819 IRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDMFVIVFIDDIL 878
I+I +D+PKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLD+FVIVFIDDIL
Sbjct: 666 IKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDLFVIVFIDDIL 725
Query: 879 IYSRTDLEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEA 938
IYS+TDL+H+EHLRK LT LRE+KLYA F+KCEFW+ QVSFLGH+VSKDGI VDP K+EA
Sbjct: 726 IYSKTDLKHQEHLRKALTILRENKLYANFTKCEFWIXQVSFLGHIVSKDGIFVDPNKIEA 785
Query: 939 ITKWERPTT------------------------------LTKKGVPFRWDDACEASFQNL 998
+TK +RPTT LTKKGVPF WDD CE SFQ L
Sbjct: 786 VTKRKRPTTVTEIRSFLGLVGYYRRFVXDFARIATPLTQLTKKGVPFVWDDTCEVSFQEL 845
Query: 999 KERLVTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYASRQLKEYEKNYHTHDL 1058
K+RLV+APVL VPESS GY IYSDAS KGLGCVLMQHGKVVAYAS QLK+YEKNY THDL
Sbjct: 846 KQRLVSAPVLTVPESSVGYAIYSDASKKGLGCVLMQHGKVVAYASHQLKDYEKNYPTHDL 905
Query: 1059 ELAAVVFALKIWRHYLYGEKTQIFTNHKSLKYFFTQKELNMRQRRW-------------- 1118
ELAAVVFALKIWRHY YGEKTQI+T+HKSLKY FTQKELNMRQRRW
Sbjct: 906 ELAAVVFALKIWRHYPYGEKTQIYTDHKSLKYLFTQKELNMRQRRWLELVKDYDIDIQYH 965
Query: 1119 ------------------------------------------------------------ 1178
Sbjct: 966 PGKANVVADALSRKVVHSSALITREPRGRTDFEQADIVVVTKEVAAQLARMTVRPTLRQR 1025
Query: 1179 ------------------------------------------------------------ 1238
Sbjct: 1026 IIDSQREDPSLSKILDQLEVGPVDGFTKSTDDGLLCQGRLCVPPLSGIKNEILTEAHNSA 1085
Query: 1239 ------------------------------------------------------------ 1298
Sbjct: 1086 FSIHPGGTKMYQDLKKHFWWRSMKKDIAEYVSKCLVCQQVKAPRQKTAGLLQPLSIPEWK 1145
Query: 1299 ------------------------------------------------------------ 1358
Sbjct: 1146 WENIAMDFIVGLPKTLKGYTVIWVVVDRLTKSAHFLLGKATYTVDNWAQLYVKEIVRLHG 1205
Query: 1359 ------------------------------------------------------------ 1387
Sbjct: 1206 VPVSIVSDRDPRFMSAFWRCLQRAMGSWDTKLHLMEFSYNNSFQATIGMAPFEALYGKRC 1265
BLAST of CmoCh10G011790 vs. ExPASy TrEMBL
Match:
A0A5A7UBS1 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold13G00110 PE=4 SV=1)
HSP 1 Score: 1454.5 bits (3764), Expect = 0.0e+00
Identity = 771/1303 (59.17%), Postives = 923/1303 (70.84%), Query Frame = 0
Query: 155 APIKAQKQAQAQAQALAQVQAQVQAQAQATAQANAPINP----PQPVRCS---NWIRDFK 214
AP A +Q QA AQ QA A AQA AP P P PV+ S +RDF+
Sbjct: 59 APFLAAQQNQAAPVQAEAAPAQAQA-APVQAQAVAPPAPEEAQPVPVQLSTEAKHLRDFR 118
Query: 215 RYDPRPFTGSLVDPIEAQMWIVVVETTFETMECPENQKVACATFVLQKDAEIWWRDNKTL 274
+Y+P+ F GS+ +P +AQMW+ +ET F M+CPE+QKV CA F L+ WW + +
Sbjct: 119 KYNPKTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCAVFFLEDRGTAWWETAERM 178
Query: 275 LNPEGGPMNWERFKEAFLKEYYLKSERLKRQQEFTHLVQGGLTVEKYNREFNKLKRFAPS 334
L + + WE+FKE F +++ + + + QEF +L QG +TVE+Y+ EF+ L RFAP
Sbjct: 179 LGGDVSKITWEQFKENFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAEFDMLSRFAPD 238
Query: 335 MVDTEEKMTEKFVLGLEPRIRRMLEAFNPKTYEEALRTAKALEKPK---DEKRCEEPVII 394
MV E TEKFV GL ++ ++ A P T+ +ALR A L P+ K +
Sbjct: 239 MVRDEAARTEKFVRGLRLDLQGIVRALRPATHADALRIALDLSLPERADASKAAGRGSAL 298
Query: 395 GQKRHELGGSDRPPLARRHRSNNRPAPRWDDRRPPRHTDRNPRNQDGARGR--RE-EGCT 454
GQKR D P R RS + R + A GR RE CT
Sbjct: 299 GQKRKVETQPDVAP-QRTLRSGG--------------VFQRHRRELAAAGRTLRELPACT 358
Query: 455 ICGRLHGGRCMAGSRACYRCGQEGHIAVNCTVGNAAAQANPPRVVEQMDQPAPLRA-QAR 514
CGR+HGGRC+AGS C+RC Q GH A C PR + P P A Q R
Sbjct: 359 TCGRVHGGRCLAGSGVCFRCRQPGHTADMC-----------PRKPFETTPPQPSAAQQGR 418
Query: 515 AYASTSKDTGRSDAVVTGTLSILGHFAFTLFDSGSTHSFISMPFVVQAGFELEPLLHEMS 574
+A+T ++ R+ VVTGTL ILGH+AF LFDSGS+HSFIS FV G E+EPL +S
Sbjct: 419 VFATTRQEAERAGTVVTGTLPILGHYAFVLFDSGSSHSFISSVFVQHVGLEVEPLGSVLS 478
Query: 575 VSTPAGVDLVSRDRVKDGQVIIGNQTLSVDLMVVNMTDFDIILGMDWLAENRASIDCRKK 634
VSTP+G L+S++++K +V I N+ L V L+V++M DFD+ILGMDWL+ N A+IDC K
Sbjct: 479 VSTPSGEVLLSKEQIKACRVEIANRMLDVTLLVLDMQDFDVILGMDWLSANHANIDCYGK 538
Query: 635 EVKFSPPIGPTFKFKGTNTGITPKVVSMMKAKRLVQQGGWAILACAVDVRGKKETLVNVP 694
EV F+PP +FKF+G PKV+S MKA +L+ QG W ILA VDVR + +L + P
Sbjct: 539 EVVFNPPSEASFKFRGAGMVCIPKVISAMKASKLLSQGTWGILASVVDVREPEVSLSSEP 598
Query: 695 IVNEFPDVFPNDLPGIPPSRAVDFVIELEPETGPISKAPYRMAPAELKELKAQLQDLLDK 754
+V E+PDVFP++LPG+PP R VDF IELEP T PIS+APYRMAPAELKELK QLQ+LLDK
Sbjct: 599 VVREYPDVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDK 658
Query: 755 GFIRPSVSPWGAPVLFVKKKDASMRLCIDYRELNKRTIKNKYPLPRIEDLFDQLREATVF 814
GFIRPSVSPWGAPVLFVKKKD SMRLCIDYRELNK T+KN+YPLPRI+DLFDQL+ ATVF
Sbjct: 659 GFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVF 718
Query: 815 SKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLD 874
SKIDLRSGYHQ+RI + D+PKTAFR+RYGHYEFVVMSFGLTNAPAVFM+LMNRVFKE LD
Sbjct: 719 SKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKEFLD 778
Query: 875 MFVIVFIDDILIYSRTDLEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKD 934
FVIVFIDDILIYS+T+ EHEEHL +VL TLR +KLYAKFSKCEFWLR+V+FLGHVVS +
Sbjct: 779 SFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSE 838
Query: 935 GISVDPTKVEAITKWERPTT------------------------------LTKKGVPFRW 994
G+SVDP K+EA+T W RP+T LT+KG PF W
Sbjct: 839 GVSVDPAKIEAVTNWTRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVW 898
Query: 995 DDACEASFQNLKERLVTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYASRQLK 1054
ACE SFQ LK++LVTAPVL VP+ S +VIYSDAS KGLGCVLMQ GKVVAYASRQLK
Sbjct: 899 SPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYASRQLK 958
Query: 1055 EYEKNYHTHDLELAAVVFALKIWRHYLYGEKTQIFTNHKSLKYFFTQKELNMRQRRW--- 1114
+E+NY THDLELAAVVFALKIWRHYLYGEK QI+T+HKSLKYFFTQKELNMRQRRW
Sbjct: 959 IHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLEL 1018
Query: 1115 ------------------------KTVHSSALITREVRVQREFERANIAVATEGVIAQLA 1174
K HS+ALIT++ + R+FERA IAV+ V AQLA
Sbjct: 1019 VKDYDCEILYHPGKANVVADALSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA 1078
Query: 1175 RLTVQPTLRQRIIASQREDPNLQKVLGQLDESPVDGFSKSSDEELLYQGRLCVPAIEDLR 1234
+LTVQPTLRQ+IIA+Q +DP L + ++ +GFS SSD+ L+++GRLCVP ++
Sbjct: 1079 QLTVQPTLRQKIIAAQLDDPYLAEKRRVVETEQGEGFSISSDDGLMFEGRLCVPEDSAVK 1138
Query: 1235 KEVLMEAHNSPFAMHPRGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAG 1294
E+L EAH+SPF MHP TKMYQDL+ +WW+ MKRDVA FVS+CLVCQQVKAPRQ AG
Sbjct: 1139 TELLTEAHSSPFTMHPGSTKMYQDLRSVYWWRGMKRDVADFVSRCLVCQQVKAPRQHPAG 1198
Query: 1295 LLQPLSIPKWKWENIAMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPRKVTYTVDDWAK 1354
LLQPLS+P WKWE+++MDFI GLPKT +GYTVIWVVVDRLTKSAHF+P K TYT W +
Sbjct: 1199 LLQPLSVPGWKWESVSMDFITGLPKTLRGYTVIWVVVDRLTKSAHFVPGKSTYTASKWGQ 1258
Query: 1355 LYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRFDFSTAFHPQTDGQTERLNQI 1387
LY+ EIVRLHGVPVSI+SDRD RFTS FW+GLQ ALGTR DFSTAFHPQTDGQTERLNQI
Sbjct: 1259 LYMTEIVRLHGVPVSIISDRDARFTSKFWKGLQLALGTRLDFSTAFHPQTDGQTERLNQI 1318
BLAST of CmoCh10G011790 vs. ExPASy TrEMBL
Match:
A0A5A7TSL0 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold191G001340 PE=4 SV=1)
HSP 1 Score: 1454.5 bits (3764), Expect = 0.0e+00
Identity = 771/1303 (59.17%), Postives = 923/1303 (70.84%), Query Frame = 0
Query: 155 APIKAQKQAQAQAQALAQVQAQVQAQAQATAQANAPINP----PQPVRCS---NWIRDFK 214
AP A +Q QA AQ QA A AQA AP P P PV+ S +RDF+
Sbjct: 59 APFLAAQQNQAAPVQAEAAPAQAQA-APVQAQAVAPPAPEEAQPVPVQLSAEAKHLRDFR 118
Query: 215 RYDPRPFTGSLVDPIEAQMWIVVVETTFETMECPENQKVACATFVLQKDAEIWWRDNKTL 274
+Y+P+ F GS+ +P +AQMW+ +ET F M+CPE+QKV CA F L+ WW + +
Sbjct: 119 KYNPKTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCAVFFLEDRGTAWWETAERM 178
Query: 275 LNPEGGPMNWERFKEAFLKEYYLKSERLKRQQEFTHLVQGGLTVEKYNREFNKLKRFAPS 334
L + + WE+FKE F +++ + + + QEF +L QG +TVE+Y+ EF+ L RFAP
Sbjct: 179 LGGDVSKITWEQFKENFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAEFDMLSRFAPD 238
Query: 335 MVDTEEKMTEKFVLGLEPRIRRMLEAFNPKTYEEALRTAKALEKPK---DEKRCEEPVII 394
MV E TEKFV GL ++ ++ A P T+ +ALR A L P+ K +
Sbjct: 239 MVRDEAARTEKFVRGLRLDLQGIVRALRPATHADALRIALDLSLPERADASKAAGRGSAL 298
Query: 395 GQKRHELGGSDRPPLARRHRSNNRPAPRWDDRRPPRHTDRNPRNQDGARGR--RE-EGCT 454
GQKR D P R RS + R + A GR RE CT
Sbjct: 299 GQKRKVETQPDVAP-QRTLRSGG--------------VFQRHRRELAAAGRTLRELPACT 358
Query: 455 ICGRLHGGRCMAGSRACYRCGQEGHIAVNCTVGNAAAQANPPRVVEQMDQPAPLRA-QAR 514
CGR+HGGRC+AGS C+RC Q GH A C PR + P P A Q R
Sbjct: 359 TCGRVHGGRCLAGSGVCFRCRQPGHTADMC-----------PRKPFETTPPQPSAAQQGR 418
Query: 515 AYASTSKDTGRSDAVVTGTLSILGHFAFTLFDSGSTHSFISMPFVVQAGFELEPLLHEMS 574
+A+T ++ R+ VVTGTL ILGH+AF LFDSGS+HSFIS FV G E+EPL +S
Sbjct: 419 VFATTRQEAERAGTVVTGTLPILGHYAFVLFDSGSSHSFISSVFVQHVGLEVEPLGSVLS 478
Query: 575 VSTPAGVDLVSRDRVKDGQVIIGNQTLSVDLMVVNMTDFDIILGMDWLAENRASIDCRKK 634
VSTP+G L+S++++K +V I N+ L V L+V++M DFD+ILGMDWL+ N A+IDC K
Sbjct: 479 VSTPSGEVLLSKEQIKACRVEIANRMLDVTLLVLDMQDFDVILGMDWLSANHANIDCYGK 538
Query: 635 EVKFSPPIGPTFKFKGTNTGITPKVVSMMKAKRLVQQGGWAILACAVDVRGKKETLVNVP 694
EV F+PP +FKF+G PKV+S MKA +L+ QG W ILA VDVR + +L + P
Sbjct: 539 EVVFNPPSEASFKFRGAGMVCIPKVISAMKASKLLSQGTWGILASVVDVREPEVSLSSEP 598
Query: 695 IVNEFPDVFPNDLPGIPPSRAVDFVIELEPETGPISKAPYRMAPAELKELKAQLQDLLDK 754
+V E+PDVFP++LPG+PP R VDF IELEP T PIS+APYRMAPAELKELK QLQ+LLDK
Sbjct: 599 VVREYPDVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDK 658
Query: 755 GFIRPSVSPWGAPVLFVKKKDASMRLCIDYRELNKRTIKNKYPLPRIEDLFDQLREATVF 814
GFIRPSVSPWGAPVLFVKKKD SMRLCIDYRELNK T+KN+YPLPRI+DLFDQL+ ATVF
Sbjct: 659 GFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVF 718
Query: 815 SKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLD 874
SKIDLRSGYHQ+RI + D+PKTAFR+RYGHYEFVVMSFGLTNAPAVFM+LMNRVFKE LD
Sbjct: 719 SKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKEFLD 778
Query: 875 MFVIVFIDDILIYSRTDLEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKD 934
FVIVFIDDILIYS+T+ EHEEHL +VL TLR +KLYAKFSKCEFWLR+V+FLGHVVS +
Sbjct: 779 SFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSE 838
Query: 935 GISVDPTKVEAITKWERPTT------------------------------LTKKGVPFRW 994
G+SVDP K+EA+T W RP+T LT+KG PF W
Sbjct: 839 GVSVDPAKIEAVTNWTRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVW 898
Query: 995 DDACEASFQNLKERLVTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYASRQLK 1054
ACE SFQ LK++LVTAPVL VP+ S +VIYSDAS KGLGCVLMQ GKVVAYASRQLK
Sbjct: 899 SPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYASRQLK 958
Query: 1055 EYEKNYHTHDLELAAVVFALKIWRHYLYGEKTQIFTNHKSLKYFFTQKELNMRQRRW--- 1114
+E+NY THDLELAAVVFALKIWRHYLYGEK QI+T+HKSLKYFFTQKELNMRQRRW
Sbjct: 959 IHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLEL 1018
Query: 1115 ------------------------KTVHSSALITREVRVQREFERANIAVATEGVIAQLA 1174
K HS+ALIT++ + R+FERA IAV+ V AQLA
Sbjct: 1019 VKDYDCEILYHPGKANVVADALSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA 1078
Query: 1175 RLTVQPTLRQRIIASQREDPNLQKVLGQLDESPVDGFSKSSDEELLYQGRLCVPAIEDLR 1234
+LTVQPTLRQ+IIA+Q +DP L + ++ +GFS SSD+ L+++GRLCVP ++
Sbjct: 1079 QLTVQPTLRQKIIAAQLDDPYLAEKRRVVETEQGEGFSISSDDGLMFEGRLCVPEDSAVK 1138
Query: 1235 KEVLMEAHNSPFAMHPRGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAG 1294
E+L EAH+SPF MHP TKMYQDL+ +WW+ MKRDVA FVS+CLVCQQVKAPRQ AG
Sbjct: 1139 TELLTEAHSSPFTMHPGSTKMYQDLRSVYWWRGMKRDVADFVSRCLVCQQVKAPRQHPAG 1198
Query: 1295 LLQPLSIPKWKWENIAMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPRKVTYTVDDWAK 1354
LLQPLS+P WKWE+++MDFI GLPKT +GYTVIWVVVDRLTKSAHF+P K TYT W +
Sbjct: 1199 LLQPLSVPGWKWESVSMDFITGLPKTLRGYTVIWVVVDRLTKSAHFVPGKSTYTASKWGQ 1258
Query: 1355 LYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRFDFSTAFHPQTDGQTERLNQI 1387
LY+ EIVRLHGVPVSI+SDRD RFTS FW+GLQ ALGTR DFSTAFHPQTDGQTERLNQI
Sbjct: 1259 LYMTEIVRLHGVPVSIISDRDARFTSKFWKGLQLALGTRLDFSTAFHPQTDGQTERLNQI 1318
BLAST of CmoCh10G011790 vs. ExPASy TrEMBL
Match:
A0A5D3BTN0 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold451G001560 PE=4 SV=1)
HSP 1 Score: 1454.5 bits (3764), Expect = 0.0e+00
Identity = 772/1303 (59.25%), Postives = 923/1303 (70.84%), Query Frame = 0
Query: 155 APIKAQKQAQAQAQALAQVQAQVQAQAQATAQANAPINP----PQPVRCS---NWIRDFK 214
AP A +Q QA AQ QA A AQA AP P P PV+ S +RDF+
Sbjct: 382 APFLAAQQNQAAPVQAEAAPAQAQA-APVQAQAVAPPAPEEAQPVPVQLSTEAKHLRDFR 441
Query: 215 RYDPRPFTGSLVDPIEAQMWIVVVETTFETMECPENQKVACATFVLQKDAEIWWRDNKTL 274
+Y+P+ F GS+ +P +AQMW+ +ET F M+CPE+QKV CA F L+ WW + +
Sbjct: 442 KYNPKTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCAVFFLEDRGTAWWETAERM 501
Query: 275 LNPEGGPMNWERFKEAFLKEYYLKSERLKRQQEFTHLVQGGLTVEKYNREFNKLKRFAPS 334
L + + WE+FKE F +++ + + + QEF +L QG +TVE+Y+ EF+ L RFAP
Sbjct: 502 LGGDVSKITWEQFKENFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAEFDMLSRFAPD 561
Query: 335 MVDTEEKMTEKFVLGLEPRIRRMLEAFNPKTYEEALRTAKALEKPK---DEKRCEEPVII 394
MV E TEKFV GL ++ ++ A P T+ +ALR A L P+ K +
Sbjct: 562 MVRDEAARTEKFVRGLRLDLQGIVRALRPATHADALRIALDLSLPERADASKAAGRGSAL 621
Query: 395 GQKRHELGGSDRPPLARRHRSNNRPAPRWDDRRPPRHTDRNPRNQDGARGR--RE-EGCT 454
GQKR D P R RS + R + A GR RE CT
Sbjct: 622 GQKRKVETQPDVAP-QRTLRSGG--------------VFQRHRRELAAAGRTLRELPACT 681
Query: 455 ICGRLHGGRCMAGSRACYRCGQEGHIAVNCTVGNAAAQANPPRVVEQMDQPAPLRA-QAR 514
CGR+HGGRC+AGS C+RC Q GH A C PR + P P A Q R
Sbjct: 682 TCGRVHGGRCLAGSGVCFRCRQPGHTADMC-----------PRKPFETTPPQPSAAQQGR 741
Query: 515 AYASTSKDTGRSDAVVTGTLSILGHFAFTLFDSGSTHSFISMPFVVQAGFELEPLLHEMS 574
+A+T ++ R+ VVTGTL ILGH+AF LFDSGS+HSFIS FV G E+EPL +S
Sbjct: 742 VFATTRQEAERAGTVVTGTLPILGHYAFVLFDSGSSHSFISSVFVQHVGLEVEPLGSVLS 801
Query: 575 VSTPAGVDLVSRDRVKDGQVIIGNQTLSVDLMVVNMTDFDIILGMDWLAENRASIDCRKK 634
VSTP+G L+S++++K +V I N+ L V L+V++M DFD+ILGMDWL+ N A+IDC K
Sbjct: 802 VSTPSGEVLLSKEQIKACRVEIANRMLDVTLLVLDMQDFDVILGMDWLSANHANIDCYGK 861
Query: 635 EVKFSPPIGPTFKFKGTNTGITPKVVSMMKAKRLVQQGGWAILACAVDVRGKKETLVNVP 694
EV F+PP +FKF+G PKV+S MKA +L+ QG W ILA VDVR + +L + P
Sbjct: 862 EVVFNPPSEASFKFRGAGMVCIPKVISAMKASKLLSQGTWGILASVVDVREPEVSLSSEP 921
Query: 695 IVNEFPDVFPNDLPGIPPSRAVDFVIELEPETGPISKAPYRMAPAELKELKAQLQDLLDK 754
+V E+PDVFP++LPG+PP R VDF IELEP T PIS+APYRMAPAELKELK QLQ+LLDK
Sbjct: 922 VVREYPDVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDK 981
Query: 755 GFIRPSVSPWGAPVLFVKKKDASMRLCIDYRELNKRTIKNKYPLPRIEDLFDQLREATVF 814
GFIRPSVSPWGAPVLFVKKKD SMRLCIDYRELNK T+KN+YPLPRI+DLFDQL+ ATVF
Sbjct: 982 GFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVF 1041
Query: 815 SKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLD 874
SKIDLRSGYHQ+RI + D+PKTAFR+RYGHYEFVVMSFGLTNAPAVFM+LMNRVFK+ LD
Sbjct: 1042 SKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLD 1101
Query: 875 MFVIVFIDDILIYSRTDLEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKD 934
FVIVFIDDILIYS+T+ EHEEHL +VL TLR +KLYAKFSKCEFWLR+V+FLGHVVS +
Sbjct: 1102 SFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSE 1161
Query: 935 GISVDPTKVEAITKWERPTT------------------------------LTKKGVPFRW 994
G+SVDP K+EA+T W RP+T LT+KG PF W
Sbjct: 1162 GVSVDPAKIEAVTNWTRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVW 1221
Query: 995 DDACEASFQNLKERLVTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYASRQLK 1054
ACE SFQ LK++LVTAPVL VP+ S +VIYSDAS KGLGCVLMQ GKVVAYASRQLK
Sbjct: 1222 SPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYASRQLK 1281
Query: 1055 EYEKNYHTHDLELAAVVFALKIWRHYLYGEKTQIFTNHKSLKYFFTQKELNMRQRRW--- 1114
+E+NY THDLELAAVVFALKIWRHYLYGEK QI+T+HKSLKYFFTQKELNMRQRRW
Sbjct: 1282 IHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLEL 1341
Query: 1115 ------------------------KTVHSSALITREVRVQREFERANIAVATEGVIAQLA 1174
K HS+ALIT++ + R+FERA IAV+ V AQLA
Sbjct: 1342 VKDYDCEILYHPGKANVVADALSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA 1401
Query: 1175 RLTVQPTLRQRIIASQREDPNLQKVLGQLDESPVDGFSKSSDEELLYQGRLCVPAIEDLR 1234
+LTVQPTLRQ+IIA+Q +DP L + ++ +GFS SSD+ L+++GRLCVP ++
Sbjct: 1402 QLTVQPTLRQKIIAAQLDDPYLAEKRRVVETEQGEGFSISSDDGLMFEGRLCVPEDSAVK 1461
Query: 1235 KEVLMEAHNSPFAMHPRGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAG 1294
E+L EAH+SPF MHP TKMYQDL+ +WW+ MKRDVA FVS+CLVCQQVKAPRQ AG
Sbjct: 1462 TELLTEAHSSPFTMHPGSTKMYQDLRSVYWWRGMKRDVADFVSRCLVCQQVKAPRQHPAG 1521
Query: 1295 LLQPLSIPKWKWENIAMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPRKVTYTVDDWAK 1354
LLQPLS+P WKWE+++MDFI GLPKT KGYTVIWVVVDRLTKSAHF+P K TYT W +
Sbjct: 1522 LLQPLSVPGWKWESVSMDFITGLPKTLKGYTVIWVVVDRLTKSAHFVPGKSTYTASKWGQ 1581
Query: 1355 LYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRFDFSTAFHPQTDGQTERLNQI 1387
LY+ EIVRLHGVPVSIVSDRD RFTS FW+GLQ ALGTR DFSTAFHPQTDGQTERLNQI
Sbjct: 1582 LYMTEIVRLHGVPVSIVSDRDARFTSKFWKGLQIALGTRLDFSTAFHPQTDGQTERLNQI 1641
BLAST of CmoCh10G011790 vs. ExPASy TrEMBL
Match:
A0A5A7UBH7 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold60G001510 PE=4 SV=1)
HSP 1 Score: 1454.5 bits (3764), Expect = 0.0e+00
Identity = 771/1303 (59.17%), Postives = 923/1303 (70.84%), Query Frame = 0
Query: 155 APIKAQKQAQAQAQALAQVQAQVQAQAQATAQANAPINP----PQPVRCS---NWIRDFK 214
AP A +Q QA AQ QA A AQA AP P P PV+ S +RDF+
Sbjct: 547 APFLAAQQNQAAPVQAEAAPAQAQA-APVQAQAVAPPAPEEAQPVPVQLSTEAKHLRDFR 606
Query: 215 RYDPRPFTGSLVDPIEAQMWIVVVETTFETMECPENQKVACATFVLQKDAEIWWRDNKTL 274
+Y+P+ F GS+ +P +AQMW+ +ET F M+CPE+QKV CA F L+ WW + +
Sbjct: 607 KYNPKTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCAVFFLEDRGTAWWETAERM 666
Query: 275 LNPEGGPMNWERFKEAFLKEYYLKSERLKRQQEFTHLVQGGLTVEKYNREFNKLKRFAPS 334
L + + WE+FKE F +++ + + + QEF +L QG +TVE+Y+ EF+ L RFAP
Sbjct: 667 LGGDVSKITWEQFKENFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAEFDMLSRFAPD 726
Query: 335 MVDTEEKMTEKFVLGLEPRIRRMLEAFNPKTYEEALRTAKALEKPK---DEKRCEEPVII 394
MV E TEKFV GL ++ ++ A P T+ +ALR A L P+ K +
Sbjct: 727 MVRDEAARTEKFVRGLRLDLQGIVRALRPATHADALRIALDLSLPERADASKAAGRGSAL 786
Query: 395 GQKRHELGGSDRPPLARRHRSNNRPAPRWDDRRPPRHTDRNPRNQDGARGR--RE-EGCT 454
GQKR D P R RS + R + A GR RE CT
Sbjct: 787 GQKRKVETQPDVAP-QRTLRSGG--------------VFQRHRRELAAAGRTLRELPACT 846
Query: 455 ICGRLHGGRCMAGSRACYRCGQEGHIAVNCTVGNAAAQANPPRVVEQMDQPAPLRA-QAR 514
CGR+HGGRC+AGS C+RC Q GH A C PR + P P A Q R
Sbjct: 847 TCGRVHGGRCLAGSGVCFRCRQPGHTADMC-----------PRKPFETTPPQPSAAQQGR 906
Query: 515 AYASTSKDTGRSDAVVTGTLSILGHFAFTLFDSGSTHSFISMPFVVQAGFELEPLLHEMS 574
+A+T ++ R+ VVTGTL ILGH+AF LFDSGS+HSFIS FV G E+EPL +S
Sbjct: 907 VFATTRQEAERAGTVVTGTLPILGHYAFVLFDSGSSHSFISSVFVQHVGLEVEPLGSVLS 966
Query: 575 VSTPAGVDLVSRDRVKDGQVIIGNQTLSVDLMVVNMTDFDIILGMDWLAENRASIDCRKK 634
VSTP+G L+S++++K +V I N+ L V L+V++M DFD+ILGMDWL+ N A+IDC K
Sbjct: 967 VSTPSGEVLLSKEQIKACRVEIANRMLDVTLLVLDMQDFDVILGMDWLSANHANIDCYGK 1026
Query: 635 EVKFSPPIGPTFKFKGTNTGITPKVVSMMKAKRLVQQGGWAILACAVDVRGKKETLVNVP 694
EV F+PP +FKF+G PKV+S MKA +L+ QG W ILA VDVR + +L + P
Sbjct: 1027 EVVFNPPSEASFKFRGAGMVCIPKVISAMKASKLLSQGTWGILASVVDVREPEVSLSSEP 1086
Query: 695 IVNEFPDVFPNDLPGIPPSRAVDFVIELEPETGPISKAPYRMAPAELKELKAQLQDLLDK 754
+V E+PDVFP++LPG+PP R VDF IELEP T PIS+APYRMAPAELKELK QLQ+LLDK
Sbjct: 1087 VVREYPDVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDK 1146
Query: 755 GFIRPSVSPWGAPVLFVKKKDASMRLCIDYRELNKRTIKNKYPLPRIEDLFDQLREATVF 814
GFIRPSVSPWGAPVLFVKKKD SMRLCIDYRELNK T+KN+YPLPRI+DLFDQL+ ATVF
Sbjct: 1147 GFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVF 1206
Query: 815 SKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLD 874
SKIDLRSGYHQ+RI + D+PKTAFR+RYGHYEFVVMSFGLTNAPAVFM+LMNRVFKE LD
Sbjct: 1207 SKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKEFLD 1266
Query: 875 MFVIVFIDDILIYSRTDLEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKD 934
FVIVFIDDILIYS+T+ EHEEHL +VL TLR +KLYAKFSKCEFWLR+V+FLGHVVS +
Sbjct: 1267 SFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSE 1326
Query: 935 GISVDPTKVEAITKWERPTT------------------------------LTKKGVPFRW 994
G+SVDP K+EA+T W RP+T LT+KG PF W
Sbjct: 1327 GVSVDPAKIEAVTNWTRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVW 1386
Query: 995 DDACEASFQNLKERLVTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYASRQLK 1054
ACE SFQ LK++LVTAPVL VP+ S +VIYSDAS KGLGCVLMQ GKVVAYASRQLK
Sbjct: 1387 SPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYASRQLK 1446
Query: 1055 EYEKNYHTHDLELAAVVFALKIWRHYLYGEKTQIFTNHKSLKYFFTQKELNMRQRRW--- 1114
+E+NY THDLELAAVVFALKIWRHYLYGEK QI+T+HKSLKYFFTQKELNMRQRRW
Sbjct: 1447 IHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLEL 1506
Query: 1115 ------------------------KTVHSSALITREVRVQREFERANIAVATEGVIAQLA 1174
K HS+ALIT++ + R+FERA IAV+ V AQLA
Sbjct: 1507 VKDYDCEILYHPGKANVVADALSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA 1566
Query: 1175 RLTVQPTLRQRIIASQREDPNLQKVLGQLDESPVDGFSKSSDEELLYQGRLCVPAIEDLR 1234
+LTVQPTLRQ+IIA+Q +DP L + ++ +GFS SSD+ L+++GRLCVP ++
Sbjct: 1567 QLTVQPTLRQKIIAAQLDDPYLAEKRRVVETEQGEGFSISSDDGLMFEGRLCVPEDSAVK 1626
Query: 1235 KEVLMEAHNSPFAMHPRGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAG 1294
E+L EAH+SPF MHP TKMYQDL+ +WW+ MKRDVA FVS+CLVCQQVKAPRQ AG
Sbjct: 1627 TELLTEAHSSPFTMHPGSTKMYQDLRSVYWWRGMKRDVADFVSRCLVCQQVKAPRQHPAG 1686
Query: 1295 LLQPLSIPKWKWENIAMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPRKVTYTVDDWAK 1354
LLQPLS+P WKWE+++MDFI GLPKT +GYTVIWVVVDRLTKSAHF+P K TYT W +
Sbjct: 1687 LLQPLSVPGWKWESVSMDFITGLPKTLRGYTVIWVVVDRLTKSAHFVPGKSTYTASKWGQ 1746
Query: 1355 LYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRFDFSTAFHPQTDGQTERLNQI 1387
LY+ EIVRLHGVPVSI+SDRD RFTS FW+GLQ ALGTR DFSTAFHPQTDGQTERLNQI
Sbjct: 1747 LYMTEIVRLHGVPVSIISDRDARFTSKFWKGLQLALGTRLDFSTAFHPQTDGQTERLNQI 1806
BLAST of CmoCh10G011790 vs. TAIR 10
Match:
ATMG00860.1 (DNA/RNA polymerases superfamily protein )
HSP 1 Score: 62.0 bits (149), Expect = 4.3e-09
Identity = 35/124 (28.23%), Postives = 54/124 (43.55%), Query Frame = 0
Query: 883 HLRKVLTTLREHKLYAKFSKCEFWLRQVSFLG--HVVSKDGISVDPTKVEAITKWERPTT 942
HL VL +H+ YA KC F Q+++LG H++S +G+S DP K+EA+ W P
Sbjct: 3 HLGMVLQIWEQHQFYANRKKCAFGQPQIAYLGHRHIISGEGVSADPAKLEAMVGWPEPKN 62
Query: 943 LTK-----------------------------KGVPFRWDDACEASFQNLKERLVTAPVL 976
T+ K +W + +F+ LK + T PVL
Sbjct: 63 TTELRGFLGLTGYYRRFVKNYGKIVRPLTELLKKNSLKWTEMAALAFKALKGAVTTLPVL 122
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q7LHG5 | 1.1e-102 | 32.77 | Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... | [more] |
Q99315 | 3.3e-102 | 32.52 | Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... | [more] |
P0CT41 | 3.5e-96 | 28.25 | Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24... | [more] |
P0CT34 | 3.5e-96 | 28.25 | Transposon Tf2-1 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... | [more] |
P0CT35 | 3.5e-96 | 28.25 | Transposon Tf2-2 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1EYH9 | 0.0e+00 | 54.95 | Reverse transcriptase OS=Cucurbita moschata OX=3662 GN=LOC111440131 PE=4 SV=1 | [more] |
A0A5A7UBS1 | 0.0e+00 | 59.17 | Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold13... | [more] |
A0A5A7TSL0 | 0.0e+00 | 59.17 | Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold19... | [more] |
A0A5D3BTN0 | 0.0e+00 | 59.25 | Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold45... | [more] |
A0A5A7UBH7 | 0.0e+00 | 59.17 | Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold60... | [more] |
Match Name | E-value | Identity | Description | |
ATMG00860.1 | 4.3e-09 | 28.23 | DNA/RNA polymerases superfamily protein | [more] |