Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCTTTTGTTGTACAAGCAGGGTTCGAATTAGAACCCTTATTGCATGAAATGTCTGTAAGCACCCCTGCGGGGGTAGACTTAGTATCTAGGGATAGAGTAAAGGATGGCCAAGTAATCATAGGGAACCAAACTTTAAGCATTGACCTGATGGTGGTAAACATGACAGATTTTGACGCCATACTAGGCATGGATTGGTTAGCTGAAAATCGAGCTAGTATAGACTGCCGCAAAAAGGAAGTAAAATTTTCACCATCGACAGGACCTACCTTTAAATTTAAAGGCACAAATATCGGGATTACCCCCAAGGTAGTCTCGATGATGAAAGCAAAAAGGTTAGTCCAACAAGGTGGATGGGCTATATTAGCATGTGTTGTAGACGTAAGAGGAAAGGAAAAGACCCTAGTAAATGTGCCAATAGTAAACGAGTTCCCGAATGTATTTCCGGATGACTTATCTGGAATATCCCCTTCCCGAGCGGTCGACTTTGTCATCGAACTCGAGCCGAGAACTGGGCCTATTTCCAAAGCACCCTATCGCATGGCGCCAGCAGAGTTGAAAGAACTTAAGGCGCAATTGCAAGACTTACTAGATAAAGGATTCATTCAACCTAGCGTGTCCCCCTGGGGTGCGCCAGTGTTGTTTGTTAAGAAGAAAGATGGATCGATGCGTCTGTGCATCGATTATAGAGAGCTAAACAAGAGAACCGTAAAAAATAAATATCCTCTACCTAGAATAGAAGACTTGTTTGATCAACTCAGAGAGGCAACAATATTCTCTAAGATAGATCTTCGGTCCGGTTACCACCAAATTAGGATTAATGAAAAAGACGTACCAAAAACAGCGTTTAGGACAAGGTACGGTCACTACGAGTTTGTAGTGATGTCATTTGGCCTCACTAATGCCCCAACTGTGTTTATGGAGTTAATGAACCGGGTATTCAAAGAATGCCTAGACATGTTCGTGATTGTGTTCATTGACGACATCCTCATATACTCGAGAACTGACCTAGAGCACGAGGAACACCTCCGAAAAGTCCTTACCACCCTAAGAGAGCACAAGTTGTACGCCAAGTTCTCCAAATGCGAATTTTGGTTACGACAAGTCTCTTTCCTAGGACACATGGTGTCAAAGGACGAAATATCTGTAGATCCCACCAAGGTCGAAGCGATCACAAAGTGGGAACGCCCAACTACGGTAACGGAAGTAAGGAGTTTCCTAGGATTGGCGGGATATTATCGAAGGTTCATGCAGGACTTCGCTAAAATATCCTCGCCTTTAAAAAAGTTAACAAAAAAAAGGGGTGCCATTTAGATGGGATGATGCTTGTGAGGCAAGCTTCCAGAACCTAAAAGAGAGATTGGTAACCACCCCGGTACTCATAGTACTCGAGAGCTCAGAAGGATATGAGATCTATAGTGATGCCTCCATGAAAGGACTGGGATGTGTGTTAATGCAACACGGCAAGGTTGTCGCATACGCATCTCGTCAACTTAAAGAATATGAAAAGAACTACCCTACCCATGACCTAGAGTTGGCCGCTGTAGTGTTCGCGCTGAAAATCTGGCGACATTACCTGTATGGCGAAAAAACCCAAATTTTTACCGACCACAAAAGTTTGAAATACTTCTTCACCCAGAAAGAGTTAAACATGAGGCAGAGAAGGTGGTTAGAATTGGTGAAGGATTATGACGTAGATATCCAGTACCACCTTGGGAAAGCAAATGTGGTTGCAGATGCCTTGAGTAGGAAGACGGTCCACTCGTCGGCCCTCATTACGAGGGAAGTAAGGGTACAAAGGGAGTTCGAGCGAGCCAACATAGCTGTAGCGACCGAGGGAGTCGTAGCACAGCTGGCCCGACTCACGGTACAACCTACGCTTAGGCAGAGAATTATTACCTCCCAACGAGAGGATCCTAACCTACAGAAAGTCCTAGGACAGCTAGACGAAAGTCCAGTAGATGGATTCTCGAAGTCATCAGATGAAGGACTATTGTATCAGGGACGCTTATGTGTTCCGGCAATAGAAGATTTAAGGAAGGAAATACTGATGGAAGCTCACAACTCACCATTTTTCATGCATCCAAGAGGTACTAAGATGTACCAAGATTTAAAACAACACTTTTGGTGGAAGAGCATGAAGAGAGATGTGGCCGGGTTTGTGAGCAAGTGCTTAGTTTGTCAACAAGTGAAAGCTCCAAGACAAAAGGCGGCGGGGTTGTTGCAGCCCCTAAGCATACCGGAGTGGAAGTGGGAAAACATAGCGATGGACTTCATAGTAGGTTTACCCAAAACGCCCAAAGGCTACACAGTGATCTGGGTAGTTGTCGATAGGTTGACCAAGTCGGCACACTTCCTACCCGGGAAGGTCACATATACAGTTGACAATTGGGCACAACTGTATGTGAAAGAAATAGTAAGACTACACGGAGTCCCAGTGTCTATAGTGTCGGATCGGGATCCCCGCTTTACGTCAGCGTTTTGGCGTGGACTTCAAAAAGCACTGGGTACCCGCCTCGACTTTAGTACCGCCTTTCACCCCCAAACAGATGGACAAACGGAGCGTTTAAACCAAATTCTAGAGGACATGCTACGCGCTTGCGTACTAGATTTTAAGGAAAGTTGGGATTCCAAACTCCACCTGATGGAATTCTCGTATAACAATAGTTTCCAAGCAACTATTGGAATGGCACCGTTTGAGGCCCTGTACGGAAAACGGTGTAGGTCCCCACTACGTTGGGACAAGGTAGGAGAGAGAGAATTAGTAGGACCCGAGTTGGTTCGACTCACCAATGAGGCTGTCCAGAAAATTCGAGCGAGGATGCGTACCGCTCAAAGTAGACAGAAAAGCTACGCCGATGTAAGGCGTAAAAGCCTGGAGTTTGAGGTGGGGGACCCAGTATTCCTCAAGGTGGCACCTATGAAAGGTGTGTTAAGATTTGGGCATAAGGGCAAGTTAAGTCCTAAATTCATTGGACCATTTGAGATCCTAGAGCGAGTTGGTCTAGTAGTGTATAAGTTAGCCTTACCTCCAGCTCTCTCAGGAGTACATGATGTATTTCACGTGTCGATGCTGAGGAAGTACATCACGGATCCTATCCACGTTATAGACTACAAACCACTCCAGCTCAATGAAGATCTGAGCTACGAGGAAAAACCAGTAAGAATCTTAGCTAGAGAAGTAAAAACCTTACGCAACAGGAGCATTGCGTTCGTTAAGGTACTGTGGCGGAATCACCACAGTGAGGAAGCCACGTGGGAGCGTGAGGACGAAATAAGAGAGAAATACCCCGAGTTGGTACAAGAGTTTGAGACTTTCGAGGACGAAAGTTCTTTTTAGGGGTAGATAATGTAACGACCCGGGAAAGAAAGAAAAAAAAAATATATATATATATATACATATAATAACAATAAATAAATAAAATAATAAAATAAAGTAAAAAAAACAAAAAAAACTCAGTCGCCGGAAAACCCGCGAGTTTTCCGGCGACCGCCAAGTTTCGTGGAACCCACACGAAACGACGGCCACAGCACGCCACCCTCAGCCCACGACACCCAGCATCTTCAGAACACTTGCAAAGGGAGAAAAGAGAGAAAATTTTGAGAGAGAGAGAGAGATTTCGGACAAACGTCGGCGAGTGTCCAGTTTCTCCGACGAACCCTCAAACCACCCTCAAACCGACACCAAACGATCTGTTACCACCATAAGAGGGATCCCTAACGAAAGCACAATATCTCAGGGTGCGTTTTGTGTCATTTTGTAAGCGTCGTCGTCGGTAACCGAGGTTAGAAAATTAGGTTTCTTAATCGATTCTTAGATCTGTGGCTTTTAGGAGCTTTTAGGCCGCAAAACTCCCAGAAAGAAAAGGACGAAGAACAAAGAGGAAAAGGGAAGGAAGAGGACCGAAATTGGCAACGAAAACGCCGCGGGAAGGAGAGAGAAAGCTCGCCGGAAAATCGGCCAAAGTCGGGTAAGGAAGACGAGATCCACGGATCCGGGTCAACCCGAACCCAGAACCCGGGAGAGCCAGCCCGTTCCTCCCTTGGCCTCTGCCTTTGGCCCAGCCCAACTACAGCAGCCCAAACCTCAAACCGGCCCAGAAGCTAAGGTTCAGTTGAACCAACCCAGACTAGACCCACGGTCCAATTGAGCCGGCCTGCAAACCAAACTCCTGCGGTCCAGCCCAGTAAGCCGAGACCCAACGATTAGTCTTGGCCTAATAGCCTTCCACAGCCCACGACCCGAACCACGACCTGAATCGGACTGCGACCTGCGCTTCGACCCGACACTGCCGCCCGACATGGCTTTCTCTGCTACTGCCACGTGTCACACAGCGGCGCAGGCAGTCCCTCAGCTCGGCTCGGCGCAAATGGCTCGGGCTCCCTCCAACGATGCTCCGGCGGCGTTCGGCTCGGCTCACCTATTTTCAGCTCGATTTCCACTGTTCCGACCCTCCCAAATCTGTTTTCGGCCCCGATTAAGGTATCTGATGCCGTTTCAGAGCCGTATTTCACATTTATTAAATTATAGGTAAATTAATTGTGAAATACTAACGAAATATGGTGGACGGTGTGATAGGAAACAAACCCCGGGGACGTAGGGAGCAGCTCTCACGGAACGGGAGCTTACGTTTAGCGATTTAGGGAGTACTTCGGCCAAGGTCAACCAAGTAAGTGGCCTTACTATAGGATAGGCTAAATAAATTGTATGATGCTTGATGTTATGTGCCAGTTTATACATATTTTGTTGAGTGACGCTTGATATGTTTGACGCACGTTGTGGCTTTATGATGAGCATGATGATTGCATGATTTTTACCTTAATATGATGATGTTTTCCATATTGAGCATGCTAGATGATGATGAGTGTCATATTGCATCATGTCGTTAGATCGACATAGGACACAACCCTAAGAGCATGAAAATGATAGTAATATAAATATCCAGGAGTATGCGTTACCTAGAGTAACAAAGAGATGAGACTAGAGGGTTGTGTCAGAAGAGACATTACGATGGATTTAGAGGGACCTCATGCATTTTGTATGTTCATAAGCATAGGGCTACTTCCCCTAGAGATGATGAGTGCGGACGCGCACCTTACGATGAGCGCGAATGCGCACAAGTGCAAAAGCACACAGATGAGAGTGTTCATGAGGCGCATGACACTATGGGGTTCCGCTGACCTCCGGACGTCGCTACAGATTAGCTTGACCAGAGGGTCCAGGGGGTGTGCGAGCACCCTGGGGACTCACATTCACACGTGTGAGTCGTGTGTAGGGAAGTACTACACATCCAATTTGTCCGAGATTGGAGGCCACCCCTAAGATGATTAGAGATAGGTCCCTATTCATGATTGCATGTGTTTGCATTAGCATGGCCCCTATAGTGGGGTCACTTACTGAGTATTTCTTCGGCCGTGTGCCATATTATTTTTTTTTTTTCAGGTAAAGGCAAGGCGCCCATGTACGGTTGACGGTGGCATCGTGATCAGAGACTGTGGCGCGTGCATAGGATAGTTGCATATTTAATTCCTAGTCTTGGTTAGGATAGGGCGTTTGCATTTCATTCATTTATATTAATTAATTTGTAATCGTTTTATTTTATCTTTTTGAACTCCAGCACAATGTTTGAAACACGTAAGTCCGGTAGTGTTTTTCAATGTTTTAATGTATTCTGAAGTTTTAAATTTTTCCGCTAATAAAGATGAGCATGCTTAGATTTTTATTCTGCATTAGAGATGAGCATGAGACGTTAGTGGCGACTCTAGATATGTGGAAATTTAGGGTCGTTACAGTTGGTATCAGAGCTCTAGGNNNNNNNNNNCGGAAAATCGGCCAAAGTCGGGTAAGGAAGACGAGATCCACGGATCCGGGTCAACCCGAACCCAGAACCCGGGAGAGCCAGCCCGTTCCTCCCTTGGCCTCTGCCTTTGGCCCAGCCCAACTACAGCAGCCCAAACCTCAAACCGGCCCAGAAGCTAAGGTTCAGTTGAACCAACCCAGACTAGACCCACGGTCCAATTGAGCCGGCCTGCAAACCAAACTCCTGCGGTCCAGCCCAGTAAGCCGAGACCCAACGATTAGTCTTGGCCTAATAGCCTTCCACAGCCCACGACCCGAACCACGACCTGAATCGGACTGCGACCTGCGCTTCGACCCGACACTGCCGCCCGACATGGCTTTCTCTGCTACTGCCACGTGTCACACAGCGGCGCAGGCAGTCCCTCAGCTCGGCTCGGCGCAAATGGCTCGGGCTCCCTCCAACGATGCTCCGGCGGCGTTCGGCTCGGCTCACCTATTTTCAGCTCGATTTCCACTGTTCCGACCCTCCCAAATCTGTTTTCGGCCCCGATTAAGGTATCTGATGCCGTTTCAGAGCCGTATTTCACATTTATTAAATTATAGGTAAATTAATTGTGAAATACTAACGAAATATGGTGGACGGTGTGATAGGAAACAAACCCCGGGGACGTAGGGAGCAGCTCTCACGGAACGGGAGCTTACGTTTAGCGATTTAGGGAGTACTTCGGCCAAGGTCAACCAAGTAAGTGGCCTTACTATAGGATAGGCTAAATAAATTGTATGATGCTTGATGTTATGTGCCAGTTTATACATATTTTGTTGAGTGACGCTTGATATGTTTGACGCACGTTGTGGCTTTATGATGAGCATGATGATTGCATGATTTTTACCTTAATATGATGATGTTTTCCATATTGAGCATGCTAGATGATGATGAGTGTCATATTGCATCATGTCGTTAGATCGACATAGGACACAACCCTAAGAGCATGAAAATGATAGTAATATAAATATCCAGGAGTATGCGTTACCTAGAGTAACAAAGAGATGAGACTAGAGGGTTGTGTCAGAAGAGACATTACGATGGATTTAGAGGGACCTCATGCATTTTGTATGTTCATAAGCATAGGGCTACTTCCCCTAGAGATGATGAGTGCGGACGCGCACCTTACGATGAGCGCGAATGCGCACAAGTGCAAAAGCACACAGATGAGAGTGTTCATGAGGCGCATGACACTATGGGGTTCCGCTGA
mRNA sequence
ATGCCTTTTGTTGTACAAGCAGGGTTCGAATTAGAACCCTTATTGCATGAAATGTCTGTAAGCACCCCTGCGGGGGTAGACTTAGTATCTAGGGATAGAGTAAAGGATGGCCAAGTAATCATAGGGAACCAAACTTTAAGCATTGACCTGATGGTGGTAAACATGACAGATTTTGACGCCATACTAGGCATGGATTGGTTAGCTGAAAATCGAGCTAGTATAGACTGCCGCAAAAAGGAAGTAAAATTTTCACCATCGACAGGACCTACCTTTAAATTTAAAGGCACAAATATCGGGATTACCCCCAAGGTAGTCTCGATGATGAAAGCAAAAAGGTTAGTCCAACAAGGTGGATGGGCTATATTAGCATGTGTTGTAGACGTAAGAGGAAAGGAAAAGACCCTAGTAAATGTGCCAATAGTAAACGAGTTCCCGAATGTATTTCCGGATGACTTATCTGGAATATCCCCTTCCCGAGCGGTCGACTTTGTCATCGAACTCGAGCCGAGAACTGGGCCTATTTCCAAAGCACCCTATCGCATGGCGCCAGCAGAGTTGAAAGAACTTAAGGCGCAATTGCAAGACTTACTAGATAAAGGATTCATTCAACCTAGCGTGTCCCCCTGGGGTGCGCCAGTGTTGTTTGTTAAGAAGAAAGATGGATCGATGCGTCTGTGCATCGATTATAGAGAGCTAAACAAGAGAACCGTAAAAAATAAATATCCTCTACCTAGAATAGAAGACTTGTTTGATCAACTCAGAGAGGCAACAATATTCTCTAAGATAGATCTTCGGTCCGGTTACCACCAAATTAGGATTAATGAAAAAGACGTACCAAAAACAGCGTTTAGGACAAGGTACGGTCACTACGAGTTTGTAGTGATGTCATTTGGCCTCACTAATGCCCCAACTGTGTTTATGGAGTTAATGAACCGGGTATTCAAAGAATGCCTAGACATGTTCGTGATTGTGTTCATTGACGACATCCTCATATACTCGAGAACTGACCTAGAGCACGAGGAACACCTCCGAAAAGTCCTTACCACCCTAAGAGAGCACAAGTTGTACGCCAAGTTCTCCAAATGCGAATTTTGGTTACGACAAGTCTCTTTCCTAGGACACATGGTGTCAAAGGACGAAATATCTGTAGATCCCACCAAGGTCGAAGCGATCACAAAGTGGGAACGCCCAACTACGAACCTAAAAGAGAGATTGGTAACCACCCCGGTACTCATAGTACTCGAGAGCTCAGAAGGATATGAGATCTATAGTGATGCCTCCATGAAAGGACTGGGATGTGTGTTAATGCAACACGGCAAGGTTGTCGCATACGCATCTCGTCAACTTAAAGAATATGAAAAGAACTACCCTACCCATGACCTAGAGTTGGCCGCTGTAGTGTTCGCGCTGAAAATCTGGCGACATTACCTGTATGGCGAAAAAACCCAAATTTTTACCGACCACAAAAGTTTGAAATACTTCTTCACCCAGAAAGAGTTAAACATGAGGCAGAGAAGGTGGTTAGAATTGGTGAAGGATTATGACGTAGATATCCAGTACCACCTTGGGAAAGCAAATGTGGTTGCAGATGCCTTGAGTAGGAAGACGGTCCACTCGTCGGCCCTCATTACGAGGGAAGTAAGGGTACAAAGGGAGTTCGAGCGAGCCAACATAGCTGTAGCGACCGAGGGAGTCGTAGCACAGCTGGCCCGACTCACGGTACAACCTACGCTTAGGCAGAGAATTATTACCTCCCAACGAGAGGATCCTAACCTACAGAAAGTCCTAGGACAGCTAGACGAAAGTCCAGTAGATGGATTCTCGAAGTCATCAGATGAAGGACTATTGTATCAGGGACGCTTATGTGTTCCGGCAATAGAAGATTTAAGGAAGGAAATACTGATGGAAGCTCACAACTCACCATTTTTCATGCATCCAAGAGGTACTAAGATGTACCAAGATTTAAAACAACACTTTTGGTGGAAGAGCATGAAGAGAGATGTGGCCGGGTTTGTGAGCAAGTGCTTAGTTTGTCAACAAGTGAAAGCTCCAAGACAAAAGGCGGCGGGGTTGTTGCAGCCCCTAAGCATACCGGAGTGGAAGTGGGAAAACATAGCGATGGACTTCATAGTAGGTTTACCCAAAACGCCCAAAGGCTACACAGTGATCTGGGTAGTTGTCGATAGGTTGACCAAGTCGGCACACTTCCTACCCGGGAAGGTCACATATACAGTTGACAATTGGGCACAACTGTATGTGAAAGAAATAGTAAGACTACACGGAGTCCCAGTGTCTATAGTGTCGGATCGGGATCCCCGCTTTACGTCAGCGTTTTGGCGTGGACTTCAAAAAGCACTGGGTACCCGCCTCGACTTTAGTACCGCCTTTCACCCCCAAACAGATGGACAAACGGAGCGTTTAAACCAAATTCTAGAGGACATGCTACGCGCTTGCGTACTAGATTTTAAGGAAAGTTGGGATTCCAAACTCCACCTGATGGAATTCTCGTATAACAATAGTTTCCAAGCAACTATTGGAATGGCACCGTTTGAGGCCCTGTACGGAAAACGGTGTAGGTCCCCACTACGTTGGGACAAGGTAGGAGAGAGAGAATTAGTAGGACCCGAGTTGGTTCGACTCACCAATGAGGCTGTCCAGAAAATTCGAGCGAGGATGCGTACCGCTCAAAGTAGACAGAAAAGCTACGCCGATGTAAGGCGTAAAAGCCTGGAGTTTGAGGTGGGGGACCCAGTATTCCTCAAGGTGGCACCTATGAAAGGTGTGTTAAGATTTGGGCATAAGGGCAAGTTAAGTCCTAAATTCATTGGACCATTTGAGATCCTAGAGCGAGTTGGTCTAGTAGTGTATAAGTTAGCCTTACCTCCAGCTCTCTCAGGAGTACATGATGTATTTCACGTGTCGATGCTGAGGAAGTACATCACGGATCCTATCCACGTTATAGACTACAAACCACTCCAGCTCAATGAAGATCTGAGCTACGAGGAAAAACCAGTAAGAATCTTAGCTAGAGAAGTAAAAACCTTACGCAACAGGAGCATTGCGTTCGTTAAGGAGCTTTTAGGCCGCAAAACTCCCAGAAAGAAAAGGACGAAGAACAAAGAGGAAAAGGGAAGGAAGAGGACCGAAATTGGCAACGAAAACGCCGCGGGAAGGAGAGAGAAAGCTCGCCGGAAAATCGGCCAAAGTCGGCCCAACTACAGCAGCCCAAACCTCAAACCGGCCCAGAAGCTAAGCCCACGACCCGAACCACGACCTGAATCGGACTGCGACCTGCGCTTCGACCCGACACTGCCGCCCGACATGGCTTTCTCTGCTACTGCCACGTGTCACACAGCGGCGCAGGCAGTCCCTCAGCTCGGCTCGGCGCAAATGGCTCGGGCTCCCTCCAACGATGCTCCGGCGGCGTTCGGCTCGGCTCACCTATTTTCAGCTCGATTTCCACTGTTCCGACCCTCCCAAATCTGTTTTCGGCCCCGATTAAGCCCACGACCCGAACCACGACCTGAATCGGACTGCGACCTGCGCTTCGACCCGACACTGCCGCCCGACATGGCTTTCTCTGCTACTGCCACGTGTCACACAGCGGCGCAGGCAGTCCCTCAGCTCGGCTCGGCGCAAATGGCTCGGGCTCCCTCCAACGATGCTCCGGCGGCGTTCGGCTCGGCTCACCTATTTTCAGCTCGATTTCCACTGTTCCGACCCTCCCAAATCTGTTTTCGGCCCCGATTAAGGGCTACTTCCCCTAGAGATGATGAGTGCGGACGCGCACCTTACGATGAGCGCGAATGCGCACAAGTGCAAAAGCACACAGATGAGAGTGTTCATGAGGCGCATGACACTATGGGGTTCCGCTGA
Coding sequence (CDS)
ATGCCTTTTGTTGTACAAGCAGGGTTCGAATTAGAACCCTTATTGCATGAAATGTCTGTAAGCACCCCTGCGGGGGTAGACTTAGTATCTAGGGATAGAGTAAAGGATGGCCAAGTAATCATAGGGAACCAAACTTTAAGCATTGACCTGATGGTGGTAAACATGACAGATTTTGACGCCATACTAGGCATGGATTGGTTAGCTGAAAATCGAGCTAGTATAGACTGCCGCAAAAAGGAAGTAAAATTTTCACCATCGACAGGACCTACCTTTAAATTTAAAGGCACAAATATCGGGATTACCCCCAAGGTAGTCTCGATGATGAAAGCAAAAAGGTTAGTCCAACAAGGTGGATGGGCTATATTAGCATGTGTTGTAGACGTAAGAGGAAAGGAAAAGACCCTAGTAAATGTGCCAATAGTAAACGAGTTCCCGAATGTATTTCCGGATGACTTATCTGGAATATCCCCTTCCCGAGCGGTCGACTTTGTCATCGAACTCGAGCCGAGAACTGGGCCTATTTCCAAAGCACCCTATCGCATGGCGCCAGCAGAGTTGAAAGAACTTAAGGCGCAATTGCAAGACTTACTAGATAAAGGATTCATTCAACCTAGCGTGTCCCCCTGGGGTGCGCCAGTGTTGTTTGTTAAGAAGAAAGATGGATCGATGCGTCTGTGCATCGATTATAGAGAGCTAAACAAGAGAACCGTAAAAAATAAATATCCTCTACCTAGAATAGAAGACTTGTTTGATCAACTCAGAGAGGCAACAATATTCTCTAAGATAGATCTTCGGTCCGGTTACCACCAAATTAGGATTAATGAAAAAGACGTACCAAAAACAGCGTTTAGGACAAGGTACGGTCACTACGAGTTTGTAGTGATGTCATTTGGCCTCACTAATGCCCCAACTGTGTTTATGGAGTTAATGAACCGGGTATTCAAAGAATGCCTAGACATGTTCGTGATTGTGTTCATTGACGACATCCTCATATACTCGAGAACTGACCTAGAGCACGAGGAACACCTCCGAAAAGTCCTTACCACCCTAAGAGAGCACAAGTTGTACGCCAAGTTCTCCAAATGCGAATTTTGGTTACGACAAGTCTCTTTCCTAGGACACATGGTGTCAAAGGACGAAATATCTGTAGATCCCACCAAGGTCGAAGCGATCACAAAGTGGGAACGCCCAACTACGAACCTAAAAGAGAGATTGGTAACCACCCCGGTACTCATAGTACTCGAGAGCTCAGAAGGATATGAGATCTATAGTGATGCCTCCATGAAAGGACTGGGATGTGTGTTAATGCAACACGGCAAGGTTGTCGCATACGCATCTCGTCAACTTAAAGAATATGAAAAGAACTACCCTACCCATGACCTAGAGTTGGCCGCTGTAGTGTTCGCGCTGAAAATCTGGCGACATTACCTGTATGGCGAAAAAACCCAAATTTTTACCGACCACAAAAGTTTGAAATACTTCTTCACCCAGAAAGAGTTAAACATGAGGCAGAGAAGGTGGTTAGAATTGGTGAAGGATTATGACGTAGATATCCAGTACCACCTTGGGAAAGCAAATGTGGTTGCAGATGCCTTGAGTAGGAAGACGGTCCACTCGTCGGCCCTCATTACGAGGGAAGTAAGGGTACAAAGGGAGTTCGAGCGAGCCAACATAGCTGTAGCGACCGAGGGAGTCGTAGCACAGCTGGCCCGACTCACGGTACAACCTACGCTTAGGCAGAGAATTATTACCTCCCAACGAGAGGATCCTAACCTACAGAAAGTCCTAGGACAGCTAGACGAAAGTCCAGTAGATGGATTCTCGAAGTCATCAGATGAAGGACTATTGTATCAGGGACGCTTATGTGTTCCGGCAATAGAAGATTTAAGGAAGGAAATACTGATGGAAGCTCACAACTCACCATTTTTCATGCATCCAAGAGGTACTAAGATGTACCAAGATTTAAAACAACACTTTTGGTGGAAGAGCATGAAGAGAGATGTGGCCGGGTTTGTGAGCAAGTGCTTAGTTTGTCAACAAGTGAAAGCTCCAAGACAAAAGGCGGCGGGGTTGTTGCAGCCCCTAAGCATACCGGAGTGGAAGTGGGAAAACATAGCGATGGACTTCATAGTAGGTTTACCCAAAACGCCCAAAGGCTACACAGTGATCTGGGTAGTTGTCGATAGGTTGACCAAGTCGGCACACTTCCTACCCGGGAAGGTCACATATACAGTTGACAATTGGGCACAACTGTATGTGAAAGAAATAGTAAGACTACACGGAGTCCCAGTGTCTATAGTGTCGGATCGGGATCCCCGCTTTACGTCAGCGTTTTGGCGTGGACTTCAAAAAGCACTGGGTACCCGCCTCGACTTTAGTACCGCCTTTCACCCCCAAACAGATGGACAAACGGAGCGTTTAAACCAAATTCTAGAGGACATGCTACGCGCTTGCGTACTAGATTTTAAGGAAAGTTGGGATTCCAAACTCCACCTGATGGAATTCTCGTATAACAATAGTTTCCAAGCAACTATTGGAATGGCACCGTTTGAGGCCCTGTACGGAAAACGGTGTAGGTCCCCACTACGTTGGGACAAGGTAGGAGAGAGAGAATTAGTAGGACCCGAGTTGGTTCGACTCACCAATGAGGCTGTCCAGAAAATTCGAGCGAGGATGCGTACCGCTCAAAGTAGACAGAAAAGCTACGCCGATGTAAGGCGTAAAAGCCTGGAGTTTGAGGTGGGGGACCCAGTATTCCTCAAGGTGGCACCTATGAAAGGTGTGTTAAGATTTGGGCATAAGGGCAAGTTAAGTCCTAAATTCATTGGACCATTTGAGATCCTAGAGCGAGTTGGTCTAGTAGTGTATAAGTTAGCCTTACCTCCAGCTCTCTCAGGAGTACATGATGTATTTCACGTGTCGATGCTGAGGAAGTACATCACGGATCCTATCCACGTTATAGACTACAAACCACTCCAGCTCAATGAAGATCTGAGCTACGAGGAAAAACCAGTAAGAATCTTAGCTAGAGAAGTAAAAACCTTACGCAACAGGAGCATTGCGTTCGTTAAGGAGCTTTTAGGCCGCAAAACTCCCAGAAAGAAAAGGACGAAGAACAAAGAGGAAAAGGGAAGGAAGAGGACCGAAATTGGCAACGAAAACGCCGCGGGAAGGAGAGAGAAAGCTCGCCGGAAAATCGGCCAAAGTCGGCCCAACTACAGCAGCCCAAACCTCAAACCGGCCCAGAAGCTAAGCCCACGACCCGAACCACGACCTGAATCGGACTGCGACCTGCGCTTCGACCCGACACTGCCGCCCGACATGGCTTTCTCTGCTACTGCCACGTGTCACACAGCGGCGCAGGCAGTCCCTCAGCTCGGCTCGGCGCAAATGGCTCGGGCTCCCTCCAACGATGCTCCGGCGGCGTTCGGCTCGGCTCACCTATTTTCAGCTCGATTTCCACTGTTCCGACCCTCCCAAATCTGTTTTCGGCCCCGATTAAGCCCACGACCCGAACCACGACCTGAATCGGACTGCGACCTGCGCTTCGACCCGACACTGCCGCCCGACATGGCTTTCTCTGCTACTGCCACGTGTCACACAGCGGCGCAGGCAGTCCCTCAGCTCGGCTCGGCGCAAATGGCTCGGGCTCCCTCCAACGATGCTCCGGCGGCGTTCGGCTCGGCTCACCTATTTTCAGCTCGATTTCCACTGTTCCGACCCTCCCAAATCTGTTTTCGGCCCCGATTAAGGGCTACTTCCCCTAGAGATGATGAGTGCGGACGCGCACCTTACGATGAGCGCGAATGCGCACAAGTGCAAAAGCACACAGATGAGAGTGTTCATGAGGCGCATGACACTATGGGGTTCCGCTGA
Protein sequence
MPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDGQVIIGNQTLSIDLMVVNMTDFDAILGMDWLAENRASIDCRKKEVKFSPSTGPTFKFKGTNIGITPKVVSMMKAKRLVQQGGWAILACVVDVRGKEKTLVNVPIVNEFPNVFPDDLSGISPSRAVDFVIELEPRTGPISKAPYRMAPAELKELKAQLQDLLDKGFIQPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLREATIFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPTVFMELMNRVFKECLDMFVIVFIDDILIYSRTDLEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHMVSKDEISVDPTKVEAITKWERPTTNLKERLVTTPVLIVLESSEGYEIYSDASMKGLGCVLMQHGKVVAYASRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYGEKTQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDVDIQYHLGKANVVADALSRKTVHSSALITREVRVQREFERANIAVATEGVVAQLARLTVQPTLRQRIITSQREDPNLQKVLGQLDESPVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFFMHPRGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKAAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATIGMAPFEALYGKRCRSPLRWDKVGERELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSLEFEVGDPVFLKVAPMKGVLRFGHKGKLSPKFIGPFEILERVGLVVYKLALPPALSGVHDVFHVSMLRKYITDPIHVIDYKPLQLNEDLSYEEKPVRILAREVKTLRNRSIAFVKELLGRKTPRKKRTKNKEEKGRKRTEIGNENAAGRREKARRKIGQSRPNYSSPNLKPAQKLSPRPEPRPESDCDLRFDPTLPPDMAFSATATCHTAAQAVPQLGSAQMARAPSNDAPAAFGSAHLFSARFPLFRPSQICFRPRLSPRPEPRPESDCDLRFDPTLPPDMAFSATATCHTAAQAVPQLGSAQMARAPSNDAPAAFGSAHLFSARFPLFRPSQICFRPRLRATSPRDDECGRAPYDERECAQVQKHTDESVHEAHDTMGFR
Homology
BLAST of CmoCh11G012340 vs. ExPASy Swiss-Prot
Match:
Q99315 (Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-G PE=1 SV=3)
HSP 1 Score: 429.5 bits (1103), Expect = 1.4e-118
Identity = 292/903 (32.34%), Postives = 441/903 (48.84%), Query Frame = 0
Query: 149 PDDLSGISPSRAVDFVIELEPRTGPISKAPYRMAPAELKELKAQLQDLLDKGFIQPSVSP 208
P D++ I V IE++P PY + +E+ +Q LLD FI PS SP
Sbjct: 576 PADINNI----PVKHDIEIKPGARLPRLQPYHVTEKNEQEINKIVQKLLDNKFIVPSKSP 635
Query: 209 WGAPVLFVKKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLREATIFSKIDLRSGY 268
+PV+ V KKDG+ RLC+DYR LNK T+ + +PLPRI++L ++ A IF+ +DL SGY
Sbjct: 636 CSSPVVLVPKKDGTFRLCVDYRTLNKATISDPFPLPRIDNLLSRIGNAQIFTTLDLHSGY 695
Query: 269 HQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPTVFMELMNRVFKECLDMFVIVFIDD 328
HQI + KD KTAF T G YE+ VM FGL NAP+ F M F++ FV V++DD
Sbjct: 696 HQIPMEPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTFARYMADTFRDL--RFVNVYLDD 755
Query: 329 ILIYSRTDLEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHMVSKDEISVDPTKV 388
ILI+S + EH +HL VL L+ L K KC+F + FLG+ + +I+ K
Sbjct: 756 ILIFSESPEEHWKHLDTVLERLKNENLIVKKKKCKFASEETEFLGYSIGIQKIAPLQHKC 815
Query: 389 EAITKWERPTT-----------------------------------------------NL 448
AI + P T L
Sbjct: 816 AAIRDFPTPKTVKQAQRFLGMINYYRRFIPNCSKIAQPIQLFICDKSQWTEKQDKAIDKL 875
Query: 449 KERLVTTPVLIVLESSEGYEIYSDASMKGLGCVLMQHGK------VVAYASRQLKEYEKN 508
K+ L +PVL+ + Y + +DAS G+G VL + VV Y S+ L+ +KN
Sbjct: 876 KDALCNSPVLVPFNNKANYRLTTDASKDGIGAVLEEVDNKNKLVGVVGYFSKSLESAQKN 935
Query: 509 YPTHDLELAAVVFALKIWRHYLYGEKTQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYD 568
YP +LEL ++ AL +R+ L+G+ + TDH SL + E R +RWL+ + YD
Sbjct: 936 YPAGELELLGIIKALHHFRYMLHGKHFTLRTDHISLLSLQNKNEPARRVQRWLDDLATYD 995
Query: 569 VDIQYHLGKANVVADALSRKTVHSSALITREVRVQREFERANIAVATEGVVAQLARLTVQ 628
++Y G NVVADA+SR + +R + + V+ + LT Q
Sbjct: 996 FTLEYLAGPKNVVADAISRAVYTITPETSRPIDTESWKSYYKSDPLCSAVLIHMKELT-Q 1055
Query: 629 PTLRQRIITSQREDPNLQKVLGQLDESPVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILM 688
+ +++ R + QK L +L E+ +S DE + YQ RL VP I+ + +
Sbjct: 1056 HNVTPEDMSAFR---SYQKKL-ELSETFRKNYS-LEDEMIYYQDRLVVP-IKQQNAVMRL 1115
Query: 689 EAHNSPFFMHPRGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKAAGLLQPL 748
++ F H T + ++W ++ + ++ C+ CQ +K+ R + GLLQPL
Sbjct: 1116 YHDHTLFGGHFGVTVTLAKISPIYYWPKLQHSIIQYIRTCVQCQLIKSHRPRLHGLLQPL 1175
Query: 749 SIPEWKWENIAMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTVDNWAQLYVKE 808
I E +W +I+MDF+ GLP T +I VVVDR +K AHF+ + T L +
Sbjct: 1176 PIAEGRWLDISMDFVTGLPPTSNNLNMILVVVDRFSKRAHFIATRKTLDATQLIDLLFRY 1235
Query: 809 IVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDML 868
I HG P +I SDRD R T+ ++ L K LG + S+A HPQTDGQ+ER Q L +L
Sbjct: 1236 IFSYHGFPRTITSDRDVRMTADKYQELTKRLGIKSTMSSANHPQTDGQSERTIQTLNRLL 1295
Query: 869 RACVLDFKESWDSKLHLMEFSYNNSFQATIGMAPFEALYGKRCRSPL--RWDKVGERELV 928
RA ++W L +EF YN++ T+G +PFE G +P D+V R
Sbjct: 1296 RAYASTNIQNWHVYLPQIEFVYNSTPTRTLGKSPFEIDLGYLPNTPAIKSDDEVNARSFT 1355
Query: 929 GPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSLEFEVGDPVFLKVAPMKGVLRFGH 988
EL + + + ++ AQ ++ + RRK L +GD V + + G
Sbjct: 1356 AVELAKHLKALTIQTKEQLEHAQIEMETNNNQRRKPLLLNIGDHVLVH---RDAYFKKGA 1415
Query: 989 KGKLSPKFIGPFEILERVGLVVYKLALPPALSGVHDVFHVSMLRKYITDPIHVIDYKPLQ 997
K+ ++GPF +++++ Y+L L + H V +V L+K++ P KP+
Sbjct: 1416 YMKVQQIYVGPFRVVKKINDNAYELDL-NSHKKKHRVINVQFLKKFVYRPDAYPKNKPIS 1461
BLAST of CmoCh11G012340 vs. ExPASy Swiss-Prot
Match:
Q7LHG5 (Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-I PE=1 SV=2)
HSP 1 Score: 422.9 bits (1086), Expect = 1.3e-116
Identity = 294/924 (31.82%), Postives = 448/924 (48.48%), Query Frame = 0
Query: 149 PDDLSGISPSRAVDFVIELEPRTGPISKAPYRMAPAELKELKAQLQDLLDKGFIQPSVSP 208
P D++ I V IE++P PY + +E+ +Q LLD FI PS SP
Sbjct: 602 PADINNI----PVKHDIEIKPGARLPRLQPYHVTEKNEQEINKIVQKLLDNKFIVPSKSP 661
Query: 209 WGAPVLFVKKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLREATIFSKIDLRSGY 268
+PV+ V KKDG+ RLC+DYR LNK T+ + +PLPRI++L ++ A IF+ +DL SGY
Sbjct: 662 CSSPVVLVPKKDGTFRLCVDYRTLNKATISDPFPLPRIDNLLSRIGNAQIFTTLDLHSGY 721
Query: 269 HQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPTVFMELMNRVFKECLDMFVIVFIDD 328
HQI + KD KTAF T G YE+ VM FGL NAP+ F M F++ FV V++DD
Sbjct: 722 HQIPMEPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTFARYMADTFRDL--RFVNVYLDD 781
Query: 329 ILIYSRTDLEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHMVSKDEISVDPTKV 388
ILI+S + EH +HL VL L+ L K KC+F + FLG+ + +I+ K
Sbjct: 782 ILIFSESPEEHWKHLDTVLERLKNENLIVKKKKCKFASEETEFLGYSIGIQKIAPLQHKC 841
Query: 389 EAITKWERPTT-----------------------------------------------NL 448
AI + P T L
Sbjct: 842 AAIRDFPTPKTVKQAQRFLGMINYYRRFIPNCSKIAQPIQLFICDKSQWTEKQDKAIEKL 901
Query: 449 KERLVTTPVLIVLESSEGYEIYSDASMKGLGCVLMQHGK------VVAYASRQLKEYEKN 508
K L +PVL+ + Y + +DAS G+G VL + VV Y S+ L+ +KN
Sbjct: 902 KAALCNSPVLVPFNNKANYRLTTDASKDGIGAVLEEVDNKNKLVGVVGYFSKSLESAQKN 961
Query: 509 YPTHDLELAAVVFALKIWRHYLYGEKTQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYD 568
YP +LEL ++ AL +R+ L+G+ + TDH SL + E R +RWL+ + YD
Sbjct: 962 YPAGELELLGIIKALHHFRYMLHGKHFTLRTDHISLLSLQNKNEPARRVQRWLDDLATYD 1021
Query: 569 VDIQYHLGKANVVADALSRKTVHSSALITREVRVQREFERANIAVATEGVVAQLARLTVQ 628
++Y G NVVADA+SR + +R + + V+ + LT Q
Sbjct: 1022 FTLEYLAGPKNVVADAISRAIYTITPETSRPIDTESWKSYYKSDPLCSAVLIHMKELT-Q 1081
Query: 629 PTLRQRIITSQREDPNLQKVLGQLDESPVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILM 688
+ +++ R + QK L +L E+ +S DE + YQ RL VP I+ + +
Sbjct: 1082 HNVTPEDMSAFR---SYQKKL-ELSETFRKNYS-LEDEMIYYQDRLVVP-IKQQNAVMRL 1141
Query: 689 EAHNSPFFMHPRGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKAAGLLQPL 748
++ F H T + ++W ++ + ++ C+ CQ +K+ R + GLLQPL
Sbjct: 1142 YHDHTLFGGHFGVTVTLAKISPIYYWPKLQHSIIQYIRTCVQCQLIKSHRPRLHGLLQPL 1201
Query: 749 SIPEWKWENIAMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTVDNWAQLYVKE 808
I E +W +I+MDF+ GLP T +I VVVDR +K AHF+ + T L +
Sbjct: 1202 PIAEGRWLDISMDFVTGLPPTSNNLNMILVVVDRFSKRAHFIATRKTLDATQLIDLLFRY 1261
Query: 809 IVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDML 868
I HG P +I SDRD R T+ ++ L K LG + S+A HPQTDGQ+ER Q L +L
Sbjct: 1262 IFSYHGFPRTITSDRDVRMTADKYQELTKRLGIKSTMSSANHPQTDGQSERTIQTLNRLL 1321
Query: 869 RACVLDFKESWDSKLHLMEFSYNNSFQATIGMAPFEALYGKRCRSPL--RWDKVGERELV 928
RA V ++W L +EF YN++ T+G +PFE G +P D+V R
Sbjct: 1322 RAYVSTNIQNWHVYLPQIEFVYNSTPTRTLGKSPFEIDLGYLPNTPAIKSDDEVNARSFT 1381
Query: 929 GPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSLEFEVGDPVFLKVAPMKGVLRFGH 988
EL + + + ++ AQ ++ + RRK L +GD V + + G
Sbjct: 1382 AVELAKHLKALTIQTKEQLEHAQIEMETNNNQRRKPLLLNIGDHVLVH---RDAYFKKGA 1441
Query: 989 KGKLSPKFIGPFEILERVGLVVYKLALPPALSGVHDVFHVSMLRKYITDPIHVIDYKPLQ 1018
K+ ++GPF +++++ Y+L L + H V +V L+ ++ + + +
Sbjct: 1442 YMKVQQIYVGPFRVVKKINDNAYELDL-NSHKKKHRVINVQFLKS-----LYTVQTRTQR 1498
BLAST of CmoCh11G012340 vs. ExPASy Swiss-Prot
Match:
P0CT41 (Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-12 PE=3 SV=1)
HSP 1 Score: 408.7 bits (1049), Expect = 2.5e-112
Identity = 259/887 (29.20%), Postives = 437/887 (49.27%), Query Frame = 0
Query: 157 PSRAVDFVIELEPRTGPISKAPYRMAPAELKELKAQLQDLLDKGFIQPSVSPWGAPVLFV 216
P + ++F +EL + Y + P +++ + ++ L G I+ S + PV+FV
Sbjct: 396 PIKGLEFEVELTQENYRLPIRNYPLPPGKMQAMNDEINQGLKSGIIRESKAINACPVMFV 455
Query: 217 KKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLREATIFSKIDLRSGYHQIRINEK 276
KK+G++R+ +DY+ LNK N YPLP IE L +++ +TIF+K+DL+S YH IR+ +
Sbjct: 456 PKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKG 515
Query: 277 DVPKTAFRTRYGHYEFVVMSFGLTNAPTVFMELMNRVFKECLDMFVIVFIDDILIYSRTD 336
D K AFR G +E++VM +G++ AP F +N + E + V+ ++DDILI+S+++
Sbjct: 516 DEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKSE 575
Query: 337 LEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHMVSKDEISVDPTKVEAITKWER 396
EH +H++ VL L+ L +KCEF QV F+G+ +S+ + ++ + +W++
Sbjct: 576 SEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQ 635
Query: 397 PTT-------------------------------------------------NLKERLVT 456
P N+K+ LV+
Sbjct: 636 PKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVS 695
Query: 457 TPVLIVLESSEGYEIYSDASMKGLGCVLMQHGK-----VVAYASRQLKEYEKNYPTHDLE 516
PVL + S+ + +DAS +G VL Q V Y S ++ + + NY D E
Sbjct: 696 PPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKE 755
Query: 517 LAAVVFALKIWRHYLYG--EKTQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDVDI 576
+ A++ +LK WRHYL E +I TDH++L T + N R RW ++D++ +I
Sbjct: 756 MLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEI 815
Query: 577 QYHLGKANVVADALSRKTVHSSALITREVRVQREFERANIAVATEGVVAQLARLTVQPTL 636
Y G AN +ADALSR ++ + ++ E +I + +++
Sbjct: 816 NYRPGSANHIADALSR-------IVDETEPIPKDSEDNSINFVNQ--------ISITDDF 875
Query: 637 RQRIITSQREDPNLQKVLGQLDESPVDGFSKSSDEGLLYQGR--LCVPAIEDLRKEILME 696
+ +++T D L +L D+ + +GLL + + +P L + I+ +
Sbjct: 876 KNQVVTEYTNDTKLLNLLNNEDKRVEENIQLK--DGLLINSKDQILLPNDTQLTRTIIKK 935
Query: 697 AHNSPFFMHPRGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKAAGLLQPLS 756
H +HP + + + F WK +++ + +V C CQ K+ K G LQP+
Sbjct: 936 YHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIP 995
Query: 757 IPEWKWENIAMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTVDNWAQLYVKEI 816
E WE+++MDFI LP++ GY ++VVVDR +K A +P + T + A+++ + +
Sbjct: 996 PSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRV 1055
Query: 817 VRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLR 876
+ G P I++D D FTS W+ + FS + PQTDGQTER NQ +E +LR
Sbjct: 1056 IAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLR 1115
Query: 877 ACVLDFKESWDSKLHLMEFSYNNSFQATIGMAPFEALYG-KRCRSPLRWDKVGERELVGP 936
+W + L++ SYNN+ + M PFE ++ SPL ++
Sbjct: 1116 CVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLELPSFSDKT---D 1175
Query: 937 ELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSL-EFEVGDPVFLKVAPMKGVLRFGHK 980
E + T + Q ++ + T + K Y D++ + + EF+ GD V +K + F HK
Sbjct: 1176 ENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK----RTKTGFLHK 1235
BLAST of CmoCh11G012340 vs. ExPASy Swiss-Prot
Match:
P0CT34 (Transposon Tf2-1 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-1 PE=3 SV=1)
HSP 1 Score: 408.7 bits (1049), Expect = 2.5e-112
Identity = 259/887 (29.20%), Postives = 437/887 (49.27%), Query Frame = 0
Query: 157 PSRAVDFVIELEPRTGPISKAPYRMAPAELKELKAQLQDLLDKGFIQPSVSPWGAPVLFV 216
P + ++F +EL + Y + P +++ + ++ L G I+ S + PV+FV
Sbjct: 396 PIKGLEFEVELTQENYRLPIRNYPLPPGKMQAMNDEINQGLKSGIIRESKAINACPVMFV 455
Query: 217 KKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLREATIFSKIDLRSGYHQIRINEK 276
KK+G++R+ +DY+ LNK N YPLP IE L +++ +TIF+K+DL+S YH IR+ +
Sbjct: 456 PKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKG 515
Query: 277 DVPKTAFRTRYGHYEFVVMSFGLTNAPTVFMELMNRVFKECLDMFVIVFIDDILIYSRTD 336
D K AFR G +E++VM +G++ AP F +N + E + V+ ++DDILI+S+++
Sbjct: 516 DEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKSE 575
Query: 337 LEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHMVSKDEISVDPTKVEAITKWER 396
EH +H++ VL L+ L +KCEF QV F+G+ +S+ + ++ + +W++
Sbjct: 576 SEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQ 635
Query: 397 PTT-------------------------------------------------NLKERLVT 456
P N+K+ LV+
Sbjct: 636 PKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVS 695
Query: 457 TPVLIVLESSEGYEIYSDASMKGLGCVLMQHGK-----VVAYASRQLKEYEKNYPTHDLE 516
PVL + S+ + +DAS +G VL Q V Y S ++ + + NY D E
Sbjct: 696 PPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKE 755
Query: 517 LAAVVFALKIWRHYLYG--EKTQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDVDI 576
+ A++ +LK WRHYL E +I TDH++L T + N R RW ++D++ +I
Sbjct: 756 MLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEI 815
Query: 577 QYHLGKANVVADALSRKTVHSSALITREVRVQREFERANIAVATEGVVAQLARLTVQPTL 636
Y G AN +ADALSR ++ + ++ E +I + +++
Sbjct: 816 NYRPGSANHIADALSR-------IVDETEPIPKDSEDNSINFVNQ--------ISITDDF 875
Query: 637 RQRIITSQREDPNLQKVLGQLDESPVDGFSKSSDEGLLYQGR--LCVPAIEDLRKEILME 696
+ +++T D L +L D+ + +GLL + + +P L + I+ +
Sbjct: 876 KNQVVTEYTNDTKLLNLLNNEDKRVEENIQLK--DGLLINSKDQILLPNDTQLTRTIIKK 935
Query: 697 AHNSPFFMHPRGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKAAGLLQPLS 756
H +HP + + + F WK +++ + +V C CQ K+ K G LQP+
Sbjct: 936 YHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIP 995
Query: 757 IPEWKWENIAMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTVDNWAQLYVKEI 816
E WE+++MDFI LP++ GY ++VVVDR +K A +P + T + A+++ + +
Sbjct: 996 PSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRV 1055
Query: 817 VRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLR 876
+ G P I++D D FTS W+ + FS + PQTDGQTER NQ +E +LR
Sbjct: 1056 IAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLR 1115
Query: 877 ACVLDFKESWDSKLHLMEFSYNNSFQATIGMAPFEALYG-KRCRSPLRWDKVGERELVGP 936
+W + L++ SYNN+ + M PFE ++ SPL ++
Sbjct: 1116 CVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLELPSFSDKT---D 1175
Query: 937 ELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSL-EFEVGDPVFLKVAPMKGVLRFGHK 980
E + T + Q ++ + T + K Y D++ + + EF+ GD V +K + F HK
Sbjct: 1176 ENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK----RTKTGFLHK 1235
BLAST of CmoCh11G012340 vs. ExPASy Swiss-Prot
Match:
P0CT35 (Transposon Tf2-2 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-2 PE=3 SV=1)
HSP 1 Score: 408.7 bits (1049), Expect = 2.5e-112
Identity = 259/887 (29.20%), Postives = 437/887 (49.27%), Query Frame = 0
Query: 157 PSRAVDFVIELEPRTGPISKAPYRMAPAELKELKAQLQDLLDKGFIQPSVSPWGAPVLFV 216
P + ++F +EL + Y + P +++ + ++ L G I+ S + PV+FV
Sbjct: 396 PIKGLEFEVELTQENYRLPIRNYPLPPGKMQAMNDEINQGLKSGIIRESKAINACPVMFV 455
Query: 217 KKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLREATIFSKIDLRSGYHQIRINEK 276
KK+G++R+ +DY+ LNK N YPLP IE L +++ +TIF+K+DL+S YH IR+ +
Sbjct: 456 PKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKG 515
Query: 277 DVPKTAFRTRYGHYEFVVMSFGLTNAPTVFMELMNRVFKECLDMFVIVFIDDILIYSRTD 336
D K AFR G +E++VM +G++ AP F +N + E + V+ ++DDILI+S+++
Sbjct: 516 DEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKSE 575
Query: 337 LEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHMVSKDEISVDPTKVEAITKWER 396
EH +H++ VL L+ L +KCEF QV F+G+ +S+ + ++ + +W++
Sbjct: 576 SEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQ 635
Query: 397 PTT-------------------------------------------------NLKERLVT 456
P N+K+ LV+
Sbjct: 636 PKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVS 695
Query: 457 TPVLIVLESSEGYEIYSDASMKGLGCVLMQHGK-----VVAYASRQLKEYEKNYPTHDLE 516
PVL + S+ + +DAS +G VL Q V Y S ++ + + NY D E
Sbjct: 696 PPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKE 755
Query: 517 LAAVVFALKIWRHYLYG--EKTQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDVDI 576
+ A++ +LK WRHYL E +I TDH++L T + N R RW ++D++ +I
Sbjct: 756 MLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEI 815
Query: 577 QYHLGKANVVADALSRKTVHSSALITREVRVQREFERANIAVATEGVVAQLARLTVQPTL 636
Y G AN +ADALSR ++ + ++ E +I + +++
Sbjct: 816 NYRPGSANHIADALSR-------IVDETEPIPKDSEDNSINFVNQ--------ISITDDF 875
Query: 637 RQRIITSQREDPNLQKVLGQLDESPVDGFSKSSDEGLLYQGR--LCVPAIEDLRKEILME 696
+ +++T D L +L D+ + +GLL + + +P L + I+ +
Sbjct: 876 KNQVVTEYTNDTKLLNLLNNEDKRVEENIQLK--DGLLINSKDQILLPNDTQLTRTIIKK 935
Query: 697 AHNSPFFMHPRGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKAAGLLQPLS 756
H +HP + + + F WK +++ + +V C CQ K+ K G LQP+
Sbjct: 936 YHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIP 995
Query: 757 IPEWKWENIAMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTVDNWAQLYVKEI 816
E WE+++MDFI LP++ GY ++VVVDR +K A +P + T + A+++ + +
Sbjct: 996 PSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRV 1055
Query: 817 VRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLR 876
+ G P I++D D FTS W+ + FS + PQTDGQTER NQ +E +LR
Sbjct: 1056 IAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLR 1115
Query: 877 ACVLDFKESWDSKLHLMEFSYNNSFQATIGMAPFEALYG-KRCRSPLRWDKVGERELVGP 936
+W + L++ SYNN+ + M PFE ++ SPL ++
Sbjct: 1116 CVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLELPSFSDKT---D 1175
Query: 937 ELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSL-EFEVGDPVFLKVAPMKGVLRFGHK 980
E + T + Q ++ + T + K Y D++ + + EF+ GD V +K + F HK
Sbjct: 1176 ENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK----RTKTGFLHK 1235
BLAST of CmoCh11G012340 vs. ExPASy TrEMBL
Match:
A0A6J1EYH9 (Reverse transcriptase OS=Cucurbita moschata OX=3662 GN=LOC111440131 PE=4 SV=1)
HSP 1 Score: 1505.7 bits (3897), Expect = 0.0e+00
Identity = 847/1474 (57.46%), Postives = 905/1474 (61.40%), Query Frame = 0
Query: 2 PFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDGQVIIGNQTLSIDLMVVNMTDFDAI 61
PF+ QAGF +EPL+H +SV TPAGVDLV++DRV+DGQV+I QT+ +DL VV+MTDFD I
Sbjct: 377 PFIKQAGFVIEPLMHALSVGTPAGVDLVTKDRVRDGQVVIAGQTIHVDLKVVDMTDFDVI 436
Query: 62 LGMDWLAENRASIDCRKKEVKFSPSTGPTFKFKGTNIGITPKVVSMMKAKRLVQQGGWAI 121
LGMDWLAEN A+IDC KKEV F+P G TFKFKGT+ G TPK++SMMKA+RL+QQGGWA
Sbjct: 437 LGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQQGGWAF 496
Query: 122 LACVVDVRGKEKTLVNVPIVNEFPNVFPDDLSGISPSRAVDFVIELEPRTGPISKAPYRM 181
LA V+ +GKEK + +P+VNEF +VFP+DL GI PSR VDF I+LE TGPISKAPYRM
Sbjct: 497 LAYAVNTKGKEKPIDTIPVVNEFMDVFPEDLPGIPPSREVDFGIDLELGTGPISKAPYRM 556
Query: 182 APAELKELKAQLQDLLDKGFIQPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKRTVKNKY 241
APAELKELK QLQDLLD KD SMRLCI YRELNKRTVKNKY
Sbjct: 557 APAELKELKTQLQDLLD--------------------KDDSMRLCIGYRELNKRTVKNKY 616
Query: 242 PLPRIEDLFDQLREATIFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTN 301
PLPRIEDLFDQLR AT+FSKIDLRSGYHQI+I +D+PKTAFRTRYGHYEFVVMSFGLTN
Sbjct: 617 PLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTN 676
Query: 302 APTVFMELMNRVFKECLDMFVIVFIDDILIYSRTDLEHEEHLRKVLTTLREHKLYAKFSK 361
AP VFMELMNRVFKECLD+FVIVFIDDILIYS+TDL+H+EHLRK LT LRE+KLYA F+K
Sbjct: 677 APAVFMELMNRVFKECLDLFVIVFIDDILIYSKTDLKHQEHLRKALTILRENKLYANFTK 736
Query: 362 CEFWLRQVSFLGHMVSKDEISVDPTKVEAITKWERPTT---------------------- 421
CEFW+ QVSFLGH+VSKD I VDP K+EA+TK +RPTT
Sbjct: 737 CEFWIXQVSFLGHIVSKDGIFVDPNKIEAVTKRKRPTTVTEIRSFLGLVGYYRRFVXDFA 796
Query: 422 ---------------------------NLKERLVTTPVLIVLESSEGYEIYSDASMKGLG 481
LK+RLV+ PVL V ESS GY IYSDAS KGLG
Sbjct: 797 RIATPLTQLTKKGVPFVWDDTCEVSFQELKQRLVSAPVLTVPESSVGYAIYSDASKKGLG 856
Query: 482 CVLMQHGKVVAYASRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYGEKTQIFTDHKSLK 541
CVLMQHGKVVAYAS QLK+YEKNYPTHDLELAAVVFALKIWRHY YGEKTQI+TDHKSLK
Sbjct: 857 CVLMQHGKVVAYASHQLKDYEKNYPTHDLELAAVVFALKIWRHYPYGEKTQIYTDHKSLK 916
Query: 542 YFFTQKELNMRQRRWLELVKDYDVDIQYHLGKANVVA----------------------- 601
Y FTQKELNMRQRRWLELVKDYD+DIQYH GKANVVA
Sbjct: 917 YLFTQKELNMRQRRWLELVKDYDIDIQYHPGKANVVADALSRKVVHSSALITREPRGRTD 976
Query: 602 ------------------------------------------------------------ 661
Sbjct: 977 FEQADIVVVTKEVAAQLARMTVRPTLRQRIIDSQREDPSLSKILDQLEVGPVDGFTKSTD 1036
Query: 662 ------------------------------------------------------------ 721
Sbjct: 1037 DGLLCQGRLCVPPLSGIKNEILTEAHNSAFSIHPGGTKMYQDLKKHFWWRSMKKDIAEYV 1096
Query: 722 ------------------------------------------------------------ 781
Sbjct: 1097 SKCLVCQQVKAPRQKTAGLLQPLSIPEWKWENIAMDFIVGLPKTLKGYTVIWVVVDRLTK 1156
Query: 782 ------------------------------------------------------------ 841
Sbjct: 1157 SAHFLLGKATYTVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFMSAFWRCLQRAMGSWDTK 1216
Query: 842 ------------------------------------------------------------ 901
Sbjct: 1217 LHLMEFSYNNSFQATIGMAPFEALYGKRCRSPLCWDEVGERELIGPELVHVTNEAIQKIR 1276
Query: 902 --------------------------------------------------DALSRKTVHS 961
DALSRKTVHS
Sbjct: 1277 VRMRTTQSRQKSYADVRRRNLEFEEGDPVFLKLAPMKDIQYHPGKANVVVDALSRKTVHS 1336
Query: 962 SALITREVRVQREFERANIAVATEGVVAQLARLTVQPTLRQRIITSQREDPNLQKVLGQL 1021
SALITREVRVQREFERANIAVAT+GV+AQLARLTVQPTLRQRII SQREDPNLQKVLGQL
Sbjct: 1337 SALITREVRVQREFERANIAVATKGVIAQLARLTVQPTLRQRIIASQREDPNLQKVLGQL 1396
Query: 1022 DESPVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFFMHPRGTKMYQDLKQHF 1054
D+SPVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPF MHP GTKMYQDLKQHF
Sbjct: 1397 DKSPVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMHPGGTKMYQDLKQHF 1456
BLAST of CmoCh11G012340 vs. ExPASy TrEMBL
Match:
A0A5A7SIJ5 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold34G003210 PE=4 SV=1)
HSP 1 Score: 1501.9 bits (3887), Expect = 0.0e+00
Identity = 748/1092 (68.50%), Postives = 876/1092 (80.22%), Query Frame = 0
Query: 3 FVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDGQVIIGNQTLSIDLMVVNMTDFDAIL 62
FV G E+EPL +SVSTP+G L+S++++K +V I N+ L + L+V++M DFD IL
Sbjct: 421 FVRHVGLEVEPLGSVLSVSTPSGEVLLSKEQIKACRVEITNRMLDVTLLVLDMQDFDVIL 480
Query: 63 GMDWLAENRASIDCRKKEVKFSPSTGPTFKFKGTNIGITPKVVSMMKAKRLVQQGGWAIL 122
GMDWL+ N A+IDC KEV F+P +G +FKF+G + PKV+S MKA +L+ QG W IL
Sbjct: 481 GMDWLSANHANIDCFGKEVVFNPPSGASFKFRGAGMVCIPKVISAMKASKLLSQGTWGIL 540
Query: 123 ACVVDVRGKEKTLVNVPIVNEFPNVFPDDLSGISPSRAVDFVIELEPRTGPISKAPYRMA 182
A VVDVR E +L + P+V E+P+VFPD+L G+ P R VDF IELEP T PIS+APYRMA
Sbjct: 541 ASVVDVREPEVSLSSEPVVREYPDVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMA 600
Query: 183 PAELKELKAQLQDLLDKGFIQPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKRTVKNKYP 242
PAELKELK QLQ+LLDKGFI+PSVSPWGAPVLFVKKKDGSMRLCIDYRELNK TVKN+YP
Sbjct: 601 PAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYP 660
Query: 243 LPRIEDLFDQLREATIFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNA 302
LPRI+DLFDQL+ AT+FSKIDLRSGYHQ+RI + D+PKTAFR+RYGHYEFVVMSFGLTNA
Sbjct: 661 LPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNA 720
Query: 303 PTVFMELMNRVFKECLDMFVIVFIDDILIYSRTDLEHEEHLRKVLTTLREHKLYAKFSKC 362
P VFM+LMNRVFK+ LD FVIVFIDDILIYS+T+ EHEEHL +VL TLR +KLYAKFSKC
Sbjct: 721 PAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKC 780
Query: 363 EFWLRQVSFLGHMVSKDEISVDPTKVEAITKWERPTT----------------------- 422
EFWLR+V+FLGH+VS + +SVDP K+EA+T W RP+T
Sbjct: 781 EFWLRKVTFLGHVVSSEGVSVDPAKIEAVTNWTRPSTVSEIRSFLGLAGYYRRFVEDFSR 840
Query: 423 --------------------------NLKERLVTTPVLIVLESSEGYEIYSDASMKGLGC 482
LK++LVT PVL V + S + IYSDAS KGLGC
Sbjct: 841 IASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGC 900
Query: 483 VLMQHGKVVAYASRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYGEKTQIFTDHKSLKY 542
VLMQ GKVVAYASRQLK +E+NYPTHDLELAAVVFALKIWRHYLYGEK QI+TDHKSLKY
Sbjct: 901 VLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDHKSLKY 960
Query: 543 FFTQKELNMRQRRWLELVKDYDVDIQYHLGKANVVADALSRKTVHSSALITREVRVQREF 602
FFTQKELNMRQRRWLELVKDYD +I YH GKANVVADALSRK HS+ALIT++ + R+F
Sbjct: 961 FFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVAHSAALITKQTPLLRDF 1020
Query: 603 ERANIAVATEGVVAQLARLTVQPTLRQRIITSQREDPNLQKVLGQLDESPVDGFSKSSDE 662
ERA IAV+ V AQLA+LTVQPTLRQ+II +Q +DP L + ++ +GFS SSD+
Sbjct: 1021 ERAEIAVSVGEVTAQLAQLTVQPTLRQKIIAAQLDDPYLAEKRRVVETEQGEGFSISSDD 1080
Query: 663 GLLYQGRLCVPAIEDLRKEILMEAHNSPFFMHPRGTKMYQDLKQHFWWKSMKRDVAGFVS 722
GL+++GRLCVP ++ E+L EAH+SPF MHP TKMYQDL+ +WW+ MKRDVA FVS
Sbjct: 1081 GLMFEGRLCVPEDSAVKTELLTEAHSSPFTMHPGSTKMYQDLRSVYWWRGMKRDVADFVS 1140
Query: 723 KCLVCQQVKAPRQKAAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYTVIWVVVDRLTKS 782
+CLVCQQVKAPRQ AGLLQPLS+P WKWE+++MDFI GLPKT KGYTVIWVVVDRLTKS
Sbjct: 1141 RCLVCQQVKAPRQHPAGLLQPLSVPGWKWESVSMDFITGLPKTLKGYTVIWVVVDRLTKS 1200
Query: 783 AHFLPGKVTYTVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFS 842
AHF+PGK TYT W QLY+ EIVRLHGVPVSIVSDRD RFTS FW+GLQ ALGTRLDFS
Sbjct: 1201 AHFVPGKSTYTASKWGQLYMTEIVRLHGVPVSIVSDRDARFTSKFWKGLQIALGTRLDFS 1260
Query: 843 TAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATIGMAPFEAL 902
TAFHPQTDGQTERLNQILEDMLRACVL+F SWDS LHLMEF+YNNS+QATIGMAPFEAL
Sbjct: 1261 TAFHPQTDGQTERLNQILEDMLRACVLEFSGSWDSHLHLMEFAYNNSYQATIGMAPFEAL 1320
Query: 903 YGKRCRSPLRWDKVGERELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSLEFE 962
YGK CRSP+ W +VGE+ ++GPELV+ TN A+QKIRARM TAQSRQKSYADVRRK LEFE
Sbjct: 1321 YGKCCRSPVCWGEVGEQRMLGPELVQTTNAAIQKIRARMLTAQSRQKSYADVRRKDLEFE 1380
Query: 963 VGDPVFLKVAPMKGVLRFGHKGKLSPKFIGPFEILERVGLVVYKLALPPALSGVHDVFHV 1022
VGD VFLKVAPMKGVLRF KGKLSP+F+GPFEILER+G V Y+LALPP+ + VHDVFH+
Sbjct: 1381 VGDMVFLKVAPMKGVLRFAKKGKLSPRFVGPFEILERIGPVAYRLALPPSFAAVHDVFHI 1440
Query: 1023 SMLRKYITDPIHVIDYKPLQLNEDLSYEEKPVRILAREVKTLRNRSIAFVKELLGRKTPR 1046
SMLRKY+ DP HV+D++PLQ++E+LSYEE+PV +LAREVK LR+R I VK +L +
Sbjct: 1441 SMLRKYVADPTHVVDFEPLQISENLSYEEQPVEVLAREVKKLRSREIPLVK-ILWQNHGV 1500
BLAST of CmoCh11G012340 vs. ExPASy TrEMBL
Match:
A0A5D3BTN0 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold451G001560 PE=4 SV=1)
HSP 1 Score: 1500.3 bits (3883), Expect = 0.0e+00
Identity = 747/1092 (68.41%), Postives = 875/1092 (80.13%), Query Frame = 0
Query: 3 FVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDGQVIIGNQTLSIDLMVVNMTDFDAIL 62
FV G E+EPL +SVSTP+G L+S++++K +V I N+ L + L+V++M DFD IL
Sbjct: 758 FVQHVGLEVEPLGSVLSVSTPSGEVLLSKEQIKACRVEIANRMLDVTLLVLDMQDFDVIL 817
Query: 63 GMDWLAENRASIDCRKKEVKFSPSTGPTFKFKGTNIGITPKVVSMMKAKRLVQQGGWAIL 122
GMDWL+ N A+IDC KEV F+P + +FKF+G + PKV+S MKA +L+ QG W IL
Sbjct: 818 GMDWLSANHANIDCYGKEVVFNPPSEASFKFRGAGMVCIPKVISAMKASKLLSQGTWGIL 877
Query: 123 ACVVDVRGKEKTLVNVPIVNEFPNVFPDDLSGISPSRAVDFVIELEPRTGPISKAPYRMA 182
A VVDVR E +L + P+V E+P+VFPD+L G+ P R VDF IELEP T PIS+APYRMA
Sbjct: 878 ASVVDVREPEVSLSSEPVVREYPDVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMA 937
Query: 183 PAELKELKAQLQDLLDKGFIQPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKRTVKNKYP 242
PAELKELK QLQ+LLDKGFI+PSVSPWGAPVLFVKKKDGSMRLCIDYRELNK TVKN+YP
Sbjct: 938 PAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYP 997
Query: 243 LPRIEDLFDQLREATIFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNA 302
LPRI+DLFDQL+ AT+FSKIDLRSGYHQ+RI + D+PKTAFR+RYGHYEFVVMSFGLTNA
Sbjct: 998 LPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNA 1057
Query: 303 PTVFMELMNRVFKECLDMFVIVFIDDILIYSRTDLEHEEHLRKVLTTLREHKLYAKFSKC 362
P VFM+LMNRVFK+ LD FVIVFIDDILIYS+T+ EHEEHL +VL TLR +KLYAKFSKC
Sbjct: 1058 PAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKC 1117
Query: 363 EFWLRQVSFLGHMVSKDEISVDPTKVEAITKWERPTT----------------------- 422
EFWLR+V+FLGH+VS + +SVDP K+EA+T W RP+T
Sbjct: 1118 EFWLRKVTFLGHVVSSEGVSVDPAKIEAVTNWTRPSTVSEIRSFLGLAGYYRRFVEDFSR 1177
Query: 423 --------------------------NLKERLVTTPVLIVLESSEGYEIYSDASMKGLGC 482
LK++LVT PVL V + S + IYSDAS KGLGC
Sbjct: 1178 IASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGC 1237
Query: 483 VLMQHGKVVAYASRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYGEKTQIFTDHKSLKY 542
VLMQ GKVVAYASRQLK +E+NYPTHDLELAAVVFALKIWRHYLYGEK QI+TDHKSLKY
Sbjct: 1238 VLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDHKSLKY 1297
Query: 543 FFTQKELNMRQRRWLELVKDYDVDIQYHLGKANVVADALSRKTVHSSALITREVRVQREF 602
FFTQKELNMRQRRWLELVKDYD +I YH GKANVVADALSRK HS+ALIT++ + R+F
Sbjct: 1298 FFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVAHSAALITKQTPLLRDF 1357
Query: 603 ERANIAVATEGVVAQLARLTVQPTLRQRIITSQREDPNLQKVLGQLDESPVDGFSKSSDE 662
ERA IAV+ V AQLA+LTVQPTLRQ+II +Q +DP L + ++ +GFS SSD+
Sbjct: 1358 ERAEIAVSVGEVTAQLAQLTVQPTLRQKIIAAQLDDPYLAEKRRVVETEQGEGFSISSDD 1417
Query: 663 GLLYQGRLCVPAIEDLRKEILMEAHNSPFFMHPRGTKMYQDLKQHFWWKSMKRDVAGFVS 722
GL+++GRLCVP ++ E+L EAH+SPF MHP TKMYQDL+ +WW+ MKRDVA FVS
Sbjct: 1418 GLMFEGRLCVPEDSAVKTELLTEAHSSPFTMHPGSTKMYQDLRSVYWWRGMKRDVADFVS 1477
Query: 723 KCLVCQQVKAPRQKAAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYTVIWVVVDRLTKS 782
+CLVCQQVKAPRQ AGLLQPLS+P WKWE+++MDFI GLPKT KGYTVIWVVVDRLTKS
Sbjct: 1478 RCLVCQQVKAPRQHPAGLLQPLSVPGWKWESVSMDFITGLPKTLKGYTVIWVVVDRLTKS 1537
Query: 783 AHFLPGKVTYTVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFS 842
AHF+PGK TYT W QLY+ EIVRLHGVPVSIVSDRD RFTS FW+GLQ ALGTRLDFS
Sbjct: 1538 AHFVPGKSTYTASKWGQLYMTEIVRLHGVPVSIVSDRDARFTSKFWKGLQIALGTRLDFS 1597
Query: 843 TAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATIGMAPFEAL 902
TAFHPQTDGQTERLNQILEDMLRACVL+F SWDS LHLMEF+YNNS+QATIGMAPFEAL
Sbjct: 1598 TAFHPQTDGQTERLNQILEDMLRACVLEFSGSWDSHLHLMEFAYNNSYQATIGMAPFEAL 1657
Query: 903 YGKRCRSPLRWDKVGERELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSLEFE 962
YGK CRSP+ W +VGE+ ++GPELV+ TN A+QKIRARM TAQSRQKSYADVRRK LEFE
Sbjct: 1658 YGKCCRSPVCWGEVGEQRMLGPELVQTTNAAIQKIRARMLTAQSRQKSYADVRRKDLEFE 1717
Query: 963 VGDPVFLKVAPMKGVLRFGHKGKLSPKFIGPFEILERVGLVVYKLALPPALSGVHDVFHV 1022
VGD VFLKVAPMKGVLRF KGKLSP+F+GPFEILER+G V Y+LALPP+ + VHDVFH+
Sbjct: 1718 VGDMVFLKVAPMKGVLRFAKKGKLSPRFVGPFEILERIGPVAYRLALPPSFAAVHDVFHI 1777
Query: 1023 SMLRKYITDPIHVIDYKPLQLNEDLSYEEKPVRILAREVKTLRNRSIAFVKELLGRKTPR 1046
SMLRKY+ DP HV+D++PLQ++E+LSYEE+PV +LAREVK LR+R I VK +L +
Sbjct: 1778 SMLRKYVADPTHVVDFEPLQISENLSYEEQPVEVLAREVKKLRSREIPLVK-ILWQNHGV 1837
BLAST of CmoCh11G012340 vs. ExPASy TrEMBL
Match:
A0A5D3C6W3 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold13G001930 PE=4 SV=1)
HSP 1 Score: 1500.3 bits (3883), Expect = 0.0e+00
Identity = 747/1092 (68.41%), Postives = 875/1092 (80.13%), Query Frame = 0
Query: 3 FVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDGQVIIGNQTLSIDLMVVNMTDFDAIL 62
FV G E+EPL +SVSTP+G L+S++++K +V I N+ L + L+V++M DFD IL
Sbjct: 376 FVQHVGLEVEPLGSVLSVSTPSGEVLLSKEQIKACRVEIANRMLDVTLLVLDMQDFDVIL 435
Query: 63 GMDWLAENRASIDCRKKEVKFSPSTGPTFKFKGTNIGITPKVVSMMKAKRLVQQGGWAIL 122
GMDWL+ N A+IDC KEV F+P + +FKF+G + PKV+S MKA +L+ QG W IL
Sbjct: 436 GMDWLSANHANIDCYGKEVVFNPPSEASFKFRGAGMVCIPKVISAMKASKLLSQGTWGIL 495
Query: 123 ACVVDVRGKEKTLVNVPIVNEFPNVFPDDLSGISPSRAVDFVIELEPRTGPISKAPYRMA 182
A VVDVR E +L + P+V E+P+VFPD+L G+ P R VDF IELEP T PIS+APYRMA
Sbjct: 496 ASVVDVREPEVSLSSEPVVREYPDVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMA 555
Query: 183 PAELKELKAQLQDLLDKGFIQPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKRTVKNKYP 242
PAELKELK QLQ+LLDKGFI+PSVSPWGAPVLFVKKKDGSMRLCIDYRELNK TVKN+YP
Sbjct: 556 PAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYP 615
Query: 243 LPRIEDLFDQLREATIFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNA 302
LPRI+DLFDQL+ AT+FSKIDLRSGYHQ+RI + D+PKTAFR+RYGHYEFVVMSFGLTNA
Sbjct: 616 LPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNA 675
Query: 303 PTVFMELMNRVFKECLDMFVIVFIDDILIYSRTDLEHEEHLRKVLTTLREHKLYAKFSKC 362
P VFM+LMNRVFK+ LD FVIVFIDDILIYS+T+ EHEEHL +VL TLR +KLYAKFSKC
Sbjct: 676 PAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKC 735
Query: 363 EFWLRQVSFLGHMVSKDEISVDPTKVEAITKWERPTT----------------------- 422
EFWLR+V+FLGH+VS + +SVDP K+EA+T W RP+T
Sbjct: 736 EFWLRKVTFLGHVVSSEGVSVDPAKIEAVTNWTRPSTVSEIRSFLGLAGYYRRFVEDFSR 795
Query: 423 --------------------------NLKERLVTTPVLIVLESSEGYEIYSDASMKGLGC 482
LK++LVT PVL V + S + IYSDAS KGLGC
Sbjct: 796 IASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGC 855
Query: 483 VLMQHGKVVAYASRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYGEKTQIFTDHKSLKY 542
VLMQ GKVVAYASRQLK +E+NYPTHDLELAAVVFALKIWRHYLYGEK QI+TDHKSLKY
Sbjct: 856 VLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDHKSLKY 915
Query: 543 FFTQKELNMRQRRWLELVKDYDVDIQYHLGKANVVADALSRKTVHSSALITREVRVQREF 602
FFTQKELNMRQRRWLELVKDYD +I YH GKANVVADALSRK HS+ALIT++ + R+F
Sbjct: 916 FFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVAHSAALITKQTPLLRDF 975
Query: 603 ERANIAVATEGVVAQLARLTVQPTLRQRIITSQREDPNLQKVLGQLDESPVDGFSKSSDE 662
ERA IAV+ V AQLA+LTVQPTLRQ+II +Q +DP L + ++ +GFS SSD+
Sbjct: 976 ERAEIAVSVGEVTAQLAQLTVQPTLRQKIIAAQLDDPYLAEKRRVVETEQGEGFSISSDD 1035
Query: 663 GLLYQGRLCVPAIEDLRKEILMEAHNSPFFMHPRGTKMYQDLKQHFWWKSMKRDVAGFVS 722
GL+++GRLCVP ++ E+L EAH+SPF MHP TKMYQDL+ +WW+ MKRDVA FVS
Sbjct: 1036 GLMFEGRLCVPEDSAVKTELLTEAHSSPFTMHPGSTKMYQDLRSVYWWRGMKRDVADFVS 1095
Query: 723 KCLVCQQVKAPRQKAAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYTVIWVVVDRLTKS 782
+CLVCQQVKAPRQ AGLLQPLS+P WKWE+++MDFI GLPKT KGYTVIWVVVDRLTKS
Sbjct: 1096 RCLVCQQVKAPRQHPAGLLQPLSVPGWKWESVSMDFITGLPKTLKGYTVIWVVVDRLTKS 1155
Query: 783 AHFLPGKVTYTVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFS 842
AHF+PGK TYT W QLY+ EIVRLHGVPVSIVSDRD RFTS FW+GLQ ALGTRLDFS
Sbjct: 1156 AHFVPGKSTYTASKWGQLYMTEIVRLHGVPVSIVSDRDARFTSKFWKGLQIALGTRLDFS 1215
Query: 843 TAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATIGMAPFEAL 902
TAFHPQTDGQTERLNQILEDMLRACVL+F SWDS LHLMEF+YNNS+QATIGMAPFEAL
Sbjct: 1216 TAFHPQTDGQTERLNQILEDMLRACVLEFSGSWDSHLHLMEFAYNNSYQATIGMAPFEAL 1275
Query: 903 YGKRCRSPLRWDKVGERELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSLEFE 962
YGK CRSP+ W +VGE+ ++GPELV+ TN A+QKIRARM TAQSRQKSYADVRRK LEFE
Sbjct: 1276 YGKCCRSPVCWGEVGEQRMLGPELVQTTNAAIQKIRARMLTAQSRQKSYADVRRKDLEFE 1335
Query: 963 VGDPVFLKVAPMKGVLRFGHKGKLSPKFIGPFEILERVGLVVYKLALPPALSGVHDVFHV 1022
VGD VFLKVAPMKGVLRF KGKLSP+F+GPFEILER+G V Y+LALPP+ + VHDVFH+
Sbjct: 1336 VGDMVFLKVAPMKGVLRFAKKGKLSPRFVGPFEILERIGPVAYRLALPPSFAAVHDVFHI 1395
Query: 1023 SMLRKYITDPIHVIDYKPLQLNEDLSYEEKPVRILAREVKTLRNRSIAFVKELLGRKTPR 1046
SMLRKY+ DP HV+D++PLQ++E+LSYEE+PV +LAREVK LR+R I VK +L +
Sbjct: 1396 SMLRKYVADPTHVVDFEPLQISENLSYEEQPVEVLAREVKKLRSREIPLVK-ILWQNHGV 1455
BLAST of CmoCh11G012340 vs. ExPASy TrEMBL
Match:
A0A5A7V2A0 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold154G001000 PE=4 SV=1)
HSP 1 Score: 1500.3 bits (3883), Expect = 0.0e+00
Identity = 747/1092 (68.41%), Postives = 875/1092 (80.13%), Query Frame = 0
Query: 3 FVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDGQVIIGNQTLSIDLMVVNMTDFDAIL 62
FV G E+EPL +SVSTP+G L+S++++K +V I N+ L + L+V++M DFD IL
Sbjct: 804 FVQHVGLEVEPLGSVLSVSTPSGEVLLSKEQIKACRVEIANRMLDVTLLVLDMQDFDVIL 863
Query: 63 GMDWLAENRASIDCRKKEVKFSPSTGPTFKFKGTNIGITPKVVSMMKAKRLVQQGGWAIL 122
GMDWL+ N A+IDC KEV F+P + +FKF+G + PKV+S MKA +L+ QG W IL
Sbjct: 864 GMDWLSANHANIDCYGKEVVFNPPSEASFKFRGAGMVCIPKVISAMKASKLLSQGTWGIL 923
Query: 123 ACVVDVRGKEKTLVNVPIVNEFPNVFPDDLSGISPSRAVDFVIELEPRTGPISKAPYRMA 182
A VVDVR E +L + P+V E+P+VFPD+L G+ P R VDF IELEP T PIS+APYRMA
Sbjct: 924 ASVVDVREPEVSLSSEPVVREYPDVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMA 983
Query: 183 PAELKELKAQLQDLLDKGFIQPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKRTVKNKYP 242
PAELKELK QLQ+LLDKGFI+PSVSPWGAPVLFVKKKDGSMRLCIDYRELNK TVKN+YP
Sbjct: 984 PAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYP 1043
Query: 243 LPRIEDLFDQLREATIFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNA 302
LPRI+DLFDQL+ AT+FSKIDLRSGYHQ+RI + D+PKTAFR+RYGHYEFVVMSFGLTNA
Sbjct: 1044 LPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNA 1103
Query: 303 PTVFMELMNRVFKECLDMFVIVFIDDILIYSRTDLEHEEHLRKVLTTLREHKLYAKFSKC 362
P VFM+LMNRVFK+ LD FVIVFIDDILIYS+T+ EHEEHL +VL TLR +KLYAKFSKC
Sbjct: 1104 PAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKC 1163
Query: 363 EFWLRQVSFLGHMVSKDEISVDPTKVEAITKWERPTT----------------------- 422
EFWLR+V+FLGH+VS + +SVDP K+EA+T W RP+T
Sbjct: 1164 EFWLRKVTFLGHVVSSEGVSVDPAKIEAVTNWTRPSTVSEIRSFLGLAGYYRRFVEDFSR 1223
Query: 423 --------------------------NLKERLVTTPVLIVLESSEGYEIYSDASMKGLGC 482
LK++LVT PVL V + S + IYSDAS KGLGC
Sbjct: 1224 IASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGC 1283
Query: 483 VLMQHGKVVAYASRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYGEKTQIFTDHKSLKY 542
VLMQ GKVVAYASRQLK +E+NYPTHDLELAAVVFALKIWRHYLYGEK QI+TDHKSLKY
Sbjct: 1284 VLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDHKSLKY 1343
Query: 543 FFTQKELNMRQRRWLELVKDYDVDIQYHLGKANVVADALSRKTVHSSALITREVRVQREF 602
FFTQKELNMRQRRWLELVKDYD +I YH GKANVVADALSRK HS+ALIT++ + R+F
Sbjct: 1344 FFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVAHSAALITKQTPLLRDF 1403
Query: 603 ERANIAVATEGVVAQLARLTVQPTLRQRIITSQREDPNLQKVLGQLDESPVDGFSKSSDE 662
ERA IAV+ V AQLA+LTVQPTLRQ+II +Q +DP L + ++ +GFS SSD+
Sbjct: 1404 ERAEIAVSVGEVTAQLAQLTVQPTLRQKIIAAQLDDPYLAEKRRVVETEQGEGFSISSDD 1463
Query: 663 GLLYQGRLCVPAIEDLRKEILMEAHNSPFFMHPRGTKMYQDLKQHFWWKSMKRDVAGFVS 722
GL+++GRLCVP ++ E+L EAH+SPF MHP TKMYQDL+ +WW+ MKRDVA FVS
Sbjct: 1464 GLMFEGRLCVPEDSAVKTELLTEAHSSPFTMHPGSTKMYQDLRSVYWWRGMKRDVADFVS 1523
Query: 723 KCLVCQQVKAPRQKAAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYTVIWVVVDRLTKS 782
+CLVCQQVKAPRQ AGLLQPLS+P WKWE+++MDFI GLPKT KGYTVIWVVVDRLTKS
Sbjct: 1524 RCLVCQQVKAPRQHPAGLLQPLSVPGWKWESVSMDFITGLPKTLKGYTVIWVVVDRLTKS 1583
Query: 783 AHFLPGKVTYTVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFS 842
AHF+PGK TYT W QLY+ EIVRLHGVPVSIVSDRD RFTS FW+GLQ ALGTRLDFS
Sbjct: 1584 AHFVPGKSTYTASKWGQLYMTEIVRLHGVPVSIVSDRDARFTSKFWKGLQIALGTRLDFS 1643
Query: 843 TAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATIGMAPFEAL 902
TAFHPQTDGQTERLNQILEDMLRACVL+F SWDS LHLMEF+YNNS+QATIGMAPFEAL
Sbjct: 1644 TAFHPQTDGQTERLNQILEDMLRACVLEFSGSWDSHLHLMEFAYNNSYQATIGMAPFEAL 1703
Query: 903 YGKRCRSPLRWDKVGERELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSLEFE 962
YGK CRSP+ W +VGE+ ++GPELV+ TN A+QKIRARM TAQSRQKSYADVRRK LEFE
Sbjct: 1704 YGKCCRSPVCWGEVGEQRMLGPELVQTTNAAIQKIRARMLTAQSRQKSYADVRRKDLEFE 1763
Query: 963 VGDPVFLKVAPMKGVLRFGHKGKLSPKFIGPFEILERVGLVVYKLALPPALSGVHDVFHV 1022
VGD VFLKVAPMKGVLRF KGKLSP+F+GPFEILER+G V Y+LALPP+ + VHDVFH+
Sbjct: 1764 VGDMVFLKVAPMKGVLRFAKKGKLSPRFVGPFEILERIGPVAYRLALPPSFAAVHDVFHI 1823
Query: 1023 SMLRKYITDPIHVIDYKPLQLNEDLSYEEKPVRILAREVKTLRNRSIAFVKELLGRKTPR 1046
SMLRKY+ DP HV+D++PLQ++E+LSYEE+PV +LAREVK LR+R I VK +L +
Sbjct: 1824 SMLRKYVADPTHVVDFEPLQISENLSYEEQPVEVLAREVKKLRSREIPLVK-ILWQNHGV 1883
BLAST of CmoCh11G012340 vs. TAIR 10
Match:
ATMG00860.1 (DNA/RNA polymerases superfamily protein )
HSP 1 Score: 51.2 bits (121), Expect = 7.1e-06
Identity = 26/65 (40.00%), Postives = 38/65 (58.46%), Query Frame = 0
Query: 342 HLRKVLTTLREHKLYAKFSKCEFWLRQVSFLG--HMVSKDEISVDPTKVEAITKWERP-- 401
HL VL +H+ YA KC F Q+++LG H++S + +S DP K+EA+ W P
Sbjct: 3 HLGMVLQIWEQHQFYANRKKCAFGQPQIAYLGHRHIISGEGVSADPAKLEAMVGWPEPKN 62
Query: 402 TTNLK 403
TT L+
Sbjct: 63 TTELR 67
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q99315 | 1.4e-118 | 32.34 | Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... | [more] |
Q7LHG5 | 1.3e-116 | 31.82 | Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... | [more] |
P0CT41 | 2.5e-112 | 29.20 | Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24... | [more] |
P0CT34 | 2.5e-112 | 29.20 | Transposon Tf2-1 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... | [more] |
P0CT35 | 2.5e-112 | 29.20 | Transposon Tf2-2 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1EYH9 | 0.0e+00 | 57.46 | Reverse transcriptase OS=Cucurbita moschata OX=3662 GN=LOC111440131 PE=4 SV=1 | [more] |
A0A5A7SIJ5 | 0.0e+00 | 68.50 | Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold34... | [more] |
A0A5D3BTN0 | 0.0e+00 | 68.41 | Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold45... | [more] |
A0A5D3C6W3 | 0.0e+00 | 68.41 | Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold13... | [more] |
A0A5A7V2A0 | 0.0e+00 | 68.41 | Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold15... | [more] |
Match Name | E-value | Identity | Description | |
ATMG00860.1 | 7.1e-06 | 40.00 | DNA/RNA polymerases superfamily protein | [more] |