CmoCh11G012300.1 (mRNA) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh11G012300.1
TypemRNA
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionReverse transcriptase
LocationCmo_Chr11: 7497039 .. 7504462 (+)
Sequence length6429
RNA-Seq ExpressionCmoCh11G012300.1
SyntenyCmoCh11G012300.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAAATCCAGACGACAATACTAACATTACTGATGCACGATTGAGAGAAGCACAACAACGAACTATGGAAAGACTAATTCGAGGAATAGAAGAGTTGACTGATCGAATAGGAAGATTGGAGATTCAAAATCAAGCACGACAGAGGATTCCACAACCTACGCCCTCAACCGATACATATGAAGGCGACAATTCTGATCACCACGAGGATAATCCACATGCGGTTGGTCCTGGCTTGATGCGAGGGAGAGACCATGGAAGAAAGTATCATAATTTACAACAACGAGTTCCTTATGATGATAGAATTGATCGTAACGTGGGGAGCATCAAATTAAAACTTCCCAAGTTTTATGGCAAAACCGATCCAGAGGAGTACCTTCAATGGGAGAAAACGGTGGAGTCAGTGTTCAACTGCCATAATTTTAGTGATGAAAAGAAGGTACTGTTATGCATTGCTCAATTCAAACAATATGCTCAAATTTGGTGGGATAAATTGATGTCAAGTAGGAGAAGAAATCTTGAAGCACCAATTGATTCATGGGTCGAGTTCAAAGAGTCCATGAGGAAGCGTTTTGTTCCACAATATTTTCAACGGGACATGGCGCAAAAGCTTCAAGCGTTGAAACAAGGACGCAAATCTGTGGAGGATTATTACAAGGAGATGGATACATTGATGGATCGACTTGAACTCGATGAGGACATGGAGGCTCTCATGGCGCGGTTTCTTAATGGGTTAAACACAGAGATTGCAGACAAGACTGATTTACAGCCTTATTCTAATATTGAGGAGTTGTTGCACATTGCAATTAAGATCGAGAGGCAAATCCAACGAAGGTCTCAACGGTATTCTTCTAAAACTTTTCCCAATTCTACTTCTACATGGAAAAAGGATAGTAAGAACATTGATTATAAGCATAGAAATCAAGAGATTAATGAGAAGCCTCAAGCTAAATTTGAGAAAGGGGAGAGTTCTAGAACAGGGAAAGAAAAAGTAGAAAAGTCTAATGTTCGAAATAGGGATTTAAAGTGTTGGAGATGTCAAGGGGTAGGACACTATAGTAGAGATTGCCCAAATGCAAGAATTATGACCATCAAGGAGGGAGAAATTGTTACGGATGACGAGGCACATGACGACATAAATGAGGAAACTGATGAGAGTGAGGAGTTTAGTGAAGAGGACCCTACACATATATCTTTGGTTACTCGACGAGCTCTAAACACCCACATTAAGGAGGACGGCCTAGACCAAAGGGAGAACTTGTTTCAAACTCGGTGTCTTGTTCAATCTGTACCTTGTAGTGTTGTCATTGATAGCGGTAGTTGCACCAATGTTGTGAGTTCCATTCTGGTCAAAAGACTTAATTTAAAGACACAACCACATCCAAGACCCTACAAGCTTCAATGGTTGAATGATTGTGGGGAAGTACGGGTAACTCAACAAACTCTTGTTTCTTTTACAATTGGAAAATATGTTGATGATGTTTTATGTGATGTTGTATCCATGCATGTTGGAGATTTACTACTGGGGAGGCCATGGCAATTTGATCGTCGGGTAATGTATGATGGGTATGCAAATCGATACTCCTTTACTCACAACGGTAGAAAAACTACTCTTATCCCATTGTCTCCAAAAGATGTATTTATTGATCATTGCAAACTTGAAAAGAAAAGGCAAGAGGCTGATGCAAAAGCAGAGATTGAAAAAGAATCAAGTGAAAAAAAGAGCTTGAGTGAAAAGCAAGAGAGTAACACTCAGCCTAGAGAAAAAAAAGAGAGAAAAGCCAAATCAGTAAGCTTGTATGTTAGATCAAGTGAGGCTAGGAATGTTTTGCTCTCTAACCAGACTATTCTTGTACTTATGTGCAAGGGGTCTTGTTACTTTACTAACATGCTTAACCCTTCTTTGCCTAGTGATTTTGTTGTGCTTTTGCAAGAGTTTGAACATTTATTTTCCGAGGAGATGCCTAGTAGTTTGCCACCACTTAGAGGGATTGAACACAAGATTGACTTCATTCCTGGCGCGCCCATTCCAAACCGACCAGCTTATAGGACTAATCCAAAGGAGGCTGAAGAGATACAAAGGCAAGTAAGTGAACTCCTTGCTAAAGGGTATGTACGTGAAAGTTTGAGTCCTTGTTCTGTTCCAGTTATTCTTGTACCTAAGAAAGATGGTTCTTGGCGTATGTGTGTTGATTGTAGGCTATAAACAAGATAACTATAAAGTATAGGCATCCAATTCCCAGATTAGATGATATGCTTGATGAATTGCATGGATGTAGTCTTTTTACTAAGATTGATTTAAAATCGGGTTATCATCAAATTCGCATGCATATTGGGGATGAGTGGAAAACAGCTTTTAAAACCAAGTATGGTCTTTATGAATGGTTGGTTATGCCTTTTGGATTAACTAATGCACCTAGTACATTCATGAGACTAATGAATCATGTCTTACGAGAATACTTAGGTAAGTTTGTGGTTGTTTATTTTGATGACATCCTTGTTTACTCTAAATCTTTAGATGATCATATTACCCATGTACGCAATGTTTTGACTACTTTAAGAAACGAATGTTTGTACGTAAATTTAAAGAAATGTAGCTTTTGCATGGAAAAAGTTAACTTTCTTGGGTTTGTAGTTTCATCTAATGGTGTTGAGGTTGATGAGGAGAAAGTGAAGGCTATAAAAGATTGGCCTACTCCGAAAAATGTAAGTGAGGTAAGAAGTTTTCATGGTCTTGCAAGTTTCTACCGTAGGTTCATTAAAAATTTTAGTACAATTGCTTCACCCTTGAATGAACTTGTTAAGAAAAATGTATCCTTTATATGGGAAAAAGATCAAGAACTTGCTTTTAATACTTTGAAAGAAAAATTGAGTTCTGCTCCCTTGCTTGCATTACCTAATTTTGAGTCTACTTTTGAAATTGAATGTGATGCTAGTGGAGTAGGGATAGGTGCTGTATTAATGCAAAATCAAAGACCTTTAATGTTCTTTAGTGAGAAGTTGACTGGTGCATCTTTGAGGTATCCAACTTATGACAAAGAGCTTTATGCTTTGGTTCGTGCATTGCAAACCTGGCAACATTATCTTTGGCCTAAGGAGTTCATTATTCATACGGATCATGAAAGTTTAAAGCATTTGAGAGTACAAAATAAACTCAACAGACGACATGCTAAGTGGTTAGAATTTATTGAAACATTCCCGTATGTCATAAAATATAAACAAGGAAAGGAGAACATTGTAGCAGATGCTTTATCACGAAGGTATGTCCTCCTCAATACTTTGAATGCTAGGTTGTTGGGTTTTGAACACATAAAGGATTTGTATCAACATGACATGTTCTTTGCTCCTTTTGTTGAATCTTGTGAAAAAGGACTCATTGTGGATAATTACTTGTTGTTAGATGGATTTTTGTTCCGAAAAGGCAAACTTTGCATACCATCTTGTTCCATCCGTGAGCTACTTGTGAGGGAAGCTCATGGAGGTGGTTTAATGGCACACCATGGAGTTTCTAAAACTTATGATATGCTCTCTGAACATTTTTTTTGGCCTAAAATGAGACATGATGTTCATAAAGTTTGTGGTCGTTGCATAGCATGTAAACAAGCTAAGTCTAGGCTTCAACCACATGGTTTATACTCCCCATTACCAGTTCCTAATGGTCCATGGATTGATATATCAATGGATTTTGTTTTAGGTTTACCTAGGACTAGGAAAGGTTATGATAGCATTTTTGTTGTGGTTGATCGATTTAGTAAAATGGCTCATTTTATTCCTTGTCACAAAACTGATGATGCAAAACATATTGCAGACCTATTCTTTAGGGAAGTTGTACGATTGCATGGCATTCCTAAAAGCATCGTTAGTGATCGTGATGTAAAATTTTTAAGCCACTTTTGGCGTGTTTTATGGGGTAAGTTGGGAACTAAGCTAGTATATTCAANTGGAGATTTACTACTGGGGAGGCCATGGCAATTTGATCGTCGGGTAATGTATGATGGGTATGCAAATCGATACTCCTTTACTCACAACGGTAGAAAAACTACTCTTATCCCATTGTCTCCAAAAGATATATTTATTGATCATTGCAAACTTGAAAAGAAAAGGCAAGAGGCTGATGCAAAAGCAGAGATTGAAAAAGAATCAAGTGAAAAAAAGAGCTTGAGTGAAAAGCAAGAGAGTAACACTCAGCCTAGAGAAAAAAAAGAGAGAAAAGCCAAATCAGTAAGCTTGTATGTTAGATCAAGTGAGGCTAGGAATGTTTTGCTCTCTAACCAGACTATTCTTGTACTTATGTGCAAGGGGTCTTGTTACTTTACTAACATGCTTAACCCTTCTTTGCCTAGTGATTTTGTTGTGATTTTTGTTGTGCTTTTGCAAGAGTTTGAAGATTTATTTTTCGAGGAGATGCCTAGTAGTTTGCCACCACTTAGAGGGATTGAACACAAGATTGACTTCATTCCTGGCGCGCCCATTCCAAACCGACCAGCTTATAGGACTAATCCAAAGGAGGCTGAAGAGATACAAANATATGATGTTAAATGTCTTTCTTTTTTCCTCTTGATTACTATATGTAACTCTCATATGTTACTACGTTAAATTAGTCCTTAAACATTTTCTTTCTTTAATTCGTTTCTTTGTCGTTATTCGGATTGCTCCTTCAATACCCTTGTTTATGTGTTGAGTGTGCTGTCTATTCACTTTGAATTGCACTTAGCTCTATAACATAGCTNTTGTTCTGTTCCAGTTATTCTTGTACCTAAGAAAGATGGTTCTTGGCGTATGTGTGTTGATTGTAGGCTATAAACAAGATAACTATAAAGTATAGGCATCCAATTCCCAGATTAGATGATATGCTTGATGAATTGCATGGATGTAGTCTTTTTACTAAGATTGATTTAAAATCGGGTTATCATCAAATTCGCANCGTGAGGGAGCGCTTGTGAGGTCCTTTTTTTTTTCTTTTACCTTTCTTTTGTAGCATGGAAAATCCAAACGACAATACTAACATTACTGATGCACGATTGAGAGAAGCACAACAACGAACCATGGAAAGACTAATTCGAGGAATAGAAGAGTTGACTGATCGAATAGGAAGATTGGAGATTCAAAATCAAGCACGACAGAGGATTCCACAACCTACGCCCTCAACCGATACATATGAAGGNCATCCTTGTTTACTCTAAATCTTTAGATGATCATATTACCCATGTACGCAATGTTTTGACTACTTTAAAAAACGAATGTTTGTACGTAAATTTAAAGAAATGTAGCTTTTGCATGGAAAAATTTAACTTTCTTGGGTTTGTAGTTTCATCTAATGGTGTTGAGGTCGATGAGGAGAAAGTGAAGGCTATAAAAGATTGGCCTACACCGAAAAATGTAAGTGAGGTAAGAAGTTTTCATGGTCTTGCAAGTTTCTACCGTAGGTTCATTAAAAATTTTAGTACAATTGCTTCACCCTTGAATGAACTTGTTAAGAAAAATGTATCCTTTATATGGGAAAAAGATCAAGAACTTGCTTTTAATACTTTGAAAGAAAAATTGAGTTCTGCTCCCTTGCTTGCATTACCTAATTTTGAGTCTACTTTTGAAATTGAATGTGATGCTAGTGGAGTAGGGATAGGTGCTGTATTAATGCAAAATCAAAGACCTTTAATGTTCTTTAGTGAGAAGTTGACTGGTGCATCTTTGAGGTATCCAACTTATGACAAAGAGCTTTATGCTTTGGTTCGTGCATTGCAAACCTGGCAACATTATCTTTGGCCTAAGGAGTTCATTATTCATACGGATCATGAAAGTTTAAAGCATTTGAGAGTACAAAATAAACTCAACAGACGACATGCTAAGTGGTTAGAATTTATTGAAACATTCCCGTATGTCATAAAATATAAACAAGGAAAGGAGAACATTGTAGCAGATGCTTTATCACGAAGGTATGTCCTCCTCAATACTTTGAATGCTAGGTTGTTGGGTTTTGAACACATAAAGGATTTGTATCAACATGACATGTTCTTTGCTCCTTTTGTTGAATCTTGTGAAAAAGGACTCATTGTGGATAATTACTTGTTGTTAGATGGATTTTTGTTCCGAAAAGGCAAACTTTGCATACCATCTTGTTCCATCCGTGAGCTACTTGTGAGGGAAGCTCATGGAGGTGGTTTAATGGCACACCATGGAGTTTCTAAAACTTATGATATGCTCTCTAAACATTTTTTTTGGCCTAAAATGAGACATGATGTTCATAAAGTTTGTGGTCGTTGCATAGCATGTAAACAAGCTAAGTCTAGGCTTCAACCACATGGTTTATACTCCCCATTACCAGTTCCTAATGGTCCATGGATTGATATATCAATGGATTTTGTTTTAGGTTTACCTAGGACTAGGAAAGGTTATGATAGCATTTTTGTTGTGGTTGATCGATTTAGTAAAATGGCTCATTTTATTCCTTGTCACAAAACTGATGATGCAAAACATATTGCAGACCTGTTCTTTAGGGAAGTTGTACGATTGCATGGCATTCCTAAAAGCATCGTTAGTGATCGTGATGTAAAATTTTTAAGCCACTTTTGGCGTGTTTTATGGGGTAAGTTGGGAACTAAGCTAGTATATTCAACTACTTGTCATCCTCAAACGGATGGACAAACTGAAGTTGTTAACAGAACCATGACTGCTATGCTTAGGGCTATTATTGATAAGAATCTTAAGACTTGGGAGGATTGTTTGCCATTTATAGAATTTGCATATAATAGGGTTGTTCATAGCACTACTAAATGCACACCTTTTGAAATTGTTTATGGCTTTAATCCTTTAACCCCTATTGACTTGTTACCCATACCGTCAAAAGAATTTGTGAATTTTGATGCAAATGCCAAGGTTGAGTTTTCTCATAAACTGCACAAGCAAGTGAAAGAACAAATTGAGAAACAAAATTCCAAGGTTGCCACCCGAATTAATAAAGGACGTAAGATTGTCATCTTCAAGCCAGGAGATTGGGTTTGGGTGCATTTCCGAAAAGAAAGATTTCCTACTCAAAGAAAATCTAAGCTTTTACCACGAGGAGATGGACCTTTTCAAGTTCTTGAGCGCATCAACGACAATGCTTATAAAATTGATTTACCAGGTAAGTACGGTGTTAGTGCAACTTTTAATGTTGTTGATTTGAGCCCTTTTGATGTAGGTGATGGCTTGGATTCGAGGACGAATCCTTCTCAAGAGGGGGAGAATGATATGAACCACGACCAAGGAATTTCCATACCTCAAGGTCCAATTACAAGGACGAGAGCCAAGAAGCTACAACAAACTTTATACAGTTATATTCAAGCTATGGTGAGCTCATCAAAGGAAATTCTAGAAGACGCTGTAGACCTCCCTTATATGTTGTGCAAAGTTGAGGTTCAAGAAAGAGATGAATTAAATGCACTTTAA

mRNA sequence

ATGGAAAATCCAGACGACAATACTAACATTACTGATGCACGATTGAGAGAAGCACAACAACGAACTATGGAAAGACTAATTCGAGGAATAGAAGAGTTGACTGATCGAATAGGAAGATTGGAGATTCAAAATCAAGCACGACAGAGGATTCCACAACCTACGCCCTCAACCGATACATATGAAGGCGACAATTCTGATCACCACGAGGATAATCCACATGCGGTTGGTCCTGGCTTGATGCGAGGGAGAGACCATGGAAGAAAGTATCATAATTTACAACAACGAGTTCCTTATGATGATAGAATTGATCGTAACGTGGGGAGCATCAAATTAAAACTTCCCAAGTTTTATGGCAAAACCGATCCAGAGGAGTACCTTCAATGGGAGAAAACGGTGGAGTCAGTGTTCAACTGCCATAATTTTAGTGATGAAAAGAAGGTACTGTTATGCATTGCTCAATTCAAACAATATGCTCAAATTTGGTGGGATAAATTGATGTCAAGTAGGAGAAGAAATCTTGAAGCACCAATTGATTCATGGGTCGAGTTCAAAGAGTCCATGAGGAAGCGTTTTGTTCCACAATATTTTCAACGGGACATGGCGCAAAAGCTTCAAGCGTTGAAACAAGGACGCAAATCTGTGGAGGATTATTACAAGGAGATGGATACATTGATGGATCGACTTGAACTCGATGAGGACATGGAGGCTCTCATGGCGCGGTTTCTTAATGGGTTAAACACAGAGATTGCAGACAAGACTGATTTACAGCCTTATTCTAATATTGAGGAGTTGTTGCACATTGCAATTAAGATCGAGAGGCAAATCCAACGAAGGTCTCAACGGTATTCTTCTAAAACTTTTCCCAATTCTACTTCTACATGGAAAAAGGATAGTAAGAACATTGATTATAAGCATAGAAATCAAGAGATTAATGAGAAGCCTCAAGCTAAATTTGAGAAAGGGGAGAGTTCTAGAACAGGGAAAGAAAAAGTAGAAAAGTCTAATGTTCGAAATAGGGATTTAAAGTGTTGGAGATGTCAAGGGGTAGGACACTATAGTAGAGATTGCCCAAATGCAAGAATTATGACCATCAAGGAGGGAGAAATTGTTACGGATGACGAGGCACATGACGACATAAATGAGGAAACTGATGAGAGTGAGGAGTTTAGTGAAGAGGACCCTACACATATATCTTTGGTTACTCGACGAGCTCTAAACACCCACATTAAGGAGGACGGCCTAGACCAAAGGGAGAACTTGTTTCAAACTCGGTGTCTTGTTCAATCTGTACCTTGTAGTGTTGTCATTGATAGCGGTAGTTGCACCAATGTTGTGAGTTCCATTCTGGTCAAAAGACTTAATTTAAAGACACAACCACATCCAAGACCCTACAAGCTTCAATGGTTGAATGATTGTGGGGAAGTACGGGTAACTCAACAAACTCTTGTTTCTTTTACAATTGGAAAATATGTTGATGATGTTTTATGTGATGTTGTATCCATGCATGTTGGAGATTTACTACTGGGGAGGCCATGGCAATTTGATCGTCGGGTAATGTATGATGGGTATGCAAATCGATACTCCTTTACTCACAACGGTAGAAAAACTACTCTTATCCCATTGTCTCCAAAAGATGTATTTATTGATCATTGCAAACTTGAAAAGAAAAGGCAAGAGGCTGATGCAAAAGCAGAGATTGAAAAAGAATCAAGTGAAAAAAAGAGCTTGAGTGAAAAGCAAGAGAGTAACACTCAGCCTAGAGAAAAAAAAGAGAGAAAAGCCAAATCAGTAAGCTTGTATGTTAGATCAAGTGAGGCTAGGAATGTTTTGCTCTCTAACCAGACTATTCTTGTACTTATGTGCAAGGGGTCTTGTTACTTTACTAACATGCTTAACCCTTCTTTGCCTAGTGATTTTGTTGTGCTTTTGCAAGAGTTTGAACATTTATTTTCCGAGGAGATGCCTAGTAGTTTGCCACCACTTAGAGGGATTGAACACAAGATTGACTTCATTCCTGGCGCGCCCATTCCAAACCGACCAGCTTATAGGACTAATCCAAAGGAGGCTGAAGAGATACAAAGGCAAGTAAGTGAACTCCTTGCTAAAGGGTATGCTATAAACAAGATAACTATAAAGTATAGGCATCCAATTCCCAGATTAGATGATATGCTTGATGAATTGCATGGATGTAGTCTTTTTACTAAGATTGATTTAAAATCGGGTTATCATCAAATTCGCATGCATATTGGGGATGAGTGGAAAACAGCTTTTAAAACCAAGTATGGTCTTTATGAATGGTTGGTTATGCCTTTTGGATTAACTAATGCACCTAGTACATTCATGAGACTAATGAATCATGTCTTACGAGAATACTTAGTTTCATCTAATGGTGTTGAGGTTGATGAGGAGAAAGTGAAGGCTATAAAAGATTGGCCTACTCCGAAAAATGTAAGTGAGGTAAGAAGTTTTCATGGTCTTGCAAGTTTCTACCGTAGGTTCATTAAAAATTTTAGTACAATTGCTTCACCCTTGAATGAACTTGTTAAGAAAAATGTATCCTTTATATGGGAAAAAGATCAAGAACTTGCTTTTAATACTTTGAAAGAAAAATTGAGTTCTGCTCCCTTGCTTGCATTACCTAATTTTGAGTCTACTTTTGAAATTGAATGTGATGCTAGTGGAGTAGGGATAGGTGCTGTATTAATGCAAAATCAAAGACCTTTAATGTTCTTTAGTGAGAAGTTGACTGGTGCATCTTTGAGGTATCCAACTTATGACAAAGAGCTTTATGCTTTGGTTCGTGCATTGCAAACCTGGCAACATTATCTTTGGCCTAAGGAGTTCATTATTCATACGGATCATGAAAGTTTAAAGCATTTGAGAGTACAAAATAAACTCAACAGACGACATGCTAAGTGGTTAGAATTTATTGAAACATTCCCGTATGTCATAAAATATAAACAAGGAAAGGAGAACATTGTAGCAGATGCTTTATCACGAAGGTATGTCCTCCTCAATACTTTGAATGCTAGGTTGTTGGGTTTTGAACACATAAAGGATTTGTATCAACATGACATGTTCTTTGCTCCTTTTGTTGAATCTTGTGAAAAAGGACTCATTGTGGATAATTACTTGTTGTTAGATGGATTTTTGTTCCGAAAAGGCAAACTTTGCATACCATCTTGTTCCATCCGTGAGCTACTTGTGAGGGAAGCTCATGGAGGTGGTTTAATGGCACACCATGGAGTTTCTAAAACTTATGATATGCTCTCTGAACATTTTTTTTGGCCTAAAATGAGACATGATGTTCATAAAGTTTGTGGTCGTTGCATAGCATGTAAACAAGCTAAGTCTAGGCTTCAACCACATGGTTTATACTCCCCATTACCAGTTCCTAATGGTCCATGGATTGATATATCAATGGATTTTGTTTTAGGTTTACCTAGGACTAGGAAAGGTTATGATAGCATTTTTGTTGTGGTTGATCGATTTAGTAAAATGGCTCATTTTATTCCTTGTCACAAAACTGATGATGCAAAACATATTGCAGACCTATTCTTTAGGGAAGTTGTACGATTGCATGGCATTCCTAAAAGCATCGTTAGTGATCGTGATGTAAAATTTTTAAGCCACTTTTGGCGTGTTTTATGGGGTAAGTTGGGAACTAAGCTAGTATATTCAANTGGAGATTTACTACTGGGGAGGCCATGGCAATTTGATCGTCGGGTAATGTATGATGGGTATGCAAATCGATACTCCTTTACTCACAACGGTAGAAAAACTACTCTTATCCCATTGTCTCCAAAAGATATATTTATTGATCATTGCAAACTTGAAAAGAAAAGGCAAGAGGCTGATGCAAAAGCAGAGATTGAAAAAGAATCAAGTGAAAAAAAGAGCTTGAGTGAAAAGCAAGAGAGTAACACTCAGCCTAGAGAAAAAAAAGAGAGAAAAGCCAAATCAGTAAGCTTGTATGTTAGATCAAGTGAGGCTAGGAATGTTTTGCTCTCTAACCAGACTATTCTTGTACTTATGTGCAAGGGGTCTTGTTACTTTACTAACATGCTTAACCCTTCTTTGCCTAGTGATTTTGTTGTGATTTTTGTTGTGCTTTTGCAAGAGTTTGAAGATTTATTTTTCGAGGAGATGCCTAGTAGTTTGCCACCACTTAGAGGGATTGAACACAAGATTGACTTCATTCCTGGCGCGCCCATTCCAAACCGACCAGCTTATAGGACTAATCCAAAGGAGGCTGAAGAGATACAAANATATGATGTTAAATGTCTTTCTTTTTTCCTCTTGATTACTATATTTTCATCTAATGGTGTTGAGGTCGATGAGGAGAAAGTGAAGGCTATAAAAGATTGGCCTACACCGAAAAATGTAAGTGAGGTAAGAAGTTTTCATGGTCTTGCAAGTTTCTACCGTAGGTTCATTAAAAATTTTAGTACAATTGCTTCACCCTTGAATGAACTTGTTAAGAAAAATGTATCCTTTATATGGGAAAAAGATCAAGAACTTGCTTTTAATACTTTGAAAGAAAAATTGAGTTCTGCTCCCTTGCTTGCATTACCTAATTTTGAGTCTACTTTTGAAATTGAATGTGATGCTAGTGGAGTAGGGATAGGTGCTGTATTAATGCAAAATCAAAGACCTTTAATGTTCTTTAGTGAGAAGTTGACTGGTGCATCTTTGAGGTATCCAACTTATGACAAAGAGCTTTATGCTTTGGTTCGTGCATTGCAAACCTGGCAACATTATCTTTGGCCTAAGGAGTTCATTATTCATACGGATCATGAAAGTTTAAAGCATTTGAGAGTACAAAATAAACTCAACAGACGACATGCTAAGTGGTTAGAATTTATTGAAACATTCCCGTATGTCATAAAATATAAACAAGGAAAGGAGAACATTGTAGCAGATGCTTTATCACGAAGGTATGTCCTCCTCAATACTTTGAATGCTAGGTTGTTGGGTTTTGAACACATAAAGGATTTGTATCAACATGACATGTTCTTTGCTCCTTTTGTTGAATCTTGTGAAAAAGGACTCATTGTGGATAATTACTTGTTGTTAGATGGATTTTTGTTCCGAAAAGGCAAACTTTGCATACCATCTTGTTCCATCCGTGAGCTACTTGTGAGGGAAGCTCATGGAGGTGGTTTAATGGCACACCATGGAGTTTCTAAAACTTATGATATGCTCTCTAAACATTTTTTTTGGCCTAAAATGAGACATGATGTTCATAAAGTTTGTGGTCGTTGCATAGCATGTAAACAAGCTAAGTCTAGGCTTCAACCACATGGTTTATACTCCCCATTACCAGTTCCTAATGGTCCATGGATTGATATATCAATGGATTTTGTTTTAGGTTTACCTAGGACTAGGAAAGGTTATGATAGCATTTTTGTTGTGGTTGATCGATTTAGTAAAATGGCTCATTTTATTCCTTGTCACAAAACTGATGATGCAAAACATATTGCAGACCTGTTCTTTAGGGAAGTTGTACGATTGCATGGCATTCCTAAAAGCATCGTTAGTGATCGTGATGTAAAATTTTTAAGCCACTTTTGGCGTGTTTTATGGGGTAAGTTGGGAACTAAGCTAGTATATTCAACTACTTGTCATCCTCAAACGGATGGACAAACTGAAGTTGTTAACAGAACCATGACTGCTATGCTTAGGGCTATTATTGATAAGAATCTTAAGACTTGGGAGGATTGTTTGCCATTTATAGAATTTGCATATAATAGGGTTGTTCATAGCACTACTAAATGCACACCTTTTGAAATTGTTTATGGCTTTAATCCTTTAACCCCTATTGACTTGTTACCCATACCGTCAAAAGAATTTGTGAATTTTGATGCAAATGCCAAGGTTGAGTTTTCTCATAAACTGCACAAGCAAGTGAAAGAACAAATTGAGAAACAAAATTCCAAGGTTGCCACCCGAATTAATAAAGGACGTAAGATTGTCATCTTCAAGCCAGGAGATTGGGTTTGGGTGCATTTCCGAAAAGAAAGATTTCCTACTCAAAGAAAATCTAAGCTTTTACCACGAGGAGATGGACCTTTTCAAGTTCTTGAGCGCATCAACGACAATGCTTATAAAATTGATTTACCAGGTAAGTACGGTGTTAGTGCAACTTTTAATGTTGTTGATTTGAGCCCTTTTGATGTAGGTGATGGCTTGGATTCGAGGACGAATCCTTCTCAAGAGGGGGAGAATGATATGAACCACGACCAAGGAATTTCCATACCTCAAGGTCCAATTACAAGGACGAGAGCCAAGAAGCTACAACAAACTTTATACAGTTATATTCAAGCTATGGTGAGCTCATCAAAGGAAATTCTAGAAGACGCTGTAGACCTCCCTTATATGTTGTGCAAAGTTGAGGTTCAAGAAAGAGATGAATTAAATGCACTTTAA

Coding sequence (CDS)

ATGGAAAATCCAGACGACAATACTAACATTACTGATGCACGATTGAGAGAAGCACAACAACGAACTATGGAAAGACTAATTCGAGGAATAGAAGAGTTGACTGATCGAATAGGAAGATTGGAGATTCAAAATCAAGCACGACAGAGGATTCCACAACCTACGCCCTCAACCGATACATATGAAGGCGACAATTCTGATCACCACGAGGATAATCCACATGCGGTTGGTCCTGGCTTGATGCGAGGGAGAGACCATGGAAGAAAGTATCATAATTTACAACAACGAGTTCCTTATGATGATAGAATTGATCGTAACGTGGGGAGCATCAAATTAAAACTTCCCAAGTTTTATGGCAAAACCGATCCAGAGGAGTACCTTCAATGGGAGAAAACGGTGGAGTCAGTGTTCAACTGCCATAATTTTAGTGATGAAAAGAAGGTACTGTTATGCATTGCTCAATTCAAACAATATGCTCAAATTTGGTGGGATAAATTGATGTCAAGTAGGAGAAGAAATCTTGAAGCACCAATTGATTCATGGGTCGAGTTCAAAGAGTCCATGAGGAAGCGTTTTGTTCCACAATATTTTCAACGGGACATGGCGCAAAAGCTTCAAGCGTTGAAACAAGGACGCAAATCTGTGGAGGATTATTACAAGGAGATGGATACATTGATGGATCGACTTGAACTCGATGAGGACATGGAGGCTCTCATGGCGCGGTTTCTTAATGGGTTAAACACAGAGATTGCAGACAAGACTGATTTACAGCCTTATTCTAATATTGAGGAGTTGTTGCACATTGCAATTAAGATCGAGAGGCAAATCCAACGAAGGTCTCAACGGTATTCTTCTAAAACTTTTCCCAATTCTACTTCTACATGGAAAAAGGATAGTAAGAACATTGATTATAAGCATAGAAATCAAGAGATTAATGAGAAGCCTCAAGCTAAATTTGAGAAAGGGGAGAGTTCTAGAACAGGGAAAGAAAAAGTAGAAAAGTCTAATGTTCGAAATAGGGATTTAAAGTGTTGGAGATGTCAAGGGGTAGGACACTATAGTAGAGATTGCCCAAATGCAAGAATTATGACCATCAAGGAGGGAGAAATTGTTACGGATGACGAGGCACATGACGACATAAATGAGGAAACTGATGAGAGTGAGGAGTTTAGTGAAGAGGACCCTACACATATATCTTTGGTTACTCGACGAGCTCTAAACACCCACATTAAGGAGGACGGCCTAGACCAAAGGGAGAACTTGTTTCAAACTCGGTGTCTTGTTCAATCTGTACCTTGTAGTGTTGTCATTGATAGCGGTAGTTGCACCAATGTTGTGAGTTCCATTCTGGTCAAAAGACTTAATTTAAAGACACAACCACATCCAAGACCCTACAAGCTTCAATGGTTGAATGATTGTGGGGAAGTACGGGTAACTCAACAAACTCTTGTTTCTTTTACAATTGGAAAATATGTTGATGATGTTTTATGTGATGTTGTATCCATGCATGTTGGAGATTTACTACTGGGGAGGCCATGGCAATTTGATCGTCGGGTAATGTATGATGGGTATGCAAATCGATACTCCTTTACTCACAACGGTAGAAAAACTACTCTTATCCCATTGTCTCCAAAAGATGTATTTATTGATCATTGCAAACTTGAAAAGAAAAGGCAAGAGGCTGATGCAAAAGCAGAGATTGAAAAAGAATCAAGTGAAAAAAAGAGCTTGAGTGAAAAGCAAGAGAGTAACACTCAGCCTAGAGAAAAAAAAGAGAGAAAAGCCAAATCAGTAAGCTTGTATGTTAGATCAAGTGAGGCTAGGAATGTTTTGCTCTCTAACCAGACTATTCTTGTACTTATGTGCAAGGGGTCTTGTTACTTTACTAACATGCTTAACCCTTCTTTGCCTAGTGATTTTGTTGTGCTTTTGCAAGAGTTTGAACATTTATTTTCCGAGGAGATGCCTAGTAGTTTGCCACCACTTAGAGGGATTGAACACAAGATTGACTTCATTCCTGGCGCGCCCATTCCAAACCGACCAGCTTATAGGACTAATCCAAAGGAGGCTGAAGAGATACAAAGGCAAGTAAGTGAACTCCTTGCTAAAGGGTATGCTATAAACAAGATAACTATAAAGTATAGGCATCCAATTCCCAGATTAGATGATATGCTTGATGAATTGCATGGATGTAGTCTTTTTACTAAGATTGATTTAAAATCGGGTTATCATCAAATTCGCATGCATATTGGGGATGAGTGGAAAACAGCTTTTAAAACCAAGTATGGTCTTTATGAATGGTTGGTTATGCCTTTTGGATTAACTAATGCACCTAGTACATTCATGAGACTAATGAATCATGTCTTACGAGAATACTTAGTTTCATCTAATGGTGTTGAGGTTGATGAGGAGAAAGTGAAGGCTATAAAAGATTGGCCTACTCCGAAAAATGTAAGTGAGGTAAGAAGTTTTCATGGTCTTGCAAGTTTCTACCGTAGGTTCATTAAAAATTTTAGTACAATTGCTTCACCCTTGAATGAACTTGTTAAGAAAAATGTATCCTTTATATGGGAAAAAGATCAAGAACTTGCTTTTAATACTTTGAAAGAAAAATTGAGTTCTGCTCCCTTGCTTGCATTACCTAATTTTGAGTCTACTTTTGAAATTGAATGTGATGCTAGTGGAGTAGGGATAGGTGCTGTATTAATGCAAAATCAAAGACCTTTAATGTTCTTTAGTGAGAAGTTGACTGGTGCATCTTTGAGGTATCCAACTTATGACAAAGAGCTTTATGCTTTGGTTCGTGCATTGCAAACCTGGCAACATTATCTTTGGCCTAAGGAGTTCATTATTCATACGGATCATGAAAGTTTAAAGCATTTGAGAGTACAAAATAAACTCAACAGACGACATGCTAAGTGGTTAGAATTTATTGAAACATTCCCGTATGTCATAAAATATAAACAAGGAAAGGAGAACATTGTAGCAGATGCTTTATCACGAAGGTATGTCCTCCTCAATACTTTGAATGCTAGGTTGTTGGGTTTTGAACACATAAAGGATTTGTATCAACATGACATGTTCTTTGCTCCTTTTGTTGAATCTTGTGAAAAAGGACTCATTGTGGATAATTACTTGTTGTTAGATGGATTTTTGTTCCGAAAAGGCAAACTTTGCATACCATCTTGTTCCATCCGTGAGCTACTTGTGAGGGAAGCTCATGGAGGTGGTTTAATGGCACACCATGGAGTTTCTAAAACTTATGATATGCTCTCTGAACATTTTTTTTGGCCTAAAATGAGACATGATGTTCATAAAGTTTGTGGTCGTTGCATAGCATGTAAACAAGCTAAGTCTAGGCTTCAACCACATGGTTTATACTCCCCATTACCAGTTCCTAATGGTCCATGGATTGATATATCAATGGATTTTGTTTTAGGTTTACCTAGGACTAGGAAAGGTTATGATAGCATTTTTGTTGTGGTTGATCGATTTAGTAAAATGGCTCATTTTATTCCTTGTCACAAAACTGATGATGCAAAACATATTGCAGACCTATTCTTTAGGGAAGTTGTACGATTGCATGGCATTCCTAAAAGCATCGTTAGTGATCGTGATGTAAAATTTTTAAGCCACTTTTGGCGTGTTTTATGGGGTAAGTTGGGAACTAAGCTAGTATATTCAANTGGAGATTTACTACTGGGGAGGCCATGGCAATTTGATCGTCGGGTAATGTATGATGGGTATGCAAATCGATACTCCTTTACTCACAACGGTAGAAAAACTACTCTTATCCCATTGTCTCCAAAAGATATATTTATTGATCATTGCAAACTTGAAAAGAAAAGGCAAGAGGCTGATGCAAAAGCAGAGATTGAAAAAGAATCAAGTGAAAAAAAGAGCTTGAGTGAAAAGCAAGAGAGTAACACTCAGCCTAGAGAAAAAAAAGAGAGAAAAGCCAAATCAGTAAGCTTGTATGTTAGATCAAGTGAGGCTAGGAATGTTTTGCTCTCTAACCAGACTATTCTTGTACTTATGTGCAAGGGGTCTTGTTACTTTACTAACATGCTTAACCCTTCTTTGCCTAGTGATTTTGTTGTGATTTTTGTTGTGCTTTTGCAAGAGTTTGAAGATTTATTTTTCGAGGAGATGCCTAGTAGTTTGCCACCACTTAGAGGGATTGAACACAAGATTGACTTCATTCCTGGCGCGCCCATTCCAAACCGACCAGCTTATAGGACTAATCCAAAGGAGGCTGAAGAGATACAAANATATGATGTTAAATGTCTTTCTTTTTTCCTCTTGATTACTATATTTTCATCTAATGGTGTTGAGGTCGATGAGGAGAAAGTGAAGGCTATAAAAGATTGGCCTACACCGAAAAATGTAAGTGAGGTAAGAAGTTTTCATGGTCTTGCAAGTTTCTACCGTAGGTTCATTAAAAATTTTAGTACAATTGCTTCACCCTTGAATGAACTTGTTAAGAAAAATGTATCCTTTATATGGGAAAAAGATCAAGAACTTGCTTTTAATACTTTGAAAGAAAAATTGAGTTCTGCTCCCTTGCTTGCATTACCTAATTTTGAGTCTACTTTTGAAATTGAATGTGATGCTAGTGGAGTAGGGATAGGTGCTGTATTAATGCAAAATCAAAGACCTTTAATGTTCTTTAGTGAGAAGTTGACTGGTGCATCTTTGAGGTATCCAACTTATGACAAAGAGCTTTATGCTTTGGTTCGTGCATTGCAAACCTGGCAACATTATCTTTGGCCTAAGGAGTTCATTATTCATACGGATCATGAAAGTTTAAAGCATTTGAGAGTACAAAATAAACTCAACAGACGACATGCTAAGTGGTTAGAATTTATTGAAACATTCCCGTATGTCATAAAATATAAACAAGGAAAGGAGAACATTGTAGCAGATGCTTTATCACGAAGGTATGTCCTCCTCAATACTTTGAATGCTAGGTTGTTGGGTTTTGAACACATAAAGGATTTGTATCAACATGACATGTTCTTTGCTCCTTTTGTTGAATCTTGTGAAAAAGGACTCATTGTGGATAATTACTTGTTGTTAGATGGATTTTTGTTCCGAAAAGGCAAACTTTGCATACCATCTTGTTCCATCCGTGAGCTACTTGTGAGGGAAGCTCATGGAGGTGGTTTAATGGCACACCATGGAGTTTCTAAAACTTATGATATGCTCTCTAAACATTTTTTTTGGCCTAAAATGAGACATGATGTTCATAAAGTTTGTGGTCGTTGCATAGCATGTAAACAAGCTAAGTCTAGGCTTCAACCACATGGTTTATACTCCCCATTACCAGTTCCTAATGGTCCATGGATTGATATATCAATGGATTTTGTTTTAGGTTTACCTAGGACTAGGAAAGGTTATGATAGCATTTTTGTTGTGGTTGATCGATTTAGTAAAATGGCTCATTTTATTCCTTGTCACAAAACTGATGATGCAAAACATATTGCAGACCTGTTCTTTAGGGAAGTTGTACGATTGCATGGCATTCCTAAAAGCATCGTTAGTGATCGTGATGTAAAATTTTTAAGCCACTTTTGGCGTGTTTTATGGGGTAAGTTGGGAACTAAGCTAGTATATTCAACTACTTGTCATCCTCAAACGGATGGACAAACTGAAGTTGTTAACAGAACCATGACTGCTATGCTTAGGGCTATTATTGATAAGAATCTTAAGACTTGGGAGGATTGTTTGCCATTTATAGAATTTGCATATAATAGGGTTGTTCATAGCACTACTAAATGCACACCTTTTGAAATTGTTTATGGCTTTAATCCTTTAACCCCTATTGACTTGTTACCCATACCGTCAAAAGAATTTGTGAATTTTGATGCAAATGCCAAGGTTGAGTTTTCTCATAAACTGCACAAGCAAGTGAAAGAACAAATTGAGAAACAAAATTCCAAGGTTGCCACCCGAATTAATAAAGGACGTAAGATTGTCATCTTCAAGCCAGGAGATTGGGTTTGGGTGCATTTCCGAAAAGAAAGATTTCCTACTCAAAGAAAATCTAAGCTTTTACCACGAGGAGATGGACCTTTTCAAGTTCTTGAGCGCATCAACGACAATGCTTATAAAATTGATTTACCAGGTAAGTACGGTGTTAGTGCAACTTTTAATGTTGTTGATTTGAGCCCTTTTGATGTAGGTGATGGCTTGGATTCGAGGACGAATCCTTCTCAAGAGGGGGAGAATGATATGAACCACGACCAAGGAATTTCCATACCTCAAGGTCCAATTACAAGGACGAGAGCCAAGAAGCTACAACAAACTTTATACAGTTATATTCAAGCTATGGTGAGCTCATCAAAGGAAATTCTAGAAGACGCTGTAGACCTCCCTTATATGTTGTGCAAAGTTGAGGTTCAAGAAAGAGATGAATTAAATGCACTTTAA

Protein sequence

MENPDDNTNITDARLREAQQRTMERLIRGIEELTDRIGRLEIQNQARQRIPQPTPSTDTYEGDNSDHHEDNPHAVGPGLMRGRDHGRKYHNLQQRVPYDDRIDRNVGSIKLKLPKFYGKTDPEEYLQWEKTVESVFNCHNFSDEKKVLLCIAQFKQYAQIWWDKLMSSRRRNLEAPIDSWVEFKESMRKRFVPQYFQRDMAQKLQALKQGRKSVEDYYKEMDTLMDRLELDEDMEALMARFLNGLNTEIADKTDLQPYSNIEELLHIAIKIERQIQRRSQRYSSKTFPNSTSTWKKDSKNIDYKHRNQEINEKPQAKFEKGESSRTGKEKVEKSNVRNRDLKCWRCQGVGHYSRDCPNARIMTIKEGEIVTDDEAHDDINEETDESEEFSEEDPTHISLVTRRALNTHIKEDGLDQRENLFQTRCLVQSVPCSVVIDSGSCTNVVSSILVKRLNLKTQPHPRPYKLQWLNDCGEVRVTQQTLVSFTIGKYVDDVLCDVVSMHVGDLLLGRPWQFDRRVMYDGYANRYSFTHNGRKTTLIPLSPKDVFIDHCKLEKKRQEADAKAEIEKESSEKKSLSEKQESNTQPREKKERKAKSVSLYVRSSEARNVLLSNQTILVLMCKGSCYFTNMLNPSLPSDFVVLLQEFEHLFSEEMPSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEAEEIQRQVSELLAKGYAINKITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIRMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVLREYLVSSNGVEVDEEKVKAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASPLNELVKKNVSFIWEKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKELYALVRALQTWQHYLWPKEFIIHTDHESLKHLRVQNKLNRRHAKWLEFIETFPYVIKYKQGKENIVADALSRRYVLLNTLNARLLGFEHIKDLYQHDMFFAPFVESCEKGLIVDNYLLLDGFLFRKGKLCIPSCSIRELLVREAHGGGLMAHHGVSKTYDMLSEHFFWPKMRHDVHKVCGRCIACKQAKSRLQPHGLYSPLPVPNGPWIDISMDFVLGLPRTRKGYDSIFVVVDRFSKMAHFIPCHKTDDAKHIADLFFREVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSXGDLLLGRPWQFDRRVMYDGYANRYSFTHNGRKTTLIPLSPKDIFIDHCKLEKKRQEADAKAEIEKESSEKKSLSEKQESNTQPREKKERKAKSVSLYVRSSEARNVLLSNQTILVLMCKGSCYFTNMLNPSLPSDFVVIFVVLLQEFEDLFFEEMPSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEAEEIQXYDVKCLSFFLLITIFSSNGVEVDEEKVKAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASPLNELVKKNVSFIWEKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKELYALVRALQTWQHYLWPKEFIIHTDHESLKHLRVQNKLNRRHAKWLEFIETFPYVIKYKQGKENIVADALSRRYVLLNTLNARLLGFEHIKDLYQHDMFFAPFVESCEKGLIVDNYLLLDGFLFRKGKLCIPSCSIRELLVREAHGGGLMAHHGVSKTYDMLSKHFFWPKMRHDVHKVCGRCIACKQAKSRLQPHGLYSPLPVPNGPWIDISMDFVLGLPRTRKGYDSIFVVVDRFSKMAHFIPCHKTDDAKHIADLFFREVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVNRTMTAMLRAIIDKNLKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPIDLLPIPSKEFVNFDANAKVEFSHKLHKQVKEQIEKQNSKVATRINKGRKIVIFKPGDWVWVHFRKERFPTQRKSKLLPRGDGPFQVLERINDNAYKIDLPGKYGVSATFNVVDLSPFDVGDGLDSRTNPSQEGENDMNHDQGISIPQGPITRTRAKKLQQTLYSYIQAMVSSSKEILEDAVDLPYMLCKVEVQERDELNAL
Homology
BLAST of CmoCh11G012300.1 vs. ExPASy Swiss-Prot
Match: Q99315 (Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-G PE=1 SV=3)

HSP 1 Score: 375.2 bits (962), Expect = 5.0e-102
Identity = 231/647 (35.70%), Postives = 343/647 (53.01%), Query Frame = 0

Query: 1447 EEKVKAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASPLNELVKKNVSFIWEKDQE 1506
            + K  AI+D+PTPK V + + F G+ ++YRRFI N S IA P+   +       W + Q+
Sbjct: 806  QHKCAAIRDFPTPKTVKQAQRFLGMINYYRRFIPNCSKIAQPIQLFICDKSQ--WTEKQD 865

Query: 1507 LAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVL--MQNQRPLM----FFSEKL 1566
             A + LK+ L ++P+L   N ++ + +  DAS  GIGAVL  + N+  L+    +FS+ L
Sbjct: 866  KAIDKLKDALCNSPVLVPFNNKANYRLTTDASKDGIGAVLEEVDNKNKLVGVVGYFSKSL 925

Query: 1567 TGASLRYPTYDKELYALVRALQTWQHYLWPKEFIIHTDHESLKHLRVQNKLNRRHAKWLE 1626
              A   YP  + EL  +++AL  +++ L  K F + TDH SL  L+ +N+  RR  +WL+
Sbjct: 926  ESAQKNYPAGELELLGIIKALHHFRYMLHGKHFTLRTDHISLLSLQNKNEPARRVQRWLD 985

Query: 1627 FIETFPYVIKYKQGKENIVADALSRRYVLLNTLNARLLGFE-----------------HI 1686
             + T+ + ++Y  G +N+VADA+SR    +    +R +  E                 H+
Sbjct: 986  DLATYDFTLEYLAGPKNVVADAISRAVYTITPETSRPIDTESWKSYYKSDPLCSAVLIHM 1045

Query: 1687 KDLYQHDM------FFAPFVESCE-KGLIVDNYLLLDGFLFRKGKLCIPSCSIRELLVRE 1746
            K+L QH++       F  + +  E       NY L D  ++ + +L +P    +  ++R 
Sbjct: 1046 KELTQHNVTPEDMSAFRSYQKKLELSETFRKNYSLEDEMIYYQDRLVVP-IKQQNAVMRL 1105

Query: 1747 AHGGGLM-AHHGVSKTYDMLSKHFFWPKMRHDVHKVCGRCIACKQAKS-RLQPHGLYSPL 1806
             H   L   H GV+ T   +S  ++WPK++H + +    C+ C+  KS R + HGL  PL
Sbjct: 1106 YHDHTLFGGHFGVTVTLAKISPIYYWPKLQHSIIQYIRTCVQCQLIKSHRPRLHGLLQPL 1165

Query: 1807 PVPNGPWIDISMDFVLGLPRTRKGYDSIFVVVDRFSKMAHFIPCHKTDDAKHIADLFFRE 1866
            P+  G W+DISMDFV GLP T    + I VVVDRFSK AHFI   KT DA  + DL FR 
Sbjct: 1166 PIAEGRWLDISMDFVTGLPPTSNNLNMILVVVDRFSKRAHFIATRKTLDATQLIDLLFRY 1225

Query: 1867 VVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVNRTMTAML 1926
            +   HG P++I SDRDV+  +  ++ L  +LG K   S+  HPQTDGQ+E   +T+  +L
Sbjct: 1226 IFSYHGFPRTITSDRDVRMTADKYQELTKRLGIKSTMSSANHPQTDGQSERTIQTLNRLL 1285

Query: 1927 RAIIDKNLKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPIDLLPIPSKEFVNF 1986
            RA    N++ W   LP IEF YN     T   +PFEI  G+ P TP     I S + VN 
Sbjct: 1286 RAYASTNIQNWHVYLPQIEFVYNSTPTRTLGKSPFEIDLGYLPNTP----AIKSDDEVNA 1345

Query: 1987 DANAKVEFSHKLHK---QVKEQIEKQNSKVATRINKGRKIVIFKPGDWVWVHFRKERFPT 2046
             +   VE +  L     Q KEQ+E    ++ T  N+ RK ++   GD V VH R   F  
Sbjct: 1346 RSFTAVELAKHLKALTIQTKEQLEHAQIEMETNNNQRRKPLLLNIGDHVLVH-RDAYFKK 1405

Query: 2047 QRKSKLLPRGDGPFQVLERINDNAYKIDLPGKYGVSATFNVVDLSPF 2059
                K+     GPF+V+++INDNAY++DL          NV  L  F
Sbjct: 1406 GAYMKVQQIYVGPFRVVKKINDNAYELDLNSHKKKHRVINVQFLKKF 1444


HSP 2 Score: 108.2 bits (269), Expect = 1.1e-21
Identity = 62/182 (34.07%), Postives = 86/182 (47.25%), Query Frame = 0

Query: 640 VVLLQEFEHLFSEEMPSSLPPLRGI--EHKIDFIPGAPIPNRPAYRTNPKEAEEIQRQVS 699
           V L Q++  +   ++P     +  I  +H I+  PGA +P    Y    K  +EI + V 
Sbjct: 558 VWLQQKYREIIRNDLPPRPADINNIPVKHDIEIKPGARLPRLQPYHVTEKNEQEINKIVQ 617

Query: 700 ELLAKGY-----------------------------AINKITIKYRHPIPRLDDMLDELH 759
           +LL   +                              +NK TI    P+PR+D++L  + 
Sbjct: 618 KLLDNKFIVPSKSPCSSPVVLVPKKDGTFRLCVDYRTLNKATISDPFPLPRIDNLLSRIG 677

Query: 760 GCSLFTKIDLKSGYHQIRMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVL 791
              +FT +DL SGYHQI M   D +KTAF T  G YE+ VMPFGL NAPSTF R M    
Sbjct: 678 NAQIFTTLDLHSGYHQIPMEPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTFARYMADTF 737

BLAST of CmoCh11G012300.1 vs. ExPASy Swiss-Prot
Match: Q7LHG5 (Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-I PE=1 SV=2)

HSP 1 Score: 372.9 bits (956), Expect = 2.5e-101
Identity = 227/629 (36.09%), Postives = 338/629 (53.74%), Query Frame = 0

Query: 1447 EEKVKAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASPLNELVKKNVSFIWEKDQE 1506
            + K  AI+D+PTPK V + + F G+ ++YRRFI N S IA P+   +       W + Q+
Sbjct: 832  QHKCAAIRDFPTPKTVKQAQRFLGMINYYRRFIPNCSKIAQPIQLFICDKSQ--WTEKQD 891

Query: 1507 LAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVL--MQNQRPLM----FFSEKL 1566
             A   LK  L ++P+L   N ++ + +  DAS  GIGAVL  + N+  L+    +FS+ L
Sbjct: 892  KAIEKLKAALCNSPVLVPFNNKANYRLTTDASKDGIGAVLEEVDNKNKLVGVVGYFSKSL 951

Query: 1567 TGASLRYPTYDKELYALVRALQTWQHYLWPKEFIIHTDHESLKHLRVQNKLNRRHAKWLE 1626
              A   YP  + EL  +++AL  +++ L  K F + TDH SL  L+ +N+  RR  +WL+
Sbjct: 952  ESAQKNYPAGELELLGIIKALHHFRYMLHGKHFTLRTDHISLLSLQNKNEPARRVQRWLD 1011

Query: 1627 FIETFPYVIKYKQGKENIVADALSRRYVLLNTLNARLLGFE-----------------HI 1686
             + T+ + ++Y  G +N+VADA+SR    +    +R +  E                 H+
Sbjct: 1012 DLATYDFTLEYLAGPKNVVADAISRAIYTITPETSRPIDTESWKSYYKSDPLCSAVLIHM 1071

Query: 1687 KDLYQHDM------FFAPFVESCE-KGLIVDNYLLLDGFLFRKGKLCIPSCSIRELLVRE 1746
            K+L QH++       F  + +  E       NY L D  ++ + +L +P    +  ++R 
Sbjct: 1072 KELTQHNVTPEDMSAFRSYQKKLELSETFRKNYSLEDEMIYYQDRLVVP-IKQQNAVMRL 1131

Query: 1747 AHGGGLM-AHHGVSKTYDMLSKHFFWPKMRHDVHKVCGRCIACKQAKS-RLQPHGLYSPL 1806
             H   L   H GV+ T   +S  ++WPK++H + +    C+ C+  KS R + HGL  PL
Sbjct: 1132 YHDHTLFGGHFGVTVTLAKISPIYYWPKLQHSIIQYIRTCVQCQLIKSHRPRLHGLLQPL 1191

Query: 1807 PVPNGPWIDISMDFVLGLPRTRKGYDSIFVVVDRFSKMAHFIPCHKTDDAKHIADLFFRE 1866
            P+  G W+DISMDFV GLP T    + I VVVDRFSK AHFI   KT DA  + DL FR 
Sbjct: 1192 PIAEGRWLDISMDFVTGLPPTSNNLNMILVVVDRFSKRAHFIATRKTLDATQLIDLLFRY 1251

Query: 1867 VVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVNRTMTAML 1926
            +   HG P++I SDRDV+  +  ++ L  +LG K   S+  HPQTDGQ+E   +T+  +L
Sbjct: 1252 IFSYHGFPRTITSDRDVRMTADKYQELTKRLGIKSTMSSANHPQTDGQSERTIQTLNRLL 1311

Query: 1927 RAIIDKNLKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPIDLLPIPSKEFVNF 1986
            RA +  N++ W   LP IEF YN     T   +PFEI  G+ P TP     I S + VN 
Sbjct: 1312 RAYVSTNIQNWHVYLPQIEFVYNSTPTRTLGKSPFEIDLGYLPNTP----AIKSDDEVNA 1371

Query: 1987 DANAKVEFSHKLHK---QVKEQIEKQNSKVATRINKGRKIVIFKPGDWVWVHFRKERFPT 2041
             +   VE +  L     Q KEQ+E    ++ T  N+ RK ++   GD V VH R   F  
Sbjct: 1372 RSFTAVELAKHLKALTIQTKEQLEHAQIEMETNNNQRRKPLLLNIGDHVLVH-RDAYFKK 1431


HSP 2 Score: 108.2 bits (269), Expect = 1.1e-21
Identity = 62/182 (34.07%), Postives = 86/182 (47.25%), Query Frame = 0

Query: 640 VVLLQEFEHLFSEEMPSSLPPLRGI--EHKIDFIPGAPIPNRPAYRTNPKEAEEIQRQVS 699
           V L Q++  +   ++P     +  I  +H I+  PGA +P    Y    K  +EI + V 
Sbjct: 584 VWLQQKYREIIRNDLPPRPADINNIPVKHDIEIKPGARLPRLQPYHVTEKNEQEINKIVQ 643

Query: 700 ELLAKGY-----------------------------AINKITIKYRHPIPRLDDMLDELH 759
           +LL   +                              +NK TI    P+PR+D++L  + 
Sbjct: 644 KLLDNKFIVPSKSPCSSPVVLVPKKDGTFRLCVDYRTLNKATISDPFPLPRIDNLLSRIG 703

Query: 760 GCSLFTKIDLKSGYHQIRMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVL 791
              +FT +DL SGYHQI M   D +KTAF T  G YE+ VMPFGL NAPSTF R M    
Sbjct: 704 NAQIFTTLDLHSGYHQIPMEPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTFARYMADTF 763

BLAST of CmoCh11G012300.1 vs. ExPASy Swiss-Prot
Match: Q9UR07 (Transposon Tf2-11 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-11 PE=3 SV=1)

HSP 1 Score: 344.4 bits (882), Expect = 9.5e-93
Identity = 213/652 (32.67%), Postives = 337/652 (51.69%), Query Frame = 0

Query: 1439 SSNGVEVDEEKVKAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASPLNELVKKNVS 1498
            S  G    +E +  +  W  PKN  E+R F G  ++ R+FI   S +  PLN L+KK+V 
Sbjct: 616  SEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVR 675

Query: 1499 FIWEKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQ-----RPL 1558
            + W   Q  A   +K+ L S P+L   +F     +E DAS V +GAVL Q        P+
Sbjct: 676  WKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPV 735

Query: 1559 MFFSEKLTGASLRYPTYDKELYALVRALQTWQHYLWP--KEFIIHTDHESLKHLRVQNKL 1618
             ++S K++ A L Y   DKE+ A++++L+ W+HYL    + F I TDH +L   R+ N+ 
Sbjct: 736  GYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIG-RITNES 795

Query: 1619 ---NRRHAKWLEFIETFPYVIKYKQGKENIVADALSR---------------RYVLLNTL 1678
               N+R A+W  F++ F + I Y+ G  N +ADALSR                   +N +
Sbjct: 796  EPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEPIPKDSEDNSINFVNQI 855

Query: 1679 NARLLGFEHIKDLYQHDMFFAPFVESCEKGLIVDNYLLLDGFLFR-KGKLCIPS-CSIRE 1738
            +        +   Y +D      + + +K  + +N  L DG L   K ++ +P+   +  
Sbjct: 856  SITDDFKNQVVTEYTNDTKLLNLLNNEDK-RVEENIQLKDGLLINSKDQILLPNDTQLTR 915

Query: 1739 LLVREAHGGGLMAHHGVSKTYDMLSKHFFWPKMRHDVHKVCGRCIACKQAKSR-LQPHGL 1798
             ++++ H  G + H G+    +++ + F W  +R  + +    C  C+  KSR  +P+G 
Sbjct: 916  TIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGP 975

Query: 1799 YSPLPVPNGPWIDISMDFVLGLPRTRKGYDSIFVVVDRFSKMAHFIPCHKTDDAKHIADL 1858
              P+P    PW  +SMDF+  LP +  GY+++FVVVDRFSKMA  +PC K+  A+  A +
Sbjct: 976  LQPIPPSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAILVPCTKSITAEQTARM 1035

Query: 1859 FFREVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVNRTM 1918
            F + V+   G PK I++D D  F S  W+    K    + +S    PQTDGQTE  N+T+
Sbjct: 1036 FDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTV 1095

Query: 1919 TAMLRAIIDKNLKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNP-LTPIDLLPIPSK 1978
              +LR +   +  TW D +  ++ +YN  +HS T+ TPFEIV+ ++P L+P++L     K
Sbjct: 1096 EKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLELPSFSDK 1155

Query: 1979 EFVNFDANAKVEFSHKLHKQVKEQIEKQNSKVATRIN-KGRKIVIFKPGDWVWVHFRKER 2038
               N     +V       + VKE +   N K+    + K ++I  F+PGD V V   K  
Sbjct: 1156 TDENSQETIQV------FQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVKRTKTG 1215

Query: 2039 FPTQRKSKLLPRGDGPFQVLERINDNAYKIDLPG--KYGVSATFNVVDLSPF 2059
            F   + +KL P   GPF VL++   N Y++DLP   K+  S+TF+V  L  +
Sbjct: 1216 F-LHKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEKY 1257


HSP 2 Score: 88.6 bits (218), Expect = 9.4e-16
Identity = 63/220 (28.64%), Postives = 108/220 (49.09%), Query Frame = 0

Query: 633 PSLPSDFVVLLQEFEHLFSEEMPSSLP-PLRGIEHKIDFIP---GAPIPNRPAYRTNPKE 692
           P LP     + +EF+ + +E     LP P++G+E +++        PI N   Y   P +
Sbjct: 372 PELPD----IYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRN---YPLPPGK 431

Query: 693 AEEIQRQVSELLAKGY-----AIN--------------KITIKYR----------HPIPR 752
            + +  ++++ L  G      AIN              ++ + Y+          +P+P 
Sbjct: 432 MQAMNDEINQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPL 491

Query: 753 LDDMLDELHGCSLFTKIDLKSGYHQIRMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPST 811
           ++ +L ++ G ++FTK+DLKS YH IR+  GDE K AF+   G++E+LVMP+G++ AP+ 
Sbjct: 492 IEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAPAH 551

BLAST of CmoCh11G012300.1 vs. ExPASy Swiss-Prot
Match: P0CT41 (Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-12 PE=3 SV=1)

HSP 1 Score: 344.4 bits (882), Expect = 9.5e-93
Identity = 213/652 (32.67%), Postives = 337/652 (51.69%), Query Frame = 0

Query: 1439 SSNGVEVDEEKVKAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASPLNELVKKNVS 1498
            S  G    +E +  +  W  PKN  E+R F G  ++ R+FI   S +  PLN L+KK+V 
Sbjct: 616  SEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVR 675

Query: 1499 FIWEKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQ-----RPL 1558
            + W   Q  A   +K+ L S P+L   +F     +E DAS V +GAVL Q        P+
Sbjct: 676  WKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPV 735

Query: 1559 MFFSEKLTGASLRYPTYDKELYALVRALQTWQHYLWP--KEFIIHTDHESLKHLRVQNKL 1618
             ++S K++ A L Y   DKE+ A++++L+ W+HYL    + F I TDH +L   R+ N+ 
Sbjct: 736  GYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIG-RITNES 795

Query: 1619 ---NRRHAKWLEFIETFPYVIKYKQGKENIVADALSR---------------RYVLLNTL 1678
               N+R A+W  F++ F + I Y+ G  N +ADALSR                   +N +
Sbjct: 796  EPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEPIPKDSEDNSINFVNQI 855

Query: 1679 NARLLGFEHIKDLYQHDMFFAPFVESCEKGLIVDNYLLLDGFLFR-KGKLCIPS-CSIRE 1738
            +        +   Y +D      + + +K  + +N  L DG L   K ++ +P+   +  
Sbjct: 856  SITDDFKNQVVTEYTNDTKLLNLLNNEDK-RVEENIQLKDGLLINSKDQILLPNDTQLTR 915

Query: 1739 LLVREAHGGGLMAHHGVSKTYDMLSKHFFWPKMRHDVHKVCGRCIACKQAKSR-LQPHGL 1798
             ++++ H  G + H G+    +++ + F W  +R  + +    C  C+  KSR  +P+G 
Sbjct: 916  TIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGP 975

Query: 1799 YSPLPVPNGPWIDISMDFVLGLPRTRKGYDSIFVVVDRFSKMAHFIPCHKTDDAKHIADL 1858
              P+P    PW  +SMDF+  LP +  GY+++FVVVDRFSKMA  +PC K+  A+  A +
Sbjct: 976  LQPIPPSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAILVPCTKSITAEQTARM 1035

Query: 1859 FFREVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVNRTM 1918
            F + V+   G PK I++D D  F S  W+    K    + +S    PQTDGQTE  N+T+
Sbjct: 1036 FDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTV 1095

Query: 1919 TAMLRAIIDKNLKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNP-LTPIDLLPIPSK 1978
              +LR +   +  TW D +  ++ +YN  +HS T+ TPFEIV+ ++P L+P++L     K
Sbjct: 1096 EKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLELPSFSDK 1155

Query: 1979 EFVNFDANAKVEFSHKLHKQVKEQIEKQNSKVATRIN-KGRKIVIFKPGDWVWVHFRKER 2038
               N     +V       + VKE +   N K+    + K ++I  F+PGD V V   K  
Sbjct: 1156 TDENSQETIQV------FQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVKRTKTG 1215

Query: 2039 FPTQRKSKLLPRGDGPFQVLERINDNAYKIDLPG--KYGVSATFNVVDLSPF 2059
            F   + +KL P   GPF VL++   N Y++DLP   K+  S+TF+V  L  +
Sbjct: 1216 F-LHKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEKY 1257


HSP 2 Score: 90.1 bits (222), Expect = 3.2e-16
Identity = 63/220 (28.64%), Postives = 109/220 (49.55%), Query Frame = 0

Query: 633 PSLPSDFVVLLQEFEHLFSEEMPSSLP-PLRGIEHKIDFIP---GAPIPNRPAYRTNPKE 692
           P LP     + +EF+ + +E     LP P++G+E +++        PI N   Y   P +
Sbjct: 372 PELPD----IYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRN---YPLPPGK 431

Query: 693 AEEIQRQVSELLAKGY-----AIN--------------KITIKYR----------HPIPR 752
            + +  ++++ L  G      AIN              ++ + Y+          +P+P 
Sbjct: 432 MQAMNDEINQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPL 491

Query: 753 LDDMLDELHGCSLFTKIDLKSGYHQIRMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPST 811
           ++ +L ++ G ++FTK+DLKS YH IR+  GDE K AF+   G++E+LVMP+G++ AP+ 
Sbjct: 492 IEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAH 551

BLAST of CmoCh11G012300.1 vs. ExPASy Swiss-Prot
Match: P0CT34 (Transposon Tf2-1 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-1 PE=3 SV=1)

HSP 1 Score: 344.4 bits (882), Expect = 9.5e-93
Identity = 213/652 (32.67%), Postives = 337/652 (51.69%), Query Frame = 0

Query: 1439 SSNGVEVDEEKVKAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASPLNELVKKNVS 1498
            S  G    +E +  +  W  PKN  E+R F G  ++ R+FI   S +  PLN L+KK+V 
Sbjct: 616  SEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVR 675

Query: 1499 FIWEKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQ-----RPL 1558
            + W   Q  A   +K+ L S P+L   +F     +E DAS V +GAVL Q        P+
Sbjct: 676  WKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPV 735

Query: 1559 MFFSEKLTGASLRYPTYDKELYALVRALQTWQHYLWP--KEFIIHTDHESLKHLRVQNKL 1618
             ++S K++ A L Y   DKE+ A++++L+ W+HYL    + F I TDH +L   R+ N+ 
Sbjct: 736  GYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIG-RITNES 795

Query: 1619 ---NRRHAKWLEFIETFPYVIKYKQGKENIVADALSR---------------RYVLLNTL 1678
               N+R A+W  F++ F + I Y+ G  N +ADALSR                   +N +
Sbjct: 796  EPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEPIPKDSEDNSINFVNQI 855

Query: 1679 NARLLGFEHIKDLYQHDMFFAPFVESCEKGLIVDNYLLLDGFLFR-KGKLCIPS-CSIRE 1738
            +        +   Y +D      + + +K  + +N  L DG L   K ++ +P+   +  
Sbjct: 856  SITDDFKNQVVTEYTNDTKLLNLLNNEDK-RVEENIQLKDGLLINSKDQILLPNDTQLTR 915

Query: 1739 LLVREAHGGGLMAHHGVSKTYDMLSKHFFWPKMRHDVHKVCGRCIACKQAKSR-LQPHGL 1798
             ++++ H  G + H G+    +++ + F W  +R  + +    C  C+  KSR  +P+G 
Sbjct: 916  TIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGP 975

Query: 1799 YSPLPVPNGPWIDISMDFVLGLPRTRKGYDSIFVVVDRFSKMAHFIPCHKTDDAKHIADL 1858
              P+P    PW  +SMDF+  LP +  GY+++FVVVDRFSKMA  +PC K+  A+  A +
Sbjct: 976  LQPIPPSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAILVPCTKSITAEQTARM 1035

Query: 1859 FFREVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVNRTM 1918
            F + V+   G PK I++D D  F S  W+    K    + +S    PQTDGQTE  N+T+
Sbjct: 1036 FDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTV 1095

Query: 1919 TAMLRAIIDKNLKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNP-LTPIDLLPIPSK 1978
              +LR +   +  TW D +  ++ +YN  +HS T+ TPFEIV+ ++P L+P++L     K
Sbjct: 1096 EKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLELPSFSDK 1155

Query: 1979 EFVNFDANAKVEFSHKLHKQVKEQIEKQNSKVATRIN-KGRKIVIFKPGDWVWVHFRKER 2038
               N     +V       + VKE +   N K+    + K ++I  F+PGD V V   K  
Sbjct: 1156 TDENSQETIQV------FQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVKRTKTG 1215

Query: 2039 FPTQRKSKLLPRGDGPFQVLERINDNAYKIDLPG--KYGVSATFNVVDLSPF 2059
            F   + +KL P   GPF VL++   N Y++DLP   K+  S+TF+V  L  +
Sbjct: 1216 F-LHKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEKY 1257


HSP 2 Score: 90.1 bits (222), Expect = 3.2e-16
Identity = 63/220 (28.64%), Postives = 109/220 (49.55%), Query Frame = 0

Query: 633 PSLPSDFVVLLQEFEHLFSEEMPSSLP-PLRGIEHKIDFIP---GAPIPNRPAYRTNPKE 692
           P LP     + +EF+ + +E     LP P++G+E +++        PI N   Y   P +
Sbjct: 372 PELPD----IYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRN---YPLPPGK 431

Query: 693 AEEIQRQVSELLAKGY-----AIN--------------KITIKYR----------HPIPR 752
            + +  ++++ L  G      AIN              ++ + Y+          +P+P 
Sbjct: 432 MQAMNDEINQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPL 491

Query: 753 LDDMLDELHGCSLFTKIDLKSGYHQIRMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPST 811
           ++ +L ++ G ++FTK+DLKS YH IR+  GDE K AF+   G++E+LVMP+G++ AP+ 
Sbjct: 492 IEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAH 551

BLAST of CmoCh11G012300.1 vs. ExPASy TrEMBL
Match: A0A1U8MQK5 (Reverse transcriptase OS=Gossypium hirsutum OX=3635 GN=LOC107939043 PE=4 SV=1)

HSP 1 Score: 2145.2 bits (5557), Expect = 0.0e+00
Identity = 1162/2412 (48.18%), Postives = 1514/2412 (62.77%), Query Frame = 0

Query: 9    NITDARLREAQQRTMERLI-RGIEELTDRIGRLEIQNQARQRIPQPTPSTDTYEGDNSDH 68
            N+ D ++ +A  R MERL+ R +  L DR+ ++E Q   RQR   P  +    EG N   
Sbjct: 20   NVPDLQM-QALLREMERLLDRRLNPLEDRLYQVEAQ---RQRDEYPEVARQRREGPNQRR 79

Query: 69   HED--NPHAVGPGLMRGRDHGRKYHNLQQRVPYD------DRIDRNVGSIKLKLPKFYGK 128
             +       V        DHG    +L++R P +       R D ++ +IKL +P F GK
Sbjct: 80   RQPRVQDDDVNENSEDESDHGSNI-SLERRRPRNRGQVDRRREDDDLKNIKLSIPPFQGK 139

Query: 129  TDPEEYLQWEKTVESVFNCHNFSDEKKVLLCIAQFKQYAQIWWDKLMSSRRRNLEAPIDS 188
            +DPE YL+WEK +E VF CHN+S+ KKV L   +F  YA IWWD+L +SRRRN E PI +
Sbjct: 140  SDPEAYLEWEKKIELVFECHNYSENKKVKLAAIEFSDYAMIWWDQLTTSRRRNGEHPIST 199

Query: 189  WVEFKESMRKRFVPQYFQRDMAQKLQALKQGRKSVEDYYKEMDTLMDRLELDEDMEALMA 248
            W E K  MR+RF+P Y+ R++ QKLQ L QG KSVEDYYKEM+  M R ++ ED EA MA
Sbjct: 200  WTEMKAVMRRRFIPSYYHRELHQKLQNLIQGTKSVEDYYKEMEVAMIRADIQEDREATMA 259

Query: 249  RFLNGLNTEIADKTDLQPYSNIEELLHIAIKIERQIQRRSQRYSSKTFPNST-STWKKDS 308
            RFL GLN EIA+  +LQ Y  + +++H+AIK+E+Q++R+    +++ +PN+  S W + +
Sbjct: 260  RFLAGLNREIANVVELQHYIEVVDMVHMAIKVEKQLKRKG---TTRAYPNTNPSKWGQST 319

Query: 309  KNIDYKHRNQEINEKPQAKFEKGESSRTGKEKVEKSNVRNRDLKCWRCQGVGHYSRDCPN 368
                  +R ++ +   +A     ESS+ GK   E S  R+RD+KC++C G GH +  CPN
Sbjct: 320  SKGFPTNRTKDSSTISKANKPIAESSK-GKAP-ESSTARSRDIKCFKCLGRGHIASQCPN 379

Query: 369  ARIMTIK-EGEIVTDDEAHDDINEETDESEEFSEEDPTHISLVTRRALNTHIKEDGLDQR 428
             R M I+ +GEI T+DE  +D  E   E EE  E+      LV +R+L+    ED   QR
Sbjct: 380  RRTMVIRADGEIETEDEEEND-PESNSEVEEDLEQPVEGELLVVKRSLSLQSVEDE-QQR 439

Query: 429  ENLFQTRCLVQSVPCSVVIDSGSCTNVVSSILVKRLNLKTQPHPRPYKLQWLNDCGEVRV 488
            EN+F +RC VQ   CS++ID GSCTNV S+++V++L L T  HP PYKLQWLND GE++V
Sbjct: 440  ENIFHSRCQVQGKVCSIIIDGGSCTNVASTLMVEKLGLSTTKHPNPYKLQWLNDGGELKV 499

Query: 489  TQQTLVSFTIGKYVDDVLCDVVSMHVGDLLLGRPWQFDRRVMYDGYANRYSFTHNGRKTT 548
            T+Q LVSF+IGKY D+VLCDVV MH G LLLGRPWQFDRRVM+DGY NRYSF H G+  T
Sbjct: 500  TKQVLVSFSIGKYSDEVLCDVVPMHAGHLLLGRPWQFDRRVMHDGYTNRYSFKHLGKNVT 559

Query: 549  LIPLSPKDVFIDHCKL-----EKKRQEADAKAEI-EKESSEKKSLSEKQESNTQPREKKE 608
            L PL+PK V+ D  KL     + + +E  AK +I EKE  + K+  E ++   Q +  +E
Sbjct: 560  LAPLTPKQVYEDQQKLKLSVEQAREKEKSAKEKIKEKEREKTKNARENEKKQKQVKSGEE 619

Query: 609  RKAKSVSLYVRSSEARNVLLSNQTILVLMCKGSCYFTNMLNPSLPSDFVVLLQEFEHLFS 668
            +++  +S++ R S+ R  +   Q + V M K +   TN L  +LP+  + L+QEFE +F 
Sbjct: 620  KESGKMSVFARVSDIRRAMFLRQPMYVFMYKEALLNTNELPENLPASVLSLIQEFEDIFP 679

Query: 669  EEMPSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEAEEIQRQVSELLAKGY------- 728
            +E+PS LPP+RGIEH+ID +PGA +PNRPAYR+NP+E +E+Q+Q++ELL KGY       
Sbjct: 680  DEIPSGLPPIRGIEHQIDLVPGAALPNRPAYRSNPEETKELQKQITELLDKGYIRESLSP 739

Query: 729  ----------------------AINKITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGY 788
                                  AINKITIKYRHPIPRLDDMLDEL G  +FTKIDLKSGY
Sbjct: 740  CAVPVLLVPKKDGSWRMCVDCRAINKITIKYRHPIPRLDDMLDELSGAQIFTKIDLKSGY 799

Query: 789  HQIRMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVLRE------------ 848
            HQIRM  GDEWKTAFKTK GLYEWLVMPFGLTNAPSTFMRLMNHVLR             
Sbjct: 800  HQIRMREGDEWKTAFKTKLGLYEWLVMPFGLTNAPSTFMRLMNHVLRAFIGKFCVVYFDD 859

Query: 849  ---------------------------------------------YLVSSNGVEVDEEKV 908
                                                         Y+VS+NG+EVD+EKV
Sbjct: 860  ILVYSKTLEEHVPHLRAVLETLRKEVLYANLKKCSFCTDKVVFLGYVVSANGLEVDQEKV 919

Query: 909  KAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASPLNELVKKNVSFIWEKDQELAFN 968
            +AI++WP P N+S+VRSFHGLASFYRRF+ NFSTIA+PL  ++KKN +F W +DQE +FN
Sbjct: 920  QAIQEWPRPTNISQVRSFHGLASFYRRFVPNFSTIAAPLTSVIKKNSAFHWSEDQERSFN 979

Query: 969  TLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTY 1028
             +KE L  APLL+LP+F  TFEIECDASGVGIGAVL Q+ RP+ +FSEKL+GASL YP Y
Sbjct: 980  LIKECLVKAPLLSLPDFTKTFEIECDASGVGIGAVLTQDGRPVAYFSEKLSGASLNYPVY 1039

Query: 1029 DKELYALVRALQTWQHYLWPKEFIIHTDHESLKHLRVQNKLNRRHAKWLEFIETFPYVIK 1088
            DKE+YAL+R+L+TWQHYL PKEF+IH+DHE+LK ++ Q+KLN+RHAKW+E++E+FPYVIK
Sbjct: 1040 DKEMYALIRSLETWQHYLRPKEFVIHSDHEALKFIKGQHKLNKRHAKWVEYLESFPYVIK 1099

Query: 1089 YKQGKENIVADALSRRYVLLNTLNARLLGFEHIKDLYQHDMFFAPFVESCEKGLIVDNYL 1148
            YK+GKEN+VADALSRRY LL TL+++LLGF H+K+LY +D  F     +CE G   + + 
Sbjct: 1100 YKKGKENVVADALSRRYTLLTTLDSKLLGFYHMKELYANDEDFGNIYAACENGSF-EKFF 1159

Query: 1149 LLDGFLFRKGKLCIPSCSIRELLVREAHGGGLMAHHGVSKTYDMLSEHFFWPKMRHDVHK 1208
              +GFLF++ +LCIP  SIRELLV EAH GGLM H G+SKT   L EHF+WP+MR DV +
Sbjct: 1160 RYEGFLFKENRLCIPQGSIRELLVLEAHSGGLMGHFGISKTLSTLHEHFYWPRMRRDVER 1219

Query: 1209 VCGRCIACKQAKSRLQPHGLYSPLPVPNGPWIDISMDFVLGLPRTRKGYDSIFVVVDRFS 1268
            +C RC+ CK+AKS++QPHGLY+PLP+P+ PW D+SMDF+LGLP+T+ G DSIFVVVDRFS
Sbjct: 1220 ICERCLICKRAKSKVQPHGLYTPLPIPDTPWTDVSMDFILGLPKTKTGKDSIFVVVDRFS 1279

Query: 1269 KMAHFIPCHKTDDAKHIADLFFREVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLV 1328
            KM+HFI C KTDDA HIA+LFFRE+VRLHG+P++IVSDRD KFLSHFWR LWGKLGTKL+
Sbjct: 1280 KMSHFISCAKTDDAVHIANLFFREIVRLHGLPRTIVSDRDTKFLSHFWRTLWGKLGTKLL 1339

Query: 1329 YS-----------------XGDLLLG------RPWQ---------FDRRV----MYDGYA 1388
            +S                    LL        + W+         ++R V     +  + 
Sbjct: 1340 FSTTCHPQTDGQTEVVNRVLSTLLRALVRKNLKTWEECLPHIEFAYNRSVHSATKFSPFE 1399

Query: 1389 NRYSFTHNGRKTTLIPLSPKDIF---IDHCKLEKKRQEADAKAEIEKESSEKKSLSEKQE 1448
              Y F          PL+P D+    +D    +    +A  KAE  KE   K  + E  E
Sbjct: 1400 TVYGFN---------PLTPLDLLPFPVD----QFVHTDARKKAEFVKELHRK--VRENIE 1459

Query: 1449 SNTQPREKKERKAKS---------VSLYVRS----SEARNVLLS------------NQTI 1508
            S T+   ++  K +          V +++R     ++ R+ LL             N+  
Sbjct: 1460 SRTEQYVQRANKGRKRVVFEPGDWVWVHMRKERFPAQRRSKLLPRGDGPFQVIERINENS 1519

Query: 1509 LVLMCKGSCYFTNMLNPS------LPSDFVVIFVVLLQEFEDLFFEEMPSSLPPLRGIEH 1568
              L   G    +   N S      + +D         +EFED+F +E+PS LPP+RGIEH
Sbjct: 1520 YKLDLPGEYNVSASFNVSDLTPFDVDTDLRTSHFQEGEEFEDIFLDEIPSGLPPIRGIEH 1579

Query: 1569 KIDFIPGAPIPNRPAYRTNPKEAEEIQ--------------------------------- 1628
            +ID +PGA +PNRPAYR+NP+E +E+Q                                 
Sbjct: 1580 QIDLVPGAALPNRPAYRSNPEETKELQKQITELLDKGYIQESLSPCAVSVLLVPKKDGSW 1639

Query: 1629 -------------------------------------XYDVK------------------ 1688
                                                   D+K                  
Sbjct: 1640 RMCADCRAINKITIKYRHPIPRLDDMLDELSGTQIFTKIDLKSGYHQILMREGDEWKTAF 1699

Query: 1689 --------------------------------------CLSFFLLITIF----------- 1748
                                                  C+ +F  I ++           
Sbjct: 1700 KMKLGLYEWLVMPFSLTNAPSTFMRLMNHVLRAFIGKFCVVYFDDILVYSKTLEEHVPHL 1759

Query: 1749 ---------------------------------SSNGVEVDEEKVKAIKDWPTPKNVSEV 1808
                                             S+NG+E D+EKV+AI++WP P N+S+V
Sbjct: 1760 RAVLETLRKEVLYANLKKCSFCTDKVVFLGYVVSANGLEADQEKVQAIQEWPRPTNISQV 1819

Query: 1809 RSFHGLASFYRRFIKNFSTIASPLNELVKKNVSFIWEKDQELAFNTLKEKLSSAPLLALP 1868
            RSFHGLASFYRRF+ NFSTIA+PL  ++KKN +F W +DQE +FN +KE L  APLL+LP
Sbjct: 1820 RSFHGLASFYRRFVPNFSTIAAPLTSVIKKNSAFHWSEDQERSFNLIKECLVKAPLLSLP 1879

Query: 1869 NFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKELYALVRALQTWQ 1928
            +F  TFEIECDASGVGI AVL Q+ R + +FS KL+GASL Y  Y+KE+YAL+R+L+TWQ
Sbjct: 1880 DFTKTFEIECDASGVGIEAVLTQDGRSVAYFSGKLSGASLNYLVYEKEMYALIRSLETWQ 1939

Query: 1929 HYLWPKEFIIHTDHESLKHLRVQNKLNRRHAKWLEFIETFPYVIKYKQGKENIVADALSR 1988
            HYL PKEF+IH+DHE+LK ++ Q+KLN+RHAKW+E++E+FPYVIKYK+GKEN+VADALSR
Sbjct: 1940 HYLRPKEFVIHSDHEALKFIKGQHKLNKRHAKWVEYLESFPYVIKYKKGKENVVADALSR 1999

Query: 1989 RYVLLNTLNARLLGFEHIKDLYQHDMFFAPFVESCEKGLIVDNYLLLDGFLFRKGKLCIP 2048
            RY LL TL+++LLGF H+K+LY +D  F     +CE G   + +   +GFLF++ +LCIP
Sbjct: 2000 RYTLLTTLDSKLLGFYHMKELYANDEDFGNIYAACENGSF-EKFFRYEGFLFKENRLCIP 2059

Query: 2049 SCSIRELLVREAHGGGLMAHHGVSKTYDMLSKHFFWPKMRHDVHKVCGRCIACKQAKSRL 2078
              SIRELLV EAH GGLM H G+SKT   L +HF+WP+MR DV ++C RC+ CK+AKS++
Sbjct: 2060 QGSIRELLVLEAHSGGLMGHFGISKTLSTLHEHFYWPRMRRDVERICERCLICKRAKSKV 2119

BLAST of CmoCh11G012300.1 vs. ExPASy TrEMBL
Match: A0A2N9I2F8 (Reverse transcriptase OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS46081 PE=4 SV=1)

HSP 1 Score: 1910.2 bits (4947), Expect = 0.0e+00
Identity = 1054/2226 (47.35%), Postives = 1385/2226 (62.22%), Query Frame = 0

Query: 8    TNITDARL-REAQQRTMERLIR-GIEELTDRIGRLEIQNQ----------ARQRIPQPTP 67
            T + D ++  EA    M R+++  +E++ +RI ++E + +           R+R+P    
Sbjct: 198  TKLADPKVFMEAMMSEMRRVMKIEMEQVHERIDQMENRREEQPQNGRNLRRRERVPAREE 257

Query: 68   STDTYEGDNSDHHEDNPHAVGPGLMRGRDHGRKYHNLQQRVPYDDRIDRNVGSIKLKLPK 127
              + Y G   D  ED    V      GR  G +           +R D N+G IK+K+P 
Sbjct: 258  EEERY-GSGFDEEEDRDSIVNNRRPGGRFGGAR-----------NREDNNLGGIKMKIPS 317

Query: 128  FYGKTDPEEYLQWEKTVESVFNCHNFSDEKKVLLCIAQFKQYAQIWWDKLMSSRRRNLEA 187
            F G++DPE YL+WEK +E VF+CHN+S+ KKV L + +F +YA  WWD+L+ +RRRN E 
Sbjct: 318  FQGRSDPEAYLEWEKKMEFVFDCHNYSETKKVKLAVIEFSEYAVTWWDQLVINRRRNRER 377

Query: 188  PIDSWVEFKESMRKRFVPQYFQRDMAQKLQALKQGRKSVEDYYKEMDTLMDRLELDEDME 247
            PID+W E K  MRKRF+P Y+ R++ +KLQ L+QG +SVEDYYKEM+  M R  ++ED E
Sbjct: 378  PIDTWEEMKVVMRKRFIPSYYYRELYKKLQGLRQGSRSVEDYYKEMEIAMIRANVEEDRE 437

Query: 248  ALMARFLNGLNTEIADKTDLQPYSNIEELLHIAIKIERQIQRRSQRYSSKTFPNSTST-W 307
            A MARFL GLN EI DK ++Q Y  +E+++H+AIK+E+Q++R S    ++   NS+ST W
Sbjct: 438  ATMARFLLGLNREIHDKVEMQHYVELEDMVHMAIKVEQQLKRGS---GTRAGHNSSSTSW 497

Query: 308  KKD-SKNIDYKHRNQEINEKPQAKFEKGESSRTGKEKVEKSNVRNRDLKCWRCQGVGHYS 367
            K   +K +D        ++ P+ + +   +S   + K E S  RNRD+KC+RCQG GH +
Sbjct: 498  KSSHAKPLDK-------SQTPKPEPKSVTTSHVPQGKTEASTSRNRDIKCFRCQGRGHIA 557

Query: 368  RDCPNARIMTIK-EGEIVTDDEAHDDINEETDESEEFSEED---PTHISLVTRRALNTHI 427
              CPN ++M ++  GEIVTD E  D   ++    E+  EE+   P  ++LV RRAL+   
Sbjct: 558  SQCPNKQVMVLQANGEIVTDCE--DSDTDDMPPLEDVFEEEYLAPDALTLVARRALSLQA 617

Query: 428  KEDGLDQRENLFQTRCLVQSVPCSVVIDSGSCTNVVSSILVKRLNLKTQPHPRPYKLQWL 487
            K     QREN+F TRC V+   CSV+ID GSCTNV S+I+V++L L    HPRPYKLQWL
Sbjct: 618  KGVDEIQRENIFHTRCYVKDKVCSVIIDGGSCTNVASTIMVEKLGLPMAKHPRPYKLQWL 677

Query: 488  NDCGEVRVTQQTLVSFTIGKYVDDVLCDVVSMHVGDLLLGRPWQFDRRVMYDGYANRYSF 547
            ND GE+RV +Q LV+F IGKY D+V+CDVV M  G LLLGRPWQFDR+V +DG+ N+YSF
Sbjct: 678  NDSGEIRVNKQVLVAFRIGKYEDEVMCDVVPMQAGHLLLGRPWQFDRQVKHDGFTNKYSF 737

Query: 548  THNGRKTTLIPLSPKDVFIDHCKLEK---KRQEADAKAEIEKESSEKKSLSEKQESNTQP 607
              N R  TL+PL+P+ V+ D  +L+K   +++E++ K + E +   +K+  EK+  ++  
Sbjct: 738  VLNQRLITLVPLTPQQVYEDQVRLQKESDQKKESEQKKKSENQRGAEKNEREKENQSSAL 797

Query: 608  REKKERKAKSVSLYVRSSEARNVLLSNQTILVLMCKGSCYFTNMLNPSLPSDFVVLLQEF 667
              K ER  K  + Y + SE +  + SNQ ++VL+ K +   TN L+ +LPS  V LLQE+
Sbjct: 798  ERKSERTQK--NFYAKVSEIKRAMFSNQPMIVLLYKEALLNTNELDLALPSSIVSLLQEY 857

Query: 668  EHLFSEEMPSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEAEEIQRQVSELLAKGY-- 727
            E +F EE P  LPP+RGIEH+IDF+PGA IPNRPAYR+NP+E +E+QRQVSELL KG+  
Sbjct: 858  EDVFPEETPHGLPPIRGIEHQIDFVPGATIPNRPAYRSNPEETKELQRQVSELLEKGHVR 917

Query: 728  ---------------------------AINKITIKYRHPIPRLDDMLDELHGCSLFTKID 787
                                       AIN IT+KYRHPIPRLDDMLDELHG  +F+KID
Sbjct: 918  ESMSPCAVPVLLVPKKDGTWRMCVDCRAINNITVKYRHPIPRLDDMLDELHGSCIFSKID 977

Query: 788  LKSGYHQIRMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVLREYL----- 847
            LKSGYHQIR+  GDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVLR ++     
Sbjct: 978  LKSGYHQIRIKEGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVLRTFIGRFVV 1037

Query: 848  ----------------------------------------------------VSSNGVEV 907
                                                                VS+ G++V
Sbjct: 1038 VYFDDILIYSKNLDDHVVHVKSVLDVLRKERLFANLKKCTFYTDKLVFLGFVVSAQGIQV 1097

Query: 908  DEEKVKAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASPLNELVKKNVSFIWEKDQ 967
            DEEKV+AI+DWP+P +V  VRSFHGLASFYRRF+K+FS++A+PL E++KKNV F W ++Q
Sbjct: 1098 DEEKVRAIQDWPSPTSVGNVRSFHGLASFYRRFVKDFSSLAAPLTEVIKKNVGFRWGEEQ 1157

Query: 968  ELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASL 1027
            E AF  +KEKL++APLL+LPNF  TFEIECDASGVGIGAVLMQ  RP+ +FSEKL+GA+L
Sbjct: 1158 EKAFQLIKEKLTNAPLLSLPNFSKTFEIECDASGVGIGAVLMQEGRPIAYFSEKLSGAAL 1217

Query: 1028 RYPTYDKELYALVRALQTWQHYLWPKEFIIHTDHESLKHLRVQNKLNRRHAKWLEFIETF 1087
             YPTYDKELYALVRAL+TWQHYLWPKEF+IHTDHESLKHL+ Q+KLN+RHA+W+EFIETF
Sbjct: 1218 NYPTYDKELYALVRALETWQHYLWPKEFVIHTDHESLKHLKGQHKLNKRHARWVEFIETF 1277

Query: 1088 PYVIKYKQGKENIVADALSRRYVLLNTLNARLLGFEHIKDLYQHDMFFAPFVESCEKGLI 1147
            PYVI+YKQGKEN+VADALSRRY LL+TL+A+LLGFEHIK+LY  D  F     +CEK + 
Sbjct: 1278 PYVIRYKQGKENVVADALSRRYALLSTLDAKLLGFEHIKELYAEDHEFCEEYRACEK-IA 1337

Query: 1148 VDNYLLLDGFLFRKGKLCIPSCSIRELLVREAHGGGLMAHHGVSKTYDMLSEHFFWPKMR 1207
               +  LDGFLFR+ KLC+P+CS+RELLV+E+HGGGLM H GV+KT  +L +HF+WP+M+
Sbjct: 1338 SGKFFRLDGFLFRENKLCVPNCSMRELLVQESHGGGLMGHFGVAKTLAILQDHFYWPRMK 1397

Query: 1208 HDVHKVCGRCIACKQAKSRLQPHGLYSPLPVPNGPWIDISMDFVLGLPRTRKGYDSIFVV 1267
             DV ++CGRC+ C+QAKS++QP+GLY+PLP+P+ PWIDISMDFVLGLPRT++G DSIFVV
Sbjct: 1398 RDVERICGRCVTCRQAKSKVQPNGLYTPLPIPSEPWIDISMDFVLGLPRTKRGRDSIFVV 1457

Query: 1268 VDRFSKMAHFIPCHKTDDAKHIADLFFREVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKL 1327
            VDRFSKMAHFIPCHKTDDA H+ADLFFRE+VRLHG+P++IVSDRD KFLS+FW+ LW KL
Sbjct: 1458 VDRFSKMAHFIPCHKTDDASHVADLFFREIVRLHGMPRTIVSDRDAKFLSYFWKTLWCKL 1517

Query: 1328 GTKLVYSXGDLLLGRPWQFDRRVMYDGYANRYSFTHNGRKTTLIPLSPKDIFIDHCKLEK 1387
            GTKL++S                                 TT  P               
Sbjct: 1518 GTKLLFS---------------------------------TTCHP--------------- 1577

Query: 1388 KRQEADAKAEIEKESSEKKSLSEKQESNTQPREKKERKAKSVSLYVRSSEARNVLLSNQT 1447
               + D + E+                            +++S  +R+   +N+    + 
Sbjct: 1578 ---QTDGETEVVN--------------------------RTLSTLLRAIIRKNIKTWEE- 1637

Query: 1448 ILVLMCKGSCYFTNMLNPSLPSDFVVIFVVLLQEFEDLFFEEMPSSLPPLRGIEH----- 1507
               L      Y   + + +  S F +++          F    P  L PL   EH     
Sbjct: 1638 --CLPHVEFAYNRAVHSATKFSPFEIVYG---------FNPLTPLDLSPLPSTEHVNLDG 1697

Query: 1508 --KIDFIPGAPIPNRPAYRTNPKEAEEIQXYDVKCLSFFLLITIFSSNGVEVDEEKVKAI 1567
              K DF+               K+  E   ++++                       +  
Sbjct: 1698 KKKADFV---------------KQIHEKARFNIE-----------------------RRT 1757

Query: 1568 KDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASPLNELVKKNVSFIWEKDQELAFNTLK 1627
            + + T  N    +       +    ++     A   ++L+ +          +  F  L+
Sbjct: 1758 EQYATQANKGRRQLVFEPGDWVWLHMRKERFPAKRHSKLLPRG---------DGPFQILE 1817

Query: 1628 EKLSSAPLLALP---NFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTY 1687
                +A  L LP   N  +TF +  D S   +G  L  N      F E+  G      T 
Sbjct: 1818 RINDNAYKLDLPGEYNVSATFNV-TDLSPFDVGDDLRANP-----FQEE--GNDGDQGTI 1877

Query: 1688 DKELYALVRALQTWQHYLWPKEFIIHTDHESLKHLRVQNKLNRRHAKWLEFIETFPYVIK 1747
             K+L          Q  + P                    + R  AK            K
Sbjct: 1878 SKDLI---------QVPIGP--------------------VTRARAK------------K 1937

Query: 1748 YKQGKENIVADALSRRYVLLNTLNARLLGFEHIKDLYQHDMFFAPFVESCEKG-----LI 1807
            +K                +LN L         I++L+     + P  E   +G      +
Sbjct: 1938 FKD---------------VLNGL---------IQELWAQANSWRPCSEHDPRGQQRIVTL 1997

Query: 1808 VDNYLLLDGF---LFRKGKLCIPSCSIRELL--VREAHGGGLMAHHGVSKTYDMLSKHFF 1867
            + NYL L GF        +   PS  +   L  V+E+HGGGLM H GVSKT  +L  HF+
Sbjct: 1998 IHNYLNLSGFPVAFISTYQTETPSPFVASSLYRVQESHGGGLMGHFGVSKTLAILQGHFY 2057

Query: 1868 WPKMRHDVHKVCGRCIACKQAKSRLQPHGLYSPLPVPNGPWIDISMDFVLGLPRTRKGYD 1927
            WP M+ DV ++CGRC+ C+QAKS++QP+GLY+PLP+P+ PWIDIS+DFVLGLPRT++G D
Sbjct: 2058 WPHMKRDVERICGRCVTCRQAKSKVQPNGLYTPLPIPSEPWIDISIDFVLGLPRTKRGRD 2117

Query: 1928 SIFVVVDRFSKMAHFIPCHKTDDAKHIADLFFREVVRLHGIPKSIVSDRDVKFLSHFWRV 1987
            SIFVVVDRFSKMAHFIPCHKTDDA H+ADLFFRE+VRLHG+P++IVSDRD KFLS+FW+ 
Sbjct: 2118 SIFVVVDRFSKMAHFIPCHKTDDASHVADLFFREIVRLHGMPRTIVSDRDAKFLSYFWKT 2177

Query: 1988 LWGKLGTKLVYSTTCHPQTDGQTEVVNRTMTAMLRAIIDKNLKTWEDCLPFIEFAYNRVV 2047
            LW KLGTKL++STTCHPQTDG+TEVVNRT++ +LRAII KN+KTWE+CLP +EFAYNR V
Sbjct: 2178 LWCKLGTKLLFSTTCHPQTDGETEVVNRTLSTLLRAIIRKNIKTWEECLPHVEFAYNRAV 2183

Query: 2048 HSTTKCTPFEIVYGFNPLTPIDLLPIPSKEFVNFDANAKVEFSHKLHKQVKEQIEKQNSK 2101
            HS TK +PFEIVYGFNPLTP+DL P+P  E VN D   K +F  ++H++ +  IE++  +
Sbjct: 2238 HSATKFSPFEIVYGFNPLTPLDLSPLPLTEHVNLDGKKKADFVKQIHEKARLNIERRTEQ 2183

BLAST of CmoCh11G012300.1 vs. ExPASy TrEMBL
Match: A0A2N9I3L4 (Reverse transcriptase OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS46293 PE=4 SV=1)

HSP 1 Score: 1713.7 bits (4437), Expect = 0.0e+00
Identity = 1038/2769 (37.49%), Postives = 1269/2769 (45.83%), Query Frame = 0

Query: 399  LVTRRALNTHIKEDGLDQRENLFQTRCLVQSVPCSVVIDSGSCTNVVSSILVKRLNLKTQ 458
            LV RR LN  +KE+   QRENLF TRC V +  CSV+ID GSCTNV S+ LV++L L T 
Sbjct: 19   LVARRVLNVQVKEEESRQRENLFHTRCFVNNKVCSVIIDGGSCTNVASTYLVEKLALTTL 78

Query: 459  PHPRPYKLQWLNDCGEVRVTQQTLVSFTIGKYVDDVLCDVVSMHVGDLLLGRPWQFDRRV 518
             HP PY+LQWLN+CGE++VT+Q LV+ +IGKY D+VLCDVV MH   LLLGRPWQ+D+R 
Sbjct: 79   KHPHPYRLQWLNECGEIKVTRQVLVALSIGKYEDEVLCDVVPMHACHLLLGRPWQYDKRA 138

Query: 519  MYDGYANRYSFTHNGRKTTLIPLSPKDVFIDHCKLEKKRQEADAKAEIEKESSEKKSLSE 578
             +DG+ NRYSFTH G+   L+PL+PK                                  
Sbjct: 139  KHDGFTNRYSFTHKGQPIILVPLTPK---------------------------------- 198

Query: 579  KQESNTQPREKKERKAKSVSLYVRSSEARNVLLSNQTILVLMCKGSCYFTNMLNPSLPSD 638
                                                                        
Sbjct: 199  ------------------------------------------------------------ 258

Query: 639  FVVLLQEFEHLFSEEMPSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEAEEIQRQVSE 698
                 QEFE +  EE+P  LPP+RGIEH+IDFIPGA IPNRP YR+NP+E +E+QRQV E
Sbjct: 259  -----QEFEDVLPEEVPYGLPPIRGIEHQIDFIPGASIPNRPPYRSNPEETKELQRQVGE 318

Query: 699  LLAKGY-----------------------------AINKITIKYRHPIPRLDDMLDELHG 758
            LL KGY                             AIN IT+KYRHPIPRLDDMLDELHG
Sbjct: 319  LLEKGYVRESMSPCAVPVLLVPKKDGTWRMCVDCRAINNITVKYRHPIPRLDDMLDELHG 378

Query: 759  CSLFTKIDLKSGYHQIRMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVLR 818
              +FTKIDLKSGYHQIRM  GDEWKTAFKTKYGLYEW VMPFGLTNAPSTFMRLMNHVLR
Sbjct: 379  SCVFTKIDLKSGYHQIRMKEGDEWKTAFKTKYGLYEWFVMPFGLTNAPSTFMRLMNHVLR 438

Query: 819  EYL--------------------------------------------------------- 878
             ++                                                         
Sbjct: 439  AFIGRFVVVYFDDILIYSKNLEEHVMHLKSVLEILRKEKLFANLKKCTFCTDKLVFLGFV 498

Query: 879  VSSNGVEVDEEKVKAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASPLNELVKKNV 938
            VS  G+EVDEEKVKAI++WPTP  +S+V SFHGLASFYR F+++FS++ASPL E++KKNV
Sbjct: 499  VSKRGIEVDEEKVKAIQEWPTPTTISQVWSFHGLASFYRWFVRDFSSLASPLTEVIKKNV 558

Query: 939  SFIWEKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFS 998
             F W K+QE AFN +KEKL++APLL LPNF  TFEIECDASG+GIGAVLMQ  RP+ +FS
Sbjct: 559  PFKWGKEQEKAFNLIKEKLTNAPLLVLPNFAKTFEIECDASGIGIGAVLMQEGRPVAYFS 618

Query: 999  EKLTGASLRYPTYDKELYALVRALQTWQHYLWPKEFIIHTDHESLKHLRVQNKLNRRHAK 1058
            EKL+GA+L YPTYDKE+YALVRAL+ WQHYLWPKEF+IHTDHESLKHL+ Q +LNRRHAK
Sbjct: 619  EKLSGAALNYPTYDKEMYALVRALENWQHYLWPKEFVIHTDHESLKHLKGQQRLNRRHAK 678

Query: 1059 WLEFIETFPYVIKYKQGKENIVADALSRRYVLLNTLNARLLGFEHIKDLYQHDMFFAPFV 1118
            W+EFIETFPYVI+YKQGKEN+VADALSRRY LL+ L+ +LLGFE+IKDLY  D  F    
Sbjct: 679  WVEFIETFPYVIRYKQGKENVVADALSRRYALLSILDTKLLGFEYIKDLYAQDSDFGEVF 738

Query: 1119 ESCEKGLIVDNYLLLDGFLFRKGKLCIPSCSIRELLVREAHGGGLMAHHGVSKTYDMLSE 1178
             +CEK +    +   DGFLFR+ KLC+P CS+REL VREAHGGGLM H GV+KT  +L +
Sbjct: 739  NACEK-VAFGKFYRHDGFLFRENKLCVPMCSLRELFVREAHGGGLMGHFGVAKTLGILHD 798

Query: 1179 HFFWPKMRHDVHKVCGRCIACKQAKSRLQPHGLYSPLPVPNGPWIDISMDFVLGLPRTRK 1238
            HFFWP M+ DV ++C +CI CKQAKS+L+PHGLY+PLP+P+ PW DISMDFVLGLPRT++
Sbjct: 799  HFFWPHMKRDVERICEKCITCKQAKSKLKPHGLYTPLPIPSEPWTDISMDFVLGLPRTKR 858

Query: 1239 GYDSIFVVVDRFSKMAHFIPCHKTDDAKHIADLFFREVVRLHGIPKSIVSDRDVKFL--- 1298
            G DS+FVVV RFSKMAHFI CHKTDDA HIADLFF+E+VRLHG+P++IVSDRD KF    
Sbjct: 859  GRDSVFVVVGRFSKMAHFIACHKTDDASHIADLFFKEIVRLHGMPRTIVSDRDAKFFELL 918

Query: 1299 --------------------SHFWRVLWGKLGTKLVYSXGD------------------- 1358
                                ++ W V W  +G +  Y  GD                   
Sbjct: 919  LEDFVGKVGNQVIVFYYLSSTNGWTVFWRNVGVQNSYLEGDGPFQIVERINDNAYKLDLP 978

Query: 1359 ------------------------------------------------------------ 1418
                                                                        
Sbjct: 979  GDDSRSNPFEERGNDENQQALLKDPLHVPVGPITRARSKKIKEALNGLIQDIWADSTTGH 1038

Query: 1419 ------------------------------------------------------------ 1478
                                                                        
Sbjct: 1039 SKLGPKEDEGLSWASNPILFAGSQSVSIEDRIIPYLWSFIPLETWQHYPFVIHTDHESLK 1098

Query: 1479 ------------------------------------------------------------ 1538
                                                                        
Sbjct: 1099 KGRIMLDGWNTLRPFHMSSVTNKVRKNIVADALSRRYVLLTSMSAKLLGFEYVKDMYADD 1158

Query: 1539 ------------------------------------------------------------ 1598
                                                                        
Sbjct: 1159 VDFSDVYKACDKTAFGKFYKHDGYLFKESKLCVPSCSMRELLVREAHGGGLMGHFGVKKT 1218

Query: 1599 ------------------------------------------------------------ 1658
                                                                        
Sbjct: 1219 LDILHEHFFWPKMKKDVNRICGRSSLDGQKKAELVKSLHERHITMSHKGDSSPKGKADNS 1278

Query: 1659 ------------------------------------------------------------ 1718
                                                                        
Sbjct: 1279 SFVLQAMQQQFERLNFVLGEVRDRMDLQDAAIRNLQGGRDRRRRERRVENEYENEGDGED 1338

Query: 1719 ------------------------------------------------------------ 1778
                                                                        
Sbjct: 1339 EEDLASEVGSGRHRRVRRERGHEWNPGGRDGVDRSLGNIKMKIPSFQGRTDPEVYLEWEK 1398

Query: 1779 ------------------------------------------------------------ 1838
                                                                        
Sbjct: 1399 KIDLVFDCHNYSEEKKVKLAVIEFTDYAIIWWDQLVTNRRRNTERPVETWGELKALMRRR 1458

Query: 1839 ------------------------------------------------------------ 1898
                                                                        
Sbjct: 1459 FVPSHFYRDLYQRLQNLTQGSRSVEDYHKEMEVAMIRANVEEDREATMARFLSGLNRDIA 1518

Query: 1899 ------------------------------------------------------------ 1958
                                                                        
Sbjct: 1519 NVIELQHYVEIEDMVHMAMKVERQLKRKGTARYTSVSNTTWKSKWDRNDSAEAKRKTEPP 1578

Query: 1959 ------------------------------------------------------------ 2018
                                                                        
Sbjct: 1579 KGKDEGTSNKPKVESQPSRNRDIKCFKCLGSGHIASQCPNRRVMIMRDNGEVMTESESLV 1638

Query: 2019 ------------------------------------------------------------ 2078
                                                                        
Sbjct: 1639 ARRALNTHIKVDDAEQQRENIFHTRCHVNNKWLNDCGEVRVDRQVLVTFSIGKYLDEVLC 1698

Query: 2079 ---------LLLGRPWQFDRRVMYDGYANRYSFTHNGRKTTLIPLSPKDIFIDHCKLE-- 2110
                     +LLGRPWQ+DRRV +DG+ N YSF   G+   L PL+P  ++ D  KL+  
Sbjct: 1699 DVVPMHAGHILLGRPWQYDRRVTHDGFKNMYSFVKGGKTIKLAPLTPSQVYEDQLKLQSE 1758

BLAST of CmoCh11G012300.1 vs. ExPASy TrEMBL
Match: A0A2N9GUW0 (Reverse transcriptase OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS31344 PE=4 SV=1)

HSP 1 Score: 1668.3 bits (4319), Expect = 0.0e+00
Identity = 993/2232 (44.49%), Postives = 1320/2232 (59.14%), Query Frame = 0

Query: 18   AQQRTMERLIRGIEELTDRIGRLE-IQNQARQRIPQPTP---------STDTYEGDNSDH 77
            A Q+  ER+     E+ DR+ R + +    R+  PQ  P           D  +GD+ D 
Sbjct: 4    AMQQQFERMDVMFNEIRDRMDRQDAVIATWREGRPQGGPYVRRQARRAPVDDSDGDHEDE 63

Query: 78   ---HEDNPHAVGPGLMRGRDHGRKYHNLQQRVPYDDRIDRNVGSIKLKLPKFYGKTDPEE 137
                ED     G  + RG   GR +   +  + + D  D N+G+IK+K+P F GK DPE 
Sbjct: 64   FEGEEDQASLNGRFVPRGERRGRGF---RTGLRWRDGTDGNLGNIKMKIPSFQGKNDPEA 123

Query: 138  YLQWEKTVESVFNCHNFSDEKKVLLCIAQFKQYAQIWWDKLMSSRRRNLEAPIDSWVEFK 197
            YL+WEK VE +F CHN+S+EKKV L + +F  YA IWWD+L+ +RRRN E  I++W E +
Sbjct: 124  YLEWEKKVELIFECHNYSEEKKVKLAVIEFTDYAIIWWDQLVMNRRRNHERAIETWEEMR 183

Query: 198  ESMRKRFVPQYFQRDMAQKLQALKQGRKSVEDYYKEMDTLMDRLELDEDMEALMARFLNG 257
              MR+RFVP ++ RD+ QKLQ+L QG +SV+DYYKEM+  + R  ++ED EA MARFLNG
Sbjct: 184  AIMRRRFVPSHYYRDLYQKLQSLTQGYRSVDDYYKEMEIALIRANVEEDREATMARFLNG 243

Query: 258  LNTEIADKTDLQPYSNIEELLHIAIKIERQIQRRSQRYSSKTFPNSTSTWKKDSKNIDYK 317
            LN +IA+  +LQ Y  +E+++H+AIK+ERQ++R+  R  S   P S+++WK + +    K
Sbjct: 244  LNRDIANVVELQHYVELEDMVHMAIKVERQLKRKGTR--SFQNPGSSTSWKSNWR----K 303

Query: 318  HRNQEINEKPQAKFEKGESSRTGKEKVEKSNVRNRDLKCWRCQGVGHYSRDCPNARIMTI 377
                 +  K +    + E     K K E S  RNRD+KC+RC GVGH +  CPN R M  
Sbjct: 304  DEGAVLKSKTEPPKRREEVPSVNKGKTE-SQTRNRDIKCFRCLGVGHIASQCPNKRTMIA 363

Query: 378  K-EGEIVTDDEAHDD----INEETDESEEFSEEDPTHISLVTRRALNTHIKEDGLD-QRE 437
            + +GE+ T+ E+  D    + +  D+  E+  E     SLV RRAL+  +KED ++ QRE
Sbjct: 364  RVDGEVETESESDADQMPMLEDTCDDDVEYPVEGE---SLVARRALSAQVKEDDMEQQRE 423

Query: 438  NLFQTRCLVQSVPCSVVIDSGSCTNVVSSILVKRLNLKTQPHPRPYKLQWLNDCGEVRVT 497
            N+F TRC + +  CS++ID GSCTNV S+ LV++LN  T  HP PYKL+WLNDCGE++V 
Sbjct: 424  NIFHTRCHINNKVCSMIIDGGSCTNVASTTLVEKLNFPTLKHPMPYKLKWLNDCGEIKVN 483

Query: 498  QQTLVSFTIGKYVDDVLCDVVSMHVGDLLLGRPWQFDRRVMYDGYANRYSFTHNGRKTTL 557
            +Q L+SF+IGKY D+VLCDVV MH G +LLGRPWQFDR+ ++DG+ NRYSF  + R  TL
Sbjct: 484  KQVLISFSIGKYKDEVLCDVVPMHAGHILLGRPWQFDRKAIHDGFKNRYSFVKDSRTVTL 543

Query: 558  IPLSPKDVFIDHCKLEKKR------QEADAKAEIEKESSEKKSLSEKQESNTQPREKKER 617
            +PL+P+ V+ D  KL+++       +   +K E EKES  KK   +K+E  T   E+++ 
Sbjct: 544  VPLTPRQVYEDQVKLKRENELKKNCETESSKKEDEKESERKKESEKKKERVTNTSEQQK- 603

Query: 618  KAKSVSLYVRSSEARNVLLSNQTILVLMCKGSCYFTNMLNPSLPSDFVVLLQEFEHLFSE 677
              K VS Y ++S+ ++   +NQ I VL+ K +C+ TN L+ SLPS  V LLQE+E +F  
Sbjct: 604  --KQVSFYAKASDVKSAFYANQPIFVLLYKEACFNTNELDESLPSVVVSLLQEYEDVFPN 663

Query: 678  EMPSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEAEEIQRQVSELLAKGY-------- 737
            ++PS LPP+RGIEH+IDF+PGA IPNRPAYR+NP+E +E+QRQV ELLAKG+        
Sbjct: 664  DVPSGLPPIRGIEHQIDFVPGATIPNRPAYRSNPEETKELQRQVEELLAKGHVRESMSPC 723

Query: 738  ---------------------AINKITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYH 797
                                 AIN IT+KYRHPIPRLDDMLDELHG  +FTKIDLKSGYH
Sbjct: 724  AVPVLLVPKKDGTWRMCVDCRAINNITVKYRHPIPRLDDMLDELHGSCIFTKIDLKSGYH 783

Query: 798  QIRMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVLREYL----------- 857
            QIRM  GDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNH LR +L           
Sbjct: 784  QIRMKEGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHALRAFLGRFVVVYFDDI 843

Query: 858  ----------------------------------------------VSSNGVEVDEEKVK 917
                                                          V + G+ VDEEKVK
Sbjct: 844  LVYSKSLDEHIDHLHCVLTVLRKEKLYANLKKCSFCLDKVVFLGFVVGAKGIAVDEEKVK 903

Query: 918  AIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASPLNELVKKNVSFIWEKDQELAFNT 977
            AIK+WPTPK+++EVRSFHGLASFYRRF+K+FST+A+PL E+VKK+V F W  +Q+ AF  
Sbjct: 904  AIKEWPTPKSITEVRSFHGLASFYRRFVKDFSTLAAPLTEIVKKSVGFKWGSEQDRAFIE 963

Query: 978  LKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYD 1037
            +KE+L  APLLALP+F  TFEIECDASG+GIGAVLMQ +RP+ +FSEKL GA+L YPTYD
Sbjct: 964  IKERLCGAPLLALPDFSKTFEIECDASGIGIGAVLMQEKRPIAYFSEKLNGAALNYPTYD 1023

Query: 1038 KELYALVRALQTWQHYLWPKEFIIHTDHESLKHLRVQNKLNRRHAKWLEFIETFPYVIKY 1097
            KELYALVRAL+TWQHYLWPKEF+IHTDHESLKHL+ Q KLNRRHA+W+EFIETFPYVIKY
Sbjct: 1024 KELYALVRALETWQHYLWPKEFVIHTDHESLKHLKGQGKLNRRHAQWMEFIETFPYVIKY 1083

Query: 1098 KQGKENIVADALSRRYVLLNTLNARLLGFEHIKDLYQHDMFFAPFVESCEKGLIVDNYLL 1157
            KQGKENIVADALSRRY L+ TLNA+LLGFE++K+LY +D  FA    +CEK      +  
Sbjct: 1084 KQGKENIVADALSRRYALIATLNAKLLGFEYVKELYVNDDDFASVFAACEKAAF-GKFYR 1143

Query: 1158 LDGFLFRKGKLCIPSCSIRELLVREAHGGGLMAHHGVSKTYDMLSEHFFWPKMRHDVHKV 1217
            LDG+LFR+ +LC+P+ S+RELLVREAHGG                    W K + D+   
Sbjct: 1144 LDGYLFRENRLCVPNSSMRELLVREAHGGE-------------------WEK-KIDLVFD 1203

Query: 1218 CGRCIACKQAKSRLQPHGLYSPLPVPNGPWIDISMDFVLGLPRTRKGYDSIFVVVDRFSK 1277
            C      K+ K  +     Y+ +      W  +  +      R  + +  +  ++ R   
Sbjct: 1204 CHNYSEEKKVKLAVIEFTDYAII-----WWDQLVTNRRRNTERPVETWGELKALMRR--- 1263

Query: 1278 MAHFIPCH-----------KTDDAKHIADLFFREVVRLHGIPKSIVSDRD---VKFLSHF 1337
               F+P H            T  ++ + D  + + + +  I  ++  DR+    +FLS  
Sbjct: 1264 --RFVPSHFYRDLYQRLQNLTQGSRSVED--YHKEMEVAMIRANVEEDREATMARFLSGL 1323

Query: 1338 WRVLWGKLGTKLVYSXGDLLLGRPWQFDRRVMYDGYANRYSFTHNGRKTTLIPLSPKDIF 1397
             R +   +  +      D ++    + ++++   G A RY+   N   TT      K   
Sbjct: 1324 NRDIANVIELQHYVEIED-MVHMAMKVEQQLKRKGTA-RYTSVSN---TTWKSKGDK--- 1383

Query: 1398 IDHCKLEKKRQEADAKAEIEKESSEKKSLSEKQESNTQPREKKERKAKSVSLYVRSSE-- 1457
              +   E KR+      + E  S++ K ++E ++ +    E  +       +Y  + E  
Sbjct: 1384 --NDPAEAKRKTEPPMGKDEGTSNKPKVMTESEDDSDGMPELVDASDDDEVVYPVTGESL 1443

Query: 1458 -ARNVLLSNQTILVLMCKGSCYFTNMLNPSLPSDFVVIFVVLLQEFEDLFFEEMPSSLPP 1517
             AR  L ++  +            N+ +     +     +   +E+ED+F  ++PS LPP
Sbjct: 1444 VARRALNTHFKV----DDAEQQRENIFHTRCHVNNKCCCLFDCKEYEDVFPNDVPSGLPP 1503

Query: 1518 LRGIEHKIDFIPGAPIPNRPAYRTNPKEAEEIQXYDVKCLSFFLLITIFSSNGVEVDEEK 1577
            +RGIEH+IDF+PGA IPNRPAYR+NP+E +E+Q +  + L+   +    S   V V    
Sbjct: 1504 IRGIEHQIDFVPGATIPNRPAYRSNPEETKELQRHVEELLAQGHVRESMSPCAVSV---- 1563

Query: 1578 VKAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASPLNELVKKNVSFIWEKDQELAF 1637
            +   K   T +   + ++ + + S     I++   + + L                    
Sbjct: 1564 LLVPKKDGTWRMCVDCKAINNITSL-DEHIEHLHCVLAVLR------------------- 1623

Query: 1638 NTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPT 1697
               KEKL +       N ++ +   C    V +G ++  + + L    EK  G       
Sbjct: 1624 ---KEKLYA-------NLKTCY--FCLDKVVFLGYLV--SGKGLAVDEEKEIGFGFTCAK 1683

Query: 1698 YDKELYALVRALQTWQHYLWPKEFIIHTDHESLKHLRVQNKLNRRHAKWLEFIETFPYVI 1757
             D +  A                               + KL+ R     + +E      
Sbjct: 1684 KDSQPIA-------------------------------KTKLHPRGDGPFQILEKI---- 1743

Query: 1758 KYKQGKENIVADALSRRYVLLNTLNARLLGFEHIKDLYQHDMFFAPFVESCEKGLIVDNY 1817
                 K+N     L   Y +  T     L    + +    D    PF E    G      
Sbjct: 1744 -----KDNAHKVDLPGEYNVSATFTVSDLSPFDVGE----DSRSNPFEERGNDGNQSGPS 1803

Query: 1818 LLLDGFLFRKGKLCIPSCSI---RELLVREAHGGGLMAHHGVSKTYDMLSKHFFWPKMRH 1877
            L        K  L +P   I   R   ++EA  G       V  T+D  SK    P ++ 
Sbjct: 1804 L--------KDPLQVPYGPITRSRAKKIKEAMQG------LVQSTWDEASKS---PTIK- 1863

Query: 1878 DVHKVCGRCIACKQAKSRLQPHGLYSPLPVPNGPWIDISMDFVLGLPRTRKGYDSIFVVV 1937
                          AKSR+ PHGLY+PLPVP+ PW+DISMDFVLGLPR+RKG DSIFVVV
Sbjct: 1864 --------------AKSRVLPHGLYTPLPVPSAPWVDISMDFVLGLPRSRKGRDSIFVVV 1923

Query: 1938 DRFSKMAHFIPCHKTDDAKHIADLFFREVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLG 1997
            DRFSKMAHFI CHKTDDA HIADLFFRE+VRLHG+P+SIVSDRDVKFLS+FW+VLWGKLG
Sbjct: 1924 DRFSKMAHFISCHKTDDATHIADLFFREIVRLHGVPRSIVSDRDVKFLSYFWKVLWGKLG 1983

Query: 1998 TKLVYSTTCHPQTDGQTEVVNRTMTAMLRAIIDKNLKTWEDCLPFIEFAYNRVVHSTTKC 2057
            TKL++STTCHPQTDGQTEVVNRT++ +LR II KNLK WEDCLPFIEFAYNR VHSTT  
Sbjct: 1984 TKLLFSTTCHPQTDGQTEVVNRTLSTLLRTIIQKNLKNWEDCLPFIEFAYNRSVHSTTDF 2043

Query: 2058 TPFEIVYGFNPLTPIDLLPIPSKEFVNFDANAKVEFSHKLHKQVKEQIEKQNSKVATRIN 2115
            +PFEIVYGFNPLTP+DLLP+P  E  + D   K E   KLH+ V++ IEK+N + A + N
Sbjct: 2044 SPFEIVYGFNPLTPLDLLPLPVNERTSLDGQKKAEMVKKLHESVRQHIEKKNEQYANKAN 2056

BLAST of CmoCh11G012300.1 vs. ExPASy TrEMBL
Match: A0A6J1DWE9 (LOW QUALITY PROTEIN: uncharacterized protein LOC111024115 OS=Momordica charantia OX=3673 GN=LOC111024115 PE=4 SV=1)

HSP 1 Score: 1614.0 bits (4178), Expect = 0.0e+00
Identity = 954/2062 (46.27%), Postives = 1062/2062 (51.50%), Query Frame = 0

Query: 1    MENPDDNTNITDARLREAQQRTMERLIRGIEELTDRIGRLEIQNQARQRIPQP----TPS 60
            M N     NI+DAR REA + T+ER+IR +E LT+RIGRLE QNQA  R+P P       
Sbjct: 1    MANLGXEGNISDARRREALEGTVERMIRSMEALTERIGRLETQNQAEGRVPPPPXXRVMD 60

Query: 61   TDTYEGDNSDHHEDNPHA-----------VGPGLMRGRDHGRKYHNLQQRVPYDDRIDRN 120
             D YEGD SDH ED+              VG GL RGR  GR YHN Q+    + ++DRN
Sbjct: 61   EDEYEGDGSDHWEDDQATVLAXPRGGDXHVGXGLDRGRGRGRGYHNFQRAPRANAQVDRN 120

Query: 121  VGSIKLKLPKFYGKTDPEEYLQWEKTVESVFNCHNFSDEKKVLLCIAQFKQYAQIWWDKL 180
            +G+IKLKL KFYGKTDPE Y+QW K VE VF+CHNFS+EKKV LC+AQFK YA  WWDKL
Sbjct: 121  LGNIKLKLXKFYGKTDPEAYIQWXKEVEFVFDCHNFSEEKKVRLCLAQFKDYAVTWWDKL 180

Query: 181  MSSRRRNLEAPIDSWVEFKESMRKRFVPQYFQRDMAQKLQALKQGRKSVEDYYKEMDTLM 240
            MS RRRNLEAPIDSW EFKE +R+RFVP++F RD AQKLQ L+QG KSVEDYYKEMDTLM
Sbjct: 181  MSIRRRNLEAPIDSWYEFKELLRRRFVPRHFHRDTAQKLQVLRQGSKSVEDYYKEMDTLM 240

Query: 241  DRLELDEDMEALMARFLNGLNTEIADKTDLQPYSNIEELLHIAIKIERQIQRRSQRYSSK 300
            +RL+L+EDME LMARFLNGLN EIADK DLQPY NIEE+LH+AIKIE+Q+QRRSQ YS+K
Sbjct: 241  NRLDLEEDMENLMARFLNGLNKEIADKVDLQPYFNIEEMLHLAIKIEKQLQRRSQGYSTK 300

Query: 301  TFPNSTSTWKKDSKNIDYKHRNQEINEKPQAKFEKGESSRTGKEKVEKSNVRNRDLKCWR 360
             F +S+STWK DSK+ D K RN+E  EKP+ KFEKGESS+  KEKV +SNVRN ++KCW+
Sbjct: 301  PFSHSSSTWKTDSKSADSKLRNREAYEKPKEKFEKGESSKKRKEKVVESNVRNWNIKCWK 360

Query: 361  CQGVGHYSRDCPNARIMTIKEGEIVTDDEAHDDINEETDESEEFSEEDPTHISLVTRRAL 420
            CQGVGHYSRDCPN R M I+EGE VT+DE  D         EE+SEEDPTH+SLVTRRAL
Sbjct: 361  CQGVGHYSRDCPNRRTMIIREGEFVTEDEKED---------EEYSEEDPTHMSLVTRRAL 420

Query: 421  NTHIKEDGLDQRENLFQTRCLVQSVPCSVVIDSGSCTNVVSSILVKRLNLKTQPHPRPYK 480
            +T IKEDGLDQRENLF TRCLVQ+VPCSV+IDSGSCTNVVS ILVK+LNL+T+PHPRP K
Sbjct: 421  STQIKEDGLDQRENLFHTRCLVQNVPCSVIIDSGSCTNVVSFILVKKLNLETKPHPRPSK 480

Query: 481  LQWLNDCGEVRVTQQTLVSFTIGKYVDDVLCDVVSMHVGDLLLGRPWQFDRRVMYDGYAN 540
            LQWL+DCGEV+V++Q LVSF IGKYVDDVLCDVV MH GD+LLGRP QFDRR        
Sbjct: 481  LQWLSDCGEVKVSRQALVSFAIGKYVDDVLCDVVPMHAGDILLGRPRQFDRR-------- 540

Query: 541  RYSFTHNGRKTTLIPLSPKDVFIDHCKLEKKRQEADAKAEIEKESSEKKSLSEKQESNTQ 600
                                                                        
Sbjct: 541  ------------------------------------------------------------ 600

Query: 601  PREKKERKAKSVSLYVRSSEARNVLLSNQTILVLMCKGSCYFTNMLNPSLPSDFVVLLQE 660
                                                                       E
Sbjct: 601  -----------------------------------------------------------E 660

Query: 661  FEHLFSEEMPSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEAEEIQRQVSELLAKGYA 720
            FE LF EEMP  LPPLRGIE+KIDFIPGA IPNRPAYRTNP EA+EIQRQV ELLA GY 
Sbjct: 661  FEDLFPEEMPKELPPLRGIEYKIDFIPGATIPNRPAYRTNPTEAKEIQRQVDELLAHGY- 720

Query: 721  INKITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIRMHIGDEWKTAFKTKYGLYE 780
                                                                        
Sbjct: 721  ------------------------------------------------------------ 780

Query: 781  WLVMPFGLTNAPSTFMRLMNHVLREYLVSSNGVEVDEEKVKAIKDWPTPKNVSEVRSFHG 840
                                                                        
Sbjct: 781  ------------------------------------------------------------ 840

Query: 841  LASFYRRFIKNFSTIASPLNELVKKNVSFIWEKDQELAFNTLKEKLSSAPLLALPNFEST 900
                                                                        
Sbjct: 841  ------------------------------------------------------------ 900

Query: 901  FEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKELYALVRALQTWQHYLWP 960
                                                                        
Sbjct: 901  ------------------------------------------------------------ 960

Query: 961  KEFIIHTDHESLKHLRVQNKLNRRHAKWLEFIETFPYVIKYKQGKENIVADALSRRYVLL 1020
                                                                        
Sbjct: 961  ------------------------------------------------------------ 1020

Query: 1021 NTLNARLLGFEHIKDLYQHDMFFAPFVESCEKGLIVDNYLLLDGFLFRKGKLCIPSCSIR 1080
                                                                      +R
Sbjct: 1021 ----------------------------------------------------------VR 1080

Query: 1081 ELLVREAHGGGLMAHHGVSKTYDMLSEHFFWPKMRHDVHKVCGRCIACKQAKSRLQPHGL 1140
            E L                                                         
Sbjct: 1081 ESL--------------------------------------------------------- 1140

Query: 1141 YSPLPVPNGPWIDISMDFVLGLPRTRKGYDSIFVVVDRFSKMAHFIPCHKTDDAKHIADL 1200
             SP  VP                          V++                        
Sbjct: 1141 -SPXSVP--------------------------VIL------------------------ 1200

Query: 1201 FFREVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSXGDLLLGRPWQFDRRVMY 1260
                      +PK                    K GT              W+F      
Sbjct: 1201 ----------VPK--------------------KDGT--------------WRF------ 1218

Query: 1261 DGYANRYSFTHNGRKTTLIPLSPKDIFIDHCKLEKKRQEADAKAEIEKESSEKKSLSEKQ 1320
                                                                        
Sbjct: 1261 ------------------------------------------------------------ 1218

Query: 1321 ESNTQPREKKERKAKSVSLYVRSSEARNVLLSNQTILVLMCKGSCYFTNMLNPSLPSDFV 1380
                                                                        
Sbjct: 1321 ------------------------------------------------------------ 1218

Query: 1381 VIFVVLLQEFEDLFFEEMPSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEAEEIQXYD 1440
                                                                        
Sbjct: 1381 ------------------------------------------------------------ 1218

Query: 1441 VKCLSFFLLITIFSSNGVEVDEEKVKAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTI 1500
                       I SS GVEVD EKVKAI +WPTP NV EVRSFHGLASFYRRFIK+FST+
Sbjct: 1441 -----------IVSSRGVEVDSEKVKAISEWPTPTNVGEVRSFHGLASFYRRFIKDFSTL 1218

Query: 1501 ASPLNELVKKNVSFIWEKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAV 1560
            ASPLNELV KNV F+W   QE AFN LK+KL +APLLALPNF+ TFEIECDASG+GIGAV
Sbjct: 1501 ASPLNELVXKNVPFVWGTSQEHAFNVLKDKLCNAPLLALPNFDLTFEIECDASGIGIGAV 1218

Query: 1561 LMQNQRPLMFFSEKLTGASLRYPTYDKELYALVRALQTWQHYLWPKEFIIHTDHESLKHL 1620
            LMQ  +PLMFFSEKL+GASLRYPTYDKELYALVRALQTWQHYLWPKEF+IHTDHESLKHL
Sbjct: 1561 LMQTHKPLMFFSEKLSGASLRYPTYDKELYALVRALQTWQHYLWPKEFVIHTDHESLKHL 1218

Query: 1621 RVQNKLNRRHAKWLEFIETFPYVIKYKQGKENIVADALSRRYVLLNTLNARLLGFEHIKD 1680
            R Q+KLNRRHAKWLEFIETFPYVIKYKQGKENIVADALSRRY L NTLNA++LGFEH+K 
Sbjct: 1621 RGQSKLNRRHAKWLEFIETFPYVIKYKQGKENIVADALSRRYALFNTLNAKILGFEHLKS 1218

Query: 1681 LYQHDMFFAPFVESCEKGLIVDNYLLLDGFLFRKGKLCIPSCSIRELLVREAHGGGLMAH 1740
            LY  D+FFA FVESCEKGL+VD+Y+L +GFLFRKGKLCIPSCSIRELLV+EAHGGGLMAH
Sbjct: 1681 LYLDDVFFASFVESCEKGLVVDDYMLFEGFLFRKGKLCIPSCSIRELLVKEAHGGGLMAH 1218

Query: 1741 HGVSKTYDMLSKHFFWPKMRHDVHKVCGRCIACKQAKSRLQPHGLYSPLPVPNGPWIDIS 1800
             GVSKTYDML++HF+WPKM+HDVHKVC  CIACKQAKSRLQPHGLYSPLPVP  PW+DIS
Sbjct: 1741 FGVSKTYDMLAEHFYWPKMKHDVHKVCKHCIACKQAKSRLQPHGLYSPLPVPTTPWVDIS 1218

Query: 1801 MDFVLGLPRTRKGYDSIFVVVDRFSKMAHFIPCHKTDDAKHIADLFFREVVRLHGIPKSI 1860
            MDFVLGLPRTRKG DSIFVVVDRFSKMAHFIPCHKTDDAK IADLFFREVVRLHGIP++I
Sbjct: 1801 MDFVLGLPRTRKGRDSIFVVVDRFSKMAHFIPCHKTDDAKLIADLFFREVVRLHGIPRTI 1218

Query: 1861 VSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVNRTMTAMLRAIIDKNLKTW 1920
            VSDRDVK LSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVNRTM  MLRAIIDKNLKTW
Sbjct: 1861 VSDRDVKXLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVNRTMVTMLRAIIDKNLKTW 1218

Query: 1921 EDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPIDLLPIPSKEFVNFDANAKVEFSHK 1980
            EDCLP IEFAYNR VH TTKCTPFEIVYGFNP++P+DLLP+P  EFV++DA+ KV+  H+
Sbjct: 1921 EDCLPLIEFAYNRAVHXTTKCTPFEIVYGFNPISPLDLLPMPPNEFVSYDASTKVDLVHR 1218

Query: 1981 LHKQVKEQIEKQNSKVATRINKGRKIVIFKPGDWVWVHFRKERFPTQRKSKLLPRGDGPF 2040
            LHK+VKE+IEKQN+KVA+RINKGRK+++F PGDWVWVHFRKERFPTQR+SKLLPRGDGPF
Sbjct: 1981 LHKEVKERIEKQNTKVASRINKGRKLLVFXPGDWVWVHFRKERFPTQRRSKLLPRGDGPF 1218

Query: 2041 QVLERINDNAYKIDLPGKYGVS 2048
            QVLERINDNAYKIDLPGKYGVS
Sbjct: 2041 QVLERINDNAYKIDLPGKYGVS 1218

BLAST of CmoCh11G012300.1 vs. TAIR 10
Match: ATMG00860.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 91.7 bits (226), Expect = 7.9e-18
Identity = 43/96 (44.79%), Postives = 62/96 (64.58%), Query Frame = 0

Query: 791 YLVSSNGVEVDEEKVKAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASPLNELVKK 850
           +++S  GV  D  K++A+  WP PKN +E+R F GL  +YRRF+KN+  I  PL EL+KK
Sbjct: 37  HIISGEGVSADPAKLEAMVGWPEPKNTTELRGFLGLTGYYRRFVKNYGKIVRPLTELLKK 96

Query: 851 NVSFIWEKDQELAFNTLKEKLSSAPLLALPNFESTF 887
           N S  W +   LAF  LK  +++ P+LALP+ +  F
Sbjct: 97  N-SLKWTEMAALAFKALKGAVTTLPVLALPDLKLPF 131

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q993155.0e-10235.70Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Q7LHG52.5e-10136.09Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Q9UR079.5e-9332.67Transposon Tf2-11 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24... [more]
P0CT419.5e-9332.67Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24... [more]
P0CT349.5e-9332.67Transposon Tf2-1 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
Match NameE-valueIdentityDescription
A0A1U8MQK50.0e+0048.18Reverse transcriptase OS=Gossypium hirsutum OX=3635 GN=LOC107939043 PE=4 SV=1[more]
A0A2N9I2F80.0e+0047.35Reverse transcriptase OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS46081 PE=4 SV=1[more]
A0A2N9I3L40.0e+0037.49Reverse transcriptase OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS46293 PE=4 SV=1[more]
A0A2N9GUW00.0e+0044.49Reverse transcriptase OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS31344 PE=4 SV=1[more]
A0A6J1DWE90.0e+0046.27LOW QUALITY PROTEIN: uncharacterized protein LOC111024115 OS=Momordica charantia... [more]
Match NameE-valueIdentityDescription
ATMG00860.17.9e-1844.79DNA/RNA polymerases superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 16..43
NoneNo IPR availableCOILSCoilCoilcoord: 559..579
NoneNo IPR availableCOILSCoilCoilcoord: 1283..1303
NoneNo IPR availableGENE3D1.10.340.70coord: 1679..1760
e-value: 5.1E-17
score: 63.9
coord: 1036..1117
e-value: 3.7E-17
score: 64.4
NoneNo IPR availableGENE3D3.10.10.10HIV Type 1 Reverse Transcriptase, subunit A, domain 1coord: 705..775
e-value: 3.8E-34
score: 119.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 42..85
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1286..1318
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 2059..2089
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 280..296
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 559..597
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 45..60
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 2063..2089
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 61..85
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 279..333
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 297..333
NoneNo IPR availablePANTHERPTHR24559TRANSPOSON TY3-I GAG-POL POLYPROTEINcoord: 1426..1879
coord: 771..1234
NoneNo IPR availablePANTHERPTHR24559:SF360SUBFAMILY NOT NAMEDcoord: 771..1234
coord: 1426..1879
NoneNo IPR availablePANTHERPTHR24559:SF360SUBFAMILY NOT NAMEDcoord: 358..704
NoneNo IPR availablePANTHERPTHR24559TRANSPOSON TY3-I GAG-POL POLYPROTEINcoord: 358..704
NoneNo IPR availableCDDcd09274RNase_HI_RT_Ty3coord: 1530..1645
e-value: 8.55952E-54
score: 182.307
NoneNo IPR availableCDDcd01647RT_LTRcoord: 706..800
e-value: 5.86418E-40
score: 144.66
NoneNo IPR availableCDDcd00303retropepsin_likecoord: 423..512
e-value: 1.83786E-17
score: 77.3768
NoneNo IPR availableCDDcd09274RNase_HI_RT_Ty3coord: 887..1002
e-value: 8.55952E-54
score: 182.307
IPR001878Zinc finger, CCHC-typeSMARTSM00343c2hcfinal6coord: 342..358
e-value: 9.9E-5
score: 31.7
IPR001878Zinc finger, CCHC-typePFAMPF00098zf-CCHCcoord: 342..358
e-value: 2.8E-5
score: 24.0
IPR001878Zinc finger, CCHC-typePROSITEPS50158ZF_CCHCcoord: 342..358
score: 9.834029
IPR005162Retrotransposon gag domainPFAMPF03732Retrotrans_gagcoord: 153..247
e-value: 9.7E-20
score: 70.7
IPR041588Integrase zinc-binding domainPFAMPF17921Integrase_H2C2coord: 1063..1119
e-value: 1.6E-15
score: 56.9
coord: 1706..1762
e-value: 2.1E-15
score: 56.5
IPR043128Reverse transcriptase/Diguanylate cyclase domainGENE3D3.30.70.270coord: 715..791
e-value: 3.8E-34
score: 119.8
IPR043128Reverse transcriptase/Diguanylate cyclase domainGENE3D3.30.70.270coord: 802..888
e-value: 1.2E-30
score: 107.4
coord: 1445..1531
e-value: 1.2E-30
score: 107.4
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 406..539
e-value: 2.4E-11
score: 45.4
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 419..512
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 1771..1996
e-value: 4.3E-50
score: 171.8
coord: 1128..1237
e-value: 3.1E-20
score: 74.3
IPR041373Reverse transcriptase, RNase H-like domainPFAMPF17917RT_RNaseHcoord: 882..980
e-value: 7.9E-31
score: 106.5
coord: 1525..1623
e-value: 7.9E-31
score: 106.5
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 694..792
e-value: 4.2E-13
score: 49.3
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 1131..1227
score: 9.210668
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 1774..1934
score: 18.870842
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 1435..1630
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 642..987
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 1771..1932
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 1128..1227
IPR036875Zinc finger, CCHC-type superfamilySUPERFAMILY57756Retrovirus zinc finger-like domainscoord: 332..359

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmoCh11G012300CmoCh11G012300gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh11G012300.1:exon:10359CmoCh11G012300.1:exon:10359exon
CmoCh11G012300.1:exon:10360CmoCh11G012300.1:exon:10360exon
CmoCh11G012300.1:exon:10361CmoCh11G012300.1:exon:10361exon
CmoCh11G012300.1:exon:10362CmoCh11G012300.1:exon:10362exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh11G012300.1:cdsCmoCh11G012300.1:cdsCDS
CmoCh11G012300.1:cdsCmoCh11G012300.1:cds_2CDS
CmoCh11G012300.1:cdsCmoCh11G012300.1:cds_3CDS
CmoCh11G012300.1:cdsCmoCh11G012300.1:cds_4CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmoCh11G012300.1CmoCh11G012300.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding