CmaCh03G007770 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh03G007770
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
LocationCma_Chr03: 6040188 .. 6048925 (+)
RNA-Seq ExpressionCmaCh03G007770
SyntenyCmaCh03G007770
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTCCGAATCTTGTTATCATCTTCTTCCTTTCAATACTCTAATCCATATGATCACCATCAAACTTTCTTCCTCCAATTATCTTCTTTGGAAAAGCCAACTTCTCCCTCTTCTTGAGAGTCAAGACATGCTGGGCTATGTCGATGGAACTATGGTTCCACCACCTCGCTTTGAACCAGAAACCTCCTCAACATTCAACCCCAAATATTTGGCATGGAGAGCAGCCGATCAACGACTTCTCTGTCTCCTGCTCTCCTCTCTCACTGAGGAAGCCATGGCTGTTGTCGTTGGTCTCTCTACTGCACGTGATGTTTGGCTTGCGTTGGAAACTACGTACAGCCATCAGTCAAAAGCTCGTGAACTGAGACTCAAGGATGACTTGCAGTTGATGAAACGTGGCACAAAACCTGTTGCTGAGTATGCCCGTGCCTTCAAAAAAATTTGTGACCAACTTCATGCCATTGGCAGACCCGTCGAGGACATTGATAAAGTGCATTGGTTCCTTCGTGGACTCGGCACCGAATTTTCAGCTTTTTCTACTGCTCAGATGGCTCTCACCCCTATCCCCTGTTTTGCAGATCTAGTCTCTAAAACTGAAAGTTTTGAGTTGTTCCAGCGCTCCCTTGAGTCCTCTGACTCCACTCCTACAGCATTCATAGCCACTAATCGTGGCCGCACCCATGAAAGTCACCCTGCTTCCTTTACCAACCAGCGAGGTCGTTCTTATTCTCACAAAAACAACTCTTCTAATCGAGGACGAACCCACTCAAGTCAGGGTCGTCGACCACCTCATTGCCAAATATGCCGCAAAGAGGGCCATTATGCTGACCGCTGCAACCAACGGTATGTTCGACCTGATTCTTCTCATGCTCACCTTGCTGAAGCCTTTAACACGTCATGTTCTATTGCTGGACCCGATGCTGCTGATTGGTTTTTGGACACTGGAGCTTCGGCCCATATGACTGCCGACCCATCAATTCTGGATCAGTCTAAAAATTACACGGGTAAGGACTCTGTGATTGTAGGAAACGGTGCATCCCTACCCATTACCCACATCGGTACTCTTTCTCCTGTTCCAAATATTCACTTATTAGATGTCTTGGTTGTCCCTCACCTCACTAAAAATCTTCTTTCCATAAGTAAATTAACGTCTGATTTTCCTCTCTCCGTTACATTTACTAATAATCTTATTACTATCCAGAATCATCAAACAGGAAGGGTGGTGGCAACCGGTAAAAGAGATGGAGGGCTATATGTGCTGGAGCGCGGCAACTCTGCTTTTATTTCAGCCCTTAAAAACAAATCTTTACGTGCTTCATATGATTTATGGCATGCTCGTCTGGGTCATGTGAATCATTCTGTTATTTCTTTTTTAAATAAAAAAGGTCATCTTTCTCTTACGTCTTTATTGCCTTCTCCATCATTATGTAATACTTGTCAGCTTGCGAAAAGCCATCGATTGCCTTATTCCCGCAATGAACGTAGGTCGTCTCATGTGTTAGATCTTATTCATTGTGATCTTTGGGGTCCTTCTCCCGTCAAATCAAATTCGGGTTTTCTTTATTATGCTATTTTTATTGATGATTATTCTCGATTCACTTGGTTTTACCCTTTAAAATTTAAATCTGATTTTTTTGATATTTTTCTTCAATTTCAAAAATTTGTGGAAAATCAATATTCTTCTCGTATCAAGGTATTTCAAAGCGATGGTGGTACCGAATTTACTAGTACTTGTTTCAAAACTCATTTACGTAATTCTGGCATCCACCATCAACTCTCTTGTCCATATACACCTGCTCAAAATGGTCGTGTTGAGAGAAAACATCGTCATGTGACTGAGACTGGCTTGGCCCTTCTCTTTCACTCTCATCTTTCTCCTCGTTTTTGGGTTGACGCCTTCAGCACTGCAGCTTATATTATCAACCGGTTGCCTACTCCACTTCTTGGAGGTAAGTCACCCTTTGAACTCCTTTATGGCTACACTCCACATTATGACAATTTTCATCCCTTTGGTTGTCGTGTTTATCCTTATTTGCGTGATTATATGCCTAACAAGCTTTCTCCCCGCAGCATTCCTTGTATTTTTTTGGGTTATAGTCCTGTTCATAAAGGGTTCCGCTGTCTTGATCCCGCCACCACTAAGCTATATATCACCTGCCATGCTCAATTTGATGAAACTCACTTTCCTGCTATCCCTAGCTCCCAAGCCCAACCTCTTTCCACTATTCCTATTTCAAATTTCTTGGAACCACATCTTCATCATATTGATTCATCCCCCCCTACCACTTCATCACCGCACATTCCTCAATCCAGTTCATCCCCGTGTGATATTTGTTCTGACCTTGTAGATGAGTCTGTGCAGGTTGATACTTCTCTTGCAGGTTCCACTTTGTCACCCTCGACTTCTAATTCAACCTCTATTGAACCTCCTGTTGATTTCTCTTCTTTGGGCACTCATCCTATGATCACACGCGCCAAAGCTGGTATATTCAAGACTCGTCATCCAGCAAATCTTGGTATGTTGGGCTCATCTGGACTTCTCTCTGCTCTTCTTGCATCCACTGAGCCAAAAGGATTCAAATCTGCGGCTAAGAATCCTGCTTGGGTGGCTGCCATGGATGAAGAAATTCGAGCGTTACAACAAAATGATACTTGGACTTTGGTTCCTCGCCCTGCCAACACCAACATCGTGGGCTCTAAATGGGTGTTTCGTATTAAATATTTGCCCGATGGATCCGTCGAGCGTTTCAAGGCTCGTCTCGTTGCCAAAGGTTATACTCAGGTTCCTGGTCTTGACTACACTGACACTTTCAGTCCAGTTGTCAAAGCTACCACTGTCCGTGTTGTGCTTTCTATTGCAGTCACAAATAAATGGCCTCTTCGACAACTTGATGTCAAGAATGCTTTTCTCAATGGAACGCTTATTGAACGTGTTCATATGGAACAACCTCCTGGGTATGTTGATCCTCGATTTCCAAAGCATGTTTGTCTATTAAAGAAAGCTCTCTATGGCTTAAAGCAAGCTCCTCGTGCTTGGTTTCAGCGTTTTAGCTCATTTCTTCTCACACTTGGGTTTTCTTGTAGTCGCGCTGACACGTCCCTTTTTGTCTTTCATCAGCAATCTAACCTTATCTATTTGCTTCTTTATGTTGATGACATTATTGTTACCGGCAACAACTCATCTCTTATTAACAGCTTTACTCGCAAGCTTCATTCTGAGTTTGCTACCAAAGATTTGGGTTCTCTCAGTTACTTTCTTGGTCTTGAAGCTTCACCCACTCCTGATGGTCTCTTTATCAGTCAGTTAAAATATGCTCGAGATATTCTTACTCGTGCTCAGTTGCTCGATAGCAAACCAGTTCACACTCCCATGGTTGTTTCTCAACACCTGACTGCTGATGGTTCTCCTTTCTCTGATCCTACTCTCTACAGATCTCTTGTTGGCGCCCTTCAGTACTTGACTATCACGCGTCCAGATATTGCCTATGCTGTCAATTCTGTCAGTCAATTCTTGCATGCCCCTACTGCAGATCACTTTCTTGCTGTCAAACGTATTCTTCGCTATGTCAAAGGAACACTCCACTTTGGTCTTATCTTTCGTCCATCCACTGTTCCTAGTACGCTAGTCGCTTATTCGGATGCTGACTGGGCTGGTTGTCCCGATACTCGTCGTTCTACATCCGGCTATTCTATTTATCTTGGTAACAATCTCATTTCTTGGAGTGCCAAAAAGCAACCTACTGTATCACGCTCCAGCTGTGAATCTGAGTATCGTGCTCTTGCCACAACTGCTGCTGAACTTCTTTGGGTTACGCATATTTTGCATGACCTCAAGGTCCCTATTTCACAGCAGCCCTTACTCTTATGTGACAACAAAAGTGCTATTTTTTTGAGCTCTAATCCTGTTTCTCACAAGCGGGCCAAGCATGTTGAACTAGATTATCATTTCCTTCGAGAACTTGTTATCGCTGGCAAACTTCGTACACAATATGTACCCTCTCATCTCCAAGTTGCTGACATCTTCACAAAGAGTATTTCTCGACCTCTCTTTGAATTTTTCAGATCCAAGCTTCATGTTCGTTCAAATCCGACGCTCAGCTTGCGGGGGGGTGTTAAGGATAGTTGACCGTTCATTAAGGCAAATATCTAGATATTTGCCTTACATTACGGGTTACCATATTTTCTCTTTTACTAAATGCAAATATATTGTATTCATTTATTATTCTTTCCTTTTATCGCTATTTCCTTATTATTTGTAATTTACCTTATTTCTGTATTCTGTAAATGATTATAAATAAGAGAACTCTCACCACCATTTGGGTGTGGTGGATTAATCAAACATTTACAAACATGTTTCAGGTTTCCGCGTTATGATTCCCAGCAAATGAATCTACTTATAATCCATCATCAAATTGTAGATAGAAGGCACTAGTCTGTAGATTGAGGTATAAATCATTAAAGATTACATTAGATGATTATGGGTTGAAAAAGTGAGCACTTCTTCCGAGCTTGCTGAAATCTTATCACACGAATTAATTTTCACCATATGGTTGGTTAACCAGCTCTGTTTTCTGAGTATTTTGAGCTCGCCTCGTATGCACTCCCCAGGTGTGTCGTCAAGTCTCACAGCAGTTATATTTTTAAACAATCGAGTGTCGAAATTGGAACGATGTTTGTACCGTTCTTATTAAACATTGGAAAATGTCAATATATTATGCAGAATGCTGCTGCATAGAGTTATTTGCCTCCCCTCGTTAATGTTCATATGTAGATGCAAGAGGTAATTCAATTCTGAGTCCCATTTCCGGGGCCCCCCTTTTTTTGTTGCTTATTTCATTATCTCGTCTTTCTTCGCACTACAATTGGGCCGCAAAAGGTACCATGAAGATCGAAACTTAAAATTTTTCGTTGTAAACTACAGTTACTTATAAAAGGTATATTCTTGACTTATTAGAAGAGCATCGTTTGCACTGTAATTTTTCCCACCGAAGTATATAGTATACATAGTGTATGAGTTATGCCACTTCGCACCACCGAAGAGAGTTGCTGACTGTAGACAGCCTTCTTCATGTCTGAAACATTTTATTTTATCCCATCCTTAGCACTTAACGTCTACTATGTTAACACCAAGACCTTCTCTTTATTCTTTCAAATACGAGGGCTCTATTTATTAGGCAGCCAGAAACGCCATCTTCGGCGGAGTTGCAGTTTCCACAGAAGGAAAGAAGTTTGAAGCTTGAAATTAACGAAAATGACTTAAGAGAGAGAGAGGCTTACCAAATTGAAGGGTTTAACGGAACCCACCACTAGTCCATTGGATTTCGCACGTTCGTGCCCAACCTGAGCGAAGTAGAAGAAGAAAATCAGAAAGATAGCAAATCTTAGGGATCTTATTGAGAGGAGAAAGTTTAGCTGTCAACTAGGATGGAGTAAGTAGCGTTTTATAAGATGAAGCAACTTCTTCAGATCATGGGATTCAGAACTAAACAGGCACCGACAAGACAAGATCTTGAAAGCTTCTTCAGACAAAAATATTATACTGGTACGACTATTCTCTATTCTCATTGATACATGTGCTTTTATGAAATTGAGCTTGCTTTTGACCTGTTGCCTGATTCCTTGGCAGCGATGCTCGACCCTCAAGGGCATGGGTTGGCTTTCTTTGGTGCCTTTGTCGCATGCATCCATAAACAAAATGTGGGGAGCTGATATGCGTGGAAAGTTGTCATTAATTTGGTTTTATCCTCAAGATTCTCCAAGCTGTCACTTCTCTCTTGGTCTGGTTTCTGATCCACGCTCACGTTGCTACCCACTCGCTCACCCCCCACCGCCAGCCTACAGGTTCGAGCTTATTTTTTTTCATCTTTTTGCAATTTTATTTACTGATTAGGGAACTCAAATTAGCCATATAAGAGCATAATTGTGAGTAGGCAGTGGCCATTTTCTGTCATATGAATCCCACAGTGCAGATCTTATGTGGAAAATCACGAACAATATAACAAAATAATTTGGTCAACATGCTAGGAAAGTCCCATGAGATCCCATTTTCTGTCATTAAAATCACGTAGTGCTGATTTTATGTAGAAATTCATGAACATTTCAACAGAATGATTTTGTCAATATATTAGGAGGGCTCAATGTGATCGTTTCTATGCTTTAAAAGGAATCATTTTCTTTATGTGTGAGTGCGAGTTAGGTAATGTGCTATCATAGTTTTATTTATCTGTTTATTTTAAGGGAAGAAATCTCAAATCCATAGAATATTGCAAAGTGTGTTATAAAATGTATTTAAATATAGGTTGGGTTTGTTAACTAAATCATGAAATTGTATTTCATTTGGCTCATTAGCTATTCAGAAGCTTGTTTATATTATTCTGACATGACATTGAAATTCCCACTGCTTTAAATTATTGATGTTTTTTGCTTTAAGGCATTTAGTCACGTTCTCGGAATTCTATTAGACCTTGGATAGAGCTTTTCCAAAACTCGGTCTACCTCTTCATTATTAATCACATTATGTTAGGGTGCAGAAACCTTAGTCTTTGTTTGAACAGCCAAAAAGGTTCCCAATATCAAATGTCTAAATAGAATTATGATTTCCTTTTGCTTTGAATTTGAGATAGATATGCTCTTAGCCTGTAACGGAGGAATCAAACTTGTAATATTGATTATATTTTGCTTGAGAAACCAGACCTTCCAACTATGGCTTGGGCCATGCTTACTGTGGAAGAAAAAAAAAAAAAAAAAAAAACTCTTTCGAAATTTTGAATCATTGTTTAGAGGCTTATTTATTGTGAGATCCCATACCGGTTGGAGAAGTGAAGGAAACATTCTTTATAAAAGTATGGAAATCTCTCTCTAGCAGACACGTTTGAAGTTAGACAATATTTACTAGCAGTGGGCTTGAGCTGTTATAAATAGTATAAGAGGCAGTTACCAAGCAGTATGCCAGCGAGGATGCTGGGCTTCCAATACACGTTTTAAAACTGTGAGGTTGATGACGATACGTAACGGGTCAAAACAAACAATATCTACTAACAGTGAACTTAGACTATTACATTTTTTCCTCCTTTATCCAATCTATTTTCAGCTTCAACCCGGTCTATATCAAGAGTTTATGATATAAATTTGTGTATTGGAAAAAAGAGGTGGGAAGCGGGGGTGGGGGCACCTTTTTCCCCCTCAAAAAGGCACAGAGTTGCCACTTGCTGCGCAATGGTGGATGGTTATGTGTGTGTCCAAGTGAGAATATGGCCAGCGATAAGGCCTAAACTGTCAGGTACTCTAGCTGTTACATTTTCACAAAGGCCAAACTGGCAATGCAAATGATATAAATTTTTAGTGTTTTTATGTGCATATTTCTTCAAAAAATTATTGGTGTTTTTAGGAAATATGTTTAAATGTTTGAAAAGTATTAGGAAGTTCTTTTTTGAGAAAAACATATATTTTATATTTATTGTTAAATGCTGGGAATTTTGTATTGTGATGCTCAAGATTTTAAATAGAGTGTGTCTCTACTGTGAGTAACTTGCAAATTAAACTTATACATAGTTATTTACATATTCTTAAAATGAAAGTGGAAAACAATATCTGTTTCCTTAAAAACATATTTTGTTTATGGATAATGATAGAAAATATTGCTAATTGGCGTATAAAATAATTTTGTGGATTGGTCAATGATGCATAATAATAATAATAATAATAATAATAATAAAATTGTCATCAGAACAAACAATTCCTTCATTTTTCTCCAATATTTAAAAAATAATAATAGTTATAAAAAAAAAAAAGACAGCTTCTTGCTTAAAAGTTGTCATTTCTAGCTATCTTATTATTTAATTTATTTCAATTCCACGTCTCCTTTGGCATTTTTTTGGTAAAATAAAAATATTTATTTGAGATAGTCAACAACAAACGTGAAAATTTAAAAAATTAGGAATGAAAAGTCATAGTACTATATTGTTCTAAAATGAAAAATTAAATCTTTCACCTTATTAATATTAACATAAATCTATATAGTTTTTTATAATAAAATTTATACTGATCTTCATATTTTAAATTCATTTTATTTTAGTCTTTATACGATCAAATTTAGTCATTATATTTTTAATAAATTTTAAAATCAAATTTTAAGATTATTTTGAATTTATACTTTTATTAAACTTGGTAAGGTACCTAAGTAAAAACACATTGAGGATATGGTAAAAAATTTATTTATATATATACTTTTAAAAATTTAATAATAATAATAATAATAATCAAATAAATTTATTGAAATTCTGAAAATTAAATTTAAAATAAGACAAGTAATAATTTGATGTCCATAATTTACAAACCAACAAATAAAGTTATAATTCACTTATTTTGGTAAAATTTGAACTTCAACGTTCAGGAGTAGAAACAAGGATACAGAAAGCAGAGGAATTTGTTGGCTGTTATTTGAACCAAAATAGAAAAAAAATAAATTAAATAAATAAAGAGAATATATATTGTTAAAAAAAATTGCTTAGCTGTTATCCACTCCATTACGTGAAAACCTCCATTATTATCTGCGTAACTGAAGGGAAGGAAAAAAAAAAAGAGCACAGTCACACAGCAGCTCAACGCCAGCTGATTCCTCCTCCTTCTGCAGCCGTCATCTTCTCCATCCATCAAGAGGAAGAAGAAGAAGAGGAAGAAGAAGAAGAGGAAGAAGAAGAAGAAGAAGCCGTCTTCGAGTTTCACTGTACTTTTTGA

mRNA sequence

ATGGCTTCCGAATCTTGTTATCATCTTCTTCCTTTCAATACTCTAATCCATATGATCACCATCAAACTTTCTTCCTCCAATTATCTTCTTTGGAAAAGCCAACTTCTCCCTCTTCTTGAGAGTCAAGACATGCTGGGCTATGTCGATGGAACTATGGTTCCACCACCTCGCTTTGAACCAGAAACCTCCTCAACATTCAACCCCAAATATTTGGCATGGAGAGCAGCCGATCAACGACTTCTCTGTCTCCTGCTCTCCTCTCTCACTGAGGAAGCCATGGCTGTTGTCGTTGGTCTCTCTACTGCACGTGATGTTTGGCTTGCGTTGGAAACTACGTACAGCCATCAGTCAAAAGCTCGTGAACTGAGACTCAAGGATGACTTGCAGTTGATGAAACGTGGCACAAAACCTGTTGCTGAGTATGCCCGTGCCTTCAAAAAAATTTGTGACCAACTTCATGCCATTGGCAGACCCGTCGAGGACATTGATAAAGTGCATTGGTTCCTTCGTGGACTCGGCACCGAATTTTCAGCTTTTTCTACTGCTCAGATGGCTCTCACCCCTATCCCCTGTTTTGCAGATCTAGTCTCTAAAACTGAAAGTTTTGAGTTGTTCCAGCGCTCCCTTGAGTCCTCTGACTCCACTCCTACAGCATTCATAGCCACTAATCGTGGCCGCACCCATGAAAGTCACCCTGCTTCCTTTACCAACCAGCGAGGTCGTTCTTATTCTCACAAAAACAACTCTTCTAATCGAGGACGAACCCACTCAAGTCAGGGTCGTCGACCACCTCATTGCCAAATATGCCGCAAAGAGGGCCATTATGCTGACCGCTGCAACCAACGGTATGTTCGACCTGATTCTTCTCATGCTCACCTTGCTGAAGCCTTTAACACGTCATGTTCTATTGCTGGACCCGATGCTGCTGATTGGTTTTTGGACACTGGAGCTTCGGCCCATATGACTGCCGACCCATCAATTCTGGATCAGTCTAAAAATTACACGGGTAAGGACTCTGTGATTGTAGGAAACGGTGCATCCCTACCCATTACCCACATCGAATCATCAAACAGGAAGGGTGGTGGCAACCGCACTGCAGCTTATATTATCAACCGGTTGCCTACTCCACTTCTTGGAGGTAAGTCACCCTTTGAACTCCTTTATGGCTACACTCCACATTATGACAATTTTCATCCCTTTGGTTGTCGTGTTTATCCTTATTTGCGTGATTATATGCCTAACAAGCTTTCTCCCCGCAGCATTCCTTGTATTTTTTTGGGTTATAGTCCTGTTCATAAAGGGTTCCGCTGTCTTGATCCCGCCACCACTAAGCTATATATCACCTGCCATGCTCAATTTGATGAAACTCACTTTCCTGCTATCCCTAGCTCCCAAGCCCAACCTCTTTCCACTATTCCTATTTCAAATTTCTTGGAACCACATCTTCATCATATTGATTCATCCCCCCCTACCACTTCATCACCGCACATTCCTCAATCCAGTTCATCCCCGTGTGATATTTGTTCTGACCTTGTAGATGAGTCTGTGCAGGTTGATACTTCTCTTGCAGGTTCCACTTTGTCACCCTCGACTTCTAATTCAACCTCTATTGAACCTCCTGTTGATTTCTCTTCTTTGGGCACTCATCCTATGATCACACGCGCCAAAGCTGGTATATTCAAGACTCGTCATCCAGCAAATCTTGGTATGTTGGGCTCATCTGGACTTCTCTCTGCTCTTCTTGCATCCACTGAGCCAAAAGGATTCAAATCTGCGGCTAAGAATCCTGCTTGGGTGGCTGCCATGGATGAAGAAATTCGAGCGTTACAACAAAATGATACTTGGACTTTGGTTCCTCGCCCTGCCAACACCAACATCGTGGGCTCTAAATGGGTGTTTCGTATTAAATATTTGCCCGATGGATCCGTCGAGCGTTTCAAGGCTCGTCTCGTTGCCAAAGGTTATACTCAGGTTCCTGGTCTTGACTACACTGACACTTTCAGTCCAGTTGTCAAAGCTACCACTGTCCGTGTTGTGCTTTCTATTGCAGTCACAAATAAATGGCCTCTTCGACAACTTGATGTCAAGAATGCTTTTCTCAATGGAACGCTTATTGAACGTGTTCATATGGAACAACCTCCTGGGTATGTTGATCCTCGATTTCCAAAGCATGTTTGTCTATTAAAGAAAGCTCTCTATGGCTTAAAGCAAGCTCCTCGTGCTTGGTTTCAGCGTTTTAGCTCATTTCTTCTCACACTTGGGTTTTCTTGTAGTCGCGCTGACACGTCCCTTTTTGTCTTTCATCAGCAATCTAACCTTATCTATTTGCTTCTTTATGTTGATGACATTATTGTTACCGGCAACAACTCATCTCTTATTAACAGCTTTACTCGCAAGCTTCATTCTGAGTTTGCTACCAAAGATTTGGGTTCTCTCAGTTACTTTCTTGGTCTTGAAGCTTCACCCACTCCTGATGGTCTCTTTATCAGTCAGTTAAAATATGCTCGAGATATTCTTACTCGTGCTCAGTTGCTCGATAGCAAACCAGTTCACACTCCCATGGTTGTTTCTCAACACCTGACTGCTGATGGTTCTCCTTTCTCTGATCCTACTCTCTACAGATCTCTTGTTGGCGCCCTTCAGTACTTGACTATCACGCGTCCAGATATTGCCTATGCTGTCAATTCTGTCAGTCAATTCTTGCATGCCCCTACTGCAGATCACTTTCTTGCTGTCAAACGTATTCTTCGCTATGTCAAAGGAACACTCCACTTTGGTCTTATCTTTCGTCCATCCACTGTTCCTAGTACGCTAGTCGCTTATTCGGATGCTGACTGGGCTGGTTGTCCCGATACTCGTCGTTCTACATCCGGCTATTCTATTTATCTTGGTAACAATCTCATTTCTTGGAGTGCCAAAAAGCAACCTACTGTATCACGCTCCAGCTGTGAATCTGAGTATCGTGCTCTTGCCACAACTGCTGCTGAACTTCTTTGGGTTACGCATATTTTGCATGACCTCAAGGTCCCTATTTCACAGCAGCCCTTACTCTTATGTGACAACAAAAGTGCTATTTTTTTGAGCTCTAATCCTGTTTCTCACAAGCGGGCCAAGCATGTTGAACTAGATTATCATTTCCTTCGAGAACTTGTTATCGCTGGCAAACTTCGTACACAATATGTACCCTCTCATCTCCAAGTTGCTGACATCTTCACAAAGACTCTGTTTTCTGAGTATTTTGAGCTCGCCTCGTATGCACTCCCCAGATCATGGGATTCAGAACTAAACAGGCACCGACAAGACAAGATCTTGAAAGCTTCTTCAGACAAAAATATTATACTGCGATGCTCGACCCTCAAGGGCATGGGTTGGCTTTCTTTGGTGCCTTTGTCGCATGCATCCATAAACAAAATGTGGGGAGCTGATATGCGTGGAAAGTTGTCATTAATTTGGTTTTATCCTCAAGATTCTCCAAGCTGTCACTTCTCTCTTGGTCTGGTTTCTGATCCACGCTCACGTTGCTACCCACTCGCTCACCCCCCACCGCCAGCCTACAGTCACACAGCAGCTCAACGCCAGCTGATTCCTCCTCCTTCTGCAGCCGTCATCTTCTCCATCCATCAAGAGGAAGAAGAAGAAGAGGAAGAAGAAGAAGAGGAAGAAGAAGAAGAAGAAGCCGTCTTCGAGTTTCACTGTACTTTTTGA

Coding sequence (CDS)

ATGGCTTCCGAATCTTGTTATCATCTTCTTCCTTTCAATACTCTAATCCATATGATCACCATCAAACTTTCTTCCTCCAATTATCTTCTTTGGAAAAGCCAACTTCTCCCTCTTCTTGAGAGTCAAGACATGCTGGGCTATGTCGATGGAACTATGGTTCCACCACCTCGCTTTGAACCAGAAACCTCCTCAACATTCAACCCCAAATATTTGGCATGGAGAGCAGCCGATCAACGACTTCTCTGTCTCCTGCTCTCCTCTCTCACTGAGGAAGCCATGGCTGTTGTCGTTGGTCTCTCTACTGCACGTGATGTTTGGCTTGCGTTGGAAACTACGTACAGCCATCAGTCAAAAGCTCGTGAACTGAGACTCAAGGATGACTTGCAGTTGATGAAACGTGGCACAAAACCTGTTGCTGAGTATGCCCGTGCCTTCAAAAAAATTTGTGACCAACTTCATGCCATTGGCAGACCCGTCGAGGACATTGATAAAGTGCATTGGTTCCTTCGTGGACTCGGCACCGAATTTTCAGCTTTTTCTACTGCTCAGATGGCTCTCACCCCTATCCCCTGTTTTGCAGATCTAGTCTCTAAAACTGAAAGTTTTGAGTTGTTCCAGCGCTCCCTTGAGTCCTCTGACTCCACTCCTACAGCATTCATAGCCACTAATCGTGGCCGCACCCATGAAAGTCACCCTGCTTCCTTTACCAACCAGCGAGGTCGTTCTTATTCTCACAAAAACAACTCTTCTAATCGAGGACGAACCCACTCAAGTCAGGGTCGTCGACCACCTCATTGCCAAATATGCCGCAAAGAGGGCCATTATGCTGACCGCTGCAACCAACGGTATGTTCGACCTGATTCTTCTCATGCTCACCTTGCTGAAGCCTTTAACACGTCATGTTCTATTGCTGGACCCGATGCTGCTGATTGGTTTTTGGACACTGGAGCTTCGGCCCATATGACTGCCGACCCATCAATTCTGGATCAGTCTAAAAATTACACGGGTAAGGACTCTGTGATTGTAGGAAACGGTGCATCCCTACCCATTACCCACATCGAATCATCAAACAGGAAGGGTGGTGGCAACCGCACTGCAGCTTATATTATCAACCGGTTGCCTACTCCACTTCTTGGAGGTAAGTCACCCTTTGAACTCCTTTATGGCTACACTCCACATTATGACAATTTTCATCCCTTTGGTTGTCGTGTTTATCCTTATTTGCGTGATTATATGCCTAACAAGCTTTCTCCCCGCAGCATTCCTTGTATTTTTTTGGGTTATAGTCCTGTTCATAAAGGGTTCCGCTGTCTTGATCCCGCCACCACTAAGCTATATATCACCTGCCATGCTCAATTTGATGAAACTCACTTTCCTGCTATCCCTAGCTCCCAAGCCCAACCTCTTTCCACTATTCCTATTTCAAATTTCTTGGAACCACATCTTCATCATATTGATTCATCCCCCCCTACCACTTCATCACCGCACATTCCTCAATCCAGTTCATCCCCGTGTGATATTTGTTCTGACCTTGTAGATGAGTCTGTGCAGGTTGATACTTCTCTTGCAGGTTCCACTTTGTCACCCTCGACTTCTAATTCAACCTCTATTGAACCTCCTGTTGATTTCTCTTCTTTGGGCACTCATCCTATGATCACACGCGCCAAAGCTGGTATATTCAAGACTCGTCATCCAGCAAATCTTGGTATGTTGGGCTCATCTGGACTTCTCTCTGCTCTTCTTGCATCCACTGAGCCAAAAGGATTCAAATCTGCGGCTAAGAATCCTGCTTGGGTGGCTGCCATGGATGAAGAAATTCGAGCGTTACAACAAAATGATACTTGGACTTTGGTTCCTCGCCCTGCCAACACCAACATCGTGGGCTCTAAATGGGTGTTTCGTATTAAATATTTGCCCGATGGATCCGTCGAGCGTTTCAAGGCTCGTCTCGTTGCCAAAGGTTATACTCAGGTTCCTGGTCTTGACTACACTGACACTTTCAGTCCAGTTGTCAAAGCTACCACTGTCCGTGTTGTGCTTTCTATTGCAGTCACAAATAAATGGCCTCTTCGACAACTTGATGTCAAGAATGCTTTTCTCAATGGAACGCTTATTGAACGTGTTCATATGGAACAACCTCCTGGGTATGTTGATCCTCGATTTCCAAAGCATGTTTGTCTATTAAAGAAAGCTCTCTATGGCTTAAAGCAAGCTCCTCGTGCTTGGTTTCAGCGTTTTAGCTCATTTCTTCTCACACTTGGGTTTTCTTGTAGTCGCGCTGACACGTCCCTTTTTGTCTTTCATCAGCAATCTAACCTTATCTATTTGCTTCTTTATGTTGATGACATTATTGTTACCGGCAACAACTCATCTCTTATTAACAGCTTTACTCGCAAGCTTCATTCTGAGTTTGCTACCAAAGATTTGGGTTCTCTCAGTTACTTTCTTGGTCTTGAAGCTTCACCCACTCCTGATGGTCTCTTTATCAGTCAGTTAAAATATGCTCGAGATATTCTTACTCGTGCTCAGTTGCTCGATAGCAAACCAGTTCACACTCCCATGGTTGTTTCTCAACACCTGACTGCTGATGGTTCTCCTTTCTCTGATCCTACTCTCTACAGATCTCTTGTTGGCGCCCTTCAGTACTTGACTATCACGCGTCCAGATATTGCCTATGCTGTCAATTCTGTCAGTCAATTCTTGCATGCCCCTACTGCAGATCACTTTCTTGCTGTCAAACGTATTCTTCGCTATGTCAAAGGAACACTCCACTTTGGTCTTATCTTTCGTCCATCCACTGTTCCTAGTACGCTAGTCGCTTATTCGGATGCTGACTGGGCTGGTTGTCCCGATACTCGTCGTTCTACATCCGGCTATTCTATTTATCTTGGTAACAATCTCATTTCTTGGAGTGCCAAAAAGCAACCTACTGTATCACGCTCCAGCTGTGAATCTGAGTATCGTGCTCTTGCCACAACTGCTGCTGAACTTCTTTGGGTTACGCATATTTTGCATGACCTCAAGGTCCCTATTTCACAGCAGCCCTTACTCTTATGTGACAACAAAAGTGCTATTTTTTTGAGCTCTAATCCTGTTTCTCACAAGCGGGCCAAGCATGTTGAACTAGATTATCATTTCCTTCGAGAACTTGTTATCGCTGGCAAACTTCGTACACAATATGTACCCTCTCATCTCCAAGTTGCTGACATCTTCACAAAGACTCTGTTTTCTGAGTATTTTGAGCTCGCCTCGTATGCACTCCCCAGATCATGGGATTCAGAACTAAACAGGCACCGACAAGACAAGATCTTGAAAGCTTCTTCAGACAAAAATATTATACTGCGATGCTCGACCCTCAAGGGCATGGGTTGGCTTTCTTTGGTGCCTTTGTCGCATGCATCCATAAACAAAATGTGGGGAGCTGATATGCGTGGAAAGTTGTCATTAATTTGGTTTTATCCTCAAGATTCTCCAAGCTGTCACTTCTCTCTTGGTCTGGTTTCTGATCCACGCTCACGTTGCTACCCACTCGCTCACCCCCCACCGCCAGCCTACAGTCACACAGCAGCTCAACGCCAGCTGATTCCTCCTCCTTCTGCAGCCGTCATCTTCTCCATCCATCAAGAGGAAGAAGAAGAAGAGGAAGAAGAAGAAGAGGAAGAAGAAGAAGAAGAAGCCGTCTTCGAGTTTCACTGTACTTTTTGA

Protein sequence

MASESCYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEPETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLSTARDVWLALETTYSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTPIPCFADLVSKTESFELFQRSLESSDSTPTAFIATNRGRTHESHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFNTSCSIAGPDAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHIESSNRKGGGNRTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQAQPLSTIPISNFLEPHLHHIDSSPPTTSSPHIPQSSSSPCDICSDLVDESVQVDTSLAGSTLSPSTSNSTSIEPPVDFSSLGTHPMITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYVDPRFPKHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNLIYLLLYVDDIIVTGNNSSLINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLISWSAKKQPTVSRSSCESEYRALATTAAELLWVTHILHDLKVPISQQPLLLCDNKSAIFLSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKTLFSEYFELASYALPRSWDSELNRHRQDKILKASSDKNIILRCSTLKGMGWLSLVPLSHASINKMWGADMRGKLSLIWFYPQDSPSCHFSLGLVSDPRSRCYPLAHPPPPAYSHTAAQRQLIPPPSAAVIFSIHQEEEEEEEEEEEEEEEEEAVFEFHCTF
Homology
BLAST of CmaCh03G007770 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 624.4 bits (1609), Expect = 2.7e-177
Identity = 438/1438 (30.46%), Postives = 642/1438 (44.65%), Query Frame = 0

Query: 22   KLSSSNYLLWKSQLLPLLESQDMLGYVDG-TMVPPPRFEPETSSTFNPKYLAWRAADQRL 81
            KL+S+NYL+W  Q+  L +  ++ G++DG T +PP     + +   NP Y  W+  D+ +
Sbjct: 25   KLTSTNYLMWSRQVHALFDGYELAGFLDGSTTMPPATIGTDAAPRVNPDYTRWKRQDKLI 84

Query: 82   LCLLLSSLTEEAMAVVVGLSTARDVWLALETTYSHQSKARELRLKDDLQLMKRGTKPVAE 141
               +L +++      V   +TA  +W  L   Y++ S     +L+  L+   +GTK + +
Sbjct: 85   YSAVLGAISMSVQPAVSRATTAAQIWETLRKIYANPSYGHVTQLRTQLKQWTKGTKTIDD 144

Query: 142  YARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTPIPCFADLVSKTE 201
            Y +      DQL  +G+P++  ++V   L  L  E+        A    P   ++  +  
Sbjct: 145  YMQGLVTRFDQLALLGKPMDHDEQVERVLENLPEEYKPVIDQIAAKDTPPTLTEIHERLL 204

Query: 202  SFELFQRSLESSDSTPTAFIATNRGRTHESHPASFTNQRGRSYSHKNNSSN-------RG 261
            + E    ++ S+   P    A +   T  ++  +  N+  R Y ++NN++N         
Sbjct: 205  NHESKILAVSSATVIPITANAVSHRNTTTTNNNNNGNRNNR-YDNRNNNNNSKPWQQSST 264

Query: 262  RTHSSQGRRPPH---CQICRKEGHYADRCNQRYVRPDSSHAHLAEAFNT------SCSIA 321
              H +  +  P+   CQIC  +GH A RC+Q      S ++    +  T      + ++ 
Sbjct: 265  NFHPNNNQSKPYLGKCQICGVQGHSAKRCSQLQHFLSSVNSQQPPSPFTPWQPRANLALG 324

Query: 322  GP-DAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITH----------- 381
             P  + +W LD+GA+ H+T+D + L   + YTG D V+V +G+++PI+H           
Sbjct: 325  SPYSSNNWLLDSGATHHITSDFNNLSLHQPYTGGDDVMVADGSTIPISHTGSTSLSTKSR 384

Query: 382  ------------------------------------------------------------ 441
                                                                        
Sbjct: 385  PLNLHNILYVPNIHKNLISVYRLCNANGVSVEFFPASFQVKDLNTGVPLLQGKTKDELYE 444

Query: 442  ------------------------------------------------------------ 501
                                                                        
Sbjct: 445  WPIASSQPVSLFASPSSKATHSSWHARLGHPAPSILNSVISNYSLSVLNPSHKFLSCSDC 504

Query: 502  -IESSNR----------------------------------------------------- 561
             I  SN+                                                     
Sbjct: 505  LINKSNKVPFSQSTINSTRPLEYIYSDVWSSPILSHDNYRYYVIFVDHFTRYTWLYPLKQ 564

Query: 562  ----------------------------KGGGN--------------------------- 621
                                          GG                            
Sbjct: 565  KSQVKETFITFKNLLENRFQTRIGTFYSDNGGEFVALWEYFSQHGISHLTSPPHTPEHNG 624

Query: 622  -------------------------------RTAAYIINRLPTPLLGGKSPFELLYGYTP 681
                                             A Y+INRLPTPLL  +SPF+ L+G +P
Sbjct: 625  LSERKHRHIVETGLTLLSHASIPKTYWPYAFAVAVYLINRLPTPLLQLESPFQKLFGTSP 684

Query: 682  HYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVHKGFRCLDPATTKLYITCHAQ 741
            +YD    FGC  YP+LR Y  +KL  +S  C+FLGYS     + CL   T++LYI+ H +
Sbjct: 685  NYDKLRVFGCACYPWLRPYNQHKLDDKSRQCVFLGYSLTQSAYLCLHLQTSRLYISRHVR 744

Query: 742  FDETHFP-----------------------------------------------AIPSSQ 801
            FDE  FP                                                 PSS 
Sbjct: 745  FDENCFPFSNYLATLSPVQEQRRESSCVWSPHTTLPTRTPVLPAPSCSDPHHAATPPSSP 804

Query: 802  AQPL--STIPISNFLEPHLHHIDSSP-PTTSSPHIPQ------------------SSSSP 861
            + P   S +  SN          SSP PT    + PQ                  S ++P
Sbjct: 805  SAPFRNSQVSSSNLDSSFSSSFPSSPEPTAPRQNGPQPTTQPTQTQTQTHSSQNTSQNNP 864

Query: 862  CDICSDLVDESVQVDTSLAGSTLSPSTSNSTS----------IEPPVDFSS--------- 921
             +     + +S+      + S+ SP+TS S+S          I PP   +          
Sbjct: 865  TNESPSQLAQSLSTPAQSSSSSPSPTTSASSSSTSPTPPSILIHPPPPLAQIVNNNNQAP 924

Query: 922  LGTHPMITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEE 981
            L TH M TRAKAGI K     +L +        +L A +EP+    A K+  W  AM  E
Sbjct: 925  LNTHSMGTRAKAGIIKPNPKYSLAV--------SLAAESEPRTAIQALKDERWRNAMGSE 984

Query: 982  IRALQQNDTWTLV-PRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYT 1041
            I A   N TW LV P P++  IVG +W+F  KY  DGS+ R+KARLVAKGY Q PGLDY 
Sbjct: 985  INAQIGNHTWDLVPPPPSHVTIVGCRWIFTKKYNSDGSLNRYKARLVAKGYNQRPGLDYA 1044

Query: 1042 DTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYVDPRFPKH 1082
            +TFSPV+K+T++R+VL +AV   WP+RQLDV NAFL GTL + V+M QPPG++D   P +
Sbjct: 1045 ETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQGTLTDDVYMSQPPGFIDKDRPNY 1104

BLAST of CmaCh03G007770 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 599.4 bits (1544), Expect = 9.5e-170
Identity = 443/1443 (30.70%), Postives = 633/1443 (43.87%), Query Frame = 0

Query: 22   KLSSSNYLLWKSQLLPLLESQDMLGYVDG-TMVPPPRFEPETSSTFNPKYLAWRAADQRL 81
            KL+S+NYL+W  Q+  L +  ++ G++DG T +PP     +     NP Y  WR  D+ +
Sbjct: 25   KLTSTNYLMWSRQVHALFDGYELAGFLDGSTPMPPATIGTDAVPRVNPDYTRWRRQDKLI 84

Query: 82   LCLLLSSLTEEAMAVVVGLSTARDVWLALETTYSHQSKARELRLKDDLQLMKRGTKPVAE 141
               +L +++      V   +TA  +W  L   Y++ S     +L+               
Sbjct: 85   YSAILGAISMSVQPAVSRATTAAQIWETLRKIYANPSYGHVTQLR--------------- 144

Query: 142  YARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTPIPCFADLVSKTE 201
                F    DQL  +G+P++  ++V   L  L  ++        A    P   ++  +  
Sbjct: 145  ----FITRFDQLALLGKPMDHDEQVERVLENLPDDYKPVIDQIAAKDTPPSLTEIHERLI 204

Query: 202  SFELFQRSLESSDSTP-TAFIATNRGRTHESHPASFTNQRG--RSYSHKNNSSNRGRTHS 261
            + E    +L S++  P TA + T+R      +     N RG  R+Y++ NN SN  +  S
Sbjct: 205  NRESKLLALNSAEVVPITANVVTHRNTNTNRN----QNNRGDNRNYNNNNNRSNSWQPSS 264

Query: 262  SQGR---RPP-----HCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFNT------SCSI 321
            S  R   R P      CQIC  +GH A RC Q +    +++   + +  T      + ++
Sbjct: 265  SGSRSDNRQPKPYLGRCQICSVQGHSAKRCPQLHQFQSTTNQQQSTSPFTPWQPRANLAV 324

Query: 322  AGP-DAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITH---------- 381
              P +A +W LD+GA+ H+T+D + L   + YTG D V++ +G+++PITH          
Sbjct: 325  NSPYNANNWLLDSGATHHITSDFNNLSFHQPYTGGDDVMIADGSTIPITHTGSASLPTSS 384

Query: 382  ------------------------------------------------------------ 441
                                                                        
Sbjct: 385  RSLDLNKVLYVPNIHKNLISVYRLCNTNRVSVEFFPASFQVKDLNTGVPLLQGKTKDELY 444

Query: 442  ------------------------------------------------------------ 501
                                                                        
Sbjct: 445  EWPIASSQAVSMFASPCSKATHSSWHSRLGHPSLAILNSVISNHSLPVLNPSHKLLSCSD 504

Query: 502  --IESSNRKGGGNRT--------------------------------------------- 561
              I  S++    N T                                             
Sbjct: 505  CFINKSHKVPFSNSTITSSKPLEYIYSDVWSSPILSIDNYRYYVIFVDHFTRYTWLYPLK 564

Query: 562  ------------------------------------------------------------ 621
                                                                        
Sbjct: 565  QKSQVKDTFIIFKSLVENRFQTRIGTLYSDNGGEFVVLRDYLSQHGISHFTSPPHTPEHN 624

Query: 622  ----------------------------------AAYIINRLPTPLLGGKSPFELLYGYT 681
                                              A Y+INRLPTPLL  +SPF+ L+G  
Sbjct: 625  GLSERKHRHIVEMGLTLLSHASVPKTYWPYAFSVAVYLINRLPTPLLQLQSPFQKLFGQP 684

Query: 682  PHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVHKGFRCLDPATTKLYITCHA 741
            P+Y+    FGC  YP+LR Y  +KL  +S  C F+GYS     + CL   T +LY + H 
Sbjct: 685  PNYEKLKVFGCACYPWLRPYNRHKLEDKSKQCAFMGYSLTQSAYLCLHIPTGRLYTSRHV 744

Query: 742  QFDETHFP------AIPSSQAQ------------PLSTIPISNFLEPHL-HHIDSSP--- 801
            QFDE  FP       + +SQ Q             L T P+     P L  H+D+SP   
Sbjct: 745  QFDERCFPFSTTNFGVSTSQEQRSDSAPNWPSHTTLPTTPLVLPAPPCLGPHLDTSPRPP 804

Query: 802  ----------------------------PT---------TSSPHIPQSSSSPCDICSDLV 861
                                        PT         T+ PH  Q+S+S   I ++  
Sbjct: 805  SSPSPLCTTQVSSSNLPSSSISSPSSSEPTAPSHNGPQPTAQPHQTQNSNSNSPILNNPN 864

Query: 862  DESVQVD----------------------TSLAGSTLSPSTSNSTSIEPPV--------- 921
              S   +                      TS++      S+S ST   PPV         
Sbjct: 865  PNSPSPNSPNQNSPLPQSPISSPHIPTPSTSISEPNSPSSSSTSTPPLPPVLPAPPIIQV 924

Query: 922  -DFSSLGTHPMITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNPAWVA 981
               + + TH M TRAK GI K     +          ++L A++EP+    A K+  W  
Sbjct: 925  NAQAPVNTHSMATRAKDGIRKPNQKYSYA--------TSLAANSEPRTAIQAMKDDRWRQ 984

Query: 982  AMDEEIRALQQNDTWTLV-PRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVP 1041
            AM  EI A   N TW LV P P +  IVG +W+F  K+  DGS+ R+KARLVAKGY Q P
Sbjct: 985  AMGSEINAQIGNHTWDLVPPPPPSVTIVGCRWIFTKKFNSDGSLNRYKARLVAKGYNQRP 1044

Query: 1042 GLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYVDP 1082
            GLDY +TFSPV+K+T++R+VL +AV   WP+RQLDV NAFL GTL + V+M QPPG+VD 
Sbjct: 1045 GLDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQGTLTDEVYMSQPPGFVDK 1104

BLAST of CmaCh03G007770 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 369.8 bits (948), Expect = 1.2e-100
Identity = 247/741 (33.33%), Postives = 380/741 (51.28%), Query Frame = 0

Query: 360  GGGNRTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPR 419
            G   +TA Y+INR P+  L  + P  +       Y +   FGCR + ++      KL  +
Sbjct: 611  GEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFGCRAFAHVPKEQRTKLDDK 670

Query: 420  SIPCIFLGYSPVHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQAQPLSTIPISNFLE 479
            SIPCIF+GY     G+R  DP   K+  +    F E+      +  ++ +    I NF+ 
Sbjct: 671  SIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRESEV-RTAADMSEKVKNGIIPNFV- 730

Query: 480  PHLHHIDSSPPTTSSPHIPQSS----SSPCDICSDLVDESVQVDTSLAGSTLSPSTSNST 539
                   + P T+++P   +S+    S   +   +++++  Q+D  +             
Sbjct: 731  -------TIPSTSNNPTSAESTTDEVSEQGEQPGEVIEQGEQLDEGV------------E 790

Query: 540  SIEPPVDFSSLGTHPMITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKN 599
             +E P         P+    +  +   R+P+   +L S           EP+  K    +
Sbjct: 791  EVEHPTQ-GEEQHQPLRRSERPRVESRRYPSTEYVLISD--------DREPESLKEVLSH 850

Query: 600  P---AWVAAMDEEIRALQQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVA 659
            P     + AM EE+ +LQ+N T+ LV  P     +  KWVF++K   D  + R+KARLV 
Sbjct: 851  PEKNQLMKAMQEEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVV 910

Query: 660  KGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQ 719
            KG+ Q  G+D+ + FSPVVK T++R +LS+A +    + QLDVK AFL+G L E ++MEQ
Sbjct: 911  KGFEQKKGIDFDEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQ 970

Query: 720  PPGYVDPRFPKHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSL-FVFHQQS 779
            P G+        VC L K+LYGLKQAPR W+ +F SF+ +  +  + +D  + F    ++
Sbjct: 971  PEGFEVAGKKHMVCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSEN 1030

Query: 780  NLIYLLLYVDDIIVTGNNSSLINSFTRKLHSEFATKDLGSLSYFLGLE--ASPTPDGLFI 839
            N I LLLYVDD+++ G +  LI      L   F  KDLG     LG++     T   L++
Sbjct: 1031 NFIILLLYVDDMLIVGKDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWL 1090

Query: 840  SQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFS-------DPTLYRSLVGALQY 899
            SQ KY   +L R  + ++KPV TP+     L+    P +           Y S VG+L Y
Sbjct: 1091 SQEKYIERVLERFNMKNAKPVSTPLAGHLKLSKKMCPTTVEEKGNMAKVPYSSAVGSLMY 1150

Query: 900  -LTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTLVAY 959
             +  TRPDIA+AV  VS+FL  P  +H+ AVK ILRY++GT    L F  S     L  Y
Sbjct: 1151 AMVCTRPDIAHAVGVVSRFLENPGKEHWEAVKWILRYLRGTTGDCLCFGGS--DPILKGY 1210

Query: 960  SDADWAGCPDTRRSTSGYSIYLGNNLISWSAKKQPTVSRSSCESEYRALATTAAELLWVT 1019
            +DAD AG  D R+S++GY        ISW +K Q  V+ S+ E+EY A   T  E++W+ 
Sbjct: 1211 TDADMAGDIDNRKSSTGYLFTFSGGAISWQSKLQKCVALSTTEAEYIAATETGKEMIWLK 1270

Query: 1020 HILHDLKVPISQQPLLLCDNKSAIFLSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYV 1079
              L +L +   ++ ++ CD++SAI LS N + H R KH+++ YH++RE+V    L+   +
Sbjct: 1271 RFLQELGLH-QKEYVVYCDSQSAIDLSKNSMYHARTKHIDVRYHWIREMVDDESLKVLKI 1318

Query: 1080 PSHLQVADIFTKTLFSEYFEL 1083
             ++   AD+ TK +    FEL
Sbjct: 1331 STNENPADMLTKVVPRNKFEL 1318

BLAST of CmaCh03G007770 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 356.7 bits (914), Expect = 1.1e-96
Identity = 239/781 (30.60%), Postives = 388/781 (49.68%), Query Frame = 0

Query: 365  TAAYIINRLPTPLL--GGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIP 424
            TA Y+INR+P+  L    K+P+E+ +   P+  +   FG  VY ++++    K   +S  
Sbjct: 616  TATYLINRIPSRALVDSSKTPYEMWHNKKPYLKHLRVFGATVYVHIKN-KQGKFDDKSFK 675

Query: 425  CIFLGYSPVHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQAQPLSTIPI-------- 484
             IF+GY P   GF+  D    K  +      DET+   + +S+A    T+ +        
Sbjct: 676  SIFVGYEP--NGFKLWDAVNEKFIVARDVVVDETN---MVNSRAVKFETVFLKDSKESEN 735

Query: 485  SNFLEPHLHHIDSSPPTTS----------------SPHIPQSS-----------SSPCDI 544
             NF       I +  P  S                + + P  S           S  CD 
Sbjct: 736  KNFPNDSRKIIQTEFPNESKECDNIQFLKDSKESENKNFPNDSRKIIQTEFPNESKECDN 795

Query: 545  CSDLVD------------ESVQVDTSLAGSTLSPSTSNSTSIEPPVDFSSLG-------- 604
               L D            +  + D  L  S  S + + S   E       +G        
Sbjct: 796  IQFLKDSKESNKYFLNESKKRKRDDHLNESKGSGNPNESRESETAEHLKEIGIDNPTKND 855

Query: 605  --------THPMITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNPAWV 664
                    +  + T+ +    +  +  N  +L +  + + +  S +   ++      +W 
Sbjct: 856  GIEIINRRSERLKTKPQISYNEEDNSLNKVVLNAHTIFNDVPNSFDEIQYRD--DKSSWE 915

Query: 665  AAMDEEIRALQQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVP 724
             A++ E+ A + N+TWT+  RP N NIV S+WVF +KY   G+  R+KARLVA+G+TQ  
Sbjct: 916  EAINTELNAHKINNTWTITKRPENKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQKY 975

Query: 725  GLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYVDP 784
             +DY +TF+PV + ++ R +LS+ +     + Q+DVK AFLNGTL E ++M  P G    
Sbjct: 976  QIDYEETFAPVARISSFRFILSLVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQGI--S 1035

Query: 785  RFPKHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQ--SNLIYLL 844
                +VC L KA+YGLKQA R WF+ F   L    F  S  D  +++  +   +  IY+L
Sbjct: 1036 CNSDNVCKLNKAIYGLKQAARCWFEVFEQALKECEFVNSSVDRCIYILDKGNINENIYVL 1095

Query: 845  LYVDDIIVTGNNSSLINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARD 904
            LYVDD+++   + + +N+F R L  +F   DL  + +F+G+      D +++SQ  Y + 
Sbjct: 1096 LYVDDVVIATGDMTRMNNFKRYLMEKFRMTDLNEIKHFIGIRIEMQEDKIYLSQSAYVKK 1155

Query: 905  ILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTI-TRPDIAYAVNS 964
            IL++  + +   V TP+    +     S     T  RSL+G L Y+ + TRPD+  AVN 
Sbjct: 1156 ILSKFNMENCNAVSTPLPSKINYELLNSDEDCNTPCRSLIGCLMYIMLCTRPDLTTAVNI 1215

Query: 965  VSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPS-TVPSTLVAYSDADWAGCPDTRRS 1024
            +S++     ++ +  +KR+LRY+KGT+   LIF+ +    + ++ Y D+DWAG    R+S
Sbjct: 1216 LSRYSSKNNSELWQNLKRVLRYLKGTIDMKLIFKKNLAFENKIIGYVDSDWAGSEIDRKS 1275

Query: 1025 TSGYSIYLGN-NLISWSAKKQPTVSRSSCESEYRALATTAAELLWVTHILHDLKVPISQQ 1076
            T+GY   + + NLI W+ K+Q +V+ SS E+EY AL     E LW+  +L  + + +   
Sbjct: 1276 TTGYLFKMFDFNLICWNTKRQNSVAASSTEAEYMALFEAVREALWLKFLLTSINIKLENP 1335

BLAST of CmaCh03G007770 vs. ExPASy Swiss-Prot
Match: P92519 (Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana OX=3702 GN=AtMg00810 PE=4 SV=1)

HSP 1 Score: 249.2 bits (635), Expect = 2.4e-64
Identity = 123/226 (54.42%), Postives = 165/226 (73.01%), Query Frame = 0

Query: 774  IYLLLYVDDIIVTGNNSSLINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLK 833
            +YLLLYVDDI++TG++++L+N    +L S F+ KDLG + YFLG++    P GLF+SQ K
Sbjct: 1    MYLLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTK 60

Query: 834  YARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYA 893
            YA  IL  A +LD KP+ TP+ +  + +   + + DP+ +RS+VGALQYLT+TRPDI+YA
Sbjct: 61   YAEQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYPDPSDFRSIVGALQYLTLTRPDISYA 120

Query: 894  VNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTLVAYSDADWAGCPDTR 953
            VN V Q +H PT   F  +KR+LRYVKGT+  GL    ++    + A+ D+DWAGC  TR
Sbjct: 121  VNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNS-KLNVQAFCDSDWAGCTSTR 180

Query: 954  RSTSGYSIYLGNNLISWSAKKQPTVSRSSCESEYRALATTAAELLW 1000
            RST+G+  +LG N+ISWSAK+QPTVSRSS E+EYRALA TAAEL W
Sbjct: 181  RSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CmaCh03G007770 vs. ExPASy TrEMBL
Match: A0A438EBA0 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_2065 PE=4 SV=1)

HSP 1 Score: 1749.9 bits (4531), Expect = 0.0e+00
Identity = 937/1358 (69.00%), Postives = 992/1358 (73.05%), Query Frame = 0

Query: 1    MASESCYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEP 60
            MASES  HLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQD+LGYVDGT+VPPPRFEP
Sbjct: 17   MASESS-HLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDLLGYVDGTLVPPPRFEP 76

Query: 61   ETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLSTARDVWLALETTYSHQSKAR 120
            ETS+T + KYLAW+AADQRLLCLLLSSLTEEA+ VVVGLSTAR+VWLALE T+SH SKAR
Sbjct: 77   ETSTTLSTKYLAWKAADQRLLCLLLSSLTEEAIVVVVGLSTAREVWLALENTFSHHSKAR 136

Query: 121  ELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFS 180
            ELRLKDDLQLMKRGTKPVAEYAR FK +CDQLHAIGRPVED DKVHWFLRGLGT+FS+FS
Sbjct: 137  ELRLKDDLQLMKRGTKPVAEYARTFKTLCDQLHAIGRPVEDTDKVHWFLRGLGTDFSSFS 196

Query: 181  TAQMALTPIPCFADLVSKTESFELFQRSLESSDSTPTAFIATNRGRTHESHPASF---TN 240
            TAQM+LTP+P FADLVSK ESFELFQRSLESS+ T  AF ATNR  T  SH   F    N
Sbjct: 197  TAQMSLTPLPYFADLVSKAESFELFQRSLESSEPTTAAFTATNRSHT-TSHGTPFAFRNN 256

Query: 241  QRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAF 300
            QRGRS+SH NNSSNRGRT+S  GRRPP CQICR EGHYADRCNQRY R DSS AHLAEAF
Sbjct: 257  QRGRSHSHNNNSSNRGRTYSGHGRRPPRCQICRIEGHYADRCNQRYARTDSS-AHLAEAF 316

Query: 301  NTSCSIAGPDAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITH----- 360
            NTSCS++GP+AADWFLDTGASAHMT DPSILDQSKNY GKD VIVGNGASLPITH     
Sbjct: 317  NTSCSLSGPEAADWFLDTGASAHMTTDPSILDQSKNYMGKDFVIVGNGASLPITHTGTLS 376

Query: 361  ---------------------------------------------------IESSNRKGG 420
                                                               + +  R GG
Sbjct: 377  PVPNIHLLDVLVVPHLTKNLLSISKLTSDFPLSVTFTNNLFTVQNRQTGRVVATGKRDGG 436

Query: 421  ------GN---------------------------------------------------- 480
                  GN                                                    
Sbjct: 437  LYVLERGNSAFISVLKNKSLRASYDLWHARLGHVNYSVISFINKKGHLSLTSLLPSPSLC 496

Query: 481  ------------------------------------------------------------ 540
                                                                        
Sbjct: 497  STCQLAKNHRLPYSRNEHRSSHVLDLIHCDLWGPSPIKSNSGFLYYVIFIDDHSRFTWLY 556

Query: 541  ------------------------------------------------------------ 600
                                                                        
Sbjct: 557  PLKFKSDFFDIFLQFQKFVENQHSARIKVFQSDGGAEFTNTCFKAHLRTSGIHHQLSCPY 616

Query: 601  -------------------------------------RTAAYIINRLPTPLLGGKSPFEL 660
                                                  T  YIINRLPTPLLGGKSPFEL
Sbjct: 617  TPAQNGRAERKHRHVTETGLALLFHSHLSPRFWVDAFSTTTYIINRLPTPLLGGKSPFEL 676

Query: 661  LYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVHKGFRCLDPATTKLY 720
            LYGY+PHY+NFHPFGC VYP LRDYMPNKLSPRSIPCIFLGYSP HKGFRCLDP T++LY
Sbjct: 677  LYGYSPHYENFHPFGCHVYPCLRDYMPNKLSPRSIPCIFLGYSPSHKGFRCLDPTTSRLY 736

Query: 721  ITCHAQFDETHFPAIPSSQAQPLSTIPISNFLEPHLHHIDSSPPTTSSP--HIPQSSSSP 780
            IT HAQFDETHFP +PSSQAQPLS++ ISNFLEP LHHID SPP+  SP  HIP+S+SSP
Sbjct: 737  ITRHAQFDETHFPTVPSSQAQPLSSLHISNFLEPRLHHIDPSPPSPPSPSSHIPRSNSSP 796

Query: 781  CDICSDLVDESVQVDTSLAGSTLSPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTR 840
            C+ICSDLVDESVQVDTSLAGS+L P  S+  SIE   D  SSLG+HPMITRAKAGIFKTR
Sbjct: 797  CNICSDLVDESVQVDTSLAGSSLPPLASSPHSIEHAADSSSSLGSHPMITRAKAGIFKTR 856

Query: 841  HPANLGMLGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPAN 900
            HPANLG+LGSSGLLSALLASTEPKGFKSAAKNPAW+AAMDEE++ALQQN TW LV RP N
Sbjct: 857  HPANLGVLGSSGLLSALLASTEPKGFKSAAKNPAWLAAMDEEVQALQQNGTWILVHRPVN 916

Query: 901  TNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIA 960
            TNIVGSKWVFR KY PDGSVER KARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLS+A
Sbjct: 917  TNIVGSKWVFRTKYFPDGSVERLKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSLA 976

Query: 961  VTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYVDPRFPKHVCLLKKALYGLKQAPRAWF 1020
            VTNKWPLRQLDV NAFLNGTL E V+MEQPPGY+DPRFP HVCLLKKALYGLKQAPRAWF
Sbjct: 977  VTNKWPLRQLDVNNAFLNGTLTEHVYMEQPPGYIDPRFPTHVCLLKKALYGLKQAPRAWF 1036

Query: 1021 QRFSSFLLTLGFSCSRADTSLFVFHQQSNLIYLLLYVDDIIVTGNNSSLINSFTRKLHSE 1080
            QRFSSF LTLGFSCSRADTSLFVFHQQS+LIYLLLYVDDIIVTGNN SL++SFTRKLHS+
Sbjct: 1037 QRFSSFFLTLGFSCSRADTSLFVFHQQSSLIYLLLYVDDIIVTGNNPSLLDSFTRKLHSK 1096

Query: 1081 FATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTAD 1082
            FATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRA LLDSKPVHTPMVVSQHLT  
Sbjct: 1097 FATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAHLLDSKPVHTPMVVSQHLTVA 1156

BLAST of CmaCh03G007770 vs. ExPASy TrEMBL
Match: A0A438E275 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Vitis vinifera OX=29760 GN=RE1_3495 PE=3 SV=1)

HSP 1 Score: 1746.9 bits (4523), Expect = 0.0e+00
Identity = 916/1198 (76.46%), Postives = 968/1198 (80.80%), Query Frame = 0

Query: 1    MASESCYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEP 60
            MASES  HLLPFNTLIHMITIKLSSSNYLLWKSQLLP LESQD+L YVDGT+VPPPRFEP
Sbjct: 118  MASESS-HLLPFNTLIHMITIKLSSSNYLLWKSQLLPFLESQDLLAYVDGTLVPPPRFEP 177

Query: 61   ETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLSTARDVWLALETTYSHQSKAR 120
            ETS+T + KYLAW+AA+QRLLCLLLSSLTEEA+AVVVGLSTAR+VWLALE T+SH SKAR
Sbjct: 178  ETSTTLSTKYLAWKAANQRLLCLLLSSLTEEAIAVVVGLSTAREVWLALENTFSHHSKAR 237

Query: 121  ELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFS 180
            ELRLKDDLQLMKRGTKPVAEYAR FK +CDQLHAIGRPVED DKVHWFL           
Sbjct: 238  ELRLKDDLQLMKRGTKPVAEYARTFKTLCDQLHAIGRPVEDTDKVHWFLH---------- 297

Query: 181  TAQMALTPIPCFADLVSKTESFELFQRSLESSDSTPTAFIATNRGRTHESHPASF---TN 240
                          LVSK ESFELFQRSLESS+ T  AF ATNR RT  SH   F    N
Sbjct: 298  --------------LVSKAESFELFQRSLESSEPTTAAFTATNRSRT-TSHGTPFAFRNN 357

Query: 241  QRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAF 300
            QRGRS+SH NNSSNRGRT+S  GRRPP CQIC  EGHYADRCNQRY R DSS AHLAEAF
Sbjct: 358  QRGRSHSHNNNSSNRGRTYSGHGRRPPRCQICCIEGHYADRCNQRYARTDSS-AHLAEAF 417

Query: 301  NTSCSIAGPDAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITH----- 360
            NTSCS++GP+AADWFLDT ASAHMT DPSILDQSKNY GKDSVIVGNGASLPITH     
Sbjct: 418  NTSCSLSGPEAADWFLDTRASAHMTTDPSILDQSKNYMGKDSVIVGNGASLPITHTGTLS 477

Query: 361  ----------------IESSNRKGG----------------------------------- 420
                            + +  R GG                                   
Sbjct: 478  PVPNIHLLDNRQTGRVVATGKRDGGLYVLERSNSAFIYVLKNKSLRASYDLWHARLAHLR 537

Query: 421  -----------------------------------------------GNRTAAYIINRLP 480
                                                              TA YIIN LP
Sbjct: 538  TSGIHHQLSCPYTPAQNGRAERKHRHVTETGLALLFHSHLSPRFWVDAFSTATYIINWLP 597

Query: 481  TPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVHKG 540
            TPLLGGKSPFELLY Y+PHY+NFHPFGCRVYP LRDYM NKLSPRSIPCIFLGYSP HKG
Sbjct: 598  TPLLGGKSPFELLYDYSPHYENFHPFGCRVYPCLRDYMSNKLSPRSIPCIFLGYSPSHKG 657

Query: 541  FRCLDPATTKLYITCHAQFDETHFPAIPSSQAQPLSTIPISNFLEPHLHHIDSSPPTTSS 600
            FRCLDP T++LYIT HAQFDETHFP +PSSQAQPLS++ ISNFLEP LHHID SPP+  S
Sbjct: 658  FRCLDPTTSRLYITRHAQFDETHFPTVPSSQAQPLSSLHISNFLEPRLHHIDPSPPSPPS 717

Query: 601  P--HIPQSSSSPCDICSDLVDESVQVDTSLAGSTLSPSTSNSTSIEPPVD-FSSLGTHPM 660
            P  HIP+S+SSPC+ICSDLVDESVQVDTSLAG +L P  S+  SIE   D  SSLG+HPM
Sbjct: 718  PSSHIPRSNSSPCNICSDLVDESVQVDTSLAGCSLPPLASSPHSIEHAADSSSSLGSHPM 777

Query: 661  ITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQ 720
            ITRAKAGIFKTRHPANLG+LGSSGLL ALLASTEPKGFKSAAKNPAW+AAMDEE++ALQQ
Sbjct: 778  ITRAKAGIFKTRHPANLGVLGSSGLLFALLASTEPKGFKSAAKNPAWLAAMDEEVQALQQ 837

Query: 721  NDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVV 780
            N TW LVPRP NTNIVGSKWVFR KYLPDGSVER KARLVAKGYT VPGLDYTD FSPVV
Sbjct: 838  NGTWILVPRPVNTNIVGSKWVFRTKYLPDGSVERLKARLVAKGYTHVPGLDYTDIFSPVV 897

Query: 781  KATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYVDPRFPKHVCLLKKA 840
            KATTVRVVLS+AVTNKWPLRQLDVKNAFLNGTL E V+MEQPPGY+D RFP HVCLLKKA
Sbjct: 898  KATTVRVVLSLAVTNKWPLRQLDVKNAFLNGTLTEHVYMEQPPGYIDHRFPTHVCLLKKA 957

Query: 841  LYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNLIYLLLYVDDIIVTGNNSS 900
            LYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQS+LIYLLLYVDDIIVTGNN S
Sbjct: 958  LYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSSLIYLLLYVDDIIVTGNNPS 1017

Query: 901  LINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVH 960
            L++SFTRKLHSEFATKDLGSL+YFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVH
Sbjct: 1018 LLDSFTRKLHSEFATKDLGSLNYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVH 1077

Query: 961  TPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLA 1020
            TPMVVSQHLT  GSPFS+PTLYRSLVGALQYLTITRPDIA+AVNSVSQFLHAPT DHFLA
Sbjct: 1078 TPMVVSQHLTVAGSPFSNPTLYRSLVGALQYLTITRPDIAHAVNSVSQFLHAPTIDHFLA 1137

Query: 1021 VKRILRYVKGTLHFGLIFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLISWS 1080
            VKRILRYVKGTLHFGL FRPST+PS LVAYSDADWAGCPDTRRSTSGYSIYLGNNL+SWS
Sbjct: 1138 VKRILRYVKGTLHFGLTFRPSTIPSALVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWS 1197

Query: 1081 AKKQPTVSRSSCESEYRALATTAAELLWVTHILHDLKVPISQQPLLLCDNKSAIFLSSNP 1088
            AKKQPTVSRSSCESEYRALA TAAELLW+TH+LHDLKVPI QQPLLLCDNKSAIF SSNP
Sbjct: 1198 AKKQPTVSRSSCESEYRALAMTAAELLWLTHLLHDLKVPIPQQPLLLCDNKSAIFFSSNP 1257

BLAST of CmaCh03G007770 vs. ExPASy TrEMBL
Match: A0A438E763 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Vitis vinifera OX=29760 GN=RE1_452 PE=4 SV=1)

HSP 1 Score: 1703.7 bits (4411), Expect = 0.0e+00
Identity = 880/1124 (78.29%), Postives = 934/1124 (83.10%), Query Frame = 0

Query: 1    MASESCYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEP 60
            MASES  HLLPFNTLIHMI IKLSSSNYLLWKSQLLPLLESQD+L YVDGT+VPPPRFEP
Sbjct: 1    MASESS-HLLPFNTLIHMINIKLSSSNYLLWKSQLLPLLESQDLLAYVDGTLVPPPRFEP 60

Query: 61   ETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLSTARDVWLALETTYSHQSKAR 120
            ETS+T + KYLAW+AADQRLLCLLLSSLTEEA+AVVVGLSTAR+VWLALE T+SH SKAR
Sbjct: 61   ETSTTLSTKYLAWKAADQRLLCLLLSSLTEEAIAVVVGLSTAREVWLALENTFSHHSKAR 120

Query: 121  ELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFS 180
            ELRLKDDLQLMKRGTKPVAEYAR FK +CDQLHAIGRPVED DKVHWFL GLG +FS+FS
Sbjct: 121  ELRLKDDLQLMKRGTKPVAEYARTFKTLCDQLHAIGRPVEDTDKVHWFLHGLGPDFSSFS 180

Query: 181  TAQMALTPIPCFADLVSKTESFELFQRSLESSDSTPTAFIATNRGRTHESHPASF---TN 240
            T QM+LTP+P FADLVSK ESFELFQRSLESS+ T  AF  TNR RT  SH   F    N
Sbjct: 181  TPQMSLTPLPYFADLVSKAESFELFQRSLESSEPTTAAFTTTNRSRT-TSHGTPFAFRNN 240

Query: 241  QRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAF 300
            QRGRS+SH NNSSNRGRT+S  GRRPP CQICR EGHYADRCNQRY R DSS AHLAEAF
Sbjct: 241  QRGRSHSHNNNSSNRGRTYSGHGRRPPRCQICRIEGHYADRCNQRYARTDSS-AHLAEAF 300

Query: 301  NTSCSIAGPDAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITH----- 360
            NTSCS++GP+AADWFLDTGASAHMT DPS LDQSKNY GKDSVIVGNGASLPITH     
Sbjct: 301  NTSCSLSGPEAADWFLDTGASAHMTTDPSNLDQSKNYMGKDSVIVGNGASLPITHTGTLS 360

Query: 361  ----------------IESSNRKGG------GN--------------------------- 420
                            + +  R GG      GN                           
Sbjct: 361  PVPNIHLLDNRQTGRMVATGKRDGGLYVLERGNSAFISVLKNKSLRASYDLWHARLAHLR 420

Query: 421  -------------------------------------------------RTAAYIINRLP 480
                                                              TA YIINRLP
Sbjct: 421  TSGIHHQLSCPYTPAQNGRAERKHRHVTETGLALLFHSHLSPRFWVDAFSTATYIINRLP 480

Query: 481  TPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVHKG 540
            TPLLGGKSPFELLYG++PHY+NFHPFGCRVYP LRDYMPNKLSPRSIPCIFLGYSP HKG
Sbjct: 481  TPLLGGKSPFELLYGHSPHYENFHPFGCRVYPCLRDYMPNKLSPRSIPCIFLGYSPSHKG 540

Query: 541  FRCLDPATTKLYITCHAQFDETHFPAIPSSQAQPLSTIPISNFLEPHLHHIDSSPPTTSS 600
            FRCLDP T++LYIT HAQFDETHFP +PSSQAQPLS++ ISNFLEP LHHID SPP+  S
Sbjct: 541  FRCLDPTTSRLYITRHAQFDETHFPTVPSSQAQPLSSLHISNFLEPRLHHIDPSPPSPPS 600

Query: 601  P--HIPQSSSSPCDICSDLVDESVQVDTSLAGSTLSPSTSNSTSIEPPVD-FSSLGTHPM 660
            P  HIP+S+SSPC+ICSDLVDESV+VDTSLAGS+L P  S+  SIE   D  SSLG+HPM
Sbjct: 601  PSSHIPRSNSSPCNICSDLVDESVKVDTSLAGSSLPPLASSPHSIEHAADSSSSLGSHPM 660

Query: 661  ITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQ 720
            ITRAKAGIFKTRHPANLG+LGSSGLLSALLASTEPKGFKSAAKNPAW+AAMDEE++ALQQ
Sbjct: 661  ITRAKAGIFKTRHPANLGVLGSSGLLSALLASTEPKGFKSAAKNPAWLAAMDEEVQALQQ 720

Query: 721  NDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVV 780
            N TW LVPRP NTNIVGSKWVFR KYLPDGSVER KARLVAKGYTQVPGLDYTDTFSPVV
Sbjct: 721  NGTWILVPRPVNTNIVGSKWVFRTKYLPDGSVERLKARLVAKGYTQVPGLDYTDTFSPVV 780

Query: 781  KATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYVDPRFPKHVCLLKKA 840
            KATTVRVVLS+A+TNKWPLRQLDVKNAFLNGTL E V+MEQPPGY+DPRFP HVCLLKKA
Sbjct: 781  KATTVRVVLSLAITNKWPLRQLDVKNAFLNGTLTEHVYMEQPPGYIDPRFPTHVCLLKKA 840

Query: 841  LYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNLIYLLLYVDDIIVTGNNSS 900
            LYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQS+LIYLLLYVDDIIVTGNN S
Sbjct: 841  LYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSSLIYLLLYVDDIIVTGNNPS 900

Query: 901  LINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVH 960
            L++SFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVH
Sbjct: 901  LLDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVH 960

Query: 961  TPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLA 1016
            TPMVVSQHLT  GSPFS+PTLYRSLVGALQYLTITRPDIA+AVNSVSQFLHAPT DHFLA
Sbjct: 961  TPMVVSQHLTVAGSPFSNPTLYRSLVGALQYLTITRPDIAHAVNSVSQFLHAPTIDHFLA 1020

BLAST of CmaCh03G007770 vs. ExPASy TrEMBL
Match: A0A2N9I601 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS49318 PE=4 SV=1)

HSP 1 Score: 1688.7 bits (4372), Expect = 0.0e+00
Identity = 862/1132 (76.15%), Postives = 950/1132 (83.92%), Query Frame = 0

Query: 1    MASESCYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEP 60
            MAS+S   LLPFNT+IHM+TIKLSSSNYLLWKSQLLPLLESQ++LG+VDGT+VPPP F+P
Sbjct: 1    MASDSSPTLLPFNTMIHMVTIKLSSSNYLLWKSQLLPLLESQNLLGHVDGTLVPPPPFDP 60

Query: 61   ETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLSTARDVWLALETTYSHQSKAR 120
             TS T +PK+LAW+A DQRLL LLLSSLTEEAMA  VGLST+R+VW ALE T+SH+SKAR
Sbjct: 61   PTSQTPDPKHLAWKATDQRLLSLLLSSLTEEAMAEAVGLSTSREVWTALENTFSHRSKAR 120

Query: 121  ELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFS 180
            E+RLKDDLQLMKRGT+PV  YARAFK +CDQLHAIGRPV+D DK HWFLRGLG++FS+FS
Sbjct: 121  EIRLKDDLQLMKRGTRPVTAYARAFKALCDQLHAIGRPVDDTDKTHWFLRGLGSDFSSFS 180

Query: 181  TAQMALTPIPCFADLVSKTESFELFQRSLESSDSTPTAFIATNRGR--THESHPASFTNQ 240
            TAQ+ALTP+PCFADLVSK ESFELFQRSLE S +T  AF AT+RGR   H    ++ +NQ
Sbjct: 181  TAQLALTPLPCFADLVSKAESFELFQRSLEPSATTAAAFTATSRGRASNHGHFSSNRSNQ 240

Query: 241  RGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN 300
            +GRS    N+SSNRGR++S QGRRPP CQICR EGHYADRC+QRY R DSS AHLAEAFN
Sbjct: 241  QGRS---NNHSSNRGRSNSGQGRRPPRCQICRTEGHYADRCHQRYARTDSS-AHLAEAFN 300

Query: 301  TSCSIAGPDAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHIE-SSN 360
             SCS++  + +DW+LDTGASAHMT   + LDQS  YTGKD VIVGNGASLPITH E + N
Sbjct: 301  ASCSLSETNPSDWYLDTGASAHMTPAQATLDQSTTYTGKDCVIVGNGASLPITHTEFTCN 360

Query: 361  R----------------------KGGGNR----------------------------TAA 420
            R                       G   R                            TAA
Sbjct: 361  RFQDHLSTSGIHHQLSCPHTPAQNGRAERKHRHVTETGLALLFHSHTSPRFWVDAFSTAA 420

Query: 421  YIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLG 480
            YIINRLPT LLGGKSPFELLYG +P+Y+NFHPFGCRVYP LRDYMPNKLSPRSIPCIFLG
Sbjct: 421  YIINRLPTSLLGGKSPFELLYGSSPNYENFHPFGCRVYPCLRDYMPNKLSPRSIPCIFLG 480

Query: 481  YSPVHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQAQPLSTIPISNFLEPHLHHIDS 540
            YSP HKGFRCLDP T+++YIT HAQFDETHFP + +SQAQP+S++  SNFLEP L   D 
Sbjct: 481  YSPSHKGFRCLDPTTSRIYITRHAQFDETHFPFLNTSQAQPISSLQFSNFLEPSLPPTDM 540

Query: 541  SP--PTTSSPHIPQSSSSPCDICSDLVDESVQVDTSLAGSTLSPSTSNSTSIE------- 600
             P  P   SPHIPQS S+PCDIC+D VDES+QV+ SL G +L PS  +  S+E       
Sbjct: 541  PPSSPAPHSPHIPQSGSNPCDICTDPVDESLQVNDSLTGPSLPPSDPSPASLELPTELPT 600

Query: 601  -PPVDFSSLGTHPMITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNPA 660
              PV  + + +HPM+TRAKAGIFKTRHPANL +LG SGLLSALLASTEPKGFKSAAKNPA
Sbjct: 601  PAPVAATPMPSHPMLTRAKAGIFKTRHPANLAILGPSGLLSALLASTEPKGFKSAAKNPA 660

Query: 661  WVAAMDEEIRALQQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQ 720
            W+AAMDEEI+ALQ N TW LVPRPANTNIVGSKWVFR KYLPDGS+ER KARLVAKGYTQ
Sbjct: 661  WLAAMDEEIQALQTNRTWILVPRPANTNIVGSKWVFRTKYLPDGSIERLKARLVAKGYTQ 720

Query: 721  VPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYV 780
            VPGLDYTDTFSPV+KATTVRVVLS+AVTNKWPLRQLDVKNAFLNG+L E V+MEQPPGY+
Sbjct: 721  VPGLDYTDTFSPVIKATTVRVVLSLAVTNKWPLRQLDVKNAFLNGSLTEHVYMEQPPGYI 780

Query: 781  DPRFPKHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNLIYLL 840
            DPRFP HVC LKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQS +IYLL
Sbjct: 781  DPRFPHHVCHLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSGIIYLL 840

Query: 841  LYVDDIIVTGNNSSLINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARD 900
            LYVDDII+TGNNSSL++SFT KLHSEFATKDLGSLSYFLGLEA PTPDGLF+SQLKYARD
Sbjct: 841  LYVDDIIITGNNSSLLDSFTHKLHSEFATKDLGSLSYFLGLEALPTPDGLFLSQLKYARD 900

Query: 901  ILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSV 960
            ILTRAQLLDSKPVHTPMVVSQHL+ADG  F DPTLYRSLVGALQYLTITRPDIA+AVNSV
Sbjct: 901  ILTRAQLLDSKPVHTPMVVSQHLSADGPLFPDPTLYRSLVGALQYLTITRPDIAHAVNSV 960

Query: 961  SQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTLVAYSDADWAGCPDTRRSTS 1020
            SQF+HAPTADHFLAVKRILRYVKGTLHFGL FRPS  P TLVAYSDADWAGCPDTRRSTS
Sbjct: 961  SQFMHAPTADHFLAVKRILRYVKGTLHFGLTFRPSAAPGTLVAYSDADWAGCPDTRRSTS 1020

Query: 1021 GYSIYLGNNLISWSAKKQPTVSRSSCESEYRALATTAAELLWVTHILHDLKVPISQQPLL 1070
            GYSIYLG+NL+SWSAKKQPTVSRSSCESEYRALA TAAELLW+TH+LHDLKVP+ QQPLL
Sbjct: 1021 GYSIYLGDNLVSWSAKKQPTVSRSSCESEYRALAMTAAELLWLTHLLHDLKVPLPQQPLL 1080

BLAST of CmaCh03G007770 vs. ExPASy TrEMBL
Match: A0A2N9EEM3 (Integrase catalytic domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS1094 PE=4 SV=1)

HSP 1 Score: 1629.0 bits (4217), Expect = 0.0e+00
Identity = 839/1134 (73.99%), Postives = 932/1134 (82.19%), Query Frame = 0

Query: 1    MASESCYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEP 60
            MAS+S   LLPFNT+IHM+TIKLSSSNYLLWKSQLLPLLESQ++LG+VDGT+VPPP F+P
Sbjct: 1    MASDSSPTLLPFNTMIHMVTIKLSSSNYLLWKSQLLPLLESQNLLGHVDGTLVPPPPFDP 60

Query: 61   ETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLSTARDVWLALETTYSHQSKAR 120
             TS T +PK+LAW+A DQRLL LLLSSLTEEAMA  VGLST+R+VW ALE T+SH+SKAR
Sbjct: 61   PTSQTPDPKHLAWKATDQRLLSLLLSSLTEEAMAEAVGLSTSREVWTALENTFSHRSKAR 120

Query: 121  ELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFS 180
            E+RLKDDLQLMKRGT+PV  YARAFK +CDQLHAIGRPV+D DK HWFLRGLG++FS+FS
Sbjct: 121  EIRLKDDLQLMKRGTRPVTAYARAFKALCDQLHAIGRPVDDTDKTHWFLRGLGSDFSSFS 180

Query: 181  TAQMALTPIPCFADLVSKTESFELFQRSLESSDSTPTAFIATNRGR--THESHPASFTNQ 240
            TAQ+ALTP+PCFADLVSK ESFELFQRSLE S +T  AF AT+RGR   H    ++ +NQ
Sbjct: 181  TAQLALTPLPCFADLVSKAESFELFQRSLEPSATTAAAFTATSRGRASNHGHFSSNRSNQ 240

Query: 241  RGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN 300
            +GRS    N+SSNRGR++S QGRRPP CQICR EGHYADRC+QRY R DSS AHLAEAFN
Sbjct: 241  QGRS---NNHSSNRGRSNSGQGRRPPRCQICRTEGHYADRCHQRYARTDSS-AHLAEAFN 300

Query: 301  TSCSIAGPDAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASL-------PIT 360
             SCS++  + +DW+LDTGASAHMT   + LDQS  YT    ++V N  ++       P+ 
Sbjct: 301  ASCSLSETNPSDWYLDTGASAHMTPAQATLDQSTTYT---VMVVQNLLAIAFKIILVPLA 360

Query: 361  HIESS------NRKGGGNR----------------------------TAAYIINRLPTPL 420
             I +S       + G   R                            TAAYIINRLPT L
Sbjct: 361  FIINSLAHILPLQNGRAERKHRHVTETGLALLFHSHTSPRFWVDAFSTAAYIINRLPTSL 420

Query: 421  LGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVHKGFRC 480
            LGGKSPFELLYG +P+Y+NFHPFGCRVYP LRDYMPNKLSPRSIPCIFLGYSP HKGFRC
Sbjct: 421  LGGKSPFELLYGSSPNYENFHPFGCRVYPCLRDYMPNKLSPRSIPCIFLGYSPSHKGFRC 480

Query: 481  LDPATTKLYITCHAQFDETHFPAIPSSQAQPLSTIPISNFLEPHLHHIDSSP--PTTSSP 540
            LDP T+++YIT HAQFDETHFP + +SQAQP+S++  SNFLEP L   D  P  P   SP
Sbjct: 481  LDPTTSRIYITRHAQFDETHFPFLNTSQAQPISSLQFSNFLEPSLPPTDMPPSSPAPHSP 540

Query: 541  HIPQSSSSPCDICSDLVDESVQVDTSLAGSTLSPSTSNSTSIE--------PPVDFSSLG 600
            HIPQS S+PCDIC+D VDES+QV+ SL G +L PS  +  S+E         PV  + + 
Sbjct: 541  HIPQSGSNPCDICTDPVDESLQVNDSLTGPSLPPSDPSPASLELPTELPTPAPVAATPMP 600

Query: 601  THPMITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIR 660
            +HPM+TRAKAGIFKTRHPANL +LGSSGLLSALLASTEPKGFKSAAKNPAW+AAMDEEI+
Sbjct: 601  SHPMLTRAKAGIFKTRHPANLAILGSSGLLSALLASTEPKGFKSAAKNPAWLAAMDEEIQ 660

Query: 661  ALQQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTF 720
            ALQ N TW LVPRPANTNIVGSKWVFR KYLPDGS+ER KARLVAKGYTQVPGLDYTDTF
Sbjct: 661  ALQTNRTWILVPRPANTNIVGSKWVFRTKYLPDGSIERLKARLVAKGYTQVPGLDYTDTF 720

Query: 721  SPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYVDPRFPKHVCL 780
            SPV+KATTVRVVLS+AVTNKWPLRQLDVKNAFLNG+L E V+MEQPPGY+DPRFP HVC 
Sbjct: 721  SPVIKATTVRVVLSLAVTNKWPLRQLDVKNAFLNGSLTEHVYMEQPPGYIDPRFPHHVCH 780

Query: 781  LKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNLIYLLLYVDDIIVTG 840
            LKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQS +IYLLLYVDDII+TG
Sbjct: 781  LKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSGIIYLLLYVDDIIITG 840

Query: 841  NNSSLINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDS 900
            NNSSL++SFT KLHSEFATKDLGSLSYFLGLEA PTPDGLF+SQLKYARDILTRAQLLDS
Sbjct: 841  NNSSLLDSFTHKLHSEFATKDLGSLSYFLGLEALPTPDGLFLSQLKYARDILTRAQLLDS 900

Query: 901  KPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTAD 960
            KPVHTPM                           YLTITRPDIA+AVNSVSQF+HAPTAD
Sbjct: 901  KPVHTPM---------------------------YLTITRPDIAHAVNSVSQFMHAPTAD 960

Query: 961  HFLAVKRILRYVKGTLHFGLIFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNL 1020
            HFLAVKRILRYVKGTLHFGL FRPS  P TLVAYSDADWAGCPDTRRSTSGYSIYLG+NL
Sbjct: 961  HFLAVKRILRYVKGTLHFGLTFRPSAAPGTLVAYSDADWAGCPDTRRSTSGYSIYLGDNL 1020

Query: 1021 ISWSAKKQPTVSRSSCESEYRALATTAAELLWVTHILHDLKVPISQQPLLLCDNKSAIFL 1080
            +SWSAKKQPTVSRSSCESEYRALA TAAELLW+TH+LHDLKVP+ QQPLLLCDNKSAIFL
Sbjct: 1021 VSWSAKKQPTVSRSSCESEYRALAMTAAELLWLTHLLHDLKVPLPQQPLLLCDNKSAIFL 1080

Query: 1081 SSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKTLFSEYFE 1082
            SSNPVSHKRAKHVELDYHFLRELV+AGKLRTQYVPSHLQVADIFTK++    FE
Sbjct: 1081 SSNPVSHKRAKHVELDYHFLRELVVAGKLRTQYVPSHLQVADIFTKSVSRSLFE 1100

BLAST of CmaCh03G007770 vs. NCBI nr
Match: RVW45095.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 1749.9 bits (4531), Expect = 0.0e+00
Identity = 937/1358 (69.00%), Postives = 992/1358 (73.05%), Query Frame = 0

Query: 1    MASESCYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEP 60
            MASES  HLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQD+LGYVDGT+VPPPRFEP
Sbjct: 17   MASESS-HLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDLLGYVDGTLVPPPRFEP 76

Query: 61   ETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLSTARDVWLALETTYSHQSKAR 120
            ETS+T + KYLAW+AADQRLLCLLLSSLTEEA+ VVVGLSTAR+VWLALE T+SH SKAR
Sbjct: 77   ETSTTLSTKYLAWKAADQRLLCLLLSSLTEEAIVVVVGLSTAREVWLALENTFSHHSKAR 136

Query: 121  ELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFS 180
            ELRLKDDLQLMKRGTKPVAEYAR FK +CDQLHAIGRPVED DKVHWFLRGLGT+FS+FS
Sbjct: 137  ELRLKDDLQLMKRGTKPVAEYARTFKTLCDQLHAIGRPVEDTDKVHWFLRGLGTDFSSFS 196

Query: 181  TAQMALTPIPCFADLVSKTESFELFQRSLESSDSTPTAFIATNRGRTHESHPASF---TN 240
            TAQM+LTP+P FADLVSK ESFELFQRSLESS+ T  AF ATNR  T  SH   F    N
Sbjct: 197  TAQMSLTPLPYFADLVSKAESFELFQRSLESSEPTTAAFTATNRSHT-TSHGTPFAFRNN 256

Query: 241  QRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAF 300
            QRGRS+SH NNSSNRGRT+S  GRRPP CQICR EGHYADRCNQRY R DSS AHLAEAF
Sbjct: 257  QRGRSHSHNNNSSNRGRTYSGHGRRPPRCQICRIEGHYADRCNQRYARTDSS-AHLAEAF 316

Query: 301  NTSCSIAGPDAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITH----- 360
            NTSCS++GP+AADWFLDTGASAHMT DPSILDQSKNY GKD VIVGNGASLPITH     
Sbjct: 317  NTSCSLSGPEAADWFLDTGASAHMTTDPSILDQSKNYMGKDFVIVGNGASLPITHTGTLS 376

Query: 361  ---------------------------------------------------IESSNRKGG 420
                                                               + +  R GG
Sbjct: 377  PVPNIHLLDVLVVPHLTKNLLSISKLTSDFPLSVTFTNNLFTVQNRQTGRVVATGKRDGG 436

Query: 421  ------GN---------------------------------------------------- 480
                  GN                                                    
Sbjct: 437  LYVLERGNSAFISVLKNKSLRASYDLWHARLGHVNYSVISFINKKGHLSLTSLLPSPSLC 496

Query: 481  ------------------------------------------------------------ 540
                                                                        
Sbjct: 497  STCQLAKNHRLPYSRNEHRSSHVLDLIHCDLWGPSPIKSNSGFLYYVIFIDDHSRFTWLY 556

Query: 541  ------------------------------------------------------------ 600
                                                                        
Sbjct: 557  PLKFKSDFFDIFLQFQKFVENQHSARIKVFQSDGGAEFTNTCFKAHLRTSGIHHQLSCPY 616

Query: 601  -------------------------------------RTAAYIINRLPTPLLGGKSPFEL 660
                                                  T  YIINRLPTPLLGGKSPFEL
Sbjct: 617  TPAQNGRAERKHRHVTETGLALLFHSHLSPRFWVDAFSTTTYIINRLPTPLLGGKSPFEL 676

Query: 661  LYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVHKGFRCLDPATTKLY 720
            LYGY+PHY+NFHPFGC VYP LRDYMPNKLSPRSIPCIFLGYSP HKGFRCLDP T++LY
Sbjct: 677  LYGYSPHYENFHPFGCHVYPCLRDYMPNKLSPRSIPCIFLGYSPSHKGFRCLDPTTSRLY 736

Query: 721  ITCHAQFDETHFPAIPSSQAQPLSTIPISNFLEPHLHHIDSSPPTTSSP--HIPQSSSSP 780
            IT HAQFDETHFP +PSSQAQPLS++ ISNFLEP LHHID SPP+  SP  HIP+S+SSP
Sbjct: 737  ITRHAQFDETHFPTVPSSQAQPLSSLHISNFLEPRLHHIDPSPPSPPSPSSHIPRSNSSP 796

Query: 781  CDICSDLVDESVQVDTSLAGSTLSPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTR 840
            C+ICSDLVDESVQVDTSLAGS+L P  S+  SIE   D  SSLG+HPMITRAKAGIFKTR
Sbjct: 797  CNICSDLVDESVQVDTSLAGSSLPPLASSPHSIEHAADSSSSLGSHPMITRAKAGIFKTR 856

Query: 841  HPANLGMLGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPAN 900
            HPANLG+LGSSGLLSALLASTEPKGFKSAAKNPAW+AAMDEE++ALQQN TW LV RP N
Sbjct: 857  HPANLGVLGSSGLLSALLASTEPKGFKSAAKNPAWLAAMDEEVQALQQNGTWILVHRPVN 916

Query: 901  TNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIA 960
            TNIVGSKWVFR KY PDGSVER KARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLS+A
Sbjct: 917  TNIVGSKWVFRTKYFPDGSVERLKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSLA 976

Query: 961  VTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYVDPRFPKHVCLLKKALYGLKQAPRAWF 1020
            VTNKWPLRQLDV NAFLNGTL E V+MEQPPGY+DPRFP HVCLLKKALYGLKQAPRAWF
Sbjct: 977  VTNKWPLRQLDVNNAFLNGTLTEHVYMEQPPGYIDPRFPTHVCLLKKALYGLKQAPRAWF 1036

Query: 1021 QRFSSFLLTLGFSCSRADTSLFVFHQQSNLIYLLLYVDDIIVTGNNSSLINSFTRKLHSE 1080
            QRFSSF LTLGFSCSRADTSLFVFHQQS+LIYLLLYVDDIIVTGNN SL++SFTRKLHS+
Sbjct: 1037 QRFSSFFLTLGFSCSRADTSLFVFHQQSSLIYLLLYVDDIIVTGNNPSLLDSFTRKLHSK 1096

Query: 1081 FATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTAD 1082
            FATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRA LLDSKPVHTPMVVSQHLT  
Sbjct: 1097 FATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAHLLDSKPVHTPMVVSQHLTVA 1156

BLAST of CmaCh03G007770 vs. NCBI nr
Match: RVW41798.1 (Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera])

HSP 1 Score: 1746.9 bits (4523), Expect = 0.0e+00
Identity = 916/1198 (76.46%), Postives = 968/1198 (80.80%), Query Frame = 0

Query: 1    MASESCYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEP 60
            MASES  HLLPFNTLIHMITIKLSSSNYLLWKSQLLP LESQD+L YVDGT+VPPPRFEP
Sbjct: 118  MASESS-HLLPFNTLIHMITIKLSSSNYLLWKSQLLPFLESQDLLAYVDGTLVPPPRFEP 177

Query: 61   ETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLSTARDVWLALETTYSHQSKAR 120
            ETS+T + KYLAW+AA+QRLLCLLLSSLTEEA+AVVVGLSTAR+VWLALE T+SH SKAR
Sbjct: 178  ETSTTLSTKYLAWKAANQRLLCLLLSSLTEEAIAVVVGLSTAREVWLALENTFSHHSKAR 237

Query: 121  ELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFS 180
            ELRLKDDLQLMKRGTKPVAEYAR FK +CDQLHAIGRPVED DKVHWFL           
Sbjct: 238  ELRLKDDLQLMKRGTKPVAEYARTFKTLCDQLHAIGRPVEDTDKVHWFLH---------- 297

Query: 181  TAQMALTPIPCFADLVSKTESFELFQRSLESSDSTPTAFIATNRGRTHESHPASF---TN 240
                          LVSK ESFELFQRSLESS+ T  AF ATNR RT  SH   F    N
Sbjct: 298  --------------LVSKAESFELFQRSLESSEPTTAAFTATNRSRT-TSHGTPFAFRNN 357

Query: 241  QRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAF 300
            QRGRS+SH NNSSNRGRT+S  GRRPP CQIC  EGHYADRCNQRY R DSS AHLAEAF
Sbjct: 358  QRGRSHSHNNNSSNRGRTYSGHGRRPPRCQICCIEGHYADRCNQRYARTDSS-AHLAEAF 417

Query: 301  NTSCSIAGPDAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITH----- 360
            NTSCS++GP+AADWFLDT ASAHMT DPSILDQSKNY GKDSVIVGNGASLPITH     
Sbjct: 418  NTSCSLSGPEAADWFLDTRASAHMTTDPSILDQSKNYMGKDSVIVGNGASLPITHTGTLS 477

Query: 361  ----------------IESSNRKGG----------------------------------- 420
                            + +  R GG                                   
Sbjct: 478  PVPNIHLLDNRQTGRVVATGKRDGGLYVLERSNSAFIYVLKNKSLRASYDLWHARLAHLR 537

Query: 421  -----------------------------------------------GNRTAAYIINRLP 480
                                                              TA YIIN LP
Sbjct: 538  TSGIHHQLSCPYTPAQNGRAERKHRHVTETGLALLFHSHLSPRFWVDAFSTATYIINWLP 597

Query: 481  TPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVHKG 540
            TPLLGGKSPFELLY Y+PHY+NFHPFGCRVYP LRDYM NKLSPRSIPCIFLGYSP HKG
Sbjct: 598  TPLLGGKSPFELLYDYSPHYENFHPFGCRVYPCLRDYMSNKLSPRSIPCIFLGYSPSHKG 657

Query: 541  FRCLDPATTKLYITCHAQFDETHFPAIPSSQAQPLSTIPISNFLEPHLHHIDSSPPTTSS 600
            FRCLDP T++LYIT HAQFDETHFP +PSSQAQPLS++ ISNFLEP LHHID SPP+  S
Sbjct: 658  FRCLDPTTSRLYITRHAQFDETHFPTVPSSQAQPLSSLHISNFLEPRLHHIDPSPPSPPS 717

Query: 601  P--HIPQSSSSPCDICSDLVDESVQVDTSLAGSTLSPSTSNSTSIEPPVD-FSSLGTHPM 660
            P  HIP+S+SSPC+ICSDLVDESVQVDTSLAG +L P  S+  SIE   D  SSLG+HPM
Sbjct: 718  PSSHIPRSNSSPCNICSDLVDESVQVDTSLAGCSLPPLASSPHSIEHAADSSSSLGSHPM 777

Query: 661  ITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQ 720
            ITRAKAGIFKTRHPANLG+LGSSGLL ALLASTEPKGFKSAAKNPAW+AAMDEE++ALQQ
Sbjct: 778  ITRAKAGIFKTRHPANLGVLGSSGLLFALLASTEPKGFKSAAKNPAWLAAMDEEVQALQQ 837

Query: 721  NDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVV 780
            N TW LVPRP NTNIVGSKWVFR KYLPDGSVER KARLVAKGYT VPGLDYTD FSPVV
Sbjct: 838  NGTWILVPRPVNTNIVGSKWVFRTKYLPDGSVERLKARLVAKGYTHVPGLDYTDIFSPVV 897

Query: 781  KATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYVDPRFPKHVCLLKKA 840
            KATTVRVVLS+AVTNKWPLRQLDVKNAFLNGTL E V+MEQPPGY+D RFP HVCLLKKA
Sbjct: 898  KATTVRVVLSLAVTNKWPLRQLDVKNAFLNGTLTEHVYMEQPPGYIDHRFPTHVCLLKKA 957

Query: 841  LYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNLIYLLLYVDDIIVTGNNSS 900
            LYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQS+LIYLLLYVDDIIVTGNN S
Sbjct: 958  LYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSSLIYLLLYVDDIIVTGNNPS 1017

Query: 901  LINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVH 960
            L++SFTRKLHSEFATKDLGSL+YFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVH
Sbjct: 1018 LLDSFTRKLHSEFATKDLGSLNYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVH 1077

Query: 961  TPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLA 1020
            TPMVVSQHLT  GSPFS+PTLYRSLVGALQYLTITRPDIA+AVNSVSQFLHAPT DHFLA
Sbjct: 1078 TPMVVSQHLTVAGSPFSNPTLYRSLVGALQYLTITRPDIAHAVNSVSQFLHAPTIDHFLA 1137

Query: 1021 VKRILRYVKGTLHFGLIFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLISWS 1080
            VKRILRYVKGTLHFGL FRPST+PS LVAYSDADWAGCPDTRRSTSGYSIYLGNNL+SWS
Sbjct: 1138 VKRILRYVKGTLHFGLTFRPSTIPSALVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWS 1197

Query: 1081 AKKQPTVSRSSCESEYRALATTAAELLWVTHILHDLKVPISQQPLLLCDNKSAIFLSSNP 1088
            AKKQPTVSRSSCESEYRALA TAAELLW+TH+LHDLKVPI QQPLLLCDNKSAIF SSNP
Sbjct: 1198 AKKQPTVSRSSCESEYRALAMTAAELLWLTHLLHDLKVPIPQQPLLLCDNKSAIFFSSNP 1257

BLAST of CmaCh03G007770 vs. NCBI nr
Match: RVW43615.1 (Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera])

HSP 1 Score: 1703.7 bits (4411), Expect = 0.0e+00
Identity = 880/1124 (78.29%), Postives = 934/1124 (83.10%), Query Frame = 0

Query: 1    MASESCYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEP 60
            MASES  HLLPFNTLIHMI IKLSSSNYLLWKSQLLPLLESQD+L YVDGT+VPPPRFEP
Sbjct: 1    MASESS-HLLPFNTLIHMINIKLSSSNYLLWKSQLLPLLESQDLLAYVDGTLVPPPRFEP 60

Query: 61   ETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLSTARDVWLALETTYSHQSKAR 120
            ETS+T + KYLAW+AADQRLLCLLLSSLTEEA+AVVVGLSTAR+VWLALE T+SH SKAR
Sbjct: 61   ETSTTLSTKYLAWKAADQRLLCLLLSSLTEEAIAVVVGLSTAREVWLALENTFSHHSKAR 120

Query: 121  ELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFS 180
            ELRLKDDLQLMKRGTKPVAEYAR FK +CDQLHAIGRPVED DKVHWFL GLG +FS+FS
Sbjct: 121  ELRLKDDLQLMKRGTKPVAEYARTFKTLCDQLHAIGRPVEDTDKVHWFLHGLGPDFSSFS 180

Query: 181  TAQMALTPIPCFADLVSKTESFELFQRSLESSDSTPTAFIATNRGRTHESHPASF---TN 240
            T QM+LTP+P FADLVSK ESFELFQRSLESS+ T  AF  TNR RT  SH   F    N
Sbjct: 181  TPQMSLTPLPYFADLVSKAESFELFQRSLESSEPTTAAFTTTNRSRT-TSHGTPFAFRNN 240

Query: 241  QRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAF 300
            QRGRS+SH NNSSNRGRT+S  GRRPP CQICR EGHYADRCNQRY R DSS AHLAEAF
Sbjct: 241  QRGRSHSHNNNSSNRGRTYSGHGRRPPRCQICRIEGHYADRCNQRYARTDSS-AHLAEAF 300

Query: 301  NTSCSIAGPDAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITH----- 360
            NTSCS++GP+AADWFLDTGASAHMT DPS LDQSKNY GKDSVIVGNGASLPITH     
Sbjct: 301  NTSCSLSGPEAADWFLDTGASAHMTTDPSNLDQSKNYMGKDSVIVGNGASLPITHTGTLS 360

Query: 361  ----------------IESSNRKGG------GN--------------------------- 420
                            + +  R GG      GN                           
Sbjct: 361  PVPNIHLLDNRQTGRMVATGKRDGGLYVLERGNSAFISVLKNKSLRASYDLWHARLAHLR 420

Query: 421  -------------------------------------------------RTAAYIINRLP 480
                                                              TA YIINRLP
Sbjct: 421  TSGIHHQLSCPYTPAQNGRAERKHRHVTETGLALLFHSHLSPRFWVDAFSTATYIINRLP 480

Query: 481  TPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVHKG 540
            TPLLGGKSPFELLYG++PHY+NFHPFGCRVYP LRDYMPNKLSPRSIPCIFLGYSP HKG
Sbjct: 481  TPLLGGKSPFELLYGHSPHYENFHPFGCRVYPCLRDYMPNKLSPRSIPCIFLGYSPSHKG 540

Query: 541  FRCLDPATTKLYITCHAQFDETHFPAIPSSQAQPLSTIPISNFLEPHLHHIDSSPPTTSS 600
            FRCLDP T++LYIT HAQFDETHFP +PSSQAQPLS++ ISNFLEP LHHID SPP+  S
Sbjct: 541  FRCLDPTTSRLYITRHAQFDETHFPTVPSSQAQPLSSLHISNFLEPRLHHIDPSPPSPPS 600

Query: 601  P--HIPQSSSSPCDICSDLVDESVQVDTSLAGSTLSPSTSNSTSIEPPVD-FSSLGTHPM 660
            P  HIP+S+SSPC+ICSDLVDESV+VDTSLAGS+L P  S+  SIE   D  SSLG+HPM
Sbjct: 601  PSSHIPRSNSSPCNICSDLVDESVKVDTSLAGSSLPPLASSPHSIEHAADSSSSLGSHPM 660

Query: 661  ITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQ 720
            ITRAKAGIFKTRHPANLG+LGSSGLLSALLASTEPKGFKSAAKNPAW+AAMDEE++ALQQ
Sbjct: 661  ITRAKAGIFKTRHPANLGVLGSSGLLSALLASTEPKGFKSAAKNPAWLAAMDEEVQALQQ 720

Query: 721  NDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVV 780
            N TW LVPRP NTNIVGSKWVFR KYLPDGSVER KARLVAKGYTQVPGLDYTDTFSPVV
Sbjct: 721  NGTWILVPRPVNTNIVGSKWVFRTKYLPDGSVERLKARLVAKGYTQVPGLDYTDTFSPVV 780

Query: 781  KATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYVDPRFPKHVCLLKKA 840
            KATTVRVVLS+A+TNKWPLRQLDVKNAFLNGTL E V+MEQPPGY+DPRFP HVCLLKKA
Sbjct: 781  KATTVRVVLSLAITNKWPLRQLDVKNAFLNGTLTEHVYMEQPPGYIDPRFPTHVCLLKKA 840

Query: 841  LYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNLIYLLLYVDDIIVTGNNSS 900
            LYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQS+LIYLLLYVDDIIVTGNN S
Sbjct: 841  LYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSSLIYLLLYVDDIIVTGNNPS 900

Query: 901  LINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVH 960
            L++SFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVH
Sbjct: 901  LLDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVH 960

Query: 961  TPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLA 1016
            TPMVVSQHLT  GSPFS+PTLYRSLVGALQYLTITRPDIA+AVNSVSQFLHAPT DHFLA
Sbjct: 961  TPMVVSQHLTVAGSPFSNPTLYRSLVGALQYLTITRPDIAHAVNSVSQFLHAPTIDHFLA 1020

BLAST of CmaCh03G007770 vs. NCBI nr
Match: RVW33283.1 (Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera])

HSP 1 Score: 1590.1 bits (4116), Expect = 0.0e+00
Identity = 851/1209 (70.39%), Postives = 910/1209 (75.27%), Query Frame = 0

Query: 1    MASESCYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEP 60
            MASES  HLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQD+LGYVDGT+VPPPRFEP
Sbjct: 1    MASESS-HLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDLLGYVDGTLVPPPRFEP 60

Query: 61   ETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLSTARDVWLALETTYSHQSKAR 120
            ETS+T + KYLAW+AADQRLLCLLLS LTEEA+AVVVGLSTAR+VWLALE T++H SKAR
Sbjct: 61   ETSTTLSTKYLAWKAADQRLLCLLLSFLTEEAIAVVVGLSTAREVWLALENTFNHHSKAR 120

Query: 121  ELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFS 180
            ELRLKDDLQLMKRGTKPVAEYAR FK +C+QLHAIGRPVED DKVHWFLRGLGT+FS+FS
Sbjct: 121  ELRLKDDLQLMKRGTKPVAEYARTFKTLCNQLHAIGRPVEDTDKVHWFLRGLGTDFSSFS 180

Query: 181  TAQMALTPIPCFADLVSKTESFELFQRSLESSDSTPTAFIATNRGRTHESHPASF---TN 240
            TAQM+LTP+P FADLVSK ESFELFQRSLESS+ T  AF ATN  RT  SH   F    N
Sbjct: 181  TAQMSLTPLPYFADLVSKAESFELFQRSLESSEPTTAAFTATNCSRT-TSHGTPFAFRNN 240

Query: 241  QRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAF 300
            QRGRS+SH NNSSNRGRT+S  GRRPP CQICR EGHYADRCNQRY R DSS AHLAEAF
Sbjct: 241  QRGRSHSHNNNSSNRGRTYSGHGRRPPRCQICRIEGHYADRCNQRYARTDSS-AHLAEAF 300

Query: 301  NTSCSIAGPDAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITH----- 360
            NTSCS++GP+AADWFLDTGASAHMT DPSILDQSKNY GKDSVIVGNGASLPITH     
Sbjct: 301  NTSCSLSGPEAADWFLDTGASAHMTTDPSILDQSKNYMGKDSVIVGNGASLPITHTGTLS 360

Query: 361  ---------------------------------------------------IESSNRKGG 420
                                                               + +  R GG
Sbjct: 361  PVPNIHLLDVLAVPHLTKNLLSISKLTSDFPLSVTFTNNLFTVQNRQTGRVVATGKRDGG 420

Query: 421  ------GN---------------------------------------------------- 480
                  GN                                                    
Sbjct: 421  LYVLERGNSAFISVLKNKSLRASYDLWHARLAHLRTSGIHHQLSCPYTPAQNGRVERKHR 480

Query: 481  ------------------------RTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHP 540
                                     TA YIINRLPTPLLGGKSPFELLYGY+PHY+NFHP
Sbjct: 481  HVTETGLALLFHSHLSPRFWVDAFSTATYIINRLPTPLLGGKSPFELLYGYSPHYENFHP 540

Query: 541  FGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVHKGFRCLDPATTKLYITCHAQFDETHFP 600
            FGCRVYP LRDYMPNKLSPRSIPCIFLGYSP HKGFRCLDP T++LYIT HAQFDETHFP
Sbjct: 541  FGCRVYPCLRDYMPNKLSPRSIPCIFLGYSPSHKGFRCLDPTTSRLYITRHAQFDETHFP 600

Query: 601  AIPSSQAQPLSTIPISNFLEPHLHHIDSSPPTTSSP--HIPQSSSSPCDICSDLVDESVQ 660
             +PSSQAQPLS++ ISNFLEP LHHID SPP+++SP  HIP+S+SSPC+ICSDLVDESVQ
Sbjct: 601  TVPSSQAQPLSSLHISNFLEPRLHHIDPSPPSSTSPSSHIPRSNSSPCNICSDLVDESVQ 660

Query: 661  VDTSLAGSTLSPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGMLGSSGL 720
            VDTSLAGS+L P  S+  SIE   D  SSLG+H MITRAKAGIFKTRHPANLG+LGSSGL
Sbjct: 661  VDTSLAGSSLPPLASSPHSIEHATDSSSSLGSHLMITRAKAGIFKTRHPANLGVLGSSGL 720

Query: 721  LSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPANTNIVGSKWVFRIK 780
            LS+LLASTEPKGFKSAAKNPAW+ AMDEE++ALQQN TW LVPRP NTNIVGSKWVFR K
Sbjct: 721  LSSLLASTEPKGFKSAAKNPAWLVAMDEEVQALQQNGTWILVPRPVNTNIVGSKWVFRTK 780

Query: 781  YLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVK 840
            Y PDGSVER KARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLS+ VTNKWPLRQLDVK
Sbjct: 781  YFPDGSVERLKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSLVVTNKWPLRQLDVK 840

Query: 841  NAFLNGTLIERVHMEQPPGYVDPRFPKHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFS 900
            NAFLNGTL E V+MEQPPGY+DPRFP H                                
Sbjct: 841  NAFLNGTLTEHVYMEQPPGYIDPRFPTH-------------------------------- 900

Query: 901  CSRADTSLFVFHQQSNLIYLLLYVDDIIVTGNNSSLINSFTRKLHSEFATKDLGSLSYFL 960
                         QS+LIYLLLYVDDIIVTGNN SL++SFTRKLHSEFATKDLGSLSYFL
Sbjct: 901  -------------QSSLIYLLLYVDDIIVTGNNPSLLDSFTRKLHSEFATKDLGSLSYFL 960

Query: 961  GLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSL 1020
            GLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLT  GSPFS+PTLY+SL
Sbjct: 961  GLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTVAGSPFSNPTLYQSL 1020

Query: 1021 VGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPS 1066
            VGALQYLTITRPDIA+AVNSVSQFLHAPT DHFLAVKRILRYVKGTLHFGL FRPST+  
Sbjct: 1021 VGALQYLTITRPDIAHAVNSVSQFLHAPTIDHFLAVKRILRYVKGTLHFGLTFRPSTI-- 1080

BLAST of CmaCh03G007770 vs. NCBI nr
Match: RVW96109.1 (Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera])

HSP 1 Score: 1575.5 bits (4078), Expect = 0.0e+00
Identity = 833/1127 (73.91%), Postives = 890/1127 (78.97%), Query Frame = 0

Query: 1    MASESCYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEP 60
            MASES +HLLPFNTLIHMITIKLSSSNYLLWKSQLL LLESQD+LGYVDGT+VPPPRFEP
Sbjct: 26   MASES-FHLLPFNTLIHMITIKLSSSNYLLWKSQLLSLLESQDLLGYVDGTLVPPPRFEP 85

Query: 61   ETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLSTARDVWLALETTYSHQSKAR 120
            ETS+T + KYLAW+A DQRLLCLLLSSLTEEA+AVVVGLSTAR+VWLALE T+SH  KAR
Sbjct: 86   ETSTTLSTKYLAWKAIDQRLLCLLLSSLTEEAIAVVVGLSTAREVWLALENTFSHHLKAR 145

Query: 121  ELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFS 180
            ELRLKDDLQLMKR TKPVAEYAR FK +CDQLHAIGRPVEDIDKVH              
Sbjct: 146  ELRLKDDLQLMKRDTKPVAEYARTFKTLCDQLHAIGRPVEDIDKVH-------------- 205

Query: 181  TAQMALTPIPCFADLVSKTESFELFQRSLESSDSTPTAFIATNRGRTHESHPASFTNQRG 240
                                          +S  TP AF                +NQRG
Sbjct: 206  ---------------------------CRTTSHGTPFAF---------------RSNQRG 265

Query: 241  RSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFNTS 300
            RS+SH NNSSNRGRT+S  GRRPP CQICR EG+YA+RCNQRY R DSS AHLA+A NTS
Sbjct: 266  RSHSHNNNSSNRGRTYSGHGRRPPRCQICRIEGYYANRCNQRYARTDSS-AHLAKALNTS 325

Query: 301  CSIAGPDAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHIESSNRKG 360
            CS++G +AADWFLDTGASAHMT DPSILDQSKNY GKDSVIVGNGASLPITH       G
Sbjct: 326  CSLSGLEAADWFLDTGASAHMTTDPSILDQSKNYMGKDSVIVGNGASLPITHTAHLRTSG 385

Query: 361  GGNR-------------------------------------------TAAYIINRLPTPL 420
              ++                                           TA YIINRLPTPL
Sbjct: 386  IHHQLFCPYTPAQNGRAERKHRHVTETGLALLFYSHLSPRFWVDAFSTATYIINRLPTPL 445

Query: 421  LGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVHKGFRC 480
            LGGK+ FELLYGY+PHY+NFHPFGCRVYP LRDYMPNKLSPRSIPCIFLGYSP HKGFRC
Sbjct: 446  LGGKASFELLYGYSPHYENFHPFGCRVYPCLRDYMPNKLSPRSIPCIFLGYSPSHKGFRC 505

Query: 481  LDPATTKLYITCHAQFDETHFPAIPSSQAQPLSTIPISNFLEPHLHHIDSSP--PTTSSP 540
            LDP T++LYIT HAQFDETHFP IPSSQAQPLS++ ISNFLEP LHHID SP  PT+ S 
Sbjct: 506  LDPTTSRLYITRHAQFDETHFPTIPSSQAQPLSSLHISNFLEPRLHHIDPSPPSPTSHSS 565

Query: 541  HIPQSSSSPCDICSDLVDESVQVDTSLAGSTLSPSTSNSTSIEPPVD-FSSLGTHPMITR 600
            HIP+S+SSPC+ICSDLVDESVQVDTSLAGS+  P  S+  SIE   D  SSLG+HPMITR
Sbjct: 566  HIPRSNSSPCNICSDLVDESVQVDTSLAGSSFPPLASSPHSIELAADSSSSLGSHPMITR 625

Query: 601  AKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDT 660
            AKAGIFKTRHPANLG+LGS GLLS LL STEPKGFKSAAKNP W+A MDEE++ALQQN  
Sbjct: 626  AKAGIFKTRHPANLGVLGSFGLLSTLLTSTEPKGFKSAAKNPVWLATMDEEVQALQQN-- 685

Query: 661  WTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKAT 720
                                                   GYTQVPGLDYTDTFS VVKAT
Sbjct: 686  ---------------------------------------GYTQVPGLDYTDTFSLVVKAT 745

Query: 721  TVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYVDPRFPKHVCLLKKALYG 780
            TVRVVLS+AVTNKWPLRQ DVKNAFLNGTL E V+MEQP GY+D RFP HVCLLKKALYG
Sbjct: 746  TVRVVLSLAVTNKWPLRQFDVKNAFLNGTLTEHVYMEQPLGYIDSRFPTHVCLLKKALYG 805

Query: 781  LKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNLIYLLLYVDDIIVTGNNSSLIN 840
            LKQAPRAWFQRFSSFLLTLGFS SRA  SLFVFHQQS+LIYLLLYV DIIVTGNN SL++
Sbjct: 806  LKQAPRAWFQRFSSFLLTLGFSSSRAYISLFVFHQQSSLIYLLLYVYDIIVTGNNPSLLD 865

Query: 841  SFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPM 900
            +FTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPM
Sbjct: 866  NFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPM 925

Query: 901  VVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKR 960
            VVSQHLT   SPFS+PT YRSLVGALQYLTITRPDIA+AVNSVSQFLHAPT D+FLAVKR
Sbjct: 926  VVSQHLTIASSPFSNPTFYRSLVGALQYLTITRPDIAHAVNSVSQFLHAPTIDNFLAVKR 985

Query: 961  ILRYVKGTLHFGLIFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLISWSAKK 1020
            ILRYVKGTLHFGL FRPST+PS LVAYSDADWAGCPDTRRSTS YSIYLGNNL+SWSAKK
Sbjct: 986  ILRYVKGTLHFGLTFRPSTIPSALVAYSDADWAGCPDTRRSTSSYSIYLGNNLVSWSAKK 1045

Query: 1021 QPTVSRSSCESEYRALATTAAELLWVTHILHDLKVPISQQPLLLCDNKSAIFLSSNPVSH 1080
            QPTVSRSSCESEYRALA TAAELLW+TH+LHDLKVPI QQ LLLCDNKSAIFLSSNPVSH
Sbjct: 1046 QPTVSRSSCESEYRALAMTAAELLWLTHLLHDLKVPIPQQSLLLCDNKSAIFLSSNPVSH 1053

Query: 1081 KRAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKTLFSEYFE 1082
            KRAKHVELDYHFLRELV+AGKL TQYVPSHLQVADIFTK++    FE
Sbjct: 1106 KRAKHVELDYHFLRELVVAGKLCTQYVPSHLQVADIFTKSVSRPLFE 1053

BLAST of CmaCh03G007770 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 421.4 bits (1082), Expect = 2.5e-117
Identity = 219/478 (45.82%), Postives = 301/478 (62.97%), Query Frame = 0

Query: 577  LSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPANTNIVGSKWVFRIK 636
            L  +  + EP  +  A +   W  AMD+EI A++   TW +   P N   +G KWV++IK
Sbjct: 77   LVCIAKAKEPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIK 136

Query: 637  YLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVK 696
            Y  DG++ER+KARLVAKGYTQ  G+D+ +TFSPV K T+V+++L+I+    + L QLD+ 
Sbjct: 137  YNSDGTIERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDIS 196

Query: 697  NAFLNGTLIERVHMEQPPGYV----DPRFPKHVCLLKKALYGLKQAPRAWFQRFSSFLLT 756
            NAFLNG L E ++M+ PPGY     D   P  VC LKK++YGLKQA R WF +FS  L+ 
Sbjct: 197  NAFLNGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIG 256

Query: 757  LGFSCSRADTSLFVFHQQSNLIYLLLYVDDIIVTGNNSSLINSFTRKLHSEFATKDLGSL 816
             GF  S +D + F+    +  + +L+YVDDII+  NN + ++    +L S F  +DLG L
Sbjct: 257  FGFVQSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKSQLKSCFKLRDLGPL 316

Query: 817  SYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTA-DGSPFSDPT 876
             YFLGLE + +  G+ I Q KYA D+L    LL  KP   PM  S   +A  G  F D  
Sbjct: 317  KYFLGLEIARSAAGINICQRKYALDLLDETGLLGCKPSSVPMDPSVTFSAHSGGDFVDAK 376

Query: 877  LYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRP 936
             YR L+G L YL ITR DI++AVN +SQF  AP   H  AV +IL Y+KGT+  GL F  
Sbjct: 377  AYRRLIGRLMYLQITRLDISFAVNKLSQFSEAPRLAHQQAVMKILHYIKGTVGQGL-FYS 436

Query: 937  STVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLISWSAKKQPTVSRSSCESEYRALA 996
            S     L  +SDA +  C DTRRST+GY ++LG +LISW +KKQ  VS+SS E+EYRAL+
Sbjct: 437  SQAEMQLQVFSDASFQSCKDTRRSTNGYCMFLGTSLISWKSKKQQVVSKSSAEAEYRALS 496

Query: 997  TTAAELLWVTHILHDLKVPISQQPLLLCDNKSAIFLSSNPVSHKRAKHVELDYHFLRE 1050
                E++W+     +L++P+S+  LL CDN +AI +++N V H+R KH+E D H +RE
Sbjct: 497  FATDEMMWLAQFFRELQLPLSKPTLLFCDNTAAIHIATNAVFHERTKHIESDCHSVRE 553

BLAST of CmaCh03G007770 vs. TAIR 10
Match: ATMG00810.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 249.2 bits (635), Expect = 1.7e-65
Identity = 123/226 (54.42%), Postives = 165/226 (73.01%), Query Frame = 0

Query: 774  IYLLLYVDDIIVTGNNSSLINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLK 833
            +YLLLYVDDI++TG++++L+N    +L S F+ KDLG + YFLG++    P GLF+SQ K
Sbjct: 1    MYLLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTK 60

Query: 834  YARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYA 893
            YA  IL  A +LD KP+ TP+ +  + +   + + DP+ +RS+VGALQYLT+TRPDI+YA
Sbjct: 61   YAEQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYPDPSDFRSIVGALQYLTLTRPDISYA 120

Query: 894  VNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTLVAYSDADWAGCPDTR 953
            VN V Q +H PT   F  +KR+LRYVKGT+  GL    ++    + A+ D+DWAGC  TR
Sbjct: 121  VNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNS-KLNVQAFCDSDWAGCTSTR 180

Query: 954  RSTSGYSIYLGNNLISWSAKKQPTVSRSSCESEYRALATTAAELLW 1000
            RST+G+  +LG N+ISWSAK+QPTVSRSS E+EYRALA TAAEL W
Sbjct: 181  RSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CmaCh03G007770 vs. TAIR 10
Match: ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )

HSP 1 Score: 118.6 bits (296), Expect = 3.5e-26
Identity = 60/133 (45.11%), Postives = 82/133 (61.65%), Query Frame = 0

Query: 551 MITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQ 610
           M+TR+KAGI K     +L +              EPK    A K+P W  AM EE+ AL 
Sbjct: 1   MLTRSKAGINKLNPKYSLTI--------TTTIKKEPKSVIFALKDPGWCQAMQEELDALS 60

Query: 611 QNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPV 670
           +N TW LVP P N NI+G KWVF+ K   DG+++R KARLVAKG+ Q  G+ + +T+SPV
Sbjct: 61  RNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLDRLKARLVAKGFHQEEGIYFVETYSPV 120

Query: 671 VKATTVRVVLSIA 684
           V+  T+R +L++A
Sbjct: 121 VRTATIRTILNVA 125

BLAST of CmaCh03G007770 vs. TAIR 10
Match: ATMG00240.1 (Gag-Pol-related retrotransposon family protein )

HSP 1 Score: 82.0 bits (201), Expect = 3.6e-15
Identity = 41/78 (52.56%), Postives = 53/78 (67.95%), Query Frame = 0

Query: 882 YLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTLVAY 941
           YLTITRPD+ +AVN +SQF  A       AV ++L YVKGT+  GL F  +T    L A+
Sbjct: 2   YLTITRPDLTFAVNRLSQFSSASRTAQMQAVYKVLHYVKGTVGQGL-FYSATSDLQLKAF 61

Query: 942 SDADWAGCPDTRRSTSGY 960
           +D+DWA CPDTRRS +G+
Sbjct: 62  ADSDWASCPDTRRSVTGF 78

BLAST of CmaCh03G007770 vs. TAIR 10
Match: AT1G34070.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G48050.1); Has 648 Blast hits to 647 proteins in 29 species: Archae - 0; Bacteria - 0; Metazoa - 16; Fungi - 25; Plants - 607; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 77.4 bits (189), Expect = 8.9e-14
Identity = 64/243 (26.34%), Postives = 108/243 (44.44%), Query Frame = 0

Query: 19  ITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEPETSSTFNPKYLAWRAADQ 78
           + + +  SNY  W+   L    S D++G++DGT++P            N   + W+  D 
Sbjct: 22  VMLDIEESNYDAWRELFLTHCLSFDVMGHIDGTLLPT-----------NANDVNWQKRDG 81

Query: 79  RLLCLLLSSLT-EEAMAVVVGLSTARDVWLALETTYSHQSKARELRLKDDLQLMKRGTKP 138
            +   L  +LT ++     V  ST+RD+WL ++  + +   AR LRL  +L+    G   
Sbjct: 82  IVKLSLYGTLTPKQFQGSFVTSSTSRDIWLRIKNQFRNNKDARALRLDSELRTKDIGDMR 141

Query: 139 VAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTPIPCFADLVS 198
           VA+Y R  KK+ D L  +  PV D + V + L GL  +F           P P F D  +
Sbjct: 142 VADYYRKMKKLADSLRNVDVPVTDRNLVMYVLNGLNPKFDNIINVIKHRQPFPSFDDAAT 201

Query: 199 K-TESFELFQRSLESS-----DSTPTAFIATNRGRTHESHPASFTNQRGRSYSHKNNSSN 255
              E  +  +R+++ +      S+ +  +A +      +   S  NQ G     + N+  
Sbjct: 202 MLQEEEDRLKRAIKPNPTHVDHSSSSTVLACSEAPPVTNFQRSGGNQMGYRGRGRGNNIF 253

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q94HW22.7e-17730.46Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT949.5e-17030.70Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P109781.2e-10033.33Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041461.1e-9630.60Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
P925192.4e-6454.42Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
A0A438EBA00.0e+0069.00Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
A0A438E2750.0e+0076.46Retrovirus-related Pol polyprotein from transposon RE1 OS=Vitis vinifera OX=2976... [more]
A0A438E7630.0e+0078.29Retrovirus-related Pol polyprotein from transposon RE1 OS=Vitis vinifera OX=2976... [more]
A0A2N9I6010.0e+0076.15Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS49318 PE=4 SV=1[more]
A0A2N9EEM30.0e+0073.99Integrase catalytic domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB... [more]
Match NameE-valueIdentityDescription
RVW45095.10.0e+0069.00Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
RVW41798.10.0e+0076.46Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera][more]
RVW43615.10.0e+0078.29Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera][more]
RVW33283.10.0e+0070.39Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera][more]
RVW96109.10.0e+0073.91Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera][more]
Match NameE-valueIdentityDescription
AT4G23160.12.5e-11745.82cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00810.11.7e-6554.42DNA/RNA polymerases superfamily protein [more]
ATMG00820.13.5e-2645.11Reverse transcriptase (RNA-dependent DNA polymerase) [more]
ATMG00240.13.6e-1552.56Gag-Pol-related retrotransposon family protein [more]
AT1G34070.18.9e-1426.34CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BE... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 1203..1229
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 73..185
e-value: 6.5E-17
score: 61.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 222..262
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1207..1234
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 523..545
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 523..542
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1208..1227
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 222..266
NoneNo IPR availablePANTHERPTHR47481:SF3GAG-POLYPEPTIDE OF LTR COPIA-TYPE-RELATEDcoord: 3..356
NoneNo IPR availablePANTHERPTHR47481FAMILY NOT NAMEDcoord: 3..356
NoneNo IPR availableCDDcd09272RNase_HI_RT_Ty1coord: 939..1075
e-value: 3.24335E-73
score: 237.366
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 612..854
e-value: 1.1E-67
score: 228.2
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 612..1045

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh03G007770.1CmaCh03G007770.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0006468 protein phosphorylation
biological_process GO:0006412 translation
cellular_component GO:0005840 ribosome
molecular_function GO:0008097 5S rRNA binding
molecular_function GO:0005524 ATP binding
molecular_function GO:0004672 protein kinase activity
molecular_function GO:0003735 structural constituent of ribosome