ClCG03G010470 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG03G010470
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionReverse transcriptase domain-containing protein
LocationCG_Chr03: 18839400 .. 18845501 (+)
RNA-Seq ExpressionClCG03G010470
SyntenyClCG03G010470
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCTTTGACAAAGTGCACCAGGAAGAGGCATTTAGCCTTGCATGCTCGAACCAATCCTCCTAATTGGAAGGTAAGTGCTCTCTCCTCTCTCCTCCCTACTTTCCTCTCTTTCCTCTCTCTTCCCCTCTCCACCTCCCCTCGGCGATCTCTGCTCCCGCTCGGTCAGGCCTCTTCTGGCTGATAGTTCTGGATTAATTACATTATTGGCGCAAGAATAATAGTCATGGAAGTAGTGAGCTACAAAATCAATGGAGTGTTCTGTTCATGGTTTGAAAAAGGAAAATTCTTAGTAGAAGATATTGAAGTTCGAAAATCAATCCAACTCTCAAAATCATAGATGCTTTGGACGATAGAACAAATCGAAGACCTCCTCAACGGTTCAAGCATGCGTTTTTTCCTAAGAGAATGGAGAGATGACTTAGGAACAATAAGAATTTCGAAGTTAAAGATCAAGGCGGGATGGAATTTAAGATGTGTAGTTTGGCCGATTTCCGGAGGGAGATTCTATATTCATGTACCAGAAGGAGTAGCTCAACATGGATGATCAGAGTTCTTGAAGATGCTGAAAGGAAGTCTCCAACGATTTGGAAACCACAGAGAAAAGTCAATTTTTGAGGAGGATCATCAGAAGTACTCTCACTTGGCACCATTAATACAACCAAGCTTTGCTGAAATAGTGTCAACAAGGAAAAGCTAGTGGTTTTCCTTTGTCCGGGCATCCACAAATCTGAGTAAAAGGGAGGGTACTCAGGTGCAAAAAACGCTCCATGCTGCTGCAACAATAGAACAGAGTATACAACCTCGTCGAGCTGAAAAGAAGGAGAAAGAGGGCATGGAGAATTTTGTAACGAGTTCAAAGGAAACATCAGCAAAGGACCGGTACTGTGCTCCACTCAAACATCAAAATCAACCGCAATTCTGGGTTAGGAAAAACAGAGAAGTCTTCAAGGAAGACTTTAACAACCTATGGGTTCTCACCAGGTTATTTGCCTCTGATGATTGGAAGGAAATTGCCAAATTCTTAGAGGTCTTCTACAATGTGAAAGTTAAAATCAACCCCCTTTTCGATGACAAAGCCCTATTCCGAGTAGTCCAAGGGAAGATAGAAGATATAATTGAAGCCCCCGACAAATGGTCTAATTATAGAAAATTTCATTTATTATTTGAAAATGGAATTCATTACTTCACAGTAGACCCTCTGTTTTAAGAAGCTTTGGTGGATGGATAGTAATCAAGAACTTATCGCTGGAGTATTGGAGCAGAGAAACATTTGAAGCAATTGGACAACACTTTGGAGGATTGGAGGAGATAAGTATTGAAACACTGAATTTGTTAGACGTTTCAGAGACTAAAATTAAAGTTAGAAAGAATGTTTATGGTTTTATTCCAGCTACAATCGAAATTAACAATGAATTTAGAGGTTCTATCCATTTACTATTTGGAGACATCATACCAATGAAAGATTCAAGTTCCATTATCACGGAATTTATTGGGAGTGATTTCGAAAACCCCATTGACCAAGTACGGTTGAGCAAAGTCGAAGAAGATGAGGTTAGAGTCTTTTCTCCCTCAAAATTGGCGCTGCTTAGTTCGAAACCTGACCAGACCCAACCAGAGCAGGAGGAGAAGTCACTGGAAATATCGGTCGGGAGTCCATCAGTCGGAAAATGGTCGAAGTCGACAGCAAATGGCCTCGAAGGAAAAAGCATTAATGAGAAGGCGGCAAAATTAAATTGGTGTGAGGAAACCGAAGAGGCAGCCGTAGGAAGTTCAATAGGTGTGTTGAAATCGATTCGGCCTATAGAAGTTATGAAGAAAGAGAAGAGGAAAGAGAGGGTGTTTGGCCGAATTAATGAGAATATTAAAAGCATCCAGACCCCTGACATTCCATCTGGGGATTTCCTTCGGTCAGGTGGGCCCACCGGCTTCGAAGTCATCCAAAAAATTCGAAAAGTCCCAAACGTTCTTTTGACAGTAGCAGAAGAATCCGAAAAGTCACCCCCAAATAGCATGAAGTCCAATTGGGAAGAGGTTCCCTTTCAGACATTGCTGCCCTATCATCAGCTCCCTCGAGCTCGCGTGAATGCCTTTCCTCAGCAGTCTCCAGCACACACGATTTCCTCGGTCAAGAGCCCTTTCTTCCCAGCCGTTAGGAGGAGTTCAAATCTCTTCAAACCCTCCCCAAAACACTTTAGCAGAGGAAAAACATCGTTCCTCAACCTTTGGTCAGCATTGACAAATCCAGATATACTCGAAGTTAGTTGTGCAAACTCTCAAGTCCCCCAACAAATTCGTGACAGGTCAGTACTCCCCCTCTCTTCATCACAAATATTCCATCAATCAGAAATTCTAATTCCGGGATCAAATATTCTCTTCTTGCGAGGCTCCCCAAGCTCTTACCGTCCCAAGCCTAATAAGATTCAAGACTCAGAAGAGGAATCACTCATCAGCATAAGTAGTGATGAATTAAATGAGCCGGAAGAAGATGAGAACCATTTAAGTTTGGAATTGGATGAATCCATCGGTGATGATCTAAATAAACTTTTCTAGAGCAGCCAAACCAGAAAAGGCAGCAAGTGTACAACCGACCACCACTCTTCAATGCTTAAAGAAAAAATTCCTCGGCACTTGAGATCATTTGTAGAAGATTGTGGAATTTCACTGGTTTGAGTGAAGGTGTGGCGGATTAGAACATAGAAGTGTAAAGGTATTCTCAATGAAAATTGTTTCTTGGAACATTAGAGGCCTTGGGGATCAATCAAAGCAGCTGGCTGTTAAGCACCTAATTATGAAGACAAATCTAGAATTGGTTTTAATCCAAGAAACAAAAAAAGAGGCATTTGAAGCTGAAGCAATCAAGAAGTTATGGAGTTCAAGAGACATTGGCTGGTCATTTGTGGAAGCCTATGGCAGATCAGGAGGGTTGCTAATCATGTGGGATGAAAGTAAAATATCAGTCATAGAAACAATCAAAGGAGGCTACACTCTATCCGTTAAATGTAAGACCTTATGTAGCAAAGTTTGTTGGGTAACAAATGCATACGGACCAACCGATTATAAAGAAAGGAGGCGCATCTGGCGGGAGTTACAAGCTTTGGCAGCATATTGCACTAATGCTTGGTGCTTGGGTGGGGACTTCAACATCACTAGAGCAATCCACGAAAGAGTTCCAATTGGAAGATTAACTAGAGGAATGAAGAAGTTTAGCAAATTCATAGAAGAGGCACACTTAATGGAAATTCCTATGAGCAATGGGCGTTTCACTTGGTCGAGAGAAGGAAACAGAGTATCAAGATCCTTGCTAGACAGATTTCTGGTGACAAACGAAGGGGATGAAGCATTTGAAGGCACTCGAATCTTTAGATAGGTTCGCATTGTATCAGATCACTTTCCTCTCTTATTAGAAGCTGGGGCGTTAGAATGGGGACCTTCCCCTTTTCGTTTTTGCAACAGTTGGCTGCTTAATTCCCAGTGCAACAGCATTATAATTAGAGCTCTTGCAGCTGGAAATCATCAAGGTTGGGCTGGATTTGTCATTTCTGCTAAGTTCAGATCAGTTAAAACCTCCCTACAACAGTGGCACGCTGACTTTTTAACAAAACAAAGAAAATAGGAGGCGGAAATTGTAGAAAAGCTGGATAAAGAGGAGCAGGGGGCAGAATTAGAGGATCCATCCTCCATTTTACAGGACCCAAGGGCATCCTTAAAATCTGATTTGATGAACATTTACAAGAAAAAGGAAAGAGATTTAATCCAGAAGAGTAAGCTGAATTGGTTACATTTAGGGGATGAAAACACGAGATTCTTCCACCGTTTTCTCGCTGCAAAAAAGAGAAAAAATCTTATAGCAGAGCTAGTCAATGATCAAGGCTTTCCAACTAATTCGTATTGTGAGATAGAAGACCAAATCTTAAATTTCTATAAGAATCTTTACACCAAAACTCCAAGTGCTGGCTGTTTTCCTGCAAATCTGGAATGGCAAAGGGTGTCAGTTGAACAAAACAGTAGGCTGTCTTCAAAATTCAGCAGGGAAGAAATCAGATTTGCTTTAAGAGGAATGGGGAAAAATAAAGCGCCAGGTCCGGATGGTTTCACGGTGGAATTCCTTAACAAATTTTGGGATAGAATCAAAGATGATTTTGTTGCCCTATTCAATGAATTTCATGAGAATGGAAGGCTAAATTCCTGTGTGAAGGAGAACTTCATTTGTCTGATCAAGAAAAAAGAAGATGCCATCATGGTTAAAGACTTCCGGCCAATTAGCCTCACTACATTAACATATAAAGTAATTGCCAAAGTGTTAGCGGAAAGGTTGAAATTGGTTATGCCAAGCATCATAGCACCCACCCAAAGTGCATTTATCGAAGGTTGACAGATCTTAGATCCAATACTCATTGCCAATGAATTGGTGGAGGACTATAGAATAAAGAAGAAAAAAGGGTGGATTCTAAAGCTTGACCTAGAAAAAGCCTTTGGTAGAGTAGATTGGGGATTTCTAGAAAAGGCTCTACACGGCAAGAATTTCGATTCGAAATGGATCTCCTGGATTCTAGGTTGCATCAAGAACCCTAAATTCTCCATCTTCATTAATGGAAGACCAAGAGGAAGAGTTCAAGCCTCAAGAGGTGTGAGACAAGGAGACCCTCTTTCGCCCTTCTTATTCCTCCTAGTAAGTGAGGTATTGACTAGTCTTATTTCAAGACTTCATAAAAGTAAGAAATTTGAGGGATTTATAGTTGGAAAGAAGAAGGTACATGTTCCAATTCTTCAATTCGCAGATGATACACTCCTATTTTGTAAGTACGATCTTGATATGTTGGAAGCCTTGAGGAAAACTATTGAGTTTTTTGAATGGTGCTTCGGACAAAAGGTGAATTGGGATAAATCAGCCTTGTGTGGACTGAACATTGATGATTTAGAGGTTAAATCGACTGCTGCAAGATTAAATTGCAAGGCTGAAAAATTGCCTCTAATGTATCTTGGTCTGCCCCTAGGAGGACACCCAAAAAAGATGGTCTTTTGGCAGCCTATCATTGACAAAATTCAAGGAAAATTGAGCAGGTGGAAAAGAAATAACCTCTCAAGAGGGGGTAGACTTACATTATGCAAGACCGTGCTTTCAAACCTCCCCTCTTACTACATGTCTATCTTTTTAATGCCGGAAAAGGTAGTCCTTTTAATAGAAAGAGCCATGAGAAATTTCTTTTGGGAAGGCCATGGAGGTAGCAAATTGAATCACCTTGCCCGATGGGTAACCGTCACAAAAAACCATAAGGATGGAGGTCTTGGGCTGGAAAATTTGAAAATCAAAAACTTGGCATTGCTGTCCAAGTGGGGTTGGCGTTTTATGCAAGAATCTGAAGCCCTTTGGTGCAAAGTTATCACAAGTATCCAAGGAGAGGACCGCTTCTAATGGCATACTAACAGAAAGGAAGTTGCAAGCTTAAGAAGCCCTTGGATAAGCATTTCAAGACAATGGCAGAAAATTGAAGCCCTAGCAATCTTTAAAGTAGGAGATGGAAGAAGAATAACATTCTGGTTTGATCCTTGGCTTGAGGATCAGCCCTTCAAGGTCAGATTTCCAAGACTTTTCGAGCTAGCTCTCAAGCCAAACGGTACAGATGCAGATCATTGGGACCCATGTTCCTCCTCGTGGGATTTATTGTTCAAAAGACGGCTAAAAGAGGAAGAGATAGGTGAGTTTCTGTCCCTCTCAAGCAGTGTGGCAAATAAGAGAGTTATGTTGCAGCCGGATAAAAGGATTTGGGCATTAGAAGGGCAGTAGTTTCTCCATAAAATCCCTAACCACTCATCTTTCTGTAGCTTCCCCAATTGACGATGAGTTGGTAAAATGTCTATGGAAGTCCAAATGCCCCAGGAGAGTCAACATTCAGATTTGGACCATGCTGTTTGGTTCTTTAAAATGTGCTGCCACGCTTCAAAGGAAACTCCCATATCATTGCTTGTCCCCCCACATGTGTGTCCTTTGCCGCCAAGATCAAGAAGACATCCAGCACCTTTTCTTCGGTTGCAGCTATGCTTCAAGTTGCTGGTCGAGACTGTTTGGCTTTTTCGGTTTGAGTTGGGTAATGGAGAGTGA

mRNA sequence

ATGCCTTTGACAAAGTGCACCAGGAAGAGGCATTTAGCCTTGCATGCTCGAACCAATCCTCCTAATTGGAAGTTCTGGATTAATTACATTATTGGCGCAAGAATAATAGTCATGGAAGTAGTGAGCTACAAAATCAATGGAGTGTTCTGTTCATGGTTTGAAAAAGGAAAATTCTTAGTAGAAGATATTGAAAATTTCGAAGTTAAAGATCAAGGCGGGATGGAATTTAAGATGTGTAGTTTGGCCGATTTCCGGAGGGAGATTCTATATTCATGTACCAGAAGGAGTAGCTCAACATGGATGATCAGAGTTCTTGAAGATGCTGAAAGGAAGTCTCCAACGATTTGGAAACCACAGAGAAAAGTGCAAAAAACGCTCCATGCTGCTGCAACAATAGAACAGAGTATACAACCTCGTCGAGCTGAAAAGAAGGAGAAAGAGGGCATGGAGAATTTTGTAACGAGTTCAAAGGAAACATCAGCAAAGGACCGGTACTGTGCTCCACTCAAACATCAAAATCAACCGCAATTCTGGGTTAGGAAAAACAGAGAAGTCTTCAAGGAAGACTTTAACAACCTATGGGTTCTCACCAGGTTATTTGCCTCTGATGATTGGAAGGAAATTGCCAAATTCTTAGAGGTCTTCTACAATGTGAAAGTTAAAATCAACCCCCTTTTCGATGACAAAGCCCTATTCCGAGTAGTCCAAGGGAAGATAGAAGATATAATTGAAGCCCCCGACAAATGTAGACCCTCTGTTTTAAGAAGCTTTGGTGGATGGATAGTAATCAAGAACTTATCGCTGGAGTATTGGAGCAGAGAAACATTTGAAGCAATTGGACAACACTTTGGAGGATTGGAGGAGATAAGTATTGAAACACTGAATTTGTTAGACGTTTCAGAGACTAAAATTAAAGTTAGAAAGAATGTTTATGGTTTTATTCCAGCTACAATCGAAATTAACAATGAATTTAGAGGTTCTATCCATTTACTATTTGGAGACATCATACCAATGAAAGATTCAAGTTCCATTATCACGGAATTTATTGGGAGTGATTTCGAAAACCCCATTGACCAAGTACGGTTGAGCAAAGTCGAAGAAGATGAGGTTAGAGTCTTTTCTCCCTCAAAATTGGCGCTGCTTAGTTCGAAACCTGACCAGACCCAACCAGAGCAGGAGGAGAAGTCACTGGAAATATCGGTCGGGAGTCCATCAGTCGGAAAATGGTCGAAGTCGACAGCAAATGGCCTCGAAGGAAAAAGCATTAATGAGAAGGCGGCAAAATTAAATTGGTGTGAGGAAACCGAAGAGGCAGCCGTAGGAAGTTCAATAGGTGTGTTGAAATCGATTCGGCCTATAGAAGTTATGAAGAAAGAGAAGAGGAAAGAGAGGGTGTTTGGCCGAATTAATGAGAATATTAAAAGCATCCAGACCCCTGACATTCCATCTGGGGATTTCCTTCGGTCAGGTGGGCCCACCGGCTTCGAAGTCATCCAAAAAATTCGAAAAGTCCCAAACGTTCTTTTGACAGTAGCAGAAGAATCCGAAAAGTCACCCCCAAATAGCATGAAGTCCAATTGGGAAGAGGTTCCCTTTCAGACATTGCTGCCCTATCATCAGCTCCCTCGAGCTCGCGTGAATGCCTTTCCTCAGCAGTCTCCAGCACACACGATTTCCTCGGTCAAGAGCCCTTTCTTCCCAGCCGTTAGGAGGAGTTCAAATCTCTTCAAACCCTCCCCAAAACACTTTAGCAGAGGAAAAACATCGTTCCTCAACCTTTGGTCAGCATTGACAAATCCAGATATACTCGAAGTTAGTTGTGCAAACTCTCAAGTCCCCCAACAAATTCGTGACAGGTCAGTACTCCCCCTCTCTTCATCACAAATATTCCATCAATCAGAAATTCTAATTCCGGGATCAAATATTCTCTTCTTGCGAGGCTCCCCAAGCTCTTACCGTCCCAAGCCTAATAAGATTCAAGACTCAGAAGAGGAATCACTCATCAGCATAAGTAGTGATGAATTAAATGAGCCGGAAGAAGATGAGAACCATTTAAGTTTGGAATTGGATGAATCCATCGAACATAGAAGTGTAAAGGTATTCTCAATGAAAATTGTTTCTTGGAACATTAGAGGCCTTGGGGATCAATCAAAGCAGCTGGCTGTTAAGCACCTAATTATGAAGACAAATCTAGAATTGGTTTTAATCCAAGAAACAAAAAAAGAGGCATTTGAAGCTGAAGCAATCAAGAAGTTATGGAGTTCAAGAGACATTGGCTGGTCATTTGTGGAAGCCTATGGCAGATCAGGAGGGTTGCTAATCATGTGGGATGAAAGTAAAATATCAGTCATAGAAACAATCAAAGGAGGCTACACTCTATCCGTTAAATGTAAGACCTTATGTAGCAAAGTTTGTTGGGTAACAAATGCATACGGACCAACCGATTATAAAGAAAGGAGGCGCATCTGGCGGGAGTTACAAGCTTTGGCAGCATATTGCACTAATGCTTGGTGCTTGGGTGGGGACTTCAACATCACTAGAGCAATCCACGAAAGAGTTCCAATTGGAAGATTAACTAGAGGAATGAAGAAGTTTAGCAAATTCATAGAAGAGGCACACTTAATGGAAATTCCTATGAGCAATGGGCGTTTCACTTGGTCGAGAGAAGGAAACAGAGTATCAAGATCCTTGCTAGACAGATTTCTGGTTCGCATTGTATCAGATCACTTTCCTCTCTTATTAGAAGCTGGGGCGTTAGAATGGGGACCTTCCCCTTTTCGTTTTTGCAACAGTTGGCTGCTTAATTCCCAGTGCAACAGCATTATAATTAGAGCTCTTGCAGCTGGAAATCATCAAGGTTGGGCTGGATTTGTCATTTCTGCTAAGTTCAGATCAGAGGCGGAAATTGTAGAAAAGCTGGATAAAGAGGAGCAGGGGGCAGAATTAGAGGATCCATCCTCCATTTTACAGGACCCAAGGGCATCCTTAAAATCTGATTTGATGAACATTTACAAGAAAAAGGAAAGAGATTTAATCCAGAAGAGTAAGCTGAATTGGTTACATTTAGGGGATGAAAACACGAGATTCTTCCACCGTTTTCTCGCTGCAAAAAAGAGAAAAAATCTTATAGCAGAGCTAGTCAATGATCAAGGCTTTCCAACTAATTCGTATTGTGAGATAGAAGACCAAATCTTAAATTTCTATAAGAATCTTTACACCAAAACTCCAAGTGCTGGCTGTTTTCCTGCAAATCTGGAATGGCAAAGGGTGTCAGTTGAACAAAACAGTAGGCTGTCTTCAAAATTCAGCAGGGAAGAAATCAGATTTGCTTTAAGAGGAATGGGGAAAAATAAAGCGCCAGGTCCGGATGGTTTCACGGTGGAATTCCTTAACAAATTTTGGGATAGAATCAAAGATGATTTTGTTGCCCTATTCAATGAATTTCATGAGAATGGAAGGCTAAATTCCTGTGTGAAGGAGAACTTCATTTGTCTGATCAAGAAAAAAGAAGATGCCATCATGGTTAAAGACTTCCGGCCAATTAGCCTCACTACATTAACATATAAAGTAATTGCCAAAGTGTTAGCGGAAAGGTTGAAATTGATCTTAGATCCAATACTCATTGCCAATGAATTGGTGGAGGACTATAGAATAAAGAAGAAAAAAGGGTGGATTCTAAAGCTTGACCTAGAAAAAGCCTTTGGTAGAGTAGATTGGGGATTTCTAGAAAAGGCTCTACACGGCAAGAATTTCGATTCGAAATGGATCTCCTGGATTCTAGGTTGCATCAAGAACCCTAAATTCTCCATCTTCATTAATGGAAGACCAAGAGGAAGAGTTCAAGCCTCAAGAGGTGTGAGACAAGGAGACCCTCTTTCGCCCTTCTTATTCCTCCTAGTAAGTGAGGTATTGACTAGTCTTATTTCAAGACTTCATAAAAGTAAGAAATTTGAGGGATTTATAGTTGGAAAGAAGAAGGTACATGTTCCAATTCTTCAATTCGCAGATGATACACTCCTATTTTGTAAGTACGATCTTGATATGTTGGAAGCCTTGAGGAAAACTATTGAGTTTTTTGAATGGTGCTTCGGACAAAAGGTGAATTGGGATAAATCAGCCTTGTGTGGACTGAACATTGATGATTTAGAGGTTAAATCGACTGCTGCAAGATTAAATTGCAAGGCTGAAAAATTGCCTCTAATGTATCTTGGTCTGCCCCTAGGAGGACACCCAAAAAAGATGGTCTTTTGGCAGCCTATCATTGACAAAATTCAAGGAAAATTGAGCAGGTGGAAAAGAAATAACCTCTCAAGAGGGGGTAGACTTACATTATGCAAGACCGTGCTTTCAAACCTCCCCTCTTACTACATGTCTATCTTTTTAATGCCGGAAAAGGTAGTCCTTTTAATAGAAAGAGCCATGAGAAATTTCTTTTGGGAAGGCCATGGAGGTAGCAAATTGAATCACCTTGCCCGATGGGTAACCGTCACAAAAAACCATAAGGATGGAGGTCTTGGGCTGGAAAATTTGAAAATCAAAAACTTGGCATTGCTGTCCAAGTGGGGTTGGCGTTTTATGCAAGAATCTGAAGCCCTTTGGTGCAAAGAAGTTGCAAGCTTAAGAAGCCCTTGGATAAGCATTTCAAGACAATGGCAGAAAATTGAAGCCCTAGCAATCTTTAAAGTAGGAGATGGAAGAAGAATAACATTCTGGTTTGATCCTTGGCTTGAGGATCAGCCCTTCAAGGTCAGATTTCCAAGACTTTTCGAGCTAGCTCTCAAGCCAAACGGTACAGATGCAGATCATTGGGACCCATGTTCCTCCTCGTGGGATTTATTGTTCAAAAGACGGCTAAAAGAGGAAGAGATAGGTGAGTTTCTGTCCCTCTCAAGCAGTGTGGCAAATAAGAGAGTTATGTTGCAGCCGGATAAAAGGATTTGGGCATTAGAAGGGCAGAGAGTCAACATTCAGATTTGGACCATGCTGTTTGGTTCTTTAAAATGTGCTGCCACGCTTCAAAGGAAACTCCCATATCATTGCTTATCAAGAAGACATCCAGCACCTTTTCTTCGGTTGCAGCTATGCTTCAAGTTGCTGGTCGAGACTGTTTGGCTTTTTCGGTTTGAGTTGGGTAATGGAGAGTGA

Coding sequence (CDS)

ATGCCTTTGACAAAGTGCACCAGGAAGAGGCATTTAGCCTTGCATGCTCGAACCAATCCTCCTAATTGGAAGTTCTGGATTAATTACATTATTGGCGCAAGAATAATAGTCATGGAAGTAGTGAGCTACAAAATCAATGGAGTGTTCTGTTCATGGTTTGAAAAAGGAAAATTCTTAGTAGAAGATATTGAAAATTTCGAAGTTAAAGATCAAGGCGGGATGGAATTTAAGATGTGTAGTTTGGCCGATTTCCGGAGGGAGATTCTATATTCATGTACCAGAAGGAGTAGCTCAACATGGATGATCAGAGTTCTTGAAGATGCTGAAAGGAAGTCTCCAACGATTTGGAAACCACAGAGAAAAGTGCAAAAAACGCTCCATGCTGCTGCAACAATAGAACAGAGTATACAACCTCGTCGAGCTGAAAAGAAGGAGAAAGAGGGCATGGAGAATTTTGTAACGAGTTCAAAGGAAACATCAGCAAAGGACCGGTACTGTGCTCCACTCAAACATCAAAATCAACCGCAATTCTGGGTTAGGAAAAACAGAGAAGTCTTCAAGGAAGACTTTAACAACCTATGGGTTCTCACCAGGTTATTTGCCTCTGATGATTGGAAGGAAATTGCCAAATTCTTAGAGGTCTTCTACAATGTGAAAGTTAAAATCAACCCCCTTTTCGATGACAAAGCCCTATTCCGAGTAGTCCAAGGGAAGATAGAAGATATAATTGAAGCCCCCGACAAATGTAGACCCTCTGTTTTAAGAAGCTTTGGTGGATGGATAGTAATCAAGAACTTATCGCTGGAGTATTGGAGCAGAGAAACATTTGAAGCAATTGGACAACACTTTGGAGGATTGGAGGAGATAAGTATTGAAACACTGAATTTGTTAGACGTTTCAGAGACTAAAATTAAAGTTAGAAAGAATGTTTATGGTTTTATTCCAGCTACAATCGAAATTAACAATGAATTTAGAGGTTCTATCCATTTACTATTTGGAGACATCATACCAATGAAAGATTCAAGTTCCATTATCACGGAATTTATTGGGAGTGATTTCGAAAACCCCATTGACCAAGTACGGTTGAGCAAAGTCGAAGAAGATGAGGTTAGAGTCTTTTCTCCCTCAAAATTGGCGCTGCTTAGTTCGAAACCTGACCAGACCCAACCAGAGCAGGAGGAGAAGTCACTGGAAATATCGGTCGGGAGTCCATCAGTCGGAAAATGGTCGAAGTCGACAGCAAATGGCCTCGAAGGAAAAAGCATTAATGAGAAGGCGGCAAAATTAAATTGGTGTGAGGAAACCGAAGAGGCAGCCGTAGGAAGTTCAATAGGTGTGTTGAAATCGATTCGGCCTATAGAAGTTATGAAGAAAGAGAAGAGGAAAGAGAGGGTGTTTGGCCGAATTAATGAGAATATTAAAAGCATCCAGACCCCTGACATTCCATCTGGGGATTTCCTTCGGTCAGGTGGGCCCACCGGCTTCGAAGTCATCCAAAAAATTCGAAAAGTCCCAAACGTTCTTTTGACAGTAGCAGAAGAATCCGAAAAGTCACCCCCAAATAGCATGAAGTCCAATTGGGAAGAGGTTCCCTTTCAGACATTGCTGCCCTATCATCAGCTCCCTCGAGCTCGCGTGAATGCCTTTCCTCAGCAGTCTCCAGCACACACGATTTCCTCGGTCAAGAGCCCTTTCTTCCCAGCCGTTAGGAGGAGTTCAAATCTCTTCAAACCCTCCCCAAAACACTTTAGCAGAGGAAAAACATCGTTCCTCAACCTTTGGTCAGCATTGACAAATCCAGATATACTCGAAGTTAGTTGTGCAAACTCTCAAGTCCCCCAACAAATTCGTGACAGGTCAGTACTCCCCCTCTCTTCATCACAAATATTCCATCAATCAGAAATTCTAATTCCGGGATCAAATATTCTCTTCTTGCGAGGCTCCCCAAGCTCTTACCGTCCCAAGCCTAATAAGATTCAAGACTCAGAAGAGGAATCACTCATCAGCATAAGTAGTGATGAATTAAATGAGCCGGAAGAAGATGAGAACCATTTAAGTTTGGAATTGGATGAATCCATCGAACATAGAAGTGTAAAGGTATTCTCAATGAAAATTGTTTCTTGGAACATTAGAGGCCTTGGGGATCAATCAAAGCAGCTGGCTGTTAAGCACCTAATTATGAAGACAAATCTAGAATTGGTTTTAATCCAAGAAACAAAAAAAGAGGCATTTGAAGCTGAAGCAATCAAGAAGTTATGGAGTTCAAGAGACATTGGCTGGTCATTTGTGGAAGCCTATGGCAGATCAGGAGGGTTGCTAATCATGTGGGATGAAAGTAAAATATCAGTCATAGAAACAATCAAAGGAGGCTACACTCTATCCGTTAAATGTAAGACCTTATGTAGCAAAGTTTGTTGGGTAACAAATGCATACGGACCAACCGATTATAAAGAAAGGAGGCGCATCTGGCGGGAGTTACAAGCTTTGGCAGCATATTGCACTAATGCTTGGTGCTTGGGTGGGGACTTCAACATCACTAGAGCAATCCACGAAAGAGTTCCAATTGGAAGATTAACTAGAGGAATGAAGAAGTTTAGCAAATTCATAGAAGAGGCACACTTAATGGAAATTCCTATGAGCAATGGGCGTTTCACTTGGTCGAGAGAAGGAAACAGAGTATCAAGATCCTTGCTAGACAGATTTCTGGTTCGCATTGTATCAGATCACTTTCCTCTCTTATTAGAAGCTGGGGCGTTAGAATGGGGACCTTCCCCTTTTCGTTTTTGCAACAGTTGGCTGCTTAATTCCCAGTGCAACAGCATTATAATTAGAGCTCTTGCAGCTGGAAATCATCAAGGTTGGGCTGGATTTGTCATTTCTGCTAAGTTCAGATCAGAGGCGGAAATTGTAGAAAAGCTGGATAAAGAGGAGCAGGGGGCAGAATTAGAGGATCCATCCTCCATTTTACAGGACCCAAGGGCATCCTTAAAATCTGATTTGATGAACATTTACAAGAAAAAGGAAAGAGATTTAATCCAGAAGAGTAAGCTGAATTGGTTACATTTAGGGGATGAAAACACGAGATTCTTCCACCGTTTTCTCGCTGCAAAAAAGAGAAAAAATCTTATAGCAGAGCTAGTCAATGATCAAGGCTTTCCAACTAATTCGTATTGTGAGATAGAAGACCAAATCTTAAATTTCTATAAGAATCTTTACACCAAAACTCCAAGTGCTGGCTGTTTTCCTGCAAATCTGGAATGGCAAAGGGTGTCAGTTGAACAAAACAGTAGGCTGTCTTCAAAATTCAGCAGGGAAGAAATCAGATTTGCTTTAAGAGGAATGGGGAAAAATAAAGCGCCAGGTCCGGATGGTTTCACGGTGGAATTCCTTAACAAATTTTGGGATAGAATCAAAGATGATTTTGTTGCCCTATTCAATGAATTTCATGAGAATGGAAGGCTAAATTCCTGTGTGAAGGAGAACTTCATTTGTCTGATCAAGAAAAAAGAAGATGCCATCATGGTTAAAGACTTCCGGCCAATTAGCCTCACTACATTAACATATAAAGTAATTGCCAAAGTGTTAGCGGAAAGGTTGAAATTGATCTTAGATCCAATACTCATTGCCAATGAATTGGTGGAGGACTATAGAATAAAGAAGAAAAAAGGGTGGATTCTAAAGCTTGACCTAGAAAAAGCCTTTGGTAGAGTAGATTGGGGATTTCTAGAAAAGGCTCTACACGGCAAGAATTTCGATTCGAAATGGATCTCCTGGATTCTAGGTTGCATCAAGAACCCTAAATTCTCCATCTTCATTAATGGAAGACCAAGAGGAAGAGTTCAAGCCTCAAGAGGTGTGAGACAAGGAGACCCTCTTTCGCCCTTCTTATTCCTCCTAGTAAGTGAGGTATTGACTAGTCTTATTTCAAGACTTCATAAAAGTAAGAAATTTGAGGGATTTATAGTTGGAAAGAAGAAGGTACATGTTCCAATTCTTCAATTCGCAGATGATACACTCCTATTTTGTAAGTACGATCTTGATATGTTGGAAGCCTTGAGGAAAACTATTGAGTTTTTTGAATGGTGCTTCGGACAAAAGGTGAATTGGGATAAATCAGCCTTGTGTGGACTGAACATTGATGATTTAGAGGTTAAATCGACTGCTGCAAGATTAAATTGCAAGGCTGAAAAATTGCCTCTAATGTATCTTGGTCTGCCCCTAGGAGGACACCCAAAAAAGATGGTCTTTTGGCAGCCTATCATTGACAAAATTCAAGGAAAATTGAGCAGGTGGAAAAGAAATAACCTCTCAAGAGGGGGTAGACTTACATTATGCAAGACCGTGCTTTCAAACCTCCCCTCTTACTACATGTCTATCTTTTTAATGCCGGAAAAGGTAGTCCTTTTAATAGAAAGAGCCATGAGAAATTTCTTTTGGGAAGGCCATGGAGGTAGCAAATTGAATCACCTTGCCCGATGGGTAACCGTCACAAAAAACCATAAGGATGGAGGTCTTGGGCTGGAAAATTTGAAAATCAAAAACTTGGCATTGCTGTCCAAGTGGGGTTGGCGTTTTATGCAAGAATCTGAAGCCCTTTGGTGCAAAGAAGTTGCAAGCTTAAGAAGCCCTTGGATAAGCATTTCAAGACAATGGCAGAAAATTGAAGCCCTAGCAATCTTTAAAGTAGGAGATGGAAGAAGAATAACATTCTGGTTTGATCCTTGGCTTGAGGATCAGCCCTTCAAGGTCAGATTTCCAAGACTTTTCGAGCTAGCTCTCAAGCCAAACGGTACAGATGCAGATCATTGGGACCCATGTTCCTCCTCGTGGGATTTATTGTTCAAAAGACGGCTAAAAGAGGAAGAGATAGGTGAGTTTCTGTCCCTCTCAAGCAGTGTGGCAAATAAGAGAGTTATGTTGCAGCCGGATAAAAGGATTTGGGCATTAGAAGGGCAGAGAGTCAACATTCAGATTTGGACCATGCTGTTTGGTTCTTTAAAATGTGCTGCCACGCTTCAAAGGAAACTCCCATATCATTGCTTATCAAGAAGACATCCAGCACCTTTTCTTCGGTTGCAGCTATGCTTCAAGTTGCTGGTCGAGACTGTTTGGCTTTTTCGGTTTGAGTTGGGTAATGGAGAGTGA

Protein sequence

MPLTKCTRKRHLALHARTNPPNWKFWINYIIGARIIVMEVVSYKINGVFCSWFEKGKFLVEDIENFEVKDQGGMEFKMCSLADFRREILYSCTRRSSSTWMIRVLEDAERKSPTIWKPQRKVQKTLHAAATIEQSIQPRRAEKKEKEGMENFVTSSKETSAKDRYCAPLKHQNQPQFWVRKNREVFKEDFNNLWVLTRLFASDDWKEIAKFLEVFYNVKVKINPLFDDKALFRVVQGKIEDIIEAPDKCRPSVLRSFGGWIVIKNLSLEYWSRETFEAIGQHFGGLEEISIETLNLLDVSETKIKVRKNVYGFIPATIEINNEFRGSIHLLFGDIIPMKDSSSIITEFIGSDFENPIDQVRLSKVEEDEVRVFSPSKLALLSSKPDQTQPEQEEKSLEISVGSPSVGKWSKSTANGLEGKSINEKAAKLNWCEETEEAAVGSSIGVLKSIRPIEVMKKEKRKERVFGRINENIKSIQTPDIPSGDFLRSGGPTGFEVIQKIRKVPNVLLTVAEESEKSPPNSMKSNWEEVPFQTLLPYHQLPRARVNAFPQQSPAHTISSVKSPFFPAVRRSSNLFKPSPKHFSRGKTSFLNLWSALTNPDILEVSCANSQVPQQIRDRSVLPLSSSQIFHQSEILIPGSNILFLRGSPSSYRPKPNKIQDSEEESLISISSDELNEPEEDENHLSLELDESIEHRSVKVFSMKIVSWNIRGLGDQSKQLAVKHLIMKTNLELVLIQETKKEAFEAEAIKKLWSSRDIGWSFVEAYGRSGGLLIMWDESKISVIETIKGGYTLSVKCKTLCSKVCWVTNAYGPTDYKERRRIWRELQALAAYCTNAWCLGGDFNITRAIHERVPIGRLTRGMKKFSKFIEEAHLMEIPMSNGRFTWSREGNRVSRSLLDRFLVRIVSDHFPLLLEAGALEWGPSPFRFCNSWLLNSQCNSIIIRALAAGNHQGWAGFVISAKFRSEAEIVEKLDKEEQGAELEDPSSILQDPRASLKSDLMNIYKKKERDLIQKSKLNWLHLGDENTRFFHRFLAAKKRKNLIAELVNDQGFPTNSYCEIEDQILNFYKNLYTKTPSAGCFPANLEWQRVSVEQNSRLSSKFSREEIRFALRGMGKNKAPGPDGFTVEFLNKFWDRIKDDFVALFNEFHENGRLNSCVKENFICLIKKKEDAIMVKDFRPISLTTLTYKVIAKVLAERLKLILDPILIANELVEDYRIKKKKGWILKLDLEKAFGRVDWGFLEKALHGKNFDSKWISWILGCIKNPKFSIFINGRPRGRVQASRGVRQGDPLSPFLFLLVSEVLTSLISRLHKSKKFEGFIVGKKKVHVPILQFADDTLLFCKYDLDMLEALRKTIEFFEWCFGQKVNWDKSALCGLNIDDLEVKSTAARLNCKAEKLPLMYLGLPLGGHPKKMVFWQPIIDKIQGKLSRWKRNNLSRGGRLTLCKTVLSNLPSYYMSIFLMPEKVVLLIERAMRNFFWEGHGGSKLNHLARWVTVTKNHKDGGLGLENLKIKNLALLSKWGWRFMQESEALWCKEVASLRSPWISISRQWQKIEALAIFKVGDGRRITFWFDPWLEDQPFKVRFPRLFELALKPNGTDADHWDPCSSSWDLLFKRRLKEEEIGEFLSLSSSVANKRVMLQPDKRIWALEGQRVNIQIWTMLFGSLKCAATLQRKLPYHCLSRRHPAPFLRLQLCFKLLVETVWLFRFELGNGE
Homology
BLAST of ClCG03G010470 vs. NCBI nr
Match: RVW64408.1 (LINE-1 retrotransposable element ORF2 protein [Vitis vinifera])

HSP 1 Score: 713.4 bits (1840), Expect = 4.8e-201
Identity = 473/1543 (30.65%), Postives = 744/1543 (48.22%), Query Frame = 0

Query: 259  GWIVIKNLSLEYWSRETFEAIGQHFGGLEEISIETLNLLDVSETKIKVRKNVYGFIPATI 318
            GW+ ++ L    W       I Q +G + +++ ETL L+D+S+ K+ V  +    +PA +
Sbjct: 262  GWLELRGLPFHLWDEFQLRYILQKWGRVTKVAKETLKLVDLSKVKMWVEMHPKVVLPALL 321

Query: 319  EINN---EFRGSIHLL---FGDIIPMKDSS---SIITEFIGSDFENPIDQVRL-SKVEED 378
            E+ +    F  ++ ++     D +   +S+     +T   G   + P +   L +   ++
Sbjct: 322  EVEDGAWSFTVAVSVIGEAEEDFLLRPESNRSKDEVTSAEGCVHQRPKNAEGLRATARDN 381

Query: 379  EVRVFSPSKLALLSSKPDQTQPEQEEKSLEISVGSPSVGKWSKSTANGLEGKSINEKAAK 438
            E   + P   + +      +  E E+   +  +G          TA  ++G +  E   K
Sbjct: 382  EYHRWRPRHRSRVRYSVSNSDTEVEKGRGKSCLG---------PTAGSVDGLTKPEAFFK 441

Query: 439  LNWCE-ETEEAAVGSSIGVLKSIRPIEVMKKEKRKERVFGRINENIKSIQTPDIPSGDFL 498
             ++     EE  +G S+G      P+         E   G  N        P  PS   L
Sbjct: 442  GHFARAHFEEKNIGPSVG------PVH--------ETEAGSSNGG------PATPSSSKL 501

Query: 499  RSGGPTGFEV--IQKIRKVPNVLLTVAEESEKSPPNSMKSN------------------- 558
            +  G +  EV  I +  K+    +T   ++   PP+   ++                   
Sbjct: 502  QRSGTSAKEVMPIAQSAKLKGNSVTARRKARSWPPSLKVTSIVPKRNLDGDGAPEANRGF 561

Query: 559  -WEEVPFQ--TLLPYHQLPRARVNAFPQQSPAHTISSVKSPFFPAVRRSSNLFKPSPKHF 618
             W    F+   L P  +  R             T + VK    P+V  S    K  P   
Sbjct: 562  IWGRSVFKKGALSPVSEKSRRFTGGTEGDEGISTCNWVKKVRPPSVALSE---KDLP--- 621

Query: 619  SRGKTSFLNLWSALTNPDILEVSCANSQVPQQIRDRSVLPLSSSQIFHQSEI----LIPG 678
                           NP+IL  S  +S +P + +  S +PL SS +  +S      ++  
Sbjct: 622  ----------LDGAFNPNILSDSSVSSVIPSRCQAFSPIPLESSLVSQRSPFPKIAVVEP 681

Query: 679  SNILFLRGSPSSYRPKPNKIQDSEEESLISISSDELNEPEE----------DENHLSLEL 738
            S ++ +        P     ++SEE  L     D L+ PE+          +  H SL L
Sbjct: 682  SRLVGVSRPEGVAFPLETSNRNSEERPLF---KDSLSSPEKLCVSGKAQSPNPEHPSLPL 741

Query: 739  D----------ESIEHRSVKVF-------------------SMKIVSWNIRGLGDQSKQL 798
            +          + ++    K F                    MKI+SWN RGLG + K+ 
Sbjct: 742  EGFQVEGLTPGKMVKKEEEKCFWKPRRGLGLGAGCAGLFLVYMKILSWNTRGLGSKKKRR 801

Query: 799  AVKHLIMKTNLELVLIQETKKEAFEAEAIKKLWSSRDIGWSFVEAYGRSGGLLIMWDESK 858
             V+  +   N ++V++QETK+E ++   +  +W  + + W+ + A G SGG++I+WD SK
Sbjct: 802  IVRRFLSTQNPDIVMLQETKRETWDRRFVSSVWKGKRVEWAALPACGASGGIVILWDSSK 861

Query: 859  ISVIETIKGGYTLSVKCKTLCSKVCWVTNAYGPTDYKERRRIWRELQALAAYCTNAWCLG 918
            +   E + G ++++VK  +      W+T+ YGP +   R+  W ELQ L       WC+G
Sbjct: 862  LECTEKVLGSFSVTVKFNSGEEGSFWLTSVYGPINPLWRKDFWLELQDLYGLTFPRWCVG 921

Query: 919  GDFNITRAIHERVPIGRLTRGMKKFSKFIEEAHLMEIPMSNGRFTWSREGNRVSRSLLDR 978
            GDFN+ R I E++   RLT  M+ F +FI E+ L++ P+ N  FTWS          LDR
Sbjct: 922  GDFNVIRRISEKLGETRLTLNMRCFDEFIRESGLIDPPLRNAAFTWSNMQADPICKRLDR 981

Query: 979  FLV-----------------RIVSDHFPLLLEAGALEWGPSPFRFCNSWLLNSQCNSIII 1038
            FL                  R  SDH P+ LE   L+WGP+PFRF N WLL+ +      
Sbjct: 982  FLFSSEWDTFFSQSFQEALPRWTSDHSPICLETNPLKWGPTPFRFENMWLLHPEFKEKFR 1041

Query: 1039 RALAAGNHQGWAGFVISAKFR------SEAEIVEKLD-KEEQGAELEDPSSI-LQDPRAS 1098
                    +GW G     K +       E  I+   D KE +   L D S I L +   +
Sbjct: 1042 VWWLECTGEGWEGHKFMRKLKFVKSKLKEWNIMTFGDLKERKKLILTDLSRIDLIEQEGN 1101

Query: 1099 LKSDLM-----------NIYKKKERDLIQKSKLNWLHLGDENTRFFHRFLAAKKRKNLIA 1158
            L SDL+           ++  K+E    QKS++ W+  GD N++FFHR    ++ +  I 
Sbjct: 1102 LNSDLVLERTLKRRELEDVLLKEEVQWRQKSRVKWIKEGDCNSKFFHRVATGRRSRKFIK 1161

Query: 1159 ELVNDQGFPTNSYCEIEDQILNFYKNLYTKTPSAGCFPANLEWQRVSVEQNSRLSSKFSR 1218
             L++++G   N+  +I ++I+NF+ NLY+K          ++W  +S E    L   F+ 
Sbjct: 1162 SLISERGETLNNIEDISEEIVNFFGNLYSKPVGESWRVEGIDWVPISGESGGWLDRPFTE 1221

Query: 1219 EEIRFALRGMGKNKAPGPDGFTVEFLNKFWDRIKDDFVALFNEFHENGRLNSCVKENFIC 1278
            EE+R A+  + K KAPGPDGFT+    + WD IK+D + +F EFH NG +N      FI 
Sbjct: 1222 EEVRRAVFQLNKEKAPGPDGFTIAVYQECWDVIKEDLMRVFLEFHTNGVINQSTNATFIA 1281

Query: 1279 LIKKKEDAIMVKDFRPISLTTLTYKVIAKVLAERLKL------------------ILDPI 1338
            L+ KK  ++ + D+RPISL T  YK+IAKVL+ RL+                   ILD +
Sbjct: 1282 LVPKKSQSVKISDYRPISLVTSLYKIIAKVLSGRLRKVLHETISDSQGAFVEGRHILDAV 1341

Query: 1339 LIANELVEDYRIKKKKGWILKLDLEKAFGRVDWGFLEKALHGKNFDSKWISWILGCIKNP 1398
            LIANE+V++ R   ++G + K+D EKA+  VDWGFL+  L  K F  KW  WI GC+ + 
Sbjct: 1342 LIANEVVDEKRRSGEEGIVFKIDFEKAYDHVDWGFLDHVLQRKGFSQKWRLWIRGCLSSS 1401

Query: 1399 KFSIFINGRPRGRVQASRGVRQGDPLSPFLFLLVSEVLTSLISRLHKSKKFEGFIVGKKK 1458
             F+I +NG  +G V+ASRG+RQGDPLSPFLF LV++VL+ ++ R  ++   EGF VG+ +
Sbjct: 1402 SFAILVNGNAKGWVKASRGLRQGDPLSPFLFTLVADVLSRMLFRAEETGLTEGFSVGRDR 1461

Query: 1459 VHVPILQFADDTLLFCKYDLDMLEALRKTIEFFEWCFGQKVNWDKSALCGLNIDDLEVKS 1518
              V +LQFADDT+ F K  ++ L+ L+  +  F    G K+N +KS + G+N     + S
Sbjct: 1462 TRVSLLQFADDTIFFSKASMEHLQNLKIILLVFGQVSGLKINLEKSTISGINTRQELLSS 1521

Query: 1519 TAARLNCKAEKLPLMYLGLPLGGHPKKMVFWQPIIDKIQGKLSRWKRNNLSRGGRLTLCK 1578
             A+  +C+  + PL YLGLPLGG+PK + FW P++++I  +L  WK+  LS GGR+TL +
Sbjct: 1522 LASVFDCRVSEWPLSYLGLPLGGNPKTIGFWDPVVERISRRLDGWKKAYLSLGGRITLIQ 1581

Query: 1579 TVLSNLPSYYMSIFLMPEKVVLLIERAMRNFFWEGHGGSKLNHLARWVTVTKNHKDGGLG 1638
            + LS++PSY++S+F +P  +   IE+  RNF W G G  K +HL RW  V++  + GGLG
Sbjct: 1582 SCLSHIPSYFLSLFKIPASIASKIEKMQRNFLWSGAGEGKKDHLVRWEVVSRPKELGGLG 1641

Query: 1639 LENLKIKNLALLSKWGWRFMQESEALWCKEVASL------------------RSPWISIS 1650
               + ++N+ALL KW WRF +E   LW K + S+                  R PW +I+
Sbjct: 1642 FGKISLRNIALLGKWLWRFPRERSGLWYKVIGSIYGTHPNGWDANMVVRWSHRCPWKAIA 1701

BLAST of ClCG03G010470 vs. NCBI nr
Match: RVW16209.1 (Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera])

HSP 1 Score: 695.7 bits (1794), Expect = 1.0e-195
Identity = 366/1004 (36.45%), Postives = 547/1004 (54.48%), Query Frame = 0

Query: 701  FSMKIVSWNIRGLGDQSKQLAVKHLIMKTNLELVLIQETKKEAFEAEAIKKLWSSRDIGW 760
            F MKI+SWN+RGLG ++K+  +K  +   N ++V+IQETKKE  +   +  +W+ R+  W
Sbjct: 49   FPMKIISWNVRGLGSRNKRRMIKDFLRSENPDVVMIQETKKENCDRRFVGSVWTVRNKDW 108

Query: 761  SFVEAYGRSGGLLIMWDESKISVIETIKGGYTLSVKCKTLCSKVCWVTNAYGPTDYKERR 820
              + A G SGG+LI+WD   +S  E + G +++SVK         W++  YGP     R+
Sbjct: 109  VALPASGASGGILIIWDSKILSREEVVIGSFSVSVKFSLDGCGPLWISAVYGPNSPSLRK 168

Query: 821  RIWRELQALAAYCTNAWCLGGDFNITRAIHERVPIGRLTRGMKKFSKFIEEAHLMEIPMS 880
              W EL  +       WC+GGDFN+ R   E++    LT  M+ F  FI E  L++ P+ 
Sbjct: 169  DFWVELFDIYGLTYPLWCVGGDFNVIRGSSEKMGGSSLTPSMRDFDSFISECELLDPPLR 228

Query: 881  NGRFTWSREGNRVSRSLLDRF-----------------LVRIVSDHFPLLLEAGALEWGP 940
            N  FTWS          LDRF                 L+R  SDH+P++++     WGP
Sbjct: 229  NASFTWSNIQESPVCKRLDRFLYSNEWGLLFPQGLQEALIRRTSDHWPIVMDTNPFMWGP 288

Query: 941  SPFRFCNSWLLNSQCNSIIIRALAAGNHQGWAGFVISAKFRSEAEIVEKLDKEEQGAELE 1000
            +PFRF N WL ++          +     GW G     KF    + V+   KE       
Sbjct: 289  TPFRFENMWLQHTNFKENFRDWWSGFQGIGWEGH----KFMRRLQYVKAKLKEWN----- 348

Query: 1001 DPSSILQDPRASLKSDLMNIYKKKERDLIQKSKLNWLHLGDENTRFFHRFLAAKKRKNLI 1060
               S   + +   K +L  +  ++E    QK+K+ W+  GD N++F+H+    ++ +  I
Sbjct: 349  --KSSFGELKEKKKRELEELILREEIHWRQKAKVKWVKEGDCNSKFYHKVANGRRNRKYI 408

Query: 1061 AELVNDQGFPTNSYCEIEDQILNFYKNLYTKTPSAGCFPANLEWQRVSVEQNSRLSSKFS 1120
             EL N++G    +   I ++IL++++ LYT           L+W  +S E   RL S F+
Sbjct: 409  KELENERGLVLKNAESITEEILHYFEKLYTNPTGESWGVEGLDWSPISEESALRLESPFT 468

Query: 1121 REEIRFALRGMGKNKAPGPDGFTVEFLNKFWDRIKDDFVALFNEFHENGRLNSCVKENFI 1180
             EEI  A+  + ++KAPGPDGFT+    + WD IK+D V +F EFH +G +N     +FI
Sbjct: 469  EEEISKAIFQLDRDKAPGPDGFTIAVFQECWDVIKEDLVRVFAEFHRSGIINQSTNASFI 528

Query: 1181 CLIKKKEDAIMVKDFRPISLTTLTYKVIAKVLAERL------------------KLILDP 1240
             LI KK  +  + DFRPISL T  YK+IAKVL+ RL                  + ILD 
Sbjct: 529  VLIPKKSLSKRISDFRPISLITSLYKIIAKVLSGRLRGVLHETIHYTQGAFVQGRQILDA 588

Query: 1241 ILIANELVEDYRIKKKKGWILKLDLEKAFGRVDWGFLEKALHGKNFDSKWISWILGCIKN 1300
            +LIANE+V++ R   ++G + K+D EKA+  V W FL+  L  K F  +W  W+ GC+ +
Sbjct: 589  VLIANEIVDERRRSGEEGVVFKIDFEKAYDHVKWDFLDHMLEKKGFSPRWRKWMSGCLSS 648

Query: 1301 PKFSIFINGRPRGRVQASRGVRQGDPLSPFLFLLVSEVLTSLISRLHKSKKFEGFIVGKK 1360
              F+I +NG  +G V+ASRG+RQGDPLSPFLF LV++VL+ ++ R  +    EGF VG+ 
Sbjct: 649  VSFAILVNGSAKGWVKASRGLRQGDPLSPFLFTLVADVLSRMLMRAEERNMLEGFRVGRN 708

Query: 1361 KVHVPILQFADDTLLFCKYDLDMLEALRKTIEFFEWCFGQKVNWDKSALCGLNIDDLEVK 1420
            +  V  LQFADD + F     + L+ L+  +  F   FG KVN +KS++ G+N+D   + 
Sbjct: 709  RTRVSHLQFADDAIFFSNSREEELQTLKSLLLVFGHIFGLKVNLNKSSIYGINLDQAHLS 768

Query: 1421 STAARLNCKAEKLPLMYLGLPLGGHPKKMVFWQPIIDKIQGKLSRWKRNNLSRGGRLTLC 1480
              A  L+CKA   P++YLGLPLGG+PK   FW P++++I  +L  W++  LS GGR+TL 
Sbjct: 769  RLAEMLDCKASGWPILYLGLPLGGNPKSCGFWDPVVERISSRLDGWQKAYLSFGGRITLI 828

Query: 1481 KTVLSNLPSYYMSIFLMPEKVVLLIERAMRNFFWEGHGGSKLNHLARWVTVTKNHKDGGL 1540
            ++ L++LPSY++S+F MP  V   IER  R+F W G G  K +HL RW  V +    GGL
Sbjct: 829  QSCLTHLPSYFLSLFKMPATVAAKIERLQRDFLWSGIGEGKRDHLVRWDIVCRPKTIGGL 888

Query: 1541 GLENLKIKNLALLSKWGWRFMQESEALWCKEVASL------------------RSPWISI 1600
            GL N+  +NLALL KW WR+ +E  ALW + + S+                  R PW +I
Sbjct: 889  GLGNISRRNLALLGKWLWRYPREGSALWHQVILSIYGSHSNGWDANTVVRWSHRCPWKAI 948

Query: 1601 SRQWQKIEALAIFKVGDGRRITFWFDPWLEDQPFKVRFPRLFELALKPNGTDADHWDPCS 1650
            ++ +Q+   +  +  G+G RI FW D W  DQP   ++PRLF + +  N + +    P  
Sbjct: 949  AQVFQEFSLITRYVAGNGDRIRFWEDLWRGDQPLGTQYPRLFRVVVDKNISISSVLGPSR 1008

BLAST of ClCG03G010470 vs. NCBI nr
Match: CAN68838.1 (hypothetical protein VITISV_030956 [Vitis vinifera])

HSP 1 Score: 689.5 bits (1778), Expect = 7.4e-194
Identity = 373/1052 (35.46%), Postives = 554/1052 (52.66%), Query Frame = 0

Query: 679  EEDENHLSLELDESIEHRSVKVFSMKIVSWNIRGLGDQSKQLAVKHLIMKTNLELVLIQE 738
            EE   H  + L        V  F MKI+SWN RGLG + K+  VK  +     ++V+ QE
Sbjct: 806  EEQMLHRIVRLSGFGSEIRVTKFHMKIISWNTRGLGSKKKRRVVKDFLRSEKPDVVMFQE 865

Query: 739  TKKEAFEAEAIKKLWSSRDIGWSFVEAYGRSGGLLIMWDESKISVIETIKGGYTLSVKCK 798
            TKKE  +   +  +W++R+  W+ + A G SGG+LI+WD  K+S  E + G +++S+K  
Sbjct: 866  TKKEECDRRFVGSVWTARNKDWAALPACGASGGILIIWDTKKLSREEVMLGSFSVSIKFT 925

Query: 799  TLCSKVCWVTNAYGPTDYKERRRIWRELQALAAYCTNAWCLGGDFNITRAIHERVPIGRL 858
                +  W++  YGP +   R+ +W EL  +A   +  WC+GGDFN+ R   E++   RL
Sbjct: 926  LNGCESLWLSAVYGPNNSALRKDLWVELSDIAGLASPRWCVGGDFNVIRRSSEKLGGSRL 985

Query: 859  TRGMKKFSKFIEEAHLMEIPMSNGRFTWSREGNRVSRSLLDRFLV--------------- 918
            T  MK F  FI +  L+++P+ +  FTWS          LDRFL                
Sbjct: 986  TPSMKDFDDFISDCELIDLPLRSASFTWSNMQVNPVCKRLDRFLYSNEWEQTFPQSIQGV 1045

Query: 919  --RIVSDHFPLLLEAGALEWGPSPFRFCNSWLLNSQCNSIIIRALAAGNHQGWAGFVISA 978
              R  SDH+P++LE    +WGP+PFRF N WL +        R        GW G     
Sbjct: 1046 LPRWTSDHWPIVLETNPFKWGPTPFRFENMWLQHPSFKENFGRWWREFQGNGWEGHKFMR 1105

Query: 979  KF---RSEAEIVEKLDKEEQGAELEDPSSILQD----------------PRASLKSDLMN 1038
            K    +++ ++  K    E     ED  S L +                 RA  K +L  
Sbjct: 1106 KLQFVKAKLKVWNKASFGELSKRKEDILSALVNFDSLEQEGGLSHELLAQRAIKKGELEE 1165

Query: 1039 IYKKKERDLIQKSKLNWLHLGDENTRFFHRFLAAKKRKNLIAELVNDQGFPTNSYCEIED 1098
            +  ++E    QK+++ W+  GD N++FFH+    ++ +  I EL N+ G   N+   I++
Sbjct: 1166 LILREEIHWRQKARVKWVKEGDCNSKFFHKVANGRRNRKFIKELENENGQMMNNSESIKE 1225

Query: 1099 QILNFYKNLYTKTPSAGCFPANLEWQRVSVEQNSRLSSKFSREEIRFALRGMGKNKAPGP 1158
            +IL +++ LYT           L+W  +S E   RL S F+ EEI  A+  M ++KAPGP
Sbjct: 1226 EILRYFEKLYTSPSGESWRVEGLDWSPISGESAVRLESPFTEEEICKAIFQMDRDKAPGP 1285

Query: 1159 DGFTVEFLNKFWDRIKDDFVALFNEFHENGRLNSCVKENFICLIKKKEDAIMVKDFRPIS 1218
            DGFT+      W+ IK+D V +F EFH +G +N     +FI L+ KK  +  + DFRPIS
Sbjct: 1286 DGFTIAVFQDCWEVIKEDLVKVFTEFHRSGIINQSTNASFIVLLPKKSMSRRISDFRPIS 1345

Query: 1219 LTTLTYKVIAKVLAERL------------------KLILDPILIANELVEDYRIKKKKGW 1278
            L T  YK+IAKVLA R+                  + ILD +LIANE+V++ R   ++G 
Sbjct: 1346 LITSLYKIIAKVLAGRIREVLHETIHSTQGAFVQGRQILDAVLIANEIVDEKRRSGEEGV 1405

Query: 1279 ILKLDLEKAFGRVDWGFLEKALHGKNFDSKWISWILGCIKNPKFSIFINGRPRGRVQASR 1338
            + K+D EKA+  V W FL+  +  K F  +W  W+ GC+ +  F++ +NG  +G V+ASR
Sbjct: 1406 VFKIDFEKAYDHVSWDFLDHVMEMKGFGIRWRKWMRGCLSSVSFAVLVNGNAKGWVKASR 1465

Query: 1339 GVRQGDPLSPFLFLLVSEVLTSLISRLHKSKKFEGFIVGKKKVHVPILQFADDTLLFCKY 1398
            G+RQGDPLSPFLF +V++VL+ ++ +  +    EGF VG+ +  V  LQFADDT+ F   
Sbjct: 1466 GLRQGDPLSPFLFTIVADVLSRMLLKAEERNVLEGFKVGRNRTRVSHLQFADDTIFFSSS 1525

Query: 1399 DLDMLEALRKTIEFFEWCFGQKVNWDKSALCGLNIDDLEVKSTAARLNCKAEKLPLMYLG 1458
              + +  L+  +  F    G KVN DKS + G+N++   +   A  L+CKA   P++YLG
Sbjct: 1526 REEDMMTLKNVLLVFGHISGLKVNLDKSNIYGINLEQNHLSRLAEMLDCKASGWPILYLG 1585

Query: 1459 LPLGGHPKKMVFWQPIIDKIQGKLSRWKRNNLSRGGRLTLCKTVLSNLPSYYMSIFLMPE 1518
            LPLGG+PK   FW P+I++I  +L  W++  LS GGR+TL ++ L+++P Y++S+F +P 
Sbjct: 1586 LPLGGNPKTSGFWDPVIERISRRLDGWQKAYLSFGGRITLIQSCLTHMPCYFLSLFKIPA 1645

Query: 1519 KVVLLIERAMRNFFWEGHGGSKLNHLARWVTVTKNHKDGGLGLENLKIKNLALLSKWGWR 1578
             V   IER  R+F W G G  K +HL  W  V K    GGLG   + I+N+ALL KW WR
Sbjct: 1646 SVAAKIERMQRDFLWSGVGEGKRDHLVNWDVVCKPKSRGGLGFGKISIRNVALLGKWLWR 1705

Query: 1579 FMQESEALWCKEVASL------------------RSPWISISRQWQKIEALAIFKVGDGR 1638
            + +E  ALW + + S+                  R PW +I+  +Q+      F VG+G 
Sbjct: 1706 YPREGSALWHQVILSIYGSHSNGWDVNNIVRWSHRCPWKAIALVYQEFSKFTRFVVGNGD 1765

Query: 1639 RITFWFDPWLEDQPFKVRFPRLFELALKPNGTDADHWDPCSS--------SWDLLFKRRL 1650
            RI FW D W  +QP  V++PRL  +    N        P SS        SW+  F+R L
Sbjct: 1766 RIRFWDDLWWGEQPLGVQYPRLLRVVTDKNA-------PISSILGSTRPFSWNFTFRRNL 1825

BLAST of ClCG03G010470 vs. NCBI nr
Match: RVW70235.1 (LINE-1 retrotransposable element ORF2 protein [Vitis vinifera])

HSP 1 Score: 688.3 bits (1775), Expect = 1.6e-193
Identity = 396/1178 (33.62%), Postives = 594/1178 (50.42%), Query Frame = 0

Query: 564  PFFPAVRRSSNLFKPSPKHFSRGKTSFLNLWSALTNPDILEVSCANSQVPQQIR------ 623
            PF P+    SN     P H     T          +PD    S + S  P + R      
Sbjct: 677  PFIPSSSGFSNSLLNPPVHIQCPSTPM--------SPD----SSSQSLAPMENRVKSKFF 736

Query: 624  -----DRSVLPLSSSQIFHQSEILIPGSNILFLRGSPSSYRPKPNKIQDSEEESLISISS 683
                 D    P+    +  ++E++ P       + S S+     N     +E S  ++  
Sbjct: 737  SKKGNDEGHFPVDIPSLEMETEVIQPADP---YQMSESANSLSANLRLPCKESSKATVHL 796

Query: 684  DELNEPEEDENHLSLELDESIEHRSVKVFSMKIVSWNIRGLGDQSKQLAVKHLIMKTNLE 743
              + + EE   H  + L        V  F MKI+SWN RGLG + K+  VK  +     +
Sbjct: 797  GGIFKEEEQMLHRIVRLSGFGSEIRVTKFHMKIISWNTRGLGSKKKRRVVKDFLRSEKPD 856

Query: 744  LVLIQETKKEAFEAEAIKKLWSSRDIGWSFVEAYGRSGGLLIMWDESKISVIETIKGGYT 803
            +V+ QETKKE  +   +  +W++R+  W+ + A G SGG+LI+WD  K+S  E + G ++
Sbjct: 857  VVMFQETKKEECDRRFVGSVWTARNKDWAALPACGASGGILIIWDTKKLSREEVMLGSFS 916

Query: 804  LSVKCKTLCSKVCWVTNAYGPTDYKERRRIWRELQALAAYCTNAWCLGGDFNITRAIHER 863
            +S+K      +  W++  YGP +   R+ +W EL  +A   +  WC+GGDFN+ R   E+
Sbjct: 917  VSIKFTLNGCESLWLSAVYGPNNSALRKDLWVELSDIAGLASPRWCVGGDFNVIRRSSEK 976

Query: 864  VPIGRLTRGMKKFSKFIEEAHLMEIPMSNGRFTWSREGNRVSRSLLDRFLV--------- 923
            +   RLT  MK F  FI +  L+++P+ +  FTWS          LDRFL          
Sbjct: 977  LGGSRLTPSMKDFDDFISDCELIDLPLRSASFTWSNMQVNPVCKRLDRFLYSNEWEQTFP 1036

Query: 924  --------RIVSDHFPLLLEAGALEWGPSPFRFCNSWLLNSQCNSIIIRALAAGNHQGWA 983
                    R  SDH+P++LE    +WGP+PFRF N WL +        R        GW 
Sbjct: 1037 QSIQGVLPRWTSDHWPIVLETNPFKWGPTPFRFENMWLQHPSFKENFGRWWREFQGNGWE 1096

Query: 984  GFVISAKF---RSEAEIVEKLDKEEQGAELEDPSSILQD----------------PRASL 1043
            G     K    +++ ++  K    E     ED  S L +                 RA  
Sbjct: 1097 GHKFMRKLQFVKAKLKVWNKASFGELSKRKEDILSALVNFDSLEQEGGLSHELLAQRAIK 1156

Query: 1044 KSDLMNIYKKKERDLIQKSKLNWLHLGDENTRFFHRFLAAKKRKNLIAELVNDQGFPTNS 1103
            K +L  +  ++E    QK+++ W+  GD N++FFH+    ++ +  I EL N+ G   N+
Sbjct: 1157 KGELEELILREEIHWRQKARVKWVKEGDCNSKFFHKVANGRRNRKFIKELENENGQMMNN 1216

Query: 1104 YCEIEDQILNFYKNLYTKTPSAGCFPANLEWQRVSVEQNSRLSSKFSREEIRFALRGMGK 1163
               I+++IL +++ LYT           L+W  +S E   RL S F+ EEI  A+  M +
Sbjct: 1217 SESIKEEILRYFEKLYTSPSGESWRVEGLDWSPISGESAVRLESPFTEEEICKAIFQMDR 1276

Query: 1164 NKAPGPDGFTVEFLNKFWDRIKDDFVALFNEFHENGRLNSCVKENFICLIKKKEDAIMVK 1223
            +KAPGPDGFT+      W+ IK+D V +F EFH +G +N     +FI L+ KK  +  + 
Sbjct: 1277 DKAPGPDGFTIAVFQDCWEVIKEDLVKVFTEFHRSGIINQSTNASFIVLLPKKSMSRRIS 1336

Query: 1224 DFRPISLTTLTYKVIAKVLAERL------------------KLILDPILIANELVEDYRI 1283
            DFRPISL T  YK+IAKVLA R+                  + ILD +LIANE+V++ R 
Sbjct: 1337 DFRPISLITSLYKIIAKVLAGRIREVLHETIHSTQGAFVQGRQILDAVLIANEIVDEKRR 1396

Query: 1284 KKKKGWILKLDLEKAFGRVDWGFLEKALHGKNFDSKWISWILGCIKNPKFSIFINGRPRG 1343
              ++G + K+D EKA+  V W FL+  +  K F  +W  W+ GC+ +  F++ +NG  +G
Sbjct: 1397 SGEEGVVFKIDFEKAYDHVSWDFLDHVMEMKGFGIRWRKWMRGCLSSVSFAVLVNGNAKG 1456

Query: 1344 RVQASRGVRQGDPLSPFLFLLVSEVLTSLISRLHKSKKFEGFIVGKKKVHVPILQFADDT 1403
             V+ASRG+RQGDPLSPFLF +V++VL+ ++ +  +    EGF VG+ +  V  LQFADDT
Sbjct: 1457 WVKASRGLRQGDPLSPFLFTIVADVLSRMLLKAEERNVLEGFKVGRNRTRVSHLQFADDT 1516

Query: 1404 LLFCKYDLDMLEALRKTIEFFEWCFGQKVNWDKSALCGLNIDDLEVKSTAARLNCKAEKL 1463
            + F     + +  L+  +  F    G KVN DKS + G+N++   +   A  L+CKA   
Sbjct: 1517 IFFSSSREEDMMTLKNVLLVFGHISGLKVNLDKSNIYGINLEQNHLSRLAEMLDCKASGW 1576

Query: 1464 PLMYLGLPLGGHPKKMVFWQPIIDKIQGKLSRWKRNNLSRGGRLTLCKTVLSNLPSYYMS 1523
            P++YLGLPLGG+PK   FW P+I++I  +L  W++  LS GGR+TL ++ L+++P Y++S
Sbjct: 1577 PILYLGLPLGGNPKTSGFWDPVIERISRRLDGWQKAYLSFGGRITLIQSCLTHMPCYFLS 1636

Query: 1524 IFLMPEKVVLLIERAMRNFFWEGHGGSKLNHLARWVTVTKNHKDGGLGLENLKIKNLALL 1583
            +F +P  V   IER  R+F W G G  K +HL  W  V K    GGLG   + I+N+ALL
Sbjct: 1637 LFKIPASVAAKIERMQRDFLWSGVGEGKRDHLVNWDVVCKPKSRGGLGFGKISIRNVALL 1696

Query: 1584 SKWGWRFMQESEALWCKEVASL------------------RSPWISISRQWQKIEALAIF 1643
             KW WR+ +E  ALW + + S+                  R PW +I+  +Q+      F
Sbjct: 1697 GKWLWRYPREGSALWHQVILSIYGSHSNGWDVNNIVRWSHRCPWKAIALVYQEFSKFTRF 1756

Query: 1644 KVGDGRRITFWFDPWLEDQPFKVRFPRLFELALKPNGTDADHWDPCSS--------SWDL 1650
             VG+G RI FW D W  +QP  V++PRL  +    N        P SS        SW+ 
Sbjct: 1757 VVGNGDRIRFWDDLWWGEQPLGVQYPRLLRVVTDKNA-------PISSILGSTRPFSWNF 1816

BLAST of ClCG03G010470 vs. NCBI nr
Match: RVW65579.1 (Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera])

HSP 1 Score: 688.0 bits (1774), Expect = 2.1e-193
Identity = 367/1021 (35.95%), Postives = 544/1021 (53.28%), Query Frame = 0

Query: 703  MKIVSWNIRGLGDQSKQLAVKHLIMKTNLELVLIQETKKEAFEAEAIKKLWSSRDIGWSF 762
            MKI+SWN RGLG + K+  VK+ +     ++V+IQETKKE  +   +  +WS R+  W+ 
Sbjct: 1    MKIISWNTRGLGSKKKRRVVKNFLSSEKPDVVMIQETKKEECDRRLVGSVWSVRNKDWAA 60

Query: 763  VEAYGRSGGLLIMWDESKISVIETIKGGYTLSVKCKTLCSKVCWVTNAYGPTDYKERRRI 822
            + A G SGG+LI+WD  K+   E + G +++S+K      +  W++  YGP +   R+  
Sbjct: 61   LPASGASGGILIIWDSIKMRREEVVLGSFSVSIKFAMDGCESLWLSAVYGPNNSALRKDF 120

Query: 823  WRELQALAAYCTNAWCLGGDFNITRAIHERVPIGRLTRGMKKFSKFIEEAHLMEIPMSNG 882
            W EL  +A      WC+GGDFN+ R   E++   RLT  MK F +FI +  L++ P+ + 
Sbjct: 121  WVELSDIAGLSHPRWCVGGDFNVIRRSSEKLGGSRLTPCMKDFDEFIRDCELIDSPLRSV 180

Query: 883  RFTWSREGNRVSRSLLDRFLV-----------------RIVSDHFPLLLEAGALEWGPSP 942
             +TWS          LDRFL                  R  SDH+P++LE    +WGP+P
Sbjct: 181  SYTWSNMQENPVCKRLDRFLYSNEWEQVFPQSLQGVLPRWTSDHWPIVLETNPFKWGPTP 240

Query: 943  FRFCNSWLLNSQCNSIIIRALAAGNHQGWAGFVISAKFRSEAEIVEKLDKEEQGA----- 1002
            FRF N WL +S       R  +     GW G     K +     +++ +K   G      
Sbjct: 241  FRFENMWLQHSSFKENFGRWWSEFQGNGWEGHKFMRKLQFVKAKLKEWNKTSFGELSKKK 300

Query: 1003 -----------ELEDPSSILQD---PRASLKSDLMNIYKKKERDLIQKSKLNWLHLGDEN 1062
                        LE    + Q+    RA  K +L  +  ++E    QK+++ W+  GD N
Sbjct: 301  KDILAVLANFDSLEQEGGLSQELLVQRAFSKGELEELILREEIHWRQKARVKWVKKGDCN 360

Query: 1063 TRFFHRFLAAKKRKNLIAELVNDQGFPTNSYCEIEDQILNFYKNLYTKTPSAGCFPANLE 1122
            ++FFH+    ++ +  I EL N+ G   N+   I+++IL +++ LY            L+
Sbjct: 361  SKFFHKVANGRRNRKFIKELENESGLMLNNPESIKEEILKYFEKLYACPSRESWRVEGLD 420

Query: 1123 WQRVSVEQNSRLSSKFSREEIRFALRGMGKNKAPGPDGFTVEFLNKFWDRIKDDFVALFN 1182
            W  +  E  SRL S F+ EEI  A+  M ++KAPGPDGFT+      WD IK+D V +F 
Sbjct: 421  WSPIDGESASRLESPFTEEEIYKAIFQMDRDKAPGPDGFTIAVFQDCWDVIKEDLVRVFA 480

Query: 1183 EFHENGRLNSCVKENFICLIKKKEDAIMVKDFRPISLTTLTYKVIAKVLAERL------- 1242
            EFH +G +N     +FI L+ KK  +  + DFRPISL T  YK+IAKVLA RL       
Sbjct: 481  EFHRSGIINQSTNASFIVLLPKKSISRRISDFRPISLITSLYKIIAKVLAGRLRGVLHET 540

Query: 1243 -----------KLILDPILIANELVEDYRIKKKKGWILKLDLEKAFGRVDWGFLEKALHG 1302
                       + ILD +LIANE+V++ R   ++G + K+D EKA+  V W FL+  L  
Sbjct: 541  IHSTQGAFVQGRQILDAVLIANEIVDEKRRTGEEGVVFKIDFEKAYDHVSWDFLDHVLEM 600

Query: 1303 KNFDSKWISWILGCIKNPKFSIFINGRPRGRVQASRGVRQGDPLSPFLFLLVSEVLTSLI 1362
            K F  +W  W+ GC+ +  +++ +NG  +G V+ASRG+RQGDPLSPFLF +V++VL+ ++
Sbjct: 601  KGFSLRWRKWMRGCLSSVSYAVLVNGNAKGWVKASRGLRQGDPLSPFLFTIVADVLSRML 660

Query: 1363 SRLHKSKKFEGFIVGKKKVHVPILQFADDTLLFCKYDLDMLEALRKTIEFFEWCFGQKVN 1422
             +  +    EGF VG+ +  V  LQFADDT+ F     + L  L+  +  F    G KVN
Sbjct: 661  LKAEERNVLEGFRVGRNRTRVSHLQFADDTIFFSSTREEDLMTLKSVLLVFGHISGLKVN 720

Query: 1423 WDKSALCGLNIDDLEVKSTAARLNCKAEKLPLMYLGLPLGGHPKKMVFWQPIIDKIQGKL 1482
             DKS + G+NI+   +   A  L+CKA   P++YLGLPLGG+PK   FW P+I++I  +L
Sbjct: 721  LDKSNIYGINIEQNHLSRLAVMLDCKASGWPILYLGLPLGGNPKASGFWDPVIERISRRL 780

Query: 1483 SRWKRNNLSRGGRLTLCKTVLSNLPSYYMSIFLMPEKVVLLIERAMRNFFWEGHGGSKLN 1542
              W++  LS GGR+TL ++ L+++P Y++S+F +P  V   IER  R F W G G  K +
Sbjct: 781  DGWQKAYLSFGGRITLIQSCLTHMPCYFLSLFRIPASVAAKIERMQREFLWSGVGEGKRD 840

Query: 1543 HLARWVTVTKNHKDGGLGLENLKIKNLALLSKWGWRFMQESEALWCKEVASL-------- 1602
            HL  W  V K    GGLG   + ++N+ALL KW WR+ +E  ALW + + S+        
Sbjct: 841  HLVNWDVVCKPKSRGGLGFGKISMRNVALLGKWLWRYPREGSALWHQVILSIYGSHSNGW 900

Query: 1603 ----------RSPWISISRQWQKIEALAIFKVGDGRRITFWFDPWLEDQPFKVRFPRLFE 1650
                      R PW +I+  +Q+      F VGDG RI FW D W  DQP   ++PRL  
Sbjct: 901  DVNNNVRWSHRCPWKAIALVFQEFSKFTRFVVGDGDRIRFWDDLWWGDQPLGTQYPRLLS 960

BLAST of ClCG03G010470 vs. ExPASy Swiss-Prot
Match: O00370 (LINE-1 retrotransposable element ORF2 protein OS=Homo sapiens OX=9606 PE=1 SV=1)

HSP 1 Score: 196.1 bits (497), Expect = 3.4e-48
Identity = 210/906 (23.18%), Postives = 375/906 (41.39%), Query Frame = 0

Query: 705  IVSWNIRGLGDQSKQLAVKHLIMKTNLELVLIQETKKEAFEAEAIKKLWSSRDIGW-SFV 764
            I++ N+ GL    K+  +   I   +  +  IQET     +   +K        GW    
Sbjct: 10   ILTLNVNGLNSPIKRHRLASWIKSQDPSVCCIQETHLTCRDTHRLKIK------GWRKIY 69

Query: 765  EAYG---RSGGLLIMWDES--KISVIETIKGGYTLSVKCKTLCSKVCWVTNAYGPTDYKE 824
            +A G   ++G  +++ D++  K + I+  K G+ + VK  ++  +   + N Y P +   
Sbjct: 70   QANGKQKKAGVAILVSDKTDFKPTKIKRDKEGHYIMVK-GSIQQEELTILNIYAP-NTGA 129

Query: 825  RRRIWRELQALAAYCTNAWCLGGDFNITRAIHERVPIGRLTRGMKKFSKFIEEAHLMEIP 884
             R I + L  L     +   + GDFN   +I +R    ++ +  ++ +  + +  L++I 
Sbjct: 130  PRFIKQVLSDLQRDLDSHTLIMGDFNTPLSILDRSTRQKVNKDTQELNSALHQTDLIDIY 189

Query: 885  ------------MSNGRFTWSREGNRV-SRSLLDR-----FLVRIVSDHFPLLLE----- 944
                         S    T+S+  + V S++LL +      +   +SDH  + LE     
Sbjct: 190  RTLHPKSTEYTFFSAPHHTYSKIDHIVGSKALLSKCKRTEIITNYLSDHSAIKLELRIKN 249

Query: 945  ---AGALEWGPSPFRFCNSWLLNSQCNSIIIRALAAGNHQG-----WAGF--VISAKF-- 1004
               + +  W  +     + W+ N     I +      N        W  F  V   KF  
Sbjct: 250  LTQSRSTTWKLNNLLLNDYWVHNEMKAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFIA 309

Query: 1005 ---------RSEAEIVEKLDKEEQGAELEDPSSILQDPRASLKSDLMNIYKKKERDLIQK 1064
                     RS+ + +    KE +  E     +  +     ++++L  I  +K    I +
Sbjct: 310  LNAYKRKQERSKIDTLTSQLKELEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKINE 369

Query: 1065 SKLNWLHLGDENTRFFHRFLAAKKRKNLIAELVNDQGFPTNSYCEIEDQILNFYKNLYT- 1124
            S+  +    ++  R   R +  K+ KN I  + ND+G  T    EI+  I  +YK+LY  
Sbjct: 370  SRSWFFERINKIDRPLARLIKKKREKNQIDTIKNDKGDITTDPTEIQTTIREYYKHLYAN 429

Query: 1125 ---KTPSAGCFPANLEWQRVSVEQNSRLSSKFSREEIRFALRGMGKNKAPGPDGFTVEFL 1184
                      F       R++ E+   L+   +  EI   +  +   K+PGPDGFT EF 
Sbjct: 430  KLENLEEMDTFLDTYTLPRLNQEEVESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFY 489

Query: 1185 NKFWDRIKDDFVALFNEFHENGRL-NSCVKENFICLIKKKEDAIMVKDFRPISLTTLTYK 1244
             ++ + +    + LF    + G L NS  + + I + K   D    ++FRPISL  +  K
Sbjct: 490  QRYKEELVPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAK 549

Query: 1245 VIAKVLAERLKLILDPIL-------------------IANELVEDYRIKKKKGWILKLDL 1304
            ++ K+LA R++  +  ++                     N +    R K K   I+ +D 
Sbjct: 550  ILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHINRAKDKNHVIISIDA 609

Query: 1305 EKAFGRVDWGFLEKALHGKNFDSKWISWILGCIKNPKFSIFINGRPRGRVQASRGVRQGD 1364
            EKAF ++   F+ K L+    D  ++  I      P  +I +NG+         G RQG 
Sbjct: 610  EKAFDKIQQPFMLKTLNKLGIDGMYLKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGC 669

Query: 1365 PLSPFLFLLVSEVLTSLISRLHKSKKFEGFIVGKKKVHVPILQFADDTLLFCKYDLDMLE 1424
            PLSP LF +V EVL   I    + K+ +G  +GK++V + +  FADD +++ +  +   +
Sbjct: 670  PLSPLLFNIVLEVLARAI---RQEKEIKGIQLGKEEVKLSL--FADDMIVYLENPIVSAQ 729

Query: 1425 ALRKTIEFFEWCFGQKVNWDKSALCGLNIDDLEVKSTAARLNCKAEKLPLMYLGLPLGGH 1484
             L K I  F    G K+N  KS     N +          L        + YLG+ L   
Sbjct: 730  NLLKLISNFSKVSGYKINVQKSQAFLYNNNRQTESQIMGELPFTIASKRIKYLGIQLTRD 789

Query: 1485 PKKMV--FWQPIIDKIQGKLSRWKRNNLSRGGRLTLCKTVLSNLPSYYMSIFLMPEKVVL 1531
             K +    ++P++ +I+   ++WK    S  GR+ + K  +  LP        +P K+ +
Sbjct: 790  VKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAI--LPKVIYRFNAIPIKLPM 849

BLAST of ClCG03G010470 vs. ExPASy Swiss-Prot
Match: P14381 (Transposon TX1 uncharacterized 149 kDa protein OS=Xenopus laevis OX=8355 PE=4 SV=1)

HSP 1 Score: 192.6 bits (488), Expect = 3.7e-47
Identity = 199/784 (25.38%), Postives = 325/784 (41.45%), Query Frame = 0

Query: 809  NAYGPTDYKERRRIWRELQALAAYCTN--AWCLGGDFNITRAIHERVPIGRLTRGMKKFS 868
            N Y PT   ER R +  L A      +  A  +GGDFN T    +R    +         
Sbjct: 108  NVYAPTTGPERARFFESLSAYMETIDSDEALIIGGDFNYTLDARDRNVPKKRDSSESVLR 167

Query: 869  KFIEEAHLMEIPMSNG----RFTWSR-EGNRVSRSLLDRFLVRIVSDHFPLLLEAGALEW 928
            + I    L+++          FT+ R     VS+S +DR     +S H     ++  +  
Sbjct: 168  ELIAHFSLVDVWREQNPETVAFTYVRVRDGHVSQSRIDRI---YISSHLMSRAQSSTIRL 227

Query: 929  GPSPFRFCNS--------------WLLNSQCNSII----IRALAAGNHQGWAGF------ 988
             P     C S              W  N   NS++             +GW  F      
Sbjct: 228  APFSDHNCVSLRMSIAPSLPKAAYWHFN---NSLLEDEGFAKSVRDTWRGWRAFQDEFAT 287

Query: 989  ---------------------VISAKFRSEAEIV--EKLDKEEQGAELEDPSSILQDPRA 1048
                                  +S +  +E E +  E LD E++ +  ED +  LQ    
Sbjct: 288  LNQWWDVGKVHLKLLCQEYTKSVSGQRNAEIEALNGEVLDLEQRLSGSEDQA--LQCEYL 347

Query: 1049 SLKSDLMNIYKKKERDLIQKSKLNWLHLGDENTRFFHRFLAAKKRKNLIAELVNDQGFPT 1108
              K  L N+ +++ R    +S++  L   D  +RFF+     K  +  I  L  + G P 
Sbjct: 348  ERKEALRNMEQRQARGAFVRSRMQLLCDMDRGSRFFYALEKKKGNRKQITCLFAEDGTPL 407

Query: 1109 NSYCEIEDQILNFYKNLYTKTPSAGCFPANLEWQR---VSVEQNSRLSSKFSREEIRFAL 1168
                 I D+  +FY+NL++  P +      L W     VS  +  RL +  + +E+  AL
Sbjct: 408  EDPEAIRDRARSFYQNLFSPDPISPDACEEL-WDGLPVVSERRKERLETPITLDELSQAL 467

Query: 1169 RGMGKNKAPGPDGFTVEFLNKFWDRIKDDFVALFNEFHENGRLNSCVKENFICLIKKKED 1228
            R M  NK+PG DG T+EF   FWD +  DF  +  E  + G L    +   + L+ KK D
Sbjct: 468  RLMPHNKSPGLDGLTIEFFQFFWDTLGPDFHRVLTEAFKKGELPLSCRRAVLSLLPKKGD 527

Query: 1229 AIMVKDFRPISLTTLTYKVIAKVLAERLKLIL------------------DPILIANELV 1288
              ++K++RP+SL +  YK++AK ++ RLK +L                  D + +  +L+
Sbjct: 528  LRLIKNWRPVSLLSTDYKIVAKAISLRLKSVLAEVIHPDQSYTVPGRTIFDNVFLIRDLL 587

Query: 1289 EDYRIKKKKGWILKLDLEKAFGRVDWGFLEKALHGKNFDSKWISWILGCIKNPKFSIFIN 1348
               R        L LD EKAF RVD  +L   L   +F  +++ ++     + +  + IN
Sbjct: 588  HFARRTGLSLAFLSLDQEKAFDRVDHQYLIGTLQAYSFGPQFVGYLKTMYASAECLVKIN 647

Query: 1349 GRPRGRVQASRGVRQGDPLSPFLFLLVSEVLTSLISRLHKSKKFEGFIVGKKKVHVPILQ 1408
                  +   RGVRQG PLS  L+ L  E    L+      K+  G ++ +  + V +  
Sbjct: 648  WSLTAPLAFGRGVRQGCPLSGQLYSLAIEPFLCLL-----RKRLTGLVLKEPDMRVVLSA 707

Query: 1409 FADDTLLFCKYDLDMLEALRKTIEFFEWCFGQKVNWDKSALCGLNIDDLEVK-STAARLN 1468
            +ADD +L  + DL  LE  ++  E +      ++NW KS+  GL    L+V     A  +
Sbjct: 708  YADDVILVAQ-DLVDLERAQECQEVYAAASSARINWSKSS--GLLEGSLKVDFLPPAFRD 767

Query: 1469 CKAEKLPLMYLGLPLGG--HPKKMVFWQPIIDKIQGKLSRWK---RNNLSRGGRLTLCKT 1508
               E   + YLG+ L    +P    F + + + +  +L +WK   +    RG  L + + 
Sbjct: 768  ISWESKIIKYLGVYLSAEEYPVSQNFIE-LEECVLTRLGKWKGFAKVLSMRGRALVINQL 827

BLAST of ClCG03G010470 vs. ExPASy Swiss-Prot
Match: P08548 (LINE-1 reverse transcriptase homolog OS=Nycticebus coucang OX=9470 PE=4 SV=1)

HSP 1 Score: 187.6 bits (475), Expect = 1.2e-45
Identity = 208/909 (22.88%), Postives = 386/909 (42.46%), Query Frame = 0

Query: 703  MKIVSWNIRGLGDQSKQLAVKHLIMKTNLELVLIQETKKEAFEAEAIKKLWSSRDIGWSF 762
            + I S N+ GL    K+  +   I K   ++  IQE+         +K  +  +  GWS 
Sbjct: 7    LSIFSINVNGLNCPLKRHRLADWIQKLKPDICCIQESHL------TLKDKYRLKVKGWSS 66

Query: 763  V-EAYG--RSGGLLIMWDES---KISVIETIKGGYTLSVKCKTLCSKVCWVTNAYGPTDY 822
            + +A G  +  G+ I++ ++   K + I   K G+ + VK  T   ++  + N Y P ++
Sbjct: 67   IFQANGKQKKAGIAILFADAIGFKPTKIRKDKDGHFIFVKGNTQYDEIS-IINIYAP-NH 126

Query: 823  KERRRIWRELQALAAYCTNAWCLGGDFNITRAIHERVPIGRLTRGMKKFSKFIEEAHLME 882
               + I   L  ++   ++   + GDFN   A+ +R    +L++ +   +  I+   L +
Sbjct: 127  NAPQFIRETLTDMSNLISSTSIVVGDFNTPLAVLDRSSKKKLSKEILDLNSTIQHLDLTD 186

Query: 883  IP------------MSNGRFTWSREGNRVS-RSLLDRF-----LVRIVSDHFPLLLEAG- 942
            I              S+   T+S+  + +  +S L +F     +  I SDH  + +E   
Sbjct: 187  IYRTFHPNKTEYTFFSSAHGTYSKIDHILGHKSNLSKFKKIEIIPCIFSDHHGIKVELNN 246

Query: 943  -------ALEWGPSPFRFCNSWLLNSQCNSIIIRALAAGNHQG------W--AGFVISAK 1002
                      W  +     ++W+++ +    I + L   N+Q       W  A  V+  K
Sbjct: 247  NRNLHTHTKTWKLNNLMLKDTWVID-EIKKEITKFLEQNNNQDTNYQNLWDTAKAVLRGK 306

Query: 1003 FRSEAEIVEKLDKEE-----------QGAELEDPSSILQDPRASLKSDLMNIYKKKERDL 1062
            F +    ++K ++EE           +  E  +P    +     ++++L  I  K+    
Sbjct: 307  FIALQAFLKKTEREEVNNLMGHLKQLEKEEHSNPKPSRRKEITKIRAELNEIENKRIIQQ 366

Query: 1063 IQKSKLNWLHLGDENTRFFHRFLAAKKRKNLIAELVNDQGFPTNSYCEIEDQILNFYKNL 1122
            I KSK  +    ++  +        K+ K+LI+ + N     T    EI+  +  +YK L
Sbjct: 367  INKSKSWFFEKINKIDKPLANLTRKKRVKSLISSIRNGNDEITTDPSEIQKILNEYYKKL 426

Query: 1123 YT-KTPSAGCFPANLE---WQRVSVEQNSRLSSKFSREEIRFALRGMGKNKAPGPDGFTV 1182
            Y+ K  +       LE     R+S ++   L+   S  EI   ++ + K K+PGPDGFT 
Sbjct: 427  YSHKYENLKEIDQYLEACHLPRLSQKEVEMLNRPISSSEIASTIQNLPKKKSPGPDGFTS 486

Query: 1183 EFLNKFWDRIKDDFVALFNEFHENGRL-NSCVKENFICLIKKKEDAIMVKDFRPISLTTL 1242
            EF   F + +    + LF    + G L N+  + N   + K  +D    +++RPISL  +
Sbjct: 487  EFYQTFKEELVPILLNLFQNIEKEGILPNTFYEANITLIPKPGKDPTRKENYRPISLMNI 546

Query: 1243 TYKVIAKVLAERLKLILDPIL-------------------IANELVEDYRIKKKKGWILK 1302
              K++ K+L  R++  +  I+                     N +    ++K K   IL 
Sbjct: 547  DAKILNKILTNRIQQHIKKIIHHDQVGFIPGSQGWFNIRKSINVIQHINKLKNKDHMILS 606

Query: 1303 LDLEKAFGRVDWGFLEKALHGKNFDSKWISWILGCIKNPKFSIFINGRPRGRVQASRGVR 1362
            +D EKAF  +   F+ + L     +  ++  I      P  +I +NG          G R
Sbjct: 607  IDAEKAFDNIQHPFMIRTLKKIGIEGTFLKLIEAIYSKPTANIILNGVKLKSFPLRSGTR 666

Query: 1363 QGDPLSPFLFLLVSEVLTSLISRLHKSKKFEGFIVGKKKVHVPILQFADDTLLFCKYDLD 1422
            QG PLSP LF +V EVL   I    + K  +G  +G +++ + +  FADD +++ +   D
Sbjct: 667  QGCPLSPLLFNIVMEVLAIAI---REEKAIKGIHIGSEEIKLSL--FADDMIVYLENTRD 726

Query: 1423 MLEALRKTIEFFEWCFGQKVNWDKSALCGLNIDDLEVKSTAARLNCKAEKLPLMYLGLPL 1482
                L + I+ +    G K+N  KS       ++   K+    +        + YLG+ L
Sbjct: 727  STTKLLEVIKEYSNVSGYKINTHKSVAFIYTNNNQAEKTVKDSIPFTVVPKKMKYLGVYL 786

Query: 1483 GGHPKKMV--FWQPIIDKIQGKLSRWKRNNLSRGGRLTLCKTVLSNLPSYYMSIFLMPEK 1529
                K +    ++ +  +I   +++WK    S  GR+ + K  +S LP    +   +P K
Sbjct: 787  TKDVKDLYKENYETLRKEIAEDVNKWKNIPCSWLGRINIVK--MSILPKAIYNFNAIPIK 846

BLAST of ClCG03G010470 vs. ExPASy Swiss-Prot
Match: P11369 (LINE-1 retrotransposable element ORF2 protein OS=Mus musculus OX=10090 GN=Pol PE=1 SV=2)

HSP 1 Score: 177.2 bits (448), Expect = 1.6e-42
Identity = 212/913 (23.22%), Postives = 381/913 (41.73%), Query Frame = 0

Query: 705  IVSWNIRGLGDQSKQLAVKHLIMKTNLELVLIQETKKEAFEAEAIKKLWSSRDIGWSFV- 764
            ++S NI GL    K+  +   + K +     +QET     +   +      R  GW  + 
Sbjct: 17   LISLNINGLNSPIKRHRLTDWLHKQDPTFCCLQETHLREKDRHYL------RVKGWKTIF 76

Query: 765  EAYG--RSGGLLIMWDES---KISVIETIKGGYTLSVKCKTLCSKVCWVTNAYGPTDYKE 824
            +A G  +  G+ I+  +    +  VI+  K G+ + +K K L  ++  + N Y P + + 
Sbjct: 77   QANGLKKQAGVAILISDKIDFQPKVIKKDKEGHFILIKGKILQEELS-ILNIYAP-NARA 136

Query: 825  RRRIWRELQALAAYCTNAWCLGGDFNITRAIHERVPIGRLTRGMKKFSKFIEEAHLMEI- 884
               I   L  L AY      + GDFN   +  +R    +L R   K ++ +++  L +I 
Sbjct: 137  ATFIRDTLVKLKAYIAPHTIIVGDFNTPLSSKDRSWKQKLNRDTVKLTEVMKQMDLTDIY 196

Query: 885  ----PMSNGRFTWSREGNRVS--------RSLLDRF-----LVRIVSDHFPL-LLEAGAL 944
                P + G   +S      S        ++ L+R+     +  I+SDH  L L+    +
Sbjct: 197  RTFYPKTKGYTFFSAPHGTFSKIDHIIGHKTGLNRYKNIEIVPCILSDHHGLRLIFNNNI 256

Query: 945  EWGPSPFRFCNSWLLNSQ-CNSIIIRALAAGNHQGWAGF-------------VISAKFRS 1004
              G   F    +W LN+   N  +++       + +  F              + A  R 
Sbjct: 257  NNGKPTF----TWKLNNTLLNDTLVKEGIKKEIKDFLEFNENEATTYPNLWDTMKAFLRG 316

Query: 1005 EAEIVEKLDKEEQGAELEDPSSILQDPRASLKSDLMNIYKKKERDLIQ-KSKLNWL---- 1064
            +   +    K+ + A     SS+    +A  K +  +  + + +++I+ + ++N +    
Sbjct: 317  KLIALSASKKKRETAH---TSSLTTHLKALEKKEANSPKRSRRQEIIKLRGEINQVETRR 376

Query: 1065 ---HLGDENTRFFH----------RFLAAKKRKNLIAELVNDQGFPTNSYCEIEDQILNF 1124
                +    + FF           R     + K LI ++ N++G  T    EI++ I +F
Sbjct: 377  TIQRINQTRSWFFEKINKIDKPLARLTKGHRDKILINKIRNEKGDITTDPEEIQNTIRSF 436

Query: 1125 YKNLYT----KTPSAGCFPANLEWQRVSVEQNSRLSSKFSREEIRFALRGMGKNKAPGPD 1184
            YK LY+           F    +  +++ +Q   L+S  S +EI   +  +   K+PGPD
Sbjct: 437  YKRLYSTKLENLDEMDKFLDRYQVPKLNQDQVDHLNSPISPKEIEAVINSLPTKKSPGPD 496

Query: 1185 GFTVEFLNKFWDRIKDDFVALFNEFHENGRLNSCVKENFICLI-KKKEDAIMVKDFRPIS 1244
            GF+ EF   F + +      LF++    G L +   E  I LI K ++D   +++FRPIS
Sbjct: 497  GFSAEFYQTFKEDLIPILHKLFHKIEVEGTLPNSFYEATITLIPKPQKDPTKIENFRPIS 556

Query: 1245 LTTLTYKVIAKVLA----ERLKLILDPILIA---------------NELVEDYRIKKKKG 1304
            L  +  K++ K+LA    E +K I+ P  +                N +    ++K K  
Sbjct: 557  LMNIDAKILNKILANRIQEHIKAIIHPDQVGFIPGMQGWFNIRKSINVIHYINKLKDKNH 616

Query: 1305 WILKLDLEKAFGRVDWGFLEKALHGKNFDSKWISWILGCIKNPKFSIFINGRPRGRVQAS 1364
             I+ LD EKAF ++   F+ K L        +++ I      P  +I +NG     +   
Sbjct: 617  MIISLDAEKAFDKIQHPFMIKVLERSGIQGPYLNMIKAIYSKPVANIKVNGEKLEAIPLK 676

Query: 1365 RGVRQGDPLSPFLFLLVSEVLTSLISRLHKSKKFEGFIVGKKKVHVPILQFADDTLLFCK 1424
             G RQG PLSP+LF +V EVL   I    + K+ +G  +GK++V + +L  ADD +++  
Sbjct: 677  SGTRQGCPLSPYLFNIVLEVLARAI---RQQKEIKGIQIGKEEVKISLL--ADDMIVYIS 736

Query: 1425 YDLDMLEALRKTIEFFEWCFGQKVNWDKSALCGLNIDDLEVKSTAARLNCKAEKLPLMYL 1484
               +    L   I  F    G K+N +KS       +    K              + YL
Sbjct: 737  DPKNSTRELLNLINSFGEVVGYKINSNKSMAFLYTKNKQAEKEIRETTPFSIVTNNIKYL 796

Query: 1485 GLPLGGHPKKMV--FWQPIIDKIQGKLSRWKRNNLSRGGRLTLCKTVLSNLPSYYMSIFL 1531
            G+ L    K +    ++ +  +I+  L RWK    S  GR+ + K  +  LP        
Sbjct: 797  GVTLTKEVKDLYDKNFKSLKKEIKEDLRRWKDLPCSWIGRINIVKMAI--LPKAIYRFNA 856

BLAST of ClCG03G010470 vs. ExPASy Swiss-Prot
Match: P0C2F6 (Putative ribonuclease H protein At1g65750 OS=Arabidopsis thaliana OX=3702 GN=At1g65750 PE=3 SV=1)

HSP 1 Score: 120.6 bits (301), Expect = 1.8e-25
Identity = 70/202 (34.65%), Postives = 102/202 (50.50%), Query Frame = 0

Query: 1420 IIDKIQGKLSRWKRNNLSRGGRLTLCKTVLSNLPSYYMSIFLMPEKVVLLIERAMRNFFW 1479
            I++++  ++S W+   LS  GRLTL K VLS++P + MS  L+P+ ++  +++  R F W
Sbjct: 16   ILERVSSRMSGWREKTLSFAGRLTLTKAVLSSMPVHSMSTILLPQSILNRLDQLSRTFLW 75

Query: 1480 EGHGGSKLNHLARWVTVTKNHKDGGLGLENLKIKNLALLSKWGWRFMQESEALWC----- 1539
                  K  HL +W  V    K+GGLG+   K  N AL+SK GWR +QE  +LW      
Sbjct: 76   GSTAEKKKQHLVKWSKVCSPKKEGGLGVRAAKSMNRALISKVGWRLLQEKNSLWTLVLQK 135

Query: 1540 -KEVASLR-SPWI----SISRQWQKIEALAIFKV---------GDGRRITFWFDPWLEDQ 1599
               V  +R S W+    S S  W+ I A+ +  V         GDG++I FW D W+  +
Sbjct: 136  KYHVGEIRDSRWLIPKGSWSSTWRSI-AIGLRDVVSHGVGWIPGDGQQIRFWTDRWVSGK 195

Query: 1600 PFKVRFPRLFELALKPNGTDAD 1602
            P       L EL      TD D
Sbjct: 196  P-------LLELDNGERPTDCD 209

BLAST of ClCG03G010470 vs. ExPASy TrEMBL
Match: A0A803P8A0 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 734.9 bits (1896), Expect = 7.4e-208
Identity = 389/1024 (37.99%), Postives = 564/1024 (55.08%), Query Frame = 0

Query: 703  MKIVSWNIRGLGDQSKQLAVKHLIMKTNLELVLIQETKKEAFEAEAIKKLWSSRDIGWSF 762
            MKI++WNIRG GD+ K+ A+K  I K N ++V++QE K+   +   I  +W SR   W  
Sbjct: 1    MKILTWNIRGSGDKGKRAAIKATICKANPDMVILQEVKRATVDRRFIGSIWRSRFKAWIL 60

Query: 763  VEAYGRSGGLLIMWDESKISVIETIKGGYTLSVKCKTLCSKVCWVTNAYGPTDYKERRRI 822
            + A GRSGG L++WD   ISV++++ G +++SV       +  W +  YGP  YK R   
Sbjct: 61   LPAIGRSGGTLLIWDTRIISVLDSLVGEFSISVLINAEGKEPWWFSGVYGPCSYKIRHVF 120

Query: 823  WRELQALAAYCTNAWCLGGDFNITRAIHERVPIGRLTRGMKKFSKFIEEAHLMEIPMSNG 882
            W EL  L++ C  +WC+GGDFN+TR + E++     TR MK F   I E  L++  + NG
Sbjct: 121  WDELAGLSSICGESWCVGGDFNVTRRVGEKLNSSSSTRSMKLFDGLIRELQLIDPKLENG 180

Query: 883  RFTWSREGNRVSRSLLDRF-----------------LVRIVSDHFPLLLEAGALEWGPSP 942
             FTWS        S LDRF                 LVR+VSDH P+++++   +WGP P
Sbjct: 181  SFTWSNFRAIPICSRLDRFLFLNNWNVVFPFVRQEMLVRLVSDHSPVVIDSKPPKWGPGP 240

Query: 943  FRFCNSWLLN---SQC--------------NSIIIRALAA--GNHQGWAGFVISAKFRSE 1002
            FRF N WL +   S+C               +  ++ L    G  + W+ F       ++
Sbjct: 241  FRFDNHWLEHKSFSKCFESWWQEEIIDGWPGTKFMKKLKTLQGKAKEWSRFTYGQNKATK 300

Query: 1003 AEIVEKLDKEEQGAELEDPSSILQDPRASLKSDLMNIYKKKERDLIQKSKLNWLHLGDEN 1062
              +  +L   ++       +  L D R  LK +   +  ++ER +  KSK  W   GD N
Sbjct: 301  NALEGRLGVLDRQEGTPSWNQSLYDERRKLKEEWQRLTFEEERSIWLKSKCKWAKEGDAN 360

Query: 1063 TRFFHRFLAAKKRKNLIAELVNDQGFPTNSYCEIEDQILNFYKNLYTKTPSAGCFPANLE 1122
            +RFFH  L A+K +N I+ +  D G   +S  EI ++++ F+  LYT     G     +E
Sbjct: 361  SRFFHNLLNARKARNTISRIERDNGDIIDSEKEIVEELIAFFSKLYTSETRMGTGVEGIE 420

Query: 1123 WQRVSVEQNSRLSSKFSREEIRFALRGMGKNKAPGPDGFTVEFLNKFWDRIKDDFVALFN 1182
            WQ ++     +L   F  +E+R  +     +KAPGPDGF++      W+ IK++ + +F 
Sbjct: 421  WQHIAEPSARQLECPFEEDEVRNIVFSCEGSKAPGPDGFSLAVFQNNWEVIKNELMEVFR 480

Query: 1183 EFHENGRLNSCVKENFICLIKKKEDAIMVKDFRPISLTTLTYKVIAKVLAERL------- 1242
             FH  GR+   + + FICLI K+ ++  VKDFRPISL T  YK+IAK LA RL       
Sbjct: 481  AFHSEGRIEGSINDTFICLIPKRLNSCKVKDFRPISLITSVYKIIAKTLATRLRGVLGET 540

Query: 1243 -----------KLILDPILIANELVEDYRIKKKKGWILKLDLEKAFGRVDWGFLEKALHG 1302
                       + ILD +L+ANE VEDYR + KKG++LK+D EKA+ RVDWGFL+  L  
Sbjct: 541  ISETQSAFVEGRQILDSVLLANEAVEDYRSRGKKGFVLKIDFEKAYDRVDWGFLDLVLRK 600

Query: 1303 KNFDSKWISWILGCIKNPKFSIFINGRPRGRVQASRGVRQGDPLSPFLFLLVSEVLTSLI 1362
            K F  +W  WI GC+ +  FSIF+NGR RG+   SRG+RQGDPLSPFLF LV++VL  ++
Sbjct: 601  KGFGERWRKWIRGCVSSTSFSIFVNGRVRGKFHGSRGLRQGDPLSPFLFTLVADVLGRMV 660

Query: 1363 SRLHKSKKFEGFIVGKKKVHVPILQFADDTLLFCKYDLDMLEALRKTIEFFEWCFGQKVN 1422
             +  +++ F GF +GK  + +  LQFADDTL F K D D L+ L K +E F    G KVN
Sbjct: 661  DKAVETEAFSGFQIGKDNIRLSHLQFADDTLFFVK-DEDSLQKLVKIVEAFCGISGLKVN 720

Query: 1423 WDKSALCGLNIDDLEVKSTAARLNCKAEKLPLMYLGLPLGGHPKKMVFWQPIIDKIQGKL 1482
             +KS L G+ + D  V   A  + C+  K P+ YLG+PLGG P+K  FW+P++DK   ++
Sbjct: 721  LNKSQLLGICLSDEAVAQGANLIGCEVGKWPMTYLGMPLGGSPRKKTFWEPVLDKCAKRM 780

Query: 1483 SRWKRNNLSRGGRLTLCKTVLSNLPSYYMSIFLMPEKVVLLIERAMRNFFWEGHGGSKLN 1542
              WK + LSRGGRLTL ++VLS+LP YY+S+F +P+ V+  +E+ MR+FFWEG   +  +
Sbjct: 781  DGWKCSFLSRGGRLTLIQSVLSSLPIYYLSLFKVPKMVLKELEKMMRDFFWEGGDLAGGD 840

Query: 1543 HLARWVTVTKNHKDGGLGLENLKIKNLALLSKWGWRFMQESEALWCKEVASL-------- 1602
            HL  W  V K   +GGL +  L+++N  LL KW WRF  ES +LW K + S         
Sbjct: 841  HLVAWDEVCKPRAEGGLAIGRLEMRNKGLLMKWLWRFPLESNSLWHKVIKSRYGKADNFW 900

Query: 1603 ----------RSPWISISRQWQKIEALAIFKVGDGRRITFWFDPWLEDQPFKVRFPRLFE 1648
                      R PW+ I+  + +   +  FKVG+G  I FW D W+     + +FP L  
Sbjct: 901  DTKQGVRMSPRGPWMDIADLYHEYGKMVKFKVGNGASIRFWEDEWIGGPSLRDQFPTLAV 960

BLAST of ClCG03G010470 vs. ExPASy TrEMBL
Match: A0A803QI00 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 721.8 bits (1862), Expect = 6.5e-204
Identity = 386/1016 (37.99%), Postives = 549/1016 (54.04%), Query Frame = 0

Query: 711  RGLGDQSKQLAVKHLIMKTNLELVLIQETKKEAFEAEAIKKLWSSRDIGWSFVEAYGRSG 770
            +G GD+ K+ A+K  I K N +LV++QE K+ + +   I  +W SR   W  + A GRSG
Sbjct: 908  KGSGDKGKRHAIKATICKANPDLVILQEVKRTSVDRRFIGSIWRSRFKAWIIIPAIGRSG 967

Query: 771  GLLIMWDESKISVIETIKGGYTLSVKCKTLCSKVCWVTNAYGPTDYKERRRIWRELQALA 830
            G L++WD   I+V++++ G +++SV  K       W +  YGP  YK R   W EL  L+
Sbjct: 968  GTLLIWDTRTITVLDSLVGEFSISVLIKAEGKDPWWFSGVYGPCSYKLRPAFWDELAGLS 1027

Query: 831  AYCTNAWCLGGDFNITRAIHERVPIGRLTRGMKKFSKFIEEAHLMEIPMSNGRFTWSREG 890
            A C ++WC+GGDFN+TR   E++     TR MK F   I E  L++  + NGRFTWS   
Sbjct: 1028 AICGDSWCVGGDFNVTRRPGEKLNSSSCTRSMKLFDGLIRELRLIDPKLENGRFTWSNFR 1087

Query: 891  NRVSRSLLDRF-----------------LVRIVSDHFPLLLEAGALEWGPSPFRFCNSWL 950
                 S LDRF                 LVR+VSDH P+++++    WGP PFRF N WL
Sbjct: 1088 TSPVCSRLDRFLFTNNWNVIYPFVRQEMLVRLVSDHSPVVIDSNPPRWGPGPFRFDNQWL 1147

Query: 951  LNSQCNSIIIRALAAGNHQGWAGFVISAKFRSEAEIVEKLDKEEQGA------------- 1010
             ++       R     +  GW G    +K +   E V++      G              
Sbjct: 1148 EHNSFPKSFGRWWKEASSNGWPGTKFMSKLKKTQEKVKEWSSSTFGQNKATKRALEGRLV 1207

Query: 1011 ---ELEDPSSILQ---DPRASLKSDLMNIYKKKERDLIQKSKLNWLHLGDENTRFFHRFL 1070
                LE  +S +Q   + R  LK +   +  ++ER +  KSK  W   GD N+RFFH  L
Sbjct: 1208 ALDRLEGTNSWVQSLVEERRKLKEEWQQLNFEEERSIWLKSKCKWAKEGDANSRFFHNLL 1267

Query: 1071 AAKKRKNLIAELVNDQGFPTNSYCEIEDQILNFYKNLYTKTPSAGCFPANLEWQRVSVEQ 1130
             A+K +N I+ +  + G   +   EI ++++ F+  LYT     G    ++EWQR++   
Sbjct: 1268 NARKARNTISRIEREDGSIIDKEEEIVEELIGFFSKLYTSEARRGSGIESIEWQRIAYSS 1327

Query: 1131 NSRLSSKFSREEIRFALRGMGKNKAPGPDGFTVEFLNKFWDRIKDDFVALFNEFHENGRL 1190
              +L S F  EE++ ++     +KAPGPDGF++      W+ IKDD + +F  F + GR+
Sbjct: 1328 ACQLESSFEEEEVKRSVFSCEGSKAPGPDGFSLAVFQNNWETIKDDLMEVFRTFEKEGRI 1387

Query: 1191 NSCVKENFICLIKKKEDAIMVKDFRPISLTTLTYKVIAKVLAERL--------------- 1250
               + E FICLI K+ ++  VKDFRPISL T  YK++AK LA RL               
Sbjct: 1388 EGSINETFICLIPKRLNSCKVKDFRPISLITSVYKIVAKTLATRLRGVLGETISETQSAF 1447

Query: 1251 ---KLILDPILIANELVEDYRIKKKKGWILKLDLEKAFGRVDWGFLEKALHGKNFDSKWI 1310
               + ILD +LIANE VED+R + KKG++ K+DLEKA+ RVDW FL+  L  K F   W 
Sbjct: 1448 VEGRQILDSVLIANETVEDFRSRGKKGFVFKIDLEKAYDRVDWDFLDLVLKEKGFGEVWR 1507

Query: 1311 SWILGCIKNPKFSIFINGRPRGRVQASRGVRQGDPLSPFLFLLVSEVLTSLISRLHKSKK 1370
             WI GC+ +  FS+ INGR RG+ + SRG+RQGDPLSPFLF LV +VL  L+ +  +S  
Sbjct: 1508 KWIRGCVSSTSFSLLINGRVRGKFRGSRGLRQGDPLSPFLFTLVVDVLGRLVDKAAQSDT 1567

Query: 1371 FEGFIVGKKKVHVPILQFADDTLLFCKYDLDMLEALRKTIEFFEWCFGQKVNWDKSALCG 1430
            F GF VGK  + +  LQFADDTL F K D   L  L + +E F    G KVN +KS L G
Sbjct: 1568 FSGFQVGKDNIQISHLQFADDTLFFVK-DEASLRKLVEIVEAFCGISGLKVNLNKSQLLG 1627

Query: 1431 LNIDDLEVKSTAARLNCKAEKLPLMYLGLPLGGHPKKMVFWQPIIDKIQGKLSRWKRNNL 1490
            +++++  V   A  + C+    P+ YLG+PLGG P+K  FW+P++DK   +L  WK + L
Sbjct: 1628 ISLEEEVVAQNAEIIGCEVGTWPMTYLGMPLGGSPRKGTFWEPVLDKCAKRLDGWKCSFL 1687

Query: 1491 SRGGRLTLCKTVLSNLPSYYMSIFLMPEKVVLLIERAMRNFFWEGHGGSKLNHLARWVTV 1550
            SRGGRL L ++VLS+LP YY+S+F  P+ V+  IE+ MR+FFWEG   +  +HL  W  V
Sbjct: 1688 SRGGRLILIQSVLSSLPIYYLSLFKAPKMVLQAIEKMMRDFFWEGGDLAGGDHLVAWDEV 1747

Query: 1551 TKNHKDGGLGLENLKIKNLALLSKWGWRFMQESEALWCKEV------------------A 1610
             K   +GGL +  L+++N  LL KW WR+  E  +LW K +                  A
Sbjct: 1748 CKPRSEGGLAIGRLEMRNKGLLMKWLWRYPLEPNSLWHKVIKSRYGKADNFWDTKWGARA 1807

Query: 1611 SLRSPWISISRQWQKIEALAIFKVGDGRRITFWFDPWLEDQPFKVRFPRLFELALKPNGT 1648
            S R PW  IS  + +   L  FKVG+G  I FW D W+     K +FP +  ++   N +
Sbjct: 1808 SPRGPWKDISDYYDEYGQLVKFKVGNGANIRFWEDVWIGGSSLKEQFPDVAVISKAKNAS 1867

BLAST of ClCG03G010470 vs. ExPASy TrEMBL
Match: A0A803QEA6 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 721.1 bits (1860), Expect = 1.1e-203
Identity = 399/1088 (36.67%), Postives = 577/1088 (53.03%), Query Frame = 0

Query: 660  QDSEEESLISISSDELNEPEEDEN---------------------HLSLELDESIEHRSV 719
            Q  EE  L+  S D+L+E E++E                       + +E+ +  E    
Sbjct: 289  QGLEEMRLVDGSDDKLDELEKEEGREADEIMIEATSWSNIVESMAEMGMEITQENEDSDQ 348

Query: 720  KVFSMKIVSWNIRGLGDQSKQLAVKHLIMKTNLELVLIQETKKEAFEAEAIKKLWSSRDI 779
            K  + +I++WNIRG GD+ K+ A+K  I K N +LV++QE K+   +   I  +W SR  
Sbjct: 349  K--TEEILTWNIRGSGDKGKRTAIKATICKANPDLVILQEVKRATVDRRFIGSIWRSRFK 408

Query: 780  GWSFVEAYGRSGGLLIMWDESKISVIETIKGGYTLSVKCKTLCSKVCWVTNAYGPTDYKE 839
             W  + A GRSGG L++WD   ISV++++ G +++SV       +  W +  YGP  YK 
Sbjct: 409  AWILIPAIGRSGGTLLIWDTRTISVLDSLVGEFSISVLINAEGKEPWWFSGVYGPCSYKL 468

Query: 840  RRRIWRELQALAAYCTNAWCLGGDFNITRAIHERVPIGRLTRGMKKFSKFIEEAHLMEIP 899
            R   W EL  L++ C  +WC+ GDFN+TR + E++     TR MK F   I E  L++  
Sbjct: 469  RPEFWDELAGLSSICGKSWCVAGDFNVTRRVGEKLNSSSFTRSMKLFDGLIRELQLIDPK 528

Query: 900  MSNGRFTWSREGNRVSRSLLDRF-----------------LVRIVSDHFPLLLEAGALEW 959
            + NG FTWS        S LDRF                 LVRIVSDH P+++++   +W
Sbjct: 529  LENGSFTWSNFRASPVCSRLDRFLFTNNWNIIFPFVRQELLVRIVSDHSPVVIDSNPPKW 588

Query: 960  GPSPFRFCNSWLLNSQCNSIIIRALAAGNHQGWAGFVISAKFRSEAEIVEKLDKEEQGA- 1019
            GP PFRF N WL +   +    R      + GW G     K +     V++  K   G  
Sbjct: 589  GPGPFRFDNHWLDHKSFSKCFERWWKEEINDGWPGTKFMKKLKILQGKVKEWSKSTFGQN 648

Query: 1020 ---------------ELEDPS---SILQDPRASLKSDLMNIYKKKERDLIQKSKLNWLHL 1079
                           +LE  S     L D R  LK +   +  ++ER    KSK  W   
Sbjct: 649  RAKKIALEGRLGVLDKLEGTSFWNQSLLDERRKLKEEWKWLNFEEERGTWLKSKCKWARE 708

Query: 1080 GDENTRFFHRFLAAKKRKNLIAELVNDQGFPTNSYCEIEDQILNFYKNLYTKTPSAGCFP 1139
            GD N+RFFH  L A+K +N I+ +  + G   ++  EI ++++ F+  LYT     G   
Sbjct: 709  GDANSRFFHNLLNARKARNTISRIERENGDIIDNEKEIAEELIAFFSKLYTSEARMGSGI 768

Query: 1140 ANLEWQRVSVEQNSRLSSKFSREEIRFALRGMGKNKAPGPDGFTVEFLNKFWDRIKDDFV 1199
              +EWQ+++     +L   F  EE+R  +     NKAPGPDGF++  L   W+ IK D +
Sbjct: 769  EGIEWQQIAESSAGQLECPFEEEEVRNIVFSCEGNKAPGPDGFSLAVLQHNWETIKHDLM 828

Query: 1200 ALFNEFHENGRLNSCVKENFICLIKKKEDAIMVKDFRPISLTTLTYKVIAKVLAERL--- 1259
             +F  FH  GR+   + + FICLI K+ ++  VKDFRPISL T  YK+IAK LA RL   
Sbjct: 829  EVFTAFHREGRIEGSINDTFICLIPKRLNSCKVKDFRPISLITSVYKIIAKTLATRLRGV 888

Query: 1260 ---------------KLILDPILIANELVEDYRIKKKKGWILKLDLEKAFGRVDWGFLEK 1319
                           + ILD +L+ANE VEDYR + +KG++LK+D EKA+ RVDWGFL+ 
Sbjct: 889  LGETISETQSAFVEGRQILDSVLMANEAVEDYRSRGRKGFVLKIDFEKAYDRVDWGFLDM 948

Query: 1320 ALHGKNFDSKWISWILGCIKNPKFSIFINGRPRGRVQASRGVRQGDPLSPFLFLLVSEVL 1379
             L  K F  +W  WI GC+ +  FSIFINGR RG+   SRG+RQGDPLSPFLF ++++VL
Sbjct: 949  VLRKKGFGERWRKWIRGCVSSTSFSIFINGRVRGKFNGSRGLRQGDPLSPFLFTMIADVL 1008

Query: 1380 TSLISRLHKSKKFEGFIVGKKKVHVPILQFADDTLLFCKYDLDMLEALRKTIEFFEWCFG 1439
              ++ +  +++   GF +GK  + +  LQFADDTL F K ++  L+ L K ++ F    G
Sbjct: 1009 GRMVDKAIETESLTGFQIGKDDIRLSHLQFADDTLFFVKDEVS-LQKLVKVVKAFCGISG 1068

Query: 1440 QKVNWDKSALCGLNIDDLEVKSTAARLNCKAEKLPLMYLGLPLGGHPKKMVFWQPIIDKI 1499
             KVN +KS L G+ +++  V  +A  + C+  + P+ YLG+ LGG P+K  FW+P++DK 
Sbjct: 1069 LKVNLNKSQLLGICMNEEAVAQSAILIGCEVGRWPMTYLGMSLGGSPRKRSFWEPVLDKC 1128

Query: 1500 QGKLSRWKRNNLSRGGRLTLCKTVLSNLPSYYMSIFLMPEKVVLLIERAMRNFFWEGHGG 1559
              ++  WK + LSRGGRLTL ++VLS+LP YY+S+F  P+ V+  +E+ MR FFWEG   
Sbjct: 1129 AKRMDGWKCSFLSRGGRLTLIQSVLSSLPIYYLSLFKAPKVVLKELEKMMREFFWEGGDL 1188

Query: 1560 SKLNHLARWVTVTKNHKDGGLGLENLKIKNLALLSKWGWRFMQESEALWCKEVASL---- 1619
            +  +HL  W  V K   +GGL +  L ++N  LL KW WRF  E  +LW K + S     
Sbjct: 1189 AGGDHLVAWDEVCKPRAEGGLAIGKLDMRNKGLLMKWLWRFPLEPNSLWHKVIKSRYGKA 1248

Query: 1620 --------------RSPWISISRQWQKIEALAIFKVGDGRRITFWFDPWLEDQPFKVRFP 1648
                          R PW  IS  + +   L  FKVG+G RI FW D W+     + +FP
Sbjct: 1249 DNFWDTKQGVRISPRGPWKDISDLYDEYGKLVKFKVGNGERIRFWEDEWVGGSSLRDQFP 1308

BLAST of ClCG03G010470 vs. ExPASy TrEMBL
Match: A0A438FWU5 (LINE-1 retrotransposable element ORF2 protein OS=Vitis vinifera OX=29760 GN=LORF2_70 PE=4 SV=1)

HSP 1 Score: 713.4 bits (1840), Expect = 2.3e-201
Identity = 473/1543 (30.65%), Postives = 744/1543 (48.22%), Query Frame = 0

Query: 259  GWIVIKNLSLEYWSRETFEAIGQHFGGLEEISIETLNLLDVSETKIKVRKNVYGFIPATI 318
            GW+ ++ L    W       I Q +G + +++ ETL L+D+S+ K+ V  +    +PA +
Sbjct: 262  GWLELRGLPFHLWDEFQLRYILQKWGRVTKVAKETLKLVDLSKVKMWVEMHPKVVLPALL 321

Query: 319  EINN---EFRGSIHLL---FGDIIPMKDSS---SIITEFIGSDFENPIDQVRL-SKVEED 378
            E+ +    F  ++ ++     D +   +S+     +T   G   + P +   L +   ++
Sbjct: 322  EVEDGAWSFTVAVSVIGEAEEDFLLRPESNRSKDEVTSAEGCVHQRPKNAEGLRATARDN 381

Query: 379  EVRVFSPSKLALLSSKPDQTQPEQEEKSLEISVGSPSVGKWSKSTANGLEGKSINEKAAK 438
            E   + P   + +      +  E E+   +  +G          TA  ++G +  E   K
Sbjct: 382  EYHRWRPRHRSRVRYSVSNSDTEVEKGRGKSCLG---------PTAGSVDGLTKPEAFFK 441

Query: 439  LNWCE-ETEEAAVGSSIGVLKSIRPIEVMKKEKRKERVFGRINENIKSIQTPDIPSGDFL 498
             ++     EE  +G S+G      P+         E   G  N        P  PS   L
Sbjct: 442  GHFARAHFEEKNIGPSVG------PVH--------ETEAGSSNGG------PATPSSSKL 501

Query: 499  RSGGPTGFEV--IQKIRKVPNVLLTVAEESEKSPPNSMKSN------------------- 558
            +  G +  EV  I +  K+    +T   ++   PP+   ++                   
Sbjct: 502  QRSGTSAKEVMPIAQSAKLKGNSVTARRKARSWPPSLKVTSIVPKRNLDGDGAPEANRGF 561

Query: 559  -WEEVPFQ--TLLPYHQLPRARVNAFPQQSPAHTISSVKSPFFPAVRRSSNLFKPSPKHF 618
             W    F+   L P  +  R             T + VK    P+V  S    K  P   
Sbjct: 562  IWGRSVFKKGALSPVSEKSRRFTGGTEGDEGISTCNWVKKVRPPSVALSE---KDLP--- 621

Query: 619  SRGKTSFLNLWSALTNPDILEVSCANSQVPQQIRDRSVLPLSSSQIFHQSEI----LIPG 678
                           NP+IL  S  +S +P + +  S +PL SS +  +S      ++  
Sbjct: 622  ----------LDGAFNPNILSDSSVSSVIPSRCQAFSPIPLESSLVSQRSPFPKIAVVEP 681

Query: 679  SNILFLRGSPSSYRPKPNKIQDSEEESLISISSDELNEPEE----------DENHLSLEL 738
            S ++ +        P     ++SEE  L     D L+ PE+          +  H SL L
Sbjct: 682  SRLVGVSRPEGVAFPLETSNRNSEERPLF---KDSLSSPEKLCVSGKAQSPNPEHPSLPL 741

Query: 739  D----------ESIEHRSVKVF-------------------SMKIVSWNIRGLGDQSKQL 798
            +          + ++    K F                    MKI+SWN RGLG + K+ 
Sbjct: 742  EGFQVEGLTPGKMVKKEEEKCFWKPRRGLGLGAGCAGLFLVYMKILSWNTRGLGSKKKRR 801

Query: 799  AVKHLIMKTNLELVLIQETKKEAFEAEAIKKLWSSRDIGWSFVEAYGRSGGLLIMWDESK 858
             V+  +   N ++V++QETK+E ++   +  +W  + + W+ + A G SGG++I+WD SK
Sbjct: 802  IVRRFLSTQNPDIVMLQETKRETWDRRFVSSVWKGKRVEWAALPACGASGGIVILWDSSK 861

Query: 859  ISVIETIKGGYTLSVKCKTLCSKVCWVTNAYGPTDYKERRRIWRELQALAAYCTNAWCLG 918
            +   E + G ++++VK  +      W+T+ YGP +   R+  W ELQ L       WC+G
Sbjct: 862  LECTEKVLGSFSVTVKFNSGEEGSFWLTSVYGPINPLWRKDFWLELQDLYGLTFPRWCVG 921

Query: 919  GDFNITRAIHERVPIGRLTRGMKKFSKFIEEAHLMEIPMSNGRFTWSREGNRVSRSLLDR 978
            GDFN+ R I E++   RLT  M+ F +FI E+ L++ P+ N  FTWS          LDR
Sbjct: 922  GDFNVIRRISEKLGETRLTLNMRCFDEFIRESGLIDPPLRNAAFTWSNMQADPICKRLDR 981

Query: 979  FLV-----------------RIVSDHFPLLLEAGALEWGPSPFRFCNSWLLNSQCNSIII 1038
            FL                  R  SDH P+ LE   L+WGP+PFRF N WLL+ +      
Sbjct: 982  FLFSSEWDTFFSQSFQEALPRWTSDHSPICLETNPLKWGPTPFRFENMWLLHPEFKEKFR 1041

Query: 1039 RALAAGNHQGWAGFVISAKFR------SEAEIVEKLD-KEEQGAELEDPSSI-LQDPRAS 1098
                    +GW G     K +       E  I+   D KE +   L D S I L +   +
Sbjct: 1042 VWWLECTGEGWEGHKFMRKLKFVKSKLKEWNIMTFGDLKERKKLILTDLSRIDLIEQEGN 1101

Query: 1099 LKSDLM-----------NIYKKKERDLIQKSKLNWLHLGDENTRFFHRFLAAKKRKNLIA 1158
            L SDL+           ++  K+E    QKS++ W+  GD N++FFHR    ++ +  I 
Sbjct: 1102 LNSDLVLERTLKRRELEDVLLKEEVQWRQKSRVKWIKEGDCNSKFFHRVATGRRSRKFIK 1161

Query: 1159 ELVNDQGFPTNSYCEIEDQILNFYKNLYTKTPSAGCFPANLEWQRVSVEQNSRLSSKFSR 1218
             L++++G   N+  +I ++I+NF+ NLY+K          ++W  +S E    L   F+ 
Sbjct: 1162 SLISERGETLNNIEDISEEIVNFFGNLYSKPVGESWRVEGIDWVPISGESGGWLDRPFTE 1221

Query: 1219 EEIRFALRGMGKNKAPGPDGFTVEFLNKFWDRIKDDFVALFNEFHENGRLNSCVKENFIC 1278
            EE+R A+  + K KAPGPDGFT+    + WD IK+D + +F EFH NG +N      FI 
Sbjct: 1222 EEVRRAVFQLNKEKAPGPDGFTIAVYQECWDVIKEDLMRVFLEFHTNGVINQSTNATFIA 1281

Query: 1279 LIKKKEDAIMVKDFRPISLTTLTYKVIAKVLAERLKL------------------ILDPI 1338
            L+ KK  ++ + D+RPISL T  YK+IAKVL+ RL+                   ILD +
Sbjct: 1282 LVPKKSQSVKISDYRPISLVTSLYKIIAKVLSGRLRKVLHETISDSQGAFVEGRHILDAV 1341

Query: 1339 LIANELVEDYRIKKKKGWILKLDLEKAFGRVDWGFLEKALHGKNFDSKWISWILGCIKNP 1398
            LIANE+V++ R   ++G + K+D EKA+  VDWGFL+  L  K F  KW  WI GC+ + 
Sbjct: 1342 LIANEVVDEKRRSGEEGIVFKIDFEKAYDHVDWGFLDHVLQRKGFSQKWRLWIRGCLSSS 1401

Query: 1399 KFSIFINGRPRGRVQASRGVRQGDPLSPFLFLLVSEVLTSLISRLHKSKKFEGFIVGKKK 1458
             F+I +NG  +G V+ASRG+RQGDPLSPFLF LV++VL+ ++ R  ++   EGF VG+ +
Sbjct: 1402 SFAILVNGNAKGWVKASRGLRQGDPLSPFLFTLVADVLSRMLFRAEETGLTEGFSVGRDR 1461

Query: 1459 VHVPILQFADDTLLFCKYDLDMLEALRKTIEFFEWCFGQKVNWDKSALCGLNIDDLEVKS 1518
              V +LQFADDT+ F K  ++ L+ L+  +  F    G K+N +KS + G+N     + S
Sbjct: 1462 TRVSLLQFADDTIFFSKASMEHLQNLKIILLVFGQVSGLKINLEKSTISGINTRQELLSS 1521

Query: 1519 TAARLNCKAEKLPLMYLGLPLGGHPKKMVFWQPIIDKIQGKLSRWKRNNLSRGGRLTLCK 1578
             A+  +C+  + PL YLGLPLGG+PK + FW P++++I  +L  WK+  LS GGR+TL +
Sbjct: 1522 LASVFDCRVSEWPLSYLGLPLGGNPKTIGFWDPVVERISRRLDGWKKAYLSLGGRITLIQ 1581

Query: 1579 TVLSNLPSYYMSIFLMPEKVVLLIERAMRNFFWEGHGGSKLNHLARWVTVTKNHKDGGLG 1638
            + LS++PSY++S+F +P  +   IE+  RNF W G G  K +HL RW  V++  + GGLG
Sbjct: 1582 SCLSHIPSYFLSLFKIPASIASKIEKMQRNFLWSGAGEGKKDHLVRWEVVSRPKELGGLG 1641

Query: 1639 LENLKIKNLALLSKWGWRFMQESEALWCKEVASL------------------RSPWISIS 1650
               + ++N+ALL KW WRF +E   LW K + S+                  R PW +I+
Sbjct: 1642 FGKISLRNIALLGKWLWRFPRERSGLWYKVIGSIYGTHPNGWDANMVVRWSHRCPWKAIA 1701

BLAST of ClCG03G010470 vs. ExPASy TrEMBL
Match: A0A803QQM3 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 708.8 bits (1828), Expect = 5.7e-200
Identity = 382/1014 (37.67%), Postives = 545/1014 (53.75%), Query Frame = 0

Query: 713  LGDQSKQLAVKHLIMKTNLELVLIQETKKEAFEAEAIKKLWSSRDIGWSFVEAYGRSGGL 772
            LGD+ K+ A+K  I K N +LV++QE K+   +   I  +W SR   W  + A GRSGG 
Sbjct: 934  LGDKGKRAAIKATICKANPDLVILQEVKRATVDRRFIGSIWRSRFKAWILLPALGRSGGT 993

Query: 773  LIMWDESKISVIETIKGGYTLSVKCKTLCSKVCWVTNAYGPTDYKERRRIWRELQALAAY 832
            L++WD   ISV++++ G +++SV       +  W +  YGP  YK R   W EL  L++ 
Sbjct: 994  LLIWDTRTISVLDSLVGEFSISVLINAEGKEPWWFSGVYGPCSYKLRPEFWDELAGLSSI 1053

Query: 833  CTNAWCLGGDFNITRAIHERVPIGRLTRGMKKFSKFIEEAHLMEIPMSNGRFTWSREGNR 892
            C  +WC+GGDFN+TR + E++     TR MK F   I E  L++  + NG FTWS     
Sbjct: 1054 CGESWCVGGDFNVTRRVGEKLNSSSCTRSMKLFDGLIRELQLIDPKLENGSFTWSNFRAS 1113

Query: 893  VSRSLLDRF-----------------LVRIVSDHFPLLLEAGALEWGPSPFRFCNSWLLN 952
               S LDRF                 LVR+VSDH P+++++   +WGP PFRF N WL +
Sbjct: 1114 PVCSRLDRFLFSNNWNVIYPFVRQEMLVRLVSDHSPVVIDSNPPKWGPGPFRFDNHWLEH 1173

Query: 953  SQCNSIIIRALAAGNHQGWAGFVISAKFRSEAEIVEKLDKEEQGA--------------- 1012
               +           + GW G     K +     V++  K   G                
Sbjct: 1174 KSFSKCFESWWKEEINDGWPGTKFMKKLKLLQGKVKEWSKSTFGQNKATKIALEGRLGVL 1233

Query: 1013 -ELEDPSSILQ---DPRASLKSDLMNIYKKKERDLIQKSKLNWLHLGDENTRFFHRFLAA 1072
              LE  SS  Q   D R  LK +   ++ ++ER +  KSK  W   GD N+R FH  L A
Sbjct: 1234 DRLEGTSSWNQSVLDERRKLKEEWQQLHFEEERGIWLKSKCKWAREGDANSRLFHNLLNA 1293

Query: 1073 KKRKNLIAELVNDQGFPTNSYCEIEDQILNFYKNLYTKTPSAGCFPANLEWQRVSVEQNS 1132
            +K KN I+ +  D G   ++  EI ++++ F+  LYT    +G     +EW ++      
Sbjct: 1294 RKAKNTISRIERDNGDIIDNEKEIVEELIAFFSKLYTSEARSGTGIEGIEWHKIEESSAR 1353

Query: 1133 RLSSKFSREEIRFALRGMGKNKAPGPDGFTVEFLNKFWDRIKDDFVALFNEFHENGRLNS 1192
            +L   F  EE+R  +     NKAPGPDGF++  L   W+ IK D + +F  FH  GR+  
Sbjct: 1354 QLECPFEEEEVRNIVFSCEGNKAPGPDGFSLAALQNNWETIKYDLMEVFRAFHREGRIEG 1413

Query: 1193 CVKENFICLIKKKEDAIMVKDFRPISLTTLTYKVIAKVLAERL----------------- 1252
             + + FICLI K+ ++  VKD+RPISL T  YK+IAK LA RL                 
Sbjct: 1414 SINDTFICLIPKRLNSCKVKDYRPISLITSVYKIIAKTLATRLRGVLGETISETQSAFVE 1473

Query: 1253 -KLILDPILIANELVEDYRIKKKKGWILKLDLEKAFGRVDWGFLEKALHGKNFDSKWISW 1312
             + ILD +L+ANE VEDYR + KKG +LK+D EKA+ RVDWGFL+  +  K F  +W  W
Sbjct: 1474 GRQILDSVLMANEAVEDYRSRGKKGIVLKIDFEKAYDRVDWGFLDLVMRKKGFGERWTKW 1533

Query: 1313 ILGCIKNPKFSIFINGRPRGRVQASRGVRQGDPLSPFLFLLVSEVLTSLISRLHKSKKFE 1372
            I GC+    FSIFINGR RG+   SRG+RQ DPLSPFLF L+++VL  ++ +   ++   
Sbjct: 1534 IRGCVSTTSFSIFINGRVRGKFNGSRGLRQVDPLSPFLFTLIADVLGRMVDKAIDTESLS 1593

Query: 1373 GFIVGKKKVHVPILQFADDTLLFCKYDLDMLEALRKTIEFFEWCFGQKVNWDKSALCGLN 1432
            GF +GK  + +  LQFADDTL F K D   L+ L K +E F    G KVN +KS L G+ 
Sbjct: 1594 GFQIGKDDIQLSHLQFADDTLFFVK-DEASLQKLVKIVEAFCGISGLKVNLNKSQLLGVC 1653

Query: 1433 IDDLEVKSTAARLNCKAEKLPLMYLGLPLGGHPKKMVFWQPIIDKIQGKLSRWKRNNLSR 1492
            +D+  V  +A ++ C+  + P+ YLG+PLGG P+K  FW+P++DK   ++  WK + LSR
Sbjct: 1654 MDEDAVAQSAIQIGCEVGRWPMTYLGMPLGGSPRKRSFWEPVLDKCATRMDGWKCSFLSR 1713

Query: 1493 GGRLTLCKTVLSNLPSYYMSIFLMPEKVVLLIERAMRNFFWEGHGGSKLNHLARWVTVTK 1552
            GGRLTL ++VLS+LP Y++S+F  P+ V+  +E+ MR+FFWEG   +  +HL  W  V K
Sbjct: 1714 GGRLTLIQSVLSSLPIYFLSLFKAPKVVLKELEKMMRDFFWEGGDLAGGDHLVAWDEVCK 1773

Query: 1553 NHKDGGLGLENLKIKNLALLSKWGWRFMQESEALWCKEVASL------------------ 1612
               +GGL +  L+++N  LL KW WRF  ES +LW K + S                   
Sbjct: 1774 PRAEGGLAIGRLEMRNKGLLMKWLWRFPLESNSLWHKVIKSRYGRADNFWDTKHGVRLSP 1833

Query: 1613 RSPWISISRQWQKIEALAIFKVGDGRRITFWFDPWLEDQPFKVRFPRLFELALKPNGT-- 1648
            R PW  IS  + +   L  FKVG+G  I FW D W+     + +F  L  ++   N +  
Sbjct: 1834 RGPWKDISDLYDEYGKLVKFKVGNGACIRFWEDEWIGGSSLRDQFLNLAVISRAKNASIQ 1893

BLAST of ClCG03G010470 vs. TAIR 10
Match: AT1G43760.1 (DNAse I-like superfamily protein )

HSP 1 Score: 109.8 bits (273), Expect = 2.2e-23
Identity = 66/184 (35.87%), Postives = 94/184 (51.09%), Query Frame = 0

Query: 1013 QKSKLNWLHLGDENTRFFHRFLAAKKRKNLIAELVNDQGFPTNSYCEIEDQILNFYKNLY 1072
            QKS++ WL  GD NTRFFH+ + A + KNLI  L  D      +  ++++ I+ +Y +L 
Sbjct: 437  QKSRIKWLQDGDANTRFFHKVILANQAKNLIKFLRMDDDVRVENVTQVKEMIVAYYTHLL 496

Query: 1073 TK-----TPSAGCFPANLEWQRVSVEQNSRLSSKFSREEIRFALRGMGKNKAPGPDGFTV 1132
                   TP +     ++   R +    SRLS+  S +EI  A+  M +NKAPGPD FT 
Sbjct: 497  GSDSDILTPDSVQRIKDIHPFRCNDTLASRLSALPSDKEITAAVFAMPRNKAPGPDSFTA 556

Query: 1133 EFLNKFWDRIKDDFVALFNEFHENGRLNSCVKENFICLIKKKEDAIMVKDFRPISLTTLT 1192
            EF  + W  +KD  +A   EF   G L        I LI K      +  FRP+S  T+ 
Sbjct: 557  EFFWESWFVVKDSTIAAVKEFFRTGHLLKRFNATAITLIPKVTGVDQLSMFRPVSCCTVV 616

BLAST of ClCG03G010470 vs. TAIR 10
Match: AT3G24255.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )

HSP 1 Score: 80.1 bits (196), Expect = 1.9e-14
Identity = 48/187 (25.67%), Postives = 78/187 (41.71%), Query Frame = 0

Query: 1398 LPLMYLGLPLGGHPKKMVFWQPIIDKIQGKLSRWKRNNLSRGGRLTLCKTVLSNLPSYYM 1457
            LP+ YLGLPL         + P+++KI+ ++ +W   +LS  GRL L  +V+ +L +++M
Sbjct: 23   LPVRYLGLPLLTKKMTTSDYGPLVEKIRVRIGKWTARHLSFAGRLQLISSVIHSLTNFWM 82

Query: 1458 SIFLMPEKVVLLIERAMRNFFWEGHGGSKLNHLARWVTVTKNHKDGGLGLENLK------ 1517
            S F +P   +  I+    +F W G   +       W  V     +GGLG+ +LK      
Sbjct: 83   SAFRLPSACIKEIDSICSSFLWSGPELNTKKAKVAWSDVCTPKDEGGLGIRSLKEANKGS 142

Query: 1518 ---IKNLALLSKWGWRFMQESEALWCKEVASLRSPWISISRQWQKIEALAIFKVGDGRRI 1576
               I     L  W W+ + +  AL                             + +G   
Sbjct: 143  FWSISGNTTLGSWMWKKILKHRAL---------------------ASGFVKHDIHNGSNT 188

BLAST of ClCG03G010470 vs. TAIR 10
Match: AT4G29090.1 (Ribonuclease H-like superfamily protein )

HSP 1 Score: 70.5 bits (171), Expect = 1.5e-11
Identity = 43/145 (29.66%), Postives = 67/145 (46.21%), Query Frame = 0

Query: 1452 LPSYYMSIFLMPEKVVLLIERAMRNFFWEGHGGSKLNHLARWVTVTKNHKDGGLGLENLK 1511
            LP+Y M+ FL+P+ V   I   + +F+W     +K  H   W  ++    +GG+G ++++
Sbjct: 3    LPTYTMACFLLPKTVCKQIISVLADFWWRNKQEAKGMHWKAWDHLSCYKAEGGIGFKDIE 62

Query: 1512 IKNLALLSKWGWRFMQESEALWCKEVAS--------LRSP--------WISISRQWQKIE 1571
              NLALL K  WR +   E+L  K   S        L +P        W SI    + + 
Sbjct: 63   AFNLALLGKQMWRMLSRPESLMAKVFKSRYFHKSDPLNAPLGSRPSFVWKSIHASQEILR 122

Query: 1572 ALAIFKVGDGRRITFWFDPWLEDQP 1581
              A   VG+G  I  W   WL+ +P
Sbjct: 123  QGARAVVGNGEDIIIWRHKWLDSKP 147

BLAST of ClCG03G010470 vs. TAIR 10
Match: ATMG01250.1 (RNA-directed DNA polymerase (reverse transcriptase) )

HSP 1 Score: 68.6 bits (166), Expect = 5.7e-11
Identity = 33/67 (49.25%), Postives = 42/67 (62.69%), Query Frame = 0

Query: 1272 INGRPRGRVQASRGVRQGDPLSPFLFLLVSEVLTSLISRLHKSKKFEGFIVGKKKVHVPI 1331
            ING P+G V  SRG+RQGDPLSP+LF+L +EVL+ L  R  +  +  G  V      +  
Sbjct: 14   INGAPQGLVTPSRGLRQGDPLSPYLFILCTEVLSGLCRRAQEQGRLPGIRVSNNSPRINH 73

Query: 1332 LQFADDT 1339
            L FADDT
Sbjct: 74   LLFADDT 80

BLAST of ClCG03G010470 vs. TAIR 10
Match: ATMG00310.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )

HSP 1 Score: 53.9 bits (128), Expect = 1.5e-06
Identity = 37/145 (25.52%), Postives = 59/145 (40.69%), Query Frame = 0

Query: 1452 LPSYYMSIFLMPEKVVLLIERAMRNFFWEGHGGSKLNHLARWVTVTKNHK-DGGLGLENL 1511
            LP Y MS F + + +   +  AM  F+W      +      W  + K+ + DGGLG  +L
Sbjct: 3    LPVYAMSCFRLSKLLCKKLTSAMTEFWWSSCENKRKISWVAWQKLCKSKEDDGGLGFRDL 62

Query: 1512 KIKNLALLSKWGWRFMQESEALWCKEVASLRSP----------------WISISRQWQKI 1571
               N ALL+K  +R + +   L  + + S   P                W SI    + +
Sbjct: 63   GWFNQALLAKQSFRIIHQPHTLLSRLLRSRYFPHSSMMECSVGTRPSYAWRSIIHGRELL 122

Query: 1572 EALAIFKVGDGRRITFWFDPWLEDQ 1580
                +  +GDG     W D W+ D+
Sbjct: 123  SRGLLRTIGDGIHTKVWLDRWIMDE 147

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
RVW64408.14.8e-20130.65LINE-1 retrotransposable element ORF2 protein [Vitis vinifera][more]
RVW16209.11.0e-19536.45Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera][more]
CAN68838.17.4e-19435.46hypothetical protein VITISV_030956 [Vitis vinifera][more]
RVW70235.11.6e-19333.62LINE-1 retrotransposable element ORF2 protein [Vitis vinifera][more]
RVW65579.12.1e-19335.95Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera][more]
Match NameE-valueIdentityDescription
O003703.4e-4823.18LINE-1 retrotransposable element ORF2 protein OS=Homo sapiens OX=9606 PE=1 SV=1[more]
P143813.7e-4725.38Transposon TX1 uncharacterized 149 kDa protein OS=Xenopus laevis OX=8355 PE=4 SV... [more]
P085481.2e-4522.88LINE-1 reverse transcriptase homolog OS=Nycticebus coucang OX=9470 PE=4 SV=1[more]
P113691.6e-4223.22LINE-1 retrotransposable element ORF2 protein OS=Mus musculus OX=10090 GN=Pol PE... [more]
P0C2F61.8e-2534.65Putative ribonuclease H protein At1g65750 OS=Arabidopsis thaliana OX=3702 GN=At1... [more]
Match NameE-valueIdentityDescription
A0A803P8A07.4e-20837.99Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A803QI006.5e-20437.99Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A803QEA61.1e-20336.67Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A438FWU52.3e-20130.65LINE-1 retrotransposable element ORF2 protein OS=Vitis vinifera OX=29760 GN=LORF... [more]
A0A803QQM35.7e-20037.67Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G43760.12.2e-2335.87DNAse I-like superfamily protein [more]
AT3G24255.11.9e-1425.67RNA-directed DNA polymerase (reverse transcriptase)-related family protein [more]
AT4G29090.11.5e-1129.66Ribonuclease H-like superfamily protein [more]
ATMG01250.15.7e-1149.25RNA-directed DNA polymerase (reverse transcriptase) [more]
ATMG00310.11.5e-0625.52RNA-directed DNA polymerase (reverse transcriptase)-related family protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 1166..1406
e-value: 7.5E-32
score: 110.6
IPR000477Reverse transcriptase domainPROSITEPS50878RT_POLcoord: 1147..1407
score: 17.21871
IPR036691Endonuclease/exonuclease/phosphatase superfamilyGENE3D3.60.10.10Endonuclease/exonuclease/phosphatasecoord: 700..915
e-value: 4.0E-29
score: 104.1
IPR036691Endonuclease/exonuclease/phosphatase superfamilySUPERFAMILY56219DNase I-likecoord: 698..915
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 652..681
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 667..681
NoneNo IPR availablePANTHERPTHR33116:SF33OS01G0885550 PROTEINcoord: 1090..1610
NoneNo IPR availablePANTHERPTHR33116REVERSE TRANSCRIPTASE ZINC-BINDING DOMAIN-CONTAINING PROTEIN-RELATED-RELATEDcoord: 1090..1610
NoneNo IPR availableCDDcd01650RT_nLTR_likecoord: 1163..1407
e-value: 2.49375E-42
score: 152.831
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 1110..1374

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG03G010470.1ClCG03G010470.1mRNA