CSPI04G15350 (gene) Wild cucumber (PI 183967)

NameCSPI04G15350
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationChr4 : 12751870 .. 12761914 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGTGAGAGATTGTGGGATGCGCTGTATGTGTTTGATGAAATGTCTGAAATGAGTAATTTGATTGAAACCTTTGTGTTGGTGTGGAACTTAAAGGGGATGGTGGAGTTTTATAGAGGAAGAAATGGATTGGTGATGAGGAGTTTCGGGGATGTTCTACTGGTTTCTGTAATGGGTAATTGCTGGCATTGTTGATCTTCATTCTTCCACAAACTTCTTCTACCTTCTTGTTCTACATCTGCGTGGACAACGGCGGTGTGTTCTTTTGTATTCAATGATTCGGTTGTTAGTAACGGAACGGACGGAGCATCATCGAGGAGCTTCTGGTGAGTTCTACTTCCTGCCTTTCTATTGAATATCCCCAATTTTGGAATATTTGCCTATATAACTCTTATATTTAATTCAAAATAAGCATATTTTTTAAAATAAATTAAAATATTTATAAATTATATTAAAATTTTGAATTCAATTATTATTTATTTTTTTTGGTCGATATCGATATCTATTATAAATAAAATTTAAAATTTTGATCAAAAACTCTATTCTTGATAAATTTTATTATATTTATAGTTTTTATAATAGTTGATTACTATTTCTAAAATTGTTAAACTGGAGTTTTGACCTTTACATTTGTTATTGGTTTGTTTCATTTACTGAACTAAATTCACAAAGGCTCCTTGGATATGTAGCGAAATAAGACTGAAGTATTAAGAATATAGCAAAACTTAAAATTAATTATAAATATAGTAAAATTTATGTTCTATTAATGAAAAATGTCTAAAAATTTTATCAAAGACATCAAAATATGTCTGAGACCTGAAATTCCTCATCTCTAAAACTGCTTTTATAAAAATTTGTCATTAATAAACTCTTTTTTTAAACTAATTACAAATATAATAAAATTTATCTCCAGATTTGTAGTCTATATAAAATATACAAAAGAAAATTTATCGAATGGATAAAAATTTCAATTTTTGAAGTTTGTCGTTGATAGATCTGATAACAGTATCATTGATAGACTCATTTTATAGAAATCTATCATTGATACCATGTTATCATCATCCATAATTTTGAACAAATAGGATTCTATCAAAAAAATATATCAATTTATCATAGATAGCAAACTATCGTTGATATTACCTCTATCACTATATTTTTGTATCACGGAACGATTAGTGATAGCACATTATCAATTATATACTTTTATAAGTGGTAAAATTTTATCAACGATAAAAGTACGTCATTGAAAATATATTAGTGTGGTTCATGGTTTTTTTTTTAAAAAAATTTTATAAAATATATAAAAAAAAGTGATTAATAATGGTTCCAACATATAAGTGATACACTTTTATCATCGATAAACTTCTATTGATGATAAAAATATGCATATCCTTGAAAATTTGTTTAGTTTTGAATAATTCGTAGAAATTGATCGTTGATGCTTCTATGTATAACAAAATTTTATAATCGATAAATTTCCATCAATGATAGAATATCATAAAAAAAATATTTAAAGTGATCGTTTATTTTAAATATTTTTTTTGTACTGTTTATTAGAGTCGATATAGATGATGTGTTTCTATCACTAATAGACTCGAAGAAAGTTCAATTTATTTTCAGAAAAAGTAAAAAACAAACAATTAAGAGAAATTGGTAAAAGTAGCATAAAAATTGTTTTGATCGGTCCATTCCGAGTGTGATCAACCATAGAAATTTTTCTAAAACATTAAAATTTTAAAATTTTCAATTTATCGATTGTTTTGGTTTTACCATACATCTCGATAGACTTTAGAGTGAGATTTCTACCCTCGGTAAACTTTAGAGCTAGATTTTTACCCTTTCTCAAATCTTCCCAAATTTTTTTTAAAAAATAAATTATTCAAATCTTAACTGATTGTTTTATATTTTAAAAAAGTATGGATCTCGGTTCATCCGGTAGAAATTCTCCCAAAAACACCGAATAATTTGTGGGCAATCACTAAGAAGATCCAAAAACTTGCTTCTCGTTCTTGAGCGAGGCAGAGCACATAAGTCCAGATGATTTAACGGCCATTAATGGCGGAAATCGATCACTGGAACAAGGCGTGGCAAAATTGATGGTTCCAAAATTTCCATGTCCAATCCATGGCTGGGTAGAAATTAGAGAAAAATAATAGTGGACGGGTTTTAGGTTGAGGAAGAGAAGAATGGGGAAAAAGAAAAATGGTAAGTTACAAAAAATAATTGGAGAAATTGGGAAAAATAGCAAACAAAGTTTGAAAAATGATAGAATCTCGTTTTAACATTTATCTAAATCGAGATGCACTAAGAAAATCCAAAGCAACAGATAAGTTGAAAATTTTTAATATTTTAGGAAATCTTGATGGTTGATCTCACGATCGAGAAATAGTTATATTGAGTTTTTAAAAAATGTGCTGCTTTTAAAAACGAAAACCAAAAGTATGCTATTTTTACCAATTTTTCAAATAGTTGATAAGTGTCTACCCGTGGTAGAAGAAAATTGAATATATTTTCATGAATAAAACATGTATAGTTCCATTGATTTTTTGTCTTCTTTAATTAGAGTCTTAACAAATAAATTTGTATCAACTTGAATAAAGTTGAATTTATGTTCATAAAAAAAGTAAGAACTTGCATAGTTTTTTATGCTATTTTGCTTATGTCATTGATTAGTTTATTAGTAATACACTTGAAGAACACTAAATTTATTTTCATGAAGAATTAGAAAATGTATAGTTTCATTGTTTTGTTTTTTTTTTTAGTCCATAGATGATATGTTTCTAGGTAGACTCGAAGAAAATTGAATTTATTTTCATGAAAAAGTAAGAAAATACATCATTTCAATGATGTTTTTGAGGTTGGTGATTAGTTATTAGTGATAGAGTAGAAGAATGTTAAATACTTTTTCATCAAAAGTGAGAAAATGTGTAGTTTATTTTTTTTCCTTTTCTTTTTTTACTCTTGTTTATAGCCTATCGATGATGAGTTTCGATTAGTGATAGACTTGAAGAAAGTTGATTTTTTTTTCATGAAAAATAAGAAAATATATAGTTTCAATTATGTTCTTGTTGTGTATTTAATAGTCAATGTTTCAGTTTCTTTCACTAGTAGCGGGAAGAAAGTTGGATTTATATTTTCATAAAAAGCACGTACTTCATTGTTTCTTTATTTGATATATTTCTATCATTGATTGACATAAAAAAAAAAGTTAAATTTTTATATAAAAAGTAAGAATAATATGTATATCGTCGTTTTTATTTTAGTATATTTGTTAATAAGAATAATGTACATACATTTCTATGTTTCTACCGTCAACATACTTACAAAAATGAATTGAATTTATTTTCATGAAAAATAAGTAAGAAAGTATAGTTTCGTTGTATTATTGTTTTTCTAGTGTTATTGTTGATTTAAGTATGTTGATGATATGTTTCATAGACTTGCAGAAATATAGTTTCGTTGTTGATTATTTTGTCTGGATGAGAGTCTAACAATGTTAAGTTTTTATCACCAAATACACTTAAAAGAAATTTTTAATTTATTTTTCAAGAAAATATATAAATTTCTATTGGTAGTGTTTTTATTTTTTATTTTCATAATTAAGTGTAAAATTATTTATATGAACAAATGATGGAATTTGACATGGATGAATTCGTTCTAACCATAAATAAAAGAAAATGTAAAACTAGAATGAAAATCAATGGAATTTCGCTAATTAGGAGGTTATGGTTACATGAAGTCTAAAATCATAGGTTTGGTTCCACCTCGTAGATTGTTGTTGCTTTAGTTAAAATTCCATTAGCTTGATAACCTTCCAATATTCTCTAGGATAGAAACCCAAACTCTTAATTTCGTTATTTCGTCATTCCTTACAAAAATCTATATGTTTCTATCAGGGATAGAATCCAAAATTTTACTATAGTTTGTAAATATTTTAATTTATTTTGTTATTTTTAAACATAATTGAGAAAAAAGAATGCATTATAGTATTATCAAACGTTTATGATGATACCAAGTTTATATATTATTTGTCTATACATATATAAAATAATATATCTTTTTTTAACTTTAAATTGAAAGTTGGATATTAATAATGTGTTAAAATTCTGCAAATAAATAAAAATTAAAAGTGTTGGAATTCTTTCCTTTTTTTCTAGGCTTTTTCCTTGTTAGAATCCTAGAATAATTATGGAAAGATTATGGGAATGTATTCCTTATTTTCCTAAATCTTTTCCTTTTTTATTCCATTGTGTACTCTATTTATTCTCCCTTGTACCTATTGCTTTATTCATTAGAAAATAATAACAACACAAACAATCGTGGTTTTTCTCCCGGTACTCGGGTTTCCACGTAAATTGGTGTGAACTCGTTGTCTCTCTTTTCAATAGAAACTTTATGAAAATGGGCTTTATGAAGGTTTCGTGGAGGGAGAAGAGAAGGTTCACATCCCTCTCGTCCAATATGCTGACGACACTTTCCTCTTTTGCAAATATGATCAAAAACCCAATGGATCAGTAGCTGCACATTGGGACCATTCCACCCTTTCCTGGCCTATAATTTTTAGAAGACTTCTTAAAGAAGAAATTTCAGCAATTGTTTCACCTCCTTTCGCAAAGGAAGGTGGTAGAATCTATGGACAGAAGGAGTTGGTCTCCAAAACCATCCAGTAAGTTTTCGGTTTTTTTCTTCGTCCGCGGCCTATATTGCTCCCCTCGCCCCGATTTATGTTCTGCCATCTAATTCCAATCGGCTACCACTGCTTTTGCCGTCAAATCTGTATGGCCAGCCACCCAATGATCCTAGCTACCATCCCGATGTTAAAAACTCTCAAATTCACTCAACATTTGAGGTTGGTGAATCTTCGGCATATTCCAACCGTAACGTGCAAGCTTCCTCGGGAATAGTTCATCAACAATTGGAAGGGCTTCGACAACAGATAGCAGCACTTGAGGCTACCTTAGGGACGACATCCACTCTACCGATGTATTCTGAGAATCCGGTAAACTCGTTCCCTAATGTATCCTCTCCTTATGTGACTAATACGGTGGCTCAGTCTTCCATGTATCATCTTTCAGGAGAAAAGTTGAATGGCAACAACTATTTCTCATGGTCTCAGTCAGTAAAGATGGTCCTCGAAGGACGACAAAAATTCAGCTTTCTGACAGGGGAAATACCTCGCCCCCAACCGGGCGACCCACATGAACGATATTGGAAGGCAGAAGACTCTATTCTTCGATCCATATTGATCAATAGTAGGGAACCTCAAATGGGTAAGCCGTTATTGTTTGCTGCAACAGCCAAGGATATTTGGGACACAGCCCAGACACTTTACTCAAAACGTCAGAATGCCTCTCGTCTATACACGCTGAGAAAGCAAGTTCATGAATGCAAGCAAGGAACCATGGATGTCACATCCTTTTTCAATAAGCTTTCTCTTATATGGCAAGAAATGGACCTATGCATAGAACTAGTCTGGCGTGATCCCACTGATGATGTACAGTACTCGAGAATTGAAGAGAATGACAGGATTTATGACTTTCTTGCTGGTCTTAATCCTAAGTTTGATGTAGTTCGAGGGCGTATACTAGGTCAAAGACCGATTCCCTCCCTGATGGAAGTTTGCTCGGAAATCCGCCTCGAGGAAGATCGCACAAGTGCTATGAATATTTCCGCAACCCCTACTATTGACTCTGCTGCTTTTAGTGCAAGATCTTCTAACAGTAGCAGTGACAAGCATAATGGAAAACCAATTCCTGTCTGCGAGCATTGCAAAAAACAATGGCATACCAAAGAACAAGGTTGGAAGTTACATGGTCGTCCCCCAGGAAGTAAGAAACGCCCATCCAACGACAAACAGAACACAGGGCGGGCGTATGTGAGTGAGTCTGCTGAACCTCCTCAACAATCCGATCCACACAAAAACCAAACTGATCTCAGTCTTGCCACTTTAGGTGCCATTGTCCAATCAGGTATACCTCACTCCTTCGGTCTTATTAGTATTGATGGGAAGAACCCCTGGATTCTGGATTCTGGTGCCACAGATCATTTGACTGGGTCCTCTGAACATTTTGTATCTTACATTCCTTGTGCTGGGAACGAGACAATTAGAATTGCAGATGGCTCCTTGGCCCCCATTGCTGGAAAGGGGAAGATTTCTCCTTGTGCAGGGCTCTCCTTACATAATGTTTTGCATGTGCCCAAACTATCTTATAATTTGCTTTCGATAAGCAAGATCACTCATGAGTTAAACTGCAAAGCAATATTCTTACCTGATTCTGTCTCTTTTCAGGACTTGAGCTCGGGGAGGATGATTGGCACTGCCCGGCATAGTAGGGGACTCTACCTCCTTGATGACGATACCTCTTCTAGTAGCATTCCTAGGACTAGCCTCTTATCTTCCTATTTCACTACTTCTGAACAAGATTGTATGTTGTGGCATTTTCGTTTAGGCCACCCTAATTTTCAATATATGAAACATTTATTTCCACATCTCTTCTCTAAAGTTGAGATGACTACCTTATCTTGTGATGTGTGTATTCAGGCCAAACAACATCGAGTCTCTTTTCCCTCACAACCATACAAACCAACCCAACCCTTCACTCTTGTTCATAGTGATGTCTGGGGACCATCCAAGATAACAACCTCATCTGGAAAACGGTGGTTCGTAACCTTCATTGATGATCATACCCGTCTTACCTGGGTCTACCTTATCACTGATAAATCTGAGGTTTCCTCTATGTTTCAAAATTTCTATCACACCATTGAAACACAATTCCATCAAAAAATTGCTATTCTTCGGAGTGATAATGGTCGGGAATTCCAAAACCATAACCTTAGTGAATTTCTTGCTTCCAAGGGGATTGTTCATCAAAACTCGTGCGCCTACACTCCTCAACAAAATGGAGTGGCCGAGCGAAAAAACCGTCACCTTCTGGAAGTAGCCCGTTCCCTTATGCTTTCTACTTCCCTTCCTTCATACTTATGGGGAGATGTTATTCTTACAGCAACTCATTTAATCAATAGAATGCCTTCTCGTATTCTTCATCTTCAAACTCCCTTAGATTGTCTTAAGGAGTCCTACCCATCGACTCGTCATGTTTCTGAGGTTCCTCTTCGTGTGTTTGGGTGTACCGCTTATGTCCATAATTTTAGCCCTAATCAAACAAAATTTACCCCTCGGGCTCAGGCATGTGTGTTTGTTGGGTATCCCCCTCACCAGCGTGGTTATAAATGTTTTCACCCACCATCCAGAAAATACTTTGTCACTATGGATGTTACTTTCTGTGAGGATCGACCCTACTTTCCCGTTAGCCATCTTCAGGGGGAGAGTGTGAGTGAAGAGTCTAACAACACCTTTGAATTCATCGAACCCACTCCTAGTGTTGTGTCTAACATCATTCCTCATTCCATAGTCCTACCCACAAACCAAGTCCCCTGGAAAACGTACTACAGGAGGAATCACAAAAAGGAAGTCGGTTCCCCTACTAGTCAGCCGCCGGCTCCAGTCCAAGACTCTGAACCTCCTCGAGATCAAGGTATGGAAAACCCTACTGAACCCTGTACTAAGAATATGATAAGTGAGAATGACAGGTCTAATGTTGCTGTTCTTGAAAACGTGGAAGAAAAGGACAGTGGTGATGAGATTGAGGTCAGAATAGAAACCCGTAATAATGAAGCGGAACAGGGTCATACAGGAAAATCAGATGAGTATGATTCCTCTCTTGACATTCCCATTGCTCTGAGAAAAGGCACCAGGTCTTGTACTAAACACCCCATTTGCAATTATGTTTCCTACGATAGTCTCTCTCCTCAGTTCAGAGCTTTTACAGCAAGCCTTGATTCTACCATAATACCAAAAGATATCTACACTGCTTTAAAGTATCCTGAATGGAAGAATGCTGTCATGGAAGAGATGAAAGCTCTTGAAAAGAATAGTACTTGGGACATTTGTACTCTACCTAAGGGACACAAAACTGTGGGATGCAAATGGGTGTTCTCTCTCAAATACAAAGCTGATGGTACTCTTGACAGACACAAGGCAAGGTTAGTTGCGAAGGGATTTACTCAAACCTATGGTATTGACTATTCAGAAACTTTTTCTCCAGTTGCTAAGTTGAATACTATTAGAGTTCTGTTATCTGTTGCTGTGAACAAAGATTGGCCTTTATATCAGCTGGATGTTAAGAATGCCTTTTTGAATGGAGACCTCGTAGAGGAAGTCTACATGAGCCCTCCGCCTGGATTTGAAGCCCAGTTTGGTCAGCATGTGTGTAAACTCCAGAAATCTATATATGGTCTGAAACAGTCTCCCAGAGCATGGTTTGACAGATTCACTACCTTTGTCAAGTCCCAAGGGTACAGGCAGGGACACTCTGATCATACTTTATTTACAAAGGTTTCCAAAACAGGAAAGATTGCTGTTCTAATAGTTTATGTGGATGACATTGTTTTGACTGGAGATGATCAGGCAGAAATCAGTCAACTAAAGCAGAGAATGGGCGATGAGTTTGAAATCAAGGATTTGGGAAATTTGAAATATTTCCTTGGAATGGAGGTGGCCAGATCTAAAGAAGGTATCTCCGTATCTCAAAGAAAATACATCCTTGATTTGTTAACCGAGACAGGTATGTTAGGATGTCGTCCCACTGACACTCCTATTGAATTCAACTGCAAACTAGGAAACTCTGATGATCAAGTTCCAGTTGATAAAGAACAGTATCAACGCCTCGTGGGTAAATTAATTTACTTATCTCATACTCGTCCTGATATTTCCTTTGCTGTGAGTGTTGTCAGCCAGTTTATGCAGACCCCTAATGAGGAACACATGAAAGCTGTCAACAGAATCTTGAGATACTTAAAATCAACACCTGGTAAAGGGCTGATGTTTAGAAAAACAGACAGAAAGACCATTGAGGCATACACTGACTCGGATTGGTAGAAATCTATATATGGTCTGAAATAGTCTCCCAGAGCATGGTTTGACAGATTCACTACCTTTGTCAAGTCCCAAGGGTACAGGCAGGGACACTCTGATCATACTTTATTTACAAAGGTTTCCAAAACAGGAAAGATTGCTGTTCTAATAGTTTATGTGGATGACATTGTTTTGACTGGAGATGATCAGACAGAAATCAGTCAACTAAAGCAGAGAATGGGCGATGAGTTTGAAATCAAGGATTTGGGAAATTTGAAATATTTCCTTGGAATGGAGGTGGCCAGATCTAAAGAAGGTATCTCCGTATCTCAAAGAAAATACATCCTTGATTTGTTAACCGAGACAGGTATGTTAGGATGTCGTCCCACTGACACTCCTATTGAATTCAACTGCAAACTAGGAAACTCTGATGATCAAGTTCCAGTTGATAAAGAACAGTATCAACGCCTCGTGGGTAAATTAATTTACTTATCTCATACTCGTCCTGATATTTCCTTTGCTGTGAGTGTTGTCAGCCAGTTTATGCAGACCCCTAATGAGGAACACATGAAAGCTGTCAACAGAATCTTGAGATACTTAAAATCAACACCTGGTAAAGGGCTGATGTTTAGAAAAACAGACAGAAAGACCATTGAGGCATACACTGACTCGGATTGGGCAGGATCTGTTGTTGACAGAAAATCTACCTCTGGTTATTGTACCTTTGTTTGGGGCAATCTTGTAACTTGGAGGAGTAAGAAGCAAAGTGTTGTGGCCAGGAGCAGCGCTGAGGCTGAATATAGAGCTATGAGTTTAGGAATATGTGAGGAAATTTGGCTTCAGAAAGTTTTGACAGATCTTCATCAGGAATGTGAGACACCATTGAAGCTTTTCTGTGATAATAAAGCCGCTATTAGTATTGCTAACAACCCTGTTCAACATGATAGAACTAAACATGTTGAGATTGATCGACATTTTATCAAAGAAAAACTTGACAGTGGGAGCATATGCATTCCGTACATCCCTTCGAGTCAACAGGTTGCTGATGTTCTTACCAAAGGGCTTCTCAGACCAAACTTCGACTTCTGCGTTAGCAAGTTGGGCCTCATTGATATTTACGTCCCAACTTGA

mRNA sequence

ATGTGTGAGAGATTGTGGGATGCGCTGTATGTGTTTGATGAAATGTCTGAAATGAGTAATTTGATTGAAACCTTTGTGTTGGTGTGGAACTTAAAGGGGATGGTGGAGTTTTATAGAGGAAGAAATGGATTGGTGATGAGGAGTTTCGGGGATGTTCTACTGGTTTCTGTAATGGTAACGGAACGGACGGAGCATCATCGAGGAGCTTCTGAAGATCCAAAAACTTGCTTCTCGTTCTTGAGCGAGGCAGAGCACATAAGTCCAGATGATTTAACGGCCATTAATGGCGGAAATCGATCACTGGAACAAGGCGTGGCAAAATTGATGGTTCCAAAATTTCCATGTCCAATCCATGGCTGGAAACTTTATGAAAATGGGCTTTATGAAGGTTTCGTGGAGGGAGAAGAGAAGGTTCACATCCCTCTCGTCCAATATGCTGACGACACTTTCCTCTTTTGCAAATATGATCAAAAACCCAATGGATCAGTAGCTGCACATTGGGACCATTCCACCCTTTCCTGGCCTATAATTTTTAGAAGACTTCTTAAAGAAGAAATTTCAGCAATTGTTTCACCTCCTTTCGCAAAGGAAGGTGTAAGTTTTCGGTTTTTTTCTTCGTCCGCGGCCTATATTGCTCCCCTCGCCCCGATTTATGTTCTGCCATCTAATTCCAATCGGCTACCACTGCTTTTGCCGTCAAATCTGTATGGCCAGCCACCCAATGATCCTAGCTACCATCCCGATGTTAAAAACTCTCAAATTCACTCAACATTTGAGGTTGGTGAATCTTCGGCATATTCCAACCGTAACGTGCAAGCTTCCTCGGGAATAGTTCATCAACAATTGGAAGGGCTTCGACAACAGATAGCAGCACTTGAGGCTACCTTAGGGACGACATCCACTCTACCGATGTATTCTGAGAATCCGGTAAACTCGTTCCCTAATGTATCCTCTCCTTATGTGACTAATACGGTGGCTCAGTCTTCCATGTATCATCTTTCAGGAGAAAAGTTGAATGGCAACAACTATTTCTCATGGTCTCAGTCAGTAAAGATGGTCCTCGAAGGACGACAAAAATTCAGCTTTCTGACAGGGGAAATACCTCGCCCCCAACCGGGCGACCCACATGAACGATATTGGAAGGCAGAAGACTCTATTCTTCGATCCATATTGATCAATAGTAGGGAACCTCAAATGGGTAAGCCGTTATTGTTTGCTGCAACAGCCAAGGATATTTGGGACACAGCCCAGACACTTTACTCAAAACGTCAGAATGCCTCTCGTCTATACACGCTGAGAAAGCAAGTTCATGAATGCAAGCAAGGAACCATGGATGTCACATCCTTTTTCAATAAGCTTTCTCTTATATGGCAAGAAATGGACCTATGCATAGAACTAGTCTGGCGTGATCCCACTGATGATGTACAGTACTCGAGAATTGAAGAGAATGACAGGATTTATGACTTTCTTGCTGGTCTTAATCCTAAGTTTGATGTAGTTCGAGGGCGTATACTAGGTCAAAGACCGATTCCCTCCCTGATGGAAGTTTGCTCGGAAATCCGCCTCGAGGAAGATCGCACAAGTGCTATGAATATTTCCGCAACCCCTACTATTGACTCTGCTGCTTTTAGTGCAAGATCTTCTAACAGTAGCAGTGACAAGCATAATGGAAAACCAATTCCTGTCTGCGAGCATTGCAAAAAACAATGGCATACCAAAGAACAAGGTTGGAAGTTACATGGTCGTCCCCCAGGAAGTAAGAAACGCCCATCCAACGACAAACAGAACACAGGGCGGGCGTATGTGAGTGAGTCTGCTGAACCTCCTCAACAATCCGATCCACACAAAAACCAAACTGATCTCAGTCTTGCCACTTTAGGTGCCATTGTCCAATCAGGTATACCTCACTCCTTCGGTCTTATTAGTATTGATGGGAAGAACCCCTGGATTCTGGATTCTGGTGCCACAGATCATTTGACTGGGTCCTCTGAACATTTTGTATCTTACATTCCTTGTGCTGGGAACGAGACAATTAGAATTGCAGATGGCTCCTTGGCCCCCATTGCTGGAAAGGGGAAGATTTCTCCTTGTGCAGGGCTCTCCTTACATAATGTTTTGCATGTGCCCAAACTATCTTATAATTTGCTTTCGATAAGCAAGATCACTCATGAGTTAAACTGCAAAGCAATATTCTTACCTGATTCTGTCTCTTTTCAGGACTTGAGCTCGGGGAGGATGATTGGCACTGCCCGGCATAGTAGGGGACTCTACCTCCTTGATGACGATACCTCTTCTAGTAGCATTCCTAGGACTAGCCTCTTATCTTCCTATTTCACTACTTCTGAACAAGATTGTATGTTGTGGCATTTTCGTTTAGGCCACCCTAATTTTCAATATATGAAACATTTATTTCCACATCTCTTCTCTAAAGTTGAGATGACTACCTTATCTTGTGATGTGTGTATTCAGGCCAAACAACATCGAGTCTCTTTTCCCTCACAACCATACAAACCAACCCAACCCTTCACTCTTGTTCATAGTGATGTCTGGGGACCATCCAAGATAACAACCTCATCTGGAAAACGGTGGTTCGTAACCTTCATTGATGATCATACCCGTCTTACCTGGGTCTACCTTATCACTGATAAATCTGAGGTTTCCTCTATGTTTCAAAATTTCTATCACACCATTGAAACACAATTCCATCAAAAAATTGCTATTCTTCGGAGTGATAATGGTCGGGAATTCCAAAACCATAACCTTAGTGAATTTCTTGCTTCCAAGGGGATTGTTCATCAAAACTCGTGCGCCTACACTCCTCAACAAAATGGAGTGGCCGAGCGAAAAAACCGTCACCTTCTGGAAGTAGCCCGTTCCCTTATGCTTTCTACTTCCCTTCCTTCATACTTATGGGGAGATGTTATTCTTACAGCAACTCATTTAATCAATAGAATGCCTTCTCGTATTCTTCATCTTCAAACTCCCTTAGATTGTCTTAAGGAGTCCTACCCATCGACTCGTCATGTTTCTGAGGTTCCTCTTCGTGTGTTTGGGTGTACCGCTTATGTCCATAATTTTAGCCCTAATCAAACAAAATTTACCCCTCGGGCTCAGGCATGTGTGTTTGTTGGGTATCCCCCTCACCAGCGTGGTTATAAATGTTTTCACCCACCATCCAGAAAATACTTTGTCACTATGGATGTTACTTTCTGTGAGGATCGACCCTACTTTCCCGTTAGCCATCTTCAGGGGGAGAGTGTGAGTGAAGAGTCTAACAACACCTTTGAATTCATCGAACCCACTCCTAGTGTTGTGTCTAACATCATTCCTCATTCCATAGTCCTACCCACAAACCAAGTCCCCTGGAAAACGTACTACAGGAGGAATCACAAAAAGGAAGTCGGTTCCCCTACTAGTCAGCCGCCGGCTCCAGTCCAAGACTCTGAACCTCCTCGAGATCAAGGTATGGAAAACCCTACTGAACCCTGTACTAAGAATATGATAAGTGAGAATGACAGGTCTAATGTTGCTGTTCTTGAAAACGTGGAAGAAAAGGACAGTGGTGATGAGATTGAGGTCAGAATAGAAACCCGTAATAATGAAGCGGAACAGGGTCATACAGGAAAATCAGATGAGTATGATTCCTCTCTTGACATTCCCATTGCTCTGAGAAAAGGCACCAGGTCTTGTACTAAACACCCCATTTGCAATTATGTTTCCTACGATAGTCTCTCTCCTCAGTTCAGAGCTTTTACAGCAAGCCTTGATTCTACCATAATACCAAAAGATATCTACACTGCTTTAAAGTATCCTGAATGGAAGAATGCTGTCATGGAAGAGATGAAAGCTCTTGAAAAGAATAGTACTTGGGACATTTGTACTCTACCTAAGGGACACAAAACTGTGGGATGCAAATGGGTGTTCTCTCTCAAATACAAAGCTGATGGTACTCTTGACAGACACAAGGCAAGGTTAGTTGCGAAGGGATTTACTCAAACCTATGGTATTGACTATTCAGAAACTTTTTCTCCAGTTGCTAAGTTGAATACTATTAGAGTTCTGTTATCTGTTGCTGTGAACAAAGATTGGCCTTTATATCAGCTGGATGTTAAGAATGCCTTTTTGAATGGAGACCTCGTAGAGGAAGTCTACATGAGCCCTCCGCCTGGATTTGAAGCCCAGTTTGGTCAGCATGTGTGTAAACTCCAGAAATCTATATATGGTCTGAAACAGTCTCCCAGAGCATGGTTTGACAGATTCACTACCTTTGTCAAGTCCCAAGGGTACAGGCAGGGACACTCTGATCATACTTTATTTACAAAGGTTTCCAAAACAGGAAAGATTGCTGTTCTAATAGTTTATGTGGATGACATTGTTTTGACTGGAGATGATCAGGCAGAAATCAGTCAACTAAAGCAGAGAATGGGCGATGAGTTTGAAATCAAGGATTTGGGAAATTTGAAATATTTCCTTGGAATGGAGGTGGCCAGATCTAAAGAAGGTATCTCCGTATCTCAAAGAAAATACATCCTTGATTTGTTAACCGAGACAGGTATGTTAGGATGTCGTCCCACTGACACTCCTATTGAATTCAACTGCAAACTAGGAAACTCTGATGATCAAGTTCCAGTTGATAAAGAACAGTATCAACGCCTCGTGGGTAAATTAATTTACTTATCTCATACTCGTCCTGATATTTCCTTTGCTGTGAGTGTTGTCAGCCAGTTTATGCAGACCCCTAATGAGGAACACATGAAAGCTGTCAACAGAATCTTGAGATACTTAAAATCAACACCTGGTAAAGGGCTGATGTTTAGAAAAACAGACAGAAAGACCATTGAGGCATACACTGACTCGGATTGGCAGGGACACTCTGATCATACTTTATTTACAAAGGTTTCCAAAACAGGAAAGATTGCTGTTCTAATAGTTTATGTGGATGACATTGTTTTGACTGGAGATGATCAGACAGAAATCAGTCAACTAAAGCAGAGAATGGGCGATGAGTTTGAAATCAAGGATTTGGGAAATTTGAAATATTTCCTTGGAATGGAGGTGGCCAGATCTAAAGAAGGTATCTCCGTATCTCAAAGAAAATACATCCTTGATTTGTTAACCGAGACAGGTATGTTAGGATGTCGTCCCACTGACACTCCTATTGAATTCAACTGCAAACTAGGAAACTCTGATGATCAAGTTCCAGTTGATAAAGAACAGTATCAACGCCTCGTGGGTAAATTAATTTACTTATCTCATACTCGTCCTGATATTTCCTTTGCTGTGAGTGTTGTCAGCCAGTTTATGCAGACCCCTAATGAGGAACACATGAAAGCTGTCAACAGAATCTTGAGATACTTAAAATCAACACCTGGTAAAGGGCTGATGTTTAGAAAAACAGACAGAAAGACCATTGAGGCATACACTGACTCGGATTGGGCAGGATCTGTTGTTGACAGAAAATCTACCTCTGGTTATTGTACCTTTGTTTGGGGCAATCTTGTAACTTGGAGGAGTAAGAAGCAAAGTGTTGTGGCCAGGAGCAGCGCTGAGGCTGAATATAGAGCTATGAGTTTAGGAATATGTGAGGAAATTTGGCTTCAGAAAGTTTTGACAGATCTTCATCAGGAATGTGAGACACCATTGAAGCTTTTCTGTGATAATAAAGCCGCTATTAGTATTGCTAACAACCCTGTTCAACATGATAGAACTAAACATGTTGAGATTGATCGACATTTTATCAAAGAAAAACTTGACAGTGGGAGCATATGCATTCCGTACATCCCTTCGAGTCAACAGGTTGCTGATGTTCTTACCAAAGGGCTTCTCAGACCAAACTTCGACTTCTGCGTTAGCAAGTTGGGCCTCATTGATATTTACGTCCCAACTTGA

Coding sequence (CDS)

ATGTGTGAGAGATTGTGGGATGCGCTGTATGTGTTTGATGAAATGTCTGAAATGAGTAATTTGATTGAAACCTTTGTGTTGGTGTGGAACTTAAAGGGGATGGTGGAGTTTTATAGAGGAAGAAATGGATTGGTGATGAGGAGTTTCGGGGATGTTCTACTGGTTTCTGTAATGGTAACGGAACGGACGGAGCATCATCGAGGAGCTTCTGAAGATCCAAAAACTTGCTTCTCGTTCTTGAGCGAGGCAGAGCACATAAGTCCAGATGATTTAACGGCCATTAATGGCGGAAATCGATCACTGGAACAAGGCGTGGCAAAATTGATGGTTCCAAAATTTCCATGTCCAATCCATGGCTGGAAACTTTATGAAAATGGGCTTTATGAAGGTTTCGTGGAGGGAGAAGAGAAGGTTCACATCCCTCTCGTCCAATATGCTGACGACACTTTCCTCTTTTGCAAATATGATCAAAAACCCAATGGATCAGTAGCTGCACATTGGGACCATTCCACCCTTTCCTGGCCTATAATTTTTAGAAGACTTCTTAAAGAAGAAATTTCAGCAATTGTTTCACCTCCTTTCGCAAAGGAAGGTGTAAGTTTTCGGTTTTTTTCTTCGTCCGCGGCCTATATTGCTCCCCTCGCCCCGATTTATGTTCTGCCATCTAATTCCAATCGGCTACCACTGCTTTTGCCGTCAAATCTGTATGGCCAGCCACCCAATGATCCTAGCTACCATCCCGATGTTAAAAACTCTCAAATTCACTCAACATTTGAGGTTGGTGAATCTTCGGCATATTCCAACCGTAACGTGCAAGCTTCCTCGGGAATAGTTCATCAACAATTGGAAGGGCTTCGACAACAGATAGCAGCACTTGAGGCTACCTTAGGGACGACATCCACTCTACCGATGTATTCTGAGAATCCGGTAAACTCGTTCCCTAATGTATCCTCTCCTTATGTGACTAATACGGTGGCTCAGTCTTCCATGTATCATCTTTCAGGAGAAAAGTTGAATGGCAACAACTATTTCTCATGGTCTCAGTCAGTAAAGATGGTCCTCGAAGGACGACAAAAATTCAGCTTTCTGACAGGGGAAATACCTCGCCCCCAACCGGGCGACCCACATGAACGATATTGGAAGGCAGAAGACTCTATTCTTCGATCCATATTGATCAATAGTAGGGAACCTCAAATGGGTAAGCCGTTATTGTTTGCTGCAACAGCCAAGGATATTTGGGACACAGCCCAGACACTTTACTCAAAACGTCAGAATGCCTCTCGTCTATACACGCTGAGAAAGCAAGTTCATGAATGCAAGCAAGGAACCATGGATGTCACATCCTTTTTCAATAAGCTTTCTCTTATATGGCAAGAAATGGACCTATGCATAGAACTAGTCTGGCGTGATCCCACTGATGATGTACAGTACTCGAGAATTGAAGAGAATGACAGGATTTATGACTTTCTTGCTGGTCTTAATCCTAAGTTTGATGTAGTTCGAGGGCGTATACTAGGTCAAAGACCGATTCCCTCCCTGATGGAAGTTTGCTCGGAAATCCGCCTCGAGGAAGATCGCACAAGTGCTATGAATATTTCCGCAACCCCTACTATTGACTCTGCTGCTTTTAGTGCAAGATCTTCTAACAGTAGCAGTGACAAGCATAATGGAAAACCAATTCCTGTCTGCGAGCATTGCAAAAAACAATGGCATACCAAAGAACAAGGTTGGAAGTTACATGGTCGTCCCCCAGGAAGTAAGAAACGCCCATCCAACGACAAACAGAACACAGGGCGGGCGTATGTGAGTGAGTCTGCTGAACCTCCTCAACAATCCGATCCACACAAAAACCAAACTGATCTCAGTCTTGCCACTTTAGGTGCCATTGTCCAATCAGGTATACCTCACTCCTTCGGTCTTATTAGTATTGATGGGAAGAACCCCTGGATTCTGGATTCTGGTGCCACAGATCATTTGACTGGGTCCTCTGAACATTTTGTATCTTACATTCCTTGTGCTGGGAACGAGACAATTAGAATTGCAGATGGCTCCTTGGCCCCCATTGCTGGAAAGGGGAAGATTTCTCCTTGTGCAGGGCTCTCCTTACATAATGTTTTGCATGTGCCCAAACTATCTTATAATTTGCTTTCGATAAGCAAGATCACTCATGAGTTAAACTGCAAAGCAATATTCTTACCTGATTCTGTCTCTTTTCAGGACTTGAGCTCGGGGAGGATGATTGGCACTGCCCGGCATAGTAGGGGACTCTACCTCCTTGATGACGATACCTCTTCTAGTAGCATTCCTAGGACTAGCCTCTTATCTTCCTATTTCACTACTTCTGAACAAGATTGTATGTTGTGGCATTTTCGTTTAGGCCACCCTAATTTTCAATATATGAAACATTTATTTCCACATCTCTTCTCTAAAGTTGAGATGACTACCTTATCTTGTGATGTGTGTATTCAGGCCAAACAACATCGAGTCTCTTTTCCCTCACAACCATACAAACCAACCCAACCCTTCACTCTTGTTCATAGTGATGTCTGGGGACCATCCAAGATAACAACCTCATCTGGAAAACGGTGGTTCGTAACCTTCATTGATGATCATACCCGTCTTACCTGGGTCTACCTTATCACTGATAAATCTGAGGTTTCCTCTATGTTTCAAAATTTCTATCACACCATTGAAACACAATTCCATCAAAAAATTGCTATTCTTCGGAGTGATAATGGTCGGGAATTCCAAAACCATAACCTTAGTGAATTTCTTGCTTCCAAGGGGATTGTTCATCAAAACTCGTGCGCCTACACTCCTCAACAAAATGGAGTGGCCGAGCGAAAAAACCGTCACCTTCTGGAAGTAGCCCGTTCCCTTATGCTTTCTACTTCCCTTCCTTCATACTTATGGGGAGATGTTATTCTTACAGCAACTCATTTAATCAATAGAATGCCTTCTCGTATTCTTCATCTTCAAACTCCCTTAGATTGTCTTAAGGAGTCCTACCCATCGACTCGTCATGTTTCTGAGGTTCCTCTTCGTGTGTTTGGGTGTACCGCTTATGTCCATAATTTTAGCCCTAATCAAACAAAATTTACCCCTCGGGCTCAGGCATGTGTGTTTGTTGGGTATCCCCCTCACCAGCGTGGTTATAAATGTTTTCACCCACCATCCAGAAAATACTTTGTCACTATGGATGTTACTTTCTGTGAGGATCGACCCTACTTTCCCGTTAGCCATCTTCAGGGGGAGAGTGTGAGTGAAGAGTCTAACAACACCTTTGAATTCATCGAACCCACTCCTAGTGTTGTGTCTAACATCATTCCTCATTCCATAGTCCTACCCACAAACCAAGTCCCCTGGAAAACGTACTACAGGAGGAATCACAAAAAGGAAGTCGGTTCCCCTACTAGTCAGCCGCCGGCTCCAGTCCAAGACTCTGAACCTCCTCGAGATCAAGGTATGGAAAACCCTACTGAACCCTGTACTAAGAATATGATAAGTGAGAATGACAGGTCTAATGTTGCTGTTCTTGAAAACGTGGAAGAAAAGGACAGTGGTGATGAGATTGAGGTCAGAATAGAAACCCGTAATAATGAAGCGGAACAGGGTCATACAGGAAAATCAGATGAGTATGATTCCTCTCTTGACATTCCCATTGCTCTGAGAAAAGGCACCAGGTCTTGTACTAAACACCCCATTTGCAATTATGTTTCCTACGATAGTCTCTCTCCTCAGTTCAGAGCTTTTACAGCAAGCCTTGATTCTACCATAATACCAAAAGATATCTACACTGCTTTAAAGTATCCTGAATGGAAGAATGCTGTCATGGAAGAGATGAAAGCTCTTGAAAAGAATAGTACTTGGGACATTTGTACTCTACCTAAGGGACACAAAACTGTGGGATGCAAATGGGTGTTCTCTCTCAAATACAAAGCTGATGGTACTCTTGACAGACACAAGGCAAGGTTAGTTGCGAAGGGATTTACTCAAACCTATGGTATTGACTATTCAGAAACTTTTTCTCCAGTTGCTAAGTTGAATACTATTAGAGTTCTGTTATCTGTTGCTGTGAACAAAGATTGGCCTTTATATCAGCTGGATGTTAAGAATGCCTTTTTGAATGGAGACCTCGTAGAGGAAGTCTACATGAGCCCTCCGCCTGGATTTGAAGCCCAGTTTGGTCAGCATGTGTGTAAACTCCAGAAATCTATATATGGTCTGAAACAGTCTCCCAGAGCATGGTTTGACAGATTCACTACCTTTGTCAAGTCCCAAGGGTACAGGCAGGGACACTCTGATCATACTTTATTTACAAAGGTTTCCAAAACAGGAAAGATTGCTGTTCTAATAGTTTATGTGGATGACATTGTTTTGACTGGAGATGATCAGGCAGAAATCAGTCAACTAAAGCAGAGAATGGGCGATGAGTTTGAAATCAAGGATTTGGGAAATTTGAAATATTTCCTTGGAATGGAGGTGGCCAGATCTAAAGAAGGTATCTCCGTATCTCAAAGAAAATACATCCTTGATTTGTTAACCGAGACAGGTATGTTAGGATGTCGTCCCACTGACACTCCTATTGAATTCAACTGCAAACTAGGAAACTCTGATGATCAAGTTCCAGTTGATAAAGAACAGTATCAACGCCTCGTGGGTAAATTAATTTACTTATCTCATACTCGTCCTGATATTTCCTTTGCTGTGAGTGTTGTCAGCCAGTTTATGCAGACCCCTAATGAGGAACACATGAAAGCTGTCAACAGAATCTTGAGATACTTAAAATCAACACCTGGTAAAGGGCTGATGTTTAGAAAAACAGACAGAAAGACCATTGAGGCATACACTGACTCGGATTGGCAGGGACACTCTGATCATACTTTATTTACAAAGGTTTCCAAAACAGGAAAGATTGCTGTTCTAATAGTTTATGTGGATGACATTGTTTTGACTGGAGATGATCAGACAGAAATCAGTCAACTAAAGCAGAGAATGGGCGATGAGTTTGAAATCAAGGATTTGGGAAATTTGAAATATTTCCTTGGAATGGAGGTGGCCAGATCTAAAGAAGGTATCTCCGTATCTCAAAGAAAATACATCCTTGATTTGTTAACCGAGACAGGTATGTTAGGATGTCGTCCCACTGACACTCCTATTGAATTCAACTGCAAACTAGGAAACTCTGATGATCAAGTTCCAGTTGATAAAGAACAGTATCAACGCCTCGTGGGTAAATTAATTTACTTATCTCATACTCGTCCTGATATTTCCTTTGCTGTGAGTGTTGTCAGCCAGTTTATGCAGACCCCTAATGAGGAACACATGAAAGCTGTCAACAGAATCTTGAGATACTTAAAATCAACACCTGGTAAAGGGCTGATGTTTAGAAAAACAGACAGAAAGACCATTGAGGCATACACTGACTCGGATTGGGCAGGATCTGTTGTTGACAGAAAATCTACCTCTGGTTATTGTACCTTTGTTTGGGGCAATCTTGTAACTTGGAGGAGTAAGAAGCAAAGTGTTGTGGCCAGGAGCAGCGCTGAGGCTGAATATAGAGCTATGAGTTTAGGAATATGTGAGGAAATTTGGCTTCAGAAAGTTTTGACAGATCTTCATCAGGAATGTGAGACACCATTGAAGCTTTTCTGTGATAATAAAGCCGCTATTAGTATTGCTAACAACCCTGTTCAACATGATAGAACTAAACATGTTGAGATTGATCGACATTTTATCAAAGAAAAACTTGACAGTGGGAGCATATGCATTCCGTACATCCCTTCGAGTCAACAGGTTGCTGATGTTCTTACCAAAGGGCTTCTCAGACCAAACTTCGACTTCTGCGTTAGCAAGTTGGGCCTCATTGATATTTACGTCCCAACTTGA
BLAST of CSPI04G15350 vs. Swiss-Prot
Match: POLX_TOBAC (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 464.9 bits (1195), Expect = 4.3e-129
Identity = 327/1002 (32.63%), Postives = 500/1002 (49.90%), Query Frame = 1

Query: 646  KNPWILDSGATDHLTGSSEHFVSYIPCAGN-ETIRIADGSLAPIAGKG----KISPCAGL 705
            ++ W++D+ A+ H T   + F  Y+  AG+  T+++ + S + IAG G    K +    L
Sbjct: 291  ESEWVVDTAASHHATPVRDLFCRYV--AGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTL 350

Query: 706  SLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRG-LYLL 765
             L +V HVP L  NL+S   +  +   ++ F         L+ G ++     +RG LY  
Sbjct: 351  VLKDVRHVPDLRMNLISGIALDRD-GYESYFANQKWR---LTKGSLVIAKGVARGTLYRT 410

Query: 766  DDDTSSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYMKHLFPH-LFSKVEMTTLS- 825
            + +     +             E    LWH R+GH + + ++ L    L S  + TT+  
Sbjct: 411  NAEICQGELNAAQ--------DEISVDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKP 470

Query: 826  CDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWV 885
            CD C+  KQHRVSF +   +      LV+SDV GP +I +  G ++FVTFIDD +R  WV
Sbjct: 471  CDYCLFGKQHRVSFQTSSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWV 530

Query: 886  YLITDKSEVSSMFQNFYHTIETQFHQKIAILRSDNGREFQNHNLSEFLASKGIVHQNSCA 945
            Y++  K +V  +FQ F+  +E +  +K+  LRSDNG E+ +    E+ +S GI H+ +  
Sbjct: 531  YILKTKDQVFQVFQKFHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVP 590

Query: 946  YTPQQNGVAERKNRHLLEVARSLMLSTSLPSYLWGDVILTATHLINRMPSRILHLQTPLD 1005
             TPQ NGVAER NR ++E  RS++    LP   WG+ + TA +LINR PS  L  + P  
Sbjct: 591  GTPQHNGVAERMNRTIVEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIP-- 650

Query: 1006 CLKESYPSTRHVSEVPLRVFGCTAYVHNFSPNQTKFTPRAQACVFVGYPPHQRGYKCFHP 1065
               E   + + VS   L+VFGC A+ H     +TK   ++  C+F+GY   + GY+ + P
Sbjct: 651  ---ERVWTNKEVSYSHLKVFGCRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDP 710

Query: 1066 PSRKYFVTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEPTPSVVSNIIPHSIVLPT 1125
              +K   + DV F E                 E     +  E    V + IIP+ + +P 
Sbjct: 711  VKKKVIRSRDVVFRE----------------SEVRTAADMSE---KVKNGIIPNFVTIP- 770

Query: 1126 NQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVA 1185
                              S ++ P +    ++   +QG E P E     +I + ++ +  
Sbjct: 771  ------------------STSNNPTSAESTTDEVSEQG-EQPGE-----VIEQGEQLDEG 830

Query: 1186 VLENVEEKDSGDEIEVRIETRNNEAEQGHTGKSDEYDSSLD--IPIALRKGTRSCTKHPI 1245
            V E VE    G+E    +        +     S EY    D   P +L++       HP 
Sbjct: 831  V-EEVEHPTQGEEQHQPLRRSERPRVESRRYPSTEYVLISDDREPESLKE----VLSHPE 890

Query: 1246 CNYVSYDSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTL 1305
             N +         +A    ++S +     Y  ++ P+ K  +                  
Sbjct: 891  KNQL--------MKAMQEEMES-LQKNGTYKLVELPKGKRPLK----------------- 950

Query: 1306 PKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLL 1365
                    CKWVF LK   D  L R+KARLV KGF Q  GID+ E FSPV K+ +IR +L
Sbjct: 951  --------CKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGIDFDEIFSPVVKMTSIRTIL 1010

Query: 1366 SVAVNKDWPLYQLDVKNAFLNGDLVEEVYMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPR 1425
            S+A + D  + QLDVK AFL+GDL EE+YM  P GFE    +H VCKL KS+YGLKQ+PR
Sbjct: 1011 SLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGKKHMVCKLNKSLYGLKQAPR 1070

Query: 1426 AWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQR 1485
             W+ +F +F+KSQ Y + +SD  ++ K        +L++YVDD+++ G D+  I++LK  
Sbjct: 1071 QWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYVDDMLIVGKDKGLIAKLKGD 1130

Query: 1486 MGDEFEIKDLGNLKYFLGMEVARSKEG--ISVSQRKYILDLLTETGMLGCRPTDTPIEFN 1545
            +   F++KDLG  +  LGM++ R +    + +SQ KYI  +L    M   +P  TP+  +
Sbjct: 1131 LSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERVLERFNMKNAKPVSTPLAGH 1189

Query: 1546 CKLGNS------DDQVPVDKEQYQRLVGKLIY-LSHTRPDISFAVSVVSQFMQTPNEEHM 1605
             KL         +++  + K  Y   VG L+Y +  TRPDI+ AV VVS+F++ P +EH 
Sbjct: 1191 LKLSKKMCPTTVEEKGNMAKVPYSSAVGSLMYAMVCTRPDIAHAVGVVSRFLENPGKEHW 1189

Query: 1606 KAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWQGHSDH 1628
            +AV  ILRYL+ T G  L F  +D   ++ YTD+D  G  D+
Sbjct: 1251 EAVKWILRYLRGTTGDCLCFGGSD-PILKGYTDADMAGDIDN 1189

BLAST of CSPI04G15350 vs. Swiss-Prot
Match: COPIA_DROME (Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3)

HSP 1 Score: 412.9 bits (1060), Expect = 1.9e-113
Identity = 305/1013 (30.11%), Postives = 487/1013 (48.08%), Query Frame = 1

Query: 649  WILDSGATDHLTGSSEHFVSYIPCAGNETIRIA-DGSLAPIAGKG--KISPCAGLSLHNV 708
            ++LDSGA+DHL      +   +       I +A  G       +G  ++     ++L +V
Sbjct: 289  FVLDSGASDHLINDESLYTDSVEVVPPLKIAVAKQGEFIYATKRGIVRLRNDHEITLEDV 348

Query: 709  LHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSS 768
            L   + + NL+S+ ++              +S +   SG  I       GL ++ +    
Sbjct: 349  LFCKEAAGNLMSVKRLQEA----------GMSIEFDKSGVTIS----KNGLMVVKNSGML 408

Query: 769  SSIPRTSLLS-SYFTTSEQDCMLWHFRLGHPNFQYM-----KHLFPH--LFSKVEMTTLS 828
            +++P  +  + S     + +  LWH R GH +   +     K++F    L + +E++   
Sbjct: 409  NNVPVINFQAYSINAKHKNNFRLWHERFGHISDGKLLEIKRKNMFSDQSLLNNLELSCEI 468

Query: 829  CDVCIQAKQHRVSFPSQPYKP--TQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLT 888
            C+ C+  KQ R+ F     K    +P  +VHSDV GP    T   K +FV F+D  T   
Sbjct: 469  CEPCLNGKQARLPFKQLKDKTHIKRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYC 528

Query: 889  WVYLITDKSEVSSMFQNFYHTIETQFHQKIAILRSDNGREFQNHNLSEFLASKGIVHQNS 948
              YLI  KS+V SMFQ+F    E  F+ K+  L  DNGRE+ ++ + +F   KGI +  +
Sbjct: 529  VTYLIKYKSDVFSMFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLT 588

Query: 949  CAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYLWGDVILTATHLINRMPSRIL--HLQ 1008
              +TPQ NGV+ER  R + E AR+++    L    WG+ +LTAT+LINR+PSR L    +
Sbjct: 589  VPHTPQLNGVSERMIRTITEKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSK 648

Query: 1009 TPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFSPNQTKFTPRAQACVFVGYPPHQRGYK 1068
            TP +      P  +H     LRVFG T YVH     Q KF  ++   +FVGY P+  G+K
Sbjct: 649  TPYEMWHNKKPYLKH-----LRVFGATVYVH-IKNKQGKFDDKSFKSIFVGYEPN--GFK 708

Query: 1069 CFHPPSRKYFVTMDVTFCEDRPY------FPVSHLQGESVSEESNNTFEFIEPTPSVVSN 1128
             +   + K+ V  DV   E          F    L+    SE  N    F   +  ++  
Sbjct: 709  LWDAVNEKFIVARDVVVDETNMVNSRAVKFETVFLKDSKESENKN----FPNDSRKIIQT 768

Query: 1129 IIPHSIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTK-N 1188
              P+      N       + ++ K+            +  +E P      N ++ C    
Sbjct: 769  EFPNESKECDN-----IQFLKDSKESENKNFPNDSRKIIQTEFP------NESKECDNIQ 828

Query: 1189 MISENDRSNVAVLENVEEKDSGDEI-EVRIETRNNEAEQGHTGKS------DEYDSSLDI 1248
             + ++  SN   L   +++   D + E +     NE+ +  T +       D    +  I
Sbjct: 829  FLKDSKESNKYFLNESKKRKRDDHLNESKGSGNPNESRESETAEHLKEIGIDNPTKNDGI 888

Query: 1249 PIALRKGTRSCTKHPICNYVSYDSLSP-QFRAFTASLDSTIIPKDIYTALKYPEWKNAVM 1308
             I  R+  R  TK  I      +SL+     A T   D      +I        W+ A+ 
Sbjct: 889  EIINRRSERLKTKPQISYNEEDNSLNKVVLNAHTIFNDVPNSFDEIQYRDDKSSWEEAIN 948

Query: 1309 EEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDY 1368
             E+ A + N+TW I   P+    V  +WVFS+KY   G   R+KARLVA+GFTQ Y IDY
Sbjct: 949  TELNAHKINNTWTITKRPENKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQKYQIDY 1008

Query: 1369 SETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEEVYMSPPPGFEAQFGQH 1428
             ETF+PVA++++ R +LS+ +  +  ++Q+DVK AFLNG L EE+YM  P G       +
Sbjct: 1009 EETFAPVARISSFRFILSLVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQGISCN-SDN 1068

Query: 1429 VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKI---AVLIVYV 1488
            VCKL K+IYGLKQ+ R WF+ F   +K   +     D  ++  +   G I     +++YV
Sbjct: 1069 VCKLNKAIYGLKQAARCWFEVFEQALKECEFVNSSVDRCIY--ILDKGNINENIYVLLYV 1128

Query: 1489 DDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYILDLLT 1548
            DD+V+   D   ++  K+ + ++F + DL  +K+F+G+ +   ++ I +SQ  Y+  +L+
Sbjct: 1129 DDVVIATGDMTRMNNFKRYLMEKFRMTDLNEIKHFIGIRIEMQEDKIYLSQSAYVKKILS 1188

Query: 1549 ETGMLGCRPTDTPI--EFNCKLGNSDDQVPVDKEQYQRLVGKLIYLS-HTRPDISFAVSV 1608
            +  M  C    TP+  + N +L NSD+         + L+G L+Y+   TRPD++ AV++
Sbjct: 1189 KFNMENCNAVSTPLPSKINYELLNSDEDC---NTPCRSLIGCLMYIMLCTRPDLTTAVNI 1248

Query: 1609 VSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRK--TDRKTIEAYTDSDWQG 1624
            +S++    N E  + + R+LRYLK T    L+F+K       I  Y DSDW G
Sbjct: 1249 LSRYSSKNNSELWQNLKRVLRYLKGTIDMKLIFKKNLAFENKIIGYVDSDWAG 1258

BLAST of CSPI04G15350 vs. Swiss-Prot
Match: M810_ARATH (Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana GN=AtMg00810 PE=4 SV=1)

HSP 1 Score: 189.1 bits (479), Expect = 4.5e-46
Identity = 92/224 (41.07%), Postives = 136/224 (60.71%), Query Frame = 1

Query: 1642 LIVYVDDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYI 1701
            L++YVDDI+LTG   T ++ L  ++   F +KDLG + YFLG+++     G+ +SQ KY 
Sbjct: 3    LLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYA 62

Query: 1702 LDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAV 1761
              +L   GML C+P  TP+        S  + P D   ++ +VG L YL+ TRPDIS+AV
Sbjct: 63   EQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYP-DPSDFRSIVGALQYLTLTRPDISYAV 122

Query: 1762 SVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKS 1821
            ++V Q M  P       + R+LRY+K T   GL   K  +  ++A+ DSDWAG    R+S
Sbjct: 123  NIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRS 182

Query: 1822 TSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIW 1866
            T+G+CTF+  N+++W +K+Q  V+RSS E EYRA++L   E  W
Sbjct: 183  TTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CSPI04G15350 vs. Swiss-Prot
Match: YCH4_YEAST (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY5A PE=5 SV=2)

HSP 1 Score: 142.1 bits (357), Expect = 6.4e-32
Identity = 80/255 (31.37%), Postives = 141/255 (55.29%), Query Frame = 1

Query: 1368 LDVKNAFLNGDLVEEVYMSPPPGFEAQFG-QHVCKLQKSIYGLKQSPRAWFDRFTTFVKS 1427
            +DV  AFLN  + E +Y+  PPGF  +    +V +L   +YGLKQ+P  W +     +K 
Sbjct: 1    MDVDTAFLNSTMDEPIYVKQPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKK 60

Query: 1428 QGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGN 1487
             G+ +   +H L+ + +  G I +  VYVDD+++         ++KQ +   + +KDLG 
Sbjct: 61   IGFCRHEGEHGLYFRSTSDGPIYIA-VYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGK 120

Query: 1488 LKYFLGMEVARSKEG-ISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVD 1547
            +  FLG+ + +S  G I++S + YI    +E+ +   + T TP+  +  L  +      D
Sbjct: 121  VDKFLGLNIHQSSNGDITLSLQDYIAKAASESEINTFKLTQTPLCNSKPLFETTSPHLKD 180

Query: 1548 KEQYQRLVGKLIYLSHT-RPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLM 1607
               YQ +VG+L++ ++T RPDIS+ VS++S+F++ P   H+++  R+LRYL +T    L 
Sbjct: 181  ITPYQSIVGQLLFCANTGRPDISYPVSLLSRFLREPRAIHLESARRVLRYLYTTRSMCLK 240

Query: 1608 FRKTDRKTIEAYTDS 1620
            +R   +  +  Y D+
Sbjct: 241  YRSGSQLALTVYCDA 254

BLAST of CSPI04G15350 vs. Swiss-Prot
Match: YD22B_YEAST (Transposon Ty2-DR2 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY2B-DR2 PE=3 SV=2)

HSP 1 Score: 119.8 bits (299), Expect = 3.4e-25
Identity = 119/456 (26.10%), Postives = 207/456 (45.39%), Query Frame = 1

Query: 650  ILDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAPIAGKGKISPCAGLSLH------ 709
            ++DSGA+  L  S+ +     P   N  I I D     I     I+    L  +      
Sbjct: 455  LIDSGASQTLVRSAHYLHHATP---NSEINIVDAQKQDIP----INAIGNLHFNFQNGTK 514

Query: 710  ---NVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGT-ARHSRGLYLL 769
                 LH P ++Y+LLS+S++ ++ N  A F  +++   + S G ++    +H    +L 
Sbjct: 515  TSIKALHTPNIAYDLLSLSELANQ-NITACFTRNTL---ERSDGTVLAPIVKHGDFYWLS 574

Query: 770  DDDTSSSSIPRTSL--LSSYFTTSEQDCMLWHFRLGHPNFQYM-----KHLFPHLF-SKV 829
                  S I + ++  ++   + ++    L H  LGH NF+ +     K+   +L  S +
Sbjct: 575  KKYLIPSHISKLTINNVNKSKSVNKYPYPLIHRMLGHANFRSIQKSLKKNAVTYLKESDI 634

Query: 830  EMT---TLSCDVCIQAK--QHR-VSFPSQPYKPT-QPFTLVHSDVWGPSKITTSSGKRWF 889
            E +   T  C  C+  K  +HR V      Y+ + +PF  +H+D++GP      S   +F
Sbjct: 635  EWSNASTYQCPDCLIGKSTKHRHVKGSRLKYQESYEPFQYLHTDIFGPVHHLPKSAPSYF 694

Query: 890  VTFIDDHTRLTWVYLITDKSEVS--SMFQNFYHTIETQFHQKIAILRSDNGREFQNHNLS 949
            ++F D+ TR  WVY + D+ E S  ++F +    I+ QF+ ++ +++ D G E+ N  L 
Sbjct: 695  ISFTDEKTRFQWVYPLHDRREESILNVFTSILAFIKNQFNARVLVIQMDRGSEYTNKTLH 754

Query: 950  EFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYLWGDVILTATHLI 1009
            +F  ++GI    +     + +GVAER NR LL   R+L+  + LP++LW   +  +T + 
Sbjct: 755  KFFTNRGITACYTTTADSRAHGVAERLNRTLLNDCRTLLHCSGLPNHLWFSAVEFSTIIR 814

Query: 1010 NRMPSRILHLQTPLDCLKESYPSTRHVSEVPLRV-----FGCTAYVHNFSPNQTKFTPRA 1069
            N + S            K    + +H     L +     FG    V+N +P+ +K  PR 
Sbjct: 815  NSLVSP-----------KNDKSARQHAGLAGLDITTILPFGQPVIVNNHNPD-SKIHPRG 874

BLAST of CSPI04G15350 vs. TrEMBL
Match: A5AYJ3_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_041073 PE=4 SV=1)

HSP 1 Score: 1333.9 bits (3451), Expect = 0.0e+00
Identity = 691/1340 (51.57%), Postives = 897/1340 (66.94%), Query Frame = 1

Query: 329  SMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPQPGDPHERYWKAEDSILR 388
            S + L+  KLNG NY  W+QSVK+ ++GR K   L GE+ +P   DP+ + W+  + +  
Sbjct: 35   SSFQLTIHKLNGKNYLEWAQSVKLAIDGRGKLGHLNGEVSKPVADDPNLKTWRFRELVA- 94

Query: 389  SILINSREPQMGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTS 448
                      +GKP LF  TAKD+W+  + +YS  +N+S+++ L+ ++ + +QG  +VT+
Sbjct: 95   ----------IGKPHLFLPTAKDVWEAVRDMYSDLENSSQIFDLKSKLWQSRQGDREVTT 154

Query: 449  FFNKLSLIWQEMDLCIELVWRDPTDDVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQR 508
            ++N++  +WQE+DLC E  W  P D V++ + EENDR+Y FLA LN   D VRGRILG++
Sbjct: 155  YYNQMVTLWQELDLCYEDEWDCPNDSVRHKKREENDRVYVFLAALNHNLDEVRGRILGRK 214

Query: 509  PIPSLMEVCSEIRLEEDRTSAMNIS----ATPTIDSAAFSARSSNSSSDKHNGKPIPVCE 568
            P+PS+ EV SE+R EE R   M       + P I+S+A  ++ S+   D+      P C+
Sbjct: 215  PLPSIREVFSEVRREEARRKVMLTDPEPMSNPEIESSALVSKGSDLDGDRRKK---PWCD 274

Query: 569  HCKKQWHTKEQGWKLHGRPPGSKKRPSNDKQNTGRAYVSESAEPPQQSDPHKNQTDLSLA 628
            HCKK WHTK   WK+HG+P   KK+  +D +      +S  ++ PQ +    N T   L+
Sbjct: 275  HCKKPWHTKGTCWKIHGKPQNFKKKNGSDGR--AFQTMSADSQGPQINSEKPNFTKEQLS 334

Query: 629  TLGAIVQSG---------------IPHSFGLISIDGKNPWILDSGATDHLTGSSEHFVSY 688
             L  + QS                +  +   I  +   PWI+DSGATDH+TGSS+ F SY
Sbjct: 335  HLYKLFQSPQFSNPSCSLAQQGNYLIAALSSIKSNVHCPWIIDSGATDHMTGSSQIFSSY 394

Query: 689  IPCAGNETIRIADGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKA 748
             PCAGN+ I+I DGSL+ IAGKG +     L+LHNVLHVP LS NLLSISKIT +  C+A
Sbjct: 395  KPCAGNKKIKIXDGSLSAIAGKGSVFISPSLTLHNVLHVPNLSCNLLSISKITQDHQCQA 454

Query: 749  IFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDCMLWH 808
             F P    FQ+L+SGR IG AR   GLY  ++ + S    +++   S    S  D +LWH
Sbjct: 455  NFYPSYCEFQELTSGRTIGNAREIGGLYFFENGSESRKPIQSTCFESISVASSDDIILWH 514

Query: 809  FRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDV 868
            +RLGHP+FQY+KHLFP LF     ++  C+ C  AK HR SFP QPY+ ++PF+L+HSDV
Sbjct: 515  YRLGHPSFQYLKHLFPSLFRNKNPSSFQCEFCELAKHHRTSFPLQPYRISKPFSLIHSDV 574

Query: 869  WGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAILR 928
            WGPS+I+T SGK+WFVTFIDDHTR++WVYL+ +KSEV  +F+ FY  + TQF  KI + R
Sbjct: 575  WGPSRISTLSGKKWFVTFIDDHTRVSWVYLLREKSEVEEVFKIFYTMVLTQFQTKIQVFR 634

Query: 929  SDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSY 988
            SDNG+E+ N  L +F   KGIVHQ+SC  TPQQNG+AERKN+HLLEVAR+L  +T +P Y
Sbjct: 635  SDNGKEYINKALGKFFLEKGIVHQSSCNDTPQQNGIAERKNKHLLEVARALCFTTKVPKY 694

Query: 989  LWGDVILTATHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFSPN 1048
            LWG+ ILTAT+LINRMP+RIL+ +TPL       P  R  S +PL++FGCT +VH    N
Sbjct: 695  LWGEAILTATYLINRMPTRILNFKTPLQVFTNCNPIFRLSSTLPLKIFGCTTFVHIHDHN 754

Query: 1049 QTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGESVSE 1108
            + K  PRA+ CVFVGY P Q+GYKCF P S+K FVTMDVTF E +P+F  +HLQGES SE
Sbjct: 755  RGKLDPRARKCVFVGYAPTQKGYKCFDPISKKLFVTMDVTFFESKPFF-ATHLQGESTSE 814

Query: 1109 ESN----------NTFEFIEPTPS---VVSNIIPHSIVLPTNQVPW-KTYYRRNHKKEVG 1168
            +S+          N    +EP+ S   V  NI    +    + + + KT      K  V 
Sbjct: 815  DSDLFKIEKTPTPNPNNLLEPSNSNQFVYPNIETSGLDTTKSDMSFEKTAEILGKKNGVL 874

Query: 1169 SPTSQPPAPVQDSEPPRDQGMENPTEPCTKNM-ISENDRSNVAVLENVEEKDSGDEIEVR 1228
            +  S   +    S         N     TKN  +    R      E+  +   G E E+R
Sbjct: 875  NIESLDGSSSLPSHNQNHSNTNNGNRTSTKNSELMTYSRRKHNSKESNPDPLPGHESELR 934

Query: 1229 IETRNNEAEQGH-----------TGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYDS 1288
             E  ++E    +           +  + E    L+IPIA RKG RSCTKHP+ NY+SY +
Sbjct: 935  EEPNSSECPGNNQTDSCQPVQFISNSNSESFDDLNIPIATRKGVRSCTKHPMSNYMSYKN 994

Query: 1289 LSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVG 1348
            LSP F AFT+ L    IPK++  AL+ PEWK A+ EEM+ALEKN TW++  LPKG  TVG
Sbjct: 995  LSPSFFAFTSHLSLVEIPKNVQEALQVPEWKKAIFEEMRALEKNHTWEVMGLPKGKTTVG 1054

Query: 1349 CKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDW 1408
            CKWVF++KY ++G+L+R+KARLVAKGFTQTYGIDY ETF+PVAKLNT+RVLLS+A N DW
Sbjct: 1055 CKWVFTVKYNSNGSLERYKARLVAKGFTQTYGIDYLETFAPVAKLNTVRVLLSIAANLDW 1114

Query: 1409 PLYQLDVKNAFLNGDLVEEVYMSPPPGFEAQFGQHVCKLQKSIYGLKQSPRAWFDRFTTF 1468
            PL QLDVKNAFLNG+L EEVYM PPPGF+  FG  VCKL+KS+YGLKQSPRAWF+RFT F
Sbjct: 1115 PLQQLDVKNAFLNGNLEEEVYMDPPPGFDEHFGSKVCKLKKSLYGLKQSPRAWFERFTQF 1174

Query: 1469 VKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKD 1528
            VK+QGY Q  SDHT+F K S  GKIA+LIVYVDDI+LTGD   E+ +LK+ +  EFEIKD
Sbjct: 1175 VKNQGYVQAQSDHTMFIKHSNDGKIAILIVYVDDIILTGDHVTEMDRLKKSLALEFEIKD 1234

Query: 1529 LGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVP 1588
            LG+L+YFLGMEVARSK GI VSQRKYILDLL ETGM GCRP DTPI+ N KLG+++D   
Sbjct: 1235 LGSLRYFLGMEVARSKRGIVVSQRKYILDLLKETGMSGCRPADTPIDPNQKLGDTNDGNL 1294

Query: 1589 VDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGL 1624
            V+  +YQ+LVGKLIYLSHTRPDI+FAVS+VSQFM +P E H++AV RILRYLKSTPGKGL
Sbjct: 1295 VNTTRYQKLVGKLIYLSHTRPDIAFAVSIVSQFMHSPYEVHLEAVYRILRYLKSTPGKGL 1354

BLAST of CSPI04G15350 vs. TrEMBL
Match: A5B7Z8_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_022757 PE=4 SV=1)

HSP 1 Score: 1248.0 bits (3228), Expect = 0.0e+00
Identity = 658/1325 (49.66%), Postives = 863/1325 (65.13%), Query Frame = 1

Query: 326  AQSSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPQPGDPHERYWKAEDS 385
            + SS   ++G KLNG+NY  WSQSV + + G+ K  +LTGE   P+  +P  R WK E+S
Sbjct: 26   SDSSPILITGHKLNGHNYLQWSQSVLLFICGKGKDEYLTGEAAMPETTEPGFRKWKIENS 85

Query: 386  ILRSILINSREPQMGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMD 445
            ++ S LINS    +G+  L   TAKDIWD A+  YS  +N S L+ +   +H+ +QG   
Sbjct: 86   MIMSWLINSMNNDIGENFLLFRTAKDIWDAAKETYSSSENTSELFQVESALHDFRQGEQS 145

Query: 446  VTSFFNKLSLIWQEMDLCIELVWRDPTDDVQYSRIEENDRIYDFLAGLNPKFDVVRGRIL 505
            VT ++N L+  WQ++DL     W+   D   Y  I E  R++ F  GLN + D VRGRI+
Sbjct: 146  VTQYYNTLTRYWQQLDLFETHSWKCSDDAATYRXIVEQXRLFKFFLGLNRELDDVRGRIM 205

Query: 506  GQRPIPSLMEVCSEIRLEEDRTSAMNISA---TPTIDSAAFSARSSNSSSDKHNGKPIPV 565
            G +P+PSL E  SE+R EE R   M  S     PT+D++   ARS NSS      +  P 
Sbjct: 206  GIKPLPSLREAFSEVRREESRKKVMMGSKEQPAPTLDASXLXARSFNSSGGDRQKRDRPW 265

Query: 566  CEHCKKQWHTKEQGWKLHGRPPGSKKRPSNDKQNTGRAYVS---ESAEPPQQSDPHKNQT 625
            C++CKK  H KE  WKLHG+    K +P  D+   GRA+V+   ES   P+ S  +K Q 
Sbjct: 266  CDYCKKXGHYKEACWKLHGKXADWKPKPRXDRD--GRAHVAANXESTSVPEPSPFNKEQM 325

Query: 626  DLSLATLGAIVQSGIPHSFGLISI-DGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETI 685
            ++ L  L + V SG      L +   G  PWI+D+GA+DH+TG +    +Y P  G+ ++
Sbjct: 326  EM-LQKLLSQVGSGSTTGIALTANRGGMKPWIVDTGASDHMTGDAAILQNYKPSNGHSSV 385

Query: 686  RIADGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSF 745
             IADGS + I G G I     L L +VLHVP L  NLLSISK+  +L C   F P+S  F
Sbjct: 386  HIADGSKSKIXGTGSIKLTKDLYLDSVLHVPNLDCNLLSISKLARDLQCVTKFYPNSCVF 445

Query: 746  QDLSSGRMIGTARHSRGLYLL------DDDTSSSSIPRTSLLSSYFTTS------EQDCM 805
            QDL SG+MIG+A    GLYLL      +  + +S +   S+L S+ + S      + + +
Sbjct: 446  QDLKSGKMIGSAELCSGLYLLSCGQFSNQVSQASCVQSQSMLESFNSVSNSKVNKDSEII 505

Query: 806  LWHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVH 865
            + H+RLGHP+F Y+  LFP LF      +  C++C  AK  R  +P  PYKP+  F+LVH
Sbjct: 506  MLHYRLGHPSFVYLAKLFPKLFINKNPASYHCEICQFAKHTRTVYPQIPYKPSTVFSLVH 565

Query: 866  SDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIA 925
            SDVWGPS+I   SG RWFVTF+DDHTR+TWV+L+ +KSEV  +FQ F   ++ QF+ KI 
Sbjct: 566  SDVWGPSRIKNISGTRWFVTFVDDHTRVTWVFLMKEKSEVGHIFQTFNLMVQNQFNSKIQ 625

Query: 926  ILRSDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSL 985
            +L+SDN +E+   +LS +L + GI+H +SC  TPQQNGVAERKNRHLLEVAR LM S+++
Sbjct: 626  VLKSDNAKEYFTSSLSTYLQNHGIIHISSCVDTPQQNGVAERKNRHLLEVARCLMFSSNV 685

Query: 986  PSYLWGDVILTATHLINRMPSRILHLQTPLDCLKESYPSTRHVS-EVPLRVFGCTAYVHN 1045
            P+Y WG+ ILTAT+LINRMPSR+L  Q+P     + +P TR  S ++PL+VFGCTA+VH 
Sbjct: 686  PNYFWGEAILTATYLINRMPSRVLTFQSPRQLFLKQFPHTRAASSDLPLKVFGCTAFVHV 745

Query: 1046 FSPNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGE 1105
            +  N++KF PRA  C+F+GY P Q+GYKC+ P +++++ TMDV+F E   ++P SH+QGE
Sbjct: 746  YPQNRSKFAPRANKCIFLGYSPTQKGYKCYSPTNKRFYTTMDVSFFEHVFFYPKSHVQGE 805

Query: 1106 SVSEESNNTFE-FIEPTPSVVSNIIPHSIVLPTN-QVPWKTYYRRNHKKEVGSP-TSQPP 1165
            S++E  +  +E  +E  PS  S     S   PT    P  +  +      V SP T Q P
Sbjct: 806  SMNE--HQVWESLLEGVPSFHSESPNPSQFAPTELSTPMPSSVQPAQHTNVPSPVTIQSP 865

Query: 1166 APVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEA 1225
             P+Q   P          +   +N+     R     LE+  +   G  I+          
Sbjct: 866  MPIQPIAP----------QLANENLQVYIRRRKRQELEHGSQSTCGQYIDSNSSLPEENI 925

Query: 1226 EQGHTGKS--DEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTII 1285
             +   G+      D S  +PIALRKG R CT HPI NYV+Y+ LSP +RAF  SLD T +
Sbjct: 926  GEDRAGEVLIPSIDDST-LPIALRKGVRRCTDHPIGNYVTYEGLSPSYRAFATSLDDTQV 985

Query: 1286 PKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDR 1345
            P  I  A K  EWK AV +E+ ALEKN TW I  LP G + VGCKW+F++KYKADG+++R
Sbjct: 986  PNTIQEAXKISEWKKAVQDEIDALEKNGTWTITDLPVGKRPVGCKWIFTIKYKADGSVER 1045

Query: 1346 HKARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLV 1405
             KARLVA+GFTQ+YGIDY ETF+PVAKLNTIR+LLS+AVN+DW L QLD+KNAFLNGDL 
Sbjct: 1046 FKARLVARGFTQSYGIDYQETFAPVAKLNTIRILLSLAVNQDWCLQQLDIKNAFLNGDLE 1105

Query: 1406 EEVYMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLF 1465
            EEVYM  PPGFE    ++ VCKLQKS+YGLKQSPRAWFDRFT  V   GY+QG +DHTLF
Sbjct: 1106 EEVYMEIPPGFEESMAKNQVCKLQKSLYGLKQSPRAWFDRFTKAVLKLGYKQGQADHTLF 1165

Query: 1466 TKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSK 1525
             K S  GK+A+LIVYVDDI+L+G+D  E+  LK+ + +EFE+KDLGNLKYFLGMEVARS+
Sbjct: 1166 VKKSHAGKMAILIVYVDDIILSGNDMEELQXLKKYLSEEFEVKDLGNLKYFLGMEVARSR 1225

Query: 1526 EGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYL 1585
            +GI VSQRKYILDLL ETGMLGC+P DTP++   KLG   +  PVD+ +YQRLVG+LIYL
Sbjct: 1226 KGIVVSQRKYILDLLKETGMLGCKPIDTPMDSQKKLGIEKESTPVDRGRYQRLVGRLIYL 1285

Query: 1586 SHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDS 1625
            SHTRPDI FAVS VSQFM +P EEHM+AV RI RYLK TPGKGL FRKT+ +  E Y+D+
Sbjct: 1286 SHTRPDIGFAVSXVSQFMHSPTEEHMEAVYRIXRYLKMTPGKGLFFRKTENRDXEVYSDA 1334

BLAST of CSPI04G15350 vs. TrEMBL
Match: A5AJR0_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_031159 PE=4 SV=1)

HSP 1 Score: 1240.7 bits (3209), Expect = 0.0e+00
Identity = 656/1323 (49.58%), Postives = 861/1323 (65.08%), Query Frame = 1

Query: 328  SSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPQPGDPHERYWKAEDSIL 387
            SS   ++G KLNG+NY  WSQSV + + G+ K  + TGE   P+  +P  R WK E+S++
Sbjct: 28   SSPILITGHKLNGHNYLQWSQSVLLFICGKGKDEYXTGEAXMPETTEPXFRKWKIENSMI 87

Query: 388  RSILINSREPQMGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVT 447
             S LINS    +G+  L   TAKDIWD A+  YS  +N S L+ +   +H+ +QG   VT
Sbjct: 88   MSWLINSMNNDIGENFLLFGTAKDIWDAAKETYSSSENISELFQVESALHDFRQGEQSVT 147

Query: 448  SFFNKLSLIWQEMDLCIELVWRDPTDDVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQ 507
             ++N L+  WQ++DL     W+   D   Y +I E  R++ F  GLN + D VRGRI+G 
Sbjct: 148  QYYNTLTRYWQQLDLFETHSWKCSDDAATYRQIVEQKRLFKFFLGLNRELDDVRGRIMGI 207

Query: 508  RPIPSLMEVCSEIRLEEDRTSAMNISA---TPTIDSAAFSARSSNSSSDKHNGKPIPVCE 567
            +P+PSL EV SE+R EE R   M  S     PT+D +A +ARS NSS      +  P C+
Sbjct: 208  KPLPSLREVFSEVRREESRKKVMMGSKEQPAPTLDGSALAARSFNSSGGDRQKRDRPWCD 267

Query: 568  HCKKQWHTKEQGWKLHGRPPGSKKRPSNDKQNTGRAYV---SESAEPPQQSDPHKNQTDL 627
            + KK  H KE  WKLHG+P   K +P +D+   GRA+V   SES   P+ S  +K Q ++
Sbjct: 268  YYKKPGHYKEACWKLHGKPADWKPKPRSDRD--GRAHVAANSESTSVPEPSPFNKEQMEM 327

Query: 628  SLATLGAIVQSGIPHSFGLI-SIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRI 687
             L  L + V SG      L  S  G  PWI+D+GA+DH+TG +    +Y P  G+  + I
Sbjct: 328  -LQKLLSQVGSGSTTGIALTASRGGMKPWIVDTGASDHMTGDAAILQNYKPSNGHSFVHI 387

Query: 688  ADGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQD 747
            ADGS + I G G I     L L +VLHVP L  NLLSISK+  +L C   F P+S  FQD
Sbjct: 388  ADGSKSKIVGTGSIKLTKDLYLDSVLHVPNLDCNLLSISKLARDLQCVTKFYPNSCVFQD 447

Query: 748  LSSGRMIGTARHSRGLYLL------DDDTSSSSIPRTSLLSSYFTTS------EQDCMLW 807
            L SG+MIG+A+    LYLL      +  + +S +   S+L S+ + S      + + ++ 
Sbjct: 448  LKSGKMIGSAKLCSELYLLSCGQFSNQVSQASCVQSQSMLESFNSVSNSKVNKDSEIIML 507

Query: 808  HFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSD 867
            H+RLGHP+F Y+  LFP LF      +  C++C  AK  R  +P  PYKP+  F+LVHSD
Sbjct: 508  HYRLGHPSFVYLAKLFPRLFINKNPASYHCEICQFAKHTRTVYPQIPYKPSTVFSLVHSD 567

Query: 868  VWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAIL 927
            VWGPS+I   SG RWFVTF+DDHTR+TWV+L+ +KSEV  +FQ F   ++ QF+ KI +L
Sbjct: 568  VWGPSRIKNISGTRWFVTFVDDHTRVTWVFLMKEKSEVGHIFQTFNLMVQNQFNSKIQVL 627

Query: 928  RSDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPS 987
            +SDN +E+   +LS +L + GI+H +SC  TPQQNGVAERKNRHLLEVAR LM S+++P+
Sbjct: 628  KSDNAKEYFTSSLSTYLQNHGIIHISSCVDTPQQNGVAERKNRHLLEVARCLMFSSNVPN 687

Query: 988  YLWGDVILTATHLINRMPSRILHLQTPLDCLKESYPSTRHVS-EVPLRVFGCTAYVHNFS 1047
            Y WG+ ILTAT+LINRMPSR+L  Q+P     + +P TR  S ++ L+VFGCTA+VH + 
Sbjct: 688  YFWGEAILTATYLINRMPSRVLTFQSPRQLFLKQFPHTRAASSDLSLKVFGCTAFVHVYP 747

Query: 1048 PNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGESV 1107
             N++KF PRA  C+F+GY P+Q+GYKC+ P +++++ TMDV+F E   ++P  H+QGES+
Sbjct: 748  QNRSKFAPRANKCIFLGYSPNQKGYKCYSPTNKRFYTTMDVSFFEHVFFYPKFHVQGESM 807

Query: 1108 SEESNNTFEF-IEPTPSVVSNIIPHSIVLPTN-QVPWKTYYRRNHKKEVGSP-TSQPPAP 1167
            +E  +  +E  +E  PS  S     S   PT    P  +  +      V SP T Q P P
Sbjct: 808  NE--HQVWESRLEGVPSFHSESPNPSQFAPTELSTPMPSSVQPAQHTNVPSPVTIQSPMP 867

Query: 1168 VQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAEQ 1227
            +Q   P          +   +N+     R     LE+  +   G  I+           +
Sbjct: 868  IQPIAP----------QLANENLQVYIRRRKRQELEHGSQSTCGQYIDSNSSLPEENIGE 927

Query: 1228 GHTGKS--DEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPK 1287
               G+      D S  +PIALRKG R CT HPI NYV+Y+ LSP +RAF  SLD T +P 
Sbjct: 928  DRAGEVLIPSIDDST-LPIALRKGVRRCTDHPIGNYVTYEGLSPSYRAFATSLDDTQVPN 987

Query: 1288 DIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHK 1347
             I  ALK  EWK AV +E+ ALEKN TW I  LP G + VGCKW+F++KYKADG+++R K
Sbjct: 988  TIQEALKISEWKKAVQDEIDALEKNGTWTITDLPVGKRPVGCKWIFTIKYKADGSVERFK 1047

Query: 1348 ARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEE 1407
            ARLVA+GFTQ YGIDY ETF+PVAKLNTIR+LLS+AVN+DW L QLD+KNAFLNGDL EE
Sbjct: 1048 ARLVARGFTQXYGIDYQETFAPVAKLNTIRILLSLAVNQDWCLQQLDIKNAFLNGDLEEE 1107

Query: 1408 VYMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTK 1467
            VYM  PPGFE    ++ VCKLQKS+YGLKQSPRAWFDRFT  V   GY+QG +DHTLF K
Sbjct: 1108 VYMEIPPGFEESMAKNQVCKLQKSLYGLKQSPRAWFDRFTKAVLKLGYKQGQADHTLFVK 1167

Query: 1468 VSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEG 1527
             S  GK+A+LIVYVDDI+L+G+D  E+  LK+ + +EFE+KDLGNLKYFLGMEVARS++G
Sbjct: 1168 KSHAGKMAILIVYVDDIILSGNDMEELQNLKKYLSEEFEVKDLGNLKYFLGMEVARSRKG 1227

Query: 1528 ISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSH 1587
            I VSQ KYILDLL ETGMLGC+P DTP++   KLG   +  P D+ +YQRLVG+LIYLSH
Sbjct: 1228 IVVSQTKYILDLLKETGMLGCKPIDTPMDSQKKLGIEKESTPXDRGRYQRLVGRLIYLSH 1287

Query: 1588 TRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDW 1625
            TRPDI FAVS VSQFM +P EEHM+AV RILRYLK TP KG+ FRKT+ +  E Y+D+DW
Sbjct: 1288 TRPDIGFAVSAVSQFMHSPTEEHMEAVYRILRYLKMTPXKGIFFRKTENRDTEVYSDADW 1334

BLAST of CSPI04G15350 vs. TrEMBL
Match: W9SCZ3_9ROSA (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Morus notabilis GN=L484_026684 PE=3 SV=1)

HSP 1 Score: 1239.9 bits (3207), Expect = 0.0e+00
Identity = 658/1337 (49.21%), Postives = 870/1337 (65.07%), Query Frame = 1

Query: 326  AQSSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPQPGDPHERYWKAEDS 385
            ++SS   ++G KLNG+NY  WSQSV + + G+ K  +LTGE   P+  D   + WK E+S
Sbjct: 31   SESSPILITGHKLNGHNYLQWSQSVLLFICGKGKDEYLTGEAAMPETTDSGFKKWKIENS 90

Query: 386  ILRSILINSREPQMGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMD 445
            ++ S LINS    +G+  L   TAK+IWD A+  YS  +N S L+ +   +H+ +QG   
Sbjct: 91   MIMSWLINSMNNDIGENFLLFGTAKEIWDAAKETYSSSENTSELFQVESALHDFRQGEQS 150

Query: 446  VTSFFNKLSLIWQEMDLCIELVWRDPTDDVQYSRIEENDRIYDFLAGLNPKFDVVRGRIL 505
            VT +++ L   WQ++DL     W+ P D   Y ++ E  R++ F  GLN + D VRGRI+
Sbjct: 151  VTQYYSTLIRYWQQLDLFETHSWKCPDDAATYRQVVEQKRLFKFFLGLNRELDDVRGRIM 210

Query: 506  GQRPIPSLMEVCSEIRLEEDRTSAMNIS----ATPTIDSAAFSARSSNSSSDKHNGKPIP 565
            G +P+PSL E  SE+R EE R   M  S    A+P +D++A + RSSNS+   H  +  P
Sbjct: 211  GTKPLPSLREAFSEVRREESRKKVMMGSKEQHASP-LDASALAVRSSNSNGGDHQKRERP 270

Query: 566  VCEHCKKQWHTKEQGWKLHGRPPGSKKRPSNDKQNTGR-AYVSESAEPPQQSDPHKNQTD 625
             C++CKK  H KE  WKLHG+P   K +P  D+ +    A  S+SA  P+ S  +K Q +
Sbjct: 271  WCDYCKKLGHYKEACWKLHGKPADWKPKPRFDRDSKAHVASNSDSAPVPEPSPFNKEQMN 330

Query: 626  L-----SLATLGAIVQSGI-----PHSFGLISIDGKNPWILDSGATDHLTGSSEHFVSYI 685
            +     S    G I  +G+     PH+    +  G  PWI+D+GA+DH+TG +    +Y 
Sbjct: 331  VLQKLFSQVGSGNITGAGLVAQTDPHTAFTANHGGMRPWIVDTGASDHMTGDAALLQNYK 390

Query: 686  PCAGNETIRIADGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAI 745
            P +G+ ++ IADGS + IAG G I     L L +VLHVP L  NLLSISK+  +L C   
Sbjct: 391  PSSGHSSVHIADGSNSKIAGTGSIKLTKELYLDSVLHVPNLDCNLLSISKLACDLQCVTK 450

Query: 746  FLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTS------LLSSYFTTS--- 805
            F P+   FQDL SG+MIG+A    GLYLL  D SS+ + + S      LL S+ + S   
Sbjct: 451  FYPNLCIFQDLKSGKMIGSAELCSGLYLLSCDRSSNQVSQASCVQSQSLLGSFNSVSNSN 510

Query: 806  ---EQDCMLWHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKP 865
               + + +L H+RLGHP+F Y+  LFP LF      +  C++C  AK  R  +P  PYKP
Sbjct: 511  VNKDSEIILLHYRLGHPSFVYLAKLFPKLFINKNPASFHCEICQIAKHTRTVYPQIPYKP 570

Query: 866  TQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIE 925
            +  F+L+HSDVWGPS+I   SG RWFVTF+DDHTR+TWVYL+ +KSEV  +F  F   ++
Sbjct: 571  STVFSLIHSDVWGPSRIKNVSGTRWFVTFVDDHTRVTWVYLMKEKSEVGQIFHTFNLMVQ 630

Query: 926  TQFHQKIAILRSDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVAR 985
             QF+ +I +L+SDN RE+   +L+ +L + GI+H +SC  TPQQNGVAERKNRHLLEVAR
Sbjct: 631  NQFNSRIQVLKSDNAREYFTSSLNTYLQNHGIIHLSSCVDTPQQNGVAERKNRHLLEVAR 690

Query: 986  SLMLSTSLPSYLWGDVILTATHLINRMPSRILHLQTPLDCLKESYPSTRHVS-EVPLRVF 1045
             LM S+++P+Y WG+ ILTAT+LINRMPSR+L  Q+P   L E++P TR VS ++P +VF
Sbjct: 691  CLMFSSNVPNYFWGEAILTATYLINRMPSRVLTFQSPRQLLLENFPHTRAVSSDLPPKVF 750

Query: 1046 GCTAYVHNFSPNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYF 1105
            GCTA+VH +  +++KF PRA  C+F+GY P Q+GYKC+ P S++++ TMDV+F E   ++
Sbjct: 751  GCTAFVHVYPQHRSKFDPRANKCIFLGYSPTQKGYKCYSPISKRFYTTMDVSFFEHVFFY 810

Query: 1106 PVSHLQGESVSEESNNTFEFI-EPTPSVVSNIIPHSIVLPTNQVPWKTYYRRNHKKEVGS 1165
            P S +QGES++E  +  +E I E  PS  S     S  +P +                  
Sbjct: 811  PKSRVQGESMNE--HQIWESILESVPSSHSESPRPSQTVPID------------------ 870

Query: 1166 PTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISEN-----DRSNVAVLENVEEKDSG--- 1225
              S  P P+  S  P +     P +     + +EN      R     LE+  +   G   
Sbjct: 871  --SSTPVPL--SVQPTNVSSPVPVQSVAPQLANENLQVYIRRKKRQELEHGSQPTCGQYI 930

Query: 1226 DEIEVRIETRNNEAEQGHTGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQF 1285
            D I    E       +G        DS+L  PIALRKG R CT HPI NYV+Y+ LSP +
Sbjct: 931  DSISSPPEENMGTDREGDVSTPSIDDSTL--PIALRKGVRRCTDHPIGNYVTYEGLSPSY 990

Query: 1286 RAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVF 1345
            +AF  SLD T IP  I+ AL+  EWK AV +E+ ALEKN TW I  LP G + VGCKW+F
Sbjct: 991  KAFATSLDGTQIPSTIHEALQNSEWKKAVQDEIDALEKNGTWTITDLPGGKRPVGCKWIF 1050

Query: 1346 SLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQL 1405
            ++KYKADG+++R KARLVA+GFTQ+YGIDY ETF+PVAKLNTIR+LLS+AVN+DW L QL
Sbjct: 1051 TIKYKADGSVERFKARLVARGFTQSYGIDYQETFAPVAKLNTIRILLSLAVNQDWCLQQL 1110

Query: 1406 DVKNAFLNGDLVEEVYMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQ 1465
            D+KNAFLNGDL EEVYM  PPGFE    ++ VCKL+KS+YGLKQSPRAWFDRFT  V   
Sbjct: 1111 DIKNAFLNGDLEEEVYMEIPPGFEGSMTKNQVCKLRKSLYGLKQSPRAWFDRFTKAVLKL 1170

Query: 1466 GYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNL 1525
            GY QG SDHTLF K S   KIA+LIVYVDDI+L+G+D  E+ +LK+ + +EFE+KDLGNL
Sbjct: 1171 GYVQGQSDHTLFVKKSHAEKIAILIVYVDDIILSGNDVKELQELKKYLSEEFEVKDLGNL 1230

Query: 1526 KYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKE 1585
            KYFLGMEVARS +GI VSQRKYILDLL ETGMLGC+P DTP++   KLG   +  PVD+ 
Sbjct: 1231 KYFLGMEVARSSKGIVVSQRKYILDLLKETGMLGCKPVDTPMDSQKKLGTEKESAPVDRG 1290

Query: 1586 QYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRK 1625
            +YQRLVG+LIYLSHTRPDI FAVSVVSQFM +P EEHM+AV R+LRYLK TPGKGL F K
Sbjct: 1291 RYQRLVGRLIYLSHTRPDIGFAVSVVSQFMHSPTEEHMEAVYRVLRYLKMTPGKGLFFIK 1340

BLAST of CSPI04G15350 vs. TrEMBL
Match: A5BV07_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_039357 PE=4 SV=1)

HSP 1 Score: 1239.9 bits (3207), Expect = 0.0e+00
Identity = 649/1326 (48.94%), Postives = 862/1326 (65.01%), Query Frame = 1

Query: 328  SSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPQPGDPHERYWKAEDSIL 387
            SS   ++G KLNG+NY  WSQSV + + G+ K  +LTGE   P+  +P  R WK E+S++
Sbjct: 33   SSPILITGHKLNGHNYLQWSQSVLLFICGKGKDEYLTGEAVMPETTEPGFRKWKIENSMI 92

Query: 388  RSILINSREPQMGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVT 447
             S LINS    +G+  L   TAKDIWD A+  YS  +N S L+ +   +H+ +QG   VT
Sbjct: 93   MSWLINSMNNDIGENFLLFGTAKDIWDAAKETYSSSENTSELFQVESALHDFRQGEQSVT 152

Query: 448  SFFNKLSLIWQEMDLCIELVWRDPTDDVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQ 507
             ++N L+  WQ++DL     W+   D   Y +I E  R++ F  GLN + D VRGRI+G 
Sbjct: 153  QYYNTLTRYWQQLDLFETHSWKCSDDAATYRQIVEQKRLFKFFLGLNRELDDVRGRIMGI 212

Query: 508  RPIPSLMEVCSEIRLEEDRTSAMNISA---TPTIDSAAFSARSSNSSSDKHNGKPIPVCE 567
            +P+PSL E  SE+R EE R   M  S     PT+D++A +ARS NSS      +  P C+
Sbjct: 213  KPLPSLREAFSEVRREESRKKVMMGSKEQPAPTLDASALAARSFNSSGGDRQKRDRPWCD 272

Query: 568  HCKKQWHTKEQGWKLHGRPPGSKKRPSNDKQNTGRAYV---SESAEPPQQSDPHKNQTDL 627
            +CKK  H KE  WKLHG+P   K +P  D+   GRA+V   SES   P+ S  +K Q ++
Sbjct: 273  YCKKPGHYKETCWKLHGKPADWKPKPRFDRD--GRAHVAANSESTSVPEPSPFNKEQMEM 332

Query: 628  SLATLGAIVQSGIPHSFGLISIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIA 687
                L  +            +  G  PWI+D+GA+DH+TG +    +Y P  G+ ++ IA
Sbjct: 333  LQKLLSQVGSGSTTGVAFTANRGGMRPWIVDTGASDHMTGDAAILQNYKPSNGHSSVHIA 392

Query: 688  DGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDL 747
            DGS + IAG G I     L L +VLHVP L  NLLSISK+ H+L C   F P+   FQDL
Sbjct: 393  DGSKSKIAGTGSIKLTKDLYLDSVLHVPNLDCNLLSISKLAHDLQCVTKFYPNLCVFQDL 452

Query: 748  SSGRMIGTARHSRGLYLLD-----DDTSSSSIPRTSLLSSYFTT-------SEQDCMLWH 807
             SG+MIG+A    GLYLL      +  S +S  ++  +S  F +        + + ++ H
Sbjct: 453  KSGKMIGSAELCSGLYLLSCGQFSNQVSQASCVQSQSMSESFNSVSNSKVNKDSEIIMLH 512

Query: 808  FRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDV 867
            +RLGHP+F Y+  LFP LF      +  C++C  AK  R  +P  PYKP+  F+LVHSDV
Sbjct: 513  YRLGHPSFVYLAKLFPRLFINKNPASYHCEICQFAKHTRTVYPQIPYKPSTVFSLVHSDV 572

Query: 868  WGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAILR 927
            WGPS+I   SG RWFVTF+DDHTR+TWV+L+ +KSEV  +FQ F   ++ QF+ KI +L+
Sbjct: 573  WGPSRIKNISGTRWFVTFVDDHTRVTWVFLMKEKSEVGHIFQTFNLMVQNQFNSKIQVLK 632

Query: 928  SDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSY 987
            SDN +E+   +LS +L + GI+H +SC  TPQQNGVAERKNRHLLEVAR LM S+++P+Y
Sbjct: 633  SDNAKEYFTSSLSTYLQNHGIIHISSCVDTPQQNGVAERKNRHLLEVARCLMFSSNVPNY 692

Query: 988  LWGDVILTATHLINRMPSRILHLQTPLDCLKESYPSTRHVS-EVPLRVFGCTAYVHNFSP 1047
             WG+ ILTAT+LINRMPSR+L  Q+P     + +P T   S ++PL+VFGCTA+VH +  
Sbjct: 693  FWGEAILTATYLINRMPSRVLTFQSPRQLFLKQFPHTHAASSDLPLKVFGCTAFVHVYPQ 752

Query: 1048 NQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGESVS 1107
            N++KF PRA  C+F+GY P Q+GYKC+ P +++++ TMDV+F E   ++P SH+QGES++
Sbjct: 753  NRSKFAPRANKCIFLGYSPTQKGYKCYSPTNKRFYTTMDVSFFEHVFFYPKSHVQGESMN 812

Query: 1108 EESNNTFE-FIEPTPSVVSNIIPHSIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQD 1167
            E  +  +E F+E  PS  S     S   PT               E+ +P      P Q 
Sbjct: 813  E--HQVWESFLEGVPSFHSESPNPSQFAPT---------------ELSTPMPPSVQPAQH 872

Query: 1168 SEPPRDQGMENPT--EPCTKNMISENDRSNVAVLENVE-EKDSGDEIEVRIETRNNEAEQ 1227
            +  P    +++P   +P    + +EN +  +   +  E E  S    +  I++ ++  E+
Sbjct: 873  TNVPSPVTIQSPMPIQPIAPQLANENLQVYIRRRKRQELEHGSQSTYDQYIDSNSSLPEE 932

Query: 1228 --GHTGKSDEYDSSLD---IPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTI 1287
              G     +    S+D   +PIALRKG R CT HPI NYV+Y+ LSP +RAF  SLD T 
Sbjct: 933  NIGEDRAGEVLIPSIDDSTLPIALRKGVRRCTDHPIGNYVTYEGLSPSYRAFATSLDDTQ 992

Query: 1288 IPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLD 1347
            +P  I  ALK  EWK AV +E+ ALEKN TW I  LP G + VGCKW+F++KYKADG+++
Sbjct: 993  VPNTIQEALKISEWKKAVQDEIDALEKNGTWTITDLPVGKRPVGCKWIFTIKYKADGSVE 1052

Query: 1348 RHKARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDL 1407
            R KARLVA+GFTQ+YGIDY ETF+PVAKLNTIR+LLS+AVN+DW L QLD+KNAFLNGDL
Sbjct: 1053 RFKARLVARGFTQSYGIDYQETFAPVAKLNTIRILLSLAVNQDWCLQQLDIKNAFLNGDL 1112

Query: 1408 VEEVYMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTL 1467
             EEVYM  PPGFE    ++ V KLQKS+Y LKQSPRAWFDRFT  V   GY+QG +DHTL
Sbjct: 1113 EEEVYMEIPPGFEESMAKNQVXKLQKSLYXLKQSPRAWFDRFTKAVLKLGYKQGQADHTL 1172

Query: 1468 FTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARS 1527
            F K S  GK  +LIVYVDDI+L+G+D  E+  LK+ + +EFE+KDLGNLKYFLGMEVARS
Sbjct: 1173 FVKKSHAGKXXILIVYVDDIILSGNDMXELQNLKKYLSEEFEVKDLGNLKYFLGMEVARS 1232

Query: 1528 KEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIY 1587
            ++GI VSQRKYILDLL ETGMLGC+P DTP++   KLG   +  PVD+ +YQ LVG+LIY
Sbjct: 1233 RKGIVVSQRKYILDLLKETGMLGCKPIDTPMDSQKKLGIEKESTPVDRGRYQWLVGRLIY 1292

Query: 1588 LSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTD 1625
            LSH RPDI FAVS VSQFM +P E HM+AV RILRYLK TPGKGL FRKT+ +  E Y+D
Sbjct: 1293 LSHARPDIGFAVSAVSQFMHSPTEXHMEAVYRILRYLKMTPGKGLFFRKTENRDTEVYSD 1339

BLAST of CSPI04G15350 vs. TAIR10
Match: AT4G23160.1 (AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 8)

HSP 1 Score: 378.3 bits (970), Expect = 3.0e-104
Identity = 181/403 (44.91%), Postives = 268/403 (66.50%), Query Frame = 1

Query: 1229 SCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNS 1288
            S T H I  ++SY+ +SP + +F   +     P     A ++  W  A+ +E+ A+E   
Sbjct: 54   SLTIHDISQFLSYEKVSPLYHSFLVCIAKAKEPSTYNEAKEFLVWCGAMDDEIGAMETTH 113

Query: 1289 TWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKL 1348
            TW+ICTLP   K +GCKWV+ +KY +DGT++R+KARLVAKG+TQ  GID+ ETFSPV KL
Sbjct: 114  TWEICTLPPNKKPIGCKWVYKIKYNSDGTIERYKARLVAKGYTQQEGIDFIETFSPVCKL 173

Query: 1349 NTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEEVYMSPPPGFEAQFGQH-----VCKLQ 1408
             +++++L+++   ++ L+QLD+ NAFLNGDL EE+YM  PPG+ A+ G       VC L+
Sbjct: 174  TSVKLILAISAIYNFTLHQLDISNAFLNGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLK 233

Query: 1409 KSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGD 1468
            KSIYGLKQ+ R WF +F+  +   G+ Q HSDHT F K++ T  + VL VYVDDI++  +
Sbjct: 234  KSIYGLKQASRQWFLKFSVTLIGFGFVQSHSDHTYFLKITATLFLCVL-VYVDDIIICSN 293

Query: 1469 DQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCR 1528
            + A + +LK ++   F+++DLG LKYFLG+E+ARS  GI++ QRKY LDLL ETG+LGC+
Sbjct: 294  NDAAVDELKSQLKSCFKLRDLGPLKYFLGLEIARSAAGINICQRKYALDLLDETGLLGCK 353

Query: 1529 PTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEE 1588
            P+  P++ +           VD + Y+RL+G+L+YL  TR DISFAV+ +SQF + P   
Sbjct: 354  PSSVPMDPSVTFSAHSGGDFVDAKAYRRLIGRLMYLQITRLDISFAVNKLSQFSEAPRLA 413

Query: 1589 HMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWQGHSD 1627
            H +AV +IL Y+K T G+GL +       ++ ++D+ +Q   D
Sbjct: 414  HQQAVMKILHYIKGTVGQGLFYSSQAEMQLQVFSDASFQSCKD 455

BLAST of CSPI04G15350 vs. TAIR10
Match: ATMG00810.1 (ATMG00810.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 189.1 bits (479), Expect = 2.6e-47
Identity = 92/224 (41.07%), Postives = 136/224 (60.71%), Query Frame = 1

Query: 1642 LIVYVDDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYI 1701
            L++YVDDI+LTG   T ++ L  ++   F +KDLG + YFLG+++     G+ +SQ KY 
Sbjct: 3    LLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYA 62

Query: 1702 LDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAV 1761
              +L   GML C+P  TP+        S  + P D   ++ +VG L YL+ TRPDIS+AV
Sbjct: 63   EQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYP-DPSDFRSIVGALQYLTLTRPDISYAV 122

Query: 1762 SVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKS 1821
            ++V Q M  P       + R+LRY+K T   GL   K  +  ++A+ DSDWAG    R+S
Sbjct: 123  NIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRS 182

Query: 1822 TSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIW 1866
            T+G+CTF+  N+++W +K+Q  V+RSS E EYRA++L   E  W
Sbjct: 183  TTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CSPI04G15350 vs. TAIR10
Match: ATMG00820.1 (ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase))

HSP 1 Score: 107.8 bits (268), Expect = 7.5e-23
Identity = 52/98 (53.06%), Postives = 64/98 (65.31%), Query Frame = 1

Query: 1261 PKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDR 1320
            PK +  ALK P W  A+ EE+ AL +N TW +   P     +GCKWVF  K  +DGTLDR
Sbjct: 28   PKSVIFALKDPGWCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLDR 87

Query: 1321 HKARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVA 1359
             KARLVAKGF Q  GI + ET+SPV +  TIR +L+VA
Sbjct: 88   LKARLVAKGFHQEEGIYFVETYSPVVRTATIRTILNVA 125

BLAST of CSPI04G15350 vs. TAIR10
Match: AT1G21280.1 (AT1G21280.1 Retrotransposon gag protein (InterPro:IPR005162))

HSP 1 Score: 83.2 bits (204), Expect = 2.0e-15
Identity = 52/211 (24.64%), Postives = 106/211 (50.24%), Query Frame = 1

Query: 320 YVTNTVAQSSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPQPGDPHERY 379
           Y+   +   S + +     + +NY +W    +  L   +KF F+ G +P+P P  P  + 
Sbjct: 19  YLPPDIHHPSDFSIQKLSKDEDNYVAWKIRFRSFLRVTKKFGFIDGTLPKPDPFSPLYQP 78

Query: 380 WKAEDSILRSILINSREPQMGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHEC 439
           W+  ++++   L+NS   ++ + +++A TA  +W+  + ++    +  ++Y LR+++   
Sbjct: 79  WEQCNAMVMYWLMNSMTDKLLESVMYAETAHKMWEDLRRVFVPCVDL-KIYQLRRRLATL 138

Query: 440 KQGTMDVTSFFNKLSLIWQEMDLCIEL-------VWRDPTDDVQYSRIEENDRIYDFLAG 499
           +QG   V  +F KLS +W E+     +          + T   + +R  E ++ Y+FL G
Sbjct: 139 RQGGDSVEEYFGKLSKVWMELSEYAPIPECKCGGCNCECTKRAEEAR--EKEQRYEFLMG 198

Query: 500 --LNPKFDVVRGRILGQRPIPSLMEVCSEIR 522
             LN  F+ V  +I+ Q+P PSL E  + ++
Sbjct: 199 LKLNQGFEAVTTKIMFQKPPPSLHEAFAMVK 226

BLAST of CSPI04G15350 vs. TAIR10
Match: ATMG00240.1 (ATMG00240.1 Gag-Pol-related retrotransposon family protein)

HSP 1 Score: 75.5 bits (184), Expect = 4.1e-13
Identity = 34/82 (41.46%), Postives = 53/82 (64.63%), Query Frame = 1

Query: 1748 IYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAY 1807
            +YL+ TRPD++FAV+ +SQF        M+AV ++L Y+K T G+GL +  T    ++A+
Sbjct: 1    MYLTITRPDLTFAVNRLSQFSSASRTAQMQAVYKVLHYVKGTVGQGLFYSATSDLQLKAF 60

Query: 1808 TDSDWAGSVVDRKSTSGYCTFV 1830
             DSDWA     R+S +G+C+ V
Sbjct: 61   ADSDWASCPDTRRSVTGFCSLV 82

BLAST of CSPI04G15350 vs. NCBI nr
Match: gi|147819777|emb|CAN76196.1| (hypothetical protein VITISV_041073 [Vitis vinifera])

HSP 1 Score: 1333.9 bits (3451), Expect = 0.0e+00
Identity = 691/1340 (51.57%), Postives = 897/1340 (66.94%), Query Frame = 1

Query: 329  SMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPQPGDPHERYWKAEDSILR 388
            S + L+  KLNG NY  W+QSVK+ ++GR K   L GE+ +P   DP+ + W+  + +  
Sbjct: 35   SSFQLTIHKLNGKNYLEWAQSVKLAIDGRGKLGHLNGEVSKPVADDPNLKTWRFRELVA- 94

Query: 389  SILINSREPQMGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTS 448
                      +GKP LF  TAKD+W+  + +YS  +N+S+++ L+ ++ + +QG  +VT+
Sbjct: 95   ----------IGKPHLFLPTAKDVWEAVRDMYSDLENSSQIFDLKSKLWQSRQGDREVTT 154

Query: 449  FFNKLSLIWQEMDLCIELVWRDPTDDVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQR 508
            ++N++  +WQE+DLC E  W  P D V++ + EENDR+Y FLA LN   D VRGRILG++
Sbjct: 155  YYNQMVTLWQELDLCYEDEWDCPNDSVRHKKREENDRVYVFLAALNHNLDEVRGRILGRK 214

Query: 509  PIPSLMEVCSEIRLEEDRTSAMNIS----ATPTIDSAAFSARSSNSSSDKHNGKPIPVCE 568
            P+PS+ EV SE+R EE R   M       + P I+S+A  ++ S+   D+      P C+
Sbjct: 215  PLPSIREVFSEVRREEARRKVMLTDPEPMSNPEIESSALVSKGSDLDGDRRKK---PWCD 274

Query: 569  HCKKQWHTKEQGWKLHGRPPGSKKRPSNDKQNTGRAYVSESAEPPQQSDPHKNQTDLSLA 628
            HCKK WHTK   WK+HG+P   KK+  +D +      +S  ++ PQ +    N T   L+
Sbjct: 275  HCKKPWHTKGTCWKIHGKPQNFKKKNGSDGR--AFQTMSADSQGPQINSEKPNFTKEQLS 334

Query: 629  TLGAIVQSG---------------IPHSFGLISIDGKNPWILDSGATDHLTGSSEHFVSY 688
             L  + QS                +  +   I  +   PWI+DSGATDH+TGSS+ F SY
Sbjct: 335  HLYKLFQSPQFSNPSCSLAQQGNYLIAALSSIKSNVHCPWIIDSGATDHMTGSSQIFSSY 394

Query: 689  IPCAGNETIRIADGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKA 748
             PCAGN+ I+I DGSL+ IAGKG +     L+LHNVLHVP LS NLLSISKIT +  C+A
Sbjct: 395  KPCAGNKKIKIXDGSLSAIAGKGSVFISPSLTLHNVLHVPNLSCNLLSISKITQDHQCQA 454

Query: 749  IFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDCMLWH 808
             F P    FQ+L+SGR IG AR   GLY  ++ + S    +++   S    S  D +LWH
Sbjct: 455  NFYPSYCEFQELTSGRTIGNAREIGGLYFFENGSESRKPIQSTCFESISVASSDDIILWH 514

Query: 809  FRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDV 868
            +RLGHP+FQY+KHLFP LF     ++  C+ C  AK HR SFP QPY+ ++PF+L+HSDV
Sbjct: 515  YRLGHPSFQYLKHLFPSLFRNKNPSSFQCEFCELAKHHRTSFPLQPYRISKPFSLIHSDV 574

Query: 869  WGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAILR 928
            WGPS+I+T SGK+WFVTFIDDHTR++WVYL+ +KSEV  +F+ FY  + TQF  KI + R
Sbjct: 575  WGPSRISTLSGKKWFVTFIDDHTRVSWVYLLREKSEVEEVFKIFYTMVLTQFQTKIQVFR 634

Query: 929  SDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSY 988
            SDNG+E+ N  L +F   KGIVHQ+SC  TPQQNG+AERKN+HLLEVAR+L  +T +P Y
Sbjct: 635  SDNGKEYINKALGKFFLEKGIVHQSSCNDTPQQNGIAERKNKHLLEVARALCFTTKVPKY 694

Query: 989  LWGDVILTATHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFSPN 1048
            LWG+ ILTAT+LINRMP+RIL+ +TPL       P  R  S +PL++FGCT +VH    N
Sbjct: 695  LWGEAILTATYLINRMPTRILNFKTPLQVFTNCNPIFRLSSTLPLKIFGCTTFVHIHDHN 754

Query: 1049 QTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGESVSE 1108
            + K  PRA+ CVFVGY P Q+GYKCF P S+K FVTMDVTF E +P+F  +HLQGES SE
Sbjct: 755  RGKLDPRARKCVFVGYAPTQKGYKCFDPISKKLFVTMDVTFFESKPFF-ATHLQGESTSE 814

Query: 1109 ESN----------NTFEFIEPTPS---VVSNIIPHSIVLPTNQVPW-KTYYRRNHKKEVG 1168
            +S+          N    +EP+ S   V  NI    +    + + + KT      K  V 
Sbjct: 815  DSDLFKIEKTPTPNPNNLLEPSNSNQFVYPNIETSGLDTTKSDMSFEKTAEILGKKNGVL 874

Query: 1169 SPTSQPPAPVQDSEPPRDQGMENPTEPCTKNM-ISENDRSNVAVLENVEEKDSGDEIEVR 1228
            +  S   +    S         N     TKN  +    R      E+  +   G E E+R
Sbjct: 875  NIESLDGSSSLPSHNQNHSNTNNGNRTSTKNSELMTYSRRKHNSKESNPDPLPGHESELR 934

Query: 1229 IETRNNEAEQGH-----------TGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYDS 1288
             E  ++E    +           +  + E    L+IPIA RKG RSCTKHP+ NY+SY +
Sbjct: 935  EEPNSSECPGNNQTDSCQPVQFISNSNSESFDDLNIPIATRKGVRSCTKHPMSNYMSYKN 994

Query: 1289 LSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVG 1348
            LSP F AFT+ L    IPK++  AL+ PEWK A+ EEM+ALEKN TW++  LPKG  TVG
Sbjct: 995  LSPSFFAFTSHLSLVEIPKNVQEALQVPEWKKAIFEEMRALEKNHTWEVMGLPKGKTTVG 1054

Query: 1349 CKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDW 1408
            CKWVF++KY ++G+L+R+KARLVAKGFTQTYGIDY ETF+PVAKLNT+RVLLS+A N DW
Sbjct: 1055 CKWVFTVKYNSNGSLERYKARLVAKGFTQTYGIDYLETFAPVAKLNTVRVLLSIAANLDW 1114

Query: 1409 PLYQLDVKNAFLNGDLVEEVYMSPPPGFEAQFGQHVCKLQKSIYGLKQSPRAWFDRFTTF 1468
            PL QLDVKNAFLNG+L EEVYM PPPGF+  FG  VCKL+KS+YGLKQSPRAWF+RFT F
Sbjct: 1115 PLQQLDVKNAFLNGNLEEEVYMDPPPGFDEHFGSKVCKLKKSLYGLKQSPRAWFERFTQF 1174

Query: 1469 VKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKD 1528
            VK+QGY Q  SDHT+F K S  GKIA+LIVYVDDI+LTGD   E+ +LK+ +  EFEIKD
Sbjct: 1175 VKNQGYVQAQSDHTMFIKHSNDGKIAILIVYVDDIILTGDHVTEMDRLKKSLALEFEIKD 1234

Query: 1529 LGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVP 1588
            LG+L+YFLGMEVARSK GI VSQRKYILDLL ETGM GCRP DTPI+ N KLG+++D   
Sbjct: 1235 LGSLRYFLGMEVARSKRGIVVSQRKYILDLLKETGMSGCRPADTPIDPNQKLGDTNDGNL 1294

Query: 1589 VDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGL 1624
            V+  +YQ+LVGKLIYLSHTRPDI+FAVS+VSQFM +P E H++AV RILRYLKSTPGKGL
Sbjct: 1295 VNTTRYQKLVGKLIYLSHTRPDIAFAVSIVSQFMHSPYEVHLEAVYRILRYLKSTPGKGL 1354

BLAST of CSPI04G15350 vs. NCBI nr
Match: gi|147810393|emb|CAN59964.1| (hypothetical protein VITISV_022757 [Vitis vinifera])

HSP 1 Score: 1248.0 bits (3228), Expect = 0.0e+00
Identity = 658/1325 (49.66%), Postives = 863/1325 (65.13%), Query Frame = 1

Query: 326  AQSSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPQPGDPHERYWKAEDS 385
            + SS   ++G KLNG+NY  WSQSV + + G+ K  +LTGE   P+  +P  R WK E+S
Sbjct: 26   SDSSPILITGHKLNGHNYLQWSQSVLLFICGKGKDEYLTGEAAMPETTEPGFRKWKIENS 85

Query: 386  ILRSILINSREPQMGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMD 445
            ++ S LINS    +G+  L   TAKDIWD A+  YS  +N S L+ +   +H+ +QG   
Sbjct: 86   MIMSWLINSMNNDIGENFLLFRTAKDIWDAAKETYSSSENTSELFQVESALHDFRQGEQS 145

Query: 446  VTSFFNKLSLIWQEMDLCIELVWRDPTDDVQYSRIEENDRIYDFLAGLNPKFDVVRGRIL 505
            VT ++N L+  WQ++DL     W+   D   Y  I E  R++ F  GLN + D VRGRI+
Sbjct: 146  VTQYYNTLTRYWQQLDLFETHSWKCSDDAATYRXIVEQXRLFKFFLGLNRELDDVRGRIM 205

Query: 506  GQRPIPSLMEVCSEIRLEEDRTSAMNISA---TPTIDSAAFSARSSNSSSDKHNGKPIPV 565
            G +P+PSL E  SE+R EE R   M  S     PT+D++   ARS NSS      +  P 
Sbjct: 206  GIKPLPSLREAFSEVRREESRKKVMMGSKEQPAPTLDASXLXARSFNSSGGDRQKRDRPW 265

Query: 566  CEHCKKQWHTKEQGWKLHGRPPGSKKRPSNDKQNTGRAYVS---ESAEPPQQSDPHKNQT 625
            C++CKK  H KE  WKLHG+    K +P  D+   GRA+V+   ES   P+ S  +K Q 
Sbjct: 266  CDYCKKXGHYKEACWKLHGKXADWKPKPRXDRD--GRAHVAANXESTSVPEPSPFNKEQM 325

Query: 626  DLSLATLGAIVQSGIPHSFGLISI-DGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETI 685
            ++ L  L + V SG      L +   G  PWI+D+GA+DH+TG +    +Y P  G+ ++
Sbjct: 326  EM-LQKLLSQVGSGSTTGIALTANRGGMKPWIVDTGASDHMTGDAAILQNYKPSNGHSSV 385

Query: 686  RIADGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSF 745
             IADGS + I G G I     L L +VLHVP L  NLLSISK+  +L C   F P+S  F
Sbjct: 386  HIADGSKSKIXGTGSIKLTKDLYLDSVLHVPNLDCNLLSISKLARDLQCVTKFYPNSCVF 445

Query: 746  QDLSSGRMIGTARHSRGLYLL------DDDTSSSSIPRTSLLSSYFTTS------EQDCM 805
            QDL SG+MIG+A    GLYLL      +  + +S +   S+L S+ + S      + + +
Sbjct: 446  QDLKSGKMIGSAELCSGLYLLSCGQFSNQVSQASCVQSQSMLESFNSVSNSKVNKDSEII 505

Query: 806  LWHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVH 865
            + H+RLGHP+F Y+  LFP LF      +  C++C  AK  R  +P  PYKP+  F+LVH
Sbjct: 506  MLHYRLGHPSFVYLAKLFPKLFINKNPASYHCEICQFAKHTRTVYPQIPYKPSTVFSLVH 565

Query: 866  SDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIA 925
            SDVWGPS+I   SG RWFVTF+DDHTR+TWV+L+ +KSEV  +FQ F   ++ QF+ KI 
Sbjct: 566  SDVWGPSRIKNISGTRWFVTFVDDHTRVTWVFLMKEKSEVGHIFQTFNLMVQNQFNSKIQ 625

Query: 926  ILRSDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSL 985
            +L+SDN +E+   +LS +L + GI+H +SC  TPQQNGVAERKNRHLLEVAR LM S+++
Sbjct: 626  VLKSDNAKEYFTSSLSTYLQNHGIIHISSCVDTPQQNGVAERKNRHLLEVARCLMFSSNV 685

Query: 986  PSYLWGDVILTATHLINRMPSRILHLQTPLDCLKESYPSTRHVS-EVPLRVFGCTAYVHN 1045
            P+Y WG+ ILTAT+LINRMPSR+L  Q+P     + +P TR  S ++PL+VFGCTA+VH 
Sbjct: 686  PNYFWGEAILTATYLINRMPSRVLTFQSPRQLFLKQFPHTRAASSDLPLKVFGCTAFVHV 745

Query: 1046 FSPNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGE 1105
            +  N++KF PRA  C+F+GY P Q+GYKC+ P +++++ TMDV+F E   ++P SH+QGE
Sbjct: 746  YPQNRSKFAPRANKCIFLGYSPTQKGYKCYSPTNKRFYTTMDVSFFEHVFFYPKSHVQGE 805

Query: 1106 SVSEESNNTFE-FIEPTPSVVSNIIPHSIVLPTN-QVPWKTYYRRNHKKEVGSP-TSQPP 1165
            S++E  +  +E  +E  PS  S     S   PT    P  +  +      V SP T Q P
Sbjct: 806  SMNE--HQVWESLLEGVPSFHSESPNPSQFAPTELSTPMPSSVQPAQHTNVPSPVTIQSP 865

Query: 1166 APVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEA 1225
             P+Q   P          +   +N+     R     LE+  +   G  I+          
Sbjct: 866  MPIQPIAP----------QLANENLQVYIRRRKRQELEHGSQSTCGQYIDSNSSLPEENI 925

Query: 1226 EQGHTGKS--DEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTII 1285
             +   G+      D S  +PIALRKG R CT HPI NYV+Y+ LSP +RAF  SLD T +
Sbjct: 926  GEDRAGEVLIPSIDDST-LPIALRKGVRRCTDHPIGNYVTYEGLSPSYRAFATSLDDTQV 985

Query: 1286 PKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDR 1345
            P  I  A K  EWK AV +E+ ALEKN TW I  LP G + VGCKW+F++KYKADG+++R
Sbjct: 986  PNTIQEAXKISEWKKAVQDEIDALEKNGTWTITDLPVGKRPVGCKWIFTIKYKADGSVER 1045

Query: 1346 HKARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLV 1405
             KARLVA+GFTQ+YGIDY ETF+PVAKLNTIR+LLS+AVN+DW L QLD+KNAFLNGDL 
Sbjct: 1046 FKARLVARGFTQSYGIDYQETFAPVAKLNTIRILLSLAVNQDWCLQQLDIKNAFLNGDLE 1105

Query: 1406 EEVYMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLF 1465
            EEVYM  PPGFE    ++ VCKLQKS+YGLKQSPRAWFDRFT  V   GY+QG +DHTLF
Sbjct: 1106 EEVYMEIPPGFEESMAKNQVCKLQKSLYGLKQSPRAWFDRFTKAVLKLGYKQGQADHTLF 1165

Query: 1466 TKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSK 1525
             K S  GK+A+LIVYVDDI+L+G+D  E+  LK+ + +EFE+KDLGNLKYFLGMEVARS+
Sbjct: 1166 VKKSHAGKMAILIVYVDDIILSGNDMEELQXLKKYLSEEFEVKDLGNLKYFLGMEVARSR 1225

Query: 1526 EGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYL 1585
            +GI VSQRKYILDLL ETGMLGC+P DTP++   KLG   +  PVD+ +YQRLVG+LIYL
Sbjct: 1226 KGIVVSQRKYILDLLKETGMLGCKPIDTPMDSQKKLGIEKESTPVDRGRYQRLVGRLIYL 1285

Query: 1586 SHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDS 1625
            SHTRPDI FAVS VSQFM +P EEHM+AV RI RYLK TPGKGL FRKT+ +  E Y+D+
Sbjct: 1286 SHTRPDIGFAVSXVSQFMHSPTEEHMEAVYRIXRYLKMTPGKGLFFRKTENRDXEVYSDA 1334

BLAST of CSPI04G15350 vs. NCBI nr
Match: gi|147778986|emb|CAN62538.1| (hypothetical protein VITISV_031159 [Vitis vinifera])

HSP 1 Score: 1242.3 bits (3213), Expect = 0.0e+00
Identity = 656/1323 (49.58%), Postives = 862/1323 (65.15%), Query Frame = 1

Query: 328  SSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPQPGDPHERYWKAEDSIL 387
            SS   ++G KLNG+NY  WSQSV + + G+ K  ++TGE   P+  +P  R WK E+S++
Sbjct: 28   SSPILITGHKLNGHNYLQWSQSVLLFICGKGKDEYJTGEAXMPETTEPXFRKWKIENSMI 87

Query: 388  RSILINSREPQMGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVT 447
             S LINS    +G+  L   TAKDIWD A+  YS  +N S L+ +   +H+ +QG   VT
Sbjct: 88   MSWLINSMNNDIGENFLLFGTAKDIWDAAKETYSSSENISELFQVESALHDFRQGEQSVT 147

Query: 448  SFFNKLSLIWQEMDLCIELVWRDPTDDVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQ 507
             ++N L+  WQ++DL     W+   D   Y +I E  R++ F  GLN + D VRGRI+G 
Sbjct: 148  QYYNTLTRYWQQLDLFETHSWKCSDDAATYRQIVEQKRLFKFFLGLNRELDDVRGRIMGI 207

Query: 508  RPIPSLMEVCSEIRLEEDRTSAMNISA---TPTIDSAAFSARSSNSSSDKHNGKPIPVCE 567
            +P+PSL EV SE+R EE R   M  S     PT+D +A +ARS NSS      +  P C+
Sbjct: 208  KPLPSLREVFSEVRREESRKKVMMGSKEQPAPTLDGSALAARSFNSSGGDRQKRDRPWCD 267

Query: 568  HCKKQWHTKEQGWKLHGRPPGSKKRPSNDKQNTGRAYV---SESAEPPQQSDPHKNQTDL 627
            + KK  H KE  WKLHG+P   K +P +D+   GRA+V   SES   P+ S  +K Q ++
Sbjct: 268  YYKKPGHYKEACWKLHGKPADWKPKPRSDRD--GRAHVAANSESTSVPEPSPFNKEQMEM 327

Query: 628  SLATLGAIVQSGIPHSFGLI-SIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRI 687
             L  L + V SG      L  S  G  PWI+D+GA+DH+TG +    +Y P  G+  + I
Sbjct: 328  -LQKLLSQVGSGSTTGIALTASRGGMKPWIVDTGASDHMTGDAAILQNYKPSNGHSFVHI 387

Query: 688  ADGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQD 747
            ADGS + I G G I     L L +VLHVP L  NLLSISK+  +L C   F P+S  FQD
Sbjct: 388  ADGSKSKIVGTGSIKLTKDLYLDSVLHVPNLDCNLLSISKLARDLQCVTKFYPNSCVFQD 447

Query: 748  LSSGRMIGTARHSRGLYLL------DDDTSSSSIPRTSLLSSYFTTS------EQDCMLW 807
            L SG+MIG+A+    LYLL      +  + +S +   S+L S+ + S      + + ++ 
Sbjct: 448  LKSGKMIGSAKLCSELYLLSCGQFSNQVSQASCVQSQSMLESFNSVSNSKVNKDSEIIML 507

Query: 808  HFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSD 867
            H+RLGHP+F Y+  LFP LF      +  C++C  AK  R  +P  PYKP+  F+LVHSD
Sbjct: 508  HYRLGHPSFVYLAKLFPRLFINKNPASYHCEICQFAKHTRTVYPQIPYKPSTVFSLVHSD 567

Query: 868  VWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAIL 927
            VWGPS+I   SG RWFVTF+DDHTR+TWV+L+ +KSEV  +FQ F   ++ QF+ KI +L
Sbjct: 568  VWGPSRIKNISGTRWFVTFVDDHTRVTWVFLMKEKSEVGHIFQTFNLMVQNQFNSKIQVL 627

Query: 928  RSDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPS 987
            +SDN +E+   +LS +L + GI+H +SC  TPQQNGVAERKNRHLLEVAR LM S+++P+
Sbjct: 628  KSDNAKEYFTSSLSTYLQNHGIIHISSCVDTPQQNGVAERKNRHLLEVARCLMFSSNVPN 687

Query: 988  YLWGDVILTATHLINRMPSRILHLQTPLDCLKESYPSTRHVS-EVPLRVFGCTAYVHNFS 1047
            Y WG+ ILTAT+LINRMPSR+L  Q+P     + +P TR  S ++ L+VFGCTA+VH + 
Sbjct: 688  YFWGEAILTATYLINRMPSRVLTFQSPRQLFLKQFPHTRAASSDLSLKVFGCTAFVHVYP 747

Query: 1048 PNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGESV 1107
             N++KF PRA  C+F+GY P+Q+GYKC+ P +++++ TMDV+F E   ++P  H+QGES+
Sbjct: 748  QNRSKFAPRANKCIFLGYSPNQKGYKCYSPTNKRFYTTMDVSFFEHVFFYPKFHVQGESM 807

Query: 1108 SEESNNTFEF-IEPTPSVVSNIIPHSIVLPTN-QVPWKTYYRRNHKKEVGSP-TSQPPAP 1167
            +E  +  +E  +E  PS  S     S   PT    P  +  +      V SP T Q P P
Sbjct: 808  NE--HQVWESRLEGVPSFHSESPNPSQFAPTELSTPMPSSVQPAQHTNVPSPVTIQSPMP 867

Query: 1168 VQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAEQ 1227
            +Q   P          +   +N+     R     LE+  +   G  I+           +
Sbjct: 868  IQPIAP----------QLANENLQVYIRRRKRQELEHGSQSTCGQYIDSNSSLPEENIGE 927

Query: 1228 GHTGKS--DEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPK 1287
               G+      D S  +PIALRKG R CT HPI NYV+Y+ LSP +RAF  SLD T +P 
Sbjct: 928  DRAGEVLIPSIDDST-LPIALRKGVRRCTDHPIGNYVTYEGLSPSYRAFATSLDDTQVPN 987

Query: 1288 DIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHK 1347
             I  ALK  EWK AV +E+ ALEKN TW I  LP G + VGCKW+F++KYKADG+++R K
Sbjct: 988  TIQEALKISEWKKAVQDEIDALEKNGTWTITDLPVGKRPVGCKWIFTIKYKADGSVERFK 1047

Query: 1348 ARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEE 1407
            ARLVA+GFTQ YGIDY ETF+PVAKLNTIR+LLS+AVN+DW L QLD+KNAFLNGDL EE
Sbjct: 1048 ARLVARGFTQXYGIDYQETFAPVAKLNTIRILLSLAVNQDWCLQQLDIKNAFLNGDLEEE 1107

Query: 1408 VYMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTK 1467
            VYM  PPGFE    ++ VCKLQKS+YGLKQSPRAWFDRFT  V   GY+QG +DHTLF K
Sbjct: 1108 VYMEIPPGFEESMAKNQVCKLQKSLYGLKQSPRAWFDRFTKAVLKLGYKQGQADHTLFVK 1167

Query: 1468 VSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEG 1527
             S  GK+A+LIVYVDDI+L+G+D  E+  LK+ + +EFE+KDLGNLKYFLGMEVARS++G
Sbjct: 1168 KSHAGKMAILIVYVDDIILSGNDMEELQNLKKYLSEEFEVKDLGNLKYFLGMEVARSRKG 1227

Query: 1528 ISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSH 1587
            I VSQ KYILDLL ETGMLGC+P DTP++   KLG   +  P D+ +YQRLVG+LIYLSH
Sbjct: 1228 IVVSQTKYILDLLKETGMLGCKPIDTPMDSQKKLGIEKESTPXDRGRYQRLVGRLIYLSH 1287

Query: 1588 TRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDW 1625
            TRPDI FAVS VSQFM +P EEHM+AV RILRYLK TP KG+ FRKT+ +  E Y+D+DW
Sbjct: 1288 TRPDIGFAVSAVSQFMHSPTEEHMEAVYRILRYLKMTPXKGIFFRKTENRDTEVYSDADW 1334

BLAST of CSPI04G15350 vs. NCBI nr
Match: gi|703163467|ref|XP_010113352.1| (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Morus notabilis])

HSP 1 Score: 1239.9 bits (3207), Expect = 0.0e+00
Identity = 658/1337 (49.21%), Postives = 870/1337 (65.07%), Query Frame = 1

Query: 326  AQSSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPQPGDPHERYWKAEDS 385
            ++SS   ++G KLNG+NY  WSQSV + + G+ K  +LTGE   P+  D   + WK E+S
Sbjct: 31   SESSPILITGHKLNGHNYLQWSQSVLLFICGKGKDEYLTGEAAMPETTDSGFKKWKIENS 90

Query: 386  ILRSILINSREPQMGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMD 445
            ++ S LINS    +G+  L   TAK+IWD A+  YS  +N S L+ +   +H+ +QG   
Sbjct: 91   MIMSWLINSMNNDIGENFLLFGTAKEIWDAAKETYSSSENTSELFQVESALHDFRQGEQS 150

Query: 446  VTSFFNKLSLIWQEMDLCIELVWRDPTDDVQYSRIEENDRIYDFLAGLNPKFDVVRGRIL 505
            VT +++ L   WQ++DL     W+ P D   Y ++ E  R++ F  GLN + D VRGRI+
Sbjct: 151  VTQYYSTLIRYWQQLDLFETHSWKCPDDAATYRQVVEQKRLFKFFLGLNRELDDVRGRIM 210

Query: 506  GQRPIPSLMEVCSEIRLEEDRTSAMNIS----ATPTIDSAAFSARSSNSSSDKHNGKPIP 565
            G +P+PSL E  SE+R EE R   M  S    A+P +D++A + RSSNS+   H  +  P
Sbjct: 211  GTKPLPSLREAFSEVRREESRKKVMMGSKEQHASP-LDASALAVRSSNSNGGDHQKRERP 270

Query: 566  VCEHCKKQWHTKEQGWKLHGRPPGSKKRPSNDKQNTGR-AYVSESAEPPQQSDPHKNQTD 625
             C++CKK  H KE  WKLHG+P   K +P  D+ +    A  S+SA  P+ S  +K Q +
Sbjct: 271  WCDYCKKLGHYKEACWKLHGKPADWKPKPRFDRDSKAHVASNSDSAPVPEPSPFNKEQMN 330

Query: 626  L-----SLATLGAIVQSGI-----PHSFGLISIDGKNPWILDSGATDHLTGSSEHFVSYI 685
            +     S    G I  +G+     PH+    +  G  PWI+D+GA+DH+TG +    +Y 
Sbjct: 331  VLQKLFSQVGSGNITGAGLVAQTDPHTAFTANHGGMRPWIVDTGASDHMTGDAALLQNYK 390

Query: 686  PCAGNETIRIADGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAI 745
            P +G+ ++ IADGS + IAG G I     L L +VLHVP L  NLLSISK+  +L C   
Sbjct: 391  PSSGHSSVHIADGSNSKIAGTGSIKLTKELYLDSVLHVPNLDCNLLSISKLACDLQCVTK 450

Query: 746  FLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTS------LLSSYFTTS--- 805
            F P+   FQDL SG+MIG+A    GLYLL  D SS+ + + S      LL S+ + S   
Sbjct: 451  FYPNLCIFQDLKSGKMIGSAELCSGLYLLSCDRSSNQVSQASCVQSQSLLGSFNSVSNSN 510

Query: 806  ---EQDCMLWHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKP 865
               + + +L H+RLGHP+F Y+  LFP LF      +  C++C  AK  R  +P  PYKP
Sbjct: 511  VNKDSEIILLHYRLGHPSFVYLAKLFPKLFINKNPASFHCEICQIAKHTRTVYPQIPYKP 570

Query: 866  TQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIE 925
            +  F+L+HSDVWGPS+I   SG RWFVTF+DDHTR+TWVYL+ +KSEV  +F  F   ++
Sbjct: 571  STVFSLIHSDVWGPSRIKNVSGTRWFVTFVDDHTRVTWVYLMKEKSEVGQIFHTFNLMVQ 630

Query: 926  TQFHQKIAILRSDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVAR 985
             QF+ +I +L+SDN RE+   +L+ +L + GI+H +SC  TPQQNGVAERKNRHLLEVAR
Sbjct: 631  NQFNSRIQVLKSDNAREYFTSSLNTYLQNHGIIHLSSCVDTPQQNGVAERKNRHLLEVAR 690

Query: 986  SLMLSTSLPSYLWGDVILTATHLINRMPSRILHLQTPLDCLKESYPSTRHVS-EVPLRVF 1045
             LM S+++P+Y WG+ ILTAT+LINRMPSR+L  Q+P   L E++P TR VS ++P +VF
Sbjct: 691  CLMFSSNVPNYFWGEAILTATYLINRMPSRVLTFQSPRQLLLENFPHTRAVSSDLPPKVF 750

Query: 1046 GCTAYVHNFSPNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYF 1105
            GCTA+VH +  +++KF PRA  C+F+GY P Q+GYKC+ P S++++ TMDV+F E   ++
Sbjct: 751  GCTAFVHVYPQHRSKFDPRANKCIFLGYSPTQKGYKCYSPISKRFYTTMDVSFFEHVFFY 810

Query: 1106 PVSHLQGESVSEESNNTFEFI-EPTPSVVSNIIPHSIVLPTNQVPWKTYYRRNHKKEVGS 1165
            P S +QGES++E  +  +E I E  PS  S     S  +P +                  
Sbjct: 811  PKSRVQGESMNE--HQIWESILESVPSSHSESPRPSQTVPID------------------ 870

Query: 1166 PTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISEN-----DRSNVAVLENVEEKDSG--- 1225
              S  P P+  S  P +     P +     + +EN      R     LE+  +   G   
Sbjct: 871  --SSTPVPL--SVQPTNVSSPVPVQSVAPQLANENLQVYIRRKKRQELEHGSQPTCGQYI 930

Query: 1226 DEIEVRIETRNNEAEQGHTGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQF 1285
            D I    E       +G        DS+L  PIALRKG R CT HPI NYV+Y+ LSP +
Sbjct: 931  DSISSPPEENMGTDREGDVSTPSIDDSTL--PIALRKGVRRCTDHPIGNYVTYEGLSPSY 990

Query: 1286 RAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVF 1345
            +AF  SLD T IP  I+ AL+  EWK AV +E+ ALEKN TW I  LP G + VGCKW+F
Sbjct: 991  KAFATSLDGTQIPSTIHEALQNSEWKKAVQDEIDALEKNGTWTITDLPGGKRPVGCKWIF 1050

Query: 1346 SLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQL 1405
            ++KYKADG+++R KARLVA+GFTQ+YGIDY ETF+PVAKLNTIR+LLS+AVN+DW L QL
Sbjct: 1051 TIKYKADGSVERFKARLVARGFTQSYGIDYQETFAPVAKLNTIRILLSLAVNQDWCLQQL 1110

Query: 1406 DVKNAFLNGDLVEEVYMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQ 1465
            D+KNAFLNGDL EEVYM  PPGFE    ++ VCKL+KS+YGLKQSPRAWFDRFT  V   
Sbjct: 1111 DIKNAFLNGDLEEEVYMEIPPGFEGSMTKNQVCKLRKSLYGLKQSPRAWFDRFTKAVLKL 1170

Query: 1466 GYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNL 1525
            GY QG SDHTLF K S   KIA+LIVYVDDI+L+G+D  E+ +LK+ + +EFE+KDLGNL
Sbjct: 1171 GYVQGQSDHTLFVKKSHAEKIAILIVYVDDIILSGNDVKELQELKKYLSEEFEVKDLGNL 1230

Query: 1526 KYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKE 1585
            KYFLGMEVARS +GI VSQRKYILDLL ETGMLGC+P DTP++   KLG   +  PVD+ 
Sbjct: 1231 KYFLGMEVARSSKGIVVSQRKYILDLLKETGMLGCKPVDTPMDSQKKLGTEKESAPVDRG 1290

Query: 1586 QYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRK 1625
            +YQRLVG+LIYLSHTRPDI FAVSVVSQFM +P EEHM+AV R+LRYLK TPGKGL F K
Sbjct: 1291 RYQRLVGRLIYLSHTRPDIGFAVSVVSQFMHSPTEEHMEAVYRVLRYLKMTPGKGLFFIK 1340

BLAST of CSPI04G15350 vs. NCBI nr
Match: gi|147784447|emb|CAN63881.1| (hypothetical protein VITISV_039357 [Vitis vinifera])

HSP 1 Score: 1239.9 bits (3207), Expect = 0.0e+00
Identity = 649/1326 (48.94%), Postives = 862/1326 (65.01%), Query Frame = 1

Query: 328  SSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPQPGDPHERYWKAEDSIL 387
            SS   ++G KLNG+NY  WSQSV + + G+ K  +LTGE   P+  +P  R WK E+S++
Sbjct: 33   SSPILITGHKLNGHNYLQWSQSVLLFICGKGKDEYLTGEAVMPETTEPGFRKWKIENSMI 92

Query: 388  RSILINSREPQMGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVT 447
             S LINS    +G+  L   TAKDIWD A+  YS  +N S L+ +   +H+ +QG   VT
Sbjct: 93   MSWLINSMNNDIGENFLLFGTAKDIWDAAKETYSSSENTSELFQVESALHDFRQGEQSVT 152

Query: 448  SFFNKLSLIWQEMDLCIELVWRDPTDDVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQ 507
             ++N L+  WQ++DL     W+   D   Y +I E  R++ F  GLN + D VRGRI+G 
Sbjct: 153  QYYNTLTRYWQQLDLFETHSWKCSDDAATYRQIVEQKRLFKFFLGLNRELDDVRGRIMGI 212

Query: 508  RPIPSLMEVCSEIRLEEDRTSAMNISA---TPTIDSAAFSARSSNSSSDKHNGKPIPVCE 567
            +P+PSL E  SE+R EE R   M  S     PT+D++A +ARS NSS      +  P C+
Sbjct: 213  KPLPSLREAFSEVRREESRKKVMMGSKEQPAPTLDASALAARSFNSSGGDRQKRDRPWCD 272

Query: 568  HCKKQWHTKEQGWKLHGRPPGSKKRPSNDKQNTGRAYV---SESAEPPQQSDPHKNQTDL 627
            +CKK  H KE  WKLHG+P   K +P  D+   GRA+V   SES   P+ S  +K Q ++
Sbjct: 273  YCKKPGHYKETCWKLHGKPADWKPKPRFDRD--GRAHVAANSESTSVPEPSPFNKEQMEM 332

Query: 628  SLATLGAIVQSGIPHSFGLISIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIA 687
                L  +            +  G  PWI+D+GA+DH+TG +    +Y P  G+ ++ IA
Sbjct: 333  LQKLLSQVGSGSTTGVAFTANRGGMRPWIVDTGASDHMTGDAAILQNYKPSNGHSSVHIA 392

Query: 688  DGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDL 747
            DGS + IAG G I     L L +VLHVP L  NLLSISK+ H+L C   F P+   FQDL
Sbjct: 393  DGSKSKIAGTGSIKLTKDLYLDSVLHVPNLDCNLLSISKLAHDLQCVTKFYPNLCVFQDL 452

Query: 748  SSGRMIGTARHSRGLYLLD-----DDTSSSSIPRTSLLSSYFTT-------SEQDCMLWH 807
             SG+MIG+A    GLYLL      +  S +S  ++  +S  F +        + + ++ H
Sbjct: 453  KSGKMIGSAELCSGLYLLSCGQFSNQVSQASCVQSQSMSESFNSVSNSKVNKDSEIIMLH 512

Query: 808  FRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDV 867
            +RLGHP+F Y+  LFP LF      +  C++C  AK  R  +P  PYKP+  F+LVHSDV
Sbjct: 513  YRLGHPSFVYLAKLFPRLFINKNPASYHCEICQFAKHTRTVYPQIPYKPSTVFSLVHSDV 572

Query: 868  WGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAILR 927
            WGPS+I   SG RWFVTF+DDHTR+TWV+L+ +KSEV  +FQ F   ++ QF+ KI +L+
Sbjct: 573  WGPSRIKNISGTRWFVTFVDDHTRVTWVFLMKEKSEVGHIFQTFNLMVQNQFNSKIQVLK 632

Query: 928  SDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSY 987
            SDN +E+   +LS +L + GI+H +SC  TPQQNGVAERKNRHLLEVAR LM S+++P+Y
Sbjct: 633  SDNAKEYFTSSLSTYLQNHGIIHISSCVDTPQQNGVAERKNRHLLEVARCLMFSSNVPNY 692

Query: 988  LWGDVILTATHLINRMPSRILHLQTPLDCLKESYPSTRHVS-EVPLRVFGCTAYVHNFSP 1047
             WG+ ILTAT+LINRMPSR+L  Q+P     + +P T   S ++PL+VFGCTA+VH +  
Sbjct: 693  FWGEAILTATYLINRMPSRVLTFQSPRQLFLKQFPHTHAASSDLPLKVFGCTAFVHVYPQ 752

Query: 1048 NQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGESVS 1107
            N++KF PRA  C+F+GY P Q+GYKC+ P +++++ TMDV+F E   ++P SH+QGES++
Sbjct: 753  NRSKFAPRANKCIFLGYSPTQKGYKCYSPTNKRFYTTMDVSFFEHVFFYPKSHVQGESMN 812

Query: 1108 EESNNTFE-FIEPTPSVVSNIIPHSIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQD 1167
            E  +  +E F+E  PS  S     S   PT               E+ +P      P Q 
Sbjct: 813  E--HQVWESFLEGVPSFHSESPNPSQFAPT---------------ELSTPMPPSVQPAQH 872

Query: 1168 SEPPRDQGMENPT--EPCTKNMISENDRSNVAVLENVE-EKDSGDEIEVRIETRNNEAEQ 1227
            +  P    +++P   +P    + +EN +  +   +  E E  S    +  I++ ++  E+
Sbjct: 873  TNVPSPVTIQSPMPIQPIAPQLANENLQVYIRRRKRQELEHGSQSTYDQYIDSNSSLPEE 932

Query: 1228 --GHTGKSDEYDSSLD---IPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTI 1287
              G     +    S+D   +PIALRKG R CT HPI NYV+Y+ LSP +RAF  SLD T 
Sbjct: 933  NIGEDRAGEVLIPSIDDSTLPIALRKGVRRCTDHPIGNYVTYEGLSPSYRAFATSLDDTQ 992

Query: 1288 IPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLD 1347
            +P  I  ALK  EWK AV +E+ ALEKN TW I  LP G + VGCKW+F++KYKADG+++
Sbjct: 993  VPNTIQEALKISEWKKAVQDEIDALEKNGTWTITDLPVGKRPVGCKWIFTIKYKADGSVE 1052

Query: 1348 RHKARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDL 1407
            R KARLVA+GFTQ+YGIDY ETF+PVAKLNTIR+LLS+AVN+DW L QLD+KNAFLNGDL
Sbjct: 1053 RFKARLVARGFTQSYGIDYQETFAPVAKLNTIRILLSLAVNQDWCLQQLDIKNAFLNGDL 1112

Query: 1408 VEEVYMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTL 1467
             EEVYM  PPGFE    ++ V KLQKS+Y LKQSPRAWFDRFT  V   GY+QG +DHTL
Sbjct: 1113 EEEVYMEIPPGFEESMAKNQVXKLQKSLYXLKQSPRAWFDRFTKAVLKLGYKQGQADHTL 1172

Query: 1468 FTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARS 1527
            F K S  GK  +LIVYVDDI+L+G+D  E+  LK+ + +EFE+KDLGNLKYFLGMEVARS
Sbjct: 1173 FVKKSHAGKXXILIVYVDDIILSGNDMXELQNLKKYLSEEFEVKDLGNLKYFLGMEVARS 1232

Query: 1528 KEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIY 1587
            ++GI VSQRKYILDLL ETGMLGC+P DTP++   KLG   +  PVD+ +YQ LVG+LIY
Sbjct: 1233 RKGIVVSQRKYILDLLKETGMLGCKPIDTPMDSQKKLGIEKESTPVDRGRYQWLVGRLIY 1292

Query: 1588 LSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTD 1625
            LSH RPDI FAVS VSQFM +P E HM+AV RILRYLK TPGKGL FRKT+ +  E Y+D
Sbjct: 1293 LSHARPDIGFAVSAVSQFMHSPTEXHMEAVYRILRYLKMTPGKGLFFRKTENRDTEVYSD 1339

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
POLX_TOBAC4.3e-12932.63Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
COPIA_DROME1.9e-11330.11Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3[more]
M810_ARATH4.5e-4641.07Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana GN=AtMg0... [more]
YCH4_YEAST6.4e-3231.37Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... [more]
YD22B_YEAST3.4e-2526.10Transposon Ty2-DR2 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC ... [more]
Match NameE-valueIdentityDescription
A5AYJ3_VITVI0.0e+0051.57Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_041073 PE=4 SV=1[more]
A5B7Z8_VITVI0.0e+0049.66Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_022757 PE=4 SV=1[more]
A5AJR0_VITVI0.0e+0049.58Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_031159 PE=4 SV=1[more]
W9SCZ3_9ROSA0.0e+0049.21Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Morus notabilis G... [more]
A5BV07_VITVI0.0e+0048.94Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_039357 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G23160.13.0e-10444.91 cysteine-rich RLK (RECEPTOR-like protein kinase) 8[more]
ATMG00810.12.6e-4741.07ATMG00810.1 DNA/RNA polymerases superfamily protein[more]
ATMG00820.17.5e-2353.06ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)[more]
AT1G21280.12.0e-1524.64 Retrotransposon gag protein (InterPro:IPR005162)[more]
ATMG00240.14.1e-1341.46ATMG00240.1 Gag-Pol-related retrotransposon family protein[more]
Match NameE-valueIdentityDescription
gi|147819777|emb|CAN76196.1|0.0e+0051.57hypothetical protein VITISV_041073 [Vitis vinifera][more]
gi|147810393|emb|CAN59964.1|0.0e+0049.66hypothetical protein VITISV_022757 [Vitis vinifera][more]
gi|147778986|emb|CAN62538.1|0.0e+0049.58hypothetical protein VITISV_031159 [Vitis vinifera][more]
gi|703163467|ref|XP_010113352.1|0.0e+0049.21Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Morus notabilis][more]
gi|147784447|emb|CAN63881.1|0.0e+0048.94hypothetical protein VITISV_039357 [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001584Integrase_cat-core
IPR012337RNaseH-like_sf
IPR013103RVT_2
IPR025724GAG-pre-integrase_dom
Vocabulary: Biological Process
TermDefinition
GO:0015074DNA integration
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0090304 nucleic acid metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G15350.1CSPI04G15350.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 839..954
score: 8.7
IPR001584Integrase, catalytic corePROFILEPS50994INTEGRASEcoord: 837..1003
score: 22
IPR012337Ribonuclease H-like domainGENE3DG3DSA:3.30.420.10coord: 833..995
score: 4.6
IPR012337Ribonuclease H-like domainunknownSSF53098Ribonuclease H-likecoord: 836..997
score: 8.22
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 1623..1721
score: 7.8E-27coord: 1287..1530
score: 1.3
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 755..826
score: 1.2
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 1222..1534
score: 0.0coord: 337..612
score: 0.0coord: 785..996
score: 0.0coord: 1136..1162
score: 0.0coord: 1726..1880
score: 0.0coord: 643..759
score: 0.0coord: 1013..1116
score:
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 380..527
score: 6.
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 1640..1700
score: 2.22E-25coord: 1730..1908
score: 2.22E-25coord: 1286..1509
score: 3.67E-34coord: 1539..1624
score: 3.67

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CSPI04G15350Csa4G303710Cucumber (Chinese Long) v2cpicuB181
CSPI04G15350CsaV3_4G025710Cucumber (Chinese Long) v3cpicucB212
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CSPI04G15350Wax gourdcpiwgoB363
CSPI04G15350Wax gourdcpiwgoB395
CSPI04G15350Wild cucumber (PI 183967)cpicpiB106
CSPI04G15350Wild cucumber (PI 183967)cpicpiB154
CSPI04G15350Wild cucumber (PI 183967)cpicpiB159
CSPI04G15350Cucumber (Gy14) v1cgycpiB055
CSPI04G15350Cucumber (Gy14) v1cgycpiB195
CSPI04G15350Cucumber (Gy14) v1cgycpiB279
CSPI04G15350Cucurbita maxima (Rimu)cmacpiB267
CSPI04G15350Cucurbita maxima (Rimu)cmacpiB677
CSPI04G15350Cucurbita maxima (Rimu)cmacpiB851
CSPI04G15350Cucurbita maxima (Rimu)cmacpiB876
CSPI04G15350Cucurbita moschata (Rifu)cmocpiB253
CSPI04G15350Cucurbita moschata (Rifu)cmocpiB669
CSPI04G15350Cucurbita moschata (Rifu)cmocpiB835
CSPI04G15350Cucurbita moschata (Rifu)cmocpiB856
CSPI04G15350Cucumber (Chinese Long) v2cpicuB177
CSPI04G15350Melon (DHL92) v3.5.1cpimeB287
CSPI04G15350Melon (DHL92) v3.5.1cpimeB322
CSPI04G15350Watermelon (Charleston Gray)cpiwcgB283
CSPI04G15350Watermelon (Charleston Gray)cpiwcgB316
CSPI04G15350Watermelon (97103) v1cpiwmB293
CSPI04G15350Watermelon (97103) v1cpiwmB345
CSPI04G15350Cucurbita pepo (Zucchini)cpecpiB065
CSPI04G15350Cucurbita pepo (Zucchini)cpecpiB499
CSPI04G15350Cucurbita pepo (Zucchini)cpecpiB489
CSPI04G15350Cucurbita pepo (Zucchini)cpecpiB620
CSPI04G15350Cucurbita pepo (Zucchini)cpecpiB872
CSPI04G15350Bottle gourd (USVL1VR-Ls)cpilsiB247
CSPI04G15350Bottle gourd (USVL1VR-Ls)cpilsiB255
CSPI04G15350Bottle gourd (USVL1VR-Ls)cpilsiB277
CSPI04G15350Melon (DHL92) v3.6.1cpimedB278
CSPI04G15350Melon (DHL92) v3.6.1cpimedB282
CSPI04G15350Melon (DHL92) v3.6.1cpimedB313
CSPI04G15350Cucumber (Gy14) v2cgybcpiB124
CSPI04G15350Cucumber (Gy14) v2cgybcpiB170
CSPI04G15350Cucumber (Gy14) v2cgybcpiB277
CSPI04G15350Silver-seed gourdcarcpiB0183
CSPI04G15350Silver-seed gourdcarcpiB0591
CSPI04G15350Silver-seed gourdcarcpiB0613
CSPI04G15350Silver-seed gourdcarcpiB1036
CSPI04G15350Cucumber (Chinese Long) v3cpicucB233
CSPI04G15350Cucumber (Chinese Long) v3cpicucB209
CSPI04G15350Watermelon (97103) v2cpiwmbB281
CSPI04G15350Watermelon (97103) v2cpiwmbB307