CSPI02G16290 (gene) Wild cucumber (PI 183967)

NameCSPI02G16290
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationChr2 : 15723019 .. 15728545 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGAAGACTAATAATATCCTCTCCTTATTTGACTAATACGGTGGCTCAGTCTTCCATGTATCATCTTTCAGGAGAAAAGTTGAATGGCAACAACTATTTCTCATGGTCTCAGTCAGTAAAGATGGTCCTCGAAGGACGACAAAAATTTAGCTTTCTGACAGGGGAAATACCTCGCCCCCAACCGGGCGACCCACATGAACGATATTGGAAGGCAGAAGACTCTATTCTTCGATCCATATTGATCAATAGTATGGAACCTCAAATTGGCAAGCCGTTATTGTTTGCTACAACAGCCAAGGATATTTGGGACACAGCCCAGACACTTTACTCAAAACGTCAGAATGCCTCTCGTCTATACACGCTGAGAAAGCAAGTTCATGAATGCAAGTAAGGAACCATGGATGTCACATCCTTTTTCAATAAGCTTTCTCTTATATGGCAAGAAATGGACCTATGCAGAGAACTAGTTTGGCGTGATCCCACTGATGGTGTACAGTACTCGAGAATTGAAGAGAATGACAGGATTTATGACTTTCTTGCTGGTCTTAATCCTAAGTTTGATGTAGTTCGAGGGCATATACTAGGTCAAAGACCGATTCCCTCCCTGATGGAAGTTTGCTCTGAAATCCGCCTCGAGGAAGATCGCACAAGTGCTATGAATATTTCCGCAACCCCTACTATTGACTCTGCTGCTTTTAGTGCAAGATCTTCTAACAGTAGCAGTGACAAGCATAATGGAAAACCAATTCCTGTCTGCGAGCATTGCAAAAAACAATGGCATACCAAAGAACAATGTTGGAAGTTACATGGTCGTCCCCCAGGAAGTAAGAAACGCCCTTCCAACGACAAACAGAACACAGGGCGGGCATATGTGAGTGAGTCTGCTGAACCTCCTCAACAATCCGATCCACACAAAAACCAAACTGATCTCAGTCTTGCCACTTTAGGTGCCATTGTCCAATCAGGTATACCTCATTCCTTCGGTCTTGTTAGTATTGATGGGAAGAACCCCTGGATTCTGGATTCTGGTGCCACAGATCATTTGACTGGGTCCTCTGAACATTTTGTATCTTACATTCCTTGTGCTGGGAACGAGACAATTAGAATTGCAGATGGCTCCTTGGCCCCCATTGCTGGAAAGGGGAAGATTTCTCCTTGTGCAGGGCTCTCCTTACATAATGTTTTGCATGTGCCCAAACTATCTTATAATTTGCTTTCGATAAGCAAGATCACTCATGAGTTAAACTGCAAAGCAATATTCTTACCTGATTCTGTCTCTTTTCAGGACTTGAGCTCGGGGAGGATGATTGGCACTGCCCGGCATAGTAGGGGACTCTACCTCCTTGATGACGATACCTCTTCTAGTAGCATTCCTAGGACTAGCCTCTTATCTTCCTATTTCACTACTTCTGAACAAGATTGCATGTTGTGGCATTTTCGTTTAGGCCACCCTAATTTTCAATATATGAAACATTTATTTCCACATCTCTTCTCTAAAGTTGAGATGACTACCTTATCTTGTGATGTGTGTATTCAGGCCAAACAACATCGAGTCTCTTTTCCCTCACAACCATACAAACCAACCCAACCCTTCACTCTTGTTCATAGTGATGTCTGGGGACCATCCAAGATAACAACCTCATCTGGAAAACGGTGGTTCGTAACCTTCATTGATGATCATACCCGTCTTACCTGGGTCTACCTTATCACTGATAAATCTGAGGTTTCCTCTATGTTTCAAAATTTCTATCACACCATTGAAACACAATTCCATCAAAAAATTGCTATTCTTCGGAGTGATAATGGTCGGGAATTCCAAAACCATAACCTTAGTGAATTTCTTGCTTCCAAGGGGATTGTTCATCAAAACTCGTGCGCCTACACTCCTCAACAAAATGGAGTGGCCGAGCGAAAAAACCGTCACCTTCTGGAAGTAGCCCGTTCCCTTATGCTTTCTACTTCCCTTCCTTCATACTTGTGGGGAGATGCTATTCTTACAGCAGCTCATTTAATCAATAGAATGCCTTCTCGTATTCTTCATCTTCAAACTCCCTTAGATTGTCTTAAGGAGTCCTACCCATCGACTCGTCATGTTTCTGAGGTTCCTCTTCGTGTGTTTGGGTGTACCGCTTATGTCCATAATTTTGGCCCTAATCAAACCAAATTTACCCCTCGGGCTCAGGCATGTGTGTTTGTTGGGTATCCCCCTCACCAGCGTGGTTATAAATGTTTTCACCCACCATCCAGAAAATACTTTGTCACTATGGATGTTACTTTCTGTGAGGATCGACCCTACTTTCCCGTTAGCCATCTTCAGGGGGAGAGTGTGAGTGAAGAGTCTAACAACACCTTTGAATTCATCGAACCCACTCCTAGTGTTGTGTCTAACATCATTCCTCATTCCATAGTCCTACCCACAAACCAAGTCCCCTGGAAAACGTACTACAGGAGGAATCACAAAAAGGAAGTCGGTTCCCCTACTAGTCAGCCGCCGGCTCCAGTCCAAGACTCTGAACCTCCTCGAGATCAAGGTATGGAAAACCCTACTGAACCCTGTACTAAGAATATGATAAGTGAGAATGACAGGTCTAATGTTGCTGTTCTTGAAAACGTGGAAGAAAAGGACAGTGGTGATGAGATTGAGGTCAGAATAGAAACCCGTAATAATGAAGCGGAACAGGGTCATACAGGAAAATCAGATGAGTATGATTCCTCTCTTGACATTCCCATTGCTCTGAGAAAAGGCACCAGGTCTTGTACTAAACACCCCATTTGCAATTATGTTTCCTACGATAGTCTCTCTCCTCAGTTCAGAGCTTTTACAGCAAGCCTTGACTCTACCATAATACCAAAAGATATCTACACTGCTTTAAAGTATCCTGAATGGAAGAATGCTGTCATGGAAGAGATGAAAGCTCTTGAAAAGAATAGTACTTGGGACATTTGTACTCTACCTAAGGGACACAAAACTGTGGGATGCAAATGGGTGTTCTCTCTCAAATACAAAGCTGATGGTACTCTTGACAGACACAAGGCAAGGTTAGTTGCGAAGGGATTTACTCAAACCTATGGTATTGACTATTCAGAAACTTTTTCTCCAGTTGCTAAGTTGAATACTATTAGAGTTCTGTTATCTGTTGCTGTGAACAAAGATTGGCCTTTATATCAGCTGGATGTTAAGAATGCCTTTTTGAATGGAGACCTCGTAGAGGAAGTCTACATGAGCCCTCCGCCTGGATTTGAAGCCCAGTTTGGTCAGCATGTGTGTAAACTCCAGAAATCTATATATGGTCTGAAACAGTCTCCCAGAGCATGGTTTGACAGATTCACTACCTTTGTCAAGTCCCAAGGGTACAGGCAGGGACACTCTGATCATACTTTATTTACAAAGGTTTCCAAAACAGGAAAGATTGCTGTTCTAATAGTTTATGTGGATGACATTGTTTTGACTGGAGATGATCAGGCAGAAATCAGTCAACTAAAGCAGAGAATGGGCGATGAGTTTGAAATCAAGGATTTGGGAAATTTGAAATATTTCCTTGGAATGGAGGTGGCCAGATCTAAAGAAGGTATCTCCGTATCTCAAAGAAAATACATCCTTGATTTGTTAACCGAGACAGGTATGTTAGGATGTCGTCCCACTGACACTCCTATTGAATTCAACTGCAAACTAGGAAACTCTGATGATCAAGTTCCAGTTGATAAAGAACAGTATCAACGCCTCGTGGGTAAATTAATTTACTTATCTCATACTCGTCCTGATATTTCCTTTGCTGTGAGTGTTGTCAGCCAGTTTATGCAGACCCCTAATGAGGAACACATGAAAGCTGTCAACAGAATCTTGAGATACTTAAAATCAACACCTGGTAAAGGGCTGATGTTTAGAAAAACAGACAGAAAGACCATTGAGGCATACACTGACTCGGATTGGGCAGGATCTGTTGTTGACAGAAAATCTACCTCTGGTTATTGTACCTTTGTTTGGGGCAATCTTGTAACTTGGAGGAGTAAGAAGCAAAGTGTTGTGGCCAGGAGCAGCGCTGAGGCTGAATATAGAGCTATGAGTTTAGGAATATGTGAGGAAATTTGGCTTCAGAAAGTTTTGACAGATCTTCATCAGGAATGTGAGACACCATTGAAGCTTTTCTGTGATAATAAAGCCGCTATTAGTATTGCTAACAACCCTGTTCAACATGATAGAACTAAACATGTTGAGATTGATCGACATTTTATCAAAGAAAAACTTGACAGTGGGAGCATATGCATTCCGTACATCCCTTCGAGTCAACAGGTTGCTGATGTTCTTACCAAAGGGCTTCTCAGACCAAACTTCGACTTCTGCGTTAGCAAGTTGGGCCTCATTGATATTTACGTCCCAACTTGAGGGGGAGTGTTGGAATTCTTTCCTTTTTTTCTAGGCTTTTTCCTTGTTAGAATCCTAGAATAATTATGGAAAGATTATGGGAATGTATTCCTTATTTTCCTAAATCTTTTCCTTTTTTATTCCATTGTGTACTCTATTTATTCTCCCTTGTACCTATTGCTTTATTCATTAGAAAATAATAACAACACAAACAATCGTGGTTTTTCTCCCGGTACTCGGGTTTCCACGTAAATTGGTGTGAACTCGTTGTCTCTCTTTTCAATAATAACTTTATGAAAATGGGCTTTATGAAGGTTTCGTGGAGGGAGAAGAGAAGGTTCACATCCCTCTCGTCCAATATGCTGACGACACTTTCCTCTTTTGCAAATATGATCAAAAACCCAATGGATCAGTAGCTGCACATTGGGACCATTCCACCCTTTCCTGGCCTATAATTTTTAGAAGACTTCTTAAAGAAGAAATTTCAGCAATTGTTTCACCTCCTTTCGCAAAGGAAGGTGGTAGAATCTATGGACAGAAGGAGTTGGTCTCCAAAACCATCCAGTAAGTTTTCGGTTAAGTCTCTTGTGAATCACTTTTCAGCATCTTCTCCAGTAACTAGAGAGACCTACAAAACCTTGGGGGAATCCAGTAGCATTAGAAGAGTCAACATTTTAGTTTGGATTATGCTTTTGGGGGTTCTAAACTGCTCTTTAGTCATGCAAAGGAAGCTCCCAAATAGTTGTCTTCTGCACTCTGTTTGCCTTCCATGAAGAAGAAATTTTGCTGCATCTGTTTTTCTCCTGCCATTATTCAGCCAAGTGTTGGGGCAGTTTTCTCTCTTTATTTGAAGTAAGCTGGGTTTTTGGCAGAAGATTCAGCTCCAATGTAAGGCAGATCATTTTGAGCCCATTCTTGAAAAAAGGCCACGGTTACTTTGGACTAGTTTGTCCATATCACTGCTTGCAGAAATATGGATTGAACGAAATACACCTAGGAGGAGGTTGTTTGGTGAGTGTATCAATTCTTTGTGTTTTCACTGATGTATTTCTTTGCATTATGCATTTTATGAATTTTAATTTTCTATTAGTTGTCTGTTTGGACATGATGAGGACGCCAAGGAGGTGTCAACCTAGTTGAGATGTTTGGATGCGTCACTTAG

mRNA sequence

ATGGGAAGACTAATAATATCCTCTCCTTATTTGACTAATACGGTGGCTCAGTCTTCCATGTATCATCTTTCAGGAGAAAAGTTGAATGGCAACAACTATTTCTCATGGTCTCAGTCAGTAAAGATGGTCCTCGAAGGACGACAAAAATTTAGCTTTCTGACAGGGGAAATACCTCGCCCCCAACCGGGCGACCCACATGAACGATATTGGAAGGCAGAAGACTCTATTCTTCGATCCATATTGATCAATAGTATGGAACCTCAAATTGGCAAGCCGTTATTGTTTGCTACAACAGCCAAGGATATTTGGGACACAGCCCAGACACTTTACTCAAAACGTCAGAATGCCTCTCGTCTATACACGCTGAGAAAGCAAGTTCATGAATGCAAAGAACTAGTTTGGCGTGATCCCACTGATGGTGTACAGTACTCGAGAATTGAAGAGAATGACAGGATTTATGACTTTCTTGCTGGTCTTAATCCTAAGTTTGATGTAGTTCGAGGGCATATACTAGGTCAAAGACCGATTCCCTCCCTGATGGAAGTTTGCTCTGAAATCCGCCTCGAGGAAGATCGCACAAGTGCTATGAATATTTCCGCAACCCCTACTATTGACTCTGCTGCTTTTAGTGCAAGATCTTCTAACAGTAGCAGTGACAAGCATAATGGAAAACCAATTCCTGTCTGCGAGCATTGCAAAAAACAATGGCATACCAAAGAACAATGTTGGAAGTTACATGGTCGTCCCCCAGGAAGTAAGAAACGCCCTTCCAACGACAAACAGAACACAGGGCGGGCATATGTGAGTGAGTCTGCTGAACCTCCTCAACAATCCGATCCACACAAAAACCAAACTGATCTCAGTCTTGCCACTTTAGGTGCCATTGTCCAATCAGGTATACCTCATTCCTTCGGTCTTGTTAGTATTGATGGGAAGAACCCCTGGATTCTGGATTCTGGTGCCACAGATCATTTGACTGGGTCCTCTGAACATTTTGTATCTTACATTCCTTGTGCTGGGAACGAGACAATTAGAATTGCAGATGGCTCCTTGGCCCCCATTGCTGGAAAGGGGAAGATTTCTCCTTGTGCAGGGCTCTCCTTACATAATGTTTTGCATGTGCCCAAACTATCTTATAATTTGCTTTCGATAAGCAAGATCACTCATGAGTTAAACTGCAAAGCAATATTCTTACCTGATTCTGTCTCTTTTCAGGACTTGAGCTCGGGGAGGATGATTGGCACTGCCCGGCATAGTAGGGGACTCTACCTCCTTGATGACGATACCTCTTCTAGTAGCATTCCTAGGACTAGCCTCTTATCTTCCTATTTCACTACTTCTGAACAAGATTGCATGTTGTGGCATTTTCGTTTAGGCCACCCTAATTTTCAATATATGAAACATTTATTTCCACATCTCTTCTCTAAAGTTGAGATGACTACCTTATCTTGTGATGTGTGTATTCAGGCCAAACAACATCGAGTCTCTTTTCCCTCACAACCATACAAACCAACCCAACCCTTCACTCTTGTTCATAGTGATGTCTGGGGACCATCCAAGATAACAACCTCATCTGGAAAACGGTGGTTCGTAACCTTCATTGATGATCATACCCGTCTTACCTGGGTCTACCTTATCACTGATAAATCTGAGGTTTCCTCTATGTTTCAAAATTTCTATCACACCATTGAAACACAATTCCATCAAAAAATTGCTATTCTTCGGAGTGATAATGGTCGGGAATTCCAAAACCATAACCTTAGTGAATTTCTTGCTTCCAAGGGGATTGTTCATCAAAACTCGTGCGCCTACACTCCTCAACAAAATGGAGTGGCCGAGCGAAAAAACCGTCACCTTCTGGAAGTAGCCCGTTCCCTTATGCTTTCTACTTCCCTTCCTTCATACTTGTGGGGAGATGCTATTCTTACAGCAGCTCATTTAATCAATAGAATGCCTTCTCGTATTCTTCATCTTCAAACTCCCTTAGATTGTCTTAAGGAGTCCTACCCATCGACTCGTCATGTTTCTGAGGTTCCTCTTCGTGTGTTTGGGTGTACCGCTTATGTCCATAATTTTGGCCCTAATCAAACCAAATTTACCCCTCGGGCTCAGGCATGTGTGTTTGTTGGGTATCCCCCTCACCAGCGTGGTTATAAATGTTTTCACCCACCATCCAGAAAATACTTTGTCACTATGGATGTTACTTTCTGTGAGGATCGACCCTACTTTCCCGTTAGCCATCTTCAGGGGGAGAGTGTGAGTGAAGAGTCTAACAACACCTTTGAATTCATCGAACCCACTCCTAGTGTTGTGTCTAACATCATTCCTCATTCCATAGTCCTACCCACAAACCAAGTCCCCTGGAAAACGTACTACAGGAGGAATCACAAAAAGGAAGTCGGTTCCCCTACTAGTCAGCCGCCGGCTCCAGTCCAAGACTCTGAACCTCCTCGAGATCAAGGTATGGAAAACCCTACTGAACCCTGTACTAAGAATATGATAAGTGAGAATGACAGGTCTAATGTTGCTGTTCTTGAAAACGTGGAAGAAAAGGACAGTGGTGATGAGATTGAGGTCAGAATAGAAACCCGTAATAATGAAGCGGAACAGGGTCATACAGGAAAATCAGATGAGTATGATTCCTCTCTTGACATTCCCATTGCTCTGAGAAAAGGCACCAGGTCTTGTACTAAACACCCCATTTGCAATTATGTTTCCTACGATAGTCTCTCTCCTCAGTTCAGAGCTTTTACAGCAAGCCTTGACTCTACCATAATACCAAAAGATATCTACACTGCTTTAAAGTATCCTGAATGGAAGAATGCTGTCATGGAAGAGATGAAAGCTCTTGAAAAGAATAGTACTTGGGACATTTGTACTCTACCTAAGGGACACAAAACTGTGGGATGCAAATGGGTGTTCTCTCTCAAATACAAAGCTGATGGTACTCTTGACAGACACAAGGCAAGGTTAGTTGCGAAGGGATTTACTCAAACCTATGGTATTGACTATTCAGAAACTTTTTCTCCAGTTGCTAAGTTGAATACTATTAGAGTTCTGTTATCTGTTGCTGTGAACAAAGATTGGCCTTTATATCAGCTGGATGTTAAGAATGCCTTTTTGAATGGAGACCTCGTAGAGGAAGTCTACATGAGCCCTCCGCCTGGATTTGAAGCCCAGTTTGGTCAGCATGTGTGTAAACTCCAGAAATCTATATATGGTCTGAAACAGTCTCCCAGAGCATGGTTTGACAGATTCACTACCTTTGTCAAGTCCCAAGGGTACAGGCAGGGACACTCTGATCATACTTTATTTACAAAGGTTTCCAAAACAGGAAAGATTGCTGTTCTAATAGTTTATGTGGATGACATTGTTTTGACTGGAGATGATCAGGCAGAAATCAGTCAACTAAAGCAGAGAATGGGCGATGAGTTTGAAATCAAGGATTTGGGAAATTTGAAATATTTCCTTGGAATGGAGGTGGCCAGATCTAAAGAAGGTATCTCCGTATCTCAAAGAAAATACATCCTTGATTTGTTAACCGAGACAGGTATGTTAGGATGTCGTCCCACTGACACTCCTATTGAATTCAACTGCAAACTAGGAAACTCTGATGATCAAGTTCCAGTTGATAAAGAACAGTATCAACGCCTCGTGGGTAAATTAATTTACTTATCTCATACTCGTCCTGATATTTCCTTTGCTGTGAGTGTTGTCAGCCAGTTTATGCAGACCCCTAATGAGGAACACATGAAAGCTGTCAACAGAATCTTGAGATACTTAAAATCAACACCTGGTAAAGGGCTGATGTTTAGAAAAACAGACAGAAAGACCATTGAGGCATACACTGACTCGGATTGGGCAGGATCTGTTGTTGACAGAAAATCTACCTCTGGTTATTGTACCTTTGTTTGGGGCAATCTTGTAACTTGGAGGAGTAAGAAGCAAAGTGTTGTGGCCAGGAGCAGCGCTGAGGCTGAATATAGAGCTATGAGTTTAGGAATATGTGAGGAAATTTGGCTTCAGAAAGTTTTGACAGATCTTCATCAGGAATGTGAGACACCATTGAAGCTTTTCTGTGATAATAAAGCCGCTATTAGTATTGCTAACAACCCTGTTCAACATGATAGAACTAAACATGTTGAGATTGATCGACATTTTATCAAAGAAAAACTTGACAGTGGGAGCATATGCATTCCGTACATCCCTTCGAGTCAACAGGTTGCTGATGTTCTTACCAAAGGGCTTCTCAGACCAAACTTCGACTTCTGCGGAGAAGAGAAGGTTCACATCCCTCTCGTCCAATATGCTGACGACACTTTCCTCTTTTGCAAATATGATCAAAAACCCAATGGATCAGTAGCTGCACATTGGGACCATTCCACCCTTTCCTGGCCTATAATTTTTAGAAGACTTCTTAAAGAAGAAATTTCAGCAATTGTTTCACCTCCTTTCGCAAAGGAAGGTGGTAGAATCTATGGACAGAAGGAGTTGGTCTCCAAAACCATCCATAAGCTGGGTTTTTGGCAGAAGATTCAGCTCCAATGTAAGGCAGATCATTTTGAGCCCATTCTTGAAAAAAGGCCACGGTTACTTTGGACTAGTTTGTCCATATCACTGCTTGCAGAAATATGGATTGAACGAAATACACCTAGGAGGAGGTTGTTTGGACGCCAAGGAGGTGTCAACCTAGTTGAGATGTTTGGATGCGTCACTTAG

Coding sequence (CDS)

ATGGGAAGACTAATAATATCCTCTCCTTATTTGACTAATACGGTGGCTCAGTCTTCCATGTATCATCTTTCAGGAGAAAAGTTGAATGGCAACAACTATTTCTCATGGTCTCAGTCAGTAAAGATGGTCCTCGAAGGACGACAAAAATTTAGCTTTCTGACAGGGGAAATACCTCGCCCCCAACCGGGCGACCCACATGAACGATATTGGAAGGCAGAAGACTCTATTCTTCGATCCATATTGATCAATAGTATGGAACCTCAAATTGGCAAGCCGTTATTGTTTGCTACAACAGCCAAGGATATTTGGGACACAGCCCAGACACTTTACTCAAAACGTCAGAATGCCTCTCGTCTATACACGCTGAGAAAGCAAGTTCATGAATGCAAAGAACTAGTTTGGCGTGATCCCACTGATGGTGTACAGTACTCGAGAATTGAAGAGAATGACAGGATTTATGACTTTCTTGCTGGTCTTAATCCTAAGTTTGATGTAGTTCGAGGGCATATACTAGGTCAAAGACCGATTCCCTCCCTGATGGAAGTTTGCTCTGAAATCCGCCTCGAGGAAGATCGCACAAGTGCTATGAATATTTCCGCAACCCCTACTATTGACTCTGCTGCTTTTAGTGCAAGATCTTCTAACAGTAGCAGTGACAAGCATAATGGAAAACCAATTCCTGTCTGCGAGCATTGCAAAAAACAATGGCATACCAAAGAACAATGTTGGAAGTTACATGGTCGTCCCCCAGGAAGTAAGAAACGCCCTTCCAACGACAAACAGAACACAGGGCGGGCATATGTGAGTGAGTCTGCTGAACCTCCTCAACAATCCGATCCACACAAAAACCAAACTGATCTCAGTCTTGCCACTTTAGGTGCCATTGTCCAATCAGGTATACCTCATTCCTTCGGTCTTGTTAGTATTGATGGGAAGAACCCCTGGATTCTGGATTCTGGTGCCACAGATCATTTGACTGGGTCCTCTGAACATTTTGTATCTTACATTCCTTGTGCTGGGAACGAGACAATTAGAATTGCAGATGGCTCCTTGGCCCCCATTGCTGGAAAGGGGAAGATTTCTCCTTGTGCAGGGCTCTCCTTACATAATGTTTTGCATGTGCCCAAACTATCTTATAATTTGCTTTCGATAAGCAAGATCACTCATGAGTTAAACTGCAAAGCAATATTCTTACCTGATTCTGTCTCTTTTCAGGACTTGAGCTCGGGGAGGATGATTGGCACTGCCCGGCATAGTAGGGGACTCTACCTCCTTGATGACGATACCTCTTCTAGTAGCATTCCTAGGACTAGCCTCTTATCTTCCTATTTCACTACTTCTGAACAAGATTGCATGTTGTGGCATTTTCGTTTAGGCCACCCTAATTTTCAATATATGAAACATTTATTTCCACATCTCTTCTCTAAAGTTGAGATGACTACCTTATCTTGTGATGTGTGTATTCAGGCCAAACAACATCGAGTCTCTTTTCCCTCACAACCATACAAACCAACCCAACCCTTCACTCTTGTTCATAGTGATGTCTGGGGACCATCCAAGATAACAACCTCATCTGGAAAACGGTGGTTCGTAACCTTCATTGATGATCATACCCGTCTTACCTGGGTCTACCTTATCACTGATAAATCTGAGGTTTCCTCTATGTTTCAAAATTTCTATCACACCATTGAAACACAATTCCATCAAAAAATTGCTATTCTTCGGAGTGATAATGGTCGGGAATTCCAAAACCATAACCTTAGTGAATTTCTTGCTTCCAAGGGGATTGTTCATCAAAACTCGTGCGCCTACACTCCTCAACAAAATGGAGTGGCCGAGCGAAAAAACCGTCACCTTCTGGAAGTAGCCCGTTCCCTTATGCTTTCTACTTCCCTTCCTTCATACTTGTGGGGAGATGCTATTCTTACAGCAGCTCATTTAATCAATAGAATGCCTTCTCGTATTCTTCATCTTCAAACTCCCTTAGATTGTCTTAAGGAGTCCTACCCATCGACTCGTCATGTTTCTGAGGTTCCTCTTCGTGTGTTTGGGTGTACCGCTTATGTCCATAATTTTGGCCCTAATCAAACCAAATTTACCCCTCGGGCTCAGGCATGTGTGTTTGTTGGGTATCCCCCTCACCAGCGTGGTTATAAATGTTTTCACCCACCATCCAGAAAATACTTTGTCACTATGGATGTTACTTTCTGTGAGGATCGACCCTACTTTCCCGTTAGCCATCTTCAGGGGGAGAGTGTGAGTGAAGAGTCTAACAACACCTTTGAATTCATCGAACCCACTCCTAGTGTTGTGTCTAACATCATTCCTCATTCCATAGTCCTACCCACAAACCAAGTCCCCTGGAAAACGTACTACAGGAGGAATCACAAAAAGGAAGTCGGTTCCCCTACTAGTCAGCCGCCGGCTCCAGTCCAAGACTCTGAACCTCCTCGAGATCAAGGTATGGAAAACCCTACTGAACCCTGTACTAAGAATATGATAAGTGAGAATGACAGGTCTAATGTTGCTGTTCTTGAAAACGTGGAAGAAAAGGACAGTGGTGATGAGATTGAGGTCAGAATAGAAACCCGTAATAATGAAGCGGAACAGGGTCATACAGGAAAATCAGATGAGTATGATTCCTCTCTTGACATTCCCATTGCTCTGAGAAAAGGCACCAGGTCTTGTACTAAACACCCCATTTGCAATTATGTTTCCTACGATAGTCTCTCTCCTCAGTTCAGAGCTTTTACAGCAAGCCTTGACTCTACCATAATACCAAAAGATATCTACACTGCTTTAAAGTATCCTGAATGGAAGAATGCTGTCATGGAAGAGATGAAAGCTCTTGAAAAGAATAGTACTTGGGACATTTGTACTCTACCTAAGGGACACAAAACTGTGGGATGCAAATGGGTGTTCTCTCTCAAATACAAAGCTGATGGTACTCTTGACAGACACAAGGCAAGGTTAGTTGCGAAGGGATTTACTCAAACCTATGGTATTGACTATTCAGAAACTTTTTCTCCAGTTGCTAAGTTGAATACTATTAGAGTTCTGTTATCTGTTGCTGTGAACAAAGATTGGCCTTTATATCAGCTGGATGTTAAGAATGCCTTTTTGAATGGAGACCTCGTAGAGGAAGTCTACATGAGCCCTCCGCCTGGATTTGAAGCCCAGTTTGGTCAGCATGTGTGTAAACTCCAGAAATCTATATATGGTCTGAAACAGTCTCCCAGAGCATGGTTTGACAGATTCACTACCTTTGTCAAGTCCCAAGGGTACAGGCAGGGACACTCTGATCATACTTTATTTACAAAGGTTTCCAAAACAGGAAAGATTGCTGTTCTAATAGTTTATGTGGATGACATTGTTTTGACTGGAGATGATCAGGCAGAAATCAGTCAACTAAAGCAGAGAATGGGCGATGAGTTTGAAATCAAGGATTTGGGAAATTTGAAATATTTCCTTGGAATGGAGGTGGCCAGATCTAAAGAAGGTATCTCCGTATCTCAAAGAAAATACATCCTTGATTTGTTAACCGAGACAGGTATGTTAGGATGTCGTCCCACTGACACTCCTATTGAATTCAACTGCAAACTAGGAAACTCTGATGATCAAGTTCCAGTTGATAAAGAACAGTATCAACGCCTCGTGGGTAAATTAATTTACTTATCTCATACTCGTCCTGATATTTCCTTTGCTGTGAGTGTTGTCAGCCAGTTTATGCAGACCCCTAATGAGGAACACATGAAAGCTGTCAACAGAATCTTGAGATACTTAAAATCAACACCTGGTAAAGGGCTGATGTTTAGAAAAACAGACAGAAAGACCATTGAGGCATACACTGACTCGGATTGGGCAGGATCTGTTGTTGACAGAAAATCTACCTCTGGTTATTGTACCTTTGTTTGGGGCAATCTTGTAACTTGGAGGAGTAAGAAGCAAAGTGTTGTGGCCAGGAGCAGCGCTGAGGCTGAATATAGAGCTATGAGTTTAGGAATATGTGAGGAAATTTGGCTTCAGAAAGTTTTGACAGATCTTCATCAGGAATGTGAGACACCATTGAAGCTTTTCTGTGATAATAAAGCCGCTATTAGTATTGCTAACAACCCTGTTCAACATGATAGAACTAAACATGTTGAGATTGATCGACATTTTATCAAAGAAAAACTTGACAGTGGGAGCATATGCATTCCGTACATCCCTTCGAGTCAACAGGTTGCTGATGTTCTTACCAAAGGGCTTCTCAGACCAAACTTCGACTTCTGCGGAGAAGAGAAGGTTCACATCCCTCTCGTCCAATATGCTGACGACACTTTCCTCTTTTGCAAATATGATCAAAAACCCAATGGATCAGTAGCTGCACATTGGGACCATTCCACCCTTTCCTGGCCTATAATTTTTAGAAGACTTCTTAAAGAAGAAATTTCAGCAATTGTTTCACCTCCTTTCGCAAAGGAAGGTGGTAGAATCTATGGACAGAAGGAGTTGGTCTCCAAAACCATCCATAAGCTGGGTTTTTGGCAGAAGATTCAGCTCCAATGTAAGGCAGATCATTTTGAGCCCATTCTTGAAAAAAGGCCACGGTTACTTTGGACTAGTTTGTCCATATCACTGCTTGCAGAAATATGGATTGAACGAAATACACCTAGGAGGAGGTTGTTTGGACGCCAAGGAGGTGTCAACCTAGTTGAGATGTTTGGATGCGTCACTTAG
BLAST of CSPI02G16290 vs. Swiss-Prot
Match: POLX_TOBAC (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 559.7 bits (1441), Expect = 1.0e-157
Identity = 379/1137 (33.33%), Postives = 586/1137 (51.54%), Query Frame = 1

Query: 312  KNPWILDSGATDHLTGSSEHFVSYIPCAGN-ETIRIADGSLAPIAGKG----KISPCAGL 371
            ++ W++D+ A+ H T   + F  Y+  AG+  T+++ + S + IAG G    K +    L
Sbjct: 291  ESEWVVDTAASHHATPVRDLFCRYV--AGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTL 350

Query: 372  SLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRG-LYLL 431
             L +V HVP L  NL+S   +  +   ++ F         L+ G ++     +RG LY  
Sbjct: 351  VLKDVRHVPDLRMNLISGIALDRD-GYESYFANQKWR---LTKGSLVIAKGVARGTLYRT 410

Query: 432  DDDTSSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYMKHLFPH-LFSKVEMTTLS- 491
            + +     +             E    LWH R+GH + + ++ L    L S  + TT+  
Sbjct: 411  NAEICQGELNAAQ--------DEISVDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKP 470

Query: 492  CDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWV 551
            CD C+  KQHRVSF +   +      LV+SDV GP +I +  G ++FVTFIDD +R  WV
Sbjct: 471  CDYCLFGKQHRVSFQTSSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWV 530

Query: 552  YLITDKSEVSSMFQNFYHTIETQFHQKIAILRSDNGREFQNHNLSEFLASKGIVHQNSCA 611
            Y++  K +V  +FQ F+  +E +  +K+  LRSDNG E+ +    E+ +S GI H+ +  
Sbjct: 531  YILKTKDQVFQVFQKFHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVP 590

Query: 612  YTPQQNGVAERKNRHLLEVARSLMLSTSLPSYLWGDAILTAAHLINRMPSRILHLQTPLD 671
             TPQ NGVAER NR ++E  RS++    LP   WG+A+ TA +LINR PS  L  + P  
Sbjct: 591  GTPQHNGVAERMNRTIVEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIP-- 650

Query: 672  CLKESYPSTRHVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYPPHQRGYKCFHP 731
               E   + + VS   L+VFGC A+ H     +TK   ++  C+F+GY   + GY+ + P
Sbjct: 651  ---ERVWTNKEVSYSHLKVFGCRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDP 710

Query: 732  PSRKYFVTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEPTPSVVSNIIPHSIVLPT 791
              +K   + DV F E                 E     +  E    V + IIP+ + +P 
Sbjct: 711  VKKKVIRSRDVVFRE----------------SEVRTAADMSE---KVKNGIIPNFVTIP- 770

Query: 792  NQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVA 851
                              S ++ P +    ++   +QG E P E     +I + ++ +  
Sbjct: 771  ------------------STSNNPTSAESTTDEVSEQG-EQPGE-----VIEQGEQLDEG 830

Query: 852  VLENVEEKDSGDEIEVRIETRNNEAEQGHTGKSDEYDSSLD--IPIALRKGTRSCTKHPI 911
            V E VE    G+E    +        +     S EY    D   P +L++       HP 
Sbjct: 831  V-EEVEHPTQGEEQHQPLRRSERPRVESRRYPSTEYVLISDDREPESLKE----VLSHPE 890

Query: 912  CNYVSYDSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTL 971
             N +         +A    ++S +     Y  ++ P+ K  +                  
Sbjct: 891  KNQL--------MKAMQEEMES-LQKNGTYKLVELPKGKRPLK----------------- 950

Query: 972  PKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLL 1031
                    CKWVF LK   D  L R+KARLV KGF Q  GID+ E FSPV K+ +IR +L
Sbjct: 951  --------CKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGIDFDEIFSPVVKMTSIRTIL 1010

Query: 1032 SVAVNKDWPLYQLDVKNAFLNGDLVEEVYMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPR 1091
            S+A + D  + QLDVK AFL+GDL EE+YM  P GFE    +H VCKL KS+YGLKQ+PR
Sbjct: 1011 SLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGKKHMVCKLNKSLYGLKQAPR 1070

Query: 1092 AWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQR 1151
             W+ +F +F+KSQ Y + +SD  ++ K        +L++YVDD+++ G D+  I++LK  
Sbjct: 1071 QWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYVDDMLIVGKDKGLIAKLKGD 1130

Query: 1152 MGDEFEIKDLGNLKYFLGMEVARSKEG--ISVSQRKYILDLLTETGMLGCRPTDTPIEFN 1211
            +   F++KDLG  +  LGM++ R +    + +SQ KYI  +L    M   +P  TP+  +
Sbjct: 1131 LSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERVLERFNMKNAKPVSTPLAGH 1190

Query: 1212 CKLGNS------DDQVPVDKEQYQRLVGKLIY-LSHTRPDISFAVSVVSQFMQTPNEEHM 1271
             KL         +++  + K  Y   VG L+Y +  TRPDI+ AV VVS+F++ P +EH 
Sbjct: 1191 LKLSKKMCPTTVEEKGNMAKVPYSSAVGSLMYAMVCTRPDIAHAVGVVSRFLENPGKEHW 1250

Query: 1272 KAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTW 1331
            +AV  ILRYL+ T G  L F  +D   ++ YTD+D AG + +RKS++GY     G  ++W
Sbjct: 1251 EAVKWILRYLRGTTGDCLCFGGSD-PILKGYTDADMAGDIDNRKSSTGYLFTFSGGAISW 1310

Query: 1332 RSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTD--LHQECETPLKLFCDNKAAISIA 1391
            +SK Q  VA S+ EAEY A +    E IWL++ L +  LHQ+      ++CD+++AI ++
Sbjct: 1311 QSKLQKCVALSTTEAEYIAATETGKEMIWLKRFLQELGLHQK---EYVVYCDSQSAIDLS 1321

Query: 1392 NNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCGE 1427
             N + H RTKH+++  H+I+E +D  S+ +  I +++  AD+LTK + R  F+ C E
Sbjct: 1371 KNSMYHARTKHIDVRYHWIREMVDDESLKVLKISTNENPADMLTKVVPRNKFELCKE 1321

BLAST of CSPI02G16290 vs. Swiss-Prot
Match: COPIA_DROME (Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3)

HSP 1 Score: 535.0 bits (1377), Expect = 2.7e-150
Identity = 365/1146 (31.85%), Postives = 577/1146 (50.35%), Query Frame = 1

Query: 315  WILDSGATDHLTGSSEHFVSYIPCAGNETIRIA-DGSLAPIAGKG--KISPCAGLSLHNV 374
            ++LDSGA+DHL      +   +       I +A  G       +G  ++     ++L +V
Sbjct: 289  FVLDSGASDHLINDESLYTDSVEVVPPLKIAVAKQGEFIYATKRGIVRLRNDHEITLEDV 348

Query: 375  LHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSS 434
            L   + + NL+S+ ++              +S +   SG  I       GL ++ +    
Sbjct: 349  LFCKEAAGNLMSVKRLQEA----------GMSIEFDKSGVTIS----KNGLMVVKNSGML 408

Query: 435  SSIPRTSLLS-SYFTTSEQDCMLWHFRLGHPNFQYM-----KHLFPH--LFSKVEMTTLS 494
            +++P  +  + S     + +  LWH R GH +   +     K++F    L + +E++   
Sbjct: 409  NNVPVINFQAYSINAKHKNNFRLWHERFGHISDGKLLEIKRKNMFSDQSLLNNLELSCEI 468

Query: 495  CDVCIQAKQHRVSFPSQPYKP--TQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLT 554
            C+ C+  KQ R+ F     K    +P  +VHSDV GP    T   K +FV F+D  T   
Sbjct: 469  CEPCLNGKQARLPFKQLKDKTHIKRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYC 528

Query: 555  WVYLITDKSEVSSMFQNFYHTIETQFHQKIAILRSDNGREFQNHNLSEFLASKGIVHQNS 614
              YLI  KS+V SMFQ+F    E  F+ K+  L  DNGRE+ ++ + +F   KGI +  +
Sbjct: 529  VTYLIKYKSDVFSMFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLT 588

Query: 615  CAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYLWGDAILTAAHLINRMPSRIL--HLQ 674
              +TPQ NGV+ER  R + E AR+++    L    WG+A+LTA +LINR+PSR L    +
Sbjct: 589  VPHTPQLNGVSERMIRTITEKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSK 648

Query: 675  TPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYPPHQRGYK 734
            TP +      P  +H     LRVFG T YVH     Q KF  ++   +FVGY P+  G+K
Sbjct: 649  TPYEMWHNKKPYLKH-----LRVFGATVYVH-IKNKQGKFDDKSFKSIFVGYEPN--GFK 708

Query: 735  CFHPPSRKYFVTMDVTFCEDRPY------FPVSHLQGESVSEESNNTFEFIEPTPSVVSN 794
             +   + K+ V  DV   E          F    L+    SE  N    F   +  ++  
Sbjct: 709  LWDAVNEKFIVARDVVVDETNMVNSRAVKFETVFLKDSKESENKN----FPNDSRKIIQT 768

Query: 795  IIPHSIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTK-N 854
              P+      N       + ++ K+            +  +E P      N ++ C    
Sbjct: 769  EFPNESKECDN-----IQFLKDSKESENKNFPNDSRKIIQTEFP------NESKECDNIQ 828

Query: 855  MISENDRSNVAVLENVEEKDSGDEI-EVRIETRNNEAEQGHTGKS------DEYDSSLDI 914
             + ++  SN   L   +++   D + E +     NE+ +  T +       D    +  I
Sbjct: 829  FLKDSKESNKYFLNESKKRKRDDHLNESKGSGNPNESRESETAEHLKEIGIDNPTKNDGI 888

Query: 915  PIALRKGTRSCTKHPICNYVSYDSLSP-QFRAFTASLDSTIIPKDIYTALKYPEWKNAVM 974
             I  R+  R  TK  I      +SL+     A T   D      +I        W+ A+ 
Sbjct: 889  EIINRRSERLKTKPQISYNEEDNSLNKVVLNAHTIFNDVPNSFDEIQYRDDKSSWEEAIN 948

Query: 975  EEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDY 1034
             E+ A + N+TW I   P+    V  +WVFS+KY   G   R+KARLVA+GFTQ Y IDY
Sbjct: 949  TELNAHKINNTWTITKRPENKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQKYQIDY 1008

Query: 1035 SETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEEVYMSPPPGFEAQFGQH 1094
             ETF+PVA++++ R +LS+ +  +  ++Q+DVK AFLNG L EE+YM  P G       +
Sbjct: 1009 EETFAPVARISSFRFILSLVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQGISCN-SDN 1068

Query: 1095 VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKI---AVLIVYV 1154
            VCKL K+IYGLKQ+ R WF+ F   +K   +     D  ++  +   G I     +++YV
Sbjct: 1069 VCKLNKAIYGLKQAARCWFEVFEQALKECEFVNSSVDRCIY--ILDKGNINENIYVLLYV 1128

Query: 1155 DDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYILDLLT 1214
            DD+V+   D   ++  K+ + ++F + DL  +K+F+G+ +   ++ I +SQ  Y+  +L+
Sbjct: 1129 DDVVIATGDMTRMNNFKRYLMEKFRMTDLNEIKHFIGIRIEMQEDKIYLSQSAYVKKILS 1188

Query: 1215 ETGMLGCRPTDTPI--EFNCKLGNSDDQVPVDKEQYQRLVGKLIYLS-HTRPDISFAVSV 1274
            +  M  C    TP+  + N +L NSD+         + L+G L+Y+   TRPD++ AV++
Sbjct: 1189 KFNMENCNAVSTPLPSKINYELLNSDEDC---NTPCRSLIGCLMYIMLCTRPDLTTAVNI 1248

Query: 1275 VSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRK--TDRKTIEAYTDSDWAGSVVDRKS 1334
            +S++    N E  + + R+LRYLK T    L+F+K       I  Y DSDWAGS +DRKS
Sbjct: 1249 LSRYSSKNNSELWQNLKRVLRYLKGTIDMKLIFKKNLAFENKIIGYVDSDWAGSEIDRKS 1308

Query: 1335 TSGYCTFVWG-NLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDLHQECETP 1394
            T+GY   ++  NL+ W +K+Q+ VA SS EAEY A+   + E +WL+ +LT ++ + E P
Sbjct: 1309 TTGYLFKMFDFNLICWNTKRQNSVAASSTEAEYMALFEAVREALWLKFLLTSINIKLENP 1368

Query: 1395 LKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKG 1422
            +K++ DN+  ISIANNP  H R KH++I  HF +E++ +  IC+ YIP+  Q+AD+ TK 
Sbjct: 1369 IKIYEDNQGCISIANNPSCHKRAKHIDIKYHFAREQVQNNVICLEYIPTENQLADIFTKP 1391

BLAST of CSPI02G16290 vs. Swiss-Prot
Match: M810_ARATH (Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana GN=AtMg00810 PE=4 SV=1)

HSP 1 Score: 187.2 bits (474), Expect = 1.4e-45
Identity = 91/224 (40.62%), Postives = 135/224 (60.27%), Query Frame = 1

Query: 1117 LIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYI 1176
            L++YVDDI+LTG     ++ L  ++   F +KDLG + YFLG+++     G+ +SQ KY 
Sbjct: 3    LLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYA 62

Query: 1177 LDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAV 1236
              +L   GML C+P  TP+        S  + P D   ++ +VG L YL+ TRPDIS+AV
Sbjct: 63   EQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYP-DPSDFRSIVGALQYLTLTRPDISYAV 122

Query: 1237 SVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKS 1296
            ++V Q M  P       + R+LRY+K T   GL   K  +  ++A+ DSDWAG    R+S
Sbjct: 123  NIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRS 182

Query: 1297 TSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIW 1341
            T+G+CTF+  N+++W +K+Q  V+RSS E EYRA++L   E  W
Sbjct: 183  TTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CSPI02G16290 vs. Swiss-Prot
Match: YCH4_YEAST (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY5A PE=5 SV=2)

HSP 1 Score: 165.2 bits (417), Expect = 5.6e-39
Identity = 102/309 (33.01%), Postives = 169/309 (54.69%), Query Frame = 1

Query: 1034 LDVKNAFLNGDLVEEVYMSPPPGFEAQFG-QHVCKLQKSIYGLKQSPRAWFDRFTTFVKS 1093
            +DV  AFLN  + E +Y+  PPGF  +    +V +L   +YGLKQ+P  W +     +K 
Sbjct: 1    MDVDTAFLNSTMDEPIYVKQPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKK 60

Query: 1094 QGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGN 1153
             G+ +   +H L+ + +  G I +  VYVDD+++         ++KQ +   + +KDLG 
Sbjct: 61   IGFCRHEGEHGLYFRSTSDGPIYIA-VYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGK 120

Query: 1154 LKYFLGMEVARSKEG-ISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVD 1213
            +  FLG+ + +S  G I++S + YI    +E+ +   + T TP+  +  L  +      D
Sbjct: 121  VDKFLGLNIHQSSNGDITLSLQDYIAKAASESEINTFKLTQTPLCNSKPLFETTSPHLKD 180

Query: 1214 KEQYQRLVGKLIYLSHT-RPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLM 1273
               YQ +VG+L++ ++T RPDIS+ VS++S+F++ P   H+++  R+LRYL +T    L 
Sbjct: 181  ITPYQSIVGQLLFCANTGRPDISYPVSLLSRFLREPRAIHLESARRVLRYLYTTRSMCLK 240

Query: 1274 FRKTDRKTIEAYTDSDWAGSVVD-RKSTSGYCTFVWGNLVTWRSKK-QSVVARSSAEAEY 1333
            +R   +  +  Y D+   G++ D   ST GY T + G  VTW SKK + V+   S EAEY
Sbjct: 241  YRSGSQLALTVYCDAS-HGAIHDLPHSTGGYVTLLAGAPVTWSSKKLKGVIPVPSTEAEY 300

Query: 1334 RAMSLGICE 1338
               S  + E
Sbjct: 301  ITASETVME 307

BLAST of CSPI02G16290 vs. Swiss-Prot
Match: YH41B_YEAST (Transposon Ty4-H Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY4B-H PE=3 SV=1)

HSP 1 Score: 161.4 bits (407), Expect = 8.1e-38
Identity = 130/459 (28.32%), Postives = 215/459 (46.84%), Query Frame = 1

Query: 987  HKARLVAKGFTQ---TYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNG 1046
            +KAR+V +G TQ   TY +  +E+ +     N I++ L +A N++  +  LD+ +AFL  
Sbjct: 1336 YKARIVCRGDTQSPDTYSVITTESLNH----NHIKIFLMIANNRNMFMKTLDINHAFLYA 1395

Query: 1047 DLVEEVYMSPPPGFEAQFGQHVCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHT 1106
             L EE+Y+  P          V KL K++YGLKQSP+ W D    ++   G +       
Sbjct: 1396 KLEEEIYIPHPHDRRC-----VVKLNKALYGLKQSPKEWNDHLRQYLNGIGLKDNSYTPG 1455

Query: 1107 LFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNL------KYFL 1166
            L+    K   IA   VYVDD V+   ++  + +   ++   FE+K  G L         L
Sbjct: 1456 LYQTEDKNLMIA---VYVDDCVIAASNEQRLDEFINKLKSNFELKITGTLIDDVLDTDIL 1515

Query: 1167 GMEVARSKE--GISVSQRKYI--LDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKE 1226
            GM++  +K    I ++ + +I  +D      +   R +  P     K+    D + + +E
Sbjct: 1516 GMDLVYNKRLGTIDLTLKSFINRMDKKYNEELKKIRKSSIPHMSTYKIDPKKDVLQMSEE 1575

Query: 1227 QY-------QRLVGKLIYLSH-TRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTP 1286
            ++       Q+L+G+L Y+ H  R DI+FAV  V++ +  P+E     + +I++YL    
Sbjct: 1576 EFRQGVLKLQQLLGELNYVRHKCRYDINFAVKKVARLVNYPHERVFYMIYKIIQYLVRYK 1635

Query: 1287 GKGLMFRK---TDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARS 1346
              G+ + +    D+K I A TD+   GS  D +S  G   +   N+    S K +    S
Sbjct: 1636 DIGIHYDRDCNKDKKVI-AITDAS-VGSEYDAQSRIGVILWYGMNIFNVYSNKSTNRCVS 1695

Query: 1347 SAEAEYRAMSLGICEEIWLQKVLTDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVE 1406
            S EAE  A+  G  +   L+  L +L +     + +  D+K AI   N   Q  + K   
Sbjct: 1696 STEAELHAIYEGYADSETLKVTLKELGEGDNNDIVMITDSKPAIQGLNRSYQQPKEKFTW 1755

Query: 1407 IDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNF 1422
            I    IKEK+   SI +  I     +AD+LTK +   +F
Sbjct: 1756 IKTEIIKEKIKEKSIKLLKITGKGNIADLLTKPVSASDF 1780

BLAST of CSPI02G16290 vs. TrEMBL
Match: A5AYJ3_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_041073 PE=4 SV=1)

HSP 1 Score: 1520.8 bits (3936), Expect = 0.0e+00
Identity = 783/1476 (53.05%), Postives = 1003/1476 (67.95%), Query Frame = 1

Query: 19   SMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPQPGDPHERYWKAEDSILR 78
            S + L+  KLNG NY  W+QSVK+ ++GR K   L GE+ +P   DP+ + W+  + +  
Sbjct: 35   SSFQLTIHKLNGKNYLEWAQSVKLAIDGRGKLGHLNGEVSKPVADDPNLKTWRFRELVA- 94

Query: 79   SILINSMEPQIGKPLLFATTAKDIWDTAQTLYSKRQNASRLYTLRKQVHE---------- 138
                      IGKP LF  TAKD+W+  + +YS  +N+S+++ L+ ++ +          
Sbjct: 95   ----------IGKPHLFLPTAKDVWEAVRDMYSDLENSSQIFDLKSKLWQSRQGDREVTT 154

Query: 139  --------------CKELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGHILGQR 198
                          C E  W  P D V++ + EENDR+Y FLA LN   D VRG ILG++
Sbjct: 155  YYNQMVTLWQELDLCYEDEWDCPNDSVRHKKREENDRVYVFLAALNHNLDEVRGRILGRK 214

Query: 199  PIPSLMEVCSEIRLEEDRTSAMNIS----ATPTIDSAAFSARSSNSSSDKHNGKPIPVCE 258
            P+PS+ EV SE+R EE R   M       + P I+S+A  ++ S+   D+      P C+
Sbjct: 215  PLPSIREVFSEVRREEARRKVMLTDPEPMSNPEIESSALVSKGSDLDGDRRKK---PWCD 274

Query: 259  HCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYVSESAEP--PQQSDPHKNQTDLS 318
            HCKK WHTK  CWK+HG+P   KK+  +D    GRA+ + SA+   PQ +    N T   
Sbjct: 275  HCKKPWHTKGTCWKIHGKPQNFKKKNGSD----GRAFQTMSADSQGPQINSEKPNFTKEQ 334

Query: 319  LATLGAIVQSG---------------IPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFV 378
            L+ L  + QS                +  +   +  +   PWI+DSGATDH+TGSS+ F 
Sbjct: 335  LSHLYKLFQSPQFSNPSCSLAQQGNYLIAALSSIKSNVHCPWIIDSGATDHMTGSSQIFS 394

Query: 379  SYIPCAGNETIRIADGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNC 438
            SY PCAGN+ I+I DGSL+ IAGKG +     L+LHNVLHVP LS NLLSISKIT +  C
Sbjct: 395  SYKPCAGNKKIKIXDGSLSAIAGKGSVFISPSLTLHNVLHVPNLSCNLLSISKITQDHQC 454

Query: 439  KAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDCML 498
            +A F P    FQ+L+SGR IG AR   GLY  ++ + S    +++   S    S  D +L
Sbjct: 455  QANFYPSYCEFQELTSGRTIGNAREIGGLYFFENGSESRKPIQSTCFESISVASSDDIIL 514

Query: 499  WHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHS 558
            WH+RLGHP+FQY+KHLFP LF     ++  C+ C  AK HR SFP QPY+ ++PF+L+HS
Sbjct: 515  WHYRLGHPSFQYLKHLFPSLFRNKNPSSFQCEFCELAKHHRTSFPLQPYRISKPFSLIHS 574

Query: 559  DVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAI 618
            DVWGPS+I+T SGK+WFVTFIDDHTR++WVYL+ +KSEV  +F+ FY  + TQF  KI +
Sbjct: 575  DVWGPSRISTLSGKKWFVTFIDDHTRVSWVYLLREKSEVEEVFKIFYTMVLTQFQTKIQV 634

Query: 619  LRSDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLP 678
             RSDNG+E+ N  L +F   KGIVHQ+SC  TPQQNG+AERKN+HLLEVAR+L  +T +P
Sbjct: 635  FRSDNGKEYINKALGKFFLEKGIVHQSSCNDTPQQNGIAERKNKHLLEVARALCFTTKVP 694

Query: 679  SYLWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFG 738
             YLWG+AILTA +LINRMP+RIL+ +TPL       P  R  S +PL++FGCT +VH   
Sbjct: 695  KYLWGEAILTATYLINRMPTRILNFKTPLQVFTNCNPIFRLSSTLPLKIFGCTTFVHIHD 754

Query: 739  PNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGESV 798
             N+ K  PRA+ CVFVGY P Q+GYKCF P S+K FVTMDVTF E +P+F  +HLQGES 
Sbjct: 755  HNRGKLDPRARKCVFVGYAPTQKGYKCFDPISKKLFVTMDVTFFESKPFF-ATHLQGEST 814

Query: 799  SEESN----------NTFEFIEPTPS---VVSNIIPHSIVLPTNQVPW-KTYYRRNHKKE 858
            SE+S+          N    +EP+ S   V  NI    +    + + + KT      K  
Sbjct: 815  SEDSDLFKIEKTPTPNPNNLLEPSNSNQFVYPNIETSGLDTTKSDMSFEKTAEILGKKNG 874

Query: 859  VGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNM-ISENDRSNVAVLENVEEKDSGDEIE 918
            V +  S   +    S         N     TKN  +    R      E+  +   G E E
Sbjct: 875  VLNIESLDGSSSLPSHNQNHSNTNNGNRTSTKNSELMTYSRRKHNSKESNPDPLPGHESE 934

Query: 919  VRIETRNNEAEQGH-----------TGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSY 978
            +R E  ++E    +           +  + E    L+IPIA RKG RSCTKHP+ NY+SY
Sbjct: 935  LREEPNSSECPGNNQTDSCQPVQFISNSNSESFDDLNIPIATRKGVRSCTKHPMSNYMSY 994

Query: 979  DSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKT 1038
             +LSP F AFT+ L    IPK++  AL+ PEWK A+ EEM+ALEKN TW++  LPKG  T
Sbjct: 995  KNLSPSFFAFTSHLSLVEIPKNVQEALQVPEWKKAIFEEMRALEKNHTWEVMGLPKGKTT 1054

Query: 1039 VGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNK 1098
            VGCKWVF++KY ++G+L+R+KARLVAKGFTQTYGIDY ETF+PVAKLNT+RVLLS+A N 
Sbjct: 1055 VGCKWVFTVKYNSNGSLERYKARLVAKGFTQTYGIDYLETFAPVAKLNTVRVLLSIAANL 1114

Query: 1099 DWPLYQLDVKNAFLNGDLVEEVYMSPPPGFEAQFGQHVCKLQKSIYGLKQSPRAWFDRFT 1158
            DWPL QLDVKNAFLNG+L EEVYM PPPGF+  FG  VCKL+KS+YGLKQSPRAWF+RFT
Sbjct: 1115 DWPLQQLDVKNAFLNGNLEEEVYMDPPPGFDEHFGSKVCKLKKSLYGLKQSPRAWFERFT 1174

Query: 1159 TFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEI 1218
             FVK+QGY Q  SDHT+F K S  GKIA+LIVYVDDI+LTGD   E+ +LK+ +  EFEI
Sbjct: 1175 QFVKNQGYVQAQSDHTMFIKHSNDGKIAILIVYVDDIILTGDHVTEMDRLKKSLALEFEI 1234

Query: 1219 KDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQ 1278
            KDLG+L+YFLGMEVARSK GI VSQRKYILDLL ETGM GCRP DTPI+ N KLG+++D 
Sbjct: 1235 KDLGSLRYFLGMEVARSKRGIVVSQRKYILDLLKETGMSGCRPADTPIDPNQKLGDTNDG 1294

Query: 1279 VPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGK 1338
              V+  +YQ+LVGKLIYLSHTRPDI+FAVS+VSQFM +P E H++AV RILRYLKSTPGK
Sbjct: 1295 NLVNTTRYQKLVGKLIYLSHTRPDIAFAVSIVSQFMHSPYEVHLEAVYRILRYLKSTPGK 1354

Query: 1339 GLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAE 1398
            GL F+K+++KTIEAYTD+DWAGSV DR+STSGYCT++WGNLVTWRSKKQSV ARSSAEAE
Sbjct: 1355 GLFFKKSEQKTIEAYTDADWAGSVTDRRSTSGYCTYIWGNLVTWRSKKQSVXARSSAEAE 1414

Query: 1399 YRAMSLGICEEIWLQKVLTDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHF 1424
            YRAM+ G+CE +WL+K+L +L +  E P+KL+CDNKAAISIA+NPVQHDRTKHVEIDRHF
Sbjct: 1415 YRAMAHGVCEILWLKKILEELKRPLEMPMKLYCDNKAAISIAHNPVQHDRTKHVEIDRHF 1474

BLAST of CSPI02G16290 vs. TrEMBL
Match: A5B7Z8_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_022757 PE=4 SV=1)

HSP 1 Score: 1411.4 bits (3652), Expect = 0.0e+00
Identity = 738/1457 (50.65%), Postives = 961/1457 (65.96%), Query Frame = 1

Query: 16   AQSSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPQPGDPHERYWKAEDS 75
            + SS   ++G KLNG+NY  WSQSV + + G+ K  +LTGE   P+  +P  R WK E+S
Sbjct: 26   SDSSPILITGHKLNGHNYLQWSQSVLLFICGKGKDEYLTGEAAMPETTEPGFRKWKIENS 85

Query: 76   ILRSILINSMEPQIGKPLLFATTAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKE---- 135
            ++ S LINSM   IG+  L   TAKDIWD A+  YS  +N S L+ +   +H+ ++    
Sbjct: 86   MIMSWLINSMNNDIGENFLLFRTAKDIWDAAKETYSSSENTSELFQVESALHDFRQGEQS 145

Query: 136  --------------------LVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGHIL 195
                                  W+   D   Y  I E  R++ F  GLN + D VRG I+
Sbjct: 146  VTQYYNTLTRYWQQLDLFETHSWKCSDDAATYRXIVEQXRLFKFFLGLNRELDDVRGRIM 205

Query: 196  GQRPIPSLMEVCSEIRLEEDRTSAMNISA---TPTIDSAAFSARSSNSSSDKHNGKPIPV 255
            G +P+PSL E  SE+R EE R   M  S     PT+D++   ARS NSS      +  P 
Sbjct: 206  GIKPLPSLREAFSEVRREESRKKVMMGSKEQPAPTLDASXLXARSFNSSGGDRQKRDRPW 265

Query: 256  CEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYVS---ESAEPPQQSDPHKNQT 315
            C++CKK  H KE CWKLHG+    K +P  D+   GRA+V+   ES   P+ S  +K Q 
Sbjct: 266  CDYCKKXGHYKEACWKLHGKXADWKPKPRXDRD--GRAHVAANXESTSVPEPSPFNKEQM 325

Query: 316  DLSLATLGAIVQSGIPHSFGLVSI-DGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETI 375
            ++ L  L + V SG      L +   G  PWI+D+GA+DH+TG +    +Y P  G+ ++
Sbjct: 326  EM-LQKLLSQVGSGSTTGIALTANRGGMKPWIVDTGASDHMTGDAAILQNYKPSNGHSSV 385

Query: 376  RIADGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSF 435
             IADGS + I G G I     L L +VLHVP L  NLLSISK+  +L C   F P+S  F
Sbjct: 386  HIADGSKSKIXGTGSIKLTKDLYLDSVLHVPNLDCNLLSISKLARDLQCVTKFYPNSCVF 445

Query: 436  QDLSSGRMIGTARHSRGLYLL------DDDTSSSSIPRTSLLSSYFTTS------EQDCM 495
            QDL SG+MIG+A    GLYLL      +  + +S +   S+L S+ + S      + + +
Sbjct: 446  QDLKSGKMIGSAELCSGLYLLSCGQFSNQVSQASCVQSQSMLESFNSVSNSKVNKDSEII 505

Query: 496  LWHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVH 555
            + H+RLGHP+F Y+  LFP LF      +  C++C  AK  R  +P  PYKP+  F+LVH
Sbjct: 506  MLHYRLGHPSFVYLAKLFPKLFINKNPASYHCEICQFAKHTRTVYPQIPYKPSTVFSLVH 565

Query: 556  SDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIA 615
            SDVWGPS+I   SG RWFVTF+DDHTR+TWV+L+ +KSEV  +FQ F   ++ QF+ KI 
Sbjct: 566  SDVWGPSRIKNISGTRWFVTFVDDHTRVTWVFLMKEKSEVGHIFQTFNLMVQNQFNSKIQ 625

Query: 616  ILRSDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSL 675
            +L+SDN +E+   +LS +L + GI+H +SC  TPQQNGVAERKNRHLLEVAR LM S+++
Sbjct: 626  VLKSDNAKEYFTSSLSTYLQNHGIIHISSCVDTPQQNGVAERKNRHLLEVARCLMFSSNV 685

Query: 676  PSYLWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVS-EVPLRVFGCTAYVHN 735
            P+Y WG+AILTA +LINRMPSR+L  Q+P     + +P TR  S ++PL+VFGCTA+VH 
Sbjct: 686  PNYFWGEAILTATYLINRMPSRVLTFQSPRQLFLKQFPHTRAASSDLPLKVFGCTAFVHV 745

Query: 736  FGPNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGE 795
            +  N++KF PRA  C+F+GY P Q+GYKC+ P +++++ TMDV+F E   ++P SH+QGE
Sbjct: 746  YPQNRSKFAPRANKCIFLGYSPTQKGYKCYSPTNKRFYTTMDVSFFEHVFFYPKSHVQGE 805

Query: 796  SVSEESNNTFE-FIEPTPSVVSNIIPHSIVLPTN-QVPWKTYYRRNHKKEVGSP-TSQPP 855
            S++E  +  +E  +E  PS  S     S   PT    P  +  +      V SP T Q P
Sbjct: 806  SMNE--HQVWESLLEGVPSFHSESPNPSQFAPTELSTPMPSSVQPAQHTNVPSPVTIQSP 865

Query: 856  APVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEA 915
             P+Q   P          +   +N+     R     LE+  +   G  I+          
Sbjct: 866  MPIQPIAP----------QLANENLQVYIRRRKRQELEHGSQSTCGQYIDSNSSLPEENI 925

Query: 916  EQGHTGKS--DEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTII 975
             +   G+      D S  +PIALRKG R CT HPI NYV+Y+ LSP +RAF  SLD T +
Sbjct: 926  GEDRAGEVLIPSIDDST-LPIALRKGVRRCTDHPIGNYVTYEGLSPSYRAFATSLDDTQV 985

Query: 976  PKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDR 1035
            P  I  A K  EWK AV +E+ ALEKN TW I  LP G + VGCKW+F++KYKADG+++R
Sbjct: 986  PNTIQEAXKISEWKKAVQDEIDALEKNGTWTITDLPVGKRPVGCKWIFTIKYKADGSVER 1045

Query: 1036 HKARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLV 1095
             KARLVA+GFTQ+YGIDY ETF+PVAKLNTIR+LLS+AVN+DW L QLD+KNAFLNGDL 
Sbjct: 1046 FKARLVARGFTQSYGIDYQETFAPVAKLNTIRILLSLAVNQDWCLQQLDIKNAFLNGDLE 1105

Query: 1096 EEVYMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLF 1155
            EEVYM  PPGFE    ++ VCKLQKS+YGLKQSPRAWFDRFT  V   GY+QG +DHTLF
Sbjct: 1106 EEVYMEIPPGFEESMAKNQVCKLQKSLYGLKQSPRAWFDRFTKAVLKLGYKQGQADHTLF 1165

Query: 1156 TKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSK 1215
             K S  GK+A+LIVYVDDI+L+G+D  E+  LK+ + +EFE+KDLGNLKYFLGMEVARS+
Sbjct: 1166 VKKSHAGKMAILIVYVDDIILSGNDMEELQXLKKYLSEEFEVKDLGNLKYFLGMEVARSR 1225

Query: 1216 EGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYL 1275
            +GI VSQRKYILDLL ETGMLGC+P DTP++   KLG   +  PVD+ +YQRLVG+LIYL
Sbjct: 1226 KGIVVSQRKYILDLLKETGMLGCKPIDTPMDSQKKLGIEKESTPVDRGRYQRLVGRLIYL 1285

Query: 1276 SHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDS 1335
            SHTRPDI FAVS VSQFM +P EEHM+AV RI RYLK TPGKGL FRKT+ +  E Y+D+
Sbjct: 1286 SHTRPDIGFAVSXVSQFMHSPTEEHMEAVYRIXRYLKMTPGKGLFFRKTENRDXEVYSDA 1345

Query: 1336 DWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVL 1395
            DWAG+++DR+STSGYC+FVWGNLVT RSKKQSVVARSSAEAEYRA++ GICE IW+++VL
Sbjct: 1346 DWAGNIIDRRSTSGYCSFVWGNLVTXRSKKQSVVARSSAEAEYRALAQGICEGIWIKRVL 1405

Query: 1396 TDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSS 1423
            ++L Q   +P+ + CDN+AAISIA NPV HD TKHVEIDRHFI EK+ S ++ + Y+P+ 
Sbjct: 1406 SELGQTSSSPILMMCDNQAAISIAKNPVHHDXTKHVEIDRHFITEKVTSETVKLNYVPTK 1465

BLAST of CSPI02G16290 vs. TrEMBL
Match: A5AJR0_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_031159 PE=4 SV=1)

HSP 1 Score: 1404.4 bits (3634), Expect = 0.0e+00
Identity = 735/1455 (50.52%), Postives = 958/1455 (65.84%), Query Frame = 1

Query: 18   SSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPQPGDPHERYWKAEDSIL 77
            SS   ++G KLNG+NY  WSQSV + + G+ K  + TGE   P+  +P  R WK E+S++
Sbjct: 28   SSPILITGHKLNGHNYLQWSQSVLLFICGKGKDEYXTGEAXMPETTEPXFRKWKIENSMI 87

Query: 78   RSILINSMEPQIGKPLLFATTAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKE------ 137
             S LINSM   IG+  L   TAKDIWD A+  YS  +N S L+ +   +H+ ++      
Sbjct: 88   MSWLINSMNNDIGENFLLFGTAKDIWDAAKETYSSSENISELFQVESALHDFRQGEQSVT 147

Query: 138  ------------------LVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGHILGQ 197
                                W+   D   Y +I E  R++ F  GLN + D VRG I+G 
Sbjct: 148  QYYNTLTRYWQQLDLFETHSWKCSDDAATYRQIVEQKRLFKFFLGLNRELDDVRGRIMGI 207

Query: 198  RPIPSLMEVCSEIRLEEDRTSAMNISA---TPTIDSAAFSARSSNSSSDKHNGKPIPVCE 257
            +P+PSL EV SE+R EE R   M  S     PT+D +A +ARS NSS      +  P C+
Sbjct: 208  KPLPSLREVFSEVRREESRKKVMMGSKEQPAPTLDGSALAARSFNSSGGDRQKRDRPWCD 267

Query: 258  HCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYV---SESAEPPQQSDPHKNQTDL 317
            + KK  H KE CWKLHG+P   K +P +D+   GRA+V   SES   P+ S  +K Q ++
Sbjct: 268  YYKKPGHYKEACWKLHGKPADWKPKPRSDRD--GRAHVAANSESTSVPEPSPFNKEQMEM 327

Query: 318  SLATLGAIVQSGIPHSFGLV-SIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRI 377
             L  L + V SG      L  S  G  PWI+D+GA+DH+TG +    +Y P  G+  + I
Sbjct: 328  -LQKLLSQVGSGSTTGIALTASRGGMKPWIVDTGASDHMTGDAAILQNYKPSNGHSFVHI 387

Query: 378  ADGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQD 437
            ADGS + I G G I     L L +VLHVP L  NLLSISK+  +L C   F P+S  FQD
Sbjct: 388  ADGSKSKIVGTGSIKLTKDLYLDSVLHVPNLDCNLLSISKLARDLQCVTKFYPNSCVFQD 447

Query: 438  LSSGRMIGTARHSRGLYLL------DDDTSSSSIPRTSLLSSYFTTS------EQDCMLW 497
            L SG+MIG+A+    LYLL      +  + +S +   S+L S+ + S      + + ++ 
Sbjct: 448  LKSGKMIGSAKLCSELYLLSCGQFSNQVSQASCVQSQSMLESFNSVSNSKVNKDSEIIML 507

Query: 498  HFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSD 557
            H+RLGHP+F Y+  LFP LF      +  C++C  AK  R  +P  PYKP+  F+LVHSD
Sbjct: 508  HYRLGHPSFVYLAKLFPRLFINKNPASYHCEICQFAKHTRTVYPQIPYKPSTVFSLVHSD 567

Query: 558  VWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAIL 617
            VWGPS+I   SG RWFVTF+DDHTR+TWV+L+ +KSEV  +FQ F   ++ QF+ KI +L
Sbjct: 568  VWGPSRIKNISGTRWFVTFVDDHTRVTWVFLMKEKSEVGHIFQTFNLMVQNQFNSKIQVL 627

Query: 618  RSDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPS 677
            +SDN +E+   +LS +L + GI+H +SC  TPQQNGVAERKNRHLLEVAR LM S+++P+
Sbjct: 628  KSDNAKEYFTSSLSTYLQNHGIIHISSCVDTPQQNGVAERKNRHLLEVARCLMFSSNVPN 687

Query: 678  YLWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVS-EVPLRVFGCTAYVHNFG 737
            Y WG+AILTA +LINRMPSR+L  Q+P     + +P TR  S ++ L+VFGCTA+VH + 
Sbjct: 688  YFWGEAILTATYLINRMPSRVLTFQSPRQLFLKQFPHTRAASSDLSLKVFGCTAFVHVYP 747

Query: 738  PNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGESV 797
             N++KF PRA  C+F+GY P+Q+GYKC+ P +++++ TMDV+F E   ++P  H+QGES+
Sbjct: 748  QNRSKFAPRANKCIFLGYSPNQKGYKCYSPTNKRFYTTMDVSFFEHVFFYPKFHVQGESM 807

Query: 798  SEESNNTFEF-IEPTPSVVSNIIPHSIVLPTN-QVPWKTYYRRNHKKEVGSP-TSQPPAP 857
            +E  +  +E  +E  PS  S     S   PT    P  +  +      V SP T Q P P
Sbjct: 808  NE--HQVWESRLEGVPSFHSESPNPSQFAPTELSTPMPSSVQPAQHTNVPSPVTIQSPMP 867

Query: 858  VQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAEQ 917
            +Q   P          +   +N+     R     LE+  +   G  I+           +
Sbjct: 868  IQPIAP----------QLANENLQVYIRRRKRQELEHGSQSTCGQYIDSNSSLPEENIGE 927

Query: 918  GHTGKS--DEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPK 977
               G+      D S  +PIALRKG R CT HPI NYV+Y+ LSP +RAF  SLD T +P 
Sbjct: 928  DRAGEVLIPSIDDST-LPIALRKGVRRCTDHPIGNYVTYEGLSPSYRAFATSLDDTQVPN 987

Query: 978  DIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHK 1037
             I  ALK  EWK AV +E+ ALEKN TW I  LP G + VGCKW+F++KYKADG+++R K
Sbjct: 988  TIQEALKISEWKKAVQDEIDALEKNGTWTITDLPVGKRPVGCKWIFTIKYKADGSVERFK 1047

Query: 1038 ARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEE 1097
            ARLVA+GFTQ YGIDY ETF+PVAKLNTIR+LLS+AVN+DW L QLD+KNAFLNGDL EE
Sbjct: 1048 ARLVARGFTQXYGIDYQETFAPVAKLNTIRILLSLAVNQDWCLQQLDIKNAFLNGDLEEE 1107

Query: 1098 VYMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTK 1157
            VYM  PPGFE    ++ VCKLQKS+YGLKQSPRAWFDRFT  V   GY+QG +DHTLF K
Sbjct: 1108 VYMEIPPGFEESMAKNQVCKLQKSLYGLKQSPRAWFDRFTKAVLKLGYKQGQADHTLFVK 1167

Query: 1158 VSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEG 1217
             S  GK+A+LIVYVDDI+L+G+D  E+  LK+ + +EFE+KDLGNLKYFLGMEVARS++G
Sbjct: 1168 KSHAGKMAILIVYVDDIILSGNDMEELQNLKKYLSEEFEVKDLGNLKYFLGMEVARSRKG 1227

Query: 1218 ISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSH 1277
            I VSQ KYILDLL ETGMLGC+P DTP++   KLG   +  P D+ +YQRLVG+LIYLSH
Sbjct: 1228 IVVSQTKYILDLLKETGMLGCKPIDTPMDSQKKLGIEKESTPXDRGRYQRLVGRLIYLSH 1287

Query: 1278 TRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDW 1337
            TRPDI FAVS VSQFM +P EEHM+AV RILRYLK TP KG+ FRKT+ +  E Y+D+DW
Sbjct: 1288 TRPDIGFAVSAVSQFMHSPTEEHMEAVYRILRYLKMTPXKGIFFRKTENRDTEVYSDADW 1347

Query: 1338 AGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTD 1397
            AG+++DR+STSGYC+FVWGNLVTWRSKKQSVVARSSAEAEY A++ GICE  W+++VL++
Sbjct: 1348 AGNIIDRRSTSGYCSFVWGNLVTWRSKKQSVVARSSAEAEYXALAQGICEGXWIKRVLSE 1407

Query: 1398 LHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQ 1423
            L Q   +P+ + CDN+A ISIA NPV HDRTKHVEIDRHFI EK+ S ++ + Y+P+  Q
Sbjct: 1408 LGQTSSSPILMMCDNQAXISIAKNPVHHDRTKHVEIDRHFITEKVTSETVKLNYVPTKHQ 1466

BLAST of CSPI02G16290 vs. TrEMBL
Match: A5BJ12_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_024789 PE=4 SV=1)

HSP 1 Score: 1397.5 bits (3616), Expect = 0.0e+00
Identity = 728/1454 (50.07%), Postives = 949/1454 (65.27%), Query Frame = 1

Query: 18   SSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPQPGDPHERYWKAEDSIL 77
            SS   ++G KLNG+NY  WSQSV + + G+ K  +LTGE   P+  +P  R WK E+S++
Sbjct: 28   SSPILITGHKLNGHNYLQWSQSVLLFICGKGKDEYLTGEAVMPETTEPGFRKWKIENSMI 87

Query: 78   RSILINSMEPQIGKPLLFATTAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKE------ 137
             S LINSM   IG+  L   TAKDIWD A+  YS  +N S L+ +   +H+ ++      
Sbjct: 88   MSWLINSMNNDIGENFLLFGTAKDIWDAAKETYSSSENTSELFQVESALHDFRQGEQSVT 147

Query: 138  ------------------LVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGHILGQ 197
                                W+   D   Y +I E  R++ F  GLN + D VRG I+G 
Sbjct: 148  QYYNTLTRYWQQLDLFETHSWKCSDDAATYRQIMEQKRLFKFFLGLNRELDDVRGRIMGI 207

Query: 198  RPIPSLMEVCSEIRLEEDRTSAMNISA---TPTIDSAAFSARSSNSSSDKHNGKPIPVCE 257
            +P+PSL E  SE+R EE R   M  S     PT+D++A +ARS NSS      +  P C+
Sbjct: 208  KPLPSLREAFSEVRREESRKKVMMGSKEQPAPTLDASALAARSFNSSGGDRQKRDRPWCD 267

Query: 258  HCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYV---SESAEPPQQSDPHKNQTDL 317
            +CKK  H KE CWKLHG+P   K +P  D+   GRA+V   SES   P+ S  +K Q ++
Sbjct: 268  YCKKPGHYKETCWKLHGKPADWKPKPRFDRD--GRAHVAANSESTSVPEPSPFNKEQMEM 327

Query: 318  SLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIA 377
                L  +            +  G  PWI+D+GA+DH+TG +    +Y P  G+ ++ IA
Sbjct: 328  LQKLLSQVGSGSTTGVAFTTNRGGMRPWIVDTGASDHMTGDAAILQNYKPSNGHSSVHIA 387

Query: 378  DGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDL 437
            DGS         I     L L +VLHVP L  NLLSISK+ H+L C   F P+   FQDL
Sbjct: 388  DGS---------IKLTKDLYLDSVLHVPNLDCNLLSISKLAHDLQCVTKFYPNLCVFQDL 447

Query: 438  SSGRMIGTARHSRGLYLLD-----DDTSSSSIPRTSLLSSYFTT-------SEQDCMLWH 497
             SG+MIG+A    GLYLL      +  S +S  ++  +S  F +        + + ++ H
Sbjct: 448  KSGKMIGSAELRSGLYLLSCGQFSNQVSQASCVQSQSMSESFNSVSNSKVNKDSEIIMLH 507

Query: 498  FRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDV 557
            +RLGHP+F Y+  LFP LF      +  C++C  AK  R+ +P  PYKP+  F+LVHSDV
Sbjct: 508  YRLGHPSFVYLAKLFPRLFINKNPASYHCEICQFAKHTRIVYPQIPYKPSTVFSLVHSDV 567

Query: 558  WGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAILR 617
            WGPS+I   SG RWFVTF+DDHT +TWV+L+ +KSEV  +FQ F   ++ QF+ KI +L+
Sbjct: 568  WGPSRIKNISGTRWFVTFVDDHTWVTWVFLMKEKSEVGHIFQTFNLMVQNQFNSKIQVLK 627

Query: 618  SDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSY 677
            SDN +E+   +LS +L +  I+H +SC  TPQQN VAERKNRHLLEVAR LM S+++P+Y
Sbjct: 628  SDNAKEYFTSSLSTYLQNHDIIHISSCVDTPQQNRVAERKNRHLLEVARCLMFSSNVPNY 687

Query: 678  LWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVS-EVPLRVFGCTAYVHNFGP 737
             WG+AILTA +LINRMPSR+L  Q+P     + +P T   S ++PL+VFGCTA++H +  
Sbjct: 688  FWGEAILTATYLINRMPSRVLTFQSPRQLFLKQFPHTHAASSDLPLKVFGCTAFIHVYPQ 747

Query: 738  NQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGESVS 797
            N++KF PRA  C+F+GY P Q+GYKC+ P +++++ TMDV+F E   ++P SH+QGES++
Sbjct: 748  NRSKFAPRANKCIFLGYSPTQKGYKCYSPTNKRFYTTMDVSFFEHVFFYPKSHVQGESMN 807

Query: 798  EESNNTFE-FIEPTPSVVSNIIPHSIVLPTN-QVPWKTYYRRNHKKEVGSP-TSQPPAPV 857
            E  +  +E F+E  PS  S     S   PT    P     +      V SP T Q P P+
Sbjct: 808  E--HQVWESFLEGVPSFHSESPNPSQFAPTELSTPMPPSVQPAQHTNVPSPVTIQSPMPI 867

Query: 858  QDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAEQG 917
            Q   P          +   +N+     R     LE+  +   G  I+           + 
Sbjct: 868  QPIAP----------QLANENLQVYLRRRKRQELEHGSQSTCGQYIDSNSSLPEENIGED 927

Query: 918  HTGKS--DEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPKD 977
              G+      D S  +PIALRKG R CT HPI NYV+Y+ LSP +RAF  SLD T +P  
Sbjct: 928  RAGEVLIPSIDDST-LPIALRKGVRRCTDHPIGNYVTYEGLSPSYRAFATSLDDTQVPNT 987

Query: 978  IYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKA 1037
            I  A K  EWK AV +E+ ALEKN TW I  LP G + VGCKW+F++KYK DG+++R KA
Sbjct: 988  IQEASKISEWKKAVQDEIDALEKNGTWTITDLPVGKRPVGCKWIFTIKYKTDGSVERFKA 1047

Query: 1038 RLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEEV 1097
            RLVA+GFTQ+YGIDY ETF+PVAKLNTIR+LLS+AVN+DW L QLD+KNAFLNGDL EEV
Sbjct: 1048 RLVARGFTQSYGIDYQETFAPVAKLNTIRILLSLAVNQDWCLQQLDIKNAFLNGDLEEEV 1107

Query: 1098 YMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKV 1157
            YM  PPGFE    ++ VCKLQKS+YGLKQSPRAWFDRFT  V   GY+QG +DHTLF K 
Sbjct: 1108 YMEIPPGFEESMAKNQVCKLQKSLYGLKQSPRAWFDRFTKAVLKLGYKQGQADHTLFVKK 1167

Query: 1158 SKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGI 1217
            S  GK+A+LIVYVDDI+L+G+D  E+  LK+ + +EFE+KDLGNLKYF GMEVA+S++GI
Sbjct: 1168 SHAGKLAILIVYVDDIILSGNDMGELQNLKKYLSEEFEVKDLGNLKYFXGMEVAKSRKGI 1227

Query: 1218 SVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHT 1277
             VSQRKYILDLL ETGMLGC+P DTP++   KLG   +  PVD+ +YQRLVG+LIYLSHT
Sbjct: 1228 VVSQRKYILDLLKETGMLGCKPIDTPMDSQKKLGIEKESTPVDRGRYQRLVGRLIYLSHT 1287

Query: 1278 RPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWA 1337
            RPDI FAVS VSQFM +P EEHM+AV RILRYLK TPGKGL FRKT+ +  E Y+D+DWA
Sbjct: 1288 RPDIGFAVSAVSQFMHSPTEEHMEAVYRILRYLKMTPGKGLFFRKTENRDTEVYSDADWA 1347

Query: 1338 GSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDL 1397
            G+++DR STSGYC+FVWGNLVTWRSKKQSVVARSSAEAEYRA++ GICE IW++ VL++L
Sbjct: 1348 GNIIDRWSTSGYCSFVWGNLVTWRSKKQSVVARSSAEAEYRALAQGICEGIWIKXVLSEL 1407

Query: 1398 HQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQV 1423
             Q   +P+ + CDN+AAISIA NPV HDRTKHVEIDRHFI EK+ S ++ + Y+P+  Q 
Sbjct: 1408 GQXSSSPILMMCDNQAAISIAKNPVHHDRTKHVEIDRHFITEKVTSETVKLNYVPTKHQT 1457

BLAST of CSPI02G16290 vs. TrEMBL
Match: A5B7A7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_025045 PE=4 SV=1)

HSP 1 Score: 1383.2 bits (3579), Expect = 0.0e+00
Identity = 729/1454 (50.14%), Postives = 945/1454 (64.99%), Query Frame = 1

Query: 18   SSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPQPGDPHERYWKAEDSIL 77
            SS   ++G KLNG+NY  WSQSV + + G+ K  +LTGE   P+  +P  R WK E+S++
Sbjct: 28   SSPILITGHKLNGHNYLQWSQSVLLFICGKGKDEYLTGEAVMPETTEPGFRKWKIENSMI 87

Query: 78   RSILINSMEPQIGKPLLFATTAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKE------ 137
             S LINSM   IG+  L   TAKDIWD A+  YS  +N S L+ +   +H+ ++      
Sbjct: 88   MSWLINSMNNDIGENFLLFGTAKDIWDAAKETYSSSENTSELFQVESALHDFRQGEQSVT 147

Query: 138  ------------------LVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGHILGQ 197
                                W+   D   Y +I E  R++ F  GLN + D VRG I+G 
Sbjct: 148  QYYNTLTRYWQQLDLFETHSWKCSDDAATYRQIVEQKRLFKFFLGLNRELDDVRGRIMGI 207

Query: 198  RPIPSLMEVCSEIRLEEDRTSAMNISA---TPTIDSAAFSARSSNSSSDKHNGKPIPVCE 257
            +P+PSL E  SE+R EE R   M  S     PT+D++A +ARS NSS      +  P C+
Sbjct: 208  KPLPSLREAFSEVRREESRKKVMMGSKEQPAPTLDASALAARSFNSSGGDRQKRDRPWCD 267

Query: 258  HCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYV---SESAEPPQQSDPHKNQTDL 317
            +CKK  H KE CWKLHG+P   K +P  D+   GRA+V   SES   P+ S  +K Q ++
Sbjct: 268  YCKKPGHYKETCWKLHGKPADWKPKPRFDRD--GRAHVAANSESTSVPEPSPFNKEQMEM 327

Query: 318  SLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIA 377
                L  +            +  G  PWI+D+GA+DH+TG +    +Y P  G+ ++ IA
Sbjct: 328  LQKLLSQVGSGSTTGVAFTANRGGMRPWIVDTGASDHMTGDAAILQNYKPSNGHSSVHIA 387

Query: 378  DGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDL 437
            DGS + IAG G I     L L +VLHVP L  NLLSISK+ H+L C   F P+   FQDL
Sbjct: 388  DGSKSKIAGTGSIKLTKDLYLDSVLHVPNLDCNLLSISKLAHDLQCVTKFYPNLCVFQDL 447

Query: 438  SSGRMIGTARHSRGLYLLD-----DDTSSSSIPRTSLLSSYFTT-------SEQDCMLWH 497
             SG+MIG+A    GLYLL      +  S +S  ++  +S  F +        + + ++ H
Sbjct: 448  KSGKMIGSAELCSGLYLLSCGQFSNQVSQASCVQSQSMSESFNSVSNSKVNKDSEIIMLH 507

Query: 498  FRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDV 557
            +RLGHP+F Y+  LFP LF      +  C++C  AK  R  +P  PYKP+  F+LVHSDV
Sbjct: 508  YRLGHPSFVYLAKLFPKLFINKNPASYHCEICQFAKHTRTVYPQIPYKPSTVFSLVHSDV 567

Query: 558  WGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAILR 617
            WGPS+I   SG RWFVTF+DDHTR+TWV+L+ +KSEV  +FQ F   ++ QF+ KI +L+
Sbjct: 568  WGPSRIKNISGTRWFVTFVDDHTRVTWVFLMKEKSEVGHIFQTFNLMVQNQFNSKIQVLK 627

Query: 618  SDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSY 677
            SDN +E+   +LS +L + GI+H +SC  TPQQNGVAERKNRHLLEVAR LM S+++P+Y
Sbjct: 628  SDNAKEYFTSSLSTYLQNHGIIHISSCVDTPQQNGVAERKNRHLLEVARCLMFSSNVPNY 687

Query: 678  LWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVS-EVPLRVFGCTAYVHNFGP 737
             WG+AILTA +LINRMPSR+L  Q+P     + +P T   S ++PL+VFGCTA+VH +  
Sbjct: 688  FWGEAILTATYLINRMPSRVLTFQSPRQLFLKQFPHTHAASSDLPLKVFGCTAFVHVYPQ 747

Query: 738  NQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGESVS 797
            N++KF PRA  C+F+GY P Q+GYKC+ P +++++ TMDV+F E   ++P SH+QGES++
Sbjct: 748  NRSKFAPRANKCIFLGYSPTQKGYKCYSPTNKRFYTTMDVSFFEHVFFYPKSHVQGESMN 807

Query: 798  EESNNTFE-FIEPTPSVVSNIIPHSIVLPTN-QVPWKTYYRRNHKKEVGSP-TSQPPAPV 857
            E  +  +E F+E  PS  S     S   PT    P     +      V SP T Q P P+
Sbjct: 808  E--HQVWESFLEGVPSFHSESPNPSQFAPTELSTPMPPSVQPAQHTNVPSPVTIQSPMPI 867

Query: 858  QDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAEQG 917
            Q   P          +   +N+     R     LE+  +   G  I+           + 
Sbjct: 868  QPIAP----------QLANENLQVYIRRRKRQELEHGSQSTCGQYIDSNSSLPEENIGED 927

Query: 918  HTGKS--DEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPKD 977
              G+      D S  +PIALRKG R CT HPI NYV+Y+ LSP +RAF  SLD T +P  
Sbjct: 928  RAGEVLIPSIDDST-LPIALRKGVRRCTDHPIGNYVTYEGLSPSYRAFATSLDDTQVPNT 987

Query: 978  IYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKA 1037
            I  ALK  EWK AV +E+ ALEKN TW I  LP G + +            D + D  KA
Sbjct: 988  IQEALKISEWKKAVQDEIDALEKNGTWTITDLPVGKRPM------------DQSKD-FKA 1047

Query: 1038 RLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEEV 1097
            RLVA+GFTQ+YGIDY ETF+PVAKLNTIR+LLS+AVN+DW L QLD+KNAFLNGDL EEV
Sbjct: 1048 RLVARGFTQSYGIDYQETFAPVAKLNTIRILLSLAVNQDWCLQQLDIKNAFLNGDLEEEV 1107

Query: 1098 YMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKV 1157
            YM  PPGFE    ++ VCKLQKS+YGLKQSPRAWFDRFT  V   GY+QG  DHTLF K 
Sbjct: 1108 YMEIPPGFEESMAKNQVCKLQKSLYGLKQSPRAWFDRFTKAVLKLGYKQGQXDHTLFVKK 1167

Query: 1158 SKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGI 1217
            S  GK+A+LIVYVDDI+L+G+D  E+  LK+ + +EFE+KDLGNLKYFLGMEVARS++GI
Sbjct: 1168 SHAGKLAILIVYVDDIILSGNDMGELQNLKKYLLEEFEVKDLGNLKYFLGMEVARSRKGI 1227

Query: 1218 SVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHT 1277
             VSQRKYILDLL ETGMLGC+P DTP++   KLG   +  PVD+ +YQRLVG+LIYLSHT
Sbjct: 1228 VVSQRKYILDLLKETGMLGCKPIDTPMDSKKKLGIEKESTPVDRGRYQRLVGRLIYLSHT 1287

Query: 1278 RPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWA 1337
            RPDI FAVS VSQFM +P EEHM+AV RILRYLK TPGKGL FRKT+ +  E Y+D+DWA
Sbjct: 1288 RPDIGFAVSAVSQFMHSPTEEHMEAVYRILRYLKMTPGKGLFFRKTENRDTEVYSDADWA 1347

Query: 1338 GSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDL 1397
            G+++DR+STSGYC+FVWGNLVTWRSKKQSVVARSSAEAEYRA++ GICE IW+++VL++L
Sbjct: 1348 GNIIDRRSTSGYCSFVWGNLVTWRSKKQSVVARSSAEAEYRALAQGICEGIWIKRVLSEL 1407

Query: 1398 HQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQV 1423
             Q   +P+ + CDN+AAISIA NPV HDRTKHVEIDRHFI EK+ S ++ + Y+P+  Q 
Sbjct: 1408 GQTSSSPILMMCDNQAAISIAKNPVHHDRTKHVEIDRHFITEKVTSETVKLNYVPTKHQT 1453

BLAST of CSPI02G16290 vs. TAIR10
Match: AT4G23160.1 (AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 8)

HSP 1 Score: 478.8 bits (1231), Expect = 1.3e-134
Identity = 230/502 (45.82%), Postives = 335/502 (66.73%), Query Frame = 1

Query: 895  SCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNS 954
            S T H I  ++SY+ +SP + +F   +     P     A ++  W  A+ +E+ A+E   
Sbjct: 54   SLTIHDISQFLSYEKVSPLYHSFLVCIAKAKEPSTYNEAKEFLVWCGAMDDEIGAMETTH 113

Query: 955  TWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKL 1014
            TW+ICTLP   K +GCKWV+ +KY +DGT++R+KARLVAKG+TQ  GID+ ETFSPV KL
Sbjct: 114  TWEICTLPPNKKPIGCKWVYKIKYNSDGTIERYKARLVAKGYTQQEGIDFIETFSPVCKL 173

Query: 1015 NTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEEVYMSPPPGFEAQFGQH-----VCKLQ 1074
             +++++L+++   ++ L+QLD+ NAFLNGDL EE+YM  PPG+ A+ G       VC L+
Sbjct: 174  TSVKLILAISAIYNFTLHQLDISNAFLNGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLK 233

Query: 1075 KSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGD 1134
            KSIYGLKQ+ R WF +F+  +   G+ Q HSDHT F K++ T  + VL VYVDDI++  +
Sbjct: 234  KSIYGLKQASRQWFLKFSVTLIGFGFVQSHSDHTYFLKITATLFLCVL-VYVDDIIICSN 293

Query: 1135 DQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCR 1194
            + A + +LK ++   F+++DLG LKYFLG+E+ARS  GI++ QRKY LDLL ETG+LGC+
Sbjct: 294  NDAAVDELKSQLKSCFKLRDLGPLKYFLGLEIARSAAGINICQRKYALDLLDETGLLGCK 353

Query: 1195 PTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEE 1254
            P+  P++ +           VD + Y+RL+G+L+YL  TR DISFAV+ +SQF + P   
Sbjct: 354  PSSVPMDPSVTFSAHSGGDFVDAKAYRRLIGRLMYLQITRLDISFAVNKLSQFSEAPRLA 413

Query: 1255 HMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLV 1314
            H +AV +IL Y+K T G+GL +       ++ ++D+ +      R+ST+GYC F+  +L+
Sbjct: 414  HQQAVMKILHYIKGTVGQGLFYSSQAEMQLQVFSDASFQSCKDTRRSTNGYCMFLGTSLI 473

Query: 1315 TWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDLHQECETPLKLFCDNKAAISIA 1374
            +W+SKKQ VV++SSAEAEYRA+S    E +WL +   +L      P  LFCDN AAI IA
Sbjct: 474  SWKSKKQQVVSKSSAEAEYRALSFATDEMMWLAQFFRELQLPLSKPTLLFCDNTAAIHIA 533

Query: 1375 NNPVQHDRTKHVEIDRHFIKEK 1392
             N V H+RTKH+E D H ++E+
Sbjct: 534  TNAVFHERTKHIESDCHSVRER 554

BLAST of CSPI02G16290 vs. TAIR10
Match: ATMG00810.1 (ATMG00810.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 187.2 bits (474), Expect = 7.8e-47
Identity = 91/224 (40.62%), Postives = 135/224 (60.27%), Query Frame = 1

Query: 1117 LIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYI 1176
            L++YVDDI+LTG     ++ L  ++   F +KDLG + YFLG+++     G+ +SQ KY 
Sbjct: 3    LLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYA 62

Query: 1177 LDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAV 1236
              +L   GML C+P  TP+        S  + P D   ++ +VG L YL+ TRPDIS+AV
Sbjct: 63   EQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYP-DPSDFRSIVGALQYLTLTRPDISYAV 122

Query: 1237 SVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKS 1296
            ++V Q M  P       + R+LRY+K T   GL   K  +  ++A+ DSDWAG    R+S
Sbjct: 123  NIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRS 182

Query: 1297 TSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIW 1341
            T+G+CTF+  N+++W +K+Q  V+RSS E EYRA++L   E  W
Sbjct: 183  TTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CSPI02G16290 vs. TAIR10
Match: ATMG00820.1 (ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase))

HSP 1 Score: 107.8 bits (268), Expect = 6.0e-23
Identity = 52/98 (53.06%), Postives = 64/98 (65.31%), Query Frame = 1

Query: 927  PKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDR 986
            PK +  ALK P W  A+ EE+ AL +N TW +   P     +GCKWVF  K  +DGTLDR
Sbjct: 28   PKSVIFALKDPGWCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLDR 87

Query: 987  HKARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVA 1025
             KARLVAKGF Q  GI + ET+SPV +  TIR +L+VA
Sbjct: 88   LKARLVAKGFHQEEGIYFVETYSPVVRTATIRTILNVA 125

BLAST of CSPI02G16290 vs. TAIR10
Match: ATMG00240.1 (ATMG00240.1 Gag-Pol-related retrotransposon family protein)

HSP 1 Score: 75.5 bits (184), Expect = 3.3e-13
Identity = 34/82 (41.46%), Postives = 53/82 (64.63%), Query Frame = 1

Query: 1223 IYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAY 1282
            +YL+ TRPD++FAV+ +SQF        M+AV ++L Y+K T G+GL +  T    ++A+
Sbjct: 1    MYLTITRPDLTFAVNRLSQFSSASRTAQMQAVYKVLHYVKGTVGQGLFYSATSDLQLKAF 60

Query: 1283 TDSDWAGSVVDRKSTSGYCTFV 1305
             DSDWA     R+S +G+C+ V
Sbjct: 61   ADSDWASCPDTRRSVTGFCSLV 82

BLAST of CSPI02G16290 vs. TAIR10
Match: AT1G21280.1 (AT1G21280.1 Retrotransposon gag protein (InterPro:IPR005162))

HSP 1 Score: 54.7 bits (130), Expect = 6.0e-07
Identity = 27/117 (23.08%), Postives = 62/117 (52.99%), Query Frame = 1

Query: 10  YLTNTVAQSSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPQPGDPHERY 69
           YL   +   S + +     + +NY +W    +  L   +KF F+ G +P+P P  P  + 
Sbjct: 19  YLPPDIHHPSDFSIQKLSKDEDNYVAWKIRFRSFLRVTKKFGFIDGTLPKPDPFSPLYQP 78

Query: 70  WKAEDSILRSILINSMEPQIGKPLLFATTAKDIWDTAQTLYSKRQNASRLYTLRKQV 127
           W+  ++++   L+NSM  ++ + +++A TA  +W+  + ++    +  ++Y LR+++
Sbjct: 79  WEQCNAMVMYWLMNSMTDKLLESVMYAETAHKMWEDLRRVFVPCVDL-KIYQLRRRL 134

BLAST of CSPI02G16290 vs. NCBI nr
Match: gi|147819777|emb|CAN76196.1| (hypothetical protein VITISV_041073 [Vitis vinifera])

HSP 1 Score: 1520.8 bits (3936), Expect = 0.0e+00
Identity = 783/1476 (53.05%), Postives = 1003/1476 (67.95%), Query Frame = 1

Query: 19   SMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPQPGDPHERYWKAEDSILR 78
            S + L+  KLNG NY  W+QSVK+ ++GR K   L GE+ +P   DP+ + W+  + +  
Sbjct: 35   SSFQLTIHKLNGKNYLEWAQSVKLAIDGRGKLGHLNGEVSKPVADDPNLKTWRFRELVA- 94

Query: 79   SILINSMEPQIGKPLLFATTAKDIWDTAQTLYSKRQNASRLYTLRKQVHE---------- 138
                      IGKP LF  TAKD+W+  + +YS  +N+S+++ L+ ++ +          
Sbjct: 95   ----------IGKPHLFLPTAKDVWEAVRDMYSDLENSSQIFDLKSKLWQSRQGDREVTT 154

Query: 139  --------------CKELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGHILGQR 198
                          C E  W  P D V++ + EENDR+Y FLA LN   D VRG ILG++
Sbjct: 155  YYNQMVTLWQELDLCYEDEWDCPNDSVRHKKREENDRVYVFLAALNHNLDEVRGRILGRK 214

Query: 199  PIPSLMEVCSEIRLEEDRTSAMNIS----ATPTIDSAAFSARSSNSSSDKHNGKPIPVCE 258
            P+PS+ EV SE+R EE R   M       + P I+S+A  ++ S+   D+      P C+
Sbjct: 215  PLPSIREVFSEVRREEARRKVMLTDPEPMSNPEIESSALVSKGSDLDGDRRKK---PWCD 274

Query: 259  HCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYVSESAEP--PQQSDPHKNQTDLS 318
            HCKK WHTK  CWK+HG+P   KK+  +D    GRA+ + SA+   PQ +    N T   
Sbjct: 275  HCKKPWHTKGTCWKIHGKPQNFKKKNGSD----GRAFQTMSADSQGPQINSEKPNFTKEQ 334

Query: 319  LATLGAIVQSG---------------IPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFV 378
            L+ L  + QS                +  +   +  +   PWI+DSGATDH+TGSS+ F 
Sbjct: 335  LSHLYKLFQSPQFSNPSCSLAQQGNYLIAALSSIKSNVHCPWIIDSGATDHMTGSSQIFS 394

Query: 379  SYIPCAGNETIRIADGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNC 438
            SY PCAGN+ I+I DGSL+ IAGKG +     L+LHNVLHVP LS NLLSISKIT +  C
Sbjct: 395  SYKPCAGNKKIKIXDGSLSAIAGKGSVFISPSLTLHNVLHVPNLSCNLLSISKITQDHQC 454

Query: 439  KAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDCML 498
            +A F P    FQ+L+SGR IG AR   GLY  ++ + S    +++   S    S  D +L
Sbjct: 455  QANFYPSYCEFQELTSGRTIGNAREIGGLYFFENGSESRKPIQSTCFESISVASSDDIIL 514

Query: 499  WHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHS 558
            WH+RLGHP+FQY+KHLFP LF     ++  C+ C  AK HR SFP QPY+ ++PF+L+HS
Sbjct: 515  WHYRLGHPSFQYLKHLFPSLFRNKNPSSFQCEFCELAKHHRTSFPLQPYRISKPFSLIHS 574

Query: 559  DVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAI 618
            DVWGPS+I+T SGK+WFVTFIDDHTR++WVYL+ +KSEV  +F+ FY  + TQF  KI +
Sbjct: 575  DVWGPSRISTLSGKKWFVTFIDDHTRVSWVYLLREKSEVEEVFKIFYTMVLTQFQTKIQV 634

Query: 619  LRSDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLP 678
             RSDNG+E+ N  L +F   KGIVHQ+SC  TPQQNG+AERKN+HLLEVAR+L  +T +P
Sbjct: 635  FRSDNGKEYINKALGKFFLEKGIVHQSSCNDTPQQNGIAERKNKHLLEVARALCFTTKVP 694

Query: 679  SYLWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFG 738
             YLWG+AILTA +LINRMP+RIL+ +TPL       P  R  S +PL++FGCT +VH   
Sbjct: 695  KYLWGEAILTATYLINRMPTRILNFKTPLQVFTNCNPIFRLSSTLPLKIFGCTTFVHIHD 754

Query: 739  PNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGESV 798
             N+ K  PRA+ CVFVGY P Q+GYKCF P S+K FVTMDVTF E +P+F  +HLQGES 
Sbjct: 755  HNRGKLDPRARKCVFVGYAPTQKGYKCFDPISKKLFVTMDVTFFESKPFF-ATHLQGEST 814

Query: 799  SEESN----------NTFEFIEPTPS---VVSNIIPHSIVLPTNQVPW-KTYYRRNHKKE 858
            SE+S+          N    +EP+ S   V  NI    +    + + + KT      K  
Sbjct: 815  SEDSDLFKIEKTPTPNPNNLLEPSNSNQFVYPNIETSGLDTTKSDMSFEKTAEILGKKNG 874

Query: 859  VGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNM-ISENDRSNVAVLENVEEKDSGDEIE 918
            V +  S   +    S         N     TKN  +    R      E+  +   G E E
Sbjct: 875  VLNIESLDGSSSLPSHNQNHSNTNNGNRTSTKNSELMTYSRRKHNSKESNPDPLPGHESE 934

Query: 919  VRIETRNNEAEQGH-----------TGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSY 978
            +R E  ++E    +           +  + E    L+IPIA RKG RSCTKHP+ NY+SY
Sbjct: 935  LREEPNSSECPGNNQTDSCQPVQFISNSNSESFDDLNIPIATRKGVRSCTKHPMSNYMSY 994

Query: 979  DSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKT 1038
             +LSP F AFT+ L    IPK++  AL+ PEWK A+ EEM+ALEKN TW++  LPKG  T
Sbjct: 995  KNLSPSFFAFTSHLSLVEIPKNVQEALQVPEWKKAIFEEMRALEKNHTWEVMGLPKGKTT 1054

Query: 1039 VGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNK 1098
            VGCKWVF++KY ++G+L+R+KARLVAKGFTQTYGIDY ETF+PVAKLNT+RVLLS+A N 
Sbjct: 1055 VGCKWVFTVKYNSNGSLERYKARLVAKGFTQTYGIDYLETFAPVAKLNTVRVLLSIAANL 1114

Query: 1099 DWPLYQLDVKNAFLNGDLVEEVYMSPPPGFEAQFGQHVCKLQKSIYGLKQSPRAWFDRFT 1158
            DWPL QLDVKNAFLNG+L EEVYM PPPGF+  FG  VCKL+KS+YGLKQSPRAWF+RFT
Sbjct: 1115 DWPLQQLDVKNAFLNGNLEEEVYMDPPPGFDEHFGSKVCKLKKSLYGLKQSPRAWFERFT 1174

Query: 1159 TFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEI 1218
             FVK+QGY Q  SDHT+F K S  GKIA+LIVYVDDI+LTGD   E+ +LK+ +  EFEI
Sbjct: 1175 QFVKNQGYVQAQSDHTMFIKHSNDGKIAILIVYVDDIILTGDHVTEMDRLKKSLALEFEI 1234

Query: 1219 KDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQ 1278
            KDLG+L+YFLGMEVARSK GI VSQRKYILDLL ETGM GCRP DTPI+ N KLG+++D 
Sbjct: 1235 KDLGSLRYFLGMEVARSKRGIVVSQRKYILDLLKETGMSGCRPADTPIDPNQKLGDTNDG 1294

Query: 1279 VPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGK 1338
              V+  +YQ+LVGKLIYLSHTRPDI+FAVS+VSQFM +P E H++AV RILRYLKSTPGK
Sbjct: 1295 NLVNTTRYQKLVGKLIYLSHTRPDIAFAVSIVSQFMHSPYEVHLEAVYRILRYLKSTPGK 1354

Query: 1339 GLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAE 1398
            GL F+K+++KTIEAYTD+DWAGSV DR+STSGYCT++WGNLVTWRSKKQSV ARSSAEAE
Sbjct: 1355 GLFFKKSEQKTIEAYTDADWAGSVTDRRSTSGYCTYIWGNLVTWRSKKQSVXARSSAEAE 1414

Query: 1399 YRAMSLGICEEIWLQKVLTDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHF 1424
            YRAM+ G+CE +WL+K+L +L +  E P+KL+CDNKAAISIA+NPVQHDRTKHVEIDRHF
Sbjct: 1415 YRAMAHGVCEILWLKKILEELKRPLEMPMKLYCDNKAAISIAHNPVQHDRTKHVEIDRHF 1474

BLAST of CSPI02G16290 vs. NCBI nr
Match: gi|147810393|emb|CAN59964.1| (hypothetical protein VITISV_022757 [Vitis vinifera])

HSP 1 Score: 1411.4 bits (3652), Expect = 0.0e+00
Identity = 738/1457 (50.65%), Postives = 961/1457 (65.96%), Query Frame = 1

Query: 16   AQSSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPQPGDPHERYWKAEDS 75
            + SS   ++G KLNG+NY  WSQSV + + G+ K  +LTGE   P+  +P  R WK E+S
Sbjct: 26   SDSSPILITGHKLNGHNYLQWSQSVLLFICGKGKDEYLTGEAAMPETTEPGFRKWKIENS 85

Query: 76   ILRSILINSMEPQIGKPLLFATTAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKE---- 135
            ++ S LINSM   IG+  L   TAKDIWD A+  YS  +N S L+ +   +H+ ++    
Sbjct: 86   MIMSWLINSMNNDIGENFLLFRTAKDIWDAAKETYSSSENTSELFQVESALHDFRQGEQS 145

Query: 136  --------------------LVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGHIL 195
                                  W+   D   Y  I E  R++ F  GLN + D VRG I+
Sbjct: 146  VTQYYNTLTRYWQQLDLFETHSWKCSDDAATYRXIVEQXRLFKFFLGLNRELDDVRGRIM 205

Query: 196  GQRPIPSLMEVCSEIRLEEDRTSAMNISA---TPTIDSAAFSARSSNSSSDKHNGKPIPV 255
            G +P+PSL E  SE+R EE R   M  S     PT+D++   ARS NSS      +  P 
Sbjct: 206  GIKPLPSLREAFSEVRREESRKKVMMGSKEQPAPTLDASXLXARSFNSSGGDRQKRDRPW 265

Query: 256  CEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYVS---ESAEPPQQSDPHKNQT 315
            C++CKK  H KE CWKLHG+    K +P  D+   GRA+V+   ES   P+ S  +K Q 
Sbjct: 266  CDYCKKXGHYKEACWKLHGKXADWKPKPRXDRD--GRAHVAANXESTSVPEPSPFNKEQM 325

Query: 316  DLSLATLGAIVQSGIPHSFGLVSI-DGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETI 375
            ++ L  L + V SG      L +   G  PWI+D+GA+DH+TG +    +Y P  G+ ++
Sbjct: 326  EM-LQKLLSQVGSGSTTGIALTANRGGMKPWIVDTGASDHMTGDAAILQNYKPSNGHSSV 385

Query: 376  RIADGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSF 435
             IADGS + I G G I     L L +VLHVP L  NLLSISK+  +L C   F P+S  F
Sbjct: 386  HIADGSKSKIXGTGSIKLTKDLYLDSVLHVPNLDCNLLSISKLARDLQCVTKFYPNSCVF 445

Query: 436  QDLSSGRMIGTARHSRGLYLL------DDDTSSSSIPRTSLLSSYFTTS------EQDCM 495
            QDL SG+MIG+A    GLYLL      +  + +S +   S+L S+ + S      + + +
Sbjct: 446  QDLKSGKMIGSAELCSGLYLLSCGQFSNQVSQASCVQSQSMLESFNSVSNSKVNKDSEII 505

Query: 496  LWHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVH 555
            + H+RLGHP+F Y+  LFP LF      +  C++C  AK  R  +P  PYKP+  F+LVH
Sbjct: 506  MLHYRLGHPSFVYLAKLFPKLFINKNPASYHCEICQFAKHTRTVYPQIPYKPSTVFSLVH 565

Query: 556  SDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIA 615
            SDVWGPS+I   SG RWFVTF+DDHTR+TWV+L+ +KSEV  +FQ F   ++ QF+ KI 
Sbjct: 566  SDVWGPSRIKNISGTRWFVTFVDDHTRVTWVFLMKEKSEVGHIFQTFNLMVQNQFNSKIQ 625

Query: 616  ILRSDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSL 675
            +L+SDN +E+   +LS +L + GI+H +SC  TPQQNGVAERKNRHLLEVAR LM S+++
Sbjct: 626  VLKSDNAKEYFTSSLSTYLQNHGIIHISSCVDTPQQNGVAERKNRHLLEVARCLMFSSNV 685

Query: 676  PSYLWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVS-EVPLRVFGCTAYVHN 735
            P+Y WG+AILTA +LINRMPSR+L  Q+P     + +P TR  S ++PL+VFGCTA+VH 
Sbjct: 686  PNYFWGEAILTATYLINRMPSRVLTFQSPRQLFLKQFPHTRAASSDLPLKVFGCTAFVHV 745

Query: 736  FGPNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGE 795
            +  N++KF PRA  C+F+GY P Q+GYKC+ P +++++ TMDV+F E   ++P SH+QGE
Sbjct: 746  YPQNRSKFAPRANKCIFLGYSPTQKGYKCYSPTNKRFYTTMDVSFFEHVFFYPKSHVQGE 805

Query: 796  SVSEESNNTFE-FIEPTPSVVSNIIPHSIVLPTN-QVPWKTYYRRNHKKEVGSP-TSQPP 855
            S++E  +  +E  +E  PS  S     S   PT    P  +  +      V SP T Q P
Sbjct: 806  SMNE--HQVWESLLEGVPSFHSESPNPSQFAPTELSTPMPSSVQPAQHTNVPSPVTIQSP 865

Query: 856  APVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEA 915
             P+Q   P          +   +N+     R     LE+  +   G  I+          
Sbjct: 866  MPIQPIAP----------QLANENLQVYIRRRKRQELEHGSQSTCGQYIDSNSSLPEENI 925

Query: 916  EQGHTGKS--DEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTII 975
             +   G+      D S  +PIALRKG R CT HPI NYV+Y+ LSP +RAF  SLD T +
Sbjct: 926  GEDRAGEVLIPSIDDST-LPIALRKGVRRCTDHPIGNYVTYEGLSPSYRAFATSLDDTQV 985

Query: 976  PKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDR 1035
            P  I  A K  EWK AV +E+ ALEKN TW I  LP G + VGCKW+F++KYKADG+++R
Sbjct: 986  PNTIQEAXKISEWKKAVQDEIDALEKNGTWTITDLPVGKRPVGCKWIFTIKYKADGSVER 1045

Query: 1036 HKARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLV 1095
             KARLVA+GFTQ+YGIDY ETF+PVAKLNTIR+LLS+AVN+DW L QLD+KNAFLNGDL 
Sbjct: 1046 FKARLVARGFTQSYGIDYQETFAPVAKLNTIRILLSLAVNQDWCLQQLDIKNAFLNGDLE 1105

Query: 1096 EEVYMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLF 1155
            EEVYM  PPGFE    ++ VCKLQKS+YGLKQSPRAWFDRFT  V   GY+QG +DHTLF
Sbjct: 1106 EEVYMEIPPGFEESMAKNQVCKLQKSLYGLKQSPRAWFDRFTKAVLKLGYKQGQADHTLF 1165

Query: 1156 TKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSK 1215
             K S  GK+A+LIVYVDDI+L+G+D  E+  LK+ + +EFE+KDLGNLKYFLGMEVARS+
Sbjct: 1166 VKKSHAGKMAILIVYVDDIILSGNDMEELQXLKKYLSEEFEVKDLGNLKYFLGMEVARSR 1225

Query: 1216 EGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYL 1275
            +GI VSQRKYILDLL ETGMLGC+P DTP++   KLG   +  PVD+ +YQRLVG+LIYL
Sbjct: 1226 KGIVVSQRKYILDLLKETGMLGCKPIDTPMDSQKKLGIEKESTPVDRGRYQRLVGRLIYL 1285

Query: 1276 SHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDS 1335
            SHTRPDI FAVS VSQFM +P EEHM+AV RI RYLK TPGKGL FRKT+ +  E Y+D+
Sbjct: 1286 SHTRPDIGFAVSXVSQFMHSPTEEHMEAVYRIXRYLKMTPGKGLFFRKTENRDXEVYSDA 1345

Query: 1336 DWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVL 1395
            DWAG+++DR+STSGYC+FVWGNLVT RSKKQSVVARSSAEAEYRA++ GICE IW+++VL
Sbjct: 1346 DWAGNIIDRRSTSGYCSFVWGNLVTXRSKKQSVVARSSAEAEYRALAQGICEGIWIKRVL 1405

Query: 1396 TDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSS 1423
            ++L Q   +P+ + CDN+AAISIA NPV HD TKHVEIDRHFI EK+ S ++ + Y+P+ 
Sbjct: 1406 SELGQTSSSPILMMCDNQAAISIAKNPVHHDXTKHVEIDRHFITEKVTSETVKLNYVPTK 1465

BLAST of CSPI02G16290 vs. NCBI nr
Match: gi|147778986|emb|CAN62538.1| (hypothetical protein VITISV_031159 [Vitis vinifera])

HSP 1 Score: 1406.0 bits (3638), Expect = 0.0e+00
Identity = 735/1455 (50.52%), Postives = 959/1455 (65.91%), Query Frame = 1

Query: 18   SSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPQPGDPHERYWKAEDSIL 77
            SS   ++G KLNG+NY  WSQSV + + G+ K  ++TGE   P+  +P  R WK E+S++
Sbjct: 28   SSPILITGHKLNGHNYLQWSQSVLLFICGKGKDEYJTGEAXMPETTEPXFRKWKIENSMI 87

Query: 78   RSILINSMEPQIGKPLLFATTAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKE------ 137
             S LINSM   IG+  L   TAKDIWD A+  YS  +N S L+ +   +H+ ++      
Sbjct: 88   MSWLINSMNNDIGENFLLFGTAKDIWDAAKETYSSSENISELFQVESALHDFRQGEQSVT 147

Query: 138  ------------------LVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGHILGQ 197
                                W+   D   Y +I E  R++ F  GLN + D VRG I+G 
Sbjct: 148  QYYNTLTRYWQQLDLFETHSWKCSDDAATYRQIVEQKRLFKFFLGLNRELDDVRGRIMGI 207

Query: 198  RPIPSLMEVCSEIRLEEDRTSAMNISA---TPTIDSAAFSARSSNSSSDKHNGKPIPVCE 257
            +P+PSL EV SE+R EE R   M  S     PT+D +A +ARS NSS      +  P C+
Sbjct: 208  KPLPSLREVFSEVRREESRKKVMMGSKEQPAPTLDGSALAARSFNSSGGDRQKRDRPWCD 267

Query: 258  HCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYV---SESAEPPQQSDPHKNQTDL 317
            + KK  H KE CWKLHG+P   K +P +D+   GRA+V   SES   P+ S  +K Q ++
Sbjct: 268  YYKKPGHYKEACWKLHGKPADWKPKPRSDRD--GRAHVAANSESTSVPEPSPFNKEQMEM 327

Query: 318  SLATLGAIVQSGIPHSFGLV-SIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRI 377
             L  L + V SG      L  S  G  PWI+D+GA+DH+TG +    +Y P  G+  + I
Sbjct: 328  -LQKLLSQVGSGSTTGIALTASRGGMKPWIVDTGASDHMTGDAAILQNYKPSNGHSFVHI 387

Query: 378  ADGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQD 437
            ADGS + I G G I     L L +VLHVP L  NLLSISK+  +L C   F P+S  FQD
Sbjct: 388  ADGSKSKIVGTGSIKLTKDLYLDSVLHVPNLDCNLLSISKLARDLQCVTKFYPNSCVFQD 447

Query: 438  LSSGRMIGTARHSRGLYLL------DDDTSSSSIPRTSLLSSYFTTS------EQDCMLW 497
            L SG+MIG+A+    LYLL      +  + +S +   S+L S+ + S      + + ++ 
Sbjct: 448  LKSGKMIGSAKLCSELYLLSCGQFSNQVSQASCVQSQSMLESFNSVSNSKVNKDSEIIML 507

Query: 498  HFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSD 557
            H+RLGHP+F Y+  LFP LF      +  C++C  AK  R  +P  PYKP+  F+LVHSD
Sbjct: 508  HYRLGHPSFVYLAKLFPRLFINKNPASYHCEICQFAKHTRTVYPQIPYKPSTVFSLVHSD 567

Query: 558  VWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAIL 617
            VWGPS+I   SG RWFVTF+DDHTR+TWV+L+ +KSEV  +FQ F   ++ QF+ KI +L
Sbjct: 568  VWGPSRIKNISGTRWFVTFVDDHTRVTWVFLMKEKSEVGHIFQTFNLMVQNQFNSKIQVL 627

Query: 618  RSDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPS 677
            +SDN +E+   +LS +L + GI+H +SC  TPQQNGVAERKNRHLLEVAR LM S+++P+
Sbjct: 628  KSDNAKEYFTSSLSTYLQNHGIIHISSCVDTPQQNGVAERKNRHLLEVARCLMFSSNVPN 687

Query: 678  YLWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVS-EVPLRVFGCTAYVHNFG 737
            Y WG+AILTA +LINRMPSR+L  Q+P     + +P TR  S ++ L+VFGCTA+VH + 
Sbjct: 688  YFWGEAILTATYLINRMPSRVLTFQSPRQLFLKQFPHTRAASSDLSLKVFGCTAFVHVYP 747

Query: 738  PNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGESV 797
             N++KF PRA  C+F+GY P+Q+GYKC+ P +++++ TMDV+F E   ++P  H+QGES+
Sbjct: 748  QNRSKFAPRANKCIFLGYSPNQKGYKCYSPTNKRFYTTMDVSFFEHVFFYPKFHVQGESM 807

Query: 798  SEESNNTFEF-IEPTPSVVSNIIPHSIVLPTN-QVPWKTYYRRNHKKEVGSP-TSQPPAP 857
            +E  +  +E  +E  PS  S     S   PT    P  +  +      V SP T Q P P
Sbjct: 808  NE--HQVWESRLEGVPSFHSESPNPSQFAPTELSTPMPSSVQPAQHTNVPSPVTIQSPMP 867

Query: 858  VQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAEQ 917
            +Q   P          +   +N+     R     LE+  +   G  I+           +
Sbjct: 868  IQPIAP----------QLANENLQVYIRRRKRQELEHGSQSTCGQYIDSNSSLPEENIGE 927

Query: 918  GHTGKS--DEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPK 977
               G+      D S  +PIALRKG R CT HPI NYV+Y+ LSP +RAF  SLD T +P 
Sbjct: 928  DRAGEVLIPSIDDST-LPIALRKGVRRCTDHPIGNYVTYEGLSPSYRAFATSLDDTQVPN 987

Query: 978  DIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHK 1037
             I  ALK  EWK AV +E+ ALEKN TW I  LP G + VGCKW+F++KYKADG+++R K
Sbjct: 988  TIQEALKISEWKKAVQDEIDALEKNGTWTITDLPVGKRPVGCKWIFTIKYKADGSVERFK 1047

Query: 1038 ARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEE 1097
            ARLVA+GFTQ YGIDY ETF+PVAKLNTIR+LLS+AVN+DW L QLD+KNAFLNGDL EE
Sbjct: 1048 ARLVARGFTQXYGIDYQETFAPVAKLNTIRILLSLAVNQDWCLQQLDIKNAFLNGDLEEE 1107

Query: 1098 VYMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTK 1157
            VYM  PPGFE    ++ VCKLQKS+YGLKQSPRAWFDRFT  V   GY+QG +DHTLF K
Sbjct: 1108 VYMEIPPGFEESMAKNQVCKLQKSLYGLKQSPRAWFDRFTKAVLKLGYKQGQADHTLFVK 1167

Query: 1158 VSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEG 1217
             S  GK+A+LIVYVDDI+L+G+D  E+  LK+ + +EFE+KDLGNLKYFLGMEVARS++G
Sbjct: 1168 KSHAGKMAILIVYVDDIILSGNDMEELQNLKKYLSEEFEVKDLGNLKYFLGMEVARSRKG 1227

Query: 1218 ISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSH 1277
            I VSQ KYILDLL ETGMLGC+P DTP++   KLG   +  P D+ +YQRLVG+LIYLSH
Sbjct: 1228 IVVSQTKYILDLLKETGMLGCKPIDTPMDSQKKLGIEKESTPXDRGRYQRLVGRLIYLSH 1287

Query: 1278 TRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDW 1337
            TRPDI FAVS VSQFM +P EEHM+AV RILRYLK TP KG+ FRKT+ +  E Y+D+DW
Sbjct: 1288 TRPDIGFAVSAVSQFMHSPTEEHMEAVYRILRYLKMTPXKGIFFRKTENRDTEVYSDADW 1347

Query: 1338 AGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTD 1397
            AG+++DR+STSGYC+FVWGNLVTWRSKKQSVVARSSAEAEY A++ GICE  W+++VL++
Sbjct: 1348 AGNIIDRRSTSGYCSFVWGNLVTWRSKKQSVVARSSAEAEYXALAQGICEGXWIKRVLSE 1407

Query: 1398 LHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQ 1423
            L Q   +P+ + CDN+A ISIA NPV HDRTKHVEIDRHFI EK+ S ++ + Y+P+  Q
Sbjct: 1408 LGQTSSSPILMMCDNQAXISIAKNPVHHDRTKHVEIDRHFITEKVTSETVKLNYVPTKHQ 1466

BLAST of CSPI02G16290 vs. NCBI nr
Match: gi|147769406|emb|CAN70229.1| (hypothetical protein VITISV_024789 [Vitis vinifera])

HSP 1 Score: 1399.0 bits (3620), Expect = 0.0e+00
Identity = 728/1454 (50.07%), Postives = 950/1454 (65.34%), Query Frame = 1

Query: 18   SSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPQPGDPHERYWKAEDSIL 77
            SS   ++G KLNG+NY  WSQSV + + G+ K  +LTGE   P+  +P  R WK E+S++
Sbjct: 28   SSPILITGHKLNGHNYLQWSQSVLLFICGKGKDEYLTGEAVMPETTEPGFRKWKIENSMI 87

Query: 78   RSILINSMEPQIGKPLLFATTAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKE------ 137
             S LINSM   IG+  L   TAKDIWD A+  YS  +N S L+ +   +H+ ++      
Sbjct: 88   MSWLINSMNNDIGENFLLFGTAKDIWDAAKETYSSSENTSELFQVESALHDFRQGEQSVT 147

Query: 138  ------------------LVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGHILGQ 197
                                W+   D   Y +I E  R++ F  GLN + D VRG I+G 
Sbjct: 148  QYYNTLTRYWQQLDLFETHSWKCSDDAATYRQIMEQKRLFKFFLGLNRELDDVRGRIMGI 207

Query: 198  RPIPSLMEVCSEIRLEEDRTSAMNISA---TPTIDSAAFSARSSNSSSDKHNGKPIPVCE 257
            +P+PSL E  SE+R EE R   M  S     PT+D++A +ARS NSS      +  P C+
Sbjct: 208  KPLPSLREAFSEVRREESRKKVMMGSKEQPAPTLDASALAARSFNSSGGDRQKRDRPWCD 267

Query: 258  HCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYV---SESAEPPQQSDPHKNQTDL 317
            +CKK  H KE CWKLHG+P   K +P  D+   GRA+V   SES   P+ S  +K Q ++
Sbjct: 268  YCKKPGHYKETCWKLHGKPADWKPKPRFDRD--GRAHVAANSESTSVPEPSPFNKEQMEM 327

Query: 318  SLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIA 377
                L  +            +  G  PWI+D+GA+DH+TG +    +Y P  G+ ++ IA
Sbjct: 328  LQKLLSQVGSGSTTGVAFTTNRGGMRPWIVDTGASDHMTGDAAILQNYKPSNGHSSVHIA 387

Query: 378  DGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDL 437
            DGS         I     L L +VLHVP L  NLLSISK+ H+L C   F P+   FQDL
Sbjct: 388  DGS---------IKLTKDLYLDSVLHVPNLDCNLLSISKLAHDLQCVTKFYPNLCVFQDL 447

Query: 438  SSGRMIGTARHSRGLYLLD-----DDTSSSSIPRTSLLSSYFTT-------SEQDCMLWH 497
             SG+MIG+A    GLYLL      +  S +S  ++  +S  F +        + + ++ H
Sbjct: 448  KSGKMIGSAELRSGLYLLSCGQFSNQVSQASCVQSQSMSESFNSVSNSKVNKDSEIIMLH 507

Query: 498  FRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDV 557
            +RLGHP+F Y+  LFP LF      +  C++C  AK  R+ +P  PYKP+  F+LVHSDV
Sbjct: 508  YRLGHPSFVYLAKLFPRLFINKNPASYHCEICQFAKHTRIVYPQIPYKPSTVFSLVHSDV 567

Query: 558  WGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAILR 617
            WGPS+I   SG RWFVTF+DDHT +TWV+L+ +KSEV  +FQ F   ++ QF+ KI +L+
Sbjct: 568  WGPSRIKNISGTRWFVTFVDDHTWVTWVFLMKEKSEVGHIFQTFNLMVQNQFNSKIQVLK 627

Query: 618  SDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSY 677
            SDN +E+   +LS +L +  I+H +SC  TPQQN VAERKNRHLLEVAR LM S+++P+Y
Sbjct: 628  SDNAKEYFTSSLSTYLQNHDIIHISSCVDTPQQNRVAERKNRHLLEVARCLMFSSNVPNY 687

Query: 678  LWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVS-EVPLRVFGCTAYVHNFGP 737
             WG+AILTA +LINRMPSR+L  Q+P     + +P T   S ++PL+VFGCTA++H +  
Sbjct: 688  FWGEAILTATYLINRMPSRVLTFQSPRQLFLKQFPHTHAASSDLPLKVFGCTAFIHVYPQ 747

Query: 738  NQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGESVS 797
            N++KF PRA  C+F+GY P Q+GYKC+ P +++++ TMDV+F E   ++P SH+QGES++
Sbjct: 748  NRSKFAPRANKCIFLGYSPTQKGYKCYSPTNKRFYTTMDVSFFEHVFFYPKSHVQGESMN 807

Query: 798  EESNNTFE-FIEPTPSVVSNIIPHSIVLPTN-QVPWKTYYRRNHKKEVGSP-TSQPPAPV 857
            E  +  +E F+E  PS  S     S   PT    P     +      V SP T Q P P+
Sbjct: 808  E--HQVWESFLEGVPSFHSESPNPSQFAPTELSTPMPPSVQPAQHTNVPSPVTIQSPMPI 867

Query: 858  QDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAEQG 917
            Q   P          +   +N+     R     LE+  +   G  I+           + 
Sbjct: 868  QPIAP----------QLANENLQVYLRRRKRQELEHGSQSTCGQYIDSNSSLPEENIGED 927

Query: 918  HTGKS--DEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPKD 977
              G+      D S  +PIALRKG R CT HPI NYV+Y+ LSP +RAF  SLD T +P  
Sbjct: 928  RAGEVLIPSIDDST-LPIALRKGVRRCTDHPIGNYVTYEGLSPSYRAFATSLDDTQVPNT 987

Query: 978  IYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKA 1037
            I  A K  EWK AV +E+ ALEKN TW I  LP G + VGCKW+F++KYK DG+++R KA
Sbjct: 988  IQEASKISEWKKAVQDEIDALEKNGTWTITDLPVGKRPVGCKWIFTIKYKTDGSVERFKA 1047

Query: 1038 RLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEEV 1097
            RLVA+GFTQ+YGIDY ETF+PVAKLNTIR+LLS+AVN+DW L QLD+KNAFLNGDL EEV
Sbjct: 1048 RLVARGFTQSYGIDYQETFAPVAKLNTIRILLSLAVNQDWCLQQLDIKNAFLNGDLEEEV 1107

Query: 1098 YMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKV 1157
            YM  PPGFE    ++ VCKLQKS+YGLKQSPRAWFDRFT  V   GY+QG +DHTLF K 
Sbjct: 1108 YMEIPPGFEESMAKNQVCKLQKSLYGLKQSPRAWFDRFTKAVLKLGYKQGQADHTLFVKK 1167

Query: 1158 SKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGI 1217
            S  GK+A+LIVYVDDI+L+G+D  E+  LK+ + +EFE+KDLGNLKYF+GMEVA+S++GI
Sbjct: 1168 SHAGKLAILIVYVDDIILSGNDMGELQNLKKYLSEEFEVKDLGNLKYFJGMEVAKSRKGI 1227

Query: 1218 SVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHT 1277
             VSQRKYILDLL ETGMLGC+P DTP++   KLG   +  PVD+ +YQRLVG+LIYLSHT
Sbjct: 1228 VVSQRKYILDLLKETGMLGCKPIDTPMDSQKKLGIEKESTPVDRGRYQRLVGRLIYLSHT 1287

Query: 1278 RPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWA 1337
            RPDI FAVS VSQFM +P EEHM+AV RILRYLK TPGKGL FRKT+ +  E Y+D+DWA
Sbjct: 1288 RPDIGFAVSAVSQFMHSPTEEHMEAVYRILRYLKMTPGKGLFFRKTENRDTEVYSDADWA 1347

Query: 1338 GSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDL 1397
            G+++DR STSGYC+FVWGNLVTWRSKKQSVVARSSAEAEYRA++ GICE IW++ VL++L
Sbjct: 1348 GNIIDRWSTSGYCSFVWGNLVTWRSKKQSVVARSSAEAEYRALAQGICEGIWIKXVLSEL 1407

Query: 1398 HQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQV 1423
             Q   +P+ + CDN+AAISIA NPV HDRTKHVEIDRHFI EK+ S ++ + Y+P+  Q 
Sbjct: 1408 GQXSSSPILMMCDNQAAISIAKNPVHHDRTKHVEIDRHFITEKVTSETVKLNYVPTKHQT 1457

BLAST of CSPI02G16290 vs. NCBI nr
Match: gi|147860087|emb|CAN82928.1| (hypothetical protein VITISV_025045 [Vitis vinifera])

HSP 1 Score: 1383.2 bits (3579), Expect = 0.0e+00
Identity = 729/1454 (50.14%), Postives = 945/1454 (64.99%), Query Frame = 1

Query: 18   SSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPQPGDPHERYWKAEDSIL 77
            SS   ++G KLNG+NY  WSQSV + + G+ K  +LTGE   P+  +P  R WK E+S++
Sbjct: 28   SSPILITGHKLNGHNYLQWSQSVLLFICGKGKDEYLTGEAVMPETTEPGFRKWKIENSMI 87

Query: 78   RSILINSMEPQIGKPLLFATTAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKE------ 137
             S LINSM   IG+  L   TAKDIWD A+  YS  +N S L+ +   +H+ ++      
Sbjct: 88   MSWLINSMNNDIGENFLLFGTAKDIWDAAKETYSSSENTSELFQVESALHDFRQGEQSVT 147

Query: 138  ------------------LVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGHILGQ 197
                                W+   D   Y +I E  R++ F  GLN + D VRG I+G 
Sbjct: 148  QYYNTLTRYWQQLDLFETHSWKCSDDAATYRQIVEQKRLFKFFLGLNRELDDVRGRIMGI 207

Query: 198  RPIPSLMEVCSEIRLEEDRTSAMNISA---TPTIDSAAFSARSSNSSSDKHNGKPIPVCE 257
            +P+PSL E  SE+R EE R   M  S     PT+D++A +ARS NSS      +  P C+
Sbjct: 208  KPLPSLREAFSEVRREESRKKVMMGSKEQPAPTLDASALAARSFNSSGGDRQKRDRPWCD 267

Query: 258  HCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYV---SESAEPPQQSDPHKNQTDL 317
            +CKK  H KE CWKLHG+P   K +P  D+   GRA+V   SES   P+ S  +K Q ++
Sbjct: 268  YCKKPGHYKETCWKLHGKPADWKPKPRFDRD--GRAHVAANSESTSVPEPSPFNKEQMEM 327

Query: 318  SLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIA 377
                L  +            +  G  PWI+D+GA+DH+TG +    +Y P  G+ ++ IA
Sbjct: 328  LQKLLSQVGSGSTTGVAFTANRGGMRPWIVDTGASDHMTGDAAILQNYKPSNGHSSVHIA 387

Query: 378  DGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDL 437
            DGS + IAG G I     L L +VLHVP L  NLLSISK+ H+L C   F P+   FQDL
Sbjct: 388  DGSKSKIAGTGSIKLTKDLYLDSVLHVPNLDCNLLSISKLAHDLQCVTKFYPNLCVFQDL 447

Query: 438  SSGRMIGTARHSRGLYLLD-----DDTSSSSIPRTSLLSSYFTT-------SEQDCMLWH 497
             SG+MIG+A    GLYLL      +  S +S  ++  +S  F +        + + ++ H
Sbjct: 448  KSGKMIGSAELCSGLYLLSCGQFSNQVSQASCVQSQSMSESFNSVSNSKVNKDSEIIMLH 507

Query: 498  FRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDV 557
            +RLGHP+F Y+  LFP LF      +  C++C  AK  R  +P  PYKP+  F+LVHSDV
Sbjct: 508  YRLGHPSFVYLAKLFPKLFINKNPASYHCEICQFAKHTRTVYPQIPYKPSTVFSLVHSDV 567

Query: 558  WGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAILR 617
            WGPS+I   SG RWFVTF+DDHTR+TWV+L+ +KSEV  +FQ F   ++ QF+ KI +L+
Sbjct: 568  WGPSRIKNISGTRWFVTFVDDHTRVTWVFLMKEKSEVGHIFQTFNLMVQNQFNSKIQVLK 627

Query: 618  SDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSY 677
            SDN +E+   +LS +L + GI+H +SC  TPQQNGVAERKNRHLLEVAR LM S+++P+Y
Sbjct: 628  SDNAKEYFTSSLSTYLQNHGIIHISSCVDTPQQNGVAERKNRHLLEVARCLMFSSNVPNY 687

Query: 678  LWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVS-EVPLRVFGCTAYVHNFGP 737
             WG+AILTA +LINRMPSR+L  Q+P     + +P T   S ++PL+VFGCTA+VH +  
Sbjct: 688  FWGEAILTATYLINRMPSRVLTFQSPRQLFLKQFPHTHAASSDLPLKVFGCTAFVHVYPQ 747

Query: 738  NQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGESVS 797
            N++KF PRA  C+F+GY P Q+GYKC+ P +++++ TMDV+F E   ++P SH+QGES++
Sbjct: 748  NRSKFAPRANKCIFLGYSPTQKGYKCYSPTNKRFYTTMDVSFFEHVFFYPKSHVQGESMN 807

Query: 798  EESNNTFE-FIEPTPSVVSNIIPHSIVLPTN-QVPWKTYYRRNHKKEVGSP-TSQPPAPV 857
            E  +  +E F+E  PS  S     S   PT    P     +      V SP T Q P P+
Sbjct: 808  E--HQVWESFLEGVPSFHSESPNPSQFAPTELSTPMPPSVQPAQHTNVPSPVTIQSPMPI 867

Query: 858  QDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAEQG 917
            Q   P          +   +N+     R     LE+  +   G  I+           + 
Sbjct: 868  QPIAP----------QLANENLQVYIRRRKRQELEHGSQSTCGQYIDSNSSLPEENIGED 927

Query: 918  HTGKS--DEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPKD 977
              G+      D S  +PIALRKG R CT HPI NYV+Y+ LSP +RAF  SLD T +P  
Sbjct: 928  RAGEVLIPSIDDST-LPIALRKGVRRCTDHPIGNYVTYEGLSPSYRAFATSLDDTQVPNT 987

Query: 978  IYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKA 1037
            I  ALK  EWK AV +E+ ALEKN TW I  LP G + +            D + D  KA
Sbjct: 988  IQEALKISEWKKAVQDEIDALEKNGTWTITDLPVGKRPM------------DQSKD-FKA 1047

Query: 1038 RLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEEV 1097
            RLVA+GFTQ+YGIDY ETF+PVAKLNTIR+LLS+AVN+DW L QLD+KNAFLNGDL EEV
Sbjct: 1048 RLVARGFTQSYGIDYQETFAPVAKLNTIRILLSLAVNQDWCLQQLDIKNAFLNGDLEEEV 1107

Query: 1098 YMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKV 1157
            YM  PPGFE    ++ VCKLQKS+YGLKQSPRAWFDRFT  V   GY+QG  DHTLF K 
Sbjct: 1108 YMEIPPGFEESMAKNQVCKLQKSLYGLKQSPRAWFDRFTKAVLKLGYKQGQXDHTLFVKK 1167

Query: 1158 SKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGI 1217
            S  GK+A+LIVYVDDI+L+G+D  E+  LK+ + +EFE+KDLGNLKYFLGMEVARS++GI
Sbjct: 1168 SHAGKLAILIVYVDDIILSGNDMGELQNLKKYLLEEFEVKDLGNLKYFLGMEVARSRKGI 1227

Query: 1218 SVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHT 1277
             VSQRKYILDLL ETGMLGC+P DTP++   KLG   +  PVD+ +YQRLVG+LIYLSHT
Sbjct: 1228 VVSQRKYILDLLKETGMLGCKPIDTPMDSKKKLGIEKESTPVDRGRYQRLVGRLIYLSHT 1287

Query: 1278 RPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWA 1337
            RPDI FAVS VSQFM +P EEHM+AV RILRYLK TPGKGL FRKT+ +  E Y+D+DWA
Sbjct: 1288 RPDIGFAVSAVSQFMHSPTEEHMEAVYRILRYLKMTPGKGLFFRKTENRDTEVYSDADWA 1347

Query: 1338 GSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDL 1397
            G+++DR+STSGYC+FVWGNLVTWRSKKQSVVARSSAEAEYRA++ GICE IW+++VL++L
Sbjct: 1348 GNIIDRRSTSGYCSFVWGNLVTWRSKKQSVVARSSAEAEYRALAQGICEGIWIKRVLSEL 1407

Query: 1398 HQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQV 1423
             Q   +P+ + CDN+AAISIA NPV HDRTKHVEIDRHFI EK+ S ++ + Y+P+  Q 
Sbjct: 1408 GQTSSSPILMMCDNQAAISIAKNPVHHDRTKHVEIDRHFITEKVTSETVKLNYVPTKHQT 1453

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
POLX_TOBAC1.0e-15733.33Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
COPIA_DROME2.7e-15031.85Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3[more]
M810_ARATH1.4e-4540.63Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana GN=AtMg0... [more]
YCH4_YEAST5.6e-3933.01Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... [more]
YH41B_YEAST8.1e-3828.32Transposon Ty4-H Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Match NameE-valueIdentityDescription
A5AYJ3_VITVI0.0e+0053.05Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_041073 PE=4 SV=1[more]
A5B7Z8_VITVI0.0e+0050.65Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_022757 PE=4 SV=1[more]
A5AJR0_VITVI0.0e+0050.52Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_031159 PE=4 SV=1[more]
A5BJ12_VITVI0.0e+0050.07Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_024789 PE=4 SV=1[more]
A5B7A7_VITVI0.0e+0050.14Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_025045 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G23160.11.3e-13445.82 cysteine-rich RLK (RECEPTOR-like protein kinase) 8[more]
ATMG00810.17.8e-4740.63ATMG00810.1 DNA/RNA polymerases superfamily protein[more]
ATMG00820.16.0e-2353.06ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)[more]
ATMG00240.13.3e-1341.46ATMG00240.1 Gag-Pol-related retrotransposon family protein[more]
AT1G21280.16.0e-0723.08 Retrotransposon gag protein (InterPro:IPR005162)[more]
Match NameE-valueIdentityDescription
gi|147819777|emb|CAN76196.1|0.0e+0053.05hypothetical protein VITISV_041073 [Vitis vinifera][more]
gi|147810393|emb|CAN59964.1|0.0e+0050.65hypothetical protein VITISV_022757 [Vitis vinifera][more]
gi|147778986|emb|CAN62538.1|0.0e+0050.52hypothetical protein VITISV_031159 [Vitis vinifera][more]
gi|147769406|emb|CAN70229.1|0.0e+0050.07hypothetical protein VITISV_024789 [Vitis vinifera][more]
gi|147860087|emb|CAN82928.1|0.0e+0050.14hypothetical protein VITISV_025045 [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001584Integrase_cat-core
IPR012337RNaseH-like_sf
IPR013103RVT_2
IPR025724GAG-pre-integrase_dom
Vocabulary: Biological Process
TermDefinition
GO:0015074DNA integration
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0090304 nucleic acid metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003824 catalytic activity
molecular_function GO:0003676 nucleic acid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI02G16290.1CSPI02G16290.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 505..620
score: 6.6
IPR001584Integrase, catalytic corePROFILEPS50994INTEGRASEcoord: 503..669
score: 22
IPR012337Ribonuclease H-like domainGENE3DG3DSA:3.30.420.10coord: 499..661
score: 9.6
IPR012337Ribonuclease H-like domainunknownSSF53098Ribonuclease H-likecoord: 502..663
score: 2.53
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 953..1196
score: 9.4
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 421..492
score: 9.4
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 451..662
score: 0.0coord: 802..828
score: 0.0coord: 309..425
score: 0.0coord: 679..782
score: 0.0coord: 888..1355
score: 0.0coord: 27..278
score:
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 952..1175
score: 4.67E-42coord: 1205..1383
score: 4.67

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None