CSPI01G15910 (gene) Wild cucumber (PI 183967)

NameCSPI01G15910
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationChr1 : 11422784 .. 11428033 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTATCAGAGCGGGACAATGAAAACACCCTAGAAACCCAAAAAAACCAAACCACTTATGAAAATCAAACAGAAGGGACAGCCATCAATTTTAATGTTGCTGTAGCTGCTGCCATCGATGCTCGGATGAGTGCTGCCATGGACGAATTATTAAGCCGGCTACAGAAAACGTCCGAAAATAATTTTTCGTCATTACCGCAGCCGTCCGCGCCGTCACCGGACCACCACACACCGTCACTGGACCACGACGCGCCGTCACCGGACCACCACACACCGTCACTGGACCACGACGCGCCGTCACCGGACCACCACACACCGTCACTGGACCACGACGCGCCGTCACCGGACCACCACACACCGTCACTGGACCACGACGCGCCGTCACCGGACCACCACACACCGTCACTGGACCACGACGCGCCGTCACCGGACCACCACACACCGTCACTGGACCACGACGCGCCGTCACCGGACCACCACGCGCCTGGTTTTCTTCCTCAGACGGCGCCGACCATCCCATCTGTCCAACCCTTTTCTTCGTCCGCGGCCTATATTGCTCCCCACGCCCCGATTTATGTTCTGCCATCTAATTCCAATCGGCTACCACCGCTTCTGCCGTCAAATCTGTATGGCCAGCCACCCAATGATCCTAGCTACCATCCCGATGTTAAAAACTCTCAAATTCACTCAACATTTGAGGTTGGTGAATCTTCGGCATATTCCAACCGTAACGTGCAAGCTTCCTCGGGAATAGTTCATCAACAATTGGAAGGGCTTCGACAACAGATAGCAGCACTTGAGGCTACCTTAGGGACGACATCCACTCTACCGATGTATTCTGAGTATCCGGTAAACTCGTTCCCTAATGTATCCTCTCCTTATTTGACTAATACGGTGGCTCAGTCTTCCATGTATCATCTTTCAGGAGAAAAGTTGAATGGCAACAACTATTTCTCATGGTCTCAGTCAGTAAAGATGGTCCTCGAAGGACGACAAAAATTTAGCTTTCTGACAGGGGAAATACCTCGCCCCCTACCGGGCGACCCACATGAACGATATTGGAAGGCAGAAGACTCTATTCTTCGATCCATATTGATCAATAGTATGGAACCTCAAATTGGCAAGCCGTTATTGTTTGCTGCAACAGCCAAGGATATTTGGGACACAGCCCAGACACTTTACTCAAAACGTCAGAATGCCTCTCGTCTATACACGCTGAGAAAGCAAGTTCATGAATGCAAGCAAGGAACCATGGATGTCACATCCTTTTTCAATAAGCTTTCTCTTATATGGCAAGAAATGGACCTATGCAGAGAACTAGTCTGGCGTGATCCCACTGATGGTGTACAGTACTCGAGAATTGAAGAGAATGACAGGATTTATGACTTTCTTGCTGGTCTTAATCCTAAGTTTGATGTAGTTCGAGGGCGTATACTAGGTCAAAGACCGATTCCCTCCCTGATGGAAGTTTGCTCTGAAATCCGCCTCGAGGAAGATCGCACAAGTGCTATGAATATTTCCGCAACCCCTACTATTGACTCTGCTGCTTTTAGTGCAAGATCTTCTAACAGTAGCAGTGACAAGCATAATGGAAAACCAATTCCTGTCTGCGAGCATTGCAAAAAACAATGGCATACCAAAGAACAATGTTGGAAGTTACATGGTCGTCCCCCAGGAAGTAAGAAACGCCCTTCCAACGACAAACAGAACACAGGGCGGGCGTATGTGAGTGAGTCTGCTGAACCTCCTCAACAATCTGATCCACACAAAAACCAAACTGATCTCAGTCTTGCCACTTTAGGTGCCATTGTCCAATCAGGTATACCTCATTCCTTCGGTCTTGTTAGTATTGATGGGAAGAACCCCTGGATTCTGGATTCTGGTGCCACAGATCATTTGACTGGGTCCTCTGAACATTTTGTATCTTACATTCCTTGTGCTGGGAACGAGACAATTAGAATTGCAGATGGCTCCTTGGCCCCCATTGCTGGAAAGGGGAAGATTTCTCCTTGTGCAGGGCTCTCCTTACATAATGTTTTGCATGTGCCCAAACTATCTTATAATTTGCTTTCGATAAGCAAGATCACTCATGAGTTAAACTGCAAAGCAATATTCTTACCTGATTCTGTCTCTTTTCAGGACTTGAGCTCGGGGAGGATGATTGGCACTGCCCGGCATAGTAGGGGACTCTACCTCCTTGATGACGATACCTCTTCTAGTAGCATTCCTAGGACTAGTCTCTTATCTTCCTATTTCACTACTTCTGAACAAGATTGTATGTTGTGGCATTTTCGTTTAGGCCACCCTAATTTTCAATATATGAAACATTTATTTCCACATCTCTTCTCTAAAGTTGAGATGACTACCTTATCTTGTGATGTGTGTATTCAGGCCAAACAACATCGAGTCTCTTTTCCCTCACAACCATACAAACCAACCCAACCCTTCACTCTTGTTCATAGTGATGTCTGGGGACCATCCAAGATAACAACCTCATCTGGAAAACGGTGGTTCGTAACCTTCATTGATGATCATACCCGTCTTACCTGGGTCTACCTTATCACTGATAAATCTGAGGTTTCCTCTATGTTTCAAAATTTCTATCACACCATTGAAACACAATTCCATCAAAAAATTGCTATTCTTCGGAGTGATAATGGTCGGGAATTCCAAAACCATAACCTTAGTGAATTTCTTGCTTCCAAGGGGATTGTTCATCAAAACTCGTGCGCCTACACTCCTCAACAAAATGGAGTGGCCGAGCGAAAAAACCGTCACCTTCTGGAAGTAGCCCGTTCCCTTATGCTTTCTACTTCCCTTCCTTCATACTTGTGGGGAGATGCTATTCTTACAGCAGCTCATTTAATCAATAGAATGCCTTCTCGTATTCTTCATCTTCAAACTCCCTTAGATTGTCTTAAGGAGTCCTACCCATCGACTCGTCATGTTTCTGAGGTTCCTCTTCGTGTGTTTGGGTGTACCGCTTATGTCCATAATTTTGGCCCTAATCAAACCAAATTTACCCCTCGGGCTCAGGCATGTGTGTTTGTTGGGTATCCCCCTCACCAGCGTGGTTATAAATGTTTTCACCCACCATCCAGAAAATACTTTGTCACTATGGATGTTACTTTCTGTGAGGATCGACCCTACTTTCCCGTTAGCCATCTTCAGGGGGAGAGTGTGAGTGAAGAGTCTAACAACACCTTTGAATTCATCGAACCCACTCCTAGTGTTGTGTCTAACATCATTCCTCATTCCATAGTCCTACCCACAAACCAAGTCCCCTGGAAAACGTACTACAGGAGGAATCACAAAAAGGAAGTCGGTTCCCCTACTAGTCAGCCGCCGGCTCCAGTCCAAGACTCTGAACCTCCTCGAGATCAAGGTATGGAAAACCCTACTGAACCCTGTACTAAGAATATGATAAGTGAGAATGACAGGTCTAATGTTGCTGTTCTTGAAAACGTGGAAGAAAAGGACAGTGGTGATGAGATTGAGGTCAGAATAGAAACCCGTGATAATGAAGCGGAACATGGTCATACAGGAAAATCAGATGAGTATGATTCCTCTCTTGACATTCCCATTGCTCTGAGAAAAGGCACCAGGTCTTGTACTAAACACCCCATTTGCAATTATGTTTCCTACAATAGTCTCTCTCCTCAGTTCAGAGCTTTTACAGCAAGCCTTGACTCTACCATAATACCAAAAGATATCTACACTGCTTTAAAGTATCCTGAATGGAAGAATGCTGTCATGGAAGAGATGAAAGCTCTTGAAAAGAATAGTACTTGGGACATTTGTACTCTACCTAAGGGACACAAAACTGTGGGATGCAAATGGGTGTTCTCTCTCAAATACAAAGCTGATGGTACTCTTGACAGACACAAGGCAAGGTTAGTTGCGAAGGGATTTACTCAAACCTATGGTATTGACTATTCAGAAACTTTTTCTCCAGTTGCTAAGTTGAATACTATTAGAGTTCTGTTATCTGTTGCTGTGAACAAAGATTGGCCTTTATATCAGCTGGATGTTAAGAATGCCTTTTTGAATGGAGACCTCGTAGAGGAAGTCTACATGAGCCCTCCGCCTGGATTTGAAGCCCAGTTTGGTCAGCATGTGTGTAAACTCCAGAAATCTATATATGGTCTGAAACAGTCTCCCAGAGCATGGTTTGACAGATTCACTACCTTTGTCAAGTCCCAAGGGTACAGGCAGGGACACTCTGATCATACTTTATTTACAAAGGTTTCCAAAACAGGAAAGATTGCTGTTCTAATAGTTTATGTGGATGACATTGTTTTGACTGGAGATGATCAGGCAGAAATCAGTCAACTAAAGCAGAGAATGGGCGATGAGTTTGAAATCAAGGATTTGGGAAATTTGAAATATTTCCTTGGAATGGAGGTGGCCAGATCTAAAGAAGGTATCTCCGTATCTCAAAGAAAATACATCCTTGATTTGTTAACCGAGACAGGTATGTTAGGATGTCGTCCCACTGACACTCCTATTGAATTCAACTGCAAACTAGGAAACTCTGATGATCAAGTTCCAGTTGATAAAGAACAGTATCAACGCCTCGTGGGTAAATTAATTTACTTATCTCATACTCGTCCTGATATTTCCTTTGCTGTGAGTGTTGTCAGCCAGTTTATGCAGACCCCTAATGAGGAACACATGAAAGCTGTCAACAGAATCTTGAGATACTTAAAATCAACACCTGGTAAAGGGCTGATGTTTAGAAAAACAGACAGAAAGACCATTGAGGCATACACTGACTCGGATTGGGCAGGATCTGTTGTTGACAGAAAATCTACCTCTGGTTATTGTACCTTTGTTTGGGGCAATCTTGTAACTTGGAGGAGTAAGAAGCAAAGTGTTGTGGCCAGGAGCAGCACTGAGGCTGAATACAAAGCTTTGAGTTTAGGAATATGTGAGGAAATTTGGCTTCAGAAAGTTTTGACAGATCTTCATCAGGAATGTGAGACACCATTGAAGCTTTTCTGTGATAATAAAGCCGCTATTAGTATTGCTAACAACCCTGTTCAACATGATAGAACTAAACATGTTGAGATTGATCGACATTTTATCAAAGAAAAACTTGACAGTGGGAGCATATGCATTCCGTACATCCCTTCGAGTCAACAGGTTGCTGATGTTCTTACCAAAGGGCTTCTCAGACCAAACTTCGACTTTTGCGTTAGCAAGTTGGGCCTCATTGATATTTACGTCCCAACTTGAGGGGGAGTGTTG

mRNA sequence

ATGGTATCAGAGCGGGACAATGAAAACACCCTAGAAACCCAAAAAAACCAAACCACTTATGAAAATCAAACAGAAGGGACAGCCATCAATTTTAATGTTGCTGTAGCTGCTGCCATCGATGCTCGGATGAGTGCTGCCATGGACGAATTATTAAGCCGGCTACAGAAAACGTCCGAAAATAATTTTTCGTCATTACCGCAGCCGTCCGCGCCGTCACCGGACCACCACACACCGTCACTGGACCACGACGCGCCGTCACCGGACCACCACACACCGTCACTGGACCACGACGCGCCGTCACCGGACCACCACACACCGTCACTGGACCACGACGCGCCGTCACCGGACCACCACACACCGTCACTGGACCACGACGCGCCGTCACCGGACCACCACACACCGTCACTGGACCACGACGCGCCGTCACCGGACCACCACACACCGTCACTGGACCACGACGCGCCGTCACCGGACCACCACGCGCCTGGTTTTCTTCCTCAGACGGCGCCGACCATCCCATCTGTCCAACCCTTTTCTTCGTCCGCGGCCTATATTGCTCCCCACGCCCCGATTTATGTTCTGCCATCTAATTCCAATCGGCTACCACCGCTTCTGCCGTCAAATCTGTATGGCCAGCCACCCAATGATCCTAGCTACCATCCCGATGTTAAAAACTCTCAAATTCACTCAACATTTGAGGTTGGTGAATCTTCGGCATATTCCAACCGTAACGTGCAAGCTTCCTCGGGAATAGTTCATCAACAATTGGAAGGGCTTCGACAACAGATAGCAGCACTTGAGGCTACCTTAGGGACGACATCCACTCTACCGATGTATTCTGAGTATCCGGTAAACTCGTTCCCTAATGTATCCTCTCCTTATTTGACTAATACGGTGGCTCAGTCTTCCATGTATCATCTTTCAGGAGAAAAGTTGAATGGCAACAACTATTTCTCATGGTCTCAGTCAGTAAAGATGGTCCTCGAAGGACGACAAAAATTTAGCTTTCTGACAGGGGAAATACCTCGCCCCCTACCGGGCGACCCACATGAACGATATTGGAAGGCAGAAGACTCTATTCTTCGATCCATATTGATCAATAGTATGGAACCTCAAATTGGCAAGCCGTTATTGTTTGCTGCAACAGCCAAGGATATTTGGGACACAGCCCAGACACTTTACTCAAAACGTCAGAATGCCTCTCGTCTATACACGCTGAGAAAGCAAGTTCATGAATGCAAGCAAGGAACCATGGATGTCACATCCTTTTTCAATAAGCTTTCTCTTATATGGCAAGAAATGGACCTATGCAGAGAACTAGTCTGGCGTGATCCCACTGATGGTGTACAGTACTCGAGAATTGAAGAGAATGACAGGATTTATGACTTTCTTGCTGGTCTTAATCCTAAGTTTGATGTAGTTCGAGGGCGTATACTAGGTCAAAGACCGATTCCCTCCCTGATGGAAGTTTGCTCTGAAATCCGCCTCGAGGAAGATCGCACAAGTGCTATGAATATTTCCGCAACCCCTACTATTGACTCTGCTGCTTTTAGTGCAAGATCTTCTAACAGTAGCAGTGACAAGCATAATGGAAAACCAATTCCTGTCTGCGAGCATTGCAAAAAACAATGGCATACCAAAGAACAATGTTGGAAGTTACATGGTCGTCCCCCAGGAAGTAAGAAACGCCCTTCCAACGACAAACAGAACACAGGGCGGGCGTATGTGAGTGAGTCTGCTGAACCTCCTCAACAATCTGATCCACACAAAAACCAAACTGATCTCAGTCTTGCCACTTTAGGTGCCATTGTCCAATCAGGTATACCTCATTCCTTCGGTCTTGTTAGTATTGATGGGAAGAACCCCTGGATTCTGGATTCTGGTGCCACAGATCATTTGACTGGGTCCTCTGAACATTTTGTATCTTACATTCCTTGTGCTGGGAACGAGACAATTAGAATTGCAGATGGCTCCTTGGCCCCCATTGCTGGAAAGGGGAAGATTTCTCCTTGTGCAGGGCTCTCCTTACATAATGTTTTGCATGTGCCCAAACTATCTTATAATTTGCTTTCGATAAGCAAGATCACTCATGAGTTAAACTGCAAAGCAATATTCTTACCTGATTCTGTCTCTTTTCAGGACTTGAGCTCGGGGAGGATGATTGGCACTGCCCGGCATAGTAGGGGACTCTACCTCCTTGATGACGATACCTCTTCTAGTAGCATTCCTAGGACTAGTCTCTTATCTTCCTATTTCACTACTTCTGAACAAGATTGTATGTTGTGGCATTTTCGTTTAGGCCACCCTAATTTTCAATATATGAAACATTTATTTCCACATCTCTTCTCTAAAGTTGAGATGACTACCTTATCTTGTGATGTGTGTATTCAGGCCAAACAACATCGAGTCTCTTTTCCCTCACAACCATACAAACCAACCCAACCCTTCACTCTTGTTCATAGTGATGTCTGGGGACCATCCAAGATAACAACCTCATCTGGAAAACGGTGGTTCGTAACCTTCATTGATGATCATACCCGTCTTACCTGGGTCTACCTTATCACTGATAAATCTGAGGTTTCCTCTATGTTTCAAAATTTCTATCACACCATTGAAACACAATTCCATCAAAAAATTGCTATTCTTCGGAGTGATAATGGTCGGGAATTCCAAAACCATAACCTTAGTGAATTTCTTGCTTCCAAGGGGATTGTTCATCAAAACTCGTGCGCCTACACTCCTCAACAAAATGGAGTGGCCGAGCGAAAAAACCGTCACCTTCTGGAAGTAGCCCGTTCCCTTATGCTTTCTACTTCCCTTCCTTCATACTTGTGGGGAGATGCTATTCTTACAGCAGCTCATTTAATCAATAGAATGCCTTCTCGTATTCTTCATCTTCAAACTCCCTTAGATTGTCTTAAGGAGTCCTACCCATCGACTCGTCATGTTTCTGAGGTTCCTCTTCGTGTGTTTGGGTGTACCGCTTATGTCCATAATTTTGGCCCTAATCAAACCAAATTTACCCCTCGGGCTCAGGCATGTGTGTTTGTTGGGTATCCCCCTCACCAGCGTGGTTATAAATGTTTTCACCCACCATCCAGAAAATACTTTGTCACTATGGATGTTACTTTCTGTGAGGATCGACCCTACTTTCCCGTTAGCCATCTTCAGGGGGAGAGTGTGAGTGAAGAGTCTAACAACACCTTTGAATTCATCGAACCCACTCCTAGTGTTGTGTCTAACATCATTCCTCATTCCATAGTCCTACCCACAAACCAAGTCCCCTGGAAAACGTACTACAGGAGGAATCACAAAAAGGAAGTCGGTTCCCCTACTAGTCAGCCGCCGGCTCCAGTCCAAGACTCTGAACCTCCTCGAGATCAAGGTATGGAAAACCCTACTGAACCCTGTACTAAGAATATGATAAGTGAGAATGACAGGTCTAATGTTGCTGTTCTTGAAAACGTGGAAGAAAAGGACAGTGGTGATGAGATTGAGGTCAGAATAGAAACCCGTGATAATGAAGCGGAACATGGTCATACAGGAAAATCAGATGAGTATGATTCCTCTCTTGACATTCCCATTGCTCTGAGAAAAGGCACCAGGTCTTGTACTAAACACCCCATTTGCAATTATGTTTCCTACAATAGTCTCTCTCCTCAGTTCAGAGCTTTTACAGCAAGCCTTGACTCTACCATAATACCAAAAGATATCTACACTGCTTTAAAGTATCCTGAATGGAAGAATGCTGTCATGGAAGAGATGAAAGCTCTTGAAAAGAATAGTACTTGGGACATTTGTACTCTACCTAAGGGACACAAAACTGTGGGATGCAAATGGGTGTTCTCTCTCAAATACAAAGCTGATGGTACTCTTGACAGACACAAGGCAAGGTTAGTTGCGAAGGGATTTACTCAAACCTATGGTATTGACTATTCAGAAACTTTTTCTCCAGTTGCTAAGTTGAATACTATTAGAGTTCTGTTATCTGTTGCTGTGAACAAAGATTGGCCTTTATATCAGCTGGATGTTAAGAATGCCTTTTTGAATGGAGACCTCGTAGAGGAAGTCTACATGAGCCCTCCGCCTGGATTTGAAGCCCAGTTTGGTCAGCATGTGTGTAAACTCCAGAAATCTATATATGGTCTGAAACAGTCTCCCAGAGCATGGTTTGACAGATTCACTACCTTTGTCAAGTCCCAAGGGTACAGGCAGGGACACTCTGATCATACTTTATTTACAAAGGTTTCCAAAACAGGAAAGATTGCTGTTCTAATAGTTTATGTGGATGACATTGTTTTGACTGGAGATGATCAGGCAGAAATCAGTCAACTAAAGCAGAGAATGGGCGATGAGTTTGAAATCAAGGATTTGGGAAATTTGAAATATTTCCTTGGAATGGAGGTGGCCAGATCTAAAGAAGGTATCTCCGTATCTCAAAGAAAATACATCCTTGATTTGTTAACCGAGACAGGTATGTTAGGATGTCGTCCCACTGACACTCCTATTGAATTCAACTGCAAACTAGGAAACTCTGATGATCAAGTTCCAGTTGATAAAGAACAGTATCAACGCCTCGTGGGTAAATTAATTTACTTATCTCATACTCGTCCTGATATTTCCTTTGCTGTGAGTGTTGTCAGCCAGTTTATGCAGACCCCTAATGAGGAACACATGAAAGCTGTCAACAGAATCTTGAGATACTTAAAATCAACACCTGGTAAAGGGCTGATGTTTAGAAAAACAGACAGAAAGACCATTGAGGCATACACTGACTCGGATTGGGCAGGATCTGTTGTTGACAGAAAATCTACCTCTGGTTATTGTACCTTTGTTTGGGGCAATCTTGTAACTTGGAGGAGTAAGAAGCAAAGTGTTGTGGCCAGGAGCAGCACTGAGGCTGAATACAAAGCTTTGAGTTTAGGAATATGTGAGGAAATTTGGCTTCAGAAAGTTTTGACAGATCTTCATCAGGAATGTGAGACACCATTGAAGCTTTTCTGTGATAATAAAGCCGCTATTAGTATTGCTAACAACCCTGTTCAACATGATAGAACTAAACATGTTGAGATTGATCGACATTTTATCAAAGAAAAACTTGACAGTGGGAGCATATGCATTCCGTACATCCCTTCGAGTCAACAGGTTGCTGATGTTCTTACCAAAGGGCTTCTCAGACCAAACTTCGACTTTTGCGTTAGCAAGTTGGGCCTCATTGATATTTACGTCCCAACTTGA

Coding sequence (CDS)

ATGGTATCAGAGCGGGACAATGAAAACACCCTAGAAACCCAAAAAAACCAAACCACTTATGAAAATCAAACAGAAGGGACAGCCATCAATTTTAATGTTGCTGTAGCTGCTGCCATCGATGCTCGGATGAGTGCTGCCATGGACGAATTATTAAGCCGGCTACAGAAAACGTCCGAAAATAATTTTTCGTCATTACCGCAGCCGTCCGCGCCGTCACCGGACCACCACACACCGTCACTGGACCACGACGCGCCGTCACCGGACCACCACACACCGTCACTGGACCACGACGCGCCGTCACCGGACCACCACACACCGTCACTGGACCACGACGCGCCGTCACCGGACCACCACACACCGTCACTGGACCACGACGCGCCGTCACCGGACCACCACACACCGTCACTGGACCACGACGCGCCGTCACCGGACCACCACACACCGTCACTGGACCACGACGCGCCGTCACCGGACCACCACGCGCCTGGTTTTCTTCCTCAGACGGCGCCGACCATCCCATCTGTCCAACCCTTTTCTTCGTCCGCGGCCTATATTGCTCCCCACGCCCCGATTTATGTTCTGCCATCTAATTCCAATCGGCTACCACCGCTTCTGCCGTCAAATCTGTATGGCCAGCCACCCAATGATCCTAGCTACCATCCCGATGTTAAAAACTCTCAAATTCACTCAACATTTGAGGTTGGTGAATCTTCGGCATATTCCAACCGTAACGTGCAAGCTTCCTCGGGAATAGTTCATCAACAATTGGAAGGGCTTCGACAACAGATAGCAGCACTTGAGGCTACCTTAGGGACGACATCCACTCTACCGATGTATTCTGAGTATCCGGTAAACTCGTTCCCTAATGTATCCTCTCCTTATTTGACTAATACGGTGGCTCAGTCTTCCATGTATCATCTTTCAGGAGAAAAGTTGAATGGCAACAACTATTTCTCATGGTCTCAGTCAGTAAAGATGGTCCTCGAAGGACGACAAAAATTTAGCTTTCTGACAGGGGAAATACCTCGCCCCCTACCGGGCGACCCACATGAACGATATTGGAAGGCAGAAGACTCTATTCTTCGATCCATATTGATCAATAGTATGGAACCTCAAATTGGCAAGCCGTTATTGTTTGCTGCAACAGCCAAGGATATTTGGGACACAGCCCAGACACTTTACTCAAAACGTCAGAATGCCTCTCGTCTATACACGCTGAGAAAGCAAGTTCATGAATGCAAGCAAGGAACCATGGATGTCACATCCTTTTTCAATAAGCTTTCTCTTATATGGCAAGAAATGGACCTATGCAGAGAACTAGTCTGGCGTGATCCCACTGATGGTGTACAGTACTCGAGAATTGAAGAGAATGACAGGATTTATGACTTTCTTGCTGGTCTTAATCCTAAGTTTGATGTAGTTCGAGGGCGTATACTAGGTCAAAGACCGATTCCCTCCCTGATGGAAGTTTGCTCTGAAATCCGCCTCGAGGAAGATCGCACAAGTGCTATGAATATTTCCGCAACCCCTACTATTGACTCTGCTGCTTTTAGTGCAAGATCTTCTAACAGTAGCAGTGACAAGCATAATGGAAAACCAATTCCTGTCTGCGAGCATTGCAAAAAACAATGGCATACCAAAGAACAATGTTGGAAGTTACATGGTCGTCCCCCAGGAAGTAAGAAACGCCCTTCCAACGACAAACAGAACACAGGGCGGGCGTATGTGAGTGAGTCTGCTGAACCTCCTCAACAATCTGATCCACACAAAAACCAAACTGATCTCAGTCTTGCCACTTTAGGTGCCATTGTCCAATCAGGTATACCTCATTCCTTCGGTCTTGTTAGTATTGATGGGAAGAACCCCTGGATTCTGGATTCTGGTGCCACAGATCATTTGACTGGGTCCTCTGAACATTTTGTATCTTACATTCCTTGTGCTGGGAACGAGACAATTAGAATTGCAGATGGCTCCTTGGCCCCCATTGCTGGAAAGGGGAAGATTTCTCCTTGTGCAGGGCTCTCCTTACATAATGTTTTGCATGTGCCCAAACTATCTTATAATTTGCTTTCGATAAGCAAGATCACTCATGAGTTAAACTGCAAAGCAATATTCTTACCTGATTCTGTCTCTTTTCAGGACTTGAGCTCGGGGAGGATGATTGGCACTGCCCGGCATAGTAGGGGACTCTACCTCCTTGATGACGATACCTCTTCTAGTAGCATTCCTAGGACTAGTCTCTTATCTTCCTATTTCACTACTTCTGAACAAGATTGTATGTTGTGGCATTTTCGTTTAGGCCACCCTAATTTTCAATATATGAAACATTTATTTCCACATCTCTTCTCTAAAGTTGAGATGACTACCTTATCTTGTGATGTGTGTATTCAGGCCAAACAACATCGAGTCTCTTTTCCCTCACAACCATACAAACCAACCCAACCCTTCACTCTTGTTCATAGTGATGTCTGGGGACCATCCAAGATAACAACCTCATCTGGAAAACGGTGGTTCGTAACCTTCATTGATGATCATACCCGTCTTACCTGGGTCTACCTTATCACTGATAAATCTGAGGTTTCCTCTATGTTTCAAAATTTCTATCACACCATTGAAACACAATTCCATCAAAAAATTGCTATTCTTCGGAGTGATAATGGTCGGGAATTCCAAAACCATAACCTTAGTGAATTTCTTGCTTCCAAGGGGATTGTTCATCAAAACTCGTGCGCCTACACTCCTCAACAAAATGGAGTGGCCGAGCGAAAAAACCGTCACCTTCTGGAAGTAGCCCGTTCCCTTATGCTTTCTACTTCCCTTCCTTCATACTTGTGGGGAGATGCTATTCTTACAGCAGCTCATTTAATCAATAGAATGCCTTCTCGTATTCTTCATCTTCAAACTCCCTTAGATTGTCTTAAGGAGTCCTACCCATCGACTCGTCATGTTTCTGAGGTTCCTCTTCGTGTGTTTGGGTGTACCGCTTATGTCCATAATTTTGGCCCTAATCAAACCAAATTTACCCCTCGGGCTCAGGCATGTGTGTTTGTTGGGTATCCCCCTCACCAGCGTGGTTATAAATGTTTTCACCCACCATCCAGAAAATACTTTGTCACTATGGATGTTACTTTCTGTGAGGATCGACCCTACTTTCCCGTTAGCCATCTTCAGGGGGAGAGTGTGAGTGAAGAGTCTAACAACACCTTTGAATTCATCGAACCCACTCCTAGTGTTGTGTCTAACATCATTCCTCATTCCATAGTCCTACCCACAAACCAAGTCCCCTGGAAAACGTACTACAGGAGGAATCACAAAAAGGAAGTCGGTTCCCCTACTAGTCAGCCGCCGGCTCCAGTCCAAGACTCTGAACCTCCTCGAGATCAAGGTATGGAAAACCCTACTGAACCCTGTACTAAGAATATGATAAGTGAGAATGACAGGTCTAATGTTGCTGTTCTTGAAAACGTGGAAGAAAAGGACAGTGGTGATGAGATTGAGGTCAGAATAGAAACCCGTGATAATGAAGCGGAACATGGTCATACAGGAAAATCAGATGAGTATGATTCCTCTCTTGACATTCCCATTGCTCTGAGAAAAGGCACCAGGTCTTGTACTAAACACCCCATTTGCAATTATGTTTCCTACAATAGTCTCTCTCCTCAGTTCAGAGCTTTTACAGCAAGCCTTGACTCTACCATAATACCAAAAGATATCTACACTGCTTTAAAGTATCCTGAATGGAAGAATGCTGTCATGGAAGAGATGAAAGCTCTTGAAAAGAATAGTACTTGGGACATTTGTACTCTACCTAAGGGACACAAAACTGTGGGATGCAAATGGGTGTTCTCTCTCAAATACAAAGCTGATGGTACTCTTGACAGACACAAGGCAAGGTTAGTTGCGAAGGGATTTACTCAAACCTATGGTATTGACTATTCAGAAACTTTTTCTCCAGTTGCTAAGTTGAATACTATTAGAGTTCTGTTATCTGTTGCTGTGAACAAAGATTGGCCTTTATATCAGCTGGATGTTAAGAATGCCTTTTTGAATGGAGACCTCGTAGAGGAAGTCTACATGAGCCCTCCGCCTGGATTTGAAGCCCAGTTTGGTCAGCATGTGTGTAAACTCCAGAAATCTATATATGGTCTGAAACAGTCTCCCAGAGCATGGTTTGACAGATTCACTACCTTTGTCAAGTCCCAAGGGTACAGGCAGGGACACTCTGATCATACTTTATTTACAAAGGTTTCCAAAACAGGAAAGATTGCTGTTCTAATAGTTTATGTGGATGACATTGTTTTGACTGGAGATGATCAGGCAGAAATCAGTCAACTAAAGCAGAGAATGGGCGATGAGTTTGAAATCAAGGATTTGGGAAATTTGAAATATTTCCTTGGAATGGAGGTGGCCAGATCTAAAGAAGGTATCTCCGTATCTCAAAGAAAATACATCCTTGATTTGTTAACCGAGACAGGTATGTTAGGATGTCGTCCCACTGACACTCCTATTGAATTCAACTGCAAACTAGGAAACTCTGATGATCAAGTTCCAGTTGATAAAGAACAGTATCAACGCCTCGTGGGTAAATTAATTTACTTATCTCATACTCGTCCTGATATTTCCTTTGCTGTGAGTGTTGTCAGCCAGTTTATGCAGACCCCTAATGAGGAACACATGAAAGCTGTCAACAGAATCTTGAGATACTTAAAATCAACACCTGGTAAAGGGCTGATGTTTAGAAAAACAGACAGAAAGACCATTGAGGCATACACTGACTCGGATTGGGCAGGATCTGTTGTTGACAGAAAATCTACCTCTGGTTATTGTACCTTTGTTTGGGGCAATCTTGTAACTTGGAGGAGTAAGAAGCAAAGTGTTGTGGCCAGGAGCAGCACTGAGGCTGAATACAAAGCTTTGAGTTTAGGAATATGTGAGGAAATTTGGCTTCAGAAAGTTTTGACAGATCTTCATCAGGAATGTGAGACACCATTGAAGCTTTTCTGTGATAATAAAGCCGCTATTAGTATTGCTAACAACCCTGTTCAACATGATAGAACTAAACATGTTGAGATTGATCGACATTTTATCAAAGAAAAACTTGACAGTGGGAGCATATGCATTCCGTACATCCCTTCGAGTCAACAGGTTGCTGATGTTCTTACCAAAGGGCTTCTCAGACCAAACTTCGACTTTTGCGTTAGCAAGTTGGGCCTCATTGATATTTACGTCCCAACTTGA
BLAST of CSPI01G15910 vs. Swiss-Prot
Match: POLX_TOBAC (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 562.8 bits (1449), Expect = 1.3e-158
Identity = 380/1141 (33.30%), Postives = 589/1141 (51.62%), Query Frame = 1

Query: 620  KNPWILDSGATDHLTGSSEHFVSYIPCAGN-ETIRIADGSLAPIAGKG----KISPCAGL 679
            ++ W++D+ A+ H T   + F  Y+  AG+  T+++ + S + IAG G    K +    L
Sbjct: 291  ESEWVVDTAASHHATPVRDLFCRYV--AGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTL 350

Query: 680  SLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRG-LYLL 739
             L +V HVP L  NL+S   +  +   ++ F         L+ G ++     +RG LY  
Sbjct: 351  VLKDVRHVPDLRMNLISGIALDRD-GYESYFANQKWR---LTKGSLVIAKGVARGTLYRT 410

Query: 740  DDDTSSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYMKHLFPH-LFSKVEMTTLS- 799
            + +     +             E    LWH R+GH + + ++ L    L S  + TT+  
Sbjct: 411  NAEICQGELNAAQ--------DEISVDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKP 470

Query: 800  CDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWV 859
            CD C+  KQHRVSF +   +      LV+SDV GP +I +  G ++FVTFIDD +R  WV
Sbjct: 471  CDYCLFGKQHRVSFQTSSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWV 530

Query: 860  YLITDKSEVSSMFQNFYHTIETQFHQKIAILRSDNGREFQNHNLSEFLASKGIVHQNSCA 919
            Y++  K +V  +FQ F+  +E +  +K+  LRSDNG E+ +    E+ +S GI H+ +  
Sbjct: 531  YILKTKDQVFQVFQKFHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVP 590

Query: 920  YTPQQNGVAERKNRHLLEVARSLMLSTSLPSYLWGDAILTAAHLINRMPSRILHLQTPLD 979
             TPQ NGVAER NR ++E  RS++    LP   WG+A+ TA +LINR PS  L  + P  
Sbjct: 591  GTPQHNGVAERMNRTIVEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIP-- 650

Query: 980  CLKESYPSTRHVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYPPHQRGYKCFHP 1039
               E   + + VS   L+VFGC A+ H     +TK   ++  C+F+GY   + GY+ + P
Sbjct: 651  ---ERVWTNKEVSYSHLKVFGCRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDP 710

Query: 1040 PSRKYFVTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEPTPSVVSNIIPHSIVLPT 1099
              +K   + DV F E                 E     +  E    V + IIP+ + +P 
Sbjct: 711  VKKKVIRSRDVVFRE----------------SEVRTAADMSE---KVKNGIIPNFVTIP- 770

Query: 1100 NQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVA 1159
                              S ++ P +    ++   +QG E P E     +I + ++ +  
Sbjct: 771  ------------------STSNNPTSAESTTDEVSEQG-EQPGE-----VIEQGEQLDEG 830

Query: 1160 VLENVEEKDSGDEIEVRIETRDNEAEHGHTGKSDEYDSSLD--IPIALRKGTRSCTKHPI 1219
            V E VE    G+E    +   +          S EY    D   P +L++       HP 
Sbjct: 831  V-EEVEHPTQGEEQHQPLRRSERPRVESRRYPSTEYVLISDDREPESLKE----VLSHPE 890

Query: 1220 CNYVSYNSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTL 1279
             N +         +A    ++S +     Y  ++ P+ K  +                  
Sbjct: 891  KNQL--------MKAMQEEMES-LQKNGTYKLVELPKGKRPLK----------------- 950

Query: 1280 PKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLL 1339
                    CKWVF LK   D  L R+KARLV KGF Q  GID+ E FSPV K+ +IR +L
Sbjct: 951  --------CKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGIDFDEIFSPVVKMTSIRTIL 1010

Query: 1340 SVAVNKDWPLYQLDVKNAFLNGDLVEEVYMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPR 1399
            S+A + D  + QLDVK AFL+GDL EE+YM  P GFE    +H VCKL KS+YGLKQ+PR
Sbjct: 1011 SLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGKKHMVCKLNKSLYGLKQAPR 1070

Query: 1400 AWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQR 1459
             W+ +F +F+KSQ Y + +SD  ++ K        +L++YVDD+++ G D+  I++LK  
Sbjct: 1071 QWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYVDDMLIVGKDKGLIAKLKGD 1130

Query: 1460 MGDEFEIKDLGNLKYFLGMEVARSKEG--ISVSQRKYILDLLTETGMLGCRPTDTPIEFN 1519
            +   F++KDLG  +  LGM++ R +    + +SQ KYI  +L    M   +P  TP+  +
Sbjct: 1131 LSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERVLERFNMKNAKPVSTPLAGH 1190

Query: 1520 CKLGNS------DDQVPVDKEQYQRLVGKLIY-LSHTRPDISFAVSVVSQFMQTPNEEHM 1579
             KL         +++  + K  Y   VG L+Y +  TRPDI+ AV VVS+F++ P +EH 
Sbjct: 1191 LKLSKKMCPTTVEEKGNMAKVPYSSAVGSLMYAMVCTRPDIAHAVGVVSRFLENPGKEHW 1250

Query: 1580 KAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTW 1639
            +AV  ILRYL+ T G  L F  +D   ++ YTD+D AG + +RKS++GY     G  ++W
Sbjct: 1251 EAVKWILRYLRGTTGDCLCFGGSD-PILKGYTDADMAGDIDNRKSSTGYLFTFSGGAISW 1310

Query: 1640 RSKKQSVVARSSTEAEYKALSLGICEEIWLQKVLTD--LHQECETPLKLFCDNKAAISIA 1699
            +SK Q  VA S+TEAEY A +    E IWL++ L +  LHQ+      ++CD+++AI ++
Sbjct: 1311 QSKLQKCVALSTTEAEYIAATETGKEMIWLKRFLQELGLHQK---EYVVYCDSQSAIDLS 1325

Query: 1700 NNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLG 1739
             N + H RTKH+++  H+I+E +D  S+ +  I +++  AD+LTK + R  F+ C   +G
Sbjct: 1371 KNSMYHARTKHIDVRYHWIREMVDDESLKVLKISTNENPADMLTKVVPRNKFELCKELVG 1325

BLAST of CSPI01G15910 vs. Swiss-Prot
Match: COPIA_DROME (Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3)

HSP 1 Score: 543.9 bits (1400), Expect = 6.5e-153
Identity = 374/1156 (32.35%), Postives = 584/1156 (50.52%), Query Frame = 1

Query: 623  WILDSGATDHLTGSSEHFVSYIPCAGNETIRIA-DGSLAPIAGKG--KISPCAGLSLHNV 682
            ++LDSGA+DHL      +   +       I +A  G       +G  ++     ++L +V
Sbjct: 289  FVLDSGASDHLINDESLYTDSVEVVPPLKIAVAKQGEFIYATKRGIVRLRNDHEITLEDV 348

Query: 683  LHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSS 742
            L   + + NL+S+ ++              +S +   SG  I       GL ++ +    
Sbjct: 349  LFCKEAAGNLMSVKRLQEA----------GMSIEFDKSGVTIS----KNGLMVVKNSGML 408

Query: 743  SSIPRTSLLS-SYFTTSEQDCMLWHFRLGHPNFQYM-----KHLFPH--LFSKVEMTTLS 802
            +++P  +  + S     + +  LWH R GH +   +     K++F    L + +E++   
Sbjct: 409  NNVPVINFQAYSINAKHKNNFRLWHERFGHISDGKLLEIKRKNMFSDQSLLNNLELSCEI 468

Query: 803  CDVCIQAKQHRVSFPSQPYKP--TQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLT 862
            C+ C+  KQ R+ F     K    +P  +VHSDV GP    T   K +FV F+D  T   
Sbjct: 469  CEPCLNGKQARLPFKQLKDKTHIKRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYC 528

Query: 863  WVYLITDKSEVSSMFQNFYHTIETQFHQKIAILRSDNGREFQNHNLSEFLASKGIVHQNS 922
              YLI  KS+V SMFQ+F    E  F+ K+  L  DNGRE+ ++ + +F   KGI +  +
Sbjct: 529  VTYLIKYKSDVFSMFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLT 588

Query: 923  CAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYLWGDAILTAAHLINRMPSRIL--HLQ 982
              +TPQ NGV+ER  R + E AR+++    L    WG+A+LTA +LINR+PSR L    +
Sbjct: 589  VPHTPQLNGVSERMIRTITEKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSK 648

Query: 983  TPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYPPHQRGYK 1042
            TP +      P  +H     LRVFG T YVH     Q KF  ++   +FVGY P+  G+K
Sbjct: 649  TPYEMWHNKKPYLKH-----LRVFGATVYVH-IKNKQGKFDDKSFKSIFVGYEPN--GFK 708

Query: 1043 CFHPPSRKYFVTMDVTFCEDRPY------FPVSHLQGESVSEESNNTFEFIEPTPSVVSN 1102
             +   + K+ V  DV   E          F    L+    SE  N    F   +  ++  
Sbjct: 709  LWDAVNEKFIVARDVVVDETNMVNSRAVKFETVFLKDSKESENKN----FPNDSRKIIQT 768

Query: 1103 IIPHSIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTK-N 1162
              P+      N       + ++ K+            +  +E P      N ++ C    
Sbjct: 769  EFPNESKECDN-----IQFLKDSKESENKNFPNDSRKIIQTEFP------NESKECDNIQ 828

Query: 1163 MISENDRSNVAVLENVEEKDSGDEIEVR------IETRDNE-AEHGHTGKSDEYDSSLDI 1222
             + ++  SN   L   +++   D +          E+R++E AEH      D    +  I
Sbjct: 829  FLKDSKESNKYFLNESKKRKRDDHLNESKGSGNPNESRESETAEHLKEIGIDNPTKNDGI 888

Query: 1223 PIALRKGTRSCTKHPICNYVSYNSLSP-QFRAFTASLDSTIIPKDIYTALKYPEWKNAVM 1282
             I  R+  R  TK  I      NSL+     A T   D      +I        W+ A+ 
Sbjct: 889  EIINRRSERLKTKPQISYNEEDNSLNKVVLNAHTIFNDVPNSFDEIQYRDDKSSWEEAIN 948

Query: 1283 EEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDY 1342
             E+ A + N+TW I   P+    V  +WVFS+KY   G   R+KARLVA+GFTQ Y IDY
Sbjct: 949  TELNAHKINNTWTITKRPENKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQKYQIDY 1008

Query: 1343 SETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEEVYMSPPPGFEAQFGQH 1402
             ETF+PVA++++ R +LS+ +  +  ++Q+DVK AFLNG L EE+YM  P G       +
Sbjct: 1009 EETFAPVARISSFRFILSLVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQGISCN-SDN 1068

Query: 1403 VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKI---AVLIVYV 1462
            VCKL K+IYGLKQ+ R WF+ F   +K   +     D  ++  +   G I     +++YV
Sbjct: 1069 VCKLNKAIYGLKQAARCWFEVFEQALKECEFVNSSVDRCIY--ILDKGNINENIYVLLYV 1128

Query: 1463 DDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYILDLLT 1522
            DD+V+   D   ++  K+ + ++F + DL  +K+F+G+ +   ++ I +SQ  Y+  +L+
Sbjct: 1129 DDVVIATGDMTRMNNFKRYLMEKFRMTDLNEIKHFIGIRIEMQEDKIYLSQSAYVKKILS 1188

Query: 1523 ETGMLGCRPTDTPI--EFNCKLGNSDDQVPVDKEQYQRLVGKLIYLS-HTRPDISFAVSV 1582
            +  M  C    TP+  + N +L NSD+         + L+G L+Y+   TRPD++ AV++
Sbjct: 1189 KFNMENCNAVSTPLPSKINYELLNSDEDC---NTPCRSLIGCLMYIMLCTRPDLTTAVNI 1248

Query: 1583 VSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRK--TDRKTIEAYTDSDWAGSVVDRKS 1642
            +S++    N E  + + R+LRYLK T    L+F+K       I  Y DSDWAGS +DRKS
Sbjct: 1249 LSRYSSKNNSELWQNLKRVLRYLKGTIDMKLIFKKNLAFENKIIGYVDSDWAGSEIDRKS 1308

Query: 1643 TSGYCTFVWG-NLVTWRSKKQSVVARSSTEAEYKALSLGICEEIWLQKVLTDLHQECETP 1702
            T+GY   ++  NL+ W +K+Q+ VA SSTEAEY AL   + E +WL+ +LT ++ + E P
Sbjct: 1309 TTGYLFKMFDFNLICWNTKRQNSVAASSTEAEYMALFEAVREALWLKFLLTSINIKLENP 1368

Query: 1703 LKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKG 1740
            +K++ DN+  ISIANNP  H R KH++I  HF +E++ +  IC+ YIP+  Q+AD+ TK 
Sbjct: 1369 IKIYEDNQGCISIANNPSCHKRAKHIDIKYHFAREQVQNNVICLEYIPTENQLADIFTKP 1401

BLAST of CSPI01G15910 vs. Swiss-Prot
Match: M810_ARATH (Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana GN=AtMg00810 PE=4 SV=1)

HSP 1 Score: 188.7 bits (478), Expect = 5.3e-46
Identity = 92/224 (41.07%), Postives = 136/224 (60.71%), Query Frame = 1

Query: 1425 LIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYI 1484
            L++YVDDI+LTG     ++ L  ++   F +KDLG + YFLG+++     G+ +SQ KY 
Sbjct: 3    LLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYA 62

Query: 1485 LDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAV 1544
              +L   GML C+P  TP+        S  + P D   ++ +VG L YL+ TRPDIS+AV
Sbjct: 63   EQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYP-DPSDFRSIVGALQYLTLTRPDISYAV 122

Query: 1545 SVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKS 1604
            ++V Q M  P       + R+LRY+K T   GL   K  +  ++A+ DSDWAG    R+S
Sbjct: 123  NIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRS 182

Query: 1605 TSGYCTFVWGNLVTWRSKKQSVVARSSTEAEYKALSLGICEEIW 1649
            T+G+CTF+  N+++W +K+Q  V+RSSTE EY+AL+L   E  W
Sbjct: 183  TTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CSPI01G15910 vs. Swiss-Prot
Match: YCH4_YEAST (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY5A PE=5 SV=2)

HSP 1 Score: 167.5 bits (423), Expect = 1.3e-39
Identity = 103/309 (33.33%), Postives = 170/309 (55.02%), Query Frame = 1

Query: 1342 LDVKNAFLNGDLVEEVYMSPPPGFEAQFG-QHVCKLQKSIYGLKQSPRAWFDRFTTFVKS 1401
            +DV  AFLN  + E +Y+  PPGF  +    +V +L   +YGLKQ+P  W +     +K 
Sbjct: 1    MDVDTAFLNSTMDEPIYVKQPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKK 60

Query: 1402 QGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGN 1461
             G+ +   +H L+ + +  G I +  VYVDD+++         ++KQ +   + +KDLG 
Sbjct: 61   IGFCRHEGEHGLYFRSTSDGPIYIA-VYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGK 120

Query: 1462 LKYFLGMEVARSKEG-ISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVD 1521
            +  FLG+ + +S  G I++S + YI    +E+ +   + T TP+  +  L  +      D
Sbjct: 121  VDKFLGLNIHQSSNGDITLSLQDYIAKAASESEINTFKLTQTPLCNSKPLFETTSPHLKD 180

Query: 1522 KEQYQRLVGKLIYLSHT-RPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLM 1581
               YQ +VG+L++ ++T RPDIS+ VS++S+F++ P   H+++  R+LRYL +T    L 
Sbjct: 181  ITPYQSIVGQLLFCANTGRPDISYPVSLLSRFLREPRAIHLESARRVLRYLYTTRSMCLK 240

Query: 1582 FRKTDRKTIEAYTDSDWAGSVVD-RKSTSGYCTFVWGNLVTWRSKK-QSVVARSSTEAEY 1641
            +R   +  +  Y D+   G++ D   ST GY T + G  VTW SKK + V+   STEAEY
Sbjct: 241  YRSGSQLALTVYCDAS-HGAIHDLPHSTGGYVTLLAGAPVTWSSKKLKGVIPVPSTEAEY 300

Query: 1642 KALSLGICE 1646
               S  + E
Sbjct: 301  ITASETVME 307

BLAST of CSPI01G15910 vs. Swiss-Prot
Match: YH41B_YEAST (Transposon Ty4-H Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY4B-H PE=3 SV=1)

HSP 1 Score: 163.3 bits (412), Expect = 2.4e-38
Identity = 131/459 (28.54%), Postives = 216/459 (47.06%), Query Frame = 1

Query: 1295 HKARLVAKGFTQ---TYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNG 1354
            +KAR+V +G TQ   TY +  +E+ +     N I++ L +A N++  +  LD+ +AFL  
Sbjct: 1336 YKARIVCRGDTQSPDTYSVITTESLNH----NHIKIFLMIANNRNMFMKTLDINHAFLYA 1395

Query: 1355 DLVEEVYMSPPPGFEAQFGQHVCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHT 1414
             L EE+Y+  P          V KL K++YGLKQSP+ W D    ++   G +       
Sbjct: 1396 KLEEEIYIPHPHDRRC-----VVKLNKALYGLKQSPKEWNDHLRQYLNGIGLKDNSYTPG 1455

Query: 1415 LFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNL------KYFL 1474
            L+    K   IA   VYVDD V+   ++  + +   ++   FE+K  G L         L
Sbjct: 1456 LYQTEDKNLMIA---VYVDDCVIAASNEQRLDEFINKLKSNFELKITGTLIDDVLDTDIL 1515

Query: 1475 GMEVARSKE--GISVSQRKYI--LDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKE 1534
            GM++  +K    I ++ + +I  +D      +   R +  P     K+    D + + +E
Sbjct: 1516 GMDLVYNKRLGTIDLTLKSFINRMDKKYNEELKKIRKSSIPHMSTYKIDPKKDVLQMSEE 1575

Query: 1535 QY-------QRLVGKLIYLSH-TRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTP 1594
            ++       Q+L+G+L Y+ H  R DI+FAV  V++ +  P+E     + +I++YL    
Sbjct: 1576 EFRQGVLKLQQLLGELNYVRHKCRYDINFAVKKVARLVNYPHERVFYMIYKIIQYLVRYK 1635

Query: 1595 GKGLMFRK---TDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARS 1654
              G+ + +    D+K I A TD+   GS  D +S  G   +   N+    S K +    S
Sbjct: 1636 DIGIHYDRDCNKDKKVI-AITDAS-VGSEYDAQSRIGVILWYGMNIFNVYSNKSTNRCVS 1695

Query: 1655 STEAEYKALSLGICEEIWLQKVLTDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVE 1714
            STEAE  A+  G  +   L+  L +L +     + +  D+K AI   N   Q  + K   
Sbjct: 1696 STEAELHAIYEGYADSETLKVTLKELGEGDNNDIVMITDSKPAIQGLNRSYQQPKEKFTW 1755

Query: 1715 IDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNF 1730
            I    IKEK+   SI +  I     +AD+LTK +   +F
Sbjct: 1756 IKTEIIKEKIKEKSIKLLKITGKGNIADLLTKPVSASDF 1780

BLAST of CSPI01G15910 vs. TrEMBL
Match: A5AYJ3_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_041073 PE=4 SV=1)

HSP 1 Score: 1583.5 bits (4099), Expect = 0.0e+00
Identity = 801/1490 (53.76%), Postives = 1034/1490 (69.40%), Query Frame = 1

Query: 303  SMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILR 362
            S + L+  KLNG NY  W+QSVK+ ++GR K   L GE+ +P+  DP+ + W+  + +  
Sbjct: 35   SSFQLTIHKLNGKNYLEWAQSVKLAIDGRGKLGHLNGEVSKPVADDPNLKTWRFRELVA- 94

Query: 363  SILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTS 422
                      IGKP LF  TAKD+W+  + +YS  +N+S+++ L+ ++ + +QG  +VT+
Sbjct: 95   ----------IGKPHLFLPTAKDVWEAVRDMYSDLENSSQIFDLKSKLWQSRQGDREVTT 154

Query: 423  FFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQR 482
            ++N++  +WQE+DLC E  W  P D V++ + EENDR+Y FLA LN   D VRGRILG++
Sbjct: 155  YYNQMVTLWQELDLCYEDEWDCPNDSVRHKKREENDRVYVFLAALNHNLDEVRGRILGRK 214

Query: 483  PIPSLMEVCSEIRLEEDRTSAMNIS----ATPTIDSAAFSARSSNSSSDKHNGKPIPVCE 542
            P+PS+ EV SE+R EE R   M       + P I+S+A  ++ S+   D+      P C+
Sbjct: 215  PLPSIREVFSEVRREEARRKVMLTDPEPMSNPEIESSALVSKGSDLDGDRRKK---PWCD 274

Query: 543  HCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYVSESAEP--PQQSDPHKNQTDLS 602
            HCKK WHTK  CWK+HG+P   KK+  +D    GRA+ + SA+   PQ +    N T   
Sbjct: 275  HCKKPWHTKGTCWKIHGKPQNFKKKNGSD----GRAFQTMSADSQGPQINSEKPNFTKEQ 334

Query: 603  LATLGAIVQSG---------------IPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFV 662
            L+ L  + QS                +  +   +  +   PWI+DSGATDH+TGSS+ F 
Sbjct: 335  LSHLYKLFQSPQFSNPSCSLAQQGNYLIAALSSIKSNVHCPWIIDSGATDHMTGSSQIFS 394

Query: 663  SYIPCAGNETIRIADGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNC 722
            SY PCAGN+ I+I DGSL+ IAGKG +     L+LHNVLHVP LS NLLSISKIT +  C
Sbjct: 395  SYKPCAGNKKIKIXDGSLSAIAGKGSVFISPSLTLHNVLHVPNLSCNLLSISKITQDHQC 454

Query: 723  KAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDCML 782
            +A F P    FQ+L+SGR IG AR   GLY  ++ + S    +++   S    S  D +L
Sbjct: 455  QANFYPSYCEFQELTSGRTIGNAREIGGLYFFENGSESRKPIQSTCFESISVASSDDIIL 514

Query: 783  WHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHS 842
            WH+RLGHP+FQY+KHLFP LF     ++  C+ C  AK HR SFP QPY+ ++PF+L+HS
Sbjct: 515  WHYRLGHPSFQYLKHLFPSLFRNKNPSSFQCEFCELAKHHRTSFPLQPYRISKPFSLIHS 574

Query: 843  DVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAI 902
            DVWGPS+I+T SGK+WFVTFIDDHTR++WVYL+ +KSEV  +F+ FY  + TQF  KI +
Sbjct: 575  DVWGPSRISTLSGKKWFVTFIDDHTRVSWVYLLREKSEVEEVFKIFYTMVLTQFQTKIQV 634

Query: 903  LRSDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLP 962
             RSDNG+E+ N  L +F   KGIVHQ+SC  TPQQNG+AERKN+HLLEVAR+L  +T +P
Sbjct: 635  FRSDNGKEYINKALGKFFLEKGIVHQSSCNDTPQQNGIAERKNKHLLEVARALCFTTKVP 694

Query: 963  SYLWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFG 1022
             YLWG+AILTA +LINRMP+RIL+ +TPL       P  R  S +PL++FGCT +VH   
Sbjct: 695  KYLWGEAILTATYLINRMPTRILNFKTPLQVFTNCNPIFRLSSTLPLKIFGCTTFVHIHD 754

Query: 1023 PNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGESV 1082
             N+ K  PRA+ CVFVGY P Q+GYKCF P S+K FVTMDVTF E +P+F  +HLQGES 
Sbjct: 755  HNRGKLDPRARKCVFVGYAPTQKGYKCFDPISKKLFVTMDVTFFESKPFF-ATHLQGEST 814

Query: 1083 SEESN----------NTFEFIEPTPS---VVSNIIPHSIVLPTNQVPW-KTYYRRNHKKE 1142
            SE+S+          N    +EP+ S   V  NI    +    + + + KT      K  
Sbjct: 815  SEDSDLFKIEKTPTPNPNNLLEPSNSNQFVYPNIETSGLDTTKSDMSFEKTAEILGKKNG 874

Query: 1143 VGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNM-ISENDRSNVAVLENVEEKDSGDEIE 1202
            V +  S   +    S         N     TKN  +    R      E+  +   G E E
Sbjct: 875  VLNIESLDGSSSLPSHNQNHSNTNNGNRTSTKNSELMTYSRRKHNSKESNPDPLPGHESE 934

Query: 1203 VRIETRDNEAEHGH-----------TGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSY 1262
            +R E   +E    +           +  + E    L+IPIA RKG RSCTKHP+ NY+SY
Sbjct: 935  LREEPNSSECPGNNQTDSCQPVQFISNSNSESFDDLNIPIATRKGVRSCTKHPMSNYMSY 994

Query: 1263 NSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKT 1322
             +LSP F AFT+ L    IPK++  AL+ PEWK A+ EEM+ALEKN TW++  LPKG  T
Sbjct: 995  KNLSPSFFAFTSHLSLVEIPKNVQEALQVPEWKKAIFEEMRALEKNHTWEVMGLPKGKTT 1054

Query: 1323 VGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNK 1382
            VGCKWVF++KY ++G+L+R+KARLVAKGFTQTYGIDY ETF+PVAKLNT+RVLLS+A N 
Sbjct: 1055 VGCKWVFTVKYNSNGSLERYKARLVAKGFTQTYGIDYLETFAPVAKLNTVRVLLSIAANL 1114

Query: 1383 DWPLYQLDVKNAFLNGDLVEEVYMSPPPGFEAQFGQHVCKLQKSIYGLKQSPRAWFDRFT 1442
            DWPL QLDVKNAFLNG+L EEVYM PPPGF+  FG  VCKL+KS+YGLKQSPRAWF+RFT
Sbjct: 1115 DWPLQQLDVKNAFLNGNLEEEVYMDPPPGFDEHFGSKVCKLKKSLYGLKQSPRAWFERFT 1174

Query: 1443 TFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEI 1502
             FVK+QGY Q  SDHT+F K S  GKIA+LIVYVDDI+LTGD   E+ +LK+ +  EFEI
Sbjct: 1175 QFVKNQGYVQAQSDHTMFIKHSNDGKIAILIVYVDDIILTGDHVTEMDRLKKSLALEFEI 1234

Query: 1503 KDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQ 1562
            KDLG+L+YFLGMEVARSK GI VSQRKYILDLL ETGM GCRP DTPI+ N KLG+++D 
Sbjct: 1235 KDLGSLRYFLGMEVARSKRGIVVSQRKYILDLLKETGMSGCRPADTPIDPNQKLGDTNDG 1294

Query: 1563 VPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGK 1622
              V+  +YQ+LVGKLIYLSHTRPDI+FAVS+VSQFM +P E H++AV RILRYLKSTPGK
Sbjct: 1295 NLVNTTRYQKLVGKLIYLSHTRPDIAFAVSIVSQFMHSPYEVHLEAVYRILRYLKSTPGK 1354

Query: 1623 GLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSTEAE 1682
            GL F+K+++KTIEAYTD+DWAGSV DR+STSGYCT++WGNLVTWRSKKQSV ARSS EAE
Sbjct: 1355 GLFFKKSEQKTIEAYTDADWAGSVTDRRSTSGYCTYIWGNLVTWRSKKQSVXARSSAEAE 1414

Query: 1683 YKALSLGICEEIWLQKVLTDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHF 1742
            Y+A++ G+CE +WL+K+L +L +  E P+KL+CDNKAAISIA+NPVQHDRTKHVEIDRHF
Sbjct: 1415 YRAMAHGVCEILWLKKILEELKRPLEMPMKLYCDNKAAISIAHNPVQHDRTKHVEIDRHF 1474

Query: 1743 IKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT 1746
            IKEKL++  IC+P++P++QQ+AD+LTKGL R +F+F +SKLG+IDIY PT
Sbjct: 1475 IKEKLEASIICMPFVPTTQQIADILTKGLFRSSFEFLISKLGMIDIYAPT 1505

BLAST of CSPI01G15910 vs. TrEMBL
Match: A5B7Z8_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_022757 PE=4 SV=1)

HSP 1 Score: 1457.2 bits (3771), Expect = 0.0e+00
Identity = 755/1469 (51.40%), Postives = 980/1469 (66.71%), Query Frame = 1

Query: 300  AQSSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDS 359
            + SS   ++G KLNG+NY  WSQSV + + G+ K  +LTGE   P   +P  R WK E+S
Sbjct: 26   SDSSPILITGHKLNGHNYLQWSQSVLLFICGKGKDEYLTGEAAMPETTEPGFRKWKIENS 85

Query: 360  ILRSILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMD 419
            ++ S LINSM   IG+  L   TAKDIWD A+  YS  +N S L+ +   +H+ +QG   
Sbjct: 86   MIMSWLINSMNNDIGENFLLFRTAKDIWDAAKETYSSSENTSELFQVESALHDFRQGEQS 145

Query: 420  VTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRIL 479
            VT ++N L+  WQ++DL     W+   D   Y  I E  R++ F  GLN + D VRGRI+
Sbjct: 146  VTQYYNTLTRYWQQLDLFETHSWKCSDDAATYRXIVEQXRLFKFFLGLNRELDDVRGRIM 205

Query: 480  GQRPIPSLMEVCSEIRLEEDRTSAMNISA---TPTIDSAAFSARSSNSSSDKHNGKPIPV 539
            G +P+PSL E  SE+R EE R   M  S     PT+D++   ARS NSS      +  P 
Sbjct: 206  GIKPLPSLREAFSEVRREESRKKVMMGSKEQPAPTLDASXLXARSFNSSGGDRQKRDRPW 265

Query: 540  CEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYVS---ESAEPPQQSDPHKNQT 599
            C++CKK  H KE CWKLHG+    K +P  D+   GRA+V+   ES   P+ S  +K Q 
Sbjct: 266  CDYCKKXGHYKEACWKLHGKXADWKPKPRXDRD--GRAHVAANXESTSVPEPSPFNKEQM 325

Query: 600  DLSLATLGAIVQSGIPHSFGLVSI-DGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETI 659
            ++ L  L + V SG      L +   G  PWI+D+GA+DH+TG +    +Y P  G+ ++
Sbjct: 326  EM-LQKLLSQVGSGSTTGIALTANRGGMKPWIVDTGASDHMTGDAAILQNYKPSNGHSSV 385

Query: 660  RIADGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSF 719
             IADGS + I G G I     L L +VLHVP L  NLLSISK+  +L C   F P+S  F
Sbjct: 386  HIADGSKSKIXGTGSIKLTKDLYLDSVLHVPNLDCNLLSISKLARDLQCVTKFYPNSCVF 445

Query: 720  QDLSSGRMIGTARHSRGLYLL------DDDTSSSSIPRTSLLSSYFTTS------EQDCM 779
            QDL SG+MIG+A    GLYLL      +  + +S +   S+L S+ + S      + + +
Sbjct: 446  QDLKSGKMIGSAELCSGLYLLSCGQFSNQVSQASCVQSQSMLESFNSVSNSKVNKDSEII 505

Query: 780  LWHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVH 839
            + H+RLGHP+F Y+  LFP LF      +  C++C  AK  R  +P  PYKP+  F+LVH
Sbjct: 506  MLHYRLGHPSFVYLAKLFPKLFINKNPASYHCEICQFAKHTRTVYPQIPYKPSTVFSLVH 565

Query: 840  SDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIA 899
            SDVWGPS+I   SG RWFVTF+DDHTR+TWV+L+ +KSEV  +FQ F   ++ QF+ KI 
Sbjct: 566  SDVWGPSRIKNISGTRWFVTFVDDHTRVTWVFLMKEKSEVGHIFQTFNLMVQNQFNSKIQ 625

Query: 900  ILRSDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSL 959
            +L+SDN +E+   +LS +L + GI+H +SC  TPQQNGVAERKNRHLLEVAR LM S+++
Sbjct: 626  VLKSDNAKEYFTSSLSTYLQNHGIIHISSCVDTPQQNGVAERKNRHLLEVARCLMFSSNV 685

Query: 960  PSYLWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVS-EVPLRVFGCTAYVHN 1019
            P+Y WG+AILTA +LINRMPSR+L  Q+P     + +P TR  S ++PL+VFGCTA+VH 
Sbjct: 686  PNYFWGEAILTATYLINRMPSRVLTFQSPRQLFLKQFPHTRAASSDLPLKVFGCTAFVHV 745

Query: 1020 FGPNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGE 1079
            +  N++KF PRA  C+F+GY P Q+GYKC+ P +++++ TMDV+F E   ++P SH+QGE
Sbjct: 746  YPQNRSKFAPRANKCIFLGYSPTQKGYKCYSPTNKRFYTTMDVSFFEHVFFYPKSHVQGE 805

Query: 1080 SVSEESNNTFE-FIEPTPSVVSNIIPHSIVLPTN-QVPWKTYYRRNHKKEVGSP-TSQPP 1139
            S++E  +  +E  +E  PS  S     S   PT    P  +  +      V SP T Q P
Sbjct: 806  SMNE--HQVWESLLEGVPSFHSESPNPSQFAPTELSTPMPSSVQPAQHTNVPSPVTIQSP 865

Query: 1140 APVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRDNEA 1199
             P+Q   P          +   +N+     R     LE+  +   G  I+      +   
Sbjct: 866  MPIQPIAP----------QLANENLQVYIRRRKRQELEHGSQSTCGQYIDSNSSLPEENI 925

Query: 1200 EHGHTGKS--DEYDSSLDIPIALRKGTRSCTKHPICNYVSYNSLSPQFRAFTASLDSTII 1259
                 G+      D S  +PIALRKG R CT HPI NYV+Y  LSP +RAF  SLD T +
Sbjct: 926  GEDRAGEVLIPSIDDST-LPIALRKGVRRCTDHPIGNYVTYEGLSPSYRAFATSLDDTQV 985

Query: 1260 PKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDR 1319
            P  I  A K  EWK AV +E+ ALEKN TW I  LP G + VGCKW+F++KYKADG+++R
Sbjct: 986  PNTIQEAXKISEWKKAVQDEIDALEKNGTWTITDLPVGKRPVGCKWIFTIKYKADGSVER 1045

Query: 1320 HKARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLV 1379
             KARLVA+GFTQ+YGIDY ETF+PVAKLNTIR+LLS+AVN+DW L QLD+KNAFLNGDL 
Sbjct: 1046 FKARLVARGFTQSYGIDYQETFAPVAKLNTIRILLSLAVNQDWCLQQLDIKNAFLNGDLE 1105

Query: 1380 EEVYMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLF 1439
            EEVYM  PPGFE    ++ VCKLQKS+YGLKQSPRAWFDRFT  V   GY+QG +DHTLF
Sbjct: 1106 EEVYMEIPPGFEESMAKNQVCKLQKSLYGLKQSPRAWFDRFTKAVLKLGYKQGQADHTLF 1165

Query: 1440 TKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSK 1499
             K S  GK+A+LIVYVDDI+L+G+D  E+  LK+ + +EFE+KDLGNLKYFLGMEVARS+
Sbjct: 1166 VKKSHAGKMAILIVYVDDIILSGNDMEELQXLKKYLSEEFEVKDLGNLKYFLGMEVARSR 1225

Query: 1500 EGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYL 1559
            +GI VSQRKYILDLL ETGMLGC+P DTP++   KLG   +  PVD+ +YQRLVG+LIYL
Sbjct: 1226 KGIVVSQRKYILDLLKETGMLGCKPIDTPMDSQKKLGIEKESTPVDRGRYQRLVGRLIYL 1285

Query: 1560 SHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDS 1619
            SHTRPDI FAVS VSQFM +P EEHM+AV RI RYLK TPGKGL FRKT+ +  E Y+D+
Sbjct: 1286 SHTRPDIGFAVSXVSQFMHSPTEEHMEAVYRIXRYLKMTPGKGLFFRKTENRDXEVYSDA 1345

Query: 1620 DWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSTEAEYKALSLGICEEIWLQKVL 1679
            DWAG+++DR+STSGYC+FVWGNLVT RSKKQSVVARSS EAEY+AL+ GICE IW+++VL
Sbjct: 1346 DWAGNIIDRRSTSGYCSFVWGNLVTXRSKKQSVVARSSAEAEYRALAQGICEGIWIKRVL 1405

Query: 1680 TDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSS 1739
            ++L Q   +P+ + CDN+AAISIA NPV HD TKHVEIDRHFI EK+ S ++ + Y+P+ 
Sbjct: 1406 SELGQTSSSPILMMCDNQAAISIAKNPVHHDXTKHVEIDRHFITEKVTSETVKLNYVPTK 1465

Query: 1740 QQVADVLTKGLLRPNFDFCVSKLGLIDIY 1743
             Q AD+LTK L RPNF+    KLGL DIY
Sbjct: 1466 HQTADILTKALPRPNFEDLTCKLGLYDIY 1478

BLAST of CSPI01G15910 vs. TrEMBL
Match: A5AJR0_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_031159 PE=4 SV=1)

HSP 1 Score: 1454.1 bits (3763), Expect = 0.0e+00
Identity = 754/1469 (51.33%), Postives = 978/1469 (66.58%), Query Frame = 1

Query: 302  SSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSIL 361
            SS   ++G KLNG+NY  WSQSV + + G+ K  + TGE   P   +P  R WK E+S++
Sbjct: 28   SSPILITGHKLNGHNYLQWSQSVLLFICGKGKDEYXTGEAXMPETTEPXFRKWKIENSMI 87

Query: 362  RSILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVT 421
             S LINSM   IG+  L   TAKDIWD A+  YS  +N S L+ +   +H+ +QG   VT
Sbjct: 88   MSWLINSMNNDIGENFLLFGTAKDIWDAAKETYSSSENISELFQVESALHDFRQGEQSVT 147

Query: 422  SFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQ 481
             ++N L+  WQ++DL     W+   D   Y +I E  R++ F  GLN + D VRGRI+G 
Sbjct: 148  QYYNTLTRYWQQLDLFETHSWKCSDDAATYRQIVEQKRLFKFFLGLNRELDDVRGRIMGI 207

Query: 482  RPIPSLMEVCSEIRLEEDRTSAMNISA---TPTIDSAAFSARSSNSSSDKHNGKPIPVCE 541
            +P+PSL EV SE+R EE R   M  S     PT+D +A +ARS NSS      +  P C+
Sbjct: 208  KPLPSLREVFSEVRREESRKKVMMGSKEQPAPTLDGSALAARSFNSSGGDRQKRDRPWCD 267

Query: 542  HCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYV---SESAEPPQQSDPHKNQTDL 601
            + KK  H KE CWKLHG+P   K +P +D+   GRA+V   SES   P+ S  +K Q ++
Sbjct: 268  YYKKPGHYKEACWKLHGKPADWKPKPRSDRD--GRAHVAANSESTSVPEPSPFNKEQMEM 327

Query: 602  SLATLGAIVQSGIPHSFGLV-SIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRI 661
             L  L + V SG      L  S  G  PWI+D+GA+DH+TG +    +Y P  G+  + I
Sbjct: 328  -LQKLLSQVGSGSTTGIALTASRGGMKPWIVDTGASDHMTGDAAILQNYKPSNGHSFVHI 387

Query: 662  ADGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQD 721
            ADGS + I G G I     L L +VLHVP L  NLLSISK+  +L C   F P+S  FQD
Sbjct: 388  ADGSKSKIVGTGSIKLTKDLYLDSVLHVPNLDCNLLSISKLARDLQCVTKFYPNSCVFQD 447

Query: 722  LSSGRMIGTARHSRGLYLL------DDDTSSSSIPRTSLLSSYFTTS------EQDCMLW 781
            L SG+MIG+A+    LYLL      +  + +S +   S+L S+ + S      + + ++ 
Sbjct: 448  LKSGKMIGSAKLCSELYLLSCGQFSNQVSQASCVQSQSMLESFNSVSNSKVNKDSEIIML 507

Query: 782  HFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSD 841
            H+RLGHP+F Y+  LFP LF      +  C++C  AK  R  +P  PYKP+  F+LVHSD
Sbjct: 508  HYRLGHPSFVYLAKLFPRLFINKNPASYHCEICQFAKHTRTVYPQIPYKPSTVFSLVHSD 567

Query: 842  VWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAIL 901
            VWGPS+I   SG RWFVTF+DDHTR+TWV+L+ +KSEV  +FQ F   ++ QF+ KI +L
Sbjct: 568  VWGPSRIKNISGTRWFVTFVDDHTRVTWVFLMKEKSEVGHIFQTFNLMVQNQFNSKIQVL 627

Query: 902  RSDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPS 961
            +SDN +E+   +LS +L + GI+H +SC  TPQQNGVAERKNRHLLEVAR LM S+++P+
Sbjct: 628  KSDNAKEYFTSSLSTYLQNHGIIHISSCVDTPQQNGVAERKNRHLLEVARCLMFSSNVPN 687

Query: 962  YLWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVS-EVPLRVFGCTAYVHNFG 1021
            Y WG+AILTA +LINRMPSR+L  Q+P     + +P TR  S ++ L+VFGCTA+VH + 
Sbjct: 688  YFWGEAILTATYLINRMPSRVLTFQSPRQLFLKQFPHTRAASSDLSLKVFGCTAFVHVYP 747

Query: 1022 PNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGESV 1081
             N++KF PRA  C+F+GY P+Q+GYKC+ P +++++ TMDV+F E   ++P  H+QGES+
Sbjct: 748  QNRSKFAPRANKCIFLGYSPNQKGYKCYSPTNKRFYTTMDVSFFEHVFFYPKFHVQGESM 807

Query: 1082 SEESNNTFEF-IEPTPSVVSNIIPHSIVLPTN-QVPWKTYYRRNHKKEVGSP-TSQPPAP 1141
            +E  +  +E  +E  PS  S     S   PT    P  +  +      V SP T Q P P
Sbjct: 808  NE--HQVWESRLEGVPSFHSESPNPSQFAPTELSTPMPSSVQPAQHTNVPSPVTIQSPMP 867

Query: 1142 VQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRDNEAEH 1201
            +Q   P          +   +N+     R     LE+  +   G  I+      +     
Sbjct: 868  IQPIAP----------QLANENLQVYIRRRKRQELEHGSQSTCGQYIDSNSSLPEENIGE 927

Query: 1202 GHTGKS--DEYDSSLDIPIALRKGTRSCTKHPICNYVSYNSLSPQFRAFTASLDSTIIPK 1261
               G+      D S  +PIALRKG R CT HPI NYV+Y  LSP +RAF  SLD T +P 
Sbjct: 928  DRAGEVLIPSIDDST-LPIALRKGVRRCTDHPIGNYVTYEGLSPSYRAFATSLDDTQVPN 987

Query: 1262 DIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHK 1321
             I  ALK  EWK AV +E+ ALEKN TW I  LP G + VGCKW+F++KYKADG+++R K
Sbjct: 988  TIQEALKISEWKKAVQDEIDALEKNGTWTITDLPVGKRPVGCKWIFTIKYKADGSVERFK 1047

Query: 1322 ARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEE 1381
            ARLVA+GFTQ YGIDY ETF+PVAKLNTIR+LLS+AVN+DW L QLD+KNAFLNGDL EE
Sbjct: 1048 ARLVARGFTQXYGIDYQETFAPVAKLNTIRILLSLAVNQDWCLQQLDIKNAFLNGDLEEE 1107

Query: 1382 VYMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTK 1441
            VYM  PPGFE    ++ VCKLQKS+YGLKQSPRAWFDRFT  V   GY+QG +DHTLF K
Sbjct: 1108 VYMEIPPGFEESMAKNQVCKLQKSLYGLKQSPRAWFDRFTKAVLKLGYKQGQADHTLFVK 1167

Query: 1442 VSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEG 1501
             S  GK+A+LIVYVDDI+L+G+D  E+  LK+ + +EFE+KDLGNLKYFLGMEVARS++G
Sbjct: 1168 KSHAGKMAILIVYVDDIILSGNDMEELQNLKKYLSEEFEVKDLGNLKYFLGMEVARSRKG 1227

Query: 1502 ISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSH 1561
            I VSQ KYILDLL ETGMLGC+P DTP++   KLG   +  P D+ +YQRLVG+LIYLSH
Sbjct: 1228 IVVSQTKYILDLLKETGMLGCKPIDTPMDSQKKLGIEKESTPXDRGRYQRLVGRLIYLSH 1287

Query: 1562 TRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDW 1621
            TRPDI FAVS VSQFM +P EEHM+AV RILRYLK TP KG+ FRKT+ +  E Y+D+DW
Sbjct: 1288 TRPDIGFAVSAVSQFMHSPTEEHMEAVYRILRYLKMTPXKGIFFRKTENRDTEVYSDADW 1347

Query: 1622 AGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSTEAEYKALSLGICEEIWLQKVLTD 1681
            AG+++DR+STSGYC+FVWGNLVTWRSKKQSVVARSS EAEY AL+ GICE  W+++VL++
Sbjct: 1348 AGNIIDRRSTSGYCSFVWGNLVTWRSKKQSVVARSSAEAEYXALAQGICEGXWIKRVLSE 1407

Query: 1682 LHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQ 1741
            L Q   +P+ + CDN+A ISIA NPV HDRTKHVEIDRHFI EK+ S ++ + Y+P+  Q
Sbjct: 1408 LGQTSSSPILMMCDNQAXISIAKNPVHHDRTKHVEIDRHFITEKVTSETVKLNYVPTKHQ 1467

Query: 1742 VADVLTKGLLRPNFDFCVSKLGLIDIYVP 1745
             AD+LTK L RPNF+    KLGL DIY P
Sbjct: 1468 TADILTKALPRPNFEDLTCKLGLYDIYSP 1480

BLAST of CSPI01G15910 vs. TrEMBL
Match: A5BJ12_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_024789 PE=4 SV=1)

HSP 1 Score: 1446.0 bits (3742), Expect = 0.0e+00
Identity = 746/1468 (50.82%), Postives = 969/1468 (66.01%), Query Frame = 1

Query: 302  SSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSIL 361
            SS   ++G KLNG+NY  WSQSV + + G+ K  +LTGE   P   +P  R WK E+S++
Sbjct: 28   SSPILITGHKLNGHNYLQWSQSVLLFICGKGKDEYLTGEAVMPETTEPGFRKWKIENSMI 87

Query: 362  RSILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVT 421
             S LINSM   IG+  L   TAKDIWD A+  YS  +N S L+ +   +H+ +QG   VT
Sbjct: 88   MSWLINSMNNDIGENFLLFGTAKDIWDAAKETYSSSENTSELFQVESALHDFRQGEQSVT 147

Query: 422  SFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQ 481
             ++N L+  WQ++DL     W+   D   Y +I E  R++ F  GLN + D VRGRI+G 
Sbjct: 148  QYYNTLTRYWQQLDLFETHSWKCSDDAATYRQIMEQKRLFKFFLGLNRELDDVRGRIMGI 207

Query: 482  RPIPSLMEVCSEIRLEEDRTSAMNISA---TPTIDSAAFSARSSNSSSDKHNGKPIPVCE 541
            +P+PSL E  SE+R EE R   M  S     PT+D++A +ARS NSS      +  P C+
Sbjct: 208  KPLPSLREAFSEVRREESRKKVMMGSKEQPAPTLDASALAARSFNSSGGDRQKRDRPWCD 267

Query: 542  HCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYV---SESAEPPQQSDPHKNQTDL 601
            +CKK  H KE CWKLHG+P   K +P  D+   GRA+V   SES   P+ S  +K Q ++
Sbjct: 268  YCKKPGHYKETCWKLHGKPADWKPKPRFDRD--GRAHVAANSESTSVPEPSPFNKEQMEM 327

Query: 602  SLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIA 661
                L  +            +  G  PWI+D+GA+DH+TG +    +Y P  G+ ++ IA
Sbjct: 328  LQKLLSQVGSGSTTGVAFTTNRGGMRPWIVDTGASDHMTGDAAILQNYKPSNGHSSVHIA 387

Query: 662  DGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDL 721
            DGS         I     L L +VLHVP L  NLLSISK+ H+L C   F P+   FQDL
Sbjct: 388  DGS---------IKLTKDLYLDSVLHVPNLDCNLLSISKLAHDLQCVTKFYPNLCVFQDL 447

Query: 722  SSGRMIGTARHSRGLYLLD-----DDTSSSSIPRTSLLSSYFTT-------SEQDCMLWH 781
             SG+MIG+A    GLYLL      +  S +S  ++  +S  F +        + + ++ H
Sbjct: 448  KSGKMIGSAELRSGLYLLSCGQFSNQVSQASCVQSQSMSESFNSVSNSKVNKDSEIIMLH 507

Query: 782  FRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDV 841
            +RLGHP+F Y+  LFP LF      +  C++C  AK  R+ +P  PYKP+  F+LVHSDV
Sbjct: 508  YRLGHPSFVYLAKLFPRLFINKNPASYHCEICQFAKHTRIVYPQIPYKPSTVFSLVHSDV 567

Query: 842  WGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAILR 901
            WGPS+I   SG RWFVTF+DDHT +TWV+L+ +KSEV  +FQ F   ++ QF+ KI +L+
Sbjct: 568  WGPSRIKNISGTRWFVTFVDDHTWVTWVFLMKEKSEVGHIFQTFNLMVQNQFNSKIQVLK 627

Query: 902  SDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSY 961
            SDN +E+   +LS +L +  I+H +SC  TPQQN VAERKNRHLLEVAR LM S+++P+Y
Sbjct: 628  SDNAKEYFTSSLSTYLQNHDIIHISSCVDTPQQNRVAERKNRHLLEVARCLMFSSNVPNY 687

Query: 962  LWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVS-EVPLRVFGCTAYVHNFGP 1021
             WG+AILTA +LINRMPSR+L  Q+P     + +P T   S ++PL+VFGCTA++H +  
Sbjct: 688  FWGEAILTATYLINRMPSRVLTFQSPRQLFLKQFPHTHAASSDLPLKVFGCTAFIHVYPQ 747

Query: 1022 NQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGESVS 1081
            N++KF PRA  C+F+GY P Q+GYKC+ P +++++ TMDV+F E   ++P SH+QGES++
Sbjct: 748  NRSKFAPRANKCIFLGYSPTQKGYKCYSPTNKRFYTTMDVSFFEHVFFYPKSHVQGESMN 807

Query: 1082 EESNNTFE-FIEPTPSVVSNIIPHSIVLPTN-QVPWKTYYRRNHKKEVGSP-TSQPPAPV 1141
            E  +  +E F+E  PS  S     S   PT    P     +      V SP T Q P P+
Sbjct: 808  E--HQVWESFLEGVPSFHSESPNPSQFAPTELSTPMPPSVQPAQHTNVPSPVTIQSPMPI 867

Query: 1142 QDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRDNEAEHG 1201
            Q   P          +   +N+     R     LE+  +   G  I+      +      
Sbjct: 868  QPIAP----------QLANENLQVYLRRRKRQELEHGSQSTCGQYIDSNSSLPEENIGED 927

Query: 1202 HTGKS--DEYDSSLDIPIALRKGTRSCTKHPICNYVSYNSLSPQFRAFTASLDSTIIPKD 1261
              G+      D S  +PIALRKG R CT HPI NYV+Y  LSP +RAF  SLD T +P  
Sbjct: 928  RAGEVLIPSIDDST-LPIALRKGVRRCTDHPIGNYVTYEGLSPSYRAFATSLDDTQVPNT 987

Query: 1262 IYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKA 1321
            I  A K  EWK AV +E+ ALEKN TW I  LP G + VGCKW+F++KYK DG+++R KA
Sbjct: 988  IQEASKISEWKKAVQDEIDALEKNGTWTITDLPVGKRPVGCKWIFTIKYKTDGSVERFKA 1047

Query: 1322 RLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEEV 1381
            RLVA+GFTQ+YGIDY ETF+PVAKLNTIR+LLS+AVN+DW L QLD+KNAFLNGDL EEV
Sbjct: 1048 RLVARGFTQSYGIDYQETFAPVAKLNTIRILLSLAVNQDWCLQQLDIKNAFLNGDLEEEV 1107

Query: 1382 YMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKV 1441
            YM  PPGFE    ++ VCKLQKS+YGLKQSPRAWFDRFT  V   GY+QG +DHTLF K 
Sbjct: 1108 YMEIPPGFEESMAKNQVCKLQKSLYGLKQSPRAWFDRFTKAVLKLGYKQGQADHTLFVKK 1167

Query: 1442 SKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGI 1501
            S  GK+A+LIVYVDDI+L+G+D  E+  LK+ + +EFE+KDLGNLKYF GMEVA+S++GI
Sbjct: 1168 SHAGKLAILIVYVDDIILSGNDMGELQNLKKYLSEEFEVKDLGNLKYFXGMEVAKSRKGI 1227

Query: 1502 SVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHT 1561
             VSQRKYILDLL ETGMLGC+P DTP++   KLG   +  PVD+ +YQRLVG+LIYLSHT
Sbjct: 1228 VVSQRKYILDLLKETGMLGCKPIDTPMDSQKKLGIEKESTPVDRGRYQRLVGRLIYLSHT 1287

Query: 1562 RPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWA 1621
            RPDI FAVS VSQFM +P EEHM+AV RILRYLK TPGKGL FRKT+ +  E Y+D+DWA
Sbjct: 1288 RPDIGFAVSAVSQFMHSPTEEHMEAVYRILRYLKMTPGKGLFFRKTENRDTEVYSDADWA 1347

Query: 1622 GSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSTEAEYKALSLGICEEIWLQKVLTDL 1681
            G+++DR STSGYC+FVWGNLVTWRSKKQSVVARSS EAEY+AL+ GICE IW++ VL++L
Sbjct: 1348 GNIIDRWSTSGYCSFVWGNLVTWRSKKQSVVARSSAEAEYRALAQGICEGIWIKXVLSEL 1407

Query: 1682 HQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQV 1741
             Q   +P+ + CDN+AAISIA NPV HDRTKHVEIDRHFI EK+ S ++ + Y+P+  Q 
Sbjct: 1408 GQXSSSPILMMCDNQAAISIAKNPVHHDRTKHVEIDRHFITEKVTSETVKLNYVPTKHQT 1467

Query: 1742 ADVLTKGLLRPNFDFCVSKLGLIDIYVP 1745
            AD+LTK L RPNF+    KLGL DIY P
Sbjct: 1468 ADILTKALPRPNFEDLTCKLGLYDIYSP 1471

BLAST of CSPI01G15910 vs. TrEMBL
Match: A5B7A7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_025045 PE=4 SV=1)

HSP 1 Score: 1431.4 bits (3704), Expect = 0.0e+00
Identity = 747/1468 (50.89%), Postives = 965/1468 (65.74%), Query Frame = 1

Query: 302  SSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSIL 361
            SS   ++G KLNG+NY  WSQSV + + G+ K  +LTGE   P   +P  R WK E+S++
Sbjct: 28   SSPILITGHKLNGHNYLQWSQSVLLFICGKGKDEYLTGEAVMPETTEPGFRKWKIENSMI 87

Query: 362  RSILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVT 421
             S LINSM   IG+  L   TAKDIWD A+  YS  +N S L+ +   +H+ +QG   VT
Sbjct: 88   MSWLINSMNNDIGENFLLFGTAKDIWDAAKETYSSSENTSELFQVESALHDFRQGEQSVT 147

Query: 422  SFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQ 481
             ++N L+  WQ++DL     W+   D   Y +I E  R++ F  GLN + D VRGRI+G 
Sbjct: 148  QYYNTLTRYWQQLDLFETHSWKCSDDAATYRQIVEQKRLFKFFLGLNRELDDVRGRIMGI 207

Query: 482  RPIPSLMEVCSEIRLEEDRTSAMNISA---TPTIDSAAFSARSSNSSSDKHNGKPIPVCE 541
            +P+PSL E  SE+R EE R   M  S     PT+D++A +ARS NSS      +  P C+
Sbjct: 208  KPLPSLREAFSEVRREESRKKVMMGSKEQPAPTLDASALAARSFNSSGGDRQKRDRPWCD 267

Query: 542  HCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYV---SESAEPPQQSDPHKNQTDL 601
            +CKK  H KE CWKLHG+P   K +P  D+   GRA+V   SES   P+ S  +K Q ++
Sbjct: 268  YCKKPGHYKETCWKLHGKPADWKPKPRFDRD--GRAHVAANSESTSVPEPSPFNKEQMEM 327

Query: 602  SLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIA 661
                L  +            +  G  PWI+D+GA+DH+TG +    +Y P  G+ ++ IA
Sbjct: 328  LQKLLSQVGSGSTTGVAFTANRGGMRPWIVDTGASDHMTGDAAILQNYKPSNGHSSVHIA 387

Query: 662  DGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDL 721
            DGS + IAG G I     L L +VLHVP L  NLLSISK+ H+L C   F P+   FQDL
Sbjct: 388  DGSKSKIAGTGSIKLTKDLYLDSVLHVPNLDCNLLSISKLAHDLQCVTKFYPNLCVFQDL 447

Query: 722  SSGRMIGTARHSRGLYLLD-----DDTSSSSIPRTSLLSSYFTT-------SEQDCMLWH 781
             SG+MIG+A    GLYLL      +  S +S  ++  +S  F +        + + ++ H
Sbjct: 448  KSGKMIGSAELCSGLYLLSCGQFSNQVSQASCVQSQSMSESFNSVSNSKVNKDSEIIMLH 507

Query: 782  FRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDV 841
            +RLGHP+F Y+  LFP LF      +  C++C  AK  R  +P  PYKP+  F+LVHSDV
Sbjct: 508  YRLGHPSFVYLAKLFPKLFINKNPASYHCEICQFAKHTRTVYPQIPYKPSTVFSLVHSDV 567

Query: 842  WGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAILR 901
            WGPS+I   SG RWFVTF+DDHTR+TWV+L+ +KSEV  +FQ F   ++ QF+ KI +L+
Sbjct: 568  WGPSRIKNISGTRWFVTFVDDHTRVTWVFLMKEKSEVGHIFQTFNLMVQNQFNSKIQVLK 627

Query: 902  SDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSY 961
            SDN +E+   +LS +L + GI+H +SC  TPQQNGVAERKNRHLLEVAR LM S+++P+Y
Sbjct: 628  SDNAKEYFTSSLSTYLQNHGIIHISSCVDTPQQNGVAERKNRHLLEVARCLMFSSNVPNY 687

Query: 962  LWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVS-EVPLRVFGCTAYVHNFGP 1021
             WG+AILTA +LINRMPSR+L  Q+P     + +P T   S ++PL+VFGCTA+VH +  
Sbjct: 688  FWGEAILTATYLINRMPSRVLTFQSPRQLFLKQFPHTHAASSDLPLKVFGCTAFVHVYPQ 747

Query: 1022 NQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGESVS 1081
            N++KF PRA  C+F+GY P Q+GYKC+ P +++++ TMDV+F E   ++P SH+QGES++
Sbjct: 748  NRSKFAPRANKCIFLGYSPTQKGYKCYSPTNKRFYTTMDVSFFEHVFFYPKSHVQGESMN 807

Query: 1082 EESNNTFE-FIEPTPSVVSNIIPHSIVLPTN-QVPWKTYYRRNHKKEVGSP-TSQPPAPV 1141
            E  +  +E F+E  PS  S     S   PT    P     +      V SP T Q P P+
Sbjct: 808  E--HQVWESFLEGVPSFHSESPNPSQFAPTELSTPMPPSVQPAQHTNVPSPVTIQSPMPI 867

Query: 1142 QDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRDNEAEHG 1201
            Q   P          +   +N+     R     LE+  +   G  I+      +      
Sbjct: 868  QPIAP----------QLANENLQVYIRRRKRQELEHGSQSTCGQYIDSNSSLPEENIGED 927

Query: 1202 HTGKS--DEYDSSLDIPIALRKGTRSCTKHPICNYVSYNSLSPQFRAFTASLDSTIIPKD 1261
              G+      D S  +PIALRKG R CT HPI NYV+Y  LSP +RAF  SLD T +P  
Sbjct: 928  RAGEVLIPSIDDST-LPIALRKGVRRCTDHPIGNYVTYEGLSPSYRAFATSLDDTQVPNT 987

Query: 1262 IYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKA 1321
            I  ALK  EWK AV +E+ ALEKN TW I  LP G + +            D + D  KA
Sbjct: 988  IQEALKISEWKKAVQDEIDALEKNGTWTITDLPVGKRPM------------DQSKD-FKA 1047

Query: 1322 RLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEEV 1381
            RLVA+GFTQ+YGIDY ETF+PVAKLNTIR+LLS+AVN+DW L QLD+KNAFLNGDL EEV
Sbjct: 1048 RLVARGFTQSYGIDYQETFAPVAKLNTIRILLSLAVNQDWCLQQLDIKNAFLNGDLEEEV 1107

Query: 1382 YMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKV 1441
            YM  PPGFE    ++ VCKLQKS+YGLKQSPRAWFDRFT  V   GY+QG  DHTLF K 
Sbjct: 1108 YMEIPPGFEESMAKNQVCKLQKSLYGLKQSPRAWFDRFTKAVLKLGYKQGQXDHTLFVKK 1167

Query: 1442 SKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGI 1501
            S  GK+A+LIVYVDDI+L+G+D  E+  LK+ + +EFE+KDLGNLKYFLGMEVARS++GI
Sbjct: 1168 SHAGKLAILIVYVDDIILSGNDMGELQNLKKYLLEEFEVKDLGNLKYFLGMEVARSRKGI 1227

Query: 1502 SVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHT 1561
             VSQRKYILDLL ETGMLGC+P DTP++   KLG   +  PVD+ +YQRLVG+LIYLSHT
Sbjct: 1228 VVSQRKYILDLLKETGMLGCKPIDTPMDSKKKLGIEKESTPVDRGRYQRLVGRLIYLSHT 1287

Query: 1562 RPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWA 1621
            RPDI FAVS VSQFM +P EEHM+AV RILRYLK TPGKGL FRKT+ +  E Y+D+DWA
Sbjct: 1288 RPDIGFAVSAVSQFMHSPTEEHMEAVYRILRYLKMTPGKGLFFRKTENRDTEVYSDADWA 1347

Query: 1622 GSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSTEAEYKALSLGICEEIWLQKVLTDL 1681
            G+++DR+STSGYC+FVWGNLVTWRSKKQSVVARSS EAEY+AL+ GICE IW+++VL++L
Sbjct: 1348 GNIIDRRSTSGYCSFVWGNLVTWRSKKQSVVARSSAEAEYRALAQGICEGIWIKRVLSEL 1407

Query: 1682 HQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQV 1741
             Q   +P+ + CDN+AAISIA NPV HDRTKHVEIDRHFI EK+ S ++ + Y+P+  Q 
Sbjct: 1408 GQTSSSPILMMCDNQAAISIAKNPVHHDRTKHVEIDRHFITEKVTSETVKLNYVPTKHQT 1467

Query: 1742 ADVLTKGLLRPNFDFCVSKLGLIDIYVP 1745
            AD+LTK L RPNF+    KLGL DIY P
Sbjct: 1468 ADILTKALPRPNFEDLTCKLGLYDIYSP 1467

BLAST of CSPI01G15910 vs. TAIR10
Match: AT4G23160.1 (AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 8)

HSP 1 Score: 475.7 bits (1223), Expect = 1.2e-133
Identity = 229/502 (45.62%), Postives = 333/502 (66.33%), Query Frame = 1

Query: 1203 SCTKHPICNYVSYNSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNS 1262
            S T H I  ++SY  +SP + +F   +     P     A ++  W  A+ +E+ A+E   
Sbjct: 54   SLTIHDISQFLSYEKVSPLYHSFLVCIAKAKEPSTYNEAKEFLVWCGAMDDEIGAMETTH 113

Query: 1263 TWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKL 1322
            TW+ICTLP   K +GCKWV+ +KY +DGT++R+KARLVAKG+TQ  GID+ ETFSPV KL
Sbjct: 114  TWEICTLPPNKKPIGCKWVYKIKYNSDGTIERYKARLVAKGYTQQEGIDFIETFSPVCKL 173

Query: 1323 NTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEEVYMSPPPGFEAQFGQH-----VCKLQ 1382
             +++++L+++   ++ L+QLD+ NAFLNGDL EE+YM  PPG+ A+ G       VC L+
Sbjct: 174  TSVKLILAISAIYNFTLHQLDISNAFLNGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLK 233

Query: 1383 KSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGD 1442
            KSIYGLKQ+ R WF +F+  +   G+ Q HSDHT F K++ T  + VL VYVDDI++  +
Sbjct: 234  KSIYGLKQASRQWFLKFSVTLIGFGFVQSHSDHTYFLKITATLFLCVL-VYVDDIIICSN 293

Query: 1443 DQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCR 1502
            + A + +LK ++   F+++DLG LKYFLG+E+ARS  GI++ QRKY LDLL ETG+LGC+
Sbjct: 294  NDAAVDELKSQLKSCFKLRDLGPLKYFLGLEIARSAAGINICQRKYALDLLDETGLLGCK 353

Query: 1503 PTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEE 1562
            P+  P++ +           VD + Y+RL+G+L+YL  TR DISFAV+ +SQF + P   
Sbjct: 354  PSSVPMDPSVTFSAHSGGDFVDAKAYRRLIGRLMYLQITRLDISFAVNKLSQFSEAPRLA 413

Query: 1563 HMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLV 1622
            H +AV +IL Y+K T G+GL +       ++ ++D+ +      R+ST+GYC F+  +L+
Sbjct: 414  HQQAVMKILHYIKGTVGQGLFYSSQAEMQLQVFSDASFQSCKDTRRSTNGYCMFLGTSLI 473

Query: 1623 TWRSKKQSVVARSSTEAEYKALSLGICEEIWLQKVLTDLHQECETPLKLFCDNKAAISIA 1682
            +W+SKKQ VV++SS EAEY+ALS    E +WL +   +L      P  LFCDN AAI IA
Sbjct: 474  SWKSKKQQVVSKSSAEAEYRALSFATDEMMWLAQFFRELQLPLSKPTLLFCDNTAAIHIA 533

Query: 1683 NNPVQHDRTKHVEIDRHFIKEK 1700
             N V H+RTKH+E D H ++E+
Sbjct: 534  TNAVFHERTKHIESDCHSVRER 554

BLAST of CSPI01G15910 vs. TAIR10
Match: ATMG00810.1 (ATMG00810.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 188.7 bits (478), Expect = 3.0e-47
Identity = 92/224 (41.07%), Postives = 136/224 (60.71%), Query Frame = 1

Query: 1425 LIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYI 1484
            L++YVDDI+LTG     ++ L  ++   F +KDLG + YFLG+++     G+ +SQ KY 
Sbjct: 3    LLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYA 62

Query: 1485 LDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAV 1544
              +L   GML C+P  TP+        S  + P D   ++ +VG L YL+ TRPDIS+AV
Sbjct: 63   EQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYP-DPSDFRSIVGALQYLTLTRPDISYAV 122

Query: 1545 SVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKS 1604
            ++V Q M  P       + R+LRY+K T   GL   K  +  ++A+ DSDWAG    R+S
Sbjct: 123  NIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRS 182

Query: 1605 TSGYCTFVWGNLVTWRSKKQSVVARSSTEAEYKALSLGICEEIW 1649
            T+G+CTF+  N+++W +K+Q  V+RSSTE EY+AL+L   E  W
Sbjct: 183  TTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CSPI01G15910 vs. TAIR10
Match: ATMG00820.1 (ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase))

HSP 1 Score: 110.2 bits (274), Expect = 1.3e-23
Identity = 56/117 (47.86%), Postives = 73/117 (62.39%), Query Frame = 1

Query: 1216 NSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKT 1275
            N L+P++ + T +      PK +  ALK P W  A+ EE+ AL +N TW +   P     
Sbjct: 10   NKLNPKY-SLTITTTIKKEPKSVIFALKDPGWCQAMQEELDALSRNKTWILVPPPVNQNI 69

Query: 1276 VGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVA 1333
            +GCKWVF  K  +DGTLDR KARLVAKGF Q  GI + ET+SPV +  TIR +L+VA
Sbjct: 70   LGCKWVFKTKLHSDGTLDRLKARLVAKGFHQEEGIYFVETYSPVVRTATIRTILNVA 125

BLAST of CSPI01G15910 vs. TAIR10
Match: ATMG00240.1 (ATMG00240.1 Gag-Pol-related retrotransposon family protein)

HSP 1 Score: 75.5 bits (184), Expect = 3.7e-13
Identity = 34/82 (41.46%), Postives = 53/82 (64.63%), Query Frame = 1

Query: 1531 IYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAY 1590
            +YL+ TRPD++FAV+ +SQF        M+AV ++L Y+K T G+GL +  T    ++A+
Sbjct: 1    MYLTITRPDLTFAVNRLSQFSSASRTAQMQAVYKVLHYVKGTVGQGLFYSATSDLQLKAF 60

Query: 1591 TDSDWAGSVVDRKSTSGYCTFV 1613
             DSDWA     R+S +G+C+ V
Sbjct: 61   ADSDWASCPDTRRSVTGFCSLV 82

BLAST of CSPI01G15910 vs. NCBI nr
Match: gi|147819777|emb|CAN76196.1| (hypothetical protein VITISV_041073 [Vitis vinifera])

HSP 1 Score: 1583.5 bits (4099), Expect = 0.0e+00
Identity = 801/1490 (53.76%), Postives = 1034/1490 (69.40%), Query Frame = 1

Query: 303  SMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILR 362
            S + L+  KLNG NY  W+QSVK+ ++GR K   L GE+ +P+  DP+ + W+  + +  
Sbjct: 35   SSFQLTIHKLNGKNYLEWAQSVKLAIDGRGKLGHLNGEVSKPVADDPNLKTWRFRELVA- 94

Query: 363  SILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTS 422
                      IGKP LF  TAKD+W+  + +YS  +N+S+++ L+ ++ + +QG  +VT+
Sbjct: 95   ----------IGKPHLFLPTAKDVWEAVRDMYSDLENSSQIFDLKSKLWQSRQGDREVTT 154

Query: 423  FFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQR 482
            ++N++  +WQE+DLC E  W  P D V++ + EENDR+Y FLA LN   D VRGRILG++
Sbjct: 155  YYNQMVTLWQELDLCYEDEWDCPNDSVRHKKREENDRVYVFLAALNHNLDEVRGRILGRK 214

Query: 483  PIPSLMEVCSEIRLEEDRTSAMNIS----ATPTIDSAAFSARSSNSSSDKHNGKPIPVCE 542
            P+PS+ EV SE+R EE R   M       + P I+S+A  ++ S+   D+      P C+
Sbjct: 215  PLPSIREVFSEVRREEARRKVMLTDPEPMSNPEIESSALVSKGSDLDGDRRKK---PWCD 274

Query: 543  HCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYVSESAEP--PQQSDPHKNQTDLS 602
            HCKK WHTK  CWK+HG+P   KK+  +D    GRA+ + SA+   PQ +    N T   
Sbjct: 275  HCKKPWHTKGTCWKIHGKPQNFKKKNGSD----GRAFQTMSADSQGPQINSEKPNFTKEQ 334

Query: 603  LATLGAIVQSG---------------IPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFV 662
            L+ L  + QS                +  +   +  +   PWI+DSGATDH+TGSS+ F 
Sbjct: 335  LSHLYKLFQSPQFSNPSCSLAQQGNYLIAALSSIKSNVHCPWIIDSGATDHMTGSSQIFS 394

Query: 663  SYIPCAGNETIRIADGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNC 722
            SY PCAGN+ I+I DGSL+ IAGKG +     L+LHNVLHVP LS NLLSISKIT +  C
Sbjct: 395  SYKPCAGNKKIKIXDGSLSAIAGKGSVFISPSLTLHNVLHVPNLSCNLLSISKITQDHQC 454

Query: 723  KAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDCML 782
            +A F P    FQ+L+SGR IG AR   GLY  ++ + S    +++   S    S  D +L
Sbjct: 455  QANFYPSYCEFQELTSGRTIGNAREIGGLYFFENGSESRKPIQSTCFESISVASSDDIIL 514

Query: 783  WHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHS 842
            WH+RLGHP+FQY+KHLFP LF     ++  C+ C  AK HR SFP QPY+ ++PF+L+HS
Sbjct: 515  WHYRLGHPSFQYLKHLFPSLFRNKNPSSFQCEFCELAKHHRTSFPLQPYRISKPFSLIHS 574

Query: 843  DVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAI 902
            DVWGPS+I+T SGK+WFVTFIDDHTR++WVYL+ +KSEV  +F+ FY  + TQF  KI +
Sbjct: 575  DVWGPSRISTLSGKKWFVTFIDDHTRVSWVYLLREKSEVEEVFKIFYTMVLTQFQTKIQV 634

Query: 903  LRSDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLP 962
             RSDNG+E+ N  L +F   KGIVHQ+SC  TPQQNG+AERKN+HLLEVAR+L  +T +P
Sbjct: 635  FRSDNGKEYINKALGKFFLEKGIVHQSSCNDTPQQNGIAERKNKHLLEVARALCFTTKVP 694

Query: 963  SYLWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFG 1022
             YLWG+AILTA +LINRMP+RIL+ +TPL       P  R  S +PL++FGCT +VH   
Sbjct: 695  KYLWGEAILTATYLINRMPTRILNFKTPLQVFTNCNPIFRLSSTLPLKIFGCTTFVHIHD 754

Query: 1023 PNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGESV 1082
             N+ K  PRA+ CVFVGY P Q+GYKCF P S+K FVTMDVTF E +P+F  +HLQGES 
Sbjct: 755  HNRGKLDPRARKCVFVGYAPTQKGYKCFDPISKKLFVTMDVTFFESKPFF-ATHLQGEST 814

Query: 1083 SEESN----------NTFEFIEPTPS---VVSNIIPHSIVLPTNQVPW-KTYYRRNHKKE 1142
            SE+S+          N    +EP+ S   V  NI    +    + + + KT      K  
Sbjct: 815  SEDSDLFKIEKTPTPNPNNLLEPSNSNQFVYPNIETSGLDTTKSDMSFEKTAEILGKKNG 874

Query: 1143 VGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNM-ISENDRSNVAVLENVEEKDSGDEIE 1202
            V +  S   +    S         N     TKN  +    R      E+  +   G E E
Sbjct: 875  VLNIESLDGSSSLPSHNQNHSNTNNGNRTSTKNSELMTYSRRKHNSKESNPDPLPGHESE 934

Query: 1203 VRIETRDNEAEHGH-----------TGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSY 1262
            +R E   +E    +           +  + E    L+IPIA RKG RSCTKHP+ NY+SY
Sbjct: 935  LREEPNSSECPGNNQTDSCQPVQFISNSNSESFDDLNIPIATRKGVRSCTKHPMSNYMSY 994

Query: 1263 NSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKT 1322
             +LSP F AFT+ L    IPK++  AL+ PEWK A+ EEM+ALEKN TW++  LPKG  T
Sbjct: 995  KNLSPSFFAFTSHLSLVEIPKNVQEALQVPEWKKAIFEEMRALEKNHTWEVMGLPKGKTT 1054

Query: 1323 VGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNK 1382
            VGCKWVF++KY ++G+L+R+KARLVAKGFTQTYGIDY ETF+PVAKLNT+RVLLS+A N 
Sbjct: 1055 VGCKWVFTVKYNSNGSLERYKARLVAKGFTQTYGIDYLETFAPVAKLNTVRVLLSIAANL 1114

Query: 1383 DWPLYQLDVKNAFLNGDLVEEVYMSPPPGFEAQFGQHVCKLQKSIYGLKQSPRAWFDRFT 1442
            DWPL QLDVKNAFLNG+L EEVYM PPPGF+  FG  VCKL+KS+YGLKQSPRAWF+RFT
Sbjct: 1115 DWPLQQLDVKNAFLNGNLEEEVYMDPPPGFDEHFGSKVCKLKKSLYGLKQSPRAWFERFT 1174

Query: 1443 TFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEI 1502
             FVK+QGY Q  SDHT+F K S  GKIA+LIVYVDDI+LTGD   E+ +LK+ +  EFEI
Sbjct: 1175 QFVKNQGYVQAQSDHTMFIKHSNDGKIAILIVYVDDIILTGDHVTEMDRLKKSLALEFEI 1234

Query: 1503 KDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQ 1562
            KDLG+L+YFLGMEVARSK GI VSQRKYILDLL ETGM GCRP DTPI+ N KLG+++D 
Sbjct: 1235 KDLGSLRYFLGMEVARSKRGIVVSQRKYILDLLKETGMSGCRPADTPIDPNQKLGDTNDG 1294

Query: 1563 VPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGK 1622
              V+  +YQ+LVGKLIYLSHTRPDI+FAVS+VSQFM +P E H++AV RILRYLKSTPGK
Sbjct: 1295 NLVNTTRYQKLVGKLIYLSHTRPDIAFAVSIVSQFMHSPYEVHLEAVYRILRYLKSTPGK 1354

Query: 1623 GLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSTEAE 1682
            GL F+K+++KTIEAYTD+DWAGSV DR+STSGYCT++WGNLVTWRSKKQSV ARSS EAE
Sbjct: 1355 GLFFKKSEQKTIEAYTDADWAGSVTDRRSTSGYCTYIWGNLVTWRSKKQSVXARSSAEAE 1414

Query: 1683 YKALSLGICEEIWLQKVLTDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHF 1742
            Y+A++ G+CE +WL+K+L +L +  E P+KL+CDNKAAISIA+NPVQHDRTKHVEIDRHF
Sbjct: 1415 YRAMAHGVCEILWLKKILEELKRPLEMPMKLYCDNKAAISIAHNPVQHDRTKHVEIDRHF 1474

Query: 1743 IKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT 1746
            IKEKL++  IC+P++P++QQ+AD+LTKGL R +F+F +SKLG+IDIY PT
Sbjct: 1475 IKEKLEASIICMPFVPTTQQIADILTKGLFRSSFEFLISKLGMIDIYAPT 1505

BLAST of CSPI01G15910 vs. NCBI nr
Match: gi|147810393|emb|CAN59964.1| (hypothetical protein VITISV_022757 [Vitis vinifera])

HSP 1 Score: 1457.2 bits (3771), Expect = 0.0e+00
Identity = 755/1469 (51.40%), Postives = 980/1469 (66.71%), Query Frame = 1

Query: 300  AQSSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDS 359
            + SS   ++G KLNG+NY  WSQSV + + G+ K  +LTGE   P   +P  R WK E+S
Sbjct: 26   SDSSPILITGHKLNGHNYLQWSQSVLLFICGKGKDEYLTGEAAMPETTEPGFRKWKIENS 85

Query: 360  ILRSILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMD 419
            ++ S LINSM   IG+  L   TAKDIWD A+  YS  +N S L+ +   +H+ +QG   
Sbjct: 86   MIMSWLINSMNNDIGENFLLFRTAKDIWDAAKETYSSSENTSELFQVESALHDFRQGEQS 145

Query: 420  VTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRIL 479
            VT ++N L+  WQ++DL     W+   D   Y  I E  R++ F  GLN + D VRGRI+
Sbjct: 146  VTQYYNTLTRYWQQLDLFETHSWKCSDDAATYRXIVEQXRLFKFFLGLNRELDDVRGRIM 205

Query: 480  GQRPIPSLMEVCSEIRLEEDRTSAMNISA---TPTIDSAAFSARSSNSSSDKHNGKPIPV 539
            G +P+PSL E  SE+R EE R   M  S     PT+D++   ARS NSS      +  P 
Sbjct: 206  GIKPLPSLREAFSEVRREESRKKVMMGSKEQPAPTLDASXLXARSFNSSGGDRQKRDRPW 265

Query: 540  CEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYVS---ESAEPPQQSDPHKNQT 599
            C++CKK  H KE CWKLHG+    K +P  D+   GRA+V+   ES   P+ S  +K Q 
Sbjct: 266  CDYCKKXGHYKEACWKLHGKXADWKPKPRXDRD--GRAHVAANXESTSVPEPSPFNKEQM 325

Query: 600  DLSLATLGAIVQSGIPHSFGLVSI-DGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETI 659
            ++ L  L + V SG      L +   G  PWI+D+GA+DH+TG +    +Y P  G+ ++
Sbjct: 326  EM-LQKLLSQVGSGSTTGIALTANRGGMKPWIVDTGASDHMTGDAAILQNYKPSNGHSSV 385

Query: 660  RIADGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSF 719
             IADGS + I G G I     L L +VLHVP L  NLLSISK+  +L C   F P+S  F
Sbjct: 386  HIADGSKSKIXGTGSIKLTKDLYLDSVLHVPNLDCNLLSISKLARDLQCVTKFYPNSCVF 445

Query: 720  QDLSSGRMIGTARHSRGLYLL------DDDTSSSSIPRTSLLSSYFTTS------EQDCM 779
            QDL SG+MIG+A    GLYLL      +  + +S +   S+L S+ + S      + + +
Sbjct: 446  QDLKSGKMIGSAELCSGLYLLSCGQFSNQVSQASCVQSQSMLESFNSVSNSKVNKDSEII 505

Query: 780  LWHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVH 839
            + H+RLGHP+F Y+  LFP LF      +  C++C  AK  R  +P  PYKP+  F+LVH
Sbjct: 506  MLHYRLGHPSFVYLAKLFPKLFINKNPASYHCEICQFAKHTRTVYPQIPYKPSTVFSLVH 565

Query: 840  SDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIA 899
            SDVWGPS+I   SG RWFVTF+DDHTR+TWV+L+ +KSEV  +FQ F   ++ QF+ KI 
Sbjct: 566  SDVWGPSRIKNISGTRWFVTFVDDHTRVTWVFLMKEKSEVGHIFQTFNLMVQNQFNSKIQ 625

Query: 900  ILRSDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSL 959
            +L+SDN +E+   +LS +L + GI+H +SC  TPQQNGVAERKNRHLLEVAR LM S+++
Sbjct: 626  VLKSDNAKEYFTSSLSTYLQNHGIIHISSCVDTPQQNGVAERKNRHLLEVARCLMFSSNV 685

Query: 960  PSYLWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVS-EVPLRVFGCTAYVHN 1019
            P+Y WG+AILTA +LINRMPSR+L  Q+P     + +P TR  S ++PL+VFGCTA+VH 
Sbjct: 686  PNYFWGEAILTATYLINRMPSRVLTFQSPRQLFLKQFPHTRAASSDLPLKVFGCTAFVHV 745

Query: 1020 FGPNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGE 1079
            +  N++KF PRA  C+F+GY P Q+GYKC+ P +++++ TMDV+F E   ++P SH+QGE
Sbjct: 746  YPQNRSKFAPRANKCIFLGYSPTQKGYKCYSPTNKRFYTTMDVSFFEHVFFYPKSHVQGE 805

Query: 1080 SVSEESNNTFE-FIEPTPSVVSNIIPHSIVLPTN-QVPWKTYYRRNHKKEVGSP-TSQPP 1139
            S++E  +  +E  +E  PS  S     S   PT    P  +  +      V SP T Q P
Sbjct: 806  SMNE--HQVWESLLEGVPSFHSESPNPSQFAPTELSTPMPSSVQPAQHTNVPSPVTIQSP 865

Query: 1140 APVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRDNEA 1199
             P+Q   P          +   +N+     R     LE+  +   G  I+      +   
Sbjct: 866  MPIQPIAP----------QLANENLQVYIRRRKRQELEHGSQSTCGQYIDSNSSLPEENI 925

Query: 1200 EHGHTGKS--DEYDSSLDIPIALRKGTRSCTKHPICNYVSYNSLSPQFRAFTASLDSTII 1259
                 G+      D S  +PIALRKG R CT HPI NYV+Y  LSP +RAF  SLD T +
Sbjct: 926  GEDRAGEVLIPSIDDST-LPIALRKGVRRCTDHPIGNYVTYEGLSPSYRAFATSLDDTQV 985

Query: 1260 PKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDR 1319
            P  I  A K  EWK AV +E+ ALEKN TW I  LP G + VGCKW+F++KYKADG+++R
Sbjct: 986  PNTIQEAXKISEWKKAVQDEIDALEKNGTWTITDLPVGKRPVGCKWIFTIKYKADGSVER 1045

Query: 1320 HKARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLV 1379
             KARLVA+GFTQ+YGIDY ETF+PVAKLNTIR+LLS+AVN+DW L QLD+KNAFLNGDL 
Sbjct: 1046 FKARLVARGFTQSYGIDYQETFAPVAKLNTIRILLSLAVNQDWCLQQLDIKNAFLNGDLE 1105

Query: 1380 EEVYMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLF 1439
            EEVYM  PPGFE    ++ VCKLQKS+YGLKQSPRAWFDRFT  V   GY+QG +DHTLF
Sbjct: 1106 EEVYMEIPPGFEESMAKNQVCKLQKSLYGLKQSPRAWFDRFTKAVLKLGYKQGQADHTLF 1165

Query: 1440 TKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSK 1499
             K S  GK+A+LIVYVDDI+L+G+D  E+  LK+ + +EFE+KDLGNLKYFLGMEVARS+
Sbjct: 1166 VKKSHAGKMAILIVYVDDIILSGNDMEELQXLKKYLSEEFEVKDLGNLKYFLGMEVARSR 1225

Query: 1500 EGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYL 1559
            +GI VSQRKYILDLL ETGMLGC+P DTP++   KLG   +  PVD+ +YQRLVG+LIYL
Sbjct: 1226 KGIVVSQRKYILDLLKETGMLGCKPIDTPMDSQKKLGIEKESTPVDRGRYQRLVGRLIYL 1285

Query: 1560 SHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDS 1619
            SHTRPDI FAVS VSQFM +P EEHM+AV RI RYLK TPGKGL FRKT+ +  E Y+D+
Sbjct: 1286 SHTRPDIGFAVSXVSQFMHSPTEEHMEAVYRIXRYLKMTPGKGLFFRKTENRDXEVYSDA 1345

Query: 1620 DWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSTEAEYKALSLGICEEIWLQKVL 1679
            DWAG+++DR+STSGYC+FVWGNLVT RSKKQSVVARSS EAEY+AL+ GICE IW+++VL
Sbjct: 1346 DWAGNIIDRRSTSGYCSFVWGNLVTXRSKKQSVVARSSAEAEYRALAQGICEGIWIKRVL 1405

Query: 1680 TDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSS 1739
            ++L Q   +P+ + CDN+AAISIA NPV HD TKHVEIDRHFI EK+ S ++ + Y+P+ 
Sbjct: 1406 SELGQTSSSPILMMCDNQAAISIAKNPVHHDXTKHVEIDRHFITEKVTSETVKLNYVPTK 1465

Query: 1740 QQVADVLTKGLLRPNFDFCVSKLGLIDIY 1743
             Q AD+LTK L RPNF+    KLGL DIY
Sbjct: 1466 HQTADILTKALPRPNFEDLTCKLGLYDIY 1478

BLAST of CSPI01G15910 vs. NCBI nr
Match: gi|147778986|emb|CAN62538.1| (hypothetical protein VITISV_031159 [Vitis vinifera])

HSP 1 Score: 1455.7 bits (3767), Expect = 0.0e+00
Identity = 754/1469 (51.33%), Postives = 979/1469 (66.64%), Query Frame = 1

Query: 302  SSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSIL 361
            SS   ++G KLNG+NY  WSQSV + + G+ K  ++TGE   P   +P  R WK E+S++
Sbjct: 28   SSPILITGHKLNGHNYLQWSQSVLLFICGKGKDEYJTGEAXMPETTEPXFRKWKIENSMI 87

Query: 362  RSILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVT 421
             S LINSM   IG+  L   TAKDIWD A+  YS  +N S L+ +   +H+ +QG   VT
Sbjct: 88   MSWLINSMNNDIGENFLLFGTAKDIWDAAKETYSSSENISELFQVESALHDFRQGEQSVT 147

Query: 422  SFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQ 481
             ++N L+  WQ++DL     W+   D   Y +I E  R++ F  GLN + D VRGRI+G 
Sbjct: 148  QYYNTLTRYWQQLDLFETHSWKCSDDAATYRQIVEQKRLFKFFLGLNRELDDVRGRIMGI 207

Query: 482  RPIPSLMEVCSEIRLEEDRTSAMNISA---TPTIDSAAFSARSSNSSSDKHNGKPIPVCE 541
            +P+PSL EV SE+R EE R   M  S     PT+D +A +ARS NSS      +  P C+
Sbjct: 208  KPLPSLREVFSEVRREESRKKVMMGSKEQPAPTLDGSALAARSFNSSGGDRQKRDRPWCD 267

Query: 542  HCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYV---SESAEPPQQSDPHKNQTDL 601
            + KK  H KE CWKLHG+P   K +P +D+   GRA+V   SES   P+ S  +K Q ++
Sbjct: 268  YYKKPGHYKEACWKLHGKPADWKPKPRSDRD--GRAHVAANSESTSVPEPSPFNKEQMEM 327

Query: 602  SLATLGAIVQSGIPHSFGLV-SIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRI 661
             L  L + V SG      L  S  G  PWI+D+GA+DH+TG +    +Y P  G+  + I
Sbjct: 328  -LQKLLSQVGSGSTTGIALTASRGGMKPWIVDTGASDHMTGDAAILQNYKPSNGHSFVHI 387

Query: 662  ADGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQD 721
            ADGS + I G G I     L L +VLHVP L  NLLSISK+  +L C   F P+S  FQD
Sbjct: 388  ADGSKSKIVGTGSIKLTKDLYLDSVLHVPNLDCNLLSISKLARDLQCVTKFYPNSCVFQD 447

Query: 722  LSSGRMIGTARHSRGLYLL------DDDTSSSSIPRTSLLSSYFTTS------EQDCMLW 781
            L SG+MIG+A+    LYLL      +  + +S +   S+L S+ + S      + + ++ 
Sbjct: 448  LKSGKMIGSAKLCSELYLLSCGQFSNQVSQASCVQSQSMLESFNSVSNSKVNKDSEIIML 507

Query: 782  HFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSD 841
            H+RLGHP+F Y+  LFP LF      +  C++C  AK  R  +P  PYKP+  F+LVHSD
Sbjct: 508  HYRLGHPSFVYLAKLFPRLFINKNPASYHCEICQFAKHTRTVYPQIPYKPSTVFSLVHSD 567

Query: 842  VWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAIL 901
            VWGPS+I   SG RWFVTF+DDHTR+TWV+L+ +KSEV  +FQ F   ++ QF+ KI +L
Sbjct: 568  VWGPSRIKNISGTRWFVTFVDDHTRVTWVFLMKEKSEVGHIFQTFNLMVQNQFNSKIQVL 627

Query: 902  RSDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPS 961
            +SDN +E+   +LS +L + GI+H +SC  TPQQNGVAERKNRHLLEVAR LM S+++P+
Sbjct: 628  KSDNAKEYFTSSLSTYLQNHGIIHISSCVDTPQQNGVAERKNRHLLEVARCLMFSSNVPN 687

Query: 962  YLWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVS-EVPLRVFGCTAYVHNFG 1021
            Y WG+AILTA +LINRMPSR+L  Q+P     + +P TR  S ++ L+VFGCTA+VH + 
Sbjct: 688  YFWGEAILTATYLINRMPSRVLTFQSPRQLFLKQFPHTRAASSDLSLKVFGCTAFVHVYP 747

Query: 1022 PNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGESV 1081
             N++KF PRA  C+F+GY P+Q+GYKC+ P +++++ TMDV+F E   ++P  H+QGES+
Sbjct: 748  QNRSKFAPRANKCIFLGYSPNQKGYKCYSPTNKRFYTTMDVSFFEHVFFYPKFHVQGESM 807

Query: 1082 SEESNNTFEF-IEPTPSVVSNIIPHSIVLPTN-QVPWKTYYRRNHKKEVGSP-TSQPPAP 1141
            +E  +  +E  +E  PS  S     S   PT    P  +  +      V SP T Q P P
Sbjct: 808  NE--HQVWESRLEGVPSFHSESPNPSQFAPTELSTPMPSSVQPAQHTNVPSPVTIQSPMP 867

Query: 1142 VQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRDNEAEH 1201
            +Q   P          +   +N+     R     LE+  +   G  I+      +     
Sbjct: 868  IQPIAP----------QLANENLQVYIRRRKRQELEHGSQSTCGQYIDSNSSLPEENIGE 927

Query: 1202 GHTGKS--DEYDSSLDIPIALRKGTRSCTKHPICNYVSYNSLSPQFRAFTASLDSTIIPK 1261
               G+      D S  +PIALRKG R CT HPI NYV+Y  LSP +RAF  SLD T +P 
Sbjct: 928  DRAGEVLIPSIDDST-LPIALRKGVRRCTDHPIGNYVTYEGLSPSYRAFATSLDDTQVPN 987

Query: 1262 DIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHK 1321
             I  ALK  EWK AV +E+ ALEKN TW I  LP G + VGCKW+F++KYKADG+++R K
Sbjct: 988  TIQEALKISEWKKAVQDEIDALEKNGTWTITDLPVGKRPVGCKWIFTIKYKADGSVERFK 1047

Query: 1322 ARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEE 1381
            ARLVA+GFTQ YGIDY ETF+PVAKLNTIR+LLS+AVN+DW L QLD+KNAFLNGDL EE
Sbjct: 1048 ARLVARGFTQXYGIDYQETFAPVAKLNTIRILLSLAVNQDWCLQQLDIKNAFLNGDLEEE 1107

Query: 1382 VYMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTK 1441
            VYM  PPGFE    ++ VCKLQKS+YGLKQSPRAWFDRFT  V   GY+QG +DHTLF K
Sbjct: 1108 VYMEIPPGFEESMAKNQVCKLQKSLYGLKQSPRAWFDRFTKAVLKLGYKQGQADHTLFVK 1167

Query: 1442 VSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEG 1501
             S  GK+A+LIVYVDDI+L+G+D  E+  LK+ + +EFE+KDLGNLKYFLGMEVARS++G
Sbjct: 1168 KSHAGKMAILIVYVDDIILSGNDMEELQNLKKYLSEEFEVKDLGNLKYFLGMEVARSRKG 1227

Query: 1502 ISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSH 1561
            I VSQ KYILDLL ETGMLGC+P DTP++   KLG   +  P D+ +YQRLVG+LIYLSH
Sbjct: 1228 IVVSQTKYILDLLKETGMLGCKPIDTPMDSQKKLGIEKESTPXDRGRYQRLVGRLIYLSH 1287

Query: 1562 TRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDW 1621
            TRPDI FAVS VSQFM +P EEHM+AV RILRYLK TP KG+ FRKT+ +  E Y+D+DW
Sbjct: 1288 TRPDIGFAVSAVSQFMHSPTEEHMEAVYRILRYLKMTPXKGIFFRKTENRDTEVYSDADW 1347

Query: 1622 AGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSTEAEYKALSLGICEEIWLQKVLTD 1681
            AG+++DR+STSGYC+FVWGNLVTWRSKKQSVVARSS EAEY AL+ GICE  W+++VL++
Sbjct: 1348 AGNIIDRRSTSGYCSFVWGNLVTWRSKKQSVVARSSAEAEYXALAQGICEGXWIKRVLSE 1407

Query: 1682 LHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQ 1741
            L Q   +P+ + CDN+A ISIA NPV HDRTKHVEIDRHFI EK+ S ++ + Y+P+  Q
Sbjct: 1408 LGQTSSSPILMMCDNQAXISIAKNPVHHDRTKHVEIDRHFITEKVTSETVKLNYVPTKHQ 1467

Query: 1742 VADVLTKGLLRPNFDFCVSKLGLIDIYVP 1745
             AD+LTK L RPNF+    KLGL DIY P
Sbjct: 1468 TADILTKALPRPNFEDLTCKLGLYDIYSP 1480

BLAST of CSPI01G15910 vs. NCBI nr
Match: gi|147769406|emb|CAN70229.1| (hypothetical protein VITISV_024789 [Vitis vinifera])

HSP 1 Score: 1447.6 bits (3746), Expect = 0.0e+00
Identity = 746/1468 (50.82%), Postives = 970/1468 (66.08%), Query Frame = 1

Query: 302  SSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSIL 361
            SS   ++G KLNG+NY  WSQSV + + G+ K  +LTGE   P   +P  R WK E+S++
Sbjct: 28   SSPILITGHKLNGHNYLQWSQSVLLFICGKGKDEYLTGEAVMPETTEPGFRKWKIENSMI 87

Query: 362  RSILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVT 421
             S LINSM   IG+  L   TAKDIWD A+  YS  +N S L+ +   +H+ +QG   VT
Sbjct: 88   MSWLINSMNNDIGENFLLFGTAKDIWDAAKETYSSSENTSELFQVESALHDFRQGEQSVT 147

Query: 422  SFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQ 481
             ++N L+  WQ++DL     W+   D   Y +I E  R++ F  GLN + D VRGRI+G 
Sbjct: 148  QYYNTLTRYWQQLDLFETHSWKCSDDAATYRQIMEQKRLFKFFLGLNRELDDVRGRIMGI 207

Query: 482  RPIPSLMEVCSEIRLEEDRTSAMNISA---TPTIDSAAFSARSSNSSSDKHNGKPIPVCE 541
            +P+PSL E  SE+R EE R   M  S     PT+D++A +ARS NSS      +  P C+
Sbjct: 208  KPLPSLREAFSEVRREESRKKVMMGSKEQPAPTLDASALAARSFNSSGGDRQKRDRPWCD 267

Query: 542  HCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYV---SESAEPPQQSDPHKNQTDL 601
            +CKK  H KE CWKLHG+P   K +P  D+   GRA+V   SES   P+ S  +K Q ++
Sbjct: 268  YCKKPGHYKETCWKLHGKPADWKPKPRFDRD--GRAHVAANSESTSVPEPSPFNKEQMEM 327

Query: 602  SLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIA 661
                L  +            +  G  PWI+D+GA+DH+TG +    +Y P  G+ ++ IA
Sbjct: 328  LQKLLSQVGSGSTTGVAFTTNRGGMRPWIVDTGASDHMTGDAAILQNYKPSNGHSSVHIA 387

Query: 662  DGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDL 721
            DGS         I     L L +VLHVP L  NLLSISK+ H+L C   F P+   FQDL
Sbjct: 388  DGS---------IKLTKDLYLDSVLHVPNLDCNLLSISKLAHDLQCVTKFYPNLCVFQDL 447

Query: 722  SSGRMIGTARHSRGLYLLD-----DDTSSSSIPRTSLLSSYFTT-------SEQDCMLWH 781
             SG+MIG+A    GLYLL      +  S +S  ++  +S  F +        + + ++ H
Sbjct: 448  KSGKMIGSAELRSGLYLLSCGQFSNQVSQASCVQSQSMSESFNSVSNSKVNKDSEIIMLH 507

Query: 782  FRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDV 841
            +RLGHP+F Y+  LFP LF      +  C++C  AK  R+ +P  PYKP+  F+LVHSDV
Sbjct: 508  YRLGHPSFVYLAKLFPRLFINKNPASYHCEICQFAKHTRIVYPQIPYKPSTVFSLVHSDV 567

Query: 842  WGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAILR 901
            WGPS+I   SG RWFVTF+DDHT +TWV+L+ +KSEV  +FQ F   ++ QF+ KI +L+
Sbjct: 568  WGPSRIKNISGTRWFVTFVDDHTWVTWVFLMKEKSEVGHIFQTFNLMVQNQFNSKIQVLK 627

Query: 902  SDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSY 961
            SDN +E+   +LS +L +  I+H +SC  TPQQN VAERKNRHLLEVAR LM S+++P+Y
Sbjct: 628  SDNAKEYFTSSLSTYLQNHDIIHISSCVDTPQQNRVAERKNRHLLEVARCLMFSSNVPNY 687

Query: 962  LWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVS-EVPLRVFGCTAYVHNFGP 1021
             WG+AILTA +LINRMPSR+L  Q+P     + +P T   S ++PL+VFGCTA++H +  
Sbjct: 688  FWGEAILTATYLINRMPSRVLTFQSPRQLFLKQFPHTHAASSDLPLKVFGCTAFIHVYPQ 747

Query: 1022 NQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGESVS 1081
            N++KF PRA  C+F+GY P Q+GYKC+ P +++++ TMDV+F E   ++P SH+QGES++
Sbjct: 748  NRSKFAPRANKCIFLGYSPTQKGYKCYSPTNKRFYTTMDVSFFEHVFFYPKSHVQGESMN 807

Query: 1082 EESNNTFE-FIEPTPSVVSNIIPHSIVLPTN-QVPWKTYYRRNHKKEVGSP-TSQPPAPV 1141
            E  +  +E F+E  PS  S     S   PT    P     +      V SP T Q P P+
Sbjct: 808  E--HQVWESFLEGVPSFHSESPNPSQFAPTELSTPMPPSVQPAQHTNVPSPVTIQSPMPI 867

Query: 1142 QDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRDNEAEHG 1201
            Q   P          +   +N+     R     LE+  +   G  I+      +      
Sbjct: 868  QPIAP----------QLANENLQVYLRRRKRQELEHGSQSTCGQYIDSNSSLPEENIGED 927

Query: 1202 HTGKS--DEYDSSLDIPIALRKGTRSCTKHPICNYVSYNSLSPQFRAFTASLDSTIIPKD 1261
              G+      D S  +PIALRKG R CT HPI NYV+Y  LSP +RAF  SLD T +P  
Sbjct: 928  RAGEVLIPSIDDST-LPIALRKGVRRCTDHPIGNYVTYEGLSPSYRAFATSLDDTQVPNT 987

Query: 1262 IYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKA 1321
            I  A K  EWK AV +E+ ALEKN TW I  LP G + VGCKW+F++KYK DG+++R KA
Sbjct: 988  IQEASKISEWKKAVQDEIDALEKNGTWTITDLPVGKRPVGCKWIFTIKYKTDGSVERFKA 1047

Query: 1322 RLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEEV 1381
            RLVA+GFTQ+YGIDY ETF+PVAKLNTIR+LLS+AVN+DW L QLD+KNAFLNGDL EEV
Sbjct: 1048 RLVARGFTQSYGIDYQETFAPVAKLNTIRILLSLAVNQDWCLQQLDIKNAFLNGDLEEEV 1107

Query: 1382 YMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKV 1441
            YM  PPGFE    ++ VCKLQKS+YGLKQSPRAWFDRFT  V   GY+QG +DHTLF K 
Sbjct: 1108 YMEIPPGFEESMAKNQVCKLQKSLYGLKQSPRAWFDRFTKAVLKLGYKQGQADHTLFVKK 1167

Query: 1442 SKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGI 1501
            S  GK+A+LIVYVDDI+L+G+D  E+  LK+ + +EFE+KDLGNLKYF+GMEVA+S++GI
Sbjct: 1168 SHAGKLAILIVYVDDIILSGNDMGELQNLKKYLSEEFEVKDLGNLKYFJGMEVAKSRKGI 1227

Query: 1502 SVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHT 1561
             VSQRKYILDLL ETGMLGC+P DTP++   KLG   +  PVD+ +YQRLVG+LIYLSHT
Sbjct: 1228 VVSQRKYILDLLKETGMLGCKPIDTPMDSQKKLGIEKESTPVDRGRYQRLVGRLIYLSHT 1287

Query: 1562 RPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWA 1621
            RPDI FAVS VSQFM +P EEHM+AV RILRYLK TPGKGL FRKT+ +  E Y+D+DWA
Sbjct: 1288 RPDIGFAVSAVSQFMHSPTEEHMEAVYRILRYLKMTPGKGLFFRKTENRDTEVYSDADWA 1347

Query: 1622 GSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSTEAEYKALSLGICEEIWLQKVLTDL 1681
            G+++DR STSGYC+FVWGNLVTWRSKKQSVVARSS EAEY+AL+ GICE IW++ VL++L
Sbjct: 1348 GNIIDRWSTSGYCSFVWGNLVTWRSKKQSVVARSSAEAEYRALAQGICEGIWIKXVLSEL 1407

Query: 1682 HQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQV 1741
             Q   +P+ + CDN+AAISIA NPV HDRTKHVEIDRHFI EK+ S ++ + Y+P+  Q 
Sbjct: 1408 GQXSSSPILMMCDNQAAISIAKNPVHHDRTKHVEIDRHFITEKVTSETVKLNYVPTKHQT 1467

Query: 1742 ADVLTKGLLRPNFDFCVSKLGLIDIYVP 1745
            AD+LTK L RPNF+    KLGL DIY P
Sbjct: 1468 ADILTKALPRPNFEDLTCKLGLYDIYSP 1471

BLAST of CSPI01G15910 vs. NCBI nr
Match: gi|147860087|emb|CAN82928.1| (hypothetical protein VITISV_025045 [Vitis vinifera])

HSP 1 Score: 1431.4 bits (3704), Expect = 0.0e+00
Identity = 747/1468 (50.89%), Postives = 965/1468 (65.74%), Query Frame = 1

Query: 302  SSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSIL 361
            SS   ++G KLNG+NY  WSQSV + + G+ K  +LTGE   P   +P  R WK E+S++
Sbjct: 28   SSPILITGHKLNGHNYLQWSQSVLLFICGKGKDEYLTGEAVMPETTEPGFRKWKIENSMI 87

Query: 362  RSILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVT 421
             S LINSM   IG+  L   TAKDIWD A+  YS  +N S L+ +   +H+ +QG   VT
Sbjct: 88   MSWLINSMNNDIGENFLLFGTAKDIWDAAKETYSSSENTSELFQVESALHDFRQGEQSVT 147

Query: 422  SFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQ 481
             ++N L+  WQ++DL     W+   D   Y +I E  R++ F  GLN + D VRGRI+G 
Sbjct: 148  QYYNTLTRYWQQLDLFETHSWKCSDDAATYRQIVEQKRLFKFFLGLNRELDDVRGRIMGI 207

Query: 482  RPIPSLMEVCSEIRLEEDRTSAMNISA---TPTIDSAAFSARSSNSSSDKHNGKPIPVCE 541
            +P+PSL E  SE+R EE R   M  S     PT+D++A +ARS NSS      +  P C+
Sbjct: 208  KPLPSLREAFSEVRREESRKKVMMGSKEQPAPTLDASALAARSFNSSGGDRQKRDRPWCD 267

Query: 542  HCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYV---SESAEPPQQSDPHKNQTDL 601
            +CKK  H KE CWKLHG+P   K +P  D+   GRA+V   SES   P+ S  +K Q ++
Sbjct: 268  YCKKPGHYKETCWKLHGKPADWKPKPRFDRD--GRAHVAANSESTSVPEPSPFNKEQMEM 327

Query: 602  SLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIA 661
                L  +            +  G  PWI+D+GA+DH+TG +    +Y P  G+ ++ IA
Sbjct: 328  LQKLLSQVGSGSTTGVAFTANRGGMRPWIVDTGASDHMTGDAAILQNYKPSNGHSSVHIA 387

Query: 662  DGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDL 721
            DGS + IAG G I     L L +VLHVP L  NLLSISK+ H+L C   F P+   FQDL
Sbjct: 388  DGSKSKIAGTGSIKLTKDLYLDSVLHVPNLDCNLLSISKLAHDLQCVTKFYPNLCVFQDL 447

Query: 722  SSGRMIGTARHSRGLYLLD-----DDTSSSSIPRTSLLSSYFTT-------SEQDCMLWH 781
             SG+MIG+A    GLYLL      +  S +S  ++  +S  F +        + + ++ H
Sbjct: 448  KSGKMIGSAELCSGLYLLSCGQFSNQVSQASCVQSQSMSESFNSVSNSKVNKDSEIIMLH 507

Query: 782  FRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDV 841
            +RLGHP+F Y+  LFP LF      +  C++C  AK  R  +P  PYKP+  F+LVHSDV
Sbjct: 508  YRLGHPSFVYLAKLFPKLFINKNPASYHCEICQFAKHTRTVYPQIPYKPSTVFSLVHSDV 567

Query: 842  WGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAILR 901
            WGPS+I   SG RWFVTF+DDHTR+TWV+L+ +KSEV  +FQ F   ++ QF+ KI +L+
Sbjct: 568  WGPSRIKNISGTRWFVTFVDDHTRVTWVFLMKEKSEVGHIFQTFNLMVQNQFNSKIQVLK 627

Query: 902  SDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSY 961
            SDN +E+   +LS +L + GI+H +SC  TPQQNGVAERKNRHLLEVAR LM S+++P+Y
Sbjct: 628  SDNAKEYFTSSLSTYLQNHGIIHISSCVDTPQQNGVAERKNRHLLEVARCLMFSSNVPNY 687

Query: 962  LWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVS-EVPLRVFGCTAYVHNFGP 1021
             WG+AILTA +LINRMPSR+L  Q+P     + +P T   S ++PL+VFGCTA+VH +  
Sbjct: 688  FWGEAILTATYLINRMPSRVLTFQSPRQLFLKQFPHTHAASSDLPLKVFGCTAFVHVYPQ 747

Query: 1022 NQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGESVS 1081
            N++KF PRA  C+F+GY P Q+GYKC+ P +++++ TMDV+F E   ++P SH+QGES++
Sbjct: 748  NRSKFAPRANKCIFLGYSPTQKGYKCYSPTNKRFYTTMDVSFFEHVFFYPKSHVQGESMN 807

Query: 1082 EESNNTFE-FIEPTPSVVSNIIPHSIVLPTN-QVPWKTYYRRNHKKEVGSP-TSQPPAPV 1141
            E  +  +E F+E  PS  S     S   PT    P     +      V SP T Q P P+
Sbjct: 808  E--HQVWESFLEGVPSFHSESPNPSQFAPTELSTPMPPSVQPAQHTNVPSPVTIQSPMPI 867

Query: 1142 QDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRDNEAEHG 1201
            Q   P          +   +N+     R     LE+  +   G  I+      +      
Sbjct: 868  QPIAP----------QLANENLQVYIRRRKRQELEHGSQSTCGQYIDSNSSLPEENIGED 927

Query: 1202 HTGKS--DEYDSSLDIPIALRKGTRSCTKHPICNYVSYNSLSPQFRAFTASLDSTIIPKD 1261
              G+      D S  +PIALRKG R CT HPI NYV+Y  LSP +RAF  SLD T +P  
Sbjct: 928  RAGEVLIPSIDDST-LPIALRKGVRRCTDHPIGNYVTYEGLSPSYRAFATSLDDTQVPNT 987

Query: 1262 IYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKA 1321
            I  ALK  EWK AV +E+ ALEKN TW I  LP G + +            D + D  KA
Sbjct: 988  IQEALKISEWKKAVQDEIDALEKNGTWTITDLPVGKRPM------------DQSKD-FKA 1047

Query: 1322 RLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEEV 1381
            RLVA+GFTQ+YGIDY ETF+PVAKLNTIR+LLS+AVN+DW L QLD+KNAFLNGDL EEV
Sbjct: 1048 RLVARGFTQSYGIDYQETFAPVAKLNTIRILLSLAVNQDWCLQQLDIKNAFLNGDLEEEV 1107

Query: 1382 YMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKV 1441
            YM  PPGFE    ++ VCKLQKS+YGLKQSPRAWFDRFT  V   GY+QG  DHTLF K 
Sbjct: 1108 YMEIPPGFEESMAKNQVCKLQKSLYGLKQSPRAWFDRFTKAVLKLGYKQGQXDHTLFVKK 1167

Query: 1442 SKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGI 1501
            S  GK+A+LIVYVDDI+L+G+D  E+  LK+ + +EFE+KDLGNLKYFLGMEVARS++GI
Sbjct: 1168 SHAGKLAILIVYVDDIILSGNDMGELQNLKKYLLEEFEVKDLGNLKYFLGMEVARSRKGI 1227

Query: 1502 SVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHT 1561
             VSQRKYILDLL ETGMLGC+P DTP++   KLG   +  PVD+ +YQRLVG+LIYLSHT
Sbjct: 1228 VVSQRKYILDLLKETGMLGCKPIDTPMDSKKKLGIEKESTPVDRGRYQRLVGRLIYLSHT 1287

Query: 1562 RPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWA 1621
            RPDI FAVS VSQFM +P EEHM+AV RILRYLK TPGKGL FRKT+ +  E Y+D+DWA
Sbjct: 1288 RPDIGFAVSAVSQFMHSPTEEHMEAVYRILRYLKMTPGKGLFFRKTENRDTEVYSDADWA 1347

Query: 1622 GSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSTEAEYKALSLGICEEIWLQKVLTDL 1681
            G+++DR+STSGYC+FVWGNLVTWRSKKQSVVARSS EAEY+AL+ GICE IW+++VL++L
Sbjct: 1348 GNIIDRRSTSGYCSFVWGNLVTWRSKKQSVVARSSAEAEYRALAQGICEGIWIKRVLSEL 1407

Query: 1682 HQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQV 1741
             Q   +P+ + CDN+AAISIA NPV HDRTKHVEIDRHFI EK+ S ++ + Y+P+  Q 
Sbjct: 1408 GQTSSSPILMMCDNQAAISIAKNPVHHDRTKHVEIDRHFITEKVTSETVKLNYVPTKHQT 1467

Query: 1742 ADVLTKGLLRPNFDFCVSKLGLIDIYVP 1745
            AD+LTK L RPNF+    KLGL DIY P
Sbjct: 1468 ADILTKALPRPNFEDLTCKLGLYDIYSP 1467

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
POLX_TOBAC1.3e-15833.30Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
COPIA_DROME6.5e-15332.35Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3[more]
M810_ARATH5.3e-4641.07Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana GN=AtMg0... [more]
YCH4_YEAST1.3e-3933.33Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... [more]
YH41B_YEAST2.4e-3828.54Transposon Ty4-H Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Match NameE-valueIdentityDescription
A5AYJ3_VITVI0.0e+0053.76Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_041073 PE=4 SV=1[more]
A5B7Z8_VITVI0.0e+0051.40Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_022757 PE=4 SV=1[more]
A5AJR0_VITVI0.0e+0051.33Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_031159 PE=4 SV=1[more]
A5BJ12_VITVI0.0e+0050.82Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_024789 PE=4 SV=1[more]
A5B7A7_VITVI0.0e+0050.89Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_025045 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G23160.11.2e-13345.62 cysteine-rich RLK (RECEPTOR-like protein kinase) 8[more]
ATMG00810.13.0e-4741.07ATMG00810.1 DNA/RNA polymerases superfamily protein[more]
ATMG00820.11.3e-2347.86ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)[more]
ATMG00240.13.7e-1341.46ATMG00240.1 Gag-Pol-related retrotransposon family protein[more]
Match NameE-valueIdentityDescription
gi|147819777|emb|CAN76196.1|0.0e+0053.76hypothetical protein VITISV_041073 [Vitis vinifera][more]
gi|147810393|emb|CAN59964.1|0.0e+0051.40hypothetical protein VITISV_022757 [Vitis vinifera][more]
gi|147778986|emb|CAN62538.1|0.0e+0051.33hypothetical protein VITISV_031159 [Vitis vinifera][more]
gi|147769406|emb|CAN70229.1|0.0e+0050.82hypothetical protein VITISV_024789 [Vitis vinifera][more]
gi|147860087|emb|CAN82928.1|0.0e+0050.89hypothetical protein VITISV_025045 [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001584Integrase_cat-core
IPR012337RNaseH-like_sf
IPR013103RVT_2
IPR025724GAG-pre-integrase_dom
Vocabulary: Biological Process
TermDefinition
GO:0015074DNA integration
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0090304 nucleic acid metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003824 catalytic activity
molecular_function GO:0003676 nucleic acid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G15910.1CSPI01G15910.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 813..928
score: 7.6
IPR001584Integrase, catalytic corePROFILEPS50994INTEGRASEcoord: 811..977
score: 22
IPR012337Ribonuclease H-like domainGENE3DG3DSA:3.30.420.10coord: 807..969
score: 1.1
IPR012337Ribonuclease H-like domainunknownSSF53098Ribonuclease H-likecoord: 810..971
score: 2.91
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 1261..1504
score: 1.1
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 729..800
score: 1.1
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 300..595
score: 0.0coord: 224..253
score: 0.0coord: 617..1161
score: 0.0coord: 1230..1664
score:
NoneNo IPR availablePANTHERPTHR11439:SF185SUBFAMILY NOT NAMEDcoord: 224..253
score: 0.0coord: 1230..1664
score: 0.0coord: 300..595
score: 0.0coord: 617..1161
score:
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 354..501
score: 1.
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 1260..1483
score: 5.78E-42coord: 1513..1691
score: 5.78