CSPI05G02420 (gene) Wild cucumber (PI 183967)

NameCSPI05G02420
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationChr5 : 3087660 .. 3095037 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGAAGTTTGCTGGTGGTCTTTACACCACAAGCGTTGAGGTATAAGTTGACTAAATTGCTATGACCCATCTTCAGCATGGCAAATTGATATCCATCACTTGTGTAAGTGGTATAAAACCATTGTTCCGTTTGTGATATGCAGGCATTTGTCCCAAACACTGGTCGTGGTATTCAAGGTGCGACTTCACATTGTTTGGGTCAAAATTTTGCAAAATTGTTTGAAATCAACTTTGAAAATGAAAAGGGGGAGAAAGCTATGGTCTGGCAAAATTCGTGGGTCTATAGTACCAGAACGGTAATATTTTGATACTTGCATTTGTCGTAGCCTTTGGTAACTTGTTAGAGTGAATTTCTGCGTCTTGTTAATCTAGCCTTCCAAACTGTTTGTAGATTGGTGTGATGGTTATGGTTCACGGTGATGACAAGGGATTGGTGATGCCTCCCCAAGTTGCATCAATTCAAGTCATTATAGTTCCTGTTCCCTACAAAGATGCAGACACTCAAGGAATTTTTTATGCTTGTTCTGCCACTTCGAATATGTTGTCCAAAGCAGGAATTCGTGCAGAGGTAGACATTGGGGAGAACTATTCTCCTGGATGGAAATACTCCCACTGGGAGATGAAAGGTGTTCCACTCAGGATTGAAATAGGGCCCAAGGACTTAGCAAACAATCAGGTTAGATATTTTGTCTATTAATGTTCTTTTCTGTCTACTGAAATACTGTTTTCTCCTTCCTAATAATTATAAATGTCTGATTTGGATTGTGTTGATGCATAGTGCTAATCTATAATTGTATGCTTTCTTTTTCTAGGTACGCGCAGTTCGTCGTGATAATTCTGCAAAGAAGGACATACCTAGGGCTTCGTTGGTTGAACAAGTGAAAGAATTGCTAGAAAGTATTCAACAAAGCCTTTTTGATGCGGCAAAAGAAAAACGAGATGCATGCATTCAGGTTATTAATACATGGGATGAGTTTACTGAAGCCCTCGGTCAGAAGAAAATGATATTAGCTCCATGGTGTGATGAAGAGGTACAACTTCAGTGTTTATTGTTGCTGTTTTGGAATTCCACTTGTGTTGTTTATTACATATATTAAGGACTGCCTACTTTTTGGTTCTGGCTTTTCACGAGGCCACCAAAATTTTATTTGTTAGTTGAAGGGATCCTGGTTCCTATTCTTTTCCAGTAGTTAATATGCTTTCTGTTGGAATTCTTTCCTTTTTTTCTAGGCTTTTTCCTTGTTAGAATCCTAGAATAATTATGGAAAGATTATGAGAATGTATTCCTTATTTTCCTAAATCTTTTCCTTTTTTATTCCATTGTGTACTCTATTTATTCTCCCATGTACCTATTGTTTTATTCATTAGAAAATAATAACAACAAACAATCGTGGTTTTTCTCCCGGTACTCGGGTTTCCACGTAAATTGGTGTGAACTCGTTGTCTCTCTTTTCAATATGGTATCAGAGCGGGACAATGAAAACACCTTAGAAACCCAAAAAAACCAAACCACTAATGAAAATCAAACAGAAGGGACAGCCATCAATTTTAGTGTTGTCGTAGCTGCTGCCATCGATGCTCGGATGAGTGCTACCATGGACGAATTATTAAGCCGGCTACAGAAAACATCCGAAAATAATTTTTCGTCATTACCGCAGTCGTCCGCGCCGCCCCCAGACCACCACGCGCCTGGTTTTCTTCCTCAGACGGCGCCGACCATCCCATCTGTCCAACCCTTTTCTTCGTCCGCGGCCTATATTGCTCCCCACGCCCCGATTTATGTTCTGCCATCTAATTCCAATCGGCTACCACCGCTTCTGCCGTCAAATCTGTATGGCCAGCCACCCAATGATCCTAGTTACCATCCCGATGTTAAAAACTCCCAAATTCACTCAACATTTGAGGTTGGTGAATCTTCGGCATATTCCAACCGTAACGTGCAAGCTTCCTCGGGAATAGTTCATCAACAATTGGAAGGGCTTCGACAACAGATAGCAGCACTTGAGGCTACCTTAGGGACGACATCCACTCTACCGATGTATTCTGAGAATCCGGTAAACTCGTTCCCTAATGTATCATCTCCTTATGTGACTAATACGGTGACTCAGTCTTCCATGTATCATCTTTCAGGAGAAAAGTTGAATGGCAACAACTATTTCTCATGGTCTCAGTCAGTAAAGATGGTCCTCGAAGGACAACAAAAATTTAGCTTTCTGACAGGGGAAATACCTCGCCCCTTACCGGGCGACCCACATGAACGATATTGGAAGGCAGAAGACTCTATTCTTCGATCCATATTGATCAATAGTATGGAACCTCAAATTGGTAAGCCGTTATTGTTTGCTACAACAGCCAAGGATATTTGGGACACAGCCCAGACACTTTACTCAAAACGTCAGAATGCCTCTCGTCTATACACGCTGAGAAAGCAAGTTCATGAATGCAAGCAAGGAACCATGGATGTCACATCCTTTTTCAATAAGCTTTCTCTTATATGGCAAGAAATGGACCTATGCAGAGAACTAGTCTGGCGTGATCCCACTGATGGTGTACAGTACTCGAGAATTGAAGAGAATGACAGGATTTATGACTTTCTTGCTGGTCTTAATCCTAAGTTTGATGTAGTTCGAGGGCGTATACTAGGTCAAAGACCGATTCCCTCCCTGATGGAAGTTTGCTCTGAAATCCGCCTCGAGGAAGATCGCACAAGTGCTATGAATATTTCCGCAACCCCTACTATTGACTCTGCTGCTTTTAGTGCAAGATCTTCTAACAGTAGCAGTGACAAGCATAATGGAAAACCAATTCCTGTCTGCGAGCATTGCAAAAAACAATGGCATACCAAAGAACAATGTTGGAAGTTACATGGTCGTCCCCCAGGAAGTAAGAAACGCCCATCCAACGACAAACAGAACACAGGGCGGGCGTATGTGAGTGAGTCTGCTGAACCTCCTCAACAATCCGATCCACACAAAAACCAAACTGATCTCAGTCTTGCCACTTTAGGTGCCATTGTCCAATCAGGTATACCTCATTCCTTCGGTCTTGTTAGTATTGATGGGAAGAACCCCTGGATTCTGGATTCTGGTGCCACAGATCATTTGACTGGGTCCTCTGAACATTTTGTATCTTACATTCCTTGTGCTGGGAACGAGACAATTAGAATTGCAGATGGCTCCTTGGCCCCCATTGCTGGAAAGGGGAAGATTTCTCCTTGTGCAGGGCTCTCCTTACATAATGTTTTGCATGTGCCCAAACTATCTTATAATTTGCTTTCGATAAGCAAGATCACTCATGAGTTAAACTGCAAAGCAATATTCTTACCTGATTCTGTCTCTTTTCAGGACTTGAGCTCGGGGAGGATGATTGGCACTGCCCGGCATAGTAGGGGACTCTACCTCCTTGATGACGATACCTCTTCTAGTAGCATTCCTAGGACTAGCCTCTTATCTTCCTATTTCACTACTTCTGAACAAGATTGCATGTTGTGGCATTTTCGTTTAGGCCACCCTAATTTTCAATATATGAAACATTTATTTCCACATCTCTTCTCTAAAGTTGAGATGACTACCTTATCTTGTGATGTGTGTATTCAGGCCAAACAACATCGAGTCTCTTTTCCCTCACAACCATACAAACCAACCCAACCCTTCACTCTTGTTCATAGTGATGTCTGGGGACCATCCAAGATAACAACCTCATCTGGAAAACGGTGGTTCGTAACCTTCATTGATGATCATACCCGTCTTACCTGGGTCTACCTTATCACTGATAAATCTGAGGTTTCCTCTATGTTTCAAAATTTCTATCACACCATTGAAACACAATTCCATCAAAAAATTGCTATTCTTCGGAGTGATAATGGTCGGGAATTCCAAAACCATAACCTTAGTGAATTTCTTGCTTCCAAGGGGATTGTTCATCAAAACTCGTGCGCCTACACTCCTCAACAAAATGGAGTGGCCGAGCGAAAAAACCGTCACCTTCTGGAAGTAGCCCGTTCCCTTATGCTTTCTACTTCCCTTCCTTCATACTTGTGGGGAGATGCTATTCTTACAGCAGCTCATTTAATCAATAGAATGCCTTCTCGTATTCTTCATCTTCAAACTCCCTTAGATTGTCTTAAGGAGTCCTACCCATCGACTCGTCATGTTTCTGAGGTTCCTCTTCGTGTGTTTGGGTGTACCGCTTATGTCCATAATTTTGGCCCTAATCAAACCAAATTTACCCCTCGGGCTCAGGCATGTGTGTTTGTTGGGTATCCCCCTCACCAGCGTGGTTATAAATGTTTTCACCCACCATCCAGAAAATACTTTGTCACTATGGATGTTACTTTCTGTGAGGATCGACCCTACTTTCCCGTTAGCCATCTTCAGGGGGAGAGTGTGAGTGAAGAGTCTAACAACACCTTTGAATTCATCGAACCCACTCCTAGTGTTGTGTCTAACATCATTCCTCATTCCATAGTCCTACCCACAAACCAAGTCCCCTGGAAAACGTACTACAGGAGGAATCACAAAAAGGAAGTCGGTTCCCCTACTAGTCAGCCGCCGGCTCCAGTCCAAGACTCTGAACCTCCTCGAGATCAAGGTATGGAAAACCCTACTGAACCCTGTACTAAGAATATGATAAGTGAGAATGACAGGTCTAATGTTGCTGTTCTTGAAAACGTGGAAGAAAAGGACAGTGGTGATGAGATTGAGGTCAGAATAGAAACCCGTAATAATGAAGCGGAACAGGGTCATACAGGAAAATTAGATGAGTATGATTCCTCTCTTGACATTCCCATTGCTCTGAGAAAAGGCACCAGGTCTTGTACTAAACACCCCATTTGCAATTATGTTTCCTACGATAGTCTCTCTCCTCAATTCAGAGCTTTTACAGCAAGCCTTGACTCTACCATAATACCAAAAGATATCTACACTGCTTTAAAGTATCCTGAATGGAAGAATGCTGTCATGGAAGAGATGAAAGCTCTTGAAAAGAATAGTACTTGGGACATTTGTACTCTACCTAAGGGACACAAAACTGTGGGATGCAAATGGGTGTTCTCTCTCAAATACAAAGCTGATGGTACTCTTGACAGACACAAGGCAAGGTTAGTTGCGAAGGGATTTACTCAAACCTATGGTATTGACTATTCAGAAACTTTTTCTCCAGTTGCTAAGTTGAATACTATTAGAGTTCTGTTATCTGTTGCTGTGAACAAAGATTGGCCTTTATATCAGCTGGATGTTAAGAATGCCTTTTTGAATGGAGACCTCGTAGAGGAAGTCTACATGAGCCCTCCGCCTGGATTTGAAGCTCAGTTTGGTCAGCATGTGTGTAAACTCCAGAAATCTATATATGGTCTGAAACAGTCTCCCAGAGCATGGTTTGACAGATTCACTACCTTTGTCAAGTCCCAAGGGTACAGGCAGGGACACTCTGATCATACTTTATTTACAAAGGTTTCCAAAACAGGAAAGATTGCTGTTCTAATAGTTTATGTGGATGACATTGTTTTGACTGGAGATGATCAGGCAGAAATCAGTCAACTAAAGCAGAGAATGGGCGATGAGTTTGAAATCAAGGATTTGGGAAATTTGAAATATTTCCTTGGAATGGAGGTGGCCAGATCTAAAGAAGGTATCTCCGTATCTCAAAGAAAATACATCCTTGATTTGTTAACCGAGACAGGTATGTTAGGATGTCGTCCCACTGACACTCCTATTGAATTCAACTGCAAACTAGGAAACTCTGATGATCAAGTTCCAGTTGATAAAGAACAGTATCAACGCCTCGTGGGTAAATTAATTTACTTATCTCATACTCGTCCTGATATTTCCTTTGCTGTGAGTGTTGTCAGCCAGTTTATGCAGACCCCTAATGAGGAACACATGAAAGCTGTCAACAGAATCTTGAGATACTTAAAATCAACACCTGGTAAAGGGCTGATGTTTAGAAAAACAGACAGAAAGACCATTGAGGCATACACTGACTCGGATTGGGCAGGATCTGTTGTTGACAGAAAATCTACCTCTGGTTATTGTACCTTTGTTTGGGGCAATCTTGTAACTTGGAGGAGTAAGAAGCAAAGTGTTGTGGCCAGGAGCAGCGCTGAGGCTGAATATAGAGCTATGAGTTTAGGAATATGTGAGGAAATTTGGCTTCAGAAAGTTTTGACAGATCTTCATCAGGAATGTGAGACACCATTGAAGCTTTTCTGTGATAATAAAGCCGCTATTAGTATTGCTAACAACCCTGTTCAACATGATAGAACTAAACATGTTGAGATTGATCGACATTTTATCAAAGAAAAACTTGACAGTGGGAGCATATGCATTCCGTACATCCCTTCGAGTCAACAGGTTGCTGATGTTCTTACCAAAGGGCTTCTCAGACCAAACGATGTATTCCTTATTTGCCTAAATCTTTTCCTTTTTTATTCCATTGTGTACTCTATTTATTCTCCCTTGTACCTATTGCTTTATTCATTAGAAAATAATAACAACACAAACAATCGTGGTTTTTCTCCCGGTACTCGGGTTTCCACGTAAATTGGTGTGAACTCGTTGTCTCTCTTTTCAATACTTTCGTATGCTGAAATCTATTTGATGAATCAACTATAGACTCCTGGTTTGTAGAGCTTGTTTTTGTAGCTAGGAAGGGAGATTTTGTTGCAATATTAGCTGCTGCAGCGTAAATAATGTTTCATGAGTTTAGGATGAGTTTAGGATGATTTCTATAGAATTGATAGGGTAATTATCAATAAAATTTTGAACTGCATTGATCACTGGGTTGAGTCGATGCATTTTTAAGTATTTGGTTTCTACTGAAGTCTGTGTTAGCTATGTGGTAAATGGAAATAAGTTTTCTGAAATCTGAAATACTTAATATATTTGTTGTATGCCATCCTCTATCATTTACTAGTGTATTTCTAAAGGATGATGGTTTCATTTGTTATGTTACCCCAGGAGGTCGAGAAGGATGTGAAAACAAGGACTAAAGGTGAGATGGGAGCAGCGAAAACCCTATGCTCCCCGTTCGAGCAGCCTCCCCTTCCAGAAGGTTACTAAATTCATGTCTAAATCTGTCTTCATTTTACATGTTAATATGAGTTTATTTCAACTTGTGGGATTTTGGGAAACTTGATAGCACTCTTTTAGTTCCTTGAATTTACTATGATGTTTGTGCTTAGGTACCAAATGTTTTGCATCTGGGAAGCCTGCAAAGAAATGGAGCTATTGGGGCCGAAGCTACTAAAGCTAATCAGCTCCCTATTTTCTCCTACGCTTCGTCTCGTTAAACGATGTTGGAAATCAGCAAGAGTTGTGGAGTTTGGAGGCATTCGATTCAGTCGAGTCGAGTTCATCATAATTTCCCGTATTAGATGA

mRNA sequence

ATGGAGAAGTTTGCTGGTGGTCTTTACACCACAAGCGTTGAGGCATTTGTCCCAAACACTGGTCGTGGTATTCAAGGTGCGACTTCACATTGTTTGGGTCAAAATTTTGCAAAATTGTTTGAAATCAACTTTGAAAATGAAAAGGGGGAGAAAGCTATGGTCTGGCAAAATTCGTGGGTCTATAGTACCAGAACGATTGGTGTGATGGTTATGGTTCACGGTGATGACAAGGGATTGGTGATGCCTCCCCAAGTTGCATCAATTCAAGTCATTATAGTTCCTGTTCCCTACAAAGATGCAGACACTCAAGGAATTTTTTATGCTTGTTCTGCCACTTCGAATATGTTGTCCAAAGCAGGAATTCGTGCAGAGGTAGACATTGGGGAGAACTATTCTCCTGGATGGAAATACTCCCACTGGGAGATGAAAGGTGTTCCACTCAGGATTGAAATAGGGCCCAAGGACTTAGCAAACAATCAGGTACGCGCAGTTCGTCGTGATAATTCTGCAAAGAAGGACATACCTAGGGCTTCGTTGGTTGAACAAGTGAAAGAATTGCTAGAAAGTATTCAACAAAGCCTTTTTGATGCGGCAAAAGAAAAACGAGATGCATGCATTCAGGTTATTAATACATGGGATGAGTTTACTGAAGCCCTCGGTCAGAAGAAAATGATATTAGCTCCATGGTGTGATGAAGAGCGGGACAATGAAAACACCTTAGAAACCCAAAAAAACCAAACCACTAATGAAAATCAAACAGAAGGGACAGCCATCAATTTTAGTGTTGTCGTAGCTGCTGCCATCGATGCTCGGATGAGTGCTACCATGGACGAATTATTAAGCCGGCTACAGAAAACATCCGAAAATAATTTTTCGTCATTACCGCAGTCGTCCGCGCCGCCCCCAGACCACCACGCGCCTGGTTTTCTTCCTCAGACGGCGCCGACCATCCCATCTGTCCAACCCTTTTCTTCGTCCGCGGCCTATATTGCTCCCCACGCCCCGATTTATGTTCTGCCATCTAATTCCAATCGGCTACCACCGCTTCTGCCGTCAAATCTGTATGGCCAGCCACCCAATGATCCTAGTTACCATCCCGATGTTAAAAACTCCCAAATTCACTCAACATTTGAGGTTGGTGAATCTTCGGCATATTCCAACCGTAACGTGCAAGCTTCCTCGGGAATAGTTCATCAACAATTGGAAGGGCTTCGACAACAGATAGCAGCACTTGAGGCTACCTTAGGGACGACATCCACTCTACCGATGTATTCTGAGAATCCGGTAAACTCGTTCCCTAATGTATCATCTCCTTATGTGACTAATACGGTGACTCAGTCTTCCATGTATCATCTTTCAGGAGAAAAGTTGAATGGCAACAACTATTTCTCATGGTCTCAGTCAGTAAAGATGGTCCTCGAAGGACAACAAAAATTTAGCTTTCTGACAGGGGAAATACCTCGCCCCTTACCGGGCGACCCACATGAACGATATTGGAAGGCAGAAGACTCTATTCTTCGATCCATATTGATCAATAGTATGGAACCTCAAATTGGTAAGCCGTTATTGTTTGCTACAACAGCCAAGGATATTTGGGACACAGCCCAGACACTTTACTCAAAACGTCAGAATGCCTCTCGTCTATACACGCTGAGAAAGCAAGTTCATGAATGCAAGCAAGGAACCATGGATGTCACATCCTTTTTCAATAAGCTTTCTCTTATATGGCAAGAAATGGACCTATGCAGAGAACTAGTCTGGCGTGATCCCACTGATGGTGTACAGTACTCGAGAATTGAAGAGAATGACAGGATTTATGACTTTCTTGCTGGTCTTAATCCTAAGTTTGATGTAGTTCGAGGGCGTATACTAGGTCAAAGACCGATTCCCTCCCTGATGGAAGTTTGCTCTGAAATCCGCCTCGAGGAAGATCGCACAAGTGCTATGAATATTTCCGCAACCCCTACTATTGACTCTGCTGCTTTTAGTGCAAGATCTTCTAACAGTAGCAGTGACAAGCATAATGGAAAACCAATTCCTGTCTGCGAGCATTGCAAAAAACAATGGCATACCAAAGAACAATGTTGGAAGTTACATGGTCGTCCCCCAGGAAGTAAGAAACGCCCATCCAACGACAAACAGAACACAGGGCGGGCGTATGTGAGTGAGTCTGCTGAACCTCCTCAACAATCCGATCCACACAAAAACCAAACTGATCTCAGTCTTGCCACTTTAGGTGCCATTGTCCAATCAGGTATACCTCATTCCTTCGGTCTTGTTAGTATTGATGGGAAGAACCCCTGGATTCTGGATTCTGGTGCCACAGATCATTTGACTGGGTCCTCTGAACATTTTGTATCTTACATTCCTTGTGCTGGGAACGAGACAATTAGAATTGCAGATGGCTCCTTGGCCCCCATTGCTGGAAAGGGGAAGATTTCTCCTTGTGCAGGGCTCTCCTTACATAATGTTTTGCATGTGCCCAAACTATCTTATAATTTGCTTTCGATAAGCAAGATCACTCATGAGTTAAACTGCAAAGCAATATTCTTACCTGATTCTGTCTCTTTTCAGGACTTGAGCTCGGGGAGGATGATTGGCACTGCCCGGCATAGTAGGGGACTCTACCTCCTTGATGACGATACCTCTTCTAGTAGCATTCCTAGGACTAGCCTCTTATCTTCCTATTTCACTACTTCTGAACAAGATTGCATGTTGTGGCATTTTCGTTTAGGCCACCCTAATTTTCAATATATGAAACATTTATTTCCACATCTCTTCTCTAAAGTTGAGATGACTACCTTATCTTGTGATGTGTGTATTCAGGCCAAACAACATCGAGTCTCTTTTCCCTCACAACCATACAAACCAACCCAACCCTTCACTCTTGTTCATAGTGATGTCTGGGGACCATCCAAGATAACAACCTCATCTGGAAAACGGTGGTTCGTAACCTTCATTGATGATCATACCCGTCTTACCTGGGTCTACCTTATCACTGATAAATCTGAGGTTTCCTCTATGTTTCAAAATTTCTATCACACCATTGAAACACAATTCCATCAAAAAATTGCTATTCTTCGGAGTGATAATGGTCGGGAATTCCAAAACCATAACCTTAGTGAATTTCTTGCTTCCAAGGGGATTGTTCATCAAAACTCGTGCGCCTACACTCCTCAACAAAATGGAGTGGCCGAGCGAAAAAACCGTCACCTTCTGGAAGTAGCCCGTTCCCTTATGCTTTCTACTTCCCTTCCTTCATACTTGTGGGGAGATGCTATTCTTACAGCAGCTCATTTAATCAATAGAATGCCTTCTCGTATTCTTCATCTTCAAACTCCCTTAGATTGTCTTAAGGAGTCCTACCCATCGACTCGTCATGTTTCTGAGGTTCCTCTTCGTGTGTTTGGGTGTACCGCTTATGTCCATAATTTTGGCCCTAATCAAACCAAATTTACCCCTCGGGCTCAGGCATGTGTGTTTGTTGGGTATCCCCCTCACCAGCGTGGTTATAAATGTTTTCACCCACCATCCAGAAAATACTTTGTCACTATGGATGTTACTTTCTGTGAGGATCGACCCTACTTTCCCGTTAGCCATCTTCAGGGGGAGAGTGTGAGTGAAGAGTCTAACAACACCTTTGAATTCATCGAACCCACTCCTAGTGTTGTGTCTAACATCATTCCTCATTCCATAGTCCTACCCACAAACCAAGTCCCCTGGAAAACGTACTACAGGAGGAATCACAAAAAGGAAGTCGGTTCCCCTACTAGTCAGCCGCCGGCTCCAGTCCAAGACTCTGAACCTCCTCGAGATCAAGGTATGGAAAACCCTACTGAACCCTGTACTAAGAATATGATAAGTGAGAATGACAGGTCTAATGTTGCTGTTCTTGAAAACGTGGAAGAAAAGGACAGTGGTGATGAGATTGAGGTCAGAATAGAAACCCGTAATAATGAAGCGGAACAGGGTCATACAGGAAAATTAGATGAGTATGATTCCTCTCTTGACATTCCCATTGCTCTGAGAAAAGGCACCAGGTCTTGTACTAAACACCCCATTTGCAATTATGTTTCCTACGATAGTCTCTCTCCTCAATTCAGAGCTTTTACAGCAAGCCTTGACTCTACCATAATACCAAAAGATATCTACACTGCTTTAAAGTATCCTGAATGGAAGAATGCTGTCATGGAAGAGATGAAAGCTCTTGAAAAGAATAGTACTTGGGACATTTGTACTCTACCTAAGGGACACAAAACTGTGGGATGCAAATGGGTGTTCTCTCTCAAATACAAAGCTGATGGTACTCTTGACAGACACAAGGCAAGGTTAGTTGCGAAGGGATTTACTCAAACCTATGGTATTGACTATTCAGAAACTTTTTCTCCAGTTGCTAAGTTGAATACTATTAGAGTTCTGTTATCTGTTGCTGTGAACAAAGATTGGCCTTTATATCAGCTGGATGTTAAGAATGCCTTTTTGAATGGAGACCTCGTAGAGGAAGTCTACATGAGCCCTCCGCCTGGATTTGAAGCTCAGTTTGGTCAGCATGTGTGTAAACTCCAGAAATCTATATATGGTCTGAAACAGTCTCCCAGAGCATGGTTTGACAGATTCACTACCTTTGTCAAGTCCCAAGGGTACAGGCAGGGACACTCTGATCATACTTTATTTACAAAGGTTTCCAAAACAGGAAAGATTGCTGTTCTAATAGTTTATGTGGATGACATTGTTTTGACTGGAGATGATCAGGCAGAAATCAGTCAACTAAAGCAGAGAATGGGCGATGAGTTTGAAATCAAGGATTTGGGAAATTTGAAATATTTCCTTGGAATGGAGGTGGCCAGATCTAAAGAAGGTATCTCCGTATCTCAAAGAAAATACATCCTTGATTTGTTAACCGAGACAGGTATGTTAGGATGTCGTCCCACTGACACTCCTATTGAATTCAACTGCAAACTAGGAAACTCTGATGATCAAGTTCCAGTTGATAAAGAACAGTATCAACGCCTCGTGGGTAAATTAATTTACTTATCTCATACTCGTCCTGATATTTCCTTTGCTGTGAGTGTTGTCAGCCAGTTTATGCAGACCCCTAATGAGGAACACATGAAAGCTGTCAACAGAATCTTGAGATACTTAAAATCAACACCTGGTAAAGGGCTGATGTTTAGAAAAACAGACAGAAAGACCATTGAGGCATACACTGACTCGGATTGGGCAGGATCTGTTGTTGACAGAAAATCTACCTCTGGTTATTGTACCTTTGTTTGGGGCAATCTTGTAACTTGGAGGAGTAAGAAGCAAAGTGTTGTGGCCAGGAGCAGCGCTGAGGCTGAATATAGAGCTATGAGTTTAGGAATATGTGAGGAAATTTGGCTTCAGAAAGTTTTGACAGATCTTCATCAGGAATGTGAGACACCATTGAAGCTTTTCTGTGATAATAAAGCCGCTATTAGTATTGCTAACAACCCTGTTCAACATGATAGAACTAAACATGTTGAGATTGATCGACATTTTATCAAAGAAAAACTTGACAGTGGGAGCATATGCATTCCGTACATCCCTTCGAGTCAACAGGTTGCTGATGTTCTTACCAAAGGGCTTCTCAGACCAAACGATGAGGTCGAGAAGGATGTGAAAACAAGGACTAAAGGTGAGATGGGAGCAGCGAAAACCCTATGCTCCCCGTTCGAGCAGCCTCCCCTTCCAGAAGCCTGCAAAGAAATGGAGCTATTGGGGCCGAAGCTACTAAAGCTAATCAGCTCCCTATTTTCTCCTACGCTTCGTCTCGTTAAACGATGTTGGAAATCAGCAAGAGTTGTGGAGTTTGGAGGCATTCGATTCAGTCGAGTCGAGTTCATCATAATTTCCCGTATTAGATGA

Coding sequence (CDS)

ATGGAGAAGTTTGCTGGTGGTCTTTACACCACAAGCGTTGAGGCATTTGTCCCAAACACTGGTCGTGGTATTCAAGGTGCGACTTCACATTGTTTGGGTCAAAATTTTGCAAAATTGTTTGAAATCAACTTTGAAAATGAAAAGGGGGAGAAAGCTATGGTCTGGCAAAATTCGTGGGTCTATAGTACCAGAACGATTGGTGTGATGGTTATGGTTCACGGTGATGACAAGGGATTGGTGATGCCTCCCCAAGTTGCATCAATTCAAGTCATTATAGTTCCTGTTCCCTACAAAGATGCAGACACTCAAGGAATTTTTTATGCTTGTTCTGCCACTTCGAATATGTTGTCCAAAGCAGGAATTCGTGCAGAGGTAGACATTGGGGAGAACTATTCTCCTGGATGGAAATACTCCCACTGGGAGATGAAAGGTGTTCCACTCAGGATTGAAATAGGGCCCAAGGACTTAGCAAACAATCAGGTACGCGCAGTTCGTCGTGATAATTCTGCAAAGAAGGACATACCTAGGGCTTCGTTGGTTGAACAAGTGAAAGAATTGCTAGAAAGTATTCAACAAAGCCTTTTTGATGCGGCAAAAGAAAAACGAGATGCATGCATTCAGGTTATTAATACATGGGATGAGTTTACTGAAGCCCTCGGTCAGAAGAAAATGATATTAGCTCCATGGTGTGATGAAGAGCGGGACAATGAAAACACCTTAGAAACCCAAAAAAACCAAACCACTAATGAAAATCAAACAGAAGGGACAGCCATCAATTTTAGTGTTGTCGTAGCTGCTGCCATCGATGCTCGGATGAGTGCTACCATGGACGAATTATTAAGCCGGCTACAGAAAACATCCGAAAATAATTTTTCGTCATTACCGCAGTCGTCCGCGCCGCCCCCAGACCACCACGCGCCTGGTTTTCTTCCTCAGACGGCGCCGACCATCCCATCTGTCCAACCCTTTTCTTCGTCCGCGGCCTATATTGCTCCCCACGCCCCGATTTATGTTCTGCCATCTAATTCCAATCGGCTACCACCGCTTCTGCCGTCAAATCTGTATGGCCAGCCACCCAATGATCCTAGTTACCATCCCGATGTTAAAAACTCCCAAATTCACTCAACATTTGAGGTTGGTGAATCTTCGGCATATTCCAACCGTAACGTGCAAGCTTCCTCGGGAATAGTTCATCAACAATTGGAAGGGCTTCGACAACAGATAGCAGCACTTGAGGCTACCTTAGGGACGACATCCACTCTACCGATGTATTCTGAGAATCCGGTAAACTCGTTCCCTAATGTATCATCTCCTTATGTGACTAATACGGTGACTCAGTCTTCCATGTATCATCTTTCAGGAGAAAAGTTGAATGGCAACAACTATTTCTCATGGTCTCAGTCAGTAAAGATGGTCCTCGAAGGACAACAAAAATTTAGCTTTCTGACAGGGGAAATACCTCGCCCCTTACCGGGCGACCCACATGAACGATATTGGAAGGCAGAAGACTCTATTCTTCGATCCATATTGATCAATAGTATGGAACCTCAAATTGGTAAGCCGTTATTGTTTGCTACAACAGCCAAGGATATTTGGGACACAGCCCAGACACTTTACTCAAAACGTCAGAATGCCTCTCGTCTATACACGCTGAGAAAGCAAGTTCATGAATGCAAGCAAGGAACCATGGATGTCACATCCTTTTTCAATAAGCTTTCTCTTATATGGCAAGAAATGGACCTATGCAGAGAACTAGTCTGGCGTGATCCCACTGATGGTGTACAGTACTCGAGAATTGAAGAGAATGACAGGATTTATGACTTTCTTGCTGGTCTTAATCCTAAGTTTGATGTAGTTCGAGGGCGTATACTAGGTCAAAGACCGATTCCCTCCCTGATGGAAGTTTGCTCTGAAATCCGCCTCGAGGAAGATCGCACAAGTGCTATGAATATTTCCGCAACCCCTACTATTGACTCTGCTGCTTTTAGTGCAAGATCTTCTAACAGTAGCAGTGACAAGCATAATGGAAAACCAATTCCTGTCTGCGAGCATTGCAAAAAACAATGGCATACCAAAGAACAATGTTGGAAGTTACATGGTCGTCCCCCAGGAAGTAAGAAACGCCCATCCAACGACAAACAGAACACAGGGCGGGCGTATGTGAGTGAGTCTGCTGAACCTCCTCAACAATCCGATCCACACAAAAACCAAACTGATCTCAGTCTTGCCACTTTAGGTGCCATTGTCCAATCAGGTATACCTCATTCCTTCGGTCTTGTTAGTATTGATGGGAAGAACCCCTGGATTCTGGATTCTGGTGCCACAGATCATTTGACTGGGTCCTCTGAACATTTTGTATCTTACATTCCTTGTGCTGGGAACGAGACAATTAGAATTGCAGATGGCTCCTTGGCCCCCATTGCTGGAAAGGGGAAGATTTCTCCTTGTGCAGGGCTCTCCTTACATAATGTTTTGCATGTGCCCAAACTATCTTATAATTTGCTTTCGATAAGCAAGATCACTCATGAGTTAAACTGCAAAGCAATATTCTTACCTGATTCTGTCTCTTTTCAGGACTTGAGCTCGGGGAGGATGATTGGCACTGCCCGGCATAGTAGGGGACTCTACCTCCTTGATGACGATACCTCTTCTAGTAGCATTCCTAGGACTAGCCTCTTATCTTCCTATTTCACTACTTCTGAACAAGATTGCATGTTGTGGCATTTTCGTTTAGGCCACCCTAATTTTCAATATATGAAACATTTATTTCCACATCTCTTCTCTAAAGTTGAGATGACTACCTTATCTTGTGATGTGTGTATTCAGGCCAAACAACATCGAGTCTCTTTTCCCTCACAACCATACAAACCAACCCAACCCTTCACTCTTGTTCATAGTGATGTCTGGGGACCATCCAAGATAACAACCTCATCTGGAAAACGGTGGTTCGTAACCTTCATTGATGATCATACCCGTCTTACCTGGGTCTACCTTATCACTGATAAATCTGAGGTTTCCTCTATGTTTCAAAATTTCTATCACACCATTGAAACACAATTCCATCAAAAAATTGCTATTCTTCGGAGTGATAATGGTCGGGAATTCCAAAACCATAACCTTAGTGAATTTCTTGCTTCCAAGGGGATTGTTCATCAAAACTCGTGCGCCTACACTCCTCAACAAAATGGAGTGGCCGAGCGAAAAAACCGTCACCTTCTGGAAGTAGCCCGTTCCCTTATGCTTTCTACTTCCCTTCCTTCATACTTGTGGGGAGATGCTATTCTTACAGCAGCTCATTTAATCAATAGAATGCCTTCTCGTATTCTTCATCTTCAAACTCCCTTAGATTGTCTTAAGGAGTCCTACCCATCGACTCGTCATGTTTCTGAGGTTCCTCTTCGTGTGTTTGGGTGTACCGCTTATGTCCATAATTTTGGCCCTAATCAAACCAAATTTACCCCTCGGGCTCAGGCATGTGTGTTTGTTGGGTATCCCCCTCACCAGCGTGGTTATAAATGTTTTCACCCACCATCCAGAAAATACTTTGTCACTATGGATGTTACTTTCTGTGAGGATCGACCCTACTTTCCCGTTAGCCATCTTCAGGGGGAGAGTGTGAGTGAAGAGTCTAACAACACCTTTGAATTCATCGAACCCACTCCTAGTGTTGTGTCTAACATCATTCCTCATTCCATAGTCCTACCCACAAACCAAGTCCCCTGGAAAACGTACTACAGGAGGAATCACAAAAAGGAAGTCGGTTCCCCTACTAGTCAGCCGCCGGCTCCAGTCCAAGACTCTGAACCTCCTCGAGATCAAGGTATGGAAAACCCTACTGAACCCTGTACTAAGAATATGATAAGTGAGAATGACAGGTCTAATGTTGCTGTTCTTGAAAACGTGGAAGAAAAGGACAGTGGTGATGAGATTGAGGTCAGAATAGAAACCCGTAATAATGAAGCGGAACAGGGTCATACAGGAAAATTAGATGAGTATGATTCCTCTCTTGACATTCCCATTGCTCTGAGAAAAGGCACCAGGTCTTGTACTAAACACCCCATTTGCAATTATGTTTCCTACGATAGTCTCTCTCCTCAATTCAGAGCTTTTACAGCAAGCCTTGACTCTACCATAATACCAAAAGATATCTACACTGCTTTAAAGTATCCTGAATGGAAGAATGCTGTCATGGAAGAGATGAAAGCTCTTGAAAAGAATAGTACTTGGGACATTTGTACTCTACCTAAGGGACACAAAACTGTGGGATGCAAATGGGTGTTCTCTCTCAAATACAAAGCTGATGGTACTCTTGACAGACACAAGGCAAGGTTAGTTGCGAAGGGATTTACTCAAACCTATGGTATTGACTATTCAGAAACTTTTTCTCCAGTTGCTAAGTTGAATACTATTAGAGTTCTGTTATCTGTTGCTGTGAACAAAGATTGGCCTTTATATCAGCTGGATGTTAAGAATGCCTTTTTGAATGGAGACCTCGTAGAGGAAGTCTACATGAGCCCTCCGCCTGGATTTGAAGCTCAGTTTGGTCAGCATGTGTGTAAACTCCAGAAATCTATATATGGTCTGAAACAGTCTCCCAGAGCATGGTTTGACAGATTCACTACCTTTGTCAAGTCCCAAGGGTACAGGCAGGGACACTCTGATCATACTTTATTTACAAAGGTTTCCAAAACAGGAAAGATTGCTGTTCTAATAGTTTATGTGGATGACATTGTTTTGACTGGAGATGATCAGGCAGAAATCAGTCAACTAAAGCAGAGAATGGGCGATGAGTTTGAAATCAAGGATTTGGGAAATTTGAAATATTTCCTTGGAATGGAGGTGGCCAGATCTAAAGAAGGTATCTCCGTATCTCAAAGAAAATACATCCTTGATTTGTTAACCGAGACAGGTATGTTAGGATGTCGTCCCACTGACACTCCTATTGAATTCAACTGCAAACTAGGAAACTCTGATGATCAAGTTCCAGTTGATAAAGAACAGTATCAACGCCTCGTGGGTAAATTAATTTACTTATCTCATACTCGTCCTGATATTTCCTTTGCTGTGAGTGTTGTCAGCCAGTTTATGCAGACCCCTAATGAGGAACACATGAAAGCTGTCAACAGAATCTTGAGATACTTAAAATCAACACCTGGTAAAGGGCTGATGTTTAGAAAAACAGACAGAAAGACCATTGAGGCATACACTGACTCGGATTGGGCAGGATCTGTTGTTGACAGAAAATCTACCTCTGGTTATTGTACCTTTGTTTGGGGCAATCTTGTAACTTGGAGGAGTAAGAAGCAAAGTGTTGTGGCCAGGAGCAGCGCTGAGGCTGAATATAGAGCTATGAGTTTAGGAATATGTGAGGAAATTTGGCTTCAGAAAGTTTTGACAGATCTTCATCAGGAATGTGAGACACCATTGAAGCTTTTCTGTGATAATAAAGCCGCTATTAGTATTGCTAACAACCCTGTTCAACATGATAGAACTAAACATGTTGAGATTGATCGACATTTTATCAAAGAAAAACTTGACAGTGGGAGCATATGCATTCCGTACATCCCTTCGAGTCAACAGGTTGCTGATGTTCTTACCAAAGGGCTTCTCAGACCAAACGATGAGGTCGAGAAGGATGTGAAAACAAGGACTAAAGGTGAGATGGGAGCAGCGAAAACCCTATGCTCCCCGTTCGAGCAGCCTCCCCTTCCAGAAGCCTGCAAAGAAATGGAGCTATTGGGGCCGAAGCTACTAAAGCTAATCAGCTCCCTATTTTCTCCTACGCTTCGTCTCGTTAAACGATGTTGGAAATCAGCAAGAGTTGTGGAGTTTGGAGGCATTCGATTCAGTCGAGTCGAGTTCATCATAATTTCCCGTATTAGATGA
BLAST of CSPI05G02420 vs. Swiss-Prot
Match: COPIA_DROME (Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3)

HSP 1 Score: 535.4 bits (1378), Expect = 2.6e-150
Identity = 364/1141 (31.90%), Postives = 577/1141 (50.57%), Query Frame = 1

Query: 768  WILDSGATDHLTGSSEHFVSYIPCAGNETIRIA-DGSLAPIAGKG--KISPCAGLSLHNV 827
            ++LDSGA+DHL      +   +       I +A  G       +G  ++     ++L +V
Sbjct: 289  FVLDSGASDHLINDESLYTDSVEVVPPLKIAVAKQGEFIYATKRGIVRLRNDHEITLEDV 348

Query: 828  LHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSS 887
            L   + + NL+S+ ++              +S +   SG  I       GL ++ +    
Sbjct: 349  LFCKEAAGNLMSVKRLQEA----------GMSIEFDKSGVTIS----KNGLMVVKNSGML 408

Query: 888  SSIPRTSLLS-SYFTTSEQDCMLWHFRLGHPNFQYM-----KHLFPH--LFSKVEMTTLS 947
            +++P  +  + S     + +  LWH R GH +   +     K++F    L + +E++   
Sbjct: 409  NNVPVINFQAYSINAKHKNNFRLWHERFGHISDGKLLEIKRKNMFSDQSLLNNLELSCEI 468

Query: 948  CDVCIQAKQHRVSFPSQPYKP--TQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLT 1007
            C+ C+  KQ R+ F     K    +P  +VHSDV GP    T   K +FV F+D  T   
Sbjct: 469  CEPCLNGKQARLPFKQLKDKTHIKRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYC 528

Query: 1008 WVYLITDKSEVSSMFQNFYHTIETQFHQKIAILRSDNGREFQNHNLSEFLASKGIVHQNS 1067
              YLI  KS+V SMFQ+F    E  F+ K+  L  DNGRE+ ++ + +F   KGI +  +
Sbjct: 529  VTYLIKYKSDVFSMFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLT 588

Query: 1068 CAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYLWGDAILTAAHLINRMPSRIL--HLQ 1127
              +TPQ NGV+ER  R + E AR+++    L    WG+A+LTA +LINR+PSR L    +
Sbjct: 589  VPHTPQLNGVSERMIRTITEKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSK 648

Query: 1128 TPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYPPHQRGYK 1187
            TP +      P  +H     LRVFG T YVH     Q KF  ++   +FVGY P+  G+K
Sbjct: 649  TPYEMWHNKKPYLKH-----LRVFGATVYVH-IKNKQGKFDDKSFKSIFVGYEPN--GFK 708

Query: 1188 CFHPPSRKYFVTMDVTFCEDRPY------FPVSHLQGESVSEESNNTFEFIEPTPSVVSN 1247
             +   + K+ V  DV   E          F    L+    SE  N    F   +  ++  
Sbjct: 709  LWDAVNEKFIVARDVVVDETNMVNSRAVKFETVFLKDSKESENKN----FPNDSRKIIQT 768

Query: 1248 IIPHSIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTK-N 1307
              P+      N       + ++ K+            +  +E P      N ++ C    
Sbjct: 769  EFPNESKECDN-----IQFLKDSKESENKNFPNDSRKIIQTEFP------NESKECDNIQ 828

Query: 1308 MISENDRSNVAVLENVEEKDSGDEI-EVRIETRNNEAEQGHTGK------LDEYDSSLDI 1367
             + ++  SN   L   +++   D + E +     NE+ +  T +      +D    +  I
Sbjct: 829  FLKDSKESNKYFLNESKKRKRDDHLNESKGSGNPNESRESETAEHLKEIGIDNPTKNDGI 888

Query: 1368 PIALRKGTRSCTKHPICNYVSYDSLSP-QFRAFTASLDSTIIPKDIYTALKYPEWKNAVM 1427
             I  R+  R  TK  I      +SL+     A T   D      +I        W+ A+ 
Sbjct: 889  EIINRRSERLKTKPQISYNEEDNSLNKVVLNAHTIFNDVPNSFDEIQYRDDKSSWEEAIN 948

Query: 1428 EEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDY 1487
             E+ A + N+TW I   P+    V  +WVFS+KY   G   R+KARLVA+GFTQ Y IDY
Sbjct: 949  TELNAHKINNTWTITKRPENKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQKYQIDY 1008

Query: 1488 SETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEEVYMSPPPGFEAQFGQH 1547
             ETF+PVA++++ R +LS+ +  +  ++Q+DVK AFLNG L EE+YM  P G       +
Sbjct: 1009 EETFAPVARISSFRFILSLVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQGISCN-SDN 1068

Query: 1548 VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKI---AVLIVYV 1607
            VCKL K+IYGLKQ+ R WF+ F   +K   +     D  ++  +   G I     +++YV
Sbjct: 1069 VCKLNKAIYGLKQAARCWFEVFEQALKECEFVNSSVDRCIY--ILDKGNINENIYVLLYV 1128

Query: 1608 DDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYILDLLT 1667
            DD+V+   D   ++  K+ + ++F + DL  +K+F+G+ +   ++ I +SQ  Y+  +L+
Sbjct: 1129 DDVVIATGDMTRMNNFKRYLMEKFRMTDLNEIKHFIGIRIEMQEDKIYLSQSAYVKKILS 1188

Query: 1668 ETGMLGCRPTDTPI--EFNCKLGNSDDQVPVDKEQYQRLVGKLIYLS-HTRPDISFAVSV 1727
            +  M  C    TP+  + N +L NSD+         + L+G L+Y+   TRPD++ AV++
Sbjct: 1189 KFNMENCNAVSTPLPSKINYELLNSDEDC---NTPCRSLIGCLMYIMLCTRPDLTTAVNI 1248

Query: 1728 VSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRK--TDRKTIEAYTDSDWAGSVVDRKS 1787
            +S++    N E  + + R+LRYLK T    L+F+K       I  Y DSDWAGS +DRKS
Sbjct: 1249 LSRYSSKNNSELWQNLKRVLRYLKGTIDMKLIFKKNLAFENKIIGYVDSDWAGSEIDRKS 1308

Query: 1788 TSGYCTFVWG-NLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDLHQECETP 1847
            T+GY   ++  NL+ W +K+Q+ VA SS EAEY A+   + E +WL+ +LT ++ + E P
Sbjct: 1309 TTGYLFKMFDFNLICWNTKRQNSVAASSTEAEYMALFEAVREALWLKFLLTSINIKLENP 1368

Query: 1848 LKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKG 1870
            +K++ DN+  ISIANNP  H R KH++I  HF +E++ +  IC+ YIP+  Q+AD+ TK 
Sbjct: 1369 IKIYEDNQGCISIANNPSCHKRAKHIDIKYHFAREQVQNNVICLEYIPTENQLADIFTKP 1386

BLAST of CSPI05G02420 vs. Swiss-Prot
Match: SYPC_ARATH (Proline--tRNA ligase, cytoplasmic OS=Arabidopsis thaliana GN=At3g62120 PE=2 SV=1)

HSP 1 Score: 392.9 bits (1008), Expect = 2.1e-107
Identity = 179/246 (72.76%), Postives = 213/246 (86.59%), Query Frame = 1

Query: 2   EKFAGGLYTTSVEAFVPNTGRGIQGATSHCLGQNFAKLFEINFENEKGEKAMVWQNSWVY 61
           EKFAGGLYTTSVEAF+PNTGRG+QGATSHCLGQNFAK+FEINFENEK E  MVWQNSW Y
Sbjct: 247 EKFAGGLYTTSVEAFIPNTGRGVQGATSHCLGQNFAKMFEINFENEKAETEMVWQNSWAY 306

Query: 62  STRTIGVMVMVHGDDKGLVMPPQVASIQVIIVPVPYKDADTQGIFYACSATSNMLSKAGI 121
           STRTIGVM+M HGDDKGLV+PP+VAS+QV+++PVPYKDA+TQGI+ AC+AT++ L +AGI
Sbjct: 307 STRTIGVMIMTHGDDKGLVLPPKVASVQVVVIPVPYKDANTQGIYDACTATASALCEAGI 366

Query: 122 RAEVDIGENYSPGWKYSHWEMKGVPLRIEIGPKDLANNQVRAVRRDNSAKKDIPRASLVE 181
           RAE D+ +NYSPGWKYS WEMKGVPLRIEIGP+DL N+QVR VRRDN  K+DIPR SLVE
Sbjct: 367 RAEEDLRDNYSPGWKYSDWEMKGVPLRIEIGPRDLENDQVRTVRRDNGVKEDIPRGSLVE 426

Query: 182 QVKELLESIQQSLFDAAKEKRDACIQVINTWDEFTEALGQKKMILAPWCDEERDNENTLE 241
            VKELLE IQQ++++ AK+KR+AC+Q + TWDEF +AL +KK+ILAPWCDEE    +   
Sbjct: 427 HVKELLEKIQQNMYEVAKQKREACVQEVKTWDEFIKALNEKKLILAPWCDEEEVERDVKA 486

Query: 242 TQKNQT 248
             K +T
Sbjct: 487 RTKGET 492

BLAST of CSPI05G02420 vs. Swiss-Prot
Match: POLX_TOBAC (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 389.0 bits (998), Expect = 3.0e-106
Identity = 214/517 (41.39%), Postives = 322/517 (62.28%), Query Frame = 1

Query: 1380 PKDIYTALKYPEWKNAVM----EEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADG 1439
            P+ +   L +PE KN +M    EEM++L+KN T+ +  LPKG + + CKWVF LK   D 
Sbjct: 811  PESLKEVLSHPE-KNQLMKAMQEEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDC 870

Query: 1440 TLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLN 1499
             L R+KARLV KGF Q  GID+ E FSPV K+ +IR +LS+A + D  + QLDVK AFL+
Sbjct: 871  KLVRYKARLVVKGFEQKKGIDFDEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLH 930

Query: 1500 GDLVEEVYMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSD 1559
            GDL EE+YM  P GFE    +H VCKL KS+YGLKQ+PR W+ +F +F+KSQ Y + +SD
Sbjct: 931  GDLEEEIYMEQPEGFEVAGKKHMVCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSD 990

Query: 1560 HTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEV 1619
              ++ K        +L++YVDD+++ G D+  I++LK  +   F++KDLG  +  LGM++
Sbjct: 991  PCVYFKRFSENNFIILLLYVDDMLIVGKDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKI 1050

Query: 1620 ARSKEG--ISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNS------DDQVPVDKE 1679
             R +    + +SQ KYI  +L    M   +P  TP+  + KL         +++  + K 
Sbjct: 1051 VRERTSRKLWLSQEKYIERVLERFNMKNAKPVSTPLAGHLKLSKKMCPTTVEEKGNMAKV 1110

Query: 1680 QYQRLVGKLIY-LSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFR 1739
             Y   VG L+Y +  TRPDI+ AV VVS+F++ P +EH +AV  ILRYL+ T G  L F 
Sbjct: 1111 PYSSAVGSLMYAMVCTRPDIAHAVGVVSRFLENPGKEHWEAVKWILRYLRGTTGDCLCFG 1170

Query: 1740 KTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMS 1799
             +D   ++ YTD+D AG + +RKS++GY     G  ++W+SK Q  VA S+ EAEY A +
Sbjct: 1171 GSD-PILKGYTDADMAGDIDNRKSSTGYLFTFSGGAISWQSKLQKCVALSTTEAEYIAAT 1230

Query: 1800 LGICEEIWLQKVLTD--LHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKE 1859
                E IWL++ L +  LHQ+      ++CD+++AI ++ N + H RTKH+++  H+I+E
Sbjct: 1231 ETGKEMIWLKRFLQELGLHQK---EYVVYCDSQSAIDLSKNSMYHARTKHIDVRYHWIRE 1290

Query: 1860 KLDSGSICIPYIPSSQQVADVLTKGLLRPNDEVEKDV 1881
             +D  S+ +  I +++  AD+LTK + R   E+ K++
Sbjct: 1291 MVDDESLKVLKISTNENPADMLTKVVPRNKFELCKEL 1322

BLAST of CSPI05G02420 vs. Swiss-Prot
Match: PRS1_SCHPO (Putative proline--tRNA ligase C19C7.06 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=prs1 PE=3 SV=1)

HSP 1 Score: 243.8 bits (621), Expect = 1.6e-62
Identity = 127/265 (47.92%), Postives = 167/265 (63.02%), Query Frame = 1

Query: 2   EKFAGGLYTTSVEAFVPNTGRGIQGATSHCLGQNFAKLFEINFENEKGE--------KAM 61
           EKFAGG++TT+VE ++P TGRGIQGATSHCLGQNF+K+F I  E+   E        K  
Sbjct: 405 EKFAGGMFTTTVEGYIPTTGRGIQGATSHCLGQNFSKMFNIVVEDPNAEIGPTGERPKLF 464

Query: 62  VWQNSWVYSTRTIGVMVMVHGDDKGLVMPPQVASIQVIIVPVPYK----DADTQGIFYAC 121
           VWQNSW  STRTIGV VMVHGDDKGL +PP +A +Q ++VP        D +   I   C
Sbjct: 465 VWQNSWGLSTRTIGVAVMVHGDDKGLKLPPAIALVQSVVVPCGITNKTTDQERNEIEGFC 524

Query: 122 SATSNMLSKAGIRAEVDIGENYSPGWKYSHWEMKGVPLRIEIGPKDLANNQVRAVRRDNS 181
           S  ++ L+ A IR E D+   Y+PG+K+SHWEMKGVPLR+E GP D   NQV AVRRD  
Sbjct: 525 SKLADRLNAADIRTEADL-RAYTPGYKFSHWEMKGVPLRLEYGPNDAKKNQVTAVRRDTF 584

Query: 182 AKKDIPRASLVEQVKELLESIQQSLFDAAKEKRDACIQVINTWDEFTEALGQKKMILAPW 241
            K  +P  +L + V +LL  IQ ++++ AK +RDA +  +  W +F  AL +K +++ PW
Sbjct: 585 EKIPVPLNNLEKGVSDLLAKIQTNMYETAKAERDAHVVKVKEWADFVPALNKKNIVMIPW 644

Query: 242 CDEERDNENTLETQKNQTTNENQTE 255
           C+     E   E +KN     N  E
Sbjct: 645 CN---TTECEKEIKKNSARQVNGDE 665

BLAST of CSPI05G02420 vs. Swiss-Prot
Match: SYEP_DROME (Bifunctional glutamate/proline--tRNA ligase OS=Drosophila melanogaster GN=Aats-glupro PE=1 SV=2)

HSP 1 Score: 242.7 bits (618), Expect = 3.5e-62
Identity = 122/236 (51.69%), Postives = 163/236 (69.07%), Query Frame = 1

Query: 2    EKFAGGLYTTSVEAFVPNTGRGIQGATSHCLGQNFAKLFEINFEN-EKGEKAMVWQNSWV 61
            EKFAGG YTT+VEAF+  +GR IQGATSH LGQNF+K+FEI +E+ E  +K  V+QNSW 
Sbjct: 1415 EKFAGGDYTTTVEAFISASGRAIQGATSHHLGQNFSKMFEIVYEDPETQQKKYVYQNSWG 1474

Query: 62   YSTRTIGVMVMVHGDDKGLVMPPQVASIQVIIVP----VPYKDADTQGIFYACSATSNML 121
             +TRTIGVM+MVH D++GLV+PP VA IQ I+VP    V  KD +   +  AC A    L
Sbjct: 1475 ITTRTIGVMIMVHADNQGLVLPPHVACIQAIVVPCGITVNTKDDERAQLLDACKALEKRL 1534

Query: 122  SKAGIRAEVDIGENYSPGWKYSHWEMKGVPLRIEIGPKDLANNQVRAVRRDNSAKKDIPR 181
               G+R E D  +NYSPGWK++HWE+KGVPLR+E+GPKDL   Q+ AVRRD   K  IP 
Sbjct: 1535 VGGGVRCEGDYRDNYSPGWKFNHWELKGVPLRLEVGPKDLKAQQLVAVRRDTVEKITIPL 1594

Query: 182  ASLVEQVKELLESIQQSLFDAAKEKRDACIQVINTWDEFTEALGQKKMILAPWCDE 233
            A + +++  LLE+I +S+ + A+E   +  + +  W +F   L QK ++LAP+C E
Sbjct: 1595 ADVEKKIPALLETIHESMLNKAQEDMTSHTKKVTNWTDFCGFLEQKNILLAPFCGE 1650

BLAST of CSPI05G02420 vs. TrEMBL
Match: A5AYJ3_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_041073 PE=4 SV=1)

HSP 1 Score: 1557.3 bits (4031), Expect = 0.0e+00
Identity = 793/1478 (53.65%), Postives = 1022/1478 (69.15%), Query Frame = 1

Query: 445  TQSSMYHLSGEKLNGNNYFSWSQSVKMVLEGQQKFSFLTGEIPRPLPGDPHERYWKAEDS 504
            T  S + L+  KLNG NY  W+QSVK+ ++G+ K   L GE+ +P+  DP+ + W+  + 
Sbjct: 32   TSLSSFQLTIHKLNGKNYLEWAQSVKLAIDGRGKLGHLNGEVSKPVADDPNLKTWRFREL 91

Query: 505  ILRSILINSMEPQIGKPLLFATTAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMD 564
            +            IGKP LF  TAKD+W+  + +YS  +N+S+++ L+ ++ + +QG  +
Sbjct: 92   VA-----------IGKPHLFLPTAKDVWEAVRDMYSDLENSSQIFDLKSKLWQSRQGDRE 151

Query: 565  VTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRIL 624
            VT+++N++  +WQE+DLC E  W  P D V++ + EENDR+Y FLA LN   D VRGRIL
Sbjct: 152  VTTYYNQMVTLWQELDLCYEDEWDCPNDSVRHKKREENDRVYVFLAALNHNLDEVRGRIL 211

Query: 625  GQRPIPSLMEVCSEIRLEEDRTSAMNIS----ATPTIDSAAFSARSSNSSSDKHNGKPIP 684
            G++P+PS+ EV SE+R EE R   M       + P I+S+A  ++ S+   D+      P
Sbjct: 212  GRKPLPSIREVFSEVRREEARRKVMLTDPEPMSNPEIESSALVSKGSDLDGDRRKK---P 271

Query: 685  VCEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYVSESAEP--PQQSDPHKNQT 744
             C+HCKK WHTK  CWK+HG+P   KK+  +D    GRA+ + SA+   PQ +    N T
Sbjct: 272  WCDHCKKPWHTKGTCWKIHGKPQNFKKKNGSD----GRAFQTMSADSQGPQINSEKPNFT 331

Query: 745  DLSLATLGAIVQSG---------------IPHSFGLVSIDGKNPWILDSGATDHLTGSSE 804
               L+ L  + QS                +  +   +  +   PWI+DSGATDH+TGSS+
Sbjct: 332  KEQLSHLYKLFQSPQFSNPSCSLAQQGNYLIAALSSIKSNVHCPWIIDSGATDHMTGSSQ 391

Query: 805  HFVSYIPCAGNETIRIADGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHE 864
             F SY PCAGN+ I+I DGSL+ IAGKG +     L+LHNVLHVP LS NLLSISKIT +
Sbjct: 392  IFSSYKPCAGNKKIKIXDGSLSAIAGKGSVFISPSLTLHNVLHVPNLSCNLLSISKITQD 451

Query: 865  LNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQD 924
              C+A F P    FQ+L+SGR IG AR   GLY  ++ + S    +++   S    S  D
Sbjct: 452  HQCQANFYPSYCEFQELTSGRTIGNAREIGGLYFFENGSESRKPIQSTCFESISVASSDD 511

Query: 925  CMLWHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTL 984
             +LWH+RLGHP+FQY+KHLFP LF     ++  C+ C  AK HR SFP QPY+ ++PF+L
Sbjct: 512  IILWHYRLGHPSFQYLKHLFPSLFRNKNPSSFQCEFCELAKHHRTSFPLQPYRISKPFSL 571

Query: 985  VHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQK 1044
            +HSDVWGPS+I+T SGK+WFVTFIDDHTR++WVYL+ +KSEV  +F+ FY  + TQF  K
Sbjct: 572  IHSDVWGPSRISTLSGKKWFVTFIDDHTRVSWVYLLREKSEVEEVFKIFYTMVLTQFQTK 631

Query: 1045 IAILRSDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLST 1104
            I + RSDNG+E+ N  L +F   KGIVHQ+SC  TPQQNG+AERKN+HLLEVAR+L  +T
Sbjct: 632  IQVFRSDNGKEYINKALGKFFLEKGIVHQSSCNDTPQQNGIAERKNKHLLEVARALCFTT 691

Query: 1105 SLPSYLWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVH 1164
             +P YLWG+AILTA +LINRMP+RIL+ +TPL       P  R  S +PL++FGCT +VH
Sbjct: 692  KVPKYLWGEAILTATYLINRMPTRILNFKTPLQVFTNCNPIFRLSSTLPLKIFGCTTFVH 751

Query: 1165 NFGPNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQG 1224
                N+ K  PRA+ CVFVGY P Q+GYKCF P S+K FVTMDVTF E +P+F  +HLQG
Sbjct: 752  IHDHNRGKLDPRARKCVFVGYAPTQKGYKCFDPISKKLFVTMDVTFFESKPFF-ATHLQG 811

Query: 1225 ESVSEESN----------NTFEFIEPTPS---VVSNIIPHSIVLPTNQVPW-KTYYRRNH 1284
            ES SE+S+          N    +EP+ S   V  NI    +    + + + KT      
Sbjct: 812  ESTSEDSDLFKIEKTPTPNPNNLLEPSNSNQFVYPNIETSGLDTTKSDMSFEKTAEILGK 871

Query: 1285 KKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNM-ISENDRSNVAVLENVEEKDSGD 1344
            K  V +  S   +    S         N     TKN  +    R      E+  +   G 
Sbjct: 872  KNGVLNIESLDGSSSLPSHNQNHSNTNNGNRTSTKNSELMTYSRRKHNSKESNPDPLPGH 931

Query: 1345 EIEVRIETRNNEAEQGH-----------TGKLDEYDSSLDIPIALRKGTRSCTKHPICNY 1404
            E E+R E  ++E    +           +    E    L+IPIA RKG RSCTKHP+ NY
Sbjct: 932  ESELREEPNSSECPGNNQTDSCQPVQFISNSNSESFDDLNIPIATRKGVRSCTKHPMSNY 991

Query: 1405 VSYDSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKG 1464
            +SY +LSP F AFT+ L    IPK++  AL+ PEWK A+ EEM+ALEKN TW++  LPKG
Sbjct: 992  MSYKNLSPSFFAFTSHLSLVEIPKNVQEALQVPEWKKAIFEEMRALEKNHTWEVMGLPKG 1051

Query: 1465 HKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVA 1524
              TVGCKWVF++KY ++G+L+R+KARLVAKGFTQTYGIDY ETF+PVAKLNT+RVLLS+A
Sbjct: 1052 KTTVGCKWVFTVKYNSNGSLERYKARLVAKGFTQTYGIDYLETFAPVAKLNTVRVLLSIA 1111

Query: 1525 VNKDWPLYQLDVKNAFLNGDLVEEVYMSPPPGFEAQFGQHVCKLQKSIYGLKQSPRAWFD 1584
             N DWPL QLDVKNAFLNG+L EEVYM PPPGF+  FG  VCKL+KS+YGLKQSPRAWF+
Sbjct: 1112 ANLDWPLQQLDVKNAFLNGNLEEEVYMDPPPGFDEHFGSKVCKLKKSLYGLKQSPRAWFE 1171

Query: 1585 RFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDE 1644
            RFT FVK+QGY Q  SDHT+F K S  GKIA+LIVYVDDI+LTGD   E+ +LK+ +  E
Sbjct: 1172 RFTQFVKNQGYVQAQSDHTMFIKHSNDGKIAILIVYVDDIILTGDHVTEMDRLKKSLALE 1231

Query: 1645 FEIKDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNS 1704
            FEIKDLG+L+YFLGMEVARSK GI VSQRKYILDLL ETGM GCRP DTPI+ N KLG++
Sbjct: 1232 FEIKDLGSLRYFLGMEVARSKRGIVVSQRKYILDLLKETGMSGCRPADTPIDPNQKLGDT 1291

Query: 1705 DDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKST 1764
            +D   V+  +YQ+LVGKLIYLSHTRPDI+FAVS+VSQFM +P E H++AV RILRYLKST
Sbjct: 1292 NDGNLVNTTRYQKLVGKLIYLSHTRPDIAFAVSIVSQFMHSPYEVHLEAVYRILRYLKST 1351

Query: 1765 PGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSA 1824
            PGKGL F+K+++KTIEAYTD+DWAGSV DR+STSGYCT++WGNLVTWRSKKQSV ARSSA
Sbjct: 1352 PGKGLFFKKSEQKTIEAYTDADWAGSVTDRRSTSGYCTYIWGNLVTWRSKKQSVXARSSA 1411

Query: 1825 EAEYRAMSLGICEEIWLQKVLTDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEID 1876
            EAEYRAM+ G+CE +WL+K+L +L +  E P+KL+CDNKAAISIA+NPVQHDRTKHVEID
Sbjct: 1412 EAEYRAMAHGVCEILWLKKILEELKRPLEMPMKLYCDNKAAISIAHNPVQHDRTKHVEID 1471

BLAST of CSPI05G02420 vs. TrEMBL
Match: A5B7Z8_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_022757 PE=4 SV=1)

HSP 1 Score: 1445.6 bits (3741), Expect = 0.0e+00
Identity = 749/1457 (51.41%), Postives = 975/1457 (66.92%), Query Frame = 1

Query: 445  TQSSMYHLSGEKLNGNNYFSWSQSVKMVLEGQQKFSFLTGEIPRPLPGDPHERYWKAEDS 504
            + SS   ++G KLNG+NY  WSQSV + + G+ K  +LTGE   P   +P  R WK E+S
Sbjct: 26   SDSSPILITGHKLNGHNYLQWSQSVLLFICGKGKDEYLTGEAAMPETTEPGFRKWKIENS 85

Query: 505  ILRSILINSMEPQIGKPLLFATTAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMD 564
            ++ S LINSM   IG+  L   TAKDIWD A+  YS  +N S L+ +   +H+ +QG   
Sbjct: 86   MIMSWLINSMNNDIGENFLLFRTAKDIWDAAKETYSSSENTSELFQVESALHDFRQGEQS 145

Query: 565  VTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRIL 624
            VT ++N L+  WQ++DL     W+   D   Y  I E  R++ F  GLN + D VRGRI+
Sbjct: 146  VTQYYNTLTRYWQQLDLFETHSWKCSDDAATYRXIVEQXRLFKFFLGLNRELDDVRGRIM 205

Query: 625  GQRPIPSLMEVCSEIRLEEDRTSAMNISA---TPTIDSAAFSARSSNSSSDKHNGKPIPV 684
            G +P+PSL E  SE+R EE R   M  S     PT+D++   ARS NSS      +  P 
Sbjct: 206  GIKPLPSLREAFSEVRREESRKKVMMGSKEQPAPTLDASXLXARSFNSSGGDRQKRDRPW 265

Query: 685  CEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYVS---ESAEPPQQSDPHKNQT 744
            C++CKK  H KE CWKLHG+    K +P  D+   GRA+V+   ES   P+ S  +K Q 
Sbjct: 266  CDYCKKXGHYKEACWKLHGKXADWKPKPRXDRD--GRAHVAANXESTSVPEPSPFNKEQM 325

Query: 745  DLSLATLGAIVQSGIPHSFGLVSI-DGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETI 804
            ++ L  L + V SG      L +   G  PWI+D+GA+DH+TG +    +Y P  G+ ++
Sbjct: 326  EM-LQKLLSQVGSGSTTGIALTANRGGMKPWIVDTGASDHMTGDAAILQNYKPSNGHSSV 385

Query: 805  RIADGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSF 864
             IADGS + I G G I     L L +VLHVP L  NLLSISK+  +L C   F P+S  F
Sbjct: 386  HIADGSKSKIXGTGSIKLTKDLYLDSVLHVPNLDCNLLSISKLARDLQCVTKFYPNSCVF 445

Query: 865  QDLSSGRMIGTARHSRGLYLL------DDDTSSSSIPRTSLLSSYFTTS------EQDCM 924
            QDL SG+MIG+A    GLYLL      +  + +S +   S+L S+ + S      + + +
Sbjct: 446  QDLKSGKMIGSAELCSGLYLLSCGQFSNQVSQASCVQSQSMLESFNSVSNSKVNKDSEII 505

Query: 925  LWHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVH 984
            + H+RLGHP+F Y+  LFP LF      +  C++C  AK  R  +P  PYKP+  F+LVH
Sbjct: 506  MLHYRLGHPSFVYLAKLFPKLFINKNPASYHCEICQFAKHTRTVYPQIPYKPSTVFSLVH 565

Query: 985  SDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIA 1044
            SDVWGPS+I   SG RWFVTF+DDHTR+TWV+L+ +KSEV  +FQ F   ++ QF+ KI 
Sbjct: 566  SDVWGPSRIKNISGTRWFVTFVDDHTRVTWVFLMKEKSEVGHIFQTFNLMVQNQFNSKIQ 625

Query: 1045 ILRSDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSL 1104
            +L+SDN +E+   +LS +L + GI+H +SC  TPQQNGVAERKNRHLLEVAR LM S+++
Sbjct: 626  VLKSDNAKEYFTSSLSTYLQNHGIIHISSCVDTPQQNGVAERKNRHLLEVARCLMFSSNV 685

Query: 1105 PSYLWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVS-EVPLRVFGCTAYVHN 1164
            P+Y WG+AILTA +LINRMPSR+L  Q+P     + +P TR  S ++PL+VFGCTA+VH 
Sbjct: 686  PNYFWGEAILTATYLINRMPSRVLTFQSPRQLFLKQFPHTRAASSDLPLKVFGCTAFVHV 745

Query: 1165 FGPNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGE 1224
            +  N++KF PRA  C+F+GY P Q+GYKC+ P +++++ TMDV+F E   ++P SH+QGE
Sbjct: 746  YPQNRSKFAPRANKCIFLGYSPTQKGYKCYSPTNKRFYTTMDVSFFEHVFFYPKSHVQGE 805

Query: 1225 SVSEESNNTFE-FIEPTPSVVSNIIPHSIVLPTN-QVPWKTYYRRNHKKEVGSP-TSQPP 1284
            S++E  +  +E  +E  PS  S     S   PT    P  +  +      V SP T Q P
Sbjct: 806  SMNE--HQVWESLLEGVPSFHSESPNPSQFAPTELSTPMPSSVQPAQHTNVPSPVTIQSP 865

Query: 1285 APVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEA 1344
             P+Q   P          +   +N+     R     LE+  +   G  I+          
Sbjct: 866  MPIQPIAP----------QLANENLQVYIRRRKRQELEHGSQSTCGQYIDSNSSLPEENI 925

Query: 1345 EQGHTGK--LDEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTII 1404
             +   G+  +   D S  +PIALRKG R CT HPI NYV+Y+ LSP +RAF  SLD T +
Sbjct: 926  GEDRAGEVLIPSIDDST-LPIALRKGVRRCTDHPIGNYVTYEGLSPSYRAFATSLDDTQV 985

Query: 1405 PKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDR 1464
            P  I  A K  EWK AV +E+ ALEKN TW I  LP G + VGCKW+F++KYKADG+++R
Sbjct: 986  PNTIQEAXKISEWKKAVQDEIDALEKNGTWTITDLPVGKRPVGCKWIFTIKYKADGSVER 1045

Query: 1465 HKARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLV 1524
             KARLVA+GFTQ+YGIDY ETF+PVAKLNTIR+LLS+AVN+DW L QLD+KNAFLNGDL 
Sbjct: 1046 FKARLVARGFTQSYGIDYQETFAPVAKLNTIRILLSLAVNQDWCLQQLDIKNAFLNGDLE 1105

Query: 1525 EEVYMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLF 1584
            EEVYM  PPGFE    ++ VCKLQKS+YGLKQSPRAWFDRFT  V   GY+QG +DHTLF
Sbjct: 1106 EEVYMEIPPGFEESMAKNQVCKLQKSLYGLKQSPRAWFDRFTKAVLKLGYKQGQADHTLF 1165

Query: 1585 TKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSK 1644
             K S  GK+A+LIVYVDDI+L+G+D  E+  LK+ + +EFE+KDLGNLKYFLGMEVARS+
Sbjct: 1166 VKKSHAGKMAILIVYVDDIILSGNDMEELQXLKKYLSEEFEVKDLGNLKYFLGMEVARSR 1225

Query: 1645 EGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYL 1704
            +GI VSQRKYILDLL ETGMLGC+P DTP++   KLG   +  PVD+ +YQRLVG+LIYL
Sbjct: 1226 KGIVVSQRKYILDLLKETGMLGCKPIDTPMDSQKKLGIEKESTPVDRGRYQRLVGRLIYL 1285

Query: 1705 SHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDS 1764
            SHTRPDI FAVS VSQFM +P EEHM+AV RI RYLK TPGKGL FRKT+ +  E Y+D+
Sbjct: 1286 SHTRPDIGFAVSXVSQFMHSPTEEHMEAVYRIXRYLKMTPGKGLFFRKTENRDXEVYSDA 1345

Query: 1765 DWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVL 1824
            DWAG+++DR+STSGYC+FVWGNLVT RSKKQSVVARSSAEAEYRA++ GICE IW+++VL
Sbjct: 1346 DWAGNIIDRRSTSGYCSFVWGNLVTXRSKKQSVVARSSAEAEYRALAQGICEGIWIKRVL 1405

Query: 1825 TDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSS 1876
            ++L Q   +P+ + CDN+AAISIA NPV HD TKHVEIDRHFI EK+ S ++ + Y+P+ 
Sbjct: 1406 SELGQTSSSPILMMCDNQAAISIAKNPVHHDXTKHVEIDRHFITEKVTSETVKLNYVPTK 1465

BLAST of CSPI05G02420 vs. TrEMBL
Match: A5AJR0_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_031159 PE=4 SV=1)

HSP 1 Score: 1438.7 bits (3723), Expect = 0.0e+00
Identity = 746/1455 (51.27%), Postives = 972/1455 (66.80%), Query Frame = 1

Query: 447  SSMYHLSGEKLNGNNYFSWSQSVKMVLEGQQKFSFLTGEIPRPLPGDPHERYWKAEDSIL 506
            SS   ++G KLNG+NY  WSQSV + + G+ K  + TGE   P   +P  R WK E+S++
Sbjct: 28   SSPILITGHKLNGHNYLQWSQSVLLFICGKGKDEYXTGEAXMPETTEPXFRKWKIENSMI 87

Query: 507  RSILINSMEPQIGKPLLFATTAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVT 566
             S LINSM   IG+  L   TAKDIWD A+  YS  +N S L+ +   +H+ +QG   VT
Sbjct: 88   MSWLINSMNNDIGENFLLFGTAKDIWDAAKETYSSSENISELFQVESALHDFRQGEQSVT 147

Query: 567  SFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQ 626
             ++N L+  WQ++DL     W+   D   Y +I E  R++ F  GLN + D VRGRI+G 
Sbjct: 148  QYYNTLTRYWQQLDLFETHSWKCSDDAATYRQIVEQKRLFKFFLGLNRELDDVRGRIMGI 207

Query: 627  RPIPSLMEVCSEIRLEEDRTSAMNISA---TPTIDSAAFSARSSNSSSDKHNGKPIPVCE 686
            +P+PSL EV SE+R EE R   M  S     PT+D +A +ARS NSS      +  P C+
Sbjct: 208  KPLPSLREVFSEVRREESRKKVMMGSKEQPAPTLDGSALAARSFNSSGGDRQKRDRPWCD 267

Query: 687  HCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYV---SESAEPPQQSDPHKNQTDL 746
            + KK  H KE CWKLHG+P   K +P +D+   GRA+V   SES   P+ S  +K Q ++
Sbjct: 268  YYKKPGHYKEACWKLHGKPADWKPKPRSDRD--GRAHVAANSESTSVPEPSPFNKEQMEM 327

Query: 747  SLATLGAIVQSGIPHSFGLV-SIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRI 806
             L  L + V SG      L  S  G  PWI+D+GA+DH+TG +    +Y P  G+  + I
Sbjct: 328  -LQKLLSQVGSGSTTGIALTASRGGMKPWIVDTGASDHMTGDAAILQNYKPSNGHSFVHI 387

Query: 807  ADGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQD 866
            ADGS + I G G I     L L +VLHVP L  NLLSISK+  +L C   F P+S  FQD
Sbjct: 388  ADGSKSKIVGTGSIKLTKDLYLDSVLHVPNLDCNLLSISKLARDLQCVTKFYPNSCVFQD 447

Query: 867  LSSGRMIGTARHSRGLYLL------DDDTSSSSIPRTSLLSSYFTTS------EQDCMLW 926
            L SG+MIG+A+    LYLL      +  + +S +   S+L S+ + S      + + ++ 
Sbjct: 448  LKSGKMIGSAKLCSELYLLSCGQFSNQVSQASCVQSQSMLESFNSVSNSKVNKDSEIIML 507

Query: 927  HFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSD 986
            H+RLGHP+F Y+  LFP LF      +  C++C  AK  R  +P  PYKP+  F+LVHSD
Sbjct: 508  HYRLGHPSFVYLAKLFPRLFINKNPASYHCEICQFAKHTRTVYPQIPYKPSTVFSLVHSD 567

Query: 987  VWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAIL 1046
            VWGPS+I   SG RWFVTF+DDHTR+TWV+L+ +KSEV  +FQ F   ++ QF+ KI +L
Sbjct: 568  VWGPSRIKNISGTRWFVTFVDDHTRVTWVFLMKEKSEVGHIFQTFNLMVQNQFNSKIQVL 627

Query: 1047 RSDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPS 1106
            +SDN +E+   +LS +L + GI+H +SC  TPQQNGVAERKNRHLLEVAR LM S+++P+
Sbjct: 628  KSDNAKEYFTSSLSTYLQNHGIIHISSCVDTPQQNGVAERKNRHLLEVARCLMFSSNVPN 687

Query: 1107 YLWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVS-EVPLRVFGCTAYVHNFG 1166
            Y WG+AILTA +LINRMPSR+L  Q+P     + +P TR  S ++ L+VFGCTA+VH + 
Sbjct: 688  YFWGEAILTATYLINRMPSRVLTFQSPRQLFLKQFPHTRAASSDLSLKVFGCTAFVHVYP 747

Query: 1167 PNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGESV 1226
             N++KF PRA  C+F+GY P+Q+GYKC+ P +++++ TMDV+F E   ++P  H+QGES+
Sbjct: 748  QNRSKFAPRANKCIFLGYSPNQKGYKCYSPTNKRFYTTMDVSFFEHVFFYPKFHVQGESM 807

Query: 1227 SEESNNTFEF-IEPTPSVVSNIIPHSIVLPTN-QVPWKTYYRRNHKKEVGSP-TSQPPAP 1286
            +E  +  +E  +E  PS  S     S   PT    P  +  +      V SP T Q P P
Sbjct: 808  NE--HQVWESRLEGVPSFHSESPNPSQFAPTELSTPMPSSVQPAQHTNVPSPVTIQSPMP 867

Query: 1287 VQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAEQ 1346
            +Q   P          +   +N+     R     LE+  +   G  I+           +
Sbjct: 868  IQPIAP----------QLANENLQVYIRRRKRQELEHGSQSTCGQYIDSNSSLPEENIGE 927

Query: 1347 GHTGK--LDEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPK 1406
               G+  +   D S  +PIALRKG R CT HPI NYV+Y+ LSP +RAF  SLD T +P 
Sbjct: 928  DRAGEVLIPSIDDST-LPIALRKGVRRCTDHPIGNYVTYEGLSPSYRAFATSLDDTQVPN 987

Query: 1407 DIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHK 1466
             I  ALK  EWK AV +E+ ALEKN TW I  LP G + VGCKW+F++KYKADG+++R K
Sbjct: 988  TIQEALKISEWKKAVQDEIDALEKNGTWTITDLPVGKRPVGCKWIFTIKYKADGSVERFK 1047

Query: 1467 ARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEE 1526
            ARLVA+GFTQ YGIDY ETF+PVAKLNTIR+LLS+AVN+DW L QLD+KNAFLNGDL EE
Sbjct: 1048 ARLVARGFTQXYGIDYQETFAPVAKLNTIRILLSLAVNQDWCLQQLDIKNAFLNGDLEEE 1107

Query: 1527 VYMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTK 1586
            VYM  PPGFE    ++ VCKLQKS+YGLKQSPRAWFDRFT  V   GY+QG +DHTLF K
Sbjct: 1108 VYMEIPPGFEESMAKNQVCKLQKSLYGLKQSPRAWFDRFTKAVLKLGYKQGQADHTLFVK 1167

Query: 1587 VSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEG 1646
             S  GK+A+LIVYVDDI+L+G+D  E+  LK+ + +EFE+KDLGNLKYFLGMEVARS++G
Sbjct: 1168 KSHAGKMAILIVYVDDIILSGNDMEELQNLKKYLSEEFEVKDLGNLKYFLGMEVARSRKG 1227

Query: 1647 ISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSH 1706
            I VSQ KYILDLL ETGMLGC+P DTP++   KLG   +  P D+ +YQRLVG+LIYLSH
Sbjct: 1228 IVVSQTKYILDLLKETGMLGCKPIDTPMDSQKKLGIEKESTPXDRGRYQRLVGRLIYLSH 1287

Query: 1707 TRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDW 1766
            TRPDI FAVS VSQFM +P EEHM+AV RILRYLK TP KG+ FRKT+ +  E Y+D+DW
Sbjct: 1288 TRPDIGFAVSAVSQFMHSPTEEHMEAVYRILRYLKMTPXKGIFFRKTENRDTEVYSDADW 1347

Query: 1767 AGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTD 1826
            AG+++DR+STSGYC+FVWGNLVTWRSKKQSVVARSSAEAEY A++ GICE  W+++VL++
Sbjct: 1348 AGNIIDRRSTSGYCSFVWGNLVTWRSKKQSVVARSSAEAEYXALAQGICEGXWIKRVLSE 1407

Query: 1827 LHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQ 1876
            L Q   +P+ + CDN+A ISIA NPV HDRTKHVEIDRHFI EK+ S ++ + Y+P+  Q
Sbjct: 1408 LGQTSSSPILMMCDNQAXISIAKNPVHHDRTKHVEIDRHFITEKVTSETVKLNYVPTKHQ 1466

BLAST of CSPI05G02420 vs. TrEMBL
Match: A5BJ12_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_024789 PE=4 SV=1)

HSP 1 Score: 1431.8 bits (3705), Expect = 0.0e+00
Identity = 739/1454 (50.83%), Postives = 963/1454 (66.23%), Query Frame = 1

Query: 447  SSMYHLSGEKLNGNNYFSWSQSVKMVLEGQQKFSFLTGEIPRPLPGDPHERYWKAEDSIL 506
            SS   ++G KLNG+NY  WSQSV + + G+ K  +LTGE   P   +P  R WK E+S++
Sbjct: 28   SSPILITGHKLNGHNYLQWSQSVLLFICGKGKDEYLTGEAVMPETTEPGFRKWKIENSMI 87

Query: 507  RSILINSMEPQIGKPLLFATTAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVT 566
             S LINSM   IG+  L   TAKDIWD A+  YS  +N S L+ +   +H+ +QG   VT
Sbjct: 88   MSWLINSMNNDIGENFLLFGTAKDIWDAAKETYSSSENTSELFQVESALHDFRQGEQSVT 147

Query: 567  SFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQ 626
             ++N L+  WQ++DL     W+   D   Y +I E  R++ F  GLN + D VRGRI+G 
Sbjct: 148  QYYNTLTRYWQQLDLFETHSWKCSDDAATYRQIMEQKRLFKFFLGLNRELDDVRGRIMGI 207

Query: 627  RPIPSLMEVCSEIRLEEDRTSAMNISA---TPTIDSAAFSARSSNSSSDKHNGKPIPVCE 686
            +P+PSL E  SE+R EE R   M  S     PT+D++A +ARS NSS      +  P C+
Sbjct: 208  KPLPSLREAFSEVRREESRKKVMMGSKEQPAPTLDASALAARSFNSSGGDRQKRDRPWCD 267

Query: 687  HCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYV---SESAEPPQQSDPHKNQTDL 746
            +CKK  H KE CWKLHG+P   K +P  D+   GRA+V   SES   P+ S  +K Q ++
Sbjct: 268  YCKKPGHYKETCWKLHGKPADWKPKPRFDRD--GRAHVAANSESTSVPEPSPFNKEQMEM 327

Query: 747  SLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIA 806
                L  +            +  G  PWI+D+GA+DH+TG +    +Y P  G+ ++ IA
Sbjct: 328  LQKLLSQVGSGSTTGVAFTTNRGGMRPWIVDTGASDHMTGDAAILQNYKPSNGHSSVHIA 387

Query: 807  DGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDL 866
            DGS         I     L L +VLHVP L  NLLSISK+ H+L C   F P+   FQDL
Sbjct: 388  DGS---------IKLTKDLYLDSVLHVPNLDCNLLSISKLAHDLQCVTKFYPNLCVFQDL 447

Query: 867  SSGRMIGTARHSRGLYLLD-----DDTSSSSIPRTSLLSSYFTT-------SEQDCMLWH 926
             SG+MIG+A    GLYLL      +  S +S  ++  +S  F +        + + ++ H
Sbjct: 448  KSGKMIGSAELRSGLYLLSCGQFSNQVSQASCVQSQSMSESFNSVSNSKVNKDSEIIMLH 507

Query: 927  FRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDV 986
            +RLGHP+F Y+  LFP LF      +  C++C  AK  R+ +P  PYKP+  F+LVHSDV
Sbjct: 508  YRLGHPSFVYLAKLFPRLFINKNPASYHCEICQFAKHTRIVYPQIPYKPSTVFSLVHSDV 567

Query: 987  WGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAILR 1046
            WGPS+I   SG RWFVTF+DDHT +TWV+L+ +KSEV  +FQ F   ++ QF+ KI +L+
Sbjct: 568  WGPSRIKNISGTRWFVTFVDDHTWVTWVFLMKEKSEVGHIFQTFNLMVQNQFNSKIQVLK 627

Query: 1047 SDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSY 1106
            SDN +E+   +LS +L +  I+H +SC  TPQQN VAERKNRHLLEVAR LM S+++P+Y
Sbjct: 628  SDNAKEYFTSSLSTYLQNHDIIHISSCVDTPQQNRVAERKNRHLLEVARCLMFSSNVPNY 687

Query: 1107 LWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVS-EVPLRVFGCTAYVHNFGP 1166
             WG+AILTA +LINRMPSR+L  Q+P     + +P T   S ++PL+VFGCTA++H +  
Sbjct: 688  FWGEAILTATYLINRMPSRVLTFQSPRQLFLKQFPHTHAASSDLPLKVFGCTAFIHVYPQ 747

Query: 1167 NQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGESVS 1226
            N++KF PRA  C+F+GY P Q+GYKC+ P +++++ TMDV+F E   ++P SH+QGES++
Sbjct: 748  NRSKFAPRANKCIFLGYSPTQKGYKCYSPTNKRFYTTMDVSFFEHVFFYPKSHVQGESMN 807

Query: 1227 EESNNTFE-FIEPTPSVVSNIIPHSIVLPTN-QVPWKTYYRRNHKKEVGSP-TSQPPAPV 1286
            E  +  +E F+E  PS  S     S   PT    P     +      V SP T Q P P+
Sbjct: 808  E--HQVWESFLEGVPSFHSESPNPSQFAPTELSTPMPPSVQPAQHTNVPSPVTIQSPMPI 867

Query: 1287 QDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAEQG 1346
            Q   P          +   +N+     R     LE+  +   G  I+           + 
Sbjct: 868  QPIAP----------QLANENLQVYLRRRKRQELEHGSQSTCGQYIDSNSSLPEENIGED 927

Query: 1347 HTGK--LDEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPKD 1406
              G+  +   D S  +PIALRKG R CT HPI NYV+Y+ LSP +RAF  SLD T +P  
Sbjct: 928  RAGEVLIPSIDDST-LPIALRKGVRRCTDHPIGNYVTYEGLSPSYRAFATSLDDTQVPNT 987

Query: 1407 IYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKA 1466
            I  A K  EWK AV +E+ ALEKN TW I  LP G + VGCKW+F++KYK DG+++R KA
Sbjct: 988  IQEASKISEWKKAVQDEIDALEKNGTWTITDLPVGKRPVGCKWIFTIKYKTDGSVERFKA 1047

Query: 1467 RLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEEV 1526
            RLVA+GFTQ+YGIDY ETF+PVAKLNTIR+LLS+AVN+DW L QLD+KNAFLNGDL EEV
Sbjct: 1048 RLVARGFTQSYGIDYQETFAPVAKLNTIRILLSLAVNQDWCLQQLDIKNAFLNGDLEEEV 1107

Query: 1527 YMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKV 1586
            YM  PPGFE    ++ VCKLQKS+YGLKQSPRAWFDRFT  V   GY+QG +DHTLF K 
Sbjct: 1108 YMEIPPGFEESMAKNQVCKLQKSLYGLKQSPRAWFDRFTKAVLKLGYKQGQADHTLFVKK 1167

Query: 1587 SKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGI 1646
            S  GK+A+LIVYVDDI+L+G+D  E+  LK+ + +EFE+KDLGNLKYF GMEVA+S++GI
Sbjct: 1168 SHAGKLAILIVYVDDIILSGNDMGELQNLKKYLSEEFEVKDLGNLKYFXGMEVAKSRKGI 1227

Query: 1647 SVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHT 1706
             VSQRKYILDLL ETGMLGC+P DTP++   KLG   +  PVD+ +YQRLVG+LIYLSHT
Sbjct: 1228 VVSQRKYILDLLKETGMLGCKPIDTPMDSQKKLGIEKESTPVDRGRYQRLVGRLIYLSHT 1287

Query: 1707 RPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWA 1766
            RPDI FAVS VSQFM +P EEHM+AV RILRYLK TPGKGL FRKT+ +  E Y+D+DWA
Sbjct: 1288 RPDIGFAVSAVSQFMHSPTEEHMEAVYRILRYLKMTPGKGLFFRKTENRDTEVYSDADWA 1347

Query: 1767 GSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDL 1826
            G+++DR STSGYC+FVWGNLVTWRSKKQSVVARSSAEAEYRA++ GICE IW++ VL++L
Sbjct: 1348 GNIIDRWSTSGYCSFVWGNLVTWRSKKQSVVARSSAEAEYRALAQGICEGIWIKXVLSEL 1407

Query: 1827 HQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQV 1876
             Q   +P+ + CDN+AAISIA NPV HDRTKHVEIDRHFI EK+ S ++ + Y+P+  Q 
Sbjct: 1408 GQXSSSPILMMCDNQAAISIAKNPVHHDRTKHVEIDRHFITEKVTSETVKLNYVPTKHQT 1457

BLAST of CSPI05G02420 vs. TrEMBL
Match: A5B7A7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_025045 PE=4 SV=1)

HSP 1 Score: 1417.5 bits (3668), Expect = 0.0e+00
Identity = 740/1454 (50.89%), Postives = 959/1454 (65.96%), Query Frame = 1

Query: 447  SSMYHLSGEKLNGNNYFSWSQSVKMVLEGQQKFSFLTGEIPRPLPGDPHERYWKAEDSIL 506
            SS   ++G KLNG+NY  WSQSV + + G+ K  +LTGE   P   +P  R WK E+S++
Sbjct: 28   SSPILITGHKLNGHNYLQWSQSVLLFICGKGKDEYLTGEAVMPETTEPGFRKWKIENSMI 87

Query: 507  RSILINSMEPQIGKPLLFATTAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVT 566
             S LINSM   IG+  L   TAKDIWD A+  YS  +N S L+ +   +H+ +QG   VT
Sbjct: 88   MSWLINSMNNDIGENFLLFGTAKDIWDAAKETYSSSENTSELFQVESALHDFRQGEQSVT 147

Query: 567  SFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQ 626
             ++N L+  WQ++DL     W+   D   Y +I E  R++ F  GLN + D VRGRI+G 
Sbjct: 148  QYYNTLTRYWQQLDLFETHSWKCSDDAATYRQIVEQKRLFKFFLGLNRELDDVRGRIMGI 207

Query: 627  RPIPSLMEVCSEIRLEEDRTSAMNISA---TPTIDSAAFSARSSNSSSDKHNGKPIPVCE 686
            +P+PSL E  SE+R EE R   M  S     PT+D++A +ARS NSS      +  P C+
Sbjct: 208  KPLPSLREAFSEVRREESRKKVMMGSKEQPAPTLDASALAARSFNSSGGDRQKRDRPWCD 267

Query: 687  HCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYV---SESAEPPQQSDPHKNQTDL 746
            +CKK  H KE CWKLHG+P   K +P  D+   GRA+V   SES   P+ S  +K Q ++
Sbjct: 268  YCKKPGHYKETCWKLHGKPADWKPKPRFDRD--GRAHVAANSESTSVPEPSPFNKEQMEM 327

Query: 747  SLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIA 806
                L  +            +  G  PWI+D+GA+DH+TG +    +Y P  G+ ++ IA
Sbjct: 328  LQKLLSQVGSGSTTGVAFTANRGGMRPWIVDTGASDHMTGDAAILQNYKPSNGHSSVHIA 387

Query: 807  DGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDL 866
            DGS + IAG G I     L L +VLHVP L  NLLSISK+ H+L C   F P+   FQDL
Sbjct: 388  DGSKSKIAGTGSIKLTKDLYLDSVLHVPNLDCNLLSISKLAHDLQCVTKFYPNLCVFQDL 447

Query: 867  SSGRMIGTARHSRGLYLLD-----DDTSSSSIPRTSLLSSYFTT-------SEQDCMLWH 926
             SG+MIG+A    GLYLL      +  S +S  ++  +S  F +        + + ++ H
Sbjct: 448  KSGKMIGSAELCSGLYLLSCGQFSNQVSQASCVQSQSMSESFNSVSNSKVNKDSEIIMLH 507

Query: 927  FRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDV 986
            +RLGHP+F Y+  LFP LF      +  C++C  AK  R  +P  PYKP+  F+LVHSDV
Sbjct: 508  YRLGHPSFVYLAKLFPKLFINKNPASYHCEICQFAKHTRTVYPQIPYKPSTVFSLVHSDV 567

Query: 987  WGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAILR 1046
            WGPS+I   SG RWFVTF+DDHTR+TWV+L+ +KSEV  +FQ F   ++ QF+ KI +L+
Sbjct: 568  WGPSRIKNISGTRWFVTFVDDHTRVTWVFLMKEKSEVGHIFQTFNLMVQNQFNSKIQVLK 627

Query: 1047 SDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSY 1106
            SDN +E+   +LS +L + GI+H +SC  TPQQNGVAERKNRHLLEVAR LM S+++P+Y
Sbjct: 628  SDNAKEYFTSSLSTYLQNHGIIHISSCVDTPQQNGVAERKNRHLLEVARCLMFSSNVPNY 687

Query: 1107 LWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVS-EVPLRVFGCTAYVHNFGP 1166
             WG+AILTA +LINRMPSR+L  Q+P     + +P T   S ++PL+VFGCTA+VH +  
Sbjct: 688  FWGEAILTATYLINRMPSRVLTFQSPRQLFLKQFPHTHAASSDLPLKVFGCTAFVHVYPQ 747

Query: 1167 NQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGESVS 1226
            N++KF PRA  C+F+GY P Q+GYKC+ P +++++ TMDV+F E   ++P SH+QGES++
Sbjct: 748  NRSKFAPRANKCIFLGYSPTQKGYKCYSPTNKRFYTTMDVSFFEHVFFYPKSHVQGESMN 807

Query: 1227 EESNNTFE-FIEPTPSVVSNIIPHSIVLPTN-QVPWKTYYRRNHKKEVGSP-TSQPPAPV 1286
            E  +  +E F+E  PS  S     S   PT    P     +      V SP T Q P P+
Sbjct: 808  E--HQVWESFLEGVPSFHSESPNPSQFAPTELSTPMPPSVQPAQHTNVPSPVTIQSPMPI 867

Query: 1287 QDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAEQG 1346
            Q   P          +   +N+     R     LE+  +   G  I+           + 
Sbjct: 868  QPIAP----------QLANENLQVYIRRRKRQELEHGSQSTCGQYIDSNSSLPEENIGED 927

Query: 1347 HTGK--LDEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPKD 1406
              G+  +   D S  +PIALRKG R CT HPI NYV+Y+ LSP +RAF  SLD T +P  
Sbjct: 928  RAGEVLIPSIDDST-LPIALRKGVRRCTDHPIGNYVTYEGLSPSYRAFATSLDDTQVPNT 987

Query: 1407 IYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKA 1466
            I  ALK  EWK AV +E+ ALEKN TW I  LP G + +            D + D  KA
Sbjct: 988  IQEALKISEWKKAVQDEIDALEKNGTWTITDLPVGKRPM------------DQSKD-FKA 1047

Query: 1467 RLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEEV 1526
            RLVA+GFTQ+YGIDY ETF+PVAKLNTIR+LLS+AVN+DW L QLD+KNAFLNGDL EEV
Sbjct: 1048 RLVARGFTQSYGIDYQETFAPVAKLNTIRILLSLAVNQDWCLQQLDIKNAFLNGDLEEEV 1107

Query: 1527 YMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKV 1586
            YM  PPGFE    ++ VCKLQKS+YGLKQSPRAWFDRFT  V   GY+QG  DHTLF K 
Sbjct: 1108 YMEIPPGFEESMAKNQVCKLQKSLYGLKQSPRAWFDRFTKAVLKLGYKQGQXDHTLFVKK 1167

Query: 1587 SKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGI 1646
            S  GK+A+LIVYVDDI+L+G+D  E+  LK+ + +EFE+KDLGNLKYFLGMEVARS++GI
Sbjct: 1168 SHAGKLAILIVYVDDIILSGNDMGELQNLKKYLLEEFEVKDLGNLKYFLGMEVARSRKGI 1227

Query: 1647 SVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHT 1706
             VSQRKYILDLL ETGMLGC+P DTP++   KLG   +  PVD+ +YQRLVG+LIYLSHT
Sbjct: 1228 VVSQRKYILDLLKETGMLGCKPIDTPMDSKKKLGIEKESTPVDRGRYQRLVGRLIYLSHT 1287

Query: 1707 RPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWA 1766
            RPDI FAVS VSQFM +P EEHM+AV RILRYLK TPGKGL FRKT+ +  E Y+D+DWA
Sbjct: 1288 RPDIGFAVSAVSQFMHSPTEEHMEAVYRILRYLKMTPGKGLFFRKTENRDTEVYSDADWA 1347

Query: 1767 GSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDL 1826
            G+++DR+STSGYC+FVWGNLVTWRSKKQSVVARSSAEAEYRA++ GICE IW+++VL++L
Sbjct: 1348 GNIIDRRSTSGYCSFVWGNLVTWRSKKQSVVARSSAEAEYRALAQGICEGIWIKRVLSEL 1407

Query: 1827 HQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQV 1876
             Q   +P+ + CDN+AAISIA NPV HDRTKHVEIDRHFI EK+ S ++ + Y+P+  Q 
Sbjct: 1408 GQTSSSPILMMCDNQAAISIAKNPVHHDRTKHVEIDRHFITEKVTSETVKLNYVPTKHQT 1453

BLAST of CSPI05G02420 vs. TAIR10
Match: AT4G23160.1 (AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 8)

HSP 1 Score: 478.8 bits (1231), Expect = 1.6e-134
Identity = 230/502 (45.82%), Postives = 335/502 (66.73%), Query Frame = 1

Query: 1348 SCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNS 1407
            S T H I  ++SY+ +SP + +F   +     P     A ++  W  A+ +E+ A+E   
Sbjct: 54   SLTIHDISQFLSYEKVSPLYHSFLVCIAKAKEPSTYNEAKEFLVWCGAMDDEIGAMETTH 113

Query: 1408 TWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKL 1467
            TW+ICTLP   K +GCKWV+ +KY +DGT++R+KARLVAKG+TQ  GID+ ETFSPV KL
Sbjct: 114  TWEICTLPPNKKPIGCKWVYKIKYNSDGTIERYKARLVAKGYTQQEGIDFIETFSPVCKL 173

Query: 1468 NTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEEVYMSPPPGFEAQFGQH-----VCKLQ 1527
             +++++L+++   ++ L+QLD+ NAFLNGDL EE+YM  PPG+ A+ G       VC L+
Sbjct: 174  TSVKLILAISAIYNFTLHQLDISNAFLNGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLK 233

Query: 1528 KSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGD 1587
            KSIYGLKQ+ R WF +F+  +   G+ Q HSDHT F K++ T  + VL VYVDDI++  +
Sbjct: 234  KSIYGLKQASRQWFLKFSVTLIGFGFVQSHSDHTYFLKITATLFLCVL-VYVDDIIICSN 293

Query: 1588 DQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCR 1647
            + A + +LK ++   F+++DLG LKYFLG+E+ARS  GI++ QRKY LDLL ETG+LGC+
Sbjct: 294  NDAAVDELKSQLKSCFKLRDLGPLKYFLGLEIARSAAGINICQRKYALDLLDETGLLGCK 353

Query: 1648 PTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEE 1707
            P+  P++ +           VD + Y+RL+G+L+YL  TR DISFAV+ +SQF + P   
Sbjct: 354  PSSVPMDPSVTFSAHSGGDFVDAKAYRRLIGRLMYLQITRLDISFAVNKLSQFSEAPRLA 413

Query: 1708 HMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLV 1767
            H +AV +IL Y+K T G+GL +       ++ ++D+ +      R+ST+GYC F+  +L+
Sbjct: 414  HQQAVMKILHYIKGTVGQGLFYSSQAEMQLQVFSDASFQSCKDTRRSTNGYCMFLGTSLI 473

Query: 1768 TWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDLHQECETPLKLFCDNKAAISIA 1827
            +W+SKKQ VV++SSAEAEYRA+S    E +WL +   +L      P  LFCDN AAI IA
Sbjct: 474  SWKSKKQQVVSKSSAEAEYRALSFATDEMMWLAQFFRELQLPLSKPTLLFCDNTAAIHIA 533

Query: 1828 NNPVQHDRTKHVEIDRHFIKEK 1845
             N V H+RTKH+E D H ++E+
Sbjct: 534  TNAVFHERTKHIESDCHSVRER 554

BLAST of CSPI05G02420 vs. TAIR10
Match: AT3G62120.1 (AT3G62120.1 Class II aaRS and biotin synthetases superfamily protein)

HSP 1 Score: 392.9 bits (1008), Expect = 1.2e-108
Identity = 179/246 (72.76%), Postives = 213/246 (86.59%), Query Frame = 1

Query: 2   EKFAGGLYTTSVEAFVPNTGRGIQGATSHCLGQNFAKLFEINFENEKGEKAMVWQNSWVY 61
           EKFAGGLYTTSVEAF+PNTGRG+QGATSHCLGQNFAK+FEINFENEK E  MVWQNSW Y
Sbjct: 247 EKFAGGLYTTSVEAFIPNTGRGVQGATSHCLGQNFAKMFEINFENEKAETEMVWQNSWAY 306

Query: 62  STRTIGVMVMVHGDDKGLVMPPQVASIQVIIVPVPYKDADTQGIFYACSATSNMLSKAGI 121
           STRTIGVM+M HGDDKGLV+PP+VAS+QV+++PVPYKDA+TQGI+ AC+AT++ L +AGI
Sbjct: 307 STRTIGVMIMTHGDDKGLVLPPKVASVQVVVIPVPYKDANTQGIYDACTATASALCEAGI 366

Query: 122 RAEVDIGENYSPGWKYSHWEMKGVPLRIEIGPKDLANNQVRAVRRDNSAKKDIPRASLVE 181
           RAE D+ +NYSPGWKYS WEMKGVPLRIEIGP+DL N+QVR VRRDN  K+DIPR SLVE
Sbjct: 367 RAEEDLRDNYSPGWKYSDWEMKGVPLRIEIGPRDLENDQVRTVRRDNGVKEDIPRGSLVE 426

Query: 182 QVKELLESIQQSLFDAAKEKRDACIQVINTWDEFTEALGQKKMILAPWCDEERDNENTLE 241
            VKELLE IQQ++++ AK+KR+AC+Q + TWDEF +AL +KK+ILAPWCDEE    +   
Sbjct: 427 HVKELLEKIQQNMYEVAKQKREACVQEVKTWDEFIKALNEKKLILAPWCDEEEVERDVKA 486

Query: 242 TQKNQT 248
             K +T
Sbjct: 487 RTKGET 492

BLAST of CSPI05G02420 vs. TAIR10
Match: AT5G10880.1 (AT5G10880.1 tRNA synthetase-related / tRNA ligase-related)

HSP 1 Score: 207.6 bits (527), Expect = 6.9e-53
Identity = 100/163 (61.35%), Postives = 121/163 (74.23%), Query Frame = 1

Query: 71  MVHGDDKGLVMPPQVASIQVIIVPVPYKDA-DTQGIFYACSATSNMLSKAGIRAEVDIGE 130
           M HGDDKGLV PP+VA +QV+++ VP K A D Q +  AC A  + L  AGIRAE DI +
Sbjct: 89  MTHGDDKGLVFPPKVAPVQVVVIHVPIKGAADYQELCDACEAVESTLLGAGIRAEADIRD 148

Query: 131 NYSPGWKYSHWEMKGVPLRIEIGPKDLANNQVRAVRRDNSAKKDIPRASLVEQVKELLES 190
           NYS GWKY+  E+ GVPLRIE GP+DLAN+QVR V RDN AK D+ R  L+EQVK+LLE 
Sbjct: 149 NYSCGWKYADQELTGVPLRIETGPRDLANDQVRIVTRDNGAKMDVKRGDLIEQVKDLLEK 208

Query: 191 IQQSLFDAAKEKRDACIQVINTWDEFTEALGQKKMILAPWCDE 233
           IQ +L+D AK K + C Q + TWDEF EAL QKK+ILAPWCD+
Sbjct: 209 IQSNLYDVAKRKVEECTQKVETWDEFVEALSQKKLILAPWCDK 251

BLAST of CSPI05G02420 vs. TAIR10
Match: AT5G52520.1 (AT5G52520.1 Class II aaRS and biotin synthetases superfamily protein)

HSP 1 Score: 201.8 bits (512), Expect = 3.8e-51
Identity = 110/266 (41.35%), Postives = 159/266 (59.77%), Query Frame = 1

Query: 1   MEKFAGGLYTTSVEAFVPNTGRGIQGATSHCLGQNFAKLFEINFENEKGEKAMVWQNSWV 60
           +E FAG   T ++EA + +  + +Q  TSH LGQNF++ F   F +E GE+  VWQ SW 
Sbjct: 261 LETFAGADITYTIEAMMGDR-KALQAGTSHNLGQNFSRAFGTQFADENGERQHVWQTSWA 320

Query: 61  YSTRTIGVMVMVHGDDKGLVMPPQVASIQVIIVPVPYKDADTQGIFYACSATSNMLSKAG 120
            STR +G ++M HGDD GL++PP++A IQV+IVP+  KD +  G+  A S+    L  AG
Sbjct: 321 VSTRFVGGIIMTHGDDTGLMLPPKIAPIQVVIVPIWKKDTEKTGVLSAASSVKEALQTAG 380

Query: 121 IRAEVDIGENYSPGWKYSHWEMKGVPLRIEIGPKDLANNQVRAVRRDNSAKK------DI 180
           +R ++D  +  +PGWK++ WEMKG+PLRIEIGP+D+++N V   RRD   K        +
Sbjct: 381 VRVKLDDTDQRTPGWKFNFWEMKGIPLRIEIGPRDVSSNSVVVSRRDVPGKAGKVFGISM 440

Query: 181 PRASLVEQVKELLESIQQSLFDAAKEKRDACIQVINTWDEFTEALGQKKMILAPW----C 240
             ++LV  VKE L+ IQ SL + A   RD+ I  +N++ E  +A+   K    PW     
Sbjct: 441 EPSTLVAYVKEKLDEIQTSLLEKALSFRDSNIVDVNSYAELKDAISSGKWARGPWSASDA 500

Query: 241 DEERDNENTLETQKNQTTNENQTEGT 257
           DE+R  E T  T +       QT+GT
Sbjct: 501 DEQRVKEETGATIR--CFPFEQTQGT 523

BLAST of CSPI05G02420 vs. TAIR10
Match: ATMG00810.1 (ATMG00810.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 187.2 bits (474), Expect = 9.7e-47
Identity = 91/224 (40.62%), Postives = 135/224 (60.27%), Query Frame = 1

Query: 1570 LIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYI 1629
            L++YVDDI+LTG     ++ L  ++   F +KDLG + YFLG+++     G+ +SQ KY 
Sbjct: 3    LLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYA 62

Query: 1630 LDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAV 1689
              +L   GML C+P  TP+        S  + P D   ++ +VG L YL+ TRPDIS+AV
Sbjct: 63   EQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYP-DPSDFRSIVGALQYLTLTRPDISYAV 122

Query: 1690 SVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKS 1749
            ++V Q M  P       + R+LRY+K T   GL   K  +  ++A+ DSDWAG    R+S
Sbjct: 123  NIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRS 182

Query: 1750 TSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIW 1794
            T+G+CTF+  N+++W +K+Q  V+RSS E EYRA++L   E  W
Sbjct: 183  TTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CSPI05G02420 vs. NCBI nr
Match: gi|147819777|emb|CAN76196.1| (hypothetical protein VITISV_041073 [Vitis vinifera])

HSP 1 Score: 1557.3 bits (4031), Expect = 0.0e+00
Identity = 793/1478 (53.65%), Postives = 1022/1478 (69.15%), Query Frame = 1

Query: 445  TQSSMYHLSGEKLNGNNYFSWSQSVKMVLEGQQKFSFLTGEIPRPLPGDPHERYWKAEDS 504
            T  S + L+  KLNG NY  W+QSVK+ ++G+ K   L GE+ +P+  DP+ + W+  + 
Sbjct: 32   TSLSSFQLTIHKLNGKNYLEWAQSVKLAIDGRGKLGHLNGEVSKPVADDPNLKTWRFREL 91

Query: 505  ILRSILINSMEPQIGKPLLFATTAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMD 564
            +            IGKP LF  TAKD+W+  + +YS  +N+S+++ L+ ++ + +QG  +
Sbjct: 92   VA-----------IGKPHLFLPTAKDVWEAVRDMYSDLENSSQIFDLKSKLWQSRQGDRE 151

Query: 565  VTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRIL 624
            VT+++N++  +WQE+DLC E  W  P D V++ + EENDR+Y FLA LN   D VRGRIL
Sbjct: 152  VTTYYNQMVTLWQELDLCYEDEWDCPNDSVRHKKREENDRVYVFLAALNHNLDEVRGRIL 211

Query: 625  GQRPIPSLMEVCSEIRLEEDRTSAMNIS----ATPTIDSAAFSARSSNSSSDKHNGKPIP 684
            G++P+PS+ EV SE+R EE R   M       + P I+S+A  ++ S+   D+      P
Sbjct: 212  GRKPLPSIREVFSEVRREEARRKVMLTDPEPMSNPEIESSALVSKGSDLDGDRRKK---P 271

Query: 685  VCEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYVSESAEP--PQQSDPHKNQT 744
             C+HCKK WHTK  CWK+HG+P   KK+  +D    GRA+ + SA+   PQ +    N T
Sbjct: 272  WCDHCKKPWHTKGTCWKIHGKPQNFKKKNGSD----GRAFQTMSADSQGPQINSEKPNFT 331

Query: 745  DLSLATLGAIVQSG---------------IPHSFGLVSIDGKNPWILDSGATDHLTGSSE 804
               L+ L  + QS                +  +   +  +   PWI+DSGATDH+TGSS+
Sbjct: 332  KEQLSHLYKLFQSPQFSNPSCSLAQQGNYLIAALSSIKSNVHCPWIIDSGATDHMTGSSQ 391

Query: 805  HFVSYIPCAGNETIRIADGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHE 864
             F SY PCAGN+ I+I DGSL+ IAGKG +     L+LHNVLHVP LS NLLSISKIT +
Sbjct: 392  IFSSYKPCAGNKKIKIXDGSLSAIAGKGSVFISPSLTLHNVLHVPNLSCNLLSISKITQD 451

Query: 865  LNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQD 924
              C+A F P    FQ+L+SGR IG AR   GLY  ++ + S    +++   S    S  D
Sbjct: 452  HQCQANFYPSYCEFQELTSGRTIGNAREIGGLYFFENGSESRKPIQSTCFESISVASSDD 511

Query: 925  CMLWHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTL 984
             +LWH+RLGHP+FQY+KHLFP LF     ++  C+ C  AK HR SFP QPY+ ++PF+L
Sbjct: 512  IILWHYRLGHPSFQYLKHLFPSLFRNKNPSSFQCEFCELAKHHRTSFPLQPYRISKPFSL 571

Query: 985  VHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQK 1044
            +HSDVWGPS+I+T SGK+WFVTFIDDHTR++WVYL+ +KSEV  +F+ FY  + TQF  K
Sbjct: 572  IHSDVWGPSRISTLSGKKWFVTFIDDHTRVSWVYLLREKSEVEEVFKIFYTMVLTQFQTK 631

Query: 1045 IAILRSDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLST 1104
            I + RSDNG+E+ N  L +F   KGIVHQ+SC  TPQQNG+AERKN+HLLEVAR+L  +T
Sbjct: 632  IQVFRSDNGKEYINKALGKFFLEKGIVHQSSCNDTPQQNGIAERKNKHLLEVARALCFTT 691

Query: 1105 SLPSYLWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVH 1164
             +P YLWG+AILTA +LINRMP+RIL+ +TPL       P  R  S +PL++FGCT +VH
Sbjct: 692  KVPKYLWGEAILTATYLINRMPTRILNFKTPLQVFTNCNPIFRLSSTLPLKIFGCTTFVH 751

Query: 1165 NFGPNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQG 1224
                N+ K  PRA+ CVFVGY P Q+GYKCF P S+K FVTMDVTF E +P+F  +HLQG
Sbjct: 752  IHDHNRGKLDPRARKCVFVGYAPTQKGYKCFDPISKKLFVTMDVTFFESKPFF-ATHLQG 811

Query: 1225 ESVSEESN----------NTFEFIEPTPS---VVSNIIPHSIVLPTNQVPW-KTYYRRNH 1284
            ES SE+S+          N    +EP+ S   V  NI    +    + + + KT      
Sbjct: 812  ESTSEDSDLFKIEKTPTPNPNNLLEPSNSNQFVYPNIETSGLDTTKSDMSFEKTAEILGK 871

Query: 1285 KKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNM-ISENDRSNVAVLENVEEKDSGD 1344
            K  V +  S   +    S         N     TKN  +    R      E+  +   G 
Sbjct: 872  KNGVLNIESLDGSSSLPSHNQNHSNTNNGNRTSTKNSELMTYSRRKHNSKESNPDPLPGH 931

Query: 1345 EIEVRIETRNNEAEQGH-----------TGKLDEYDSSLDIPIALRKGTRSCTKHPICNY 1404
            E E+R E  ++E    +           +    E    L+IPIA RKG RSCTKHP+ NY
Sbjct: 932  ESELREEPNSSECPGNNQTDSCQPVQFISNSNSESFDDLNIPIATRKGVRSCTKHPMSNY 991

Query: 1405 VSYDSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKG 1464
            +SY +LSP F AFT+ L    IPK++  AL+ PEWK A+ EEM+ALEKN TW++  LPKG
Sbjct: 992  MSYKNLSPSFFAFTSHLSLVEIPKNVQEALQVPEWKKAIFEEMRALEKNHTWEVMGLPKG 1051

Query: 1465 HKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVA 1524
              TVGCKWVF++KY ++G+L+R+KARLVAKGFTQTYGIDY ETF+PVAKLNT+RVLLS+A
Sbjct: 1052 KTTVGCKWVFTVKYNSNGSLERYKARLVAKGFTQTYGIDYLETFAPVAKLNTVRVLLSIA 1111

Query: 1525 VNKDWPLYQLDVKNAFLNGDLVEEVYMSPPPGFEAQFGQHVCKLQKSIYGLKQSPRAWFD 1584
             N DWPL QLDVKNAFLNG+L EEVYM PPPGF+  FG  VCKL+KS+YGLKQSPRAWF+
Sbjct: 1112 ANLDWPLQQLDVKNAFLNGNLEEEVYMDPPPGFDEHFGSKVCKLKKSLYGLKQSPRAWFE 1171

Query: 1585 RFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDE 1644
            RFT FVK+QGY Q  SDHT+F K S  GKIA+LIVYVDDI+LTGD   E+ +LK+ +  E
Sbjct: 1172 RFTQFVKNQGYVQAQSDHTMFIKHSNDGKIAILIVYVDDIILTGDHVTEMDRLKKSLALE 1231

Query: 1645 FEIKDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNS 1704
            FEIKDLG+L+YFLGMEVARSK GI VSQRKYILDLL ETGM GCRP DTPI+ N KLG++
Sbjct: 1232 FEIKDLGSLRYFLGMEVARSKRGIVVSQRKYILDLLKETGMSGCRPADTPIDPNQKLGDT 1291

Query: 1705 DDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKST 1764
            +D   V+  +YQ+LVGKLIYLSHTRPDI+FAVS+VSQFM +P E H++AV RILRYLKST
Sbjct: 1292 NDGNLVNTTRYQKLVGKLIYLSHTRPDIAFAVSIVSQFMHSPYEVHLEAVYRILRYLKST 1351

Query: 1765 PGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSA 1824
            PGKGL F+K+++KTIEAYTD+DWAGSV DR+STSGYCT++WGNLVTWRSKKQSV ARSSA
Sbjct: 1352 PGKGLFFKKSEQKTIEAYTDADWAGSVTDRRSTSGYCTYIWGNLVTWRSKKQSVXARSSA 1411

Query: 1825 EAEYRAMSLGICEEIWLQKVLTDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEID 1876
            EAEYRAM+ G+CE +WL+K+L +L +  E P+KL+CDNKAAISIA+NPVQHDRTKHVEID
Sbjct: 1412 EAEYRAMAHGVCEILWLKKILEELKRPLEMPMKLYCDNKAAISIAHNPVQHDRTKHVEID 1471

BLAST of CSPI05G02420 vs. NCBI nr
Match: gi|147810393|emb|CAN59964.1| (hypothetical protein VITISV_022757 [Vitis vinifera])

HSP 1 Score: 1445.6 bits (3741), Expect = 0.0e+00
Identity = 749/1457 (51.41%), Postives = 975/1457 (66.92%), Query Frame = 1

Query: 445  TQSSMYHLSGEKLNGNNYFSWSQSVKMVLEGQQKFSFLTGEIPRPLPGDPHERYWKAEDS 504
            + SS   ++G KLNG+NY  WSQSV + + G+ K  +LTGE   P   +P  R WK E+S
Sbjct: 26   SDSSPILITGHKLNGHNYLQWSQSVLLFICGKGKDEYLTGEAAMPETTEPGFRKWKIENS 85

Query: 505  ILRSILINSMEPQIGKPLLFATTAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMD 564
            ++ S LINSM   IG+  L   TAKDIWD A+  YS  +N S L+ +   +H+ +QG   
Sbjct: 86   MIMSWLINSMNNDIGENFLLFRTAKDIWDAAKETYSSSENTSELFQVESALHDFRQGEQS 145

Query: 565  VTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRIL 624
            VT ++N L+  WQ++DL     W+   D   Y  I E  R++ F  GLN + D VRGRI+
Sbjct: 146  VTQYYNTLTRYWQQLDLFETHSWKCSDDAATYRXIVEQXRLFKFFLGLNRELDDVRGRIM 205

Query: 625  GQRPIPSLMEVCSEIRLEEDRTSAMNISA---TPTIDSAAFSARSSNSSSDKHNGKPIPV 684
            G +P+PSL E  SE+R EE R   M  S     PT+D++   ARS NSS      +  P 
Sbjct: 206  GIKPLPSLREAFSEVRREESRKKVMMGSKEQPAPTLDASXLXARSFNSSGGDRQKRDRPW 265

Query: 685  CEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYVS---ESAEPPQQSDPHKNQT 744
            C++CKK  H KE CWKLHG+    K +P  D+   GRA+V+   ES   P+ S  +K Q 
Sbjct: 266  CDYCKKXGHYKEACWKLHGKXADWKPKPRXDRD--GRAHVAANXESTSVPEPSPFNKEQM 325

Query: 745  DLSLATLGAIVQSGIPHSFGLVSI-DGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETI 804
            ++ L  L + V SG      L +   G  PWI+D+GA+DH+TG +    +Y P  G+ ++
Sbjct: 326  EM-LQKLLSQVGSGSTTGIALTANRGGMKPWIVDTGASDHMTGDAAILQNYKPSNGHSSV 385

Query: 805  RIADGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSF 864
             IADGS + I G G I     L L +VLHVP L  NLLSISK+  +L C   F P+S  F
Sbjct: 386  HIADGSKSKIXGTGSIKLTKDLYLDSVLHVPNLDCNLLSISKLARDLQCVTKFYPNSCVF 445

Query: 865  QDLSSGRMIGTARHSRGLYLL------DDDTSSSSIPRTSLLSSYFTTS------EQDCM 924
            QDL SG+MIG+A    GLYLL      +  + +S +   S+L S+ + S      + + +
Sbjct: 446  QDLKSGKMIGSAELCSGLYLLSCGQFSNQVSQASCVQSQSMLESFNSVSNSKVNKDSEII 505

Query: 925  LWHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVH 984
            + H+RLGHP+F Y+  LFP LF      +  C++C  AK  R  +P  PYKP+  F+LVH
Sbjct: 506  MLHYRLGHPSFVYLAKLFPKLFINKNPASYHCEICQFAKHTRTVYPQIPYKPSTVFSLVH 565

Query: 985  SDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIA 1044
            SDVWGPS+I   SG RWFVTF+DDHTR+TWV+L+ +KSEV  +FQ F   ++ QF+ KI 
Sbjct: 566  SDVWGPSRIKNISGTRWFVTFVDDHTRVTWVFLMKEKSEVGHIFQTFNLMVQNQFNSKIQ 625

Query: 1045 ILRSDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSL 1104
            +L+SDN +E+   +LS +L + GI+H +SC  TPQQNGVAERKNRHLLEVAR LM S+++
Sbjct: 626  VLKSDNAKEYFTSSLSTYLQNHGIIHISSCVDTPQQNGVAERKNRHLLEVARCLMFSSNV 685

Query: 1105 PSYLWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVS-EVPLRVFGCTAYVHN 1164
            P+Y WG+AILTA +LINRMPSR+L  Q+P     + +P TR  S ++PL+VFGCTA+VH 
Sbjct: 686  PNYFWGEAILTATYLINRMPSRVLTFQSPRQLFLKQFPHTRAASSDLPLKVFGCTAFVHV 745

Query: 1165 FGPNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGE 1224
            +  N++KF PRA  C+F+GY P Q+GYKC+ P +++++ TMDV+F E   ++P SH+QGE
Sbjct: 746  YPQNRSKFAPRANKCIFLGYSPTQKGYKCYSPTNKRFYTTMDVSFFEHVFFYPKSHVQGE 805

Query: 1225 SVSEESNNTFE-FIEPTPSVVSNIIPHSIVLPTN-QVPWKTYYRRNHKKEVGSP-TSQPP 1284
            S++E  +  +E  +E  PS  S     S   PT    P  +  +      V SP T Q P
Sbjct: 806  SMNE--HQVWESLLEGVPSFHSESPNPSQFAPTELSTPMPSSVQPAQHTNVPSPVTIQSP 865

Query: 1285 APVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEA 1344
             P+Q   P          +   +N+     R     LE+  +   G  I+          
Sbjct: 866  MPIQPIAP----------QLANENLQVYIRRRKRQELEHGSQSTCGQYIDSNSSLPEENI 925

Query: 1345 EQGHTGK--LDEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTII 1404
             +   G+  +   D S  +PIALRKG R CT HPI NYV+Y+ LSP +RAF  SLD T +
Sbjct: 926  GEDRAGEVLIPSIDDST-LPIALRKGVRRCTDHPIGNYVTYEGLSPSYRAFATSLDDTQV 985

Query: 1405 PKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDR 1464
            P  I  A K  EWK AV +E+ ALEKN TW I  LP G + VGCKW+F++KYKADG+++R
Sbjct: 986  PNTIQEAXKISEWKKAVQDEIDALEKNGTWTITDLPVGKRPVGCKWIFTIKYKADGSVER 1045

Query: 1465 HKARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLV 1524
             KARLVA+GFTQ+YGIDY ETF+PVAKLNTIR+LLS+AVN+DW L QLD+KNAFLNGDL 
Sbjct: 1046 FKARLVARGFTQSYGIDYQETFAPVAKLNTIRILLSLAVNQDWCLQQLDIKNAFLNGDLE 1105

Query: 1525 EEVYMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLF 1584
            EEVYM  PPGFE    ++ VCKLQKS+YGLKQSPRAWFDRFT  V   GY+QG +DHTLF
Sbjct: 1106 EEVYMEIPPGFEESMAKNQVCKLQKSLYGLKQSPRAWFDRFTKAVLKLGYKQGQADHTLF 1165

Query: 1585 TKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSK 1644
             K S  GK+A+LIVYVDDI+L+G+D  E+  LK+ + +EFE+KDLGNLKYFLGMEVARS+
Sbjct: 1166 VKKSHAGKMAILIVYVDDIILSGNDMEELQXLKKYLSEEFEVKDLGNLKYFLGMEVARSR 1225

Query: 1645 EGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYL 1704
            +GI VSQRKYILDLL ETGMLGC+P DTP++   KLG   +  PVD+ +YQRLVG+LIYL
Sbjct: 1226 KGIVVSQRKYILDLLKETGMLGCKPIDTPMDSQKKLGIEKESTPVDRGRYQRLVGRLIYL 1285

Query: 1705 SHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDS 1764
            SHTRPDI FAVS VSQFM +P EEHM+AV RI RYLK TPGKGL FRKT+ +  E Y+D+
Sbjct: 1286 SHTRPDIGFAVSXVSQFMHSPTEEHMEAVYRIXRYLKMTPGKGLFFRKTENRDXEVYSDA 1345

Query: 1765 DWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVL 1824
            DWAG+++DR+STSGYC+FVWGNLVT RSKKQSVVARSSAEAEYRA++ GICE IW+++VL
Sbjct: 1346 DWAGNIIDRRSTSGYCSFVWGNLVTXRSKKQSVVARSSAEAEYRALAQGICEGIWIKRVL 1405

Query: 1825 TDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSS 1876
            ++L Q   +P+ + CDN+AAISIA NPV HD TKHVEIDRHFI EK+ S ++ + Y+P+ 
Sbjct: 1406 SELGQTSSSPILMMCDNQAAISIAKNPVHHDXTKHVEIDRHFITEKVTSETVKLNYVPTK 1465

BLAST of CSPI05G02420 vs. NCBI nr
Match: gi|147778986|emb|CAN62538.1| (hypothetical protein VITISV_031159 [Vitis vinifera])

HSP 1 Score: 1440.2 bits (3727), Expect = 0.0e+00
Identity = 746/1455 (51.27%), Postives = 973/1455 (66.87%), Query Frame = 1

Query: 447  SSMYHLSGEKLNGNNYFSWSQSVKMVLEGQQKFSFLTGEIPRPLPGDPHERYWKAEDSIL 506
            SS   ++G KLNG+NY  WSQSV + + G+ K  ++TGE   P   +P  R WK E+S++
Sbjct: 28   SSPILITGHKLNGHNYLQWSQSVLLFICGKGKDEYJTGEAXMPETTEPXFRKWKIENSMI 87

Query: 507  RSILINSMEPQIGKPLLFATTAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVT 566
             S LINSM   IG+  L   TAKDIWD A+  YS  +N S L+ +   +H+ +QG   VT
Sbjct: 88   MSWLINSMNNDIGENFLLFGTAKDIWDAAKETYSSSENISELFQVESALHDFRQGEQSVT 147

Query: 567  SFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQ 626
             ++N L+  WQ++DL     W+   D   Y +I E  R++ F  GLN + D VRGRI+G 
Sbjct: 148  QYYNTLTRYWQQLDLFETHSWKCSDDAATYRQIVEQKRLFKFFLGLNRELDDVRGRIMGI 207

Query: 627  RPIPSLMEVCSEIRLEEDRTSAMNISA---TPTIDSAAFSARSSNSSSDKHNGKPIPVCE 686
            +P+PSL EV SE+R EE R   M  S     PT+D +A +ARS NSS      +  P C+
Sbjct: 208  KPLPSLREVFSEVRREESRKKVMMGSKEQPAPTLDGSALAARSFNSSGGDRQKRDRPWCD 267

Query: 687  HCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYV---SESAEPPQQSDPHKNQTDL 746
            + KK  H KE CWKLHG+P   K +P +D+   GRA+V   SES   P+ S  +K Q ++
Sbjct: 268  YYKKPGHYKEACWKLHGKPADWKPKPRSDRD--GRAHVAANSESTSVPEPSPFNKEQMEM 327

Query: 747  SLATLGAIVQSGIPHSFGLV-SIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRI 806
             L  L + V SG      L  S  G  PWI+D+GA+DH+TG +    +Y P  G+  + I
Sbjct: 328  -LQKLLSQVGSGSTTGIALTASRGGMKPWIVDTGASDHMTGDAAILQNYKPSNGHSFVHI 387

Query: 807  ADGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQD 866
            ADGS + I G G I     L L +VLHVP L  NLLSISK+  +L C   F P+S  FQD
Sbjct: 388  ADGSKSKIVGTGSIKLTKDLYLDSVLHVPNLDCNLLSISKLARDLQCVTKFYPNSCVFQD 447

Query: 867  LSSGRMIGTARHSRGLYLL------DDDTSSSSIPRTSLLSSYFTTS------EQDCMLW 926
            L SG+MIG+A+    LYLL      +  + +S +   S+L S+ + S      + + ++ 
Sbjct: 448  LKSGKMIGSAKLCSELYLLSCGQFSNQVSQASCVQSQSMLESFNSVSNSKVNKDSEIIML 507

Query: 927  HFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSD 986
            H+RLGHP+F Y+  LFP LF      +  C++C  AK  R  +P  PYKP+  F+LVHSD
Sbjct: 508  HYRLGHPSFVYLAKLFPRLFINKNPASYHCEICQFAKHTRTVYPQIPYKPSTVFSLVHSD 567

Query: 987  VWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAIL 1046
            VWGPS+I   SG RWFVTF+DDHTR+TWV+L+ +KSEV  +FQ F   ++ QF+ KI +L
Sbjct: 568  VWGPSRIKNISGTRWFVTFVDDHTRVTWVFLMKEKSEVGHIFQTFNLMVQNQFNSKIQVL 627

Query: 1047 RSDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPS 1106
            +SDN +E+   +LS +L + GI+H +SC  TPQQNGVAERKNRHLLEVAR LM S+++P+
Sbjct: 628  KSDNAKEYFTSSLSTYLQNHGIIHISSCVDTPQQNGVAERKNRHLLEVARCLMFSSNVPN 687

Query: 1107 YLWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVS-EVPLRVFGCTAYVHNFG 1166
            Y WG+AILTA +LINRMPSR+L  Q+P     + +P TR  S ++ L+VFGCTA+VH + 
Sbjct: 688  YFWGEAILTATYLINRMPSRVLTFQSPRQLFLKQFPHTRAASSDLSLKVFGCTAFVHVYP 747

Query: 1167 PNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGESV 1226
             N++KF PRA  C+F+GY P+Q+GYKC+ P +++++ TMDV+F E   ++P  H+QGES+
Sbjct: 748  QNRSKFAPRANKCIFLGYSPNQKGYKCYSPTNKRFYTTMDVSFFEHVFFYPKFHVQGESM 807

Query: 1227 SEESNNTFEF-IEPTPSVVSNIIPHSIVLPTN-QVPWKTYYRRNHKKEVGSP-TSQPPAP 1286
            +E  +  +E  +E  PS  S     S   PT    P  +  +      V SP T Q P P
Sbjct: 808  NE--HQVWESRLEGVPSFHSESPNPSQFAPTELSTPMPSSVQPAQHTNVPSPVTIQSPMP 867

Query: 1287 VQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAEQ 1346
            +Q   P          +   +N+     R     LE+  +   G  I+           +
Sbjct: 868  IQPIAP----------QLANENLQVYIRRRKRQELEHGSQSTCGQYIDSNSSLPEENIGE 927

Query: 1347 GHTGK--LDEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPK 1406
               G+  +   D S  +PIALRKG R CT HPI NYV+Y+ LSP +RAF  SLD T +P 
Sbjct: 928  DRAGEVLIPSIDDST-LPIALRKGVRRCTDHPIGNYVTYEGLSPSYRAFATSLDDTQVPN 987

Query: 1407 DIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHK 1466
             I  ALK  EWK AV +E+ ALEKN TW I  LP G + VGCKW+F++KYKADG+++R K
Sbjct: 988  TIQEALKISEWKKAVQDEIDALEKNGTWTITDLPVGKRPVGCKWIFTIKYKADGSVERFK 1047

Query: 1467 ARLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEE 1526
            ARLVA+GFTQ YGIDY ETF+PVAKLNTIR+LLS+AVN+DW L QLD+KNAFLNGDL EE
Sbjct: 1048 ARLVARGFTQXYGIDYQETFAPVAKLNTIRILLSLAVNQDWCLQQLDIKNAFLNGDLEEE 1107

Query: 1527 VYMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTK 1586
            VYM  PPGFE    ++ VCKLQKS+YGLKQSPRAWFDRFT  V   GY+QG +DHTLF K
Sbjct: 1108 VYMEIPPGFEESMAKNQVCKLQKSLYGLKQSPRAWFDRFTKAVLKLGYKQGQADHTLFVK 1167

Query: 1587 VSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEG 1646
             S  GK+A+LIVYVDDI+L+G+D  E+  LK+ + +EFE+KDLGNLKYFLGMEVARS++G
Sbjct: 1168 KSHAGKMAILIVYVDDIILSGNDMEELQNLKKYLSEEFEVKDLGNLKYFLGMEVARSRKG 1227

Query: 1647 ISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSH 1706
            I VSQ KYILDLL ETGMLGC+P DTP++   KLG   +  P D+ +YQRLVG+LIYLSH
Sbjct: 1228 IVVSQTKYILDLLKETGMLGCKPIDTPMDSQKKLGIEKESTPXDRGRYQRLVGRLIYLSH 1287

Query: 1707 TRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDW 1766
            TRPDI FAVS VSQFM +P EEHM+AV RILRYLK TP KG+ FRKT+ +  E Y+D+DW
Sbjct: 1288 TRPDIGFAVSAVSQFMHSPTEEHMEAVYRILRYLKMTPXKGIFFRKTENRDTEVYSDADW 1347

Query: 1767 AGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTD 1826
            AG+++DR+STSGYC+FVWGNLVTWRSKKQSVVARSSAEAEY A++ GICE  W+++VL++
Sbjct: 1348 AGNIIDRRSTSGYCSFVWGNLVTWRSKKQSVVARSSAEAEYXALAQGICEGXWIKRVLSE 1407

Query: 1827 LHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQ 1876
            L Q   +P+ + CDN+A ISIA NPV HDRTKHVEIDRHFI EK+ S ++ + Y+P+  Q
Sbjct: 1408 LGQTSSSPILMMCDNQAXISIAKNPVHHDRTKHVEIDRHFITEKVTSETVKLNYVPTKHQ 1466

BLAST of CSPI05G02420 vs. NCBI nr
Match: gi|147769406|emb|CAN70229.1| (hypothetical protein VITISV_024789 [Vitis vinifera])

HSP 1 Score: 1433.3 bits (3709), Expect = 0.0e+00
Identity = 739/1454 (50.83%), Postives = 964/1454 (66.30%), Query Frame = 1

Query: 447  SSMYHLSGEKLNGNNYFSWSQSVKMVLEGQQKFSFLTGEIPRPLPGDPHERYWKAEDSIL 506
            SS   ++G KLNG+NY  WSQSV + + G+ K  +LTGE   P   +P  R WK E+S++
Sbjct: 28   SSPILITGHKLNGHNYLQWSQSVLLFICGKGKDEYLTGEAVMPETTEPGFRKWKIENSMI 87

Query: 507  RSILINSMEPQIGKPLLFATTAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVT 566
             S LINSM   IG+  L   TAKDIWD A+  YS  +N S L+ +   +H+ +QG   VT
Sbjct: 88   MSWLINSMNNDIGENFLLFGTAKDIWDAAKETYSSSENTSELFQVESALHDFRQGEQSVT 147

Query: 567  SFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQ 626
             ++N L+  WQ++DL     W+   D   Y +I E  R++ F  GLN + D VRGRI+G 
Sbjct: 148  QYYNTLTRYWQQLDLFETHSWKCSDDAATYRQIMEQKRLFKFFLGLNRELDDVRGRIMGI 207

Query: 627  RPIPSLMEVCSEIRLEEDRTSAMNISA---TPTIDSAAFSARSSNSSSDKHNGKPIPVCE 686
            +P+PSL E  SE+R EE R   M  S     PT+D++A +ARS NSS      +  P C+
Sbjct: 208  KPLPSLREAFSEVRREESRKKVMMGSKEQPAPTLDASALAARSFNSSGGDRQKRDRPWCD 267

Query: 687  HCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYV---SESAEPPQQSDPHKNQTDL 746
            +CKK  H KE CWKLHG+P   K +P  D+   GRA+V   SES   P+ S  +K Q ++
Sbjct: 268  YCKKPGHYKETCWKLHGKPADWKPKPRFDRD--GRAHVAANSESTSVPEPSPFNKEQMEM 327

Query: 747  SLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIA 806
                L  +            +  G  PWI+D+GA+DH+TG +    +Y P  G+ ++ IA
Sbjct: 328  LQKLLSQVGSGSTTGVAFTTNRGGMRPWIVDTGASDHMTGDAAILQNYKPSNGHSSVHIA 387

Query: 807  DGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDL 866
            DGS         I     L L +VLHVP L  NLLSISK+ H+L C   F P+   FQDL
Sbjct: 388  DGS---------IKLTKDLYLDSVLHVPNLDCNLLSISKLAHDLQCVTKFYPNLCVFQDL 447

Query: 867  SSGRMIGTARHSRGLYLLD-----DDTSSSSIPRTSLLSSYFTT-------SEQDCMLWH 926
             SG+MIG+A    GLYLL      +  S +S  ++  +S  F +        + + ++ H
Sbjct: 448  KSGKMIGSAELRSGLYLLSCGQFSNQVSQASCVQSQSMSESFNSVSNSKVNKDSEIIMLH 507

Query: 927  FRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDV 986
            +RLGHP+F Y+  LFP LF      +  C++C  AK  R+ +P  PYKP+  F+LVHSDV
Sbjct: 508  YRLGHPSFVYLAKLFPRLFINKNPASYHCEICQFAKHTRIVYPQIPYKPSTVFSLVHSDV 567

Query: 987  WGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAILR 1046
            WGPS+I   SG RWFVTF+DDHT +TWV+L+ +KSEV  +FQ F   ++ QF+ KI +L+
Sbjct: 568  WGPSRIKNISGTRWFVTFVDDHTWVTWVFLMKEKSEVGHIFQTFNLMVQNQFNSKIQVLK 627

Query: 1047 SDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSY 1106
            SDN +E+   +LS +L +  I+H +SC  TPQQN VAERKNRHLLEVAR LM S+++P+Y
Sbjct: 628  SDNAKEYFTSSLSTYLQNHDIIHISSCVDTPQQNRVAERKNRHLLEVARCLMFSSNVPNY 687

Query: 1107 LWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVS-EVPLRVFGCTAYVHNFGP 1166
             WG+AILTA +LINRMPSR+L  Q+P     + +P T   S ++PL+VFGCTA++H +  
Sbjct: 688  FWGEAILTATYLINRMPSRVLTFQSPRQLFLKQFPHTHAASSDLPLKVFGCTAFIHVYPQ 747

Query: 1167 NQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGESVS 1226
            N++KF PRA  C+F+GY P Q+GYKC+ P +++++ TMDV+F E   ++P SH+QGES++
Sbjct: 748  NRSKFAPRANKCIFLGYSPTQKGYKCYSPTNKRFYTTMDVSFFEHVFFYPKSHVQGESMN 807

Query: 1227 EESNNTFE-FIEPTPSVVSNIIPHSIVLPTN-QVPWKTYYRRNHKKEVGSP-TSQPPAPV 1286
            E  +  +E F+E  PS  S     S   PT    P     +      V SP T Q P P+
Sbjct: 808  E--HQVWESFLEGVPSFHSESPNPSQFAPTELSTPMPPSVQPAQHTNVPSPVTIQSPMPI 867

Query: 1287 QDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAEQG 1346
            Q   P          +   +N+     R     LE+  +   G  I+           + 
Sbjct: 868  QPIAP----------QLANENLQVYLRRRKRQELEHGSQSTCGQYIDSNSSLPEENIGED 927

Query: 1347 HTGK--LDEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPKD 1406
              G+  +   D S  +PIALRKG R CT HPI NYV+Y+ LSP +RAF  SLD T +P  
Sbjct: 928  RAGEVLIPSIDDST-LPIALRKGVRRCTDHPIGNYVTYEGLSPSYRAFATSLDDTQVPNT 987

Query: 1407 IYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKA 1466
            I  A K  EWK AV +E+ ALEKN TW I  LP G + VGCKW+F++KYK DG+++R KA
Sbjct: 988  IQEASKISEWKKAVQDEIDALEKNGTWTITDLPVGKRPVGCKWIFTIKYKTDGSVERFKA 1047

Query: 1467 RLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEEV 1526
            RLVA+GFTQ+YGIDY ETF+PVAKLNTIR+LLS+AVN+DW L QLD+KNAFLNGDL EEV
Sbjct: 1048 RLVARGFTQSYGIDYQETFAPVAKLNTIRILLSLAVNQDWCLQQLDIKNAFLNGDLEEEV 1107

Query: 1527 YMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKV 1586
            YM  PPGFE    ++ VCKLQKS+YGLKQSPRAWFDRFT  V   GY+QG +DHTLF K 
Sbjct: 1108 YMEIPPGFEESMAKNQVCKLQKSLYGLKQSPRAWFDRFTKAVLKLGYKQGQADHTLFVKK 1167

Query: 1587 SKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGI 1646
            S  GK+A+LIVYVDDI+L+G+D  E+  LK+ + +EFE+KDLGNLKYF+GMEVA+S++GI
Sbjct: 1168 SHAGKLAILIVYVDDIILSGNDMGELQNLKKYLSEEFEVKDLGNLKYFJGMEVAKSRKGI 1227

Query: 1647 SVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHT 1706
             VSQRKYILDLL ETGMLGC+P DTP++   KLG   +  PVD+ +YQRLVG+LIYLSHT
Sbjct: 1228 VVSQRKYILDLLKETGMLGCKPIDTPMDSQKKLGIEKESTPVDRGRYQRLVGRLIYLSHT 1287

Query: 1707 RPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWA 1766
            RPDI FAVS VSQFM +P EEHM+AV RILRYLK TPGKGL FRKT+ +  E Y+D+DWA
Sbjct: 1288 RPDIGFAVSAVSQFMHSPTEEHMEAVYRILRYLKMTPGKGLFFRKTENRDTEVYSDADWA 1347

Query: 1767 GSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDL 1826
            G+++DR STSGYC+FVWGNLVTWRSKKQSVVARSSAEAEYRA++ GICE IW++ VL++L
Sbjct: 1348 GNIIDRWSTSGYCSFVWGNLVTWRSKKQSVVARSSAEAEYRALAQGICEGIWIKXVLSEL 1407

Query: 1827 HQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQV 1876
             Q   +P+ + CDN+AAISIA NPV HDRTKHVEIDRHFI EK+ S ++ + Y+P+  Q 
Sbjct: 1408 GQXSSSPILMMCDNQAAISIAKNPVHHDRTKHVEIDRHFITEKVTSETVKLNYVPTKHQT 1457

BLAST of CSPI05G02420 vs. NCBI nr
Match: gi|147860087|emb|CAN82928.1| (hypothetical protein VITISV_025045 [Vitis vinifera])

HSP 1 Score: 1417.5 bits (3668), Expect = 0.0e+00
Identity = 740/1454 (50.89%), Postives = 959/1454 (65.96%), Query Frame = 1

Query: 447  SSMYHLSGEKLNGNNYFSWSQSVKMVLEGQQKFSFLTGEIPRPLPGDPHERYWKAEDSIL 506
            SS   ++G KLNG+NY  WSQSV + + G+ K  +LTGE   P   +P  R WK E+S++
Sbjct: 28   SSPILITGHKLNGHNYLQWSQSVLLFICGKGKDEYLTGEAVMPETTEPGFRKWKIENSMI 87

Query: 507  RSILINSMEPQIGKPLLFATTAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVT 566
             S LINSM   IG+  L   TAKDIWD A+  YS  +N S L+ +   +H+ +QG   VT
Sbjct: 88   MSWLINSMNNDIGENFLLFGTAKDIWDAAKETYSSSENTSELFQVESALHDFRQGEQSVT 147

Query: 567  SFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQ 626
             ++N L+  WQ++DL     W+   D   Y +I E  R++ F  GLN + D VRGRI+G 
Sbjct: 148  QYYNTLTRYWQQLDLFETHSWKCSDDAATYRQIVEQKRLFKFFLGLNRELDDVRGRIMGI 207

Query: 627  RPIPSLMEVCSEIRLEEDRTSAMNISA---TPTIDSAAFSARSSNSSSDKHNGKPIPVCE 686
            +P+PSL E  SE+R EE R   M  S     PT+D++A +ARS NSS      +  P C+
Sbjct: 208  KPLPSLREAFSEVRREESRKKVMMGSKEQPAPTLDASALAARSFNSSGGDRQKRDRPWCD 267

Query: 687  HCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYV---SESAEPPQQSDPHKNQTDL 746
            +CKK  H KE CWKLHG+P   K +P  D+   GRA+V   SES   P+ S  +K Q ++
Sbjct: 268  YCKKPGHYKETCWKLHGKPADWKPKPRFDRD--GRAHVAANSESTSVPEPSPFNKEQMEM 327

Query: 747  SLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIA 806
                L  +            +  G  PWI+D+GA+DH+TG +    +Y P  G+ ++ IA
Sbjct: 328  LQKLLSQVGSGSTTGVAFTANRGGMRPWIVDTGASDHMTGDAAILQNYKPSNGHSSVHIA 387

Query: 807  DGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDL 866
            DGS + IAG G I     L L +VLHVP L  NLLSISK+ H+L C   F P+   FQDL
Sbjct: 388  DGSKSKIAGTGSIKLTKDLYLDSVLHVPNLDCNLLSISKLAHDLQCVTKFYPNLCVFQDL 447

Query: 867  SSGRMIGTARHSRGLYLLD-----DDTSSSSIPRTSLLSSYFTT-------SEQDCMLWH 926
             SG+MIG+A    GLYLL      +  S +S  ++  +S  F +        + + ++ H
Sbjct: 448  KSGKMIGSAELCSGLYLLSCGQFSNQVSQASCVQSQSMSESFNSVSNSKVNKDSEIIMLH 507

Query: 927  FRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDV 986
            +RLGHP+F Y+  LFP LF      +  C++C  AK  R  +P  PYKP+  F+LVHSDV
Sbjct: 508  YRLGHPSFVYLAKLFPKLFINKNPASYHCEICQFAKHTRTVYPQIPYKPSTVFSLVHSDV 567

Query: 987  WGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAILR 1046
            WGPS+I   SG RWFVTF+DDHTR+TWV+L+ +KSEV  +FQ F   ++ QF+ KI +L+
Sbjct: 568  WGPSRIKNISGTRWFVTFVDDHTRVTWVFLMKEKSEVGHIFQTFNLMVQNQFNSKIQVLK 627

Query: 1047 SDNGREFQNHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSY 1106
            SDN +E+   +LS +L + GI+H +SC  TPQQNGVAERKNRHLLEVAR LM S+++P+Y
Sbjct: 628  SDNAKEYFTSSLSTYLQNHGIIHISSCVDTPQQNGVAERKNRHLLEVARCLMFSSNVPNY 687

Query: 1107 LWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVS-EVPLRVFGCTAYVHNFGP 1166
             WG+AILTA +LINRMPSR+L  Q+P     + +P T   S ++PL+VFGCTA+VH +  
Sbjct: 688  FWGEAILTATYLINRMPSRVLTFQSPRQLFLKQFPHTHAASSDLPLKVFGCTAFVHVYPQ 747

Query: 1167 NQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGESVS 1226
            N++KF PRA  C+F+GY P Q+GYKC+ P +++++ TMDV+F E   ++P SH+QGES++
Sbjct: 748  NRSKFAPRANKCIFLGYSPTQKGYKCYSPTNKRFYTTMDVSFFEHVFFYPKSHVQGESMN 807

Query: 1227 EESNNTFE-FIEPTPSVVSNIIPHSIVLPTN-QVPWKTYYRRNHKKEVGSP-TSQPPAPV 1286
            E  +  +E F+E  PS  S     S   PT    P     +      V SP T Q P P+
Sbjct: 808  E--HQVWESFLEGVPSFHSESPNPSQFAPTELSTPMPPSVQPAQHTNVPSPVTIQSPMPI 867

Query: 1287 QDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAEQG 1346
            Q   P          +   +N+     R     LE+  +   G  I+           + 
Sbjct: 868  QPIAP----------QLANENLQVYIRRRKRQELEHGSQSTCGQYIDSNSSLPEENIGED 927

Query: 1347 HTGK--LDEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPKD 1406
              G+  +   D S  +PIALRKG R CT HPI NYV+Y+ LSP +RAF  SLD T +P  
Sbjct: 928  RAGEVLIPSIDDST-LPIALRKGVRRCTDHPIGNYVTYEGLSPSYRAFATSLDDTQVPNT 987

Query: 1407 IYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKA 1466
            I  ALK  EWK AV +E+ ALEKN TW I  LP G + +            D + D  KA
Sbjct: 988  IQEALKISEWKKAVQDEIDALEKNGTWTITDLPVGKRPM------------DQSKD-FKA 1047

Query: 1467 RLVAKGFTQTYGIDYSETFSPVAKLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEEV 1526
            RLVA+GFTQ+YGIDY ETF+PVAKLNTIR+LLS+AVN+DW L QLD+KNAFLNGDL EEV
Sbjct: 1048 RLVARGFTQSYGIDYQETFAPVAKLNTIRILLSLAVNQDWCLQQLDIKNAFLNGDLEEEV 1107

Query: 1527 YMSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKV 1586
            YM  PPGFE    ++ VCKLQKS+YGLKQSPRAWFDRFT  V   GY+QG  DHTLF K 
Sbjct: 1108 YMEIPPGFEESMAKNQVCKLQKSLYGLKQSPRAWFDRFTKAVLKLGYKQGQXDHTLFVKK 1167

Query: 1587 SKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGI 1646
            S  GK+A+LIVYVDDI+L+G+D  E+  LK+ + +EFE+KDLGNLKYFLGMEVARS++GI
Sbjct: 1168 SHAGKLAILIVYVDDIILSGNDMGELQNLKKYLLEEFEVKDLGNLKYFLGMEVARSRKGI 1227

Query: 1647 SVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHT 1706
             VSQRKYILDLL ETGMLGC+P DTP++   KLG   +  PVD+ +YQRLVG+LIYLSHT
Sbjct: 1228 VVSQRKYILDLLKETGMLGCKPIDTPMDSKKKLGIEKESTPVDRGRYQRLVGRLIYLSHT 1287

Query: 1707 RPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWA 1766
            RPDI FAVS VSQFM +P EEHM+AV RILRYLK TPGKGL FRKT+ +  E Y+D+DWA
Sbjct: 1288 RPDIGFAVSAVSQFMHSPTEEHMEAVYRILRYLKMTPGKGLFFRKTENRDTEVYSDADWA 1347

Query: 1767 GSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDL 1826
            G+++DR+STSGYC+FVWGNLVTWRSKKQSVVARSSAEAEYRA++ GICE IW+++VL++L
Sbjct: 1348 GNIIDRRSTSGYCSFVWGNLVTWRSKKQSVVARSSAEAEYRALAQGICEGIWIKRVLSEL 1407

Query: 1827 HQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQV 1876
             Q   +P+ + CDN+AAISIA NPV HDRTKHVEIDRHFI EK+ S ++ + Y+P+  Q 
Sbjct: 1408 GQTSSSPILMMCDNQAAISIAKNPVHHDRTKHVEIDRHFITEKVTSETVKLNYVPTKHQT 1453

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
COPIA_DROME2.6e-15031.90Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3[more]
SYPC_ARATH2.1e-10772.76Proline--tRNA ligase, cytoplasmic OS=Arabidopsis thaliana GN=At3g62120 PE=2 SV=1[more]
POLX_TOBAC3.0e-10641.39Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
PRS1_SCHPO1.6e-6247.92Putative proline--tRNA ligase C19C7.06 OS=Schizosaccharomyces pombe (strain 972 ... [more]
SYEP_DROME3.5e-6251.69Bifunctional glutamate/proline--tRNA ligase OS=Drosophila melanogaster GN=Aats-g... [more]
Match NameE-valueIdentityDescription
A5AYJ3_VITVI0.0e+0053.65Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_041073 PE=4 SV=1[more]
A5B7Z8_VITVI0.0e+0051.41Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_022757 PE=4 SV=1[more]
A5AJR0_VITVI0.0e+0051.27Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_031159 PE=4 SV=1[more]
A5BJ12_VITVI0.0e+0050.83Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_024789 PE=4 SV=1[more]
A5B7A7_VITVI0.0e+0050.89Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_025045 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G23160.11.6e-13445.82 cysteine-rich RLK (RECEPTOR-like protein kinase) 8[more]
AT3G62120.11.2e-10872.76 Class II aaRS and biotin synthetases superfamily protein[more]
AT5G10880.16.9e-5361.35 tRNA synthetase-related / tRNA ligase-related[more]
AT5G52520.13.8e-5141.35 Class II aaRS and biotin synthetases superfamily protein[more]
ATMG00810.19.7e-4740.63ATMG00810.1 DNA/RNA polymerases superfamily protein[more]
Match NameE-valueIdentityDescription
gi|147819777|emb|CAN76196.1|0.0e+0053.65hypothetical protein VITISV_041073 [Vitis vinifera][more]
gi|147810393|emb|CAN59964.1|0.0e+0051.41hypothetical protein VITISV_022757 [Vitis vinifera][more]
gi|147778986|emb|CAN62538.1|0.0e+0051.27hypothetical protein VITISV_031159 [Vitis vinifera][more]
gi|147769406|emb|CAN70229.1|0.0e+0050.83hypothetical protein VITISV_024789 [Vitis vinifera][more]
gi|147860087|emb|CAN82928.1|0.0e+0050.89hypothetical protein VITISV_025045 [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001584Integrase_cat-core
IPR004154Anticodon-bd
IPR012337RNaseH-like_sf
IPR013103RVT_2
IPR025724GAG-pre-integrase_dom
Vocabulary: Biological Process
TermDefinition
GO:0015074DNA integration
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0090304 nucleic acid metabolic process
biological_process GO:0006525 arginine metabolic process
biological_process GO:0006560 proline metabolic process
biological_process GO:0006433 prolyl-tRNA aminoacylation
cellular_component GO:0005575 cellular_component
cellular_component GO:0005737 cytoplasm
molecular_function GO:0003824 catalytic activity
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0005524 ATP binding
molecular_function GO:0004827 proline-tRNA ligase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI05G02420.1CSPI05G02420.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 958..1073
score: 8.7
IPR001584Integrase, catalytic corePROFILEPS50994INTEGRASEcoord: 956..1122
score: 22
IPR004154Anticodon-bindingGENE3DG3DSA:3.40.50.800coord: 83..203
score: 1.0
IPR004154Anticodon-bindingPFAMPF03129HGTP_anticodoncoord: 89..185
score: 2.1
IPR004154Anticodon-bindingunknownSSF52954Class II aaRS ABD-relatedcoord: 77..203
score: 1.8
IPR012337Ribonuclease H-like domainGENE3DG3DSA:3.30.420.10coord: 952..1114
score: 1.3
IPR012337Ribonuclease H-like domainunknownSSF53098Ribonuclease H-likecoord: 955..1116
score: 3.41
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 1406..1649
score: 1.3
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 874..945
score: 1.2
NoneNo IPR availableunknownCoilCoilcoord: 1960..1961
score: -coord: 230..250
scor
NoneNo IPR availableGENE3DG3DSA:3.30.930.10coord: 2..82
score: 1.6
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 1132..1235
score: 0.0coord: 1341..1808
score: 0.0coord: 456..731
score: 0.0coord: 904..1115
score: 0.0coord: 1255..1281
score: 0.0coord: 310..315
score: 0.0coord: 762..878
score:
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 499..646
score: 6.0
NoneNo IPR availableunknownSSF55681Class II aaRS and biotin synthetasescoord: 2..95
score: 1.91
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 1658..1836
score: 6.89E-42coord: 1405..1628
score: 6.89