CSPI02G21510 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI02G21510
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionBeta-galactosidase
LocationChr2: 18841431 .. 18849530 (+)
RNA-Seq ExpressionCSPI02G21510
SyntenyCSPI02G21510
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTGTGATGACGTTCTGAACCATTGTTTTCTTTCTCATTTCTTATATTTCAAAGAGATGCCTCAACCAGATATCTGATGTAAGTAATGACGGAGCGACTTTCGCTGTCCGCCGAACAACTCTCGCCCATGGGATTGCCATTTCCGTCGTCGCTGAATCTAATTTCTCTCCCCCTTCCGACGTATCGTTCTCTCGTTTTCCCCCTCTTCGATGCAACGAAATTCATTTCTGTTTTCTCAATGCCTTCGACGAATGAAAAATGGTCTCATATCCACCGACCAAGACCAACGGTATCATCTTCACTAAGAAATTTCAGAATCAGGAACAGAAATATCAAAGCTCGATTCAGCTCTCGGAATGATTCGAGTTCCACACCACATGCCTCCAATGGCGACTCGAATTCTCCCGCTAAAACGAAACTTATTGTAGTAAGCTCTCTAATTGCCGTGTCACTGGCAATCGCGAATCGTGTGTTGTACAAGCTTGCTCTCGTTCCATTGAAGGAATATCCATTCTTCTTAGCTCAATTGACTACGTTCGGGCAAGTTATTTATGCTCTGCCTTGACGATTTGTTGTTTACTTTTTTACTTTGTTGATTGTTTTGTTCTGATGTATTTACTGCTTTTAAGTTGTGTGTTTATGCATCGTACTGAAGAACATTTGGTACTTTTGTCTTATAATACGTGGGAGCAGATATGTTATGGCATATTTTAGTATATTATATCTGCGGCGTCGGGCAAACATTGTTACTGAAGAGATGCTATCTCTACCAAAATCGCGATTCATGGCCATAGGTTTTCTTGAAGCGCTTGGAATTGCGACTGGGATGGCTGCGGCAGGTAACAAAATTATCAAATTATACACTTTTTGATCTATTTTGATGTCGTTTTGCGTCCTTATGTGGTTACGTACGATAATTGTAACTGTTTGTAATTCAATTCCTGGGATGTCACTTGCTTTTTTAACACTTCAATCAGAGTATGATAGGAACAATTGTTTCTTGTGATATGATGGATGGGATGTAAAGAGTTGTAAGAATTAAGCTGGCTTGGAGGAGGAAAAGAATGGGACCAATTTACGATTTCAATTTGCGTTTAGAACTCGTACTGAATGCTATGTATCATTTACAGCTTCGCTTCCTGGTCCAGCAATTCCTATACTCAGTCAGGTATCTCGTCTTACCATTCTTTTTGCCAAAGTTCGTAACTGGAAAAAGTAAAAACCTATTTCCATTGTACTAGCTGGATTAAATATTTGGTCTCTCTGTGTCTTTGTTGTAGTTTACGATATGTGCTTACCAACTTGAAGTAATGAAGTTAACTATAATTATAACCCTGTTATCAAGCTTATGAATTTCACCTCCATTGGCTCTCTCATTTTCTCAATATTTCCAAAGATTTAAATCTGGGTATGTGCAACTACATTATTTATTTATTTAAATTATCATTATTTTCTAAAAGGGAACTTTTTATTGAAATATGAGACAATTTCGACCATCTGGATGGTTGATCACTCTTCAATCTGATAAAGACCTTAACAGCATCTGAAACTTGGAAATTTCTTCCTGTTTCAACAGTTTACAGAAAGTAACAGCCACGATAAAGTGTTTTCATCTTAATGATCTAAAACTGAACCACTGGGAAGAAGACCATACGGAAACAAATCTGGAATACTGGAATTCAGAGGCAAGTTACCAAGCCACGGATCAATCCAAAAAGAATTGTGCTAACAAATTTGATGATATCCCCTTAATGACCTTTTTTCTTTAAGTAGTGATAATCCATTTTATGATCTCTGTCTTTCCAATACACATGAAAGGACATTCATGCTGAACTCATATGTTTCCGGTTTCATTTTCAAATCTCAGATTGGCGCCACTTGTAATAATATAACTTTTTTGCAGACTTTTTTGGTTTGGCAGCTCGTATTTTCTGCCATTTTGCTAGGTAGAAAGTACTCATGGAATCAAATTGCGGGCTGTGTTATTGTAACTGCTGGTGTAGTAGTTGCTGTTGGAAGGTATCATTTTGTTGGACCTTTTGTATCGGCTATATTCATTTTTCTACATTTGCAATTTTTTTTTCTATATCTACAACCGATTTTCTATATTTTCTACCAACTTTTTGTAAAAAGACGTATAAGCAAGACAGTTGAAGATGTAAACAAAAACAAAGCTTCCAAATTTTTGTTTTTGACTCCTATTCATCCATGCACACTCTAGGAGGTAAGGGGAACGCATAGGAGGCCACTGGTTTATGGGTCATGAAATTTTTGTTTATACTGGGGGCAGTGGTTTTTTTCCCCTTTCAAACAATAGGGAGAGATTTTATAGACCTCTTGGTCGTTAGTACATATTATATGCAAGTTAAGCTTTGATCACTTTGGTTAATAACTATTATTATTATTTTTAAATTTTTTTTAAATGCTTATGTTTTGAATACAGCATTTTCTTAAGCAAATTTTTAGGATTATCCAGTATCTATATATATGTATGATGATGGTGGATGATGAAGGCGAGTTCATGATTTCCTCCTAGGTGGACAATAACAAAAGAGATCCCTTTATTCGATTCAAGATTTAATGCTTCCTCCAACTCCAGATCAAGTAGATTTAAAACCTGCCTCCATCATTTTTTGAAAGGTAGAAGGGGCCTCAGACTCTCAATGTGAACTATAAGGAACAAGTTCAAGCTTGTTACTCGTGTTGGAATTCTTTCCTTTTTTTCTAGGCTTTTTCCTTGTTAGAATCCTAGAATAATTATGGAAAGATTATGAGAATGTATTCCTTATTTTCCTAAATCTTTTCCTTTTTTATTCCATTGTGTACTCTATTTATTCTCCCTTGTACCTATTGCTTTATTCATTAGAAAATAATAACAACACAAACAATCGTGGTTTTTCTCCCGGTACTCGGGTTTCCACGTAAATTGGTGAATGTTTTCCTTATTTTCCTAAATCTTTTCCTTTTTTATTCCATTGTGTACTCTATTTATTCTCCCTTGTACCTATTGCTTTATTCATTAGAAAATAATAACAACACAAACAATCGTGGTTTTTCTCCTGGTAGTCGGGTTTCCACGTAAATTGGTGTGAACTCGTTGTCTCTCTTTTCAATATGGTATCAGAGCGGGACAATGAAAACACCCTAGAAACCCAAAAAAACCAAACCACTTATGAAAATCAAACAGAAGTGACAGCCATCAGTTTTAGTGCTGCCGTAGCTGCTGCCATCGATGCTCGGATGAGTGCTGCCATGGACGAATTATTAAGCCGGCTACAGAAAACGTCCGAAAATAATTTTTCGTCATTACCGCAGTCGTCCGCGCCGTCACCGGACCACCACGCGCCTGGTTTTCTTCCTCAGACGGCGCCGACCATCCCATCTGTCCAACCCTTTTCTTCGTCCGCGGCCTATATTGCTCCCCACGCCCCGATTTATGTTCTGCCATCTAATTCCAATCGGCTACCACCGCTTCTGCCGTCAAATCTGTATGGCCAGCCACCCAATGATCCTAGCTACCATCCCGATGTTAAAAACTCTCAAATTCACTCAACATTTGAGGTTGGTGAATCTTCGGCATATTCCAACCGTAACGTGCAAGCTTCCTCGGGAATAGTTCATCAACAATTGGAAGGGCTTCGACAACAGATAGCAGCACTTGAGGCTACCTTAGGGACGACATCCACTCTACCGATGTATTCTGAGTATCCGGTAAACTCGTTCCCTAATGTATCCTCTCCTTATTTGACTAATACGGTGGCTCAGTCTTCCATGTATCATCTTTCAGGAGAAAAGTTGAATGGCAACAACTATTTCTCATGGTCTCAGTCAGTAAAGATGGTCCTCGAAGGACGACAAAAATTTAGCTTTCTGACAGGGGAAATACCTCGCCCCCTACCGGGCGACCCACATGAACGATATTGGAAGGCAGAAGACTCTATTCTTCGATCCATATTGATCAATAGTATGGAACCTCAAATTGGCAAGCCGTTATTGTTTGCTGCAACAGCCAAGGATATTTGGGACACAGCCCAGACACTTTACTCAAAACGTCAGAATGCCTCTCGTCTATACACGCTGAGAAAGCAAGTTCATGAATGCAAGCAAGGAACCATGGATGTCACATCCTTTTTCAATAAGCTTTCTCTTATATGGCAAGAAATGGACCTATGCAGAGAACTAGTCTGGCGTGATCCCACTGATGGTGTACAGTACTCGAGAATTGAAGAGAATGACAGGATTTATGACTTTCTTGCTGGTCTTAATCCTAAGTTTGATGTAGTTCGAGGGCGTATACTAGGTCAAAGACCGATTCCCTCCCTGATGGAAGTTTGCTCTGAAATCCGCCTCGAGGAAGATCGCACAAGTGCTATGAATATTTCCGCAACCCCTACTATTGACTCTGCTGCTTTTAGTGCAAGATCTTCTAACAGTAGCAGTGACAAGCATAATGGAAAACCAATTCCTGTCTGCGAGCATTGCAAAAAACAATGGCATACCAAAGAACAATGTTGGAAGTTACATGGTCGTCCCCCAGGAAGTAAGAAACGCCCTTCCAACGACAAACAGAACACAGGGCGGGCGTATGTGAGTGAGTCTGCTGAACCTCCTCAACAATCTGATCCACACAAAAACCAAACTGATCTCAGTCTTGCCACTTTAGGTGCCATTGTCCAATCAGGTATACCTCATTCCTTCGGTCTTGTTAGTATTGATGGGAAGAACCCCTGGATTCTGGATTCTGGTGCCACAGATCATTTGACTGGGTCCTCTGAACATTTTGTATCTTACATTCCTTGTGCTGGGAACGAGACAATTAGAATTGCAGATGGCTCCTTGGCCCCCGTTGCTGGAAAGGGGAAGATTTCTCCTTGTGCAGGGCTCTCCTTACATAATGTTTTGCATGTGCCCAAACTATCTTATAATTTGCTTTCGATAAGCAAGATCACTCATGAGTTAAACTGCAAAGCAATATTCTTACCTGATTCTGTCTCTTTTCAGGACTTGAGCTCGGGGAGGATGATTGGCACTGCCCGGCATAGTAGGGGACTCTACCTCCTTGATGACGATACCTCTTCTAGTAGCATTCCTAGGACTAGTCTCTTATCTTCCTATTTCACTACTTCTGAACAAGATTGTATGTTGTGGCATTTTCGTTTAGGCCACCCTAATTTTCAATATATGAAACATTTATTTCCACATCTCTTCTCTAAAGTTGAGATGACTACCTTATCTTGTGATGTGTGTATTCAGGCCAAACAACATCGAGTCTCTTTTCCCTCACAACCATACAAACCAACCCAACCCTTCACTCTTGTTCATAGTGATGTCTGGGGACCATCCAAGATAACAACCTCATCTGGAAAACGGTGGTTCGTAACCTTCATTGATGATCATACCCGTCTTACCTGGGTCTACCTTATCACTGATAAATCTGAGGTTTCCTCTATGTTTCAAAATTTCTATCACACCATTGAAACACAATTCCATCAAAAAATTGCTATTCTTCGGAGTGATAATGGTCGGGAATTCCAAAACCATAAGCTTAGTGAATTTCTTGCTTCCAAGGGGATTGTTCATCAAAACTCGTGCGCCTACACTCCTCAACAAAATGGAGTGGCCGAGCGAAAAAACCGTCACCTTCTGGAAGTAGCCCGTTCCCTTATGCTTTCTACTTCCCTTCTTTCACACTTGTGGGGAGATGCTATTCTTACAGCAGCTCATTTAATCAATAGAATGCCTTCTCGTATTCTTCATCTTCAAACTCCCTTAGATTGTCTTAAGGAGTCCTACCCATCGACTCGTCATGTTTCTGAGGTTCCTCTTCGTGTGTTTGGGTGTACCGCTTATGTCCATAATTTTGGCCCTAATCAAACCAAATTTACCCCTCGGGCTCAGGCATGTGTGTTTGTTGGGTATCCCCCTCACCAGCGTGGTTATAAATGTTTTCACCCACCATCCAGAAAATACTTTGTCACTATGGATGTTACTTTCTGTGAGGATCGACCCTACTTTCCCGTTAGCCATCTTCAGGGGGAGAGTGTGAGTGAAGAGTCTAACAACACCTTTGAATTCATCGAACCCACTCCTAGTGTTGTGTCTAACATCATTCCTCATTCCATAGTCCTACCCACAAACCAAGTCCCCTGGAAAACGTACTACAGGAGGAATCACAAAAAGGAAGTCGGTTCCCCTACTAGTCAGCCGCCGGCTCCAGTCCAAGACTCTGAACCTCCTCGAGATCAAGGTATGGAAAACCCTACTGAACCCTGTACTAAGAATATGATAAGTGAGAATGACAGGTCTAATGTTGCTGTTCTTGAAAACGTGGAAGAAAAGGACAGTGGTGATGAGATTGAGGTCAGAATAGAAACCCGTAATAATGAAGCGGAACAGGGTCATACAGGAAAATCAGATGAGTATGATTCCTCTCTTGACATTCCCATTGCTCTGAGAAAAGGCACCAGGTCTTGTACTAAACACCCCATTTGCAATTATGTTTCCTACAATAGTCTCTCTCCTCAGTTCAGAGCTTTTACAGCAAGCCTTGACTCTACCATAATACCAAAAGATATCTACACTGCTTTAAAGTATCCTGAATGGAAGAATGCTGTCATGGAAGAGATGAAAGCTCTTGAAAAGAATAGTACTTGGGACATTTGTACTCTACCTAAGGGACACAAAACTGTGGGATGCAAATGGGTGTTCTCTCTCAAATACAAAGCTGATGGTACTCTTGACAGACACAAGGCAAGGTTAGTTGCGAAGGGATTTACTCAAACCTATGGTATTGACTATTCAGAAACTTTTTCTCCGGTTGCTACGTTGAATACTATTAGAGTTCTGTTATCTGTTGCTGTGAACAAAGATTGGCCTTTATATCAGCTGGATGTTAAGAATGCCTTTTTGAATGGAGACCTCGTAGAGGAAGTCTACATGAGCCCTCCGCCTGGATTTGAAGCTCAGGTTGGTTAGCATGTGTGTAAACTCCAGAAATCTATATATGGTCTGAAACAGTCTCCCAGAGCATGGTTTGACAGATTCACTACCTTTGTCAAGTCCCAAGGGTACAGGCAGGGACACTCTGATCATACTTTATTTACAAAGGTTTCCAAAACAGGAAAGATTGCTGTTCTAATAGTTTATGTGGATGACATTGTTTTGACTGGAGATGATCAGGCAGAAATCAGTCAACTAAAGCAGAGAATGGGCGATGAGTTTGAAATCAAGGATTTGGGAAATTTGAAATATTTCCTTGGAATGGAGGTGGCCAGATCTAAAGAAGGTATCTCCGTATCTCAAAGAAAATACATCCTTGATTTGTTAACCGAGACAGGTATGTTAGGATGTCGTCCCACTGACACTCCTATTGAATTCAACTGCAAACTAGGAAACTCTGATGATCAAGTTCCAGTTGATAAAGAACAATATCAACGTCTCGTGGGTAAATTAATTTACTTATCTCATACTCGTCCTGATATTTCCTTTGCTGTGAGTGTTGTCAGCCAGTTTATGCAGACCCCTAATGAGGAACACATGAAAGCTGTCAACAGAATCTTGAGATACTTAAAATCAACACCTGGTAAAGGGCTGATGTTTAGAAAAACAGACAGAAAGACCATTGAGGCATACACTGACTCGGATTGGGCAGGATCTGTTGTTGACAGAAAATCTACCTCTGGTTATTGTACCTTTGTTTGGGGCAATCTTGTAACTTGGAGGAGTAAGAAGCAAAGTGTTGTGGCCAGGAGCAGCACTGAGGCTGAATACAAAGCTTTGAGTTTAGGAATATGTGAGGAAATTTGGCTTCAGAAAGTTTTGACAGATCTTCATCAGGAATGTGAGACACCATTGAAGCTTTTCTGTGATAATAAAGCCGCTATTAGTATTGCTAACAACCCTGTTCAACATGATAGAACTAAACATGTTGAGATTGATCGACATTTTATCAAAGAAAAACTTGACAGTGGGAGCATATGCATTCCGTACATCCCTTCGAGTCAACAGGTTGCTGATGTTCTTACCAAAGGGCTTCTCAGACCAAACTTCGACTTCTGCGTTAGCAAGTTGGGCCTCATTGATATTTACGTCCCAACTTGA

mRNA sequence

GTTGTGATGACGTTCTGAACCATTGTTTTCTTTCTCATTTCTTATATTTCAAAGAGATGCCTCAACCAGATATCTGATGTAAGTAATGACGGAGCGACTTTCGCTGTCCGCCGAACAACTCTCGCCCATGGGATTGCCATTTCCGTCGTCGCTGAATCTAATTTCTCTCCCCCTTCCGACGTATCGTTCTCTCGTTTTCCCCCTCTTCGATGCAACGAAATTCATTTCTGTTTTCTCAATGCCTTCGACGAATGAAAAATGGTCTCATATCCACCGACCAAGACCAACGGTATCATCTTCACTAAGAAATTTCAGAATCAGGAACAGAAATATCAAAGCTCGATTCAGCTCTCGGAATGATTCGAGTTCCACACCACATGCCTCCAATGGCGACTCGAATTCTCCCGCTAAAACGAAACTTATTGTAGTAAGCTCTCTAATTGCCGTGTCACTGGCAATCGCGAATCGTGTGTTGTACAAGCTTGCTCTCGTTCCATTGAAGGAATATCCATTCTTCTTAGCTCAATTGACTACGTTCGGATATGTTATGGCATATTTTAGTATATTATATCTGCGGCGTCGGGCAAACATTGTTACTGAAGAGATGCTATCTCTACCAAAATCGCGATTCATGGCCATAGGTTTTCTTGAAGCGCTTGGAATTGCGACTGGGATGGCTGCGGCAGCTTCGCTTCCTGGTCCAGCAATTCCTATACTCAGTCAGACTTTTTTGGTTTGGCAGCTCGTATTTTCTGCCATTTTGCTAGGTAGAAAGTACTCATGGAATCAAATTGCGGGCTGTGTTATTGTAACTGCTGGTGTAGTAGTTGCTGTTGGAAGGTATCATTTTGTTGGACCTTTTTCGGGTTTCCACGTAAATTGGTGTGAACTCGTTGTCTCTCTTTTCAATATGGTATCAGAGCGGGACAATGAAAACACCCTAGAAACCCAAAAAAACCAAACCACTTATGAAAATCAAACAGAAGTGACAGCCATCAGTTTTAGTGCTGCCGTAGCTGCTGCCATCGATGCTCGGATGAGTGCTGCCATGGACGAATTATTAAGCCGGCTACAGAAAACGTCCGAAAATAATTTTTCGTCATTACCGCAGTCGTCCGCGCCGTCACCGGACCACCACGCGCCTGGTTTTCTTCCTCAGACGGCGCCGACCATCCCATCTGTCCAACCCTTTTCTTCGTCCGCGGCCTATATTGCTCCCCACGCCCCGATTTATGTTCTGCCATCTAATTCCAATCGGCTACCACCGCTTCTGCCGTCAAATCTGTATGGCCAGCCACCCAATGATCCTAGCTACCATCCCGATGTTAAAAACTCTCAAATTCACTCAACATTTGAGGTTGGTGAATCTTCGGCATATTCCAACCGTAACGTGCAAGCTTCCTCGGGAATAGTTCATCAACAATTGGAAGGGCTTCGACAACAGATAGCAGCACTTGAGGCTACCTTAGGGACGACATCCACTCTACCGATGTATTCTGAGTATCCGGTAAACTCGTTCCCTAATGTATCCTCTCCTTATTTGACTAATACGGTGGCTCAGTCTTCCATGTATCATCTTTCAGGAGAAAAGTTGAATGGCAACAACTATTTCTCATGGTCTCAGTCAGTAAAGATGGTCCTCGAAGGACGACAAAAATTTAGCTTTCTGACAGGGGAAATACCTCGCCCCCTACCGGGCGACCCACATGAACGATATTGGAAGGCAGAAGACTCTATTCTTCGATCCATATTGATCAATAGTATGGAACCTCAAATTGGCAAGCCGTTATTGTTTGCTGCAACAGCCAAGGATATTTGGGACACAGCCCAGACACTTTACTCAAAACGTCAGAATGCCTCTCGTCTATACACGCTGAGAAAGCAAGTTCATGAATGCAAGCAAGGAACCATGGATGTCACATCCTTTTTCAATAAGCTTTCTCTTATATGGCAAGAAATGGACCTATGCAGAGAACTAGTCTGGCGTGATCCCACTGATGGTGTACAGTACTCGAGAATTGAAGAGAATGACAGGATTTATGACTTTCTTGCTGGTCTTAATCCTAAGTTTGATGTAGTTCGAGGGCGTATACTAGGTCAAAGACCGATTCCCTCCCTGATGGAAGTTTGCTCTGAAATCCGCCTCGAGGAAGATCGCACAAGTGCTATGAATATTTCCGCAACCCCTACTATTGACTCTGCTGCTTTTAGTGCAAGATCTTCTAACAGTAGCAGTGACAAGCATAATGGAAAACCAATTCCTGTCTGCGAGCATTGCAAAAAACAATGGCATACCAAAGAACAATGTTGGAAGTTACATGGTCGTCCCCCAGGAAGTAAGAAACGCCCTTCCAACGACAAACAGAACACAGGGCGGGCGTATGTGAGTGAGTCTGCTGAACCTCCTCAACAATCTGATCCACACAAAAACCAAACTGATCTCAGTCTTGCCACTTTAGGTGCCATTGTCCAATCAGGTATACCTCATTCCTTCGGTCTTGTTAGTATTGATGGGAAGAACCCCTGGATTCTGGATTCTGGTGCCACAGATCATTTGACTGGGTCCTCTGAACATTTTGTATCTTACATTCCTTGTGCTGGGAACGAGACAATTAGAATTGCAGATGGCTCCTTGGCCCCCGTTGCTGGAAAGGGGAAGATTTCTCCTTGTGCAGGGCTCTCCTTACATAATGTTTTGCATGTGCCCAAACTATCTTATAATTTGCTTTCGATAAGCAAGATCACTCATGAGTTAAACTGCAAAGCAATATTCTTACCTGATTCTGTCTCTTTTCAGGACTTGAGCTCGGGGAGGATGATTGGCACTGCCCGGCATAGTAGGGGACTCTACCTCCTTGATGACGATACCTCTTCTAGTAGCATTCCTAGGACTAGTCTCTTATCTTCCTATTTCACTACTTCTGAACAAGATTGTATGTTGTGGCATTTTCGTTTAGGCCACCCTAATTTTCAATATATGAAACATTTATTTCCACATCTCTTCTCTAAAGTTGAGATGACTACCTTATCTTGTGATGTGTGTATTCAGGCCAAACAACATCGAGTCTCTTTTCCCTCACAACCATACAAACCAACCCAACCCTTCACTCTTGTTCATAGTGATGTCTGGGGACCATCCAAGATAACAACCTCATCTGGAAAACGGTGGTTCGTAACCTTCATTGATGATCATACCCGTCTTACCTGGGTCTACCTTATCACTGATAAATCTGAGGTTTCCTCTATGTTTCAAAATTTCTATCACACCATTGAAACACAATTCCATCAAAAAATTGCTATTCTTCGGAGTGATAATGGTCGGGAATTCCAAAACCATAAGCTTAGTGAATTTCTTGCTTCCAAGGGGATTGTTCATCAAAACTCGTGCGCCTACACTCCTCAACAAAATGGAGTGGCCGAGCGAAAAAACCGTCACCTTCTGGAAGTAGCCCGTTCCCTTATGCTTTCTACTTCCCTTCTTTCACACTTGTGGGGAGATGCTATTCTTACAGCAGCTCATTTAATCAATAGAATGCCTTCTCGTATTCTTCATCTTCAAACTCCCTTAGATTGTCTTAAGGAGTCCTACCCATCGACTCGTCATGTTTCTGAGGTTCCTCTTCGTGTGTTTGGGTGTACCGCTTATGTCCATAATTTTGGCCCTAATCAAACCAAATTTACCCCTCGGGCTCAGGCATGTGTGTTTGTTGGGTATCCCCCTCACCAGCGTGGTTATAAATGTTTTCACCCACCATCCAGAAAATACTTTGTCACTATGGATGTTACTTTCTGTGAGGATCGACCCTACTTTCCCGTTAGCCATCTTCAGGGGGAGAGTGTGAGTGAAGAGTCTAACAACACCTTTGAATTCATCGAACCCACTCCTAGTGTTGTGTCTAACATCATTCCTCATTCCATAGTCCTACCCACAAACCAAGTCCCCTGGAAAACGTACTACAGGAGGAATCACAAAAAGGAAGTCGGTTCCCCTACTAGTCAGCCGCCGGCTCCAGTCCAAGACTCTGAACCTCCTCGAGATCAAGGTATGGAAAACCCTACTGAACCCTGTACTAAGAATATGATAAGTGAGAATGACAGGTCTAATGTTGCTGTTCTTGAAAACGTGGAAGAAAAGGACAGTGGTGATGAGATTGAGGTCAGAATAGAAACCCGTAATAATGAAGCGGAACAGGGTCATACAGGAAAATCAGATGAGTATGATTCCTCTCTTGACATTCCCATTGCTCTGAGAAAAGGCACCAGGTCTTGTACTAAACACCCCATTTGCAATTATGTTTCCTACAATAGTCTCTCTCCTCAGTTCAGAGCTTTTACAGCAAGCCTTGACTCTACCATAATACCAAAAGATATCTACACTGCTTTAAAGTATCCTGAATGGAAGAATGCTGTCATGGAAGAGATGAAAGCTCTTGAAAAGAATAGTACTTGGGACATTTGTACTCTACCTAAGGGACACAAAACTGTGGGATGCAAATGGGTGTTCTCTCTCAAATACAAAGCTGATGGTACTCTTGACAGACACAAGGCAAGGTTAGTTGCGAAGGGATTTACTCAAACCTATGGTATTGACTATTCAGAAACTTTTTCTCCGGTTGCTACGTTGAATACTATTAGAGTTCTGTTATCTGTTGCTGTGAACAAAGATTGGCCTTTATATCAGCTGGATGTTAAGAATGCCTTTTTGAATGGAGACCTCGTAGAGGAAGTCTACATGAGCCCTCCGCCTGGATTTGAAGCTCAGTCTCCCAGAGCATGGTTTGACAGATTCACTACCTTTGTCAAGTCCCAAGGGTACAGGCAGGGACACTCTGATCATACTTTATTTACAAAGGTTTCCAAAACAGGAAAGATTGCTGTTCTAATAGTTTATGTGGATGACATTGTTTTGACTGGAGATGATCAGGCAGAAATCAGTCAACTAAAGCAGAGAATGGGCGATGAGTTTGAAATCAAGGATTTGGGAAATTTGAAATATTTCCTTGGAATGGAGGTGGCCAGATCTAAAGAAGGTATCTCCGTATCTCAAAGAAAATACATCCTTGATTTGTTAACCGAGACAGGTATGTTAGGATGTCGTCCCACTGACACTCCTATTGAATTCAACTGCAAACTAGGAAACTCTGATGATCAAGTTCCAGTTGATAAAGAACAATATCAACGTCTCGTGGGTAAATTAATTTACTTATCTCATACTCGTCCTGATATTTCCTTTGCTGTGAGTGTTGTCAGCCAGTTTATGCAGACCCCTAATGAGGAACACATGAAAGCTGTCAACAGAATCTTGAGATACTTAAAATCAACACCTGGTAAAGGGCTGATGTTTAGAAAAACAGACAGAAAGACCATTGAGGCATACACTGACTCGGATTGGGCAGGATCTGTTGTTGACAGAAAATCTACCTCTGGTTATTGTACCTTTGTTTGGGGCAATCTTGTAACTTGGAGGAGTAAGAAGCAAAGTGTTGTGGCCAGGAGCAGCACTGAGGCTGAATACAAAGCTTTGAGTTTAGGAATATGTGAGGAAATTTGGCTTCAGAAAGTTTTGACAGATCTTCATCAGGAATGTGAGACACCATTGAAGCTTTTCTGTGATAATAAAGCCGCTATTAGTATTGCTAACAACCCTGTTCAACATGATAGAACTAAACATGTTGAGATTGATCGACATTTTATCAAAGAAAAACTTGACAGTGGGAGCATATGCATTCCGTACATCCCTTCGAGTCAACAGGTTGCTGATGTTCTTACCAAAGGGCTTCTCAGACCAAACTTCGACTTCTGCGTTAGCAAGTTGGGCCTCATTGATATTTACGTCCCAACTTGA

Coding sequence (CDS)

ATGACGGAGCGACTTTCGCTGTCCGCCGAACAACTCTCGCCCATGGGATTGCCATTTCCGTCGTCGCTGAATCTAATTTCTCTCCCCCTTCCGACGTATCGTTCTCTCGTTTTCCCCCTCTTCGATGCAACGAAATTCATTTCTGTTTTCTCAATGCCTTCGACGAATGAAAAATGGTCTCATATCCACCGACCAAGACCAACGGTATCATCTTCACTAAGAAATTTCAGAATCAGGAACAGAAATATCAAAGCTCGATTCAGCTCTCGGAATGATTCGAGTTCCACACCACATGCCTCCAATGGCGACTCGAATTCTCCCGCTAAAACGAAACTTATTGTAGTAAGCTCTCTAATTGCCGTGTCACTGGCAATCGCGAATCGTGTGTTGTACAAGCTTGCTCTCGTTCCATTGAAGGAATATCCATTCTTCTTAGCTCAATTGACTACGTTCGGATATGTTATGGCATATTTTAGTATATTATATCTGCGGCGTCGGGCAAACATTGTTACTGAAGAGATGCTATCTCTACCAAAATCGCGATTCATGGCCATAGGTTTTCTTGAAGCGCTTGGAATTGCGACTGGGATGGCTGCGGCAGCTTCGCTTCCTGGTCCAGCAATTCCTATACTCAGTCAGACTTTTTTGGTTTGGCAGCTCGTATTTTCTGCCATTTTGCTAGGTAGAAAGTACTCATGGAATCAAATTGCGGGCTGTGTTATTGTAACTGCTGGTGTAGTAGTTGCTGTTGGAAGGTATCATTTTGTTGGACCTTTTTCGGGTTTCCACGTAAATTGGTGTGAACTCGTTGTCTCTCTTTTCAATATGGTATCAGAGCGGGACAATGAAAACACCCTAGAAACCCAAAAAAACCAAACCACTTATGAAAATCAAACAGAAGTGACAGCCATCAGTTTTAGTGCTGCCGTAGCTGCTGCCATCGATGCTCGGATGAGTGCTGCCATGGACGAATTATTAAGCCGGCTACAGAAAACGTCCGAAAATAATTTTTCGTCATTACCGCAGTCGTCCGCGCCGTCACCGGACCACCACGCGCCTGGTTTTCTTCCTCAGACGGCGCCGACCATCCCATCTGTCCAACCCTTTTCTTCGTCCGCGGCCTATATTGCTCCCCACGCCCCGATTTATGTTCTGCCATCTAATTCCAATCGGCTACCACCGCTTCTGCCGTCAAATCTGTATGGCCAGCCACCCAATGATCCTAGCTACCATCCCGATGTTAAAAACTCTCAAATTCACTCAACATTTGAGGTTGGTGAATCTTCGGCATATTCCAACCGTAACGTGCAAGCTTCCTCGGGAATAGTTCATCAACAATTGGAAGGGCTTCGACAACAGATAGCAGCACTTGAGGCTACCTTAGGGACGACATCCACTCTACCGATGTATTCTGAGTATCCGGTAAACTCGTTCCCTAATGTATCCTCTCCTTATTTGACTAATACGGTGGCTCAGTCTTCCATGTATCATCTTTCAGGAGAAAAGTTGAATGGCAACAACTATTTCTCATGGTCTCAGTCAGTAAAGATGGTCCTCGAAGGACGACAAAAATTTAGCTTTCTGACAGGGGAAATACCTCGCCCCCTACCGGGCGACCCACATGAACGATATTGGAAGGCAGAAGACTCTATTCTTCGATCCATATTGATCAATAGTATGGAACCTCAAATTGGCAAGCCGTTATTGTTTGCTGCAACAGCCAAGGATATTTGGGACACAGCCCAGACACTTTACTCAAAACGTCAGAATGCCTCTCGTCTATACACGCTGAGAAAGCAAGTTCATGAATGCAAGCAAGGAACCATGGATGTCACATCCTTTTTCAATAAGCTTTCTCTTATATGGCAAGAAATGGACCTATGCAGAGAACTAGTCTGGCGTGATCCCACTGATGGTGTACAGTACTCGAGAATTGAAGAGAATGACAGGATTTATGACTTTCTTGCTGGTCTTAATCCTAAGTTTGATGTAGTTCGAGGGCGTATACTAGGTCAAAGACCGATTCCCTCCCTGATGGAAGTTTGCTCTGAAATCCGCCTCGAGGAAGATCGCACAAGTGCTATGAATATTTCCGCAACCCCTACTATTGACTCTGCTGCTTTTAGTGCAAGATCTTCTAACAGTAGCAGTGACAAGCATAATGGAAAACCAATTCCTGTCTGCGAGCATTGCAAAAAACAATGGCATACCAAAGAACAATGTTGGAAGTTACATGGTCGTCCCCCAGGAAGTAAGAAACGCCCTTCCAACGACAAACAGAACACAGGGCGGGCGTATGTGAGTGAGTCTGCTGAACCTCCTCAACAATCTGATCCACACAAAAACCAAACTGATCTCAGTCTTGCCACTTTAGGTGCCATTGTCCAATCAGGTATACCTCATTCCTTCGGTCTTGTTAGTATTGATGGGAAGAACCCCTGGATTCTGGATTCTGGTGCCACAGATCATTTGACTGGGTCCTCTGAACATTTTGTATCTTACATTCCTTGTGCTGGGAACGAGACAATTAGAATTGCAGATGGCTCCTTGGCCCCCGTTGCTGGAAAGGGGAAGATTTCTCCTTGTGCAGGGCTCTCCTTACATAATGTTTTGCATGTGCCCAAACTATCTTATAATTTGCTTTCGATAAGCAAGATCACTCATGAGTTAAACTGCAAAGCAATATTCTTACCTGATTCTGTCTCTTTTCAGGACTTGAGCTCGGGGAGGATGATTGGCACTGCCCGGCATAGTAGGGGACTCTACCTCCTTGATGACGATACCTCTTCTAGTAGCATTCCTAGGACTAGTCTCTTATCTTCCTATTTCACTACTTCTGAACAAGATTGTATGTTGTGGCATTTTCGTTTAGGCCACCCTAATTTTCAATATATGAAACATTTATTTCCACATCTCTTCTCTAAAGTTGAGATGACTACCTTATCTTGTGATGTGTGTATTCAGGCCAAACAACATCGAGTCTCTTTTCCCTCACAACCATACAAACCAACCCAACCCTTCACTCTTGTTCATAGTGATGTCTGGGGACCATCCAAGATAACAACCTCATCTGGAAAACGGTGGTTCGTAACCTTCATTGATGATCATACCCGTCTTACCTGGGTCTACCTTATCACTGATAAATCTGAGGTTTCCTCTATGTTTCAAAATTTCTATCACACCATTGAAACACAATTCCATCAAAAAATTGCTATTCTTCGGAGTGATAATGGTCGGGAATTCCAAAACCATAAGCTTAGTGAATTTCTTGCTTCCAAGGGGATTGTTCATCAAAACTCGTGCGCCTACACTCCTCAACAAAATGGAGTGGCCGAGCGAAAAAACCGTCACCTTCTGGAAGTAGCCCGTTCCCTTATGCTTTCTACTTCCCTTCTTTCACACTTGTGGGGAGATGCTATTCTTACAGCAGCTCATTTAATCAATAGAATGCCTTCTCGTATTCTTCATCTTCAAACTCCCTTAGATTGTCTTAAGGAGTCCTACCCATCGACTCGTCATGTTTCTGAGGTTCCTCTTCGTGTGTTTGGGTGTACCGCTTATGTCCATAATTTTGGCCCTAATCAAACCAAATTTACCCCTCGGGCTCAGGCATGTGTGTTTGTTGGGTATCCCCCTCACCAGCGTGGTTATAAATGTTTTCACCCACCATCCAGAAAATACTTTGTCACTATGGATGTTACTTTCTGTGAGGATCGACCCTACTTTCCCGTTAGCCATCTTCAGGGGGAGAGTGTGAGTGAAGAGTCTAACAACACCTTTGAATTCATCGAACCCACTCCTAGTGTTGTGTCTAACATCATTCCTCATTCCATAGTCCTACCCACAAACCAAGTCCCCTGGAAAACGTACTACAGGAGGAATCACAAAAAGGAAGTCGGTTCCCCTACTAGTCAGCCGCCGGCTCCAGTCCAAGACTCTGAACCTCCTCGAGATCAAGGTATGGAAAACCCTACTGAACCCTGTACTAAGAATATGATAAGTGAGAATGACAGGTCTAATGTTGCTGTTCTTGAAAACGTGGAAGAAAAGGACAGTGGTGATGAGATTGAGGTCAGAATAGAAACCCGTAATAATGAAGCGGAACAGGGTCATACAGGAAAATCAGATGAGTATGATTCCTCTCTTGACATTCCCATTGCTCTGAGAAAAGGCACCAGGTCTTGTACTAAACACCCCATTTGCAATTATGTTTCCTACAATAGTCTCTCTCCTCAGTTCAGAGCTTTTACAGCAAGCCTTGACTCTACCATAATACCAAAAGATATCTACACTGCTTTAAAGTATCCTGAATGGAAGAATGCTGTCATGGAAGAGATGAAAGCTCTTGAAAAGAATAGTACTTGGGACATTTGTACTCTACCTAAGGGACACAAAACTGTGGGATGCAAATGGGTGTTCTCTCTCAAATACAAAGCTGATGGTACTCTTGACAGACACAAGGCAAGGTTAGTTGCGAAGGGATTTACTCAAACCTATGGTATTGACTATTCAGAAACTTTTTCTCCGGTTGCTACGTTGAATACTATTAGAGTTCTGTTATCTGTTGCTGTGAACAAAGATTGGCCTTTATATCAGCTGGATGTTAAGAATGCCTTTTTGAATGGAGACCTCGTAGAGGAAGTCTACATGAGCCCTCCGCCTGGATTTGAAGCTCAGTCTCCCAGAGCATGGTTTGACAGATTCACTACCTTTGTCAAGTCCCAAGGGTACAGGCAGGGACACTCTGATCATACTTTATTTACAAAGGTTTCCAAAACAGGAAAGATTGCTGTTCTAATAGTTTATGTGGATGACATTGTTTTGACTGGAGATGATCAGGCAGAAATCAGTCAACTAAAGCAGAGAATGGGCGATGAGTTTGAAATCAAGGATTTGGGAAATTTGAAATATTTCCTTGGAATGGAGGTGGCCAGATCTAAAGAAGGTATCTCCGTATCTCAAAGAAAATACATCCTTGATTTGTTAACCGAGACAGGTATGTTAGGATGTCGTCCCACTGACACTCCTATTGAATTCAACTGCAAACTAGGAAACTCTGATGATCAAGTTCCAGTTGATAAAGAACAATATCAACGTCTCGTGGGTAAATTAATTTACTTATCTCATACTCGTCCTGATATTTCCTTTGCTGTGAGTGTTGTCAGCCAGTTTATGCAGACCCCTAATGAGGAACACATGAAAGCTGTCAACAGAATCTTGAGATACTTAAAATCAACACCTGGTAAAGGGCTGATGTTTAGAAAAACAGACAGAAAGACCATTGAGGCATACACTGACTCGGATTGGGCAGGATCTGTTGTTGACAGAAAATCTACCTCTGGTTATTGTACCTTTGTTTGGGGCAATCTTGTAACTTGGAGGAGTAAGAAGCAAAGTGTTGTGGCCAGGAGCAGCACTGAGGCTGAATACAAAGCTTTGAGTTTAGGAATATGTGAGGAAATTTGGCTTCAGAAAGTTTTGACAGATCTTCATCAGGAATGTGAGACACCATTGAAGCTTTTCTGTGATAATAAAGCCGCTATTAGTATTGCTAACAACCCTGTTCAACATGATAGAACTAAACATGTTGAGATTGATCGACATTTTATCAAAGAAAAACTTGACAGTGGGAGCATATGCATTCCGTACATCCCTTCGAGTCAACAGGTTGCTGATGTTCTTACCAAAGGGCTTCTCAGACCAAACTTCGACTTCTGCGTTAGCAAGTTGGGCCTCATTGATATTTACGTCCCAACTTGA

Protein sequence

MTERLSLSAEQLSPMGLPFPSSLNLISLPLPTYRSLVFPLFDATKFISVFSMPSTNEKWSHIHRPRPTVSSSLRNFRIRNRNIKARFSSRNDSSSTPHASNGDSNSPAKTKLIVVSSLIAVSLAIANRVLYKLALVPLKEYPFFLAQLTTFGYVMAYFSILYLRRRANIVTEEMLSLPKSRFMAIGFLEALGIATGMAAAASLPGPAIPILSQTFLVWQLVFSAILLGRKYSWNQIAGCVIVTAGVVVAVGRYHFVGPFSGFHVNWCELVVSLFNMVSERDNENTLETQKNQTTYENQTEVTAISFSAAVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPSPDHHAPGFLPQTAPTIPSVQPFSSSAAYIAPHAPIYVLPSNSNRLPPLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTSTLPMYSEYPVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQRPIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFSARSSNSSSDKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYVSESAEPPQQSDPHKNQTDLSLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAILRSDNGREFQNHKLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLLSHLWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEPTPSVVSNIIPHSIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAEQGHTGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYNSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVATLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEEVYMSPPPGFEAQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSTEAEYKALSLGICEEIWLQKVLTDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT*
Homology
BLAST of CSPI02G21510 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 675.2 bits (1741), Expect = 2.1e-192
Identity = 456/1486 (30.69%), Postives = 723/1486 (48.65%), Query Frame = 0

Query: 502  KLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPG---------DPHERYWKAEDSIL 561
            KL   NY  WS+ V  + +G +   FL G    P            +P    WK +D ++
Sbjct: 25   KLTSTNYLMWSRQVHALFDGYELAGFLDGSTTMPPATIGTDAAPRVNPDYTRWKRQDKLI 84

Query: 562  RSILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVT 621
             S ++ ++   +   +  A TA  IW+T + +Y+   +   +  LR Q+ +  +GT  + 
Sbjct: 85   YSAVLGAISMSVQPAVSRATTAAQIWETLRKIYA-NPSYGHVTQLRTQLKQWTKGTKTID 144

Query: 622  SFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQ 681
             +   L   + ++ L  + +  D     Q  R+ EN         L  ++  V  +I  +
Sbjct: 145  DYMQGLVTRFDQLALLGKPMDHDE----QVERVLEN---------LPEEYKPVIDQIAAK 204

Query: 682  RPIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFSARSSNSSSDKHNGKPIPVCEHCK 741
               P+L E+   +   E +  A++ +    I + A S R++ ++++ +NG      ++  
Sbjct: 205  DTPPTLTEIHERLLNHESKILAVSSATVIPITANAVSHRNTTTTNNNNNGNRNNRYDNRN 264

Query: 742  KQWHTKE-QCWKLHGRPPGSKKRP-------------SNDKQNTGRAYVS--ESAEPPQQ 801
               ++K  Q    +  P  ++ +P             S  + +  + ++S   S +PP  
Sbjct: 265  NNNNSKPWQQSSTNFHPNNNQSKPYLGKCQICGVQGHSAKRCSQLQHFLSSVNSQQPPSP 324

Query: 802  SDPHKNQTDLSLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIP 861
              P + + +L+L         G P+S         N W+LDSGAT H+T    +   + P
Sbjct: 325  FTPWQPRANLAL---------GSPYS--------SNNWLLDSGATHHITSDFNNLSLHQP 384

Query: 862  CAGNETIRIADGSLAPVAGKGKIS---PCAGLSLHNVLHVPKLSYNLLSISKITHELNCK 921
              G + + +ADGS  P++  G  S       L+LHN+L+VP +  NL+S+ ++ +     
Sbjct: 385  YTGGDDVMVADGSTIPISHTGSTSLSTKSRPLNLHNILYVPNIHKNLISVYRLCNANGVS 444

Query: 922  AIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDCMLW 981
              F P S   +DL++G  +   +    LY    +   +S    SL +S   +S+     W
Sbjct: 445  VEFFPASFQVKDLNTGVPLLQGKTKDELY----EWPIASSQPVSLFAS--PSSKATHSSW 504

Query: 982  HFRLGHPNFQYMKHLFPHLFSKV---EMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLV 1041
            H RLGHP    +  +  +    V       LSC  C+  K ++V F       T+P   +
Sbjct: 505  HARLGHPAPSILNSVISNYSLSVLNPSHKFLSCSDCLINKSNKVPFSQSTINSTRPLEYI 564

Query: 1042 HSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKI 1101
            +SDVW  S I +    R++V F+D  TR TW+Y +  KS+V   F  F + +E +F  +I
Sbjct: 565  YSDVWS-SPILSHDNYRYYVIFVDHFTRYTWLYPLKQKSQVKETFITFKNLLENRFQTRI 624

Query: 1102 AILRSDNGREFQNHKLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTS 1161
                SDNG EF    L E+ +  GI H  S  +TP+ NG++ERK+RH++E   +L+   S
Sbjct: 625  GTFYSDNGGEFV--ALWEYFSQHGISHLTSPPHTPEHNGLSERKHRHIVETGLTLLSHAS 684

Query: 1162 LLSHLWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVHN 1221
            +    W  A   A +LINR+P+ +L L++P   L  + P+        LRVFGC  Y   
Sbjct: 685  IPKTYWPYAFAVAVYLINRLPTPLLQLESPFQKLFGTSPNYD-----KLRVFGCACYPWL 744

Query: 1222 FGPNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYF-------- 1281
               NQ K   +++ CVF+GY   Q  Y C H  + + +++  V F E+   F        
Sbjct: 745  RPYNQHKLDDKSRQCVFLGYSLTQSAYLCLHLQTSRLYISRHVRFDENCFPFSNYLATLS 804

Query: 1282 PVSHLQGESVSEES-NNTFEFIEPTPSVVSNIIPHSIVLPTN--QVPWKTYYRRNHKKEV 1341
            PV   + ES    S + T     P     S   PH    P +    P++     +   + 
Sbjct: 805  PVQEQRRESSCVWSPHTTLPTRTPVLPAPSCSDPHHAATPPSSPSAPFRNSQVSSSNLDS 864

Query: 1342 GSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVR 1401
               +S P +P  +   PR  G +  T+P T+     +   N +      E  S     + 
Sbjct: 865  SFSSSFPSSP--EPTAPRQNGPQPTTQP-TQTQTQTHSSQNTSQNNPTNESPSQLAQSLS 924

Query: 1402 IETRNNEAEQGHTGKSDEYDSSLDIP------------IALRKGTRSCTKHPICNYVSYN 1461
               +++ +    T  +    +S   P            I           H +       
Sbjct: 925  TPAQSSSSSPSPTTSASSSSTSPTPPSILIHPPPPLAQIVNNNNQAPLNTHSMGTRAKAG 984

Query: 1462 SLSPQFR-AFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKT 1521
             + P  + +   SL +   P+    ALK   W+NA+  E+ A   N TWD+   P  H T
Sbjct: 985  IIKPNPKYSLAVSLAAESEPRTAIQALKDERWRNAMGSEINAQIGNHTWDLVPPPPSHVT 1044

Query: 1522 -VGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVATLNTIRVLLSVAVN 1581
             VGC+W+F+ KY +DG+L+R+KARLVAKG+ Q  G+DY+ETFSPV    +IR++L VAV+
Sbjct: 1045 IVGCRWIFTKKYNSDGSLNRYKARLVAKGYNQRPGLDYAETFSPVIKSTSIRIVLGVAVD 1104

Query: 1582 KDWPLYQLDVKNAFLNGDLVEEVYMSPPPGF------------------EAQSPRAWFDR 1641
            + WP+ QLDV NAFL G L ++VYMS PPGF                    Q+PRAW+  
Sbjct: 1105 RSWPIRQLDVNNAFLQGTLTDDVYMSQPPGFIDKDRPNYVCKLRKALYGLKQAPRAWYVE 1164

Query: 1642 FTTFVKSQGYRQGHSDHTLFTKVSKTGK-IAVLIVYVDDIVLTGDDQAEISQLKQRMGDE 1701
               ++ + G+    SD +LF  V + GK I  ++VYVDDI++TG+D   +      +   
Sbjct: 1165 LRNYLLTIGFVNSVSDTSLF--VLQRGKSIVYMLVYVDDILITGNDPTLLHNTLDNLSQR 1224

Query: 1702 FEIKDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNS 1761
            F +KD   L YFLG+E  R   G+ +SQR+YILDLL  T M+  +P  TP+  + KL   
Sbjct: 1225 FSVKDHEELHYFLGIEAKRVPTGLHLSQRRYILDLLARTNMITAKPVTTPMAPSPKLSLY 1284

Query: 1762 DDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKST 1821
                  D  +Y+ +VG L YL+ TRPDIS+AV+ +SQFM  P EEH++A+ RILRYL  T
Sbjct: 1285 SGTKLTDPTEYRGIVGSLQYLAFTRPDISYAVNRLSQFMHMPTEEHLQALKRILRYLAGT 1344

Query: 1822 PGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSST 1881
            P  G+  +K +  ++ AY+D+DWAG   D  ST+GY  ++  + ++W SKKQ  V RSST
Sbjct: 1345 PNHGIFLKKGNTLSLHAYSDADWAGDKDDYVSTNGYIVYLGHHPISWSSKKQKGVVRSST 1404

Query: 1882 EAEYKALSLGICEEIWLQKVLTDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEID 1913
            EAEY++++    E  W+  +LT+L      P  ++CDN  A  +  NPV H R KH+ ID
Sbjct: 1405 EAEYRSVANTSSEMQWICSLLTELGIRLTRPPVIYCDNVGATYLCANPVFHSRMKHIAID 1460

BLAST of CSPI02G21510 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 644.4 bits (1661), Expect = 4.0e-183
Identity = 454/1516 (29.95%), Postives = 713/1516 (47.03%), Query Frame = 0

Query: 502  KLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRP--------LPG-DPHERYWKAEDSIL 561
            KL   NY  WS+ V  + +G +   FL G  P P        +P  +P    W+ +D ++
Sbjct: 25   KLTSTNYLMWSRQVHALFDGYELAGFLDGSTPMPPATIGTDAVPRVNPDYTRWRRQDKLI 84

Query: 562  RSILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVT 621
             S ++ ++   +   +  A TA  IW+T + +Y+   N S  +  +          +   
Sbjct: 85   YSAILGAISMSVQPAVSRATTAAQIWETLRKIYA---NPSYGHVTQ----------LRFI 144

Query: 622  SFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQ 681
            + F++L+L+ + MD                     ++++   L  L   +  V  +I  +
Sbjct: 145  TRFDQLALLGKPMD--------------------HDEQVERVLENLPDDYKPVIDQIAAK 204

Query: 682  RPIPSLMEVCSEIRLEEDRTSAMNISATPTID---------------------------- 741
               PSL E+   +   E +  A+N +    I                             
Sbjct: 205  DTPPSLTEIHERLINRESKLLALNSAEVVPITANVVTHRNTNTNRNQNNRGDNRNYNNNN 264

Query: 742  --SAAFSARSSNSSSDKHNGKP-IPVCEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQN 801
              S ++   SS S SD    KP +  C+ C  Q H+ ++C +LH              Q+
Sbjct: 265  NRSNSWQPSSSGSRSDNRQPKPYLGRCQICSVQGHSAKRCPQLH------------QFQS 324

Query: 802  TGRAYVSESAEPPQQSDPHKNQTDLSLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGAT 861
            T     S S   P Q  P  N                      + S    N W+LDSGAT
Sbjct: 325  TTNQQQSTSPFTPWQ--PRAN--------------------LAVNSPYNANNWLLDSGAT 384

Query: 862  DHLTGSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKIS---PCAGLSLHNVLHVPKLSY 921
             H+T    +   + P  G + + IADGS  P+   G  S       L L+ VL+VP +  
Sbjct: 385  HHITSDFNNLSFHQPYTGGDDVMIADGSTIPITHTGSASLPTSSRSLDLNKVLYVPNIHK 444

Query: 922  NLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSL 981
            NL+S+ ++ +       F P S   +DL++G  +   +    LY    +   +S    S+
Sbjct: 445  NLISVYRLCNTNRVSVEFFPASFQVKDLNTGVPLLQGKTKDELY----EWPIASSQAVSM 504

Query: 982  LSSYFTTSEQDCMLWHFRLGHPNFQYM-----KHLFPHLFSKVEMTTLSCDVCIQAKQHR 1041
             +S    S+     WH RLGHP+   +      H  P L    ++  LSC  C   K H+
Sbjct: 505  FAS--PCSKATHSSWHSRLGHPSLAILNSVISNHSLPVLNPSHKL--LSCSDCFINKSHK 564

Query: 1042 VSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSS 1101
            V F +     ++P   ++SDVW  S I +    R++V F+D  TR TW+Y +  KS+V  
Sbjct: 565  VPFSNSTITSSKPLEYIYSDVWS-SPILSIDNYRYYVIFVDHFTRYTWLYPLKQKSQVKD 624

Query: 1102 MFQNFYHTIETQFHQKIAILRSDNGREFQNHKLSEFLASKGIVHQNSCAYTPQQNGVAER 1161
             F  F   +E +F  +I  L SDNG EF    L ++L+  GI H  S  +TP+ NG++ER
Sbjct: 625  TFIIFKSLVENRFQTRIGTLYSDNGGEFV--VLRDYLSQHGISHFTSPPHTPEHNGLSER 684

Query: 1162 KNRHLLEVARSLMLSTSLLSHLWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRH 1221
            K+RH++E+  +L+   S+    W  A   A +LINR+P+ +L LQ+P   L    P+   
Sbjct: 685  KHRHIVEMGLTLLSHASVPKTYWPYAFSVAVYLINRLPTPLLQLQSPFQKLFGQPPNYE- 744

Query: 1222 VSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDV 1281
                 L+VFGC  Y      N+ K   +++ C F+GY   Q  Y C H P+ + + +  V
Sbjct: 745  ----KLKVFGCACYPWLRPYNRHKLEDKSKQCAFMGYSLTQSAYLCLHIPTGRLYTSRHV 804

Query: 1282 TFCEDRPYFPVSHLQ-GESVSEESNN-------------TFEFIEPTPSVVSNIIPHSIV 1341
             F  D   FP S    G S S+E  +             T   + P P  +   +  S  
Sbjct: 805  QF--DERCFPFSTTNFGVSTSQEQRSDSAPNWPSHTTLPTTPLVLPAPPCLGPHLDTSPR 864

Query: 1342 LPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRS 1401
             P++  P  T            P+S   +P   SEP       N  +P  +   ++N  S
Sbjct: 865  PPSSPSPLCT----TQVSSSNLPSSSISSP-SSSEPTAPS--HNGPQPTAQPHQTQNSNS 924

Query: 1402 NVAVLENVEEKDSGDE-------------IEVRIETRNNEAEQGHTGKSDEYDSS----- 1461
            N  +L N                          I T +    + ++  S    +      
Sbjct: 925  NSPILNNPNPNSPSPNSPNQNSPLPQSPISSPHIPTPSTSISEPNSPSSSSTSTPPLPPV 984

Query: 1462 LDIPIALRKGTRS-CTKHPICNYVSYNSLSP-QFRAFTASLDSTIIPKDIYTALKYPEWK 1521
            L  P  ++   ++    H +          P Q  ++  SL +   P+    A+K   W+
Sbjct: 985  LPAPPIIQVNAQAPVNTHSMATRAKDGIRKPNQKYSYATSLAANSEPRTAIQAMKDDRWR 1044

Query: 1522 NAVMEEMKALEKNSTWDICTLPKGHKT-VGCKWVFSLKYKADGTLDRHKARLVAKGFTQT 1581
             A+  E+ A   N TWD+   P    T VGC+W+F+ K+ +DG+L+R+KARLVAKG+ Q 
Sbjct: 1045 QAMGSEINAQIGNHTWDLVPPPPPSVTIVGCRWIFTKKFNSDGSLNRYKARLVAKGYNQR 1104

Query: 1582 YGIDYSETFSPVATLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEEVYMSPPPGFE- 1641
             G+DY+ETFSPV    +IR++L VAV++ WP+ QLDV NAFL G L +EVYMS PPGF  
Sbjct: 1105 PGLDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQGTLTDEVYMSQPPGFVD 1164

Query: 1642 -----------------AQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLI 1701
                              Q+PRAW+    T++ + G+    SD +LF  + +   I  ++
Sbjct: 1165 KDRPDYVCRLRKAIYGLKQAPRAWYVELRTYLLTVGFVNSISDTSLFV-LQRGRSIIYML 1224

Query: 1702 VYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYILD 1761
            VYVDDI++TG+D   +      +   F +K+  +L YFLG+E  R  +G+ +SQR+Y LD
Sbjct: 1225 VYVDDILITGNDTVLLKHTLDALSQRFSVKEHEDLHYFLGIEAKRVPQGLHLSQRRYTLD 1284

Query: 1762 LLTETGMLGCRPTDTPIEFNCKLG-NSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVS 1821
            LL  T ML  +P  TP+  + KL  +S  ++P D  +Y+ +VG L YL+ TRPD+S+AV+
Sbjct: 1285 LLARTNMLTAKPVATPMATSPKLTLHSGTKLP-DPTEYRGIVGSLQYLAFTRPDLSYAVN 1344

Query: 1822 VVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKST 1881
             +SQ+M  P ++H  A+ R+LRYL  TP  G+  +K +  ++ AY+D+DWAG   D  ST
Sbjct: 1345 RLSQYMHMPTDDHWNALKRVLRYLAGTPDHGIFLKKGNTLSLHAYSDADWAGDTDDYVST 1404

Query: 1882 SGYCTFVWGNLVTWRSKKQSVVARSSTEAEYKALSLGICEEIWLQKVLTDLHQECETPLK 1916
            +GY  ++  + ++W SKKQ  V RSSTEAEY++++    E  W+  +LT+L  +   P  
Sbjct: 1405 NGYIVYLGHHPISWSSKKQKGVVRSSTEAEYRSVANTSSELQWICSLLTELGIQLSHPPV 1446

BLAST of CSPI02G21510 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 580.5 bits (1495), Expect = 7.1e-164
Identity = 445/1468 (30.31%), Postives = 702/1468 (47.82%), Query Frame = 0

Query: 502  KLNGNNYFS-WSQSVK--MVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILIN 561
            K NG+N FS W + ++  ++ +G  K   +  + P  +  +     W   D    S +  
Sbjct: 10   KFNGDNGFSTWQRRMRDLLIQQGLHKVLDVDSKKPDTMKAED----WADLDERAASAIRL 69

Query: 562  SMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKL 621
             +   +   ++   TA+ IW   ++LY  +   ++LY L+KQ++       + T+F + L
Sbjct: 70   HLSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLY-LKKQLYALHMS--EGTNFLSHL 129

Query: 622  SLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQRPIPSL 681
            ++          L+ +    GV   +IEE D+    L  L   +D +   IL  +    L
Sbjct: 130  NVF-------NGLITQLANLGV---KIEEEDKAILLLNSLPSSYDNLATTILHGKTTIEL 189

Query: 682  MEVCSEIRLEED-RTSAMNISATPTIDSAAFS-ARSSNS--------SSDKHNGKPIPVC 741
             +V S + L E  R    N       +    S  RSSN+         S   +   +  C
Sbjct: 190  KDVTSALLLNEKMRKKPENQGQALITEGRGRSYQRSSNNYGRSGARGKSKNRSKSRVRNC 249

Query: 742  EHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYVSESAEPPQQSDPHKNQTDLSL 801
             +C +  H K  C       P  +K      + +G+     +A   Q +D          
Sbjct: 250  YNCNQPGHFKRDC-------PNPRK---GKGETSGQKNDDNTAAMVQNNDN--------- 309

Query: 802  ATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGN-ETIRIAD 861
              L    +    H  G      ++ W++D+ A+ H T   + F  Y+  AG+  T+++ +
Sbjct: 310  VVLFINEEEECMHLSG-----PESEWVVDTAASHHATPVRDLFCRYV--AGDFGTVKMGN 369

Query: 862  GSLAPVAGKG----KISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSF 921
             S + +AG G    K +    L L +V HVP L  NL+S   +  +   ++ F       
Sbjct: 370  TSYSKIAGIGDICIKTNVGCTLVLKDVRHVPDLRMNLISGIALDRD-GYESYFANQK--- 429

Query: 922  QDLSSGRMIGTARHSRG-LYLLDDDTSSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNF 981
              L+ G ++     +RG LY  + +     +             E    LWH R+GH + 
Sbjct: 430  WRLTKGSLVIAKGVARGTLYRTNAEICQGELNAAQ--------DEISVDLWHKRMGHMSE 489

Query: 982  QYMKHLF-PHLFSKVEMTTLS-CDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKI 1041
            + ++ L    L S  + TT+  CD C+  KQHRVSF +   +      LV+SDV GP +I
Sbjct: 490  KGLQILAKKSLISYAKGTTVKPCDYCLFGKQHRVSFQTSSERKLNILDLVYSDVCGPMEI 549

Query: 1042 TTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAILRSDNGRE 1101
             +  G ++FVTFIDD +R  WVY++  K +V  +FQ F+  +E +  +K+  LRSDNG E
Sbjct: 550  ESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFHALVERETGRKLKRLRSDNGGE 609

Query: 1102 FQNHKLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLLSHLWGDAI 1161
            + + +  E+ +S GI H+ +   TPQ NGVAER NR ++E  RS++    L    WG+A+
Sbjct: 610  YTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEKVRSMLRMAKLPKSFWGEAV 669

Query: 1162 LTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFGPNQTKFTP 1221
             TA +LINR PS  L  + P     E   + + VS   L+VFGC A+ H     +TK   
Sbjct: 670  QTACYLINRSPSVPLAFEIP-----ERVWTNKEVSYSHLKVFGCRAFAHVPKEQRTKLDD 729

Query: 1222 RAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGESVSEESNNTF 1281
            ++  C+F+GY   + GY+ + P  +K   + DV F E                 E     
Sbjct: 730  KSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRE----------------SEVRTAA 789

Query: 1282 EFIEPTPSVVSNIIPHSIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQG 1341
            +  E    V + IIP+ + +P+                    TS  P   +         
Sbjct: 790  DMSE---KVKNGIIPNFVTIPS--------------------TSNNPTSAES-------- 849

Query: 1342 MENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAEQGHTGKSDEYDS 1401
                    T + +SE       V+E  E+ D G E EV   T+  E  Q           
Sbjct: 850  --------TTDEVSEQGEQPGEVIEQGEQLDEGVE-EVEHPTQGEEQHQ----------- 909

Query: 1402 SLDIPIALRKGTR---SCTKHPICNYVSYNSLSPQFRAFTASLDSTIIPKDIYTALKYPE 1461
                   LR+  R      ++P   YV               +     P+ +   L +PE
Sbjct: 910  ------PLRRSERPRVESRRYPSTEYV--------------LISDDREPESLKEVLSHPE 969

Query: 1462 WKN----AVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAK 1521
             KN    A+ EEM++L+KN T+ +  LPKG + + CKWVF LK   D  L R+KARLV K
Sbjct: 970  -KNQLMKAMQEEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVK 1029

Query: 1522 GFTQTYGIDYSETFSPVATLNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEEVYMSPP 1581
            GF Q  GID+ E FSPV  + +IR +LS+A + D  + QLDVK AFL+GDL EE+YM  P
Sbjct: 1030 GFEQKKGIDFDEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQP 1089

Query: 1582 PGFEA------------------QSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGK 1641
             GFE                   Q+PR W+ +F +F+KSQ Y + +SD  ++ K      
Sbjct: 1090 EGFEVAGKKHMVCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENN 1149

Query: 1642 IAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEG--ISVS 1701
              +L++YVDD+++ G D+  I++LK  +   F++KDLG  +  LGM++ R +    + +S
Sbjct: 1150 FIILLLYVDDMLIVGKDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWLS 1209

Query: 1702 QRKYILDLLTETGMLGCRPTDTPIEFNCKLGNS------DDQVPVDKEQYQRLVGKLIY- 1761
            Q KYI  +L    M   +P  TP+  + KL         +++  + K  Y   VG L+Y 
Sbjct: 1210 QEKYIERVLERFNMKNAKPVSTPLAGHLKLSKKMCPTTVEEKGNMAKVPYSSAVGSLMYA 1269

Query: 1762 LSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTD 1821
            +  TRPDI+ AV VVS+F++ P +EH +AV  ILRYL+ T G  L F  +D   ++ YTD
Sbjct: 1270 MVCTRPDIAHAVGVVSRFLENPGKEHWEAVKWILRYLRGTTGDCLCFGGSD-PILKGYTD 1325

Query: 1822 SDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSTEAEYKALSLGICEEIWLQKV 1881
            +D AG + +RKS++GY     G  ++W+SK Q  VA S+TEAEY A +    E IWL++ 
Sbjct: 1330 ADMAGDIDNRKSSTGYLFTFSGGAISWQSKLQKCVALSTTEAEYIAATETGKEMIWLKRF 1325

Query: 1882 LTD--LHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYI 1913
            L +  LHQ+      ++CD+++AI ++ N + H RTKH+++  H+I+E +D  S+ +  I
Sbjct: 1390 LQELGLHQK---EYVVYCDSQSAIDLSKNSMYHARTKHIDVRYHWIREMVDDESLKVLKI 1325

BLAST of CSPI02G21510 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 528.9 bits (1361), Expect = 2.4e-148
Identity = 426/1498 (28.44%), Postives = 684/1498 (45.66%), Query Frame = 0

Query: 504  NGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQ 563
            +G  Y  W   ++ +L  +     + G +P  +     +  WK  +   +S +I  +   
Sbjct: 12   DGEKYAIWKFRIRALLAEQDVLKVVDGLMPNEV-----DDSWKKAERCAKSTIIEYLSDS 71

Query: 564  IGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECK-QGTMDVTSFFNKLSLIW 623
                     TA+ I +    +Y ++  AS+L  LRK++   K    M + S F+      
Sbjct: 72   FLNFATSDITARQILENLDAVYERKSLASQL-ALRKRLLSLKLSSEMSLLSHFHIFD--- 131

Query: 624  QEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFD-----------------VV 683
               +L  EL+          ++IEE D+I   L  L   +D                  V
Sbjct: 132  ---ELISELL-------AAGAKIEEMDKISHLLITLPSCYDGIITAIETLSEENLTLAFV 191

Query: 684  RGRILGQRPIPSLMEVCSEIRLEEDRT-------SAMNISATPTIDSAAFSARSSNSSS- 743
            + R+L Q           EI+++ D         +A+  +   T  +  F  R +     
Sbjct: 192  KNRLLDQ-----------EIKIKNDHNDTSKKVMNAIVHNNNNTYKNNLFKNRVTKPKKI 251

Query: 744  DKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYVSESAEPPQQS 803
             K N K    C HC ++ H K+ C+  H +   + K   N+KQ                 
Sbjct: 252  FKGNSKYKVKCHHCGREGHIKKDCF--HYKRILNNKNKENEKQ----------------- 311

Query: 804  DPHKNQTDLSLATLGAIVQSGIPHSFGLVSIDGKNP-------WILDSGATDHLTGSSEH 863
                             VQ+   H    +  +  N        ++LDSGA+DHL      
Sbjct: 312  -----------------VQTATSHGIAFMVKEVNNTSVMDNCGFVLDSGASDHLINDESL 371

Query: 864  FVSYIPCAGNETIRIA-DGSLAPVAGKG--KISPCAGLSLHNVLHVPKLSYNLLSISKIT 923
            +   +       I +A  G       +G  ++     ++L +VL   + + NL+S+ ++ 
Sbjct: 372  YTDSVEVVPPLKIAVAKQGEFIYATKRGIVRLRNDHEITLEDVLFCKEAAGNLMSVKRLQ 431

Query: 924  HELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLS-SYFTTS 983
                         +S +   SG  I       GL ++ +    +++P  +  + S     
Sbjct: 432  EA----------GMSIEFDKSGVTIS----KNGLMVVKNSGMLNNVPVINFQAYSINAKH 491

Query: 984  EQDCMLWHFRLGHPNFQYM-----KHLF--PHLFSKVEMTTLSCDVCIQAKQHRVSFPSQ 1043
            + +  LWH R GH +   +     K++F    L + +E++   C+ C+  KQ R+ F   
Sbjct: 492  KNNFRLWHERFGHISDGKLLEIKRKNMFSDQSLLNNLELSCEICEPCLNGKQARLPFKQL 551

Query: 1044 PYKP--TQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQN 1103
              K    +P  +VHSDV GP    T   K +FV F+D  T     YLI  KS+V SMFQ+
Sbjct: 552  KDKTHIKRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYCVTYLIKYKSDVFSMFQD 611

Query: 1104 FYHTIETQFHQKIAILRSDNGREFQNHKLSEFLASKGIVHQNSCAYTPQQNGVAERKNRH 1163
            F    E  F+ K+  L  DNGRE+ ++++ +F   KGI +  +  +TPQ NGV+ER  R 
Sbjct: 612  FVAKSEAHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLTVPHTPQLNGVSERMIRT 671

Query: 1164 LLEVARSLMLSTSLLSHLWGDAILTAAHLINRMPSRIL--HLQTPLDCLKESYPSTRHVS 1223
            + E AR+++    L    WG+A+LTA +LINR+PSR L    +TP +      P  +H  
Sbjct: 672  ITEKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSKTPYEMWHNKKPYLKH-- 731

Query: 1224 EVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYFVTMDVTF 1283
               LRVFG T YVH     Q KF  ++   +FVGY P+  G+K +   + K+ V  DV  
Sbjct: 732  ---LRVFGATVYVH-IKNKQGKFDDKSFKSIFVGYEPN--GFKLWDAVNEKFIVARDVVV 791

Query: 1284 CEDRPY------FPVSHLQGESVSEESNNTFEFIEPTPSVVSNIIPHSIVLPTNQVPWKT 1343
             E          F    L+    SE  N    F   +  ++    P+      N      
Sbjct: 792  DETNMVNSRAVKFETVFLKDSKESENKN----FPNDSRKIIQTEFPNESKECDN-----I 851

Query: 1344 YYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTK-NMISENDRSNVAVLENVE 1403
             + ++ K+            +  +E P      N ++ C     + ++  SN   L   +
Sbjct: 852  QFLKDSKESENKNFPNDSRKIIQTEFP------NESKECDNIQFLKDSKESNKYFLNESK 911

Query: 1404 EKDSGDEI-EVRIETRNNEAEQGHTGKS------DEYDSSLDIPIALRKGTRSCTKHPIC 1463
            ++   D + E +     NE+ +  T +       D    +  I I  R+  R  TK  I 
Sbjct: 912  KRKRDDHLNESKGSGNPNESRESETAEHLKEIGIDNPTKNDGIEIINRRSERLKTKPQIS 971

Query: 1464 NYVSYNSLSP-QFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTL 1523
                 NSL+     A T   D      +I        W+ A+  E+ A + N+TW I   
Sbjct: 972  YNEEDNSLNKVVLNAHTIFNDVPNSFDEIQYRDDKSSWEEAINTELNAHKINNTWTITKR 1031

Query: 1524 PKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVATLNTIRVLL 1583
            P+    V  +WVFS+KY   G   R+KARLVA+GFTQ Y IDY ETF+PVA +++ R +L
Sbjct: 1032 PENKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQKYQIDYEETFAPVARISSFRFIL 1091

Query: 1584 SVAVNKDWPLYQLDVKNAFLNGDLVEEVYMSPPPGFEA----------------QSPRAW 1643
            S+ +  +  ++Q+DVK AFLNG L EE+YM  P G                   Q+ R W
Sbjct: 1092 SLVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQGISCNSDNVCKLNKAIYGLKQAARCW 1151

Query: 1644 FDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKI---AVLIVYVDDIVLTGDDQAEISQLKQ 1703
            F+ F   +K   +     D  ++  +   G I     +++YVDD+V+   D   ++  K+
Sbjct: 1152 FEVFEQALKECEFVNSSVDRCIY--ILDKGNINENIYVLLYVDDVVIATGDMTRMNNFKR 1211

Query: 1704 RMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPI--EF 1763
             + ++F + DL  +K+F+G+ +   ++ I +SQ  Y+  +L++  M  C    TP+  + 
Sbjct: 1212 YLMEKFRMTDLNEIKHFIGIRIEMQEDKIYLSQSAYVKKILSKFNMENCNAVSTPLPSKI 1271

Query: 1764 NCKLGNSDDQVPVDKEQYQRLVGKLIYLS-HTRPDISFAVSVVSQFMQTPNEEHMKAVNR 1823
            N +L NSD+         + L+G L+Y+   TRPD++ AV+++S++    N E  + + R
Sbjct: 1272 NYELLNSDEDC---NTPCRSLIGCLMYIMLCTRPDLTTAVNILSRYSSKNNSELWQNLKR 1331

Query: 1824 ILRYLKSTPGKGLMFRK--TDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWG-NLVTWRS 1883
            +LRYLK T    L+F+K       I  Y DSDWAGS +DRKST+GY   ++  NL+ W +
Sbjct: 1332 VLRYLKGTIDMKLIFKKNLAFENKIIGYVDSDWAGSEIDRKSTTGYLFKMFDFNLICWNT 1391

Query: 1884 KKQSVVARSSTEAEYKALSLGICEEIWLQKVLTDLHQECETPLKLFCDNKAAISIANNPV 1914
            K+Q+ VA SSTEAEY AL   + E +WL+ +LT ++ + E P+K++ DN+  ISIANNP 
Sbjct: 1392 KRQNSVAASSTEAEYMALFEAVREALWLKFLLTSINIKLENPIKIYEDNQGCISIANNPS 1401

BLAST of CSPI02G21510 vs. ExPASy Swiss-Prot
Match: A1L4X0 (Protein CLT2, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CLT2 PE=2 SV=1)

HSP 1 Score: 199.9 bits (507), Expect = 2.6e-49
Identity = 107/180 (59.44%), Postives = 141/180 (78.33%), Query Frame = 0

Query: 78  IRNRNIKARFSSRNDSSS-------TPHASNGDSNSPAKTKLIVVSSLIAVSLAIANRVL 137
           +R  ++++RF S   ++S       +  AS  +S+ P+   LIV +S++ V+LA+ANRVL
Sbjct: 52  LRRSDLRSRFLSTPKTTSPMRRPRFSVGASTEESSIPSNRNLIVANSVVIVALAVANRVL 111

Query: 138 YKLALVPLKEYPFFLAQLTTFGYVMAYFSILYLRRRANIVTEEMLSLPKSRFMAIGFLEA 197
           YKLALVP+K+YPFF+AQLTTFGYV+ YF+ILY RRR  IVT EM+ +PK RF  IGFLEA
Sbjct: 112 YKLALVPMKQYPFFMAQLTTFGYVLIYFTILYTRRRLGIVTNEMMDVPKWRFAIIGFLEA 171

Query: 198 LGIATGMAAAASLPGPAIPILSQTFLVWQLVFSAILLGRKYSWNQIAGCVIVTAGVVVAV 251
           LG+ATGMAAAA LPGP IPIL+QT+LVWQL+F+ ++LGR++  NQIAGC++V  GVVVAV
Sbjct: 172 LGVATGMAAAAMLPGPVIPILNQTYLVWQLLFALLILGRRFLLNQIAGCLLVAVGVVVAV 231

BLAST of CSPI02G21510 vs. ExPASy TrEMBL
Match: A0A5D3CIR0 (Beta-galactosidase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold227G00930 PE=4 SV=1)

HSP 1 Score: 2615.9 bits (6779), Expect = 0.0e+00
Identity = 1311/1684 (77.85%), Postives = 1435/1684 (85.21%), Query Frame = 0

Query: 260  SGFHVNWCELVVSLFNMVSERDNENTLETQKNQTTYENQTEVTAISFSAAVAAAIDARMS 319
            +GF ++  +L V L    SE+ N  TLE    +T  E      A + +A ++AA+DA ++
Sbjct: 337  AGFKISRSKL-VDLIRWASEQSNNETLENNLGETQIETDPVTAAAAAAAGISAAVDAAVA 396

Query: 320  AAMDELLSRLQKTSENNFSSLPQSSAPSPDH---HAPGFLPQTAPTIPSVQPFSSSAAYI 379
            AAM++LL  LQK        +PQ  AP  D    HAP      A   P   PF  +A  +
Sbjct: 397  AAMEKLLQNLQKPPIYPTGVVPQPYAPPFDQKLIHAPLVSGAWAHAPP---PFHVTAHPV 456

Query: 380  APHAPIYVLPSNSNRLP-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRN 439
              +AP  V PSN +  P P  PS   GQ P+  +        Q++   +  +   +S   
Sbjct: 457  PFYAPSDVQPSNPSGHPHPHAPSTSSGQHPSTVNLSNQYSKQQLY--VDPLQQPLFSGNG 516

Query: 440  VQASSGIVHQQLEGLRQQIAALEATLGTTST-LPMYSEYPVNSFPNVSSPYLTNTVAQSS 499
            +              R  I A E++  +  T LPMYS+ PV SFPN  S Y+T ++  SS
Sbjct: 517  IDQPQN---------RSDIEAGESSTHSKPTELPMYSKNPVTSFPNSQSNYITGSLG-SS 576

Query: 500  MYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRS 559
              + SGEKLNG NYFSWSQS+KM LEGR +F FLTGEI RP PGD  ER WK EDS++RS
Sbjct: 577  TGNFSGEKLNGQNYFSWSQSIKMFLEGRYQFGFLTGEIVRPPPGDALERLWKGEDSLIRS 636

Query: 560  ILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSF 619
            +LINSMEPQIGKPLL+A TAKD+WDT QTLYSKRQNASRLYTLRKQVH CKQGT+DVT++
Sbjct: 637  MLINSMEPQIGKPLLYATTAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTY 696

Query: 620  FNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQRP 679
            FNKLSL+WQEMDLCRE VW  P D  QY+++EE DR+YDFLAGLNPKFD V GRILGQRP
Sbjct: 697  FNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCGRILGQRP 756

Query: 680  IPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFSARSSNSSSDKHNGKPIPVCEHCKKQ 739
            +PSLMEVC E+RLEEDRT+AM +  TPTIDSAAFSARSSN  SDK+NGK IPVCEHCKKQ
Sbjct: 757  LPSLMEVCFEVRLEEDRTNAMGVLTTPTIDSAAFSARSSNHDSDKNNGKSIPVCEHCKKQ 816

Query: 740  WHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYVSES--AEPPQQSDPHKNQTDLSLATLG 799
            WHTK+QCWKLHGRPPG KKR SN+KQN+GRAY+SE+  A   Q +DP  +QT     TLG
Sbjct: 817  WHTKDQCWKLHGRPPGGKKRSSNEKQNSGRAYISETTPASTSQSTDPTVSQT--KTPTLG 876

Query: 800  AIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAP 859
            AI QSG+P S GL+S+DGKNPWILDSGATDHLTGSSEHF+SY PCAGNE IRIADGSLAP
Sbjct: 877  AIAQSGMPQSLGLISVDGKNPWILDSGATDHLTGSSEHFISYAPCAGNEKIRIADGSLAP 936

Query: 860  VAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMI 919
            +AGKG+I P  G +L NVLHVPKLSYNLLSISKIT EL+CKAIFLP+SV FQD+SSGR I
Sbjct: 937  IAGKGQIVPFDGFALQNVLHVPKLSYNLLSISKITRELHCKAIFLPESVYFQDMSSGRTI 996

Query: 920  GTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYMKHLFPHL 979
            GTARHSRGLY+LDDDTS SS+ R SLLSSYF+TSEQDCMLWHFRLGHPNF YM+HLFPHL
Sbjct: 997  GTARHSRGLYILDDDTSCSSLSRVSLLSSYFSTSEQDCMLWHFRLGHPNFTYMQHLFPHL 1056

Query: 980  FSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTF 1039
            FSKV++++LSCDVCI+AKQHRVSFPSQPYKPTQPF L+HSDVWGPSK+TTSSGKRWFVTF
Sbjct: 1057 FSKVDVSSLSCDVCIRAKQHRVSFPSQPYKPTQPFNLIHSDVWGPSKVTTSSGKRWFVTF 1116

Query: 1040 IDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAILRSDNGREFQNHKLSEFLAS 1099
            IDDHTRLTWVYLI+DKSEV S+FQNFYHTI+TQFH KIAILRSDNGREFQNH LSEFLAS
Sbjct: 1117 IDDHTRLTWVYLISDKSEVPSIFQNFYHTIKTQFHTKIAILRSDNGREFQNHNLSEFLAS 1176

Query: 1100 KGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLLSHLWGDAILTAAHLINRMPS 1159
            KGIVHQ SCAYTPQQNGVAERKNRHL+EVARSLMLSTSL S+LWGDAILTAAHLINRMPS
Sbjct: 1177 KGIVHQTSCAYTPQQNGVAERKNRHLVEVARSLMLSTSLPSYLWGDAILTAAHLINRMPS 1236

Query: 1160 RILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYPP 1219
            RILHLQTPLDCLKESYPSTR VSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYP 
Sbjct: 1237 RILHLQTPLDCLKESYPSTRLVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYPL 1296

Query: 1220 HQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEPTPSVVSN 1279
            HQ GYKCFHPPSRKYFVTMDVTFCE+RPYFPVSHLQGE+VSEESNNTFEF+EPT   VS+
Sbjct: 1297 HQHGYKCFHPPSRKYFVTMDVTFCENRPYFPVSHLQGENVSEESNNTFEFVEPTLITVSD 1356

Query: 1280 IIPHSIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNM 1339
            I PH I+LPTNQVPWKTYYRRN +KEVGSPTSQPPAPVQ+ EPPRDQGMENPT+PCT N 
Sbjct: 1357 IDPHPIILPTNQVPWKTYYRRNLRKEVGSPTSQPPAPVQNFEPPRDQGMENPTKPCTNNT 1416

Query: 1340 ISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAEQGHTGKSDEYDSSLDIPIALRKGT 1399
            +SEND+S++A LEN+EEK+  DE EVRIET N+EAEQGHT K DEYD SLDIPIALRKGT
Sbjct: 1417 MSENDKSDIAFLENMEEKNCDDETEVRIETSNDEAEQGHTRKLDEYDPSLDIPIALRKGT 1476

Query: 1400 RSCTKHPICNYVSYNSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKN 1459
            RSCTKHPICNYVSY++LSPQFRAFTA+LDSTIIPK+IYTAL+ PEWKNAVMEEMKALEKN
Sbjct: 1477 RSCTKHPICNYVSYDNLSPQFRAFTANLDSTIIPKNIYTALECPEWKNAVMEEMKALEKN 1536

Query: 1460 STWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAT 1519
             TW+IC LPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVA 
Sbjct: 1537 RTWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAK 1596

Query: 1520 LNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEEVYMSPPPGFEA-------------- 1579
            LNT+RVLLSVAVNKDWPLYQLDVKNAFLNGDLVEEVYMSPPPGFEA              
Sbjct: 1597 LNTVRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEEVYMSPPPGFEAQFGQEVCKLQKSLY 1656

Query: 1580 ---QSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAE 1639
               QSPRAWFDRFTTFVKSQGY QGHSDHTLFTK SKTGKIA+LIVYVDDIVLTGDDQ E
Sbjct: 1657 GLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKTGKIAILIVYVDDIVLTGDDQTE 1716

Query: 1640 ISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDT 1699
            ISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKY LDLLTETGMLGCRP DT
Sbjct: 1717 ISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADT 1776

Query: 1700 PIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKA 1759
            PIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQ P E+HM+A
Sbjct: 1777 PIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQAPYEKHMEA 1836

Query: 1760 VNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRS 1819
            VNRILRYLK+TPGKGLMFRKT+RKTIEAYTDSDWAGSV+DRKSTSGYCTFVWGNLVTWRS
Sbjct: 1837 VNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSVIDRKSTSGYCTFVWGNLVTWRS 1896

Query: 1820 KKQSVVARSSTEAEYKALSLGICEEIWLQKVLTDLHQECETPLKLFCDNKAAISIANNPV 1879
            KKQSVVARSS EAEY+A+SLGICEEIWLQKVL+DLHQECETPLKLFCDNKAAISIANNPV
Sbjct: 1897 KKQSVVARSSAEAEYRAMSLGICEEIWLQKVLSDLHQECETPLKLFCDNKAAISIANNPV 1956

Query: 1880 QHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDI 1920
            QHDRTKHVEIDRHFIKE+LDSGSICIPYIPSSQQ+ADVLTKGLLRP+FD CVSKLGLIDI
Sbjct: 1957 QHDRTKHVEIDRHFIKERLDSGSICIPYIPSSQQIADVLTKGLLRPHFDLCVSKLGLIDI 2002

BLAST of CSPI02G21510 vs. ExPASy TrEMBL
Match: A0A5D3DJM7 (Beta-galactosidase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold605G00420 PE=4 SV=1)

HSP 1 Score: 2609.3 bits (6762), Expect = 0.0e+00
Identity = 1308/1668 (78.42%), Postives = 1427/1668 (85.55%), Query Frame = 0

Query: 276  MVSERDNENTLETQKNQTTYENQTEVTAISFSAAVAAAIDARMSAAMDELLSRLQKTSEN 335
            MVSE+ N  TLE    +T  E +        +AA AAA+DA ++AA+++LL  LQK    
Sbjct: 1    MVSEQSNNETLENNLGETQIETE------PVAAAAAAAVDAAVAAAVEKLLQNLQKPPIY 60

Query: 336  NFSSLPQSSAPSPDH---HAPGFLPQTAPTIPSVQPFSSSAAYIAPHAPIYVLPSNSNRL 395
                +PQ  AP  D    HAP      A   P   PF  +A  +  +AP  V PSN +  
Sbjct: 61   PTGVVPQPYAPPFDQKLIHAPLVSGAWAHAPP---PFHVTAHPVPFYAPSDVQPSNPSGH 120

Query: 396  P-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLR 455
            P P  PS   GQ P+  +        Q++   +  +   +S   +              R
Sbjct: 121  PHPHAPSTSSGQHPSTVNLSNQYSKQQLY--VDPLQQPLFSGNGIDQPQN---------R 180

Query: 456  QQIAALEATLGTTST-LPMYSEYPVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFS 515
              I A E++  +  T LPMYS+ PV SFPN  S Y+T ++  SS  + SGEKLNG NYFS
Sbjct: 181  SDIEAGESSTHSKPTELPMYSKNPVTSFPNSQSNYITGSLG-SSTGNFSGEKLNGQNYFS 240

Query: 516  WSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLF 575
            WSQS+KM LEGR +F FLTGEI RP PGD  ER WK EDS++RS+LINSMEPQIGKPLL+
Sbjct: 241  WSQSIKMFLEGRYQFGFLTGEIVRPPPGDALERLWKGEDSLIRSMLINSMEPQIGKPLLY 300

Query: 576  AATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRE 635
            A TAKD+WDT QTLYSKRQNASRLYTLRKQVH CKQGT+DVT++FNKLSL+WQEMDLCRE
Sbjct: 301  ATTAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRE 360

Query: 636  LVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQRPIPSLMEVCSEIRLEED 695
             VW  P D  QY+++EE DR+YDFLAGLNPKFD V GRILGQRP+PSLMEVC E+RLEED
Sbjct: 361  TVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCGRILGQRPLPSLMEVCFEVRLEED 420

Query: 696  RTSAMNISATPTIDSAAFSARSSNSSSDKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPG 755
            RT+AM +  TPTIDSAAFSARSSN  SDK+NGK IPVCEHCKKQWHTK+QCWKLHGRPPG
Sbjct: 421  RTNAMGVLTTPTIDSAAFSARSSNHDSDKNNGKSIPVCEHCKKQWHTKDQCWKLHGRPPG 480

Query: 756  SKKRPSNDKQNTGRAYVSES--AEPPQQSDPHKNQTDLSLATLGAIVQSGIPHSFGLVSI 815
             KKR SN+KQN+GRAY+SE+  A   Q +DP  +QT     TLGAI QSG+P S GL+S+
Sbjct: 481  GKKRSSNEKQNSGRAYISETTPASTSQSTDPTVSQT--KTPTLGAIAQSGMPQSLGLISV 540

Query: 816  DGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKISPCAGLSLH 875
            DGKNPWILDSGATDHLTGSSEHF+SY PCAGNE IRIADGSLAP+AGKG+I P  G +L 
Sbjct: 541  DGKNPWILDSGATDHLTGSSEHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQ 600

Query: 876  NVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDT 935
            NVLHVPKLSYNLLSISKIT EL+CKAIFLP+SV FQD+SSGR IGTARHSRGLY+LDDDT
Sbjct: 601  NVLHVPKLSYNLLSISKITRELHCKAIFLPESVYFQDMSSGRTIGTARHSRGLYILDDDT 660

Query: 936  SSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQ 995
            S SS+ R SLLSSYF+TSEQDCMLWHFRLGHPNF YM+HLFPHLFSKV++++LSCDVCI+
Sbjct: 661  SCSSLSRVSLLSSYFSTSEQDCMLWHFRLGHPNFTYMQHLFPHLFSKVDVSSLSCDVCIR 720

Query: 996  AKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDK 1055
            AKQHRVSFPSQPYKPTQPF L+HSDVWGPSK+TTSSGKRWFVTFIDDHTRLTWVYLI+DK
Sbjct: 721  AKQHRVSFPSQPYKPTQPFNLIHSDVWGPSKVTTSSGKRWFVTFIDDHTRLTWVYLISDK 780

Query: 1056 SEVSSMFQNFYHTIETQFHQKIAILRSDNGREFQNHKLSEFLASKGIVHQNSCAYTPQQN 1115
            SEV S+FQNFYHTI+TQFH KIAILRSDNGREFQNH LSEFLASKGIVHQ SCAYTPQQN
Sbjct: 781  SEVPSIFQNFYHTIKTQFHTKIAILRSDNGREFQNHNLSEFLASKGIVHQTSCAYTPQQN 840

Query: 1116 GVAERKNRHLLEVARSLMLSTSLLSHLWGDAILTAAHLINRMPSRILHLQTPLDCLKESY 1175
            GVAERKNRHL+EVARSLMLSTSL S+LWGDAILTAAHLINRMPSRILHLQTPLDCLKESY
Sbjct: 841  GVAERKNRHLVEVARSLMLSTSLPSYLWGDAILTAAHLINRMPSRILHLQTPLDCLKESY 900

Query: 1176 PSTRHVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYF 1235
            PSTR VSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYP HQ GYKCFHPPSRKYF
Sbjct: 901  PSTRLVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYPLHQHGYKCFHPPSRKYF 960

Query: 1236 VTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEPTPSVVSNIIPHSIVLPTNQVPWK 1295
            VTMDVTFCE+RPYFPVSHLQGE+VSEESNNTFEF+EPT   VS+I PH I+LPTNQVPWK
Sbjct: 961  VTMDVTFCENRPYFPVSHLQGENVSEESNNTFEFVEPTLITVSDIDPHPIILPTNQVPWK 1020

Query: 1296 TYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVE 1355
            TYYRRN +KEVGSPTSQPPAPVQ+ EPPRDQGMENPT+PCT N +SEND+S++A LEN+E
Sbjct: 1021 TYYRRNLRKEVGSPTSQPPAPVQNFEPPRDQGMENPTKPCTNNTMSENDKSDIAFLENME 1080

Query: 1356 EKDSGDEIEVRIETRNNEAEQGHTGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYNS 1415
            EK+  DE EVRIET N+EAEQGHT K DEYD SLDIPIALRKGTRSCTKHPICNYVSY++
Sbjct: 1081 EKNCDDETEVRIETSNDEAEQGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDN 1140

Query: 1416 LSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVG 1475
            LSPQFRAFTA+LDSTIIPK+IYTAL+ PEWKNAVMEEMKALEKN TW+IC LPKGHKTVG
Sbjct: 1141 LSPQFRAFTANLDSTIIPKNIYTALECPEWKNAVMEEMKALEKNRTWEICALPKGHKTVG 1200

Query: 1476 CKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVATLNTIRVLLSVAVNKDW 1535
            CKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVA LNT+RVLLSVAVNKDW
Sbjct: 1201 CKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLLSVAVNKDW 1260

Query: 1536 PLYQLDVKNAFLNGDLVEEVYMSPPPGFEA-----------------QSPRAWFDRFTTF 1595
            PLYQLDVKNAFLNGDLVEEVYMSPPPGFEA                 QSPRAWFDRFTTF
Sbjct: 1261 PLYQLDVKNAFLNGDLVEEVYMSPPPGFEAQFGQEVCKLQKSLYGLKQSPRAWFDRFTTF 1320

Query: 1596 VKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKD 1655
            VKSQGY QGHSDHTLFTK SKTGKIA+LIVYVDDIVLTGDDQ EISQLKQRMGDEFEIKD
Sbjct: 1321 VKSQGYSQGHSDHTLFTKASKTGKIAILIVYVDDIVLTGDDQTEISQLKQRMGDEFEIKD 1380

Query: 1656 LGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVP 1715
            LGNLKYFLGMEVARSKEGISVSQRKY LDLLTETGMLGCRP DTPIEFNCKLGNSDDQVP
Sbjct: 1381 LGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVP 1440

Query: 1716 VDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGL 1775
            VDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQ P E+HM+AVNRILRYLK+TPGKGL
Sbjct: 1441 VDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQAPYEKHMEAVNRILRYLKNTPGKGL 1500

Query: 1776 MFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSTEAEYK 1835
            MFRKT+RKTIEAYTDSDWAGSV+DRKSTSGYCTFVWGNLVTWRSKKQSVVARSS EAEY+
Sbjct: 1501 MFRKTNRKTIEAYTDSDWAGSVIDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYR 1560

Query: 1836 ALSLGICEEIWLQKVLTDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIK 1895
            A+SLGICEEIWLQKVL+DLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIK
Sbjct: 1561 AMSLGICEEIWLQKVLSDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIK 1620

Query: 1896 EKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT 1920
            E+LDSGSICIPYIPSSQQ+ADVLTKGLLRP+FD CVSKLGLIDIY+PT
Sbjct: 1621 ERLDSGSICIPYIPSSQQIADVLTKGLLRPHFDLCVSKLGLIDIYLPT 1645

BLAST of CSPI02G21510 vs. ExPASy TrEMBL
Match: A0A5A7SL21 (Beta-galactosidase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold1204G00470 PE=4 SV=1)

HSP 1 Score: 2598.5 bits (6734), Expect = 0.0e+00
Identity = 1306/1668 (78.30%), Postives = 1422/1668 (85.25%), Query Frame = 0

Query: 276  MVSERDNENTLETQKNQTTYENQTEVTAISFSAAVAAAIDARMSAAMDELLSRLQKTSEN 335
            MVSE+ N  TLE    +T  E +           VAAA     +AA+++LL  LQK    
Sbjct: 1    MVSEQSNNETLENNLGETQIETE----------PVAAA----AAAAVEKLLQNLQKPPIY 60

Query: 336  NFSSLPQSSAPSPDH---HAPGFLPQTAPTIPSVQPFSSSAAYIAPHAPIYVLPSNSNRL 395
                +PQ  AP  D    HAP      A   P   PF  +A  +  +AP  V PSN +  
Sbjct: 61   PTGVVPQPYAPPSDQKLIHAPLVSGAWAHAPP---PFHVTAHPVPFYAPSDVQPSNPSGH 120

Query: 396  P-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLR 455
            P P  PS   GQ P+  +        Q++   +  +   +S   +              R
Sbjct: 121  PHPHAPSTSSGQHPSTVNLSNQYSKQQLY--VDPLQQPLFSGNGIDQPQN---------R 180

Query: 456  QQIAALEATLGTTST-LPMYSEYPVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFS 515
              I A E++  +  T LPMYS+ PV SFPN  S Y+T ++  SS  + SGEKLNG NYFS
Sbjct: 181  SDIEAGESSTHSKPTELPMYSKNPVTSFPNSQSNYITGSLG-SSTGNFSGEKLNGQNYFS 240

Query: 516  WSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLF 575
            WSQS+KM LEGR +F FLTGE  RP PGD  ER WK EDS++RS+LINSMEPQIGKPLL+
Sbjct: 241  WSQSIKMFLEGRYQFGFLTGETVRPPPGDALERLWKGEDSLIRSMLINSMEPQIGKPLLY 300

Query: 576  AATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRE 635
            AATAKD+WDT QTLYSKRQNASRLYTLRKQVH CKQGT+DVT++FNKLSL+WQEMDLCRE
Sbjct: 301  AATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRE 360

Query: 636  LVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQRPIPSLMEVCSEIRLEED 695
             VW  P D  QY+++EE DR+YDFLAGLNPKFD V GRILGQRP+PSLMEVC E+RLEED
Sbjct: 361  TVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCGRILGQRPLPSLMEVCFEVRLEED 420

Query: 696  RTSAMNISATPTIDSAAFSARSSNSSSDKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPG 755
            RT+AM +  TPTIDSAAFSARSSN  SDK+NGK IPVCEHCKKQWHTK+QCWKLHGRPPG
Sbjct: 421  RTNAMGVLTTPTIDSAAFSARSSNHDSDKNNGKSIPVCEHCKKQWHTKDQCWKLHGRPPG 480

Query: 756  SKKRPSNDKQNTGRAYVSES--AEPPQQSDPHKNQTDLSLATLGAIVQSGIPHSFGLVSI 815
             KKR SN+KQN+GRAY+SE+  A   Q +DP  +QT     TLGAI QSG+P S GL+S+
Sbjct: 481  GKKRSSNEKQNSGRAYISETTPASTSQSTDPTVSQT--KTPTLGAIAQSGMPQSLGLISV 540

Query: 816  DGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKISPCAGLSLH 875
            DGKNPWILDSGATDHLTGSSEHF+SY PCAGNE IRIADGSLAP+AGKG+I P  G +L 
Sbjct: 541  DGKNPWILDSGATDHLTGSSEHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQ 600

Query: 876  NVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDT 935
            NVLHVPKLSYNLLSISKIT EL+CKAIFLP+SV FQD+SSGR IGTARHSRGLY+LDDDT
Sbjct: 601  NVLHVPKLSYNLLSISKITRELHCKAIFLPESVYFQDMSSGRTIGTARHSRGLYILDDDT 660

Query: 936  SSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQ 995
            S SS+ R SLLSSYF+TSEQDCMLWHFRLGHPNF YM+HLFPHLFSKV++++LSCDVCI+
Sbjct: 661  SCSSLSRVSLLSSYFSTSEQDCMLWHFRLGHPNFTYMQHLFPHLFSKVDVSSLSCDVCIR 720

Query: 996  AKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDK 1055
            AKQHRVSFPSQPYKPTQPF L+HSDVWGPSK+TTSSGKRWFVTFIDDHTRLTWVYLI+DK
Sbjct: 721  AKQHRVSFPSQPYKPTQPFNLIHSDVWGPSKVTTSSGKRWFVTFIDDHTRLTWVYLISDK 780

Query: 1056 SEVSSMFQNFYHTIETQFHQKIAILRSDNGREFQNHKLSEFLASKGIVHQNSCAYTPQQN 1115
            SEV S+FQNFYHTI+TQFH KIAILRSDNGREFQNH LSEFLASKGIVHQ SCAYTPQQN
Sbjct: 781  SEVPSIFQNFYHTIKTQFHTKIAILRSDNGREFQNHNLSEFLASKGIVHQTSCAYTPQQN 840

Query: 1116 GVAERKNRHLLEVARSLMLSTSLLSHLWGDAILTAAHLINRMPSRILHLQTPLDCLKESY 1175
            GVAERKNRHL+EVARSLMLSTSL S+LWGDAILTAAHLINRMPSRILHLQTPLDCLKESY
Sbjct: 841  GVAERKNRHLVEVARSLMLSTSLPSYLWGDAILTAAHLINRMPSRILHLQTPLDCLKESY 900

Query: 1176 PSTRHVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYF 1235
            PSTR VSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYP HQ GYKCFHPPSRKYF
Sbjct: 901  PSTRLVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYPLHQHGYKCFHPPSRKYF 960

Query: 1236 VTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEPTPSVVSNIIPHSIVLPTNQVPWK 1295
            VTMDVTFCE+RPYFPVSHLQGE+VSEESNNTFEF+EPT   VS+I PH I+LPTNQVPWK
Sbjct: 961  VTMDVTFCENRPYFPVSHLQGENVSEESNNTFEFVEPTLITVSDIDPHPIILPTNQVPWK 1020

Query: 1296 TYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVE 1355
            TYYRRN +KEVGSPTSQPPAPVQ+ EPPRDQGMENPT+PCT N +SEND+S++AVLEN+E
Sbjct: 1021 TYYRRNLRKEVGSPTSQPPAPVQNFEPPRDQGMENPTKPCTNNTMSENDKSDIAVLENME 1080

Query: 1356 EKDSGDEIEVRIETRNNEAEQGHTGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYNS 1415
            EK+  DE EVRIET N+EAEQGHT K DEYD SLDIPIALRKGTRSCTKHPICNYVSY++
Sbjct: 1081 EKNCDDETEVRIETSNDEAEQGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDN 1140

Query: 1416 LSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVG 1475
            LSPQFRAFTA+LDSTIIPK+IYTAL+ PEWKNAVMEEMKALEKN TW+IC LPKGHKTVG
Sbjct: 1141 LSPQFRAFTANLDSTIIPKNIYTALECPEWKNAVMEEMKALEKNRTWEICALPKGHKTVG 1200

Query: 1476 CKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVATLNTIRVLLSVAVNKDW 1535
            CKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVA LNT+RVLLSVAVNKDW
Sbjct: 1201 CKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLLSVAVNKDW 1260

Query: 1536 PLYQLDVKNAFLNGDLVEEVYMSPPPGFEA-----------------QSPRAWFDRFTTF 1595
            PLYQLDVKNAFLNGDLVEEVYMSPPPGFEA                 QSPRAWFDRFTTF
Sbjct: 1261 PLYQLDVKNAFLNGDLVEEVYMSPPPGFEAQFGQEVCKLQKSLYGLKQSPRAWFDRFTTF 1320

Query: 1596 VKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKD 1655
            VKSQGY QGHSDHTLFTK SKTGKIA+LIVYVDDIVLTGDDQ EISQLKQRMGDEFEIKD
Sbjct: 1321 VKSQGYSQGHSDHTLFTKASKTGKIAILIVYVDDIVLTGDDQTEISQLKQRMGDEFEIKD 1380

Query: 1656 LGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVP 1715
            LGNLKYFLGMEVARSKEGISVSQRKY LDLLTETGMLGCRP DTPIEFNCKLGNSDDQVP
Sbjct: 1381 LGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVP 1440

Query: 1716 VDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGL 1775
            VDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQ P E+HM+AVNRILRYLK+TPGKGL
Sbjct: 1441 VDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQAPYEKHMEAVNRILRYLKNTPGKGL 1500

Query: 1776 MFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSTEAEYK 1835
            MFRKT+RKTIEAYTDSDWAGSV+DRKSTSGYCTFVWGNLVTWRSKKQSVVARSS EAEY+
Sbjct: 1501 MFRKTNRKTIEAYTDSDWAGSVIDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYR 1560

Query: 1836 ALSLGICEEIWLQKVLTDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIK 1895
            A+SLGICEEIWLQKVL+DLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIK
Sbjct: 1561 AMSLGICEEIWLQKVLSDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIK 1620

Query: 1896 EKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT 1920
            E+LDSGSICIPYIPSSQQ+ADVLTKGLLRP+FD CVSKLGLIDIY+PT
Sbjct: 1621 ERLDSGSICIPYIPSSQQIADVLTKGLLRPHFDLCVSKLGLIDIYLPT 1637

BLAST of CSPI02G21510 vs. ExPASy TrEMBL
Match: A0A5A7UGB2 (Beta-galactosidase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold43055G00290 PE=4 SV=1)

HSP 1 Score: 2597.0 bits (6730), Expect = 0.0e+00
Identity = 1306/1668 (78.30%), Postives = 1420/1668 (85.13%), Query Frame = 0

Query: 276  MVSERDNENTLETQKNQTTYENQTEVTAISFSAAVAAAIDARMSAAMDELLSRLQKTSEN 335
            MVSE+ N  TLE    +T  E  TE  A               +AAM++LL  LQK    
Sbjct: 1    MVSEQSNNETLENNLGETQIE--TEPAA---------------AAAMEKLLQNLQKPPIY 60

Query: 336  NFSSLPQSSAPSPDH---HAPGFLPQTAPTIPSVQPFSSSAAYIAPHAPIYVLPSNSNRL 395
                +PQ  AP  D    HAP      A   P   PF  +A  +  +AP  V PSN +  
Sbjct: 61   PTGVVPQPYAPPFDQKLIHAPLVSGAWAHAPP---PFHVTAHPVPFYAPSDVQPSNPSGH 120

Query: 396  P-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLR 455
            P P  PS   GQ P+  +        Q++   +  +   +S   +              R
Sbjct: 121  PHPHAPSTSSGQHPSTVNLSNQYSKQQLY--VDPLQQPLFSGNGIDQPQN---------R 180

Query: 456  QQIAALEATLGTTST-LPMYSEYPVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFS 515
              I A E++  +  T LPMYS+ PV SFPN  S Y+T ++  SS  + SGEKLNG NYFS
Sbjct: 181  SDIEAGESSTHSKPTELPMYSKNPVTSFPNSQSNYITGSLG-SSTGNFSGEKLNGQNYFS 240

Query: 516  WSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLF 575
            WSQS+KM LEGR +F FLTGEI RP PGD  ER WK EDS++RS+LINSMEPQIGKPLL+
Sbjct: 241  WSQSIKMFLEGRYQFGFLTGEIVRPPPGDALERLWKGEDSLIRSMLINSMEPQIGKPLLY 300

Query: 576  AATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRE 635
            A TAKD+WDT QTLYSKRQNASRLYTLRKQVH CKQGT+DVT++FNKLSL+WQEMDLCRE
Sbjct: 301  ATTAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRE 360

Query: 636  LVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQRPIPSLMEVCSEIRLEED 695
             VW  P D  QY+++EE DR+YDFLAGLNPKFD V GRILGQRP+PSLMEVC E+RLEED
Sbjct: 361  TVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCGRILGQRPLPSLMEVCFEVRLEED 420

Query: 696  RTSAMNISATPTIDSAAFSARSSNSSSDKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPG 755
            RT+AM +  TPTIDSAAFSARSSN  SDK+NGK IPVCEHCKKQWHTK+QCWKLHGRPPG
Sbjct: 421  RTNAMGVLTTPTIDSAAFSARSSNHDSDKNNGKSIPVCEHCKKQWHTKDQCWKLHGRPPG 480

Query: 756  SKKRPSNDKQNTGRAYVSES--AEPPQQSDPHKNQTDLSLATLGAIVQSGIPHSFGLVSI 815
             KKR SN+KQN+GRAY+SE+  A   Q +DP  +QT     TLGAI QSG+P S GL+S+
Sbjct: 481  GKKRSSNEKQNSGRAYISETTPASTSQSTDPTVSQT--KTPTLGAIAQSGMPQSLGLISV 540

Query: 816  DGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKISPCAGLSLH 875
            DGKNPWILDSGATDHLTGSSEHF+SY PCAGNE IRIADGSLAP+AGKG+I P  G +L 
Sbjct: 541  DGKNPWILDSGATDHLTGSSEHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQ 600

Query: 876  NVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDT 935
            NVLHVPKLSYNLLSISKIT EL+CKAIFLP+SV FQD+SSGR IGTARHSRGLY+LDDDT
Sbjct: 601  NVLHVPKLSYNLLSISKITRELHCKAIFLPESVYFQDMSSGRTIGTARHSRGLYILDDDT 660

Query: 936  SSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQ 995
            S SS+ R SLLSSYF+TSEQDCMLWHFRLGHPNF YM+HLFPHLFSKV++++LSCDVCI+
Sbjct: 661  SCSSLSRVSLLSSYFSTSEQDCMLWHFRLGHPNFTYMQHLFPHLFSKVDVSSLSCDVCIR 720

Query: 996  AKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDK 1055
            AKQHRVSFPSQPYKPTQPF L+HSDVWGPSK+TTSSGKRWFVTFIDDHTRLTWVYLI+DK
Sbjct: 721  AKQHRVSFPSQPYKPTQPFNLIHSDVWGPSKVTTSSGKRWFVTFIDDHTRLTWVYLISDK 780

Query: 1056 SEVSSMFQNFYHTIETQFHQKIAILRSDNGREFQNHKLSEFLASKGIVHQNSCAYTPQQN 1115
            SEV S+FQNFYHTI+TQFH KIAILRSDNGREFQNH LSEFLASKGIVHQ SCAYTPQQN
Sbjct: 781  SEVPSIFQNFYHTIKTQFHTKIAILRSDNGREFQNHNLSEFLASKGIVHQTSCAYTPQQN 840

Query: 1116 GVAERKNRHLLEVARSLMLSTSLLSHLWGDAILTAAHLINRMPSRILHLQTPLDCLKESY 1175
            GVAERKNRHL+EVARSLMLSTSL S+LWGDAILTAAHLINRMPSRILHLQTPLDCLKESY
Sbjct: 841  GVAERKNRHLVEVARSLMLSTSLPSYLWGDAILTAAHLINRMPSRILHLQTPLDCLKESY 900

Query: 1176 PSTRHVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYF 1235
            PSTR VSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYP HQ GYKCFHPPSRKYF
Sbjct: 901  PSTRLVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYPLHQHGYKCFHPPSRKYF 960

Query: 1236 VTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEPTPSVVSNIIPHSIVLPTNQVPWK 1295
            VTMDVTFCE+RPYFPVSHLQGE+VSEESNNTFEF+EPT   VS+I PH I+LPTNQVPWK
Sbjct: 961  VTMDVTFCENRPYFPVSHLQGENVSEESNNTFEFVEPTLITVSDIDPHPIILPTNQVPWK 1020

Query: 1296 TYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVE 1355
            TYYRRN +KEVGSPTSQPPAPVQ+ EPPRDQGMENPT+PCT N +SEND+S++AVLEN+E
Sbjct: 1021 TYYRRNLRKEVGSPTSQPPAPVQNFEPPRDQGMENPTKPCTNNTMSENDKSDIAVLENME 1080

Query: 1356 EKDSGDEIEVRIETRNNEAEQGHTGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYNS 1415
            EK+  DE EVRIET N+EAEQGHT K DEYD SLDIPIALRKGTRSCTKHPICNYVSY++
Sbjct: 1081 EKNCDDETEVRIETSNDEAEQGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDN 1140

Query: 1416 LSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVG 1475
            LSPQFRAFTA+LDSTIIPK+IYTAL+ PEWKNAVMEEMKALEKN TW+IC LPKGHKTVG
Sbjct: 1141 LSPQFRAFTANLDSTIIPKNIYTALECPEWKNAVMEEMKALEKNRTWEICALPKGHKTVG 1200

Query: 1476 CKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVATLNTIRVLLSVAVNKDW 1535
            CKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVA LNT+RVLLSVAVNKDW
Sbjct: 1201 CKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLLSVAVNKDW 1260

Query: 1536 PLYQLDVKNAFLNGDLVEEVYMSPPPGFEA-----------------QSPRAWFDRFTTF 1595
            PLYQLDVKNAFLNGDLVEEVYMSPPPGFEA                 QSPRAWFDRFTTF
Sbjct: 1261 PLYQLDVKNAFLNGDLVEEVYMSPPPGFEAQFGQEVCKLQKSLYGLKQSPRAWFDRFTTF 1320

Query: 1596 VKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKD 1655
            VKSQGY QGHSDHTLFTK SKTGKIA+LIVYVDDIVLTGDDQ EISQLKQRMGDEFEIKD
Sbjct: 1321 VKSQGYSQGHSDHTLFTKASKTGKIAILIVYVDDIVLTGDDQTEISQLKQRMGDEFEIKD 1380

Query: 1656 LGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVP 1715
            LGNLKYFLGMEVARSKEGISVSQRKY LDLLTETGMLGCRP DTPIEFNCKLGNSDDQVP
Sbjct: 1381 LGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVP 1440

Query: 1716 VDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGL 1775
            VDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQ P E+HM+AVNRILRYLK+TPGKGL
Sbjct: 1441 VDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQAPYEKHMEAVNRILRYLKNTPGKGL 1500

Query: 1776 MFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSTEAEYK 1835
            MFRKT+RKTIEAYTDSDWAGSV+DRKSTSGYCTFVWGNLVTWRSKKQSVVARSS EAEY+
Sbjct: 1501 MFRKTNRKTIEAYTDSDWAGSVIDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYR 1560

Query: 1836 ALSLGICEEIWLQKVLTDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIK 1895
            A+SLGICEEIWLQKVL+DLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIK
Sbjct: 1561 AMSLGICEEIWLQKVLSDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIK 1620

Query: 1896 EKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT 1920
            E+LDSGSICIPYIPSSQQ+ADVLTKGLLRP+FD CVSKLGLIDIY+PT
Sbjct: 1621 ERLDSGSICIPYIPSSQQIADVLTKGLLRPHFDLCVSKLGLIDIYLPT 1634

BLAST of CSPI02G21510 vs. ExPASy TrEMBL
Match: A0A5A7UNC5 (Beta-galactosidase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold323G00700 PE=4 SV=1)

HSP 1 Score: 2594.7 bits (6724), Expect = 0.0e+00
Identity = 1305/1668 (78.24%), Postives = 1419/1668 (85.07%), Query Frame = 0

Query: 276  MVSERDNENTLETQKNQTTYENQTEVTAISFSAAVAAAIDARMSAAMDELLSRLQKTSEN 335
            MVSE+ N  TLE    +T  E +  VTA                AAM++LL  LQK    
Sbjct: 1    MVSEQSNNETLENNLGETQIETE-PVTA----------------AAMEKLLQNLQKPPIY 60

Query: 336  NFSSLPQSSAPSPDH---HAPGFLPQTAPTIPSVQPFSSSAAYIAPHAPIYVLPSNSNRL 395
                +PQ  A   D    HAP      A   P   PF  +A  +  +AP  V PSN +  
Sbjct: 61   PTGVVPQPYALPSDQKLIHAPLVSGAWAHAPP---PFHVTAHPVPFYAPSDVQPSNPSGH 120

Query: 396  P-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLR 455
            P P  PS   GQ P+  +        Q++   +  +   +S   +              R
Sbjct: 121  PHPHAPSTSSGQHPSTVNLSNQYSKQQLY--VDPLQQPLFSGNRIDQPQN---------R 180

Query: 456  QQIAALEATLGTTST-LPMYSEYPVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFS 515
              I A E++  +  T LPMYS+ PV SFPN  S Y+T ++  SS  + SGEKLNG NYFS
Sbjct: 181  SDIEAGESSTHSKPTELPMYSKNPVTSFPNSQSNYITGSLG-SSTGNFSGEKLNGQNYFS 240

Query: 516  WSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLF 575
            WSQS+KM LEGR +F FLTGE  RP PGD  ER WK EDS++RS+LINSMEPQIGKPLL+
Sbjct: 241  WSQSIKMFLEGRYQFGFLTGETVRPPPGDALERLWKGEDSLIRSMLINSMEPQIGKPLLY 300

Query: 576  AATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRE 635
            AATAKD+WDT QTLYSKRQNASRLYTLRKQVH CKQGT+DVT++FNKLSL+WQEMDLCRE
Sbjct: 301  AATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRE 360

Query: 636  LVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQRPIPSLMEVCSEIRLEED 695
             VW  P D  QY+++EE DR+YDFLAGLNPKFD V GRILGQRP+PSLMEVC E+RLEED
Sbjct: 361  TVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCGRILGQRPLPSLMEVCFEVRLEED 420

Query: 696  RTSAMNISATPTIDSAAFSARSSNSSSDKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPG 755
            RT+AM +  TPTIDSAAFSARSSN  SDK+NGK IPVCEHCKKQWHTK+QCWKLHGRPPG
Sbjct: 421  RTNAMGVLTTPTIDSAAFSARSSNHDSDKNNGKSIPVCEHCKKQWHTKDQCWKLHGRPPG 480

Query: 756  SKKRPSNDKQNTGRAYVSES--AEPPQQSDPHKNQTDLSLATLGAIVQSGIPHSFGLVSI 815
             KKR SN+KQN+GRAY+SE+  A   Q +DP  +QT     TLGAI QSG+P S GL+S+
Sbjct: 481  GKKRSSNEKQNSGRAYISETTPASTSQSTDPTVSQT--KTPTLGAIAQSGMPQSLGLISV 540

Query: 816  DGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKISPCAGLSLH 875
            DGKNPWILDSGATDHLTGSSEHF+SY PCAGNE IRIADGSLAP+AGKG+I P  G +L 
Sbjct: 541  DGKNPWILDSGATDHLTGSSEHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQ 600

Query: 876  NVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDT 935
            NVLHVPKLSYNLLSISKIT EL+CKAIFLP+SV FQD+SSGR IGTARHSRGLY+LDDDT
Sbjct: 601  NVLHVPKLSYNLLSISKITRELHCKAIFLPESVYFQDMSSGRTIGTARHSRGLYILDDDT 660

Query: 936  SSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQ 995
            S SS+ R SLLSSYF+TSEQDCMLWHFRLGHPNF YM+HLFPHLFSKV++++LSCDVCI+
Sbjct: 661  SCSSLSRVSLLSSYFSTSEQDCMLWHFRLGHPNFTYMQHLFPHLFSKVDVSSLSCDVCIR 720

Query: 996  AKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDK 1055
            AKQHRVSFPSQPYKPTQPF L+HSDVWGPSK+TTSSGKRWFVTFIDDHTRLTWVYLI+DK
Sbjct: 721  AKQHRVSFPSQPYKPTQPFNLIHSDVWGPSKVTTSSGKRWFVTFIDDHTRLTWVYLISDK 780

Query: 1056 SEVSSMFQNFYHTIETQFHQKIAILRSDNGREFQNHKLSEFLASKGIVHQNSCAYTPQQN 1115
            SEV S+FQNFYHTI+TQFH KIAILRSDNGREFQNH LSEFLASKGIVHQ SCAYTPQQN
Sbjct: 781  SEVPSIFQNFYHTIKTQFHTKIAILRSDNGREFQNHNLSEFLASKGIVHQTSCAYTPQQN 840

Query: 1116 GVAERKNRHLLEVARSLMLSTSLLSHLWGDAILTAAHLINRMPSRILHLQTPLDCLKESY 1175
            GVAERKNRHL+EVARSLMLSTSL S+LWGDAILTAAHLINRMPSRILHLQTPLDCLKESY
Sbjct: 841  GVAERKNRHLVEVARSLMLSTSLPSYLWGDAILTAAHLINRMPSRILHLQTPLDCLKESY 900

Query: 1176 PSTRHVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYF 1235
            PSTR VSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYP HQ GYKCFHPPSRKYF
Sbjct: 901  PSTRLVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYPLHQHGYKCFHPPSRKYF 960

Query: 1236 VTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEPTPSVVSNIIPHSIVLPTNQVPWK 1295
            VTMDVTFCE+RPYFPVSHLQGE+VSEESNNTFEF+EPT   VS+I PH I+LPTNQVPWK
Sbjct: 961  VTMDVTFCENRPYFPVSHLQGENVSEESNNTFEFVEPTLITVSDIDPHPIILPTNQVPWK 1020

Query: 1296 TYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVE 1355
            TYYRRN +KEVGSPTSQPPAPVQ+ EPPRDQGMENPT+PCT N +SEND+S++AVLEN+E
Sbjct: 1021 TYYRRNLRKEVGSPTSQPPAPVQNFEPPRDQGMENPTKPCTNNTMSENDKSDIAVLENME 1080

Query: 1356 EKDSGDEIEVRIETRNNEAEQGHTGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYNS 1415
            EK+  DE EVRIET N+EAEQGHT K DEYD SLDIPIALRKGTRSCTKHPICNYVSY++
Sbjct: 1081 EKNCDDETEVRIETSNDEAEQGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDN 1140

Query: 1416 LSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVG 1475
            LSPQFRAFTA+LDSTIIPK+IYTAL+ PEWKNAVMEEMKALEKN TW+IC LPKGHKTVG
Sbjct: 1141 LSPQFRAFTANLDSTIIPKNIYTALECPEWKNAVMEEMKALEKNRTWEICALPKGHKTVG 1200

Query: 1476 CKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVATLNTIRVLLSVAVNKDW 1535
            CKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVA LNT+RVLLSVAVNKDW
Sbjct: 1201 CKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLLSVAVNKDW 1260

Query: 1536 PLYQLDVKNAFLNGDLVEEVYMSPPPGFEA-----------------QSPRAWFDRFTTF 1595
            PLYQLDVKNAFLNGDLVEEVYMSPPPGFEA                 QSPRAWFDRFTTF
Sbjct: 1261 PLYQLDVKNAFLNGDLVEEVYMSPPPGFEAQFGQEVCKLQKSLYGLKQSPRAWFDRFTTF 1320

Query: 1596 VKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKD 1655
            VKSQGY QGHSDHTLFTK SKTGKIA+LIVYVDDIVLTGDDQ EISQLKQRMGDEFEIKD
Sbjct: 1321 VKSQGYSQGHSDHTLFTKASKTGKIAILIVYVDDIVLTGDDQTEISQLKQRMGDEFEIKD 1380

Query: 1656 LGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVP 1715
            LGNLKYFLGMEVARSKEGISVSQRKY LDLLTETGMLGCRP DTPIEFNCKLGNSDDQVP
Sbjct: 1381 LGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVP 1440

Query: 1716 VDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGL 1775
            VDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQ P E+HM+AVNRILRYLK+TPGKGL
Sbjct: 1441 VDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQAPYEKHMEAVNRILRYLKNTPGKGL 1500

Query: 1776 MFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSTEAEYK 1835
            MFRKT+RKTIEAYTDSDWAGSV+DRKSTSGYCTFVWGNLVTWRSKKQSVVARSS EAEY+
Sbjct: 1501 MFRKTNRKTIEAYTDSDWAGSVIDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYR 1560

Query: 1836 ALSLGICEEIWLQKVLTDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIK 1895
            A+SLGICEEIWLQKVL+DLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIK
Sbjct: 1561 AMSLGICEEIWLQKVLSDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIK 1620

Query: 1896 EKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT 1920
            E+LDSGSICIPYIPSSQQ+ADVLTKGLLRP+FD CVSKLGLIDIY+PT
Sbjct: 1621 ERLDSGSICIPYIPSSQQIADVLTKGLLRPHFDLCVSKLGLIDIYLPT 1634

BLAST of CSPI02G21510 vs. NCBI nr
Match: TYK11240.1 (Beta-galactosidase [Cucumis melo var. makuwa])

HSP 1 Score: 2615.9 bits (6779), Expect = 0.0e+00
Identity = 1311/1684 (77.85%), Postives = 1435/1684 (85.21%), Query Frame = 0

Query: 260  SGFHVNWCELVVSLFNMVSERDNENTLETQKNQTTYENQTEVTAISFSAAVAAAIDARMS 319
            +GF ++  +L V L    SE+ N  TLE    +T  E      A + +A ++AA+DA ++
Sbjct: 337  AGFKISRSKL-VDLIRWASEQSNNETLENNLGETQIETDPVTAAAAAAAGISAAVDAAVA 396

Query: 320  AAMDELLSRLQKTSENNFSSLPQSSAPSPDH---HAPGFLPQTAPTIPSVQPFSSSAAYI 379
            AAM++LL  LQK        +PQ  AP  D    HAP      A   P   PF  +A  +
Sbjct: 397  AAMEKLLQNLQKPPIYPTGVVPQPYAPPFDQKLIHAPLVSGAWAHAPP---PFHVTAHPV 456

Query: 380  APHAPIYVLPSNSNRLP-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRN 439
              +AP  V PSN +  P P  PS   GQ P+  +        Q++   +  +   +S   
Sbjct: 457  PFYAPSDVQPSNPSGHPHPHAPSTSSGQHPSTVNLSNQYSKQQLY--VDPLQQPLFSGNG 516

Query: 440  VQASSGIVHQQLEGLRQQIAALEATLGTTST-LPMYSEYPVNSFPNVSSPYLTNTVAQSS 499
            +              R  I A E++  +  T LPMYS+ PV SFPN  S Y+T ++  SS
Sbjct: 517  IDQPQN---------RSDIEAGESSTHSKPTELPMYSKNPVTSFPNSQSNYITGSLG-SS 576

Query: 500  MYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRS 559
              + SGEKLNG NYFSWSQS+KM LEGR +F FLTGEI RP PGD  ER WK EDS++RS
Sbjct: 577  TGNFSGEKLNGQNYFSWSQSIKMFLEGRYQFGFLTGEIVRPPPGDALERLWKGEDSLIRS 636

Query: 560  ILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSF 619
            +LINSMEPQIGKPLL+A TAKD+WDT QTLYSKRQNASRLYTLRKQVH CKQGT+DVT++
Sbjct: 637  MLINSMEPQIGKPLLYATTAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTY 696

Query: 620  FNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQRP 679
            FNKLSL+WQEMDLCRE VW  P D  QY+++EE DR+YDFLAGLNPKFD V GRILGQRP
Sbjct: 697  FNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCGRILGQRP 756

Query: 680  IPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFSARSSNSSSDKHNGKPIPVCEHCKKQ 739
            +PSLMEVC E+RLEEDRT+AM +  TPTIDSAAFSARSSN  SDK+NGK IPVCEHCKKQ
Sbjct: 757  LPSLMEVCFEVRLEEDRTNAMGVLTTPTIDSAAFSARSSNHDSDKNNGKSIPVCEHCKKQ 816

Query: 740  WHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYVSES--AEPPQQSDPHKNQTDLSLATLG 799
            WHTK+QCWKLHGRPPG KKR SN+KQN+GRAY+SE+  A   Q +DP  +QT     TLG
Sbjct: 817  WHTKDQCWKLHGRPPGGKKRSSNEKQNSGRAYISETTPASTSQSTDPTVSQT--KTPTLG 876

Query: 800  AIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAP 859
            AI QSG+P S GL+S+DGKNPWILDSGATDHLTGSSEHF+SY PCAGNE IRIADGSLAP
Sbjct: 877  AIAQSGMPQSLGLISVDGKNPWILDSGATDHLTGSSEHFISYAPCAGNEKIRIADGSLAP 936

Query: 860  VAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMI 919
            +AGKG+I P  G +L NVLHVPKLSYNLLSISKIT EL+CKAIFLP+SV FQD+SSGR I
Sbjct: 937  IAGKGQIVPFDGFALQNVLHVPKLSYNLLSISKITRELHCKAIFLPESVYFQDMSSGRTI 996

Query: 920  GTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYMKHLFPHL 979
            GTARHSRGLY+LDDDTS SS+ R SLLSSYF+TSEQDCMLWHFRLGHPNF YM+HLFPHL
Sbjct: 997  GTARHSRGLYILDDDTSCSSLSRVSLLSSYFSTSEQDCMLWHFRLGHPNFTYMQHLFPHL 1056

Query: 980  FSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTF 1039
            FSKV++++LSCDVCI+AKQHRVSFPSQPYKPTQPF L+HSDVWGPSK+TTSSGKRWFVTF
Sbjct: 1057 FSKVDVSSLSCDVCIRAKQHRVSFPSQPYKPTQPFNLIHSDVWGPSKVTTSSGKRWFVTF 1116

Query: 1040 IDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAILRSDNGREFQNHKLSEFLAS 1099
            IDDHTRLTWVYLI+DKSEV S+FQNFYHTI+TQFH KIAILRSDNGREFQNH LSEFLAS
Sbjct: 1117 IDDHTRLTWVYLISDKSEVPSIFQNFYHTIKTQFHTKIAILRSDNGREFQNHNLSEFLAS 1176

Query: 1100 KGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLLSHLWGDAILTAAHLINRMPS 1159
            KGIVHQ SCAYTPQQNGVAERKNRHL+EVARSLMLSTSL S+LWGDAILTAAHLINRMPS
Sbjct: 1177 KGIVHQTSCAYTPQQNGVAERKNRHLVEVARSLMLSTSLPSYLWGDAILTAAHLINRMPS 1236

Query: 1160 RILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYPP 1219
            RILHLQTPLDCLKESYPSTR VSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYP 
Sbjct: 1237 RILHLQTPLDCLKESYPSTRLVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYPL 1296

Query: 1220 HQRGYKCFHPPSRKYFVTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEPTPSVVSN 1279
            HQ GYKCFHPPSRKYFVTMDVTFCE+RPYFPVSHLQGE+VSEESNNTFEF+EPT   VS+
Sbjct: 1297 HQHGYKCFHPPSRKYFVTMDVTFCENRPYFPVSHLQGENVSEESNNTFEFVEPTLITVSD 1356

Query: 1280 IIPHSIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNM 1339
            I PH I+LPTNQVPWKTYYRRN +KEVGSPTSQPPAPVQ+ EPPRDQGMENPT+PCT N 
Sbjct: 1357 IDPHPIILPTNQVPWKTYYRRNLRKEVGSPTSQPPAPVQNFEPPRDQGMENPTKPCTNNT 1416

Query: 1340 ISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAEQGHTGKSDEYDSSLDIPIALRKGT 1399
            +SEND+S++A LEN+EEK+  DE EVRIET N+EAEQGHT K DEYD SLDIPIALRKGT
Sbjct: 1417 MSENDKSDIAFLENMEEKNCDDETEVRIETSNDEAEQGHTRKLDEYDPSLDIPIALRKGT 1476

Query: 1400 RSCTKHPICNYVSYNSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKN 1459
            RSCTKHPICNYVSY++LSPQFRAFTA+LDSTIIPK+IYTAL+ PEWKNAVMEEMKALEKN
Sbjct: 1477 RSCTKHPICNYVSYDNLSPQFRAFTANLDSTIIPKNIYTALECPEWKNAVMEEMKALEKN 1536

Query: 1460 STWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAT 1519
             TW+IC LPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVA 
Sbjct: 1537 RTWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAK 1596

Query: 1520 LNTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEEVYMSPPPGFEA-------------- 1579
            LNT+RVLLSVAVNKDWPLYQLDVKNAFLNGDLVEEVYMSPPPGFEA              
Sbjct: 1597 LNTVRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEEVYMSPPPGFEAQFGQEVCKLQKSLY 1656

Query: 1580 ---QSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAE 1639
               QSPRAWFDRFTTFVKSQGY QGHSDHTLFTK SKTGKIA+LIVYVDDIVLTGDDQ E
Sbjct: 1657 GLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKTGKIAILIVYVDDIVLTGDDQTE 1716

Query: 1640 ISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDT 1699
            ISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKY LDLLTETGMLGCRP DT
Sbjct: 1717 ISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADT 1776

Query: 1700 PIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKA 1759
            PIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQ P E+HM+A
Sbjct: 1777 PIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQAPYEKHMEA 1836

Query: 1760 VNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRS 1819
            VNRILRYLK+TPGKGLMFRKT+RKTIEAYTDSDWAGSV+DRKSTSGYCTFVWGNLVTWRS
Sbjct: 1837 VNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSVIDRKSTSGYCTFVWGNLVTWRS 1896

Query: 1820 KKQSVVARSSTEAEYKALSLGICEEIWLQKVLTDLHQECETPLKLFCDNKAAISIANNPV 1879
            KKQSVVARSS EAEY+A+SLGICEEIWLQKVL+DLHQECETPLKLFCDNKAAISIANNPV
Sbjct: 1897 KKQSVVARSSAEAEYRAMSLGICEEIWLQKVLSDLHQECETPLKLFCDNKAAISIANNPV 1956

Query: 1880 QHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDI 1920
            QHDRTKHVEIDRHFIKE+LDSGSICIPYIPSSQQ+ADVLTKGLLRP+FD CVSKLGLIDI
Sbjct: 1957 QHDRTKHVEIDRHFIKERLDSGSICIPYIPSSQQIADVLTKGLLRPHFDLCVSKLGLIDI 2002

BLAST of CSPI02G21510 vs. NCBI nr
Match: TYK23439.1 (Beta-galactosidase [Cucumis melo var. makuwa])

HSP 1 Score: 2609.3 bits (6762), Expect = 0.0e+00
Identity = 1308/1668 (78.42%), Postives = 1427/1668 (85.55%), Query Frame = 0

Query: 276  MVSERDNENTLETQKNQTTYENQTEVTAISFSAAVAAAIDARMSAAMDELLSRLQKTSEN 335
            MVSE+ N  TLE    +T  E +        +AA AAA+DA ++AA+++LL  LQK    
Sbjct: 1    MVSEQSNNETLENNLGETQIETE------PVAAAAAAAVDAAVAAAVEKLLQNLQKPPIY 60

Query: 336  NFSSLPQSSAPSPDH---HAPGFLPQTAPTIPSVQPFSSSAAYIAPHAPIYVLPSNSNRL 395
                +PQ  AP  D    HAP      A   P   PF  +A  +  +AP  V PSN +  
Sbjct: 61   PTGVVPQPYAPPFDQKLIHAPLVSGAWAHAPP---PFHVTAHPVPFYAPSDVQPSNPSGH 120

Query: 396  P-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLR 455
            P P  PS   GQ P+  +        Q++   +  +   +S   +              R
Sbjct: 121  PHPHAPSTSSGQHPSTVNLSNQYSKQQLY--VDPLQQPLFSGNGIDQPQN---------R 180

Query: 456  QQIAALEATLGTTST-LPMYSEYPVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFS 515
              I A E++  +  T LPMYS+ PV SFPN  S Y+T ++  SS  + SGEKLNG NYFS
Sbjct: 181  SDIEAGESSTHSKPTELPMYSKNPVTSFPNSQSNYITGSLG-SSTGNFSGEKLNGQNYFS 240

Query: 516  WSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLF 575
            WSQS+KM LEGR +F FLTGEI RP PGD  ER WK EDS++RS+LINSMEPQIGKPLL+
Sbjct: 241  WSQSIKMFLEGRYQFGFLTGEIVRPPPGDALERLWKGEDSLIRSMLINSMEPQIGKPLLY 300

Query: 576  AATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRE 635
            A TAKD+WDT QTLYSKRQNASRLYTLRKQVH CKQGT+DVT++FNKLSL+WQEMDLCRE
Sbjct: 301  ATTAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRE 360

Query: 636  LVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQRPIPSLMEVCSEIRLEED 695
             VW  P D  QY+++EE DR+YDFLAGLNPKFD V GRILGQRP+PSLMEVC E+RLEED
Sbjct: 361  TVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCGRILGQRPLPSLMEVCFEVRLEED 420

Query: 696  RTSAMNISATPTIDSAAFSARSSNSSSDKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPG 755
            RT+AM +  TPTIDSAAFSARSSN  SDK+NGK IPVCEHCKKQWHTK+QCWKLHGRPPG
Sbjct: 421  RTNAMGVLTTPTIDSAAFSARSSNHDSDKNNGKSIPVCEHCKKQWHTKDQCWKLHGRPPG 480

Query: 756  SKKRPSNDKQNTGRAYVSES--AEPPQQSDPHKNQTDLSLATLGAIVQSGIPHSFGLVSI 815
             KKR SN+KQN+GRAY+SE+  A   Q +DP  +QT     TLGAI QSG+P S GL+S+
Sbjct: 481  GKKRSSNEKQNSGRAYISETTPASTSQSTDPTVSQT--KTPTLGAIAQSGMPQSLGLISV 540

Query: 816  DGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKISPCAGLSLH 875
            DGKNPWILDSGATDHLTGSSEHF+SY PCAGNE IRIADGSLAP+AGKG+I P  G +L 
Sbjct: 541  DGKNPWILDSGATDHLTGSSEHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQ 600

Query: 876  NVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDT 935
            NVLHVPKLSYNLLSISKIT EL+CKAIFLP+SV FQD+SSGR IGTARHSRGLY+LDDDT
Sbjct: 601  NVLHVPKLSYNLLSISKITRELHCKAIFLPESVYFQDMSSGRTIGTARHSRGLYILDDDT 660

Query: 936  SSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQ 995
            S SS+ R SLLSSYF+TSEQDCMLWHFRLGHPNF YM+HLFPHLFSKV++++LSCDVCI+
Sbjct: 661  SCSSLSRVSLLSSYFSTSEQDCMLWHFRLGHPNFTYMQHLFPHLFSKVDVSSLSCDVCIR 720

Query: 996  AKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDK 1055
            AKQHRVSFPSQPYKPTQPF L+HSDVWGPSK+TTSSGKRWFVTFIDDHTRLTWVYLI+DK
Sbjct: 721  AKQHRVSFPSQPYKPTQPFNLIHSDVWGPSKVTTSSGKRWFVTFIDDHTRLTWVYLISDK 780

Query: 1056 SEVSSMFQNFYHTIETQFHQKIAILRSDNGREFQNHKLSEFLASKGIVHQNSCAYTPQQN 1115
            SEV S+FQNFYHTI+TQFH KIAILRSDNGREFQNH LSEFLASKGIVHQ SCAYTPQQN
Sbjct: 781  SEVPSIFQNFYHTIKTQFHTKIAILRSDNGREFQNHNLSEFLASKGIVHQTSCAYTPQQN 840

Query: 1116 GVAERKNRHLLEVARSLMLSTSLLSHLWGDAILTAAHLINRMPSRILHLQTPLDCLKESY 1175
            GVAERKNRHL+EVARSLMLSTSL S+LWGDAILTAAHLINRMPSRILHLQTPLDCLKESY
Sbjct: 841  GVAERKNRHLVEVARSLMLSTSLPSYLWGDAILTAAHLINRMPSRILHLQTPLDCLKESY 900

Query: 1176 PSTRHVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYF 1235
            PSTR VSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYP HQ GYKCFHPPSRKYF
Sbjct: 901  PSTRLVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYPLHQHGYKCFHPPSRKYF 960

Query: 1236 VTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEPTPSVVSNIIPHSIVLPTNQVPWK 1295
            VTMDVTFCE+RPYFPVSHLQGE+VSEESNNTFEF+EPT   VS+I PH I+LPTNQVPWK
Sbjct: 961  VTMDVTFCENRPYFPVSHLQGENVSEESNNTFEFVEPTLITVSDIDPHPIILPTNQVPWK 1020

Query: 1296 TYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVE 1355
            TYYRRN +KEVGSPTSQPPAPVQ+ EPPRDQGMENPT+PCT N +SEND+S++A LEN+E
Sbjct: 1021 TYYRRNLRKEVGSPTSQPPAPVQNFEPPRDQGMENPTKPCTNNTMSENDKSDIAFLENME 1080

Query: 1356 EKDSGDEIEVRIETRNNEAEQGHTGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYNS 1415
            EK+  DE EVRIET N+EAEQGHT K DEYD SLDIPIALRKGTRSCTKHPICNYVSY++
Sbjct: 1081 EKNCDDETEVRIETSNDEAEQGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDN 1140

Query: 1416 LSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVG 1475
            LSPQFRAFTA+LDSTIIPK+IYTAL+ PEWKNAVMEEMKALEKN TW+IC LPKGHKTVG
Sbjct: 1141 LSPQFRAFTANLDSTIIPKNIYTALECPEWKNAVMEEMKALEKNRTWEICALPKGHKTVG 1200

Query: 1476 CKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVATLNTIRVLLSVAVNKDW 1535
            CKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVA LNT+RVLLSVAVNKDW
Sbjct: 1201 CKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLLSVAVNKDW 1260

Query: 1536 PLYQLDVKNAFLNGDLVEEVYMSPPPGFEA-----------------QSPRAWFDRFTTF 1595
            PLYQLDVKNAFLNGDLVEEVYMSPPPGFEA                 QSPRAWFDRFTTF
Sbjct: 1261 PLYQLDVKNAFLNGDLVEEVYMSPPPGFEAQFGQEVCKLQKSLYGLKQSPRAWFDRFTTF 1320

Query: 1596 VKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKD 1655
            VKSQGY QGHSDHTLFTK SKTGKIA+LIVYVDDIVLTGDDQ EISQLKQRMGDEFEIKD
Sbjct: 1321 VKSQGYSQGHSDHTLFTKASKTGKIAILIVYVDDIVLTGDDQTEISQLKQRMGDEFEIKD 1380

Query: 1656 LGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVP 1715
            LGNLKYFLGMEVARSKEGISVSQRKY LDLLTETGMLGCRP DTPIEFNCKLGNSDDQVP
Sbjct: 1381 LGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVP 1440

Query: 1716 VDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGL 1775
            VDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQ P E+HM+AVNRILRYLK+TPGKGL
Sbjct: 1441 VDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQAPYEKHMEAVNRILRYLKNTPGKGL 1500

Query: 1776 MFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSTEAEYK 1835
            MFRKT+RKTIEAYTDSDWAGSV+DRKSTSGYCTFVWGNLVTWRSKKQSVVARSS EAEY+
Sbjct: 1501 MFRKTNRKTIEAYTDSDWAGSVIDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYR 1560

Query: 1836 ALSLGICEEIWLQKVLTDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIK 1895
            A+SLGICEEIWLQKVL+DLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIK
Sbjct: 1561 AMSLGICEEIWLQKVLSDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIK 1620

Query: 1896 EKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT 1920
            E+LDSGSICIPYIPSSQQ+ADVLTKGLLRP+FD CVSKLGLIDIY+PT
Sbjct: 1621 ERLDSGSICIPYIPSSQQIADVLTKGLLRPHFDLCVSKLGLIDIYLPT 1645

BLAST of CSPI02G21510 vs. NCBI nr
Match: KAA0025363.1 (Beta-galactosidase [Cucumis melo var. makuwa])

HSP 1 Score: 2598.5 bits (6734), Expect = 0.0e+00
Identity = 1306/1668 (78.30%), Postives = 1422/1668 (85.25%), Query Frame = 0

Query: 276  MVSERDNENTLETQKNQTTYENQTEVTAISFSAAVAAAIDARMSAAMDELLSRLQKTSEN 335
            MVSE+ N  TLE    +T  E +           VAAA     +AA+++LL  LQK    
Sbjct: 1    MVSEQSNNETLENNLGETQIETE----------PVAAA----AAAAVEKLLQNLQKPPIY 60

Query: 336  NFSSLPQSSAPSPDH---HAPGFLPQTAPTIPSVQPFSSSAAYIAPHAPIYVLPSNSNRL 395
                +PQ  AP  D    HAP      A   P   PF  +A  +  +AP  V PSN +  
Sbjct: 61   PTGVVPQPYAPPSDQKLIHAPLVSGAWAHAPP---PFHVTAHPVPFYAPSDVQPSNPSGH 120

Query: 396  P-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLR 455
            P P  PS   GQ P+  +        Q++   +  +   +S   +              R
Sbjct: 121  PHPHAPSTSSGQHPSTVNLSNQYSKQQLY--VDPLQQPLFSGNGIDQPQN---------R 180

Query: 456  QQIAALEATLGTTST-LPMYSEYPVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFS 515
              I A E++  +  T LPMYS+ PV SFPN  S Y+T ++  SS  + SGEKLNG NYFS
Sbjct: 181  SDIEAGESSTHSKPTELPMYSKNPVTSFPNSQSNYITGSLG-SSTGNFSGEKLNGQNYFS 240

Query: 516  WSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLF 575
            WSQS+KM LEGR +F FLTGE  RP PGD  ER WK EDS++RS+LINSMEPQIGKPLL+
Sbjct: 241  WSQSIKMFLEGRYQFGFLTGETVRPPPGDALERLWKGEDSLIRSMLINSMEPQIGKPLLY 300

Query: 576  AATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRE 635
            AATAKD+WDT QTLYSKRQNASRLYTLRKQVH CKQGT+DVT++FNKLSL+WQEMDLCRE
Sbjct: 301  AATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRE 360

Query: 636  LVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQRPIPSLMEVCSEIRLEED 695
             VW  P D  QY+++EE DR+YDFLAGLNPKFD V GRILGQRP+PSLMEVC E+RLEED
Sbjct: 361  TVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCGRILGQRPLPSLMEVCFEVRLEED 420

Query: 696  RTSAMNISATPTIDSAAFSARSSNSSSDKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPG 755
            RT+AM +  TPTIDSAAFSARSSN  SDK+NGK IPVCEHCKKQWHTK+QCWKLHGRPPG
Sbjct: 421  RTNAMGVLTTPTIDSAAFSARSSNHDSDKNNGKSIPVCEHCKKQWHTKDQCWKLHGRPPG 480

Query: 756  SKKRPSNDKQNTGRAYVSES--AEPPQQSDPHKNQTDLSLATLGAIVQSGIPHSFGLVSI 815
             KKR SN+KQN+GRAY+SE+  A   Q +DP  +QT     TLGAI QSG+P S GL+S+
Sbjct: 481  GKKRSSNEKQNSGRAYISETTPASTSQSTDPTVSQT--KTPTLGAIAQSGMPQSLGLISV 540

Query: 816  DGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKISPCAGLSLH 875
            DGKNPWILDSGATDHLTGSSEHF+SY PCAGNE IRIADGSLAP+AGKG+I P  G +L 
Sbjct: 541  DGKNPWILDSGATDHLTGSSEHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQ 600

Query: 876  NVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDT 935
            NVLHVPKLSYNLLSISKIT EL+CKAIFLP+SV FQD+SSGR IGTARHSRGLY+LDDDT
Sbjct: 601  NVLHVPKLSYNLLSISKITRELHCKAIFLPESVYFQDMSSGRTIGTARHSRGLYILDDDT 660

Query: 936  SSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQ 995
            S SS+ R SLLSSYF+TSEQDCMLWHFRLGHPNF YM+HLFPHLFSKV++++LSCDVCI+
Sbjct: 661  SCSSLSRVSLLSSYFSTSEQDCMLWHFRLGHPNFTYMQHLFPHLFSKVDVSSLSCDVCIR 720

Query: 996  AKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDK 1055
            AKQHRVSFPSQPYKPTQPF L+HSDVWGPSK+TTSSGKRWFVTFIDDHTRLTWVYLI+DK
Sbjct: 721  AKQHRVSFPSQPYKPTQPFNLIHSDVWGPSKVTTSSGKRWFVTFIDDHTRLTWVYLISDK 780

Query: 1056 SEVSSMFQNFYHTIETQFHQKIAILRSDNGREFQNHKLSEFLASKGIVHQNSCAYTPQQN 1115
            SEV S+FQNFYHTI+TQFH KIAILRSDNGREFQNH LSEFLASKGIVHQ SCAYTPQQN
Sbjct: 781  SEVPSIFQNFYHTIKTQFHTKIAILRSDNGREFQNHNLSEFLASKGIVHQTSCAYTPQQN 840

Query: 1116 GVAERKNRHLLEVARSLMLSTSLLSHLWGDAILTAAHLINRMPSRILHLQTPLDCLKESY 1175
            GVAERKNRHL+EVARSLMLSTSL S+LWGDAILTAAHLINRMPSRILHLQTPLDCLKESY
Sbjct: 841  GVAERKNRHLVEVARSLMLSTSLPSYLWGDAILTAAHLINRMPSRILHLQTPLDCLKESY 900

Query: 1176 PSTRHVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYF 1235
            PSTR VSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYP HQ GYKCFHPPSRKYF
Sbjct: 901  PSTRLVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYPLHQHGYKCFHPPSRKYF 960

Query: 1236 VTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEPTPSVVSNIIPHSIVLPTNQVPWK 1295
            VTMDVTFCE+RPYFPVSHLQGE+VSEESNNTFEF+EPT   VS+I PH I+LPTNQVPWK
Sbjct: 961  VTMDVTFCENRPYFPVSHLQGENVSEESNNTFEFVEPTLITVSDIDPHPIILPTNQVPWK 1020

Query: 1296 TYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVE 1355
            TYYRRN +KEVGSPTSQPPAPVQ+ EPPRDQGMENPT+PCT N +SEND+S++AVLEN+E
Sbjct: 1021 TYYRRNLRKEVGSPTSQPPAPVQNFEPPRDQGMENPTKPCTNNTMSENDKSDIAVLENME 1080

Query: 1356 EKDSGDEIEVRIETRNNEAEQGHTGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYNS 1415
            EK+  DE EVRIET N+EAEQGHT K DEYD SLDIPIALRKGTRSCTKHPICNYVSY++
Sbjct: 1081 EKNCDDETEVRIETSNDEAEQGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDN 1140

Query: 1416 LSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVG 1475
            LSPQFRAFTA+LDSTIIPK+IYTAL+ PEWKNAVMEEMKALEKN TW+IC LPKGHKTVG
Sbjct: 1141 LSPQFRAFTANLDSTIIPKNIYTALECPEWKNAVMEEMKALEKNRTWEICALPKGHKTVG 1200

Query: 1476 CKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVATLNTIRVLLSVAVNKDW 1535
            CKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVA LNT+RVLLSVAVNKDW
Sbjct: 1201 CKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLLSVAVNKDW 1260

Query: 1536 PLYQLDVKNAFLNGDLVEEVYMSPPPGFEA-----------------QSPRAWFDRFTTF 1595
            PLYQLDVKNAFLNGDLVEEVYMSPPPGFEA                 QSPRAWFDRFTTF
Sbjct: 1261 PLYQLDVKNAFLNGDLVEEVYMSPPPGFEAQFGQEVCKLQKSLYGLKQSPRAWFDRFTTF 1320

Query: 1596 VKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKD 1655
            VKSQGY QGHSDHTLFTK SKTGKIA+LIVYVDDIVLTGDDQ EISQLKQRMGDEFEIKD
Sbjct: 1321 VKSQGYSQGHSDHTLFTKASKTGKIAILIVYVDDIVLTGDDQTEISQLKQRMGDEFEIKD 1380

Query: 1656 LGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVP 1715
            LGNLKYFLGMEVARSKEGISVSQRKY LDLLTETGMLGCRP DTPIEFNCKLGNSDDQVP
Sbjct: 1381 LGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVP 1440

Query: 1716 VDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGL 1775
            VDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQ P E+HM+AVNRILRYLK+TPGKGL
Sbjct: 1441 VDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQAPYEKHMEAVNRILRYLKNTPGKGL 1500

Query: 1776 MFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSTEAEYK 1835
            MFRKT+RKTIEAYTDSDWAGSV+DRKSTSGYCTFVWGNLVTWRSKKQSVVARSS EAEY+
Sbjct: 1501 MFRKTNRKTIEAYTDSDWAGSVIDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYR 1560

Query: 1836 ALSLGICEEIWLQKVLTDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIK 1895
            A+SLGICEEIWLQKVL+DLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIK
Sbjct: 1561 AMSLGICEEIWLQKVLSDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIK 1620

Query: 1896 EKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT 1920
            E+LDSGSICIPYIPSSQQ+ADVLTKGLLRP+FD CVSKLGLIDIY+PT
Sbjct: 1621 ERLDSGSICIPYIPSSQQIADVLTKGLLRPHFDLCVSKLGLIDIYLPT 1637

BLAST of CSPI02G21510 vs. NCBI nr
Match: KAA0052775.1 (Beta-galactosidase [Cucumis melo var. makuwa])

HSP 1 Score: 2597.0 bits (6730), Expect = 0.0e+00
Identity = 1306/1668 (78.30%), Postives = 1420/1668 (85.13%), Query Frame = 0

Query: 276  MVSERDNENTLETQKNQTTYENQTEVTAISFSAAVAAAIDARMSAAMDELLSRLQKTSEN 335
            MVSE+ N  TLE    +T  E  TE  A               +AAM++LL  LQK    
Sbjct: 1    MVSEQSNNETLENNLGETQIE--TEPAA---------------AAAMEKLLQNLQKPPIY 60

Query: 336  NFSSLPQSSAPSPDH---HAPGFLPQTAPTIPSVQPFSSSAAYIAPHAPIYVLPSNSNRL 395
                +PQ  AP  D    HAP      A   P   PF  +A  +  +AP  V PSN +  
Sbjct: 61   PTGVVPQPYAPPFDQKLIHAPLVSGAWAHAPP---PFHVTAHPVPFYAPSDVQPSNPSGH 120

Query: 396  P-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLR 455
            P P  PS   GQ P+  +        Q++   +  +   +S   +              R
Sbjct: 121  PHPHAPSTSSGQHPSTVNLSNQYSKQQLY--VDPLQQPLFSGNGIDQPQN---------R 180

Query: 456  QQIAALEATLGTTST-LPMYSEYPVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFS 515
              I A E++  +  T LPMYS+ PV SFPN  S Y+T ++  SS  + SGEKLNG NYFS
Sbjct: 181  SDIEAGESSTHSKPTELPMYSKNPVTSFPNSQSNYITGSLG-SSTGNFSGEKLNGQNYFS 240

Query: 516  WSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLF 575
            WSQS+KM LEGR +F FLTGEI RP PGD  ER WK EDS++RS+LINSMEPQIGKPLL+
Sbjct: 241  WSQSIKMFLEGRYQFGFLTGEIVRPPPGDALERLWKGEDSLIRSMLINSMEPQIGKPLLY 300

Query: 576  AATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRE 635
            A TAKD+WDT QTLYSKRQNASRLYTLRKQVH CKQGT+DVT++FNKLSL+WQEMDLCRE
Sbjct: 301  ATTAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRE 360

Query: 636  LVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQRPIPSLMEVCSEIRLEED 695
             VW  P D  QY+++EE DR+YDFLAGLNPKFD V GRILGQRP+PSLMEVC E+RLEED
Sbjct: 361  TVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCGRILGQRPLPSLMEVCFEVRLEED 420

Query: 696  RTSAMNISATPTIDSAAFSARSSNSSSDKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPG 755
            RT+AM +  TPTIDSAAFSARSSN  SDK+NGK IPVCEHCKKQWHTK+QCWKLHGRPPG
Sbjct: 421  RTNAMGVLTTPTIDSAAFSARSSNHDSDKNNGKSIPVCEHCKKQWHTKDQCWKLHGRPPG 480

Query: 756  SKKRPSNDKQNTGRAYVSES--AEPPQQSDPHKNQTDLSLATLGAIVQSGIPHSFGLVSI 815
             KKR SN+KQN+GRAY+SE+  A   Q +DP  +QT     TLGAI QSG+P S GL+S+
Sbjct: 481  GKKRSSNEKQNSGRAYISETTPASTSQSTDPTVSQT--KTPTLGAIAQSGMPQSLGLISV 540

Query: 816  DGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKISPCAGLSLH 875
            DGKNPWILDSGATDHLTGSSEHF+SY PCAGNE IRIADGSLAP+AGKG+I P  G +L 
Sbjct: 541  DGKNPWILDSGATDHLTGSSEHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQ 600

Query: 876  NVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDT 935
            NVLHVPKLSYNLLSISKIT EL+CKAIFLP+SV FQD+SSGR IGTARHSRGLY+LDDDT
Sbjct: 601  NVLHVPKLSYNLLSISKITRELHCKAIFLPESVYFQDMSSGRTIGTARHSRGLYILDDDT 660

Query: 936  SSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQ 995
            S SS+ R SLLSSYF+TSEQDCMLWHFRLGHPNF YM+HLFPHLFSKV++++LSCDVCI+
Sbjct: 661  SCSSLSRVSLLSSYFSTSEQDCMLWHFRLGHPNFTYMQHLFPHLFSKVDVSSLSCDVCIR 720

Query: 996  AKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDK 1055
            AKQHRVSFPSQPYKPTQPF L+HSDVWGPSK+TTSSGKRWFVTFIDDHTRLTWVYLI+DK
Sbjct: 721  AKQHRVSFPSQPYKPTQPFNLIHSDVWGPSKVTTSSGKRWFVTFIDDHTRLTWVYLISDK 780

Query: 1056 SEVSSMFQNFYHTIETQFHQKIAILRSDNGREFQNHKLSEFLASKGIVHQNSCAYTPQQN 1115
            SEV S+FQNFYHTI+TQFH KIAILRSDNGREFQNH LSEFLASKGIVHQ SCAYTPQQN
Sbjct: 781  SEVPSIFQNFYHTIKTQFHTKIAILRSDNGREFQNHNLSEFLASKGIVHQTSCAYTPQQN 840

Query: 1116 GVAERKNRHLLEVARSLMLSTSLLSHLWGDAILTAAHLINRMPSRILHLQTPLDCLKESY 1175
            GVAERKNRHL+EVARSLMLSTSL S+LWGDAILTAAHLINRMPSRILHLQTPLDCLKESY
Sbjct: 841  GVAERKNRHLVEVARSLMLSTSLPSYLWGDAILTAAHLINRMPSRILHLQTPLDCLKESY 900

Query: 1176 PSTRHVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYF 1235
            PSTR VSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYP HQ GYKCFHPPSRKYF
Sbjct: 901  PSTRLVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYPLHQHGYKCFHPPSRKYF 960

Query: 1236 VTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEPTPSVVSNIIPHSIVLPTNQVPWK 1295
            VTMDVTFCE+RPYFPVSHLQGE+VSEESNNTFEF+EPT   VS+I PH I+LPTNQVPWK
Sbjct: 961  VTMDVTFCENRPYFPVSHLQGENVSEESNNTFEFVEPTLITVSDIDPHPIILPTNQVPWK 1020

Query: 1296 TYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVE 1355
            TYYRRN +KEVGSPTSQPPAPVQ+ EPPRDQGMENPT+PCT N +SEND+S++AVLEN+E
Sbjct: 1021 TYYRRNLRKEVGSPTSQPPAPVQNFEPPRDQGMENPTKPCTNNTMSENDKSDIAVLENME 1080

Query: 1356 EKDSGDEIEVRIETRNNEAEQGHTGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYNS 1415
            EK+  DE EVRIET N+EAEQGHT K DEYD SLDIPIALRKGTRSCTKHPICNYVSY++
Sbjct: 1081 EKNCDDETEVRIETSNDEAEQGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDN 1140

Query: 1416 LSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVG 1475
            LSPQFRAFTA+LDSTIIPK+IYTAL+ PEWKNAVMEEMKALEKN TW+IC LPKGHKTVG
Sbjct: 1141 LSPQFRAFTANLDSTIIPKNIYTALECPEWKNAVMEEMKALEKNRTWEICALPKGHKTVG 1200

Query: 1476 CKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVATLNTIRVLLSVAVNKDW 1535
            CKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVA LNT+RVLLSVAVNKDW
Sbjct: 1201 CKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLLSVAVNKDW 1260

Query: 1536 PLYQLDVKNAFLNGDLVEEVYMSPPPGFEA-----------------QSPRAWFDRFTTF 1595
            PLYQLDVKNAFLNGDLVEEVYMSPPPGFEA                 QSPRAWFDRFTTF
Sbjct: 1261 PLYQLDVKNAFLNGDLVEEVYMSPPPGFEAQFGQEVCKLQKSLYGLKQSPRAWFDRFTTF 1320

Query: 1596 VKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKD 1655
            VKSQGY QGHSDHTLFTK SKTGKIA+LIVYVDDIVLTGDDQ EISQLKQRMGDEFEIKD
Sbjct: 1321 VKSQGYSQGHSDHTLFTKASKTGKIAILIVYVDDIVLTGDDQTEISQLKQRMGDEFEIKD 1380

Query: 1656 LGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVP 1715
            LGNLKYFLGMEVARSKEGISVSQRKY LDLLTETGMLGCRP DTPIEFNCKLGNSDDQVP
Sbjct: 1381 LGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVP 1440

Query: 1716 VDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGL 1775
            VDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQ P E+HM+AVNRILRYLK+TPGKGL
Sbjct: 1441 VDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQAPYEKHMEAVNRILRYLKNTPGKGL 1500

Query: 1776 MFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSTEAEYK 1835
            MFRKT+RKTIEAYTDSDWAGSV+DRKSTSGYCTFVWGNLVTWRSKKQSVVARSS EAEY+
Sbjct: 1501 MFRKTNRKTIEAYTDSDWAGSVIDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYR 1560

Query: 1836 ALSLGICEEIWLQKVLTDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIK 1895
            A+SLGICEEIWLQKVL+DLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIK
Sbjct: 1561 AMSLGICEEIWLQKVLSDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIK 1620

Query: 1896 EKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT 1920
            E+LDSGSICIPYIPSSQQ+ADVLTKGLLRP+FD CVSKLGLIDIY+PT
Sbjct: 1621 ERLDSGSICIPYIPSSQQIADVLTKGLLRPHFDLCVSKLGLIDIYLPT 1634

BLAST of CSPI02G21510 vs. NCBI nr
Match: KAA0056107.1 (Beta-galactosidase [Cucumis melo var. makuwa])

HSP 1 Score: 2594.7 bits (6724), Expect = 0.0e+00
Identity = 1305/1668 (78.24%), Postives = 1419/1668 (85.07%), Query Frame = 0

Query: 276  MVSERDNENTLETQKNQTTYENQTEVTAISFSAAVAAAIDARMSAAMDELLSRLQKTSEN 335
            MVSE+ N  TLE    +T  E +  VTA                AAM++LL  LQK    
Sbjct: 1    MVSEQSNNETLENNLGETQIETE-PVTA----------------AAMEKLLQNLQKPPIY 60

Query: 336  NFSSLPQSSAPSPDH---HAPGFLPQTAPTIPSVQPFSSSAAYIAPHAPIYVLPSNSNRL 395
                +PQ  A   D    HAP      A   P   PF  +A  +  +AP  V PSN +  
Sbjct: 61   PTGVVPQPYALPSDQKLIHAPLVSGAWAHAPP---PFHVTAHPVPFYAPSDVQPSNPSGH 120

Query: 396  P-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLR 455
            P P  PS   GQ P+  +        Q++   +  +   +S   +              R
Sbjct: 121  PHPHAPSTSSGQHPSTVNLSNQYSKQQLY--VDPLQQPLFSGNRIDQPQN---------R 180

Query: 456  QQIAALEATLGTTST-LPMYSEYPVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFS 515
              I A E++  +  T LPMYS+ PV SFPN  S Y+T ++  SS  + SGEKLNG NYFS
Sbjct: 181  SDIEAGESSTHSKPTELPMYSKNPVTSFPNSQSNYITGSLG-SSTGNFSGEKLNGQNYFS 240

Query: 516  WSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLF 575
            WSQS+KM LEGR +F FLTGE  RP PGD  ER WK EDS++RS+LINSMEPQIGKPLL+
Sbjct: 241  WSQSIKMFLEGRYQFGFLTGETVRPPPGDALERLWKGEDSLIRSMLINSMEPQIGKPLLY 300

Query: 576  AATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRE 635
            AATAKD+WDT QTLYSKRQNASRLYTLRKQVH CKQGT+DVT++FNKLSL+WQEMDLCRE
Sbjct: 301  AATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRE 360

Query: 636  LVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQRPIPSLMEVCSEIRLEED 695
             VW  P D  QY+++EE DR+YDFLAGLNPKFD V GRILGQRP+PSLMEVC E+RLEED
Sbjct: 361  TVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCGRILGQRPLPSLMEVCFEVRLEED 420

Query: 696  RTSAMNISATPTIDSAAFSARSSNSSSDKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPG 755
            RT+AM +  TPTIDSAAFSARSSN  SDK+NGK IPVCEHCKKQWHTK+QCWKLHGRPPG
Sbjct: 421  RTNAMGVLTTPTIDSAAFSARSSNHDSDKNNGKSIPVCEHCKKQWHTKDQCWKLHGRPPG 480

Query: 756  SKKRPSNDKQNTGRAYVSES--AEPPQQSDPHKNQTDLSLATLGAIVQSGIPHSFGLVSI 815
             KKR SN+KQN+GRAY+SE+  A   Q +DP  +QT     TLGAI QSG+P S GL+S+
Sbjct: 481  GKKRSSNEKQNSGRAYISETTPASTSQSTDPTVSQT--KTPTLGAIAQSGMPQSLGLISV 540

Query: 816  DGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKISPCAGLSLH 875
            DGKNPWILDSGATDHLTGSSEHF+SY PCAGNE IRIADGSLAP+AGKG+I P  G +L 
Sbjct: 541  DGKNPWILDSGATDHLTGSSEHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQ 600

Query: 876  NVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDT 935
            NVLHVPKLSYNLLSISKIT EL+CKAIFLP+SV FQD+SSGR IGTARHSRGLY+LDDDT
Sbjct: 601  NVLHVPKLSYNLLSISKITRELHCKAIFLPESVYFQDMSSGRTIGTARHSRGLYILDDDT 660

Query: 936  SSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQ 995
            S SS+ R SLLSSYF+TSEQDCMLWHFRLGHPNF YM+HLFPHLFSKV++++LSCDVCI+
Sbjct: 661  SCSSLSRVSLLSSYFSTSEQDCMLWHFRLGHPNFTYMQHLFPHLFSKVDVSSLSCDVCIR 720

Query: 996  AKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDK 1055
            AKQHRVSFPSQPYKPTQPF L+HSDVWGPSK+TTSSGKRWFVTFIDDHTRLTWVYLI+DK
Sbjct: 721  AKQHRVSFPSQPYKPTQPFNLIHSDVWGPSKVTTSSGKRWFVTFIDDHTRLTWVYLISDK 780

Query: 1056 SEVSSMFQNFYHTIETQFHQKIAILRSDNGREFQNHKLSEFLASKGIVHQNSCAYTPQQN 1115
            SEV S+FQNFYHTI+TQFH KIAILRSDNGREFQNH LSEFLASKGIVHQ SCAYTPQQN
Sbjct: 781  SEVPSIFQNFYHTIKTQFHTKIAILRSDNGREFQNHNLSEFLASKGIVHQTSCAYTPQQN 840

Query: 1116 GVAERKNRHLLEVARSLMLSTSLLSHLWGDAILTAAHLINRMPSRILHLQTPLDCLKESY 1175
            GVAERKNRHL+EVARSLMLSTSL S+LWGDAILTAAHLINRMPSRILHLQTPLDCLKESY
Sbjct: 841  GVAERKNRHLVEVARSLMLSTSLPSYLWGDAILTAAHLINRMPSRILHLQTPLDCLKESY 900

Query: 1176 PSTRHVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYPPHQRGYKCFHPPSRKYF 1235
            PSTR VSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYP HQ GYKCFHPPSRKYF
Sbjct: 901  PSTRLVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQACVFVGYPLHQHGYKCFHPPSRKYF 960

Query: 1236 VTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEPTPSVVSNIIPHSIVLPTNQVPWK 1295
            VTMDVTFCE+RPYFPVSHLQGE+VSEESNNTFEF+EPT   VS+I PH I+LPTNQVPWK
Sbjct: 961  VTMDVTFCENRPYFPVSHLQGENVSEESNNTFEFVEPTLITVSDIDPHPIILPTNQVPWK 1020

Query: 1296 TYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVE 1355
            TYYRRN +KEVGSPTSQPPAPVQ+ EPPRDQGMENPT+PCT N +SEND+S++AVLEN+E
Sbjct: 1021 TYYRRNLRKEVGSPTSQPPAPVQNFEPPRDQGMENPTKPCTNNTMSENDKSDIAVLENME 1080

Query: 1356 EKDSGDEIEVRIETRNNEAEQGHTGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYNS 1415
            EK+  DE EVRIET N+EAEQGHT K DEYD SLDIPIALRKGTRSCTKHPICNYVSY++
Sbjct: 1081 EKNCDDETEVRIETSNDEAEQGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDN 1140

Query: 1416 LSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVG 1475
            LSPQFRAFTA+LDSTIIPK+IYTAL+ PEWKNAVMEEMKALEKN TW+IC LPKGHKTVG
Sbjct: 1141 LSPQFRAFTANLDSTIIPKNIYTALECPEWKNAVMEEMKALEKNRTWEICALPKGHKTVG 1200

Query: 1476 CKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVATLNTIRVLLSVAVNKDW 1535
            CKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVA LNT+RVLLSVAVNKDW
Sbjct: 1201 CKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLLSVAVNKDW 1260

Query: 1536 PLYQLDVKNAFLNGDLVEEVYMSPPPGFEA-----------------QSPRAWFDRFTTF 1595
            PLYQLDVKNAFLNGDLVEEVYMSPPPGFEA                 QSPRAWFDRFTTF
Sbjct: 1261 PLYQLDVKNAFLNGDLVEEVYMSPPPGFEAQFGQEVCKLQKSLYGLKQSPRAWFDRFTTF 1320

Query: 1596 VKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKD 1655
            VKSQGY QGHSDHTLFTK SKTGKIA+LIVYVDDIVLTGDDQ EISQLKQRMGDEFEIKD
Sbjct: 1321 VKSQGYSQGHSDHTLFTKASKTGKIAILIVYVDDIVLTGDDQTEISQLKQRMGDEFEIKD 1380

Query: 1656 LGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVP 1715
            LGNLKYFLGMEVARSKEGISVSQRKY LDLLTETGMLGCRP DTPIEFNCKLGNSDDQVP
Sbjct: 1381 LGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVP 1440

Query: 1716 VDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGL 1775
            VDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQ P E+HM+AVNRILRYLK+TPGKGL
Sbjct: 1441 VDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQAPYEKHMEAVNRILRYLKNTPGKGL 1500

Query: 1776 MFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSTEAEYK 1835
            MFRKT+RKTIEAYTDSDWAGSV+DRKSTSGYCTFVWGNLVTWRSKKQSVVARSS EAEY+
Sbjct: 1501 MFRKTNRKTIEAYTDSDWAGSVIDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYR 1560

Query: 1836 ALSLGICEEIWLQKVLTDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIK 1895
            A+SLGICEEIWLQKVL+DLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIK
Sbjct: 1561 AMSLGICEEIWLQKVLSDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIK 1620

Query: 1896 EKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT 1920
            E+LDSGSICIPYIPSSQQ+ADVLTKGLLRP+FD CVSKLGLIDIY+PT
Sbjct: 1621 ERLDSGSICIPYIPSSQQIADVLTKGLLRPHFDLCVSKLGLIDIYLPT 1634

BLAST of CSPI02G21510 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 444.9 bits (1143), Expect = 3.3e-124
Identity = 217/502 (43.23%), Postives = 319/502 (63.55%), Query Frame = 0

Query: 1394 SCTKHPICNYVSYNSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNS 1453
            S T H I  ++SY  +SP + +F   +     P     A ++  W  A+ +E+ A+E   
Sbjct: 54   SLTIHDISQFLSYEKVSPLYHSFLVCIAKAKEPSTYNEAKEFLVWCGAMDDEIGAMETTH 113

Query: 1454 TWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVATL 1513
            TW+ICTLP   K +GCKWV+ +KY +DGT++R+KARLVAKG+TQ  GID+ ETFSPV  L
Sbjct: 114  TWEICTLPPNKKPIGCKWVYKIKYNSDGTIERYKARLVAKGYTQQEGIDFIETFSPVCKL 173

Query: 1514 NTIRVLLSVAVNKDWPLYQLDVKNAFLNGDLVEEVYMSPPPGFEA--------------- 1573
             +++++L+++   ++ L+QLD+ NAFLNGDL EE+YM  PPG+ A               
Sbjct: 174  TSVKLILAISAIYNFTLHQLDISNAFLNGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLK 233

Query: 1574 -------QSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGD 1633
                   Q+ R WF +F+  +   G+ Q HSDHT F K++ T  + VL VYVDDI++  +
Sbjct: 234  KSIYGLKQASRQWFLKFSVTLIGFGFVQSHSDHTYFLKITATLFLCVL-VYVDDIIICSN 293

Query: 1634 DQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCR 1693
            + A + +LK ++   F+++DLG LKYFLG+E+ARS  GI++ QRKY LDLL ETG+LGC+
Sbjct: 294  NDAAVDELKSQLKSCFKLRDLGPLKYFLGLEIARSAAGINICQRKYALDLLDETGLLGCK 353

Query: 1694 PTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEE 1753
            P+  P++ +           VD + Y+RL+G+L+YL  TR DISFAV+ +SQF + P   
Sbjct: 354  PSSVPMDPSVTFSAHSGGDFVDAKAYRRLIGRLMYLQITRLDISFAVNKLSQFSEAPRLA 413

Query: 1754 HMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLV 1813
            H +AV +IL Y+K T G+GL +       ++ ++D+ +      R+ST+GYC F+  +L+
Sbjct: 414  HQQAVMKILHYIKGTVGQGLFYSSQAEMQLQVFSDASFQSCKDTRRSTNGYCMFLGTSLI 473

Query: 1814 TWRSKKQSVVARSSTEAEYKALSLGICEEIWLQKVLTDLHQECETPLKLFCDNKAAISIA 1873
            +W+SKKQ VV++SS EAEY+ALS    E +WL +   +L      P  LFCDN AAI IA
Sbjct: 474  SWKSKKQQVVSKSSAEAEYRALSFATDEMMWLAQFFRELQLPLSKPTLLFCDNTAAIHIA 533

BLAST of CSPI02G21510 vs. TAIR 10
Match: AT4G24460.1 (CRT (chloroquine-resistance transporter)-like transporter 2 )

HSP 1 Score: 199.9 bits (507), Expect = 1.8e-50
Identity = 107/180 (59.44%), Postives = 141/180 (78.33%), Query Frame = 0

Query: 78  IRNRNIKARFSSRNDSSS-------TPHASNGDSNSPAKTKLIVVSSLIAVSLAIANRVL 137
           +R  ++++RF S   ++S       +  AS  +S+ P+   LIV +S++ V+LA+ANRVL
Sbjct: 52  LRRSDLRSRFLSTPKTTSPMRRPRFSVGASTEESSIPSNRNLIVANSVVIVALAVANRVL 111

Query: 138 YKLALVPLKEYPFFLAQLTTFGYVMAYFSILYLRRRANIVTEEMLSLPKSRFMAIGFLEA 197
           YKLALVP+K+YPFF+AQLTTFGYV+ YF+ILY RRR  IVT EM+ +PK RF  IGFLEA
Sbjct: 112 YKLALVPMKQYPFFMAQLTTFGYVLIYFTILYTRRRLGIVTNEMMDVPKWRFAIIGFLEA 171

Query: 198 LGIATGMAAAASLPGPAIPILSQTFLVWQLVFSAILLGRKYSWNQIAGCVIVTAGVVVAV 251
           LG+ATGMAAAA LPGP IPIL+QT+LVWQL+F+ ++LGR++  NQIAGC++V  GVVVAV
Sbjct: 172 LGVATGMAAAAMLPGPVIPILNQTYLVWQLLFALLILGRRFLLNQIAGCLLVAVGVVVAV 231

BLAST of CSPI02G21510 vs. TAIR 10
Match: AT4G24460.2 (CRT (chloroquine-resistance transporter)-like transporter 2 )

HSP 1 Score: 199.9 bits (507), Expect = 1.8e-50
Identity = 107/180 (59.44%), Postives = 141/180 (78.33%), Query Frame = 0

Query: 78  IRNRNIKARFSSRNDSSS-------TPHASNGDSNSPAKTKLIVVSSLIAVSLAIANRVL 137
           +R  ++++RF S   ++S       +  AS  +S+ P+   LIV +S++ V+LA+ANRVL
Sbjct: 52  LRRSDLRSRFLSTPKTTSPMRRPRFSVGASTEESSIPSNRNLIVANSVVIVALAVANRVL 111

Query: 138 YKLALVPLKEYPFFLAQLTTFGYVMAYFSILYLRRRANIVTEEMLSLPKSRFMAIGFLEA 197
           YKLALVP+K+YPFF+AQLTTFGYV+ YF+ILY RRR  IVT EM+ +PK RF  IGFLEA
Sbjct: 112 YKLALVPMKQYPFFMAQLTTFGYVLIYFTILYTRRRLGIVTNEMMDVPKWRFAIIGFLEA 171

Query: 198 LGIATGMAAAASLPGPAIPILSQTFLVWQLVFSAILLGRKYSWNQIAGCVIVTAGVVVAV 251
           LG+ATGMAAAA LPGP IPIL+QT+LVWQL+F+ ++LGR++  NQIAGC++V  GVVVAV
Sbjct: 172 LGVATGMAAAAMLPGPVIPILNQTYLVWQLLFALLILGRRFLLNQIAGCLLVAVGVVVAV 231

BLAST of CSPI02G21510 vs. TAIR 10
Match: ATMG00810.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 188.7 bits (478), Expect = 4.3e-47
Identity = 92/224 (41.07%), Postives = 136/224 (60.71%), Query Frame = 0

Query: 1599 LIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYI 1658
            L++YVDDI+LTG     ++ L  ++   F +KDLG + YFLG+++     G+ +SQ KY 
Sbjct: 3    LLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYA 62

Query: 1659 LDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAV 1718
              +L   GML C+P  TP+        S  + P D   ++ +VG L YL+ TRPDIS+AV
Sbjct: 63   EQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYP-DPSDFRSIVGALQYLTLTRPDISYAV 122

Query: 1719 SVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKS 1778
            ++V Q M  P       + R+LRY+K T   GL   K  +  ++A+ DSDWAG    R+S
Sbjct: 123  NIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRS 182

Query: 1779 TSGYCTFVWGNLVTWRSKKQSVVARSSTEAEYKALSLGICEEIW 1823
            T+G+CTF+  N+++W +K+Q  V+RSSTE EY+AL+L   E  W
Sbjct: 183  TTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CSPI02G21510 vs. TAIR 10
Match: AT5G12170.2 (CRT (chloroquine-resistance transporter)-like transporter 3 )

HSP 1 Score: 175.3 bits (443), Expect = 4.9e-43
Identity = 94/158 (59.49%), Postives = 117/158 (74.05%), Query Frame = 0

Query: 93  SSSTPHASNGDSNSPAKTKLIVVSSLIAVSLAIANRVLYKLALVPLKEYPFFLAQLTTFG 152
           SSST H  +G+     KT  IV+ + +  +  + NRV+YKLALVPLKEYPFFLAQL+TFG
Sbjct: 90  SSSTVHVIDGEH---VKTAEIVIWAAVTAAFGVGNRVMYKLALVPLKEYPFFLAQLSTFG 149

Query: 153 YVMAYFSILYLRRRANIVTEEMLSLPKSRFMAIGFLEALGIATGMAAAASLPGPAIPILS 212
           YV  Y++ILY R RA  VT+ MLS+PKS F+ +G LEAL  A GMAAAA+L GP+  +LS
Sbjct: 150 YVAVYYTILYFRYRAGTVTDAMLSVPKSPFLIVGILEALAAAAGMAAAANLSGPSTTVLS 209

Query: 213 QTFLVWQLVFSAILLGRKYSWNQIAGCVIVTAGVVVAV 251
           QTFLVWQ+ FS I LGR+YS NQI GC +V  GV+V+V
Sbjct: 210 QTFLVWQIFFSIIFLGRRYSVNQILGCTLVALGVIVSV 244

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q94HW22.1e-19230.69Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT944.0e-18329.95Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P109787.1e-16430.31Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041462.4e-14828.44Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
A1L4X02.6e-4959.44Protein CLT2, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CLT2 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A5D3CIR00.0e+0077.85Beta-galactosidase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold227G0... [more]
A0A5D3DJM70.0e+0078.42Beta-galactosidase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold605G0... [more]
A0A5A7SL210.0e+0078.30Beta-galactosidase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold1204G... [more]
A0A5A7UGB20.0e+0078.30Beta-galactosidase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold43055... [more]
A0A5A7UNC50.0e+0078.24Beta-galactosidase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold323G0... [more]
Match NameE-valueIdentityDescription
TYK11240.10.0e+0077.85Beta-galactosidase [Cucumis melo var. makuwa][more]
TYK23439.10.0e+0078.42Beta-galactosidase [Cucumis melo var. makuwa][more]
KAA0025363.10.0e+0078.30Beta-galactosidase [Cucumis melo var. makuwa][more]
KAA0052775.10.0e+0078.30Beta-galactosidase [Cucumis melo var. makuwa][more]
KAA0056107.10.0e+0078.24Beta-galactosidase [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
AT4G23160.13.3e-12443.23cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
AT4G24460.11.8e-5059.44CRT (chloroquine-resistance transporter)-like transporter 2 [more]
AT4G24460.21.8e-5059.44CRT (chloroquine-resistance transporter)-like transporter 2 [more]
ATMG00810.14.3e-4741.07DNA/RNA polymerases superfamily protein [more]
AT5G12170.24.9e-4359.49CRT (chloroquine-resistance transporter)-like transporter 3 [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 920..991
e-value: 1.2E-15
score: 57.1
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 1004..1105
e-value: 1.3E-14
score: 54.4
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 1002..1168
score: 23.107197
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 1452..1677
e-value: 2.0E-58
score: 197.9
IPR029472Retrotransposon Copia-like, N-terminalPFAMPF14244Retrotran_gag_3coord: 502..535
e-value: 2.9E-7
score: 30.2
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 545..629
e-value: 6.2E-8
score: 32.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 87..106
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1296..1378
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 743..759
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 743..783
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1324..1341
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 760..783
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 336..363
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1342..1378
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 814..1782
NoneNo IPR availableCDDcd09272RNase_HI_RT_Ty1coord: 1762..1898
e-value: 4.59226E-72
score: 235.055
NoneNo IPR availableSUPERFAMILY103481Multidrug resistance efflux transporter EmrEcoord: 182..251
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 998..1179
e-value: 8.1E-42
score: 144.7
IPR013936Chloroquine-resistance transporter-likePFAMPF08627CRT-likecoord: 113..266
e-value: 2.9E-25
score: 88.9
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 1001..1162
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 1451..1865

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI02G21510.1CSPI02G21510.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding