CSPI03G44700 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI03G44700
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionCentromere-associated protein E isoform X1
LocationChr3: 38257687 .. 38267462 (+)
RNA-Seq ExpressionCSPI03G44700
SyntenyCSPI03G44700
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACAAGAACAAGAGCCGTTCCGATCTGCTCGCCGCAGGCAGGAAGAAGGTAAATTTGCGTTCCTTTTATTTTCATTTTAAAATAAATGTTAAGTGTCTCATTTAGTGAGTTGAATGGGAAAATGTAAAAGTATTACATTATTGTAGCCGATAGTTTCATAAAAGCGAAATTCGTTCTGCTCATTTCGTAGGGTGTAATGTAGACTGAGTGGGGTAACAAAATTTTGCTGTTTACTCTATTTTTTAATTTGCACCCTCTTATTTTCCTTTTTCGGGCTTCCAATGACTCCCAGTTGGTAGATTTCTTTATGCTCATTTCATAGGGTTTAATATTAAGTTTGGGTATTAGTTTCTCATTCAATATGAAGAACATATTCTGAAAAAGGGTTTTTTTTTCTTGAGTTTTATTTTTATTGACTTATCTGAGATTTATTTGAACTTGGAAAGGGCATTTGTCTCAAGAATGGTTTTCTATGCACCATTCAGTCCCATTTGGCTGTACAAGCTGAAATGTATGATTTCCCCCTTTATTTTCTCTATTTATCAGCTCCAGCAATTCCGTAAGAAGAAGGATAATAAGGGCAGTGGTAGCCAAGGAGGTTCATCAAGAAATACTAGTAAATTGGAACAGCATGATGCAGATGCAGACATTGGGATTGGTGCTGCTAAATCCACATCTGGTAGGTTTTCCAGTGATGAAGTACTTGCATCCAGTGTTGATCGCAATCCACATATTGTAGATTCTTCAGCATCATCTTCTACAGAACATTCCTTGGCAGCAGAGACTGATGATCATTCAACAGTTTCTGTTAAGCAAGAGATGGATTTAGCGGAAGCTTCAGCCATTGACCAGGGAGAGACTTCGATGCAGGAAGTGGGGTATAGGGAGGACTTCGAACACACAGTCCAAAATGTTGAGGCCTCTGGATTTGTATCATCTGGACCTTCCGTTCCTACTGATGTTGAAGGGAACGACAACCCTACTTCCAATTTGTCTTTCGCCGAATCATCTTCCCAAATTTCTTCTGCTTCTGTGGAGCAGCAAGGAAGAATAGTTGAAGTAGGGGGCGGATGTAGGGAAGAAGAGCTGTTGGTTTCACCGTCTACGTCTTTGTTGCAAGCAAGGGAAGATGTAGGTTGTCCTCTTATTCTTTGGTACTTATTTGCTAATCTATTTTATCGATGGATTGTCCAGATAAACTGTGTGATCAAGGAAACCTAGTGCAAGTCCTTTATGATTAAATATATTCTTACGTTTTTGATTGGTAGGCATGGGGGATGCAGTGATGCAGCCTGGTCAAGTCCATGAAACAGAGATTGCAGGAGACAAGCAGCTAGACACTGGTGGCACAAGTGAGTCTGCAGCAGAGACTACTTTTAAAGAAACACGCTGCAATGAAGAAGAGGATATTGCAGCAGGAGTGGCATCTATATCTGTTGCTGTAACCAAATCAAATAATTATTCAATTTCTAGTCCGGGAGAAAATTTAGGCATGGAGAATAGTTCAAGTAGTAGTAGAGATGACTGGAAAGAAGAAAGACAAGTTCATGCTGAAGATACAATACATTCAAGCAGGTCTCAAGTAGAATCTATACCAGAAGATAATTTTGCAGATCTGTCTGAGGGTCACGGAAAGGCTTCACAAACAAGCGTGAAAGTTTCTGATGTGAGAGATGCCAATACTATCTCTCTTAATGCACATATGACCGCAACTTCAGATGCACAGTCAGAAACTTTTTCTTCCTTTAGACAAGATTGTAATTTTTTTGATTTACTGGAAAGAATGAAAGAAGAGTTGATAGTATCAAGTTGCTCCAAAGAAATCTTTAACATGCAAATTACTGAGCAGAATGAACTACAAATGGAGCTTGATAACCATCGTTCTAAATCAACCAAAGATGTGGCTCTGCTCAATACCTCCCTCAATGAAGTTGTTGAGAGAAATCAGAGCCTCGTCGATGAACTTTCACATTGCAGATCTGAACTTGAAGATGTTTCAACTGCAAAGGAGAAGCTCAGAGATCAGCTGCTAACGGCAGAGGCAGAGATAGAAAAGCTTTCTTCTAAAACAAGTGAGACAGAGAATAGCTTGGAAAAGTTACATGGAGATATGTTCAGATTGGCAAAAGAGTTGGATGACTGCAAGCATTTGGTGACAATGTTGGAAGGGGAGAAGGAAAGATTAAATGGTATTATCACCTTTGAAAATGAAAATAAAATAAAATTAGCTGAGGAAAAGGAGTTGTATAGTGATGAGAATCAAAAGATATTATCAGAGTTAAGTAGCTTAAAGAGTTTGAATGTGGCTCTGGAGGCTGAAAATTCTAAATTAATGGGGAGTTTGTCATCAGTAGCAGAGGCAAAAACAAAGCTTGAAGAAGAAAGAGAGCAGTTGTTTCAGGTGAATGGGACTCTGTCAGCTGAACTTGCCAATTGTAAAAACTTGGTTGCTACTCAACAAGAGGAAAATATGAACTTAACCAAGAACCTTGCACTGGTAACAGAAGATAGGACGAAGGTAGAAGAAGATAAGAACCATTTATTTCATAAGAATGAGACAATGGCGTCTGAGCTGCTTGTTCTTGATGAGATACTGTCAACTGAACATGAGAAACGTGTAAAGTTTGAGGGTGACCTTAAAGATGCTTTGGCACAACTTGACCAACTCACTGAAGAAAATGTATTTCTCAGCAACGGTCTTGATATATATAAATTTAAAATTGAAGAACTTTGTGGCGAAATAATTTCTCTGCAAACGAGAACTAGAGAAGACGAGGACCGGGCTGAAAATGCAGGCTCTGACCAGTATCATGGAAATAATTTCCAAGAAAATGTTTCTTCCCAGATCACTTTCAAGAAATGTTTACCTAATCCTTCTTCTGTTCTTACTGGTGGGAAACCCTTCGAAGTGACTGAACAGGAAATCTTTGGTGATTCTCTTGGGTTTGTAACTTTGGGTCAACACTTGGAGGAAGCAGAACTCATGTTACAGAGACTTGAGAAGGAAATCACAGGGTTGCAGTCCAATTCTGCCTCTAGCAGGTCAGGTAGTAAAACGGCTGCACCTGCTATTTCTAAACTAATTCAAGCCTTTGAGTCGCAGGTAAATGTTGAAGAAGACGAGGTAGAGGCTGAAATCCAGTCACCTAACGATCCATATAAGTTATCAATTGAACTTGTGGAAAATTTGAGAGTATTGCTTCGCCAAGTGGTTGTGGACAGTGAGAATGCCAGTGTGTTGCTCAAGGGAGAGCGTGATCATCAGAATGTTGCTATATCAACATTGAACGAATTCAAGGACAAATTTGAAGCTTTGGAGAACTACAGCAACAATTGGGTGATGGCCAACATTGAGCACGGGGTTTTATTTGATTGCTTCAAACATCATTTGAACGATGCTGGTGATAAGATCTATGAACTTGAGATTCTTAACAAGTCTTTAAAGCAACAAGCCACGCACCACAAGAATTTTAATAGGGAGCTTGCTGAAAGGTTATGTGGATATGAATCAACACTTACTGAGTTGGAGCGTCAATTGTGCGATCTTCCTCAAAGCTCAAATGAGATGGTTTCTTTGATATGTAATCAGTTAGACAATTTGCAGGGGGGAGCAATTGAAAGGGCAATGACACTTGAGAAGGACTGGCACTCTTTCTTATTGGAGCTTGCTGAAACAATTGTTAAGCTTGATGAATCATTAGGGAAATCTGATACTCCAGCCATCAAATTTTGCACTAGTGACCAATTGCTTAGCTGCATTTCTGCCTCTGTCATAGATGCTGTCAAAACGATTAATGATCTGAGAGAGAGACTTCAAGCAACTGCTTCCAATGGCGAAGCATGTAGGATGTCATATGAAGAAGTAACTGAAAAATATGATAGTTTGTTTAGAAGGAATGAATTTACTGTTGATATGCTTCATAAGTTATATGGTGAATTGCAAAAACTTCATATTGCTTCTTGTGGATCTGTCAGCGGAAGTGATATGAACATGCAAATCAAGATGGTGGGTGATCCCTTAGATTACAGCAACTTTGAGGCCTTAATCAAATCGCTAGAGGATTGTATTACTGAGAAACTGCAACTTCAGTCTGTAAACGATAGACTTTGCACAGACTTGGAACGTAGGACAGTAGAATTTGTTGAGTTCAGAGAGAGATGCCTTGATTCGATTGGCATTGAAGAATTGATTAAAGATGTTCAAAGTGTGTTATCACTAGAAGACACTGAGAAGTATCATGCTGAAATACCCGCTATTTATTTGGAATCTATGGTATCATTGCTTTTACAAAAATACAGGGAGTCTGAGTTGCAATTAGGCCTATCTAGAGAAGAGTCTGAATCCAAAATGATGAAATTGACGGGACTGCAGGAAAGTGTGAATGACTTGAGCACCTTGATTCTTGATCATGAATGTGAAATTGTTCTTCTAAAAGAAAGCTTGAGCCAGGCGCAGGAAGCCTTAATGGCTTCTCGATCTGAACTAAAGGATAAAGTTAACGAACTGGAACAAACAGAGCAGCGAGTGTCTGCAATCAGAGAGAAGCTAAGCATAGCTGTTGCCAAGGGAAAAAGTTTGATTGTACAACGGGATAATTTGAAGCAGTTACTGGCACAGAATTCCAGTGAACTGGAGAGGTGCTTGCAGGAGTTGCAGATGAAAGACACCAGGCTTAATGAGACTGAAATGAAACTTAAAACCTATTCAGAAGCAGGAGAGCGTGTTGAAGCACTGGAATCTGAGCTTTCGTACATTCGGAATTCTGCCACTGCACTAAGAGAATCATTTCTTCTTAAAGATTCAGTTCTTCAGAGGATAGAGGAGATTCTTGATGAACTAGATTTGCCAGAGAATTTTCATTCAAGAGACATAATTGATAAGATTGATTGGTTAGCGAAGTCAAGTATGGGTGAGAATTTACTCCATACGGACTGGGATCAAAGGAGTTCAGTTGCAGGAGGCTCAGGCTCTGATGCTAATTTTGTTATTACAGATGCCTGGAAAGATGAAGTGCAGCCGGATGCAAATGTTGGGGATGATTTGAGAAGAAAATATGAGGAGCTCCAAACAAAGTTTTATGGGCTTGCTGAACAAAATGAAATGCTTGAACAGTCATTAATGGAAAGGAATATTATAGTGCAAAGATGGGAAGAGCTTCTAGAAAAGATTGACATTCCTTCACACTTCCGGTCCATGGAGCCAGAAGATAAAATTGAATGGTTGCACAGATCCCTTTCGGAGGCTTGTCGTGATAGGGATTCTCTCCATCAGAGGGTCAATTACTTAGAGAACTATAGTGAGTCATTAACTGCAGATCTGGATGATTCACAGAAGAAAATTTCTCACATTGAGGCAGAGCTCCAGTCAGTCTTGCTTGAGAGAGAGAAGCTTTCTGAAAAGTTGGAAATAATCCATCATCATAATGACCATCTATCATTTGGAACTTTTGAGAAAGAAATTGAGAACATAGTATTACAAAATGAATTAAGCAATACACAGGATAAATTAATTTCTACTGAGCATAAAATAGGGAAATTGGAGGCTTTGGTAAGTAATGCATTGCGAGAGGAAGACATGAATGATTTGGTTCCTGGTAGCTGCAGCATTGAATTTCTTGAATTGATGGTGATGAAGCTAATTCAAAATTATTCAGCATCTTTGTCGGGGAATACTGTGCCTAGGAGCATTATGAATGGAGCTGATACTGAAGAAATGCTTGCCAGAAGCACAGAAGCGCAGGTTGCTTGGCAAAATGATATAAATGTTCTCAAGGAAGATCTAGAGGATGCAATGCATCAATTGATGGTTGTGACGAAGGAGAGAGATCAATATATGGAGATGCATGAATCTTTAATTGTCAAGGTTGAAAGTTTAGATAAAAAGAAGGACGAGTTGGAGGAACTGCTTAATCTAGAGGAGCAGAAGTCGACGTCTGTAAGAGAGAAACTAAATGTTGCTGTCCGGAAGGGAAAGTCTTTGGTTCAACAACGAGACACTCTGAAACAAACCATTGAAGAGATGACCACTGAGTTGAAACGACTGAGATCTGAGATGAAGTCTCAGGAAAATACTCTCGCTAGTTATGAGCAGAAGTTTAAGGATTTCTCTGTTTACCCAGGACGGGTGGAGGCCCTGGAATCTGAAAATCTGTCTTTGAAGAACCGGTTGACTGAAATGGAAAGCAATTTACAGGAAAAAGAATATAAATTGAGCTCAATTATCAGCACGTTAGATCAAATTGAAGTGAATATTGATGTTAACGAAACTGATCCTATTGAGAAACTGAAACATGTTGGAAAACTGTGCTTTGATCTGCGTGAAGCCATGTTTTTCTCTGAACAAGAGTCTGTGAAGTCCAGAAGAGCAGCAGAGTTGCTTCTGGCAGAATTGAATGAAGTTCAGGAAAGAAATGATGCTTTCCAAGAGGAGCTAGCAAAAGCTTCCGATGAGATTGCTGAAATGACCAGGGAAAGGGACTCAGCAGAGAGTTCCAAGCTTGAAGCTCTTTCAGAACTCGAAAAGTTATCTACTCTCCAATTGAAGGAAAGAAAGAACCAATTTTCTCAATTTATGGGATTAAAATCTGGCCTTGATCGACTAAAGGAGGCTTTGCATGAGATCAATAGCTTACTTGTGGATGCTTTCTCTAGGGATTTGGATGCTTTTTATAATCTGGAAGCTGCTATTGAGTCCTGTACTAAAGCTAACGAACCTACCGAGGTCAATCCTTCTCCTTCCACTGTGTCTGGTGCCTTTAAGAAGGACAAGGTACGTTGGTTCCTCTGTAAATTTACCTGTTTCCCTTGATAGCTGTTTTGTATTTCAGTACCTGACCCTGGTCTGAGTGCCCCTTGGGTCAAGCATTTATCAGAGTTGCTGACACGAAACTTGATTTTCATTTTGTGAACTCTTGGTTTCCGCTTTTAAGGTGGTGTGGGTTTGGATTAAGCATATGGCAAATCCATGTTTGGTTGCTACTTCTATAAATCCAAGTTATTGAGATTGTTAAATCTACTTTTTTTCAACTTAGCAACATAGGTTTACAACAACTCTGTAAATTTTTGCTTTATTACAACTAATCTCGTCTCTAGCCTCATTCTTCCGAATGCACTCACTTCTTTAGTAAACACATTAGTTAAATAAAAACAATTATCTAATGTTAGTTTTTAATTAATAACTAAATAAGGTAATCAGAACTGTAAATAGTACACAATAAGGTTTAAAATTTGTTTTGGTTTTGTACTGTTGAAAATTTATTTAGGTCTTAATCAACGGGCTTATTGACGAAGATTCTTTTTCTTGTAGGGGAGTTTTTTTGCTCTGGATTCCTGGTTGAACTCCTACACTAATTCTGCTATGGACGAAAAGGTTGCAACGGAAATACATAGTCAAATTGTGCATCAACTAGAAGAATCGATGAAGGAAATTGGGGATCTGAAAGAAATGATAGATGGCCATTCTGTGTCATTCCATAAACAATCTGATTCTCTATCTAAGGTACTGGGGGAGCTTTATCAAGAAGTTAATTCACAGAAAGAGTTGGTCCAAGCATTAGAGTCAAAGGTGCAACAGTGTGAATCAGTTGCAAAAGATAAAGAAAAGGAAGGCGATATCCTATGTAGAAGCGTTGACATGCTTCTTGAAGCATGCAGATCTACAATTAAGGAAGTTGACCAAAGAAAAGGGGAACTAATGGGAAATGATTTGACTAGTGAAAATTTGGGAGTGAATTTTATCTCCACAGCACCTGATCAACTTTCACGCACAGGAAGAACTCATTTATTGTCTGAGGAATATGTCCAGACAATTGCTGACAGGTTGCTGTTAACAGTAAGGGAATTTATAGGTCTGAAAGCTGAAATGTTTGATGGTAGTGTAACAGAAATGAAGATTGCAATAGCAAATTTGCAGAAGGAGCTTCAGGAAAAGGACATCCAGAAGGAAAGGATTTGCATGGATCTTGTTGGTCAAATCAAGGAAGCAGAAGGAACTGCAACTAGATATTCGCTTGATCTCCAAGCTTCAAAAGATAAGGTTCGTGAGTTGGAGAAAGTAATGGAACAAATGGACAATGAGAGGAAGGCCTTCGAGCAGAGATTAAGGCAGTTGCAAGATGGTTTGTTCATCTCAGATGAGTTACGGGAGAGGGTCAAATCACTCACAGATTTGCTCGCATCAAAAGACCAAGGTATGTTAATTTGTGCTTAAGTTCTTCATATATCCACTTTTCGTAAGACTATCAATGCTCAGGCTGGCTTTGCTTTGATGTCTACAGAAATTGAAGCTTTGATGCATGCACTTGATGAAGAAGAGGTACAGATGGAAGGTCTGACCAATAAGATCGAGGAGCTGGAAAAAGTCTTGAAGGAAAAGAATCACGAACTTGAGGGCATTGAAACTTCTCGGGGGAAGCTCACAAAAAAGCTCTCAATCACTGTGACAAAATTTGATGAGCTTCATCATCTATCTGAAAGTCTCTTAACGGAGGTTGAAAAACTTCAAGCACAGTTGCAAGATCGGGATGCTGAAATCTCCTTTTTGAGACAAGAGGTAACAAGATGTACTAATGATGCTCTTGTTGCAACTCAAACAAGCAACAGAAGTACAGAGGATATCAATGAGGTCATAACATGGTTTGACATGGTGGGAGCTCGGGCGGGGCTGTCTCATATAGGTCATAGTGACCAGGCAAACGAAGTTCATGAATGCAAGGAAGTTCTCAAGAAGAAGATAACATCAATCTTAAAAGAAATTGAGGATATTCAAGCAGCATCTCAAAGGAAGGACGAATTGTTACTGGTTGAAAAGAATAAGGTAGAAGAATTGAAACGCAAGGAATTGCAACTAAACTCGCTTGAAGATGTTGGAGATGATAATAAAGCAAGAAGTGCGGCCCCTGAAATCTTTGAATCCGAACCATTGGTGAGGTTTTTATTAGTAGCTCTTTTCTTTTCTTTTCATTTGATTATATTGTTTACTACATCCTGTAAAATTTTAAGTGAGGTTAGAAATGGGACTTGTCCTGTCGCTTTTACTTCAAACATTATGTCTTGCAATTTTGGATATTACACCGAATTTGTTGAAATAGGCTCGTGCATGTTTATACATTGCATTTTCCCCTCAACTAATTTATCAACTAAAATGTATATTTTTGCTTGGTTTGCCAGGATGGTAATGTTAAGACAGTGTTTTTCAATTCTCATTTTCTGTTTTGCTATGCAATAGAAGTAATTTAACTATTCATGCATTCTTGCGCAGATTAACAAATGGGCTGCAAGCAGTACTATTACACCTCAAGTTCGTAGCTTACGCAAAGGCAATACCGATCAAGTTGCAATTGCCATAGATGTGGATCCTGCTAGCAGTAGTAATAGATTAGAGGATGAAGATGACGACAAAGGTAAAACTCTAAACTAATGTGGTTACTTTCACCTTGCTTTTCAACTTTTTTCCTTTTCTAGACGGCTTAACTTACTGTTCCTTTGATCCTCGTCCATTTTAGTGCATGGTTTCAAGTCATTAGCTTCATCAAGACTTGTTCCAAAATTTTCAAGACGTGCAACAGACATGATTGATGGTCTTTGGTAGGCATTCAGATATTCATATTCACAGTCATTAAATTATGTAAATCCGAACAAGTATGTAACAAGATTTTGTATGCAATTTTGCTAGGGTATCTTGTGATCGGGCGCTGATGCGGCAGCCTGCATTACGACTGGGAATTATATTCTATTGGGCCATATTACATGCACTTGTTGCCACATTTGTAGTT

mRNA sequence

ATGGACAAGAACAAGAGCCGTTCCGATCTGCTCGCCGCAGGCAGGAAGAAGCTCCAGCAATTCCGTAAGAAGAAGGATAATAAGGGCAGTGGTAGCCAAGGAGGTTCATCAAGAAATACTAGTAAATTGGAACAGCATGATGCAGATGCAGACATTGGGATTGGTGCTGCTAAATCCACATCTGGTAGGTTTTCCAGTGATGAAGTACTTGCATCCAGTGTTGATCGCAATCCACATATTGTAGATTCTTCAGCATCATCTTCTACAGAACATTCCTTGGCAGCAGAGACTGATGATCATTCAACAGTTTCTGTTAAGCAAGAGATGGATTTAGCGGAAGCTTCAGCCATTGACCAGGGAGAGACTTCGATGCAGGAAGTGGGGTATAGGGAGGACTTCGAACACACAGTCCAAAATGTTGAGGCCTCTGGATTTGTATCATCTGGACCTTCCGTTCCTACTGATGTTGAAGGGAACGACAACCCTACTTCCAATTTGTCTTTCGCCGAATCATCTTCCCAAATTTCTTCTGCTTCTGTGGAGCAGCAAGGAAGAATAGTTGAAGTAGGGGGCGGATGTAGGGAAGAAGAGCTGTTGGTTTCACCGTCTACGTCTTTGTTGCAAGCAAGGGAAGATGTAGGTTGCATGGGGGATGCAGTGATGCAGCCTGGTCAAGTCCATGAAACAGAGATTGCAGGAGACAAGCAGCTAGACACTGGTGGCACAAGTGAGTCTGCAGCAGAGACTACTTTTAAAGAAACACGCTGCAATGAAGAAGAGGATATTGCAGCAGGAGTGGCATCTATATCTGTTGCTGTAACCAAATCAAATAATTATTCAATTTCTAGTCCGGGAGAAAATTTAGGCATGGAGAATAGTTCAAGTAGTAGTAGAGATGACTGGAAAGAAGAAAGACAAGTTCATGCTGAAGATACAATACATTCAAGCAGGTCTCAAGTAGAATCTATACCAGAAGATAATTTTGCAGATCTGTCTGAGGGTCACGGAAAGGCTTCACAAACAAGCGTGAAAGTTTCTGATGTGAGAGATGCCAATACTATCTCTCTTAATGCACATATGACCGCAACTTCAGATGCACAGTCAGAAACTTTTTCTTCCTTTAGACAAGATTGTAATTTTTTTGATTTACTGGAAAGAATGAAAGAAGAGTTGATAGTATCAAGTTGCTCCAAAGAAATCTTTAACATGCAAATTACTGAGCAGAATGAACTACAAATGGAGCTTGATAACCATCGTTCTAAATCAACCAAAGATGTGGCTCTGCTCAATACCTCCCTCAATGAAGTTGTTGAGAGAAATCAGAGCCTCGTCGATGAACTTTCACATTGCAGATCTGAACTTGAAGATGTTTCAACTGCAAAGGAGAAGCTCAGAGATCAGCTGCTAACGGCAGAGGCAGAGATAGAAAAGCTTTCTTCTAAAACAAGTGAGACAGAGAATAGCTTGGAAAAGTTACATGGAGATATGTTCAGATTGGCAAAAGAGTTGGATGACTGCAAGCATTTGGTGACAATGTTGGAAGGGGAGAAGGAAAGATTAAATGGTATTATCACCTTTGAAAATGAAAATAAAATAAAATTAGCTGAGGAAAAGGAGTTGTATAGTGATGAGAATCAAAAGATATTATCAGAGTTAAGTAGCTTAAAGAGTTTGAATGTGGCTCTGGAGGCTGAAAATTCTAAATTAATGGGGAGTTTGTCATCAGTAGCAGAGGCAAAAACAAAGCTTGAAGAAGAAAGAGAGCAGTTGTTTCAGGTGAATGGGACTCTGTCAGCTGAACTTGCCAATTGTAAAAACTTGGTTGCTACTCAACAAGAGGAAAATATGAACTTAACCAAGAACCTTGCACTGGTAACAGAAGATAGGACGAAGGTAGAAGAAGATAAGAACCATTTATTTCATAAGAATGAGACAATGGCGTCTGAGCTGCTTGTTCTTGATGAGATACTGTCAACTGAACATGAGAAACGTGTAAAGTTTGAGGGTGACCTTAAAGATGCTTTGGCACAACTTGACCAACTCACTGAAGAAAATGTATTTCTCAGCAACGGTCTTGATATATATAAATTTAAAATTGAAGAACTTTGTGGCGAAATAATTTCTCTGCAAACGAGAACTAGAGAAGACGAGGACCGGGCTGAAAATGCAGGCTCTGACCAGTATCATGGAAATAATTTCCAAGAAAATGTTTCTTCCCAGATCACTTTCAAGAAATGTTTACCTAATCCTTCTTCTGTTCTTACTGGTGGGAAACCCTTCGAAGTGACTGAACAGGAAATCTTTGGTGATTCTCTTGGGTTTGTAACTTTGGGTCAACACTTGGAGGAAGCAGAACTCATGTTACAGAGACTTGAGAAGGAAATCACAGGGTTGCAGTCCAATTCTGCCTCTAGCAGGTCAGGTAGTAAAACGGCTGCACCTGCTATTTCTAAACTAATTCAAGCCTTTGAGTCGCAGGTAAATGTTGAAGAAGACGAGGTAGAGGCTGAAATCCAGTCACCTAACGATCCATATAAGTTATCAATTGAACTTGTGGAAAATTTGAGAGTATTGCTTCGCCAAGTGGTTGTGGACAGTGAGAATGCCAGTGTGTTGCTCAAGGGAGAGCGTGATCATCAGAATGTTGCTATATCAACATTGAACGAATTCAAGGACAAATTTGAAGCTTTGGAGAACTACAGCAACAATTGGGTGATGGCCAACATTGAGCACGGGGTTTTATTTGATTGCTTCAAACATCATTTGAACGATGCTGGTGATAAGATCTATGAACTTGAGATTCTTAACAAGTCTTTAAAGCAACAAGCCACGCACCACAAGAATTTTAATAGGGAGCTTGCTGAAAGGTTATGTGGATATGAATCAACACTTACTGAGTTGGAGCGTCAATTGTGCGATCTTCCTCAAAGCTCAAATGAGATGGTTTCTTTGATATGTAATCAGTTAGACAATTTGCAGGGGGGAGCAATTGAAAGGGCAATGACACTTGAGAAGGACTGGCACTCTTTCTTATTGGAGCTTGCTGAAACAATTGTTAAGCTTGATGAATCATTAGGGAAATCTGATACTCCAGCCATCAAATTTTGCACTAGTGACCAATTGCTTAGCTGCATTTCTGCCTCTGTCATAGATGCTGTCAAAACGATTAATGATCTGAGAGAGAGACTTCAAGCAACTGCTTCCAATGGCGAAGCATGTAGGATGTCATATGAAGAAGTAACTGAAAAATATGATAGTTTGTTTAGAAGGAATGAATTTACTGTTGATATGCTTCATAAGTTATATGGTGAATTGCAAAAACTTCATATTGCTTCTTGTGGATCTGTCAGCGGAAGTGATATGAACATGCAAATCAAGATGGTGGGTGATCCCTTAGATTACAGCAACTTTGAGGCCTTAATCAAATCGCTAGAGGATTGTATTACTGAGAAACTGCAACTTCAGTCTGTAAACGATAGACTTTGCACAGACTTGGAACGTAGGACAGTAGAATTTGTTGAGTTCAGAGAGAGATGCCTTGATTCGATTGGCATTGAAGAATTGATTAAAGATGTTCAAAGTGTGTTATCACTAGAAGACACTGAGAAGTATCATGCTGAAATACCCGCTATTTATTTGGAATCTATGGTATCATTGCTTTTACAAAAATACAGGGAGTCTGAGTTGCAATTAGGCCTATCTAGAGAAGAGTCTGAATCCAAAATGATGAAATTGACGGGACTGCAGGAAAGTGTGAATGACTTGAGCACCTTGATTCTTGATCATGAATGTGAAATTGTTCTTCTAAAAGAAAGCTTGAGCCAGGCGCAGGAAGCCTTAATGGCTTCTCGATCTGAACTAAAGGATAAAGTTAACGAACTGGAACAAACAGAGCAGCGAGTGTCTGCAATCAGAGAGAAGCTAAGCATAGCTGTTGCCAAGGGAAAAAGTTTGATTGTACAACGGGATAATTTGAAGCAGTTACTGGCACAGAATTCCAGTGAACTGGAGAGGTGCTTGCAGGAGTTGCAGATGAAAGACACCAGGCTTAATGAGACTGAAATGAAACTTAAAACCTATTCAGAAGCAGGAGAGCGTGTTGAAGCACTGGAATCTGAGCTTTCGTACATTCGGAATTCTGCCACTGCACTAAGAGAATCATTTCTTCTTAAAGATTCAGTTCTTCAGAGGATAGAGGAGATTCTTGATGAACTAGATTTGCCAGAGAATTTTCATTCAAGAGACATAATTGATAAGATTGATTGGTTAGCGAAGTCAAGTATGGGTGAGAATTTACTCCATACGGACTGGGATCAAAGGAGTTCAGTTGCAGGAGGCTCAGGCTCTGATGCTAATTTTGTTATTACAGATGCCTGGAAAGATGAAGTGCAGCCGGATGCAAATGTTGGGGATGATTTGAGAAGAAAATATGAGGAGCTCCAAACAAAGTTTTATGGGCTTGCTGAACAAAATGAAATGCTTGAACAGTCATTAATGGAAAGGAATATTATAGTGCAAAGATGGGAAGAGCTTCTAGAAAAGATTGACATTCCTTCACACTTCCGGTCCATGGAGCCAGAAGATAAAATTGAATGGTTGCACAGATCCCTTTCGGAGGCTTGTCGTGATAGGGATTCTCTCCATCAGAGGGTCAATTACTTAGAGAACTATAGTGAGTCATTAACTGCAGATCTGGATGATTCACAGAAGAAAATTTCTCACATTGAGGCAGAGCTCCAGTCAGTCTTGCTTGAGAGAGAGAAGCTTTCTGAAAAGTTGGAAATAATCCATCATCATAATGACCATCTATCATTTGGAACTTTTGAGAAAGAAATTGAGAACATAGTATTACAAAATGAATTAAGCAATACACAGGATAAATTAATTTCTACTGAGCATAAAATAGGGAAATTGGAGGCTTTGGTAAGTAATGCATTGCGAGAGGAAGACATGAATGATTTGGTTCCTGGTAGCTGCAGCATTGAATTTCTTGAATTGATGGTGATGAAGCTAATTCAAAATTATTCAGCATCTTTGTCGGGGAATACTGTGCCTAGGAGCATTATGAATGGAGCTGATACTGAAGAAATGCTTGCCAGAAGCACAGAAGCGCAGGTTGCTTGGCAAAATGATATAAATGTTCTCAAGGAAGATCTAGAGGATGCAATGCATCAATTGATGGTTGTGACGAAGGAGAGAGATCAATATATGGAGATGCATGAATCTTTAATTGTCAAGGTTGAAAGTTTAGATAAAAAGAAGGACGAGTTGGAGGAACTGCTTAATCTAGAGGAGCAGAAGTCGACGTCTGTAAGAGAGAAACTAAATGTTGCTGTCCGGAAGGGAAAGTCTTTGGTTCAACAACGAGACACTCTGAAACAAACCATTGAAGAGATGACCACTGAGTTGAAACGACTGAGATCTGAGATGAAGTCTCAGGAAAATACTCTCGCTAGTTATGAGCAGAAGTTTAAGGATTTCTCTGTTTACCCAGGACGGGTGGAGGCCCTGGAATCTGAAAATCTGTCTTTGAAGAACCGGTTGACTGAAATGGAAAGCAATTTACAGGAAAAAGAATATAAATTGAGCTCAATTATCAGCACGTTAGATCAAATTGAAGTGAATATTGATGTTAACGAAACTGATCCTATTGAGAAACTGAAACATGTTGGAAAACTGTGCTTTGATCTGCGTGAAGCCATGTTTTTCTCTGAACAAGAGTCTGTGAAGTCCAGAAGAGCAGCAGAGTTGCTTCTGGCAGAATTGAATGAAGTTCAGGAAAGAAATGATGCTTTCCAAGAGGAGCTAGCAAAAGCTTCCGATGAGATTGCTGAAATGACCAGGGAAAGGGACTCAGCAGAGAGTTCCAAGCTTGAAGCTCTTTCAGAACTCGAAAAGTTATCTACTCTCCAATTGAAGGAAAGAAAGAACCAATTTTCTCAATTTATGGGATTAAAATCTGGCCTTGATCGACTAAAGGAGGCTTTGCATGAGATCAATAGCTTACTTGTGGATGCTTTCTCTAGGGATTTGGATGCTTTTTATAATCTGGAAGCTGCTATTGAGTCCTGTACTAAAGCTAACGAACCTACCGAGGTCAATCCTTCTCCTTCCACTGTGTCTGGTGCCTTTAAGAAGGACAAGGGGAGTTTTTTTGCTCTGGATTCCTGGTTGAACTCCTACACTAATTCTGCTATGGACGAAAAGGTTGCAACGGAAATACATAGTCAAATTGTGCATCAACTAGAAGAATCGATGAAGGAAATTGGGGATCTGAAAGAAATGATAGATGGCCATTCTGTGTCATTCCATAAACAATCTGATTCTCTATCTAAGGTACTGGGGGAGCTTTATCAAGAAGTTAATTCACAGAAAGAGTTGGTCCAAGCATTAGAGTCAAAGGTGCAACAGTGTGAATCAGTTGCAAAAGATAAAGAAAAGGAAGGCGATATCCTATGTAGAAGCGTTGACATGCTTCTTGAAGCATGCAGATCTACAATTAAGGAAGTTGACCAAAGAAAAGGGGAACTAATGGGAAATGATTTGACTAGTGAAAATTTGGGAGTGAATTTTATCTCCACAGCACCTGATCAACTTTCACGCACAGGAAGAACTCATTTATTGTCTGAGGAATATGTCCAGACAATTGCTGACAGGTTGCTGTTAACAGTAAGGGAATTTATAGGTCTGAAAGCTGAAATGTTTGATGGTAGTGTAACAGAAATGAAGATTGCAATAGCAAATTTGCAGAAGGAGCTTCAGGAAAAGGACATCCAGAAGGAAAGGATTTGCATGGATCTTGTTGGTCAAATCAAGGAAGCAGAAGGAACTGCAACTAGATATTCGCTTGATCTCCAAGCTTCAAAAGATAAGGTTCGTGAGTTGGAGAAAGTAATGGAACAAATGGACAATGAGAGGAAGGCCTTCGAGCAGAGATTAAGGCAGTTGCAAGATGGTTTGTTCATCTCAGATGAGTTACGGGAGAGGGTCAAATCACTCACAGATTTGCTCGCATCAAAAGACCAAGAAATTGAAGCTTTGATGCATGCACTTGATGAAGAAGAGGTACAGATGGAAGGTCTGACCAATAAGATCGAGGAGCTGGAAAAAGTCTTGAAGGAAAAGAATCACGAACTTGAGGGCATTGAAACTTCTCGGGGGAAGCTCACAAAAAAGCTCTCAATCACTGTGACAAAATTTGATGAGCTTCATCATCTATCTGAAAGTCTCTTAACGGAGGTTGAAAAACTTCAAGCACAGTTGCAAGATCGGGATGCTGAAATCTCCTTTTTGAGACAAGAGGTAACAAGATGTACTAATGATGCTCTTGTTGCAACTCAAACAAGCAACAGAAGTACAGAGGATATCAATGAGGTCATAACATGGTTTGACATGGTGGGAGCTCGGGCGGGGCTGTCTCATATAGGTCATAGTGACCAGGCAAACGAAGTTCATGAATGCAAGGAAGTTCTCAAGAAGAAGATAACATCAATCTTAAAAGAAATTGAGGATATTCAAGCAGCATCTCAAAGGAAGGACGAATTGTTACTGGTTGAAAAGAATAAGGTAGAAGAATTGAAACGCAAGGAATTGCAACTAAACTCGCTTGAAGATGTTGGAGATGATAATAAAGCAAGAAGTGCGGCCCCTGAAATCTTTGAATCCGAACCATTGATTAACAAATGGGCTGCAAGCAGTACTATTACACCTCAAGTTCGTAGCTTACGCAAAGGCAATACCGATCAAGTTGCAATTGCCATAGATGTGGATCCTGCTAGCAGTAGTAATAGATTAGAGGATGAAGATGACGACAAAGTGCATGGTTTCAAGTCATTAGCTTCATCAAGACTTGTTCCAAAATTTTCAAGACGTGCAACAGACATGATTGATGGTCTTTGGGTATCTTGTGATCGGGCGCTGATGCGGCAGCCTGCATTACGACTGGGAATTATATTCTATTGGGCCATATTACATGCACTTGTTGCCACATTTGTAGTT

Coding sequence (CDS)

ATGGACAAGAACAAGAGCCGTTCCGATCTGCTCGCCGCAGGCAGGAAGAAGCTCCAGCAATTCCGTAAGAAGAAGGATAATAAGGGCAGTGGTAGCCAAGGAGGTTCATCAAGAAATACTAGTAAATTGGAACAGCATGATGCAGATGCAGACATTGGGATTGGTGCTGCTAAATCCACATCTGGTAGGTTTTCCAGTGATGAAGTACTTGCATCCAGTGTTGATCGCAATCCACATATTGTAGATTCTTCAGCATCATCTTCTACAGAACATTCCTTGGCAGCAGAGACTGATGATCATTCAACAGTTTCTGTTAAGCAAGAGATGGATTTAGCGGAAGCTTCAGCCATTGACCAGGGAGAGACTTCGATGCAGGAAGTGGGGTATAGGGAGGACTTCGAACACACAGTCCAAAATGTTGAGGCCTCTGGATTTGTATCATCTGGACCTTCCGTTCCTACTGATGTTGAAGGGAACGACAACCCTACTTCCAATTTGTCTTTCGCCGAATCATCTTCCCAAATTTCTTCTGCTTCTGTGGAGCAGCAAGGAAGAATAGTTGAAGTAGGGGGCGGATGTAGGGAAGAAGAGCTGTTGGTTTCACCGTCTACGTCTTTGTTGCAAGCAAGGGAAGATGTAGGTTGCATGGGGGATGCAGTGATGCAGCCTGGTCAAGTCCATGAAACAGAGATTGCAGGAGACAAGCAGCTAGACACTGGTGGCACAAGTGAGTCTGCAGCAGAGACTACTTTTAAAGAAACACGCTGCAATGAAGAAGAGGATATTGCAGCAGGAGTGGCATCTATATCTGTTGCTGTAACCAAATCAAATAATTATTCAATTTCTAGTCCGGGAGAAAATTTAGGCATGGAGAATAGTTCAAGTAGTAGTAGAGATGACTGGAAAGAAGAAAGACAAGTTCATGCTGAAGATACAATACATTCAAGCAGGTCTCAAGTAGAATCTATACCAGAAGATAATTTTGCAGATCTGTCTGAGGGTCACGGAAAGGCTTCACAAACAAGCGTGAAAGTTTCTGATGTGAGAGATGCCAATACTATCTCTCTTAATGCACATATGACCGCAACTTCAGATGCACAGTCAGAAACTTTTTCTTCCTTTAGACAAGATTGTAATTTTTTTGATTTACTGGAAAGAATGAAAGAAGAGTTGATAGTATCAAGTTGCTCCAAAGAAATCTTTAACATGCAAATTACTGAGCAGAATGAACTACAAATGGAGCTTGATAACCATCGTTCTAAATCAACCAAAGATGTGGCTCTGCTCAATACCTCCCTCAATGAAGTTGTTGAGAGAAATCAGAGCCTCGTCGATGAACTTTCACATTGCAGATCTGAACTTGAAGATGTTTCAACTGCAAAGGAGAAGCTCAGAGATCAGCTGCTAACGGCAGAGGCAGAGATAGAAAAGCTTTCTTCTAAAACAAGTGAGACAGAGAATAGCTTGGAAAAGTTACATGGAGATATGTTCAGATTGGCAAAAGAGTTGGATGACTGCAAGCATTTGGTGACAATGTTGGAAGGGGAGAAGGAAAGATTAAATGGTATTATCACCTTTGAAAATGAAAATAAAATAAAATTAGCTGAGGAAAAGGAGTTGTATAGTGATGAGAATCAAAAGATATTATCAGAGTTAAGTAGCTTAAAGAGTTTGAATGTGGCTCTGGAGGCTGAAAATTCTAAATTAATGGGGAGTTTGTCATCAGTAGCAGAGGCAAAAACAAAGCTTGAAGAAGAAAGAGAGCAGTTGTTTCAGGTGAATGGGACTCTGTCAGCTGAACTTGCCAATTGTAAAAACTTGGTTGCTACTCAACAAGAGGAAAATATGAACTTAACCAAGAACCTTGCACTGGTAACAGAAGATAGGACGAAGGTAGAAGAAGATAAGAACCATTTATTTCATAAGAATGAGACAATGGCGTCTGAGCTGCTTGTTCTTGATGAGATACTGTCAACTGAACATGAGAAACGTGTAAAGTTTGAGGGTGACCTTAAAGATGCTTTGGCACAACTTGACCAACTCACTGAAGAAAATGTATTTCTCAGCAACGGTCTTGATATATATAAATTTAAAATTGAAGAACTTTGTGGCGAAATAATTTCTCTGCAAACGAGAACTAGAGAAGACGAGGACCGGGCTGAAAATGCAGGCTCTGACCAGTATCATGGAAATAATTTCCAAGAAAATGTTTCTTCCCAGATCACTTTCAAGAAATGTTTACCTAATCCTTCTTCTGTTCTTACTGGTGGGAAACCCTTCGAAGTGACTGAACAGGAAATCTTTGGTGATTCTCTTGGGTTTGTAACTTTGGGTCAACACTTGGAGGAAGCAGAACTCATGTTACAGAGACTTGAGAAGGAAATCACAGGGTTGCAGTCCAATTCTGCCTCTAGCAGGTCAGGTAGTAAAACGGCTGCACCTGCTATTTCTAAACTAATTCAAGCCTTTGAGTCGCAGGTAAATGTTGAAGAAGACGAGGTAGAGGCTGAAATCCAGTCACCTAACGATCCATATAAGTTATCAATTGAACTTGTGGAAAATTTGAGAGTATTGCTTCGCCAAGTGGTTGTGGACAGTGAGAATGCCAGTGTGTTGCTCAAGGGAGAGCGTGATCATCAGAATGTTGCTATATCAACATTGAACGAATTCAAGGACAAATTTGAAGCTTTGGAGAACTACAGCAACAATTGGGTGATGGCCAACATTGAGCACGGGGTTTTATTTGATTGCTTCAAACATCATTTGAACGATGCTGGTGATAAGATCTATGAACTTGAGATTCTTAACAAGTCTTTAAAGCAACAAGCCACGCACCACAAGAATTTTAATAGGGAGCTTGCTGAAAGGTTATGTGGATATGAATCAACACTTACTGAGTTGGAGCGTCAATTGTGCGATCTTCCTCAAAGCTCAAATGAGATGGTTTCTTTGATATGTAATCAGTTAGACAATTTGCAGGGGGGAGCAATTGAAAGGGCAATGACACTTGAGAAGGACTGGCACTCTTTCTTATTGGAGCTTGCTGAAACAATTGTTAAGCTTGATGAATCATTAGGGAAATCTGATACTCCAGCCATCAAATTTTGCACTAGTGACCAATTGCTTAGCTGCATTTCTGCCTCTGTCATAGATGCTGTCAAAACGATTAATGATCTGAGAGAGAGACTTCAAGCAACTGCTTCCAATGGCGAAGCATGTAGGATGTCATATGAAGAAGTAACTGAAAAATATGATAGTTTGTTTAGAAGGAATGAATTTACTGTTGATATGCTTCATAAGTTATATGGTGAATTGCAAAAACTTCATATTGCTTCTTGTGGATCTGTCAGCGGAAGTGATATGAACATGCAAATCAAGATGGTGGGTGATCCCTTAGATTACAGCAACTTTGAGGCCTTAATCAAATCGCTAGAGGATTGTATTACTGAGAAACTGCAACTTCAGTCTGTAAACGATAGACTTTGCACAGACTTGGAACGTAGGACAGTAGAATTTGTTGAGTTCAGAGAGAGATGCCTTGATTCGATTGGCATTGAAGAATTGATTAAAGATGTTCAAAGTGTGTTATCACTAGAAGACACTGAGAAGTATCATGCTGAAATACCCGCTATTTATTTGGAATCTATGGTATCATTGCTTTTACAAAAATACAGGGAGTCTGAGTTGCAATTAGGCCTATCTAGAGAAGAGTCTGAATCCAAAATGATGAAATTGACGGGACTGCAGGAAAGTGTGAATGACTTGAGCACCTTGATTCTTGATCATGAATGTGAAATTGTTCTTCTAAAAGAAAGCTTGAGCCAGGCGCAGGAAGCCTTAATGGCTTCTCGATCTGAACTAAAGGATAAAGTTAACGAACTGGAACAAACAGAGCAGCGAGTGTCTGCAATCAGAGAGAAGCTAAGCATAGCTGTTGCCAAGGGAAAAAGTTTGATTGTACAACGGGATAATTTGAAGCAGTTACTGGCACAGAATTCCAGTGAACTGGAGAGGTGCTTGCAGGAGTTGCAGATGAAAGACACCAGGCTTAATGAGACTGAAATGAAACTTAAAACCTATTCAGAAGCAGGAGAGCGTGTTGAAGCACTGGAATCTGAGCTTTCGTACATTCGGAATTCTGCCACTGCACTAAGAGAATCATTTCTTCTTAAAGATTCAGTTCTTCAGAGGATAGAGGAGATTCTTGATGAACTAGATTTGCCAGAGAATTTTCATTCAAGAGACATAATTGATAAGATTGATTGGTTAGCGAAGTCAAGTATGGGTGAGAATTTACTCCATACGGACTGGGATCAAAGGAGTTCAGTTGCAGGAGGCTCAGGCTCTGATGCTAATTTTGTTATTACAGATGCCTGGAAAGATGAAGTGCAGCCGGATGCAAATGTTGGGGATGATTTGAGAAGAAAATATGAGGAGCTCCAAACAAAGTTTTATGGGCTTGCTGAACAAAATGAAATGCTTGAACAGTCATTAATGGAAAGGAATATTATAGTGCAAAGATGGGAAGAGCTTCTAGAAAAGATTGACATTCCTTCACACTTCCGGTCCATGGAGCCAGAAGATAAAATTGAATGGTTGCACAGATCCCTTTCGGAGGCTTGTCGTGATAGGGATTCTCTCCATCAGAGGGTCAATTACTTAGAGAACTATAGTGAGTCATTAACTGCAGATCTGGATGATTCACAGAAGAAAATTTCTCACATTGAGGCAGAGCTCCAGTCAGTCTTGCTTGAGAGAGAGAAGCTTTCTGAAAAGTTGGAAATAATCCATCATCATAATGACCATCTATCATTTGGAACTTTTGAGAAAGAAATTGAGAACATAGTATTACAAAATGAATTAAGCAATACACAGGATAAATTAATTTCTACTGAGCATAAAATAGGGAAATTGGAGGCTTTGGTAAGTAATGCATTGCGAGAGGAAGACATGAATGATTTGGTTCCTGGTAGCTGCAGCATTGAATTTCTTGAATTGATGGTGATGAAGCTAATTCAAAATTATTCAGCATCTTTGTCGGGGAATACTGTGCCTAGGAGCATTATGAATGGAGCTGATACTGAAGAAATGCTTGCCAGAAGCACAGAAGCGCAGGTTGCTTGGCAAAATGATATAAATGTTCTCAAGGAAGATCTAGAGGATGCAATGCATCAATTGATGGTTGTGACGAAGGAGAGAGATCAATATATGGAGATGCATGAATCTTTAATTGTCAAGGTTGAAAGTTTAGATAAAAAGAAGGACGAGTTGGAGGAACTGCTTAATCTAGAGGAGCAGAAGTCGACGTCTGTAAGAGAGAAACTAAATGTTGCTGTCCGGAAGGGAAAGTCTTTGGTTCAACAACGAGACACTCTGAAACAAACCATTGAAGAGATGACCACTGAGTTGAAACGACTGAGATCTGAGATGAAGTCTCAGGAAAATACTCTCGCTAGTTATGAGCAGAAGTTTAAGGATTTCTCTGTTTACCCAGGACGGGTGGAGGCCCTGGAATCTGAAAATCTGTCTTTGAAGAACCGGTTGACTGAAATGGAAAGCAATTTACAGGAAAAAGAATATAAATTGAGCTCAATTATCAGCACGTTAGATCAAATTGAAGTGAATATTGATGTTAACGAAACTGATCCTATTGAGAAACTGAAACATGTTGGAAAACTGTGCTTTGATCTGCGTGAAGCCATGTTTTTCTCTGAACAAGAGTCTGTGAAGTCCAGAAGAGCAGCAGAGTTGCTTCTGGCAGAATTGAATGAAGTTCAGGAAAGAAATGATGCTTTCCAAGAGGAGCTAGCAAAAGCTTCCGATGAGATTGCTGAAATGACCAGGGAAAGGGACTCAGCAGAGAGTTCCAAGCTTGAAGCTCTTTCAGAACTCGAAAAGTTATCTACTCTCCAATTGAAGGAAAGAAAGAACCAATTTTCTCAATTTATGGGATTAAAATCTGGCCTTGATCGACTAAAGGAGGCTTTGCATGAGATCAATAGCTTACTTGTGGATGCTTTCTCTAGGGATTTGGATGCTTTTTATAATCTGGAAGCTGCTATTGAGTCCTGTACTAAAGCTAACGAACCTACCGAGGTCAATCCTTCTCCTTCCACTGTGTCTGGTGCCTTTAAGAAGGACAAGGGGAGTTTTTTTGCTCTGGATTCCTGGTTGAACTCCTACACTAATTCTGCTATGGACGAAAAGGTTGCAACGGAAATACATAGTCAAATTGTGCATCAACTAGAAGAATCGATGAAGGAAATTGGGGATCTGAAAGAAATGATAGATGGCCATTCTGTGTCATTCCATAAACAATCTGATTCTCTATCTAAGGTACTGGGGGAGCTTTATCAAGAAGTTAATTCACAGAAAGAGTTGGTCCAAGCATTAGAGTCAAAGGTGCAACAGTGTGAATCAGTTGCAAAAGATAAAGAAAAGGAAGGCGATATCCTATGTAGAAGCGTTGACATGCTTCTTGAAGCATGCAGATCTACAATTAAGGAAGTTGACCAAAGAAAAGGGGAACTAATGGGAAATGATTTGACTAGTGAAAATTTGGGAGTGAATTTTATCTCCACAGCACCTGATCAACTTTCACGCACAGGAAGAACTCATTTATTGTCTGAGGAATATGTCCAGACAATTGCTGACAGGTTGCTGTTAACAGTAAGGGAATTTATAGGTCTGAAAGCTGAAATGTTTGATGGTAGTGTAACAGAAATGAAGATTGCAATAGCAAATTTGCAGAAGGAGCTTCAGGAAAAGGACATCCAGAAGGAAAGGATTTGCATGGATCTTGTTGGTCAAATCAAGGAAGCAGAAGGAACTGCAACTAGATATTCGCTTGATCTCCAAGCTTCAAAAGATAAGGTTCGTGAGTTGGAGAAAGTAATGGAACAAATGGACAATGAGAGGAAGGCCTTCGAGCAGAGATTAAGGCAGTTGCAAGATGGTTTGTTCATCTCAGATGAGTTACGGGAGAGGGTCAAATCACTCACAGATTTGCTCGCATCAAAAGACCAAGAAATTGAAGCTTTGATGCATGCACTTGATGAAGAAGAGGTACAGATGGAAGGTCTGACCAATAAGATCGAGGAGCTGGAAAAAGTCTTGAAGGAAAAGAATCACGAACTTGAGGGCATTGAAACTTCTCGGGGGAAGCTCACAAAAAAGCTCTCAATCACTGTGACAAAATTTGATGAGCTTCATCATCTATCTGAAAGTCTCTTAACGGAGGTTGAAAAACTTCAAGCACAGTTGCAAGATCGGGATGCTGAAATCTCCTTTTTGAGACAAGAGGTAACAAGATGTACTAATGATGCTCTTGTTGCAACTCAAACAAGCAACAGAAGTACAGAGGATATCAATGAGGTCATAACATGGTTTGACATGGTGGGAGCTCGGGCGGGGCTGTCTCATATAGGTCATAGTGACCAGGCAAACGAAGTTCATGAATGCAAGGAAGTTCTCAAGAAGAAGATAACATCAATCTTAAAAGAAATTGAGGATATTCAAGCAGCATCTCAAAGGAAGGACGAATTGTTACTGGTTGAAAAGAATAAGGTAGAAGAATTGAAACGCAAGGAATTGCAACTAAACTCGCTTGAAGATGTTGGAGATGATAATAAAGCAAGAAGTGCGGCCCCTGAAATCTTTGAATCCGAACCATTGATTAACAAATGGGCTGCAAGCAGTACTATTACACCTCAAGTTCGTAGCTTACGCAAAGGCAATACCGATCAAGTTGCAATTGCCATAGATGTGGATCCTGCTAGCAGTAGTAATAGATTAGAGGATGAAGATGACGACAAAGTGCATGGTTTCAAGTCATTAGCTTCATCAAGACTTGTTCCAAAATTTTCAAGACGTGCAACAGACATGATTGATGGTCTTTGGGTATCTTGTGATCGGGCGCTGATGCGGCAGCCTGCATTACGACTGGGAATTATATTCTATTGGGCCATATTACATGCACTTGTTGCCACATTTGTAGTT

Protein sequence

MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGGSSRNTSKLEQHDADADIGIGAAKSTSGRFSSDEVLASSVDRNPHIVDSSASSSTEHSLAAETDDHSTVSVKQEMDLAEASAIDQGETSMQEVGYREDFEHTVQNVEASGFVSSGPSVPTDVEGNDNPTSNLSFAESSSQISSASVEQQGRIVEVGGGCREEELLVSPSTSLLQAREDVGCMGDAVMQPGQVHETEIAGDKQLDTGGTSESAAETTFKETRCNEEEDIAAGVASISVAVTKSNNYSISSPGENLGMENSSSSSRDDWKEERQVHAEDTIHSSRSQVESIPEDNFADLSEGHGKASQTSVKVSDVRDANTISLNAHMTATSDAQSETFSSFRQDCNFFDLLERMKEELIVSSCSKEIFNMQITEQNELQMELDNHRSKSTKDVALLNTSLNEVVERNQSLVDELSHCRSELEDVSTAKEKLRDQLLTAEAEIEKLSSKTSETENSLEKLHGDMFRLAKELDDCKHLVTMLEGEKERLNGIITFENENKIKLAEEKELYSDENQKILSELSSLKSLNVALEAENSKLMGSLSSVAEAKTKLEEEREQLFQVNGTLSAELANCKNLVATQQEENMNLTKNLALVTEDRTKVEEDKNHLFHKNETMASELLVLDEILSTEHEKRVKFEGDLKDALAQLDQLTEENVFLSNGLDIYKFKIEELCGEIISLQTRTREDEDRAENAGSDQYHGNNFQENVSSQITFKKCLPNPSSVLTGGKPFEVTEQEIFGDSLGFVTLGQHLEEAELMLQRLEKEITGLQSNSASSRSGSKTAAPAISKLIQAFESQVNVEEDEVEAEIQSPNDPYKLSIELVENLRVLLRQVVVDSENASVLLKGERDHQNVAISTLNEFKDKFEALENYSNNWVMANIEHGVLFDCFKHHLNDAGDKIYELEILNKSLKQQATHHKNFNRELAERLCGYESTLTELERQLCDLPQSSNEMVSLICNQLDNLQGGAIERAMTLEKDWHSFLLELAETIVKLDESLGKSDTPAIKFCTSDQLLSCISASVIDAVKTINDLRERLQATASNGEACRMSYEEVTEKYDSLFRRNEFTVDMLHKLYGELQKLHIASCGSVSGSDMNMQIKMVGDPLDYSNFEALIKSLEDCITEKLQLQSVNDRLCTDLERRTVEFVEFRERCLDSIGIEELIKDVQSVLSLEDTEKYHAEIPAIYLESMVSLLLQKYRESELQLGLSREESESKMMKLTGLQESVNDLSTLILDHECEIVLLKESLSQAQEALMASRSELKDKVNELEQTEQRVSAIREKLSIAVAKGKSLIVQRDNLKQLLAQNSSELERCLQELQMKDTRLNETEMKLKTYSEAGERVEALESELSYIRNSATALRESFLLKDSVLQRIEEILDELDLPENFHSRDIIDKIDWLAKSSMGENLLHTDWDQRSSVAGGSGSDANFVITDAWKDEVQPDANVGDDLRRKYEELQTKFYGLAEQNEMLEQSLMERNIIVQRWEELLEKIDIPSHFRSMEPEDKIEWLHRSLSEACRDRDSLHQRVNYLENYSESLTADLDDSQKKISHIEAELQSVLLEREKLSEKLEIIHHHNDHLSFGTFEKEIENIVLQNELSNTQDKLISTEHKIGKLEALVSNALREEDMNDLVPGSCSIEFLELMVMKLIQNYSASLSGNTVPRSIMNGADTEEMLARSTEAQVAWQNDINVLKEDLEDAMHQLMVVTKERDQYMEMHESLIVKVESLDKKKDELEELLNLEEQKSTSVREKLNVAVRKGKSLVQQRDTLKQTIEEMTTELKRLRSEMKSQENTLASYEQKFKDFSVYPGRVEALESENLSLKNRLTEMESNLQEKEYKLSSIISTLDQIEVNIDVNETDPIEKLKHVGKLCFDLREAMFFSEQESVKSRRAAELLLAELNEVQERNDAFQEELAKASDEIAEMTRERDSAESSKLEALSELEKLSTLQLKERKNQFSQFMGLKSGLDRLKEALHEINSLLVDAFSRDLDAFYNLEAAIESCTKANEPTEVNPSPSTVSGAFKKDKGSFFALDSWLNSYTNSAMDEKVATEIHSQIVHQLEESMKEIGDLKEMIDGHSVSFHKQSDSLSKVLGELYQEVNSQKELVQALESKVQQCESVAKDKEKEGDILCRSVDMLLEACRSTIKEVDQRKGELMGNDLTSENLGVNFISTAPDQLSRTGRTHLLSEEYVQTIADRLLLTVREFIGLKAEMFDGSVTEMKIAIANLQKELQEKDIQKERICMDLVGQIKEAEGTATRYSLDLQASKDKVRELEKVMEQMDNERKAFEQRLRQLQDGLFISDELRERVKSLTDLLASKDQEIEALMHALDEEEVQMEGLTNKIEELEKVLKEKNHELEGIETSRGKLTKKLSITVTKFDELHHLSESLLTEVEKLQAQLQDRDAEISFLRQEVTRCTNDALVATQTSNRSTEDINEVITWFDMVGARAGLSHIGHSDQANEVHECKEVLKKKITSILKEIEDIQAASQRKDELLLVEKNKVEELKRKELQLNSLEDVGDDNKARSAAPEIFESEPLINKWAASSTITPQVRSLRKGNTDQVAIAIDVDPASSSNRLEDEDDDKVHGFKSLASSRLVPKFSRRATDMIDGLWVSCDRALMRQPALRLGIIFYWAILHALVATFVV
Homology
BLAST of CSPI03G44700 vs. ExPASy Swiss-Prot
Match: Q15075 (Early endosome antigen 1 OS=Homo sapiens OX=9606 GN=EEA1 PE=1 SV=2)

HSP 1 Score: 65.9 bits (159), Expect = 8.1e-09
Identity = 281/1479 (19.00%), Postives = 597/1479 (40.37%), Query Frame = 0

Query: 1178 RCLDSIG-IEELIKDVQSVLSLEDTEKYHAEIPAIYLESMVSLLLQKYRESELQLGLSR- 1237
            +C+ S+G  +EL K  ++V    +   +  E         V+LL Q+ ++ +  L   + 
Sbjct: 45   QCMKSLGSADELFKHYEAVHDAGNDSGHGGESNLALKRDDVTLLRQEVQDLQASLKEEKW 104

Query: 1238 --EESESKMMKLTGLQESVNDLSTLILDHECEIVLLKESLSQAQEALMASRSELKDKVNE 1297
              EE + ++ K  GLQ+       L+ D   E+  L++ L +AQ         +K   + 
Sbjct: 105  YSEELKKELEKYQGLQQQEAKPDGLVTDSSAELQSLEQQLEEAQ----TENFNIKQMKDL 164

Query: 1298 LEQTEQRVSAIREKLSIAVAKGKSLIVQRDNLKQLLAQNSSELERCLQE-LQMKDTRLNE 1357
             EQ   +++       IA  K K                  + ER L+E  + K TRL E
Sbjct: 165  FEQKAAQLAT-----EIADIKSK-----------------YDEERSLREAAEQKVTRLTE 224

Query: 1358 TEMKLKTYSEAGERVEALESELSYIRNSATALRESFLLKDSVLQRIEEILDELDLPENFH 1417
               K  T       ++ L++EL         + +  +LK  ++Q ++ ++D + L     
Sbjct: 225  ELNKEATV------IQDLKTELL----QRPGIEDVAVLKKELVQ-VQTLMDNMTLERERE 284

Query: 1418 SRDIIDKIDWLAKSSMGENLLHTDWDQRSSVAGGSGSDANFVITDAWKDEVQPDANVGDD 1477
            S  + D+   L           T    RS +A G    A +V       E+Q   +  ++
Sbjct: 285  SEKLKDECKKLQSQYASSEA--TISQLRSELAKGPQEVAVYV------QELQKLKSSVNE 344

Query: 1478 LRRKYEELQTKFYGLAEQNEMLEQSLMERNIIVQRWEELLEKIDIPSHFRSMEPEDKIEW 1537
            L +K + L        +    LE+   E ++  +  +  L + D+             + 
Sbjct: 345  LTQKNQTLTENLLKKEQDYTKLEEKHNEESVSKKNIQATLHQKDL-----------DCQQ 404

Query: 1538 LHRSLSEACRDRDSLHQRVNYLENYSESLTADLDDSQKKISHIEAELQSVLLEREKLSEK 1597
            L   LS +      +H  ++     ++ L  +L + + K  H++AE + +  +RE     
Sbjct: 405  LQSRLSASETSLHRIHVELSEKGEATQKLKEELSEVETKYQHLKAEFKQLQQQRE----- 464

Query: 1598 LEIIHHHNDHLSFGTFEKEIENIVLQNELSNTQDKLISTEHKIGKLEALVSNALREEDMN 1657
                            EKE   + LQ+E++    KL+ TE ++G+    +    R+    
Sbjct: 465  ----------------EKEQHGLQLQSEINQLHSKLLETERQLGEAHGRLKEQ-RQLSSE 524

Query: 1658 DLVPGSCSIEFLELMVMKLIQNYSASLSGNTVPRSIMNGADTEEMLARSTEAQVAWQNDI 1717
             L+     +  L+L + +L +     ++ +T  +  ++   T++        Q +    +
Sbjct: 525  KLMDKEQQVADLQLKLSRLEEQLKEKVTNSTELQHQLD--KTKQQHQEQQALQQSTTAKL 584

Query: 1718 NVLKEDLEDAMHQLMVVTKERDQYMEMHESLIVK----VESLDKKKDEL----------E 1777
               + DLE  + Q+     ++DQ ++  E+L+ K    +  L+K++++L           
Sbjct: 585  REAQNDLEQVLRQI----GDKDQKIQNLEALLQKSKENISLLEKEREDLYAKIQAGEGET 644

Query: 1778 ELLNLEEQKSTSVREKLNVAVRK----GKSLVQQRDTLKQTIEEMTTELKRLRSEMKSQE 1837
             +LN  ++K+ +++E++     K     +S  Q ++ L   ++E    L+  +  + S E
Sbjct: 645  AVLNQLQEKNHTLQEQVTQLTEKLKNQSESHKQAQENLHDQVQEQKAHLRAAQDRVLSLE 704

Query: 1838 NTLASYEQKFKDFSVYPGRVE---------------ALESENLSLKNRLTEMESNLQEKE 1897
             ++     +  +      +++               A  ++   L+N L   ++ LQ+K+
Sbjct: 705  TSVNELNSQLNESKEKVSQLDIQIKAKTELLLSAEAAKTAQRADLQNHLDTAQNALQDKQ 764

Query: 1898 YKLSSIISTLDQIEVNIDVNETDPIEKLKHVGKLCFDLRE--AMFFS------------- 1957
             +L+ I + LDQ+   +        +K +H  +L   L+E    + S             
Sbjct: 765  QELNKITTQLDQVTAKLQ-------DKQEHCSQLESHLKEYKEKYLSLEQKTEELEGQIK 824

Query: 1958 --EQESVKSRRAAELLLAELNEVQERNDAFQEELAKASDEIAEMTRERDSAESSKLEALS 2017
              E +S++ + + E  L +L + ++ N   +    +A++   ++  E++   S++L+   
Sbjct: 825  KLEADSLEVKASKEQALQDLQQQRQLNTDLE---LRATELSKQLEMEKEIVSSTRLDLQK 884

Query: 2018 ELEKLSTLQLKERKNQFSQFMGLKSGLDRLKEAL----HEINSLLVDAFSRDLDAFYNLE 2077
            + E L +++ K  K +  + + LK   + L +       E+N+ +    +         E
Sbjct: 885  KSEALESIKQKLTKQEEEKKI-LKQDFETLSQETKIQHEELNNRIQTTVTELQKVKMEKE 944

Query: 2078 AAI-ESCTKANEPTEVNPSPSTVSGAFKKD--KGSFFALDSWLNSYTNSAMDEKVATEIH 2137
            A + E  T  ++ ++V+ S       F+K+  KG    LD            EK   E+ 
Sbjct: 945  ALMTELSTVKDKLSKVSDSLKNSKSEFEKENQKGKAAILDL-----------EKTCKELK 1004

Query: 2138 SQIVHQLEESMKEIGDLKEMIDGHSVSFHKQSDSLSKVLGELYQEVNS-QKELVQALESK 2197
             Q+  Q+E ++KE  +LK+ ++    + H           +L  E+NS Q++L+QA    
Sbjct: 1005 HQLQVQMENTLKEQKELKKSLEKEKEASH-----------QLKLELNSMQEQLIQA---- 1064

Query: 2198 VQQCESVAKDKEKEGDILCRSVDMLLEACRSTIKEVDQRKGELMGNDLTSENLGVNFIST 2257
                ++  K  EKE   L  +++ L ++     K+++  +GEL         + V     
Sbjct: 1065 ----QNTLKQNEKEEQQLQGNINELKQSSEQKKKQIEALQGEL--------KIAV----- 1124

Query: 2258 APDQLSRTGRTHLLSEEYVQTIADRLLLTVREFIGLKAEMFDGSVTEMKIAIANLQKELQ 2317
                L +T   + L ++  Q  A + L   +E I +    ++ S            K+LQ
Sbjct: 1125 ----LQKTELENKLQQQLTQ--AAQELAAEKEKISVLQNNYEKSQETF--------KQLQ 1184

Query: 2318 EKDIQKERICMDLVGQIKEAEGTATRYSLDLQASKDKVRELEKVMEQMDNERKAFEQRLR 2377
                 +E   +     +K  E   +    DL ++++++    K+++++   +   E    
Sbjct: 1185 SDFYGRESELLATRQDLKSVEEKLSLAQEDLISNRNQIGNQNKLIQELKTAKATLE---- 1244

Query: 2378 QLQDGLFISDELRERVKSLTDLLASKDQEIEALMHALDEEEVQMEGLTNKIEELEKVLKE 2437
              QD      +L+ER K+L D+   K  + + L++   +     E    + +E+ K+ +E
Sbjct: 1245 --QDSAKKEQQLQERCKALQDIQKEKSLKEKELVNEKSKLAEIEEIKCRQEKEITKLNEE 1304

Query: 2438 -KNHELEGIETSRGKLTKKLSITVTKFDELHHLSESLLTEVE---------KLQAQLQDR 2497
             K+H+LE I+        K  +   K  EL   ++SL   VE         K Q + ++ 
Sbjct: 1305 LKSHKLESIKEITNLKDAKQLLIQQKL-ELQGKADSLKAAVEQEKRNQQILKDQVKKEEE 1355

Query: 2498 DAEISFLRQEVTRCTNDALVATQTSNRSTEDINEVITWFDMVGARAGLSHIGHSDQANEV 2557
            + +  F+ +E     +  +   +   +  E+ NE      +      L  +    Q+++ 
Sbjct: 1365 ELKKEFIEKEAK--LHSEIKEKEVGMKKHEE-NEAKLTMQITALNENLGTVKKEWQSSQR 1355

Query: 2558 HECKEVLKKKITSILKEIEDIQAASQRKD-------ELLLVEKNKVEELKRKELQLN--- 2569
               +  L+K+   +  EI  ++A  Q          E  L  + ++E+L+ K L+L    
Sbjct: 1425 RVSE--LEKQTDDLRGEIAVLEATVQNNQDERRALLERCLKGEGEIEKLQTKVLELQRKL 1355

BLAST of CSPI03G44700 vs. ExPASy Swiss-Prot
Match: Q13439 (Golgin subfamily A member 4 OS=Homo sapiens OX=9606 GN=GOLGA4 PE=1 SV=1)

HSP 1 Score: 57.8 bits (138), Expect = 2.2e-06
Identity = 372/1929 (19.28%), Postives = 744/1929 (38.57%), Query Frame = 0

Query: 300  DWKEERQVHAEDTIHSSRSQVESIPEDN-------FADLSEGHGKASQTSVKVSDVRDAN 359
            + KEE        I    +Q E + E         F +L     KA  T+ K  + R   
Sbjct: 378  EMKEEEIAQLRSRIKQMTTQGEELREQKEKSERAAFEELE----KALSTAQKTEEARRKL 437

Query: 360  TISLNAHMTATSDAQSETFSSFRQDCNFFDLLERMKEELI---VSSCSKEIFNMQITEQN 419
               ++  +        E   S +Q+      L R+K+E++     S  ++I  +Q   + 
Sbjct: 438  KAEMDEQIKTIEKTSEEERISLQQE------LSRVKQEVVDVMKKSSEEQIAKLQKLHEK 497

Query: 420  ELQMELDNHRSKSTKDVALLNTSLNEVVERNQSLVDELSHCRSELEDVSTAKEKLRDQLL 479
            EL  +      K           +   +E++QS   ++S  + + E ++  + +L+ + +
Sbjct: 498  ELARKEQELTKKLQTREREFQEQMKVALEKSQSEYLKISQEKEQQESLALEELELQKKAI 557

Query: 480  TAEAEIEKLSSKTSETENSLEKLHGDMFRLAKELDD----CKHLVTMLEGEKERLNGIIT 539
              E+E  KL     E E    ++      L K L +     K L   LE EK + N  IT
Sbjct: 558  LTESE-NKLRDLQQEAETYRTRILELESSLEKSLQENKNQSKDLAVHLEAEKNKHNKEIT 617

Query: 540  FENENKIKLAEEKELYSDENQKILSELSSLKSLNVALEAENSKLMGSLSSVAEAKTKLEE 599
               E       + EL S ++Q+       L+ L    + E  KL        + K  L +
Sbjct: 618  VMVEK-----HKTELESLKHQQDALWTEKLQVLKQQYQTEMEKLR---EKCEQEKETLLK 677

Query: 600  EREQLFQVNGTLSAELANCKNL--VATQQEENMNLTKNLALVTEDRTKVEEDKNHLFHKN 659
            ++E +FQ +     E  N K L  +  +Q E  +L+  L+ V + R K+EE+ + L  + 
Sbjct: 678  DKEIIFQAH----IEEMNEKTLEKLDVKQTELESLSSELSEVLKARHKLEEELSVLKDQT 737

Query: 660  ETMASELLVLDEILSTEHEKRVKFEGDLKDALAQLDQLTEENVFLSNGLDIYKFKIEELC 719
            + M  EL    +     H+++V       D++     + E  V +       K +I +L 
Sbjct: 738  DKMKQELEAKMDEQKNHHQQQV-------DSI-----IKEHEVSIQRTEKALKDQINQLE 797

Query: 720  GEIISLQTRTREDEDRAENAGSDQYHGNNFQENVSSQITFKKCLPNPSSVLTGGKPFEVT 779
              +       +E +   EN  +D        +  S+++   +   + +   T  K +E  
Sbjct: 798  LLLKERDKHLKEHQAHVENLEADIKRSEGELQQASAKLDVFQSYQSATHEQT--KAYE-- 857

Query: 780  EQEIFGDSLGFVTLGQHLEEAELMLQRLEKEITGLQSNSASSRSGSKTAAPAISKLIQAF 839
            EQ           L Q L + E     L K++  ++    + +    T   A    +Q  
Sbjct: 858  EQ--------LAQLQQKLLDLETERILLTKQVAEVE----AQKKDVCTELDAHKIQVQDL 917

Query: 840  ESQVNVEEDEVEAEIQSPNDPYKLSIELVENLRVLLRQVVVDSENASVLLK-GERDHQNV 899
              Q+  +  E+E +++S    Y+  +E     +   +Q++V+ EN  + ++ G++    +
Sbjct: 918  MQQLEKQNSEMEQKVKSLTQVYESKLEDGNKEQEQTKQILVEKENMILQMREGQKKEIEI 977

Query: 900  AISTLNEFKDKFEALENYSNNWVMANIEHGVLFDCFKHHLNDAGDKIYEL-EILNKSLKQ 959
                L+  +D    L          N E+   F   +  +     K  E+ E L K L  
Sbjct: 978  LTQKLSAKEDSIHIL----------NEEYETKFKNQEKKMEKVKQKAKEMQETLKKKLLD 1037

Query: 960  QATHHKNFNRELAERLCGYESTLTELERQLCDLPQSSNEMVSLICNQLDNLQGGAIERAM 1019
            Q    K   +EL            +   ++ ++ Q+++  +S   ++L+  Q   IE   
Sbjct: 1038 QEAKLK---KELENTALELSQKEKQFNAKMLEMAQANSAGISDAVSRLETNQKEQIESLT 1097

Query: 1020 TLEKD--------WHSFLLELAETIVKLDE-SLGKSDTPAIKF--------CTSDQLLSC 1079
             + +         W   L + AE + ++ E  L + +    +         C  +++   
Sbjct: 1098 EVHRRELNDVISIWEKKLNQQAEELQEIHEIQLQEKEQEVAELKQKILLFGCEKEEMNKE 1157

Query: 1080 ISASVIDAVK---TINDLRERLQATASNGEACRMSYEEVTEKYDSLFRRNEFTVDMLHKL 1139
            I+    + VK   T+N+L+E+L+  +++  +  ++ +E                    KL
Sbjct: 1158 ITWLKEEGVKQDTTLNELQEQLKQKSAHVNS--LAQDET-------------------KL 1217

Query: 1140 YGELQKLHIASCGSVSGS----DMNMQIKMVGDPLDYSNFEALIKSLEDCITEKLQLQSV 1199
               L+KL +    S+  +    +  +++KM+ +  D      L   L+    E   L+S 
Sbjct: 1218 KAHLEKLEVDLNKSLKENTFLQEQLVELKMLAEE-DKRKVSELTSKLKTTDEEFQSLKSS 1277

Query: 1200 NDRLCTDLERRTVEFVEFRERCLDSIGIEELIKDVQSVLSLEDTEKYHAEIPAIYLESMV 1259
            +++    LE +++EF +  E    +I ++   K  +++L  +  E               
Sbjct: 1278 HEKSNKSLEDKSLEFKKLSEEL--AIQLDICCKKTEALLEAKTNE--------------- 1337

Query: 1260 SLLLQKYRESELQLGLSREESESKMMKLTGLQESVNDLSTLILDHECEIVLLKESLSQAQ 1319
                         + +S  ++ + + +++  Q     +   +L   C +  L+  L Q  
Sbjct: 1338 ------------LINISSSKTNAILSRISHCQHRTTKVKEALLIKTCTVSELEAQLRQLT 1397

Query: 1320 EALMASRSELKDKVNELEQTEQRVSAIREKLSIAVAKGKSLIVQRDNLKQLLAQNSSELE 1379
            E         +   ++LE+ E ++ +++  +   V + ++L  +  N +Q     +SE E
Sbjct: 1398 EEQNTLNISFQQATHQLEEKENQIKSMKADIESLVTEKEALQKEGGNQQQA----ASEKE 1457

Query: 1380 RCLQELQMK-DTRLNETEMKLKTYSEAGERVEALESELSYIRNSATALRESFLL--KDSV 1439
             C+ +L+ +    +N   +  +   E    + +L  +L+ +      L+ S  L  K++ 
Sbjct: 1458 SCITQLKKELSENINAVTLMKEELKEKKVEISSLSKQLTDLN---VQLQNSISLSEKEAA 1517

Query: 1440 LQRIEEILDELDLPENFHSRDIIDKIDWLAKSSMGENLLHTDWDQRSSVAGGSGSDANFV 1499
            +  + +  DE         +D+  K+D L+K  +       DW  + S            
Sbjct: 1518 ISSLRKQYDEEKCELLDQVQDLSFKVDTLSKEKISALEQVDDWSNKFS------------ 1577

Query: 1500 ITDAWKDEVQ----PDANVGDDLRRKYEELQTKFYGLAEQNEMLEQSLMERN-------- 1559
                WK + Q       N   +L+ + E    + Y   EQ  +L++ L ++N        
Sbjct: 1578 ---EWKKKAQSRFTQHQNTVKELQIQLELKSKEAYEKDEQINLLKEELDQQNKRFDCLKG 1637

Query: 1560 ------IIVQRWEELLEKIDIPSHFRSMEPED-------KIEWLHRSLSEACRDRDSLH- 1619
                    +++ E  LE        R ME ED       +IE L+  L    + +D  H 
Sbjct: 1638 EMEDDKSKMEKKESNLETELKSQTARIMELEDHITQKTIEIESLNEVLKNYNQQKDIEHK 1697

Query: 1620 ---QRVNYLENYSESLTADLDDSQKKISHIEAELQSVLLEREKLSEKLEIIHHHNDHLSF 1679
               Q++ + +   E     + ++++KI  +E ++ S+  E E   ++LE       H++ 
Sbjct: 1698 ELVQKLQHFQELGEEKDNRVKEAEEKILTLENQVYSMKAELETKKKELE-------HVNL 1757

Query: 1680 GTFEKEIENIVLQNEL-SNTQDKLISTEHKIGKLEALVSNAL---REEDMNDLVPGSCS- 1739
                KE E   L++ L S +  KL   + K  +  A +   L    EE       G+ S 
Sbjct: 1758 SVKSKEEELKALEDRLESESAAKLAELKRKAEQKIAAIKKQLLSQMEEKEEQYKKGTESH 1817

Query: 1740 --------------IEFLELMVMKLIQNYSASLSGNTVPRSIMN-GADTEEMLARSTE-A 1799
                          +  LE  +  +  + S +L    VPRS  N  A TE+  A S    
Sbjct: 1818 LSELNTKLQEREREVHILEEKLKSVESSQSETL---IVPRSAKNVAAYTEQEEADSQGCV 1877

Query: 1800 QVAWQNDINVLKEDLEDAMHQLMVVTKERDQYMEMH-------ESLIVKVESLDKKKDEL 1859
            Q  ++  I+VL+ +L +    L  V +E+++ +  H       +  ++K+E  + K+ E 
Sbjct: 1878 QKTYEEKISVLQRNLTEKEKLLQRVGQEKEETVSSHFEMRCQYQERLIKLEHAEAKQHED 1937

Query: 1860 EELL-----NLEEQ-KSTSVREKLNVAVRKGKSLVQQRDTLKQTIEEMTTELKRLRSEMK 1919
            + ++      LEE+ K  S+    +V    GK+ +Q +  L+   ++       ++  ++
Sbjct: 1938 QSMIGHLQEELEEKNKKYSLIVAQHVEKEGGKNNIQAKQNLENVFDD-------VQKTLQ 1997

Query: 1920 SQENTLASYEQKFKDFS---VYPGRVEALESENLSLK-------------NRLTE-MESN 1979
             +E T    EQK K+     V    V  +E E L+ K             N+ TE +E N
Sbjct: 1998 EKELTCQILEQKIKELDSCLVRQKEVHRVEMEELTSKYEKLQALQQMDGRNKPTELLEEN 2057

Query: 1980 LQEKEYK-------LSSIISTLDQIEVNIDVNETDPIEKLKHVGKLCFDLREAMFFSEQE 2039
             +EK          LS++ +  + +E  +   E +  +  K + +L  DLR      +QE
Sbjct: 2058 TEEKSKSHLVQPKLLSNMEAQHNDLEFKLAGAEREKQKLGKEIVRLQKDLRMLRKEHQQE 2117

Query: 2040 -----SVKSRRAAELLLAELNEVQ-ERNDAFQEELAKASDEIAEMTRERDSAESSKLEAL 2089
                     +   E +  E  +++ + N   ++ + + + ++A+  +E +      +   
Sbjct: 2118 LEILKKEYDQEREEKIKQEQEDLELKHNSTLKQLMREFNTQLAQKEQELEMTIKETINKA 2142

BLAST of CSPI03G44700 vs. ExPASy Swiss-Prot
Match: C9ZN16 (Flagellar attachment zone protein 1 OS=Trypanosoma brucei gambiense (strain MHOM/CI/86/DAL972) OX=679716 GN=TbgDal_IV3690 PE=3 SV=1)

HSP 1 Score: 50.8 bits (120), Expect = 2.7e-04
Identity = 211/1010 (20.89%), Postives = 411/1010 (40.69%), Query Frame = 0

Query: 1545 DSLHQRVNYLENYSESLTADLDDSQKKISHIEAELQSVLLEREKLSEKLEIIHHHNDHLS 1604
            + L   +  +E  S      L +  +++   +AE++ +L   E+L E+LE +        
Sbjct: 739  ERLEAELRQMEEKSRLSEQGLSEMTQRLEEKQAEIEGLLENLEQLDEQLEALRAAEKSAQ 798

Query: 1605 FGTFEKEIENIVLQNELSNTQDKLISTEHKIGKLEALVSNALREEDMNDLVPGSCSIEFL 1664
                 ++ E   LQ  L    D  I T   + +L    +N     D  +    +   +  
Sbjct: 799  AHIEARDREISDLQQRLEGEIDDHIKTTALLEELRKHYNNLEELFDKQEAELMAYREKRQ 858

Query: 1665 ELMVMKLIQNYSASLSGNTVPRSIMNGAD---TEEMLARSTEAQVAWQNDINVLKEDLED 1724
                ++ ++     +   T P   +  AD   +E +L+ + +      +  N  +++ + 
Sbjct: 859  NAHKVRSLEPTLRPIGTQTKPFQEVVSADEISSEPLLSVTLDEYNDHMHRSNQFQQENDL 918

Query: 1725 AMHQLMVVTKERDQYMEMHESLIVKVESLDKKKDELEELLNLEEQKSTSV-------REK 1784
               QL     ER+   +  E L+ + +SL ++   + E L  EE+  + V        E+
Sbjct: 919  LRQQLQQANDERENLHDRLEQLMAENQSLSEQLHNMHEELEREERDRSGVTLQNERLAEE 978

Query: 1785 LNVAVRKGKSLVQQRDTLKQTIEEMTTELKRLRS--EMKSQENTLASYEQKFKDFSVYPG 1844
            +     + + LV + +  +  I  +  +++RL    E+K+ EN   + E + K       
Sbjct: 979  IQRKTAENEQLVLENNKSRSDIRNLNVQVQRLMEELELKAAENEKLAEELELK------- 1038

Query: 1845 RVEALESENLS--LKNRLTEMESNLQEKEYKLSSIISTLDQIEVNIDVNETDPIEKLKHV 1904
               A E+E L+  L+ +  E E   +  + K +      +++E+ +  NE          
Sbjct: 1039 ---AAENEKLAEELELKAAENEKLAEALDLKAAENEKLAEELELKVAENE---------- 1098

Query: 1905 GKLCFDLREAMFFSEQESVKSRRAAELLLAELNEVQERNDAFQEELAKASDEIAEMTRER 1964
                  L E +     E+ K     EL  AE  ++ E  +    E  K ++E+     E 
Sbjct: 1099 -----KLAEELELKVAENEKLAEELELKAAENEKLAEELELKAAENEKLAEELELKAAEN 1158

Query: 1965 DS-AESSKLEALSELEKLSTLQLKERKNQ-FSQFMGLKSG-LDRLKEALHEINSLLVDAF 2024
            +  AE  +L+A    +    L LK  +N+  ++ + LK+   ++L E L           
Sbjct: 1159 EKLAEELELKAAENEKLAEALDLKAAENEKLAEELDLKAAENEKLAEEL----------- 1218

Query: 2025 SRDLDAFYNLEAAIESCTKANEPTEVNPSPSTVSGAFKKDKGSFFALDSWLNSYTNSAMD 2084
              +L    N + A E   KA E  ++           K  +    A +  L +  N  + 
Sbjct: 1219 --ELKVAENEKLAEELELKAAENEKLAEELE-----LKAAENEKLAEELELKAAENEKLA 1278

Query: 2085 EKVATEI--HSQIVHQLEESMKEIGDLKEMIDGHSVSFHKQSDSL-------SKVLGELY 2144
            E++  ++  + ++  +LE    E   L E ++  +    K ++ L        K+  EL 
Sbjct: 1279 EELELKVAENEKLAEELELKAAENEKLAEELELKAAENEKLAEELELKAAENEKLAEELE 1338

Query: 2145 QEVNSQKELVQALESKVQQCESVAKD---KEKEGDILCRSVDMLLEACRSTIKEVDQRKG 2204
             +V   ++L + LE K  + E +A++   K  E + L   +++         +E++ +  
Sbjct: 1339 LKVAENEKLAEELELKAAENEKLAEELELKAAENEKLAEELELKAAENEKLAEELELKAA 1398

Query: 2205 ELMGNDLTSENLGVNFISTAPDQLSRTGRTHLLSEEYVQTIADRLLLTVREFIGLKAEMF 2264
            E   N+  +E L +           +      L+EE     A+   L   E + LKA   
Sbjct: 1399 E---NEKLAEELEL-----------KAAENEKLAEELELKAAENEKLA--EELELKAAEN 1458

Query: 2265 DGSVTEMKIAIA---NLQKELQEKDIQKERICMDLVGQIKEAEGTATRYSLDLQASKDKV 2324
            +    E+++  A    L +EL+ K  + E++  +L  ++K AE       L+L+A++++ 
Sbjct: 1459 EKLAEELELKAAENEKLAEELELKAAENEKLAEEL--ELKAAENEKLAEELELKAAENEK 1518

Query: 2325 RELEKVMEQMDNERKAFEQRLRQLQDGLFISDELRERVKSLTDLLASKDQEI-EALMHAL 2384
               E  ++  +NE+ A E  L+  ++         E++    +L A++++++ E L    
Sbjct: 1519 LAEELELKAAENEKLAEELELKAAEN---------EKLAEELELKAAENEKLAEELELKA 1578

Query: 2385 DEEEVQMEGLTNKIEELEKVLKEKNHELEGIETSRGKLTKKLSITVTKFDELHHLSESLL 2444
             E E   E L  K  E EK+ +E   EL+  E    KL ++L + V + ++L    E  +
Sbjct: 1579 AENEKLAEELELKAAENEKLAEEL--ELKAAENE--KLAEELELKVAENEKLAEELELKV 1638

Query: 2445 TEVEKLQAQLQDRDAEISFLRQEVT-RCTNDALVATQTSNRSTEDINEVITWFDMVGARA 2504
             E EKL  +L+ + AE   L +EVT R +   L+A  TS R  E                
Sbjct: 1639 AENEKLAEELELKVAENKRLAEEVTQRLSEKELLAEDTSARLLE---------------- 1651

Query: 2505 GLSHIGHSDQANEVHECK-EVLKKKITSILKEIEDIQAASQRKDELLLVE 2520
                   +D AN   +CK + L++K+T +  E E   A  + +   LL +
Sbjct: 1699 -------ADSANSALQCKVKHLEEKLTLLSSEKETALATLEAEIVDLLTQ 1651

BLAST of CSPI03G44700 vs. ExPASy TrEMBL
Match: A0A0A0LDV2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G881880 PE=4 SV=1)

HSP 1 Score: 4949.0 bits (12836), Expect = 0.0e+00
Identity = 2657/2666 (99.66%), Postives = 2659/2666 (99.74%), Query Frame = 0

Query: 1    MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGGSSRNTSKLEQHDADADIGIGAAKST 60
            MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGGSSRNTSKLEQHDADADIGIGAAKST
Sbjct: 1    MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGGSSRNTSKLEQHDADADIGIGAAKST 60

Query: 61   SGRFSSDEVLASSVDRNPHIVDSSASSSTEHSLAAETDDHSTVSVKQEMDLAEASAIDQG 120
            SGRFSSDEVLASSVDRNPHIVDSSASSSTEHSLAAETDDHSTVSVKQEMDLAEASAIDQG
Sbjct: 61   SGRFSSDEVLASSVDRNPHIVDSSASSSTEHSLAAETDDHSTVSVKQEMDLAEASAIDQG 120

Query: 121  ETSMQEVGYREDFEHTVQNVEASGFVSSGPSVPTDVEGNDNPTSNLSFAESSSQISSASV 180
            ETSMQEVGYREDFEHTVQNVEASGFVSSGPSVPTDVEGNDNPTSNLSFAESSSQISSASV
Sbjct: 121  ETSMQEVGYREDFEHTVQNVEASGFVSSGPSVPTDVEGNDNPTSNLSFAESSSQISSASV 180

Query: 181  EQQGRIVEVGGGCREEELLVSPSTSLLQAREDVGCMGDAVMQPGQVHETEIAGDKQLDTG 240
            EQQGRIVEVGGGCREEELLVSPSTSLLQAREDVG MGDAVMQPGQVHETEIAGDKQLDTG
Sbjct: 181  EQQGRIVEVGGGCREEELLVSPSTSLLQAREDVG-MGDAVMQPGQVHETEIAGDKQLDTG 240

Query: 241  GTSESAAETTFKETRCNEEEDIAAGVASISVAVTKSNNYSISSPGENLGMENSSSSSRDD 300
            GTSESAAETTFKETRCNEEEDIAAGV SISVAVTKSNNYSISSPGENLGMENSSSSSRDD
Sbjct: 241  GTSESAAETTFKETRCNEEEDIAAGVTSISVAVTKSNNYSISSPGENLGMENSSSSSRDD 300

Query: 301  WKEERQVHAEDTIHSSRSQVESIPEDNFADLSEGHGKASQTSVKVSDVRDANTISLNAHM 360
            WKEERQVHAEDTIHSSRSQVESIPEDNFADLSEGHG ASQTSVKVSDVRDANTISLNAHM
Sbjct: 301  WKEERQVHAEDTIHSSRSQVESIPEDNFADLSEGHGMASQTSVKVSDVRDANTISLNAHM 360

Query: 361  TATSDAQSETFSSFRQDCNFFDLLERMKEELIVSSCSKEIFNMQITEQNELQMELDNHRS 420
            TATSDAQSETFSSFRQDCNFFDLLERMKEELIVSSCSKEIFNMQITEQNELQMELDNHRS
Sbjct: 361  TATSDAQSETFSSFRQDCNFFDLLERMKEELIVSSCSKEIFNMQITEQNELQMELDNHRS 420

Query: 421  KSTKDVALLNTSLNEVVERNQSLVDELSHCRSELEDVSTAKEKLRDQLLTAEAEIEKLSS 480
            KSTKDVALLNTSLNEVVERNQSLVDELSHCRSELEDVSTAKEKLRDQLLTAEAEIEKLSS
Sbjct: 421  KSTKDVALLNTSLNEVVERNQSLVDELSHCRSELEDVSTAKEKLRDQLLTAEAEIEKLSS 480

Query: 481  KTSETENSLEKLHGDMFRLAKELDDCKHLVTMLEGEKERLNGIITFENENKIKLAEEKEL 540
            KTSETENSLEKLHGDMFRLAKELDDCKHLVTMLEGEKERLNGIITFENENKIKLAEEKEL
Sbjct: 481  KTSETENSLEKLHGDMFRLAKELDDCKHLVTMLEGEKERLNGIITFENENKIKLAEEKEL 540

Query: 541  YSDENQKILSELSSLKSLNVALEAENSKLMGSLSSVAEAKTKLEEEREQLFQVNGTLSAE 600
            YSDENQKILSELSSLKSLNVALEAENSKLMGSLSSVAE KTKLEEEREQLFQ+NGTLSAE
Sbjct: 541  YSDENQKILSELSSLKSLNVALEAENSKLMGSLSSVAEEKTKLEEEREQLFQMNGTLSAE 600

Query: 601  LANCKNLVATQQEENMNLTKNLALVTEDRTKVEEDKNHLFHKNETMASELLVLDEILSTE 660
            LANCKNLVATQQEENMNLTKNLALVTEDRTKVEEDKNHLFHKNETMASELLVLDE LSTE
Sbjct: 601  LANCKNLVATQQEENMNLTKNLALVTEDRTKVEEDKNHLFHKNETMASELLVLDERLSTE 660

Query: 661  HEKRVKFEGDLKDALAQLDQLTEENVFLSNGLDIYKFKIEELCGEIISLQTRTREDEDRA 720
            HEKRVKFEGDLKDALAQLDQLTEENVFLSNGLDIYKFKIEELCGEIISLQTRTREDEDRA
Sbjct: 661  HEKRVKFEGDLKDALAQLDQLTEENVFLSNGLDIYKFKIEELCGEIISLQTRTREDEDRA 720

Query: 721  ENAGSDQYHGNNFQENVSSQITFKKCLPNPSSVLTGGKPFEVTEQEIFGDSLGFVTLGQH 780
            ENAGSDQYHGNNFQENVSSQITFKKCLPNPSSVLTGGKPFEVTEQEIFGDSLGFVTLGQH
Sbjct: 721  ENAGSDQYHGNNFQENVSSQITFKKCLPNPSSVLTGGKPFEVTEQEIFGDSLGFVTLGQH 780

Query: 781  LEEAELMLQRLEKEITGLQSNSASSRSGSKTAAPAISKLIQAFESQVNVEEDEVEAEIQS 840
            LEEAELMLQRLEKEITGLQSNSASSRSGSKTAAPAISKLIQAFESQVNVEEDEVEAEIQS
Sbjct: 781  LEEAELMLQRLEKEITGLQSNSASSRSGSKTAAPAISKLIQAFESQVNVEEDEVEAEIQS 840

Query: 841  PNDPYKLSIELVENLRVLLRQVVVDSENASVLLKGERDHQNVAISTLNEFKDKFEALENY 900
            PNDPYKLSIELVENLRVLLRQVVVDSENASVLLKGERDHQNVAISTLNEFKDKFEALENY
Sbjct: 841  PNDPYKLSIELVENLRVLLRQVVVDSENASVLLKGERDHQNVAISTLNEFKDKFEALENY 900

Query: 901  SNNWVMANIEHGVLFDCFKHHLNDAGDKIYELEILNKSLKQQATHHKNFNRELAERLCGY 960
            SNNWVMANIEHGVLFDCFKHHLNDAGDKIYELEILNKSLKQQATHHKNFNRELAERLCGY
Sbjct: 901  SNNWVMANIEHGVLFDCFKHHLNDAGDKIYELEILNKSLKQQATHHKNFNRELAERLCGY 960

Query: 961  ESTLTELERQLCDLPQSSNEMVSLICNQLDNLQGGAIERAMTLEKDWHSFLLELAETIVK 1020
            ESTLTELERQLCDLPQSSNEMVSLICNQLDNLQGGAIERAMTLEKDWHSFLLELAETIVK
Sbjct: 961  ESTLTELERQLCDLPQSSNEMVSLICNQLDNLQGGAIERAMTLEKDWHSFLLELAETIVK 1020

Query: 1021 LDESLGKSDTPAIKFCTSDQLLSCISASVIDAVKTINDLRERLQATASNGEACRMSYEEV 1080
            LDESLGKSDTPAIKFCTSDQLLSCISASVIDAVKTI+DLRERLQATASNGEACRMSYEEV
Sbjct: 1021 LDESLGKSDTPAIKFCTSDQLLSCISASVIDAVKTIDDLRERLQATASNGEACRMSYEEV 1080

Query: 1081 TEKYDSLFRRNEFTVDMLHKLYGELQKLHIASCGSVSGSDMNMQIKMVGDPLDYSNFEAL 1140
            TEKYDSLFRRNEFTVDMLHKLYGELQKLHIASCGSVSGSDMNMQIKMVGDPLDYSNFEAL
Sbjct: 1081 TEKYDSLFRRNEFTVDMLHKLYGELQKLHIASCGSVSGSDMNMQIKMVGDPLDYSNFEAL 1140

Query: 1141 IKSLEDCITEKLQLQSVNDRLCTDLERRTVEFVEFRERCLDSIGIEELIKDVQSVLSLED 1200
            IKSLEDCITEKLQLQSVNDRLCTDLERRTVEFVEFRERCLDSIGIEELIKDVQSVLSLED
Sbjct: 1141 IKSLEDCITEKLQLQSVNDRLCTDLERRTVEFVEFRERCLDSIGIEELIKDVQSVLSLED 1200

Query: 1201 TEKYHAEIPAIYLESMVSLLLQKYRESELQLGLSREESESKMMKLTGLQESVNDLSTLIL 1260
            TEKYHAEIPAIYLESMVSLLLQKYRESELQLGLSREESESKMMKLTGLQESVNDLSTLIL
Sbjct: 1201 TEKYHAEIPAIYLESMVSLLLQKYRESELQLGLSREESESKMMKLTGLQESVNDLSTLIL 1260

Query: 1261 DHECEIVLLKESLSQAQEALMASRSELKDKVNELEQTEQRVSAIREKLSIAVAKGKSLIV 1320
            DHECEIVLLKESLSQAQEALMASRSELKDKVNELEQTEQRVSAIREKLSIAVAKGKSLIV
Sbjct: 1261 DHECEIVLLKESLSQAQEALMASRSELKDKVNELEQTEQRVSAIREKLSIAVAKGKSLIV 1320

Query: 1321 QRDNLKQLLAQNSSELERCLQELQMKDTRLNETEMKLKTYSEAGERVEALESELSYIRNS 1380
            QRDNLKQLLAQNSSELERCLQELQMKDTRLNETEMKLKTYSEAGERVEALESELSYIRNS
Sbjct: 1321 QRDNLKQLLAQNSSELERCLQELQMKDTRLNETEMKLKTYSEAGERVEALESELSYIRNS 1380

Query: 1381 ATALRESFLLKDSVLQRIEEILDELDLPENFHSRDIIDKIDWLAKSSMGENLLHTDWDQR 1440
            ATALRESFLLKDSVLQRIEEILDELDLPENFHSRDIIDKIDWLAKSSMGENLLHTDWDQR
Sbjct: 1381 ATALRESFLLKDSVLQRIEEILDELDLPENFHSRDIIDKIDWLAKSSMGENLLHTDWDQR 1440

Query: 1441 SSVAGGSGSDANFVITDAWKDEVQPDANVGDDLRRKYEELQTKFYGLAEQNEMLEQSLME 1500
            SSVAGGSGSDANFVITDAWKDEVQPDANVGDDLRRKYEELQTKFYGLAEQNEMLEQSLME
Sbjct: 1441 SSVAGGSGSDANFVITDAWKDEVQPDANVGDDLRRKYEELQTKFYGLAEQNEMLEQSLME 1500

Query: 1501 RNIIVQRWEELLEKIDIPSHFRSMEPEDKIEWLHRSLSEACRDRDSLHQRVNYLENYSES 1560
            RNIIVQRWEELLEKIDIPSHFRSMEPEDKIEWLHRSLSEACRDRDSLHQRVNYLENYSES
Sbjct: 1501 RNIIVQRWEELLEKIDIPSHFRSMEPEDKIEWLHRSLSEACRDRDSLHQRVNYLENYSES 1560

Query: 1561 LTADLDDSQKKISHIEAELQSVLLEREKLSEKLEIIHHHNDHLSFGTFEKEIENIVLQNE 1620
            LTADLDDSQKKISHIEAELQSVLLEREKLSEKLEIIHHHNDHLSFGTFEKEIENIVLQNE
Sbjct: 1561 LTADLDDSQKKISHIEAELQSVLLEREKLSEKLEIIHHHNDHLSFGTFEKEIENIVLQNE 1620

Query: 1621 LSNTQDKLISTEHKIGKLEALVSNALREEDMNDLVPGSCSIEFLELMVMKLIQNYSASLS 1680
            LSNTQDKLISTEHKIGKLEALVSNALREEDMNDLVPGSCSIEFLELMVMKLIQNYSASLS
Sbjct: 1621 LSNTQDKLISTEHKIGKLEALVSNALREEDMNDLVPGSCSIEFLELMVMKLIQNYSASLS 1680

Query: 1681 GNTVPRSIMNGADTEEMLARSTEAQVAWQNDINVLKEDLEDAMHQLMVVTKERDQYMEMH 1740
            GNTVPRSIMNGADTEEMLARSTEAQVAWQNDINVLKEDLEDAMHQLMVVTKERDQYMEMH
Sbjct: 1681 GNTVPRSIMNGADTEEMLARSTEAQVAWQNDINVLKEDLEDAMHQLMVVTKERDQYMEMH 1740

Query: 1741 ESLIVKVESLDKKKDELEELLNLEEQKSTSVREKLNVAVRKGKSLVQQRDTLKQTIEEMT 1800
            ESLIVKVESLDKKKDELEELLNLEEQKSTSVREKLNVAVRKGKSLVQQRDTLKQTIEEMT
Sbjct: 1741 ESLIVKVESLDKKKDELEELLNLEEQKSTSVREKLNVAVRKGKSLVQQRDTLKQTIEEMT 1800

Query: 1801 TELKRLRSEMKSQENTLASYEQKFKDFSVYPGRVEALESENLSLKNRLTEMESNLQEKEY 1860
            TELKRLRSEMKSQENTLASYEQKFKDFSVYPGRVEALESENLSLKNRLTEMESNLQEKEY
Sbjct: 1801 TELKRLRSEMKSQENTLASYEQKFKDFSVYPGRVEALESENLSLKNRLTEMESNLQEKEY 1860

Query: 1861 KLSSIISTLDQIEVNIDVNETDPIEKLKHVGKLCFDLREAMFFSEQESVKSRRAAELLLA 1920
            KLSSIISTLDQIEVNIDVNETDPIEKLKHVGKLCFDLREAMFFSEQESVKSRRAAELLLA
Sbjct: 1861 KLSSIISTLDQIEVNIDVNETDPIEKLKHVGKLCFDLREAMFFSEQESVKSRRAAELLLA 1920

Query: 1921 ELNEVQERNDAFQEELAKASDEIAEMTRERDSAESSKLEALSELEKLSTLQLKERKNQFS 1980
            ELNEVQERNDAFQEELAKASDEIAEMTRERDSAESSKLEALSELEKLSTLQLKERKNQFS
Sbjct: 1921 ELNEVQERNDAFQEELAKASDEIAEMTRERDSAESSKLEALSELEKLSTLQLKERKNQFS 1980

Query: 1981 QFMGLKSGLDRLKEALHEINSLLVDAFSRDLDAFYNLEAAIESCTKANEPTEVNPSPSTV 2040
            QFMGLKSGLDRLKEALHEINSLLVDAFSRDLDAFYNLEAAIESCTKANEPTEVNPSPSTV
Sbjct: 1981 QFMGLKSGLDRLKEALHEINSLLVDAFSRDLDAFYNLEAAIESCTKANEPTEVNPSPSTV 2040

Query: 2041 SGAFKKDKGSFFALDSWLNSYTNSAMDEKVATEIHSQIVHQLEESMKEIGDLKEMIDGHS 2100
            SGAFKKDKGSFFALDSWLNSYTNSAMDEKVATEIHSQIVHQLEESMKEIGDLKEMIDGHS
Sbjct: 2041 SGAFKKDKGSFFALDSWLNSYTNSAMDEKVATEIHSQIVHQLEESMKEIGDLKEMIDGHS 2100

Query: 2101 VSFHKQSDSLSKVLGELYQEVNSQKELVQALESKVQQCESVAKDKEKEGDILCRSVDMLL 2160
            VSFHKQSDSLSKVLGELYQEVNSQKELVQALESKVQQCESVAKDKEKEGDILCRSVDMLL
Sbjct: 2101 VSFHKQSDSLSKVLGELYQEVNSQKELVQALESKVQQCESVAKDKEKEGDILCRSVDMLL 2160

Query: 2161 EACRSTIKEVDQRKGELMGNDLTSENLGVNFISTAPDQLSRTGRTHLLSEEYVQTIADRL 2220
            EACRSTIKEVDQRKGELMGNDLTSENLGVNFISTAPDQLSRTGRTHLLSEEYVQTIADRL
Sbjct: 2161 EACRSTIKEVDQRKGELMGNDLTSENLGVNFISTAPDQLSRTGRTHLLSEEYVQTIADRL 2220

Query: 2221 LLTVREFIGLKAEMFDGSVTEMKIAIANLQKELQEKDIQKERICMDLVGQIKEAEGTATR 2280
            LLTVREFIGLKAEMFDGSVTEMKIAIANLQKELQEKDIQKERICMDLVGQIKEAEGTATR
Sbjct: 2221 LLTVREFIGLKAEMFDGSVTEMKIAIANLQKELQEKDIQKERICMDLVGQIKEAEGTATR 2280

Query: 2281 YSLDLQASKDKVRELEKVMEQMDNERKAFEQRLRQLQDGLFISDELRERVKSLTDLLASK 2340
            YSLDLQASKDKVRELEKVMEQMDNERKAFEQRLRQLQDGL ISDELRERVKSLTDLLASK
Sbjct: 2281 YSLDLQASKDKVRELEKVMEQMDNERKAFEQRLRQLQDGLSISDELRERVKSLTDLLASK 2340

Query: 2341 DQEIEALMHALDEEEVQMEGLTNKIEELEKVLKEKNHELEGIETSRGKLTKKLSITVTKF 2400
            DQEIEALMHALDEEEVQMEGLTNKIEELEKVLKEKNHELEGIETSRGKLTKKLSITVTKF
Sbjct: 2341 DQEIEALMHALDEEEVQMEGLTNKIEELEKVLKEKNHELEGIETSRGKLTKKLSITVTKF 2400

Query: 2401 DELHHLSESLLTEVEKLQAQLQDRDAEISFLRQEVTRCTNDALVATQTSNRSTEDINEVI 2460
            DELHHLSESLLTEVEKLQAQLQDRDAEISFLRQEVTRCTNDALVATQTSNRSTEDINEVI
Sbjct: 2401 DELHHLSESLLTEVEKLQAQLQDRDAEISFLRQEVTRCTNDALVATQTSNRSTEDINEVI 2460

Query: 2461 TWFDMVGARAGLSHIGHSDQANEVHECKEVLKKKITSILKEIEDIQAASQRKDELLLVEK 2520
            TWFDMVGARAGLSHIGHSDQANEVHECKEVLKKKITSILKEIEDIQAASQRKDELLLVEK
Sbjct: 2461 TWFDMVGARAGLSHIGHSDQANEVHECKEVLKKKITSILKEIEDIQAASQRKDELLLVEK 2520

Query: 2521 NKVEELKRKELQLNSLEDVGDDNKARSAAPEIFESEPLINKWAASSTITPQVRSLRKGNT 2580
            NKVEELK KELQLNSLEDVGDDNKARSAAPEIFESEPLINKWAASSTITPQVRSLRKGNT
Sbjct: 2521 NKVEELKCKELQLNSLEDVGDDNKARSAAPEIFESEPLINKWAASSTITPQVRSLRKGNT 2580

Query: 2581 DQVAIAIDVDPASSSNRLEDEDDDKVHGFKSLASSRLVPKFSRRATDMIDGLWVSCDRAL 2640
            DQVAIAIDVDPASSSNRLEDEDDDKVHGFKSLASSRLVPKFSRRATDMIDGLWVSCDRAL
Sbjct: 2581 DQVAIAIDVDPASSSNRLEDEDDDKVHGFKSLASSRLVPKFSRRATDMIDGLWVSCDRAL 2640

Query: 2641 MRQPALRLGIIFYWAILHALVATFVV 2667
            MRQPALRLGIIFYWAILHALVATFVV
Sbjct: 2641 MRQPALRLGIIFYWAILHALVATFVV 2665

BLAST of CSPI03G44700 vs. ExPASy TrEMBL
Match: A0A5A7TAW3 (Centromere-associated protein E isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold455G006690 PE=4 SV=1)

HSP 1 Score: 4567.3 bits (11845), Expect = 0.0e+00
Identity = 2471/2667 (92.65%), Postives = 2549/2667 (95.58%), Query Frame = 0

Query: 1    MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGGSSRNTSKLEQHDADADIGIGAAKST 60
            MDKNK+RSDLLAAGRKKLQQFRKKKD+KGSGSQG SSRNTSKLEQ DAD DI  GAAKST
Sbjct: 1    MDKNKNRSDLLAAGRKKLQQFRKKKDSKGSGSQGSSSRNTSKLEQQDADVDIVTGAAKST 60

Query: 61   SGRFSSDEVLASSVDRNPHIVDSSASSSTEHSLAAETDDHSTVSVKQEMDLAEASAIDQG 120
            SGRFSSD VLASSVD NPHIVDSSASSSTEHSLAAETDDHSTVSVKQEMDLAE SAIDQG
Sbjct: 61   SGRFSSDGVLASSVDGNPHIVDSSASSSTEHSLAAETDDHSTVSVKQEMDLAETSAIDQG 120

Query: 121  ETSMQEVGYREDFEHTVQNVEASGFVSSGPSVPTDVEGNDNPTSNLSFAESSSQISSASV 180
            ETSMQEVGYRE+FEH +QN EA GFVSSGPS+PTD+E NDNPTSNLSF ESSSQISSASV
Sbjct: 121  ETSMQEVGYREEFEHPIQNAEAIGFVSSGPSLPTDIEENDNPTSNLSFPESSSQISSASV 180

Query: 181  EQQGRIVEVGGGCREEELLVSPSTSLLQAREDVGCMGDAVMQPGQVHETEIAGDKQLDTG 240
            EQQGRIVEV GGCREEELLVSPSTSLLQAREDVG MGDA+MQ GQVHETE+AGDK LDTG
Sbjct: 181  EQQGRIVEVWGGCREEELLVSPSTSLLQAREDVG-MGDALMQSGQVHETELAGDKLLDTG 240

Query: 241  GTSESAAETTFKETRCNEEEDIAAGVASISVAVTKSNNYSISSPGENLGMENSSSSSRDD 300
            GTSESAAETTFKET C++EEDIAA VAS+SVAVT+SN+YSISSPGENLGM+NSSSSSRDD
Sbjct: 241  GTSESAAETTFKETHCDKEEDIAAEVASVSVAVTESNSYSISSPGENLGMDNSSSSSRDD 300

Query: 301  WKEERQVHAEDTIHSSRSQVESIPEDNFADLSEGHGKASQTSVKVSDVRDANTISLNAHM 360
            WK+ERQVHAEDTIHSSRSQVESIPED+FAD SEGHGKASQTSVKVSDVRDANTISLN HM
Sbjct: 301  WKDERQVHAEDTIHSSRSQVESIPEDDFADQSEGHGKASQTSVKVSDVRDANTISLNEHM 360

Query: 361  TATSDAQSETFSSFRQDCNFFDLLERMKEELIVSSCSKEIFNMQITEQNELQMELDNHRS 420
            TATSDAQS TFSSF QDCNFFDLLERMKEELIVSS SKEIFNMQITEQNELQMELDNHRS
Sbjct: 361  TATSDAQSGTFSSFGQDCNFFDLLERMKEELIVSSFSKEIFNMQITEQNELQMELDNHRS 420

Query: 421  KSTKDVALLNTSLNEVVERNQSLVDELSHCRSELEDVSTAKEKLRDQLLTAEAEIEKLSS 480
            KSTKDVALLNTSLNEVVERNQSLVDELSHCRSELEDVS AKEK RDQLLTAEAEIEKLSS
Sbjct: 421  KSTKDVALLNTSLNEVVERNQSLVDELSHCRSELEDVSIAKEKFRDQLLTAEAEIEKLSS 480

Query: 481  KTSETENSLEKLHGDMFRLAKELDDCKHLVTMLEGEKERLNGIITFENENKIKLAEEKEL 540
            KTSETENSLEKLHGDMFRLAKELDDCKHLVT+LEGEKERLNGIITFENENK KLAEEKEL
Sbjct: 481  KTSETENSLEKLHGDMFRLAKELDDCKHLVTVLEGEKERLNGIITFENENKRKLAEEKEL 540

Query: 541  YSDENQKILSELSSLKSLNVALEAENSKLMGSLSSVAEAKTKLEEEREQLFQVNGTLSAE 600
            YSDEN+KILSELSSLKSLNVALEAENSKLMGSLSSVAE KTKLEEEREQLFQVNGTLSAE
Sbjct: 541  YSDENEKILSELSSLKSLNVALEAENSKLMGSLSSVAEEKTKLEEEREQLFQVNGTLSAE 600

Query: 601  LANCKNLVATQQEENMNLTKNLALVTEDRTKVEEDKNHLFHKNETMASELLVLDEILSTE 660
            LANCK+LVATQQEENMNLTKNLALVTEDRTKV+EDKN LFH+NETMASELLVL+E LSTE
Sbjct: 601  LANCKDLVATQQEENMNLTKNLALVTEDRTKVDEDKNRLFHENETMASELLVLEERLSTE 660

Query: 661  HEKRVKFEGDLKDALAQLDQLTEENVFLSNGLDIYKFKIEELCGEIISLQTRTREDEDRA 720
            HEKRVKFEGDLKDALAQLDQL EENVFLSNGL+I+KFK+EELCGEIISLQTR+ EDED+A
Sbjct: 661  HEKRVKFEGDLKDALAQLDQLIEENVFLSNGLNIHKFKLEELCGEIISLQTRSTEDEDQA 720

Query: 721  ENAGSDQYHGNNFQENVSSQITFKKCLPNPSSVLTGGKPFEVTEQEIFGDSLGFVTLGQH 780
            ENA  D+YHG+NFQENVSSQI+FKKCLP+ SSVL GGKPF V+EQEIF DSLGFVTLGQH
Sbjct: 721  ENADCDRYHGDNFQENVSSQISFKKCLPDTSSVLAGGKPFMVSEQEIFDDSLGFVTLGQH 780

Query: 781  LEEAELMLQRLEKEITGLQSNSASSRSGSKTAAPAISKLIQAFESQVNVEEDEVEAEIQS 840
            LEEA LMLQRLEKEITGLQSNSASSRSGSK AAPA+SKLIQAFES VNVEE EVEAEIQ 
Sbjct: 781  LEEAALMLQRLEKEITGLQSNSASSRSGSKAAAPAVSKLIQAFESHVNVEEHEVEAEIQP 840

Query: 841  PNDPYKLSIELVENLRVLLRQVVVDSENASVLLKGERDHQNVAISTLNEFKDKFEALENY 900
            PNDPYKLSIELVENLRVLLRQVVVDS+NASVLLKGERDHQNVAIST NEFK+KFEALENY
Sbjct: 841  PNDPYKLSIELVENLRVLLRQVVVDSKNASVLLKGERDHQNVAISTSNEFKEKFEALENY 900

Query: 901  SNNWVMANIEHGVLFDCFKHHLNDAGDKIYELEILNKSLKQQATHHKNFNRELAERLCGY 960
            SNN VMANIEH VLF+C KHH+NDAGDKIYELEILNKSLKQQATHHKNFNRELAERL GY
Sbjct: 901  SNNLVMANIEHRVLFECLKHHVNDAGDKIYELEILNKSLKQQATHHKNFNRELAERLRGY 960

Query: 961  ESTLTELERQLCDLPQSSNEMVSLICNQLDNLQGGAIERAMTLEKDWHSFLLELAETIVK 1020
            ESTLTELE QLCDLPQSSNEMVSL+CNQLDNLQ GAIERAMTLEKDWHSFLLELAETIVK
Sbjct: 961  ESTLTELEYQLCDLPQSSNEMVSLVCNQLDNLQEGAIERAMTLEKDWHSFLLELAETIVK 1020

Query: 1021 LDESLGKSDTPAIKFCTSDQLLSCISASVIDAVKTINDLRERLQATASNGEACRMSYEEV 1080
            LDESLGKSDT AIKFCTSD+LLSCISASV+DAVKTI+DLRERLQ TASN EACRM YEE+
Sbjct: 1021 LDESLGKSDTSAIKFCTSDRLLSCISASVVDAVKTIDDLRERLQTTASNSEACRMLYEEI 1080

Query: 1081 TEKYDSLFRRNEFTVDMLHKLYGELQKLHIASCGSVSGSDMNMQIKMVGDPLDYSNFEAL 1140
            TEKYDSLFRRNEFTVDMLHKLYGEL KLHIASCGSVSGSDMNMQIKM  DPLDYSNF AL
Sbjct: 1081 TEKYDSLFRRNEFTVDMLHKLYGELHKLHIASCGSVSGSDMNMQIKM-DDPLDYSNFVAL 1140

Query: 1141 IKSLEDCITEKLQLQSVNDRLCTDLERRTVEFVEFRERCLDSIGIEELIKDVQSVLSLED 1200
            IKSLEDCITEKLQLQSVND+L  DLE  +VEFVEFRERCLDSIGIE+LIKDVQSVLSLED
Sbjct: 1141 IKSLEDCITEKLQLQSVNDKLRLDLEHTSVEFVEFRERCLDSIGIEKLIKDVQSVLSLED 1200

Query: 1201 TEKYHAEIPAIYLESMVSLLLQKYRESELQLGLSREESESKMMKLTGLQESVNDLSTLIL 1260
            TEKYHAEIPAI+LESMVSLLLQKYRESELQL LSREESES MMKLTG QESVNDLSTLIL
Sbjct: 1201 TEKYHAEIPAIHLESMVSLLLQKYRESELQLSLSREESESIMMKLTGQQESVNDLSTLIL 1260

Query: 1261 DHECEIVLLKESLSQAQEALMASRSELKDKVNELEQTEQRVSAIREKLSIAVAKGKSLIV 1320
            DHECEIVLLKESLSQAQEA+MASRSELKDKVNELEQ EQRVSAIREKLSIAVAKGKSLIV
Sbjct: 1261 DHECEIVLLKESLSQAQEAVMASRSELKDKVNELEQAEQRVSAIREKLSIAVAKGKSLIV 1320

Query: 1321 QRDNLKQLLAQNSSELERCLQELQMKDTRLNETEMKLKTYSEAGERVEALESELSYIRNS 1380
            QRDNLKQLLAQ SSELERCLQELQMKDTRLNETE KLKTYSEAGERVEALESELSYIRNS
Sbjct: 1321 QRDNLKQLLAQTSSELERCLQELQMKDTRLNETETKLKTYSEAGERVEALESELSYIRNS 1380

Query: 1381 ATALRESFLLKDSVLQRIEEILDELDLPENFHSRDIIDKIDWLAKSSMGENLLHTDWDQR 1440
            ATALRESFLLKDSVLQRIEEILDELDLPENFHSRDIIDKIDWLAKSS GENL+HTDWDQR
Sbjct: 1381 ATALRESFLLKDSVLQRIEEILDELDLPENFHSRDIIDKIDWLAKSSTGENLVHTDWDQR 1440

Query: 1441 SSVAGGSGSDANFVITDAWKDEVQPDANVGDDLRRKYEELQTKFYGLAEQNEMLEQSLME 1500
            SSVAGGSGSDANFVITDAWKDEVQ DANVGDDLRRKYEELQTKFYGLAEQNEMLEQSLME
Sbjct: 1441 SSVAGGSGSDANFVITDAWKDEVQLDANVGDDLRRKYEELQTKFYGLAEQNEMLEQSLME 1500

Query: 1501 RNIIVQRWEELLEKIDIPSHFRSMEPEDKIEWLHRSLSEACRDRDSLHQRVNYLENYSES 1560
            RN+IVQRWEELLEKIDIPSH RSMEPEDKIEWLHRSLSEAC DRDSL QRVN LENY ES
Sbjct: 1501 RNVIVQRWEELLEKIDIPSHLRSMEPEDKIEWLHRSLSEACHDRDSLLQRVNDLENYCES 1560

Query: 1561 LTADLDDSQKKISHIEAELQSVLLEREKLSEKLEIIHHHNDHLSFGTFEKEIENIVLQNE 1620
            LTADLDDSQKKISHIEAELQSVLLEREKLSEKLEII+HHNDHL FGTFEKEIEN VL NE
Sbjct: 1561 LTADLDDSQKKISHIEAELQSVLLEREKLSEKLEIIYHHNDHLLFGTFEKEIENTVLLNE 1620

Query: 1621 LSNTQDKLISTEHKIGKLEALVSNALREEDMNDLVPGSCSIEFLELMVMKLIQNYSASLS 1680
            LSN QD LISTEH I KLEALVSNALREEDMNDLVPGSC I FLELMVMKLIQNYSAS S
Sbjct: 1621 LSNMQDNLISTEHNIVKLEALVSNALREEDMNDLVPGSCRIGFLELMVMKLIQNYSASSS 1680

Query: 1681 GNTVPRSIMNGADTEEMLARSTEAQVAWQNDINVLKEDLEDAMHQLMVVTKERDQYMEMH 1740
            GN VP S MNGADTEEMLARST+ QVAWQNDINVLK+DLEDAMHQLM VTKERDQYMEMH
Sbjct: 1681 GNAVPGSAMNGADTEEMLARSTDEQVAWQNDINVLKKDLEDAMHQLMAVTKERDQYMEMH 1740

Query: 1741 ESLIVKVESLDKKKDELEELLNLEEQKSTSVREKLNVAVRKGKSLVQQRDTLKQTIEEMT 1800
            ESLIVKVESLDKKKDELEELLNLEEQKSTSVREKLNVAVRKGKSLVQQRDTLKQTIEEMT
Sbjct: 1741 ESLIVKVESLDKKKDELEELLNLEEQKSTSVREKLNVAVRKGKSLVQQRDTLKQTIEEMT 1800

Query: 1801 TELKRLRSEMKSQENTLASYEQKFKDFSVYPGRVEALESENLSLKNRLTEMESNLQEKEY 1860
            TEL+RLRSEMKSQENTLASYEQKF+DFSVYPG+VEALESENLSLKNRL E ESNLQEKEY
Sbjct: 1801 TELERLRSEMKSQENTLASYEQKFRDFSVYPGQVEALESENLSLKNRLNETESNLQEKEY 1860

Query: 1861 KLSSIISTLDQIEVNIDVNETDPIEKLKHVGKLCFDLREAMFFSEQESVKSRRAAELLLA 1920
            KLSSII+TLD IEVN+DV+ETDPIEKLKHVGKLC DLREAMFFSEQESVKSRRAAELLLA
Sbjct: 1861 KLSSIINTLDHIEVNVDVHETDPIEKLKHVGKLCSDLREAMFFSEQESVKSRRAAELLLA 1920

Query: 1921 ELNEVQERNDAFQEELAKASDEIAEMTRERDSAESSKLEALSELEKLSTLQLKERKNQFS 1980
            ELNEVQERNDAFQEELAKASDEIAEMTRERDSAE+SKLEALSELEKLSTLQL+ERKNQFS
Sbjct: 1921 ELNEVQERNDAFQEELAKASDEIAEMTRERDSAETSKLEALSELEKLSTLQLRERKNQFS 1980

Query: 1981 QFMGLKSGLDRLKEALHEINSLLVDAFSRDLDAFYNLEAAIESCTKANEPTEVNPSPSTV 2040
            QFMGLKSGLDRLKEALHEINSLLVDAFSRDLDAFYNLEAAIESCTKAN+P  VN SPSTV
Sbjct: 1981 QFMGLKSGLDRLKEALHEINSLLVDAFSRDLDAFYNLEAAIESCTKANDPIGVNCSPSTV 2040

Query: 2041 SGAFKKDKGSFFALDSWLNSYTNSAMDEKVATEIHSQIVHQLEESMKEIGDLKEMIDGHS 2100
            SGAFKKDKGSFFALDSWLNSYTN+A DE VATEIHSQIVHQLEESMKEIGDLKEMIDGHS
Sbjct: 2041 SGAFKKDKGSFFALDSWLNSYTNAAEDENVATEIHSQIVHQLEESMKEIGDLKEMIDGHS 2100

Query: 2101 VSFHKQSDSLSKVLGELYQEVNSQKELVQALESKVQQCESVAKDKEKEGDILCRSVDMLL 2160
            VSFHKQSDSLSKVLGELYQEVNSQKELV+ALESKVQQCESVAKDKEKEGDILCRSV +LL
Sbjct: 2101 VSFHKQSDSLSKVLGELYQEVNSQKELVEALESKVQQCESVAKDKEKEGDILCRSVAVLL 2160

Query: 2161 EACRSTIKEVDQRKGELMGNDLTSENLGVNFISTAPDQLSRTGRTHLLSEEYVQTIADRL 2220
            EAC STIKEV++RKGELMGNDLTSENLGVN ISTAP QLSR+GRTHLLSEEYVQTIADRL
Sbjct: 2161 EACTSTIKEVEERKGELMGNDLTSENLGVNIISTAPGQLSRSGRTHLLSEEYVQTIADRL 2220

Query: 2221 LLTVREFIGLKAEMFDGSVTEMKIAIANLQKELQEKDIQKERICMDLVGQIKEAEGTATR 2280
            LLTVR+FIGLKAEMFDGSV EMKIA++NLQKELQEKDIQKERICMDLVGQIKEAEG  TR
Sbjct: 2221 LLTVRKFIGLKAEMFDGSVKEMKIAMSNLQKELQEKDIQKERICMDLVGQIKEAEGITTR 2280

Query: 2281 YSLDLQASKDKVRELEKVMEQMDNERKAFEQRLRQLQDGLFISDELRERVKSLTDLLASK 2340
            YSLDLQASKDKV ELEKVMEQMDNERK  EQRLR+LQDGL ISDELRERV+SLTDLLA+K
Sbjct: 2281 YSLDLQASKDKVHELEKVMEQMDNERKVLEQRLRELQDGLSISDELRERVRSLTDLLAAK 2340

Query: 2341 DQEIEALMHALDEEEVQMEGLTNKIEELEKVLKEKNHELEGIETSRGKLTKKLSITVTKF 2400
            DQEIEALMHALDEEEVQMEGLTNKIEELEKVLKEKNHELE +ETSRGKLTKKLSITVTKF
Sbjct: 2341 DQEIEALMHALDEEEVQMEGLTNKIEELEKVLKEKNHELESVETSRGKLTKKLSITVTKF 2400

Query: 2401 DELHHLSESLLTEVEKLQAQLQDRDAEISFLRQEVTRCTNDALVATQTSNRSTEDINEVI 2460
            DELHHLSESLLTEVEKLQAQLQDRDAEISFLRQEVTRCTNDALVATQTSNRSTEDINEVI
Sbjct: 2401 DELHHLSESLLTEVEKLQAQLQDRDAEISFLRQEVTRCTNDALVATQTSNRSTEDINEVI 2460

Query: 2461 TWFDMVGARAGLSHIGHSDQANEVHECKEVLKKKITSILKEIEDIQAASQRKDELLLVEK 2520
            TWFDMVGARAGLS IGHSDQ NEVHE KE+LKKKITSILKEIED+QAASQRKDELLLVEK
Sbjct: 2461 TWFDMVGARAGLSRIGHSDQENEVHERKELLKKKITSILKEIEDLQAASQRKDELLLVEK 2520

Query: 2521 NKVEELKRKELQLNSLEDVGDDNKARSAAPEIFESEPLINKWAASST-ITPQVRSLRKGN 2580
            NKVEELKRK+LQLNSLEDVGDDNKA S APEIFESEPLIN WAASST +TPQVRSLRKGN
Sbjct: 2521 NKVEELKRKKLQLNSLEDVGDDNKASSVAPEIFESEPLINTWAASSTSVTPQVRSLRKGN 2580

Query: 2581 TDQVAIAIDVDPASSSNRLEDEDDDKVHGFKSLASSRLVPKFSRRATDMIDGLWVSCDRA 2640
            TDQVAIAIDVDPASSSNRLEDEDDDKVHGFKSLASSRLVPKFSRRATDMIDGLWVSCDRA
Sbjct: 2581 TDQVAIAIDVDPASSSNRLEDEDDDKVHGFKSLASSRLVPKFSRRATDMIDGLWVSCDRA 2640

Query: 2641 LMRQPALRLGIIFYWAILHALVATFVV 2667
            LMRQPALRLGIIFYWAILHALVATFVV
Sbjct: 2641 LMRQPALRLGIIFYWAILHALVATFVV 2665

BLAST of CSPI03G44700 vs. ExPASy TrEMBL
Match: A0A1S3CS85 (centromere-associated protein E isoform X1 OS=Cucumis melo OX=3656 GN=LOC103503751 PE=4 SV=1)

HSP 1 Score: 4557.7 bits (11820), Expect = 0.0e+00
Identity = 2466/2667 (92.46%), Postives = 2546/2667 (95.46%), Query Frame = 0

Query: 1    MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGGSSRNTSKLEQHDADADIGIGAAKST 60
            MDKNK+RSDLLAAGRKKLQQFRKKKD+KGSGSQG SSRNTSKLEQ DAD DI  GAAKST
Sbjct: 1    MDKNKNRSDLLAAGRKKLQQFRKKKDSKGSGSQGSSSRNTSKLEQQDADVDIVTGAAKST 60

Query: 61   SGRFSSDEVLASSVDRNPHIVDSSASSSTEHSLAAETDDHSTVSVKQEMDLAEASAIDQG 120
            SGRFSSD VLASSVD NPHIVDSSASSSTEHSLAAETDDHSTVSVKQEMDLAE SAIDQG
Sbjct: 61   SGRFSSDGVLASSVDGNPHIVDSSASSSTEHSLAAETDDHSTVSVKQEMDLAETSAIDQG 120

Query: 121  ETSMQEVGYREDFEHTVQNVEASGFVSSGPSVPTDVEGNDNPTSNLSFAESSSQISSASV 180
            ETSMQEVGYRE+FEH +QN EA GFVSSGPS+PTD+E NDNPTSNLSF ESSSQISSASV
Sbjct: 121  ETSMQEVGYREEFEHPIQNAEAIGFVSSGPSLPTDIEENDNPTSNLSFPESSSQISSASV 180

Query: 181  EQQGRIVEVGGGCREEELLVSPSTSLLQAREDVGCMGDAVMQPGQVHETEIAGDKQLDTG 240
            EQQGRIVEV GGCREEELLVSPSTSLLQAREDVG MGDA+MQ GQVHETE+AGDK LDTG
Sbjct: 181  EQQGRIVEVWGGCREEELLVSPSTSLLQAREDVG-MGDALMQSGQVHETELAGDKLLDTG 240

Query: 241  GTSESAAETTFKETRCNEEEDIAAGVASISVAVTKSNNYSISSPGENLGMENSSSSSRDD 300
            GTSESAAETTFKET C++EEDIAA VAS+SVAV +SN+YSISSPGENLGM+NSSSSSRDD
Sbjct: 241  GTSESAAETTFKETHCDKEEDIAAEVASVSVAVIESNSYSISSPGENLGMDNSSSSSRDD 300

Query: 301  WKEERQVHAEDTIHSSRSQVESIPEDNFADLSEGHGKASQTSVKVSDVRDANTISLNAHM 360
            WK+ERQVHAEDTIHSSRSQVESIPED+FAD SEGHGKASQTSVKVSDVRDANTISLN HM
Sbjct: 301  WKDERQVHAEDTIHSSRSQVESIPEDDFADQSEGHGKASQTSVKVSDVRDANTISLNEHM 360

Query: 361  TATSDAQSETFSSFRQDCNFFDLLERMKEELIVSSCSKEIFNMQITEQNELQMELDNHRS 420
            TATSDAQS TFSSF QDCNFFDLLERMKEELIVSS SKEIFNMQITEQNELQMELDNHRS
Sbjct: 361  TATSDAQSGTFSSFGQDCNFFDLLERMKEELIVSSFSKEIFNMQITEQNELQMELDNHRS 420

Query: 421  KSTKDVALLNTSLNEVVERNQSLVDELSHCRSELEDVSTAKEKLRDQLLTAEAEIEKLSS 480
            KSTKDVALLNTSLNEVVERNQSLVDELSHCRSELEDVS AKEK RDQLLTAEAEIEKLSS
Sbjct: 421  KSTKDVALLNTSLNEVVERNQSLVDELSHCRSELEDVSIAKEKFRDQLLTAEAEIEKLSS 480

Query: 481  KTSETENSLEKLHGDMFRLAKELDDCKHLVTMLEGEKERLNGIITFENENKIKLAEEKEL 540
            KTSETENSLEKLHGDMFRLAKELDDCKHLVT+LEGEKERLNGIITFENENK KLAEEKEL
Sbjct: 481  KTSETENSLEKLHGDMFRLAKELDDCKHLVTVLEGEKERLNGIITFENENKRKLAEEKEL 540

Query: 541  YSDENQKILSELSSLKSLNVALEAENSKLMGSLSSVAEAKTKLEEEREQLFQVNGTLSAE 600
            YSDEN+KILSELSSLKSLNVALEAENSKLMGSLSSVAE KTKLEEEREQLFQVNGTLSAE
Sbjct: 541  YSDENEKILSELSSLKSLNVALEAENSKLMGSLSSVAEEKTKLEEEREQLFQVNGTLSAE 600

Query: 601  LANCKNLVATQQEENMNLTKNLALVTEDRTKVEEDKNHLFHKNETMASELLVLDEILSTE 660
            LANCK+LVATQQEENMNLTKNLALVTEDRTKV+EDKN LFH+NETMASELLVL+E LSTE
Sbjct: 601  LANCKDLVATQQEENMNLTKNLALVTEDRTKVDEDKNRLFHENETMASELLVLEERLSTE 660

Query: 661  HEKRVKFEGDLKDALAQLDQLTEENVFLSNGLDIYKFKIEELCGEIISLQTRTREDEDRA 720
            HEKRVKFEGDLKDALAQLDQL EENVFLSNGL+I+KFK+EELCGEIISLQTR+ EDED+A
Sbjct: 661  HEKRVKFEGDLKDALAQLDQLIEENVFLSNGLNIHKFKLEELCGEIISLQTRSTEDEDQA 720

Query: 721  ENAGSDQYHGNNFQENVSSQITFKKCLPNPSSVLTGGKPFEVTEQEIFGDSLGFVTLGQH 780
            ENA  D+YHGNNFQENVSSQI+FKKCLP+ SSVL GGKPF V+EQEIF DSLGFVTLGQH
Sbjct: 721  ENADCDRYHGNNFQENVSSQISFKKCLPDTSSVLAGGKPFMVSEQEIFDDSLGFVTLGQH 780

Query: 781  LEEAELMLQRLEKEITGLQSNSASSRSGSKTAAPAISKLIQAFESQVNVEEDEVEAEIQS 840
            LEEA LMLQRLEKEITGLQSNSASSRSGSK AAPA+SKLIQAFES VNVEE EVEAEIQ 
Sbjct: 781  LEEAALMLQRLEKEITGLQSNSASSRSGSKAAAPAVSKLIQAFESHVNVEEHEVEAEIQP 840

Query: 841  PNDPYKLSIELVENLRVLLRQVVVDSENASVLLKGERDHQNVAISTLNEFKDKFEALENY 900
            PNDPYKLSIELVENLRVLLRQVVVDS+NASVLLKGERDHQNVAIST NEFK+KFEALE+Y
Sbjct: 841  PNDPYKLSIELVENLRVLLRQVVVDSKNASVLLKGERDHQNVAISTSNEFKEKFEALEDY 900

Query: 901  SNNWVMANIEHGVLFDCFKHHLNDAGDKIYELEILNKSLKQQATHHKNFNRELAERLCGY 960
            SNN VMANIEH VLF+C KHH+NDAGDKIYELEILNKSLKQQATHHKNFNRELAERL GY
Sbjct: 901  SNNLVMANIEHRVLFECLKHHVNDAGDKIYELEILNKSLKQQATHHKNFNRELAERLRGY 960

Query: 961  ESTLTELERQLCDLPQSSNEMVSLICNQLDNLQGGAIERAMTLEKDWHSFLLELAETIVK 1020
            ESTLTELE QLCDLPQSSNEMVSL+CN LDNLQ GAIERAMTLEKDWHSFLLELAETIVK
Sbjct: 961  ESTLTELEYQLCDLPQSSNEMVSLVCNLLDNLQEGAIERAMTLEKDWHSFLLELAETIVK 1020

Query: 1021 LDESLGKSDTPAIKFCTSDQLLSCISASVIDAVKTINDLRERLQATASNGEACRMSYEEV 1080
            LDESLGKSDT AIKFCTSD+LLSCISASV+DAVKTI+DLRERLQ TASN EACRM YEE+
Sbjct: 1021 LDESLGKSDTSAIKFCTSDRLLSCISASVVDAVKTIDDLRERLQTTASNSEACRMLYEEI 1080

Query: 1081 TEKYDSLFRRNEFTVDMLHKLYGELQKLHIASCGSVSGSDMNMQIKMVGDPLDYSNFEAL 1140
            TEKYDSLFRRNEFTVDMLHKLYGEL KLHIASCGSVSGSD+NMQIKM  DPLDYSNF AL
Sbjct: 1081 TEKYDSLFRRNEFTVDMLHKLYGELHKLHIASCGSVSGSDVNMQIKM-DDPLDYSNFVAL 1140

Query: 1141 IKSLEDCITEKLQLQSVNDRLCTDLERRTVEFVEFRERCLDSIGIEELIKDVQSVLSLED 1200
            IKSLEDCITEKLQLQSVND+L  DLE  +VEFVEFRERCLDSIGIE+LIKDVQSVLSLED
Sbjct: 1141 IKSLEDCITEKLQLQSVNDKLRLDLEHTSVEFVEFRERCLDSIGIEKLIKDVQSVLSLED 1200

Query: 1201 TEKYHAEIPAIYLESMVSLLLQKYRESELQLGLSREESESKMMKLTGLQESVNDLSTLIL 1260
            TEKYHAEIPAI+LESMVSLLLQKYRESELQL LSREESES MMKLTG QESVNDLSTLIL
Sbjct: 1201 TEKYHAEIPAIHLESMVSLLLQKYRESELQLSLSREESESIMMKLTGQQESVNDLSTLIL 1260

Query: 1261 DHECEIVLLKESLSQAQEALMASRSELKDKVNELEQTEQRVSAIREKLSIAVAKGKSLIV 1320
            DHECEIVLLKESLSQAQEA+MASRSELKDKVNELEQ EQRVSAIREKLSIAVAKGKSLIV
Sbjct: 1261 DHECEIVLLKESLSQAQEAVMASRSELKDKVNELEQAEQRVSAIREKLSIAVAKGKSLIV 1320

Query: 1321 QRDNLKQLLAQNSSELERCLQELQMKDTRLNETEMKLKTYSEAGERVEALESELSYIRNS 1380
            QRDNLKQLLAQ SSELERCLQELQMKDTRLNETE KLKTYSEAGERVEALESELSYIRNS
Sbjct: 1321 QRDNLKQLLAQTSSELERCLQELQMKDTRLNETETKLKTYSEAGERVEALESELSYIRNS 1380

Query: 1381 ATALRESFLLKDSVLQRIEEILDELDLPENFHSRDIIDKIDWLAKSSMGENLLHTDWDQR 1440
            ATALRESFLLKDSVLQRIEEILDELDLPENFHSRDIIDKIDWLAKSS GENL+HTDWDQR
Sbjct: 1381 ATALRESFLLKDSVLQRIEEILDELDLPENFHSRDIIDKIDWLAKSSTGENLVHTDWDQR 1440

Query: 1441 SSVAGGSGSDANFVITDAWKDEVQPDANVGDDLRRKYEELQTKFYGLAEQNEMLEQSLME 1500
            SSVAGGSGSDANFVITDAWKDEVQ DANVGDDLRRKYEELQTKFYGLAEQNEMLEQSLME
Sbjct: 1441 SSVAGGSGSDANFVITDAWKDEVQLDANVGDDLRRKYEELQTKFYGLAEQNEMLEQSLME 1500

Query: 1501 RNIIVQRWEELLEKIDIPSHFRSMEPEDKIEWLHRSLSEACRDRDSLHQRVNYLENYSES 1560
            RN+IVQRWEELLEKIDIPSH RSMEPEDKIEWLHRSLSEAC DRDSL QRVN LENY ES
Sbjct: 1501 RNVIVQRWEELLEKIDIPSHLRSMEPEDKIEWLHRSLSEACHDRDSLLQRVNDLENYCES 1560

Query: 1561 LTADLDDSQKKISHIEAELQSVLLEREKLSEKLEIIHHHNDHLSFGTFEKEIENIVLQNE 1620
            LTADLDDSQKKISHIEAELQSVLLEREKLSEKLEII+HHNDHL FGTFEKEIEN VL NE
Sbjct: 1561 LTADLDDSQKKISHIEAELQSVLLEREKLSEKLEIIYHHNDHLLFGTFEKEIENTVLLNE 1620

Query: 1621 LSNTQDKLISTEHKIGKLEALVSNALREEDMNDLVPGSCSIEFLELMVMKLIQNYSASLS 1680
            LSN QD LISTEH I KLEALVSNALREEDMNDLVPGSC I FLELMVMKLIQNYSAS S
Sbjct: 1621 LSNMQDNLISTEHNIVKLEALVSNALREEDMNDLVPGSCRIGFLELMVMKLIQNYSASSS 1680

Query: 1681 GNTVPRSIMNGADTEEMLARSTEAQVAWQNDINVLKEDLEDAMHQLMVVTKERDQYMEMH 1740
            GN VP S MNGADTEEMLARST+ QVAWQNDINVLK+DLEDAMHQLM VTKERDQYMEMH
Sbjct: 1681 GNAVPGSAMNGADTEEMLARSTDEQVAWQNDINVLKKDLEDAMHQLMAVTKERDQYMEMH 1740

Query: 1741 ESLIVKVESLDKKKDELEELLNLEEQKSTSVREKLNVAVRKGKSLVQQRDTLKQTIEEMT 1800
            ESLIVKVESLDKKKDELEELLNLEEQKSTSVREKLNVAVRKGKSLVQQRDTLKQTIEEM+
Sbjct: 1741 ESLIVKVESLDKKKDELEELLNLEEQKSTSVREKLNVAVRKGKSLVQQRDTLKQTIEEMS 1800

Query: 1801 TELKRLRSEMKSQENTLASYEQKFKDFSVYPGRVEALESENLSLKNRLTEMESNLQEKEY 1860
            TEL+RLRSEMKSQENTLASYEQKF+DFSVYPG+VEALESENLSLKNRL E ESNLQEKEY
Sbjct: 1801 TELERLRSEMKSQENTLASYEQKFRDFSVYPGQVEALESENLSLKNRLNETESNLQEKEY 1860

Query: 1861 KLSSIISTLDQIEVNIDVNETDPIEKLKHVGKLCFDLREAMFFSEQESVKSRRAAELLLA 1920
            KLSSII+TLD IEVN+DV+ETDPIEKLKHVGKLC DLREAMFFSEQESVKSRRAAELLLA
Sbjct: 1861 KLSSIINTLDHIEVNVDVHETDPIEKLKHVGKLCSDLREAMFFSEQESVKSRRAAELLLA 1920

Query: 1921 ELNEVQERNDAFQEELAKASDEIAEMTRERDSAESSKLEALSELEKLSTLQLKERKNQFS 1980
            ELNEVQERNDAFQEELAKASDEIAEMTRERDSAE+SKLEALSELEKLSTLQL+ERKNQFS
Sbjct: 1921 ELNEVQERNDAFQEELAKASDEIAEMTRERDSAETSKLEALSELEKLSTLQLRERKNQFS 1980

Query: 1981 QFMGLKSGLDRLKEALHEINSLLVDAFSRDLDAFYNLEAAIESCTKANEPTEVNPSPSTV 2040
            QFMGLKSGLDRLKEALHEINSLLVDAFSRDLDAFYNLEAAIESCTKAN+P  VN SPSTV
Sbjct: 1981 QFMGLKSGLDRLKEALHEINSLLVDAFSRDLDAFYNLEAAIESCTKANDPIGVNCSPSTV 2040

Query: 2041 SGAFKKDKGSFFALDSWLNSYTNSAMDEKVATEIHSQIVHQLEESMKEIGDLKEMIDGHS 2100
            SGAFKKDKGSFFALDSWLNSYTN+A DE VATEIHSQIVHQLEESMKEIGDLKEMIDGHS
Sbjct: 2041 SGAFKKDKGSFFALDSWLNSYTNAAEDENVATEIHSQIVHQLEESMKEIGDLKEMIDGHS 2100

Query: 2101 VSFHKQSDSLSKVLGELYQEVNSQKELVQALESKVQQCESVAKDKEKEGDILCRSVDMLL 2160
            VSFHKQSDSLSKVLGELYQEVNSQKELV+ALESKVQQCESVAKDKEKEGDILCRSV +LL
Sbjct: 2101 VSFHKQSDSLSKVLGELYQEVNSQKELVEALESKVQQCESVAKDKEKEGDILCRSVAVLL 2160

Query: 2161 EACRSTIKEVDQRKGELMGNDLTSENLGVNFISTAPDQLSRTGRTHLLSEEYVQTIADRL 2220
            EAC STIKEV++RKGELMGNDLTSENLGVN ISTAP QLSR+GRTHLLSEEYVQTIADRL
Sbjct: 2161 EACTSTIKEVEERKGELMGNDLTSENLGVNIISTAPGQLSRSGRTHLLSEEYVQTIADRL 2220

Query: 2221 LLTVREFIGLKAEMFDGSVTEMKIAIANLQKELQEKDIQKERICMDLVGQIKEAEGTATR 2280
            LLTVR+FIGLKAEMFDGSV EMKIA++NLQKELQEKDIQKERICMDLVGQIKEAEG  TR
Sbjct: 2221 LLTVRKFIGLKAEMFDGSVKEMKIAMSNLQKELQEKDIQKERICMDLVGQIKEAEGITTR 2280

Query: 2281 YSLDLQASKDKVRELEKVMEQMDNERKAFEQRLRQLQDGLFISDELRERVKSLTDLLASK 2340
            YSLDLQASKDKV ELEKVMEQMDNERK  EQRLR+LQDGL ISDELRERV+SLTDLLA+K
Sbjct: 2281 YSLDLQASKDKVHELEKVMEQMDNERKVLEQRLRELQDGLSISDELRERVRSLTDLLAAK 2340

Query: 2341 DQEIEALMHALDEEEVQMEGLTNKIEELEKVLKEKNHELEGIETSRGKLTKKLSITVTKF 2400
            DQEIEALMHALDEEEVQMEGLTNKIEELEKVLKEKNHELE +ETSRGKLTKKLSITVTKF
Sbjct: 2341 DQEIEALMHALDEEEVQMEGLTNKIEELEKVLKEKNHELESVETSRGKLTKKLSITVTKF 2400

Query: 2401 DELHHLSESLLTEVEKLQAQLQDRDAEISFLRQEVTRCTNDALVATQTSNRSTEDINEVI 2460
            DELHHLSESLLTEVEKLQAQLQDRDAEISFLRQEVTRCTNDALVATQTSNRSTEDINEVI
Sbjct: 2401 DELHHLSESLLTEVEKLQAQLQDRDAEISFLRQEVTRCTNDALVATQTSNRSTEDINEVI 2460

Query: 2461 TWFDMVGARAGLSHIGHSDQANEVHECKEVLKKKITSILKEIEDIQAASQRKDELLLVEK 2520
            TWFDMVGAR GLS IGHSDQ NEVHE KE+LKKKITSILKEIED+QAASQRKDELLLVEK
Sbjct: 2461 TWFDMVGARVGLSRIGHSDQENEVHERKELLKKKITSILKEIEDLQAASQRKDELLLVEK 2520

Query: 2521 NKVEELKRKELQLNSLEDVGDDNKARSAAPEIFESEPLINKWAASST-ITPQVRSLRKGN 2580
            NKVEELKRK+LQLNSLEDVGDDNKA S APEIFESEPLIN WAASST +TPQVRSLRKGN
Sbjct: 2521 NKVEELKRKKLQLNSLEDVGDDNKASSVAPEIFESEPLINTWAASSTSVTPQVRSLRKGN 2580

Query: 2581 TDQVAIAIDVDPASSSNRLEDEDDDKVHGFKSLASSRLVPKFSRRATDMIDGLWVSCDRA 2640
            TDQVAIAIDVDPASSSNRLEDEDDDKVHGFKSLASSRLVPKFSRRATDMIDGLWVSCDRA
Sbjct: 2581 TDQVAIAIDVDPASSSNRLEDEDDDKVHGFKSLASSRLVPKFSRRATDMIDGLWVSCDRA 2640

Query: 2641 LMRQPALRLGIIFYWAILHALVATFVV 2667
            LMRQPALRLGIIFYWAILHALVATFVV
Sbjct: 2641 LMRQPALRLGIIFYWAILHALVATFVV 2665

BLAST of CSPI03G44700 vs. ExPASy TrEMBL
Match: A0A1S3CQW7 (centromere-associated protein E isoform X2 OS=Cucumis melo OX=3656 GN=LOC103503751 PE=4 SV=1)

HSP 1 Score: 4216.0 bits (10933), Expect = 0.0e+00
Identity = 2272/2452 (92.66%), Postives = 2346/2452 (95.68%), Query Frame = 0

Query: 216  MGDAVMQPGQVHETEIAGDKQLDTGGTSESAAETTFKETRCNEEEDIAAGVASISVAVTK 275
            MGDA+MQ GQVHETE+AGDK LDTGGTSESAAETTFKET C++EEDIAA VAS+SVAV +
Sbjct: 1    MGDALMQSGQVHETELAGDKLLDTGGTSESAAETTFKETHCDKEEDIAAEVASVSVAVIE 60

Query: 276  SNNYSISSPGENLGMENSSSSSRDDWKEERQVHAEDTIHSSRSQVESIPEDNFADLSEGH 335
            SN+YSISSPGENLGM+NSSSSSRDDWK+ERQVHAEDTIHSSRSQVESIPED+FAD SEGH
Sbjct: 61   SNSYSISSPGENLGMDNSSSSSRDDWKDERQVHAEDTIHSSRSQVESIPEDDFADQSEGH 120

Query: 336  GKASQTSVKVSDVRDANTISLNAHMTATSDAQSETFSSFRQDCNFFDLLERMKEELIVSS 395
            GKASQTSVKVSDVRDANTISLN HMTATSDAQS TFSSF QDCNFFDLLERMKEELIVSS
Sbjct: 121  GKASQTSVKVSDVRDANTISLNEHMTATSDAQSGTFSSFGQDCNFFDLLERMKEELIVSS 180

Query: 396  CSKEIFNMQITEQNELQMELDNHRSKSTKDVALLNTSLNEVVERNQSLVDELSHCRSELE 455
             SKEIFNMQITEQNELQMELDNHRSKSTKDVALLNTSLNEVVERNQSLVDELSHCRSELE
Sbjct: 181  FSKEIFNMQITEQNELQMELDNHRSKSTKDVALLNTSLNEVVERNQSLVDELSHCRSELE 240

Query: 456  DVSTAKEKLRDQLLTAEAEIEKLSSKTSETENSLEKLHGDMFRLAKELDDCKHLVTMLEG 515
            DVS AKEK RDQLLTAEAEIEKLSSKTSETENSLEKLHGDMFRLAKELDDCKHLVT+LEG
Sbjct: 241  DVSIAKEKFRDQLLTAEAEIEKLSSKTSETENSLEKLHGDMFRLAKELDDCKHLVTVLEG 300

Query: 516  EKERLNGIITFENENKIKLAEEKELYSDENQKILSELSSLKSLNVALEAENSKLMGSLSS 575
            EKERLNGIITFENENK KLAEEKELYSDEN+KILSELSSLKSLNVALEAENSKLMGSLSS
Sbjct: 301  EKERLNGIITFENENKRKLAEEKELYSDENEKILSELSSLKSLNVALEAENSKLMGSLSS 360

Query: 576  VAEAKTKLEEEREQLFQVNGTLSAELANCKNLVATQQEENMNLTKNLALVTEDRTKVEED 635
            VAE KTKLEEEREQLFQVNGTLSAELANCK+LVATQQEENMNLTKNLALVTEDRTKV+ED
Sbjct: 361  VAEEKTKLEEEREQLFQVNGTLSAELANCKDLVATQQEENMNLTKNLALVTEDRTKVDED 420

Query: 636  KNHLFHKNETMASELLVLDEILSTEHEKRVKFEGDLKDALAQLDQLTEENVFLSNGLDIY 695
            KN LFH+NETMASELLVL+E LSTEHEKRVKFEGDLKDALAQLDQL EENVFLSNGL+I+
Sbjct: 421  KNRLFHENETMASELLVLEERLSTEHEKRVKFEGDLKDALAQLDQLIEENVFLSNGLNIH 480

Query: 696  KFKIEELCGEIISLQTRTREDEDRAENAGSDQYHGNNFQENVSSQITFKKCLPNPSSVLT 755
            KFK+EELCGEIISLQTR+ EDED+AENA  D+YHGNNFQENVSSQI+FKKCLP+ SSVL 
Sbjct: 481  KFKLEELCGEIISLQTRSTEDEDQAENADCDRYHGNNFQENVSSQISFKKCLPDTSSVLA 540

Query: 756  GGKPFEVTEQEIFGDSLGFVTLGQHLEEAELMLQRLEKEITGLQSNSASSRSGSKTAAPA 815
            GGKPF V+EQEIF DSLGFVTLGQHLEEA LMLQRLEKEITGLQSNSASSRSGSK AAPA
Sbjct: 541  GGKPFMVSEQEIFDDSLGFVTLGQHLEEAALMLQRLEKEITGLQSNSASSRSGSKAAAPA 600

Query: 816  ISKLIQAFESQVNVEEDEVEAEIQSPNDPYKLSIELVENLRVLLRQVVVDSENASVLLKG 875
            +SKLIQAFES VNVEE EVEAEIQ PNDPYKLSIELVENLRVLLRQVVVDS+NASVLLKG
Sbjct: 601  VSKLIQAFESHVNVEEHEVEAEIQPPNDPYKLSIELVENLRVLLRQVVVDSKNASVLLKG 660

Query: 876  ERDHQNVAISTLNEFKDKFEALENYSNNWVMANIEHGVLFDCFKHHLNDAGDKIYELEIL 935
            ERDHQNVAIST NEFK+KFEALE+YSNN VMANIEH VLF+C KHH+NDAGDKIYELEIL
Sbjct: 661  ERDHQNVAISTSNEFKEKFEALEDYSNNLVMANIEHRVLFECLKHHVNDAGDKIYELEIL 720

Query: 936  NKSLKQQATHHKNFNRELAERLCGYESTLTELERQLCDLPQSSNEMVSLICNQLDNLQGG 995
            NKSLKQQATHHKNFNRELAERL GYESTLTELE QLCDLPQSSNEMVSL+CN LDNLQ G
Sbjct: 721  NKSLKQQATHHKNFNRELAERLRGYESTLTELEYQLCDLPQSSNEMVSLVCNLLDNLQEG 780

Query: 996  AIERAMTLEKDWHSFLLELAETIVKLDESLGKSDTPAIKFCTSDQLLSCISASVIDAVKT 1055
            AIERAMTLEKDWHSFLLELAETIVKLDESLGKSDT AIKFCTSD+LLSCISASV+DAVKT
Sbjct: 781  AIERAMTLEKDWHSFLLELAETIVKLDESLGKSDTSAIKFCTSDRLLSCISASVVDAVKT 840

Query: 1056 INDLRERLQATASNGEACRMSYEEVTEKYDSLFRRNEFTVDMLHKLYGELQKLHIASCGS 1115
            I+DLRERLQ TASN EACRM YEE+TEKYDSLFRRNEFTVDMLHKLYGEL KLHIASCGS
Sbjct: 841  IDDLRERLQTTASNSEACRMLYEEITEKYDSLFRRNEFTVDMLHKLYGELHKLHIASCGS 900

Query: 1116 VSGSDMNMQIKMVGDPLDYSNFEALIKSLEDCITEKLQLQSVNDRLCTDLERRTVEFVEF 1175
            VSGSD+NMQIKM  DPLDYSNF ALIKSLEDCITEKLQLQSVND+L  DLE  +VEFVEF
Sbjct: 901  VSGSDVNMQIKM-DDPLDYSNFVALIKSLEDCITEKLQLQSVNDKLRLDLEHTSVEFVEF 960

Query: 1176 RERCLDSIGIEELIKDVQSVLSLEDTEKYHAEIPAIYLESMVSLLLQKYRESELQLGLSR 1235
            RERCLDSIGIE+LIKDVQSVLSLEDTEKYHAEIPAI+LESMVSLLLQKYRESELQL LSR
Sbjct: 961  RERCLDSIGIEKLIKDVQSVLSLEDTEKYHAEIPAIHLESMVSLLLQKYRESELQLSLSR 1020

Query: 1236 EESESKMMKLTGLQESVNDLSTLILDHECEIVLLKESLSQAQEALMASRSELKDKVNELE 1295
            EESES MMKLTG QESVNDLSTLILDHECEIVLLKESLSQAQEA+MASRSELKDKVNELE
Sbjct: 1021 EESESIMMKLTGQQESVNDLSTLILDHECEIVLLKESLSQAQEAVMASRSELKDKVNELE 1080

Query: 1296 QTEQRVSAIREKLSIAVAKGKSLIVQRDNLKQLLAQNSSELERCLQELQMKDTRLNETEM 1355
            Q EQRVSAIREKLSIAVAKGKSLIVQRDNLKQLLAQ SSELERCLQELQMKDTRLNETE 
Sbjct: 1081 QAEQRVSAIREKLSIAVAKGKSLIVQRDNLKQLLAQTSSELERCLQELQMKDTRLNETET 1140

Query: 1356 KLKTYSEAGERVEALESELSYIRNSATALRESFLLKDSVLQRIEEILDELDLPENFHSRD 1415
            KLKTYSEAGERVEALESELSYIRNSATALRESFLLKDSVLQRIEEILDELDLPENFHSRD
Sbjct: 1141 KLKTYSEAGERVEALESELSYIRNSATALRESFLLKDSVLQRIEEILDELDLPENFHSRD 1200

Query: 1416 IIDKIDWLAKSSMGENLLHTDWDQRSSVAGGSGSDANFVITDAWKDEVQPDANVGDDLRR 1475
            IIDKIDWLAKSS GENL+HTDWDQRSSVAGGSGSDANFVITDAWKDEVQ DANVGDDLRR
Sbjct: 1201 IIDKIDWLAKSSTGENLVHTDWDQRSSVAGGSGSDANFVITDAWKDEVQLDANVGDDLRR 1260

Query: 1476 KYEELQTKFYGLAEQNEMLEQSLMERNIIVQRWEELLEKIDIPSHFRSMEPEDKIEWLHR 1535
            KYEELQTKFYGLAEQNEMLEQSLMERN+IVQRWEELLEKIDIPSH RSMEPEDKIEWLHR
Sbjct: 1261 KYEELQTKFYGLAEQNEMLEQSLMERNVIVQRWEELLEKIDIPSHLRSMEPEDKIEWLHR 1320

Query: 1536 SLSEACRDRDSLHQRVNYLENYSESLTADLDDSQKKISHIEAELQSVLLEREKLSEKLEI 1595
            SLSEAC DRDSL QRVN LENY ESLTADLDDSQKKISHIEAELQSVLLEREKLSEKLEI
Sbjct: 1321 SLSEACHDRDSLLQRVNDLENYCESLTADLDDSQKKISHIEAELQSVLLEREKLSEKLEI 1380

Query: 1596 IHHHNDHLSFGTFEKEIENIVLQNELSNTQDKLISTEHKIGKLEALVSNALREEDMNDLV 1655
            I+HHNDHL FGTFEKEIEN VL NELSN QD LISTEH I KLEALVSNALREEDMNDLV
Sbjct: 1381 IYHHNDHLLFGTFEKEIENTVLLNELSNMQDNLISTEHNIVKLEALVSNALREEDMNDLV 1440

Query: 1656 PGSCSIEFLELMVMKLIQNYSASLSGNTVPRSIMNGADTEEMLARSTEAQVAWQNDINVL 1715
            PGSC I FLELMVMKLIQNYSAS SGN VP S MNGADTEEMLARST+ QVAWQNDINVL
Sbjct: 1441 PGSCRIGFLELMVMKLIQNYSASSSGNAVPGSAMNGADTEEMLARSTDEQVAWQNDINVL 1500

Query: 1716 KEDLEDAMHQLMVVTKERDQYMEMHESLIVKVESLDKKKDELEELLNLEEQKSTSVREKL 1775
            K+DLEDAMHQLM VTKERDQYMEMHESLIVKVESLDKKKDELEELLNLEEQKSTSVREKL
Sbjct: 1501 KKDLEDAMHQLMAVTKERDQYMEMHESLIVKVESLDKKKDELEELLNLEEQKSTSVREKL 1560

Query: 1776 NVAVRKGKSLVQQRDTLKQTIEEMTTELKRLRSEMKSQENTLASYEQKFKDFSVYPGRVE 1835
            NVAVRKGKSLVQQRDTLKQTIEEM+TEL+RLRSEMKSQENTLASYEQKF+DFSVYPG+VE
Sbjct: 1561 NVAVRKGKSLVQQRDTLKQTIEEMSTELERLRSEMKSQENTLASYEQKFRDFSVYPGQVE 1620

Query: 1836 ALESENLSLKNRLTEMESNLQEKEYKLSSIISTLDQIEVNIDVNETDPIEKLKHVGKLCF 1895
            ALESENLSLKNRL E ESNLQEKEYKLSSII+TLD IEVN+DV+ETDPIEKLKHVGKLC 
Sbjct: 1621 ALESENLSLKNRLNETESNLQEKEYKLSSIINTLDHIEVNVDVHETDPIEKLKHVGKLCS 1680

Query: 1896 DLREAMFFSEQESVKSRRAAELLLAELNEVQERNDAFQEELAKASDEIAEMTRERDSAES 1955
            DLREAMFFSEQESVKSRRAAELLLAELNEVQERNDAFQEELAKASDEIAEMTRERDSAE+
Sbjct: 1681 DLREAMFFSEQESVKSRRAAELLLAELNEVQERNDAFQEELAKASDEIAEMTRERDSAET 1740

Query: 1956 SKLEALSELEKLSTLQLKERKNQFSQFMGLKSGLDRLKEALHEINSLLVDAFSRDLDAFY 2015
            SKLEALSELEKLSTLQL+ERKNQFSQFMGLKSGLDRLKEALHEINSLLVDAFSRDLDAFY
Sbjct: 1741 SKLEALSELEKLSTLQLRERKNQFSQFMGLKSGLDRLKEALHEINSLLVDAFSRDLDAFY 1800

Query: 2016 NLEAAIESCTKANEPTEVNPSPSTVSGAFKKDKGSFFALDSWLNSYTNSAMDEKVATEIH 2075
            NLEAAIESCTKAN+P  VN SPSTVSGAFKKDKGSFFALDSWLNSYTN+A DE VATEIH
Sbjct: 1801 NLEAAIESCTKANDPIGVNCSPSTVSGAFKKDKGSFFALDSWLNSYTNAAEDENVATEIH 1860

Query: 2076 SQIVHQLEESMKEIGDLKEMIDGHSVSFHKQSDSLSKVLGELYQEVNSQKELVQALESKV 2135
            SQIVHQLEESMKEIGDLKEMIDGHSVSFHKQSDSLSKVLGELYQEVNSQKELV+ALESKV
Sbjct: 1861 SQIVHQLEESMKEIGDLKEMIDGHSVSFHKQSDSLSKVLGELYQEVNSQKELVEALESKV 1920

Query: 2136 QQCESVAKDKEKEGDILCRSVDMLLEACRSTIKEVDQRKGELMGNDLTSENLGVNFISTA 2195
            QQCESVAKDKEKEGDILCRSV +LLEAC STIKEV++RKGELMGNDLTSENLGVN ISTA
Sbjct: 1921 QQCESVAKDKEKEGDILCRSVAVLLEACTSTIKEVEERKGELMGNDLTSENLGVNIISTA 1980

Query: 2196 PDQLSRTGRTHLLSEEYVQTIADRLLLTVREFIGLKAEMFDGSVTEMKIAIANLQKELQE 2255
            P QLSR+GRTHLLSEEYVQTIADRLLLTVR+FIGLKAEMFDGSV EMKIA++NLQKELQE
Sbjct: 1981 PGQLSRSGRTHLLSEEYVQTIADRLLLTVRKFIGLKAEMFDGSVKEMKIAMSNLQKELQE 2040

Query: 2256 KDIQKERICMDLVGQIKEAEGTATRYSLDLQASKDKVRELEKVMEQMDNERKAFEQRLRQ 2315
            KDIQKERICMDLVGQIKEAEG  TRYSLDLQASKDKV ELEKVMEQMDNERK  EQRLR+
Sbjct: 2041 KDIQKERICMDLVGQIKEAEGITTRYSLDLQASKDKVHELEKVMEQMDNERKVLEQRLRE 2100

Query: 2316 LQDGLFISDELRERVKSLTDLLASKDQEIEALMHALDEEEVQMEGLTNKIEELEKVLKEK 2375
            LQDGL ISDELRERV+SLTDLLA+KDQEIEALMHALDEEEVQMEGLTNKIEELEKVLKEK
Sbjct: 2101 LQDGLSISDELRERVRSLTDLLAAKDQEIEALMHALDEEEVQMEGLTNKIEELEKVLKEK 2160

Query: 2376 NHELEGIETSRGKLTKKLSITVTKFDELHHLSESLLTEVEKLQAQLQDRDAEISFLRQEV 2435
            NHELE +ETSRGKLTKKLSITVTKFDELHHLSESLLTEVEKLQAQLQDRDAEISFLRQEV
Sbjct: 2161 NHELESVETSRGKLTKKLSITVTKFDELHHLSESLLTEVEKLQAQLQDRDAEISFLRQEV 2220

Query: 2436 TRCTNDALVATQTSNRSTEDINEVITWFDMVGARAGLSHIGHSDQANEVHECKEVLKKKI 2495
            TRCTNDALVATQTSNRSTEDINEVITWFDMVGAR GLS IGHSDQ NEVHE KE+LKKKI
Sbjct: 2221 TRCTNDALVATQTSNRSTEDINEVITWFDMVGARVGLSRIGHSDQENEVHERKELLKKKI 2280

Query: 2496 TSILKEIEDIQAASQRKDELLLVEKNKVEELKRKELQLNSLEDVGDDNKARSAAPEIFES 2555
            TSILKEIED+QAASQRKDELLLVEKNKVEELKRK+LQLNSLEDVGDDNKA S APEIFES
Sbjct: 2281 TSILKEIEDLQAASQRKDELLLVEKNKVEELKRKKLQLNSLEDVGDDNKASSVAPEIFES 2340

Query: 2556 EPLINKWAASST-ITPQVRSLRKGNTDQVAIAIDVDPASSSNRLEDEDDDKVHGFKSLAS 2615
            EPLIN WAASST +TPQVRSLRKGNTDQVAIAIDVDPASSSNRLEDEDDDKVHGFKSLAS
Sbjct: 2341 EPLINTWAASSTSVTPQVRSLRKGNTDQVAIAIDVDPASSSNRLEDEDDDKVHGFKSLAS 2400

Query: 2616 SRLVPKFSRRATDMIDGLWVSCDRALMRQPALRLGIIFYWAILHALVATFVV 2667
            SRLVPKFSRRATDMIDGLWVSCDRALMRQPALRLGIIFYWAILHALVATFVV
Sbjct: 2401 SRLVPKFSRRATDMIDGLWVSCDRALMRQPALRLGIIFYWAILHALVATFVV 2451

BLAST of CSPI03G44700 vs. ExPASy TrEMBL
Match: A0A6J1F6C6 (major antigen-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111442749 PE=4 SV=1)

HSP 1 Score: 3964.5 bits (10280), Expect = 0.0e+00
Identity = 2183/2668 (81.82%), Postives = 2356/2668 (88.31%), Query Frame = 0

Query: 1    MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGGSSRNTSKLEQHDADADIGIGAAKST 60
            MDKNKSRSDLLAAGRKKLQQFRKKKDN+G GSQG SS+NTSKLEQHD DADI   +AKS 
Sbjct: 1    MDKNKSRSDLLAAGRKKLQQFRKKKDNRGGGSQGNSSKNTSKLEQHDVDADIVTASAKSP 60

Query: 61   SGRFSSDEVLASSVDRNPHIVDSSASSSTEHSLAAETDDHSTVSVKQEMDLAEASAIDQG 120
            SG  S+DE L+ SV R+P  VDSSAS S EHSLAAE  DHST SVKQEMDLAE SAIDQ 
Sbjct: 61   SGSCSTDEALSPSVYRDPDAVDSSASPSMEHSLAAEI-DHSTDSVKQEMDLAETSAIDQA 120

Query: 121  ETSMQEVGYREDFEHTVQNVEASGFVSSGPSVPTDVEGNDNPTSNLSFAESSSQISSASV 180
            E  MQEVGY ED EH +QN EA+  +  G S+PTD E NDN   NLS  ESS QISSASV
Sbjct: 121  EVPMQEVGYSEDCEHPIQNTEAA--MPFGLSLPTDAEENDNHICNLSSTESSPQISSASV 180

Query: 181  EQQGRIVEVGGGCREEELLVSPSTSLLQAREDVGCMGDAVMQPGQVHETEIAGDKQLDTG 240
            EQQGRI EV GGCREEELL S S SLLQAREDVG M D +MQ  Q HETE +GDKQL+TG
Sbjct: 181  EQQGRIAEVWGGCREEELLPSQSASLLQAREDVG-MEDVLMQSVQAHETEFSGDKQLETG 240

Query: 241  GTSESAAETTFKETRCNEEEDIAAGVASISVAVTKSNNYSISSPGENLGMENSSSSSRDD 300
            G +ESAAETTFK+  C+++E IAA V S+S A T+SN+Y ISSPGE LGM+NSSSSSRDD
Sbjct: 241  GMNESAAETTFKDRYCDKKEIIAADVKSVSGADTESNSYLISSPGEKLGMKNSSSSSRDD 300

Query: 301  WKEERQVHAEDTIHSSRSQVESIPEDNFADLSEGHGKASQTSVKVSDVRDANTISLNAHM 360
            WKEE QVHAED I SSR +V+ +PEDNFAD SEGH  ASQT    SD  DAN IS NAHM
Sbjct: 301  WKEESQVHAEDMIQSSRCEVQYMPEDNFADQSEGHDMASQT----SDAGDANAISHNAHM 360

Query: 361  TATSDAQSETFSSFRQDCNFFDLLERMKEELIVSSCSKEIFNMQITEQNELQMELDNHRS 420
            T+TSDA S TFSSF QD  F  LLERMKEELIV+S SK+IFN+QI+EQNELQ+ELDNH  
Sbjct: 361  TSTSDA-SGTFSSFEQDSKFLHLLERMKEELIVTSFSKDIFNLQISEQNELQLELDNHLH 420

Query: 421  KSTKDVALLNTSLNEVVERNQSLVDELSHCRSELEDVSTAKEKLRDQLLTAEAEIEKLSS 480
            KST D+  LNTSL+EV+ERNQSLVDELSHCRSEL+DV T KE+LRDQLL AEAEIEKLSS
Sbjct: 421  KSTDDMTRLNTSLDEVLERNQSLVDELSHCRSELKDVLTTKEELRDQLLNAEAEIEKLSS 480

Query: 481  KTSETENSLEKLHGDMFRLAKELDDCKHLVTMLEGEKERLNGIITFENENKIKLAEEKEL 540
            +TSETENSLEK HGDMFRL KELDDCKHLVT+LE E ERLNGIIT ENENK KLAEEKEL
Sbjct: 481  RTSETENSLEKFHGDMFRLGKELDDCKHLVTVLEEENERLNGIITSENENKRKLAEEKEL 540

Query: 541  YSDENQKILSELSSLKSLNVALEAENSKLMGSLSSVAEAKTKLEEEREQLFQVNGTLSAE 600
            Y +EN+KILSE+SS KSL +ALE ENSKLMGSLS V E KTKLEEERE L Q+NGTLS E
Sbjct: 541  YINENEKILSEISSFKSLKMALEVENSKLMGSLSEVVEEKTKLEEEREHLCQMNGTLSVE 600

Query: 601  LANCKNLVATQQEENMNLTKNLALVTEDRTKVEEDKNHLFHKNETMASELLVLDEILSTE 660
            L+NCKNLVATQQEE  +L KNLAL TEDRTK+EEDKN LFH+NE +ASELLVLDE LSTE
Sbjct: 601  LSNCKNLVATQQEEITDLIKNLALATEDRTKLEEDKNRLFHENERIASELLVLDERLSTE 660

Query: 661  HEKRVKFEGDLKDALAQLDQLTEENVFLSNGLDIYKFKIEELCGEIISLQTRTREDEDRA 720
            HE+RVK E DLKDALAQLDQLTEEN+FLSN LDI+ FKIEELCGEI+SLQTR+ +DED+A
Sbjct: 661  HEERVKLESDLKDALAQLDQLTEENIFLSNNLDIHIFKIEELCGEILSLQTRSVDDEDQA 720

Query: 721  ENAGSDQYHGNNFQENVSSQITFKKCLPNPSSVLTGGKPFEVTEQEIFGDSLGFVTLGQH 780
            EN  S + HGN FQ N SSQITFK+ L   SSVL GGKPF VTEQEIF DSLG VTLGQH
Sbjct: 721  ENTDSGRRHGNKFQGNDSSQITFKENLHEISSVLAGGKPFIVTEQEIFDDSLGLVTLGQH 780

Query: 781  LEEAELMLQRLEKEITGLQSNSAS-SRSGSKTAAPAISKLIQAFESQVNVEEDEVEAEIQ 840
            LEEA+LMLQ+LEKEI GLQSNSAS S SGSK AAPA+SKLIQAFES+VNVEE EV+AEIQ
Sbjct: 781  LEEADLMLQKLEKEIKGLQSNSASFSSSGSKMAAPAVSKLIQAFESKVNVEEQEVDAEIQ 840

Query: 841  SPNDPYKLSIELVENLRVLLRQVVVDSENASVLLKGERDHQNVAISTLNEFKDKFEALEN 900
              NDPYKLS ELV+NLRVLLRQVVVDSE ASVLLKGERDH+ VAISTLNEFKD+FE LEN
Sbjct: 841  LSNDPYKLSNELVDNLRVLLRQVVVDSEKASVLLKGERDHRKVAISTLNEFKDQFEDLEN 900

Query: 901  YSNNWVMANIEHGVLFDCFKHHLNDAGDKIYELEILNKSLKQQATHHKNFNRELAERLCG 960
            +SN+ VMANIEH +LF+C KHH+ DAGDKIYELEIL +SLKQQ  HHKN N ELA RLCG
Sbjct: 901  HSNDLVMANIEHSILFECLKHHVYDAGDKIYELEILKESLKQQGVHHKNSNCELAVRLCG 960

Query: 961  YESTLTELERQLCDLPQSSNEMVSLICNQLDNLQGGAIERAMTLEKDWHSFLLELAETIV 1020
            Y+  LTELE QLCD  Q SNE VSLICNQLDNLQ G IER MTLEKDWHSFLLELAETI 
Sbjct: 961  YKLKLTELESQLCDFHQGSNETVSLICNQLDNLQEGEIERGMTLEKDWHSFLLELAETIA 1020

Query: 1021 KLDESLGKSDTPAIKFCTSDQLLSCISASVIDAVKTINDLRERLQATASNGEACRMSYEE 1080
            KLDESLG S+T AIKFCT+DQL SCI+ SV +AV  I+DLRERLQATASNGEA RM YEE
Sbjct: 1021 KLDESLGNSNTSAIKFCTNDQLPSCIATSVKNAVNIIDDLRERLQATASNGEAFRMLYEE 1080

Query: 1081 VTEKYDSLFRRNEFTVDMLHKLYGELQKLHIASCGSVSGSDMNMQIKMVGDPLDYSNFEA 1140
            V EKYD+LFR  E +VDML ++YG+LQ L+IASCGSVSGSDMNMQIKM+GDPLDYSNFE 
Sbjct: 1081 VNEKYDNLFRSTELSVDMLRRIYGKLQNLYIASCGSVSGSDMNMQIKMLGDPLDYSNFET 1140

Query: 1141 LIKSLEDCITEKLQLQSVNDRLCTDLERRTVEFVEFRERCLDSIGIEELIKDVQSVLSLE 1200
            LIK LEDCITE+L+L+S+ND+L  DLE RTVEFV+FRERCLD IGI++LIK+VQSVL LE
Sbjct: 1141 LIKPLEDCITERLRLESLNDKLRLDLEHRTVEFVQFRERCLDPIGIQKLIKNVQSVLLLE 1200

Query: 1201 DTEKYHAEIPAIYLESMVSLLLQKYRESELQLGLSREESESKMMKLTGLQESVNDLSTLI 1260
            DTEK  AE+PA +LE+MVSL+LQKYRESELQLGLSREE  S MMKLT LQESV+DLSTLI
Sbjct: 1201 DTEKDRAEMPAFHLETMVSLVLQKYRESELQLGLSREECGSVMMKLTELQESVHDLSTLI 1260

Query: 1261 LDHECEIVLLKESLSQAQEALMASRSELKDKVNELEQTEQRVSAIREKLSIAVAKGKSLI 1320
            LDHECEIVLLKESLSQAQEALMA RSELKDKV+ELEQ+EQRVSAIR+KLSIAVAKGK LI
Sbjct: 1261 LDHECEIVLLKESLSQAQEALMALRSELKDKVDELEQSEQRVSAIRDKLSIAVAKGKGLI 1320

Query: 1321 VQRDNLKQLLAQNSSELERCLQELQMKDTRLNETEMKLKTYSEAGERVEALESELSYIRN 1380
            VQRDNLKQLLAQ SSELERCLQELQMKDTRL+E E KL TYSEAGERVEALESELSYIRN
Sbjct: 1321 VQRDNLKQLLAQTSSELERCLQELQMKDTRLHEVETKLNTYSEAGERVEALESELSYIRN 1380

Query: 1381 SATALRESFLLKDSVLQRIEEILDELDLPENFHSRDIIDKIDWLAKSSMGENLLHTDWDQ 1440
            SATALRESFLLKDSVLQRIEEILDELDLPENFHSRDII+KIDWLAKSS GEN+ HTDWDQ
Sbjct: 1381 SATALRESFLLKDSVLQRIEEILDELDLPENFHSRDIIEKIDWLAKSSAGENIPHTDWDQ 1440

Query: 1441 RSSVAGGSGSDANFVITDAWKDEVQPDANVGDDLRRKYEELQTKFYGLAEQNEMLEQSLM 1500
            RSSVAGGSGSDANFVITDAWKDEVQPDANVGDDLRRKYEELQTKFYGLAEQNEMLEQSLM
Sbjct: 1441 RSSVAGGSGSDANFVITDAWKDEVQPDANVGDDLRRKYEELQTKFYGLAEQNEMLEQSLM 1500

Query: 1501 ERNIIVQRWEELLEKIDIPSHFRSMEPEDKIEWLHRSLSEACRDRDSLHQRVNYLENYSE 1560
            ERN  VQRWEELLEKIDI SH RSMEPEDKIEWL+RSLSEAC DRDSLHQRVNYLENY  
Sbjct: 1501 ERNNAVQRWEELLEKIDIHSHLRSMEPEDKIEWLNRSLSEACHDRDSLHQRVNYLENYCG 1560

Query: 1561 SLTADLDDSQKKISHIEAELQSVLLEREKLSEKLEIIHHHNDHLSFGTFEKEIENIVLQN 1620
            SLTADLDDS+KKIS IEAELQ VLLEREKLSEKLEII HHNDHLSFGTFE EIENIVLQN
Sbjct: 1561 SLTADLDDSRKKISDIEAELQLVLLEREKLSEKLEIIDHHNDHLSFGTFENEIENIVLQN 1620

Query: 1621 ELSNTQDKLISTEHKIGKLEALVSNALREEDMNDLVPGSCSIEFLELMVMKLIQNYSASL 1680
            ELSN Q+KLISTE KI KLEALV N L++ D++DLV GS SIEFLELMVMKL+QNY+ SL
Sbjct: 1621 ELSNMQEKLISTELKIVKLEALVGNVLQDNDVHDLVSGS-SIEFLELMVMKLVQNYTFSL 1680

Query: 1681 SGNTVPRSIMNGADTEEMLARSTEAQVAWQNDINVLKEDLEDAMHQLMVVTKERDQYMEM 1740
              + VP S  NG+ TEEMLARS +A VAWQNDINVLK+DLEDAMHQLMVVTKERD+YMEM
Sbjct: 1681 R-DAVPESTTNGSTTEEMLARSVDAHVAWQNDINVLKKDLEDAMHQLMVVTKERDRYMEM 1740

Query: 1741 HESLIVKVESLDKKKDELEELLNLEEQKSTSVREKLNVAVRKGKSLVQQRDTLKQTIEEM 1800
            HE L+VKVES+DKKKDEL+ELLNLEEQKSTS+REKLNVAVRKGKSLVQQRD+LKQ IEEM
Sbjct: 1741 HEYLVVKVESIDKKKDELQELLNLEEQKSTSIREKLNVAVRKGKSLVQQRDSLKQAIEEM 1800

Query: 1801 TTELKRLRSEMKSQENTLASYEQKFKDFSVYPGRVEALESENLSLKNRLTEMESNLQEKE 1860
            TTELK LRSEMKSQENTLASYEQK +DFSVY GRVEALESENLSLKN+LTE ++NLQEKE
Sbjct: 1801 TTELKNLRSEMKSQENTLASYEQKLRDFSVYTGRVEALESENLSLKNQLTETKNNLQEKE 1860

Query: 1861 YKLSSIISTLDQIEVNIDVNETDPIEKLKHVGKLCFDLREAMFFSEQESVKSRRAAELLL 1920
            +KLSSII+TL  +EVN+DV ETDPIEKLK VGKLC DLREAM  SEQESVKSRRAAELLL
Sbjct: 1861 FKLSSIINTLVHMEVNVDVYETDPIEKLKQVGKLCSDLREAMVSSEQESVKSRRAAELLL 1920

Query: 1921 AELNEVQERNDAFQEELAKASDEIAEMTRERDSAESSKLEALSELEKLSTLQLKERKNQF 1980
            AELNEVQERNDAFQEELAKASDEIAE+T+ERD AE+SKLEALSELEKLSTL LKERKNQF
Sbjct: 1921 AELNEVQERNDAFQEELAKASDEIAELTKERDLAETSKLEALSELEKLSTLHLKERKNQF 1980

Query: 1981 SQFMGLKSGLDRLKEALHEINSLLVDAFSRDLDAFYNLEAAIESCTKANEPTEVNPSPST 2040
            S+FMG KSGLD+LKEAL EIN LL DAFSRDLDAFYNLE AIESCTKAN+  EVNPSPST
Sbjct: 1981 SKFMGFKSGLDQLKEALREINCLLADAFSRDLDAFYNLEVAIESCTKANDLAEVNPSPST 2040

Query: 2041 VSGAFKKDKGSFFALDSWLNSYTNSAMDEKVATEIHSQIVHQLEESMKEIGDLKEMIDGH 2100
            VSG  KKDKGSFFALD+WLNSY NS +DE V TEIHSQI+  LEES+KEIG LKEMI GH
Sbjct: 2041 VSGVVKKDKGSFFALDTWLNSYANSPVDENVETEIHSQIMQHLEESIKEIGALKEMIGGH 2100

Query: 2101 SVSFHKQSDSLSKVLGELYQEVNSQKELVQALESKVQQCESVAKDKEKEGDILCRSVDML 2160
            SVSFHK+SDSLSKVLG LYQEV SQKELVQALE  VQQ ESVAKDKEKEGDILCR++ +L
Sbjct: 2101 SVSFHKRSDSLSKVLGSLYQEVLSQKELVQALELDVQQRESVAKDKEKEGDILCRNIAVL 2160

Query: 2161 LEACRSTIKEVDQRKGELMGNDLTSENLGVNFISTAPDQLSRTGRTHLLSEEYVQTIADR 2220
             EAC STIKE+DQRKGELMGNDLTSENLG++  S  PDQLS  G+THLLSEEYV+ IADR
Sbjct: 2161 SEACTSTIKEIDQRKGELMGNDLTSENLGMDINSPTPDQLSHIGKTHLLSEEYVRRIADR 2220

Query: 2221 LLLTVREFIGLKAEMFDGSVTEMKIAIANLQKELQEKDIQKERICMDLVGQIKEAEGTAT 2280
            LL+TVREFIGLKAEMFDG V EMK AIANLQKELQEKDIQ ER+CM+LVGQIKEAE TAT
Sbjct: 2221 LLITVREFIGLKAEMFDGHVKEMKAAIANLQKELQEKDIQNERVCMELVGQIKEAEATAT 2280

Query: 2281 RYSLDLQASKDKVRELEKVMEQMDNERKAFEQRLRQLQDGLFISDELRERVKSLTDLLAS 2340
            RYSLDLQASKD++REL+KV EQM++ERK  EQRLR+++DGL ISDELRE V+ LTD LA+
Sbjct: 2281 RYSLDLQASKDEMRELQKVTEQMESERKILEQRLREMRDGLSISDELRETVRLLTDSLAA 2340

Query: 2341 KDQEIEALMHALDEEEVQMEGLTNKIEELEKVLKEKNHELEGIETSRGKLTKKLSITVTK 2400
            KDQEIEALMHALDEEE QMEGLTNKIEE EKVLK+KN ELE IETSRGKLTKKLS+TVTK
Sbjct: 2341 KDQEIEALMHALDEEEEQMEGLTNKIEEQEKVLKQKNQELESIETSRGKLTKKLSLTVTK 2400

Query: 2401 FDELHHLSESLLTEVEKLQAQLQDRDAEISFLRQEVTRCTNDALVATQTSNRSTEDINEV 2460
            FDELHHLSESLLTEVEKLQAQLQDRDAE+SFLRQEVTRCTNDALVATQTSNRSTEDINEV
Sbjct: 2401 FDELHHLSESLLTEVEKLQAQLQDRDAEVSFLRQEVTRCTNDALVATQTSNRSTEDINEV 2460

Query: 2461 ITWFDMVGARAGLSHIGHSDQANEVHECKEVLKKKITSILKEIEDIQAASQRKDELLLVE 2520
            ITWFDM+ AR GLSHIGH DQ N V ECKEVLKKKITSILKEIED+QA SQRKD LLL E
Sbjct: 2461 ITWFDMMEARVGLSHIGHDDQENGVRECKEVLKKKITSILKEIEDLQAVSQRKDALLLAE 2520

Query: 2521 KNKVEELKRKELQLNSLEDVGDDNKARSAAPEIFESEPLINKWAASST-ITPQVRSLRKG 2580
            KNKVEELKRKELQLN LEDVGD N+A SAAPEIFESEPLINKWAASST +TPQV SLRKG
Sbjct: 2521 KNKVEELKRKELQLNLLEDVGDGNRASSAAPEIFESEPLINKWAASSTSVTPQVPSLRKG 2580

Query: 2581 NTDQVAIAIDVDPASSSNRLEDEDDDKVHGFKSLASSRLVPKFSRRATDMIDGLWVSCDR 2640
            NTDQVAIAID+DPASSSNRLEDEDDDKVHGFKSLASSR+VPKFSRRATDMIDGLWVSCDR
Sbjct: 2581 NTDQVAIAIDMDPASSSNRLEDEDDDKVHGFKSLASSRIVPKFSRRATDMIDGLWVSCDR 2640

Query: 2641 ALMRQPALRLGIIFYWAILHALVATFVV 2667
            ALMRQPALRLGIIFYWAILHAL+ATFVV
Sbjct: 2641 ALMRQPALRLGIIFYWAILHALLATFVV 2657

BLAST of CSPI03G44700 vs. NCBI nr
Match: XP_011652533.1 (centromere-associated protein E isoform X1 [Cucumis sativus])

HSP 1 Score: 4949.0 bits (12836), Expect = 0.0e+00
Identity = 2657/2666 (99.66%), Postives = 2659/2666 (99.74%), Query Frame = 0

Query: 1    MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGGSSRNTSKLEQHDADADIGIGAAKST 60
            MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGGSSRNTSKLEQHDADADIGIGAAKST
Sbjct: 1    MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGGSSRNTSKLEQHDADADIGIGAAKST 60

Query: 61   SGRFSSDEVLASSVDRNPHIVDSSASSSTEHSLAAETDDHSTVSVKQEMDLAEASAIDQG 120
            SGRFSSDEVLASSVDRNPHIVDSSASSSTEHSLAAETDDHSTVSVKQEMDLAEASAIDQG
Sbjct: 61   SGRFSSDEVLASSVDRNPHIVDSSASSSTEHSLAAETDDHSTVSVKQEMDLAEASAIDQG 120

Query: 121  ETSMQEVGYREDFEHTVQNVEASGFVSSGPSVPTDVEGNDNPTSNLSFAESSSQISSASV 180
            ETSMQEVGYREDFEHTVQNVEASGFVSSGPSVPTDVEGNDNPTSNLSFAESSSQISSASV
Sbjct: 121  ETSMQEVGYREDFEHTVQNVEASGFVSSGPSVPTDVEGNDNPTSNLSFAESSSQISSASV 180

Query: 181  EQQGRIVEVGGGCREEELLVSPSTSLLQAREDVGCMGDAVMQPGQVHETEIAGDKQLDTG 240
            EQQGRIVEVGGGCREEELLVSPSTSLLQAREDVG MGDAVMQPGQVHETEIAGDKQLDTG
Sbjct: 181  EQQGRIVEVGGGCREEELLVSPSTSLLQAREDVG-MGDAVMQPGQVHETEIAGDKQLDTG 240

Query: 241  GTSESAAETTFKETRCNEEEDIAAGVASISVAVTKSNNYSISSPGENLGMENSSSSSRDD 300
            GTSESAAETTFKETRCNEEEDIAAGV SISVAVTKSNNYSISSPGENLGMENSSSSSRDD
Sbjct: 241  GTSESAAETTFKETRCNEEEDIAAGVTSISVAVTKSNNYSISSPGENLGMENSSSSSRDD 300

Query: 301  WKEERQVHAEDTIHSSRSQVESIPEDNFADLSEGHGKASQTSVKVSDVRDANTISLNAHM 360
            WKEERQVHAEDTIHSSRSQVESIPEDNFADLSEGHG ASQTSVKVSDVRDANTISLNAHM
Sbjct: 301  WKEERQVHAEDTIHSSRSQVESIPEDNFADLSEGHGMASQTSVKVSDVRDANTISLNAHM 360

Query: 361  TATSDAQSETFSSFRQDCNFFDLLERMKEELIVSSCSKEIFNMQITEQNELQMELDNHRS 420
            TATSDAQSETFSSFRQDCNFFDLLERMKEELIVSSCSKEIFNMQITEQNELQMELDNHRS
Sbjct: 361  TATSDAQSETFSSFRQDCNFFDLLERMKEELIVSSCSKEIFNMQITEQNELQMELDNHRS 420

Query: 421  KSTKDVALLNTSLNEVVERNQSLVDELSHCRSELEDVSTAKEKLRDQLLTAEAEIEKLSS 480
            KSTKDVALLNTSLNEVVERNQSLVDELSHCRSELEDVSTAKEKLRDQLLTAEAEIEKLSS
Sbjct: 421  KSTKDVALLNTSLNEVVERNQSLVDELSHCRSELEDVSTAKEKLRDQLLTAEAEIEKLSS 480

Query: 481  KTSETENSLEKLHGDMFRLAKELDDCKHLVTMLEGEKERLNGIITFENENKIKLAEEKEL 540
            KTSETENSLEKLHGDMFRLAKELDDCKHLVTMLEGEKERLNGIITFENENKIKLAEEKEL
Sbjct: 481  KTSETENSLEKLHGDMFRLAKELDDCKHLVTMLEGEKERLNGIITFENENKIKLAEEKEL 540

Query: 541  YSDENQKILSELSSLKSLNVALEAENSKLMGSLSSVAEAKTKLEEEREQLFQVNGTLSAE 600
            YSDENQKILSELSSLKSLNVALEAENSKLMGSLSSVAE KTKLEEEREQLFQ+NGTLSAE
Sbjct: 541  YSDENQKILSELSSLKSLNVALEAENSKLMGSLSSVAEEKTKLEEEREQLFQMNGTLSAE 600

Query: 601  LANCKNLVATQQEENMNLTKNLALVTEDRTKVEEDKNHLFHKNETMASELLVLDEILSTE 660
            LANCKNLVATQQEENMNLTKNLALVTEDRTKVEEDKNHLFHKNETMASELLVLDE LSTE
Sbjct: 601  LANCKNLVATQQEENMNLTKNLALVTEDRTKVEEDKNHLFHKNETMASELLVLDERLSTE 660

Query: 661  HEKRVKFEGDLKDALAQLDQLTEENVFLSNGLDIYKFKIEELCGEIISLQTRTREDEDRA 720
            HEKRVKFEGDLKDALAQLDQLTEENVFLSNGLDIYKFKIEELCGEIISLQTRTREDEDRA
Sbjct: 661  HEKRVKFEGDLKDALAQLDQLTEENVFLSNGLDIYKFKIEELCGEIISLQTRTREDEDRA 720

Query: 721  ENAGSDQYHGNNFQENVSSQITFKKCLPNPSSVLTGGKPFEVTEQEIFGDSLGFVTLGQH 780
            ENAGSDQYHGNNFQENVSSQITFKKCLPNPSSVLTGGKPFEVTEQEIFGDSLGFVTLGQH
Sbjct: 721  ENAGSDQYHGNNFQENVSSQITFKKCLPNPSSVLTGGKPFEVTEQEIFGDSLGFVTLGQH 780

Query: 781  LEEAELMLQRLEKEITGLQSNSASSRSGSKTAAPAISKLIQAFESQVNVEEDEVEAEIQS 840
            LEEAELMLQRLEKEITGLQSNSASSRSGSKTAAPAISKLIQAFESQVNVEEDEVEAEIQS
Sbjct: 781  LEEAELMLQRLEKEITGLQSNSASSRSGSKTAAPAISKLIQAFESQVNVEEDEVEAEIQS 840

Query: 841  PNDPYKLSIELVENLRVLLRQVVVDSENASVLLKGERDHQNVAISTLNEFKDKFEALENY 900
            PNDPYKLSIELVENLRVLLRQVVVDSENASVLLKGERDHQNVAISTLNEFKDKFEALENY
Sbjct: 841  PNDPYKLSIELVENLRVLLRQVVVDSENASVLLKGERDHQNVAISTLNEFKDKFEALENY 900

Query: 901  SNNWVMANIEHGVLFDCFKHHLNDAGDKIYELEILNKSLKQQATHHKNFNRELAERLCGY 960
            SNNWVMANIEHGVLFDCFKHHLNDAGDKIYELEILNKSLKQQATHHKNFNRELAERLCGY
Sbjct: 901  SNNWVMANIEHGVLFDCFKHHLNDAGDKIYELEILNKSLKQQATHHKNFNRELAERLCGY 960

Query: 961  ESTLTELERQLCDLPQSSNEMVSLICNQLDNLQGGAIERAMTLEKDWHSFLLELAETIVK 1020
            ESTLTELERQLCDLPQSSNEMVSLICNQLDNLQGGAIERAMTLEKDWHSFLLELAETIVK
Sbjct: 961  ESTLTELERQLCDLPQSSNEMVSLICNQLDNLQGGAIERAMTLEKDWHSFLLELAETIVK 1020

Query: 1021 LDESLGKSDTPAIKFCTSDQLLSCISASVIDAVKTINDLRERLQATASNGEACRMSYEEV 1080
            LDESLGKSDTPAIKFCTSDQLLSCISASVIDAVKTI+DLRERLQATASNGEACRMSYEEV
Sbjct: 1021 LDESLGKSDTPAIKFCTSDQLLSCISASVIDAVKTIDDLRERLQATASNGEACRMSYEEV 1080

Query: 1081 TEKYDSLFRRNEFTVDMLHKLYGELQKLHIASCGSVSGSDMNMQIKMVGDPLDYSNFEAL 1140
            TEKYDSLFRRNEFTVDMLHKLYGELQKLHIASCGSVSGSDMNMQIKMVGDPLDYSNFEAL
Sbjct: 1081 TEKYDSLFRRNEFTVDMLHKLYGELQKLHIASCGSVSGSDMNMQIKMVGDPLDYSNFEAL 1140

Query: 1141 IKSLEDCITEKLQLQSVNDRLCTDLERRTVEFVEFRERCLDSIGIEELIKDVQSVLSLED 1200
            IKSLEDCITEKLQLQSVNDRLCTDLERRTVEFVEFRERCLDSIGIEELIKDVQSVLSLED
Sbjct: 1141 IKSLEDCITEKLQLQSVNDRLCTDLERRTVEFVEFRERCLDSIGIEELIKDVQSVLSLED 1200

Query: 1201 TEKYHAEIPAIYLESMVSLLLQKYRESELQLGLSREESESKMMKLTGLQESVNDLSTLIL 1260
            TEKYHAEIPAIYLESMVSLLLQKYRESELQLGLSREESESKMMKLTGLQESVNDLSTLIL
Sbjct: 1201 TEKYHAEIPAIYLESMVSLLLQKYRESELQLGLSREESESKMMKLTGLQESVNDLSTLIL 1260

Query: 1261 DHECEIVLLKESLSQAQEALMASRSELKDKVNELEQTEQRVSAIREKLSIAVAKGKSLIV 1320
            DHECEIVLLKESLSQAQEALMASRSELKDKVNELEQTEQRVSAIREKLSIAVAKGKSLIV
Sbjct: 1261 DHECEIVLLKESLSQAQEALMASRSELKDKVNELEQTEQRVSAIREKLSIAVAKGKSLIV 1320

Query: 1321 QRDNLKQLLAQNSSELERCLQELQMKDTRLNETEMKLKTYSEAGERVEALESELSYIRNS 1380
            QRDNLKQLLAQNSSELERCLQELQMKDTRLNETEMKLKTYSEAGERVEALESELSYIRNS
Sbjct: 1321 QRDNLKQLLAQNSSELERCLQELQMKDTRLNETEMKLKTYSEAGERVEALESELSYIRNS 1380

Query: 1381 ATALRESFLLKDSVLQRIEEILDELDLPENFHSRDIIDKIDWLAKSSMGENLLHTDWDQR 1440
            ATALRESFLLKDSVLQRIEEILDELDLPENFHSRDIIDKIDWLAKSSMGENLLHTDWDQR
Sbjct: 1381 ATALRESFLLKDSVLQRIEEILDELDLPENFHSRDIIDKIDWLAKSSMGENLLHTDWDQR 1440

Query: 1441 SSVAGGSGSDANFVITDAWKDEVQPDANVGDDLRRKYEELQTKFYGLAEQNEMLEQSLME 1500
            SSVAGGSGSDANFVITDAWKDEVQPDANVGDDLRRKYEELQTKFYGLAEQNEMLEQSLME
Sbjct: 1441 SSVAGGSGSDANFVITDAWKDEVQPDANVGDDLRRKYEELQTKFYGLAEQNEMLEQSLME 1500

Query: 1501 RNIIVQRWEELLEKIDIPSHFRSMEPEDKIEWLHRSLSEACRDRDSLHQRVNYLENYSES 1560
            RNIIVQRWEELLEKIDIPSHFRSMEPEDKIEWLHRSLSEACRDRDSLHQRVNYLENYSES
Sbjct: 1501 RNIIVQRWEELLEKIDIPSHFRSMEPEDKIEWLHRSLSEACRDRDSLHQRVNYLENYSES 1560

Query: 1561 LTADLDDSQKKISHIEAELQSVLLEREKLSEKLEIIHHHNDHLSFGTFEKEIENIVLQNE 1620
            LTADLDDSQKKISHIEAELQSVLLEREKLSEKLEIIHHHNDHLSFGTFEKEIENIVLQNE
Sbjct: 1561 LTADLDDSQKKISHIEAELQSVLLEREKLSEKLEIIHHHNDHLSFGTFEKEIENIVLQNE 1620

Query: 1621 LSNTQDKLISTEHKIGKLEALVSNALREEDMNDLVPGSCSIEFLELMVMKLIQNYSASLS 1680
            LSNTQDKLISTEHKIGKLEALVSNALREEDMNDLVPGSCSIEFLELMVMKLIQNYSASLS
Sbjct: 1621 LSNTQDKLISTEHKIGKLEALVSNALREEDMNDLVPGSCSIEFLELMVMKLIQNYSASLS 1680

Query: 1681 GNTVPRSIMNGADTEEMLARSTEAQVAWQNDINVLKEDLEDAMHQLMVVTKERDQYMEMH 1740
            GNTVPRSIMNGADTEEMLARSTEAQVAWQNDINVLKEDLEDAMHQLMVVTKERDQYMEMH
Sbjct: 1681 GNTVPRSIMNGADTEEMLARSTEAQVAWQNDINVLKEDLEDAMHQLMVVTKERDQYMEMH 1740

Query: 1741 ESLIVKVESLDKKKDELEELLNLEEQKSTSVREKLNVAVRKGKSLVQQRDTLKQTIEEMT 1800
            ESLIVKVESLDKKKDELEELLNLEEQKSTSVREKLNVAVRKGKSLVQQRDTLKQTIEEMT
Sbjct: 1741 ESLIVKVESLDKKKDELEELLNLEEQKSTSVREKLNVAVRKGKSLVQQRDTLKQTIEEMT 1800

Query: 1801 TELKRLRSEMKSQENTLASYEQKFKDFSVYPGRVEALESENLSLKNRLTEMESNLQEKEY 1860
            TELKRLRSEMKSQENTLASYEQKFKDFSVYPGRVEALESENLSLKNRLTEMESNLQEKEY
Sbjct: 1801 TELKRLRSEMKSQENTLASYEQKFKDFSVYPGRVEALESENLSLKNRLTEMESNLQEKEY 1860

Query: 1861 KLSSIISTLDQIEVNIDVNETDPIEKLKHVGKLCFDLREAMFFSEQESVKSRRAAELLLA 1920
            KLSSIISTLDQIEVNIDVNETDPIEKLKHVGKLCFDLREAMFFSEQESVKSRRAAELLLA
Sbjct: 1861 KLSSIISTLDQIEVNIDVNETDPIEKLKHVGKLCFDLREAMFFSEQESVKSRRAAELLLA 1920

Query: 1921 ELNEVQERNDAFQEELAKASDEIAEMTRERDSAESSKLEALSELEKLSTLQLKERKNQFS 1980
            ELNEVQERNDAFQEELAKASDEIAEMTRERDSAESSKLEALSELEKLSTLQLKERKNQFS
Sbjct: 1921 ELNEVQERNDAFQEELAKASDEIAEMTRERDSAESSKLEALSELEKLSTLQLKERKNQFS 1980

Query: 1981 QFMGLKSGLDRLKEALHEINSLLVDAFSRDLDAFYNLEAAIESCTKANEPTEVNPSPSTV 2040
            QFMGLKSGLDRLKEALHEINSLLVDAFSRDLDAFYNLEAAIESCTKANEPTEVNPSPSTV
Sbjct: 1981 QFMGLKSGLDRLKEALHEINSLLVDAFSRDLDAFYNLEAAIESCTKANEPTEVNPSPSTV 2040

Query: 2041 SGAFKKDKGSFFALDSWLNSYTNSAMDEKVATEIHSQIVHQLEESMKEIGDLKEMIDGHS 2100
            SGAFKKDKGSFFALDSWLNSYTNSAMDEKVATEIHSQIVHQLEESMKEIGDLKEMIDGHS
Sbjct: 2041 SGAFKKDKGSFFALDSWLNSYTNSAMDEKVATEIHSQIVHQLEESMKEIGDLKEMIDGHS 2100

Query: 2101 VSFHKQSDSLSKVLGELYQEVNSQKELVQALESKVQQCESVAKDKEKEGDILCRSVDMLL 2160
            VSFHKQSDSLSKVLGELYQEVNSQKELVQALESKVQQCESVAKDKEKEGDILCRSVDMLL
Sbjct: 2101 VSFHKQSDSLSKVLGELYQEVNSQKELVQALESKVQQCESVAKDKEKEGDILCRSVDMLL 2160

Query: 2161 EACRSTIKEVDQRKGELMGNDLTSENLGVNFISTAPDQLSRTGRTHLLSEEYVQTIADRL 2220
            EACRSTIKEVDQRKGELMGNDLTSENLGVNFISTAPDQLSRTGRTHLLSEEYVQTIADRL
Sbjct: 2161 EACRSTIKEVDQRKGELMGNDLTSENLGVNFISTAPDQLSRTGRTHLLSEEYVQTIADRL 2220

Query: 2221 LLTVREFIGLKAEMFDGSVTEMKIAIANLQKELQEKDIQKERICMDLVGQIKEAEGTATR 2280
            LLTVREFIGLKAEMFDGSVTEMKIAIANLQKELQEKDIQKERICMDLVGQIKEAEGTATR
Sbjct: 2221 LLTVREFIGLKAEMFDGSVTEMKIAIANLQKELQEKDIQKERICMDLVGQIKEAEGTATR 2280

Query: 2281 YSLDLQASKDKVRELEKVMEQMDNERKAFEQRLRQLQDGLFISDELRERVKSLTDLLASK 2340
            YSLDLQASKDKVRELEKVMEQMDNERKAFEQRLRQLQDGL ISDELRERVKSLTDLLASK
Sbjct: 2281 YSLDLQASKDKVRELEKVMEQMDNERKAFEQRLRQLQDGLSISDELRERVKSLTDLLASK 2340

Query: 2341 DQEIEALMHALDEEEVQMEGLTNKIEELEKVLKEKNHELEGIETSRGKLTKKLSITVTKF 2400
            DQEIEALMHALDEEEVQMEGLTNKIEELEKVLKEKNHELEGIETSRGKLTKKLSITVTKF
Sbjct: 2341 DQEIEALMHALDEEEVQMEGLTNKIEELEKVLKEKNHELEGIETSRGKLTKKLSITVTKF 2400

Query: 2401 DELHHLSESLLTEVEKLQAQLQDRDAEISFLRQEVTRCTNDALVATQTSNRSTEDINEVI 2460
            DELHHLSESLLTEVEKLQAQLQDRDAEISFLRQEVTRCTNDALVATQTSNRSTEDINEVI
Sbjct: 2401 DELHHLSESLLTEVEKLQAQLQDRDAEISFLRQEVTRCTNDALVATQTSNRSTEDINEVI 2460

Query: 2461 TWFDMVGARAGLSHIGHSDQANEVHECKEVLKKKITSILKEIEDIQAASQRKDELLLVEK 2520
            TWFDMVGARAGLSHIGHSDQANEVHECKEVLKKKITSILKEIEDIQAASQRKDELLLVEK
Sbjct: 2461 TWFDMVGARAGLSHIGHSDQANEVHECKEVLKKKITSILKEIEDIQAASQRKDELLLVEK 2520

Query: 2521 NKVEELKRKELQLNSLEDVGDDNKARSAAPEIFESEPLINKWAASSTITPQVRSLRKGNT 2580
            NKVEELK KELQLNSLEDVGDDNKARSAAPEIFESEPLINKWAASSTITPQVRSLRKGNT
Sbjct: 2521 NKVEELKCKELQLNSLEDVGDDNKARSAAPEIFESEPLINKWAASSTITPQVRSLRKGNT 2580

Query: 2581 DQVAIAIDVDPASSSNRLEDEDDDKVHGFKSLASSRLVPKFSRRATDMIDGLWVSCDRAL 2640
            DQVAIAIDVDPASSSNRLEDEDDDKVHGFKSLASSRLVPKFSRRATDMIDGLWVSCDRAL
Sbjct: 2581 DQVAIAIDVDPASSSNRLEDEDDDKVHGFKSLASSRLVPKFSRRATDMIDGLWVSCDRAL 2640

Query: 2641 MRQPALRLGIIFYWAILHALVATFVV 2667
            MRQPALRLGIIFYWAILHALVATFVV
Sbjct: 2641 MRQPALRLGIIFYWAILHALVATFVV 2665

BLAST of CSPI03G44700 vs. NCBI nr
Match: KGN60307.2 (hypothetical protein Csa_002649 [Cucumis sativus])

HSP 1 Score: 4885.5 bits (12671), Expect = 0.0e+00
Identity = 2629/2666 (98.61%), Postives = 2631/2666 (98.69%), Query Frame = 0

Query: 1    MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGGSSRNTSKLEQHDADADIGIGAAKST 60
            MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGGSSRNTSKLEQHDADADIGIGAAKST
Sbjct: 1    MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGGSSRNTSKLEQHDADADIGIGAAKST 60

Query: 61   SGRFSSDEVLASSVDRNPHIVDSSASSSTEHSLAAETDDHSTVSVKQEMDLAEASAIDQG 120
            S                            EHSLAAETDDHSTVSVKQEMDLAEASAIDQG
Sbjct: 61   S----------------------------EHSLAAETDDHSTVSVKQEMDLAEASAIDQG 120

Query: 121  ETSMQEVGYREDFEHTVQNVEASGFVSSGPSVPTDVEGNDNPTSNLSFAESSSQISSASV 180
            ETSMQEVGYREDFEHTVQNVEASGFVSSGPSVPTDVEGNDNPTSNLSFAESSSQISSASV
Sbjct: 121  ETSMQEVGYREDFEHTVQNVEASGFVSSGPSVPTDVEGNDNPTSNLSFAESSSQISSASV 180

Query: 181  EQQGRIVEVGGGCREEELLVSPSTSLLQAREDVGCMGDAVMQPGQVHETEIAGDKQLDTG 240
            EQQGRIVEVGGGCREEELLVSPSTSLLQAREDVG MGDAVMQPGQVHETEIAGDKQLDTG
Sbjct: 181  EQQGRIVEVGGGCREEELLVSPSTSLLQAREDVG-MGDAVMQPGQVHETEIAGDKQLDTG 240

Query: 241  GTSESAAETTFKETRCNEEEDIAAGVASISVAVTKSNNYSISSPGENLGMENSSSSSRDD 300
            GTSESAAETTFKETRCNEEEDIAAGV SISVAVTKSNNYSISSPGENLGMENSSSSSRDD
Sbjct: 241  GTSESAAETTFKETRCNEEEDIAAGVTSISVAVTKSNNYSISSPGENLGMENSSSSSRDD 300

Query: 301  WKEERQVHAEDTIHSSRSQVESIPEDNFADLSEGHGKASQTSVKVSDVRDANTISLNAHM 360
            WKEERQVHAEDTIHSSRSQVESIPEDNFADLSEGHG ASQTSVKVSDVRDANTISLNAHM
Sbjct: 301  WKEERQVHAEDTIHSSRSQVESIPEDNFADLSEGHGMASQTSVKVSDVRDANTISLNAHM 360

Query: 361  TATSDAQSETFSSFRQDCNFFDLLERMKEELIVSSCSKEIFNMQITEQNELQMELDNHRS 420
            TATSDAQSETFSSFRQDCNFFDLLERMKEELIVSSCSKEIFNMQITEQNELQMELDNHRS
Sbjct: 361  TATSDAQSETFSSFRQDCNFFDLLERMKEELIVSSCSKEIFNMQITEQNELQMELDNHRS 420

Query: 421  KSTKDVALLNTSLNEVVERNQSLVDELSHCRSELEDVSTAKEKLRDQLLTAEAEIEKLSS 480
            KSTKDVALLNTSLNEVVERNQSLVDELSHCRSELEDVSTAKEKLRDQLLTAEAEIEKLSS
Sbjct: 421  KSTKDVALLNTSLNEVVERNQSLVDELSHCRSELEDVSTAKEKLRDQLLTAEAEIEKLSS 480

Query: 481  KTSETENSLEKLHGDMFRLAKELDDCKHLVTMLEGEKERLNGIITFENENKIKLAEEKEL 540
            KTSETENSLEKLHGDMFRLAKELDDCKHLVTMLEGEKERLNGIITFENENKIKLAEEKEL
Sbjct: 481  KTSETENSLEKLHGDMFRLAKELDDCKHLVTMLEGEKERLNGIITFENENKIKLAEEKEL 540

Query: 541  YSDENQKILSELSSLKSLNVALEAENSKLMGSLSSVAEAKTKLEEEREQLFQVNGTLSAE 600
            YSDENQKILSELSSLKSLNVALEAENSKLMGSLSSVAE KTKLEEEREQLFQ+NGTLSAE
Sbjct: 541  YSDENQKILSELSSLKSLNVALEAENSKLMGSLSSVAEEKTKLEEEREQLFQMNGTLSAE 600

Query: 601  LANCKNLVATQQEENMNLTKNLALVTEDRTKVEEDKNHLFHKNETMASELLVLDEILSTE 660
            LANCKNLVATQQEENMNLTKNLALVTEDRTKVEEDKNHLFHKNETMASELLVLDE LSTE
Sbjct: 601  LANCKNLVATQQEENMNLTKNLALVTEDRTKVEEDKNHLFHKNETMASELLVLDERLSTE 660

Query: 661  HEKRVKFEGDLKDALAQLDQLTEENVFLSNGLDIYKFKIEELCGEIISLQTRTREDEDRA 720
            HEKRVKFEGDLKDALAQLDQLTEENVFLSNGLDIYKFKIEELCGEIISLQTRTREDEDRA
Sbjct: 661  HEKRVKFEGDLKDALAQLDQLTEENVFLSNGLDIYKFKIEELCGEIISLQTRTREDEDRA 720

Query: 721  ENAGSDQYHGNNFQENVSSQITFKKCLPNPSSVLTGGKPFEVTEQEIFGDSLGFVTLGQH 780
            ENAGSDQYHGNNFQENVSSQITFKKCLPNPSSVLTGGKPFEVTEQEIFGDSLGFVTLGQH
Sbjct: 721  ENAGSDQYHGNNFQENVSSQITFKKCLPNPSSVLTGGKPFEVTEQEIFGDSLGFVTLGQH 780

Query: 781  LEEAELMLQRLEKEITGLQSNSASSRSGSKTAAPAISKLIQAFESQVNVEEDEVEAEIQS 840
            LEEAELMLQRLEKEITGLQSNSASSRSGSKTAAPAISKLIQAFESQVNVEEDEVEAEIQS
Sbjct: 781  LEEAELMLQRLEKEITGLQSNSASSRSGSKTAAPAISKLIQAFESQVNVEEDEVEAEIQS 840

Query: 841  PNDPYKLSIELVENLRVLLRQVVVDSENASVLLKGERDHQNVAISTLNEFKDKFEALENY 900
            PNDPYKLSIELVENLRVLLRQVVVDSENASVLLKGERDHQNVAISTLNEFKDKFEALENY
Sbjct: 841  PNDPYKLSIELVENLRVLLRQVVVDSENASVLLKGERDHQNVAISTLNEFKDKFEALENY 900

Query: 901  SNNWVMANIEHGVLFDCFKHHLNDAGDKIYELEILNKSLKQQATHHKNFNRELAERLCGY 960
            SNNWVMANIEHGVLFDCFKHHLNDAGDKIYELEILNKSLKQQATHHKNFNRELAERLCGY
Sbjct: 901  SNNWVMANIEHGVLFDCFKHHLNDAGDKIYELEILNKSLKQQATHHKNFNRELAERLCGY 960

Query: 961  ESTLTELERQLCDLPQSSNEMVSLICNQLDNLQGGAIERAMTLEKDWHSFLLELAETIVK 1020
            ESTLTELERQLCDLPQSSNEMVSLICNQLDNLQGGAIERAMTLEKDWHSFLLELAETIVK
Sbjct: 961  ESTLTELERQLCDLPQSSNEMVSLICNQLDNLQGGAIERAMTLEKDWHSFLLELAETIVK 1020

Query: 1021 LDESLGKSDTPAIKFCTSDQLLSCISASVIDAVKTINDLRERLQATASNGEACRMSYEEV 1080
            LDESLGKSDTPAIKFCTSDQLLSCISASVIDAVKTI+DLRERLQATASNGEACRMSYEEV
Sbjct: 1021 LDESLGKSDTPAIKFCTSDQLLSCISASVIDAVKTIDDLRERLQATASNGEACRMSYEEV 1080

Query: 1081 TEKYDSLFRRNEFTVDMLHKLYGELQKLHIASCGSVSGSDMNMQIKMVGDPLDYSNFEAL 1140
            TEKYDSLFRRNEFTVDMLHKLYGELQKLHIASCGSVSGSDMNMQIKMVGDPLDYSNFEAL
Sbjct: 1081 TEKYDSLFRRNEFTVDMLHKLYGELQKLHIASCGSVSGSDMNMQIKMVGDPLDYSNFEAL 1140

Query: 1141 IKSLEDCITEKLQLQSVNDRLCTDLERRTVEFVEFRERCLDSIGIEELIKDVQSVLSLED 1200
            IKSLEDCITEKLQLQSVNDRLCTDLERRTVEFVEFRERCLDSIGIEELIKDVQSVLSLED
Sbjct: 1141 IKSLEDCITEKLQLQSVNDRLCTDLERRTVEFVEFRERCLDSIGIEELIKDVQSVLSLED 1200

Query: 1201 TEKYHAEIPAIYLESMVSLLLQKYRESELQLGLSREESESKMMKLTGLQESVNDLSTLIL 1260
            TEKYHAEIPAIYLESMVSLLLQKYRESELQLGLSREESESKMMKLTGLQESVNDLSTLIL
Sbjct: 1201 TEKYHAEIPAIYLESMVSLLLQKYRESELQLGLSREESESKMMKLTGLQESVNDLSTLIL 1260

Query: 1261 DHECEIVLLKESLSQAQEALMASRSELKDKVNELEQTEQRVSAIREKLSIAVAKGKSLIV 1320
            DHECEIVLLKESLSQAQEALMASRSELKDKVNELEQTEQRVSAIREKLSIAVAKGKSLIV
Sbjct: 1261 DHECEIVLLKESLSQAQEALMASRSELKDKVNELEQTEQRVSAIREKLSIAVAKGKSLIV 1320

Query: 1321 QRDNLKQLLAQNSSELERCLQELQMKDTRLNETEMKLKTYSEAGERVEALESELSYIRNS 1380
            QRDNLKQLLAQNSSELERCLQELQMKDTRLNETEMKLKTYSEAGERVEALESELSYIRNS
Sbjct: 1321 QRDNLKQLLAQNSSELERCLQELQMKDTRLNETEMKLKTYSEAGERVEALESELSYIRNS 1380

Query: 1381 ATALRESFLLKDSVLQRIEEILDELDLPENFHSRDIIDKIDWLAKSSMGENLLHTDWDQR 1440
            ATALRESFLLKDSVLQRIEEILDELDLPENFHSRDIIDKIDWLAKSSMGENLLHTDWDQR
Sbjct: 1381 ATALRESFLLKDSVLQRIEEILDELDLPENFHSRDIIDKIDWLAKSSMGENLLHTDWDQR 1440

Query: 1441 SSVAGGSGSDANFVITDAWKDEVQPDANVGDDLRRKYEELQTKFYGLAEQNEMLEQSLME 1500
            SSVAGGSGSDANFVITDAWKDEVQPDANVGDDLRRKYEELQTKFYGLAEQNEMLEQSLME
Sbjct: 1441 SSVAGGSGSDANFVITDAWKDEVQPDANVGDDLRRKYEELQTKFYGLAEQNEMLEQSLME 1500

Query: 1501 RNIIVQRWEELLEKIDIPSHFRSMEPEDKIEWLHRSLSEACRDRDSLHQRVNYLENYSES 1560
            RNIIVQRWEELLEKIDIPSHFRSMEPEDKIEWLHRSLSEACRDRDSLHQRVNYLENYSES
Sbjct: 1501 RNIIVQRWEELLEKIDIPSHFRSMEPEDKIEWLHRSLSEACRDRDSLHQRVNYLENYSES 1560

Query: 1561 LTADLDDSQKKISHIEAELQSVLLEREKLSEKLEIIHHHNDHLSFGTFEKEIENIVLQNE 1620
            LTADLDDSQKKISHIEAELQSVLLEREKLSEKLEIIHHHNDHLSFGTFEKEIENIVLQNE
Sbjct: 1561 LTADLDDSQKKISHIEAELQSVLLEREKLSEKLEIIHHHNDHLSFGTFEKEIENIVLQNE 1620

Query: 1621 LSNTQDKLISTEHKIGKLEALVSNALREEDMNDLVPGSCSIEFLELMVMKLIQNYSASLS 1680
            LSNTQDKLISTEHKIGKLEALVSNALREEDMNDLVPGSCSIEFLELMVMKLIQNYSASLS
Sbjct: 1621 LSNTQDKLISTEHKIGKLEALVSNALREEDMNDLVPGSCSIEFLELMVMKLIQNYSASLS 1680

Query: 1681 GNTVPRSIMNGADTEEMLARSTEAQVAWQNDINVLKEDLEDAMHQLMVVTKERDQYMEMH 1740
            GNTVPRSIMNGADTEEMLARSTEAQVAWQNDINVLKEDLEDAMHQLMVVTKERDQYMEMH
Sbjct: 1681 GNTVPRSIMNGADTEEMLARSTEAQVAWQNDINVLKEDLEDAMHQLMVVTKERDQYMEMH 1740

Query: 1741 ESLIVKVESLDKKKDELEELLNLEEQKSTSVREKLNVAVRKGKSLVQQRDTLKQTIEEMT 1800
            ESLIVKVESLDKKKDELEELLNLEEQKSTSVREKLNVAVRKGKSLVQQRDTLKQTIEEMT
Sbjct: 1741 ESLIVKVESLDKKKDELEELLNLEEQKSTSVREKLNVAVRKGKSLVQQRDTLKQTIEEMT 1800

Query: 1801 TELKRLRSEMKSQENTLASYEQKFKDFSVYPGRVEALESENLSLKNRLTEMESNLQEKEY 1860
            TELKRLRSEMKSQENTLASYEQKFKDFSVYPGRVEALESENLSLKNRLTEMESNLQEKEY
Sbjct: 1801 TELKRLRSEMKSQENTLASYEQKFKDFSVYPGRVEALESENLSLKNRLTEMESNLQEKEY 1860

Query: 1861 KLSSIISTLDQIEVNIDVNETDPIEKLKHVGKLCFDLREAMFFSEQESVKSRRAAELLLA 1920
            KLSSIISTLDQIEVNIDVNETDPIEKLKHVGKLCFDLREAMFFSEQESVKSRRAAELLLA
Sbjct: 1861 KLSSIISTLDQIEVNIDVNETDPIEKLKHVGKLCFDLREAMFFSEQESVKSRRAAELLLA 1920

Query: 1921 ELNEVQERNDAFQEELAKASDEIAEMTRERDSAESSKLEALSELEKLSTLQLKERKNQFS 1980
            ELNEVQERNDAFQEELAKASDEIAEMTRERDSAESSKLEALSELEKLSTLQLKERKNQFS
Sbjct: 1921 ELNEVQERNDAFQEELAKASDEIAEMTRERDSAESSKLEALSELEKLSTLQLKERKNQFS 1980

Query: 1981 QFMGLKSGLDRLKEALHEINSLLVDAFSRDLDAFYNLEAAIESCTKANEPTEVNPSPSTV 2040
            QFMGLKSGLDRLKEALHEINSLLVDAFSRDLDAFYNLEAAIESCTKANEPTEVNPSPSTV
Sbjct: 1981 QFMGLKSGLDRLKEALHEINSLLVDAFSRDLDAFYNLEAAIESCTKANEPTEVNPSPSTV 2040

Query: 2041 SGAFKKDKGSFFALDSWLNSYTNSAMDEKVATEIHSQIVHQLEESMKEIGDLKEMIDGHS 2100
            SGAFKKDKGSFFALDSWLNSYTNSAMDEKVATEIHSQIVHQLEESMKEIGDLKEMIDGHS
Sbjct: 2041 SGAFKKDKGSFFALDSWLNSYTNSAMDEKVATEIHSQIVHQLEESMKEIGDLKEMIDGHS 2100

Query: 2101 VSFHKQSDSLSKVLGELYQEVNSQKELVQALESKVQQCESVAKDKEKEGDILCRSVDMLL 2160
            VSFHKQSDSLSKVLGELYQEVNSQKELVQALESKVQQCESVAKDKEKEGDILCRSVDMLL
Sbjct: 2101 VSFHKQSDSLSKVLGELYQEVNSQKELVQALESKVQQCESVAKDKEKEGDILCRSVDMLL 2160

Query: 2161 EACRSTIKEVDQRKGELMGNDLTSENLGVNFISTAPDQLSRTGRTHLLSEEYVQTIADRL 2220
            EACRSTIKEVDQRKGELMGNDLTSENLGVNFISTAPDQLSRTGRTHLLSEEYVQTIADRL
Sbjct: 2161 EACRSTIKEVDQRKGELMGNDLTSENLGVNFISTAPDQLSRTGRTHLLSEEYVQTIADRL 2220

Query: 2221 LLTVREFIGLKAEMFDGSVTEMKIAIANLQKELQEKDIQKERICMDLVGQIKEAEGTATR 2280
            LLTVREFIGLKAEMFDGSVTEMKIAIANLQKELQEKDIQKERICMDLVGQIKEAEGTATR
Sbjct: 2221 LLTVREFIGLKAEMFDGSVTEMKIAIANLQKELQEKDIQKERICMDLVGQIKEAEGTATR 2280

Query: 2281 YSLDLQASKDKVRELEKVMEQMDNERKAFEQRLRQLQDGLFISDELRERVKSLTDLLASK 2340
            YSLDLQASKDKVRELEKVMEQMDNERKAFEQRLRQLQDGL ISDELRERVKSLTDLLASK
Sbjct: 2281 YSLDLQASKDKVRELEKVMEQMDNERKAFEQRLRQLQDGLSISDELRERVKSLTDLLASK 2340

Query: 2341 DQEIEALMHALDEEEVQMEGLTNKIEELEKVLKEKNHELEGIETSRGKLTKKLSITVTKF 2400
            DQEIEALMHALDEEEVQMEGLTNKIEELEKVLKEKNHELEGIETSRGKLTKKLSITVTKF
Sbjct: 2341 DQEIEALMHALDEEEVQMEGLTNKIEELEKVLKEKNHELEGIETSRGKLTKKLSITVTKF 2400

Query: 2401 DELHHLSESLLTEVEKLQAQLQDRDAEISFLRQEVTRCTNDALVATQTSNRSTEDINEVI 2460
            DELHHLSESLLTEVEKLQAQLQDRDAEISFLRQEVTRCTNDALVATQTSNRSTEDINEVI
Sbjct: 2401 DELHHLSESLLTEVEKLQAQLQDRDAEISFLRQEVTRCTNDALVATQTSNRSTEDINEVI 2460

Query: 2461 TWFDMVGARAGLSHIGHSDQANEVHECKEVLKKKITSILKEIEDIQAASQRKDELLLVEK 2520
            TWFDMVGARAGLSHIGHSDQANEVHECKEVLKKKITSILKEIEDIQAASQRKDELLLVEK
Sbjct: 2461 TWFDMVGARAGLSHIGHSDQANEVHECKEVLKKKITSILKEIEDIQAASQRKDELLLVEK 2520

Query: 2521 NKVEELKRKELQLNSLEDVGDDNKARSAAPEIFESEPLINKWAASSTITPQVRSLRKGNT 2580
            NKVEELK KELQLNSLEDVGDDNKARSAAPEIFESEPLINKWAASSTITPQVRSLRKGNT
Sbjct: 2521 NKVEELKCKELQLNSLEDVGDDNKARSAAPEIFESEPLINKWAASSTITPQVRSLRKGNT 2580

Query: 2581 DQVAIAIDVDPASSSNRLEDEDDDKVHGFKSLASSRLVPKFSRRATDMIDGLWVSCDRAL 2640
            DQVAIAIDVDPASSSNRLEDEDDDKVHGFKSLASSRLVPKFSRRATDMIDGLWVSCDRAL
Sbjct: 2581 DQVAIAIDVDPASSSNRLEDEDDDKVHGFKSLASSRLVPKFSRRATDMIDGLWVSCDRAL 2637

Query: 2641 MRQPALRLGIIFYWAILHALVATFVV 2667
            MRQPALRLGIIFYWAILHALVATFVV
Sbjct: 2641 MRQPALRLGIIFYWAILHALVATFVV 2637

BLAST of CSPI03G44700 vs. NCBI nr
Match: XP_004136448.1 (centromere-associated protein E isoform X2 [Cucumis sativus])

HSP 1 Score: 4567.3 bits (11845), Expect = 0.0e+00
Identity = 2443/2451 (99.67%), Postives = 2445/2451 (99.76%), Query Frame = 0

Query: 216  MGDAVMQPGQVHETEIAGDKQLDTGGTSESAAETTFKETRCNEEEDIAAGVASISVAVTK 275
            MGDAVMQPGQVHETEIAGDKQLDTGGTSESAAETTFKETRCNEEEDIAAGV SISVAVTK
Sbjct: 1    MGDAVMQPGQVHETEIAGDKQLDTGGTSESAAETTFKETRCNEEEDIAAGVTSISVAVTK 60

Query: 276  SNNYSISSPGENLGMENSSSSSRDDWKEERQVHAEDTIHSSRSQVESIPEDNFADLSEGH 335
            SNNYSISSPGENLGMENSSSSSRDDWKEERQVHAEDTIHSSRSQVESIPEDNFADLSEGH
Sbjct: 61   SNNYSISSPGENLGMENSSSSSRDDWKEERQVHAEDTIHSSRSQVESIPEDNFADLSEGH 120

Query: 336  GKASQTSVKVSDVRDANTISLNAHMTATSDAQSETFSSFRQDCNFFDLLERMKEELIVSS 395
            G ASQTSVKVSDVRDANTISLNAHMTATSDAQSETFSSFRQDCNFFDLLERMKEELIVSS
Sbjct: 121  GMASQTSVKVSDVRDANTISLNAHMTATSDAQSETFSSFRQDCNFFDLLERMKEELIVSS 180

Query: 396  CSKEIFNMQITEQNELQMELDNHRSKSTKDVALLNTSLNEVVERNQSLVDELSHCRSELE 455
            CSKEIFNMQITEQNELQMELDNHRSKSTKDVALLNTSLNEVVERNQSLVDELSHCRSELE
Sbjct: 181  CSKEIFNMQITEQNELQMELDNHRSKSTKDVALLNTSLNEVVERNQSLVDELSHCRSELE 240

Query: 456  DVSTAKEKLRDQLLTAEAEIEKLSSKTSETENSLEKLHGDMFRLAKELDDCKHLVTMLEG 515
            DVSTAKEKLRDQLLTAEAEIEKLSSKTSETENSLEKLHGDMFRLAKELDDCKHLVTMLEG
Sbjct: 241  DVSTAKEKLRDQLLTAEAEIEKLSSKTSETENSLEKLHGDMFRLAKELDDCKHLVTMLEG 300

Query: 516  EKERLNGIITFENENKIKLAEEKELYSDENQKILSELSSLKSLNVALEAENSKLMGSLSS 575
            EKERLNGIITFENENKIKLAEEKELYSDENQKILSELSSLKSLNVALEAENSKLMGSLSS
Sbjct: 301  EKERLNGIITFENENKIKLAEEKELYSDENQKILSELSSLKSLNVALEAENSKLMGSLSS 360

Query: 576  VAEAKTKLEEEREQLFQVNGTLSAELANCKNLVATQQEENMNLTKNLALVTEDRTKVEED 635
            VAE KTKLEEEREQLFQ+NGTLSAELANCKNLVATQQEENMNLTKNLALVTEDRTKVEED
Sbjct: 361  VAEEKTKLEEEREQLFQMNGTLSAELANCKNLVATQQEENMNLTKNLALVTEDRTKVEED 420

Query: 636  KNHLFHKNETMASELLVLDEILSTEHEKRVKFEGDLKDALAQLDQLTEENVFLSNGLDIY 695
            KNHLFHKNETMASELLVLDE LSTEHEKRVKFEGDLKDALAQLDQLTEENVFLSNGLDIY
Sbjct: 421  KNHLFHKNETMASELLVLDERLSTEHEKRVKFEGDLKDALAQLDQLTEENVFLSNGLDIY 480

Query: 696  KFKIEELCGEIISLQTRTREDEDRAENAGSDQYHGNNFQENVSSQITFKKCLPNPSSVLT 755
            KFKIEELCGEIISLQTRTREDEDRAENAGSDQYHGNNFQENVSSQITFKKCLPNPSSVLT
Sbjct: 481  KFKIEELCGEIISLQTRTREDEDRAENAGSDQYHGNNFQENVSSQITFKKCLPNPSSVLT 540

Query: 756  GGKPFEVTEQEIFGDSLGFVTLGQHLEEAELMLQRLEKEITGLQSNSASSRSGSKTAAPA 815
            GGKPFEVTEQEIFGDSLGFVTLGQHLEEAELMLQRLEKEITGLQSNSASSRSGSKTAAPA
Sbjct: 541  GGKPFEVTEQEIFGDSLGFVTLGQHLEEAELMLQRLEKEITGLQSNSASSRSGSKTAAPA 600

Query: 816  ISKLIQAFESQVNVEEDEVEAEIQSPNDPYKLSIELVENLRVLLRQVVVDSENASVLLKG 875
            ISKLIQAFESQVNVEEDEVEAEIQSPNDPYKLSIELVENLRVLLRQVVVDSENASVLLKG
Sbjct: 601  ISKLIQAFESQVNVEEDEVEAEIQSPNDPYKLSIELVENLRVLLRQVVVDSENASVLLKG 660

Query: 876  ERDHQNVAISTLNEFKDKFEALENYSNNWVMANIEHGVLFDCFKHHLNDAGDKIYELEIL 935
            ERDHQNVAISTLNEFKDKFEALENYSNNWVMANIEHGVLFDCFKHHLNDAGDKIYELEIL
Sbjct: 661  ERDHQNVAISTLNEFKDKFEALENYSNNWVMANIEHGVLFDCFKHHLNDAGDKIYELEIL 720

Query: 936  NKSLKQQATHHKNFNRELAERLCGYESTLTELERQLCDLPQSSNEMVSLICNQLDNLQGG 995
            NKSLKQQATHHKNFNRELAERLCGYESTLTELERQLCDLPQSSNEMVSLICNQLDNLQGG
Sbjct: 721  NKSLKQQATHHKNFNRELAERLCGYESTLTELERQLCDLPQSSNEMVSLICNQLDNLQGG 780

Query: 996  AIERAMTLEKDWHSFLLELAETIVKLDESLGKSDTPAIKFCTSDQLLSCISASVIDAVKT 1055
            AIERAMTLEKDWHSFLLELAETIVKLDESLGKSDTPAIKFCTSDQLLSCISASVIDAVKT
Sbjct: 781  AIERAMTLEKDWHSFLLELAETIVKLDESLGKSDTPAIKFCTSDQLLSCISASVIDAVKT 840

Query: 1056 INDLRERLQATASNGEACRMSYEEVTEKYDSLFRRNEFTVDMLHKLYGELQKLHIASCGS 1115
            I+DLRERLQATASNGEACRMSYEEVTEKYDSLFRRNEFTVDMLHKLYGELQKLHIASCGS
Sbjct: 841  IDDLRERLQATASNGEACRMSYEEVTEKYDSLFRRNEFTVDMLHKLYGELQKLHIASCGS 900

Query: 1116 VSGSDMNMQIKMVGDPLDYSNFEALIKSLEDCITEKLQLQSVNDRLCTDLERRTVEFVEF 1175
            VSGSDMNMQIKMVGDPLDYSNFEALIKSLEDCITEKLQLQSVNDRLCTDLERRTVEFVEF
Sbjct: 901  VSGSDMNMQIKMVGDPLDYSNFEALIKSLEDCITEKLQLQSVNDRLCTDLERRTVEFVEF 960

Query: 1176 RERCLDSIGIEELIKDVQSVLSLEDTEKYHAEIPAIYLESMVSLLLQKYRESELQLGLSR 1235
            RERCLDSIGIEELIKDVQSVLSLEDTEKYHAEIPAIYLESMVSLLLQKYRESELQLGLSR
Sbjct: 961  RERCLDSIGIEELIKDVQSVLSLEDTEKYHAEIPAIYLESMVSLLLQKYRESELQLGLSR 1020

Query: 1236 EESESKMMKLTGLQESVNDLSTLILDHECEIVLLKESLSQAQEALMASRSELKDKVNELE 1295
            EESESKMMKLTGLQESVNDLSTLILDHECEIVLLKESLSQAQEALMASRSELKDKVNELE
Sbjct: 1021 EESESKMMKLTGLQESVNDLSTLILDHECEIVLLKESLSQAQEALMASRSELKDKVNELE 1080

Query: 1296 QTEQRVSAIREKLSIAVAKGKSLIVQRDNLKQLLAQNSSELERCLQELQMKDTRLNETEM 1355
            QTEQRVSAIREKLSIAVAKGKSLIVQRDNLKQLLAQNSSELERCLQELQMKDTRLNETEM
Sbjct: 1081 QTEQRVSAIREKLSIAVAKGKSLIVQRDNLKQLLAQNSSELERCLQELQMKDTRLNETEM 1140

Query: 1356 KLKTYSEAGERVEALESELSYIRNSATALRESFLLKDSVLQRIEEILDELDLPENFHSRD 1415
            KLKTYSEAGERVEALESELSYIRNSATALRESFLLKDSVLQRIEEILDELDLPENFHSRD
Sbjct: 1141 KLKTYSEAGERVEALESELSYIRNSATALRESFLLKDSVLQRIEEILDELDLPENFHSRD 1200

Query: 1416 IIDKIDWLAKSSMGENLLHTDWDQRSSVAGGSGSDANFVITDAWKDEVQPDANVGDDLRR 1475
            IIDKIDWLAKSSMGENLLHTDWDQRSSVAGGSGSDANFVITDAWKDEVQPDANVGDDLRR
Sbjct: 1201 IIDKIDWLAKSSMGENLLHTDWDQRSSVAGGSGSDANFVITDAWKDEVQPDANVGDDLRR 1260

Query: 1476 KYEELQTKFYGLAEQNEMLEQSLMERNIIVQRWEELLEKIDIPSHFRSMEPEDKIEWLHR 1535
            KYEELQTKFYGLAEQNEMLEQSLMERNIIVQRWEELLEKIDIPSHFRSMEPEDKIEWLHR
Sbjct: 1261 KYEELQTKFYGLAEQNEMLEQSLMERNIIVQRWEELLEKIDIPSHFRSMEPEDKIEWLHR 1320

Query: 1536 SLSEACRDRDSLHQRVNYLENYSESLTADLDDSQKKISHIEAELQSVLLEREKLSEKLEI 1595
            SLSEACRDRDSLHQRVNYLENYSESLTADLDDSQKKISHIEAELQSVLLEREKLSEKLEI
Sbjct: 1321 SLSEACRDRDSLHQRVNYLENYSESLTADLDDSQKKISHIEAELQSVLLEREKLSEKLEI 1380

Query: 1596 IHHHNDHLSFGTFEKEIENIVLQNELSNTQDKLISTEHKIGKLEALVSNALREEDMNDLV 1655
            IHHHNDHLSFGTFEKEIENIVLQNELSNTQDKLISTEHKIGKLEALVSNALREEDMNDLV
Sbjct: 1381 IHHHNDHLSFGTFEKEIENIVLQNELSNTQDKLISTEHKIGKLEALVSNALREEDMNDLV 1440

Query: 1656 PGSCSIEFLELMVMKLIQNYSASLSGNTVPRSIMNGADTEEMLARSTEAQVAWQNDINVL 1715
            PGSCSIEFLELMVMKLIQNYSASLSGNTVPRSIMNGADTEEMLARSTEAQVAWQNDINVL
Sbjct: 1441 PGSCSIEFLELMVMKLIQNYSASLSGNTVPRSIMNGADTEEMLARSTEAQVAWQNDINVL 1500

Query: 1716 KEDLEDAMHQLMVVTKERDQYMEMHESLIVKVESLDKKKDELEELLNLEEQKSTSVREKL 1775
            KEDLEDAMHQLMVVTKERDQYMEMHESLIVKVESLDKKKDELEELLNLEEQKSTSVREKL
Sbjct: 1501 KEDLEDAMHQLMVVTKERDQYMEMHESLIVKVESLDKKKDELEELLNLEEQKSTSVREKL 1560

Query: 1776 NVAVRKGKSLVQQRDTLKQTIEEMTTELKRLRSEMKSQENTLASYEQKFKDFSVYPGRVE 1835
            NVAVRKGKSLVQQRDTLKQTIEEMTTELKRLRSEMKSQENTLASYEQKFKDFSVYPGRVE
Sbjct: 1561 NVAVRKGKSLVQQRDTLKQTIEEMTTELKRLRSEMKSQENTLASYEQKFKDFSVYPGRVE 1620

Query: 1836 ALESENLSLKNRLTEMESNLQEKEYKLSSIISTLDQIEVNIDVNETDPIEKLKHVGKLCF 1895
            ALESENLSLKNRLTEMESNLQEKEYKLSSIISTLDQIEVNIDVNETDPIEKLKHVGKLCF
Sbjct: 1621 ALESENLSLKNRLTEMESNLQEKEYKLSSIISTLDQIEVNIDVNETDPIEKLKHVGKLCF 1680

Query: 1896 DLREAMFFSEQESVKSRRAAELLLAELNEVQERNDAFQEELAKASDEIAEMTRERDSAES 1955
            DLREAMFFSEQESVKSRRAAELLLAELNEVQERNDAFQEELAKASDEIAEMTRERDSAES
Sbjct: 1681 DLREAMFFSEQESVKSRRAAELLLAELNEVQERNDAFQEELAKASDEIAEMTRERDSAES 1740

Query: 1956 SKLEALSELEKLSTLQLKERKNQFSQFMGLKSGLDRLKEALHEINSLLVDAFSRDLDAFY 2015
            SKLEALSELEKLSTLQLKERKNQFSQFMGLKSGLDRLKEALHEINSLLVDAFSRDLDAFY
Sbjct: 1741 SKLEALSELEKLSTLQLKERKNQFSQFMGLKSGLDRLKEALHEINSLLVDAFSRDLDAFY 1800

Query: 2016 NLEAAIESCTKANEPTEVNPSPSTVSGAFKKDKGSFFALDSWLNSYTNSAMDEKVATEIH 2075
            NLEAAIESCTKANEPTEVNPSPSTVSGAFKKDKGSFFALDSWLNSYTNSAMDEKVATEIH
Sbjct: 1801 NLEAAIESCTKANEPTEVNPSPSTVSGAFKKDKGSFFALDSWLNSYTNSAMDEKVATEIH 1860

Query: 2076 SQIVHQLEESMKEIGDLKEMIDGHSVSFHKQSDSLSKVLGELYQEVNSQKELVQALESKV 2135
            SQIVHQLEESMKEIGDLKEMIDGHSVSFHKQSDSLSKVLGELYQEVNSQKELVQALESKV
Sbjct: 1861 SQIVHQLEESMKEIGDLKEMIDGHSVSFHKQSDSLSKVLGELYQEVNSQKELVQALESKV 1920

Query: 2136 QQCESVAKDKEKEGDILCRSVDMLLEACRSTIKEVDQRKGELMGNDLTSENLGVNFISTA 2195
            QQCESVAKDKEKEGDILCRSVDMLLEACRSTIKEVDQRKGELMGNDLTSENLGVNFISTA
Sbjct: 1921 QQCESVAKDKEKEGDILCRSVDMLLEACRSTIKEVDQRKGELMGNDLTSENLGVNFISTA 1980

Query: 2196 PDQLSRTGRTHLLSEEYVQTIADRLLLTVREFIGLKAEMFDGSVTEMKIAIANLQKELQE 2255
            PDQLSRTGRTHLLSEEYVQTIADRLLLTVREFIGLKAEMFDGSVTEMKIAIANLQKELQE
Sbjct: 1981 PDQLSRTGRTHLLSEEYVQTIADRLLLTVREFIGLKAEMFDGSVTEMKIAIANLQKELQE 2040

Query: 2256 KDIQKERICMDLVGQIKEAEGTATRYSLDLQASKDKVRELEKVMEQMDNERKAFEQRLRQ 2315
            KDIQKERICMDLVGQIKEAEGTATRYSLDLQASKDKVRELEKVMEQMDNERKAFEQRLRQ
Sbjct: 2041 KDIQKERICMDLVGQIKEAEGTATRYSLDLQASKDKVRELEKVMEQMDNERKAFEQRLRQ 2100

Query: 2316 LQDGLFISDELRERVKSLTDLLASKDQEIEALMHALDEEEVQMEGLTNKIEELEKVLKEK 2375
            LQDGL ISDELRERVKSLTDLLASKDQEIEALMHALDEEEVQMEGLTNKIEELEKVLKEK
Sbjct: 2101 LQDGLSISDELRERVKSLTDLLASKDQEIEALMHALDEEEVQMEGLTNKIEELEKVLKEK 2160

Query: 2376 NHELEGIETSRGKLTKKLSITVTKFDELHHLSESLLTEVEKLQAQLQDRDAEISFLRQEV 2435
            NHELEGIETSRGKLTKKLSITVTKFDELHHLSESLLTEVEKLQAQLQDRDAEISFLRQEV
Sbjct: 2161 NHELEGIETSRGKLTKKLSITVTKFDELHHLSESLLTEVEKLQAQLQDRDAEISFLRQEV 2220

Query: 2436 TRCTNDALVATQTSNRSTEDINEVITWFDMVGARAGLSHIGHSDQANEVHECKEVLKKKI 2495
            TRCTNDALVATQTSNRSTEDINEVITWFDMVGARAGLSHIGHSDQANEVHECKEVLKKKI
Sbjct: 2221 TRCTNDALVATQTSNRSTEDINEVITWFDMVGARAGLSHIGHSDQANEVHECKEVLKKKI 2280

Query: 2496 TSILKEIEDIQAASQRKDELLLVEKNKVEELKRKELQLNSLEDVGDDNKARSAAPEIFES 2555
            TSILKEIEDIQAASQRKDELLLVEKNKVEELK KELQLNSLEDVGDDNKARSAAPEIFES
Sbjct: 2281 TSILKEIEDIQAASQRKDELLLVEKNKVEELKCKELQLNSLEDVGDDNKARSAAPEIFES 2340

Query: 2556 EPLINKWAASSTITPQVRSLRKGNTDQVAIAIDVDPASSSNRLEDEDDDKVHGFKSLASS 2615
            EPLINKWAASSTITPQVRSLRKGNTDQVAIAIDVDPASSSNRLEDEDDDKVHGFKSLASS
Sbjct: 2341 EPLINKWAASSTITPQVRSLRKGNTDQVAIAIDVDPASSSNRLEDEDDDKVHGFKSLASS 2400

Query: 2616 RLVPKFSRRATDMIDGLWVSCDRALMRQPALRLGIIFYWAILHALVATFVV 2667
            RLVPKFSRRATDMIDGLWVSCDRALMRQPALRLGIIFYWAILHALVATFVV
Sbjct: 2401 RLVPKFSRRATDMIDGLWVSCDRALMRQPALRLGIIFYWAILHALVATFVV 2451

BLAST of CSPI03G44700 vs. NCBI nr
Match: KAA0038751.1 (centromere-associated protein E isoform X1 [Cucumis melo var. makuwa] >TYK31364.1 centromere-associated protein E isoform X1 [Cucumis melo var. makuwa])

HSP 1 Score: 4567.3 bits (11845), Expect = 0.0e+00
Identity = 2471/2667 (92.65%), Postives = 2549/2667 (95.58%), Query Frame = 0

Query: 1    MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGGSSRNTSKLEQHDADADIGIGAAKST 60
            MDKNK+RSDLLAAGRKKLQQFRKKKD+KGSGSQG SSRNTSKLEQ DAD DI  GAAKST
Sbjct: 1    MDKNKNRSDLLAAGRKKLQQFRKKKDSKGSGSQGSSSRNTSKLEQQDADVDIVTGAAKST 60

Query: 61   SGRFSSDEVLASSVDRNPHIVDSSASSSTEHSLAAETDDHSTVSVKQEMDLAEASAIDQG 120
            SGRFSSD VLASSVD NPHIVDSSASSSTEHSLAAETDDHSTVSVKQEMDLAE SAIDQG
Sbjct: 61   SGRFSSDGVLASSVDGNPHIVDSSASSSTEHSLAAETDDHSTVSVKQEMDLAETSAIDQG 120

Query: 121  ETSMQEVGYREDFEHTVQNVEASGFVSSGPSVPTDVEGNDNPTSNLSFAESSSQISSASV 180
            ETSMQEVGYRE+FEH +QN EA GFVSSGPS+PTD+E NDNPTSNLSF ESSSQISSASV
Sbjct: 121  ETSMQEVGYREEFEHPIQNAEAIGFVSSGPSLPTDIEENDNPTSNLSFPESSSQISSASV 180

Query: 181  EQQGRIVEVGGGCREEELLVSPSTSLLQAREDVGCMGDAVMQPGQVHETEIAGDKQLDTG 240
            EQQGRIVEV GGCREEELLVSPSTSLLQAREDVG MGDA+MQ GQVHETE+AGDK LDTG
Sbjct: 181  EQQGRIVEVWGGCREEELLVSPSTSLLQAREDVG-MGDALMQSGQVHETELAGDKLLDTG 240

Query: 241  GTSESAAETTFKETRCNEEEDIAAGVASISVAVTKSNNYSISSPGENLGMENSSSSSRDD 300
            GTSESAAETTFKET C++EEDIAA VAS+SVAVT+SN+YSISSPGENLGM+NSSSSSRDD
Sbjct: 241  GTSESAAETTFKETHCDKEEDIAAEVASVSVAVTESNSYSISSPGENLGMDNSSSSSRDD 300

Query: 301  WKEERQVHAEDTIHSSRSQVESIPEDNFADLSEGHGKASQTSVKVSDVRDANTISLNAHM 360
            WK+ERQVHAEDTIHSSRSQVESIPED+FAD SEGHGKASQTSVKVSDVRDANTISLN HM
Sbjct: 301  WKDERQVHAEDTIHSSRSQVESIPEDDFADQSEGHGKASQTSVKVSDVRDANTISLNEHM 360

Query: 361  TATSDAQSETFSSFRQDCNFFDLLERMKEELIVSSCSKEIFNMQITEQNELQMELDNHRS 420
            TATSDAQS TFSSF QDCNFFDLLERMKEELIVSS SKEIFNMQITEQNELQMELDNHRS
Sbjct: 361  TATSDAQSGTFSSFGQDCNFFDLLERMKEELIVSSFSKEIFNMQITEQNELQMELDNHRS 420

Query: 421  KSTKDVALLNTSLNEVVERNQSLVDELSHCRSELEDVSTAKEKLRDQLLTAEAEIEKLSS 480
            KSTKDVALLNTSLNEVVERNQSLVDELSHCRSELEDVS AKEK RDQLLTAEAEIEKLSS
Sbjct: 421  KSTKDVALLNTSLNEVVERNQSLVDELSHCRSELEDVSIAKEKFRDQLLTAEAEIEKLSS 480

Query: 481  KTSETENSLEKLHGDMFRLAKELDDCKHLVTMLEGEKERLNGIITFENENKIKLAEEKEL 540
            KTSETENSLEKLHGDMFRLAKELDDCKHLVT+LEGEKERLNGIITFENENK KLAEEKEL
Sbjct: 481  KTSETENSLEKLHGDMFRLAKELDDCKHLVTVLEGEKERLNGIITFENENKRKLAEEKEL 540

Query: 541  YSDENQKILSELSSLKSLNVALEAENSKLMGSLSSVAEAKTKLEEEREQLFQVNGTLSAE 600
            YSDEN+KILSELSSLKSLNVALEAENSKLMGSLSSVAE KTKLEEEREQLFQVNGTLSAE
Sbjct: 541  YSDENEKILSELSSLKSLNVALEAENSKLMGSLSSVAEEKTKLEEEREQLFQVNGTLSAE 600

Query: 601  LANCKNLVATQQEENMNLTKNLALVTEDRTKVEEDKNHLFHKNETMASELLVLDEILSTE 660
            LANCK+LVATQQEENMNLTKNLALVTEDRTKV+EDKN LFH+NETMASELLVL+E LSTE
Sbjct: 601  LANCKDLVATQQEENMNLTKNLALVTEDRTKVDEDKNRLFHENETMASELLVLEERLSTE 660

Query: 661  HEKRVKFEGDLKDALAQLDQLTEENVFLSNGLDIYKFKIEELCGEIISLQTRTREDEDRA 720
            HEKRVKFEGDLKDALAQLDQL EENVFLSNGL+I+KFK+EELCGEIISLQTR+ EDED+A
Sbjct: 661  HEKRVKFEGDLKDALAQLDQLIEENVFLSNGLNIHKFKLEELCGEIISLQTRSTEDEDQA 720

Query: 721  ENAGSDQYHGNNFQENVSSQITFKKCLPNPSSVLTGGKPFEVTEQEIFGDSLGFVTLGQH 780
            ENA  D+YHG+NFQENVSSQI+FKKCLP+ SSVL GGKPF V+EQEIF DSLGFVTLGQH
Sbjct: 721  ENADCDRYHGDNFQENVSSQISFKKCLPDTSSVLAGGKPFMVSEQEIFDDSLGFVTLGQH 780

Query: 781  LEEAELMLQRLEKEITGLQSNSASSRSGSKTAAPAISKLIQAFESQVNVEEDEVEAEIQS 840
            LEEA LMLQRLEKEITGLQSNSASSRSGSK AAPA+SKLIQAFES VNVEE EVEAEIQ 
Sbjct: 781  LEEAALMLQRLEKEITGLQSNSASSRSGSKAAAPAVSKLIQAFESHVNVEEHEVEAEIQP 840

Query: 841  PNDPYKLSIELVENLRVLLRQVVVDSENASVLLKGERDHQNVAISTLNEFKDKFEALENY 900
            PNDPYKLSIELVENLRVLLRQVVVDS+NASVLLKGERDHQNVAIST NEFK+KFEALENY
Sbjct: 841  PNDPYKLSIELVENLRVLLRQVVVDSKNASVLLKGERDHQNVAISTSNEFKEKFEALENY 900

Query: 901  SNNWVMANIEHGVLFDCFKHHLNDAGDKIYELEILNKSLKQQATHHKNFNRELAERLCGY 960
            SNN VMANIEH VLF+C KHH+NDAGDKIYELEILNKSLKQQATHHKNFNRELAERL GY
Sbjct: 901  SNNLVMANIEHRVLFECLKHHVNDAGDKIYELEILNKSLKQQATHHKNFNRELAERLRGY 960

Query: 961  ESTLTELERQLCDLPQSSNEMVSLICNQLDNLQGGAIERAMTLEKDWHSFLLELAETIVK 1020
            ESTLTELE QLCDLPQSSNEMVSL+CNQLDNLQ GAIERAMTLEKDWHSFLLELAETIVK
Sbjct: 961  ESTLTELEYQLCDLPQSSNEMVSLVCNQLDNLQEGAIERAMTLEKDWHSFLLELAETIVK 1020

Query: 1021 LDESLGKSDTPAIKFCTSDQLLSCISASVIDAVKTINDLRERLQATASNGEACRMSYEEV 1080
            LDESLGKSDT AIKFCTSD+LLSCISASV+DAVKTI+DLRERLQ TASN EACRM YEE+
Sbjct: 1021 LDESLGKSDTSAIKFCTSDRLLSCISASVVDAVKTIDDLRERLQTTASNSEACRMLYEEI 1080

Query: 1081 TEKYDSLFRRNEFTVDMLHKLYGELQKLHIASCGSVSGSDMNMQIKMVGDPLDYSNFEAL 1140
            TEKYDSLFRRNEFTVDMLHKLYGEL KLHIASCGSVSGSDMNMQIKM  DPLDYSNF AL
Sbjct: 1081 TEKYDSLFRRNEFTVDMLHKLYGELHKLHIASCGSVSGSDMNMQIKM-DDPLDYSNFVAL 1140

Query: 1141 IKSLEDCITEKLQLQSVNDRLCTDLERRTVEFVEFRERCLDSIGIEELIKDVQSVLSLED 1200
            IKSLEDCITEKLQLQSVND+L  DLE  +VEFVEFRERCLDSIGIE+LIKDVQSVLSLED
Sbjct: 1141 IKSLEDCITEKLQLQSVNDKLRLDLEHTSVEFVEFRERCLDSIGIEKLIKDVQSVLSLED 1200

Query: 1201 TEKYHAEIPAIYLESMVSLLLQKYRESELQLGLSREESESKMMKLTGLQESVNDLSTLIL 1260
            TEKYHAEIPAI+LESMVSLLLQKYRESELQL LSREESES MMKLTG QESVNDLSTLIL
Sbjct: 1201 TEKYHAEIPAIHLESMVSLLLQKYRESELQLSLSREESESIMMKLTGQQESVNDLSTLIL 1260

Query: 1261 DHECEIVLLKESLSQAQEALMASRSELKDKVNELEQTEQRVSAIREKLSIAVAKGKSLIV 1320
            DHECEIVLLKESLSQAQEA+MASRSELKDKVNELEQ EQRVSAIREKLSIAVAKGKSLIV
Sbjct: 1261 DHECEIVLLKESLSQAQEAVMASRSELKDKVNELEQAEQRVSAIREKLSIAVAKGKSLIV 1320

Query: 1321 QRDNLKQLLAQNSSELERCLQELQMKDTRLNETEMKLKTYSEAGERVEALESELSYIRNS 1380
            QRDNLKQLLAQ SSELERCLQELQMKDTRLNETE KLKTYSEAGERVEALESELSYIRNS
Sbjct: 1321 QRDNLKQLLAQTSSELERCLQELQMKDTRLNETETKLKTYSEAGERVEALESELSYIRNS 1380

Query: 1381 ATALRESFLLKDSVLQRIEEILDELDLPENFHSRDIIDKIDWLAKSSMGENLLHTDWDQR 1440
            ATALRESFLLKDSVLQRIEEILDELDLPENFHSRDIIDKIDWLAKSS GENL+HTDWDQR
Sbjct: 1381 ATALRESFLLKDSVLQRIEEILDELDLPENFHSRDIIDKIDWLAKSSTGENLVHTDWDQR 1440

Query: 1441 SSVAGGSGSDANFVITDAWKDEVQPDANVGDDLRRKYEELQTKFYGLAEQNEMLEQSLME 1500
            SSVAGGSGSDANFVITDAWKDEVQ DANVGDDLRRKYEELQTKFYGLAEQNEMLEQSLME
Sbjct: 1441 SSVAGGSGSDANFVITDAWKDEVQLDANVGDDLRRKYEELQTKFYGLAEQNEMLEQSLME 1500

Query: 1501 RNIIVQRWEELLEKIDIPSHFRSMEPEDKIEWLHRSLSEACRDRDSLHQRVNYLENYSES 1560
            RN+IVQRWEELLEKIDIPSH RSMEPEDKIEWLHRSLSEAC DRDSL QRVN LENY ES
Sbjct: 1501 RNVIVQRWEELLEKIDIPSHLRSMEPEDKIEWLHRSLSEACHDRDSLLQRVNDLENYCES 1560

Query: 1561 LTADLDDSQKKISHIEAELQSVLLEREKLSEKLEIIHHHNDHLSFGTFEKEIENIVLQNE 1620
            LTADLDDSQKKISHIEAELQSVLLEREKLSEKLEII+HHNDHL FGTFEKEIEN VL NE
Sbjct: 1561 LTADLDDSQKKISHIEAELQSVLLEREKLSEKLEIIYHHNDHLLFGTFEKEIENTVLLNE 1620

Query: 1621 LSNTQDKLISTEHKIGKLEALVSNALREEDMNDLVPGSCSIEFLELMVMKLIQNYSASLS 1680
            LSN QD LISTEH I KLEALVSNALREEDMNDLVPGSC I FLELMVMKLIQNYSAS S
Sbjct: 1621 LSNMQDNLISTEHNIVKLEALVSNALREEDMNDLVPGSCRIGFLELMVMKLIQNYSASSS 1680

Query: 1681 GNTVPRSIMNGADTEEMLARSTEAQVAWQNDINVLKEDLEDAMHQLMVVTKERDQYMEMH 1740
            GN VP S MNGADTEEMLARST+ QVAWQNDINVLK+DLEDAMHQLM VTKERDQYMEMH
Sbjct: 1681 GNAVPGSAMNGADTEEMLARSTDEQVAWQNDINVLKKDLEDAMHQLMAVTKERDQYMEMH 1740

Query: 1741 ESLIVKVESLDKKKDELEELLNLEEQKSTSVREKLNVAVRKGKSLVQQRDTLKQTIEEMT 1800
            ESLIVKVESLDKKKDELEELLNLEEQKSTSVREKLNVAVRKGKSLVQQRDTLKQTIEEMT
Sbjct: 1741 ESLIVKVESLDKKKDELEELLNLEEQKSTSVREKLNVAVRKGKSLVQQRDTLKQTIEEMT 1800

Query: 1801 TELKRLRSEMKSQENTLASYEQKFKDFSVYPGRVEALESENLSLKNRLTEMESNLQEKEY 1860
            TEL+RLRSEMKSQENTLASYEQKF+DFSVYPG+VEALESENLSLKNRL E ESNLQEKEY
Sbjct: 1801 TELERLRSEMKSQENTLASYEQKFRDFSVYPGQVEALESENLSLKNRLNETESNLQEKEY 1860

Query: 1861 KLSSIISTLDQIEVNIDVNETDPIEKLKHVGKLCFDLREAMFFSEQESVKSRRAAELLLA 1920
            KLSSII+TLD IEVN+DV+ETDPIEKLKHVGKLC DLREAMFFSEQESVKSRRAAELLLA
Sbjct: 1861 KLSSIINTLDHIEVNVDVHETDPIEKLKHVGKLCSDLREAMFFSEQESVKSRRAAELLLA 1920

Query: 1921 ELNEVQERNDAFQEELAKASDEIAEMTRERDSAESSKLEALSELEKLSTLQLKERKNQFS 1980
            ELNEVQERNDAFQEELAKASDEIAEMTRERDSAE+SKLEALSELEKLSTLQL+ERKNQFS
Sbjct: 1921 ELNEVQERNDAFQEELAKASDEIAEMTRERDSAETSKLEALSELEKLSTLQLRERKNQFS 1980

Query: 1981 QFMGLKSGLDRLKEALHEINSLLVDAFSRDLDAFYNLEAAIESCTKANEPTEVNPSPSTV 2040
            QFMGLKSGLDRLKEALHEINSLLVDAFSRDLDAFYNLEAAIESCTKAN+P  VN SPSTV
Sbjct: 1981 QFMGLKSGLDRLKEALHEINSLLVDAFSRDLDAFYNLEAAIESCTKANDPIGVNCSPSTV 2040

Query: 2041 SGAFKKDKGSFFALDSWLNSYTNSAMDEKVATEIHSQIVHQLEESMKEIGDLKEMIDGHS 2100
            SGAFKKDKGSFFALDSWLNSYTN+A DE VATEIHSQIVHQLEESMKEIGDLKEMIDGHS
Sbjct: 2041 SGAFKKDKGSFFALDSWLNSYTNAAEDENVATEIHSQIVHQLEESMKEIGDLKEMIDGHS 2100

Query: 2101 VSFHKQSDSLSKVLGELYQEVNSQKELVQALESKVQQCESVAKDKEKEGDILCRSVDMLL 2160
            VSFHKQSDSLSKVLGELYQEVNSQKELV+ALESKVQQCESVAKDKEKEGDILCRSV +LL
Sbjct: 2101 VSFHKQSDSLSKVLGELYQEVNSQKELVEALESKVQQCESVAKDKEKEGDILCRSVAVLL 2160

Query: 2161 EACRSTIKEVDQRKGELMGNDLTSENLGVNFISTAPDQLSRTGRTHLLSEEYVQTIADRL 2220
            EAC STIKEV++RKGELMGNDLTSENLGVN ISTAP QLSR+GRTHLLSEEYVQTIADRL
Sbjct: 2161 EACTSTIKEVEERKGELMGNDLTSENLGVNIISTAPGQLSRSGRTHLLSEEYVQTIADRL 2220

Query: 2221 LLTVREFIGLKAEMFDGSVTEMKIAIANLQKELQEKDIQKERICMDLVGQIKEAEGTATR 2280
            LLTVR+FIGLKAEMFDGSV EMKIA++NLQKELQEKDIQKERICMDLVGQIKEAEG  TR
Sbjct: 2221 LLTVRKFIGLKAEMFDGSVKEMKIAMSNLQKELQEKDIQKERICMDLVGQIKEAEGITTR 2280

Query: 2281 YSLDLQASKDKVRELEKVMEQMDNERKAFEQRLRQLQDGLFISDELRERVKSLTDLLASK 2340
            YSLDLQASKDKV ELEKVMEQMDNERK  EQRLR+LQDGL ISDELRERV+SLTDLLA+K
Sbjct: 2281 YSLDLQASKDKVHELEKVMEQMDNERKVLEQRLRELQDGLSISDELRERVRSLTDLLAAK 2340

Query: 2341 DQEIEALMHALDEEEVQMEGLTNKIEELEKVLKEKNHELEGIETSRGKLTKKLSITVTKF 2400
            DQEIEALMHALDEEEVQMEGLTNKIEELEKVLKEKNHELE +ETSRGKLTKKLSITVTKF
Sbjct: 2341 DQEIEALMHALDEEEVQMEGLTNKIEELEKVLKEKNHELESVETSRGKLTKKLSITVTKF 2400

Query: 2401 DELHHLSESLLTEVEKLQAQLQDRDAEISFLRQEVTRCTNDALVATQTSNRSTEDINEVI 2460
            DELHHLSESLLTEVEKLQAQLQDRDAEISFLRQEVTRCTNDALVATQTSNRSTEDINEVI
Sbjct: 2401 DELHHLSESLLTEVEKLQAQLQDRDAEISFLRQEVTRCTNDALVATQTSNRSTEDINEVI 2460

Query: 2461 TWFDMVGARAGLSHIGHSDQANEVHECKEVLKKKITSILKEIEDIQAASQRKDELLLVEK 2520
            TWFDMVGARAGLS IGHSDQ NEVHE KE+LKKKITSILKEIED+QAASQRKDELLLVEK
Sbjct: 2461 TWFDMVGARAGLSRIGHSDQENEVHERKELLKKKITSILKEIEDLQAASQRKDELLLVEK 2520

Query: 2521 NKVEELKRKELQLNSLEDVGDDNKARSAAPEIFESEPLINKWAASST-ITPQVRSLRKGN 2580
            NKVEELKRK+LQLNSLEDVGDDNKA S APEIFESEPLIN WAASST +TPQVRSLRKGN
Sbjct: 2521 NKVEELKRKKLQLNSLEDVGDDNKASSVAPEIFESEPLINTWAASSTSVTPQVRSLRKGN 2580

Query: 2581 TDQVAIAIDVDPASSSNRLEDEDDDKVHGFKSLASSRLVPKFSRRATDMIDGLWVSCDRA 2640
            TDQVAIAIDVDPASSSNRLEDEDDDKVHGFKSLASSRLVPKFSRRATDMIDGLWVSCDRA
Sbjct: 2581 TDQVAIAIDVDPASSSNRLEDEDDDKVHGFKSLASSRLVPKFSRRATDMIDGLWVSCDRA 2640

Query: 2641 LMRQPALRLGIIFYWAILHALVATFVV 2667
            LMRQPALRLGIIFYWAILHALVATFVV
Sbjct: 2641 LMRQPALRLGIIFYWAILHALVATFVV 2665

BLAST of CSPI03G44700 vs. NCBI nr
Match: XP_008466297.1 (PREDICTED: centromere-associated protein E isoform X1 [Cucumis melo])

HSP 1 Score: 4557.7 bits (11820), Expect = 0.0e+00
Identity = 2466/2667 (92.46%), Postives = 2546/2667 (95.46%), Query Frame = 0

Query: 1    MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGGSSRNTSKLEQHDADADIGIGAAKST 60
            MDKNK+RSDLLAAGRKKLQQFRKKKD+KGSGSQG SSRNTSKLEQ DAD DI  GAAKST
Sbjct: 1    MDKNKNRSDLLAAGRKKLQQFRKKKDSKGSGSQGSSSRNTSKLEQQDADVDIVTGAAKST 60

Query: 61   SGRFSSDEVLASSVDRNPHIVDSSASSSTEHSLAAETDDHSTVSVKQEMDLAEASAIDQG 120
            SGRFSSD VLASSVD NPHIVDSSASSSTEHSLAAETDDHSTVSVKQEMDLAE SAIDQG
Sbjct: 61   SGRFSSDGVLASSVDGNPHIVDSSASSSTEHSLAAETDDHSTVSVKQEMDLAETSAIDQG 120

Query: 121  ETSMQEVGYREDFEHTVQNVEASGFVSSGPSVPTDVEGNDNPTSNLSFAESSSQISSASV 180
            ETSMQEVGYRE+FEH +QN EA GFVSSGPS+PTD+E NDNPTSNLSF ESSSQISSASV
Sbjct: 121  ETSMQEVGYREEFEHPIQNAEAIGFVSSGPSLPTDIEENDNPTSNLSFPESSSQISSASV 180

Query: 181  EQQGRIVEVGGGCREEELLVSPSTSLLQAREDVGCMGDAVMQPGQVHETEIAGDKQLDTG 240
            EQQGRIVEV GGCREEELLVSPSTSLLQAREDVG MGDA+MQ GQVHETE+AGDK LDTG
Sbjct: 181  EQQGRIVEVWGGCREEELLVSPSTSLLQAREDVG-MGDALMQSGQVHETELAGDKLLDTG 240

Query: 241  GTSESAAETTFKETRCNEEEDIAAGVASISVAVTKSNNYSISSPGENLGMENSSSSSRDD 300
            GTSESAAETTFKET C++EEDIAA VAS+SVAV +SN+YSISSPGENLGM+NSSSSSRDD
Sbjct: 241  GTSESAAETTFKETHCDKEEDIAAEVASVSVAVIESNSYSISSPGENLGMDNSSSSSRDD 300

Query: 301  WKEERQVHAEDTIHSSRSQVESIPEDNFADLSEGHGKASQTSVKVSDVRDANTISLNAHM 360
            WK+ERQVHAEDTIHSSRSQVESIPED+FAD SEGHGKASQTSVKVSDVRDANTISLN HM
Sbjct: 301  WKDERQVHAEDTIHSSRSQVESIPEDDFADQSEGHGKASQTSVKVSDVRDANTISLNEHM 360

Query: 361  TATSDAQSETFSSFRQDCNFFDLLERMKEELIVSSCSKEIFNMQITEQNELQMELDNHRS 420
            TATSDAQS TFSSF QDCNFFDLLERMKEELIVSS SKEIFNMQITEQNELQMELDNHRS
Sbjct: 361  TATSDAQSGTFSSFGQDCNFFDLLERMKEELIVSSFSKEIFNMQITEQNELQMELDNHRS 420

Query: 421  KSTKDVALLNTSLNEVVERNQSLVDELSHCRSELEDVSTAKEKLRDQLLTAEAEIEKLSS 480
            KSTKDVALLNTSLNEVVERNQSLVDELSHCRSELEDVS AKEK RDQLLTAEAEIEKLSS
Sbjct: 421  KSTKDVALLNTSLNEVVERNQSLVDELSHCRSELEDVSIAKEKFRDQLLTAEAEIEKLSS 480

Query: 481  KTSETENSLEKLHGDMFRLAKELDDCKHLVTMLEGEKERLNGIITFENENKIKLAEEKEL 540
            KTSETENSLEKLHGDMFRLAKELDDCKHLVT+LEGEKERLNGIITFENENK KLAEEKEL
Sbjct: 481  KTSETENSLEKLHGDMFRLAKELDDCKHLVTVLEGEKERLNGIITFENENKRKLAEEKEL 540

Query: 541  YSDENQKILSELSSLKSLNVALEAENSKLMGSLSSVAEAKTKLEEEREQLFQVNGTLSAE 600
            YSDEN+KILSELSSLKSLNVALEAENSKLMGSLSSVAE KTKLEEEREQLFQVNGTLSAE
Sbjct: 541  YSDENEKILSELSSLKSLNVALEAENSKLMGSLSSVAEEKTKLEEEREQLFQVNGTLSAE 600

Query: 601  LANCKNLVATQQEENMNLTKNLALVTEDRTKVEEDKNHLFHKNETMASELLVLDEILSTE 660
            LANCK+LVATQQEENMNLTKNLALVTEDRTKV+EDKN LFH+NETMASELLVL+E LSTE
Sbjct: 601  LANCKDLVATQQEENMNLTKNLALVTEDRTKVDEDKNRLFHENETMASELLVLEERLSTE 660

Query: 661  HEKRVKFEGDLKDALAQLDQLTEENVFLSNGLDIYKFKIEELCGEIISLQTRTREDEDRA 720
            HEKRVKFEGDLKDALAQLDQL EENVFLSNGL+I+KFK+EELCGEIISLQTR+ EDED+A
Sbjct: 661  HEKRVKFEGDLKDALAQLDQLIEENVFLSNGLNIHKFKLEELCGEIISLQTRSTEDEDQA 720

Query: 721  ENAGSDQYHGNNFQENVSSQITFKKCLPNPSSVLTGGKPFEVTEQEIFGDSLGFVTLGQH 780
            ENA  D+YHGNNFQENVSSQI+FKKCLP+ SSVL GGKPF V+EQEIF DSLGFVTLGQH
Sbjct: 721  ENADCDRYHGNNFQENVSSQISFKKCLPDTSSVLAGGKPFMVSEQEIFDDSLGFVTLGQH 780

Query: 781  LEEAELMLQRLEKEITGLQSNSASSRSGSKTAAPAISKLIQAFESQVNVEEDEVEAEIQS 840
            LEEA LMLQRLEKEITGLQSNSASSRSGSK AAPA+SKLIQAFES VNVEE EVEAEIQ 
Sbjct: 781  LEEAALMLQRLEKEITGLQSNSASSRSGSKAAAPAVSKLIQAFESHVNVEEHEVEAEIQP 840

Query: 841  PNDPYKLSIELVENLRVLLRQVVVDSENASVLLKGERDHQNVAISTLNEFKDKFEALENY 900
            PNDPYKLSIELVENLRVLLRQVVVDS+NASVLLKGERDHQNVAIST NEFK+KFEALE+Y
Sbjct: 841  PNDPYKLSIELVENLRVLLRQVVVDSKNASVLLKGERDHQNVAISTSNEFKEKFEALEDY 900

Query: 901  SNNWVMANIEHGVLFDCFKHHLNDAGDKIYELEILNKSLKQQATHHKNFNRELAERLCGY 960
            SNN VMANIEH VLF+C KHH+NDAGDKIYELEILNKSLKQQATHHKNFNRELAERL GY
Sbjct: 901  SNNLVMANIEHRVLFECLKHHVNDAGDKIYELEILNKSLKQQATHHKNFNRELAERLRGY 960

Query: 961  ESTLTELERQLCDLPQSSNEMVSLICNQLDNLQGGAIERAMTLEKDWHSFLLELAETIVK 1020
            ESTLTELE QLCDLPQSSNEMVSL+CN LDNLQ GAIERAMTLEKDWHSFLLELAETIVK
Sbjct: 961  ESTLTELEYQLCDLPQSSNEMVSLVCNLLDNLQEGAIERAMTLEKDWHSFLLELAETIVK 1020

Query: 1021 LDESLGKSDTPAIKFCTSDQLLSCISASVIDAVKTINDLRERLQATASNGEACRMSYEEV 1080
            LDESLGKSDT AIKFCTSD+LLSCISASV+DAVKTI+DLRERLQ TASN EACRM YEE+
Sbjct: 1021 LDESLGKSDTSAIKFCTSDRLLSCISASVVDAVKTIDDLRERLQTTASNSEACRMLYEEI 1080

Query: 1081 TEKYDSLFRRNEFTVDMLHKLYGELQKLHIASCGSVSGSDMNMQIKMVGDPLDYSNFEAL 1140
            TEKYDSLFRRNEFTVDMLHKLYGEL KLHIASCGSVSGSD+NMQIKM  DPLDYSNF AL
Sbjct: 1081 TEKYDSLFRRNEFTVDMLHKLYGELHKLHIASCGSVSGSDVNMQIKM-DDPLDYSNFVAL 1140

Query: 1141 IKSLEDCITEKLQLQSVNDRLCTDLERRTVEFVEFRERCLDSIGIEELIKDVQSVLSLED 1200
            IKSLEDCITEKLQLQSVND+L  DLE  +VEFVEFRERCLDSIGIE+LIKDVQSVLSLED
Sbjct: 1141 IKSLEDCITEKLQLQSVNDKLRLDLEHTSVEFVEFRERCLDSIGIEKLIKDVQSVLSLED 1200

Query: 1201 TEKYHAEIPAIYLESMVSLLLQKYRESELQLGLSREESESKMMKLTGLQESVNDLSTLIL 1260
            TEKYHAEIPAI+LESMVSLLLQKYRESELQL LSREESES MMKLTG QESVNDLSTLIL
Sbjct: 1201 TEKYHAEIPAIHLESMVSLLLQKYRESELQLSLSREESESIMMKLTGQQESVNDLSTLIL 1260

Query: 1261 DHECEIVLLKESLSQAQEALMASRSELKDKVNELEQTEQRVSAIREKLSIAVAKGKSLIV 1320
            DHECEIVLLKESLSQAQEA+MASRSELKDKVNELEQ EQRVSAIREKLSIAVAKGKSLIV
Sbjct: 1261 DHECEIVLLKESLSQAQEAVMASRSELKDKVNELEQAEQRVSAIREKLSIAVAKGKSLIV 1320

Query: 1321 QRDNLKQLLAQNSSELERCLQELQMKDTRLNETEMKLKTYSEAGERVEALESELSYIRNS 1380
            QRDNLKQLLAQ SSELERCLQELQMKDTRLNETE KLKTYSEAGERVEALESELSYIRNS
Sbjct: 1321 QRDNLKQLLAQTSSELERCLQELQMKDTRLNETETKLKTYSEAGERVEALESELSYIRNS 1380

Query: 1381 ATALRESFLLKDSVLQRIEEILDELDLPENFHSRDIIDKIDWLAKSSMGENLLHTDWDQR 1440
            ATALRESFLLKDSVLQRIEEILDELDLPENFHSRDIIDKIDWLAKSS GENL+HTDWDQR
Sbjct: 1381 ATALRESFLLKDSVLQRIEEILDELDLPENFHSRDIIDKIDWLAKSSTGENLVHTDWDQR 1440

Query: 1441 SSVAGGSGSDANFVITDAWKDEVQPDANVGDDLRRKYEELQTKFYGLAEQNEMLEQSLME 1500
            SSVAGGSGSDANFVITDAWKDEVQ DANVGDDLRRKYEELQTKFYGLAEQNEMLEQSLME
Sbjct: 1441 SSVAGGSGSDANFVITDAWKDEVQLDANVGDDLRRKYEELQTKFYGLAEQNEMLEQSLME 1500

Query: 1501 RNIIVQRWEELLEKIDIPSHFRSMEPEDKIEWLHRSLSEACRDRDSLHQRVNYLENYSES 1560
            RN+IVQRWEELLEKIDIPSH RSMEPEDKIEWLHRSLSEAC DRDSL QRVN LENY ES
Sbjct: 1501 RNVIVQRWEELLEKIDIPSHLRSMEPEDKIEWLHRSLSEACHDRDSLLQRVNDLENYCES 1560

Query: 1561 LTADLDDSQKKISHIEAELQSVLLEREKLSEKLEIIHHHNDHLSFGTFEKEIENIVLQNE 1620
            LTADLDDSQKKISHIEAELQSVLLEREKLSEKLEII+HHNDHL FGTFEKEIEN VL NE
Sbjct: 1561 LTADLDDSQKKISHIEAELQSVLLEREKLSEKLEIIYHHNDHLLFGTFEKEIENTVLLNE 1620

Query: 1621 LSNTQDKLISTEHKIGKLEALVSNALREEDMNDLVPGSCSIEFLELMVMKLIQNYSASLS 1680
            LSN QD LISTEH I KLEALVSNALREEDMNDLVPGSC I FLELMVMKLIQNYSAS S
Sbjct: 1621 LSNMQDNLISTEHNIVKLEALVSNALREEDMNDLVPGSCRIGFLELMVMKLIQNYSASSS 1680

Query: 1681 GNTVPRSIMNGADTEEMLARSTEAQVAWQNDINVLKEDLEDAMHQLMVVTKERDQYMEMH 1740
            GN VP S MNGADTEEMLARST+ QVAWQNDINVLK+DLEDAMHQLM VTKERDQYMEMH
Sbjct: 1681 GNAVPGSAMNGADTEEMLARSTDEQVAWQNDINVLKKDLEDAMHQLMAVTKERDQYMEMH 1740

Query: 1741 ESLIVKVESLDKKKDELEELLNLEEQKSTSVREKLNVAVRKGKSLVQQRDTLKQTIEEMT 1800
            ESLIVKVESLDKKKDELEELLNLEEQKSTSVREKLNVAVRKGKSLVQQRDTLKQTIEEM+
Sbjct: 1741 ESLIVKVESLDKKKDELEELLNLEEQKSTSVREKLNVAVRKGKSLVQQRDTLKQTIEEMS 1800

Query: 1801 TELKRLRSEMKSQENTLASYEQKFKDFSVYPGRVEALESENLSLKNRLTEMESNLQEKEY 1860
            TEL+RLRSEMKSQENTLASYEQKF+DFSVYPG+VEALESENLSLKNRL E ESNLQEKEY
Sbjct: 1801 TELERLRSEMKSQENTLASYEQKFRDFSVYPGQVEALESENLSLKNRLNETESNLQEKEY 1860

Query: 1861 KLSSIISTLDQIEVNIDVNETDPIEKLKHVGKLCFDLREAMFFSEQESVKSRRAAELLLA 1920
            KLSSII+TLD IEVN+DV+ETDPIEKLKHVGKLC DLREAMFFSEQESVKSRRAAELLLA
Sbjct: 1861 KLSSIINTLDHIEVNVDVHETDPIEKLKHVGKLCSDLREAMFFSEQESVKSRRAAELLLA 1920

Query: 1921 ELNEVQERNDAFQEELAKASDEIAEMTRERDSAESSKLEALSELEKLSTLQLKERKNQFS 1980
            ELNEVQERNDAFQEELAKASDEIAEMTRERDSAE+SKLEALSELEKLSTLQL+ERKNQFS
Sbjct: 1921 ELNEVQERNDAFQEELAKASDEIAEMTRERDSAETSKLEALSELEKLSTLQLRERKNQFS 1980

Query: 1981 QFMGLKSGLDRLKEALHEINSLLVDAFSRDLDAFYNLEAAIESCTKANEPTEVNPSPSTV 2040
            QFMGLKSGLDRLKEALHEINSLLVDAFSRDLDAFYNLEAAIESCTKAN+P  VN SPSTV
Sbjct: 1981 QFMGLKSGLDRLKEALHEINSLLVDAFSRDLDAFYNLEAAIESCTKANDPIGVNCSPSTV 2040

Query: 2041 SGAFKKDKGSFFALDSWLNSYTNSAMDEKVATEIHSQIVHQLEESMKEIGDLKEMIDGHS 2100
            SGAFKKDKGSFFALDSWLNSYTN+A DE VATEIHSQIVHQLEESMKEIGDLKEMIDGHS
Sbjct: 2041 SGAFKKDKGSFFALDSWLNSYTNAAEDENVATEIHSQIVHQLEESMKEIGDLKEMIDGHS 2100

Query: 2101 VSFHKQSDSLSKVLGELYQEVNSQKELVQALESKVQQCESVAKDKEKEGDILCRSVDMLL 2160
            VSFHKQSDSLSKVLGELYQEVNSQKELV+ALESKVQQCESVAKDKEKEGDILCRSV +LL
Sbjct: 2101 VSFHKQSDSLSKVLGELYQEVNSQKELVEALESKVQQCESVAKDKEKEGDILCRSVAVLL 2160

Query: 2161 EACRSTIKEVDQRKGELMGNDLTSENLGVNFISTAPDQLSRTGRTHLLSEEYVQTIADRL 2220
            EAC STIKEV++RKGELMGNDLTSENLGVN ISTAP QLSR+GRTHLLSEEYVQTIADRL
Sbjct: 2161 EACTSTIKEVEERKGELMGNDLTSENLGVNIISTAPGQLSRSGRTHLLSEEYVQTIADRL 2220

Query: 2221 LLTVREFIGLKAEMFDGSVTEMKIAIANLQKELQEKDIQKERICMDLVGQIKEAEGTATR 2280
            LLTVR+FIGLKAEMFDGSV EMKIA++NLQKELQEKDIQKERICMDLVGQIKEAEG  TR
Sbjct: 2221 LLTVRKFIGLKAEMFDGSVKEMKIAMSNLQKELQEKDIQKERICMDLVGQIKEAEGITTR 2280

Query: 2281 YSLDLQASKDKVRELEKVMEQMDNERKAFEQRLRQLQDGLFISDELRERVKSLTDLLASK 2340
            YSLDLQASKDKV ELEKVMEQMDNERK  EQRLR+LQDGL ISDELRERV+SLTDLLA+K
Sbjct: 2281 YSLDLQASKDKVHELEKVMEQMDNERKVLEQRLRELQDGLSISDELRERVRSLTDLLAAK 2340

Query: 2341 DQEIEALMHALDEEEVQMEGLTNKIEELEKVLKEKNHELEGIETSRGKLTKKLSITVTKF 2400
            DQEIEALMHALDEEEVQMEGLTNKIEELEKVLKEKNHELE +ETSRGKLTKKLSITVTKF
Sbjct: 2341 DQEIEALMHALDEEEVQMEGLTNKIEELEKVLKEKNHELESVETSRGKLTKKLSITVTKF 2400

Query: 2401 DELHHLSESLLTEVEKLQAQLQDRDAEISFLRQEVTRCTNDALVATQTSNRSTEDINEVI 2460
            DELHHLSESLLTEVEKLQAQLQDRDAEISFLRQEVTRCTNDALVATQTSNRSTEDINEVI
Sbjct: 2401 DELHHLSESLLTEVEKLQAQLQDRDAEISFLRQEVTRCTNDALVATQTSNRSTEDINEVI 2460

Query: 2461 TWFDMVGARAGLSHIGHSDQANEVHECKEVLKKKITSILKEIEDIQAASQRKDELLLVEK 2520
            TWFDMVGAR GLS IGHSDQ NEVHE KE+LKKKITSILKEIED+QAASQRKDELLLVEK
Sbjct: 2461 TWFDMVGARVGLSRIGHSDQENEVHERKELLKKKITSILKEIEDLQAASQRKDELLLVEK 2520

Query: 2521 NKVEELKRKELQLNSLEDVGDDNKARSAAPEIFESEPLINKWAASST-ITPQVRSLRKGN 2580
            NKVEELKRK+LQLNSLEDVGDDNKA S APEIFESEPLIN WAASST +TPQVRSLRKGN
Sbjct: 2521 NKVEELKRKKLQLNSLEDVGDDNKASSVAPEIFESEPLINTWAASSTSVTPQVRSLRKGN 2580

Query: 2581 TDQVAIAIDVDPASSSNRLEDEDDDKVHGFKSLASSRLVPKFSRRATDMIDGLWVSCDRA 2640
            TDQVAIAIDVDPASSSNRLEDEDDDKVHGFKSLASSRLVPKFSRRATDMIDGLWVSCDRA
Sbjct: 2581 TDQVAIAIDVDPASSSNRLEDEDDDKVHGFKSLASSRLVPKFSRRATDMIDGLWVSCDRA 2640

Query: 2641 LMRQPALRLGIIFYWAILHALVATFVV 2667
            LMRQPALRLGIIFYWAILHALVATFVV
Sbjct: 2641 LMRQPALRLGIIFYWAILHALVATFVV 2665

BLAST of CSPI03G44700 vs. TAIR 10
Match: AT4G31570.1 (CONTAINS InterPro DOMAIN/s: Prefoldin (InterPro:IPR009053); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G24460.1); Has 194354 Blast hits to 66887 proteins in 3244 species: Archae - 3688; Bacteria - 38556; Metazoa - 84828; Fungi - 17265; Plants - 10589; Viruses - 805; Other Eukaryotes - 38623 (source: NCBI BLink). )

HSP 1 Score: 1381.3 bits (3574), Expect = 0.0e+00
Identity = 1031/2821 (36.55%), Postives = 1570/2821 (55.65%), Query Frame = 0

Query: 1    MDKNKSRSDLLAAGRKKLQQFR---------KKKDNKGSGSQGGSSRNTSKLEQHDADAD 60
            MDK K+R+D LAAGR+KLQQFR         +KKD+KGS SQG SS+ ++K E+H+   D
Sbjct: 1    MDKKKNRADPLAAGRQKLQQFRQKKADKGTDQKKDSKGSTSQGKSSKKSNKSEKHERKPD 60

Query: 61   IGIGAAKSTSGRFSSDEVLASSVDRNPHIVDSSASSSTEHSLAAETDDHSTVSVKQEMDL 120
                + ++ +    +     S V+    +VDS  +SS +         H + S    +  
Sbjct: 61   TSAVSDEAQAPSPVTVGGATSHVNVAEEVVDSPQTSS-DTKAHEYVSVHGSSSEPDALQP 120

Query: 121  AEASAIDQGETSMQEVGYREDFEHTV----QNVEA--SGFVSSGPSVPTDVEGNDNPTSN 180
               ++ D  E   + V    D   ++    +NV++  SG   +  S+ +D   ++   ++
Sbjct: 121  GHTTSNDGSEARKEVVNSENDISKSLSTEEENVKSINSGVAGTVDSLISDPADSEKGVTH 180

Query: 181  LSFAESSSQISSASVEQQGRIVEVGGGCREEELLVSPSTSLLQAREDVGCM---GDAVMQ 240
               +      +++    +G  VEV GG    E    PS SL +   DV  +   GD V  
Sbjct: 181  DDASNVDGIFAASGNIAEGEGVEVEGGSGNVEKPHQPS-SLQEYIPDVSLIRARGDQVTD 240

Query: 241  PGQVHETEIAGDKQLDTGGTSESAAETTFKETRCNEEEDIAAGVASISVAVTKSNNYSIS 300
             G++ E ++    +L      +  A T  ++T      D +A  +  S   + + + ++ 
Sbjct: 241  VGEMQEEDMEQFSELSAKAGVDKIA-TEERQTSYPAVVDSSASPSHFSEGSSVAFD-TVE 300

Query: 301  SPGENLGMENSSSSSRDDWKEERQVHAEDTIHSSRSQVESIPED-NFADLSEGHGKASQT 360
              G N    +       +  EE+   + D  ++    + + PE+ + A+++         
Sbjct: 301  LEGINGNFRSQQIREAAELNEEKPETSIDFPNNRDHVLSAEPEESSVAEMASQLQLPESV 360

Query: 361  SV----KVSDVRDANTISLNAHMTATSDAQSETFSSFR---------QD-----CNFFDL 420
            S+       + R  +T++L+A +T+    +  + S  +         QD     CN  + 
Sbjct: 361  SISGVLSHEETRKIDTLNLSAELTSAHVHEGRSVSFLQLMDIVKGLGQDEYQILCNAREA 420

Query: 421  ----------LERMKEELIVSSCSKEIFNMQITEQNELQMELDNHRSKSTKDVALLNTSL 480
                      LER++EEL VSS  ++I ++Q+TEQ+ LQ+E D+  ++   +++ L  S 
Sbjct: 421  ASSTEPGTSSLERLREELFVSSTMEDILHVQLTEQSHLQIEFDHQHNQFVAEISQLRASY 480

Query: 481  NEVVERNQSLVDELSHCRSELEDVSTAKEKLRDQLLTAEAEIEKLSSKTSETENSLEKLH 540
            + V ERN SL +ELS C+S+L   +++   L +QLL  EA++E  ++K +E + SLEK  
Sbjct: 481  SAVTERNDSLAEELSECQSKLYAATSSNTNLENQLLATEAQVEDFTAKMNELQLSLEK-- 540

Query: 541  GDMFRLAKELDDCKHLVTMLEGEKERLNGIITFENENKIKLAEEKELYSDENQKILSELS 600
                    +L + K     L+ E + L  +I+  N+ K +L EEKE  + E + + SEL 
Sbjct: 541  -----SLLDLSETKEKFINLQVENDTLVAVISSMNDEKKELIEEKESKNYEIKHLSSELC 600

Query: 601  SLKSLNVALEAENSKLMGSLSSVAEAKTKLEEEREQLFQVNGTLSAELANCKNLVATQQE 660
            + K+L   L+AE  +   ++  + + K  L EE+  L      L  ELANCK +V  Q+ 
Sbjct: 601  NCKNLAAILKAEVEQFENTIGPLTDEKIHLVEEKYSLLGEAEKLQEELANCKTVVTLQEV 660

Query: 661  ENMNLTKNLALVTEDRTKVEE--------------------------------------- 720
            EN N+ + L+L+T  +T  EE                                       
Sbjct: 661  ENSNMKETLSLLTRQQTMFEENNIHLREENEKAHLELSAHLISETYLLSEYSNLKEGYTL 720

Query: 721  ----------DKNHLFHKNETMASELLVLDEILSTEHEKRVKFEGDLKDALAQLDQLTEE 780
                      +K HL  +N+ +  ELL L E +ST  E+R   E +L++A+A+LD+L EE
Sbjct: 721  LNNKLLKFQGEKEHLVEENDKLTQELLTLQEHMSTVEEERTHLEVELREAIARLDKLAEE 780

Query: 781  NVFLSNGLDIYKFKIEELCGEIIS--LQTRTREDEDRAENAGSDQYHGNNFQENVSSQIT 840
            N  L++ + + K ++ +     +S  +     E   R+   G  +    +F EN  +Q T
Sbjct: 781  NTSLTSSIMVEKARMVDNGSADVSGLINQEISEKLGRSSEIGVSK-QSASFLEN--TQYT 840

Query: 841  FKKCLPNPSSVLTGGKPFEVTEQEIFGDSLGFVTLGQHLEEAELMLQRLEKEITGLQSNS 900
                                  +E+   +  F  L ++LE+ E M+Q LE+ I  + ++S
Sbjct: 841  --------------------NLEEVREYTSEFSALMKNLEKGEKMVQNLEEAIKQILTDS 900

Query: 901  ASSRSGSKTAAPAISKLIQAFESQVNVEEDEVE----AEIQSPNDPYKLSIELVENLRVL 960
            + S+S  K A PA+SKLIQAFES+   EE E E     +  S  D +      + NLR L
Sbjct: 901  SVSKSSDKGATPAVSKLIQAFESKRKPEEPESENAQLTDDLSEADQFVSVNVQIRNLRGL 960

Query: 961  LRQVVVDSENASVLLKGERDHQNVAISTLNEFKDKFEALENYSNNWVMANIEHGVLFDCF 1020
            L Q+++++  A +      D +      L E   +F + +++ N      IE  V F+  
Sbjct: 961  LDQLLLNARKAGIQFNQLNDDRTSTNQRLEELNVEFASHQDHINVLEADTIESKVSFEAL 1020

Query: 1021 KHHLNDAGDKIYELEILNKSLKQQATHHKNFNRELAERLCGYESTLTELERQLCDLPQSS 1080
            KH+  +   K ++LE+L  SLK +  +    N EL ++L      + ELE QL +L Q+ 
Sbjct: 1021 KHYSYELQHKNHDLELLCDSLKLRNDNISVENTELNKKLNYCSLRIDELEIQLENLQQNL 1080

Query: 1081 NEMVSLICNQLDNLQGGAIERAMTLEKDWHSFLLELAETIVKLDESLGKSDTPAIKFCTS 1140
               +S +  QL  LQ  + ERAM +E +  S + E  E +V+LD+ L +S T      T 
Sbjct: 1081 TSFLSTMEEQLVALQDES-ERAMMVEHELTSLMSEFGEAVVRLDDCLLRSGTSGAH--TG 1140

Query: 1141 DQLLSCISASVIDAVKTINDLRERLQATASNGEACRMSYEEVTEKYDSLFRRNEFTVDML 1200
              +   IS SV  AV  I DL+E+L+A     E+    YEE+ + +++LF +NEFT   +
Sbjct: 1141 LDMTKRISGSVDVAVNVIEDLKEKLEAAYVKHESTSNKYEELKQSFNTLFEKNEFTASSM 1200

Query: 1201 HKLYGELQKLHIASCGSVSGSDMNMQIKMVGDPLDYSNFEALIKSLEDCITEKLQLQSVN 1260
             K+Y +L KL   SCGS   + + ++   V DP    +FE L++++   ++E+L+LQSV 
Sbjct: 1201 QKVYADLTKLITESCGSAEMTSLEVENVAVFDPFRDGSFENLLEAVRKILSERLELQSVI 1260

Query: 1261 DRLCTDLERRTVEFVEFRERCLDSIGIEELIKDVQSVLSLEDTEKYHAEIPAIYLESMVS 1320
            D+L +DL  ++ +  E  +R LDS  + EL++ V+ +L LE    +  E P+  +E +VS
Sbjct: 1261 DKLQSDLSSKSNDMEEMTQRSLDSTSLRELVEKVEGLLELESGVIF--ESPSSQVEFLVS 1320

Query: 1321 LLLQKYRESELQLGLSREESESKMMKLTGLQESVNDLSTLILDHECEIVLLKESLSQAQE 1380
             L+QK+ E E    L R++ E+K  +L  ++ES       +L H+ +I  L+ESL+QA+E
Sbjct: 1321 QLVQKFIEIEELANLLRKQLEAKGNELMEIEES-------LLHHKTKIAGLRESLTQAEE 1380

Query: 1381 ALMASRSELKDKVNELEQTEQRVSAIREKLSIAVAKGKSLIVQRDNLKQLLAQNSSELER 1440
            +L+A RSEL+DK NELEQ+EQR+ + REKLSIAV KGK LIVQRDN+KQ LA+ S++L++
Sbjct: 1381 SLVAVRSELQDKSNELEQSEQRLLSTREKLSIAVTKGKGLIVQRDNVKQSLAEASAKLQK 1440

Query: 1441 CLQELQMKDTRLNETEMKLKTYSEAGERVEALESELSYIRNSATALRESFLLKDSVLQRI 1500
            C +EL  KD RL E E KLKTY EAGERVEALESELSYIRNSATALRESFLLKDS+L RI
Sbjct: 1441 CSEELNSKDARLVEVEKKLKTYIEAGERVEALESELSYIRNSATALRESFLLKDSLLHRI 1500

Query: 1501 EEILDELDLPENFHSRDIIDKIDWLAKSSMGENLLHTDWDQRSSVAGGSGSDANFVITDA 1560
            EEIL++LDLPE+FH+RDI++K++WLA+S+ G +   + WDQ+SS  G     A FV+++ 
Sbjct: 1501 EEILEDLDLPEHFHARDILEKVEWLARSANGNSSRPSGWDQKSSDGG-----AGFVLSEP 1560

Query: 1561 WKDEVQPDANVGDDLRRKYEELQTKFYGLAEQNEMLEQSLMERNIIVQRWEELLEKIDIP 1620
            W+++VQ   +  DDLR K+EEL+ KFYGLAEQNEMLEQSLMERN +VQRWE+LLE IDIP
Sbjct: 1561 WREDVQTGTSSEDDLRIKFEELKGKFYGLAEQNEMLEQSLMERNTLVQRWEKLLENIDIP 1620

Query: 1621 SHFRSMEPEDKIEWLHRSLSEACRDRDSLHQRVNYLENYSESLTADLDDSQKKISHIEAE 1680
                SME E+KIEWL  +++EA  DRD+L Q+++ LE Y +S+T DL+ SQK++  +E  
Sbjct: 1621 PQLHSMEVENKIEWLASTITEATHDRDNLQQKIDNLEVYCQSVTTDLEVSQKQVGDVEGN 1680

Query: 1681 LQSVLLEREKLSEKLEIIHHHNDHLSFGTFEKEIENIVLQNELSNTQDKLI--------- 1740
            LQS + ER  LSE+LE +   ++ LS      E+EN  LQN++ +  +KL+         
Sbjct: 1681 LQSCVSERVNLSERLESLIGDHESLSARGIHLEVENEKLQNQVKDLHEKLVEKLGNEEHF 1740

Query: 1741 -STEHKIGKLEALVSNALREEDMNDLVPGSCSIEFLELMVMKLIQNY----SASLSGNT- 1800
             + E  +  L  ++ + ++E+ + DL   S S E L+ ++ KLI  Y     +SL G T 
Sbjct: 1741 QTIEGDLLSLRYMIDDVIQEDGLQDLALASNS-ENLDGVLRKLIDYYKNLVKSSLPGETD 1800

Query: 1801 --------VPRSIMNG-----------------ADTEEMLARSTEAQVAWQNDINVLKED 1860
                        + +G                 +D+  + A S +  V    D+  L +D
Sbjct: 1801 DNVCETRPSDADVRSGESLGAHGATSHGQHFELSDSNVVEATSRDIAVVETPDVASLTKD 1860

Query: 1861 LEDAMHQLMVVTKERDQYMEMHESLIVKVESLDKKKDELEELLNLEEQKSTSVREKLNVA 1920
            L+ A+H   +  +ERD YM   +SL+ + E+LDKK  EL+E L  EEQKS SVREKLNVA
Sbjct: 1861 LDQALHVQKLTREERDLYMAKQQSLVAENEALDKKIIELQEFLKQEEQKSASVREKLNVA 1920

Query: 1921 VRKGKSLVQQRDTLKQTIEEMTTELKRLRSEMKSQENTLASYEQKFKDFSVYPGRVEALE 1980
            VRKGK+LVQQRD+LKQTIEE+  EL RL+SE+  ++  L   E+KF++   Y  RVE+LE
Sbjct: 1921 VRKGKALVQQRDSLKQTIEEVNAELGRLKSEIIKRDEKLLENEKKFRELESYSVRVESLE 1980

Query: 1981 SENLSLKNRLTEMESNLQEKEYKLSSIISTLDQIEVNIDVNETDPIEKLKHVGKLCFDLR 2040
            SE   LK    E E  LQE+   LS  ++ L+ I++  + +  DP+ KL+ + +L   + 
Sbjct: 1981 SECQLLKIHSQETEYLLQERSGNLSMTLNALNSIDIGDEGDINDPVMKLQRISQLFQTMS 2040

Query: 2041 EAMFFSEQESVKSRRAAELLLAELNEVQERNDAFQEELAKASDEIAEMTRERDSAESSKL 2100
              +  +EQES KSRRAAELLLAELNEVQE ND+ QE+L+K + EI +++RE+D+AE++K+
Sbjct: 2041 TTVTSAEQESRKSRRAAELLLAELNEVQETNDSLQEDLSKFTYEIQQLSREKDAAEAAKV 2100

Query: 2101 EALSELEKLSTLQLKERKNQFSQFMGLKSGLDRLKEALHEINSLLVDAFSRDLDAFYNLE 2160
            EA+S  E LS +  +E+   ++Q +   + ++ L++ L   NS L D F  D++  ++L+
Sbjct: 2101 EAISRFENLSAVSNEEKNKLYAQLLSCGTSVNSLRKILAGTNSCLADIFIMDMEFLHHLK 2160

Query: 2161 AAIESCTKANEPTEVNPSPSTVSGAFKKDKGSFFALD-SWLNSYTNSAMDEKVATEIHSQ 2220
            A +E C K    T+++  P  +S     DK  F  L  +W N   +         EI   
Sbjct: 2161 ANMELCAK-KTGTDLSGLPQ-LSTENLVDKEIFARLSAAWSNINLHETSSGGNIAEICGS 2220

Query: 2221 IVHQLEESMKEIGDLKEMIDGHSVSFHKQSDSLSKVLGELYQEVNSQKELVQALESKVQQ 2280
            +   L++ +  +  L+E +  H  ++H Q + +S  +   +                   
Sbjct: 2221 LSQNLDQFVVGVSHLEEKVSKHLATWHDQINIVSNSIDTFF------------------- 2280

Query: 2281 CESVAKDKEKEGDILCRSVDMLLEACRSTIKEVDQRKGELMGNDLTSENLGVNFISTAPD 2340
             +S+    + E   L   + +L  AC S + E+++RK EL+GND    N+ ++ +     
Sbjct: 2281 -KSIGTGTDSEVAALGERIALLHGACSSVLVEIERRKAELVGND--DFNMSLHQVD---- 2340

Query: 2341 QLSRTGRTHLLSEEYVQTIADRLLLTVREFIGLKAEMFDGSVTEMKIAIANLQKELQEKD 2400
                       S E V+++ +RL   V+E +   AE  + +  EMK+ IANLQ+EL EKD
Sbjct: 2341 -------EDFSSMESVRSMVNRLSSAVKELVVANAETLERNEKEMKVIIANLQRELHEKD 2400

Query: 2401 IQKERICMDLVGQIKEAEGTATRYSLDLQASKDKVRELEKVMEQMDNERKAFEQRLRQLQ 2460
            IQ  R C +LVGQ+KEA+  A  ++ DLQ++  ++R+++  +  +  ER + ++R+++L 
Sbjct: 2401 IQNNRTCNELVGQVKEAQAGAKIFAEDLQSASARMRDMQDQLGILVRERDSMKERVKELL 2460

Query: 2461 DGLFISDELRERVKSLTDLLASKDQEIEALMHALDEEEVQMEGLTNKIEELEKVLKEKNH 2520
             G     EL+E+V SL+DLLA+KD EIEALM ALDEEE QME L  ++ ELE+ +++KN 
Sbjct: 2461 AGQASHSELQEKVTSLSDLLAAKDLEIEALMQALDEEESQMEDLKLRVTELEQEVQQKNL 2520

Query: 2521 ELEGIETSRGKLTKKLSITVTKFDELHHLSESLLTEVEKLQAQLQDRDAEISFLRQEVTR 2580
            +L+  E SRGK++KKLSITV KFDELHHLSE+LL E+EKLQ Q+QDRD E+SFLRQEVTR
Sbjct: 2521 DLQKAEASRGKISKKLSITVDKFDELHHLSENLLAEIEKLQQQVQDRDTEVSFLRQEVTR 2580

Query: 2581 CTNDALVATQT-SNRSTEDINEVITWFDMVGARAGLSHIGHSDQANEVHECKEVLKKKIT 2640
            CTN+AL A+Q  + R +E+I  V++WFD + +  G+     +D  + ++   E  +K+I 
Sbjct: 2581 CTNEALAASQMGTKRDSEEIQTVLSWFDTIASLLGIEDSLSTDADSHINHYMETFEKRIA 2640

Query: 2641 SILKEIEDIQAASQRKDELLLVEKNKVEELKRKELQLNS--LEDVGDDNKARSAAPEIFE 2667
            S+L EI++++   Q KD LL  E+++V EL++KE  L    LE     + + S+  EI E
Sbjct: 2641 SMLSEIDELRLVGQSKDVLLEGERSRVAELRQKEATLEKFLLEKESQQDISTSSTSEIVE 2700

BLAST of CSPI03G44700 vs. TAIR 10
Match: AT1G24460.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 14 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G31570.1). )

HSP 1 Score: 176.4 bits (446), Expect = 3.0e-43
Identity = 350/1538 (22.76%), Postives = 646/1538 (42.00%), Query Frame = 0

Query: 1271 ESLSQAQEALMASRSELKDKVNELEQTEQRVSAIREKLSIAVAKGKSLIVQRDNLKQLLA 1330
            E +++ +E   + R+E +    ELE  + + +  +EKLS+AV KGK+L+  RD LK  L+
Sbjct: 288  EQVNREKEMCESMRTEFEKLKAELELEKTKCTNTKEKLSMAVTKGKALVQNRDALKHQLS 347

Query: 1331 QNSSELERCLQELQMKDTRLNETE-MKLKTYSEAGERVEALESELSYIRNSATALRESFL 1390
            + ++EL   L ELQ K+  L  +E MK +      E+ + LE   + + + + +L    L
Sbjct: 348  EKTTELANRLTELQEKEIALESSEVMKGQLEQSLTEKTDELEKCYAELNDRSVSLEAYEL 407

Query: 1391 LKDSVLQ-------RIEEILDELDLPENFHSRDIIDKIDWLAKSSMGENLLHTDWDQRSS 1450
             K  + Q        +EE L +L        +  +DK + LAKS    + +   + +  S
Sbjct: 408  TKKELEQSLAEKTKELEECLTKLQEMSTALDQSELDKGE-LAKS----DAMVASYQEMLS 467

Query: 1451 VAGGSGSDANFVITDAWKDEVQPDANVGDDLRRKYEELQTKFYGLAEQNEMLEQSLMERN 1510
            V      +   ++++ +  E     ++ + +R            LAE+ + L     E N
Sbjct: 468  VRNSIIENIETILSNIYTPEEGHSFDIVEKVR-----------SLAEERKELTNVSQEYN 527

Query: 1511 IIVQRWEELLEKIDIPSHFRSMEPEDKIEWLHRSLSEACRDRDSLHQRVNYLENYSESLT 1570
                R ++L+  ID+P        E ++ WL  S  +    +D ++   N +E+ S SL+
Sbjct: 528  ----RLKDLIVSIDLPEEMSQSSLESRLAWLRESFLQG---KDEVNALQNRIESVSMSLS 587

Query: 1571 ADLDDSQKKISHIEAELQSVLLEREKLSEKLEIIHHHNDHLSFGTFEKEIENIVLQNELS 1630
            A++++     S+I  EL  +    +K+ E  E           G+ E+E           
Sbjct: 588  AEMEEK----SNIRKELDDLSFSLKKMEETAE----------RGSLERE----------- 647

Query: 1631 NTQDKLISTEHKIGKLEALVSNALRE---EDMNDLVPGSCSIEFLELMVMKLIQNYSASL 1690
                       ++ +   L++  + +    D+N LV  S         + K I++ S S 
Sbjct: 648  -------EIVRRLVETSGLMTEGVEDHTSSDINLLVDRSFD------KIEKQIRDSSDSS 707

Query: 1691 SGNTVPRSIMNGADTEEMLARSTEAQVAWQNDINVLKEDL-EDAMHQLMVVTKERDQYME 1750
             GN            EE+             + ++ KE L E  +    V     +  + 
Sbjct: 708  YGN------------EEIFEAFQSLLYVRDLEFSLCKEMLGEGELISFQVSNLSDELKIA 767

Query: 1751 MHESLIVKVESLDKKKDELEELLNLEEQKSTSVREKLNVAVRKGKSLVQQRDTLKQTIEE 1810
              E   VK E +  +KD     L   E+KS  +R+KL++A++KGK LVQ R+  K  ++E
Sbjct: 768  SQELAFVKEEKIALEKD-----LERSEEKSALLRDKLSMAIKKGKGLVQDREKFKTQLDE 827

Query: 1811 MTTELKRLRSEMKSQENTLASYEQKFKDFSVYPGRVEALESENLSLKNRLTEMESNLQEK 1870
              +E+++L  E++    T+  Y+ +    S    R + LE+E ++ K    +++ +L   
Sbjct: 828  KKSEIEKLMLELQQLGGTVDGYKNQIDMLSRDLERTKELETELVATKEERDQLQQSLSLI 887

Query: 1871 EYKLSSIISTLDQIEVNIDVNETDPIEKLKHVGKLCFDLREAMFFSEQESVKSRRAAELL 1930
            +  L  ++ +++ I + +D+   DP EK+  +     +++ A    ++E  K +   + L
Sbjct: 888  DTLLQKVMKSVEIIALPVDLASEDPSEKIDRLAGYIQEVQLARVEEQEEIEKVKSEVDAL 947

Query: 1931 LAELNEVQERNDAFQEELAKASDEIAEMTRERDSAESSKLEALSELEKLSTLQLKERKNQ 1990
             ++L E Q      ++ L+ A D I+ +T E  + +++K  A  EL+K +        ++
Sbjct: 948  TSKLAETQTALKLVEDALSTAEDNISRLTEENRNVQAAKENAELELQK-AVADASSVASE 1007

Query: 1991 FSQFMGLKSGLD-RLKEALHEINSLL---VDAFSRDLDAFYNLE-AAIESCTKANEPTEV 2050
              + +  KS L+  L +A   I+ ++    +A  R   A    E    E+  + N+ TE 
Sbjct: 1008 LDEVLATKSTLEAALMQAERNISDIISEKEEAQGRTATAEMEQEMLQKEASIQKNKLTEA 1067

Query: 2051 NPSPSTVSGAFKKDKGSFFALDSWLNSYTNSAMDEKVATEIHSQIVHQL----EESMKEI 2110
            +   ST++      + +    +S ++S +    D+KV T      + +L    E    ++
Sbjct: 1068 H---STINSL----EETLAQTESNMDSLSKQIEDDKVLTTSLKNELEKLKIEAEFERNKM 1127

Query: 2111 GDLKEMIDGHSVSFHKQSDSLSKVLGELYQEVNSQKELVQALESKVQQC---------ES 2170
             +    I  H  +  K  +SLS + GE+   V ++ E +  L SK+  C          S
Sbjct: 1128 AEASLTIVSHEEALMKAENSLSALQGEM---VKAEGE-ISTLSSKLNVCMEELAGSSGNS 1187

Query: 2171 VAKDKE------------KEGDIL-------------CRSVDMLLEACRSTIKEVDQRKG 2230
             +K  E            K+G ++              R VD++       I E     G
Sbjct: 1188 QSKSLEIITHLDNLQMLLKDGGLISKVNEFLQRKFKSLRDVDVIARDITRNIGENGLLAG 1247

Query: 2231 ELMGN--DLTSENLGV-----NFISTAP----------DQLSRT-----------GRTHL 2290
            E MGN  D ++E   +     N ++T P          D++S +            +T  
Sbjct: 1248 E-MGNAEDDSTEAKSLLSDLDNSVNTEPENSQGSAADEDEISSSLRKMAEGVRLRNKTLE 1307

Query: 2291 LSEEYVQTIADRLLLT-----------VREFIGLKAEM------FDGSVTEMKIAIANLQ 2350
             + E   T  D L+ T           V   +G  + +       +  V E +  I+ LQ
Sbjct: 1308 NNFEGFSTSIDTLIATLMQNMTAARADVLNIVGHNSSLEEQVRSVENIVREQENTISALQ 1367

Query: 2351 KEL-----------QEKDIQKERICMDLVGQIK-----EAEGTATRYSLDLQASKDKVRE 2410
            K+L           +E  ++ +   ++LV   +     E E T     L +     +++E
Sbjct: 1368 KDLSSLISACGAAARELQLEVKNNLLELVQFQENENGGEMESTEDPQELHVSECAQRIKE 1427

Query: 2411 LEKVMEQMDNERKAFEQR-------LRQLQDGLFISDELRERVKSLTDLLASKDQEIEAL 2470
            L    E+     K FE         +R +++ L  +    E+     +    K+ E+  L
Sbjct: 1428 LSSAAEKACATLKLFETTNNAAATVIRDMENRLTEASVALEKAVVKEEKWHEKEVELSTL 1487

Query: 2471 MHAL--DEEEVQ--------MEGLTNKIEELEKVLKEKNHELEGIETSRGKLTKKLSITV 2530
               L   E+E +        M  L +KI  +E    +    + G++       KKL   V
Sbjct: 1488 YDKLLVQEQEAKENLIPASDMRTLFDKINGIEVPSVDL---VNGLDPQSPYDVKKLFAIV 1547

Query: 2531 TKFDELHHLSESLLTEVEKLQAQLQDRDAEISFLRQEVTRCTNDALVATQTSNRSTEDIN 2590
                E+ H  + L    ++L + L ++D EI  L++     +   L   +  N    +++
Sbjct: 1548 DSVTEMQHQIDILSYGQKELNSTLAEKDLEIQGLKKATEAESTTELELVKAKN----ELS 1607

Query: 2591 EVITWFDMVGARAGLSHIGHSDQANEVHECKEVLKKKITSILKEIEDIQAASQR------ 2650
            ++I+  + +      ++       +E     + L+KKITS+L E E  ++ +Q       
Sbjct: 1608 KLISGLEKLLGILASNNPVVDPNFSESWTLVQALEKKITSLLLESESSKSRAQELGLKLA 1667

Query: 2651 -----KDELLLVEKNKVEELKRKELQLNSLEDVGDDNKARSAAPEIFESEPLINKWAAS- 2659
                  D+L L  K   E+L+ K +Q + +++       R  AP   E   + +K A   
Sbjct: 1668 GSEKLVDKLSLRVKEFEEKLQTKAIQPDIVQERSIFETPR--APSTSEISEIEDKGALGI 1724

BLAST of CSPI03G44700 vs. TAIR 10
Match: AT1G24460.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G31570.1); Has 181008 Blast hits to 85359 proteins in 3551 species: Archae - 3290; Bacteria - 48304; Metazoa - 70793; Fungi - 13943; Plants - 10118; Viruses - 785; Other Eukaryotes - 33775 (source: NCBI BLink). )

HSP 1 Score: 158.7 bits (400), Expect = 6.6e-38
Identity = 352/1609 (21.88%), Postives = 652/1609 (40.52%), Query Frame = 0

Query: 1271 ESLSQAQEALMASRSELKDKVNELEQTEQRVSAIREKLSIAVAKGKSLIVQRDNLKQLLA 1330
            E +++ +E   + R+E +    ELE  + + +  +EKLS+AV KGK+L+  RD LK  L+
Sbjct: 288  EQVNREKEMCESMRTEFEKLKAELELEKTKCTNTKEKLSMAVTKGKALVQNRDALKHQLS 347

Query: 1331 QNSSELERCLQELQMKDTRLNETE-MKLKTYSEAGERVEALESELSYIRNSATALRESFL 1390
            + ++EL   L ELQ K+  L  +E MK +      E+ + LE   + + + + +L    L
Sbjct: 348  EKTTELANRLTELQEKEIALESSEVMKGQLEQSLTEKTDELEKCYAELNDRSVSLEAYEL 407

Query: 1391 LKDSVLQ-------RIEEILDELDLPENFHSRDIIDKIDWLAKSSMGENLLHTDWDQRSS 1450
             K  + Q        +EE L +L        +  +DK + LAKS    + +   + +  S
Sbjct: 408  TKKELEQSLAEKTKELEECLTKLQEMSTALDQSELDKGE-LAKS----DAMVASYQEMLS 467

Query: 1451 VAGGSGSDANFVITDAWKDEVQPDANVGDDLRRKYEELQTKFYGLAEQNEMLEQSLMERN 1510
            V      +   ++++ +  E     ++ + +R            LAE+ + L     E N
Sbjct: 468  VRNSIIENIETILSNIYTPEEGHSFDIVEKVR-----------SLAEERKELTNVSQEYN 527

Query: 1511 IIVQRWEELLEKIDIPSHFRSMEPEDKIEWLHRSLSEACRDRDSLHQRVNYLENYSESLT 1570
                R ++L+  ID+P        E ++ WL  S  +    +D ++   N +E+ S SL+
Sbjct: 528  ----RLKDLIVSIDLPEEMSQSSLESRLAWLRESFLQG---KDEVNALQNRIESVSMSLS 587

Query: 1571 ADLDDSQKKISHIEAELQSVLLEREKLSEKLEIIHHHNDHLSFGTFEKEIENIVLQNELS 1630
            A++++     S+I  EL  +    +K+ E  E           G+ E+E           
Sbjct: 588  AEMEEK----SNIRKELDDLSFSLKKMEETAE----------RGSLERE----------- 647

Query: 1631 NTQDKLISTEHKIGKLEALVSNALRE---EDMNDLVPGSCSIEFLELMVMKLIQNYSASL 1690
                       ++ +   L++  + +    D+N LV  S         + K I++ S S 
Sbjct: 648  -------EIVRRLVETSGLMTEGVEDHTSSDINLLVDRSFD------KIEKQIRDSSDSS 707

Query: 1691 SGNTVPRSIMNGADTEEMLARSTEAQVAWQNDINVLKEDL-EDAMHQLMVVTKERDQYME 1750
             GN            EE+             + ++ KE L E  +    V     +  + 
Sbjct: 708  YGN------------EEIFEAFQSLLYVRDLEFSLCKEMLGEGELISFQVSNLSDELKIA 767

Query: 1751 MHESLIVKVESLDKKKDELEELLNLEEQKSTSVREKLNVAVRKGKSLVQQRDTLKQTIEE 1810
              E   VK E +  +KD     L   E+KS  +R+KL++A++KGK LVQ R+  K  ++E
Sbjct: 768  SQELAFVKEEKIALEKD-----LERSEEKSALLRDKLSMAIKKGKGLVQDREKFKTQLDE 827

Query: 1811 MTTELKRLRSEMKSQENTLASYEQKFKDFSVYPGRVEALESENLSLKNRLTEMESNLQEK 1870
              +E+++L  E++    T+  Y+ +    S    R + LE+E ++ K    +++ +L   
Sbjct: 828  KKSEIEKLMLELQQLGGTVDGYKNQIDMLSRDLERTKELETELVATKEERDQLQQSLSLI 887

Query: 1871 EYKLSSIISTLDQIEVNIDVNETDPIEKLKHVGKLCFDLREAMFFSEQESVKSRRAAELL 1930
            +  L  ++ +++ I + +D+   DP EK+  +     +++ A    ++E  K +   + L
Sbjct: 888  DTLLQKVMKSVEIIALPVDLASEDPSEKIDRLAGYIQEVQLARVEEQEEIEKVKSEVDAL 947

Query: 1931 LAELNEVQERNDAFQEELAKASDEIAEMTRERDSAESSKLEALSELEKLSTLQLKERKNQ 1990
             ++L E Q      ++ L+ A D I+ +T E  + +++K  A  EL+K +        ++
Sbjct: 948  TSKLAETQTALKLVEDALSTAEDNISRLTEENRNVQAAKENAELELQK-AVADASSVASE 1007

Query: 1991 FSQFMGLKSGLD-RLKEALHEINSLL---VDAFSRDLDAFYNLE-AAIESCTKANEPTEV 2050
              + +  KS L+  L +A   I+ ++    +A  R   A    E    E+  + N+ TE 
Sbjct: 1008 LDEVLATKSTLEAALMQAERNISDIISEKEEAQGRTATAEMEQEMLQKEASIQKNKLTEA 1067

Query: 2051 NPSPSTVSGAFKKDKGSFFALDSWLNSYTNSAMDEKVATEIHSQIVHQL----EESMKEI 2110
            +   ST++      + +    +S ++S +    D+KV T      + +L    E    ++
Sbjct: 1068 H---STINSL----EETLAQTESNMDSLSKQIEDDKVLTTSLKNELEKLKIEAEFERNKM 1127

Query: 2111 GDLKEMIDGHSVSFHKQSDSLSKVLGELYQEVNSQKELVQALESKVQQC---------ES 2170
             +    I  H  +  K  +SLS + GE+   V ++ E +  L SK+  C          S
Sbjct: 1128 AEASLTIVSHEEALMKAENSLSALQGEM---VKAEGE-ISTLSSKLNVCMEELAGSSGNS 1187

Query: 2171 VAKDKE------------KEGDIL-------------CRSVDMLLEACRSTIKEVDQRKG 2230
             +K  E            K+G ++              R VD++       I E     G
Sbjct: 1188 QSKSLEIITHLDNLQMLLKDGGLISKVNEFLQRKFKSLRDVDVIARDITRNIGENGLLAG 1247

Query: 2231 ELMGNDLTSENLGV-------------------NFISTAP----------DQLSRT---- 2290
            E+   ++T+  L                     N ++T P          D++S +    
Sbjct: 1248 EMGNAEVTAVLLITLLYFQDDSTEAKSLLSDLDNSVNTEPENSQGSAADEDEISSSLRKM 1307

Query: 2291 -------GRTHLLSEEYVQTIADRLLLT-----------VREFIGLKAEM------FDGS 2350
                    +T   + E   T  D L+ T           V   +G  + +       +  
Sbjct: 1308 AEGVRLRNKTLENNFEGFSTSIDTLIATLMQNMTAARADVLNIVGHNSSLEEQVRSVENI 1367

Query: 2351 VTEMKIAIANLQKEL-----------QEKDIQKERICMDLVGQIK-----EAEGTATRYS 2410
            V E +  I+ LQK+L           +E  ++ +   ++LV   +     E E T     
Sbjct: 1368 VREQENTISALQKDLSSLISACGAAARELQLEVKNNLLELVQFQENENGGEMESTEDPQE 1427

Query: 2411 LDLQASKDKVRELEKVMEQMDNERKAFEQR-------LRQLQDGLFISDELRERVKSLTD 2470
            L +     +++EL    E+     K FE         +R +++ L  +    E+     D
Sbjct: 1428 LHVSECAQRIKELSSAAEKACATLKLFETTNNAAATVIRDMENRLTEASVALEKAVLERD 1487

Query: 2471 LLASKDQEIEALMHALD----------------EEEVQMEGLTNKI-------------- 2530
            L  +K    EA + +L+                E+EV++  L +K+              
Sbjct: 1488 LNQTKVSSSEAKVESLEELCQDLKLQVKEEKWHEKEVELSTLYDKLLVQEQGNFYLLLSL 1547

Query: 2531 -----------------------EELEKVLKEKN-----HELEGIETSRGKL-------- 2590
                                   E  E ++   +      ++ GIE     L        
Sbjct: 1548 ISLNLHHIITTILKCHVLLLRIAEAKENLIPASDMRTLFDKINGIEVPSVDLVNGLDPQS 1607

Query: 2591 ---TKKLSITVTKFDELHHLSESLLTEVEKLQAQLQDRDAEISFLRQEVTRCTNDALVAT 2650
                KKL   V    E+ H  + L    ++L + L ++D EI  L++     +   L   
Sbjct: 1608 PYDVKKLFAIVDSVTEMQHQIDILSYGQKELNSTLAEKDLEIQGLKKATEAESTTELELV 1667

Query: 2651 QTSNRSTEDINEVITWFDMVGARAGLSHIGHSDQANEVHECKEVLKKKITSILKEIEDIQ 2659
            +  N    +++++I+  + +      ++       +E     + L+KKITS+L E E  +
Sbjct: 1668 KAKN----ELSKLISGLEKLLGILASNNPVVDPNFSESWTLVQALEKKITSLLLESESSK 1727

BLAST of CSPI03G44700 vs. TAIR 10
Match: AT1G65010.1 (Plant protein of unknown function (DUF827) )

HSP 1 Score: 50.8 bits (120), Expect = 1.9e-05
Identity = 241/1174 (20.53%), Postives = 482/1174 (41.06%), Query Frame = 0

Query: 1236 EESESKMMKLTGLQESVNDLSTLILDHECEIVLLKESLSQAQEALMASR---SELKDKVN 1295
            E+  S+++K T LQ  +N +   +   + +I LLK+  ++A + L  S     E  +K+ 
Sbjct: 71   EKVHSRLVKGTELQTQLNQIQEDLKKADEQIELLKKDKAKAIDDLKESEKLVEEANEKLK 130

Query: 1296 ELEQTEQRVS---------AIREKLSIAVAKGKSLIVQRDNLKQLLAQN----------S 1355
            E    ++R           A+  + +   A  K  +  ++ L+ + +Q+          +
Sbjct: 131  EALAAQKRAEESFEVEKFRAVELEQAGLEAVQKKDVTSKNELESIRSQHALDISALLSTT 190

Query: 1356 SELERCLQELQM----KDTRLNETEMKLKTYSEAGERVEALESELSYIRNSATALRESFL 1415
             EL+R   EL M    K+  L+  E   K      E+ E L SEL  ++    +  E   
Sbjct: 191  EELQRVKHELSMTADAKNKALSHAEEATKIAEIHAEKAEILASELGRLKALLGSKEEKEA 250

Query: 1416 LKDSVLQRIEEILDELDLPENFHSRDIIDKIDWLAKSSMGENLLHTDWDQRSSVAGGSGS 1475
            ++ + +  + ++  E++L      R  ++K+  L  S   +  L          A  + S
Sbjct: 251  IEGNEI--VSKLKSEIEL-----LRGELEKVSILESSLKEQEGLVEQLKVDLEAAKMAES 310

Query: 1476 DANFVITDAWKDEV-QPDANVGDDLRRK---YEELQTKFYGLAEQNEMLEQSLMERNIIV 1535
              N  + + WK++V + +  V +  R K    E +++    LAE N +L ++  + N   
Sbjct: 311  CTNSSV-EEWKNKVHELEKEVEESNRSKSSASESMESVMKQLAELNHVLHETKSD-NAAQ 370

Query: 1536 QRWEELLEK------IDIPSHFRSM----EPEDKIEWLHRSLSEACRDRDSLHQRVNYLE 1595
            +   ELLEK       D+  + R +    E   K+E L  S+        S  ++   L+
Sbjct: 371  KEKIELLEKTIEAQRTDLEEYGRQVCIAKEEASKLENLVESIKSEL--EISQEEKTRALD 430

Query: 1596 NYSESLTADLDDSQKKISHIEAELQSVLLEREKLSEKLEIIHHHNDHLSFGTFEKEIENI 1655
            N  ++ T+++ +   + + +  EL+   +E EK  + +E +       S  + E +   +
Sbjct: 431  N-EKAATSNIQNLLDQRTELSIELERCKVEEEKSKKDMESLTLALQEASTESSEAKATLL 490

Query: 1656 VLQNELSNTQDKL----ISTEHKIGKLEALVSNALREED---------MNDLVPGSCSIE 1715
            V Q EL N + ++    ++++    K E ++ +A  E D          N+        E
Sbjct: 491  VCQEELKNCESQVDSLKLASKETNEKYEKMLEDARNEIDSLKSTVDSIQNEFENSKAGWE 550

Query: 1716 FLELMVMKLIQNYSA--SLSGNTVPRSIMNGADTEEMLARSTEAQVAWQNDINVLKEDLE 1775
              EL +M  ++      S S   V R +    ++EE      E + + +N++ V + +++
Sbjct: 551  QKELHLMGCVKKSEEENSSSQEEVSRLVNLLKESEEDACARKEEEASLKNNLKVAEGEVK 610

Query: 1776 DAMHQLMVVTKERDQYMEMHESLIVKVESLDKKKDELEELLNLEE------QKSTSVREK 1835
                 L    + + + M++ ESL+ K E L     E+  L   E       ++ + V+E 
Sbjct: 611  YLQETL---GEAKAESMKLKESLLDKEEDLKNVTAEISSLREWEGSVLEKIEELSKVKES 670

Query: 1836 LNVAVRKGKSLVQQRDTLK-------QTIEEMTTELKRLRSEMKSQENTLASYEQKFKDF 1895
            L     K +S+ Q+ + LK       + IEE++T    L  E    ++ +   E   +  
Sbjct: 671  LVDKETKLQSITQEAEELKGREAAHMKQIEELSTANASLVDEATKLQSIVQESEDLKEKE 730

Query: 1896 SVYPGRVEALESENLSLKNRLTEMESNLQEKEYKLSSIISTLDQIEVNIDVNET--DPIE 1955
            + Y  ++E L   N SL + +T+++S +QE +      ++ L +IE     NE+  D   
Sbjct: 731  AGYLKKIEELSVANESLADNVTDLQSIVQESKDLKEREVAYLKKIEELSVANESLVDKET 790

Query: 1956 KLKHVGKLCFDLREAMFFSEQESVKSRRAAELLLAELNEVQERNDAFQEELAKASD--EI 2015
            KL+H+ +            E E ++ R A+   L ++ E+ + N+   + +A   +  E 
Sbjct: 791  KLQHIDQ------------EAEELRGREASH--LKKIEELSKENENLVDNVANMQNIAEE 850

Query: 2016 AEMTRERDSAESSKLEALSE-----LEKLSTLQLKERKNQFSQFMGLKSGLDRLKEALHE 2075
            ++  RER+ A   K++ LS       + ++ LQ    +N+  +    ++ L +  E L E
Sbjct: 851  SKDLREREVAYLKKIDELSTANGTLADNVTNLQNISEENK--ELRERETTLLKKAEELSE 910

Query: 2076 INSLLVDAFSRDLDAFYNLEAAIESCTKANEPTEVNPSPSTVSGAFKKDKGSFFALDSWL 2135
            +N  LVD  S+           +++  + NE      +                      
Sbjct: 911  LNESLVDKASK-----------LQTVVQENEELRERET---------------------- 970

Query: 2136 NSYTNSAMDEKVATEIHSQIVHQLEESMKEIGDLKEMIDGHSVSFHKQSDSLSKVLGELY 2195
             +Y     +     EI S    +L+ S  E  +LKE       ++ K+ + LSKV  +L 
Sbjct: 971  -AYLKKIEELSKLHEILSDQETKLQISNHEKEELKE----RETAYLKKIEELSKVQEDLL 1030

Query: 2196 QEVNSQKELVQALESKVQQCESVAKDKEKE-----GDILCRSVDMLLEACR--------- 2255
             + N    +V  +E  ++  +S+A+ K +E       +L +  ++    C          
Sbjct: 1031 NKENELHGMVVEIED-LRSKDSLAQKKIEELSNFNASLLIKENELQAVVCENEELKSKQV 1090

Query: 2256 STIKEVDQRKGELMGNDLTSENLGVNFISTAPDQLSRTGRTHLLSEEYVQTIADRLLLTV 2312
            ST+K +D+     +   L  +   +       ++L       L   E +  +   L+   
Sbjct: 1091 STLKTIDELSD--LKQSLIHKEKELQAAIVENEKLKAEAALSLQRIEELTNLKQTLIDKQ 1150

BLAST of CSPI03G44700 vs. TAIR 10
Match: AT2G32240.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: response to cadmium ion; LOCATED IN: plasma membrane; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Prefoldin (InterPro:IPR009053); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G05320.3); Has 470429 Blast hits to 168274 proteins in 4282 species: Archae - 6896; Bacteria - 131956; Metazoa - 175525; Fungi - 33166; Plants - 25441; Viruses - 2243; Other Eukaryotes - 95202 (source: NCBI BLink). )

HSP 1 Score: 47.4 bits (111), Expect = 2.1e-04
Identity = 174/860 (20.23%), Postives = 335/860 (38.95%), Query Frame = 0

Query: 1716 KEDLEDAMHQLMVVTKE------RDQYMEMHESLIVKVESLDKKKDELEELLNLEEQKST 1775
            K+D E A H  +   KE           E+HES   K + L+ + + +   L   E ++T
Sbjct: 59   KDDAEKADHVPVEEQKEVIERSSSGSQRELHESQ-EKAKELELELERVAGELKRYESENT 118

Query: 1776 SVREKLNVAVRKGKSLVQQRDTL----KQTIEEMTTELKRLRSEMKSQENTLASYEQKFK 1835
             ++++L  A  K +   ++   L    K+  E++    +R  S++KS E+ L S++ K K
Sbjct: 119  HLKDELLSAKEKLEETEKKHGDLEVVQKKQQEKIVEGEERHSSQLKSLEDALQSHDAKDK 178

Query: 1836 DFSVYPGRVEALESENLSLKNRLTEMESNLQEKEYKLSSIISTLDQIEVNIDVNETDPI- 1895
            + +      +AL  E  S + +L E+E  L+    +         Q   + D      + 
Sbjct: 179  ELTEVKEAFDALGIELESSRKKLIELEEGLKRSAEEAQKFEELHKQSASHADSESQKALE 238

Query: 1896 --EKLKHVGKLCFDLREAMFFSEQESVKSRRAAELLLAELNEVQERNDAFQEELAKASDE 1955
              E LK   +   ++ E M   +QE           + ELNE    N+  +  L  ++ E
Sbjct: 239  FSELLKSTKESAKEMEEKMASLQQE-----------IKELNEKMSENEKVEAALKSSAGE 298

Query: 1956 IAEMTRERDSAESSKLEALSELEKLSTLQLKERKNQFSQFMGLKSGLDRLKEALHEINSL 2015
            +A +  E   ++S  LE   ++     L + E   +  Q    K+   R KE L  +  L
Sbjct: 299  LAAVQEELALSKSRLLETEQKVSSTEAL-IDELTQELEQ---KKASESRFKEELSVLQDL 358

Query: 2016 LVDAFSRDLDAFYNLEAAIESCTKANEPTEVNPSPSTVSGAFKKDKGSFFALDSWLNSYT 2075
              DA ++ L A  + +  I S   A E  E     S      +K + +        N   
Sbjct: 359  --DAQTKGLQAKLSEQEGINS-KLAEELKEKELLESLSKDQEEKLRTA--------NEKL 418

Query: 2076 NSAMDEKVATEIH-SQIVHQLEESMKEIGDLKEMIDGHSVSFHKQSDSLSKVL---GELY 2135
               + EK A E + +++   +    +   +L+E +     +F K    LS+ L    EL 
Sbjct: 419  AEVLKEKEALEANVAEVTSNVATVTEVCNELEEKLKTSDENFSKTDALLSQALSNNSELE 478

Query: 2136 QEVNSQKELVQALESKVQQCESVAKDKEKEGDILCRSVDMLLEACRSTIKEVD------Q 2195
            Q++ S +E    L S+     + A  K  E + + RS     E  +S IKE++      +
Sbjct: 479  QKLKSLEE----LHSEAGSAAAAATQKNLELEDVVRSSSQAAEEAKSQIKELETKFTAAE 538

Query: 2196 RKGELMGNDLTSENLGVNFISTAPDQLSRTGRTHLLSEEYVQTIADRLLLTVREFIGLKA 2255
            +K   +   L    L  +       +LS        + E  +    +    ++E+   KA
Sbjct: 539  QKNAELEQQLNLLQLKSSDAERELKELSEKSSELQTAIEVAEEEKKQATTQMQEY-KQKA 598

Query: 2256 EMFDGSVTEMKIAIANLQKELQ-----------------EKDIQKERICMDLVGQIKEAE 2315
               + S+T+     + L+++L+                 ++ I+ E +C     + ++AE
Sbjct: 599  SELELSLTQSSARNSELEEDLRIALQKGAEHEDRANTTHQRSIELEGLCQSSQSKHEDAE 658

Query: 2316 GTATRYSLDLQASKDKVRELEKVMEQMDNERKAFEQRLRQLQDGLFISDELRERVKSLTD 2375
            G      L LQ  K +++ELE+ +  ++ +                   E     K    
Sbjct: 659  GRLKDLELLLQTEKYRIQELEEQVSSLEKKH-----------------GETEADSKGYLG 718

Query: 2376 LLASKDQEIEALMHALDEEEVQMEGLTNKIEELEKVLKEKNHELEGIETSRGKLTKKLSI 2435
             +A    E+++ + A   +   +E   N   E EK L E    L  + + + KL   +  
Sbjct: 719  QVA----ELQSTLEAFQVKSSSLEAALNIATENEKELTE---NLNAVTSEKKKLEATVDE 778

Query: 2436 TVTKFDELHHLSESLLTEVEKLQAQLQDRDAEISFLRQEVTRCTNDALVATQTSNRSTED 2495
               K  E  +L ES+  E+   Q +L+  + ++     + +        A ++  +   +
Sbjct: 779  YSVKISESENLLESIRNELNVTQGKLESIENDLKAAGLQESEVMEKLKSAEESLEQKGRE 838

Query: 2496 INEVITWFDMVGARAGLSHIGHSDQANEVHECKEVLKK------KITSILKEIEDIQAAS 2530
            I+E  T       R  L  +  S   +  H  ++ +++      + +S+ +++ D++   
Sbjct: 839  IDEATT------KRMELEALHQSLSIDSEHRLQKAMEEFTSRDSEASSLTEKLRDLEGKI 856

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q150758.1e-0919.00Early endosome antigen 1 OS=Homo sapiens OX=9606 GN=EEA1 PE=1 SV=2[more]
Q134392.2e-0619.28Golgin subfamily A member 4 OS=Homo sapiens OX=9606 GN=GOLGA4 PE=1 SV=1[more]
C9ZN162.7e-0420.89Flagellar attachment zone protein 1 OS=Trypanosoma brucei gambiense (strain MHOM... [more]
Match NameE-valueIdentityDescription
A0A0A0LDV20.0e+0099.66Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G881880 PE=4 SV=1[more]
A0A5A7TAW30.0e+0092.65Centromere-associated protein E isoform X1 OS=Cucumis melo var. makuwa OX=119469... [more]
A0A1S3CS850.0e+0092.46centromere-associated protein E isoform X1 OS=Cucumis melo OX=3656 GN=LOC1035037... [more]
A0A1S3CQW70.0e+0092.66centromere-associated protein E isoform X2 OS=Cucumis melo OX=3656 GN=LOC1035037... [more]
A0A6J1F6C60.0e+0081.82major antigen-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111442749 PE=4... [more]
Match NameE-valueIdentityDescription
XP_011652533.10.0e+0099.66centromere-associated protein E isoform X1 [Cucumis sativus][more]
KGN60307.20.0e+0098.61hypothetical protein Csa_002649 [Cucumis sativus][more]
XP_004136448.10.0e+0099.67centromere-associated protein E isoform X2 [Cucumis sativus][more]
KAA0038751.10.0e+0092.65centromere-associated protein E isoform X1 [Cucumis melo var. makuwa] >TYK31364.... [more]
XP_008466297.10.0e+0092.46PREDICTED: centromere-associated protein E isoform X1 [Cucumis melo][more]
Match NameE-valueIdentityDescription
AT4G31570.10.0e+0036.55CONTAINS InterPro DOMAIN/s: Prefoldin (InterPro:IPR009053); BEST Arabidopsis tha... [more]
AT1G24460.23.0e-4322.76unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G24460.16.6e-3821.88unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G65010.11.9e-0520.53Plant protein of unknown function (DUF827) [more]
AT2G32240.12.1e-0420.23FUNCTIONS IN: molecular_function unknown; INVOLVED IN: response to cadmium ion; ... [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 2114..2148
NoneNo IPR availableCOILSCoilCoilcoord: 2239..2259
NoneNo IPR availableCOILSCoilCoilcoord: 774..801
NoneNo IPR availableCOILSCoilCoilcoord: 2285..2319
NoneNo IPR availableCOILSCoilCoilcoord: 601..621
NoneNo IPR availableCOILSCoilCoilcoord: 545..565
NoneNo IPR availableCOILSCoilCoilcoord: 2323..2385
NoneNo IPR availableCOILSCoilCoilcoord: 664..691
NoneNo IPR availableCOILSCoilCoilcoord: 1558..1592
NoneNo IPR availableCOILSCoilCoilcoord: 1712..1732
NoneNo IPR availableCOILSCoilCoilcoord: 2519..2539
NoneNo IPR availableCOILSCoilCoilcoord: 1834..1861
NoneNo IPR availableCOILSCoilCoilcoord: 1273..1314
NoneNo IPR availableCOILSCoilCoilcoord: 1740..1774
NoneNo IPR availableCOILSCoilCoilcoord: 1322..1346
NoneNo IPR availableCOILSCoilCoilcoord: 447..488
NoneNo IPR availableCOILSCoilCoilcoord: 2488..2508
NoneNo IPR availableCOILSCoilCoilcoord: 1915..1963
NoneNo IPR availableCOILSCoilCoilcoord: 1789..1823
NoneNo IPR availableCOILSCoilCoilcoord: 2407..2434
NoneNo IPR availableGENE3D1.10.287.1490coord: 1759..1889
e-value: 2.1E-5
score: 25.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 144..181
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 59..92
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 277..322
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 296..315
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 277..295
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..118
NoneNo IPR availablePANTHERPTHR43939:SF50NUCLEOPORINcoord: 538..2666
NoneNo IPR availablePANTHERPTHR43939FAMILY NOT NAMEDcoord: 538..2666

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI03G44700.1CSPI03G44700.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006336 DNA replication-independent chromatin assembly
biological_process GO:0042981 regulation of apoptotic process
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005634 nucleus
molecular_function GO:0031491 nucleosome binding