Csa3G000010 (gene) Cucumber (Chinese Long) v2

NameCsa3G000010
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionMAU2 chromatid cohesion factor homolog; contains IPR019440 (Cohesin loading factor)
LocationChr3 : 4499 .. 25313 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
TAATATTATATGAAAATATTCCTTCATTGTCGTCCGAGCTTTGGCATCCCCTTCCTGGCGACTAAAAATTGGCACCAATCTAATTGACCCCCCTACTGAAAACTAGAAATCTACTTAATTTTGGCGCGGGGAGGACGAAAGCATCCTTTCAACTGAACTTCATTTCACGACATGGAAGCAGTGGCGGAGGGATTGTGGAGGCTGGCAGATTACCACGAGAAGCAGGGGGAATTAGGGAAGGCAATAAAGTGTTTGGAAGCCATATGCCAAAGTCCCGTCTCTTTCTTTCCAGTCCTTGAGGTCAAGACTCGGCTTCGAATTGCCACCCTTCTGCTCACTTATTCTCACAACGTCAATCATGCCAAATCCCATCTCGAGCGTTCTGTATGATTTATTTATTTATTTAGCTTTTTTTTTATTTCGCACAAATAATGACGTAGAGTTAGCTATTTATGAACGATGGGTTTTCCCGGCAGCAATTGTTACTGAAGTCTATTCCATCATGCTTCGAATTGAAGTGCCGGGCTTATAGCTTATTAAGCCAATGCTATCATCTCGTTGGTGCTATTCCTCCCCAGAAACAACTTCTTTATAAGGGTCTTGATCTCACAAATTCTGCTGGTCACGAGTAAGCATTTATCATAGTTCCGTAAGCCTAATGAATCATTTCTACATTTCTTTCGCCAAGTAACGTAATTATTTATTTTTTCTTTTTTGGTTTGTTTGTCGCCCTCTCATTTTTTATTGTTGGTAATAATGGATCATCCTGCAATTTTTTTGATTTAGTGCTACTTTATCAACTTTACTTATTGTGTTTTTCTTCATGCACTCATTTAAGACTGTCGGTGAAACTATGGTCTTGCAACTTCAATTCACAGCTAGCAAATGCTTTGATTATTGAAGGAGACTATCAAAATTCAATCTCTGCCTTGGAGTCGGGATACGTATTTTCAGCTGAGATATGCTATCCTGAACTGCAGGTTGGCATCTAAATTTTTGTTTTCGTGCTCACTGCCATATTTCTTTATTCTTCTGGAAGCATATCTGTTTGTTTTTTCTTTATATATATATATTGATAGGAAACATAAAATTTCATTGATAAGATGAAATCACTAAAAAGGATCCTATCCAATGGATCACAAAAAACTTCTCCAATTGGTCATGGGATTAGATAAGCTACAATTATCCTTATATATGTGCGAGGAAAGATATTATCCCCAAAAGTATAAAATTTCATGGCTCAAAGAGGGGGATGAAAACACAAGTTTGTTTCACCGAATGCTGTCAGCAAAGGAAAGAAAGTGTTCAATGTCAGAATTGCATTCTACGGGCACTTCTTTGCTGTCTTTTAATGAGATTGGGGAGGAAATAATCAGTTTTCTTAATTCAGTGTACCAGAACATGTCTGGAAATAGATATATGCCTTTTAACTCGGTTTGGAATACAGTTTTGGGCTTTTAGATTCACAAAATTCCAAGCCCATTACGCCCTTTATGGATGAAAAAATTCGAATTGGAATTAAGTGACTTGGTAAAAACGCCCTTGGATCAGACGGTTTACCAAATAATTCTACATAAATTTTGGAACTTTTGCATAAGAGATTGCCAAAAGCTATGGAGTGAATTCTTAGGAAGCAAAAAAATGCTGTTGAGTGAATTTCATCGAAATGGCCATGTTGAGATTGCCAAAAGCTGTTGGGCGAATGCCTGCTTTGCTTGAGAGCAAATTTTGTTTGTCTTGTTAAAAAACCAAGACAGGTACCTTTGTAAAGGACTTTAGACCAATCAGTTTGACTTCCTCAATTTCCAAGATCCAGGCTAAAGTCCTGGCAGAAAGGATTAAGGAAGTTATGCCTTTTATCATCTCTCCCTGCCAAAGTGCGGTTGTAGAAGGAAGATAAATATTTGACCCTCTCTTGATTGGGAAATGAAGTTGTGGAAGACCATAAAGTTGAAAAGAAGAAATGGTGGATTTTGTAACTAGATTTTGAAAAGGCATCAATTGTGTGGTTTGGGATTTGCTAGAAGCAGTTATACAACAAAAAAGCTTCGATGAACAGCTAGCAACAAGGATTTTCTGGAAGCAGTTCAGTGTCAAACAATCTCCATATTTACTATATGTTTATTTTCCTCATGCTCTCTTCTGCAGCTTGGATTTAGAATGGAATGTGGTATGCAAAATTTCTTTCGGGAAGGAACTATTGCAGCAAATTAAATAATTTGGTTTGATGGAATTTGGCTTCTCAATCCAAGCTTCATGGAGGTTTAGGAATCAAGCTTTCAATCATAAAAATTTGACTTGGCTTGCCAAATGGATTTGGAGATTTTTTCATGAGCCAATCTTCTTCCTCATCGCTTTTCTAAAATCGAATAAGCCGTAGCATTTTTCTATGTCTAATTCAGTGCGAAGAATCAGATTCAATACTCTGAGAAATACTTTGATGATATCTATGAGTACAGGCATGTGGTACTTTCTCTTGAAGTGGCAAAACTTCTCCCCAAGAATCGCCTTCTTTTCGAAAACGAATGGCCAGTGATTGGTGTTCAACAAAGCCGTGGATGGGTTCATCATGCACTTCATCGCCCAAAGCAACACATATTGCTATTCAAAAGGCCTCTCAACTATAAACATTAACAAGAGAATCAAGCACAATAACAGATTCTCGCCAAGTAATTAGTTGATTATTGTTTCCTTTTTTAAAGGGAAACAAGTCTCTTTTATTGAAGATTATCACAAGACAAAAGCTCAAGAGAGTTATACAATAAGCAAAATAACCTAGGAATTAGTAGACGCACCCGGACATCTCAACTAGGTTGACACCCCCTTAGCATCCTTATCATATTTGTTCAGTTCTAATTTTAGTAAGGTTGTTCATATATTATTGAAAGAAGTTTACTTTCTGATGTAGTGTTTAACTATTTTTGGAGCTTTAGAAGATTTACCACCCTTGGAAGAGTTAAAATTATTAAAGGAACAAAACCATCTAGCAAACTGGGGGCTTTGCTGTGATTAGGAAGGTTTAGAAATTCAAAATTGCCGAAATGCATAAATATCTTACCCCTCTTTAGGTCTGTAATTTCTGAACAAAACCACATAAATTTTTCTTAACCTGAATTCGGGCTTCGCTACTGTTTGTTAGATTCCATGTTTCAGTGGCTATGTCAATAAGATCTCCAAAATGATCTTCAATTACCTCAAACATGCTTCTACACCAATAGTTCAAAGGAAGATTCTCGATTTTAAGCCAACCTCCGTATCCTTTTAAAACCAGTGGCCTACTGTGTTTCATAGTCACATTTTTCAAAATTCTGGAATTTGAGATGGTGATTCCCCCAGGCCTGCCACTTTCCTTCCTCACTGGTGAAGTCATTGATTTAACCTTGATTGAAACTAATGAGAGCGTTTTCATAAAAGAGAGGGATTATATCAATATTGGTTTGAAATATCTTTCCCAGCCGCTTGCGGATCGATCTCCAGTCATCAGATGTAAATAATTTGGATATAATCCATAGACTCGAAATTAACCTTGGCCACTTCATGATTCTTGATTAACCATTGTTGTTTGCCTTTTTTGGCTAAAGACCTGGCAGGGTTTCTTTTTTTCAGGCCTTGTAACAAGGGTTTTAACACTACTAGACTTGACCATCTCTACGTAGCTTAGATTTTCTACCGACGACAGAGGGGGAATCTTGGTTGTGTTGCCCGAGAACCATTTGATATAATCCACTTTTTCTACAAATTTTTCCAGCATCTTGCAAAACGACATCAACCTTGTTGATTTTCTCTGGAGCAGACTCTTAAAATGGAGTGACCTTCTGAATAAAGCCAATGATTACAACTAAGGACCCACCCGGAACTTTGCTTGAAACTTGGAAACCTTCAGCCTTCCTCTGTCAATGTCAGCATATTTTTTGAAAAATCGATCAATTGGAGTTCTGAGGAGTTTTGATATAGCAGATTGAGTCCATCGGAGTTTATCATCAGAAATAGAAATATATCTGTTGGCTTCCACATCTTCAATGATACAACTACCATTCTCTTTCCAAATACAGTTATGCGACTGATTGATTTTACAACGAACAACCTCCATTCTGAACTTTGATGCTCGTCGGAAAGGACAGACTACTGAAAAGGAATGAGGAGAGAGGGGAGAGGGGAGAGGGGAAAGGAATGATGTCCTTGAGTTGTTTCTGGAGGAAGGTATAGAAGAAGAAGAACCTCTCCATAAATTCAGAGAAGAAATCCCACAGCACTTGCATGGCATCATTGAAGCTTGTGGTATATCTTTCCTTTGAGTTCTACAAAATTTATCTCCAGCAGTTAAAGTAGTCTTGGATGTGGTAGTGCATTGAAAGTCCTTTCGTAGAATACAAGGGACCTAGGAGTTAATAGAAAAGAATGGTAGTTAGGAATTTGATGCAATGAGCTGTGGTTGTTTGGATGGTTAGGATTTCTCTGTTTTAGGATTGATAGATCTCCTAGCTGAAATGTTCGAGTGCGCCTCTTGATCCCTCTAGGACTCTCTTGATTATAACTTCTTTATTACTCTCTTTGTATAAATGTCTTTTACTTTGAGTTCATATTAATAAAGAGGTTTTGACTCCGTTTCAAAAAAAAGAATTGATAGATCTCCTGTTTTATGTTTATTTTGTTTAGTTTAAGTAGTTACCATGTGGTAACTCCTTGTATAACTTTTTCTGTAATTTACAGTTGAGTTAGCCTCTAATTAAGGGCCGTAATTTCCAGTTGATTTGGCCTCTAATTAAGGGCCAGCCAACTTCATTAATGCATTATGAGGTATTTTTCTTAAAGGTTCTCTCTTTCACCAACTTGTCTACACAATTTTTTTTTGAGAAAAACCCGGAACAATACAGAATTCACCGCAGCAGACTAAAATCTTCTAAAATGGCACCAGGAAAATAATTAAATTATCAAACCCTCGAGGTTTCTGAAAAAGTTGCTTTTGGGATCCTTTTTTAGTGCTCTGTTATAATTTAAATGTTTTTAATATAGGACGATCCCTTCATTCTTTCTTTTTTTTTTTGGGTAGACGCAGCTTGCAACTTCTATTACTAATTAAGGAGAATCAGTTTATGAAACACTTGTAAACTCAACTACCAAAAGGACTTATATTGACATATGAAACTTCCTTTTGGGTTGAATTTCTTGATTGAACTTCTTTGTGTTAGTTTTGCCTCCTTTCACTAGTAAAATTCATTCGAAAAATTCAGTGGCTTTCATACACATCCAACTAATTATCTTGTTTTTATTTGTGAAAGATGTTTTTTGCTACTTCTATCCTTCATGTGCACCTTATGCAATGGTACGATGACAATTCTGTTCAGCAAGCTGTTAACAAATGTGATGAGGTCTGGGAATCTATTGAACCTGAGAAAGTAAGTGGATTTCTTCCCTTTTATGTTCTCCATAGTTGTTTGTCAGCAACTCAACATCTAAAGATGACTAGGCTTATCTATCTTTAACTGTAATTTTTTTGCTCTGATTAGAATTATTTGTTCACGTTTACTTACTCTTTCTTCACTCCTTGGTCTCCGTCTTACCTCTCCAATAAAGAGACAGCAGTGTGTTGGTTTATTATTTTACAATGAGCTGCTGCACATATTTTATCGACTTCGGATCTGTGACTACAAGAATGCTGCTCAGCATTTAGATAAATTAGATGCTGCTATGAAGGCTGATCTGCAACAAACTCAGTACATAGAAGATCTGAATAAAGAAATGAATGCCTTAAATCAGAGTCTATCCAGGTCTGATCTACATTATAAAGATAGGTTGGCTCTCACTGGAAAACATGCTCAGCTTCAGGAACAGCTTAGAAGTATAACTAGACCAACTTCCTTGAGCAAAGAATCTTTGGAACCTGGGCATTTTGGAAACGTGAGAAGAACATATAGAGACAAACTTGAGTTGGCACCATATCCTATTGATGGGGAGTGGCTACCAAAAAGTGCTGTGTACGCGCTTGTGGATCTTATGGTTGTTATCTTTAGCCGGCCTAAAGGTCTCTTCAAAGAGTGCACAAAACGAATTTTATCTGGAATGCTTACTATACAAGGTAAATATTTTAGAATGTTCACACCTTAACAACTACGAAACTGAATATACATTTGTCCTAGATTAACTGATGTAGCCTAAAGAATAAAATGAGAGAAATCTCCTAGAATAAAGTCAAAATCAGCCGCCTGGTGTACTTTGACAAAGGAGTTTAGCGCCTATACTATTCAAGACTTGTGTTTGAACTGGCCAGCCTTCACCACTTGAGTTTTTTGGCTTTTCTTTGTTCCATTTCTCTGTTTTTTGTTTCCTAATGTATTTCTGTTTTACGTAGGAATGTATTTATGAATTTCTTTGGATACTAGTTTCTTGATTGGAAATGATAAGGTAGCTAACGACCTTACTTTTTTGAAATGGAGACAAACCTATTTATTAATAATAAATGAGACTAATGCTCAAAGTACAAGAGTACTAAGAGCAAAATAAGAAAAAGACAGTCATCAAAGATGAACCCAAACAAAACATATGGGAAAAAGACCAACAAACAATCACAAAGTAGAAACAACAAAAGAACCAAAGCAACAAAGAGTTGAGACACTAGTTAAGATCTTAATACAAAGCAAATATATATACTTTCATGAAAAGCACCCACTTCAGATTGAAATTTTGAATACCACATGCTTCAATTCGAAAGGAGAAGGGATCCATCGGTTGCTTCAATTAGAAATTTTTGCTCTTTCTTGAATATGAGAGAATTAGTTCTGAGTAAGTGGTCAAGAGACTAGAATCAACACTTTATCTGCAACCTTCAAACCTTACGAAGTACCTGATTATTGGACCAAAAGATGGCTGAAATCAGATCTCGAAAGAAAACTAATTTGCATTTTTCTACTATTGAGAAATTATCTTCATTGACCTAGATGAGAAGGAGGAAGGTTATCAAATTTTCATTAAACCAAAATGATTTCACAATCATTGACTAAGGGCAGCAGATCCTTAGGCAATTCCAAGGGAACTTTGATGGGAACAACACTGGCTTCATCAAATAGAAAAGAGAGATCAGCCCCTTCAATTTTTTCATTATTCTTGATATCAAAACCATTTGTAAACCCTAGGGCCTCCTCACTACTGACACTAAATGGGAACTCCAAGGTCTTGGAGGTCGGTTTGTCAATGTCGGCACGACAAGGGGAGTCTTGCAACAAATTAACTTTAGAGTTAAAAATGGGGAGTTTTGAATATTTGAAATGCTTGACAGGAGAGCCTGAGTTTATGGACCTGGAAAGACGATTAGAAGTTTGGATTTTGGAGCTGCAAACCTCCAAAAGGTCAGGATTGTTTGCTGAGATAGAATGCTTTGAATTATGCCGCCGAGAGGGGGCTGAAGCAAAAACTGAAGGAGAAGGATCCTTGAATTATGCTTCCAAACTAATTAGTCTTGTTGCAGAAACCAGAAACTGAAGCTGCGGAGGCCATAATAGGAAGTTGCCGTTGGTTGGAGGAAAAAATGTAATCTTTAATTACTTTTGAGGAAATCTCATAATTCTGCATGATAGTACTTAAAGGAGATTTACCCTTTTTTCTTTCACTACTTAGATTTACATCTGCCGAATTAATGGGAAGTTGGTTAAAATCCATGTGCCCTAGTTTCTTTTTAATTCAATTGTGACCAATAAGGTTTTCATTAATTAGAAGTGGAGACTATTCAGATTCTCGCCCATGGCTGCCACCTGTGTAATTGAACCTTTCATCTTCTCCAGCGTCATACGGCTGGCCGGAAAATGTTGCAGAGATTGACAAGGGTTTCTACATTTTGGTAAGGTTGCAAAAACTGATCTTGGTACATTAAGTACAAGTGGGAAGGAGGAGAGATCACATCCCTCATCGAGCAAAGTTTCTCTTACCCTTAGTTGGTCAATGGAGTTGCTGTAAACGTCTTGTACACTGGGAGATTTTCTGGAAAACGGAGGGTTTGGAAACTCAAAATCTCCAAATGTAGAAATATGTTTTCCCTCTTTTGGTTAGAAATTTCGATCGTTAAAGGAACGAATCCACACTGGTTTTTCTTGACCTTAATACGAGCTTCACTACAATTGGTAAAATTAAGGGTTGGCTTTTTCTGAATTTTCTGTTCATTGTTTTACAGTTTTTTCTGGTCCCTTGTAATAGGCTGATTGTTATTAGGTTGCTCCTCTGTTGGTTTTTGTGTTGATGTTTTCTTTCTCGGGATATGATGATTGGCGTTATGGAGGTGTCAACCTAGTTTAGATGTCTGGGTGCGCCTTCTGGTCCTATAGGTTCCCCCTTTCTTCTTTATTGCTCTCTTTGTATTAATCTCTTATACTTAGAGTTCTTATTAATAAAGAAGTTTTTGTCTCCGTTTCAAAAAAAGGGTTTCAGAGGCTATTTCTATGAATCCTCCAAAATGGTCTTTGATTGTTTCGAAAATACTTCTACACCAGAAATCAAGAGGAAGGTTCTTCACTTTAATCCAACCTCCATAACCTTTCAACACCAATGGCCTACTATGTTTTGCACTGGCCCATTTTTCAAATTTTAGGTGGAAATTGTCCCATTCTTGCCATCCTAGTTGAGATGTTCGAGTGCATCCCCTGACCCCTAAACCTCTTGTACTTTGAGTTCTGAATATCAATAAAGAGGTTGTCTCCTTCTAAAAAAATACTCATGTACTTTGAGCTTTAGTCTCATTATCATTAATAAAGAAGCTTGTTTATGTTTTAAAAAAATATCGAGGACAACAAGGTTTTATCAAGTGCTTCCAAACTGAATTGCAAAGTGGACAGCATTCCGATAATGTACCTTTGTCTTCCTTTGTGTGGCTATCTAGTTGAGATGTTTGGGTGCACATGCTGATCTCATAGGCTTAGATCTTGTCTCTTCATTTTACTCAACACTACTTTGTCTATTCATTTTCTTAATGAAGAGGTTCATTTTCCTTTTTTTTTCTAAAAAAATCATAACATTTTGGTGACTTTCCTCTCTGCAAACTTCAGAAAGATATCATTTTTTTTAAAAAGGATACGAGTCTCACTATTATTAATATAAATAAAGAGACAAAGCTCAATGTACATGAGGGTTATACAAAGAGCAATAGGGGGGAGGGAGGATCAACGGGCGCACCCGGGCATCTCAACTAGGTTGACACTCCCATAGCGCCCTCATCACATTCAAAAAACGACAATACAAGAATCAAACAAAAAGCCACAAATACAAGGGCAAACAAAATGATAAAAATAATCTGTCCAAAGCAAAATCATAAATCATAGGCAGCAGGCCGAGAACAAAGGCATCATTAATAGTGGTTATAGTGTTTTGCTATCCTGCCTCTAACCTCATAAAACTCGACACTGAGACCTGCAAGAAATTTGTAAGTACAGCCAGTTTCCACAGTTTTTTTTAATGCTTTTGATCCTCTTTGACTTCCACTCATACGTACCAAAGAGGTCAAGTTCTTGCCAAATTCTTAAGAGTGAAAATTCTGGGTAACTTAATTACTTCCCTATTGTATATCACCCAATTCAAGGTTTGGCATCCCACTTGGCAAACAAAGGATTTTTTTATCTCTGGTGCAATTTTTTCTTTCATAATGTAACCAATCTCTCCATGTCCACGAATATACATCTGAACACTTTGGGACCAACAAAGAAAATTATTCCCATTAAGCTGGATGGAGGTTATTTGGATGGTAGAACCGTTGGAATGGATTCAATTGTTTAAAACTTTAGCAATAGATGACTTGTTATCGGACATTATTGCTGGAAGTAGAGGAAAAACGAACTAAAACCCAATAACCAGAGAACCAAAAGAAGGAAAGAAGGGAAAAGCTAGCCGCGATAGCCTTGTTTGATCTCTACCTCCGTCAAAAGGAAAGGTATCATCTTAGGCAAACCCATCACTGATTACACAAGTTTTTTCTGCCATATCTTTGTTCGTTACATCAACCTCTCATCTGTTCGTGTAGCTTTCAGATCGACCGCCGACCTTTGGTGGGTCTCTACTAGGAGTTGTTTCCAAATCCATCGTCTGTTGTTTGTCTTAGCTCTGTTGCCTCAAATTCTTGTGCACCCTCAAACCCGCATAGTCCTCATATGCGCCGCTTCATCCTACAAATCCATCTGCTGTGTTTAGATTTGTTTAAATCTAGATTCGTCGTTGATCCATCTGCTTCATCCTTCATCGTTTCAGTCTAGATTCAATGAAGCCTCGACCTCTGATCCGTTGGTGGCTTCACTTGCACCGATCACCATGGGTTTCATTTGTCAGCACCACCAAACTCGTTGTTTGGTGTGTGCTTTTTTGTTCGTGCATTAAATTTTTCCCAGTTTGTAAAAGATTATCATGTACCCTCTCAATTATTCTGAATATTCAATATATTATATAAAATGATCAAATCTGAATTGAGAACCCTATAAAATCCATGGAAAAAGAAATTCTGGAAGAAGACTTGAGACAATTCAAAACAACCGAGGCAAGATAACTTTATTGGGCATGCTAATCGGCTGGAATCCCAAATTTTCTTTTTGAATTAATGTCTTCTTGAAAAAACATGTTTTCTGTCATGCACTAAAAGATGAGATTATTTTCAAAGAAGCTGCAAATAAGTTAACGCCACAATCAACTTACTTGCCATGTTTCTACAAAGAAGAGATCTTCGAACCATATACAGAAAAATAAATTATTTGACTTTACAACTTCCAATCTCCATATCCTTAACACTAGAGAAATGAAAACGCAGAGGAAGGAGGCTAAGCAACAACTTACTAACCAGAGAAAGTAGTAATGTTAACTGGAGAATTCGGCAAAAGCAATTGGAGATACTGACTAAAGAGAGAGCGAGAAGAGAGAAGGGTGAGGAGAGAACAAACATACCTCTAGAATTCTTTTGATTTAAAATGCAAATGATATTTTATGTGCTCATATGAGTATTATAACAATGTTGTAAATATCTTTTCAAGTTGTTTTTTATTTTATTTTATCTTAATATTTTAAACATTTTTGTTATATTGTTCTTTATGGTCATTTACCTTTTCCTACTTAGGTTACTCTTATACACATGTATATATATGTATGTGTCTTCTGTGAACGGAATAGATTATTTTGATTCTCTCAAACCTTAGAGTTTTACCATCACTTTCTTATCATCCCTCGAGAGTGAGAATTAGCTACTAGTTGTTGGTAAGAGTCGATTGACAAATCAATAACTTAGCAAAAAATCAAGAGAATGAATTTAGGTGGCTCCTCAATTAACTTGATATCGAATGTTTTATACAATTTAGTTTTGGCCGGACTATGATTCACAGGCTCTACTTTTGTTTGTTTTCGGAACTCAAATTTGCTCATAGAAATTTTTCTTGCAGAGGAATTGGTGAAGCTTGGGATAGCTGATGGCGTAAGAGGTAACACTTGGAAGTTAGAATACTATCCATACTTCTACTTCCCAATGTTGAATTTTGATTGTTTCTTTTCAATGCATATTTTGTGTGCATGTATAAATGCATATTTTGTAAAAAGAATGGGAAAAAGATCCTTCCATGGAAATTGGTGACCGTTTAACTGGTGAAATTGTTGGTTAATTTCAAATATACTGTTGAGTTTATACTTCATGTCATATTTTCTTTGAGAAGAAAAGAGTTCTTAAACAAAAGGAACACTCAATACAAATCAAGAAAATACAAAATCAGTAAATATAATATAATAAAGACAAAAATGGGAAATTGCTTTAAGTGACAAAATTATTGAACATATTCACATATATATCAAAATGTCACGTCTATCAGTCTATCAACGATAGACCGTGAAATTTGCTATATTTGTAAAAGAAATTGTTCATTTTGCTCTATTTGAAAACAACCCTACAAAAAATATATATTTAACATAAAAGGAAGCAATATATACTCCCCTTATGTGACAACAAGATATCGTTTATTGCAACCTCGTCAATTGAACTGTTAAATTGCCATTTTAGGGGACCTTTAGTTAACACATAACAATTTTTTTTTTTTAACACAACAATTTGTTCTATTGTTGGGAAAGTAAGGAATGCAGATTACTCTAGCAGAAATATTCTCACACGCCCTCATATATTCCTTCCTCTCGCCGGCCACTAACAATTCTCTCCTTCCACCTCCGACGATGGGCAGCACAACCGACAACGCTTCATTGGGATCCTTGTTCCTCTTTGATATATTGCAAAGATCTTCGTTGTGGGTCACCATGGCCTCGTCGGTTCCACCTGTATTCTAGAATTCAGAGAAGTATATTTCTTATACATATTGAGTCAGAGAAAAGTTACATATTTATATAGAGAATAAAATAAACCCTAGACACTATGTACAATTACAATAAAGGACATAGTTATAACTATATATCATAACACTCCCCCCAAGCTGGAGCAAATGTGTCGATCATGCCCAGCTTGTTGCACAGATAGCTTATCCTTGCTCCATTTAAAGCTTTAGTCAAAATATCTCCTAATTGTTCTCCAATCTTTACGTATCCGGTGGACACCAACCCATCTTGGATTTTCTCACGAATGAAGTGACCATCCACCTCAATATGTTTAGTTCGTTCATGAAATATTGGATTAGATGCAATGTGAAGTGCAACTTGATTATCACACCATAATTTAGCTAGCACTGTAATACTAAAGCCTATCTCAGATAATAATTGGTGAACCCATATTATTTCGCACACAGATTGTGCCATAGCTCGATATTCTGACTCAACACTCGAACGAGAAACAACATTTTGTTTCTTACTCTTCCATGATACTAAGTTTCCACCTACAAAGACACAATATCCAGAAGTTGATCTCTTATCCTCACAAGATTCTGCCCAATCAGCATCAGAAAAACATTCAACTCTCGTATGTCCATGATCTTTGTGAAAGATCCCACATCTAGGAGCAACTTTTGGATAACACAAAATCTGCTCTACTGCAGCCCAATGATCCATTGTAGGGGAAGACATGAACTGACTTACAACAGTTACAGAATAAGCAATGTTTGGTCGAGTTACTGTTAAATAGTTCAACTTCCCAACTAATCTCCTATATCTCTCAGGATCTTTACATAATTCTCCTTCTTTAACAAGTTGCTGATTGTAGATTCAACATGGTGGGTTGAGGTGTTGTAAAGGAAGTACAACCTTTTGTTCCGCTTTTGGGTTTGGACTTCTTTGGCAAGATGGTTTTGAGATCACCGTCTGCATCTTTTTGCTAGTCTCTAGATCCAAATGGTTTCCTTTTGAGAAAGTTATTTTGTTATGCTTTGTATTTAGTTCTCTTCGTTTTGTTTTAGTTTTAACTTCATTTTTTCTGTTGGATTTCGCCTTTTTAGCCTATTCTTTGCTTTTTGTTTGTTTTGGTCGTTTTCCCTTGTTTTGGCTGAGATCTTTTTTGGGCAGTTTGTTTGGTTGTTTTTGTCTTTTCTTGTTTTGCTCTTTGTATAATTCTCTTATACTTTGAGCATTAGTCTCATTTTCCTTTATTAATAAAGAAACTTGTCTCCGTTTCAAAAAAACAAGTTGCTGATTTGGCATCATTGGAGTGCCACTTGGTTTGGCGCCTAATTTTCCTGTCTCGGACAATAAATCAAGTACATATTTTCGTTGAGACAAATAAATACCTTTCTTGCTTCTCATCACTTCAATGCCCAAAAAATATTTCAATTGGCCCAAATCTTTTGTATAAAACTGACCCTGAAGGAAAGTTTTGAGAGATGAAATACCCGAGCATCATTTCTAGTAATAACAATATCATCAACATATACAACTAGTAGAACTATACCATTATCAGATCGGCGATAGAAAACTGAATGATCAGATGTACTCTTCTGCATACCAAAGCATACAAGAGCTTAACTAAACTTACCAAACCACGCACGAGGACTCTGTTTCAAACCATACAAAGATTTTCGAAGGTGACATACTTTATCTCTCCCTCTGAGCAACAAACCCAGGTGGTTGTTCCATATAAACTTCCTCTTGAAGATCACCGTGAAGAAAAGCATTCTTAATGTCAAGTTGATGCAACGACCATTTATTGGTAGCAGCCATGGAAAGAAATAGGCGAATGGAAGTTAACTTGGCAACCGGAGAGAATGTATCTGAATAATCAGTGCCATAGATTTGAGCATAACCTTTGGCAACAAGGTGACCTTCCTCACTCAGGTTTGTTCTAGCACTCAAGTAATGGAACTCTTCAATATAATGGAACTCTTCAATATATTTGGCTATCGTTTTAGTACCTTGGCGGCAATTCTGGTACTGGTTATATAAAGCTTGTTCATAAGTCGGAGGTAAGAACCAAGCCTTCAACAACTTCTTCATCTTCTCCCAAGAACGGATGGGCTGCTTCCCACATCTTTGTCTATTGATCTCTAGCTGACCTTACCATGCCAAAGCACCAGCTCTGAGTTTCAAGGCCACTAAGTGCATCTTTTCTTGTTCAGAAGTATCCATGTAGTTGAAAAAATTCTCAATGTTCTTTTTTTTTTTGCAACGGAGACAAAATTCTCAATGTTCTTTATCCAATCTAAGTGCATCTTCCTCCCGTTTGCCACTATACGTTGGAAGATCAATCTTCATCTTGTAATCATGATGTACCTCTCTTCGTGCATCAAACCTCCTATTGTTTCGTCTCATTCGCACTTCATCATTATGGTTCCATAGATTTCCTTGTTCATTGTTGCCCTTCGTACCATCTTCTTGTAGTTCTTCCCATACTTCTTGGTCCTCTTGAATATCTTCTTCAAGGTACTGTGGCGGGTCGTAACCGGGCCTTCCTCTTTGGGCGTGTCATTGGTTGTGGAGGTTTTTGTAATTCCTTCTTGCTCGTCGACCTCGTCTGTCACTGAAATTATAGTCGTTTGCCACATTTGCTTCTTTTCGCGGCGGCGGCGGCAGCTCATCCATCCTTCTGTTCAGCAACTCGAAGTTCTCCATCATTCTATCAAACTTGTTGTGAAGATCTCCCAAGGATTCCTCGACGGCCAGCAAGCGAACCGTTGATGTCTCTGGAGAGAGGGCGGCGATTTCCTCCACTTCTTCTTGCACACGATCGTCCCCCGCGGCTGGGTTTATTCCTCTCCGGCCAGCAAGGTTAGTTATGAGAAAGTTGTTACAAAACAGTTAAGTATTCTAACTGTTTTAACTACCATGCAGTTTGGCTAACTCTTTAGACTATCTCTGTTATTATTCTTATTATATCAATATGTCTTTCCCTTATTCACAGCATCACAATAGTCTCACTTAAATAGAAGAAAGACCTAAAATAGAAAGGGAAAAAATCATATCGTAATAATCACATTCTAATTTCAGCAAATATAATAACACTAATTAAGGGATAAATCTAAAGTTGTATACACTTCACAATTTCATGTTGTAACAAACTAAAAAACTCTAATTAATTAACTCTATGAAACTTTGTTTTTAGCAGAAGTCAGTTTGCAGCACTCTGCCATATGGATGGCTGGCGTTTATTTAATGCTCATTATGCAACTTCTTGAAAACAAAGTAGCCATTGAGTTGACACGTTCTGAATTTGTTGAGGCTCAAGAGGTAAGAACTAGGAAAGTTTACTTATCTGTGAGTGTACTTTTTTTCAATTGTCATCCCTCTCTCACCCTCTATTTGTGTCTTGTTGACTTAGTCTTTTCCTTTTCCTTGTGGTGTTCTAAAGGTTTTCGTGATATTCAGCACACACTTTTATCCTCTTTTGCAAGAAGTATCTCCAAATGATACAAGCTAATGTTCGTTAGTTGTATTTTGTTTGGTCGTTTTGCTGGAGTTTGTTCTGTAGCTTCAGAACTTTATAACTAGGGTATATATATATATTACCAAAATAAATAAATAATAGAAACAACAAAGGTATTAAAATTGGTATTAAAGTTTGGTTATACCTCATGGCCAGCATCATACTTCCTGTGACCTAAGAGCTTTGCAAAAGTTTCTTTTGCTCAAAAACATTGTTTAGATGTAAATTTTTTTGAAATAGAGACAAACTTCTTTATTAATAATGAAACTTAAAGTACAAGAGAATTATACAATGAGAGTAAATAAAAAAATCCTATAAGAGGTAACCTTGGGGATCAAGAGGCGCACTCGGACATCTCAACTAGGTGGACACCCCCTAGCGCCAATCATCATATTCCAAAGAAACGAAACAAAACCAAACTAAAATTATTTGATGTCCAGATTAATACAACAGAGTCGAGACAAAAGAAGCTATCAGAAAAGAAAACAATGCAAACCAGAGAAAACATCTAACACCTGTGAAGGCCAAAACAAGAAATACATGAAAATTCTATGGCTAAAGTAAGAAAAAAATGAACCAATCCACCAAGACTTCAAAGCGTAGAAAGAGGCCTGCTGTAAGAATCTTTCAATCTGAATATTCTGCAGTCGAAATCACCAACTATAATGCTGGTTGAGAGAGAAAAACATCCCAGGCATATGTCTTGAATGGAGTAATTCTCAAATTCCTTATTTAAAGAACACCAAGCTGTAGCATTACGTTAGGCAGACAATTTCTGACCTTGGTCTTTCCTTATCATGAAAAATCCATTTGTTACGCTCGAGCCATAATTCCGTAAGGAGAGCTTTAGACCTGTTTACCCATATTAAATGTGGTTTCTTAGATAGAAATGGACCCATCAAAAGCTGAAGCACATTTGAACTTAGGGACCCATTAAAAACCCAATCAGTTTTAAAGATAGAGAATATGCTACCCCAGCAGAAGGATGAAAAGGGACAGGTCATAAAGAGATGAAGAAGGTCCTCATTAGCCTTCATACAGAGAGGACGGGCTGAAGAAGAAGGTACTTGTTAGTTTATTTCCTTTGCCAGATTTCTGAACTGTTGAGGAACCCAAAAACCATAATCCACATTAGAATATTAATTCTCTGAGGACTGCTTGAATTCCACATAGCTTTAAATAAGTGGTTATCCATCGGGGAGGCAGTTTGTAGATAAACCCAAAGAGACTTAACAGTGTAATGGCCTTGGTGGTCAATGGACCAATATCTATAATCTTCAGAATCTCACTTTTTCAGACAAGGGGAGGTATAATAAGGATTAAAACTCTGCAATTTCATCTTCTTTCAAACATCTTCTAAAAGATAAGAACCAAGATAAAGTTTCAATGTCCCAATGTGTTGCAACAGAGCCGTTGAGCAATAGAGCAATTCTAAAAAGCTTGGAAAATTAATCCTTGAGAGGGATGACATCAACTCAAAGATCTATCCAGAATCCGATTCTACAACCATTGCCAAGTTTAAATAAAGCTAGTGATTCCACCGACCTCCACACTCTTGAAATGCTTACCTATGGACTCCTTAGGCTATTACCAAACTTTTCCATTGCAAACCAGTCAAATGCCTCTTTGCCGTGGATGCTTCTTTCTATTTGCCTACATAAAGCTGTCTTCCTTTTAAAGTTTCCACCCTCACTTAGCAAGCAAAGCAATGTTATGTACTTTAATGCCACCTAAGTCGAGACCCCATCCTTTATTGATAGTGTAACCTTGCTGCAATTCACAAGTTGATTGATTTTACTGCCAGCATAACCTTCCCAAAAGAAATTCCTCATAATTTTCTCCAAAGAGGCAGCCACATTTTTCGGGGATAGAAAACAAAGATTGATAGTATGTGGGGAGACTAGCCAAGACCGAATTGGACAAAGTATGTCTACCTCCTCTAGAAATGTTAAATTTTTTCCATCTATCTAATTTTTTGTGAACTCGATCAATAACTGGTTGCCAAAACTCTTTTTGTCGAGGGTAGCCTTCCAAGGGGAGACCGAAATATAAAAAAGGTAACTTTTCTGCCTTGAAACCTAACCGGTTTGCTGTTTGAAACAAGACTTCCTCAACCACATTGACACCGCTAAGAGCTGATTTTACCCAATTTACTTTCTGCTCCAAACACCATTCGAAAAGCTTTGGCAGTGTTTAAAGTAGGAAATGGCAGAAGAACAGCCTTCAGGTCAGATCTGTGGGTGGTTAAAATTCCTCTAAAATTCGCATTCCCACGTTTGTTCAGAAGTGCACGCATTTCGGAAGGATCTACTGCTGACCATTGGGATTTCTATACTTCTTCTTGGTCTTTAACCTTCATAAGATTGCGGAAAGAAGAAGAAATTAATGATTTTCAGGCTCTTTTTTCTTTGTTGGCAGGAAAAGAATTACCGAGTGCCCCAGATAAAAGAATGTGGTCTTTGGAAACAAGCGGTGCCTTCTCTATTAAATCCCTTGTTAATCACCTTTCTTCTGCTACACCCCTTGAAAATTCACCGGAAGTTTGCATCTGGAAATCAAACAGCTCGTGAAGAGTGAATATTACTCTTTGGATTATGCTTTCTGGGAATTTGAATTGTGCTGCCGTTATGCAAAGGAAACTCCCCTCGGTCCTTTAGGCCCCTCCAAACCCAGACAAAGGTAGGTTTCTCTCTCTTCTCTCTCCTCTCTCCTCCCCCTCTCCCATCCTTCTCCTTCTCTAGCAGTCAATATCGACGACTCAATTTTTCTTTTTCGTTCTCCTCCTAGCGTTTCTTAATCCTTCAGTCAGAAGTTTTGAAATGATGGAAGTGATAAGATGTCGAATCCTCCAAGCACACTGCTGTATCTGGTTGGAAGATGGAAAGTTCCAAGTAGAAGATGTGGAAGCGCAAAGAAGGTTGTCAGTTTCAAAGGCACAGCTGAACCATGAAATCTATAGCGGTCCTTATGAGAGGTACAAAAGATGGTTTCTTTCATAATTCTGGGGATTTGGATCGAGGAAGAATGAAAGTGTCGAAATTCTGGTCCAAAGTTGGATGGATGCTATGCTGTGATTTTTGGCCTTTGTCAAGTGGTCACTCCAACATTCGAATGCGTGCAGGGGAGGACAGGCAGAGGTGGTTAGATTTCCATAAAATGTTAAAGAGTTTCCTGTCTAAGGTAGACTATACAAGATGGTTTGCAAATACAAGGCCCACGTCTAATTTTCCATCTGAGAGCAAGAGTTTAAGCTACGCAGACAAGGTTAGGACGCTAAAAAGGTGTCAACCTAGTTGAGATGTCCAAATGTGTCCCCTGATCCTTAGGTCTCTCTTTTGGTACTCTTAGTATATAACTTACTTTGAGTTTGAATATTAATGAAGAGGTTCTCCTTTTCAAAAAGCAAAGGAAACTCCTCTCTCATTGCTTATTGCCTCATATCTGCCCCCTTTGTTTAAATAATCTGGAGGATTTACAGCACCATCCCTTTGATTGTGACTATGCTAAGAGATGCTGTTTCAGATTGCTTCAAACCTTTAATCTATCTTGGGTTTTTGGAAATGATTTTAGGAGTAATGTCCAACATACTTTGGTTGGTCCAGCCCTCAAGAAAAGGCCTTTTTTTATTTGAAGCAACGCGGTAAAAGCCCTACTATCTGGATTATGGTTTGAAAGAAACCAATGAGTCTTCCACGATAAAATTTTCCCTTGGCTAGACCACTTTGAGCTTCCTCATTTAAACGCTTCTTCATGGTGTATGCTTCCAAATATTTTTGAAGACTTTTCAATTCAGGAGATCAATCTGAATTGGCAGGCTTTCATTTTCCCTCCTAAGTAGCAGAAGACTTTTTGTGTACTTATTTTTCTGTCTTAGGAGTCATTTTGTATTTGTATCACTGCTTGTTATTTTCTTGAGTTCTTGGTCTTAATGTTCCTATATTTTTTTCTTTGGACATTGAGTTTGGGATGTATTTCTTATTTCTTATTTTGACTATTCTCCTTGTTCGGATATGATGAGAGCGCTAAGGGTTGTCAACCTGGTTGAGATGTTGTATATTTTCTTGTATTTTGTATAGTGACTCTGTTTCTTTTCATTAATCAATGAAAAGTGTGTTTCCTTTTTTAAAAAAAAAAATTGATTATAATACACGAGGAATCCTAGGTTTGGAATTAATTATAGAAGAACAGGACATCTAAAAATAATTATAAGATGTAAATGACTTCACAAATCTACATCTTGGTATTCCATTCTTTGGATGTTTCAGGTTTTGTCTTCTTACTCTTCATTGGAGTATATATTCTTTGAACATTTTTGCTTCTTTTCATTAAAAAAAAATTTCATGTTAAAAAATGTGACAATATCCCAAAGAAGAAAGTGGAATGTTTGCCTCTCCGTCGGTAGGCGCATCCTTTGTCTAAATACCTGTATTGTATTATGTAGGCCTTAGTGCAGATGAAGAATTGGTTCTTGCGCTTTCCTACAATCTTACAGGCATGTGAGAGTATGATTGAGATGCTTAGAGGCCAGTATGCTCACTACGTTGGTTGTTACCATGAAGCAACTTTCCATTATATCGAAGCTGCAAAGGTCTTGCAATTTTAGTTTCCATGCTTCTGATACTGCTTGAAATCTCACACTGTGCTTATGGTTTGTTATTTGTGTCAAATTTTGTTGATTCTCTTGTCTGATTGTGTAGCTTACAGAGAGCAAATCAATTCAAGCAATGTGCCAAGTTTATGCTGCTGTCTCTTATATCTGTATTGGCGATGCTGAATCATCTACTCTTGTAACATTTTCTCCTGCTTAGAATTTATATACAAGATGAAATATACTAGCTTCCACATTTTTTAGTTTGATTGATTGAAATAGCATTACAGGCACTTGACTTGATTGGACCAGTTTACAGCATGATGGATTCTTTTGTTGGTGTTCGAGAGAAAACAAGTGTTCTTTTTGCGTATGGCCTCTTACTGATGAAGCAACATGATTTACAAGAAGCAAGGTATCTTTTTCCATTTAGATAG

mRNA sequence

ATGGAAGCAGTGGCGGAGGGATTGTGGAGGCTGGCAGATTACCACGAGAAGCAGGGGGAATTAGGGAAGGCAATAAAGTGTTTGGAAGCCATATGCCAAAGTCCCGTCTCTTTCTTTCCAGTCCTTGAGGTCAAGACTCGGCTTCGAATTGCCACCCTTCTGCTCACTTATTCTCACAACGTCAATCATGCCAAATCCCATCTCGAGCGTTCTCAATTGTTACTGAAGTCTATTCCATCATGCTTCGAATTGAAGTGCCGGGCTTATAGCTTATTAAGCCAATGCTATCATCTCGTTGGTGCTATTCCTCCCCAGAAACAACTTCTTTATAAGGGTCTTGATCTCACAAATTCTGCTGGTCACGAACTGTCGGTGAAACTATGGTCTTGCAACTTCAATTCACAGCTAGCAAATGCTTTGATTATTGAAGGAGACTATCAAAATTCAATCTCTGCCTTGGAGTCGGGATACGTATTTTCAGCTGAGATATGCTATCCTGAACTGCAGATGTTTTTTGCTACTTCTATCCTTCATGTGCACCTTATGCAATGGTACGATGACAATTCTGTTCAGCAAGCTGTTAACAAATGTGATGAGGTCTGGGAATCTATTGAACCTGAGAAAAGACAGCAGTGTGTTGGTTTATTATTTTACAATGAGCTGCTGCACATATTTTATCGACTTCGGATCTGTGACTACAAGAATGCTGCTCAGCATTTAGATAAATTAGATGCTGCTATGAAGGCTGATCTGCAACAAACTCAGTACATAGAAGATCTGAATAAAGAAATGAATGCCTTAAATCAGAGTCTATCCAGGTCTGATCTACATTATAAAGATAGGTTGGCTCTCACTGGAAAACATGCTCAGCTTCAGGAACAGCTTAGAAGTATAACTAGACCAACTTCCTTGAGCAAAGAATCTTTGGAACCTGGGCATTTTGGAAACGTGAGAAGAACATATAGAGACAAACTTGAGTTGGCACCATATCCTATTGATGGGGAGTGGCTACCAAAAAGTGCTGTGTACGCGCTTGTGGATCTTATGGTTGTTATCTTTAGCCGGCCTAAAGGTCTCTTCAAAGAGTGCACAAAACGAATTTTATCTGGAATGCTTACTATACAAGAGGAATTGGTGAAGCTTGGGATAGCTGATGGCGTAAGAGAAGTCAGTTTGCAGCACTCTGCCATATGGATGGCTGGCGTTTATTTAATGCTCATTATGCAACTTCTTGAAAACAAAGTAGCCATTGAGTTGACACGTTCTGAATTTGTTGAGGCTCAAGAGGCCTTAGTGCAGATGAAGAATTGGTTCTTGCGCTTTCCTACAATCTTACAGGCATGTGAGAGTATGATTGAGATGCTTAGAGGCCAGTATGCTCACTACGTTGGTTGTTACCATGAAGCAACTTTCCATTATATCGAAGCTGCAAAGCTTACAGAGAGCAAATCAATTCAAGCAATGTGCCAAGTTTATGCTGCTGTCTCTTATATCTGTATTGGCGATGCTGAATCATCTACTCTTGCACTTGACTTGATTGGACCAGTTTACAGCATGATGGATTCTTTTGTTGGTGTTCGAGAGAAAACAAGTGTTCTTTTTGCGTATGGCCTCTTACTGATGAAGCAACATGATTTACAAGAAGCAAGGTATCTTTTTCCATTTAGATAG

Coding sequence (CDS)

ATGGAAGCAGTGGCGGAGGGATTGTGGAGGCTGGCAGATTACCACGAGAAGCAGGGGGAATTAGGGAAGGCAATAAAGTGTTTGGAAGCCATATGCCAAAGTCCCGTCTCTTTCTTTCCAGTCCTTGAGGTCAAGACTCGGCTTCGAATTGCCACCCTTCTGCTCACTTATTCTCACAACGTCAATCATGCCAAATCCCATCTCGAGCGTTCTCAATTGTTACTGAAGTCTATTCCATCATGCTTCGAATTGAAGTGCCGGGCTTATAGCTTATTAAGCCAATGCTATCATCTCGTTGGTGCTATTCCTCCCCAGAAACAACTTCTTTATAAGGGTCTTGATCTCACAAATTCTGCTGGTCACGAACTGTCGGTGAAACTATGGTCTTGCAACTTCAATTCACAGCTAGCAAATGCTTTGATTATTGAAGGAGACTATCAAAATTCAATCTCTGCCTTGGAGTCGGGATACGTATTTTCAGCTGAGATATGCTATCCTGAACTGCAGATGTTTTTTGCTACTTCTATCCTTCATGTGCACCTTATGCAATGGTACGATGACAATTCTGTTCAGCAAGCTGTTAACAAATGTGATGAGGTCTGGGAATCTATTGAACCTGAGAAAAGACAGCAGTGTGTTGGTTTATTATTTTACAATGAGCTGCTGCACATATTTTATCGACTTCGGATCTGTGACTACAAGAATGCTGCTCAGCATTTAGATAAATTAGATGCTGCTATGAAGGCTGATCTGCAACAAACTCAGTACATAGAAGATCTGAATAAAGAAATGAATGCCTTAAATCAGAGTCTATCCAGGTCTGATCTACATTATAAAGATAGGTTGGCTCTCACTGGAAAACATGCTCAGCTTCAGGAACAGCTTAGAAGTATAACTAGACCAACTTCCTTGAGCAAAGAATCTTTGGAACCTGGGCATTTTGGAAACGTGAGAAGAACATATAGAGACAAACTTGAGTTGGCACCATATCCTATTGATGGGGAGTGGCTACCAAAAAGTGCTGTGTACGCGCTTGTGGATCTTATGGTTGTTATCTTTAGCCGGCCTAAAGGTCTCTTCAAAGAGTGCACAAAACGAATTTTATCTGGAATGCTTACTATACAAGAGGAATTGGTGAAGCTTGGGATAGCTGATGGCGTAAGAGAAGTCAGTTTGCAGCACTCTGCCATATGGATGGCTGGCGTTTATTTAATGCTCATTATGCAACTTCTTGAAAACAAAGTAGCCATTGAGTTGACACGTTCTGAATTTGTTGAGGCTCAAGAGGCCTTAGTGCAGATGAAGAATTGGTTCTTGCGCTTTCCTACAATCTTACAGGCATGTGAGAGTATGATTGAGATGCTTAGAGGCCAGTATGCTCACTACGTTGGTTGTTACCATGAAGCAACTTTCCATTATATCGAAGCTGCAAAGCTTACAGAGAGCAAATCAATTCAAGCAATGTGCCAAGTTTATGCTGCTGTCTCTTATATCTGTATTGGCGATGCTGAATCATCTACTCTTGCACTTGACTTGATTGGACCAGTTTACAGCATGATGGATTCTTTTGTTGGTGTTCGAGAGAAAACAAGTGTTCTTTTTGCGTATGGCCTCTTACTGATGAAGCAACATGATTTACAAGAAGCAAGGTATCTTTTTCCATTTAGATAG

Protein sequence

MEAVAEGLWRLADYHEKQGELGKAIKCLEAICQSPVSFFPVLEVKTRLRIATLLLTYSHNVNHAKSHLERSQLLLKSIPSCFELKCRAYSLLSQCYHLVGAIPPQKQLLYKGLDLTNSAGHELSVKLWSCNFNSQLANALIIEGDYQNSISALESGYVFSAEICYPELQMFFATSILHVHLMQWYDDNSVQQAVNKCDEVWESIEPEKRQQCVGLLFYNELLHIFYRLRICDYKNAAQHLDKLDAAMKADLQQTQYIEDLNKEMNALNQSLSRSDLHYKDRLALTGKHAQLQEQLRSITRPTSLSKESLEPGHFGNVRRTYRDKLELAPYPIDGEWLPKSAVYALVDLMVVIFSRPKGLFKECTKRILSGMLTIQEELVKLGIADGVREVSLQHSAIWMAGVYLMLIMQLLENKVAIELTRSEFVEAQEALVQMKNWFLRFPTILQACESMIEMLRGQYAHYVGCYHEATFHYIEAAKLTESKSIQAMCQVYAAVSYICIGDAESSTLALDLIGPVYSMMDSFVGVREKTSVLFAYGLLLMKQHDLQEARYLFPFR*
BLAST of Csa3G000010 vs. Swiss-Prot
Match: SCC4_HUMAN (MAU2 chromatid cohesion factor homolog OS=Homo sapiens GN=MAU2 PE=1 SV=2)

HSP 1 Score: 58.5 bits (140), Expect = 2.6e-07
Identity = 52/215 (24.19%), Postives = 95/215 (44.19%), Query Frame = 1

Query: 25  IKCLEAICQSPVSFFPVLEVKTRLRIATLLLTYSHNVNHAKSHLERSQLLLKSIPSCFEL 84
           + CL+A+   P      +E +T L++ ++L  ++ N   A+SHLE++ L+ + IP   ++
Sbjct: 48  VHCLQAVF--PFKPPQRIEARTHLQLGSVLYHHTKNSEQARSHLEKAWLISQQIPQFEDV 107

Query: 85  KCRAYSLLSQCYHLVGAIPPQKQLLYKGLDLTNSAGHELSVKLWSCNFNSQLANALIIEG 144
           K  A SLLS+ Y    ++   K LL K + ++    +      W C    QLA    +E 
Sbjct: 108 KFEAASLLSELYCQENSVDAAKPLLRKAIQISQQTPY------WHCRLLFQLAQLHTLEK 167

Query: 145 DYQNSISALESGYVFSAEICYPELQMFFATSILHVHLMQWYDDNSVQQAVNKCDEVWESI 204
           D  ++   L  G  ++  +     +  F  S   + LM+      V   +  C ++ E+ 
Sbjct: 168 DLVSACDLLGVGAEYARVVGSEYTRALFLLSKGMLLLME-RKLQEVHPLLTLCGQIVENW 227

Query: 205 EPEKRQQCVGLLFYNELLHIFYR-LRICDYKNAAQ 239
           +    Q+        E L +F+  L++  Y +A Q
Sbjct: 228 QGNPIQK--------ESLRVFFLVLQVTHYLDAGQ 245

BLAST of Csa3G000010 vs. Swiss-Prot
Match: SCC4_MOUSE (MAU2 chromatid cohesion factor homolog OS=Mus musculus GN=Mau2 PE=1 SV=3)

HSP 1 Score: 58.5 bits (140), Expect = 2.6e-07
Identity = 52/215 (24.19%), Postives = 95/215 (44.19%), Query Frame = 1

Query: 25  IKCLEAICQSPVSFFPVLEVKTRLRIATLLLTYSHNVNHAKSHLERSQLLLKSIPSCFEL 84
           + CL+A+   P      +E +T L++ ++L  ++ N   A+SHLE++ L+ + IP   ++
Sbjct: 54  VHCLQAVF--PFKPPQRIEARTHLQLGSVLYHHTKNSEQARSHLEKAWLISQQIPQFEDV 113

Query: 85  KCRAYSLLSQCYHLVGAIPPQKQLLYKGLDLTNSAGHELSVKLWSCNFNSQLANALIIEG 144
           K  A SLLS+ Y    ++   K LL K + ++    +      W C    QLA    +E 
Sbjct: 114 KFEAASLLSELYCQENSVDAAKPLLRKAIQISQQTPY------WHCRLLFQLAQLHTLEK 173

Query: 145 DYQNSISALESGYVFSAEICYPELQMFFATSILHVHLMQWYDDNSVQQAVNKCDEVWESI 204
           D  ++   L  G  ++  +     +  F  S   + LM+      V   +  C ++ E+ 
Sbjct: 174 DLVSACDLLGVGAEYARVVGSEYTRALFLLSKGMLLLME-RKLQEVHPLLTLCGQIVENW 233

Query: 205 EPEKRQQCVGLLFYNELLHIFYR-LRICDYKNAAQ 239
           +    Q+        E L +F+  L++  Y +A Q
Sbjct: 234 QGNPIQK--------ESLRVFFLVLQVTHYLDAGQ 251

BLAST of Csa3G000010 vs. Swiss-Prot
Match: SCC4_XENTR (MAU2 chromatid cohesion factor homolog OS=Xenopus tropicalis GN=mau2 PE=2 SV=1)

HSP 1 Score: 58.2 bits (139), Expect = 3.4e-07
Identity = 52/215 (24.19%), Postives = 95/215 (44.19%), Query Frame = 1

Query: 25  IKCLEAICQSPVSFFPVLEVKTRLRIATLLLTYSHNVNHAKSHLERSQLLLKSIPSCFEL 84
           + CL+A+ Q   S    +E +T L++ ++L  ++ N   A+ HLE++ L+ + IP   ++
Sbjct: 40  VHCLQAVFQFKPS--QRIEARTHLQLGSVLYHHTKNSELARQHLEKAWLISQQIPQFEDV 99

Query: 85  KCRAYSLLSQCYHLVGAIPPQKQLLYKGLDLTNSAGHELSVKLWSCNFNSQLANALIIEG 144
           K  A SLLS+ Y    ++   K LL K + ++    +      W C    QLA    +E 
Sbjct: 100 KFEAASLLSELYCQENSVDAAKPLLRKAIQISQQTPY------WHCRLLFQLAQLHTLEK 159

Query: 145 DYQNSISALESGYVFSAEICYPELQMFFATSILHVHLMQWYDDNSVQQAVNKCDEVWESI 204
           D  ++   L  G  ++  +     +  F  S   + LM+      V   +  C ++ E+ 
Sbjct: 160 DLVSACDLLGVGAEYARVVGSEYTRALFLLSKGMLLLME-RKLQEVHPLLTLCGQIVENW 219

Query: 205 EPEKRQQCVGLLFYNELLHIFYR-LRICDYKNAAQ 239
           +    Q+        E L +F+  L++  Y +A Q
Sbjct: 220 QGNPIQK--------ESLRVFFLVLQVTHYLDAGQ 237

BLAST of Csa3G000010 vs. Swiss-Prot
Match: SCC4_XENLA (MAU2 chromatid cohesion factor homolog OS=Xenopus laevis GN=mau2 PE=1 SV=1)

HSP 1 Score: 55.8 bits (133), Expect = 1.7e-06
Identity = 52/217 (23.96%), Postives = 95/217 (43.78%), Query Frame = 1

Query: 25  IKCLEAICQSPVSFFPV--LEVKTRLRIATLLLTYSHNVNHAKSHLERSQLLLKSIPSCF 84
           + CL+A+ Q    F P   +E +T L++ ++L  ++ N   A+ HLE++  + + IP   
Sbjct: 43  VHCLQAVFQ----FKPPQRIEARTHLQLGSVLYHHTKNSELARQHLEKAWFISQQIPQFE 102

Query: 85  ELKCRAYSLLSQCYHLVGAIPPQKQLLYKGLDLTNSAGHELSVKLWSCNFNSQLANALII 144
           ++K  A SLLS+ Y    ++   K LL K + ++    +      W C    QLA    +
Sbjct: 103 DVKFEAASLLSELYCQENSVDAAKPLLRKAIQISQQTPY------WHCRLLFQLAQLHTL 162

Query: 145 EGDYQNSISALESGYVFSAEICYPELQMFFATSILHVHLMQWYDDNSVQQAVNKCDEVWE 204
           E D  ++   L  G  ++  +     +  F  S   + LM+      V   +  C ++ E
Sbjct: 163 EKDLVSACDLLGVGAEYARVVGSEYTRALFLLSKGMLLLME-RKLQEVHPLLTLCGQIVE 222

Query: 205 SIEPEKRQQCVGLLFYNELLHIFYR-LRICDYKNAAQ 239
           + +    Q+        E L +F+  L++  Y +A Q
Sbjct: 223 NWQGNPIQK--------ESLRVFFLVLQVTHYLDAGQ 240

BLAST of Csa3G000010 vs. Swiss-Prot
Match: SCC4_AEDAE (MAU2 chromatid cohesion factor homolog OS=Aedes aegypti GN=AAEL011819 PE=3 SV=1)

HSP 1 Score: 55.8 bits (133), Expect = 1.7e-06
Identity = 57/233 (24.46%), Postives = 104/233 (44.64%), Query Frame = 1

Query: 11  LADYHEKQG--ELGKAIKCLEAICQSPVSFFPVLEV--KTRLRIATLLLTYSHNVNHAKS 70
           LA+Y        + K I+CL+A+     +F P L+V  +T L++  +L+ Y+ N + A++
Sbjct: 15  LAEYFRTSSPPNIKKCIQCLQAL----FTFKPPLKVEARTHLQLGQILMAYTKNTDLARN 74

Query: 71  HLERSQLLLKSIPSCFELKCRAYSLLSQCYHLVGAIPPQKQLLYKGLDLTNSAGHELSVK 130
           HLE++ +L ++I +  ++K    SLL+Q Y         K +L K ++L+    H +   
Sbjct: 75  HLEQAWMLSENINNFDDVKFDTASLLAQLYQQQEQSSLAKPVLRKAIELSQ---HNV--- 134

Query: 131 LWSCNFNSQLANALIIEGDYQNSISALESGYVFSAEICYPELQMFFATSILHVHLMQWYD 190
            W C    QLA     + +Y  +   L  G   + E     L+  F  S   + +++   
Sbjct: 135 YWHCKLLFQLAQTHATDKEYALASELLAVGVESTDETNATYLKTLFLLSRAMIMMIE-RK 194

Query: 191 DNSVQQAVNKCDEVWESIEPEKRQQCVGLLFYNELLHI-FYRLRICDYKNAAQ 239
              V   +N+   + ++         V  +   E L + FY L++C Y    Q
Sbjct: 195 TGDVLTILNQAGTMIDN--------AVQNIHLKEYLKVFFYVLQVCHYLQLGQ 228

BLAST of Csa3G000010 vs. TrEMBL
Match: M5WND8_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003720mg PE=4 SV=1)

HSP 1 Score: 906.7 bits (2342), Expect = 1.3e-260
Identity = 443/550 (80.55%), Postives = 504/550 (91.64%), Query Frame = 1

Query: 1   MEAVAEGLWRLADYHEKQGELGKAIKCLEAICQSPVSFFPVLEVKTRLRIATLLLTYSHN 60
           MEAVAEGLW LADY E++GE+GKA+KCLEAICQS VSFFP++EVKTRLRIATLLL +SHN
Sbjct: 1   MEAVAEGLWGLADYQEQRGEIGKAVKCLEAICQSDVSFFPIVEVKTRLRIATLLLKHSHN 60

Query: 61  VNHAKSHLERSQLLLKSIPSCFELKCRAYSLLSQCYHLVGAIPPQKQLLYKGLDLTNSAG 120
           VNHAKSHLER+QLLLKSIPSCF+LKCRAYSLLSQCYHLVGAIPPQKQ+L+K L+L+ SAG
Sbjct: 61  VNHAKSHLERAQLLLKSIPSCFDLKCRAYSLLSQCYHLVGAIPPQKQVLHKALELSVSAG 120

Query: 121 HELSVKLWSCNFNSQLANALIIEGDYQNSISALESGYVFSAEICYPELQMFFATSILHVH 180
           HE++VKLWSCNFNSQLANALIIEGDY++SISALE+G+  + EICYPELQMFFAT +LHVH
Sbjct: 121 HEITVKLWSCNFNSQLANALIIEGDYRSSISALEAGFACATEICYPELQMFFATCMLHVH 180

Query: 181 LMQWYDDNSVQQAVNKCDEVWESIEPEKRQQCVGLLFYNELLHIFYRLRICDYKNAAQHL 240
           LMQW D+N+VQ AV KCDEVWES++P+KRQQC+GLLFYNELLHIFYRLRICDYKNA  H+
Sbjct: 181 LMQWDDENTVQLAVTKCDEVWESLDPQKRQQCLGLLFYNELLHIFYRLRICDYKNATPHV 240

Query: 241 DKLDAAMKADLQQTQYIEDLNKEMNALNQSLSRSDLHYKDRLALTGKHAQLQEQLRSITR 300
           ++LDAAMKADLQQ Q+++ L +E++A+NQSLSRSDLH+++R AL+ K A+LQ QL S++ 
Sbjct: 241 ERLDAAMKADLQQMQHVQQLARELDAVNQSLSRSDLHHRERSALSEKQARLQHQLSSLST 300

Query: 301 PTSLSKESLEPGHFGNVRRTYRDKLELAPYPIDGEWLPKSAVYALVDLMVVIFSRPKGLF 360
            +S +K SLEP +FGN++RTY DKLELAP PIDGEWLPKSAVYALVDLM+V   RPKG F
Sbjct: 301 WSSTAKGSLEPAYFGNMKRTYGDKLELAPPPIDGEWLPKSAVYALVDLMMVASGRPKGNF 360

Query: 361 KECTKRILSGMLTIQEELVKLGIADGVREVSLQHSAIWMAGVYLMLIMQLLENKVAIELT 420
           KEC KRI SGMLTIQEELVKLGI DGVREV+LQHSAIWMAGVYLML+MQ LENKVA+ELT
Sbjct: 361 KECAKRIQSGMLTIQEELVKLGITDGVREVNLQHSAIWMAGVYLMLLMQFLENKVAMELT 420

Query: 421 RSEFVEAQEALVQMKNWFLRFPTILQACESMIEMLRGQYAHYVGCYHEATFHYIEAAKLT 480
           RSEFVEAQEALVQMKNWF+RFPTILQ CES+IEMLRGQYAH VGCY+EA FHYIEAAKLT
Sbjct: 421 RSEFVEAQEALVQMKNWFMRFPTILQTCESIIEMLRGQYAHSVGCYNEAAFHYIEAAKLT 480

Query: 481 ESKSIQAMCQVYAAVSYICIGDAESSTLALDLIGPVYSMMDSFVGVREKTSVLFAYGLLL 540
           ESKS+QA+ Q+YAAVSYICIGD+ESST ALDLIGPVY MMDSFVGVREKT+ LFAYGLLL
Sbjct: 481 ESKSMQAIYQIYAAVSYICIGDSESSTQALDLIGPVYRMMDSFVGVREKTTALFAYGLLL 540

Query: 541 MKQHDLQEAR 551
           MKQ DLQEAR
Sbjct: 541 MKQQDLQEAR 550

BLAST of Csa3G000010 vs. TrEMBL
Match: V7BAU7_PHAVU (Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_008G243600g PE=4 SV=1)

HSP 1 Score: 906.4 bits (2341), Expect = 1.8e-260
Identity = 443/550 (80.55%), Postives = 499/550 (90.73%), Query Frame = 1

Query: 1   MEAVAEGLWRLADYHEKQGELGKAIKCLEAICQSPVSFFPVLEVKTRLRIATLLLTYSHN 60
           MEAVAEGLW LA+YHEK+GE+GKA+KCLEAICQS VSFFP++EVKTRLRIATLLL +SHN
Sbjct: 1   MEAVAEGLWGLAEYHEKRGEIGKAVKCLEAICQSEVSFFPIVEVKTRLRIATLLLHHSHN 60

Query: 61  VNHAKSHLERSQLLLKSIPSCFELKCRAYSLLSQCYHLVGAIPPQKQLLYKGLDLTNSAG 120
           VNHAKSHLERSQLLLKSIPSCFE+KCRAYSLLSQCYHLVGAIPPQKQ+L+KGL+LT S G
Sbjct: 61  VNHAKSHLERSQLLLKSIPSCFEIKCRAYSLLSQCYHLVGAIPPQKQVLHKGLELTASVG 120

Query: 121 HELSVKLWSCNFNSQLANALIIEGDYQNSISALESGYVFSAEICYPELQMFFATSILHVH 180
           +E+S+KLWSCNFNSQLANAL IEGDYQ SISALE GYV + E+C PELQMFFATSILHV 
Sbjct: 121 YEISMKLWSCNFNSQLANALSIEGDYQGSISALECGYVCATEVCLPELQMFFATSILHVR 180

Query: 181 LMQWYDDNSVQQAVNKCDEVWESIEPEKRQQCVGLLFYNELLHIFYRLRICDYKNAAQHL 240
           LMQW DDN V+QAVNKC+E+WESI+P+KR+QC GLLFYNELLHIFYRLR+CDYKNAA H+
Sbjct: 181 LMQWDDDNLVEQAVNKCNEIWESIDPDKRRQCPGLLFYNELLHIFYRLRLCDYKNAAPHV 240

Query: 241 DKLDAAMKADLQQTQYIEDLNKEMNALNQSLSRSDLHYKDRLALTGKHAQLQEQLRSITR 300
           D LDAAMK D+QQTQ+I++L KE++ L+QSLSRSDLHY+DR AL+ K   ++EQL S+T 
Sbjct: 241 DNLDAAMKFDMQQTQHIQELVKELDVLDQSLSRSDLHYRDRTALSRKQTMIKEQLSSMTG 300

Query: 301 PTSLSKESLEPGHFGNVRRTYRDKLELAPYPIDGEWLPKSAVYALVDLMVVIFSRPKGLF 360
              + +E+L+P +FGNVRRT  DKL+LAP PIDGEWLPKSAVYALVDL+VV+F RPKGLF
Sbjct: 301 LNLIGQETLQPVYFGNVRRTIGDKLQLAPPPIDGEWLPKSAVYALVDLIVVVFGRPKGLF 360

Query: 361 KECTKRILSGMLTIQEELVKLGIADGVREVSLQHSAIWMAGVYLMLIMQLLENKVAIELT 420
           KEC KRI SGM  IQ+ELVKLGI DGVREV LQHS+IWMAGVYLML++Q LENKVAIELT
Sbjct: 361 KECAKRIQSGMHIIQDELVKLGITDGVREVDLQHSSIWMAGVYLMLLVQFLENKVAIELT 420

Query: 421 RSEFVEAQEALVQMKNWFLRFPTILQACESMIEMLRGQYAHYVGCYHEATFHYIEAAKLT 480
           R+EFVEAQEALVQMKNWF+RFPTILQACE +IEMLRGQYAH VGCY+EA FHYIEA KLT
Sbjct: 421 RAEFVEAQEALVQMKNWFMRFPTILQACECIIEMLRGQYAHSVGCYNEAAFHYIEAVKLT 480

Query: 481 ESKSIQAMCQVYAAVSYICIGDAESSTLALDLIGPVYSMMDSFVGVREKTSVLFAYGLLL 540
           +SKS+QAMCQVYAAVSYICIGDAESS+ ALDLIGPVY +MDSFVGVREKT VLFAYGLLL
Sbjct: 481 DSKSMQAMCQVYAAVSYICIGDAESSSQALDLIGPVYGVMDSFVGVREKTGVLFAYGLLL 540

Query: 541 MKQHDLQEAR 551
           MKQ DLQEAR
Sbjct: 541 MKQQDLQEAR 550

BLAST of Csa3G000010 vs. TrEMBL
Match: D7SJG2_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_06s0004g08370 PE=4 SV=1)

HSP 1 Score: 906.0 bits (2340), Expect = 2.3e-260
Identity = 450/550 (81.82%), Postives = 496/550 (90.18%), Query Frame = 1

Query: 1   MEAVAEGLWRLADYHEKQGELGKAIKCLEAICQSPVSFFPVLEVKTRLRIATLLLTYSHN 60
           ME VAEGLW LAD HEK+GE+GKA+KCLEA+CQS VSF P+LE+KTRLRIATLLL +SHN
Sbjct: 1   METVAEGLWGLADMHEKKGEIGKAVKCLEALCQSQVSFLPILEIKTRLRIATLLLKHSHN 60

Query: 61  VNHAKSHLERSQLLLKSIPSCFELKCRAYSLLSQCYHLVGAIPPQKQLLYKGLDLTNSAG 120
           +NHAKSHLERSQLLLKSIPSCFELKCRAYSLLSQCYHLVGAIPPQKQ+L K L+LT S+G
Sbjct: 61  LNHAKSHLERSQLLLKSIPSCFELKCRAYSLLSQCYHLVGAIPPQKQILNKALELTASSG 120

Query: 121 HELSVKLWSCNFNSQLANALIIEGDYQNSISALESGYVFSAEICYPELQMFFATSILHVH 180
              +VKLW CNFNSQLANALIIEGDYQNSISALE G+  + EICY ELQMFFATSILHVH
Sbjct: 121 DGFAVKLWFCNFNSQLANALIIEGDYQNSISALERGFNCATEICYIELQMFFATSILHVH 180

Query: 181 LMQWYDDNSVQQAVNKCDEVWESIEPEKRQQCVGLLFYNELLHIFYRLRICDYKNAAQHL 240
           LMQW D N V++AVNKC+EVW+SIEP+KRQQ +GLLFYNELLHIFYRLRICDYKNAAQH+
Sbjct: 181 LMQWDDVNLVERAVNKCNEVWDSIEPDKRQQSLGLLFYNELLHIFYRLRICDYKNAAQHV 240

Query: 241 DKLDAAMKADLQQTQYIEDLNKEMNALNQSLSRSDLHYKDRLALTGKHAQLQEQLRSITR 300
           DKLDAAMKADLQQ Q+I++L KE++ALNQSLSR DLHY DR AL+ K AQ+QEQLR +TR
Sbjct: 241 DKLDAAMKADLQQMQHIQELTKELDALNQSLSRHDLHYTDRSALSEKQAQVQEQLRRVTR 300

Query: 301 PTSLSKESLEPGHFGNVRRTYRDKLELAPYPIDGEWLPKSAVYALVDLMVVIFSRPKGLF 360
             S  KESLE  +FGNV+R + DKL+LAP PIDGEWLPKSAVY L+DLMVVIF RPKG F
Sbjct: 301 LGSSGKESLESAYFGNVKRAWGDKLDLAPPPIDGEWLPKSAVYGLIDLMVVIFGRPKGNF 360

Query: 361 KECTKRILSGMLTIQEELVKLGIADGVREVSLQHSAIWMAGVYLMLIMQLLENKVAIELT 420
           KEC KRI SG+ TIQEEL+KLGI+D VREV LQHSAIWMAGVYLML+MQ LENKVA+ELT
Sbjct: 361 KECGKRIQSGLRTIQEELMKLGISDSVREVDLQHSAIWMAGVYLMLLMQFLENKVAVELT 420

Query: 421 RSEFVEAQEALVQMKNWFLRFPTILQACESMIEMLRGQYAHYVGCYHEATFHYIEAAKLT 480
           RSEFVEAQEALVQM+NWFLRFPTILQACES+IEMLRGQYAH VGC+ EA FH+IEAAKLT
Sbjct: 421 RSEFVEAQEALVQMRNWFLRFPTILQACESIIEMLRGQYAHSVGCFSEAAFHFIEAAKLT 480

Query: 481 ESKSIQAMCQVYAAVSYICIGDAESSTLALDLIGPVYSMMDSFVGVREKTSVLFAYGLLL 540
           ESKS+QAMCQVYAAVSYICIGDAESS+ A DLIGPVY MMDSFVGVREKTSVLFAYGLLL
Sbjct: 481 ESKSMQAMCQVYAAVSYICIGDAESSSQAFDLIGPVYRMMDSFVGVREKTSVLFAYGLLL 540

Query: 541 MKQHDLQEAR 551
           MKQH+LQEAR
Sbjct: 541 MKQHNLQEAR 550

BLAST of Csa3G000010 vs. TrEMBL
Match: K7KAE5_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_02G238400 PE=4 SV=1)

HSP 1 Score: 905.2 bits (2338), Expect = 3.9e-260
Identity = 441/550 (80.18%), Postives = 497/550 (90.36%), Query Frame = 1

Query: 1   MEAVAEGLWRLADYHEKQGELGKAIKCLEAICQSPVSFFPVLEVKTRLRIATLLLTYSHN 60
           MEAVAEGLW LA+YHEK+GE+GKA+KCLEAICQS  SFFP++EVKTRLRIATLLL +SHN
Sbjct: 1   MEAVAEGLWGLAEYHEKRGEIGKAVKCLEAICQSDASFFPIVEVKTRLRIATLLLQHSHN 60

Query: 61  VNHAKSHLERSQLLLKSIPSCFELKCRAYSLLSQCYHLVGAIPPQKQLLYKGLDLTNSAG 120
           VNHAKSHLERSQLLLKSIPSCFELKCRAYSLLSQCYHLVGAIPPQKQ+L+KGL+LT S G
Sbjct: 61  VNHAKSHLERSQLLLKSIPSCFELKCRAYSLLSQCYHLVGAIPPQKQVLHKGLELTASVG 120

Query: 121 HELSVKLWSCNFNSQLANALIIEGDYQNSISALESGYVFSAEICYPELQMFFATSILHVH 180
           +E+S+KLW CNFNSQLANAL IEGDYQ SISALE GY  + E+C+PELQ+FFATSILHV 
Sbjct: 121 YEISMKLWFCNFNSQLANALSIEGDYQGSISALECGYACATEVCFPELQLFFATSILHVR 180

Query: 181 LMQWYDDNSVQQAVNKCDEVWESIEPEKRQQCVGLLFYNELLHIFYRLRICDYKNAAQHL 240
           LMQW DDN V+QAVN+C+++WESI+P+KR+QC GLLFYNELLHIFYRLR+CDYKNAA H+
Sbjct: 181 LMQWDDDNLVEQAVNRCNQIWESIDPDKRRQCPGLLFYNELLHIFYRLRLCDYKNAAPHV 240

Query: 241 DKLDAAMKADLQQTQYIEDLNKEMNALNQSLSRSDLHYKDRLALTGKHAQLQEQLRSITR 300
           D LDAAMK D+QQTQ I++L  E+NAL+QSLSRSDLHY+DR AL+ K   +QEQL+S+T 
Sbjct: 241 DNLDAAMKIDMQQTQRIQELVNELNALDQSLSRSDLHYRDRTALSKKQTMIQEQLKSMTG 300

Query: 301 PTSLSKESLEPGHFGNVRRTYRDKLELAPYPIDGEWLPKSAVYALVDLMVVIFSRPKGLF 360
             S+ +ESL+P +FGNVRR   DKL+LAP PIDGEWLPKSAVYALVDL+VV+F RPKGLF
Sbjct: 301 LCSIGQESLQPVYFGNVRRIIGDKLQLAPPPIDGEWLPKSAVYALVDLIVVVFGRPKGLF 360

Query: 361 KECTKRILSGMLTIQEELVKLGIADGVREVSLQHSAIWMAGVYLMLIMQLLENKVAIELT 420
           KEC KRI SGM  IQ+ELVKLGI DGVREV LQHS+IWMAGVYLML++Q LENKVAIELT
Sbjct: 361 KECAKRIQSGMNIIQDELVKLGITDGVREVDLQHSSIWMAGVYLMLLIQFLENKVAIELT 420

Query: 421 RSEFVEAQEALVQMKNWFLRFPTILQACESMIEMLRGQYAHYVGCYHEATFHYIEAAKLT 480
           R+EFVEAQEALVQMKNWF+RFPTILQACE +IEMLRGQYAH VGCYHEA FH+IEA KLT
Sbjct: 421 RAEFVEAQEALVQMKNWFMRFPTILQACECIIEMLRGQYAHSVGCYHEAAFHFIEAVKLT 480

Query: 481 ESKSIQAMCQVYAAVSYICIGDAESSTLALDLIGPVYSMMDSFVGVREKTSVLFAYGLLL 540
           +SKS+QAMCQVYAAVSYICIGDAESS+ ALDLIGPVY +MDSFVGVREKT VLFAYGLLL
Sbjct: 481 DSKSMQAMCQVYAAVSYICIGDAESSSQALDLIGPVYGVMDSFVGVREKTGVLFAYGLLL 540

Query: 541 MKQHDLQEAR 551
           MKQ DLQEAR
Sbjct: 541 MKQQDLQEAR 550

BLAST of Csa3G000010 vs. TrEMBL
Match: A0A0B2S4C3_GLYSO (MAU2 chromatid cohesion factor like OS=Glycine soja GN=glysoja_016910 PE=4 SV=1)

HSP 1 Score: 904.4 bits (2336), Expect = 6.7e-260
Identity = 441/550 (80.18%), Postives = 496/550 (90.18%), Query Frame = 1

Query: 1   MEAVAEGLWRLADYHEKQGELGKAIKCLEAICQSPVSFFPVLEVKTRLRIATLLLTYSHN 60
           MEAVAEGLW LA+YHEK+GE+GKA+KCLEAICQS  SFFP++EVKTRLRIATLLL +SHN
Sbjct: 1   MEAVAEGLWGLAEYHEKRGEIGKAVKCLEAICQSDASFFPIVEVKTRLRIATLLLHHSHN 60

Query: 61  VNHAKSHLERSQLLLKSIPSCFELKCRAYSLLSQCYHLVGAIPPQKQLLYKGLDLTNSAG 120
           VNHAKSHLERSQLLLKSIPSCFELKCRAYSLLSQCYHLVGAIPPQKQ+L+KGL+L  S G
Sbjct: 61  VNHAKSHLERSQLLLKSIPSCFELKCRAYSLLSQCYHLVGAIPPQKQVLHKGLELAASVG 120

Query: 121 HELSVKLWSCNFNSQLANALIIEGDYQNSISALESGYVFSAEICYPELQMFFATSILHVH 180
           +E+S+KLWSCNFNSQLANAL IEGDYQ SISALE GYV + E+C+PELQMFFATSILHV 
Sbjct: 121 YEISMKLWSCNFNSQLANALSIEGDYQGSISALECGYVCATEVCFPELQMFFATSILHVR 180

Query: 181 LMQWYDDNSVQQAVNKCDEVWESIEPEKRQQCVGLLFYNELLHIFYRLRICDYKNAAQHL 240
           LMQW DDN V+QAVN+C+++WESI P+KR+QC GLLFYNELLHIFYRLR+CDYKNAA H+
Sbjct: 181 LMQWDDDNLVEQAVNRCNQIWESIAPDKRRQCPGLLFYNELLHIFYRLRLCDYKNAAPHV 240

Query: 241 DKLDAAMKADLQQTQYIEDLNKEMNALNQSLSRSDLHYKDRLALTGKHAQLQEQLRSITR 300
           D LDAAMK D+QQTQ I++L KE+N L+QSLSRSDLHY+DR AL+ K   +QEQL+S+T 
Sbjct: 241 DNLDAAMKIDMQQTQRIQELVKELNTLDQSLSRSDLHYRDRTALSKKQTMIQEQLKSMTG 300

Query: 301 PTSLSKESLEPGHFGNVRRTYRDKLELAPYPIDGEWLPKSAVYALVDLMVVIFSRPKGLF 360
             S+ +ESL+P +FGNVRR   DKL+LAP PIDGEWLPKSAVYALVDL+VV+F RPKGLF
Sbjct: 301 LCSIGQESLQPVYFGNVRRIIGDKLQLAPPPIDGEWLPKSAVYALVDLIVVVFGRPKGLF 360

Query: 361 KECTKRILSGMLTIQEELVKLGIADGVREVSLQHSAIWMAGVYLMLIMQLLENKVAIELT 420
           KEC KRI SGM  IQ+EL+KLGI DGVREV LQHS+IWMAGVYLML++Q LENKVAIELT
Sbjct: 361 KECAKRIQSGMNIIQDELLKLGITDGVREVDLQHSSIWMAGVYLMLLIQFLENKVAIELT 420

Query: 421 RSEFVEAQEALVQMKNWFLRFPTILQACESMIEMLRGQYAHYVGCYHEATFHYIEAAKLT 480
           R+EFVEAQEALVQMKNWF+RFPTILQACE + EMLRGQYAH VGCYHEA FH+IEA KLT
Sbjct: 421 RAEFVEAQEALVQMKNWFMRFPTILQACECIFEMLRGQYAHSVGCYHEAAFHFIEAVKLT 480

Query: 481 ESKSIQAMCQVYAAVSYICIGDAESSTLALDLIGPVYSMMDSFVGVREKTSVLFAYGLLL 540
           +SKS+QAMCQVYAAVSYICIGDAESS+ ALDLIGPVY +MDSFVGVREKT VLFAYGLLL
Sbjct: 481 DSKSMQAMCQVYAAVSYICIGDAESSSQALDLIGPVYGVMDSFVGVREKTGVLFAYGLLL 540

Query: 541 MKQHDLQEAR 551
           MKQ DLQEAR
Sbjct: 541 MKQQDLQEAR 550

BLAST of Csa3G000010 vs. TAIR10
Match: AT5G51340.1 (AT5G51340.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 727.2 bits (1876), Expect = 7.4e-210
Identity = 355/548 (64.78%), Postives = 442/548 (80.66%), Query Frame = 1

Query: 3   AVAEGLWRLADYHEKQGELGKAIKCLEAICQSPVSFFPVLEVKTRLRIATLLLTYSHNVN 62
           AVAEGLW LAD+H+K GE+GK IKCLEAICQS +SF P++EVK+RLR+A LLL YSHNVN
Sbjct: 5   AVAEGLWGLADHHQKLGEIGKTIKCLEAICQSQISFLPLVEVKSRLRLAALLLRYSHNVN 64

Query: 63  HAKSHLERSQLLLKSIPSCFELKCRAYSLLSQCYHLVGAIPPQKQLLYKGLDLTNSAGHE 122
           HAKSHLERS LLLKSIPS ++LK + YSLLS CYHL+ + PPQ+ LL K L+L +S   +
Sbjct: 65  HAKSHLERSLLLLKSIPSSYDLKFQNYSLLSHCYHLLASFPPQRNLLVKALELASSVPQD 124

Query: 123 LSVKLWSCNFNSQLANALIIEGDYQNSISALESGYVFSAEICYPELQMFFATSILHVHLM 182
           +S  LWSCNFNSQLAN  II+ D+ +S+SALESG++ ++ IC+PELQMFF  S+LHVH+M
Sbjct: 125 ISAYLWSCNFNSQLANTFIIQADFPSSLSALESGFLSASHICFPELQMFFTASMLHVHIM 184

Query: 183 QWYDDNSVQQAVNKCDEVWESIEPEKRQQCVGLLFYNELLHIFYRLRICDYKNAAQHLDK 242
           QW DD SV++AV +CDE+W++I  +K  +C GL FYNE+LH+FYRLR+CDYKNA  H+D+
Sbjct: 185 QWTDDYSVEKAVQRCDEIWQTISSDKTDRCPGLFFYNEMLHVFYRLRLCDYKNAQHHVDR 244

Query: 243 LDAAMKADLQQTQYIEDLNKEMNALNQSLSRSDLHYKDRLALTGKHAQLQEQLRSITRPT 302
           LD AM A   + Q I+ L  E+++LN SLSR DL  ++R AL+ + +QLQ+++ +++ P+
Sbjct: 245 LDQAMNAHSHKMQEIQQLLDELSSLNLSLSRYDLPSRERSALSARQSQLQDRVNALS-PS 304

Query: 303 SLSKESLEPGHFGNVRRTYRDKLELAPYPIDGEWLPKSAVYALVDLMVVIFSRPKGLFKE 362
           S +  SLEP +FGN+ R + +KL L+P PIDGEWLPKSA+ ALV LMVVI  RPKGLFKE
Sbjct: 305 SSTVNSLEPAYFGNIDRGWTEKLLLSPSPIDGEWLPKSAIDALVHLMVVISGRPKGLFKE 364

Query: 363 CTKRILSGMLTIQEELVKLGIADGVREVSLQHSAIWMAGVYLMLIMQLLENKVAIELTRS 422
           C+KRI SG+  IQ+EL+KLGI D VRE  L+H+AIWM+ V+LML MQ LEN+VA+ELTRS
Sbjct: 365 CSKRIESGLQIIQDELIKLGITDEVREADLRHTAIWMSRVFLMLQMQFLENRVALELTRS 424

Query: 423 EFVEAQEALVQMKNWFLRFPTILQACESMIEMLRGQYAHYVGCYHEATFHYIEAAKLTES 482
           ++VEA+EALV MKNWF RFPTILQA E MIEMLRGQY+H VGCY EA FH IEA KLTES
Sbjct: 425 DYVEAEEALVDMKNWFTRFPTILQASECMIEMLRGQYSHSVGCYSEAAFHCIEATKLTES 484

Query: 483 KSIQAMCQVYAAVSYICIGDAESSTLALDLIGPVYSMMDSFVGVREKTSVLFAYGLLLMK 542
            S+QA CQ +AAVSY+ IGDAESS+ ALDLIGP+  M +S  GVRE+ S+LFAYGLLLMK
Sbjct: 485 ISMQASCQAFAAVSYLTIGDAESSSKALDLIGPLNGMTNSLSGVREEASILFAYGLLLMK 544

Query: 543 QHDLQEAR 551
           Q DLQEAR
Sbjct: 545 QRDLQEAR 551

BLAST of Csa3G000010 vs. NCBI nr
Match: gi|700200444|gb|KGN55577.1| (hypothetical protein Csa_3G000010 [Cucumis sativus])

HSP 1 Score: 1113.2 bits (2878), Expect = 0.0e+00
Identity = 556/556 (100.00%), Postives = 556/556 (100.00%), Query Frame = 1

Query: 1   MEAVAEGLWRLADYHEKQGELGKAIKCLEAICQSPVSFFPVLEVKTRLRIATLLLTYSHN 60
           MEAVAEGLWRLADYHEKQGELGKAIKCLEAICQSPVSFFPVLEVKTRLRIATLLLTYSHN
Sbjct: 1   MEAVAEGLWRLADYHEKQGELGKAIKCLEAICQSPVSFFPVLEVKTRLRIATLLLTYSHN 60

Query: 61  VNHAKSHLERSQLLLKSIPSCFELKCRAYSLLSQCYHLVGAIPPQKQLLYKGLDLTNSAG 120
           VNHAKSHLERSQLLLKSIPSCFELKCRAYSLLSQCYHLVGAIPPQKQLLYKGLDLTNSAG
Sbjct: 61  VNHAKSHLERSQLLLKSIPSCFELKCRAYSLLSQCYHLVGAIPPQKQLLYKGLDLTNSAG 120

Query: 121 HELSVKLWSCNFNSQLANALIIEGDYQNSISALESGYVFSAEICYPELQMFFATSILHVH 180
           HELSVKLWSCNFNSQLANALIIEGDYQNSISALESGYVFSAEICYPELQMFFATSILHVH
Sbjct: 121 HELSVKLWSCNFNSQLANALIIEGDYQNSISALESGYVFSAEICYPELQMFFATSILHVH 180

Query: 181 LMQWYDDNSVQQAVNKCDEVWESIEPEKRQQCVGLLFYNELLHIFYRLRICDYKNAAQHL 240
           LMQWYDDNSVQQAVNKCDEVWESIEPEKRQQCVGLLFYNELLHIFYRLRICDYKNAAQHL
Sbjct: 181 LMQWYDDNSVQQAVNKCDEVWESIEPEKRQQCVGLLFYNELLHIFYRLRICDYKNAAQHL 240

Query: 241 DKLDAAMKADLQQTQYIEDLNKEMNALNQSLSRSDLHYKDRLALTGKHAQLQEQLRSITR 300
           DKLDAAMKADLQQTQYIEDLNKEMNALNQSLSRSDLHYKDRLALTGKHAQLQEQLRSITR
Sbjct: 241 DKLDAAMKADLQQTQYIEDLNKEMNALNQSLSRSDLHYKDRLALTGKHAQLQEQLRSITR 300

Query: 301 PTSLSKESLEPGHFGNVRRTYRDKLELAPYPIDGEWLPKSAVYALVDLMVVIFSRPKGLF 360
           PTSLSKESLEPGHFGNVRRTYRDKLELAPYPIDGEWLPKSAVYALVDLMVVIFSRPKGLF
Sbjct: 301 PTSLSKESLEPGHFGNVRRTYRDKLELAPYPIDGEWLPKSAVYALVDLMVVIFSRPKGLF 360

Query: 361 KECTKRILSGMLTIQEELVKLGIADGVREVSLQHSAIWMAGVYLMLIMQLLENKVAIELT 420
           KECTKRILSGMLTIQEELVKLGIADGVREVSLQHSAIWMAGVYLMLIMQLLENKVAIELT
Sbjct: 361 KECTKRILSGMLTIQEELVKLGIADGVREVSLQHSAIWMAGVYLMLIMQLLENKVAIELT 420

Query: 421 RSEFVEAQEALVQMKNWFLRFPTILQACESMIEMLRGQYAHYVGCYHEATFHYIEAAKLT 480
           RSEFVEAQEALVQMKNWFLRFPTILQACESMIEMLRGQYAHYVGCYHEATFHYIEAAKLT
Sbjct: 421 RSEFVEAQEALVQMKNWFLRFPTILQACESMIEMLRGQYAHYVGCYHEATFHYIEAAKLT 480

Query: 481 ESKSIQAMCQVYAAVSYICIGDAESSTLALDLIGPVYSMMDSFVGVREKTSVLFAYGLLL 540
           ESKSIQAMCQVYAAVSYICIGDAESSTLALDLIGPVYSMMDSFVGVREKTSVLFAYGLLL
Sbjct: 481 ESKSIQAMCQVYAAVSYICIGDAESSTLALDLIGPVYSMMDSFVGVREKTSVLFAYGLLL 540

Query: 541 MKQHDLQEARYLFPFR 557
           MKQHDLQEARYLFPFR
Sbjct: 541 MKQHDLQEARYLFPFR 556

BLAST of Csa3G000010 vs. NCBI nr
Match: gi|449456905|ref|XP_004146189.1| (PREDICTED: MAU2 chromatid cohesion factor homolog isoform X1 [Cucumis sativus])

HSP 1 Score: 1099.7 bits (2843), Expect = 0.0e+00
Identity = 550/550 (100.00%), Postives = 550/550 (100.00%), Query Frame = 1

Query: 1   MEAVAEGLWRLADYHEKQGELGKAIKCLEAICQSPVSFFPVLEVKTRLRIATLLLTYSHN 60
           MEAVAEGLWRLADYHEKQGELGKAIKCLEAICQSPVSFFPVLEVKTRLRIATLLLTYSHN
Sbjct: 1   MEAVAEGLWRLADYHEKQGELGKAIKCLEAICQSPVSFFPVLEVKTRLRIATLLLTYSHN 60

Query: 61  VNHAKSHLERSQLLLKSIPSCFELKCRAYSLLSQCYHLVGAIPPQKQLLYKGLDLTNSAG 120
           VNHAKSHLERSQLLLKSIPSCFELKCRAYSLLSQCYHLVGAIPPQKQLLYKGLDLTNSAG
Sbjct: 61  VNHAKSHLERSQLLLKSIPSCFELKCRAYSLLSQCYHLVGAIPPQKQLLYKGLDLTNSAG 120

Query: 121 HELSVKLWSCNFNSQLANALIIEGDYQNSISALESGYVFSAEICYPELQMFFATSILHVH 180
           HELSVKLWSCNFNSQLANALIIEGDYQNSISALESGYVFSAEICYPELQMFFATSILHVH
Sbjct: 121 HELSVKLWSCNFNSQLANALIIEGDYQNSISALESGYVFSAEICYPELQMFFATSILHVH 180

Query: 181 LMQWYDDNSVQQAVNKCDEVWESIEPEKRQQCVGLLFYNELLHIFYRLRICDYKNAAQHL 240
           LMQWYDDNSVQQAVNKCDEVWESIEPEKRQQCVGLLFYNELLHIFYRLRICDYKNAAQHL
Sbjct: 181 LMQWYDDNSVQQAVNKCDEVWESIEPEKRQQCVGLLFYNELLHIFYRLRICDYKNAAQHL 240

Query: 241 DKLDAAMKADLQQTQYIEDLNKEMNALNQSLSRSDLHYKDRLALTGKHAQLQEQLRSITR 300
           DKLDAAMKADLQQTQYIEDLNKEMNALNQSLSRSDLHYKDRLALTGKHAQLQEQLRSITR
Sbjct: 241 DKLDAAMKADLQQTQYIEDLNKEMNALNQSLSRSDLHYKDRLALTGKHAQLQEQLRSITR 300

Query: 301 PTSLSKESLEPGHFGNVRRTYRDKLELAPYPIDGEWLPKSAVYALVDLMVVIFSRPKGLF 360
           PTSLSKESLEPGHFGNVRRTYRDKLELAPYPIDGEWLPKSAVYALVDLMVVIFSRPKGLF
Sbjct: 301 PTSLSKESLEPGHFGNVRRTYRDKLELAPYPIDGEWLPKSAVYALVDLMVVIFSRPKGLF 360

Query: 361 KECTKRILSGMLTIQEELVKLGIADGVREVSLQHSAIWMAGVYLMLIMQLLENKVAIELT 420
           KECTKRILSGMLTIQEELVKLGIADGVREVSLQHSAIWMAGVYLMLIMQLLENKVAIELT
Sbjct: 361 KECTKRILSGMLTIQEELVKLGIADGVREVSLQHSAIWMAGVYLMLIMQLLENKVAIELT 420

Query: 421 RSEFVEAQEALVQMKNWFLRFPTILQACESMIEMLRGQYAHYVGCYHEATFHYIEAAKLT 480
           RSEFVEAQEALVQMKNWFLRFPTILQACESMIEMLRGQYAHYVGCYHEATFHYIEAAKLT
Sbjct: 421 RSEFVEAQEALVQMKNWFLRFPTILQACESMIEMLRGQYAHYVGCYHEATFHYIEAAKLT 480

Query: 481 ESKSIQAMCQVYAAVSYICIGDAESSTLALDLIGPVYSMMDSFVGVREKTSVLFAYGLLL 540
           ESKSIQAMCQVYAAVSYICIGDAESSTLALDLIGPVYSMMDSFVGVREKTSVLFAYGLLL
Sbjct: 481 ESKSIQAMCQVYAAVSYICIGDAESSTLALDLIGPVYSMMDSFVGVREKTSVLFAYGLLL 540

Query: 541 MKQHDLQEAR 551
           MKQHDLQEAR
Sbjct: 541 MKQHDLQEAR 550

BLAST of Csa3G000010 vs. NCBI nr
Match: gi|659095143|ref|XP_008448423.1| (PREDICTED: LOW QUALITY PROTEIN: MAU2 chromatid cohesion factor homolog [Cucumis melo])

HSP 1 Score: 1089.3 bits (2816), Expect = 0.0e+00
Identity = 544/550 (98.91%), Postives = 548/550 (99.64%), Query Frame = 1

Query: 1   MEAVAEGLWRLADYHEKQGELGKAIKCLEAICQSPVSFFPVLEVKTRLRIATLLLTYSHN 60
           MEAVAEGLWRLADYHEKQGELGKAIKCLEAICQSPVSFFPVLEVKTRLRIATLLLTYSHN
Sbjct: 1   MEAVAEGLWRLADYHEKQGELGKAIKCLEAICQSPVSFFPVLEVKTRLRIATLLLTYSHN 60

Query: 61  VNHAKSHLERSQLLLKSIPSCFELKCRAYSLLSQCYHLVGAIPPQKQLLYKGLDLTNSAG 120
           VNHAKSHLERSQLLLKSIPSCFELKCRAYSLLSQCYHLVGAIPPQKQLLYKGLDLTNSAG
Sbjct: 61  VNHAKSHLERSQLLLKSIPSCFELKCRAYSLLSQCYHLVGAIPPQKQLLYKGLDLTNSAG 120

Query: 121 HELSVKLWSCNFNSQLANALIIEGDYQNSISALESGYVFSAEICYPELQMFFATSILHVH 180
           HELSVKLWSCNFNSQLANALIIEGDYQNSISALESGYVFSAEICYPELQMFFATSILHVH
Sbjct: 121 HELSVKLWSCNFNSQLANALIIEGDYQNSISALESGYVFSAEICYPELQMFFATSILHVH 180

Query: 181 LMQWYDDNSVQQAVNKCDEVWESIEPEKRQQCVGLLFYNELLHIFYRLRICDYKNAAQHL 240
           LMQWYDDNSV+QAVNKCDEVWES+EPEKRQQCVGLLFYNELLHIFYRLRICDYKNAAQHL
Sbjct: 181 LMQWYDDNSVEQAVNKCDEVWESMEPEKRQQCVGLLFYNELLHIFYRLRICDYKNAAQHL 240

Query: 241 DKLDAAMKADLQQTQYIEDLNKEMNALNQSLSRSDLHYKDRLALTGKHAQLQEQLRSITR 300
           DKLDAAMKADLQQTQYIEDL KEMNALNQSLSRSDLHYKDRLALTGKHAQLQEQLRSITR
Sbjct: 241 DKLDAAMKADLQQTQYIEDLTKEMNALNQSLSRSDLHYKDRLALTGKHAQLQEQLRSITR 300

Query: 301 PTSLSKESLEPGHFGNVRRTYRDKLELAPYPIDGEWLPKSAVYALVDLMVVIFSRPKGLF 360
           PTS+SKESLEPGHFGNVRRT RDKLELAPYPIDGEWLPKSAVYALVDLMVVIFSRPKGLF
Sbjct: 301 PTSMSKESLEPGHFGNVRRTSRDKLELAPYPIDGEWLPKSAVYALVDLMVVIFSRPKGLF 360

Query: 361 KECTKRILSGMLTIQEELVKLGIADGVREVSLQHSAIWMAGVYLMLIMQLLENKVAIELT 420
           KEC+KRILSGMLTIQEELVKLGIADGVREVSLQHSAIWMAGVYLMLIMQLLENKVAIELT
Sbjct: 361 KECSKRILSGMLTIQEELVKLGIADGVREVSLQHSAIWMAGVYLMLIMQLLENKVAIELT 420

Query: 421 RSEFVEAQEALVQMKNWFLRFPTILQACESMIEMLRGQYAHYVGCYHEATFHYIEAAKLT 480
           RSEFVEAQEALVQMKNWFLRFPTILQACESMIEMLRGQYAHYVGCYHEATFHYIEAAKLT
Sbjct: 421 RSEFVEAQEALVQMKNWFLRFPTILQACESMIEMLRGQYAHYVGCYHEATFHYIEAAKLT 480

Query: 481 ESKSIQAMCQVYAAVSYICIGDAESSTLALDLIGPVYSMMDSFVGVREKTSVLFAYGLLL 540
           ESKSIQAMCQVYAAVSYICIGDAESSTLALDLIGPVYSMMDSFVGVREKTSVLFAYGLLL
Sbjct: 481 ESKSIQAMCQVYAAVSYICIGDAESSTLALDLIGPVYSMMDSFVGVREKTSVLFAYGLLL 540

Query: 541 MKQHDLQEAR 551
           MKQHDLQEAR
Sbjct: 541 MKQHDLQEAR 550

BLAST of Csa3G000010 vs. NCBI nr
Match: gi|778674503|ref|XP_011650234.1| (PREDICTED: MAU2 chromatid cohesion factor homolog isoform X2 [Cucumis sativus])

HSP 1 Score: 1066.6 bits (2757), Expect = 1.5e-308
Identity = 537/550 (97.64%), Postives = 537/550 (97.64%), Query Frame = 1

Query: 1   MEAVAEGLWRLADYHEKQGELGKAIKCLEAICQSPVSFFPVLEVKTRLRIATLLLTYSHN 60
           MEAVAEGLWRLADYHEKQGELGKAIKCLEAICQSPVSFFPVLEVKTRLRIATLLLTYSHN
Sbjct: 1   MEAVAEGLWRLADYHEKQGELGKAIKCLEAICQSPVSFFPVLEVKTRLRIATLLLTYSHN 60

Query: 61  VNHAKSHLERSQLLLKSIPSCFELKCRAYSLLSQCYHLVGAIPPQKQLLYKGLDLTNSAG 120
           VNHAKSHLERSQLLLKSIPSCFELKCRAYSLLSQCYHLVGAIPPQKQLLYKGLDLTNSAG
Sbjct: 61  VNHAKSHLERSQLLLKSIPSCFELKCRAYSLLSQCYHLVGAIPPQKQLLYKGLDLTNSAG 120

Query: 121 HELSVKLWSCNFNSQLANALIIEGDYQNSISALESGYVFSAEICYPELQMFFATSILHVH 180
           HELSVKLWSCNFNSQLANALIIEGDYQNSISALESGYVFSAEICYPELQMFFATSILHVH
Sbjct: 121 HELSVKLWSCNFNSQLANALIIEGDYQNSISALESGYVFSAEICYPELQMFFATSILHVH 180

Query: 181 LMQWYDDNSVQQAVNKCDEVWESIEPEKRQQCVGLLFYNELLHIFYRLRICDYKNAAQHL 240
           LMQWYDDNSVQQAVNKCDEVWESIEPEKRQQCVGLLFYNELLHIFYRLRICDYKNAAQHL
Sbjct: 181 LMQWYDDNSVQQAVNKCDEVWESIEPEKRQQCVGLLFYNELLHIFYRLRICDYKNAAQHL 240

Query: 241 DKLDAAMKADLQQTQYIEDLNKEMNALNQSLSRSDLHYKDRLALTGKHAQLQEQLRSITR 300
           DKLDAAMKADLQQTQYIEDLNKEMNALNQSLSRSDLHYKDRLALTGKHAQLQEQLRSITR
Sbjct: 241 DKLDAAMKADLQQTQYIEDLNKEMNALNQSLSRSDLHYKDRLALTGKHAQLQEQLRSITR 300

Query: 301 PTSLSKESLEPGHFGNVRRTYRDKLELAPYPIDGEWLPKSAVYALVDLMVVIFSRPKGLF 360
           PTSLSKESLEPGHFGNVRRTYRDKLELAPYPIDGEWLPKSAVYALVDLMVVIFSRPKGLF
Sbjct: 301 PTSLSKESLEPGHFGNVRRTYRDKLELAPYPIDGEWLPKSAVYALVDLMVVIFSRPKGLF 360

Query: 361 KECTKRILSGMLTIQEELVKLGIADGVREVSLQHSAIWMAGVYLMLIMQLLENKVAIELT 420
           KECTKRILSGMLTIQ             EVSLQHSAIWMAGVYLMLIMQLLENKVAIELT
Sbjct: 361 KECTKRILSGMLTIQ-------------EVSLQHSAIWMAGVYLMLIMQLLENKVAIELT 420

Query: 421 RSEFVEAQEALVQMKNWFLRFPTILQACESMIEMLRGQYAHYVGCYHEATFHYIEAAKLT 480
           RSEFVEAQEALVQMKNWFLRFPTILQACESMIEMLRGQYAHYVGCYHEATFHYIEAAKLT
Sbjct: 421 RSEFVEAQEALVQMKNWFLRFPTILQACESMIEMLRGQYAHYVGCYHEATFHYIEAAKLT 480

Query: 481 ESKSIQAMCQVYAAVSYICIGDAESSTLALDLIGPVYSMMDSFVGVREKTSVLFAYGLLL 540
           ESKSIQAMCQVYAAVSYICIGDAESSTLALDLIGPVYSMMDSFVGVREKTSVLFAYGLLL
Sbjct: 481 ESKSIQAMCQVYAAVSYICIGDAESSTLALDLIGPVYSMMDSFVGVREKTSVLFAYGLLL 537

Query: 541 MKQHDLQEAR 551
           MKQHDLQEAR
Sbjct: 541 MKQHDLQEAR 537

BLAST of Csa3G000010 vs. NCBI nr
Match: gi|1009169949|ref|XP_015865941.1| (PREDICTED: uncharacterized protein LOC107403551 [Ziziphus jujuba])

HSP 1 Score: 916.0 bits (2366), Expect = 3.2e-263
Identity = 455/551 (82.58%), Postives = 503/551 (91.29%), Query Frame = 1

Query: 1   MEAVAEGLWRLADYHEKQGELGKAIKCLEAICQSPVSFFPVLEVKTRLRIATLLLTYSHN 60
           MEAVAEGLW LAD+HE++GE+ KA+KCLEAICQS VSFFP++EVKTRLRIATLLL +S+N
Sbjct: 1   MEAVAEGLWGLADHHERKGEIAKAVKCLEAICQSHVSFFPIVEVKTRLRIATLLLKHSYN 60

Query: 61  VNHAKSHLERSQLLLKSIPSCFELKCRAYSLLSQCYHLVGAIPPQKQLLYKGLDLTNSAG 120
           VNHAKSHLER+QLLLKSIPSCF+LK RAYSLLSQCYHLVGAIPPQKQ+L+K LDLT SAG
Sbjct: 61  VNHAKSHLERAQLLLKSIPSCFDLKFRAYSLLSQCYHLVGAIPPQKQILHKALDLTASAG 120

Query: 121 HELSVKLWSCNFNSQLANALIIEGDYQNSISALESGYVFSAEICYPELQMFFATSILHVH 180
           +E++VKLW CNFNSQLANALIIEGDY NSISAL+ G++ +A+ICYPELQMFFATSILHVH
Sbjct: 121 NEIAVKLWCCNFNSQLANALIIEGDYPNSISALQCGFLCAAQICYPELQMFFATSILHVH 180

Query: 181 LMQWYDDNSVQQAVNKCDEVWESIEPEKRQQCVGLLFYNELLHIFYRLRICDYKNAAQHL 240
           LMQW D N V+ AVNKCD+VWESI PEKRQ C+GLLFYNELLHIFYRLRICDYKNAAQH+
Sbjct: 181 LMQWEDPNLVEGAVNKCDQVWESIAPEKRQHCLGLLFYNELLHIFYRLRICDYKNAAQHI 240

Query: 241 DKLDAAMKADLQQTQYIEDLNKEMNALNQSLSRSDLHYKDRLALTGKHAQLQEQLRSITR 300
           D LD AMKADLQQTQ++++L KE++ALNQSLSRSDLHY+DR AL+ K A LQE+L S+TR
Sbjct: 241 DILDTAMKADLQQTQHVQELTKELDALNQSLSRSDLHYRDRSALSEKQALLQERLSSMTR 300

Query: 301 -PTSLSKESLEPGHFGNVRRTYRDKLELAPYPIDGEWLPKSAVYALVDLMVVIFSRPKGL 360
              S  K+ LEP +FGNVRRT  DKLELAP PIDGEWLPKSAVYALVDLMVVIF RPKGL
Sbjct: 301 FSNSSRKDFLEPAYFGNVRRTSGDKLELAPPPIDGEWLPKSAVYALVDLMVVIFGRPKGL 360

Query: 361 FKECTKRILSGMLTIQEELVKLGIADGVREVSLQHSAIWMAGVYLMLIMQLLENKVAIEL 420
           FKEC KRI SGM TIQEELVKLGI DGVREV+LQHSAIWMAGVYLML+MQ LENKVA++L
Sbjct: 361 FKECGKRIQSGMHTIQEELVKLGITDGVREVNLQHSAIWMAGVYLMLLMQFLENKVAVDL 420

Query: 421 TRSEFVEAQEALVQMKNWFLRFPTILQACESMIEMLRGQYAHYVGCYHEATFHYIEAAKL 480
           TRSEFVEAQEALVQMKNWF+RFPTILQACES+IEMLRGQYAH  GCY EA FHYIEAA+L
Sbjct: 421 TRSEFVEAQEALVQMKNWFIRFPTILQACESVIEMLRGQYAHCFGCYSEAAFHYIEAARL 480

Query: 481 TESKSIQAMCQVYAAVSYICIGDAESSTLALDLIGPVYSMMDSFVGVREKTSVLFAYGLL 540
           TE+KS+QA+CQVYAAVSYICIGDAESS+ ALDLIGPVY MMDSFVGVREKT VLFAYGLL
Sbjct: 481 TENKSMQAICQVYAAVSYICIGDAESSSQALDLIGPVYRMMDSFVGVREKTGVLFAYGLL 540

Query: 541 LMKQHDLQEAR 551
           LMKQHDLQEAR
Sbjct: 541 LMKQHDLQEAR 551

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
SCC4_HUMAN2.6e-0724.19MAU2 chromatid cohesion factor homolog OS=Homo sapiens GN=MAU2 PE=1 SV=2[more]
SCC4_MOUSE2.6e-0724.19MAU2 chromatid cohesion factor homolog OS=Mus musculus GN=Mau2 PE=1 SV=3[more]
SCC4_XENTR3.4e-0724.19MAU2 chromatid cohesion factor homolog OS=Xenopus tropicalis GN=mau2 PE=2 SV=1[more]
SCC4_XENLA1.7e-0623.96MAU2 chromatid cohesion factor homolog OS=Xenopus laevis GN=mau2 PE=1 SV=1[more]
SCC4_AEDAE1.7e-0624.46MAU2 chromatid cohesion factor homolog OS=Aedes aegypti GN=AAEL011819 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
M5WND8_PRUPE1.3e-26080.55Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003720mg PE=4 SV=1[more]
V7BAU7_PHAVU1.8e-26080.55Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_008G243600g PE=4 SV=1[more]
D7SJG2_VITVI2.3e-26081.82Putative uncharacterized protein OS=Vitis vinifera GN=VIT_06s0004g08370 PE=4 SV=... [more]
K7KAE5_SOYBN3.9e-26080.18Uncharacterized protein OS=Glycine max GN=GLYMA_02G238400 PE=4 SV=1[more]
A0A0B2S4C3_GLYSO6.7e-26080.18MAU2 chromatid cohesion factor like OS=Glycine soja GN=glysoja_016910 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G51340.17.4e-21064.78 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|700200444|gb|KGN55577.1|0.0e+00100.00hypothetical protein Csa_3G000010 [Cucumis sativus][more]
gi|449456905|ref|XP_004146189.1|0.0e+00100.00PREDICTED: MAU2 chromatid cohesion factor homolog isoform X1 [Cucumis sativus][more]
gi|659095143|ref|XP_008448423.1|0.0e+0098.91PREDICTED: LOW QUALITY PROTEIN: MAU2 chromatid cohesion factor homolog [Cucumis ... [more]
gi|778674503|ref|XP_011650234.1|1.5e-30897.64PREDICTED: MAU2 chromatid cohesion factor homolog isoform X2 [Cucumis sativus][more]
gi|1009169949|ref|XP_015865941.1|3.2e-26382.58PREDICTED: uncharacterized protein LOC107403551 [Ziziphus jujuba][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR019440MAU2
Vocabulary: Biological Process
TermDefinition
GO:0007064mitotic sister chromatid cohesion
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0007064 mitotic sister chromatid cohesion
biological_process GO:0000956 nuclear-transcribed mRNA catabolic process
biological_process GO:0006487 protein N-linked glycosylation
biological_process GO:0034088 maintenance of mitotic sister chromatid cohesion
cellular_component GO:0005575 cellular_component
cellular_component GO:0000785 chromatin
cellular_component GO:0032116 SMC loading complex
molecular_function GO:0003674 molecular_function
molecular_function GO:0005515 protein binding
molecular_function GO:0003690 double-stranded DNA binding
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
CU086804cucumber EST collection version 3.0transcribed_cluster
CU128416cucumber EST collection version 3.0transcribed_cluster
CU132039cucumber EST collection version 3.0transcribed_cluster
CU157095cucumber EST collection version 3.0transcribed_cluster
CU166484cucumber EST collection version 3.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa3G000010.1Csa3G000010.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
CU132039CU132039transcribed_cluster
CU128416CU128416transcribed_cluster
CU157095CU157095transcribed_cluster
CU166484CU166484transcribed_cluster
CU086804CU086804transcribed_cluster


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR019440Chromatid cohesion factor MAU2PANTHERPTHR21394UNCHARACTERIZEDcoord: 1..258
score: 7.5E-189coord: 298..552
score: 7.5E
IPR019440Chromatid cohesion factor MAU2PFAMPF10345Cohesin_loadcoord: 24..494
score: 3.
NoneNo IPR availableunknownCoilCoilcoord: 257..277
scor
NoneNo IPR availablePANTHERPTHR21394:SF0MAU2 CHROMATID COHESION FACTOR HOMOLOGcoord: 1..258
score: 7.5E-189coord: 298..552
score: 7.5E

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:

None