Sed0009983 (gene) Chayote v1

Overview
NameSed0009983
Typegene
OrganismSechium edule (Chayote v1)
DescriptionAT-rich interactive domain-containing protein 2
LocationLG07: 13580843 .. 13603238 (-)
RNA-Seq ExpressionSed0009983
SyntenySed0009983
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCTCTCTCCTTTCCTTGTTCAGAAGAACCACTTCAGCATCCATTCCTCCCTCTGTTTCCTTTTTCTCTTCTCTTCTTTCTGCTTCCGGCGAGCAATACTGATTCTGGCGACCGCTCTCTCCGTTCAACGACGAGCATTCCGGAGAAGGCGGATTCCGGTGCAGCGTTCGAGCAGGTGGCAGCAGCTCTCAGCTCCGTTCACACGACGGCGACCGTCCGGAGGCGGCGGGTTCTGGCGGAGCCGGCGGCAGCAGCTCCATGCTCCTCCGTTCATCTTCCTTTTCACGCCGGTGAATTTTCTTTAATTTTGAAGTAGGTGTTTGAGTTCATGGGGAAATGGCCTATTTCATCCAATGATTCAGTTTTAGATTGCAATAATAAAGATGTTGATCCTTGTCACAGTAATGGTTTTTTTATATCCTCTGATTGTTTGGTAGAGGAAAGTCATGTGAATGTTGATTATGATGATTGCAAAGCGACGGTTAGAAGCTCTTTTGAGAAGATTCTTTCGGTTTTTCTAAAGGAAATAGGTCGTAGAGGAATTGTTAGGCCAGTGCCAGCATTACTAGGTGAAGGGGGATCTTTGGATTTGTTTGAGCTGTTCATGATTGTAAGAGAGAAAGGTGGTTATCTAGTTGTTTCTGAAAATCAATTATGGTCCTCAGTGGTTGTGGAATTAGGTTTGGATCTTCAACTTTCTGCTTCTGTGAAATTGATTTATTCCAAGTATTTAAGTGAGCTAGAGAAATGGCTTATGAAGAGACGTGGTGGTACCAAACTGCTAAATGGGAACTCTAGTTATCACTATGAGATTAGTTTTCCATTTTTGTCGGAACTGGAGGGGAAGATTAAGGGTATGGTATATGGTCTGCTGAGACAAAAGAATGCATATCATGATCGTTCTGGACTCAAATCTAACAAACAGAATGGGAACGTCAATGTTGATGTCACTGCAGAGGAGGAAATAAAATCACCGAGAATAAAGAAAGAAGAACATAGTATATATGGGGGTGTTGAAGAAATTAAAAAAAATTGTAATGACACACTTCGGGATGATGACGAAAAGGATAGAATCCATGTTATTGAAGATGATAGAAGTTTGGATGCTGTTTGTGTTAATGTTGAAAAGGAAATAGACTCCCTTGGGAGATATCGAAAATCTTTATTACGAATGTTGAGGTGGGCAAGAAAGAGTGCAAAGAATCCTGCAGATCCATCTACTGGTATAATACCAAAGTCATCTAAGTGGAAAGGGTTTGATGACAACAATGCATTTTGGCTTCAAGTAATCAGGGCAAAAGATGCAGTTTTAATTAGGAAGGATGTTGACAAAAATGCTGAGAAAGATCTTTTAATACAGGTAGATTTCATACACTTGTTTTTCTTTTCAGCTAGTTCTCCCTATTTTCCTTGACCGATGGACACGAAAACTGTAATTCTTGCTTTGAGCGGCAATTATGTGCGCAAACTGACTTATGCAGCCTGCAGGTATCATGATCTTGGATTTCTTTTTCAGGATAGATGTCTCAGCACCCTTTTCATTCATATATAGAATGGGAGAAGTTACCTCCTGCTATAAGAACCTCGGGTTTTAATCGTTGTAAGCAAGCAGCATATGACTACTATGCCACCCTGTCTTGCTTCCTACGTGAACCTGGGCACAATGGTGAAAACTTAGATCCTTGTCATTGCGTCTACTATCATCGCCTATAATTAGGGATGTACTTATTTGAAATAGAACAAGGTTTGCCCTTGTTTATGGGTTTTGTTTCACCTTTGAACTCAGTGTTAGGAGTCTACATCTCTCAGTTTTTAAGTGTCTGAAAAGAAAATAAGTTTTTGTCACTACCCTAAGAATATGAATTTTACAGTCTTAGAGATTGGCAGGATCAAAGGGATGATAGTTTCATAAAAAATTCCGAGGATAGAAGAAGGCTTATGAGATAGCTGGAGGAGTTGCATACTAGGGAAAGGTCTAGCCAGAAACAAAATTCCCTAGTCTAATGGATTTAGGAGGGCGATGCTTACAGTGCCTGGAGGAATTGTTCAAAAAAAGATGCTCATAGTGCTTGGAGGAATATCAGGCCATGGTGGCCACTTACCTTGAATTCGATATCCTCAATTACCTTGACAACTTAATGTAGAAGGATTAGGTGTTTTGTTTGCATATGAAAATAGCTGAGGAGCTTGCAAGTTGACTCAGACATGAGAGAACTACCTCTGGTTGGCTTATGTATGAAACTACATTTACCTCTATGGTGGGTATTATTTTCTACCGACATATGACTACCTAGTACCTCATGGATTAGAACTCCCTTGAAAGGATCCTATAGGATGACGGGTTGCTGCTGTGGTTAGTTGGATGGGGATTCATCTCACATCCTTGAGTGTTTGTGTTTTACTTTGCACAACATTGGTTTTAGATATTTAGGGTTGTTAGTTGTTACACTGAGGACTAATCAATTGTTTTCCTTCTTCCTTTTACCTGTGAATGCTCTTAATCATCTACAAGAAATGTGTTACGCAATATTTTGGGACTCAGAGAAGTCGTACTTTTTATTGATATAACTATGTCCATATATATATACAAGGAGAATGATGCAAATCCTAACAACTCACTAAATATAGAATAATACTAAGAATGATGCAAATCCTAACAACTCACTAAATATAGAATAATACTAAGAATGATGCAAATCCTAACAACTCACTAAATATAGAATAATACTAACACTCCCCCTCAAGCTGGAGCAAAGATATCTATCATGCCCAACTTGCTACAAAGATAATGTATTATTGTTCCATTCAGTGCCTTTGTGAAGATATCTCCCAATTGTTCTCCGGTCTTTACATAACTTGTAGAGATTAACCCTCGATGTATCTTTTCTCGAATGAAATGACAGTCCACTTCGATGTGTTTAGTTCTCTCATGAAACACTGGATTCGATGCTATGTGAATAGCTGCTTGGTTATCACACCATGATCTAGCGGGTATAACATTTTCTATCCCAAGTTTAGTCAACAATTGGAGCAACCACACTATCTCACATGTGGCTTGAGCCATAGCCCGTATTCGACTCGCACTTGAACGGAAACGACATTACTGCTTCTTGGTCTTCCAAGACACCAAGTTCCTCCCACAAATACACAATATCCCGAAGTAGATCTCCGTCCTCTTTTGACCAGTGCCCAATCAAAGATCCGAATAACATTCCACCTCATATGACCATGACTCTCATACACTATTCCTCGATCAAGAGCAGATTTCAAGTAACTCAAAGATCCGTTCCACACCCACCCAATGATCCACGATGTAGGAGATGACATATATTGACTTACTACACTTACAGTATGCTATATCCGGCCTCATGTAATGATCGGATAATTTAATTTCCCAACCATTCTTGATGCAAGTTCGGGATCTTTAAAAGGTTCTCCTTCTTTGACAAGTTGTACGTTTGGTATCATCGAGTATCTTGCAAGGTTTCACCCCCAACTTTCCGGCTTCAGATAATGAGTCAAGGACATACTTAGTTGGGACAAAAAGATACCTTTCTTGTTTCGCATCACCTCGATACCTAAAAATATTTGAGAGTTCCCGGATCTTTAGTATGAAACTCGTTGTGAAGGAATTCTTTCAGCAAAACTATACAGTATACCGGTTGTGTCATCGCCAGTGATTATAATATCATCAACATATACCACCAACAAAAGAATGTTATTATTGAATCTCCTGTAAAACACCGAGTGGTCAGACGAACTCTTTTTCAATCCAAACTTTTCAAGGGCTTGACTAAATTTCCCAAACCACGCTCTCGGACTGCTTTGTGCCATATAAGGATTTACGAAGCCGACAAACCTTGCCATTCTCCCCCTGAGCAACAAACTCAAGTGGTTGCTCAAGATACACTTCTTCTTGAAGATCGCCATGAAGGAACGCATTCTTAATATCGAGCTGATGTAACGGCCATTCTCGAGAAGCTGCCATCGAGATAAACAATCGAATCGATGTCATCTTGGCCACAGGAGAAAAAGTATCACAATAATCGATGCCATATGACTGAGCATAGCCCTTGGCCACGAGACGAGCTTTCAAACGAGCCACGGACCCATCCGAGTTAACTTTAACTGCAAATACCCACTTACACCCAATTACATTCTTACCTGCAGGATGAGCCACTAGCTCCCAAGTGTCATTAGCATCTAAAGCAATCATTTCTTCAATCATTGCCTGGCGCCAACTAGAATGAGACAAGACTTCATGAGTATTCTTAGGAATAGGTACAGAATCAAGAAATGTAAGGAGAGAGTACGTGGAAGGCAACAAATGAGAGTAAGAAACATAAGAAGAAATGGGATAAGTACATGAACGCTTACCTTTGCGAAGGGCAATAGGCAGGTCTAGGTCACTAGCTGCTTGCGGTTTAGATGACGAAACATTTTCTGGTGGAGGACATGAGTCAGCAGACTGTTGTTGTACCCGTCTGGAATAAACTTGAGTAATCGGAGGACAAGACGAAGGAGGCTCCGTTGGAGGCAAACTAGGAGTCGGGAAGGAAATAACTTCGTAGATGAACAAGTTATCATCTTCCAGACTCGAAGTGGACGACACTAGAGTGAAAGGTACATTCTCAAAGAACTGAACATCTGATGACACAAGATATCGATTCGAGCTTGGACAATAACACCAATAACCCTTCTGAACACGAGAATAACCTAGGAAGATACACTTCAACGATTTTGGGTCTAACTTTGTGCGATGAGGGTTTATATCACGAACAAAACAAGTGCAACCAAAAATCTTGGGTGCAATTGGGAACAGTTCCTTTGTAGGAAACAAGACATGAAACGGAGTTTTACCACTGAGTACGGAAGAAGGCATTCTATTGATCAAAAAACAAGCTGTAGATACCGCATCTACCCAAAAGTGTTTTGGAACATTCATTTGAAACGACAATGCCCTTGCAGTTTCAAGTAAATGTCGATTCTTCCGTTCTGCTACTCCATTTTGAGATGGAGTATCAGCACAAGACGACTGATGAAGGATTCCTTTGTCATGTAAGTAAGATCCGAGGAGATTAGAGAAGTATTCTCCCCCATTATCAGTGCGTAAAATCTTAATAGAAGTTTTAAACTGATTTCGAATTTCAGCATTGAATGTACGAAAAATTGAAAACAACTCAGAATGATTTTTCATTAGATAAATCCACGTCAGACGGGAAAAATCATCCACAAAGGTTACAAAATAACGAAATCCTGTTTTAGATGTTACAGGACTTGGACCCTAAACGTCGGAATGAACCAAATCAAAGGGAGCATCAGCTCTTTTATGAACCCTAGGACTCGAACTAAGACGATGAAATTTAGCAAATTGACAAGACTCACAATCTAAAGATGACAAAGTACTGAAAGATGGATAAAGTTTCTTCAAAACTGACAAGGACGGATGACCTAACCGACAATGAATGTCAAACAAAGACTCAGTGCTCGAACATGCAACCTCATTCCCTGGTTGTTGATCGAGTATGTAGAGACCGCCAGACTCATATCCTCTACCAATAATCCTCTTCTTCATATGATCCTGAAACAACAAGCGTGCTTGGGAAAGAAAGACACAAAGCAATTTAGATCATGGGTAAGTTTGCTAATTGAGATAAGATTATATGTAAGATTTGGTAGACATAATACTGAAGATAAAGACATTGTGGGCGTAACATTTATCAGACCAGACAGTGTTTTAAAAAGCCCCATAAAGCCCCAAAAAGGCACAGAAAGGCACAAGGCGCCATGCTTGTGCCTCGCCTTGCTCAGGCGCAGCGCATAAGATAAGGCGCGCGCCTTACCACGCCTTTCCTCGAGGCTCCCTGGCGCAAAAGGCGCGTGCCTCAAGGGCTTTTTCTTTTCTTTTTTTTTACATCCTCAAAATGTAACCAAACCAAAGAAAGAAGAAGAAGAAGATGATGAAATGTCATAAAAAGATGAAGTGAAAAGTAAGTTTTGTATAAAATAAAAGAAAATAGAAAGGTGACATTGATGAAAATTATTGATTTGACCACTCACAATTGATACATATTTGATTCAAATAGAATTTATTTACATTTAAAATTTGATATCTCTCTCAATTCTTTCCATAGTTGTTTAATTTACTTAATTAAATATTATCATTATGACCTATTTTCATTAGATACCATTTTAACCACAATATTATTATTAAGATGTCTTTCACAATTTTCAACTAAAATTTTATTTATTATATATTTAAAATTTCAATGTACAATATATTATATACGTATTAATAAATATTTTATTCATTATGTGTCCCCTTCTTTTATATGTCTTTTTGTTTGAACTAAATTTAATAGTTATTGAACTTATTTACTTATTGCTTTTATATTTTATTGGATATTTTAAACTTTAGAGACTTAATTGATGTTTTTATAATTTAATAGATTAACATATATGTTTTGTGGTAAATATAATTTTTCAAAATATTTTTTTATATATGGTGCGCCTAGAATATAAAGCCCGCGCCTTTTTTGCGCCTTGCGCCTCAGGCTCCAGAGAGCTTTTGCGCCTTTGTGCGCCTTTAAAAACACTGACCAGACCCTTGAACCGAAGATGATGACCCATCTGCCAAGGTGACAGCTGGGAACGGGGCCGGGGATAATGGATTTACAAACAAATTACGGTTACCTGTCATATGTGATGTAGCCCCAGAGTCGATGACCCATTTCAAACTCAGGACCTAAACCCTGTAGAAAACTAATAACACCCATGTTCTCTCTCTGACTCTGTTGAACTTTTATATCTGCATCAAAAGGCATAAGAGCCGTGAGCTCGGCACTGTTCCTCTTGTGCTGCATATAGTACGCCATTATTGATTGTCCCTTTCGATCAGCATGGAAGTAGGATGTGCAAACATCATTTATTCTTGTCACCTGCTCCTTTCTAGAATATAAAAACTCCAGAAATTTCATCAATTCCTTGACGCTATTACAATGAGTCACCAGGGCGACCACATCATCTTCTATTGAGTTTTGAATCTGATTATATAGTCGAGCATCATCTTTAAGCCATATTCTTTTCTCTGACTCATCCGCTGGTGCATCATCCTTAATATGACTCTCCATCTCCATGCTCAGAAGATGAAACCGAATTGTTTTATTCCAATCGCGATAATTGAGGCCATTCAACTTACGCTTCGTGATTTCCGACGAGTGAGAAATAACACTCGTGTGCTTAGAATTAGCCATATATTCTTCCAAGCGTAAAAAACCAATCTAAAATCTGGAAATCAGAGAATCAACGTCGAATTAAACAAAACACAGCAGGAAATTATAAAAAAAAACGCTCTAAACCTCAATCCCGAGACCAAATATCGCCGAAAACCAAAAACAAACCACACAGACACGAAGAGAGGCTTGTCGCGCTCAAAGCCCAGGAAAAATCTCGTCGGAAAACCCTTCCACGCGCTCTCACGCGCCTCCGTCTGCGGCGGCGATCTGGAACTTCCGGCGGCGCGTGTGAGGTCCCTGTTCGTCGTGCGGCGGCGATCTTCTAGATGCAGGTAGATCGGCTGGCAGGCTTTCGAATGACGGTGGCGGTGACAACCAACTCCGACAAAAGCGCGCGGCGGCGTGGTTCCGACAGATTCGGCGGTCTGACAAGTAGATTTTCGACGGGCTATCTCGAACAGCAAGAGTACTATTTGAATACCCCGAGAAGCACACACCCAAACCCTAAAATTCAATATTTTAAAACCCTAAGCATGGCTCTGATACCATGTTACGCAATATTTTGGGACTCTGAGAAGTCGTACTTTTTATTGATATAACTATGTCCATATATATATACAAGGAGAATGATACAAATCCTAACAACTCACTAAATATAGAATAATACTAAGAATGATGCAAATCCTAACAACTCACTAAATATAGAATAATACTAACAAAATAATCCTCCACCCAAAATATTTTTGGGTGGTAGAGGCTGAGTCCAAATCCATCTTCCATAATTGATTCTCTATGTTGTCAATGGAAGAAATTGATCAAGCAGAAGAATGGCTTCTATGAAGGAGGAGGCTTACCCTTATCTAGTTGATGGTTTCTATATTCCTCTCTTTATATATATGTATATTTTGAGCTTCTCTTGGTTCTGCTTGGAGGACTAAATAACGTGTTATGTGGAATCTATCGTGGAATGGGCGAAGGATCATTTTGATTCTCACCTGATTAGTTGGAACAGGGTAGTGTGGTTTTTTTGCTTTAGTTGTTATTTCATATTTGTTATAGGTATCAGAAACTTGAATGCCAAGAACATTATTTTTCTGAATAAGTGGCTTAGAACTTAAAAAAAGTGGCTTTAGAGGCTCTAGTTTCGTGGCCTTTGCTTTGGCATAAAATTAAGGGTAGTCCATTTTTGGGGCTTTTGTTATCTTTGTCCTTGGAGGATGGAGAGTACGTTTCTAGAAGCACAAATGGTCTTTTGGAAGCCCTTGATTGATTTTATACGAAAAAAAAGAGAGAAATTGATGAGGCTAAACCTATATATCTACAAGGAGAGTAGTAAACACTCACTAATACTGGGCCTAAAGGGCCTGACCTATATATTACAAAATTATACATAAAGATAGGGAATATATCAATAACTCTAACTCTCCCCCTCCGACTGAAGCAAATATGTCTATCATGCCCAACTTGTTGCAGAGATAAGAAATCCTTATTCCATTCTAAGCCTTCGTGAAGATATCTCTCAATTGTTCTCCAGTCTTTACATAGCTTGTCAAAATCAAACCTCGATGTATCTTCTCTCGAATAAAATGACAGTCCACTTCAATGTGCTTAGTTCTCTCATGGAACACTGGATTTGATGCTATGTGAATAGCTGGTTGGTTATTGCACCATAGTCTCCCAATTTCTGTCAACAGCTGGCACAACCATACTATCTCGCAAGTTGCTTGTGCCATAGCCCAATATTTAGATTCTGCACTCGAATGGGACATCGCATTCTGTTTTATGCTCTTCCACAACACCAGGTTCCCTCCTACAAAGACACAATATTCAGAGGTAGATCTTCTGTCCTCTTTGGACCTTGCTCAATTAGCATCTGAATAGCACCCAATTTCGTATGACCCTGATTCTCATACACGATCCGAGTGATATATGGTAGGAAATGACATATACTGACTTACCACGCTTACTGAGTAAGCTATATCAAGTGTAGTGACAATTAGGTAATTTTGTTTCCCAACCATCCTCCGGTATCTTTCAAGATCTCTAAAGGATTCCCCTTTTGTGATTAAGTGAAGGTTAGGTACCATAGGGGGCACCACTTGGTTTCACTCCTAACTTTCTTGCTCCGGACAATAAGTCAAGAACATATGTTCTTTGAGATAGAAATATACCTCTCTTTCTCCTCATCACTTCAATGCCTAAGAAATATTTGAGCGTTCCCAGATCTTTACTATGAAACTAGCCATGAAGATCATTGTTTACTTGTGGTTCCCATGACGAAGAGTCTCCTAGTAAAGGACATGAATTTGTAGTCTGTTGTCGTACTCGTCTGGGGTAAACTTGAGTAATTGGAGGGCATGATGGTGCCGACTCCGACGAATCAAGAGGAGATGAAGAGACAACTTCATAAATAAATAAGTTATCCTCCTCGTGACAAGGACCTATTGACACTGATTTGAAATGTATAGTTCCAAAGAATTGAACATATGAAGAGGCGAGATATCTATTCGAGAGTGGACAGTAACAACAATACCCTTTATGAACACGGTAACAACCTTAGAAGATACACTTCATTGATTTTGGGTCCAACTTTGTACGATGAGGACTGATATCATGAACGAAACAAGTAACACCCAAAAACCTTGGGTGCAAGAGGGAACGACTCTTTTGTGGGAAACAAGACTTGGAATAGAACACTACCATTAAGCACCGATGATGCCATTCTATTGATTAAAAAACAAGTAGTAGACACAACATCCACTCAAAAGTGCTGGAACATTAATTTGGAATGACAATGTCCTTGCAGTTTCGAATAAATGTCGATTCTTTCTTTCAACAAATCCATTTTTAGATTGAGTACCAGGACATGATGACTGATGCATGATTTCTTTGTCAGGTTGGTAAGACCCTAGAATGGTAGGAAAATATTTACTGCCATTATCTATTCTCAAAATTTTAATGGTAGTATTGAATTGTTTTCTAATTTTCGCATTATAAGTGCCAAAGTGAGAGAACAAACTTAGAACGATTGAAAATTAGATAAACCAAAGTTATGTGAGAATAATCATCAACGAAAGTGACAAAATAACGAAAATCCATTTTTGACAATACGGTACTCAGACCCCAAACATCTGAATGAACTAAATCAAAAGGAGCATCTGCTCGTTTATTGACCCTAGGACTAGAAATATTACGATGAAATTTAGCAAATTGACAAGACTCGTAATTTAAAGACGACTGACTATTGAAAGTTGGATACATTTTCTTCAAAACTAATAAGGACGGATGACCTAGCCAACAATGAATTTCACATAAAGACTCGGTGCTTGAACATGCGAAAACATTTCCTGGTTGTTGATCGAGTATGTATAGGCCCTCGAACTCATATCCTCTACCAGTAATCTTCTTCGTCATATGATCCTATAACAAACAATAGCCAGGAAAGAAAGACACAAAGAAATTTAGGTCACGAGTAAGTTGGCTAATTGAAATAAGGTTGAATGCAAGGTTAGGTAGACATAAAACCAAAGATAAAGACATTATATGTGTAATATCCATCAACGCAAATTCCTGAATCGGTGATGATGACTCATCTGCCAAAAGGACAGTTGGATATGGTGCCGGTGATAGTGAATTTAAAAGGGATTGCGGTTACCTCTTATATGTGATGTAGCCCTAGAGTCTATGACCCATTTCGGAGAAGATGCAAGAAGGCCACGATTTTTATTACCTGTCTCGACTAACATTGCTGTAGGAAGATTCGAAGAGGATGATTATAAAGATGCTTGTGACACTTGAAGTTTGGCAAAATCACTAGTGAAAGCATGAATTGAGCGTTCTAAAACATCATTCGAAACAACATTCGCTAACTGTGAGCTCTGATTCTTATACTTGAGCTTCCTACAATCCCGTTTTACATGACCTGGTTAAGACAATAATAGCAAACCATCTCCCCTCTTGATTCCGGTCGCCTGCTATCTTCATTGGACTTTTGATGCTGATCATAACTTGTCTTTGGTGTGTTAGCCTCACCCTTTACGAAGCGAGACTCGTCTCTTCCCATAAGTGCACTACTCAGTTGTGACATTGATAAATCTAGATCAGTGTTTTAAAAAGCCCAAGGCGTAGTAAGGCGCAAGGCTTTTTCTACATTAGGCGCACTATATAAAAAAACACGAATGTTTTTATATATATGTGTGTATACACAATAAAAACATTTCCATAGACATAAATGAAGTTTTAACCAAGAATAATATATAAAACATACATTAAATTTAACTATTAACCTTCATTTCATGGATAGAAATGAAGTTTAACCAAGGATCTATATAAAGCATACATAAGATTCAACTACTAACCTTTCATTTTAAGCAAATGCTGTCTTTGTGAGTTGAAGAATTAAATTATTTAAGGTATTGAAATGTCTTGAATCAGATATCTCGCGCTTCGAATGACTAGACGGGGTCTTTTGACGCCCGGGGCTTCACGTAAGGCGCCAAAAGGCGCGCGCCTTTTAAAACACTGATCTAGATGGTGGCTTTCAGTACGATGGTGCCTTATGTATACATCCTCTGAACTAGGGATCTCGAATCTCGAGAGGATCTGTGCCCTTGCCATCTTGTACTCCAAACCCGGACTTAGGAGAAAATTGATAACACCCATCTGTGCTTTTTGACTTTGATGAATTTTTATATCAACATCAAATGACATAAGTGCATTAAGCTCGACACTATTCTTTTTGTGCTACGTATGATAAGCCATTAACGGTTGTCTCTTTCTGTCGGCATAGAAATATAACGTACAAACATCATATATTTTTGTCACATGTTCTTTTCCTTAATTAAAAAACTCCATGAACTTCAGAAGTTCCTTCACACTGTTACAATGGCTAATCAAGTCAACCAAATCATCTCCTATGGAGTTTATAATCTTATTGTACAACCAAGCATCATCCTGAGCCCATATCTTTTTCTTTAAATCGTCTTTCACCAAATTTTTTTCAATACATTCCTCCATCTCCATGCTTAACAAATGAAACTGGACTCTTTGATTCTAATTACTGTTAAGAGTATATGTGACATATGTATAGGCCTTACTCTGTAATCATTAGGCCCACAAAGGCCCGTTAGTACCTAGGGTTTTCAACTATCCTCTTGTATAAATATAGTCACCGGTTACATAATAAAACGGGGTGTTTTTCGGGTTCCTATATTGTATTTCATGGTATCAGAGCCAGGGTTAGCGAAATTAGGGTAGGAGTCGTGTCGCCTCCGTCGCCGCCATTCGCCGTCGCCGCGCGTTGTCGACACCGCCATTCGCCGCCGCTACGCGTTGTCGACACCGCCGCCTCCATTGGAAGCGCCTGCCGCTGATCTACTAGTTGTCTGAGCCTCGCCGTCATCCGACCACCGGAGTCGTCTCACGCGCCGCCGGAAGCAGGGAGGCTCGTGGCTGCGTATTTTTGGCCGATTTCGACTGGGTTTTTCTTGCGCCGGGGTGCTTTTCAAGTTCGGGTAATTTGTTTTTGATTTTGGCAACGTTTTGACTTGAGTTTGACCGTTAGACCGTTTTTTTCGGGATTTTTCCTGCTGCATTCTGTTTGGTTTTGTTCTGGTTTTTGTCCCAGATCTGTTAAAAATTGGTGATTGACTGTGGTTGACATATTTCATGGCGGATTCTAAGCATACAAGTGTAATTGCTCACTCGTCGGAAATCACGAAGCGAAAGCTAAATGGTTTAAATTATCGTGAGTGGAATAAAATAATTCGCTTTCACCTGCTAAGCACGGAGATGGAGGATCATATTAAGGATGATGCACCTGAGGATGAGCCAGTGAAGAAAATATGGCTTCGTGATGATGCACGATTATATAATCAAATTCAAAATTCTATAGATTCTATAGAAGATGAAGTGGTTGATCTGGTAAGTCATTGTAATAGTGTCAAAGAACTCATGAAGTTTTTGGAGTTCTTGTATTCAGGCAAGGAACAAAAAACAAGAATATACGATGTGTGCACATCCTACTTTCATGCTGATCGAAAGGGACAATCGTTAACAACATACTTTATGCAGCATAAGAGGAACAATGCTGAGTTGACTGCTCTTATGCCATTTGATACCGATGTCAAGGTTCAACAGAGTCAGAGGGAGAAAATGGGTGTTATGAGTTTCCTCTAGGGATTAGGTCCTGAATTTGAAATGGCTAAGTCACAGATTTTGTCTGGGACGGAAGTTCTAAGTTTAGATGATGTTTACAAGCATCTCCTTCGCATTGAGAATCATCTTATGGATCCTTTAGCCTCACAATCTAACAGTGCCCTCGTAGGGAGAAAAGATCAGCGTTCTGTAAAGGGTGCTAGTACTTATGCACCTAGGGCTAATTGTGACCGACGACGAGAATCTAGCAGTGACAGGAGGCGATCCGATTCGAGAGGAACCATTGAGTGCTATTATTGCCATAAATCGGGACATCTGAAAAGGGATTGTAGAAGACTGCAATATCAGAATCAGAAGCTTCAAGCTGCTCATGTTGCATCTACTGATGATCCGGAACACTTAATTCATGCATCTGCTGATGAGTTTGCTAAACTTCAAGTGTCACAGGCATCTTCGTCACACAATCCCACTGCAACCCTTGCTGAGATAGGTAATAAGAATCGCTGCTTCCTTACATCTTCTCCAAAATGGGTCATCGACTCTTGGGCTATAGCTCATATGACAGGTAACCATAATCTCTTTGCAAATCCAGTCTCCGGCTTCGTTTCTGGCTGTTACATTGGCAGATGGGTCATCATCCCCGATTCAGGGGTCTGGTCAAATAAGTGTTACACCTTCCATGTCTTTATCTTCAGTATTATGCCTACCAAATTTTGCTTTTAACTTAATCTCGATCAGTAAACTTACTCGTGATCTAAACTGTTTTGTCTCTTTCTTTCCTGGCTACTGTTTGTTTCAGGATCATATAACGAGAGGATTATTGGTAGAGGATATGAGTCAGATGGTCTTTACATACTTGACCAACAACTAGAGAATGCTGTTGCGTGCTTGAGTACCGAATCTTTGTTCGAAGTCCATTGCCGATTGGTCATCCGTCGTTGTCAGTTTTGAAGAAAATCTATCCAGCTTTCAGTAGTGTGTCATCTTTAAATTGTGAGTCATGTCAGTTTGCTAAATTTCATCGTCTTAGTTCGAGTCCTAGAGTTCATAAACGAGCTGATGCTCCCTTTGATTTAGTTCATTCAGATGTTTGGGGTCCGAGTCCTATAGTGTAAAAAATAAGATTTCGTTATTTTGTGACTTTTGTTGATGACTATTCTTGTTTTACTTGGGTTTATTTAATGAAGAATCGTTCTGAGTTATTTTCCATTTTCTGTGCCTTCAATGCTGAAATACGAGATCAATTCAAAGCTTCCATTAAAATTTTACGTACTGATAATGGTGGGGAATATTTCTCTACTCTTCTTGGATCTTACTTACGTGATCATGGTATTCTTCATCAGTCATCTTGTGCTGACACTTCATCACAGAACGGGGTTGCTGAACGGAAGAACCGACATTTACTTGAAACGGCAAGGGCGTTGGCTTTTCAAATGAATGTTCCTAAACATTTTTGGGGTGATGCTGTATCTACAACCTGTTTTCTAATTAATCGAATGCCTTCATCTGTGCTCAGTGGTCAAACTTTGTTTCATGCCTTGTTTCCTACAAAAGAATTGTTTCCAGTTGCCCCAAAAATCTTTGGTTGCACATGTTTTGTCCGAGATATTAACCCACGTCGCACAAAGTTAGACCCTAAGTTGTTGAAATGTATCTTTCTGGGCTATTCTCGAGTTCAGAAGGGTTATCGGTGCTATTGTCCTGAGTCTAACAGATATCTTGTATCCTTCGATGTTCAATTCTTTGAAGGTGTGCCTTTCACTAAAGGATCGTCTAGTTCGTGTTAAGAGGAGGATAATGTGTTTATCTATGAGGTCGTTCAGTCCTCTACTCCCGATTCGTCTCAAACGGAGTCTTCATCGCCGTGTCCCCCTATCACTCAGGTTTACTCTCGACGCATCCGACAACAACCCTCTGACTCATATCCCATACTGGAAAAAGCTTCGTCATCTGAACCACAAGCAGCTAGCGATTTTGACCTACCTATTGCCCTCTGGAAAGGTAAACGCTCTTGCACATATCCGATTTCTTCCTGTGTTTCATACCAAAGGTTATTGCCTTCCACCTACTCTTTTCTTACTTCTCTTGAATCCATATCTATTCCTAAGACTATTCCTGCAGCATTGTCTCACTCGGGGTGGTGTCAAGCAATGATTGATGAAATGACTGCTTTGGATGCTAATGACACTTGGGAACTAGCAAATTGTCCTGCAGGCAAGAATGTTATTGGGTGTAAATGAGTTTTTGCTATCAAAGTAAATTTTGATGGGTCTGTGGCTGGATTAAAAGCCCGTCTGGTGGCTAAGGACTATGTTCAAACGTATGGAGTGGACTATGGTGACACATTTTCTCTAGTCGCAAAGATGACTTCGATCAGATTGTTTATCTTGATAGTGGCGTCTAAAGATTGGCCGTTATATCAACTGGATATTAAGAATGCATTCCTCCACGGAGATCTTTAGGAGGAGGTGTACCTTGAGCAACCACCTGAGTTTGTTGCTCAAGGGGAGAGTGATAAGGTTTGTCGGCTTCGTAAATCATTATATGGCCTCAAACAGAGTCCGAGAGCTTGGTTTGGGAAATTTAGTGAAGCCCTTGAGAAATTTGGTATGAAGAAGAGTTCGTTTGACCACTCCGTATTTTATAGAGATATGAAGGATCGTATTATCCTATTGGTGGTTTATGTTGACGACATCATAATCACTGGAGATGACATAGCCGGTATAGAATCGTTGAAAGTTTTCCTGCATAATGAATTTCATACTAAAGATCTGGGAACCCTCAAATATTTTTTAGGTGTTGAAGTGATGAGGAGCAAGAAAGGCATTATTTTGTCTCAACGAAAGTATGTTCTTGATCTGTTATTAGAAGCAGAGAAGTTGGGCGTGAAACCATGCAGTACTCCTATGGTTACAAACTTACATCTTGTCAAAGAAGGAGAAGCTTTCAAAGATCCTAAGAGATACAAAAGAATGGTTGGGATATTAAACTATCTAACAGTTACTAGACTTGATATAGCATACGCGGTTAGTGTGGTAAGTCAATATATGTCATCTCCCACCGTTGATCATTGGGTGGTCGTAGAACAGATATTAAGTTATTTGAAATCAGCTCCGGGATAGGAAATCTTGTATGGGAGTCATGGACATATACGAGTGGAATGTTTTTCTGATGCCGATTGGGCAGGATCAAAAGATGACAAGAGATCTACCTCGGGATATATTGTGTATTTGTAGGAGGAAATTTGGTATCTTGGAGGAGCAAGAAACAGAATGTGGTCTCTCGTTCAAGTGCAGAATCTGAATACCGGGCTATGGCTCAAGCAACATGTGAGATAGTATGGTTGCTCCAATTAATGACAGAGATCGGGATAGAAAGTGTTATACCTGCAAGGTTATGATGTGATAACCAATCAGCTATTCTCATAGCTTCAAATCCAGTATTTCATGAGAGAACTAAGCACATTGAGGTAGATTGTCATTTCATTCGAGAAAAGGTACTCCAAGGAGTAATCTCAACGAGCTACATCAAGACTGGAGAACAGTTGGGAGATATCTTCATGAAGGCACTTCATGGAACTAGAATAAATTATCTGTGTAGCAAGCTAGGCATGACTGACATTTTCGCTCCAGCTTGAGGGGGAGTGTTAAGAGTATATGTGACATATGTATAGGCCTTACTCTGTAATCATTAGGCCCACAAAGGTCCGTTAGTACCTAGGGTTTCCAGCTATCCTCTTGTATAAATATAGTCACCGGTTACATAATAAAACGGGGTGTTTTTCGGGTCCCTATATTGTATTACAATTACGATAGTTCGAGCCATTTAACTTATGCTGCGTATTTTCCATAGAGTGAGAAATCACATTTGTTCGCTTCGTTTCAGCTATATATTATCAAACTCAACTAATAAATTAGCAAAAAGCAATACTTTTTTTTTAAAAAAATCAATTCTGAAAACAGTAACAGAAAGTGCCTAAAACAGCTGCAATCCTCAATCTTGAGACAAATAGATGTCAGAAACCAAAAACGAACTACTCCTGCAGCTATTGTGGACTAACGGCTGGTCAAAATCCTTCAATATTTGGACAGATAGTAGAACGGGACGTAAGCTATCAGATTCCGACGGCTGTGACAAACAACTCCAGTGGTGGCAGCATCGGAGGTGCTCCGCCGACGGCAGTTTGACATTGATGGTGACCCTCTGATGACTCCCTTGACTAGCAAAAACCCAAACCTAGGAACTTTGCTCTGATACCATGTAGTCGTTATTGAGATAAAATAGTTGTCACATTTTTCATTGATGAGGCTAAGCCTATATATTTACAATGAGAGTAGTAAACCCTAACTAATTCTAGGCTTAATGGGCCTGACTTATATATTACATAATTATTTATAAAGATGTTATAGGGAATATATCAATAACTCTAACAATGAGCACCACTAGGTTTGTGGTGACTCTTATTAAATATGCAGAATTGGACGACGAGTGGGAGTAATCTTTTAGGAGTAATCTTTGATAAAGGACAAAAGGGAAACCTTATTTTGGACTGTTTGATCCATTTCAGCTTGGTATTAAGCAGTTTGGACCAGAACAGACCGAATCGTTTTGGAGTTTTCAGAACCCAATCGAATCAACTGGATTTGACTTATTCAATACTTCTTCCTATTTCCCTGTGTAAATGTTATATTTGAGAGAAAGACTTCATGTCTCTTCTATTCATTAAAGAGTTATATTTATACAAATGTATAGGGTTAACTTAACTCCTAATGACCCTAATGATAAAAAAAATAAGATTACAATATTTTCATAATTAATTATTCTAACACTCCCTCACAAGTTGGAACATATATGTTGACCATCCCAACTTGTTTGAGAGATAATCTATCTGAGTTCCATTCAATGCTTTAGTGAAGATATCTCCTACTTGCTCTTCAGTCTTCTCATATCCCGTGGATATCACACATTGTTGTATCTTTCTCTCTTACAAAATGCCAATCAACCTCTATGTGTTTTGTCTCTATGTGTTCTGTCCGTTCATGGAAAATTGGGTTAGACGCAATATGGGTGGCTGCTTGATTATCACACCATAACTTTGTAGCGTCTTGACCTCAAATCCTAGCTCTATGAGAAGCTGATGTAACCCGACCAGTTCACATACAAACTGTGCCATTGCTCTATATTCTGATTCAACACTTGATCGCGAAACCACATTATGTTTCTTACTCTTCCAAGAGACTAAGTTACCACCCACAAAATAGTATCCTAAGGTTAATCTCCAGTCTTCTTTAGACCCAACCCAATCTGTGTCTGAAAATCCCTTAATGTCAGTGTGACCATTATCTTTATACAACACACCATGGCCAAGAGACCAGTGATTAACTGTGGGACATGACATATACGAGATCACAATAATGACTTAATATGCTATATCTAGCCTTGTCACAGTCAAATAATTAAATTTTCCAACCAGTCTTGTATACCTTTTTGGATTCTCCAATAGTTCCCCATCTTTCGTAAGCTGTGAATTGGGCATTATTGAAGTACTGCAAAACTTAGCTCCTAAATTTTTTGTCTCGGTTAGCAAGTCCAATGCATACTTTCTTTGAGACAACATAATTCCTTTCTTGCTCCTCAACACCTCAATGCCTAAAAAATACTTAAGCAAACTCAAGTCTTTCGTATTGAACCGACTATGAAGAAATGTCTTAAGAGACTGGATACCTAATGCATCATCACTGGTAATCGCAATATCATAAATGTATACAATTAACAAGATGATGCCAGAGGCAGACTGTTTAAAGAAAATTGAGTGATTAGATGTGATCTTTTACATCCCAAATTCCTCAACCACTTGGCTAAACTTTCCAAACTAGGCCCTAGGACTTTGTTTCAGTCCATACAAAGATTTACAAATGCGCAAAATTTACCTGTCTCCTTTTGAACAACAAACCCAGGTGGTTGTTCCATATACACTTCTTCTTGCAGGTTGCCGTGAAGAAATGCATTTTTGATATCTAGCTGATGCAGAGGCCAATGATAAACAAAAGCCAAAGATATAAAGAGGCAGACCAAAGTCATTTTCGCAACTGGAGAAAATGTATCGTCATAATCAACTCCATATGTCTGAGCATATCTTTTTGCTACTAGGAAAGCTTTCAATTGAGCAACAGACCCATCGGGATTGATCTTAACTTCAGACACCCACTTACAACCAATAGATGTCTTCCCTGCTTGAAGACTAACTAACTCCCAAGTACCATTGGCATCCAAGGTAGTCAATCTCCTCTACCATCGTAGCACACCAACCAGGATGAGACAAAGTTTCGCAAACTGTTTTAGGACAAACATAGAATCCAAGGACGCAACGAACAAACACTTCGACTGTGATAAATGATCGTAAGAAACAAATGATGATATAAGATAAGTACACGAGCACTTACCTTTACGAAGGGCAATGAGAAGATCATCGCTTGTTCCTGGATCTACTGATGAAGGGACTACGGATAAGGGGCACGAATCTTTGTGGTAAATTTCTAAAACCCTATTTTCACGGTTTGGTTGGCTACACATCTTCTTTCTTCCTTTTTGGCATTATCTCTCAGCCTTATCCGACGTAGATATTCCTAGTATACGAGTCATGTCTCCCAGATTATGTAGCTGTACACGGGATAATGTTTTTGGATTCACAAGCCTTCTTTGGATTTTAAGTTGGATGTTCAATTCTCGATTAGATTTGTTTTACCTTGTTGGCTCATTATGGCCCTCTCCTCTCCTCTATTAACTATTTCATTTAAAACACCTCAGATTAGGCAATTTGACATGACGTGTATTGGGTAGGCATTAACTTTTCTGTCTTTGATCGTTCATGCTTCTTTTACTTTCCTTATTGAGAGTTTGTGAAACACTATGAGCTCTTTATGGCTGCTCCTTTTAATAATTGGCTGCATGTTAAAATTATAATTATTAGTTTCTCTCTTATATCCAATGTTTTAAAAGCGGCAGTATAGTCCCCTTGAATAATTTCTCCTGACAGACATGTTTGAACACTTGGACACGACAAATAAACCTAATCAATAATAACAAGCTTTGTTTGTGAAACATTTTTTGTTGTAACATGTTTCCATTATTCTATTTGGTTTTAACATCCTTTATTAATCTAAACCGTGAAGGGTTACTTTCACACTCACACACATATCTCTCTGCTTTTGACATATATTAGTTTATAAACCTCATTCGAACAGTTGAATATTCCTCTAATCTTTTCATCTAGTACTTTGTTTTCCTAAATGTCGATGTGGCTATTCTAGTGTCATTTTCATGTTGATAGTAGGTGGTACCTCTTGTTCTTTCTCCTTATGTTGGAAAACACAAGTCTTAGCTACAAAAATAAATGCATATCTATCAACGGATTCTTGTTTAAAGTTGAACAAAATTAAATGATCTACTGAGCTTGAGAAAAAAAGATTCTGGTTTAAACTTGCAGAAGAAAGTAAAGATGCATCCATCCATTTATGAGGATGATTTTGGTAGTCATCACCTCTCTACTGAAAGGATCAGATGTGGCAAATCTGCATTGGTCTCATGTAGTAGCTCATGTCCAGCCGTTCAAAATAACTTGCTTGGTAGTCCAACAACAGAAATTGGGAAGAAATTTGACAATCAAACACTATTGAATGGTGACTTATCATCTGAAATTGTAGACGATCAGCCGAATGAAGATTCAGTTGATAAATCAGTTCCCGTGGGTGCTTTATTTCAAGCAGTTGTGCCTGAATGGACTGGTAATATTTCTGACAGTGACTCGAAATGGCTAGGGATGCGGTCATGGCCTTCTAAGCACAGAAATAGTCATTCCATAAAGAATAGGAATCCCATTGGCCGAGGGAGACCAGATTCATGTGGGTGTCAATTTCCAAGTTCTGTTGAATGTTTTAGATTTCACATTGCTGAAGCGAGGATGAGATTAAAGCTTGAACTTGGTTTGACATTCTTTGATTGGAGATTTCATCACATGGGGGAGGAGATATCTCTACAGTGGACTTCTGAAGAAGAGAAGAAATTTAAGGAGTTGACAATGTCAAGTTTTAACAACCAGAGTAAGTGCTTTTGGAACTCTTCACTGAGGTGCTTTCCAAAGAAGTCAAGGAAAATTTTGATCAGCTATTACTTCAATGTGTTTCTTATACGGCAGAGAAGCTATCAGAATCGCGTGACTCCAAATAGTATTGATAGTGACGATGAAGATGTCGAGTTTGGTTGCGTTAGTGGGGATTTTGGGGAGAAGGCAATGGAAATTTTAGGCACAAAGTCTCTAGAATGTTCTGAAAATAGACAGTTCGCAGATTTGGAGTAGTGGGAAGTAGAATCCTTCGAGGGTCAAGGAGATAAATTCAAGTATGTAATATAATAAAACTCGAAACTAACATGCTGCAGCCTAGTAGAATCTGCAGTATCCAAGAAGAGGGGAAAACCCAATTTTGATAAGAACTCTGAGTTACTGTTTTAGCTATACAACACCGATATCGGCACCATATGGTGTTAGCTGCTGTAACTTTTATTTTGATCTGTATGATATCAGCTGGTGAGAAATATGAAGACCAGAGAAAGCCAGTGGTATATTTGTTTGTTTTGAATATGTCAATGTGTGGAACTCAATCTGATAGTTCTTGGTTTTGTATGTGGAGTTTGTTGTAATCATTTGAAGTGGCTCCGAGGAAAGGTCTACCAAGACTAGAAAGGAAAGGTTAGAACTATTTTTGTTAATTACTTGTATTATGTATGGGGTTTAGATCCTCAAAACTGAGTTTGTAAGTTGTGTGTAACTTCAAAATCCAGGTTCCGAGAAAAGGACCTCAAAAGTGCTGTTTGTTGAGACCACCTCTGAGTAACACAGACTTGGGAGTTGTGTTCAAATTTACAGGTAAAGCTTAATCAGTTGACAGTTTCTTGTAACACTGACTTAGAGTTTGTGTGTTGAACTTACAGCTGAACTTAATGAGAAAT

mRNA sequence

TCTCTCTCCTTTCCTTGTTCAGAAGAACCACTTCAGCATCCATTCCTCCCTCTGTTTCCTTTTTCTCTTCTCTTCTTTCTGCTTCCGGCGAGCAATACTGATTCTGGCGACCGCTCTCTCCGTTCAACGACGAGCATTCCGGAGAAGGCGGATTCCGGTGCAGCGTTCGAGCAGGTGGCAGCAGCTCTCAGCTCCGTTCACACGACGGCGACCGTCCGGAGGCGGCGGGTTCTGGCGGAGCCGGCGGCAGCAGCTCCATGCTCCTCCGTTCATCTTCCTTTTCACGCCGGTGAATTTTCTTTAATTTTGAAGTAGGTGTTTGAGTTCATGGGGAAATGGCCTATTTCATCCAATGATTCAGTTTTAGATTGCAATAATAAAGATGTTGATCCTTGTCACAGTAATGGTTTTTTTATATCCTCTGATTGTTTGGTAGAGGAAAGTCATGTGAATGTTGATTATGATGATTGCAAAGCGACGGTTAGAAGCTCTTTTGAGAAGATTCTTTCGGTTTTTCTAAAGGAAATAGGTCGTAGAGGAATTGTTAGGCCAGTGCCAGCATTACTAGGTGAAGGGGGATCTTTGGATTTGTTTGAGCTGTTCATGATTGTAAGAGAGAAAGGTGGTTATCTAGTTGTTTCTGAAAATCAATTATGGTCCTCAGTGGTTGTGGAATTAGGTTTGGATCTTCAACTTTCTGCTTCTGTGAAATTGATTTATTCCAAGTATTTAAGTGAGCTAGAGAAATGGCTTATGAAGAGACGTGGTGGTACCAAACTGCTAAATGGGAACTCTAGTTATCACTATGAGATTAGTTTTCCATTTTTGTCGGAACTGGAGGGGAAGATTAAGGGTATGGTATATGGTCTGCTGAGACAAAAGAATGCATATCATGATCGTTCTGGACTCAAATCTAACAAACAGAATGGGAACGTCAATGTTGATGTCACTGCAGAGGAGGAAATAAAATCACCGAGAATAAAGAAAGAAGAACATAGTATATATGGGGGTGTTGAAGAAATTAAAAAAAATTGTAATGACACACTTCGGGATGATGACGAAAAGGATAGAATCCATGTTATTGAAGATGATAGAAGTTTGGATGCTGTTTGTGTTAATGTTGAAAAGGAAATAGACTCCCTTGGGAGATATCGAAAATCTTTATTACGAATGTTGAGGTGGGCAAGAAAGAGTGCAAAGAATCCTGCAGATCCATCTACTGGTATAATACCAAAGTCATCTAAGTGGAAAGGGTTTGATGACAACAATGCATTTTGGCTTCAAGTAATCAGGGCAAAAGATGCAGTTTTAATTAGGAAGGATGTTGACAAAAATGCTGAGAAAGATCTTTTAATACAGAAGAAAGTAAAGATGCATCCATCCATTTATGAGGATGATTTTGGTAGTCATCACCTCTCTACTGAAAGGATCAGATGTGGCAAATCTGCATTGGTCTCATGTAGTAGCTCATGTCCAGCCGTTCAAAATAACTTGCTTGGTAGTCCAACAACAGAAATTGGGAAGAAATTTGACAATCAAACACTATTGAATGGTGACTTATCATCTGAAATTGTAGACGATCAGCCGAATGAAGATTCAGTTGATAAATCAGTTCCCGTGGGTGCTTTATTTCAAGCAGTTGTGCCTGAATGGACTGGTAATATTTCTGACAGTGACTCGAAATGGCTAGGGATGCGGTCATGGCCTTCTAAGCACAGAAATAGTCATTCCATAAAGAATAGGAATCCCATTGGCCGAGGGAGACCAGATTCATGTGGGTGTCAATTTCCAAGTTCTGTTGAATGTTTTAGATTTCACATTGCTGAAGCGAGGATGAGATTAAAGCTTGAACTTGGTTTGACATTCTTTGATTGGAGATTTCATCACATGGGGGAGGAGATATCTCTACAGTGGACTTCTGAAGAAGAGAAGAAATTTAAGGAGTTGACAATGTCAAGTTTTAACAACCAGAGTAAGTGCTTTTGGAACTCTTCACTGAGGTGCTTTCCAAAGAAGTCAAGGAAAATTTTGATCAGCTATTACTTCAATGTGTTTCTTATACGGCAGAGAAGCTATCAGAATCGCGTGACTCCAAATAGTATTGATAGTGACGATGAAGATGTCGAGTTTGGTTGCGTTAGTGGGGATTTTGGGGAGAAGGCAATGGAAATTTTAGGCACAAAGTCTCTAGAATGTTCTGAAAATAGACAGTTCGCAGATTTGGAGTAGTGGGAAGTAGAATCCTTCGAGGGTCAAGGAGATAAATTCAAGTATGTAATATAATAAAACTCGAAACTAACATGCTGCAGCCTAGTAGAATCTGCAGTATCCAAGAAGAGGGGAAAACCCAATTTTGATAAGAACTCTGAGTTACTGTTTTAGCTATACAACACCGATATCGGCACCATATGGTGTTAGCTGCTGTAACTTTTATTTTGATCTGTATGATATCAGCTGGTGAGAAATATGAAGACCAGAGAAAGCCAGTGGTATATTTGTTTGTTTTGAATATGTCAATGTGTGGAACTCAATCTGATAGTTCTTGGTTTTGTATGTGGAGTTTGTTGTAATCATTTGAAGTGGCTCCGAGGAAAGGTCTACCAAGACTAGAAAGGAAAGGTTCCGAGAAAAGGACCTCAAAAGTGCTGTTTGTTGAGACCACCTCTGAGTAACACAGACTTGGGAGTTGTGTTCAAATTTACAGGTAAAGCTTAATCAGTTGACAGTTTCTTGTAACACTGACTTAGAGTTTGTGTGTTGAACTTACAGCTGAACTTAATGAGAAAT

Coding sequence (CDS)

ATGGGGAAATGGCCTATTTCATCCAATGATTCAGTTTTAGATTGCAATAATAAAGATGTTGATCCTTGTCACAGTAATGGTTTTTTTATATCCTCTGATTGTTTGGTAGAGGAAAGTCATGTGAATGTTGATTATGATGATTGCAAAGCGACGGTTAGAAGCTCTTTTGAGAAGATTCTTTCGGTTTTTCTAAAGGAAATAGGTCGTAGAGGAATTGTTAGGCCAGTGCCAGCATTACTAGGTGAAGGGGGATCTTTGGATTTGTTTGAGCTGTTCATGATTGTAAGAGAGAAAGGTGGTTATCTAGTTGTTTCTGAAAATCAATTATGGTCCTCAGTGGTTGTGGAATTAGGTTTGGATCTTCAACTTTCTGCTTCTGTGAAATTGATTTATTCCAAGTATTTAAGTGAGCTAGAGAAATGGCTTATGAAGAGACGTGGTGGTACCAAACTGCTAAATGGGAACTCTAGTTATCACTATGAGATTAGTTTTCCATTTTTGTCGGAACTGGAGGGGAAGATTAAGGGTATGGTATATGGTCTGCTGAGACAAAAGAATGCATATCATGATCGTTCTGGACTCAAATCTAACAAACAGAATGGGAACGTCAATGTTGATGTCACTGCAGAGGAGGAAATAAAATCACCGAGAATAAAGAAAGAAGAACATAGTATATATGGGGGTGTTGAAGAAATTAAAAAAAATTGTAATGACACACTTCGGGATGATGACGAAAAGGATAGAATCCATGTTATTGAAGATGATAGAAGTTTGGATGCTGTTTGTGTTAATGTTGAAAAGGAAATAGACTCCCTTGGGAGATATCGAAAATCTTTATTACGAATGTTGAGGTGGGCAAGAAAGAGTGCAAAGAATCCTGCAGATCCATCTACTGGTATAATACCAAAGTCATCTAAGTGGAAAGGGTTTGATGACAACAATGCATTTTGGCTTCAAGTAATCAGGGCAAAAGATGCAGTTTTAATTAGGAAGGATGTTGACAAAAATGCTGAGAAAGATCTTTTAATACAGAAGAAAGTAAAGATGCATCCATCCATTTATGAGGATGATTTTGGTAGTCATCACCTCTCTACTGAAAGGATCAGATGTGGCAAATCTGCATTGGTCTCATGTAGTAGCTCATGTCCAGCCGTTCAAAATAACTTGCTTGGTAGTCCAACAACAGAAATTGGGAAGAAATTTGACAATCAAACACTATTGAATGGTGACTTATCATCTGAAATTGTAGACGATCAGCCGAATGAAGATTCAGTTGATAAATCAGTTCCCGTGGGTGCTTTATTTCAAGCAGTTGTGCCTGAATGGACTGGTAATATTTCTGACAGTGACTCGAAATGGCTAGGGATGCGGTCATGGCCTTCTAAGCACAGAAATAGTCATTCCATAAAGAATAGGAATCCCATTGGCCGAGGGAGACCAGATTCATGTGGGTGTCAATTTCCAAGTTCTGTTGAATGTTTTAGATTTCACATTGCTGAAGCGAGGATGAGATTAAAGCTTGAACTTGGTTTGACATTCTTTGATTGGAGATTTCATCACATGGGGGAGGAGATATCTCTACAGTGGACTTCTGAAGAAGAGAAGAAATTTAAGGAGTTGACAATGTCAAGTTTTAACAACCAGAGTAAGTGCTTTTGGAACTCTTCACTGAGGTGCTTTCCAAAGAAGTCAAGGAAAATTTTGATCAGCTATTACTTCAATGTGTTTCTTATACGGCAGAGAAGCTATCAGAATCGCGTGACTCCAAATAGTATTGATAGTGACGATGAAGATGTCGAGTTTGGTTGCGTTAGTGGGGATTTTGGGGAGAAGGCAATGGAAATTTTAGGCACAAAGTCTCTAGAATGTTCTGAAAATAGACAGTTCGCAGATTTGGAGTAG

Protein sequence

MGKWPISSNDSVLDCNNKDVDPCHSNGFFISSDCLVEESHVNVDYDDCKATVRSSFEKILSVFLKEIGRRGIVRPVPALLGEGGSLDLFELFMIVREKGGYLVVSENQLWSSVVVELGLDLQLSASVKLIYSKYLSELEKWLMKRRGGTKLLNGNSSYHYEISFPFLSELEGKIKGMVYGLLRQKNAYHDRSGLKSNKQNGNVNVDVTAEEEIKSPRIKKEEHSIYGGVEEIKKNCNDTLRDDDEKDRIHVIEDDRSLDAVCVNVEKEIDSLGRYRKSLLRMLRWARKSAKNPADPSTGIIPKSSKWKGFDDNNAFWLQVIRAKDAVLIRKDVDKNAEKDLLIQKKVKMHPSIYEDDFGSHHLSTERIRCGKSALVSCSSSCPAVQNNLLGSPTTEIGKKFDNQTLLNGDLSSEIVDDQPNEDSVDKSVPVGALFQAVVPEWTGNISDSDSKWLGMRSWPSKHRNSHSIKNRNPIGRGRPDSCGCQFPSSVECFRFHIAEARMRLKLELGLTFFDWRFHHMGEEISLQWTSEEEKKFKELTMSSFNNQSKCFWNSSLRCFPKKSRKILISYYFNVFLIRQRSYQNRVTPNSIDSDDEDVEFGCVSGDFGEKAMEILGTKSLECSENRQFADLE
Homology
BLAST of Sed0009983 vs. NCBI nr
Match: XP_008452043.1 (PREDICTED: AT-rich interactive domain-containing protein 2 [Cucumis melo])

HSP 1 Score: 919.8 bits (2376), Expect = 1.2e-263
Identity = 464/642 (72.27%), Postives = 536/642 (83.49%), Query Frame = 0

Query: 1   MGKWPISSNDSVLDCNNKDVDPCHSNGFFISSDCLVEESHVNVDYDDCKATVRSSFEKIL 60
           MG+WPISSNDS+LDC NKDVDP  SNG+ I+ DCLVE S  NVD+DDCKAT+R  FEKIL
Sbjct: 1   MGRWPISSNDSILDC-NKDVDPNPSNGYCIAPDCLVEGSRANVDHDDCKATIRCYFEKIL 60

Query: 61  SVFLKEIGRRGIVRPVPALLGEGGSLDLFELFMIVREKGGYLVVSENQLWSSVVVELGLD 120
            VFLKEI RRG +RPVPALLGEGGSLDLFELFM+VR+KGGY VVSE +LWSSVVVELGLD
Sbjct: 61  WVFLKEICRRGFIRPVPALLGEGGSLDLFELFMVVRDKGGYQVVSEKELWSSVVVELGLD 120

Query: 121 LQLSASVKLIYSKYLSELEKWLMKRRGGTKLLNGNSS-YHYEISFPFLSELEGKIKGMVY 180
           L LSASVKLIY KYLSELEKWLM RRGGTKL NGNS  Y+Y  SFP L+ELE KIK M+Y
Sbjct: 121 LGLSASVKLIYFKYLSELEKWLMVRRGGTKLENGNSDYYYYRKSFPCLAELEAKIKDMLY 180

Query: 181 GLLRQKNAYHDRSGLKSNKQNGNVNV-DVTAEEEIKSPRIKKEEHSIYGGVEEIKKNCND 240
           G+LRQK+ Y +R G KSNK NGNVNV +  AE+EIK P+I+K+EH ++  V  I++NC +
Sbjct: 181 GVLRQKSIYDERPGFKSNKPNGNVNVAETAAEKEIKFPKIEKKEHDLHEDVTPIQQNCTE 240

Query: 241 TLRDDDEKDRIHVIEDDRSLDAVCVNVEKEIDSLGRYRKSLLRMLRWARKSAKNPADPST 300
           T R + E ++IHVI D RSLDA  VNVE E DS GR R+SLLRML+W RK+AK+PA+PS 
Sbjct: 241 TPRVNGETNQIHVIGDCRSLDA--VNVETETDSHGRSRESLLRMLKWVRKTAKHPANPSN 300

Query: 301 GIIPKSSKWKGFDDNNAFWLQVIRAKDAVLIRKDVDKNAEKDLLIQKKVKMHPSIYEDDF 360
           G +P+SSKWK +  ++A WLQVI+AKDA+L RKDVDK AEK LLIQKKV+MHP IYED+ 
Sbjct: 301 GTVPESSKWKAYASDDALWLQVIKAKDALLNRKDVDKTAEKRLLIQKKVRMHPCIYEDNI 360

Query: 361 -GSHHLSTERIRC-------GKSALVSCSSSCPAVQNNLLGSPTTEIGKKFDNQTLLNGD 420
             +HHLSTERI C        KS LV+ ++SCP V++N +GS TTEIGK   NQ LLNGD
Sbjct: 361 DDNHHLSTERICCSRRSNALAKSELVASNNSCPPVRSNQIGSLTTEIGKGLKNQALLNGD 420

Query: 421 LSSEIVDDQPNEDSVDKSVPVGALFQAVVPEWTGNISDSDSKWLGMRSWPSKHRNSHSIK 480
           L+SE+ D+Q NEDSV+K VPVGALFQA +PEWTGNISDSDSKWLG R WPS+H N+ S+ 
Sbjct: 421 LASEMEDNQANEDSVEKPVPVGALFQAAIPEWTGNISDSDSKWLGTRLWPSQHENNKSVS 480

Query: 481 NRNPIGRGRPDSCGCQFPSSVECFRFHIAEARMRLKLELGLTFFDWRFHHMGEEISLQWT 540
           NRNPIGRGR DSC CQFP SVEC+RFHIAEARMRLKLELGLTF+DWRFH MGEEISLQWT
Sbjct: 481 NRNPIGRGRLDSCSCQFPGSVECYRFHIAEARMRLKLELGLTFYDWRFHQMGEEISLQWT 540

Query: 541 SEEEKKFKELTMSSFNNQSKCFWNSSLRCFPKKSRKILISYYFNVFLIRQRSYQNRVTPN 600
           +EEEK+FKEL +SSFNNQ++CFWN SL+ FP KSRK LISYYFNVFL+RQRSYQNRVTPN
Sbjct: 541 AEEEKRFKELAISSFNNQNQCFWNHSLKWFPMKSRKNLISYYFNVFLLRQRSYQNRVTPN 600

Query: 601 SIDSDDEDVEFGCVSGDFGEKAMEILGTKSLECSENRQFADL 633
            IDSDDEDVEFGC+SGDFG KAMEILG+KS+ECSEN+QF D+
Sbjct: 601 DIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENKQFIDI 639

BLAST of Sed0009983 vs. NCBI nr
Match: XP_038893741.1 (AT-rich interactive domain-containing protein 2 [Benincasa hispida])

HSP 1 Score: 902.9 bits (2332), Expect = 1.6e-258
Identity = 452/636 (71.07%), Postives = 528/636 (83.02%), Query Frame = 0

Query: 1   MGKWPISSNDSVLDCNNKDVDPCHSNGFFISSDCLVEESHVNVDYDDCKATVRSSFEKIL 60
           MG+WPISSN S++DC NKDVDP  SNG  I+ DCLVE S+ NV+YDDCKAT+R  FEKIL
Sbjct: 7   MGRWPISSNASIVDC-NKDVDPNPSNGCCIAPDCLVEGSYANVNYDDCKATIRCYFEKIL 66

Query: 61  SVFLKEIGRRGIVRPVPALLGEGGSLDLFELFMIVREKGGYLVVSENQLWSSVVVELGLD 120
            VFLKEIGRRG +RPV ALLGEGGSLDLFELFM+VR+KGGY VVSE +LWSSVV+ELGLD
Sbjct: 67  WVFLKEIGRRGSIRPVAALLGEGGSLDLFELFMVVRDKGGYQVVSEKELWSSVVLELGLD 126

Query: 121 LQLSASVKLIYSKYLSELEKWLMKRRGGTKLLNGNSSYHYEISFPFLSELEGKIKGMVYG 180
           L LSASVKLIYSKYLS+LEKWLM R GGTKL NGNS YHY  SFPFLSELE K+K M+  
Sbjct: 127 LGLSASVKLIYSKYLSDLEKWLMVRSGGTKLENGNSDYHYRKSFPFLSELEAKVKCML-- 186

Query: 181 LLRQKNAYHDRSGLKSNKQNGNVNVDVTA-EEEIKSPRIKKEEHSIYGGVEEIKKNCNDT 240
                  Y + SG KSNK NGNVNV   A E+EIK P++KKEEH ++G V  I++NC +T
Sbjct: 187 -------YDECSGFKSNKPNGNVNVATAALEKEIKFPKLKKEEHDLHGDVTPIQQNCTET 246

Query: 241 LRDDDEKDRIHVIEDDRSLDAVCVNVEKEIDSLGRYRKSLLRMLRWARKSAKNPADPSTG 300
            RD+ E D+IHVIED RSL A  VN+E E+D+ GRYR+SLLRML+WARK+AK+P +PS  
Sbjct: 247 PRDNGETDQIHVIEDCRSLAA--VNIETELDTHGRYRESLLRMLKWARKTAKHPGNPSNC 306

Query: 301 IIPKSSKWKGFDDNNAFWLQVIRAKDAVLIRKDVDKNAEKDLLIQKKVKMHPSIYEDDFG 360
            +P +SKWK +  ++A WLQVIRAKDA+L RKDVD+ AEK LLIQKK +MHPSIYED+  
Sbjct: 307 TVPGASKWKAYASDDALWLQVIRAKDALLTRKDVDRIAEKRLLIQKKTRMHPSIYEDNID 366

Query: 361 SHHLSTERIRCGKSALVS-CSSSCPAVQNNLLGSPTTEIGKKFDNQTLLNGDLSSEIVDD 420
           +H LSTERI C K +  S C++S P +Q+N + S TTEIGK  +NQ L NGDL S++ D+
Sbjct: 367 NHQLSTERICCSKKSNASACNNSHPTIQSNCISSLTTEIGKGLENQALSNGDLPSKMEDN 426

Query: 421 QPNEDSVDKSVPVGALFQAVVPEWTGNISDSDSKWLGMRSWPSKHRNSHS-IKNRNPIGR 480
           QPNEDSV+K VP GALFQAV+PEWTGNISDSDSKWLG +SWPS+H N +S + ++NPIG+
Sbjct: 427 QPNEDSVEKPVPTGALFQAVIPEWTGNISDSDSKWLGTQSWPSQHGNINSVVSDKNPIGK 486

Query: 481 GRPDSCGCQFPSSVECFRFHIAEARMRLKLELGLTFFDWRFHHMGEEISLQWTSEEEKKF 540
           GRPDSC CQFP SVECFRFHIAEARM LKLELGLTF+DWRFHHMGEEISLQWT+EEEK+F
Sbjct: 487 GRPDSCSCQFPGSVECFRFHIAEARMGLKLELGLTFYDWRFHHMGEEISLQWTAEEEKRF 546

Query: 541 KELTMSSFNNQSKCFWNSSLRCFPKKSRKILISYYFNVFLIRQRSYQNRVTPNSIDSDDE 600
           KEL +SSFNNQS+CFWN SL+ FP KSRK LISYYFNVFL+RQRSYQNR TPNSIDSDDE
Sbjct: 547 KELAVSSFNNQSRCFWNYSLKWFPMKSRKNLISYYFNVFLLRQRSYQNRATPNSIDSDDE 606

Query: 601 DVEFGCVSGDFGEKAMEILGTKSLECSENRQFADLE 634
           D+EFGC+SGDFG KAMEILG+KS+EC+ENRQF D+E
Sbjct: 607 DLEFGCISGDFGAKAMEILGSKSVECAENRQFTDVE 630

BLAST of Sed0009983 vs. NCBI nr
Match: XP_004146560.2 (AT-rich interactive domain-containing protein 2 [Cucumis sativus] >KGN53331.1 hypothetical protein Csa_015265 [Cucumis sativus])

HSP 1 Score: 892.5 bits (2305), Expect = 2.1e-255
Identity = 450/639 (70.42%), Postives = 525/639 (82.16%), Query Frame = 0

Query: 1   MGKWPISSNDSVLDCNNKDVDPCHSNGFFISSDCLVEESHVNVDYDDCKATVRSSFEKIL 60
           MG+WPISSNDS+LDC NKDVDP  S G+ I+ DCLVE S  NVD+DDCKAT+R  FEK+L
Sbjct: 1   MGRWPISSNDSILDC-NKDVDPNPSYGYCIAPDCLVEGSCANVDHDDCKATIRCYFEKVL 60

Query: 61  SVFLKEIGRRGIVRPVPALLGEGGSLDLFELFMIVREKGGYLVVSENQLWSSVVVELGLD 120
            VFLKE  RRG +RPVPALLGEG SLDLFELFM+VR+KGGY VVSE +LWSSVVVELGLD
Sbjct: 61  WVFLKETCRRGFIRPVPALLGEGESLDLFELFMVVRDKGGYQVVSEKELWSSVVVELGLD 120

Query: 121 LQLSASVKLIYSKYLSELEKWLMKRRGGTKLLNGNSS-YHYEISFPFLSELEGKIKGMVY 180
           L LSASVKLIY KYLS+LEKWLM RRGGTKL NGNS  Y+Y  +FP L+ELE KIK ++Y
Sbjct: 121 LGLSASVKLIYFKYLSDLEKWLMVRRGGTKLENGNSDCYYYRKNFPCLAELEAKIKDILY 180

Query: 181 GLLRQKNAYHDRSGLKSNKQNGNVNV-DVTAEEEIKSPRIKKEEHSIYGGVEEIKKNCND 240
           G+LRQK+ Y +RSG KSNK NGNVNV +  AE+EIKSP+I+K+EH ++  V  I++NC +
Sbjct: 181 GVLRQKSIYDERSGFKSNKPNGNVNVAETAAEKEIKSPKIEKKEHDLHEDVTPIQQNCTE 240

Query: 241 TLRDDDEKDRIHVIEDDRSLDAVCVNVEKEIDSLGRYRKSLLRMLRWARKSAKNPADPST 300
           T RD+ + ++IHVI D RS DA  VNVE E DS G  R+SL RML+W RK+AK+PA+PS 
Sbjct: 241 TPRDNGKTNQIHVIGDCRSSDA--VNVETETDSHGSSRESLFRMLKWVRKTAKHPANPSN 300

Query: 301 GIIPKSSKWKGFDDNNAFWLQVIRAKDAVLIRKDVDKNAEKDLLIQKKVKMHPSIYEDDF 360
           G +P SSKWK +   +A WLQVI+AKDA+L RKDVDK AEK LLIQKKV+MHP IYED+ 
Sbjct: 301 GTVPGSSKWKAYASEDALWLQVIKAKDALLNRKDVDKTAEKRLLIQKKVRMHPCIYEDNI 360

Query: 361 -GSHHLSTERIRC-------GKSALVSCSSSCPAVQNNLLGSPTTEIGKKFDNQTLLNGD 420
             +HHLSTERI C        KS  V+C++SCP VQ+N +GS TTEIGK   NQ LLNGD
Sbjct: 361 DDNHHLSTERICCSRRSNALSKSESVACNNSCPPVQSNQIGSLTTEIGKGLKNQALLNGD 420

Query: 421 LSSEIVDDQPNEDSVDKSVPVGALFQAVVPEWTGNISDSDSKWLGMRSWPSKHRNSHSIK 480
           L+SE+ D+Q NEDSV+K VPVGA FQAV+PEWTGNISDSDSKWLG RSWPS+H N+ S+ 
Sbjct: 421 LASEMEDNQANEDSVEKPVPVGASFQAVLPEWTGNISDSDSKWLGTRSWPSQHENNKSVS 480

Query: 481 NRNPIGRGRPDSCGCQFPSSVECFRFHIAEARMRLKLELGLTFFDWRFHHMGEEISLQWT 540
           +RNPI RGR D C CQFP SVEC+RFHIAEARMRLKLELGLTF+DWRFH MGEEISLQWT
Sbjct: 481 DRNPISRGRLDPCSCQFPGSVECYRFHIAEARMRLKLELGLTFYDWRFHQMGEEISLQWT 540

Query: 541 SEEEKKFKELTMSSFNNQSKCFWNSSLRCFPKKSRKILISYYFNVFLIRQRSYQNRVTPN 600
           +EEE +FKEL +SSFNNQ++CFWN SL+ FP KSRK LISYYFNVFL+RQRSYQNRVTPN
Sbjct: 541 AEEENRFKELAISSFNNQNQCFWNHSLKWFPMKSRKNLISYYFNVFLLRQRSYQNRVTPN 600

Query: 601 SIDSDDEDVEFGCVSGDFGEKAMEILGTKSLECSENRQF 630
            IDSD EDVEFGC+SGDFG KAME+LG+K +ECSEN+QF
Sbjct: 601 DIDSDGEDVEFGCISGDFGAKAMEVLGSKFVECSENKQF 636

BLAST of Sed0009983 vs. NCBI nr
Match: XP_022136609.1 (AT-rich interactive domain-containing protein 2 [Momordica charantia])

HSP 1 Score: 879.0 bits (2270), Expect = 2.4e-251
Identity = 445/642 (69.31%), Postives = 519/642 (80.84%), Query Frame = 0

Query: 1   MGKWPISSNDSVLDCNNKDVDPCH-SNGFFISSDCLVEESHVNVDYDDCKATVRSSFEKI 60
           MG+WPISSN S+LDC +KD+DP + SNG  I+ DCLVEES+ +VDY DCKA +R SFEKI
Sbjct: 1   MGRWPISSNASILDC-SKDIDPSYSSNGCCIAPDCLVEESYASVDYGDCKAALRCSFEKI 60

Query: 61  LSVFLKEIGRRGIVRPVPALLGEGGSLDLFELFMIVREKGGYLVVSENQLWSSVVVELGL 120
           LS FLKEIGRRGIVRPVPALLGEGGSLDLFELFM+VR+KGGY VVSE +LWSSVVVELGL
Sbjct: 61  LSFFLKEIGRRGIVRPVPALLGEGGSLDLFELFMVVRDKGGYQVVSEKELWSSVVVELGL 120

Query: 121 DLQLSASVKLIYSKYLSELEKWLMKRRGGTKLLNGNSSYHYEISFPFLSELEGKIKGMVY 180
           DL+LSASVKL+YSKYLSELEKWLM R GG KL NGNS YH E SFPF SEL  KIKGM+Y
Sbjct: 121 DLELSASVKLVYSKYLSELEKWLMVRCGGKKLENGNSDYHCEKSFPFSSELAAKIKGMLY 180

Query: 181 GLLRQKNAYHDRSGLKSNKQNGNVNV-DVTAEEEIKSPRIKKEEHSIYGGVEEIKKNCND 240
           G+L+QK+ Y + SG  S+KQ GN+NV     E++IK P + K+E  + G V + ++  + 
Sbjct: 181 GVLKQKSLYDECSGFISSKQIGNINVASAAVEKKIKLPEVNKKEPDLNGSVTQSQEKLSK 240

Query: 241 TLRDDDEKDRIHVIEDDRSLDAVCVNVEKEIDSLGRYRKSLLRMLRWARKSAKNPADPST 300
           T +DDD  + I V ED RSL    VNVE + DS    R+SLLRML+WAR+ AK PADPS 
Sbjct: 241 TPQDDDGIEHIRVNEDCRSL--ASVNVETKTDSCESCRESLLRMLKWARQIAKYPADPSN 300

Query: 301 GIIPKSSKWKGFDDNNAFWLQVIRAKDAVLIRKDVDKNAEKDLLIQKKVKMHPSIYEDDF 360
           G IP  SKWK +  NN FWLQV+RAKDA+LIRKDV++N EK LL QKKVK+HPSIYED+ 
Sbjct: 301 GTIPGPSKWKEYTSNNTFWLQVVRAKDALLIRKDVEENNEKRLL-QKKVKVHPSIYEDNI 360

Query: 361 GSHHLSTERIRCG-------KSALVSCSSSCPAVQNNLLGSPTTEIGKKFDNQTLLNGDL 420
            +HHLSTERIRC        KS LVSC+SSCP V +NL+ SPTTEIGK  D Q  +NGDL
Sbjct: 361 DNHHLSTERIRCSKRFNALVKSILVSCNSSCPTVGSNLISSPTTEIGKGLDKQAFMNGDL 420

Query: 421 SSEIVDDQPNEDSVDKSVPVGALFQAVVPEWTGNISDSDSKWLGMRSWPSKHRNSHSIKN 480
            SE +D Q NEDSV+K V VG+ FQAVVPEWTG IS+SDSKWLG RSW  +H NS+S+ +
Sbjct: 421 PSERIDTQQNEDSVEKPVLVGSSFQAVVPEWTGEISESDSKWLGTRSWTFQHENSNSVSD 480

Query: 481 RNPIGRGRPDSCGCQFPSSVECFRFHIAEARMRLKLELGLTFFDWRFHHMGEEISLQWTS 540
           RNPIGRGRPDSC C +P SVECFRFHIAEAR+RLKLELGL F++WRFHHMGEEISLQWT+
Sbjct: 481 RNPIGRGRPDSCRCWYPGSVECFRFHIAEARLRLKLELGLAFYEWRFHHMGEEISLQWTT 540

Query: 541 EEEKKFKELTMSSFNNQSKCFWNSSLRCFPKKSRKILISYYFNVFLIRQRSYQNRVTPNS 600
           EEEK FKEL  SSFN+QSKCFWN S++ FP KSRK LISYYFNVF++RQRSYQNRVTPNS
Sbjct: 541 EEEKTFKELAKSSFNSQSKCFWNYSMKWFPMKSRKNLISYYFNVFVLRQRSYQNRVTPNS 600

Query: 601 IDSDDEDVEFGCVSGDFGEKAMEILGTKSLECSENRQFADLE 634
           +DSDD++VEFGC+SGDFG KAM++LGTKSLECSEN+Q  D+E
Sbjct: 601 VDSDDDEVEFGCLSGDFGGKAMQVLGTKSLECSENKQCTDVE 638

BLAST of Sed0009983 vs. NCBI nr
Match: KAG7015291.1 (AT-rich interactive domain-containing protein 2, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 878.6 bits (2269), Expect = 3.2e-251
Identity = 449/642 (69.94%), Postives = 516/642 (80.37%), Query Frame = 0

Query: 1   MGKWPISSNDSVLDCNNKDVDPCHSNGFFISSDCLVEESHVNVDYDDCKATVRSSFEKIL 60
           MG+W +SSN S+LDC NKDVDP  SNG  I+SDCLVE S+ NVDYDDCKA +R  FEKIL
Sbjct: 1   MGRWHVSSNASILDC-NKDVDPNPSNGCCIASDCLVEGSYENVDYDDCKARIRRYFEKIL 60

Query: 61  SVFLKEIGRRGIVRPVPALLGEGGSLDLFELFMIVREKGGYLVVSENQLWSSVVVELGLD 120
            VFLKEIGRRG VRP+PAL+GEGG+LDLFELF++VR+KGG  VVSE +LWSSVVVELGLD
Sbjct: 61  WVFLKEIGRRGFVRPLPALIGEGGALDLFELFLVVRDKGGSQVVSEKKLWSSVVVELGLD 120

Query: 121 LQLSASVKLIYSKYLSELEKWLMKRRGGTKLLNGNSSYHYEISFPFLSELEGKIKGMVYG 180
           L LSASVKLIYSKYLS+LEKWLM R G TKL NG+S Y Y+ S PFLSEL  KI GM+YG
Sbjct: 121 LGLSASVKLIYSKYLSDLEKWLMVRCGDTKLENGSSDYCYKKSSPFLSELGAKINGMLYG 180

Query: 181 LLRQKNAYHDRSGLKSNKQNGNVNVDVTA--EEEIKSPRIKKEEHSIYGGVEEIKKNCND 240
           + RQ + Y +  G KSNKQNGNVNV   A  E+EIK P IKK+EH ++G V  I+++C +
Sbjct: 181 VPRQNSIYDECFGFKSNKQNGNVNVAAAAAVEKEIKFPEIKKKEHDLHGDVTPIQQDCTE 240

Query: 241 TLRDDDEKDRIHVIEDDRSLDAVCVNVEKEIDSLGRYRKSLLRMLRWARKSAKNPADPST 300
           T         IHVIED +SLDA  VNVE EI+SLG+YR+SLLRML+W RK+AK+P DP  
Sbjct: 241 T-------HPIHVIEDGQSLDA--VNVEAEIESLGKYRESLLRMLKWVRKTAKHPEDPLN 300

Query: 301 GIIPKSSKWKGFDDNNAFWLQVIRAKDAVLIRKDVDKNAEKDLLIQKKVKMHPSIYEDDF 360
           G IP +S+WKG+  ++A WLQVIRAKDA+LIRK VDK AEK LLIQKKVKMHPSIYED  
Sbjct: 301 GTIPGTSRWKGYSSDDALWLQVIRAKDALLIRKGVDKIAEKRLLIQKKVKMHPSIYEDYI 360

Query: 361 GSHHLSTERIRCGK-------SALVSCSSSCPAVQNNLLGSPTTEIGKKFDNQTLLNGDL 420
            +HHLSTERI C K       S L  CS+SCP V++N + S TTE+GK   NQ +LNGD+
Sbjct: 361 DNHHLSTERISCSKRSKASTESVLAPCSNSCPTVRSNCISSLTTEVGKGLKNQAVLNGDI 420

Query: 421 SSEIVDDQPNEDSVDKSVPVGALFQAVVPEWTGNISDSDSKWLGMRSWPSKHRNSHSIKN 480
            SE+ DD PNEDS +++VPVGAL QA +PEWTGN SDSDSKWLG RSWP +HRNS+S+++
Sbjct: 421 PSEMEDDHPNEDSAEETVPVGALCQADLPEWTGNNSDSDSKWLGTRSWPLQHRNSNSVRD 480

Query: 481 RNPIGRGRPDSCGCQFPSSVECFRFHIAEARMRLKLELGLTFFDWRFHHMGEEISLQWTS 540
           R  IGRGRPDSCGCQFP SVECFRFHIAEARMRLKLELG TFF WRFH MGEEISLQWT 
Sbjct: 481 RRAIGRGRPDSCGCQFPGSVECFRFHIAEARMRLKLELGSTFFAWRFHQMGEEISLQWTV 540

Query: 541 EEEKKFKELTMSSFNNQSKCFWNSSLRCFPKKSRKILISYYFNVFLIRQRSYQNRVTPNS 600
           EEEK+FKEL MS FNN ++CFW+ SLR FP KSRK LISYYFNVFL+R RSYQNRVTPNS
Sbjct: 541 EEEKRFKELAMSGFNNHNRCFWDYSLRWFPMKSRKNLISYYFNVFLLRLRSYQNRVTPNS 600

Query: 601 IDSDDEDVEFGCVSGDFGEKAMEILGTKSLECSENRQFADLE 634
           IDSDDED EFG VSG FG+KAMEILG+KSLECS NRQ  D+E
Sbjct: 601 IDSDDEDFEFGRVSGGFGDKAMEILGSKSLECSINRQVTDVE 632

BLAST of Sed0009983 vs. ExPASy Swiss-Prot
Match: Q9LDD4 (AT-rich interactive domain-containing protein 2 OS=Arabidopsis thaliana OX=3702 GN=ARID2 PE=1 SV=1)

HSP 1 Score: 335.9 bits (860), Expect = 1.0e-90
Identity = 229/626 (36.58%), Postives = 330/626 (52.72%), Query Frame = 0

Query: 27  GFFISSDCLVEESHVNVD---YDDCKATVRSSFEKILSVFLKEIGRRGIVRPVPALLGEG 86
           G F  + C    S+V+V+    D+C+  +R  F++ L VFL+E    G ++P+PA++G+G
Sbjct: 2   GSFNDTSC----SYVDVEIKYVDECEERLRRLFDQALLVFLEE---EGSIKPLPAVIGDG 61

Query: 87  GSLDLFELFMIVREKGGYLVVSENQLWSSVVVELGLDLQLSASVKLIYSKYLSELEKWLM 146
            ++DLF+LF++VRE+ G+  VS  +LW  V  +LG D  L  S+ LIY KYL+ +EKW +
Sbjct: 62  KNVDLFKLFVLVREREGFDTVSRKRLWEVVAEKLGFDCSLVPSLILIYLKYLNRMEKWAV 121

Query: 147 KRRGGTKLLNGNSSYHYEISFPFLSELEGKIKGMVYGLLRQKNAYHDRSGLKSNKQNGNV 206
           +    ++++N ++           SE +G   GM++ L          +G KS   NG  
Sbjct: 122 EE---SRIVNWDNKD---------SEKKGCYSGMLHEL---------GNGFKSLLDNG-- 181

Query: 207 NVDVTAEEEIKSPRIKKEEHSIYGGVEEIKKNCNDTLR--------DDDEKDRIHVIEDD 266
                        + +K   ++  G   ++++C++  R        DDD+K         
Sbjct: 182 -------------KCQKRNRAVAFGCNHMEESCSEFDRSRKRFRESDDDDKGVGLSSVVI 241

Query: 267 RSLDAVCVNVEKEIDSLGRYRKSLLRMLRWARKSAKNPADPSTGIIPKSSKWKGFDDNNA 326
           R    VC   E   D     R  L  ML+W    A +P DP+ G+IP SSKWK ++ N  
Sbjct: 242 REETVVCAVEEGLSDFSLEKRDDLPGMLKWLALVATSPHDPAIGVIPHSSKWKQYNGNKC 301

Query: 327 FWLQVIRAKDAVLIRKDVDKNAEKDLLIQ----KKVKMHPSIYEDDFGSHHLSTERIRCG 386
            WLQV RAK+++L+++D   NAE           +   HPS+YEDD  S       IR  
Sbjct: 302 -WLQVARAKNSLLVQRD---NAELRYRYHPFRGHQNIHHPSMYEDDRKSIGRLRYSIR-P 361

Query: 387 KSALVSCSSSCPAVQNNLLGSPTTEIGKKFDNQTLLNGDLSSEIVDDQPNEDSVDKSVP- 446
            +    CSSSC     + L S +     K    T++  + +                +P 
Sbjct: 362 PNLSKHCSSSC--CNGSSLVSLSKSRSTKCRKLTIIASERAGLTAGTSRARKRNKAEIPR 421

Query: 447 ----VGALFQAVVPEWTGNISDSDSKWLGMRSWPSKHRNS-HSIKNRNPIGRGRPDSCGC 506
               VG   QA V EWT +  DSDSKWLG R WP ++  +       + +G+GRPDSC C
Sbjct: 422 RCIKVGHQHQAQVDEWTESGVDSDSKWLGTRIWPPENSEALDQTLGNDLVGKGRPDSCSC 481

Query: 507 QFPSSVECFRFHIAEARMRLKLELGLTFFDWRFHHMGEEISLQWTSEEEKKFKELTMSSF 566
           +    VEC R HIAE RM LK ELG  FF WRF+ MGEE+ L+WT EEEK+FK++ ++  
Sbjct: 482 ELSGFVECTRLHIAEKRMELKRELGDDFFHWRFNQMGEEVCLRWTEEEEKRFKDMIIA-- 541

Query: 567 NNQSKCFWNSSLRCFPKKSRKILISYYFNVFLIRQRSYQNRVTPNSIDSDDEDVEFGCVS 626
               + FW ++ + FPKK R+ L+SYYFNVFLI +R YQNRVTP SIDSDDE   FG V 
Sbjct: 542 --DPQSFWTNAAKNFPKKKREELVSYYFNVFLINRRRYQNRVTPKSIDSDDEGA-FGSVG 572

Query: 627 GDFGEKAMEILGTKSLECSENRQFAD 632
           G FG  A+   G+  + C++NRQ  D
Sbjct: 602 GSFGRDAVTSSGSDVMICAQNRQCED 572

BLAST of Sed0009983 vs. ExPASy Swiss-Prot
Match: Q84JT7 (AT-rich interactive domain-containing protein 1 OS=Arabidopsis thaliana OX=3702 GN=ARID1 PE=2 SV=1)

HSP 1 Score: 246.5 bits (628), Expect = 8.0e-64
Identity = 184/551 (33.39%), Postives = 270/551 (49.00%), Query Frame = 0

Query: 54  SSFEKILSVFLKEIGRRGIVRPVPALLGEGGSLDLFELFMIVREKGGYLVVSENQLWSSV 113
           S F  +L  FL E        P+PA+ GEG ++DLF LF+ V  KGG+  VSEN  W  V
Sbjct: 47  SLFRPLLDSFLAEFCSADGFLPLPAMTGEGRTVDLFNLFLNVTHKGGFDAVSENGSWDEV 106

Query: 114 VVELGLDLQLSASVKLIYSKYLSELEKWLMKRRGGTKLLNGNSSYHYEISFPFLSELEGK 173
           V E GL+   SAS KLIY KYL    +WL       +++ G++          +S +E  
Sbjct: 107 VQESGLESYDSASAKLIYVKYLDAFGRWL------NRVVAGDTD---------VSSVE-- 166

Query: 174 IKGMVYGLLRQKNAYHDRSGLKSNKQNGNVNVDVTAEEEIKSPRIKKEEHSIYGGVEEIK 233
           + G+   L+ + N +      K   + G    ++ AE +    + K+     + G E   
Sbjct: 167 LSGISDALVARLNGFLSEVKKKYELRKGRPAKELGAELKWFISKTKRRYDKHHVGKESAS 226

Query: 234 KNCNDTLRDDDEKDRIHVIEDDRSLDAVCV--NVEKEIDSLG-RYRKSLLRMLRWARKSA 293
              ND +++            +R L+ + +  +V +E  S G R R+  L  L+W    A
Sbjct: 227 ---NDAVKEFQGSKLA-----ERRLEQIMILESVTQECSSPGKRKRECPLETLKWLSDVA 286

Query: 294 KNPADPSTGIIPKSSKWKGFDDNNAFWLQVIRAKDAVLIRKDVDKNAEKDLLIQKKVKMH 353
           K+P DPS GI+P  S+W  +      W Q++  + +   R + D   EK    QK  KMH
Sbjct: 287 KDPCDPSLGIVPDRSEWVSYGSEEP-WKQLLLFRAS---RTNNDSACEKTW--QKVQKMH 346

Query: 354 PSIYEDDFGSHHLSTERIRCGKSALVSCSSSCPAVQNNLLGSPTTEIGKKFDNQTLLNGD 413
           P +Y+D  G+ +   ER+                         + E  K+   +T    D
Sbjct: 347 PCLYDDSAGASYNLRERL-------------------------SYEDYKR--GKTGNGSD 406

Query: 414 LSSEIVDDQPNEDSVDKSVPVGALFQAVVPEWTGNISDSDSKWLGMRSWP--SKHRNSHS 473
           + S   +D+P          VG+ FQA VPEWTG   +SDSKWLG R WP   +   ++ 
Sbjct: 407 IGSSDEEDRP-------CALVGSKFQAKVPEWTGITPESDSKWLGTRIWPLTKEQTKANL 466

Query: 474 IKNRNPIGRGRPDSCGCQFPSSVECFRFHIAEARMRLKLELGLTFFDWRFHHMGEEISLQ 533
           +  R+ IG+GR D CGC  P S+EC +FHI   R +LKLELG  F+ W F  MGE     
Sbjct: 467 LIERDRIGKGRQDPCGCHNPGSIECVKFHITAKRDKLKLELGPAFYMWCFDVMGECTLQY 526

Query: 534 WTSEEEKKFKELTMSSFNNQSKCFWNSSLRCFPKKSRKILISYYFNVFLIRQRSYQNRVT 593
           WT  E KK K L MSS  + S  F + +    P KSR  ++SY++NV L++ R+ Q+R+T
Sbjct: 527 WTDLELKKIKSL-MSSPPSLSPAFIHQAKMILPSKSRGKIVSYFYNVTLLQYRASQSRIT 531

Query: 594 PNSIDSDDEDV 600
           P+ IDSD + +
Sbjct: 587 PHDIDSDTDQI 531

BLAST of Sed0009983 vs. ExPASy TrEMBL
Match: A0A1S3BSW2 (AT-rich interactive domain-containing protein 2 OS=Cucumis melo OX=3656 GN=LOC103493169 PE=4 SV=1)

HSP 1 Score: 919.8 bits (2376), Expect = 6.0e-264
Identity = 464/642 (72.27%), Postives = 536/642 (83.49%), Query Frame = 0

Query: 1   MGKWPISSNDSVLDCNNKDVDPCHSNGFFISSDCLVEESHVNVDYDDCKATVRSSFEKIL 60
           MG+WPISSNDS+LDC NKDVDP  SNG+ I+ DCLVE S  NVD+DDCKAT+R  FEKIL
Sbjct: 1   MGRWPISSNDSILDC-NKDVDPNPSNGYCIAPDCLVEGSRANVDHDDCKATIRCYFEKIL 60

Query: 61  SVFLKEIGRRGIVRPVPALLGEGGSLDLFELFMIVREKGGYLVVSENQLWSSVVVELGLD 120
            VFLKEI RRG +RPVPALLGEGGSLDLFELFM+VR+KGGY VVSE +LWSSVVVELGLD
Sbjct: 61  WVFLKEICRRGFIRPVPALLGEGGSLDLFELFMVVRDKGGYQVVSEKELWSSVVVELGLD 120

Query: 121 LQLSASVKLIYSKYLSELEKWLMKRRGGTKLLNGNSS-YHYEISFPFLSELEGKIKGMVY 180
           L LSASVKLIY KYLSELEKWLM RRGGTKL NGNS  Y+Y  SFP L+ELE KIK M+Y
Sbjct: 121 LGLSASVKLIYFKYLSELEKWLMVRRGGTKLENGNSDYYYYRKSFPCLAELEAKIKDMLY 180

Query: 181 GLLRQKNAYHDRSGLKSNKQNGNVNV-DVTAEEEIKSPRIKKEEHSIYGGVEEIKKNCND 240
           G+LRQK+ Y +R G KSNK NGNVNV +  AE+EIK P+I+K+EH ++  V  I++NC +
Sbjct: 181 GVLRQKSIYDERPGFKSNKPNGNVNVAETAAEKEIKFPKIEKKEHDLHEDVTPIQQNCTE 240

Query: 241 TLRDDDEKDRIHVIEDDRSLDAVCVNVEKEIDSLGRYRKSLLRMLRWARKSAKNPADPST 300
           T R + E ++IHVI D RSLDA  VNVE E DS GR R+SLLRML+W RK+AK+PA+PS 
Sbjct: 241 TPRVNGETNQIHVIGDCRSLDA--VNVETETDSHGRSRESLLRMLKWVRKTAKHPANPSN 300

Query: 301 GIIPKSSKWKGFDDNNAFWLQVIRAKDAVLIRKDVDKNAEKDLLIQKKVKMHPSIYEDDF 360
           G +P+SSKWK +  ++A WLQVI+AKDA+L RKDVDK AEK LLIQKKV+MHP IYED+ 
Sbjct: 301 GTVPESSKWKAYASDDALWLQVIKAKDALLNRKDVDKTAEKRLLIQKKVRMHPCIYEDNI 360

Query: 361 -GSHHLSTERIRC-------GKSALVSCSSSCPAVQNNLLGSPTTEIGKKFDNQTLLNGD 420
             +HHLSTERI C        KS LV+ ++SCP V++N +GS TTEIGK   NQ LLNGD
Sbjct: 361 DDNHHLSTERICCSRRSNALAKSELVASNNSCPPVRSNQIGSLTTEIGKGLKNQALLNGD 420

Query: 421 LSSEIVDDQPNEDSVDKSVPVGALFQAVVPEWTGNISDSDSKWLGMRSWPSKHRNSHSIK 480
           L+SE+ D+Q NEDSV+K VPVGALFQA +PEWTGNISDSDSKWLG R WPS+H N+ S+ 
Sbjct: 421 LASEMEDNQANEDSVEKPVPVGALFQAAIPEWTGNISDSDSKWLGTRLWPSQHENNKSVS 480

Query: 481 NRNPIGRGRPDSCGCQFPSSVECFRFHIAEARMRLKLELGLTFFDWRFHHMGEEISLQWT 540
           NRNPIGRGR DSC CQFP SVEC+RFHIAEARMRLKLELGLTF+DWRFH MGEEISLQWT
Sbjct: 481 NRNPIGRGRLDSCSCQFPGSVECYRFHIAEARMRLKLELGLTFYDWRFHQMGEEISLQWT 540

Query: 541 SEEEKKFKELTMSSFNNQSKCFWNSSLRCFPKKSRKILISYYFNVFLIRQRSYQNRVTPN 600
           +EEEK+FKEL +SSFNNQ++CFWN SL+ FP KSRK LISYYFNVFL+RQRSYQNRVTPN
Sbjct: 541 AEEEKRFKELAISSFNNQNQCFWNHSLKWFPMKSRKNLISYYFNVFLLRQRSYQNRVTPN 600

Query: 601 SIDSDDEDVEFGCVSGDFGEKAMEILGTKSLECSENRQFADL 633
            IDSDDEDVEFGC+SGDFG KAMEILG+KS+ECSEN+QF D+
Sbjct: 601 DIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENKQFIDI 639

BLAST of Sed0009983 vs. ExPASy TrEMBL
Match: A0A0A0KZM1 (ARID domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G047920 PE=4 SV=1)

HSP 1 Score: 892.5 bits (2305), Expect = 1.0e-255
Identity = 450/639 (70.42%), Postives = 525/639 (82.16%), Query Frame = 0

Query: 1   MGKWPISSNDSVLDCNNKDVDPCHSNGFFISSDCLVEESHVNVDYDDCKATVRSSFEKIL 60
           MG+WPISSNDS+LDC NKDVDP  S G+ I+ DCLVE S  NVD+DDCKAT+R  FEK+L
Sbjct: 1   MGRWPISSNDSILDC-NKDVDPNPSYGYCIAPDCLVEGSCANVDHDDCKATIRCYFEKVL 60

Query: 61  SVFLKEIGRRGIVRPVPALLGEGGSLDLFELFMIVREKGGYLVVSENQLWSSVVVELGLD 120
            VFLKE  RRG +RPVPALLGEG SLDLFELFM+VR+KGGY VVSE +LWSSVVVELGLD
Sbjct: 61  WVFLKETCRRGFIRPVPALLGEGESLDLFELFMVVRDKGGYQVVSEKELWSSVVVELGLD 120

Query: 121 LQLSASVKLIYSKYLSELEKWLMKRRGGTKLLNGNSS-YHYEISFPFLSELEGKIKGMVY 180
           L LSASVKLIY KYLS+LEKWLM RRGGTKL NGNS  Y+Y  +FP L+ELE KIK ++Y
Sbjct: 121 LGLSASVKLIYFKYLSDLEKWLMVRRGGTKLENGNSDCYYYRKNFPCLAELEAKIKDILY 180

Query: 181 GLLRQKNAYHDRSGLKSNKQNGNVNV-DVTAEEEIKSPRIKKEEHSIYGGVEEIKKNCND 240
           G+LRQK+ Y +RSG KSNK NGNVNV +  AE+EIKSP+I+K+EH ++  V  I++NC +
Sbjct: 181 GVLRQKSIYDERSGFKSNKPNGNVNVAETAAEKEIKSPKIEKKEHDLHEDVTPIQQNCTE 240

Query: 241 TLRDDDEKDRIHVIEDDRSLDAVCVNVEKEIDSLGRYRKSLLRMLRWARKSAKNPADPST 300
           T RD+ + ++IHVI D RS DA  VNVE E DS G  R+SL RML+W RK+AK+PA+PS 
Sbjct: 241 TPRDNGKTNQIHVIGDCRSSDA--VNVETETDSHGSSRESLFRMLKWVRKTAKHPANPSN 300

Query: 301 GIIPKSSKWKGFDDNNAFWLQVIRAKDAVLIRKDVDKNAEKDLLIQKKVKMHPSIYEDDF 360
           G +P SSKWK +   +A WLQVI+AKDA+L RKDVDK AEK LLIQKKV+MHP IYED+ 
Sbjct: 301 GTVPGSSKWKAYASEDALWLQVIKAKDALLNRKDVDKTAEKRLLIQKKVRMHPCIYEDNI 360

Query: 361 -GSHHLSTERIRC-------GKSALVSCSSSCPAVQNNLLGSPTTEIGKKFDNQTLLNGD 420
             +HHLSTERI C        KS  V+C++SCP VQ+N +GS TTEIGK   NQ LLNGD
Sbjct: 361 DDNHHLSTERICCSRRSNALSKSESVACNNSCPPVQSNQIGSLTTEIGKGLKNQALLNGD 420

Query: 421 LSSEIVDDQPNEDSVDKSVPVGALFQAVVPEWTGNISDSDSKWLGMRSWPSKHRNSHSIK 480
           L+SE+ D+Q NEDSV+K VPVGA FQAV+PEWTGNISDSDSKWLG RSWPS+H N+ S+ 
Sbjct: 421 LASEMEDNQANEDSVEKPVPVGASFQAVLPEWTGNISDSDSKWLGTRSWPSQHENNKSVS 480

Query: 481 NRNPIGRGRPDSCGCQFPSSVECFRFHIAEARMRLKLELGLTFFDWRFHHMGEEISLQWT 540
           +RNPI RGR D C CQFP SVEC+RFHIAEARMRLKLELGLTF+DWRFH MGEEISLQWT
Sbjct: 481 DRNPISRGRLDPCSCQFPGSVECYRFHIAEARMRLKLELGLTFYDWRFHQMGEEISLQWT 540

Query: 541 SEEEKKFKELTMSSFNNQSKCFWNSSLRCFPKKSRKILISYYFNVFLIRQRSYQNRVTPN 600
           +EEE +FKEL +SSFNNQ++CFWN SL+ FP KSRK LISYYFNVFL+RQRSYQNRVTPN
Sbjct: 541 AEEENRFKELAISSFNNQNQCFWNHSLKWFPMKSRKNLISYYFNVFLLRQRSYQNRVTPN 600

Query: 601 SIDSDDEDVEFGCVSGDFGEKAMEILGTKSLECSENRQF 630
            IDSD EDVEFGC+SGDFG KAME+LG+K +ECSEN+QF
Sbjct: 601 DIDSDGEDVEFGCISGDFGAKAMEVLGSKFVECSENKQF 636

BLAST of Sed0009983 vs. ExPASy TrEMBL
Match: A0A6J1C4T3 (AT-rich interactive domain-containing protein 2 OS=Momordica charantia OX=3673 GN=LOC111008274 PE=4 SV=1)

HSP 1 Score: 879.0 bits (2270), Expect = 1.2e-251
Identity = 445/642 (69.31%), Postives = 519/642 (80.84%), Query Frame = 0

Query: 1   MGKWPISSNDSVLDCNNKDVDPCH-SNGFFISSDCLVEESHVNVDYDDCKATVRSSFEKI 60
           MG+WPISSN S+LDC +KD+DP + SNG  I+ DCLVEES+ +VDY DCKA +R SFEKI
Sbjct: 1   MGRWPISSNASILDC-SKDIDPSYSSNGCCIAPDCLVEESYASVDYGDCKAALRCSFEKI 60

Query: 61  LSVFLKEIGRRGIVRPVPALLGEGGSLDLFELFMIVREKGGYLVVSENQLWSSVVVELGL 120
           LS FLKEIGRRGIVRPVPALLGEGGSLDLFELFM+VR+KGGY VVSE +LWSSVVVELGL
Sbjct: 61  LSFFLKEIGRRGIVRPVPALLGEGGSLDLFELFMVVRDKGGYQVVSEKELWSSVVVELGL 120

Query: 121 DLQLSASVKLIYSKYLSELEKWLMKRRGGTKLLNGNSSYHYEISFPFLSELEGKIKGMVY 180
           DL+LSASVKL+YSKYLSELEKWLM R GG KL NGNS YH E SFPF SEL  KIKGM+Y
Sbjct: 121 DLELSASVKLVYSKYLSELEKWLMVRCGGKKLENGNSDYHCEKSFPFSSELAAKIKGMLY 180

Query: 181 GLLRQKNAYHDRSGLKSNKQNGNVNV-DVTAEEEIKSPRIKKEEHSIYGGVEEIKKNCND 240
           G+L+QK+ Y + SG  S+KQ GN+NV     E++IK P + K+E  + G V + ++  + 
Sbjct: 181 GVLKQKSLYDECSGFISSKQIGNINVASAAVEKKIKLPEVNKKEPDLNGSVTQSQEKLSK 240

Query: 241 TLRDDDEKDRIHVIEDDRSLDAVCVNVEKEIDSLGRYRKSLLRMLRWARKSAKNPADPST 300
           T +DDD  + I V ED RSL    VNVE + DS    R+SLLRML+WAR+ AK PADPS 
Sbjct: 241 TPQDDDGIEHIRVNEDCRSL--ASVNVETKTDSCESCRESLLRMLKWARQIAKYPADPSN 300

Query: 301 GIIPKSSKWKGFDDNNAFWLQVIRAKDAVLIRKDVDKNAEKDLLIQKKVKMHPSIYEDDF 360
           G IP  SKWK +  NN FWLQV+RAKDA+LIRKDV++N EK LL QKKVK+HPSIYED+ 
Sbjct: 301 GTIPGPSKWKEYTSNNTFWLQVVRAKDALLIRKDVEENNEKRLL-QKKVKVHPSIYEDNI 360

Query: 361 GSHHLSTERIRCG-------KSALVSCSSSCPAVQNNLLGSPTTEIGKKFDNQTLLNGDL 420
            +HHLSTERIRC        KS LVSC+SSCP V +NL+ SPTTEIGK  D Q  +NGDL
Sbjct: 361 DNHHLSTERIRCSKRFNALVKSILVSCNSSCPTVGSNLISSPTTEIGKGLDKQAFMNGDL 420

Query: 421 SSEIVDDQPNEDSVDKSVPVGALFQAVVPEWTGNISDSDSKWLGMRSWPSKHRNSHSIKN 480
            SE +D Q NEDSV+K V VG+ FQAVVPEWTG IS+SDSKWLG RSW  +H NS+S+ +
Sbjct: 421 PSERIDTQQNEDSVEKPVLVGSSFQAVVPEWTGEISESDSKWLGTRSWTFQHENSNSVSD 480

Query: 481 RNPIGRGRPDSCGCQFPSSVECFRFHIAEARMRLKLELGLTFFDWRFHHMGEEISLQWTS 540
           RNPIGRGRPDSC C +P SVECFRFHIAEAR+RLKLELGL F++WRFHHMGEEISLQWT+
Sbjct: 481 RNPIGRGRPDSCRCWYPGSVECFRFHIAEARLRLKLELGLAFYEWRFHHMGEEISLQWTT 540

Query: 541 EEEKKFKELTMSSFNNQSKCFWNSSLRCFPKKSRKILISYYFNVFLIRQRSYQNRVTPNS 600
           EEEK FKEL  SSFN+QSKCFWN S++ FP KSRK LISYYFNVF++RQRSYQNRVTPNS
Sbjct: 541 EEEKTFKELAKSSFNSQSKCFWNYSMKWFPMKSRKNLISYYFNVFVLRQRSYQNRVTPNS 600

Query: 601 IDSDDEDVEFGCVSGDFGEKAMEILGTKSLECSENRQFADLE 634
           +DSDD++VEFGC+SGDFG KAM++LGTKSLECSEN+Q  D+E
Sbjct: 601 VDSDDDEVEFGCLSGDFGGKAMQVLGTKSLECSENKQCTDVE 638

BLAST of Sed0009983 vs. ExPASy TrEMBL
Match: A0A6J1ETI2 (AT-rich interactive domain-containing protein 2-like isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111437591 PE=4 SV=1)

HSP 1 Score: 878.2 bits (2268), Expect = 2.0e-251
Identity = 447/642 (69.63%), Postives = 516/642 (80.37%), Query Frame = 0

Query: 1   MGKWPISSNDSVLDCNNKDVDPCHSNGFFISSDCLVEESHVNVDYDDCKATVRSSFEKIL 60
           MG+W +SSN S+LDC NKDVDP  SNG  I+SDCLVE S+ NVDYDDCKA +R  FEKIL
Sbjct: 1   MGRWHVSSNASILDC-NKDVDPNPSNGCCIASDCLVERSYENVDYDDCKARIRCYFEKIL 60

Query: 61  SVFLKEIGRRGIVRPVPALLGEGGSLDLFELFMIVREKGGYLVVSENQLWSSVVVELGLD 120
            VFLKEIGRRG VRP+PAL+GEGG+LDLFELF++VR+KGG  VVSE +LWSSVVVELGLD
Sbjct: 61  WVFLKEIGRRGFVRPLPALIGEGGALDLFELFLVVRDKGGSQVVSEKKLWSSVVVELGLD 120

Query: 121 LQLSASVKLIYSKYLSELEKWLMKRRGGTKLLNGNSSYHYEISFPFLSELEGKIKGMVYG 180
           L LSASVKLIYSKYLS+LEKWLM R G TKL NG+S Y Y+ S PFLSEL  KI GM+YG
Sbjct: 121 LGLSASVKLIYSKYLSDLEKWLMVRCGDTKLENGSSDYCYKKSSPFLSELGAKINGMLYG 180

Query: 181 LLRQKNAYHDRSGLKSNKQNGNVNVDVTA--EEEIKSPRIKKEEHSIYGGVEEIKKNCND 240
           + RQ + Y +  G KSNKQNGNVNV   A  E+EIK P IKK+EH ++G V  I+++C +
Sbjct: 181 VPRQNSIYDECFGFKSNKQNGNVNVAAAAAVEKEIKFPEIKKKEHDLHGDVTSIQQDCTE 240

Query: 241 TLRDDDEKDRIHVIEDDRSLDAVCVNVEKEIDSLGRYRKSLLRMLRWARKSAKNPADPST 300
           T         IHVIED +SLDA  VNVE EI+SLG+YR+SLLRML+W RK+AK+P DP  
Sbjct: 241 T-------HPIHVIEDGQSLDA--VNVEAEIESLGKYRESLLRMLKWVRKTAKHPEDPLN 300

Query: 301 GIIPKSSKWKGFDDNNAFWLQVIRAKDAVLIRKDVDKNAEKDLLIQKKVKMHPSIYEDDF 360
           G IP +S+WKG+  ++A WLQVIRAKDA+LIRK VDK AEK LLIQKKVKMHPSIYED+ 
Sbjct: 301 GTIPGTSRWKGYSSDDALWLQVIRAKDALLIRKGVDKIAEKRLLIQKKVKMHPSIYEDNI 360

Query: 361 GSHHLSTERIRCGK-------SALVSCSSSCPAVQNNLLGSPTTEIGKKFDNQTLLNGDL 420
            +HHLSTERI C K       S L  CS+SCP V++N + S TTE+GK   NQ +LNGD+
Sbjct: 361 DNHHLSTERISCSKRSKALTESVLAPCSNSCPTVRSNCISSLTTEVGKGLKNQAVLNGDI 420

Query: 421 SSEIVDDQPNEDSVDKSVPVGALFQAVVPEWTGNISDSDSKWLGMRSWPSKHRNSHSIKN 480
            SE+ DD PNEDS +++VPVGA+ QA +PEWTGN SDSDSKWLG RSWP +HRNS+S+++
Sbjct: 421 PSEMEDDHPNEDSAEETVPVGAVCQADLPEWTGNNSDSDSKWLGTRSWPLQHRNSNSVRD 480

Query: 481 RNPIGRGRPDSCGCQFPSSVECFRFHIAEARMRLKLELGLTFFDWRFHHMGEEISLQWTS 540
           R  IGRGRPDSCGCQFP SVECFRFHIAEARMRLKLELG TFF WRFH MGEEISLQWT 
Sbjct: 481 RRAIGRGRPDSCGCQFPGSVECFRFHIAEARMRLKLELGSTFFAWRFHQMGEEISLQWTV 540

Query: 541 EEEKKFKELTMSSFNNQSKCFWNSSLRCFPKKSRKILISYYFNVFLIRQRSYQNRVTPNS 600
           EEEK+FKEL MS FNN ++CFW+ SLR FP KSRK LISYYFNVFL+R RSYQNRVTPNS
Sbjct: 541 EEEKRFKELAMSGFNNHNRCFWDYSLRWFPMKSRKNLISYYFNVFLLRLRSYQNRVTPNS 600

Query: 601 IDSDDEDVEFGCVSGDFGEKAMEILGTKSLECSENRQFADLE 634
           IDSDDED EFG VSG FG+KAMEILG+ SLECS NRQ  D+E
Sbjct: 601 IDSDDEDFEFGRVSGGFGDKAMEILGSNSLECSINRQVTDVE 632

BLAST of Sed0009983 vs. ExPASy TrEMBL
Match: A0A6J1J644 (AT-rich interactive domain-containing protein 2-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111482924 PE=4 SV=1)

HSP 1 Score: 867.8 bits (2241), Expect = 2.7e-248
Identity = 443/642 (69.00%), Postives = 515/642 (80.22%), Query Frame = 0

Query: 1   MGKWPISSNDSVLDCNNKDVDPCHSNGFFISSDCLVEESHVNVDYDDCKATVRSSFEKIL 60
           MG+W +SSN S+LDC NKDVDP  SNG  I+SDCLVE ++ NVDYDDCKA +R  FEKIL
Sbjct: 1   MGRWHVSSNASILDC-NKDVDPNPSNGCCIASDCLVEGTYANVDYDDCKARIRCYFEKIL 60

Query: 61  SVFLKEIGRRGIVRPVPALLGEGGSLDLFELFMIVREKGGYLVVSENQLWSSVVVELGLD 120
            VFLKEIGRRG VRP+PAL+GEGG+LDLFELF++VR+KGG  VVSE +LWSSVVVELGLD
Sbjct: 61  WVFLKEIGRRGFVRPLPALIGEGGALDLFELFLVVRDKGGSQVVSEKKLWSSVVVELGLD 120

Query: 121 LQLSASVKLIYSKYLSELEKWLMKRRGGTKLLNGNSSYHYEISFPFLSELEGKIKGMVYG 180
           L LSASVKLIYSKYLS+LEKWLM R G TKL NG+S Y Y+ S PFLSEL  KI GM+YG
Sbjct: 121 LGLSASVKLIYSKYLSDLEKWLMVRCGDTKLENGSSDYCYKKSSPFLSELGAKINGMLYG 180

Query: 181 LLRQKNAYHDRSGLKSNKQNGNVNVDVTA-EEEIKSPRIKKEEHSIYGGVEEIKKNCNDT 240
           + RQ + Y +  G KSNKQNGNVNV   A E+EIK   IKK+EH ++G V  I+++C +T
Sbjct: 181 VPRQNSIYDECFGFKSNKQNGNVNVAAAAVEKEIKFSEIKKKEHDLHGDVTPIQQDCTET 240

Query: 241 LRDDDEKDRIHVIEDDRSLDAVCVNVEKEIDSLGRYRKSLLRMLRWARKSAKNPADPSTG 300
                    IHVIED +SLDA  VNVE EI+SLG+YR+SLLRML+W RK+AK+P DP  G
Sbjct: 241 -------HPIHVIEDGQSLDA--VNVEAEIESLGKYRESLLRMLKWVRKTAKHPEDPLNG 300

Query: 301 IIPKSSKWKGFDDNNAFWLQVIRAKDAVLIRKDVDKNAEKDLLIQKKVKMHPSIYEDDFG 360
            I  +S+WKG+  ++A WLQVI AKDA+LIRK VDK AEK LLIQKKVKMHPSIYED+  
Sbjct: 301 TILGASRWKGYSSDDALWLQVISAKDALLIRKGVDKIAEKRLLIQKKVKMHPSIYEDNID 360

Query: 361 SHHLSTERIRCGK-------SALVSCSSSCPAVQNNLLGSP-TTEIGKKFDNQTLLNGDL 420
           +H LSTERI C K       S   +CS+SCP V++N + S  TTE+GK   NQ +LNGD+
Sbjct: 361 NHRLSTERISCSKRFKASTESVFATCSNSCPTVRSNCISSSLTTEVGKGLKNQAVLNGDI 420

Query: 421 SSEIVDDQPNEDSVDKSVPVGALFQAVVPEWTGNISDSDSKWLGMRSWPSKHRNSHSIKN 480
            SE+ DD PNEDS +++VPVGAL QA +PEWTGN SDSDSKWLG R WP +HRNS+S+++
Sbjct: 421 PSEMEDDHPNEDSAEETVPVGALCQADLPEWTGNNSDSDSKWLGTRLWPLQHRNSNSVRD 480

Query: 481 RNPIGRGRPDSCGCQFPSSVECFRFHIAEARMRLKLELGLTFFDWRFHHMGEEISLQWTS 540
           R  IGRGRPDSCGCQFP SVECFRFHIAEARMRLKLELG TFF WRFH MGEEISLQWT+
Sbjct: 481 RRAIGRGRPDSCGCQFPGSVECFRFHIAEARMRLKLELGSTFFAWRFHQMGEEISLQWTA 540

Query: 541 EEEKKFKELTMSSFNNQSKCFWNSSLRCFPKKSRKILISYYFNVFLIRQRSYQNRVTPNS 600
           EEEK+FKEL MSSFNN ++CFW+ SLR FP KSRK LISYYFNVFL+R RSYQNRVTPNS
Sbjct: 541 EEEKRFKELAMSSFNNHNRCFWDYSLRWFPMKSRKNLISYYFNVFLLRLRSYQNRVTPNS 600

Query: 601 IDSDDEDVEFGCVSGDFGEKAMEILGTKSLECSENRQFADLE 634
           IDSDDED EFGCVSG FG+KAME+LG+KSLECS NRQ  D+E
Sbjct: 601 IDSDDEDFEFGCVSGGFGDKAMEVLGSKSLECSINRQVTDVE 632

BLAST of Sed0009983 vs. TAIR 10
Match: AT4G11400.1 (ARID/BRIGHT DNA-binding domain;ELM2 domain protein )

HSP 1 Score: 335.9 bits (860), Expect = 7.1e-92
Identity = 229/626 (36.58%), Postives = 330/626 (52.72%), Query Frame = 0

Query: 27  GFFISSDCLVEESHVNVD---YDDCKATVRSSFEKILSVFLKEIGRRGIVRPVPALLGEG 86
           G F  + C    S+V+V+    D+C+  +R  F++ L VFL+E    G ++P+PA++G+G
Sbjct: 2   GSFNDTSC----SYVDVEIKYVDECEERLRRLFDQALLVFLEE---EGSIKPLPAVIGDG 61

Query: 87  GSLDLFELFMIVREKGGYLVVSENQLWSSVVVELGLDLQLSASVKLIYSKYLSELEKWLM 146
            ++DLF+LF++VRE+ G+  VS  +LW  V  +LG D  L  S+ LIY KYL+ +EKW +
Sbjct: 62  KNVDLFKLFVLVREREGFDTVSRKRLWEVVAEKLGFDCSLVPSLILIYLKYLNRMEKWAV 121

Query: 147 KRRGGTKLLNGNSSYHYEISFPFLSELEGKIKGMVYGLLRQKNAYHDRSGLKSNKQNGNV 206
           +    ++++N ++           SE +G   GM++ L          +G KS   NG  
Sbjct: 122 EE---SRIVNWDNKD---------SEKKGCYSGMLHEL---------GNGFKSLLDNG-- 181

Query: 207 NVDVTAEEEIKSPRIKKEEHSIYGGVEEIKKNCNDTLR--------DDDEKDRIHVIEDD 266
                        + +K   ++  G   ++++C++  R        DDD+K         
Sbjct: 182 -------------KCQKRNRAVAFGCNHMEESCSEFDRSRKRFRESDDDDKGVGLSSVVI 241

Query: 267 RSLDAVCVNVEKEIDSLGRYRKSLLRMLRWARKSAKNPADPSTGIIPKSSKWKGFDDNNA 326
           R    VC   E   D     R  L  ML+W    A +P DP+ G+IP SSKWK ++ N  
Sbjct: 242 REETVVCAVEEGLSDFSLEKRDDLPGMLKWLALVATSPHDPAIGVIPHSSKWKQYNGNKC 301

Query: 327 FWLQVIRAKDAVLIRKDVDKNAEKDLLIQ----KKVKMHPSIYEDDFGSHHLSTERIRCG 386
            WLQV RAK+++L+++D   NAE           +   HPS+YEDD  S       IR  
Sbjct: 302 -WLQVARAKNSLLVQRD---NAELRYRYHPFRGHQNIHHPSMYEDDRKSIGRLRYSIR-P 361

Query: 387 KSALVSCSSSCPAVQNNLLGSPTTEIGKKFDNQTLLNGDLSSEIVDDQPNEDSVDKSVP- 446
            +    CSSSC     + L S +     K    T++  + +                +P 
Sbjct: 362 PNLSKHCSSSC--CNGSSLVSLSKSRSTKCRKLTIIASERAGLTAGTSRARKRNKAEIPR 421

Query: 447 ----VGALFQAVVPEWTGNISDSDSKWLGMRSWPSKHRNS-HSIKNRNPIGRGRPDSCGC 506
               VG   QA V EWT +  DSDSKWLG R WP ++  +       + +G+GRPDSC C
Sbjct: 422 RCIKVGHQHQAQVDEWTESGVDSDSKWLGTRIWPPENSEALDQTLGNDLVGKGRPDSCSC 481

Query: 507 QFPSSVECFRFHIAEARMRLKLELGLTFFDWRFHHMGEEISLQWTSEEEKKFKELTMSSF 566
           +    VEC R HIAE RM LK ELG  FF WRF+ MGEE+ L+WT EEEK+FK++ ++  
Sbjct: 482 ELSGFVECTRLHIAEKRMELKRELGDDFFHWRFNQMGEEVCLRWTEEEEKRFKDMIIA-- 541

Query: 567 NNQSKCFWNSSLRCFPKKSRKILISYYFNVFLIRQRSYQNRVTPNSIDSDDEDVEFGCVS 626
               + FW ++ + FPKK R+ L+SYYFNVFLI +R YQNRVTP SIDSDDE   FG V 
Sbjct: 542 --DPQSFWTNAAKNFPKKKREELVSYYFNVFLINRRRYQNRVTPKSIDSDDEGA-FGSVG 572

Query: 627 GDFGEKAMEILGTKSLECSENRQFAD 632
           G FG  A+   G+  + C++NRQ  D
Sbjct: 602 GSFGRDAVTSSGSDVMICAQNRQCED 572

BLAST of Sed0009983 vs. TAIR 10
Match: AT2G46040.1 (ARID/BRIGHT DNA-binding domain;ELM2 domain protein )

HSP 1 Score: 246.5 bits (628), Expect = 5.7e-65
Identity = 184/551 (33.39%), Postives = 270/551 (49.00%), Query Frame = 0

Query: 54  SSFEKILSVFLKEIGRRGIVRPVPALLGEGGSLDLFELFMIVREKGGYLVVSENQLWSSV 113
           S F  +L  FL E        P+PA+ GEG ++DLF LF+ V  KGG+  VSEN  W  V
Sbjct: 47  SLFRPLLDSFLAEFCSADGFLPLPAMTGEGRTVDLFNLFLNVTHKGGFDAVSENGSWDEV 106

Query: 114 VVELGLDLQLSASVKLIYSKYLSELEKWLMKRRGGTKLLNGNSSYHYEISFPFLSELEGK 173
           V E GL+   SAS KLIY KYL    +WL       +++ G++          +S +E  
Sbjct: 107 VQESGLESYDSASAKLIYVKYLDAFGRWL------NRVVAGDTD---------VSSVE-- 166

Query: 174 IKGMVYGLLRQKNAYHDRSGLKSNKQNGNVNVDVTAEEEIKSPRIKKEEHSIYGGVEEIK 233
           + G+   L+ + N +      K   + G    ++ AE +    + K+     + G E   
Sbjct: 167 LSGISDALVARLNGFLSEVKKKYELRKGRPAKELGAELKWFISKTKRRYDKHHVGKESAS 226

Query: 234 KNCNDTLRDDDEKDRIHVIEDDRSLDAVCV--NVEKEIDSLG-RYRKSLLRMLRWARKSA 293
              ND +++            +R L+ + +  +V +E  S G R R+  L  L+W    A
Sbjct: 227 ---NDAVKEFQGSKLA-----ERRLEQIMILESVTQECSSPGKRKRECPLETLKWLSDVA 286

Query: 294 KNPADPSTGIIPKSSKWKGFDDNNAFWLQVIRAKDAVLIRKDVDKNAEKDLLIQKKVKMH 353
           K+P DPS GI+P  S+W  +      W Q++  + +   R + D   EK    QK  KMH
Sbjct: 287 KDPCDPSLGIVPDRSEWVSYGSEEP-WKQLLLFRAS---RTNNDSACEKTW--QKVQKMH 346

Query: 354 PSIYEDDFGSHHLSTERIRCGKSALVSCSSSCPAVQNNLLGSPTTEIGKKFDNQTLLNGD 413
           P +Y+D  G+ +   ER+                         + E  K+   +T    D
Sbjct: 347 PCLYDDSAGASYNLRERL-------------------------SYEDYKR--GKTGNGSD 406

Query: 414 LSSEIVDDQPNEDSVDKSVPVGALFQAVVPEWTGNISDSDSKWLGMRSWP--SKHRNSHS 473
           + S   +D+P          VG+ FQA VPEWTG   +SDSKWLG R WP   +   ++ 
Sbjct: 407 IGSSDEEDRP-------CALVGSKFQAKVPEWTGITPESDSKWLGTRIWPLTKEQTKANL 466

Query: 474 IKNRNPIGRGRPDSCGCQFPSSVECFRFHIAEARMRLKLELGLTFFDWRFHHMGEEISLQ 533
           +  R+ IG+GR D CGC  P S+EC +FHI   R +LKLELG  F+ W F  MGE     
Sbjct: 467 LIERDRIGKGRQDPCGCHNPGSIECVKFHITAKRDKLKLELGPAFYMWCFDVMGECTLQY 526

Query: 534 WTSEEEKKFKELTMSSFNNQSKCFWNSSLRCFPKKSRKILISYYFNVFLIRQRSYQNRVT 593
           WT  E KK K L MSS  + S  F + +    P KSR  ++SY++NV L++ R+ Q+R+T
Sbjct: 527 WTDLELKKIKSL-MSSPPSLSPAFIHQAKMILPSKSRGKIVSYFYNVTLLQYRASQSRIT 531

Query: 594 PNSIDSDDEDV 600
           P+ IDSD + +
Sbjct: 587 PHDIDSDTDQI 531

BLAST of Sed0009983 vs. TAIR 10
Match: AT5G04110.1 (DNA GYRASE B3 )

HSP 1 Score: 127.5 bits (319), Expect = 3.8e-29
Identity = 76/205 (37.07%), Postives = 111/205 (54.15%), Query Frame = 0

Query: 408 NGDLSSEIVDD--QPNEDSVDKSVPVGALFQAVVPEWT---------GNISDSDS-KWLG 467
           N D+S++   D      +    ++P+G  FQA +P W          G+  DS++ +WLG
Sbjct: 338 NKDVSNKTSKDVITHGSNKTRPAIPIGPRFQAEIPVWIAPTKKGKFYGSPGDSNTLRWLG 397

Query: 468 MRSWP--SKHRNSHSIKNRNPIGRGRPDSCGCQFPSSVECFRFHIAEARMRLKLELGLTF 527
              WP  S  +  HS K    +G GR DSC C  P S  C + H  EA+  L+ E+   F
Sbjct: 398 TGVWPTYSLKKTVHSKK----VGEGRSDSCSCASPRSTNCIKRHKKEAQELLEKEINRAF 457

Query: 528 FDWRFHHMGEEISLQ-WTSEEEKKFKELTMSSFNNQSKCFWNSSLRCFPKKSRKILISYY 587
             W F  MGEEI L+ WT++EE++F+ L   +  + S  FW  +   FP+KS+K L+SYY
Sbjct: 458 STWEFDQMGEEIVLKSWTAKEERRFEALVKKNPLSSSDGFWEFASNAFPQKSKKDLLSYY 517

Query: 588 FNVFLIRQRSYQNRVTPNSIDSDDE 598
           +NVFLI++         N+IDSDD+
Sbjct: 518 YNVFLIKRMRLLKSSAANNIDSDDD 538

BLAST of Sed0009983 vs. TAIR 10
Match: AT1G26580.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: ELM2 domain-containing protein (TAIR:AT2G03470.1); Has 161 Blast hits to 161 proteins in 17 species: Archae - 0; Bacteria - 0; Metazoa - 1; Fungi - 4; Plants - 156; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 90.1 bits (222), Expect = 6.8e-18
Identity = 68/206 (33.01%), Postives = 101/206 (49.03%), Query Frame = 0

Query: 427 KSVPVGALFQAVVPEW----TGNISDS-------------DSKWLGMRSWPSKHRNSHSI 486
           K VP+G   QA +PEW    TGNI  S               K  G    P     + + 
Sbjct: 133 KQVPIGPGHQAEIPEWEGSQTGNIETSGMSVQNHISGCADGEKLFGTSVIPMPGLTTVAH 192

Query: 487 KNRNPIGRGRPDSCGCQFPSSVECFRFHIAEARMRLKLELG-LTFFDWRFHHMGEEISLQ 546
            + + +G+GR   C C+   SV C   HI EAR  L    G  TF +     MGE+ +L+
Sbjct: 193 ID-DIVGKGR-KFCVCRDRDSVRCVCQHIKEAREELVKTFGNETFKELGLCEMGEKGALK 252

Query: 547 WTSEEEKKFKELTMSSFNNQSKCFWNSSLRCFPKKSRKILISYYFNVFLIRQRSYQNRVT 606
           W+ E+ + F E+  S+     + FW      F  +++K ++S+YFNVF++R+R+ QNR  
Sbjct: 253 WSDEDAQLFHEVVYSNPVTLGQNFWRHLEAAFCSRTQKEIVSFYFNVFVLRRRAIQNRAF 312

Query: 607 PNSIDSDDEDVEFGCVSGDFGEKAME 615
              IDSDD++   GC  G  G + +E
Sbjct: 313 ILDIDSDDDEWH-GCYGGSSGTRYVE 335

BLAST of Sed0009983 vs. TAIR 10
Match: AT2G03470.1 (ELM2 domain-containing protein )

HSP 1 Score: 89.0 bits (219), Expect = 1.5e-17
Identity = 49/124 (39.52%), Postives = 70/124 (56.45%), Query Frame = 0

Query: 476 GRGRPDSCGCQFPSSVECFRFHIAEARMRLKLELGL-TFFDWRFHHMGEEISLQWTSEEE 535
           G+GR + C C    S+ C R HI EAR  L   +G   F +     MGEE++  WT EEE
Sbjct: 174 GQGRKE-CLCLDKGSIRCVRRHIIEARESLVETIGYERFMELGLCEMGEEVASLWTEEEE 233

Query: 536 KKFKELTMSSFNNQSKCFWNSSLRCFPKKSRKILISYYFNVFLIRQRSYQNRVTPNSIDS 595
             F ++  S+  +  + FW      FP ++ K L+SYYFNVF++R+R  QNR     +DS
Sbjct: 234 DLFHKVVYSNPFSAGRDFWKQLKGTFPSRTMKELVSYYFNVFILRRRGIQNRFKALDVDS 293

Query: 596 DDED 599
           DD++
Sbjct: 294 DDDE 296

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008452043.11.2e-26372.27PREDICTED: AT-rich interactive domain-containing protein 2 [Cucumis melo][more]
XP_038893741.11.6e-25871.07AT-rich interactive domain-containing protein 2 [Benincasa hispida][more]
XP_004146560.22.1e-25570.42AT-rich interactive domain-containing protein 2 [Cucumis sativus] >KGN53331.1 hy... [more]
XP_022136609.12.4e-25169.31AT-rich interactive domain-containing protein 2 [Momordica charantia][more]
KAG7015291.13.2e-25169.94AT-rich interactive domain-containing protein 2, partial [Cucurbita argyrosperma... [more]
Match NameE-valueIdentityDescription
Q9LDD41.0e-9036.58AT-rich interactive domain-containing protein 2 OS=Arabidopsis thaliana OX=3702 ... [more]
Q84JT78.0e-6433.39AT-rich interactive domain-containing protein 1 OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
A0A1S3BSW26.0e-26472.27AT-rich interactive domain-containing protein 2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A0A0KZM11.0e-25570.42ARID domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G047920 PE=4 S... [more]
A0A6J1C4T31.2e-25169.31AT-rich interactive domain-containing protein 2 OS=Momordica charantia OX=3673 G... [more]
A0A6J1ETI22.0e-25169.63AT-rich interactive domain-containing protein 2-like isoform X3 OS=Cucurbita mos... [more]
A0A6J1J6442.7e-24869.00AT-rich interactive domain-containing protein 2-like isoform X1 OS=Cucurbita max... [more]
Match NameE-valueIdentityDescription
AT4G11400.17.1e-9236.58ARID/BRIGHT DNA-binding domain;ELM2 domain protein [more]
AT2G46040.15.7e-6533.39ARID/BRIGHT DNA-binding domain;ELM2 domain protein [more]
AT5G04110.13.8e-2937.07DNA GYRASE B3 [more]
AT1G26580.16.8e-1833.01FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
AT2G03470.11.5e-1739.52ELM2 domain-containing protein [more]
InterPro
Analysis Name: InterPro Annotations of Chayote (edule) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001606ARID DNA-binding domainSMARTSM00501bright_3coord: 51..144
e-value: 5.9E-15
score: 65.7
IPR001606ARID DNA-binding domainPFAMPF01388ARIDcoord: 55..139
e-value: 3.8E-9
score: 37.1
IPR001606ARID DNA-binding domainPROSITEPS51011ARIDcoord: 50..143
score: 18.665234
NoneNo IPR availableSMARTSM01014ARID_2coord: 47..139
e-value: 1.0E-9
score: 48.3
NoneNo IPR availablePANTHERPTHR46410AT-RICH INTERACTIVE DOMAIN-CONTAINING PROTEIN 2coord: 1..632
NoneNo IPR availablePANTHERPTHR46410:SF2AT-RICH INTERACTIVE DOMAIN-CONTAINING PROTEIN 2coord: 1..632
NoneNo IPR availableCDDcd16100ARIDcoord: 53..139
e-value: 2.33943E-15
score: 69.694
IPR036431ARID DNA-binding domain superfamilyGENE3D1.10.150.60coord: 36..152
e-value: 3.5E-15
score: 57.5
IPR036431ARID DNA-binding domain superfamilySUPERFAMILY46774ARID-likecoord: 47..148

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sed0009983.1Sed0009983.1mRNA
Sed0009983.2Sed0009983.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003677 DNA binding