Clc11G00120 (gene) Watermelon (cordophanus) v2

Overview
NameClc11G00120
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
Descriptionsquamous cell carcinoma antigen recognized by T-cells 3
LocationClcChr11: 130692 .. 152799 (+)
RNA-Seq ExpressionClc11G00120
SyntenyClc11G00120
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTTTGCGAATTTGTGCTATCCCTCTGGAATGATTGACAAACTTTCTTGTTTTATGTTTGGCCATTTTGAGAAGGGGAATCTGATGTTATGGGTTACTTTATGGTGGAGGAGGTGAATCTGAGTTCTCCATTTCGAGAAGGGGCAAAGTTTTTGTGGCATGCTAGTTTCTTTGTCGTTTTGTAGGCATTTGGCTCGAGAGGAATAACATGATCTTTAGAGAGGCAGAGAAATTGTGTGAGGAAGTTTGAGAGGCTCTTGATAAAAGCTTGGTTTCATCACTATATATATATATAGAAAATTGGAATATCAATCAGATAACATTTTTCATTTAATTTTGATTGGATGGATTTCTATTATAATATAAAACTTTTTCTTGATGTGTTAGTGTTATATTCATAACATGCTTCCTTCACTATTTCTTTACTTTTAAGATGTATGCAAATATTGTCATCTTTGCAACTTATGTGGCTGGCAGCTAATTTTGCAGGTGCATTCCTGCTAGCTGAATCCAAGCTTGGAGTTCTGTAGCACTCATTCCCTTTTACCTTTTTATGTGCAATAAAAAAAGTAAAGCTTTTGCTGGACAAAGTGCTTGAAGCTTGTGAATGTACTTGATGAGGTTGGGATTGAAATTTCCATCACAACTGAAATGTTTAATTTTATAATCTAGTGGTTCTAGAAATTCATTTTGCTCACGGCTAATGATAAGATTTGTTTTTATTTCATCTCATTTATGATTTTAACCGACATTGTTCCCTGAGCTAATGTCTAGCAGAATGATGAAACATAGTTGGAAATTAGAGTTGTTGCCGGAAGTACCCTTGTGTTTGCTTACCTCCAGGAAGCTTATTGGTAGCACGTAGTTGAAATATAATATGGTTGGAGGTTGTAGTTATTTGAAGCTGTTTTTCGATCAAGAGGCAAAGGTTCTACTCCGACTTCTCCATTCTTAAAACAGTAATTTTAATAACTAATATTGAGCTGTCATCTCAGTGCCTACTCACCGTAGTTCTTTTATCTTTTAGGCTTTACACCTGATAAGGCCAATGAGTAGCAAATTAAAAATTCTAAGTAGATGTACTAGATATTCAAACCAGCATAGTGGGATAAATTGTTATTAAGTTGGGTATGTGCTCTTAGTTGTTTGTATGTTTCTTTTATTTAAGGCTTCATTGCTTAGCAGTTTGTTTCTTAATGTAAAAACATCTGTGGGGAGAAATTTGAGAGGAGACGAAGCTGTTTACGTCTGGAAATCCAAACACAAGAAGAAACCATTGGTCAAGAACATGAATCTACTAGAAAAGTGTACAAAAAATGAAGAACTTCTGCAAATGGTCGCTGGACTTGACCTACAACATTATCAAAGGAGTCTACCGTTCGCAAAAGTTCCTTTTTTTGCCTTCATATCTAGATCTCGTAGTATGAAGAGTACGAATACATTCAAACTCTACTTAAAAATAGGTCGAATCATAGACATGATCATATCAAACAAGCTTTTCAACCTATTCAACAACCGACAACTCATAAAGAGATCTCTACTGTCGTGAGCCATTGCCTGGAATCGATTTTGCCCTTCTGCGGGGTAGGTAAAACTACCTAGGGCTTTTTTATGTCGTCTTCAATTGATGAGTGAACAACATGTGCGTGGATCAACTTAGATCTCTTGACCCGATCCAATAGAGGATCTGCAGGATGAAAAAGATCAACCACCCAGGGTGAGCTAGAGCAAGCAGGTGCAAGGGCAGCTTCAATGTTCTTAATCATATCATGTTAGAGCAAGTTAACCCCTACCCGGTGGTGGTTTGTCGATGCGTTCCCTTTTGACTGGAAGACTCATTGAACATTATCAAGTGTTATGTCAGATGTAATATAGTGTCTATGACATTTTTTACATAACGTGATTGACCTGGGCAACCCAATGGGTGACGAACTTTGACGGTTTTTGCCATTTGGGAAAAGAGAGTACTAAAAATAAGAGTAATTGCTAGTTTAGATCAATAAAAAGAGCTTGTATTGAAGCATCTTGACAAAAATTGGATGAATGAGGACTTTGCCCAAACGCCTATGTCTAAAGCTTTCATTGCATGGTGAAGGGTTCGTTCTTATCAATATGGACTCTTTTATAAATACGACAGGAAAGCTTTAACATATATTCCTCTATAAAAGGTATTAACAACTCCTTCACAAAAATATTTCAAATATATATATATATATATATCCTTTCACAAAGGTAATGCAATCTTAATTAAACTTAGCTTGATAACATGTCGAAAGGGAGATTAACTTAAACTTCAGAGTGTATATGGTCCAGCACTACATCTATGTAGAGGTTTTATAGTATGTCTTTTAGGTATCGACTAGAGCTCGCCATGAATGTCATGTCAATGTTCAACCACGTTGAGAATTGTGCATCAACAATTTACAACATTCGATAACATTTTGATTTGTAGTTTTCTATTTTTTAAAATTTTGCTTAATTTCTTACCATTTATTTACTATAGTTTTCATCTTTCTTGCAAAAACATTTGAAATTCTAGCAAAATCTAAAAACAAAAACTTAAAAACTTCTTAGACCCCATTTGACAATTAGTTAGTTTTTGAAAATTAAGGTTATAAACACTGCTTTCATCTATAAATTTATTTGTTTGGTAATACTCTTTGTATCAATGTTTTCAAAAACCAAATAAAGTTTTGAAAACTAAAAAGAATAGTTAAAAAACTTATTCTTGTTTTCAAACATTGACCTATGAATTCAAATATTCATTTAAAAATAAAATAAAATAAAACCATAGTAAGGAAATTGAGAGTAAACAAGCATAAATTTAGAAAATATAAAAATAAAAGCAAAATTGCTATCAAACGGGACCATAGTTTTCAAGAACTTGGCTTTGACTTTGAAATTTGTTTTGGAAGGTAGATAACAGAATAAAAAACTTTTGTGTAGAAGTAGTATTTACAAGTATAGTTTTTGATAAATCAAGAAACAAAAAATCAAAACTATAACTTTTAACAAATATCTCATCTCGCTTTAGAATAGTTGCATTTGGTTTTTAAGTTTTCAAAATGTATCTTTCTAGTCCAAATTTTTTAAGAATAGGTTTGAAAGGTTACTAGAGTATTTTTTTTTTTTTGTGTTATTTTATATAATCATATGACATGTTGAATTGCAAAAATAAGTTAAAAATATGGTTGTTTTCAAATAGAAAGAGCGATAGGATTCTATTACGGTCTATCGCTATAATTTTTCTATATTCATAAATAATTTGATATTTTTTCATTTATAATAATTTTTTTAAAAATAATAAATTTCGAGAGAAGAATTAATTTTTTCAAAATTATTTTTCTAATTTAAAATAATAAAAAATTGTTGTAGGGAACCTTTAAACATATTTTTAAAACTCAAGTGGTAAAAAAAAATACACCTATTTTAAAAAACTCGAAGATAAAAAATATTTTTGAATTTTAGAGACCAAAACTCAAAAAAATAAATTTTTTCCTGAGAAAAGAAAAAGAATTGGTCGCTACGGTAAAATTAAAATATCTTTTTATTCCAAATCATGGACCGGCGGAGGCAGTGGTTCACTTCCGCTCCTTGGCTAGAGCAGACTGTTCCCTTCCTTTCCCGTTTAGGTCCGACGCCATGGACGAAGCCATTGAATCCAACGCCGAAATTCCACTTCCAACTAACTCAGGCGAAGGAGATATTGTAAATGGAGACGAACCAATGCCGGACGTTCTTCAGAACCCCACCTCCGCCGCCTCCGATTCTTCTTCCTCCGACTCTGATTCTGAAGACGACGAATCCGACCAAAAGCTCCAACTCCAGTCCCTTCAATCTCAGCTCTCATCCGACCCCTCCAACTACGATGCTCATGTCCAAGTGATATTCCTCTTTCTTTCTTTAATGCTTTTTTCTCATCATTTCTTCACCTCATTTCTGCATTTACTAACTTTACTTTGATTTTGATACCCTTTCCTGGTTCTCCTACTCTGTCTTGCTCATCCCCATTACTTCAGTGTATAAAGCTTTTGAGGAAAATTGGTGACATTGACAACCTCAGGAAAGCAAGGGAAGCAATGAGTGAGATATTCCCATTGACCCCATCAATGTGGCAGGAATGGGCCAAGGACGAAGCATCTATCAATACTGGGTAATCTCGTCGATTTGTACCTTTTTAGATGTCCCGTTGTTTCCTCTCTTGGGGTGGGGTAAAAGTTTCTGAAAGTGTTTTATTTCTTTTACTATGCCGTTCTTAACATCTTCTGGGTTGGATTTGTTCCTTAGAGAACTGTAAATTTCGAACTACTAAGCTCTTGAAATTTTTATGTCTTGTGAAATTTTATGGAAAAAAAAAGGATATTGGATTTCTCTCAATACTTCTCTTGGTGGCAGCTGGCTGTTTAACACTGATTAATTTTGCTCTCTATGCTTTTGCAGACCCGAGGCTCTTGCTGCAATTGAGAGGCTATATGAGCGAGGGGTGTTTGATTATCTGGTTTGTATTGTCGTGTCATTTGGAGATTAACCCTTGGTTTTGCGAGGCGTGTTACATATTAGACCAAAGAATGTTAGCATATTACCAATTTTACATCTGCTGCAAGCCAATGGTTTCACCACATTTTCCCTTTTATTCTTACGTGTAACTTGAAAACCCATATATTAATCTATAAACTTCATCTATTGTCGATTTACCAATTGGATCTTGGACTTTTATGTCCAAACAGTTGTTATTGTTGTTGTTTTCTTATTTATTATTTTTTATTTTCAATGACTAATTACTTGCTTTGAAAATAAATTTGGTTGTCTATGAAACTCAGTTTTCTGCTCGAGTTTCTACGTTTAAATGCATAAAACAAGTCCTTTTCAAGTTTGAGAACCATGGTTTGCTTTGTAATTATCATTGTTTTTTTGAGGATGACAATATTCTGAACTATGAAGTTGTAGAGGGTGTCCCGATGTTTTCAGATTTTGTTTTGTTGATCATGATCACGTGAGATGGTTAACTTTTCACTATGTTATGCAGTCGGTTTCTCTTTGGTGTGACTACTTAAATTTTGTTCAAGAGTATGATCCGTTGGTGCAAGATTGTGCAACTTCTGGAATTAAAAAGGCTAGAGATTTATTTGAACGTGCTCTCACAGCTGCTGGTTTGCATTTCACCGAGGCTGAAAAATTATGGGAAGCATATAGAGATTTTGAGAAAGCCATATACCAAACCATTGCTGAAACCGATACTCAGGTCTGAACCCTCTTTCATAATTTCTTTTTATGTTTATGGTCATCTTATATGAGGTCTCATTTTTGATGTAACTTTGTTATATGGCTTGAAGCTAATATGGCTTAGAGCTATATATACGTGTCATATGCTACATTTTGTGGAACATATAATGTGCAGATTCGAATTCTTGAAATGCATTTAAATTTACTACCTAACAAAGATATTTCTGAAATATTGTGCTGAATGCAACATTTTGTGGAATATATAGTGTAAAGATTTGAATTCTCAAAATGCTTTTAAATTTTCTACAAAACAAAAGATATTTATGAATTATTTTGCTTAGCATACGCAAAGCAGAATGATTGGGCCATATTGAAGGATCATAACATAGCGGCCAAGCACAAAGTACCAAAAAGACAAATAAACCAAAGAAATACATGATACTCAGTGCTCACGAAGGGCAAAAATGGTAATTGCATAACCCTAAAGCAACTAGTATTTTAATGGAATATTATGGTTTAACTTCCCTCCGCAAGTTGACTGGGCAGTTGGGTACAATGAGCTTGATTTGGAAATTGGAAGGCACGTGCTTATTGAATATCCTTTCGCAAACAAAGCGTTAGTCAATTTCAATATGCTTAGTTCTTTGATGCATAATAGGGTTAGGGTTTGTAGTAAGATAGGTGAAATTGCCACGAAGAATTACTGGGGTGAACCAAGAAGGAACCAGTGCTCACGTCGAATGTCTTGAAGCTAGATTACCTTAACAGTTGCATTGGCAAGGCCCTCGTTGAATTCATCACTAGATCGTGAAACAACCTTCTATTTGGTTTAGCCAATGACCGAGAGATATGATTATGGTTGAGGAAGGTATTCTTGCCATTGGTGCTACGACAATCTTTCGGACAACTTTTCCAATCTGCATCAATAAAGCAAACAATACCATGGTAAATAGTGCCCTTGAGGTATCGATGAATCCTTTCCACAGTAGCCGAATGTGAAGAAGTACGAGCATTTAAAAAATGACATACCTTGTTGCAACATACCAAATATTAGGGTGCGTGATGGTACAACACTGAAGAGCTTTGAAAATGTTTTGATATTTAGATGGGTTTTTCGGTGGTGTATAATCATATAAAGGTTGGCTTGTAACAACAGGAGTTGGTAGTTGCTTGTTGTCCCACATATCAAAGATTCAAAACGCTGAAGTAACTCTAGGATAGATATACTTGGAGAAAGATTTTTTTTTTGGGGTAAGCAATTGAATTGAAAAAAATTAAAGAATACAAGAATTAAAAAACTGGCTCCCAAAACACAGGAGCCTCGCCCTCAACCTCTGCGACCTAGGACAACCTTACTTTTTTATGCCCAACTAATAACCCACCACTTAGAACGATGTTTCACATACATATACGCAAGTCAGAGACAGAGGGAGAGAATACAAGCACAAGTTGGTGGCAATTTCGCCTAATCACTTTTATTGCTAAGTAACAAATCGATGTTTCACTTTTATTGGTCTCAAGGATTTCTCCTATGGCAAGGACTGCATTTGGACAAGTCCCAAGGTAATTCTTTTATATGCATGGATGAGGAGTTGGTTGGAGAGGGTGGACGAGTGGGCTAGTGGGGATGGACGACCACCTATGTTCCTTTCATCTAGGAGTCCACTTTCCCGTGATCTAGTAACCACTCGATACCTTAGGCACTAAGGTACTACCCATGTTCTGGATAACTCCCTCGAGCAACTACCTACTTTCACTACCAGTACTTTGAGTGACCATCCACATTTCAGGCAACTACTCTACTCTCCACTATTTTAGGTAACTACCCACATTTCAGGCAATTACCCATTCATCCAGTATTTTAGCCACACAAGTGCTCTTGGTGTAGAGAAGTGCAAATGAGGCGAGTCGCACAACAGGCTGATTGGAGGTTATGGTCTAACAAAAGTTATATAGCTGTTCTCCTTGGAGTTCGACTTTTTGGGACCGTGTTGTGGAGAAAATGCAAAAGAGTATTTCTGCCTAGAAGAGGAGTTTCTTTTTGAAAGAAGAAATGGTTACCCTGATTCAGTTGATTATGAGTGGAATCCTTGTGTAATCTCCTTATTCAAGATCCTTCAATCAGAAAATCTGTAGAGTTGGGCATGAGGAACTTCTTTGGGAGGAGGGGGTTGGGAAGGTAAAGGGATTCAACTCTTGAATTGGAATGTGGAGTCAAAGCCAATAAAGCTTGGGGCTAAGTGTTGTTAATCTTTGAATGCACAACAATTCCCTTTTGATTAGGTGGCTGTGGCGTTTCCCCCAAGATTCTAATTCCCTTTGTTTCATGGGAATCCGATTCCCAAAGAATGAGCATCTAGATTATCTCGTTTATTTTGTGTTAATGCAAGACGCCCATACAGAGAGCATCTCAAGAATTTTGGATACTTTCTTGAAACATGAGTTTTCGAGATCCCCATGTTGAGGGCATCATCTTTTTACCTTTTTACCCTTCCTTTTTTTCCCCTCATGTATGTATATAAATATCACTTTTCTTTATTTGCATTTTATAACTAAAATCATACTGTCAACCATATATTTCATTCTTCTTTTATATTGTAGTTGTTTTTTTTAAAGATATTCAGATTTAAATTCAAATTAATATTTGTTTTATAAAAAATCTTTAATTAATATAGATAAATTTGTTGAGAAAAATAACATATAAAAATAATACAAACTATATTAATTTAGAAAGAAATTCAAATTAATAAAAACAAATGTCAATATGTTATATAAATTCAAATTATATTAAATTAATCATAAATTAATGAATAGCCGTTAGGGCATTTTTGGAACACTGTAAATTTTTGACATTTCTGAGTAGAAATCCTATGAGACAAACACAGTTATCAAAGTATTTCTAGTTATTTGAAAAAATATTCCTACAACACACATGGTTATCTAAAGATGTTAGATATTTGTGATTCCAAGCATATATAATTTCGGGGAACTAAAGATTTCGGTTATCTACATTAACATTCCCAAAAAACAAACTTAAAGAGTATAATTACAAAAGGGCTTGGTCACAAAATCTCAAAGAGAAACAAAGAGAAATGTAGAATCGCATAAGGGATCAACATCTCTCAATGACCTCTCTCGACCCCTTCATATCATTTCTCTCAGTCGCTGTTTCTTTGTATTTCTTTCTCTTTCTCCTTTTCTACTCCTTTTGGAGTTTTTATCTTTTGAGCATTAGTTTATCTTTTCCTCTCTTCAACTGTTGAACAAAAATAGAAGTAATAACAACCAAATGAAGGCAAAGGAGCACATTTTTTGTTGGTTCATTCCTTGAATACATTAATGGTAGAAGGTCATGTTTGGTAAGTTTAAGATCGTTTTCATAAATATTTTTCTCTGATGTGGATGACCAATTAAAATGATTTTTTAACTTGGATAGAAGTTATACTCACAGTCAGAGCTGCATATTTTTCTGTAGGCAAAGGAAAAACAAGTCCAGCTAATCCGGAGCATTTTCCATCGCCAGTTGTCACTTCCTCTATCTAACATGAATTCAACACTTGAGGCCTATAAAACATGGGAGATGGAAATGAAACAAGGATGCGTTCTTGATACAGAATCTAATTATTCTGATGGGGTTCCCTCTCAAGTGGCCTCAGCTTATCGAAGAGCATTGGATATGTATAATGCCCGTGTTCAATTGGAAGAACAAATATCAAAGCAGGATTTAACTGATACTGAAAGACTTCACCAGTATATCGTATGTTGCACTTACAAGTTATTATGCCTACCAGATTTCTTTGGAGTTGATTCATGATTCATCCTAAAGCTTCTCTTTGCCAGTTTTGAAATACAACTTCTTAATTTTTATTTTTCCTCCAGATTTATCTGAAGTTTGAACAATCTGCTGGAGATCCAGCAAGGGTCCAAGTTCTGTTTGAGCGTGCGATTGCTGATTTTCCAGTATCGGTTGATCTTTGGCTAGATTATACCCGTTACATGGACAAAACTTTGAAGGTTTCTTCTTCCAGTTTCCTTTGAACTATGGATATGTCTTAATATATACTTATACGTGAGGTTTGGGTTTTCAGGTGGGCAATATTGTGAGGAACGTTTACTCCAGGGCAACTAGGAATTGTCCATGGATAGGTGATCTTTGGGTTCAGTATTTGCTTGCTTTGGAACGTGCTCGTGCTTCCGAAGGTGAAATTGCTTCAGTAAGCATTGGCCAACACACCCTTTCTCTGGAGCTGTTAAATTAAGTACAGTTTGGTCTGACTTCATTTGTCATGATGATTATTTTACATTTTGTCCCAAGCAAATCCTCCCTATTTGGTCTTGGCTAAAATTAACTTACTTAATGCTGTTGAGATGGTTTTTACTGCTGAAGTTCCAGAAGTTTGCCTTCCTTTTATTGGAAAAGTAATTGAGAGCAATGAAAAAGTTGAAGTAAATAAATGAAAATTAAGTGCTTTATTTGAGAAGTAATTTTATCTTGTTTGGCTTTTGCCTTTTGTTTTGGCTCCTCTTTGTGTGATCCGACTTGTTTTCCATCCTAAACGAAATTGTAGGGATCAGGTGGATAGCTTTTCTTTTTTTTTTTTTTTTTTTTTTGATTTGTTAAAGAAAAACTAGTATAAATTTGGAATAAGCTATAATTACAAAAGAATTTAGAAGTGAAGGACCATGTAGAAGCAAAAAGAATAACCAAACTCCAAAAAGAGTCACTATTAGTTTCTTGATCGCGAAAGGTTCTAGAGTTTCTTTCAAACCACAACCTCTAAAGGAGGGTGAAGATCCCATTTGACCAAAGGATTTTTGCCTGTGCATTTAATTGAGACCTTCAACTAACTGTAAAAGGGCATTTGATGCTTTCTTGGGCCTTACCGTTGGAGTTTGTACCTCTTTTTTGTATTTATTTATTTCTATTTTCTGTGTGTGAACAGATCGTATAATAAATTTTACTTCTACACGCTATTTGAGATCAGAATTATTTTTCTTGATGCAATGCACTGTACAGGTCTTTGAGAAGTCTTTGCAGTGCTCATTCTCAACCTTGGATGAGGTATGTAGTTTTATTTTTCTGGATATATGCCTAATTATTCTTGTTCTCCTAATGCAATCTATTCTGGTTCCTATATAGTAGGTCTGCAGTACTCCCCGTAGAGAACGGGATTTTAGTTTTAGGAGAATGATCATATATTATTTTTTTTTGAAACAGAAACAAGACCTTTCATTGAAATAATGAATGAGTCTAATGCTCAATGTACGTAAGATGAAACAAAAATGGAATTCAAACTTGCCAATAAAGGCTACAGCCACAAGACTATGTACAGAAACAAGACTTTTCATTGAAATAATGAATGAGTCTAATGCTCAAAGTATGTAAGATGAAACAAAAATGGAATTTAAACTTGCCAATGAAGGCTACAGCCGCAGGACTAAGTAAAGATCTAAATAATATAAGAGGGAATAAAAACATTCCAGATTATACAAATATCTTGGATGGAGAAAACAGCCTGAATCAAATGAAGACAACTGACTTTTTGTGGTAGGACAGTGAACCCTTCGACTGAAGTTGATTGAAGAATCTGATAGAGAGATGATTGAAGTTTTGAAACTCTGATAAGAATGGATCAAATCTGAAGGGCCCAACTGCTTTGAGGATGATCCTGATGACTGATTATTAGCACAACTTATTTCAAATGCAACATCGTTGATCTTTTGTAGTTTAGAGAAGACGGGGGAGGCTTTTGTCTTGGGTAATGAAATGGATGTGGTGGAGAAAGAGAACCAACAGGTTCTTTTCTTTTCTTCACATCAAAGCTGAAGTCTTCTTTACTCCTTCCAAACAAAGCTTTAAAACTGCCTTTCTACCTTCCAAATTAAGTGATGAACTATCATTCTTTTTGAGTAATGAGGGAGAGTTGTGATTTCCAATTGCTGTCTGATGGAATTCCTCAATTATATTATCATTGTCTTTTATTTGATAAGAGATGATTGCATTGAACAGATTGGATAGAATAAAAAGATATGCCAACTTCTATCATTGTTCTACTTGGCATTTTCATAAAAGTTATGCATGAGACAAATTCACAAATGAGAACTGAAATCTTAAACTATCATTCCAGTTTGATGATATTGCATATTTTACCTTGTTGTAGTACTTGGATTTGTTTCTTACACGAATTGATGGCTTGAGGCGAAGAATCTCTTCTGCAGTTCAGCTAGAGAATGTATTGGAATATTCGTTGATTAAAGAGACCTTTCAGGTTTTTTTATATGTTTTTCCATGTTCTTGGATTCCATCTGCTGTGTAAACTGTTGATTACCAATCTGCTTATATTGAATGTCTCCATCACACATATCAACTGAGCTGTTAACATTGTTTCTTCTGTTCCTTCTTATTCAGTTGGACATTCACATGAACTAATACTTAATGGTTTTATTTGTAGCGTGCATCAGATTACTTATCACCGCACCTAAAGAATTCAGAAATTTTGTTGCGTTTGTATGCTTATTGGGCCCGTTTAGAGATAAACTTGGGGAAAGATCTAGATGCTGCCCGTGGAGTTTGGGAAAGTTTGCTTAAGATCTGGTTTTCCTCATTGCCTCTTTATGATTATCACTTTCTTGCTTCCATGTTACCGCATTATGAATTACTTTTCTTCTTTGTTTACCTTTTCTGGGAGTCATTTGTTTTTCTGTTGCTGCTCCGTTGGCAGTGGCTCATTATTGGCGGCATGGGAGGGTTATATAGCAATGGAAATTGAGTTGAACCATATAAATAATGCCAGATCCATATATAAGAGATGTTACAGCAAAAGATTTCCAGGGAGTGGTTCAGAGGTAAAATTACTTTGAGTTGGTGACATAATCTCCATCGACATTCTTTTTCCTTTTACTAGTTTTTGTTGCAAGTTTTATACCTGCCGTCAATTGGATACCTACTAATACCTCTGGGGTTAACCTCTAAAAACAATGGCGTATTTAGTATTTTTAACATGTCTGAATTGCTCCTCTTTAAATTTTTATTATAAAAATGTCAATAATACAATTTTTTCTTTTTCTTTTTTTTTTTTCAAAACAAGAAACAAACTGTTCATTGATAGATGAAAAGATTAGAAAAATGCACTTCTAATGAGGAACAAGAACACAATAATATTATTCCAATATGATAACTTTTTTATTTTCCTGGTAAAATGTAGAAAGCTTCTTTTGGAATCAGTCCCTTACACATTATCTAATACATTTTGGATTAGATGACACCTTCAATTTCTAACCGGAAAAAAAATTGTTCAGGATATTTGCCATTCTTGGTTGCGTTTTGAGAGGGAATATGGTAGTCTCGAAGATTTTGATCATGCTGCTCGAAAGGTACAGTTCTTTAGTAATTTTCCAGCTATGTTGATATTTGTTTCATGGAAATTAAGTGCATGCATAGCTGATCAAAACAAAAAGTTCAGTTTCTCATTGGGAGAAATTTTTTAATTTTGGATTTCAGTACGCCATGTTCTTTGGGTAGAATCAATTTATCTTTTTGTTGATATGGAAAGAGTATTCAGTGTACTCCGTACTTTTGGCAAATTAGAGATCATTTATTCAGAATAGAGATACATATATGGCAAGAAACCCTAATCTAGGAAATGTACAATTACAATAAAGGACGCCTATACATAATAATATAAATACATATTATAACATTCACCCTCAAGCTGGAGCAAATATGTCAATCATGTCCAGCTTGTTGCATTGATAGTTTATTCTTGCTTCGTTTACAACTTTGGTAAGGATATCTCCCAATTGTTTTCCAATCTTCACATATCTTGCAGACACCCATCCTTGTATTTTCGCGTGAATAAAATGACAATCCACTTCAATAATAGGTAATCATTAGGGTTGTTTTCAAATATAGAAAATGAGCCAAACTATTTACAAATATAGAAAAATTTCATTGTCTATCAGCGATAGACAACAAATATACGTCTATCGCTTGAGCTATAGATTGCGGGATAGAAGTTTATCGCGATCGCTGATAGATAGTGAAATTTTTCTATATTTGTAAATAGTTTGATATTTTTTTCTATTTATAATAGTTTTCCTAATCATTACCTATAGACAATTCTCCCACTCCTTCATTTATCTTGACTTTTCATCATCTTTTTATTATTTTTATTTTTTGCTTTCAAGATTTCCTTTATGGAGTCTATCATAGCTACTGATTGGTAACTCTATGACAACATTGTCACTCATCTTTTCTTCTATTATCTTCTCAATGACAACAAATTTCCCTTCTGAAAACCGATAAAAACTGGCCAACTATCCGTGTGGTTGGACGAATAGTGTCTCCACTTGGAAACCAATGTGTCGGTGGCTGGCGGCGATTTTTGCCCAAACTCGATGATCACCCCTACCAATCAACATTTGAAAAACATTCAACCCTTGTATGACCATGATCTTTATATAAGATCCCACACCTAGGTGCAACTTTTAGATAACATAGAATTTGTTCTACTATAATTCAATAATCCACTGTAGGAGAAGAGATAAATTGACTATGGTTGTGTCACTATTAAATAGTTCAACTTTCCAACTAATCTTTTATATCTCTCAGGATCTTTATATAATTCTCCTTCTTTAGCAAGTTGCATATTTGGTATCATTGGAGTACTACATGATTTAACTCCTAGTTTTCCTGTCTCAAACAACAAATCAAGTACATGTTTTCATTGTGACAAATAAATACCTTTCTTGCTTCTCATTACTTTAATACCCAAAAGATACTTCAATTGTCCCAAATCTTTAGTATGAAATGAACCCTGAAGGATGTCTTGAGAGAAGATATACTTGATACATCATTTTCAGTGATAACGATATCATCAACATATACAACAAGCAAAGTAATACCATTGTCATCTTGCAATAGAAAATAGAATGATCATATGTACTTTTTTTTTTTTTTTATATTGAAACGCTCGAGTGCTTGACTAAACTTACCAAACCATGCACATGGACTCTGTTTCAGTCCATACAAAGGTTTTCCAAAGGCGATATACCTTATCACTCTTCCCCTGAGCAACAAACCCAGATGATTGCTCCACATAAACTTCCTCCTGAAGATCATAGCATTGTTTATCACTTTATGTCAAGTTGATGCAAAGGCCAATTGTGAGTAGCAACCATGGAAAGAATTAGTCGGATGGATTTTAATAATATAAACGTTGAATTTCCTTTTGGTATAAAAATGATATATATCTTAAGAAATGTATATTTTAATAAATGTGTATTTGTCGTGTCATGTCCTAGATTTTAAAAAATGACGTGTCGTTGTGTCCCTGGCTTGTCGTATCCATGTCTGAATTTGTATCTATGCTTCTTAGGACATGATTTATATTTCTAATTGTGATGCCACCACCTTATATAACTTGCAAAAGTTTGAAGACATCTTTAGTATCGTTTTTACTTTACAGTCCATTGTTCATGACGCTAAAATCAAGTTTCAAATCAACTTTATTTCTGTCTTATTAATATTAGAAGTCAGTTCTAGAAATTTGAATGCTCTCCCTGAGTTGAGCAATTGAACATGAAAATAGGATTGATTTCTTAGCAATGAGCAGTAGTTTTTTGTATGTCAAGACAGAAGATCAAAAGATTCATTTTGTAGAAATTTCTATAACTATTATTCTAGCTGCAAAAGACCTTAGCCTATATTTATCATTAATTTTTGAAACTACCAATTCTGTAGTTTGTTTCTTTAATGAACTACTAATACATTCTAGTGGATGATTTCTTTATCTTAGGTTAACCCTCGTCTGGAAGAGTTAAAGTCCTATAAGTTACAGATGGATGAGTCAGAAAATCCTGTAAAGCAGACTGATCATGGTAAGCGAAAATTGGGTGGAGATGCACCCAATGTAGAGTCTCCTGCTAAGAAACTAAAAGATTCTGCTCATGGGCCCAAGAAAGTTACTGAAAAGGGTAAAGCGCAATTACAAAACATAGATGATCAAACAGGAGACATCAAAGAAAGAGTTAAAAAACCTGATGACACAAGTGACCAACAAATGAAAGATTCCGTTCAAGAAAAAGGGAAAGTATATAATGACCAATGCACTGCATTCATTTCCAACCTCAACCTCAAGGCAAGTTATTTCTGAAGTGCCTTCTATTTTTAACTCACTCTTCTTCATTTTCGTGGATGAGCACATGGTTTATGCTGCTTCATTACAACTCTGGCAGGTTACCTATGAACACTTGCGGGACTTCTTCCAAGATGTTGGTGGAGTTGTTGCAATTCGAATATTGCATGATAAGTTTACTGGAAAATCACGGGTCCACTCTCTTCTTTTCACCCCATATTTTCTTTCATTTATTGTCATTGTATATATATGTAACAAGAAATTACTGTCTTCTAATTTTACTTGATGACAAGTTCTTACTAGTTACATACTGATACTGCACTTATCTTACTTGAGATTCAGCCCTGGATTCAAAAGGAAAAAAAGAAAAAAAAATACTAGTTACATACTGCGATATTTTCGGGAATTCTTAATGCCTACTTATAATTTTGATAAGAAATTGTCGAGAATTGGTAGTCTGTATGCTTGTATGCTTGAAAGGACAAATGTCATTGTGAATTCCCAAAATGCTTGTCTAAATTAGGATCCTTATATCTTTCCTTTGTAATTTGCATTCTTTTGTTATCTTTTGTATTTTTTTGAGCTTTGTCCCTTAAAAAAAAAAAAAAAAAGAAAAAAAAAAATCCCAAAATGCTTGTACATGGGCAGCAACGTTGAAAGTTAGATAGATGGGTTCAGTTTGGTGGTGTTTTTGATTTCTTAGTTTCCTCTTGGCCTTCTTTGAGTCATATTGTGAACTTTCTTGAAGCAGTAAATAAATTGAAGCATCCCTTGGACTTTAATTTAGGTCATTGTTTTTCTCGTGGATCCAATCCCAACAATCTCATTTCTCAACATTCAGTGGGTTTTCTTTGATTCTTTTAAGGAAATCATCTTACATCTACTGATTTGTCCAGCCAGTCAAAAGTCTTACTGTCTAACACAGTGAAGGTTTTGTTATCTGAGTTGTAGTTGGAATGAAATGATAGGGTTTCTCATAGTAAACATCTTTCTTGGGATGATTGGTTTGGCTCTTCTCTTCTGAAGCCCTCTTCATGGTGTTCTTTCTAAAAGCATTGCAAGATGTACCGTTCAGGATATTTGTCTCAATTTTTTTTTGAAAGAAAACAAGACTAATAAATGAAAAGAGGCCAATGCTTAAAATACAAGAAAACGATACGAAACAAAAACACTTTTATGTTGCTTTCATAATCCTGTTACGTTATATTGTTTTATTTCACTTTGTATTATGTTTTTGTATCATTTTTTTGGGTTTTTTGTATTTTTGAGCATTAGTCTCTTTTCATCCTATTAATGAATTGTGGTTGTTTCCTAAAAAAAAAATCTTCAGAATCCAATAACTCAAAATTGCCTCAAACTCCACAAAAGAATAAGAATAAGAACTAAGTGACTCTTCATCTTCCAAACAGATCGAACTCCTTATTGCATCAAGCAGATTGGGTTAAGACCACACATTAGGATAGCCACAAAAGGTTGATTCCTATTAATTTTAAGACCATGATACTAAAAATTTGGGAGCCACTCTTTATATTGATCACAATATTCCGAAGGTCTTGGAATACAATCTTACAGAACTGATATTCTCACAAACCTTACCCTTCCAAACTTCTCCTACTAATCTCCCTACTTATTGTATTTCACTATTTGAACAGCCTCCAAAGGTTACAATGCGAATAGAAACTTTATTCCGAATCATATGAGGAAATCCATCAGAGGGATACCACAAACACCTTGTCAAATGGGAAAGGATTATTCATCCTATTGATAAAAGTGGTTTGGGCATTCGTGCCATTCAAGACAAAAGCCAGGTTCTCGCAAAATGGGTGTGGAGATATCAAAAGGGGAAAAAAACCGTTTGGATGGATTGTCGATGCCAATTTAGTACCAAACCTCTCAACAACCATCCGGGCAGCTTCTCCCAACACTCCTCTAAAAGGCCATGGAAGTCCATTTTCATCACCAAACCTTAATTCGCATTCTGTTTTGACCAAACTTGGCAATGGTTTTTCTACTGATTTTGGCACGATAGATGGGTGGGCGACCATCCTTTTTATATCAACATTTCCTAATTTTTTGTTAGGTACTTAAGTACCTTGGAGAATCGCACACCAAAAACCAACTATTAAGGTGGAAGAGCCAAGCCACTTAAGTACCATATTGGTCATCCCATTCTAATCAATGTGGGACAAAGGTAACCCATACTACTTTGGTTCCTGACAATACCTCATTCCTGGAAAGCCGACGACCCCGCGGCTACTCCGACGGTATTTAGTTCGGACACACTTGGTCAGATCCTTCCACCAATTCCACTCCAGAACGTCAACAACCGACTTGATACCGTTGTTAGGTACTTAAGCACCTTGGAGAACCACACCCCAAAATCCAAGTATTGAGGTGGGAGAGCCAATCTACTTAAATACCACATTAGCCATTCCATTCTAACCGATGTGGGACAAAGGTAACCCATACTACCTTGGTTCCTAACATTTTTCTCCTTATCCAACAAAAATCATTCTTCTATCAAATATGTGTGTAGTGTTGTTGAGGCGACCCCGGGCGCGCGCCTAAGACAAGAGGCGAGGCGCTCAAGGGCCTTTGCACCTCTTGTAATGCCTAGGCGAGGCTCTTCAAAGAGGCGGTCGCCTTTTGGCGCCTCGGTGCGCCTTCACCTTTGTGATAGGCGAGTACCTAATTAAATTTTATTTAGTTGAGGTAGTCGATTCTTCATAGCGAAACCAATGGTACCATCAGTAAAAAATATTTGATCCACATCCCACATTAAGACTTTCAATTACTTAACCTCAAGAATCGACGGCCTCCTTCAAGAATTCGGCGGCTTCCTTCAACTTCCACCGCTTCAATTTCTTCTTCTTTTTTCTGTATTAGCTTCGATTTCTTCAAGTTCTTTTTGCAGCTGTTTCGAGTTCAGAGGGTGTGGCCTTGTGGGTTTCATTTTCCGTTTGATTCTGATTTTCTGATTTGTGTAAAGGTAATATAAATTTATTATATTCCTTTTTTTATATGCTGATTTCTTGAAGCTTCAGTAAAAAGTCAGGTCGGTTGGTGGTGTTAAGTGGGTCTAGGTGGATCAAGATATTCTTTTTAGTTTTTGCATATTATATATTTAGCGGATGGGTGAGAATTACATATTTTTCCTATTCACCTTATAATTTATATATTATTATTATTAATATATTATATATTATTATATTCATATCATATTCAGTGGGGGTGAGATGAGATATTTTTGAACTTACATTAATTTTTTTCTCTTTGTATGTTGCTTCTTCGGAAATCTAGCATCTAAATGATTCTTAGTTTCTATTATTTTATAATAGGATGGGGAAAAGAGAACAAAAATGCAAGTTTCTATGATATATTTATTCATTGCGCCTCGCTTCGCTCAGGCGTCGCCTTTTTGTCGCCTCTCACCTTGAGGCAATCAAGGGACTTGTCGTCTTGAGTTGTGCCTTGCACTTTGAAACACTGCACACTTTGAAAACACTGCATGTGCGAACACTAATTTTGCCTTCTGGGGTTTGCAAATTAGAAGAATTTTGAAGGAAGTCCTTGAATAGGCTGAACTGAGCTCCGTTTTATCCGGCATAAAATTGACATTCCAATCAGTAGTAGATGTAAGACGTCTTCCTTAAAAGAATTAAAAAAAACTCACTGGATGTTGAAAAATGAGAATAGCTTCCATCACCAATTGCTTGAATACGAGCAGTAAATGAGTGATCGTTAATGACATAGTGGGAAAATCTGATTTTGGTAGGACAATGAGTGATTACTTCCCCCTCCAATCTCCCTCATGAAAGGTGATCTTTCGAACAAGCCAATCGAGGACTTAACTATACAGTAGAAAAAACAAAAAGCAGCTTTAAATGTGACAGATGACATATGAATTCATAATAAGCAATCCTCACACATTAAGTATAGCTCATAGTAAACCTCTATGATGAATTATTGACGTGGATTATTTTGTTACTCGTCTTGCTCTGTTTAAGGTGAACTATTTTCTTACTTTTTGTTAGAATTGGAATTCCAGCACCCTTTTTCAGCTTTGCCTGCTAAATCTATTTATTAATTCTGGACTTGATTTTATGTATTCTGTTCTATTGTTAAGTGCATTCACATGTTCATGTACATTTGTAAGAGTATTTTGACCGAGCTGTGGCACCATTATTAGATTGTAAAATGATGGCATGATTTCTTGTATGACTTATGGATTTCTACCGTACTGATGGGTTGAATTTTCCTTATTTTTAGGGACTAGCATATGTAGATTTCTCTGATGATGCGCATCTTGATGCTGGAGTTGCAAAAAACAAGCAGTTGTTGCTTGGGAAAAGGATAAGCATTGCGCGGTCAGATCCTAAGAAAGGTATACCAAAGAAAAGTTGAAGATGAAAATTAGATAACATCCAATCGAATACATTTAGCGCAATATACCCCAAATGTGGTATGCTTGGTAAATTAGTTAAAATTGATGCCTTGGAATTACTAAAATAGGGCCGGAGTAGCCTGTGGAAGTGGGAGCACATTTTTCAAAAGAAGAGAAAAACACACAAGTTATTAGGAAGTATGCATTTTTACCACCATTTAGCTAATGGAAATGTTTTATCTAGTTTAACAGGAACATAATGGTCAATAGTTGAAACAGGGTATGCTCGTCAATATTTAATATGGGTTTCTGTTGTAGCCTAAACATTCCAAAGATCCTCAATGATAAATCTTAATTAACGAATGAAATGATAAGTCAATGCAAAGGTAGAATCTTTGTTTGTAGAGAATTAGAAGTAGAACTACTTACTAAAACTCCTTAACCAGCAAAATCTTCAACTAAAAAAGTAATTGCAATATTGTTCTTTAATACTCTACATGAGCTCCTTAAAAAATTAACATGGCCTTGGGTTTAAAATTAATGGAGTCAGAATGTTTTGATTGAAAAATTCTAAGGAAATGCTATTTTCAGCCCCATGTAATCTACCATAGAGTAAAATGGTAAAGGAAACAAAACTTTTCATTGAAGTGAAAAATTACAAAAGTTGGAAAAGATTATACAATTTCTGAATATCAGCAAACTTGAAAAGACGAAACTAAAAAGTTACTAATTTACACATCAACCTAAGCATATCTCAACTAGTTGAGACATATCATTGACTAAAAGGTCAGAAGTTTGAATCCTTATCCTTGATTGAACTAAAAAAGAAGCTATCATACACATTACATACAAGAACGGTCTAACTTAAAAGATCTACAAAAGCTTCTCTATAGTTCTTGTACGTTGTACCAACAAAATATTCAACTAAAAGGAAATTCCAGTATTGTTCTATAATACCCTACATCCATTTTGAAATCGTCAAAGGTCATGTTTAACGTCACTTTAACTTGAGTAAATTTGTGATGTAGCAACTTCAGTTCTTAAATTAGGCTTTGTCGATATGCTACAACCTACAGTGGATACTTCAGAACCGCTTAAAACTCAGTGAAGATATGAGAACCTTTCTACAAGTAAACCATGAGCATATTCATGGAGTAGAGCATTTGTGCTATTTTGTATTTTGTTGGTTGAAATTTTAGCTTCTGAAAATGGGGTCTTACAGCGGGCACATTATCGAGTCTTAGTTGCAATATGGCCCCTGCGCAAGGATGACACGCACAAATTGAGTTGCAATATTTCAGTTATTTGGATTGGTAGAGTATGCTCTGTCAGTGTCGTCAAACTAGAAAGTTGAATTGGACATACCTTTGGCAGATTTTATGTTCATTAGTCATAACTTTTCTACTTGGAAAAAAGATATATGTTTTTTTGTTCATTTTTACTCAAATAAGAAGTTTGCATGCATGGTTTCTGCTATGCATAGTGATTGATTACCCACTACATATATACATCTACCAGTCGCATGGTCAGAACTCATTTTTATGCCAGCTAGATTCTATTGCCACATTTGGGATTTGTCATCAGATTTAGAAAGTTACAGCCTTCTTCATGATTTAATCTGTATATACCGTTAACGGGAATCATAGGGAATGCCATGTAGATAACTTCAAAAATATTAGTATGTAGTTCTAAAATGTTGGCTCACAACTTAACGAAAAATTCTTGGGTCAATAGACACTGTTTTGTTGTCTCATTAGCTTTATTATTGTCATAATGTAATACCAAGTTCATCTCTCTGAATGAGACAAAAGCTAACAACTGTTCTATTATTTTCTCTTGTAGGAGGCCATTCTACAGATCGAGCTGGTGGAGGCAAGAGATTTGAGTCAAGAAGTTCTTCAAAGGAGCCTCAGAAAGCGAATGAGCAACCACGTGGGGTGAGGAAGCATGGAGGGAACAATGTTGAGCTCAAGGGAAAGAACACGTTTGCAGTGCCTAGAAATGTTAGAGCACTTGGTTGGACTGCAGATAAACCAAAAACCGTGGAACAAGATGATGAAAAGCCAAAAACCAACGATGAATTCAGGAAACTGTATTTTAAAGGCTGAAGCTTTCTTTTGTTGTAAAGATTATTATGGACTCAAGATTCTGATGTAGTTTAGAGTGTATCCCTAGACATACTATACCACAAGGGGATTTTTTTTCTTCTTTTTGTAATCTCTTACAAATAGATTTTTGTTTAGGTATATACTATGGTATCCTGTAGTTTGGATACCTGAATTTACTCTGTTGTATATATAGTTCAGTTGAGATTTCAAGCCTATTTATAAAAGCGAGCAGTGAACTACATGATTGTGCTTTGA

mRNA sequence

ATGGAAAAAGAATTGGTCGCTACGGTAAAATTAAAATATCTTTTTATTCCAAATCATGGACCGGCGGAGGCAGTGGTTCACTTCCGCTCCTTGGCTAGAGCAGACTGTTCCCTTCCTTTCCCGTTTAGGTCCGACGCCATGGACGAAGCCATTGAATCCAACGCCGAAATTCCACTTCCAACTAACTCAGGCGAAGGAGATATTGTAAATGGAGACGAACCAATGCCGGACGTTCTTCAGAACCCCACCTCCGCCGCCTCCGATTCTTCTTCCTCCGACTCTGATTCTGAAGACGACGAATCCGACCAAAAGCTCCAACTCCAGTCCCTTCAATCTCAGCTCTCATCCGACCCCTCCAACTACGATGCTCATGTCCAATGTATAAAGCTTTTGAGGAAAATTGGTGACATTGACAACCTCAGGAAAGCAAGGGAAGCAATGAGTGAGATATTCCCATTGACCCCATCAATGTGGCAGGAATGGGCCAAGGACGAAGCATCTATCAATACTGGACCCGAGGCTCTTGCTGCAATTGAGAGGCTATATGAGCGAGGGGTGTTTGATTATCTGTCGGTTTCTCTTTGGTGTGACTACTTAAATTTTGTTCAAGAGTATGATCCGTTGGTGCAAGATTGTGCAACTTCTGGAATTAAAAAGGCTAGAGATTTATTTGAACGTGCTCTCACAGCTGCTGGTTTGCATTTCACCGAGGCTGAAAAATTATGGGAAGCATATAGAGATTTTGAGAAAGCCATATACCAAACCATTGCTGAAACCGATACTCAGGATTTCTCCTATGGCAAGGACTGCATTTGGACAAGTCCCAAGGCAAAGGAAAAACAAGTCCAGCTAATCCGGAGCATTTTCCATCGCCAGTTGTCACTTCCTCTATCTAACATGAATTCAACACTTGAGGCCTATAAAACATGGGAGATGGAAATGAAACAAGGATGCGTTCTTGATACAGAATCTAATTATTCTGATGGGGTTCCCTCTCAAGTGGCCTCAGCTTATCGAAGAGCATTGGATATGTATAATGCCCGTGTTCAATTGGAAGAACAAATATCAAAGCAGGATTTAACTGATACTGAAAGACTTCACCAGTATATCATTTATCTGAAGTTTGAACAATCTGCTGGAGATCCAGCAAGGGTCCAAGTTCTGTTTGAGCGTGCGATTGCTGATTTTCCAGTATCGGTTGATCTTTGGCTAGATTATACCCGTTACATGGACAAAACTTTGAAGGTGGGCAATATTGTGAGGAACGTTTACTCCAGGGCAACTAGGAATTGTCCATGGATAGGTGATCTTTGGGTTCAGTATTTGCTTGCTTTGGAACGTGCTCGTGCTTCCGAAGGTGAAATTGCTTCAGTCTTTGAGAAGTCTTTGCAGTGCTCATTCTCAACCTTGGATGAGACGGGGGAGGCTTTTGTCTTGGGTAATGAAATGGATGTGGTGGAGAAAGAGAACCAACAGTACTTGGATTTGTTTCTTACACGAATTGATGGCTTGAGGCGAAGAATCTCTTCTGCAGTTCAGCTAGAGAATGTATTGGAATATTCGTTGATTAAAGAGACCTTTCAGCGTGCATCAGATTACTTATCACCGCACCTAAAGAATTCAGAAATTTTGTTGCGTTTGTATGCTTATTGGGCCCGTTTAGAGATAAACTTGGGGAAAGATCTAGATGCTGCCCGTGGAGTTTGGGAAAGTTTGCTTAAGATCTGGTTTTCCTCATTGCCTCTTTATGATTATCACTTTCTTGCTTCCATTGGCTCATTATTGGCGGCATGGGAGGGTTATATAGCAATGGAAATTGAGTTGAACCATATAAATAATGCCAGATCCATATATAAGAGATGTTACAGCAAAAGATTTCCAGGGAGTGGTTCAGAGGATATTTGCCATTCTTGGTTGCGTTTTGAGAGGGAATATGGTAGTCTCGAAGATTTTGATCATGCTGCTCGAAAGGTTAACCCTCGTCTGGAAGAGTTAAAGTCCTATAAGTTACAGATGGATGAGTCAGAAAATCCTGTAAAGCAGACTGATCATGGTAAGCGAAAATTGGGTGGAGATGCACCCAATGTAGAGTCTCCTGCTAAGAAACTAAAAGATTCTGCTCATGGGCCCAAGAAAGTTACTGAAAAGGGTAAAGCGCAATTACAAAACATAGATGATCAAACAGGAGACATCAAAGAAAGAGTTAAAAAACCTGATGACACAAGTGACCAACAAATGAAAGATTCCGTTCAAGAAAAAGGGAAAGTTACCTATGAACACTTGCGGGACTTCTTCCAAGATGTTGGTGGAGTTGTTGCAATTCGAATATTGCATGATAAGTTTACTGGAAAATCACGGGGACTAGCATATGTAGATTTCTCTGATGATGCGCATCTTGATGCTGGAGTTGCAAAAAACAAGCAGTTGTTGCTTGGGAAAAGGATAAGCATTGCGCGGTCAGATCCTAAGAAAGGAGGCCATTCTACAGATCGAGCTGGTGGAGGCAAGAGATTTGAGTCAAGAAGTTCTTCAAAGGAGCCTCAGAAAGCGAATGAGCAACCACGTGGGGTGAGGAAGCATGGAGGGAACAATGTTGAGCTCAAGGGAAAGAACACGTTTGCAGTGCCTAGAAATGTTAGAGCACTTGGTTGGACTGCAGATAAACCAAAAACCGTGGAACAAGATGATGAAAAGCCAAAAACCAACGATGAATTCAGGAAACTGTATTTTAAAGGCTGAAGCTTTCTTTTGTTGTAAAGATTATTATGGACTCAAGATTCTGATGTAGTTTAGAGTGTATCCCTAGACATACTATACCACAAGGGGATTTTTTTTCTTCTTTTTGTAATCTCTTACAAATAGATTTTTGTTTAGGTATATACTATGGTATCCTGTAGTTTGGATACCTGAATTTACTCTGTTGTATATATAGTTCAGTTGAGATTTCAAGCCTATTTATAAAAGCGAGCAGTGAACTACATGATTGTGCTTTGA

Coding sequence (CDS)

ATGGAAAAAGAATTGGTCGCTACGGTAAAATTAAAATATCTTTTTATTCCAAATCATGGACCGGCGGAGGCAGTGGTTCACTTCCGCTCCTTGGCTAGAGCAGACTGTTCCCTTCCTTTCCCGTTTAGGTCCGACGCCATGGACGAAGCCATTGAATCCAACGCCGAAATTCCACTTCCAACTAACTCAGGCGAAGGAGATATTGTAAATGGAGACGAACCAATGCCGGACGTTCTTCAGAACCCCACCTCCGCCGCCTCCGATTCTTCTTCCTCCGACTCTGATTCTGAAGACGACGAATCCGACCAAAAGCTCCAACTCCAGTCCCTTCAATCTCAGCTCTCATCCGACCCCTCCAACTACGATGCTCATGTCCAATGTATAAAGCTTTTGAGGAAAATTGGTGACATTGACAACCTCAGGAAAGCAAGGGAAGCAATGAGTGAGATATTCCCATTGACCCCATCAATGTGGCAGGAATGGGCCAAGGACGAAGCATCTATCAATACTGGACCCGAGGCTCTTGCTGCAATTGAGAGGCTATATGAGCGAGGGGTGTTTGATTATCTGTCGGTTTCTCTTTGGTGTGACTACTTAAATTTTGTTCAAGAGTATGATCCGTTGGTGCAAGATTGTGCAACTTCTGGAATTAAAAAGGCTAGAGATTTATTTGAACGTGCTCTCACAGCTGCTGGTTTGCATTTCACCGAGGCTGAAAAATTATGGGAAGCATATAGAGATTTTGAGAAAGCCATATACCAAACCATTGCTGAAACCGATACTCAGGATTTCTCCTATGGCAAGGACTGCATTTGGACAAGTCCCAAGGCAAAGGAAAAACAAGTCCAGCTAATCCGGAGCATTTTCCATCGCCAGTTGTCACTTCCTCTATCTAACATGAATTCAACACTTGAGGCCTATAAAACATGGGAGATGGAAATGAAACAAGGATGCGTTCTTGATACAGAATCTAATTATTCTGATGGGGTTCCCTCTCAAGTGGCCTCAGCTTATCGAAGAGCATTGGATATGTATAATGCCCGTGTTCAATTGGAAGAACAAATATCAAAGCAGGATTTAACTGATACTGAAAGACTTCACCAGTATATCATTTATCTGAAGTTTGAACAATCTGCTGGAGATCCAGCAAGGGTCCAAGTTCTGTTTGAGCGTGCGATTGCTGATTTTCCAGTATCGGTTGATCTTTGGCTAGATTATACCCGTTACATGGACAAAACTTTGAAGGTGGGCAATATTGTGAGGAACGTTTACTCCAGGGCAACTAGGAATTGTCCATGGATAGGTGATCTTTGGGTTCAGTATTTGCTTGCTTTGGAACGTGCTCGTGCTTCCGAAGGTGAAATTGCTTCAGTCTTTGAGAAGTCTTTGCAGTGCTCATTCTCAACCTTGGATGAGACGGGGGAGGCTTTTGTCTTGGGTAATGAAATGGATGTGGTGGAGAAAGAGAACCAACAGTACTTGGATTTGTTTCTTACACGAATTGATGGCTTGAGGCGAAGAATCTCTTCTGCAGTTCAGCTAGAGAATGTATTGGAATATTCGTTGATTAAAGAGACCTTTCAGCGTGCATCAGATTACTTATCACCGCACCTAAAGAATTCAGAAATTTTGTTGCGTTTGTATGCTTATTGGGCCCGTTTAGAGATAAACTTGGGGAAAGATCTAGATGCTGCCCGTGGAGTTTGGGAAAGTTTGCTTAAGATCTGGTTTTCCTCATTGCCTCTTTATGATTATCACTTTCTTGCTTCCATTGGCTCATTATTGGCGGCATGGGAGGGTTATATAGCAATGGAAATTGAGTTGAACCATATAAATAATGCCAGATCCATATATAAGAGATGTTACAGCAAAAGATTTCCAGGGAGTGGTTCAGAGGATATTTGCCATTCTTGGTTGCGTTTTGAGAGGGAATATGGTAGTCTCGAAGATTTTGATCATGCTGCTCGAAAGGTTAACCCTCGTCTGGAAGAGTTAAAGTCCTATAAGTTACAGATGGATGAGTCAGAAAATCCTGTAAAGCAGACTGATCATGGTAAGCGAAAATTGGGTGGAGATGCACCCAATGTAGAGTCTCCTGCTAAGAAACTAAAAGATTCTGCTCATGGGCCCAAGAAAGTTACTGAAAAGGGTAAAGCGCAATTACAAAACATAGATGATCAAACAGGAGACATCAAAGAAAGAGTTAAAAAACCTGATGACACAAGTGACCAACAAATGAAAGATTCCGTTCAAGAAAAAGGGAAAGTTACCTATGAACACTTGCGGGACTTCTTCCAAGATGTTGGTGGAGTTGTTGCAATTCGAATATTGCATGATAAGTTTACTGGAAAATCACGGGGACTAGCATATGTAGATTTCTCTGATGATGCGCATCTTGATGCTGGAGTTGCAAAAAACAAGCAGTTGTTGCTTGGGAAAAGGATAAGCATTGCGCGGTCAGATCCTAAGAAAGGAGGCCATTCTACAGATCGAGCTGGTGGAGGCAAGAGATTTGAGTCAAGAAGTTCTTCAAAGGAGCCTCAGAAAGCGAATGAGCAACCACGTGGGGTGAGGAAGCATGGAGGGAACAATGTTGAGCTCAAGGGAAAGAACACGTTTGCAGTGCCTAGAAATGTTAGAGCACTTGGTTGGACTGCAGATAAACCAAAAACCGTGGAACAAGATGATGAAAAGCCAAAAACCAACGATGAATTCAGGAAACTGTATTTTAAAGGCTGA

Protein sequence

MEKELVATVKLKYLFIPNHGPAEAVVHFRSLARADCSLPFPFRSDAMDEAIESNAEIPLPTNSGEGDIVNGDEPMPDVLQNPTSAASDSSSSDSDSEDDESDQKLQLQSLQSQLSSDPSNYDAHVQCIKLLRKIGDIDNLRKAREAMSEIFPLTPSMWQEWAKDEASINTGPEALAAIERLYERGVFDYLSVSLWCDYLNFVQEYDPLVQDCATSGIKKARDLFERALTAAGLHFTEAEKLWEAYRDFEKAIYQTIAETDTQDFSYGKDCIWTSPKAKEKQVQLIRSIFHRQLSLPLSNMNSTLEAYKTWEMEMKQGCVLDTESNYSDGVPSQVASAYRRALDMYNARVQLEEQISKQDLTDTERLHQYIIYLKFEQSAGDPARVQVLFERAIADFPVSVDLWLDYTRYMDKTLKVGNIVRNVYSRATRNCPWIGDLWVQYLLALERARASEGEIASVFEKSLQCSFSTLDETGEAFVLGNEMDVVEKENQQYLDLFLTRIDGLRRRISSAVQLENVLEYSLIKETFQRASDYLSPHLKNSEILLRLYAYWARLEINLGKDLDAARGVWESLLKIWFSSLPLYDYHFLASIGSLLAAWEGYIAMEIELNHINNARSIYKRCYSKRFPGSGSEDICHSWLRFEREYGSLEDFDHAARKVNPRLEELKSYKLQMDESENPVKQTDHGKRKLGGDAPNVESPAKKLKDSAHGPKKVTEKGKAQLQNIDDQTGDIKERVKKPDDTSDQQMKDSVQEKGKVTYEHLRDFFQDVGGVVAIRILHDKFTGKSRGLAYVDFSDDAHLDAGVAKNKQLLLGKRISIARSDPKKGGHSTDRAGGGKRFESRSSSKEPQKANEQPRGVRKHGGNNVELKGKNTFAVPRNVRALGWTADKPKTVEQDDEKPKTNDEFRKLYFKG
Homology
BLAST of Clc11G00120 vs. NCBI nr
Match: XP_038876547.1 (squamous cell carcinoma antigen recognized by T-cells 3 isoform X1 [Benincasa hispida])

HSP 1 Score: 1479.5 bits (3829), Expect = 0.0e+00
Identity = 768/882 (87.07%), Postives = 793/882 (89.91%), Query Frame = 0

Query: 47  MDEAIESNAEIPLPTNSGEGDIVNGDEPMPDVLQNPTSAASDSSSSDSDSEDDESDQKLQ 106
           MDE  ESNAEIP+ TN+ EG++VNGDEPMPD+ QNPTS ASDSSSSDSDSEDDESDQKLQ
Sbjct: 1   MDEPTESNAEIPVTTNADEGNMVNGDEPMPDLPQNPTS-ASDSSSSDSDSEDDESDQKLQ 60

Query: 107 LQSLQSQLSSDPSNYDAHVQCIKLLRKIGDIDNLRKAREAMSEIFPLTPSMWQEWAKDEA 166
           LQSLQSQLSSDPSNYDAHVQCIKLLRKIGDIDNLRKAREAMSEIFPLTP MWQEWAKDEA
Sbjct: 61  LQSLQSQLSSDPSNYDAHVQCIKLLRKIGDIDNLRKAREAMSEIFPLTPLMWQEWAKDEA 120

Query: 167 SINTGPEALAAIERLYERGVFDYLSVSLWCDYLNFVQEYDPLVQDCATSGIKKARDLFER 226
           SI+TGPEALAAIERLYERGVFDYLSVSLW DYLNFVQEYD LVQ+CATSG+KKARDLFER
Sbjct: 121 SIDTGPEALAAIERLYERGVFDYLSVSLWRDYLNFVQEYDSLVQNCATSGVKKARDLFER 180

Query: 227 ALTAAGLHFTEAEKLWEAYRDFEKAIYQTIAETDTQDFSYGKDCIWTSPKAKEKQVQLIR 286
           ALTAAGLHFTEAEKLWEAYRDFEKAIYQTIAETDTQ              AKEKQVQLIR
Sbjct: 181 ALTAAGLHFTEAEKLWEAYRDFEKAIYQTIAETDTQ--------------AKEKQVQLIR 240

Query: 287 SIFHRQLSLPLSNMNSTLEAYKTWEMEMKQGCVLDTESNYSDGVPSQVASAYRRALDMYN 346
           SIFHRQLSLPLSNMNSTLEAYK WE+E+KQGCVLDTESNYSDGVPSQV+SAYRRALDMYN
Sbjct: 241 SIFHRQLSLPLSNMNSTLEAYKAWEVEVKQGCVLDTESNYSDGVPSQVSSAYRRALDMYN 300

Query: 347 ARVQLEEQISKQDLTDTERLHQYIIYLKFEQSAGDPARVQVLFERAIADFPVSVDLWLDY 406
           ARVQLEEQISKQDLTDTERLHQYIIYLKFEQSAGDPARVQVLFERAIADFPVSVDLWLDY
Sbjct: 301 ARVQLEEQISKQDLTDTERLHQYIIYLKFEQSAGDPARVQVLFERAIADFPVSVDLWLDY 360

Query: 407 TRYMDKTLKVGNIVRNVYSRATRNCPWIGDLWVQYLLALERARASEGEIASVFEKSLQCS 466
           TRYMDKTLKVGNIVRNVYSRATRNCPWIGDLWV+YLLALERARASEGEIASVFEKSLQCS
Sbjct: 361 TRYMDKTLKVGNIVRNVYSRATRNCPWIGDLWVRYLLALERARASEGEIASVFEKSLQCS 420

Query: 467 FSTLDETGEAFVLGNEMDVVEKENQQYLDLFLTRIDGLRRRISSAVQLENVLEYSLIKET 526
           FSTLDE                    YLDLFLTRIDGLRRRIS A QLE+VLEYSLIKET
Sbjct: 421 FSTLDE--------------------YLDLFLTRIDGLRRRISFADQLEDVLEYSLIKET 480

Query: 527 FQRASDYLSPHLKNSEILLRLYAYWARLEINLGKDLDAARGVWESLLKIWFSSLPLYDYH 586
           FQRASDYLSPHLKNSE+L RLYAYWARLEINLGKDLDAARGVWESLLKI           
Sbjct: 481 FQRASDYLSPHLKNSEVLFRLYAYWARLEINLGKDLDAARGVWESLLKI----------- 540

Query: 587 FLASIGSLLAAWEGYIAMEIELNHINNARSIYKRCYSKRFPGSGSEDICHSWLRFEREYG 646
                GSLLAAWEGYIAMEIELNHINNARSIYKRCYSKRFPG+GSEDICHSWLRFERE+G
Sbjct: 541 ----CGSLLAAWEGYIAMEIELNHINNARSIYKRCYSKRFPGNGSEDICHSWLRFEREFG 600

Query: 647 SLEDFDHAARKVNPRLEELKSYKLQMDESENPVKQTDHGKRKLGGDAPNVESPAKKLKDS 706
           SLED DHA RK+NPRLEELKSYKLQMDESENPVKQ DHGKRKLGGDAPNVESPAKKLKDS
Sbjct: 601 SLEDLDHAVRKINPRLEELKSYKLQMDESENPVKQNDHGKRKLGGDAPNVESPAKKLKDS 660

Query: 707 AHGPKKVTEKGKAQLQNIDDQTGDIKERVKKPDDTSDQQMKDSVQEKG------------ 766
           AHG KKVTEKGKAQ QN+DDQTGDI+ RVKKPDDTSDQQMKDS+QEKG            
Sbjct: 661 AHGLKKVTEKGKAQSQNVDDQTGDIRGRVKKPDDTSDQQMKDSIQEKGKVYNDQCTAFIS 720

Query: 767 ----KVTYEHLRDFFQDVGGVVAIRILHDKFTGKSRGLAYVDFSDDAHLDAGVAKNKQLL 826
               KVTYEHLRDFFQDVGGVVAIRILHDKFTGKSRGLAYVDFSDDAHLDAGVAKNKQLL
Sbjct: 721 NLNLKVTYEHLRDFFQDVGGVVAIRILHDKFTGKSRGLAYVDFSDDAHLDAGVAKNKQLL 780

Query: 827 LGKRISIARSDPKKGGHSTDRAGGGKRFESRSSSKEPQKANEQPRGVRKHGGNNVELKGK 886
           LGK+ISIARSDPKKGGH+TDRAGGGKRFESRSS+K PQKA+EQP G RKHGGNN+ELKGK
Sbjct: 781 LGKKISIARSDPKKGGHTTDRAGGGKRFESRSSAKAPQKAHEQPPGARKHGGNNIELKGK 832

Query: 887 NTFAVPRNVRALGWTADKPKTVEQDDEKPKTNDEFRKLYFKG 913
           NTFAVPRNVRALGWTADKPKTVEQ DEKPKTNDEFRKLYFKG
Sbjct: 841 NTFAVPRNVRALGWTADKPKTVEQADEKPKTNDEFRKLYFKG 832

BLAST of Clc11G00120 vs. NCBI nr
Match: XP_008460582.1 (PREDICTED: squamous cell carcinoma antigen recognized by T-cells 3 isoform X1 [Cucumis melo])

HSP 1 Score: 1429.5 bits (3699), Expect = 0.0e+00
Identity = 738/882 (83.67%), Postives = 777/882 (88.10%), Query Frame = 0

Query: 47  MDEAIESNAEIPLPTNSGEGDIVNGDEPMPDVLQNPTSAASDSSSSDSDSEDDESDQKLQ 106
           MDE I SNAE PLPTNS +G+ VNGDEPMPD+ QNP    SDSSSSDSDSEDDES+Q LQ
Sbjct: 1   MDEPISSNAETPLPTNSDQGNSVNGDEPMPDLPQNP----SDSSSSDSDSEDDESNQNLQ 60

Query: 107 LQSLQSQLSSDPSNYDAHVQCIKLLRKIGDIDNLRKAREAMSEIFPLTPSMWQEWAKDEA 166
           L SLQSQLSS+ S+YDAHVQ IKLLRK+GDIDNLRKAREAMSEIFPL+PSMWQEWAKDEA
Sbjct: 61  LHSLQSQLSSNASDYDAHVQYIKLLRKVGDIDNLRKAREAMSEIFPLSPSMWQEWAKDEA 120

Query: 167 SINTGPEALAAIERLYERGVFDYLSVSLWCDYLNFVQEYDPLVQDCATSGIKKARDLFER 226
           SINTGPEALAAIERLYERGVFDYLSVSLW DYLNF++EYDPLVQDCATSGIKK RDLFER
Sbjct: 121 SINTGPEALAAIERLYERGVFDYLSVSLWLDYLNFIREYDPLVQDCATSGIKKVRDLFER 180

Query: 227 ALTAAGLHFTEAEKLWEAYRDFEKAIYQTIAETDTQDFSYGKDCIWTSPKAKEKQVQLIR 286
           ALTAAGLHFTEAEKLWEAYRDFEKAIYQTIAETD Q              AKEKQ+QLIR
Sbjct: 181 ALTAAGLHFTEAEKLWEAYRDFEKAIYQTIAETDAQ--------------AKEKQIQLIR 240

Query: 287 SIFHRQLSLPLSNMNSTLEAYKTWEMEMKQGCVLDTESNYSDGVPSQVASAYRRALDMYN 346
           SIFHRQLSLPLSNM+STLE YK WE+E+KQGC LDTESNYSDGVP+ VAS YRRALDMYN
Sbjct: 241 SIFHRQLSLPLSNMSSTLETYKAWELEVKQGCALDTESNYSDGVPTLVASTYRRALDMYN 300

Query: 347 ARVQLEEQISKQDLTDTERLHQYIIYLKFEQSAGDPARVQVLFERAIADFPVSVDLWLDY 406
           ARVQLE+QISKQDLTDTERLHQYIIYLKFEQSAGDPARVQVLFERAIADFPVSVDLWLDY
Sbjct: 301 ARVQLEDQISKQDLTDTERLHQYIIYLKFEQSAGDPARVQVLFERAIADFPVSVDLWLDY 360

Query: 407 TRYMDKTLKVGNIVRNVYSRATRNCPWIGDLWVQYLLALERARASEGEIASVFEKSLQCS 466
           T Y+DKTLKVGNIVRNVYSRATRNCPWIGDLWV+YLLALER+ ASEGEIASVFEKSLQCS
Sbjct: 361 TCYIDKTLKVGNIVRNVYSRATRNCPWIGDLWVRYLLALERSHASEGEIASVFEKSLQCS 420

Query: 467 FSTLDETGEAFVLGNEMDVVEKENQQYLDLFLTRIDGLRRRISSAVQLENVLEYSLIKET 526
           FSTLDE                    YLDLFLTRIDGLRRRISS VQLE+ LEYSLIKET
Sbjct: 421 FSTLDE--------------------YLDLFLTRIDGLRRRISSGVQLEDALEYSLIKET 480

Query: 527 FQRASDYLSPHLKNSEILLRLYAYWARLEINLGKDLDAARGVWESLLKIWFSSLPLYDYH 586
           FQRASDYLSP LKNSE+L+RLYAYWARLEINLGKDLD+ARGVWESLLKI           
Sbjct: 481 FQRASDYLSPQLKNSEVLVRLYAYWARLEINLGKDLDSARGVWESLLKI----------- 540

Query: 587 FLASIGSLLAAWEGYIAMEIELNHINNARSIYKRCYSKRFPGSGSEDICHSWLRFEREYG 646
                GSLLAAWEGYIAME+ELNHINNARSIYKRCYSKRF GSGSEDICHSWLRFERE+G
Sbjct: 541 ----CGSLLAAWEGYIAMEVELNHINNARSIYKRCYSKRFTGSGSEDICHSWLRFEREFG 600

Query: 647 SLEDFDHAARKVNPRLEELKSYKLQMDESENPVKQTDHGKRKLGGDAPNVESPAKKLKDS 706
           SLEDFDHA RKVNPRLEELKSYKLQ+D+SENPVKQ+DH KRKLGGDAPNVESPAKKLKDS
Sbjct: 601 SLEDFDHAVRKVNPRLEELKSYKLQIDDSENPVKQSDHSKRKLGGDAPNVESPAKKLKDS 660

Query: 707 AHGPKKVTEKGKAQLQNIDDQTGDIKERVKKPDDTSDQQMKDSVQEKG------------ 766
           AHGPKKVTEKGKAQLQN+DDQTGDI+ RVKKPDD+SDQQM DSVQ KG            
Sbjct: 661 AHGPKKVTEKGKAQLQNLDDQTGDIRGRVKKPDDSSDQQMNDSVQGKGKVYNDQCTAFIS 720

Query: 767 ----KVTYEHLRDFFQDVGGVVAIRILHDKFTGKSRGLAYVDFSDDAHLDAGVAKNKQLL 826
               KVTY+HLRDFFQDVGGVVAIRILHDKFTGKSRGLAYVDFSDDAHL+AGVAKNKQLL
Sbjct: 721 NLNLKVTYDHLRDFFQDVGGVVAIRILHDKFTGKSRGLAYVDFSDDAHLEAGVAKNKQLL 780

Query: 827 LGKRISIARSDPKKGGHSTDRAGGGKRFESRSSSKEPQKANEQPRGVRKHGGNNVELKGK 886
           LGK+ISIARSDPK+GGH  DRAGGGKRFESRSSSKEP K NEQP G+RKHGGN+VELKGK
Sbjct: 781 LGKKISIARSDPKRGGHGMDRAGGGKRFESRSSSKEPHKVNEQPSGLRKHGGNSVELKGK 829

Query: 887 NTFAVPRNVRALGWTADKPKTVEQDDEKPKTNDEFRKLYFKG 913
           NTFAVPRNVRALGWTADKPKT+EQDDE PK+NDEFRKLYFKG
Sbjct: 841 NTFAVPRNVRALGWTADKPKTLEQDDENPKSNDEFRKLYFKG 829

BLAST of Clc11G00120 vs. NCBI nr
Match: XP_004142811.2 (squamous cell carcinoma antigen recognized by T-cells 3 isoform X1 [Cucumis sativus] >KAE8648565.1 hypothetical protein Csa_009024 [Cucumis sativus])

HSP 1 Score: 1404.0 bits (3633), Expect = 0.0e+00
Identity = 726/882 (82.31%), Postives = 770/882 (87.30%), Query Frame = 0

Query: 47  MDEAIESNAEIPLPTNSGEGDIVNGDEPMPDVLQNPTSAASDSSSSDSDSEDDESDQKLQ 106
           MDE I S AE PLPTNS +G  VNGDEPMPD+ QNP    +DSSSSDSDSEDDES+Q L 
Sbjct: 1   MDEPISSKAETPLPTNSDQGYSVNGDEPMPDLPQNP----ADSSSSDSDSEDDESNQNLH 60

Query: 107 LQSLQSQLSSDPSNYDAHVQCIKLLRKIGDIDNLRKAREAMSEIFPLTPSMWQEWAKDEA 166
           LQSLQSQLSS+PS+YDAHVQ IK+LRK+GDIDNLRKAREAMSEIFPLTPSMWQEWA+DEA
Sbjct: 61  LQSLQSQLSSNPSDYDAHVQYIKILRKVGDIDNLRKAREAMSEIFPLTPSMWQEWAEDEA 120

Query: 167 SINTGPEALAAIERLYERGVFDYLSVSLWCDYLNFVQEYDPLVQDCATSGIKKARDLFER 226
           SI+TGPEALAAIERLYERGVFDYLSVS W DYLNF++EYDPLVQDCATSGIKK RDLFER
Sbjct: 121 SISTGPEALAAIERLYERGVFDYLSVSFWLDYLNFIREYDPLVQDCATSGIKKVRDLFER 180

Query: 227 ALTAAGLHFTEAEKLWEAYRDFEKAIYQTIAETDTQDFSYGKDCIWTSPKAKEKQVQLIR 286
           ALTAAGLHFTEAEKLWEAYRD EK+IYQTIAETD Q              AKEKQVQLIR
Sbjct: 181 ALTAAGLHFTEAEKLWEAYRDLEKSIYQTIAETDAQ--------------AKEKQVQLIR 240

Query: 287 SIFHRQLSLPLSNMNSTLEAYKTWEMEMKQGCVLDTESNYSDGVPSQVASAYRRALDMYN 346
           SIFHRQLSLPLSNM+STLEAYK WEME+KQ C LDTESNYSDGVP+QVA+ Y+RALDMYN
Sbjct: 241 SIFHRQLSLPLSNMSSTLEAYKAWEMEVKQECALDTESNYSDGVPTQVATTYQRALDMYN 300

Query: 347 ARVQLEEQISKQDLTDTERLHQYIIYLKFEQSAGDPARVQVLFERAIADFPVSVDLWLDY 406
           ARVQLE+QISKQDLTDTERLHQYIIYLKFEQSAGDPARVQVLFERAIADFPVSVDLWLDY
Sbjct: 301 ARVQLEDQISKQDLTDTERLHQYIIYLKFEQSAGDPARVQVLFERAIADFPVSVDLWLDY 360

Query: 407 TRYMDKTLKVGNIVRNVYSRATRNCPWIGDLWVQYLLALERARASEGEIASVFEKSLQCS 466
           T YMDKTLKVGNIVRNVYSRATRNCPWIGDLWV+YLLALER+ ASEGEIASVF KSLQCS
Sbjct: 361 TCYMDKTLKVGNIVRNVYSRATRNCPWIGDLWVRYLLALERSHASEGEIASVFGKSLQCS 420

Query: 467 FSTLDETGEAFVLGNEMDVVEKENQQYLDLFLTRIDGLRRRISSAVQLENVLEYSLIKET 526
           FSTLDE                    YLDLFLTRIDGLRRRISS VQLE+ LEYSLI+ET
Sbjct: 421 FSTLDE--------------------YLDLFLTRIDGLRRRISSGVQLEDALEYSLIRET 480

Query: 527 FQRASDYLSPHLKNSEILLRLYAYWARLEINLGKDLDAARGVWESLLKIWFSSLPLYDYH 586
           FQRASDYLSPHLKNSE+L+RLYAYWARLEIN+GK+LD+ARGVWESLLKI           
Sbjct: 481 FQRASDYLSPHLKNSEVLVRLYAYWARLEINMGKNLDSARGVWESLLKI----------- 540

Query: 587 FLASIGSLLAAWEGYIAMEIELNHINNARSIYKRCYSKRFPGSGSEDICHSWLRFEREYG 646
                GSL AAWEGYIAME+ELNHINNARSIYKRCYSKRFPGSGSEDICHSWLRFERE+G
Sbjct: 541 ----CGSLSAAWEGYIAMEVELNHINNARSIYKRCYSKRFPGSGSEDICHSWLRFEREFG 600

Query: 647 SLEDFDHAARKVNPRLEELKSYKLQMDESENPVKQTDHGKRKLGGDAPNVESPAKKLKDS 706
           SLEDFDHA RKVNPRLEELKSYKLQ+D+SENPVKQ D  KRKLGGDAPNVESPAKKLKDS
Sbjct: 601 SLEDFDHAVRKVNPRLEELKSYKLQIDDSENPVKQNDRSKRKLGGDAPNVESPAKKLKDS 660

Query: 707 AHGPKKVTEKGKAQLQNIDDQTGDIKERVKKPDDTSDQQMKDSVQEKG------------ 766
           AHGPKKVTEKGKAQL+N+DDQTGDI+ RVKK DD SDQQM DS+QEKG            
Sbjct: 661 AHGPKKVTEKGKAQLENVDDQTGDIRGRVKKLDDISDQQMNDSIQEKGKVYNDQCTAFIS 720

Query: 767 ----KVTYEHLRDFFQDVGGVVAIRILHDKFTGKSRGLAYVDFSDDAHLDAGVAKNKQLL 826
               KVTY+HLRDFFQDVGGVVAIRILHDKFTGKSRGLAYVDFSDDAHL+AGVAKNKQLL
Sbjct: 721 NLNLKVTYDHLRDFFQDVGGVVAIRILHDKFTGKSRGLAYVDFSDDAHLEAGVAKNKQLL 780

Query: 827 LGKRISIARSDPKKGGHSTDRAGGGKRFESRSSSKEPQKANEQPRGVRKHGGNNVELKGK 886
           LGK+ISIARSDPKKGGH TD+AG GKRFESR SSKE  K NEQP GVR+HGGN+V+LKGK
Sbjct: 781 LGKKISIARSDPKKGGHGTDKAGAGKRFESR-SSKESHKGNEQPSGVRRHGGNSVDLKGK 828

Query: 887 NTFAVPRNVRALGWTADKPKTVEQDDEKPKTNDEFRKLYFKG 913
           NTFAVPRNVRALGWT DKPKT+EQDDEKPKTNDEFRKLYFKG
Sbjct: 841 NTFAVPRNVRALGWTTDKPKTLEQDDEKPKTNDEFRKLYFKG 828

BLAST of Clc11G00120 vs. NCBI nr
Match: XP_011655427.1 (squamous cell carcinoma antigen recognized by T-cells 3 isoform X2 [Cucumis sativus])

HSP 1 Score: 1397.5 bits (3616), Expect = 0.0e+00
Identity = 725/882 (82.20%), Postives = 769/882 (87.19%), Query Frame = 0

Query: 47  MDEAIESNAEIPLPTNSGEGDIVNGDEPMPDVLQNPTSAASDSSSSDSDSEDDESDQKLQ 106
           MDE I S AE PLPTNS +G  VNGDEPMPD+ QNP    +DSSSSDSDSEDDES+Q L 
Sbjct: 1   MDEPISSKAETPLPTNSDQGYSVNGDEPMPDLPQNP----ADSSSSDSDSEDDESNQNLH 60

Query: 107 LQSLQSQLSSDPSNYDAHVQCIKLLRKIGDIDNLRKAREAMSEIFPLTPSMWQEWAKDEA 166
           LQSLQSQLSS+PS+YDAHVQ IK+LRK+GDIDNLRKAREAMSEIFPLTPSMWQEWA+DEA
Sbjct: 61  LQSLQSQLSSNPSDYDAHVQYIKILRKVGDIDNLRKAREAMSEIFPLTPSMWQEWAEDEA 120

Query: 167 SINTGPEALAAIERLYERGVFDYLSVSLWCDYLNFVQEYDPLVQDCATSGIKKARDLFER 226
           SI+TGPEALAAIERLYERGVFDYLSVS W DYLNF++EYDPLVQDCATSGIKK RDLFER
Sbjct: 121 SISTGPEALAAIERLYERGVFDYLSVSFWLDYLNFIREYDPLVQDCATSGIKKVRDLFER 180

Query: 227 ALTAAGLHFTEAEKLWEAYRDFEKAIYQTIAETDTQDFSYGKDCIWTSPKAKEKQVQLIR 286
           ALTAAGLHFTEAEKLWEAYRD EK+IYQTIAETD Q              AKEKQVQLIR
Sbjct: 181 ALTAAGLHFTEAEKLWEAYRDLEKSIYQTIAETDAQ--------------AKEKQVQLIR 240

Query: 287 SIFHRQLSLPLSNMNSTLEAYKTWEMEMKQGCVLDTESNYSDGVPSQVASAYRRALDMYN 346
           SIFHRQLSLPLSNM+STLEAYK WEME+KQ C LDTESNYSDGVP+QVA+ Y+RALDMYN
Sbjct: 241 SIFHRQLSLPLSNMSSTLEAYKAWEMEVKQECALDTESNYSDGVPTQVATTYQRALDMYN 300

Query: 347 ARVQLEEQISKQDLTDTERLHQYIIYLKFEQSAGDPARVQVLFERAIADFPVSVDLWLDY 406
           ARVQLE+QISKQDLTDTERLHQYIIYLKFEQSAGDPARVQVLFERAIADFPVSVDLWLDY
Sbjct: 301 ARVQLEDQISKQDLTDTERLHQYIIYLKFEQSAGDPARVQVLFERAIADFPVSVDLWLDY 360

Query: 407 TRYMDKTLKVGNIVRNVYSRATRNCPWIGDLWVQYLLALERARASEGEIASVFEKSLQCS 466
           T YMDKTLKVGNIVRNVYSRATRNCPWIGDLWV+YLLALER+ ASEGEIASVF KSLQCS
Sbjct: 361 TCYMDKTLKVGNIVRNVYSRATRNCPWIGDLWVRYLLALERSHASEGEIASVFGKSLQCS 420

Query: 467 FSTLDETGEAFVLGNEMDVVEKENQQYLDLFLTRIDGLRRRISSAVQLENVLEYSLIKET 526
           FSTLDE                    YLDLFLTRIDGLRRRISS VQLE+ LEYSLI+ET
Sbjct: 421 FSTLDE--------------------YLDLFLTRIDGLRRRISSGVQLEDALEYSLIRET 480

Query: 527 FQRASDYLSPHLKNSEILLRLYAYWARLEINLGKDLDAARGVWESLLKIWFSSLPLYDYH 586
           FQRASDYLSPHLKNSE+L+RLYAYWARLEIN+GK+LD+ARGVWESLLKI           
Sbjct: 481 FQRASDYLSPHLKNSEVLVRLYAYWARLEINMGKNLDSARGVWESLLKI----------- 540

Query: 587 FLASIGSLLAAWEGYIAMEIELNHINNARSIYKRCYSKRFPGSGSEDICHSWLRFEREYG 646
                GSL AAWEGYIAME+ELNHINNARSIYKRCYSKRFPGSGSEDICHSWLRFERE+G
Sbjct: 541 ----CGSLSAAWEGYIAMEVELNHINNARSIYKRCYSKRFPGSGSEDICHSWLRFEREFG 600

Query: 647 SLEDFDHAARKVNPRLEELKSYKLQMDESENPVKQTDHGKRKLGGDAPNVESPAKKLKDS 706
           SLEDFDHA RKVNPRLEELKSYKLQ+D+SENPVKQ D  KRKLGGDAPNVESPAKKLKDS
Sbjct: 601 SLEDFDHAVRKVNPRLEELKSYKLQIDDSENPVKQNDRSKRKLGGDAPNVESPAKKLKDS 660

Query: 707 AHGPKKVTEKGKAQLQNIDDQTGDIKERVKKPDDTSDQQMKDSVQEKG------------ 766
           AHGPKKVTEKGKAQL+N+DDQTGDI+ RVKK DD SDQQM DS+QEKG            
Sbjct: 661 AHGPKKVTEKGKAQLENVDDQTGDIRGRVKKLDDISDQQMNDSIQEKGKVYNDQCTAFIS 720

Query: 767 ----KVTYEHLRDFFQDVGGVVAIRILHDKFTGKSRGLAYVDFSDDAHLDAGVAKNKQLL 826
               KVTY+HLRDFFQDVGGVVAIRILHDKFTGKSRGLAYVDFSDDAHL+AGVAKNKQLL
Sbjct: 721 NLNLKVTYDHLRDFFQDVGGVVAIRILHDKFTGKSRGLAYVDFSDDAHLEAGVAKNKQLL 780

Query: 827 LGKRISIARSDPKKGGHSTDRAGGGKRFESRSSSKEPQKANEQPRGVRKHGGNNVELKGK 886
           LGK+ISIARSDPKK GH TD+AG GKRFESR SSKE  K NEQP GVR+HGGN+V+LKGK
Sbjct: 781 LGKKISIARSDPKK-GHGTDKAGAGKRFESR-SSKESHKGNEQPSGVRRHGGNSVDLKGK 827

Query: 887 NTFAVPRNVRALGWTADKPKTVEQDDEKPKTNDEFRKLYFKG 913
           NTFAVPRNVRALGWT DKPKT+EQDDEKPKTNDEFRKLYFKG
Sbjct: 841 NTFAVPRNVRALGWTTDKPKTLEQDDEKPKTNDEFRKLYFKG 827

BLAST of Clc11G00120 vs. NCBI nr
Match: KAG6596153.1 (Squamous cell carcinoma antigen recognized by T-cells 3, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1377.1 bits (3563), Expect = 0.0e+00
Identity = 723/886 (81.60%), Postives = 771/886 (87.02%), Query Frame = 0

Query: 44  SDAMDEAIESNAEIPLPTNSGEGDIVNGDEPMPDVLQNPTSAASDSSSSDSDSEDDESDQ 103
           SDAMD     NA++PL TNS + ++VNGD+PMPD+ QNPTS ASDSSSSDSDSE DESDQ
Sbjct: 62  SDAMDAI---NAQLPLSTNSDDENLVNGDDPMPDLPQNPTSPASDSSSSDSDSEVDESDQ 121

Query: 104 KLQLQSLQSQLSSDPSNYDAHVQCIKLLRKIGDIDNLRKAREAMSEIFPLTPSMWQEWAK 163
           KLQLQ+LQS+LSS+PSNYDAHVQ IKLLRK+GDIDNLRKAREAMSE+FPLTPSMWQEWAK
Sbjct: 122 KLQLQTLQSELSSNPSNYDAHVQYIKLLRKVGDIDNLRKAREAMSEMFPLTPSMWQEWAK 181

Query: 164 DEASINTGPEALAAIERLYERGVFDYLSVSLWCDYLNFVQEYDPLVQDCATSGIKKARDL 223
           DEASI+TGPE +AAIERLYERGVFDYLSVSLWCDYLNFVQEYDP+V+D ATS IKKARDL
Sbjct: 182 DEASISTGPEDVAAIERLYERGVFDYLSVSLWCDYLNFVQEYDPMVRDRATSRIKKARDL 241

Query: 224 FERALTAAGLHFTEAEKLWEAYRDFEKAIYQTIAETDTQDFSYGKDCIWTSPKAKEKQVQ 283
           FERALTAAGLHFTEAEKLWEAYR+FEK+IYQ+I ETDT+              AKEKQVQ
Sbjct: 242 FERALTAAGLHFTEAEKLWEAYREFEKSIYQSIDETDTE--------------AKEKQVQ 301

Query: 284 LIRSIFHRQLSLPLSNMNSTLEAYKTWEMEMKQGCVLDTESNYSDGVPSQVASAYRRALD 343
           LIRSIFHRQLSLPLSNMNSTLEAYK WEME+KQGC+LD +SN SDGVPSQVASAY+RALD
Sbjct: 302 LIRSIFHRQLSLPLSNMNSTLEAYKAWEMEVKQGCILDGKSNDSDGVPSQVASAYQRALD 361

Query: 344 MYNARVQLEEQISKQDLTDTERLHQYIIYLKFEQSAGDPARVQVLFERAIADFPVSVDLW 403
           MYNARVQ EEQISKQDLTDTERLHQYIIYLKFEQSAGDPARVQVLFERA+ADFPV VDLW
Sbjct: 362 MYNARVQFEEQISKQDLTDTERLHQYIIYLKFEQSAGDPARVQVLFERAVADFPVVVDLW 421

Query: 404 LDYTRYMDKTLKVGNIVRNVYSRATRNCPWIGDLWVQYLLALERARASEGEIASVFEKSL 463
           LDYTRYMDKTLKV NIVRNVYSRATRNCPWIGDLWVQYLLALERARASEGEIASVFEKS+
Sbjct: 422 LDYTRYMDKTLKVSNIVRNVYSRATRNCPWIGDLWVQYLLALERARASEGEIASVFEKSV 481

Query: 464 QCSFSTLDETGEAFVLGNEMDVVEKENQQYLDLFLTRIDGLRRRISSAVQLENVLEYSLI 523
            CSFSTL E                    YLDLFLTRIDGLRRRISSAV+LE+VL YSLI
Sbjct: 482 LCSFSTLYE--------------------YLDLFLTRIDGLRRRISSAVELEDVLGYSLI 541

Query: 524 KETFQRASDYLSPHLKNSEILLRLYAYWARLEINLGKDLDAARGVWESLLKIWFSSLPLY 583
           KETFQRASDYLSPHLKNSE L+RLYAYWARLEINLGKDL AARGVWESLLK         
Sbjct: 542 KETFQRASDYLSPHLKNSEDLVRLYAYWARLEINLGKDLVAARGVWESLLK--------- 601

Query: 584 DYHFLASIGSLLAAWEGYIAMEIELNHINNARSIYKRCYSKRFPGSGSEDICHSWLRFER 643
                 + GSLLAAWEGYIAMEIE NHINNARSIYKRCYSKRFPGSGSEDICHSWLRFER
Sbjct: 602 ------NSGSLLAAWEGYIAMEIESNHINNARSIYKRCYSKRFPGSGSEDICHSWLRFER 661

Query: 644 EYGSLEDFDHAARKVNPRLEELKSYKLQMDESENPVKQTDHGKRKLGGDAPNVESPAKKL 703
           E+GSLEDFDHAARKVNPRLEELKSYKLQMDESENP K ++HGKRKLGGDAP+VESPAK+L
Sbjct: 662 EFGSLEDFDHAARKVNPRLEELKSYKLQMDESENPAKPSEHGKRKLGGDAPDVESPAKRL 721

Query: 704 KDSAHGPKKVTEKGKAQLQNIDDQTGDIKERVKKPDDTSDQQMKDSVQEKGKV------- 763
           KD+AHGPKKV EKGKAQLQ++D QTG+ K + KKPDDTS+QQMKD VQEKGKV       
Sbjct: 722 KDAAHGPKKVNEKGKAQLQSLDGQTGNSKAKAKKPDDTSNQQMKDFVQEKGKVYNDQCTA 781

Query: 764 ---------TYEHLRDFFQDVGGVVAIRILHDKFTGKSRGLAYVDFSDDAHLDAGVAKNK 823
                    TYEHLRDFFQDVGGVVAIRILHDKFTGKSRGLAYVDFSDDAHL+A VAKNK
Sbjct: 782 FVSNLNLKATYEHLRDFFQDVGGVVAIRILHDKFTGKSRGLAYVDFSDDAHLNAAVAKNK 841

Query: 824 QLLLGKRISIARSDPKK-GGHSTDRAGGGKRFESRSSSKEPQKANEQPRGVRKHGGNNVE 883
           QLLLGK+ISIARSDPKK GGH+TDRA GGK  ES SS+K P+KANEQP   RK GGN VE
Sbjct: 842 QLLLGKKISIARSDPKKGGGHTTDRASGGKTSESVSSAK-PRKANEQPPAERKDGGNKVE 894

Query: 884 LKGKNTFAVPRNVRALGWTADKPKTVEQDDEKPKTNDEFRKLYFKG 913
           LKGKNTFAVPRNVRALGWTADKP+T EQ+DEKPKTNDEFRKLYFKG
Sbjct: 902 LKGKNTFAVPRNVRALGWTADKPRTAEQEDEKPKTNDEFRKLYFKG 894

BLAST of Clc11G00120 vs. ExPASy Swiss-Prot
Match: B3DJT0 (Squamous cell carcinoma antigen recognized by T-cells 3 OS=Danio rerio OX=7955 GN=sart3 PE=2 SV=1)

HSP 1 Score: 288.9 bits (738), Expect = 2.0e-76
Identity = 227/824 (27.55%), Postives = 364/824 (44.17%), Query Frame = 0

Query: 59  LPTNSGEGDIVNGDEPMPDVLQNPTSAASDSSSSDSDSEDDESDQKLQLQSLQSQLSSDP 118
           LP    E + +  +    D  +            + D+ +DE + + ++Q L+ QLS + 
Sbjct: 12  LPDIEEEAEGMEREMESEDDEEEGMGVEHSEEEDEEDTSEDERENEAEIQRLEEQLSINA 71

Query: 119 SNYDAHVQCIKLLRKIGDIDNLRKAREAMSEIFPLTPSMWQEWAKDEASINTGPEALAAI 178
            +Y+ HV  IKLLR+ G +  LRKAR+ MSE+FPLT  +W +W KDE  I         +
Sbjct: 72  FDYNCHVDLIKLLRQEGKLHRLRKARQKMSELFPLTEEIWLDWLKDEIRITEDESDREKV 131

Query: 179 ERLYERGVFDYLSVSLWCDYLNFVQEYDPLVQDCATSGIKKARDLFERALTAAGLHFTEA 238
             L+ER + DY+   +W +Y+ +      +    A  GI++ R +FERALTA GLH T+ 
Sbjct: 132 YELFERAIKDYVCPEIWLEYVQY-----SIGGMGAQGGIERVRSIFERALTAVGLHMTKG 191

Query: 239 EKLWEAYRDFEKAIYQTIAETDTQDFSYGKDCIWTSPKAKEKQVQLIRSIFHRQLSLPLS 298
             +WEAYR+FE  I  T+          G        +    Q++ I ++F RQL++PL 
Sbjct: 192 ASIWEAYREFEIVILSTVQPPP------GTVPSQEQQELLSAQLERIHTLFRRQLAVPLM 251

Query: 299 NMNSTLEAYKTWEMEMKQGCVLDTESNYSDGVPSQVASAYRRALDMYNARVQLEEQISKQ 358
           +M  T   Y  W                 DGVP  V   YRRAL         EE +   
Sbjct: 252 DMEGTYAEYSDWA---------------DDGVPETVTHQYRRALQQMEKGKPYEEALL-- 311

Query: 359 DLTDTERLHQYIIYLKFEQSAGDPARVQVLFERAIADFPVSVDLWLDYTRYMDKTLKVGN 418
            +++  +L +Y  Y+ FE   GDPARVQ++FERA+A+  +  DLW+ YT Y+D+ LK+ +
Sbjct: 312 -VSEPPKLAEYQSYIDFEIKEGDPARVQIIFERALAENCLVPDLWIKYTTYLDRQLKIKD 371

Query: 419 IVRNVYSRATRNCPWIGDLWVQYLLALERARASEGEIASVFEKSLQCSFSTLDETGEAFV 478
           +V + + RA RNCPW   LW  YLLALER  A    +  VFEK+L   F           
Sbjct: 372 LVLSAHERAVRNCPWTMGLWKSYLLALERHGADHQTVKDVFEKALNAGFI---------- 431

Query: 479 LGNEMDVVEKENQQYLDLFLTRIDGLRRRISSAVQLENVLEYSLIKETFQRASDYLSPHL 538
                     +   Y++++ + +D LRRR+  + +    L+   ++  F R+ +YL   +
Sbjct: 432 ----------QATDYVEIWQSYLDYLRRRVDFSKEWSRELDE--LRAAFSRSLEYLKQDV 491

Query: 539 -----KNSEILLRLYAYWARLEINLGKDLDAARGVWESLL---KIWFSSLPLYDYHFLAS 598
                ++ ++   L   WAR+E    K++  AR +W+S++      ++++ L  Y+   S
Sbjct: 492 EERFSESGDLSCTLMQIWARIEALHCKNMQKARELWDSIMTKGNAKYANMWLEYYNLERS 551

Query: 599 IGSLLAAWEGYIAMEIELNHINNARSIYKRCYSKRFPGSGSEDICHSWLRFEREYGSLED 658
            G              +  H   A     +C S  +P    E +C   L FER  GSLED
Sbjct: 552 YG--------------DAAHCRKALHRAVQCTSD-YP----EHVCDVLLNFERVEGSLED 611

Query: 659 FDHAARKVNPRLEELKSYKLQMDESE----NPVKQTDHGKRKLGGD-------------- 718
           +D A +K   +L  +   + ++ E E       ++    +RK+  D              
Sbjct: 612 WDAAVQKTETKLNRVCEQRARVAEKEALHARQEEEKAEQRRKVKADKKAQKKGQKANRTG 671

Query: 719 -------------APNVESPAKKLKDSAHGPKKVTEK----------------GKAQLQN 778
                            E P+K+L+        VTE+                 K +   
Sbjct: 672 DKRKAEDDDEEEWGEEAELPSKRLRGEDDFDSTVTEELMETESGLFGRRAPPARKTEPPG 731

Query: 779 IDDQTGDIKERVKKPDDTSDQQMKD-SVQEKGKVTYE------HLRDFFQDVGGVVAIRI 819
                    E  ++P D   +Q KD +      +T+        LR  FQ  G +  +R 
Sbjct: 732 FRKNQQGAPEPQRQPHDMPKEQRKDENCVFVSNLTFNMEDPEGKLRTLFQGCGTIQQVRP 762

BLAST of Clc11G00120 vs. ExPASy Swiss-Prot
Match: Q5REG1 (Squamous cell carcinoma antigen recognized by T-cells 3 OS=Pongo abelii OX=9601 GN=SART3 PE=2 SV=1)

HSP 1 Score: 275.0 bits (702), Expect = 3.0e-72
Identity = 193/639 (30.20%), Postives = 316/639 (49.45%), Query Frame = 0

Query: 84  SAASDSSSSDSDSEDDESDQK--LQLQSLQSQLSSDPSNYDAHVQCIKLLRKIGDIDNLR 143
           +++++SS  + + E DE ++K  L+++ L+ QLS +  +Y+ HV  I+LLR  G++  +R
Sbjct: 73  ASSAESSPGEYEWEYDEEEEKNQLEIERLEEQLSINVYDYNCHVDLIRLLRLEGELTKVR 132

Query: 144 KAREAMSEIFPLTPSMWQEWAKDEASINTGPEALAAIERLYERGVFDYLSVSLWCDYLNF 203
            AR+ MSEIFPLT  +W EW  DE S+         +  L+E+ V DY+  ++W +Y  +
Sbjct: 133 MARQKMSEIFPLTEELWLEWLHDEISMAQDGLDREHVYDLFEKAVKDYICPNIWLEYGQY 192

Query: 204 VQEYDPLVQDCATSGIKKARDLFERALTAAGLHFTEAEKLWEAYRDFEKAIYQTIAETDT 263
                 + Q     G++K R +FERAL++ GLH T+   LWEAYR+FE AI +  A    
Sbjct: 193 --SVGGIGQ---KGGLEKVRSVFERALSSVGLHMTKGLALWEAYREFESAIVE--AARPV 252

Query: 264 QDFSYGKDCIWTSPKAKEKQVQLIRSIFHRQLSLPLSNMNSTLEAYKTWEMEMKQGCVLD 323
             F    D      +  + Q++ + S+F RQL++PL +M +T   Y+ W           
Sbjct: 253 AGFLSPFD----REQTFDSQLEKVHSLFRRQLAIPLYDMEATFAEYEEWS---------- 312

Query: 324 TESNYSDGVPSQVASAYRRALDMYNARVQLEEQISKQDLTDTERLHQYIIYLKFEQSAGD 383
                 D +P  V   Y +AL         EE + +    +  RL +Y  Y+ FE   GD
Sbjct: 313 -----EDPIPESVIQNYNKALQQLEKYKPYEEALLQ---AEAPRLAEYQAYIDFEMKIGD 372

Query: 384 PARVQVLFERAIADFPVSVDLWLDYTRYMDKTLKVGNIVRNVYSRATRNCPWIGDLWVQY 443
           PAR+Q++FERA+ +  +  DLW+ Y++Y+D+ LKV ++V +V++RA RNCPW   LW +Y
Sbjct: 373 PARIQLIFERALVENCLVPDLWIRYSQYLDRQLKVKDLVLSVHNRAIRNCPWTVALWSRY 432

Query: 444 LLALERARASEGEIASVFEKSLQCSFSTLDETGEAFVLGNEMDVVEKENQQYLDLFLTRI 503
           LLA+ER       I+  FEK+L   F                     +   Y++++   +
Sbjct: 433 LLAMERHGVDHQVISVTFEKALNAGFI--------------------QATDYVEIWQAYL 492

Query: 504 DGLRRRISSAVQLENVLEYSLIKETFQRASDYLSPHL-----KNSEILLRLYAYWARLEI 563
           D LRRR+    + ++  E   ++  F RA +YL   +     ++ +    +   WAR+E 
Sbjct: 493 DYLRRRVD--FKQDSSKELEELRAAFTRALEYLKQEVEERFNESGDPSCVIMQNWARIEA 552

Query: 564 NLGKDLDAARGVWESLL---KIWFSSLPLYDYHFLASIGSLLAAWEGYIAMEIELNHINN 623
            L  ++  AR +W+S++      ++++ L  Y+   + G              +  H   
Sbjct: 553 RLCNNMQKARELWDSIMTRGNAKYANMWLEYYNLERAHG--------------DTQHCRK 612

Query: 624 ARSIYKRCYSKRFPGSGSEDICHSWLRFEREYGSLEDFDHAARKVNPRLEELKSYKLQMD 683
           A     +C S  +P    E +C   L  ER  GSLED+D A +K   RL  +   +++  
Sbjct: 613 ALHRAVQCTSD-YP----EHVCEVLLTMERTEGSLEDWDIAVQKTETRLARVNEQRMKAA 640

Query: 684 ESENPVKQTDHGKRKLGGDAPNVESPAKKLKDSAHGPKK 713
           E E  + Q +  K +    A   E  A K K    GP+K
Sbjct: 673 EKEAALVQQEEEKAEQRKRA-RAEKKALKKKKKIRGPEK 640

BLAST of Clc11G00120 vs. ExPASy Swiss-Prot
Match: Q15020 (Squamous cell carcinoma antigen recognized by T-cells 3 OS=Homo sapiens OX=9606 GN=SART3 PE=1 SV=1)

HSP 1 Score: 270.4 bits (690), Expect = 7.4e-71
Identity = 189/639 (29.58%), Postives = 311/639 (48.67%), Query Frame = 0

Query: 84  SAASDSSSSDSDSEDDESDQK--LQLQSLQSQLSSDPSNYDAHVQCIKLLRKIGDIDNLR 143
           +++++SS  + + E DE ++K  L+++ L+ QLS +  +Y+ HV  I+LLR  G++  +R
Sbjct: 73  ASSAESSPGEYEWEYDEEEEKNQLEIERLEEQLSINVYDYNCHVDLIRLLRLEGELTKVR 132

Query: 144 KAREAMSEIFPLTPSMWQEWAKDEASINTGPEALAAIERLYERGVFDYLSVSLWCDYLNF 203
            AR+ MSEIFPLT  +W EW  DE S+         +  L+E+ V DY+  ++W +Y  +
Sbjct: 133 MARQKMSEIFPLTEELWLEWLHDEISMAQDGLDREHVYDLFEKAVKDYICPNIWLEYGQY 192

Query: 204 VQEYDPLVQDCATSGIKKARDLFERALTAAGLHFTEAEKLWEAYRDFEKAIYQTIAETDT 263
                 + Q     G++K R +FERAL++ GLH T+   LWEAYR+FE AI +       
Sbjct: 193 --SVGGIGQ---KGGLEKVRSVFERALSSVGLHMTKGLALWEAYREFESAIVEA------ 252

Query: 264 QDFSYGKDCIWTSPKAKEKQVQLIRSIFHRQLSLPLSNMNSTLEAYKTWEMEMKQGCVLD 323
                              +++ + S+F RQL++PL +M +T   Y+ W           
Sbjct: 253 ------------------ARLEKVHSLFRRQLAIPLYDMEATFAEYEEWS---------- 312

Query: 324 TESNYSDGVPSQVASAYRRALDMYNARVQLEEQISKQDLTDTERLHQYIIYLKFEQSAGD 383
                 D +P  V   Y +AL         EE + +    +  RL +Y  Y+ FE   GD
Sbjct: 313 -----EDPIPESVIQNYNKALQQLEKYKPYEEALLQ---AEAPRLAEYQAYIDFEMKIGD 372

Query: 384 PARVQVLFERAIADFPVSVDLWLDYTRYMDKTLKVGNIVRNVYSRATRNCPWIGDLWVQY 443
           PAR+Q++FERA+ +  +  DLW+ Y++Y+D+ LKV ++V +V++RA RNCPW   LW +Y
Sbjct: 373 PARIQLIFERALVENCLVPDLWIRYSQYLDRQLKVKDLVLSVHNRAIRNCPWTVALWSRY 432

Query: 444 LLALERARASEGEIASVFEKSLQCSFSTLDETGEAFVLGNEMDVVEKENQQYLDLFLTRI 503
           LLA+ER       I+  FEK+L   F                     +   Y++++   +
Sbjct: 433 LLAMERHGVDHQVISVTFEKALNAGFI--------------------QATDYVEIWQAYL 492

Query: 504 DGLRRRISSAVQLENVLEYSLIKETFQRASDYLSPHL-----KNSEILLRLYAYWARLEI 563
           D LRRR+    + ++  E   ++  F RA +YL   +     ++ +    +   WAR+E 
Sbjct: 493 DYLRRRVD--FKQDSSKELEELRAAFTRALEYLKQEVEERFNESGDPSCVIMQNWARIEA 552

Query: 564 NLGKDLDAARGVWESLL---KIWFSSLPLYDYHFLASIGSLLAAWEGYIAMEIELNHINN 623
            L  ++  AR +W+S++      ++++ L  Y+   + G              +  H   
Sbjct: 553 RLCNNMQKARELWDSIMTRGNAKYANMWLEYYNLERAHG--------------DTQHCRK 612

Query: 624 ARSIYKRCYSKRFPGSGSEDICHSWLRFEREYGSLEDFDHAARKVNPRLEELKSYKLQMD 683
           A     +C S  +P    E +C   L  ER  GSLED+D A +K   RL  +   +++  
Sbjct: 613 ALHRAVQCTSD-YP----EHVCEVLLTMERTEGSLEDWDIAVQKTETRLARVNEQRMKAA 622

Query: 684 ESENPVKQTDHGKRKLGGDAPNVESPAKKLKDSAHGPKK 713
           E E  + Q +  K +    A   E  A K K    GP+K
Sbjct: 673 EKEAALVQQEEEKAEQRKRA-RAEKKALKKKKKIRGPEK 622

BLAST of Clc11G00120 vs. ExPASy Swiss-Prot
Match: Q9JLI8 (Squamous cell carcinoma antigen recognized by T-cells 3 OS=Mus musculus OX=10090 GN=Sart3 PE=1 SV=1)

HSP 1 Score: 261.2 bits (666), Expect = 4.5e-68
Identity = 193/698 (27.65%), Postives = 335/698 (47.99%), Query Frame = 0

Query: 70  NGDEPMPDVLQNPTSAASDSSSSDSDSEDDESDQK--LQLQSLQSQLSSDPSNYDAHVQC 129
           +GDE      ++  +++++SS+ + + E DE ++K  L+++ L+ QLS +  +Y+ HV+ 
Sbjct: 66  DGDE------EDAMASSAESSAGEDEWEYDEEEEKNQLEIERLEEQLSINGYDYNCHVEL 125

Query: 130 IKLLRKIGDIDNLRKAREAMSEIFPLTPSMWQEWAKDEASINTGPEALAAIERLYERGVF 189
           I+LLR  G++  +R AR+ MSE+FPLT  +W EW  DE S+         +  L+ER V 
Sbjct: 126 IRLLRLEGELSRVRAARQKMSELFPLTEELWLEWLHDEISMAMDGLDREHVYELFERAVK 185

Query: 190 DYLSVSLWCDYLNFVQEYDPLVQDCATSGIKKARDLFERALTAAGLHFTEAEKLWEAYRD 249
           DY+  ++W +Y  +      + Q     G++K R +FERAL++ GLH T+   +WEAYR+
Sbjct: 186 DYICPNIWLEYGQY--SVGGIGQ---KGGLEKVRSVFERALSSVGLHMTKGLAIWEAYRE 245

Query: 250 FEKAIYQTIAETDTQDFSYGKDCIWTSPKAKEKQVQLIRSIFHRQLSLPLSNMNSTLEAY 309
           FE AI +                          +++ + S+F RQL++PL  M +T   Y
Sbjct: 246 FESAIVEA------------------------ARLEKVHSLFRRQLAIPLYEMEATFAEY 305

Query: 310 KTWEMEMKQGCVLDTESNYSDGVPSQVASAYRRALDMYNARVQLEEQISKQDLTDTERLH 369
           + W  E                +P  V  +Y++AL         EE + +    +  RL 
Sbjct: 306 EEWSEE---------------PMPESVLQSYQKALGQLEKYKPYEEALLQ---AEAPRLA 365

Query: 370 QYIIYLKFEQSAGDPARVQVLFERAIADFPVSVDLWLDYTRYMDKTLKVGNIVRNVYSRA 429
           +Y  Y+ FE   GDPAR+Q++FERA+ +  +  DLW+ Y++Y+D+ LKV ++V +V+SRA
Sbjct: 366 EYQAYIDFEMKIGDPARIQLIFERALVENCLVPDLWIRYSQYLDRQLKVKDLVLSVHSRA 425

Query: 430 TRNCPWIGDLWVQYLLALERARASEGEIASVFEKSLQCSFSTLDETGEAFVLGNEMDVVE 489
            RNCPW   LW +YLLA+ER       I++ FE +L   F                    
Sbjct: 426 VRNCPWTVALWSRYLLAMERHGLDHQTISATFENALSAGFI------------------- 485

Query: 490 KENQQYLDLFLTRIDGLRRRISSAVQLENVLEYSLIKETFQRASDYLSPHL-----KNSE 549
            +   Y++++   +D LRRR+    + ++  E   ++  F RA +YL   +     ++ +
Sbjct: 486 -QATDYVEIWQVYLDYLRRRVD--FRQDSSKELEELRSMFTRALEYLQQEVEERFSESGD 545

Query: 550 ILLRLYAYWARLEINLGKDLDAARGVWESLL---KIWFSSLPLYDYHFLASIGSLLAAWE 609
               +   WAR+E  L  ++  AR +W+S++      ++++ L  Y+   + G       
Sbjct: 546 PSCLIMQSWARVEARLCNNMQKARELWDSIMTRGNAKYANMWLEYYNLERAHG------- 605

Query: 610 GYIAMEIELNHINNARSIYKRCYSKRFPGSGSEDICHSWLRFEREYGSLEDFDHAARKVN 669
                  +  H   A     +C S  +P    E +C   L  ER  G+LED+D A +K  
Sbjct: 606 -------DTQHCRKALHRAVQCTSD-YP----EHVCEVLLTMERTEGTLEDWDLAIQKTE 665

Query: 670 PRLEELKSYKLQMDESENPVKQTDHGKRKLGGDAPNVESPAKKLKDSAHGPKKVTEKGKA 729
            RL  +   +++  E E  + Q +  K +        +   KK K +    K+  ++ + 
Sbjct: 666 TRLARVNEQRMKAAEKEAALVQQEEEKAEQRKKVRAEKKALKKKKKTRGADKRREDEDEE 669

Query: 730 QLQNIDDQTGDIKERVKKPDDTSDQ--QMKDSVQEKGK 756
                +++    K R  +    S +   MK+  +  GK
Sbjct: 726 NEWGEEEEEQPSKRRRTENSLASGEASAMKEETELSGK 669

BLAST of Clc11G00120 vs. ExPASy Swiss-Prot
Match: Q9USY2 (Uncharacterized RNA-binding protein C1861.04c OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=SPBC1861.04c PE=4 SV=1)

HSP 1 Score: 131.3 bits (329), Expect = 5.4e-29
Identity = 170/745 (22.82%), Postives = 301/745 (40.40%), Query Frame = 0

Query: 93  DSDSEDDESDQKLQLQSLQSQLSSDPSNYDAHVQCIKLLRKIGDIDNLRKAREAMSEIFP 152
           D D ++    + L L  L       P NYD+H++ I+ L+++     L +AR+    IFP
Sbjct: 15  DIDPQEQNEKKPLLLDELTKFTILHPYNYDSHIKLIEELKRLDKKKELSEARKTFQSIFP 74

Query: 153 LTPSMWQEWAKDEASINTGPEALAAIERLYERGVFDYLSVSLWCDYLNF---VQEYDPLV 212
           L+  +W ++  DE       +    I+ L++  V DYLS+ +WC YL F   + +     
Sbjct: 75  LSEDLWVDYLLDECKNCRTLDDYVRIKTLFDLAVQDYLSIKIWCMYLEFTLNLMDTSSFE 134

Query: 213 QDCATSG----IKKARDLFERALTAAGLHFTEAEKLWEAYRDFEKAIYQTIAETDTQDFS 272
           Q+ +       +  A  LFERA      HF++++ +W  Y +F +     + E + +   
Sbjct: 135 QEQSELNGVITLTDAHSLFERAYQTCKFHFSKSQCVWTLYLEFLQTFGDALFEGEEEQEI 194

Query: 273 YGKDCIWTSPKAKEKQVQLIRSIFH-RQLSLPLSNMNSTLEAYKTWEMEMKQGCVLDTES 332
             K+ I+                FH  +L LP   +  T  +  T+             +
Sbjct: 195 VFKNKIYD---------------FHIDRLKLPHEQIEETFTSLSTF-----------VTN 254

Query: 333 NYSDGVPSQVASAYRRALDMYNARVQLEEQISKQDLTDTERLHQYIIYL-----KFEQSA 392
           N+S   PS+      ++  +Y   ++   +I  ++L      H    Y+     +  +S 
Sbjct: 255 NWS---PSEYEDVMVKSNKVYETTLKRNAKIFNKELLLNSANHSLEAYMDLINDESRRST 314

Query: 393 GDPARVQVLFERAIADFPVSVDLWLDYTRYMDKTLKVGNIVRNVYSRATRNCPWIGDLWV 452
            +   +  L+ERAI  +P+  +LWL YT ++ K     +   +V  RATRNC WIG +W 
Sbjct: 315 AELQYITTLYERAIVLYPLIPELWLQYTAWLSKVDFSSSQASSVAERATRNCSWIGRIWS 374

Query: 453 QYLLALERARASEGEIASVFEKSLQCSFSTLDETGEAFVLGNEMDVVEKENQQYLDLFLT 512
             L  +  + AS    ++V E+  +C  S L        L N  +V+        D F  
Sbjct: 375 IKLTYMTLSGAS---TSAVCEEKDRCLNSNL--------LVNFDEVI--------DFFSG 434

Query: 513 RIDGLRRRISSAVQLENVLEYSLIKETFQRASDYLSPHLKNSEILLRLYAYWARLEINLG 572
            +       S+  + +  L++ +      +  DYL  + K S+         AR+ I L 
Sbjct: 435 FLKACLYLSSNEDKPQEFLKHQI-----HKVEDYLRKNHKGSKD--------ARMRIELS 494

Query: 573 K-DLDAARGVWESLLKIWFSSLPLYDYHFLASIGSLLAAWEGYIAMEIELNHINNARSIY 632
           K  L +    +ES+ K W  S   +D+   A        W       ++ N    A ++ 
Sbjct: 495 KIYLYSEISDFESVEKCW--SDMFHDFQNQA------LYWISRYISTMKYNPELAAETLK 554

Query: 633 KRCYSKRFPGSGSEDICHSWLRFEREYGSLEDFDHAARKVNPRLEELK-----SYKLQMD 692
           K  Y         +++    L F+  Y S+ D ++        L ++      S+K Q+D
Sbjct: 555 KSLY---------KNVDQPQLLFQ-FYQSIMDLNNDCFTNTSHLYDVLNAQRISFKRQLD 614

Query: 693 ESENPVKQTDHGKRKLG-GDAPNVESPAKKLKDSAHGPKKVTEKGKAQLQNIDDQTGDIK 752
                 KQT      L    A +  + +KK K    G      K   Q +N ++ T  + 
Sbjct: 615 SFAEETKQTVENTEPLKVPQADDTAALSKKRKPGQEGDVFKKSKPIEQHRNREELTVLV- 660

Query: 753 ERVKKPDDTSDQQMKDSVQEKGKVTYEHLRDFFQDVGGVVAIRILHDKFTGKSRGLAYVD 812
                P D S+ ++K                FF+D G ++ I IL D    K   +A ++
Sbjct: 675 --TNLPSDISENELK---------------IFFKDCGNIIRIFILED--NQKDVKVAQIE 660

Query: 813 FSDDAHLDAGVAKNKQLLLGKRISI 818
           FS+ + + A   ++ + + G  IS+
Sbjct: 735 FSETSEVLAAKTRDLKSIRGHEISV 660

BLAST of Clc11G00120 vs. ExPASy TrEMBL
Match: A0A1S3CCC2 (squamous cell carcinoma antigen recognized by T-cells 3 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103499370 PE=4 SV=1)

HSP 1 Score: 1429.5 bits (3699), Expect = 0.0e+00
Identity = 738/882 (83.67%), Postives = 777/882 (88.10%), Query Frame = 0

Query: 47  MDEAIESNAEIPLPTNSGEGDIVNGDEPMPDVLQNPTSAASDSSSSDSDSEDDESDQKLQ 106
           MDE I SNAE PLPTNS +G+ VNGDEPMPD+ QNP    SDSSSSDSDSEDDES+Q LQ
Sbjct: 1   MDEPISSNAETPLPTNSDQGNSVNGDEPMPDLPQNP----SDSSSSDSDSEDDESNQNLQ 60

Query: 107 LQSLQSQLSSDPSNYDAHVQCIKLLRKIGDIDNLRKAREAMSEIFPLTPSMWQEWAKDEA 166
           L SLQSQLSS+ S+YDAHVQ IKLLRK+GDIDNLRKAREAMSEIFPL+PSMWQEWAKDEA
Sbjct: 61  LHSLQSQLSSNASDYDAHVQYIKLLRKVGDIDNLRKAREAMSEIFPLSPSMWQEWAKDEA 120

Query: 167 SINTGPEALAAIERLYERGVFDYLSVSLWCDYLNFVQEYDPLVQDCATSGIKKARDLFER 226
           SINTGPEALAAIERLYERGVFDYLSVSLW DYLNF++EYDPLVQDCATSGIKK RDLFER
Sbjct: 121 SINTGPEALAAIERLYERGVFDYLSVSLWLDYLNFIREYDPLVQDCATSGIKKVRDLFER 180

Query: 227 ALTAAGLHFTEAEKLWEAYRDFEKAIYQTIAETDTQDFSYGKDCIWTSPKAKEKQVQLIR 286
           ALTAAGLHFTEAEKLWEAYRDFEKAIYQTIAETD Q              AKEKQ+QLIR
Sbjct: 181 ALTAAGLHFTEAEKLWEAYRDFEKAIYQTIAETDAQ--------------AKEKQIQLIR 240

Query: 287 SIFHRQLSLPLSNMNSTLEAYKTWEMEMKQGCVLDTESNYSDGVPSQVASAYRRALDMYN 346
           SIFHRQLSLPLSNM+STLE YK WE+E+KQGC LDTESNYSDGVP+ VAS YRRALDMYN
Sbjct: 241 SIFHRQLSLPLSNMSSTLETYKAWELEVKQGCALDTESNYSDGVPTLVASTYRRALDMYN 300

Query: 347 ARVQLEEQISKQDLTDTERLHQYIIYLKFEQSAGDPARVQVLFERAIADFPVSVDLWLDY 406
           ARVQLE+QISKQDLTDTERLHQYIIYLKFEQSAGDPARVQVLFERAIADFPVSVDLWLDY
Sbjct: 301 ARVQLEDQISKQDLTDTERLHQYIIYLKFEQSAGDPARVQVLFERAIADFPVSVDLWLDY 360

Query: 407 TRYMDKTLKVGNIVRNVYSRATRNCPWIGDLWVQYLLALERARASEGEIASVFEKSLQCS 466
           T Y+DKTLKVGNIVRNVYSRATRNCPWIGDLWV+YLLALER+ ASEGEIASVFEKSLQCS
Sbjct: 361 TCYIDKTLKVGNIVRNVYSRATRNCPWIGDLWVRYLLALERSHASEGEIASVFEKSLQCS 420

Query: 467 FSTLDETGEAFVLGNEMDVVEKENQQYLDLFLTRIDGLRRRISSAVQLENVLEYSLIKET 526
           FSTLDE                    YLDLFLTRIDGLRRRISS VQLE+ LEYSLIKET
Sbjct: 421 FSTLDE--------------------YLDLFLTRIDGLRRRISSGVQLEDALEYSLIKET 480

Query: 527 FQRASDYLSPHLKNSEILLRLYAYWARLEINLGKDLDAARGVWESLLKIWFSSLPLYDYH 586
           FQRASDYLSP LKNSE+L+RLYAYWARLEINLGKDLD+ARGVWESLLKI           
Sbjct: 481 FQRASDYLSPQLKNSEVLVRLYAYWARLEINLGKDLDSARGVWESLLKI----------- 540

Query: 587 FLASIGSLLAAWEGYIAMEIELNHINNARSIYKRCYSKRFPGSGSEDICHSWLRFEREYG 646
                GSLLAAWEGYIAME+ELNHINNARSIYKRCYSKRF GSGSEDICHSWLRFERE+G
Sbjct: 541 ----CGSLLAAWEGYIAMEVELNHINNARSIYKRCYSKRFTGSGSEDICHSWLRFEREFG 600

Query: 647 SLEDFDHAARKVNPRLEELKSYKLQMDESENPVKQTDHGKRKLGGDAPNVESPAKKLKDS 706
           SLEDFDHA RKVNPRLEELKSYKLQ+D+SENPVKQ+DH KRKLGGDAPNVESPAKKLKDS
Sbjct: 601 SLEDFDHAVRKVNPRLEELKSYKLQIDDSENPVKQSDHSKRKLGGDAPNVESPAKKLKDS 660

Query: 707 AHGPKKVTEKGKAQLQNIDDQTGDIKERVKKPDDTSDQQMKDSVQEKG------------ 766
           AHGPKKVTEKGKAQLQN+DDQTGDI+ RVKKPDD+SDQQM DSVQ KG            
Sbjct: 661 AHGPKKVTEKGKAQLQNLDDQTGDIRGRVKKPDDSSDQQMNDSVQGKGKVYNDQCTAFIS 720

Query: 767 ----KVTYEHLRDFFQDVGGVVAIRILHDKFTGKSRGLAYVDFSDDAHLDAGVAKNKQLL 826
               KVTY+HLRDFFQDVGGVVAIRILHDKFTGKSRGLAYVDFSDDAHL+AGVAKNKQLL
Sbjct: 721 NLNLKVTYDHLRDFFQDVGGVVAIRILHDKFTGKSRGLAYVDFSDDAHLEAGVAKNKQLL 780

Query: 827 LGKRISIARSDPKKGGHSTDRAGGGKRFESRSSSKEPQKANEQPRGVRKHGGNNVELKGK 886
           LGK+ISIARSDPK+GGH  DRAGGGKRFESRSSSKEP K NEQP G+RKHGGN+VELKGK
Sbjct: 781 LGKKISIARSDPKRGGHGMDRAGGGKRFESRSSSKEPHKVNEQPSGLRKHGGNSVELKGK 829

Query: 887 NTFAVPRNVRALGWTADKPKTVEQDDEKPKTNDEFRKLYFKG 913
           NTFAVPRNVRALGWTADKPKT+EQDDE PK+NDEFRKLYFKG
Sbjct: 841 NTFAVPRNVRALGWTADKPKTLEQDDENPKSNDEFRKLYFKG 829

BLAST of Clc11G00120 vs. ExPASy TrEMBL
Match: A0A6J1FHD1 (squamous cell carcinoma antigen recognized by T-cells 3 OS=Cucurbita moschata OX=3662 GN=LOC111445777 PE=4 SV=1)

HSP 1 Score: 1375.5 bits (3559), Expect = 0.0e+00
Identity = 720/876 (82.19%), Postives = 765/876 (87.33%), Query Frame = 0

Query: 54  NAEIPLPTNSGEGDIVNGDEPMPDVLQNPTSAASDSSSSDSDSEDDESDQKLQLQSLQSQ 113
           NA++PL TNS + ++VNGD+PMPD+ QNPTS ASDSSSSDSDSE DESDQKLQLQ+LQS+
Sbjct: 5   NAQLPLSTNSDDENLVNGDDPMPDLPQNPTSPASDSSSSDSDSEVDESDQKLQLQTLQSE 64

Query: 114 LSSDPSNYDAHVQCIKLLRKIGDIDNLRKAREAMSEIFPLTPSMWQEWAKDEASINTGPE 173
           LSS+PSNYDAHVQ IKLLRK+GDIDNLRKAREAMSE+FPLTPSMWQEWAKDEASI+TGPE
Sbjct: 65  LSSNPSNYDAHVQYIKLLRKVGDIDNLRKAREAMSEMFPLTPSMWQEWAKDEASISTGPE 124

Query: 174 ALAAIERLYERGVFDYLSVSLWCDYLNFVQEYDPLVQDCATSGIKKARDLFERALTAAGL 233
            +AAIERLYERGVFDYLSVSLWCDYLNFVQEYDP+V+D ATS IKKARDLFERALTAAGL
Sbjct: 125 DVAAIERLYERGVFDYLSVSLWCDYLNFVQEYDPMVRDRATSRIKKARDLFERALTAAGL 184

Query: 234 HFTEAEKLWEAYRDFEKAIYQTIAETDTQDFSYGKDCIWTSPKAKEKQVQLIRSIFHRQL 293
           HFTEAEKLWEAYR+FEK+IYQ+I ETDTQ              AKEKQVQLIRSIFHRQL
Sbjct: 185 HFTEAEKLWEAYREFEKSIYQSIDETDTQ--------------AKEKQVQLIRSIFHRQL 244

Query: 294 SLPLSNMNSTLEAYKTWEMEMKQGCVLDTESNYSDGVPSQVASAYRRALDMYNARVQLEE 353
           SLPLSNMNSTLEAYK WEME+KQGC+LD +SN SDGVPSQVASAY+RALDMYNARVQ EE
Sbjct: 245 SLPLSNMNSTLEAYKAWEMEVKQGCILDGKSNDSDGVPSQVASAYQRALDMYNARVQFEE 304

Query: 354 QISKQDLTDTERLHQYIIYLKFEQSAGDPARVQVLFERAIADFPVSVDLWLDYTRYMDKT 413
           QISKQDLTDTERLHQYIIYLKFEQSAGDPARVQVLFERA+ADFPV VDLWLDYTRYMDKT
Sbjct: 305 QISKQDLTDTERLHQYIIYLKFEQSAGDPARVQVLFERAVADFPVVVDLWLDYTRYMDKT 364

Query: 414 LKVGNIVRNVYSRATRNCPWIGDLWVQYLLALERARASEGEIASVFEKSLQCSFSTLDET 473
           LKV NIVRNVYSRATRNCPWIGDLWVQYLLALERA ASEGEIASVFEKS+ CSFSTL E 
Sbjct: 365 LKVSNIVRNVYSRATRNCPWIGDLWVQYLLALERAHASEGEIASVFEKSVLCSFSTLYE- 424

Query: 474 GEAFVLGNEMDVVEKENQQYLDLFLTRIDGLRRRISSAVQLENVLEYSLIKETFQRASDY 533
                              YLDLFLTRIDGLRRRISSAV+LE+VL YSLIKETFQRASDY
Sbjct: 425 -------------------YLDLFLTRIDGLRRRISSAVELEDVLGYSLIKETFQRASDY 484

Query: 534 LSPHLKNSEILLRLYAYWARLEINLGKDLDAARGVWESLLKIWFSSLPLYDYHFLASIGS 593
           LSPHLKNSE L+RLYAYWARLEINLGKDL AARGVWESLLKI                GS
Sbjct: 485 LSPHLKNSEDLVRLYAYWARLEINLGKDLVAARGVWESLLKI---------------SGS 544

Query: 594 LLAAWEGYIAMEIELNHINNARSIYKRCYSKRFPGSGSEDICHSWLRFEREYGSLEDFDH 653
           LLAAWEGYIAMEIE NHINNARSIYKRCYSKRFPGSGSEDICHSWLRFERE+GSLEDFDH
Sbjct: 545 LLAAWEGYIAMEIESNHINNARSIYKRCYSKRFPGSGSEDICHSWLRFEREFGSLEDFDH 604

Query: 654 AARKVNPRLEELKSYKLQMDESENPVKQTDHGKRKLGGDAPNVESPAKKLKDSAHGPKKV 713
           AARKVNPRLEELKSYKLQMDESENP K ++HGKRKLGGDAP+VESPAK+LKD+AHGPKKV
Sbjct: 605 AARKVNPRLEELKSYKLQMDESENPAKPSEHGKRKLGGDAPDVESPAKRLKDAAHGPKKV 664

Query: 714 TEKGKAQLQNIDDQTGDIKERVKKPDDTSDQQMKDSVQEKGKV----------------T 773
            EKGKAQLQ++D QTG+ K + KKPDDTSDQQMKD VQEKGKV                T
Sbjct: 665 NEKGKAQLQSLDGQTGNSKAKAKKPDDTSDQQMKDFVQEKGKVYNDQCTAFVSNLNLKAT 724

Query: 774 YEHLRDFFQDVGGVVAIRILHDKFTGKSRGLAYVDFSDDAHLDAGVAKNKQLLLGKRISI 833
           YEHLRDFFQDVGGVVAIRILHDKFTGKSRGLAYVDFSDDAHL+A VAKNKQLLLGK+ISI
Sbjct: 725 YEHLRDFFQDVGGVVAIRILHDKFTGKSRGLAYVDFSDDAHLNAAVAKNKQLLLGKKISI 784

Query: 834 ARSDPKK-GGHSTDRAGGGKRFESRSSSKEPQKANEQPRGVRKHGGNNVELKGKNTFAVP 893
           ARSDPKK GGH+TDRA GGK  ES SS+K P+KANEQP   RK GGN VELKGKNTFAVP
Sbjct: 785 ARSDPKKGGGHTTDRASGGKTSESVSSAK-PRKANEQPPAERKDGGNKVELKGKNTFAVP 830

Query: 894 RNVRALGWTADKPKTVEQDDEKPKTNDEFRKLYFKG 913
           RNVRALGWTADKP+T EQ+DEKPKTNDEFRKLYFKG
Sbjct: 845 RNVRALGWTADKPRTAEQEDEKPKTNDEFRKLYFKG 830

BLAST of Clc11G00120 vs. ExPASy TrEMBL
Match: A0A6J1I352 (squamous cell carcinoma antigen recognized by T-cells 3 OS=Cucurbita maxima OX=3661 GN=LOC111470118 PE=4 SV=1)

HSP 1 Score: 1350.9 bits (3495), Expect = 0.0e+00
Identity = 710/876 (81.05%), Postives = 756/876 (86.30%), Query Frame = 0

Query: 54  NAEIPLPTNSGEGDIVNGDEPMPDVLQNPTSAASDSSSSDSDSEDDESDQKLQLQSLQSQ 113
           N ++PL TNS + ++VNGD+PMPD+ QNPTS ASDSSSSDSDSE DESDQKLQLQ+LQS+
Sbjct: 5   NDQLPLSTNSDDENLVNGDDPMPDLPQNPTSPASDSSSSDSDSEVDESDQKLQLQTLQSE 64

Query: 114 LSSDPSNYDAHVQCIKLLRKIGDIDNLRKAREAMSEIFPLTPSMWQEWAKDEASINTGPE 173
           LSS+PSNYDAHVQ IKLLRK+GDIDNLRKAREAMSE+FPLTPSMWQEWAKDEASI+TGPE
Sbjct: 65  LSSNPSNYDAHVQYIKLLRKVGDIDNLRKAREAMSEMFPLTPSMWQEWAKDEASISTGPE 124

Query: 174 ALAAIERLYERGVFDYLSVSLWCDYLNFVQEYDPLVQDCATSGIKKARDLFERALTAAGL 233
            +AAIERLYERGVFDYLSVSLWCDYLNFVQEYDP+V+D ATS IKK RDLFERALTAAGL
Sbjct: 125 DVAAIERLYERGVFDYLSVSLWCDYLNFVQEYDPMVRDRATSRIKKVRDLFERALTAAGL 184

Query: 234 HFTEAEKLWEAYRDFEKAIYQTIAETDTQDFSYGKDCIWTSPKAKEKQVQLIRSIFHRQL 293
           HFTEAE LWEAYR+FEK+IYQ+I ETD+Q              AKEKQVQLIRSIFHRQL
Sbjct: 185 HFTEAETLWEAYREFEKSIYQSIDETDSQ--------------AKEKQVQLIRSIFHRQL 244

Query: 294 SLPLSNMNSTLEAYKTWEMEMKQGCVLDTESNYSDGVPSQVASAYRRALDMYNARVQLEE 353
           SLPLSNMNSTLEAYK WEME+KQGC+LD +SN SDGVPSQVAS Y+RALDMYNARVQ EE
Sbjct: 245 SLPLSNMNSTLEAYKAWEMEVKQGCILDGKSNDSDGVPSQVASTYQRALDMYNARVQFEE 304

Query: 354 QISKQDLTDTERLHQYIIYLKFEQSAGDPARVQVLFERAIADFPVSVDLWLDYTRYMDKT 413
           QISKQ+LTDTERLHQYIIYLKFEQSAGDPARVQVLFERA+ADFPV VDLWLDYTRYMDKT
Sbjct: 305 QISKQELTDTERLHQYIIYLKFEQSAGDPARVQVLFERAVADFPVVVDLWLDYTRYMDKT 364

Query: 414 LKVGNIVRNVYSRATRNCPWIGDLWVQYLLALERARASEGEIASVFEKSLQCSFSTLDET 473
           LKV NIVRNVYSRATRNCPWIGDLWVQYLLALERA ASEGEIASVFEKS+ CSFSTL E 
Sbjct: 365 LKVSNIVRNVYSRATRNCPWIGDLWVQYLLALERAHASEGEIASVFEKSVLCSFSTLYE- 424

Query: 474 GEAFVLGNEMDVVEKENQQYLDLFLTRIDGLRRRISSAVQLENVLEYSLIKETFQRASDY 533
                              YLDLFLTRIDGLRRRISSAV LE+VL YSLIKETFQRASDY
Sbjct: 425 -------------------YLDLFLTRIDGLRRRISSAVDLEDVLGYSLIKETFQRASDY 484

Query: 534 LSPHLKNSEILLRLYAYWARLEINLGKDLDAARGVWESLLKIWFSSLPLYDYHFLASIGS 593
           LSPHLKNSE L+RLYAYWARLEINLGKDL AARGVWESLLKI                GS
Sbjct: 485 LSPHLKNSEDLVRLYAYWARLEINLGKDLVAARGVWESLLKI---------------SGS 544

Query: 594 LLAAWEGYIAMEIELNHINNARSIYKRCYSKRFPGSGSEDICHSWLRFEREYGSLEDFDH 653
           LLAAWEGYIAMEIE NHI NARSIYKRCYSKRFPGSGSEDICHSWLRFERE+GSLEDFDH
Sbjct: 545 LLAAWEGYIAMEIESNHITNARSIYKRCYSKRFPGSGSEDICHSWLRFEREFGSLEDFDH 604

Query: 654 AARKVNPRLEELKSYKLQMDESENPVKQTDHGKRKLGGDAPNVESPAKKLKDSAHGPKKV 713
           AARKVNPRLEELKSYK QMDESENP K ++HGKRKLGGDAP+VES AK+LKD+AHGPKKV
Sbjct: 605 AARKVNPRLEELKSYKFQMDESENPAKPSEHGKRKLGGDAPDVESSAKRLKDAAHGPKKV 664

Query: 714 TEKGKAQLQNIDDQTGDIKERVKKPDDTSDQQMKDSVQEKGKV----------------T 773
            EKGKAQLQ++D QTG+ K + KKPDDTSDQQMKD VQEKGKV                T
Sbjct: 665 NEKGKAQLQSLDGQTGNSKAKAKKPDDTSDQQMKDFVQEKGKVYNDQCTAFVSNLNPKAT 724

Query: 774 YEHLRDFFQDVGGVVAIRILHDKFTGKSRGLAYVDFSDDAHLDAGVAKNKQLLLGKRISI 833
           YEHLRDFFQDVGGVVAIRILHDKFTGKSRGLAYVDFSDDAHL+A VAKNKQLLLGK+ISI
Sbjct: 725 YEHLRDFFQDVGGVVAIRILHDKFTGKSRGLAYVDFSDDAHLNAAVAKNKQLLLGKKISI 784

Query: 834 ARSDPKK-GGHSTDRAGGGKRFESRSSSKEPQKANEQPRGVRKHGGNNVELKGKNTFAVP 893
           ARSDPKK GGH+TDRA GGK  ES SS+K P+KANEQP   R  GGN VELKGKNTFAVP
Sbjct: 785 ARSDPKKGGGHTTDRASGGKTSESVSSAK-PRKANEQPPAER-DGGNKVELKGKNTFAVP 829

Query: 894 RNVRALGWTADKPKTVEQDDEKPKTNDEFRKLYFKG 913
           RNVRALGWTADKP+T EQ+DEKPKTNDEFRKLYFKG
Sbjct: 845 RNVRALGWTADKPRTAEQEDEKPKTNDEFRKLYFKG 829

BLAST of Clc11G00120 vs. ExPASy TrEMBL
Match: A0A5D3CUH2 (Squamous cell carcinoma antigen recognized by T-cells 3 isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold45G001220 PE=4 SV=1)

HSP 1 Score: 1261.5 bits (3263), Expect = 0.0e+00
Identity = 661/829 (79.73%), Postives = 696/829 (83.96%), Query Frame = 0

Query: 47  MDEAIESNAEIPLPTNSGEGDIVNGDEPMPDVLQNPTSAASDSSSSDSDSEDDESDQKLQ 106
           MDE I SNAE PLPTNS +G+ VNGDEPMPD+ QNP    SDSSSSDSDSEDDES+Q LQ
Sbjct: 1   MDEPISSNAETPLPTNSDQGNSVNGDEPMPDLPQNP----SDSSSSDSDSEDDESNQNLQ 60

Query: 107 LQSLQSQLSSDPSNYDAHVQCIKLLRKIGDIDNLRKAREAMSEIFPLTPSMWQEWAKDEA 166
           L SLQSQLSS+ S+YDAHVQ IKLLRK+GDIDNLRKAREAMSEIFPL+PSMWQEWAKDEA
Sbjct: 61  LHSLQSQLSSNASDYDAHVQYIKLLRKVGDIDNLRKAREAMSEIFPLSPSMWQEWAKDEA 120

Query: 167 SINTGPEALAAIERLYERGVFDYLSVSLWCDYLNFVQEYDPLVQDCATSGIKKARDLFER 226
           SINTGPEALAAIERLYERGVFDYLSVSLW DYLNF++EYDPLVQDCATSGIKK RDLFER
Sbjct: 121 SINTGPEALAAIERLYERGVFDYLSVSLWLDYLNFIREYDPLVQDCATSGIKKVRDLFER 180

Query: 227 ALTAAGLHFTEAEKLWEAYRDFEKAIYQTIAETDTQDFSYGKDCIWTSPKAKEKQVQLIR 286
           ALTAAGLHFTEAEKLWEAYRDFEKAIYQTIAETD Q              AKEKQ+QLIR
Sbjct: 181 ALTAAGLHFTEAEKLWEAYRDFEKAIYQTIAETDAQ--------------AKEKQIQLIR 240

Query: 287 SIFHRQLSLPLSNMNSTLEAYKTWEMEMKQGCVLDTESNYSDGVPSQVASAYRRALDMYN 346
           SIFHRQLSLPLSNM+STLE YK WE+E+KQGC LDTESNYSDGVP+ VAS YRRALDMYN
Sbjct: 241 SIFHRQLSLPLSNMSSTLETYKAWELEVKQGCALDTESNYSDGVPTLVASTYRRALDMYN 300

Query: 347 ARVQLEEQISKQDLTDTERLHQYIIYLKFEQSAGDPARVQVLFERAIADFPVSVDLWLDY 406
           ARVQLE+QISKQDLTDTERLHQYIIYLKFEQSAGDPARVQVLFERAIADFPVSVDLWLDY
Sbjct: 301 ARVQLEDQISKQDLTDTERLHQYIIYLKFEQSAGDPARVQVLFERAIADFPVSVDLWLDY 360

Query: 407 TRYMDKTLKVGNIVRNVYSRATRNCPWIGDLWVQYLLALERARASEGEIASVFEKSLQCS 466
           T Y+DKTLKVGNIVRNVYSRATRNCPWIGDLWV+YLLALER+ ASEGEIASVFEKSLQCS
Sbjct: 361 TCYIDKTLKVGNIVRNVYSRATRNCPWIGDLWVRYLLALERSHASEGEIASVFEKSLQCS 420

Query: 467 FSTLDETGEAFVLGNEMDVVEKENQQYLDLFLTRIDGLRRRISSAVQLENVLEYSLIKET 526
           FSTLDE                    YLDLFLTRIDGLRRRISS VQLE+ LEYSLIKET
Sbjct: 421 FSTLDE--------------------YLDLFLTRIDGLRRRISSGVQLEDALEYSLIKET 480

Query: 527 FQRASDYLSPHLKNSEILLRLYAYWARLEINLGKDLDAARGVWESLLKIWFSSLPLYDYH 586
           FQRASDYLSP LKNSE+L+RLYAYWARLEINLGKDLD+ARGVWESLLKI           
Sbjct: 481 FQRASDYLSPQLKNSEVLVRLYAYWARLEINLGKDLDSARGVWESLLKI----------- 540

Query: 587 FLASIGSLLAAWEGYIAMEIELNHINNARSIYKRCYSKRFPGSGSEDICHSWLRFEREYG 646
                GSLLAAWEGYIAME+ELNHINNARSIYKRCYSKRF GSGSEDICHSWLRFERE+G
Sbjct: 541 ----CGSLLAAWEGYIAMEVELNHINNARSIYKRCYSKRFTGSGSEDICHSWLRFEREFG 600

Query: 647 SLEDFDHAARKVNPRLEELKSYKLQMDESENPVKQTDHGKRKLGGDAPNVESPAKKLKDS 706
           SLEDFDHA RKVNPRLEELKSYKLQ+D+SENPVKQ+DH KRKLGGDAPNVESPAKKLKDS
Sbjct: 601 SLEDFDHAVRKVNPRLEELKSYKLQIDDSENPVKQSDHSKRKLGGDAPNVESPAKKLKDS 660

Query: 707 AHGPKKVTEKGKAQLQNIDDQTGDIKERVKKPDDTSDQQMKDSVQEKGK----------- 766
           AHGPKKVTEKGKAQLQN+DDQTGDI+ RVKKPDD+SDQQM DSVQ KGK           
Sbjct: 661 AHGPKKVTEKGKAQLQNLDDQTGDIRGRVKKPDDSSDQQMNDSVQGKGKVYNDQCTAFIS 720

Query: 767 ---------------------------------------VTYEHLRDFFQDVGGVVAIRI 826
                                                  VTY+HLRDFFQDVGGVVAIRI
Sbjct: 721 NLNLKASYYQSAFYFNSLSGVCIFVDEHIAYAASSQLWQVTYDHLRDFFQDVGGVVAIRI 776

BLAST of Clc11G00120 vs. ExPASy TrEMBL
Match: A0A1S4E2W5 (squamous cell carcinoma antigen recognized by T-cells 3 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103499370 PE=4 SV=1)

HSP 1 Score: 1225.7 bits (3170), Expect = 0.0e+00
Identity = 656/882 (74.38%), Postives = 691/882 (78.34%), Query Frame = 0

Query: 47  MDEAIESNAEIPLPTNSGEGDIVNGDEPMPDVLQNPTSAASDSSSSDSDSEDDESDQKLQ 106
           MDE I SNAE PLPTNS +G+ VNGDEPMPD+ QNP    SDSSSSDSDSEDDES+Q LQ
Sbjct: 1   MDEPISSNAETPLPTNSDQGNSVNGDEPMPDLPQNP----SDSSSSDSDSEDDESNQNLQ 60

Query: 107 LQSLQSQLSSDPSNYDAHVQCIKLLRKIGDIDNLRKAREAMSEIFPLTPSMWQEWAKDEA 166
           L SLQSQLSS+ S+YDAHVQ IKLLRK+GDIDNLRKAREAMSEIFPL+PSMWQEWAKDEA
Sbjct: 61  LHSLQSQLSSNASDYDAHVQYIKLLRKVGDIDNLRKAREAMSEIFPLSPSMWQEWAKDEA 120

Query: 167 SINTGPEALAAIERLYERGVFDYLSVSLWCDYLNFVQEYDPLVQDCATSGIKKARDLFER 226
           SINTGPEALAAIERLYERGVFDYLSVSLW DYLNF++EYDPLVQDCATSGIKK RDLFER
Sbjct: 121 SINTGPEALAAIERLYERGVFDYLSVSLWLDYLNFIREYDPLVQDCATSGIKKVRDLFER 180

Query: 227 ALTAAGLHFTEAEKLWEAYRDFEKAIYQTIAETDTQDFSYGKDCIWTSPKAKEKQVQLIR 286
           ALTAAGLHFTEAEKLWEAYRDFEKAIYQTIAETD Q              AKEKQ+QLIR
Sbjct: 181 ALTAAGLHFTEAEKLWEAYRDFEKAIYQTIAETDAQ--------------AKEKQIQLIR 240

Query: 287 SIFHRQLSLPLSNMNSTLEAYKTWEMEMKQGCVLDTESNYSDGVPSQVASAYRRALDMYN 346
           SIFHRQLSLPLSNM+STLE YK WE+E+KQGC LDTESNYSDGVP+ VAS YRRALDMYN
Sbjct: 241 SIFHRQLSLPLSNMSSTLETYKAWELEVKQGCALDTESNYSDGVPTLVASTYRRALDMYN 300

Query: 347 ARVQLEEQISKQDLTDTERLHQYIIYLKFEQSAGDPARVQVLFERAIADFPVSVDLWLDY 406
           ARVQLE+QISKQDLTDTERLHQYIIYLKFEQSAGDPARVQVLFERAIADFPVSVDLWLDY
Sbjct: 301 ARVQLEDQISKQDLTDTERLHQYIIYLKFEQSAGDPARVQVLFERAIADFPVSVDLWLDY 360

Query: 407 TRYMDKTLKVGNIVRNVYSRATRNCPWIGDLWVQYLLALERARASEGEIASVFEKSLQCS 466
           T Y+DKTLKVGNIVRNVYSRATRNCPWIGDLWV+YLLALER+ ASEGEIASVFEKSLQCS
Sbjct: 361 TCYIDKTLKVGNIVRNVYSRATRNCPWIGDLWVRYLLALERSHASEGEIASVFEKSLQCS 420

Query: 467 FSTLDETGEAFVLGNEMDVVEKENQQYLDLFLTRIDGLRRRISSAVQLENVLEYSLIKET 526
           FSTLDE                    YLDLFLTRIDGLRRRISS VQLE+ LEYSLIKET
Sbjct: 421 FSTLDE--------------------YLDLFLTRIDGLRRRISSGVQLEDALEYSLIKET 480

Query: 527 FQRASDYLSPHLKNSEILLRLYAYWARLEINLGKDLDAARGVWESLLKIWFSSLPLYDYH 586
           FQ                                                          
Sbjct: 481 FQ---------------------------------------------------------- 540

Query: 587 FLASIGSLLAAWEGYIAMEIELNHINNARSIYKRCYSKRFPGSGSEDICHSWLRFEREYG 646
                                                         DICHSWLRFERE+G
Sbjct: 541 ----------------------------------------------DICHSWLRFEREFG 600

Query: 647 SLEDFDHAARKVNPRLEELKSYKLQMDESENPVKQTDHGKRKLGGDAPNVESPAKKLKDS 706
           SLEDFDHA RKVNPRLEELKSYKLQ+D+SENPVKQ+DH KRKLGGDAPNVESPAKKLKDS
Sbjct: 601 SLEDFDHAVRKVNPRLEELKSYKLQIDDSENPVKQSDHSKRKLGGDAPNVESPAKKLKDS 660

Query: 707 AHGPKKVTEKGKAQLQNIDDQTGDIKERVKKPDDTSDQQMKDSVQEKG------------ 766
           AHGPKKVTEKGKAQLQN+DDQTGDI+ RVKKPDD+SDQQM DSVQ KG            
Sbjct: 661 AHGPKKVTEKGKAQLQNLDDQTGDIRGRVKKPDDSSDQQMNDSVQGKGKVYNDQCTAFIS 720

Query: 767 ----KVTYEHLRDFFQDVGGVVAIRILHDKFTGKSRGLAYVDFSDDAHLDAGVAKNKQLL 826
               KVTY+HLRDFFQDVGGVVAIRILHDKFTGKSRGLAYVDFSDDAHL+AGVAKNKQLL
Sbjct: 721 NLNLKVTYDHLRDFFQDVGGVVAIRILHDKFTGKSRGLAYVDFSDDAHLEAGVAKNKQLL 740

Query: 827 LGKRISIARSDPKKGGHSTDRAGGGKRFESRSSSKEPQKANEQPRGVRKHGGNNVELKGK 886
           LGK+ISIARSDPK+GGH  DRAGGGKRFESRSSSKEP K NEQP G+RKHGGN+VELKGK
Sbjct: 781 LGKKISIARSDPKRGGHGMDRAGGGKRFESRSSSKEPHKVNEQPSGLRKHGGNSVELKGK 740

Query: 887 NTFAVPRNVRALGWTADKPKTVEQDDEKPKTNDEFRKLYFKG 913
           NTFAVPRNVRALGWTADKPKT+EQDDE PK+NDEFRKLYFKG
Sbjct: 841 NTFAVPRNVRALGWTADKPKTLEQDDENPKSNDEFRKLYFKG 740

BLAST of Clc11G00120 vs. TAIR 10
Match: AT4G24270.2 (EMBRYO DEFECTIVE 140 )

HSP 1 Score: 756.1 bits (1951), Expect = 3.2e-218
Identity = 430/875 (49.14%), Postives = 563/875 (64.34%), Query Frame = 0

Query: 68  IVNGDEPMPDV-LQNPTSA----ASDSSSSDSDSEDDESDQKLQLQSLQSQLSSDPSNYD 127
           + N D+ M D   +NP  A    + DS  SDSDSE DE++   Q+ +L+S+LS++P NYD
Sbjct: 9   VSNTDQKMEDASAENPARADPPSSDDSGDSDSDSE-DEAESNQQIVTLESELSANPYNYD 68

Query: 128 AHVQCIKLLRKIGDIDNLRKAREAMSEIFPLTPSMWQEWAKDEASINTGPEALAAIERLY 187
           A+VQ IKLLRK  +++ LR+AREAMS IFPL+PS+W EWA+DEAS+    E +  I  LY
Sbjct: 69  AYVQYIKLLRKTANLEKLRQAREAMSAIFPLSPSLWLEWARDEASL-AASENVPEIVMLY 128

Query: 188 ERGVFDYLSVSLWCDYLNFVQEYDPLVQDCATSGIKKARDLFERALTAAGLHFTEAEKLW 247
           ERG+ DY SVSLWCDYL+F+ E+DP V+   + GI K R LFERA+ AAG H TE  ++W
Sbjct: 129 ERGLSDYQSVSLWCDYLSFMLEFDPSVRGYPSEGISKMRSLFERAIPAAGFHVTEGNRIW 188

Query: 248 EAYRDFEKAIYQTIAETDTQDFSYGKDCIWTSPKAKEKQVQLIRSIFHRQLSLPLSNMNS 307
           E YR+FE+ +  TI E D ++              + KQ+Q IRSIFHR LS+PL N++S
Sbjct: 189 EGYREFEQGVLATIDEADIEE--------------RNKQIQRIRSIFHRHLSVPLENLSS 248

Query: 308 TLEAYKTWEMEMKQGCVLDTESNYSDGVPSQVASAYRRALDMYNARVQLEEQISKQDLTD 367
           TL AYKTWE+E  QG  LD  S+    V  QVA A ++A  MY+ R  LEE ISKQDL+D
Sbjct: 249 TLIAYKTWELE--QGIDLDIGSDDLSKVSHQVAVANKKAQQMYSERAHLEENISKQDLSD 308

Query: 368 TERLHQYIIYLKFEQSAGDPARVQVLFERAIADFPVSVDLWLDYTRYMDKTLKVGNIVRN 427
           TE+  +++ Y+KFE+++GDP RVQ ++ERA+A++PVS DLW+DYT Y+DKTLKVG  + +
Sbjct: 309 TEKFQEFMNYIKFEKTSGDPTRVQAIYERAVAEYPVSSDLWIDYTVYLDKTLKVGKAITH 368

Query: 428 VYSRATRNCPWIGDLWVQYLLALERARASEGEIASVFEKSLQCSFSTLDETGEAFVLGNE 487
            YSRATR+CPW GDLW +YLLALER  ASE EI  VFEKSLQC+FS+ +E          
Sbjct: 369 AYSRATRSCPWTGDLWARYLLALERGSASEKEIYDVFEKSLQCTFSSFEE---------- 428

Query: 488 MDVVEKENQQYLDLFLTRIDGLRRRISSAVQLENVLEYSLIKETFQRASDYLSPHLKNSE 547
                     YLDL+LTR+DGLRRR+ S   LE  L+YSLI+ETFQ+ASDYL+PH++N++
Sbjct: 429 ----------YLDLYLTRVDGLRRRMLSTRMLE-ALDYSLIRETFQQASDYLTPHMQNTD 488

Query: 548 ILLRLYAYWARLEINLGKDLDAARGVWESLLKIWFSSLPLYDYHFLASIGSLLAAWEGYI 607
            LL L+ YWA LE+N+GKDL  ARGVW+S               FL   G +LAAW  YI
Sbjct: 489 SLLHLHTYWANLELNIGKDLAGARGVWDS---------------FLKKSGGMLAAWHAYI 548

Query: 608 AMEIELNHINNARSIYKRCYSKRFPGSGSEDICHSWLRFEREYGSLEDFDHAARKVNPRL 667
            ME+ L HI  ARSIY+RCY+++F G+GSEDIC  WLRFERE+G LE FD A +KV PRL
Sbjct: 549 DMEVHLGHIKEARSIYRRCYTRKFDGTGSEDICKGWLRFEREHGDLEHFDLAVQKVMPRL 608

Query: 668 EELKSYKLQMDESENPVKQT----DHGKRKLGGDAPNVESPAKKLKDSAHGPKKVTEKGK 727
           EEL+  +LQ + +  PVK +    +H  +K   +  NVE  +   +      K+V   G+
Sbjct: 609 EELQLMRLQQEST--PVKPSAGLKEHSSQKRKAE-QNVEEESLAKRQKRKSQKEVDLGGQ 668

Query: 728 AQLQNIDDQTGDIKERVKKPDDTSDQQMKDSVQEKGKV----------------TYEHLR 787
           +        T ++K    K  D+  ++ +D+   K KV                  E +R
Sbjct: 669 SATV---PATKNVKAENGKTADSDKEETEDAKPLKPKVYRDECTAFISNLSVKAQEEDIR 728

Query: 788 DFFQDVGGVVAIRILHDKFTGKSRGLAYVDFSDDAHLDAGVAKNKQLLLGKRISIARSDP 847
            FF D GGV +IRILH K TGK RGLAY DF DD HL A +AKN+++  GK+ISIARS+P
Sbjct: 729 KFFGDDGGVDSIRILHHKDTGKPRGLAYADFVDDEHLAAAIAKNRKMFFGKKISIARSNP 788

Query: 848 KKGGHSTDRAGGGKRFESRSSSKEPQKANEQ---PRGVRKHG---GNNVELKGKNTFAVP 907
           KKG     R G     +   +SK+P   +E+   P G    G   GN VE++GKNTFAVP
Sbjct: 789 KKGKKEFTRRG---NVDGSGNSKDPSLISEKAKAPLGGETEGERKGNEVEVRGKNTFAVP 816

Query: 908 RNVRALGWTADKPKTVEQDDEKPKTNDEFRKLYFK 912
           RNV+ LG+T  KP      DE PK+NDEFR ++ K
Sbjct: 849 RNVKPLGYTTPKPSA----DETPKSNDEFRNMFLK 816

BLAST of Clc11G00120 vs. TAIR 10
Match: AT4G24270.1 (EMBRYO DEFECTIVE 140 )

HSP 1 Score: 755.7 bits (1950), Expect = 4.2e-218
Identity = 430/872 (49.31%), Postives = 561/872 (64.33%), Query Frame = 0

Query: 68  IVNGDEPMPDV-LQNPTSA----ASDSSSSDSDSEDDESDQKLQLQSLQSQLSSDPSNYD 127
           + N D+ M D   +NP  A    + DS  SDSDSE DE++   Q+ +L+S+LS++P NYD
Sbjct: 9   VSNTDQKMEDASAENPARADPPSSDDSGDSDSDSE-DEAESNQQIVTLESELSANPYNYD 68

Query: 128 AHVQCIKLLRKIGDIDNLRKAREAMSEIFPLTPSMWQEWAKDEASINTGPEALAAIERLY 187
           A+VQ IKLLRK  +++ LR+AREAMS IFPL+PS+W EWA+DEAS+    E +  I  LY
Sbjct: 69  AYVQYIKLLRKTANLEKLRQAREAMSAIFPLSPSLWLEWARDEASL-AASENVPEIVMLY 128

Query: 188 ERGVFDYLSVSLWCDYLNFVQEYDPLVQDCATSGIKKARDLFERALTAAGLHFTEAEKLW 247
           ERG+ DY SVSLWCDYL+F+ E+DP V+   + GI K R LFERA+ AAG H TE  ++W
Sbjct: 129 ERGLSDYQSVSLWCDYLSFMLEFDPSVRGYPSEGISKMRSLFERAIPAAGFHVTEGNRIW 188

Query: 248 EAYRDFEKAIYQTIAETDTQDFSYGKDCIWTSPKAKEKQVQLIRSIFHRQLSLPLSNMNS 307
           E YR+FE+ +  TI E D ++              + KQ+Q IRSIFHR LS+PL N++S
Sbjct: 189 EGYREFEQGVLATIDEADIEE--------------RNKQIQRIRSIFHRHLSVPLENLSS 248

Query: 308 TLEAYKTWEMEMKQGCVLDTESNYSDGVPSQVASAYRRALDMYNARVQLEEQISKQDLTD 367
           TL AYKTWE+E  QG  LD  S+    V  QVA A ++A  MY+ R  LEE ISKQDL+D
Sbjct: 249 TLIAYKTWELE--QGIDLDIGSDDLSKVSHQVAVANKKAQQMYSERAHLEENISKQDLSD 308

Query: 368 TERLHQYIIYLKFEQSAGDPARVQVLFERAIADFPVSVDLWLDYTRYMDKTLKVGNIVRN 427
           TE+  +++ Y+KFE+++GDP RVQ ++ERA+A++PVS DLW+DYT Y+DKTLKVG  + +
Sbjct: 309 TEKFQEFMNYIKFEKTSGDPTRVQAIYERAVAEYPVSSDLWIDYTVYLDKTLKVGKAITH 368

Query: 428 VYSRATRNCPWIGDLWVQYLLALERARASEGEIASVFEKSLQCSFSTLDETGEAFVLGNE 487
            YSRATR+CPW GDLW +YLLALER  ASE EI  VFEKSLQC+FS+ +E          
Sbjct: 369 AYSRATRSCPWTGDLWARYLLALERGSASEKEIYDVFEKSLQCTFSSFEE---------- 428

Query: 488 MDVVEKENQQYLDLFLTRIDGLRRRISSAVQLENVLEYSLIKETFQRASDYLSPHLKNSE 547
                     YLDL+LTR+DGLRRR+ S   LE  L+YSLI+ETFQ+ASDYL+PH++N++
Sbjct: 429 ----------YLDLYLTRVDGLRRRMLSTRMLE-ALDYSLIRETFQQASDYLTPHMQNTD 488

Query: 548 ILLRLYAYWARLEINLGKDLDAARGVWESLLKIWFSSLPLYDYHFLASIGSLLAAWEGYI 607
            LL L+ YWA LE+N+GKDL  ARGVW+S               FL   G +LAAW  YI
Sbjct: 489 SLLHLHTYWANLELNIGKDLAGARGVWDS---------------FLKKSGGMLAAWHAYI 548

Query: 608 AMEIELNHINNARSIYKRCYSKRFPGSGSEDICHSWLRFEREYGSLEDFDHAARKVNPRL 667
            ME+ L HI  ARSIY+RCY+++F G+GSEDIC  WLRFERE+G LE FD A +KV PRL
Sbjct: 549 DMEVHLGHIKEARSIYRRCYTRKFDGTGSEDICKGWLRFEREHGDLEHFDLAVQKVMPRL 608

Query: 668 EELKSYKLQMDESENPVKQT----DHGKRKLGGDAPNVESPAKKLKDSAHGPKKVTEKGK 727
           EEL+  +LQ + +  PVK +    +H  +K   +  NVE  +   +      K+V   G+
Sbjct: 609 EELQLMRLQQEST--PVKPSAGLKEHSSQKRKAE-QNVEEESLAKRQKRKSQKEVDLGGQ 668

Query: 728 AQLQNIDDQTGDIKERVKKPDDTSDQQMKDSVQEKGKV----------------TYEHLR 787
           +        T ++K    K  D+  ++ +D+   K KV                  E +R
Sbjct: 669 SATV---PATKNVKAENGKTADSDKEETEDAKPLKPKVYRDECTAFISNLSVKAQEEDIR 728

Query: 788 DFFQDVGGVVAIRILHDKFTGKSRGLAYVDFSDDAHLDAGVAKNKQLLLGKRISIARSDP 847
            FF D GGV +IRILH K TGK RGLAY DF DD HL A +AKN+++  GK+ISIARS+P
Sbjct: 729 KFFGDDGGVDSIRILHHKDTGKPRGLAYADFVDDEHLAAAIAKNRKMFFGKKISIARSNP 788

Query: 848 KKGGHSTDRAGGGKRFESRSSSKEPQKANEQPRGVRKHG---GNNVELKGKNTFAVPRNV 907
           KKG     R G      S+  S   +KA + P G    G   GN VE++GKNTFAVPRNV
Sbjct: 789 KKGKKEFTRRGNDGSGNSKDPSLISEKA-KAPLGGETEGERKGNEVEVRGKNTFAVPRNV 815

Query: 908 RALGWTADKPKTVEQDDEKPKTNDEFRKLYFK 912
           + LG+T  KP      DE PK+NDEFR ++ K
Sbjct: 849 KPLGYTTPKPSA----DETPKSNDEFRNMFLK 815

BLAST of Clc11G00120 vs. TAIR 10
Match: AT5G41770.1 (crooked neck protein, putative / cell cycle protein, putative )

HSP 1 Score: 51.6 bits (122), Expect = 3.9e-06
Identity = 150/684 (21.93%), Postives = 269/684 (39.33%), Query Frame = 0

Query: 74  PMPDVLQNPTSAASDSSSSDSDSEDDESDQKLQLQSLQSQL--SSDPSNYDAHVQCIKLL 133
           P P  ++N T A    ++     E  E  Q+ +++  + ++  S++ S+Y       +L 
Sbjct: 25  PRPTRVKNKTPAPIQITAEQILREARER-QEAEIRPPKQKITDSTELSDY-------RLR 84

Query: 134 RKIGDIDNLRKAREAMSEIFPLTPSMWQEWAKDEASINTGPEALAAIERLYERGVFDYLS 193
           R+    D +R+AR  +         +W ++A+ E S      A +  ER  E    DY +
Sbjct: 85  RRKEFEDQIRRARWNI--------QVWVKYAQWEESQKDYARARSVWERAIEG---DYRN 144

Query: 194 VSLWCDYLNFVQEYDPLVQDCATSGIKKARDLFERALTAAGLHFTEAEKLWEAYRDFEKA 253
            +LW  Y  F         +     +  AR++++RA+T         ++LW  Y   E+ 
Sbjct: 145 HTLWLKYAEF---------EMKNKFVNSARNVWDRAVTL----LPRVDQLWYKYIHMEEI 204

Query: 254 IYQTIAETDT----QDFSYGKDCIWTS---PKAKEKQVQLIRSIFHRQLSLPLSNMNSTL 313
           +              D+S  +   W S    + +  +++  R+I+ R +       +  +
Sbjct: 205 LGNIAGARQIFERWMDWSPDQQG-WLSFIKFELRYNEIERARTIYERFVL-----CHPKV 264

Query: 314 EAYKTW-EMEMKQGCVLDTESNY-------SDGVPSQV-----ASAYRRALDMYNARVQL 373
            AY  + + EMK G V    S Y       +D   +++     A    R  ++  AR   
Sbjct: 265 SAYIRYAKFEMKGGEVARCRSVYERATEKLADDEEAEILFVAFAEFEERCKEVERARFIY 324

Query: 374 EEQISKQDLTDTERLHQYIIYLKFEQSAGDPARVQ--------VLFERAIADFPVSVDLW 433
           +  +        E L  Y  ++ FE+  GD   ++          +E  +   P + D W
Sbjct: 325 KFALDHIPKGRAEDL--YRKFVAFEKQYGDKEGIEDAIVGKRRFQYEDEVRKSPSNYDSW 384

Query: 434 LDYTRYMDKTLKVGNIVRNVYSRATRNCPWIGD---------LWVQYLL-------ALER 493
            DY R +++++   + +R +Y RA  N P   +         LW+ Y L        +ER
Sbjct: 385 FDYVR-LEESVGNKDRIREIYERAIANVPPAEEKRYWQRYIYLWINYALFEEIETEDIER 444

Query: 494 AR-----------ASEGEIASVFEKSLQCSFSTLDETGEAFVLGNEMDVVEKEN--QQY- 553
            R            S+   A ++  + Q     L+ TG   +LGN +    K+   ++Y 
Sbjct: 445 TRDVYRECLKLIPHSKFSFAKIWLLAAQFEIRQLNLTGARQILGNAIGKAPKDKIFKKYI 504

Query: 554 -LDLFLTRIDGLRRRISSAVQL--ENV--------LEYSLIKETFQRASDYLS---PHLK 613
            ++L L  +D  R+     ++   EN         LE SL++    RA   L+   P L 
Sbjct: 505 EIELQLGNMDRCRKLYERYLEWSPENCYAWSKYAELERSLVETERARAIFELAISQPALD 564

Query: 614 NSEILLRLYAYWARLEINLGKDLDAARGVWESLL------KIWFSSLPLYDYHFLASIGS 668
             E+L + Y      EI+ G +L+  R ++E LL      K+W S        F AS   
Sbjct: 565 MPELLWKAY---IDFEISEG-ELERTRALYERLLDRTKHYKVWVSFA-----KFEASAAE 624

BLAST of Clc11G00120 vs. TAIR 10
Match: AT3G26420.1 (RNA-binding (RRM/RBD/RNP motifs) family protein with retrovirus zinc finger-like domain )

HSP 1 Score: 48.5 bits (114), Expect = 3.3e-05
Identity = 30/102 (29.41%), Postives = 51/102 (50.00%), Query Frame = 0

Query: 761 LRDFFQDVGGVVAIRILHDKFTGKSRGLAYVDFSDDAHLDAGV-AKNKQLLLGKRISIAR 820
           LRD F+  G +V  +++ DKF+G+SRG  ++ F +   +D  + A N   L G+ I++ +
Sbjct: 23  LRDAFEKYGHLVEAKVVLDKFSGRSRGFGFITFDEKKAMDEAIAAMNGMDLDGRTITVDK 82

Query: 821 SDPKKGGHSTDRAGGGKRFESRSSSKEPQKANEQPRGVRKHG 862
           + P +GG   D  G       R   +   +   +P G R  G
Sbjct: 83  AQPHQGGAGRDNDG------DRGRDRGYDRDRSRPSGGRGGG 118

BLAST of Clc11G00120 vs. TAIR 10
Match: AT2G16940.1 (Splicing factor, CC1-like )

HSP 1 Score: 45.4 bits (106), Expect = 2.8e-04
Identity = 34/110 (30.91%), Postives = 56/110 (50.91%), Query Frame = 0

Query: 730 DIKERVKKP--DDTSDQQMKDSVQEKGKVTYEHLRDFFQDVGGVVAIRILHDKFTGKSRG 789
           D KE   +P  D   DQ+   + Q   + T   + +FF   G V  +RI+ D+ + +SRG
Sbjct: 165 DKKEDKVEPEADPERDQRTVFAYQIALRATERDVYEFFSRAGKVRDVRIIMDRISRRSRG 224

Query: 790 LAYVDFSDDAHLDAGVAKNKQLLLGKRISIARSDPKKG--GHSTDRAGGG 836
           + YV+F D   +   +A + Q LLG+ + +  S+ +K     +T  AG G
Sbjct: 225 IGYVEFYDTMSVPMAIALSGQPLLGQPVMVKPSEAEKNLVQSTTAAAGAG 274

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038876547.10.0e+0087.07squamous cell carcinoma antigen recognized by T-cells 3 isoform X1 [Benincasa hi... [more]
XP_008460582.10.0e+0083.67PREDICTED: squamous cell carcinoma antigen recognized by T-cells 3 isoform X1 [C... [more]
XP_004142811.20.0e+0082.31squamous cell carcinoma antigen recognized by T-cells 3 isoform X1 [Cucumis sati... [more]
XP_011655427.10.0e+0082.20squamous cell carcinoma antigen recognized by T-cells 3 isoform X2 [Cucumis sati... [more]
KAG6596153.10.0e+0081.60Squamous cell carcinoma antigen recognized by T-cells 3, partial [Cucurbita argy... [more]
Match NameE-valueIdentityDescription
B3DJT02.0e-7627.55Squamous cell carcinoma antigen recognized by T-cells 3 OS=Danio rerio OX=7955 G... [more]
Q5REG13.0e-7230.20Squamous cell carcinoma antigen recognized by T-cells 3 OS=Pongo abelii OX=9601 ... [more]
Q150207.4e-7129.58Squamous cell carcinoma antigen recognized by T-cells 3 OS=Homo sapiens OX=9606 ... [more]
Q9JLI84.5e-6827.65Squamous cell carcinoma antigen recognized by T-cells 3 OS=Mus musculus OX=10090... [more]
Q9USY25.4e-2922.82Uncharacterized RNA-binding protein C1861.04c OS=Schizosaccharomyces pombe (stra... [more]
Match NameE-valueIdentityDescription
A0A1S3CCC20.0e+0083.67squamous cell carcinoma antigen recognized by T-cells 3 isoform X1 OS=Cucumis me... [more]
A0A6J1FHD10.0e+0082.19squamous cell carcinoma antigen recognized by T-cells 3 OS=Cucurbita moschata OX... [more]
A0A6J1I3520.0e+0081.05squamous cell carcinoma antigen recognized by T-cells 3 OS=Cucurbita maxima OX=3... [more]
A0A5D3CUH20.0e+0079.73Squamous cell carcinoma antigen recognized by T-cells 3 isoform X1 OS=Cucumis me... [more]
A0A1S4E2W50.0e+0074.38squamous cell carcinoma antigen recognized by T-cells 3 isoform X2 OS=Cucumis me... [more]
Match NameE-valueIdentityDescription
AT4G24270.23.2e-21849.14EMBRYO DEFECTIVE 140 [more]
AT4G24270.14.2e-21849.31EMBRYO DEFECTIVE 140 [more]
AT5G41770.13.9e-0621.93crooked neck protein, putative / cell cycle protein, putative [more]
AT3G26420.13.3e-0529.41RNA-binding (RRM/RBD/RNP motifs) family protein with retrovirus zinc finger-like... [more]
AT2G16940.12.8e-0430.91Splicing factor, CC1-like [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000504RNA recognition motif domainSMARTSM00360rrm1_1coord: 718..818
e-value: 9.9E-5
score: 31.7
IPR000504RNA recognition motif domainPFAMPF00076RRM_1coord: 756..815
e-value: 2.3E-9
score: 36.9
IPR000504RNA recognition motif domainPROSITEPS50102RRMcoord: 749..822
score: 12.065728
IPR003107HAT (Half-A-TPR) repeatSMARTSM00386hat_new_1coord: 135..167
e-value: 40.0
score: 11.0
coord: 380..412
e-value: 3.4E-5
score: 33.3
coord: 415..447
e-value: 1.2
score: 18.2
coord: 173..204
e-value: 32.0
score: 11.7
coord: 609..644
e-value: 97.0
score: 8.2
coord: 215..251
e-value: 0.059
score: 22.5
IPR012677Nucleotide-binding alpha-beta plait domain superfamilyGENE3D3.30.70.330coord: 741..863
e-value: 1.9E-15
score: 59.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 360..682
e-value: 3.9E-46
score: 159.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 111..322
e-value: 1.5E-47
score: 163.9
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 112..673
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 105..120
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 727..752
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 55..120
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 818..865
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 818..912
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 669..752
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 669..691
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 889..912
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 698..719
NoneNo IPR availablePANTHERPTHR17204PRE-MRNA PROCESSING PROTEIN PRP39-RELATEDcoord: 74..907
NoneNo IPR availablePANTHERPTHR17204:SF25EMBRYO DEFECTIVE 140coord: 74..907
IPR035979RNA-binding domain superfamilySUPERFAMILY54928RNA-binding domain, RBDcoord: 756..840

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc11G00120.2Clc11G00120.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006396 RNA processing
molecular_function GO:0005515 protein binding
molecular_function GO:0003723 RNA binding
molecular_function GO:0003676 nucleic acid binding