Cucsa.155960 (gene) Cucumber (Gy14) v1

NameCucsa.155960
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionAminopeptidase
Locationscaffold01139 : 138507 .. 160155 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCGATAGGTGTGAACAAAACAAAACGAAACAACATTTCCCTCCGGAGCCGGCGGGGATCGTTCTGTTTCATGAGAACTTTTTTCTGGTTGAGAAGAAAGACAATTTTTTTTTAGATTCAAAACCTAAATTGCTTTATATATTGTCTTTATCGATAAAATAATTGCTTCTATTCATCGTAATTATGATTTGATTTTGTTTTGGAATCGACTTGGGACTGTTCTTGACACTGTTTCAGACGTTACAGTTTGCTGATTTGGAGGTTTCTTCTTTCCTTTTTCTGCTGCCTTGTTTTGGAAGGCATAAATTCAATATTCTCATCTAAAAAACCTCCGTTCTATACCAAAGAATCTTACTGTATTTTGGTCGTTCCCTCAGGTACTATTATTGTCTCGCCTTTCTCTTTCTTAGGCCAGTGGCGTACCCTCGTTCAGCTTCTGAACAATGGGGAAGAACAAGGAAGTAATTCGGCTGGAGCGTGAATCTGTCATTCCGGTGTTGAAGCCGAAGCTTATAATGACCTTGGCCAATCTAATTGGTTCGACCACTTCTCCCTTTGGGTGATTTATGAATTTGGGGTTCTTTCTTTTTACGAATTTGAGTGGTTCTATGAGTTTCTAGGATTGCGGTATTCATAATGGTTATCCATGTAATTAATTATGTCAGATATTCTGCTGTGCTATAGTGTGATTCACTGCCTGCGTTTCTCCGCAATCCAAACTACCTGTTTTTGTCAATTTTGTAACATCCATTCGGATGGCCTGGTGCTAGTGTTCTATTTGTACTGCTATTATTATATTCTACACTGCTTTTGATGATAACCTCTTTTCCTTGGTCCATGACTTTTCGTTTTGTATTATAAGAGTATCCGCTAGTAGAGGTATTAAGAATGGTCGCTATTCATGAACTTTTTTCCTTACCTGCTTGCTGAACTGAGATGATAAATCCTAGAGAATTATGTTCCCTTTGTAATTGGGGCTACTGATTATTTCATTGTTTTCTAGAACATAGTTCTGATCGGGCAGAGTTCTTAAAACTTTGCAAGAGAATTGAGTACACAATTCGAGCTTGGTATCTTCTTCAATTTGAGGATTTGATGGTACTATTCTTCATGAATGAAAGGTCTTCATTTCACTGTTACTATTCTGCTTGTGAACCTGAATTGAGACAGATCTACTTTGATTTCAGCAACTCTACTCTCTCTTTGATCCTGTTCATGGAGCCCAGAAGCTGGAGCAGCAGAATCTATCCTCTGATGAAATTGAAGTGCTTGAACAAAATTTTCTTTCTTACCTGTTTCAGGTTTGTACAAGTTCTCTGTTTCTTTTATTTTGTTCATAGTGTTAATTATTTTCTAAAGTAACTTCCCAATCATCATGGATTGACTTATTGACCATAGAACATAAAAAAGAAAAATGGTTTTGGGGCAATAAGCTTAAACCTTGTGATGGCCACCTACATAGGAGATATTGAATTCCTAGGAGTTCTATGACAACTGAGATTCGGAAAACCAAATTTTGTCCAGATTGAAATCAATAATATGAGGAGTGGGAGATGCATGTTTAGCAAATATTGTGAAGGAGGATAAAAATAAGAGAGAGAGAGAAAGAGAGAGGGAGAGAGAGAGAGAGTATAAGTGCATTATTGAATGTTATGTACCAAACCAACAAGTGGGGATTTCTCAGATGGTTAATTGGGACAATTACAAATGTGGCATTTTGACTAATGGTTGAAGTTGGACAAAAATACATTGTTTTTATTAAGAAGCACATTGACCTGTTACTTTGGTTCTTGCTTCACTCAATTGATATCTTTGGCATCGGCTTTCGGTATAAATTGCACCGACATTTAACTTAACTTAATGGTTTCCTTCAGAACACTATTCTAAATGTTAACATCAAAAGCTTTTTGACAATTGACTTATTTTTTCTTGATTCATCCAGGAGGATTTATGTGATTCTTATTCTGAGCATTTACCCTTTGCTGTTTTCAAACTTGTACCTGTTATTTTCTGCATGACTTTGACAAAGTACAGGGCCTCGTAGTTTGCTCGTTTACCAATGAGATATCTTTTGCAAAAGTGATGGAGAAGTATTTCTCCTCACCATGAGATTGAAATTTAAAACCAAACAACAGTTTCAAACAAACACATCACCTAACCAATTTGAGAGGGTTAGTAATTTCTTCATACAAAAGTACTGTAACATTGACAAAATAAAATCTAAACCTTCCTTACAAATCAACGTACCCTTGGTACAGCTTTTTGCTCCCTAGCTCTAATACACTTCCAATGTTTCCCACTAATTAAACACACTAACTACCATATTGTCTTTTCCCTTATTCTCCTCCTTGCCTGCGTATCCATAGTAGAAGCCCAAACAAAATAGTAGGCACAGTTTAACTGTAACCTTCCTCCCTTAGGTACAATAATTGTACCTTCCCTGATGTACTGGGAACCATAGGTTTTATCGCCTTGTCAGACCTTATAGCCCTCATATAAAAATGGAACATAAAAGCAAATAACTTGAGTAGGCAGAGAGTATCAACCAGGTATAGACATATGTAAAACATTCTTGAAATCTTTACTTTGAAAGCACACTTATATAAATACTTGCTTTAACCCAAAACATTCTAAATAGCTTGGAAACATGCTTGCTTAAACATTCTTTACAAATCTCCTGCTTTATAAGACATGCTTTGGAAATAGCTTATAAAACATGCTTAACATAACTCTTTATTAAACTCATGCTTTGCAAATCATTTACTAAATCAGTCAAGAATATCTTTCAGAAATCTCAAAGCATATTTAGAAACTTTTAGCTAGAAACTCATGCTTAGAAATTGTTATTTAAATGCATTAGTCACTCACAACTTGTAGCCCAACTCCATGGCCTAAGAATTATCCTTTTACTCTTCTTGGTCTGAAAGAAGGAATTAATATCAAAATTAATCTTCTCATACTTAAAAATATCAACAAACCACTTAAAATTCAACAAAAATCCAAAAAAATCATTTTTAGCATAAAGCCACAGCACATGCCACAACACTGTCATGGGCATGCTGCGTGTGCACATAGGCAGGCCAATGACATGGCCTTGGGCAGCACTGGGTAACCTCACAGCCACATGCCTCAGGCGCCAGCCTTGTGCCCAACACCTGCTCTGCCTTGCCGCAACAAACACCTTGCTGCGTGCCACTAATCTCTCAAAAGGCTGTGACTCAGATCAGAAGAAAGACAAAGTAGTTTAAGAGAAAGAGGTCTGAAATTCGATGAAAAATGTCTGCGAAGCGAAGGCACAAAGAATATATAGAGGACTAGCCTTGCCGCCAGTTTCAACGCCCAACTTTGACACTCACGGCCTGAAGCTTCAGATTACCTTCGAAAATCTGGACTCGACTTTAAAAATTCATAACTTAAAATCCATAACTCAAATCAAGAAATTTCCTCCTACAAACTTGTTTCTTATCCTCTTAACTACATTACCTGAAAATTCCAGCTCTAGGTTCGAAATAATGAGCTTTGTAGCCTCTGGAGAATCAACATTGCTTTAGTTTACCCGAGAAATGACTTTCAAGGCTTACCTTCAATGGATGAATGTCTTCCGCACGTTCTCTTTGTTCAGCAGCCTAACCCATTAAACTAGACTCATTTTGGGAGCTTTTGCAATTGGAATTACAAAAAACGCATACCATGCCACATTATTAAAGTTACCTTATATATTGGTCCTTGCAATGCCTTAAGCCATTTCCTGCTTGCCATTTTTTAATTCTTTGCCTGCCTAGACATCTAGAATCCACTCCACTGGACACTATTTGAACGCCTAGAATTTCATTTGGTCATATGGCATCCCACATCATCTCGCAAAGCCTTTGACTAGACTTTGCCGCCTAGCCTCTCGGCACACACACACCATGCCGCCTAGCCATAGGTGCATGGCTTGTCTCAACTTTTCCGTTAGAGGTTCTTTTGGCATAACTTGGCTGTATCATGCAAACCCTTAGCTGCCTAGCATGCTTTCCTCGCCTAGGGTGTTTTTTACTGCCCAACTCTTTGCCCTTAGAGGTTTTCTGGACATGGTGGCTGTTCAACCATGCTCAATGCCTAGCACTCATGTCATTCTTTGGCCGCCTAGGACATGACCTTTTGAGGTTTTTTGCCGCCTAGCACACTCAAAACGCTTAGTCTTCCATGTCCAACACTTGGCCAAAATTTTGAACTTCTCTTCCTTTCGTTTTCCTTTTCCTTTGGCCGCCCACTATGAATTCTCCCTTCCAACATTTAAACTAATTACCTCTCAAGCTTCTCTTTCTGAAACCACAAACACAACCAATATACTAAACCACAACTCTCTTGGTATTCTAGGGTTTACACTACTTCCATAAGGTGAAGGTTTTTTCGTTTGACAACCCCATTTTGCTACGCACAAGAACTCTGGTGGACAATCCCTTTGGAGGACAGGAACCCATGAAGGGTATGATTTTGGAACTCACGACCGTTGTCTCTCCATACAATTGCAATTTTGGCATTGAATTGCGTTTGTACTCTGTGATGAAAATCTTGAAATATGGAATTGACCTTAGATTTGTTAGATGAGGTAAACTCATGTAAGGCGGGTATGATCATCAATAAAGGTCACAAACCACCGTTTCTCAGAAGAAGTGGTAACCCTGAACGGTCCCTAAACATCACTATGGATTAGGGTAAATGGTTAGGTTGGTTTATACGATTGTGAGGGAAAAGAGGCCCTATGTTGTTTAGCACGAATACATACATCACACGATAAAGTAAAGATCAACTTTCGAGAAAAGGTGGAGAAATAAAAATTTCATTTATTAAAAGTTTGGGTGGCCCAAACAGAAATGCTACAAGATACAATCTTTCTTAGAAGTAGTAAAATACAAAGATAATAAACTGGTCCTAGAGGTACTTCTAAAGGAAGCATCATCGTCAAGGAGGTGACTCTCCTATTGTGTCGAGCATTGCTAATCATCCTCCCGAGCTCAAGTCTGAAAAGATACAAAATCGGGTAAGAAAGTTGTTTTGCAGTTAAGCTTGAGTTGCAGTTAAGCTTGCGGGTAATCTTGCTAATAGATAACAAATTATAAAAAAAATTTGGCACATGCAACACATTCTGTAGAGGTAGGCTCCTCGAGGGAAAACATGCCCCTCCCCGATAATTGGAAGCAAGGAATCATATGCTATTCTTATTTTCGCGTTATCTACACAAGGAAGATATGACAAAATTCTCAGAAGAACCTATCAAGTGATCTGTAACACCCGAATTCAGAATCTATGGATTCTTCCCATCAATATTGATAAAACTGGGGAACTAAGGATACCTAACTGAGCAATAGCTCCGAGAGTGGGAGGACTAGGATCCTGGTTTCCTGTGGAGCTGTATGGTTGAGATGTTCTAGCATATTCACTCACATATGCACAACCTGAATTTTGTTTGTCATTTGGCAGACGTTTCTTACCACTTGGTGGACGACCATGCAATTTCCAATTGTGTTCTTTTGTGTGCTATGTTTTCTTGCAATGCTCACAAACTGGAACAAGATTTCCGTTGTGCTTCTCACCATCAGGAGTAGAGGACCTTGCATTGAAGGTAGCAAAATCAATGGCAGGAGTGGTTAGAATATTCATAGCACTCATGCGGTCCTCCTCTAGATGGACTTCAGAACAGATCTCCATCAAGGAGGGAATAAGTCTTTGGCCTAGTATGTGACCACAAACCACATCAAACTTGGAGTTAAGGCCAACAATAAAGTCAACTTCTTCAATTCTAGAATGTGTATGCCATCACTGGGACACTTTCAGATAATCTTCTACATAAATCCATTTTCCCTCAAATGAGGGAAAGTTTATTAATACAAGCTGTTACATCCATAGTCCCCTGCTTGCATTTATGGACTGTTCTCAGTGTGTCGAGACGAGAAGCATTCTAACGCTTAGATACAACTTCTGAGCTGTGTCCCATACTTAGTAGTAGTAGCAAATAATAAAGGCTTGCCAATCTGTGGCTCTATATTGTTAATCAACATAGACCAAATGAGATAATCCTGTCCTCCTCAGAGATGTTTCTAAGGATCACCTGGTTGAGGTCGTGGTATCTCTCCTACCAGAAAGCCAAAGTTGTCATTTTAACAGATTGAAATCAAGAAAAGTAATTCTGGCCATTCAACTTCTCCTGTAGATTGTGCCAAGGATTCACCACCTTCAAATTTCGACTGAGCTAGGGTTGCGGCACTTGGGTTAGGGTTTTTAGAATGGTGTACGGTTGCAGCAGAGCTTGGTGGCACTTGGAGTTGCCGCAGCTGAACAACGGCACTAACCCCGATTGTTGTCTCGCCGACAATGGGGTCCTCCATCATTTGCTCCTTTGTTTGGGTTTCGTCAATGGTAGTTTCTAGGGTTTCGTCATCACCTCACTCTGATACTATATTAAAAGAGACGATAAGAGAGCACACCAAGATACGTGGAAACCCCTGTACAGGGAGAAAAACTATGATAACTATGATAGAGACTTTTTCTTATATGAACCAGAAAAATACAAAGGGGAAAAATAAATAGGCAACGTGAGCCTTAACAAAAATAGGGTATTTACTAATCATCCAAGTAGGATCGAGCCCTGTTCTCAATCTGTGTTGAACTCTGTAATGTTTCTCTTCTAAATAACTAAATAGGGTTAAGCACAAGGGCTATGATTACAAAGGAGATCAGCAGGTTGAATGGAGTAATTAGAGATCCTATAACAAGTATGCATGTCTGTATGTCGTGGAATATTCTTATAATTATACTTGCAAGTGCATACACGTTGAAAAATGAGACAGTAGTATCGATTGATCATTTTGGGAAGCATTCAATTTTATACAGGTTATGGAAAAGAGCAATTTCAAAATAGCAAGTGATGAGGAGATTGAAATTGCACTTTCAGGGCAGTATCTCTTAAATCTTCCTATTACAGTTGATGAATCCAAGGTTTGTATTCAGTTATGCATGCTTTAAGATCTCAAATGGGAAATGTACGTGTAGAAAATGCAAAAATATATTTCCAGATTTGAGTATCATATTCTTTGTACTTCATTCTTTCTTCAAGATATAGTATTTATTATGTTTCTAAAGAATATCTTGGTGGACTTGTTTAATTCATACGTGAGCAATATTCTTCATATCGACTTCTCTGGTCTCATGTTTATCTTTTTAATTTGTAGCTTGACAAGGTTCTTTTAAAGAAATATTTTGCGACACATCCTCAAGCCAACCTACCAGACTTTGTTGACAAGGTATGTACTCTATTTGTACATAGCTTTGGAAACCTTAAAGTTTCTTCAACTCATTAAAAGATCGTTTTTTGACTCTTTGATTTTTAAAGCTCAAGATGCGATAAAGTGCAATAGTCTTTCGGAGCTTAAGCGTAAGACATAAAGAAAAAGCTTGGGTCTTTTTTTTAAGGAGCACTATATATGAATGTTCTATATTAAAAACAGAAAGCATGGTTGAAGAGAAAATATGAAAAACCAGATCTTTAAACACAAGAAATTTTGCATTTAGACTTGGGTAAATTTGAATTTTTAAAAAATAATAAAAAGGCCCAAGGCGCATTGCTTTGAGCCTTAATCTTTGTCAAACCTGCTCTTGGAGTTTTCCACTTATTTGGTTTGCTAGTCTTTCAACATGTGCTATTCTGATTTTTTTAGTTTATTAAATGAATGGAGAAAATTTGAAGTCAATAGGAAAGAATCATATGACAACAAAATTTAACAAATCTAATTTTAAAAAATACAAAAGGTACATCATTCGAGAAAAAGGAATAAAAGATACATAAATTTAACAAATCTAAGTTTAAAAAATAAAAAAGATACATCATTCAAGAAAAATGTATAAAAGATACATTATTGTTATCATTAAAATTAATTAATAATGAGTTTATTGAAGAAAAATGTTTGCATAAAAAAAATATTCCAAAATGAAAGTGGAAAATTGGGAGATGAATAAAAATCATTTTAATATTCCAAAATGAAAGTGGAAAATTGGGAGATGATTAAAAATCATTTTTATGTTAATGTTTTAATGTTTAATGTAGGTGAATTTTCAAAAATAGCAAATTTTACAAAATACTTACCATCTATAGCAAATTCTATCATAGTTATTGATATGCTATAGTGATAGAAGACTATTATTGATAGAATTCAAAATTTTGCTATAACTTGTAAATATTTTAATTTCTTTTGCTATTTTTAAAAATGTCCTTTAGTGTATTCTGTAAAATATAATATATTAAAAAATAATCTTATAATCAATGTTAAATTATTGAATTATAACCGAACTTTGCTTTTTAACAAAATATACTACTTTCATTTAAAGATAACAAAATACTTACCATCTATAGCAAATTCTATCATAGTTATTGATATGAATTTCTTTTCAAAGTTTACATTATTTCATGTCATAAGAAGAACGTATTCATCAAAGAAAACATGGAAGTATCTATTGGATGGAAGAGATTAGTACACGCACTTGAAATAATTATATTGTCATAGATAGCAAAAAAAAAAAAAAGAAACTACATACGAAGTATAGCAAAAACATTCAAATGAGCTATAGAGATTTTGATAGATTTGTCTATATTTTAGAAATAATTTCAAGAAAAATTTACAAATAGAAGAAAATGGAAAGTATCCGTATCCTATTGAAAAAAATAAAAAATTGATACTTTGGGTTTGAGTTAAAATTTTAGGTTAAAATTTTAGTTATTCATTTTTAGTTATGCATTTTGTAGGGGCAGTAAATATTTGCCAATTCTTCTATTTTTAAAAACTTCCCTAATTTCACTATTTTGCTATTTTCAATATGCCCACTTTTAATCTACTGTGGATGTTTAAAATCAAACTAAAAATTACTTTTTCAAATATAACATTACACATTACTTTATAAAAAACTTAGTACAATTATCTAATATCCCATGTCATGGGAAAGGAATAAATTTGGTGTGGAAAAATAAACTTATATAACTTTTTATAGGAAAAATGTGATTCATCAAACAACTAATTTTGGAGAAAAAGGTTGAGTGTGGAAAAAAAATAAATAGCATACACAAGATTTTTATAAAAAATAGGTACAGAGTGATGATTATTTAAATTTCAAGTGTTGAGTAAGGTTGCTAGATTATTTAACATTTCAAACTTTTCAAATTCTAAAATTTATTGCAAAATAATAAATTTGTTATGAATTTTTAATAGTTATTGGATAAAACTATTTAAATTCTCAAACTTTTAAAATTCTAAAATTAATTAAATTTGAAGTTGGAAAATTATCAACACTAGAACATTTGACAAAATATTTACCCTACATAAAAAATCAAGTGCAATAAATTTTGTCTTTTAGTGGTTTTGTTCTATGCAGGGTAAATAATTTGTCAAAATTATCTATTTTTGAAAAAACCCCTTTAAAGTCATTATTTATTAGGACAAAAATGAAGAAAGTACTTTCATATTTCTACGAAAAGCATCCTATTACTTTTAATTATAATTAATAATTAGATTCCAACAAAAGTTACCTAATTCAAACAAAATAAAAATATTAACCACACTTTGCCATTTTGTATCATTTTACAATGTCCAACATCCCATATGTTACATTCTAATTAAAAAATAAATTCCATGGCTACATAATGTAAACATCCCATCTAGAAATACATGCACGAACATATTTAGATTCCATGTTTACATTCTAACAAAAAAAAGCCTAAGATGAATACAACAAAAGCAAAAGGTTGAAAGAGAATGATCATTTGATTTCTTCTTGTGTAACTTATGAGCATGAATGGAGAAAAGATAAGATATTTATACCCAAAAATTTCAAAGGGAGGAAATTGAAAGGTTGGGAGATAGAGAAGAGTTAATAAGAGAAAAGGGAATGTGAATAGTTGGGAGATAAAAGGTTAAGAGAAAGATGAAAAGAATGGATGAGGAAGAGAGGAGTTTTGAATTAATTAGAGAGAAATGTGAGTTTTTTCTAGAGGTGGGGGAGAGTTATTTTTGTCCATCTAGAGAGTGAGGAGTAGTGGAAGGAGAGAATAATAAATGAATAAATTAAACAACAGAAAAGTAAAATATAAATCATCACATCACTCAACCATTTAGTAAAATATAAATCATCACATCACTCAATCATTTTATTAATAAACAACTAAAAATAAATTAATTTAATGAAAAAGAATTAATTTAATTTAGCAATTAACTACATTAATTTTGGATTTAAATGACATATTTGCCACCAAAGTAGTAATGAAAAACCTTATAATGAAAGGACATAATTTGAACAAAAAGTTTCTGCTCATCCATGCTTTATATATACTATAGATTAGATATTGATTATTTATGATTCAGGTGCCAAAGTCTGGCTCTATTTCCTCCTTTCAATCCCTCCATTTTCTTAGGTCAAAGTTTCAATGCAGAGATTCATAGACAGAAATAAAGCTAGGATTATAGTTTTAAACTTCTTTGCTGTACTTTGGTCATGTTCACAACGCTGGCCACAAAGGTTCTTGTCGTATGTTACACGAGTCTGTCATTGGTACTAAAAATGAATGTTTGTAGGAAAACTAGATATAATTGGTGGTCGGAATTATTGTTTTCAGTTCTGTGCATGGGAAATTCATGAGTGGTATAATCATTTTCAACTTCATAACCTTAAACTCACCCTAAATGGACACAGGCCTCTCTTTGTCTCACTCATCTGTACCGTGTTTCTTTCGTCAGTATGTTATTTTCAGGCGAGGAACTGGAATTGACCAAACGAGTGATTTTTTTTTCATGGAGAAAGTAGACATGCTTATCGGAAGGTTTTGGGCATATCTGTTAAGGTTAACGAGGTAAGTTAGTACTTGAAATCAGTGAACATTTTATTTCATTTACTATCAGTTGTAGGCCTGTGAATAATTATATGAATTATTTTACTTTGGCTAGTCAAACATCAAATCAAAGAGTTATGTTGCATAAACTTGTTATAGACCAAAGATTTTTTGCCATCATTGGCAGTTTCTAGTTGGACATCATGAAAAGGAGATTTCTGAGCACATCGCATTTACTTTTACAATAAAACGGCTTCTGAGCAACCTTCAGAGTATGAAACAGCATCTGTTTGAGCACCATCCAAACTATGTTTTTCTGCTTTTGATTTGTGTGGTTGCAGGTTAGAAAAGATTCTCTGCAGACGGCCAATTTCACGATCTACGGAGGATAGGAAGAAGAATGATGAGATTCCCCCTGATGCTGATCAAGACCTAGATGTTGAAAGAGTTCGTCTGGAGAATATGGAACTGAGGTTGTGCATAGTTTCTTACAGTTGATGACTGACTGCAGTTGGATCATGCTTATACAAGTTTTACGTTTGATGGAATGAATAAAAAATGAAAAATTAAAGCAAAAGATTCATTATATGTTATTCTGGGGTTTAGTTGGTAATAGGACTAATAAGAAAAATGCTGTTCACTCATTGAATTTTCTAGTTATTTTCAATACTACTTTAAACAGACATGTCTTAATTTTATGAATGGTCAGGATTAGTTGACTTAGAAAAGCAAGATTAACCAACCTCCCCCCTCCTTTGTTTTCACAGCGCTTCCAATTTGCTGGGCAAGGTTACCATTCAAGAACCTACCTTTGATAGGATTATTGTAGTTTACAGGTACTGCTCTATTAACTATGCATGCCCCATTATATTTTGAAGCTTCATTTTATTAGATTATGTAGGATCTGATTACAAAACTTGATTCTTCTTTTGGCTAAATGAATTTGAACTGTTGGTTAGAGGTCAGACTGAGGGGGCTATTTAATTACATTGTGGAGTTTGTGGTACTTAATGACAGGAAGGTTATATATGGATGATTTTGTCTTACTATTGTAGCTGTCTTGTTCGTTATTGGTGCATATTAGTGGGAATTTCATTTTACTTTTATTACTAACAATCAACCAGCTTTCAGTTTTTCTAGTCAGAGAGTATCTATTGCCATCTGAGCTGATTGTTCATCAGAAAATATTTTTTTCCAATTATTCTAAAGTACAGTTCGTTGACATCTAAAAGGAATGGTAATTCTTCAGGCGAGCAAGTACAAAGTCTAAACCTGAACGTGGAATATATGTCAAGCATTTTAAAAACATTCCAATGGCTGATATGGAAATAGTACTTGTAAGTTAATTCTCTATTTTAATGATCATTCTTTTTGCTTGTTGAAAAGTTTGTATGAGAAGATTGGTGACCGAATTTTTTGTGACAGCCTGAAAAGAAGAATCCAGGACTGACTCCAATGGACTGGGTTAAGTTTATTGTATCTGCTATAGTTGGGCTGGTTAGTATTAAGTATGTTTATTTTACTGCACTATTGTTGTCATTAAGTTCTTAATATTTTGTTTACTTGTTTTTACTGTGTGTGTATTAATAGGTTGCCCTTGTGGGTTCAATTGAGATGCCCAAGGCTGACTTTTGGGTCATTTTCGCTGTTCTCTCAACAGTTATTGGTTACTGTGCAAAGACATATTTCACGTTAGTACCATTCTCTCTTTGATGCATTCTTTAAATTAATCATTAGCTTCTGTAGACCAATTCTGTAAATTATTCAATAATTTCCAAATTTATAAGATTGGAACGAAAATGTGACTAGGAGGCTGATAATCAAAGACAAATTTAGAGTCGACCTTCTTCGAGAAGAGGCAGATGAATGTTTCATTTAATGATGCATTGATAAAACCATAAGAATAGATGTCAATAATCATGTTCTTGATGTCTTGCTTGAGAGAAGGCTAGAAGAATTTAAAAAATTCTATGGGGAAACCATTGGGGTTAGGGGACTTACTAACTCCCAAAGAGGAAACTGCGCAACTCAACTGACACCTGAGGATAAGAGGTCCCGGGTTGAAATCCCACCACCCCCAAGTTGTATTACAGTACCATTTAAAAAGAATTCCCAAAGAGGAAACTACATAAACAAGCTCTACCTCAGTGAAAGGGTGTTCCAATTCATATGCTTCCTCAACTCTAATAGGGCACATAACAACCAATATTAGTGGGCAGAAAGCCCTGATGACCATTCTTAGAGAATAAATTGGAACAAACATGAAGAAATTCAACTTCGATATCAGATGCTGTCACCAGACTCGGCCCTTCTTTGGAAAAAAATTTGGTGAATGAACTCTTATGCTTATGGGCAACAAAAATACGGTGGAAAAAGCGGGTTTTTTTATCACCATTTCATTGCATCCCATATACATCTTGTGCAATCAACTGTTCAAATATGTTCAATCTGATCTATATCTATCCATTATCCTCCAAAGTGTCCAAAGCGGCAAGTTGTGAAATCAGGGAAGGGAGACTCCTGGCCTCAAGATGATAGTTTTGCTCCCAATCACAAAGAAAAAACTTTAGTTCTTTCAGTTTCGTCATCAATCCATGGTCTGGCCAACTAGAAGTGGGGTGATTAGTCCACCAATTCTTCACCAAGAACGGAATTTGTAACCATGAATTTTCAAAGTGAAAGGGGCGTGGACGCTAATCAATACTACCAGAGGATAAAGCCAGAGGAAAATGATGAGACATAATTCGATCTAGATGGTGGAAGGTCACAACAATGAATTTGGGAAGCCAAGCGTTTGCAACAAGGAATTTGTCCAATAAGGAAAGATAAGAGGTTGGTCCTGAGAAAGACAAGTATAACAGCCATCGTTCAACAGGATATTACACAACTTATAATTGGAGCTCCACTGATTAAAAGCTCCCATACTAGCAGAGATTGAGTGATCATGTAATTTCTCTCATGACTAGCAGATCAACTAAAATCACCGTCAGTAATCGAGAGTCACCACTAAGTCCAGCCACATCATCAAGTTCCTGCCAATATTCTGGATGGTAACATCCATAGACTGCTGTAAGCCAAAAAGAAAACCCATTAGCCAAAAAAACTTGAATGGACATCATATTTGTGTTTCCTAAGTTTACCTCTGTTGTTTCAGTTATAGGCTGTATTTTTCCTTAGTCCTTTTTATTGAAAAGAGAGACAACGAGTTCACACCAATTTACGTGGAAACCCGAGTACCGGGAGAAAAACCACGATTGTTTATNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTGCTCTTAGTATAGTACTCCTGTCCTTTGAGCATCGGTCTCTTTTATTATTATTAACAAAGAGGCTCGTTTCCATTTAAAAAAATAAAAATACTTGAATGGACATAGTAAACATGCCTTGAATAACGTCTTGAATCCTGTAACCAGGTGCTTTCCAATGTATAAGAATGCCCCCAAAGAGTCAATGGTGTCAACTGTCTAGAGCTATGCACAGCCAATAAAAGAAGAGCTACAAATAGATTTAACCAAAAAGCTATCCACTGACAGCTTTCTTTCCTGAAGGAGGACAATACTAGGGCTCTGTTTTTGGATCACCTTCTTAACCAAAGCTTGCTTCTTCCAGGAACCTAAACCACGGACATTCCACGTGAGAAACTTCATGGGTGAAAAGTGGTACCCTTCTGTTAGTCAACGTGGCTGTTTTGTCATAGTGGATTGATGTTTGTAGTCCCTGTAGTTCAATTTGCAACTTATTTTTTCACAGAAGTGGAGTATGTTTTTTGGGATTTTAATTTTAGTAGGAGAGGTGGAATGGCCATAATACAAAGGCCATGTTCACCGAGGATATTGGCAATGCCTTGAAGGTTAAAAGAGAGTTGAGTAGTAATTGAGGCGGTGGGTTGGGTATTGGCTGCAGACTATCATAATGGTTTATGTTTGGGGTATAGAGTAATTTACTTTGAGCAGAAATAATAAACCTACTGTAGCAAAGAGGGAAGAGAGAATGAGTAGGAAACAATGAACATGTTAACAAAGAGAGATTTGAAATAGTAATCATGTAGTTAATTGTAGGTTTTATTGGTTGGAAACACCAACTACCACAAGTGAAGCCCAAACATGGTCTGGGCTATAATAACCCACTCCAACCACTGGGAGGTAAACAACCCCTTTCTGTTTATACTTGAATAATGTGAAACTAGGAGGCTATTCCTAATAATTAATTTGTGATTATCAGTTTTGTTGATTACTCTAATGCATGTATACTCTGGCCATTTTTGTTTTTAAATCTATAGGTTTCAGCAAAACATGGCTACATATCAGAACTTAATTACACAATCCATGTATGACAAACAGCTGGATAGTGGAAGGGGAACACTTCTTCATTTGTGTGATGATGTGATCCAACAGGAAGTAAGTATATTATGAATTTCTTGTCTGTCAAATATGAATATTGTATCCAGATATAAATTTCTAAGTTTGAGAAAAAGGATTAAAATGCATGCTATGTTTGTAATTGGACTTTATTTTTACTTGTCTCTTTCTGAATCTCCAAAAGGAGTCTGGAACTCTCGAATGATAGTGTGGTTCCCTTTTCATTTTCCAGCTGGGACCTGGAAGGCCAATGAGTTCCAGTCACTGTCACAAAATTACCCTCTAGTTTTGGCATTTGTTGGTTAAGAATCCAATCATTTAATGAATTATTGTTATTGAATCATTGTAGGGTTATGTTTCTGTACGGCAGTGCTTTGCTCATTTTTCAAGAATCTTGGTTTTCTGTGCGATTTTTCCTGTACTGTATGTATTGTTTAGTCCTTCTGCCAGCCTGATTGATAGATAGAAAAATGGAGCAATCTGTGGATAAATGGTGAATAATCTAGCAACAAAAGTTGATCTTTTAGTCAGCCACTGTTTTGTTTAAGTACGAGCCATGGTAATTTGTAATTGAATAATTGAAGGATATGTTTTCTTAGAGGCTAGTGTTTGGTCATGTTGCAAGTAGCTCACGTGAAAATTGGGATAATACATGGGAAAATAGTGAACATATTATGACTCACTGTTTTTGCTACAGTTTGCATATGCATCTGTATTTGGTGATTAGAGGCCTGTTGATTTAGTTATTACTTTGATTGCTTTTGCTTTAACCATTTAATGGTGAACTTTGAGCAACAAATAGAGAAAATGATGAATTTTGAATTTGCATACACAACAATTGCTTTTCTGAAATCAATATTTAGATGCCTATTGATTTAGTTTTTCTTTGGGTTTGGTTTAACCCACAAATTGTTGGATCCTACGTTTGGGGTAATATAATGAAGTTGAAGATGAAAACCTTCCATACTGCTGTATACGATGGTAATCATCTTCACCTTTTCTTGATTAAGAGCATCATTATGTTAAGAAACATGACAGGATTTCTAAGGCGCTTTAGAAATGCTGATGAAATTAAATATTAGCTCCTTGCTCTATTTCTCTAGCTGTTTTTTGGAGATTCATTTTTATATGATATCTCTACAAACTAGGCTGGATTTTGTTCTGCGGTATGCAAGTGGCATACCCTTATGCATATACTGTTAGTCAATACTTATAATCCGTTTTTTCCCTTTGTGTGAAGGTTAAAGAGGTGATCATTTCCTTCTTTATACTGATGGAGCAAGGCAAAGCTACTTTAGAAGTAATATCTTCTGCCGCTGCTTTGTTTTCTTGCTTTCTTTGATTCTGTTTGAAAACATTGATGTGGATTTTCTGAAATTTACGGTGAAAAGTGATTTTATGATGATGGCAAGTAGGATCTTGATCTACGGTGTGAGGAGTTGATCAAAGAAGAGTTTGGAGAACACTGCAACTTCGAAGTGGACGATGCAGTTCAAAAGTTGGAGAAATTGGGAATCATTTCTCGGGTTAGTTATAAGCGTTTTTCTTACTGACGAAAATTGATCTAAAAAATTTATAAAAGCAATTTAGAAAAAGTTGTGAAAGCAATTTGAGGAAAAGTAAAAGAATATCGTATATTACTCTGTTTCTAAGTGAGGAAGGGATCTGTACGTAAATTGCATTGATTATATTGAATGCATCAGGATTCTTCTGTGTACGGGCTTATTGACTAAATGGAATATTTGATGATTGCTTAGCATGGTTTTAATGAATATTTATATATATAGTATTTGAAGATAAGTTGATAGTGTGTGGTGCATGTGGAAGGTGTCCATGATCTTCATCTGATTCTTTCTCTGTTGGCTTTAAATTTAGGATACAATTGGGCGTTATTACTGTGTTGGATTAAAACGCGCCAATGAGATCATAGGACTGACTACAGAGGAGCTTGTCCTGAAAGCAAGGCAGGGTGTCAATCCTTGAGAAATGGAAAGGCCAAACCTATTCCTGCTTTTTCTCAGAGTGTATCTTAGCGCTAAAGCACAAAAACATGAAATTCAACTGTAGATAGTGTTGGCGGTGTGCAAATGAAGAAAAATTGCTGCTCTAGAAAGTTTTCCACTTCAACAGTTTAGTGATGCAAGCTTAGTAGTTTGCTTAATGCATTTACCTCTTCTTCGTAGAAATGCTGCTCTTTGGTGTGTTTCCTAAATTGATTCGAATCTGGAATATAGTTGTATTTTTCTTGCAAGATTTATTGCATCAGTTTTGTTTCTTTTGGAAATCTCA

mRNA sequence

TCGATAGGTGTGAACAAAACAAAACGAAACAACATTTCCCTCCGGAGCCGGCGGGGATCGTTCTGTTTCATGAGAACTTTTTTCTGGTTGAGAAGAAAGACAATTTTTTTTTAGATTCAAAACCTAAATTGCTTTATATATTGTCTTTATCGATAAAATAATTGCTTCTATTCATCGTAATTATGATTTGATTTTGTTTTGGAATCGACTTGGGACTGTTCTTGACACTGTTTCAGACGTTACAGTTTGCTGATTTGGAGGTACTATTATTGTCTCGCCTTTCTCTTTCTTAGGCCAGTGGCGTACCCTCGTTCAGCTTCTGAACAATGGGGAAGAACAAGGAAGTAATTCGGCTGGAGCGTGAATCTGTCATTCCGGTGTTGAAGCCGAAGCTTATAATGACCTTGGCCAATCTAATTGAACATAGTTCTGATCGGGCAGAGTTCTTAAAACTTTGCAAGAGAATTGAGTACACAATTCGAGCTTGGTATCTTCTTCAATTTGAGGATTTGATGCAACTCTACTCTCTCTTTGATCCTGTTCATGGAGCCCAGAAGCTGGAGCAGCAGAATCTATCCTCTGATGAAATTGAAGTGCTTGAACAAAATTTTCTTTCTTACCTGTTTCAGGTTATGGAAAAGAGCAATTTCAAAATAGCAAGTGATGAGGAGATTGAAATTGCACTTTCAGGGCAGTATCTCTTAAATCTTCCTATTACAGTTGATGAATCCAAGCTTGACAAGGTTCTTTTAAAGAAATATTTTGCGACACATCCTCAAGCCAACCTACCAGACTTTGTTGACAAGTATGTTATTTTCAGGCGAGGAACTGGAATTGACCAAACGAGTGATTTTTTTTTCATGGAGAAAGTAGACATGCTTATCGGAAGGTTTTGGGCATATCTGTTAAGGTTAACGAGGTTAGAAAAGATTCTCTGCAGACGGCCAATTTCACGATCTACGGAGGATAGGAAGAAGAATGATGAGATTCCCCCTGATGCTGATCAAGACCTAGATGTTGAAAGAGTTCGTCTGGAGAATATGGAACTGAGCGCTTCCAATTTGCTGGGCAAGGTTACCATTCAAGAACCTACCTTTGATAGGATTATTGTAGTTTACAGGCGAGCAAGTACAAAGTCTAAACCTGAACGTGGAATATATGTCAAGCATTTTAAAAACATTCCAATGGCTGATATGGAAATAGTACTTCCTGAAAAGAAGAATCCAGGACTGACTCCAATGGACTGGGTTAAGTTTATTGTATCTGCTATAGTTGGGCTGGTTGCCCTTGTGGGTTCAATTGAGATGCCCAAGGCTGACTTTTGGGTCATTTTCGCTGTTCTCTCAACAGTTATTGGTTACTGTGCAAAGACATATTTCACGTTTCAGCAAAACATGGCTACATATCAGAACTTAATTACACAATCCATGTATGACAAACAGCTGGATAGTGGAAGGGGAACACTTCTTCATTTGTGTGATGATGTGATCCAACAGGAAGTTAAAGAGGTGATCATTTCCTTCTTTATACTGATGGAGCAAGGCAAAGCTACTTTAGAAGATCTTGATCTACGGTGTGAGGAGTTGATCAAAGAAGAGTTTGGAGAACACTGCAACTTCGAAGTGGACGATGCAGTTCAAAAGTTGGAGAAATTGGGAATCATTTCTCGGGATACAATTGGGCGTTATTACTGTGTTGGATTAAAACGCGCCAATGAGATCATAGGACTGACTACAGAGGAGCTTGTCCTGAAAGCAAGGCAGGGTGTCAATCCTTGAGAAATGGAAAGGCCAAACCTATTCCTGCTTTTTCTCAGAGTGTATCTTAGCGCTAAAGCACAAAAACATGAAATTCAACTGTAGATAGTGTTGGCGGTGTGCAAATGAAGAAAAATTGCTGCTCTAGAAAGTTTTCCACTTCAACAGTTTAGTGATGCAAGCTTAGTAGTTTGCTTAATGCATTTACCTCTTCTTCGTAGAAATGCTGCTCTTTGGTGTGTTTCCTAAATTGATTCGAATCTGGAATATAGTTGTATTTTTCTTGCAAGATTTATTGCATCAGTTTTGTTTCTTTTGGAAATCTCA

Coding sequence (CDS)

ATGGGGAAGAACAAGGAAGTAATTCGGCTGGAGCGTGAATCTGTCATTCCGGTGTTGAAGCCGAAGCTTATAATGACCTTGGCCAATCTAATTGAACATAGTTCTGATCGGGCAGAGTTCTTAAAACTTTGCAAGAGAATTGAGTACACAATTCGAGCTTGGTATCTTCTTCAATTTGAGGATTTGATGCAACTCTACTCTCTCTTTGATCCTGTTCATGGAGCCCAGAAGCTGGAGCAGCAGAATCTATCCTCTGATGAAATTGAAGTGCTTGAACAAAATTTTCTTTCTTACCTGTTTCAGGTTATGGAAAAGAGCAATTTCAAAATAGCAAGTGATGAGGAGATTGAAATTGCACTTTCAGGGCAGTATCTCTTAAATCTTCCTATTACAGTTGATGAATCCAAGCTTGACAAGGTTCTTTTAAAGAAATATTTTGCGACACATCCTCAAGCCAACCTACCAGACTTTGTTGACAAGTATGTTATTTTCAGGCGAGGAACTGGAATTGACCAAACGAGTGATTTTTTTTTCATGGAGAAAGTAGACATGCTTATCGGAAGGTTTTGGGCATATCTGTTAAGGTTAACGAGGTTAGAAAAGATTCTCTGCAGACGGCCAATTTCACGATCTACGGAGGATAGGAAGAAGAATGATGAGATTCCCCCTGATGCTGATCAAGACCTAGATGTTGAAAGAGTTCGTCTGGAGAATATGGAACTGAGCGCTTCCAATTTGCTGGGCAAGGTTACCATTCAAGAACCTACCTTTGATAGGATTATTGTAGTTTACAGGCGAGCAAGTACAAAGTCTAAACCTGAACGTGGAATATATGTCAAGCATTTTAAAAACATTCCAATGGCTGATATGGAAATAGTACTTCCTGAAAAGAAGAATCCAGGACTGACTCCAATGGACTGGGTTAAGTTTATTGTATCTGCTATAGTTGGGCTGGTTGCCCTTGTGGGTTCAATTGAGATGCCCAAGGCTGACTTTTGGGTCATTTTCGCTGTTCTCTCAACAGTTATTGGTTACTGTGCAAAGACATATTTCACGTTTCAGCAAAACATGGCTACATATCAGAACTTAATTACACAATCCATGTATGACAAACAGCTGGATAGTGGAAGGGGAACACTTCTTCATTTGTGTGATGATGTGATCCAACAGGAAGTTAAAGAGGTGATCATTTCCTTCTTTATACTGATGGAGCAAGGCAAAGCTACTTTAGAAGATCTTGATCTACGGTGTGAGGAGTTGATCAAAGAAGAGTTTGGAGAACACTGCAACTTCGAAGTGGACGATGCAGTTCAAAAGTTGGAGAAATTGGGAATCATTTCTCGGGATACAATTGGGCGTTATTACTGTGTTGGATTAAAACGCGCCAATGAGATCATAGGACTGACTACAGAGGAGCTTGTCCTGAAAGCAAGGCAGGGTGTCAATCCTTGA

Protein sequence

MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQNLSSDEIEVLEQNFLSYLFQVMEKSNFKIASDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQANLPDFVDKYVIFRRGTGIDQTSDFFFMEKVDMLIGRFWAYLLRLTRLEKILCRRPISRSTEDRKKNDEIPPDADQDLDVERVRLENMELSASNLLGKVTIQEPTFDRIIVVYRRASTKSKPERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKFIVSAIVGLVALVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDVIQQEVKEVIISFFILMEQGKATLEDLDLRCEELIKEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGLTTEELVLKARQGVNP*
BLAST of Cucsa.155960 vs. TrEMBL
Match: A0A067LBJ5_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26883 PE=4 SV=1)

HSP 1 Score: 815.8 bits (2106), Expect = 2.7e-233
Identity = 404/481 (83.99%), Postives = 444/481 (92.31%), Query Frame = 1

Query: 3   KNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDL 62
           K K+VIRLE+ESVIP+LKPKLIMTLANLIEH+SDRAEFLKLCKR+EYTIRAWYLLQFEDL
Sbjct: 5   KKKDVIRLEKESVIPILKPKLIMTLANLIEHTSDRAEFLKLCKRVEYTIRAWYLLQFEDL 64

Query: 63  MQLYSLFDPVHGAQKLEQQNLSSDEIEVLEQNFLSYLFQVMEKSNFKIASDEEIEIALSG 122
           MQLYSLFDPV GAQKL+QQNLS +EI+VLEQNFL+YLFQVM+KSNFKI +DEEI++AL G
Sbjct: 65  MQLYSLFDPVSGAQKLQQQNLSPEEIDVLEQNFLTYLFQVMDKSNFKITTDEEIDVALCG 124

Query: 123 QYLLNLPITVDESKLDKVLLKKYFATHPQANLPDFVDKYVIFRRGTGIDQTSDFFFMEKV 182
           QYLLNLPI+VDESKLDK LL+KYFA HP  NLPDF DKYVIFRRG GID+T+DFF MEKV
Sbjct: 125 QYLLNLPISVDESKLDKNLLRKYFAEHPHENLPDFADKYVIFRRGIGIDRTTDFFIMEKV 184

Query: 183 DMLIGRFWAYLLRLTRLEKILCRRPISRSTEDRKKNDEIPPDADQD-LDVERVRLENMEL 242
           DMLI RFWAY+LRLTR+EKIL RR   R   D KKNDEI  +AD+D   VER+RLENMEL
Sbjct: 185 DMLIARFWAYILRLTRVEKILRRRQSRRHNNDPKKNDEINSEADRDDFTVERIRLENMEL 244

Query: 243 SASNLLGKVTIQEPTFDRIIVVYRRASTKSKPERGIYVKHFKNIPMADMEIVLPEKKNPG 302
           S  NLL + TIQEPTFDRIIVVYRRA TKSKP+RGIYVKHFKNIPMADMEIVLPEK+NPG
Sbjct: 245 SVKNLLTRTTIQEPTFDRIIVVYRRAGTKSKPDRGIYVKHFKNIPMADMEIVLPEKQNPG 304

Query: 303 LTPMDWVKFIVSAIVGLVALVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQ 362
           LTPMDWVKF+ SA+VGLVA+VGS+EMPKAD WVIFAVLSTVIGY AKTYFTFQQN+ATYQ
Sbjct: 305 LTPMDWVKFLASAVVGLVAVVGSVEMPKADLWVIFAVLSTVIGYIAKTYFTFQQNLATYQ 364

Query: 363 NLITQSMYDKQLDSGRGTLLHLCDDVIQQEVKEVIISFFILMEQGKATLEDLDLRCEELI 422
           NLITQSMYDKQLDSGRGTLLHLCDDVIQQEVKEVIISFFILMEQGKATL+DLDLRCEELI
Sbjct: 365 NLITQSMYDKQLDSGRGTLLHLCDDVIQQEVKEVIISFFILMEQGKATLQDLDLRCEELI 424

Query: 423 KEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGLTTEELVLKARQGV 482
           +EEFGE CNF+VDDAV KLEKLGI+SRD++GRY+CVGLKRANEIIG TTEEL+LKA+QGV
Sbjct: 425 QEEFGESCNFDVDDAVHKLEKLGIVSRDSLGRYFCVGLKRANEIIGTTTEELILKAKQGV 484

BLAST of Cucsa.155960 vs. TrEMBL
Match: A0A061EDG0_THECC (Disease resistance protein OS=Theobroma cacao GN=TCM_017362 PE=4 SV=1)

HSP 1 Score: 809.3 bits (2089), Expect = 2.5e-231
Identity = 400/479 (83.51%), Postives = 443/479 (92.48%), Query Frame = 1

Query: 3   KNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDL 62
           K KEVIRLERESVIP+LKPKLIMTLANLIE  SDRAEFLK CKR+EYTIRAWYLLQFEDL
Sbjct: 9   KKKEVIRLERESVIPILKPKLIMTLANLIELRSDRAEFLKFCKRVEYTIRAWYLLQFEDL 68

Query: 63  MQLYSLFDPVHGAQKLEQQNLSSDEIEVLEQNFLSYLFQVMEKSNFKIASDEEIEIALSG 122
           MQLYSLFDPVHGAQKL+QQNLSS+EI+VLEQNFL+YLFQVMEKSNFKIA+D+EI++ALSG
Sbjct: 69  MQLYSLFDPVHGAQKLQQQNLSSEEIDVLEQNFLTYLFQVMEKSNFKIATDDEIDVALSG 128

Query: 123 QYLLNLPITVDESKLDKVLLKKYFATHPQANLPDFVDKYVIFRRGTGIDQTSDFFFMEKV 182
           QYLLNLPITVDESK+D+ LLK+YF+ HPQ NLPDF  KY+IFRRG GID+T+D+FF+EKV
Sbjct: 129 QYLLNLPITVDESKIDQSLLKRYFSEHPQENLPDFAVKYIIFRRGIGIDRTTDYFFLEKV 188

Query: 183 DMLIGRFWAYLLRLTRLEKILCRRPISRSTEDRKKNDEIPPDAD-QDLDVERVRLENMEL 242
           DM+I R WAYLLRLTRL+K+L RR   +   + KK+DEI P+AD +DL VER+RLENM+L
Sbjct: 189 DMIIARLWAYLLRLTRLDKLLARRSRRQHKTEPKKDDEINPEADSEDLFVERIRLENMDL 248

Query: 243 SASNLLGKVTIQEPTFDRIIVVYRRASTKSKPERGIYVKHFKNIPMADMEIVLPEKKNPG 302
           S  NLL K TIQEPTFDRIIVVYRRAST+S  ERG+YVKHFKNIPMAD+EIVLPEKKNPG
Sbjct: 249 SIPNLLSKTTIQEPTFDRIIVVYRRASTESNKERGVYVKHFKNIPMADLEIVLPEKKNPG 308

Query: 303 LTPMDWVKFIVSAIVGLVALVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQ 362
           LTPMDWVKF+ SA+VGLVA+ GS+EMPKAD WVIFA+LSTVIGYCAKTYFTFQ NMA YQ
Sbjct: 309 LTPMDWVKFLASAVVGLVAVFGSLEMPKADLWVIFAILSTVIGYCAKTYFTFQANMAAYQ 368

Query: 363 NLITQSMYDKQLDSGRGTLLHLCDDVIQQEVKEVIISFFILMEQGKATLEDLDLRCEELI 422
           NLITQSMYDKQLDSGRGTLLHLCDDVIQQEVKEVIISFFILMEQGKAT+EDLD+RCEELI
Sbjct: 369 NLITQSMYDKQLDSGRGTLLHLCDDVIQQEVKEVIISFFILMEQGKATMEDLDIRCEELI 428

Query: 423 KEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGLTTEELVLKARQG 481
           KEEFGE CNF+VDDAV+KLEKL IISRD+IGRYYCVGLKRANEIIG+TTEELVLKARQG
Sbjct: 429 KEEFGESCNFDVDDAVEKLEKLKIISRDSIGRYYCVGLKRANEIIGVTTEELVLKARQG 487

BLAST of Cucsa.155960 vs. TrEMBL
Match: B9RGW6_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1445010 PE=4 SV=1)

HSP 1 Score: 806.2 bits (2081), Expect = 2.2e-230
Identity = 405/479 (84.55%), Postives = 441/479 (92.07%), Query Frame = 1

Query: 3   KNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDL 62
           K KEVIRLERESVIP+LKPKL+MTLANLIEH+SDRAEFLKLCKRIEYTIRAWYLLQFEDL
Sbjct: 7   KKKEVIRLERESVIPILKPKLVMTLANLIEHTSDRAEFLKLCKRIEYTIRAWYLLQFEDL 66

Query: 63  MQLYSLFDPVHGAQKLEQQNLSSDEIEVLEQNFLSYLFQVMEKSNFKIASDEEIEIALSG 122
           MQLYSLFDPV GAQKL+QQNLS +EI+VLEQNFL+YLFQVM+KSNFKIA++EEIE+A SG
Sbjct: 67  MQLYSLFDPVSGAQKLQQQNLSPEEIDVLEQNFLTYLFQVMDKSNFKIATEEEIEVAHSG 126

Query: 123 QYLLNLPITVDESKLDKVLLKKYFATHPQANLPDFVDKYVIFRRGTGIDQTSDFFFMEKV 182
           QYLLNLPI+VDESKLDK +LKKYFA HP+ +LPDFVDKYVIFRRG GID+T+D+F MEKV
Sbjct: 127 QYLLNLPISVDESKLDKEVLKKYFAAHPREDLPDFVDKYVIFRRGIGIDRTTDYFIMEKV 186

Query: 183 DMLIGRFWAYLLRLTRLEKILCRRPISRSTEDRKKNDEIPPDADQ-DLDVERVRLENMEL 242
           DMLI RFWAY+LRLTR+EK+L RR   R  +D KKNDEI  +ADQ DL VER+RLENMEL
Sbjct: 187 DMLIARFWAYILRLTRVEKLLRRRSSMRCKKDPKKNDEINSEADQNDLCVERIRLENMEL 246

Query: 243 SASNLLGKVTIQEPTFDRIIVVYRRASTKSKPERGIYVKHFKNIPMADMEIVLPEKKNPG 302
           S  NLL   TIQEPTFDRIIVVYRRAS+K K ERGIYVKHFKNIPMADMEIVLPEKKNPG
Sbjct: 247 SVRNLLSSTTIQEPTFDRIIVVYRRASSKLKKERGIYVKHFKNIPMADMEIVLPEKKNPG 306

Query: 303 LTPMDWVKFIVSAIVGLVALVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQ 362
           LTPMDWVKF+ SAIVGLVALV S+EMPKAD WVIFAVLS VIGY AKTYFTFQ N+A YQ
Sbjct: 307 LTPMDWVKFLGSAIVGLVALVSSLEMPKADLWVIFAVLSAVIGYFAKTYFTFQANLAAYQ 366

Query: 363 NLITQSMYDKQLDSGRGTLLHLCDDVIQQEVKEVIISFFILMEQGKATLEDLDLRCEELI 422
           NLITQSMYDKQLDSG+GTLLHLCDDVIQQEVKEVIISFFILMEQGKAT++DLDLRCEELI
Sbjct: 367 NLITQSMYDKQLDSGKGTLLHLCDDVIQQEVKEVIISFFILMEQGKATMQDLDLRCEELI 426

Query: 423 KEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGLTTEELVLKARQG 481
           +EEFGE CNF+VDDAV KLEKLGI++RDTIGRYYCVGLKRANEIIG TTEELVLKA+QG
Sbjct: 427 QEEFGESCNFDVDDAVLKLEKLGIVARDTIGRYYCVGLKRANEIIGTTTEELVLKAKQG 485

BLAST of Cucsa.155960 vs. TrEMBL
Match: W9R8L8_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_007561 PE=4 SV=1)

HSP 1 Score: 805.1 bits (2078), Expect = 4.8e-230
Identity = 400/477 (83.86%), Postives = 441/477 (92.45%), Query Frame = 1

Query: 5   KEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQ 64
           KEVIR+ERESVIP+LKPKLIMTLANLIEHSSDRAEFLKLCKR+EYTIRAWYLLQFEDLMQ
Sbjct: 21  KEVIRIERESVIPILKPKLIMTLANLIEHSSDRAEFLKLCKRVEYTIRAWYLLQFEDLMQ 80

Query: 65  LYSLFDPVHGAQKLEQQNLSSDEIEVLEQNFLSYLFQVMEKSNFKIASDEEIEIALSGQY 124
           LYSLFDPVHGAQKLEQQ+LS +EI+VLEQNFL+YLFQVMEKSNFKIA+D+EIE+ALSGQY
Sbjct: 81  LYSLFDPVHGAQKLEQQSLSPEEIDVLEQNFLTYLFQVMEKSNFKIATDDEIEVALSGQY 140

Query: 125 LLNLPITVDESKLDKVLLKKYFATHPQANLPDFVDKYVIFRRGTGIDQTSDFFFMEKVDM 184
           LLNLPITVD+SKLDK LL+KYF  HP  NLPDF DKYVIFRRG G+D+T+D+FFMEK+DM
Sbjct: 141 LLNLPITVDDSKLDKKLLRKYFEDHPCPNLPDFSDKYVIFRRGIGLDKTTDYFFMEKMDM 200

Query: 185 LIGRFWAYLLRLTRLEKILCRRPISRSTEDRKKNDEIPPDADQ-DLDVERVRLENMELSA 244
           LIGRFW+YLLRLTRL+K+L RR   +  +D KK+DEI PDADQ DL VER RLEN+ELS 
Sbjct: 201 LIGRFWSYLLRLTRLDKLLSRRS-GKKKKDPKKDDEIIPDADQEDLYVERKRLENLELSL 260

Query: 245 SNLLGKVTIQEPTFDRIIVVYRRASTKSKPERGIYVKHFKNIPMADMEIVLPEKKNPGLT 304
            NLL K+TIQEPTFDRIIVVYRRAS K K ERGI+VKHFKNIPMADMEIVLPEKKNPGLT
Sbjct: 261 RNLLSKITIQEPTFDRIIVVYRRASAKEKTERGIFVKHFKNIPMADMEIVLPEKKNPGLT 320

Query: 305 PMDWVKFIVSAIVGLVALVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNL 364
           PMDWVKF++SA+VGLVA+V S+EMPKAD WVI A+LSTVIGYCAKTYFTFQQNMA YQNL
Sbjct: 321 PMDWVKFLISAVVGLVAVVTSVEMPKADLWVIIAILSTVIGYCAKTYFTFQQNMAAYQNL 380

Query: 365 ITQSMYDKQLDSGRGTLLHLCDDVIQQEVKEVIISFFILMEQGKATLEDLDLRCEELIKE 424
           ITQSMYDKQLDSG+GTLLHLCDDVIQQEVKEVIISFFILMEQGKAT +DLD  CE+LIKE
Sbjct: 381 ITQSMYDKQLDSGKGTLLHLCDDVIQQEVKEVIISFFILMEQGKATRQDLDRWCEDLIKE 440

Query: 425 EFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGLTTEELVLKARQG 481
           EF E CNF+VDDAVQKLEKLGI++RD++GRYYCVGLKRANEIIG TTEELVLK +QG
Sbjct: 441 EFNESCNFDVDDAVQKLEKLGIVARDSVGRYYCVGLKRANEIIGTTTEELVLKVKQG 496

BLAST of Cucsa.155960 vs. TrEMBL
Match: A0A068V8E4_COFCA (Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00019232001 PE=4 SV=1)

HSP 1 Score: 802.7 bits (2072), Expect = 2.4e-229
Identity = 393/479 (82.05%), Postives = 442/479 (92.28%), Query Frame = 1

Query: 3   KNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDL 62
           KNKEVIRLE ESVIPVLKPKLIMTLANLIEHSSDR+EFLKLCKR+EYTIRAWYLLQFEDL
Sbjct: 13  KNKEVIRLEPESVIPVLKPKLIMTLANLIEHSSDRSEFLKLCKRVEYTIRAWYLLQFEDL 72

Query: 63  MQLYSLFDPVHGAQKLEQQNLSSDEIEVLEQNFLSYLFQVMEKSNFKIASDEEIEIALSG 122
           MQLYSLFDPVHGAQKL+QQNLS DEI++LEQNFL+YLFQVMEKSNFKIA+D+EI+IA SG
Sbjct: 73  MQLYSLFDPVHGAQKLQQQNLSLDEIDILEQNFLTYLFQVMEKSNFKIATDDEIDIAHSG 132

Query: 123 QYLLNLPITVDESKLDKVLLKKYFATHPQANLPDFVDKYVIFRRGTGIDQTSDFFFMEKV 182
           QYLLNLPITVDESKLDK LLK+YF  HP  NLP+F DKYVIFRRG GIDQT+D+FF+EKV
Sbjct: 133 QYLLNLPITVDESKLDKKLLKRYFEEHPHENLPEFADKYVIFRRGIGIDQTTDYFFLEKV 192

Query: 183 DMLIGRFWAYLLRLTRLEKILCRRPISRSTEDRKKNDEIPPDADQD-LDVERVRLENMEL 242
           DM+I R W + LR TRLE+   RR +SR   D+KK+DE   D ++D + VER+RLENME+
Sbjct: 193 DMIIARLWTWFLRKTRLERSFSRRSVSRQKSDQKKSDEKTADNEEDCIFVERIRLENMEI 252

Query: 243 SASNLLGKVTIQEPTFDRIIVVYRRASTKSKPERGIYVKHFKNIPMADMEIVLPEKKNPG 302
           S  +LL K+TIQEPTFDRIIV+YR+A T+ KPERGI+VKHFKNIPMADMEIVLPEKKNP 
Sbjct: 253 SFRSLLSKITIQEPTFDRIIVIYRQAGTQLKPERGIFVKHFKNIPMADMEIVLPEKKNPS 312

Query: 303 LTPMDWVKFIVSAIVGLVALVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQ 362
           LTPMDWVKF++SA+VGLVA+VGS+E+PKAD WVIFA++STV+GYCAKTYFTFQQNMATYQ
Sbjct: 313 LTPMDWVKFLISAVVGLVAVVGSLEVPKADLWVIFAIVSTVLGYCAKTYFTFQQNMATYQ 372

Query: 363 NLITQSMYDKQLDSGRGTLLHLCDDVIQQEVKEVIISFFILMEQGKATLEDLDLRCEELI 422
           +LITQSMYDKQLDSG+GTLLHLCDDVIQQEVKEVIISFFILMEQGKATL+DLDLRCEELI
Sbjct: 373 SLITQSMYDKQLDSGKGTLLHLCDDVIQQEVKEVIISFFILMEQGKATLQDLDLRCEELI 432

Query: 423 KEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGLTTEELVLKARQG 481
           K+EFGE CNF+V+DAVQKLEKLGI++RDTIGRYYC+GLKRANEIIG TTEELVLKARQG
Sbjct: 433 KDEFGETCNFDVNDAVQKLEKLGIVARDTIGRYYCIGLKRANEIIGTTTEELVLKARQG 491

BLAST of Cucsa.155960 vs. TAIR10
Match: AT3G19340.1 (AT3G19340.1 Protein of unknown function (DUF3754))

HSP 1 Score: 787.3 bits (2032), Expect = 5.2e-228
Identity = 385/481 (80.04%), Postives = 439/481 (91.27%), Query Frame = 1

Query: 4   NKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLM 63
           NKEVIRLE ESVIP+LKPKLIMTLANLIEHS+DR EFLKLCKRIEYT+RAWYLLQFEDLM
Sbjct: 6   NKEVIRLEPESVIPILKPKLIMTLANLIEHSNDRQEFLKLCKRIEYTVRAWYLLQFEDLM 65

Query: 64  QLYSLFDPVHGAQKLEQQNLSSDEIEVLEQNFLSYLFQVMEKSNFKIASDEEIEIALSGQ 123
           QLYSLFDPVHGAQK++QQNL+S EI+VLEQNFL+YLFQVMEKSNFKI S+EE+E+A SGQ
Sbjct: 66  QLYSLFDPVHGAQKIQQQNLTSQEIDVLEQNFLAYLFQVMEKSNFKITSNEEMEVAHSGQ 125

Query: 124 YLLNLPITVDESKLDKVLLKKYFATHPQANLPDFVDKYVIFRRGTGIDQTSDFFFMEKVD 183
           YLLNLPI VDESKLDK LLK+YF  HP  N+PDF DKYVIFRRG G+D+T+D+FFMEK+D
Sbjct: 126 YLLNLPIKVDESKLDKKLLKRYFEEHPHENIPDFSDKYVIFRRGIGLDKTTDYFFMEKLD 185

Query: 184 MLIGRFWAYLLRLTRLEKILCRRPISRSTEDRKKNDEIPPDADQD-LDVERVRLENMELS 243
           ++I RFW++L+R+TRLEK+  +R  S + +D KK+DE  PD D D L VER+RLEN +LS
Sbjct: 186 VIISRFWSFLMRITRLEKLRAKRSSSLNKKDPKKDDEPNPDTDNDELYVERIRLENSKLS 245

Query: 244 ASNLLGKVTIQEPTFDRIIVVYRRASTKSKPERGIYVKHFKNIPMADMEIVLPEKKNPGL 303
             + L K+TIQEPTFDR+IVVYRRAS+K+  ERGIYVKHFKNIPMADMEIVLPEK+NPGL
Sbjct: 246 FKSFLSKLTIQEPTFDRMIVVYRRASSKTNLERGIYVKHFKNIPMADMEIVLPEKRNPGL 305

Query: 304 TPMDWVKFIVSAIVGLVALVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQN 363
           TPMDWVKF++SA+VGLVA++ S+EMPK+D WVI A+LSTV+GYCAKTYFTFQQNMATYQN
Sbjct: 306 TPMDWVKFLISAVVGLVAVLTSVEMPKSDPWVIIAILSTVLGYCAKTYFTFQQNMATYQN 365

Query: 364 LITQSMYDKQLDSGRGTLLHLCDDVIQQEVKEVIISFFILMEQGKATLEDLDLRCEELIK 423
           LITQSMYDKQLDSGRGTLLHLCDDVIQQEVKEV+I F+ILMEQGKATLEDLDLRCEELIK
Sbjct: 366 LITQSMYDKQLDSGRGTLLHLCDDVIQQEVKEVMICFYILMEQGKATLEDLDLRCEELIK 425

Query: 424 EEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGLTTEELVLKARQGVN 483
           EEFG  CNF+V+DAVQKLEKLGI++RDTIGRYYC+GLKRANEIIG TTEELVLKA+QGV 
Sbjct: 426 EEFGARCNFDVEDAVQKLEKLGIVARDTIGRYYCMGLKRANEIIGTTTEELVLKAKQGVT 485

BLAST of Cucsa.155960 vs. TAIR10
Match: AT5G13940.1 (AT5G13940.1 aminopeptidases)

HSP 1 Score: 561.2 bits (1445), Expect = 6.1e-160
Identity = 278/452 (61.50%), Postives = 358/452 (79.20%), Query Frame = 1

Query: 31  IEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQNLSSDEIEV 90
           I+   +R EFL+ C+R+E TIRAWY L FEDLMQLYSLF+PV GA +L QQNLS+ EI+ 
Sbjct: 321 IKDKWEREEFLRFCQRVECTIRAWYHLHFEDLMQLYSLFEPVRGAHRLNQQNLSTREIDA 380

Query: 91  LEQNFLSYLFQVMEKSNFKIASDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHP 150
           LE  FL +LFQVMEKSNFK+ ++EEI++ALS QY LNLPI V+E+KLD  LL +YF+  P
Sbjct: 381 LEDQFLLHLFQVMEKSNFKVITNEEIQVALSAQYRLNLPIVVNEAKLDTKLLTRYFSKFP 440

Query: 151 QANLPDFVDKYVIFRRGTGIDQTSDFFFMEKVDMLIGRFWAYLLRLTRLEKILCRRPISR 210
           + +LP F DKY+IFRRG GID    +FF+ K+D ++ R W +LL +T L++++  +   +
Sbjct: 441 RDDLPHFADKYIIFRRGFGIDHMKAYFFLAKIDTILVRIWHFLLTITCLKRLVYGK---K 500

Query: 211 STEDRKKNDEIPPDADQD-LDVERVRLENMELSASNLLGKVTIQEPTFDRIIVVYRRAST 270
           +     +  +I  + ++D L +ER+R+E ++LS SNL+ K+TIQEPTF+RIIVVYRR S 
Sbjct: 501 NDVGLSEQIDISIETEKDSLYIERIRIEKLKLSLSNLMKKITIQEPTFERIIVVYRRVSG 560

Query: 271 KSKPERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKFIVSAIVGLVALVGSIEMPK 330
           K + ER IYVKHFK IPMADMEIVLPEKKNPGLTP+DWVKF+VSA +GLV +V S+ + K
Sbjct: 561 KKESERNIYVKHFKTIPMADMEIVLPEKKNPGLTPLDWVKFLVSAAIGLVTVVSSVSLKK 620

Query: 331 ADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDVIQ 390
           AD  VI A+LSTV+ YC KTYFTFQ+N+  YQ+LIT+S+YDKQLDSGRGTLLHLCD+VIQ
Sbjct: 621 ADIRVIAAILSTVVAYCVKTYFTFQRNLVDYQSLITRSVYDKQLDSGRGTLLHLCDEVIQ 680

Query: 391 QEVKEVIISFFILMEQGKAT-LEDLDLRCEELIKEEFGEHCNFEVDDAVQKLEKLGIISR 450
           QEVKEVIISFF+L+++G  T  E+LD++ E  IKEEF E CNF+VDDA+ KLEKLG++SR
Sbjct: 681 QEVKEVIISFFMLIKKGCPTSKEELDMKSEAFIKEEFNESCNFDVDDAITKLEKLGLVSR 740

Query: 451 DTIGRYYCVGLKRANEIIGLTTEELVLKARQG 481
           D+  +Y CV +K ANEI+G TTEE+VLKARQG
Sbjct: 741 DSEDKYRCVEMKEANEIMGTTTEEMVLKARQG 769

BLAST of Cucsa.155960 vs. NCBI nr
Match: gi|449455934|ref|XP_004145705.1| (PREDICTED: uncharacterized protein LOC101204725 [Cucumis sativus])

HSP 1 Score: 954.5 bits (2466), Expect = 7.0e-275
Identity = 483/483 (100.00%), Postives = 483/483 (100.00%), Query Frame = 1

Query: 1   MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFE 60
           MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFE
Sbjct: 1   MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFE 60

Query: 61  DLMQLYSLFDPVHGAQKLEQQNLSSDEIEVLEQNFLSYLFQVMEKSNFKIASDEEIEIAL 120
           DLMQLYSLFDPVHGAQKLEQQNLSSDEIEVLEQNFLSYLFQVMEKSNFKIASDEEIEIAL
Sbjct: 61  DLMQLYSLFDPVHGAQKLEQQNLSSDEIEVLEQNFLSYLFQVMEKSNFKIASDEEIEIAL 120

Query: 121 SGQYLLNLPITVDESKLDKVLLKKYFATHPQANLPDFVDKYVIFRRGTGIDQTSDFFFME 180
           SGQYLLNLPITVDESKLDKVLLKKYFATHPQANLPDFVDKYVIFRRGTGIDQTSDFFFME
Sbjct: 121 SGQYLLNLPITVDESKLDKVLLKKYFATHPQANLPDFVDKYVIFRRGTGIDQTSDFFFME 180

Query: 181 KVDMLIGRFWAYLLRLTRLEKILCRRPISRSTEDRKKNDEIPPDADQDLDVERVRLENME 240
           KVDMLIGRFWAYLLRLTRLEKILCRRPISRSTEDRKKNDEIPPDADQDLDVERVRLENME
Sbjct: 181 KVDMLIGRFWAYLLRLTRLEKILCRRPISRSTEDRKKNDEIPPDADQDLDVERVRLENME 240

Query: 241 LSASNLLGKVTIQEPTFDRIIVVYRRASTKSKPERGIYVKHFKNIPMADMEIVLPEKKNP 300
           LSASNLLGKVTIQEPTFDRIIVVYRRASTKSKPERGIYVKHFKNIPMADMEIVLPEKKNP
Sbjct: 241 LSASNLLGKVTIQEPTFDRIIVVYRRASTKSKPERGIYVKHFKNIPMADMEIVLPEKKNP 300

Query: 301 GLTPMDWVKFIVSAIVGLVALVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATY 360
           GLTPMDWVKFIVSAIVGLVALVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATY
Sbjct: 301 GLTPMDWVKFIVSAIVGLVALVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATY 360

Query: 361 QNLITQSMYDKQLDSGRGTLLHLCDDVIQQEVKEVIISFFILMEQGKATLEDLDLRCEEL 420
           QNLITQSMYDKQLDSGRGTLLHLCDDVIQQEVKEVIISFFILMEQGKATLEDLDLRCEEL
Sbjct: 361 QNLITQSMYDKQLDSGRGTLLHLCDDVIQQEVKEVIISFFILMEQGKATLEDLDLRCEEL 420

Query: 421 IKEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGLTTEELVLKARQG 480
           IKEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGLTTEELVLKARQG
Sbjct: 421 IKEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGLTTEELVLKARQG 480

Query: 481 VNP 484
           VNP
Sbjct: 481 VNP 483

BLAST of Cucsa.155960 vs. NCBI nr
Match: gi|659098120|ref|XP_008449987.1| (PREDICTED: uncharacterized protein LOC103491708 isoform X1 [Cucumis melo])

HSP 1 Score: 934.5 bits (2414), Expect = 7.5e-269
Identity = 471/483 (97.52%), Postives = 479/483 (99.17%), Query Frame = 1

Query: 1   MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFE 60
           MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFE
Sbjct: 1   MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFE 60

Query: 61  DLMQLYSLFDPVHGAQKLEQQNLSSDEIEVLEQNFLSYLFQVMEKSNFKIASDEEIEIAL 120
           DLMQLYSLFDPVHGAQKLEQQNLSSDEI+VLEQNFLSYLFQVMEKSNFKIASDEEIEIAL
Sbjct: 61  DLMQLYSLFDPVHGAQKLEQQNLSSDEIDVLEQNFLSYLFQVMEKSNFKIASDEEIEIAL 120

Query: 121 SGQYLLNLPITVDESKLDKVLLKKYFATHPQANLPDFVDKYVIFRRGTGIDQTSDFFFME 180
           SGQYLLNLPITVDESKLDKVLLKKYFATHPQANLPDFVDKYVIFRRGTGID+TSDFFF+E
Sbjct: 121 SGQYLLNLPITVDESKLDKVLLKKYFATHPQANLPDFVDKYVIFRRGTGIDRTSDFFFIE 180

Query: 181 KVDMLIGRFWAYLLRLTRLEKILCRRPISRSTEDRKKNDEIPPDADQDLDVERVRLENME 240
           KVDMLIGRFW+YLLRLTRLEKILCRRP SRS EDRKKNDEIP DA+QDLDVERVRLENME
Sbjct: 181 KVDMLIGRFWSYLLRLTRLEKILCRRPSSRSMEDRKKNDEIPTDAEQDLDVERVRLENME 240

Query: 241 LSASNLLGKVTIQEPTFDRIIVVYRRASTKSKPERGIYVKHFKNIPMADMEIVLPEKKNP 300
           LSASNLLGKVTIQEPTFDRIIVVYRRASTKSKPERGIYVKHFKNIPMADMEIVLPEKKNP
Sbjct: 241 LSASNLLGKVTIQEPTFDRIIVVYRRASTKSKPERGIYVKHFKNIPMADMEIVLPEKKNP 300

Query: 301 GLTPMDWVKFIVSAIVGLVALVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATY 360
           GLTPMDWVKFIVSAIVGLVA+VGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATY
Sbjct: 301 GLTPMDWVKFIVSAIVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATY 360

Query: 361 QNLITQSMYDKQLDSGRGTLLHLCDDVIQQEVKEVIISFFILMEQGKATLEDLDLRCEEL 420
           QNLITQSMYDKQLDSGRGTLLHLCDDVIQQEVKEVIISFFILMEQGKATLEDLDLRCEEL
Sbjct: 361 QNLITQSMYDKQLDSGRGTLLHLCDDVIQQEVKEVIISFFILMEQGKATLEDLDLRCEEL 420

Query: 421 IKEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGLTTEELVLKARQG 480
           IKEEFGEHCNFEVDDAVQKLEKLGI+SRDTIGRYYCVGLKRANEIIG TTEELVLKARQG
Sbjct: 421 IKEEFGEHCNFEVDDAVQKLEKLGIVSRDTIGRYYCVGLKRANEIIGPTTEELVLKARQG 480

Query: 481 VNP 484
           +NP
Sbjct: 481 LNP 483

BLAST of Cucsa.155960 vs. NCBI nr
Match: gi|659098124|ref|XP_008449989.1| (PREDICTED: uncharacterized protein LOC103491708 isoform X2 [Cucumis melo])

HSP 1 Score: 878.2 bits (2268), Expect = 6.4e-252
Identity = 448/483 (92.75%), Postives = 456/483 (94.41%), Query Frame = 1

Query: 1   MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFE 60
           MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFE
Sbjct: 1   MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFE 60

Query: 61  DLMQLYSLFDPVHGAQKLEQQNLSSDEIEVLEQNFLSYLFQVMEKSNFKIASDEEIEIAL 120
           DLMQLYSLFDPVHGAQKLEQQNLSSDEI+VLEQNFLSYLFQVMEKSNFKIASDEEIEIAL
Sbjct: 61  DLMQLYSLFDPVHGAQKLEQQNLSSDEIDVLEQNFLSYLFQVMEKSNFKIASDEEIEIAL 120

Query: 121 SGQYLLNLPITVDESKLDKVLLKKYFATHPQANLPDFVDKYVIFRRGTGIDQTSDFFFME 180
           SGQYLLNLPITVDESKLDKVLLKKYFATHPQANLPDFVDKYVIFRRGTGID+TSDFFF+E
Sbjct: 121 SGQYLLNLPITVDESKLDKVLLKKYFATHPQANLPDFVDKYVIFRRGTGIDRTSDFFFIE 180

Query: 181 KVDMLIGRFWAYLLRLTRLEKILCRRPISRSTEDRKKNDEIPPDADQDLDVERVRLENME 240
           KVDMLIGRFW+YLLRLTRLEKILCRRP SRS EDRKKNDEIP DA+QDLDVERVRLENME
Sbjct: 181 KVDMLIGRFWSYLLRLTRLEKILCRRPSSRSMEDRKKNDEIPTDAEQDLDVERVRLENME 240

Query: 241 LSASNLLGKVTIQEPTFDRIIVVYRRASTKSKPERGIYVKHFKNIPMADMEIVLPEKKNP 300
           L                       RRASTKSKPERGIYVKHFKNIPMADMEIVLPEKKNP
Sbjct: 241 L-----------------------RRASTKSKPERGIYVKHFKNIPMADMEIVLPEKKNP 300

Query: 301 GLTPMDWVKFIVSAIVGLVALVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATY 360
           GLTPMDWVKFIVSAIVGLVA+VGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATY
Sbjct: 301 GLTPMDWVKFIVSAIVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATY 360

Query: 361 QNLITQSMYDKQLDSGRGTLLHLCDDVIQQEVKEVIISFFILMEQGKATLEDLDLRCEEL 420
           QNLITQSMYDKQLDSGRGTLLHLCDDVIQQEVKEVIISFFILMEQGKATLEDLDLRCEEL
Sbjct: 361 QNLITQSMYDKQLDSGRGTLLHLCDDVIQQEVKEVIISFFILMEQGKATLEDLDLRCEEL 420

Query: 421 IKEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGLTTEELVLKARQG 480
           IKEEFGEHCNFEVDDAVQKLEKLGI+SRDTIGRYYCVGLKRANEIIG TTEELVLKARQG
Sbjct: 421 IKEEFGEHCNFEVDDAVQKLEKLGIVSRDTIGRYYCVGLKRANEIIGPTTEELVLKARQG 460

Query: 481 VNP 484
           +NP
Sbjct: 481 LNP 460

BLAST of Cucsa.155960 vs. NCBI nr
Match: gi|802564642|ref|XP_012067365.1| (PREDICTED: uncharacterized protein LOC105630221 [Jatropha curcas])

HSP 1 Score: 815.8 bits (2106), Expect = 3.9e-233
Identity = 404/481 (83.99%), Postives = 444/481 (92.31%), Query Frame = 1

Query: 3   KNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDL 62
           K K+VIRLE+ESVIP+LKPKLIMTLANLIEH+SDRAEFLKLCKR+EYTIRAWYLLQFEDL
Sbjct: 5   KKKDVIRLEKESVIPILKPKLIMTLANLIEHTSDRAEFLKLCKRVEYTIRAWYLLQFEDL 64

Query: 63  MQLYSLFDPVHGAQKLEQQNLSSDEIEVLEQNFLSYLFQVMEKSNFKIASDEEIEIALSG 122
           MQLYSLFDPV GAQKL+QQNLS +EI+VLEQNFL+YLFQVM+KSNFKI +DEEI++AL G
Sbjct: 65  MQLYSLFDPVSGAQKLQQQNLSPEEIDVLEQNFLTYLFQVMDKSNFKITTDEEIDVALCG 124

Query: 123 QYLLNLPITVDESKLDKVLLKKYFATHPQANLPDFVDKYVIFRRGTGIDQTSDFFFMEKV 182
           QYLLNLPI+VDESKLDK LL+KYFA HP  NLPDF DKYVIFRRG GID+T+DFF MEKV
Sbjct: 125 QYLLNLPISVDESKLDKNLLRKYFAEHPHENLPDFADKYVIFRRGIGIDRTTDFFIMEKV 184

Query: 183 DMLIGRFWAYLLRLTRLEKILCRRPISRSTEDRKKNDEIPPDADQD-LDVERVRLENMEL 242
           DMLI RFWAY+LRLTR+EKIL RR   R   D KKNDEI  +AD+D   VER+RLENMEL
Sbjct: 185 DMLIARFWAYILRLTRVEKILRRRQSRRHNNDPKKNDEINSEADRDDFTVERIRLENMEL 244

Query: 243 SASNLLGKVTIQEPTFDRIIVVYRRASTKSKPERGIYVKHFKNIPMADMEIVLPEKKNPG 302
           S  NLL + TIQEPTFDRIIVVYRRA TKSKP+RGIYVKHFKNIPMADMEIVLPEK+NPG
Sbjct: 245 SVKNLLTRTTIQEPTFDRIIVVYRRAGTKSKPDRGIYVKHFKNIPMADMEIVLPEKQNPG 304

Query: 303 LTPMDWVKFIVSAIVGLVALVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQ 362
           LTPMDWVKF+ SA+VGLVA+VGS+EMPKAD WVIFAVLSTVIGY AKTYFTFQQN+ATYQ
Sbjct: 305 LTPMDWVKFLASAVVGLVAVVGSVEMPKADLWVIFAVLSTVIGYIAKTYFTFQQNLATYQ 364

Query: 363 NLITQSMYDKQLDSGRGTLLHLCDDVIQQEVKEVIISFFILMEQGKATLEDLDLRCEELI 422
           NLITQSMYDKQLDSGRGTLLHLCDDVIQQEVKEVIISFFILMEQGKATL+DLDLRCEELI
Sbjct: 365 NLITQSMYDKQLDSGRGTLLHLCDDVIQQEVKEVIISFFILMEQGKATLQDLDLRCEELI 424

Query: 423 KEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGLTTEELVLKARQGV 482
           +EEFGE CNF+VDDAV KLEKLGI+SRD++GRY+CVGLKRANEIIG TTEEL+LKA+QGV
Sbjct: 425 QEEFGESCNFDVDDAVHKLEKLGIVSRDSLGRYFCVGLKRANEIIGTTTEELILKAKQGV 484

BLAST of Cucsa.155960 vs. NCBI nr
Match: gi|590647944|ref|XP_007032040.1| (Disease resistance protein [Theobroma cacao])

HSP 1 Score: 809.3 bits (2089), Expect = 3.7e-231
Identity = 400/479 (83.51%), Postives = 443/479 (92.48%), Query Frame = 1

Query: 3   KNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDL 62
           K KEVIRLERESVIP+LKPKLIMTLANLIE  SDRAEFLK CKR+EYTIRAWYLLQFEDL
Sbjct: 9   KKKEVIRLERESVIPILKPKLIMTLANLIELRSDRAEFLKFCKRVEYTIRAWYLLQFEDL 68

Query: 63  MQLYSLFDPVHGAQKLEQQNLSSDEIEVLEQNFLSYLFQVMEKSNFKIASDEEIEIALSG 122
           MQLYSLFDPVHGAQKL+QQNLSS+EI+VLEQNFL+YLFQVMEKSNFKIA+D+EI++ALSG
Sbjct: 69  MQLYSLFDPVHGAQKLQQQNLSSEEIDVLEQNFLTYLFQVMEKSNFKIATDDEIDVALSG 128

Query: 123 QYLLNLPITVDESKLDKVLLKKYFATHPQANLPDFVDKYVIFRRGTGIDQTSDFFFMEKV 182
           QYLLNLPITVDESK+D+ LLK+YF+ HPQ NLPDF  KY+IFRRG GID+T+D+FF+EKV
Sbjct: 129 QYLLNLPITVDESKIDQSLLKRYFSEHPQENLPDFAVKYIIFRRGIGIDRTTDYFFLEKV 188

Query: 183 DMLIGRFWAYLLRLTRLEKILCRRPISRSTEDRKKNDEIPPDAD-QDLDVERVRLENMEL 242
           DM+I R WAYLLRLTRL+K+L RR   +   + KK+DEI P+AD +DL VER+RLENM+L
Sbjct: 189 DMIIARLWAYLLRLTRLDKLLARRSRRQHKTEPKKDDEINPEADSEDLFVERIRLENMDL 248

Query: 243 SASNLLGKVTIQEPTFDRIIVVYRRASTKSKPERGIYVKHFKNIPMADMEIVLPEKKNPG 302
           S  NLL K TIQEPTFDRIIVVYRRAST+S  ERG+YVKHFKNIPMAD+EIVLPEKKNPG
Sbjct: 249 SIPNLLSKTTIQEPTFDRIIVVYRRASTESNKERGVYVKHFKNIPMADLEIVLPEKKNPG 308

Query: 303 LTPMDWVKFIVSAIVGLVALVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQ 362
           LTPMDWVKF+ SA+VGLVA+ GS+EMPKAD WVIFA+LSTVIGYCAKTYFTFQ NMA YQ
Sbjct: 309 LTPMDWVKFLASAVVGLVAVFGSLEMPKADLWVIFAILSTVIGYCAKTYFTFQANMAAYQ 368

Query: 363 NLITQSMYDKQLDSGRGTLLHLCDDVIQQEVKEVIISFFILMEQGKATLEDLDLRCEELI 422
           NLITQSMYDKQLDSGRGTLLHLCDDVIQQEVKEVIISFFILMEQGKAT+EDLD+RCEELI
Sbjct: 369 NLITQSMYDKQLDSGRGTLLHLCDDVIQQEVKEVIISFFILMEQGKATMEDLDIRCEELI 428

Query: 423 KEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGLTTEELVLKARQG 481
           KEEFGE CNF+VDDAV+KLEKL IISRD+IGRYYCVGLKRANEIIG+TTEELVLKARQG
Sbjct: 429 KEEFGESCNFDVDDAVEKLEKLKIISRDSIGRYYCVGLKRANEIIGVTTEELVLKARQG 487

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A067LBJ5_JATCU2.7e-23383.99Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26883 PE=4 SV=1[more]
A0A061EDG0_THECC2.5e-23183.51Disease resistance protein OS=Theobroma cacao GN=TCM_017362 PE=4 SV=1[more]
B9RGW6_RICCO2.2e-23084.55Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1445010 PE=4 SV=1[more]
W9R8L8_9ROSA4.8e-23083.86Uncharacterized protein OS=Morus notabilis GN=L484_007561 PE=4 SV=1[more]
A0A068V8E4_COFCA2.4e-22982.05Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00019232001 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G19340.15.2e-22880.04 Protein of unknown function (DUF3754)[more]
AT5G13940.16.1e-16061.50 aminopeptidases[more]
Match NameE-valueIdentityDescription
gi|449455934|ref|XP_004145705.1|7.0e-275100.00PREDICTED: uncharacterized protein LOC101204725 [Cucumis sativus][more]
gi|659098120|ref|XP_008449987.1|7.5e-26997.52PREDICTED: uncharacterized protein LOC103491708 isoform X1 [Cucumis melo][more]
gi|659098124|ref|XP_008449989.1|6.4e-25292.75PREDICTED: uncharacterized protein LOC103491708 isoform X2 [Cucumis melo][more]
gi|802564642|ref|XP_012067365.1|3.9e-23383.99PREDICTED: uncharacterized protein LOC105630221 [Jatropha curcas][more]
gi|590647944|ref|XP_007032040.1|3.7e-23183.51Disease resistance protein [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR022227DUF3754
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005783 endoplasmic reticulum
cellular_component GO:0005886 plasma membrane
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.155960.1Cucsa.155960.1mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR022227Protein of unknown function DUF3754PFAMPF12576DUF3754coord: 257..378
score: 2.0
NoneNo IPR availableunknownCoilCoilcoord: 403..423
scor
NoneNo IPR availablePANTHERPTHR33645FAMILY NOT NAMEDcoord: 5..479
score: 3.9E
NoneNo IPR availablePANTHERPTHR33645:SF3SUBFAMILY NOT NAMEDcoord: 5..479
score: 3.9E