CmUC05G085890 (gene) Watermelon (USVL531) v1

Overview
NameCmUC05G085890
Typegene
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionMitochondrial intermediate peptidase
LocationCmU531Chr05: 4734091 .. 4756401 (+)
RNA-Seq ExpressionCmUC05G085890
SyntenyCmUC05G085890
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGTTCTCATATCTAGTGAGAAGCATTTGAAATTTCATTTTCCTTTCTTTTTTCCTCTACTTTTTTGGTGGAAACTAAATTGCTTTGTTTCCTTGACTATGTCGTAGGATCAAAGATGTTAGTCGATGCTTTATGTAAATGTAACTTCATAGAATCAGATTTCTGTCTATTTGGTGCTTTCTTTTTCTGGCAAGAGAACTGGTGAAGACCAATGTTGAAGAACTCTACTTAGACAACATGCTGATTCACATATATTATATATCAGGAAAAGAAAATCTCATTTCAATAAGATTGTATACTTAAGAGAATTGGTCACCAGTAGTCCTTACAAGAACACAAACCGTCTCTGTAATGGTGAAATCTACTTCTGTCCCTAATGATGATCTCTCTGGCATATACTTTTGAGGCTAATTTCATAAAAGCATGACAATCCTCACAAATTCGAAGGTTCTTTATAATGCAAATGCGTGGCCCTTCATTCATGAGGGCATAGCAAAGTGCCAATTTCTCGCTGTGCCAGAGGACTAATTCCTTCTTTTCCTCTTCGTCTAAATCAACGAGCACATAATTTGTCTGTGGCGTATAACCAGCCAGATTCAACTTTTGAACTACCTCATCTAATTTCTGATGTATTTGATCTGCTTGCTTGTGATTTCTATCTGCCATTTGAAATTCATGGACCTCATTGTTCAATTCAATTCTACTGCATCCTCTCTCTTTGGAAACGCCCATCTTGGTCATTAGTTTTCTAACTTCCCCAACGTCTTCCCATCTTCTTTCTTTAGCGTATATGTTTGATAAGACGACAAGGGCCCCATCATGATCAGGCTCGAGCTTGAGAACTTGTTTAGCAGCAAATTCTCCTAACTCAGTCTCACCGTGGATCTGACAAGCAGCCATAAGGGATCCCCAAATAATAGCATTAGGAGCAAATGGCATTGCCTCAATCACCTCAAGAGCTTCTCTCAGAAGATTTGCACGGCCAAAGAGGTCAACCATGCAACCAAAGTGTTCATGCTTGGGACTTATGCCATACTCATCGGTCATTGAATGAAATATTCTTCGGCCCTCCTCAACTAGACCTCCGTGGCTACAAGCATAAAGCACCCCTACAAATGTGATCCAATTAGGCTCAACATTTTCAACTTTCATTTGATGAAATAAGCTTAAAGCATTAGGAGCATCTCCATGCATTGCAAGAGCATGGATCATACTTGTCCAAGATATTACATTTTTCTTTGGCATCTTTCCAAAGACTTTTCTTGCTCCTTCTAGACTCCCACATTTGGCATACATATCAATGAGTGCATTATTGATAGATAATGCCTTGCCAAACCCATTTTTATCAACATAAGTTTGTATCCATTTGCCTTGATCTAATGCGCCAAGATGAGCACAAGCTGAAATAACACTCAATATGGTGACTACATCAGGTTTCATTCCCTGCTGTTGCATTTTCTTGAATAATACAAGAGCCTCTTGAGGGCAGTCACTCTCTGTATAGCCAGAAATCATTGCGCTCCAACATATCAAGTCCTTCTCTACCATCTGATCAAACACGTAGCGAGCTTCTCCAATCTGTCCACCTTTTGCAAGCCCAGAAACCATGGCAGTCGAAACAACCATGTTCTTGGGGGAAATCTTTTCATAGAAATCCCAAGCCAAGTCCATGGAGCCACAGCTCGCATACATTGTGATGAGAGCACTTTGTAAATGAGGATCCATGACAATATTCTTCTTAGTAATGAACTCGTGTATTTTTGTTCCAAAATCCAAATTTCCAGCACGAGCGCATGCAGAAAGAACTGTAGAAAGAATCATCTCATCTGGTTCCAACTCTGTTCTCTTCATTTCTTCAAAGAGTTGAAAGGCAAGATCATAAAAGCCACTTAAGCAATACCTGTATATTTGAGGATAAGGGGCAAATATAAGTTCCGCAGTAATTTGTCTCCTAAACAGCCATGCATTTAAACTGGAAAATGATAAGATAGAAAGATACTACCATAACAATACAACGTCCAAAGTAAATTTATAACTCTTTCTCATTTGGAAGACTACTAGAATTGGCCACAATATCCATGGTTATAACATAGTAAATTGAAAGTCTGAAACAAAATTTCCATGAAAATCTAAAATCAACTAAATTTTACCACATTAACCTTCACTAGTAAATGCAGAGCAACAACATTAGCTTCATACCCATCAATCATGATGCTCCAAGCAACGACATCCCTGTGAGACATTTTATCAAACACCAACCGAGCTTCCATTATCCGTCCACAGGCTGCGTACATTCTAACCAAACCCGTCTCCACAAATGGGTCCGACCCAAATCCCAACTTCGACGCGAGCCCATGAATCTCCATCCCCGTTCTCAAGGAAAGATTCCTCGAAGCAGCTTTCAACAGCGGAGGGAAGCAGTACCTATCCAAACTCAGACCCTCCGCCCTCATCTTCTCGTATACAAAAAGCGTAAACTCCGGCTCAGAACCTCGTGATAATTGGCGCAGAAGCTTGTTGCAGAGACGGGTCTTGGGCTGGGGAATTTGATCAAACACAGAGAGGGCATAGTCGAGGCTAGGCGAGAGAGCACAAGAGGAAAGAATAAGTTCAAAAAGAAGGGAATTGGAATCACAGCGTTCGAGTTTGGAGCGAAGGATTTGAGCGTGGACTTGTTTGAGGTGGAAGAGGCTGGAGGCGGAGGAGAGAGCGGCGGAGAGAGCGGTGGGTCTGGTGGGGTATGTGTGAAGTTGAAGAGGGAGGACGGAGGTTGAGTGGCTGAGCATTTCCATAGTGTTTGGGTTAAATTATATGTAGAAGATGGAGCAATGTTGAGTTTGGAGTAGTCTGACCGACATTGAAGGAAATCGGAATGCAGAGCAATGGTGGTGGTGAGGTGGATGGGGAAAATGGAGGGAAATATCAGGACGAAGACGTTGAATTCTTATACGGTATAAGTTTTACCTATTTCTTTTGTGGCATACCCTTTTTCAAACATATAAGATTTTGGAAATAGTTTTTAACTTTCTGCATGGTCTTAGAAATACTCCACTTTTAAATATTTCAAAAATAACCTTAAACTAAAAATATAAAAGTTTTAAAATACTCTTATTATTATTTTTAGATGGAAATGGTTAGTATTTTATTTCAAAAGTATATTTGATTTTTTTAAAGTTTCAAAAATATCCTTGAACTTTAGAAAAAAAAAACTTCAAAAGTATCTTATACTTAGTATGTGAACACAAACCGTTAATACTTGTTTAAGAAATATTCTTGAACTTTCAAAAGTTGTATTAATACTTTTAACCTTATAAAAAGAAATTTTTTAATTTCTCAAATCGTTAGTATAGTGATCTTTGTTTGGAGTTTTCTTGAATTTGAAAATAACCAATTTTAAAACAAATTATATTAGATATCATTGTTTTTATATAGATACTCTTATTTTTCTCATATTTGTCCAATTCTAACACCAACTTTCTTATAACAAAATATAAACAAAAAGTTTCGAGTTAAAAGCATAAAATCTCACAAATATCTAAAATCTAAAATCTTTTCTTAAACTTATTAAAACCATATATAGTCAAATACAAAAGAACATATTTGGTTATTAACACATATGCAACTACAAACATATTAAAACAATTAAGCCTTAAGGTCTCTTTAGCGTTTTCATTATTATCTTTCTTTTTGTTGTAATCTATATATGACACTTTTTTTTTCTTTAATGAAATAAACACATATTGAATAGTTAACATAAGGGGGGAAATAGATAAGTCTTAGGTAATATGTTAAAGTAGGGGGAAAGAGAGGAAATGAAAGGGTGAGGTGATAGGGACATAGACAAATGGGCGGTGAGGGAGATTGAGTAGAGTTTTTACCCATTATTTTCTCTCTGTCTTCAAATTGAACTCTCATGTGCTTATTATAGGTGTGCAAAAAGTGATCTCATGACTATTTGACAACTTTTCAATTAATTTCATGTCCCACTGCACAGACTTTATCACGTCATAGATTCATTGACTTTGTCGTGGCCCGATCAAATTAGTCATCCACTTGCATATTGACACAACAGCAACCGCCCATGCGTCCTATGACTCTTTTTCTTCTCGACGTCGGGTTCTTTAACGCATTGATCAGTGGAACTATATAGTCTTGGTGTTTCTGGCCGCATCTTTTCCTTGTCCTTTTTGGTTTCTTGTCAAATATTGCGTTCAAAAGATATAAAGCTCATAATTTCTAGGATAGAAGCTTTTGTAAACATCATGAACGTAAAATTATTCATATTACATTATTTCCTTCTCAACCTCATGCGTTCATCTTCTAAATTACTTTAATTATTCCAAATAAAGAGTACAATAACTTACTTGTATTTCTACAGGTTATCAAAACTATTAGTTTTGTGTTTCAAAAATATCCTTAAACTTTTAAAAAGAAATACTTTAATAAATGAACATAAACTCTTAAGGCCTCATTGCAAAAATATCTATAAACTTTCAAAAGTTGCATGAGTACCCTTACCTTAAAAAAAAAAAAAAAGTTAAAAATGCTATTACAGAGCGAGGATCCATTCATTCCATCTTTGTAAACTATATCAAACAACCTTCATTTCAAGCCTCCAAGAAGCAAGATGTTGTCATTGGTGGCTTGACGTTATTCATTCTCATTTCATTTCCATTTCCAACTTTCTTATTTGTACTGAGATTGTGACATCGAAAGACACATATTTATTTAATATAAATTTGACTGATTCTATATTTCCATCCATTATCATCTTTATCGTTTCACATTATGGGTAACTAAATTTTCTAATGGGTTGAGTGGATGAAACGAACATTTTCAAAGTTTTATCTAAGTTGCATTTATCTTGTTTAAATGTGTTCTCAAAACCTTAGCATGAATCTGTCAAACAAATAGGTTTAAGTAAGTATAATTAACTGCTTAAGAGAGTATGTAATTAACACTTGTTGTTTAAACAAGGAAAGAATTAACTTTAGAAATAAGGTGAATCTTTTGTTAAGAAGAAACTCATCATGCATTTTCGAAATAAGAAAAATGTGGTGTCTAAGAATCGAGAGATGTGTTTTCAATTTTGAGTTTGCAACATGAACGCCTACTTAAGTATTATTTAGAAATATGTTATACTTAAGTATTACTTAGAAATATGTTTGTGAAGTATTCTCAACCATACTTTCTCCATTGATTTCTCCATTTTAACGTTGGTTGTTTGATTGTTTGTACTTGACACCTTCATTCATCATCCTCTGCGCATAATTATCCCACATGTATGAACATCATTCTTATACTTGTAAATTCATTTTTAGTTTAATGTAAAAACACCTCAACACATTCTTTTCGCATATATTATTTTGTATAGCAAATTGCATTAAATGAAAGAATTTTTCTTAACATCAAATTGTTGCAAATTTATGAATTCGACCATGGATTCTTTAGAAAACCTATTTTAAACTTACACTTGGATTTTAATAGGAAAATTTGCATTTTACTAAACTAATTTTACATAACATTAACACGACATAAGACCAAATCAACGCATTATTTTCAATTGATCAGCAACGCAACAAGGTTCCATCCAAAACTAACGATAATCGCATCTTTTAACTCTTTTTAAAAATTGAAGTATAATTTTGAAACTTTGGAAAGTTTAGAGATATTTTGAAACTAATTACAAAGTTTAAAGACACTTTTTATAATTTAGCCTAAAATTCTGATCTTATACCTAATTGATGAAACTGTTGACCTCAATTTTATAGTTAATTGTACAAGTTTTACGTGGGTTGAACTATTTTTCATTATGAAAAATATATATCAATGTAGGTTTTAACTTTTATGAATGACAAAAATTAAAACAGATTTGAAAATATACTTTCAGTGTCGTTTCAAAGTAGATTTGCCTAAATTTTAATATGGGCCGAAAGCTTTCCATGAATACGATAAACGAAGGCCCAAGATTGTAGGCCCAGTAGTCTGTACCGTCCTCAGAGGGATTTGGTTCTTGGTATCTCCAGTTTGATCCGTTTAGTCGCGGAGTTTAGTAATCGGAGCTCCGCCATGGCCGCAGCTTTCATTCATCTTCATCATGTTCTCATGTCCAAACAGGTATTTCACTTTCCACTTCCTCTCCCATTCTGAAACTTGAACATTATCAATTTCATTTGAGTTCCGCTACATAGCATATCTAAGTTTTTCTCGAATTGGTTGTTGTCAAGCATCCACATAGAACCCACCGAATCTCTTTTTCCATTTGTTTCTTGTTTTGATTCATCATCTAGCTCTTATTCGGCATTAAATTGGCTGCTAGTTGAACTGCTGAACTAATTTTATTTTGAGACTAGAATTGCTATTCTTTTGCTCGTCGGTTGATTTCAAGATGTGGTTTCGCTTTCTGCTGTTGAAGTCGATTTCTACGCTTTTACTTGTTTTTTTTTTTTCAACTTCGTTTAACTTAATGCGTCTACTCATTTCTGCAACATAGAACGATTTGACGATCGATGAAGCAAATTTGCTCCAAACGTGTGTGTCTAAGGCTGTTCGAGATTATACCTTTGGAGGACTCCTTGGAGGTGGCGTCACATTGGCAGGTATGCTGTAAAATGCGGCAGAATTTATCTTCCTTCAATCTGAAAATACTATACAATGAGGATACCTAGAGGCCAGAGGGAGCTTTTTAATTTTCAATATTTATGTGTTTATGCCCATAGAGGCTAGAACTTTTCGGCCAGATCGTTTCTATTTTTCAGTTTTATACGACCCTTTTTATTTAGTAGTCCATTTCCGCTTGCTCCATTCAACATCTGTAGTTCCCTTCCGGATGAATTTATGTTGCTAGTGCTCAAGTTCTAATTATCTTCACTTTACAAATCAGTATATAGAACCATAACTTCGTATAAAGTACTATTTGTTTGATTTTAGGTGATATTCTTTTTAAGACAGGCAGCAAAAAGATACTTGTGGATTTCTTGTAGATAATACTCAATTTTAAGATCCCAAAGGGTGCTTGCTGCCTGGTGGTAAAGGCTTGGGATATTTGGAGATACCATCGAAGGTCTTGGGTTTAAGCCCTTGGTTTAAAAAATCTTTGTTGAGCAGGCATTGGAGTGGGCAATTTGTTCTTAGGGATTAGCTAGGCAAGCCATGAGCATAATCCAAAATGTAGAGTCCTTGACACTCGAGATATATTAACAAAAAAATGATGATAGAATAACTTATTACCTCATTATCAAATAAAGATAAACTAGCTTAAAGCTATCACAGCTTGTACCAAAAGAAACTGTGTTGCTTTAGTAGTGCACACACTCAGATATATATCAGTTAAATGTTTGTTGAATCTTAAAGCTGTAATTCCTCCTGTTCCCTCCGTCAATTATTCCTTTAATCACTCTTTAGTTTGCTCTGGCAGGAACATGGAGGCTGAAGGTCTCCACTCGGCTACATATTTTTGCAGGTAAGATGAACTCTCTCTCTCGCACACACTTTTGAAAAATTAATTGCTCCATTTTAGGTCGTGCAGAAATTACGTTTTAATTCATTTCTTAAGGATTGTATTTTGGCAGGAGCTGGTACTCTATGTGGATTATGGAGATTTTGCAGGTCCCTAAATTCAAGTGTCGATGATATTCTTGCACTGGATGGAAGTAGGATGCAAAAGGAATTGGCAAATATGTAGGCCACTCGCCTTTATTAATTTTGGAATTTTCTAATTAGTGCCATGCTAGTAAACAAATATTGATGTTCATGACGAACCAGGCTAAAACAACGTTACGTGTCAGCTAAAATATCTGAACGATGTTGCCGCAGTGAGCTATGACTATGAATTAAAATCTAGCTATATTAATATGCCCTTTCGAGGGGTGAAGAGGTCATGTGATGAGGTTCGGGAGGTGATGAGGTTTAATGCCTCCATGTAGGCATTATAGTCATTACGCCTTTTTGTAATTATGACATTGGATTTCAGTTCCTTCTTGTAGGTTGTGTTTTTGTGACTCCCTTTTGTTAGGCTTCTTTTTTGTATGCGTTTGTATATTCTTTCATTTGCATCATGAATTTTTTTTTCGGATGGAACAAACTTTCATTCAACATGAATGAAAGAACACAAGGGCATATGAAAAAGACAAGCCCTCAAAAGGAAGCCCCTTAAAGGAAATGGCTCCAACTGTGCAAAATAGTACCAATAGAATAATTACTGAAGCTCTTCAAAATCAAAACCTAAAGAGCAACGTGAAATCTAATGAGAGACCAAACATCTGAAGGTTCCCTCTCCAAACCTTAAACACTTTGTTATTCCTCTCTCTCCCTAAATATCCTACTAGATAGCACACACCCTTGCAATCCATAAGACACCAACTATTTTCATGAAAAGTAGATGAACTTCTCGATCACATTCCTAGTAACCCCATGATGGGCTAGCATGAAACGGACTGTCTAAAAGCTCCTTTTTGTTATGAAAAAAAAGTTGTATATGCATGTTCTTTATTTGTTATATGATTGCTAGAATTATTGATGTTTTCTATCTATTTACTTGTTTTGTATAATGAGCACATTAACAGAAGCCAGCAGTGCTATAATCTGTTTAAATTTGACAAAGTAGGTGTAGGTGTGTCTGTGGTCACTGTTCCCGCATAGTGGCCAAGCGATAGGTGTCGTTATGCCACTCAGTTGCTGCATGAGCGCTGAATATACCTTCATGCAGGCTCAATTGGGGAAGCAAGGGGGCTAACCAAGCTAGTGAGTATCTTTTGGGCACTTGAGTAGCTGCTCGAGTGCTGACGTACTTGAAGCTGATTTAATTAGAGAAAGAAGAGGGTTATCTACGTATATTAGTTTACTATGACTACGAGTTGTAATAACTCGTCAAGTGTTTTTTAATGCTGTCATAGGCTAGTAAATTTCGGTGATACTGATCTTGGAATCTTGTCCTATTTTTAATGTCAATACATGGCGAGTTCTCTTGTACATGACCAATATTTATTTTCTGATTGCAGTATAGTGACGAGGTATCCCAATAATCCTCGTACCATGCAGCACATATTCAAGCATTTTTATTATGAGGAAGTATTTGACGATTCAACCTTGGACCGGCCAAAAAGAATGTTGCGTTATCGAAATTTCTTTAGTGATATTGTTCATGCTCAGAGGACGAATGACAATGACCATAACGAAAACGTGCATGGAAACTCCCACCATGACTCATCCAACCGTGATTCCAGTGCCTACCAGAGTGATTCCTATGGTGAGCCTGATGACAAAGGAAATGCCCTTGAGTTCAAGCCAGTCCTTGTAAGTTCAATAATTCAGTTTCTGTCCTCATCTGACTAATACTTCTTCAATTCCTTTACGATCGTTTATTCATAGTTTCTGCTTTTTTCCTTAAAGAATTTTCTAGGCATGTCCAGAAAGTGTCCCAAATGTGTTGCACGTATCTGAAATAAAAACAGGACACAGAAATTTGTGTGTCGGACACGTTTCGAAAGTGTGTCTAGAGCGCGTTTGAATCAGACAATCTGCCTAACATGGAGTGTTGTGCGTCATATCTCAAAATCATATTATCTCGAGTACTGTGATTATAAAGTTCCTGAAGTTGAAAAGTATGTACTAGAGATGCAAGGAAATATTTTCCCGGTTCTCCGCAAGTTGCCTTGCACCAAAGGAAATTGAGTATGGTTGATTCAAATAGGATGATTCACAGTAAGGCTGGAATGTAGGTGTTATATTATGTTGCCTTCTTGGTCAAACTAGCTTTTCTTGTGCAAATAGATGCACATGTCGCATACCTGAGTGACAAAACTTCGCACTTGATAAGAATTTGAACATCCAAGTGTCTTTCTATCATTTTGTCTGATTTGAACTTTTGTTATATGCAGATTAAGCGTGGCACCGATGCTGCGACCGCGGACCCTCTAAATTGTATTTTTGGTACTTTGGCCAAAGAAGAAGAAATTCAACACTCCAGTGCCTCTAATCCATCTCCCAAATCTCACTCTCGCAGCAGAAGATACAATCGTCGGCATCGAAGAGATAAGAAGACAATGCCAACAAACTTTGAACATGTGTAACTTCAGGTTCTCTATCAATGCTAAGAAGATTTTATTTTCTCATTTGAAGAAATAAAAAGAGACATTTTTGTACCTAACCATACAATTGAAGAAGGAATGTTAAGTGTCGGCATCGAAGAGACAACAAGACGATGCCAACGAACTTTGAACATGTGTAATTTTAGGTTCTATCCATTCTAAGAAGATTTTATTTTCTCATTTGAAGAAATAAAATGAGACATTTTTGTACCTAACCATACAATTGAAGAAGGAATGTTTAAGACATTTATGCACTTCGAGTGCAGTGTTGTCATTTGCCTATCATAGGAAATAAAAATGGTGTATAATCGTTTCCATCGGGGTCTTTTATGGTTTTTCCTAACCATTTGTGAATTAATGGATAGGCTATCCAAAGATGCCATAGCCATCCATAGTGCAGAGACAGTGCTCATGAGGCAGTTGTGTGGTGGTTGTTGAAATGGAAGAGCCATATTTCAGAGAACACAATAGTGGTTGAAGGTAGGACATTCCAGTCTTTTCTCCATTTGCTTAATTCTCTAACATCCTCAGGAAGATTCGTGGGGAAAAATCTTTCATCATCCAGACCGTGTGGCGTGTTGAAGGCCTTACACAAGAAAGACGACTGGGATTACGCGTCAAGAATGGTAAATGCCTTTTGTCAGACCTTTAATCCTATAATAACTAACTAGAAGCTTCATGCTTTGTCCTTTTTGCATTAATATGAAATTCTTCCCCCTTTTTAGTGGTTTATATATATATATATATATATTCTCTAGTGGGATTGGGGGGGTTTAGATGAGATGCTTTTTGTTGGTACCCATCATTTTTAATCTTTATACTGTGGGGTGAAAACATTTGTTGGGTTCTTAGACTGAGCAGTTGAAAGCAAAAGGAGTTGATTTAAAGTGAAAATATTTGTTGGGTTCTGGGATTGAGCGGTTGAAAGCAAAAGCAGTGATTTGAAATTAATTTTTTTTTCCCTTTTTTTTTTGGAAAGAAATTAAAATTGAAACTTCCATCAATTTGGAATCTTTGAGCTTGATTGCGGCCCCTACCGCACGCATAGTGCCGCAATAGATATGAGACAACTCATAAACCCTACTCTTTTCTCGTTTCCTTTTTTGATTCAAACCTAAAGCTTCAATAGGTTCAAGTTTGGTATGTAGATATAATAATTGTAGATATAGTCATTTATTGTGTTTGTGTGTCTGTGTTTTTTTATAATAAATTTTTGTCGGACGGGGTTCTATTTTCCGATGGATGGTGGCTCGGTAGCTGAAATTTTGTTGATTTGAAGTTTTTGTTCGTCTCACTCACCTCAATTTTCACTCAAGGAATCAAATTGGTTAAAAATTGGAGGGTTTTTTTTTTTTTTTTTTTTTCCCTAATTTTCATATATGTCAAGTGGTATGGTTTCATTTCATTGCCCTAAAACAAATTTTGCAATTAAATTATTTAAAAAGGATGGTAATTTTTATCATCTCATAAAATTGTGTGTCAAAATGCATTAAAAAATATTATCATATGAGTTCAATGAGTATAAAACATTAATATAGTAAAAAATTAATAGGACTAGAAAAAAATGACATTAAAGCTAAATTTTCGTTAGAAATGTAAGGTGTGGTTAAGCTTTCAATCGGTTGTGCATTTTGATTCAAATTTAAATTTGGGCAAAACTTTTGAGTTTATAGGAATTTACTAGTTTGTGGAGCTTTTAACATAGCTTTTTGTTAAAATTAGTTAGCAAATTAGTGAAAATTGAATGCAGGTTCAAAATTGTAACGATTGTTGAAGTGCATAGTCTTTTTTGATGAAATTGAAAGTTATAAGTTTATTGTGAAGATTAATACAATTTGATAAATTTAAGAGTATAAATTAATTTTTTAAGGAAAAATTTAGAATCAAGTTTTGAATTTATTAGTAATTTTTATAAATAAATAAATAAAAAGATGCAGCGTGAGACTAGCTGGGATGCCATGAAACCATTTTATCATTTATGTTTATGTTAACTTGCTTGTATTACCAAAAGTCTCAATTATCGTTTTTGTGTTTTAATGTATTAATTCGTATGATATTTATCTGAGTAAAGGTTAAATAAGAACGTTAGGGTGTTTTGTTAAGCTGACATTGATGTTACTAGCTAACAACTTTTATCACTAATTAATTATATTAGTTATTTGAAATAAAAAAATTGACGATAGTTAAATTAAAGTACATGATAGGGTATTGAATTTCTACCTAACAATTTAGTTCCTATTTGGTAACTATTTGATTATTTATTTTTTTTAAAATTAAGTCTATAAACACTAATTTCACTTCCAAATTTCTTCCTTTGTTATCTATTTTTTTACCAATTGTTTAAAAAATCAAGCAAAAAAATTGAAAACTAAAAAAAATGTAACTATTAAAAACTTGTTTTTGTTTTTGGAATTTGGTTAAGAATTCAACCATTATACTTAAGAAAGATGCAAATCATTAGAAATTGGGAGGAAATAAGCTTAATTTTCAAAAATGCATAACCAACAACAAAATTGTTATTGGGTCTTAGCTTTTGAACTTTAATTGGTAATAATATAATCCCTATACTTTTAAATTTGTAATAGTTTAGTCATTGAACAACATTAATAAGTAACAATTTAGTACATGTACTTTTATAATTTGTATTGATTTAGTCCTTATCTAAAATATATTATAAAAATTTAATGAAATTTTTTGCTCATGTAAATTGATAAACTAATCAAGGATCAAATCTTTTTACAAATTATAAAGACTAAGACACCACTTAATAAAAATTTGCACTATTAGATTCTTTTTTTTTTTTTTTTTGTGTATAGAAACTGAATTGATATTCATTAAGAGTATGTTTGGAATACATTTTCAAATGTTTAATTTAAAAAATAAATTATTTTGGAAACAATTGGAGTGTTTGGTAGAAGTGTATTTTAAATAGATTTTATCAAAATAGTTTAAATAAAAATGAGATTTTTGTTCTCAAGTCAATTCAAACAAGCCCTAAACTTCGATGACTAAATTGTTTTAACTTTGAAAGTATAAAGACTAAAACATTACCAATTAAAGTTTAGGGACTAAATTAATAATTTCATGAAAGTTCAAGGATCACGGGTGTTTATTAATTTTTTTGACGAAATATCATCAAGGTAGAAATAATATTAATAACAATAAATTTAAGAAAAATTATTTTAAATAAAAAAACTGCTGAAAATATTTACAAATATAACAAAATTGTGATAGATGTATATCTTGATCTATTACAGTTTATCATTGATAGAAAATAAAATTTTGCTATATTTGTAAATATTTTAAACTCATTTAAGGAAGGTGAAATGTATTGTGTGAATAGTTTGGTTTTATTTCTATAAATATTAATTGACATTATAATTTCATTAATATTTTTTCACAAAATTCATACTTCAACAATTTCATCATTCTCTACCCATATGTAATATGATTAAATAAAATTTTAATATTTTTAATGTTATCCTATCGTGTTATGGGAAAATAATAATAACTTGCATAAAGCATGTACCATCAATAAATGTTCAAATCTTTTATTCTTATATCATTGAATTAAAAACTAACCGGTTATAAATCTATATGATTTGATAATAGCAACCAAATTATTTGACTAAATTAAAAATTTTTAAACAATAATAATGACCAAATTTATAATTCAAGCGTAAAGTATACCAATTAAATTTATTCAAACTTTAAATACGACTAAATCATTACTTACATCATTACATGTAAAATCCTCGAATGTAAGGAAGCTCAAAGCGAATTGATGCTTCTTAACCTAAATTTTATCCAAAAGTCCCCACATGTAATTTTTTTTTCAGAAAAAAAAAAAATAATAATAACATGACCGACTTTGTAATTTCATCAAAATTAAAGGTTGAATATAGAAATAACATATTTCCCTTCCTTTTTTTATCTGTTGAGCGATTCTTCCTCTCTGGTCTTTGCTCTCACGGCTCTTCCTGCATTCTCTTCCTCCAATGAATGTCTTTCTCCTTCCTTCCCTTCGATTCATTTTTTCAATTCTCTCAAATCCGTCATTCAATTAATTTCCCTTTTTCAGTCCCCATTTTCGAATATTAGGGTTCTCATTCGATTTCTGTTTGTTTTTACACGAAACAATTCGTTTTATTCCTTTATGATTTCCCCTTTCCGTCCTCTGAACACCAATCCCATCAAGATTTCGATTCCATTTGAAATTCGTCCGTCGTTTTTCACTACGGCATCTTTGATTTTCCATTTGATTTGGGTCTTAACCTTTGTTGTTGTTGTTGTTTCTCCCGCCCCTTCAAAACCTGTTGCGACTGTGAAGAACGAGCTCGAAATCAACACCACCGCCGCCATGAAGGTCCACCCATTGCCGAGGAAGCGCAATATCGCCGTCAGGAATAACCCCACTTCGAGAAACTCTCTTGAAGATCAATCCCTTCTGAACAACCACAAGAAACTCAGGAGATTACCTCATATCTTCAGTCGGGTCCTTGAGCTTCCGTTTCGATCTGATGCGGATGTTTTGGTGGAGGAAAATCCCGATTGTTTCCGATTCATTGCTGAAACTGACGGTAACATTAGCGATGGAGTAAGAGCTCATGCTGTGGAAATCCATCCTGGGGTTATTAAGATCGTTGTCCGTGAGAATGAATCGTTGGAAATGTCAATGGATGAGCTCGAATTGGACATGTGGAGGTTTCGGCTACCGGAGACGACGCGACCGGAGCTTGCGAGTGCGGCGTTTGTTGATGGAGAGCTTATAGTTACTGTTCCAAAGGGGAATGAGGAAGAGAATTCTGAAGATGGTGGAGGAGATATCTGGGGAGATGGGAACGAGAGCTTCAGAGATGAAATGGAAGGTCGGCTTGTTCTTGTACAGTAAATTGAATCCTTTCTCTTTTTTTCATTTTGGTTTTTGTACTAACTGTTGGAAATTTCCTTGTTCATCATTATTAATGTTGTTCCAGATTTCCTCTTAGTTTCCCATTGTTGAATCAGTTGATCCATTAGCTTAAAAGAAACTTAGAGAGTAAAACCAGATTGTGCTTCCTATCTGAATGATATCCATCACAAACATAGAACTTGATAACATAATACCCAACTATGAAGCATTAATCCCTTTTCCAATTGTAATCTGCTTTAAGTAGCTTCTAGTAATTTGAACTCCTTTTCTTTTCTCCCCACATCATTACAACCTGACCCCAGTATGATCTTAAAACAGAATCCTGTATTTCAAATATGGGTTTTTTAGGCATTAGTTTATATCAGTATTGTTATCTGTTGTTGGCTTGAATGAAAAACAATTGATAACACAACCGAATAGTAGTCCTTCATACTTTCCCTTTGCTTTCATTTCTTAATTGTTAATGTACAGTCCTGAGATTTTGATTGTGAATGCATTTCTTTAGTGGTTTAGCCATTTTCTAGACATCATTTTTTTTTCTTTTTTTTTTCTTTTTTTTTTGTTTCTTTTGCGTGTACCTCAAACACTTGTCCTTCTCTGTGGTTTGGAAAGTATTGCTGCTTTCTTTCTTGTCCATTTATGGGTCCTATATGTGGCCATATGTTCTTTACTTTTTGTGTTTAAAACATATAATGTGGCACTTTTCATGGTAGCCAAAGATAAAATAAAACCTTCTTTTAAGCCTTATTTTTCTCGTATCTGCCACGTTTAAGTTAGCAAATTCTTATTCCTAAATCCTAACTGCTTTGCCTATTCTCATTCCTTAGATCTCCCTTTTTCCTTTTCATGGTTGACTTGTTCATTTGAGCAATAGCTACCTTCATGCTCTTTGGTATCTTTATGTTTTTTAATCTTTTCTATGGGTCATGTTTGGTGTTGCTTTAATAGCGGTTTCCACTAGGGGTTGAGGATAATGATTCATTTTCTTAGTTAAATAAGAAATTTGAACATCTAACCTCTTAATCAATAGTATATATTGAAACTATTTGAAATAGTTATGTTGATAGTAAATCGTAAACCTTAAATATAACAACTAAATATCGGGTGGCCTCGTATATCTTAAAATAATTCAATAATTTGCTTGTAACATGGAATGATATTAGATACATTTAAGGGCTATTTAGGGTCTAATAGAATGAGTTTGTAATGTAAGGTAATTCAAAATCCGTGTCCAATATTTGTTAGATTGAGGGTCCAATTTTAATATTTGAGTTCTATCATTTTTGGATTCAATTTCTATCATTATTTTAAGGGTCCAATTTTAATATTTATAAGAATTTTAAGAGCTCAATTATTACAATTAAAAATTGAAAATTTGAGAGTGTAACTACAACTACTATCATACTTCAAGTGGTTTTTATAATTTACCTAAATTAAAATTAAGAGAGAAAAATGAGAGGATTGTTGGTGTGTATATACAATATGACAAACAAAAAAAAGAGAAGAGTCGAATTCCATGTAGCTGACCATGTGAATCAGGCGTCAATCCAAACTTTCATCATTTTATTGGCTTCCATGGAATTATTGCATCGCAGCGGCAGGGGCCTACAATCCTCTCCCCATTATTATGACTAATTATTACGGTAGAAAATGTTGGTTCAAGATAATCGTAAATCCTTTTCTAACCTTAAATAAAAATAATAAGAAGAATAAAAAATAAAAATAAGGGGCATTTTAACCTGTGCCTCACCGACTAATGAATTCTGCTTACGTTGGGTTAAGGATGATTTTGTCTATCGTAATCCAAACGTGATGGCCATGTCTATCACAATCTTCATTTTTAGTGGACAAAAATAGAAATTGCAATAGACATAGCCACCACATCAAGAATACGATAGACATGGTTGTCCCTAATACAATATTTTATAAAACGTAAAACATGGAAGTAAACTCCATTTTAACTGTATAATTGGATTTTCAAGACGTGTAGTATAAGAGGACTTTACAAGTTTCTATTTTCATAACATTGATTCATAATCAACACATAATCTCACAAATTTAGATTTGATTTGTCAACCGAACACATTATCGAATATTTTGGCAAAATCCACTAATCCTATCTATCACCAATCATTTTTCGTTATGAATAAAAATGGGAAGAGATTTAGATTTGTTCTATAAATATAAGAGGATCCAACAACCAAGTTGATCTCTTTACTCTTTATTTCGTAGCACAAAACTAGATTTCCCTGGCTTCATATTCAGCTAAGTTTTTAGTTTATAAAATTTAAGAAAAATAAGGAGACTTCATTCACACATTATGGAGGCATTTTCCTAGCCATTATTGGGGGCTAATTATAACTAGTCAAAAGGCCTGTAATAAAAAGGAAAAAAGAAAGAGAATACCAAAAACAAATCGCTTTTTGATTAAACAGCTAGCTAAAAGGAACTTAATAACGTCCTTGGTTGAGGGAAAAATAAAACTTGCATTATCACTACGGTCTCAGAGGACAACTACAAAAATACATTCGTAAGGATGGAAAAGCCAAATGACAAAGTAAGGAGCCGACCTTTTTTGTCGTGAACAGAATCACACGTCAACAGTAGGTGCAACAATCATCATCATCATCCTGATAAAGAAAATGGCGGTTTCACCACATTCCATGAAGCTTCCTTTCTATCTAATAAGTGGGATTATGTCTTCTCTTAGCATTGTAATCAAAAGAAAGTACCTCTCTTCTGCATTTAGTGATAAGAAGTCTGAACCAGACAGACTGGATCCCTACTGAGCTCCACAGAAGCCATAAAAAGCAATCCGTCAAGCGACCAATGGGAACGAGGAAGTTTATCTGAGAAGAGTTCCAATATTCACAAGAATTCAAAGCATAAATAATGGTTTTAAGTGTCTATCTTCAAGCAATGCTTGTTTCTAACTCTCATTTGTTCATCATCCATTCGATATATAAGTAACAAAGAGAACAATTAGGAGCAAAGCTGATAGTTTGGTCCAGTAGAAATTATAAAATGCATGGTGTTGAAAAATTGATATGAAGGTTTACCGTGAAAACTTGTTGCAGAGGGTTAGTTTCCAATAGACCAAAATCTCTCTACTCTACATTCCTTCTGGCAGCTCCGCAACTGCATAGGTTCAGTATTTTGCTTCCTGAATCCAGCTATCTTCATAAACCAATTAAAACCTCGATATGATACACTCAGCCTCATGTGAGGCCATATGATACACAGTAAAAACTACAAAAACCCATATAAGGCTAAATGAACCAACAAAATTCAGTGAAATTTTGTGAGACATCATATGTTTGTGCTTGTGTCTCTACTCAACATCCAAATCGAAGTCCTCCTTGAGGATATCTTTTGGAGTCATTCCAAATCCATCAGCAACCTTTTCCATTCTCTTGAGTCTCTTTACTTCTCGTTTTATTTGGTCCACTCTGTTTGTTACGGCATTTTGTTCCGTTGATGATGTTTGATGTCTGAGTTCAGTTTGCTCCTTGTCAAGTTGCTGAAAGAGATTATGTAGATAATTTTTATGTAACAGAAGCTTTTCTTCTGCCAAATTATACTCGAACTCTTGTGATCTTTTCAGTCCCTGCAAAACCTGATCAATCTCGTCTTCAAGTTTTAGGGACTCAATCCAATGATCAAAAGGGGTGGACATAGTCCATTCTGAGCTTATAATGGAGTCTGAAGTGTCATGAGAACCTTCTGTAGAATCAGCATTATCAGGTGCATCTGTGCATAAAATACAAAATCATTCCTTTATCTTTCTTTCTTTTTTTTTTTTTTTTTTTCCTTTTTCCTATAAGAGTATACTTAATCACTCTTAACAGGCAAGAGTACGGAAGATTGATGGTGAAAATGGCTGTTGAACATACTAAATAGTTGGCAGGTTATATACATTAACCACTTTAGAGACATTGTTTCTAAGAAGAGTTTGTTCGAGTATGTAGACTAATCAGACCCTCAAGTGATGAAATGAAATCTTTCTCCATAGTTATCAAGGGAGGGATTTGACAAAAAAATATTGCCAGTGCAATTAACTGAGCCCCTAATAAATGGTATTACAATTTCTCGTGCTCATACCACTGCGAACTTATCCATACTATTTGATTTGAAGAAAAATTACCAGTGCAATTCGCTGAGCTGTCTTCCTCCATCTTCCAAATCTCTTCCAAGCAAGTCCCAGATTTAAGCTGAAGCAAGCATTCAAAGTCAAATAGCATGGTAAAGTTCACAGAGCAGCATAGGATATTGTAAAATTATTTAGTACTTGTAGAAAATACCAGTATTACCTTTTCAATGTTCAATTCAATATTTCTAAACAGCTCCTTTGCTCTCATTTTGTGTGAACCACGCAAAATGCAGAAACCAAGGCTTAAGATCTCCTCAATATCATCTCGACAGTCAGCTGATTGACATGACTGCAAAAATCTTTCTACATGTGATACCAAATCCGTTCTGGCATCACAACGTCGACAATAATACTCAGCATCCAATCCAATGCTTCCTCCAACTGTCCCAGCTGTATATGATTTAAGACCACATTTTATATGAGCATGATGTCCACAAATATAACCATCACCCACCACTGCTTTACATTTTATGTAGCTATAACTTTCTGTGGTCGTGTCTATAATCTTGCTGCATAGTATACAGCAGCAATCACGGCAAAACCGAGGTTCGCTGCAGCAAATATCACAGGACATGGATTTTAATGAAGATGGATTCTCTGCAACAGATAAACTATTACAGTTCTTATTTCCAGCCTTGCAACCCACTCTATCAATCTGGGACTCAGATGCAGAGCATTCTTCCATCTCTTTTGAAGGTAGAGGGCATGAAATTTGTTTTACTTGAATACCTATATAAGATAAAACCGTATATTAGTGATCTCTAGAAAACAATAGAATAAGATTGTGAAAGCAATCAATCTCCCAATGAATGTGTAACCTGACAAACACATGCACACAAGTTGATTTTATATTAATGATAGAGAAATAATTATGTAAAACAATGCTTTCGACAACATGACAATAGAACTTCAACAAGTCCTGCTGAGTCCTGCCACTTAGTTGGAATCTATCTAAACTTCGAAATTTTCAGAATCTGGAGTCCTAACCTGTATAGCTTGTTAGCCATTAGCCCAATAAAAATGAAAAACCCAACGAATAAGATCTAAATTCTGATAAGGAAGGGAAAGACAAGAGTTTCTGAGACACTGGACATAAACCAGATCTAAAATATTAGGTGCAATGATCCAACCAAAAGAATTTATCATAAGTTTATGAGAGTATAGAGACTACCTGTAACAAGAAAGACAAATCCAATAAAACATAAATTCAATGCATTACATAATCATCAGAAACCTTTTTCATTTAAGGAAACTATTTCAACGTTATGAATATTTCGGCTTAAATAGGCATTTCATTTCAACCAACAGAACTGTCATTTCAGATAGAACAGGCAAGATGAGAATGCTAGAATTGAGATTTACAATAAAGTTAATGGAGCTACAGCAGCTAAATTTCAGGTGTAGGATGAGAAGAGGATTGTCAAATTTATTAAGCTAACCTTGCGCTAAAGATGACTTTTTTGCTGGTATCTTCCAACTGAATGAGGCAAAAAATGCATCAATGTCTGCATTGGGAAACTCAGACTGGATATATCTTTCAACTGAAAGCTTGCTTGCAAAACCATGCCCTTTACGAGCTGAGTTCTCAGAAGTGCCAATACCACGAGGAGAATAAAGGTACCTATCCAGAAAATGGCCAGTTATAGCAACTCTCTTCCCCACCCTCCAACTCCAGTTATCACCAGGATTGGGCCAATTTTCAGGAGCATATGGCAAGCCCTCCCCAGATTCATCTTGAGAAACTGGCCTAAGGATCAGTTCATTTTTCTTCGCCCTAGGTGTACAGCCATTTGTATCCTCAAGAACTTTAGTCTCCACAGGATCCCCCGACATCTAACAATGGTTGATGAACGAACCTATAAGCATAGGCAATACTTTTTTTTTCCTAATGAATATAAGCATAGGCAATATTATTAACATTCATTATTAGAAACTACCAACCCATAAGGGAAAACGATACAGATAGCTCACAATTCTCATGATCAAGTTCCAGAACAAAGAAAAGGTATTAAGAGGGATGGATAGAGAAAACAAAAGGAAAAGAAAATTATAGGAATATGATACCTGTGAATTGAAAGACTATGTTTGACAGGTAGGAATGGAATCTATAAAAACTTATAACATCTGGTTTTATTTTTACAAAATAATTTATCTCGTATAATTTTTATTAGATTTATTGACAATATTTGTTTAATAAATTACTTCGCACTCAGGGTAATTGTTTTAAATGGTAAAACCACTTTAAATATTTTCAAATATAGCAAAGTATGATAGAAAGTGACTTATTGTTATATTTGTAAATAAATTGACTCATTTTGCTTTATTTGAAAACAGGTCCTGAATACGTATTTAACTTAACATGAATTATTTCTAATCTATTTAAGATATTGGTGCTCATTATAACATGAGTTAGGATGTCAATGCTCATTAGTAACTTGCAGGTGATGCTGATTTCCTTAATAACACATAACAAGCGCCTCTAAATCACTGGTTTGCAGTATTTAAGGTAGCAAACCGATAGCTTGAACAATAAAAATAATGACCGAGCACGTTTTTTGTTTTTTCTTTCTTCTAAATCATGAAAGGATAAACACGAGATGAGAAATACGAAACTACCAAACCGGCCCTAAATAATTTTCCGCCCCTAGTTCTGAAAAAGAGAATTTACTAATCAACGAAGATCGACAAGATAATAAAGCTCACCTCGCTCGAAGAATTTAATAAGACATCCAGAGAACTTTAGGGTTTTCTCTGTCCAGTAGTCGTCTCTTAATCTGCAAAACAAGGAGGAAAGTAAGGAAGTGCAATACAGAAGTGCCTGAAGCGGGAAGTGAATCGAGACAGAAAAGTAACATAATCATAACGAAAACAGTGCTCACGTTGTAGAGAAACTAGAGAGAATGAGAAAGTAGAGAGATTAAGCAGATGAATTTTTTCGCGGGAATGAGAAGATGCTGCACAAGAACGAATGCCGAAGCATAGGAAATCGAGAAAAAGGTTCTTTCCGCCGCACAACGGGAAGAAAAAGGAGGCTCGACGCCTTTGATGGACCGACGAAGTACTGGACCTGAAGGCCCAATACGATAGGTCCAACTGATCAATGGATTTTTGTTCTTGTTTTTTGTTTTTTAAATATTTACACACAAAAAAAAGTGAAC

mRNA sequence

ATGGATCTCACCGTGGATCTGACAAGCAGCCATAAGGGATCCCCAAATAATAGCATTAGGAGCAAATGGCATTGCCTCAATCACCTCAAGAGCTTCTCTCAGAAGATTTGCACGGCCAAAGAGAGCCTCTTGAGGGCAGTCACTCTCTGTATAGCCAGAAATCATTGCGCTCCAACATATCAAGTCCTTCTCTACCATCTGATCAAACACGTAGCGAGCTTCTCCAATCTGTCCACCTTTTGCAAGCCCAGAAACCATGGCAGTCGAAACAACCATGTTCTTGGGGGAAATCTTTTCATAGAAATCCCAAGCCAAGTCCATGGAGCCACAGCTCGCATACATTGTGATGAGAGCACTTTACTACTAGAATTGGCCACAATATCCATGAGCAACAACATTAGCTTCATACCCATCAATCATGATGCTCCAAGCAACGACATCCCTGTGAGACATTTTATCAAACACCAACCGAGCTTCCATTATCCGTCCACAGGCTGCGTACATTCTAACCAAACCCGTCTCCACAAATGGGTCCGACCCAAATCCCAACTTCGACGCGAGCCCATGAATCTCCATCCCCGTTCTCAAGGAAAGATTCCTCGAAGCAGCTTTCAACAGCGGAGGGAAGCAGTACCTATCCAAACTCAGACCCTCCGCCCTCATCTTCTCGTATACAAAAAGCAGACGGGTCTTGGGCTGGGGAATTTGATCAAACACAGAGAGGGCATAGTCGAGGCTAGGCGAGAGAGCACAAGAGGAAAGAATAAGTTCAAAAAGAAGGGAATTGGAATCACAGCGTTCGAGTTTGGAGCGAAGGATTTGAGCGTGGACTTGTTTGAGGTGGAAGAGGCTGGAGGCGGAGGAGAGAGCGGCGGAGAGAGCGGTGGGTCTGGTGGGGAAATCGGAATGCAGAGCAATGGTGGTGGTGAGGTGGATGGGGAAAATGGAGGGAAATATCAGGACGAAGACGTTGAATTCTTATACGTCGCGGAGTTTAGTAATCGGAGCTCCGCCATGGCCGCAGCTTTCATTCATCTTCATCATGTTCTCATGTCCAAACAGAACGATTTGACGATCGATGAAGCAAATTTGCTCCAAACGTGTGTGTCTAAGGCTGTTCGAGATTATACCTTTGGAGGACTCCTTGGAGGTGGCGTCACATTGGCAGTTTGCTCTGGCAGGAACATGGAGGCTGAAGGTCTCCACTCGGCTACATATTTTTGCAGGTCCCTAAATTCAAGTGTCGATGATATTCTTGCACTGGATGGAAGTAGGATGCAAAAGGAATTGGCAAATATTATAGTGACGAGGTATCCCAATAATCCTCGTACCATGCAGCACATATTCAAGCATTTTTATTATGAGGAAGTATTTGACGATTCAACCTTGGACCGGCCAAAAAGAATGTTGCGTTATCGAAATTTCTTTAGTGATATTGTTCATGCTCAGAGGACGAATGACAATGACCATAACGAAAACGTGCATGGAAACTCCCACCATGACTCATCCAACCGTGATTCCAGTGCCTACCAGAGTGATTCCTATGGTGAGCCTGATGACAAAGGAAATGCCCTTGAGTTCAAGCCAGTCCTTATTAAGCGTGGCACCGATGCTGCGACCGCGGACCCTCTAAATTGTATTTTTGGTACTTTGGCCAAAGAAGAAGAAATTCAACACTCCAGTGCCTCTAATCCATCTCCCAAATCTCACTCTCGCAGCAGAAGATACAATCGTCGGCATCGAAGAGATAAGAAGACAATGCCAACAAACTTTGAACATGTGTAACTTCAGGTTCTCTATCAATGCTAAGAAGATTTTATTTTCTCATTTGAAGAAATAAAAAGAGACATTTTTGCTATCCAAAGATGCCATAGCCATCCATAGTGCAGAGACAGTGCTCATGAGGCAGTTGTGTGGTGGTTGTTGAAATGGAAGAGCCATATTTCAGAGAACACAATAGTGGTTGAAGGTAGGACATTCCAGTCTTTTCTCCATTTGCTTAATTCTCTAACATCCTCAGGAAGATTCGTGGGGAAAAATCTTTCATCATCCAGACCGTGTGGCGTGTTGAAGGCCTTACACAAGAAAGACGACTGGGATTACGCGTCAAGAATGAACGAGCTCGAAATCAACACCACCGCCGCCATGAAGGTCCACCCATTGCCGAGGAAGCGCAATATCGCCGTCAGGAATAACCCCACTTCGAGAAACTCTCTTGAAGATCAATCCCTTCTGAACAACCACAAGAAACTCAGGAGATTACCTCATATCTTCAGTCGGGTCCTTGAGCTTCCGTTTCGATCTGATGCGGATGTTTTGGTGGAGGAAAATCCCGATTGTTTCCGATTCATTGCTGAAACTGACGGTAACATTAGCGATGGAGTAAGAGCTCATGCTGTGGAAATCCATCCTGGGGTTATTAAGATCGTTGTCCGTGAGAATGAATCGTTGGAAATGTCAATGGATGAGCTCGAATTGGACATGTGGAGGTTTCGGCTACCGGAGACGACGCGACCGGAGCTTGCGAGTGCGGCGTTTGTTGATGGAGAGCTTATAGTTACTGTTCCAAAGGGGAATGAGGAAGAGAATTCTGAAGATGGTGGAGGAGATATCTGGGGAGATGGGAACGAGAGCTTCAGAGATGAAATGGAAGGTCGGCTTGTTCTTTTTTAGGGACTCAATCCAATGATCAAAAGGGGTGGACATAGTCCATTCTGAGCTTATAATGGAGTCTGAAGTGTCATGAGAACCTTCTGTAGAATCAGCATTATCAGGCAAGAGTACGGAAGATTGATGGTGAAAATGGCTGTTGAACATACTAAATAGTTGGCAGGTTATATACATTAACCACTTTAGAGACATTGTTTCTAAGAAGAGTTTGTTCGAGTATGTAGACTAATCAGACCCTCAAGTGATGAAATGAAATCTTTCTCCATAGTTATCAAGGGAGGGATTTGACAAAAAAATATTGCCAGTGCAATTAACTGAGCCCCTAATAAATGGTATTACAATTTCTCGTGCTCATACCACTGCGAACTTATCCATACTATTTGATTTGAAGAAAAATTACCAGTGCAATTCGCTGAGCTGTCTTCCTCCATCTTCCAAATCTCTTCCAAGCAAGTCCCAGATTTAAGCTGAAGCAAGCATTCAAAGTCAAATAGCATGGTAAAGTTCACAGAGCAGCATAGGATATTGTAAAATTATTTAGTACTTGTAGAAAATACCAGTATTACCTTTTCAATGTTCAATTCAATATTTCTAAACAGCTCCTTTGCTCTCATTTTGTGTGAACCACGCAAAATGCAGAAACCAAGGCTTAAGATCTCCTCAATATCATCTCGACAGTCAGCTGATTGACATGACTGCAAAAATCTTTCTACATGTGATACCAAATCCGTTCTGGCATCACAACGTCGACAATAATACTCAGCATCCAATCCAATGCTTCCTCCAACTGTCCCAGCTGTATATGATTTAAGACCACATTTTATATGAGCATGATGTCCACAAATATAACCATCACCCACCACTGCTTTACATTTTATGTAGCTATAACTTTCTGTGGTCGTGTCTATAATCTTGCTGCATAGTATACAGCAGCAATCACGGCAAAACCGAGGTTCGCTGCAGCAAATATCACAGGACATGGATTTTAATGAAGATGGATTCTCTGCAACAGATAAACTATTACAGTTCTTATTTCCAGCCTTGCAACCCACTCTATCAATCTGGGACTCAGATGCAGAGCATTCTTCCATCTCTTTTGAAGGTAGAGGGCATGAAATTTGTTTTACTTGAATACCTTGCGCTAAAGATGACTTTTTTGCTGGTATCTTCCAACTGAATGAGGCAAAAAATGCATCAATGTCTGCATTGGGAAACTCAGACTGGATATATCTTTCAACTGAAAGCTTGCTTGCAAAACCATGCCCTTTACGAGCTGAGTTCTCAGAAGTGCCAATACCACGAGGAGAATAAAGGTACCTATCCAGAAAATGGCCAGTTATAGCAACTCTCTTCCCCACCCTCCAACTCCAGTTATCACCAGGATTGGGCCAATTTTCAGGAGCATATGGCAAGCCCTCCCCAGATTCATCTTGAGAAACTGGCCTAAGGATCAGTTCATTTTTCTTCGCCCTAGGTGTACAGCCATTTGTATCCTCAAGAACTTTAGTCTCCACAGGATCCCCCGACATCTCGCTCGAAGAATTTAATAAGACATCCAGAGAACTTTAGGGTTTTCTCTGTCCAGTAGTCGTCTCTTAATGTTGTAGAGAAACTAGAGAGAATGAGAAAGTAGAGAGATTAAGCAGATGAATTTTTTCGCGGGAATGAGAAGATGCTGCACAAGAACGAATGCCGAAGCATAGGAAATCGAGAAAAAGGTTCTTTCCGCCGCACAACGGGAAGAAAAAGGAGGCTCGACGCCTTTGATGGACCGACGAAGTACTGGACCTGAAGGCCCAATACGATAGGTCCAACTGATCAATGGATTTTTGTTCTTGTTTTTTGTTTTTTAAATATTTACACACAAAAAAAAGTGAAC

Coding sequence (CDS)

ATGGATCTCACCGTGGATCTGACAAGCAGCCATAAGGGATCCCCAAATAATAGCATTAGGAGCAAATGGCATTGCCTCAATCACCTCAAGAGCTTCTCTCAGAAGATTTGCACGGCCAAAGAGAGCCTCTTGAGGGCAGTCACTCTCTGTATAGCCAGAAATCATTGCGCTCCAACATATCAAGTCCTTCTCTACCATCTGATCAAACACGTAGCGAGCTTCTCCAATCTGTCCACCTTTTGCAAGCCCAGAAACCATGGCAGTCGAAACAACCATGTTCTTGGGGGAAATCTTTTCATAGAAATCCCAAGCCAAGTCCATGGAGCCACAGCTCGCATACATTGTGATGAGAGCACTTTACTACTAGAATTGGCCACAATATCCATGAGCAACAACATTAGCTTCATACCCATCAATCATGATGCTCCAAGCAACGACATCCCTGTGAGACATTTTATCAAACACCAACCGAGCTTCCATTATCCGTCCACAGGCTGCGTACATTCTAACCAAACCCGTCTCCACAAATGGGTCCGACCCAAATCCCAACTTCGACGCGAGCCCATGAATCTCCATCCCCGTTCTCAAGGAAAGATTCCTCGAAGCAGCTTTCAACAGCGGAGGGAAGCAGTACCTATCCAAACTCAGACCCTCCGCCCTCATCTTCTCGTATACAAAAAGCAGACGGGTCTTGGGCTGGGGAATTTGATCAAACACAGAGAGGGCATAGTCGAGGCTAGGCGAGAGAGCACAAGAGGAAAGAATAAGTTCAAAAAGAAGGGAATTGGAATCACAGCGTTCGAGTTTGGAGCGAAGGATTTGAGCGTGGACTTGTTTGAGGTGGAAGAGGCTGGAGGCGGAGGAGAGAGCGGCGGAGAGAGCGGTGGGTCTGGTGGGGAAATCGGAATGCAGAGCAATGGTGGTGGTGAGGTGGATGGGGAAAATGGAGGGAAATATCAGGACGAAGACGTTGAATTCTTATACGTCGCGGAGTTTAGTAATCGGAGCTCCGCCATGGCCGCAGCTTTCATTCATCTTCATCATGTTCTCATGTCCAAACAGAACGATTTGACGATCGATGAAGCAAATTTGCTCCAAACGTGTGTGTCTAAGGCTGTTCGAGATTATACCTTTGGAGGACTCCTTGGAGGTGGCGTCACATTGGCAGTTTGCTCTGGCAGGAACATGGAGGCTGAAGGTCTCCACTCGGCTACATATTTTTGCAGGTCCCTAAATTCAAGTGTCGATGATATTCTTGCACTGGATGGAAGTAGGATGCAAAAGGAATTGGCAAATATTATAGTGACGAGGTATCCCAATAATCCTCGTACCATGCAGCACATATTCAAGCATTTTTATTATGAGGAAGTATTTGACGATTCAACCTTGGACCGGCCAAAAAGAATGTTGCGTTATCGAAATTTCTTTAGTGATATTGTTCATGCTCAGAGGACGAATGACAATGACCATAACGAAAACGTGCATGGAAACTCCCACCATGACTCATCCAACCGTGATTCCAGTGCCTACCAGAGTGATTCCTATGGTGAGCCTGATGACAAAGGAAATGCCCTTGAGTTCAAGCCAGTCCTTATTAAGCGTGGCACCGATGCTGCGACCGCGGACCCTCTAAATTGTATTTTTGGTACTTTGGCCAAAGAAGAAGAAATTCAACACTCCAGTGCCTCTAATCCATCTCCCAAATCTCACTCTCGCAGCAGAAGATACAATCGTCGGCATCGAAGAGATAAGAAGACAATGCCAACAAACTTTGAACATGTGTAA

Protein sequence

MDLTVDLTSSHKGSPNNSIRSKWHCLNHLKSFSQKICTAKESLLRAVTLCIARNHCAPTYQVLLYHLIKHVASFSNLSTFCKPRNHGSRNNHVLGGNLFIEIPSQVHGATARIHCDESTLLLELATISMSNNISFIPINHDAPSNDIPVRHFIKHQPSFHYPSTGCVHSNQTRLHKWVRPKSQLRREPMNLHPRSQGKIPRSSFQQRREAVPIQTQTLRPHLLVYKKQTGLGLGNLIKHREGIVEARRESTRGKNKFKKKGIGITAFEFGAKDLSVDLFEVEEAGGGGESGGESGGSGGEIGMQSNGGGEVDGENGGKYQDEDVEFLYVAEFSNRSSAMAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTLAVCSGRNMEAEGLHSATYFCRSLNSSVDDILALDGSRMQKELANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFSDIVHAQRTNDNDHNENVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALEFKPVLIKRGTDAATADPLNCIFGTLAKEEEIQHSSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHV
Homology
BLAST of CmUC05G085890 vs. NCBI nr
Match: XP_038878005.1 (uncharacterized protein LOC120070209 isoform X1 [Benincasa hispida])

HSP 1 Score: 345.9 bits (886), Expect = 6.9e-91
Identity = 191/267 (71.54%), Postives = 207/267 (77.53%), Query Frame = 0

Query: 339 MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTLA--------V 398
           M  A  HL  VL SKQN LTI+EANLLQTC SKAVRD+T GGL+GGGVT A        +
Sbjct: 1   MGEALFHLEQVLRSKQNSLTIEEANLLQTCRSKAVRDFTLGGLIGGGVTWAGTWRLNKFI 60

Query: 399 CSGRNMEAEGLHSATYFCRSLNSSVDDILALDGSRMQKELANIIVTRYPNNPRTMQHIFK 458
               +  A  L     F  SL S VD ILAL GSRMQKELANI+VTRY N+PR MQ I K
Sbjct: 61  RLNLSGGAAALCGLWRFSLSLTSCVDHILALHGSRMQKELANIVVTRYHNDPRAMQLISK 120

Query: 459 HFYYEEVFDDSTLDRPKRMLRYRNFFS-DIVHAQRTNDNDHNENVHGNSHHDSSNRDSSA 518
           HFYYEEVFDDSTLDRPK   R RNFFS D+ HAQRT DND  +N+HGNSHHDSSNRDSSA
Sbjct: 121 HFYYEEVFDDSTLDRPKIRWRSRNFFSDDVAHAQRTPDNDPKDNLHGNSHHDSSNRDSSA 180

Query: 519 YQSDSYGEPDDKGNALEFKPVLIKRGTDAATADPLNCIFGTLAK--EEEIQHSSASNPSP 578
           YQSDSYG+PDDKGNALE KPVL K GTDA T DPL+CIFGTLA+  EEEIQHSSAS+PSP
Sbjct: 181 YQSDSYGDPDDKGNALELKPVLTKPGTDATTTDPLDCIFGTLAREEEEEIQHSSASSPSP 240

Query: 579 KSHSRSRRYNRRHRRDKKTMPTNFEHV 595
           KSHSRSRRYNRRHR+  +TMPTNFEHV
Sbjct: 241 KSHSRSRRYNRRHRKGNQTMPTNFEHV 267

BLAST of CmUC05G085890 vs. NCBI nr
Match: XP_022956077.1 (uncharacterized protein LOC111457878 [Cucurbita moschata])

HSP 1 Score: 334.7 bits (857), Expect = 1.6e-87
Identity = 184/265 (69.43%), Postives = 206/265 (77.74%), Query Frame = 0

Query: 339 MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTLA--------V 398
           M  A   L  VL SKQN LTI+EAN+LQTC SKAVRD+TFG L+GGGVT A        V
Sbjct: 1   MGEALFELEQVLRSKQNSLTIEEANVLQTCKSKAVRDFTFGFLVGGGVTWAGTWRLNKFV 60

Query: 399 CSGRNMEAEGLHSATYFCRSLNSSVDDILALDGSRMQKELANIIVTRYPNNPRTMQHIFK 458
               +  A  L     F RSL+S VD ILALDGSRMQKELANI+VT+Y N+PRTMQHI K
Sbjct: 61  RLNLSGGAGALFGLRRFSRSLSSCVDHILALDGSRMQKELANIVVTKYHNDPRTMQHISK 120

Query: 459 HFYYEEVFDDSTLDRPKRMLRYRNFFS-DIVHAQRTNDNDHNENVHGNSHHDSSNRDSSA 518
           HF+YEEVFDDSTLDRPK   RYRNFFS D+ HAQRT+ ND  +N+HGN HHDSSNRDS+ 
Sbjct: 121 HFFYEEVFDDSTLDRPKIRWRYRNFFSDDVAHAQRTHYNDPKDNLHGN-HHDSSNRDSNP 180

Query: 519 YQSDSYGEPDDKGNALEFKPVLIKRGTDAATADPLNCIFGTLAKEEEIQHSSASNPSPKS 578
            QSDSYG+PDDKGNA EF PVL K G DAATADPL+ IFGTL +EEEIQHSSAS+PSPKS
Sbjct: 181 NQSDSYGDPDDKGNAFEFTPVLTKPGADAATADPLDYIFGTLTREEEIQHSSASSPSPKS 240

Query: 579 HSRSRRYNRRHRRDKKTMPTNFEHV 595
           H RS+RYNRRHRR  +TMPT+FEHV
Sbjct: 241 HHRSKRYNRRHRRHNQTMPTDFEHV 264

BLAST of CmUC05G085890 vs. NCBI nr
Match: XP_023527180.1 (uncharacterized protein LOC111790494 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 334.0 bits (855), Expect = 2.7e-87
Identity = 183/265 (69.06%), Postives = 206/265 (77.74%), Query Frame = 0

Query: 339 MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTLA--------V 398
           M  A   L  VL SKQN LTI+EAN+LQTC SKAVRD+TFG L+GGGVT A        V
Sbjct: 1   MGEALFELEQVLRSKQNSLTIEEANVLQTCKSKAVRDFTFGFLVGGGVTWAGTWRLNKFV 60

Query: 399 CSGRNMEAEGLHSATYFCRSLNSSVDDILALDGSRMQKELANIIVTRYPNNPRTMQHIFK 458
               +  A  L     F RSL+S VD ILALDGSRMQKELANI+VT+Y N+PRTMQHI K
Sbjct: 61  RLNLSGGAGALFGLRRFSRSLSSCVDHILALDGSRMQKELANIVVTKYHNDPRTMQHISK 120

Query: 459 HFYYEEVFDDSTLDRPKRMLRYRNFFS-DIVHAQRTNDNDHNENVHGNSHHDSSNRDSSA 518
           HF+YEEVFDDSTLDRPK   RYRNFFS D+ HAQRT+ ND  +N+HGN HHDSSNRDS+ 
Sbjct: 121 HFFYEEVFDDSTLDRPKIRWRYRNFFSDDVAHAQRTHYNDPKDNLHGN-HHDSSNRDSNP 180

Query: 519 YQSDSYGEPDDKGNALEFKPVLIKRGTDAATADPLNCIFGTLAKEEEIQHSSASNPSPKS 578
            QSDSYG+PDDKGNA EF PVL K G DAATADPL+ IFGT+ +EEEIQHSSAS+PSPKS
Sbjct: 181 NQSDSYGDPDDKGNAFEFTPVLTKPGADAATADPLDYIFGTMTREEEIQHSSASSPSPKS 240

Query: 579 HSRSRRYNRRHRRDKKTMPTNFEHV 595
           H RS+RYNRRHRR  +TMPT+FEHV
Sbjct: 241 HHRSKRYNRRHRRHNQTMPTDFEHV 264

BLAST of CmUC05G085890 vs. NCBI nr
Match: KAG6582303.1 (hypothetical protein SDJN03_22305, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 332.0 bits (850), Expect = 1.0e-86
Identity = 185/270 (68.52%), Postives = 208/270 (77.04%), Query Frame = 0

Query: 334 NRSSAMAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTLA---- 393
           +RSS M  A   L  VL SKQN LTI+EAN+LQTC SKAVRD+TFG L+GGGVT A    
Sbjct: 54  DRSSDMGEALFELEQVLKSKQNSLTIEEANVLQTCKSKAVRDFTFGFLVGGGVTWAGTWR 113

Query: 394 ----VCSGRNMEAEGLHSATYFCRSLNSSVDDILALDGSRMQKELANIIVTRYPNNPRTM 453
               V    +  A  L     F RSL+S VD ILALDGSRMQKELANI+VT+Y N+PRTM
Sbjct: 114 LNKFVRLSLSGGAGVLFGLRRFSRSLSSCVDHILALDGSRMQKELANIVVTKYHNDPRTM 173

Query: 454 QHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DIVHAQRTNDNDHNENVHGNSHHDSSN 513
           Q I KHF+YEEVFDDSTLDRPK   RYRNFFS D+ HAQRT+ ND  +N+HGN HHDSSN
Sbjct: 174 QLISKHFFYEEVFDDSTLDRPKIRWRYRNFFSDDVAHAQRTHYNDPKDNLHGN-HHDSSN 233

Query: 514 RDSSAYQSDSYGEPDDKGNALEFKPVLIKRGTDAATADPLNCIFGTLAKEEEIQHSSASN 573
           RDS+  Q DSYG+PDDKGNA EF PVL K G DAATADPL+ IFGTL +EEEIQHSSAS+
Sbjct: 234 RDSNPNQRDSYGDPDDKGNAFEFTPVLTKPGADAATADPLDYIFGTLTREEEIQHSSASS 293

Query: 574 PSPKSHSRSRRYNRRHRRDKKTMPTNFEHV 595
           PSPKSH RS+RYNRRHRR  +TMPT+FEHV
Sbjct: 294 PSPKSHHRSKRYNRRHRRHNQTMPTDFEHV 322

BLAST of CmUC05G085890 vs. NCBI nr
Match: XP_022980008.1 (uncharacterized protein LOC111479542 [Cucurbita maxima])

HSP 1 Score: 330.1 bits (845), Expect = 3.9e-86
Identity = 183/265 (69.06%), Postives = 204/265 (76.98%), Query Frame = 0

Query: 339 MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTLA--------V 398
           M  A   L  VL SKQN LTI+EAN+LQTC SKAVRD+TFG L+GGGVT A        V
Sbjct: 1   MGEALFELEQVLRSKQNSLTIEEANVLQTCKSKAVRDFTFGFLVGGGVTWAGTWRLNKFV 60

Query: 399 CSGRNMEAEGLHSATYFCRSLNSSVDDILALDGSRMQKELANIIVTRYPNNPRTMQHIFK 458
               +  A  L     F RSL+S VD ILALDGSRMQKELANI+VT+  N+PRTMQHI K
Sbjct: 61  RLNLSGGAGALFGLRRFSRSLSSCVDHILALDGSRMQKELANILVTKNHNDPRTMQHISK 120

Query: 459 HFYYEEVFDDSTLDRPKRMLRYRNFFS-DIVHAQRTNDNDHNENVHGNSHHDSSNRDSSA 518
           HF+YEEVFDDSTLDRPK   RYRNFFS D+ HAQR + ND  +N+HGN HHDSSNRDS+ 
Sbjct: 121 HFFYEEVFDDSTLDRPKIRWRYRNFFSDDVAHAQRAHYNDPKDNLHGN-HHDSSNRDSNP 180

Query: 519 YQSDSYGEPDDKGNALEFKPVLIKRGTDAATADPLNCIFGTLAKEEEIQHSSASNPSPKS 578
            QSDSYGEPDDKGNA EF PVL K G DAATADPL+ IFGTL +EEEIQHSSAS+PSPKS
Sbjct: 181 NQSDSYGEPDDKGNAFEFTPVLTKPGADAATADPLDYIFGTLTREEEIQHSSASSPSPKS 240

Query: 579 HSRSRRYNRRHRRDKKTMPTNFEHV 595
           H RS+RYNRRHRR  +TMPT+FEHV
Sbjct: 241 HHRSKRYNRRHRRHNQTMPTDFEHV 264

BLAST of CmUC05G085890 vs. ExPASy TrEMBL
Match: A0A6J1GVC2 (uncharacterized protein LOC111457878 OS=Cucurbita moschata OX=3662 GN=LOC111457878 PE=4 SV=1)

HSP 1 Score: 334.7 bits (857), Expect = 7.7e-88
Identity = 184/265 (69.43%), Postives = 206/265 (77.74%), Query Frame = 0

Query: 339 MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTLA--------V 398
           M  A   L  VL SKQN LTI+EAN+LQTC SKAVRD+TFG L+GGGVT A        V
Sbjct: 1   MGEALFELEQVLRSKQNSLTIEEANVLQTCKSKAVRDFTFGFLVGGGVTWAGTWRLNKFV 60

Query: 399 CSGRNMEAEGLHSATYFCRSLNSSVDDILALDGSRMQKELANIIVTRYPNNPRTMQHIFK 458
               +  A  L     F RSL+S VD ILALDGSRMQKELANI+VT+Y N+PRTMQHI K
Sbjct: 61  RLNLSGGAGALFGLRRFSRSLSSCVDHILALDGSRMQKELANIVVTKYHNDPRTMQHISK 120

Query: 459 HFYYEEVFDDSTLDRPKRMLRYRNFFS-DIVHAQRTNDNDHNENVHGNSHHDSSNRDSSA 518
           HF+YEEVFDDSTLDRPK   RYRNFFS D+ HAQRT+ ND  +N+HGN HHDSSNRDS+ 
Sbjct: 121 HFFYEEVFDDSTLDRPKIRWRYRNFFSDDVAHAQRTHYNDPKDNLHGN-HHDSSNRDSNP 180

Query: 519 YQSDSYGEPDDKGNALEFKPVLIKRGTDAATADPLNCIFGTLAKEEEIQHSSASNPSPKS 578
            QSDSYG+PDDKGNA EF PVL K G DAATADPL+ IFGTL +EEEIQHSSAS+PSPKS
Sbjct: 181 NQSDSYGDPDDKGNAFEFTPVLTKPGADAATADPLDYIFGTLTREEEIQHSSASSPSPKS 240

Query: 579 HSRSRRYNRRHRRDKKTMPTNFEHV 595
           H RS+RYNRRHRR  +TMPT+FEHV
Sbjct: 241 HHRSKRYNRRHRRHNQTMPTDFEHV 264

BLAST of CmUC05G085890 vs. ExPASy TrEMBL
Match: A0A6J1IXZ4 (uncharacterized protein LOC111479542 OS=Cucurbita maxima OX=3661 GN=LOC111479542 PE=4 SV=1)

HSP 1 Score: 330.1 bits (845), Expect = 1.9e-86
Identity = 183/265 (69.06%), Postives = 204/265 (76.98%), Query Frame = 0

Query: 339 MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTLA--------V 398
           M  A   L  VL SKQN LTI+EAN+LQTC SKAVRD+TFG L+GGGVT A        V
Sbjct: 1   MGEALFELEQVLRSKQNSLTIEEANVLQTCKSKAVRDFTFGFLVGGGVTWAGTWRLNKFV 60

Query: 399 CSGRNMEAEGLHSATYFCRSLNSSVDDILALDGSRMQKELANIIVTRYPNNPRTMQHIFK 458
               +  A  L     F RSL+S VD ILALDGSRMQKELANI+VT+  N+PRTMQHI K
Sbjct: 61  RLNLSGGAGALFGLRRFSRSLSSCVDHILALDGSRMQKELANILVTKNHNDPRTMQHISK 120

Query: 459 HFYYEEVFDDSTLDRPKRMLRYRNFFS-DIVHAQRTNDNDHNENVHGNSHHDSSNRDSSA 518
           HF+YEEVFDDSTLDRPK   RYRNFFS D+ HAQR + ND  +N+HGN HHDSSNRDS+ 
Sbjct: 121 HFFYEEVFDDSTLDRPKIRWRYRNFFSDDVAHAQRAHYNDPKDNLHGN-HHDSSNRDSNP 180

Query: 519 YQSDSYGEPDDKGNALEFKPVLIKRGTDAATADPLNCIFGTLAKEEEIQHSSASNPSPKS 578
            QSDSYGEPDDKGNA EF PVL K G DAATADPL+ IFGTL +EEEIQHSSAS+PSPKS
Sbjct: 181 NQSDSYGEPDDKGNAFEFTPVLTKPGADAATADPLDYIFGTLTREEEIQHSSASSPSPKS 240

Query: 579 HSRSRRYNRRHRRDKKTMPTNFEHV 595
           H RS+RYNRRHRR  +TMPT+FEHV
Sbjct: 241 HHRSKRYNRRHRRHNQTMPTDFEHV 264

BLAST of CmUC05G085890 vs. ExPASy TrEMBL
Match: A0A0A0L4T1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G146600 PE=4 SV=1)

HSP 1 Score: 317.4 bits (812), Expect = 1.3e-82
Identity = 181/275 (65.82%), Postives = 203/275 (73.82%), Query Frame = 0

Query: 329 VAEFSNRSSAMAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTL 388
           V  F ++SS M      L +VL SK N LTI+EA LLQTC SKAVRD+TFGG+LGGG+T 
Sbjct: 16  VPVFGDQSSDMGEILNELEYVLRSKPNGLTIEEAILLQTCRSKAVRDFTFGGILGGGLTW 75

Query: 389 AVCSGRN--------MEAEGLHSATYFCRSLNSSVDDILALDGSRMQKELANIIVTRYPN 448
           A     N        + A  L     F RSLNS VD ILALDGSRMQKELANI+VTRY N
Sbjct: 76  AGAWRLNKFTRLNLSVGAASLCGFWRFSRSLNSCVDYILALDGSRMQKELANIVVTRYHN 135

Query: 449 NPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DIVHAQRTNDNDHNENVHGNSH 508
           +P  MQ+I KHFYYEEVFDDST DRPK   RYRNFFS D+ H+QRT+ ND+  NVH NSH
Sbjct: 136 DPHAMQYISKHFYYEEVFDDSTSDRPKIRWRYRNFFSDDVAHSQRTHGNDN--NVHENSH 195

Query: 509 HDSSNRDSSAYQSDSYGEPDDKGNALEFKPVLIKRGTDAATADPLNCIFGTLAKEEEIQH 568
                RDSSAYQ DSYG+PDD GNA EFKPVL K GTDAATADPL+CIFGTLA++EEIQ+
Sbjct: 196 -----RDSSAYQGDSYGDPDDNGNAHEFKPVLTKPGTDAATADPLDCIFGTLARKEEIQN 255

Query: 569 SSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHV 595
           S+ S PSPK HSRSRRYNRRHR+D  T  TNFEHV
Sbjct: 256 STPSIPSPKPHSRSRRYNRRHRKDNHTKSTNFEHV 283

BLAST of CmUC05G085890 vs. ExPASy TrEMBL
Match: A0A1S3AWL2 (uncharacterized protein LOC103483703 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103483703 PE=4 SV=1)

HSP 1 Score: 311.6 bits (797), Expect = 7.0e-81
Identity = 175/258 (67.83%), Postives = 198/258 (76.74%), Query Frame = 0

Query: 346 LHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTLAVCSGRNM--------E 405
           L +VL SK N LTI+EA LLQTC SKAVRD+TFGG+LGGG+T A     N          
Sbjct: 8   LEYVLRSKPNGLTIEEAILLQTCRSKAVRDFTFGGILGGGLTWAGTWRLNKFTRLNLSGG 67

Query: 406 AEGLHSATYFCRSLNSSVDDILALDGSRMQKELANIIVTRYPNNPRTMQHIFKHFYYEEV 465
           A  L     F RSLNS VD IL+LDGSRMQKELANI+VTRY N+PR MQ+I KHF+YEEV
Sbjct: 68  AAALCGFWRFSRSLNSCVDYILSLDGSRMQKELANIVVTRYHNDPRAMQYISKHFFYEEV 127

Query: 466 FDDSTLDRPKRMLRYRNFFS-DIVHAQRTNDNDHNENVHGNSHHDSSNRDSSAYQSDSYG 525
           FDDST DRPK   RYRNFFS D+ H+QRT+ ND+  NVH NSH     RDSSA+Q DSYG
Sbjct: 128 FDDSTSDRPKIRWRYRNFFSDDVAHSQRTHGNDN--NVHENSH-----RDSSAHQRDSYG 187

Query: 526 EPDDKGNALEFKPVLIKRGTDAATADPLNCIFGTLAKEEEIQHSSASNPSPKSHSRSRRY 585
           + DDKGNA EFKPVL K GTD+ATADPL+CIFGTLA+EEEIQHS+ S PSPK HSRSRRY
Sbjct: 188 DSDDKGNAHEFKPVLTKTGTDSATADPLDCIFGTLAREEEIQHSTPSAPSPKPHSRSRRY 247

Query: 586 NRRHRRDKKTMPTNFEHV 595
           NRRHR+D +T PTNFE+V
Sbjct: 248 NRRHRKDNQTKPTNFEYV 258

BLAST of CmUC05G085890 vs. ExPASy TrEMBL
Match: A0A6J1C8I6 (uncharacterized protein LOC111009363 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111009363 PE=4 SV=1)

HSP 1 Score: 305.1 bits (780), Expect = 6.6e-79
Identity = 171/257 (66.54%), Postives = 193/257 (75.10%), Query Frame = 0

Query: 339 MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTLA--------V 398
           M  A   L  VL SKQN LTI+EA LLQTC SKAVRD+TFG L GGGVT A        +
Sbjct: 1   MGEALFELEQVLRSKQNSLTIEEATLLQTCKSKAVRDFTFGALAGGGVTWAGTWRLNKFI 60

Query: 399 CSGRNMEAEGLHSATYFCRSLNSSVDDILALDGSRMQKELANIIVTRYPNNPRTMQHIFK 458
               +  A  L     F RSLNS VD ILALDGSRMQKELANI+VT+Y N+PRTMQHI K
Sbjct: 61  RLNLSGGAAALFGLWRFSRSLNSCVDHILALDGSRMQKELANIVVTKYHNDPRTMQHISK 120

Query: 459 HFYYEEVFDDSTLDRPKRMLRYRNFFS-DIVHAQRTNDNDHNENVHGNSHHDSSNRDSSA 518
           HFYYE+VFDDSTLDRP+   RYRNFFS D+ H QRT+DND   N+HGNSHH SSN DS++
Sbjct: 121 HFYYEKVFDDSTLDRPRIRWRYRNFFSDDVAHGQRTHDNDTKNNLHGNSHHYSSNHDSNS 180

Query: 519 YQSDSYGEPDDKGNALEFKPVLIKRGTDAATADPLNCIFGTLAKEEEIQHSSASNPSPKS 578
            Q+ SY EPDDKGNALEFKPVL K GTD ATADPL+C+FG LAK EEIQHS++S  + KS
Sbjct: 181 NQNYSYDEPDDKGNALEFKPVLTKPGTD-ATADPLDCLFGPLAKGEEIQHSNSSTTTFKS 240

Query: 579 HSRSRRYNRRHRRDKKT 587
           HSRSRRY+RRHRR  +T
Sbjct: 241 HSRSRRYHRRHRRHNQT 256

BLAST of CmUC05G085890 vs. TAIR 10
Match: AT1G05430.1 (unknown protein; Has 31 Blast hits to 31 proteins in 9 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 31; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 51.2 bits (121), Expect = 3.3e-06
Identity = 75/271 (27.68%), Postives = 115/271 (42.44%), Query Frame = 0

Query: 341 AAFIHLHHVLMSK--QNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTLAVCS------ 400
           AA   L  VL SK  Q  +T +E+  + +C  KA+    F   +GGG+T  V        
Sbjct: 4   AALNQLFVVLASKPEQEKITPEESRAIVSCHFKALWTAGFASGVGGGLTWQVTKKLKKPK 63

Query: 401 --GRNMEAEGLHSATYF-------CRSLNSSVDDILALDGSRMQKELANIIVTRYPNNPR 460
              R   A G+ ++T+         +   SS+D IL+ D +RMQKEL N++V        
Sbjct: 64  GLERVALAAGVAASTFVVAWNWSSSKYAVSSLDHILSQDATRMQKELVNVLVRSNRGEAW 123

Query: 461 TMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFSDIVHAQ---RTNDNDHNENVHGNSHH 520
             Q + KHFY E V+ D   D+P+   R R  F++I  +        +  N N   N  H
Sbjct: 124 RWQLMSKHFYPESVYGDEG-DKPQMRWRRRTTFTEIASSYDDVNATKSQRNPNGLPNPSH 183

Query: 521 DSSNRDSSAYQSDSYGEPDDKGNALEFKPVLIKRGTDAATADPLNCIFGTLAKEEEIQHS 580
              +  S A ++    + +  GN+            + A  D L+ +FG     E I   
Sbjct: 184 RRISGGSDASKTKQTLQ-NSSGNS----------DGEMAEEDVLDIVFGCSEATESIPAP 243

Query: 581 SASNPSPKSHSR-SRRYNRRHRRDKKTMPTN 591
             S  + K+ +R  +R  RR R   +   TN
Sbjct: 244 VISKLASKTQTRKQKRAQRRQRLKNREASTN 262

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038878005.16.9e-9171.54uncharacterized protein LOC120070209 isoform X1 [Benincasa hispida][more]
XP_022956077.11.6e-8769.43uncharacterized protein LOC111457878 [Cucurbita moschata][more]
XP_023527180.12.7e-8769.06uncharacterized protein LOC111790494 isoform X1 [Cucurbita pepo subsp. pepo][more]
KAG6582303.11.0e-8668.52hypothetical protein SDJN03_22305, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022980008.13.9e-8669.06uncharacterized protein LOC111479542 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1GVC27.7e-8869.43uncharacterized protein LOC111457878 OS=Cucurbita moschata OX=3662 GN=LOC1114578... [more]
A0A6J1IXZ41.9e-8669.06uncharacterized protein LOC111479542 OS=Cucurbita maxima OX=3661 GN=LOC111479542... [more]
A0A0A0L4T11.3e-8265.82Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G146600 PE=4 SV=1[more]
A0A1S3AWL27.0e-8167.83uncharacterized protein LOC103483703 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A6J1C8I66.6e-7966.54uncharacterized protein LOC111009363 isoform X1 OS=Momordica charantia OX=3673 G... [more]
Match NameE-valueIdentityDescription
AT1G05430.13.3e-0627.68unknown protein; Has 31 Blast hits to 31 proteins in 9 species: Archae - 0; Bact... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL531) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 556..594
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 556..570
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 486..523
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 283..316
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 486..502
NoneNo IPR availablePANTHERPTHR35986EXPRESSED PROTEINcoord: 336..585

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmUC05G085890.1CmUC05G085890.1mRNA