CmaCh17G005990 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh17G005990
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionDRBM domain-containing protein
LocationCma_Chr17: 4977413 .. 4999009 (+)
RNA-Seq ExpressionCmaCh17G005990
SyntenyCmaCh17G005990
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGTGCAACAGGTGTATGCCCAACCGAGGATGCCATACTTGCATTATTGGATTACTTAGTTGAACCTATGCTTCCTTCAAAGTCATCTTCAATAGAAAATCCACCACTAGCTCTACTGCAATCAGTTGCAAAACAGGTACTTCTGAAATTGACTTTCGTGTGTTTAATATTTTAATTCACTTTCATTCTTGAAATATTCGAACAATTATAATTAGGTTAATAATTTTGGTTTAGGACTGGGTGATTGGAGTTGATTTACTGGAAACATAAAAGTTGTACAAGCAGCCGTCTGTGCTTTAATATCCATCATCTTTTGAAAGTACCAAAAGATTATGATAATACTATTACTTGTGCTTAAAAAAATTTAATTTCTACAGTGCTCTCTTCATTGACTAACCGAATGCTTTAATTTTTTCTAGTCTTTTGAATGTACCAGCGTCAAGCATAAAACTACCTATGTAAATAAATATATACTTGTCCAAAAAACTCTCTCACTATCTTACACACACACAAATGCACACGCACATTCACACAGACATGATAACTGAAAGAAGGCACATAGTGAGAGGAGGGATATTGGGGGCTATGTATCCAGGATTCAGCTATACTCCATAACAATAGAAATTACAAGGAATGTTCATTCATTAAATTTATTTGAGCAAATTGATTAACATATTTTCCAATATTTGAAACAACTGCAAATAAAAAGTAATTCACGGCACTTCTGCTGCTTTTGCTAGTCTATGATTTATTTGATTTTTCTCATTTACACGTTTACACGTTTTAGTTTACCACCATTCTTGTTGCTGATAAACTTTGGATTTTCCTATATTTAATAGCAGCCGAAATTGACTAGTACTGTTCTTATGATTAACACGGAATAGGTTTAAAGTAGGTTAGTTTTAACTATATGTGGCAGCTTATTTCTCACTACTCCTTCCCATAAGAAAAAAATTGTTAAGAGAGAATAAAAGTGTCATCTATATTTAAAATCGTTATTGTTAAAAAATAATGTTATCTAACTTAGTGTTTTTCCTTTGAATCTACTATTAGATGCATGCCGTTGTTTTGTTATACAACTACTACCACAGGAAACAACACCCACATCTTGAATTTTTGAGTTTTGAGGCATTTTGCAAATTAGCTGTGGTCGTTAAACCAGCTTTGTTGTCTCACATGAAACTCATGCAAAGCTCTGATGATATTGAATTGGAAAATCCCGAGAAGCAGCTTTCTCCATCCGAAAAAGCAATTATGGATGCATGTGATTTAGCCACTTGTCTATATACATCAAAAGATGAAAATATAGAGGGCTGGCCCCTTTCCAAGGTTGCCGTTTTTTTGATCGACTCCAAGAAGGAGCATTGCCATTTGCTGTTTAGTTCCATCACTCAAGGAGTTTGGTCTGTCATTGAACAAAATTTAGATACCTATGAATGCCAACCAAAAAGCGTGGAAGAGGAAAAACATGTAAACAAAAAGAAAAGAGTGATTAAGAAACCTTCAAAAGAGGGGCTAGTTGTTGTTGAAACTAAGACTCAGCAGCTTGCATATTCAGCAGTCAAGGAAGCAACTGGTATGTATTCTACTTTGTTAACTCATCGATAGTCAATGGCTATATCTTGTTAAGTATACCAAGGAGAACTGCTTTTATTTCTCACCTTTTTATATTGAATGAGATGTCCCTCTTTGTTCGTGTTAGATTCAAAGATATGTTTTAGAATTCTAATGACCTATTGCAAACATGATTGTCTCATATATATTTTCTTTCTCAATTAAGGATCATGTATGGTTAAGTATTAGGAAGTGAATTGTAACAACCCGATTTTCGATACCTCGAATCACAGGTNTTTTTTTTTTGTAAAACGGGAGACACAAGGAATTTAAAATTTAAATATGCATTCAAAACCAACGTAAAAGTAGTGTAAAGGTAATATTTGCGGAGTTAAATCAAAACGACTTACAAAAAATGCATAAGTTTTGGGAAGTAGCATTAAAATAATAATAATATGACAAGAAGGAAGATGACTCGATCTATAGAGCACCCCCTGTAGTTGCACGGCTCTGCACGCTCCTGCCATCAACAATGATCTACATTCCCCCTAAAAAAAAAAAAATAGAATAACATGGGATGGGTATAAAAATACCCANTTTTTTTTTTTTTTTTTTTTTTGTAAAACGGGAGACACAAGGAATTTAAAATTTAAATATGCATTCAAAACCAACGTAAAAGTAGTGTAAAGGTAATATTTGCGGAGTTAAATCAAAACGACTTACAAAAAATGCATAAGTTTTGGGAAGTAGCATTAAAATAATAATAATATGACAAGAAGGAAGATGACTCGATCTATAGAGCACCCCCTGTAGTTGCACGGCTCTGCACGCTCCTGCCATCAACAATGATCTACATTCCCCCTAAAAAAAAAAAATAGAATAACATGGGATGGGTATAAAAATACCCAGTAAGCAACCTACTTGTAGGCTCTTGTCGAACTTCATCCTATACATGGAAACTTAGGCTATGTGCTCATAATTGTCCCTTGCGATAGCCAAACTGTGGCTTAAACGTCCATACCAAACTTTAGCATACCTCTTTTAAGGAGTATTCAATCCCTCTCAGGTTCACAACCTTGGAGTTTTCTGTTTTCTCGTCTCTAGGTTTCACGTATGTCTAGGTCCTTAGTCAATCTTGCACAAAGTGCGCTTCAAGTCGTTGGGCCCATGGAAAGGCCCTAGGGTAATGTTCGCAAGTCTGGAGGAGCCCTTTTTGCTCTAATTGGTGACTGTACTCCATACATATCTACCTATTGATCTCTAATTAGTGGATTCTCCCTAGTGGTGTCGGGCAACATCTACTTGTTCCTAAGATCAGTAAACTACCTGTCACTTTGCATTTCCAGGCTAGAACAATAAATTACTTGCTCACCCTCATTAGGTTAGAACAATTAAACTACTCATCTTAGGAATAGTAACTACATATTATGTTGCCTTTTCAGGTTAGTACAGTAAAATTATATGTCACTATGCCACTCAGAGGTTGGAATAGTAAAAAGGCCATGACTCTCTCCACTAGGCACCATTGATAGCACTTCAGAAGTATTCCCTCTATGAGTCCCGAAAGCATTTATGGATGACAATTAAGTCATGTGTAACAAGTAGACCCTAACCCCTGTTGGTTAGTTCACGAATAGGGGTTGCGCCCTATTCGTCCCTACAAGGTACCTGGTTAACCCAAGGAAACTATGGAACTCATGACTAGGTACTAGGACCAAAATATGAATAGGGATTGCACTCTATCCGTCCCTACAAGGTACCTGGTTAACCCAAGGAAACTATGGAACTCATGACTAGCTACTAGAACCAAAATACGTGAAATTGGAACTCATAGTCTATGGATACCACAACTAGCACTCTATGACATATGACGATAGTCCAACTCATAACTGGGTAATAACGGGCTATCCAAACCCTAAACCATTCATATAAACGTATAAAGACATACAACATGCTTGTCATTGCTCTTTAAGTGATAATAAATCAACCTCAGCATACTGGAAAGCATAGAAGCCTAATGATCTATCAAAGTTACACAATTATGCGTGTGGAAGTTATATAACGGTAAATAGAATTAAGTGGTAAGTGTTCACATTATCTAAGCACAATTATCATGAGACTAGGTCAAACTAGGCTCCTAAACATAATAATCAATCATTGCTCTCGATCCTAATTAACTCCTACAAGTATTTCTTAGAAACTTGCTCCATATGGTTACTTACTTGGTTGTAGACTTCCCGTGCTTGTTTCGTATCTAAGAGGATATTCCAAATTTCTCCAAAATTCCTCAATATGCTTGTTGACTGTACGTTCATCTTCTCCAACTTGCGTGCGTGCCATTCTGCTTTCCTTCTTTTTTTTTTTTTGAAAAAGATCTTTTATTTATTTTCTTGAAAAAAATCTTTTATTTATTTTCTTAAAATTCCAGGTGTTACATGAATATATTGATTGTCCTTTTATTTTGTTTTTTCTGAATTGTATCAATATCTAATTATGGTATTGGTGAGATGGTCTGATGGTGCCTCATCATATTGGTTGCATTTTGTGTTCTTTCTATTCTCTTCTGTACTTACCTCAGAAAAAAGGGAAGAATTTGGATTGCTTAAGATATGTAGTTGTTTTGGAACAAGAAAAAACTCCTCATTGATGATGAAAAAGAGTAAATAATGTTCAAGAAATCCAAAATCCTTAAGGGAGTGAAAAAAAAATACAAACAACAAATACAACCATATAGTGATTAGCCATGAAAATAAATAAACATTTTAAGAACAGAGAAAATTTTCCAATATCTTTGCCTGAAAGAATGATATTGTTTGGAGTGTATGTACAAAAATAAACTAAACTTTCTAAGAGAAATATATTATTTAGAACTTGGATCCATCTATCATGACGTTGTTAGGCCAATCTTGAAAACATTGTTTGGAGTAAAGGGCTTCTTTACTTGTTATTACTGTTTTAGGAATTTCGTCTCATAACGTCTTGGAAGAATAAACGTAGTTATCAAGGAACATAATATCTCATATTGCCTTGCTTGTAAATTTAGTCAACAAGAACTTCCTTGCTTGTAATTACTACTCTAGGAATTAGTCTTATAATCTTGCTACCCCCCTCCACCCACACACACACTTACCCCCCCCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCCCCCGGCCTTATTCAGACATTGTATTCTTTTCTTTTGAGGCCTTATTGATGCAAAGTTGATAAGATTGCCTAACGGATATGGAACGTTCTATGGGGATGTGGGGATGCTTGTTTGCAGGCATGATTTGGGACAATATCAGGAGGCTCTTGAAGCAAATGGCATGTAGCTTGTCATGCTGGTTTATGATTGAAGAGGTGTAGTCTTGTTGATGTAGAACTAAGGGAAATTCAGAATTGTATTCAATTGAAGGTAATGATACAAACTGGAAGCTGACCATTTATAGGTCACAAGTCAGCAAACAAATCAAAACTAACCCAAGGCTTCCTAACTAATTGACCAATTCGGTTGAGGGGGTAAATAAGAGAAAACTTGGGTAAACTAAAGAAACTAAAGAGTACAAGAATGCCAGATCTAGTGAGAATTAAAAAAAGATGCAAGAACTATGAAGAAACTCAAATAAAATATCTTAAAAAGTACTTCCGAAGGGATTATGATTAATGGAACTTAAATCTTCAATATTTTACAGAGAAACCTCCCTTCACACGATTTATCATCCTCAATTTGTCATTTTTGTTTCTTGGATGTGGATTGTCTTCTGCATATTTTCTTTGGTTGTTCCTAATTGCAGTTATGTTGGTTTAAGTTGTTTCCGAACATTTCTCATTAGCCCAAAATTGGCTCCGAAATCTATGCTTTTGTGGGGTAATACAGTAAAGCCTTTTTTTTTTCCTTGTCAAAATTATGTTTTGAAAGAAACCGAAGGCTGTTTCACAATAAGCATTTCCTTTGGCTAGATCATTTTGAGTCAGCTCATCTCAAAGCTTCATCATGGTGTTCTCTTTCCAAATATTTTGTTAATTTATATTTTCAGGATATCAATCTTAATTTTGGAATGCTTGTATATATCCTTTGTAGATTTCTTTATTTGAGGCTTACTATTTCTTATTGCTTTTATTCCTTCCTTGGAGTTTGCATCCATTAAGCATTAGGCTCTTTTCATTACATCCTTTTCGTTTCTCTAGATTTGGGAGGGTACAAATTATATTGCTATATGTTAAAAACTGTGCAACTTAGGTCTTCTTGTATAGAAGAGAGATCTATTACATACATGGAAAGAATAATGCAATTAAGACTAAATATTGACCACTAATGTTTACACTTAAATAGTAGAATTCCAAGACTTCCCTTCAAGTTGGTTTAAAGATACCTTCCATAGTTAGCTTGCCAATTCATTTGTCACATTACTTTTGGGAGTCCTTTAATTGATACATCAACAATTTGCTGTGACGTTGGAAGGTAAGGGATACATATTACTTCTGCATCAATCTTCTTTTTAATAAAAAGCTTATCCACCTTAACATTTTTGTTCTATCATGTAAACTGGATTATGAGCCATCAAAATGATAACTTTATTGTCACAACAAATCCGTATAGGCGTCTTTTGAAGGAACTTCAATTCTCCAAGTACCCTTTTTTGTCCGTATACCCCCACAAATTCCACGTGCTAAAGCTCTTGTATGCTTCAGTACTGCTTCTAACTACCACATTTTGTTTTTTTGCTCTGCCACATAATTAAGTTCCTTCCAACAATAGAACGATAACCTGAGGTAGATCTTCTATCAGTAGTACTTTTTACCCAATCTGCATTAGTATAAGAGGGTGCTTATTGAATAAGATACCTTTCCCATGAGTTTCTGTAAGTCAAAATTTCAACTTCAAAATGAACTGGTTCATGCATACAACATGTGTAATATCAGGATGGGTATGGGATAGATAGACTAACCCTTCAACAAGTTTTTGGTATCGCTCCTTGTCCTTTATTTCTTCTGCTTTTGCAACTTATAATTTTAGATGGAGTTTAATGGGGGCTTTCTGCAACCTTACGTTCAAGTAGTCCGGTTTCTTAAAGTAAGTCAAAGACATTTTCTTTGATTATAAAGATACCCTTTTTTGATGCTGCAAATTCCATTTCTAAGAAATATTTTAATGTTCTCAAATCCTTGATCAGAATAATGTCAACTAAGTATCGTAAGTAAATTATTCCTGATTGTTCAATCATTTGATAACCAGAGTCATTCTTTGATAAATGATCACTATGCGTCAGACTCAATAGAATTTGATCCATTTTTTTTTTTTTTTTGTTGTTAAGGTGGAGAACTGAACCAAGAATTATCTTTCTTTGAGACTAGGATTCTATTTTATCATAAATCCAATCAATGTTCACATTTTTTTCTTATCAATGAATATATCTCTCTGACTTAGATGTCTCGTATTTCTAAAAAAAATGATTCAATTGATGGGATTTGGTACGATACTTGTGATATTGATGAGATTGATATTCTGATCTTTCTTCTTCGAACATATTGATTTGATCCCATAAGAACGACCGCCATCCCATAACATGTTGCCACCAAAAGGAGTACCCATATTTCTTCTAGAGAATCTCCTAATTGTTCCAAGCAAACTAGAAAGAGATTCTTTAACCAGAAAGAATTTAGTTTAGATATGGGATACCTATCTAGAAGTCTTTGCAACTCAATCATTTATGATGGAATCATCAAAGACTTGGCCTTTTCGAACTCTATTTGTAACTCACTATAGGCTTGAGAAACAAGGAGAAGATGTGTACGATTTCTCTAACAAATTTTTTCTTGGGTTCAACTATACTTGTTTCATCATTACCTATAAGAATAATATTATCCACTTGGAGGATCAATATTGAGATTTTATTATTCTTCTAATGTGATGAGGAAGGGGTTGGAGAAGAAGCTAAGTAATGAAATAGGGAAATTTAGAAGGATCCCTACCGTTTGAGAGACCCCAATCATCAGTCTCTACAATCAGGAGAGAGAGCAACCGATTGAAGTAAAGACAAGAGAGCATCTTAAGGTCACTTAAACGGATGATAAGGAATGAAAGAAACTCCAGGATTTCCCCAGACAAAGAAAGATCTCCCCAAGAAGTCCTTGACTCGTTCCCCCATATGGCTTTTTATTGGTGTAAAAATGTTCATCCTCATAGCAACTATAGTTTGAATCCTTTCATATGCAATTGTTGAAGTCTTTTGTAATCACTTTGGATGGATCTTACCTTTTGTATCATTTCATTTAATCAGTGAAAGTATTTCCTATACAAAATAGTAAAGGCTCAGAAGCTACTCCTGAATTGAGGTCAAGAAGTTTTGGGGACCTGTATAAAGGATTCGACACTGAAAAGCAATAGGGGGCATAAGAAGCCAAAGATACTCCTAGAATCAAACGCTTGTATCATCTCCCACAATAGATGGAAGAACGAAAGGAAGCTTGGAAAGCAGAAGGGGATTTCCTTCCACAAGTTTCTAAAAGTGCCTATAGAAGCTAAATTGGAAACCTAGCTGTTGAGATGACACCTGTATTTGCTCACAATTTCCGAATGGCATAAAGTTTAAGGTTTTGAAGGGACCTGTCATGTCCATTATGAGGCAAATGCTTTTGTTGCCCTACTTTCCTATCCCCAAACTACCTCCTTATTAGGTTTCAACACCACACTCCAATTCACTGAATGATTTCCTATCCTCAAACCACTCCCTTGTTAGGTTGCCACAACATACTGCAATTCACTAAATGTGACCCACCTCCCTCCTCAACCCCTCCCTGCAGAAATTCCCATGTGACTTTTCCTTGATAGATCCTTGTATGTTGAAAAGAAAGAAAATGCACGGGATTTCAGGGTAGTCTTCCCCTTCAGAGAAGAAATCGTTTTTTTCCTTGAAGAAGCTCTTTTAAAAATTTCAATTACCTTGAATGAACTTTAAATCTTTCACCTCTGAAGGTTAATTTTCTTTTGTAAATTTCATGCATCAATGAAATTTGTTTCTTATTAAAGAAATGTTTAAACTTAGTTCTTAAACGCAGATGCTAGTTCAATAAAGTTGTTCATTCAGTACCAATATATCCTTATTTTTTAAAAATAAAATTCAATTAATCTAAAGTCCCCATTGAACATATTATTGGACTTGGTGTAGCCTGATGCAGCAATGGAAGCAAAATTCATGGAGTCTCCTTTTTGTTGCTATTATCACCCTTGGGGTCCTACATTATGGATATGTCTCTTATCTTGGTACTTACAGAGAGTTGCTAATCCTGTTTATTTCATGCAAAACAGGCATTAATCAACACGATCTCAAAATTTTAGAAAGTCACGTTGCCTACTCTCTAAGTAAAGAAAAATCAGCAGTCTACTTCTACATGATGCAGTGCACTCGATCAGCAACTGAAGATGTAATTCAAGTTCCAATAAAAGATGCCTTCGACAGGTTTCATTGTTTTCCTTTCACGCCTGCCCCCTTCAATTCTGTGTCTTGGTTTGAATGCATGTTCCAGGAGCTCATGTGAAATGAGCATGGATGAAGGTCAGTCCTCAAATGCATAGATACCCTTTTTTGCTCTTGATTTTTCTCTTTTCCATGGTATTAGTTTGCAGGATTCGTTGTTTAAAAAAAATGGTAGGAGATGGAGCGTTACTTCAAAAGTTGAGTACTACCACATTCTTCCTTATGTGAAGATGGTGCTAACCTGGTTTCGTAGGTATACTTTTCATTGCTATGGATATTATTTTTGATGTTGGTATTTATATATCGGTCTGCATAACTTGATACTTAAAGCCTGATTCTTGATTATTGTGACCTTGTTTATTTCTGCTTCCTGGTGCCGCTGTGTCCAGCCTCTTGGCTCTTGCTAGTTGCCAGTAATGTGAGGTTTAACTATTATTGGATATTAGTTGGAAAATAGTAGAAAGAAGACTGCCAACCCCATAAGCCTCTCGTTCAGTAAGGGGTTGAGGAGATCTAAGAAAGGAAGAATTATCAGAAGATGGCAAGAAAAAGCTACTGATATCCTCTTATCTAAAAAATGATACAGATGAGGAAACAAGGCGCATGAGGTTTTTCCCCACTCAAAAATCCTCCACCAGAGAGTATCAGGCTCCTTGTGGAAACGCTACAACAGTTTTGCCAAAAGAGTCTCATTACATAATTACAGAACATCAGACATTAGCAGAAGTGGTTGGCAGAGAAGTCTCCCAGAGGTGGAGATAGAGATTGAACGAAGGTGGGGAGTGAGAATTGGGGGAGGGGAAGGAGTTTTCTAGAGTTAGTGTTTTTTTTTTTCATTACAAACTTTTTTATAAATTTTAATTTGTGGGCTTTTAATGGACACTTTGTGGCATGTTTCTTTACTGGGTTTAAATGGGCTCATATAGCTTATTGGGTTTTATGTTTTTGTAAATTTTAAGCTAAATTCAAGGACCAATTAAATCTTCATTCACATTTTTTTAAAAAAAATTAATTATTTTTAATATTTAGTAAAAAACTGAAGTATTTTGGGTTTGATCTTTTAATATATTTATGAACTTGTCCCCAACGTGTCCATATCCTATGTTTTAGAAAATTGGTGTGTCGTAATGTCATGTCACGTCTATGCTTCTCAGGCGAAAGCTTGTCCATGCAGTTGGGTTTTGGAAAAAGTCTTTCCTAAGTTGGTTAAGGGGAAGAAATTCTTGTAAGGAAAGGGTGGAAGTGCATTTTCTTCTTCTTATTCTTCGTCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTTTTTTTTTGAATAATTAAATTAAATTTATATAATTAATTAATGAAATTTTATTTTTTTCTGGACGAGAAGCTTAGCTTTTATTATCAGAAGAGAGTGAAAAAATGGAGGGAGACTTTGTTTTAAGTGGTTCTTGCGAACTTCTCCAACTAAAAACCGTCTTTCCTTACAGTCTATCTTGCTTGAGGATGGGTAGTGTAGAAAATTGACCTGTAAATTTGAGATCTTGGCTGCATATCTTTGATTTCCAGCCGTTGCTTAAACGTCGTTTATCCTAAATCTCCTGGTTTAGTAGTTTGCAGGTGTGGAGAATGCGCTCCTTTGTATGTAGTGTTTTCTTATTATCTCGAAGTTCTATATTAGGGAGTAATTACATATTTTAGAGCAATGGAAGTTCTTTTTTGCCTCATAATGGGAAGCATTGTTATTCATTTCTGTCTCAATACGGAGAAGTGATTGCATTCTAATATCTCTATCATCTATTTCACGTAGGGAAACTTTAACAGATAATTTGGGAGTCGTAGGTGGAGAAAAGATTGATGAAAACCTGAACAAGCCTAAGAGAAAAGATGTAACCAGGAAGCTTGGAACTCAAAACAATCAAGACGATGCTACTACAAACAATATGAATAAAGGCACTAGCATTTATGATGCAGTATTGGAGAGATTGCCTGATAAAACGAACTGTATGAGTAGTTTGCATGATGCGATCTGCAGGCCCCAGAGTCCTAGTGTGGATGACTTGGTTCCCTCCAATCCAATGGAGAAGAGGAAGGGTGTACTGACTCCCACCCAAGTTATCATGTCATATGTAAAGAAAATACATGGTAGTCCAGTTTACAATCACTATGAAGCAACTATCCCATGTTCGGTGACTGGTAGGCAAGTTTACAATCACTATGAAGCAACTATCCCATGTACAGTGAATGAATCGAAGGCTTCAGAGAGTGGTATCAAAGTCGAGGTAAGATGTTGTGCGTCTCTTCACATACATGGTCTTGATTTTCTATTTATCTTTAGGATCTTAAGTTACATATTTTGCCTGAATTTTTATTAAACATTAGGACTTCTGATGCTTATGGGGACATTGACTCAGATTTTATTCTCTCCTATAAATTTTGGCTTAAAGCATGCCTATATTTTAAGAATTATTGTGGGTTCATCCCTTGAAGGATCTGGACATGATAATTTTTGGAGCAGTTTTTGTATGCTTTATTCACTCAAGATGAGATTCTCCGTTGGGACCTATGGTCCAAGGGACAGTCTTGAGGAGGGGTCAACGTGGTTACTCTATGTTTTTGACACCACAAGAGATACTATTTGGCTTTAGGTATCATTTTTTTAGTGGATTAATTGAAGAATTTGCACATTTAAATCCCTGACCTTTTTTGATAATGTAAAGAACTATGTCAAGTGTAGCAAGGTAAATAATTAAAAAAATTGGAGTTTTTAAAGTTGTTAGTCCTTAAAAAATGATTCGTTGCTTCTCAATTAATATAGGTACACAACTTTGAAAATACTTGAAGGAAACTGGAAAAAGAAAACATCTAGAGTACAAGATAACCCCTGTAAGCATAGGACATGACCAGATACTCTTTATACGTCCCAGTAAATTACAAGATAACTAAGTAATCCTCTATTTTCAACTGCATAAACTTTGATATAATTAAACAGAAAAACTCTGTTATAGAACTAGTGGAAAAATTAAACCATTGTTAATGCATTGAAATGAAGGATGGTGAAGACGTGATAGTGCAATCTAGCGCAAAAGAGTTGGGAACTTTAACTCTCACACAAGATCCTTGTTATTATTCATTTAAGTTCATTGACAAATCTATTATAAATGAAAACCAATTAATTCCAACTATTATGTAAGTTTTCGGATTCAGTATTCTTTTAACATAGAACCAAAGAGTTGTTGAATTCAATTTTTTGCGTGCCACATCTTTACTTCAAAATTAATTATCTGTATGATGAGCTTGTTAATCCAAGGAAAATTTCAATCTGTACTTGACAGAATGCTAGTTATTAATGTTGTAAATTAATAATTGTCATAATAACTCAGATTTTTGAGATAGTACAATCTGTAGCTTTTAATGAATTAATTTTTGCTTTATTTTATTATATTTATAATTCAGCTGATAGAATAATTTACTTTCCAGGATGGAATACTAGCAACAAACCCGTGTATTGCTGAAGGCAGTGGTGAAAAGGTTGCTTCTGGCAATCTCTCTGACAATATTTCAGATCAAAATAGGAATGACGATCATGCTCTCATCACCTGTCAATCGAACGCAAAGCATCTTTCCAAGATGCAGGCAATTATTTCGAAAGAAACAGCATTGTCACAAGCTGCCATTAAAGCTCTAATTAGAAAGAGGGATAAACTGGTACAAACACACATCTTATATTGTGAAGTTTTGGGCTTATCCTTGTCTCATGAATTTATTTTGAACCAGAAACTTTTAAAGATAGCGTGTTGCAAAAAAATGAGAACGCGTAATATATAATAAATAGCTGAGAGCTTCCACTTCCACCACAATTGTAAAATAGAAATGGGCGAAGTGATTTAAATCTCACGTCTCCTGCATCTTTGTTGATGAGATGGTAGGAAATTAATTAAGCTTGCTCTGGTCTCTTTTTGTATGTCTCACCTTACAGTGGTATTCAGCTGGCCCTTTCTCTCTGAACAAGGGTTTGTCATAAATTGAGCCCTTTTGAATCATCTTACATATCGTTATTTCTGACAGTGTAATCCAATCATTGTCTGCAGTCTCATCAACAGCGCATAATTGAAGATGAGATAGCTCAGTGTGATAAAAATATGCAGACAATATTAAGGGGTATGTTCTATTTACTTTTTTCTTTTTTTGTTCCTCCTGTTCTTGTTTTGTTTCTACAGTTTATTCTATTCAATATGCTTCATGTAAGAAAGCCTCTAGACATTTAGATGATTTCCTTCTTCAAATACTAGCAGTTCTCAGTCCTCTTTTTTGCTTCATTTTTACTTGATGAACTCTTACCCCAGCTCTCTACGATTTATGGTCTTATAAGCCTGTGGTGCATGCTATATAAATGATTTTTCAGTATTGGCTTGAGCCTATAACTCGAAATGTCGTTATAATTGGTAGTCTAAGGATTTTGTAGATCCTGAGCTTCAAATGGAAGTAATTCTAAGATTTTGTATTTGGTATTTGTCATTGCTTTAATCATTTGATGTTTATACTGGAGCTAGATTTTCAATTTTATATGTAAACTGATGAATTGTGTTATATTAGAACAGGTAATATTAGAAACTCATTATTGAATTAAAATATGCATTGAAAGAAGTTTGAAATTGTTAGTGGTGCATCCTAGTATTTTAGATAAAAGAAGAATATTTGATAGTGCTGTTATATATAAAAAAAAAAATGTGGGATGCTAAGTACTCATGTTACTGTTTTTTTTTTTTTTTTTTTTTAAATTAATTAGCACCAATGTACAAGTAGCTAACACCATTGGAGATGCTTTACAACGTTAAGGAGATGCTTTAAATTTTAACTTGAGTTTTTACTTTATTCCTAGTTTTGCTTATTTGATTCCATAATTATTTTTAGATTTTATTTTGATATTTTTTCATCAGATCAAGTTAATCTATGTCATCATAACTAACCAAAGAGAATGTCTGAATTACATTCACATTTTGGTTGCTAAAGGTGATGAAGATGATTTTGTTGTAAAGCTGGATTCTGTGATTGAATGTTGTAATGATGTATGTCTAAGAAGCGCAGCTGAAGACAAACCTTATCAATACTCTGAAGAAAACTGCTCATCTCAACTAGTCACAAGGAAGAGATTGTCAGAAGAAATTCTCTGCATACGGAATCCATGTCAGGTGGGTTAACCATTGAAGATAATAATATTCAAAAATTTTATATTTGAGGGTATTGTGGTTGTATTCGTATATTTATGTCTTAATGCTGCCCAAGTTATATTTTTTAAATTAATAGGATCTCATCATTCCGTACAATAGATTTAAGTAAAATAGATCAGAGGGCCTTGAAAGGAACTCTTCAAAAGAATTGCCATGCTAAAATTATACAGCACAAATAACACAGAAGACATTATTTCGAACAACTGGTTTGGAAGAATAGTGATGTTAAATAGCTTTATTATGGCTTAGTATACGAGTTGGAAATATGTGGACTTAAAACATCGAGATTATGATGTCACTTTTCAAAAATTTGAGATACACATAAGAACTTATCATTAAAAGTTGAAAGTAGGACTTGTATAGACAACTGGAAAGTTTTTTCTTCTTTCATTTTTTTTTTTGGGTGGGGAGGGAGTGAATTTTTGAGTTCTTATTTTTGTCTACTTTATACATATGACTAATCTATGCAAGGAAAATGCTGCCTCAATATTCTTTATGCTACTGGTCAGCACCTATTTTTCGAGATCCTAGTTTTGTCTGCATGTTTATATTTATGTTTTAAACTTCTTTATGGTCCCTCTCATACGAAATAGATTTCCAACTTTTCTCTTTTTGACTTTTTCTTTCTGTAGATTTTCCTCTCAATATATTTTAGAACTGGGAAAGGACACCTACTATGTCTTCATTTAAAAAAAAAATCTTTATCATGGATGGAGTGTGGTCCTCAAAGAAGAAAAATTGATAACTTAGTCTATGAGCTTAGTTACAAATACAAATTTTTCTTTCTATGGGTAATGTTTGGCCTCCAATAATAAGAGTGTACTGGAGGAGGGGTAAAAAGTGAAACATCCTCTGGAAGTTTAATAGGATGAATTGCCCTTGTATTTTAGGGGAGTGATGTAAGTGTGAGGGAGAAGATGAATCTTGTGAAGTTTGTATAGGGAGGGAAGCTGGCCCTCAAGTTTTGTAAGGTGCTTGGTTCACCTCTTTCTTTGTGAATAGTGATAATATTAGTGTTCATACCTACTTTCCTTAAAATATACTTTCTTGTATAAACAATAATTGTCAAGGATATTTGTGTAATTCCATAAAATATGGGTTAGCGATTATTATTTAGATTTACTGTTACGTTGTGTTTAGGGATTTATTTAAATTTAATAGAGTATTTAGATTATTTAGTGATTTAGATTTAGTTAGTCTCTCCCTCCCCTTGTAAAAGAGACATGTAATTCTCTGAAAATAAATAAGAAAAAACAGCAAGCTAATTTAGACTCTCAACATGGTATGAGAGCATCGTATTAGATCAGAAAATCGAAGGGAATTTTCATGGTGTGTTAAGTAAGGATATGAAGACGTTGAAGTTCCGGATCCCTTTGTACAAAAAGTTGTCCTTGGCTGGAAGCACCATCCATTTGGTTGTCGGGAAGTGAACTGGCGAGGAAGAATGAGTTCGAGTTGCCAAATTTGTATCGCCAAAGTGTTTTCTATATTTTGCTGCATTTTTTGAAAATGTAGTGTTAACGAAGCAGTTACTTACACATTGGTGGATTGGAGAAGGCCGTCTGGACCCTTCAGGTAGTAGGGATCAAACACCTGAAGTTAAAGAAATTTACAATGCCCCCTATTTTGCATTCTGCTGCAATTATACTGGCCGAAGAAGAGATGTTTTTTTAATTATGATTCCGGAGGACAACTCATTTGGTAAATCTTCTGGCTGCTCATGATATTACAATCTATAGCGACCTATCTGCACTTGCAGCTTGGACCAACAGAGTTAAAGACTTTAGTTTTGGATAATGAGAGTTTTCAGTTCTATTTGGATTCAGCTATCGATTATCCACTTGACCAAATTTATTCGGAGCCTTCATAGCTACCGGTCTTCCACTTGACCTAATTTATCCAGAGCTCTCATAGCTACCAATTTTTCCCCTCGACCAAATTTATCAGCTATCATTGACAAATTTATTCGGAGCTTTAGCTAGTTGACAAATGTATTTGGAGCCTTCATAGCTACTGTTGAAGTCATGTCTTGATGTTTAAGTCATGTCATCGAAGTCATGTCTTGATGTTTAAGTCATGTCATCGAAGTCATGCAATCAATATTGCACATTAACCTAACCAGTCCAAGCTAAGAATGAATGATATTTATTACTCAGCTTGAGAAGGTGTCGAAGGAATGATATCTATTACTCAGCTTGACAAGGGGTGTCAAAATTTATGTGTAATTCATAAAATAAGGGAAATAGTTGACCGTTGGATTTATTTAAATTTATTTAAATTTAGTTATGGGGATTAACTGTTACCTACTATTTCTTTACATTTGTTTTTGGATTATTTAGATTATCTAGTGATTTAGATTTAGTTAGTTGTTCAAGAGGATAGGTAAGTCTCTGAAAATAAATAAACAGCAGTCTAATTTAGACTTCAACAATAACGGTAGATTTCAACTTGCTCTCTTGCCTGTAATTCAAACGCTAACTTTTGAAATTGGTTTAACAGATCCTTGTCTTTTGGGTTCTTTAATTTAAACTGTAAAATCCATGTGGATTCAGTGGATTTTCGTTTCGTTAAGAGAGAGTTTCCTTCCTAGGACAAGAGACACGTATTAAATTCAAGTTGCAATGTCCTGAGTAACAAATCTTCATATTTTTCCCAGAATGCAGCAAATGGATCATAAAAACTTGTAACTTGTAATACCATTGATTAGATACGCGGTTGAGGATGCTGCTAAGATGGTATTTGTTATGTATGGCAAACTTTGGCTAATAAAATGTGTAGTGTATTGATATATCTAATGCGTTAACGATATCATGTACGTCAAACTTATTGGGACTGGTTATCAATTTGGACTTGCCTAATGCACAGGGAAGATTGAACTTCAAAACTTTTGAAAAGCAAGTAGCATAAGCTTTCAGATACTGTTTCAAATAGTGTTGGGATGTAAATTTGATGTTAAATAATCTAAGAAATGGTAGTCTGCTATACATTTTTATTTCATTATTTCCACATTTGCTAACATTTGCAGCTGTAAATGTAATGATGCTCTGTTGGTTGGCAGGAACTGGACGATATATGTCATAAGAATAATTGGATATTGCCAGTCTATGGAGTTTCGTCATCAGATGGTAAGATCACATGTTTTTTTCTCGCCTGCAAGTTATTCTTCTTCTTTTTTTTTTTTTTAATAAATAAACTGCATTGCAAATCATTTGGATTTATTTGGTGGATTCTCCCTGTTGCACCTCGAAATATTCCACAAATATGAAGTATGTCTCTTTCCTTCTTTCCCCTCTCCTTCCCCGTAAGGAAATTTCAATCGTGTTTCATCCTATTCGCTTCCATTCATCCGTGGGTATGTTGCCGCTTCTCCCTTTGTGGAGACGGCCACCGTCGTCGAGTGTGATCAGGCTGCCACCTCAGTTCTCGGTTTAAAGCCTTGCCATCTCAGGTTGAACATGAGTTTGACCTTCATAGGTTGGAGAAGGGGTATCCTTTGATTATTGGGAGAATTAGTATTCTTCATGATAGAGGTTGAGAGGCTCATTTTGATGGGGGATGTTCCACTTCGCTGTGTGGTTGATGCCATTTTGGGTGCCTTGGGGCTTCTTGATATTGGAGATATCTTTCCAGATTCAAATCTGGGTACTCCGTCATTATTTTATTTATTTATTTTTTTGTTGAAGAACCTGTCTCAATTGAGCCCTCAATAGCCGACGGACTTACTGTCGAAGGAATTATTGGAAAATCGATGGACAATTCTTGAGGAGGGCGAGAGCAATTGGAGGAGTCGTGTGTCACAATCGCACTTTTCTTGGCACGGACTTTGTGATTGTGTGGCACTCACTCCTAACACCAAGTGAGTCGAGCCAATTTGAATTCGCGTAGCCTATGGCCGAGCAACGTATGCTCAAGCCTCTGACCCTGTTTGAAACGAAGGTTTTAGAAAACGGAGGCAAAAAGATTTTTGAAAGCGAGTTTGAAAGTAAGAATTAGAGAAGATTTAAAAGCAATGTTATATAACTATAAGCAAAAGGTAACACAATGGCTTAAATAACATATAACTTTCTAGGTAGGGAGAATTTAACAACTATTCGCATTCCCTTTTAGTACATTTTGAGGCGATTCAGGTTTGCCAGGCCTAGATGTGATTTTGCCGAACGATGGAAGGAGTTGACAACTAATCACACCTCCTATAGTACATTTGTGGTGATTTAGGTTTGGTAGGCCTAGACATGATTTCATCTAGTAGTTGTACAAATAGAAAATACAATAGTAAGGCAATACAACATGAGATTAAATAAAGGGTTCAACATAGCATGGACATGAACATGCGGCTATGGGCAACACGCCTTGGACGAGCATGGCCATGTTACCGTGCTGCCGCAATTGCCCAATCCATTCATCTAATCAAGAAGCGTTTAGAGTGAAAGAAGCTGATAATGAAATTGGAGTTGAATAAAACTGATTTGTGGGATGAGGCATGGTTCTATTATGGGTGAGATTGAAAAGAACAAGGCATCAGAGAGAGAATTGCTCGAGTATATTGACAGGATAAGGCTGGCTAGAGAAGAGAATGATGTAGAGTTGGAATCGTAGTCGGTAAATACATTAGTCAGCATGAGAAGCAATTCAAAAGAGAAAGAGCTTGAAGCTTTGTTAGCGGGGCAGAACGATTCTTGCTCTTATTATATTGAAGTGGAATTGAATTTCAAGGAACATATTTAAGCTATAACTATACGGCTTATACGGCTTAGGCAAATCCTGTAAGTGACTTTGCGACTTTGCTGTTGAGTGAACTGTGACCCTTGCACTTGTGTGTTATGAAGGTTTGACATGAGTCATGCTATGTGTATGCTTTGATATGTTATGCCTATGATATATGGTTATGTTATGGAAACTATTATGTCATGTCATAATTTATATCATGATATGACATGAGATGACTATGCTAGGCAGGCTATGGGATTGCATGATATTGATATGAACTTCCATATGTTATCCTACATATACTAAGATAACATGCTCACCGTTATGCTACATATACTAAGATAACATGTTCACCCATAGGCTTATGAGATACTGTTCTAGTATAACATGTTCACCCATAGGCTTATGAGATACTGTTCATGAGAGTTAGGTTTGTGAAATGAAACGGAGGCATGACTTGCATCACAAGAGCATACCATTGTTTTAAAGCTATGGGGTCTCATGCATTTTATATTGCATGTGCATTAGGATATTTCATCGTTATGATTCATATGAATGGCATCCGGCAGCGTGAGTGTACGCTAGCTCGACAAGCCAAACCTAAAGGGTATGTTACGAGCTCACCTAGTGGGTCTATGTGCGTGTGCGTGAGCTGTGTGTAGGGAAGTACTACACATCCAACCCTGTCCGACTTGAATTGGAAAGCAAAGCTCAAAGCCATGCAATGGTTTTATGCAAAGATAGGTCCCATCATAATTGCTTGTGTTTGCATTTCCTATCCTAACCCAATAGAAGGGCTACTCACTAAGTTTTTCCTAACACATTGCAGTAAAAACCATGGAACCAAAGCTATGATAATTGCATTTGTGTCTAAATTGTAGTCCTCAGGTTAGAGTAGGGCATGAACTTTTTGTTGTAGAATTGATTTTACAAATGCCCAAGTTCGTGGCATCAATTATGAAATGTTTTAAATCAATTGTTTTGTGCACATTAGACATGTCATACATGTATTACATTTTGTGTACTTACCATTTGAAATCATTTAACAGTGTCTAATAACAGTATGGAATAGTTGTGCTTAAATGTGTTGTGTTTATGATAGTATCTATTGTAGAATAGTTGTGCTAAAATGTATTGCGTGTCTTGAAGGTGGATTCCAAGCTAATGTAATATTAAAAGGGATGGATTTTGAGTATTCAAGCAACGGCGAAGTGTGTCACAATCCTCGTGAGGCCAGGGAATCAGCTGCAATGAAGATGTTAGGTCAACTATGGAGGATGGCGGCAAGTCAGCCCTAGTTCTTGGAGCCCCTTTCATGTAAGATATGTTTTAGTATGCTCTCCAAGATTGTGCAGTCTGAGGTAGTTTCTTTAGTGGGAGTACCAATCAAGATAACTGTTCACAATTGTTCATGATGCTTTTCGAATTTAGGATTTGCCTCATTACTATTTCCTTTTTAGTTTTCTGTTCTGTTCATATTTTTGCTTGCAATTTCTTTAGCTTGATCTATTTTCTGGAAAATATAAAAACATATTTCAAGTTAGTAGCCAAATTGGAAAGGTATAGGTTTGATTAGAAC

mRNA sequence

ATGAGTGCAACAGGTGTATGCCCAACCGAGGATGCCATACTTGCATTATTGGATTACTTAGTTGAACCTATGCTTCCTTCAAAGTCATCTTCAATAGAAAATCCACCACTAGCTCTACTGCAATCAGTTGCAAAACAGATGCATGCCGTTGTTTTGTTATACAACTACTACCACAGGAAACAACACCCACATCTTGAATTTTTGAGTTTTGAGGCATTTTGCAAATTAGCTGTGGTCGTTAAACCAGCTTTGTTGTCTCACATGAAACTCATGCAAAGCTCTGATGATATTGAATTGGAAAATCCCGAGAAGCAGCTTTCTCCATCCGAAAAAGCAATTATGGATGCATGTGATTTAGCCACTTGTCTATATACATCAAAAGATGAAAATATAGAGGGCTGGCCCCTTTCCAAGGTTGCCGTTTTTTTGATCGACTCCAAGAAGGAGCATTGCCATTTGCTGTTTAGTTCCATCACTCAAGGAGTTTGGTCTGTCATTGAACAAAATTTAGATACCTATGAATGCCAACCAAAAAGCGTGGAAGAGGAAAAACATGTAAACAAAAAGAAAAGAGTGATTAAGAAACCTTCAAAAGAGGGGCTAGTTGTTGTTGAAACTAAGACTCAGCAGCTTGCATATTCAGCAGTCAAGGAAGCAACTGGCATTAATCAACACGATCTCAAAATTTTAGAAAGTCACGTTGCCTACTCTCTAAGTAAAGAAAAATCAGCAGTCTACTTCTACATGATGCAGTGCACTCGATCAGCAACTGAAGATGTAATTCAAGTTCCAATAAAAGATGCCTTCGACAGTTTGCAGGATTCGTTGTTTAAAAAAAATGGTAGGAGATGGAGCGTTACTTCAAAAGTTGAGTACTACCACATTCTTCCTTATGTGAAGATGGTGCTAACCTGGTTTCGTAGGGAAACTTTAACAGATAATTTGGGAGTCGTAGGTGGAGAAAAGATTGATGAAAACCTGAACAAGCCTAAGAGAAAAGATGTAACCAGGAAGCTTGGAACTCAAAACAATCAAGACGATGCTACTACAAACAATATGAATAAAGGCACTAGCATTTATGATGCAGTATTGGAGAGATTGCCTGATAAAACGAACTGTATGAGTAGTTTGCATGATGCGATCTGCAGGCCCCAGAGTCCTAGTGTGGATGACTTGGTTCCCTCCAATCCAATGGAGAAGAGGAAGGGTGTACTGACTCCCACCCAAGTTATCATGTCATATGTAAAGAAAATACATGGTAGTCCAGTTTACAATCACTATGAAGCAACTATCCCATGTTCGGTGACTGGTAGGCAAGTTTACAATCACTATGAAGCAACTATCCCATGTACAGTGAATGAATCGAAGGCTTCAGAGAGTGGTATCAAAGTCGAGGATGGAATACTAGCAACAAACCCGTGTATTGCTGAAGGCAGTGGTGAAAAGGTTGCTTCTGGCAATCTCTCTGACAATATTTCAGATCAAAATAGGAATGACGATCATGCTCTCATCACCTGTCAATCGAACGCAAAGCATCTTTCCAAGATGCAGGCAATTATTTCGAAAGAAACAGCATTGTCACAAGCTGCCATTAAAGCTCTAATTAGAAAGAGGGATAAACTGTCTCATCAACAGCGCATAATTGAAGATGAGATAGCTCAGTGTGATAAAAATATGCAGACAATATTAAGGGGTGATGAAGATGATTTTGTTGTAAAGCTGGATTCTGTGATTGAATGTTGTAATGATGTATGTCTAAGAAGCGCAGCTGAAGACAAACCTTATCAATACTCTGAAGAAAACTGCTCATCTCAACTAGTCACAAGGAAGAGATTGTCAGAAGAAATTCTCTGCATACGGAATCCATGTCAGGAACTGGACGATATATGTCATAAGAATAATTGGATATTGCCAGTCTATGGAGTTTCGTCATCAGATGGTGGATTCCAAGCTAATGTAATATTAAAAGGGATGGATTTTGAGTATTCAAGCAACGGCGAAGTGTGTCACAATCCTCGTGAGGCCAGGGAATCAGCTGCAATGAAGATGTTAGGTCAACTATGGAGGATGGCGGCAAGTCAGCCCTAGTTCTTGGAGCCCCTTTCATGTAAGATATGTTTTAGTATGCTCTCCAAGATTGTGCAGTCTGAGGTAGTTTCTTTAGTGGGAGTACCAATCAAGATAACTGTTCACAATTGTTCATGATGCTTTTCGAATTTAGGATTTGCCTCATTACTATTTCCTTTTTAGTTTTCTGTTCTGTTCATATTTTTGCTTGCAATTTCTTTAGCTTGATCTATTTTCTGGAAAATATAAAAACATATTTCAAGTTAGTAGCCAAATTGGAAAGGTATAGGTTTGATTAGAAC

Coding sequence (CDS)

ATGAGTGCAACAGGTGTATGCCCAACCGAGGATGCCATACTTGCATTATTGGATTACTTAGTTGAACCTATGCTTCCTTCAAAGTCATCTTCAATAGAAAATCCACCACTAGCTCTACTGCAATCAGTTGCAAAACAGATGCATGCCGTTGTTTTGTTATACAACTACTACCACAGGAAACAACACCCACATCTTGAATTTTTGAGTTTTGAGGCATTTTGCAAATTAGCTGTGGTCGTTAAACCAGCTTTGTTGTCTCACATGAAACTCATGCAAAGCTCTGATGATATTGAATTGGAAAATCCCGAGAAGCAGCTTTCTCCATCCGAAAAAGCAATTATGGATGCATGTGATTTAGCCACTTGTCTATATACATCAAAAGATGAAAATATAGAGGGCTGGCCCCTTTCCAAGGTTGCCGTTTTTTTGATCGACTCCAAGAAGGAGCATTGCCATTTGCTGTTTAGTTCCATCACTCAAGGAGTTTGGTCTGTCATTGAACAAAATTTAGATACCTATGAATGCCAACCAAAAAGCGTGGAAGAGGAAAAACATGTAAACAAAAAGAAAAGAGTGATTAAGAAACCTTCAAAAGAGGGGCTAGTTGTTGTTGAAACTAAGACTCAGCAGCTTGCATATTCAGCAGTCAAGGAAGCAACTGGCATTAATCAACACGATCTCAAAATTTTAGAAAGTCACGTTGCCTACTCTCTAAGTAAAGAAAAATCAGCAGTCTACTTCTACATGATGCAGTGCACTCGATCAGCAACTGAAGATGTAATTCAAGTTCCAATAAAAGATGCCTTCGACAGTTTGCAGGATTCGTTGTTTAAAAAAAATGGTAGGAGATGGAGCGTTACTTCAAAAGTTGAGTACTACCACATTCTTCCTTATGTGAAGATGGTGCTAACCTGGTTTCGTAGGGAAACTTTAACAGATAATTTGGGAGTCGTAGGTGGAGAAAAGATTGATGAAAACCTGAACAAGCCTAAGAGAAAAGATGTAACCAGGAAGCTTGGAACTCAAAACAATCAAGACGATGCTACTACAAACAATATGAATAAAGGCACTAGCATTTATGATGCAGTATTGGAGAGATTGCCTGATAAAACGAACTGTATGAGTAGTTTGCATGATGCGATCTGCAGGCCCCAGAGTCCTAGTGTGGATGACTTGGTTCCCTCCAATCCAATGGAGAAGAGGAAGGGTGTACTGACTCCCACCCAAGTTATCATGTCATATGTAAAGAAAATACATGGTAGTCCAGTTTACAATCACTATGAAGCAACTATCCCATGTTCGGTGACTGGTAGGCAAGTTTACAATCACTATGAAGCAACTATCCCATGTACAGTGAATGAATCGAAGGCTTCAGAGAGTGGTATCAAAGTCGAGGATGGAATACTAGCAACAAACCCGTGTATTGCTGAAGGCAGTGGTGAAAAGGTTGCTTCTGGCAATCTCTCTGACAATATTTCAGATCAAAATAGGAATGACGATCATGCTCTCATCACCTGTCAATCGAACGCAAAGCATCTTTCCAAGATGCAGGCAATTATTTCGAAAGAAACAGCATTGTCACAAGCTGCCATTAAAGCTCTAATTAGAAAGAGGGATAAACTGTCTCATCAACAGCGCATAATTGAAGATGAGATAGCTCAGTGTGATAAAAATATGCAGACAATATTAAGGGGTGATGAAGATGATTTTGTTGTAAAGCTGGATTCTGTGATTGAATGTTGTAATGATGTATGTCTAAGAAGCGCAGCTGAAGACAAACCTTATCAATACTCTGAAGAAAACTGCTCATCTCAACTAGTCACAAGGAAGAGATTGTCAGAAGAAATTCTCTGCATACGGAATCCATGTCAGGAACTGGACGATATATGTCATAAGAATAATTGGATATTGCCAGTCTATGGAGTTTCGTCATCAGATGGTGGATTCCAAGCTAATGTAATATTAAAAGGGATGGATTTTGAGTATTCAAGCAACGGCGAAGTGTGTCACAATCCTCGTGAGGCCAGGGAATCAGCTGCAATGAAGATGTTAGGTCAACTATGGAGGATGGCGGCAAGTCAGCCCTAG

Protein sequence

MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRKQHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPSEKAIMDACDLATCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTYECQPKSVEEEKHVNKKKRVIKKPSKEGLVVVETKTQQLAYSAVKEATGINQHDLKILESHVAYSLSKEKSAVYFYMMQCTRSATEDVIQVPIKDAFDSLQDSLFKKNGRRWSVTSKVEYYHILPYVKMVLTWFRRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIYDAVLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVLTPTQVIMSYVKKIHGSPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVEDGILATNPCIAEGSGEKVASGNLSDNISDQNRNDDHALITCQSNAKHLSKMQAIISKETALSQAAIKALIRKRDKLSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVCLRSAAEDKPYQYSEENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGGFQANVILKGMDFEYSSNGEVCHNPREARESAAMKMLGQLWRMAASQP
Homology
BLAST of CmaCh17G005990 vs. ExPASy TrEMBL
Match: A0A6J1JUZ0 (uncharacterized protein LOC111488583 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111488583 PE=4 SV=1)

HSP 1 Score: 1380.5 bits (3572), Expect = 0.0e+00
Identity = 695/695 (100.00%), Postives = 695/695 (100.00%), Query Frame = 0

Query: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60
           MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK
Sbjct: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60

Query: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPSEKAIMDACDLA 120
           QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPSEKAIMDACDLA
Sbjct: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPSEKAIMDACDLA 120

Query: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTYECQPKSV 180
           TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTYECQPKSV
Sbjct: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTYECQPKSV 180

Query: 181 EEEKHVNKKKRVIKKPSKEGLVVVETKTQQLAYSAVKEATGINQHDLKILESHVAYSLSK 240
           EEEKHVNKKKRVIKKPSKEGLVVVETKTQQLAYSAVKEATGINQHDLKILESHVAYSLSK
Sbjct: 181 EEEKHVNKKKRVIKKPSKEGLVVVETKTQQLAYSAVKEATGINQHDLKILESHVAYSLSK 240

Query: 241 EKSAVYFYMMQCTRSATEDVIQVPIKDAFDSLQDSLFKKNGRRWSVTSKVEYYHILPYVK 300
           EKSAVYFYMMQCTRSATEDVIQVPIKDAFDSLQDSLFKKNGRRWSVTSKVEYYHILPYVK
Sbjct: 241 EKSAVYFYMMQCTRSATEDVIQVPIKDAFDSLQDSLFKKNGRRWSVTSKVEYYHILPYVK 300

Query: 301 MVLTWFRRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIY 360
           MVLTWFRRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIY
Sbjct: 301 MVLTWFRRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIY 360

Query: 361 DAVLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVLTPTQVIMSYVKKIHG 420
           DAVLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVLTPTQVIMSYVKKIHG
Sbjct: 361 DAVLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVLTPTQVIMSYVKKIHG 420

Query: 421 SPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVEDGILATNPCIAEGSG 480
           SPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVEDGILATNPCIAEGSG
Sbjct: 421 SPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVEDGILATNPCIAEGSG 480

Query: 481 EKVASGNLSDNISDQNRNDDHALITCQSNAKHLSKMQAIISKETALSQAAIKALIRKRDK 540
           EKVASGNLSDNISDQNRNDDHALITCQSNAKHLSKMQAIISKETALSQAAIKALIRKRDK
Sbjct: 481 EKVASGNLSDNISDQNRNDDHALITCQSNAKHLSKMQAIISKETALSQAAIKALIRKRDK 540

Query: 541 LSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVCLRSAAEDKPYQYSEE 600
           LSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVCLRSAAEDKPYQYSEE
Sbjct: 541 LSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVCLRSAAEDKPYQYSEE 600

Query: 601 NCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGGFQANVILKGMDF 660
           NCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGGFQANVILKGMDF
Sbjct: 601 NCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGGFQANVILKGMDF 660

Query: 661 EYSSNGEVCHNPREARESAAMKMLGQLWRMAASQP 696
           EYSSNGEVCHNPREARESAAMKMLGQLWRMAASQP
Sbjct: 661 EYSSNGEVCHNPREARESAAMKMLGQLWRMAASQP 695

BLAST of CmaCh17G005990 vs. ExPASy TrEMBL
Match: A0A6J1EQ29 (uncharacterized protein LOC111436360 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111436360 PE=4 SV=1)

HSP 1 Score: 1351.7 bits (3497), Expect = 0.0e+00
Identity = 683/695 (98.27%), Postives = 687/695 (98.85%), Query Frame = 0

Query: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60
           MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK
Sbjct: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60

Query: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPSEKAIMDACDLA 120
           QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSP+EKAIMDAC LA
Sbjct: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACGLA 120

Query: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTYECQPKSV 180
           TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDT ECQPKSV
Sbjct: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSV 180

Query: 181 EEEKHVNKKKRVIKKPSKEGLVVVETKTQQLAYSAVKEATGINQHDLKILESHVAYSLSK 240
           EEEKHVNKKKRVIKKPSKEGLVVV TKTQQLAYSAVKEATGINQHDLKILESHVAYSLSK
Sbjct: 181 EEEKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATGINQHDLKILESHVAYSLSK 240

Query: 241 EKSAVYFYMMQCTRSATEDVIQVPIKDAFDSLQDSLFKKNGRRWSVTSKVEYYHILPYVK 300
           EKSAVYFYMMQCTRSATEDVIQVPIKDA DSLQDSLFKKNGRRWSVTSKVEYYHILPYVK
Sbjct: 241 EKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKKNGRRWSVTSKVEYYHILPYVK 300

Query: 301 MVLTWFRRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIY 360
           MVLTWFRRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIY
Sbjct: 301 MVLTWFRRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIY 360

Query: 361 DAVLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVLTPTQVIMSYVKKIHG 420
           DA LERLP+KTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGV TPTQVIMSYVKKIHG
Sbjct: 361 DAGLERLPNKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVPTPTQVIMSYVKKIHG 420

Query: 421 SPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVEDGILATNPCIAEGSG 480
           SPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVEDGILATNPCIAE SG
Sbjct: 421 SPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVEDGILATNPCIAECSG 480

Query: 481 EKVASGNLSDNISDQNRNDDHALITCQSNAKHLSKMQAIISKETALSQAAIKALIRKRDK 540
           EKVASGNLSDNISDQNRNDDHALITCQSN K+LSKMQAIISKETALSQAAIKALIRKRDK
Sbjct: 481 EKVASGNLSDNISDQNRNDDHALITCQSNTKNLSKMQAIISKETALSQAAIKALIRKRDK 540

Query: 541 LSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVCLRSAAEDKPYQYSEE 600
           LSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVCLRSAAEDKPYQYSEE
Sbjct: 541 LSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVCLRSAAEDKPYQYSEE 600

Query: 601 NCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGGFQANVILKGMDF 660
           NCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGGFQANVILKG+DF
Sbjct: 601 NCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGGFQANVILKGLDF 660

Query: 661 EYSSNGEVCHNPREARESAAMKMLGQLWRMAASQP 696
           EYSSNGEVCHNPREARESAAMKMLGQLWRMAASQP
Sbjct: 661 EYSSNGEVCHNPREARESAAMKMLGQLWRMAASQP 695

BLAST of CmaCh17G005990 vs. ExPASy TrEMBL
Match: A0A6J1EPE2 (uncharacterized protein LOC111436360 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111436360 PE=4 SV=1)

HSP 1 Score: 1349.7 bits (3492), Expect = 0.0e+00
Identity = 682/695 (98.13%), Postives = 686/695 (98.71%), Query Frame = 0

Query: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60
           MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK
Sbjct: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60

Query: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPSEKAIMDACDLA 120
           QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSP+EKAIMDAC LA
Sbjct: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACGLA 120

Query: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTYECQPKSV 180
           TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDT ECQPKSV
Sbjct: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSV 180

Query: 181 EEEKHVNKKKRVIKKPSKEGLVVVETKTQQLAYSAVKEATGINQHDLKILESHVAYSLSK 240
           EEEKHVNKKKRVIKKPSKEGLVVV TKTQQLAYSAVKEATGINQHDLKILESHVAYSLSK
Sbjct: 181 EEEKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATGINQHDLKILESHVAYSLSK 240

Query: 241 EKSAVYFYMMQCTRSATEDVIQVPIKDAFDSLQDSLFKKNGRRWSVTSKVEYYHILPYVK 300
           EKSAVYFYMMQCTRSATEDVIQVPIKDA DSLQDSLFKKNGRRWSVTSKVEYYHILPYVK
Sbjct: 241 EKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKKNGRRWSVTSKVEYYHILPYVK 300

Query: 301 MVLTWFRRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIY 360
           MVLTWF RETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIY
Sbjct: 301 MVLTWFHRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIY 360

Query: 361 DAVLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVLTPTQVIMSYVKKIHG 420
           DA LERLP+KTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGV TPTQVIMSYVKKIHG
Sbjct: 361 DAGLERLPNKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVPTPTQVIMSYVKKIHG 420

Query: 421 SPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVEDGILATNPCIAEGSG 480
           SPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVEDGILATNPCIAE SG
Sbjct: 421 SPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVEDGILATNPCIAECSG 480

Query: 481 EKVASGNLSDNISDQNRNDDHALITCQSNAKHLSKMQAIISKETALSQAAIKALIRKRDK 540
           EKVASGNLSDNISDQNRNDDHALITCQSN K+LSKMQAIISKETALSQAAIKALIRKRDK
Sbjct: 481 EKVASGNLSDNISDQNRNDDHALITCQSNTKNLSKMQAIISKETALSQAAIKALIRKRDK 540

Query: 541 LSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVCLRSAAEDKPYQYSEE 600
           LSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVCLRSAAEDKPYQYSEE
Sbjct: 541 LSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVCLRSAAEDKPYQYSEE 600

Query: 601 NCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGGFQANVILKGMDF 660
           NCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGGFQANVILKG+DF
Sbjct: 601 NCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGGFQANVILKGLDF 660

Query: 661 EYSSNGEVCHNPREARESAAMKMLGQLWRMAASQP 696
           EYSSNGEVCHNPREARESAAMKMLGQLWRMAASQP
Sbjct: 661 EYSSNGEVCHNPREARESAAMKMLGQLWRMAASQP 695

BLAST of CmaCh17G005990 vs. ExPASy TrEMBL
Match: A0A6J1JP19 (uncharacterized protein LOC111488583 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111488583 PE=4 SV=1)

HSP 1 Score: 1315.1 bits (3402), Expect = 0.0e+00
Identity = 670/695 (96.40%), Postives = 670/695 (96.40%), Query Frame = 0

Query: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60
           MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK
Sbjct: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60

Query: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPSEKAIMDACDLA 120
           QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPSEKAIMDACDLA
Sbjct: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPSEKAIMDACDLA 120

Query: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTYECQPKSV 180
           TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTYECQPKSV
Sbjct: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTYECQPKSV 180

Query: 181 EEEKHVNKKKRVIKKPSKEGLVVVETKTQQLAYSAVKEATGINQHDLKILESHVAYSLSK 240
           EEEKHVNKKKRVIKKPSKEGLVVVETKTQQLAYSAVKEATGINQHDLKILESHVAYSLSK
Sbjct: 181 EEEKHVNKKKRVIKKPSKEGLVVVETKTQQLAYSAVKEATGINQHDLKILESHVAYSLSK 240

Query: 241 EKSAVYFYMMQCTRSATEDVIQVPIKDAFDSLQDSLFKKNGRRWSVTSKVEYYHILPYVK 300
           EKSAVYFYMMQCTRSATEDVIQVPIKDAFDSLQDSLFKKNGRRWSVTSKVEYYHILPYVK
Sbjct: 241 EKSAVYFYMMQCTRSATEDVIQVPIKDAFDSLQDSLFKKNGRRWSVTSKVEYYHILPYVK 300

Query: 301 MVLTWFRRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIY 360
           MVLTWFRRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIY
Sbjct: 301 MVLTWFRRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIY 360

Query: 361 DAVLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVLTPTQVIMSYVKKIHG 420
           DAVLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVLTPTQVIMSYVKKIHG
Sbjct: 361 DAVLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVLTPTQVIMSYVKKIHG 420

Query: 421 SPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVEDGILATNPCIAEGSG 480
           SPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVEDGILATNPCIAEGSG
Sbjct: 421 SPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVEDGILATNPCIAEGSG 480

Query: 481 EKVASGNLSDNISDQNRNDDHALITCQSNAKHLSKMQAIISKETALSQAAIKALIRKRDK 540
           EKVASGNLSDNISDQNRNDDHALITCQSNAKHLSKMQAIISKETALSQAAIKALIRKRDK
Sbjct: 481 EKVASGNLSDNISDQNRNDDHALITCQSNAKHLSKMQAIISKETALSQAAIKALIRKRDK 540

Query: 541 LSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVCLRSAAEDKPYQYSEE 600
           LSHQQRIIEDEIAQCDKNMQTILR                         AEDKPYQYSEE
Sbjct: 541 LSHQQRIIEDEIAQCDKNMQTILR-------------------------AEDKPYQYSEE 600

Query: 601 NCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGGFQANVILKGMDF 660
           NCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGGFQANVILKGMDF
Sbjct: 601 NCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGGFQANVILKGMDF 660

Query: 661 EYSSNGEVCHNPREARESAAMKMLGQLWRMAASQP 696
           EYSSNGEVCHNPREARESAAMKMLGQLWRMAASQP
Sbjct: 661 EYSSNGEVCHNPREARESAAMKMLGQLWRMAASQP 670

BLAST of CmaCh17G005990 vs. ExPASy TrEMBL
Match: A0A6J1ENX1 (uncharacterized protein LOC111436360 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111436360 PE=4 SV=1)

HSP 1 Score: 1284.2 bits (3322), Expect = 0.0e+00
Identity = 657/695 (94.53%), Postives = 661/695 (95.11%), Query Frame = 0

Query: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60
           MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK
Sbjct: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60

Query: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPSEKAIMDACDLA 120
           QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSP+EKAIMDAC LA
Sbjct: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACGLA 120

Query: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTYECQPKSV 180
           TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDT ECQPKSV
Sbjct: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSV 180

Query: 181 EEEKHVNKKKRVIKKPSKEGLVVVETKTQQLAYSAVKEATGINQHDLKILESHVAYSLSK 240
           EEEKHVNKKKRVIKKPSKEGLVVV TKTQQLAYSAVKEATGINQHDLKILESHVAYSLSK
Sbjct: 181 EEEKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATGINQHDLKILESHVAYSLSK 240

Query: 241 EKSAVYFYMMQCTRSATEDVIQVPIKDAFDSLQDSLFKKNGRRWSVTSKVEYYHILPYVK 300
           EKSAVYFYMMQCTRSATEDVIQVPIKDA DSLQDSLFKKNGRRWSVTSKVEYYHILPYVK
Sbjct: 241 EKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKKNGRRWSVTSKVEYYHILPYVK 300

Query: 301 MVLTWFRRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIY 360
           MVLTWF RETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIY
Sbjct: 301 MVLTWFHRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIY 360

Query: 361 DAVLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVLTPTQVIMSYVKKIHG 420
           DA LERLP+KTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGV TPTQVIMSYVKKIHG
Sbjct: 361 DAGLERLPNKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVPTPTQVIMSYVKKIHG 420

Query: 421 SPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVEDGILATNPCIAEGSG 480
           SPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVEDGILATNPCIAE SG
Sbjct: 421 SPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVEDGILATNPCIAECSG 480

Query: 481 EKVASGNLSDNISDQNRNDDHALITCQSNAKHLSKMQAIISKETALSQAAIKALIRKRDK 540
           EKVASGNLSDNISDQNRNDDHALITCQSN K+LSKMQAIISKETALSQAAIKALIRKRDK
Sbjct: 481 EKVASGNLSDNISDQNRNDDHALITCQSNTKNLSKMQAIISKETALSQAAIKALIRKRDK 540

Query: 541 LSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVCLRSAAEDKPYQYSEE 600
           LSHQQRIIEDEIAQCDKNMQTILR                         AEDKPYQYSEE
Sbjct: 541 LSHQQRIIEDEIAQCDKNMQTILR-------------------------AEDKPYQYSEE 600

Query: 601 NCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGGFQANVILKGMDF 660
           NCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGGFQANVILKG+DF
Sbjct: 601 NCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGGFQANVILKGLDF 660

Query: 661 EYSSNGEVCHNPREARESAAMKMLGQLWRMAASQP 696
           EYSSNGEVCHNPREARESAAMKMLGQLWRMAASQP
Sbjct: 661 EYSSNGEVCHNPREARESAAMKMLGQLWRMAASQP 670

BLAST of CmaCh17G005990 vs. NCBI nr
Match: XP_022992174.1 (uncharacterized protein LOC111488583 isoform X1 [Cucurbita maxima] >XP_022992175.1 uncharacterized protein LOC111488583 isoform X1 [Cucurbita maxima] >XP_022992176.1 uncharacterized protein LOC111488583 isoform X1 [Cucurbita maxima])

HSP 1 Score: 1380.5 bits (3572), Expect = 0.0e+00
Identity = 695/695 (100.00%), Postives = 695/695 (100.00%), Query Frame = 0

Query: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60
           MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK
Sbjct: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60

Query: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPSEKAIMDACDLA 120
           QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPSEKAIMDACDLA
Sbjct: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPSEKAIMDACDLA 120

Query: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTYECQPKSV 180
           TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTYECQPKSV
Sbjct: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTYECQPKSV 180

Query: 181 EEEKHVNKKKRVIKKPSKEGLVVVETKTQQLAYSAVKEATGINQHDLKILESHVAYSLSK 240
           EEEKHVNKKKRVIKKPSKEGLVVVETKTQQLAYSAVKEATGINQHDLKILESHVAYSLSK
Sbjct: 181 EEEKHVNKKKRVIKKPSKEGLVVVETKTQQLAYSAVKEATGINQHDLKILESHVAYSLSK 240

Query: 241 EKSAVYFYMMQCTRSATEDVIQVPIKDAFDSLQDSLFKKNGRRWSVTSKVEYYHILPYVK 300
           EKSAVYFYMMQCTRSATEDVIQVPIKDAFDSLQDSLFKKNGRRWSVTSKVEYYHILPYVK
Sbjct: 241 EKSAVYFYMMQCTRSATEDVIQVPIKDAFDSLQDSLFKKNGRRWSVTSKVEYYHILPYVK 300

Query: 301 MVLTWFRRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIY 360
           MVLTWFRRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIY
Sbjct: 301 MVLTWFRRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIY 360

Query: 361 DAVLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVLTPTQVIMSYVKKIHG 420
           DAVLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVLTPTQVIMSYVKKIHG
Sbjct: 361 DAVLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVLTPTQVIMSYVKKIHG 420

Query: 421 SPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVEDGILATNPCIAEGSG 480
           SPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVEDGILATNPCIAEGSG
Sbjct: 421 SPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVEDGILATNPCIAEGSG 480

Query: 481 EKVASGNLSDNISDQNRNDDHALITCQSNAKHLSKMQAIISKETALSQAAIKALIRKRDK 540
           EKVASGNLSDNISDQNRNDDHALITCQSNAKHLSKMQAIISKETALSQAAIKALIRKRDK
Sbjct: 481 EKVASGNLSDNISDQNRNDDHALITCQSNAKHLSKMQAIISKETALSQAAIKALIRKRDK 540

Query: 541 LSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVCLRSAAEDKPYQYSEE 600
           LSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVCLRSAAEDKPYQYSEE
Sbjct: 541 LSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVCLRSAAEDKPYQYSEE 600

Query: 601 NCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGGFQANVILKGMDF 660
           NCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGGFQANVILKGMDF
Sbjct: 601 NCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGGFQANVILKGMDF 660

Query: 661 EYSSNGEVCHNPREARESAAMKMLGQLWRMAASQP 696
           EYSSNGEVCHNPREARESAAMKMLGQLWRMAASQP
Sbjct: 661 EYSSNGEVCHNPREARESAAMKMLGQLWRMAASQP 695

BLAST of CmaCh17G005990 vs. NCBI nr
Match: XP_023548856.1 (uncharacterized protein LOC111807382 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023548857.1 uncharacterized protein LOC111807382 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023548858.1 uncharacterized protein LOC111807382 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1356.3 bits (3509), Expect = 0.0e+00
Identity = 684/695 (98.42%), Postives = 686/695 (98.71%), Query Frame = 0

Query: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60
           MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK
Sbjct: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60

Query: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPSEKAIMDACDLA 120
           QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSP+EKAIMDAC LA
Sbjct: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACGLA 120

Query: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTYECQPKSV 180
           TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDT ECQPKSV
Sbjct: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSV 180

Query: 181 EEEKHVNKKKRVIKKPSKEGLVVVETKTQQLAYSAVKEATGINQHDLKILESHVAYSLSK 240
           EEEKHVNKKKRVIKKPSKEGLVVV TKTQQLAYSAVKEATGINQHDLKILESHVAYSLSK
Sbjct: 181 EEEKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATGINQHDLKILESHVAYSLSK 240

Query: 241 EKSAVYFYMMQCTRSATEDVIQVPIKDAFDSLQDSLFKKNGRRWSVTSKVEYYHILPYVK 300
           EKSAVYFYMMQCTRSATEDVIQVPIKDA DSLQDSLFKKNGRRWSVTSKVEYYHILPYVK
Sbjct: 241 EKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKKNGRRWSVTSKVEYYHILPYVK 300

Query: 301 MVLTWFRRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIY 360
           MVLTWF RETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIY
Sbjct: 301 MVLTWFHRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIY 360

Query: 361 DAVLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVLTPTQVIMSYVKKIHG 420
           DA LERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGV TPTQVIMSYVKKIHG
Sbjct: 361 DAGLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVPTPTQVIMSYVKKIHG 420

Query: 421 SPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVEDGILATNPCIAEGSG 480
           SPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVEDGILATNPCIAEGSG
Sbjct: 421 SPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVEDGILATNPCIAEGSG 480

Query: 481 EKVASGNLSDNISDQNRNDDHALITCQSNAKHLSKMQAIISKETALSQAAIKALIRKRDK 540
           EKVASGNLSDNISDQNRNDDH LITCQSN KHLSKMQAIISKETALSQAAIKALIRKRDK
Sbjct: 481 EKVASGNLSDNISDQNRNDDHVLITCQSNTKHLSKMQAIISKETALSQAAIKALIRKRDK 540

Query: 541 LSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVCLRSAAEDKPYQYSEE 600
           LSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVCLRSAAEDKPYQYSEE
Sbjct: 541 LSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVCLRSAAEDKPYQYSEE 600

Query: 601 NCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGGFQANVILKGMDF 660
           NCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGGFQANVILKGMDF
Sbjct: 601 NCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGGFQANVILKGMDF 660

Query: 661 EYSSNGEVCHNPREARESAAMKMLGQLWRMAASQP 696
           EYSSNGEVCHNPREARESAAMKMLGQLW+MAASQP
Sbjct: 661 EYSSNGEVCHNPREARESAAMKMLGQLWKMAASQP 695

BLAST of CmaCh17G005990 vs. NCBI nr
Match: KAG6575356.1 (hypothetical protein SDJN03_25995, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1352.4 bits (3499), Expect = 0.0e+00
Identity = 682/695 (98.13%), Postives = 686/695 (98.71%), Query Frame = 0

Query: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60
           MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK
Sbjct: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60

Query: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPSEKAIMDACDLA 120
           QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSP+EKAIMDACDLA
Sbjct: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACDLA 120

Query: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTYECQPKSV 180
           TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDT ECQPKSV
Sbjct: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSV 180

Query: 181 EEEKHVNKKKRVIKKPSKEGLVVVETKTQQLAYSAVKEATGINQHDLKILESHVAYSLSK 240
           EEEKHVNKKKRVIKKPSKEGLVVV TKTQQLAYSAVKEATGINQHDL+ILESHVAYSLSK
Sbjct: 181 EEEKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATGINQHDLRILESHVAYSLSK 240

Query: 241 EKSAVYFYMMQCTRSATEDVIQVPIKDAFDSLQDSLFKKNGRRWSVTSKVEYYHILPYVK 300
           EKSAVYFYMMQCTRSATEDVIQVPIKDA DSLQDSLFKK+GRRWSVTSKVEYYHILPYVK
Sbjct: 241 EKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKKSGRRWSVTSKVEYYHILPYVK 300

Query: 301 MVLTWFRRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIY 360
           MVLTWFRRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIY
Sbjct: 301 MVLTWFRRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIY 360

Query: 361 DAVLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVLTPTQVIMSYVKKIHG 420
           DA LERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNP EKRKGV TPTQVIMSYVKKIHG
Sbjct: 361 DAGLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPTEKRKGVSTPTQVIMSYVKKIHG 420

Query: 421 SPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVEDGILATNPCIAEGSG 480
           SPVYNHYEATIPC VTGRQVYNHY ATIPCTVNESKASESGIKVEDGILATNPCIAEGSG
Sbjct: 421 SPVYNHYEATIPCLVTGRQVYNHYGATIPCTVNESKASESGIKVEDGILATNPCIAEGSG 480

Query: 481 EKVASGNLSDNISDQNRNDDHALITCQSNAKHLSKMQAIISKETALSQAAIKALIRKRDK 540
           EKVASGNLSDNISDQNRNDDHALITCQSN KHLSKMQAIISKETALSQAAIKALIRKRDK
Sbjct: 481 EKVASGNLSDNISDQNRNDDHALITCQSNTKHLSKMQAIISKETALSQAAIKALIRKRDK 540

Query: 541 LSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVCLRSAAEDKPYQYSEE 600
           LSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVCLRSAAEDKPYQYSEE
Sbjct: 541 LSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVCLRSAAEDKPYQYSEE 600

Query: 601 NCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGGFQANVILKGMDF 660
           NCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGGFQANVILKG+DF
Sbjct: 601 NCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGGFQANVILKGLDF 660

Query: 661 EYSSNGEVCHNPREARESAAMKMLGQLWRMAASQP 696
           EYSSNGEVCHNPREARESAAMKMLGQLWRMAASQP
Sbjct: 661 EYSSNGEVCHNPREARESAAMKMLGQLWRMAASQP 695

BLAST of CmaCh17G005990 vs. NCBI nr
Match: XP_022929889.1 (uncharacterized protein LOC111436360 isoform X2 [Cucurbita moschata])

HSP 1 Score: 1351.7 bits (3497), Expect = 0.0e+00
Identity = 683/695 (98.27%), Postives = 687/695 (98.85%), Query Frame = 0

Query: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60
           MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK
Sbjct: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60

Query: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPSEKAIMDACDLA 120
           QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSP+EKAIMDAC LA
Sbjct: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACGLA 120

Query: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTYECQPKSV 180
           TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDT ECQPKSV
Sbjct: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSV 180

Query: 181 EEEKHVNKKKRVIKKPSKEGLVVVETKTQQLAYSAVKEATGINQHDLKILESHVAYSLSK 240
           EEEKHVNKKKRVIKKPSKEGLVVV TKTQQLAYSAVKEATGINQHDLKILESHVAYSLSK
Sbjct: 181 EEEKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATGINQHDLKILESHVAYSLSK 240

Query: 241 EKSAVYFYMMQCTRSATEDVIQVPIKDAFDSLQDSLFKKNGRRWSVTSKVEYYHILPYVK 300
           EKSAVYFYMMQCTRSATEDVIQVPIKDA DSLQDSLFKKNGRRWSVTSKVEYYHILPYVK
Sbjct: 241 EKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKKNGRRWSVTSKVEYYHILPYVK 300

Query: 301 MVLTWFRRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIY 360
           MVLTWFRRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIY
Sbjct: 301 MVLTWFRRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIY 360

Query: 361 DAVLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVLTPTQVIMSYVKKIHG 420
           DA LERLP+KTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGV TPTQVIMSYVKKIHG
Sbjct: 361 DAGLERLPNKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVPTPTQVIMSYVKKIHG 420

Query: 421 SPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVEDGILATNPCIAEGSG 480
           SPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVEDGILATNPCIAE SG
Sbjct: 421 SPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVEDGILATNPCIAECSG 480

Query: 481 EKVASGNLSDNISDQNRNDDHALITCQSNAKHLSKMQAIISKETALSQAAIKALIRKRDK 540
           EKVASGNLSDNISDQNRNDDHALITCQSN K+LSKMQAIISKETALSQAAIKALIRKRDK
Sbjct: 481 EKVASGNLSDNISDQNRNDDHALITCQSNTKNLSKMQAIISKETALSQAAIKALIRKRDK 540

Query: 541 LSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVCLRSAAEDKPYQYSEE 600
           LSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVCLRSAAEDKPYQYSEE
Sbjct: 541 LSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVCLRSAAEDKPYQYSEE 600

Query: 601 NCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGGFQANVILKGMDF 660
           NCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGGFQANVILKG+DF
Sbjct: 601 NCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGGFQANVILKGLDF 660

Query: 661 EYSSNGEVCHNPREARESAAMKMLGQLWRMAASQP 696
           EYSSNGEVCHNPREARESAAMKMLGQLWRMAASQP
Sbjct: 661 EYSSNGEVCHNPREARESAAMKMLGQLWRMAASQP 695

BLAST of CmaCh17G005990 vs. NCBI nr
Match: XP_022929885.1 (uncharacterized protein LOC111436360 isoform X1 [Cucurbita moschata] >XP_022929886.1 uncharacterized protein LOC111436360 isoform X1 [Cucurbita moschata] >XP_022929887.1 uncharacterized protein LOC111436360 isoform X1 [Cucurbita moschata] >XP_022929888.1 uncharacterized protein LOC111436360 isoform X1 [Cucurbita moschata])

HSP 1 Score: 1349.7 bits (3492), Expect = 0.0e+00
Identity = 682/695 (98.13%), Postives = 686/695 (98.71%), Query Frame = 0

Query: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60
           MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK
Sbjct: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60

Query: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPSEKAIMDACDLA 120
           QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSP+EKAIMDAC LA
Sbjct: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACGLA 120

Query: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTYECQPKSV 180
           TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDT ECQPKSV
Sbjct: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSV 180

Query: 181 EEEKHVNKKKRVIKKPSKEGLVVVETKTQQLAYSAVKEATGINQHDLKILESHVAYSLSK 240
           EEEKHVNKKKRVIKKPSKEGLVVV TKTQQLAYSAVKEATGINQHDLKILESHVAYSLSK
Sbjct: 181 EEEKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATGINQHDLKILESHVAYSLSK 240

Query: 241 EKSAVYFYMMQCTRSATEDVIQVPIKDAFDSLQDSLFKKNGRRWSVTSKVEYYHILPYVK 300
           EKSAVYFYMMQCTRSATEDVIQVPIKDA DSLQDSLFKKNGRRWSVTSKVEYYHILPYVK
Sbjct: 241 EKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKKNGRRWSVTSKVEYYHILPYVK 300

Query: 301 MVLTWFRRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIY 360
           MVLTWF RETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIY
Sbjct: 301 MVLTWFHRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIY 360

Query: 361 DAVLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVLTPTQVIMSYVKKIHG 420
           DA LERLP+KTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGV TPTQVIMSYVKKIHG
Sbjct: 361 DAGLERLPNKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVPTPTQVIMSYVKKIHG 420

Query: 421 SPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVEDGILATNPCIAEGSG 480
           SPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVEDGILATNPCIAE SG
Sbjct: 421 SPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVEDGILATNPCIAECSG 480

Query: 481 EKVASGNLSDNISDQNRNDDHALITCQSNAKHLSKMQAIISKETALSQAAIKALIRKRDK 540
           EKVASGNLSDNISDQNRNDDHALITCQSN K+LSKMQAIISKETALSQAAIKALIRKRDK
Sbjct: 481 EKVASGNLSDNISDQNRNDDHALITCQSNTKNLSKMQAIISKETALSQAAIKALIRKRDK 540

Query: 541 LSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVCLRSAAEDKPYQYSEE 600
           LSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVCLRSAAEDKPYQYSEE
Sbjct: 541 LSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVCLRSAAEDKPYQYSEE 600

Query: 601 NCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGGFQANVILKGMDF 660
           NCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGGFQANVILKG+DF
Sbjct: 601 NCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGGFQANVILKGLDF 660

Query: 661 EYSSNGEVCHNPREARESAAMKMLGQLWRMAASQP 696
           EYSSNGEVCHNPREARESAAMKMLGQLWRMAASQP
Sbjct: 661 EYSSNGEVCHNPREARESAAMKMLGQLWRMAASQP 695

BLAST of CmaCh17G005990 vs. TAIR 10
Match: AT1G05950.1 (unknown protein; Has 50 Blast hits to 45 proteins in 14 species: Archae - 5; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 34; Viruses - 0; Other Eukaryotes - 7 (source: NCBI BLink). )

HSP 1 Score: 335.1 bits (858), Expect = 1.3e-91
Identity = 247/691 (35.75%), Postives = 379/691 (54.85%), Query Frame = 0

Query: 4   TGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRKQHP 63
           T  CPTEDAI ALL+ LV+P+LPSK +  + P  ++ +SVAKQ+HAVVLLYNYYHRK +P
Sbjct: 14  TDSCPTEDAIRALLESLVDPLLPSKPTD-DLPSTSIRESVAKQVHAVVLLYNYYHRKDNP 73

Query: 64  HLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPSEKAIMDACDLATCL 123
           HLE LSFE+F  LA V+KPALL H+K        E      Q    EK I+DAC L+  L
Sbjct: 74  HLECLSFESFRSLATVMKPALLQHLK--------EDGGVSGQTVLLEKVIVDACSLSMSL 133

Query: 124 YTSKDENI-EGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTYECQPKSVEE 183
             S D  I    P+ +VAV L+DS+K+ C+L  SSITQGVWS++E          K +E+
Sbjct: 134 DASSDLFILNKCPIRRVAVLLVDSEKKSCYLQHSSITQGVWSLLE----------KPIEK 193

Query: 184 EKHVNKKKRVIKKPSKEGLVVVETKTQQLAYSAVKEATGINQHDLKILESHVAYSLSKEK 243
           EK   + ++      +EG+       Q++A++ VKEATG+N  D+ ILE H+  SLS+EK
Sbjct: 194 EKAARENQK------EEGVF------QKVAFAVVKEATGVNHKDIVILERHLVCSLSEEK 253

Query: 244 SAVYFYMMQCTRSATEDVIQVPIKDAFDSLQDSLFKKNGRRWSVTSKVEYYHILPYVKMV 303
           +AV FY+M+CT S  +   + P+++    +Q  LF+K+   W++ S VEY+H+LPY  ++
Sbjct: 254 TAVRFYIMKCT-SQDKFSGENPVEEVLSCMQGPLFEKSFSDWTMNSIVEYFHVLPYATLI 313

Query: 304 LTWFRRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIYD- 363
             WF R   T+ +     E + +++              ++N+ DAT    ++ + I++ 
Sbjct: 314 EDWFSRRGDTEFVIEKEPEAVCDDI--------------ESNKVDATKE--SEVSDIFER 373

Query: 364 ----AVLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVLTPTQVIMSYVK- 423
               A+  R   K   +++L                 S+P  + K     T++   Y+K 
Sbjct: 374 REKAALKRRYEIKAKKVAAL----------------LSHPGARGKAT---TRLQNRYLKG 433

Query: 424 KIHGSPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVEDGILATNPCIA 483
            + G+   N +  T+  ++  + V N      PC  N S   + G +V     A++P   
Sbjct: 434 SMSGAKEPNVHSETV-VALKAKNVGNEMS---PCKDNYSNGEKGGFEV-----ASDPKEL 493

Query: 484 EGSGEKVASGNLSDNISDQNRNDDHALITCQSNAKHLSKMQAIISKETALSQAAIKALIR 543
           +  G +     + D ++  ++ +        SN        +++SK T+LS+ A+K L+ 
Sbjct: 494 KERGLQRKKA-VPDRLNSIHKLNSTPASAHNSNPNLEELQTSLLSKATSLSETALKVLLC 553

Query: 544 KRDKLSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVCLRSAAEDKPYQ 603
           KRDKL+ QQR IEDEIA+CDK +Q I    + D+ ++L++V+ECCN+   R     +  Q
Sbjct: 554 KRDKLTRQQRNIEDEIAKCDKCIQNI----KGDWELQLETVLECCNETYPR-----RNLQ 613

Query: 604 YSEENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGGFQANVILK 663
            S +  + Q   R +LSE +   ++ CQ LDDIC  NNW+LP Y V+ SDGG++A V + 
Sbjct: 614 ESLDKSACQSNKRLKLSETLPSTKSLCQRLDDICLMNNWVLPNYRVAPSDGGYEAEVRIT 618

Query: 664 GMDFEYSSNGEVCHNPREARESAAMKMLGQL 688
           G     + +GE   +  EARESAA  +L +L
Sbjct: 674 GNHVACTIHGEEKSDAEEARESAAACLLTKL 618

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1JUZ00.0e+00100.00uncharacterized protein LOC111488583 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1EQ290.0e+0098.27uncharacterized protein LOC111436360 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1EPE20.0e+0098.13uncharacterized protein LOC111436360 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1JP190.0e+0096.40uncharacterized protein LOC111488583 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1ENX10.0e+0094.53uncharacterized protein LOC111436360 isoform X3 OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
XP_022992174.10.0e+00100.00uncharacterized protein LOC111488583 isoform X1 [Cucurbita maxima] >XP_022992175... [more]
XP_023548856.10.0e+0098.42uncharacterized protein LOC111807382 isoform X1 [Cucurbita pepo subsp. pepo] >XP... [more]
KAG6575356.10.0e+0098.13hypothetical protein SDJN03_25995, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022929889.10.0e+0098.27uncharacterized protein LOC111436360 isoform X2 [Cucurbita moschata][more]
XP_022929885.10.0e+0098.13uncharacterized protein LOC111436360 isoform X1 [Cucurbita moschata] >XP_0229298... [more]
Match NameE-valueIdentityDescription
AT1G05950.11.3e-9135.75unknown protein; Has 50 Blast hits to 45 proteins in 14 species: Archae - 5; Bac... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF14709DND1_DSRMcoord: 624..687
e-value: 3.2E-6
score: 27.3
NoneNo IPR availableGENE3D3.30.160.20coord: 592..688
e-value: 1.4E-7
score: 33.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 333..354
NoneNo IPR availablePANTHERPTHR33913ALEURONE LAYER MORPHOGENESIS PROTEINcoord: 1..694
NoneNo IPR availableSUPERFAMILY54768dsRNA-binding domain-likecoord: 617..687

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh17G005990.1CmaCh17G005990.1mRNA