CmoCh19G003250 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh19G003250
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionPlant protein of unknown function (DUF639)
LocationCmo_Chr19: 2556726 .. 2579378 (+)
RNA-Seq ExpressionCmoCh19G003250
SyntenyCmoCh19G003250
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
CCAATCATAGAATACGCATACGGCAGCTCTTCTATTCCGCCATCTGTAAGGCCGATCGGCTTTTGTTATATCGGGGTGGAGGATTTTCTCTCTCTTTTCGGAGCAAAAAGTATCAAGCTGAGGTCATTTTGCTTTCAATTTTTCCAATACAACCTCCACATGGAGGTTAAGATTCTTCTACTTCAGCATCTCCCATAAACATGGATTCCCATTTCGAAGGGTCCTTGAATTGAAACAGAAATGCTCTGTAAACTCCCTTCAACGTGTTTAAAATCATCTTCCGCAGGATTGGATCCTTCGATTTCTCACCATGGTTATAATCGCAAGTTTGGGTGTTCCACTAGAAAGAATGTTCCAGAGCTCAAGTATCGGTTTAAGATTGTGGGTCTTTCTAAGGGAGATAAATGGCCTCTCAATGACATTGATGCAAGTGAGTTTGTCAATACAATTCTATGGATTTGAGTTTATTTTGTGTATCTGTTTCTGTGTTTTCCTGGTATAGCTGTTTGATTTCTACTTATGTAGCTTTATGGTGAAGAGAAATTGTAGTTTTTGCAGACGATTATTACTCTTGGCTGTGATTGAACACTAGATTATTTCATGATGATTTCACTTATGGAGCTAGGAAATAATATTTTAGATTGAATGCTGATAGTCAATAGACATGGAAACTCTAGGCGTTCTTCTTTGTTATGGTGTGTGTATATAATCTGTGGCACCTTTGATATTGTGAAAGTAGGAAATTTTTAAACAAGGAGGAAGTTCTTAGGGTGTCATCCTTTGTGTTCCATGGCGAAGTGTAGATGAATGGAGGGAAGCGTTGAGGTCGCCAACTTCTACGGAATTTGCTTAGTCAGTGATGACAATTGTTGGTCATGTAGTGTCATTTGAAGTGGTCGTTGGAAGTTTGATGCGTGGTGTCATAGTTATGCTTGTCCGTGGCGTGTTGCCTGTGGCCGCATGTTGGATGTCCCGATCATGCTTGTCTAGGGTGTGTTGTCCATGAATGCATGCCCGTTCCAAACCCCTTTTACTTACTTTCATGTGTAGTTAGGTCATGATTATTCTTGTCATGTCAATATGGGGGTGCATGACCGTTCACCAAAATCATGTCTACATCGCTAGGGCTAAAATGCCCTAAAATGTAGCATAGGGGGATGTGACGAATTGGATGTGACTAATTGTCAATTCTTCCTACTTGGGTTATACCTATCAAAGCCATAGTTTTTGCTTATTCACTTTTAGCTTATAACATTGTTTTCTTTCAAATTGTTTTCAACTCATTTCCTCACTCATGTTTTCTAAAATATTTCTTAAACGTCACTTGAGGCTCGATCATACGTCAAGATTGTCCGCAAAGTTTGATTCTCACACAGCTGACTAGTTCGTTGGGTTCGAATAATTGCCACATAGTCGTTGTCTCGCTACCACAAGAGTGAAATTGTGACACGTGACACTATGGGTCATAGTCACTAATATCGTAAATGTGAACCATCAACAAGATAAATCTAAGCTAACCAAAGAACTTAAAAGATGAGAGATTTCAAGAAAATGGCATGAGAAAACTAACAAAAAGAGACGACAAACTAAACTGACATTTAATGGTTTAAACTGACAACACTAACATAATTAATTAAAATGATAGTGTTCGGTTATATTGTTTTGCGCTATCATCTTAACACTCCCCATCTAACCAAACGATAAGGAAATGATGTGGTTTTGTGACGCAGCTCAATGAACTTGGCCTCTGATAAAACTTTGGTTAGTCCTTTTGCAAGCGATCAGTGCGGGGATCATGCTGCACTTGCAAAGATTTGTCAAGGACTTTATTGTGAATGAAGTGTATGTCAAGTTCCACATGTTTGAAGCAAGGGAAAATGCGCTTATGTTGTCACACCAGATACTTGGTGTAGCAGCAACTGATTTCATTGAGAAGAGACTGAAACCAGCTAATCTCAGTTACTGTGCGGGCCAAAGAATGGTACTCTGATTCTGTGTTGGACCTTGAAACATGTTGCTTCTTGGATGACCAAGCAATTAAAGAATGCCCAAGGAATACTAAGTGAAAAGTACTCCACGAAAAATGATACCCTGTAGGGATACAAGCTTTGCGCCCAAACCACTTGACTCTCCCTAAACATAAATACACTCTTTCTAAAGACATACACTTCTATAAACACCAACAGACACATAAATCATATCACTCAAATACGCCCTACGTTCACACTTACACAAACTACACGAAAATGACTCTCTAGTGACCACGTGATAGCTATACCTATCATCATACTCACAAAAGATACATCACCATTCTCAATAATAATCACCCCTAACTAGAACCTATTTATACCTACTCCTTAACCATGTCCATGTATAGTGCTTGGACAACCTCCTTTCCGGTTAGGCACCCTCCACTGTGCGCACAAGCCGAAGGCGTAGTGGCAATTCCTTCGGTTTCTGCGGTGATAAGGCCCGCAGCGCGGCGTAGAGATACTTAGGTTTTACGTTAGACAGTGAATAGGCTACTACCTTGCTCCTATCGCTGGTTGGGATTGGATACTGTTCTGATTGATGGTTTTCGTAGTTGTTAACATAAATCAGAACGGATCAACCAGTAGGGCTCCATCCACTGAGTGCCAATGCCGAAGGCGTAGTGGCATTTCCTTCGGTTACTGCATTATTGCAGGGTGAGTGAGGAAAAACCTTATTAAGTCTCGCTTTCTCGTGTTTTTTGACCATATAATATGCTCCAAAGAAAGCTCAATACTCCTTATGTCTGACACTGCACGTACCCCACCTTGATATTAATGGTTTTTTGCATTTTCACTTAATATCATTGGCCTCTCAGTTAAGAAAGTAAGCACAAGCTTAACAGAGATACGAACAGTTTAGTAGGGCACCTTCCACTGTGCGCCAAGACAGAAGGCTTAGTGGCATTTCCTTCCGTAACTGCTTTATTGCAGGGCGAGTAAGGCTTCTAAAAGCACTATTATGTGGAAGAAAAACACACAATTGGACCCCGACACATCTGGACTCTAACACTCAATATCAGGCTGCTGGACACTTGTTCAATCGGGATATCACAAGCTCTATCCCCTTTGAAAAGGATCCTCTATTGCTATTGAGTGTTTTCATATATTTTTTATAGCACCAATGCTATATTTTTGCAAGGGCGAATTATGCTAATTTCTGTGTAATATACATTGCATATTACCCACATAACTTATTGAAAGAAAGAGGGGGATTTTTTCGTCATAAAGCATATTACCCACAAAACTTATTGAAAGAAAGAGGCTTTCCACGTATCTTAATGTTAGTTAATATATATTGTTGCAAGAGTGGAGAAGTCAGCAGATTTGGGAATTTGTAAGCCATGTGATACCTTTGATGTTATTGATGGCTTGCAAATGGAGGGAAGTAGGATTCAGAAGAAATTGACTTATTTTGATGATGATAAATCAAATTTTAATTCTGGTATGTACAAGGTATCGAAGAAATCAATGGCACTTCGATACAATGTAGGATTTTCCATTAGTTATCTAGCATTGGCTGTTAGGTATTTCTAGTGATGGCTGGTGTAGAGCATGGCTTGATGCGAGCTAGGACCAATCTTTCGAGAAGATTCGATGCATACTTGGTCTGGTTGGGATGGATTCCATTATTGTCACGATGAACTGAAATACCAAGGAAGTAGGATAGATTTCCCAAGTCCTCCAAAGAAAAAGTTTTGTGAAGTTCAACAACAAAATAGTTGATGATAGCAACGTCAGTCCCTGTCACTAACATGTCATCAACATAGCTTAGAATAAAGATCTTAGTTGATGCAGTAACAAGCAAAAATTAAGAATTATCAGATCTTGCATTTTTAAAACTCCATTGCAGTGGCACTTGTCCAAGCCTCTCAAACCAAGCCATCAGGCTTGTATTAATCCATACAAAGAGCTTTTTGCAGTAATTGCAAGCAGTCAAGAAGCCACCACAAGCCCTAATTTCATCCATTCCACCACAAGCCCAGATTTCCTCTCCTAGCTAATCACTTGCAAGATGACGACATTATTGAGCATTCATTCGACGAGGCTCTTCATTCCTTATTCAAAAAGGAAAATCTAATTAACGCTTCAGCAAAGAAAAAAACTTCTGATTTTAGGATTTCTAAAGCCCTATTAACAGTAAGTTTTTTCGTTTTCTCCTTCCATACTGGACAAAGAAATTCCCTTCATTTCTTGAAGTCTGTGGTCTTCAAGTTCGTGAAGTTATTCCAACATCAGAATTAAAGGTCAGTGTTTGGCCTTTGGGTTCCCTCAAGCCTTTCATGGATCATCTTTTTTGAATTCTGTTAAAAGGATAAAGATGTTTTGTTTTGTTTTGTCTCTTCTGTTGGTGTGCTTGGCAGCAGCATTTCTTGGGGTTTTGTTGTCTCGAGGAAGTGGTTTTTAGCGTCTTTGGCTTCATAGATTCTTTGTAAGCAAGCCTCTTCTTTGGCTAGATCACTTTGATTCCGCTAGTGTAGAGACTTCCTCTTGACTTTATCTATTCTAGCTTTTTGTTGGGTTTTCTTTTCAGTTGAATTTTTCAATATGGGATGATTTTATTTTTTTAATTTCTAGTTTTTGCTAATTCCTTGTGAAGTTTGTATCTTTGGAGCATTAGTCTCGTCATTTATCAATGAAAGTTTTGTTCCTTGTTTAAAAGAAGGTGTACTTCATTTTCACTTTTGACCAATAGGAAGTGCTTTATGTAGATTCCTTGGCAGGGGGCTGTTTAACTTCTTATTTTATGTAATTTCATCTTATTAAAAAAATGAATATTTCTCATAAAATAAAAAATGTTGCTATGTCATATGAAGGTCAATGGTTAATTCTTATTGCCTTTTATCTTAGTATTGCTTTTTTTCCCCTCCTCTCTCCTAACGCAGTCTTCTCCGTGCTCTATTCTTATCCTGACAGACGCAGTGCAACAAAACTTAAACAAATGGCTGCTGAAGACGCAAAACTTCTTAAATGAAGTTACATCTCCCCGTGGAAAAATCAGTAAGAACAAAGACCATATTCCTACAGGAGCATCCACCGACATAGAGGATCTAGTTATGGCGGAATATACTGTTAATATCAGGACACCAAATGGCCTCCTCTCTTCTACTGCTGTTGTATCCATTGAGCAATTTAGCAGGTCAGATTATTATAAAATTTGGTCATTTAATCTAAAGATAATTGACATATTGTTACCTTCTTGACTTATTTTTCATTTTAATGGCCTCTCCTATTAGACATCATCCATTACAGTGTTTAATTATTTAGCAGATTTGCCCAAAATTTGAGATCGTATGAGCAATCAAAGCATATGATTGTCATCATGAACATCGTAATAAGATGGAAATGATGAAATTGTGCCTCTGGAAATCATTATTCAACAATGTGTACATTGAGTTGCAATAACGAGGCTTGTTATGTGTGAAGACTTTGTATATAAATCATTTATGCCCCATGCATATTTGTTTGAGGATGAAAACCTTGTTATGGCGTGGCAACAGTTTCACTTAAAATTGTTGAATCATCAAGTTTTATTGATGAAAAGAAGCAGATATACAGATCCAAGTAATTCCTACCAGGTAATAGAAACTCAGAGAAAGCCAGGTAGCTAAGTTTCTCTGGAGTTCCTAAGACTGCAAATGAAAGGTCCAGTTGTTCATTAACACATTTTTTTTCTGCATGCATGGCTAGTTATGGCCTGCATACGGACAAAGAAAGAAACAGAAATCCAGAGATCCCTCGATGCAATTTATAGGAATCTGAAAAAGGAATAGTAAGCTTCTAGAAGGAAACAGTTTAGTCCTGTTATTACAGTATTCACTGATTTATTGTCAACTAGCTGGAAGAAAACAGGTTTTTGATTATCTGTCACGTGAGATGGCATTACACTCTCGATTCTAACTGATTCAAAGATATTCTATCGGTCCTGTCTTGCTGATTATTTCCTGACAAGACCTTTGGCAGTGGTTTAGTTTGCTCATGTGCAAGTTGATTTCTAATGGATGCAAAAGGAGTGCAAATAAATCTTGCTTCCAACTTCTCCTCATAAAAAAAAAGACCCTCTAGGATATTGTGGAAGTAGAACCAACATCACAGACATATCCCACTGGGTTGTGTTCATTACTGATGACTACTTTGTTATCATATTCAAATTTAATCGGAGGTTTACTGTATATTTCAAGTTTTTCCATTTAGATTAACACTTAGACATTTTTTTTCTTTAGAAACTAGACTTTCATTATCAAAGAGGCAAGGGAAACCAATTGTTTAGTAAGCTTCAGTTTTCTCGAAAGGGGGTTGTCCCAGTTTCTTGAGTTTCTTAGAGTATTTTTTGTTTTTTATTCCTTTTCTGCTGGTTATAGATACAATTTCTATTTCATTTCTAGGGATTCTATAAAGCTGGTACTAGGAAGAGATCATGGGAATGTATTTTGTATCTAGGAAAATGTTGTTGGGATTAGGAAATATTCTGGATAAGATGAAGAATGAAATGGCAGCCATGGTTCAAAAAATATGGAGGAGATGAAATCAAGACTCAAGACTATGAAAAAGATGGAGGTTAGTTAGATCGAGGGGTGAAACCCCGAGGGATTCCACATTAGAGTGAAGAAATAAGGAAAAGAGACAATTGTTCGATCCAAAAAATACAAACCTATACAAGTTTTGAAAAGATCTTAGGGACTCATGGTAGGGGATTTAGACTTGCAACAACAACAATTAGCAGCAAGGAGTGAAGCTATCCATGTCATTGTATTGAGCTGAACACAACATTTTTTAACAAGTATTCTCGAACCATTGGAAAATGTTCTTTCGTACATGGCATTTTCAACTTCATTATCTTTTAAGCTCGATGGTTTATTAATTTTAAGAGGGAGGGATTTGATAGGGTATGATCTAGTAATTTAAATTTGAGTTTGTTAATTGGTTTTAGTTCTTGGATTTCTAATTAGTTAGTTGATTTAGTTACTGGACTAGCATCCTTTAGCAAGATTTAATCTCTTGTATGTGTAAGCCTTTCTAAATAATGACAAAGACTCTCATCAAGTATTTTTGAAGATGAAGGAATGTTCTCAATGAATATTTGTGTAATCCCATTAAATAAGGGAAGTAGACCTCTGTATTGTTACCTAGATTTATCTAGATTTTCTAGTATCTAGGGATTAGTATTAGATTTACTATTACTTAATGATACTGGGGAGTATGATTTAGCTAGATTTATTGTTACCTAGATTTCGTTGGCAGAAAAACAAAAACAGTAAGATATTTTAAACTTGCAACAATTTTCTTATATGTTAGGTTGCATCACCTTACATAGTTAAGTAGTTTTGAAGATTATTTTCTATTTCCACTTCGAGGAACGAGACCTTTTCGTTTGCTCTCGTTTTAACCTATCTACTTTTTCTATTTCGCAATAAAAAAAGTGAAACTTCAGTGGTTCTTGTTAGCCTTAATAAATTTTGATTCTCATTCCAGATTTGTTTATTTTTCGTTTCCCGTCTTGTCTAGGTTTTTCTTTCATTTAGTTGTGAAGAAGTATGGTATATACAATTTATTTTACTCAGCCATGTCATGACATGCAGGATGAATGGCTTGACTGGGCAGAAAATGCAGAGGATATTTAAAGCCCTTGTGCCTGAATCTGTTTACAATGATGCTCGCAGTCTGGTAGAGTATTGCTGTTTTAGATTCTTGTCAAGGGACAGCTCAAATCTTCATCCTTCCCTCAGTGTAAGTACTTCATTCACCCTGAAGTTTCAGACCTTGTTTAATGATTCAATGACGATTAGATATGGAGTTCTAATTTATTTTAGGGAAAAATTAAATTTGTTGCCTCTGTACTTGGAAAGGGTAATCAATTTTTTACCTAAACGGTTTTATCTTAAGCTTAAAAGAGGAAGATCTTAAAATTTCCTTTTTATTTTTGGGTTTTTAATTTCAGTACGAAGAACAAAATTTCTCATCTTCTTGGAGGCTGTGGACTAAAAAGGAAAACAAAAATTCTTTGGATTAATACTTCTTAAATTAAGGTCATTCCTCGACATAAACCACTAAACTTTTTGTGTTAATAGGGATACAACCTTCCAATGAAGGATGTCATCAAGAAACTTGACGAAAGACGAAAATGGAAAACCCCTTTTGATATTTTCAAATGAGTATATGTTGATGACTAGCAAAACAATACTTTCTTTTTTGATGAATAAATTCTGTTTATTTATGCAGGAACCCACATTCCAGAGATTGATATTTATAACAATGCTTGCTTGGGAAAATCCATATCACGAGCATACTAATGCTTCAGAGGAAATTGCTTTTCAGGTTTCAACCTTATTAAAATCTTGCCATATTTTGTATTTCATTGTTGTGAGAGAAGAACTAGGGATATCTTTGTCTTATCTCCTCCTCCATTGTGACTTCAGTAGTAGAGTTTGGATTTCTTTTTGGAAATTTGTTCGGGCGTACAATGGTGTAAACCTCTTTGCTTGGACGATTAGCTGGTCGAATTCCTCAATGGTTGGGAACAGAGGAAAGCAAAGGTGTTATCAAATGTGGCTAAAGCTATTCTGTGGTTAATTTGGAAGGAAAGGATTCACAGAGTTTAAAGATAAGTTCAACTGTTTTGTTTATTTTTGTGAGTTTGTACTGCTCACGGGACTTTAAGCATGACGTCAAGACAATTAAAACAATTGACTGTAAATGGTAAATCGGAATCAAGAAAACCTTGGGGGTAGGAAAAGTATGAGGTTTGAAGCAAATCAAAGAGCAAGAATGTGTTACAGGGGAAAACACCACTTTCTGGATAATGCACGAAACAGAGCATATTGTTCAAGAATTCAAGAACAACATTGGAAAGTCAAGATTCTGATTTTTGATGAAAGACACTCTGCTAGTGATAACAACTAGAAAATAGGGCCATAGAAATAATAAATACCAAAAACATGCAACACTCGGGTTGAAAATCGACTTACCATCCTTCTACGACAAAATTTATCTTGAAGCTTTTTTTTTTTATTGGATTTAAAATGTGGAGAATTTTTTTGCGATGACACCAATACCCCGGAGTGCGACAAGGTTAAACTAGTGGCCTTGAAATTAAGGAGGAGCTTTTGCTTTGTGGGGTTAGCTTGAAACCAATAGAAGGTGAAGTGGAAGATTCTAGTTCAAAAATGGTCGGGAATGCTACATCAAATGGAAGATGGATTCCTACCAACAAACTATGAGCAACTACTCTATCAACGATACCAACGATCCAAACAAGAGAGAGAATTTAGTCGAGTATATCGAGGAATTCCATTGCCCAAGTTCAAGAACAAACTTGGCTAATGAAAACTTCCAAATCTGTTTAGAGAATCTGGTTGAGTATATCGAGGAATTCCATTGCCCAAGTTCAAGAACAAACTTGTCTCATAATGAAAACTTCCAAATCACAAGATTTATTGGTGAACTCAAAAGAAGAGATCCAAGATCAAATGGATGCTCAACCTATAGCCTTTATGTTTGATGTCATTACCATAGGTACCTGAATTGAGAAGAAGTTTGGAAAGAAACAAGGTAAGTGAAGGATGACCTTGGTTCAACTGTGCAGTGTCATGATCTTGACACACTCACAACCAGTCTTGAGGGTAATCGAAGGCCTATATATACATTTTTCGCATAACTTTGTGGCTTTGTCGAACTTTAGGCGAGGCTTGTTCAACGTAACTTGTGCAGAATTCAAGCTTGTTAGACTATACACGCTGCCTGCGTGTGCATGTTCTATCATTTGCGACATAACTGACTTGCCTCTATACTAGGCCAAGTCATTAGGCTTGACCTCGCCTCTACTCGGGGCCAAGGTCTCTCAACGAAACATCACATCTCATCATGTCATCATCTAGGTTCCTCATTTAACATGGCCCACATGTCTTAGCGCCATCCCAAGTCATGGGGTGTTGTCGGCCGTCACAGCTCACTCTGTCATGCCATTGAGAACGGGCCCACGTGTCGTTCATAATTTTGGGACATCTTGAACATCCATTCATCCAAGTTCTATCGAGGCATTGGACCCTTTGTGTCCATGCCATGTCGTTTACAGACTCTGTACTGGATATCGGGTTTATGATCTAACCTCCGACACATGGGCTGGGGCACAACACCAGCACACCCGTGCCGCCCGGTACTGTCCTTGAAAAAGCTCATTTTAAAGTAAGTCGCTCGAGGCACTTGTCTCGCAATGCTCGCTCTCACTGTGACACACGTATGTGCCAGGAATGGTGGAGAGTTAACATCATGCCTCCCCCTATAGTACATTTTGAGGCATTTCCAATCTAACGGGTGTAGACATGATTTTGACAAATGGACATACGGTGTCATGGAGTGTTTTGGCGAGCTTTCATATTTCACATGGAGGAACTTGACTCTAGAACCGTTTGTTTAAATTCCCTTTGGGACCCCGACTTTCGAGGTTAAGGCTCGGGGCCCTATCTGACCCTTCATCAAGCCTATCTGTGACACTCAAGTCTACTCATTGTTCCTGTCTTGCTTATGTCTTGTCTCCTCTTCAATGCACACATTATACGGTGTGTTGGATGATGGATTGTCTTCCTCGGTCCCATCATACCACTATTCTATATTAGCCCTCGCATGGTCGGATGGGCACCTCTTGAGGATCGACTATTTGCTTGTTTTGATGTGAGGGTCACAACTTACTCTATTCATAGTATAGTTGTGACAATGGGCTGCTACTATGAAAAATAGAAAAATCGATTGGAAGTAGTGAGCTTGGAGATTTATTCCTTGGAGTTGCGGTAAAATTTAGAAAACTCTTGGACTTGTAGTAAACTTCAGAAAAATCTTGGTTGGAAAGCGGTGGTTGCTGGGATACACTATTGGACTTGTGGTAATGCACAGAGAGGTTGGAAACTATGAAGTCTCTGAATGAAGCTTGATTACGTTGGCTGATTGACAGTCATGAAAGCTTCACCGATTGAGTTAGAGTTTGGTTCTCATTAAATTTTCCAGAATAAAATGCTCTGATACTATGTTACGAGTATAAATTATCTTGCTCTCAATACTCAATATATTTTTAGAGAAATACATGTACCTTTTATGATACAAAAAGACTAAATAAATCCCTAGATACTAGGTAAAATTAAATTTTTGGCTAAGTCCTTACATACCAGGTAACAATATAGATAAATAACAAATATCTCTTAACTTAAGGATAAATAAATATTCCTTAAATCATGGACATTTCTTCAACATTTCACATTTACGTTTAAAGCTTAAACTGATTCTAATGTATGACAATAGTTTTTTTAATACCATTAATAACTTGGAAGGAATATGATTTTATAAATGCTATGAAGTTTTGGTAACTCGAAACGATGGACAAAATCTCTGATGTTATTGTGTGTGTGCCTTTTGTCTCATTTCTCCCTGTATGGGGGATGACTTTTTTTGTTTGATAATTGTTAAAAAATTTAGTAAATCATTATTTGATGAATAGATCAACAAGCTTTATATCAAAATTGGTTCTTTTTGGCATTGCAGAAGATGTTAGTTGGAGAAGAGGCTTTTACACGTATTGCACCAGCTATTTCTGGTGTTGCAGATCGATCCACAGTACATCATCTATTCAAGGCACTTGCGGGTGATGAACAGAGCATCTCTTTCAGTTTGTGGCTCAAATATGTTGATGAACTGCTCAAGTGAGCATCATAATTCTTTCTTTGTCTTGCATTTGGCCATGGTACACTTATATAAGTCTATAATCTATAAAGTGAAGTGAAATGAAAATAATATGTTGTATAGGTTTCTTTTTCTTATATCAAATTTCTTATTATTTCTCCCTTTGAATTTGCTTGTCACCAGTTGGAATATGGAGCTTGATGACTATTTATAGTAAAAGATTTGGTAGAGATATATACGGTAGATATATCGTAAAGATCTAGTAGAGACATATATATGGTAGATATATAGTAAAATCTGTAGAGATATATATGGTAGATTATATTTTGTATGTGGCTAAATCTGCTAAAGCTATATTTAGTAAACTGAAATCATATATATACCCCCATTAGGCAGTCTCTTGGTGTACATACTTGAAGTCATCAATAAAACCTTTAGCCAATATTCCTCCTTGAATTTTCTCTAGTCTCTTGGTGTACATACCTGAAGTCATCAATAAAACCTTTAGCCAATATTCCTCATTGAATTTTTTCTCTCCTGTTCTTTGTGGTCATCTTGTTCGAGTGTGTGCCCTTGTTGTGCGATCCTAACACCATACCCCATTGACCCTCAATTGTTGCCCTTCTCCCTCAATCTCCCTCCATCGTTGGCCACTCCCTCTTCGATCCCTATCATCAGCATCCTCCCATGGCCTCTTTCCATTGTTGCTGCTTTCCTTCTTCCTTCTCTCCTTTCCTCCCTTCCCTTCTCTCATCTTCTCCAACCCCCTGTCTTCCACGCCTTCTCCTTGGACTATGTGCCTCGACGGGGAAGTAACAAGAATACATGGAGAGGAAGATTTACGAGATCAATAAGAAAATGGAAGGCATGGACCATCAAGCCTGAGCAACTGTCTTGCATACAATATAGAACTTGGATTGATTTTTTTTTTTTTTTTTAATGGTCTAATTCGTGTCAACATTGTTGCTATGTCAGTCCAAGCAAACATTAAATTGTACTGAATCTTAGGTTTAATTTGTTGTTCTAACTTAATTTCTATTGGTTTGGCTGTAGGGTCCATGAAGGACGAAAATTATATCGAGTTCGAGATAACAGACAGTTCTTTGGTGAGAATATCCTATGTATTGGTTCCAGCAAGAAGAGACCCGTCTTAAAATGGGAGAATAATATTGCATGGCCAGGAAAACTTACTCTTACCGACAAAGCTGTTTATTTCGAGGTATAATTCCCTTTCAGTTGACCTAATCTCTGACTAAATAACTTCACAGCATATAGAAAATAAATGCAGTTCTAGATGCATAACTAAGGACAAAAAGCTTACAATAAACTGAGATCTTGAAAAATTTAGACAAGAGGAAGAGTATGACCAAATGCCTTATTCAGCACACTGGGCCAATTGATAGTGGACCTTGCTAATGCATCTATTTGGTCCATGCATTTAAGTAGGGATCATCCTAGGGCAAAGGTGTTGTAAAGGTGTTGCAAGTATAAATTATCTTATTTACGTTTCTTCTTAATTTTTTTCGAAAGCTACACGTACCTAGGAAAGACTAACTAAATAACTAGATATTACCAATACAGGAAAATAAATCTAACTAAATACCTAGTTACTAGATACCCAAAAAAAAAAAAAAAAAAAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCCCAAAAAAAAAAAAAAAAAAAAAATCCTTATATATTAGGTAACAATACAAAAAAATAAAACTAACTAAATTCCTCGATATTAGGTAATAATACATACATAATTAAGGTTATGGTTACATTTATATATAATTGAGGTCATTCCTTTAACATTCCCCTTTAAGTTGCGGAATAGATGTCATCTCTTCCTAGTTGGGTGACTAACTCATGAAACATTGAGTTGTGAAGTTCTTTTGTTAGTGATCGGCCATTTGATTGTGAAGGCTCATAACTAATGCAAATTATCTCTTTCTCCAATTTCTCCTTGATAAATTGTCGATCAATTAGAATATACTTTGTCATTTCATGCTGGATTGGATTACGTGCTATATTGATCCACACGACTTCATTATCAAGGCATGACTACAACAGTACGACTCCAACGGTAGTTATGAAGGTTCTGAATAAATTTGATCAAGGTGAAAGATCGGTAGCTATTAAGGCTCCTAATAAATTTGGTTAAGGTTGAAGATCAGTAGATATGATGGCTCAATATTTGTCGGAATGGATCATACTTACTTTGTCTGGAAGAACCATCTCCAATCCTTCGGTCTAGGAATTATTGTATGATCCATAGTTGCTTTGTCTGGAAGGACCATCTCCCATCCTTCATTTTGCAGTTTCTTCTAATAATATCCTGCACTCAAAATTGCCTTGGTTGCAAGAACTCCAATCTGTAAGGGATTCCCAGCTTCAAGCGGCTTCAACTTAGCTTCCTTGAGCTCTCTAACAACACAATCAGCCATGTGGAGGAACCCTTTTTGGATTCTGGAAATTGTTCTCTTTCATCAGTGTGCAAAAGAGTTGGTGGTGGTACAACCTGAACAGGATCAACCTCCATCAAAACTGAATTCTACTAGTATTTCTGTAGGCAGATATGAAAGATTGACTACTTATAAGCTGGAAAGACATGTCAATCCTCAAAATCATCTGGGATTCAACATTGAGAAACTGCGTTATCAATTTTCCATCACAAGAACAGAAAATTCATTATTCAAAATAGGGGGAAACCCTTCTCTGGCATTCAAAATAGAAGGGGATTGAATTTCAGTTCCTGAAACAAAACCCCATCTGTAATAGACAAGCCCACCGCTAGCCGATATTGTCCTCTTTGGGCTTTCCCTCTAGGTTTTACTCTTTGCCTTCAGAGACTCGTATTCCTTTCTGATGGTAGACATTCTGCCTAAGCAGAACTACTTTGAACTCAGCTTAAACCAAAAGTCAGCACCTTGAACCACGAAGAGGCAACTAAAGGACAGGACAATAGATTAACAAATAATTAACATCTTTCGGGCTGTTCTTGCGTAATAAGACTCAACTTGGTCTGAGGACCATGAAGGTTGTGTGTCGAAACATTTCCTAGGTGTTTAGGCCTCCAAGTTTCCCTAATGAAAACCCTGTGCTTAGGCATCTTAAAGTTCCACTGAAGGAATGACCTTAATTATATACTTGTAAACTCTAGCCTTAATTTAAGGAATAACCTTGTGTAAACCATATCCTTAATTTAAGGAATATTTGTGTCATCCAATTAAAAATGGGAATTAGTTAACCTTCCTATTGTTACCCAGTATGTCGGGATCTTTTTTTCTTTATTGTTTTTCTTAAGTGTTACCTAACAGGATTTAGTTAGATTTATTTTTCTATATTGTTACCTAATATCTAAGGATTTAGTTAGATATATTTTTATGTATGGTTAACTAGTATTTAAGGATTTAGTTAGATTTATGTTTTCATATTGTTACTTGTATCTGGATGTTTAGTTAGTAATCATGCCTTATAAAAGACACATGTATCTCTTTAGAAATTAATGAGAATGGTTTGCAAGATAGTTTATACTTAAAACATGGTATCAAGGCTATTCTATTTTGGGAGGTCTGATTAATCGGCGAAGCTTTCTAGCTGTCGATCTTCCCTTCCACCTTGACCAAATTCATTCGGTCTTTATAGCTACTGATCTTCCACCTTGACCAAATTTATTTGGAGCCTCTATAACTACCAATCTTCCACATTGACCAATTTTTTTTTTTTTAATAAGAGACAATTTCATTGATGAGTGATATTTACAAGAGGGATGTATAATTTATGATGTTTACAAAAGACCTTCCCAATTTGCAATGAGGGAGGTATAACTGTAGACAGTTTACACCAAGATATAGCTTGGTAAACAACATTGTTGAAAAGTTTTGTGTAGGTCTGTGTCGTTTTTGCAAATATTCTTTGATTTCTTTCTTTCCACAGATTCCAAAAGAAAGCCATGATGAGGTTTTTCCATAATAGGGCTTTTGCGTTCTTGAAAGGGTGGTACGTTTTAAGGCCATAGGAAATGTGAGATGTCATCTGAATATATTAAGAATTGCTGTCCAAAAATTTTGAGTGTATGTGCATTGTATAAACAAGTGACTTTGTGATTTGTTTTCTTTCTTGCATAGTGGACACCAGTTTGGAGAGAGAGTGATGTAAGACATTCTTTTTTTTGAAGATTTTCACTTGTGCTGATGGCTTTATGCGCTATTTCCCAAAGGAAGAACTTCACCTTTTTAGGTTGATGTCCTTTCCATATTGTCTTTGCTAGCATGGGATTTATTGCTTCTACTTTTTCCCCATGTCCATCATCAAGGATTTGGTAGAAAAGACCCTGTCAGCGCTGGGGAGCCAAGTTAGTGATTCTTCTTTCTTTGACAATACCACAGGGGCAAGGTCAAGGCCAAGGCTTTTGGCCCACTCTGTTCCTTCATTATCCTTTAGATTCCTACCAAGCTTCAGGTCCCAAAATTTGTTGACAACATTCCACGTTTATTTAATCGTGGCTTTTTTGCTATGAGAGAGCCTATACAAAAGTAGATACTTCAAAGCTAGCGTGGTGTTTCCAATCCATGGGTTGGACCAAGATGATGTGCTTCCCCCATTCCCCACCTTATGGCGAGTTCGGTTGGTGTTGAGATTTTGATGTTTCTTTATGTACTTCCAAGGCCCTTTTGTAGAAGATGGAGGGAGGTGATCTTTGCTTGATGTAGGAGTATATTTAGCCTTTATAAGATTTCGCCATAAGGCCTTTTCTTCATGATGATATCTCCATATCCATTTGGCAAGGAAAGCTGTATTCTTCTTCCTTATAGGGAGAAGACTGAGGCTTCCCTTTTCTGTTGGGAGGTTTATAATATTCCAACGAAAAAGGTGTGCGCCATCTTTCCACAAGTAATTTCTGAAAAGTCTTTCTATATCTGCGGCCACTTTTTGTGGCATTTCAAATAGAGACATGTCGTAAGTAGGGAGGTTGGATAATGTGGCTTGTATTAGGGTAAGTCTTCTGCCTTTTGAAGTATAACGTGAAGACCATGTTGACAACCTTCTTTCTATTTTCTCAATAATAATCTTCCCGAAGGAAAAGGATTTGTGATTTCCTTTTAGAGGTAGTCCTAGGTACATATTCGACCATTCTCCCTTTTTACAACCATATATGCTAGCGAAGTCTGCAATGATATCTGTGCTAATATTGATGCCCAAGATTGATATTTTGTCCAGAGAATCTTTCGAAGGTTTTTACCGTCTCAATCATGTTTTTTATGGAAGAGGCATTCACCAATCCGAAAAGAGGATTATGCATCTGCGAATTGAAGGTGGTTTATGCTCAAGCCTTCATTACCAATCTGAAAACCTTTTATCTGCCTTTCGTATCCTGCTTTTGTTAGCAGGCAACTAAGACAGTCCATTACCATGATGAAGAGAAAAGGGGATAGTGGATCTCCTTGTCTCAGGCCCCGTGTGGCATAAATTTTCCCTCTCAGTCTTCCATTGATGATGATGGAGAAATTTGTGGATGAGATGCATTCTTTAATCCACCTTCGCCATTTTGGGCCAAAGCCTTTGGCCACAAGAATGTTTTCGAGGAAAGTCCAATCAACTTTGTCGAAGGCCTTTTCCATGTCAAGTTTAATGACTACACCTTTTTGTTTCTTTCTTTCCCATTCATCAATGAGTTCATTGGCCATAAGGGATGCATCAAGAATTTGTCGCCCCTCAACGAAAGCTGTCTGTTGCCCAGTTATTGTGAAGGCGAGGACCTTTTAAAGGCGTTCTTATAATACCCTTGCTATGATTTTATATAGACCGGTCGTGAGGCTTATGGGACGAAAGTCTGCAACAGTACAAGCATCCAGTTTCTTTGAGATCAAATATATGTATGTTTCATTCAGGTTGCCATTAATAACTCCGTTTTGAAAAAAAAAATCATGGAATACTCTCATGATATCAACTCTCATAAAGTTCCAGCATTTCTGAAAGAATTATGAGGTAAAACCATCCGGTCCTGGAGTTTTGTCGGAGCCCAAGTGTTGAATCGCCTTCCCCACCTCTTCTTCAGTGAAAACGACCTCGAGGGAGGCAGCTTGCTGTTGATCAATAGGACTCCAGTCAAGATTGTGAGGAAATTCGCAGAGGGCATTGTCCTTTGTGTATAAGGATGTGTAAAAGGACACAAATTCTGATACAATTTCTTCTTCGTTTACTGAACTTATGCCTTGTGTGGATAGAATTTCCATGATGGTTCTCTTTCATTTCTTTGTAGCCATGATACGATGAAAAAAATTTGGAGTTTATGTCACCTTCCTCTAACCATCGTTTTTTGCATTTTTGTCTCCAAGATTGCTCTTCATTTACTGCCAACGAAATAAGTTTTGCTTTCATGACCTTTTTCTGTGTGTGTTGAACTAGTGTGATGGAGCTAATTTCCTCCATCTGATAGAGGGCAATTTCGGTTAGCAATTGGTTCCTTTTTGTCGAAATACAACCAAAAACCTCCTTATTCCATTTCTTTAGGACTCCTTTTAGTCTTTTAAGTTTGTTGATGAAACCATGCCCTGGCCATCCATGGAGGGGCGTATTCTTCCACCAGTATTCTACCATTGGCAAAAATTCAGAATGGTTCAGCCACATATTCTCAAATTGGAAGGGTGAAGGGCCCCATTTACAGCAACCCATGGATAGTAGAATGGGATAATGATAAGATGTAGGTCTATTCAGTCTTTGAACCCAACGTTACTGAATTTTCCAAGGAGGCCTTCTGTGGCAAGAAATCTATGGATGAGTGTCAAGGTGGGTGGAGATCTGTTATTTGACCATGTATAGAGTCCATTACTGAGGGGAAAGTCAATGAGTTCGGCTTCTCTTATGAAATTGTTGAAGTGTACATTGACCAAATTTATTAAGAGACTTCACAGCTACAAATCTTCCACATTGGCCAAATTTATTCGGAGCCTTCATAGCTACAAATCTTCCACATTGACCAAATTTATTCGGAGCCTTCATAGCTATTTCTGAAGTCGTACTTTGACATTGAAGTCATGTCGTCGAAGCTGTGCCTCGGTGGGCATGAGAGAGCGCATCTTTATATCAAAGACATGTTGAAATCGAAATCATTGGTGTTGCAATTGTTGAGTTGAAATCGCATAGTTTGAGAGGTCACTTGAACCCTGTTTATATTTCTACAGAAGAAAAATCAGAGTGAGAAGGGTTTTCTTTGTTTCATAGATTACTTGAAAAGGAGGCCCATTTTTCTGTGACAGTATGTTAGGAGAACAAAAAATTTTCTCCCGTTGAGGAACTTCCTACTGACCAGTGACGGGAAAGGTGATTTTATTTAAACGAAACAATTCCCTTACATTAGGAGCATACATAGACGCTAACCATGTTGGATCAGTAATGGACAATAGATCTATGATGGGGTATTGTACTTTTCTCGGAGGTAATTTAGTGACATAGAGGTGTAAAAAACAAAATGTGATGGCTAGATCATCTGTTGAATTGTAGTTCTAAACTTTTGTGCAAGGGGTATGTGAGCCCTTATGGTTAAAGATGATCCTTGATGACTTAAAAATTAAGGGAGAATGTTCAATAAAGTTGTACTGCGACAAGAAATCAGTAATTAGTATTGCACATAATTCTATCCAGCATGATATGACAAATTATATTAAAATTGATCGACATTTTATCAAGGAGAAATTGGAGAAATGGATTGTTTGCATTTGTTATGGACCCTCACAGTTCAATTGGAATATGTACTGACAAAAGGACATTACACCTTAGTGTTTCATGAGTTGGTATCCAAGATGAGAATGAATGATATCTATCATCTATTCCTCTCTTTAGGGAAGGAATGATCTTAATTATATCAATGTAAACCATATCCTTTAAGAAATATTTGTGCAATCCGATTAAATAATAGAATTCGTTAACCTCCCTATATTATTACCTAATATCTAAGGATTTAGTTATATTTATTTTTCTGTATTGTTACTTAATATTGGATTTAGTTATATACGTATATTTTTATATTGTTACCTAGTATCTGGGTATTTAGATAGCCTTTCTACTTCATAAAAGGCACATGTATCTCTCTAAAAACTAATGATAATAGCGGATAAGATAGTTTATACTTGTAACACTTACCAAATTGGTTCCGCTAATTGTATCAATTTGGTTCTTATGACATAGAAGTAGTTAAAGGAAGACATAAGTAATACTGATTTGGTTGCTGGTTGTTTTTCTAATCTAAAAAAATATTATGATTTTTAATCTAAAAAATCGTAGAGTGGGAGTAGAGGTTCGAAACATGGTTTTCATCGTATTTACCACAAAATTGGCTCTTGGTTCTCTCAACCAATACGACCCTATAAATTTTTTTAGGGTGTTTCTGTTTATAAGAGTTTGGAGAAGATTATTAGAGACCTTTTAGAGCGAGTCGATGAGGGCGGGGCTCTCATTTGATTAGGTAGAGGAGTTTGTAGTCTCCTTTGAATTTTGTTCACTTGTACCATATCTTCTCTTTGAGGTTAGTCAGCATATGCTATTGTGTAAAGTTTTTTTCCTTCAAACATTTTTGTATCTTTTCATAAATGAGAAGTTCATATATTATCCAAAGAAAAGTTAAATCAATCAAACTCACTTGTCATCAAAAGGTTGTCTAAATAAATACCTTAAAATATAGTGTTAGCATTTCTAGTGTTTAAGAAATATTCATTCAGTTCGATTCTTTTTGTGAATCGGTATAGGCAGTTGGAATCTTCGGGCAGAAAGATATCGTGAGATTGGATCTTACAAAAGACGGTGTACAGGTGGACAAGGCGAAAGTGGGACCTTTTGGCTCGATTCTTTTTGACTCTGCTATTTCAGTAGCATCCAGCTCAGAGTAGGACCCTTGACTTTCCTTTTATATTTTCCAAACTGCATTTGTATATTTCTAGTATGCTAAGTCTAAGTAATCATCTGGGAGGTACATATGCTTTTATATTAGAGACAGGCTAACTAAATACCTAGATACGAGGTAACAATAAAAAAACACCTAGATACTTGTAACAATACAGATATAATTATGGTTGGAGTTTAGACTGATATAATTAAAGTCATTTCTTCACTCCTCCTCAAGCAGCAGAATGGATATCATCTATTCTCAACTTGGATACTAACCCATGAAATATTGAGCTGTGAAGTTCTTTTGTTGCCACGTCTTCCTAGTTGAACTACGAGGGCATGTAACTAACGCAAACTATCCTTTTCTCCAATTTCTTCTTGATAAAATGTTGATCAATTTCAATATGATTTGTCATATTACTTTGGATAGGATTATGTGCAATATTGATCGCATGACTTTGACACATGACTTCAATCTCAAGGCATGACTTTGACACATGACTTCAATCTCAAGGCACGACTTTGACACATGACTTCAATCTCAAGGCATGACTTTAACAACTATGAATAAATTTGGTTAAGGTGGACTTCAACCTCAAGGCTTTAAATAGATTTTGTTAAGGTGGAAGATTGGTAGTTATGAAGGCTCTAAATAAATTTGGTCAAGGCGGAAGCTCGATAGCTATGAAGACTTTGAATAAATTTGGTAAAGGTGGAAGATCGGTAGTTATGAAGGTTCTTAATAAATTTGGTCAAGTTGGAAGATCGGTAGCTTGAAAGCTCCTAATAAATTTGTTCAAGATGGAAGATCGACAACTATATAGGTTCGAATAAATATGGTTAAAATGGAGGATTGGTAGCTATGAAAGCTATGCCGATTAATCAGGCCTCCCAGAAAAGAATGGTTCTGATACTATGTTACAGGTCTAAATAACCTTGCTCATAGTTCTTAATTTTCAGAGAGGTACATGTGTCTAAATCCCTTAATACTAGGTAACAATACGAAAAATAAATCTAACCAAATCTCTAGATATTAGGTAATAATATCGAAGATTATATCTAACTAAATCCATAGATGTTAGATAACAATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAATCTAACTAAATTCGCAGTTCCTTGGGTTTTAATGTTTTATGTATAAAAATCACCATAAATTCGCAGTTCCTATGTTTGCATTTGGAAATCCGATAACCTTCTTAGTGTCATTATTGCTTTCCATGTATCGTTTTATCTCTTGTTATTCGTTTTTTTGAAACTGAACAAATCTTTTCAATTATGTTACAGGATGGAAACATGGGTTCTAGAGTTTGTTGA

mRNA sequence

CCAATCATAGAATACGCATACGGCAGCTCTTCTATTCCGCCATCTGTAAGGCCGATCGGCTTTTGTTATATCGGGGTGGAGGATTTTCTCTCTCTTTTCGGAGCAAAAAGTATCAAGCTGAGGTCATTTTGCTTTCAATTTTTCCAATACAACCTCCACATGGAGGTTAAGATTCTTCTACTTCAGCATCTCCCATAAACATGGATTCCCATTTCGAAGGGTCCTTGAATTGAAACAGAAATGCTCTGTAAACTCCCTTCAACGTGTTTAAAATCATCTTCCGCAGGATTGGATCCTTCGATTTCTCACCATGGTTATAATCGCAAGTTTGGGTGTTCCACTAGAAAGAATGTTCCAGAGCTCAAGTATCGGTTTAAGATTGTGGGTCTTTCTAAGGGAGATAAATGGCCTCTCAATGACATTGATGCAAACGCAGTGCAACAAAACTTAAACAAATGGCTGCTGAAGACGCAAAACTTCTTAAATGAAGTTACATCTCCCCGTGGAAAAATCAGTAAGAACAAAGACCATATTCCTACAGGAGCATCCACCGACATAGAGGATCTAGTTATGGCGGAATATACTGTTAATATCAGGACACCAAATGGCCTCCTCTCTTCTACTGCTGTTGTATCCATTGAGCAATTTAGCAGGATGAATGGCTTGACTGGGCAGAAAATGCAGAGGATATTTAAAGCCCTTGTGCCTGAATCTGTTTACAATGATGCTCGCAGTCTGGTAGAGTATTGCTGTTTTAGATTCTTGTCAAGGGACAGCTCAAATCTTCATCCTTCCCTCAGTGAACCCACATTCCAGAGATTGATATTTATAACAATGCTTGCTTGGGAAAATCCATATCACGAGCATACTAATGCTTCAGAGGAAATTGCTTTTCAGAAGATGTTAGTTGGAGAAGAGGCTTTTACACGTATTGCACCAGCTATTTCTGGTGTTGCAGATCGATCCACAGTACATCATCTATTCAAGGCACTTGCGGGTGATGAACAGAGCATCTCTTTCAGTTTGTGGCTCAAATATGTTGATGAACTGCTCAAGGTCCATGAAGGACGAAAATTATATCGAGTTCGAGATAACAGACAGTTCTTTGGTGAGAATATCCTATGTATTGGTTCCAGCAAGAAGAGACCCGTCTTAAAATGGGAGAATAATATTGCATGGCCAGGAAAACTTACTCTTACCGACAAAGCTGTTTATTTCGAGGATGGAAACATGGGTTCTAGAGTTTGTTGA

Coding sequence (CDS)

ATGCTCTGTAAACTCCCTTCAACGTGTTTAAAATCATCTTCCGCAGGATTGGATCCTTCGATTTCTCACCATGGTTATAATCGCAAGTTTGGGTGTTCCACTAGAAAGAATGTTCCAGAGCTCAAGTATCGGTTTAAGATTGTGGGTCTTTCTAAGGGAGATAAATGGCCTCTCAATGACATTGATGCAAACGCAGTGCAACAAAACTTAAACAAATGGCTGCTGAAGACGCAAAACTTCTTAAATGAAGTTACATCTCCCCGTGGAAAAATCAGTAAGAACAAAGACCATATTCCTACAGGAGCATCCACCGACATAGAGGATCTAGTTATGGCGGAATATACTGTTAATATCAGGACACCAAATGGCCTCCTCTCTTCTACTGCTGTTGTATCCATTGAGCAATTTAGCAGGATGAATGGCTTGACTGGGCAGAAAATGCAGAGGATATTTAAAGCCCTTGTGCCTGAATCTGTTTACAATGATGCTCGCAGTCTGGTAGAGTATTGCTGTTTTAGATTCTTGTCAAGGGACAGCTCAAATCTTCATCCTTCCCTCAGTGAACCCACATTCCAGAGATTGATATTTATAACAATGCTTGCTTGGGAAAATCCATATCACGAGCATACTAATGCTTCAGAGGAAATTGCTTTTCAGAAGATGTTAGTTGGAGAAGAGGCTTTTACACGTATTGCACCAGCTATTTCTGGTGTTGCAGATCGATCCACAGTACATCATCTATTCAAGGCACTTGCGGGTGATGAACAGAGCATCTCTTTCAGTTTGTGGCTCAAATATGTTGATGAACTGCTCAAGGTCCATGAAGGACGAAAATTATATCGAGTTCGAGATAACAGACAGTTCTTTGGTGAGAATATCCTATGTATTGGTTCCAGCAAGAAGAGACCCGTCTTAAAATGGGAGAATAATATTGCATGGCCAGGAAAACTTACTCTTACCGACAAAGCTGTTTATTTCGAGGATGGAAACATGGGTTCTAGAGTTTGTTGA

Protein sequence

MLCKLPSTCLKSSSAGLDPSISHHGYNRKFGCSTRKNVPELKYRFKIVGLSKGDKWPLNDIDANAVQQNLNKWLLKTQNFLNEVTSPRGKISKNKDHIPTGASTDIEDLVMAEYTVNIRTPNGLLSSTAVVSIEQFSRMNGLTGQKMQRIFKALVPESVYNDARSLVEYCCFRFLSRDSSNLHPSLSEPTFQRLIFITMLAWENPYHEHTNASEEIAFQKMLVGEEAFTRIAPAISGVADRSTVHHLFKALAGDEQSISFSLWLKYVDELLKVHEGRKLYRVRDNRQFFGENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFEDGNMGSRVC
Homology
BLAST of CmoCh19G003250 vs. ExPASy TrEMBL
Match: A0A6J1HG64 (uncharacterized protein LOC111463987 OS=Cucurbita moschata OX=3662 GN=LOC111463987 PE=4 SV=1)

HSP 1 Score: 671.0 bits (1730), Expect = 2.6e-189
Identity = 327/327 (100.00%), Postives = 327/327 (100.00%), Query Frame = 0

Query: 1   MLCKLPSTCLKSSSAGLDPSISHHGYNRKFGCSTRKNVPELKYRFKIVGLSKGDKWPLND 60
           MLCKLPSTCLKSSSAGLDPSISHHGYNRKFGCSTRKNVPELKYRFKIVGLSKGDKWPLND
Sbjct: 1   MLCKLPSTCLKSSSAGLDPSISHHGYNRKFGCSTRKNVPELKYRFKIVGLSKGDKWPLND 60

Query: 61  IDANAVQQNLNKWLLKTQNFLNEVTSPRGKISKNKDHIPTGASTDIEDLVMAEYTVNIRT 120
           IDANAVQQNLNKWLLKTQNFLNEVTSPRGKISKNKDHIPTGASTDIEDLVMAEYTVNIRT
Sbjct: 61  IDANAVQQNLNKWLLKTQNFLNEVTSPRGKISKNKDHIPTGASTDIEDLVMAEYTVNIRT 120

Query: 121 PNGLLSSTAVVSIEQFSRMNGLTGQKMQRIFKALVPESVYNDARSLVEYCCFRFLSRDSS 180
           PNGLLSSTAVVSIEQFSRMNGLTGQKMQRIFKALVPESVYNDARSLVEYCCFRFLSRDSS
Sbjct: 121 PNGLLSSTAVVSIEQFSRMNGLTGQKMQRIFKALVPESVYNDARSLVEYCCFRFLSRDSS 180

Query: 181 NLHPSLSEPTFQRLIFITMLAWENPYHEHTNASEEIAFQKMLVGEEAFTRIAPAISGVAD 240
           NLHPSLSEPTFQRLIFITMLAWENPYHEHTNASEEIAFQKMLVGEEAFTRIAPAISGVAD
Sbjct: 181 NLHPSLSEPTFQRLIFITMLAWENPYHEHTNASEEIAFQKMLVGEEAFTRIAPAISGVAD 240

Query: 241 RSTVHHLFKALAGDEQSISFSLWLKYVDELLKVHEGRKLYRVRDNRQFFGENILCIGSSK 300
           RSTVHHLFKALAGDEQSISFSLWLKYVDELLKVHEGRKLYRVRDNRQFFGENILCIGSSK
Sbjct: 241 RSTVHHLFKALAGDEQSISFSLWLKYVDELLKVHEGRKLYRVRDNRQFFGENILCIGSSK 300

Query: 301 KRPVLKWENNIAWPGKLTLTDKAVYFE 328
           KRPVLKWENNIAWPGKLTLTDKAVYFE
Sbjct: 301 KRPVLKWENNIAWPGKLTLTDKAVYFE 327

BLAST of CmoCh19G003250 vs. ExPASy TrEMBL
Match: A0A6J1HX34 (uncharacterized protein LOC111467032 OS=Cucurbita maxima OX=3661 GN=LOC111467032 PE=4 SV=1)

HSP 1 Score: 665.6 bits (1716), Expect = 1.1e-187
Identity = 325/327 (99.39%), Postives = 325/327 (99.39%), Query Frame = 0

Query: 1   MLCKLPSTCLKSSSAGLDPSISHHGYNRKFGCSTRKNVPELKYRFKIVGLSKGDKWPLND 60
           MLCKLPSTCLKSSSAGLDPSISHHGYNRKFGCSTRKNVPE KYRFKIVGLSKGDKWPLND
Sbjct: 1   MLCKLPSTCLKSSSAGLDPSISHHGYNRKFGCSTRKNVPEPKYRFKIVGLSKGDKWPLND 60

Query: 61  IDANAVQQNLNKWLLKTQNFLNEVTSPRGKISKNKDHIPTGASTDIEDLVMAEYTVNIRT 120
           IDANAVQQNLNKWLLKTQNFLNEVTSPRGK SKNKDHIPTGASTDIEDLVMAEYTVNIRT
Sbjct: 61  IDANAVQQNLNKWLLKTQNFLNEVTSPRGKNSKNKDHIPTGASTDIEDLVMAEYTVNIRT 120

Query: 121 PNGLLSSTAVVSIEQFSRMNGLTGQKMQRIFKALVPESVYNDARSLVEYCCFRFLSRDSS 180
           PNGLLSSTAVVSIEQFSRMNGLTGQKMQRIFKALVPESVYNDARSLVEYCCFRFLSRDSS
Sbjct: 121 PNGLLSSTAVVSIEQFSRMNGLTGQKMQRIFKALVPESVYNDARSLVEYCCFRFLSRDSS 180

Query: 181 NLHPSLSEPTFQRLIFITMLAWENPYHEHTNASEEIAFQKMLVGEEAFTRIAPAISGVAD 240
           NLHPSLSEPTFQRLIFITMLAWENPYHEHTNASEEIAFQKMLVGEEAFTRIAPAISGVAD
Sbjct: 181 NLHPSLSEPTFQRLIFITMLAWENPYHEHTNASEEIAFQKMLVGEEAFTRIAPAISGVAD 240

Query: 241 RSTVHHLFKALAGDEQSISFSLWLKYVDELLKVHEGRKLYRVRDNRQFFGENILCIGSSK 300
           RSTVHHLFKALAGDEQSISFSLWLKYVDELLKVHEGRKLYRVRDNRQFFGENILCIGSSK
Sbjct: 241 RSTVHHLFKALAGDEQSISFSLWLKYVDELLKVHEGRKLYRVRDNRQFFGENILCIGSSK 300

Query: 301 KRPVLKWENNIAWPGKLTLTDKAVYFE 328
           KRPVLKWENNIAWPGKLTLTDKAVYFE
Sbjct: 301 KRPVLKWENNIAWPGKLTLTDKAVYFE 327

BLAST of CmoCh19G003250 vs. ExPASy TrEMBL
Match: A0A6J1DTB7 (uncharacterized protein LOC111023786 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111023786 PE=4 SV=1)

HSP 1 Score: 580.5 bits (1495), Expect = 4.6e-162
Identity = 292/329 (88.75%), Postives = 304/329 (92.40%), Query Frame = 0

Query: 1   MLCKLPSTCLKSSSAGLDPSISHHGYNRKFGCSTRKNVPELKYRFKIVGLSKGDKWPLND 60
           MLCKLPST LK+S AGL+P ISHHG  RKFGCSTR N+PE K+RFK+VGLS GDKW L D
Sbjct: 1   MLCKLPSTFLKASPAGLEPPISHHGDKRKFGCSTR-NIPEPKFRFKLVGLSMGDKWHLKD 60

Query: 61  IDANAVQQNLNKWLLKTQNFLNEVTSPRGKISKNKDHIPTGA--STDIEDLVMAEYTVNI 120
           IDANAVQQNLNKWLLKTQNFLNEVTSP GK SKNKDHIP GA  S +IE++VMAE+TVNI
Sbjct: 61  IDANAVQQNLNKWLLKTQNFLNEVTSPLGKTSKNKDHIPAGAFDSIEIENIVMAEHTVNI 120

Query: 121 RTPNGLLSSTAVVSIEQFSRMNGLTGQKMQRIFKALVPESVYNDARSLVEYCCFRFLSRD 180
            TPNGLLSSTAVVSIEQFSRMNGLTGQKMQRIFKAL PESVYNDARSLVEYCCFRFLSRD
Sbjct: 121 STPNGLLSSTAVVSIEQFSRMNGLTGQKMQRIFKALAPESVYNDARSLVEYCCFRFLSRD 180

Query: 181 SSNLHPSLSEPTFQRLIFITMLAWENPYHEHTNASEEIAFQKMLVGEEAFTRIAPAISGV 240
           SSN+HPSLSEPTFQRLIFITMLAWENPYHE   ASEEI+FQKMLV EEAFTRIAPAISGV
Sbjct: 181 SSNIHPSLSEPTFQRLIFITMLAWENPYHE--PASEEISFQKMLVREEAFTRIAPAISGV 240

Query: 241 ADRSTVHHLFKALAGDEQSISFSLWLKYVDELLKVHEGRKLYRVRDNRQFFGENILCIGS 300
           ADRSTVH+LFKALAGDEQSIS SLWLKYVDELLKVHEGRKLYRVRDNRQF GENIL IGS
Sbjct: 241 ADRSTVHNLFKALAGDEQSISLSLWLKYVDELLKVHEGRKLYRVRDNRQFSGENILSIGS 300

Query: 301 SKKRPVLKWENNIAWPGKLTLTDKAVYFE 328
           SKKRPVLKWENNIAWPGKLTLTDKAVYFE
Sbjct: 301 SKKRPVLKWENNIAWPGKLTLTDKAVYFE 326

BLAST of CmoCh19G003250 vs. ExPASy TrEMBL
Match: A0A1S3B8Z7 (uncharacterized protein LOC103487477 OS=Cucumis melo OX=3656 GN=LOC103487477 PE=4 SV=1)

HSP 1 Score: 567.8 bits (1462), Expect = 3.1e-158
Identity = 284/329 (86.32%), Postives = 299/329 (90.88%), Query Frame = 0

Query: 1   MLCKLPSTCLKSSSAGLDPSISHHGYNRKFGCSTRKNVPELKYRFKIVGLSKGDKWPLND 60
           ML KLPST LK S+AGLDPSIS       FGC TR NVPE KYRFK+VGLS GDKWPLND
Sbjct: 1   MLFKLPSTYLKPSTAGLDPSISLRADKLIFGCFTR-NVPERKYRFKLVGLSMGDKWPLND 60

Query: 61  IDANAVQQNLNKWLLKTQNFLNEVTSPRGKISKNKDHIPTGA--STDIEDLVMAEYTVNI 120
           IDANAVQQNLNKWLLKTQNFLNEVTSPR K SKNK+HIP GA  +T+ ED+V  E TVNI
Sbjct: 61  IDANAVQQNLNKWLLKTQNFLNEVTSPREKTSKNKNHIPAGAYGTTEKEDIVKVECTVNI 120

Query: 121 RTPNGLLSSTAVVSIEQFSRMNGLTGQKMQRIFKALVPESVYNDARSLVEYCCFRFLSRD 180
           RTPNGLLSS AVVSIEQFSRMNGLTGQKMQRIFKALV ESVYNDARSLVEYCCFRFLSRD
Sbjct: 121 RTPNGLLSSAAVVSIEQFSRMNGLTGQKMQRIFKALVHESVYNDARSLVEYCCFRFLSRD 180

Query: 181 SSNLHPSLSEPTFQRLIFITMLAWENPYHEHTNASEEIAFQKMLVGEEAFTRIAPAISGV 240
           SSN+HPSLSEPTFQRLIFITMLAWENPYH+H + SEEI+FQKMLV EEAFTRIAPAISGV
Sbjct: 181 SSNIHPSLSEPTFQRLIFITMLAWENPYHDHASVSEEISFQKMLVREEAFTRIAPAISGV 240

Query: 241 ADRSTVHHLFKALAGDEQSISFSLWLKYVDELLKVHEGRKLYRVRDNRQFFGENILCIGS 300
           ADRSTVH+LFKALAGD++SIS SLWLKYVDEL++VHEGRKLYRVRDN QFFGENILCIGS
Sbjct: 241 ADRSTVHNLFKALAGDKESISLSLWLKYVDELIRVHEGRKLYRVRDNTQFFGENILCIGS 300

Query: 301 SKKRPVLKWENNIAWPGKLTLTDKAVYFE 328
           SKKRPVLKWENNIAWPGKLTLTDKAVYFE
Sbjct: 301 SKKRPVLKWENNIAWPGKLTLTDKAVYFE 328

BLAST of CmoCh19G003250 vs. ExPASy TrEMBL
Match: A0A251RJ64 (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_1G566800 PE=4 SV=1)

HSP 1 Score: 449.1 bits (1154), Expect = 1.6e-122
Identity = 225/330 (68.18%), Postives = 258/330 (78.18%), Query Frame = 0

Query: 1   MLCKLPSTCLKSSSAGLDPSISHHGYNRKFGCSTRKNVPELKYRFKIVGLSKGDKWPLND 60
           ML K+  T LK+S +      S HG  R+FG   R +    K RFKIVG S GD+W LN+
Sbjct: 1   MLSKISVTHLKASPSASTSGFSWHGNQRRFGYCARNSSQFNKPRFKIVGKSLGDRWKLNE 60

Query: 61  IDANAVQQNLNKWLLKTQNFLNEVTSPRGKISKNKDHIPTGA--STDIEDLVMAEYTVNI 120
           IDANAVQ+ LN WLLKTQNFLNEVTSP  + S+ +  +   A  + D+ED+ MAE T+N 
Sbjct: 61  IDANAVQEKLNSWLLKTQNFLNEVTSPLVRTSQTRKPVTRDAFETQDMEDIFMAEQTINN 120

Query: 121 RTPNGLLSSTAVVSIEQFSRMNGLTGQKMQRIFKALVPESVYNDARSLVEYCCFRFLSRD 180
           RTPNG+LS  A+VSIEQFSRMNGLTGQKMQRIFKALV ES YNDAR+LVEYCCFRFLSRD
Sbjct: 121 RTPNGVLSLAAIVSIEQFSRMNGLTGQKMQRIFKALVSESTYNDARNLVEYCCFRFLSRD 180

Query: 181 SSNLHPSLSEPTFQRLIFITMLAWENPYHEH-TNASEEIAFQKMLVGEEAFTRIAPAISG 240
           +S++HPSL EP FQRLIFITMLAWENPY E   N SE+ +FQ  LV EEAF R+APAISG
Sbjct: 181 NSDIHPSLKEPAFQRLIFITMLAWENPYQEDLANGSEKASFQSKLVREEAFVRVAPAISG 240

Query: 241 VADRSTVHHLFKALAGDEQSISFSLWLKYVDELLKVHEGRKLYRVRDNRQFFGENILCIG 300
           VADRST H+LFKALAGDEQ IS SLWL YVDEL+KVHEGRK Y+ R +     E ILCIG
Sbjct: 241 VADRSTAHNLFKALAGDEQGISLSLWLTYVDELIKVHEGRKSYQTRQSPDLSEERILCIG 300

Query: 301 SSKKRPVLKWENNIAWPGKLTLTDKAVYFE 328
           SS+KRPVLKWENN+AWPGK+TLTDKA+YFE
Sbjct: 301 SSRKRPVLKWENNMAWPGKVTLTDKAIYFE 330

BLAST of CmoCh19G003250 vs. TAIR 10
Match: AT1G71240.2 (Plant protein of unknown function (DUF639) )

HSP 1 Score: 330.9 bits (847), Expect = 1.2e-90
Identity = 178/330 (53.94%), Postives = 220/330 (66.67%), Query Frame = 0

Query: 10  LKSSSAGLDPSISHH--GYNRKFGCSTRKN-VPELKYRFKIVGLSKGDKWPLNDIDANAV 69
           LK SSA  D  I     G      CS  +N     K R +IV      KW LNDID N V
Sbjct: 11  LKLSSATPDFPIGFRCDGVRVHRRCSFSRNCASNRKPRLRIVAQK---KWKLNDIDTNVV 70

Query: 70  QQNLNKWLLKTQNFLNEVTSPRGKISKNKDHIPTGASTDIED---LVMAEYTVNIRTPNG 129
           Q+  ++W+ K+Q  L++VTSP  K S++   I      D ED   L+  E TV   TP G
Sbjct: 71  QERFSQWVSKSQKILSDVTSPLKKKSQSLKKIDLEDQQDFEDLEELLTVEQTVRSDTPKG 130

Query: 130 LLSSTAVVSIEQFSRMNGLTGQKMQRIFKALVPESVYNDARSLVEYCCFRFLSRDSSNLH 189
            LS  A++SIEQFSRMNG+TG+KMQ IF+ +V  ++  DAR LVEYCCFRFLSRDSS  H
Sbjct: 131 FLSFDAIISIEQFSRMNGITGKKMQDIFETIVSPALSTDARYLVEYCCFRFLSRDSSEFH 190

Query: 190 PSLSEPTFQRLIFITMLAWENPYHEHTN----ASEEIAFQKMLVGEEAFTRIAPAISGVA 249
           P L EP FQRLIFITMLAW NPY +  N    AS + +FQ   +GEEAF RIAPAISG+A
Sbjct: 191 PCLKEPAFQRLIFITMLAWANPYCKERNARNDASGKPSFQGRFIGEEAFIRIAPAISGLA 250

Query: 250 DRSTVHHLFKAL--AGDEQSISFSLWLKYVDELLKVHEGRKLYRVRDNRQFFGENILCIG 309
           DR+TVH+LFKAL  A D++ IS  +WL Y+ EL+K+HEGRK ++  D  Q   E +LC+ 
Sbjct: 251 DRATVHNLFKALATATDQKGISLEIWLAYIQELVKIHEGRKSHQTTDFPQLSSERLLCMA 310

Query: 310 SSKKRPVLKWENNIAWPGKLTLTDKAVYFE 328
           +++K PVLKWENN+AWPGKLTLTDKA+YFE
Sbjct: 311 ANRKGPVLKWENNVAWPGKLTLTDKALYFE 337

BLAST of CmoCh19G003250 vs. TAIR 10
Match: AT1G71240.1 (Plant protein of unknown function (DUF639) )

HSP 1 Score: 326.2 bits (835), Expect = 3.0e-89
Identity = 178/331 (53.78%), Postives = 220/331 (66.47%), Query Frame = 0

Query: 10  LKSSSAGLDPSISHH--GYNRKFGCSTRKN-VPELKYRFKIVGLSKGDKWPLNDIDANAV 69
           LK SSA  D  I     G      CS  +N     K R +IV      KW LNDID N V
Sbjct: 11  LKLSSATPDFPIGFRCDGVRVHRRCSFSRNCASNRKPRLRIVAQK---KWKLNDIDTNVV 70

Query: 70  QQNLNKWLLKTQNFLNEVTSPRGKISKNKDHIPTGASTDIED---LVMAEYTVNIRTPNG 129
           Q+  ++W+ K+Q  L++VTSP  K S++   I      D ED   L+  E TV   TP G
Sbjct: 71  QERFSQWVSKSQKILSDVTSPLKKKSQSLKKIDLEDQQDFEDLEELLTVEQTVRSDTPKG 130

Query: 130 LLSSTAVVSIEQF-SRMNGLTGQKMQRIFKALVPESVYNDARSLVEYCCFRFLSRDSSNL 189
            LS  A++SIEQF SRMNG+TG+KMQ IF+ +V  ++  DAR LVEYCCFRFLSRDSS  
Sbjct: 131 FLSFDAIISIEQFSSRMNGITGKKMQDIFETIVSPALSTDARYLVEYCCFRFLSRDSSEF 190

Query: 190 HPSLSEPTFQRLIFITMLAWENPYHEHTN----ASEEIAFQKMLVGEEAFTRIAPAISGV 249
           HP L EP FQRLIFITMLAW NPY +  N    AS + +FQ   +GEEAF RIAPAISG+
Sbjct: 191 HPCLKEPAFQRLIFITMLAWANPYCKERNARNDASGKPSFQGRFIGEEAFIRIAPAISGL 250

Query: 250 ADRSTVHHLFKAL--AGDEQSISFSLWLKYVDELLKVHEGRKLYRVRDNRQFFGENILCI 309
           ADR+TVH+LFKAL  A D++ IS  +WL Y+ EL+K+HEGRK ++  D  Q   E +LC+
Sbjct: 251 ADRATVHNLFKALATATDQKGISLEIWLAYIQELVKIHEGRKSHQTTDFPQLSSERLLCM 310

Query: 310 GSSKKRPVLKWENNIAWPGKLTLTDKAVYFE 328
            +++K PVLKWENN+AWPGKLTLTDKA+YFE
Sbjct: 311 AANRKGPVLKWENNVAWPGKLTLTDKALYFE 338

BLAST of CmoCh19G003250 vs. TAIR 10
Match: AT1G48840.1 (Plant protein of unknown function (DUF639) )

HSP 1 Score: 96.7 bits (239), Expect = 3.9e-20
Identity = 73/214 (34.11%), Postives = 113/214 (52.80%), Query Frame = 0

Query: 125 LSSTAVVSIEQFSRMNGLTGQKMQRIFKALVPESVYNDA---RSLVEYCCFRFLSRDSSN 184
           LS  A V I + S++ G+   ++Q  FK    ESV   +   R+ +EYCCFR L+  S  
Sbjct: 49  LSPVANVVIRRCSKILGVAVSELQDSFKQEASESVKQPSMFPRNFLEYCCFRALAL-SVG 108

Query: 185 LHPSLSEPTFQRLIFITMLAWENPYHEHTNASEEI--AFQKMLVGEEAFTRIAPAISGVA 244
           +   LS+ +F+RL F  M+AWE P    + AS+ +    +   VG EAF+RIAPA+  +A
Sbjct: 109 VTGHLSDKSFRRLTFDMMVAWEVP----SAASQTLLSVDEDPTVGLEAFSRIAPAVPIIA 168

Query: 245 DRSTVHHLFKALAGDEQSI--SFSLWLKY---VDELLKVHEGRKLYRVRDNRQFFGENIL 304
           D     +LF  L     S+   F ++ KY   ++  +K  + +    +    +  GE IL
Sbjct: 169 DVIICENLFGILTSVSNSVRLQFYVYDKYLYGLERAIKKMKSQSESSLLSGVRSKGEKIL 228

Query: 305 CI-GSSKKRPVLKWENNIAWPGKLTLTDKAVYFE 328
            + G+   +PVL+      WPG+L LTD ++YFE
Sbjct: 229 ELDGTVTTQPVLEHIGISTWPGRLILTDHSLYFE 257

BLAST of CmoCh19G003250 vs. TAIR 10
Match: AT5G23390.1 (Plant protein of unknown function (DUF639) )

HSP 1 Score: 96.7 bits (239), Expect = 3.9e-20
Identity = 74/246 (30.08%), Postives = 115/246 (46.75%), Query Frame = 0

Query: 125 LSSTAVVSIEQFSRMNGLTGQKMQRIFKALVPESV---YNDARSLVEYCCFRFLSRDSSN 184
           LS  A   + + S++  +  + +Q  F   +PESV      AR+ +E+C F+ L +    
Sbjct: 57  LSLLANSVVSRCSKILNIQTEDLQHHFDVELPESVKQLLTYARNFLEFCSFQALHQVMKK 116

Query: 185 LHPSLSEPTFQRLIFITMLAWENP----------------------------YHEHTNAS 244
               LS+  F++L+F  MLAWE P                            Y   TN +
Sbjct: 117 -PDYLSDQEFRQLMFDMMLAWETPSVTSEQENKDAASPSKQDSEDEDGWSLFYSSPTNMA 176

Query: 245 EEIAFQKMLVGEEAFTRIAPAISGVADRSTVHHLFKALAGDE----QSISFSLWLKYVDE 304
            ++  +K  VG+EAF RIAP    +AD  TVH+LF AL          I +  +L+ +D+
Sbjct: 177 MQVD-EKKSVGQEAFARIAPVCPAIADAITVHNLFDALTSSSGHRLHYIVYDKYLRTLDK 236

Query: 305 LLKVHEGRKLYRVRDNRQFFGENILCI-GSSKKRPVLKWENNIAWPGKLTLTDKAVYFED 335
           + K  +        + +   GE +L + G++   PVLK     AWPGKLTLT+ A+YF+ 
Sbjct: 237 IFKAAKSTLGPSAANLQLAKGEIVLDMDGANPVLPVLKHVGISAWPGKLTLTNCALYFDS 296

BLAST of CmoCh19G003250 vs. TAIR 10
Match: AT3G18350.1 (Plant protein of unknown function (DUF639) )

HSP 1 Score: 90.1 bits (222), Expect = 3.6e-18
Identity = 73/217 (33.64%), Postives = 110/217 (50.69%), Query Frame = 0

Query: 125 LSSTAVVSIEQFSRMNGLTGQKMQRIFKALVPESVYNDA---RSLVEYCCFRFLSRDSSN 184
           LS  A V + + S++ G++  +++  FK    ES+   +   R+ +EYCCFR LS  S  
Sbjct: 49  LSPIANVVVRRCSKILGVSANELRDSFKQEAFESLKQPSLFPRNFLEYCCFRALSL-SVG 108

Query: 185 LHPSLSEPTFQRLIFITMLAWENPYHEHTNASEEI--AFQKMLVGEEAFTRIAPAISGVA 244
           +   L++  F+RL F  M+ WE P      AS+ +    +   V  EAF+RIAPA+  +A
Sbjct: 109 VTGHLADKKFRRLTFDMMVVWEVP----AVASQALLSVEEDATVSLEAFSRIAPAVPIIA 168

Query: 245 DRSTVHHLFKALAGDEQS-ISFSLWLKYVDELLKV-------HEGRKLYRVRDNRQFFGE 304
           D     +LF+ L       + FS++ KY+  L +         E   L  VR  R    E
Sbjct: 169 DVIICDNLFQMLTSSTGGRLQFSVYDKYLHGLERAIKKMRTQSESSLLSGVRSKR----E 228

Query: 305 NILCI-GSSKKRPVLKWENNIAWPGKLTLTDKAVYFE 328
            IL I G+   +PVL+      WPG+L LTD ++YFE
Sbjct: 229 KILEIDGTVTTQPVLEHVGISTWPGRLILTDHSLYFE 256

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1HG642.6e-189100.00uncharacterized protein LOC111463987 OS=Cucurbita moschata OX=3662 GN=LOC1114639... [more]
A0A6J1HX341.1e-18799.39uncharacterized protein LOC111467032 OS=Cucurbita maxima OX=3661 GN=LOC111467032... [more]
A0A6J1DTB74.6e-16288.75uncharacterized protein LOC111023786 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A1S3B8Z73.1e-15886.32uncharacterized protein LOC103487477 OS=Cucumis melo OX=3656 GN=LOC103487477 PE=... [more]
A0A251RJ641.6e-12268.18Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_1G566800 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G71240.21.2e-9053.94Plant protein of unknown function (DUF639) [more]
AT1G71240.13.0e-8953.78Plant protein of unknown function (DUF639) [more]
AT1G48840.13.9e-2034.11Plant protein of unknown function (DUF639) [more]
AT5G23390.13.9e-2030.08Plant protein of unknown function (DUF639) [more]
AT3G18350.13.6e-1833.64Plant protein of unknown function (DUF639) [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR31860HEAT-INDUCIBLE TRANSCRIPTION REPRESSOR (DUF639)-RELATEDcoord: 33..329
NoneNo IPR availablePANTHERPTHR31860:SF3PROTEIN, PUTATIVE (DUF639)-RELATEDcoord: 33..329

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh19G003250.1CmoCh19G003250.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane