Chy4G074980 (gene) Cucumber (hystrix) v1

Overview
NameChy4G074980
Typegene
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionSerine/arginine repetitive matrix protein 2, putative isoform 2
LocationchrH04: 9973912 .. 9995558 (+)
RNA-Seq ExpressionChy4G074980
SyntenyChy4G074980
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATATATATGACAATATCCTTCCGTACCACAACCGTTCCTTTTAGGATAAAATAAAATTAGATTATAGCAATTCCCATTTCCAATCCTCAACCACAGTTCTTCTGGGCTCTCTATCAGCTTCTCCATTTTTCCATTTATCTCCAAATTCCTTATGTCTCTCTCGCCTCTATCCAAACCCAAATCCCTAACCCTAACCCTAAATCTCCTCCCTCCTCACCTCTTCCGCCGTCCTTTCTCCTCCCCTTCCGATCACGATCTCACCGACCTCCCTGAATCTCCTTCTTCTTCTGATCCCCTTCTTCGCAACCTTGAGGACGCTATTCAACGCATCCTCGTTCACCGATCTGCCCCGATTGGCTTTCCTTCGTTCTCGGTGCTTCCTATCGGGTTCCTCTGCCTTCTAATTCCCATTTACCTCTCATTGCCAACGTTCTTCGTAACTTAGCTAACCCTCTCTCTCCTGAATAATCCTTGTCTACCACTACCATCTGTGGCTGGCTCTCTTCTCATTACTTCATTCAAGGTATTCCTCTTCTTTCGTATTTCGTTTTCATTTTATCTCTACTCTCTTTCTCTCTTTAAATTATTTATTTAGGAAATAAAATACCCTTTCATCGTAGGTACGCATCTTCCTTCTCTTGATCCGGTGGTTGACACCACGTTGACTGAATGTGATGCTTCCCTTGACCACGAGGAAGGATGAGAATCCGATTTGGGTGTGTTCCTATTAGTTCTCATACTAAACTAGTTTCTGCCGCTAAAGTGAAGTCTGAATTATATTTTCTTTTGCCTTGCAATTTATTAAATTTGTATATTGAGAGACTTTTAATAAAAAAAATTAACTTGATAGCTCCTTTATTGTGGTTAATTTCTACAAATTAACACTCTTCAGATTGTCAAATTCTCTACTTTCCTCTACTCTCACTTCATATTGTAACGAGAAGCACACTTCGGTTTGTAGCTTGCTTGGTGTTGTTTGAAATCTTGTATAATTTTTGGAAGATCATTCCTTTTTAGCTCTGTTAGGTTCATCTACTCGCCTCGAGGCTGTTACCTTTCTCTTTTGATATTAACCTTGTTTCTTAAAAACAAGACTGTAGATTTTGGGCGTCTATTGTAAAGAGCGTTCTTTATGTTGGAAGAATTGAGGGTGGGGGATTAGAAATGCCATGAAAACCCGTAACTGGTAATGTTCTCGAGTGGGCGAGTTAGGGATATTTGCATCCTTGAACTGGTGCAAAGTGTCATGATTATGAAAGTAAATTGCCATTGAATCTGTTGGGCATCCCATGAGTTTGATTAGTAGTGAGTGTGGATGGATTTTAATCTTTCGAGTTATTATTTGTTGATCATTTGAGAACTAAATAACATTTTCTTGTAAAAAATACAAGATCCTAGAGATAATGAAAGTAATACAAATGATGGCATTTTCTTCTACTACTCTTTGCAGCTCGCCTTGCCTTGTAAACCTCATGTATAGAAGAAGACAACTGAATAACAAGCTAAGCAGAATGGCCGAGTAAGGGGTAGAAGCTTGTACAGATGTGAACTGGTCGGTGAGTGCTATTTATAAAATGGCCCTTCGTCGATCATTTAGAAGCTTCTGCTCTTAAACGTTGTTAAATGTTTTTCCTTTTTAGTAACTTATTTCTTTCAACTTTTTGCATATGTATTTTGTTCTCTTGTTCTATTTTAAATACTTCTTTCTTTCAACTTTAAGCATTCTGAAACATGATTCTCCTCCCTCCGACCCCATGCCTCATAGGAGGGCACTATTCTCGTTTGAGAAGCTGTTGACTTGTGAATAGGACATTAAACCACATATGTTGCAGTTCAAGCAAGTTAGCCTATTTAGATCCAAATATAATAATTTGTCTCGCCTAGTTCGTTTTGGATGGCTACATGATATCTTCATCTGTATTGCAAATTAAAGTTCAATCTGTGATCTAAACCGTCTCTTGATTATTCTTTTCAAAAGCAAGTATCGAGGTGATAGCTTAGAATGGAAGAAAAGTTTGAAGATATACATTTTATAATTTGGAAGAGATGCAATTATTTGTATGTGAATGAAGCGAAGGTAAAGTCTTGTAAAGACTATTAGGGAGAATCTTGTGAGTATAGACGAGGAAGGTGCAACTGGGGAACTTTTCTTATTTTGACATTTGGTAATGTGTCTTGGAGATTCTTCTAGGGGTACGTTGTCCTATAGCTTAGATGTTGCATTATTTATGCTTCCCGAAGGTAGAGACAGATTGTCAGGAGATGTTATGAAAAAGATGGAGTGACTAAATGACCGTAAAATGGCTGAATTCTAATTCTAAACACAATTATATAGGATTATTTTGGTAAGAGGTTTTTTAAAAATAGAACAAAGCTTGAAAATATTTACATTGTATAGAATAATTTTAAAAAAAGAAAAAAACTCACGAGTCCACAAATAAGTCATACACTTACCAAATCTAAACAATCTTGTAACAATGACTACTTAAATCTAAACAATCTTGTACCTTTGTATTCAACCTATATGTAATCTATACAATCTTGTATCCTTTTACCTACGATTTTGTATCAAATCAAAATGATCTATTAGTGATAGTGGTTAGTGTTCTATCTGGGATAGACAACTAATCATTTAGATTCTGCACCAACATCATTTAAATATTGTTCTAAGAAGAAAAAAAAGAAGATGAGAAATGAAGAAGTAAGAAATTTGGAAGATGAAATGAAGAAATTGATAAATATTTACATAAATAACCATAATATTGTAAGATGAAGAAGTACTAATATGAATAAATTAATAATACGAAGAAGAGAATAATAAATCGCAAAAAAGAAAGAAGAAAAAGAAGTGAAAAATCGTCAGCATTATTCAAGAACAAATATGAAATTTATGAAAAAATCATGGACTTTTTATTTTGTTACACGAATCGAAAATATTTTAATGTTTTGTTATATTTATGAAAATTAACCTTTTGGTAATTCGTTTGTTTTTAAAAACTTAATATTTAAATAATAAGCATATTTGGTGACTAGTCTACTTTTTTTTATTTTTTAATCAGAAACTTGCTTTTAAAATTATAAAGGAAAATTGAAAATGAAAAAAAAATGGAGCTTTAAAAATGTGTATTTTGTTTTTAAAAATATAAAACAATATTGATTTTCGTTTTTATTGAACAAAAGATTAAAAAACAAGAAAAAACAAAAACAAAAAATTTGAAAGAGTTATAAGATGGAGTCTACTGTAGGCATCAACCATGAGGTGGGAAGTGTCATGGTATTCCTCTTTATTATATTATTTATTATTTGTTAATAGTATTTATTATTTGTCATTCATTTTTTTTTTCGTATTCTTCCTCTATTGTTATCATCGTGTTTACTTTCCCAATCAAAATAAAATCAGCTTGTATTTTAAACACAAATTATAGTAACTATTATAATTCATTTATTATAATAATCAATTTATTATCTCAAACCTAAAAGAATAAGGATGTATTTAGATTATCTTTGAAAGTATTTAAATTTGAAAAAAAAAATATTTCAAAAAAATTTGAGATGTTTGGTAACTTTTCAAAATGTATTTTTAAATAATCAGATTGAAAAATAATCCAATTTAAAGAAAACACTCATCACCTTGAGAAAATAGGTTTACAACCTCATGCAGATGATTTGAATATATTCTTAAAGATGATATGTAGAGAACTTGGACATTTTAGTTGTTTTCATTAATTGTTTATTTTGCTATTAATGACAATGAGAGTGTTGTTTATTATATTTCGTTTGTTTTTTGTTTTAGTTTAATTTTTATTCTGTCAAGTTTATATTTTGTGTGAATTTATTGTATTTGAATTTATTATTTAATAAAATTTGTTTTTTTTTCTCTCTCTCTCTTTCTCCCTTACTCTTTAGTCTTCAAAGTTTTCAAACTCATCAAAGGAAAACATAGCTTGCCCTACTCCAATTCGCCATCCTTACAAATATGAGGCATAAAAGTCGAATCATGGAAGAACATGATAAAAACATGGATAAAATGAGACAAGATATTAATAATTTTGGGAAACAAGTTTAAAAAATACTAGAATTGCTTTTAGCAGGAAAAGGAAAAGTCGTCGTAGAAACAGCACAATCACGCAATCCAGTTCAAGACACCAATGATCCCATTTATCCTCCAGGATTTACGCCATGCCACATGAATGTTCCATAGTCTCAAACCACTCAATACTATGTTGCTATGAATCCGTTTTTTGTTGTTCCACCTCTTGTCCCAGATATAGAACAATTGGAAGCTCAGGCTAAAATTCAAGACATGGGACAGAATGAGAACACTCCGGGTAAGCAAAAATTAGATGTTTTGGAAGAAATGTTGCGAGCAATTGAAGGGACTGACATTTATGAAAATATAGATGCAACACAATTGTGTTTAGTGCCACACTTGATCATTCCAACAAAGTTTAAAGTTCCCGAATTTGACAAATATGATAGATTATTGTGTCCAATGAGTCATCTTATAATGTATTGTAGAAAGATGGCAGCGCACATTGGTAATGATAAATTGTTGATCCACTGCTTTCAGGATAGTTTGACTGGTCCAGCTACTCGATGGTATATTTAATTAGATAATGTACACATTCATGTTTGAAAGGATTTAGTTGATGCATTTCTAAAGCAATACAAACATAATATTGATATGGCTCCAGATCGTTTAGACCTACAGTGAATAGAGAAGAAGAGTTCAAAAAGCTTTAAAGAATATGCTCAACGATGGAGAGATATGGCCGCAGAGGTTTAACCACTGTTAATAGACAAAGAAATGGCATATATGTTTATGAATACTTTGCAGGCTCTGTTCTATGATCGAATGATTCGTAATGTTATAACCAATTTTTCTGACATTATTGTTATTGGTGAAAGAATTGAATATGGGAAAGCACGGGAGGTTAGAAGAGGCTTCGATTGAGTATGGGGGATTAAAGAAAGAAACAACATCTAAGAAGAAATAAGGTGAGGTTCATGCAATTGGTTTTCCTAATTCAGGGAACCACAAATCGATTTTGGGCCAAAGAAAACAACTAAAATTTTTCTTCATATATAAGCAATATTTCTCATATCCCTTGTAACAACTATGTACCAGCTCACTCTTTCTCTGGAGCCCCAAAACCTGTTAACTCAAACTCTTCTCGACCATTTGTACAAGGTTAGGGTAGCAAGACCAACTCAGATACATGGCGATTTGATTCAATTCTCATACAGAGCTTTTACCCCAGTTAATTCAAAATCGACATTAGCTCTTATCCCAATGAATCCTATATAACCTCCTTATCCAAAATGGTATGATTCAAATGTTCGATGTGATTATCATGTTGAAGGAGCGAGACATTCAACTGAAAATTGTTTAGCTTTGAAAAGGAAGGTGCAATCTTTAATTAACTCTGGATGGTTAAGCTTCAAAAAGCTGGTGAGAAGCCGAATGTCAACAACAATCCACTTCCTAATCATGAAAATTCAAAAGTGAACGTTAGAGATTGCCTTATTGAAAAGTATAAAAATGAAGTTCATGAGATAGTGATGCCAATGGAAAAACTTTTTGAAGCAGTAGTCTGGAATATTTAGACCCCAACATAAGATACGAAGGGTACGATGAAAGCAGACATTGTATATTCCATCAAGGAGTTGCAGTTCACGTTATCCAACAGTGTTGTAAATTTAAATCTAAAGTACAACAACTTATAGATTTAAAGATACTCACGATATATAGAGGACAAGAAAAAGATGAGATGAAAGACAATAAAATATGTGCATTAACGAATGAAGTTGCAAAAAAGGAAAATTCCTTTTTATCAAGGCCTTTGACAGCCTTTTATCAAGAAAGTAGTAATGAGTCAACTTCCTGCAATCCTAAACAACTCACTATCCAAGTACCGAGTCCTTTCAAATTTAAGGATATGAAAGCAGTGCCCTAACGCTAGTCATTATAGGTCCTTCAATTGATAATATTACAGGAATCAGTGGGATAACTCGAAGTGGAAAATGTTACAAACCAGATAATTTAACACCTCCTTTAGGTAGTTTAATACTGGGGTAAGGGAGGAAAAATGAGAAAAGAAATGTGAATGAACATGACAAAGAGCAGGATGTAGAGATGTCGATCATAGCAAAAGATGTAGAATACAAGAAGCCTGTTACAGATGAGGAAGCAAACGAACTCTTAAAAATAGTAAAACAAAGCGAGTATAAGAGTTCGTTTGAATTTTTTTAAGTAAAACAAAGCAAACGAACTCTTAAAAATATTAAAACAAATATGGTTTGGGCTATATGTCATACGTATATGATAAGATTAGGCTTCAAGAAGAAAATAAGAAAAAACGTTTGGCAAAGCTAGAGATGAGAGAGTTTGATCCAAGTCTAAAATTTATACCAGCATTAAATGATACATTCAAGAGTGCTGGTATAAGTTACTCATCACATGACTCTGATTCAAATGATTGTTTGTTAACGAAGATGGAGAGTTTATCAATTTCAGTTGTGGCACAATAAGCATCATTTGAAGGCAAAACAGTTTATGCATGTCCACCTAATTTTGAGCTTAACAATTGGGATGTTGTAGATCTACCCAAATGTCACAATATTTTGTTTGCTATACAATATGAATGTACTTTTTTTTATTATTTAATGAAATCTTCACATTTTCGTTTCATTCTTTCTAATTTCTTTTTCTCTCTGCTTTCTTTCTTGCGTTTTGCAAATTAAATACAAAAGTTCAAACCTTTTCATAGACAACATTCAAAAAAATGAAGATGTTAGTAATTTCAGTTCCACCCTTGATACCTTAATTTACACTATGGAATCTGACAAAGAAAGTGACAATGAAAATGTCAGAGGAATATCCTCATAACTGTTAAGAATGGTAGAAGTGAAGGGATAAAAGCCATATGGCAGCAGAAGAATTTAAAGAATCATTACTCAAATTAATTTGAGAATATTAAAATATCGGTACATTGGCAAAATTTTAGAATCAAAATTCTAAATAAATTCTAAGCATGCTTTTAAAGATAAGATACGAGAACTTACTTTCGATGATGTCTCTGAATCTACAAATCACCGAATTGGGGACACCACCCGTCGGAAACCTTCCTATCCTCTGGGATGAGATTGGTTTGTGGGAACCAATTGGGATGGGAAAAGATTTAGAGATATTTGAGAGAAACTCTTTTGCAGAGAGAAAAAACCTTAGCTTTTTTTTTCTTAATCAACTTGGCTTGAAGAAAACTGTTTTTTTCTAATATTTATAACTCTCTTTCCTTCTTGGAACAGGAAAAGGAGTATGAGAGAATTACGAGATTCTATTATCAAATCTACTAATTAATTTACAATTAATCAATTAGTTAATAATTAATTAAATCATATTTAATTAATATTCCATTCAAATCGTATTTGAATAAATATCTCTCCAATATCTTATAGGTTTATATAATTAATTAAATCATATTTAATTAATATTTCATTCAAATCTTATTTGAATGAATATTATTTTATACATTTAATTAAATCAAATTTAATTAAATAAAACTAAATTATTAATAATTCTCCAATTAATTAATAGCTAAGTTAAATATCTTATATTTAACTTAAATTAAATTTGAATCATATTCAAATATAAGCTTCTATCATAATTAATAATTTTAATATGAATCCAATTCACATTAAATTAATATTTGAACTCATTCAAATATTTTATCTCTCATAAATTAATTTCTAATCATATCCAAAATTAAATTTATATAATAAAATTTCACAAACTTTATATTATAATGTATCACTATACATTAAACTAATTCCCAATGTAAATTTGAACATTTCAAATTACAACCAATATAAATAAATCTCATTGCCCTTTACGAGCTAGGAAGGGGACCTAATGGACCTACAGATCAGACGCTACACCGATATGAGATTAATTGGCTAAACTCATTAACCACATTAATCAATATTCGTTAACTGTGTGTACACTCCACTAAAGACTCACAGCTGAACTCTTCTCACTGTAGATATATTTCTGTGTCCACGGATATAGACCAATAACAGTAAGTTAGTCATTCACGAGTGTTCGTAACACCAGCTGGGTCAATTTACCGTTTTACCCCTGTGTTACCTCTAATCCTTAAATACCAGTGCTTCTCCAATGAACAATCTGTTTTTGGTCCTACCAATAAACAGAAACCCCTCTCGTGCCATAGAGAGGGTAGGGCCCTTTGTTCAAGACCTGGAGACACCATTTAAGGGAACACTCATCAACTTACCCTAAAGTAGGGAAGGAGTGAATTCCATCTTGTGTAATTATATTTCCAGCTCCCCACTCGATCTTGTCCCCAAAACGATAAGCTTATTGAGTCAGCGATCTGGCCACTCTCACCCGTACAAATCAAAGGAAAATCCCTCGTGAACAAGAGTTCATAACACACTCAGGATTAAGACTAAGTTACCTAGGTCATCCTAGTGAAATAGAAACCTAACTAGTTAACGGAGTTACATCTAGTGGTTACTATTTCGCGGTCCGGTCTTATGCAAACTCATTGCATAGGACACCCTCACTCGCATGTCAACTACACGAACGCGTTGGATCATTGCGTTTGTATCAAATACAAAGTGAGCCGTATCAATAGTGTTACCAAGATAAGGTACCCAGCCTTATCCCTATATTATAGACCCTTTAAGCTGATCTTGAACATTGATCCATGTATGTCTCTACATACTGTTCAAGACTCATTAAACAGCTTAGGATGTTAGTTTATTGGATTTGGGTTATTAAGACAAAACTAATAATTTATCAATAACAATTATTACAATAATAACACTTTATTAATAACGGTCAATGGATTATATTTACAATCTATGAGTTTTAGGACATAAATCCCAACAAGAAGAAGCAAGAGCTAGTTGAAGTAATCAATTTGGGTTCTCAAGAGGAATTAAAAGAGGTAAAAATTGGCACATTAATGACCAGTGAAACTCGAAAAAAAAATGATCAATTTGTTATGTGAATACTCAGATATTTTTGTGTTGAGTTATCAGATTATGTCGGGATTAAATACAAATATTGTAGTACATCGTGTTCCTTTGAAGCCAGAATGCAACCCATGAAGACAAAAGCTACGTAAAATGAAGCCTGATGTATTTGATTAAAATAAAAGAGGAAGTACTGAAACAAATTGAAGCAGGGTTCCTCATAGTCTCCAAATATCCTGAATCGGTAGCAAACATTGTTCCTAAACCCAAAAAAGGTGGAAAAGTGAGAATGTGTGTAGATTATAGAGATTTAAATCGAGCCAGCCCAAAAAACAACTTCCCTCTACCTCATATCGACATGTTGGTAGATAACATTGCAGGATACTCAACTTTCTCCTTCATGGATGGTTTTTCATAATATAATCAAATCAAAATGGCCGAAGAAGATAGAGAAAAAATGACATTCATTATTGTTTGGGGAACATTCTGCTACAAAGTAATGTTTTTTGGTTTGAAAAATGTTGGGGCAACATATCAGTGAGCAATGGTTACACTTTTTCATAATATGATGCATAAGGAAATAGATGTGTATGTTGATTATATGATTGCAAAATCCAAGACAAATGAAGATCATACAACCATTTCCAAAAATTATTTGATCGGTTGAGAAAGTATCAATTGAAATTAAATCCATCCAAATGTACATTTGGGGCAACCTCGAAAAAGTTGTTGGGATTCATTGTTAGGGAAGAAGGGATCAAAGTAGACCCAAATAAAGTTAAAGCTATCATGAAAATGTCATCCCTTGAACAGAAAAAGAAGTTTGAGGTTTTCTTGGAAGATTGAACTACATATCCAGATTTATCTCGCATTTGACACCAACATGCGAACCAATCTTCAAGCTTTTACGTAAAAGTGATCGTTAATCCTATACTTGACAGTATTAGAAGGTTCTATGAACGGTGTTCTAGGACAACATGACTTATCAGGGAAGCAAGAACATGCTATTTATTATTTGAGCAAAAATTTCACTGATTATGAGTCTAAATACTTAATGTTAGAGCGAACTTGTTGTGCTTTGGTAGGGACTGCTCATCGTTTGAGGCAATATATGTTATATCATACGACATGGCTTATTTAAAAAATGGATCCAATAAAATACACTTTTTGGAAGCCATCATTATCTGGAGGATTGCAAAATGCCAAGTTTTGTTGTCAGAGTATGATATTGTTTATGTTACTAAAAAAACAATAAAGGAAAGCGCATTTGCTGATCATTAGTTGCCCAACCAGTAGCAGATTACAAGCCTATAAGGGTTGATTTCCTGGATAAAACACATTTCTAGTTGAAAAGGATGCTACAAATCATGAGACATGGATTATGCTTTTCGATGGTGCCTCAAATGAGTTGGGACATGGGATTGGAGTTATACTAATTTTCCCAAAAGGAAAGGCAGTTCCCTTAACTGCCAAGTTATGTTTTGAATGCACTCACAATATCGTTGAATATGAAGCATGTATTATGAGACTTCAAGCAGCATGCGACATGAGTATTAAAAAATTGAAAGTTTTGGGTGGCTCAATGCTAGTAATACATCAAGTCAAGGGAGAATGGGAAACAAGAGATACCAAATTGGTTCCTTAAAGCCAATAAGTTGCAAAATTATCTCAAAATTTTTTAGAAAATTTCATTTGACCCTGTTCATAGGAAAGACAATCGAATGGCAGATGCATTAGCCACCCTGCAATGATGTTTGATTTTAACCTTGAGTGTGAATTTTATCCAATCCAAATTATAAAGCGAGATCCGCCGACATATTGCATGAAGAAAATGATAATAAGCCATGGTATTTTGACATCAAGCAGTACATATAAAGTGTAGAGAATATCCCTATGGAGCTTTAGAGAATGGCAAGACAATAAGAAGGTTGGCCATGAACTTTTTCTTAAGTGGTGAAGTTCTTTTTAAAAGAAACCATGATATGGTGCTCCTTCGATGTGTTGATGTGGAAGAAGCAAAACAAATTATGACAGAAATTCATGAAGGAATGTGTGGAACACATGCTAATAGACATATGATGGCTAGACAAATATTTAGATATGGTTATTATTAGACAACGATGGAATCATACTGCATAAAATATGCAAGGGAATGTAAAAAGTGTCAAGTTTATATGGATAAGATCCACGCAGCAGCATCTCCATTGCATGTCTTGGCAGCACCTTGGTTGTTTTCTTTGTGGAGAATGAATGTTATTGGACCCATTGATCCTAAAGCTTCAAATGGTTATCGTTTTATTCTTGTGGCCATTGATTACTTCACTAAATGGATAGAGGCAGCATCTTACTACAACGTTACAAGAGGAGTGGTGCTCAAGTTTATCAAGAAAGAGTTGGTCTGTCGTTATGGTCTCCCAAAGGGTATTATTACATATAATGCCAAGAACCTTAATAACAAAATGATGGACGAAATTTGTGAGCTATTCAAGATCAATCACAGAAATTTGACTCCATATCGCCCCAAGATGAATGGGGCAATTGAAGCAACCAACAAAAATATTAAAAGAATCATTGAGAAGATGAAAACAACGTATAGAGATTTACATGAAATTTTACCGTTTGCACTCTTCAACAGGGGCAACACCATTTTCTTTGGTTTATGGTATGGAAGTTGTCCTACCTTTAGAAGTTGAGATACCTTCGTTGAGAGTTCTCATGGAAGCTAATGGATACGAGGTCGTTATGAGTAGTTGAATTTCGTTGAAGAAAAACGATTGGCAACATTAAGTCATGGACAACTTTACCAAAGAAGACTAATGTGAGCATACAATAAGAAAATGCACCCTCGAAGTTTTTGAAAAGGAGATTTTGTGTTAAAAATGATACTTCCATTTCAAAAGGATCATCGAGGAAAATGGACTCCTAATTATGAAGGTCCGTTCGGAGTGAAAAAGGCTTTTCAAGAGGAGTTTTGCTTTTAACCAATATGGATGGCGTTGAGTTAAAGAATCCTATGAATTCAGATTATGTTCGAAGATATTATGCGTGAGGCCTTTTCTTAGAAAGTTTCGTAATGTTGAAAGAAGTCAGGGGTATTTCTAAGAAACTTACATTTAGTTCTTAGAATGTTATTTCTATATGTGTAAACTTCCCCTTACATCCTCTCTTTGTACTCCTATTCCAAGAATCGTTGTTATCTTTTCATTTCTTATTATATTTATCAAACATTTATTTATTTGTCAATCCTTTCTTTCATCCTTTTTAAATGTCATCAAATTATAAGTAAACATCCATTATCCATATGTCTATTATTCTATCTATCGTTAATTAACAAAAATTCAGTACAATTTATTGTATTTGCAAACCAAGTAAAAATTATGATTTTCTCTTGTTAAGACTTAAGGAAATATAGGTGTTTACAAGTCCAGAAAAATTTCTCTATGATTGTGAGTTACTGCAATAGGAAGTCTACATCCTAAAGGATTCATGTTTTGTGAAGTTCAAGACTAAAAAAACATTAGGTCTTTACAACATAGTGCAGTCTTTCACAGTTTGGATTCGTTTGTTTGATATGAAGAGTGGCGAGTTTGACAGCTTAAAAAGTTCTTGTGATACATATGCCTTATTATATCCTATGACCATGTTGATCAATTGTTAATGGTATTGGTAAAATGGAGTAGTGAGCTATTTATTTCTTTGAAAAGCACAATATTTTCAAATTAAATTTCAACTCGATTTAAAAGAAGGATGGGTCCACTTTATCAAACCTAGTCAACCTAGTCAACCTAGTCCTAGAGGTAAGGTTGATGTCTTTTCAATTTGGTTAAGTCGAGCCCAGATATGAAGCTAAGGTCGAAACTTGTCTAACAGGTTCTTTATTTATCAAACATAGTCCTAGAGCTAAGGTTGATGTTTTTACAAATTTGGTTAAATCGAGCCCAGATCTGAAGTTGAGTTCGAAACCTGTCTAACAGGTCCTTTATTTATCAAACCTAACCCTAGAGCTAAGGTTGATGTCTTTTCAATTTTGGTTAAATCGAGCCCAAATATGAAGTTGAGGTCAAAAGCATGTCTAATAGGTCCTTTATTTATCAAACCTAGCCATAAAGTTAAGGTTGATGTTTTTCAAATTTGGTTAAGTTGAGCCAAGATCTGAAGCTGAGGCCGAAACCTGTTTATGAGGTCCATTATTTATCAAAATTAGCTCTAGGGATAAGGTTGATATCCTTACTCATTTGATAGTTTTTTCTCTCTAAAATTTTTTTTAAAAAAATCAGCCCAAATACTTTATCTATTTCTTTTGTTCACCTTTAAATATTTTCAGGATGGACAAAGGGGGCAAGCTGTAGATACCTTATTGTGTCCTTCATTTTTTATTTTTTATATTCATTTATTATTTTTTATTATTTATGTGTATTTTATTTTTATAAAAGAATTTGAATTTGAATTTAAATTTAATAAAAGAAAAAACAAAATTAAAATTTAAAAATAAATAAAATAGAATAAAAAGCGTTTTCCCTACTTTTTCTCAACCATTCCCACTATTTTTGTTTTTTCCCTTTCCCTCATCCACGCTCCTACTCTCTCATTCTCCCACTCTCCCACTCTCCCACTCTCCCACTCTCCCACTCTCCCACTCTCCCACTTTTCCATTTTCTTTTCGATACTCTACCTCCATTTCACTCTACTGATCTAAAAATAAAACGATCAATTTTGTCCTATAAAAAGTGAAATGAGAGATTCTTGAGGATAATTAAAAGAGGTTGAGATTTTTTTTAAGGAAAGGGTACAGTTTTTATTAAAAAAAAAAAAATTCATCCACTTGTCGTTTGGAAAGATTTTATTGGATTTAAGCGAGGTCAGTTTGCTATGTAGGTGTTTTTTATTTAGGCCAATTTAATTATTTCGTGTTTTTAATTTCATGCTTCATGGTTCACTGTGTTGAATTATTTGGCACTCATTTGACATGCTGCTTTATTTTATTTGACATTTTGCAAACATTTTTTATTTGGCTGGCATTCTGCTTTATTTATTTGGTCTCATGCAAATAACATACTGCTTTATTTATTTGACATAGCATGAGTTATTTATTATTTGACATCATTTATTATTTATTTGGCATGTTATTATTATTTTTGCATCGTGCTTTATTTTGTATGCTTATATTATTTCTGTATTTATTTTATCTCTATATTTATTTTATATCACTGTATTATTTTATCTCAATCTCTTTGTATTATTTTATGTCACTGTATTATTTTATCCCAATCTCTTTGTATTATTTTATGTCACTGTATTATTTTATCCCAATCTCTTTGTATTATTTTATGTCACTGTATTATTTTATCTCAATCTCTTTGTATTATTTTATGTCATTCTATTATTTTATCTCATCATATTACTTTTTTTATTTCAAAGTCTCCTTACAACTATTTCACAACCTATATAACTATTTCACAACCAGTTTATTTATCAAATTTTACACTATTATGTCATTATTATTATCATTTATTTTCTCATTTGAATATAAAAAATTTAAATAAAAAAATATGTATTGTTTTTATGGGTTTTATAATTTAAATTTCATGAATACATGAAATTTCTCAATTTGAATTTTTATTAATAAATTATCTTTAAATTTATAAACAATTTCATATTTTAAATGAAACTCAATAGTGCTTTCTATGAATTTATTAATTCCTAAACACGTGAAGTTTCTCATTAAACAACTATGATAGAGATTAAAATATCAATATTTTGTATTTTAATATTCTTGAAGTAAATAATAATTATAAAAAAATCTTTTGAAATAAAAAATAAATTTCATTTTTCTAATGACTCTTATGTCATTCATAATTATCAATTCATGCTTAAGTTTTATTTCATTCGATGTGAATTGAGAACCATGTAACATTTAAAAATAAAAAATAAAAAACTATTTTGTGAACTTCTTTAATTTAGTTTAAAAAATAAAATTAGGTAACCATTTTATGAGTGTCGTAAAGTGCTAATACCTGGTTTTGTGAACTCTCCAAACCTATTTATTTATTTCTTTTAAAATAAAATTAATGAAGACGTCCTTGCGAGAGCAGCCACAACAAGTAGTTTGGCTTATTAATCACTTTGAGAAATTTTTCAACGTTTATTATCAGTATTATTATTTGAAAATTTGCTAACACTATTGGTCATTTTGCCAAGAGCTCTAACACATTGACTATATGTACTTCAAGACAATAGGTCTTAGATTCAAATCCCCCATTCTTAATTTTTACTAAAAAGAAAACGCCAACCATTTTATTATTAATGATAATTAATAATAAATTACACTTGAATATCAAAAACCTAATTTAAATTTGAAAAAGTATTTTAAATGATAGAACTTTTTGAAATTATTTACAAATATAACAATATTTTGTCGGTGAAAACCACGATAGACCAAGATAACTATTTATCACACAATTAGTAGTCAATCATATAGAGAGTAATATTTGCTATATTTTGTAAATATTTTGGTTCATTTTGTTATAAGTGAAAATAACCTCTTTAAAAAAAACTAACAAAATATGCAAATGGAATTGGACCTATAGTTCAAGAGACCAAAAACAACTTACCTAGATAATTCAATAACTAAAATATAATTTACAGTTTCAGAATATCTAAAAATTATATAAATGTAAAATTTAATCTACTAATTTAAACAATCCACATTTTATCAACAAAAGAAAATAAAAATTCAAAATGATATATGAAATGAGAATAAAACTTCAGGGGTAAGCGGGAGGGGAGGAAATAATTTTCCCATAATGTAAAATGTCCATTTATACAACAAAATCAATAAAATCAAAGCAACTCCCAGCAAGAATTCTTCTCTCAAATATTATGCCTTTCCACTGCATTTATTTGTTCAAACAGATCGCTTGGATAAACAATGAGAATGAAACAAACACGTTGTTCGATATGAATTCAGGGGATTTCGAAAGTACTCTTCTTGCTGGCAAATTGGCAACATTTTCAGTAATAACTTTGCTTTGTTTGACCAGACTGAAAGTACATCCGCAAGATCAATACTAACCTGAGAAGAAAAGAAAAGACTATGCGAAGTAATTGTGATCCGATTTTACCTATAGTATAGTGCTAATTCTCCACTTATGATACATAAACATGAAAAGGAATACGAGATAGAGTCTCCTTACTGAAAGTGAAGTGCACAATGGTTTCAACGAAAGATTTCAGATTAATACAGGCCCAACATCCTTAGCTCCATAACATGACTGGGGTCTAATTCAGGAACGTTCTTGTTATGTTCTCGATCTTCAACTATATATTCCTGAAACAAATAAAGCTATGTGAATTGAGCAAGTAGGGAAAGATTTGGGAAGACTAAAGGAAACCAACAGTTGCGTTACAATGGAGAAATGAGATGAGGGAAAATATAATTAAATTTGCAATATTTTTCTCTTTAAATTGTTACTCGACCTTTGTCAGTTCTTGCTTAGCGAGAACATGGTAGTGGCGTTCATTTATCCTCTGCCCGCATATAATGGCTATGAAGAAACCATAGAGTAAGCCAACCAAGATGATAGCCAAGACTGCAGAAAAGAAATCGTAGAACGAAGAATAAAAAAGACAGATAGTACAGCAACTACGGTTTTTTTCTAATGCCGATACTTAATTTCATTCAAGGATTCATTCAACTGCATATCTTAATATCTTAAATTCTTCTAATCAAATGAAACCAACAAATTGTGAGAATATTCACTTACACTAATAAATCCAATACAGATATAAAGAATAAAAAAAAGGAGTTAAGTCGGAAGCCTCAGTATACCAGCCATGGCATAGAACCCGTATGGGTGTTCTTCATAGCCAAACATTTCCCTTAGTTCCTCTCCGTAAAATTTATACACCAACACCCCCAAGAATGCCACAATCTGTTCAAAGCACATGGATAATAATGAACATTCAGCCTAACTTGTATTAAACACCTTGACACAGGTGAATGAGCCAATTGAACACATAACTAAGCTAACGTACCAGTTGAACAATGATGAAAATGAACGCATGGTCCCTAGCAACAAGAAATTGAAACTTCAGTCTCAACCACCAGCGATCAGGAGGGACATTTGCACGTAACACAAACATAGCTCTGCACTCTGTACAATGAGCAAAAGCAAAGCCCTCCTGACAGAATTACTGAAATCGATTAGATGTCAAAACCTTGAAAACTTGATCATTCTTGAATTAGACAAGTTCGTAAAGGCATATGAAAGCAAATGTATTAGATTGATAACGGTAAGTAAAAGTTTCACGTTTGAATAGCAGTTGATACCACAGTCCATTACCCAGGAAAAGTGTGAATTAAGAATAGAGAAGCAAACACGAAGTGAGGGGAAACTGACCTTTGTGGACCTCCAATTATCAAGACAAGATCTATGGACATACTTTTGAGTGCCTTTGCAATGGCATGGAGCAATTAAATCTTCTCCTGAAACCAAAATCATATTCATACTAGAAAAGGTAAACCAAATCTTTGGAAGAAGAAAAACAAGATATGCGTGTGTACTGAGTCTTAACCCACAAGTGAGCCTAAGGATAGCATTCTTATTTGTTAAATAAATACATGAGAAATGTGTCTTAACTGATGTTGATGAAATAGGATTACACATTGAACAGAAAGGAATGTTAAAAACCTCCAGTGTCAAGGCATATACGACATTGTGGTTGGTCATTCACCAGATGACAAGTTTCATCAATGTTAAGATTGAGTAAATCATCATCACTAACAGAGCTCTCTCTCCCTGTAGCTATAATCTCAGCAATCTCAGCAGAGGATGACGACCCCACAGACTGTGGCAGAAAATCCAATTGACTTAAAATGGGCGCACGTTCCAGAATATCATCTTTATTCTCACCATGATTTGGTACCAGTTGCATTGTGTTAAGCCTCGCTCTTGTGTCAAACTTGTAGACTCATCGCCTGAGCTATTGTTCAAGTTGACCATTTACATGGAAACTAACGAGCCTGCATAATAGGGGAAAGTAAAACAATCAAGCTGGCACAGAATCACATACACAGAATATCAAAATGTACATATTTAACCAACAGATAATAATTCATTCTGTTCTCACAACTTCCACATCTCAATGCTATAGACCAATATTAGAAGACGCAATAACAATTTGTATATATTCCAAATCTAGGTGGAGGTGACCTTATATGCAACCTTAACCTTTTATTTTAACAGTAAATAATATACATTATTGTTTTCGGATAAAAGTTTATCGCTGCAAACCCTATACTGTTAATGTTAATAGTTGAGTCTGTTCTCATCAAGTGCTTGAGATTCCTTGGGGCGCCATGGCCACAACAATTCACAGGAATAGAAGTGAACTCCTGGTACCTAGCCTACCAACGAGAAATTGGTTTGGGCTGGTTAGTATTTTAGACCCCCTATTATTCACACAGACAAACGTAGGGGCAGAAAAGGTAATAACCAGGGGGTAGCTTAACCTATTAGGTTTATGGAACATTCTGATGTGGGACCCACGGGAAGGTATGGGAGTATATAAGGGAATGTTTTTGGGATTGCTGAAGGTATCATCATTGAGAAAAGAATAGCAGCAGCTTTTGGAGGAGAAGGCCAGCATTCTCGAATTAGCTGGAGTGTTATCCTTTTCTTTACTATTTTCTTCTTGTTGATATTGTCTAGTGTGCTTTTTTCGTTGTTAGAGGATAGTATATGAACCTTGTCATCTATTATAACTATATCTATATATAAATACAGAGTACTGTCTCTGTTTCTCTATTGTATTGTTGGGATTTTACTGAGTGACAGGTGTTGGGTCCTTACATAGTACTTCCTGCCTTCTGGTGTTGAAATCTTGCCAATTCATCAATCCTTCAGAGCTGTGGCAAGCTTGGTTGACAAGAGTGTAAGAAGTGTCTTCCTGCCAATTGCCACAACCTCTATAACTTCCTTTTTTCATGCAATTCGGAACTCCATCAACTTTCTAAAATCCAAACTTATTCTTTTAGGAGTGATATAACCAACTACCAAGCTTACCGATCAGCCAATCTTATCCATCAAAGTAGAAAATGCTCCAAAATTCTATTCTCCAGTCAGGAGTAGTTCTTAGATCCTTAAATTTCCATAACGCAATGAGAATTAGGTCCATGTAAAAATGTAAATGATAGACCTCAAAAACAATTAGCTGCAGCAACATTAGGCTCTTGATAGTAATATTGAATCTTACATTATTAGCCTCAGTGAGGTGGTTTAGTTTCTTATGGCAGTTAATCTACAAGTACAAATAATGTTTGTCAGAGAAACTTGACAGAACAGGCTGTAATTTTTTTTTTTTTTTTTTGAACCCATTAAGAAAATGGAATGAAACTAAAGCTCAAAGTACAAAGAAAACAAAAAGCATAAACGGAGTACAAAAGAAAGCAATCCGAATAAAAAAGACATAAAGGGTTTAAACAAAGAACTGGCTTGGCCATCAAAAACTAAGAATAACAGCAAGACTATGAAGACCATGAAGAGAATCTAAAGGAAAAGCTTCATATGGTCATTTGTAGCTGCTGCTGCCGCCAAGGAGGAACAAAAACAGCAGAAAGAATTCTGATTCATGGAGAGAAGAATTCTGATCCATGTATAAGAATGACAATCTACAACATCAAAATTACAATCTCAAAATGTTCTATAACATTAAACCACGGCACTCGATCAACTAGAAATATGTTTGCACATCTCTGCCATCATAGTTTGGAATCTATCCCCAAAACGGTTTTCTAATTCGTTAATTCTATCCTGGAGTGTTTGTACCTCAGTTTTTCTTTGTCGTTTAGATTCGTGAAGATGTTTTTCAGACTCATTAAGGCGACGCTCAAGATAGCATACCTTAGATGATGATCCAAACAACTCAGAAGGGGTAACACCTACCCCATAACCACGAACACGTCCATGTTTCTCCGGTCCAACATTCCAAGGTACAGAATGATAACTTTCTGCACCACCCCAAAGGGTATGGAAAGATTAAGTGAAACTTAAAGGAAATAAACACAAAAATACTTTTTATTCCAAAATACTGCAGTTTAAACATGACCAGAGATTATGCAAACTTCCTAGTACATTGATTAAGGAAAATAATGAGGCATGATTCAGTTAGTCAAAGATGCACTATATTAGCAGCAGTTCTTATCAAATACATATTTTTTGAGATATGATTTCTAAGTTAGACGGAAAAGAACCATTGAGTAATTTTCATGACTGAATAATTTTCTTCAGTAAGATTCATAACATGTGGAATGGAATAGCAATTACGAGATTTTGATTGATATATTGAAATGGAATAGCGATTAGGAGACAACAAGACAACAAAAACCCAACCCAAACATGCCAGAAAATGGATACAAAAGCGACTAACCCGAATCTGAACTGGAAACTAAAATAGAAAATTGAAAATGAAACCTAACAAGCAGACAACCACTAACAAATACAACCACAGGAATAGCAATCGAAACACAAAACAATACATTAAAACAAAATTTCATCTCCAAAAACTGCTGAAGGAATGATCGAACAAGGCCAGAGAGGATTCATGATCTAAACTTCAAGGATCCAAATGTCTTGCCGAAAAATGAACAGCTTCAAAGCAAAATCAAATCACAATCTTTAACAAGGGATCTTAAATGAACTGGAAATAAATGTATTGCATAAACTATTATGAAGCGAGCCTACGTATAAAAAAACCACCATCGCATTGAAGATATGATTCAATAAGATATATAAAGTGAACTCGTACAGATCTGAATAATTAATAATATTTAATCTTGCATTCAAACTATGATTTGTAAAGAACAGAGCATACATTATTCTGAGTGAGTTGTAATGAACAGCAGTCTTCTATTGGAGGATCGAGATAGCTCTATATTCATATAAAGAGTTTGATGCATAAACACCCCCAATAGCTTTTGATTCCAGAGACACCGTTCCGCACGAAGCAAAGTGAATTAAAAAAGCAGTAAACATCAATCCTCAAACATACCTGGAATGTATCATATAGTACGTACTTCTAAACACAAGAAACTACCACTCGGAGCGGTAGTAAGGGCGGTGGAATGCAGAATCAAATATGGAATCTACACAAGCGAGTAGGGGAGTTGCGACGAGCAAGGAGGGAGGCGAGTTGCGGCGAGCCCGGCGAGAGCTTCAGATGCGTGGAGAAAGGGAAATCGAGAGAGCTTGAAAATTGGGGGAAGAAAAGGGAAAATTGGAAGCATAAACCGACCCGATTATTTGGTTAACCCGAGAATCCAATAGTATCACCTCACCAGACAAAAAAAGAAAAAAAAAATCATAAGCAAAAAAGACCTACCTCATCTCAACATTGCTATTTATTATATTTGGAAAATGGTATAGTTGGAAAAATAGGAATTCCATTTTTTAAGAAAAGGTTAAAACTAACCAAACCATTTTATTTTGATTTTCTTCTTAAATAACATTTTTTTATATATAAAATATTTTAATAAAAAAGTGAAAAAACTTTATTAATAAAAAAGTACAACTTTTTTTTTGGTTAAGGTTGTTAATATTGTTCGATAAAAAGTATAAGTTATTAATATCCTTCGATAAAAAGTATAGTTT

mRNA sequence

ATATATATGACAATATCCTTCCGTACCACAACCGTTCCTTTTAGGATAAAATAAAATTAGATTATAGCAATTCCCATTTCCAATCCTCAACCACAGTTCTTCTGGGCTCTCTATCAGCTTCTCCATTTTTCCATTTATCTCCAAATTCCTTATGTCTCTCTCGCCTCTATCCAAACCCAAATCCCTAACCCTAACCCTAAATCTCCTCCCTCCTCACCTCTTCCGCCGTCCTTTCTCCTCCCCTTCCGATCACGATCTCACCGACCTCCCTGAATCTCCTTCTTCTTCTGATCCCCTTCTTCGCAACCTTGAGGACGCTATTCAACGCATCCTCGTTCACCGATCTGCCCCGATTGGCTTTCCTTCGTTCTCGGTGCTTCCTATCGGGTTCCTCTGCCTTCTAATTCCCATTTACCTCTCATTGCCAACGTTCTTCGTAACTTAGCTAACCCTCTCTCTCCTGAATAATCCTTGTCTACCACTACCATCTGTGGCTGGCTCTCTTCTCATTACTTCATTCAAGGTACGCATCTTCCTTCTCTTGATCCGGTGGTTGACACCACGTTGACTGAATGTGATGCTTCCCTTGACCACGAGGAAGGATGAGAATCCGATTTGGGTATCCTAGAGATAATGAAAGTAATACAAATGATGGCATTTTCTTCTACTACTCTTTGCAGCTCGCCTTGCCTTGTAAACCTCATGTATAGAAGAAGACAACTGAATAACAAGCTAAGCAGAATGGCCGAGTAAGGGGTAGAAGCTTGTACAGATGTGAACTGGTCGGTGAGTGCTATTTATAAAATGGCCCTTCGTCGATCATTTAGAAGCTTCTGCTCTTAAACGTTGTTAAATGTTTTTCCTTTTTAGTAACTTATTTCTTTCAACTTTTTGCATATGTATTTTGTTCTCTTGTTGATACCAAGATAACTATTTATCACACAATTAGTAGGTTT

Coding sequence (CDS)

ATATATATGACAATATCCTTCCGTACCACAACCGTTCCTTTTAGGATAAAATAAAATTAGATTATAGCAATTCCCATTTCCAATCCTCAACCACAGTTCTTCTGGGCTCTCTATCAGCTTCTCCATTTTTCCATTTATCTCCAAATTCCTTATGTCTCTCTCGCCTCTATCCAAACCCAAATCCCTAACCCTAACCCTAAATCTCCTCCCTCCTCACCTCTTCCGCCGTCCTTTCTCCTCCCCTTCCGATCACGATCTCACCGACCTCCCTGAATCTCCTTCTTCTTCTGATCCCCTTCTTCGCAACCTTGAGGACGCTATTCAACGCATCCTCGTTCACCGATCTGCCCCGATTGGCTTTCCTTCGTTCTCGGTGCTTCCTATCGGGTTCCTCTGCCTTCTAATTCCCATTTACCTCTCATTGCCAACGTTCTTCGTAACTTAGCTAACCCTCTCTCTCCTGAATAATCCTTGTCTACCACTACCATCTGTGGCTGGCTCTCTTCTCATTACTTCATTCAAGGTACGCATCTTCCTTCTCTTGATCCGGTGGTTGACACCACGTTGACTGAATGTGATGCTTCCCTTGACCACGAGGAAGGATGAGAATCCGATTTGGGTATCCTAGAGATAATGAAAGTAATACAAATGATGGCATTTTCTTCTACTACTCTTTGCAGCTCGCCTTGCCTTGTAAACCTCATGTATAGAAGAAGACAACTGAATAACAAGCTAAGCAGAATGGCCGAGTAAGGGGTAGAAGCTTGTACAGATGTGAACTGGTCGGTGAGTGCTATTTATAAAATGGCCCTTCGTCGATCATTTAGAAGCTTCTGCTCTTAAACGTTGTTAAATGTTTTTCCTTTTTAGTAACTTATTTCTTTCAACTTTTTGCATATGTATTTTGTTCTCTTGTTGATACCAAGATAACTATTTATCACACAATTAGTAGGTTT

Protein sequence

IYMTISFRTTTVPFRIK*N*IIAIPISNPQPQFFWALYQLLHFSIYLQIPYVSLASIQTQIPNPNPKSPPSSPLPPSFLLPFRSRSHRPP*ISFFF*SPSSQP*GRYSTHPRSPICPDWLSFVLGASYRVPLPSNSHLPLIANVLRNLANPLSPE*SLSTTTICGWLSSHYFIQGTHLPSLDPVVDTTLTECDASLDHEEG*ESDLGILEIMKVIQMMAFSSTTLCSSPCLVNLMYRRRQLNNKLSRMAE*GVEACTDVNWSVSAIYKMALRRSFRSFCS*TLLNVFPF**LISFNFLHMYFVLLLIPR*LFITQLVGX
Homology
BLAST of Chy4G074980 vs. ExPASy TrEMBL
Match: A0A0A0K9L9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G419630 PE=4 SV=1)

HSP 1 Score: 156.4 bits (394), Expect = 2.0e-34
Identity = 75/85 (88.24%), Postives = 76/85 (89.41%), Query Frame = 0

Query: 117 PDWLSFVLGASYRVPLPSNSHLPLIANVLRNLANPLSPE*SLSTTTICGWLSSHYFIQGT 176
           PDWL FV GASY VPLPSNSHLP IANVLRNLANPLSPE SLSTTT+ GW SSHYFIQGT
Sbjct: 67  PDWLPFVPGASYWVPLPSNSHLPPIANVLRNLANPLSPEQSLSTTTVRGWPSSHYFIQGT 126

Query: 177 HLPSLDPVVDTTLTECDASLDHEEG 202
           HLPSLDP VDTT TECDASLDHEEG
Sbjct: 127 HLPSLDPEVDTTSTECDASLDHEEG 151

BLAST of Chy4G074980 vs. ExPASy TrEMBL
Match: A0A1S3CC43 (uncharacterized protein LOC103499150 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103499150 PE=4 SV=1)

HSP 1 Score: 147.9 bits (372), Expect = 7.2e-32
Identity = 72/85 (84.71%), Postives = 74/85 (87.06%), Query Frame = 0

Query: 117 PDWLSFVLGASYRVPLPSNSHLPLIANVLRNLANPLSPE*SLSTTTICGWLSSHYFIQGT 176
           PDWL FV GASY VPLPSNSHLP IA+VLRNLANPLS E SLSTTT+ GW SSHYFIQGT
Sbjct: 67  PDWLPFVPGASYWVPLPSNSHLPPIASVLRNLANPLSHEQSLSTTTVRGWPSSHYFIQGT 126

Query: 177 HLPSLDPVVDTTLTECDASLDHEEG 202
           HLPSLDP VDTT TECDASLD EEG
Sbjct: 127 HLPSLDPEVDTTSTECDASLDREEG 151

BLAST of Chy4G074980 vs. ExPASy TrEMBL
Match: A0A5A7VD84 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold606G00450 PE=4 SV=1)

HSP 1 Score: 147.9 bits (372), Expect = 7.2e-32
Identity = 72/85 (84.71%), Postives = 74/85 (87.06%), Query Frame = 0

Query: 117 PDWLSFVLGASYRVPLPSNSHLPLIANVLRNLANPLSPE*SLSTTTICGWLSSHYFIQGT 176
           PDWL FV GASY VPLPSNSHLP IA+VLRNLANPLS E SLSTTT+ GW SSHYFIQGT
Sbjct: 67  PDWLPFVPGASYWVPLPSNSHLPPIASVLRNLANPLSHEQSLSTTTVRGWPSSHYFIQGT 126

Query: 177 HLPSLDPVVDTTLTECDASLDHEEG 202
           HLPSLDP VDTT TECDASLD EEG
Sbjct: 127 HLPSLDPEVDTTSTECDASLDREEG 151

BLAST of Chy4G074980 vs. ExPASy TrEMBL
Match: A0A6J1GVG9 (uncharacterized protein LOC111457911 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111457911 PE=4 SV=1)

HSP 1 Score: 127.5 bits (319), Expect = 1.0e-25
Identity = 62/86 (72.09%), Postives = 68/86 (79.07%), Query Frame = 0

Query: 117 PDWLSFVLGASYRVPLPSNSHLPLIANVLRNLANPLSPE*SLSTTTICGWLSSHYFIQGT 176
           PDWL FV GASY VPLPS SHLP IAN+L N ANPLSPE SLSTTT+ GW SSH+FIQG 
Sbjct: 73  PDWLPFVPGASYWVPLPSTSHLPPIANLLHNFANPLSPEQSLSTTTVRGWPSSHFFIQGA 132

Query: 177 HLPSLDPVVDTTLTECDAS--LDHEE 201
           HLPSL+P V+T   EC+AS   DHEE
Sbjct: 133 HLPSLEPEVETNSAECEASQHSDHEE 158

BLAST of Chy4G074980 vs. ExPASy TrEMBL
Match: A0A6J1J031 (uncharacterized protein LOC111480015 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111480015 PE=4 SV=1)

HSP 1 Score: 125.9 bits (315), Expect = 2.9e-25
Identity = 61/85 (71.76%), Postives = 68/85 (80.00%), Query Frame = 0

Query: 117 PDWLSFVLGASYRVPLPSNSHLPLIANVLRNLANPLSPE*SLSTTTICGWLSSHYFIQGT 176
           PDWL FV GASY VPLPS SHLP IAN+L N ANPLSPE SLSTTT+ GW SSH+FIQG+
Sbjct: 77  PDWLPFVPGASYWVPLPSTSHLPPIANLLHNFANPLSPEQSLSTTTVRGWPSSHFFIQGS 136

Query: 177 HLPSLDPVVDTTLTECDAS--LDHE 200
           HLPSL+P V+T   EC+AS   DHE
Sbjct: 137 HLPSLEPEVETNSAECEASQHSDHE 161

BLAST of Chy4G074980 vs. NCBI nr
Match: XP_004135962.1 (uncharacterized protein LOC101214133 [Cucumis sativus])

HSP 1 Score: 154 bits (388), Expect = 2.39e-42
Identity = 75/85 (88.24%), Postives = 76/85 (89.41%), Query Frame = 0

Query: 117 PDWLSFVLGASYRVPLPSNSHLPLIANVLRNLANPLSPE*SLSTTTICGWLSSHYFIQGT 176
           PDWL FV GASY VPLPSNSHLP IANVLRNLANPLSPE SLSTTT+ GW SSHYFIQGT
Sbjct: 67  PDWLPFVPGASYWVPLPSNSHLPPIANVLRNLANPLSPEQSLSTTTVRGWPSSHYFIQGT 126

Query: 177 HLPSLDPVVDTTLTECDASLDHEEG 201
           HLPSLDP VDTT TECDASLDHEEG
Sbjct: 127 HLPSLDPEVDTTSTECDASLDHEEG 151

BLAST of Chy4G074980 vs. NCBI nr
Match: KAE8646378.1 (hypothetical protein Csa_015587 [Cucumis sativus])

HSP 1 Score: 154 bits (388), Expect = 5.63e-42
Identity = 75/85 (88.24%), Postives = 76/85 (89.41%), Query Frame = 0

Query: 117 PDWLSFVLGASYRVPLPSNSHLPLIANVLRNLANPLSPE*SLSTTTICGWLSSHYFIQGT 176
           PDWL FV GASY VPLPSNSHLP IANVLRNLANPLSPE SLSTTT+ GW SSHYFIQGT
Sbjct: 67  PDWLPFVPGASYWVPLPSNSHLPPIANVLRNLANPLSPEQSLSTTTVRGWPSSHYFIQGT 126

Query: 177 HLPSLDPVVDTTLTECDASLDHEEG 201
           HLPSLDP VDTT TECDASLDHEEG
Sbjct: 127 HLPSLDPEVDTTSTECDASLDHEEG 151

BLAST of Chy4G074980 vs. NCBI nr
Match: XP_008460271.1 (PREDICTED: uncharacterized protein LOC103499150 isoform X1 [Cucumis melo] >XP_008460272.1 PREDICTED: uncharacterized protein LOC103499150 isoform X1 [Cucumis melo] >XP_008460273.1 PREDICTED: uncharacterized protein LOC103499150 isoform X1 [Cucumis melo] >KAA0065598.1 uncharacterized protein E6C27_scaffold90G00420 [Cucumis melo var. makuwa] >TYK07168.1 uncharacterized protein E5676_scaffold606G00450 [Cucumis melo var. makuwa])

HSP 1 Score: 145 bits (366), Expect = 4.61e-39
Identity = 72/85 (84.71%), Postives = 74/85 (87.06%), Query Frame = 0

Query: 117 PDWLSFVLGASYRVPLPSNSHLPLIANVLRNLANPLSPE*SLSTTTICGWLSSHYFIQGT 176
           PDWL FV GASY VPLPSNSHLP IA+VLRNLANPLS E SLSTTT+ GW SSHYFIQGT
Sbjct: 67  PDWLPFVPGASYWVPLPSNSHLPPIASVLRNLANPLSHEQSLSTTTVRGWPSSHYFIQGT 126

Query: 177 HLPSLDPVVDTTLTECDASLDHEEG 201
           HLPSLDP VDTT TECDASLD EEG
Sbjct: 127 HLPSLDPEVDTTSTECDASLDREEG 151

BLAST of Chy4G074980 vs. NCBI nr
Match: KAG7018211.1 (hypothetical protein SDJN02_20079, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 125 bits (314), Expect = 3.09e-31
Identity = 69/110 (62.73%), Postives = 75/110 (68.18%), Query Frame = 0

Query: 98  SPSSQP*GRYSTHPRSPI-----CPDWLSFVLGASYRVPLPSNSHLPLIANVLRNLANPL 157
           SPSS P  R        I      PDWL FV GASY VPLPS SHLP IAN+L N ANPL
Sbjct: 49  SPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLPSTSHLPPIANLLHNFANPL 108

Query: 158 SPE*SLSTTTICGWLSSHYFIQGTHLPSLDPVVDTTLTECDASL--DHEE 200
           SPE SLSTTT+ GW SSH+FIQG HLPSL+P V+T   EC+AS   DHEE
Sbjct: 109 SPEQSLSTTTVRGWPSSHFFIQGAHLPSLEPEVETNSAECEASQHSDHEE 158

BLAST of Chy4G074980 vs. NCBI nr
Match: XP_022956127.1 (uncharacterized protein LOC111457911 isoform X2 [Cucurbita moschata])

HSP 1 Score: 125 bits (314), Expect = 3.09e-31
Identity = 69/110 (62.73%), Postives = 75/110 (68.18%), Query Frame = 0

Query: 98  SPSSQP*GRYSTHPRSPI-----CPDWLSFVLGASYRVPLPSNSHLPLIANVLRNLANPL 157
           SPSS P  R        I      PDWL FV GASY VPLPS SHLP IAN+L N ANPL
Sbjct: 49  SPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLPSTSHLPPIANLLHNFANPL 108

Query: 158 SPE*SLSTTTICGWLSSHYFIQGTHLPSLDPVVDTTLTECDASL--DHEE 200
           SPE SLSTTT+ GW SSH+FIQG HLPSL+P V+T   EC+AS   DHEE
Sbjct: 109 SPEQSLSTTTVRGWPSSHFFIQGAHLPSLEPEVETNSAECEASQHSDHEE 158

BLAST of Chy4G074980 vs. TAIR 10
Match: AT1G16840.4 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G78890.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 66.2 bits (160), Expect = 5.3e-11
Identity = 39/84 (46.43%), Postives = 48/84 (57.14%), Query Frame = 0

Query: 117 PDWLSFVLGASYRVPLP-SNSHLPLIANVLRNLANPLSPE*SLSTTTICGWLSSHYFIQG 176
           PDWL FV GASY VP P S S    IA ++  LANPL+ E SLST +  GW SS YF++G
Sbjct: 77  PDWLPFVPGASYWVPPPGSGSQSHGIAQLVVKLANPLTHEESLSTNSSHGWPSSDYFLKG 136

Query: 177 THLPSLDPVVDTTLTECDASLDHE 200
                ++   +TT      S D E
Sbjct: 137 VQPQLMETKTETTSNSESHSEDEE 160

BLAST of Chy4G074980 vs. TAIR 10
Match: AT1G16840.3 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G78890.1); Has 71 Blast hits to 71 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 71; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 66.2 bits (160), Expect = 5.3e-11
Identity = 39/84 (46.43%), Postives = 48/84 (57.14%), Query Frame = 0

Query: 117 PDWLSFVLGASYRVPLP-SNSHLPLIANVLRNLANPLSPE*SLSTTTICGWLSSHYFIQG 176
           PDWL FV GASY VP P S S    IA ++  LANPL+ E SLST +  GW SS YF++G
Sbjct: 77  PDWLPFVPGASYWVPPPGSGSQSHGIAQLVVKLANPLTHEESLSTNSSHGWPSSDYFLKG 136

Query: 177 THLPSLDPVVDTTLTECDASLDHE 200
                ++   +TT      S D E
Sbjct: 137 VQPQLMETKTETTSNSESHSEDEE 160

BLAST of Chy4G074980 vs. TAIR 10
Match: AT1G16840.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G78890.1); Has 71 Blast hits to 71 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 71; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 66.2 bits (160), Expect = 5.3e-11
Identity = 39/84 (46.43%), Postives = 48/84 (57.14%), Query Frame = 0

Query: 117 PDWLSFVLGASYRVPLP-SNSHLPLIANVLRNLANPLSPE*SLSTTTICGWLSSHYFIQG 176
           PDWL FV GASY VP P S S    IA ++  LANPL+ E SLST +  GW SS YF++G
Sbjct: 77  PDWLPFVPGASYWVPPPGSGSQSHGIAQLVVKLANPLTHEESLSTNSSHGWPSSDYFLKG 136

Query: 177 THLPSLDPVVDTTLTECDASLDHE 200
                ++   +TT      S D E
Sbjct: 137 VQPQLMETKTETTSNSESHSEDEE 160

BLAST of Chy4G074980 vs. TAIR 10
Match: AT1G16840.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G78890.1); Has 71 Blast hits to 71 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 71; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 62.4 bits (150), Expect = 7.7e-10
Identity = 34/62 (54.84%), Postives = 42/62 (67.74%), Query Frame = 0

Query: 117 PDWLSFVLGASYRVPLP-SNSHLPLIANVLRNLANPLSPE*SLSTTTICGWLSSHYFIQG 176
           PDWL FV GASY VP P S S    IA ++  LANPL+ E SLST +  GW SS YF++G
Sbjct: 77  PDWLPFVPGASYWVPPPGSGSQSHGIAQLVVKLANPLTHEESLSTNSSHGWPSSDYFLKG 136

Query: 177 TH 178
           ++
Sbjct: 137 SY 138

BLAST of Chy4G074980 vs. TAIR 10
Match: AT1G78890.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G16840.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 61.2 bits (147), Expect = 1.7e-09
Identity = 38/85 (44.71%), Postives = 50/85 (58.82%), Query Frame = 0

Query: 117 PDWLSFVLGASYRVPLP-SNSHLPLIANVLRNLANPLSPE*SLSTTTICGWLSSHYFIQG 176
           PDWL FV GAS+ VP P S SH   IA ++  LANP+S E S+S +++ GW  S YFI+G
Sbjct: 76  PDWLPFVPGASFWVPPPRSQSH--GIAKLVEKLANPISDEESISISSVRGWPCSDYFIKG 135

Query: 177 THLPSLDPVVDTTLTECDASLDHEE 201
               S    V+T +T   A    +E
Sbjct: 136 VKPQS----VETEMTSNTAYHSEDE 154

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0K9L92.0e-3488.24Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G419630 PE=4 SV=1[more]
A0A1S3CC437.2e-3284.71uncharacterized protein LOC103499150 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A5A7VD847.2e-3284.71Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A6J1GVG91.0e-2572.09uncharacterized protein LOC111457911 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1J0312.9e-2571.76uncharacterized protein LOC111480015 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
Match NameE-valueIdentityDescription
XP_004135962.12.39e-4288.24uncharacterized protein LOC101214133 [Cucumis sativus][more]
KAE8646378.15.63e-4288.24hypothetical protein Csa_015587 [Cucumis sativus][more]
XP_008460271.14.61e-3984.71PREDICTED: uncharacterized protein LOC103499150 isoform X1 [Cucumis melo] >XP_00... [more]
KAG7018211.13.09e-3162.73hypothetical protein SDJN02_20079, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_022956127.13.09e-3162.73uncharacterized protein LOC111457911 isoform X2 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
AT1G16840.45.3e-1146.43unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G16840.35.3e-1146.43unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G16840.15.3e-1146.43unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G16840.27.7e-1054.84unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G78890.11.7e-0944.71unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (hystrix) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33972:SF16REPETITIVE MATRIX PROTEIN 2, PUTATIVE ISOFORM 1-RELATEDcoord: 111..192
NoneNo IPR availablePANTHERPTHR33972EXPRESSED PROTEINcoord: 111..192

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Chy4G074980.1Chy4G074980.1mRNA