Sgr016824 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr016824
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionCMP/dCMP-type deaminase domain-containing protein
Locationtig00153010: 1423814 .. 1444822 (-)
RNA-Seq ExpressionSgr016824
SyntenySgr016824
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
GTATCTGGATTTACAAGCATATGCTATGTGGAAAATCATCCACATCCCAGATAAGCCGCCTATTCTGCCTGACCAACAACTCACTGGTGACCTAATTTTCTGAAAAATCACAATACTCTCTCAAACATTGTCTCTTGTTTGTTAAATCATTGAATACTTCTATACCATACAGTTAATGTCATTGCTGCTGTTGTTGAGCCCAAGTTAGTCAACAATCTTATAAGGTTTGCTTTTACGTTTCTTACCCAGTTTCCTGATTTAATCTGTTCTCCCTGATGGTTTTCATTTGTTCTTCAATGAAATAAGATGAAGCATTCTCAATTTGAACTCAGATTTATTCCTTTTCCTTGAAATTCAAGTTATGCAAGTTTTATTTCTTTAAGGAAAGGTATAAAATTTAAAATAGGTCCATTTACATGATATTTTCAGAGGAAGCTGTATATAATTTATAGTTGATACCTTATTTCTGAAAGACCTAAAATTGTCTTGTTAGGCGCTTAAATCAGATAGCACCATTGGAAAATCTCCGCCATGTGAAGAGAGTTCAGAAGAAGCATCTTGAAGATGGTATGCATCTGGAGTTATAGCAATTCTTTTGTCCCCTACCTCAGCGAATAATTCCTATGAAGCTATCTCTAAATATAAGTAATTTTCCACTGTTTTGTTTTTTTTTTTTTTAATAAATGTATTAGCTATGAGGATCTGAGTCATTTATTATGGTTATGTCTCTAGCCGTTTTGAGTCTAACCTATGTATTGAGGTTGAGTATTCATGAGACTTTTGGAGGAGGTTATTCTGGACCCTCCTTTCTAGGAGATGGGTTGGGTTCTTTGGCAAATTGGCTACCTTGCAGTTTTGTGGTATAATTCGTTGGAAAGAAACAGAAAAGTGTTCCTCAGAAGAAATAGTTTTTTGGAGGCTGTATGAAATTGGACATTTTTAATGCATCTCTTTGGGCTCTTGATTCAAATTTATTTTGTAATTATCCGTTGTTTTTGGTTGCTAAATTTTATCCAGCAATATCAGTTCGCCAAACTAGCCGTTAGAGTGGACTGTGGATTAGTGACTTAAGTTTGAACTATGTTGTAGAAATAGGGTTTTTTTCCTCCTCTAAATAAACTTATAAAAGTACAGATAATTTTGAAATTTCATTTCTTGGCCCTGGATCTGATATTTTATTAGGACCTTGTTTCTGTTTGGCCTCTCTTTGATTTGGAGTTGGGAGGTGATATCCATTTGGAATATTCATATATTTTAGGTGAAATGGTCAAGAGGCTATTATATCTTTTATGGTTTCGTTTAGCTTGCTGAGTTAAAGCTTTCCAGGTCAAATTTTTTTTTCTACCTTTTCAGGTTTTAATTTCATCATGCAATCATTTCTAGCCTGGTTTGAAGCTGTTTTTATCATGTATTCTATAAGGCTAGTAGAATAGGTCTTAGTCTCTGCCTCAAATGAGGTTGTTGATCATTAGATTTCTACAGGAAAAACTCAAATAGCATTGATCCTATGTCTTGCAAGTGAAAACAACTGCAATTTGGATGCCATGCCTCCAGATGTACAAGAACTCGTGAGCTTGTACCAGTTGAGTGCATTTATCACAAAGGTGCATTCTCCTATCGATTGAAATGTTTATTAATTAAATTCTGTTTCATAATAATTAAAGTACATCAATCAAACAAATTCAGATCATCCGAAAACTAGCATTCAATTTCTTGATTTATCATTTCTGATCGTGTTGAATTTATTTTTTCTAATTATTTCTTTATTTTAATCAATTTAGTAAGTATGATGAACAACTTTTTCCAGGACCAGGTGATTGATAGTAGGGTGGTCCATTTAATTAGATGGCATGCGTTTGTTGACTCTAAATTCATTGGTGGTTTAGACATTGCTCAATTTGATTATTTAGAATATTGTTGTCCATGAAATGTGGTTTTGGAATTTCCTAAGGAGAGAAATTATTTTTTCCATTATTGTGAATCAATATGGGTTAAACTCTGTAAAATTATATAATATGGGCTGAATTTTAATTGGTCGATTCCCTTAGTCAGTAAAGGTACTCATCATAGCCCTTGGCAGTTCATTTCATATTACAACGTTTTACTTCTTTGTAATTCTTTCTCTGGAGAATAGTGTGAACATTCTTTTTTGGGAGGACATTTTGGAGAGTTGAATTACCTTTTCTGCTACTCTCTGAATTATGTAGGCTTGCCACCCTAATTTGTTATTTGTTCCGTCTATCGTTTTTGCTTTCAAATATCGTTCTTTATTTGTTGAGATAGACTTGTTGCGCCTCCTTATAACTCCCTCTTTTAAGCTCTTATATAAGTTAAAATGCCTGCCTTATACAGTGCCTTTGATGCTTGGTTGGTGGTTGCACGATTAGTTCAAATGAGCTGTATTGAAAGAAAATGGTACCCTTTTCTTTAACATCAGGGGTGCATCCTGTGTAAAATTGTGGGCCATAAGCATAATGCAAGAGTGGGTCATGAACACTCTGGTTGTAAAAAAAAAACTTGCTGGACTCTGATCAGTGATCACCAGGGATTTGATCTTTGTGCGTATAAGGCTGGAAAAAACACTATGGGGTTGTAAGAAAAATCTTGCTGTCAAGAAATCAAACAGTTTGCTGAGGTGGTTGAGGAGCTCATTGTTATTATTATCTTAAATTTTATTTATTCGTAGCTTGTGTGCTTTTGATTCATTGTGTATGTTTATGTGAGAATTGTTTAATGTTTGACTTTGATGGATTTGGAATAAAATGGTGGTGTTGTCTATCCCCAAAGTAGCAGGATAAGGTTCATTCATGTTTGTTGTTCAGTTGACTGCTGTTTGTTCTGGTCCCTAAAAATTTGACCTCTTTGGTACTTAATTCTAGTTTGTTGAGGACTTGCCATCTTTTGGAACATCCCATTTAACTTTCTTGATATTATCAGGTAGAGATATATTTTACCTTTAGTTTGTGACACAGGTTTGTAAGGAGGCAGCGTCAACAAAAGAAGAATGGGAAGAGCAGTGCAAGTTTTGGCCAACATCATATCATCCACCAACATAGTAAATCTTGTAACATTGTTAGCATTTTTCTTTTAAATGTTAGGGGTAGTCTTAATGCAGTTTCAACAATCGTGCTTGTGCAATATCTTGGTGGTGGTTGACCTAGTTATGTTGACAAAACTTGGTTTAAAATCCTTTTCTTCTAGAAAATATAATTTTCTTATATGTATCTTAAAAGTTTAAATTCTACCAGCTAGAACTTACTTCCTTTATTATCTCTTTTTCAATGTCAGCAATATTGATGGTATCACTGGATTTAATGAAGAGGACACACAGTCAGTTTTTGGCTTCATGAGACTTGCAATAGAATTAGCACAGTCGTCTAACAAATCGGTAAGTTAATGTACTTTGGTTTTCCCTCAACCGTGACCAATCCTAGATCAATCTGGATTTATGCTGTATTAGTTATTAGTTATTAGAATTAGCTTCAGCTTTCTCGTTTAGCTTTTCTAGTTTTATTATCAACTACAATTTTTTAGTTACTTTTGATTTCCCAGGTTCTTTTAGAAGCGTTAGCTATGGCCTTTAGAATTTAGATACTGAGAGGCAACATTTTATATGAAAATTTTGATCTCATGTTTCAACTCTTTTCAGAATATTTATGGTTAGGTTAGCTAGTACTCTATGTTTGCTTAACCTGTGAGTATTTAGTTTGTTCTGGAATTTTGGAACTTGTGCCTTTATATATATATATATATATATATATATATATATATATACGGGTTTTAAGCTATTAATTTTCCACCTGGGAGCTTGAACCCAGGACTTGGAGGGTATCTCCAAACATCCTAATTCCTTACCACCTGGGCCACCCTTGGTTGCTTTTATAGTTGGTTTGAAAGGCACTCAAGCATCTGGTTATTTGTGTGTTTGAGGAGAAAAAGCAATGTTGTTTTTTTAGCTAGTCTATTTTATCATCTAGCAGACTTTATTTAAAGAACGATATCACCTTTATAATAAATAGGCTTCTTACTGTTTAGTATTTTTCTGGCTTCTAGATAAACAAGTAGTTATCTCTCTTGAATATTACCATGTAGGCTTAAAGTAAGGGATAGCTATGCTATTTGGCCCATGAATCCCTGGTGTTGTCCATGTCTTTCTGGGCACGTAATTATTGTTTATATACACAATTCAAATTCTGCAGGTACTAAATGCTGCAGTGATAGTAGATCCTTCAGTTAAGCAAGTAATTGCAAGTGCCTACGACCATCATATTCACTCCGAGAATGCTTCCAAAAGCAATGCCAATGGGGAAACAGGCTTCGAAAAGCGGCTGGAATCTATTGGTTCTCATCTTGATTCCAATGGAGCCACTATCCATGGCACTTCGCTGTCGATTGGCTCTTTGAAAGCACTGAAACAATCATGTTCCGATGTTTTGTGTTTGTACCCGTGGCGATGGGTCGAGCAACAGTCAGAAAATTCATCAAATTCTTGTCAGCACCCTCTAAGGCATGCTGCTATTGCAGCTATTGAATCTTCTGCTGCTCGGGACAGGCGCCTATACCCCAGTTCTGGAAGCGTTGGTGATAAATCAATTGAAATGGAGCTTATGGGGCCTCCGACCAACCTAGCAAAAAGACAGAAGACTGATCTTGAAAATGTAAGTTACCCATGTTAATGCAAGACCTTTTGAGTTAATCATTAAAAAAAGAAGAGATTAGTATCTAACATCCTTACATGAGACACGAAGGAAAGAGAACAGGGGGTCATACCATGAACAGTCATAATAAAAAAGGAGAAGACTGTGAATTTGGCTAGTTAAAGCTCCATTAATAAGTAGCCTACCTTTTGAATTGACAGGCAATCCAACATACTTGTACATTTCTTTGCCCTTTTATTCTTTCTTCGTTATTTATGGTTTAAATCTTATGTTAGATCTCTAAACTTTCAATTTATTGTATTTTGGTTCTGCCTAAATTTTACATAAAAAATTATTTGTCATTGTTAATTTTTTTTGGGGACAGAAACTTGGCATAGGTACATATTTTGTTTGTTTACCTAATAAATGGATGCCTATATCTACAAGTATTTGAAGTCAAATGCAGTAAATTATGCATTAGGTTTTCATATAGATTTAGACTCTTGTTAAAAGATTAATTAAAAAATTAATGATAAGGACTAAAAACAGTTTATTATGTAAAATCTAGGAATTAAAACAGAACATTTGAAGATTGAGAGACAATTAAAATATTTGAAAGCTTAGGGACTATAATAAAATAAGCAATCAGGGCTTAGAGATCACAATAGTATTTTTTTATTTTTGTGTTTAAATATATATGTAAAGATTAATATGTTGTGGTGTACAGGTCAAAAGTGAGAAGGTAGGCACTGATATTGGAGGCATAAATTCTAAATCAGCCAGACCATACCTGTGCACTGGTTATGACATTTATCTTGTGTGGGAGCCATGTACCATGTAAGTTCACAAAATGGATCCTTACTTTCCCCTTTTCCCCATTGTTCTTTCTTGGATATCCATTTTATATCATCTTTCAGGCATTTAGCTTTTATGAGTGTTTTTATGCTTAAATTTAATTGAAGCTGGTATATTGTGGATGCTAAATTATACAAGCAATGTTTCTAGCATGTGTAGTTGATTTCAATTCTATCTCCAATGTTTTAATTTGCTTAAGGAGGACCATTTGTGAGCAATTTCTCTAAAAATGCTTAACGGAAATTAAATGATTAGGAGTAGAATTGAAACTTTTAAAACATTAGGAGAAAAATTGTAACCAATTACATGTTTTAAAAGTTTCAATTCTACCTCTATATCTTTTTAACTTTGTTAGGAATTTAATAAATATTGCTAACACAGGTTTTGCAATAAACAAATTAAAAAATTGATAGTAGAATTGAAAATTTTGAAATGTTAGGGATAAAATTGAAACCTGCTAACATGGGATGTTTTCTATAATTTAGCCTATTCTTGATTAAAATGTTAGGGCCGAACTCTTATGGATTAAGTATTTTTCTATTGAAAAATCTTACCGTAAATGTTTTTACCGTAGTCTTGCGTTAGTCATATCTCTAAACCTTTATTTTTTGTGATCTTGCTTTTCTTCGTTAAAGGTGTGCAATGGCACTCATTCACCAACGAGTTAGGCGCGTGTTTTATGCGTTCCCTAATCCTAACGATGGGGCACTGGGCAGTGTTCACAGACTACAGGGTGAAAAGGGCTTAAATCATCACTATGCTGTTTTTAGAGTTTTGTTAACTGAAGATGTACTTTGATGGTTGCTATATCTCGAATAGAGGGAATGCCTCAATTTAATTTGTAGATGGAGTTGGGGATATGATAATTCAGATTTGGTTGTATTTTAATGTTAATCTAATCTATTTATTGCTCATGAACCACCATGCTTTAAGACTCCTCAATGATGGTCAATTTGACATTTCTTGAAATCAAATTGACACTATTGTTTATTAAATTCTTTGTATTATGTGACTCTACAATTGAGTTTTGGTTGGGGAAGTAAAGACCATGTCCGAAATGAATAGTTTTTGTTTGTTTTTGTTTTTTTATTTGGGGAAAACAACAAAAATAACCATAATAAATAGGGTGTTTTGTTAAATGATCTTGAGGTTTTGATATTGCAATTGTAGCCATACATGCATATATCTAGTGTATTTGATGTTTTCTAAAATCTAATCCTCTTAACAAACTAAACGTGGAAATATTTAAATTTGAAAACAAATATTTCAAATGTTTGTTAACTAACAGAATAAAGAAAAATATTGTAAAAAAATATGGGAAAATATGAAAAGATATTTAAATATTTTAAAATTTTAAACGATATAATATTTAATTACGTAAAATAATGAGCATGAAAAATAAAAATCTAATATATAAGCTTCAATAACATTTATGGTGTATATAAAATTATTTATGCTGAATAGTATAATAGACGGTCAAAATAAATAAAGATGAATGACACTTTTTTTTAATAAAAAAATGACGCTTGTTTAAAGTTTTAAAAGTGCAATGATGTATAATATATGGACTAAGTATTAATCGAGCAAAGTAGAAATGTAGTGTAGCTATTTATAATTGTTTATCATGTATTAAAAAAAGACTATTTATGATAAATATTAAAACAGACATGGTCAATATCAATAAGAATATATAACACTTGTTAAGTGTTAGACTTGTAAATATGAATATATACTTAGTTAATATTAATCATAATGAAATATATTTATTCATTATTTCAGTAGTATTTATCATATATCAAAAACTATTTTCACTAAATATTAATAAAAATATATATTAAAGATTATAAATATGTCTTGACACATGACCAATACAAAATATGACTAAAGTAGAACTTATGAAAGAAAAAAAAAAAAAGATGAAAACAAAAGCAAGATGTAAACTACAATGCAAGGAAAATGATTAAAGTGGATTTATCTGATAAATATAGACATTCACACATAAAAAATAGCAAACTAACAATAAAAATAAAATCTAAATCTTGTAGTAGAAAATCTAAGCATTAACTAGGGAAGCTTTGACGGAACAATGACTATATTTTCGTCCCCAAAATTCTAACCCCATTCACTTGCTTCTTTCCAATTTGAATATAGAAAAAAAATATATAAACAAAAGGATCTAGACAACCTGCAAGAATTTTAACAAGAACATATATGTTAGTACTTGCTTTGTTCCAATTTTAATGTCTGGTATATAATAAAAAATACGAAATTTAAAAAATTGAAAGACTAAAAGAAAAATGTGTTAATTGGAAAACTTAGAAAATCAATGAGAGATAAAAACTTGAAAGAAGGTCAACGTCAATGGAAGATCAAAGACAGTTTTGCGTTGATGGAAAATATGAGGAAGACGAGAAGATGTGGAAACGTAAATTAAATTTAAACAATCACAAAATTTGAAAACAAAATTTGAATTTCAAATAATAAGGGACCAAATCATATCTAAGATTTGATATAAAATTTGAAAGTCATTATTAGAAAACAAAATCTACAAACCGTATAACAAATTTACCAAGGGCAAAGTACACAATACGTGAAGGAGAATTAGGAGTTGAAATTCTGGATGACCATTTGTGCAAATCTAAAACTTTTGGCCATTTTTCTATTTGAGGGTGTAAATATGGTAATCCACACTAATTTTTCAAAAATAAAAGAGTACAAACAACAGTCCAACAAAATAGTCATCATAACTTGGCTAAATATACAAAAAAAAAAAAAGAAAAAAAGAAAAAAAAAGAACTTTAATTGTGCTCCAATCTAAAAGAATAAAAGTTTAAAGAATAATTATAAAATTCTTTTGAAATAGATGCCCAAATAGAGATATTAAACTTGGCAAAAGACTAAACTAGCTCTTTCCAAGATATAACACTTCAGAAAATTTTTCCATTCCTCTCAAGTTAAATGCTCCATAAAACAGTAGAAAAGTTGCATGCTAAAGAACTCAACCCTTATCATGAAACGACAAGTGACAACATCTCTTCCATTATAAAACAACAACCTCTCTTACAAGCCAAACAAACGCTAAAATACTCCAACCAACAATCCCAAATTGAGTGAGCATATTGGCATCTCAATATCTCTTAAGGATATGATCCAAATTCTCAACTGCTCCCCTATAAAGAATGCACAACCCCAACAAATAGGAAGAATTCCTCTAAATACGGTGCATAGTCTTAACTTTCCCATGCAAGACTTGCCAAGCAAAGAATTTGACCTGTCTGAAAATTTTAACCTTCCATAATAAGGAGAATAAAGGGTATCGAGAAGAATAGATTGACAAAAAAGAAGGAAGAAAGAGCTACAAAAAAACCCCTTAGAAGGATTAAGGGACTAAAAATGAATATCCCTACTTTACAAATGAAAATGGACACCCAACAAAAAAAAATTAAAGAAACATTCACCATCTCTCTAATAGAGGGTAGCAAAAACCAAAATTAAATGAAGAGGAGTTACCTAAGGAAGGAATAATTGTAACTACAGACACAACTTCAAAGAGGATAGATAGTATAAGTAAGGGAACAAAATACAGAAGGGACGATCACTAACCAACAATCCTCCCAAAAGAAAGTATTCAAGCCATCCCCAAAGAACATTTCACAAACTTAGAAAAAAGGGGAAACCTTAGCATTTCAAAGGTTCTAATACACCTTTAGACCCACAAACTGAAGCCTACTCAAAAGTGTGAGGATTGTAATTACTCATGTGACCTTAAGCCACAAGGTGTTCAGCTTCCAGGAGAAATGCCATAACTATTTTGTTGCAACACAAAGAGAACAAAATAAAGAAAAAATGAACTGAATGCTTTTATTTTTTTTGAGTAGAAAAGGATTATATCGTTGAATTTATAGAGTTGAAAATAACAAACTCTCCAAAAACTTCTAATCAGAATTCTACCAGCAAGGAATAAAAAATTTTGGTTACAACTAAAATTGACTTGGTATATCTTGAAGAATAGGAACTAAAAAACTAACATATTGGCAGTAAACAAAATGAAGTAAGATGCACTAATTTGTAAGATTCCTCCTTAGTGCATACTTATTACGGTCTATAATTGCAAGGATAGCTCTAAAATTCTCAAATTTTGGCTTGTCTAAAGCCTCGTAAATATATCTGCAACTTGATTAGTAGTAGGAATTCCTTTTAGCTCAATTTCTTGCTCATAACTTTCTCTCGAATGAAGTGGTAGCAAACATCAATATGTTTTGTTCTTCTATGCAAGACCAGATTTGAAGCAAGTTTGATGGCACTTTTATTGTCACAATAAATTAACACAGCACAATCAACTTCAGAAAAAATATCTCCCATTAAACACTTTAACCTATCACATTCCTGTGAAGCCATTGTTTTCGTCATGTACTCAACTTCAGTGGTAGATAAACCAGTAGTAGATTACTTTTTGCTACCCCAAGAAATGATAGTTGAACCCATGCTAAAACGGTGTCTTGAAGTTGAATGACGATCAATTGCATCACTTGCCTAGTCTACATCTGTAAATCTTTTTAGCAAAAAATTATCACACTTTCTTATATAAGTCCATAAGCAAGGGAACCCTTTTATGTATCGTAGAATTCTTTTAGTAGCAATTAGATGAGGAGTTCTTGGGCTTTTTCTAAACTAAGAAATAACACGAATAGAATAAGCAATTTCTGGTCTTGTGATAGTTAAATAAATTAAACTACCAACAAGTTGTTGAAATGTTGTAGCATCTTGTAGAAACCCCCTCCTTCCTTCTCCAACTTGAGATGTGTCTCCATAGGAGTGGCCACTTCTTTTGACTCCCCCATGTTGAAACGATCTAAGAGATCTTTTGCATACCTTCTTTGAGATACAAAATAACCTTGATCTAATCTTTCAACTTCTAAACCAAGGAAGCATCCAATTTCTCTTAAGTTTTTCATTTCAAACCGATCTGATTAATCATTTCTCAAGTGTGAAATTTCAGCTTCATCATCTCTGATAATTATCATGTCGTCCACATACAATAAAATCAACAACCACATCTTTGATTCTAAATTTATAAACAAACTAGAATTTGAATATGAAACTTCAAATCTGAAAAAGATAAAATAATAAGTAACTTTACCATACCAAGCATGTGGTGCTTGTTTGAGGCCATACAACGCCTTCTTTAATCGACAAACATGATTGAAAAATTATTGGGAGATGAATTCTTGTAGTTGTTCCATAAGTACTTCACGATCCAACTCTTCGTAAAGAAATGCATTCCTTACATCATGCTGCCATAGCCTCCAACTTTATAGGCAAGTAATAGAAAATATCGTCCTCACCGTAACCATCTTTGCCACTGGACAGAAGATGTCCTCGTAGTCCAATCCATAACTTTGAGAAAAATCACAAGCTACAAGACATGCTTTGAATCTATCAACTATTCCATCTGATTTCTTCTTCAAGTGATAGACCCATTTGAAACAGCTTAGAATTTTGTGGTTTTGGTACAAGCTCCCATGTCTAATTTGTTTTAAGGCATCAATCTCTCCTTGCATTGCTGCTTCCCATTCTGGACAACCTTTAGGTTCTTCATAACATTTTGGCCCTTCATTTTCTATAGTAAAAAAACATGAGGCCACGATGCAATAATTGAGCTGGACTTCATAATCCTTGAAATAATTTGGTTCTCGTTTGTCTTGCTAATCTTCCAACAATTTGCTCACCATCTTCTAAATTTGGAGAATTTAATGTTAGGGATACAGATTCTTCGCTTCAAGGATTATCATTATAAGATGCTTCAATATCTATTTGAGAAAACATCTCCAACTTATCTGGATCACAACAAACTCAATAATTTTGACATGTTCATTTGGAACTGAATAAGATGATACTTCATCAATCACAACATCTCTTAACGTAATAAATTTCTTTATCTTAGTATCCATGTATCTCCACCCTTTCCTGCATGAATCATAGCCAACAAAAATAATTTTTTTTGCTTTCAGATCTGGTATGGTTCTATTATTTTTAGGAACATGAACATAACAAGGAGAACCAAATACCCAAAAATAATCTATATTTGGTTTCATGTTATATAAGATCTCAAAAGGAGCCTTTTCTTTGCCTGGCTAAGAAGATAACTGATTTATCACATGACACGCACATTGAATTGCTTTTGCCCAAAGTTCTTGTGGAAGGTTCATGTCATGTATCCATGAAAGTCTCATTGAAGAGAGATATTCCAACTTTCTTTTAACGACCCCATTTTGCTATCGTTATTAGGACATGTCATTTTTCGTTGAATATCATTGTTGTCATAGTATTGAAAGAAATCATCTGACATAAATTCTCCACTATTATCACTCCTCAAGCATTTTATTTTTTTCCCAAACTACTTTTCTACAAGTATGTTGAATTTAACAAACTTGAAAATGCTTTACTCTTTTCTTTAAAAAAATGCACCCTAGTAGGGGTGGTAAAAAAACTGCCAAACCAACTAATCGAATCGATTGGTTTGGTTCAATTATTTATAAAATATTGGACATATCATTTCCATAATTCAAAAAACCCAAAACTTTCGGTTATGTCCAGTTTTATATTAAAAAATAAAATCAAACTGAGTTACATATATATAACAAAAAACTATAATTTTCTGACCCACTTCCCTTTGCTTCTCAAGTTCACAATCACGTGCACACACCCTAGTTCTTCTACCTTCACCATTCGTCCTTCATCATCACTCCCATCGTCGCCATTTGTCCTTCACCCTCACGCCCTCCGTCACCGTTCTTCCCTCACTGTCACTGTTCGTCCGTTCCTCGGTCGCCTTTTGTCCCACCCTTCCGTCGTCATTCTGTTCATCCTTTTCCTTCCCTTCCATCACTGCTTCCTCATCCTTCACTCATCGCTGCTGCCGCTTCCTTCCCCTCCGTTGCAGCTACCTCTTACCGAGAAACAGACCTTGTCGACTTCCTTTTCGCCGATCTACCTTCGGTAGAGACATAGGTCGTGACTTCGTGAGCCTCTCCCTCCCATCAGACATAGATCCGACGGGCCCTTTTCAATATCAACAGCATAGCTTCGGCAGCTACTCCTTCCCAAGCCCATATCTCATCTTATCAACATCAGATAATGTTTTAATTTTATTTACTTCTGTCTCATGGTCATGATCATGGAGTACCAATTGTTGATTATTTCTCGTCTTGGATGTATAGAACTTGCAAACAAAAATGGAAATTGATCCTACAAAAGATGATAGCACTCAGGTTGGGGGAGAGGTCGAATTGCCATAGTACAATCACCGAGAAAGAATAAAAAAGATAGAACAACTAAAGCTTCAGGTGCGTGGGAGCACTTTACTAAAGTTGGTACTAAATCTAATCCTAGAGCATCATGTAAATATTGTGGTGTAACTTATGCATGTGACACTAAAAAAAATGGAACCACAAATATGTTAAAACACATAGTATTGAAGTGTAAAAAGTATCCTAGAAGAGTAAACAACAAAAAATTTTGTCATTTGTTCAACCTAAAGATGGAGAAGAGAGTTATGCTTTCTAGGTTGATTGTGAAGGAAGAGTTGCCATTCAAAATTGTGGAGGTGGAAGGGTTTCAAGAATATAGTAATACATTACAACCAAAATGATTAATAAAAAATGAATGATAAGGATTAAAATAGTTTATTGTGTAAAGTTTAGGAATTAAAACAGAACATTTGAAGATTGAGAAACAAATAAAATATTTGAAAGTTTAAGGACTAAAATAAAATAAGCATGAGAGCTTAAAGATCAAAATAGTACTTTTTTTTAAAAAATTTTGTGCTTAAATGTATATGTGAAGATTATTATGTTGTGGCATACAGGCCAAAAGTGAGGAGAAGGTAGGCACTGATATTGAAGGCATAAATTCTAAATCAGTCAGACCATACATGTGCACTGGTTATGACAATTATCTTGTGTGGGAGCCATGTACCATGTAACTTCACAAAATGGATCCTTACTTTCCCCTTTTCCCCATTGTTCTTTCTTGGATACTCATTTTATAGCATCTTTAAGATATTTAGCTTTTATGAGCTTTTTTATCCTTAGATTTAATTGAAGCTGGTATATTCTAGAGGCTAAATTATACAAAGAGGTTTTAGTGTGTGTAGTTGGTTTCAATTTTGTCTTTAATGTTTTAAAAGTTTCCATTTTGTCTCCAAATTTTTAATTTGCTTAAAGAAGACCCTTTGTGAGCAATTTCTCTAAAAATGCTTAATGGAAATTAAATGATTAGAATTAGAATTAAAACTTTTAAAACATTAAATCGAAACTAATTACATGTTTCAAAAGTTTCAATTCTACCTCTATATCTTTTAATTTCGTTAGGCATTTAACATAAATTGCTAACAAAGAGGTTTTGCAATACACAAATTAAAAAATTGACAGTAGAATTGAAATTTTCGAAATGTTAGGGACAAAATTGAAACCAACTAACATGGGATGTTTTTTATAATTTAGCCTATTATTGATTAAAATGTTAGGGCTAAACTTTTTCTGTTGAAAAATCTTACCCGAAATGTTTTTACTGTAATCTTTTTTTAGTAATATCCCTAAACCCTTATTTTTTGTGATCTTGCCTTTCTTTCTTCGTTAAAAGTGTGTAATGGCACTAGTTCACCAACGAGAGTTAGGCGTGTGTTTTATGCGTTCCCTAATCCTAACGATGGGGCATTGGGCAGTGTTCACAGACAACAGGGTGAAAAGAGCTTAAACCATCACTATGCTGTTTTTAGAGTTTTGTTACCTGAAGATGTACTTTAGTGGCTGCTCCATCTCGAAAGAGTGGAAATTCTTCAATTTAAATGTAGATGGAGTTAGGGATATGGCTAATTCAGATTTGGTTGTATTTTAATATTAATCTAATCTATTTATTTTTCATGAACCACCGGGCATTAAGACTCCAAAATGATGGTCAATTTCACATTTCTTCAAATCAATTTGATACTATTGTTTATTATATTCTGTGCATTATGTTACTCTGCAAGTGAGTTTTGGTTGGGGAAGTAAAAGACCATACCTAAAATGAATACTTTTATTTATTTATTTATTTTTCAATTCGAAAAATAGAGCTTTCATGGAGAAAAATGTAAAAATACAAAAAGGACTTACAAAATAGCAGCTCAACAAAAAAACCTATCAATAACTTGGCTAAATATACAAAATGGAGGCTTTATTCTTTCTCCAATCCAAAAGAATAAGTTTAAAGGATAATTATAAAATTCTTTGGAAGTAGATGCCCAAAGAGAGACATTAGACCTGACAAAGATTAAACCTCTACTCAAGATCTAACACTTTTAAAAATTCTCACATTCCTCTCAAGTCAAATGCTCCAAATAATAGTAAAAAAGTTATATGCCAAAAAACTCGACCCTCATCAGGAATGACGAGTGACAAAATCTCTTCCATTATAAAATAACAACCTCTGTTACAAGTCAACTCCAAAAAAAAAAAAAAAATCCCAAATTGAGTGAGCATATTGACATCTTAATATCTCTAAAGGATATGATCCAAATCCTTAGAAGCTCCCTTACAAAAAATGCACTACTATGACCAACAAATAGGAAAAATGTCTCTAAATATGGTCCATAGTGTTAACATTCCCATGCAAGACTTTCGAAGCAAAGAATTTGACCTTCTTGAGAATTTTAACCATCCATATGAGGAGAATTAAGGGCGTTGAGAAGAATAGAGTCCACACAAAAGGAGGAAGAAAGAGCTACAAGAAAACCCTTTAGAAGGATTGGGGGGGACCAAAAATGAAAATCTCTACTCTACGAATTAAAATGAATATCTCCCAACAAAGAAAGTAAAGTTGAAACATTCACCATCTCCCTATTTGATAGAGGGCAACAAAAACCAATATTAAATGAAGAAGAGTTACCAAAGGAAGGAATGGTTGTAGCCACATGCACAATTTTAAAGAGGATAGTGCAAGGAATGAACAATTCAATCCCCACAGTAGAAGATTAGGAATCGACATTGGCACTTAAAAAAGTTTCAAGAAGGAAATAACTTACTTGTTTCTTAGATGTGAATTTTTTTTTGTTGAATCACTAACAATAAACTCCAAAAGATCTTCTCTGCTATCCTCGAATACAAACAAACAAGCGTGTGGGCGCTTCTTGGAAGAACATAAATGGGAAATTGTGTGAAAAATCCCTTTATGAATTTTGGCCAATGTATTGGAGGAGATTTAGGTTATTTTCTCAACTCATGAGAGGTTCTTCAATTCTTTAATTAGAGTCCACCAAAGAGTTATATTTATAGATCATTTGGAGCTCCATTTTCCTAATGAAAAAACGTTTCTACTAATTAGAAAATGAATCAAATTCATTATTATAATATTGACTACTATAATCTTTGTATTTTACTCTTTGGTTTACTCTTTTTAATTTTGTCCAGTGCGCATTAATTTACAGTTTTGAGAATAGTTCAAGATTTAAGTGTGTATTAATATAAAATTGTGTCTTAAAGTATAACAATTGGTATCAGAGCATGCCCTAGACTTAGCAGTGTCAGTAGAATCCTCGAGTGCCGAACAAAGAAATGATGAGCTTCAATGACGTAGTCATGTGGAGATTCGTGGTTGAGCTCTATTGGATAGGCGGTGTGCTCCGAAAAAAAGGAGTTGGTCTGGATAAGAGCGGACACATGAATGACTATTTGAGGGGAGGCTCAAAGGTACTTTGTCCCATATTGGCTAGGAAAGGAGAAGATCATGGGTTTATAAGTGATGACAACATCTCCATTGGTTTGAGACCTTTTGGGTAAAACCAAAAGCAAAGCCATGAGAGTTTATGCCCAAAGTTGACAATATCATTCCATTGTAGAATGTGAAGAGGGTCGGTTGTCCTTAACAATTGGTATCAAAGCTCGGAATCTTGCTATCTCAGATCCGTTTTTGAAACTAATTTTTGTTTTTTAATTTTAGTTGATTATTTGGGAATTTTGGTTTGTGGAAAATCAAAATGAAGGCCATCTTGACTCAACAAAAAGTGAATCTTGCTATTGCACCCCTACCTCTGTTCCTAGTACCTAAAGGATATGGACAAGGTTGCCTATACCTTGATAATCTTGCATCTTATCGACAATGTGTTGCGAAAATTTAGTGGTGAGGATACCATCGGTAAGCTATGGGGTAAGCTTCAATTTCTTTATCAGACAATGTCATTATCAAATAAGATTTATTAGAAAGAATGTCTATTTGGCTTTAAAATGGATCCTAGTAAGAGTCTTAGTGTTAATTTAGATAAATTCAATAGAATTACTTTTGATATTGCTAATTGTGATGAGGACATCTCTAATGAAAATCAATTTATTATTTTATTGAATTCATTGCCTGATTCTTATAAGAATGTTAAAGCCACTATAAAATATGATAGAAATTCTTTGACTTGTGACATTGTTGTTAATGCTCTTAAAATGTGAGATTTAGAATTAAAGAGTGAAAAGGATGGTGAGTCTTTATTTGTTAGGGGTAGAACTGATAAAAGATATAGTGGTAAAAACAATCATGGTAAAAGTAGGTCCAAGTCTAAGTGGAAGAGTCATAGATCCTATCATTGTAAAAAAGAGGGTCATATTAAACGGTTTTGTCTTGAACTTAAAATAAGAAAGGGGGAGAGAATAAGGGGAAGGAAGTGGTGAGTGGGATAAACCTAGGAGAGGGTTGTGACACCGTTGAGGTCCTTATGGTTAGTGAGCACCAAGAGAACAAAGCCTGGATTTTAGATTCGGGTTGATTCTTCCATATGACTCCACATAAAGATTGGTTAGCAGACTTCCAAGAATCTAGTGGTGGTAAAGTCTTTTTAGGTAATAACCAATTTTGTGAAGTAGAGGGACAAGGAACCGTTAGATTTAGGATGAATGATAGGGCCATGAGACTCTTGACGGGTTAAGGTATGTACCTAAGCTTAGAAGAAATTTGATTTCTCTAGGAGTAATGGATAAGTTAGGTTATATCATAAAATTTGAAAATGGAAATTTAAAAATTATGATCAGTATAATGGTTAAGTTGAAGGGAAAACTTATTAATGGCTTGTATTTCTTAGAAGGTTCAACTATTATAGGGTCAAGCGCCTTGGTTAAAGAACATGATATTCCTTTGTAACCCTTTGGCATAAAAGATTAGGGCACATGAGTGAACTAGGTCTCCAAAAATTAGTTAAACAAAATTTGATTAATGATGTGAAAATGGTAGACTTAGACTTATGTGATCATTGTGTATATAGAAATCCAAAAAGGGTTAAATTTAATAAAGAAATTCATAGTTCTAAAAAGAAATTAGAATATGTACACTCTGATTTGTAGGTCCTTCTAATGTTTCTACTAATGGTGGAGTTAGATATTTTTTGTCAATAATTGGTGATTTTTCAAGAAAAGTTTGGGTTTATATTTTGAAAAATAAATCAAATGCTTTTCAAAAATTTGTTGATTGGATAACACTTGTTGAAAATCAATCTAGCTATAGAGTAAGGAAACTTAGATATGATAATGGACTTGAATTTTGTAATCAAGAGTTCAATAATTTCTATAGAAAAAGAGTTATTCAAAGGCACTTGACTATAGTAGGGACTCCTCAGCAAAATGGTGTTGCTGAGAGGATGAACCGTACTTTAATGAAAAAAGTTAGGTGCATGTTGTCCAATGCATAACTTTCCAAGAGTTTTTAGGGTGAAGCTATAGCGATGACATGCTATATAATTAATAGGAGTCCCTAAACTACTTTAAGTTGTAAAACCCCTAGGAAATGTGGTATGGTACTCCCCTTGATTTGTCAAACCTATGTGTTTTGGGGTGTCTAGCCTATGCTTATAGTAATATAGGAAAGCTAGAATTTAGAGCTAAGAAGTGTATCTTTTAGGGTATCCGAGAGCATTAAGGGGTACAAACTTTGGTGCCTAGAATCTAATAGTGCTAAAACTTTGATTAGCAGGGATGTGGTCTTTGATGAGTAAAAAATGGGGTATGTAGAGGATACCACTGAGTCAAAACTAGTTGAAAATCCTAAGGTTGTTGAAACTGTTCTTCCACCTCTTGACACCTCTTCTGAGATTATTTCTTAATCTACTACTGAACAGACACTACTTGAGGAAGTTGGTTGCTCAAGAGGAAGAGGAAGAACAGGATAATCAAGACCTTACTGATTATCAACTGTCTAGGGATAGGATTAGGAGAACTACTAGGGCACCCACTAGATATGGTTATGCTGACCTAGTAGCCTATGCACTAACAGTTGCTAGTGAGTTTGTTGACTTAGAACCTACTACTTTCAATGCAACCATGATTGATAAGAACAGGGAGAAATGGTTAACTGTAATGAAAGAGGAGATGGATTCTTTAAATTAAAATAAAACTTGGATTGTAGTCAACAAACCTAGTGAGATAAAAACAGTAGGTTGTAAATAAATTTTCAAGCTTAAAAAAAATATTGAAAATGAGAGCAAGCATAGGTTTAAGGCTAGGTTGGTTGCTAAGGGTTTCACCCAAAAGGAAGGTATTGACTATGACGAGGTATTCTCATCTGTTGTGAGACACACATCAATTAGGGTTTTTCTTGATTTTGTTGCTCACTTTGATTTAGAACTTGAGCAGATGGATGTCAAAATAACTTTCTTACATGGTGAACTTGAAAAGACGATCTATATGGATCAACCACTTGGGTTCGTTGCACCTGGACAAGAAAGCAAGGTTTGTTTTCTTAAGAAGTCTCTTTATGGTTTAAAGCAATCTCCTAAACAATGGTACAAAAGATTTGATTCATACATGCTATCTATTGACTTTTGTAAGGAGAAAGTATGATAGTTATGTTTACTATAAGAAAGTTTATGAATATGATTTTTTTACTTTACTTACTCTTGTATGTTGATGACATGTTGATTGCTTGTAAATCCATGCATGAAATAAATGTTTTAAAAAGACTTCTTAGTAAAGAATTTGAAATGAAAGACTTAGGGTAAGCAAAACGTATTATAGGCATGAATATTAAAAGAGTAAGAGAAAGAAATATTCTTTACCTATTTTAGAATGATTATATAAACAAGCTTGTGAATAAATTAGGAATGCATGATGTTAGGCATGTTAGTACAATGTTTGGAAAACATTTTAGGTTGTCTTCTTCTAATGCACCTAAAACTCCTATTGAGATTGATGAAATGCATGACATACCATACTTTAACACAGTAGGTAGTTTAATGTATGCCATGGTTTGTACTCGCCTGATTTAGCATATCCAATGAGTGTAGTAAGTAAATTTCTTACTAATCCAGGTAAACAACATTGGAATGTTGTAAAATAGATTTTGAGATATCTTGCATGTCATTCACATGAGTGTCTTGTTTATGGTGAAGATAGTGTAAATAAAGATAATGAAATCAAGATTGAGGGTTACGTCGACTCTGATTATGCTACTGACATATATAAAAGAAGGTCATTAACTGGTTTTACATTTAGATTGTTTAACTATACTATTAGTTGGAAGTCAGTTTTACAATCTGTTGTTGCTTTATCTACTGTTGAATTTGAGTATATTGCATTGACTGAAGCTATAAAAGAATTCATATGGATAAAGAGATTAGTTTCAAAATTAAGTAGTAGTGATGTAAAAGTAATAATTTGGTGTGATAGTCAAAGTACCATTCATCTATCCAAAAATCATTCTCATCATGAGCGTACTAAACATATAGAATTAAGTATCATTTTATTAGAGATGTTATTTGTGATAAAATAATATATGTGAAAAAAATATCCAGTAATCTTAATGCAGTTGATATGTTTACTAAAGCTCTCCCAATTGATAAATTTGATATATGCATGAGAATTTTACATATTGATAAGTGATAGAATGTTTTGGGCACTTGTATGGTGGGAACCATCATGGTTTCAAGAATATTGAAATCATGGTGGAGTTTGTTATAATATGATTAAATATTCTTTTCTTTTTTTGGGTCCTTTTATTTTCAGTATTGTAGTTTTATTAAAATGGCCCAACTTATCTTTATTTCATATTACCACTTGGGCCATAATTCAAATGCCCAAGCCCACCATTTGGTTTGTTATTATTACCGATACAGCATGTTATGTAGATTATTTTAAGCTCACTAAGGCAACCGATTATCACACCTATATATTCATCTCTTTCTCCCCCATTTTTAAGCATCCCCATGTGAAATACAACTCAGAATAGAGAAAAGGGTTTGAGGGAAGGTTGGGGTTTGAGGGTGAAGGTTTCCACTACCGACGCTGTGATTGCCGTGTCGTGGAGTCGACGTCGTTGGGTTGCTGTCGTGGAGAAGCCATTGCCACCATCGAGCAAGGAGGCCGTTTGGGCACATGGAGTGGCGTTCCGTCTCTGTTTGGCTTCTGTGAGGCGTTCTTGGTCGGCGATTCCGATCTACACCGTGAACTCAGTTGTCTTCTTCCCTATCTTGGGTGGTTTGCTTTTCCTCTCGATTTATTGTTTGGAATAG

mRNA sequence

GTATCTGGATTTACAAGCATATGCTATGTGGAAAATCATCCACATCCCAGATAAGCCGCCTATTCTGCCTGACCAACAACTCACTGTTAATGTCATTGCTGCTGTTGTTGAGCCCAAGTTAGTCAACAATCTTATAAGGCGCTTAAATCAGATAGCACCATTGGAAAATCTCCGCCATGTGAAGAGAGTTCAGAAGAAGCATCTTGAAGATGGAAAAACTCAAATAGCATTGATCCTATGTCTTGCAAGTGAAAACAACTGCAATTTGGATGCCATGCCTCCAGATGTACAAGAACTCGTGAGCTTGTACCAGTTGAGTGCATTTATCACAAAGGTTTGTAAGGAGGCAGCGTCAACAAAAGAAGAATGGGAAGAGCAGTGCAAGTTTTGGCCAACATCATATCATCCACCAACATACAATATTGATGGTATCACTGGATTTAATGAAGAGGACACACAGTCAGTTTTTGGCTTCATGAGACTTGCAATAGAATTAGCACAGTCGTCTAACAAATCGGTACTAAATGCTGCAGTGATAGTAGATCCTTCAGTTAAGCAAGTAATTGCAAGTGCCTACGACCATCATATTCACTCCGAGAATGCTTCCAAAAGCAATGCCAATGGGGAAACAGGCTTCGAAAAGCGGCTGGAATCTATTGGTTCTCATCTTGATTCCAATGGAGCCACTATCCATGGCACTTCGCTGTCGATTGGCTCTTTGAAAGCACTGAAACAATCATGTTCCGATGTTTTGTGTTTGTACCCGTGGCGATGGGTCGAGCAACAGTCAGAAAATTCATCAAATTCTTGTCAGCACCCTCTAAGGCATGCTGCTATTGCAGCTATTGAATCTTCTGCTGCTCGGGACAGGCGCCTATACCCCAGTTCTGGAAGCGTTGGTGATAAATCAATTGAAATGGAGCTTATGGGGCCTCCGACCAACCTAGCAAAAAGACAGAAGACTGATCTTGAAAATGTCAAAAGTGAGAAGGTAGGCACTGATATTGGAGGCATAAATTCTAAATCAGCCAGACCATACCTGTGCACTGGTTATGACATTTATCTTGTGTGGGAGCCATGTACCATTTCACAATCACGTGCACACACCCTAGTTCTTCTACCTTCACCATTCGTCCTTCATCATCACTCCCATCGTCGCCATTTGTCCTTCACCCTCACGCCCTCCGTCACCGTTCTTCCCTCACTGTCACTGTTCGTCCGTTCCTCGGTCGCCTTTTGTCCCACCCTTCCGTCGTCATTCTGTTCATCCTTTTCCTTCCCTTCCATCACTGCTTCCTCATCCTTCACTCATCGCTGCTGCCGCTTCCTTCCCCTCCGTTGCAGCTACCTCTTACCGAGAAACAGACCTTGTCGACTTCCTTTTCGCCGATCTACCTTCGGTAGAGACATAGGTCGTGACTTCGTGAGCCTCTCCCTCCCATCAGACATAGATCCGACGGGCCCTTTTCAATATCAACAGCATAGCTTCGGCAGCTACTCCTTCCCAAGCCCATATCTCATCTTATCAACATCAGATAATAACTTGCAAACAAAAATGGAAATTGATCCTACAAAAGATGATAGCACTCAGGCCAAAAGTGAGGAGAAGGTAGGCACTGATATTGAAGGCATAAATTCTAAATCAGTCAGACCATACATGTGCACTGGTTATGACAATTATCTTAATAGAGAAAAGGGTTTGAGGGAAGGTTGGGGTTTGAGGGTGAAGGTTTCCACTACCGACGCTGTGATTGCCGTGTCGTGGAGTCGACGTCGTTGGGTTGCTGTCGTGGAGAAGCCATTGCCACCATCGAGCAAGGAGGCCGTTTGGGCACATGGAGTGGCGTTCCGTCTCTGTTTGGCTTCTGTGAGGCGTTCTTGGTCGGCGATTCCGATCTACACCGTGAACTCAGTTGTCTTCTTCCCTATCTTGGGTGGTTTGCTTTTCCTCTCGATTTATTGTTTGGAATAG

Coding sequence (CDS)

GTATCTGGATTTACAAGCATATGCTATGTGGAAAATCATCCACATCCCAGATAAGCCGCCTATTCTGCCTGACCAACAACTCACTGTTAATGTCATTGCTGCTGTTGTTGAGCCCAAGTTAGTCAACAATCTTATAAGGCGCTTAAATCAGATAGCACCATTGGAAAATCTCCGCCATGTGAAGAGAGTTCAGAAGAAGCATCTTGAAGATGGAAAAACTCAAATAGCATTGATCCTATGTCTTGCAAGTGAAAACAACTGCAATTTGGATGCCATGCCTCCAGATGTACAAGAACTCGTGAGCTTGTACCAGTTGAGTGCATTTATCACAAAGGTTTGTAAGGAGGCAGCGTCAACAAAAGAAGAATGGGAAGAGCAGTGCAAGTTTTGGCCAACATCATATCATCCACCAACATACAATATTGATGGTATCACTGGATTTAATGAAGAGGACACACAGTCAGTTTTTGGCTTCATGAGACTTGCAATAGAATTAGCACAGTCGTCTAACAAATCGGTACTAAATGCTGCAGTGATAGTAGATCCTTCAGTTAAGCAAGTAATTGCAAGTGCCTACGACCATCATATTCACTCCGAGAATGCTTCCAAAAGCAATGCCAATGGGGAAACAGGCTTCGAAAAGCGGCTGGAATCTATTGGTTCTCATCTTGATTCCAATGGAGCCACTATCCATGGCACTTCGCTGTCGATTGGCTCTTTGAAAGCACTGAAACAATCATGTTCCGATGTTTTGTGTTTGTACCCGTGGCGATGGGTCGAGCAACAGTCAGAAAATTCATCAAATTCTTGTCAGCACCCTCTAAGGCATGCTGCTATTGCAGCTATTGAATCTTCTGCTGCTCGGGACAGGCGCCTATACCCCAGTTCTGGAAGCGTTGGTGATAAATCAATTGAAATGGAGCTTATGGGGCCTCCGACCAACCTAGCAAAAAGACAGAAGACTGATCTTGAAAATGTCAAAAGTGAGAAGGTAGGCACTGATATTGGAGGCATAAATTCTAAATCAGCCAGACCATACCTGTGCACTGGTTATGACATTTATCTTGTGTGGGAGCCATGTACCATTTCACAATCACGTGCACACACCCTAGTTCTTCTACCTTCACCATTCGTCCTTCATCATCACTCCCATCGTCGCCATTTGTCCTTCACCCTCACGCCCTCCGTCACCGTTCTTCCCTCACTGTCACTGTTCGTCCGTTCCTCGGTCGCCTTTTGTCCCACCCTTCCGTCGTCATTCTGTTCATCCTTTTCCTTCCCTTCCATCACTGCTTCCTCATCCTTCACTCATCGCTGCTGCCGCTTCCTTCCCCTCCGTTGCAGCTACCTCTTACCGAGAAACAGACCTTGTCGACTTCCTTTTCGCCGATCTACCTTCGGTAGAGACATAGGTCGTGACTTCGTGAGCCTCTCCCTCCCATCAGACATAGATCCGACGGGCCCTTTTCAATATCAACAGCATAGCTTCGGCAGCTACTCCTTCCCAAGCCCATATCTCATCTTATCAACATCAGATAATAACTTGCAAACAAAAATGGAAATTGATCCTACAAAAGATGATAGCACTCAGGCCAAAAGTGAGGAGAAGGTAGGCACTGATATTGAAGGCATAAATTCTAAATCAGTCAGACCATACATGTGCACTGGTTATGACAATTATCTTAATAGAGAAAAGGGTTTGAGGGAAGGTTGGGGTTTGAGGGTGAAGGTTTCCACTACCGACGCTGTGATTGCCGTGTCGTGGAGTCGACGTCGTTGGGTTGCTGTCGTGGAGAAGCCATTGCCACCATCGAGCAAGGAGGCCGTTTGGGCACATGGAGTGGCGTTCCGTCTCTGTTTGGCTTCTGTGAGGCGTTCTTGGTCGGCGATTCCGATCTACACCGTGAACTCAGTTGTCTTCTTCCCTATCTTGGGTGGTTTGCTTTTCCTCTCGATTTATTGTTTGGAATAG

Protein sequence

YLDLQAYAMWKIIHIPDKPPILPDQQLTVNVIAAVVEPKLVNNLIRRLNQIAPLENLRHVKRVQKKHLEDGKTQIALILCLASENNCNLDAMPPDVQELVSLYQLSAFITKVCKEAASTKEEWEEQCKFWPTSYHPPTYNIDGITGFNEEDTQSVFGFMRLAIELAQSSNKSVLNAAVIVDPSVKQVIASAYDHHIHSENASKSNANGETGFEKRLESIGSHLDSNGATIHGTSLSIGSLKALKQSCSDVLCLYPWRWVEQQSENSSNSCQHPLRHAAIAAIESSAARDRRLYPSSGSVGDKSIEMELMGPPTNLAKRQKTDLENVKSEKVGTDIGGINSKSARPYLCTGYDIYLVWEPCTISQSRAHTLVLLPSPFVLHHHSHRRHLSFTLTPSVTVLPSLSLFVRSSVAFCPTLPSSFCSSFSFPSITASSSFTHRCCRFLPLRCSYLLPRNRPCRLPFRRSTFGRDIGRDFVSLSLPSDIDPTGPFQYQQHSFGSYSFPSPYLILSTSDNNLQTKMEIDPTKDDSTQAKSEEKVGTDIEGINSKSVRPYMCTGYDNYLNREKGLREGWGLRVKVSTTDAVIAVSWSRRRWVAVVEKPLPPSSKEAVWAHGVAFRLCLASVRRSWSAIPIYTVNSVVFFPILGGLLFLSIYCLE
Homology
BLAST of Sgr016824 vs. NCBI nr
Match: XP_022141843.1 (probable inactive tRNA-specific adenosine deaminase-like protein 3 [Momordica charantia])

HSP 1 Score: 582.0 bits (1499), Expect = 6.3e-162
Identity = 299/355 (84.23%), Postives = 320/355 (90.14%), Query Frame = 0

Query: 9   MWKIIHIPDKPPILPDQQLTVNVIAAVVEPKLVNNLIRRLNQIAPLENLRHVKRVQKKHL 68
           M +II+IPDK PI P QQ TVNV AAVVEPKL+NNL+RRLNQIAPLENLRHVKRVQKKHL
Sbjct: 1   MCQIIYIPDKLPIPPGQQPTVNVFAAVVEPKLINNLVRRLNQIAPLENLRHVKRVQKKHL 60

Query: 69  EDGKTQIALILCLASENNCNLDAMPPDVQELVSLYQLSAFITKVCKEAASTKEEWEEQCK 128
           EDGKTQIALILCLA ENNCNLD MP DVQELV+ YQLSAFITKVCKEAASTKEEWEEQCK
Sbjct: 61  EDGKTQIALILCLARENNCNLDDMPSDVQELVTSYQLSAFITKVCKEAASTKEEWEEQCK 120

Query: 129 FWPTSYHPPTYNIDGITGFNEEDTQSVFGFMRLAIELAQSSNKSVLNAAVIVDPSVKQVI 188
            WPTSYHPPTYNIDGITGFNEE+TQSVFG MRLAIELAQSS+K+V+NAAVI+DPS KQVI
Sbjct: 121 LWPTSYHPPTYNIDGITGFNEEETQSVFGLMRLAIELAQSSHKTVVNAAVIIDPSDKQVI 180

Query: 189 ASAYDHHIHSENASKSNANGETGFEKRLESIGSHLDSNGATIHGTSLSIGSLKALKQSCS 248
           A+A DHHI SENA  SNA+GETGF+KRLESIGSH DSNGATIH T  S  SLK LKQSCS
Sbjct: 181 ATACDHHIFSENA-MSNASGETGFKKRLESIGSHADSNGATIHETLPSSASLKVLKQSCS 240

Query: 249 DVLCLYPWRWVEQQSENSSNSCQ-HPLRHAAIAAIESSAARDRRLYPSSGSVGDKSIEME 308
           +V CLYPWRWVEQQ ++SSNSC  HPLRHAAIAAIESSAARDRRL+P+SG+VGDKSIEME
Sbjct: 241 NVSCLYPWRWVEQQLQHSSNSCSWHPLRHAAIAAIESSAARDRRLFPTSGTVGDKSIEME 300

Query: 309 LMGPPTNLAKRQKTDLENVKS--EKVGTDIGGINSKSARPYLCTGYDIYLVWEPC 361
            MGPPTNLAKRQKTDL+N KS  EK GTDI GI+SKS RPYLCTGYDIYLVWEPC
Sbjct: 301 PMGPPTNLAKRQKTDLDNAKSEEEKRGTDIEGIDSKSGRPYLCTGYDIYLVWEPC 354

BLAST of Sgr016824 vs. NCBI nr
Match: XP_023523263.1 (probable inactive tRNA-specific adenosine deaminase-like protein 3 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 559.3 bits (1440), Expect = 4.4e-155
Identity = 288/354 (81.36%), Postives = 311/354 (87.85%), Query Frame = 0

Query: 9   MWKIIHIPDKPPILPDQQLTVNVIAAVVEPKLVNNLIRRLNQIAPLENLRHVKRVQKKHL 68
           M +II+IPDKPPI PDQQ TVNV AAVVEPKL+NNL+R LN+IAPLENLRHVKRVQKKHL
Sbjct: 1   MSQIIYIPDKPPIPPDQQPTVNVFAAVVEPKLINNLVRCLNRIAPLENLRHVKRVQKKHL 60

Query: 69  EDGKTQIALILCLASENNCNLDAMPPDVQELVSLYQLSAFITKVCKEAASTKEEWEEQCK 128
           EDGKT IALILCLASEN+ NLDAMP DVQELV+ YQLSAFITKVCKEAA+TKEEWEEQCK
Sbjct: 61  EDGKTLIALILCLASENDSNLDAMPLDVQELVTSYQLSAFITKVCKEAATTKEEWEEQCK 120

Query: 129 FWPTSYHPPTYNIDGITGFNEEDTQSVFGFMRLAIELAQSSNKSVLNAAVIVDPSVKQVI 188
            WPTSYHPPTYNIDGITGFNEEDTQSVFGFMRLA+ELAQ SNK V+NAAVIVDPSVKQVI
Sbjct: 121 LWPTSYHPPTYNIDGITGFNEEDTQSVFGFMRLAVELAQ-SNKLVVNAAVIVDPSVKQVI 180

Query: 189 ASAYDHHIHSENASKSNANGETGFEKRLESIGSHLDSNGATIHGTSLSIGSLKALKQSCS 248
           ASA DHHI SENAS SNA GE  F  RLES+GSHLDSNGATIHG   S  SLK LKQSCS
Sbjct: 181 ASACDHHISSENASMSNAGGEIDFRNRLESVGSHLDSNGATIHGNLPSSASLKTLKQSCS 240

Query: 249 DVLCLYPWRWVEQQSENSSNS-CQHPLRHAAIAAIESSAARDRRLYPSSGSVGDKSIEME 308
            V CLYPW+W +QQ ++ SNS C HPLRHAAIAAIESSAARDRRL+P+SG++GDKS+E +
Sbjct: 241 TVSCLYPWQWTDQQLQHLSNSCCWHPLRHAAIAAIESSAARDRRLFPTSGTIGDKSVETD 300

Query: 309 LMGPPTNLAKRQKTDLENVKS-EKVGTDIGGINSKSARPYLCTGYDIYLVWEPC 361
            +GP  NLAKRQKT+LEN KS EK GTDI GI SKS RPYLCTGYDIYLVWEPC
Sbjct: 301 DIGPHCNLAKRQKTNLENAKSEEKEGTDIEGICSKSGRPYLCTGYDIYLVWEPC 353

BLAST of Sgr016824 vs. NCBI nr
Match: XP_022948266.1 (probable inactive tRNA-specific adenosine deaminase-like protein 3 isoform X1 [Cucurbita moschata])

HSP 1 Score: 554.7 bits (1428), Expect = 1.1e-153
Identity = 284/354 (80.23%), Postives = 310/354 (87.57%), Query Frame = 0

Query: 9   MWKIIHIPDKPPILPDQQLTVNVIAAVVEPKLVNNLIRRLNQIAPLENLRHVKRVQKKHL 68
           M +II+IPDKPPI PDQQ TVNV AAVVEPKL+NNL+R LN+IAPLENLRH+KRVQKKHL
Sbjct: 1   MSQIIYIPDKPPIPPDQQPTVNVFAAVVEPKLINNLVRCLNRIAPLENLRHLKRVQKKHL 60

Query: 69  EDGKTQIALILCLASENNCNLDAMPPDVQELVSLYQLSAFITKVCKEAASTKEEWEEQCK 128
           EDGKT IALILCLASEN+ N+DAMP DV+ELV+ YQLSAFITKVCKEAA+TKEEWEEQCK
Sbjct: 61  EDGKTLIALILCLASENDSNMDAMPQDVRELVTSYQLSAFITKVCKEAATTKEEWEEQCK 120

Query: 129 FWPTSYHPPTYNIDGITGFNEEDTQSVFGFMRLAIELAQSSNKSVLNAAVIVDPSVKQVI 188
            WPTSYHPPTYNIDGITGFNEEDTQSVFGFMRLA+ELAQ SNK V+NAAVIVDPSVKQVI
Sbjct: 121 LWPTSYHPPTYNIDGITGFNEEDTQSVFGFMRLAVELAQ-SNKLVVNAAVIVDPSVKQVI 180

Query: 189 ASAYDHHIHSENASKSNANGETGFEKRLESIGSHLDSNGATIHGTSLSIGSLKALKQSCS 248
           ASA DHHI SENAS SNA GE  F  RLES+GSHLDSNGATIHG   S  SLK LKQSCS
Sbjct: 181 ASACDHHISSENASTSNAGGEIDFRNRLESVGSHLDSNGATIHGNLPSSASLKTLKQSCS 240

Query: 249 DVLCLYPWRWVEQQSENSSNS-CQHPLRHAAIAAIESSAARDRRLYPSSGSVGDKSIEME 308
            V CLYPW+W +QQ ++ SNS C HPLRHAAIAAIESSAARDR L+P+SG++GDKS+E +
Sbjct: 241 TVSCLYPWQWTDQQLQHLSNSCCWHPLRHAAIAAIESSAARDRHLFPTSGTIGDKSVETD 300

Query: 309 LMGPPTNLAKRQKTDLENVKS-EKVGTDIGGINSKSARPYLCTGYDIYLVWEPC 361
            +GP  NLAKRQKT+LEN KS EK GTDI GI SKS RPYLCTGYDIYLVWEPC
Sbjct: 301 DIGPHCNLAKRQKTNLENAKSEEKEGTDIEGICSKSGRPYLCTGYDIYLVWEPC 353

BLAST of Sgr016824 vs. NCBI nr
Match: KAG6607517.1 (tRNA-specific adenosine deaminase TAD3, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 551.6 bits (1420), Expect = 9.2e-153
Identity = 284/354 (80.23%), Postives = 308/354 (87.01%), Query Frame = 0

Query: 9   MWKIIHIPDKPPILPDQQLTVNVIAAVVEPKLVNNLIRRLNQIAPLENLRHVKRVQKKHL 68
           M +II+IPDKPPI PDQQ TVNV AAVVEPKL+NNL+R LN+IAPLENLRH+KRVQKKHL
Sbjct: 1   MSQIIYIPDKPPIPPDQQPTVNVFAAVVEPKLINNLVRCLNRIAPLENLRHLKRVQKKHL 60

Query: 69  EDGKTQIALILCLASENNCNLDAMPPDVQELVSLYQLSAFITKVCKEAASTKEEWEEQCK 128
           EDGKT IALILCLASEN+ NLDAMP DVQELV+ YQLSAFITKVCKEAA+TKEEWEEQCK
Sbjct: 61  EDGKTLIALILCLASENDSNLDAMPQDVQELVTSYQLSAFITKVCKEAATTKEEWEEQCK 120

Query: 129 FWPTSYHPPTYNIDGITGFNEEDTQSVFGFMRLAIELAQSSNKSVLNAAVIVDPSVKQVI 188
            WPTSYHPPTYNIDGITGFN EDTQSVFGFMRLA+ELAQ SNK V+NAAVIVDPS KQVI
Sbjct: 121 LWPTSYHPPTYNIDGITGFNGEDTQSVFGFMRLAVELAQ-SNKLVVNAAVIVDPSAKQVI 180

Query: 189 ASAYDHHIHSENASKSNANGETGFEKRLESIGSHLDSNGATIHGTSLSIGSLKALKQSCS 248
           ASA DHHI SENAS SNA GE  F  RLES+ SHLDSNGATIHG   S  SLK LKQSCS
Sbjct: 181 ASACDHHISSENASMSNAGGEIDFRNRLESVCSHLDSNGATIHGNLPSSASLKTLKQSCS 240

Query: 249 DVLCLYPWRWVEQQSENSSNS-CQHPLRHAAIAAIESSAARDRRLYPSSGSVGDKSIEME 308
            V CLYPW+W +QQ ++ SNS C HPLRHAAIAAIESSAARDRRL+P+SG++GDKS+E +
Sbjct: 241 TVSCLYPWQWTDQQLQHLSNSCCWHPLRHAAIAAIESSAARDRRLFPTSGTIGDKSVETD 300

Query: 309 LMGPPTNLAKRQKTDLENVKS-EKVGTDIGGINSKSARPYLCTGYDIYLVWEPC 361
            +GP  NLAKRQKT+LEN KS EK GTDI GI SKS RPYLCTGYDIYLVWEPC
Sbjct: 301 DLGPHCNLAKRQKTNLENAKSEEKEGTDIEGICSKSGRPYLCTGYDIYLVWEPC 353

BLAST of Sgr016824 vs. NCBI nr
Match: XP_022973452.1 (probable inactive tRNA-specific adenosine deaminase-like protein 3 isoform X1 [Cucurbita maxima])

HSP 1 Score: 544.7 bits (1402), Expect = 1.1e-150
Identity = 279/354 (78.81%), Postives = 309/354 (87.29%), Query Frame = 0

Query: 9   MWKIIHIPDKPPILPDQQLTVNVIAAVVEPKLVNNLIRRLNQIAPLENLRHVKRVQKKHL 68
           M +II+IPDKPPI PDQQ TVNV AAVVEPKL+NNL+R LN+IAPLENLRHVKRVQKKHL
Sbjct: 1   MSQIIYIPDKPPIPPDQQPTVNVFAAVVEPKLINNLVRCLNRIAPLENLRHVKRVQKKHL 60

Query: 69  EDGKTQIALILCLASENNCNLDAMPPDVQELVSLYQLSAFITKVCKEAASTKEEWEEQCK 128
           EDGKT IALILCLASEN+ ++DAMP DVQE+V+ YQLSAFITKVCKEAA+TKEEWEEQCK
Sbjct: 61  EDGKTLIALILCLASENDSDMDAMPQDVQEIVTSYQLSAFITKVCKEAATTKEEWEEQCK 120

Query: 129 FWPTSYHPPTYNIDGITGFNEEDTQSVFGFMRLAIELAQSSNKSVLNAAVIVDPSVKQVI 188
            WPTSYHPPTYNIDGITGFNEEDTQSVFGFMRLA+ELAQ SNK ++NAAVIVDPS K+VI
Sbjct: 121 LWPTSYHPPTYNIDGITGFNEEDTQSVFGFMRLAVELAQ-SNKLIVNAAVIVDPSAKKVI 180

Query: 189 ASAYDHHIHSENASKSNANGETGFEKRLESIGSHLDSNGATIHGTSLSIGSLKALKQSCS 248
           ASA DHH+ SENAS SNA G+  F   LES+GSHLDSNGATIHG   S  SLK LKQSCS
Sbjct: 181 ASACDHHVSSENASMSNAGGKIDFRNWLESVGSHLDSNGATIHGNLPSSASLKTLKQSCS 240

Query: 249 DVLCLYPWRWVEQQSENSSNS-CQHPLRHAAIAAIESSAARDRRLYPSSGSVGDKSIEME 308
            V CLYPW+W +QQ ++ SNS C HPLRHAAIAAIESSAARDRRL+P+SG++GDKS+E +
Sbjct: 241 TVSCLYPWQWTDQQLQHLSNSCCWHPLRHAAIAAIESSAARDRRLFPTSGTIGDKSVETD 300

Query: 309 LMGPPTNLAKRQKTDLENVKS-EKVGTDIGGINSKSARPYLCTGYDIYLVWEPC 361
           + GP  NLAKRQKT+LEN KS EK GTDI GI SKS RPYLCTGYDIYLVWEPC
Sbjct: 301 I-GPFCNLAKRQKTNLENAKSEEKEGTDIEGICSKSGRPYLCTGYDIYLVWEPC 352

BLAST of Sgr016824 vs. ExPASy Swiss-Prot
Match: F4KH86 (tRNA-specific adenosine deaminase TAD3 OS=Arabidopsis thaliana OX=3702 GN=TAD3 PE=1 SV=1)

HSP 1 Score: 308.9 bits (790), Expect = 1.4e-82
Identity = 178/364 (48.90%), Postives = 238/364 (65.38%), Query Frame = 0

Query: 10  WKIIHIPDKPPILPDQQLTVNVIAAVVEPKLVNNLIRRLNQIAPLENLRHVKRVQKKHLE 69
           W+IIHIP+KP + PD Q TV V A++++P+  N ++R L +IAPLE+LRHVKRV+KK L 
Sbjct: 6   WEIIHIPEKPSLSPDHQPTVKVYASLIKPRFANTIVRHLCKIAPLEDLRHVKRVKKKILP 65

Query: 70  D-GKTQIALILCLASENNCNLDAMPPDVQELVSLYQLSAFITKVCKEAASTKEEWEEQCK 129
           D G+TQ+ +ILCLA E+N  L  MPPDVQ LV  Y+LS FIT+VCK AA +KEEWEEQ K
Sbjct: 66  DCGETQLTVILCLAPEHNDQLSDMPPDVQRLVDPYELSPFITQVCKYAAVSKEEWEEQSK 125

Query: 130 FWPTSYHPPTYNIDGITGFNEEDTQSVFGFMRLAIELAQSSNKSVLNAAVIVDPSVKQVI 189
            WPTS+HPPTYNIDGI GF+EE+TQS+  FMR+ I++A S +  ++NAAVIVDPSV+++I
Sbjct: 126 IWPTSFHPPTYNIDGIGGFSEEETQSICKFMRVVIDMAVSGHTPLVNAAVIVDPSVRRII 185

Query: 190 ASAYDHHIHSENASKSNANGET-GFEKRLE-SIGSHLDSNGATIHGTSLSIGSLKALKQS 249
           AS  D  +++ +A +   + ET  FE+  E  +   L+             GSL AL   
Sbjct: 186 ASETD-QVYASSAPRDMTSAETRPFEETGEICLNDTLEKQN----------GSLSALS-- 245

Query: 250 CSDVLCLYPWRWVEQQSENSSNSCQHPLRHAAIAAIESSAARDRRLYPSSGSVGDKSIEM 309
                CL PW+W  Q  +  + S  HPLRHA++ AIESS+ARDR L+P+   + D+    
Sbjct: 246 -----CLNPWQWSLQPHDTENCSQWHPLRHASMVAIESSSARDRNLFPNPSKIFDQD--- 305

Query: 310 ELMGPPTNL---AKRQKT-----DLENVKSEKVGTDIGGINSKSARPYLCTGYDIYLVWE 363
               PP+N    AK+QKT     D++N   E+   D         RPYLCTGYDI+L+ E
Sbjct: 306 --HVPPSNTDSPAKKQKTSSQSPDVQNDSREETVRD-----PSMERPYLCTGYDIFLLLE 341

BLAST of Sgr016824 vs. ExPASy Swiss-Prot
Match: Q8JFW4 (Probable inactive tRNA-specific adenosine deaminase-like protein 3 OS=Danio rerio OX=7955 GN=adat3 PE=2 SV=2)

HSP 1 Score: 74.3 bits (181), Expect = 5.6e-12
Identity = 54/189 (28.57%), Postives = 95/189 (50.26%), Query Frame = 0

Query: 20  PILPDQQ-----LTVNVIAAVVEPKLVNNLIRRLNQIAPLENLRHVKRVQKKHLEDGKTQ 79
           P+L D+Q     L     A ++E +  + L++ L+ I PL NL+H+KRV+    +D    
Sbjct: 22  PVLSDEQSQDPELLPAYAAPILERRETSRLVKELSLIHPLPNLQHIKRVRPCKHKDSPHP 81

Query: 80  IALILCLASENNCNLDAMPPDVQELVSLYQLSA------FITKVCKEAASTKEEWEEQCK 139
           + +I+CL S+  C  D     +  L+     ++      FI ++      T+ ++E+  K
Sbjct: 82  LEVIVCLVSDVQCT-DPKKVTLSHLLHTQCFNSNGLGDPFIVQIPANPPLTRPQFEKASK 141

Query: 140 FWPTSYHPPTYNIDGITG--FNEEDTQSVFGFMRLAIELAQSSNKSVLNA--AVIVDPSV 194
            WPTS+H        + G  F       +  +M +A++ A+S  +  ++A  AVIVDP  
Sbjct: 142 HWPTSFHEDKLVTFALKGQLFTAHQKTKMREYMCVAVKAAKSGRELGMDAVGAVIVDPKT 201

BLAST of Sgr016824 vs. ExPASy TrEMBL
Match: A0A6J1CLQ3 (probable inactive tRNA-specific adenosine deaminase-like protein 3 OS=Momordica charantia OX=3673 GN=LOC111012110 PE=4 SV=1)

HSP 1 Score: 582.0 bits (1499), Expect = 3.1e-162
Identity = 299/355 (84.23%), Postives = 320/355 (90.14%), Query Frame = 0

Query: 9   MWKIIHIPDKPPILPDQQLTVNVIAAVVEPKLVNNLIRRLNQIAPLENLRHVKRVQKKHL 68
           M +II+IPDK PI P QQ TVNV AAVVEPKL+NNL+RRLNQIAPLENLRHVKRVQKKHL
Sbjct: 1   MCQIIYIPDKLPIPPGQQPTVNVFAAVVEPKLINNLVRRLNQIAPLENLRHVKRVQKKHL 60

Query: 69  EDGKTQIALILCLASENNCNLDAMPPDVQELVSLYQLSAFITKVCKEAASTKEEWEEQCK 128
           EDGKTQIALILCLA ENNCNLD MP DVQELV+ YQLSAFITKVCKEAASTKEEWEEQCK
Sbjct: 61  EDGKTQIALILCLARENNCNLDDMPSDVQELVTSYQLSAFITKVCKEAASTKEEWEEQCK 120

Query: 129 FWPTSYHPPTYNIDGITGFNEEDTQSVFGFMRLAIELAQSSNKSVLNAAVIVDPSVKQVI 188
            WPTSYHPPTYNIDGITGFNEE+TQSVFG MRLAIELAQSS+K+V+NAAVI+DPS KQVI
Sbjct: 121 LWPTSYHPPTYNIDGITGFNEEETQSVFGLMRLAIELAQSSHKTVVNAAVIIDPSDKQVI 180

Query: 189 ASAYDHHIHSENASKSNANGETGFEKRLESIGSHLDSNGATIHGTSLSIGSLKALKQSCS 248
           A+A DHHI SENA  SNA+GETGF+KRLESIGSH DSNGATIH T  S  SLK LKQSCS
Sbjct: 181 ATACDHHIFSENA-MSNASGETGFKKRLESIGSHADSNGATIHETLPSSASLKVLKQSCS 240

Query: 249 DVLCLYPWRWVEQQSENSSNSCQ-HPLRHAAIAAIESSAARDRRLYPSSGSVGDKSIEME 308
           +V CLYPWRWVEQQ ++SSNSC  HPLRHAAIAAIESSAARDRRL+P+SG+VGDKSIEME
Sbjct: 241 NVSCLYPWRWVEQQLQHSSNSCSWHPLRHAAIAAIESSAARDRRLFPTSGTVGDKSIEME 300

Query: 309 LMGPPTNLAKRQKTDLENVKS--EKVGTDIGGINSKSARPYLCTGYDIYLVWEPC 361
            MGPPTNLAKRQKTDL+N KS  EK GTDI GI+SKS RPYLCTGYDIYLVWEPC
Sbjct: 301 PMGPPTNLAKRQKTDLDNAKSEEEKRGTDIEGIDSKSGRPYLCTGYDIYLVWEPC 354

BLAST of Sgr016824 vs. ExPASy TrEMBL
Match: A0A6J1G8T4 (probable inactive tRNA-specific adenosine deaminase-like protein 3 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111451991 PE=4 SV=1)

HSP 1 Score: 554.7 bits (1428), Expect = 5.3e-154
Identity = 284/354 (80.23%), Postives = 310/354 (87.57%), Query Frame = 0

Query: 9   MWKIIHIPDKPPILPDQQLTVNVIAAVVEPKLVNNLIRRLNQIAPLENLRHVKRVQKKHL 68
           M +II+IPDKPPI PDQQ TVNV AAVVEPKL+NNL+R LN+IAPLENLRH+KRVQKKHL
Sbjct: 1   MSQIIYIPDKPPIPPDQQPTVNVFAAVVEPKLINNLVRCLNRIAPLENLRHLKRVQKKHL 60

Query: 69  EDGKTQIALILCLASENNCNLDAMPPDVQELVSLYQLSAFITKVCKEAASTKEEWEEQCK 128
           EDGKT IALILCLASEN+ N+DAMP DV+ELV+ YQLSAFITKVCKEAA+TKEEWEEQCK
Sbjct: 61  EDGKTLIALILCLASENDSNMDAMPQDVRELVTSYQLSAFITKVCKEAATTKEEWEEQCK 120

Query: 129 FWPTSYHPPTYNIDGITGFNEEDTQSVFGFMRLAIELAQSSNKSVLNAAVIVDPSVKQVI 188
            WPTSYHPPTYNIDGITGFNEEDTQSVFGFMRLA+ELAQ SNK V+NAAVIVDPSVKQVI
Sbjct: 121 LWPTSYHPPTYNIDGITGFNEEDTQSVFGFMRLAVELAQ-SNKLVVNAAVIVDPSVKQVI 180

Query: 189 ASAYDHHIHSENASKSNANGETGFEKRLESIGSHLDSNGATIHGTSLSIGSLKALKQSCS 248
           ASA DHHI SENAS SNA GE  F  RLES+GSHLDSNGATIHG   S  SLK LKQSCS
Sbjct: 181 ASACDHHISSENASTSNAGGEIDFRNRLESVGSHLDSNGATIHGNLPSSASLKTLKQSCS 240

Query: 249 DVLCLYPWRWVEQQSENSSNS-CQHPLRHAAIAAIESSAARDRRLYPSSGSVGDKSIEME 308
            V CLYPW+W +QQ ++ SNS C HPLRHAAIAAIESSAARDR L+P+SG++GDKS+E +
Sbjct: 241 TVSCLYPWQWTDQQLQHLSNSCCWHPLRHAAIAAIESSAARDRHLFPTSGTIGDKSVETD 300

Query: 309 LMGPPTNLAKRQKTDLENVKS-EKVGTDIGGINSKSARPYLCTGYDIYLVWEPC 361
            +GP  NLAKRQKT+LEN KS EK GTDI GI SKS RPYLCTGYDIYLVWEPC
Sbjct: 301 DIGPHCNLAKRQKTNLENAKSEEKEGTDIEGICSKSGRPYLCTGYDIYLVWEPC 353

BLAST of Sgr016824 vs. ExPASy TrEMBL
Match: A0A6J1I7K0 (probable inactive tRNA-specific adenosine deaminase-like protein 3 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111471994 PE=4 SV=1)

HSP 1 Score: 544.7 bits (1402), Expect = 5.4e-151
Identity = 279/354 (78.81%), Postives = 309/354 (87.29%), Query Frame = 0

Query: 9   MWKIIHIPDKPPILPDQQLTVNVIAAVVEPKLVNNLIRRLNQIAPLENLRHVKRVQKKHL 68
           M +II+IPDKPPI PDQQ TVNV AAVVEPKL+NNL+R LN+IAPLENLRHVKRVQKKHL
Sbjct: 1   MSQIIYIPDKPPIPPDQQPTVNVFAAVVEPKLINNLVRCLNRIAPLENLRHVKRVQKKHL 60

Query: 69  EDGKTQIALILCLASENNCNLDAMPPDVQELVSLYQLSAFITKVCKEAASTKEEWEEQCK 128
           EDGKT IALILCLASEN+ ++DAMP DVQE+V+ YQLSAFITKVCKEAA+TKEEWEEQCK
Sbjct: 61  EDGKTLIALILCLASENDSDMDAMPQDVQEIVTSYQLSAFITKVCKEAATTKEEWEEQCK 120

Query: 129 FWPTSYHPPTYNIDGITGFNEEDTQSVFGFMRLAIELAQSSNKSVLNAAVIVDPSVKQVI 188
            WPTSYHPPTYNIDGITGFNEEDTQSVFGFMRLA+ELAQ SNK ++NAAVIVDPS K+VI
Sbjct: 121 LWPTSYHPPTYNIDGITGFNEEDTQSVFGFMRLAVELAQ-SNKLIVNAAVIVDPSAKKVI 180

Query: 189 ASAYDHHIHSENASKSNANGETGFEKRLESIGSHLDSNGATIHGTSLSIGSLKALKQSCS 248
           ASA DHH+ SENAS SNA G+  F   LES+GSHLDSNGATIHG   S  SLK LKQSCS
Sbjct: 181 ASACDHHVSSENASMSNAGGKIDFRNWLESVGSHLDSNGATIHGNLPSSASLKTLKQSCS 240

Query: 249 DVLCLYPWRWVEQQSENSSNS-CQHPLRHAAIAAIESSAARDRRLYPSSGSVGDKSIEME 308
            V CLYPW+W +QQ ++ SNS C HPLRHAAIAAIESSAARDRRL+P+SG++GDKS+E +
Sbjct: 241 TVSCLYPWQWTDQQLQHLSNSCCWHPLRHAAIAAIESSAARDRRLFPTSGTIGDKSVETD 300

Query: 309 LMGPPTNLAKRQKTDLENVKS-EKVGTDIGGINSKSARPYLCTGYDIYLVWEPC 361
           + GP  NLAKRQKT+LEN KS EK GTDI GI SKS RPYLCTGYDIYLVWEPC
Sbjct: 301 I-GPFCNLAKRQKTNLENAKSEEKEGTDIEGICSKSGRPYLCTGYDIYLVWEPC 352

BLAST of Sgr016824 vs. ExPASy TrEMBL
Match: A0A1S4E5W3 (probable inactive tRNA-specific adenosine deaminase-like protein 3 isoform X4 OS=Cucumis melo OX=3656 GN=LOC103504115 PE=4 SV=1)

HSP 1 Score: 538.1 bits (1385), Expect = 5.1e-149
Identity = 277/362 (76.52%), Postives = 306/362 (84.53%), Query Frame = 0

Query: 5   QAYAMWKIIHIPDKPPILPDQQLTVNVIAAVVEPKLVNNLIRRLNQIAPLENLRHVKRVQ 64
           QAY+M KII+IPDK P  PDQQ TVNV AAVVEPK++NNL+RRLNQIAPLENLRHVKRVQ
Sbjct: 32  QAYSMSKIIYIPDKLPTPPDQQPTVNVFAAVVEPKIINNLVRRLNQIAPLENLRHVKRVQ 91

Query: 65  KKHLEDGKTQIALILCLASENNCNLDAMPPDVQELVSLYQLSAFITKVCKEAASTKEEWE 124
           KKHLEDGKTQIALILC+ASENNC+LD + P VQELV+ Y+LSAFITKVCKEAA+TKEEWE
Sbjct: 92  KKHLEDGKTQIALILCVASENNCDLDVLSPCVQELVTSYKLSAFITKVCKEAATTKEEWE 151

Query: 125 EQCKFWPTSYHPPTYNIDGITGFNEEDTQSVFGFMRLAIELAQSSNKSVLNAAVIVDPSV 184
           EQCK WPTSYHPPTYNIDGITGFNE DTQS+FGFMRLAIELAQSS+KSV+NAAVIVDPSV
Sbjct: 152 EQCKLWPTSYHPPTYNIDGITGFNEVDTQSIFGFMRLAIELAQSSSKSVVNAAVIVDPSV 211

Query: 185 KQVIASAYDHHIHSENASKSNANGETGFEKRLESIGSHLDSNGATIHGTSLSIGSLKALK 244
            QVIASA DHHI  ENAS SN NGET FEK  +S+ SH  SNG+ IHGT  S  SL+ LK
Sbjct: 212 TQVIASACDHHISFENASTSNVNGETSFEKSPKSLCSHFGSNGSIIHGTFPSSSSLEKLK 271

Query: 245 QSCSDVLCLYPWRWVEQQSENSSNS-CQHPLRHAAIAAIESSAARDRRLYPSSGSVGDKS 304
           QSC+DV CLYP RWV+Q   +SSNS C HPLRHAAIAAIESSAARDRRL+P+  + GDKS
Sbjct: 272 QSCADVSCLYPLRWVDQPLPHSSNSCCWHPLRHAAIAAIESSAARDRRLFPTLETTGDKS 331

Query: 305 IEMELMGPPTNLAKRQKTDLENV--KSEKVGTDIGGINSKSARPYLCTGYDIYLVWEPCT 364
           +EME MGP T LAKRQK DL+N   K +KVGTD  G  SK  RPYLCT YDIYLVWEPC 
Sbjct: 332 VEMEHMGPLTKLAKRQKIDLDNANSKEKKVGTDGEGTYSKLGRPYLCTDYDIYLVWEPCI 391

BLAST of Sgr016824 vs. ExPASy TrEMBL
Match: A0A1S4E5Y1 (probable inactive tRNA-specific adenosine deaminase-like protein 3 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103504115 PE=4 SV=1)

HSP 1 Score: 537.3 bits (1383), Expect = 8.7e-149
Identity = 277/359 (77.16%), Postives = 304/359 (84.68%), Query Frame = 0

Query: 5   QAYAMWKIIHIPDKPPILPDQQLTVNVIAAVVEPKLVNNLIRRLNQIAPLENLRHVKRVQ 64
           QAY+M KII+IPDK P  PDQQ TVNV AAVVEPK++NNL+RRLNQIAPLENLRHVKRVQ
Sbjct: 32  QAYSMSKIIYIPDKLPTPPDQQPTVNVFAAVVEPKIINNLVRRLNQIAPLENLRHVKRVQ 91

Query: 65  KKHLEDGKTQIALILCLASENNCNLDAMPPDVQELVSLYQLSAFITKVCKEAASTKEEWE 124
           KKHLEDGKTQIALILC+ASENNC+LD + P VQELV+ Y+LSAFITKVCKEAA+TKEEWE
Sbjct: 92  KKHLEDGKTQIALILCVASENNCDLDVLSPCVQELVTSYKLSAFITKVCKEAATTKEEWE 151

Query: 125 EQCKFWPTSYHPPTYNIDGITGFNEEDTQSVFGFMRLAIELAQSSNKSVLNAAVIVDPSV 184
           EQCK WPTSYHPPTYNIDGITGFNE DTQS+FGFMRLAIELAQSS+KSV+NAAVIVDPSV
Sbjct: 152 EQCKLWPTSYHPPTYNIDGITGFNEVDTQSIFGFMRLAIELAQSSSKSVVNAAVIVDPSV 211

Query: 185 KQVIASAYDHHIHSENASKSNANGETGFEKRLESIGSHLDSNGATIHGTSLSIGSLKALK 244
            QVIASA DHHI  ENAS SN NGET FEK  +S+ SH  SNG+ IHGT  S  SL+ LK
Sbjct: 212 TQVIASACDHHISFENASTSNVNGETSFEKSPKSLCSHFGSNGSIIHGTFPSSSSLEKLK 271

Query: 245 QSCSDVLCLYPWRWVEQQSENSSNS-CQHPLRHAAIAAIESSAARDRRLYPSSGSVGDKS 304
           QSC+DV CLYP RWV+Q   +SSNS C HPLRHAAIAAIESSAARDRRL+P+  + GDKS
Sbjct: 272 QSCADVSCLYPLRWVDQPLPHSSNSCCWHPLRHAAIAAIESSAARDRRLFPTLETTGDKS 331

Query: 305 IEMELMGPPTNLAKRQKTDLENV--KSEKVGTDIGGINSKSARPYLCTGYDIYLVWEPC 361
           +EME MGP T LAKRQK DL+N   K +KVGTD  G  SK  RPYLCT YDIYLVWEPC
Sbjct: 332 VEMEHMGPLTKLAKRQKIDLDNANSKEKKVGTDGEGTYSKLGRPYLCTDYDIYLVWEPC 390

BLAST of Sgr016824 vs. TAIR 10
Match: AT5G24670.1 (Cytidine/deoxycytidylate deaminase family protein )

HSP 1 Score: 308.9 bits (790), Expect = 9.7e-84
Identity = 178/364 (48.90%), Postives = 238/364 (65.38%), Query Frame = 0

Query: 10  WKIIHIPDKPPILPDQQLTVNVIAAVVEPKLVNNLIRRLNQIAPLENLRHVKRVQKKHLE 69
           W+IIHIP+KP + PD Q TV V A++++P+  N ++R L +IAPLE+LRHVKRV+KK L 
Sbjct: 6   WEIIHIPEKPSLSPDHQPTVKVYASLIKPRFANTIVRHLCKIAPLEDLRHVKRVKKKILP 65

Query: 70  D-GKTQIALILCLASENNCNLDAMPPDVQELVSLYQLSAFITKVCKEAASTKEEWEEQCK 129
           D G+TQ+ +ILCLA E+N  L  MPPDVQ LV  Y+LS FIT+VCK AA +KEEWEEQ K
Sbjct: 66  DCGETQLTVILCLAPEHNDQLSDMPPDVQRLVDPYELSPFITQVCKYAAVSKEEWEEQSK 125

Query: 130 FWPTSYHPPTYNIDGITGFNEEDTQSVFGFMRLAIELAQSSNKSVLNAAVIVDPSVKQVI 189
            WPTS+HPPTYNIDGI GF+EE+TQS+  FMR+ I++A S +  ++NAAVIVDPSV+++I
Sbjct: 126 IWPTSFHPPTYNIDGIGGFSEEETQSICKFMRVVIDMAVSGHTPLVNAAVIVDPSVRRII 185

Query: 190 ASAYDHHIHSENASKSNANGET-GFEKRLE-SIGSHLDSNGATIHGTSLSIGSLKALKQS 249
           AS  D  +++ +A +   + ET  FE+  E  +   L+             GSL AL   
Sbjct: 186 ASETD-QVYASSAPRDMTSAETRPFEETGEICLNDTLEKQN----------GSLSALS-- 245

Query: 250 CSDVLCLYPWRWVEQQSENSSNSCQHPLRHAAIAAIESSAARDRRLYPSSGSVGDKSIEM 309
                CL PW+W  Q  +  + S  HPLRHA++ AIESS+ARDR L+P+   + D+    
Sbjct: 246 -----CLNPWQWSLQPHDTENCSQWHPLRHASMVAIESSSARDRNLFPNPSKIFDQD--- 305

Query: 310 ELMGPPTNL---AKRQKT-----DLENVKSEKVGTDIGGINSKSARPYLCTGYDIYLVWE 363
               PP+N    AK+QKT     D++N   E+   D         RPYLCTGYDI+L+ E
Sbjct: 306 --HVPPSNTDSPAKKQKTSSQSPDVQNDSREETVRD-----PSMERPYLCTGYDIFLLLE 341

BLAST of Sgr016824 vs. TAIR 10
Match: AT5G24670.2 (Cytidine/deoxycytidylate deaminase family protein )

HSP 1 Score: 307.4 bits (786), Expect = 2.8e-83
Identity = 178/368 (48.37%), Postives = 238/368 (64.67%), Query Frame = 0

Query: 10  WKIIHIPDKPPILPDQQLTVNVIAAVVEPKLVNNLIRRLNQIAPLENLRHVKRVQKKHLE 69
           W+IIHIP+KP + PD Q TV V A++++P+  N ++R L +IAPLE+LRHVKRV+KK L 
Sbjct: 6   WEIIHIPEKPSLSPDHQPTVKVYASLIKPRFANTIVRHLCKIAPLEDLRHVKRVKKKILP 65

Query: 70  D-----GKTQIALILCLASENNCNLDAMPPDVQELVSLYQLSAFITKVCKEAASTKEEWE 129
           D     G+TQ+ +ILCLA E+N  L  MPPDVQ LV  Y+LS FIT+VCK AA +KEEWE
Sbjct: 66  DCAYWTGETQLTVILCLAPEHNDQLSDMPPDVQRLVDPYELSPFITQVCKYAAVSKEEWE 125

Query: 130 EQCKFWPTSYHPPTYNIDGITGFNEEDTQSVFGFMRLAIELAQSSNKSVLNAAVIVDPSV 189
           EQ K WPTS+HPPTYNIDGI GF+EE+TQS+  FMR+ I++A S +  ++NAAVIVDPSV
Sbjct: 126 EQSKIWPTSFHPPTYNIDGIGGFSEEETQSICKFMRVVIDMAVSGHTPLVNAAVIVDPSV 185

Query: 190 KQVIASAYDHHIHSENASKSNANGET-GFEKRLE-SIGSHLDSNGATIHGTSLSIGSLKA 249
           +++IAS  D  +++ +A +   + ET  FE+  E  +   L+             GSL A
Sbjct: 186 RRIIASETD-QVYASSAPRDMTSAETRPFEETGEICLNDTLEKQN----------GSLSA 245

Query: 250 LKQSCSDVLCLYPWRWVEQQSENSSNSCQHPLRHAAIAAIESSAARDRRLYPSSGSVGDK 309
           L        CL PW+W  Q  +  + S  HPLRHA++ AIESS+ARDR L+P+   + D+
Sbjct: 246 LS-------CLNPWQWSLQPHDTENCSQWHPLRHASMVAIESSSARDRNLFPNPSKIFDQ 305

Query: 310 SIEMELMGPPTNL---AKRQKT-----DLENVKSEKVGTDIGGINSKSARPYLCTGYDIY 363
                   PP+N    AK+QKT     D++N   E+   D         RPYLCTGYDI+
Sbjct: 306 D-----HVPPSNTDSPAKKQKTSSQSPDVQNDSREETVRD-----PSMERPYLCTGYDIF 345

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022141843.16.3e-16284.23probable inactive tRNA-specific adenosine deaminase-like protein 3 [Momordica ch... [more]
XP_023523263.14.4e-15581.36probable inactive tRNA-specific adenosine deaminase-like protein 3 isoform X1 [C... [more]
XP_022948266.11.1e-15380.23probable inactive tRNA-specific adenosine deaminase-like protein 3 isoform X1 [C... [more]
KAG6607517.19.2e-15380.23tRNA-specific adenosine deaminase TAD3, partial [Cucurbita argyrosperma subsp. s... [more]
XP_022973452.11.1e-15078.81probable inactive tRNA-specific adenosine deaminase-like protein 3 isoform X1 [C... [more]
Match NameE-valueIdentityDescription
F4KH861.4e-8248.90tRNA-specific adenosine deaminase TAD3 OS=Arabidopsis thaliana OX=3702 GN=TAD3 P... [more]
Q8JFW45.6e-1228.57Probable inactive tRNA-specific adenosine deaminase-like protein 3 OS=Danio reri... [more]
Match NameE-valueIdentityDescription
A0A6J1CLQ33.1e-16284.23probable inactive tRNA-specific adenosine deaminase-like protein 3 OS=Momordica ... [more]
A0A6J1G8T45.3e-15480.23probable inactive tRNA-specific adenosine deaminase-like protein 3 isoform X1 OS... [more]
A0A6J1I7K05.4e-15178.81probable inactive tRNA-specific adenosine deaminase-like protein 3 isoform X1 OS... [more]
A0A1S4E5W35.1e-14976.52probable inactive tRNA-specific adenosine deaminase-like protein 3 isoform X4 OS... [more]
A0A1S4E5Y18.7e-14977.16probable inactive tRNA-specific adenosine deaminase-like protein 3 isoform X1 OS... [more]
Match NameE-valueIdentityDescription
AT5G24670.19.7e-8448.90Cytidine/deoxycytidylate deaminase family protein [more]
AT5G24670.22.8e-8348.37Cytidine/deoxycytidylate deaminase family protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D3.40.140.10Cytidine Deaminase, domain 2coord: 150..370
e-value: 8.5E-9
score: 37.1
NoneNo IPR availablePANTHERPTHR11079CYTOSINE DEAMINASE FAMILY MEMBERcoord: 28..361
NoneNo IPR availablePANTHERPTHR11079:SF156INACTIVE TRNA-SPECIFIC ADENOSINE DEAMINASE-LIKE PROTEIN 3-RELATEDcoord: 28..361
IPR016193Cytidine deaminase-likeSUPERFAMILY53927Cytidine deaminase-likecoord: 128..361

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr016824.1Sgr016824.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0071704 organic substance metabolic process
molecular_function GO:0052717 tRNA-specific adenosine-34 deaminase activity
molecular_function GO:0003824 catalytic activity