CsaV3_4G027590 (gene) Cucumber (Chinese Long) v3

Overview
NameCsaV3_4G027590
Typegene
OrganismCucumis sativus L. var. sativus cv. Chinese Long (Cucumber (Chinese Long) v3)
DescriptionRNA exonuclease 4
Locationchr4: 16842382 .. 16865265 (-)
RNA-Seq ExpressionCsaV3_4G027590
SyntenyCsaV3_4G027590
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
TCAGACTCGTGATATTTAATATTATGTAATTTTGGGTTTCACTTAAGTAATGTTATATTGTTTAGTGGTTGAATGTCCTTTATGTTTTCAATGTGTTATCTACTTTAAGTGAGAGGTAAGTTTAGGGTCGATAGGTGTCGTTGAGGTAAGTTGCGATATCGATTGACTAATTGCATCACCTCTTAGGTTAAGAGTTGGTGGCTTGGGAGGGGGCGTGACAAAATATCCTTAGATGGAGAAGGATATTTGTTTTGTTGGCTTCACGGTCTCTTTCAGCTAGGAGAAGGTGGTGTGGGAGGGGTGTGATAGAAATAGAACAAGAACTTTAACAAATATAAATAACATTCTAGCACTAGATCAATATCTTCAATATCTTTTGACGATATGTATATATATTTTATATATCCATTTTTGTCGATTCATCCTTAAAACGATATATCCATGATAATGATTATTTTTCGTGGAGAAAATCTTCACCCTCGTTGTTTCTATAGCATCTCATTTTGGCGATCGATATGTCGCTACTTTTCAATAGACTGCACCCTGTAGTGAAAGTTTCCCGCAAAATCACCCAGAAGTCGGTTCAGCGATACCTACACTAGGGGCTTCACTGCGGACGCATAAGAAAACAGATCCGACAAGCACGATTGTCGGCGCTTCGATAGTGACCCACCAGAGCCTTTTTTCTCTCCTTCTCGACGAGTTTTGGCATTGACTCTCTAGATTTGGACTGGTTCGAAGGCATCGATTAAGGTTTATGGTTTATTTTGGGTAGAATACCCCTCTCAACGCTCTCTCTCATGAACTGAGTTCTAGCCTCAAGCATTGTCTTAATTGTTTAGAATTCTAGGAAGTTGACATACTGCAAGTTTAGCAATTTCCACATCCTTTCTGAACGATAAATTTTGGCCTAATTATGTTTTGATTGTTTTAATTTAGAAAACGTTTGAAAGAATATTTTGATTGTTTGAAAATGTGTTTTGTATGATCTGAAGTTGATCGAAATTCTTTTCCGAAGTATGTTTATTGTTTTGGATAATTTGAAATAAAATGTTTTGATTGATTTTGAATTGGAGGTACGTTTGTTTTAAATTTTTTATACATGCAATTGTTGAAAAGATAGAATGGTTTTTTTTTTCTTTTTTATGAATCTTTGATTGAATTTGCTAATTTGGAAACAGATGTTTCGTTTATGAATGTTTGAAAGAAAATGAATTTAAAAGGATATGGGCATTGCTGATTGAGTATTTTTGTATTCATTTTTTGCTTTCTTCGATACTTTTTTTGTGATAAAAGATTAAGTGATGGTTGATACTTTTGAAAGAAGTAAAGTTTGACATTCTAGCATGAGGCATCAAAACTGTTCTTTTTTTTCCTTTTAATTAAAGATTTTTAGTTTATCTTAGTAATTGTTTGCAATAGCTTCTGCTTGTTAAAGTTTCTAATGGTTATGTTTTCTACATCATTGTATACTAGGTCTATTGAAAAGTACAGGCGGGTACCCTAAATTCCAGAGGAGAGGAACACACACTAACACTTCTGGATTGACATGGACTCTGATCATGACCCTCTCAAGACACCAAGCCTAAGGTCACATTCTCGTCCTTACTATGTTATTTTTAGGTTTCTTAATTGCCAATATTCTTTCCTTGTTTTTGTTTGTTTGGTTTTTACAGATCGACATGAAGTAAAAACAACATACCTGTTGTCCTCCTCTGTTTTTTGGTCTTTAATGTCCATAAATTGCTTCAGAAGTGTCATTCTTAGTTTGCTCTGTGCTCACCGCATGACTCCGTCAGAAGTGTCATTCTTAGTGGAACTCAAGAGAATCAATGTTGTTTATTTCATGTTTTCTCACATCCATGCTTGACATTACCTCTTTGTATTTGATGAAGACGATAATTTCGTCAACATCAACCATACATTTGGGCTTTCAAAGAGGTTTTTGATCTTGTGATTTATACCAATGCAGTGATCCTCTTCTTTTTTCTTTCATGTTGCTTTGGCCTTCTGTATCAATTTTTTTTTTTTTTAACTATTCTCTTGGTAACATCTTAGTTAGTTGGAATCCCTTCCTTGATAAGGAGTGTTTTGGAGAGCTCTTTTGTAATCCCTTACCTTTTGGCTTTTTTTTGTTGTTAATTCTAGTGGTGTACTATCCAATAGGGGTAGATAAAGTTTTAGGACTACCAGTTGATAAATGGGACATGTAATCCTTCAAACATTAGGAAATGCCGGTGGCAGTTTGTTGAACTGGCCTAAAAATGCTTACAAGAAATAAGGATGGTATTATGAGAATATAAACCTAAAAGTACGAAATGACTGCTCAGGTTCTTTTCCAGCCATGATCAACCTCCCATCCTCTTCAGCCTCGAGAAGTATGGTGGACTGATCCTCTTTTCTAATGGTGATTATCATATTGGAGATATTGTCCGAATTAGTGGCAGTCTGGCCATTGTCTACCAAGGCTATATAAAAAATAAGATTGCTAACAGTTTGACAGTTGACTCGTATCATGGAGAAGCAACCCTGTCTGAAAAGCAAAGCTTGTTATCTCATACTTCTAACCTTTATCTAATGGAGACAAGGATCTTGGAGTAAATCTTGGAGAGATTTTTTATTAATTGTGTGTAATGAGAGAGATAAGATATCCCTTAAATAGATCCAAGAGATAAGGTTTGTTCACACCTAACTACCTAGTTAGCTAACTAACCTAGGCCCCCTTCAATAGACCGAAGGGGAAAAGTGCTGGAACACACTTAAACTAAAATTAAAACATGAATTTTTAAAGACAATTCAACAAAAGGTAGAATTCCCAAAATGCCCCTCTATCATATTTCCTCCCCTTACAAAGAATAACTCGTCCCGAGTTATGTTGTGAGCTTGAATTCATCTGGAGCATAGTAAGGGGTCAGATCAGCTACATTGAAAGTGGAACTGATGTTTAGGGTCTTTGGTAGGTAAAGCTGGTAGGCATTTTCTCCAATTTTGTTGATAACTCAAAATGGTCCATGTTTTTTTGGACAGTTTGTAGTGAACGCTTGCTGGAAGTCTTGTTTTTTTTAGAAACACCATGACCTAAGAAGGATGGACACTTTACTTTGGATAGCATGTCTGTGTCCTACACATGTCGGACACTTGGACACTCCGATACTTGGTGGACACCTATCGAACACTTGTTAGTGCAGTAAATGCATTGGACATACATAGAACACTTGTTAAGTAGACTAAAAAGACACATATATGACAATAATAATAACCTTTGAGTATAGAATACATCAAGCTTTTTGAGTATAGAATACATCAAGCTTTTTGAGTGTAGGATACATCAAGCTAAGTTTTTTAAGCATATAAATGCATCAATGCATTTACTATGATTTTTTTTTACTATAAAGATGATATATATATATTTTAAAAGTACTTTTTAATAAACGTGTCTTTGTCGTGTCGTGTCCTAGATTTTTAAAAAATGGGGTGTCATTGTGTCCATGTCTCATATCCGTGTTCGTGCTTCTTAAACCATGACTAGATCTTCTGGTTGGAAACTTCGAAATCTTTTATGTTCATCAGCTTTAGTTTTGTAGGATGCATTTACTTTTCTAGGTAGTCTATTACTTCTTGAGGTAGTTGTTGTATTCTGTCCATCATATGCTCCGCTTCTTCACTAAGATCAACTGTAGATGGAAGATCAGTAAAATCAAAAGTGAGATGGGGGAGTTTAGCATAGATTATCTAAAATGGACACTTCCTTGTTGATTTGTTTTATGTTATTGTAAACAAATTTGGCTTGAGCTAGGCAAGTGTCCCATTACCTTGGTTTGTCGCTTGAAAGGCACCTTGGTTTGTCGCTTGAAAGGCATCTAGGAAACCGCATCTAAGTAGGTTTCCCAGTGTTATGTTAGTGACCTCAGTTTGGGCATCCATTTGTGGGTGGCTAGTTGTGCTGAATTTCAAGCTTGTTCCAAACTTTTTCCCAAGACTTTTCCAAAAGAAGCTCATAAATTTGAATCTCTGTTTGAGACTATAGATTTTAGAATTCCATGTATCTTGACAATTTCTCTAAAAAAAAGGATTAGCAATGTTTAGAGCATCTGAAGTTTTCTTGCAAGCAATGAAATATGCCATTTTACTGAATTTGTCCACCACGACAAAATGAATTAAAGCACATTTTGTAAGTGTGATGAAAGTTCAATTGGAGTTGGTTTTTTATTTCCTCGGTAGGTTTCAATATAGGCCAGATGACAGATTGTATTAATATTAGATGGACTTCAAAATCAAGCTGGGTTTGGAGATGATTTTTGGGTAAAGATGTGTGGTTCATGGGCTGAGAAGACCAAAATTTTGTTTGTAGATTGGGTCATCTGGGTTGGGTTAAAAGAAGTAAAATGGATGGATGAATTGTGAAAATAGTTATGGAACTGGGTCTGGTAATGTGCCTTTTGGTTGGGTCAGAAGGAAAAATTTGGGAACAACTTTTACAAAATCGTTCATCTCACACTGTTGCTTCATCTTCTTCAATCTTGTGAACGATTTTGAAGGGGATGTCAGCTGTCTTCCACTTATTTCCCTTGTCAGTCAAGTAATCTTCTTCATCTTCCTTCTTGGTCAAAATTTTCTGTTCGTTCTCTTTCGCTTTGCCAATGAAGGATCCAAAAGTGTTTGATTTGATATACGGCATCTCCCTTGTCGTTTGGTAGCCTACTGGAATTATGTTTTCACTTTTCTTCAAATAATTTTGTTTGTGGGTTTAGGGTTTAGGGTTTAGTCTCTCTCTTTTGATAGTTCTCTTGATAGATGTTTTTATTTGATTAAAAGTAGAGATGAAACACAATCTTATGTGGAAACCCTCGTAAAAGGAGAAAAACCACGGTGCTGATAATTTTATTTTCTAATATTCATAAAGGTATAAAGGGGAAATATTTTGTTAGGATTCTTAACCTGAAAAGCAGAAAAATCCCCAACAAGACAAACAAAGACAGCACTCTGATATTCTATATTGATAATAAAGATCAAAATTTACACAACCTGATATCAAAAGTCTAGAAAACAATAATAACAATAATGTCCAGCACCTTTGAGAAGGCTGGAAACTTCTCCTATGAAGGCTATGAAAGCTTTCACCAAAAAGACTATAAAATCTAAAAACAGCAATCCCGAAACATTCTCTATAAATACCCATTACCCACAGTGGGCCTCTATGATGTTTTCTTTTCTCTCTTTGCTCTCTGAGTTACCATCCGAGCTTTCAATTCCCTTTTCTGCCCCTTCTTTTAGAAGTATGCAGTATTGGGGGTCTAACATATTTATAAACAACACGAGTCTAATTAATAGAATAAAAAATAGGGTAAAGTAAACTACGTAACTACCCTTGGGCTTTCAACATGACCTAAGCCCATTATTTCTAACACTCCCCTCAATTTGGGACGCAAACGTCATAAAGACCCAACTTGCTAACACACAAATCAAAGCTTTGTTTGAGGAGCCCTTTTGTGAGGATATCAACAATCTGTTGGCTTAAGGGTATGTAAGGAATGCAGATGCTACCATCATCAAGCTTCTCTTTGATGAAGTGTTTGTCAATCTCCACTTGTTTAGTCCTATCATGTTGGACCGGGTTATTGGCAATGCTTAGGTGGCCTTTTTATTGTGGAATAGCTTTATAGGCATCTCATAGTCTTGACAAAGATCAGACAATTCCTTCTGGAGCCAGATCCCCAAATTCATAGTCTGTATTCAGCTTTAGCGCTGCTTCTAGCCACAACTCCTTGCTTCTGACTTCTCCAAGTAACAAGATTGCCCTACACAAAAGTACAGTATTCTGAGGTAGACTTTCTATCAACAATAGATCTTGTCTAGTTGGAATTAGTATAAGCTTTAACACATCTTCTGTCAGTCTTCTTGAATCTCAGACCTTTACCAAGAGTTGCTTTTTAATGCCTCAAAATCCTGTTAACCGCTTCCATGTGGGCCTTATAAGGTGCTTGCATGAATTGATTGACGGTGCTTACAATGTCAGGCCTAGTGCGAGATAAGTAAATCAGCTCTGCCTCCAAATGTTGATATTTCACTTTATCAATAGGAACCCTGTCACCTGAATTTCCGAGTTTGGCATTGAACTCAATGGGTGTATCAGCAGGACGACATCCTAGCATACCCGTCTCAGCTAATAAATCAATGTTGTACTTTCTTTGGAACACTAAGATGCCTTCTCTGGATATAGCAATCTCCATCCTGAGAAAGTACTTCAGAATTCCCAAGTCTTTGATCTCAAAATCAAACCCCATCTTCTTCTTTAGTTGAATGATCTCATTAGTATCGTCCCTAGATAGCACAATGTCATCGATATAGACAATCAATACTGCAATTTTCCCTGTCTTGGAAACTTTGGTGAACAAGGTGTGATCGGAATGCCCCTGATTGTACCCCTGAGACCTGATGAGAGTGGAACATCTTTCAAACCGTGATCTTGGTGATTGTTTCAACCCGTATAAAGACTTCCAAAGCTTGCAAACCTAATTATTTAATTGAGCTTCAAATTTTGGGGGCAGTCACGTACACTTCCTCTTTTAAATCTCCATTCAAGAATGCATTCTTAACATCAAGTTGATATAGTGGCCAATCCTTATTATATGTAATAGACAACAAAACTCTAACAGTGTCTAAATTTGCAGCAGGAGAAATAATCACAGAATAGTCAATCCCATAAGTCTGAGTGAATCCTTTTGCAACTAGCATGGTTTTATGTTTGTCAAGGTACCATTTTCTTTGTACTTGAATGTAAACACCCATTGCATCTGATAGTCTTGTGTCCCATTGCATCTGGTAAACACCCATTCCAAAAGTGTTCGCTTTTGTTCTTTTCTCGGGCTCTCATTTCTTCCATGATAGCAGCTTTCCACTCCAGACACTCTAAGGCAAGGTGGATACTTTTGGATATTGTGGTAGAGTCAAAACTTGTAATAAAAGCTCTGAACTGTGGTGAGAGATTCTCATATGACACATAATTAGAGATGAAGTGTGCTTTCTACATGACCTAGTACCTTTCGTCAATGCAATAGGAAGATCAATAGATGGATTATACCTACTGATATTTTTTGAATGGCCTGAGTTAGTTTTGTTTTCAATATGTTCAACAACAACCTCATTCTCATCAATCATGTCCTCTCTGTATAATGACCCCATCAATACAACCCTGCTCATCCATATCTTCAGGAACAACTATCTCAGACTCGTCATTCTAACCTGCTTTACTGTCAGTATCTGAGTCAGTTGAATTTATCTCGAGTCTGTCATTCTCCCTCATTCTGTTATAACTATGTGAGTTAGTGGAATTAGTCATACCTTGTTCCCTTATTGGTTTAGAACCTTAGACTGGAGCCATCTGAATAGCAGGAGACTCAATTTCCTTTGTGAGATTCTTCCTATAGTAAGTTTTCCAGGGAACTTGATTTGTGGGTAGGACTGTATTGTGAGGACTAGGGGTCAAGTAAGGTAGCCATAGGTAAGGTAGCCACAGTAGGAAAAGTACAAGTAGACTCTAACGGAAACATATAGTTAGACTCTTCACTAACAGTCCCCCCTGAAGTAGGCTGATGGAAAGAAAAGACAATCCTTAATAAAGGTGATATCCATGTTTACAAAGCACTTACGTGAAGGTGCAGAGGATACCCGACAAACATGCAAGCCTGAGACTGAGGAGTGAATGTAGTTTGGTTAAGGCCATGGCTATGAGTATAGGCTGTGCATCCGAACACCCGAGGGGGAACATCAGGAACGAGATGGGTGGAGGGGTGAGATTCTTTGAGACAATCTAACGGTTTCTAGAGGTAAAGGACACGGGAAGACATGCGGTTGATGAGAGAGCACGTCATCACCCCAAAGATAGGAAGGAAGAGAAGTGGACAACATAAGAGAACGTTCAACTTCCAAAAGGTGATGGTTCTTTTGCTCGAGAACCTGAGTGTAAGCAAAGGAACTTTGGTTCTTAAGCCACCAATTCAACTAAAAAACTTAAGCTAGTGGTTGAAGGAAAATTTAATTATATATCACTAACACTTCCTCTCATATGTAAACTTGAAATATTTGAAAGGCCCAACAAGTGAAAATCAATTTTAATTGGAGAGGAAACAACAATGCAGGGGCTTGAACACAGGGGTTCCCTAGACCACTCGATTTGATACCATCTTGAATCACCAATTAACCAAAAAACTTAAGCTAGTGGTTGAAGGCCAATTTAATTATATATCACCTAAGAAGCACAGACACTTCATATTAGAGGTGGTATTAGTGTCAAACACTTGGGGGACACGGATTTGTCCAGACACGTTGGGGACACGTGTCCGATACGCCAAATTCCATGTTCTATTTTTTCTATTTTTTGTTTTTCGGACACGTCTGGACACTCCCAATTGCAATTTTTAGGTGAAGCCCAAACTGGCAAGCGCACTTACTTTAACGTTAATATAATAATAATATTTTATACTTTATTAATGTGTATATTAGCCCTAGCCCAAGCCCAGTTACTTTAATAAAGACACAAAATATAAAAATATTATACACATTAACATAACCCTCAAATCTCCTTGTTGTGCACTTTCCCTCCCTTCTTTCATCCCCATCCATGACATTTTATGCTTTTTTTAGTGAAGAACATTTAGTTTTTTTTTTTTTTTTTTTAGGAAATGAGGAACATTTAGTATTTTGTTACAAATTTACATATACTATAAAAAAATTAATTTTAAAAAATAGTGTATCCCCAACGTGTCTGTATCCTATTATTTTAGAAATTGGCGTATCGCTATGTCCTGTCCTGTCGTGTTCGTGTCCGTGCTTCTTAGTATATCACTAAAGGTGTGTTTGGGGGAAGGGTTGAGTTATGGAGGGTTAGAGTTATGATAAACCAGTGTTATGATAAATCTAAGATTATGATAAGATGTGTTTGGAGGAAGGGTTATGAAAAAAGTGTTACGATAAGATGTGTTTGGGAGAAAGATTATGTGGGTAGGGTTATAGTAGTGTTATAATATGATATGTTTGGGGGAAGGGTTATATGGGTAGTGTTATGATATTTTTTATTTTTTATTTTTTTAAATTTATATTATAGATCAAATACACAAATTTCTATACTTAATTTACTCAATCATCTTAGTTTTCGTAAAATAATTTGCCTAAATTTCATTAAATACGACGCTCATTTTCAACATTTTTTTAGTAATCCTTACTTGTGGATAAATTTTGTTATAAAAAAATTCATATTATAAATGTAATACATAAATTTTGTATATTTAATTTATCAATTTGTCATAGTTTTTGTAAAATAATTTGCCTTAATATCTTTAAATACAATGTTCATTTTCAACCTTTGTGCAATTTTTTTATAAAAAAAACCATATTCTAGATCAATTACAAGAATTTCGTATATTTAATTTACCAAATCGTCCTAGTTTTCGTAAAAAAAATTCCTTAATTTCTTTAAATACGACATTCATTTTCAATCTCTTCTAAGTAGTCGTTAGGTGTGAATAATTTTTTTTTTAATTCATATTCTAAATTCAATGACACAAATTTCATTTATTTAATTTACCAAATTGTCCTAGATTTTGTAAAATAAGTTGCCTTAATTTATTTAACTACAACGTTCATTTTCAACATTTTGTAAGTAACGTTACGTTTCAATAAATTGGTTTTTATGAATGTGACACGTGATCCATAAACATGTAAATAAGTGCAAATAACCATCGAAATTATGAAGTTCTGAAAACAGAAAATAACAATCAAGGCTTTTACCAACTCTAGTTTTGTTTTCTCGTGGGAAATTACAATGAATCTAGTTTTAGGTTTCTCTTCAATTTATTTTAGACTACTTTTAAAATATCAACTTTAATTTTTATAATTTTTAACAGTGGTTTATATACTTTCAAAATATTTATTTTAGTCATCATGCAATAATGATTAGGTAAAAAAACATGAATCTACCATCTCTATAACAAAAGATTTGTTATGTTAATTTAACAAAACTTAATTTAATAACCAAAATAACATTATGAAAGTGACGAAATATAGACAAACCACCATTTGTTGAAGATTTGTTGAAATTACAGTCCTCACTAATTTTTAATTTTATCAAATAGTCCTTATAGTTTGCTCATGGTTGTAATCAATTTATGGTAATTTGTTAAAATTGTACCATTGTTTTTTTTATTGAATTGATATTCCTTAGATATTTAAGCTAATTAACAAAATACAATAAAGTTGAGATTGATATCAAAATCTTAATTTCACAAATATATTGATGTGTTAGAACATCTAACATAGACTTGATTGAAGGTAATTAAATTAAAATTTTAATTGTGTAATTTACAATGAATTATCTAATTCAAGAATAAAGGAAGAAATTAAAATTTTAATAATGTATATAAAACTATAAATGAGAAAAAAATAGTATTTAAAATAAATGAATTAATTATAAATGAATAAGTGTTGAGATAGGGTTGGAGAAGGGTTGAGGAAGGATTGAGAAAGGGTTAAGAAAGGGTTGTGGAGAGGGGTTAAGGGTAAAACTAGGGTTATGATAACCCTTCCTCCAAACATGGGTTGGGTTACATAACCCAACCCCTCCCCCAAACATACCCTAACAATTTGAATATTTTTGTTTTCTTGAAAATGTTTCGACCTCCAAATTTGTGTTGGAATGATTCATGGGTGTTCTTCGATGTTGGTGATAATAATTTCTTTCACGCTCTCAATCATTTTATGGATCATCTAACTCCCAATCTTAAATCGATTTGTTGAGAAACCTGTTTGATTGGATTGCTCTTTGTTGACAATGTTGTCTTGTTATTATGTTCTAAAATGCATTAGAATTTCAAATGTGGATAATGATAGGTTCTTTGAGAAAATCAAGCACGAGTGAAATTATTATGTTGAAATTATTATGTTGAATTCTTCCTCTGAATCACTTGAACCAGAATTCCAATACTGTTTTTGGGAGTAAATTTGACTTTCTAGCCTATACCAACTACTGTACTTTTTCTTCGTCGTCTTCCTTGAGCAGTAATAGTCCATTCTTCCAATCTTTTTGGTGAATTTTTTTTTTGTTTTTTTGTAATTTCAAGGGTTCCCCCTTCTTAGATGTGTGTTGTGGCAATGAGCACAACTCACAAAGCTCCCTCTTGTGATCGGTGTCATGAAGAGTTTCTTGGTCTTCCACCGATGACAGTGAACATTGGCCTAGACGGCTGCTCTGATACCAAAATTGATGTATACCAAATTAGAGAATTTTATTAATGTTTTAAAAGTTGTGAATACAAGAAGGCAAATATTCTGTTTATAATTGAAAGAAAACTAATTCTAATCTTAATTAATAAAAGAAACTAATCCTAATCCTAATAGATTAAAGATTTGACCATAATACCCTATTCCTACTGCATCATTATCATTGTTGTAGGTACAAATTCTAGATAATATGATGTTTCTATGGCAAACAATGACGTTAAATCATGCTGTATTTTGCTAGTGGAAAAAGCCGAGGGGTATTTATAAGATTACTGTAAAGATTTAGTGGATGGTTCTACACAATATTTTTTCATATTGTTTGGGAGACTGAGTAGCAGATCATCAAATTTTGGATCAAGTTTTTATAATTAATAAGGCTATAGAAGATTATAGGAGGCGTAAACAGGAAGATTTCATTTTGAGTTTTATTTTCATGGTCTATGACTGTTTGGAACTTCTTGGATAAGATGTTGTCAAAGAAAGGTTTTGGGATTAAATGGAGGGTTGAATGTGGTGTTGTATGGGAAATGGTATATTCTTTATCTTGTCAATAAAAAACCTTGGTGGTAGGATCTTAGCATCAAGAGGCCTAGGACACGATAATTACTAATGTCTTTTCTCTTCCTTTTAGTTGTCAATATTCTGAGTACAATGGTGCTCTTAGTAATAATCAAGGACATTTTTTTACTAACTCTCTTCTTGCCGCCTAAAGGGAATGCCTTGGTATAAAGCAAATTAAAGAAAGTGTTTGGGAAAGGAAAGAACAAAATAAATATTGGGGTTCAAATCCTATTTTAGTCCCTAAACTTTGTAACTTGTTCTATTTTGGTCTTTAAACTTTGAAAAATATTTGTTTGTCCTTGAACTTTTAAGAAGAGCTTACTTTGGTTCTTGTTGTTCGAATTCTATTAACTCTTTAACAAAATGATGAGGTGGCTTCTAATCTTTGGATGACTAGACTCTAGATTCATCATATTAATTAAATTGATAAATAACCAAGGATTCTTGCTAATTTATTCTATAAATATTTTATTTCAACACAAAAACATAAACAACACACTTGATTCAAATAAACCAAAAAACTAACTCTCTTTTTTTCTCCAAAACTCAACTCTCTCACATCTCCTCCCCGTCTCCTGCATCCTTTAACTTTGAGTAGAATTAAAAAAATAAAATAGAAGGATCAGATAGGTTGTCATGTAAGATTAGCCGAGGTGCGCATAAGCTAGCTTGAATACCCACAAGATATAAAAATAGAAAAATAGAACTCGCATATTGGTGTTAACATCCATGGTAAGGTGAAATGAAACTGAAAACAACCCATATAGAAAATAAACTCTACGAAATTGCAAATAAACATTCCTACAATTAATCAATTTAATTACTATGACGAGTTTAGTGCCTGGTTATTCAAATGTAGAAGCCACTTCATCATCATTCATTCCGTTAAAGAGTTAACGAAATCCTAATGGCAAGGACCAAACTAAGCTCTTTTTAAAAGCTCAAGTGCTTAAGCATACGTTTTTAAAAAAGTTTAGGGACCAAAATAGAACAAGGTACAAAGTTTCGAGACCAAAATAGGATTTAAACTTGTATTGGCGTAATCTTATCGCCAACTTTATTAAGAAATGTTTTTGACAATGTTGATCGTCTCATGCAAAGTGATAGGTGACCATGATTATAAAAGGGGGATGATATATCTCAAGAAGACTGAGTTACTAAATAGATTATTGATTGACTATTATAGGATCATAATTTTTATATTGACCTTATCCACACAAGGCTAGGATGGGGTGTGCTCTTAATTGCTAAAGTTCCGTCAATTCATCAAGAATTTAAAGAGGCAAAAAAGGTGGGAAATGCTATTTGCTTTACATTCTTGCAACAAAAAGGAAAGTATCTAATCCTATTCCGAATGTTCTATACATCTTTTCTCTTATCTGCAAAATTTCTTTTGTTTTCCAACCAAATAGCTGAAGATGGTTATAACTCGGAAATGACACATTTTCAAAACACTTCTTTTTTTCTTTTTATTAATTCCACTAGCAAAAATATCCTGACCCAATTAAGTATACACGTCTTTGGAGAGAATTTCATTTTGTGCATCTTTTAATAAGTATTCTTTAATCACCTACGGTGTCTAATGTTGTTGACTGGCCGTACTTTATTTGACATTTTACTGTTCTAATATTTTAGCTAAATTTAATTTTTGAATTCCCTTATTAATGAAGAAGAATGCTATCATGGAAACTACCATCACGACCATGGAATGTTTAGGGTTGTCATCTTTTCATTTAACCCTCTGCTATGGTTTCTAATTTTTTTTTCATTTTGAGTACTTGGCTGTGAACGTGAGCATTATATTGTACATTTTGTTTACAGACACAAATGCTCAGCATGCTACAAGCAATATAAGAAGAAAGAGCATCTTATCGAGCATATGAGAGTTTCATATCACTCTGTTCATCAGCCTAGATGTGGAGTCTGCCTAAAGCACTGCAAATCATTTGAATCATTGAGAGAACATCTAATGGGTGAGATAGTTTTGTGTATCTTTATGGTAATATATCATAACACCATCTCATTATTTCCAGTCTTTTAGGTCCACTTTCAAAATCAAATTGTTCAAAAATTTTCTCAGAGCAAGGTTGTGGGCTTTGTTTGAGAGTACTTGATGGTCCGGAATCTCTCAGTGATCACCAAGACATTTGCTGCATAACAGCACCTGTTCACCAAGTAAGTCGCTCTCTATGCATCATATATCATATATGATAAATAATAAGGTTACTGGTAGTTCTCAAAAAAAGAAAAAGGTTTCGGGTAGTTTATATTTTTATATATCAAGTAATTGAGCCATTTATTATGATATCATACATGCATTTCAGAATTTGATTGGGGCTCTGTTTTAGGGAACAAGTCTACCACCAACTGATTTATCCGATTGTTACGAAGAAGATCGTTCTGATCGAGGCCTTGGAGCAATAGCTATTGATTGTGTAATGGCTGGTGGAGGAAGTGATGGTGCACTGGACATTTGTGTTTGGATATGCCTTGTTGATGAAGATGAGAAATTGATTTTCAATACTTTCGTACAACCACAAATACCGATCACGAACTATAGGTACTTGTCTGTGGCATTATCTTGTTCTCTAAAGAAGGTTTTAGCTTTTAAAATCTAGCCTATAGATGCTCATTTCCCAGTCATGTTGTAGGCATGAAGTAACTGGGCTAAAGGAAGAACATATGAGGTATGCCATGCCACTGAAAAATGTCCAGGAAAAAGTATTGAAACTCTTGTTAAATGGAGAATCCATTGGGAGATTGAGATTGAATGGTGGTAAAGCTAAGCTTCTTGTCGGCCATGACTTGGAGCATGATTTAGATTGCCTGAGATTGAATTATCCCGATCATATGTTGAGGTGATTTTTACTAACTTGGTTTGTAATTTTGTGAGGTTTAGTTCACTTTTATTGAAAGTAAAATCCATTTGAACTTTGAAGTGATCTACTGAATATCGATGCACCAAAAAAAGAAAGAAAAACACTATCAACTACTACTTTTCAACCTTGCACATTATTTTCTCTGGAAATAATTCCCACCATTGAGAGAAGCCAACGTATTTTTGCTCTTTTTTCTTTAATCATTTTTCCTTTTTGTTTCGTTTTTTTATTGTTAAATGCACATTATTCATACACCTTTTGCCTGATATTTTTTTGCAGGGACACGGCTAGATATCATCCATTGATGAAAACAAATTTGGTTAGTCATTCTCTCAAGTACCTTACTCGAGCATATTTGGGGTAAGATACATTAACTATGATTCATACTTCTTTAAAGTATGATAAGCAATTTTTTATATTCTTTTTCGAAACGGAGACAAGAACTTCTTTATTAATATGAACTCAAAGTACAAGAGAGTTATACGAAGAGAGCCATAAAGAAGTAGTAATCAATGGAGTCCTAGAGGGATCAGAAGGTGCGGACATCTCAACTTGGTTGACACCCCCTTAGCGCCAAACATCATATCCTGAGGACAAGCAAGCAAAACAATAAAGAAACAATAATAAAAGCATCAGCTTAGTACAAAGGTCCAAAAAACAGTATGGGACCGGAAGAAAACAACAAGAAACTCGGGAGTGCAGAAAACAAAATAAAACAGGGGGCAATCCTTTGAGGCTTCAAAAGCAGTAGAGGGCATCGGTCTGTATAATATGCATTGAAACTGAGGCAGCTTAGGATTAGGCAGGTTGTGAAAGAAAAGCAGCCCAATTAAGGTAAATGTCTTCGATGGAATAATTAACGAACTCCTTTTTAAGGGAACACCAAGCTGCAGCTTTAAGTTCTGCCCCTGTTTTATTTTGTTTTTTTTATATTCTTTCTTTCAATAGAAGCAATTATATTAATAAAATTATTAAAAGTAGAGACACAGAACACACAAATTTACTTGAAAATCTGAGTACTGGGAGAAAAACTATAATATTTTCCTTCTTACTATTTTTTGATAATAATAGTGGTACAAAGGGGGAAATATTTATAGGCAACATGACCTAAGCCCATTATTTCTAATATTCTCCCGTTGGGACAAAAATGTCATAAAGACCCAACTTGTTAACACACAAATCAAATTTTTTTCTAAGGAGTCCCTTTGTGAGGATATCAGCAGTCTGTTGACTTGAGGGTATGTAAGGAATGCAGGTGCTACCATCATCAAGCTTCTCTTTGATAAAGAGTCTGTCAATCTCCACATGTTTAGTCCTGTCATGTTGGACCGAGTTATTTGCAATGTTAATGGTTGCCTTGTTATCGAAGAATAGCTTCATAGGGACCTCATAGTATTGACAAAAATCTGACAACACCTTTTGGAGCCAGATCTCACAAATCCCCAAACTCTTAGCCCTATATTCGACTTCAACTTTACTTCTAGCCACAACTCCTTGCTTCTTACTTCTCCAAGTAACAAAATTGTCCGACACAAAAGTACAGTATCATGAGGTAGACTTTCTATCAACAACAGATCATGTCCAGTCATAATCAATATAAGCTTTAATACATCTTTTGTCGGTCTTCCTAAATCTCAGACTTTTACTAGGAGTTGCTTTTAAATACCTCATAATCCTATTAATCGCTTCCATATGATCCTCATAAGGTGCTTGCATGATTTGACTGACGATGCTCCCAGCATAGGAGATGTTAGGCCTAGTGTGAGATAAGTAAATCAACTTTCTGATAGACCCCCTATCATACTGCAATATAGTAGAAGGAAAAAGAAAGAGACGAAGATTTCGCTCATGCTCTTCTTTGAGAGAGAGGATAGCAGAGAGAGAGTTAGATTGTCGACCAAATTGAGATCTCCTAAATTCGATTGTTCTTTTCGTGTGATTTTTTCCTAGTAATTCATCTTTCGAATTGAAGCAATAAATTTTCTATAACTAAATTCCAAAAGGAGTTCCATCAAATTGGTATCAAAGCCAACTTTTCTGGGCAAGAGGAATACAATGGTTCAAACATGCATGGAGGAGAAAATGGAAGCGCATGATCAGGAGATTGATAGACCCCAATACTATATCAATATAGTAGGAGAGGAAAGAAGGGTAGTGGCACAACATGTGTGTAGAGTAACACGTGAGTTAGGTTAGTATAAATAAAGAATAAGTGTGGGCGGGAAAGGCAGTTAGAAATTGAGAAGGAAAAGGGTTTCTGTTCTTCCTTGAGAGATAGGATAACAGAAGAGAGTTTCATTTGTAGTATTTGATCATATTAGTGTGTGTAATCTAGATCAGATTGAATCTAATAACAATAATAATAGAAGTTCTATTCTATTAGTTACTGGATTCCATCAGAGATGCAATCGATGAGAAAAGAAGTGTGCAAAATTCTAGCGATGGAAGAAAAATTATCATCGATCAGCGTACAGACGAAAAGACTCACCAAATGCTGATGATGTTTATGGAATCGATCACCAAGGAACGTACCGCTATGAGTGAGAAGATGGTCGTGTCTAGTGTGCAAGAGACAGTATCGACGATAGTGAATAGGAGGGATGGCTCGACAAGAAAGCGCCATGAAAATGAAACAAAGGACGGGAAGGTTGAAGGAGAAGATGGAATGAATGAGAGGAATAAGTTCAAGAAGGTCGAGATGTTGGTATTCAACGGTGAAGATCCTGATTCATGGCTTTTCCGTGCAAATAGGTATTTTTCAGATACACAAATTGACTAATGCTGAGAAAGTGTTGGTTGCGACCATTAGTTTTGAGGGCCCACCGTAGAACTGGTATAGGGCGCAGGAAGAATTCGAAAAGTTTACCAATTGGTCGAATCTCAAGGATATGGTGGAGGAATATCGTAACCAATTCGATAAATTGATGGCACCTTTATCCGATCTACAGGACAAAGTGGTGGAAGAAACATTTATGAACGACTTATTTCCTTGGATTAAAGCGGAAGTTGATTTTTGCCTTTCGGTCAGCCTAACTAAAATGATGCAAGCGGCTCAACTCGTGGAAAATCGGAAGATCATTCGGAATGAGCCTAACTTTACGGGGTATGCTAGAGGTAAGTATCCTTCCCAAAATTCTTTCAACAATAAGAATAATGCGACAGTGAACATTAGTGATAGCAAGGAAAACATGTTTCCGATGAGAACGATTACTCTGAGGACTACATCTGGAGAAGTTAAGAAAGAAGGGCAGTCGAAATGATTGTCGGATGCGGAATTTCAGGCGAGGAAGGAACATGGACTTTGTTTTCGTTGTAATGAGAAATATTCTCATGATCATAGGTGTAGGAGAAGAGAACAAAGGGAGTTGAGAATGTATGTGGTCAAGGCTAATGATGAAGAATTTGAGATCGTAGAAGTGGAAGATAATGATAAAGAGTTGAAATGTGTGGAAGTCATAGAGAAAGACGATACTGTTATTGAACAATCAATTAATTCGGTGGTGGGATTAACCAATCCGGGAACGATGAAGGTGAGGGGGAAGATCAGCAAGTGAGAAGTTATCTAATTGATTGTGGAGCGACGCATAACTTCATCTTCGAAAAGATAGTGAAGGAGCCACGAAGTCAACATCACATTATGGTGTAATTTTGGGTTTGGGTGGGGCGGTGAAAGGTAAAGGAATACGTGAAGAGGTGGGGATCAAATTGAACGGATGGAAAGTGGTAGCAAATTTTCTACTCTTAGAATTGGGAGGGGTGGATGTAGTGTTGGGAATGCTCTCTGGGTAGGACAGAAGTAGATTGGAGGAACCTAACAATGACATTTATGCATCTAGGAGAAAAGATAGTGATCAAAGGAGACCCAAGTTTAACCAAGTCCAGAGTGGGTCTCAGGAACATGATTAAAACGTGGAATGATTCCGATCAAGGATTCCGAATCGAATGCCAAGCAATGGAAAGAGTTTATGAACCAACAGAAGCGGATGGGATTGAAGTATTAACGGTGCAAGAAGCAGTATCTGTGGTGTTGAAGAAATTTGGAGATGTTTTTACTTAACCGGAGGCACTACCCCCTCAAAGATGTATTGAACATCATATTCACTTGAAACAAGGCACTAATCCAGTGAATGTACGACCTTATAGATATGCTTATCAACAGAAAGCAGAGATGGAAAAACTCGTTGATGAGATGTTGACCTCGGGGATAATTCGTCCCAGCCATAGCCCATATTCTAGTCTAGTTCTGTTGGTAAGAAAAAAAGATGGAAGTTGGCATTTTTGTGTAGACTACCGGGCGTTGAACAGTGTAACCATACTAGACAAATTCCCTATTCTTGTGATTGAGGAGCTTTTTGATGAATTAAATGGAGCGAAATGGTTTTCAAAGATAGATTTGAAGGCGAGATACCATCACCTCAGAATGTGTGGAGAAGACATAGAGAAAACAACATTTCATACACATGATGGACACTATGAATTTATGGTTATGCTGTTTGGATTGACTAATGCTCTGTCCACTTTTCAATCCTTGATGAACTTCGTATTTAAACTGTTCTTGCAAAAGTTGGTATTGGTGTTCTTCGACGATATATTAATCTACATTGCAGATTTGGAAAATCACTTGAAGCACCTTGGATTGGCACTGGAGATTTTGAGAAAAATGAGCTATATGTGAATCAAAAGAAGTGCAACTTTGCAAGAGAACGTATAGATTACTTGGGCAATATCATTTCGGGTTGAGGAGTAGAAGTGGATCCTGAGAAAATTAGAGCAATCAAAGAATGGCCTACACCAACTAATGTAAGAGAGATACGGGGATTCTTAGGCCTAACCAGTTATTACAGAAAATTCGTCCAACACTATGGTTCCATTGCTGCACCATTGACTCAACTGATAAAGAAAGGGGGGTTCAAGTGGACTGAGGAGTCTAAGGAGGCTTTCCAACAACTACAAAATGCCATGACAACATTGCCTGTTCTGGCGTTGCCTAATTTTAGCGCCACATTTGAGATGGAAACAGATGCATCTGGATGTGGAATAACAGATGCATCTGGATGTGGAATAGGAGCAGTCCTCATCCAATCCAAGCACCCCATTGCATATTTCAACCTTACGTTGGCATTAAGAGATAGAGTCCAGCCTGTATATGAAAGTGAATTAATGGCAGTAGTCTTGGCTGTACAACGATGGAGACTGTACCTACTGGGGAGGAAATTTATTGTGAAAACATATCAGAAATCCTTGAAGTACTTGTTAGAACAGAGGGTGATACAACCCCAATATCAGAATTGGGTAGCTAAATTATTGGGTCACTCATTTGAGGTGATCTACAAACCTGGATTGGAGAAGAAAGCAGCAGACGCCTTGTCACGAATCTACAGTTCATCTTGGTAGCATAACAGCTCCAACGTTGGTAGACATTTTGGTGGTTAAGAAGGAAGTCGAAGAGGATGAAAAACTGAGCAAAATAATGGAAGAGCTACAAACGATGGAAGGAAGTAAAGAGGGTAAATTCTCTATACAACAAGATATGTTAAGATATAAGGATAGGTTGGTGCTGTCTAAAACCTCTACATTGATACTCACAATCCTACACACTTACCACGATTCGGTACTTGGAGGCCACTCTGGATTTTGGCGCACCTATAAGAGATTGACCGGAAATTGTATTGGGAGAGAATGAGGGCGAATGTTAAGAAGTATTGTGGAGAATGTGTAATATGTCATAAGAATAAAAATTAGGGTAAAGTAAATCTTAGCTACCCATGGGATCAACATGATCTAAGCTCATTATTTCTAGCAAAAATAATCGAAAATAAAAATTTATATTAATGTCTACAAAAGTGGTGTGAATATAACTAAGATGTCACCTGGCCAAAACCTAAACTTTAGGACTATGCAAAAAACGGTCACTGTCAATATTTTTAATAACAAAAAAATAAAAAGGGAAAATTATGTAAAACTCTATACAAATCACTATAATTGGGGATTTTGTACGCCTTCCAATTGTTGGCTTGTTTGAACTATAAGGGTTTGCTATTATGGAGCTTTTTCTGTTGCATCTCAGTGTTATCTTCGTCTGGTTTACTTTGTTCGTCTCCGTCTCTTGTTTACTTTGTTTGTCTCCATCTCTTGAGAGTTGTTGTATCTTTGAGCATTAGTCCCTTTTCATTTTTCAGTAAAAAAAATTTCTTTTATTTTTTTTTTAGTGTCAATCAGCCCTAGACTGGTCTTTGAAGTTTTTGTTGTTCCTCTTAAGAGTAGGAACTAAGGTAGTATGAGATGCCTTTGTCCCACATTGATTTTGTTCAAATGGGATGATCATTATGGTACTTATGTGGCTTGGCTGCCCCACTCCAATACCTGGTTTTTGAGGTGTGGTTCTCCAAGATGCTTAAGTAGCTAACAATGATATCAAATCTTTTTCAAGCTTAATCCTATAGTCAAGTTAAAAGCTATTTCAAGCTTGAATTCATCCCAGAAATTTTTGTTACATTCAAAAGCCACCAACTAGAGAGGATCTAAGTGAGCAGCAAGAGTGTCTTGTGAGCAATGAAATTATGGATTTGCGTGTGTTATTTTTTATTAAATCTTTCATGTTGGGTCTTTAATGCTTATCGTTATGCTCCAATACCTCTCACGCTGTCCTTATTTTTCGTGTATTTGGTTGGCCCAATGAAATAACTGTAGATTTTAAAATGTTTTGGAATACATGAGTTTTAGTGTTGGTGGTCATTTGCATAGGTGAGACATACGGGTCTATTTGTTCCTAATTTCTAATGTTTATAGAGCTAGTCCTATATGTTTAAGGCTGCTTAGATATCCACAAGAATGAAAAAAATATCCAACACATCATCATCGAATGTTTACAGTCAAATATAGATTCTTTATTATCTTATAATCAGCTATGTGAGGGTTGAGGAAATGAATTTGAAATTAGAATTCTCAAGGGAAATGTAAATAACCAACGTTCTACATCTTCCGCATTGAAGAATCAAATTCATTTACCCTTGAAGAGTGGTTTTTTTCACATATTATTTAAAAATTTGACGAAGATGAAACATTGATCATGTGGATAATGGGAGAATGCTGTCTTCATAGGTGGGTTATATATTATATTCTCTTTATCATTAAGGATGATCCCTGAGAGCTGTGGTTTTGCAGGTATGATATTCGGCAAGATGGGCACGACCCTTACGAAAACTGTGTTTCAGTAATGCGATTGTATAAAAGAATGCGTAGTCTTGATCACCATAGACAAGTTATGACCCTGTCAATTACTCCTTCCTGTATCCAGTATGTTGCTCCCAATCTTGATTCCCATAGTGCAAAAGATCTTGAAAAGATGACTCCAGATGAACTTTATGAGATGTCGAGATCAAACTTTAAGTGTTGGTGTCATGACTCCAGACGAGTTATGCAGACTTAGGAGCCAATCATTTTTGTTTTTGCTGCTAAGAAAAGGGCATTGATCATTGTGGTGTCTGCAGTTTGTATAGAGAATTCTCATTTTGTTGAGAAGGGAGGGAGGTTCTTGGATGAAACGCTCCCATCGGACTTGTTTGTTAAAAAATGGGAATGACCTTATCTTTTGTATCTATTTAGCCTTTCAAGACTTAATACGGTAAAGGTGCAGTTCCCACATCGGAAGTGTCTTACCAGTTTATGGTAGTGTTTTAGGTCTCTTATGATGTTTGTTTTCTGATATAAACAATGAACTTTGAAAACGTAAGGACGCGCTTTGTTGAAGGTGTTTTTGAAGGGTGAATCAGTTTGGGCATCGATAGCCCATGTTTTTCTTTGAAGAGTTTGCTTATTTACCAAGATTAATGCTTTTCTTTGACATCTACACATGAAACGGGTTTCGTCTAGCCGTCGGAGATTTTTGAAGCTCTCTCTTCTCTTCTTTGACTTCTATACTTTCCCGCTTCCTCCCCATCTCTGTTTCATTTAGTTTGTTGGGGATTTTTATGGGTTTTATTTTAAAATTTTATGAAACTATTAACTTTCATGGGTTTATTTTAGTTAATTGTGTTTTTGGTGTATGCAACCAAACTAATATTTGTTGGATAAAAAGAAAAAGAATTTGTTTTTTTGTTATTTTTCATGGGTTTTATTTGGGGGAATTTACATAAATGGAAGAAAATACAAAGTATTTACACCATATAGC

mRNA sequence

ATGGACTCTGATCATGACCCTCTCAAGACACCAAGCCTAAGACACAAATGCTCAGCATGCTACAAGCAATATAAGAAGAAAGAGCATCTTATCGAGCATATGAGAGTTTCATATCACTCTGTTCATCAGCCTAGATGTGGAGTCTGCCTAAAGCACTGCAAATCATTTGAATCATTGAGAGAACATCTAATGGAGCAAGGTTGTGGGCTTTGTTTGAGAGTACTTGATGGTCCGGAATCTCTCAGTGATCACCAAGACATTTGCTGCATAACAGCACCTGTTCACCAAGGAACAAGTCTACCACCAACTGATTTATCCGATTGTTACGAAGAAGATCGTTCTGATCGAGGCCTTGGAGCAATAGCTATTGATTGTGTAATGGCTGGTGGAGGAAGTGATGGTGCACTGGACATTTGTGTTTGGATATGCCTTGTTGATGAAGATGAGAAATTGATTTTCAATACTTTCGTACAACCACAAATACCGATCACGAACTATAGGCATGAAGTAACTGGGCTAAAGGAAGAACATATGAGGTATGCCATGCCACTGAAAAATGTCCAGGAAAAAGTATTGAAACTCTTGTTAAATGGAGAATCCATTGGGAGATTGAGATTGAATGGTGGTAAAGCTAAGCTTCTTGTCGGCCATGACTTGGAGCATGATTTAGATTGCCTGAGATTGAATTATCCCGATCATATGTTGAGGGACACGGCTAGATATCATCCATTGATGAAAACAAATTTGGTTAGTCATTCTCTCAAGTACCTTACTCGAGCATATTTGGGGTATGATATTCGGCAAGATGGGCACGACCCTTACGAAAACTGTGTTTCAGTAATGCGATTGTATAAAAGAATGCGTAGTCTTGATCACCATAGACAAGTTATGACCCTGTCAATTACTCCTTCCTGTATCCAGTATGTTGCTCCCAATCTTGATTCCCATAGTGCAAAAGATCTTGAAAAGATGACTCCAGATGAACTTTATGAGATGTCGAGATCAAACTTTAAGTGTTGGTGTCATGACTCCAGACGAGTTATGCAGACTTAG

Coding sequence (CDS)

ATGGACTCTGATCATGACCCTCTCAAGACACCAAGCCTAAGACACAAATGCTCAGCATGCTACAAGCAATATAAGAAGAAAGAGCATCTTATCGAGCATATGAGAGTTTCATATCACTCTGTTCATCAGCCTAGATGTGGAGTCTGCCTAAAGCACTGCAAATCATTTGAATCATTGAGAGAACATCTAATGGAGCAAGGTTGTGGGCTTTGTTTGAGAGTACTTGATGGTCCGGAATCTCTCAGTGATCACCAAGACATTTGCTGCATAACAGCACCTGTTCACCAAGGAACAAGTCTACCACCAACTGATTTATCCGATTGTTACGAAGAAGATCGTTCTGATCGAGGCCTTGGAGCAATAGCTATTGATTGTGTAATGGCTGGTGGAGGAAGTGATGGTGCACTGGACATTTGTGTTTGGATATGCCTTGTTGATGAAGATGAGAAATTGATTTTCAATACTTTCGTACAACCACAAATACCGATCACGAACTATAGGCATGAAGTAACTGGGCTAAAGGAAGAACATATGAGGTATGCCATGCCACTGAAAAATGTCCAGGAAAAAGTATTGAAACTCTTGTTAAATGGAGAATCCATTGGGAGATTGAGATTGAATGGTGGTAAAGCTAAGCTTCTTGTCGGCCATGACTTGGAGCATGATTTAGATTGCCTGAGATTGAATTATCCCGATCATATGTTGAGGGACACGGCTAGATATCATCCATTGATGAAAACAAATTTGGTTAGTCATTCTCTCAAGTACCTTACTCGAGCATATTTGGGGTATGATATTCGGCAAGATGGGCACGACCCTTACGAAAACTGTGTTTCAGTAATGCGATTGTATAAAAGAATGCGTAGTCTTGATCACCATAGACAAGTTATGACCCTGTCAATTACTCCTTCCTGTATCCAGTATGTTGCTCCCAATCTTGATTCCCATAGTGCAAAAGATCTTGAAAAGATGACTCCAGATGAACTTTATGAGATGTCGAGATCAAACTTTAAGTGTTGGTGTCATGACTCCAGACGAGTTATGCAGACTTAG

Protein sequence

MDSDHDPLKTPSLRHKCSACYKQYKKKEHLIEHMRVSYHSVHQPRCGVCLKHCKSFESLREHLMEQGCGLCLRVLDGPESLSDHQDICCITAPVHQGTSLPPTDLSDCYEEDRSDRGLGAIAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLDCLRLNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYKRMRSLDHHRQVMTLSITPSCIQYVAPNLDSHSAKDLEKMTPDELYEMSRSNFKCWCHDSRRVMQT*
Homology
BLAST of CsaV3_4G027590 vs. NCBI nr
Match: XP_004142658.1 (apoptosis-enhancing nuclease isoform X2 [Cucumis sativus] >KGN54753.2 hypothetical protein Csa_012744 [Cucumis sativus])

HSP 1 Score: 741.5 bits (1913), Expect = 3.3e-210
Identity = 350/350 (100.00%), Postives = 350/350 (100.00%), Query Frame = 0

Query: 1   MDSDHDPLKTPSLRHKCSACYKQYKKKEHLIEHMRVSYHSVHQPRCGVCLKHCKSFESLR 60
           MDSDHDPLKTPSLRHKCSACYKQYKKKEHLIEHMRVSYHSVHQPRCGVCLKHCKSFESLR
Sbjct: 1   MDSDHDPLKTPSLRHKCSACYKQYKKKEHLIEHMRVSYHSVHQPRCGVCLKHCKSFESLR 60

Query: 61  EHLMEQGCGLCLRVLDGPESLSDHQDICCITAPVHQGTSLPPTDLSDCYEEDRSDRGLGA 120
           EHLMEQGCGLCLRVLDGPESLSDHQDICCITAPVHQGTSLPPTDLSDCYEEDRSDRGLGA
Sbjct: 61  EHLMEQGCGLCLRVLDGPESLSDHQDICCITAPVHQGTSLPPTDLSDCYEEDRSDRGLGA 120

Query: 121 IAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEHMRY 180
           IAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEHMRY
Sbjct: 121 IAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEHMRY 180

Query: 181 AMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLDCLRLNYPDHMLRDTAR 240
           AMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLDCLRLNYPDHMLRDTAR
Sbjct: 181 AMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLDCLRLNYPDHMLRDTAR 240

Query: 241 YHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYKRMRSLDHHRQVMTLS 300
           YHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYKRMRSLDHHRQVMTLS
Sbjct: 241 YHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYKRMRSLDHHRQVMTLS 300

Query: 301 ITPSCIQYVAPNLDSHSAKDLEKMTPDELYEMSRSNFKCWCHDSRRVMQT 351
           ITPSCIQYVAPNLDSHSAKDLEKMTPDELYEMSRSNFKCWCHDSRRVMQT
Sbjct: 301 ITPSCIQYVAPNLDSHSAKDLEKMTPDELYEMSRSNFKCWCHDSRRVMQT 350

BLAST of CsaV3_4G027590 vs. NCBI nr
Match: XP_011653750.1 (apoptosis-enhancing nuclease isoform X1 [Cucumis sativus] >XP_011653752.1 apoptosis-enhancing nuclease isoform X1 [Cucumis sativus] >XP_031739807.1 apoptosis-enhancing nuclease isoform X1 [Cucumis sativus] >XP_031739808.1 apoptosis-enhancing nuclease isoform X1 [Cucumis sativus])

HSP 1 Score: 732.3 bits (1889), Expect = 2.0e-207
Identity = 350/363 (96.42%), Postives = 350/363 (96.42%), Query Frame = 0

Query: 1   MDSDHDPLKTPSLRHKCSACYKQYKKKEHLIEHMRVSYHSVHQPRCGVCLKHCKSFESLR 60
           MDSDHDPLKTPSLRHKCSACYKQYKKKEHLIEHMRVSYHSVHQPRCGVCLKHCKSFESLR
Sbjct: 1   MDSDHDPLKTPSLRHKCSACYKQYKKKEHLIEHMRVSYHSVHQPRCGVCLKHCKSFESLR 60

Query: 61  EHLM-------------EQGCGLCLRVLDGPESLSDHQDICCITAPVHQGTSLPPTDLSD 120
           EHLM             EQGCGLCLRVLDGPESLSDHQDICCITAPVHQGTSLPPTDLSD
Sbjct: 61  EHLMGPLSKSNCSKIFSEQGCGLCLRVLDGPESLSDHQDICCITAPVHQGTSLPPTDLSD 120

Query: 121 CYEEDRSDRGLGAIAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITNYR 180
           CYEEDRSDRGLGAIAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITNYR
Sbjct: 121 CYEEDRSDRGLGAIAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITNYR 180

Query: 181 HEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLDCLR 240
           HEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLDCLR
Sbjct: 181 HEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLDCLR 240

Query: 241 LNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYKRM 300
           LNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYKRM
Sbjct: 241 LNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYKRM 300

Query: 301 RSLDHHRQVMTLSITPSCIQYVAPNLDSHSAKDLEKMTPDELYEMSRSNFKCWCHDSRRV 351
           RSLDHHRQVMTLSITPSCIQYVAPNLDSHSAKDLEKMTPDELYEMSRSNFKCWCHDSRRV
Sbjct: 301 RSLDHHRQVMTLSITPSCIQYVAPNLDSHSAKDLEKMTPDELYEMSRSNFKCWCHDSRRV 360

BLAST of CsaV3_4G027590 vs. NCBI nr
Match: KAA0045030.1 (apoptosis-enhancing nuclease isoform X1 [Cucumis melo var. makuwa])

HSP 1 Score: 649.0 bits (1673), Expect = 2.3e-182
Identity = 308/330 (93.33%), Postives = 315/330 (95.45%), Query Frame = 0

Query: 34  MRVSYHSVHQPRCGVCLKHCKSFESLREHLM-------------EQGCGLCLRVLDGPES 93
           MRVSYHSVHQPRCGVCLKHCKSFESLREHLM             EQGCGLCLRVLDGPE+
Sbjct: 1   MRVSYHSVHQPRCGVCLKHCKSFESLREHLMGPLSKSNCSKIFSEQGCGLCLRVLDGPET 60

Query: 94  LSDHQDICCITAPVHQGTSLPPTDLSDCYEEDRSDRGLGAIAIDCVMAGGGSDGALDICV 153
           LS+HQDICCITAPVHQGTSLPPTDLSDCYEEDRSDRGLGAIA+DCVMAGGGSDGALDICV
Sbjct: 61  LSEHQDICCITAPVHQGTSLPPTDLSDCYEEDRSDRGLGAIAMDCVMAGGGSDGALDICV 120

Query: 154 WICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGES 213
           WICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGES
Sbjct: 121 WICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGES 180

Query: 214 IGRLRLNGGKAKLLVGHDLEHDLDCLRLNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRA 273
           IG+LR NGGKAKLLVGHDLEHDLDCLR+NYPDHMLRDTARYHPLMKTNLVSHSLKYLTRA
Sbjct: 181 IGKLRSNGGKAKLLVGHDLEHDLDCLRMNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRA 240

Query: 274 YLGYDIRQDGHDPYENCVSVMRLYKRMRSLDHHRQVMTLSITPSCIQYVAPNLDSHSAKD 333
           YLGYDIRQDGHDPYENCVSVMRLYKRMRSLDH RQVMTLS+TPSCIQYVAPNLDSHSAKD
Sbjct: 241 YLGYDIRQDGHDPYENCVSVMRLYKRMRSLDHRRQVMTLSVTPSCIQYVAPNLDSHSAKD 300

Query: 334 LEKMTPDELYEMSRSNFKCWCHDSRRVMQT 351
           LEKMTPDELYEMSR+NFKCWCHDSRRVMQT
Sbjct: 301 LEKMTPDELYEMSRTNFKCWCHDSRRVMQT 330

BLAST of CsaV3_4G027590 vs. NCBI nr
Match: TYJ96296.1 (apoptosis-enhancing nuclease isoform X1 [Cucumis melo var. makuwa])

HSP 1 Score: 649.0 bits (1673), Expect = 2.3e-182
Identity = 308/330 (93.33%), Postives = 315/330 (95.45%), Query Frame = 0

Query: 34  MRVSYHSVHQPRCGVCLKHCKSFESLREHLM-------------EQGCGLCLRVLDGPES 93
           MRVSYHSVHQPRCGVCLKHCKSFESLREHLM             EQGCGLCLRVLDGPE+
Sbjct: 1   MRVSYHSVHQPRCGVCLKHCKSFESLREHLMGPLSKSNCSKIFSEQGCGLCLRVLDGPET 60

Query: 94  LSDHQDICCITAPVHQGTSLPPTDLSDCYEEDRSDRGLGAIAIDCVMAGGGSDGALDICV 153
           LS+HQDICCITAPVHQGTSLPPTDLSDCYEEDRSDRGLGAIA+DCVMAGGGSDGALDICV
Sbjct: 61  LSEHQDICCITAPVHQGTSLPPTDLSDCYEEDRSDRGLGAIAMDCVMAGGGSDGALDICV 120

Query: 154 WICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGES 213
           WICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEH+RYAMPLKNVQEKVLKLLLNGES
Sbjct: 121 WICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEHLRYAMPLKNVQEKVLKLLLNGES 180

Query: 214 IGRLRLNGGKAKLLVGHDLEHDLDCLRLNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRA 273
           IGRLR NGGKAKLLVGHDLEHDLDCLR+NYPDHMLRDTARYHPLMKTNLVSHSLKYLTRA
Sbjct: 181 IGRLRSNGGKAKLLVGHDLEHDLDCLRMNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRA 240

Query: 274 YLGYDIRQDGHDPYENCVSVMRLYKRMRSLDHHRQVMTLSITPSCIQYVAPNLDSHSAKD 333
           YLGYDIRQDGHDPYENCVSVMRLYKRMRSLDH RQVMTLS+TPSCIQYVAPNLDSHSAKD
Sbjct: 241 YLGYDIRQDGHDPYENCVSVMRLYKRMRSLDHRRQVMTLSVTPSCIQYVAPNLDSHSAKD 300

Query: 334 LEKMTPDELYEMSRSNFKCWCHDSRRVMQT 351
           LEKMTPDELYEMSR+NFKCWCHDSRRVMQT
Sbjct: 301 LEKMTPDELYEMSRTNFKCWCHDSRRVMQT 330

BLAST of CsaV3_4G027590 vs. NCBI nr
Match: XP_038877619.1 (uncharacterized protein LOC120069877 isoform X1 [Benincasa hispida] >XP_038877620.1 uncharacterized protein LOC120069877 isoform X1 [Benincasa hispida] >XP_038877621.1 uncharacterized protein LOC120069877 isoform X1 [Benincasa hispida] >XP_038877622.1 uncharacterized protein LOC120069877 isoform X1 [Benincasa hispida] >XP_038877623.1 uncharacterized protein LOC120069877 isoform X1 [Benincasa hispida] >XP_038877625.1 uncharacterized protein LOC120069877 isoform X1 [Benincasa hispida])

HSP 1 Score: 640.2 bits (1650), Expect = 1.1e-179
Identity = 313/363 (86.23%), Postives = 327/363 (90.08%), Query Frame = 0

Query: 1   MDSDHDPLKTPSLRHKCSACYKQYKKKEHLIEHMRVSYHSVHQPRCGVCLKHCKSFESLR 60
           MDSDHDPLKTPSLRHKCSACYKQYKKK+HLIEHMRVSYHSVHQPRCGVCLKH KSFESLR
Sbjct: 1   MDSDHDPLKTPSLRHKCSACYKQYKKKDHLIEHMRVSYHSVHQPRCGVCLKHFKSFESLR 60

Query: 61  EHL-------------MEQGCGLCLRVLDGPESLSDHQDICCITAPVHQGTSLPPTDLSD 120
           EHL              E+GCGLCLRVLDGP SLS+HQ+ICC+TAPVHQG SL PTDLSD
Sbjct: 61  EHLKGPLPKSNCSKIFSERGCGLCLRVLDGPGSLSEHQNICCLTAPVHQGNSLLPTDLSD 120

Query: 121 CYEEDRSDRGLGAIAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITNYR 180
           CY EDRSDRG GAIA+DCVMAGGGSDGALDICVWICLVDEDEKLIF+TFVQPQIPITNYR
Sbjct: 121 CYGEDRSDRGHGAIAMDCVMAGGGSDGALDICVWICLVDEDEKLIFSTFVQPQIPITNYR 180

Query: 181 HEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLDCLR 240
           HEVTGLKEEHMR+A+PLKNVQEKVLK+LLNGESI RLR NGGKA+LLVGHDLEHDLDCLR
Sbjct: 181 HEVTGLKEEHMRHAIPLKNVQEKVLKILLNGESIERLRFNGGKARLLVGHDLEHDLDCLR 240

Query: 241 LNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYKRM 300
           +NYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDIRQ GH+PYENCVSVMRLYKRM
Sbjct: 241 MNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDIRQGGHNPYENCVSVMRLYKRM 300

Query: 301 RSLDHHRQVMTLSITPSCIQYVAPNLDSHSAKDLEKMTPDELYEMSRSNFKCWCHDSRRV 351
           RSLDH  QV   SI P  IQYVAP+LDS S KDLEKMTP ELYEMSRSNFKCWC DSRRV
Sbjct: 301 RSLDHRGQVP--SIAPH-IQYVAPSLDSVSTKDLEKMTPHELYEMSRSNFKCWCLDSRRV 360

BLAST of CsaV3_4G027590 vs. ExPASy Swiss-Prot
Match: Q4IEV5 (RNA exonuclease 4 OS=Gibberella zeae (strain ATCC MYA-4620 / CBS 123657 / FGSC 9075 / NRRL 31084 / PH-1) OX=229533 GN=REX4 PE=3 SV=1)

HSP 1 Score: 96.3 bits (238), Expect = 7.4e-19
Identity = 58/176 (32.95%), Postives = 97/176 (55.11%), Query Frame = 0

Query: 121 IAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEHMRY 180
           IAIDC M G G  G       + +VD     I++++V+P+  +TN+R  V+G+ ++ MR+
Sbjct: 134 IAIDCEMVGVGPGGHESALARVSIVDFHGVQIYDSYVKPKEKVTNWRTAVSGISQKSMRF 193

Query: 181 AMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLDCLRLNYPDHMLRDTAR 240
           A   + VQ ++ KLL              + ++LVGHDL+HDL+ L L++P   +RDTA+
Sbjct: 194 ARDFEEVQAEIDKLL--------------RGRILVGHDLKHDLEALILSHPGKDIRDTAK 253

Query: 241 YHPLMK-TNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYKRMRS---LDH 293
           +    K  N    SL+ L +  LG +I+   H   E+  + M L+++ +S   +DH
Sbjct: 254 FSGFKKYANGRKPSLRVLAQQLLGVEIQGGEHSSIEDARATMLLFRKHKSAFDVDH 295

BLAST of CsaV3_4G027590 vs. ExPASy Swiss-Prot
Match: Q08237 (RNA exonuclease 4 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=REX4 PE=1 SV=1)

HSP 1 Score: 94.4 bits (233), Expect = 2.8e-18
Identity = 56/171 (32.75%), Postives = 91/171 (53.22%), Query Frame = 0

Query: 121 IAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEHMRY 180
           IA+DC   G G +G       I +V+    ++ + FV+P+  +  +R  V+G+K EHM+ 
Sbjct: 122 IAMDCEFVGVGPEGKESALARISIVNYFGHVVLDEFVKPREKVVEWRTWVSGIKPEHMKN 181

Query: 181 AMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLDCLRLNYPDHMLRDTAR 240
           A+  K  Q+K   +L              + ++LVGH L+HDL+ L L++P  +LRDT+R
Sbjct: 182 AITFKEAQKKTADIL--------------EGRILVGHALKHDLEALMLSHPKSLLRDTSR 241

Query: 241 YHPLMK--TNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYKRMRS 290
           + P  K      + SLK LTR  L   I++  H   E+  + M LYK+ ++
Sbjct: 242 HLPFRKLYAKGKTPSLKKLTREVLKISIQEGEHSSVEDARATMLLYKKEKT 278

BLAST of CsaV3_4G027590 vs. ExPASy Swiss-Prot
Match: Q9GZR2 (RNA exonuclease 4 OS=Homo sapiens OX=9606 GN=REXO4 PE=1 SV=2)

HSP 1 Score: 92.8 bits (229), Expect = 8.2e-18
Identity = 53/167 (31.74%), Postives = 94/167 (56.29%), Query Frame = 0

Query: 120 AIAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEHMR 179
           A+A+DC M G G  G   +   + +V++  K +++ +V+P  P+T+YR  V+G++ E+++
Sbjct: 243 ALALDCEMVGVGPKGEESMAARVSIVNQYGKCVYDKYVKPTEPVTDYRTAVSGIRPENLK 302

Query: 180 YAMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLDCLRLNYPDHMLRDTA 239
               L+ VQ++V ++L              K ++LVGH L +DL  L L++P   +RDT 
Sbjct: 303 QGEELEVVQKEVAEML--------------KGRILVGHALHNDLKVLFLDHPKKKIRDTQ 362

Query: 240 RYHPLMKTNLVS--HSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLY 285
           +Y P  K+ + S   SL+ L+   LG  ++Q  H   ++  + MRLY
Sbjct: 363 KYKP-FKSQVKSGRPSLRLLSEKILGLQVQQAEHCSIQDAQAAMRLY 394

BLAST of CsaV3_4G027590 vs. ExPASy Swiss-Prot
Match: Q6CMT3 (RNA exonuclease 4 OS=Kluyveromyces lactis (strain ATCC 8585 / CBS 2359 / DSM 70799 / NBRC 1267 / NRRL Y-1140 / WM37) OX=284590 GN=REX4 PE=3 SV=1)

HSP 1 Score: 92.0 bits (227), Expect = 1.4e-17
Identity = 53/167 (31.74%), Postives = 90/167 (53.89%), Query Frame = 0

Query: 121 IAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEHMRY 180
           +++DC   G G DG       + +V+    ++ + FV+P+ P+T++R  V+G+K  HM  
Sbjct: 120 VSMDCEFVGVGPDGKDSALARVSIVNYYGNVVLDLFVRPKEPVTDWRTWVSGIKPHHMAN 179

Query: 181 AMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLDCLRLNYPDHMLRDTAR 240
           A+  ++ Q++V  +L              K ++LVGH + HDL  L L++P  M+RDT+R
Sbjct: 180 AVTQEDCQKQVSNVL--------------KGRILVGHSVHHDLTALMLSHPRRMIRDTSR 239

Query: 241 YHPLMK--TNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYK 286
           + P  +  +   + SLK LT+  L  DI+   H   E+  + M LYK
Sbjct: 240 HMPFRQKYSEGKTPSLKKLTKEILQLDIQDGEHSSIEDARATMLLYK 272

BLAST of CsaV3_4G027590 vs. ExPASy Swiss-Prot
Match: Q7S9B7 (RNA exonuclease 4 OS=Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) OX=367110 GN=rex-4 PE=3 SV=1)

HSP 1 Score: 92.0 bits (227), Expect = 1.4e-17
Identity = 57/185 (30.81%), Postives = 94/185 (50.81%), Query Frame = 0

Query: 103 TDLSDCYEEDRSDRGLGAIAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIP 162
           TDL+   +  +S+     ++IDC M G G  GA  +     +VD     I++++V+P   
Sbjct: 199 TDLALILQATKSNTLGKYLSIDCEMVGTGPSGATSVLARCSIVDFHGHQIYDSYVRPTAF 258

Query: 163 ITNYRHEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHD 222
           +T++R  V+G+ + HM  A   ++VQ  V  LL              K ++LVGHD++HD
Sbjct: 259 VTDWRTHVSGISKRHMASARSFESVQATVAALL--------------KGRILVGHDVKHD 318

Query: 223 LDCLRLNYPDHMLRDTARYHPLMK-TNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVM 282
           L+ L   +P   +RDTA+Y    K  +    SL+ L +  LG +I Q  H   E+    M
Sbjct: 319 LEVLGFEHPHRDIRDTAKYSGFRKYGHGPKPSLRVLAKEVLGIEIHQGQHSSVEDARVAM 369

Query: 283 RLYKR 287
            L+++
Sbjct: 379 LLFRK 369

BLAST of CsaV3_4G027590 vs. ExPASy TrEMBL
Match: A0A5D3BBM9 (RNA exonuclease 4 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold78209G00940 PE=3 SV=1)

HSP 1 Score: 649.0 bits (1673), Expect = 1.1e-182
Identity = 308/330 (93.33%), Postives = 315/330 (95.45%), Query Frame = 0

Query: 34  MRVSYHSVHQPRCGVCLKHCKSFESLREHLM-------------EQGCGLCLRVLDGPES 93
           MRVSYHSVHQPRCGVCLKHCKSFESLREHLM             EQGCGLCLRVLDGPE+
Sbjct: 1   MRVSYHSVHQPRCGVCLKHCKSFESLREHLMGPLSKSNCSKIFSEQGCGLCLRVLDGPET 60

Query: 94  LSDHQDICCITAPVHQGTSLPPTDLSDCYEEDRSDRGLGAIAIDCVMAGGGSDGALDICV 153
           LS+HQDICCITAPVHQGTSLPPTDLSDCYEEDRSDRGLGAIA+DCVMAGGGSDGALDICV
Sbjct: 61  LSEHQDICCITAPVHQGTSLPPTDLSDCYEEDRSDRGLGAIAMDCVMAGGGSDGALDICV 120

Query: 154 WICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGES 213
           WICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEH+RYAMPLKNVQEKVLKLLLNGES
Sbjct: 121 WICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEHLRYAMPLKNVQEKVLKLLLNGES 180

Query: 214 IGRLRLNGGKAKLLVGHDLEHDLDCLRLNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRA 273
           IGRLR NGGKAKLLVGHDLEHDLDCLR+NYPDHMLRDTARYHPLMKTNLVSHSLKYLTRA
Sbjct: 181 IGRLRSNGGKAKLLVGHDLEHDLDCLRMNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRA 240

Query: 274 YLGYDIRQDGHDPYENCVSVMRLYKRMRSLDHHRQVMTLSITPSCIQYVAPNLDSHSAKD 333
           YLGYDIRQDGHDPYENCVSVMRLYKRMRSLDH RQVMTLS+TPSCIQYVAPNLDSHSAKD
Sbjct: 241 YLGYDIRQDGHDPYENCVSVMRLYKRMRSLDHRRQVMTLSVTPSCIQYVAPNLDSHSAKD 300

Query: 334 LEKMTPDELYEMSRSNFKCWCHDSRRVMQT 351
           LEKMTPDELYEMSR+NFKCWCHDSRRVMQT
Sbjct: 301 LEKMTPDELYEMSRTNFKCWCHDSRRVMQT 330

BLAST of CsaV3_4G027590 vs. ExPASy TrEMBL
Match: A0A5A7TPB9 (RNA exonuclease 4 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold30G00150 PE=3 SV=1)

HSP 1 Score: 649.0 bits (1673), Expect = 1.1e-182
Identity = 308/330 (93.33%), Postives = 315/330 (95.45%), Query Frame = 0

Query: 34  MRVSYHSVHQPRCGVCLKHCKSFESLREHLM-------------EQGCGLCLRVLDGPES 93
           MRVSYHSVHQPRCGVCLKHCKSFESLREHLM             EQGCGLCLRVLDGPE+
Sbjct: 1   MRVSYHSVHQPRCGVCLKHCKSFESLREHLMGPLSKSNCSKIFSEQGCGLCLRVLDGPET 60

Query: 94  LSDHQDICCITAPVHQGTSLPPTDLSDCYEEDRSDRGLGAIAIDCVMAGGGSDGALDICV 153
           LS+HQDICCITAPVHQGTSLPPTDLSDCYEEDRSDRGLGAIA+DCVMAGGGSDGALDICV
Sbjct: 61  LSEHQDICCITAPVHQGTSLPPTDLSDCYEEDRSDRGLGAIAMDCVMAGGGSDGALDICV 120

Query: 154 WICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGES 213
           WICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGES
Sbjct: 121 WICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGES 180

Query: 214 IGRLRLNGGKAKLLVGHDLEHDLDCLRLNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRA 273
           IG+LR NGGKAKLLVGHDLEHDLDCLR+NYPDHMLRDTARYHPLMKTNLVSHSLKYLTRA
Sbjct: 181 IGKLRSNGGKAKLLVGHDLEHDLDCLRMNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRA 240

Query: 274 YLGYDIRQDGHDPYENCVSVMRLYKRMRSLDHHRQVMTLSITPSCIQYVAPNLDSHSAKD 333
           YLGYDIRQDGHDPYENCVSVMRLYKRMRSLDH RQVMTLS+TPSCIQYVAPNLDSHSAKD
Sbjct: 241 YLGYDIRQDGHDPYENCVSVMRLYKRMRSLDHRRQVMTLSVTPSCIQYVAPNLDSHSAKD 300

Query: 334 LEKMTPDELYEMSRSNFKCWCHDSRRVMQT 351
           LEKMTPDELYEMSR+NFKCWCHDSRRVMQT
Sbjct: 301 LEKMTPDELYEMSRTNFKCWCHDSRRVMQT 330

BLAST of CsaV3_4G027590 vs. ExPASy TrEMBL
Match: A0A5A7TYQ2 (RNA exonuclease 4 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold3741G00010 PE=3 SV=1)

HSP 1 Score: 637.9 bits (1644), Expect = 2.5e-179
Identity = 303/324 (93.52%), Postives = 311/324 (95.99%), Query Frame = 0

Query: 34  MRVSYHSVHQPRCGVCLKHCKSFESLREHLMEQ-------GCGLCLRVLDGPESLSDHQD 93
           MRVSYHSVHQPRCGVCLKHCKSFESLR HLM +       GCGLCLRVLDGPE+LS+HQD
Sbjct: 1   MRVSYHSVHQPRCGVCLKHCKSFESLRVHLMGEIVLCIFDGCGLCLRVLDGPETLSEHQD 60

Query: 94  ICCITAPVHQGTSLPPTDLSDCYEEDRSDRGLGAIAIDCVMAGGGSDGALDICVWICLVD 153
           ICCITAPVHQGTSLPPTDLSD YEEDRSDRGLGAIA+DCVMAGGGSDGALDICVWICLVD
Sbjct: 61  ICCITAPVHQGTSLPPTDLSDGYEEDRSDRGLGAIAMDCVMAGGGSDGALDICVWICLVD 120

Query: 154 EDEKLIFNTFVQPQIPITNYRHEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGESIGRLRL 213
           EDEKLIFNTFVQPQIPITNYRHEVTGLKEEH+RYAMPLKNVQEKVLKLLLNGESIGRLR 
Sbjct: 121 EDEKLIFNTFVQPQIPITNYRHEVTGLKEEHLRYAMPLKNVQEKVLKLLLNGESIGRLRS 180

Query: 214 NGGKAKLLVGHDLEHDLDCLRLNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDI 273
           NGGKAKLLVGHDLEHDLDCLR+NYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDI
Sbjct: 181 NGGKAKLLVGHDLEHDLDCLRMNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDI 240

Query: 274 RQDGHDPYENCVSVMRLYKRMRSLDHHRQVMTLSITPSCIQYVAPNLDSHSAKDLEKMTP 333
           RQDGHDPYENCVSVMRLYKRMRSLDH RQVMT S+TPSCIQYVAPNLDSHSAKDLEKMTP
Sbjct: 241 RQDGHDPYENCVSVMRLYKRMRSLDHRRQVMTRSVTPSCIQYVAPNLDSHSAKDLEKMTP 300

Query: 334 DELYEMSRSNFKCWCHDSRRVMQT 351
           DELYEMSR+NFKCWCHDSRRVMQT
Sbjct: 301 DELYEMSRTNFKCWCHDSRRVMQT 324

BLAST of CsaV3_4G027590 vs. ExPASy TrEMBL
Match: A0A6J1DNJ6 (RNA exonuclease 4 OS=Momordica charantia OX=3673 GN=LOC111022742 PE=3 SV=1)

HSP 1 Score: 630.6 bits (1625), Expect = 4.0e-177
Identity = 299/363 (82.37%), Postives = 324/363 (89.26%), Query Frame = 0

Query: 1   MDSDHDPLKTPSLRHKCSACYKQYKKKEHLIEHMRVSYHSVHQPRCGVCLKHCKSFESLR 60
           MDSD DPL  P+ RHKCSACYKQYKKKEHLIEHM+VSYHS+HQPRCGVC KHCKSFESLR
Sbjct: 1   MDSDRDPLNPPTTRHKCSACYKQYKKKEHLIEHMKVSYHSIHQPRCGVCGKHCKSFESLR 60

Query: 61  EHL-------------MEQGCGLCLRVLDGPESLSDHQDICCITAPVHQGTSLPPTDLSD 120
           EHL             +EQGCGLCLRVLDG  SL++H+DICC+TAPVHQGTSL PTDLSD
Sbjct: 61  EHLQGPLSKSNCSKIFIEQGCGLCLRVLDGRRSLNEHRDICCLTAPVHQGTSLLPTDLSD 120

Query: 121 CYEEDRSDRGLGAIAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITNYR 180
           CY+EDRSDR LGAIA+DCVMAGGGSDG LD+CV +CLVDE+EKLIFNTFV+PQIPITNYR
Sbjct: 121 CYDEDRSDRVLGAIAMDCVMAGGGSDGTLDLCVGVCLVDEEEKLIFNTFVRPQIPITNYR 180

Query: 181 HEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLDCLR 240
           HEVTGL EEHMRYAMPLK VQEKVL++L+NGESIGRLRLNGG+A+LLVGHDLEHDLDCLR
Sbjct: 181 HEVTGLTEEHMRYAMPLKEVQEKVLRILINGESIGRLRLNGGRARLLVGHDLEHDLDCLR 240

Query: 241 LNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYKRM 300
           +NYPDHMLRDTARYHPLMKTNLVSHSLKYLTR YLGYDI+Q  HDPYENCVSVMRLYKRM
Sbjct: 241 MNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRTYLGYDIQQGVHDPYENCVSVMRLYKRM 300

Query: 301 RSLDHHRQVMTLSITPSCIQYVAPNLDSHSAKDLEKMTPDELYEMSRSNFKCWCHDSRRV 351
           R+LDHH QVM  ++ P   QYVA NLDSHS KDLEKMTPD+LYEMSRSNFKCWC DSRRV
Sbjct: 301 RNLDHHGQVMIPTVAPHA-QYVAHNLDSHSVKDLEKMTPDKLYEMSRSNFKCWCLDSRRV 360

BLAST of CsaV3_4G027590 vs. ExPASy TrEMBL
Match: A0A5D3BMN0 (RNA exonuclease 4 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1923G00030 PE=3 SV=1)

HSP 1 Score: 619.4 bits (1596), Expect = 9.3e-174
Identity = 294/317 (92.74%), Postives = 305/317 (96.21%), Query Frame = 0

Query: 34  MRVSYHSVHQPRCGVCLKHCKSFESLREHLMEQGCGLCLRVLDGPESLSDHQDICCITAP 93
           MRVSYHSVHQPRCGVCLKHCKSFESLRE+LMEQGCGLCLRVLD P++LS+H +ICCITAP
Sbjct: 1   MRVSYHSVHQPRCGVCLKHCKSFESLREYLMEQGCGLCLRVLDSPKTLSEHLNICCITAP 60

Query: 94  VHQGTSLPPTDLSDCYEEDRSDRGLGAIAIDCVMAGGGSDGALDICVWICLVDEDEKLIF 153
           VHQGTSL PTDLSDCYEEDRSDRGLGAIA+DCVMAGGGSDGALDICVWICLVDE EKLIF
Sbjct: 61  VHQGTSLLPTDLSDCYEEDRSDRGLGAIAMDCVMAGGGSDGALDICVWICLVDEHEKLIF 120

Query: 154 NTFVQPQIPITNYRHEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKL 213
           NTFVQPQIPITNYRHEVTGLKEEH+RYAMPL+NVQEKVLKLLLNGESIGRLR NGGKAKL
Sbjct: 121 NTFVQPQIPITNYRHEVTGLKEEHLRYAMPLRNVQEKVLKLLLNGESIGRLRSNGGKAKL 180

Query: 214 LVGHDLEHDLDCLRLNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDP 273
           LVGHDLEHDLD LR+NYPDHMLRDTARYHPL+KTNLVSHSLKYLTRAYLGYDIRQDGHDP
Sbjct: 181 LVGHDLEHDLDYLRMNYPDHMLRDTARYHPLIKTNLVSHSLKYLTRAYLGYDIRQDGHDP 240

Query: 274 YENCVSVMRLYKRMRSLDHHRQVMTLSITPSCIQYVAPNLDSHSAKDLEKMTPDELYEMS 333
           YENCVSVMRLYKRMRSLDH RQVMT S T   IQYVAPNLDS+SAKDLEKMTPDELYEMS
Sbjct: 241 YENCVSVMRLYKRMRSLDHRRQVMTPSATSPYIQYVAPNLDSYSAKDLEKMTPDELYEMS 300

Query: 334 RSNFKCWCHDSRRVMQT 351
           RSNFKCWCHDSRRVMQT
Sbjct: 301 RSNFKCWCHDSRRVMQT 317

BLAST of CsaV3_4G027590 vs. TAIR 10
Match: AT2G48100.1 (Exonuclease family protein )

HSP 1 Score: 350.5 bits (898), Expect = 1.5e-96
Identity = 177/361 (49.03%), Postives = 237/361 (65.65%), Query Frame = 0

Query: 1   MDSDHDPLKTP--SLRHKCSACYKQYKKKEHLIEHMRVSYHSVHQPRCGVCLKHCKSFES 60
           MDS  +P K    S+RH+C ACYK + ++EHL+EHM++SYHS+HQPRCGVCLKHCKSFES
Sbjct: 1   MDSQLNPSKRRKISVRHRCVACYKMFNRREHLVEHMKISYHSLHQPRCGVCLKHCKSFES 60

Query: 61  LREHL---------------MEQGCGLCLRVLDGPESLSDHQDICCITAPVHQGTSLPPT 120
           +REHL                ++GC LCL++ +   +L++H++ C ++ P   GTS    
Sbjct: 61  VREHLNVPDHLSKGNCKAIFTKRGCTLCLQIFEEAFALAEHKNKCHLSPPRPLGTSTQRN 120

Query: 121 DLSDCYEEDRSDRGLGAIAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPI 180
             S       +   L A+A+DC M GGG+DG +D C  +CLVD+DE +IF+T VQP +P+
Sbjct: 121 PSSSL-----AGSRLKAMALDCEMVGGGADGTIDQCASVCLVDDDENVIFSTHVQPLLPV 180

Query: 181 TNYRHEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDL 240
           T+YRHE+TGL +E ++  MPL++V+E+V   L  G++ G  RL      LLVGHDL HD+
Sbjct: 181 TDYRHEITGLTKEDLKDGMPLEHVRERVFSFLCGGQNDGAGRL------LLVGHDLRHDM 240

Query: 241 DCLRLNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRL 300
            CL+L YP H+LRDTA+Y PLMKTNLVS SLKYLT++YLGY I+   H+ YE+CVS MRL
Sbjct: 241 SCLKLEYPSHLLRDTAKYVPLMKTNLVSQSLKYLTKSYLGYKIQCGKHEVYEDCVSAMRL 300

Query: 301 YKRMRSLDHHRQVMTLSITPSCIQYVAPN-LDSHSAKDLEKMTPDELYEMSRSNFKCWCH 344
           YKRMR  +H            C      N L+S    DLEKM  +ELY+ S S ++CWC 
Sbjct: 301 YKRMRDQEH-----------VCSGKAEGNGLNSRKQSDLEKMNAEELYQKSTSEYRCWCL 339

BLAST of CsaV3_4G027590 vs. TAIR 10
Match: AT2G48100.2 (Exonuclease family protein )

HSP 1 Score: 350.5 bits (898), Expect = 1.5e-96
Identity = 177/361 (49.03%), Postives = 237/361 (65.65%), Query Frame = 0

Query: 1   MDSDHDPLKTP--SLRHKCSACYKQYKKKEHLIEHMRVSYHSVHQPRCGVCLKHCKSFES 60
           MDS  +P K    S+RH+C ACYK + ++EHL+EHM++SYHS+HQPRCGVCLKHCKSFES
Sbjct: 1   MDSQLNPSKRRKISVRHRCVACYKMFNRREHLVEHMKISYHSLHQPRCGVCLKHCKSFES 60

Query: 61  LREHL---------------MEQGCGLCLRVLDGPESLSDHQDICCITAPVHQGTSLPPT 120
           +REHL                ++GC LCL++ +   +L++H++ C ++ P   GTS    
Sbjct: 61  VREHLNVPDHLSKGNCKAIFTKRGCTLCLQIFEEAFALAEHKNKCHLSPPRPLGTSTQRN 120

Query: 121 DLSDCYEEDRSDRGLGAIAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPI 180
             S       +   L A+A+DC M GGG+DG +D C  +CLVD+DE +IF+T VQP +P+
Sbjct: 121 PSSSL-----AGSRLKAMALDCEMVGGGADGTIDQCASVCLVDDDENVIFSTHVQPLLPV 180

Query: 181 TNYRHEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDL 240
           T+YRHE+TGL +E ++  MPL++V+E+V   L  G++ G  RL      LLVGHDL HD+
Sbjct: 181 TDYRHEITGLTKEDLKDGMPLEHVRERVFSFLCGGQNDGAGRL------LLVGHDLRHDM 240

Query: 241 DCLRLNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRL 300
            CL+L YP H+LRDTA+Y PLMKTNLVS SLKYLT++YLGY I+   H+ YE+CVS MRL
Sbjct: 241 SCLKLEYPSHLLRDTAKYVPLMKTNLVSQSLKYLTKSYLGYKIQCGKHEVYEDCVSAMRL 300

Query: 301 YKRMRSLDHHRQVMTLSITPSCIQYVAPN-LDSHSAKDLEKMTPDELYEMSRSNFKCWCH 344
           YKRMR  +H            C      N L+S    DLEKM  +ELY+ S S ++CWC 
Sbjct: 301 YKRMRDQEH-----------VCSGKAEGNGLNSRKQSDLEKMNAEELYQKSTSEYRCWCL 339

BLAST of CsaV3_4G027590 vs. TAIR 10
Match: AT2G48100.3 (Exonuclease family protein )

HSP 1 Score: 350.5 bits (898), Expect = 1.5e-96
Identity = 177/361 (49.03%), Postives = 237/361 (65.65%), Query Frame = 0

Query: 1   MDSDHDPLKTP--SLRHKCSACYKQYKKKEHLIEHMRVSYHSVHQPRCGVCLKHCKSFES 60
           MDS  +P K    S+RH+C ACYK + ++EHL+EHM++SYHS+HQPRCGVCLKHCKSFES
Sbjct: 1   MDSQLNPSKRRKISVRHRCVACYKMFNRREHLVEHMKISYHSLHQPRCGVCLKHCKSFES 60

Query: 61  LREHL---------------MEQGCGLCLRVLDGPESLSDHQDICCITAPVHQGTSLPPT 120
           +REHL                ++GC LCL++ +   +L++H++ C ++ P   GTS    
Sbjct: 61  VREHLNVPDHLSKGNCKAIFTKRGCTLCLQIFEEAFALAEHKNKCHLSPPRPLGTSTQRN 120

Query: 121 DLSDCYEEDRSDRGLGAIAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPI 180
             S       +   L A+A+DC M GGG+DG +D C  +CLVD+DE +IF+T VQP +P+
Sbjct: 121 PSSSL-----AGSRLKAMALDCEMVGGGADGTIDQCASVCLVDDDENVIFSTHVQPLLPV 180

Query: 181 TNYRHEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDL 240
           T+YRHE+TGL +E ++  MPL++V+E+V   L  G++ G  RL      LLVGHDL HD+
Sbjct: 181 TDYRHEITGLTKEDLKDGMPLEHVRERVFSFLCGGQNDGAGRL------LLVGHDLRHDM 240

Query: 241 DCLRLNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRL 300
            CL+L YP H+LRDTA+Y PLMKTNLVS SLKYLT++YLGY I+   H+ YE+CVS MRL
Sbjct: 241 SCLKLEYPSHLLRDTAKYVPLMKTNLVSQSLKYLTKSYLGYKIQCGKHEVYEDCVSAMRL 300

Query: 301 YKRMRSLDHHRQVMTLSITPSCIQYVAPN-LDSHSAKDLEKMTPDELYEMSRSNFKCWCH 344
           YKRMR  +H            C      N L+S    DLEKM  +ELY+ S S ++CWC 
Sbjct: 301 YKRMRDQEH-----------VCSGKAEGNGLNSRKQSDLEKMNAEELYQKSTSEYRCWCL 339

BLAST of CsaV3_4G027590 vs. TAIR 10
Match: AT3G27970.1 (Exonuclease family protein )

HSP 1 Score: 336.3 bits (861), Expect = 3.0e-92
Identity = 169/363 (46.56%), Postives = 237/363 (65.29%), Query Frame = 0

Query: 1   MDSDHDPLKTPSLRHKCSACYKQYKKKEHLIEHMRVSYHSVHQPRCGVCLKHCKSFESLR 60
           MD       + +LR+KC+ACY+Q+ K EHL+EHM++SYHS H+P CGVC KHC+SFESLR
Sbjct: 1   MDYRSSMESSETLRNKCAACYRQFNKLEHLVEHMKISYHSGHEPTCGVCKKHCRSFESLR 60

Query: 61  EHLME-------------QGCGLCLRVLDGPESLSDHQDICCITAPVHQG--TSLPPTDL 120
           EHL+              +GC  C+ +L+ P S   HQ+ C  ++ V+ G  T +    L
Sbjct: 61  EHLIGPLPKQECKNIFSLRGCRFCMTILESPNSRRIHQERCQFSS-VNSGLTTRMAALGL 120

Query: 121 SDCYEED-RSDRGLGAIAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPIT 180
            D    D  S R    +A+ C M GGGSDG+LD+C  +C+ DE + +IF+T+V+P + +T
Sbjct: 121 RDKAMIDYTSSRSPRVVALSCKMVGGGSDGSLDLCARVCITDESDNVIFHTYVKPSMAVT 180

Query: 181 NYRHEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLD 240
           +YR+E TG++ E++R AMPLK VQ K+ + L NGE + ++R  GGKA++LVGH L+HDLD
Sbjct: 181 SYRYETTGIRPENLRDAMPLKQVQRKIQEFLCNGEPMWKIRPRGGKARILVGHGLDHDLD 240

Query: 241 CLRLNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLY 300
            L+L YP  M+RDTA+Y PLMKT+ +S+SLKYLT+AYLGYD+     DPYE+CV+ MRLY
Sbjct: 241 RLQLEYPSSMIRDTAKYPPLMKTSKLSNSLKYLTQAYLGYDVHFGIQDPYEDCVATMRLY 300

Query: 301 KRMRSLDHHRQVMTLSITPSCIQYVAPNLDSHSA---KDLEKMTPDELYEMSRSNFKCWC 345
            RMR   H  +   L+         A N  +  A    + E+M+PDE+  +SRS++ CWC
Sbjct: 301 TRMRYQKHKIEAYPLAAD-------AQNRSNQVAWRQSEAERMSPDEMLSISRSDYYCWC 355

BLAST of CsaV3_4G027590 vs. TAIR 10
Match: AT5G40310.1 (Exonuclease family protein )

HSP 1 Score: 327.8 bits (839), Expect = 1.1e-89
Identity = 160/347 (46.11%), Postives = 228/347 (65.71%), Query Frame = 0

Query: 14  RHKCSACYKQYKKKEHLIEHMRVSYHSVHQPRCGVCLKHCKSFESLREHLME-------- 73
           R+KC  CY+Q+ KKEHL+EHMR+SYHSVH+P CG+C KHC+SF+SLREHL+         
Sbjct: 5   RNKCGGCYRQFNKKEHLVEHMRISYHSVHEPTCGICNKHCRSFDSLREHLIGPLPKQECK 64

Query: 74  -----QGCGLCLRVLDGPESLSDHQDICCITAPVHQGTSLPPTDL---SDCYEEDRSDRG 133
                +GC  CL +L+ P +   HQ+ C + + V  G  +    L   ++   +  S R 
Sbjct: 65  NIFSIRGCRFCLTILESPNARRIHQERCQL-SNVTSGLMIRMAALGLRNNSTIDYTSSRS 124

Query: 134 LGAIAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEH 193
              +A+ C M GGGSDG+LD+C  +C+ DE E ++F+T+V+P IP+TNYR+E+TG++ E+
Sbjct: 125 PRVVALSCKMVGGGSDGSLDLCARVCITDESENVVFHTYVKPTIPVTNYRYEMTGIRPEN 184

Query: 194 MRYAMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLDCLRLNYPDHMLRD 253
           +R AM LK+ Q KV + L NGE + ++R   GKA++LVGH L++ LD L+L Y   M+RD
Sbjct: 185 LRDAMRLKHAQRKVQEFLCNGEPMWKIRPRNGKARILVGHGLDNHLDSLQLEYSSSMIRD 244

Query: 254 TARYHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYKRMRSLDHHRQVM 313
           TA Y PLMK++ +S+SLKYLT+AYLGYDI     DPYE+CV+ MRLY RMR   H  +  
Sbjct: 245 TAEYPPLMKSSKLSNSLKYLTQAYLGYDIHVGIQDPYEDCVATMRLYTRMRYQKHRAEAY 304

Query: 314 TLSITPSCIQYVAPNLDSHSAKDLEKMTPDELYEMSRSNFKCWCHDS 345
            L+           N  +    +LE+M+P+EL ++SRS++ CWC DS
Sbjct: 305 PLASDTQNHN----NFAAWRQNELERMSPEELLDLSRSDYYCWCLDS 346

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004142658.13.3e-210100.00apoptosis-enhancing nuclease isoform X2 [Cucumis sativus] >KGN54753.2 hypothetic... [more]
XP_011653750.12.0e-20796.42apoptosis-enhancing nuclease isoform X1 [Cucumis sativus] >XP_011653752.1 apopto... [more]
KAA0045030.12.3e-18293.33apoptosis-enhancing nuclease isoform X1 [Cucumis melo var. makuwa][more]
TYJ96296.12.3e-18293.33apoptosis-enhancing nuclease isoform X1 [Cucumis melo var. makuwa][more]
XP_038877619.11.1e-17986.23uncharacterized protein LOC120069877 isoform X1 [Benincasa hispida] >XP_03887762... [more]
Match NameE-valueIdentityDescription
Q4IEV57.4e-1932.95RNA exonuclease 4 OS=Gibberella zeae (strain ATCC MYA-4620 / CBS 123657 / FGSC 9... [more]
Q082372.8e-1832.75RNA exonuclease 4 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=55... [more]
Q9GZR28.2e-1831.74RNA exonuclease 4 OS=Homo sapiens OX=9606 GN=REXO4 PE=1 SV=2[more]
Q6CMT31.4e-1731.74RNA exonuclease 4 OS=Kluyveromyces lactis (strain ATCC 8585 / CBS 2359 / DSM 707... [more]
Q7S9B71.4e-1730.81RNA exonuclease 4 OS=Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708... [more]
Match NameE-valueIdentityDescription
A0A5D3BBM91.1e-18293.33RNA exonuclease 4 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold78209G... [more]
A0A5A7TPB91.1e-18293.33RNA exonuclease 4 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold30G001... [more]
A0A5A7TYQ22.5e-17993.52RNA exonuclease 4 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold3741G0... [more]
A0A6J1DNJ64.0e-17782.37RNA exonuclease 4 OS=Momordica charantia OX=3673 GN=LOC111022742 PE=3 SV=1[more]
A0A5D3BMN09.3e-17492.74RNA exonuclease 4 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1923G0... [more]
Match NameE-valueIdentityDescription
AT2G48100.11.5e-9649.03Exonuclease family protein [more]
AT2G48100.21.5e-9649.03Exonuclease family protein [more]
AT2G48100.31.5e-9649.03Exonuclease family protein [more]
AT3G27970.13.0e-9246.56Exonuclease family protein [more]
AT5G40310.11.1e-8946.11Exonuclease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Chinese Long) v3
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013520Exonuclease, RNase T/DNA polymerase IIISMARTSM00479exoiiienduscoord: 119..293
e-value: 1.8E-24
score: 97.3
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 119..304
e-value: 1.4E-41
score: 144.0
NoneNo IPR availablePANTHERPTHR12801:SF123EXONUCLEASE FAMILY PROTEINcoord: 12..347
NoneNo IPR availablePANTHERPTHR12801RNA EXONUCLEASE REXO1 / RECO3 FAMILY MEMBER-RELATEDcoord: 12..347
IPR013087Zinc finger C2H2-typePROSITEPS00028ZINC_FINGER_C2H2_1coord: 17..39
IPR013087Zinc finger C2H2-typePROSITEPS50157ZINC_FINGER_C2H2_2coord: 15..44
score: 11.593128
IPR037431RNA exonuclease 4, DEDDh 3'-5' exonuclease domainCDDcd06144REX4_likecoord: 121..285
e-value: 7.67999E-68
score: 208.139
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 121..290

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_4G027590.1CsaV3_4G027590.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006364 rRNA processing
molecular_function GO:0008408 3'-5' exonuclease activity
molecular_function GO:0003676 nucleic acid binding