CsaV3_4G027590 (gene) Cucumber (Chinese Long) v3

NameCsaV3_4G027590
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionRNA exonuclease 4-like
Locationchr4 : 16842382 .. 16865265 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
TCAGACTCGTGATATTTAATATTATGTAATTTTGGGTTTCACTTAAGTAATGTTATATTGTTTAGTGGTTGAATGTCCTTTATGTTTTCAATGTGTTATCTACTTTAAGTGAGAGGTAAGTTTAGGGTCGATAGGTGTCGTTGAGGTAAGTTGCGATATCGATTGACTAATTGCATCACCTCTTAGGTTAAGAGTTGGTGGCTTGGGAGGGGGCGTGACAAAATATCCTTAGATGGAGAAGGATATTTGTTTTGTTGGCTTCACGGTCTCTTTCAGCTAGGAGAAGGTGGTGTGGGAGGGGTGTGATAGAAATAGAACAAGAACTTTAACAAATATAAATAACATTCTAGCACTAGATCAATATCTTCAATATCTTTTGACGATATGTATATATATTTTATATATCCATTTTTGTCGATTCATCCTTAAAACGATATATCCATGATAATGATTATTTTTCGTGGAGAAAATCTTCACCCTCGTTGTTTCTATAGCATCTCATTTTGGCGATCGATATGTCGCTACTTTTCAATAGACTGCACCCTGTAGTGAAAGTTTCCCGCAAAATCACCCAGAAGTCGGTTCAGCGATACCTACACTAGGGGCTTCACTGCGGACGCATAAGAAAACAGATCCGACAAGCACGATTGTCGGCGCTTCGATAGTGACCCACCAGAGCCTTTTTTCTCTCCTTCTCGACGAGTTTTGGCATTGACTCTCTAGATTTGGACTGGTTCGAAGGCATCGATTAAGGTTTATGGTTTATTTTGGGTAGAATACCCCTCTCAACGCTCTCTCTCATGAACTGAGTTCTAGCCTCAAGCATTGTCTTAATTGTTTAGAATTCTAGGAAGTTGACATACTGCAAGTTTAGCAATTTCCACATCCTTTCTGAACGATAAATTTTGGCCTAATTATGTTTTGATTGTTTTAATTTAGAAAACGTTTGAAAGAATATTTTGATTGTTTGAAAATGTGTTTTGTATGATCTGAAGTTGATCGAAATTCTTTTCCGAAGTATGTTTATTGTTTTGGATAATTTGAAATAAAATGTTTTGATTGATTTTGAATTGGAGGTACGTTTGTTTTAAATTTTTTATACATGCAATTGTTGAAAAGATAGAATGGTTTTTTTTTTCTTTTTTATGAATCTTTGATTGAATTTGCTAATTTGGAAACAGATGTTTCGTTTATGAATGTTTGAAAGAAAATGAATTTAAAAGGATATGGGCATTGCTGATTGAGTATTTTTGTATTCATTTTTTGCTTTCTTCGATACTTTTTTTGTGATAAAAGATTAAGTGATGGTTGATACTTTTGAAAGAAGTAAAGTTTGACATTCTAGCATGAGGCATCAAAACTGTTCTTTTTTTTCCTTTTAATTAAAGATTTTTAGTTTATCTTAGTAATTGTTTGCAATAGCTTCTGCTTGTTAAAGTTTCTAATGGTTATGTTTTCTACATCATTGTATACTAGGTCTATTGAAAAGTACAGGCGGGTACCCTAAATTCCAGAGGAGAGGAACACACACTAACACTTCTGGATTGACATGGACTCTGATCATGACCCTCTCAAGACACCAAGCCTAAGGTCACATTCTCGTCCTTACTATGTTATTTTTAGGTTTCTTAATTGCCAATATTCTTTCCTTGTTTTTGTTTGTTTGGTTTTTACAGATCGACATGAAGTAAAAACAACATACCTGTTGTCCTCCTCTGTTTTTTGGTCTTTAATGTCCATAAATTGCTTCAGAAGTGTCATTCTTAGTTTGCTCTGTGCTCACCGCATGACTCCGTCAGAAGTGTCATTCTTAGTGGAACTCAAGAGAATCAATGTTGTTTATTTCATGTTTTCTCACATCCATGCTTGACATTACCTCTTTGTATTTGATGAAGACGATAATTTCGTCAACATCAACCATACATTTGGGCTTTCAAAGAGGTTTTTGATCTTGTGATTTATACCAATGCAGTGATCCTCTTCTTTTTTCTTTCATGTTGCTTTGGCCTTCTGTATCAATTTTTTTTTTTTTTAACTATTCTCTTGGTAACATCTTAGTTAGTTGGAATCCCTTCCTTGATAAGGAGTGTTTTGGAGAGCTCTTTTGTAATCCCTTACCTTTTGGCTTTTTTTTGTTGTTAATTCTAGTGGTGTACTATCCAATAGGGGTAGATAAAGTTTTAGGACTACCAGTTGATAAATGGGACATGTAATCCTTCAAACATTAGGAAATGCCGGTGGCAGTTTGTTGAACTGGCCTAAAAATGCTTACAAGAAATAAGGATGGTATTATGAGAATATAAACCTAAAAGTACGAAATGACTGCTCAGGTTCTTTTCCAGCCATGATCAACCTCCCATCCTCTTCAGCCTCGAGAAGTATGGTGGACTGATCCTCTTTTCTAATGGTGATTATCATATTGGAGATATTGTCCGAATTAGTGGCAGTCTGGCCATTGTCTACCAAGGCTATATAAAAAATAAGATTGCTAACAGTTTGACAGTTGACTCGTATCATGGAGAAGCAACCCTGTCTGAAAAGCAAAGCTTGTTATCTCATACTTCTAACCTTTATCTAATGGAGACAAGGATCTTGGAGTAAATCTTGGAGAGATTTTTTATTAATTGTGTGTAATGAGAGAGATAAGATATCCCTTAAATAGATCCAAGAGATAAGGTTTGTTCACACCTAACTACCTAGTTAGCTAACTAACCTAGGCCCCCTTCAATAGACCGAAGGGGAAAAGTGCTGGAACACACTTAAACTAAAATTAAAACATGAATTTTTAAAGACAATTCAACAAAAGGTAGAATTCCCAAAATGCCCCTCTATCATATTTCCTCCCCTTACAAAGAATAACTCGTCCCGAGTTATGTTGTGAGCTTGAATTCATCTGGAGCATAGTAAGGGGTCAGATCAGCTACATTGAAAGTGGAACTGATGTTTAGGGTCTTTGGTAGGTAAAGCTGGTAGGCATTTTCTCCAATTTTGTTGATAACTCAAAATGGTCCATGTTTTTTTGGACAGTTTGTAGTGAACGCTTGCTGGAAGTCTTGTTTTTTTTAGAAACACCATGACCTAAGAAGGATGGACACTTTACTTTGGATAGCATGTCTGTGTCCTACACATGTCGGACACTTGGACACTCCGATACTTGGTGGACACCTATCGAACACTTGTTAGTGCAGTAAATGCATTGGACATACATAGAACACTTGTTAAGTAGACTAAAAAGACACATATATGACAATAATAATAACCTTTGAGTATAGAATACATCAAGCTTTTTGAGTATAGAATACATCAAGCTTTTTGAGTGTAGGATACATCAAGCTAAGTTTTTTAAGCATATAAATGCATCAATGCATTTACTATGATTTTTTTTTACTATAAAGATGATATATATATATTTTAAAAGTACTTTTTAATAAACGTGTCTTTGTCGTGTCGTGTCCTAGATTTTTAAAAAATGGGGTGTCATTGTGTCCATGTCTCATATCCGTGTTCGTGCTTCTTAAACCATGACTAGATCTTCTGGTTGGAAACTTCGAAATCTTTTATGTTCATCAGCTTTAGTTTTGTAGGATGCATTTACTTTTCTAGGTAGTCTATTACTTCTTGAGGTAGTTGTTGTATTCTGTCCATCATATGCTCCGCTTCTTCACTAAGATCAACTGTAGATGGAAGATCAGTAAAATCAAAAGTGAGATGGGGGAGTTTAGCATAGATTATCTAAAATGGACACTTCCTTGTTGATTTGTTTTATGTTATTGTAAACAAATTTGGCTTGAGCTAGGCAAGTGTCCCATTACCTTGGTTTGTCGCTTGAAAGGCACCTTGGTTTGTCGCTTGAAAGGCATCTAGGAAACCGCATCTAAGTAGGTTTCCCAGTGTTATGTTAGTGACCTCAGTTTGGGCATCCATTTGTGGGTGGCTAGTTGTGCTGAATTTCAAGCTTGTTCCAAACTTTTTCCCAAGACTTTTCCAAAAGAAGCTCATAAATTTGAATCTCTGTTTGAGACTATAGATTTTAGAATTCCATGTATCTTGACAATTTCTCTAAAAAAAAGGATTAGCAATGTTTAGAGCATCTGAAGTTTTCTTGCAAGCAATGAAATATGCCATTTTACTGAATTTGTCCACCACGACAAAATGAATTAAAGCACATTTTGTAAGTGTGATGAAAGTTCAATTGGAGTTGGTTTTTTATTTCCTCGGTAGGTTTCAATATAGGCCAGATGACAGATTGTATTAATATTAGATGGACTTCAAAATCAAGCTGGGTTTGGAGATGATTTTTGGGTAAAGATGTGTGGTTCATGGGCTGAGAAGACCAAAATTTTGTTTGTAGATTGGGTCATCTGGGTTGGGTTAAAAGAAGTAAAATGGATGGATGAATTGTGAAAATAGTTATGGAACTGGGTCTGGTAATGTGCCTTTTGGTTGGGTCAGAAGGAAAAATTTGGGAACAACTTTTACAAAATCGTTCATCTCACACTGTTGCTTCATCTTCTTCAATCTTGTGAACGATTTTGAAGGGGATGTCAGCTGTCTTCCACTTATTTCCCTTGTCAGTCAAGTAATCTTCTTCATCTTCCTTCTTGGTCAAAATTTTCTGTTCGTTCTCTTTCGCTTTGCCAATGAAGGATCCAAAAGTGTTTGATTTGATATACGGCATCTCCCTTGTCGTTTGGTAGCCTACTGGAATTATGTTTTCACTTTTCTTCAAATAATTTTGTTTGTGGGTTTAGGGTTTAGGGTTTAGTCTCTCTCTTTTGATAGTTCTCTTGATAGATGTTTTTATTTGATTAAAAGTAGAGATGAAACACAATCTTATGTGGAAACCCTCGTAAAAGGAGAAAAACCACGGTGCTGATAATTTTATTTTCTAATATTCATAAAGGTATAAAGGGGAAATATTTTGTTAGGATTCTTAACCTGAAAAGCAGAAAAATCCCCAACAAGACAAACAAAGACAGCACTCTGATATTCTATATTGATAATAAAGATCAAAATTTACACAACCTGATATCAAAAGTCTAGAAAACAATAATAACAATAATGTCCAGCACCTTTGAGAAGGCTGGAAACTTCTCCTATGAAGGCTATGAAAGCTTTCACCAAAAAGACTATAAAATCTAAAAACAGCAATCCCGAAACATTCTCTATAAATACCCATTACCCACAGTGGGCCTCTATGATGTTTTCTTTTCTCTCTTTGCTCTCTGAGTTACCATCCGAGCTTTCAATTCCCTTTTCTGCCCCTTCTTTTAGAAGTATGCAGTATTGGGGGTCTAACATATTTATAAACAACACGAGTCTAATTAATAGAATAAAAAATAGGGTAAAGTAAACTACGTAACTACCCTTGGGCTTTCAACATGACCTAAGCCCATTATTTCTAACACTCCCCTCAATTTGGGACGCAAACGTCATAAAGACCCAACTTGCTAACACACAAATCAAAGCTTTGTTTGAGGAGCCCTTTTGTGAGGATATCAACAATCTGTTGGCTTAAGGGTATGTAAGGAATGCAGATGCTACCATCATCAAGCTTCTCTTTGATGAAGTGTTTGTCAATCTCCACTTGTTTAGTCCTATCATGTTGGACCGGGTTATTGGCAATGCTTAGGTGGCCTTTTTATTGTGGAATAGCTTTATAGGCATCTCATAGTCTTGACAAAGATCAGACAATTCCTTCTGGAGCCAGATCCCCAAATTCATAGTCTGTATTCAGCTTTAGCGCTGCTTCTAGCCACAACTCCTTGCTTCTGACTTCTCCAAGTAACAAGATTGCCCTACACAAAAGTACAGTATTCTGAGGTAGACTTTCTATCAACAATAGATCTTGTCTAGTTGGAATTAGTATAAGCTTTAACACATCTTCTGTCAGTCTTCTTGAATCTCAGACCTTTACCAAGAGTTGCTTTTTAATGCCTCAAAATCCTGTTAACCGCTTCCATGTGGGCCTTATAAGGTGCTTGCATGAATTGATTGACGGTGCTTACAATGTCAGGCCTAGTGCGAGATAAGTAAATCAGCTCTGCCTCCAAATGTTGATATTTCACTTTATCAATAGGAACCCTGTCACCTGAATTTCCGAGTTTGGCATTGAACTCAATGGGTGTATCAGCAGGACGACATCCTAGCATACCCGTCTCAGCTAATAAATCAATGTTGTACTTTCTTTGGAACACTAAGATGCCTTCTCTGGATATAGCAATCTCCATCCTGAGAAAGTACTTCAGAATTCCCAAGTCTTTGATCTCAAAATCAAACCCCATCTTCTTCTTTAGTTGAATGATCTCATTAGTATCGTCCCTAGATAGCACAATGTCATCGATATAGACAATCAATACTGCAATTTTCCCTGTCTTGGAAACTTTGGTGAACAAGGTGTGATCGGAATGCCCCTGATTGTACCCCTGAGACCTGATGAGAGTGGAACATCTTTCAAACCGTGATCTTGGTGATTGTTTCAACCCGTATAAAGACTTCCAAAGCTTGCAAACCTAATTATTTAATTGAGCTTCAAATTTTGGGGGCAGTCACGTACACTTCCTCTTTTAAATCTCCATTCAAGAATGCATTCTTAACATCAAGTTGATATAGTGGCCAATCCTTATTATATGTAATAGACAACAAAACTCTAACAGTGTCTAAATTTGCAGCAGGAGAAATAATCACAGAATAGTCAATCCCATAAGTCTGAGTGAATCCTTTTGCAACTAGCATGGTTTTATGTTTGTCAAGGTACCATTTTCTTTGTACTTGAATGTAAACACCCATTGCATCTGATAGTCTTGTGTCCCATTGCATCTGGTAAACACCCATTCCAAAAGTGTTCGCTTTTGTTCTTTTCTCGGGCTCTCATTTCTTCCATGATAGCAGCTTTCCACTCCAGACACTCTAAGGCAAGGTGGATACTTTTGGATATTGTGGTAGAGTCAAAACTTGTAATAAAAGCTCTGAACTGTGGTGAGAGATTCTCATATGACACATAATTAGAGATGAAGTGTGCTTTCTACATGACCTAGTACCTTTCGTCAATGCAATAGGAAGATCAATAGATGGATTATACCTACTGATATTTTTTGAATGGCCTGAGTTAGTTTTGTTTTCAATATGTTCAACAACAACCTCATTCTCATCAATCATGTCCTCTCTGTATAATGACCCCATCAATACAACCCTGCTCATCCATATCTTCAGGAACAACTATCTCAGACTCGTCATTCTAACCTGCTTTACTGTCAGTATCTGAGTCAGTTGAATTTATCTCGAGTCTGTCATTCTCCCTCATTCTGTTATAACTATGTGAGTTAGTGGAATTAGTCATACCTTGTTCCCTTATTGGTTTAGAACCTTAGACTGGAGCCATCTGAATAGCAGGAGACTCAATTTCCTTTGTGAGATTCTTCCTATAGTAAGTTTTCCAGGGAACTTGATTTGTGGGTAGGACTGTATTGTGAGGACTAGGGGTCAAGTAAGGTAGCCATAGGTAAGGTAGCCACAGTAGGAAAAGTACAAGTAGACTCTAACGGAAACATATAGTTAGACTCTTCACTAACAGTCCCCCCTGAAGTAGGCTGATGGAAAGAAAAGACAATCCTTAATAAAGGTGATATCCATGTTTACAAAGCACTTACGTGAAGGTGCAGAGGATACCCGACAAACATGCAAGCCTGAGACTGAGGAGTGAATGTAGTTTGGTTAAGGCCATGGCTATGAGTATAGGCTGTGCATCCGAACACCCGAGGGGGAACATCAGGAACGAGATGGGTGGAGGGGTGAGATTCTTTGAGACAATCTAACGGTTTCTAGAGGTAAAGGACACGGGAAGACATGCGGTTGATGAGAGAGCACGTCATCACCCCAAAGATAGGAAGGAAGAGAAGTGGACAACATAAGAGAACGTTCAACTTCCAAAAGGTGATGGTTCTTTTGCTCGAGAACCTGAGTGTAAGCAAAGGAACTTTGGTTCTTAAGCCACCAATTCAACTAAAAAACTTAAGCTAGTGGTTGAAGGAAAATTTAATTATATATCACTAACACTTCCTCTCATATGTAAACTTGAAATATTTGAAAGGCCCAACAAGTGAAAATCAATTTTAATTGGAGAGGAAACAACAATGCAGGGGCTTGAACACAGGGGTTCCCTAGACCACTCGATTTGATACCATCTTGAATCACCAATTAACCAAAAAACTTAAGCTAGTGGTTGAAGGCCAATTTAATTATATATCACCTAAGAAGCACAGACACTTCATATTAGAGGTGGTATTAGTGTCAAACACTTGGGGGACACGGATTTGTCCAGACACGTTGGGGACACGTGTCCGATACGCCAAATTCCATGTTCTATTTTTTCTATTTTTTGTTTTTCGGACACGTCTGGACACTCCCAATTGCAATTTTTAGGTGAAGCCCAAACTGGCAAGCGCACTTACTTTAACGTTAATATAATAATAATATTTTATACTTTATTAATGTGTATATTAGCCCTAGCCCAAGCCCAGTTACTTTAATAAAGACACAAAATATAAAAATATTATACACATTAACATAACCCTCAAATCTCCTTGTTGTGCACTTTCCCTCCCTTCTTTCATCCCCATCCATGACATTTTATGCTTTTTTTAGTGAAGAACATTTAGTTTTTTTTTTTTTTTTTTTAGGAAATGAGGAACATTTAGTATTTTGTTACAAATTTACATATACTATAAAAAAATTAATTTTAAAAAATAGTGTATCCCCAACGTGTCTGTATCCTATTATTTTAGAAATTGGCGTATCGCTATGTCCTGTCCTGTCGTGTTCGTGTCCGTGCTTCTTAGTATATCACTAAAGGTGTGTTTGGGGGAAGGGTTGAGTTATGGAGGGTTAGAGTTATGATAAACCAGTGTTATGATAAATCTAAGATTATGATAAGATGTGTTTGGAGGAAGGGTTATGAAAAAAGTGTTACGATAAGATGTGTTTGGGAGAAAGATTATGTGGGTAGGGTTATAGTAGTGTTATAATATGATATGTTTGGGGGAAGGGTTATATGGGTAGTGTTATGATATTTTTTATTTTTTATTTTTTTAAATTTATATTATAGATCAAATACACAAATTTCTATACTTAATTTACTCAATCATCTTAGTTTTCGTAAAATAATTTGCCTAAATTTCATTAAATACGACGCTCATTTTCAACATTTTTTTAGTAATCCTTACTTGTGGATAAATTTTGTTATAAAAAAATTCATATTATAAATGTAATACATAAATTTTGTATATTTAATTTATCAATTTGTCATAGTTTTTGTAAAATAATTTGCCTTAATATCTTTAAATACAATGTTCATTTTCAACCTTTGTGCAATTTTTTTATAAAAAAAACCATATTCTAGATCAATTACAAGAATTTCGTATATTTAATTTACCAAATCGTCCTAGTTTTCGTAAAAAAAATTCCTTAATTTCTTTAAATACGACATTCATTTTCAATCTCTTCTAAGTAGTCGTTAGGTGTGAATAATTTTTTTTTTAATTCATATTCTAAATTCAATGACACAAATTTCATTTATTTAATTTACCAAATTGTCCTAGATTTTGTAAAATAAGTTGCCTTAATTTATTTAACTACAACGTTCATTTTCAACATTTTGTAAGTAACGTTACGTTTCAATAAATTGGTTTTTATGAATGTGACACGTGATCCATAAACATGTAAATAAGTGCAAATAACCATCGAAATTATGAAGTTCTGAAAACAGAAAATAACAATCAAGGCTTTTACCAACTCTAGTTTTGTTTTCTCGTGGGAAATTACAATGAATCTAGTTTTAGGTTTCTCTTCAATTTATTTTAGACTACTTTTAAAATATCAACTTTAATTTTTATAATTTTTAACAGTGGTTTATATACTTTCAAAATATTTATTTTAGTCATCATGCAATAATGATTAGGTAAAAAAACATGAATCTACCATCTCTATAACAAAAGATTTGTTATGTTAATTTAACAAAACTTAATTTAATAACCAAAATAACATTATGAAAGTGACGAAATATAGACAAACCACCATTTGTTGAAGATTTGTTGAAATTACAGTCCTCACTAATTTTTAATTTTATCAAATAGTCCTTATAGTTTGCTCATGGTTGTAATCAATTTATGGTAATTTGTTAAAATTGTACCATTGTTTTTTTTATTGAATTGATATTCCTTAGATATTTAAGCTAATTAACAAAATACAATAAAGTTGAGATTGATATCAAAATCTTAATTTCACAAATATATTGATGTGTTAGAACATCTAACATAGACTTGATTGAAGGTAATTAAATTAAAATTTTAATTGTGTAATTTACAATGAATTATCTAATTCAAGAATAAAGGAAGAAATTAAAATTTTAATAATGTATATAAAACTATAAATGAGAAAAAAATAGTATTTAAAATAAATGAATTAATTATAAATGAATAAGTGTTGAGATAGGGTTGGAGAAGGGTTGAGGAAGGATTGAGAAAGGGTTAAGAAAGGGTTGTGGAGAGGGGTTAAGGGTAAAACTAGGGTTATGATAACCCTTCCTCCAAACATGGGTTGGGTTACATAACCCAACCCCTCCCCCAAACATACCCTAACAATTTGAATATTTTTGTTTTCTTGAAAATGTTTCGACCTCCAAATTTGTGTTGGAATGATTCATGGGTGTTCTTCGATGTTGGTGATAATAATTTCTTTCACGCTCTCAATCATTTTATGGATCATCTAACTCCCAATCTTAAATCGATTTGTTGAGAAACCTGTTTGATTGGATTGCTCTTTGTTGACAATGTTGTCTTGTTATTATGTTCTAAAATGCATTAGAATTTCAAATGTGGATAATGATAGGTTCTTTGAGAAAATCAAGCACGAGTGAAATTATTATGTTGAAATTATTATGTTGAATTCTTCCTCTGAATCACTTGAACCAGAATTCCAATACTGTTTTTGGGAGTAAATTTGACTTTCTAGCCTATACCAACTACTGTACTTTTTCTTCGTCGTCTTCCTTGAGCAGTAATAGTCCATTCTTCCAATCTTTTTGGTGAATTTTTTTTTTGTTTTTTTGTAATTTCAAGGGTTCCCCCTTCTTAGATGTGTGTTGTGGCAATGAGCACAACTCACAAAGCTCCCTCTTGTGATCGGTGTCATGAAGAGTTTCTTGGTCTTCCACCGATGACAGTGAACATTGGCCTAGACGGCTGCTCTGATACCAAAATTGATGTATACCAAATTAGAGAATTTTATTAATGTTTTAAAAGTTGTGAATACAAGAAGGCAAATATTCTGTTTATAATTGAAAGAAAACTAATTCTAATCTTAATTAATAAAAGAAACTAATCCTAATCCTAATAGATTAAAGATTTGACCATAATACCCTATTCCTACTGCATCATTATCATTGTTGTAGGTACAAATTCTAGATAATATGATGTTTCTATGGCAAACAATGACGTTAAATCATGCTGTATTTTGCTAGTGGAAAAAGCCGAGGGGTATTTATAAGATTACTGTAAAGATTTAGTGGATGGTTCTACACAATATTTTTTCATATTGTTTGGGAGACTGAGTAGCAGATCATCAAATTTTGGATCAAGTTTTTATAATTAATAAGGCTATAGAAGATTATAGGAGGCGTAAACAGGAAGATTTCATTTTGAGTTTTATTTTCATGGTCTATGACTGTTTGGAACTTCTTGGATAAGATGTTGTCAAAGAAAGGTTTTGGGATTAAATGGAGGGTTGAATGTGGTGTTGTATGGGAAATGGTATATTCTTTATCTTGTCAATAAAAAACCTTGGTGGTAGGATCTTAGCATCAAGAGGCCTAGGACACGATAATTACTAATGTCTTTTCTCTTCCTTTTAGTTGTCAATATTCTGAGTACAATGGTGCTCTTAGTAATAATCAAGGACATTTTTTTACTAACTCTCTTCTTGCCGCCTAAAGGGAATGCCTTGGTATAAAGCAAATTAAAGAAAGTGTTTGGGAAAGGAAAGAACAAAATAAATATTGGGGTTCAAATCCTATTTTAGTCCCTAAACTTTGTAACTTGTTCTATTTTGGTCTTTAAACTTTGAAAAATATTTGTTTGTCCTTGAACTTTTAAGAAGAGCTTACTTTGGTTCTTGTTGTTCGAATTCTATTAACTCTTTAACAAAATGATGAGGTGGCTTCTAATCTTTGGATGACTAGACTCTAGATTCATCATATTAATTAAATTGATAAATAACCAAGGATTCTTGCTAATTTATTCTATAAATATTTTATTTCAACACAAAAACATAAACAACACACTTGATTCAAATAAACCAAAAAACTAACTCTCTTTTTTTCTCCAAAACTCAACTCTCTCACATCTCCTCCCCGTCTCCTGCATCCTTTAACTTTGAGTAGAATTAAAAAAATAAAATAGAAGGATCAGATAGGTTGTCATGTAAGATTAGCCGAGGTGCGCATAAGCTAGCTTGAATACCCACAAGATATAAAAATAGAAAAATAGAACTCGCATATTGGTGTTAACATCCATGGTAAGGTGAAATGAAACTGAAAACAACCCATATAGAAAATAAACTCTACGAAATTGCAAATAAACATTCCTACAATTAATCAATTTAATTACTATGACGAGTTTAGTGCCTGGTTATTCAAATGTAGAAGCCACTTCATCATCATTCATTCCGTTAAAGAGTTAACGAAATCCTAATGGCAAGGACCAAACTAAGCTCTTTTTAAAAGCTCAAGTGCTTAAGCATACGTTTTTAAAAAAGTTTAGGGACCAAAATAGAACAAGGTACAAAGTTTCGAGACCAAAATAGGATTTAAACTTGTATTGGCGTAATCTTATCGCCAACTTTATTAAGAAATGTTTTTGACAATGTTGATCGTCTCATGCAAAGTGATAGGTGACCATGATTATAAAAGGGGGATGATATATCTCAAGAAGACTGAGTTACTAAATAGATTATTGATTGACTATTATAGGATCATAATTTTTATATTGACCTTATCCACACAAGGCTAGGATGGGGTGTGCTCTTAATTGCTAAAGTTCCGTCAATTCATCAAGAATTTAAAGAGGCAAAAAAGGTGGGAAATGCTATTTGCTTTACATTCTTGCAACAAAAAGGAAAGTATCTAATCCTATTCCGAATGTTCTATACATCTTTTCTCTTATCTGCAAAATTTCTTTTGTTTTCCAACCAAATAGCTGAAGATGGTTATAACTCGGAAATGACACATTTTCAAAACACTTCTTTTTTTCTTTTTATTAATTCCACTAGCAAAAATATCCTGACCCAATTAAGTATACACGTCTTTGGAGAGAATTTCATTTTGTGCATCTTTTAATAAGTATTCTTTAATCACCTACGGTGTCTAATGTTGTTGACTGGCCGTACTTTATTTGACATTTTACTGTTCTAATATTTTAGCTAAATTTAATTTTTGAATTCCCTTATTAATGAAGAAGAATGCTATCATGGAAACTACCATCACGACCATGGAATGTTTAGGGTTGTCATCTTTTCATTTAACCCTCTGCTATGGTTTCTAATTTTTTTTTCATTTTGAGTACTTGGCTGTGAACGTGAGCATTATATTGTACATTTTGTTTACAGACACAAATGCTCAGCATGCTACAAGCAATATAAGAAGAAAGAGCATCTTATCGAGCATATGAGAGTTTCATATCACTCTGTTCATCAGCCTAGATGTGGAGTCTGCCTAAAGCACTGCAAATCATTTGAATCATTGAGAGAACATCTAATGGGTGAGATAGTTTTGTGTATCTTTATGGTAATATATCATAACACCATCTCATTATTTCCAGTCTTTTAGGTCCACTTTCAAAATCAAATTGTTCAAAAATTTTCTCAGAGCAAGGTTGTGGGCTTTGTTTGAGAGTACTTGATGGTCCGGAATCTCTCAGTGATCACCAAGACATTTGCTGCATAACAGCACCTGTTCACCAAGTAAGTCGCTCTCTATGCATCATATATCATATATGATAAATAATAAGGTTACTGGTAGTTCTCAAAAAAAGAAAAAGGTTTCGGGTAGTTTATATTTTTATATATCAAGTAATTGAGCCATTTATTATGATATCATACATGCATTTCAGAATTTGATTGGGGCTCTGTTTTAGGGAACAAGTCTACCACCAACTGATTTATCCGATTGTTACGAAGAAGATCGTTCTGATCGAGGCCTTGGAGCAATAGCTATTGATTGTGTAATGGCTGGTGGAGGAAGTGATGGTGCACTGGACATTTGTGTTTGGATATGCCTTGTTGATGAAGATGAGAAATTGATTTTCAATACTTTCGTACAACCACAAATACCGATCACGAACTATAGGTACTTGTCTGTGGCATTATCTTGTTCTCTAAAGAAGGTTTTAGCTTTTAAAATCTAGCCTATAGATGCTCATTTCCCAGTCATGTTGTAGGCATGAAGTAACTGGGCTAAAGGAAGAACATATGAGGTATGCCATGCCACTGAAAAATGTCCAGGAAAAAGTATTGAAACTCTTGTTAAATGGAGAATCCATTGGGAGATTGAGATTGAATGGTGGTAAAGCTAAGCTTCTTGTCGGCCATGACTTGGAGCATGATTTAGATTGCCTGAGATTGAATTATCCCGATCATATGTTGAGGTGATTTTTACTAACTTGGTTTGTAATTTTGTGAGGTTTAGTTCACTTTTATTGAAAGTAAAATCCATTTGAACTTTGAAGTGATCTACTGAATATCGATGCACCAAAAAAAGAAAGAAAAACACTATCAACTACTACTTTTCAACCTTGCACATTATTTTCTCTGGAAATAATTCCCACCATTGAGAGAAGCCAACGTATTTTTGCTCTTTTTTCTTTAATCATTTTTCCTTTTTGTTTCGTTTTTTTATTGTTAAATGCACATTATTCATACACCTTTTGCCTGATATTTTTTTGCAGGGACACGGCTAGATATCATCCATTGATGAAAACAAATTTGGTTAGTCATTCTCTCAAGTACCTTACTCGAGCATATTTGGGGTAAGATACATTAACTATGATTCATACTTCTTTAAAGTATGATAAGCAATTTTTTATATTCTTTTTCGAAACGGAGACAAGAACTTCTTTATTAATATGAACTCAAAGTACAAGAGAGTTATACGAAGAGAGCCATAAAGAAGTAGTAATCAATGGAGTCCTAGAGGGATCAGAAGGTGCGGACATCTCAACTTGGTTGACACCCCCTTAGCGCCAAACATCATATCCTGAGGACAAGCAAGCAAAACAATAAAGAAACAATAATAAAAGCATCAGCTTAGTACAAAGGTCCAAAAAACAGTATGGGACCGGAAGAAAACAACAAGAAACTCGGGAGTGCAGAAAACAAAATAAAACAGGGGGCAATCCTTTGAGGCTTCAAAAGCAGTAGAGGGCATCGGTCTGTATAATATGCATTGAAACTGAGGCAGCTTAGGATTAGGCAGGTTGTGAAAGAAAAGCAGCCCAATTAAGGTAAATGTCTTCGATGGAATAATTAACGAACTCCTTTTTAAGGGAACACCAAGCTGCAGCTTTAAGTTCTGCCCCTGTTTTATTTTGTTTTTTTTATATTCTTTCTTTCAATAGAAGCAATTATATTAATAAAATTATTAAAAGTAGAGACACAGAACACACAAATTTACTTGAAAATCTGAGTACTGGGAGAAAAACTATAATATTTTCCTTCTTACTATTTTTTGATAATAATAGTGGTACAAAGGGGGAAATATTTATAGGCAACATGACCTAAGCCCATTATTTCTAATATTCTCCCGTTGGGACAAAAATGTCATAAAGACCCAACTTGTTAACACACAAATCAAATTTTTTTCTAAGGAGTCCCTTTGTGAGGATATCAGCAGTCTGTTGACTTGAGGGTATGTAAGGAATGCAGGTGCTACCATCATCAAGCTTCTCTTTGATAAAGAGTCTGTCAATCTCCACATGTTTAGTCCTGTCATGTTGGACCGAGTTATTTGCAATGTTAATGGTTGCCTTGTTATCGAAGAATAGCTTCATAGGGACCTCATAGTATTGACAAAAATCTGACAACACCTTTTGGAGCCAGATCTCACAAATCCCCAAACTCTTAGCCCTATATTCGACTTCAACTTTACTTCTAGCCACAACTCCTTGCTTCTTACTTCTCCAAGTAACAAAATTGTCCGACACAAAAGTACAGTATCATGAGGTAGACTTTCTATCAACAACAGATCATGTCCAGTCATAATCAATATAAGCTTTAATACATCTTTTGTCGGTCTTCCTAAATCTCAGACTTTTACTAGGAGTTGCTTTTAAATACCTCATAATCCTATTAATCGCTTCCATATGATCCTCATAAGGTGCTTGCATGATTTGACTGACGATGCTCCCAGCATAGGAGATGTTAGGCCTAGTGTGAGATAAGTAAATCAACTTTCTGATAGACCCCCTATCATACTGCAATATAGTAGAAGGAAAAAGAAAGAGACGAAGATTTCGCTCATGCTCTTCTTTGAGAGAGAGGATAGCAGAGAGAGAGTTAGATTGTCGACCAAATTGAGATCTCCTAAATTCGATTGTTCTTTTCGTGTGATTTTTTCCTAGTAATTCATCTTTCGAATTGAAGCAATAAATTTTCTATAACTAAATTCCAAAAGGAGTTCCATCAAATTGGTATCAAAGCCAACTTTTCTGGGCAAGAGGAATACAATGGTTCAAACATGCATGGAGGAGAAAATGGAAGCGCATGATCAGGAGATTGATAGACCCCAATACTATATCAATATAGTAGGAGAGGAAAGAAGGGTAGTGGCACAACATGTGTGTAGAGTAACACGTGAGTTAGGTTAGTATAAATAAAGAATAAGTGTGGGCGGGAAAGGCAGTTAGAAATTGAGAAGGAAAAGGGTTTCTGTTCTTCCTTGAGAGATAGGATAACAGAAGAGAGTTTCATTTGTAGTATTTGATCATATTAGTGTGTGTAATCTAGATCAGATTGAATCTAATAACAATAATAATAGAAGTTCTATTCTATTAGTTACTGGATTCCATCAGAGATGCAATCGATGAGAAAAGAAGTGTGCAAAATTCTAGCGATGGAAGAAAAATTATCATCGATCAGCGTACAGACGAAAAGACTCACCAAATGCTGATGATGTTTATGGAATCGATCACCAAGGAACGTACCGCTATGAGTGAGAAGATGGTCGTGTCTAGTGTGCAAGAGACAGTATCGACGATAGTGAATAGGAGGGATGGCTCGACAAGAAAGCGCCATGAAAATGAAACAAAGGACGGGAAGGTTGAAGGAGAAGATGGAATGAATGAGAGGAATAAGTTCAAGAAGGTCGAGATGTTGGTATTCAACGGTGAAGATCCTGATTCATGGCTTTTCCGTGCAAATAGGTATTTTTCAGATACACAAATTGACTAATGCTGAGAAAGTGTTGGTTGCGACCATTAGTTTTGAGGGCCCACCGTAGAACTGGTATAGGGCGCAGGAAGAATTCGAAAAGTTTACCAATTGGTCGAATCTCAAGGATATGGTGGAGGAATATCGTAACCAATTCGATAAATTGATGGCACCTTTATCCGATCTACAGGACAAAGTGGTGGAAGAAACATTTATGAACGACTTATTTCCTTGGATTAAAGCGGAAGTTGATTTTTGCCTTTCGGTCAGCCTAACTAAAATGATGCAAGCGGCTCAACTCGTGGAAAATCGGAAGATCATTCGGAATGAGCCTAACTTTACGGGGTATGCTAGAGGTAAGTATCCTTCCCAAAATTCTTTCAACAATAAGAATAATGCGACAGTGAACATTAGTGATAGCAAGGAAAACATGTTTCCGATGAGAACGATTACTCTGAGGACTACATCTGGAGAAGTTAAGAAAGAAGGGCAGTCGAAATGATTGTCGGATGCGGAATTTCAGGCGAGGAAGGAACATGGACTTTGTTTTCGTTGTAATGAGAAATATTCTCATGATCATAGGTGTAGGAGAAGAGAACAAAGGGAGTTGAGAATGTATGTGGTCAAGGCTAATGATGAAGAATTTGAGATCGTAGAAGTGGAAGATAATGATAAAGAGTTGAAATGTGTGGAAGTCATAGAGAAAGACGATACTGTTATTGAACAATCAATTAATTCGGTGGTGGGATTAACCAATCCGGGAACGATGAAGGTGAGGGGGAAGATCAGCAAGTGAGAAGTTATCTAATTGATTGTGGAGCGACGCATAACTTCATCTTCGAAAAGATAGTGAAGGAGCCACGAAGTCAACATCACATTATGGTGTAATTTTGGGTTTGGGTGGGGCGGTGAAAGGTAAAGGAATACGTGAAGAGGTGGGGATCAAATTGAACGGATGGAAAGTGGTAGCAAATTTTCTACTCTTAGAATTGGGAGGGGTGGATGTAGTGTTGGGAATGCTCTCTGGGTAGGACAGAAGTAGATTGGAGGAACCTAACAATGACATTTATGCATCTAGGAGAAAAGATAGTGATCAAAGGAGACCCAAGTTTAACCAAGTCCAGAGTGGGTCTCAGGAACATGATTAAAACGTGGAATGATTCCGATCAAGGATTCCGAATCGAATGCCAAGCAATGGAAAGAGTTTATGAACCAACAGAAGCGGATGGGATTGAAGTATTAACGGTGCAAGAAGCAGTATCTGTGGTGTTGAAGAAATTTGGAGATGTTTTTACTTAACCGGAGGCACTACCCCCTCAAAGATGTATTGAACATCATATTCACTTGAAACAAGGCACTAATCCAGTGAATGTACGACCTTATAGATATGCTTATCAACAGAAAGCAGAGATGGAAAAACTCGTTGATGAGATGTTGACCTCGGGGATAATTCGTCCCAGCCATAGCCCATATTCTAGTCTAGTTCTGTTGGTAAGAAAAAAAGATGGAAGTTGGCATTTTTGTGTAGACTACCGGGCGTTGAACAGTGTAACCATACTAGACAAATTCCCTATTCTTGTGATTGAGGAGCTTTTTGATGAATTAAATGGAGCGAAATGGTTTTCAAAGATAGATTTGAAGGCGAGATACCATCACCTCAGAATGTGTGGAGAAGACATAGAGAAAACAACATTTCATACACATGATGGACACTATGAATTTATGGTTATGCTGTTTGGATTGACTAATGCTCTGTCCACTTTTCAATCCTTGATGAACTTCGTATTTAAACTGTTCTTGCAAAAGTTGGTATTGGTGTTCTTCGACGATATATTAATCTACATTGCAGATTTGGAAAATCACTTGAAGCACCTTGGATTGGCACTGGAGATTTTGAGAAAAATGAGCTATATGTGAATCAAAAGAAGTGCAACTTTGCAAGAGAACGTATAGATTACTTGGGCAATATCATTTCGGGTTGAGGAGTAGAAGTGGATCCTGAGAAAATTAGAGCAATCAAAGAATGGCCTACACCAACTAATGTAAGAGAGATACGGGGATTCTTAGGCCTAACCAGTTATTACAGAAAATTCGTCCAACACTATGGTTCCATTGCTGCACCATTGACTCAACTGATAAAGAAAGGGGGGTTCAAGTGGACTGAGGAGTCTAAGGAGGCTTTCCAACAACTACAAAATGCCATGACAACATTGCCTGTTCTGGCGTTGCCTAATTTTAGCGCCACATTTGAGATGGAAACAGATGCATCTGGATGTGGAATAACAGATGCATCTGGATGTGGAATAGGAGCAGTCCTCATCCAATCCAAGCACCCCATTGCATATTTCAACCTTACGTTGGCATTAAGAGATAGAGTCCAGCCTGTATATGAAAGTGAATTAATGGCAGTAGTCTTGGCTGTACAACGATGGAGACTGTACCTACTGGGGAGGAAATTTATTGTGAAAACATATCAGAAATCCTTGAAGTACTTGTTAGAACAGAGGGTGATACAACCCCAATATCAGAATTGGGTAGCTAAATTATTGGGTCACTCATTTGAGGTGATCTACAAACCTGGATTGGAGAAGAAAGCAGCAGACGCCTTGTCACGAATCTACAGTTCATCTTGGTAGCATAACAGCTCCAACGTTGGTAGACATTTTGGTGGTTAAGAAGGAAGTCGAAGAGGATGAAAAACTGAGCAAAATAATGGAAGAGCTACAAACGATGGAAGGAAGTAAAGAGGGTAAATTCTCTATACAACAAGATATGTTAAGATATAAGGATAGGTTGGTGCTGTCTAAAACCTCTACATTGATACTCACAATCCTACACACTTACCACGATTCGGTACTTGGAGGCCACTCTGGATTTTGGCGCACCTATAAGAGATTGACCGGAAATTGTATTGGGAGAGAATGAGGGCGAATGTTAAGAAGTATTGTGGAGAATGTGTAATATGTCATAAGAATAAAAATTAGGGTAAAGTAAATCTTAGCTACCCATGGGATCAACATGATCTAAGCTCATTATTTCTAGCAAAAATAATCGAAAATAAAAATTTATATTAATGTCTACAAAAGTGGTGTGAATATAACTAAGATGTCACCTGGCCAAAACCTAAACTTTAGGACTATGCAAAAAACGGTCACTGTCAATATTTTTAATAACAAAAAAATAAAAAGGGAAAATTATGTAAAACTCTATACAAATCACTATAATTGGGGATTTTGTACGCCTTCCAATTGTTGGCTTGTTTGAACTATAAGGGTTTGCTATTATGGAGCTTTTTCTGTTGCATCTCAGTGTTATCTTCGTCTGGTTTACTTTGTTCGTCTCCGTCTCTTGTTTACTTTGTTTGTCTCCATCTCTTGAGAGTTGTTGTATCTTTGAGCATTAGTCCCTTTTCATTTTTCAGTAAAAAAAATTTCTTTTATTTTTTTTTTAGTGTCAATCAGCCCTAGACTGGTCTTTGAAGTTTTTGTTGTTCCTCTTAAGAGTAGGAACTAAGGTAGTATGAGATGCCTTTGTCCCACATTGATTTTGTTCAAATGGGATGATCATTATGGTACTTATGTGGCTTGGCTGCCCCACTCCAATACCTGGTTTTTGAGGTGTGGTTCTCCAAGATGCTTAAGTAGCTAACAATGATATCAAATCTTTTTCAAGCTTAATCCTATAGTCAAGTTAAAAGCTATTTCAAGCTTGAATTCATCCCAGAAATTTTTGTTACATTCAAAAGCCACCAACTAGAGAGGATCTAAGTGAGCAGCAAGAGTGTCTTGTGAGCAATGAAATTATGGATTTGCGTGTGTTATTTTTTATTAAATCTTTCATGTTGGGTCTTTAATGCTTATCGTTATGCTCCAATACCTCTCACGCTGTCCTTATTTTTCGTGTATTTGGTTGGCCCAATGAAATAACTGTAGATTTTAAAATGTTTTGGAATACATGAGTTTTAGTGTTGGTGGTCATTTGCATAGGTGAGACATACGGGTCTATTTGTTCCTAATTTCTAATGTTTATAGAGCTAGTCCTATATGTTTAAGGCTGCTTAGATATCCACAAGAATGAAAAAAATATCCAACACATCATCATCGAATGTTTACAGTCAAATATAGATTCTTTATTATCTTATAATCAGCTATGTGAGGGTTGAGGAAATGAATTTGAAATTAGAATTCTCAAGGGAAATGTAAATAACCAACGTTCTACATCTTCCGCATTGAAGAATCAAATTCATTTACCCTTGAAGAGTGGTTTTTTTCACATATTATTTAAAAATTTGACGAAGATGAAACATTGATCATGTGGATAATGGGAGAATGCTGTCTTCATAGGTGGGTTATATATTATATTCTCTTTATCATTAAGGATGATCCCTGAGAGCTGTGGTTTTGCAGGTATGATATTCGGCAAGATGGGCACGACCCTTACGAAAACTGTGTTTCAGTAATGCGATTGTATAAAAGAATGCGTAGTCTTGATCACCATAGACAAGTTATGACCCTGTCAATTACTCCTTCCTGTATCCAGTATGTTGCTCCCAATCTTGATTCCCATAGTGCAAAAGATCTTGAAAAGATGACTCCAGATGAACTTTATGAGATGTCGAGATCAAACTTTAAGTGTTGGTGTCATGACTCCAGACGAGTTATGCAGACTTAGGAGCCAATCATTTTTGTTTTTGCTGCTAAGAAAAGGGCATTGATCATTGTGGTGTCTGCAGTTTGTATAGAGAATTCTCATTTTGTTGAGAAGGGAGGGAGGTTCTTGGATGAAACGCTCCCATCGGACTTGTTTGTTAAAAAATGGGAATGACCTTATCTTTTGTATCTATTTAGCCTTTCAAGACTTAATACGGTAAAGGTGCAGTTCCCACATCGGAAGTGTCTTACCAGTTTATGGTAGTGTTTTAGGTCTCTTATGATGTTTGTTTTCTGATATAAACAATGAACTTTGAAAACGTAAGGACGCGCTTTGTTGAAGGTGTTTTTGAAGGGTGAATCAGTTTGGGCATCGATAGCCCATGTTTTTCTTTGAAGAGTTTGCTTATTTACCAAGATTAATGCTTTTCTTTGACATCTACACATGAAACGGGTTTCGTCTAGCCGTCGGAGATTTTTGAAGCTCTCTCTTCTCTTCTTTGACTTCTATACTTTCCCGCTTCCTCCCCATCTCTGTTTCATTTAGTTTGTTGGGGATTTTTATGGGTTTTATTTTAAAATTTTATGAAACTATTAACTTTCATGGGTTTATTTTAGTTAATTGTGTTTTTGGTGTATGCAACCAAACTAATATTTGTTGGATAAAAAGAAAAAGAATTTGTTTTTTTGTTATTTTTCATGGGTTTTATTTGGGGGAATTTACATAAATGGAAGAAAATACAAAGTATTTACACCATATAGC

mRNA sequence

ATGGACTCTGATCATGACCCTCTCAAGACACCAAGCCTAAGACACAAATGCTCAGCATGCTACAAGCAATATAAGAAGAAAGAGCATCTTATCGAGCATATGAGAGTTTCATATCACTCTGTTCATCAGCCTAGATGTGGAGTCTGCCTAAAGCACTGCAAATCATTTGAATCATTGAGAGAACATCTAATGGAGCAAGGTTGTGGGCTTTGTTTGAGAGTACTTGATGGTCCGGAATCTCTCAGTGATCACCAAGACATTTGCTGCATAACAGCACCTGTTCACCAAGGAACAAGTCTACCACCAACTGATTTATCCGATTGTTACGAAGAAGATCGTTCTGATCGAGGCCTTGGAGCAATAGCTATTGATTGTGTAATGGCTGGTGGAGGAAGTGATGGTGCACTGGACATTTGTGTTTGGATATGCCTTGTTGATGAAGATGAGAAATTGATTTTCAATACTTTCGTACAACCACAAATACCGATCACGAACTATAGGCATGAAGTAACTGGGCTAAAGGAAGAACATATGAGGTATGCCATGCCACTGAAAAATGTCCAGGAAAAAGTATTGAAACTCTTGTTAAATGGAGAATCCATTGGGAGATTGAGATTGAATGGTGGTAAAGCTAAGCTTCTTGTCGGCCATGACTTGGAGCATGATTTAGATTGCCTGAGATTGAATTATCCCGATCATATGTTGAGGGACACGGCTAGATATCATCCATTGATGAAAACAAATTTGGTTAGTCATTCTCTCAAGTACCTTACTCGAGCATATTTGGGGTATGATATTCGGCAAGATGGGCACGACCCTTACGAAAACTGTGTTTCAGTAATGCGATTGTATAAAAGAATGCGTAGTCTTGATCACCATAGACAAGTTATGACCCTGTCAATTACTCCTTCCTGTATCCAGTATGTTGCTCCCAATCTTGATTCCCATAGTGCAAAAGATCTTGAAAAGATGACTCCAGATGAACTTTATGAGATGTCGAGATCAAACTTTAAGTGTTGGTGTCATGACTCCAGACGAGTTATGCAGACTTAG

Coding sequence (CDS)

ATGGACTCTGATCATGACCCTCTCAAGACACCAAGCCTAAGACACAAATGCTCAGCATGCTACAAGCAATATAAGAAGAAAGAGCATCTTATCGAGCATATGAGAGTTTCATATCACTCTGTTCATCAGCCTAGATGTGGAGTCTGCCTAAAGCACTGCAAATCATTTGAATCATTGAGAGAACATCTAATGGAGCAAGGTTGTGGGCTTTGTTTGAGAGTACTTGATGGTCCGGAATCTCTCAGTGATCACCAAGACATTTGCTGCATAACAGCACCTGTTCACCAAGGAACAAGTCTACCACCAACTGATTTATCCGATTGTTACGAAGAAGATCGTTCTGATCGAGGCCTTGGAGCAATAGCTATTGATTGTGTAATGGCTGGTGGAGGAAGTGATGGTGCACTGGACATTTGTGTTTGGATATGCCTTGTTGATGAAGATGAGAAATTGATTTTCAATACTTTCGTACAACCACAAATACCGATCACGAACTATAGGCATGAAGTAACTGGGCTAAAGGAAGAACATATGAGGTATGCCATGCCACTGAAAAATGTCCAGGAAAAAGTATTGAAACTCTTGTTAAATGGAGAATCCATTGGGAGATTGAGATTGAATGGTGGTAAAGCTAAGCTTCTTGTCGGCCATGACTTGGAGCATGATTTAGATTGCCTGAGATTGAATTATCCCGATCATATGTTGAGGGACACGGCTAGATATCATCCATTGATGAAAACAAATTTGGTTAGTCATTCTCTCAAGTACCTTACTCGAGCATATTTGGGGTATGATATTCGGCAAGATGGGCACGACCCTTACGAAAACTGTGTTTCAGTAATGCGATTGTATAAAAGAATGCGTAGTCTTGATCACCATAGACAAGTTATGACCCTGTCAATTACTCCTTCCTGTATCCAGTATGTTGCTCCCAATCTTGATTCCCATAGTGCAAAAGATCTTGAAAAGATGACTCCAGATGAACTTTATGAGATGTCGAGATCAAACTTTAAGTGTTGGTGTCATGACTCCAGACGAGTTATGCAGACTTAG

Protein sequence

MDSDHDPLKTPSLRHKCSACYKQYKKKEHLIEHMRVSYHSVHQPRCGVCLKHCKSFESLREHLMEQGCGLCLRVLDGPESLSDHQDICCITAPVHQGTSLPPTDLSDCYEEDRSDRGLGAIAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLDCLRLNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYKRMRSLDHHRQVMTLSITPSCIQYVAPNLDSHSAKDLEKMTPDELYEMSRSNFKCWCHDSRRVMQT
BLAST of CsaV3_4G027590 vs. NCBI nr
Match: XP_004142658.1 (PREDICTED: apoptosis-enhancing nuclease isoform X2 [Cucumis sativus])

HSP 1 Score: 701.0 bits (1808), Expect = 2.0e-198
Identity = 350/350 (100.00%), Postives = 350/350 (100.00%), Query Frame = 0

Query: 1   MDSDHDPLKTPSLRHKCSACYKQYKKKEHLIEHMRVSYHSVHQPRCGVCLKHCKSFESLR 60
           MDSDHDPLKTPSLRHKCSACYKQYKKKEHLIEHMRVSYHSVHQPRCGVCLKHCKSFESLR
Sbjct: 1   MDSDHDPLKTPSLRHKCSACYKQYKKKEHLIEHMRVSYHSVHQPRCGVCLKHCKSFESLR 60

Query: 61  EHLMEQGXXXXXXXXXXXXXXXXHQDICCITAPVHQGTSLPPTDLSDCYEEDRSDRGLGA 120
           EHLMEQGXXXXXXXXXXXXXXXXHQDICCITAPVHQGTSLPPTDLSDCYEEDRSDRGLGA
Sbjct: 61  EHLMEQGXXXXXXXXXXXXXXXXHQDICCITAPVHQGTSLPPTDLSDCYEEDRSDRGLGA 120

Query: 121 IAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEHMRY 180
           IAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEHMRY
Sbjct: 121 IAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEHMRY 180

Query: 181 AMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLDCLRLNYPDHMLRDTAR 240
           AMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLDCLRLNYPDHMLRDTAR
Sbjct: 181 AMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLDCLRLNYPDHMLRDTAR 240

Query: 241 YHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYKRMRSLDHHRQVMTLS 300
           YHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYKRMRSLDHHRQVMTLS
Sbjct: 241 YHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYKRMRSLDHHRQVMTLS 300

Query: 301 ITPSCIQYVAPNLDSHSAKDLEKMTPDELYEMSRSNFKCWCHDSRRVMQT 351
           ITPSCIQYVAPNLDSHSAKDLEKMTPDELYEMSRSNFKCWCHDSRRVMQT
Sbjct: 301 ITPSCIQYVAPNLDSHSAKDLEKMTPDELYEMSRSNFKCWCHDSRRVMQT 350

BLAST of CsaV3_4G027590 vs. NCBI nr
Match: XP_011653750.1 (PREDICTED: apoptosis-enhancing nuclease isoform X1 [Cucumis sativus] >XP_011653752.1 PREDICTED: apoptosis-enhancing nuclease isoform X1 [Cucumis sativus])

HSP 1 Score: 667.2 bits (1720), Expect = 3.2e-188
Identity = 325/363 (89.53%), Postives = 325/363 (89.53%), Query Frame = 0

Query: 1   MDSDHDPLKTPSLRHKCSACYKQYKKKEHLIEHMRVSYHSVHQPRCGVCLKHCKSFESLR 60
           MDSDHDPLKTPSLRHKCSACYKQYKKKEHLIEHMRVSYHSVHQPRCGV         SLR
Sbjct: 1   MDSDHDPLKTPSLRHKCSACYKQYKKKEHLIEHMRVSYHSVHQPRCGVXXXXXXXXXSLR 60

Query: 61  EHLM-------------EQGXXXXXXXXXXXXXXXXHQDICCITAPVHQGTSLPPTDLSD 120
           EHLM             EQG                HQDICCITAPVHQGTSLPPTDLSD
Sbjct: 61  EHLMGPLSKSNCSKIFSEQGCGLCLRVLDGPESLSDHQDICCITAPVHQGTSLPPTDLSD 120

Query: 121 CYEEDRSDRGLGAIAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITNYR 180
           CYEEDRSDRGLGAIAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITNYR
Sbjct: 121 CYEEDRSDRGLGAIAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITNYR 180

Query: 181 HEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLDCLR 240
           HEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLDCLR
Sbjct: 181 HEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLDCLR 240

Query: 241 LNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYKRM 300
           LNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYKRM
Sbjct: 241 LNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYKRM 300

Query: 301 RSLDHHRQVMTLSITPSCIQYVAPNLDSHSAKDLEKMTPDELYEMSRSNFKCWCHDSRRV 351
           RSLDHHRQVMTLSITPSCIQYVAPNLDSHSAKDLEKMTPDELYEMSRSNFKCWCHDSRRV
Sbjct: 301 RSLDHHRQVMTLSITPSCIQYVAPNLDSHSAKDLEKMTPDELYEMSRSNFKCWCHDSRRV 360

BLAST of CsaV3_4G027590 vs. NCBI nr
Match: XP_022155663.1 (apoptosis-enhancing nuclease isoform X1 [Momordica charantia])

HSP 1 Score: 598.2 bits (1541), Expect = 1.8e-167
Identity = 287/363 (79.06%), Postives = 310/363 (85.40%), Query Frame = 0

Query: 1   MDSDHDPLKTPSLRHKCSACYKQYKKKEHLIEHMRVSYHSVHQPRCGVCLKHCKSFESLR 60
           MDSD DPL  P+ RHKCSACYKQYKKKEHLIEHM+VSYHS+HQPRCGVC KHCKSFESLR
Sbjct: 1   MDSDRDPLNPPTTRHKCSACYKQYKKKEHLIEHMKVSYHSIHQPRCGVCGKHCKSFESLR 60

Query: 61  EHL-------------MEQGXXXXXXXXXXXXXXXXHQDICCITAPVHQGTSLPPTDLSD 120
           EHL             +EQG                H+DICC+TAPVHQGTSL PTDLSD
Sbjct: 61  EHLQGPLSKSNCSKIFIEQGCGLCLRVLDGRRSLNEHRDICCLTAPVHQGTSLLPTDLSD 120

Query: 121 CYEEDRSDRGLGAIAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITNYR 180
           CY+EDRSDR LGAIA+DCVMAGGGSDG LD+CV +CLVDE+EKLIFNTFV+PQIPITNYR
Sbjct: 121 CYDEDRSDRVLGAIAMDCVMAGGGSDGTLDLCVGVCLVDEEEKLIFNTFVRPQIPITNYR 180

Query: 181 HEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLDCLR 240
           HEVTGL EEHMRYAMPLK VQEKVL++L+NGESIGRLRLNGG+A+LLVGHDLEHDLDCLR
Sbjct: 181 HEVTGLTEEHMRYAMPLKEVQEKVLRILINGESIGRLRLNGGRARLLVGHDLEHDLDCLR 240

Query: 241 LNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYKRM 300
           +NYPDHMLRDTARYHPLMKTNLVSHSLKYLTR YLGYDI+Q  HDPYENCVSVMRLYKRM
Sbjct: 241 MNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRTYLGYDIQQGVHDPYENCVSVMRLYKRM 300

Query: 301 RSLDHHRQVMTLSITPSCIQYVAPNLDSHSAKDLEKMTPDELYEMSRSNFKCWCHDSRRV 351
           R+LDHH QVM  ++ P   QYVA NLDSHS KDLEKMTPD+LYEMSRSNFKCWC DSRRV
Sbjct: 301 RNLDHHGQVMIPTVAPHA-QYVAHNLDSHSVKDLEKMTPDKLYEMSRSNFKCWCLDSRRV 360

BLAST of CsaV3_4G027590 vs. NCBI nr
Match: XP_008449328.1 (PREDICTED: RNA exonuclease 4-like [Cucumis melo])

HSP 1 Score: 554.7 bits (1428), Expect = 2.3e-154
Identity = 267/305 (87.54%), Postives = 275/305 (90.16%), Query Frame = 0

Query: 46  CGVCLKHCKSFESLREHLMEQGXXXXXXXXXXXXXXXXHQDICCITAPVHQGTSLPPTDL 105
           CG+CL+     E+L E                      HQDICCITAPVHQGTSLPPTDL
Sbjct: 18  CGLCLRVLDGPETLSE----------------------HQDICCITAPVHQGTSLPPTDL 77

Query: 106 SDCYEEDRSDRGLGAIAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITN 165
           SDCYEEDRSDRGLGAIA+DCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITN
Sbjct: 78  SDCYEEDRSDRGLGAIAMDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITN 137

Query: 166 YRHEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLDC 225
           YRHEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGESIG+LR NGGKAKLLVGHDLEHDLDC
Sbjct: 138 YRHEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGESIGKLRSNGGKAKLLVGHDLEHDLDC 197

Query: 226 LRLNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYK 285
           LR+NYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYK
Sbjct: 198 LRMNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYK 257

Query: 286 RMRSLDHHRQVMTLSITPSCIQYVAPNLDSHSAKDLEKMTPDELYEMSRSNFKCWCHDSR 345
           RMRSLDH RQVMTLS+TPSCIQYVAPNLDSHSAKDLEKMTPDELYEMSR+NFKCWCHDSR
Sbjct: 258 RMRSLDHRRQVMTLSVTPSCIQYVAPNLDSHSAKDLEKMTPDELYEMSRTNFKCWCHDSR 300

Query: 346 RVMQT 351
           RVMQT
Sbjct: 318 RVMQT 300

BLAST of CsaV3_4G027590 vs. NCBI nr
Match: XP_008465475.1 (PREDICTED: RNA exonuclease 4-like isoform X1 [Cucumis melo])

HSP 1 Score: 547.7 bits (1410), Expect = 2.8e-152
Identity = 265/305 (86.89%), Postives = 273/305 (89.51%), Query Frame = 0

Query: 46  CGVCLKHCKSFESLREHLMEQGXXXXXXXXXXXXXXXXHQDICCITAPVHQGTSLPPTDL 105
           CG+CL+     E+L E                      HQDICCITAPVHQGTSLPPTDL
Sbjct: 18  CGLCLRVLDGPETLSE----------------------HQDICCITAPVHQGTSLPPTDL 77

Query: 106 SDCYEEDRSDRGLGAIAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITN 165
           SD YEEDRSDRGLGAIA+DCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITN
Sbjct: 78  SDGYEEDRSDRGLGAIAMDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITN 137

Query: 166 YRHEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLDC 225
           YRHEVTGLKEEH+RYAMPLKNVQEKVLKLLLNGESIGRLR NGGKAKLLVGHDLEHDLDC
Sbjct: 138 YRHEVTGLKEEHLRYAMPLKNVQEKVLKLLLNGESIGRLRSNGGKAKLLVGHDLEHDLDC 197

Query: 226 LRLNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYK 285
           LR+NYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYK
Sbjct: 198 LRMNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYK 257

Query: 286 RMRSLDHHRQVMTLSITPSCIQYVAPNLDSHSAKDLEKMTPDELYEMSRSNFKCWCHDSR 345
           RMRSLDH RQVMT S+TPSCIQYVAPNLDSHSAKDLEKMTPDELYEMSR+NFKCWCHDSR
Sbjct: 258 RMRSLDHRRQVMTRSVTPSCIQYVAPNLDSHSAKDLEKMTPDELYEMSRTNFKCWCHDSR 300

Query: 346 RVMQT 351
           RVMQT
Sbjct: 318 RVMQT 300

BLAST of CsaV3_4G027590 vs. TAIR10
Match: AT2G48100.1 (Exonuclease family protein)

HSP 1 Score: 330.9 bits (847), Expect = 9.7e-91
Identity = 172/361 (47.65%), Postives = 226/361 (62.60%), Query Frame = 0

Query: 1   MDSDHDPLKTP--SLRHKCSACYKQYKKKEHLIEHMRVSYHSVHQPRCGVCLKHCKSFES 60
           MDS  +P K    S+RH+C ACYK + ++EHL+EHM++SYHS+HQPRCGVCLKHCKSFES
Sbjct: 1   MDSQLNPSKRRKISVRHRCVACYKMFNRREHLVEHMKISYHSLHQPRCGVCLKHCKSFES 60

Query: 61  LREHL---------------MEQGXXXXXXXXXXXXXXXXHQDICCITAPVHQGTSLPPT 120
           +REHL                ++G                H++ C ++ P   GTS    
Sbjct: 61  VREHLNVPDHLSKGNCKAIFTKRGCTLCLQIFEEAFALAEHKNKCHLSPPRPLGTSTQRN 120

Query: 121 DLSDCYEEDRSDRGLGAIAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPI 180
             S       +   L A+A+DC M GGG+DG +D C  +CLVD+DE +IF+T VQP +P+
Sbjct: 121 PSSSL-----AGSRLKAMALDCEMVGGGADGTIDQCASVCLVDDDENVIFSTHVQPLLPV 180

Query: 181 TNYRHEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDL 240
           T+YRHE+TGL +E ++  MPL++V+E+V   L  G++ G  RL      LLVGHDL HD+
Sbjct: 181 TDYRHEITGLTKEDLKDGMPLEHVRERVFSFLCGGQNDGAGRL------LLVGHDLRHDM 240

Query: 241 DCLRLNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRL 300
            CL+L YP H+LRDTA+Y PLMKTNLVS SLKYLT++YLGY I+   H+ YE+CVS MRL
Sbjct: 241 SCLKLEYPSHLLRDTAKYVPLMKTNLVSQSLKYLTKSYLGYKIQCGKHEVYEDCVSAMRL 300

Query: 301 YKRMRSLDHHRQVMTLSITPSCIQYVAPN-LDSHSAKDLEKMTPDELYEMSRSNFKCWCH 344
           YKRMR  +H            C      N L+S    DLEKM  +ELY+ S S ++CWC 
Sbjct: 301 YKRMRDQEH-----------VCSGKAEGNGLNSRKQSDLEKMNAEELYQKSTSEYRCWCL 339

BLAST of CsaV3_4G027590 vs. TAIR10
Match: AT3G27970.1 (Exonuclease family protein)

HSP 1 Score: 317.4 bits (812), Expect = 1.1e-86
Identity = 164/363 (45.18%), Postives = 229/363 (63.09%), Query Frame = 0

Query: 1   MDSDHDPLKTPSLRHKCSACYKQYKKKEHLIEHMRVSYHSVHQPRCGVCLKHCKSFESLR 60
           MD       + +LR+KC+ACY+Q+ K EHL+EHM++SYHS H+P CGVC KHC+SFESLR
Sbjct: 1   MDYRSSMESSETLRNKCAACYRQFNKLEHLVEHMKISYHSGHEPTCGVCKKHCRSFESLR 60

Query: 61  EHLME-------------QGXXXXXXXXXXXXXXXXHQDICCITAPVHQG--TSLPPTDL 120
           EHL+              +G                HQ+ C  ++ V+ G  T +    L
Sbjct: 61  EHLIGPLPKQECKNIFSLRGCRFCMTILESPNSRRIHQERCQFSS-VNSGLTTRMAALGL 120

Query: 121 SDCYEED-RSDRGLGAIAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPIT 180
            D    D  S R    +A+ C M GGGSDG+LD+C  +C+ DE + +IF+T+V+P + +T
Sbjct: 121 RDKAMIDYTSSRSPRVVALSCKMVGGGSDGSLDLCARVCITDESDNVIFHTYVKPSMAVT 180

Query: 181 NYRHEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLD 240
           +YR+E TG++ E++R AMPLK VQ K+ + L NGE + ++R  GGKA++LVGH L+HDLD
Sbjct: 181 SYRYETTGIRPENLRDAMPLKQVQRKIQEFLCNGEPMWKIRPRGGKARILVGHGLDHDLD 240

Query: 241 CLRLNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLY 300
            L+L YP  M+RDTA+Y PLMKT+ +S+SLKYLT+AYLGYD+     DPYE+CV+ MRLY
Sbjct: 241 RLQLEYPSSMIRDTAKYPPLMKTSKLSNSLKYLTQAYLGYDVHFGIQDPYEDCVATMRLY 300

Query: 301 KRMRSLDHHRQVMTLSITPSCIQYVAPNLDSHSA---KDLEKMTPDELYEMSRSNFKCWC 345
            RMR   H  +   L+         A N  +  A    + E+M+PDE+  +SRS++ CWC
Sbjct: 301 TRMRYQKHKIEAYPLAAD-------AQNRSNQVAWRQSEAERMSPDEMLSISRSDYYCWC 355

BLAST of CsaV3_4G027590 vs. TAIR10
Match: AT5G40310.1 (Exonuclease family protein)

HSP 1 Score: 309.7 bits (792), Expect = 2.3e-84
Identity = 155/347 (44.67%), Postives = 220/347 (63.40%), Query Frame = 0

Query: 14  RHKCSACYKQYKKKEHLIEHMRVSYHSVHQPRCGVCLKHCKSFESLREHLME-------- 73
           R+KC  CY+Q+ KKEHL+EHMR+SYHSVH+P CG+C KHC+SF+SLREHL+         
Sbjct: 5   RNKCGGCYRQFNKKEHLVEHMRISYHSVHEPTCGICNKHCRSFDSLREHLIGPLPKQECK 64

Query: 74  -----QGXXXXXXXXXXXXXXXXHQDICCITAPVHQGTSLPPTDL---SDCYEEDRSDRG 133
                +G                HQ+ C + + V  G  +    L   ++   +  S R 
Sbjct: 65  NIFSIRGCRFCLTILESPNARRIHQERCQL-SNVTSGLMIRMAALGLRNNSTIDYTSSRS 124

Query: 134 LGAIAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEH 193
              +A+ C M GGGSDG+LD+C  +C+ DE E ++F+T+V+P IP+TNYR+E+TG++ E+
Sbjct: 125 PRVVALSCKMVGGGSDGSLDLCARVCITDESENVVFHTYVKPTIPVTNYRYEMTGIRPEN 184

Query: 194 MRYAMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLDCLRLNYPDHMLRD 253
           +R AM LK+ Q KV + L NGE + ++R   GKA++LVGH L++ LD L+L Y   M+RD
Sbjct: 185 LRDAMRLKHAQRKVQEFLCNGEPMWKIRPRNGKARILVGHGLDNHLDSLQLEYSSSMIRD 244

Query: 254 TARYHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYKRMRSLDHHRQVM 313
           TA Y PLMK++ +S+SLKYLT+AYLGYDI     DPYE+CV+ MRLY RMR   H  +  
Sbjct: 245 TAEYPPLMKSSKLSNSLKYLTQAYLGYDIHVGIQDPYEDCVATMRLYTRMRYQKHRAEAY 304

Query: 314 TLSITPSCIQYVAPNLDSHSAKDLEKMTPDELYEMSRSNFKCWCHDS 345
            L+           N  +    +LE+M+P+EL ++SRS++ CWC DS
Sbjct: 305 PLASDTQNHN----NFAAWRQNELERMSPEELLDLSRSDYYCWCLDS 346

BLAST of CsaV3_4G027590 vs. TAIR10
Match: AT3G15080.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein)

HSP 1 Score: 84.3 bits (207), Expect = 1.6e-16
Identity = 52/168 (30.95%), Postives = 85/168 (50.60%), Query Frame = 0

Query: 121 IAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEHMRY 180
           +A+DC M  G S G       + LV++   ++++ FV+P   + ++R  ++G++   +R 
Sbjct: 84  VAMDCEMV-GVSQGTKSALGRVTLVNKWGNVLYDEFVRPVEHVVDFRTSISGIRPRDLRK 143

Query: 181 AMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLDCLRLNYPDHMLRDTAR 240
           A   +  Q KV +L+              K K+LVGH L +DL  L L +P   +RDT  
Sbjct: 144 AKDFRVAQTKVAELI--------------KGKILVGHALHNDLKALLLTHPKKDIRDTGE 203

Query: 241 YHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYKRMR 289
           Y P +K      SLK+L    LG DI+   H P ++  + M LY++ R
Sbjct: 204 YQPFLK-GKTRKSLKHLASEILGADIQNGEHCPIDDARAAMMLYQKNR 235

BLAST of CsaV3_4G027590 vs. TAIR10
Match: AT3G50100.1 (small RNA degrading nuclease 1)

HSP 1 Score: 71.6 bits (174), Expect = 1.1e-12
Identity = 52/169 (30.77%), Postives = 88/169 (52.07%), Query Frame = 0

Query: 121 IAIDC--VMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEHM 180
           +A+DC  V+   G++G + + V    VD D K+I + FV+P  P+ +YR ++TG+  E +
Sbjct: 141 VAVDCEMVLCEDGTEGLVRVGV----VDRDLKVILDEFVKPNKPVVDYRTDITGITAEDI 200

Query: 181 RYA-MPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLDCLRLNYPDHMLRD 240
             A + + ++QE +   L  G              +LVGH L  DL+ L++++P   + D
Sbjct: 201 ENASLSVVDIQETLQPFLSTG-------------TILVGHSLNRDLEVLKIDHP--KVID 260

Query: 241 TARYHPLMKT-NLVSHSLKYLTRAYLGYDIRQDG--HDPYENCVSVMRL 284
           TA       T  L   SL  L ++ LGY++R+ G  HD   +  + M+L
Sbjct: 261 TALVFKYPNTRKLRRPSLNNLCKSILGYEVRKTGVPHDCVHDASAAMKL 290

BLAST of CsaV3_4G027590 vs. Swiss-Prot
Match: sp|Q4IEV5|REXO4_GIBZE (RNA exonuclease 4 OS=Gibberella zeae (strain PH-1 / ATCC MYA-4620 / FGSC 9075 / NRRL 31084) OX=229533 GN=REX4 PE=3 SV=1)

HSP 1 Score: 96.3 bits (238), Expect = 7.3e-19
Identity = 58/176 (32.95%), Postives = 97/176 (55.11%), Query Frame = 0

Query: 121 IAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEHMRY 180
           IAIDC M G G  G       + +VD     I++++V+P+  +TN+R  V+G+ ++ MR+
Sbjct: 134 IAIDCEMVGVGPGGHESALARVSIVDFHGVQIYDSYVKPKEKVTNWRTAVSGISQKSMRF 193

Query: 181 AMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLDCLRLNYPDHMLRDTAR 240
           A   + VQ ++ KLL              + ++LVGHDL+HDL+ L L++P   +RDTA+
Sbjct: 194 ARDFEEVQAEIDKLL--------------RGRILVGHDLKHDLEALILSHPGKDIRDTAK 253

Query: 241 YHPLMK-TNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYKRMRS---LDH 293
           +    K  N    SL+ L +  LG +I+   H   E+  + M L+++ +S   +DH
Sbjct: 254 FSGFKKYANGRKPSLRVLAQQLLGVEIQGGEHSSIEDARATMLLFRKHKSAFDVDH 295

BLAST of CsaV3_4G027590 vs. Swiss-Prot
Match: sp|Q08237|REXO4_YEAST (RNA exonuclease 4 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=REX4 PE=1 SV=1)

HSP 1 Score: 94.4 bits (233), Expect = 2.8e-18
Identity = 56/171 (32.75%), Postives = 91/171 (53.22%), Query Frame = 0

Query: 121 IAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEHMRY 180
           IA+DC   G G +G       I +V+    ++ + FV+P+  +  +R  V+G+K EHM+ 
Sbjct: 122 IAMDCEFVGVGPEGKESALARISIVNYFGHVVLDEFVKPREKVVEWRTWVSGIKPEHMKN 181

Query: 181 AMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLDCLRLNYPDHMLRDTAR 240
           A+  K  Q+K   +L              + ++LVGH L+HDL+ L L++P  +LRDT+R
Sbjct: 182 AITFKEAQKKTADIL--------------EGRILVGHALKHDLEALMLSHPKSLLRDTSR 241

Query: 241 YHPLMK--TNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYKRMRS 290
           + P  K      + SLK LTR  L   I++  H   E+  + M LYK+ ++
Sbjct: 242 HLPFRKLYAKGKTPSLKKLTREVLKISIQEGEHSSVEDARATMLLYKKEKT 278

BLAST of CsaV3_4G027590 vs. Swiss-Prot
Match: sp|Q9GZR2|REXO4_HUMAN (RNA exonuclease 4 OS=Homo sapiens OX=9606 GN=REXO4 PE=1 SV=2)

HSP 1 Score: 92.8 bits (229), Expect = 8.0e-18
Identity = 53/167 (31.74%), Postives = 94/167 (56.29%), Query Frame = 0

Query: 120 AIAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEHMR 179
           A+A+DC M G G  G   +   + +V++  K +++ +V+P  P+T+YR  V+G++ E+++
Sbjct: 243 ALALDCEMVGVGPKGEESMAARVSIVNQYGKCVYDKYVKPTEPVTDYRTAVSGIRPENLK 302

Query: 180 YAMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLDCLRLNYPDHMLRDTA 239
               L+ VQ++V ++L              K ++LVGH L +DL  L L++P   +RDT 
Sbjct: 303 QGEELEVVQKEVAEML--------------KGRILVGHALHNDLKVLFLDHPKKKIRDTQ 362

Query: 240 RYHPLMKTNLVS--HSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLY 285
           +Y P  K+ + S   SL+ L+   LG  ++Q  H   ++  + MRLY
Sbjct: 363 KYKP-FKSQVKSGRPSLRLLSEKILGLQVQQAEHCSIQDAQAAMRLY 394

BLAST of CsaV3_4G027590 vs. Swiss-Prot
Match: sp|Q6CMT3|REXO4_KLULA (RNA exonuclease 4 OS=Kluyveromyces lactis (strain ATCC 8585 / CBS 2359 / DSM 70799 / NBRC 1267 / NRRL Y-1140 / WM37) OX=284590 GN=REX4 PE=3 SV=1)

HSP 1 Score: 92.0 bits (227), Expect = 1.4e-17
Identity = 53/167 (31.74%), Postives = 90/167 (53.89%), Query Frame = 0

Query: 121 IAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEHMRY 180
           +++DC   G G DG       + +V+    ++ + FV+P+ P+T++R  V+G+K  HM  
Sbjct: 120 VSMDCEFVGVGPDGKDSALARVSIVNYYGNVVLDLFVRPKEPVTDWRTWVSGIKPHHMAN 179

Query: 181 AMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLDCLRLNYPDHMLRDTAR 240
           A+  ++ Q++V  +L              K ++LVGH + HDL  L L++P  M+RDT+R
Sbjct: 180 AVTQEDCQKQVSNVL--------------KGRILVGHSVHHDLTALMLSHPRRMIRDTSR 239

Query: 241 YHPLMK--TNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYK 286
           + P  +  +   + SLK LT+  L  DI+   H   E+  + M LYK
Sbjct: 240 HMPFRQKYSEGKTPSLKKLTKEILQLDIQDGEHSSIEDARATMLLYK 272

BLAST of CsaV3_4G027590 vs. Swiss-Prot
Match: sp|Q7S9B7|REXO4_NEUCR (RNA exonuclease 4 OS=Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) OX=367110 GN=rex-4 PE=3 SV=1)

HSP 1 Score: 90.5 bits (223), Expect = 4.0e-17
Identity = 53/167 (31.74%), Postives = 86/167 (51.50%), Query Frame = 0

Query: 121 IAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEHMRY 180
           ++IDC M G G  GA  +     +VD     I++++V+P   +T++R  V+G+ + HM  
Sbjct: 217 LSIDCEMVGTGPSGATSVLARCSIVDFHGHQIYDSYVRPTAFVTDWRTHVSGISKRHMAS 276

Query: 181 AMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLDCLRLNYPDHMLRDTAR 240
           A   ++VQ  V  LL              K ++LVGHD++HDL+ L   +P   +RDTA+
Sbjct: 277 ARSFESVQATVAALL--------------KGRILVGHDVKHDLEVLGFEHPHRDIRDTAK 336

Query: 241 YHPLMK-TNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYKR 287
           Y    K  +    SL+ L +  LG +I Q  H   E+    M L+++
Sbjct: 337 YSGFRKYGHGPKPSLRVLAKEVLGIEIHQGQHSSVEDARVAMLLFRK 369

BLAST of CsaV3_4G027590 vs. TrEMBL
Match: tr|A0A1S3BLT2|A0A1S3BLT2_CUCME (RNA exonuclease 4-like OS=Cucumis melo OX=3656 GN=LOC103491238 PE=4 SV=1)

HSP 1 Score: 554.7 bits (1428), Expect = 1.5e-154
Identity = 267/305 (87.54%), Postives = 275/305 (90.16%), Query Frame = 0

Query: 46  CGVCLKHCKSFESLREHLMEQGXXXXXXXXXXXXXXXXHQDICCITAPVHQGTSLPPTDL 105
           CG+CL+     E+L E                      HQDICCITAPVHQGTSLPPTDL
Sbjct: 18  CGLCLRVLDGPETLSE----------------------HQDICCITAPVHQGTSLPPTDL 77

Query: 106 SDCYEEDRSDRGLGAIAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITN 165
           SDCYEEDRSDRGLGAIA+DCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITN
Sbjct: 78  SDCYEEDRSDRGLGAIAMDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITN 137

Query: 166 YRHEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLDC 225
           YRHEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGESIG+LR NGGKAKLLVGHDLEHDLDC
Sbjct: 138 YRHEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGESIGKLRSNGGKAKLLVGHDLEHDLDC 197

Query: 226 LRLNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYK 285
           LR+NYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYK
Sbjct: 198 LRMNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYK 257

Query: 286 RMRSLDHHRQVMTLSITPSCIQYVAPNLDSHSAKDLEKMTPDELYEMSRSNFKCWCHDSR 345
           RMRSLDH RQVMTLS+TPSCIQYVAPNLDSHSAKDLEKMTPDELYEMSR+NFKCWCHDSR
Sbjct: 258 RMRSLDHRRQVMTLSVTPSCIQYVAPNLDSHSAKDLEKMTPDELYEMSRTNFKCWCHDSR 300

Query: 346 RVMQT 351
           RVMQT
Sbjct: 318 RVMQT 300

BLAST of CsaV3_4G027590 vs. TrEMBL
Match: tr|A0A1S3CNZ6|A0A1S3CNZ6_CUCME (RNA exonuclease 4-like isoform X1 OS=Cucumis melo OX=3656 GN=LOC103503082 PE=4 SV=1)

HSP 1 Score: 547.7 bits (1410), Expect = 1.8e-152
Identity = 265/305 (86.89%), Postives = 273/305 (89.51%), Query Frame = 0

Query: 46  CGVCLKHCKSFESLREHLMEQGXXXXXXXXXXXXXXXXHQDICCITAPVHQGTSLPPTDL 105
           CG+CL+     E+L E                      HQDICCITAPVHQGTSLPPTDL
Sbjct: 18  CGLCLRVLDGPETLSE----------------------HQDICCITAPVHQGTSLPPTDL 77

Query: 106 SDCYEEDRSDRGLGAIAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITN 165
           SD YEEDRSDRGLGAIA+DCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITN
Sbjct: 78  SDGYEEDRSDRGLGAIAMDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQIPITN 137

Query: 166 YRHEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEHDLDC 225
           YRHEVTGLKEEH+RYAMPLKNVQEKVLKLLLNGESIGRLR NGGKAKLLVGHDLEHDLDC
Sbjct: 138 YRHEVTGLKEEHLRYAMPLKNVQEKVLKLLLNGESIGRLRSNGGKAKLLVGHDLEHDLDC 197

Query: 226 LRLNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYK 285
           LR+NYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYK
Sbjct: 198 LRMNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVMRLYK 257

Query: 286 RMRSLDHHRQVMTLSITPSCIQYVAPNLDSHSAKDLEKMTPDELYEMSRSNFKCWCHDSR 345
           RMRSLDH RQVMT S+TPSCIQYVAPNLDSHSAKDLEKMTPDELYEMSR+NFKCWCHDSR
Sbjct: 258 RMRSLDHRRQVMTRSVTPSCIQYVAPNLDSHSAKDLEKMTPDELYEMSRTNFKCWCHDSR 300

Query: 346 RVMQT 351
           RVMQT
Sbjct: 318 RVMQT 300

BLAST of CsaV3_4G027590 vs. TrEMBL
Match: tr|I1MCY4|I1MCY4_SOYBN (Uncharacterized protein OS=Glycine max OX=3847 GN=100805841 PE=4 SV=1)

HSP 1 Score: 437.6 bits (1124), Expect = 2.7e-119
Identity = 217/365 (59.45%), Postives = 266/365 (72.88%), Query Frame = 0

Query: 1   MDSDHDPLKTPSLRHKCSACYKQYKKKEHLIEHMRVSYHSVHQPRCGVCLKHCKSFESLR 60
           MD++ DP + P  RHKC ACYKQYKKKEHLIEHM+ SYHSVHQPRCGVC KHCKSFESLR
Sbjct: 1   MDAEADPPQNPITRHKCLACYKQYKKKEHLIEHMKTSYHSVHQPRCGVCQKHCKSFESLR 60

Query: 61  EHL-------------MEQGXXXXXXXXXXXXXXXXHQDICCITAPVHQGTSLPP----- 120
           EHL              +QG                H+ IC I+AP   GTS  P     
Sbjct: 61  EHLTGPLPRGICSKIFSQQGCQLCLALFDSPGSLIDHRKICRISAPTCPGTSALPYIDSQ 120

Query: 121 TDLSDCYEEDRSDRGL-GAIAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQI 180
            D  D  +E+ +  G  GA+A+DC M GGGSDG+L++C  +CLVDEDE+LIF+T+VQP+I
Sbjct: 121 FDCQDFSDENHAGEGPGGAVAMDCEMVGGGSDGSLELCARVCLVDEDERLIFHTYVQPEI 180

Query: 181 PITNYRHEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEH 240
           P+TNYR+++TGL EEH+R AMPLK V+EK+L++L NGESIG++RL+GGKA+LLVGHDL H
Sbjct: 181 PVTNYRYDITGLTEEHLRNAMPLKEVREKLLQILHNGESIGKVRLDGGKARLLVGHDLAH 240

Query: 241 DLDCLRLNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVM 300
           DLDCL++NYPDHMLRDTA+Y PLMKTNLVSHSLKYLTR YLGYDI+   HDPYE+C+SVM
Sbjct: 241 DLDCLKMNYPDHMLRDTAKYRPLMKTNLVSHSLKYLTRTYLGYDIQSGTHDPYEDCISVM 300

Query: 301 RLYKRMRSLDHHRQ---VMTLSITPSCIQYVAPNLDSHSAKDLEKMTPDELYEMSRSNFK 344
           RLYKR+RS  H  +    MTLS        +    DS  +++L+ +TPDELY MSRS++K
Sbjct: 301 RLYKRIRSQLHPEEDHGTMTLS------NNIVGMPDSWISRELDNLTPDELYAMSRSDYK 359

BLAST of CsaV3_4G027590 vs. TrEMBL
Match: tr|C6THA2|C6THA2_SOYBN (Uncharacterized protein OS=Glycine max OX=3847 GN=100788682 PE=2 SV=1)

HSP 1 Score: 435.3 bits (1118), Expect = 1.3e-118
Identity = 214/365 (58.63%), Postives = 266/365 (72.88%), Query Frame = 0

Query: 1   MDSDHDPLKTPSLRHKCSACYKQYKKKEHLIEHMRVSYHSVHQPRCGVCLKHCKSFESLR 60
           MD++ DP + P  RHKC ACYKQYKKKEHLIEHM+ SYHSVHQPRCGVC KHCKSFESLR
Sbjct: 1   MDAEADPPQNPITRHKCLACYKQYKKKEHLIEHMKTSYHSVHQPRCGVCQKHCKSFESLR 60

Query: 61  EHL-------------MEQGXXXXXXXXXXXXXXXXHQDICCITAPVHQGTSLPP----- 120
           EHL              +QG                H++ C ++AP   GTS  P     
Sbjct: 61  EHLTGPLPRGICSKIFSQQGCQLCLALFDSPGSLIGHRETCRLSAPTCPGTSALPYIDSQ 120

Query: 121 TDLSDCYEEDRSDRGL-GAIAIDCVMAGGGSDGALDICVWICLVDEDEKLIFNTFVQPQI 180
            D  D  +E+ +  G  GA+AIDC M GGGSDG+L++C  +CLVDEDE+LIF+T+VQP+I
Sbjct: 121 FDCQDSSDENHAGEGPGGAVAIDCEMVGGGSDGSLELCARVCLVDEDERLIFHTYVQPEI 180

Query: 181 PITNYRHEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGESIGRLRLNGGKAKLLVGHDLEH 240
           P+TNYR+++TGL EEH++ A+PLK V+EK+L++L NGESIG++RL+GGKA+LLVGHDL H
Sbjct: 181 PVTNYRYDITGLTEEHLKNAIPLKKVREKLLQILQNGESIGKVRLDGGKARLLVGHDLAH 240

Query: 241 DLDCLRLNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRAYLGYDIRQDGHDPYENCVSVM 300
           DLDCL++NYPDHMLRDTA+Y PLMKTNLVSHSLKYLTR YLGYDI+   HDPYE+C+SVM
Sbjct: 241 DLDCLKMNYPDHMLRDTAKYRPLMKTNLVSHSLKYLTRTYLGYDIQSGTHDPYEDCISVM 300

Query: 301 RLYKRMRSLDHHRQ---VMTLSITPSCIQYVAPNLDSHSAKDLEKMTPDELYEMSRSNFK 344
           RLYKR+RS  H  +    MTLS        +    DS  +++L+ +TPDELY MSRS++K
Sbjct: 301 RLYKRIRSQLHPEEDHGTMTLS------NNIVGMPDSWISRELDNLTPDELYAMSRSDYK 359

BLAST of CsaV3_4G027590 vs. TrEMBL
Match: tr|A0A0A0KYE9|A0A0A0KYE9_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G338960 PE=4 SV=1)

HSP 1 Score: 435.3 bits (1118), Expect = 1.3e-118
Identity = 214/243 (88.07%), Postives = 214/243 (88.07%), Query Frame = 0

Query: 34  MRVSYHSVHQPRCGVCLKHCKSFESLREHLM-------------EQGXXXXXXXXXXXXX 93
           MRVSYHSVHQPRCGVCLKHCKSFESLREHLM             EQG             
Sbjct: 1   MRVSYHSVHQPRCGVCLKHCKSFESLREHLMGPLSKSNCSKIFSEQGCGLCLRVLDGPES 60

Query: 94  XXXHQDICCITAPVHQGTSLPPTDLSDCYEEDRSDRGLGAIAIDCVMAGGGSDGALDICV 153
              HQDICCITAPVHQGTSLPPTDLSDCYEEDRSDRGLGAIAIDCVMAGGGSDGALDICV
Sbjct: 61  LSDHQDICCITAPVHQGTSLPPTDLSDCYEEDRSDRGLGAIAIDCVMAGGGSDGALDICV 120

Query: 154 WICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGES 213
           WICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGES
Sbjct: 121 WICLVDEDEKLIFNTFVQPQIPITNYRHEVTGLKEEHMRYAMPLKNVQEKVLKLLLNGES 180

Query: 214 IGRLRLNGGKAKLLVGHDLEHDLDCLRLNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRA 264
           IGRLRLNGGKAKLLVGHDLEHDLDCLRLNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRA
Sbjct: 181 IGRLRLNGGKAKLLVGHDLEHDLDCLRLNYPDHMLRDTARYHPLMKTNLVSHSLKYLTRA 240

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004142658.12.0e-198100.00PREDICTED: apoptosis-enhancing nuclease isoform X2 [Cucumis sativus][more]
XP_011653750.13.2e-18889.53PREDICTED: apoptosis-enhancing nuclease isoform X1 [Cucumis sativus] >XP_0116537... [more]
XP_022155663.11.8e-16779.06apoptosis-enhancing nuclease isoform X1 [Momordica charantia][more]
XP_008449328.12.3e-15487.54PREDICTED: RNA exonuclease 4-like [Cucumis melo][more]
XP_008465475.12.8e-15286.89PREDICTED: RNA exonuclease 4-like isoform X1 [Cucumis melo][more]
Match NameE-valueIdentityDescription
AT2G48100.19.7e-9147.65Exonuclease family protein[more]
AT3G27970.11.1e-8645.18Exonuclease family protein[more]
AT5G40310.12.3e-8444.67Exonuclease family protein[more]
AT3G15080.11.6e-1630.95Polynucleotidyl transferase, ribonuclease H-like superfamily protein[more]
AT3G50100.11.1e-1230.77small RNA degrading nuclease 1[more]
Match NameE-valueIdentityDescription
sp|Q4IEV5|REXO4_GIBZE7.3e-1932.95RNA exonuclease 4 OS=Gibberella zeae (strain PH-1 / ATCC MYA-4620 / FGSC 9075 / ... [more]
sp|Q08237|REXO4_YEAST2.8e-1832.75RNA exonuclease 4 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=55... [more]
sp|Q9GZR2|REXO4_HUMAN8.0e-1831.74RNA exonuclease 4 OS=Homo sapiens OX=9606 GN=REXO4 PE=1 SV=2[more]
sp|Q6CMT3|REXO4_KLULA1.4e-1731.74RNA exonuclease 4 OS=Kluyveromyces lactis (strain ATCC 8585 / CBS 2359 / DSM 707... [more]
sp|Q7S9B7|REXO4_NEUCR4.0e-1731.74RNA exonuclease 4 OS=Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708... [more]
Match NameE-valueIdentityDescription
tr|A0A1S3BLT2|A0A1S3BLT2_CUCME1.5e-15487.54RNA exonuclease 4-like OS=Cucumis melo OX=3656 GN=LOC103491238 PE=4 SV=1[more]
tr|A0A1S3CNZ6|A0A1S3CNZ6_CUCME1.8e-15286.89RNA exonuclease 4-like isoform X1 OS=Cucumis melo OX=3656 GN=LOC103503082 PE=4 S... [more]
tr|I1MCY4|I1MCY4_SOYBN2.7e-11959.45Uncharacterized protein OS=Glycine max OX=3847 GN=100805841 PE=4 SV=1[more]
tr|C6THA2|C6THA2_SOYBN1.3e-11858.63Uncharacterized protein OS=Glycine max OX=3847 GN=100788682 PE=2 SV=1[more]
tr|A0A0A0KYE9|A0A0A0KYE9_CUCSA1.3e-11888.07Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G338960 PE=4 SV=1[more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006364rRNA processing
Vocabulary: Molecular Function
TermDefinition
GO:00084083'-5' exonuclease activity
GO:0003676nucleic acid binding
Vocabulary: INTERPRO
TermDefinition
IPR012337RNaseH-like_sf
IPR037431REX4_DEDDh_dom
IPR013087Znf_C2H2_type
IPR036397RNaseH_sf
IPR013520Exonuclease_RNaseT/DNA_pol3
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0006364 rRNA processing
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0005488 binding
molecular_function GO:0003674 molecular_function
molecular_function GO:0008408 3'-5' exonuclease activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_4G027590.1CsaV3_4G027590.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013520Exonuclease, RNase T/DNA polymerase IIISMARTSM00479exoiiienduscoord: 119..293
e-value: 1.8E-24
score: 97.3
IPR036397Ribonuclease H superfamilyGENE3DG3DSA:3.30.420.10coord: 119..303
e-value: 6.5E-42
score: 145.0
NoneNo IPR availablePANTHERPTHR12801:SF66EXONUCLEASE-LIKE PROTEINcoord: 1..346
NoneNo IPR availablePANTHERPTHR12801EXONUCLEASEcoord: 1..346
IPR013087Zinc finger C2H2-typePROSITEPS00028ZINC_FINGER_C2H2_1coord: 17..39
IPR013087Zinc finger C2H2-typePROSITEPS50157ZINC_FINGER_C2H2_2coord: 15..44
score: 11.593
IPR037431RNA exonuclease 4, DEDDh 3'-5' exonuclease domainCDDcd06144REX4_likecoord: 121..285
e-value: 1.53323E-66
score: 208.139
IPR012337Ribonuclease H-like superfamilySUPERFAMILYSSF53098Ribonuclease H-likecoord: 121..290

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CsaV3_4G027590CSPI04G17050Wild cucumber (PI 183967)cpicucB212
CsaV3_4G027590Cucsa.017960Cucumber (Gy14) v1cgycucB405
CsaV3_4G027590CsGy4G016310Cucumber (Gy14) v2cgybcucB171
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CsaV3_4G027590Silver-seed gourdcarcucB0511
CsaV3_4G027590Silver-seed gourdcarcucB0562
CsaV3_4G027590Cucurbita maxima (Rimu)cmacucB0803
CsaV3_4G027590Cucurbita maxima (Rimu)cmacucB1027
CsaV3_4G027590Cucurbita moschata (Rifu)cmocucB0788
CsaV3_4G027590Cucurbita moschata (Rifu)cmocucB1011
CsaV3_4G027590Cucurbita pepo (Zucchini)cpecucB0082
CsaV3_4G027590Cucurbita pepo (Zucchini)cpecucB0606
CsaV3_4G027590Watermelon (97103) v1cucwmB361