CsaV3_5G015310 (gene) Cucumber (Chinese Long) v3

NameCsaV3_5G015310
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionPol polyprotein
Locationchr5 : 12564427 .. 12580083 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGAAGGCAGTGGCTCAAATGATCACATTGATCACCCAAAAGCATTCTTTATTTTTCATAATATTCGACATGTTCTTCCTGATTATTGGGTCTCTACATTTCAATTCTTTTTCTTTTTCTTTCTTTCAAAGATTATTGTAAGTTAATTATGATTTAATGTGTTAATTATTTCGACTTAACTATATTCATTTAATTTTCTTCCTTTGTTATTCAGGCGATTCCTCCCATTAATGAAAACCCCATAGAGCATTTGGATTCTACTTATCAACATGTGTGTGAATAACCTAATTTGTTAGTATTTAAGTAATGGAAGTCCATTTTAGTTTAGGGGTTTTCTATAATTAAATCAAAATTTTGTGCTGTTGATATATGCAGGTGGTGGATGTGATTTTTACAAGCCTTTTGAATTTCTTGGGGAGATTAGGTGGCCTTTATATCAATCTAAGGGGTCAAGGTTATCTTCAAATTCATTCTCCTCGAGCGTTTTGGAATGTTGATTCATTTCGAGAAATGTTGCGTAAGTTCCCTCTATTTCATTATCCTAAAATTATTTCATGAATTGAATTGTTTTTTTCTTCCAAGAACAAATATCAACTGCCAAGATTGCAATAACACTATGAGAAAATTCAAGAAATGTGACAAAGAAAACCAATTTTAGGAACCAAAATCTAACTACTAACAAATCCTAGTTTTGATCCATTAGAACTAATTATTTGAAATTATTATTACCTTGGAAACCCTAATGTTTACAGGTGAGGTCCTTAGTGATCAACATAGGGTTCCACAGGAGAATGATGCAATCGTTTGGCATAGGACAAATTTGCAAGCTTTGTTTAATCATTTTCCTCGCGAGCGTCGAATAATAGGTACATTATTGACTCGAAACTTTATGTTGTGATGCAAATTAAGTTCATATAACATATTTTTTTCAACATACAAATATTATACTAAATTACAAGCTTAGTCTTAATACTTTTAGATTTATATATATGTGTATGTACATGTGTGCGTGTGTATGTGTGCATGGTTGAAAATTTACAAATTTAGTATTTTCCTAGAAAGAAAATCTAATTATGCTAAACAATTCAATTGAAAATATTAATTAATTAGGGACGAAAATGAAATTCTCGGACTTTTTTGGATGTTTTTAGAGTATAAAAATCATTTAGATACAAATGTTCAAATAGGTGAATGTAGGGTTAATTTAACTTATATATATATATAAACTAAATGGTATTATAAATATGAAATTTGAAAGATTGATCACGTTTACTTTTTTTTTTTAAAAAAAAGAAGATAGATTAAGGTTTTTCAATATACTTTTGCATGTCTGTTTAGATATATAGTTTTAAATTTTCATTTCACTAATTAAATTAACTTATTAAAAGAAATTGAAACTAAATTAGTTCTTAATTCCTAATGTCTACAGGATGTTTGCTTGATGACCATGATGCAAATAGACAAGAAATTAACATTCATGACTTTTTGAGATTTTTGGTTCATCCTAATGAAAATTTAATATTTGTGGTATGTATGTATATATGTGTGTGAAACTTGAAAGAAACACATTACATTTTCTTCCACTAATGTTTATGATTAAATCTTGTTTTCTATTTATTGGTTTAAGGTTGGAAACTTCAGAAATTATAGAGTGCTTGAGGAAGAAATTGATCAATACGTCAAAGGTACCAAATTAATTTTACTTCTTTCTCTATCCATTTACAGTGAATTAATCTTGTATAACAATAAATGACAAATATAGCAATTAAATTTGGTATGTTTTTAGAGTTCAAACTATATTTTAATTTGGATTGTACTAGTTTATATTTCTCTATTATTTAGTCTAATTAAAAAGTAGTATATAGAGAGGAGAGGATAAATGAATAGAAAATAAACAACAACATCCAAAAATAAATATAAAAAAAGAAAAAGAAAAAGAAAAAATAATAACATAAGTTGTTGGAACACCCTCTTGAACTAAACCCTAAACCATAGGAACACTTCCAATCATGGAAAGAAGCCATCATCGTGTTCGTGGGTGGGTCACACCATCGAGGTTGGAGCAATAGTTGAAGCGGTCGTCATCGAGGTTTGTGGCCTAAAGGAAGCTCACCATCGTTGGGTCTACCCTAAAATAAGAGAGGGGATGTGAGAGAGAAAGAGATAGTGACGTGAGAGAGGAAGAAAGCCAAAGGAAGTACATTATGGTGACCGCTAGGTGTAGGCTAATGAAAGTTATGAAACCCTAATTTTAATTCTTTAATTTTATTGGTTTGGTTTGATTTTTTACACCCCTAATAAATTGTAAACATATAACAAATAAGGTTTTTAAATAGTAATAGTCTAGTTATTAAGTGTATGCTGTAATAGTTAAAAACACCAACTATTGTAACTGAGATAGTTGCAAACAAAGTAGCATGTCCAAAATGTTATCAGATTCAGCACATTGCAAAAAAAATTGTAAATTTAGTAAAATTGAGATCCAACTCCCGAAGTCTATTGCCGATGGACTATTACCGAAAAAGAGTCTTCTTTCATTGATAAAATTTTCTATAGTTGTTTTTTGTAAATGTTGTTACGTACACATATCCTTATTAGTCGTGAAAATGTTCGAGGCTCTAAACCTGAGGTGAATTAAACAAACTATAAATGTTGTTAGCTTTTAAAATGTTTAAGTAATCCTATATAATATCATTTTTTTTTTTTCCTTTTCATTTTGTAGTTTCGGATATTGGAACACCAACAAATTTGTGTGTGAAGTTCATTTGTCAAGAATGTCAACGAAGGTGGAACCTTATTTAAAATTCAATATATCTTTTTGATGGCTATCACCTCCTTTTGTTTCAATAAACATCTAATAAGTCCAATATTCGTTTTGGATGACCATATCTAGTGTGTTAATCATGTGGTTCTCAAATTTAACTTCTTTACTTGAATAAAATGTTTAGTTAATTCACATTTTTTCATTATACTTATATTTTCAAATCTATTAATTATGTTTTACTACCATTTGGAAGAAGCTTTTTTTTCTCTCTCTAGTTTGTAGTTCTTAAATGATATATTTATTAGTATAAATAATTAATGTGTATTTCATATTTAGACTCGTGTAACTTAATCTAACAAATTAAGATCCAAGATTATTTAATATACCTTGTACAATGTGTGTAGACATACAACTAAATCATGTTCAAGTGATAATCTAAAAGATCTATAGTCACTACAACAAAACCTACTTACTATGACAGTTGTATATATTCAAAATTGGAAAGAAGAAAGAAAAAAAATGTCGCATTTTTTGCGAGAAAAGACTTTTTTCAGATTCGAAAAATTTATGACAGATTTTAACCGTCATAAATCTATGACAATTTTTAAATGTCATTAAATATAAATAATAAAATTATATTTTATTAAAAATTATTAATATATATTTCCCTCCTTAATTTCCCTCCAACTTTTTATTAAAAATTATTAAAATATAAACTTACTTTTATTTCTTCTTTCTCCTTATCATCTTCACGATAGAACCCAAATAATCCAACCTCAACCCATTCTTCTCCATTTCCATGAACTTTCTTCTCAACGACGGATACCCACGGCGACATCAAATCTAAACTTTTGGAAGACAACAACCCACACTCTTTTTAAAGATTTTGAAGACAGGTTAGTGTTTAATACTTTTCACTTTTGTGTCTTTCCCTTTTCTTATTTTAATTAGTAGGGTGTAAGAATGGAACAAAGCCATTTGTGAATAGTTTCTCACATTTTCATTCACCCCCTCCCTCATATATATGTGACCTATTTCTCTGATTGTTTCCTCATAAAATCTGAATAACCATATTGTTTACGGTAGATCAAGGAATCTTTGAAAGTTAGACAATGAGCGTTTAGTGGGGAAAAATTTTGAGTACAGAAGGGGTTTGTGAAAATGTCTAAGAGAGAAAAATAGTGTTTTTTAGGATGTGGATTGTGGCACTATTTTATACAATTTATTTTCCAAGCATATTTTGACTATGGAACTAGAAATGGACTTTTGGAATCCAACCCATTTACATTGCTAGTGGAAGAAAATAATGTGCAAACTCATGAAGTATAACTAATGGTTTGTTAATCTTTACATTTGTCTAGTGATGTTTTGACTAAGTTGTTTTAATTCACCTTATAGGGATGAAATTCATGGAGGAATCCATAGACTTAGTGTCTTTTTTTATCTTGTGAAACATGGTTATTTTTAAAATATTGTGCATAATACATTCTCAATAAATGTGCGTTCAATTGAAAATGAAAGTGGAACGGATTTGTGATTATCATTGTTGTTTTTAGATGACAGAGTTATATGCTCTTTAGAATCTAAAATTTGAATTTAATTGTCTCGTGCAAAATATAAGTTGAATTATTGTTTATATGGTTTTTTTTTCCTCTAAAAAAAGAAATGGTGAACTATTCTCTCACATTATTTTGGTAAAAATGAAGAGGATTTTGAAAGGTCTGTTTTAGGGAAATAAATTGGGAGTTTTTTCTCAACTAATATTAGTTCATTGATTGGGAAGAAAACTATGAGTAAAGGGTTTGTATTGTGATTCATATAAAATAAAGAAGAATAAGAGTGATCATTATGGTTATTATTGTTCTTCATTCCAAATTCATCATCTTGAAAAGTTTGAATATTATCTCTAATTGTGGTCACCATTTTTATGATGTGATTTTGTTCTGTACTTGATGTTGCTTCAATCAAATCATTCATCTCAATTTGTTTCCTCTTCTCAACAATTTCGCTCTTGGAAGATTCAATTCTTCCTTCCACAAACTACAAAATGTGAAAGCTAATTTATGTGAGATTTTTTTTTGCCAAGAAAAATTGATATTGATGCACCTTTTGACAGCCGTTATATTTTTTAAATTTGAGCTTAATTTTTACATTTTTTTAACAAAAAACTTAGAGTAGGTCTGCTAAGAGGGACTGGTAGGTATACTTAGACATCTCAATTAGATTAACACATCTATCATTACCCCTTTCGTACCTAAACCCAATAGACTATATATCATTGGCACAAAAGATTAATTACCAAAGCGTTCATTGGCACAAGTAGTGAGAAATAAAACCAACAAAGGCGAAAACATTGTGGTATCTGATTATGAACCTTGTAAATTTTGATCTTTTGCAAATATTCATTTATTTTTCCTTAGGAAAAGGTCTGAAAATATTGAACCCATAGTTGGAAAATGGGTTTCATACCTTAATAGGGAGAAGAAGTTGGATTCACTTGTAGGTTGATGTCCTTTTGCTCCACATGTTCCTAGAGGACTGTACATATATGACAATGTTGGAAGTGGTAAGTTTTGACTATGATAAAATTTTATGTAACAAGCTTGAATATTTGTTATCAATTCATAGGGGATTTATGTTTATAATCTGACTATCGCTCTCATTTATGCTTATAATCTTAGAATCAATTTAATTTTGATTTACTGAATTAATTGCTAATAATCTAGGGAAGACAATGCTTATGGACATGGTTTTCAATGCTATTGAAGAAATTGTTAAACATAGAAAAATGTACCAATTTCATGAGGTGAGACATTATTCTAGTTTAGAAGATTAGAAACACAGGAACATACAGGAAAAACTAGAAAAGAGTTTTTCTAGTTACACCATAGCAAGGGAGTAGCGAAGGTATGAAGGGGATGCAAAAAAACTTCCTAATAATTCTCAAACCAATGATTAAATTTCTCTTTTTTATTTAATGATATTGGGCGGAGCTATGTAAAATAATTACTCCTATTATGATTTTCCAATAACTTTTCCTTTCTAGGAAACACTAGAAAAGCGTTGAAGAACCAGATCTATTGAAAAACAATATCACAGGTGAGATTTTTGTATTTAAAACAAATTAAATATGTACTTACATAAAAAAGTATTATCTCTCTCCCTTTTTGACGTGTTTGTATGACTACAATTATCCACAGTAACTCATCATGCTTTCCTAAATATTGTTTGTTGTTTTGTTGATTCATTTAAGGTTAAATTTTATACTTTATAGGGTTATTCTCTTGTTTGCTGATTAGACACTACTACTATTTGCAACAAAAAATTATACATTAAACTTTGGAGTATGTTAGGGATTGTATCTGACCATGGTAACACAAATATCCTTGCATTTGAATGACTTTGGAGTATTGTTATTATTGACTTATCTTATGAGTTATGCTTTAATTGTGCATAAATGGATTCTACTTTGGAAAATGAACTGGAAAAAGAAGTTGATTACTAAGAGATTCTTATTTTGATTGTGAACTATTAGAATGTTTGGTAATTTTAGGTTTACGTGAATTGCAGAGCAGAAATAAACAATAGCTTTCGTAAATCAAAGTTAGTTTGGATGAGGCTGAAGTTTTGGATTGTTTAAAAAGCATATTGTTTTGTCTTGATGTATATATATTTTGTTTATTTTATTGTTCTTTAAAAAAAACCTAAAAACTATTAGTTTTTTCCCTAAAAAATGGTTATTAAAAATTGATTATCAATCCATCACATTGAATAATTTGTTTGTGTAGTGTAAAAATAAGGTTGAAATTGAATTTTTTTGCGGTTTACTAACTATAGGAAGGAATGATTTAATTACTAACTTTTAGAAAGAAAGCTAGTGTCAATTACTGTTAAGTTAAAGTCCCTTCCTTTGTCTATGTTTATATAACAAAAAATAAGTATTAACTCATTAGGAAAAAGTACTTATCAAATTATTTTTGTAGACCTTTATATAACAAAAAAATTAGGTTGAAATCGTAGAATGTTGGTTCTCTCAATAGACATTGGCTTGAAATGTGAAAACTTGAATGTTGGTTCTCTCTATAAAGATTGCTAGTTCTCGCTCTATTTTCAATTGGACAGGAAAACTTAAAACAAACTATGTTGATGCTTGTTGTATTTTTGTTTTTTCTGCACCTTCAAATTTTATATGTTCTAAGGCATGCTTCTGTATTTAGGTTGATGAAAACTTTAAATCAATTTCTCCAATGACGTCTTATTTATTTATTTTGGTTTTGAAATTGCAATATGTTACTACACTAATAGCATGAATCTAAAATGTTAGTAGACTATTGTATATGATATTATGGCCGCTTTTAGTATCAAACTTTCTCCAAATTTCAAGACCAGGTAAGTGATATATAGTAGGCAATTTTTCTTGTAGTGGTATATCATTTCTTTACCTTGTCCATTTTAACTGTTTTTGTTTACTTGTGAATTGATTTGCTTAGATAATATTCTTTAATCATACTCTAAATGAGATTTTGTATTCTTACTAGGTTTAGAAGAATCTATTGAATCAGTGAAGCCAACGTGAAGTAACGGATTGATATAGGACTTTTTGTATGGTTGGTAAAAAAATATTGGTTTCAGTGTACTGATTTTAGGCTTGTTGATACCTAAAGATGTTAATGTTTATTCTTTTTGTATGTAAACTTCCTTTTAACAAAAAATGGTTATTAATCAATGTTTGTACGAATATTATAAAGTGTATTATAACGTATATATATTAAAGTGTTGACTCTTTGTTTTCATATGGGTGTGAAGTATTAGAACATTATTGTAATGGGAAAGCGACAAAAATAACAGGGAAAAATGTAGTAATTGAGGAATATACGACATGGAAAAATGTCATAACATACATATTCGTGACATTTTTTAACCGTCGTCAATAGCGAAACAATGACATTTAATTTTCTAAAAAAACTGTCATGATTGACATATATTTGACAGAGAAAATTGTCATAAAATACGAATTCGGGACATTTTTAAACTGTCGTCAATTTTGAAAAGATGACATTTATTTTTTAAAAAACTGTCACGATTGACATATCCTTGACATTTAAAAAATGTCATAATAAACGAATTTGCGATATTCTTTTAACTGTCGTTAATACGCAAATGATGATAGTTATTTGTTGTCATAATATAAAATATGACAGTTATTTGTTGTCATAAAATAGTAATTAATGACATTTATTTTCTGTCATGAAAAATCTTATTGCGACAGTTTCTGAAAACTGTCATGAGATTTTATTCGTGACAGAAAATAACTGTCGTCGTAGACCATTTTTATTGTAGTGAGTATACAAATAAGGTTGGGTACCTTATCTTGGATCACTATGAATATGGGTGTAACTTCAACCCAAATAGCTCAACTACCCAACCCATATATATAGGTTGGGTTGAAAATTTTGGTTTTTTCGAGTTTGATTGGGTCTTGGATTGGAGGTTGAAAATTTTGATTACAATCCATCCTAAACTCGAATTAGTAAAATATATAGAGAAATTTCCATAAAAAAAAAAACACCTACTATTTTTTAAAGATATTCCAGGTTTGTTTTCCTTTCCACCTTCCTACTTTTCGACAAATTTTCTTTTCCTTTTCCTTTTCCTCTTTTGCAATTTTTCTTTTATTTTTTTTTTCAAATTATTATTTTGTTCAAGATCATGTACCAAATATAAAAAATCTTGGTGCACGATCTTGAACAAAAATGGTTGGGATATTGGTACAAAATTATTGGAAAAATACCAAAAATCTAACAGAACTAATTCTGAAAGATCGTGTAGCCAAATCTAAAAGATCCAAATCTGAAAGATCGTGTAGCCAAATCTAAAAGCTTTCGGCTATGGTTGTTCGACAGGTGTAGAGCCAAATTCTTTGATTGAATGTGTCTCAATTTTGCAATTACCTATTTCAATTGAATTGCATCTTTTGGACGTTGAAATAGATTTTAAATTTTGCAGGCATGAGGTGGAGCTTCCCATTAATGGCCAATTTTCTAATCTTTATGTTTTTGCGAGAAATTTTTTATGTAGCATATGTTATTTCAACTACAATGTACTTGAGGTTCAGGTAAATAATTGTACTTTTTATATTGGATAGGGGCATTTGAGGAATTGTTTTATTTATTTATTTTTCTAATTTTTATGTGTTGCTATTATAACAATTTAAAATTATTTTTTGCTGTTTTCTCAAAGTAAAACTCATAATTTGCTATTCTTCTTGAGAGCCCTAAAAAAAAGACTATACAACCCAACCCAACTTGTTTACACCCCTAACTATGAATACGACCCACTTTGTAAATGTTACAAACAATTGGATCCAAATTGTTCATGAAGAGACATGTGAGTGGGATATGCAATACTGAGAATTTTTATAAGATTGGATCACAAAATATATAATATCTCTTTGTAACACCATTGATTGAAGAGATTAGTATTCCAAAATGATAACCATATATAACTTGATTTCAATCTTGAGAGTGACTCAACTCGGAGACTCAATATGTCTATCATTTTAAGGACAAAACCAAATAGAGGAGTTGGGGACATAGCTATCGTCACACCCCTCCCAGATTATCTTTTAAACATAGGAAGAGGTGTGAAGACAACAAATATCATATCTCTTATGACACTTATCGTCCAAACTAATCCTGCATTTCGTGAAAATTTTCTACATTTTCTCAAAATATTACCATACAAGCACCATTTAAAACTAAGTACAAATGTTTTTCATTCCTAATTCAATTCATATTTATCTCAAAGTGTGTTTAATCTAGTGGGCCAATCATGTAGACACTTACATAATTTTATCAACGTTCAACTTCAAAGATGTGACAAATGTCACTTTAGTAGCACCTGAATCCCTAATGAGGAAGAACATTTTGAGTTGAATAAAGGAAAAAATGGACATGCGCAGTGGAATACGAATCAATTTTACTAATTGTTTATAATCCTATTAGAAATTAACATGCTTCTCCCTAACTAAACTAAATTAGGGTTTAATAGTAATAGAGATTATGTCTTCGTAGCTCTCGATTGCTCATAATCTTTGCATGAACACAAATAAAACCACCACTAGTGTTGACCCGCTATTCTTCAAATTTAGAACTAAGTTGTGGGACCCAAAGAGTGAAGGAGTTGAAAGAGAGAGATGGAAAAAAAAAGGAGGAATGTCACAATTTTTGGACAAAAAATTCTTTGATGAATTAATGTAGAAAGTAAAATAATCAAAGGCCAACAAATTTTCAACATTTGAAATTGCCCTCTTTAAATAAGGGTTGCATGTAATAACTCACCAAAGTCCAACACCTTACAATCCATTATCTCTTAGTGGGCTTTAGTGGGCTAAGTGTACATCATATCTAACCACTTATCCCACTAAGTATAAACCCCAATTCCCTTAGATATCAAGTGATAAAGAATAAATGAAGATTGAGTTCAAAGATAAAATGGAGGGAAGGAAAAGGAAAGAAGGAAAGAAGTTCAAATTTTGGCTTTTAGTCTTATAAGACTAGGCGTATTAGGTGTCAAGAACCTCCAAAGGACATGTCCTAGGCAACCAAAGAGTGGTGTGAGGTGTTAGGCTTTGAGCATAGTTTGGGTGACCAACCATGTCAAGAAAAATTCTCTTCGCTTTGAAGTTCCTTGAGTTCGACCCTGCAATTACCAGGAAACTTCTCTTCGCTTATACTTGGACGAAGAGGGGGGAGCTTGCATGACAATATTTTTATCTTGAATATTAATGCATGATTAGAAATGCAAATTTTATCGCATAATCCAAATCCCCCACTCAACAGTTGGTACTTTTATAGCTAATAATGATAATCTCCAAAAGAGTACTTGAGAAAATCTAAGCATGAAATAATACGAAGCAAGTTTTTGGCGTTGTTGCCGAGGAATTTCTAAAGTTTTCTCACTTAACTAATTCATGTGCTAAATCATGCAAAAGTTCAATTTTAGTTGAATCATGTACTTTAGTCGAAAAGTCTACGAACAAGTTTATGAGTGCAGGCGAACAACTTGAATTCCAACTTGACCTTGAGATTGAGCAAACATTTTAGAAGTATCATAGACAACGACAAAGGCAGAATCAAACCAACATGGAGAATCCAAATAACAACCCTGTCAACCAACAACAAGCCCCATATCATAACCCTACTTATCCCGCCCATGATCTAGATCGCTCCATCAAATCATATGCATAACCAAGCCTTTACGATTTCAACTCAGGCATAACATCTCTCATGTTTGGGGAAAATTCTCAATTTGAGATCAAACCAGTGATGCTTCAAATGATTTAGAATGCGGGACAATTTGAAGGACACACATGTGAGGACCCTCATGAGCATATACAAAATTTATATTTCATATGTGCGTCGTTTAATATGCAGAAATATCACGCTATGAACTACATTTAGCTTTATTCCCGCTTACCTTATGCAATGAAGCCAAATAATGGGCAAATTCTTTAGAAGAAGGGGAAGTTACAACTTGGGATAGTCTGATTGAAAAGTTTATGAAAAATTTCTTCCAACCCATTGAAAATGCGAAAAGAAGGTAGGACCTGATGACCTTCAAGTAGAGGGATAGCGAGAATCTAATTGACATGTGGAGAAGGTTCAAGCGAATAAACAAGGGATGTCCTCATCATAGCATTCTGGGATGTTTCTTGATAGAACAATTCTATTTTAGGTTAAGTAGAGTTACACAACAATCTATAGACGTTGTGTTTACACGTGGGATCATTGAATTATCCTATAACCAAATTAAAAAACAGTTGGATGCCACGACCAACAACAGTCAAGAATGGAGGGATGACGACTTTGACTCGTAAAATGAAAACAAAGGAAATAGAAGAGAGTGTGGAAGAATAGAAGAAGGATTCGATAGAAATGCTATGGTAGCATTGCAAAGTCAAGTTACTGAAGTGAACAAACTCTTACAGTCCATGGCTCTATTGGAAGTCAATGCTGCTGGGAGTTCTCTTCAAATAGTACATCAGACAGCTGAGTTGGGTTTTGTAGAATGTGGTGGACCGTATAACACAGACACATGCCCCATATGTAAGAAGACAGTTTCTTATGTGAAACATGACCCATATTCCAAAAATTACAATGAGGGTTGGAGAGACCATCCAAACTTTAGCTTGAGGGGTCAGAAGCAAAATGCACCACAATGATAAGGCGATCGATTAAACTATAGGGGCGAGGTGTCTGGCTACTGCCCAAGGAAATATCATGAATGGCCACAACAACCAATTCAACACCTGAACCATCACATACTTAACCAATCACATTCTTCATCATCTTCATTGCCATCAATGGAATCTCTATTTCAAGAATATATGCAGAGGAATGATGCCCTTTTGCAAAGCCAAGCTGCATCTATTAAAAATCTAGAGTTACAAATGGGACAAATAGCCAACGATATATCTAGGCAACCGAAAGGAACCCTCCCTAGCAACACAGAAATACCAATTCAAGGAGGGAGCTCAGGAAAAGAAAAGTGTCAAGCAGTGACACTATAAAGCGAAAGGAACTTATCCATCCGCGAACCCAAGTCTGAACGTACCTATGCTGCTGAGATTGATAGCTTAAGTAATAATTCTCATCCTTAAACTTGTCTTTAAATAATGATAATTCTTCCTTGCAGAATAAAAATGTGTCACACGAGGAAGTGGAGCATTTGAGGTGAGAAGAGCAACGCCGCAAAGCGTCTAACGAGGCAACGAGCTCCAATCCTTTACCACCCCCTCCATTTTCTGGTCGCCTAAGAAGAAAGACAACGAACAAAAATTCGATAAGTTCTTGAACATGTTGAAGCAGTTGCACATCAACATCCCATTTATTGATGCATTAGAGCAGATGCCAACTTATGTCCAGTTTCTAAAGGACATCTTGGCAAGGAAGCAAAAAATTAATGATCTAGAAACAGTAGCTCTATCACAAACAACAAGCGATATCTTCAAAGAAGGGGTGCCAGCAAAGATGACAGATCTTGCAAGTATTAATTTAATGCCTTTGTCTATCTTCAAGAAATTAGAGATAGGGGACGTACAACCAACGCTAATGAGGCTCGAGTTCGCGAATAGATCCATTGCCAATCCAGAGGGTAAAATCGAAGATGTTTTGACAAAAGTGGACAAATTATTGTTTCTACAAACTTCGTCATTTTGGACTACAAAGTTGATTGAGAGGTACCATTATTTTAAAGCGGCCATTCCTTGCAACAGGCCATGCACTCATAGATGTCCACCAAGGGGAAATGACCATGCGCATGAACAAAGAGGAAATAAAATTTAATATTATCAACGCGATAGAATTTCCCATTAATGTTGAGAACTGTAGTGCAACAAAGGTCCTTGGTTCAGACTATTGCAAGGAAGAGGTATATCAAAAGTTGTTTAGCATTGAAGAATTCTTTAAGGATGAGCCAAGTTTGCTTGAAGAAGTGAATGTCATAACTGACAAAAAGAAGTTTGAACCTTTGGTCGTGCAAAACAAAGGTGAAAATGAAACAAAGCCATCAATTGAAGAACCACCAGAACTAGAGCTCAAGTCGTTGTCGCACCACTTGCAATATGCATTTCTAGGAGAAAATGACATTGTACCTATCATTATCTCCACATAGCTAAGTGGCCTCGAAGAAAGAACCTTATTGAACAGCTTAAACGCCACAAGAAGGCGAGAGGATGGACCATTTATGATATCCAAGGGATAAGCCCATCTTATTGCATGCATAAGATTAGGCTGGAAGAGGGACAAACAAGTACCATTCAATTTTAGAGAAGGTTAAATCTTGCAATGAAGGAGGTTATAAAATAGAAAATGATTAAATGGCTTGACGTATGACGTAGGAGTCATTTATCCTATGTAGATAGTGAATGAGTCAGCTTAGTCCAATGCGTCCCAAAGAAAGGTGGGATGACTGTTGCGAAGAATCATAAGAACGAATTAATACCAACACGAACAATCACGGGGTGGAGGATATGCATGAATTACTGCAAGTTGAATGCGATCACCAAGAAGGACCACTTTCCATTACCTTTCATTGACCAAATGTTGGATCGGCTTGCAGAAAAAGAATATTATTATTTTCTAGATGGTTATTCTGGTTATAACCAAATTACAATAGCACCTGAGGATCAACACAAGACAACATTTACATGTCCTTATGATACATTTTGATTTTGCAGGATGCCATTTGCCCTATGTCGCCAGAAACATTTAAGAGGTGTATGATGGCTATTTTCTCGAGCTTATCTTGAAGAGATCAATTGAGATATTCATGGATGATTACTTGATATTTGGCAAACCTGTCAAAGAATGTCTCAACAGTTTGGAGGAAGTACTTGAGAAGTGTGACGAGACACAACTCGTTTTAAACTGGAAGAAGTGTCACTTCATGGTAAAAGAAGGAATTGCGCTTGGACACAAAATCTCGAACGCAGGACCAAAAGTAGATCTCTCAAAGTTTGATGTAGTTAGCAAGCTGCTACGGCCTTGTAATGTTAAGCCATTGAGAAACTTCCTAGGCCACACTAGGTTTTATAAAAGATTCATTTGTGGGTTTTCCCAAACTGCCAAGCCATTGAGTAATCTGTTATGTGCATATCAACCCTTTATTTTTTATGACAAGTGCAACCAAGCGTTTCAGGCTATCAAAGACGCATTGACCTCAACGTCTACCTCATCGTGCCAGATTGGTGTCAACCATTTGAACTCATGTGTAATGCGAGTGATGTAGCTGTAGGGCTATGCTGGGTCAAAAGAAGAATAAAGTGATCCATCCAATTTACTACGCGAGAAAAACTCTCAATGAAGCATAGGAAATATATACCACCATCGAGAGGAATTACTAGCAGTAGTTTTTAGTTGAGAAGTTCAGAAGTTATATCATTGGCTCCAAAGTTATGGTGGAATCTAATCATTCTGCAATCAGATATCTCATGGTGAAAAGGGATGACAAACCACAGTTAATAAGATGGATTTTATTACTCCAAGAGTTTGATGTGGAGATCATAATCCGCAAGGGCACAGAGAACCAAATGTCAGATCACTTATCTTGTCTCAAGAATGCAGAATTTCAATGCGAAAAGAAAGATATTGAAGAAAGATTCCCAGATGAGCAACTCTTCCATGTTGAGATAAAAGAGCCTTGATATGCCACTTTTATAGGTGGGGTGAACCTTTTCTTTACAAATTAGGTCCAAACCATCTATTAAGGCGGTGTTTCCCAGAATATGAAACAATTGACATATTAGCTAAATGTCAAGAAGCACCATGGTGGACAAAGGATTGGTGCAAAAGTCTTGTAGAGTGGGTATTTTTGGCCTACTCTTTTTTAGAATGGGAGGAACTTTGTAGTCAAATGCGATAAGTGTCAAAGGATTGGAAATCTATCTCGTCATGATGAATTGCCACAAAAATCCATCTTAGAGCTCGAACACTTCGATGTGTGGGGAATAAACTTAATGGGACACTTTCCTCAGTCAGGCACCCATTTGTATATTTTACTGGCTATAGACTATGTCTCGAAGTGGGTTGAAGCAATCTCCTGTGTTATGAATGATACGATTATAGTGAGTAAATTCTTGAAAAAGAACATCTTTACGCGTTTTGGAATCCTTAGAGTAATTATAAGCAATGAAAGGTCCCACTTTGTTAACCACATCATCACGAAGCTACTTGCAAAGTACAACATCACACATAAGAAAGCCACCACCTATCACCCACAAACAAATGGTCAAGCAGAAGTATCCAACTGGGAAATTAAAAAAATTCTAGAAAAAGTGGTAAATCATTCCTACAAGGATTGGGCAGATCACCTGGATTCTACACTGTAG

mRNA sequence

ATGGAAGAAGGCAGTGGCTCAAATGATCACATTGATCACCCAAAAGCATTCTTTATTTTTCATAATATTCGACATGTTCTTCCTGATTATTGGGTTGGAAACTTCAGAAATTATAGAGTGCTTGAGGAAGAAATTGATCAATACGTCAAAGGGAAGACAATGCTTATGGACATGGTTTTCAATGCTATTGAAGAAATTGTTAAACATAGAAAAATGTACCAATTTCATGAGAATGGGAGGAACTTTGTAGTCAAATGCGATAAGTGTCAAAGGATTGGAAATCTATCTCGTCATGATGAATTGCCACAAAAATCCATCTTAGAGCTCGAACACTTCGATGTGTGGGGAATAAACTTAATGGGACACTTTCCTCAGTCAGGCACCCATTTGTATATTTTACTGGCTATAGACTATGTCTCGAAGTGGGTTGAAGCAATCTCCTGTGTTATGAATGATACGATTATAGTGAGTAAATTCTTGAAAAAGAACATCTTTACGCGTTTTGGAATCCTTAGAGTAATTATAAGCAATGAAAGGTCCCACTTTGTTAACCACATCATCACGAAGCTACTTGCAAAGTACAACATCACACATAAGAAAGCCACCACCTATCACCCACAAACAAATGGTCAAGCAGAAGTATCCAACTGGGAAATTAAAAAAATTCTAGAAAAAGTGGTAAATCATTCCTACAAGGATTGGGCAGATCACCTGGATTCTACACTGTAG

Coding sequence (CDS)

ATGGAAGAAGGCAGTGGCTCAAATGATCACATTGATCACCCAAAAGCATTCTTTATTTTTCATAATATTCGACATGTTCTTCCTGATTATTGGGTTGGAAACTTCAGAAATTATAGAGTGCTTGAGGAAGAAATTGATCAATACGTCAAAGGGAAGACAATGCTTATGGACATGGTTTTCAATGCTATTGAAGAAATTGTTAAACATAGAAAAATGTACCAATTTCATGAGAATGGGAGGAACTTTGTAGTCAAATGCGATAAGTGTCAAAGGATTGGAAATCTATCTCGTCATGATGAATTGCCACAAAAATCCATCTTAGAGCTCGAACACTTCGATGTGTGGGGAATAAACTTAATGGGACACTTTCCTCAGTCAGGCACCCATTTGTATATTTTACTGGCTATAGACTATGTCTCGAAGTGGGTTGAAGCAATCTCCTGTGTTATGAATGATACGATTATAGTGAGTAAATTCTTGAAAAAGAACATCTTTACGCGTTTTGGAATCCTTAGAGTAATTATAAGCAATGAAAGGTCCCACTTTGTTAACCACATCATCACGAAGCTACTTGCAAAGTACAACATCACACATAAGAAAGCCACCACCTATCACCCACAAACAAATGGTCAAGCAGAAGTATCCAACTGGGAAATTAAAAAAATTCTAGAAAAAGTGGTAAATCATTCCTACAAGGATTGGGCAGATCACCTGGATTCTACACTGTAG

Protein sequence

MEEGSGSNDHIDHPKAFFIFHNIRHVLPDYWVGNFRNYRVLEEEIDQYVKGKTMLMDMVFNAIEEIVKHRKMYQFHENGRNFVVKCDKCQRIGNLSRHDELPQKSILELEHFDVWGINLMGHFPQSGTHLYILLAIDYVSKWVEAISCVMNDTIIVSKFLKKNIFTRFGILRVIISNERSHFVNHIITKLLAKYNITHKKATTYHPQTNGQAEVSNWEIKKILEKVVNHSYKDWADHLDSTL
BLAST of CsaV3_5G015310 vs. NCBI nr
Match: XP_023521407.1 (LOW QUALITY PROTEIN: uncharacterized protein LOC111785222 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 209.5 bits (532), Expect = 1.2e-50
Identity = 100/166 (60.24%), Postives = 125/166 (75.30%), Query Frame = 0

Query: 77  ENGRNFVVKCDKCQRIGNLSRHDELPQKSILELEHFDVWGINLMGHFPQSGTHLYILLAI 136
           +N + F   CD+CQR GN+S+ +ELP  SILE+E FDVWGI+ MG FP S  +LYIL+A+
Sbjct: 206 DNAKEFCKGCDQCQRTGNISKKNELPLNSILEVELFDVWGIDFMGPFPPSYGNLYILVAV 265

Query: 137 DYVSKWVEAISCVMNDTIIVSKFLKKNIFTRFGILRVIISNERSHFVNHIITKLLAKYNI 196
           DYVSKWVEAI+C  ND   V KFLK+NIFTRFG+ R +IS+E +HFVN ++  LL +YN+
Sbjct: 266 DYVSKWVEAIACPSNDGKTVLKFLKRNIFTRFGVPRALISDEGTHFVNRLMNSLLERYNV 325

Query: 197 THKKATTYHPQTNGQAEVSNWEIKKILEKVVNHSYKDWADHLDSTL 243
            H+ AT YHPQTNGQAEV N EIK ILEKVV  + KDW+  LD  +
Sbjct: 326 KHRVATPYHPQTNGQAEVYNIEIKSILEKVVQPNRKDWSVKLDDAI 371

BLAST of CsaV3_5G015310 vs. NCBI nr
Match: XP_023874613.1 (uncharacterized protein LOC111987139 [Quercus suber])

HSP 1 Score: 206.1 bits (523), Expect = 1.4e-49
Identity = 101/160 (63.12%), Postives = 118/160 (73.75%), Query Frame = 0

Query: 83   VVKCDKCQRIGNLSRHDELPQKSILELEHFDVWGINLMGHFPQSGTHLYILLAIDYVSKW 142
            V  CD+CQR+GN+SR  ELP K+ILE+E FDVWGI+ MG FP S   +YILLA+DYVSKW
Sbjct: 1430 VKTCDRCQRMGNISRRQELPLKNILEVELFDVWGIDFMGPFPPSFGFVYILLAVDYVSKW 1489

Query: 143  VEAISCVMNDTIIVSKFLKKNIFTRFGILRVIISNERSHFVNHIITKLLAKYNITHKKAT 202
            VEAI+   ND  +V KFL KNIFTRFG  R IIS+E +HF N +   LL+KY + HK A 
Sbjct: 1490 VEAIATTTNDAKVVLKFLHKNIFTRFGTPRAIISDEGTHFCNKLFDNLLSKYGVKHKIAL 1549

Query: 203  TYHPQTNGQAEVSNWEIKKILEKVVNHSYKDWADHLDSTL 243
             YHPQTNGQAE+SN EIK ILEK VN + KDWA  LD  L
Sbjct: 1550 AYHPQTNGQAEISNREIKNILEKTVNTNRKDWAKKLDDAL 1589

BLAST of CsaV3_5G015310 vs. NCBI nr
Match: XP_020424472.1 (uncharacterized protein LOC109950324 [Prunus persica])

HSP 1 Score: 204.5 bits (519), Expect = 4.0e-49
Identity = 99/162 (61.11%), Postives = 122/162 (75.31%), Query Frame = 0

Query: 81   NFVVKCDKCQRIGNLSRHDELPQKSILELEHFDVWGINLMGHFPQSGTHLYILLAIDYVS 140
            NF VKCD+CQR+GN+SR +ELP K+IL +E FDVWGI+ MG FP S  + YIL+A+DYVS
Sbjct: 1438 NFCVKCDRCQRMGNISRRNELPLKNILFVELFDVWGIDFMGPFPSSFGYTYILVAVDYVS 1497

Query: 141  KWVEAISCVMNDTIIVSKFLKKNIFTRFGILRVIISNERSHFVNHIITKLLAKYNITHKK 200
            KWVEAI+   ND  +V KFL+ NIFTRFG  R +IS+  SHF N +   L+ KYNITH+ 
Sbjct: 1498 KWVEAIATKTNDHKVVLKFLRDNIFTRFGTPRAVISDGGSHFCNKLFEALMKKYNITHRV 1557

Query: 201  ATTYHPQTNGQAEVSNWEIKKILEKVVNHSYKDWADHLDSTL 243
            +T YHPQT+GQ E+SN EIK ILEKVVN + KDWA  L+  L
Sbjct: 1558 STPYHPQTSGQVEISNREIKHILEKVVNSTRKDWAAKLNDAL 1599

BLAST of CsaV3_5G015310 vs. NCBI nr
Match: XP_024634412.1 (uncharacterized protein LOC112420042 [Medicago truncatula])

HSP 1 Score: 203.0 bits (515), Expect = 1.2e-48
Identity = 95/163 (58.28%), Postives = 120/163 (73.62%), Query Frame = 0

Query: 80   RNFVVKCDKCQRIGNLSRHDELPQKSILELEHFDVWGINLMGHFPQSGTHLYILLAIDYV 139
            + FV  CDKCQR GN+SR +E+P K ILE+E FD WGI+ MG FP S ++LYIL+ +DYV
Sbjct: 1169 QEFVRHCDKCQRTGNISRRNEMPLKGILEIEPFDCWGIDFMGPFPSSYSNLYILVCVDYV 1228

Query: 140  SKWVEAISCVMNDTIIVSKFLKKNIFTRFGILRVIISNERSHFVNHIITKLLAKYNITHK 199
            +KWVEA++C+ ND+  V  FLKKNIF RFG  RV+IS+   HF N  +  +LAKYN+ HK
Sbjct: 1229 TKWVEAVACIANDSHTVVNFLKKNIFPRFGTPRVLISDGGKHFCNKYLASVLAKYNVKHK 1288

Query: 200  KATTYHPQTNGQAEVSNWEIKKILEKVVNHSYKDWADHLDSTL 243
             AT YHPQT+GQ EVSN ++K+ILEK V  S KDW+  LD  L
Sbjct: 1289 VATPYHPQTSGQVEVSNRQLKQILEKTVASSRKDWSKKLDDAL 1331

BLAST of CsaV3_5G015310 vs. NCBI nr
Match: XP_016648946.1 (PREDICTED: uncharacterized protein LOC103328625 [Prunus mume])

HSP 1 Score: 202.2 bits (513), Expect = 2.0e-48
Identity = 98/162 (60.49%), Postives = 122/162 (75.31%), Query Frame = 0

Query: 81   NFVVKCDKCQRIGNLSRHDELPQKSILELEHFDVWGINLMGHFPQSGTHLYILLAIDYVS 140
            NF VKCD+CQR+GN+SR +E+P K+IL +E FDVWGI+ MG FP S  + YIL+A+DYVS
Sbjct: 1052 NFCVKCDRCQRMGNISRMNEMPLKNILFVELFDVWGIDFMGPFPSSFGYTYILVAVDYVS 1111

Query: 141  KWVEAISCVMNDTIIVSKFLKKNIFTRFGILRVIISNERSHFVNHIITKLLAKYNITHKK 200
            KWVEAI+   ND  +V KFL+ NIFTRFG  R +IS+  SHF N     L+ KYNITH+ 
Sbjct: 1112 KWVEAIATKTNDHKVVLKFLRNNIFTRFGTPRAVISDGGSHFCNKPFEALMKKYNITHRV 1171

Query: 201  ATTYHPQTNGQAEVSNWEIKKILEKVVNHSYKDWADHLDSTL 243
            +T YHPQT+GQ E+SN EIK+ILEKVVN + KDWA  L+  L
Sbjct: 1172 STPYHPQTSGQVEISNREIKQILEKVVNSTRKDWAAKLNDAL 1213

BLAST of CsaV3_5G015310 vs. TAIR10
Match: ATMG00750.1 (GAG/POL/ENV polyprotein)

HSP 1 Score: 55.1 bits (131), Expect = 7.1e-08
Identity = 27/83 (32.53%), Postives = 45/83 (54.22%), Query Frame = 0

Query: 50  KGKTMLMDMVFNAIEEIVKHRKMY------------QFHENGRNFVVKCDKCQRIGNLSR 109
           +G ++ +  +  ++++I +H K +               ++   FV  CD CQR GN ++
Sbjct: 8   EGHSLPLPCMHRSMQDISQHLKQWPRFVLQAGFYWPTTFKDAHGFVSSCDACQRKGNFTK 67

Query: 110 HDELPQKSILELEHFDVWGINLM 121
            +E+PQ  ILE+E FDVWGI  M
Sbjct: 68  RNEMPQHFILEVEVFDVWGIYFM 90

BLAST of CsaV3_5G015310 vs. Swiss-Prot
Match: sp|P10272|POL_BAEVM (Gag-Pol polyprotein OS=Baboon endogenous virus (strain M7) OX=11764 GN=pol PE=3 SV=2)

HSP 1 Score: 62.8 bits (151), Expect = 6.2e-09
Identity = 41/121 (33.88%), Postives = 59/121 (48.76%), Query Frame = 0

Query: 115  WGINLMGHFPQSGTHLYILLAIDYVSKWVEAISCVMNDTIIVSKFLKKNIFTRFGILRVI 174
            W I+     P    + Y+L+ +D  S WVEA         IV+K + + IF RFG+ +VI
Sbjct: 1446 WEIDFTEVKPHYAGYKYLLVFVDTFSGWVEAFPTRQETAHIVAKKILEEIFPRFGLPKVI 1505

Query: 175  ISNERSHFVNHIITKLLAKYNITHKKATTYHPQTNGQAEVSNWEIKKILEKV-VNHSYKD 234
             S+    FV+ +   L     I  K    Y PQ++GQ E  N  IK+ L K+ +    KD
Sbjct: 1506 GSDNGPAFVSQVSQGLARILGINWKLHCAYRPQSSGQVERMNRTIKETLTKLTLETGLKD 1565

BLAST of CsaV3_5G015310 vs. Swiss-Prot
Match: sp|P31792|POL_FENV1 (Pol polyprotein (Fragment) OS=Feline endogenous virus ECE1 OX=11766 GN=pol PE=3 SV=1)

HSP 1 Score: 62.0 bits (149), Expect = 1.1e-08
Identity = 40/121 (33.06%), Postives = 59/121 (48.76%), Query Frame = 0

Query: 115 WGINLMGHFPQSGTHLYILLAIDYVSKWVEAISCVMNDTIIVSKFLKKNIFTRFGILRVI 174
           W I+     P    + Y+L+ +D  S WVEA         +V+K + + IF RFG+ +VI
Sbjct: 765 WEIDFTEVKPHYAGYKYLLVFVDTFSGWVEAYPTRQETAHMVAKKILEEIFPRFGLPKVI 824

Query: 175 ISNERSHFVNHIITKLLAKYNITHKKATTYHPQTNGQAEVSNWEIKKILEKV-VNHSYKD 234
            S+    FV+ +   L     I  K    Y PQ++GQ E  N  IK+ L K+ +    KD
Sbjct: 825 GSDNGPAFVSQVSQGLARTLGINWKLHCAYRPQSSGQVERMNRTIKETLTKLTLETGLKD 884

BLAST of CsaV3_5G015310 vs. Swiss-Prot
Match: sp|P21414|POL_GALV (Gag-Pol polyprotein OS=Gibbon ape leukemia virus OX=11840 GN=pol PE=3 SV=2)

HSP 1 Score: 60.8 bits (146), Expect = 2.3e-08
Identity = 42/129 (32.56%), Postives = 60/129 (46.51%), Query Frame = 0

Query: 115  WGINLMGHFPQSGTHLYILLAIDYVSKWVEAISCVMNDTIIVSKFLKKNIFTRFGILRVI 174
            W ++     P    + Y+L+ ID  S WVEA        +IV K + + I  RFGI +V+
Sbjct: 1401 WEVDFTEIKPGRYGNKYLLVFIDTFSGWVEAFPTKTETALIVCKKILEEILPRFGIPKVL 1460

Query: 175  ISNERSHFVNHIITKLLAKYNITHKKATTYHPQTNGQAEVSNWEIKKILEKV-VNHSYKD 234
             S+    FV  +   L  +  I  K    Y PQ++GQ E  N  IK+ L K+ +    KD
Sbjct: 1461 GSDNGPAFVAQVSQGLATQLGINWKLHCAYRPQSSGQVERMNRTIKETLTKLALETGGKD 1520

Query: 235  WADHLDSTL 243
            W   L   L
Sbjct: 1521 WVTLLPLAL 1529

BLAST of CsaV3_5G015310 vs. Swiss-Prot
Match: sp|O93209|POL_FFV (Pro-Pol polyprotein OS=Feline foamy virus OX=53182 GN=pol PE=3 SV=1)

HSP 1 Score: 60.5 bits (145), Expect = 3.1e-08
Identity = 40/160 (25.00%), Postives = 78/160 (48.75%), Query Frame = 0

Query: 81   NFVVKCDKCQRIGNLSRHDELPQKSILELEHFDVWGINLMGHFPQSGTHLYILLAIDYVS 140
            +F+  C+ C+ +  L+     PQ  +   + FD + ++ +G  P S  ++++L+ +D  +
Sbjct: 843  SFLSTCNVCKMVNPLNLKPISPQAIVHPTKPFDKFYMDYIGPLPPSEGYVHVLVVVDAAT 902

Query: 141  KWVEAISCVMNDTIIVSKFLKKNIFTRFGILRVIISNERSHFVNHIITKLLAKYNITHKK 200
             +          +    K L  N  T   I +V+ S++ S F +    +   + NI  + 
Sbjct: 903  GFTWLYPTKAQTSKATIKVL--NHLTGLAIPKVLHSDQGSAFTSEEFAQWAKERNIQLEF 962

Query: 201  ATTYHPQTNGQAEVSNWEIKKILEKVVNHSYKDWADHLDS 241
            +T YHPQ++G+ E  N EIKK+L K++      W + + S
Sbjct: 963  STPYHPQSSGKVERKNSEIKKLLTKLLVGRPLKWYNLISS 1000

BLAST of CsaV3_5G015310 vs. Swiss-Prot
Match: sp|Q9TTC1|POL_KORV (Gag-Pol polyprotein OS=Koala retrovirus OX=394239 GN=pro-pol PE=3 SV=2)

HSP 1 Score: 58.5 bits (140), Expect = 1.2e-07
Identity = 41/129 (31.78%), Postives = 59/129 (45.74%), Query Frame = 0

Query: 115  WGINLMGHFPQSGTHLYILLAIDYVSKWVEAISCVMNDTIIVSKFLKKNIFTRFGILRVI 174
            W ++     P    + Y+L+ ID  S WVEA        + V K + + I  RFGI +V+
Sbjct: 1402 WEVDFTEVKPGRYGNRYLLVFIDTFSGWVEAFPTKTETALTVCKKILEEILPRFGIPKVL 1461

Query: 175  ISNERSHFVNHIITKLLAKYNITHKKATTYHPQTNGQAEVSNWEIKKILEKV-VNHSYKD 234
             S+    FV  +   L  +  I  K    Y PQ++GQ E  N  IK+ L K+ +    KD
Sbjct: 1462 GSDNGPAFVAQVSQGLATQLGIDWKLHCAYRPQSSGQVERMNRTIKETLTKLALETGGKD 1521

Query: 235  WADHLDSTL 243
            W   L   L
Sbjct: 1522 WVTLLPLAL 1530

BLAST of CsaV3_5G015310 vs. TrEMBL
Match: tr|A0A2G3C7C9|A0A2G3C7C9_CAPCH (RNA-dependent RNA polymerase OS=Capsicum chinense OX=80379 GN=BC332_15865 PE=3 SV=1)

HSP 1 Score: 201.8 bits (512), Expect = 1.7e-48
Identity = 92/161 (57.14%), Postives = 119/161 (73.91%), Query Frame = 0

Query: 82  FVVKCDKCQRIGNLSRHDELPQKSILELEHFDVWGINLMGHFPQSGTHLYILLAIDYVSK 141
           F+  CD+CQ +G +SRH E+P  +ILE+E FD+WGI  MG FP S  +LYIL+A+DYV K
Sbjct: 701 FINNCDQCQSLGTISRHHEMPLNNILEVEVFDIWGIYFMGPFPPSNGNLYILVAVDYVFK 760

Query: 142 WVEAISCVMNDTIIVSKFLKKNIFTRFGILRVIISNERSHFVNHIITKLLAKYNITHKKA 201
           WVE  +C  ND  +V KF+KK+IF+RFG  R IIS+E +HF+N     LL+KY++ HK A
Sbjct: 761 WVETTTCQTNDARVVLKFVKKHIFSRFGTPRAIISDEGTHFINTWFKNLLSKYDVRHKVA 820

Query: 202 TTYHPQTNGQAEVSNWEIKKILEKVVNHSYKDWADHLDSTL 243
           T YHPQ +GQ EVSNW IK+IL+K VN  +KDWA+ LD TL
Sbjct: 821 TAYHPQMSGQIEVSNWGIKQILQKTVNGQHKDWAEKLDDTL 861

BLAST of CsaV3_5G015310 vs. TrEMBL
Match: tr|A0A2K3NGQ1|A0A2K3NGQ1_TRIPR (Uncharacterized protein OS=Trifolium pratense OX=57577 GN=L195_g025512 PE=4 SV=1)

HSP 1 Score: 201.4 bits (511), Expect = 2.2e-48
Identity = 92/166 (55.42%), Postives = 124/166 (74.70%), Query Frame = 0

Query: 77  ENGRNFVVKCDKCQRIGNLSRHDELPQKSILELEHFDVWGINLMGHFPQSGTHLYILLAI 136
           E+   +V  C++CQR G +S+ DE+PQ+ + E+E FDVWGI+ MG FP S ++L+IL+ +
Sbjct: 151 EDCNTYVKACNECQRSGGISKRDEMPQQVMAEVEPFDVWGIDFMGPFPSSHSNLHILVCV 210

Query: 137 DYVSKWVEAISCVMNDTIIVSKFLKKNIFTRFGILRVIISNERSHFVNHIITKLLAKYNI 196
           DYV+KWVEA++C  ND   V KFLKKN+FTRFG+ RV+IS+   HF+NH +  LL KYN+
Sbjct: 211 DYVTKWVEAMACQANDAATVVKFLKKNVFTRFGVPRVLISDGGKHFINHHLANLLKKYNV 270

Query: 197 THKKATTYHPQTNGQAEVSNWEIKKILEKVVNHSYKDWADHLDSTL 243
            HK AT YHPQT+GQ EVSN ++K+ILEK V+ S KDW+  LD  L
Sbjct: 271 KHKVATPYHPQTSGQVEVSNRQLKQILEKTVSSSRKDWSLKLDDAL 316

BLAST of CsaV3_5G015310 vs. TrEMBL
Match: tr|A0A2K3NJZ5|A0A2K3NJZ5_TRIPR (Uncharacterized protein (Fragment) OS=Trifolium pratense OX=57577 GN=L195_g026684 PE=4 SV=1)

HSP 1 Score: 201.1 bits (510), Expect = 2.9e-48
Identity = 94/161 (58.39%), Postives = 122/161 (75.78%), Query Frame = 0

Query: 82  FVVKCDKCQRIGNLSRHDELPQKSILELEHFDVWGINLMGHFPQSGTHLYILLAIDYVSK 141
           +V +CD+CQR GN+S+ +E+PQ  +LE+E FDVWGI+ MG FP S +  YIL+A+DYVSK
Sbjct: 171 YVKRCDRCQRTGNISKRNEMPQNPVLEVEIFDVWGIDFMGPFPSSYSKTYILVAVDYVSK 230

Query: 142 WVEAISCVMNDTIIVSKFLKKNIFTRFGILRVIISNERSHFVNHIITKLLAKYNITHKKA 201
           WVEAI+   ND  +V  FLKKNIF+RFG+ R +IS+E +HF+N  +  LL KYN+ H+ A
Sbjct: 231 WVEAIATQTNDAQVVVSFLKKNIFSRFGVPRALISDEGTHFLNRKMEALLRKYNVHHRIA 290

Query: 202 TTYHPQTNGQAEVSNWEIKKILEKVVNHSYKDWADHLDSTL 243
           T YHPQT+GQ EVSN +IK+ILEK VN S KDW+  LD  L
Sbjct: 291 TPYHPQTSGQVEVSNRQIKQILEKTVNSSRKDWSLKLDDAL 331

BLAST of CsaV3_5G015310 vs. TrEMBL
Match: tr|A0A2G9GQZ7|A0A2G9GQZ7_9LAMI (DNA-directed DNA polymerase OS=Handroanthus impetiginosus OX=429701 GN=CDL12_19712 PE=4 SV=1)

HSP 1 Score: 200.7 bits (509), Expect = 3.8e-48
Identity = 96/166 (57.83%), Postives = 123/166 (74.10%), Query Frame = 0

Query: 77   ENGRNFVVKCDKCQRIGNLSRHDELPQKSILELEHFDVWGINLMGHFPQSGTHLYILLAI 136
            ++  +FV  CDKCQRIGN+SR  E+P  +ILE+E FDVWGI+ MG F  S  ++YIL+A+
Sbjct: 1289 KDAHSFVANCDKCQRIGNISRRHEMPLNTILEVELFDVWGIDFMGPFVLSFGNMYILVAV 1348

Query: 137  DYVSKWVEAISCVMNDTIIVSKFLKKNIFTRFGILRVIISNERSHFVNHIITKLLAKYNI 196
            DYVSKWVEA++   ND+ +V  F+KKNIFTRFG  RVIIS+E +HF N     LL+KY++
Sbjct: 1349 DYVSKWVEAVAVSNNDSKVVVNFIKKNIFTRFGTPRVIISDEETHFCNRSFEALLSKYDV 1408

Query: 197  THKKATTYHPQTNGQAEVSNWEIKKILEKVVNHSYKDWADHLDSTL 243
             HK  T YHPQT+GQ EVSN EIK+ILEK V+ + KDW+  LD  L
Sbjct: 1409 QHKIFTPYHPQTSGQVEVSNREIKRILEKTVSFTRKDWSKRLDEAL 1454

BLAST of CsaV3_5G015310 vs. TrEMBL
Match: tr|A0A2K3LHD8|A0A2K3LHD8_TRIPR (Uncharacterized protein OS=Trifolium pratense OX=57577 GN=L195_g033907 PE=4 SV=1)

HSP 1 Score: 199.9 bits (507), Expect = 6.5e-48
Identity = 94/161 (58.39%), Postives = 122/161 (75.78%), Query Frame = 0

Query: 82  FVVKCDKCQRIGNLSRHDELPQKSILELEHFDVWGINLMGHFPQSGTHLYILLAIDYVSK 141
           +V +CD+CQR GN+S+ +E+PQ  ILE+E FDVWGI+ MG FP S +  YIL+A+DYVSK
Sbjct: 268 YVKRCDRCQRTGNISKRNEMPQNPILEVEIFDVWGIDFMGPFPSSYSKTYILVAVDYVSK 327

Query: 142 WVEAISCVMNDTIIVSKFLKKNIFTRFGILRVIISNERSHFVNHIITKLLAKYNITHKKA 201
           WVEAI+   ND  +V  FLK+NIF+RFG+ R +IS+E +HF+N  +  LL KYN+ H+ A
Sbjct: 328 WVEAIATHTNDAQVVVAFLKRNIFSRFGVPRALISDEGTHFLNRKMEALLKKYNVHHRIA 387

Query: 202 TTYHPQTNGQAEVSNWEIKKILEKVVNHSYKDWADHLDSTL 243
           T YHPQT+GQ EVSN +IK+ILEK VN S KDW+  LD  L
Sbjct: 388 TPYHPQTSGQVEVSNRQIKQILEKTVNSSRKDWSVKLDDAL 428

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_023521407.11.2e-5060.24LOW QUALITY PROTEIN: uncharacterized protein LOC111785222 [Cucurbita pepo subsp.... [more]
XP_023874613.11.4e-4963.13uncharacterized protein LOC111987139 [Quercus suber][more]
XP_020424472.14.0e-4961.11uncharacterized protein LOC109950324 [Prunus persica][more]
XP_024634412.11.2e-4858.28uncharacterized protein LOC112420042 [Medicago truncatula][more]
XP_016648946.12.0e-4860.49PREDICTED: uncharacterized protein LOC103328625 [Prunus mume][more]
Match NameE-valueIdentityDescription
ATMG00750.17.1e-0832.53GAG/POL/ENV polyprotein[more]
Match NameE-valueIdentityDescription
sp|P10272|POL_BAEVM6.2e-0933.88Gag-Pol polyprotein OS=Baboon endogenous virus (strain M7) OX=11764 GN=pol PE=3 ... [more]
sp|P31792|POL_FENV11.1e-0833.06Pol polyprotein (Fragment) OS=Feline endogenous virus ECE1 OX=11766 GN=pol PE=3 ... [more]
sp|P21414|POL_GALV2.3e-0832.56Gag-Pol polyprotein OS=Gibbon ape leukemia virus OX=11840 GN=pol PE=3 SV=2[more]
sp|O93209|POL_FFV3.1e-0825.00Pro-Pol polyprotein OS=Feline foamy virus OX=53182 GN=pol PE=3 SV=1[more]
sp|Q9TTC1|POL_KORV1.2e-0731.78Gag-Pol polyprotein OS=Koala retrovirus OX=394239 GN=pro-pol PE=3 SV=2[more]
Match NameE-valueIdentityDescription
tr|A0A2G3C7C9|A0A2G3C7C9_CAPCH1.7e-4857.14RNA-dependent RNA polymerase OS=Capsicum chinense OX=80379 GN=BC332_15865 PE=3 S... [more]
tr|A0A2K3NGQ1|A0A2K3NGQ1_TRIPR2.2e-4855.42Uncharacterized protein OS=Trifolium pratense OX=57577 GN=L195_g025512 PE=4 SV=1[more]
tr|A0A2K3NJZ5|A0A2K3NJZ5_TRIPR2.9e-4858.39Uncharacterized protein (Fragment) OS=Trifolium pratense OX=57577 GN=L195_g02668... [more]
tr|A0A2G9GQZ7|A0A2G9GQZ7_9LAMI3.8e-4857.83DNA-directed DNA polymerase OS=Handroanthus impetiginosus OX=429701 GN=CDL12_197... [more]
tr|A0A2K3LHD8|A0A2K3LHD8_TRIPR6.5e-4858.39Uncharacterized protein OS=Trifolium pratense OX=57577 GN=L195_g033907 PE=4 SV=1[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
Vocabulary: Biological Process
TermDefinition
GO:0015074DNA integration
Vocabulary: INTERPRO
TermDefinition
IPR012337RNaseH-like_sf
IPR036397RNaseH_sf
IPR001584Integrase_cat-core
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_5G015310.1CsaV3_5G015310.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 115..220
e-value: 9.0E-13
score: 48.4
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 98..242
score: 15.18
IPR036397Ribonuclease H superfamilyGENE3DG3DSA:3.30.420.10coord: 103..242
e-value: 3.5E-40
score: 139.3
NoneNo IPR availablePANTHERPTHR24559FAMILY NOT NAMEDcoord: 80..239
IPR012337Ribonuclease H-like superfamilySUPERFAMILYSSF53098Ribonuclease H-likecoord: 111..239

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CsaV3_5G015310CsGy5G011510Cucumber (Gy14) v2cgybcucB232
The following gene(s) are paralogous to this gene:

None