Cucsat.G5989 (gene) Cucumber (B10) v3

Overview
NameCucsat.G5989
Typegene
OrganismCucumis sativus L. var. sativus cv B10 (Cucumber (B10) v3)
DescriptionProtein EFR3-like protein B isoform X1
Locationctg1425: 145187 .. 151654 (-)
RNA-Seq ExpressionCucsat.G5989
SyntenyCucsat.G5989
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
AATACAAGAGAGTTCTATAATTTAAAATCCTTTTTCTTTATAAAAAAGTTAAAATAAAAAAGGAGAAAAAGAAAAGCATGTTCATGTAGGCGTGAGGAGAGAATTTTGATGGATGTATTTAGAAATCACTCTCTGTAAAAAGAAAGCGAGTGCGATCCCTTAATTTCCGGGACCGTTGGGACTTTGTCATTTTCATATTAATTTTTCCTTTGGAATTTCATTTCATTTTTTTTTTCCCCCTTTTGTTTCCTTTTTTTCTTTTTTCTTTTTTTAAAATCTCGCCATTATTGTTTGGCAAGAGGGGAAAAAAATTGATAGATTATAGTAGTACACACGCCCCCTGGTGGGGGGAGGATTTTGCATTTCTATCTCTTCTCCCCATTGCGCCATTTTCATGTAATTTTTCCACTCCTAATTCAATTTTGGTTTTGTTTTTTGTTTTTTCTTTTTAATTTCTACGATTTTTCCCATTTTCTGAATTGACCGAATTTTGGGTTTTGAATTTCTCTCTGAAATTTGAAAGTTTTTTCTTCAGGGTGCTATAGATCACGGCCGCAGTAGCTTGATCGGAATTCCCCAATTGAAGAGTTTGATTTCTTATTGGTTGATACCCCTTTACTTGTTTGTGCTTCTATTTCAGTTGGGTAAGATCTCTCTCTCTTTGCTTCTATAATTCATCACATTTCGTTGTGGGGTTTTGAATCTGGGTTGAATTTGACTTGGAAGATTTGAGGGTGCTTAGAAACTTGTGATAATGTCTCATAGATTGGTTATATCATACTGACAGTATGCTGTGACTTATGAACTTATTGTTCTTTGGCTCTTTTGAATTACATTTGTTCCTGAGTTCCTTTTCTATATTGGGTTTGAGTTTGTACGATTCTGTTTGTGGACTTTCAGAGTAGGTTTGAATTTCATAAGAAAACATGGGGGTTATGTCTCGGCGGGTTGTTCCTGCCTGTGGTAACCTCTGTTTCTTCTGTCCTTCTATGAGGGCGAGATCAAGACAGCCTGTGAAACGATACAAGAAGTTCCTTGCTGACATATTTCCTCGTAATCAGGTGAGTTTCCATAATGTGGCTCCATGGGAGTTAGTCTTTGCGTGTTAGCTTGAATTTTTTTTTTGGGTCAATCTTGCCAGTCTTTGAGTTCGAACAATTTGAATCGAGCTGAGAGTTCTAGAAATGTGAGTTAATGACCATGGCTTTAGGAAGTCTGGCGACTTTAATTCATCAGTTTTCTTTCTCTTTTGGTAACAACAAACAAAGGACATTGACTTTGGCTGTTACAACTGTTTTCCAAAATTTTGTCATCGTAAAAAAATATATACAAGTGATTGTGTCCTATAGTTTTTTTACTCTTGTTCACTATAGATTTACATTTGAACTTTTATATGCCTACTTATTTTCTAATTACATTTGCTAGGATGCTGAACCTAATGATAGAAAAATTTGTAAGCTCTGTGACTATGCTTCAAAAAACCCGTTGCGTATTCCCAAGGTATGTAACTTTTTTCTCGCCTTTTCTTTTTCCCTCTTAACATACTTGAGATGTTCTGTCTTTAAGCAAAGGCTATCACTACTTAAGATGAATCAGTTGCAAATTCAACTTTCTGAATCTTTGGGTTTGTTTTCTGGTCATGAAATATTTGTATGCAAGCTATACATATATGCAGTAGGCTAGCAGCATACCAAAACCCCTTTTTTGTTTGCTTTACATGATGATTTTGACCGTGTCACATGAGGTTTTTTGACAAATCACTTTTGTTTTTTTGGAAAAGGAAATGAATCTCTTCATTGGTGTAATGAAAGGAAACTAATGTTAAGGTACAATATATATACAATGCAAAAGGAAACCTAAGAATCAGGAGGTGCACTTAGGAATCTCCTCTTCATTGATGTAATGAAAGGAAACTAATGCTTAAGGTACAATATATATACAATGCAAAAGGAAATGAATGCTTAAGTTACAATATATATACAATGCGAAACCTAAGAATTAAGAGGTGAACTTAGGAATCTAATTAGATTGACACTCACTTAGCATCTATATCATATCCAATATAAGCTGACATAACATTCCAAATGGAATTATAGCAACAATCTCACAGAACAAACGAGGAATGCATCACAAAACAACAAAGAGATGAAGCATGATTCAAAATTAATCCAAACAAATACAAAGTGAACCCCTGAAAAGAAAGATGACCAATAGGGTAGATGTGTGAATGCCCTCCAATTTAGGCCTATATCCTTAATCGAGAAGTCTGTGAAGGATTTGGACAAAGCTCCCCAAGGGGGGTTGTGATATGTCAATTTTAACATTGGTTTTTGGAACAAGAAGTCAACGTTGTAGTGTGAAAATAAATTAAGAACAATGATCTGAAAGATAGGTCATGGGTATTTATATGAAGTTCATCAATTAAGGTACAATCTTTTTTGTAGCCTTGTGTCTTCACCCAACAGTCTACAACATTCTTTATTCTCTTCTAAGCTCCGGGAGGTTAGAACCTCAAAGAAAGTTCAAATATTCTCTCGGCCTGCTGCTTTTGGGGAGAATTATTAACAATAAGGCTGTTAGATAGTTATTTTTCTGTTACACAGTTTATTATTTAGTTTGTTAGGAGACAGTTATCACCAAGCTCCTATATATATCAAGCTTGGGCTATGTAAATATTCATACAAGAAATAAAGAGACTTACCTTTGTACAAAAATTCTTCCTTTCGAAGCTCTCTTCTCCCCAATATGGTATTAGAGAAATTTCCGCAAAGACAGTCTTAGAAGAAAAAACCTGAACGTCCCTATACCCTTGTGCAGTCTGCAGCAATGGCGACACCTGAGAATTCGACATCCAGCAGCTCCAGCAGCTCCAACACTTCAGCACCAATTACCTCATATGATCTTGATATCCAATTAAACCCGTTCATGCTACACCATTCAATCATTCCAACCACCAACCTTGTTTCTACACCACTGGTAGGATCAAATAATTACTCATCATGGAGTGGAGCAATGATGCTTGTGTTTACAATGGAAACGGAAAGGAAGTGTGGGATGAATTGAAGGAAAGATATAGGCAGTCCAATGGACCTCACATATACCAGTTGCGAAAAGACTTGGTAACTACTACACAGGGGAATCTCTCGGTTAAAGTTTACTATGCAAAAATCACCACTATATGGCAAGAACTAGTTGAATACCGTCTTGTGGATGAATGCACTTGTGAAGGATCAAAGAAGATGATTGATTTTTTGAACTCAGAATTTGTAATGACCTTTCTCATGACACTAAATGAATCCTATTCCCATATTAGAGCTCAAATCCTCTTGATTGATCCCCTGCCACCTATAAACAAAGTTTTCTCCCTCATTATCCAAGAAGATAGACAAAGATCCATTGGATCTTCACCTTCCCTTGAGAGTATAACACTATTGGCTAATTTTGAGAGAAGATTTCCTTCTGAGAAGTCCAAGAAAAAGGACACGCGACCTATATGCTCTAATTGTGGCTATAGAGGACATGTTGTTGACAAGTGTGACAAGTTGCATGGTTATCCTCCCGACCATAGACTTGCAAGCAACAACTCTGTTCACCAACAGAGACAAAGCAATACAATTCAAGCTGGAAATGAGAAAATGACAGAAGCTTCTAACTAATCTGCCTTCTTTGCCAAGTCTCAACGATGACCAATATTCATAGCTTATGAGTATGCTTCAAACTCATCTTATCTTAACACATCCCAAAATGGTGAGAATCCCAAAATAGAGACCACTCACATAGCAGGTACTTGTCTATCTACTTCTCTCACTAATTCATCAATCTGGATCATCGATTGTGGTGCTTCCTCACATATTTGCTATGACAAGTCTGCTTTTAAAAATCTCCACAGTATCCAGAATATGTCAGTAATCTTACCTACCAAAACTCACCTAAAGGTTGAGTATATAGGAGATATTTCCATAGCAAAAGAAGTGATCCTAAGGGATGTACTTTATATTCCTGACTTCAAATACAACCTACTGTCAGTGAGTGCTCTTCTCAAAGATGAAAGATTTGCTGTGTCATTTTCTATTTCTAATTGTCTTATTCAGGACAAGTTGCTTTCGAAAACGATTGGGAAGGTTGAGGTAACTAATGGCCTCTACTTGCTCAGAGTGAGGAAATACAAAGATAACTGCATTCAACATACTACACTGATGTGTAAAGCTTCCATTTCTACATGGCACAAACGAATGGGATAAAGGAATTAACAAAGAAGAGAGAAATTTTTTATTCCCCAAACTGTAAAAATATTTGTCACATTTGTCCCCTGGCTAAACAAAGATGTCTTTCATTTCCTACATTCAATAATATTGCTGATAATGCATTTGATCTTGTGCATTGTGATATATGGGGTCCTTTTAAAACCTTAACACATGCTGGTCACTCTTATTTTGCCACCGTTGTTGATGATAAGTCTAGATACACACGTGTATACCTTTTGGAAAATAAGAATGATATTCTACAAGTCATTCATCGGTTTTTCAAGATGATTGTCTTAAAATCCCACCTAACAAGGATTACCTCAAGAAGAAAAGATGAACTCAAGAACACTTAAAAAACATCAAAATATGAAATATATATTAGAAGATCATAATAACAAGTAACAAGTAACCCTAGCCTTTTGAGAGGGCTAGACTCTCCCAAGGTTCCCTTACAAGGAAATCTTTCTTCAAAATTCTTCACACCTCTCCCCAACCCCTCTCCCACTATTTATAACAAAAAGACCTAACCAACTTACCATCTATTTACTAATATGACCTTACTAACAACCATTTTAATTTTCTCCTAATAATCCACTATTTTTCTATCTAGGGTTCTTATAGATTGAATCTCATTTCTCAAAAGTCATTAATGCTTTTTGTTCTGAAATGCTCCAGAATTGAATTTCAGGGAATTTTTTGCTAAAACTAGAACTGAATTTCAGGGAATTTTTTGCTAAAACTAGAACTGCTCACCAGTTTTCATGTGTCTACACCCCTAAGCAGAACTCAGTGGTGGAGAGAAAGCATCGACAACTCCCCAACGTGGCAAGAGCATTGATGTTCCAATCAAAATCCCCTCTTACTTTTTGGGGAGAATGTATTTTGAGTGTTGTCTATCTAATATACTGGACACCCATGGTCCTACTCTCTAATAGCACTTCATTTGCCACTCTGTTTGAGAAAGAAGCTGATTACAGCATTATCAGAACCTTTGGATGTCTTGCCTATGCCTCTACTCTCTCAGCAACCAGATCCAAGTTCGATCCTAGAGCACAACCTTGTGTTTTTTTGGGATACAAACTATATGACTTAGCTAGGAGAAAGTTCTTTGTTTCTAGGGATGTCCTATTCTTTGAAGAATTATTTCCCATTCATTCTATCAAAGAAAAAGGTACATCTATCTCACATGACTTCCTTGAGCAATTCGTCATACCATGCCCTTTAATTGATTGCCTAGAAAAAGAGACCATCATTGATCTGACTACCGCTGAAAGACCTATATCAAAAAATACCCTTGAAGATAACCACGGTGTTGATGATCATGATCCTTGTACTGAAGATTCAGAAGAGACTAATAGCCCTGTCCAAATACCTATTACCATAGCACCCAGAAAATCCTCTAGACAATACCATCCACCTTCTTACCTAAAAGATTTTCATTGTAACCTCACCTCCCAAAGATCAACTCCCTTTCACCTTACTAAATACCTCTCGTATAAAGCCTACTCCCAACACCATAAAAACTATTTGTTCAATATTGCTTCCATATATGAACCGTCCTATTATCACCAAGTTGTAAAACACCAGACTTGGAGAAAAGCTATGGTTGAAGAAATAGAAGCTATGAAGAGGACGAATACATGGACCATTGTTTCTCTTCCTAAAAATCATCACACCGTTGGTAATAAATGGGTATACAAAGTGAAGTGTAAACTGGATAGCACCATTGATAGATACAAGGCAAGACTCGTAGAAAAGAACTATAATCAACAAGAAGGATAAATTTTTCAGATACCTTCTCACCGGTAGCTAAAATAGTTACTGTCAAGATATTCTTAGCCCTTGCTACATCCTATAATTGGTCTCTTACCAAAATGGACATAAATAACGCCTTCTTAAATGGATACTTGTTTGAAGAAGTGCATATGTCCTTAACATTGGGCTACCAAACTTCTCAAGTACCAAGAAAAGGAGAAAGATTGGCTTGCAAACTTAATAAGATCATATATGTCCTCAAGCAAGCATCAAGGCAATGGTTCATAAAATTTGCAGCATCAATATCCTCACATGGCTTCATTCAATCCAAATTCGACTACTCATCATTCACTCGAGGCAATGGAAGCAACTTTGTAGCCTTGTTAGTATATGTAGATGACATATTACTAACCGGACCATCTTCCTCAATTATCAACTCAGTCAAGGACAGCCTAAAGACACACTTTAAACTAAAGGACTTGGGGCAAGCAAAGTATTTCTTGGGTCTAGTGTTATCACGGTCTCAACAAGGACTCATGCTCTCCCAAAGAAAATACTGCCTTCAAATCCTAGAAGATACTGGTTTTCTTGATTCTAAACCGACTGCAGCACCTATGGATCCTAATTTGAAGCTATCTAACACGGAGGGAAGGCAGGTAGCTGAGGAAGACACTACCTACTATAGAAGACTGATTGGCAGATTGATATACCTACAAATATCTAGACTAGATATTTGTTTTGCTGTCCACCGTCTTAGCTAGTTCTTGCAAAAACCTACGAAAGATCACCTAATACTGCTCATCATCTACTGAAATACCTAAAAGGTACCCCAGGACAGAGTGTTTTAATAAAACCCATTAATTCATTCCACCTAAAGGCTTTTGTTGATGCTAATTGGGGATTGTGCCTTGATACTAGAAGATCAGTCACAGGATTTTGCATTTTCCTAGGAGATTCTATCTTCTCTTGGAAATCTAAAAAACAGGCAATGGTCTCCCGGTCCTCTGCAGAAGCTGAATATAGGGCCTTGGCATCAGTCACCAGTGAGTTAGTATGGATCTCTCAACTCCTCATTGACCTCAATAAGACTTTAATGCTACCTACTGTGTTTTGTGACAATCAGGCAGCAATTGCAATAGCTTCTAATCCGCCATTCCATGAAAGGACAAAACACATAGAGATTGATTGTCATTTTGTCCGAGACAAGATAGTTGAAGGCTTCTTAAAAGTTCTATGTATCAAGTCTAGCCTACAACTAGCTGATATGTTTACCAAAGCACTACCATCGTCTACCTTAACTAGGTTGTTATCCAAGTTGGGCATCATAGACATTCATCGTCCAACTTGAGGGGGAGTATTAACAATAAGACTGTTAGATAGTTATTTTTCTGTTACACAGTTTATTAGTTAGTTTGTTAGAAGATAATTATCACAAAGCTCCTATATGTATCAAGCTTGGGATATGTAAATATTCACACAAGAAATAAAGAGGCTTAGCTATGTACAGAAATTCTTTCTTTCGAAGCTCTCTTCTCCCTAATAAGAATCAACACCATTAACTTTATCTAGAAAAACTCTTTGATGCCCAGGCCTTAGTAGTGTGTCCTTAATGAATTTAGGATTTTGATCATATATTTTGGAGGCGTTCATTTATAAACACAATAGGGTAGGAATGTTGCACTGCTTTGTTAGAAGTTATTCTTGTTTCCACCCTTCTGCAATTAGAGGTTTGCATAACAAGTATTGTCTTGGCTATCTACTATATTTGAAATAGAAGAATTTTTGTGGGTACAGTGATGTTCTTAGAGGAAGTTTAGAATTTGGTATGGTTAATGCTTCACTTAGTGACTTAGTCTTTTTTTTTGAAACGGAGACAAAGACTTCTTTATTAATATGAACTCTCGAGTTATACAAAGAGAGCCATAAAGAAGTAGTAATCAAGGGAGAGGGATCAGGAGGCGCACCAGACATGTCAACTAGGTTAACATCCCCTAGTGCCAAACATCATATCCCGAGCATAAGCAAACAAAACAATAAAGAAACAATAATAAAAGCATCCAGCTTAGTATAAAGGTCCAGAAAACAGTATGAGACAGGAAGAAAACAACAAGAAACTCGGGGGGGGGGGGGGGGGGGGGGGTTGGTCCTTCAAGGCTTCAAAAGCAGTACAGGGCTTTGGTCTGTATAATCTGTAAACATTGAAACTGAGGCAGCATAGGATTAGGCAGGTTGAGAAAGAAAAGCAGCCCAGTTAAGGTAAATGTCTTGGATGGAATAATTAACAAACTCCTTTTTAAGGGAACACCAAGCTGCAGCTTTAAGTTCTGCTGCATGCATAATTTCTGCCCTTGGTCTTGCTTTATCACGGAAGATACGTTGATTTCGTTCGAACCATAAATCTGAAAGTAAAGCTTTTGACAAATTTTCCCATATGACACTTGATTTTTTGGACAAACTTGGACCCGACAGAAGTTGAACCGCACTAGCACTTCACTCAGTGACTTAGTCCTGAGCTTGTGTATTATAATTTTTAGTCGTGGTCTCTTTTTCCTAACTCTTAACAGCCAAAGTTCCTTATTGTTATTTCCTTTCAAGGGCTCCTTTGGCTGCTTTTTTGGCCTGTGTTTTAGTTCTTCATAATGGACTTTCTGATGGAAGTTTGGTTTTCACTGATTCACTAAACACACTAAATTTGTTTATTGAATGATCCAGCTAAGTTTCTTTCCAATGCAGTCTTCATTTTTCTTATTCTTTTGAGTCACCTGTTCATTTATTTTAGATAAGATTTCTTATCTTGAGGTAGTGGAGTATCTTCTCTCTCTTGTTCTTTGATTTTGTCTTTGTAGATTACCGAACTCCTGGAGCAACGATGCTACAAAGATCTGCGGAATGAGAATTTTGGATCTGTGAAAGTTGTAATATGTATATACAGAAAACTTCTATTAATGTGCAAAGATCAGATGTAAGTTTGATTTACTTTAGCCATAGTCTAATTTAACTGGAATCTTTCATTTTGGGATGGGGGAGTAGTTTGAGTACCACATTATTTCTGAATCACTATGTTAGCATCCAATAAAATTTTACGGGTCATCAAAGATTGGAAAAGCATGTTCTTTGATATTTCATGAAACCAGCGTTTCTGCATGTCATTTACCTAAGCACTTCAACTCTTGCAGGCCACTTTTTGCTAGTAGCTTAATTGGGATTTCTCGAACTCTTTTAGAACAAACACGGCATGATGATATGCAGATTCTTGGTTGCAATATTCTTGTTGAGTTCATAAGTAGCCAGGTACTTGAAGAATAAACATTAATAAGTTCCATTACCAATGTGATCTTGTTAGTATATTGCATGTGTAACTGTTATTACATGCTGTACCTGCTAATGGATAATACATTTGTAACTTTTTTGCATCTTATGTAGAACCTGTGCAAATATGGTATTAACTTGAACCTAATGCAGTTGATTGGTTTCTTCATATAAACAAATGTCTGTTCTCAATACTTCTAGATAAGGATACATTCTCTTTTAATCTTTTGCCCTATCATTTAGTGATTATTGGACACTAACTATCCAAATAAAGTAGACAGATAGTACATACATGTTCAACTTGGAGGGCATCATTCCAAAACTTTGCCAATTGGCTCTAGAAGGCGAGAGTAATGATGAGGCACCACATTTGCGGTCAGCTGGACTTCAAACTCTAGCTTCTATGGTATTACTCTCTTCCTCTCAAGGTCACGATGCTAATGTCGTAGATATGTAAATGTATTTATGCACAACTTTCTTTTCATCATTTTGATTTTGAAATCTCTCTCCCAAGGTCTTGATGCTAATATCGTAGATATGTAAATGTATTTATGCACAACTTTCTTTTCATAATTTTGATTTGCCATCTTCATCCGTGGCTATCAAATTTGAAATGGTCAAAATAGATTAAGATTATTGCCAAGATCTGTCTTGTATCTGAACTCAAAATCCCTCTATCCATGGTTATCAAAAATTTTTGGATGGATGTTGTATGATCTAATGGCAACACTTGAGGGGTGAATAGGATTGTCTACCAATTTTCAGTTTTGATGAATTCTAGTTTTTCAAAAAGCAATGACTGATTCTCTGGTTTTCGGCGTCCTCCCCTTATTTCATTCATGAATGAAATGTTTCTTTTACCAAAAAGCACCGACCTATTTAGCACCAGAAAAAAAAAAAATCAAGATAAATCTATATCACTCAATGTAAGGCATTGAAATAATTAATAAATAATAAAGAAAAATGGAATGCGAAAATTTTGAGGAGCAAGGGAAAGAAGACACCACGATTTATATTAGTTCGGTAAATTGCCTATTCCACCCTCAAGACAGCAAAATGTCCTCTTGCGGATGAGAACAAAACTGACAACGAACCTCTCTCTCTCTTTTTTATTTATCTATTTTCACAGACACCGATATCGTTTTCACTAGGTGCGGATCTAGCCAATTATTTTCTTTACAACTTGAAGTATACCCAGATTTTAATACAAAAACCCATGAAAGTAGACTAACAAACTTTACCCCTTGATAGATAAAATGCACAAAGTTTAGCACAAAAGAATGCTCCGAAGTATCCTTACAAAATGCAATAACAAGGTCTTGGAATGTGAGAAATTTTTAGAGGTGAACACTTTTCAGAGAGAAAAGTGATAGATGGAGCACTGGAATTTTCTGCAAGAAGGGTTAGGAGAAAGACGAGAGTGGCTGAATTTTTAATTAGAGAAATATAAAGTGGTCATCAGATGTAGTGGAGGTTCAATATTTGGTTTGATTTGCTGGAATTTTCTGCAAGAAGGGTTAGGAGAAAGACAAAAGCGGCTGAATTTTTAATTAGAGGAATATAAAGTGGTCATCAGATGTAGGGTGTGAGGTTCAATATTTGGTTTGGTTGAAACTCTAAAGCTAACATGTGGTTCACCTTTTTAAAATTATACCCTCTTCGAATCCGCTTCTCATATGAAATGTGATGATTGAGATTTCTGATACATCCAGTCTTATATTCACTTATTCGATACGGCCATTTCTCTTGGAAGATGTGATAATATAGATTGCTGGTATGTCACATTTGTCTTGCTGTTCAGCTTAGTAGAACTATTCCCTTCTTGAATTCCGCTTCTCTAGTGAGACCCGATGATTGAAATTGTTGGTTATCATTAGTCTTTGATTCTTTTTCTCTTTCTCTTGTTTCCGAGTTAATTCATTAGATACAACTATTTACTTCGTTAAGATAATTTTCCAATGCATTTCTTTAAAAATTATTTTTGTAGATACTGTTCATGGGCGAGCAATCTCACATCTCGATGGACTTTGATAAAGTGAGCTCTCTTAATCTTGTTGCCGTTTTTCCTTTTTCCTTGTAATACTAAGGGTGTCAGCTTATTTTGGGTAAATTCACAGGGTTTGTGTGTGTGTGTGTGTTTAAGATATAATTGAGATCCATGCTAATATGAAACTTTCTAACTTGTCTTGAAATAGATTATATCTGCGGTCTTGGAAAACTATGTAGTAGATGGACAATTTTCTCACTCAGAAGCTCAGTACATTGAAGGACAACATAAAGTAGAAAACCATAGCTCTTCCATGTTAGATGTCGATAAAAAGTTCTCTTCGTTTAACCATTTTAATAATTCGGCAACTGAAGTGTAAGTTCATGATTCTATTTTTTTTAATCGTTTCAACTGTTATTTGTTTGGAAATGCTCTCTGATCCCCTTTCTTACATCATCCGGCATTTTCACAGGGATGTTTCCAAGAACCCTTCTTATTGGTCTAGAGTTTGCTTGTGTAATATGGCTAGATTGGCAAAGGAAGCTACAACTGTCAGGCGTATGTTTGAACCTCTATTTCATCATTTTGATACTGAAAATCAATGGTCCTTAGTTAAAGGACTTGCCTACTCGGTGTTGTCATTTATGCAATCGCTTTTGGATGAATCAGGTTATATTTGAAATTCTATTGCCGACTTATTTTTTCTAGCAGTTACATTTGCTATGTTGTCCATTATTTGAACTTTTTCCCCTTTTTGGTGAAACTTTTTCTTGATACCAGGGGACAACTCATATCTTTTGTTTTCGATTCTTGTCAAGCACTTGGATCATAAAAGTGTTGTAAAAAAGCCTCAAGTTCAAGTGGATATTATCAATGTAACCACACAACTTTCTCAAAATGCAAAGACACAAGCCTCAGTTACTATTATTGGGGCTATCAATGATTTGATAAAACATCTACGGAAGTGCATTCTATGTTCATCTGAAGCATCCAGCAATGGACATGACACAGATAAATGGAATACTGATCTTCAGTTGGCACTCGAAAAGTGCATTTCTCAGCTTTCAAAGAAGGTTTGTTCTTTTCTTTTTTCCTTTTAGCATTTATTCTGATAATTGGACGTGATTGAGTTTGGTAGACTGATGAAATAAACCTCAAAACTTGTTCATATATGGTATATTACTTGCAGAGCCTTCTGAAATATCCAGCTGAATGTTAGGTCTTTCATATAAATATCTTCAGCCATGTATGATTATATGGTTTTTCATTTCTTGTATTACTGATATACTGTGCATGCTTGTTAAATGGAACATTTTTTTTTCATATCTTTGGTTCGTAATACCTTTTATAAGAATAGATAAAAACAACAATGGATTTTTTTTAGACTTGTATACATATCCATCACTGTTTTTTTACCTTGAAAATAATGGGTATTTTTCATAAATCCAACATGTATTCTGATTTTGGCATTAACTGATATTTGGAGTGTTATTGCTGCGTGGTCCTTGCTTAACCCCCTTTGTCTCAGAACTTGTGGCAGTGATGTGGCTTCCTAGTACAGTGTTTTTATATCTACTTCTACAAAATCTATTGAAAGTTTCTTTGTGGCTATAGATCATAAAACAATTTGGCTTAAGTAGTCCCTTGAATGTTGGTTTCTTTTGCTTTCATTTTCTCATTTGAATGGTGTCTGATTACCGATTTATCTATTTTTCAATCTTTTCACGTATTTCTTTTCAATCAATCATCCCTCCACATGTTGGCTCTTTGTTTCACTTTTCAATTATTTGATTTGTTGGGCACCTACCGAGTGTTTTTTAATCTTTTCCTCTTTTTCATTTGTTTTTACTTGCTCTGAAGAAATTGTTAAACCATCTTCCTTCTAATAAATATAAATAAATGAATGTATATTCTTCTGTCATCGTTGATCTAGATGTTCCTAAACATTGTCTCGTAAAAATGTAAAATTCAAGTTTATTCATTGCAAAATCAAGTAAAATCTCAATGCCTGTACTTGTATTTAAAAAAAATCAACAAATTCTGGATATCAGCAATAAAATACTGTAGGAAATAGTCTGTGCCAATTATTTTCTGCTCAAGGGACTGTGGTTCTCAATGCATTGATTTTCACTAAAACCATTTGAAACTATCAGGTAGGTGATGCAGGACTCATACTTGATATGCTAGCTGTTGTCCTCGAGAATATTTCAAATAATAATATTTCAGCTCGGGCAACAGTCTCCGCTGTTTATCAGACTGCAATGACTGTATCTTCTATTCCTAATGTTTCATATTACAAGAAGGCAAGTAATCATACTGCTGATTTTCTTTTCCCTTTTAATTGTTTTTGGGATGGGAGAAGGAAGTTGGGTTCGAACACATGGCCATTTTAGTTTGACTAATGTATCTTATCAGACATTTTGGCACTCCAACACTTGTTAGCACAGTAGACACTAGCTGTACAGAGTTAACATGGGTCCTAACATTTGTTATACACACATGGAACTCTTGTTAAGCATACTAATAGACGCAATAAATAAAACTTCTTGAAATTAAATTTTCTTTATATGTATTTAAAAAATATATATTTGGATAAATGTGTCATTGTGTCAGTGTATTTGTGTCAATTCATTCGTGCCATTTCATATTATTGTCTCATATCTGTATTTGTACTTCTTAGCTGCTGAGCTTAGGGGACATGCACGTGCTCGTGGATATCAAGATCTTTTAAAACCTAACTTTTAAATCTGTCATGTTTTTCTTCAGGCTTTTCCTGATGCTCTATTTCATCAGTTGCTTTTAGCAATGGCTCACCCTGATCATGAGACTCGAATTGGGGCACACGACATTTTCTCTATAGTGCTTATGCCATCCATTAAGTGTCCTATGATGGAACAGAAGACGATTTCCTCAGACACTGTTTCATGGTTACCATTTAGCAGTCCCACACAGAAGTTGACTAGTGGAGGTTTCTCCTTTAAAGACGATGACAATCATGTATCAGAATCTATAAATGGGGTAAGAATGGAAGAAAGTCAAGCAGCACACCTTGTTTCTGAAAATTATACAACACATCCATCTAGGCATGAATCCTCCAGCTTCAACCATAGTTCAAACGAGTCAAAAACTGTATAAAGTTCTTAATGCTTATCCTTTCCTATACCAATCCATTGCTCTATGGAAGTGTTTGATAATGAAAATTAAAGTCATGTTTTAATTTTCCAGAAGTTGAATTCCCTCCGGTTAAGCAGTCACCAAGTTAGACTCCTGCTCTCCTCAATCTGGGTGCAAGCTACATCTGCGGATAATACACCTGCAAATTTTGAGGCTATGGCTCAAACTTATAGCATTGCTTTGCTATTTACCCGGTCTAAGGTGAAATTTTGGATAAATTTAATGTGCTTAATAATTTAGTTTGTCTGGAGTCTCAGGAGATCACTATTAAATGCTGATTTGAAAAAGAAAAAGAATGACTAAGTGGTACCGATCTTAACAGACTTCGAGTCACATGGCTCTAGTACGATGTTTTCAGCTGGCATTTTCCCTTCGTAGCATTGCTGTGGATCAAGAAGGTAAACTGAAATCATTTTGTAACTGTTAGGAATCTGTTCATTATGTGCCTGTGTGCTTGTGAATTTTGGGGTTCAGTTTCTATATTCTTTTTAAACAAGATATGACCTCACTGAAATGATGAAAAGATACAAAAATATTTAAAGGAAACAAACTCTTAAAAAGGGAGTGAAAGAGAACTACAAATGTAAAATTAAAAAGGAATCTTATTAGGTAAAAATGAAATGAATGCATCCGAATTCAGTATAAATATCTTTAGGGAATAACGAGCAAACAAATTAGAAAGAGAACACCAATGAGAGGGTTTGAATATGTTGATGAGAATCTTTCCATTCTAATCATTTTGGAGTAGTTTCTATTTTTGGCATTCTTTTACTACAAGGTTTCCAAATTTGAACATTAAGACTTTATCACTTATTTCGTCAATAACTTAAAATGATACAATAACTTGTTACTCTGCTTCTCTTGAATCTTAAGACCACAAGAGATGCTGTCTTTTCACTCTGTATATTCAGCCACATTTGAATCTTATGTTCAGCGACATTTGAGTCTCCACCAATCATTTCAATAATACATATCTGGTTTGACCTTTGGACGGAGATGCAAGTATTTAGGATAAAAAGAATTCTTTCTACCTTTTAAGTATCTTGGGTTATATCTGATTATCTTCCACTATTTATACATTTAATTGTTAAATTAACTATATGGATTCTTCTGATAGCAAGTACTTTATTTCTTCAGAGTTGTTCGTTATTTTAAAAGACTAGCAGTGTTTTCTTGTCAAGAAACTACTAACGGGCAGATCTCTCTGTGCTCCTTTTGAAGGTGGTTTACTACCCTCTCGCAGAAGATCAATCTTCACCTTGGCCTCATTTATGCTTCTGTTTTCAGCCAGGGTGGGAGATCTCCCAGATTTGACTACCGTCATTAAAGCATCATTAGATAATAAAATGGTTAATCTTCGAACCTATCCCATAGGATCAATACATTCCTTTTGTACTTTGTTTGATTATTTTGTGTAGTACATGTGCTAAAACTTTCAACTGGACTTTTGGATCTGACAGGTTGATCCTCACCTTCAGTTGGTTAATGATATCAGGCTGCTGGCTGTTCGTGTCAAGTCTGAAAAGGACAGTGTACCATTTGGGTCAGAAGAAGACGAAGTTGCTGCATTGAAGTTTCTTTCAATTCTTGAACTAGATGAACAACAGTTGAAGGAAACTGTGGTCTCACACTTCACGATTAAATATGCCAATCTCTCAGAGGTTTTGATTCTCATAACACTTCATGATAACTTTTATACATGGCATTTTTTCCCACGAGAAACCATTTTCATTGTGAAATATCAAAGTTTAACAAGAGACACTGGAGAACAGCCAGAAAAAACTTCAAGGAATTAAAAAACTCCATGGAAAATAACTAAGAATGTGAGGCTTCATGTAAACATGCGGAAGTCCAAGTCTATGATGCATTGGAATTTCTTTTTCAATTTAAAGTAGTGAAGATCTTATGACTATGAAATTGGATGAAATCTGCAGAATGTTTTGCGAGTTTTCAAGATTTTAGTTATAGTGCTTGGTTGGAAGTAGTTTCATTTTTAAATTATTACTGAAAAGCCAGGCTTTCATGGAAAATGATAAAAAAAAAAAAGCCCAACAAACTAGTTAAAAATGGTCACGGAAACTCAAATAAAAACATTTGAATTGGTAAAAAGTAGGATGAATTGAAAAATATGAGGGAATATGTGCAGCATTCCAACCAGGCCAGAGCTCAAATAAAAACATTTGAGTTGGTATTAATTAAAAATGGAGATACGTTACAAAAAACTGCCGCAATGAAACTCCATGAGAAATCAATTAATCTTTCTCCTAGGTGCCAGTGAGGAGGACTTCAGGACTCTAATTACTTGTTATTGTTGATTTTTAATCAGTAGTTTTCTAAAAATTGTGTTAATGAACTATTTTCTCTTTTAGGCCGAGCTATCAAGTATTAGAGAGCAGCTCTTACATGGGTTCTTACCTGATGAGGCATACCCATTAGGAGCTCCATTATTTATGGAGACACCACGTCCATGCTCTCCACTTGCAAAGCTGGCATTTCCAGATTATGATGAGGTGAGACATGCTTAGGTTGTGTGTCAGTTGGCATTATCCTTTAAATGAGTAACAGTAAAACCAGAATGTGTTTCTTTCAGGGTATGCCTCCAGCTGCTTTGACAGATGATGAAGCCTTCCTTGAGCCTAGTGGAAGCCAGTCTGATCGCAAAACATCACTTTCCATCAGTAACCTTGACATTCTAAACGTTAATCAGCTTTTGGAATCAGTAAGACAAACATTTTTTGGAGCTCCAGTCTTCAAGTTCTTGTTTAATCATCCTATCTACATGAATCCTTCTATTTTGCACTTATGGAATATTTCCAGAGTATTGGATCATACTGAACCATTTCTTATTTTTCTACAATTCTGAATTTATGCTTGCTATTTTGTTCTGTAGTGTTATTGCAATGAAAATTAGTGTGTACTTATTGTTGAAATGTAGGATATTTTCTTTGTCAATAAATTATGTGTATAGGATATTTTTGTAATTGTATTGAAATAAGATGCTTGTTATAAAAGTTGAGGTAGAGGAGAATAAATGGGTGGAGTCGAGATCCTTCATACAATCATTTGGGCTTATTGAAGATCTAGTGCCAATTCAATACATAATTATCGCAATGATATTCCAATCTGAAAATAAGAACTTGACGAGCAAGTGCCAAAAAGACAGTGACAAAATCAGAATCTTGGGGACTGGTGTAAAAGTTCTTGTTATTTCCTCGTTTTACTCCCATTAGAATCCTAGATTTTATTATCCTCTTGAGTCTTATGTTTCCTAGAATCAACATGAAAGAGAAAACAAACACATACACACAGACTGCATGAATGAAGCATGTATAATGTAATGTACGAAACGGAAACTGATGAGACATTTTTCTAAGTCAAGAGAGGCTCCAATTGGTTTTGGCATGGTATGATGAATGATGGATGAAGAAACGAACCAATTTTTCTTTTGATGAAGAGAGGATACTAGGACCATGAAGGAGTGGCTAGGTTATGAAGTCCTTCAGGTTGTGGTTTGTAGGTACGCTCCAAATTAATCTCTAAGCAGTCAAAAGGTATACTAGCTAGGGGATAAGACAATGAGTGGTAGAGATTACCTTCACTGTTCGTCTGCGCTAGCTGCACAAGATTAGCAAACAGAGCCCTGAGGTTTGACGAGTATAGGAATACTTGGTCTTTTTATGAACTGATGGTCTTGTAAAGTACTCTCCTTTGGTTGGTGTGGGGTGTTAAATTTTCAAATAGCCTCTATGTGTCATAGATTTTTGGATAGCAATAATTTCTAAAGAAATGAATAAATTTTCCTTGCAATAATTACATGTATACATTGGATCCGTGGAGTGATGTCTGTATTATTTACAGGTGCTCGAAACAGCCAGACAAGTTGCAAGCTTTCCAGTCTCTTCTGCGCCTGTTCCATATGATCAAATGAAAAGCCAATGTGAGGCCCTTGTTAGTTGTAAACAGCAGAAGATGTCAGTGCTTCATAGTTTCAAGCACAAAAAAGAAGAGAAGGCGATTGTCCTCTCCAGTGAAATTGAAACTTTATATCCTCCGTTACCTCTCAATGTAAGTTATCATTTCATTTCAAAAGTTGTGGCGGGCTATCATTAACTTCATTATTCTTATTGATTTTTTCCGAACTTGAAAGGAGAATTAGCTCTTCCTTTTATGCTCTCCTGAAAAAGTTAGTCAAGGAGCTTGACACGCTGTGTGGCTAGTCGTTTTATACCCATTGATATAGAGAAGATAAATGAAAATTTATGGTTTCATCTTACATTGGCTGATCACATTGAATTATCCTACTTCTATTAGCTATACAATTGAGACAGAGCTATACAATTGGTACAGTTGCAAATATGACAACAAATCCAAAGTATTAGCTGATATAGTACAATGCTAAAGAATTTGCAAAAATAGCAAAATTTAGATCCAACTCTCCAAGTCTATTATTGATGGATCTACTGTGCTATATTTGAAATTCTTTAAAAGTATTGTTATACATAATTGTTAACCCTAAAATTGCTAGTCATTGCAATTACCCTATAAAATTTAGTTGTAGTAGTCCTATTCTTTATATCATTTCATTGATCCATGAAAAACTATTGCAAATATTCTCTTGTCGTTCTCGTTTCAGACAATGGAAATCGTTCAAGGGGATCTCAAGTTTTATAACAATGAGACAAACAGAGGACAGGATCAGCCGCTTCTTTGTTCACATGAATATGGTCGTCACTCTTTAAGATTGCCACCATCAAGTCCATATGACAAATTCTTGAAAGCTGCTGGATGCTAGAACATAGCTGTAATTTACATGTTAAAAGGCAATAGTTTGTATTCTCAAATACACTGCTTCAATATCCCACTTTCTATTTATTCATTCTTCATATTAAATTGTCGATCAAGTTAATTTGCTATACATATGATGCTTCTGAAATACAAGCAACAAGCATTGGAGTGTCTTTGGCAAGAATAGTTGGTTTTGCTAATGCAGTAGTCCAGATACACTAAGAAAGGTCTTGGGAAGATTTGTCTATTTCTAAGATATCCGATACCTCCTTGTTTTTGTACTATAGTTTCTTATTTGGATTCCTCATTTAGAGGGAGTTGTGTATTTGAGATTGATAATCATGTTTATTTGTACATACCGATGTATTTTTCTATTTTATTATTCGAAGATGTGATTCATATTTTTAAGGACTGGTATTGGTTTGGGAGTGTGCTTTTACTACTCAAAGAGAGGATTCAAACCTTCAACTCGTGTAATAATTCATGATTCACTATGCACCAATATTTTTCATCACTTATCATCGTACTATTGAAAATTATGTGGCTGAATACTATTGATGGTATACATGAAAATCCATCAATCAA

Coding sequence (CDS)

ATGGGGGTTATGTCTCGGCGGGTTGTTCCTGCCTGTGGTAACCTCTGTTTCTTCTGTCCTTCTATGAGGGCGAGATCAAGACAGCCTGTGAAACGATACAAGAAGTTCCTTGCTGACATATTTCCTCGTAATCAGGATGCTGAACCTAATGATAGAAAAATTTGTAAGCTCTGTGACTATGCTTCAAAAAACCCGTTGCGTATTCCCAAGATTACCGAACTCCTGGAGCAACGATGCTACAAAGATCTGCGGAATGAGAATTTTGGATCTGTGAAAGTTGTAATATGTATATACAGAAAACTTCTATTAATGTGCAAAGATCAGATGCCACTTTTTGCTAGTAGCTTAATTGGGATTTCTCGAACTCTTTTAGAACAAACACGGCATGATGATATGCAGATTCTTGGTTGCAATATTCTTGTTGAGTTCATAAGTAGCCAGACAGATAGTACATACATGTTCAACTTGGAGGGCATCATTCCAAAACTTTGCCAATTGGCTCTAGAAGGCGAGAGTAATGATGAGGCACCACATTTGCGGTCAGCTGGACTTCAAACTCTAGCTTCTATGATACTGTTCATGGGCGAGCAATCTCACATCTCGATGGACTTTGATAAAATTATATCTGCGGTCTTGGAAAACTATGTAGTAGATGGACAATTTTCTCACTCAGAAGCTCAGTACATTGAAGGACAACATAAAGTAGAAAACCATAGCTCTTCCATGTTAGATGTCGATAAAAAGTTCTCTTCGTTTAACCATTTTAATAATTCGGCAACTGAAGTGGATGTTTCCAAGAACCCTTCTTATTGGTCTAGAGTTTGCTTGTGTAATATGGCTAGATTGGCAAAGGAAGCTACAACTGTCAGGCGTATGTTTGAACCTCTATTTCATCATTTTGATACTGAAAATCAATGGTCCTTAGTTAAAGGACTTGCCTACTCGGTGTTGTCATTTATGCAATCGCTTTTGGATGAATCAGGGGACAACTCATATCTTTTGTTTTCGATTCTTGTCAAGCACTTGGATCATAAAAGTGTTGTAAAAAAGCCTCAAGTTCAAGTGGATATTATCAATGTAACCACACAACTTTCTCAAAATGCAAAGACACAAGCCTCAGTTACTATTATTGGGGCTATCAATGATTTGATAAAACATCTACGGAAGTGCATTCTATGTTCATCTGAAGCATCCAGCAATGGACATGACACAGATAAATGGAATACTGATCTTCAGTTGGCACTCGAAAAGTGCATTTCTCAGCTTTCAAAGAAGGTAGGTGATGCAGGACTCATACTTGATATGCTAGCTGTTGTCCTCGAGAATATTTCAAATAATAATATTTCAGCTCGGGCAACAGTCTCCGCTGTTTATCAGACTGCAATGACTGTATCTTCTATTCCTAATGTTTCATATTACAAGAAGGCTTTTCCTGATGCTCTATTTCATCAGTTGCTTTTAGCAATGGCTCACCCTGATCATGAGACTCGAATTGGGGCACACGACATTTTCTCTATAGTGCTTATGCCATCCATTAAGTGTCCTATGATGGAACAGAAGACGATTTCCTCAGACACTGTTTCATGGTTACCATTTAGCAGTCCCACACAGAAGTTGACTAGTGGAGGTTTCTCCTTTAAAGACGATGACAATCATGTATCAGAATCTATAAATGGGGTAAGAATGGAAGAAAGTCAAGCAGCACACCTTGTTTCTGAAAATTATACAACACATCCATCTAGGCATGAATCCTCCAGCTTCAACCATAGTTCAAACGAGTCAAAAACTAAGTTGAATTCCCTCCGGTTAAGCAGTCACCAAGTTAGACTCCTGCTCTCCTCAATCTGGGTGCAAGCTACATCTGCGGATAATACACCTGCAAATTTTGAGGCTATGGCTCAAACTTATAGCATTGCTTTGCTATTTACCCGGTCTAAGACTTCGAGTCACATGGCTCTAGTACGATGTTTTCAGCTGGCATTTTCCCTTCGTAGCATTGCTGTGGATCAAGAAGGTGGTTTACTACCCTCTCGCAGAAGATCAATCTTCACCTTGGCCTCATTTATGCTTCTGTTTTCAGCCAGGGTGGGAGATCTCCCAGATTTGACTACCGTCATTAAAGCATCATTAGATAATAAAATGGTTGATCCTCACCTTCAGTTGGTTAATGATATCAGGCTGCTGGCTGTTCGTGTCAAGTCTGAAAAGGACAGTGTACCATTTGGGTCAGAAGAAGACGAAGTTGCTGCATTGAAGTTTCTTTCAATTCTTGAACTAGATGAACAACAGTTGAAGGAAACTGTGGTCTCACACTTCACGATTAAATATGCCAATCTCTCAGAGGCCGAGCTATCAAGTATTAGAGAGCAGCTCTTACATGGGTTCTTACCTGATGAGGCATACCCATTAGGAGCTCCATTATTTATGGAGACACCACGTCCATGCTCTCCACTTGCAAAGCTGGCATTTCCAGATTATGATGAGGGTATGCCTCCAGCTGCTTTGACAGATGATGAAGCCTTCCTTGAGCCTAGTGGAAGCCAGTCTGATCGCAAAACATCACTTTCCATCAGTAACCTTGACATTCTAAACGTTAATCAGCTTTTGGAATCAGTGCTCGAAACAGCCAGACAAGTTGCAAGCTTTCCAGTCTCTTCTGCGCCTGTTCCATATGATCAAATGAAAAGCCAATGTGAGGCCCTTGTTAGTTGTAAACAGCAGAAGATGTCAGTGCTTCATAGTTTCAAGCACAAAAAAGAAGAGAAGGCGATTGTCCTCTCCAGTGAAATTGAAACTTTATATCCTCCGTTACCTCTCAATACAATGGAAATCGTTCAAGGGGATCTCAAGTTTTATAACAATGAGACAAACAGAGGACAGGATCAGCCGCTTCTTTGTTCACATGAATATGGTCGTCACTCTTTAAGATTGCCACCATCAAGTCCATATGACAAATTCTTGAAAGCTGCTGGATGCTAG

Protein sequence

MGVMSRRVVPACGNLCFFCPSMRARSRQPVKRYKKFLADIFPRNQDAEPNDRKICKLCDYASKNPLRIPKITELLEQRCYKDLRNENFGSVKVVICIYRKLLLMCKDQMPLFASSLIGISRTLLEQTRHDDMQILGCNILVEFISSQTDSTYMFNLEGIIPKLCQLALEGESNDEAPHLRSAGLQTLASMILFMGEQSHISMDFDKIISAVLENYVVDGQFSHSEAQYIEGQHKVENHSSSMLDVDKKFSSFNHFNNSATEVDVSKNPSYWSRVCLCNMARLAKEATTVRRMFEPLFHHFDTENQWSLVKGLAYSVLSFMQSLLDESGDNSYLLFSILVKHLDHKSVVKKPQVQVDIINVTTQLSQNAKTQASVTIIGAINDLIKHLRKCILCSSEASSNGHDTDKWNTDLQLALEKCISQLSKKVGDAGLILDMLAVVLENISNNNISARATVSAVYQTAMTVSSIPNVSYYKKAFPDALFHQLLLAMAHPDHETRIGAHDIFSIVLMPSIKCPMMEQKTISSDTVSWLPFSSPTQKLTSGGFSFKDDDNHVSESINGVRMEESQAAHLVSENYTTHPSRHESSSFNHSSNESKTKLNSLRLSSHQVRLLLSSIWVQATSADNTPANFEAMAQTYSIALLFTRSKTSSHMALVRCFQLAFSLRSIAVDQEGGLLPSRRRSIFTLASFMLLFSARVGDLPDLTTVIKASLDNKMVDPHLQLVNDIRLLAVRVKSEKDSVPFGSEEDEVAALKFLSILELDEQQLKETVVSHFTIKYANLSEAELSSIREQLLHGFLPDEAYPLGAPLFMETPRPCSPLAKLAFPDYDEGMPPAALTDDEAFLEPSGSQSDRKTSLSISNLDILNVNQLLESVLETARQVASFPVSSAPVPYDQMKSQCEALVSCKQQKMSVLHSFKHKKEEKAIVLSSEIETLYPPLPLNTMEIVQGDLKFYNNETNRGQDQPLLCSHEYGRHSLRLPPSSPYDKFLKAAGC
Homology
BLAST of Cucsat.G5989 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 129.8 bits (325), Expect = 6.4e-29
Identity = 102/366 (27.87%), Postives = 165/366 (45.08%), Query Frame = 0

Query: 19  EFFAKTRTAHQFSCVYTPKQNSVVERKHRQLPNVARALMFQSKSPLTFWGECILSVVYLI 78
           E+ +     H+ +   TP+ N V ER +R +    R+++  +K P +FWGE + +  YLI
Sbjct: 562 EYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEKVRSMLRMAKLPKSFWGEAVQTACYLI 621

Query: 79  YWTPMVLLSNSTSFATLFEKEADYSIIRTFGCLAYASTLSATRSKFDPRAQPCVFL---- 138
             +P V L+          KE  YS ++ FGC A+A      R+K D ++ PC+F+    
Sbjct: 622 NRSPSVPLAFEIPERVWTNKEVSYSHLKVFGCRAFAHVPKEQRTKLDDKSIPCIFIGYGD 681

Query: 139 ---GYKLYDLARRKFFVSRDVLFFEELFPIHSIKEKGTSISHDFLEQFVIPCPLIDCLEK 198
              GY+L+D  ++K   SRDV+F E    + +  +    + +  +  FV    +      
Sbjct: 682 EEFGYRLWDPVKKKVIRSRDVVFRES--EVRTAADMSEKVKNGIIPNFV---TIPSTSNN 741

Query: 199 ETIIDLTTAERPISKNTLEDNHGVDDHDPCTEDSEETNSPVQ-------IPITIAPRKSS 258
            T  + TT E  +S+   +    ++  +   E  EE   P Q       +  +  PR  S
Sbjct: 742 PTSAESTTDE--VSEQGEQPGEVIEQGEQLDEGVEEVEHPTQGEEQHQPLRRSERPRVES 801

Query: 259 RQYHPPSYLKDFHCNLTSQRSTPFHLTKYLSYKAYSQHHKNYLFNIASIYEPSYYHQVVK 318
           R+Y    Y+      L S    P  L + LS+       KN L                 
Sbjct: 802 RRYPSTEYV------LISDDREPESLKEVLSHP-----EKNQLM---------------- 861

Query: 319 HQTWRKAMVEEIEAMKRTNTWTIVSLPKNHHTVGNKWVYKVKCKLDSTIDRYKARLVEKN 371
                KAM EE+E++++  T+ +V LPK    +  KWV+K+K   D  + RYKARLV K 
Sbjct: 862 -----KAMQEEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKG 888

BLAST of Cucsat.G5989 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 104.4 bits (259), Expect = 2.9e-21
Identity = 108/439 (24.60%), Postives = 168/439 (38.27%), Query Frame = 0

Query: 18   REFFAKTRTAHQFSCVYTPKQNSVVERKHRQLPNVARALMFQSKSPLTFWGECILSVVYL 77
            R++ ++   +H  S  +TP+ N + ERKHR +  +   L+  +  P T+W       VYL
Sbjct: 580  RDYLSQHGISHFTSPPHTPEHNGLSERKHRHIVEMGLTLLSHASVPKTYWPYAFSVAVYL 639

Query: 78   IYWTPMVLLSNSTSFATLFEKEADYSIIRTFGCLAYASTLSATRSKFDPRAQPCVFLGYK 137
            I   P  LL   + F  LF +  +Y  ++ FGC  Y       R K + +++ C F+GY 
Sbjct: 640  INRLPTPLLQLQSPFQKLFGQPPNYEKLKVFGCACYPWLRPYNRHKLEDKSKQCAFMGYS 699

Query: 138  LYDLA-------RRKFFVSRDVLFFEELFPIH--------SIKEKGTSI----SHDFLEQ 197
            L   A         + + SR V F E  FP          S +++  S     SH  L  
Sbjct: 700  LTQSAYLCLHIPTGRLYTSRHVQFDERCFPFSTTNFGVSTSQEQRSDSAPNWPSHTTLPT 759

Query: 198  FVIPCPLIDCLEKETIIDLTTAERP-----------ISKNTL----------------ED 257
              +  P   CL       L T+ RP           +S + L                  
Sbjct: 760  TPLVLPAPPCLGPH----LDTSPRPPSSPSPLCTTQVSSSNLPSSSISSPSSSEPTAPSH 819

Query: 258  NHGVDDHDPCTEDSEETNSPV----------------QIPITIAPRKSSRQYHPPSYLKD 317
            N       P    +  +NSP+                  P+  +P  S     P + + +
Sbjct: 820  NGPQPTAQPHQTQNSNSNSPILNNPNPNSPSPNSPNQNSPLPQSPISSPHIPTPSTSISE 879

Query: 318  FHCNLTSQRSTP-----------FHLTKYLSYKAYSQHHK------------NYLFNIAS 371
             +   +S  STP             +        +S   +            +Y  ++A+
Sbjct: 880  PNSPSSSSTSTPPLPPVLPAPPIIQVNAQAPVNTHSMATRAKDGIRKPNQKYSYATSLAA 939

BLAST of Cucsat.G5989 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 95.1 bits (235), Expect = 1.7e-18
Identity = 99/407 (24.32%), Postives = 168/407 (41.28%), Query Frame = 0

Query: 18  REFFAKTRTAHQFSCVYTPKQNSVVERKHRQLPNVARALMFQSKSPLTFWGECILSVVYL 77
           R+F  K   ++  +  +TP+ N V ER  R +   AR ++  +K   +FWGE +L+  YL
Sbjct: 561 RQFCVKKGISYHLTVPHTPQLNGVSERMIRTITEKARTMVSGAKLDKSFWGEAVLTATYL 620

Query: 78  IYWTPMVLL--SNSTSFATLFEKEADYSIIRTFGCLAYASTLSATRSKFDPRAQPCVFLG 137
           I   P   L  S+ T +     K+     +R FG   Y   +   + KFD ++   +F+G
Sbjct: 621 INRIPSRALVDSSKTPYEMWHNKKPYLKHLRVFGATVYVH-IKNKQGKFDDKSFKSIFVG 680

Query: 138 Y-----KLYDLARRKFFVSRDVLF------------FEELFPIHSIKEKGTSISHDF--L 197
           Y     KL+D    KF V+RDV+             FE +F   S + +  +  +D   +
Sbjct: 681 YEPNGFKLWDAVNEKFIVARDVVVDETNMVNSRAVKFETVFLKDSKESENKNFPNDSRKI 740

Query: 198 EQFVIPCPLIDCLEKETIIDLTTAE--------RPISKNTLEDNHGVDDHDPCTEDSEET 257
            Q   P    +C   + + D   +E        R I +    +     D+    +DS+E+
Sbjct: 741 IQTEFPNESKECDNIQFLKDSKESENKNFPNDSRKIIQTEFPNESKECDNIQFLKDSKES 800

Query: 258 NSPVQIPITIAPR-------------KSSRQYHPPSYLKDFHCN---------LTSQRST 317
           N           R               SR+     +LK+   +         + ++RS 
Sbjct: 801 NKYFLNESKKRKRDDHLNESKGSGNPNESRESETAEHLKEIGIDNPTKNDGIEIINRRSE 860

Query: 318 PFHLTKYLSYKAYSQHHKNYLFNIASIYE--PSYYHQVV---KHQTWRKAMVEEIEAMKR 369
                  +SY          + N  +I+   P+ + ++       +W +A+  E+ A K 
Sbjct: 861 RLKTKPQISYNEEDNSLNKVVLNAHTIFNDVPNSFDEIQYRDDKSSWEEAINTELNAHKI 920

BLAST of Cucsat.G5989 vs. ExPASy Swiss-Prot
Match: P92520 (Uncharacterized mitochondrial protein AtMg00820 OS=Arabidopsis thaliana OX=3702 GN=AtMg00820 PE=4 SV=1)

HSP 1 Score: 70.5 bits (171), Expect = 4.6e-11
Identity = 34/76 (44.74%), Postives = 47/76 (61.84%), Query Frame = 0

Query: 295 EPSYYHQVVKHQTWRKAMVEEIEAMKRTNTWTIVSLPKNHHTVGNKWVYKVKCKLDSTID 354
           EP      +K   W +AM EE++A+ R  TW +V  P N + +G KWV+K K   D T+D
Sbjct: 27  EPKSVIFALKDPGWCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLD 86

Query: 355 RYKARLVEKNYNQQEG 371
           R KARLV K ++Q+EG
Sbjct: 87  RLKARLVAKGFHQEEG 102

BLAST of Cucsat.G5989 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 61.6 bits (148), Expect = 2.1e-08
Identity = 48/152 (31.58%), Postives = 71/152 (46.71%), Query Frame = 0

Query: 220  PCTEDSEETNSPVQIPITIAPRKSSRQYHPPSYLKDFHCNLTSQRSTPFHLTKYLSYKAY 279
            P T  S  + SP    I I P        PP   +  + N  +  +T    T+  +    
Sbjct: 888  PTTSASSSSTSPTPPSILIHP--------PPPLAQIVNNNNQAPLNTHSMGTRAKAGIIK 947

Query: 280  SQHHKNYLFNIASIYEPSYYHQVVKHQTWRKAMVEEIEAMKRTNTWTIVSLPKNHHT-VG 339
                 +   ++A+  EP    Q +K + WR AM  EI A    +TW +V  P +H T VG
Sbjct: 948  PNPKYSLAVSLAAESEPRTAIQALKDERWRNAMGSEINAQIGNHTWDLVPPPPSHVTIVG 1007

Query: 340  NKWVYKVKCKLDSTIDRYKARLVEKNYNQQEG 371
             +W++  K   D +++RYKARLV K YNQ+ G
Sbjct: 1008 CRWIFTKKYNSDGSLNRYKARLVAKGYNQRPG 1031

BLAST of Cucsat.G5989 vs. NCBI nr
Match: KAA0038341.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa] >TYJ97054.1 Reverse transcriptase, RNA-dependent DNA polymerase [Cucumis melo var. makuwa])

HSP 1 Score: 350 bits (897), Expect = 4.06e-114
Identity = 171/236 (72.46%), Postives = 192/236 (81.36%), Query Frame = 0

Query: 135 GYKLYDLARRKFFVSRDVLFFEELFPIHSIKEKGTSISHDFLEQFVIPCPLIDCLEKETI 194
           GYKLYD+ARRKFF+SRDVLFFEELFP HSIKEK   ISH+FLEQFVI  PL DCLEKE I
Sbjct: 9   GYKLYDIARRKFFISRDVLFFEELFPFHSIKEKDIPISHNFLEQFVILSPLFDCLEKEVI 68

Query: 195 IDLTTAERPISKNTLEDNHGVDDHDPCTEDSEETNSPVQIPITIAPRKSSRQYHPPSYLK 254
            +  T  R ++++TLED+HG +D +P T +S+ETN+  Q PI    RKSS  +HPPSYLK
Sbjct: 69  TNPFTDARSMTEDTLEDSHGANDQNPYTSNSKETNNTNQAPILTMTRKSSWPHHPPSYLK 128

Query: 255 DFHCNLTSQRSTPFHLTKYLSYKAYSQHHKNYLFNIASIYEPSYYHQVVKHQTWRKAMVE 314
           DF  NLTSQ STPF L +Y SY AYSQHH+NYLFN+ SIYEP+YYHQ VKHQTWRKAM  
Sbjct: 129 DFRYNLTSQNSTPFPLNQYRSYNAYSQHHRNYLFNVTSIYEPTYYHQAVKHQTWRKAMAL 188

Query: 315 EIEAMKRTNTWTIVSLPKNHHTVGNKWVYKVKCKLDSTIDRYKARLVEKNYNQQEG 370
           EIEA KR NTWTIVSLPK+HHTVG+KWVYKVKCK D TIDRYKARLV K YNQQEG
Sbjct: 189 EIEATKRINTWTIVSLPKDHHTVGSKWVYKVKCKPDGTIDRYKARLVAKGYNQQEG 244

BLAST of Cucsat.G5989 vs. NCBI nr
Match: KAA0043630.1 (Reverse transcriptase, RNA-dependent DNA polymerase [Cucumis melo var. makuwa] >TYK09733.1 Reverse transcriptase, RNA-dependent DNA polymerase [Cucumis melo var. makuwa])

HSP 1 Score: 278 bits (711), Expect = 1.07e-85
Identity = 138/222 (62.16%), Postives = 161/222 (72.52%), Query Frame = 0

Query: 149 SRDVLFFEELFPIHSIKEKGTSISHDFLEQFVIPCPLIDCLEKETIIDLTTAERPISKNT 208
           SRDVLFFEE FP  SIKE    ISHDF++QFVIP PL D LE +              + 
Sbjct: 146 SRDVLFFEERFPFQSIKEDNKHISHDFIDQFVIPSPLFDHLENKGF------------HP 205

Query: 209 LEDNHGVDDHDPCTEDSEETNSPVQIPITIAPRKSSRQYHPPSYLKDFHCNLTSQRSTPF 268
            +DNHG+ D  P  E S+ETN  +Q  I  A R+SSR + PPSYLKDFHCNLTS R +PF
Sbjct: 206 FKDNHGIVDPQPHIETSKETNHLIQTTIPTASRRSSRPHCPPSYLKDFHCNLTSHRKSPF 265

Query: 269 HLTKYLSYKAYSQHHKNYLFNIASIYEPSYYHQVVKHQTWRKAMVEEIEAMKRTNTWTIV 328
            L KYLSY +Y+Q+HK Y  N+ SIYEP+YYHQ + HQ W+KAM EEIEAM+RTNTWTIV
Sbjct: 266 PLEKYLSYNSYNQNHKKYTLNVTSIYEPTYYHQAMNHQNWKKAMAEEIEAMERTNTWTIV 325

Query: 329 SLPKNHHTVGNKWVYKVKCKLDSTIDRYKARLVEKNYNQQEG 370
           SLPKN+HTVG+KWVYKVK K   T+DRYKARL+ K YNQQEG
Sbjct: 326 SLPKNYHTVGSKWVYKVKYKQHGTVDRYKARLIAKGYNQQEG 355

BLAST of Cucsat.G5989 vs. NCBI nr
Match: TYK16758.1 (Copia protein [Cucumis melo var. makuwa])

HSP 1 Score: 278 bits (711), Expect = 1.07e-81
Identity = 165/386 (42.75%), Postives = 228/386 (59.07%), Query Frame = 0

Query: 20  FFAKTRTAHQFSCVYTPKQNSVVERKHRQLPNVARALMFQSKSPLTFWGECILSVVYLIY 79
           FF +    HQ+SCV  P+QNSVVERKH+ + N ARAL FQS+ PL FWG+CIL+ +YLI 
Sbjct: 254 FFEQKGVIHQYSCVQCPQQNSVVERKHQHILNTARALYFQSQVPLNFWGDCILTAIYLIN 313

Query: 80  WTPMVLLSNSTSFATLFEKEADYSIIRTFGCLAYASTLSATRSKFDPRAQPCVFLGY--- 139
            TP  LL   +SF  L     DY+ ++ FG L YAS+L   RSKF  RA P VF+GY   
Sbjct: 314 RTPSKLLQWKSSFQKLNNTIPDYNSLKVFGSLCYASSLPYNRSKFQIRAIPSVFIGYPQG 373

Query: 140 ----KLYDLARRKFFVSRDVLFFEELFPIHSIKEKGTSISHDFLEQFVIPCPLIDCL--E 199
               KLYD+  +K F+SRDV+F E  FP H+I +   SI  D L  F +P P  +C    
Sbjct: 374 MKAYKLYDIEHKKVFISRDVIFHETTFPFHNIPKNQISI--DPLPGFSLPKPFHECNLHS 433

Query: 200 KETIIDLTTAERPISKNTLED--NHGVDDHD-----PCTEDSEETNSPVQIPITIAP--- 259
             T+I   T +   +    ED  N  ++ +D     P  +++   NS +    TI+    
Sbjct: 434 HTTLIPPPTLQTNTTPTRDEDPSNSNIESNDFENQQPMNDENMARNSDINE-TTISQQDR 493

Query: 260 ----------RKSSRQYHPPSYLKDFHCNL------TSQRSTPFHLTKYLSYKAYSQHHK 319
                     RKS+R   PPSYL+ +HC+L      T+Q+ST + + +YLSY+A S  +K
Sbjct: 494 QPNANEETTIRKSTRITKPPSYLQAYHCSLLTTQSPTTQKSTKYPINQYLSYQALSPTYK 553

Query: 320 NYLFNIASIYEPSYYHQVVKHQTWRKAMVEEIEAMKRTNTWTIVSLPKNHHTVGNKWVYK 370
             +  +++  E S+YH+ V  Q WR+AM  E+EAM+   TW+IV LPK  +++G +WVYK
Sbjct: 554 YSILQVSTKKELSFYHEAVISQEWREAMKAELEAMETNQTWSIVPLPKGKNSIGCRWVYK 613

BLAST of Cucsat.G5989 vs. NCBI nr
Match: XP_022154919.1 (uncharacterized protein LOC111022065 [Momordica charantia])

HSP 1 Score: 273 bits (698), Expect = 7.26e-80
Identity = 164/410 (40.00%), Postives = 221/410 (53.90%), Query Frame = 0

Query: 19  EFFAKTRTAHQFSCVYTPKQNSVVERKHRQLPNVARALMFQSKSPLTFWGECILSVVYLI 78
           EFF      HQFSCV  P+QNSVVERKH+ L NVAR+L FQS+ P  FWGEC+L+  YLI
Sbjct: 395 EFFHSKGVLHQFSCVGCPEQNSVVERKHQHLLNVARSLYFQSRVPTAFWGECVLTAAYLI 454

Query: 79  YWTPMVLLSNSTSFATLFEKEADYSIIRTFGCLAYASTLSATRSKFDPRAQPCVFLGY-- 138
             TP  +L  +T +A L+   ADYS ++ FGCL + ST    RSKF PRA   VF+GY  
Sbjct: 455 NRTPTPVLDWNTPYARLYGHSADYSSLKVFGCLCFVSTSPVNRSKFHPRALTSVFVGYPP 514

Query: 139 -----KLYDLARRKFFVSRDVLFFEELFPIHSIKEK------------------------ 198
                KLYD+  ++FFVSRDV+F E +FP H++                           
Sbjct: 515 GMKGYKLYDIENKRFFVSRDVIFHESIFPFHTVSVNSPIVDPFPGVVIPKSYDLVDTSSG 574

Query: 199 ----------GTSISH---DFLEQFVIP--CPLIDCLEKETIID--LTTAERPISKNTLE 258
                     G+++S    D     VIP   P++   E +      L   E  I  N +E
Sbjct: 575 APDDHHNYATGSAVSPTSADISPTVVIPDGSPIMVANESDANGSPILVANESNIMPNIVE 634

Query: 259 DNHGVDDHDPCTEDSEETNSPVQIPIT-----IAPRKSSRQYHPPSYLKDFHCNLT---- 318
           ++      +P   D    +S V +P +     +  R+SSR    PSYL+D+HC L     
Sbjct: 635 NSL----INPLDTDVANVDSAVVVPCSDPSDSVTLRRSSRVAQRPSYLRDYHCGLIQATD 694

Query: 319 -SQRSTPFHLTKYLSYKAYSQHHKNYLFNIASIYEPSYYHQVVKHQTWRKAMVEEIEAMK 370
            S  S  + L KYL Y   S  +K ++ +++  YEP +YHQ V    WR+AM  E+ AM+
Sbjct: 695 HSASSVFYPLQKYLDYNNLSASYKEFVLSVSCDYEPQFYHQAVPFSHWREAMRAELHAME 754

BLAST of Cucsat.G5989 vs. NCBI nr
Match: RVX06074.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 259 bits (661), Expect = 3.00e-73
Identity = 149/370 (40.27%), Postives = 202/370 (54.59%), Query Frame = 0

Query: 15  VFPREFFAKTRTAHQFSCVYTPKQNSVVERKHRQLPNVARALMFQSKSPLTFWGECILSV 74
           +F   F+      H  SCV TP+QNSVVERKH+ + NVARAL+FQS  P+ +W +CIL+ 
Sbjct: 614 LFLSNFYHSLGVIHYRSCVETPQQNSVVERKHQHILNVARALLFQSSLPVCYWSDCILTA 673

Query: 75  VYLIYWTPMVLLSNSTSFATLFEKEADYSIIRTFGCLAYASTLSATRSKFDPRAQPCVFL 134
           VYLI  TP   L+N T F  L +K  DYS +R FGCL Y STL A R+KF PRA+  VFL
Sbjct: 674 VYLINRTPSPFLNNKTPFEILHDKLLDYSHLRVFGCLCYVSTLKANRTKFSPRAKAVVFL 733

Query: 135 GY-------KLYDLARRKFFVSRDVLFFEELFPIHSIKE-KGTSISHDFLEQFVIPCPLI 194
           GY       KL D+  R   +SR+V+F EE+FP           IS D     V+PC   
Sbjct: 734 GYPFGFKGYKLLDIETRSISISRNVIFHEEIFPFSKTNPCSSLDISSDLFHDRVLPCIAA 793

Query: 195 DCLEKETIIDLTTAERPISKNTLEDNHGVDDHDPCTEDSEETNSPVQIPITIAPRKSSRQ 254
           D  +  +++    ++ P+                            Q+  +  P + S+Q
Sbjct: 794 DNDQSSSVLPRVVSQPPL----------------------------QVAPSSRPTRVSKQ 853

Query: 255 YHPPSYLKDFHCNLTSQ------RSTPFHLTKYLSYKAYSQHHKNYLFNIASIYEPSYYH 314
              PSYLKD+HC+L +        ST   +  +LSY   S  +K +  +++ I EPS + 
Sbjct: 854 ---PSYLKDYHCSLINSVAHVETHSTSHPIQHFLSYDKLSPSYKLFSLSVSIISEPSSFA 913

Query: 315 QVVKHQTWRKAMVEEIEAMKRTNTWTIVSLPKNHHTVGNKWVYKVKCKLDSTIDRYKARL 370
           +  +   WR AM  E+EA++   TW+IVSLP   H VG KWVYK+K K + TI+RYKARL
Sbjct: 914 KAAEIPKWRAAMDCELEALEENKTWSIVSLPVGKHPVGCKWVYKIKHKANGTIERYKARL 952

BLAST of Cucsat.G5989 vs. ExPASy TrEMBL
Match: A0A5D3BDH6 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold506G001210 PE=4 SV=1)

HSP 1 Score: 350 bits (897), Expect = 1.97e-114
Identity = 171/236 (72.46%), Postives = 192/236 (81.36%), Query Frame = 0

Query: 135 GYKLYDLARRKFFVSRDVLFFEELFPIHSIKEKGTSISHDFLEQFVIPCPLIDCLEKETI 194
           GYKLYD+ARRKFF+SRDVLFFEELFP HSIKEK   ISH+FLEQFVI  PL DCLEKE I
Sbjct: 9   GYKLYDIARRKFFISRDVLFFEELFPFHSIKEKDIPISHNFLEQFVILSPLFDCLEKEVI 68

Query: 195 IDLTTAERPISKNTLEDNHGVDDHDPCTEDSEETNSPVQIPITIAPRKSSRQYHPPSYLK 254
            +  T  R ++++TLED+HG +D +P T +S+ETN+  Q PI    RKSS  +HPPSYLK
Sbjct: 69  TNPFTDARSMTEDTLEDSHGANDQNPYTSNSKETNNTNQAPILTMTRKSSWPHHPPSYLK 128

Query: 255 DFHCNLTSQRSTPFHLTKYLSYKAYSQHHKNYLFNIASIYEPSYYHQVVKHQTWRKAMVE 314
           DF  NLTSQ STPF L +Y SY AYSQHH+NYLFN+ SIYEP+YYHQ VKHQTWRKAM  
Sbjct: 129 DFRYNLTSQNSTPFPLNQYRSYNAYSQHHRNYLFNVTSIYEPTYYHQAVKHQTWRKAMAL 188

Query: 315 EIEAMKRTNTWTIVSLPKNHHTVGNKWVYKVKCKLDSTIDRYKARLVEKNYNQQEG 370
           EIEA KR NTWTIVSLPK+HHTVG+KWVYKVKCK D TIDRYKARLV K YNQQEG
Sbjct: 189 EIEATKRINTWTIVSLPKDHHTVGSKWVYKVKCKPDGTIDRYKARLVAKGYNQQEG 244

BLAST of Cucsat.G5989 vs. ExPASy TrEMBL
Match: A0A5A7TK43 (Reverse transcriptase, RNA-dependent DNA polymerase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold447G001520 PE=4 SV=1)

HSP 1 Score: 278 bits (711), Expect = 5.16e-86
Identity = 138/222 (62.16%), Postives = 161/222 (72.52%), Query Frame = 0

Query: 149 SRDVLFFEELFPIHSIKEKGTSISHDFLEQFVIPCPLIDCLEKETIIDLTTAERPISKNT 208
           SRDVLFFEE FP  SIKE    ISHDF++QFVIP PL D LE +              + 
Sbjct: 146 SRDVLFFEERFPFQSIKEDNKHISHDFIDQFVIPSPLFDHLENKGF------------HP 205

Query: 209 LEDNHGVDDHDPCTEDSEETNSPVQIPITIAPRKSSRQYHPPSYLKDFHCNLTSQRSTPF 268
            +DNHG+ D  P  E S+ETN  +Q  I  A R+SSR + PPSYLKDFHCNLTS R +PF
Sbjct: 206 FKDNHGIVDPQPHIETSKETNHLIQTTIPTASRRSSRPHCPPSYLKDFHCNLTSHRKSPF 265

Query: 269 HLTKYLSYKAYSQHHKNYLFNIASIYEPSYYHQVVKHQTWRKAMVEEIEAMKRTNTWTIV 328
            L KYLSY +Y+Q+HK Y  N+ SIYEP+YYHQ + HQ W+KAM EEIEAM+RTNTWTIV
Sbjct: 266 PLEKYLSYNSYNQNHKKYTLNVTSIYEPTYYHQAMNHQNWKKAMAEEIEAMERTNTWTIV 325

Query: 329 SLPKNHHTVGNKWVYKVKCKLDSTIDRYKARLVEKNYNQQEG 370
           SLPKN+HTVG+KWVYKVK K   T+DRYKARL+ K YNQQEG
Sbjct: 326 SLPKNYHTVGSKWVYKVKYKQHGTVDRYKARLIAKGYNQQEG 355

BLAST of Cucsat.G5989 vs. ExPASy TrEMBL
Match: A0A2N9F376 (Integrase catalytic domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS9454 PE=4 SV=1)

HSP 1 Score: 282 bits (721), Expect = 1.81e-84
Identity = 168/377 (44.56%), Postives = 215/377 (57.03%), Query Frame = 0

Query: 19  EFFAKTRTAHQFSCVYTPKQNSVVERKHRQLPNVARALMFQSKSPLTFWGECILSVVYLI 78
           +FF+     HQ SCV TP+QNSVVERKH+ L NVARA+ FQS  PLTFWG+C+L   YLI
Sbjct: 25  DFFSSKGVIHQTSCVKTPQQNSVVERKHQHLLNVARAVRFQSNLPLTFWGDCVLHAAYLI 84

Query: 79  YWTPMVLLSNSTSFATLFEKEADYSIIRTFGCLAYASTLSATRSKFDPRAQPCVFLGY-- 138
              P  +L N T F  L  K   YS ++ FGCLAYAS LS  R+KFD RA PCVF+GY  
Sbjct: 85  NRPPTHVLKNKTPFEILMHKAPTYSHLKVFGCLAYASNLSIHRTKFDTRALPCVFIGYPF 144

Query: 139 -----KLYDLARRKFFVSRDVLFFEELFPIHSIKEKGTSISHDFLEQFVIPCPLIDCLEK 198
                KL+DL+ ++FFVSRDV+F E +FP HS     TS+ +  L               
Sbjct: 145 GMKGYKLFDLSTQQFFVSRDVVFHEHIFPFHS----STSLVNPSLS-------------- 204

Query: 199 ETIIDLTTAERPISKNTLEDNHGVDDHDPCT---EDSEETNSPVQIPI-----TIAP--R 258
            T  D      P+   T+ D+  +    P T   + S ET+SP+  PI     T+ P  R
Sbjct: 205 -TSFDSAPVSLPV---TMSDSPAIPTPPPHTIPPDSSTETSSPIASPIASSSPTLHPPVR 264

Query: 259 KSSRQYHPPSYLKDFHCNLT-------SQRSTPFH-LTKYLSYKAYSQHHKNYLFNIASI 318
           KSSR   PPSYL+D+H NL        S  S   H +   LSY   S  HK +   I++ 
Sbjct: 265 KSSRLIKPPSYLQDYHYNLIFSSSPSLSPSSDVVHPIQNTLSYSHLSDSHKAFTLTISTP 324

Query: 319 YEPSYYHQVVKHQTWRKAMVEEIEAMKRTNTWTIVSLPKNHHTVGNKWVYKVKCKLDSTI 370
            EP +YH+ +K   W  AM +E+ A++  +TW I SLP   H +G KWVYK+K K D +I
Sbjct: 325 VEPHFYHEAIKSPQWCDAMSKELAALEANHTWVITSLPSGKHPIGCKWVYKLKFKSDGSI 379

BLAST of Cucsat.G5989 vs. ExPASy TrEMBL
Match: A0A5D3CZP1 (Copia protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold96G00100 PE=4 SV=1)

HSP 1 Score: 278 bits (711), Expect = 5.19e-82
Identity = 165/386 (42.75%), Postives = 228/386 (59.07%), Query Frame = 0

Query: 20  FFAKTRTAHQFSCVYTPKQNSVVERKHRQLPNVARALMFQSKSPLTFWGECILSVVYLIY 79
           FF +    HQ+SCV  P+QNSVVERKH+ + N ARAL FQS+ PL FWG+CIL+ +YLI 
Sbjct: 254 FFEQKGVIHQYSCVQCPQQNSVVERKHQHILNTARALYFQSQVPLNFWGDCILTAIYLIN 313

Query: 80  WTPMVLLSNSTSFATLFEKEADYSIIRTFGCLAYASTLSATRSKFDPRAQPCVFLGY--- 139
            TP  LL   +SF  L     DY+ ++ FG L YAS+L   RSKF  RA P VF+GY   
Sbjct: 314 RTPSKLLQWKSSFQKLNNTIPDYNSLKVFGSLCYASSLPYNRSKFQIRAIPSVFIGYPQG 373

Query: 140 ----KLYDLARRKFFVSRDVLFFEELFPIHSIKEKGTSISHDFLEQFVIPCPLIDCL--E 199
               KLYD+  +K F+SRDV+F E  FP H+I +   SI  D L  F +P P  +C    
Sbjct: 374 MKAYKLYDIEHKKVFISRDVIFHETTFPFHNIPKNQISI--DPLPGFSLPKPFHECNLHS 433

Query: 200 KETIIDLTTAERPISKNTLED--NHGVDDHD-----PCTEDSEETNSPVQIPITIAP--- 259
             T+I   T +   +    ED  N  ++ +D     P  +++   NS +    TI+    
Sbjct: 434 HTTLIPPPTLQTNTTPTRDEDPSNSNIESNDFENQQPMNDENMARNSDINE-TTISQQDR 493

Query: 260 ----------RKSSRQYHPPSYLKDFHCNL------TSQRSTPFHLTKYLSYKAYSQHHK 319
                     RKS+R   PPSYL+ +HC+L      T+Q+ST + + +YLSY+A S  +K
Sbjct: 494 QPNANEETTIRKSTRITKPPSYLQAYHCSLLTTQSPTTQKSTKYPINQYLSYQALSPTYK 553

Query: 320 NYLFNIASIYEPSYYHQVVKHQTWRKAMVEEIEAMKRTNTWTIVSLPKNHHTVGNKWVYK 370
             +  +++  E S+YH+ V  Q WR+AM  E+EAM+   TW+IV LPK  +++G +WVYK
Sbjct: 554 YSILQVSTKKELSFYHEAVISQEWREAMKAELEAMETNQTWSIVPLPKGKNSIGCRWVYK 613

BLAST of Cucsat.G5989 vs. ExPASy TrEMBL
Match: A0A6J1DNP7 (uncharacterized protein LOC111022065 OS=Momordica charantia OX=3673 GN=LOC111022065 PE=4 SV=1)

HSP 1 Score: 273 bits (698), Expect = 3.52e-80
Identity = 164/410 (40.00%), Postives = 221/410 (53.90%), Query Frame = 0

Query: 19  EFFAKTRTAHQFSCVYTPKQNSVVERKHRQLPNVARALMFQSKSPLTFWGECILSVVYLI 78
           EFF      HQFSCV  P+QNSVVERKH+ L NVAR+L FQS+ P  FWGEC+L+  YLI
Sbjct: 395 EFFHSKGVLHQFSCVGCPEQNSVVERKHQHLLNVARSLYFQSRVPTAFWGECVLTAAYLI 454

Query: 79  YWTPMVLLSNSTSFATLFEKEADYSIIRTFGCLAYASTLSATRSKFDPRAQPCVFLGY-- 138
             TP  +L  +T +A L+   ADYS ++ FGCL + ST    RSKF PRA   VF+GY  
Sbjct: 455 NRTPTPVLDWNTPYARLYGHSADYSSLKVFGCLCFVSTSPVNRSKFHPRALTSVFVGYPP 514

Query: 139 -----KLYDLARRKFFVSRDVLFFEELFPIHSIKEK------------------------ 198
                KLYD+  ++FFVSRDV+F E +FP H++                           
Sbjct: 515 GMKGYKLYDIENKRFFVSRDVIFHESIFPFHTVSVNSPIVDPFPGVVIPKSYDLVDTSSG 574

Query: 199 ----------GTSISH---DFLEQFVIP--CPLIDCLEKETIID--LTTAERPISKNTLE 258
                     G+++S    D     VIP   P++   E +      L   E  I  N +E
Sbjct: 575 APDDHHNYATGSAVSPTSADISPTVVIPDGSPIMVANESDANGSPILVANESNIMPNIVE 634

Query: 259 DNHGVDDHDPCTEDSEETNSPVQIPIT-----IAPRKSSRQYHPPSYLKDFHCNLT---- 318
           ++      +P   D    +S V +P +     +  R+SSR    PSYL+D+HC L     
Sbjct: 635 NSL----INPLDTDVANVDSAVVVPCSDPSDSVTLRRSSRVAQRPSYLRDYHCGLIQATD 694

Query: 319 -SQRSTPFHLTKYLSYKAYSQHHKNYLFNIASIYEPSYYHQVVKHQTWRKAMVEEIEAMK 370
            S  S  + L KYL Y   S  +K ++ +++  YEP +YHQ V    WR+AM  E+ AM+
Sbjct: 695 HSASSVFYPLQKYLDYNNLSASYKEFVLSVSCDYEPQFYHQAVPFSHWREAMRAELHAME 754

BLAST of Cucsat.G5989 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 104.0 bits (258), Expect = 2.7e-22
Identity = 58/129 (44.96%), Postives = 82/129 (63.57%), Query Frame = 0

Query: 243 SSRQYHPPSYLKDFHCNLTSQRSTPFH-LTKYLSYKAYSQHHKNYLFNIASIYEPSYYHQ 302
           S R+   P+YL+D++C+  S  S   H ++++LSY+  S  + ++L  IA   EPS Y++
Sbjct: 34  SHRRTRKPAYLQDYYCH--SVASLTIHDISQFLSYEKVSPLYHSFLVCIAKAKEPSTYNE 93

Query: 303 VVKHQTWRKAMVEEIEAMKRTNTWTIVSLPKNHHTVGNKWVYKVKCKLDSTIDRYKARLV 362
             +   W  AM +EI AM+ T+TW I +LP N   +G KWVYK+K   D TI+RYKARLV
Sbjct: 94  AKEFLVWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSDGTIERYKARLV 153

Query: 363 EKNYNQQEG 371
            K Y QQEG
Sbjct: 154 AKGYTQQEG 160

BLAST of Cucsat.G5989 vs. TAIR 10
Match: ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )

HSP 1 Score: 70.5 bits (171), Expect = 3.3e-12
Identity = 34/76 (44.74%), Postives = 47/76 (61.84%), Query Frame = 0

Query: 295 EPSYYHQVVKHQTWRKAMVEEIEAMKRTNTWTIVSLPKNHHTVGNKWVYKVKCKLDSTID 354
           EP      +K   W +AM EE++A+ R  TW +V  P N + +G KWV+K K   D T+D
Sbjct: 27  EPKSVIFALKDPGWCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLD 86

Query: 355 RYKARLVEKNYNQQEG 371
           R KARLV K ++Q+EG
Sbjct: 87  RLKARLVAKGFHQEEG 102

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P109786.4e-2927.87Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
Q9ZT942.9e-2124.60Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P041461.7e-1824.32Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
P925204.6e-1144.74Uncharacterized mitochondrial protein AtMg00820 OS=Arabidopsis thaliana OX=3702 ... [more]
Q94HW22.1e-0831.58Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Match NameE-valueIdentityDescription
KAA0038341.14.06e-11472.46Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
KAA0043630.11.07e-8562.16Reverse transcriptase, RNA-dependent DNA polymerase [Cucumis melo var. makuwa] >... [more]
TYK16758.11.07e-8142.75Copia protein [Cucumis melo var. makuwa][more]
XP_022154919.17.26e-8040.00uncharacterized protein LOC111022065 [Momordica charantia][more]
RVX06074.13.00e-7340.27Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
Match NameE-valueIdentityDescription
A0A5D3BDH61.97e-11472.46Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A5A7TK435.16e-8662.16Reverse transcriptase, RNA-dependent DNA polymerase OS=Cucumis melo var. makuwa ... [more]
A0A2N9F3761.81e-8444.56Integrase catalytic domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB... [more]
A0A5D3CZP15.19e-8242.75Copia protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold96G00100 P... [more]
A0A6J1DNP73.52e-8040.00uncharacterized protein LOC111022065 OS=Momordica charantia OX=3673 GN=LOC111022... [more]
Match NameE-valueIdentityDescription
AT4G23160.12.7e-2244.96cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00820.13.3e-1244.74Reverse transcriptase (RNA-dependent DNA polymerase) [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (B10) v3
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR46087PUTATIVE, EXPRESSED-RELATEDcoord: 1..715
NoneNo IPR availablePANTHERPTHR46087:SF1ARM REPEAT SUPERFAMILY PROTEINcoord: 1..715
IPR016024Armadillo-type foldSUPERFAMILY48371ARM repeatcoord: 54..509

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsat.G5989.T1Cucsat.G5989.T1mRNA
Cucsat.G5989.T2Cucsat.G5989.T2mRNA
Cucsat.G5989.T18Cucsat.G5989.T18mRNA
Cucsat.G5989.T17Cucsat.G5989.T17mRNA
Cucsat.G5989.T15Cucsat.G5989.T15mRNA
Cucsat.G5989.T16Cucsat.G5989.T16mRNA
Cucsat.G5989.T7Cucsat.G5989.T7mRNA
Cucsat.G5989.T13Cucsat.G5989.T13mRNA
Cucsat.G5989.T5Cucsat.G5989.T5mRNA
Cucsat.G5989.T12Cucsat.G5989.T12mRNA
Cucsat.G5989.T4Cucsat.G5989.T4mRNA
Cucsat.G5989.T6Cucsat.G5989.T6mRNA
Cucsat.G5989.T8Cucsat.G5989.T8mRNA
Cucsat.G5989.T9Cucsat.G5989.T9mRNA
Cucsat.G5989.T3Cucsat.G5989.T3mRNA
Cucsat.G5989.T14Cucsat.G5989.T14mRNA
Cucsat.G5989.T10Cucsat.G5989.T10mRNA
Cucsat.G5989.T11Cucsat.G5989.T11mRNA
Cucsat.G5989.T20Cucsat.G5989.T20mRNA
Cucsat.G5989.T19Cucsat.G5989.T19mRNA
Cucsat.G5989.T22Cucsat.G5989.T22mRNA
Cucsat.G5989.T24Cucsat.G5989.T24mRNA
Cucsat.G5989.T21Cucsat.G5989.T21mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0016310 phosphorylation
biological_process GO:0006278 RNA-dependent DNA biosynthetic process
molecular_function GO:0016301 kinase activity
molecular_function GO:0046872 metal ion binding
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0003964 RNA-directed DNA polymerase activity