CsaV3_4G027580 (gene) Cucumber (Chinese Long) v3

Overview
NameCsaV3_4G027580
Typegene
OrganismCucumis sativus L. var. sativus cv. Chinese Long (Cucumber (Chinese Long) v3)
Descriptiontarget of Myb protein 1 isoform X2
Locationchr4: 16812660 .. 16833155 (-)
RNA-Seq ExpressionCsaV3_4G027580
SyntenyCsaV3_4G027580
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GGGAAACAAAGATTATAGCGAAGAGAGAGGAGAGTGATAAAATAAAGGGGTTTTGGACAGCGTAGATCCATGGCGGTGCCGTTGCTTTCCTCTATAAGCTTTTTTGCTGAAACGCAACAACGCCCATTCTCTCACCAAAGGACAGAGCGACACACGCATACACTTTTAGATAGAGAGAGAGAGATTCTCATTTCTTTCTCTTCTGTCGTTCAACTCCAAATCTTCATCCCCATTATTTGCTTCTTTATTATCCTTTCAATTCCTCCTCTTAATCATTCATCATTACGCCTACCCACAGAGTTTCCAGCCATTTTTCAACCCTTTTTGCCCCAACCCCATTTCTTCTTTTTTCCTCAAATGCTCATCCTCTTCCTCTTCTAGGGTTTTTCACTTCTCTTCTTTTGTCTATTCTGGTAACGTTAACTCTACTTTCTTGCTTTTCACCTGTTCTTCAGTATTGAGTTTTTATTTTGAGATCTGTGTTGCTACTCTTTTCATTTTGATTGAAACTCCTCTTTTTTTCACACATGGGTTTGCACTTTACTGGACTTGAAGTTGAGTATCAGCTGTATCGTGTTTCTTTTACTTTCATTTCATGTGTTTTTGAACTTTTCTTTTTTCTTTTCCCCAGGCACTAAATTCTCTATTATTGTTGTTAATTTGTTTGGTTTTTAAGCTTGGACTTCCTTTTTCCCCCATTACAGAGTGGCTTATGGTTCCCACATCGGAAGGCTTTCTTGATTTCTATCTCTGCTTCTATTGGTGCAGTATGCCTATAAAGTCCAAGATATTGTAGAGCATTTTGGGGTTTTGTTTACCTTGACGATTTGTTACTCCCTTGACAATTCAAATATATAGGAGGTTGCTGATCTCGGTTAGTTATTCTTTTACTCTCTACTTTGTCATTTTAGTGTTTGAGATTTTTCATTTTCTGTATAAAAAGTTATGGTAATGTGTAGTTGTTATTATAAGCATTAGTAAGTGAAAATTTCATTATTTTTACTAAATACGTATTGCACAGCAGAACATAAAGATAAGTCTGCATTTAATTCTTTCTTTAAAAAACTACAAGGCCTTTTACCATTATGCTCTGTTATTATAGATTGTTCACTTTAGTGAGCGATGTTGTTTTTCCATTGAGTTTTTCTTTTGTCATCTGTAATGAATGATAATTAGCTCTGGATTATCTTTTTTCTCTCCATTACCTTGTCAGTTGAGTAGGAACATTGCCTATATTTGGAGTAAGGATGCCAAGACATGTGTAGTATAGTTTCTTCTCTTGCAAGTTTCTGTAGTTATTTACCAATGTTACAAAATATTATAGCATATCTGTGGTAGACCATCTGTGTCACAATTGACACAAATAGTAGTCTATTGCGGTTTATCACATATAGATTGTGATATTTTGTTATATTTGTAAATATTTTGGTTCATTTTACTACATTTCTTTCAATGTTGAAGGTTTCTGTAAGAAGTTATCACTTGACTGCATTACATTTTTTAGTCAATATTTTCTGACTATTGAAATATTTTAGTTGCCCTTTCACTTCTGAATCTGCCTCTTCTATTCTTACGTTTTTTTATCGACATTTTATTTTTTAAAAAAACCAAAATTAGGGGTACTATAAAGATGGCTGCTGAACTAGTCAACTCTGCTACAAGTGAGAAACTGGCTGAAACTGATTGGATGAAGAATATCCAAATCTGTGAATTAGTTGCTCATGATCAAAGGTGTGTCTTGTATATTCTCAAAGTATATTTTCCCACTTTTTTTGGTTTGATAAGGAAGAACTCATTTGGATTCTGCCTACAAAATATGTGAGGATAGATTTGGTTTTAAGTATGAGTTTATGACTCCCAAGCCAGACTGCTTATATCAGATGTATGGATGACAATATCTCGTTATGCCTTTATAATATCTCGTTATGCCTTTATAATATCACATTGCCACAAGTTCTAGTCTTCTTTGAGGAGATGGTCTTATTACTCTTAGTTGTAAGATTTGTGCCGCAGTTTCTTCTTCTATAAAAAATGCATTTTATGAAAATGCCATTTTCCTCAGTATTGCTTTCTTTGTTCTATTTGTGCAGGCAAGCAAAAGAGGTCATAAAAGCGATTAAAAAACGACTAGGAAATAAAAATGCAAATGCACAACTTTATGCAGTTTTGGTAAGATATTGATTCTAGTTACGCCAATTCTATTATATATTTAATATAAAAGCATGTGCCTCCTTTCATGTTTTATGTACCAAATTCATTCTTTTAGGAAAATGCATGCAATGAATATTAGGCCTTTTATCCCGCATGCTTCCAATTTTATCGGAGGTCCTTGAAGAGATGCCAGTTATCCTGCATCACAGGTGTGAAAGAGCACCAAAGAGCATGTGGATGGTGAGGTGTAATTGGCACTATTGGGAATATACCAATCTTAACATTAAGAATGGAGGAAGAGTGAGATTAGAGGAGAAAGTAAGTTTTTTCTTCTTTCTCCCCTCTCTCCCTTCTCCTACCTTCTTCTTCCCTTTGCATCATCTCATCCCCTCCAATCATCTTTCTCAGTTCTCCAGCGAGTGTCTAGCCTTTTATTTCGATCGCTCATTCAGAACTAGATATGGAAATGGAAAGTTGCAATATAGTTGACTCTCTTCACTGTATTTGGTTATGGACTATTCCAAGATGAAGACGTGGACGGCAACTAAATTCTGCCTTCATCAAATTTCGAACCGAGTTGGTTGCAAGAAGTTCTCTCAGAACTAAGCCAAAAACTAGAAAATCAATTCTTTCTCAAGAAAGAGAAGACCGATAATGGAATAACAGAAGTATCTAAATTCAGAGCCACAAAAGGTTGGATATTGAGATGCACTCTTTGGTCATGTATTGGCGGTAGGTCCTTGGTCATGTATTGGTGGTAGATCCTTCATTCAAGTTCTTTTAGGTGAGGAAAAACAGGGTTGGAAAACTTTTATTCAATTGCTAGGAGGCTTCAAATCGAAACTTGAATACTCGACATGGATTTCATCCCATTCAATGTCATCAAATATCCTTAAGAAGGACGTGAGCGCACGAAAGTCAACATGGGAAAGATATGAAGTCAAGGTGCATCCGAAAAAAAGTCACACATTCCTCTGTTTTTGACCTGGAAAAATAGAGCATGAGTTGTTACTGTATTATAAAGAATCCAAAAGTTCTCAAGACTAATTTCAACAATCTTTGGATTGTAACACGACTTTTTGAGTTTGATAAATGCTGAGCCATAGCTAGCTCACTAAAATTATACTTTGATACTGATGTCATTCTTAAAACTTTGTCTGCGGAATGTGCCTTAATCCTGTTGGATTAGGGTGAGTTGGAAGCTTTTGCTGAGTTCCCATGAACATGGCAAGAATGTGGTCAGTTTCATTTAAATAAATTGAAAAGTGGAACAAGTATCTACATGGTCGTCTGAATGTAATGAAAAGGTTCAATGGGTGGATCTCGATTAAAGATTTACCATTGGGCTGTTGGAGCTGAAAACCTTTTAAGTTACTGGGTCTTACTTGGGCGGATTAGTATCTATAGCAAATGGAACGTTGAATCTTATTAATGTTGTTGAAGCAAAAAATCAAGTTAAAAGAAACTTATGTGGTTTTATGCAATCTACCATAGAAATTTCAGATGAAAGCAAAGGTAGTATTTTTTTGAATTTTGGAGATGGCCCCCCCCTCAAAGTTTAAGGACTAAGGTGCTTTGTTTATTAAAGATTGTTCCAACCTAATTTATTTGACTTGACTAAAGCAAGTTATGATTGATGAAGATTTAGATTCTTCTGTTTTGAATCTAGAATGGACATTTCAAGCTGCCCCGTAGTCCAAAAAATTTTCATGAAATTATTTTGCATCGAAGTAGCTTCAAGGGGAGAACCACACACAAAACCCACTAGAAATTGACAGTAGAGACTCTCCGACATGAGTCTTCCTGCTGACGATGGAAGAATCAAGGAGAAGTGAACAAACTCACGCTTTTCGAATGAGACGGACTACGACAACGCTTCGAACTGAAAGCAGTCCAAGGGATGCCTTTATTCTTGGTGCGAAAAGAAAGTTTCAAAATTTGCCAGTTGTTAGCCCTTTAAAGGACGCCCATAAGTCTTAAGAAGATAAAGACTCAACAGTCATTAATGATTCCTGTTTCCCAATTAATGAACCCTTGCCCAAAGTCTCCTTTGTCCCATTAGCAACCTCTCTTGAAAATAACAACTCTAAGTCTCTCTCATGGGTAGGTGAATGAGGTTGGACTGTGCAAGCCTATTTTGTCTTCCACACCTCACCTTTCACCTTCATCCCTGTTGGATAAAGGGTTATATAAAGTTATATGGAAAACTAGTAGGAAAGTTAATATACTCACCTTTGGGTTGCCGAACTTATGCAAAGAAAACTCCCAAATAGTTGCCTCTTACCTTTGGTTTGCCCTCTGTGCATGAAAGAAGAGGAAGATTTACCACATCTGTCTTTTACATGTTCATATTCAACCAGTTGTTGGGGAACCTGTTCTCTTTATTCAGTGTTGCTTGGGTTTTTGGACATTTGTTTAGCTCAAAAGTTCAATAGGTTCTTTTGGGTCTTTTCTTAAAGAAAGAAAGGGCCGAGACTAATATGGGGAAACATGACTAAAGCCTTGCTTGTGGAATTATGGTTTGAATGAAATCAAAGGGGCTTCTTCAACAACAAAATCTATAATCATCCTTCAGACTTCTTCTAAAGGCCAAAGACCAAGAGGACGTATCTTGGTCCCAATGTGCTGCAACTGATCCAGTGGGCAAAAGAGCAATTCTAAAGAGTCTAGAATATTGCTCTTTAAAGGGAGTGTTACCTACCTAAATATCTGTCCAAAAGCCAATTCTTTGGCCGTTACCAAGATTAAAAGAGGCAAATGAATCAACCGACCTCCAAACTCTAGCAATGCTAACCCAAGGACTCCTTAGGCTATTACCAGACTTCCCTTTAGTGAACCAATCGGAGGGCTTCTTACCATGCTAATTCCTTTCAATTCTTAAGCTTCCAATTACTTTGAAAGACTTGGAGTTTCTTCCAAGATTCGGGTTTACATGAAGCCTGTCTTCTGCTGGGCCGTACCTGACCATGAAAGCAATACTGATATCTCTCTTGTCGAAGATCTCTAGTCACTTCCTTTGCTCCCTTCTGGTATGAAGATTCTCGTTGGATAATACGGCAAACCTACCTCTTGCATTTTCCAACACTTTCAAGATATTGGAAGGAGAGATAACGAGCACACACTAATTTACGTGGAAACCCGAGTACCGGGAGAAAAACCACGATTGTTTGTTGATATTATTTTCTAATGAATAATACAATAGGTACAAGGGAGAATAAATAGAGAATAACAAGAGAATAAAAAAGGAAAAGATATAGGAAATAAGGAAAATATTCCCATAATCTTTCCAAAAACATTCTAAGATTCTAACAAGGAAAATATTGAGAAAGTAAAGGAAGGATTCCAACAAAAATAATGGCAAACTCTAACCTTCTTTGCCAAAAGACCATAGAATTCTTCATGTCTGCAGCGCCCACTAAACAATCCCTAACACAAGCTGAGGTGCTGAAATTCAGCTCCAGTGTACCAAATATCCTTCCATTTCTTTCTTCTATGCAAATCTTCCTAGCCTTCCCCCACAAACCCAACCAAGAATTTCTTCCCATCGATCTTGAGCTTTCCCATCTCTTAGATTTTCTCAAATTTTGAGAGAAAAGTTATCAGAAGTTCTTCGGTTGAAGCTTCCTCTGAATTCAGTGTGTCTCTCAGTGGGTGCATTATTTTTGGGTGTTAGTTCACCTTTTATTTTGTTTCTTAATTTATAGTGGATTTTTGTTAGCTTGATTAAGGTCAATGGTCTCAGTTAGGTTGGGCAGTTTAGATGATTTTTCTCAGTTTTTTTATTCGCATGCATTGATAGATTTCTGTGAGGAGTTTCGCTTTGGTGAATTCATCGTGTTCTTCAAAATATTGCTTGCTTAATTTTTTAGTTTGAGTTCACCCACAGTTGTCTTTTCTTATTTTCTTTAGGCTGTTTCAATTTTTTCTTTAGATCTAATACTTTATCGCAAAAAGATTTGTCTATCTTATTTTTTTATTTTGTGTCTCAAGCATTAGTCTCTTTTTATCATTTCATTGAAAAATTGTTTCGTAGATTTCAATTTTACGAATATATTCATAAAATACTGTTGTCAACGGTTGTTTTTCTAAAGTTAAAAACTAATAAACCATCATGGATTGACCTAAGGGTAAAAAGGGAGATGTAGTCTCAAATAGTTCATGGTTCAATCTACGGTGATCACCTACCATAGGCTTAGGTGGGTTGCCCCTTTTAGCAAAAAGAAAAAAAAAAGGTAGAAAAGTAATAAATTATATTAAAGTTATTCTTACTTCCAAACTAAGTTAAATATTTTGTTATTATTATTCATATTCATATCATGTTTTTCTTTTTATTATTGAAAGATATTGGAGATATCTATTAATTCTTATTATCGATGTTGTGGCCTTTAATTTGTAGATATATTGATATATTGATGAATATTTCAATCTTTAACCTAGTCGGCACTTTATTTTATTCATAATTCAAGATTTCTTGGTTAGGAAAAAAGAGAAGCAAATAAGGTCAAGTACCATTATATATTAAATTTTGATGAAGTTTCTATCTACTAAATTTGTCCCCAGTACAAAGCATTCTTCACATACTAGTCTGCTGCTCTGTTGTTGGACACCACACCATCAATATCTTAATCTTTTGTGCAGTTACTTGAAATGTTGATGAACAATATTGGAGAAGCAATACATAAGCAGGTGATTGATTCAGGGGTTCTCCCTATTCTTGTGAAGATAGTGAAGAAAAAGGTTTGAATTTGTTCTATTTATTTACGACAAATAGTTGACTATTAATCTTTCTTTAGAAGTATGGCGCAAACATGGTTATGAATATGGCGGCATGTCATATTATAGATAATAAGAACATGAATACCTCTATTAAAAGTGTTTTTTTTATTAAATTACAAATCTACCTAAAAGCTTGAGCCAATGGGTGACGACAAATTTAATATAATATCTAACCGTCTCCTCTATTTGTGGGTTTGAAATATGGAGAAAGTTCAATAAGTAGAAATAAATTTTAAATGGGGAGGGAATTCATTGCTGGGGTTTGATTCGTTTGAACACAGAACCTCCTTGACTACCTCCTTGATCACCAGCTCTGATTATATCTTAAATCACCCATCTACCCAAAAGCTTATGATGTGTATATGTGTGTTCTTGGAAAGGAAGATACAAGAGACTGGGTTCTTGAAAAATGACTGACTTTTTTATTAATTCTGGAAATATGACAAACTAACAGTATTTATACTAAAAACCTAATAACAGTCAAAGTCAAACCTAATTATAACAGCCAGTTAAAATATAACAAACTAATTACATCAGCTTAAGCTGATAGGTGAAGGGAATTTTAATATAATATGTAACACTTTTAAATGTATTTGATTAATTTTTTATATAACAACAAATTCACACTAGACCTTGGTTTTATTTTATAATTTATGAAGTAATATAAAAAGGGAGGTAATTTCTAACATTTCAAGAAGTTGACGAAACCAATAATATATATATCCAATGATTCAAGTACAGTAAACAAAGATTTTCAAATCCAAATAAAAAAATTGTGAATTGTTTTCCGCTCTTGAGAGTATTCGTAACTTTTAGTATTAGTTTTTTTTGAAATGGAGACAAGTCTCCCAAGAATCACAAAAGAAGATGGCTCAAGAAACGTTTCCAACCGGTTGAACAATTATTGTTTTTCAGCATTTTTGTGGTTTTGATTCTGCGAGGCTGGTTGTTGTCTGTGTTCCCTCTTCAATCATCTCGAAGCTTATTGTGGGCCAGTTGTCAGTTTTCTTTGGGTTCCTTTTGACAATCTGTTTTAATTACTTCATACCTAGTTTACATTATAGATTCCATTTGTTGTTTTGGGTTAGAGGATTTTGATTAGCTCGCTGTTATTGCTGAGTTTGATTTGGTTATTTTTTATATCTGTCTTAGTTTGTTTGCTCTCTGTATAATTTTAATATTAGTCTCTTTTCATTATGTGAATTTTCTATATTATTGAATAAATTAGAGTATTTGAGAGAACATAATGACCTTTTACATAGAGGAATAAATAGACCTCAAAGAAACTACAAAAGGAAAATACATAAATGGAAAATACCAAAAAGGATAATAATAATAAGCCAAATCCTAATTTCCATTTAACACATAATATCAGTGAAGAGCTTTGTTTCCTTTTTTCTAAAAAAAGGAATAGAAAATTTTTTGTTAAACAATTCAACATCTTGAAAAAGTTGAAGTTTGAATTTGATTATGTTCTTACGATTCTATGCTTAGAAAATAAGTGAATTTTTTGCGTAAAGTTATTATCTCTCTTGCATTTTGATTTTGGAATAAGATACAAATTTGAACTTTTTCAAGATGTAAGAGAGAGGAGCGACATTTTCTTAAGAGAATCTTATGTTTTTCGTGCAACTTTTTCAACAAGATTCTATGATTCAACAAATTTCAACTTTTTCAAGATGTCCGGAAAATAAATTTCAAATCTTATGTATTTTACAATTTTTTTTTGCAATTTCCTGATTTTCTATTTTTGCAGTCTGATTTACCAGTGCGAGAGAGAATATTTCTTCTTCTAGATGCCACACAGACAGCTCTTGGCGGTGCTTCTGGAAAGTTCCCTCAGTATTATTCAGCATATTATGATTTGGTGGTAGGACATTGCCCCCATATTTGCTATATATTTCTTTGATTAGAAGTTGTATATTGAAGCAAATGGATATAATCATGTTAAGAGAATTAAATCCTTGTTGTTCAACACTATTTATAGTTTGAAGGAGTGGAAGAATAAAATCATCAGTAAAGTAGATCATAATAATATGTTTTCATGGTTCTAATGAGTTGAATGGCGTTGACTTTGAAGAAATGAGTTCAAGGCATGATGGGCACCATGCATAGGATATAGTAGTATATGAATTACCTGGCAACCAAACGTAATAAGGCTAGATGGCTGCCTTGTGAAAATAGTCAAGGTGTGGACAAGCTGATTAGGACACCCACAGATATCAAAAAGATAAAAGAAAAGTTTGATGTACATTGTTAGTTTCATAAAAAATTTATCATTGATATTGGTTGATGCTTGCTAAATAATAATAGTTCTTATAGGTTCAGAATGAAAGATATTTTGATTGTGTCTAATCTTCAACTTTGTAGCGTTGGGATTATATTTTTTGGGTTCAGTGAAAATGAAGAAAGAGAGGGGAAAGTATTAGATTTGATTCCCACACATTATTGTTTATGTGATTCCTTTTGTTATGTATATTCCCATTCTTCGTTTCTTGTCAGGAAAAGAAAAGAGAGAAGAAACATTGCTGCTTAGAAATTCCCCGAAGAGTTCCATTGAGAAAACCTTGGTACTTCATAAAAGAAAAGATTAGGAACTCCGGACATTGTTTTCTGTTTGTTGTTGTTTACTTGGTTTTGGTTTTTGGATTTTGGATTTTGGAGCACTAATCTCTCTTCATTTCCCTAATGAAACGTTTGGTCATTTTCGAAGAAAGGAACTTGAACATATGTTCCATCTTTTATGATTACATTCAGTAACTGTAGAATCTTTGAAGTCGGAATACTCCAAATGAATTTTTCATTTGTTTCTCAGAGTGCCGGAGTCCAGTTTCCTCAAAGGCCTCCTGCAGTTTCATCAAATAGTCCTACCCAGCAGCAAATTAATAATACTTCACAAAATGGAGTAATAAGATTATCTGAGCAGGAGAATGTTGCTAGAGTGGAACCTCAGATATTATCAGAATCTAGGTATATCTTGTTGAGCTTATTCTTCTGCACTTTTCCGTCTTGTTATCCACTTGTTTTTCCCTTTTTATTTTTGACATACAGAACTCTAAGTTGTTAAAAATCCTAGAAATTTGTCATGAACTATTTATGAAACGAAAAACACATTACTCCCATTATAAGTGTTGATGTTTGTGCAGCTAATGGTTTGCTAAAAGGAACTATTTATGGAATTCTAGTATTATTGATTTATTCCATCATATTTGATTTTTTTTGGTTTTATTTGTTATTATTTTCTAGGATTCTTTCCTTTTATGTTTATATTTTTCTTATTTCATTGAGGCTGTATTCTATATCTAATTTAGGATTTTGTTAATAAGAATAAGAAAGTGAGAAATATTCACATGGTATCAAAGCAACAAACAGAAAAACCCTAACCTTAATTATTGTGCCGCCACTGACCTCTTCAACTCTTGCCGCTGCCGTCATTGATCTACGCTAGTGCTTGCCGTTGCTAATTTCGGATCTGGCCGCCGAAGCCTTTTATTTCTCAGATCTGGTCGTTAAAAGTTTTTTCAAGATCTGGTAAAAAATCTGGCGCTCAACCTTTTTTTTTCTGATCTGGTAGTTGCTGGTTTCCAATTTTTTTTTGAAATGGAGACAAGCCTCTTTAGTAATATTAATAATAATAAGAGAGACTAACGCTCAAAGTACAAGAGAGTTATATAGAGAACAAAAGGGCTAAAACAGATACAAACAATAGCTAAATCAAACTCAACAATAATAACGAGCTAAACAAAACCCTCTATCTGGAAGCACAATTGAAAACAGAAAAGTAAACTAAGTACAAAGTAATTTTTTTTGAAAAGGAGACAAGCTTCTTTATTATTAATAAATTCAAAGTACAAGAGAGCTATACAATGAGAATAATAGGAAAGCCAAGAAATGGGGAGAGAGAGAGAGAGAGAGGATCAGTAGGTGCACTCGGACATCTCAATTAGGTTGACACTCCTATAGCACCCTCATCATATCCAAAATACAAAGAACAAGAACAATACTAAGGTCATGAAAAGACCAAAGTAACAATCAAAACAATAATAGAAACTACCTACGGTAGACAAAAACAAGGCTGAAAATAAAAACATAACGGCAGAAATACACGTTAAAGCCCATCCAAACTACAGAAGCACTAATTCTGAACTGGCAATCGGGGGAAACTACATTGAATCTCATTAGATGAAGGCGGTTCGACTGAGGCAAATGTCCTGTATGGAGTGAACTTTGAATTCTGCATTTGAAGAACACCAAGCTGCTGCGTTTCTTTTAGATATATCTACGATGTCAACCTAATTCCTTTTTTTTTTTATCAAGGAAGATGTGTTGGTTACATTCGAACCAAATCTCAACCAGAAGCGCTTTTGTCAAATTAACCCAAATTAGAAAATGTTTTTTTTAATAAATAAGGCCCCCTCAATAATTGGAGCACATTGGCGCTAAACGAACCATCAAAGACCCAAACTGATTTGAGTATATAAAATATGCTAAACCAACAAGTAGATGAGAAGGGACAGAAAATGAATAAGTGTATTAAGAGTTCACTGTTTTCCAAGCAAAGGAGGCACACTGAAGGCAGCAAACAGCAAACAGCTTCCTTTGCAACTTCTAGGAACAGTTTAGAGGACCAACTGCCATAATCCAAACCAAGACATTTATTCTCCTTGGACTGCTGGTTTTCCAAATTGCTTAAAGTAATTTTTTTCCAAAGGTGAAGAGGAAGAAAGGGGCTTTGAAAGGGACTTAACTGAAAAATGGCCTAAGGATTCTAATGACCAAACTCTCCTATCATCCAAGTCCGTTAGATCTTTCTGAGATATGAGTGTTAAAAGACCTTGGAAATCTTGAATTTCTTCATCTTTGAAATACTAATTGTAATCTAGAATTCAGAGAAGTATATTTCTTATTCATCTTGAGTCAAAGGAATGTTACATATTTATATAGAGAATAAACTAAACCCTAGAGACTATGTACAATTACAATAAAGGGCATATGATATAAATATAAATATATATATATATCATAACACCCCGCCTCAAGCTGAAGCAAATATGTCGATCATGCCCAGCTTGTTGCACAGATAGCTTATCCTTGCTCCATTTAAAGCTTTAGTTAGAATATAGTTCTACTAGTTGTATATGTTTATGATATTGTTATTACTGGAACTATTTCTTTTAGGAGTTTGATTTCTTGCTTGCTAGCAGAAAGATGTTCGTTGGATGATTGGAGAGTTCTTCCATCCCTTTCAAAGAGAAAGGGCACCTTCTTTGCTTTGCAAGGGTTTGTGCTTTATTGTGGGATTTGTGGGGCGGAAGAAATAGTTGAAGTGTTTTAGGGTATGGAAAGGGACCTTAATGGGGTTTGGTCTTTGGTGAAGTTTCATGTTTCTTTGTGGGCTTTGGTTTCAAAGGCTTTTTGTAACTACACTCATGAGGCTGTTTTTTACCTAATTGGAAACTCTTTGTTTAGTTGGGTTTTTATGGGCTTGTATTTTTGTATGTCGTATTTTCTTTTCTTTCCAATGAAAGCAATTGGTTCTCTTAAAAATAGAAAAAGAAAAAAATGAAAAAAACCTTCGAATAAATCCACTTATTGCTCATCACATTTGCCCTTTTTCATGTTATTGTGCTTTCGTAGCCTCTTTTCTCACAAGTTAAATTGTTCTCATTTTAATGTTTTTTATGTAGTATAATTGAAAAGGCTGGCAATGCATTAGAAGTTCTGAAAGAAGTTCTTGATGCTGTTGATCCTCGACATCCTGAGGTATATTCTCTTGCCAGTTTTGACTCTCTCCCTCTCTCTTTATATATACCTATATTTCTCTCTTTATAAATATATGTTGGCTGTAAATATGTTAATTAGATGTAATCACCAATGGCTTATTTCATATTCTATTTCTTTCTTTTTGCATATCAATTCTTTTTTTCTTGGAAGAAAGTTGTGCAACTTACTGCTGAACTACTGTAATGAAAATATCCCTTTCATTAAAATTAAGATGGCACTTGATCATGTAAGGTTTAACTCTATACAGCCCTAAAAAATTGTTTTCACCACTTCTAGGGAAACCCTAATCACCAAGTTGTGGAAATTTTAGTTGCTTATCTTAACTAGTGCAATCTACTTCTAATCAAACTCACAAGTGAAGGTGTTCGTATTCAAGTTGTCATTGGTTGGAATGTGTTCACAAGAGAAGAGACATTTGAATGTCTTCATGATCGTTGTTGCATAAGCCTTTGCCTTCGCCTCCATTTTTGTGTGTTGCTTCTGCCGCTACACCATCACAACTTTTGGTGGTTGATGTGTTTGTGTTACCATCTGTCTGTCGTCGTGTGTTGTTCCCATTTGAGATAATTTGGGTTGATGCTATCATTTATGGGCTTCATTTTCGTGCTCAATTGTCGACCATCGTGACCGTCCAATGAAGGGTTTTTGGTTGTTGACCATAAGTGATGCTCAAACTTCATGCCTCCATTAGGTTTAAATTTTTTTTCATTTGATTTGGTTGAATTTGGTTAAGTTTAGTTCATAGTATAGTTGGTTTCTTTCTATTTTGTTCTCGTCTGTCTGCTATGTCCGGTTTGATTTGGGTTTTTGTCACAATTTGTGTGTGTGTATATATTTATCGAAACAAGCACTTTCACTGAGAAAAAGAATGAAAGAATACAACCAAGGCATGCAAAAGGCCAAACCCACAAAAGATAGAAGCCCTTGTACTAGAATGGATTCTGACTATGGAAAAGACTGCCTACGAAGTAGTTAAAATGAATTTTGAAATTGAAACTCACAAAGAAACAAGATAACGAAAAACAGACAATCTCTCACTAAGGTCCCTTTCCCTCCCTCTAAGAGTCCTATTATTTCGCTCACCCCACAAAGTCCATAAGATCACACACACCTGTAAAGCCATAAAAAGCGACTTTTCTCCCTCTGAGGTGGATTGAGGAGGAACTTCTCAATCATACTCATAGTAACCCTCTAACAAGCCACATAGAAGCCGAGAACTGCTCTTAGTAGAATTGGTATACTCACACTGCCAAAGGATGTGATCTAAATCTTCCCCGGTCTTCCAACAAAGAAACAATAGAACAACCTAACAAATGAAGGTGACTTCCTCACCAACTAATCCATTGTACTAGCACGACCATGAAAAACTTGCCCAGTAAAGAACCTCGCTTTTTTAGAAATCTTAATGCTCTAGAGAACCAAGAAGACTGAGACACCTGTGGGAGAATGATCAACCAGACATTGAAAAAAAAATTGCACGAGAACTCCTCTGAAGGCTTGGTTTTCCAGACTCTCACATGCCTTCTCCCAAACCTAAAAAAGTGACTTTCGAGTAAGACAAGAAGAAAGGCCACATCCAACATTTCACTATTGAAAAGTGAAGGATGAAGATCAAACATAAAGAAGCACGAGCTTCTAGACCCAACAATATTATAGTTTATAAATTATAATATTGGGATGAGGTCTTTCCCCTACCAAATGACCTTTTAAAAAATATGTGCCCTTTCCCTCCCTCACCACGCATTGGTCTAAGTGGACTATTGAAGGGAGCTTTTTTGAAATATCCTTCCACAAGTCCCGTGCCTACCTCTTATTCTCTTAAGACAATTGGTATGAAAAATATTTACTTTAATCTCAAAAGAGTAAATTCTTCAAGTCTTAAAATTGCTCTTAACCCATTTACCATCTAATAGTTTTTGTTTTCCCTATGCACAAGTAGGTCATTTTCCTGGAGCATTGTTGAGTTGCAATTCTTCTTTGTGAATTATCAATTCCAAAGAGCAGTTGGCTTATTTCCACCCACTAACCCCCCTCCCCCCCCAACTATCATTATATATTCTAGGAGAGGGATAAGGATGTTGGGAAGGGGACCACGAGCAAGTTAGTTGGACATGAGAACGTTGTTAGTTCATTTGTGGGAATTATAATTGCCAGTTGTAGCATGTCTCTTTTCTCATGCTTGTATATTGAATAAAATATTAGCTCTGTGCCGGAGAGTACTCCATGCTGTTGAAGATGAAGATTCCCTGAAAGGTTAAGTTTTTTATATTCTAGTCACGTGCAAAAGAGTACACCTTGGATTGGGTCTTTTATTGTATTCTTTGTAGGAAGTTGAGGGAGGATCTTGACAACCAATGTGTTGATTTTATGGAATTAATCTCTATTTTCATTGATTATTTATTTGAGGTACAAGCCTATATATAACCATAGAGAAATACACTTAAGGAAATAATATCTCCAGAATAATATCACCTAAGAATAATCTAATTGTATTAATACCCTCCCTCAAACTCAAGGTTGAAATCACAAACTTGAGTTTGCTAAGAACTAAGAAAAGAACCAATAGATCTAGAATAGAATAAAAAACCTGAAGAACAAGCACACAAAGAGCCTTTAGGCTAAAACAAACCCACAAAGAGCGAAACGAAGAACTTCTGGGATAGAAACATAGATCCTCGTATAGAGATCTATAACAGAACCAAGGAAGGACGAAACAAGAACTCTGGGGGCAAAACGAAAGAGCAAAACTGAAGCAAAGTAGAATCAAACCAGGACGAAAACGGGGTCTGAATTGGGGAAGAGCGTTTCTTCGGATCAGAATGGAACTGAACGAACTAAACTAGTGTAAGTCAGAATCAGACTAAACAGGAAATCTGGGGTGAACTGAACGAGGCGTTGAGGCTTAGCGGATCTGAACAAAAACTGAATCGGGGCGTCTTTGGATCTGAGCTTCGATCTAAACTTCGAAGGCGACACGAAGGATGGGTCTGAAGAAATCCGACAGATTTGGAACAGATTGGAGGCGGACACAAGAATCCTGAGGGCATGGGGTTTCGTGGGTTTTGTCGAGATAGCGTATGGAGATGAAGCGACGGCAACGAAACGCAGCAGCGAGCGGAGCTTCTCGAACGACGAACAAAACTGAAACTCTTCGGTTGGGCATTCGGCTTCAACTGACAGAGGTTCAATCGGCTGCTTGGCTGATGGAAACAAGACGTCAGATGGAGAACTATGGAGGAGATGGTCGGGAAGCAAGGGCGGTGGACGCTAGTGAAAACGATTGGGGTTTCTGGGGTTTGGATGGTGGCAGATTTGAGCAGCAATGAGCGACAATGAGCAGCAAAGGGAAGGTGCAATGACGGTGGCGGATTTGCAGATCTAAGGGTGGCGGCGGCTAATGGGGGATGATTAGGTAGATCGGACGGGATCTGGAGGCTAGAATTCAATGAGGCGAAAACCCTAGAGCTCTGATACTATGTTGATTTTATGGAATTAATTTTTATTTTCATTGATTATTTAGCAGAGGTACAAGCCTATATATACCCTAGAGAAATACACTCAAGGAAATAATATCAAATAATATCTCCAGAATATGATCACTTAAGAATTAATCTAATTATATTAATACAATGCACTGGCCACCTTTCAATCACTTATGAACCAGGTATTTTGCCTATTCCTAAGAGGCTTTGTACTTGTACTTTTTTATGATAGACTGGTCTATAGTTGTGATGTGAACGAGCACGAAAAACATTTGGGGATGGTGTTTGTTGTACTTAGAGATAATCATCTATTTGCAAACTGAAAAAATGTGTTATAGCCTACTCTAGAATCCAATATTTAGGCCACTAGATCTCAAGTAAGGGGGTGGAAGCTGATAAGGATAAAATACACTTAATAATGAATTGGCCTCAACCGAAAGATGTTATCGGATTGATGGGGTTTTTGGGATTGACAGGATACTATAGAAGGTTTGTGAGAGGATATGGTGAAATCTCTACCCCTTTGACAAAACTTCTACAGATTGGTCCAGTCTTTTTATGACTGGGTATTTTAAATGGAATGAGGAAGCTACATAGGCTTTTGAAAAGTTGAAAATAGCCATGACAACCATCCTTGTTTTAGCTTTACCAGATTGGTCCCTTCTTTTTATGACTGGAACAGATGCTTCTGGAATAGGGTTAGGGGCAGTTCTATCACAAAATGGACATCCCATTGCGTTCTTCAGTCAGAAACTAGCTCCAAGAGCCCAAGTCAAATCCATCTTTGAGAGAGAAGTGATGGATGTTGTGCTTTCAGTCCAAAAACGGAGATACTGTCTCTTGGGGAGGAAATTCACTATAATTTCATACCAGAAAGCTCTTAAATGTCTCTTAAATCAAAGAGAAGTTCAACCCCCAATTTCAAAAGTGGCTTACAAAACTGTTGGGGTATGATTTTGAGATCTTGTATCAGCTGAGACTGCAAAATAAGGCTGCTCATGCCCACTCTAGAATAGAACCACCTCTCGAACTGAATGTTATGACAACCCCGGGAATTGTTGACTTAGAACTGAATGTTACCCAAATCCATAATCACAGACTGGGATAAAATATTCCTCGGTAATTTCTAGAAGGAATTGTTCTCTACCATGGGCACGTTACTTAGGAGAAGCACAACTTTCTAATCGCAAAAAGATGAGCAAACCGAAAGAGTAAACAGATGCCTAGAGACCTATTTGAGGTGTTTTTGCAATGAGCAATCAAGCAGATGGCATAAATTCCTCCCATGGGCTGAGCTGTGGTATAACACAACCTTTCATGCATCCACGAAGGCTACCCCTTTCCAACTTGTGTTTGGTAGACCCCACCACCCTTAATATCATATGGGGATAAGAAATCCTCCAACAATGAAGTGGAAGTGATGATGAAGGAAAGAGACTTAGCCATAAATGCTCTCAAGGGGAATTTTATCGTGGCTCAATACCAGATGAAGAAAATGGCTAGATTTAAAGAGAAGGGAGCTGAAGTTTAAAGTAGGAGAAGAAGTTTACCTTAAACTGAGACCCTATAGACAGCGGTCACTAGCTAGAAAAAGATGTGAAAAACTTGCTCCTAAGTTTTATGGACCTTATAAATTAATTGAGGAAATTGGGGACATGGCATACCGATTACAACTACCACCAGAAGGAGCAATCCACAATGTCTTCCATGTGTCTCAATTGAAGCTCAAATTGGGAAAGCAACAAGTGCAGCACCAGCCCAAGACTGTTATGGGAATTCGTTGGAGTAAAGAGTTAGGAGCAAATGAACGGCTAATTAAATGGAAGGTTTTACATGGGAATCAATATATCAGATGAATCAGCAGTTCCCTACATTTCACCTTGAGGACAAGGTGATTTGAAACACAAGGGTGTTGTAAGGCCACCTATAATCCATACGTATAAAAGAAAGGGCAGAAAGGGAATTATCCAGCAAGCCAATGATGAGGGAATGATTGCAGAAAAGATTGCGAGTAATGGGCCCACCGGTAGGAGAGAGAGTGTCCTATAAAAAAATCTTTATGGGCAATGTGTAGGGAGGGTTTCTTTTTGTGTAAAAGCTGCAGAAAGCTTTAGGAGACGAATTTCCCAGCCTCCTTGTAAAGTTGCTGGGTAAGCTATTGTAATTTTCCTTTATTTTCGTTGTTGAACTCTGTGTGATCTTATCTTCGGCTGAAAGTAATATAAATAACAGAGTGCGGCCTCTGTTTTAGACATTTTGTTAGGATTGATTGGTATTTTTGAATTAGAATCCTAACAGTGGCAGTTTAGTTGTTTGCAAGGGTCGTCAAATAAGAGAAGCATGTCATTTTAACTGCTACATTGCTATTTTCCAGGGGGCAAGAGATGAGTTTACTCTTGATCTTGTAGAACAGTGTTCGTTTCAGAAGCAGAAACTAATGCATCTTGTGCTGTCTTCTCGGTATGGTTCCATCTTCGCCTGTTTCTTCCACAATGCATCAGTAATTGATTCCATTCAATGAATGGATGTTTTTCCTACTCAATGCTTTGTGAAACGATAACGTACTGTGTTAAGTAAATGAAAAGGAGACATGAAGCTACAAAATATTGTTGATATCATAATGGACAAATTTTGTAAATGAAAAAATGTGCAGTAATGTTTTTGTTTTCGTAACAACTTTGCGTGTAACTGAAAGAATCTCTTATATTGCTCACCCTGTTTTAACAATTGTAGATAGCTTCAAGGATGCTGAATTTGACACATCTATAGATACCTCATAAATTAGTACATTATGAGAAATAAGCTTAATACAACGTTGCTTCCTTTTTCCAATTTACTGACCTTGAACTCTGCTATGTGACTGTTGTTTCGATTTTTCTTTTCTATTTCTATACTGTGGTTGTAACTCTTGTTAGCCTACCATAGATATCCTTTTTTAATATTGCATGAGATAGTCATTGTGTATTCTTTTTCACTTCTTAGTAAAACACGGTTATCATTATAATTTATTTAATTGAAACATAATTGCTGTCACAAGAAAATTTAAAAATTGAAACATACTTACAATAACTTGTAATTCTATTGGTCTATCATTTTGATTTCCAGTTGCTTTCACTGTGTTTTAGGGACGAGAAGATTGTCTGCGGTGCCATTGAATTGAATGAGAAGCTCCAAAAGGTCCTTGCAAGACATGATGCCCTCCTCTCTGGTCAGTTTATGTCGACTCAAAATCAGTTCAACGGCGAAGAAGTTGGTATGTCCAGATTGCCTGCTAATCATTATAACCATGACGAAGGAGAAGATGAAGAAGAGGCCGATCAACTTTTCCGAAGGTAATACACTACATTCGATGTTGATTTCAGTATGGAGTGGTGAAAATATAAGCAAACTAAGTAGTGAGTGATATGGTCAATAACAGGTTGCGAAAAGGAAAGGCTTGTGTAAGGCCTGAAGACGAAGAGGATTCTTCAGAGGAGCGGCCGTCGTTGGGTTTGCTAGGATTGTCAATTCCAGTTGAACGAGCAAACCGTCCAATCATTCGACCTATTGACGAGAAGGTGTCAACGACATTGGAAATACAGCATGGTCAGGGTGTTTCAATACCACCACCACCAGTAAAGCATGCAGAAAGGGAGAAGTTCTTCAAGGATAAAAAAATAGATGTTGGAGTTGGACATATGAGAGGGCTTTCTTTACACAGTCGTAATGCTAGCAGCTCTCGCAGTGGAAGCATAGATTTCAACGAGTCATGAAGGAGTGGTGTGAAGAAGTGTTTGGGAGTGCAATTTTATTTATTTTCACTTCAACTCATTATTTGTAAAGTTGAGTCATTTTTCTTTCTTTCTTTTCAATGAGTTTGATATCTTTAAATTAAGGTGTAAGGGCCAGGGACTTACTGAATATATCCTTTATTCCATTCGTGTATAATTTTGTAATTTAAAGGTGGTTAACTATTTCAAGTTTGCTGAAGGTGAAAAAAGCCAAAGAAGGGCAATCGAAAGACCTTTATTTTTTTCTTTTTTTTATTATAATGCCCGCAAAGACATTTTTTGGTTTGATATTGTTATTTATTTATTTATTTGGACCAGCTGTTATTCTTTCTGCTGCTCCTTTTTTACCATTTTGATGAGAAAAATGGTTCTTACCCTTCTTTTATATATGTTTTTCAAAGGATACCTGTTCCATTCTACCCCATGTTTGTTTCTTTAATACTATGATTTATATATTGAACATTAATAATATTTCCCATCTATTGAGTACTTACTCTATGCTTCAAAAGCGGATTTTGTAATCTATATCAATT

mRNA sequence

ATGGCTGCTGAACTAGTCAACTCTGCTACAAGTGAGAAACTGGCTGAAACTGATTGGATGAAGAATATCCAAATCTGTGAATTAGTTGCTCATGATCAAAGGCAAGCAAAAGAGGTCATAAAAGCGATTAAAAAACGACTAGGAAATAAAAATGCAAATGCACAACTTTATGCAGTTTTGTTACTTGAAATGTTGATGAACAATATTGGAGAAGCAATACATAAGCAGTCTGATTTACCAGTGCGAGAGAGAATATTTCTTCTTCTAGATGCCACACAGACAGCTCTTGGCGGTGCTTCTGGAAAGTTCCCTCAGTATTATTCAGCATATTATGATTTGGTGAGTGCCGGAGTCCAGTTTCCTCAAAGGCCTCCTGCAGTTTCATCAAATAGTCCTACCCAGCAGCAAATTAATAATACTTCACAAAATGGAGTAATAAGATTATCTGAGCAGGAGAATGTTGCTAGAGTGGAACCTCAGATATTATCAGAATCTAGTATAATTGAAAAGGCTGGCAATGCATTAGAAGTTCTGAAAGAAGTTCTTGATGCTGTTGATCCTCGACATCCTGAGGGGGCAAGAGATGAGTTTACTCTTGATCTTGTAGAACAGTGTTCGTTTCAGAAGCAGAAACTAATGCATCTTGTGCTGTCTTCTCGGGACGAGAAGATTGTCTGCGGTGCCATTGAATTGAATGAGAAGCTCCAAAAGGTCCTTGCAAGACATGATGCCCTCCTCTCTGGTCAGTTTATGTCGACTCAAAATCAGTTCAACGGCGAAGAAGTTGGTATGTCCAGATTGCCTGCTAATCATTATAACCATGACGAAGGAGAAGATGAAGAAGAGGCCGATCAACTTTTCCGAAGGTTGCGAAAAGGAAAGGCTTGTGTAAGGCCTGAAGACGAAGAGGATTCTTCAGAGGAGCGGCCGTCGTTGGGTTTGCTAGGATTGTCAATTCCAGTTGAACGAGCAAACCGTCCAATCATTCGACCTATTGACGAGAAGGTGTCAACGACATTGGAAATACAGCATGGTCAGGGTGTTTCAATACCACCACCACCAGTAAAGCATGCAGAAAGGGAGAAGTTCTTCAAGGATAAAAAAATAGATGTTGGAGTTGGACATATGAGAGGGCTTTCTTTACACAGTCGTAATGCTAGCAGCTCTCGCAGTGGAAGCATAGATTTCAACGAGTCATGA

Coding sequence (CDS)

ATGGCTGCTGAACTAGTCAACTCTGCTACAAGTGAGAAACTGGCTGAAACTGATTGGATGAAGAATATCCAAATCTGTGAATTAGTTGCTCATGATCAAAGGCAAGCAAAAGAGGTCATAAAAGCGATTAAAAAACGACTAGGAAATAAAAATGCAAATGCACAACTTTATGCAGTTTTGTTACTTGAAATGTTGATGAACAATATTGGAGAAGCAATACATAAGCAGTCTGATTTACCAGTGCGAGAGAGAATATTTCTTCTTCTAGATGCCACACAGACAGCTCTTGGCGGTGCTTCTGGAAAGTTCCCTCAGTATTATTCAGCATATTATGATTTGGTGAGTGCCGGAGTCCAGTTTCCTCAAAGGCCTCCTGCAGTTTCATCAAATAGTCCTACCCAGCAGCAAATTAATAATACTTCACAAAATGGAGTAATAAGATTATCTGAGCAGGAGAATGTTGCTAGAGTGGAACCTCAGATATTATCAGAATCTAGTATAATTGAAAAGGCTGGCAATGCATTAGAAGTTCTGAAAGAAGTTCTTGATGCTGTTGATCCTCGACATCCTGAGGGGGCAAGAGATGAGTTTACTCTTGATCTTGTAGAACAGTGTTCGTTTCAGAAGCAGAAACTAATGCATCTTGTGCTGTCTTCTCGGGACGAGAAGATTGTCTGCGGTGCCATTGAATTGAATGAGAAGCTCCAAAAGGTCCTTGCAAGACATGATGCCCTCCTCTCTGGTCAGTTTATGTCGACTCAAAATCAGTTCAACGGCGAAGAAGTTGGTATGTCCAGATTGCCTGCTAATCATTATAACCATGACGAAGGAGAAGATGAAGAAGAGGCCGATCAACTTTTCCGAAGGTTGCGAAAAGGAAAGGCTTGTGTAAGGCCTGAAGACGAAGAGGATTCTTCAGAGGAGCGGCCGTCGTTGGGTTTGCTAGGATTGTCAATTCCAGTTGAACGAGCAAACCGTCCAATCATTCGACCTATTGACGAGAAGGTGTCAACGACATTGGAAATACAGCATGGTCAGGGTGTTTCAATACCACCACCACCAGTAAAGCATGCAGAAAGGGAGAAGTTCTTCAAGGATAAAAAAATAGATGTTGGAGTTGGACATATGAGAGGGCTTTCTTTACACAGTCGTAATGCTAGCAGCTCTCGCAGTGGAAGCATAGATTTCAACGAGTCATGA

Protein sequence

MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVLLLEMLMNNIGEAIHKQSDLPVRERIFLLLDATQTALGGASGKFPQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILSESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEKIVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEEADQLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEIQHGQGVSIPPPPVKHAEREKFFKDKKIDVGVGHMRGLSLHSRNASSSRSGSIDFNES*
Homology
BLAST of CsaV3_4G027580 vs. NCBI nr
Match: XP_011653749.1 (TOM1-like protein 5 isoform X2 [Cucumis sativus] >KAE8649605.1 hypothetical protein Csa_012667 [Cucumis sativus])

HSP 1 Score: 773.1 bits (1995), Expect = 1.2e-219
Identity = 399/399 (100.00%), Postives = 399/399 (100.00%), Query Frame = 0

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL
Sbjct: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60

Query: 61  LLEMLMNNIGEAIHKQSDLPVRERIFLLLDATQTALGGASGKFPQYYSAYYDLVSAGVQF 120
           LLEMLMNNIGEAIHKQSDLPVRERIFLLLDATQTALGGASGKFPQYYSAYYDLVSAGVQF
Sbjct: 61  LLEMLMNNIGEAIHKQSDLPVRERIFLLLDATQTALGGASGKFPQYYSAYYDLVSAGVQF 120

Query: 121 PQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILSESSIIEKAGNALEVLKE 180
           PQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILSESSIIEKAGNALEVLKE
Sbjct: 121 PQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILSESSIIEKAGNALEVLKE 180

Query: 181 VLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEKIVCGAIELNEKLQKVLA 240
           VLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEKIVCGAIELNEKLQKVLA
Sbjct: 181 VLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEKIVCGAIELNEKLQKVLA 240

Query: 241 RHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEEADQLFRRLRKGKACVRPE 300
           RHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEEADQLFRRLRKGKACVRPE
Sbjct: 241 RHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEEADQLFRRLRKGKACVRPE 300

Query: 301 DEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEIQHGQGVSIPPPPVKHAER 360
           DEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEIQHGQGVSIPPPPVKHAER
Sbjct: 301 DEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEIQHGQGVSIPPPPVKHAER 360

Query: 361 EKFFKDKKIDVGVGHMRGLSLHSRNASSSRSGSIDFNES 400
           EKFFKDKKIDVGVGHMRGLSLHSRNASSSRSGSIDFNES
Sbjct: 361 EKFFKDKKIDVGVGHMRGLSLHSRNASSSRSGSIDFNES 399

BLAST of CsaV3_4G027580 vs. NCBI nr
Match: XP_004142659.1 (TOM1-like protein 5 isoform X1 [Cucumis sativus])

HSP 1 Score: 762.3 bits (1967), Expect = 2.1e-216
Identity = 399/416 (95.91%), Postives = 399/416 (95.91%), Query Frame = 0

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL
Sbjct: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60

Query: 61  LLEMLMNNIGEAIHKQ-----------------SDLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGEAIHKQ                 SDLPVRERIFLLLDATQTALGGASGKF
Sbjct: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKF 120

Query: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILS 180
           PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILS
Sbjct: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILS 180

Query: 181 ESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEK 240
           ESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEK
Sbjct: 181 ESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEK 240

Query: 241 IVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEEA 300
           IVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEEA
Sbjct: 241 IVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEEA 300

Query: 301 DQLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEIQ 360
           DQLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEIQ
Sbjct: 301 DQLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEIQ 360

Query: 361 HGQGVSIPPPPVKHAEREKFFKDKKIDVGVGHMRGLSLHSRNASSSRSGSIDFNES 400
           HGQGVSIPPPPVKHAEREKFFKDKKIDVGVGHMRGLSLHSRNASSSRSGSIDFNES
Sbjct: 361 HGQGVSIPPPPVKHAEREKFFKDKKIDVGVGHMRGLSLHSRNASSSRSGSIDFNES 416

BLAST of CsaV3_4G027580 vs. NCBI nr
Match: XP_038882891.1 (TOM1-like protein 5 [Benincasa hispida] >XP_038882892.1 TOM1-like protein 5 [Benincasa hispida] >XP_038882893.1 TOM1-like protein 5 [Benincasa hispida])

HSP 1 Score: 696.8 bits (1797), Expect = 1.1e-196
Identity = 367/416 (88.22%), Postives = 379/416 (91.11%), Query Frame = 0

Query: 2   AAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVLL 61
           AAELVNSATSEKL ETDWMKNIQ+CELVAHDQRQAKEVIKAIKKRLG+KNAN QLYAVLL
Sbjct: 3   AAELVNSATSEKLTETDWMKNIQVCELVAHDQRQAKEVIKAIKKRLGSKNANTQLYAVLL 62

Query: 62  LEMLMNNIGEAIHKQ-----------------SDLPVRERIFLLLDATQTALGGASGKFP 121
           LEMLMNNIGEAIHKQ                 S+LPVRERIFLLLDATQTALGGASGKFP
Sbjct: 63  LEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSNLPVRERIFLLLDATQTALGGASGKFP 122

Query: 122 QYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILSE 181
           QYYSAYYDLVSAGVQFPQRP AV  N+PT QQ NNTSQNGVIRLSE+E+VAR+EPQIL E
Sbjct: 123 QYYSAYYDLVSAGVQFPQRPSAVPLNNPTSQQQNNTSQNGVIRLSEKEDVARLEPQILPE 182

Query: 182 SSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEKI 241
           SSI EKA NALEVLKEVLDAVDPR PEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEKI
Sbjct: 183 SSITEKASNALEVLKEVLDAVDPRRPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEKI 242

Query: 242 VCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEEAD 301
           VC AIELNEKLQKVLARHDALLSGQFMSTQNQ NGEEVGMSRLPANHYN DEGEDEEEA+
Sbjct: 243 VCRAIELNEKLQKVLARHDALLSGQFMSTQNQLNGEEVGMSRLPANHYNQDEGEDEEEAE 302

Query: 302 QLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEIQH 361
           QLFRRLRKGKAC RPEDEEDSSEERPSLGLLGLSIPVERANRPIIRPI EK+ST  ++QH
Sbjct: 303 QLFRRLRKGKACARPEDEEDSSEERPSLGLLGLSIPVERANRPIIRPITEKLSTASDVQH 362

Query: 362 GQGVSIPPPPVKHAEREKFFKDKKIDVGV-GHMRGLSLHSRNASSSRSGSIDFNES 400
           GQGVSIPPPPVKHAEREKFFKDKK+DVGV GHMR LSLHSRNASSSRSGSIDFNES
Sbjct: 363 GQGVSIPPPPVKHAEREKFFKDKKMDVGVGGHMRSLSLHSRNASSSRSGSIDFNES 418

BLAST of CsaV3_4G027580 vs. NCBI nr
Match: XP_016900689.1 (PREDICTED: target of Myb protein 1 isoform X2 [Cucumis melo])

HSP 1 Score: 695.3 bits (1793), Expect = 3.1e-196
Identity = 356/368 (96.74%), Postives = 362/368 (98.37%), Query Frame = 0

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAK+VIKAIKKRLGNKNAN QLYAVL
Sbjct: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKDVIKAIKKRLGNKNANTQLYAVL 60

Query: 61  LLEMLMNNIGEAIHKQSDLPVRERIFLLLDATQTALGGASGKFPQYYSAYYDLVSAGVQF 120
           LLEMLMNNIGEAIHKQSDLPVRERIFLLLDATQTALGGASGKFPQYYSAYYDLVSAGVQF
Sbjct: 61  LLEMLMNNIGEAIHKQSDLPVRERIFLLLDATQTALGGASGKFPQYYSAYYDLVSAGVQF 120

Query: 121 PQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILSESSIIEKAGNALEVLKE 180
           PQRPPAVSSNSPTQ Q NNTSQNG+IRLSEQENVARVEPQIL ESSIIEKAGNALEVLKE
Sbjct: 121 PQRPPAVSSNSPTQLQTNNTSQNGIIRLSEQENVARVEPQILPESSIIEKAGNALEVLKE 180

Query: 181 VLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEKIVCGAIELNEKLQKVLA 240
           VLDAVDP+HPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEKIVCGAIELNEKLQKVLA
Sbjct: 181 VLDAVDPQHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEKIVCGAIELNEKLQKVLA 240

Query: 241 RHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEEADQLFRRLRKGKACVRPE 300
           RHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEEA+QL+RRLRKGKACV PE
Sbjct: 241 RHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEEAEQLYRRLRKGKACVMPE 300

Query: 301 DEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEIQHGQGVSIPPPPVKHAER 360
           DEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEIQHGQGV IPPPPVKHAER
Sbjct: 301 DEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEIQHGQGVGIPPPPVKHAER 360

Query: 361 EKFFKDKK 369
           EKFFK+KK
Sbjct: 361 EKFFKEKK 368

BLAST of CsaV3_4G027580 vs. NCBI nr
Match: XP_008449359.2 (PREDICTED: target of Myb protein 1 isoform X1 [Cucumis melo] >XP_016900684.1 PREDICTED: target of Myb protein 1 isoform X1 [Cucumis melo])

HSP 1 Score: 684.5 bits (1765), Expect = 5.5e-193
Identity = 356/385 (92.47%), Postives = 362/385 (94.03%), Query Frame = 0

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAK+VIKAIKKRLGNKNAN QLYAVL
Sbjct: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKDVIKAIKKRLGNKNANTQLYAVL 60

Query: 61  LLEMLMNNIGEAIHKQ-----------------SDLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGEAIHKQ                 SDLPVRERIFLLLDATQTALGGASGKF
Sbjct: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKF 120

Query: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILS 180
           PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQ Q NNTSQNG+IRLSEQENVARVEPQIL 
Sbjct: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQLQTNNTSQNGIIRLSEQENVARVEPQILP 180

Query: 181 ESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEK 240
           ESSIIEKAGNALEVLKEVLDAVDP+HPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEK
Sbjct: 181 ESSIIEKAGNALEVLKEVLDAVDPQHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEK 240

Query: 241 IVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEEA 300
           IVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEEA
Sbjct: 241 IVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEEA 300

Query: 301 DQLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEIQ 360
           +QL+RRLRKGKACV PEDEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEIQ
Sbjct: 301 EQLYRRLRKGKACVMPEDEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEIQ 360

Query: 361 HGQGVSIPPPPVKHAEREKFFKDKK 369
           HGQGV IPPPPVKHAEREKFFK+KK
Sbjct: 361 HGQGVGIPPPPVKHAEREKFFKEKK 385

BLAST of CsaV3_4G027580 vs. ExPASy Swiss-Prot
Match: Q9FFQ0 (TOM1-like protein 5 OS=Arabidopsis thaliana OX=3702 GN=TOL5 PE=1 SV=1)

HSP 1 Score: 426.8 bits (1096), Expect = 2.7e-118
Identity = 255/451 (56.54%), Postives = 310/451 (68.74%), Query Frame = 0

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           MAAELV+SATSEKLA+ DW KNI+ICEL A D+RQAK+VIKAIKKRLG+KN N QLYAV 
Sbjct: 1   MAAELVSSATSEKLADVDWAKNIEICELAARDERQAKDVIKAIKKRLGSKNPNTQLYAVQ 60

Query: 61  LLEMLMNNIGEAIHKQ-----------------SDLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGE IHKQ                 SDLPVRERIFLLLDATQT+LGGASGKF
Sbjct: 61  LLEMLMNNIGENIHKQVIDTGVLPTLVKIVKKKSDLPVRERIFLLLDATQTSLGGASGKF 120

Query: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARV---EPQ 180
           PQYY+AYY+LV+AGV+F QRP A +    T Q +   + N  +  +  E  A     E Q
Sbjct: 121 PQYYTAYYELVNAGVKFTQRPNA-TPVVVTAQAVPRNTLNEQLASARNEGPATTQQRESQ 180

Query: 181 ILSESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSR 240
            +S SSI++KA  ALE+LKEVLDAVD ++PEGA+DEFTLDLVEQCSFQK+++MHLV++SR
Sbjct: 181 SVSPSSILQKASTALEILKEVLDAVDSQNPEGAKDEFTLDLVEQCSFQKERVMHLVMTSR 240

Query: 241 DEKIVCGAIELNEKLQKVLARHDALLSGQF-----MSTQNQF-----------NGEE--- 300
           DEK V  AIELNE+LQ++L RH+ LLSG+       +T N +           NG++   
Sbjct: 241 DEKAVSKAIELNEQLQRILNRHEDLLSGRITVPSRSTTSNGYHSNLEPVRPISNGDQKRE 300

Query: 301 -------VGMSRLPAN--HYNHDEGEDEEEADQLFRRLRKGKACVRPEDEEDSSEERPSL 360
                     S   +N  H   +E ++EEE +QLFRRLRKGKA  RPEDEE+ S   P  
Sbjct: 301 LKASNANTESSSFISNRAHLKLEEEDEEEEPEQLFRRLRKGKARARPEDEEEPS---PPQ 360

Query: 361 GLLGLSIPVERANRPIIRPIDEKVSTTLEIQHGQG--VSIPPPPVKHAEREKFFKDKKID 399
           GL G +I  ER NRP+IRP+  + ++     H Q   V IPPPP KH EREKFFK+ K D
Sbjct: 361 GLPGSAIHNERLNRPLIRPLPSEEASRGGDSHSQSPPVVIPPPPAKHVEREKFFKENKGD 420

BLAST of CsaV3_4G027580 vs. ExPASy Swiss-Prot
Match: Q9LPL6 (TOM1-like protein 3 OS=Arabidopsis thaliana OX=3702 GN=TOL3 PE=1 SV=1)

HSP 1 Score: 180.3 bits (456), Expect = 4.4e-44
Identity = 113/322 (35.09%), Postives = 174/322 (54.04%), Query Frame = 0

Query: 2   AAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVLL 61
           AA     AT++ L   DW  NI++C+++  +  QAKE +K +KKRLG+KN+  Q+ A+  
Sbjct: 5   AAACAERATNDMLIGPDWAINIELCDIINMEPSQAKEAVKVLKKRLGSKNSKVQILALYA 64

Query: 62  LEMLMNNIGEAIH-----------------KQSDLPVRERIFLLLDATQTALGGASGKFP 121
           LE L  N GE+++                 K+ DL VRE+I  LLD  Q A GG+ G+FP
Sbjct: 65  LETLSKNCGESVYQLIVDRDILPDMVKIVKKKPDLTVREKILSLLDTWQEAFGGSGGRFP 124

Query: 122 QYYSAYYDLVSAGVQFPQR---------PPAVSSNSPTQQQINNTSQNGVIRLSEQENVA 181
           QYY+AY +L SAG++FP R         PP      P   Q   + ++  I+ S Q +  
Sbjct: 125 QYYNAYNELRSAGIEFPPRTESSVPFFTPP---QTQPIVAQATASDEDAAIQASLQSD-- 184

Query: 182 RVEPQILSESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHL 241
             +   LS    I+ A  +++VL ++L A+DP HPEG ++E  +DLVEQC   ++++M L
Sbjct: 185 --DASALSMEE-IQSAQGSVDVLTDMLGALDPSHPEGLKEELIVDLVEQCRTYQRRVMAL 244

Query: 242 VLSSRDEKIVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHD 296
           V ++ DE+++C  + LN+ LQ+VL  HD    G  +             + +P    NHD
Sbjct: 245 VNTTSDEELMCQGLALNDNLQRVLQHHDDKAKGNSVPA--------TAPTPIPLVSINHD 304

BLAST of CsaV3_4G027580 vs. ExPASy Swiss-Prot
Match: Q6NQK0 (TOM1-like protein 4 OS=Arabidopsis thaliana OX=3702 GN=TOL4 PE=1 SV=1)

HSP 1 Score: 179.5 bits (454), Expect = 7.6e-44
Identity = 135/422 (31.99%), Postives = 207/422 (49.05%), Query Frame = 0

Query: 2   AAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVLL 61
           AA     AT++ L   DW  NI++C+L+  D  QAKE +K +KKRLG+KN+  Q+ A+  
Sbjct: 5   AAACAERATNDMLIGPDWAINIELCDLINMDPSQAKEAVKVLKKRLGSKNSKVQILALYA 64

Query: 62  LEMLMNNIGEAIH-----------------KQSDLPVRERIFLLLDATQTALGGASGKFP 121
           LE L  N GE ++                 K+ +L VRE+I  LLD  Q A GG  G++P
Sbjct: 65  LETLSKNCGENVYQLIIDRGLLNDMVKIVKKKPELNVREKILTLLDTWQEAFGGRGGRYP 124

Query: 122 QYYSAYYDLVSAGVQFPQR-PPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILS 181
           QYY+AY DL SAG++FP R   ++S  +P Q Q     ++  I+ S Q + A       S
Sbjct: 125 QYYNAYNDLRSAGIEFPPRTESSLSFFTPPQTQ---PDEDAAIQASLQGDDA-------S 184

Query: 182 ESSI--IEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRD 241
             S+  I+ A  +++VL ++L A DP +PE  ++E  +DLVEQC   ++++M LV ++ D
Sbjct: 185 SLSLEEIQSAEGSVDVLMDMLGAHDPGNPESLKEEVIVDLVEQCRTYQRRVMTLVNTTTD 244

Query: 242 EKIVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEE 301
           E+++C  + LN+ LQ VL RHD + +   + +  +       +  +  NH + D+  D+E
Sbjct: 245 EELLCQGLALNDNLQHVLQRHDDIANVGSVPSNGRNTRAPPPVQIVDINHDDEDDESDDE 304

Query: 302 EADQLFRRL--RKGKACVRPEDEEDS-------------SEERPSLGLLGLSIPVERANR 361
                F RL  R      RP    DS                  S G+     P    + 
Sbjct: 305 -----FARLAHRSSTPTRRPVHGSDSGMVDILSGDVYKPQGNSSSQGVKKPPPPPPHTSS 364

Query: 362 PIIRPIDEKVSTTLEIQHGQGVSIPPPPVKHAEREKFFKDKKIDVG-----VGHMRGLSL 384
               P+ +  S           ++PPPP +H +R++FF+      G      G  R LSL
Sbjct: 365 SSSSPVFDDASPQQSKSSEVIRNLPPPPSRHNQRQQFFEHHHSSSGSDSSYEGQTRNLSL 411

BLAST of CsaV3_4G027580 vs. ExPASy Swiss-Prot
Match: Q9C9Y1 (TOM1-like protein 8 OS=Arabidopsis thaliana OX=3702 GN=TOL8 PE=2 SV=1)

HSP 1 Score: 179.1 bits (453), Expect = 9.9e-44
Identity = 122/396 (30.81%), Postives = 200/396 (50.51%), Query Frame = 0

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           M   LV+ ATS+ L   DW  N++IC+++ H+  Q +EV+  IKKRL ++ +  QL A+ 
Sbjct: 1   MVHPLVDRATSDMLIGPDWAMNLEICDMLNHEPGQTREVVSGIKKRLTSRTSKVQLLALT 60

Query: 61  LLEMLMNNIGEAIHKQ-----------------SDLPVRERIFLLLDATQTALGGASGKF 120
           LLE ++ N GE IH Q                  ++ V+E+I +L+D  Q +  G  G+ 
Sbjct: 61  LLETIITNCGELIHMQVAEKDILHKMVKMAKRKPNIQVKEKILILIDTWQESFSGPQGRH 120

Query: 121 PQYYSAYYDLVSAGVQFPQRP---PAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQ 180
           PQYY+AY +L+ AG+ FPQRP   P+   N P+ +   N S+N      +    +     
Sbjct: 121 PQYYAAYQELLRAGIVFPQRPQITPSSGQNGPSTRYPQN-SRNARQEAIDTSTESEFPTL 180

Query: 181 ILSESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSR 240
            L+E   I+ A   ++VL E+++A+D  + EG + E  +DLV QC   KQ+++HLV S+ 
Sbjct: 181 SLTE---IQNARGIMDVLAEMMNAIDGNNKEGLKQEVVVDLVSQCRTYKQRVVHLVNSTS 240

Query: 241 DEKIVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDE 300
           DE ++C  + LN+ LQ++LA+H+A+ SG  M  + + + +EV            D G  E
Sbjct: 241 DESMLCQGLALNDDLQRLLAKHEAIASGNSMIKKEEKSKKEVPKDTTQI----IDVGSSE 300

Query: 301 EEADQLFRRLRKG-KACVRPEDEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTT 360
            +   +      G K  +   D+ ++     SL L+ L  P  + + P+ +P D  +   
Sbjct: 301 TKNGSVVAYTTNGPKIDLLSGDDFETPNADNSLALVPLGPP--QPSSPVAKP-DNSIVLI 360

Query: 361 LEIQHGQGVSIPPPPVKHAEREKFFKDKKIDVGVGH 376
             +      S  P    HA  +K  ++     G GH
Sbjct: 361 DMLSDNNCESSTPTSNPHANHQKVQQNYSNGFGPGH 385

BLAST of CsaV3_4G027580 vs. ExPASy Swiss-Prot
Match: Q8L860 (TOM1-like protein 9 OS=Arabidopsis thaliana OX=3702 GN=TOL9 PE=1 SV=1)

HSP 1 Score: 168.3 bits (425), Expect = 1.7e-40
Identity = 108/311 (34.73%), Postives = 165/311 (53.05%), Query Frame = 0

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           M   +V  ATSE L   DW  N++IC+++  D  QAK+V+K IKKR+G++N  AQL A+ 
Sbjct: 1   MVNAMVERATSEMLIGPDWAMNLEICDMLNSDPAQAKDVVKGIKKRIGSRNPKAQLLALT 60

Query: 61  LLEMLMNNIGEAIH-----------------KQSDLPVRERIFLLLDATQTALGGASGKF 120
           LLE ++ N G+ +H                 K+ D  V+E+I +L+D  Q A GG   ++
Sbjct: 61  LLETIVKNCGDMVHMHVAEKGVIHEMVRIVKKKPDFHVKEKILVLIDTWQEAFGGPRARY 120

Query: 121 PQYYSAYYDLVSAGVQFPQR---------PPAVSSNSPTQQQINNTSQNGVIRLSEQENV 180
           PQYY+ Y +L+ AG  FPQR         PP     +     + N      +     E  
Sbjct: 121 PQYYAGYQELLRAGAVFPQRSERSAPVFTPPQTQPLTSYPPNLRNAGPGNDV----PEPS 180

Query: 181 ARVEPQILSESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMH 240
           A  E   LS S  I+ A   ++VL E+L A++P + E  + E  +DLVEQC   KQ+++H
Sbjct: 181 AEPEFPTLSLSE-IQNAKGIMDVLAEMLSALEPGNKEDLKQEVMVDLVEQCRTYKQRVVH 240

Query: 241 LVLSSRDEKIVCGAIELNEKLQKVLARHDALLSG-QFMSTQNQFNGEEVGMSRLPANHYN 285
           LV S+ DE ++C  + LN+ LQ+VL  ++A+ SG    S+Q +    E G S +  +   
Sbjct: 241 LVNSTSDESLLCQGLALNDDLQRVLTNYEAIASGLPGTSSQIEKPKSETGKSLVDVDGPL 300

BLAST of CsaV3_4G027580 vs. ExPASy TrEMBL
Match: A0A1S4DXH7 (target of Myb protein 1 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103491258 PE=3 SV=1)

HSP 1 Score: 695.3 bits (1793), Expect = 1.5e-196
Identity = 356/368 (96.74%), Postives = 362/368 (98.37%), Query Frame = 0

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAK+VIKAIKKRLGNKNAN QLYAVL
Sbjct: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKDVIKAIKKRLGNKNANTQLYAVL 60

Query: 61  LLEMLMNNIGEAIHKQSDLPVRERIFLLLDATQTALGGASGKFPQYYSAYYDLVSAGVQF 120
           LLEMLMNNIGEAIHKQSDLPVRERIFLLLDATQTALGGASGKFPQYYSAYYDLVSAGVQF
Sbjct: 61  LLEMLMNNIGEAIHKQSDLPVRERIFLLLDATQTALGGASGKFPQYYSAYYDLVSAGVQF 120

Query: 121 PQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILSESSIIEKAGNALEVLKE 180
           PQRPPAVSSNSPTQ Q NNTSQNG+IRLSEQENVARVEPQIL ESSIIEKAGNALEVLKE
Sbjct: 121 PQRPPAVSSNSPTQLQTNNTSQNGIIRLSEQENVARVEPQILPESSIIEKAGNALEVLKE 180

Query: 181 VLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEKIVCGAIELNEKLQKVLA 240
           VLDAVDP+HPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEKIVCGAIELNEKLQKVLA
Sbjct: 181 VLDAVDPQHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEKIVCGAIELNEKLQKVLA 240

Query: 241 RHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEEADQLFRRLRKGKACVRPE 300
           RHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEEA+QL+RRLRKGKACV PE
Sbjct: 241 RHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEEAEQLYRRLRKGKACVMPE 300

Query: 301 DEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEIQHGQGVSIPPPPVKHAER 360
           DEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEIQHGQGV IPPPPVKHAER
Sbjct: 301 DEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEIQHGQGVGIPPPPVKHAER 360

Query: 361 EKFFKDKK 369
           EKFFK+KK
Sbjct: 361 EKFFKEKK 368

BLAST of CsaV3_4G027580 vs. ExPASy TrEMBL
Match: A0A1S3BMS2 (target of Myb protein 1 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103491258 PE=3 SV=1)

HSP 1 Score: 684.5 bits (1765), Expect = 2.7e-193
Identity = 356/385 (92.47%), Postives = 362/385 (94.03%), Query Frame = 0

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAK+VIKAIKKRLGNKNAN QLYAVL
Sbjct: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKDVIKAIKKRLGNKNANTQLYAVL 60

Query: 61  LLEMLMNNIGEAIHKQ-----------------SDLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGEAIHKQ                 SDLPVRERIFLLLDATQTALGGASGKF
Sbjct: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKF 120

Query: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILS 180
           PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQ Q NNTSQNG+IRLSEQENVARVEPQIL 
Sbjct: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQLQTNNTSQNGIIRLSEQENVARVEPQILP 180

Query: 181 ESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEK 240
           ESSIIEKAGNALEVLKEVLDAVDP+HPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEK
Sbjct: 181 ESSIIEKAGNALEVLKEVLDAVDPQHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEK 240

Query: 241 IVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEEA 300
           IVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEEA
Sbjct: 241 IVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEEA 300

Query: 301 DQLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEIQ 360
           +QL+RRLRKGKACV PEDEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEIQ
Sbjct: 301 EQLYRRLRKGKACVMPEDEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEIQ 360

Query: 361 HGQGVSIPPPPVKHAEREKFFKDKK 369
           HGQGV IPPPPVKHAEREKFFK+KK
Sbjct: 361 HGQGVGIPPPPVKHAEREKFFKEKK 385

BLAST of CsaV3_4G027580 vs. ExPASy TrEMBL
Match: A0A6J1D6L6 (TOM1-like protein 5 OS=Momordica charantia OX=3673 GN=LOC111017804 PE=3 SV=1)

HSP 1 Score: 681.0 bits (1756), Expect = 3.0e-192
Identity = 357/416 (85.82%), Postives = 377/416 (90.62%), Query Frame = 0

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           MAAELVNSATSEKLAETDWMKNI+ICELVA DQRQAK+V+KAIKKR+G+K AN QLYAVL
Sbjct: 1   MAAELVNSATSEKLAETDWMKNIEICELVARDQRQAKDVVKAIKKRIGSKTANTQLYAVL 60

Query: 61  LLEMLMNNIGEAIHKQ-----------------SDLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGE IHKQ                 S+LPVRERIFLLLDATQTALGGASGKF
Sbjct: 61  LLEMLMNNIGEPIHKQVIDSGVLPILVKTVKKKSNLPVRERIFLLLDATQTALGGASGKF 120

Query: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILS 180
           PQYYSAYYDLVSAGVQFPQRPP+V  N+PTQQ  NNT QNGVIRLSEQE  ARVEPQIL 
Sbjct: 121 PQYYSAYYDLVSAGVQFPQRPPSVPPNNPTQQH-NNTLQNGVIRLSEQEGAARVEPQILP 180

Query: 181 ESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEK 240
           ESSIIEKAGNALEVLKEVLDAV+PR+PEGARDEFTLDLVEQCSFQKQ+LMHLVLSSRDEK
Sbjct: 181 ESSIIEKAGNALEVLKEVLDAVNPRNPEGARDEFTLDLVEQCSFQKQRLMHLVLSSRDEK 240

Query: 241 IVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEEA 300
           IVC AIELNEKLQKVLARHDALLSGQF+STQN F+ E+ G SRLPANH NHDEGEDEEEA
Sbjct: 241 IVCRAIELNEKLQKVLARHDALLSGQFISTQNNFSDEDTGRSRLPANHCNHDEGEDEEEA 300

Query: 301 DQLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEIQ 360
           +QLFRRLRKGKACVRPEDEEDSSEERPSLG LGLSIPVERANRPIIRPI+EK STT ++Q
Sbjct: 301 EQLFRRLRKGKACVRPEDEEDSSEERPSLGSLGLSIPVERANRPIIRPINEKASTTSDMQ 360

Query: 361 HGQGVSIPPPPVKHAEREKFFKDKKIDVGVGHMRGLSLHSRNASSSRSGSIDFNES 400
           HGQGVSIPPPPVKHAEREKFFKDKK+DVG GHMRGLSLHSRNASSSRSGSIDF+ES
Sbjct: 361 HGQGVSIPPPPVKHAEREKFFKDKKMDVGSGHMRGLSLHSRNASSSRSGSIDFSES 415

BLAST of CsaV3_4G027580 vs. ExPASy TrEMBL
Match: A0A6J1EH71 (TOM1-like protein 5 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111432500 PE=3 SV=1)

HSP 1 Score: 634.0 bits (1634), Expect = 4.2e-178
Identity = 344/417 (82.49%), Postives = 361/417 (86.57%), Query Frame = 0

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           MA+ELVNSATSEKLAETDWMKNI+ICELVA DQRQAK+VIKAIKKR+G+KN N QLYAVL
Sbjct: 1   MASELVNSATSEKLAETDWMKNIEICELVARDQRQAKDVIKAIKKRIGSKNTNTQLYAVL 60

Query: 61  LLEMLMNNIGEAIHKQ-----------------SDLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNN+GE IHKQ                 S+LPVRERIFLLLDATQTALGGASGKF
Sbjct: 61  LLEMLMNNVGETIHKQVIDSGVLPSLVKIVKKKSNLPVRERIFLLLDATQTALGGASGKF 120

Query: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILS 180
           PQYY AYYDLVSAGVQFPQRPPA  S++ TQQ  NN  QNGVIRLSEQE+ A VEPQ L 
Sbjct: 121 PQYYQAYYDLVSAGVQFPQRPPATPSDNATQQHTNNL-QNGVIRLSEQEDAASVEPQTLP 180

Query: 181 ESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEK 240
           ESSIIEKA NALE+LKEVLDAVDP+ PEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEK
Sbjct: 181 ESSIIEKASNALEILKEVLDAVDPQRPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEK 240

Query: 241 IVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEEA 300
           IVC AIELNEKLQKVL RHDALLSGQFMST NQFNGEEVG  RL ANHYN DEGED EEA
Sbjct: 241 IVCRAIELNEKLQKVLERHDALLSGQFMSTHNQFNGEEVG--RLRANHYNQDEGED-EEA 300

Query: 301 DQLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEIQ 360
           +QLFRRLRKGKAC+RPEDE  SS ERPSLG LGLSIPVERANRPIIRPIDEKVSTT ++Q
Sbjct: 301 EQLFRRLRKGKACIRPEDENGSS-ERPSLGSLGLSIPVERANRPIIRPIDEKVSTTSDMQ 360

Query: 361 HGQGVSIPPPPVKHAEREKFFKDKKIDVGVG-HMRGLSLHSRNASSSRSGSIDFNES 400
            GQGV IPPPPVKHAEREKFFKDKK   GV  HMRGLSLHSRNASSSRSGSID +ES
Sbjct: 361 QGQGVVIPPPPVKHAEREKFFKDKKTGGGVNEHMRGLSLHSRNASSSRSGSIDLSES 412

BLAST of CsaV3_4G027580 vs. ExPASy TrEMBL
Match: A0A6J1HRT0 (TOM1-like protein 5 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111466022 PE=3 SV=1)

HSP 1 Score: 624.0 bits (1608), Expect = 4.3e-175
Identity = 341/417 (81.77%), Postives = 357/417 (85.61%), Query Frame = 0

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           MA+ELVNSATSEKLAETDWMKNI+ICELVA DQRQAK+VIKAIKKR+G+KN N QLYAVL
Sbjct: 1   MASELVNSATSEKLAETDWMKNIEICELVARDQRQAKDVIKAIKKRIGSKNTNTQLYAVL 60

Query: 61  LLEMLMNNIGEAIHKQ-----------------SDLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNN+GE IHKQ                 S+LPVRERIFLLLDATQTALGGASGKF
Sbjct: 61  LLEMLMNNVGEPIHKQVIESGVLPVLVKIVKKKSNLPVRERIFLLLDATQTALGGASGKF 120

Query: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILS 180
           PQYY AYYDLVSAGVQFPQRPPA  S++  QQ  NN  QNGV RLSEQE+ A VEPQ L 
Sbjct: 121 PQYYQAYYDLVSAGVQFPQRPPATPSDNAIQQHTNNL-QNGVKRLSEQEDAASVEPQALP 180

Query: 181 ESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEK 240
           ESSIIEKA NALE+LKEVLDAVDP+ PEGARDEFTLDLVEQCSFQKQKLMHLVLS RDEK
Sbjct: 181 ESSIIEKASNALEILKEVLDAVDPQRPEGARDEFTLDLVEQCSFQKQKLMHLVLSFRDEK 240

Query: 241 IVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEEA 300
           IVC AIELNEKLQKVL RHDALLSGQFMST NQFNGEEVG SR  ANHYN DEGED EEA
Sbjct: 241 IVCRAIELNEKLQKVLERHDALLSGQFMSTHNQFNGEEVGRSR--ANHYNQDEGED-EEA 300

Query: 301 DQLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEIQ 360
           +QLFRRLRKGKACVRPEDE  SS ERPSLG LGLSIPVERANRPIIRPIDEKVSTT ++Q
Sbjct: 301 EQLFRRLRKGKACVRPEDENGSS-ERPSLGSLGLSIPVERANRPIIRPIDEKVSTTSDMQ 360

Query: 361 HGQGVSIPPPPVKHAEREKFFKDKKIDVGVG-HMRGLSLHSRNASSSRSGSIDFNES 400
            GQGV IPPPPVKHAEREKFFKDKK   GV  HMRGLSLHSRNASSS SGSID +ES
Sbjct: 361 QGQGVVIPPPPVKHAEREKFFKDKKTGGGVSEHMRGLSLHSRNASSSGSGSIDLSES 412

BLAST of CsaV3_4G027580 vs. TAIR 10
Match: AT5G63640.1 (ENTH/VHS/GAT family protein )

HSP 1 Score: 426.8 bits (1096), Expect = 1.9e-119
Identity = 255/451 (56.54%), Postives = 310/451 (68.74%), Query Frame = 0

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           MAAELV+SATSEKLA+ DW KNI+ICEL A D+RQAK+VIKAIKKRLG+KN N QLYAV 
Sbjct: 1   MAAELVSSATSEKLADVDWAKNIEICELAARDERQAKDVIKAIKKRLGSKNPNTQLYAVQ 60

Query: 61  LLEMLMNNIGEAIHKQ-----------------SDLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGE IHKQ                 SDLPVRERIFLLLDATQT+LGGASGKF
Sbjct: 61  LLEMLMNNIGENIHKQVIDTGVLPTLVKIVKKKSDLPVRERIFLLLDATQTSLGGASGKF 120

Query: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARV---EPQ 180
           PQYY+AYY+LV+AGV+F QRP A +    T Q +   + N  +  +  E  A     E Q
Sbjct: 121 PQYYTAYYELVNAGVKFTQRPNA-TPVVVTAQAVPRNTLNEQLASARNEGPATTQQRESQ 180

Query: 181 ILSESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSR 240
            +S SSI++KA  ALE+LKEVLDAVD ++PEGA+DEFTLDLVEQCSFQK+++MHLV++SR
Sbjct: 181 SVSPSSILQKASTALEILKEVLDAVDSQNPEGAKDEFTLDLVEQCSFQKERVMHLVMTSR 240

Query: 241 DEKIVCGAIELNEKLQKVLARHDALLSGQF-----MSTQNQF-----------NGEE--- 300
           DEK V  AIELNE+LQ++L RH+ LLSG+       +T N +           NG++   
Sbjct: 241 DEKAVSKAIELNEQLQRILNRHEDLLSGRITVPSRSTTSNGYHSNLEPVRPISNGDQKRE 300

Query: 301 -------VGMSRLPAN--HYNHDEGEDEEEADQLFRRLRKGKACVRPEDEEDSSEERPSL 360
                     S   +N  H   +E ++EEE +QLFRRLRKGKA  RPEDEE+ S   P  
Sbjct: 301 LKASNANTESSSFISNRAHLKLEEEDEEEEPEQLFRRLRKGKARARPEDEEEPS---PPQ 360

Query: 361 GLLGLSIPVERANRPIIRPIDEKVSTTLEIQHGQG--VSIPPPPVKHAEREKFFKDKKID 399
           GL G +I  ER NRP+IRP+  + ++     H Q   V IPPPP KH EREKFFK+ K D
Sbjct: 361 GLPGSAIHNERLNRPLIRPLPSEEASRGGDSHSQSPPVVIPPPPAKHVEREKFFKENKGD 420

BLAST of CsaV3_4G027580 vs. TAIR 10
Match: AT1G21380.1 (Target of Myb protein 1 )

HSP 1 Score: 180.3 bits (456), Expect = 3.2e-45
Identity = 113/322 (35.09%), Postives = 174/322 (54.04%), Query Frame = 0

Query: 2   AAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVLL 61
           AA     AT++ L   DW  NI++C+++  +  QAKE +K +KKRLG+KN+  Q+ A+  
Sbjct: 5   AAACAERATNDMLIGPDWAINIELCDIINMEPSQAKEAVKVLKKRLGSKNSKVQILALYA 64

Query: 62  LEMLMNNIGEAIH-----------------KQSDLPVRERIFLLLDATQTALGGASGKFP 121
           LE L  N GE+++                 K+ DL VRE+I  LLD  Q A GG+ G+FP
Sbjct: 65  LETLSKNCGESVYQLIVDRDILPDMVKIVKKKPDLTVREKILSLLDTWQEAFGGSGGRFP 124

Query: 122 QYYSAYYDLVSAGVQFPQR---------PPAVSSNSPTQQQINNTSQNGVIRLSEQENVA 181
           QYY+AY +L SAG++FP R         PP      P   Q   + ++  I+ S Q +  
Sbjct: 125 QYYNAYNELRSAGIEFPPRTESSVPFFTPP---QTQPIVAQATASDEDAAIQASLQSD-- 184

Query: 182 RVEPQILSESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHL 241
             +   LS    I+ A  +++VL ++L A+DP HPEG ++E  +DLVEQC   ++++M L
Sbjct: 185 --DASALSMEE-IQSAQGSVDVLTDMLGALDPSHPEGLKEELIVDLVEQCRTYQRRVMAL 244

Query: 242 VLSSRDEKIVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHD 296
           V ++ DE+++C  + LN+ LQ+VL  HD    G  +             + +P    NHD
Sbjct: 245 VNTTSDEELMCQGLALNDNLQRVLQHHDDKAKGNSVPA--------TAPTPIPLVSINHD 304

BLAST of CsaV3_4G027580 vs. TAIR 10
Match: AT1G76970.1 (Target of Myb protein 1 )

HSP 1 Score: 179.5 bits (454), Expect = 5.4e-45
Identity = 135/422 (31.99%), Postives = 207/422 (49.05%), Query Frame = 0

Query: 2   AAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVLL 61
           AA     AT++ L   DW  NI++C+L+  D  QAKE +K +KKRLG+KN+  Q+ A+  
Sbjct: 5   AAACAERATNDMLIGPDWAINIELCDLINMDPSQAKEAVKVLKKRLGSKNSKVQILALYA 64

Query: 62  LEMLMNNIGEAIH-----------------KQSDLPVRERIFLLLDATQTALGGASGKFP 121
           LE L  N GE ++                 K+ +L VRE+I  LLD  Q A GG  G++P
Sbjct: 65  LETLSKNCGENVYQLIIDRGLLNDMVKIVKKKPELNVREKILTLLDTWQEAFGGRGGRYP 124

Query: 122 QYYSAYYDLVSAGVQFPQR-PPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILS 181
           QYY+AY DL SAG++FP R   ++S  +P Q Q     ++  I+ S Q + A       S
Sbjct: 125 QYYNAYNDLRSAGIEFPPRTESSLSFFTPPQTQ---PDEDAAIQASLQGDDA-------S 184

Query: 182 ESSI--IEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRD 241
             S+  I+ A  +++VL ++L A DP +PE  ++E  +DLVEQC   ++++M LV ++ D
Sbjct: 185 SLSLEEIQSAEGSVDVLMDMLGAHDPGNPESLKEEVIVDLVEQCRTYQRRVMTLVNTTTD 244

Query: 242 EKIVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEE 301
           E+++C  + LN+ LQ VL RHD + +   + +  +       +  +  NH + D+  D+E
Sbjct: 245 EELLCQGLALNDNLQHVLQRHDDIANVGSVPSNGRNTRAPPPVQIVDINHDDEDDESDDE 304

Query: 302 EADQLFRRL--RKGKACVRPEDEEDS-------------SEERPSLGLLGLSIPVERANR 361
                F RL  R      RP    DS                  S G+     P    + 
Sbjct: 305 -----FARLAHRSSTPTRRPVHGSDSGMVDILSGDVYKPQGNSSSQGVKKPPPPPPHTSS 364

Query: 362 PIIRPIDEKVSTTLEIQHGQGVSIPPPPVKHAEREKFFKDKKIDVG-----VGHMRGLSL 384
               P+ +  S           ++PPPP +H +R++FF+      G      G  R LSL
Sbjct: 365 SSSSPVFDDASPQQSKSSEVIRNLPPPPSRHNQRQQFFEHHHSSSGSDSSYEGQTRNLSL 411

BLAST of CsaV3_4G027580 vs. TAIR 10
Match: AT3G08790.1 (ENTH/VHS/GAT family protein )

HSP 1 Score: 179.1 bits (453), Expect = 7.0e-45
Identity = 122/396 (30.81%), Postives = 200/396 (50.51%), Query Frame = 0

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           M   LV+ ATS+ L   DW  N++IC+++ H+  Q +EV+  IKKRL ++ +  QL A+ 
Sbjct: 1   MVHPLVDRATSDMLIGPDWAMNLEICDMLNHEPGQTREVVSGIKKRLTSRTSKVQLLALT 60

Query: 61  LLEMLMNNIGEAIHKQ-----------------SDLPVRERIFLLLDATQTALGGASGKF 120
           LLE ++ N GE IH Q                  ++ V+E+I +L+D  Q +  G  G+ 
Sbjct: 61  LLETIITNCGELIHMQVAEKDILHKMVKMAKRKPNIQVKEKILILIDTWQESFSGPQGRH 120

Query: 121 PQYYSAYYDLVSAGVQFPQRP---PAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQ 180
           PQYY+AY +L+ AG+ FPQRP   P+   N P+ +   N S+N      +    +     
Sbjct: 121 PQYYAAYQELLRAGIVFPQRPQITPSSGQNGPSTRYPQN-SRNARQEAIDTSTESEFPTL 180

Query: 181 ILSESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSR 240
            L+E   I+ A   ++VL E+++A+D  + EG + E  +DLV QC   KQ+++HLV S+ 
Sbjct: 181 SLTE---IQNARGIMDVLAEMMNAIDGNNKEGLKQEVVVDLVSQCRTYKQRVVHLVNSTS 240

Query: 241 DEKIVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDE 300
           DE ++C  + LN+ LQ++LA+H+A+ SG  M  + + + +EV            D G  E
Sbjct: 241 DESMLCQGLALNDDLQRLLAKHEAIASGNSMIKKEEKSKKEVPKDTTQI----IDVGSSE 300

Query: 301 EEADQLFRRLRKG-KACVRPEDEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTT 360
            +   +      G K  +   D+ ++     SL L+ L  P  + + P+ +P D  +   
Sbjct: 301 TKNGSVVAYTTNGPKIDLLSGDDFETPNADNSLALVPLGPP--QPSSPVAKP-DNSIVLI 360

Query: 361 LEIQHGQGVSIPPPPVKHAEREKFFKDKKIDVGVGH 376
             +      S  P    HA  +K  ++     G GH
Sbjct: 361 DMLSDNNCESSTPTSNPHANHQKVQQNYSNGFGPGH 385

BLAST of CsaV3_4G027580 vs. TAIR 10
Match: AT4G32760.1 (ENTH/VHS/GAT family protein )

HSP 1 Score: 168.3 bits (425), Expect = 1.2e-41
Identity = 108/311 (34.73%), Postives = 165/311 (53.05%), Query Frame = 0

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           M   +V  ATSE L   DW  N++IC+++  D  QAK+V+K IKKR+G++N  AQL A+ 
Sbjct: 1   MVNAMVERATSEMLIGPDWAMNLEICDMLNSDPAQAKDVVKGIKKRIGSRNPKAQLLALT 60

Query: 61  LLEMLMNNIGEAIH-----------------KQSDLPVRERIFLLLDATQTALGGASGKF 120
           LLE ++ N G+ +H                 K+ D  V+E+I +L+D  Q A GG   ++
Sbjct: 61  LLETIVKNCGDMVHMHVAEKGVIHEMVRIVKKKPDFHVKEKILVLIDTWQEAFGGPRARY 120

Query: 121 PQYYSAYYDLVSAGVQFPQR---------PPAVSSNSPTQQQINNTSQNGVIRLSEQENV 180
           PQYY+ Y +L+ AG  FPQR         PP     +     + N      +     E  
Sbjct: 121 PQYYAGYQELLRAGAVFPQRSERSAPVFTPPQTQPLTSYPPNLRNAGPGNDV----PEPS 180

Query: 181 ARVEPQILSESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMH 240
           A  E   LS S  I+ A   ++VL E+L A++P + E  + E  +DLVEQC   KQ+++H
Sbjct: 181 AEPEFPTLSLSE-IQNAKGIMDVLAEMLSALEPGNKEDLKQEVMVDLVEQCRTYKQRVVH 240

Query: 241 LVLSSRDEKIVCGAIELNEKLQKVLARHDALLSG-QFMSTQNQFNGEEVGMSRLPANHYN 285
           LV S+ DE ++C  + LN+ LQ+VL  ++A+ SG    S+Q +    E G S +  +   
Sbjct: 241 LVNSTSDESLLCQGLALNDDLQRVLTNYEAIASGLPGTSSQIEKPKSETGKSLVDVDGPL 300

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011653749.11.2e-219100.00TOM1-like protein 5 isoform X2 [Cucumis sativus] >KAE8649605.1 hypothetical prot... [more]
XP_004142659.12.1e-21695.91TOM1-like protein 5 isoform X1 [Cucumis sativus][more]
XP_038882891.11.1e-19688.22TOM1-like protein 5 [Benincasa hispida] >XP_038882892.1 TOM1-like protein 5 [Ben... [more]
XP_016900689.13.1e-19696.74PREDICTED: target of Myb protein 1 isoform X2 [Cucumis melo][more]
XP_008449359.25.5e-19392.47PREDICTED: target of Myb protein 1 isoform X1 [Cucumis melo] >XP_016900684.1 PRE... [more]
Match NameE-valueIdentityDescription
Q9FFQ02.7e-11856.54TOM1-like protein 5 OS=Arabidopsis thaliana OX=3702 GN=TOL5 PE=1 SV=1[more]
Q9LPL64.4e-4435.09TOM1-like protein 3 OS=Arabidopsis thaliana OX=3702 GN=TOL3 PE=1 SV=1[more]
Q6NQK07.6e-4431.99TOM1-like protein 4 OS=Arabidopsis thaliana OX=3702 GN=TOL4 PE=1 SV=1[more]
Q9C9Y19.9e-4430.81TOM1-like protein 8 OS=Arabidopsis thaliana OX=3702 GN=TOL8 PE=2 SV=1[more]
Q8L8601.7e-4034.73TOM1-like protein 9 OS=Arabidopsis thaliana OX=3702 GN=TOL9 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A1S4DXH71.5e-19696.74target of Myb protein 1 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103491258 PE=3 ... [more]
A0A1S3BMS22.7e-19392.47target of Myb protein 1 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103491258 PE=3 ... [more]
A0A6J1D6L63.0e-19285.82TOM1-like protein 5 OS=Momordica charantia OX=3673 GN=LOC111017804 PE=3 SV=1[more]
A0A6J1EH714.2e-17882.49TOM1-like protein 5 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111432500 PE=... [more]
A0A6J1HRT04.3e-17581.77TOM1-like protein 5 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111466022 PE=3 ... [more]
Match NameE-valueIdentityDescription
AT5G63640.11.9e-11956.54ENTH/VHS/GAT family protein [more]
AT1G21380.13.2e-4535.09Target of Myb protein 1 [more]
AT1G76970.15.4e-4531.99Target of Myb protein 1 [more]
AT3G08790.17.0e-4530.81ENTH/VHS/GAT family protein [more]
AT4G32760.11.2e-4134.73ENTH/VHS/GAT family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Chinese Long) v3
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002014VHS domainSMARTSM00288VHS_2coord: 2..117
e-value: 6.9E-16
score: 68.8
IPR002014VHS domainPFAMPF00790VHScoord: 4..76
e-value: 1.8E-17
score: 63.5
IPR002014VHS domainPROSITEPS50179VHScoord: 9..121
score: 23.486242
IPR004152GAT domainPFAMPF03127GATcoord: 175..248
e-value: 5.4E-9
score: 36.2
IPR004152GAT domainPROSITEPS50909GATcoord: 159..247
score: 13.119628
IPR008942ENTH/VHSGENE3D1.25.40.90coord: 2..119
e-value: 3.2E-26
score: 93.7
IPR008942ENTH/VHSSUPERFAMILY48464ENTH/VHS domaincoord: 4..122
IPR038425GAT domain superfamilyGENE3D1.20.58.160coord: 158..255
e-value: 3.4E-15
score: 57.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 381..399
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 378..399
NoneNo IPR availablePANTHERPTHR45898:SF3TOM1-LIKE PROTEIN 5coord: 1..397
NoneNo IPR availableCDDcd03561VHScoord: 2..113
e-value: 1.09327E-32
score: 117.751
NoneNo IPR availableSUPERFAMILY89009GAT-like domaincoord: 129..249
IPR044836TOM1-like protein, plantPANTHERPTHR45898TOM1-LIKE PROTEINcoord: 1..397

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_4G027580.1CsaV3_4G027580.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0043328 protein transport to vacuole involved in ubiquitin-dependent protein catabolic process via the multivesicular body sorting pathway
molecular_function GO:0035091 phosphatidylinositol binding
molecular_function GO:0043130 ubiquitin binding