CsaV3_3G017550 (gene) Cucumber (Chinese Long) v3

Overview
NameCsaV3_3G017550
Typegene
OrganismCucumis sativus L. var. sativus cv. Chinese Long (Cucumber (Chinese Long) v3)
DescriptionPentatricopeptide repeat (PPR) superfamily protein
Locationchr3: 13139930 .. 13162136 (-)
RNA-Seq ExpressionCsaV3_3G017550
SyntenyCsaV3_3G017550
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATAAACTGCTGCCATTGGGCTTGACTGCAATGCCGTAAGGATCCGAAGAGGTTTATTCGCTCGTACAGAAAATGGGATATTCAGTGACGCATAGGATATTTTCCACGTGAAGCCAAGACGAAGAGCAGGTACCTCATCCTAATACACATTTTCTCTCGAAAATGCCCGTGGATCTCCTACTTCAATAACTCATTACCAAGAACAAAGAAGGTTAAAGAATGATCGAAACTGCTCTGGCCGTTCGGTTTCCCGCCGGCGCAAACTTCTGCTACTCCTCTGCCGTGTCCTGTAATTCTCCCTCTCTTTCTTCCCCGCCATTACTTACTTCAGAAGACGAGCTATACTTTTTTTTATCCATGTTTTTTGGTCGAGTTATACGAAAGTGAAAGAGGAAACGTTTCTTTTGATTGCTTAACTTGCTTTCTGTTGAAATCAATGGAGCTCAAGAGTCCACGATTGCTTAACTTGCTTTCTGTTGAAATCAATGGAGCTTAAGAGTCCACATCTTTTGTTTTTGGCGTTCTTTTTCCTGCATTTTACATTGTTGCCAAGCTTACTAGCTGACTAATTTTCGTAGATGTCTTTCGAATTAGACGAAAGAGAAAGAGGAAACAATTGTTATTATTGCTTAACTTGAATTCTATTGAAAGTTTTGGAGTTTAAGATTCCACGGCTTTTCTTTTTGGCGTTCTTTTTTCTGCATTTTACATGGCTATCAACCTTACTAGCTGACTATTTTTCTTCTATGTTCTTCGAATTAAATGCAAGAGGGCGAGAAACGATTGCTATTATTGCTTAACCTGAATTCTATTATAAGCTTTGGAGCTTAAGATTCCACAGCTTTTCTTTTTGGCGTTTTCTCCAGCATTTTGCTTTGCTGCCGAGCTGAGAGTATTTTTCTTCTATGCTGTTGGAAATAAGGGAAAGCGAAAGAGGAAACATTCGTTTTGATTGCTTAATGTGATTTCCGTTTGAAACAGTGGAGCTCGAGATTCCATTGCCTTTCTTTTTGGTGTCCCCCCCGCATTTTCTTGGCTGCCAAGCTGAGACAAGTGTTCTATAATTTCTTTGAATTAAGTGTTAGTGGCAGATGAAACAATTTTTTTGGTTGCTTAAATTGAATGAAGCAAACGATTACGCAGCGTTTGTTTTTGGAAAATCCGGTGTTTTTTTGGATGTATAGCTGAGACTAATTTTCTTCCTTGTGTTTGGTTGTGTGCATGTACAGATCACCGTCCAGCTTGGACTTCGGAGGATGTAACTAGTATAGGCAATGCTAGCAGCTTCTGCCGGCTTTTGCATTCTTGCACTTCCGACGTGCATTGGTACTTAGCTTGAGCTTGCTCCTTTCTTTTCCTCTTTTTATTATTTGTTTTTTATTAAATAATTTTGTCAATGAGTTTCTTGGTACCATATGCGTGTGGGGCAAAGCAATCACAGTTCAATGCAGCATGCGAGTGTTTAAGTAACCACGGGTCTCTAGTCCTTATACTGCTCCCACTGGCCAGAAATTGTAATAAATTGAAAAAAATTCTGACTTGAATATAAATTTATGATTAGAAAATACTTAGCCTAATGCAATGAGAAAGAACTCTTAATTGACAATATTTTTTTTATTCTTATAAGCCACTGTAATATGTGGTATTTACGGAATCGCAATTGGCTGTCAGGCTACCAATCCCAAATATCAAATAAAACACACAAACAATGGCAAACAGAGACGGTACTCTTGTGTTCTTTATTTATACATAAATGCCAAGATGTTCACAAGCTACGATATAAAGATAACAAGGAAAGAAAAACATAAAAGAAACATAATAGTCCCCTTTTACATCAATGGAGACCCCTGCCGGCTACAACAACCTACTCTCTTAAAGATAACAAAAAAAAAACCCAGCTACACTCTCTCTATCAATCACTATTTAAAGCCTTTCCTAAAATTCTTCCCTGTGGGCCCTACTCTACACAATCTTTTTCTCCAATCATTCCTCCTCATTAATATTAGGTGTATTTACATTTCTACTCTTTCTCACATACGTATGAATGATTGGGGGCCTTACAATACCCCTCGGTTCTAGGTTCACCTTGTCCTCAAGGTGAAAGGTGAGGAACTACTGGTTCATCTGGTAAACGGCTTCCCACATAGCTTCAGTCTCAGAATCTTTTCACTTTACCAACCATTCGTTGCCTTTGAGGTCCTTGTTCCAACGAACTCCTAGCACAGTTTCCGGCCATAGTTGCAGCTCGAAATCTTCAGTCAATATGGGTTGTTGATGTTGAACCACTTGCTGATTTCCTAACTTGAGTTTTAGCCTGGAGATGTGGAAGACATTGTGTATCACTGCTTCTGGGGGAAGTTGGAGTCTATAAACTACTTCACCTATTTCCTCAATTATCTTGTACGGCCCATAGAACTTAGGAGCCAATTTCTCACTCCTCTTGCATGCTAAGGAACGTTGCCGGTAGGGTCTCAGTTTCAGATACACCTCATCCCCAACCTTGAATTTGAGCTCTCTTCTCCTAGTATCCGCCATTTTTTTCATCCCATTTTGGGCTATGCTCAAATTCTCTTTCAATGCATTCAGTGCCAAGTCCCTTTCCTTCAGCATAGATTCTACCTCATCGTTCAGCGTTTTTTTATTCCCATATGATAACAGTGGAGGGGGTTGTCTTCCATACACTATCCGGAAAGGATTGCTTCTCGTGGAGGCATGAAAGGTGGTATTATACCACAACTCAGCCCATGGAATGAATTTATCCCAATTATGGGGTTCATTGCAGAAACATCTCAAATATGTTTTTAAACACTGATTTACCCTTTCGCTTTGCTTGTCGGTTTGTGGGTGAAAGGCTGTACTTCTCTTCAAGACTGTACCCATAGTAGCGAATAATTCCTTCCAAAAATTGCTGATGAAGATCTTATCCTTGTTTGAGATAGTCGATTTTGGTATGCCATGCCTGCTGCTTATTATGTCAATGAAGAATTTAGCTACTCTTTTGGCTGTAAAAGGATGCTTCAAGGTAATAAAATACGAGAACTTGTTGAGTCTATCAACTACCACCATGATCACATTCATCCCTCCAGCTTTTGACAGCCCTTCAATGAAGTTCATGGACCATTCTTCTAGAATTTTCTTTGCTATTGGAATTGGTTGTAAAACCCCTGCTGGCTTGGTTGCTTCATACTTATTCCTCTGGCAGATCTCACATTGCTCAACATAGTTTTTGACGTCTGTTTTCATTCTTTTCCAATGTAGTTCTCCATTCATCCTCTTATATGTCCTTAGAAATCCGGTGTGACCTCCTAAAACTGAATTATGAAACGTATGCACCAAGTTCGATATCAGTGATGAGTGTTTGGACAACACTATCCTGTTTTTATACCATAGTCTCCCATTCTCCCAACAATATTTACTTGTCTCTTTAGTCTTTTGTTTCAACTCTTCTATGATCTTCTTAAGATCTTCATCTTGTTGTACCTCCTCTTCAACTAACTCCATATTCACAATTCCAGTGGTAGTCATGGTATTGAGTTCTAGGGGTTGCTCTATCCGAGATAGGGCATCAATAGCTTTGTTCTGTAATCTCAGTTGATACAAAATCTCAAAATCATATCCTAGGAGTTTGGTCAACCACTTTTGGAACTTGGGTTGTGCCTCTCTCTGTTCTAAAAGAAATTTCAACGCTTTCTGATCAGATATGATTGTGAACTTCTTCCCTAGAAGATAATGTCCCCATTTCTGCGCAGAGAGCACTACAACCATAAGTTCCCTCTCATTTATAGATTTGGTTTGTGCCCTAGGAGATAGTTTTTGACTAAAGAAGGTGATGGGATGATTGTTCTGAGATAACACAGCCCCTAATCCTATTCCAGATGCCATCCGTTTCAACTATAAAAGGTAGGTTCCAATCAGGTAGTGCAAACACTGGTTTGGTTGTCATTGCTATCTTCAACTTATTGAAGGCAATTGTGGCCTCTTCATTCCATAGGAAGGAATTCTTTTGTAACAGTTTAGTTAGGAGTCACAATTTCCCCGTACCCTTTCACAAATCTTCGATAGTATTTGGTTAAGCCCAAGAATCCTCTCAAACTAGTAACATCCTTCGGTTGCGGCCAGTTCACCATATCTTGGATTTTCTCTTAGTCGGCTTCTACACCCTTATTGGAAATCAAGTGTCCTAAGTACTGGATTTGAGAATAAGCTATAACACACTTTTTCTTGTTGGCAAAAAGCTAATGATAACGTAGTACCGCAAATACCATTCCTAGGTGCTTCTCATGTTCCGTGATATCTGAACTATACATAGGTATATCGTAAAAAAAAAACCAATACACAACGCCTCAAAAAAGGTTTAAATACCTGGTTCATTAATGACTGAAAGGTGGTAGGTGCGCTCGTGAGGCCGAAGGGCATAACTACGAACTCATAATGGCCTTCGTGAGTTCTGAAAGTCTTCTCAATATCTTCCTTCTTATTTGGTGATACAGACTTCAAATCCATCTTTGAGAATATAGTGGCTCCTTGCAATTCATCCCGCAGTTCTTTGGGATAGGAAATTTGTCAGAGGTAGTCACCTGGTTTAGCTTTCGGTAATCTACACAAAATCTCCACCCTCCATCCTTTTTCTTCACTAACAAGACTGGACTGGAATAAGGACTATGACTGGGTCTTATCACTCCTGCTTGAAGCATCTCCACAACTAATTTCTCAATCTCTTGCTTTTGAATGTGTCCATATTTGTAAGGCCTCACGTTAATCGGTCTCTGTTCAAGCATCACCATGATTCGATGATCTACTTCTCTCTTAGGAGGTAACCCTTTTGGGTTTTCGAAAATGTCTAAATACTGTTGCAACAAGAATCTCATGGGTGTATCTTCTTCGTCCCCCTTAATTTCTTGTTCTTCTTCCAGGTCGTCATTCATTTCTATGTCATAGTTTTGTAGTTCCAAGAGGAAGCCTTGATCTTCCTTTTCCCATGTTTTCTCAATGGTTCTCAGGGAACATTCGACTCTAATAAGGGATAGATCCCCCTTAAGGCTAATTTGTTTTGTCCCTATCCAAAACGTCATATTTATGGAAGGCCAATGGATCTTCATAGTACCAGTGGTAGCAAGCCACTCCATCCCCAAAATTACATCCACATTCCCAGTTCAATAGCTAGAAAATTAGCGACCACGATGAGTCCTTTCAGTTTCAGCTCTAACCTCCTACACACTCCTCTTCCTTTGCACCTCGCACCATTCCCAATAGTTACTCTGAATTGAGTTCCCTCCTCCAACGGTATAATTTTTGTCTCTACTGTCTTGAGGTGTATAAAGTTGTGGGTGGCTCCACTGTCGTTTAGTACCACCACTTCCTTCCCTCTTATATCCCTTTATTTTCATGGTTCCCTTATTTGTCAATCCACTGATTGTCTTCAGCTCTACTTTGATTCCATCTGTGAGGTTTAGTTGTTTCAATTTCACCACCTCCTCTGGACTTTCCTCTGTTCGATCTTCTTCCTCAACACTCTCTTCTTCATTCATAATGAATAACATCAACTCCCTTTTTTCCTTCATTGTGCATCTATGTCATGGTGAGTCCTTTCATTACATTTAAAACATAAGCCCTTGTCCAGCCTGGCCTTAAACTCTGCATCGGACAAACGCTTAACAGATGGTTCATTCTTTTGGTAATTTCCCTTAATGGGTATGGTTACCTGTTTCATTGGAAAATCCGTTTATCTCAATATCCCTTTATCCGTTCCTTCTTGAACCTGGCTGGTATGCCCTTCTCCTTTTTTTTGTTCTCCCAACTTCCAGTATGTCTTAGACAATTTCAATGCTAAGTTCCGATCTTTGACTAGTTGAGCTTCTCTCATACAATCTTCCAACGTTTGTGGATACCTGCTGACTACTTCTGCTTGAAGTTTGGGTTCTAACCGGTTAAAAAAGCATCACGTAGAACACTCTCTGCCATGTGTGGTAGAGATGCTGAATAAGTAACAAACTTCTTCACACAATCGTTATAAGAACCATCTTGTTGAATTCAAATCAAATGAGCAACCAAACTTTTTTGCCCCAAGTCCTTAAAAAAATCAAACATCCTCCCCTTAAAATCTTCCCATGACTCCACCTTCTTCTATTATGAGTCCATCTATACCAATCTACTTCGTTCGACCCAAAACTTACCACAACCACTTTGATTTTCTCTGTTTTTGGTAGATTGTTGATTTCAAAGAAGTGCTCAGCCCTATATACCCAAGATTTCGGATTTTTGCCCAAGAACATAGGCATTTCCAACTTTTTGTACTTACTGCGGTCTACTGTGTTCAAGTTGATCTCAGTGGTAACTTTTGTTTCATCTATTTTTCCTTTCAATCTCATGACCGAACCATCTGATGTCCCTGACTCATCTCTTTTTTTATAGATATGGTTCTCCCTTAGCTCATCCGCCATCCTCTCCATCGATTTTTTCATTTCCATTAGTATTTCTTTCATACCGAATATCTCCTTCTCAGTCTCTTCCACTCTTTCCTCAATCTGTCTTTGCGCCATGAGTTGCATAACCACCCCTAGCTTTCGGGCTTTGATACCAAATGTTAGGCTACCAACCCCAAATATCAAATAAAACACACAAACAATGGGAAACAAAGACGATACTCTGGTGTTCTTTATTTATATATAAATGCCAAGATGTTCACAAGCTACGATATAAAGATAACAAGGAAAGAAAAACATAGAAGAAACATAATAGTCCCCTTTTACATCAATGGAGACCCCTGCTGGCTACAACAGCCTACATTTTTAAAGATGACAAAAAGAAAACCTAGCTACACTCTCCCTATCAATCACTATTTAAAGCCTTTCCTAAGTTCTTCCATGTGGGCCCTACTCTGCACGATCCTTTTCTCCAATCATTCCTCCTCATTAATATTAGGTGTATTTACCTTTCTACTCTTTCTCATATACGTATGAATGATGGGAGCCTTACACAAGCATCAAAAGGACTAATGTTATTGGCAGCTGTGTTGTTTGAGATCTTTCAGAAATGATTGTGTATCTGACTGTTACTCCATCAGAGTGGTGCTTAGATACATTATTTGTGTGTGTGTGTGTATAATTATTGTGTACAGAGGACATTGACCGAATTGTACCTTTTCTATTTGTGAACATCAAATTTGATTCAGTCTATGCATGTAAAAGGCAATAGGAGGACTGCACGAATGGAATCCTAAAACCTGATAACTAAATTGCAGGAAAAGATGCCAGAGACTTAATAGTAGGTCTCTCTTGGGAAGAAGCTATCTCAAAAAGATTGGAATCCAAGCATCAGCAGAGCCTTTGGGCTCTGCATCAGATCCAATTAAACAAAATAGGGGATTGCAGTATCATCCATCTGAGGAGCTTGTGAAATCAATAACTGAGATTGCTGATGATGTTAGACCTACCTCTGCAGAAACTACTAGAACAATCATTGAGGTATCAATTTATACTAATGCTGTTCTTATGTTCATACTTAATCATGATGTTTAATATGTTTGATGCATTATTAGAAGGAACGTTGTTCATACTAATGCTGTTCTTATGTTCATACTTAATCATGATGTTTAATATGTTTGATGCATTACTAGAAGGAACGTTGTTTGCATTTGTGCTTTATGCACCAACCAATTTTACCACTGGAAGCTACTCATTTGTTTGAATTGTATATTCTTTTTCCATCACGTTCTCAAGGACTGAAATGGAGGTCAAGGACTGGAATTGCTAATCTTGAAGTTTGGAAGATTATCTTAGAAACAAAAACAATGGACCCAATTTTTTCACTACCTTTTTACTTAAACTCTGTTTTTTCCTCCCTCTCTTTAGTACAAAACTTTATTTTTGCCCCTAGTACAACAAAGATAAGAATGGCAATTTGAACCTCCTACTTCAAAGGGAAAGTGGGTGGGAGCTATATTAAAGGGAGCTGCTTTTGTAATTCTCTATCGTATGTAGGTTGCATGATTTCGATAATTAATATTTTCAATAGCATTTCATGTTCAGGTAAATAGCAAAGCAACATTAATGTTTGCGGGTTTGATTAATGATGAAGTTCAGGAAAACATTATCTGGCCAGAACTACCGTATGTAACTGACGCACATGGAAGTATGTCCTACATTTTAGAGTTCACTATTCTCAAACTCAAGGCCATCCTGGGCCAACCTTCCCTATTATTTTATTGATTATGTTATTTTAATATATTGCAGATATCTACTTTCAAGCGAAGAATACTGAAGAGGCCATGAAAAATCTAACATCAGAAAATAACTTTGTGGTATGGATTTTGACTTACATATTAAACATAATGTTTTCACATCTCTCACATCTTGTCTTAAACCAATAATGAGTTCAGTTGTTTAATTTTCATTATTCCAAATTTTTACTTGCCATGTGAAGGTTCCCTTGTGATGTTAGTCATAGTGTACGCCCATAGGGTATTCTTTACTATTGAAAGCGGTAAAAGAAAATGTGACACCGAGTTACGTGGAAACCCGAGTATCGAGGGAAAAACCACTATATGACTTTCTTATAATTATTATTTCATATGAACAAAAGGTACAATGGGGGAAAATTAAATAGACAGCAAGCTTAAAGCTACATTTTGGCAGCCAATTGTTGATAAAGTGCAGAGAAAGTTAGACAAATGGAGAAGATTGAATAGTGAGAAGTGAAAAGGTAAGTTCTCTCCCCTCTCTCCTCTCTCCTCTTTCCTCCCCGGCAGCCAGCCCCTCCGGCCACCACCATCTCTCTAAGAATGGAGGTTTTTAGCTGCAAGATCAATCAGTTTCATTACTGCATATGGAAGGAAAATAATTATTGCATCATTGAAGACACGGAAGCCCATAGAATCCTTCCCCTATCTGATAACCAAACCCGTTGGCTCCAAGAAGCCATTGTTGAACTTCTCAAATTTCCAGCTGATCATTTATTTAGAAAGCACGCAAACATTGATAGAGGCCGGCTAAAAGTCTCGAAGTTTCAAGCAAATTCCAGCAGGATTTTATGTTGTGACTACTGGCCATATTCAGGAGGATATTCTATGATTAGAGTTAGTTCATGTGAAAATTTGCAAGGTTGGTGGTCTTTCTGTAACATGTTACTTAAATTTGTAGAAAAGGTCAACTACACCAGTTGGTTTTCAGATAATTCCCCAAAGACTTCTCCTCAGTTTCCAGTGAATAGACCTTCCTATGTAGAAATGGTCAATTCTTCTAGGACTACCTCTCCAATAAAAAATCCCCAGGACAAGGTACCTTGTCCTCGCAACACCCCGCTTAGAGCTCATCAAGTATCTGTCCAGCCTTCTCCCCCTAAACACAACCAGTGGCTTATCAAGAATCATGAAGTGGTAAAGGTTGATTTTGAAAATTTATGGATTATTATAAAACTGTTTGCTTCCGATGCTTGGAGATTGATTCGCAAACAGTTGGAGACGCTTTATCAATCTAAAATCATTATAAACCCCCTCTATGACGATAATGCTCTCATTAGTTTTGACAAAATTCCATCCAAGGATCTCATCGGAAAGGAAAAGAAATGGCAGGCTTGGGGCGAATTCCACATCAAGTTTGAAAAATGGGACTCAGTTCGACATAGTAGGCCCCTTGTGCTTAAAGGTTTTGGGGGCTGGATGAAAATAAAAAATCTTCCTTTAGAGGGGCAACATCTTTTTATCTTAAGAGGGGCAACATCTTTTTACACTATGGTGATTTTGAGTTTTTATATCCAGTCAATACACGTTCAGCCCCCATATTGCAAGGGAAATTCAGTAATTCAATTGACCGGTTGTGTGTCAAGGAAGTTTTGATTGACGAAGATCTTGATAGGAATCCGTCCAACCTCCTGAAGCACTTTCCGGTCAGTTCACCGGTGCCATTGCCGGAGAAGAAAACAGAAATCGGTGAACAGCTGGCTGTTCTTTCTAATGAGCGCGTAACTGAACAGTTTTTGTCTTCCTCCAACCCCTTGCAGCACAGCTATCTCCAAAGAGCTATGGGGCCCACTGATTTTAAATTAAACCCTCTTTTTGCAGCTGCTAATGACCCATCAGGTGAGAGAAACGTTGGAAAAGGCCCCCTAATTACTTCCTTGAGTATTAATGAGCCTCCAACCTCTTTATTTAAGTCCTCGGCCTATCCATTTCTGCAGCCAACTCCTGAACTGCCCAATGCCACAATAACTCCCTCTTCATTCTCCCCTAGGCCCCCCTCTGTTTTCAAAAAACAGAAATTTGAAGAGTCAGGAAAAGAAAATATCTCTTCTGCCTCTAATAATTCGATCCTTTTTGCCAGCTTTTGTCAAATTCCTAATTATCCCTCCAACAAAATTAATGATTCTGTTGGAGCGGCCTTTTCTTCCACCACAGAAGGATCCCCAAATAATTTTTGTTCAAGCAAAGGAAAGAAGAGTAAAATTTCCTCAATAGTACCGTTGAAACAATTTATTAGAAAGGACACTTCATTTGATAATGCTTGGACAGCTTTACTAAAGTCTGAAGCTATTGCAGCATGTTCCTTCAAAGATCTGGAAAAAGATTTAAAAAAGAATCTGCCAAGTAAGACCTCCTTATCAGAACAAAATCCAGATTTCTTGGAAATATGTACCTCCAAAATACAATCAGCTTCATATAATATCAGGTCTATCTCTCCCCGCCCCGCTTCTGACCATTTTAATTACTCATGCTTCACTGATCCTAACACTAAAGTCACTTTAGTATGAGGAACTCCATGCCATTCAGTTGAGTCAGTGCAGAAGCTATCAAATACTGAAATTATCTCCCCCTACAGTGTCAGCAGTGAAGACTCCCCGGGTTTAACTCACATTCAAGATGATAATGCAAATGATGAAATTGAAGCAGTGGATCTAATTTTCCTTTTCAAAGTGGAAGGTGGTGTTTTTAGTAATTTCCCTTGCTCTTCCCCAATCAAATCAGTAGAAATTCCAAAGGAATTGTTGCCAATTATTAAAGACTGTGAAGTGATGCTGGTTTAAGTAAGTCAGTGTAATGTATATGCTCAAAATTTTCACCCATGAAGATCATTTCCTGGAACACAAGCGGCCTAAAAGATCCAAATAAGCATTCCGCTCTTAAGAAGTTTATAAAAAATCATCACCCAGACATGGTGCTAATCCAAGAATCAAAAATGGAAATTCTGGAAGTAAATTTCATTAAAACAATTTGGAGTTCTATGGATATAGGATGGGAATCACTGGAATCTTATGGCGCTTCCGGAGGCATTCTTACCCTATGGGATAAAAGTAAAATCACAGTGGTGGAAACCATAAGAGGACATGGGTTACTTTGTGCAAGAAGTGTGGTTGGGTTAACAATGTTTGTGGTCCGTGTGGTTATAGAGAGAGGAAACTTGTTTGGCCAGAATTATTAACAATTACAGAATGTGGGGAAGAGTCTTGGTGTTTGGGTGGAGATTTTAATATCACTAGATGGGTCTATGAGAGGTTTCCAGTTGGCAGAAGCACAAGAGGGATGAGACAGTTTAATGCCTTTATAGAATCTGCCAATCTAATGGAAATTCCCCTTCAAAATGGTAAATTTACTTGGTCAAGAGAGGGTGGCACTGCTGCAAGATCTCTGTTGGACAGATTTTTTATTAACAATAAATTGAATTTGAAAACTCAAGAGTAACCCGTAAAGCAAGAACCTTCTCCGATCACTTTCCTTTATTATAGAGGCAGGTGCGTGCAATACTATGGGGGTCATCCCCATTCAGATTTTGCAATAGTTGGTTGTTGATAAAAGAGTGCAACAAGGTTATTGAGGAAGTCTTGAAAATCGCCCCCCTAGGCGTTAAGTGGGCTGGTTTCATTCTTCATGAGCAGCTTCGTAAAGTGAAACTTGCTGTTAAAAATTGGCATGCTACTCATTTGATTGATATAAAGCAGAAAGAGGAAAGGCTTTTGAGAGAGTTAGAAATCATTGATGGGCTTGCAAAAAGTGTTGGTCTGAATGAAGTGGAATTGGTTCACAACTCAGATATACAAACAGAGTTGCTTTGCTTATATGTATTTTTGCCGTTATGTTTTTATTTTTCAGCCTTGCTTTTTGTCTATCGTAGGTAGCTTCTGTTATTATTTTGATTGTTCCTTTGGTCTTTTCATGACCTTAGTATGTTTATTGTACTTTGATCTTTTCTTTGTATTTTGGATATGATGAGGGTGCTATGGGGATGTCATCCTAGTTGAGAAGTCCGGGTGCATCTACTTATCCTATCTATGTATCTCTTTCTCCCACATTTCTTGGCTTCCCTATTATTTTCGTTGTATAGCTCTCTTGTATATTGAGTTTATTAATAATAAAGAAGCTTGTGTCATTTTAAAAAAAAAATAGACAGCAAGCTTAGGATAAAAAAAGGAAAGAATATTAGGGTTAATCTTTCCTTGGGCCACTAATTCTAACACTCCCACTCAAGTTGGGACGTAAATATCAATAAGACCCAACTTGCTAACACATGAATCAAAGTTTTATCTGAGAATCCCCTTGGTGAGAACATCAACAACCTATTGACTTGAGGGGATGTAGGGAATGCATATGCTACCATTGTCTAGTCTTTCTTTAATAAATGTCGATCAATCTCCACATGTTTAGTTCTATCATGTTGAACTGGATTGTTTGCGATGCTAATAGCTACCTTATTATCATAGAATAATTTCATCGGCACCTCGTAGTCTTGATGAAGGTAAGATAAAACCTTATGTAGCCAAATTCCCTCATATGTCCCTAAATTCATAGCTCTGTACTCAACTTTAGCACTACTTCTGGCCACAATCCCTTGCTTCTTACTTCTCAAAGTTTTGAGATTACCCCATACAAAAGTACAATAGTCAAAGGTGGATTTCCTGTCAACAACGGTCCCTGCCCAGTTAGAATCAATATAGGCCTCAATAGCTCCTTTGTAGGTCTTCCCATCAACCCTTTACTAGGAGTTGTTTTTAAGTATCTCAAGATTCTGTTAGCTGCCTCCATGTGTTTCTCACAGGGGCTGCATGAACTGGCTAACGACACTCACAACATAGGAGATATTCGGTTTAGTGTGTGACAAGTAGATCAACTTTCACACACGGCGCTGGTATTTTTCCTTATCAACCGGAGCTTTATCACCTGCACTTTCCAGTTTACTGTTGAACTCAATAGGAGTATCAGTAGGACGTATACTTGATTTAGTCAGCAAATCAAGGGTATATTTCCTTTGAGATACAGAGATACATTCTTTTGATTTAGCTACCTCCATCCCAATAAAATACTTCAGATTTTCCAAGTCTTTAGTGTCAAACTCATCATCCATCTTCTCTTTCAGTTATATGGTCTCAACGATGTCATCCCAGACATAACAATGTCATAAACACAAACAATTAACACGGCAATCTTCCTAGCCTTGAAAACTTTCGTAAACAAAGTGTGATCATAGTGCTCCTGACTGAACCATTGGGACTTGACAAATGTAGTAAACCTATCAAACCATGCTCTCGGTGATCAAACTGGGCTTTAAACCATGGAGAGGGGCTCATATAGACTTTCCTCTTCCAAGTCTTTGTTGAAGAACACATTCTTGACATCTAGCTGATATGTGAACCAATCTTTGTTTACAATAGATAACAGGATTTTAAAGGTATTTAGTTTTGCAACTGGGGAAAAGGTTTCAGAGTAAACCCTTTCGCAATTAACTTGGCCTTGTGTCTATCAAGAGTTCCATTAGTATTTGAGTGTGAACACCCATTTTCATTCCACAGTTTTATGTCCCTTAAGGAGAGTACAAATCTCCCAAGTCTTGTTCTTTTAAGAGCTTTCATCTCTTCCATCACTGCAGTGTTTTACTCAAGACATTCGAAAGCAATGTGGATATTTTTAGGTATCATTGTATCATAGACCCGTTTTCTCAGTGTATACAGAAGAAAGGTGTTCTGACGCTTCAAATAGAGTTTTTGAGTAGTATCCCATAGGTCCTTCGCAGTTGTTGCATATAACAGAGGTTTGCCAATCTGTGGCTCCATACTATTGATCAACATGGACTAGATAAGAGAATCCTCCTCTCTCCAGAATCATTCCTAGGAGTCAGTAGGTGGGGTCGAAGAGTCTCCTTTGTCAGAAAACCAAATTGTTGGAGTCCCTAAAGAATCATTTTGGCCGATTGTGACCAGGGATTTTGTGGTCTTGGTTATTAGTATACCCTTATATTCTTTCATTTTCTTTTCATTCAATGGAAGTAGTTATTTCTATAACAAAAAAAAAAAAAAAAGGAATAGAAAAACCCTAAGATATATTTAGGATGTTTTAATGCATTCCATTTCATTATAAGTTTGGGAATATAGAGTGCATTCAAGAACTGACGTGCTTTTGAAGCAAAAAGGGTTCTCGTATACCTTTGTAATTCAGAAGAAACAAAAATTATAATGGTTATTCATTAGGTATCAAAACCTTTAGATTAAACTGAAAGATTTTTTTTATTTCATTTCTGTTGGAAAATAACGAGAGAACTGAAATAGTTCGTTGGCAGCTATTGTGGATTATGAGGTGAGTGTGTTGCCTGGCTCTTTGGTGGTCAGAATTGGAGGGGTATCTCCTTTTTTGCTTTACAGTAATTGAAAACGTCCACTAGCGACTTGCTTCTTGATATTGAAAAAAACTATCAATAGATAAGAAAAGGGGGAGGGTTGTGATGTTTTAGCAGGTCCACAATCACCTTGCTTCTTGAAAGAAAAATTTTCTTTTTCAAGTGAGGAAGATTTGCTCTCAAGTCTCGTCCAAGCAGTGGTATCTGGGATTCTACTGTACTCTCTCTCCTTTTTAGAATTCCTGCTTCGATCAATAAAAGCCTAGAGAATCTTATAAGGAATTTCTCCTAAGATGGGTTTGACAAAGGGATGATTTATCTTGTTAGTTGGGACATAGTTTCAAGGTCATTGAAGTTACCCATCCCTTTGATTAGGTCTTGAGAAGGATTTTCTTTTTAATGGAAACAAAAATTTAGTCGATATAATGAAAAAAGACTACTGCTTCAAAATTCAAGATCTCAAAATATAAGGAAAAAATAGGAGACAAACACAACTAATGAAGGACAAATGCTAAAAGAAGAAAAACTTCAACTAAGAAGCACAAGAAACTAAGAAAGAGAGTCCCATAAGAAAATAAGAGAAAATCCTAACCAAGAGAAACCAGGCCAAACAGAGAAACATCTATCCAAAAAACTGGCACAAAGTGTCAAAAACGTCTGAAAAATCCCATATGAGAGGGAAACGAGAGCCCAAGGCAAACGGGACTTCAAAAAATTCTAAACCTCAAACAATTCACAAAAATGCAAATAGCTCTAAATGAACAAGACGAGCACCCTTAAACTTTAGCTCCTTGAGAACTAGAGGGAATCACCTGCAACTGAAGACCACAAACCTTAACCAAAGAAGTGAATTTGTCAAGAAGCAAAGAAGGCGTAGGAGAAGCTGCAGCAAAAACCTAACATAAGTGAACCAATATCTTCACTTGCAAAACTTTGATTATAGACTTCTGCAAATATATTCTCTAGTAGGTTCTTCTATTGCACTAGTTCCTCCATTAGAACTTCATTACTCACACTTAACAATGATTTCTCATCATTTTTTGAAGGAGGAAGAACTTCTTTAGGAGAAGGGGGGATGTTACCACTTTAATAAATTGAACATCAAATGTAGGGAGAGAGGTAATAGATTTCTTAAAAAACTTGTTAGGATCCAAAGGCTGATGATGCTTGAAGGGACTGATGAAGAAATAAAAAAAATGAGTGGAAGGCTCATCTATTATCTCTCCCTAGAGAAGTAAGTTTCTGTTCCTCTAAAACTCCTGTTCACTTTTCCCTCATTCTCCTAGTTCTTTCAATGTCAATTGAGAATGCTTGAATACCTATTCTTGTGAAGATTACTGGGTGGGAAGAAATCTTTTGCACACCTCTACCCCATCTATACCACCGATTAGATATGCAATTACAGTCTAGGCCCCTTTCTTTTCCTTCTTCTTCGTTTGTTCTTTTGGGGTTTCATTGCCCTCTCAGAAATAGGGAATTGTCAATTTCTCCCTTACTTTGTTTAGCCTACTATTAAATTTTTGACAAATGCTGCACAGTTTTGTTTTTGTTTTTTTAGGGCGGTATTTTTCCTCCGAGGGTGATTAATAATGCTTTAAATTTATTTTGGAGCCTTTATTTCTTTATTTAACTTCATTTATGAGATTTTTCAACTTAAGTTTGTTAAAATCAAATGTGGAAGTCATATTATTATTATTTATTTGTATAAATGAAGATTCGATTCAATGTTTTAAAATTTAATAATATCACAATCAAAATTTGATTGTAACTATTTCAGGTTGTGAGGGAGTTTAATATGACATTGACGATAAATTTCACTCTTTAAACTTGCCATTGTCATCATAACATATAGGGAGTAGTGCCTAGTTTAAGAATGTTCATCATGATCAGACTTTTGGTAAACTTTTATACACCTTTCTCTACATCAGGTATTTTATTTACATTCTTCCCATCCAGCATCTCATGAGTAGTGAGATTTTTTAAGCAGACAGACTTTCATTCCGTAATATCTGCTGTATAAACTAATTTCTAGTGGAAAAAAAATTTCAAATGGATAATGCATTTGATGGTTTTAATGATACAGCAAGTGCTTATTGGCATTGATACTATGGAAATGATCAATGAGATGGAGTTGTTTGGTCCATCAGAAATTGATTTTGGATTTGAAGAGCTTGATGATGGAGCTTCAGATGATGGAGATGATGATGATGATGGTGATGGTGAAGATGAAGACGAAGACGACGATGAGGATGAGGATGATGACGACGACGATGCAGATGATGAATACAATAGGGTCTCTCTATATAAAAGAAAAAGTATACTGTTGGATTTCACTATTTTAGTTTGTTATTAAATGTTTCTCCATTCTGCAGGACTGGGTTTCTGTCATAGATGATGAAGATGATCAAAATCACTCTGATGAAACCTTGGGAGATTGGGCAAAACTGGAGACAATGCGTTCTTCACATCCGATGCATTTTGCCAACAAGCTTTCGGAGGTAATGATTTATGATGGATTACAATTTTTTCCCCCAGCGGGAACTACTAATGAGTCTTGGTCTCTCCATATATGTTAAATGAAGCATCTGATTTCCAATTATATTTATATTCATATGCTTGTTATGTCTTATAGATTGCATCAGATGATCCTATTGATTGGATGGAACAGCCCCCAGCAACTTTAGTGATTCAAGGTGTTCTAAGACCTGCATTCAATGAAGAGCAGACTGTTATCGAAAAGCATCTATCCAGCCGCCATTTGAGTAATGGTGACATAAACGAGGCTCAGGAACTTGAAGAGAACCTTGAAGGTCATGGTAGGATCAATCATCATGGTCATGAATCAAGTTCATCCAAAGATGGTTTAAACTTGATGGAGGCATTGGATGAAAGTATTCCAGCGAGTGAGGCTTCATTTTACCGGCTGGAGATGATAAAAGTTCAGCTATTTACAGGAAATTCACACCCAGTATGTTTAGTTTCTGTTTTATTTTGCAGTTTGTCGTTGATTTCTCTTTTACATTGTAGCATGTTTTCTTGAAACAGCATCATACACCACACCTTTTTCACCTGTAAGAATGAGTTCTCTTAGTGTACAGTTGCCGTGGCCTAATCTGAGTTTTTCTCATTTGGAAGCTACCGTTGTGCGTTTCCTCTCTAATACGAATCTTTGGCTCTTTCTTTGGGGGGGGGGGGGGGGGGGGGAGAGATAAAAAACCATTTCATTGATTCAATGAAATATCCAAGGGTATATGTATCTTGGACATTCATATTCGCACGCTATGCTTAATTTTCTGAAAAAATTATTAATGGTATTCATTACCGATTTTAAAAATAATTATGTCGTTCATCTTTCTATGGCGGTGGGTGGAAAAGTTGCTGCCTCAGAACTAAAGTAAGATATATTGCTCGTGACTGCTCCTTTGTCAGCTTCGGCCAGTGGAACTATAAACACACAACAGGAAATTAACTGATACAAACCACTTTGCCTTCATTTCTTGTATTAAATTTTTATTTGACCATGATATATTCTATCTAACAAAGAAAACCAGCACTTCCTGGGACTTGGGCGACTTCGCCGGTCATATTTTAGATACTTTATGCATTATAAATGTTTAATGCCTACAATTTGCAGTCCCTAAATTCTGGGCAGGCTTGTTGTTTTTCTTGAACCTGCTTCTGGTTGTTGCTGACAAAAATTCTTTGATCTTTCCCAGAGCAACGTTGAAATAGAAGATCTCATGAAAGCTCAACCTGATGCGATTGCGCACTCAGCCGAAAAAATTATTTCTCGTCTAAGAGCAGGTGGAGAAAAGACCACACAAGCGCTCAAGTCTCTCTGTTGGAGATGCAAGGGCATTCAGGTTGAGGTAAATATATAAATCTTCTCAAAATGAGAAATTATTTCAATCTTTCGTTAAGATATTTGCTTGTCTCTGAATTATTACCATTCATCTTCCCAAATGATACGTCTCCAAAAGGCTGTTCAGTTTATTATTTTTATTTAAATCTGATTATGTATTTTGACTTCTCAAAAATAAAAATAAAGAAGGATTATGTATTTTGAAATTTAATTAGTCATGTAATAGAATAACGACCTATTCATATTATAGAAGAGGTATGAGAATTTAGGTCTATGAGGTCATACCAATTACTCTGGGTATACTTTGGTTGACTATTAATTAGTCATTACTTACTTCATAGTGGACTTATTAATTTTATGGTTGGTTTTGAAAACTGTCAGTGTCTAAACATATGCCAAATTAGTAGAGGAATATAGAACGAAAGTAGCCTTCTTTCATTGGCTGTAACATAGAGATAAATCTACTAAGTAATGTACTTTTTTAGTTGTGTTGCCTGATTTGCGGTGTCGTGAACCGAAGTATTTTCTCAGTACACAGTTCACAGTTGAATAGTTCAACATGGATTTGAACATTGGCTATAATGGAATTTCACAGGAGTGTTCAATATTCAGGTTTTATATTTGTGTATATATATACACGTAATTAATTTTCCAAAGAAAGAAGCACATAATCGATGAATGAGTACTTATGCCGCCAATGGTGGAAGTTCTAGGACAGGAGAAATTTTCAATTCTAAAATTCTTAGCTATAAAGTTTCTCCAAAGACCTGTCCTTTCTAATTTTAGTTCCTTCATTTTTCATTCTAGTTCAACAATAGATATGCATATTATAATGATTCTCATGGTGCTTCGGATCTTTTGGTCTTCACTGAATAATTGCAGGAGGCAGTAATCAATGGTATCGATTCACTTGGATTTGATGTGAGGGTTTGCTCCGAAACTCAGGTCCAAACATTGCGGTTTGCTTTTGATACACGAGTAAGTCAACTATCACGTTAGTCTCAAGTTACTGTTTTTTCCTCTTGATTAATTACAACATATGATAACCAAAAAAGAAAATGCATATTCATTCATAGTTTGTGACACAATTGGCAGCTGCAATAACCATATACAATTACTAAAGCAGCAAATGCATAGTTATACGTAGTTTGTGACATAACTGGCAGCTGCAATATTTTTAACGTAACCAATTATGACGTGATTTGATGAACTTCCATCCATGGTGCTTTTCTCCGTTTTATAAATGGAGAATTGTTTGTCCTAATTGTCAGATTTGTGAAAACAATTTAAGAATGATTTGGAAGGACTTATCAGATTTCAAATATGCTCTTATGAAGCTTGATTAAACCGATTTTTATTCAAGTGCGGTTTTGCCTTTTGGCACACAGCCTTTAGACAGTCTACCGGATTTGGTTCAATCACTATACCCTATTTTTTTTTAACACAAAGAATGTTTCGATTTCCATTATGGCTTGGATCCATTTAGTTTTCTTGGTTCAGTTTCTTTCATCCTAATTGATGCTGCCCGTGCAGAGTAGAGCCATGAATTTGGACACCAAAATTTACCCGATTAGCCGATTTGGCTAATCTGTTTATATATATATATATATATATCACAGACCGGACGCATGTTGTCTCTTCTAAGATTCAGGCAGTGGATACTCTAGACCTTTAAGTAGCTCATCTAATCTTTTTCCCAATTCAAAACTAGAATGGAAAAAAACCAAACCCAAACCCTAGAAGGGAAAAAAACCAAACCCTAGAATTATCTAATATTCATCTTCAAAATGCAAAGAATAAGCTACCTAAATATCTTCAACTATTTAATGAAGCCAATGTTTTATTCTTTTCAGTGAATTACCTTTAGAATCTTTAGGTTAAATTTACATATATCCTGGAGTTTTCTGGAAAAGGAAAACGGATTATCTAACATATGCGTATCCTAGCTTTCGTAGCTGGATGGTATTGAATAATGATCCTCTCAAAAGAATGTTCAAGTATAAGATAAAAAATGTAAATTGAAAAACGGATTCATACAGCAAACCAGATAATGGTATGGGAGCCAAAGGAACGGAAGATTCAGAAGAGTCCAAACAAGACGGAAACTCCAAGTATGATCAACCACAAGATATGCACCAAGAGGAGAGCCTGGCCAGGAATAGGCGCAGAACCACCGTTGCCGACCTCACAAAATCCCATCTTGGCGACCTTGACACCTGAACACAAAGCATCGGCACAACCACACCAGTACGTAACGCCATCAACCCCGCAGACCGGGTCCGTTCTAAAGCATTTCACGGGGCAAGAAGAGGGCACCGATACAGGACAGAGATCTACATCGCCGTCGTTATTGGTGGCTTCAGAAGGCAAGCGGATGGCTGAGGAAACATCCTCAGATCGGACTGGAAAGACGAAGATGAGGAGTAAAAATAAGAAAATTATGGAAATTTGGGGTCTAGAGAAGAGGGTCATGTTTCGCGAACTTTGAAAAAGCTCGCGTGATTGGAGATCGATTGGAGATCATGAGCTTTTAGATTTAGGGCAGGAAGGAAGAGACCTCTGTCATGGAATTTTCTTGACAGTTAGGCAACGAACAAGGGGGGACAGCCTTAGCTAGGAATTTGGTGGCCCTACGCCCCCCTTTTTTTGTTTTTCAAAATGGGTTTCCTTTTTTCACATTTCTCACCTAACCATTTGAATTCTTACCTAATTTTTTTTTTTAAAAGAAGAAAGTTTTGGAATATATTTTAATATTCAAAATTTACCATTGAAATCTGTGGATTTTAACTACCCATATCTAATTACCATTGAAAATGATGTTTATGGAACTTGATTGTCAATAACAACAAACAAAAAACCAAATTGTTATGAAATGAGACATTTTAATATCTCAACAATAATTTTGTTTTTTTTCTAAATCATGGATAAATTACCCTCTCTTGGTGTTGAAGTTTTATATTTGTAATGGCGGTGAGTTGTCTTTGCAGGAAAATGCTTACTAGTGAGCTTAACCAATTAGTATGTTTGTGCAGGCAACCTCAGAATTCAGTGCGGAAAAACAGCTGAACGACTTGCTTTTTCTCGAGAATTAATTCCAAACATCAAAAAGTGAAACAAACATAACAAAATGATTCCTAAACATTGTAAGCCTTCTCACCACATTACAATTCAATTATGGGTGTGTTTGGCTTTCTCAGGTTCCGATCTTTAATGCTTCAAAGCAGAATGGAAGTAGGTAAATTTATTTTAAATTGTTGATTCATTAGATTATGGATCCGGTGATTGCGAACCCGCAATCATTCTTCAATTATTATCAATTGGCCTCTACCAACATCCCTTTTCTGCTGTTTTACTCTGCCATTAATCAGTTTCCTACCGTTTGCTTGCTTCAAGTCAATTTTTTTTAGTACAAGAGAAGTGAAAATTTAAGTCGATGATGACATGTTTTATTAGTTAAGTTAGACAGTAGACACAATCTGACTTTTAATTAAAGTGATTAAGCATGAAGGGCGTTAATCAAATTAACTGCTTTTAGATGAATTTCTCCTAGGTTAATTCCCAAGTTTAAACTTGGAGGTTTTATCAATTTGATTTGTGCCTGATATGTTCTCTTGACAAATTTGAACATTTTTTAAAAATTAGTATTGCTTTTAAGAATTTGGATTCTTGTTTAATTGTTTACTTTAAAATTTCTTTCCGTCAATATTAGAATTTAATTTTATTTCCCTTCTAACTGCAAAAACCTCTTCCTT

mRNA sequence

ATGATCGAAACTGCTCTGGCCGTTCGGTTTCCCGCCGGCGCAAACTTCTGCTACTCCTCTGCCGTGTCCTATCACCGTCCAGCTTGGACTTCGGAGGATGTAACTAGTATAGGCAATGCTAGCAGCTTCTGCCGGCTTTTGCATTCTTGCACTTCCGACGTGCATTGGAAAAGATGCCAGAGACTTAATAGTAGGTCTCTCTTGGGAAGAAGCTATCTCAAAAAGATTGGAATCCAAGCATCAGCAGAGCCTTTGGGCTCTGCATCAGATCCAATTAAACAAAATAGGGGATTGCAGTATCATCCATCTGAGGAGCTTGTGAAATCAATAACTGAGATTGCTGATGATGTTAGACCTACCTCTGCAGAAACTACTAGAACAATCATTGAGGTAAATAGCAAAGCAACATTAATGTTTGCGGGTTTGATTAATGATGAAGTTCAGGAAAACATTATCTGGCCAGAACTACCGTATGTAACTGACGCACATGGAAATATCTACTTTCAAGCGAAGAATACTGAAGAGGCCATGAAAAATCTAACATCAGAAAATAACTTTGTGCAAGTGCTTATTGGCATTGATACTATGGAAATGATCAATGAGATGGAGTTGTTTGGTCCATCAGAAATTGATTTTGGATTTGAAGAGCTTGATGATGGAGCTTCAGATGATGGAGATGATGATGATGATGGTGATGGTGAAGATGAAGACGAAGACGACGATGAGGATGAGGATGATGACGACGACGATGCAGATGATGAATACAATAGGGACTGGGTTTCTGTCATAGATGATGAAGATGATCAAAATCACTCTGATGAAACCTTGGGAGATTGGGCAAAACTGGAGACAATGCGTTCTTCACATCCGATGCATTTTGCCAACAAGCTTTCGGAGATTGCATCAGATGATCCTATTGATTGGATGGAACAGCCCCCAGCAACTTTAGTGATTCAAGGTGTTCTAAGACCTGCATTCAATGAAGAGCAGACTGTTATCGAAAAGCATCTATCCAGCCGCCATTTGAGTAATGGTGACATAAACGAGGCTCAGGAACTTGAAGAGAACCTTGAAGGTCATGGTAGGATCAATCATCATGGTCATGAATCAAGTTCATCCAAAGATGGTTTAAACTTGATGGAGGCATTGGATGAAAGTATTCCAGCGAGTGAGGCTTCATTTTACCGGCTGGAGATGATAAAAGTTCAGCTATTTACAGGAAATTCACACCCAAGCAACGTTGAAATAGAAGATCTCATGAAAGCTCAACCTGATGCGATTGCGCACTCAGCCGAAAAAATTATTTCTCGTCTAAGAGCAGGTGGAGAAAAGACCACACAAGCGCTCAAGTCTCTCTGTTGGAGATGCAAGGGCATTCAGGTTGAGGAGGCAGTAATCAATGGTATCGATTCACTTGGATTTGATGTGAGGGTTTGCTCCGAAACTCAGGTCCAAACATTGCGGTTTGCTTTTGATACACGAGCAACCTCAGAATTCAGTGCGGAAAAACAGCTGAACGACTTGCTTTTTCTCGAGAATTAA

Coding sequence (CDS)

ATGATCGAAACTGCTCTGGCCGTTCGGTTTCCCGCCGGCGCAAACTTCTGCTACTCCTCTGCCGTGTCCTATCACCGTCCAGCTTGGACTTCGGAGGATGTAACTAGTATAGGCAATGCTAGCAGCTTCTGCCGGCTTTTGCATTCTTGCACTTCCGACGTGCATTGGAAAAGATGCCAGAGACTTAATAGTAGGTCTCTCTTGGGAAGAAGCTATCTCAAAAAGATTGGAATCCAAGCATCAGCAGAGCCTTTGGGCTCTGCATCAGATCCAATTAAACAAAATAGGGGATTGCAGTATCATCCATCTGAGGAGCTTGTGAAATCAATAACTGAGATTGCTGATGATGTTAGACCTACCTCTGCAGAAACTACTAGAACAATCATTGAGGTAAATAGCAAAGCAACATTAATGTTTGCGGGTTTGATTAATGATGAAGTTCAGGAAAACATTATCTGGCCAGAACTACCGTATGTAACTGACGCACATGGAAATATCTACTTTCAAGCGAAGAATACTGAAGAGGCCATGAAAAATCTAACATCAGAAAATAACTTTGTGCAAGTGCTTATTGGCATTGATACTATGGAAATGATCAATGAGATGGAGTTGTTTGGTCCATCAGAAATTGATTTTGGATTTGAAGAGCTTGATGATGGAGCTTCAGATGATGGAGATGATGATGATGATGGTGATGGTGAAGATGAAGACGAAGACGACGATGAGGATGAGGATGATGACGACGACGATGCAGATGATGAATACAATAGGGACTGGGTTTCTGTCATAGATGATGAAGATGATCAAAATCACTCTGATGAAACCTTGGGAGATTGGGCAAAACTGGAGACAATGCGTTCTTCACATCCGATGCATTTTGCCAACAAGCTTTCGGAGATTGCATCAGATGATCCTATTGATTGGATGGAACAGCCCCCAGCAACTTTAGTGATTCAAGGTGTTCTAAGACCTGCATTCAATGAAGAGCAGACTGTTATCGAAAAGCATCTATCCAGCCGCCATTTGAGTAATGGTGACATAAACGAGGCTCAGGAACTTGAAGAGAACCTTGAAGGTCATGGTAGGATCAATCATCATGGTCATGAATCAAGTTCATCCAAAGATGGTTTAAACTTGATGGAGGCATTGGATGAAAGTATTCCAGCGAGTGAGGCTTCATTTTACCGGCTGGAGATGATAAAAGTTCAGCTATTTACAGGAAATTCACACCCAAGCAACGTTGAAATAGAAGATCTCATGAAAGCTCAACCTGATGCGATTGCGCACTCAGCCGAAAAAATTATTTCTCGTCTAAGAGCAGGTGGAGAAAAGACCACACAAGCGCTCAAGTCTCTCTGTTGGAGATGCAAGGGCATTCAGGTTGAGGAGGCAGTAATCAATGGTATCGATTCACTTGGATTTGATGTGAGGGTTTGCTCCGAAACTCAGGTCCAAACATTGCGGTTTGCTTTTGATACACGAGCAACCTCAGAATTCAGTGCGGAAAAACAGCTGAACGACTTGCTTTTTCTCGAGAATTAA

Protein sequence

MIETALAVRFPAGANFCYSSAVSYHRPAWTSEDVTSIGNASSFCRLLHSCTSDVHWKRCQRLNSRSLLGRSYLKKIGIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRPTSAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDAHGNIYFQAKNTEEAMKNLTSENNFVQVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDEDDDEDEDDDDDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANKLSEIASDDPIDWMEQPPATLVIQGVLRPAFNEEQTVIEKHLSSRHLSNGDINEAQELEENLEGHGRINHHGHESSSSKDGLNLMEALDESIPASEASFYRLEMIKVQLFTGNSHPSNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSETQVQTLRFAFDTRATSEFSAEKQLNDLLFLEN*
Homology
BLAST of CsaV3_3G017550 vs. NCBI nr
Match: KAE8650472.1 (hypothetical protein Csa_009877 [Cucumis sativus])

HSP 1 Score: 981.9 bits (2537), Expect = 2.2e-282
Identity = 513/513 (100.00%), Postives = 513/513 (100.00%), Query Frame = 0

Query: 1   MIETALAVRFPAGANFCYSSAVSYHRPAWTSEDVTSIGNASSFCRLLHSCTSDVHWKRCQ 60
           MIETALAVRFPAGANFCYSSAVSYHRPAWTSEDVTSIGNASSFCRLLHSCTSDVHWKRCQ
Sbjct: 1   MIETALAVRFPAGANFCYSSAVSYHRPAWTSEDVTSIGNASSFCRLLHSCTSDVHWKRCQ 60

Query: 61  RLNSRSLLGRSYLKKIGIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRPT 120
           RLNSRSLLGRSYLKKIGIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRPT
Sbjct: 61  RLNSRSLLGRSYLKKIGIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRPT 120

Query: 121 SAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDAHGNIYFQAKNTEEAMKNL 180
           SAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDAHGNIYFQAKNTEEAMKNL
Sbjct: 121 SAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDAHGNIYFQAKNTEEAMKNL 180

Query: 181 TSENNFVQVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDEDD 240
           TSENNFVQVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDEDD
Sbjct: 181 TSENNFVQVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDEDD 240

Query: 241 DEDEDDDDDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANKLSEI 300
           DEDEDDDDDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANKLSEI
Sbjct: 241 DEDEDDDDDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANKLSEI 300

Query: 301 ASDDPIDWMEQPPATLVIQGVLRPAFNEEQTVIEKHLSSRHLSNGDINEAQELEENLEGH 360
           ASDDPIDWMEQPPATLVIQGVLRPAFNEEQTVIEKHLSSRHLSNGDINEAQELEENLEGH
Sbjct: 301 ASDDPIDWMEQPPATLVIQGVLRPAFNEEQTVIEKHLSSRHLSNGDINEAQELEENLEGH 360

Query: 361 GRINHHGHESSSSKDGLNLMEALDESIPASEASFYRLEMIKVQLFTGNSHPSNVEIEDLM 420
           GRINHHGHESSSSKDGLNLMEALDESIPASEASFYRLEMIKVQLFTGNSHPSNVEIEDLM
Sbjct: 361 GRINHHGHESSSSKDGLNLMEALDESIPASEASFYRLEMIKVQLFTGNSHPSNVEIEDLM 420

Query: 421 KAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCS 480
           KAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCS
Sbjct: 421 KAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCS 480

Query: 481 ETQVQTLRFAFDTRATSEFSAEKQLNDLLFLEN 514
           ETQVQTLRFAFDTRATSEFSAEKQLNDLLFLEN
Sbjct: 481 ETQVQTLRFAFDTRATSEFSAEKQLNDLLFLEN 513

BLAST of CsaV3_3G017550 vs. NCBI nr
Match: XP_004140749.2 (LOW QUALITY PROTEIN: uncharacterized protein At3g49140 [Cucumis sativus])

HSP 1 Score: 976.1 bits (2522), Expect = 1.2e-280
Identity = 510/510 (100.00%), Postives = 510/510 (100.00%), Query Frame = 0

Query: 1   MIETALAVRFPAGANFCYSSAVSYHRPAWTSEDVTSIGNASSFCRLLHSCTSDVHWKRCQ 60
           MIETALAVRFPAGANFCYSSAVSYHRPAWTSEDVTSIGNASSFCRLLHSCTSDVHWKRCQ
Sbjct: 1   MIETALAVRFPAGANFCYSSAVSYHRPAWTSEDVTSIGNASSFCRLLHSCTSDVHWKRCQ 60

Query: 61  RLNSRSLLGRSYLKKIGIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRPT 120
           RLNSRSLLGRSYLKKIGIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRPT
Sbjct: 61  RLNSRSLLGRSYLKKIGIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRPT 120

Query: 121 SAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDAHGNIYFQAKNTEEAMKNL 180
           SAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDAHGNIYFQAKNTEEAMKNL
Sbjct: 121 SAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDAHGNIYFQAKNTEEAMKNL 180

Query: 181 TSENNFVQVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDEDD 240
           TSENNFVQVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDEDD
Sbjct: 181 TSENNFVQVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDEDD 240

Query: 241 DEDEDDDDDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANKLSEI 300
           DEDEDDDDDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANKLSEI
Sbjct: 241 DEDEDDDDDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANKLSEI 300

Query: 301 ASDDPIDWMEQPPATLVIQGVLRPAFNEEQTVIEKHLSSRHLSNGDINEAQELEENLEGH 360
           ASDDPIDWMEQPPATLVIQGVLRPAFNEEQTVIEKHLSSRHLSNGDINEAQELEENLEGH
Sbjct: 301 ASDDPIDWMEQPPATLVIQGVLRPAFNEEQTVIEKHLSSRHLSNGDINEAQELEENLEGH 360

Query: 361 GRINHHGHESSSSKDGLNLMEALDESIPASEASFYRLEMIKVQLFTGNSHPSNVEIEDLM 420
           GRINHHGHESSSSKDGLNLMEALDESIPASEASFYRLEMIKVQLFTGNSHPSNVEIEDLM
Sbjct: 361 GRINHHGHESSSSKDGLNLMEALDESIPASEASFYRLEMIKVQLFTGNSHPSNVEIEDLM 420

Query: 421 KAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCS 480
           KAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCS
Sbjct: 421 KAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCS 480

Query: 481 ETQVQTLRFAFDTRATSEFSAEKQLNDLLF 511
           ETQVQTLRFAFDTRATSEFSAEKQLNDLLF
Sbjct: 481 ETQVQTLRFAFDTRATSEFSAEKQLNDLLF 510

BLAST of CsaV3_3G017550 vs. NCBI nr
Match: XP_008439307.1 (PREDICTED: uncharacterized protein At3g49140-like [Cucumis melo])

HSP 1 Score: 925.6 bits (2391), Expect = 1.8e-265
Identity = 483/510 (94.71%), Postives = 499/510 (97.84%), Query Frame = 0

Query: 1   MIETALAVRFPAGANFCYSSAVSYHRPAWTSEDVTSIGNASSFCRLLHSCTSDVHWKRCQ 60
           MIETALAVRFPAGANFCYSSAV YHRPAWTSED +SIGNASSFCRLLHSCTSDVHWKRCQ
Sbjct: 1   MIETALAVRFPAGANFCYSSAVPYHRPAWTSEDASSIGNASSFCRLLHSCTSDVHWKRCQ 60

Query: 61  RLNSRSLLGRSYLKKIGIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRPT 120
           RLNSRSLLGRS L+K GIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRPT
Sbjct: 61  RLNSRSLLGRSNLRKNGIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRPT 120

Query: 121 SAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDAHGNIYFQAKNTEEAMKNL 180
           SAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTD HGNIYFQ KNTEEAMKNL
Sbjct: 121 SAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDEHGNIYFQMKNTEEAMKNL 180

Query: 181 TSENNFVQVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDEDD 240
           TSENNFVQVLIG+DTMEMINEMELFGPSEIDFGFEELDDGA++ GDDDDD DG+ EDED+
Sbjct: 181 TSENNFVQVLIGLDTMEMINEMELFGPSEIDFGFEELDDGATNVGDDDDDDDGDGEDEDE 240

Query: 241 DEDEDDDDDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANKLSEI 300
           D+DED+DDDDADDEYNRDWVSVIDDEDDQN+SDETLGDWAKLETMRSSHPMHFANKLSE+
Sbjct: 241 DDDEDNDDDDADDEYNRDWVSVIDDEDDQNNSDETLGDWAKLETMRSSHPMHFANKLSEV 300

Query: 301 ASDDPIDWMEQPPATLVIQGVLRPAFNEEQTVIEKHLSSRHLSNGDINEAQELEENLEGH 360
           ASDDPIDWMEQPPATLVIQGVLRPAF+EEQTVI+KHLSSRHLSNGDINEAQ+LEENLE H
Sbjct: 301 ASDDPIDWMEQPPATLVIQGVLRPAFSEEQTVIQKHLSSRHLSNGDINEAQKLEENLESH 360

Query: 361 GRINHHGHESSSSKDGLNLMEALDESIPASEASFYRLEMIKVQLFTGNSHPSNVEIEDLM 420
           GRINHHGHESSSSKDGLNLM+ALDESIPASEASFYRLEMIKVQLFTGNSHPS+VEIEDLM
Sbjct: 361 GRINHHGHESSSSKDGLNLMDALDESIPASEASFYRLEMIKVQLFTGNSHPSDVEIEDLM 420

Query: 421 KAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCS 480
           KAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCS
Sbjct: 421 KAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCS 480

Query: 481 ETQVQTLRFAFDTRATSEFSAEKQLNDLLF 511
            TQVQTLRFAFDTRATSEFSAEKQLNDLLF
Sbjct: 481 GTQVQTLRFAFDTRATSEFSAEKQLNDLLF 510

BLAST of CsaV3_3G017550 vs. NCBI nr
Match: XP_038880477.1 (uncharacterized protein At3g49140-like [Benincasa hispida])

HSP 1 Score: 874.8 bits (2259), Expect = 3.7e-250
Identity = 460/512 (89.84%), Postives = 486/512 (94.92%), Query Frame = 0

Query: 1   MIETALAVRFPAGANFCYSSAVSYHRPAWTSEDVTSIGNASSFCRLLHSCTSDVHWKRCQ 60
           MIETALAVRFPAGANFCYSSA+SYHRPAWTSEDV+SI +ASSFCRLLHSCTSDVHWKRCQ
Sbjct: 1   MIETALAVRFPAGANFCYSSALSYHRPAWTSEDVSSIVHASSFCRLLHSCTSDVHWKRCQ 60

Query: 61  RLNSRSLLGRSYLKKIGIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRPT 120
           RLNS+SLLGR+ LKK GIQASAE LGSASDPIKQNRGLQYHPSEELVKSITEIA+DVRPT
Sbjct: 61  RLNSKSLLGRNNLKKNGIQASAEHLGSASDPIKQNRGLQYHPSEELVKSITEIAEDVRPT 120

Query: 121 SAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDAHGNIYFQAKNTEEAMKNL 180
           SAETTRTIIEVNSKATLMFAGLIN+EVQENIIWPELPYVTD HGNIYFQ KNTEEAM+NL
Sbjct: 121 SAETTRTIIEVNSKATLMFAGLINNEVQENIIWPELPYVTDEHGNIYFQMKNTEEAMQNL 180

Query: 181 TSENNFVQVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDEDD 240
           TSENNFVQVLIG+DTMEMI+EMELFGPSEIDFGFEELDD A++DGDDDD      +DED+
Sbjct: 181 TSENNFVQVLIGLDTMEMIDEMELFGPSEIDFGFEELDDEATNDGDDDD-----RDDEDE 240

Query: 241 DEDEDDDDDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANKLSEI 300
           DEDED+D+DDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANKLSE 
Sbjct: 241 DEDEDEDEDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANKLSEA 300

Query: 301 ASDDPIDWMEQPPATLVIQGVLRPAFNEEQTVIEKHLSSRHLSNGDINEAQ--ELEENLE 360
           ASDDPIDWMEQPPATLVIQGVLRPAF+EE TVI+KHLSSRH   GDINEAQ  E+E+NLE
Sbjct: 301 ASDDPIDWMEQPPATLVIQGVLRPAFSEENTVIQKHLSSRHSITGDINEAQKLEVEDNLE 360

Query: 361 GHGRINHHGHESSSSKDGLNLMEALDESIPASEASFYRLEMIKVQLFTGNSHPSNVEIED 420
            HGRINHHGHESSSSKDG NL +ALDE+IP S+ASFYRLEMIKVQLFTG++HPSNVEIED
Sbjct: 361 NHGRINHHGHESSSSKDGSNLTDALDENIPVSDASFYRLEMIKVQLFTGHAHPSNVEIED 420

Query: 421 LMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRV 480
           LMKAQPDAIAHSAEKIISRLRAGGEKT QALKSLCWRCKGIQVEEAVINGIDSLGFDVRV
Sbjct: 421 LMKAQPDAIAHSAEKIISRLRAGGEKTAQALKSLCWRCKGIQVEEAVINGIDSLGFDVRV 480

Query: 481 CSETQVQTLRFAFDTRATSEFSAEKQLNDLLF 511
           CS TQVQTLRFAFDTRATSEFSAEKQLN+LLF
Sbjct: 481 CSGTQVQTLRFAFDTRATSEFSAEKQLNELLF 507

BLAST of CsaV3_3G017550 vs. NCBI nr
Match: XP_022141265.1 (uncharacterized protein At3g49140-like [Momordica charantia])

HSP 1 Score: 825.1 bits (2130), Expect = 3.4e-235
Identity = 431/511 (84.34%), Postives = 469/511 (91.78%), Query Frame = 0

Query: 1   MIETALAVRFPAGANFCYSSAV-SYHRPAWTSEDVTSIGNASSFCRLLHSCTSDVHWKRC 60
           MIETALAVRFPAGANFC+SS   SYHR AW SEDVTSIG+ SSFCRLLHSC SDVHWKRC
Sbjct: 1   MIETALAVRFPAGANFCFSSTTSSYHRSAWISEDVTSIGHPSSFCRLLHSCASDVHWKRC 60

Query: 61  QRLNSRSLLGRSYLKKIGIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRP 120
           QRLNSR LLGR+ L++ GIQASAEPLGSASDPIKQN  LQYHPSEELVKSITE A+DVRP
Sbjct: 61  QRLNSRFLLGRNTLRRNGIQASAEPLGSASDPIKQNTRLQYHPSEELVKSITENAEDVRP 120

Query: 121 TSAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDAHGNIYFQAKNTEEAMKN 180
           T+AETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTD HGNIYFQ KNTEE M+N
Sbjct: 121 TAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDEHGNIYFQVKNTEETMQN 180

Query: 181 LTSENNFVQVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDED 240
           LTSENNFVQVL+G+DTMEMIN+MELFGPSE+DFGFEELDD A+ D  DDDDGD    D+D
Sbjct: 181 LTSENNFVQVLVGLDTMEMINDMELFGPSEVDFGFEELDDEATLDEGDDDDGD----DDD 240

Query: 241 DDEDEDDDDDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANKLSE 300
           D++ +D+D+DD DDEY+ DWVSVI+DEDD N SDETLGDWAKLETMRSSHPM+FANKLSE
Sbjct: 241 DEDKDDNDEDDTDDEYDTDWVSVIEDEDDSNDSDETLGDWAKLETMRSSHPMYFANKLSE 300

Query: 301 IASDDPIDWMEQPPATLVIQGVLRPAFNEEQTVIEKHLSSRHLSNGDINEAQELEENLEG 360
           +ASDDPIDWMEQPPATLVIQGVLRPAF+EE +VI++HLSSRH SNGDINEAQ+ E+NLE 
Sbjct: 301 VASDDPIDWMEQPPATLVIQGVLRPAFSEEHSVIQRHLSSRHSSNGDINEAQKHEDNLEN 360

Query: 361 HGRINHHGHESSSSKDGLNLMEALDESIPASEASFYRLEMIKVQLFTGNSHPSNVEIEDL 420
           HG INHH HESSSSKDGLNL + LD +IP SEASFYRLEMIK+QLFTG++HPSNVE+EDL
Sbjct: 361 HGMINHHDHESSSSKDGLNLADGLDGNIPMSEASFYRLEMIKIQLFTGHAHPSNVELEDL 420

Query: 421 MKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVC 480
           MKAQPDAIAHSAEKIISRLRAGGEKT QALKSLCWRCKGIQVEEAVING+DSLGFD+RVC
Sbjct: 421 MKAQPDAIAHSAEKIISRLRAGGEKTAQALKSLCWRCKGIQVEEAVINGVDSLGFDMRVC 480

Query: 481 SETQVQTLRFAFDTRATSEFSAEKQLNDLLF 511
           S TQVQTLRFAF TRATSEFSAEKQLND+LF
Sbjct: 481 SGTQVQTLRFAFGTRATSEFSAEKQLNDVLF 507

BLAST of CsaV3_3G017550 vs. ExPASy Swiss-Prot
Match: Q0WMN5 (Uncharacterized protein At3g49140 OS=Arabidopsis thaliana OX=3702 GN=At3g49140 PE=1 SV=2)

HSP 1 Score: 389.4 bits (999), Expect = 6.2e-107
Identity = 242/516 (46.90%), Postives = 330/516 (63.95%), Query Frame = 0

Query: 1   MIETALAVRFPAGANFCYSSAVSYHRPAWTSEDVTSIGNASSFCRLLHSCTSDVHWKRCQ 60
           MIE+ +AVR   G  FC S+A+  +R A +SE+  +  + +S           +      
Sbjct: 1   MIESVMAVRLSTG--FCSSTALLQYRTAPSSEEGGNCFHYASRRVFQPQRIHHIDGSGFL 60

Query: 61  RLNSRSLLGRSYLKKIGIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRPT 120
           + NS   + R +L+K   QA+AE + SASDP KQ    +YHPSEE+  S+ +   D R +
Sbjct: 61  KYNS-DYITRKHLRKNRTQATAEYVDSASDPEKQTGKSRYHPSEEIRASLPQNDGDSRLS 120

Query: 121 SAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDAHGNIYFQAKNTEEAMKNL 180
            AETTRTIIEVN+K TLM  G I D V ENI+WP++PY+TD +GN+YFQ K  E+ M+++
Sbjct: 121 PAETTRTIIEVNNKGTLMLTGSIGDGVHENILWPDIPYITDQNGNLYFQVKEDEDVMQSV 180

Query: 181 TSENNFVQVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDED- 240
           TSENN+VQV++G DTMEMI EMEL G S+ DF  E+      + GDDD +  GEDEDE+ 
Sbjct: 181 TSENNYVQVIVGFDTMEMIKEMELMGLSDSDFETED-----DESGDDDSEDTGEDEDEEE 240

Query: 241 -----DDEDEDDDDDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFA 300
                +DEDEDDDDDD DDE           +DD + SDE+LGDWA LETMRS HPM FA
Sbjct: 241 WVAILEDEDEDDDDDDDDDE-----------DDDDSDSDESLGDWANLETMRSCHPMFFA 300

Query: 301 NKLSEIASDDPIDWMEQPPATLVIQGVLRPAFNEEQTVIEKHLSSRHLSNGDINEAQELE 360
            +++E+AS+DP+DWM+QP A L IQG+L     E+ + I+K L+  + +     +A+ L 
Sbjct: 301 KRMTEVASNDPVDWMDQPSAGLAIQGLLSHILVEDYSDIQKKLADSNSTTNGNKDAENLV 360

Query: 361 ENLEGHGRINHHGHESSSSKDGLNLMEALDESIPASEASFYRLEMIKVQLFTGNSHPSNV 420
           + LE + +      E  SS+D              +  +FY+LEMI++QL T     + V
Sbjct: 361 DKLEDNSKAGGDESEIDSSQD----------EKARNVVAFYKLEMIRIQLITAQGDQTEV 420

Query: 421 EIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGF 480
           E+ED+ KAQPDAIAH++ +IISRL   G+K T+ALKSLCWR   IQ EE  + GIDSLGF
Sbjct: 421 EVEDVRKAQPDAIAHASAEIISRLEESGDKITEALKSLCWRHNSIQAEEVKLIGIDSLGF 480

Query: 481 DVRVCSETQVQTLRFAFDTRATSEFSAEKQLNDLLF 511
           D+R+C+  ++++LRFAF TRATSE +AE Q+  LLF
Sbjct: 481 DLRLCAGAKIESLRFAFSTRATSEENAEGQIRKLLF 487

BLAST of CsaV3_3G017550 vs. ExPASy TrEMBL
Match: A0A1S3AY35 (uncharacterized protein At3g49140-like OS=Cucumis melo OX=3656 GN=LOC103484131 PE=4 SV=1)

HSP 1 Score: 925.6 bits (2391), Expect = 8.9e-266
Identity = 483/510 (94.71%), Postives = 499/510 (97.84%), Query Frame = 0

Query: 1   MIETALAVRFPAGANFCYSSAVSYHRPAWTSEDVTSIGNASSFCRLLHSCTSDVHWKRCQ 60
           MIETALAVRFPAGANFCYSSAV YHRPAWTSED +SIGNASSFCRLLHSCTSDVHWKRCQ
Sbjct: 1   MIETALAVRFPAGANFCYSSAVPYHRPAWTSEDASSIGNASSFCRLLHSCTSDVHWKRCQ 60

Query: 61  RLNSRSLLGRSYLKKIGIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRPT 120
           RLNSRSLLGRS L+K GIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRPT
Sbjct: 61  RLNSRSLLGRSNLRKNGIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRPT 120

Query: 121 SAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDAHGNIYFQAKNTEEAMKNL 180
           SAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTD HGNIYFQ KNTEEAMKNL
Sbjct: 121 SAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDEHGNIYFQMKNTEEAMKNL 180

Query: 181 TSENNFVQVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDEDD 240
           TSENNFVQVLIG+DTMEMINEMELFGPSEIDFGFEELDDGA++ GDDDDD DG+ EDED+
Sbjct: 181 TSENNFVQVLIGLDTMEMINEMELFGPSEIDFGFEELDDGATNVGDDDDDDDGDGEDEDE 240

Query: 241 DEDEDDDDDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANKLSEI 300
           D+DED+DDDDADDEYNRDWVSVIDDEDDQN+SDETLGDWAKLETMRSSHPMHFANKLSE+
Sbjct: 241 DDDEDNDDDDADDEYNRDWVSVIDDEDDQNNSDETLGDWAKLETMRSSHPMHFANKLSEV 300

Query: 301 ASDDPIDWMEQPPATLVIQGVLRPAFNEEQTVIEKHLSSRHLSNGDINEAQELEENLEGH 360
           ASDDPIDWMEQPPATLVIQGVLRPAF+EEQTVI+KHLSSRHLSNGDINEAQ+LEENLE H
Sbjct: 301 ASDDPIDWMEQPPATLVIQGVLRPAFSEEQTVIQKHLSSRHLSNGDINEAQKLEENLESH 360

Query: 361 GRINHHGHESSSSKDGLNLMEALDESIPASEASFYRLEMIKVQLFTGNSHPSNVEIEDLM 420
           GRINHHGHESSSSKDGLNLM+ALDESIPASEASFYRLEMIKVQLFTGNSHPS+VEIEDLM
Sbjct: 361 GRINHHGHESSSSKDGLNLMDALDESIPASEASFYRLEMIKVQLFTGNSHPSDVEIEDLM 420

Query: 421 KAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCS 480
           KAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCS
Sbjct: 421 KAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCS 480

Query: 481 ETQVQTLRFAFDTRATSEFSAEKQLNDLLF 511
            TQVQTLRFAFDTRATSEFSAEKQLNDLLF
Sbjct: 481 GTQVQTLRFAFDTRATSEFSAEKQLNDLLF 510

BLAST of CsaV3_3G017550 vs. ExPASy TrEMBL
Match: A0A6J1CHK0 (uncharacterized protein At3g49140-like OS=Momordica charantia OX=3673 GN=LOC111011706 PE=4 SV=1)

HSP 1 Score: 825.1 bits (2130), Expect = 1.6e-235
Identity = 431/511 (84.34%), Postives = 469/511 (91.78%), Query Frame = 0

Query: 1   MIETALAVRFPAGANFCYSSAV-SYHRPAWTSEDVTSIGNASSFCRLLHSCTSDVHWKRC 60
           MIETALAVRFPAGANFC+SS   SYHR AW SEDVTSIG+ SSFCRLLHSC SDVHWKRC
Sbjct: 1   MIETALAVRFPAGANFCFSSTTSSYHRSAWISEDVTSIGHPSSFCRLLHSCASDVHWKRC 60

Query: 61  QRLNSRSLLGRSYLKKIGIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRP 120
           QRLNSR LLGR+ L++ GIQASAEPLGSASDPIKQN  LQYHPSEELVKSITE A+DVRP
Sbjct: 61  QRLNSRFLLGRNTLRRNGIQASAEPLGSASDPIKQNTRLQYHPSEELVKSITENAEDVRP 120

Query: 121 TSAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDAHGNIYFQAKNTEEAMKN 180
           T+AETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTD HGNIYFQ KNTEE M+N
Sbjct: 121 TAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDEHGNIYFQVKNTEETMQN 180

Query: 181 LTSENNFVQVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDED 240
           LTSENNFVQVL+G+DTMEMIN+MELFGPSE+DFGFEELDD A+ D  DDDDGD    D+D
Sbjct: 181 LTSENNFVQVLVGLDTMEMINDMELFGPSEVDFGFEELDDEATLDEGDDDDGD----DDD 240

Query: 241 DDEDEDDDDDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANKLSE 300
           D++ +D+D+DD DDEY+ DWVSVI+DEDD N SDETLGDWAKLETMRSSHPM+FANKLSE
Sbjct: 241 DEDKDDNDEDDTDDEYDTDWVSVIEDEDDSNDSDETLGDWAKLETMRSSHPMYFANKLSE 300

Query: 301 IASDDPIDWMEQPPATLVIQGVLRPAFNEEQTVIEKHLSSRHLSNGDINEAQELEENLEG 360
           +ASDDPIDWMEQPPATLVIQGVLRPAF+EE +VI++HLSSRH SNGDINEAQ+ E+NLE 
Sbjct: 301 VASDDPIDWMEQPPATLVIQGVLRPAFSEEHSVIQRHLSSRHSSNGDINEAQKHEDNLEN 360

Query: 361 HGRINHHGHESSSSKDGLNLMEALDESIPASEASFYRLEMIKVQLFTGNSHPSNVEIEDL 420
           HG INHH HESSSSKDGLNL + LD +IP SEASFYRLEMIK+QLFTG++HPSNVE+EDL
Sbjct: 361 HGMINHHDHESSSSKDGLNLADGLDGNIPMSEASFYRLEMIKIQLFTGHAHPSNVELEDL 420

Query: 421 MKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVC 480
           MKAQPDAIAHSAEKIISRLRAGGEKT QALKSLCWRCKGIQVEEAVING+DSLGFD+RVC
Sbjct: 421 MKAQPDAIAHSAEKIISRLRAGGEKTAQALKSLCWRCKGIQVEEAVINGVDSLGFDMRVC 480

Query: 481 SETQVQTLRFAFDTRATSEFSAEKQLNDLLF 511
           S TQVQTLRFAF TRATSEFSAEKQLND+LF
Sbjct: 481 SGTQVQTLRFAFGTRATSEFSAEKQLNDVLF 507

BLAST of CsaV3_3G017550 vs. ExPASy TrEMBL
Match: A0A6J1E9X9 (uncharacterized protein At3g49140-like OS=Cucurbita moschata OX=3662 GN=LOC111430699 PE=4 SV=1)

HSP 1 Score: 807.0 bits (2083), Expect = 4.6e-230
Identity = 420/510 (82.35%), Postives = 461/510 (90.39%), Query Frame = 0

Query: 1   MIETALAVRFPAGANFCYSSAVSYHRPAWTSEDVTSIGNASSFCRLLHSCTSDVHWKRCQ 60
           MIETALAVRF  GANFCYSSA+S HRPAWTSEDVT IG+ +S CRL  SC SDV WKRCQ
Sbjct: 1   MIETALAVRFSGGANFCYSSALSNHRPAWTSEDVTCIGHVNSSCRLFQSCASDVQWKRCQ 60

Query: 61  RLNSRSLLGRSYLKKIGIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRPT 120
           RLNSRSLLG++ LKK GIQASAE LGSASDPIKQNR LQYHPSEE VKSITEIA+DVRPT
Sbjct: 61  RLNSRSLLGKNNLKKNGIQASAEHLGSASDPIKQNRRLQYHPSEEFVKSITEIAEDVRPT 120

Query: 121 SAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDAHGNIYFQAKNTEEAMKNL 180
           SAETTRTIIEVN KATLMFAGLINDEVQENIIWP+LPYVTD HGNIYFQ K+TEE ++NL
Sbjct: 121 SAETTRTIIEVNGKATLMFAGLINDEVQENIIWPDLPYVTDEHGNIYFQVKSTEETLQNL 180

Query: 181 TSENNFVQVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDEDD 240
            SENNFVQVLIG+D+MEMI+E+ELFGPSE++FGFEELDD  ++DGDD           +D
Sbjct: 181 NSENNFVQVLIGLDSMEMIDELELFGPSEVEFGFEELDDEVTNDGDD-----------ED 240

Query: 241 DEDEDDDDDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANKLSEI 300
           + D+ +DDDDADDEY+RDWVSVIDDEDDQN+SDETLGDWAKLETMRSSHPMHFANKLSE 
Sbjct: 241 EVDDGEDDDDADDEYDRDWVSVIDDEDDQNYSDETLGDWAKLETMRSSHPMHFANKLSES 300

Query: 301 ASDDPIDWMEQPPATLVIQGVLRPAFNEEQTVIEKHLSSRHLSNGDINEAQELEENLEGH 360
           ASDDPID ME+PPATL+IQG LRPAF+EE TVI++HLSSRH SNGDI+EAQ+LE+NLE  
Sbjct: 301 ASDDPIDCMEEPPATLLIQGFLRPAFSEEHTVIQRHLSSRHSSNGDIHEAQKLEDNLENR 360

Query: 361 GRINHHGHESSSSKDGLNLMEALDESIPASEASFYRLEMIKVQLFTGNSHPSNVEIEDLM 420
           GRINH GHESSSSKDGLN+++ L E+IP ++ASFYRLEMIKVQL TG++HPSNVEIEDLM
Sbjct: 361 GRINHQGHESSSSKDGLNMVDGLAENIPVNDASFYRLEMIKVQLCTGHAHPSNVEIEDLM 420

Query: 421 KAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCS 480
           KAQPDAI HSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCS
Sbjct: 421 KAQPDAIGHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCS 480

Query: 481 ETQVQTLRFAFDTRATSEFSAEKQLNDLLF 511
            TQVQTLRFAFDTRATSEFSAEKQL+DLLF
Sbjct: 481 GTQVQTLRFAFDTRATSEFSAEKQLDDLLF 499

BLAST of CsaV3_3G017550 vs. ExPASy TrEMBL
Match: A0A6J1JB77 (uncharacterized protein At3g49140-like OS=Cucurbita maxima OX=3661 GN=LOC111482881 PE=4 SV=1)

HSP 1 Score: 805.8 bits (2080), Expect = 1.0e-229
Identity = 419/510 (82.16%), Postives = 462/510 (90.59%), Query Frame = 0

Query: 1   MIETALAVRFPAGANFCYSSAVSYHRPAWTSEDVTSIGNASSFCRLLHSCTSDVHWKRCQ 60
           MIETALAVRF  GANFCYSSA+S HRPAWTSEDVT IG+A+S CRL  SC SDV WKRCQ
Sbjct: 1   MIETALAVRFSGGANFCYSSALSNHRPAWTSEDVTCIGHANSSCRLFQSCASDVQWKRCQ 60

Query: 61  RLNSRSLLGRSYLKKIGIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRPT 120
           RLNSRSLLG++ LKK GIQASAE LGSASDPIKQNR LQYHPSEE VKSITEIA+DVRPT
Sbjct: 61  RLNSRSLLGKNNLKKNGIQASAENLGSASDPIKQNRRLQYHPSEEFVKSITEIAEDVRPT 120

Query: 121 SAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDAHGNIYFQAKNTEEAMKNL 180
           SAETTRTIIEVNSKATLMFAGLINDEVQENIIWP+LPYVTD HGNIYFQ K+TEE ++NL
Sbjct: 121 SAETTRTIIEVNSKATLMFAGLINDEVQENIIWPDLPYVTDEHGNIYFQVKSTEETLQNL 180

Query: 181 TSENNFVQVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDEDD 240
            SENNFVQVLIG+D+MEMI+E+ELFGPSE++FG+EELDD  ++DGD          DED+
Sbjct: 181 NSENNFVQVLIGLDSMEMIDELELFGPSEVEFGYEELDDEVTNDGD----------DEDE 240

Query: 241 DEDEDDDDDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANKLSEI 300
           D+D +DDDDDADDEY+RDWVSVIDDEDDQN+SDETLGDWAKLETMRSSHPMHFANKLSE 
Sbjct: 241 DDDGEDDDDDADDEYDRDWVSVIDDEDDQNYSDETLGDWAKLETMRSSHPMHFANKLSES 300

Query: 301 ASDDPIDWMEQPPATLVIQGVLRPAFNEEQTVIEKHLSSRHLSNGDINEAQELEENLEGH 360
           ASDDPID ME+PPATL+IQG LRPAF+EE TVI++HLSSRH SNGDI+EAQ+LE+NLE  
Sbjct: 301 ASDDPIDCMEEPPATLLIQGFLRPAFSEEHTVIQRHLSSRHSSNGDIHEAQKLEDNLENR 360

Query: 361 GRINHHGHESSSSKDGLNLMEALDESIPASEASFYRLEMIKVQLFTGNSHPSNVEIEDLM 420
           GRINH GHESSSSKDGLN+++ L E+IP  +ASFYRLEMIKVQL TG++HPSN+EIEDLM
Sbjct: 361 GRINHQGHESSSSKDGLNMVDGLAENIPVKDASFYRLEMIKVQLCTGHAHPSNIEIEDLM 420

Query: 421 KAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCS 480
           KAQPDAI  +AEKIISRL+AGGEKT QALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCS
Sbjct: 421 KAQPDAIGRTAEKIISRLKAGGEKTKQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCS 480

Query: 481 ETQVQTLRFAFDTRATSEFSAEKQLNDLLF 511
            TQVQTLRFAFDTRATSEFSAEKQL+DLLF
Sbjct: 481 GTQVQTLRFAFDTRATSEFSAEKQLDDLLF 500

BLAST of CsaV3_3G017550 vs. ExPASy TrEMBL
Match: A0A6J1HTI8 (uncharacterized protein At3g49140-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111467691 PE=4 SV=1)

HSP 1 Score: 778.5 bits (2009), Expect = 1.8e-221
Identity = 414/510 (81.18%), Postives = 451/510 (88.43%), Query Frame = 0

Query: 1   MIETALAVRFPAGANFCYSSAVSYHRPAWTSEDVTSIGNASSFCRLLHSCTSDVHWKRCQ 60
           MIETALAVRFPAGANFCYSSA+S+HR AWTSEDVT+IG+AS FCRLLHSC SDV WKRC+
Sbjct: 1   MIETALAVRFPAGANFCYSSALSHHRSAWTSEDVTNIGHASGFCRLLHSCASDVQWKRCR 60

Query: 61  RLNSRSLLGRSYLKKIGIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRPT 120
            LNS+S L R+  +K GI ASAE LGSASDP+KQNR  QYHPSEELVKS +E A+DVRPT
Sbjct: 61  SLNSKSFLERNNFRKNGIHASAEHLGSASDPLKQNR-RQYHPSEELVKSRSENAEDVRPT 120

Query: 121 SAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDAHGNIYFQAKNTEEAMKNL 180
           +AETTRTIIEVNSKATLMF GLINDEVQENIIWPELPYVTD HGNIYFQ KNTEEAM+NL
Sbjct: 121 AAETTRTIIEVNSKATLMFVGLINDEVQENIIWPELPYVTDEHGNIYFQVKNTEEAMQNL 180

Query: 181 TSENNFVQVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDEDD 240
           TSENNFVQVLIGIDTMEMI+E+  FGPSE+D GFEELDD A +D  DDDDGDG+ +DED 
Sbjct: 181 TSENNFVQVLIGIDTMEMIDEINWFGPSEVDLGFEELDDEALNDEVDDDDGDGDGDDED- 240

Query: 241 DEDEDDDDDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANKLSEI 300
               D D+DDADD+Y+ DWVSVI+DEDD NHSDET GDWAKLETMRSSHPMHFA KLSE 
Sbjct: 241 ----DVDEDDADDDYDADWVSVIEDEDDPNHSDETAGDWAKLETMRSSHPMHFAKKLSEA 300

Query: 301 ASDDPIDWMEQPPATLVIQGVLRPAFNEEQTVIEKHLSSRHLSNGDINEAQELEENLEGH 360
           ASDDPIDWMEQPPATLVIQG LRP   EE++VI++HLSSRH SN DINEAQ+LE+NLE H
Sbjct: 301 ASDDPIDWMEQPPATLVIQGALRPTRREERSVIQRHLSSRHSSNSDINEAQKLEDNLENH 360

Query: 361 GRINHHGHESSSSKDGLNLMEALDESIPASEASFYRLEMIKVQLFTGNSHPSNVEIEDLM 420
           GRI++HGHESSSS         LD++IP +E SFYRLEM KVQLFTG+SHPSNVEIEDLM
Sbjct: 361 GRIDNHGHESSSS-------NGLDDNIPMNEVSFYRLEMTKVQLFTGHSHPSNVEIEDLM 420

Query: 421 KAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCS 480
           +AQPDAIAHSAEKIISRLR GGEKTTQALKSLCWRCKGIQVEEAVINGIDS+GFDVRVCS
Sbjct: 421 QAQPDAIAHSAEKIISRLREGGEKTTQALKSLCWRCKGIQVEEAVINGIDSIGFDVRVCS 480

Query: 481 ETQVQTLRFAFDTRATSEFSAEKQLNDLLF 511
            TQVQTLRFAFDTRATSEFSAEKQLNDLLF
Sbjct: 481 GTQVQTLRFAFDTRATSEFSAEKQLNDLLF 497

BLAST of CsaV3_3G017550 vs. TAIR 10
Match: AT3G49140.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 389.4 bits (999), Expect = 4.4e-108
Identity = 242/516 (46.90%), Postives = 330/516 (63.95%), Query Frame = 0

Query: 1   MIETALAVRFPAGANFCYSSAVSYHRPAWTSEDVTSIGNASSFCRLLHSCTSDVHWKRCQ 60
           MIE+ +AVR   G  FC S+A+  +R A +SE+  +  + +S           +      
Sbjct: 1   MIESVMAVRLSTG--FCSSTALLQYRTAPSSEEGGNCFHYASRRVFQPQRIHHIDGSGFL 60

Query: 61  RLNSRSLLGRSYLKKIGIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRPT 120
           + NS   + R +L+K   QA+AE + SASDP KQ    +YHPSEE+  S+ +   D R +
Sbjct: 61  KYNS-DYITRKHLRKNRTQATAEYVDSASDPEKQTGKSRYHPSEEIRASLPQNDGDSRLS 120

Query: 121 SAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDAHGNIYFQAKNTEEAMKNL 180
            AETTRTIIEVN+K TLM  G I D V ENI+WP++PY+TD +GN+YFQ K  E+ M+++
Sbjct: 121 PAETTRTIIEVNNKGTLMLTGSIGDGVHENILWPDIPYITDQNGNLYFQVKEDEDVMQSV 180

Query: 181 TSENNFVQVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDED- 240
           TSENN+VQV++G DTMEMI EMEL G S+ DF  E+      + GDDD +  GEDEDE+ 
Sbjct: 181 TSENNYVQVIVGFDTMEMIKEMELMGLSDSDFETED-----DESGDDDSEDTGEDEDEEE 240

Query: 241 -----DDEDEDDDDDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFA 300
                +DEDEDDDDDD DDE           +DD + SDE+LGDWA LETMRS HPM FA
Sbjct: 241 WVAILEDEDEDDDDDDDDDE-----------DDDDSDSDESLGDWANLETMRSCHPMFFA 300

Query: 301 NKLSEIASDDPIDWMEQPPATLVIQGVLRPAFNEEQTVIEKHLSSRHLSNGDINEAQELE 360
            +++E+AS+DP+DWM+QP A L IQG+L     E+ + I+K L+  + +     +A+ L 
Sbjct: 301 KRMTEVASNDPVDWMDQPSAGLAIQGLLSHILVEDYSDIQKKLADSNSTTNGNKDAENLV 360

Query: 361 ENLEGHGRINHHGHESSSSKDGLNLMEALDESIPASEASFYRLEMIKVQLFTGNSHPSNV 420
           + LE + +      E  SS+D              +  +FY+LEMI++QL T     + V
Sbjct: 361 DKLEDNSKAGGDESEIDSSQD----------EKARNVVAFYKLEMIRIQLITAQGDQTEV 420

Query: 421 EIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGF 480
           E+ED+ KAQPDAIAH++ +IISRL   G+K T+ALKSLCWR   IQ EE  + GIDSLGF
Sbjct: 421 EVEDVRKAQPDAIAHASAEIISRLEESGDKITEALKSLCWRHNSIQAEEVKLIGIDSLGF 480

Query: 481 DVRVCSETQVQTLRFAFDTRATSEFSAEKQLNDLLF 511
           D+R+C+  ++++LRFAF TRATSE +AE Q+  LLF
Sbjct: 481 DLRLCAGAKIESLRFAFSTRATSEENAEGQIRKLLF 487

BLAST of CsaV3_3G017550 vs. TAIR 10
Match: AT5G24060.2 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 365.5 bits (937), Expect = 6.8e-101
Identity = 225/479 (46.97%), Postives = 301/479 (62.84%), Query Frame = 0

Query: 39  NASSFCRLLHSCTSDVHWKRCQRLNSRSLLGRSYLKKIGIQASAEPLGSASDPIKQNRGL 98
           N SS C  L  C SD              + R YL++   QA AE LGSASDP K     
Sbjct: 43  NTSSGCGFL-KCYSD-------------YITRKYLRRNRTQAIAEYLGSASDPKKPTGKS 102

Query: 99  QYHPSEELVKSITE-IADDVRPTSAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELP 158
            YHPSE++   + E    D R +  ET RTIIEVN K TLM +GL+   V ENI+WP++P
Sbjct: 103 SYHPSEDIRAYVPEKNPGDSRLSPPETARTIIEVNKKGTLMLSGLLGIGVHENILWPDIP 162

Query: 159 YVTDAHGNIYFQAKNTEEAMKN-LTSENNFVQVLIGIDTMEMINEMELFGPSEIDFGFEE 218
           YVTD HGNIYFQ K  E+ M+  +TS+NN+VQV++G DTMEMI +MEL  PS I FG EE
Sbjct: 163 YVTDQHGNIYFQVKENEDIMQTVVTSDNNYVQVIVGFDTMEMIKDMELSSPSGIGFGIEE 222

Query: 219 LDDGASDDGDDDDDGDGEDEDEDDDEDEDDDDDDADDEYNRDWVSVIDDEDDQNH----S 278
           ++             DGE E ED+++ ++D+ +D DDE   +WV+V++D DD+++    S
Sbjct: 223 IE-------------DGESEVEDENKGDEDEGEDKDDE---EWVAVLEDGDDEDNYVSDS 282

Query: 279 DETLGDWAKLETMRSSHPMHFANKLSEIASDDPIDWMEQPPATLVIQGVLRPAFNEEQTV 338
           DE+LGDWA LETMR  HPM+FA +++E+AS DP++WM+QP A L IQG+L P   E+ + 
Sbjct: 283 DESLGDWANLETMRYCHPMYFARRMAEVASTDPVNWMDQPSAGLAIQGLLSPVIVEDHSD 342

Query: 339 IEKHLSSRHLSNGDIN-EAQELEENLEGHGRINHHGHESSSSKDGLNLMEALDESIPASE 398
           I+KH+S    +  D N E +  EE  EG G                N  E L      + 
Sbjct: 343 IQKHISGCISTGTDKNKERENSEEIFEGIGE---------------NESEILHVENSRNA 402

Query: 399 ASFYRLEMIKVQLFTGNSHPSNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKS 458
             +Y+LE+I++QL T   H + VE+ED+ KAQPD IA +++ I++RL   G+K T+AL+S
Sbjct: 403 IQYYKLEIIRIQLITAQGHQTEVEVEDVRKAQPDVIACASDGILTRLEEDGDKLTEALRS 462

Query: 459 LCWRCKGIQVEEAVINGIDSLGFDVRVCSETQVQTLRFAFDTRATSEFSAEKQLNDLLF 511
           LCWR  GIQ EE  + GIDSLGFD+R+CS  Q++TLRFAF  RATSE +AE QL +LLF
Sbjct: 463 LCWRNNGIQAEEVKLIGIDSLGFDLRICSGMQIETLRFAFSIRATSEHNAEGQLRELLF 476

BLAST of CsaV3_3G017550 vs. TAIR 10
Match: AT5G24060.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 362.8 bits (930), Expect = 4.4e-100
Identity = 229/502 (45.62%), Postives = 309/502 (61.55%), Query Frame = 0

Query: 16  FCYSSAVSYHRPAWTSEDVTSIGNASSFCRLLHSCTSDVHWKRCQRLNSRSLLGRSYLKK 75
           F  S A+  H P   +ED    G  S F          V  +R  R +  +     YL++
Sbjct: 6   FFSSMALLRHCPVSNTED----GGGSFF---------HVAPRRTFRPHLLNTSSGKYLRR 65

Query: 76  IGIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITE-IADDVRPTSAETTRTIIEVNSK 135
              QA AE LGSASDP K      YHPSE++   + E    D R +  ET RTIIEVN K
Sbjct: 66  NRTQAIAEYLGSASDPKKPTGKSSYHPSEDIRAYVPEKNPGDSRLSPPETARTIIEVNKK 125

Query: 136 ATLMFAGLINDEVQENIIWPELPYVTDAHGNIYFQAKNTEEAMKN-LTSENNFVQVLIGI 195
            TLM +GL+   V ENI+WP++PYVTD HGNIYFQ K  E+ M+  +TS+NN+VQV++G 
Sbjct: 126 GTLMLSGLLGIGVHENILWPDIPYVTDQHGNIYFQVKENEDIMQTVVTSDNNYVQVIVGF 185

Query: 196 DTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDEDDDEDEDDDDDDADD 255
           DTMEMI +MEL  PS I FG EE++             DGE E ED+++ ++D+ +D DD
Sbjct: 186 DTMEMIKDMELSSPSGIGFGIEEIE-------------DGESEVEDENKGDEDEGEDKDD 245

Query: 256 EYNRDWVSVIDDEDDQNH----SDETLGDWAKLETMRSSHPMHFANKLSEIASDDPIDWM 315
           E   +WV+V++D DD+++    SDE+LGDWA LETMR  HPM+FA +++E+AS DP++WM
Sbjct: 246 E---EWVAVLEDGDDEDNYVSDSDESLGDWANLETMRYCHPMYFARRMAEVASTDPVNWM 305

Query: 316 EQPPATLVIQGVLRPAFNEEQTVIEKHLSSRHLSNGDIN-EAQELEENLEGHGRINHHGH 375
           +QP A L IQG+L P   E+ + I+KH+S    +  D N E +  EE  EG G       
Sbjct: 306 DQPSAGLAIQGLLSPVIVEDHSDIQKHISGCISTGTDKNKERENSEEIFEGIGE------ 365

Query: 376 ESSSSKDGLNLMEALDESIPASEASFYRLEMIKVQLFTGNSHPSNVEIEDLMKAQPDAIA 435
                    N  E L      +   +Y+LE+I++QL T   H + VE+ED+ KAQPD IA
Sbjct: 366 ---------NESEILHVENSRNAIQYYKLEIIRIQLITAQGHQTEVEVEDVRKAQPDVIA 425

Query: 436 HSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSETQVQTLR 495
            +++ I++RL   G+K T+AL+SLCWR  GIQ EE  + GIDSLGFD+R+CS  Q++TLR
Sbjct: 426 CASDGILTRLEEDGDKLTEALRSLCWRNNGIQAEEVKLIGIDSLGFDLRICSGMQIETLR 463

Query: 496 FAFDTRATSEFSAEKQLNDLLF 511
           FAF  RATSE +AE QL +LLF
Sbjct: 486 FAFSIRATSEHNAEGQLRELLF 463

BLAST of CsaV3_3G017550 vs. TAIR 10
Match: AT3G59300.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 147.9 bits (372), Expect = 2.2e-35
Identity = 123/422 (29.15%), Postives = 195/422 (46.21%), Query Frame = 0

Query: 89  SDPIKQNRGLQYHPSEELVKSITEIADDVRPTSAETTRTIIEVNSKATLMFAGLINDEVQ 148
           SD +  +    YHP E+L  S  +   + + +++E  RT +E NS A L+F G I+ E  
Sbjct: 86  SDSVPDSSFYGYHPLEDLKPS--KRVQETKLSASEVARTTVEANSSAVLVFPGAIHCEPH 145

Query: 149 ENIIWPELPYVTDAHGNIYFQAKNTEEAMKNLTSENNFVQVLIGIDTMEMINEMELFGPS 208
           ++  W E  YV D +G+I+F+  + E  +++    +N V+   G+D     N       +
Sbjct: 146 DHNSWSEFKYVIDDYGDIFFEIPDDENILED-PGASNPVKAFFGMDVPRYENTRHHEEYN 205

Query: 209 EIDFGFEELDDGASDDGDDDDDGDGEDEDEDDDEDEDDDDDDADDEYNRDWVSVIDDEDD 268
             D G   LD    DD                                  +  ++D E  
Sbjct: 206 ISDIG--NLDQIIFDD---------------------------------HYFEIMDSE-- 265

Query: 269 QNHSDETLGDWAKLETMRSSHPMHFANKLSEIASDDPIDWMEQPPATLVIQGVLRPAFNE 328
              + +   DW   +T    HP++FA  LS+  S D    M+ P   + I G LRPAF +
Sbjct: 266 ---ARDIPIDWGMPDTSNGVHPIYFAKHLSKAISMDYDRKMDYPSNGVSILGCLRPAFLD 325

Query: 329 EQTVIEKHLSSRHLSNGDINEAQELEENLEGHGRINHHGHESSSSKDGLNLMEALDESIP 388
           E++ I +   S              E+  +    +    +  +SS+   N M        
Sbjct: 326 EESYIRRLFLS--------------EDRDDYSWEVQGDDNPITSSRRDENDM-------- 385

Query: 389 ASEASFYRLEMIKVQLFTGNSHPSNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQA 448
              +S YRLE++ ++L +     S++ ++D   A+PD + HS   II R    G  ++ A
Sbjct: 386 --SSSLYRLEIVGIELLSLYGAESSISLQDFQDAEPDILVHSTSAIIERFNNRGINSSIA 439

Query: 449 LKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSETQVQTLRFAFDTRATSEFSAEKQLNDL 508
           LK+LC + KG+  EEA +  +DSLG DVRV +  QVQT RF F TRAT+E +AEK+++ L
Sbjct: 446 LKALC-KKKGLHAEEANLISVDSLGMDVRVFAGAQVQTHRFPFKTRATTEMAAEKKIHQL 439

Query: 509 LF 511
           LF
Sbjct: 506 LF 439

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAE8650472.12.2e-282100.00hypothetical protein Csa_009877 [Cucumis sativus][more]
XP_004140749.21.2e-280100.00LOW QUALITY PROTEIN: uncharacterized protein At3g49140 [Cucumis sativus][more]
XP_008439307.11.8e-26594.71PREDICTED: uncharacterized protein At3g49140-like [Cucumis melo][more]
XP_038880477.13.7e-25089.84uncharacterized protein At3g49140-like [Benincasa hispida][more]
XP_022141265.13.4e-23584.34uncharacterized protein At3g49140-like [Momordica charantia][more]
Match NameE-valueIdentityDescription
Q0WMN56.2e-10746.90Uncharacterized protein At3g49140 OS=Arabidopsis thaliana OX=3702 GN=At3g49140 P... [more]
Match NameE-valueIdentityDescription
A0A1S3AY358.9e-26694.71uncharacterized protein At3g49140-like OS=Cucumis melo OX=3656 GN=LOC103484131 P... [more]
A0A6J1CHK01.6e-23584.34uncharacterized protein At3g49140-like OS=Momordica charantia OX=3673 GN=LOC1110... [more]
A0A6J1E9X94.6e-23082.35uncharacterized protein At3g49140-like OS=Cucurbita moschata OX=3662 GN=LOC11143... [more]
A0A6J1JB771.0e-22982.16uncharacterized protein At3g49140-like OS=Cucurbita maxima OX=3661 GN=LOC1114828... [more]
A0A6J1HTI81.8e-22181.18uncharacterized protein At3g49140-like isoform X1 OS=Cucurbita maxima OX=3661 GN... [more]
Match NameE-valueIdentityDescription
AT3G49140.14.4e-10846.90Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G24060.26.8e-10146.97Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G24060.14.4e-10045.62Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G59300.12.2e-3529.15Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Chinese Long) v3
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR037119Haem oxygenase HugZ-like superfamilyGENE3D3.20.180.10coord: 423..513
e-value: 5.9E-20
score: 73.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 213..255
NoneNo IPR availablePANTHERPTHR13343:SF28PENTATRICOPEPTIDE REPEAT (PPR) SUPERFAMILY PROTEINcoord: 2..510
NoneNo IPR availablePANTHERPTHR13343CREG1 PROTEINcoord: 2..510
NoneNo IPR availableSUPERFAMILY50475FMN-binding split barrelcoord: 309..509

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_3G017550.1CsaV3_3G017550.1mRNA