Cucsa.108220 (gene) Cucumber (Gy14) v1

NameCucsa.108220
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionPentatricopeptide repeat (PPR) superfamily protein
Locationscaffold00931 : 397624 .. 419965 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAAGACGAAGAGCAGGTACCTCATCCTGATACACATTTTCTCTCGAAAATGCCCGTGGATCTCCTACTTCAATAACTCATTACCAAGAACAAAGAAGGTTAAAGAATGATCGAAACTGCTCTGGCCGTTCGGTTTCCCGCCGGCGCAAACTTCTGCTACTCCTCTGCCGTGTCCTGTAATTATCCCTCTCTTTCTTCCCCGCCATTACTTACTTCAGAAGACGAGCTATACTTTTCTTTATCCATGTTTTTTGGTCGAGTTATACGAAAGTGAAAGAGGAAACGTTTCTTTTGATTGCTTAACTTGCTTTCTGTTGAAATCAATGGAGCTTAAGAGTCCACGATTGCTTAACTTGCTTTCTGTTGAAATCAATGGAGCTTAAGAGTCCACATCTTTTGTTTTTGGCGTTCTTTTTCCTGCATTTTACATTGTTGCCAAGCTTACTAGCTGACTAATTTTCGTAGATGTCTTTCGAATTAGACGAAAGAGAAAGAGGAAACAATTGTTATTATTGCTTAACTTGAATTCTATTGAAAGTTTTGGAGTTTAAGATTCCACGGCTTTTCTTTTTGGCGTTCTTTTTTCTGCATTTTACATGGCTATCAACCTTACTAGCTGACTATTTTTCTTCTATGTTCTTCGAATTAAATGCAAGAGGGCGAGGAAACGATTGCTATTATTGCTTAACCTGAATTCTATTATAAGCTTTGGAGCTTAAGATTCCACAGCTTTTCTTTTTGGCGTTTTCTCCAGCATTTTTCTTTGCTGCCGAGCTGAGAATATTTTTCTTCTATGCTGTTGGAAATAAGGGAAAGCGAAAGAGGAAACATTCGTTTTGATTGCTTAATGTGATTTCCGTTTGAAACAGTGGAGCTCGAGATTCCATTGCCTTTCTTTTTGGTGTCCCCCCCGCATTTTCTTGGCTGCCAAGCTGAGACTAGTGTTCTATAATTTCTTTGAATTAAGTGTTAGTGGCAGATGAAACAATTTTTTTGGTTGCTTAAATTGAATGAAGCAAACGATTACGCAGCGTTTGTTTTTGGAAAATCCGGTGTTTTTTTGGATGTATAGCTGAGACTAATTTTCTTCCTTGTGTTTGGTTGTGTGCATGTACAGATCACCGTCCAGCTTGGACTTCGGAGGATGTAACTAGTATAGGCAATGCTAGCAGCTTCTGCCGGCTTTTGCATTCTTGCACTTCCGACGTGCATTGGTACTTAGCTTGAGCTTGCTCCTTTCTTTTCCTCTTTTTATTATTTGTTTTTTATTAAATAATTTTGTCAATGAGTTTCTTGGTACCATATGCGTGTGGGGCAAAGCAATCACAGTTCAATGCAGCATGCGAGTGTTTAAGTAACCACGGGTCTCTAGTCCTTATACTGCTCCCACTGGCCAGAAATTGTAATAAATTGAATAAAATTCTGACTTGAATATAAATTTATGATTAGAAAATACTTAGCCTAATGCAATGAGAAAGAACTCTTAATTGACAATACTTTTTTTATTCTTATAAGCCACTGTAATATGTGGTATTTACGGAATCGCAATTGGCTGTCAGGCTACCAATCCCAAATATCAAATAAAACACACAAACAATGGCAAACAGAGACGGTACTCTTGTGTTCTTTATTTATATATAAATGCCAAGATGTTCACAAGCTACGATATAAAGATAACAAGGAAAGAAAAACATAAAAGAAACATAATAGTCCCCTTTTACATCAATGGAGACCCTTGCCGGCTACAACAACCTACTCTCTTAAAGATAACAAAAAGAAAACCCAGCTACACTCTCTCTATCGATCACTATTTAAAGCCTTTCCTAAAATTCTTCCCTGTGGGCCCTACTCTACACAATCTTTTTCTCCAATCATTCCTCCTCATTAATATTAGGTGTATTTACATTTCTACCCTTTCTCACATACGTATGAATGATTGGGGGCCTTACAATACCCCTTGGTTCTAGGTTCACCTTGTCCTCAAGGTGAAAGGTGAGGAACTACTGGTTCATCTGGTAAACGGCTTCCCACATAGCTTCAGTCTCGGGATCTTTTCACTTTACCAACCATTCGTTGCCTTCGAGGTCCTTGTTCCAACGAACTCCTAGCACAGTTTCCGGCCATAGTTGCAGCTCGAAATCTTCAGTCAATATGGGTTGTTGATGTTGAACCACTTGCTGATTTCCTAACTTGAGTTTTAGCTGGGAGATGTGGAAGACATTGTGTATCACTGCTTCTGGGGGAAGTTGGAGTCTATAAACTACTTCACCTATTTCCTCAATTATCTTGTACGGCCCATAGAACTTAGGAGCCAATTTCTCACTCCTCTTGCATGCTAAGGAACGTTGCCGGTAGGGTCTCAGTTTCAGATACACCTCATCCCCAACCTTGAATTTGAGCTCTCTTCTCCTAGTATCCGCCATTTTTTTCATCCCATTTTGGGCTATGCTCAAATTCTCTTTCAATGCATTCAGTGCCAAGTCCCTTTCCTTCAGCATAGATTCTACCTCATCGTTCAGCGTTTTTTTATTCCCATATGATAACAGTGGAGGGGGTTGTCTTCGATACACTATCCGGAAGGGATTGCTTCTCGTGGAGGCATGAAAGGTGGTATTATACCACAACTCAGCCCATGGAATGAATTTATCCCACTTATGGGGTTCATTGCAGAAACATCTCAAATATGTTTCTAAACACTGATTTACCCTTTCGGTTTGCTTGTCGGTTTGTGGGTGAAAGGCTGTACTTCTCTTCAAGACTGTACCCATAGTAGCGAATAATTCCTTCCAAAAATTGCTGATCAAGATCTTATCCTTGTTTGAGATAGTCGATTTTGGCATGCCATGCCTGCTGCTTATTATGTCAATGAAGAATTTAGCTACTCTTTTGGCTGTAAAAGGATGCTTCAAGGTAATAAAATACGAGAACTTGTTGAGTCTATCAACTACCACCATGATCACATTCATCCCTCCAGCTTTTGACAGCCCTTCAATGAAGTTCATGGACCATTCTTCTAGAATTTTCTCTGCTATTGGAATTGGTTGTAAAACCCCTGCTGGCTTGGTTGCTTCATACTTATTCCTCTGGCAGATCTCACATTGCTCAACATAGTTTTTGACGTCTGTTTTCATTCTTTTCCAATGTAGTTCTCCATTCATCCTCTTATATGTCCTTAGAAATCCGGTGTGACCTCCTAAAACTGAATTATGAAACGTATGAACAAGTTCGGTATCAGTGATGAGTGTTTGGACAACACTATCCTGTTTTTATACCATAGTTTCCCATTCTCCCAACAATATTTACTTGTCTCTTTAGTCTTTTGTTTCAACTCTTCTATGATCTTCTTAAGATCTTCATCTTGTTGTACCTCCTCTTCAACTAACTCCATATTCACAATTCCAGTGGTAGTCATGGTATTGAGTTCTAGGGGTTGCTCTATCCGAGATAGGGCATCAATAGCTTTGTTCTGTAATCTCAGTTGATACAAAATCTCAAAATCATATCCTAGGAGTTTGGTCAACCACTTTTGGAACTTGGGTTGTGCCTCTCTCTGTTCTAAAAGAAATTTCAACGCTTTCTGATCAGATATGATTGTGAACTTCTTCCCTAGAAGATAATGTCCCAATTTCTGCGCAGAGAGCACTACAACCATAAGTTCCCTCTCATTTATAGATTTGGTTTGTGCCCTAGGAGATAGTTTTTGACTAAAGAAGGTGATGGGATGATTGTTCTGAGATAACACAGCCCCTAATCCTATTCCAGATGCCATCTGTTTCAACTATAAAAGGTAGGTTCCAGTCAGGTAGTGCAAACACTGGTATGGTTGTCATTGCTATCTTCAACTTATTGAAGGCAATTGTGGCCTCTTCATTCCATAGGAAGGAATTCTTTTGTAACAGTTTAGTTAGGAGTCACAATTTCCCCGTACCCTTTCACAAATCTTCGATAGTATTCGGTTAAGCCCAAGAATCCTCTCAAACTAGTAACATCCTTCGGTTGCGGCCAGTTCACCATATCTTGGATTTTCTCTTAGTCGGCTTCTACACCCTTATTGGAAATCAAGTGCCCTAAGTACTGGATTTGAGAATAAGCTATAACACACTTTTTCTTGTTGGCAAAAAGCTAATGATAACGTAGTACCGCAAATACCATTCCTAGGTGCTTCTCATGTTCCGTGATATCTGAACTATACATAGGTATATCGTAAAAAAAAACCCATACACAACGCCTCAAAAAAGGTTTAAATACCTGGTTCTTTAATGACTGAAAGGTGGTAGGTGCGCTCGTGAGGCCGAAGGGCATAACTACGAACTCATAATGGCCTTCGTGAGTTCTGAAAGTCTTCTCAATATCTTCCTTCTTATTTGGTGATACAGACTTCAAATCCATCTTTGAGAATATAGTGGCTCCTTGCAATTCATCCCGCAGTTCTTTGGGATAGGAAATTTGTCAGAGGTAGTCACCTGGTTTAGCTTTCGGTAATCTACACAAAATCTCCACCCTCCATCCTTTTTCTTCACTAACAAGACTGGACTGGAATAAGGACTATGACTGGGTCTTATCACTCCTGCTTGAAGCATCTCCACAACTAATTTCTCAATCTCTTGCTTTTGAATGTGTCCATATTTGTAAGGCCTCACGTTAATCGGTCTCTGTTAAAGCATCACCATGATTCGATGATCTACTTCTCTCTTAGGAGGTAACCCTTTTGGGTTTTCGAAAATGTCTAAATACTGTTGCAACAAGAATCTCATGGGTGTATCTTCTTTGTCCCCCTTAATTTCTTGTTCTTCTTCCAGGTCGTCATTCATTTCTATGTCATAGTTTTGTAGTTCCAAGAGGAAGCCTTGATCTTCCTTTTCCCATGTTTTCTCAATGGTTCTCAGGGAACATTCGACTCTAATAAGGGATAGATCCCCCTTAAGGCTAGTTTGTTTTGTCCCTATCCAAAACGTCATATTTATGGAAGGCCAATGGATCTTCATAGTACCAGTGGTAGCAAGCCACTCCATCCCCAAAATTACATCCACATTCCCAGTTCAATAGCTAGAAAATCAGCGACCACGATGAGTCCTTTCAGTTTCAGCTCTAACCTCCTACACACTCCTCTTCCTTTGCACCTCGCACCATTCCCAATAGTTACTCTGAATTGAGTTCCCTCCTCCAACTGTATAATTTTTGTCTCTACTGTCTTGAGGTGTATAAAGTTGTGGGTGGCTCCACTGTCGATTAGTACCACCACTTCCTTCCCTCTTATATCCCTTTATTTTCATGGTTCCCTTATTTGTCAATCCACTGATTGTCTTCAGCTCTACTTTGATTCCATCTGTTTGGTTTAGTTGTTTCAATTTCACCACCTCCTCTGGACTTTCCTCTGTTCGATCTTCTTCCTCAACACTCTCTTCTTCATTCATAATGAATAACATCAACTCCCTTTTTTCCTTCATTGTGCATCTATGTCATGGTGAGTCCTTTCATTACATTTAAAACATAAGCCCTTGTCCAGCCTGGCCTTAAACTCTGCATCGGACAAACGCTTAACAGATGGTTCATTCTTTTTGTAATTTCCCTTAATGGGTATGGTTACCTGTTTCATTGGAAAATCCGTTTATCTCAATATCCCTTTATCCGTTCCTTCTTGAACCTTGCTGGTATGCCCTTCTCCTTTTTTTTGTTCTCCCAACTTCCAGTATGTCTTAGACAATTTCAATGCTAAGTTCCGATCTTTGACTAGTTGAGCTTCTCTCATACAATCTTCCAACGTTTGTGGATACCTGCTGACTACCTCTGCTTGAAGTTTGGGTTCTAACCGGTTAAAAAAGCATCACGTAGAACACTCTCTGCCATGTGTGGTAGAGGTGCTGAATAAGTAACAAACTTCTTCACACAATCGTTATAAGAACCATCTTGTTGAATTCAAATCAAATGAGCAACCAAACTTTTTTGCCCCAAGTCCTTAAAGAAATCAAACATCCTCCCCTTAAAATCTTCCCATGACTCCACCTTCTTCTATTATGAGTCCATCTATACCAATCTACTTCGTTCGACCCAAAACTTACCACAACCACTTTGATTTTCTCTGTTTTTGGTAGATTGTTGATTTCAAAGAAGTGCTCAGCCCTATATACCCAAGATTTCGGATTTTTGCCCAAGAACATAGGCATTTCCAACTTTTTGTACTTACTGCGGTCTACTGTGTTCAAGTTGATCTCAGTGGTAACTTTTGTTTCATCTATTTTTCCTTTCAATCTCATGACCGAACCATCTGATGTCCCTGACTCATCTTTTTTTTTTATAGATATGGTTCTCCCTTAGCTCATCCGCCATCCTCTCCATTGATTTTTTCATTTCCATTAGTATTTCTTTCATACCGAATATCTCCTTCTCAGTCTCTTCCACTCTTTCCTCAATCTGTCTTTGCGCCATGAGTTGCATAACCACCCCTAGCTTTCGGGCTTTGATACCAAATGTTAGGCTACCAACCCCAAATATCAAATAAAACACACAAACAATGGCAAACAAAGACGATACTCTGGTGTTCTTTATTTATATATAAATGCCAAGATGTTCACAAGCTAACGATATAAAGATAACAAGGAAAGAAAAACATAGAAGAAACATAATAGTCCCCTTTTACATCAATGGAGACCCCTGCTGGCTACAACAGCCTACTTTTTTAAAGATGACAAAAAGAAAACCTAGCTACACTCTCCCTATCAATCACTATTTAAAGCCTTTCCTAAGTTCTTCCATGTGGGCCCTACTCTGCACAATCTTTTTCTCCAATCATTCCTCCTCATTAATATTAGGTGTATTTACCTTTCTACCCTTTCTCATATACGTATGAATGATGGGAGCCTTACACAAGCATCAAAAGGACTAATGTTATTGGCAACTGTGTTGTTTGAGATCTTTCAGAAATGATTGTGTATCTGACGGTTACTCCATCAGAGTGGTGCTTAGATACATTATTTGTGTGTGTGTGTGTATAATTATTGTGTACAGAGGACATTGACCGAATTGTACCTTTTCTATTTGTGAACATCAAATTTGATTCAGTCTATGCATGTAAAAGGCAATAGGAGGACTGCACGAATGGAATCCTAAAACCTGATAACTAAATTGCAGGAAAAGATGCCAGAGACTTAATAGTAGGTCTCTCTTGGGAAGAAGCTATCTCAAAAAGATTGGAATCCAAGCATCAGCAGAGCCTTTGGGCTCTGCATCAGATCCAATTAAACAAAATAGGGGATTGCAGTATCATCCATCTGAGGAGCTTGTGAAATCAATAACTGAGATTGCTGATGATGTTAGACCTACCTCTGCAGAAACTACTAGAACAATCATTGAGGTATCAATTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCTAGAAGGAACGTTGTTTGCATTTGTGCTTTATGCACCAACCAATTTTACCACTGGAAGCTACTCATTTGTTTGAATTGTATATTCTTTTTCCATCACGTTCTCAAAGACTGAAATGGAGGTCAAGGACTGGAATTGCTAATCTTGAAGTTTGGAAGATTATCTTAGAAACAAAAACAATGGACCCAATTTTTTCACTACCTTTTTACTTAAACTCTGTTTTTTCCTCCCTCTCTTTAGTACAAAACTTTATTTTTGCCCCTAGTACAACAAAGATAAGAATGGCAATTTGAACCTCCTACTTCAAAGGGAAAGTGGGTGGGAGCTATATTAAAGGGAGCTGCTTTTGTAATTCTCTATCGTATGTAGGTTGCATGATTTCGATAATTAATATTTTCAATAGCATTTCATGTTCAGGTAAATAGCAAAGCAACATTAATGTTTGCGGGTTTGATTAATGATGAAGTTCAGGAAAACATTATCTGGCCAGAACTACCGTATGTAACTGACGCACATGGAAGTATGTCCTACATTTTAGAGTTCACTATTCTCAAACTCAAGGCCATCCTGGGCCAACCTTCCCTATTATTTTATTGATTATGTTATTTTAATATATTGCAGATATCTACTTTCAAGCGAAGAATACTGAAGAGGCCATGAAAAATCTAACATCAGAAAATAACTTTGTGGTATGGATTTTGACTTACATATTAAACATAATGTTTTCACATCTCTCACATCTTGTCTTAAACCAATAATGAGTTCAGTTGTTTAATTTTCATTATTCCAAATTTTTACTTGCCATGTGAAGGTTCCCTTGTGATGTTAGTCATAGTGTACGCCCATAGGGTATTCTTTACTATTGAAAGCGGTAAAAGAAAATGTGACATCGAGTTACGTGGAAACCCGAGTATCGAGGGAAAAACCACTATATGACTTTCTTATAATTATTATTTCATATGAACAAAAGGTACAATGGGGGAAAATTAAATAGACAGCAAGCTTAAAGCTACATTTTGGCAGCCAATTGTTGATAAAGTGCAGAGAAAGTTAGACAAATGGAGAAGATTGAATAGTGAGAAGTGAAAAGGTAAGTTCTCTCCCCTCTCTCCTCTCTCCTCTTTCCTCCCCGGCAGCCAGCCCCTCCGGCCACCACCATCTCTCTAAGAATGGAGGTTTTTAGCTGCAAGATCAATCAGTTTCATTACTGCATATGGAAGGAAAATAATTATTGCATCATTGAAGACACGGAAGCCCATAGAATCCTTCCCCTATCTGATAACCAAACCCGTTGGCTCCAAGAAGCCATTGTTGAACTTCTCAAATTTCCAGCTGATCATTTATTTAGAAAGCACGCAAACATTGATAGAGGCCGGCTAAAAGTCTCGAAGTTTCAAGCAAATTCCAGCAGGATTTTATGTTGTGACTACTGGCCATATTCAGGAGGATATTCTATGATTAGAGTTAGTTCATGTGAAAATTTGCAAGGTTGGTGGTCTTTCTGTAACATGTTACTTAAATTTGTAGAAAAGGTCAACTACACCAGTTGGTTTTCAGATAATTCCCCAAAGACTTCTCCTCAGTTTCCAGTGAATAGACCTTCCTATGTAGAAATGGTCAATTCTTCTAGGACTACCTCTCCAATAAAAAATCCCCAGGACAAGGTACCTTGTCCTCGCAACACCCCGCTTAGAGCTCATCAAGTATCTGTCCAGCCTTCTCCCCCTAAACACAACCAGTGGCTTATCAAGAATCATGAAGTGGTAAAGGTTGATTTTGAAAATTTATGGATTATTATAAAACTGTTTGCTTCCGATGCTTGGAGATTGATTCGCAAACAGTTGGAGACGCTTTATCAATCTAAAATCATTATAAACCCCCTCTATGACGATAATGCTCTCATTAGTTTTGACAAAATTCCATCCAAGGATCTCATCGGAAAGGAAAAGAAATGGCAGGCTTGGGGCGAATTCCACATCAAGTTTGAAAAATGGGACTCAGTTCGACATAGTAGGCCCCTTGTGCTTAAAGGTTTTGGGGGCTGGATGAAAATAAAAAATCTTCCTTTAGAGGGGCAACATCTTTTTATCTTAAGAGGGGCAACATCTTTTTACACTATGGTGATTTTGAGTTTTTATATCCAGTCAATACACGTTCAGCCCCCATATTGCAAGGGAAATTTAGTAATTCAATTGACCGGTTGTGTGTCAAGGAAGTTTTGATTGACGAAGATCTTGACATTTCCTCCTTTCCTCCGGATATCAAAGTACCAAAGTCAGCGTTCGCGGTTCTCCCCAGTGATAGGAATCCGTTCAACCTCCTGAAGCACTTTCCGGTCAGTTCACCGGTGCCATTGCCGGAGAAGAAAACAGAAATCGGTGAACAGCTGGCTGTTCTTTCAAATGAGCGAGTAACTGAACAGTTTTTGTCTTCCTCCAACCCCTTGCAGCACAGCTATCTCCAAAGAGCTATGGGGCCCACTGATTTTAAATTAAACCCTCTTTTTGCAGCTGCTAATGACCCATCAGGTGAGAGAAACGTTGGAAAAGGCCCCCTAATTACTTCCTTGAGTATTAATGAGCCTCCAACCTCTTTATTTAAGTCCTCGGCCTATCCATTTCTGCAGCCAACTCCTGAACTGCCCAATGCCACAATAACTCCCTCTTCATTCTCCCCTAGGCCCCCCTCTGTTTTCAAAAAACAGAAATTTGAAGAGTCAGGAAAAGAAAATATCTCTTCTGCCTCTAATAATTCGATCCTTTTTGCCAGCTTTTGTCAAATTCCTAATTATCCCTCCAACAAAATTAATGATTCTGTTGGAGCGGCCTTTTCTTCCACCACAGAAGGATCCCCAAATAATTTTTGTTCAAGCAAAGGAAAGAAGAGTAAAATTTCCTCAATAGTACCGTTGAAACAATTTATTAGAAAGGACACTTCATTTGATAATGCTTGGACAGCTTTACTAAAGTCTGAAGCTATTGCAGCATGTTCCTTCAAAGATCTGGAAAAAGATTTAAAAAAGAATCTGCCAAGTAAGACCTCCTTATCAGAACAAAATCCAGATTTCTTGGAAATATGTACCTCCAAAATACAATCAGCTTCATATAATATCAGGTCTATCTCTCCCCGCCCCGCTTCTGACCATTTTAATTACTCAAGCTTCACTGATCCTAACACTAAAGTCACTTTAGTATGAGGAACTCCATGCCATTCAGTTGAGTCAGTGCAGAAGCTATCAAATACTGAAATTATCTCCCCCTATAGTGTCAGCAGTGAAGACTCCCCGGGTTTAACTCACATTCAAGATGATAATGCAAATGATGAAATTGAAGCAGTGGATCTAATTTTCCTTTTCAAAGTGGAAGGTGGTGTTTTTAGTAATTTCCCTTGCTCTTCCCCAATCAAATCAGTAGAAATTCCAAAGGAATTGTTGCCAATTATTAAAGACTGTGAAGTGATGCTGGTTTAAGTAAGTCAGTGTAATGTATATGCTCAAAATTTTCACCAATGAAGATCATTTCCTGGAACACAAGCGGCCTAAAAGATCCAAATAAGCATTCCGCTCTTAAGAAGTTTATAAAAAATCATCACCCAGACATGGTGCTAATCCAAGAATCAAAAATGGAAATTCTGGAAGTAAATTTCATTAAAACAATTTGGAGTTCTATGGATATAGGATGGGAATCACTGGAATCTTATGGCGCTTCCGGAGGCATTCTTACCCTATGGGATAAAAGTAAAATCACAGTGGTGGAAACCATAAGAGGACATGGGTTACTTTGTGCAAGAAGTGTGGTTGGGTTACCAATGTTTGTGGTCCGTGTGGTTATAGAGAGAGGAAACTTGTTTGGCCAGAATTATTAACAATTACAGAATGTGGGGAAGAGTCTTGGTGTTTGGGTGGAGATTTTAATATCACTAGATGGGTCTATGAGAGGTTTCCAGTTGGCAGAAGCACAAGAGGGATGAGACAGTTTAATGCCTTTATAGAATCTGCCAATCTAATGGAAATTCCCCTTCAAAATGGTAAATTTACTTGGTCAAGAGAGGGTGGCACTGCTGCAAGATCTCTGTTGGACAGATTTTTTATTAACAATAAATTGAATTTGAAAACTCAAGAGTAACCCGTAAAGCAAGAACCTTCTCCGATCACTTTCCTTTATTATAGAGGCAGGTGCAATACTATGGGGGTCATCCCCATTCAGATTTTGCAATAGTTGGTTGTTGATAAAAGAGTGCAACAAGGTTATTGAGGAAGTCTTGAAAATCGCCCCCTAGGCATTGGGTGGGCTGGTTTCATTCTTCATGAGCAGCTTCGTAAAGTGAAACTTGCTGTTAAAAATTGGCATGCTACTCATTTGATTGATATAAAGCAGAAAGAGGGAAAGGCTTTTGAGAGAGTTAGAAATCATTGATGGGCTTGCAAAAAGTGTTGGTCTGAATGAAGTGGAATTGGTTCACAACTCAGATATACAAACAGAGTTGCTTTGCTTATATGTATTTTTGCCGTTGTGTTTTTATTTTTCAGCCTTGCTTTTGTCTATCGTAGGTAGTTTCTGTTATTATTTTGATTGTTCCTTTGGTCTTTTCATGACCTTAGTATGTTTATTGTACTTTGATCTTTTCTTTGTATTTTGGATATGATGAGGGTGCTATGGGGATGTCATCCTAGTTGAGAAGTCCGGGTGCATCTACTTATCCTATCTATGTATCTCTTTCTCCCACATTTCTTGGCTTCCCTATTATTTTCATTGTATAGCTCTCTTGTACATTGAGTTTATTAATAATAAAGAAGCTTGTGTCATTTTAAAAAAAAAATAGACAGCAAGCTTAGGATAAAAAAAGGAAAGAATATTAGGGTTAATCTTTCCTTGGGCCACTAATTCTAACACTCCCACTCAAGTTGGGACGTAAATATCAATAAGACCCAACTTGCTAACACATGAATCAAAGTTTTATCTGAGAATCCCCTTAGTGAGAACATCAACAACCTATTGACTTGAGGGGATGTAGGGAATGCATATGCTACCATTGTCTAGTCTTTCTTTAATAAATGTCGATCAATCTCCACATGTTTAGTTCTATCATGTTGAACTGGATTGTTTGCGATGCTAATAGCTACCTTATTATCATAGAATAATTTCATCGGCACCTCGTAGTCTTGATGAAGGTAAGATAAAACCTTATGTAGCCAAATTCCCTCACATGTCCCTAAATTCATAGCTCTGTACTCAGCTTTAGCACTACTTCTGGCCACAATCCCTTGCTTCTTACTTCTCAAAGTTTTGAGATTACCCCATACGAAAGTACAATAGTCAAAGGTGGATTTCCTGTCAACAACGGTCCCTGCCCAGTTAGAATCAATATAGGCCTCAATAGCTCCTTTGTAGGTCTTCCCATCAACCCTTTACTAGGAGTTGTTTTTAAGTATCTCAAGATTCTGTTAGCTGCCTCCATGTGTTTCTCACAGGGGCTGCATGAACTGGCTAACGACACTCACAACATAGGAGATATTCGGTTTAGTGTGTGACAAGTAGATCAACTTTCACACATGGCGCTGGTATTTTTCCTTATCAACCGGAGCTTTATCACCTGCACTTTCCAGTTTACTGTTGAACTCAATAGGAGTATCAGTAGGACGTATACTTGATTTAGTCAGCAAATCAAGGGTATATTTCCTTTGAGATACAGAGATACATTCTTTTGATTTAGCTACCTCCATCCCAAGGAAATACTTCAGATTTTCCAAGTCTTTAGTGTCAAACTCATCATCCATCTTCTCTTTCAGTTATATGGTCTCAACGATGTCATCCCAGACATAACAATGTCATAAACACAAACAATTAACACGGCAATCTTCCTAGCCTTGAAAACTTTCGTAAACAAAGTGTGATCATAGTGCTCCTGACTGAACCATTGGGACTTGACAAATGTAGTAAACCTATCAAACCATGCTCTCGGTACTGGGCTTTAAACCATGGAGAGGGGCTCATATAGACTTTCCTCTTCCAAGTCTTTGTTGAAGAACACATTCTTGACATCTAGCTGATATGTGAGCCAATCTTTGTTTACAATAGATAACAGGATTTTAAAGGTATTTAGTTTTGCAACTGGGGAAAAAGTTTCAGAGTAAACCCTTTCGCAATTAACTTGGCCTTGTGTCTATCAAGAGTTCCATTAGTATTTGAGTGTGAACACCCATTTTCATTCCACAGTTTTATGTCCCTTAAGGAGAGTACAAATCTCCCAAGTCTTGTTCTTTTAAGAGCTTTCATCTCTTCCATCACTGCAGTGTTTTACTCAAGACATTCGAAAGCAATGTGGATATTTTTAGGTATCATGGTATCATAGACCCGTTTTCTCAGTGTATACAGAAGAAAGGTGTTCTGACACTTCAAATAGAGTTTTTGAGTAGTATCCTATAGGTCCTTCGCAGTTGTTGCATATAACAGAGGTTTGCCAATCTGTGGCTCCATACTATTGATCAACATGGACTAGATAAGAGAATCCTCCTCTCTCCAGAATCATTCCTAGGAGTCAGTAGGTGGGGTCGAAGAGTCTCCTTTGTCAGAAAACCAAATTGTTGGAGTCCCTAAAGAATCATTTTGGCCGATTGTGACCAGGGATTTTGTGGTCTTGGTTATTAGTATACCCTTATATTCTTTCATTTTCTTTTCATTCAATAGTAGTTATTTCTATAACCAAAAAAAAAAAAAAAGAATAGAAAAACCCTAAGATATATTTAGGATGTTTTAATGCATTCCATTTCATTATAAGTTTGGGAATATAGAGTGCATTCAAGAACTGACATGCTTTTGAAGCAAAAAGGGTTCTCGTATACCTTTGTAATTCAGAAGAAACAAAAATTATAATGGTTATTCATTAGGTATCAAAACCTTTAGATTAAACTGAAAGTTTTTTTTATTTCATTTCTGTTGGAAAATAAAGAGAGAACTGAAATAGTTCGTTGGCAGCTATTGTGGATTGTGAGGTGAGTGTGTTGCCTGGCTCTTTGGTGGTCAGAATTGGAGGGGTATCTCCTTTTTTGCTTTACAGTAATTGAAAACGTCCACTAGCGACTTGCTTCTTGATATTGAAAAAAACTATCAATAGATAAGAAAAGGGGGAGGGTTGTGATGTTTTAGCAGGTCCACAATCACCTTGCTTCTTGAAAGAAAAATTTTCTTTTTCAAGTGAGGAAGATTTGCTTTCAAGTCTCGTCCAAGCAGTGGTATCTGGGATTCTACTGTACTCTCTCTCCTTTTTAGAATTCCTGCTTCGATCAATAAAAGCCTAGAGAATCTTATAAGGAATTTCTCCTAAGATGGGTTTGACAAAGGGATGATTTATCTTGTTAGTTGGGACATAGTTTCAAGGTCATTGAAGTTACCCATCCCTTTGATTAGGTCTTGAGAAGGATTTTCTTTTTAATGGAAACAAAAATTTAGTCGATATAATGAAAAAAGACTACTGCTTCAAAATTCAAGATCTCAAAATATAAGGAAAAAATAGGAGACAAACACAACTAATGAAGGACAAATGCTAAAAGAAGAAAAACTTCAACTAAGAAGCACAAGAAACTAAGAAAGAGAGTCCCATAAGAAAATAAGAGAAAATCCTAACCAAGAGAAACCAGGCCAAACACAGAGAAACATCTATCCAAAAAACTGGTACAAAGTGTCAAAAACGTCTGAAAAATCCCATATGAGAGGGAAACGAGAGCCCAAGGCAAACGGGACTTCAAAAAATTCTAAACCTCAAACAATTCACAAAAATGCAAATAGCTCTAAATGAACAAGACGAGCACCCTTAAACTTTAGCTCCTTGAGAACTAGAGGGAATCACCTGCAACTGAAGACCACAAACCTTAACCAAAGAAGTGAATTTGTCAAGAAGCAAAGAAGGCGTAGGAGAAGCTGCAGCAAAAACCTAACATAAGTGAACCAATATCTTCACTTGCAAAACTTTGATTATAGACTTCTGCAAATATATTTTCTAGTAGGTTCTTCTATTGCACTAGTTCCTCCATTAGAACTTCATTACTCACACTTAACAATGATTTCTCATCATTTTTTGAAAGAGGAAGAACTTCTTTAGGAGAAGGGGGGATGTTACCACTTTAATAAATTGAACATCAAATGTAGGGAGAGAGGTAATAGATTTCTTAAAAAACTTGTTAGGATCCAAAGGCTGATGATGCTTGAAGGGACTGATGAAGAAATAAAAAAAATGAGTGGAAGGCTCATCTATTATCTCTCTCCCCAGAGAAGTAAGTTTCTGTTCCTCTAAAACTCCTGTTCACTTTTCTCTCATTCTCCTAGTTCTTTCAATGTCAATTGAGAATGCTTGAATACCTATTCTTGTGAAGATTACTGGGTGGGAAGAAATCTTTTGCACACCTCTACCCCATCTATACCACCGATTAGATATGCAATTACAGTCTAGGCCCCTTTCTTTTCCTTCTTCTTCGTTTGTTCTTTTGGGGTTTCATTGCCCTCTCAGAAATAGGGAATTGTCAATTTCTCCCTTACTTTGTTTAGCCTACTATTAAATTTTTGACAAATGCTGCACAGTTTTGTTTTTGTTTTTTTAGGGCGGTATTTTTCCTCCGAGGGTGATTAATAATTCTTTAAATTTATTTTGGAGCCTTTATTTCTTTATTTAACTTCATTTATGAGATTTTTCAACTTAAGTTTGTTAAAATCAAATGTGGAAGTCATATTATTATTATTTATTTGTATAAATGAAGATTCGATTCAATGTTTTAAAATTTAATAATATCACAATCAAAATTTGATTGTAACTATTTCAGGTTGTGAGGGAGTTTAATATGACATTGACGATAAATTTCACTCTTTAAACTTGCCATTGTCATCACAACATATAGGGAGTAGTGCTTAGTTTAAGAATGTTCATCATGATCAGACTTTTGGTAAACTTTTATACACCTTTCTCTACATCAGGTATTTTATTTACATTCTTCCCATCCAGCATCTCATGAGTAGTGAGATTTTTTAAGCAGACATACTTTCATTCCGTAATATCTGCTGTATAAATTAATTTCTAGTGGAAAAAAATTTTCAAATGGATAATGCATTTGACGGTTTTAATGATACAGCAAGTGCTTATTGGCATTGATACTATGGAAATGATCAATGAGATGGAGTTGTTTGGTCCATCAGAAATTGATTTTGGATTTGAAGAGCTTGATGATGGAGCTTCAGATGATGGAGATGATGATGATGATGGTGATGGTGAAGATGAAGACGAAGACCACGATGAGGATGATGACGACGACGATGCAGATGATGAATACAATAGGGTCTCTCTATATAAAAGAAAAAGTATACTGTTGGATTTCACTATTTTAGTTTGTTATTAAATGTTTCTCCATTCTGCAGGACTGGGTTTCTGTCATAGATGATGAAGATGATCAAAATCACTCTGATGAAACCTTGGGAGATTGGGCAAAACTGGAGACAATGCGCTCTTCACATCCGATGCATTTTGCCAACAAGCTTTCGGAGGTAATGATTTATGATGGATTACAATTTTTTCCCCCAGCGGGAACTACTAATGAGTCTTGGTCTCTCCATATATGTTAAATGAAGCATCTGATTTCTAATTATATTTATATTCATATGCTTGTTATGTCTTATAGATTGCATCAGATGATCCTATTGATTGGATGGAACAGCCCCCAGCAACTTTAGTGATTCAAGGTGTTCTAAGACCTGCATTCAATGAAGAGCAGACTGTTATCCAAAAGCATCTATCCAGCCGCCATTTGAGTAATGGTGACATAAACGAGGCTCAGGAACTTGAAGAGAACCTTGAAGGTCATGGTAGGATCAATCATCGTGGTCATGAATCAAGTTCATCCAAAGATGGTTTAAACTTGATGGAGGCATTGGATGAAAGTATTCCAGCGAGTGAGGCTTCATTTTACCGGCTGGAGATGATAAAAGTTCAGCTATTTACAGGAAATTCACACCCAGTATGTTTAGTTTCTGTTTTATTTTGCAGTTTGTCGTTGATTTCTCTTTTACATTGTAGCATGTTTTCTTGAAACAGCATCATACACCACACACCTTTTTCACCTGTAAGAATGAGTTCTCTTAGTGTACAGTTGCCGTGGCCTAATCTGAGTTTTTCTCATTTGGAAGCTACCGTTGTGCGTTTCCTTTCTAATACGAATCTTTGGCTCTTTCTTTGGGGGGGGAGAGATAAAAAACCATTTCATTGATTCAATGAAATATCCAAGGGTATATGTATCTTGGACATTCATATTCGCATGCTATGCTTAATTTTCTGAAAAAATTATTAATGGTATTCATTACCGATTTTAAAAATAATTATGTCGTTCATCTTTCTATGGCGGTGGGTGGAAAAGTTGCTGCCTCAGAACTAAAGTAAGATATATTGCTCGTGACTGCTCCTTTGTCAGCTTCGGCCAGTGGAACTAACACACAACAGGAAATTAACTGATACAAACCACTTTGCCTTCATTTCTTGTATTAAATTTTTATTTGACCATGATATATTCTATCTAACAAAGAAAACCAGCACTTCCTGGGACTTGGGCGACTTCGCCGGTCATATTTTAGATACTTTATGCATTATAAATGTTTAATGCCTACAATTTGCAGTCCCTAAATTCTGGGCAGGCTTGTTGTTTTTCTTGAACCTGCTTCTGGTTGTTGCTGACAAAAATTCTTTGATCTTTCCCAGAGCAACGTTGAAATAGAAGATCTCATGAAAGCTCAACCTGATGCGATTGCGCACTCAGCCGAAAAAATTATTTCTCGTCTAAGAGCAGGTGGAGAAAAGACCACACAAGCGCTCAAGTCTCTCTGTTGGAGATGCAAGGGCATTCAGGTTGAGGTAAATATATAAATCTTCTCAAAATGAGAAATTATTTCAATCTTTCGTTAAGATATTTGCTTGTCTCTGAATTATTACCATTCATCTTCCCAAATGATACGTCTCCAAAAGGCTGTTCAGTCTGTTATTTTTATTTAAATCTGATTATGTATTTTGATTTTTTAGTACAGATTATGTATTTTGACTTCTCAAAAATAAAAATAAAGAAGGATTATGTATTTTGAAATTTAATTAGTCATGTAATAGAATAACGACCTATTCATATTATAGAAGAGGTATGAGAATTTAGGTCTATGAGGTCATACCAATTACTCTGGGTATACTTTGGTTGACTATTAATTAGTCATTACTTACTTCATAGTGGACTCAGTAATTTTATGGTAGTGTCTAAACATATGCCAAATTAGTAGAGGAATATAGAACGAAAGTAGCCTTCTTTCATTGGCTGTAACATAGAGATAAATCTACTAAGTAATGTACTTTTTTAGTTGTGTTGCCTGATTTGCGGTGTCGTGAACCGAAGTATTTTCTAAAAATCAGTACACAGTTCACAGTTGAATAGTTCAACATGGATTTGAACATTGGCTATAATGGAATTTCACAGGAGTGTTCAATATTCAGGTTTTATATTTGTGTATATACACGTAATTAATTTTCCAAAGAAAGAAGCACATAATCGATGAATGAGTACTTATGCCGCCAATGGTGGAAGTTCTAGGACAGGAGAAATTTTCAATTCTAAAATTCTTAGCTATAAAGTTTCTCCAAAGACCTGTCCTTTCAAATTTTAGTTCCTTCATTTTTCATTCTAGTTCAACAATAGATATGCATATTATAATGATTCTCATGGTGCTTCGGATCTTTTGGTCTTCACTGAATAATTGCAGGAGGCAGTAATCAATGGTATCGATTCACTTGGATTTGATGTGAGGGTTTGCTCCGAAACTCAGGTCCAAACATTGCGGTTTGCTTTTGATACACGAGTAAGTCAACTATCACGTTAGTCTCAAGTTACTGTTTTTTCCTCTTGATTAATTACAACATATGATAACCAAAAAAGCAAATGCATATTCATACATAGTTTGTGACACAATTGGCAGCTGCAATAACCATATACAATTACTAAAGCAGCAAATGCATAGTTATACGTAGTTTGTGACATAACTGGCAGCTGCAATATTTTTAACGTAACCAATTATGACGTGATTTGATGAACTTCCATCCATGGTGCTTTTCTCCGTTTTTTAAATGGAGAATTGTTTGTCCTAATTGTCAGATTTGTGAAAACAATTTAAGAATGATTTGGAAGGACTTATCAGATTTCAAATATGCTCTTATGAAGCTTGATTAAACCGATTTTTATTCAAGTGCGGTTTTGCCTTTTGGCACACAGCCTTTAGACAGTCTACCGGATTTGGTTCAATCACTATACCCTATTTTTTTTAACACAAAGAATGTTTCGATTTCCATTATGGCTTGGATCCATTTAGTTTTCTTGGTTCAGTTTCTTTCATCCTAATTGATGCTGCCCGTGCAGAGTAGAGCCATGAATTTGGACACCAAAATTTACCCGATTAGCCGATTTGGCTAATCTGTTTATATATATATATATATATATATATATATATCACANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGTCGTAATACTGTTATTATATATATATATATATATATATATATATCACAGACCGGACGCATGTTGTCTCTTCTAAGATTCAGGCAGTGGATACTCTAGACCTTTAAGTAGCTCATCTAATCTTTTTCCCAATTCAAAACTAGAATGGAAAAAAACCAAACCCAAACCCTAGAAGGGAAAAAGACCAAACCCTAGAATTATCTAATATTCATCTTCAAAATGCAAAGAATAAGCTACCTAAATATCTTCAACTATTTAATGAAGCCAATGTTTTATTCTTTTCAGTGAATTACCTTTAGAATCTTTAGGTTAAATTTACATATATCCTGGAGTTTTCTGGAAAAGGAAAACGGATTATCTAACATATGCGTATCCTAGCTTTCGTAGCTGGATGGTATTGAATAATGATCCTCTCAAAAGAATGTTCAAGTATAAGATAAAGAATGTAAATTGAAAAACGGATTCATACAGCAAACCAGATAATGGTATGGGAGCCAAAGGAACGGAAGATTCAGAAGAGTCCAAACAAGACGGAAACTCCAAGTATGATCAACCACAAGATATGCACCAAGAGGAGAGCCTGGCCAGGAATAGGCGCAGAACCACCGTTGCCGACCTCACAAAATCCCATCTTGGCGACCTTGACACCTGAACACAAAGCATCGGCACAACCACACCAGTACGTAACGCCATCAACCCCGCAGACCGGGTCCGTTCTAAAGCATTTCACGGGGCAAGAAGAGGGCACCGATACAGGACAGAGATCTACATCGCCGTCGTTATTGGTGGCTTCAGAAGGCAAGCGGATGGCTGAGGAAACATCCTCAGATCGGACTGGAAAGACGAAGATGAGGAGTAAAAATAAGAAAATTATGGAAATTTGGGGTCTAGAGAAGAGGGTCATGTTTCGCGAACTTTGAAAAAGCTCGCGTGATTGGAGATCGATTGGAGATCATGAGCTTTTAGATTTAGGGCAGGAAGGAAGAGACCTCTGTCATGGAATTTTCTTGACAGTTAGGCAACGAACAAGAGGGGACAGCCTTAGCTAGGAATTTGGTGGCCCTACGCCCCCCTTTTTTTGTTTTTCAAAATGGGTTTCCTTTTTTCACATTTCTCACCTAACCATTTGAATTCTTACCTAATTTTTTTTTTTAAAAAAGAAAGTTTTGGAATATATTTTAATATTCAAAATTTACCATTGAAATCTGTGGATTTTAACTACCCATATCTAATTACCATTGAAAATGATGTTTATGGGACTTGATTGTCAATAACAACAAACAAAAAACCAAATTGTTATGAAATGAGACATTTTAATATCTCAACAATAATTTCGTTTTTTTTCTAAATCATGGATAAATTACCCTCTCTTGGTGTTGAAGTTTTATATTTGTAATGGCGGTGAGTTGTCTTTGCAGGAAAATGCTTACTAGTGAGCTTAACCAATTAGTTTGTTTGTGCAGGCAACCTCAGAATTCAGTGCGGAAAAACAGCTGAACGACTTGCTTTTTCTCGAGAATTAATTCCAAACATCAAAAAGTGAAACAAACATAACAAAATGATTCCTAAACATTGTAAGCCTTCTCACCACATTACAATTCAATTATGGGTGTGTTTGGCTTTCTCAGGTTCCGATCTTTAATGCTTCAAAGCAGAATGGAAATAGGTAAATTTATTTTAAATTGTTGATTCATTAGATTATGGATCCGGTGATTGCGAACCCGCAATCATTCTTCAATTATTATCAATTGGCCTCTA

mRNA sequence

CAAGACGAAGAGCAGGTACCTCATCCTGATACACATTTTCTCTCGAAAATGCCCGTGGATCTCCTACTTCAATAACTCATTACCAAGAACAAAGAAGGTTAAAGAATGATCGAAACTGCTCTGGCCGTTCGGTTTCCCGCCGGCGCAAACTTCTGCTACTCCTCTGCCGTGTCCTATCACCGTCCAGCTTGGACTTCGGAGGATGTAACTAGTATAGGCAATGCTAGCAGCTTCTGCCGGCTTTTGCATTCTTGCACTTCCGACGTGCATTGGAAAAGATGCCAGAGACTTAATAGTAGGTCTCTCTTGGGAAGAAGCTATCTCAAAAAGATTGGAATCCAAGCATCAGCAGAGCCTTTGGGCTCTGCATCAGATCCAATTAAACAAAATAGGGGATTGCAGTATCATCCATCTGAGGAGCTTGTGAAATCAATAACTGAGATTGCTGATGATGTTAGACCTACCTCTGCAGAAACTACTAGAACAATCATTGAGGTAAATAGCAAAGCAACATTAATGTTTGCGGGTTTGATTAATGATGAAGTTCAGGAAAACATTATCTGGCCAGAACTACCGTATGTAACTGACGCACATGGAAATATCTACTTTCAAGCGAAGAATACTGAAGAGGCCATGAAAAATCTAACATCAGAAAATAACTTTGTGCAAGTGCTTATTGGCATTGATACTATGGAAATGATCAATGAGATGGAGTTGTTTGGTCCATCAGAAATTGATTTTGGATTTGAAGAGCTtgatgatggagcttcagatgatggagatgatgatgatgatggtgatggtgaagatgaagacgaagaccacgatgaggatgatgacgacgacgatgcagatgatgaATACAATAGGGACTGGGTTTCTGTCATAGATGATGAAGATGATCAAAATCACTCTGATGAAACCTTGGGAGATTGGGCAAAACTGGAGACAATGCGCTCTTCACATCCGATGCATTTTGCCAACAAGCTTTCGGAGATTGCATCAGATGATCCTATTGATTGGATGGAACAGCCCCCAGCAACTTTAGTGATTCAAGGTGTTCTAAGACCTGCATTCAATGAAGAGCAGACTGTTATCCAAAAGCATCTATCCAGCCGCCATTTGAGTAATGGTGACATAAACGAGGCTCAGGAACTTGAAGAGAACCTTGAAGGTCATGGTAGGATCAATCATCGTGGTCATGAATCAAGTTCATCCAAAGATGGTTTAAACTTGATGGAGGCATTGGATGAAAGTATTCCAGCGAGTGAGGCTTCATTTTACCGGCTGGAGATGATAAAAGTTCAGCTATTTACAGGAAATTCACACCCAAGCAACGTTGAAATAGAAGATCTCATGAAAGCTCAACCTGATGCGATTGCGCACTCAGCCGAAAAAATTATTTCTCGTCTAAGAGCAGGTGGAGAAAAGACCACACAAGCGCTCAAGTCTCTCTGTTGGAGATGCAAGGGCATTCAGGTTGAGGAGGCAGTAATCAATGGTATCGATTCACTTGGATTTGATGTGAGGGTTTGCTCCGAAACTCAGGTCCAAACATTGCGGTTTGCTTTTGATACACGAGCAACCTCAGAATTCAGTGCGGAAAAACAGCTGAACGACTTGCTTTTTCTCGAGAATTAATTCCAAACATCAAAAAGTGAAACAAACATAACAAAATGATTCCTAAACATTGTAAGCCTTCTCACCACATTACAATTCAATTATGGGTGTGTTTGGCTTTCTCAGGTTCCGATCTTTAATGCTTCAAAGCAGAATGGAAATAGGTAAATTTATTTTAAATTGTTGATTCATTAGATTATGGATCCGGTGATTGCGAACCCGCAATCATTCTTCAATTATTATCAATTGGCCTCTA

Coding sequence (CDS)

ATGATCGAAACTGCTCTGGCCGTTCGGTTTCCCGCCGGCGCAAACTTCTGCTACTCCTCTGCCGTGTCCTATCACCGTCCAGCTTGGACTTCGGAGGATGTAACTAGTATAGGCAATGCTAGCAGCTTCTGCCGGCTTTTGCATTCTTGCACTTCCGACGTGCATTGGAAAAGATGCCAGAGACTTAATAGTAGGTCTCTCTTGGGAAGAAGCTATCTCAAAAAGATTGGAATCCAAGCATCAGCAGAGCCTTTGGGCTCTGCATCAGATCCAATTAAACAAAATAGGGGATTGCAGTATCATCCATCTGAGGAGCTTGTGAAATCAATAACTGAGATTGCTGATGATGTTAGACCTACCTCTGCAGAAACTACTAGAACAATCATTGAGGTAAATAGCAAAGCAACATTAATGTTTGCGGGTTTGATTAATGATGAAGTTCAGGAAAACATTATCTGGCCAGAACTACCGTATGTAACTGACGCACATGGAAATATCTACTTTCAAGCGAAGAATACTGAAGAGGCCATGAAAAATCTAACATCAGAAAATAACTTTGTGCAAGTGCTTATTGGCATTGATACTATGGAAATGATCAATGAGATGGAGTTGTTTGGTCCATCAGAAATTGATTTTGGATTTGAAGAGCTTGATGATGGAGCTTCAGATGATGGAGATGATGATGATGATGGTGATGGTGAAGATGAAGACGAAGACCACGATGAGGATGATGACGACGACGATGCAGATGATGAATACAATAGGGACTGGGTTTCTGTCATAGATGATGAAGATGATCAAAATCACTCTGATGAAACCTTGGGAGATTGGGCAAAACTGGAGACAATGCGCTCTTCACATCCGATGCATTTTGCCAACAAGCTTTCGGAGATTGCATCAGATGATCCTATTGATTGGATGGAACAGCCCCCAGCAACTTTAGTGATTCAAGGTGTTCTAAGACCTGCATTCAATGAAGAGCAGACTGTTATCCAAAAGCATCTATCCAGCCGCCATTTGAGTAATGGTGACATAAACGAGGCTCAGGAACTTGAAGAGAACCTTGAAGGTCATGGTAGGATCAATCATCGTGGTCATGAATCAAGTTCATCCAAAGATGGTTTAAACTTGATGGAGGCATTGGATGAAAGTATTCCAGCGAGTGAGGCTTCATTTTACCGGCTGGAGATGATAAAAGTTCAGCTATTTACAGGAAATTCACACCCAAGCAACGTTGAAATAGAAGATCTCATGAAAGCTCAACCTGATGCGATTGCGCACTCAGCCGAAAAAATTATTTCTCGTCTAAGAGCAGGTGGAGAAAAGACCACACAAGCGCTCAAGTCTCTCTGTTGGAGATGCAAGGGCATTCAGGTTGAGGAGGCAGTAATCAATGGTATCGATTCACTTGGATTTGATGTGAGGGTTTGCTCCGAAACTCAGGTCCAAACATTGCGGTTTGCTTTTGATACACGAGCAACCTCAGAATTCAGTGCGGAAAAACAGCTGAACGACTTGCTTTTTCTCGAGAATTAA

Protein sequence

MIETALAVRFPAGANFCYSSAVSYHRPAWTSEDVTSIGNASSFCRLLHSCTSDVHWKRCQRLNSRSLLGRSYLKKIGIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRPTSAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDAHGNIYFQAKNTEEAMKNLTSENNFVQVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDEDHDEDDDDDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANKLSEIASDDPIDWMEQPPATLVIQGVLRPAFNEEQTVIQKHLSSRHLSNGDINEAQELEENLEGHGRINHRGHESSSSKDGLNLMEALDESIPASEASFYRLEMIKVQLFTGNSHPSNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSETQVQTLRFAFDTRATSEFSAEKQLNDLLFLEN*
BLAST of Cucsa.108220 vs. Swiss-Prot
Match: Y3913_ARATH (Uncharacterized protein At3g49140 OS=Arabidopsis thaliana GN=At3g49140 PE=1 SV=2)

HSP 1 Score: 437.6 bits (1124), Expect = 1.9e-121
Identity = 242/514 (47.08%), Postives = 328/514 (63.81%), Query Frame = 1

Query: 1   MIETALAVRFPAGANFCYSSAVSYHRPAWTSEDVTSIGNASSFCRLLHSCTSDVHWKRCQ 60
           MIE+ +AVR   G  FC S+A+  +R A +SE+  +  + +S           +      
Sbjct: 1   MIESVMAVRLSTG--FCSSTALLQYRTAPSSEEGGNCFHYASRRVFQPQRIHHIDGSGFL 60

Query: 61  RLNSRSLLGRSYLKKIGIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRPT 120
           + NS   + R +L+K   QA+AE + SASDP KQ    +YHPSEE+  S+ +   D R +
Sbjct: 61  KYNS-DYITRKHLRKNRTQATAEYVDSASDPEKQTGKSRYHPSEEIRASLPQNDGDSRLS 120

Query: 121 SAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDAHGNIYFQAKNTEEAMKNL 180
            AETTRTIIEVN+K TLM  G I D V ENI+WP++PY+TD +GN+YFQ K  E+ M+++
Sbjct: 121 PAETTRTIIEVNNKGTLMLTGSIGDGVHENILWPDIPYITDQNGNLYFQVKEDEDVMQSV 180

Query: 181 TSENNFVQVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDED--- 240
           TSENN+VQV++G DTMEMI EMEL G S+ DF   E +D  S D D +D G+ EDE+   
Sbjct: 181 TSENNYVQVIVGFDTMEMIKEMELMGLSDSDF---ETEDDESGDDDSEDTGEDEDEEEWV 240

Query: 241 ---EDHDEDDDDDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANK 300
              ED DEDDDDDD DDE           +DD + SDE+LGDWA LETMRS HPM FA +
Sbjct: 241 AILEDEDEDDDDDDDDDE-----------DDDDSDSDESLGDWANLETMRSCHPMFFAKR 300

Query: 301 LSEIASDDPIDWMEQPPATLVIQGVLRPAFNEEQTVIQKHLSSRHLSNGDINEAQELEEN 360
           ++E+AS+DP+DWM+QP A L IQG+L     E+ + IQK L+  + +     +A+ L + 
Sbjct: 301 MTEVASNDPVDWMDQPSAGLAIQGLLSHILVEDYSDIQKKLADSNSTTNGNKDAENLVDK 360

Query: 361 LEGHGRINHRGHESSSSKDGLNLMEALDESIPASEASFYRLEMIKVQLFTGNSHPSNVEI 420
           LE + +      E  SS+D              +  +FY+LEMI++QL T     + VE+
Sbjct: 361 LEDNSKAGGDESEIDSSQD----------EKARNVVAFYKLEMIRIQLITAQGDQTEVEV 420

Query: 421 EDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDV 480
           ED+ KAQPDAIAH++ +IISRL   G+K T+ALKSLCWR   IQ EE  + GIDSLGFD+
Sbjct: 421 EDVRKAQPDAIAHASAEIISRLEESGDKITEALKSLCWRHNSIQAEEVKLIGIDSLGFDL 480

Query: 481 RVCSETQVQTLRFAFDTRATSEFSAEKQLNDLLF 509
           R+C+  ++++LRFAF TRATSE +AE Q+  LLF
Sbjct: 481 RLCAGAKIESLRFAFSTRATSEENAEGQIRKLLF 487

BLAST of Cucsa.108220 vs. Swiss-Prot
Match: MLRR1_PLAF7 (MATH and LRR domain-containing protein PFE0570w OS=Plasmodium falciparum (isolate 3D7) GN=PFE0570w PE=1 SV=1)

HSP 1 Score: 65.1 bits (157), Expect = 2.6e-09
Identity = 41/154 (26.62%), Postives = 77/154 (50.00%), Query Frame = 1

Query: 155  ELPYVTDAHGNIYFQAKNTEEAMKN---LTSENNFVQVLIGIDTMEMINEMELFGPSEID 214
            E  Y+ +    I+F+     E M N   L ++++++     +   +M  + +L    E +
Sbjct: 8254 ECNYLIEQINKIFFKVDKNNEHMLNGVLLENDDDYLDEEGKVSKKKM-KKKKLLNDKEHE 8313

Query: 215  FGFEELDDGASDDGDDDDDGDGEDEDEDHDEDDDDDDADDEYNRDWVSVIDDEDDQNHSD 274
               E+ D+   +D DD+DD + +++D+D D+DDDDDD DD+Y+ D+    D++  +N  +
Sbjct: 8314 KDNEDNDEDNDEDDDDEDDDEDDEDDDDDDDDDDDDDDDDDYDEDYDEDYDEKLVENKKN 8373

Query: 275  ETLGDWAKLETMRSSHPMHFANKLSEIASDDPID 306
            E        E M   + M   N  +  +S + ID
Sbjct: 8374 ERSNIIMSKENMNKLN-MQPKNTNNRSSSSNTID 8405


HSP 2 Score: 49.7 bits (117), Expect = 1.1e-04
Identity = 61/307 (19.87%), Postives = 128/307 (41.69%), Query Frame = 1

Query: 100  YHPSEELVKSITEIADDVRPT---SAETTRTIIEVNSKATLMFAGLINDEVQENI----- 159
            Y   +  VK+I  +A D++       E + ++ + NS    +    IND+  E +     
Sbjct: 7476 YRMPDRTVKAICPLAGDLQTVLDNELEISESLEDENSYINRILKMNINDKRNEAMKNARP 7535

Query: 160  IWPELPYVTDAH---GNIYFQAKNTEEAMKNLTSENNFVQVLIGIDTME--MINEMELFG 219
            I  +  +V  A      I   A+      + ++++   ++V   + T +  M++  +   
Sbjct: 7536 IIEDDKFVKHASMDSSKIIHSAREETNDKEKISTQAKIIEVGKKLTTKDEDMLHSKKT-- 7595

Query: 220  PSEIDFGFEELDDGASDDGDDDDDGDGEDEDEDHDEDDDDDDA---------------DD 279
               I  G EE DD   D+ DD++D + E+EDED ++ +D +D                DD
Sbjct: 7596 -DVIQHGDEEEDDEEDDEEDDEEDEEEEEEDEDEEDVEDVEDIEDVEDVEDIEDNYVDDD 7655

Query: 280  EYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANKLSEIASDDPIDWMEQPP 339
            +Y  ++    DD++D +  DE   D+   +   +       +K + +  +D  +  E+  
Sbjct: 7656 QYEDNY----DDDNDNDDDDEYDHDY---DEHINEEEQEDDDKKNNVNINDSYEEGEEEG 7715

Query: 340  ATLVIQGVLRPAFNEEQTVIQKHLS--SRHLSNGDINEAQELEENLEGHGRINHRGHESS 377
                  G L     +++   + ++S     + + DIN+ ++  + L    + NH+   ++
Sbjct: 7716 -----DGKLNTKIKKDKKSTENNISVVEEKVKSIDINKEEDFPDLLSNSKKKNHKDKRNA 7767


HSP 3 Score: 35.8 bits (81), Expect = 1.7e+00
Identity = 20/75 (26.67%), Postives = 34/75 (45.33%), Query Frame = 1

Query: 221  ASDDGDDDDDGDGEDEDEDHDED-----------DDDDDADDEYNRDWVSVIDDEDDQNH 280
            +S+  DDDDDGD  +   D+ +D           D++DD DD+ +  ++    D D + +
Sbjct: 2181 SSNSDDDDDDGDNNNNKNDNMDDDSYYERNSIFGDNNDDKDDDDDDSYLDNFSDSDGKKN 2240

Query: 281  SDETLGDWAKLETMR 285
                  D    E M+
Sbjct: 2241 DPSYKYDIENNEEMK 2255

BLAST of Cucsa.108220 vs. Swiss-Prot
Match: YHT1_YEAST (PH domain-containing protein YHR131C OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=YHR131C PE=1 SV=2)

HSP 1 Score: 64.7 bits (156), Expect = 3.4e-09
Identity = 29/50 (58.00%), Postives = 36/50 (72.00%), Query Frame = 1

Query: 218 DDGASDDGDDDDDGDGEDEDEDHDEDDDDDDADDEYNRDWVSVIDDEDDQ 268
           DDG  DDG+DDDD D +D+D+D DEDDDDDD DD+ + D     DD+D Q
Sbjct: 803 DDGDGDDGEDDDDDDDDDDDDDDDEDDDDDDDDDDDDDD-----DDDDGQ 847


HSP 2 Score: 57.8 bits (138), Expect = 4.1e-07
Identity = 27/53 (50.94%), Postives = 37/53 (69.81%), Query Frame = 1

Query: 221 ASDDGDDDDDG-DGEDEDEDHDEDDDDDDADDEYNRDWVSVIDDEDDQNHSDE 273
           A D G+DD DG DGED+D+D D+DDDDDD +D+ + D     DD+DD +  D+
Sbjct: 797 AEDHGEDDGDGDDGEDDDDDDDDDDDDDDDEDDDDDD-----DDDDDDDDDDD 844


HSP 3 Score: 57.0 bits (136), Expect = 7.0e-07
Identity = 28/58 (48.28%), Postives = 37/58 (63.79%), Query Frame = 1

Query: 215 EELDDGASDDGD-----DDDDGDGEDEDEDHDEDDDDDDADDEYNRDWVSVIDDEDDQ 268
           ++ +D   DDGD     DDDD D +D+D+D DEDDDDDD DD+ + D     DD+D Q
Sbjct: 795 KDAEDHGEDDGDGDDGEDDDDDDDDDDDDDDDEDDDDDDDDDDDDDD-----DDDDGQ 847

BLAST of Cucsa.108220 vs. Swiss-Prot
Match: CASQ1_PELES (Calsequestrin-1 OS=Pelophylax esculentus PE=1 SV=1)

HSP 1 Score: 64.3 bits (155), Expect = 4.4e-09
Identity = 31/77 (40.26%), Postives = 49/77 (63.64%), Query Frame = 1

Query: 196 MEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDEDHDEDDDDDDADDEYNR 255
           M+M +E +L    E++   E++ +G  +  DDDDD D +D+D+D D+DDDDDD DD+ + 
Sbjct: 347 MDMDDEEDLPTVDELEDWIEDVLEGEVNTEDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD 406

Query: 256 DWVSVIDDEDDQNHSDE 273
           D     DD+DD +  D+
Sbjct: 407 D-----DDDDDDDDDDD 418

BLAST of Cucsa.108220 vs. Swiss-Prot
Match: KEX1_VANPO (Pheromone-processing carboxypeptidase KEX1 OS=Vanderwaltozyma polyspora (strain ATCC 22028 / DSM 70294) GN=KEX1 PE=3 SV=1)

HSP 1 Score: 63.5 bits (153), Expect = 7.5e-09
Identity = 33/81 (40.74%), Postives = 48/81 (59.26%), Query Frame = 1

Query: 208 SEIDFGFEELDDGASDDGDDDDDGDGEDEDEDHDEDDDDDDADDEYNRDWVSVIDDEDDQ 267
           S+ D G ++ D G  +  DDDDD D + +D+  D+DDDDDD DD+ + D  S  DD+DD 
Sbjct: 537 SDTDEG-KDTDKGKDEKNDDDDDDDDDSDDDSDDDDDDDDDDDDDDDDDDDSDDDDDDDD 596

Query: 268 NHSDETLGDWAKLETMRSSHP 289
           +  D    D ++ ET   +HP
Sbjct: 597 DSDDNEKDDKSESET--KTHP 614


HSP 2 Score: 61.6 bits (148), Expect = 2.9e-08
Identity = 38/93 (40.86%), Postives = 53/93 (56.99%), Query Frame = 1

Query: 201 EMELFGPSEIDFGFEELDDGA-SDDG----DDDDDGDGEDEDEDHDEDDDDDDADDEYNR 260
           E E  G +E D    + D+G  +D G    +DDDD D +D D+D D+DDDDDD DD+ + 
Sbjct: 523 EDEKDGVTEGDGEKSDTDEGKDTDKGKDEKNDDDDDDDDDSDDDSDDDDDDDDDDDDDDD 582

Query: 261 DWVSVIDDEDDQNHSDETLGDWAKLETMRSSHP 289
           D     DD+DD + SD+   D  K E+   +HP
Sbjct: 583 DDDDSDDDDDDDDDSDDNEKD-DKSESETKTHP 614


HSP 3 Score: 53.5 bits (127), Expect = 7.8e-06
Identity = 33/96 (34.38%), Postives = 49/96 (51.04%), Query Frame = 1

Query: 183 ENNFVQVLIGID--TMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDEDH 242
           E +   VLI  +  T   I E EL G  E +       DG   D D+  D D   ++++ 
Sbjct: 495 ETDSPDVLISTNEPTFSDIEEEELDGEKEDEKDGVTEGDGEKSDTDEGKDTDKGKDEKND 554

Query: 243 DEDDDDDDADDEYNRDWVSVIDDEDDQNHSDETLGD 277
           D+DDDDDD+DD+ + D     DD+DD +  D++  D
Sbjct: 555 DDDDDDDDSDDDSDDDDDDDDDDDDDDDDDDDSDDD 590

BLAST of Cucsa.108220 vs. TrEMBL
Match: A0A0A0LBJ5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G184030 PE=4 SV=1)

HSP 1 Score: 610.5 bits (1573), Expect = 1.8e-171
Identity = 302/308 (98.05%), Postives = 304/308 (98.70%), Query Frame = 1

Query: 188 QVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDED--HDEDDD 247
           QVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDED   DEDDD
Sbjct: 10  QVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDEDDDEDEDDD 69

Query: 248 DDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANKLSEIASDDPID 307
           DDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANKLSEIASDDPID
Sbjct: 70  DDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANKLSEIASDDPID 129

Query: 308 WMEQPPATLVIQGVLRPAFNEEQTVIQKHLSSRHLSNGDINEAQELEENLEGHGRINHRG 367
           WMEQPPATLVIQGVLRPAFNEEQTVI+KHLSSRHLSNGDINEAQELEENLEGHGRINH G
Sbjct: 130 WMEQPPATLVIQGVLRPAFNEEQTVIEKHLSSRHLSNGDINEAQELEENLEGHGRINHHG 189

Query: 368 HESSSSKDGLNLMEALDESIPASEASFYRLEMIKVQLFTGNSHPSNVEIEDLMKAQPDAI 427
           HESSSSKDGLNLMEALDESIPASEASFYRLEMIKVQLFTGNSHPSNVEIEDLMKAQPDAI
Sbjct: 190 HESSSSKDGLNLMEALDESIPASEASFYRLEMIKVQLFTGNSHPSNVEIEDLMKAQPDAI 249

Query: 428 AHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSETQVQTL 487
           AHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSETQVQTL
Sbjct: 250 AHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSETQVQTL 309

Query: 488 RFAFDTRA 494
           RFAFDTR+
Sbjct: 310 RFAFDTRS 317

BLAST of Cucsa.108220 vs. TrEMBL
Match: W9RSG8_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_014223 PE=4 SV=1)

HSP 1 Score: 573.9 bits (1478), Expect = 1.9e-160
Identity = 300/511 (58.71%), Postives = 376/511 (73.58%), Query Frame = 1

Query: 1   MIETALAVRFPAGANFCYSSAVSYHRPAWTSEDVTSIGNASSFCRLLHSCTSDVHWKRCQ 60
           MI++ + +RF A A   Y      +RP W+SED++ + + SS CR+ H+C  DV W R +
Sbjct: 2   MIDSTVTLRFSAAATNLY------YRPMWSSEDLSGVVHVSS-CRISHACGFDVPWNRFR 61

Query: 61  RLNSRSLLGRSYLKKIGIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRPT 120
             NS S   R  L K  I+ASA+ LG  SDPIK+N   QYHP EE  KS +E   +   T
Sbjct: 62  SANSGSFR-RCNLIKNRIRASAKHLGPGSDPIKKNGKPQYHPFEEFAKSTSENGGEATLT 121

Query: 121 SAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDAHGNIYFQAKNTEEAMKNL 180
           S ET RTII+VNSKAT+MF+ L+ND+V ENIIWPE+PYVTD HGNIYFQ K+ E+ M+ L
Sbjct: 122 SEETARTIIKVNSKATVMFSNLVNDQVHENIIWPEMPYVTDEHGNIYFQVKDGEDTMQAL 181

Query: 181 TSENNFVQVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDED- 240
           +SENNFVQV+IG+DT EMI EMEL GPSEIDFG +E+                E+ED D 
Sbjct: 182 SSENNFVQVIIGLDTTEMIREMELSGPSEIDFGIDEI----------------EEEDSDV 241

Query: 241 HDEDDDDDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANKLSEIA 300
            DEDD++DD +D+Y+ DWV+V++DEDD+   DE LGDWAKLETMRSSHPM+FA KL+E+ 
Sbjct: 242 EDEDDEEDDENDDYDEDWVAVLEDEDDEEDEDEALGDWAKLETMRSSHPMYFAQKLAEVV 301

Query: 301 SDDPIDWMEQPPATLVIQGVLRPAFNEEQTVIQKHLSSRHLSNGDINE-AQELEENLEGH 360
           SD+PIDWMEQPPA+L IQGV+RPAF EE +VI+KHLS++  SN ++N+  + +E   E  
Sbjct: 302 SDNPIDWMEQPPASLAIQGVVRPAFIEEHSVIRKHLSNQQSSNAELNQVGKPVEGGSEDP 361

Query: 361 GRINHRGHESSSSKDGLNLMEALD-ESIPASEASFYRLEMIKVQLFTGNSHPSNVEIEDL 420
            RIN    ES SSKD     E L+ + I  + A+FY+LE+IK++LF+ +   + VEIED 
Sbjct: 362 IRINGHESESESSKDSSTWEEELEKDEITPNGATFYKLEIIKIELFSAHGRQTLVEIEDF 421

Query: 421 MKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVC 480
           MKAQPD IAHSA KIISRL+AGGEKTTQALKSLCWR KGIQVEEAVI G+DSLG D+R+C
Sbjct: 422 MKAQPDPIAHSATKIISRLKAGGEKTTQALKSLCWRLKGIQVEEAVIIGVDSLGIDLRIC 481

Query: 481 SETQVQTLRFAFDTRATSEFSAEKQLNDLLF 509
           S TQVQTLRF FD+RATSE+SAE+QLND+LF
Sbjct: 482 SGTQVQTLRFGFDSRATSEYSAERQLNDILF 488

BLAST of Cucsa.108220 vs. TrEMBL
Match: A0A061G0G7_THECC (Pentatricopeptide repeat superfamily protein, putative isoform 1 OS=Theobroma cacao GN=TCM_015265 PE=4 SV=1)

HSP 1 Score: 567.8 bits (1462), Expect = 1.4e-158
Identity = 296/510 (58.04%), Postives = 381/510 (74.71%), Query Frame = 1

Query: 2   IETALAVRFPAGANFCYSSAVSYHRPAWTSEDVTSIGNASSFCRLLHSCTSDVHWKRCQR 61
           IE+ALAVRFPAGANFC SSA+ ++RP  +S++VT     S   RL      D+ W R +R
Sbjct: 6   IESALAVRFPAGANFCSSSALHHYRPTCSSDEVTCCHVTSR--RLFRRGGFDLTWDRFRR 65

Query: 62  LNSRSLLGRSYLKKIGIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRPTS 121
           +NS SLL R+ +K   I+A+AE LGSASDP KQNR   YHP E++ ++ ++ ++D   ++
Sbjct: 66  INSGSLLRRTLIKN-KIRATAEHLGSASDPTKQNRRPHYHPFEDIGEATSKNSNDAILSA 125

Query: 122 AETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDAHGNIYFQAKNTEEAMKNLT 181
           AETTRTII+VNSKATLMF G+INDEV ENI+WP+LPYVTD HGN+YFQ K+ E+ M++LT
Sbjct: 126 AETTRTIIKVNSKATLMFTGIINDEVHENIMWPDLPYVTDEHGNVYFQVKSDEDIMQSLT 185

Query: 182 SENNFVQVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDED-EDH 241
            ENNFVQV+IG DT E++ E+EL GPS+IDFG EE++D              ED D ED 
Sbjct: 186 LENNFVQVIIGFDTTEIMKEIELSGPSDIDFGIEEIED--------------EDSDVEDV 245

Query: 242 DEDDDDDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANKLSEIAS 301
           DED+DD   +++Y+ +WV+ ++ EDDQ+ SDETLGDWAKLETMRSSHPM+FA KL+E+AS
Sbjct: 246 DEDEDDHAEEEDYDEEWVAALEHEDDQDDSDETLGDWAKLETMRSSHPMYFAKKLTEVAS 305

Query: 302 DDPIDWMEQPPATLVIQGVLRPAFNEEQTVIQKHLSSRHLSNGDINEAQE-LEENLEGHG 361
           DDPIDWMEQP   L IQG++RPAF EE + IQKH+SS    + D ++ ++ +E+ LE  G
Sbjct: 306 DDPIDWMEQPSDGLAIQGLIRPAFVEEHSEIQKHMSSNQSRSSDTSQVEKVVEDKLEDLG 365

Query: 362 RINHRGHESSSSKDGLNLMEALD-ESIPASEASFYRLEMIKVQLFTGNSHPSNVEIEDLM 421
            IN + +E   S D   + E  + + I  + +SFY+LE++K+QL T + H + VE+ED  
Sbjct: 366 IINGQSNELGWSGDSSTISEEPEKKEISINGSSFYKLEIVKIQLITAHGHQTVVELEDFK 425

Query: 422 KAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCS 481
           +AQPDAIA SA KIIS L+AGGEKTTQALKSLCWRCK IQVEE  I GIDSLGFD+RVC 
Sbjct: 426 QAQPDAIAQSAAKIISCLKAGGEKTTQALKSLCWRCKSIQVEEVAIIGIDSLGFDLRVCC 485

Query: 482 ETQVQTLRFAFDTRATSEFSAEKQLNDLLF 509
             Q+QTLRFAF+TRATSE+SAE+QLNDLLF
Sbjct: 486 GPQIQTLRFAFNTRATSEYSAERQLNDLLF 498

BLAST of Cucsa.108220 vs. TrEMBL
Match: A0A151T4D6_CAJCA (Uncharacterized protein At3g49135 family (Fragment) OS=Cajanus cajan GN=KK1_016421 PE=4 SV=1)

HSP 1 Score: 560.5 bits (1443), Expect = 2.2e-156
Identity = 288/489 (58.90%), Postives = 366/489 (74.85%), Query Frame = 1

Query: 25  HRPAWTSEDVTSIGNASSFCRLLHSCTSDVHWKRCQRLNSRSLLGRSYLKKIGIQASAEP 84
           +R  W++EDV  +   +S CRL  SC  DV W R Q+  S     R+ L K  I+AS+E 
Sbjct: 1   NRSMWSAEDVNGVRYVAS-CRLACSCGFDVPWVRSQKYTSIPFTRRNKLAKNRIRASSEH 60

Query: 85  LGSASDPIKQNRGLQYHPSEELVKSITEIADDVRPTSAETTRTIIEVNSKATLMFAGLIN 144
           LGS  DP+K+N    YHP EE+  S +E ++D R T+AET+RTIIEVNSKATLMF+ LI+
Sbjct: 61  LGS-QDPVKKNEKPSYHPFEEVAASTSENSEDARLTAAETSRTIIEVNSKATLMFSSLIS 120

Query: 145 DEVQENIIWPELPYVTDAHGNIYFQAKNTEEAMKNLTSENNFVQVLIGIDTMEMINEMEL 204
           DE  ENIIWP+LPY+TD HGNIYFQ KN+E+ +++LTSENNFVQV++G+++MEMI+EM+L
Sbjct: 121 DEFHENIIWPDLPYLTDEHGNIYFQVKNSEDVLQSLTSENNFVQVIVGVNSMEMISEMDL 180

Query: 205 FGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDEDHDEDDDDDDADDEYNRDWVSVIDDE 264
            GPSEIDFG EE+DD    D +D DD + EDEDED DED+++D     Y+ +WV+V  D+
Sbjct: 181 SGPSEIDFGIEEIDD---QDTEDLDDSNEEDEDEDEDEDENED-----YDSEWVAVFSDD 240

Query: 265 DDQNHSDETLGDWAKLETMRSSHPMHFANKLSEIASDDPIDWMEQPPATLVIQGVLRPAF 324
           D+Q   DETL DWAKLETM+ SHPM+FA KL+EIASDDP+DWMEQPPA + IQGV+RPAF
Sbjct: 241 DEQEDDDETLADWAKLETMQYSHPMYFAKKLAEIASDDPVDWMEQPPACVAIQGVIRPAF 300

Query: 325 NEEQTVIQKHLSSRHLSNGDINEAQELEENLEGHGRINHRGHESSSSKDGLNLMEALD-- 384
            EE + IQKHLS+   S  D N+++ +E   E  G IN  GH  +S   G N  + ++  
Sbjct: 301 VEEHSTIQKHLSTNQSS--DTNKSKSIESKGENIGVIN--GHVLNSGSSGDNAAQQVEND 360

Query: 385 ---ESIPASEASFYRLEMIKVQLFTGNSHPSNVEIEDLMKAQPDAIAHSAEKIISRLRAG 444
              +SIP SE SFY+LEMIK+Q+F+   +P+ +EIED +KAQPDAIAHSA KIISRL+  
Sbjct: 361 SNSDSIPFSETSFYKLEMIKIQVFSAQGNPTALEIEDYVKAQPDAIAHSASKIISRLKVD 420

Query: 445 GEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSETQVQTLRFAFDTRATSEFSA 504
           GE T QALKSLCWRCKG+QVEEA +  +DSLGFD+RVCS TQ+QTLRF+F  RATSE+SA
Sbjct: 421 GENTLQALKSLCWRCKGLQVEEAQLICLDSLGFDLRVCSGTQIQTLRFSFKNRATSEYSA 475

Query: 505 EKQLNDLLF 509
           E+QLNDLLF
Sbjct: 481 ERQLNDLLF 475

BLAST of Cucsa.108220 vs. TrEMBL
Match: A0A0D2RI62_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_003G069200 PE=4 SV=1)

HSP 1 Score: 558.5 bits (1438), Expect = 8.3e-156
Identity = 287/509 (56.39%), Postives = 369/509 (72.50%), Query Frame = 1

Query: 1   MIETALAVRFPAGANFCYSSAVSYHRPAWTSEDVTSIGNASSFCRLLHSCTSDVHWKRCQ 60
           MIE+A+AVRFP   NFC SSA   +RP   S +VTS     S  RL       + WK  +
Sbjct: 4   MIESAMAVRFPTPTNFCSSSAFHNYRPMCNSGEVTSCH--VSCRRLFCHGGFGITWKGFR 63

Query: 61  RLNSRSLLGRSYLKKIGIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRPT 120
           RLN R+ L R  L K  I+A+AE LGSASDP K      YHP E++ ++ ++ ++D   T
Sbjct: 64  RLN-RASLSRRTLVKNNIRATAEHLGSASDPAKHKGRSHYHPFEDIGEATSKKSNDATLT 123

Query: 121 SAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDAHGNIYFQAKNTEEAMKNL 180
           +AET+RTIIEVNSKAT+MF G+INDEV ENI+WP+LPY TD HGN+Y Q K+ E+ +++L
Sbjct: 124 AAETSRTIIEVNSKATVMFTGMINDEVHENIMWPDLPYATDEHGNVYLQVKSDEDILQSL 183

Query: 181 TSENNFVQVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDEDH 240
           T ENNFVQV+IG DT E++ E+EL GPSE+DFG EE+D+   D  DDDDD        D 
Sbjct: 184 TVENNFVQVIIGFDTTEIMKEIELSGPSEVDFGIEEIDNEDVDIEDDDDD--------DD 243

Query: 241 DEDDDDDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANKLSEIAS 300
           D+DDDDDD +++Y+ +WV+ ++DEDDQ+ SD TLGDWAKL+TMRSSHPM+FA KL+E AS
Sbjct: 244 DDDDDDDDDEEDYDEEWVAALEDEDDQDDSDGTLGDWAKLDTMRSSHPMYFAKKLTEAAS 303

Query: 301 DDPIDWMEQPPATLVIQGVLRPAFNEEQTVIQKHLSSRHLSNGDINEAQ-ELEENLEGHG 360
           DDP+DWMEQP   L IQG+LRPA  EE + IQKH+S+      D N+A+ ++ + +E  G
Sbjct: 304 DDPVDWMEQPSDGLAIQGLLRPALTEEHSEIQKHMSTNQSHGSDTNQAEKDVGDKVEDLG 363

Query: 361 RINHRGHESSSSKDGLNLMEALDESIPASEASFYRLEMIKVQLFTGNSHPSNVEIEDLMK 420
            IN  G+ES  S+   +        I  + +SFY+LEMIK+QL T + H ++VE+ED  +
Sbjct: 364 IINGYGNESELSRKSSSSERLGKNEISTNGSSFYKLEMIKIQLITAHGHQTDVELEDFKQ 423

Query: 421 AQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSE 480
           AQPDAIAH A KIISRL+AGGEKTTQALKSLCWRCKGIQVEE  I  +DSLGF +R+CS 
Sbjct: 424 AQPDAIAHLAAKIISRLKAGGEKTTQALKSLCWRCKGIQVEEVAIISVDSLGFVLRICSG 483

Query: 481 TQVQTLRFAFDTRATSEFSAEKQLNDLLF 509
           TQ++TLRFAF+ RATSE+SAE+QLND+LF
Sbjct: 484 TQIETLRFAFNARATSEYSAERQLNDMLF 501

BLAST of Cucsa.108220 vs. TAIR10
Match: AT3G49140.1 (AT3G49140.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 437.6 bits (1124), Expect = 1.1e-122
Identity = 242/514 (47.08%), Postives = 328/514 (63.81%), Query Frame = 1

Query: 1   MIETALAVRFPAGANFCYSSAVSYHRPAWTSEDVTSIGNASSFCRLLHSCTSDVHWKRCQ 60
           MIE+ +AVR   G  FC S+A+  +R A +SE+  +  + +S           +      
Sbjct: 1   MIESVMAVRLSTG--FCSSTALLQYRTAPSSEEGGNCFHYASRRVFQPQRIHHIDGSGFL 60

Query: 61  RLNSRSLLGRSYLKKIGIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRPT 120
           + NS   + R +L+K   QA+AE + SASDP KQ    +YHPSEE+  S+ +   D R +
Sbjct: 61  KYNS-DYITRKHLRKNRTQATAEYVDSASDPEKQTGKSRYHPSEEIRASLPQNDGDSRLS 120

Query: 121 SAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDAHGNIYFQAKNTEEAMKNL 180
            AETTRTIIEVN+K TLM  G I D V ENI+WP++PY+TD +GN+YFQ K  E+ M+++
Sbjct: 121 PAETTRTIIEVNNKGTLMLTGSIGDGVHENILWPDIPYITDQNGNLYFQVKEDEDVMQSV 180

Query: 181 TSENNFVQVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDED--- 240
           TSENN+VQV++G DTMEMI EMEL G S+ DF   E +D  S D D +D G+ EDE+   
Sbjct: 181 TSENNYVQVIVGFDTMEMIKEMELMGLSDSDF---ETEDDESGDDDSEDTGEDEDEEEWV 240

Query: 241 ---EDHDEDDDDDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANK 300
              ED DEDDDDDD DDE           +DD + SDE+LGDWA LETMRS HPM FA +
Sbjct: 241 AILEDEDEDDDDDDDDDE-----------DDDDSDSDESLGDWANLETMRSCHPMFFAKR 300

Query: 301 LSEIASDDPIDWMEQPPATLVIQGVLRPAFNEEQTVIQKHLSSRHLSNGDINEAQELEEN 360
           ++E+AS+DP+DWM+QP A L IQG+L     E+ + IQK L+  + +     +A+ L + 
Sbjct: 301 MTEVASNDPVDWMDQPSAGLAIQGLLSHILVEDYSDIQKKLADSNSTTNGNKDAENLVDK 360

Query: 361 LEGHGRINHRGHESSSSKDGLNLMEALDESIPASEASFYRLEMIKVQLFTGNSHPSNVEI 420
           LE + +      E  SS+D              +  +FY+LEMI++QL T     + VE+
Sbjct: 361 LEDNSKAGGDESEIDSSQD----------EKARNVVAFYKLEMIRIQLITAQGDQTEVEV 420

Query: 421 EDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDV 480
           ED+ KAQPDAIAH++ +IISRL   G+K T+ALKSLCWR   IQ EE  + GIDSLGFD+
Sbjct: 421 EDVRKAQPDAIAHASAEIISRLEESGDKITEALKSLCWRHNSIQAEEVKLIGIDSLGFDL 480

Query: 481 RVCSETQVQTLRFAFDTRATSEFSAEKQLNDLLF 509
           R+C+  ++++LRFAF TRATSE +AE Q+  LLF
Sbjct: 481 RLCAGAKIESLRFAFSTRATSEENAEGQIRKLLF 487

BLAST of Cucsa.108220 vs. TAIR10
Match: AT5G24060.2 (AT5G24060.2 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 406.0 bits (1042), Expect = 3.5e-113
Identity = 227/477 (47.59%), Postives = 301/477 (63.10%), Query Frame = 1

Query: 39  NASSFCRLLHSCTSDVHWKRCQRLNSRSLLGRSYLKKIGIQASAEPLGSASDPIKQNRGL 98
           N SS C  L  C SD              + R YL++   QA AE LGSASDP K     
Sbjct: 43  NTSSGCGFL-KCYSDY-------------ITRKYLRRNRTQAIAEYLGSASDPKKPTGKS 102

Query: 99  QYHPSEELVKSITEI-ADDVRPTSAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELP 158
            YHPSE++   + E    D R +  ET RTIIEVN K TLM +GL+   V ENI+WP++P
Sbjct: 103 SYHPSEDIRAYVPEKNPGDSRLSPPETARTIIEVNKKGTLMLSGLLGIGVHENILWPDIP 162

Query: 159 YVTDAHGNIYFQAKNTEEAMKNL-TSENNFVQVLIGIDTMEMINEMELFGPSEIDFGFEE 218
           YVTD HGNIYFQ K  E+ M+ + TS+NN+VQV++G DTMEMI +MEL  PS I FG EE
Sbjct: 163 YVTDQHGNIYFQVKENEDIMQTVVTSDNNYVQVIVGFDTMEMIKDMELSSPSGIGFGIEE 222

Query: 219 LDDGASDDGDDDDDGDGEDEDEDHDEDDDDDDADDEYNRDWVSVIDDEDDQNH----SDE 278
           ++DG S           E EDE+  ++D+ +D DDE   +WV+V++D DD+++    SDE
Sbjct: 223 IEDGES-----------EVEDENKGDEDEGEDKDDE---EWVAVLEDGDDEDNYVSDSDE 282

Query: 279 TLGDWAKLETMRSSHPMHFANKLSEIASDDPIDWMEQPPATLVIQGVLRPAFNEEQTVIQ 338
           +LGDWA LETMR  HPM+FA +++E+AS DP++WM+QP A L IQG+L P   E+ + IQ
Sbjct: 283 SLGDWANLETMRYCHPMYFARRMAEVASTDPVNWMDQPSAGLAIQGLLSPVIVEDHSDIQ 342

Query: 339 KHLSSRHLSNGDINEAQE-LEENLEGHGRINHRGHESSSSKDGLNLMEALDESIPASEAS 398
           KH+S    +  D N+ +E  EE  EG G                N  E L      +   
Sbjct: 343 KHISGCISTGTDKNKERENSEEIFEGIGE---------------NESEILHVENSRNAIQ 402

Query: 399 FYRLEMIKVQLFTGNSHPSNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLC 458
           +Y+LE+I++QL T   H + VE+ED+ KAQPD IA +++ I++RL   G+K T+AL+SLC
Sbjct: 403 YYKLEIIRIQLITAQGHQTEVEVEDVRKAQPDVIACASDGILTRLEEDGDKLTEALRSLC 462

Query: 459 WRCKGIQVEEAVINGIDSLGFDVRVCSETQVQTLRFAFDTRATSEFSAEKQLNDLLF 509
           WR  GIQ EE  + GIDSLGFD+R+CS  Q++TLRFAF  RATSE +AE QL +LLF
Sbjct: 463 WRNNGIQAEEVKLIGIDSLGFDLRICSGMQIETLRFAFSIRATSEHNAEGQLRELLF 476

BLAST of Cucsa.108220 vs. TAIR10
Match: AT3G59300.1 (AT3G59300.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 117.1 bits (292), Expect = 3.2e-26
Identity = 84/266 (31.58%), Postives = 128/266 (48.12%), Query Frame = 1

Query: 251 DEYNRDWVSVIDDEDDQNHSDETLG--------DWAKLETMRSSHPMHFANKLSEIASDD 310
           +EYN   +  +D     +H  E +         DW   +T    HP++FA  LS+  S D
Sbjct: 199 EEYNISDIGNLDQIIFDDHYFEIMDSEARDIPIDWGMPDTSNGVHPIYFAKHLSKAISMD 258

Query: 311 PIDWMEQPPATLVIQGVLRPAFNEEQTVIQKHLSSRHLSNGDINEAQELEENLEGHGRIN 370
               M+ P   + I G LRPA       + +    R L   +  +    E   + +   +
Sbjct: 259 YDRKMDYPSNGVSILGCLRPA------FLDEESYIRRLFLSEDRDDYSWEVQGDDNPITS 318

Query: 371 HRGHESSSSKDGLNLMEALDESIPASEASFYRLEMIKVQLFTGNSHPSNVEIEDLMKAQP 430
            R  E+  S                  +S YRLE++ ++L +     S++ ++D   A+P
Sbjct: 319 SRRDENDMS------------------SSLYRLEIVGIELLSLYGAESSISLQDFQDAEP 378

Query: 431 DAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSETQV 490
           D + HS   II R    G  ++ ALK+LC + KG+  EEA +  +DSLG DVRV +  QV
Sbjct: 379 DILVHSTSAIIERFNNRGINSSIALKALCKK-KGLHAEEANLISVDSLGMDVRVFAGAQV 438

Query: 491 QTLRFAFDTRATSEFSAEKQLNDLLF 509
           QT RF F TRAT+E +AEK+++ LLF
Sbjct: 439 QTHRFPFKTRATTEMAAEKKIHQLLF 439

BLAST of Cucsa.108220 vs. TAIR10
Match: AT1G44050.1 (AT1G44050.1 Cysteine/Histidine-rich C1 domain family protein)

HSP 1 Score: 63.5 bits (153), Expect = 4.2e-10
Identity = 29/59 (49.15%), Postives = 35/59 (59.32%), Query Frame = 1

Query: 218 DDGASDDGDDDDDGDGEDEDEDHDEDDDDDDADDEYNRDWVSVIDDEDDQNHSDETLGD 277
           DD   DDGDDD++ D ED+D D D+DDDDDD DD+         DD+DD    D   GD
Sbjct: 89  DDNKGDDGDDDNEDDNEDDDNDDDDDDDDDDDDDD---------DDDDDDGDDDNEDGD 138


HSP 2 Score: 60.5 bits (145), Expect = 3.6e-09
Identity = 30/69 (43.48%), Postives = 40/69 (57.97%), Query Frame = 1

Query: 208 SEIDFGFEELDDGASDDGDDDDDGDGEDEDEDHDEDDDDDDADDEYNRDWVSVIDDEDDQ 267
           +E D   ++ +DG  D+  DD D D ED++ED D DDDDDD DD+ + D     DD+DD 
Sbjct: 75  AEDDDNGDDKEDGDDDNKGDDGDDDNEDDNEDDDNDDDDDDDDDDDDDD-----DDDDDD 134

Query: 268 NHSDETLGD 277
              D   GD
Sbjct: 135 GDDDNEDGD 138


HSP 3 Score: 47.8 bits (112), Expect = 2.4e-05
Identity = 20/31 (64.52%), Postives = 23/31 (74.19%), Query Frame = 1

Query: 218 DDGASDDGDDDDDGDGEDEDEDHDEDDDDDD 249
           DD   DD DDDDD DG+D++ED D DDDD D
Sbjct: 115 DDDDDDDDDDDDDDDGDDDNEDGDCDDDDGD 145


HSP 4 Score: 32.0 bits (71), Expect = 1.4e+00
Identity = 18/50 (36.00%), Postives = 26/50 (52.00%), Query Frame = 1

Query: 192 GIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDEDHD 242
           G D  E  NE +     + D   ++ DD   DD  DDD+ DG+ +D+D D
Sbjct: 96  GDDDNEDDNEDDDNDDDDDDDDDDDDDDDDDDDDGDDDNEDGDCDDDDGD 145


HSP 5 Score: 29.6 bits (65), Expect = 6.8e+00
Identity = 14/33 (42.42%), Postives = 18/33 (54.55%), Query Frame = 1

Query: 240 HDEDDDDDDADDEYNRDWVSVIDDEDDQNHSDE 273
           H+ DDDDD  DD+   D     +D DD N  D+
Sbjct: 67  HNHDDDDDAEDDDNGDD----KEDGDDDNKGDD 95

BLAST of Cucsa.108220 vs. TAIR10
Match: AT1G48400.1 (AT1G48400.1 F-box/RNI-like/FBD-like domains-containing protein)

HSP 1 Score: 59.7 bits (143), Expect = 6.1e-09
Identity = 33/103 (32.04%), Postives = 52/103 (50.49%), Query Frame = 1

Query: 151 IIWPELPYVTDAHGNIYFQAKNTEEAMKNLTSENNFVQVLIGIDTMEMINEMELFGPSEI 210
           +IW EL Y  +A   +Y    +   A  ++   +  V+  + +      N+ +     + 
Sbjct: 231 MIWHELIYF-EAPSLVYLDYSSYVSAKYDVVDFDLLVEARLSLRLWVSTNDYDYSDDDDD 290

Query: 211 DFGFEELDDGASDDGDDDDDGDGEDEDEDHDEDDDDDDADDEY 254
           D    + DD   DD DDDDD D +D+D+D D+DDDDDD D +Y
Sbjct: 291 D----DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDGDY 328


HSP 2 Score: 56.2 bits (134), Expect = 6.8e-08
Identity = 26/61 (42.62%), Postives = 40/61 (65.57%), Query Frame = 1

Query: 221 ASDDGDDDDDGDGEDEDEDHDEDDDDDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKL 280
           +++D D  DD D +D+D+D D+DDDDDD DD+ + D     DD+DD +  D+  GD+  +
Sbjct: 277 STNDYDYSDDDDDDDDDDDDDDDDDDDDDDDDDDDD-----DDDDDDDDDDDDDGDYYIV 332

Query: 281 E 282
           E
Sbjct: 337 E 332

BLAST of Cucsa.108220 vs. NCBI nr
Match: gi|449445979|ref|XP_004140749.1| (PREDICTED: uncharacterized protein At3g49140 [Cucumis sativus])

HSP 1 Score: 1019.6 bits (2635), Expect = 1.9e-294
Identity = 508/513 (99.03%), Postives = 509/513 (99.22%), Query Frame = 1

Query: 1   MIETALAVRFPAGANFCYSSAVSYHRPAWTSEDVTSIGNASSFCRLLHSCTSDVHWKRCQ 60
           MIETALAVRFPAGANFCYSSAVSYHRPAWTSEDVTSIGNASSFCRLLHSCTSDVHWKRCQ
Sbjct: 1   MIETALAVRFPAGANFCYSSAVSYHRPAWTSEDVTSIGNASSFCRLLHSCTSDVHWKRCQ 60

Query: 61  RLNSRSLLGRSYLKKIGIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRPT 120
           RLNSRSLLGRSYLKKIGIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRPT
Sbjct: 61  RLNSRSLLGRSYLKKIGIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRPT 120

Query: 121 SAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDAHGNIYFQAKNTEEAMKNL 180
           SAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDAHGNIYFQAKNTEEAMKNL
Sbjct: 121 SAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDAHGNIYFQAKNTEEAMKNL 180

Query: 181 TSENNFVQVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDED- 240
           TSENNFVQVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDED 
Sbjct: 181 TSENNFVQVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDEDD 240

Query: 241 -HDEDDDDDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANKLSEI 300
             DEDDDDDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANKLSEI
Sbjct: 241 DEDEDDDDDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANKLSEI 300

Query: 301 ASDDPIDWMEQPPATLVIQGVLRPAFNEEQTVIQKHLSSRHLSNGDINEAQELEENLEGH 360
           ASDDPIDWMEQPPATLVIQGVLRPAFNEEQTVI+KHLSSRHLSNGDINEAQELEENLEGH
Sbjct: 301 ASDDPIDWMEQPPATLVIQGVLRPAFNEEQTVIEKHLSSRHLSNGDINEAQELEENLEGH 360

Query: 361 GRINHRGHESSSSKDGLNLMEALDESIPASEASFYRLEMIKVQLFTGNSHPSNVEIEDLM 420
           GRINH GHESSSSKDGLNLMEALDESIPASEASFYRLEMIKVQLFTGNSHPSNVEIEDLM
Sbjct: 361 GRINHHGHESSSSKDGLNLMEALDESIPASEASFYRLEMIKVQLFTGNSHPSNVEIEDLM 420

Query: 421 KAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCS 480
           KAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCS
Sbjct: 421 KAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCS 480

Query: 481 ETQVQTLRFAFDTRATSEFSAEKQLNDLLFLEN 512
           ETQVQTLRFAFDTRATSEFSAEKQLNDLLFLEN
Sbjct: 481 ETQVQTLRFAFDTRATSEFSAEKQLNDLLFLEN 513

BLAST of Cucsa.108220 vs. NCBI nr
Match: gi|659077638|ref|XP_008439307.1| (PREDICTED: uncharacterized protein At3g49140-like [Cucumis melo])

HSP 1 Score: 975.3 bits (2520), Expect = 4.1e-281
Identity = 485/510 (95.10%), Postives = 497/510 (97.45%), Query Frame = 1

Query: 1   MIETALAVRFPAGANFCYSSAVSYHRPAWTSEDVTSIGNASSFCRLLHSCTSDVHWKRCQ 60
           MIETALAVRFPAGANFCYSSAV YHRPAWTSED +SIGNASSFCRLLHSCTSDVHWKRCQ
Sbjct: 1   MIETALAVRFPAGANFCYSSAVPYHRPAWTSEDASSIGNASSFCRLLHSCTSDVHWKRCQ 60

Query: 61  RLNSRSLLGRSYLKKIGIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRPT 120
           RLNSRSLLGRS L+K GIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRPT
Sbjct: 61  RLNSRSLLGRSNLRKNGIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRPT 120

Query: 121 SAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDAHGNIYFQAKNTEEAMKNL 180
           SAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTD HGNIYFQ KNTEEAMKNL
Sbjct: 121 SAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDEHGNIYFQMKNTEEAMKNL 180

Query: 181 TSENNFVQVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASD--DGDDDDDGDGEDEDE 240
           TSENNFVQVLIG+DTMEMINEMELFGPSEIDFGFEELDDGA++  D DDDDDGDGEDEDE
Sbjct: 181 TSENNFVQVLIGLDTMEMINEMELFGPSEIDFGFEELDDGATNVGDDDDDDDGDGEDEDE 240

Query: 241 DHDEDDDDDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANKLSEI 300
           D DED+DDDDADDEYNRDWVSVIDDEDDQN+SDETLGDWAKLETMRSSHPMHFANKLSE+
Sbjct: 241 DDDEDNDDDDADDEYNRDWVSVIDDEDDQNNSDETLGDWAKLETMRSSHPMHFANKLSEV 300

Query: 301 ASDDPIDWMEQPPATLVIQGVLRPAFNEEQTVIQKHLSSRHLSNGDINEAQELEENLEGH 360
           ASDDPIDWMEQPPATLVIQGVLRPAF+EEQTVIQKHLSSRHLSNGDINEAQ+LEENLE H
Sbjct: 301 ASDDPIDWMEQPPATLVIQGVLRPAFSEEQTVIQKHLSSRHLSNGDINEAQKLEENLESH 360

Query: 361 GRINHRGHESSSSKDGLNLMEALDESIPASEASFYRLEMIKVQLFTGNSHPSNVEIEDLM 420
           GRINH GHESSSSKDGLNLM+ALDESIPASEASFYRLEMIKVQLFTGNSHPS+VEIEDLM
Sbjct: 361 GRINHHGHESSSSKDGLNLMDALDESIPASEASFYRLEMIKVQLFTGNSHPSDVEIEDLM 420

Query: 421 KAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCS 480
           KAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCS
Sbjct: 421 KAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCS 480

Query: 481 ETQVQTLRFAFDTRATSEFSAEKQLNDLLF 509
            TQVQTLRFAFDTRATSEFSAEKQLNDLLF
Sbjct: 481 GTQVQTLRFAFDTRATSEFSAEKQLNDLLF 510

BLAST of Cucsa.108220 vs. NCBI nr
Match: gi|700202288|gb|KGN57421.1| (hypothetical protein Csa_3G184030 [Cucumis sativus])

HSP 1 Score: 610.5 bits (1573), Expect = 2.6e-171
Identity = 302/308 (98.05%), Postives = 304/308 (98.70%), Query Frame = 1

Query: 188 QVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDED--HDEDDD 247
           QVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDED   DEDDD
Sbjct: 10  QVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDEDDDEDEDDD 69

Query: 248 DDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANKLSEIASDDPID 307
           DDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANKLSEIASDDPID
Sbjct: 70  DDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANKLSEIASDDPID 129

Query: 308 WMEQPPATLVIQGVLRPAFNEEQTVIQKHLSSRHLSNGDINEAQELEENLEGHGRINHRG 367
           WMEQPPATLVIQGVLRPAFNEEQTVI+KHLSSRHLSNGDINEAQELEENLEGHGRINH G
Sbjct: 130 WMEQPPATLVIQGVLRPAFNEEQTVIEKHLSSRHLSNGDINEAQELEENLEGHGRINHHG 189

Query: 368 HESSSSKDGLNLMEALDESIPASEASFYRLEMIKVQLFTGNSHPSNVEIEDLMKAQPDAI 427
           HESSSSKDGLNLMEALDESIPASEASFYRLEMIKVQLFTGNSHPSNVEIEDLMKAQPDAI
Sbjct: 190 HESSSSKDGLNLMEALDESIPASEASFYRLEMIKVQLFTGNSHPSNVEIEDLMKAQPDAI 249

Query: 428 AHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSETQVQTL 487
           AHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSETQVQTL
Sbjct: 250 AHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSETQVQTL 309

Query: 488 RFAFDTRA 494
           RFAFDTR+
Sbjct: 310 RFAFDTRS 317

BLAST of Cucsa.108220 vs. NCBI nr
Match: gi|703134830|ref|XP_010105734.1| (hypothetical protein L484_014223 [Morus notabilis])

HSP 1 Score: 573.9 bits (1478), Expect = 2.7e-160
Identity = 300/511 (58.71%), Postives = 376/511 (73.58%), Query Frame = 1

Query: 1   MIETALAVRFPAGANFCYSSAVSYHRPAWTSEDVTSIGNASSFCRLLHSCTSDVHWKRCQ 60
           MI++ + +RF A A   Y      +RP W+SED++ + + SS CR+ H+C  DV W R +
Sbjct: 2   MIDSTVTLRFSAAATNLY------YRPMWSSEDLSGVVHVSS-CRISHACGFDVPWNRFR 61

Query: 61  RLNSRSLLGRSYLKKIGIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRPT 120
             NS S   R  L K  I+ASA+ LG  SDPIK+N   QYHP EE  KS +E   +   T
Sbjct: 62  SANSGSFR-RCNLIKNRIRASAKHLGPGSDPIKKNGKPQYHPFEEFAKSTSENGGEATLT 121

Query: 121 SAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDAHGNIYFQAKNTEEAMKNL 180
           S ET RTII+VNSKAT+MF+ L+ND+V ENIIWPE+PYVTD HGNIYFQ K+ E+ M+ L
Sbjct: 122 SEETARTIIKVNSKATVMFSNLVNDQVHENIIWPEMPYVTDEHGNIYFQVKDGEDTMQAL 181

Query: 181 TSENNFVQVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDEDED- 240
           +SENNFVQV+IG+DT EMI EMEL GPSEIDFG +E+                E+ED D 
Sbjct: 182 SSENNFVQVIIGLDTTEMIREMELSGPSEIDFGIDEI----------------EEEDSDV 241

Query: 241 HDEDDDDDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANKLSEIA 300
            DEDD++DD +D+Y+ DWV+V++DEDD+   DE LGDWAKLETMRSSHPM+FA KL+E+ 
Sbjct: 242 EDEDDEEDDENDDYDEDWVAVLEDEDDEEDEDEALGDWAKLETMRSSHPMYFAQKLAEVV 301

Query: 301 SDDPIDWMEQPPATLVIQGVLRPAFNEEQTVIQKHLSSRHLSNGDINE-AQELEENLEGH 360
           SD+PIDWMEQPPA+L IQGV+RPAF EE +VI+KHLS++  SN ++N+  + +E   E  
Sbjct: 302 SDNPIDWMEQPPASLAIQGVVRPAFIEEHSVIRKHLSNQQSSNAELNQVGKPVEGGSEDP 361

Query: 361 GRINHRGHESSSSKDGLNLMEALD-ESIPASEASFYRLEMIKVQLFTGNSHPSNVEIEDL 420
            RIN    ES SSKD     E L+ + I  + A+FY+LE+IK++LF+ +   + VEIED 
Sbjct: 362 IRINGHESESESSKDSSTWEEELEKDEITPNGATFYKLEIIKIELFSAHGRQTLVEIEDF 421

Query: 421 MKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVC 480
           MKAQPD IAHSA KIISRL+AGGEKTTQALKSLCWR KGIQVEEAVI G+DSLG D+R+C
Sbjct: 422 MKAQPDPIAHSATKIISRLKAGGEKTTQALKSLCWRLKGIQVEEAVIIGVDSLGIDLRIC 481

Query: 481 SETQVQTLRFAFDTRATSEFSAEKQLNDLLF 509
           S TQVQTLRF FD+RATSE+SAE+QLND+LF
Sbjct: 482 SGTQVQTLRFGFDSRATSEYSAERQLNDILF 488

BLAST of Cucsa.108220 vs. NCBI nr
Match: gi|590673249|ref|XP_007038839.1| (Pentatricopeptide repeat superfamily protein, putative isoform 1 [Theobroma cacao])

HSP 1 Score: 567.8 bits (1462), Expect = 2.0e-158
Identity = 296/510 (58.04%), Postives = 381/510 (74.71%), Query Frame = 1

Query: 2   IETALAVRFPAGANFCYSSAVSYHRPAWTSEDVTSIGNASSFCRLLHSCTSDVHWKRCQR 61
           IE+ALAVRFPAGANFC SSA+ ++RP  +S++VT     S   RL      D+ W R +R
Sbjct: 6   IESALAVRFPAGANFCSSSALHHYRPTCSSDEVTCCHVTSR--RLFRRGGFDLTWDRFRR 65

Query: 62  LNSRSLLGRSYLKKIGIQASAEPLGSASDPIKQNRGLQYHPSEELVKSITEIADDVRPTS 121
           +NS SLL R+ +K   I+A+AE LGSASDP KQNR   YHP E++ ++ ++ ++D   ++
Sbjct: 66  INSGSLLRRTLIKN-KIRATAEHLGSASDPTKQNRRPHYHPFEDIGEATSKNSNDAILSA 125

Query: 122 AETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDAHGNIYFQAKNTEEAMKNLT 181
           AETTRTII+VNSKATLMF G+INDEV ENI+WP+LPYVTD HGN+YFQ K+ E+ M++LT
Sbjct: 126 AETTRTIIKVNSKATLMFTGIINDEVHENIMWPDLPYVTDEHGNVYFQVKSDEDIMQSLT 185

Query: 182 SENNFVQVLIGIDTMEMINEMELFGPSEIDFGFEELDDGASDDGDDDDDGDGEDED-EDH 241
            ENNFVQV+IG DT E++ E+EL GPS+IDFG EE++D              ED D ED 
Sbjct: 186 LENNFVQVIIGFDTTEIMKEIELSGPSDIDFGIEEIED--------------EDSDVEDV 245

Query: 242 DEDDDDDDADDEYNRDWVSVIDDEDDQNHSDETLGDWAKLETMRSSHPMHFANKLSEIAS 301
           DED+DD   +++Y+ +WV+ ++ EDDQ+ SDETLGDWAKLETMRSSHPM+FA KL+E+AS
Sbjct: 246 DEDEDDHAEEEDYDEEWVAALEHEDDQDDSDETLGDWAKLETMRSSHPMYFAKKLTEVAS 305

Query: 302 DDPIDWMEQPPATLVIQGVLRPAFNEEQTVIQKHLSSRHLSNGDINEAQE-LEENLEGHG 361
           DDPIDWMEQP   L IQG++RPAF EE + IQKH+SS    + D ++ ++ +E+ LE  G
Sbjct: 306 DDPIDWMEQPSDGLAIQGLIRPAFVEEHSEIQKHMSSNQSRSSDTSQVEKVVEDKLEDLG 365

Query: 362 RINHRGHESSSSKDGLNLMEALD-ESIPASEASFYRLEMIKVQLFTGNSHPSNVEIEDLM 421
            IN + +E   S D   + E  + + I  + +SFY+LE++K+QL T + H + VE+ED  
Sbjct: 366 IINGQSNELGWSGDSSTISEEPEKKEISINGSSFYKLEIVKIQLITAHGHQTVVELEDFK 425

Query: 422 KAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCS 481
           +AQPDAIA SA KIIS L+AGGEKTTQALKSLCWRCK IQVEE  I GIDSLGFD+RVC 
Sbjct: 426 QAQPDAIAQSAAKIISCLKAGGEKTTQALKSLCWRCKSIQVEEVAIIGIDSLGFDLRVCC 485

Query: 482 ETQVQTLRFAFDTRATSEFSAEKQLNDLLF 509
             Q+QTLRFAF+TRATSE+SAE+QLNDLLF
Sbjct: 486 GPQIQTLRFAFNTRATSEYSAERQLNDLLF 498

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y3913_ARATH1.9e-12147.08Uncharacterized protein At3g49140 OS=Arabidopsis thaliana GN=At3g49140 PE=1 SV=2[more]
MLRR1_PLAF72.6e-0926.62MATH and LRR domain-containing protein PFE0570w OS=Plasmodium falciparum (isolat... [more]
YHT1_YEAST3.4e-0958.00PH domain-containing protein YHR131C OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
CASQ1_PELES4.4e-0940.26Calsequestrin-1 OS=Pelophylax esculentus PE=1 SV=1[more]
KEX1_VANPO7.5e-0940.74Pheromone-processing carboxypeptidase KEX1 OS=Vanderwaltozyma polyspora (strain ... [more]
Match NameE-valueIdentityDescription
A0A0A0LBJ5_CUCSA1.8e-17198.05Uncharacterized protein OS=Cucumis sativus GN=Csa_3G184030 PE=4 SV=1[more]
W9RSG8_9ROSA1.9e-16058.71Uncharacterized protein OS=Morus notabilis GN=L484_014223 PE=4 SV=1[more]
A0A061G0G7_THECC1.4e-15858.04Pentatricopeptide repeat superfamily protein, putative isoform 1 OS=Theobroma ca... [more]
A0A151T4D6_CAJCA2.2e-15658.90Uncharacterized protein At3g49135 family (Fragment) OS=Cajanus cajan GN=KK1_0164... [more]
A0A0D2RI62_GOSRA8.3e-15656.39Uncharacterized protein OS=Gossypium raimondii GN=B456_003G069200 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G49140.11.1e-12247.08 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G24060.23.5e-11347.59 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G59300.13.2e-2631.58 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G44050.14.2e-1049.15 Cysteine/Histidine-rich C1 domain family protein[more]
AT1G48400.16.1e-0932.04 F-box/RNI-like/FBD-like domains-containing protein[more]
Match NameE-valueIdentityDescription
gi|449445979|ref|XP_004140749.1|1.9e-29499.03PREDICTED: uncharacterized protein At3g49140 [Cucumis sativus][more]
gi|659077638|ref|XP_008439307.1|4.1e-28195.10PREDICTED: uncharacterized protein At3g49140-like [Cucumis melo][more]
gi|700202288|gb|KGN57421.1|2.6e-17198.05hypothetical protein Csa_3G184030 [Cucumis sativus][more]
gi|703134830|ref|XP_010105734.1|2.7e-16058.71hypothetical protein L484_014223 [Morus notabilis][more]
gi|590673249|ref|XP_007038839.1|2.0e-15858.04Pentatricopeptide repeat superfamily protein, putative isoform 1 [Theobroma caca... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR012349Split_barrel_FMN-bd
IPR019595DUF2470
Vocabulary: Molecular Function
TermDefinition
GO:0010181FMN binding
GO:0016491oxidoreductase activity
Vocabulary: Biological Process
TermDefinition
GO:0055114oxidation-reduction process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0055114 oxidation-reduction process
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0010181 FMN binding
molecular_function GO:0016491 oxidoreductase activity
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.108220.1Cucsa.108220.1mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR012349FMN-binding split barrelunknownSSF50475FMN-binding split barrelcoord: 307..346
score: 4.71E-12coord: 385..507
score: 4.71
IPR019595Domain of unknown function DUF2470GENE3DG3DSA:3.20.180.10coord: 445..506
score: 5.
NoneNo IPR availablePANTHERPTHR13343CREG1 PROTEINcoord: 386..508
score: 1.9E-139coord: 296..348
score: 1.9E-139coord: 56..194
score: 1.9E
NoneNo IPR availablePANTHERPTHR13343:SF13PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN-LIKE PROTEINcoord: 386..508
score: 1.9E-139coord: 296..348
score: 1.9E-139coord: 56..194
score: 1.9E

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cucsa.108220Cucurbita pepo (Zucchini)cgycpeB0244
Cucsa.108220Melon (DHL92) v3.6.1cgymedB146
Cucsa.108220Silver-seed gourdcarcgyB0794
Cucsa.108220Cucurbita maxima (Rimu)cgycmaB0251
Cucsa.108220Cucurbita moschata (Rifu)cgycmoB0250
Cucsa.108220Melon (DHL92) v3.5.1cgymeB142