CmaCh20G010190.1 (mRNA) Cucurbita maxima (Rimu)

NameCmaCh20G010190.1
TypemRNA
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionGlycosyl hydrolase family protein 43
LocationCma_Chr20 : 7216757 .. 7232636 (-)
Sequence length1611
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTGCATTATCTTGGAGATAAAAAGGACCAGAAAATGAATATGAGGAACAGATACAGGAAATCAACCGCTTTACGTTGTGATGCCGGGAGCAGATGTTTGATATCTGTGGTAATAGGGAGTCTAATGGGGTGTATTCTTTTGCTACATTTATGTTCTCCTGTAAGCCGTAAGTATGAGATAGGTCGGGGTATCCAACTTCGAACAAGTCGTCACCTTCACTTTCGTGAACTTGAAGAGGTGGAAGAAGAAAACATTCAAATTCCCCCTCCTCGTAAGAGATCTCCGCGTGCAGCAAAGCGAAGACCAAAGAAAACGCCCACGCTGATTGATGAATTTCTTGATGAAGATTCACAGCTTAGGCACAAATTCTTTCCTGATCATAAAACTTCTGTTGATCCAATGATCCCGGGAGACGACAGCATGTTCTATTATCCGGGGAGAGTTTGGCTGGATACTGAGGGGAATCCTATTCAAGCTCACGGAGGTGGAGTTTTATTCGATGAAAGATCTGAAACATACTATTGGTATGGAGAGTATAAAGATGGCCCCACCTACCATGCTCACAAAAAGGGAGCTGCACGGGTAAAATCTATACTCTGCCTTTCTGAGCCTTCATTTTCTGTAATGATAGTCTAATACTTCCATGGCTAAAAATATTGGTTAAATTATAAACACATCCTAAACCAAGAAGTTGGTATCAATAATGCACCCAATTTTTTTAAAGTGTAATTTTAACTCTTTAAGTTTGAAGTTTGTTTTAAAACAGCCCTCGTAGGCTTTTGCAGTTAAATTTGTAGATGGAAAAGTTTGTATATTCTTATGTGAACAAATCTAAACACTTACATGGAAAAAATAAAGACACATGGTGCATCTGTGAACCTTGTATGTGGCCTCCTTTCAACCACCACCATTTTTTGTTCTTTCGTTTATGATTTTTTTTGCTAATCCTTCTTAAATTTACTTCCGGGTGTTTAGATTTGTTCACTTAAGTAATACATAAGTAGAAACAAACGTTTCTATCTATAAATTTAACGAAAAAAGTCTTCAAGGACTGTAATTAAAAAGTTCAACCTTCAATGGTTAAAACGAAACTTTTCGAAGTTTAGCGGTATAGCTGAACCCAACCCCATAGTTGAGGAATAATTTTGATAATTTAATCTAAAAATATTCCTATTGCATAAATTGTTGGCATGCATCTCTATGAATTTGTATGGAAACAAAAGATTTTGATTAGTACTCCAGTCTTTTTCCCTCCTTTTCCATTATTAACAATAGTTTCTTCTTTATTATGGCAATTTCCAATCCCAATTATTGCTCCATGGTGGTTTCAATGACTTTATGTCTATATCTTTTTAAATATGAAATATCAGCTCCAGTGATCGTTGTATGTTGAATTCCTTCTAGTTCATCTTTTTGTACCTTGTTCCACTCTTGATAAGGTCTACGTCCAAATCATGAATCTATAGTTTTTGAGATGGTATGTGATGCTTGGGAGGTTACTTTAAGATCTTGGGAGTATGACAGTGAGAAGAAATTTTAAGGCAGAGGAAGTGAGTGTGCTTTGTGGCCTTTTGGAGCTGATCAAGAGTATGCTGATTTTGAGATTAGAGGTCAAGAAAATGGAATTTAGATAAAACAGGATTTTCTTGGTTAAACGTTCAGTTAAAGAGTCAGTTGATGTTCCCTCTTTTTCTAAAGTTTCGGTAAATGATTTGTGGAAATCAAGAGTCTAATAAATGTGATATTCTAATCTGTTTGGTTTTGGGAAAGCTACAACACAATACATCAAGAAGTTACGAGAAAATAATTATCAGACAATGAACCTTGCTTGGGGCAAGGAAAGAAGATGAAATTAGCATTAACCCAACTGCTCAACTTCTTTTATTAAACTCTTGCAGATTAGGACAAGTCCTTGGTGATTAGAGAAACACCAATCTAATTTTTTGAATCAAACGAATGGGGCCACAAGATTGAAGTACCCTTCGGCGAGCCTAATTTTGAAATTCAAGGTTCCAAAAAAGGTGAAAAGTTCTTTTATGGCAGTTAGCCTGAAGAAGTTTGAAGAAGCTTTAAAAGAATTTTCAACTTTCAAAAATGGATGATCTCTCCTTCTAGGTGTTGCCTCTGCTTACAAGAGGAGAATCCCTTGATCACTTGTTCTTGCAATGTTCTTTTGTGGCTAAAGGGCAGCAGCCTATCTTCAATGTATTTGGGGTGGATTACTACTTCCCTAAGAACATTAATGATTGGCTTATGGAAGTGCTCAACGGTAAGGTGTTTGGCGGCAAGGGGGAAAGTCCTTTGGAGTTGTGATATGCCTTCCTTCCTTCAGAGCCTTTCGAAGGAAAGGAATAGCCAGGTTTTTAAAAGTAAGTATTCTTCTTGAGTCCTTTTGGCGTTTGGTGCAATACAAGGCCTCTTGGTGGTGCACAGTTCACACTAAATTCTCTTGCAATTAATGCCTTTTGATGATTCAATTGGAAGGCTCTCATTTGTTAGTTTTCTGGGGAGGGGTGACCACTGTGGGGTTGTTGTTTGGTTCTTTTGATCAATATGCATCAATATGCATCACTGCTTTTTTTATATAAAATAATGACCTGCCAGATATTTTGGGAAACTTGCAATTAGGAATCGACAGCATGATGTTACTGGTCTAGATGATAGTTATAGCGTGAGGAACACTACTGAGGAGTTATGTGAATGTTTGATTAATCCACCACACCTAAGTTGTCGAAGCATTCTTGACATCAAGTTGTCGAAGAGGCCATTTATTTGTGACTGCAATAGAAAGCACAACACGGACAATGGTAGCTTTGACAATCGGACTGAAAGTGTCAGTGTAGTTAAGACCAGGAACCTGAGTATAACTTTGGCAACGAGACGAGCCTTGAAACGCTCGACGGATTCATCGGGCAAATCTTTAATACGAAACACCCATTTAGAGCCCACGATGTTGGTGTTGGCAGGGCGAGGAACCAAAGTCCAAGTATCATTTTGTTGTAACGCTCAAATTTCTTCATCCATGGCAGCCACCCAAGCAGGATTCTTAGCCGCAGATTTGAATTCTTTTGGCTCAGTGGATGCAAGAAGAGCAGGGAGAAGTCCAGATGAGCCCAACATACCAATATTTGCTGGATGACGAGTCTTGAATAAACCAGCTTTGGCGCGTGTGATCATAGGATGAGTGCCCAAAGAAGAGAAATCAACAGGAGGTTCAATAGAGGTCGAATTAGAAGTCGAGGGTGGCAAAGTGGAACCTGCAAGAGAAGTATCAACCTGCACAGAATCATCTACAAGGTCAGAACAAATATCACACGGGGATGAACTGGATTGAGGAATGTGCGGTGATGAAGTGGTAGGGGGAGATGAATCAATATGATGAAGATGTGGTTCCAAGAAATTTGAAATAGGAATAGTGGAAAGAGATTGGGCTTGGGAGTTAGGGATAGCAAGAAAGTGAGTTTCATCAAATTGAGCATGGCAGGTGATATATAGCTTAGTGGTGGCGGGATCAAGACAGCGGAACCCTTTATGAACAGGACTATAACCCAAAAAAATATAAGGAATGCTGCGGGGAGAAAGTTTGTTAGGCATATAATCACGCAAATAAGGATAAACACGACAACCAAAGGGATGAAAATTGTCATAATGTGGAGTGTAGCCATAAAGGAGTTCAAAGGGTGACTTACCTCCAAGAAGTGGAGTAGGCAACCGGTTGATAATATAAGCTGCAGTGCTGAAGGCGTCAACCCAAAAACGAGGAGAAAGATGAGAGTGAACGAGAAGGGCCAAGCCAGTCTCAGTCACATGACGATGTTTTCTCTCAACACGACCATTATGAGCAGGTGTATATGGACAAGAGAGTTGATGGTGGATGCCAGAATTAAGTAAATGAGTTTTGAAACAAGTACTAGTAAATTCGGTACCACCATCGCTTTAAAATACCTTGATACGAGAAGAATATTGATTTTCCACAAATTTTTGAAATTGAAGAAAATATCAAAAAAATTAGATTTAAATTTTAAAGGGTAAAACCAAGTGAATCGAGAATAATCATCAATAAAAATAGCATAATAAAGAAAACCCGAATTTGATTTGATGGGAGAAGGACCCCAAAGATCCCAATGAATAAGATCTAACACATGAGACGACCTACGTTCATTGCGGGAATAAGGCAATCGATGACTTTTCGCAAGCTGACAAGTATTACATAATGATGGAGAAGGCAATAAAGACGTAAGAGAAAGATGACCTTTTTTATTTAAAAAAGAAATAACAGAATGATTCACATGACCCAGACGAGCATGCCATAAATCATATGAAGCACGTAAAGATTTGTTTTTAAGGGCTGAAATAAAAGCAGAGTTGCCGCGCTCCAGCACATATAGCCCTCCATCTCTTTTACCGGTTGCCACCACCCTTCCTGTTTGATGATTCTGGATAGTAATTAGATTATTAGTAAATGTAACGGAGAGAGGAAAATCAGACGTTAATTTACTTATGGAAAGAAGATTTTTAGTGAGGTGAGGGACAACCAAGACATCTAATAAGTGAATATTTGGAACATGAGAAAGAGTACCGGTGTGGGTAATGGGTAGGGATGCACCGTTTCCTACGATCACAGAGTCCTTACCCATGTAATTTTTAGACTGATCTAGAATTGATGGGTCGACAGTCATATGGGCCGAAGCTCAAGTGTCTAGAAACCAATCAGCAGCATCGGGTCCAGCAATAGAACATGACGTGTTAAAGGCTTCAGCAAGGTGAGCATGAGAAGAATCAGGTCGAACATACCGTTGGTTGCAGCGGTCAGCATAATGGCCCTCTTTGCGGCATATTTGGCAATGAGGTGGTCGACGACCCTGACTTGAGTGGGTTCGTCCTCGATTAGAAGAGTTGTTTTTGTGAGAATAAGAACGACCTCGCTGGTTGGTAAAGGAAGCAGGGTGACTTCCATGGGTGCGACCACGATTAGTGGCTGTGAATGCTGTAGGAGTGGAGTCAGAGGACTCAAGGGAGCGCTGGAACAACTCAAAACTTTCAGTTTTAGAGACTAGATGTGCAAAACAGGGGATAGGGGTGAGAGCTATCTGAGCAGTAGAAAAAGCTGAAAATTCGGTGCCGAGTCCACGAAGGAACCAGTGCACTTTATCAATGTCCTCGACGGGTCTGCCAATGGCATGAAGTTGGTCACAATTTTTTTTTGAAGGTACGGGCATACTCAGCAACAGGTTTTGTGCCACGTTTCATCAACTGCAAGTCATCCTTGAGTCTCAGTTCATGAGCTTTTGACTGATGGCTGAACGTAGTTTCCAACGCAAGCCAAACATCACGTGCAGTAGAGAGACCAACGACAACAGCCATGGCTTCCTCAGTGAGAGAGGAGAGCAGGAGACAGAGAAGTCGTTGATCGGCTGCTCTCCATGCCAAATATTTGGGGTTGAATGTTGAGGAGGTTTCTGGTTCAAAGCGAGGTGGTGGAACCATAGTTCCATCGACATAGCCCAGCATGTCTTGACTCTCAAGGAGAGGGAGAAGTTGGCTTTTCCAAAGAAGATAATTGGAGGGAGAAAGTTTGATGGTGATCATATGGATTAGAGTATTGAAAGGAAGAAGATGATAACAAGATTCGGAAGCCATAGAGAAGAAAGGATCAGGGTTGTTAAACCGAAAGGGCTCTGATACCATGTGAATGTTTGATTAATCCACCACACCCAAATGGTGGTGAGAGTTCTCTTATTTATAATCATTTACAGAATACAGAAATAAGGTAAATTACAAATATAAGGAAATAGCGATAAAAGGAAAGAATAATAAATGAATACAATATATTTGCATTTAATAAAAGGGCAAATATGGTAACCCGTAATGTAAGGCAAATATCTAATATTTGCTTTATAATGAACGGTCAACTATCCTTAACAAGTTCAGAATTGATTTAGAGTTTTTTGATGCACCTCCTTGAAATTGGAATGTTAATTAGTCATGGCCGACGAGTTTATGGTCCTCTGAGCCCCAAGGGAAGGAATGCTGGTTGAGACGAGTTGGGTGATCTATTTTGTCTGTATGGGCCTTGTGTTGGAAGAAGGAATAGTGAAAGAGATATGAAAAGATAGAATCGATGAAAGCTGCAGTGCTTTATTATTGTCTACTGACTTGATGCTCAGAGAGTTACACAATTCCAGTATTTATAGCTTATACACTGGACCAGAAATAGTAAATCATGGAAATCTAGATAAATTCAAATCTACTAACAATCTCCTAAAATCTTGGTAATTTGAATATTTGAATCATATCTCAACTACTTTAAATCATATCTTCATTTTGAATACTTTGAATCATATCTCAACACTCCCCCTTGATTCAAAGTTGCAGATGTCAAGTACAGAACTTTTCTTTTGACAAGGCCTTCGTAAATATTTCTGCTTTAACTCGTACAATGTCTGTCTCAGCTTGTATACCTTTGTTTCTTTGTCTTCCTTTATGAATCCCTCTGACGGTGTTACATAAACCTCTTCTTGTAAATCTCCATAAAACTTTTCGTTTGGTAGGTCAACCATATCCCATGTTCCATCTTTCTTAATGGACTGCATCTCTACTGCCATGGCTTTCTTCCCGTCTTCTTTTTCAGTTGCTTCTTCATAATTCATAGGGTCTGAAACAGGAAGAGCAAATTGACATGATGCATATATGTGAGCTAAAGATTTATACTTCCTTGGAGGTGTTTCATCCGAAAGTTCTTCAAGAGAAGAATTGCTGTTCAGTGATGTTGAAGATGATGAATTTGACACACTTTGTGTTGCAGTGAATGCACTGGGAGATGCACTGGAGTCGACCCTTGTCTCTGCATTTGGGGAAACTTCAACTGTAATCTTTTGCTACTCCTGTTCTTTACTCCAATCCCAACTCACGTTTTCATTAAATATCACATCCCTTCTAACTAAAATCTTTTCACTGATGGGGTTGTATAATTTATATGCCTTTGATTGAGTACAATAGCCAATGAAAATGCATTTTTCAGATTTTTCATCCAGCTTTTGATGAATTTGAGGATGTTTCAAAGCATAAGCAACACAGCCAAAGATTCGTAAATGACTTATAGATGGTTTCCTTCCATGCCATGCTTCATAAGGAGTTCAATTCATAACCGCCTTTGTTGGTGAGATGTTTAATAGATAAACAAATGTTGCTACGGCTTCTGCCCAAAATTGATTTGGAAGTCTCCTTGCTTGTAACATACTTCTTGCCATCTCCACAATAGTTCGATTTTTTCGTTCAGCGATTCCATTTTGTTATGGAGTGTAGGGAGCTGTTAACTCTCTTTGGATGCCCTCTTCTTCACAAAACAAATTAAACTCTTTAGATATGAACTCGCCACCTCTGTCTGTGCGAAGAACTTTAATGTGACAGCCACTTTGGTTCTCCACCATAGCTTTAAAAATCTGGAACTTCTGAAAAGTTTATGACTTATGTTGTAGGAAGTAAATCCAGCTCATGCGACGACTATAGTCATCAATAAATAGTCAAAAGTATATACTACCACCCAAGGACTTCGTTTGCATTGGCCCACATAAATCAATATGAATTAGTTCTAGATAATGTGAAGCTCTTTTCGCTTTTACAACAGGAAAGGACTTCCTAGTCTGCTTGCCATAAATGCATCCTTCACACAAATTAGTTGAATCAATTCTTGGTAATCCGAAAACCAAACCTTTATCCCTGAGTATTTTCAGGCCGTTCATATGAAGATGTCCATATCTGAGATGCCATAATGTTGAGTCATCTTTCATGCTAGTAGTAAGAGCAAAATCCTCCATGTTAGACACATGGAGGGGAAACATCTTATTTGAAGTCATGACAATTTGAACTTTATGGCCTGATTTCTTATGTGTAATGACGGTTGTGTTGTCATCAAATAGAACTGAATGTCCTCTAGTCATCAATTGTTCAACCCTCAATAAATTGTACCATAAATCAAGTACAAATTGAATATTATCTAATTGTTTTATCTTACCATTACTGGTTTCGACTTTCACTGTACCTTTGCCTTCAACTTGCATCTCTTTTGTATTTCCAAGTTGCACCTTAATCTTTTGTGTCTCTAGGATTTGGTGCCTTGTCATATGGTTCGAGCATCCGCTATCAACAAACCATAAGTCACCTTTTTTCGGATTAGTATCCATGCACGCCACAAACATCTTTTCTTCTTCTTCTTCATTCTCTGCTGCAAATTCATTCGTTGATTTTTATATCAACAATCAGATTTTGTGTGCCCATACCTTCTGCAATGGTAACATTGTATGACATTCCTTTGTTCATTGAATTGTCTCTGTCCATCACTTCTCCATCTACCCCTGTTATCACGACCACTATGGAAGTTACGAAATCCTCCTCTTCCACTACTTCTACCTGATAAATGAATATTTTCTCTTTCGTTGTTTTTGTGGTTGGTTGTCTCTTTCACTTGAAGTGCCTTTTTTTCGTTCTTTTCTAATGATCTATTGATTCTTGCCTCATGAGCCTGAAGCGAGCCCATCAGTTCATCAACGGAGAATATGGATAGATCCTCAGCTTCTTCTAAGGCAACCACCACATGGTCAAACTTTGGAGTCAAGCTTCTCAACACCTTTGCAACAATTGTTTCGTCTGAAATTTTCTCTCCATAAGTACGCATCTGACTGACTATTGTCATTGCTCTTGACAAAAAATCAGCAATTGATTTGCCATTCGTCATGAGTAGAGTTTCAAAATCACGTCTTAGAGATTGCAATTTCACTGTCATGACCCTTGAATCTTCTAGAAACTTCTTCTGTAGAATTGACCATGCCTACTTTGATGTGGTTGCTGCTGCAATTCGTGAAAAGAGTCTCATGAACTGCTTGCTGAATAATGAATAGAGCCTTGGCATCGTTTTTCTTGGTTTCTCTTAGTCTCTCCTTTTCTTCTATTGAGGGTTCTAATACATCAACAAACCCGTGCTCCACCAAGTCCCATAGCTCCTGCGATCTGAGCAAGGTCTTCATCTTGATGCTCCACCACTCATATTTCTCACCATTAAAGATTGGTAACAACTGTGCAGAAGAAAAACTAGCTACTGCCATTTTTGTTTTGTGAAAGAAAATAATCACTCGCCCTTGTGTCTAACGAACTTGGCTCTGATACCAAAATTGTTGGAAGAAAGAATAGTGAAAGAGATGTGAAAAGATAGAATTGATGAAAGCTCCAGTGCTTTATTATTGTCTATTGACTTGATGCTCAGAGGGATACACTTCTAGTATTTATAGCTTATACTGGATCAGAAATAGTAAATCATGGAAATCTAGATAAATTCAAATCTAATAACAATATCCTAAAATCTTGTTAGTTTGAATATTTGAATCATATCTCACACCTTTATGTAATTTTCTTTCGTACTGAGAAGATCTTGGCTGGCTGGACAATGAAATCAATAGGCTCTTCAATTATTTTATTGAGAGAAATATCCTCTTCGACTCTCCTCTATTGTACGGAAAGTTTACTTGGTTGAATAGTAGGGCACGTAGTAGACTTCATAGGTTACTAATATCTAAAGGGTAGACGAATATTCATGGGAGTGTGAGACAATTGCTAGGCCCTAGAATAACCTTCGACAGGTGTCCAGAGGTGATGATTGTGTCTTTCATATTTGAAAATATTTGGTTGGACCGTACCTCCTTATAATCTTCCTTTCCTTCATGTTGAATGCGGAAACCAATCGAAATTAAGAAATTCGGAACGGTATCTTCCATGGGGAAGCTAAGGGTCCTCAAAGGAGTGTCATTGTGGAACAAAGACTCACGTAGGAGTTTTAAAATTCTGGAAATTAAAAAGCTCAAAACAAAGCCCTTTTGGTTAAATGGTTATGCCAATTTCCCCTTGAGTCGATAGAGGACTATAGCAAGCAAATAGGGTCCTCATCCTTCTGAGTGGATGTCAGGTGGGTCAAAGGCACTTGCAAAAATATTTGGAAAGAAATTTCCGATGAGCTACTTTCTTTCTTTCACTTGGTCCCTTGTTTTGTGGCTGATGGGGAAGATAAGTGTTTTTATGAGAACAAGTGAAAGGCGATGGAACCCTCTATGGTACGTTGTCTCGTTTATATTGACATAATTTTAATCCTGGAAGGGTAGGTTTACGAAAGTTTAGGGACCATTCCTTTTTCAACTGCATTTAAGTGATTTACATCTGCTGTATGTTGGCTGTTGGCGTTTCGAGTTCCCAAGTTTAGTAACTGCACAAAAATTTTCTTTATCTAAAAACAGTCTCACGAGATCATTGTCACTCGAAAGTGACATCCAGTGCAGACCTGAGGAAATAAATCTTAGAACAGGATCTCTAGTATTAGACTTCTTAAATTACCCCATTGTCTGTTTGAAGGTGCGCATCATTGGAAAAGGGACGCCTGTTTCTTTTTGCGTACTGATGAGGCCTGCACTTCCTTCCAGCATCAATTTTATTGAAGTAATTTGAATTTAAGCCTCCTCCTTAAAGCCATTGTAACCTTCAGTCCTATAGTAGGAAATGAATTTCATGTTGTTCCTTTTACATTTGCTTTCTCTAGACCTCTAGGAAAAATTTGAAAAAGAAAAACCTTAGAATGATATGATTCCTTCTTTAATGACGCGATTTTTGATGCCTTGAATCACAGGTCACCACGTGCACTTGAACTTAAAAAGAGGTGGTATTTTTTTTTTATGTAAAGCGGGAGACACAAGGAATTTAAAATTCAAATATGCATTCAAAACAACATAAAGGTAGTTTGAAGGTAATATCCGTGAGGTTAAGTCAAAACGGTTTACCAAAAAAATTCATGAAGTTTCGATAAATAGCATTTAAAATAATAATAAAATGACAAAAAGGAAGACGACTCGATCTAAAGGGCACCCCCTATAGCTGCACAGTGTCACAGTCATGCTTGTCCTAGGTGTGTTGCCCATGCATGCCCGTTGCATGCCAAAACCCTTGTGTTGTATCACTTCAATGCTTTCCCTGCTTTCATGATGTATAGTTTATTTATTGTATTTTCTAGGTGTATGATGTTTCAACAAAATCATGTATATGCCTGATAGACCTGAAATGCCTCAAAATGTAGCATAGGGAGATGTGACTAATCGTCAAATCTTCATACTCGGAGTTATATACTATCTAAGCCATTGTGTTGCCTTTTTCGCTTATAGCCTATAACATTGCTTTGTTCATTTCTCTCAATCTTGTTTTCAAATTCCCTTTCTAAAATCTCTCTCTGGTGGTGTTTTCTAAAACTTTTTCCTTAAAGCAAGGCCAAAGGCTTGAGTATACGTTGTCAGGAAGAAGGCAACATGGTTTCGAGTTGGCGGACTCACTCGATATTGAGAGTGAGTGCTGCACAATCGCCTAAAAAACATTCCGTAAGATTGCGATTGTGACAGTTGGTATTAGAGTCGAGTTGGCACTAAAAGTGATTCGGTAAAAATGGCTACAACCAAGCAGTTGAACAAGTCCCACGTCGATTGACTAGTCGACAACGAAGAAGAGATGCAATTCTTGTGAGAAGTTCCTTACAATGTTCAGTACTTGGACAAGCGGGTGAAAAAGTGATTCGGTAAATATGGTTGTTGTAGTTGCAGGTTGATTAGATGAGTTACCCATCCAGGAACTTATGTACCGAGTAGACAACCTAAAAGCAAAAGCCACAAAGACTGGTGGCTTCGAGCGAGGAGACAGCTCGACGGGCTTGACAGCTCCTAAAAAGGAGTTATGCAGATGGTCTCTGAGATATTTGACGATGTGAAATTAGCCTTCGACGTGGTCAGGGCAGAAATTGTTAGGAATCACGACTCTCCAAAATGGTATGATATTGTCCATTTTGAGCATAAGCTCTCATAGTTTTGCTTTGGGCTTCCCCAAAAGGCGGTTCCAATGGAGATCTATTCTTTGCTTATAAACCCATGATCCTTCCTTAAATTAGTCAATGCAGGACAAACTCCCAGTTCAGTTCTACAAGATCAAGGTTCTGGAGCCCAAACCCTTCTATGGGGTTTGAGATGCCGAGGTTCTAGCGACCTTGATCAATACTTTTGGGCGATGAACACAGCAACAGAGGAAGCAAAGGTCACCTTGGTCACCATACATCAAGCCGAGGATGCAAAATTTTGATGGACATCGAAGTACATTGATATTCAATTGGGCCAGTGCACAATAGACAAATGGAAAAGACTGAAACAAGAACTTCGATCTCAATTGTTCCCAGAGAATGTTGAAATCTTGGCTAGAAGGAAATTGCGGGATCTCAAGCACACAAGAAACATTTGAGAGTATGCCAAACAGTTCTCAGGGTTCATGTTGGACATCTGAAATATGTTCGAGAAAGACAAGATCTTCCACTTCATAGAAGGACTAAAGCCATGAGCAAAGTCCAAACTATATGAGCAAAGTTCAAACTATATGAGCAAAGAATACAAGACCTCTCCACAACCTATGCTACAGCTGAACGATTGTTTGATCTAAGTAACGAACAATTCCAAGATATAAGGCGAAGCCAAACCTCCTCGAGTGGATTGTTTGATATGAGCTGAACGATTGTTTGATTTGAGCTAAACAATTGTTTGATTTGAATCTACCATTTTACTTTTGGCCAATTTATGATTGACCCATGGTTCTACCCACATGAGACCTTTTTCGATCGGCTCTTTTGACTCCCCTACTTTCTTTTGTAGGGCAGATAGAAACTTTAGGGCCCCATACAAGGGTTTGCTCTGTTCTCCGACGACGACTTTTTGGTTTTCCCACCTTCTGACTTTGCCTCTAAGTCTGTGTCCAAAGCTGCTTGGAAGGCTTGTAGGGCCGCTTTCTTGGGGCACTCGTATACTCTATGGTTCTCTCTACATAAGGAGGGAAGTAGCCGATTGAAGTAATTTTGATGGGGTAGGGCCCTCGGCCTAAGACTAGTTTCAGGAAAATCACCCTAGGTTGGCCTGGTGGTCATCAAGGGACATGTAAACAATAAAAGACAAAATAAACTAGCTCTAACCATAGTAGCCGCTTACACAAGATTTAATATCCTACAAGTACCTTGACAACCAAATGTAGTTGGTAATAATCCTACAGGTACCTTGGTTGTGAGAATACTCAAAGTGCTTGTAAGCTTCCCATACACTAATGATATTAGAGATAACAGGAGTTCTAGGGAACACTGGATATAAAAATGTGAAATTCCTAATTTCATCATAAGCTTAGATTTCAATAATGTTTACAAACTGAGTCTAAACAAAATTCGAATTGGAGAGAATTTTAGCCAATATCTATTGTGAAACATTAAAAAAAATTTGTACACTTCACAATAACTATCAATCCATGCTAATTTATCTATTCCAATATAGAATATAGAAAATTTTATGTTTTACTTCTCCCAATCTTAACTGGGGAGAACTTGGTGTGGGTTTTAGACGGTTGTGGCTAAACATTTCTCTACGTTTATGCTGGTAGTAAATGCCAACAAGCTTTCGAGTCCCTCTTTAGGCTCAAAACTAAGCTGCAGAGTGAACGTATTGAAGAGGTATCTAGGGAATCCGTGTTATTGAGAGGACTCTAAGCGCACAATCCTAAAATACATCCTTAGCCATCTTTGGAAGATTCTTATACCTCGTAAAATTTACCTTCCACAATTCTAAGATGTCAAACCAGTGTGCCCCGGTATTCCCCAACATCCCGTGTCGTAGTTGAGCTATTTTCTCTTCGCTTCTAAGAGTGAAAACTATCCCCATAGACTAACATAAGTCCTTCTAGCATGCTTTGTTCTCACTCACATACAACTAAAAGAAAATTTTCAAGAAGTCACCCAACATAGAATTGCTCCAAGCAAAATACGTTTAACTGGAGTTTATATGATTGAGCCACCGAAAAGGAACGTACACCTTATTGGTTTAAGTAGTAACTTTCCATTCTTTTAATTCTTCATTAGTTATCCTATCCTTAAGATCGCTCTCATTCATTATCCTAATCTTTACTCAAAATATTTTCATTCTCTTTTTCCATCGCTTCTATCCTTTCAATTCTTTTACCAAGTTTTTAAAATTATTCCATGATATTACTTTGCAAGTGTCTGCTGTGTAACATCAAACCTACATTTTCAATGAAGTTTGTTCTAATTCAACAACACAATCAAGATTATTCAACATGTGGTAATCCTTTGTTGTGTGTCATGGCAAAAAGGATAGAAGCGATGTTTGGTAAATATTGGAAAGATTTTCTCGTGTGTTGCTGTTATGCTAGAATCTCATTATACGTTTGAGTTCGTGATGTTTATGCTTACAGATTTATATGATGAAGAGATTGCAAACCAAATTAGGAAGAAAAAAAGTTAAAGATTAGTTTTCTCGACACTTAAATATTAAATATATTTTATATTAGTTCTTGCTGACCTCCTTCGACAGCAAGAACAATAGAACAGCTCCTTCTGGGCCATCTTTTCTAAGAATAATGCGAAGATTTTGTCGAAATGTTACACTCAAAGGTGTTTTCTGGTAAACGTGGCTCGAAAGGAACCCCAAAAATTTAAAAAATACAAGTAGAAACTTCAACCAAAGTTGGAAATGCATGTTAGAACCTTTTCTTGAAGTTGAATTTCTGTTCTTCTGTTCTTATGTTCTTGCCTATTTTGGGTTGATTGCTTATCTCCAGCTTTTGCACCTCATCTTGCTTTAAATATAATTCTTAGTTTCTTACTATATATATATATTAGGCTGTGTTCTAGGATCTCTCCTGTGATCGAGATAATTGATATCAGGAGTTTGGAGGGCCGTCAGAATTGTTGATACTGAGTTGTTCGGAGGATGTAATATTCATCTAATGATCTTAAAACCAACGATTGTTCTGTTGTGACTTTCTTTCAAATCGGATAATCTTGAATCCTGCGCCACTAGTTTTAAATTTATCACATTTTTTCACCATCTTTTGTTCATCATGTGACTTTCAAGAGCAACCAATTAACCTTCAAAACTACACTAACCTTTGGCATTCTTATGATTAGGTTGACATTATAGGAGTCGGTTGCTACTCTTCCAAAGACTTATGGACCTGGAAAAATGAAGGCATTGTTTTGACAGCGGAAGAAACCAACGAGACTCATGATCTTCACAAATCCAACGTGCTTGAGAGGCCAAAAGTAATCTACAACTCAAGGACTCGAAAATATGTAATGTGGATGCATATCGATGATGCCAACTATACGAAGGCTTCTGTTGGTGTTGCCGTCAGTGATTACCCAACCGGTCCGTTCGATTATCTTTACAGCAAAAGACCACATGGATTTGATAGTAGAGACATGACAATCTTCAAAGATGATGATGGTACAGCCTATCTCGCTTACTCATCTGAAGACAATAGTGAGCTCCATATAGGACCTCTTTCAGAAGATTATCTCGATGTGACCAATGTAGCGAAAAGGATTCTCGTCGGCCAGCACCGGGAAGCACCGGCTTTGTTTAAAGACCAGGGAACTTACTATATGATCACATCAGGTTGCACGGGATGGGCACCAAACGAGGCACTGGCACACGCATCAGAGTCGATAATGGGTCCATGGGAAACGTTGGGAAACCCTTGTATAGGTGGAAACAAGTTGTTTCGACTAGCTACCTTCTTCTCTCAGAGCACATTTGTTCTTCGCTTACCTTCACACCCCGGCTTGTTTATTTTCATGGCAGACCGATGGAACCCTGCCGACCTTAGAGACTCGAGGTACATTTGGTTGCCGTTGATGGTTGGAGGACTTGTCGATCAACCCCTCGACTACAATTTTGGGTTCCCTTTGTGGTCAAGAGTGTCAATATATTGGCATAGGAAGTGGAGGCTTCCTCAAGGCTGGAATCCGTTGAAATGA

mRNA sequence

ATGCTGCATTATCTTGGAGATAAAAAGGACCAGAAAATGAATATGAGGAACAGATACAGGAAATCAACCGCTTTACGTTGTGATGCCGGGAGCAGATGTTTGATATCTGTGGTAATAGGGAGTCTAATGGGGTGTATTCTTTTGCTACATTTATGTTCTCCTGTAAGCCGTAAGTATGAGATAGGTCGGGGTATCCAACTTCGAACAAGTCGTCACCTTCACTTTCGTGAACTTGAAGAGGTGGAAGAAGAAAACATTCAAATTCCCCCTCCTCGTAAGAGATCTCCGCGTGCAGCAAAGCGAAGACCAAAGAAAACGCCCACGCTGATTGATGAATTTCTTGATGAAGATTCACAGCTTAGGCACAAATTCTTTCCTGATCATAAAACTTCTGTTGATCCAATGATCCCGGGAGACGACAGCATGTTCTATTATCCGGGGAGAGTTTGGCTGGATACTGAGGGGAATCCTATTCAAGCTCACGGAGGTGGAGTTTTATTCGATGAAAGATCTGAAACATACTATTGGTATGGAGAGTATAAAGATGGCCCCACCTACCATGCTCACAAAAAGGGAGCTGCACGGGTGCGCATCATTGGAAAAGGGACGCCTGTTTCTTTTTGCGTACTGATGAGGCCTGCACTTCCTTCCAGCATCAATTTTATTGAATTCTTGCTGACCTCCTTCGACAGCAAGAACAATAGAACAGCTCCTTCTGGGCCATCTTTTCTAAGAATAATGCGAAGATTTTGTCGAAATGTTACACTCAAAGGAGTCGGTTGCTACTCTTCCAAAGACTTATGGACCTGGAAAAATGAAGGCATTGTTTTGACAGCGGAAGAAACCAACGAGACTCATGATCTTCACAAATCCAACGTGCTTGAGAGGCCAAAAGTAATCTACAACTCAAGGACTCGAAAATATGTAATGTGGATGCATATCGATGATGCCAACTATACGAAGGCTTCTGTTGGTGTTGCCGTCAGTGATTACCCAACCGGTCCGTTCGATTATCTTTACAGCAAAAGACCACATGGATTTGATAGTAGAGACATGACAATCTTCAAAGATGATGATGGTACAGCCTATCTCGCTTACTCATCTGAAGACAATAGTGAGCTCCATATAGGACCTCTTTCAGAAGATTATCTCGATGTGACCAATGTAGCGAAAAGGATTCTCGTCGGCCAGCACCGGGAAGCACCGGCTTTGTTTAAAGACCAGGGAACTTACTATATGATCACATCAGGTTGCACGGGATGGGCACCAAACGAGGCACTGGCACACGCATCAGAGTCGATAATGGGTCCATGGGAAACGTTGGGAAACCCTTGTATAGGTGGAAACAAGTTGTTTCGACTAGCTACCTTCTTCTCTCAGAGCACATTTGTTCTTCGCTTACCTTCACACCCCGGCTTGTTTATTTTCATGGCAGACCGATGGAACCCTGCCGACCTTAGAGACTCGAGGTACATTTGGTTGCCGTTGATGGTTGGAGGACTTGTCGATCAACCCCTCGACTACAATTTTGGGTTCCCTTTGTGGTCAAGAGTGTCAATATATTGGCATAGGAAGTGGAGGCTTCCTCAAGGCTGGAATCCGTTGAAATGA

Coding sequence (CDS)

ATGCTGCATTATCTTGGAGATAAAAAGGACCAGAAAATGAATATGAGGAACAGATACAGGAAATCAACCGCTTTACGTTGTGATGCCGGGAGCAGATGTTTGATATCTGTGGTAATAGGGAGTCTAATGGGGTGTATTCTTTTGCTACATTTATGTTCTCCTGTAAGCCGTAAGTATGAGATAGGTCGGGGTATCCAACTTCGAACAAGTCGTCACCTTCACTTTCGTGAACTTGAAGAGGTGGAAGAAGAAAACATTCAAATTCCCCCTCCTCGTAAGAGATCTCCGCGTGCAGCAAAGCGAAGACCAAAGAAAACGCCCACGCTGATTGATGAATTTCTTGATGAAGATTCACAGCTTAGGCACAAATTCTTTCCTGATCATAAAACTTCTGTTGATCCAATGATCCCGGGAGACGACAGCATGTTCTATTATCCGGGGAGAGTTTGGCTGGATACTGAGGGGAATCCTATTCAAGCTCACGGAGGTGGAGTTTTATTCGATGAAAGATCTGAAACATACTATTGGTATGGAGAGTATAAAGATGGCCCCACCTACCATGCTCACAAAAAGGGAGCTGCACGGGTGCGCATCATTGGAAAAGGGACGCCTGTTTCTTTTTGCGTACTGATGAGGCCTGCACTTCCTTCCAGCATCAATTTTATTGAATTCTTGCTGACCTCCTTCGACAGCAAGAACAATAGAACAGCTCCTTCTGGGCCATCTTTTCTAAGAATAATGCGAAGATTTTGTCGAAATGTTACACTCAAAGGAGTCGGTTGCTACTCTTCCAAAGACTTATGGACCTGGAAAAATGAAGGCATTGTTTTGACAGCGGAAGAAACCAACGAGACTCATGATCTTCACAAATCCAACGTGCTTGAGAGGCCAAAAGTAATCTACAACTCAAGGACTCGAAAATATGTAATGTGGATGCATATCGATGATGCCAACTATACGAAGGCTTCTGTTGGTGTTGCCGTCAGTGATTACCCAACCGGTCCGTTCGATTATCTTTACAGCAAAAGACCACATGGATTTGATAGTAGAGACATGACAATCTTCAAAGATGATGATGGTACAGCCTATCTCGCTTACTCATCTGAAGACAATAGTGAGCTCCATATAGGACCTCTTTCAGAAGATTATCTCGATGTGACCAATGTAGCGAAAAGGATTCTCGTCGGCCAGCACCGGGAAGCACCGGCTTTGTTTAAAGACCAGGGAACTTACTATATGATCACATCAGGTTGCACGGGATGGGCACCAAACGAGGCACTGGCACACGCATCAGAGTCGATAATGGGTCCATGGGAAACGTTGGGAAACCCTTGTATAGGTGGAAACAAGTTGTTTCGACTAGCTACCTTCTTCTCTCAGAGCACATTTGTTCTTCGCTTACCTTCACACCCCGGCTTGTTTATTTTCATGGCAGACCGATGGAACCCTGCCGACCTTAGAGACTCGAGGTACATTTGGTTGCCGTTGATGGTTGGAGGACTTGTCGATCAACCCCTCGACTACAATTTTGGGTTCCCTTTGTGGTCAAGAGTGTCAATATATTGGCATAGGAAGTGGAGGCTTCCTCAAGGCTGGAATCCGTTGAAATGA

Protein sequence

MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKYEIGRGIQLRTSRHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQLRHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTYHAHKKGAARVRIIGKGTPVSFCVLMRPALPSSINFIEFLLTSFDSKNNRTAPSGPSFLRIMRRFCRNVTLKGVGCYSSKDLWTWKNEGIVLTAEETNETHDLHKSNVLERPKVIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLAYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKDQGTYYMITSGCTGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLRLPSHPGLFIFMADRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK
BLAST of CmaCh20G010190.1 vs. TrEMBL
Match: A0A067GGY3_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g011114mg PE=3 SV=1)

HSP 1 Score: 656.0 bits (1691), Expect = 4.0e-185
Identity = 329/530 (62.08%), Postives = 382/530 (72.08%), Query Frame = 1

Query: 5   LGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKYEIGRG 64
           LG K+ +KM MRN+Y+K T   C+AGS+C +S+++  L G +LLLH  S VS K      
Sbjct: 20  LGIKRKRKMRMRNKYKKPTTFPCNAGSKCSVSIILWILAGFLLLLHFFSLVSHKDGTSGE 79

Query: 65  IQLRTSRHLHFRELEEVEEENIQIPPPR-KRSPRAAKRRPKKTPTLIDEFLDEDSQLRHK 124
           I+L  S +  FREL EVEEENIQIPPPR KRSPRAAKRRPK+T TLIDEFLDE+SQLRH 
Sbjct: 80  IELHISHNPSFRELVEVEEENIQIPPPRGKRSPRAAKRRPKRTTTLIDEFLDENSQLRHV 139

Query: 125 FFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDG 184
           FFPD KT++DPM                                D  ++++Y+Y      
Sbjct: 140 FFPDMKTAIDPMK-------------------------------DNGNDSFYYY------ 199

Query: 185 PTYHAHKKGAARVRIIGKGTPVSFCVLMRPALPSSINFIEFLLTSFDSKNNRTAPSGPSF 244
                      R+ +  +G P+        A    I + E   T F     +  P+  + 
Sbjct: 200 ---------PGRIWLDTEGAPIQ-------AHGGGILYDERSRTYFWYGEYKDGPTYHAH 259

Query: 245 LRIMRRFCRNVTLKGVGCYSSKDLWTWKNEGIVLTAEETNETHDLHKSNVLERPKVIYNS 304
            +   R    V + GVGCYSSKD+WTWKNEGIVL AEETNETHDL+K NVLERPKVIYN 
Sbjct: 260 KKAAAR----VDIIGVGCYSSKDMWTWKNEGIVLAAEETNETHDLYKLNVLERPKVIYND 319

Query: 305 RTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAY 364
           RT KYVMWMHIDD NYTKA+VGVA+SDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDG AY
Sbjct: 320 RTGKYVMWMHIDDCNYTKAAVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGVAY 379

Query: 365 LAYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKDQGTYYMITSGCTGWAP 424
           L YSSEDNSELHIGPL+ DYLDV+NV +RILVGQHREAPALFK  GTYYM+TSGCTGWAP
Sbjct: 380 LVYSSEDNSELHIGPLTSDYLDVSNVVRRILVGQHREAPALFKHLGTYYMVTSGCTGWAP 439

Query: 425 NEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLRLPSHPGLFIFMADRWNP 484
           NEAL HA+ESIMGPWE +GNPCIGGNK+FRL TFF+QST+V+ L   PGL+IFMADRWNP
Sbjct: 440 NEALVHAAESIMGPWEDMGNPCIGGNKVFRLTTFFAQSTYVIPLAGLPGLYIFMADRWNP 492

Query: 485 ADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWN 534
           ADLR+SRYIWLPL+V G  D+PL+YNFGFPLWSRVSIYWH+KWRLP  W+
Sbjct: 500 ADLRESRYIWLPLIVRGPADRPLEYNFGFPLWSRVSIYWHKKWRLPSRWS 492

BLAST of CmaCh20G010190.1 vs. TrEMBL
Match: A0A061DHP7_THECC (Glycosyl hydrolase family protein 43 isoform 2 OS=Theobroma cacao GN=TCM_000802 PE=3 SV=1)

HSP 1 Score: 654.4 bits (1687), Expect = 1.2e-184
Identity = 329/521 (63.15%), Postives = 381/521 (73.13%), Query Frame = 1

Query: 13  MNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKYEIGRGIQLRTSRH 72
           M +RN+YRK TA  C+AGSRC +S V+ SL+G +L+LHL S VS +  +G  IQLR SRH
Sbjct: 1   MRVRNKYRKPTAFPCNAGSRCSMSAVVWSLVGFVLMLHLYSLVSHRNPVGGDIQLRMSRH 60

Query: 73  LHFRELEEVEEENIQIPPPR-KRSPRAAKRRPKKTPTLIDEFLDEDSQLRHKFFPDHKTS 132
              RELE+VEEENIQIPPPR KRSPRAAKRRPK+T TLIDEFLDE+SQLRH FFPD KT+
Sbjct: 61  PLVRELEQVEEENIQIPPPRGKRSPRAAKRRPKRTTTLIDEFLDENSQLRHVFFPDMKTA 120

Query: 133 VDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTYHAHKK 192
           +DP                                 D R+++YY++              
Sbjct: 121 IDPTK-------------------------------DARNDSYYYH-------------- 180

Query: 193 GAARVRIIGKGTPVSFCVLMRPALPSSINFIEFLLTSFDSKNNRTAPSGPSFLRIMRRFC 252
              R+ +  +G P+        A    I + E   T +     +  P+        ++  
Sbjct: 181 -PGRIWLDTEGNPIQ-------AHGGGILYDERSSTYYWYGEYKDGPT----YHAHKKGA 240

Query: 253 RNVTLKGVGCYSSKDLWTWKNEGIVLTAEETNETHDLHKSNVLERPKVIYNSRTRKYVMW 312
             V + GVGCYSSKDLWTWKNEGIVL AEET+ETHDLHKSNVLERPKVIYN    KYVMW
Sbjct: 241 ARVDVIGVGCYSSKDLWTWKNEGIVLAAEETDETHDLHKSNVLERPKVIYNDNMGKYVMW 300

Query: 313 MHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLAYSSEDN 372
           MHIDDANYTKA+VG+A SDYPTGPF+YL S+RPHG++SRDMTIFKDDDG AYL YSSEDN
Sbjct: 301 MHIDDANYTKAAVGIASSDYPTGPFEYLRSQRPHGYESRDMTIFKDDDGVAYLIYSSEDN 360

Query: 373 SELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKDQGTYYMITSGCTGWAPNEALAHAS 432
           SELHIGPL+EDYLDV    +RILVGQHREAPALFK QGTYYMITSGCTGWAPNEALAHA+
Sbjct: 361 SELHIGPLTEDYLDVKPDMRRILVGQHREAPALFKYQGTYYMITSGCTGWAPNEALAHAA 420

Query: 433 ESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLRLPSHPGLFIFMADRWNPADLRDSRY 492
           ESIMGPWET+GNPCIGGNK+FRLATFF+QSTFV+ LP  PG +IFMADRWNPADL+DSRY
Sbjct: 421 ESIMGPWETMGNPCIGGNKMFRLATFFAQSTFVIPLPGIPGSYIFMADRWNPADLKDSRY 464

Query: 493 IWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGW 533
           +WLPL+VGG  D+PL++NFGFPLW RVSIYWHRKWRLP  W
Sbjct: 481 VWLPLIVGGPADRPLEFNFGFPLWPRVSIYWHRKWRLPLRW 464

BLAST of CmaCh20G010190.1 vs. TrEMBL
Match: M5W5C1_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005306mg PE=3 SV=1)

HSP 1 Score: 654.1 bits (1686), Expect = 1.5e-184
Identity = 320/524 (61.07%), Postives = 376/524 (71.76%), Query Frame = 1

Query: 13  MNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKYEIGRGIQLRTSRH 72
           M MRN+YRK T   C+AGSRC  S V+ SL+GC L+  L S V +   +   +Q R++ H
Sbjct: 1   MRMRNKYRKPTTFHCNAGSRCSTSAVVWSLVGCFLMFQLYSLVHQNDRMRGEMQFRSTHH 60

Query: 73  LHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQLRHKFFPDHKTSV 132
               ELEEVEEENIQIPPPRKRSPRAAKR+P++  TLIDEFLDE+SQ+RH FFP  K  +
Sbjct: 61  PQIHELEEVEEENIQIPPPRKRSPRAAKRKPRRPTTLIDEFLDENSQIRHVFFPGQKHVI 120

Query: 133 DPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTYHAHKKG 192
           DPM                                D  +++YY+Y               
Sbjct: 121 DPMK-------------------------------DTGNDSYYYY--------------- 180

Query: 193 AARVRIIGKGTPVSFCVLMRPALPSSINFIEFLLTSFDSKNNRTAPSGPSFLRIMRRFCR 252
             R+ +   G P+        A    I + + L T +     +  P+        ++   
Sbjct: 181 PGRIWLDTDGNPIQ-------AHGGGILYDDKLRTYYWYGEYKDGPT----YHAHKKGAA 240

Query: 253 NVTLKGVGCYSSKDLWTWKNEGIVLTAEETNETHDLHKSNVLERPKVIYNSRTRKYVMWM 312
            V + GVGCYSS+DLW WKNEGIVL AE+TNETHDLH+ NVLERPKVIYN RT KYVMWM
Sbjct: 241 RVDIIGVGCYSSRDLWKWKNEGIVLAAEKTNETHDLHELNVLERPKVIYNERTGKYVMWM 300

Query: 313 HIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLAYSSEDNS 372
           HIDD NYTKA+VG+A+SDYPTGPFDYLYSKRPHGF+SRDMTIFKDDDG AYL YSSEDNS
Sbjct: 301 HIDDVNYTKAAVGIAISDYPTGPFDYLYSKRPHGFESRDMTIFKDDDGVAYLIYSSEDNS 360

Query: 373 ELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKDQGTYYMITSGCTGWAPNEALAHASE 432
           ELHIGPL+EDYLDVTN+ +R+LVGQHREAPALFK +GTYYMITSGCTGWAPNEALAHA+E
Sbjct: 361 ELHIGPLTEDYLDVTNIMRRVLVGQHREAPALFKYEGTYYMITSGCTGWAPNEALAHAAE 420

Query: 433 SIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLRLPSHPGLFIFMADRWNPADLRDSRYI 492
           SIMGPWET+GNPC GGNK+ RL TFF+QSTFV+ +P+ PG FIF+ADRWNPADLRDSRY+
Sbjct: 421 SIMGPWETMGNPCAGGNKVSRLTTFFAQSTFVVPVPAFPGSFIFIADRWNPADLRDSRYV 467

Query: 493 WLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 537
           WLPL+VGG  D+PLDYNFGFPLWSRVSIYWHRKWRLP+GW+  K
Sbjct: 481 WLPLIVGGPADRPLDYNFGFPLWSRVSIYWHRKWRLPRGWSSSK 467

BLAST of CmaCh20G010190.1 vs. TrEMBL
Match: A0A0D2T2K4_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_008G083400 PE=3 SV=1)

HSP 1 Score: 649.8 bits (1675), Expect = 2.9e-183
Identity = 322/525 (61.33%), Postives = 382/525 (72.76%), Query Frame = 1

Query: 13  MNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKYEIGRGIQLRTSRH 72
           M +RN+YRKSTA  C+ GSRC IS+V+ SL+G +L+L + S +S +  +   I+LR SRH
Sbjct: 1   MRVRNKYRKSTAFPCNVGSRCSISIVVWSLVGFLLMLQIYSLISHRNTVSGDIKLRMSRH 60

Query: 73  LHFRELEEVEEENIQIPPPR-KRSPRAAKRRPKKTPTLIDEFLDEDSQLRHKFFPDHKTS 132
              RELE+VEEENIQIPPPR KRSPRAAKRRPK+T TL+DEFLDE+SQ+RH FFPD KT+
Sbjct: 61  PLVRELEQVEEENIQIPPPRGKRSPRAAKRRPKRTTTLVDEFLDENSQIRHVFFPDMKTA 120

Query: 133 VDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTYHAHKK 192
           +DP                               L D  ++++Y+Y              
Sbjct: 121 IDP-------------------------------LKDAGNDSFYYY-------------- 180

Query: 193 GAARVRIIGKGTPVSFCVLMRPALPSSINFIEFLLTSFDSKNNRTAPSGPSFLRIMRRFC 252
              R+ +  +G P+        A    + + E   T +     +  P+        ++  
Sbjct: 181 -PGRIWLDTEGNPIQ-------AHGGGMIYDERSSTYYWYGEYKDGPT----YHAHKKGA 240

Query: 253 RNVTLKGVGCYSSKDLWTWKNEGIVLTAEETNETHDLHKSNVLERPKVIYNSRTRKYVMW 312
             V + GVGCYSSKDLWTWKNEGIVLTAEE+NETHDLHKSNVLERPKVIYN  T KYVMW
Sbjct: 241 ARVDIIGVGCYSSKDLWTWKNEGIVLTAEESNETHDLHKSNVLERPKVIYNENTGKYVMW 300

Query: 313 MHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLAYSSEDN 372
           MHIDDANYTKA+VG+AVSDYPTGPFDYL S+RPHG++SRDMT+FKD+DG AYL YSSEDN
Sbjct: 301 MHIDDANYTKAAVGIAVSDYPTGPFDYLGSQRPHGYESRDMTVFKDEDGVAYLIYSSEDN 360

Query: 373 SELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKDQGTYYMITSGCTGWAPNEALAHAS 432
           SELHIGPL++DYLDV    +RILVGQHREAPALFK +GTYYMITSGCTGWAPNEALAHA+
Sbjct: 361 SELHIGPLTKDYLDVKPDIRRILVGQHREAPALFKYRGTYYMITSGCTGWAPNEALAHAA 420

Query: 433 ESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLRLPSHPGLFIFMADRWNPADLRDSRY 492
           +SIMGPWET+GNPCIGGNK+FRLATFFSQSTFV+ LP  PG +IFMADRWNPADL DSRY
Sbjct: 421 DSIMGPWETMGNPCIGGNKMFRLATFFSQSTFVIPLPGIPGSYIFMADRWNPADLSDSRY 468

Query: 493 IWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 537
           +WLPL+VGG  D+P ++NFGFPLW RVSIYWHRKWRLP  W   K
Sbjct: 481 VWLPLIVGGPADRPFEFNFGFPLWPRVSIYWHRKWRLPSSWRVTK 468

BLAST of CmaCh20G010190.1 vs. TrEMBL
Match: V4SQH2_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10025552mg PE=3 SV=1)

HSP 1 Score: 649.4 bits (1674), Expect = 3.7e-183
Identity = 325/522 (62.26%), Postives = 376/522 (72.03%), Query Frame = 1

Query: 13  MNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKYEIGRGIQLRTSRH 72
           M MRN+Y+K T   C+AGS+C +S+++  L G +LLLH  S VS K      I+L  S +
Sbjct: 1   MRMRNKYKKPTTFPCNAGSKCSVSIILWILAGFLLLLHFFSLVSHKDGTSGEIELHISHN 60

Query: 73  LHFRELEEVEEENIQIPPPR-KRSPRAAKRRPKKTPTLIDEFLDEDSQLRHKFFPDHKTS 132
             FREL EVEEENIQIPPPR KRSPRAAKRRPK+T TLIDEFLDE+SQLRH FFPD KT+
Sbjct: 61  PSFRELVEVEEENIQIPPPRGKRSPRAAKRRPKRTTTLIDEFLDENSQLRHVFFPDMKTA 120

Query: 133 VDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTYHAHKK 192
           +DPM                                D  ++++Y+Y              
Sbjct: 121 IDPMK-------------------------------DNGNDSFYYY-------------- 180

Query: 193 GAARVRIIGKGTPVSFCVLMRPALPSSINFIEFLLTSFDSKNNRTAPSGPSFLRIMRRFC 252
              R+ +  +G P+        A    I + E   T F     +  P+  +  +   R  
Sbjct: 181 -PGRIWLDTEGAPIQ-------AHGGGILYDERSRTYFWYGEYKDGPTYHAHKKAAAR-- 240

Query: 253 RNVTLKGVGCYSSKDLWTWKNEGIVLTAEETNETHDLHKSNVLERPKVIYNSRTRKYVMW 312
             V + GVGCYSSKD+WTWKNEGIVL AEETNETHDL+K NVLERPKVIYN RT KYVMW
Sbjct: 241 --VDIIGVGCYSSKDMWTWKNEGIVLAAEETNETHDLYKLNVLERPKVIYNDRTGKYVMW 300

Query: 313 MHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLAYSSEDN 372
           MHIDD NYTKA+VGVA+SDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDG AYL YSSEDN
Sbjct: 301 MHIDDCNYTKAAVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGVAYLVYSSEDN 360

Query: 373 SELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKDQGTYYMITSGCTGWAPNEALAHAS 432
           SELHIGPL+ DYLDV+NV +RILVGQHREAPALFK  GTYYM+TSGCTGWAPNEAL HA+
Sbjct: 361 SELHIGPLTSDYLDVSNVVRRILVGQHREAPALFKHLGTYYMVTSGCTGWAPNEALVHAA 420

Query: 433 ESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLRLPSHPGLFIFMADRWNPADLRDSRY 492
           ESIMGPWE +GNPCIGGNK+FRL TFF+QST+V+ L   PGL+IFMADRWNPADLR+SRY
Sbjct: 421 ESIMGPWEAMGNPCIGGNKVFRLTTFFAQSTYVIPLAGLPGLYIFMADRWNPADLRESRY 465

Query: 493 IWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWN 534
           IWLPL+V G  D+PL+YNFGFPLWSRVSIYWH+KWRLP  W+
Sbjct: 481 IWLPLIVRGPADRPLEYNFGFPLWSRVSIYWHKKWRLPSRWS 465

BLAST of CmaCh20G010190.1 vs. TAIR10
Match: AT3G49880.1 (AT3G49880.1 glycosyl hydrolase family protein 43)

HSP 1 Score: 515.0 bits (1325), Expect = 5.6e-146
Identity = 231/290 (79.66%), Postives = 259/290 (89.31%), Query Frame = 1

Query: 240 GPSFLRIMRRFCRNVTLKGVGCYSSKDLWTWKNEGIVLTAEETNETHDLHKSNVLERPKV 299
           GP++L   +   R V + GVGCYSSKDLWTWKNEG+VL AEET+ETHDLHKSNVLERPKV
Sbjct: 170 GPTYLSHKKGAAR-VDIIGVGCYSSKDLWTWKNEGVVLAAEETDETHDLHKSNVLERPKV 229

Query: 300 IYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDD 359
           IYNS T KYVMWMHIDDANYTKASVGVA+SD PTGPFDYLYS+ PHGFDSRDMT++KDDD
Sbjct: 230 IYNSDTGKYVMWMHIDDANYTKASVGVAISDNPTGPFDYLYSRSPHGFDSRDMTVYKDDD 289

Query: 360 GTAYLAYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKDQGTYYMITSGCT 419
             AYL YSSEDNS LHIGPL+E+YLDV  V KRI+VGQHREAPA+FK Q TYYMITSGCT
Sbjct: 290 NVAYLIYSSEDNSVLHIGPLTENYLDVKPVMKRIMVGQHREAPAIFKHQNTYYMITSGCT 349

Query: 420 GWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLRLPSHPGLFIFMAD 479
           GWAPNEALAHA+ESIMGPWETLGNPC+GGN +FR  TFF+QSTFV+ LP  PG+FIFMAD
Sbjct: 350 GWAPNEALAHAAESIMGPWETLGNPCVGGNSIFRSTTFFAQSTFVIPLPGVPGVFIFMAD 409

Query: 480 RWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLP 530
           RWNPADLRDSRY+WLPL+VGG  D+PL+Y+FGFP+WSRVS+YWHR+WRLP
Sbjct: 410 RWNPADLRDSRYLWLPLIVGGPADRPLEYSFGFPMWSRVSVYWHRQWRLP 458

BLAST of CmaCh20G010190.1 vs. TAIR10
Match: AT5G67540.2 (AT5G67540.2 Arabinanase/levansucrase/invertase)

HSP 1 Score: 510.8 bits (1314), Expect = 1.0e-144
Identity = 228/276 (82.61%), Postives = 250/276 (90.58%), Query Frame = 1

Query: 254 VTLKGVGCYSSKDLWTWKNEGIVLTAEETNETHDLHKSNVLERPKVIYNSRTRKYVMWMH 313
           V + GVGCYSSKDLWTWKNEGIVL AEETN+THDLHKSNVLERPKVIYN +T KYVMWMH
Sbjct: 196 VDIIGVGCYSSKDLWTWKNEGIVLGAEETNKTHDLHKSNVLERPKVIYNEKTEKYVMWMH 255

Query: 314 IDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLAYSSEDNSE 373
           IDDANYTKASVGVA+S+ PTGPF+YLYSKRPHGFDSRDMT+FKDDDG AYL YSSE NS 
Sbjct: 256 IDDANYTKASVGVAISNSPTGPFEYLYSKRPHGFDSRDMTVFKDDDGVAYLIYSSEVNSV 315

Query: 374 LHIGPLSEDYLDVTNVAKRILVGQHREAPALFKDQGTYYMITSGCTGWAPNEALAHASES 433
           LHIGPL+EDYLDVT V KR++VGQHREAPA+FK Q  YYM+TS CTGWAPNEALAHA+ES
Sbjct: 316 LHIGPLTEDYLDVTPVMKRVMVGQHREAPAIFKHQNIYYMVTSWCTGWAPNEALAHAAES 375

Query: 434 IMGPWETLGNPCIGGNKLFRLATFFSQSTFVLRLPSHPGLFIFMADRWNPADLRDSRYIW 493
           IMGPWE LGNPCIGGNK+FRL TFF+QST+V+ LP  PG FIFMADRWNPADLRDSRY+W
Sbjct: 376 IMGPWEKLGNPCIGGNKVFRLTTFFAQSTYVIPLPGVPGAFIFMADRWNPADLRDSRYVW 435

Query: 494 LPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLP 530
           LPL++GG  DQPL++NFGFP WSRVSIYWH KWRLP
Sbjct: 436 LPLVIGGPADQPLEFNFGFPSWSRVSIYWHSKWRLP 471

BLAST of CmaCh20G010190.1 vs. NCBI nr
Match: gi|764516698|ref|XP_011466090.1| (PREDICTED: uncharacterized protein LOC101313840 isoform X1 [Fragaria vesca subsp. vesca])

HSP 1 Score: 667.2 bits (1720), Expect = 2.5e-188
Identity = 328/522 (62.84%), Postives = 380/522 (72.80%), Query Frame = 1

Query: 15  MRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKYEIGRGIQLRTSRHLH 74
           MRN+YRK T  RC AGSRC IS V+ SL+GC+L+ HL S V  K  +GR IQ R S H  
Sbjct: 1   MRNKYRKPTTFRCYAGSRCSISAVVWSLVGCLLMFHLYSLVHHKDGMGREIQFRASVHPQ 60

Query: 75  FRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQLRHKFFPDHKTSVDP 134
             ELE+VEEE+I++PPPRKRSPRAAKR+PK+  T+IDEFLDE+SQ+RH FFPD K ++DP
Sbjct: 61  LHELEKVEEESIRMPPPRKRSPRAAKRKPKRPTTIIDEFLDENSQIRHVFFPDQKLAIDP 120

Query: 135 MIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTYHAHKKGAA 194
                                          L D  +++YY+Y                 
Sbjct: 121 -------------------------------LKDAGNDSYYYY---------------PG 180

Query: 195 RVRIIGKGTPVSFCVLMRPALPSSINFIEFLLTSFDSKNNRTAPSGPSFLRIMRRFCRNV 254
           R+ +  +  P+        A    I + E   T +     +  P+        ++    V
Sbjct: 181 RIWLDTEENPIQ-------AHGGGILYDEKSGTYYWYGEYKDGPT----YHAHKKGAARV 240

Query: 255 TLKGVGCYSSKDLWTWKNEGIVLTAEETNETHDLHKSNVLERPKVIYNSRTRKYVMWMHI 314
            + GVGCYSSKDLW W NEGIVL AE+TNETHDLH+ NVLERPKVIYN +T KYVMWMHI
Sbjct: 241 DILGVGCYSSKDLWKWNNEGIVLAAEKTNETHDLHELNVLERPKVIYNHKTAKYVMWMHI 300

Query: 315 DDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLAYSSEDNSEL 374
           DD NYTKASVGVA+SDYPTGPFDYLYSKRPHGFDSRDMT+FKDDDG AYL YSS+DNSEL
Sbjct: 301 DDVNYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTVFKDDDGIAYLIYSSDDNSEL 360

Query: 375 HIGPLSEDYLDVTNVAKRILVGQHREAPALFKDQGTYYMITSGCTGWAPNEALAHASESI 434
           HIGPL+EDYLDVTN+ +RILVGQHREAPALFK  GTYYMITSGCTGWAPNEA+AHA+ESI
Sbjct: 361 HIGPLTEDYLDVTNIVRRILVGQHREAPALFKHDGTYYMITSGCTGWAPNEAMAHAAESI 420

Query: 435 MGPWETLGNPCIGGNKLFRLATFFSQSTFVLRLPSHPGLFIFMADRWNPADLRDSRYIWL 494
           MGPWET+GNPCIGGNK+ RLATFF+QSTFV+ LP  PG FIFMADRWNPADLRDSRY+WL
Sbjct: 421 MGPWETMGNPCIGGNKMSRLATFFAQSTFVIPLPGFPGSFIFMADRWNPADLRDSRYVWL 465

Query: 495 PLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 537
           PL+VGG VD+PLDYNFGFPLWSRVSIYWHRKW+LPQGW+  K
Sbjct: 481 PLIVGGPVDRPLDYNFGFPLWSRVSIYWHRKWKLPQGWSGWK 465

BLAST of CmaCh20G010190.1 vs. NCBI nr
Match: gi|747102428|ref|XP_011099379.1| (PREDICTED: uncharacterized protein LOC105177822 isoform X1 [Sesamum indicum])

HSP 1 Score: 665.2 bits (1715), Expect = 9.4e-188
Identity = 332/536 (61.94%), Postives = 385/536 (71.83%), Query Frame = 1

Query: 1   MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKYE 60
           M H L D+K  KM MRN+YRK T L C+AGSRC  S ++ SL+  +L+LHL + +S    
Sbjct: 1   MQHCLEDRKGIKMRMRNKYRKPTTLHCNAGSRCSTSTLVWSLVVVLLMLHLYTLISHTDV 60

Query: 61  IGRGIQLRTSRHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL 120
             + I    S  +  RELEEVEEENIQ+PPPRKRSPRAAKR+P++  TLIDEFLDE SQ+
Sbjct: 61  QSKEIHRDMSHRILLRELEEVEEENIQMPPPRKRSPRAAKRKPRRPTTLIDEFLDESSQI 120

Query: 121 RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY 180
           RH FFP  KT+VDPM+                               D  ++++Y+Y   
Sbjct: 121 RHVFFPTIKTAVDPMV-------------------------------DAGNDSFYYY--- 180

Query: 181 KDGPTYHAHKKGAARVRIIGKGTPVSFCVLMRPALPSSINFIEFLLTSFDSKNNRTAPSG 240
                         R+ +  +G P+        A    I + E   T +     +  P+ 
Sbjct: 181 ------------PGRIWLDTEGNPIQ-------AHGGGILYDEKSRTYYWYGEYKDGPT- 240

Query: 241 PSFLRIMRRFCRNVTLKGVGCYSSKDLWTWKNEGIVLTAEETNETHDLHKSNVLERPKVI 300
                  ++    V + GVGCYSSKDLWTWKNEGIVL AEE NETHDLHKSNVLERPKVI
Sbjct: 241 ---YHAHKKGAARVDVIGVGCYSSKDLWTWKNEGIVLAAEERNETHDLHKSNVLERPKVI 300

Query: 301 YNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDG 360
           YN RT KYVMWMHIDDANYTKAS+GVA+SD PTGPFDYLYSKRPHGF+SRDMTIFKDDDG
Sbjct: 301 YNDRTGKYVMWMHIDDANYTKASIGVAISDSPTGPFDYLYSKRPHGFESRDMTIFKDDDG 360

Query: 361 TAYLAYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKDQGTYYMITSGCTG 420
            AYL YSSEDN+ELHIGPL E+YLDVT+VA+RILVGQHREAPALFK +GTYYMITSGCTG
Sbjct: 361 VAYLVYSSEDNTELHIGPLDENYLDVTHVARRILVGQHREAPALFKHEGTYYMITSGCTG 420

Query: 421 WAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLRLPSHPGLFIFMADR 480
           WAPNEALAHA+ESIMGPWET+GNPCIGGNK+FRL TFF+QSTFVL LP  PGLFIFMADR
Sbjct: 421 WAPNEALAHAAESIMGPWETMGNPCIGGNKVFRLTTFFAQSTFVLPLPGLPGLFIFMADR 479

Query: 481 WNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 537
           WNPADLRDSRY+WLPL  GG  DQPLDY+FGFPLWSRVSIYWH++WRLP  W+ +K
Sbjct: 481 WNPADLRDSRYVWLPLTAGGAADQPLDYSFGFPLWSRVSIYWHKRWRLPGEWSGMK 479

BLAST of CmaCh20G010190.1 vs. NCBI nr
Match: gi|645219903|ref|XP_008237861.1| (PREDICTED: uncharacterized protein LOC103336579 isoform X1 [Prunus mume])

HSP 1 Score: 663.7 bits (1711), Expect = 2.7e-187
Identity = 325/524 (62.02%), Postives = 378/524 (72.14%), Query Frame = 1

Query: 13  MNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKYEIGRGIQLRTSRH 72
           M MRN+YRK T   C+AGSRC  S V+ SL+GC L+  L S V +   +G  +Q R++ H
Sbjct: 1   MRMRNKYRKPTTFHCNAGSRCSTSAVVWSLVGCFLMFQLYSLVHQNDRMGGEMQFRSTHH 60

Query: 73  LHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQLRHKFFPDHKTSV 132
             F ELEEVEEENIQIPPPRKRSPRAAKR+P++  TLIDEFLDE+SQ+RH FFP  K  +
Sbjct: 61  PQFHELEEVEEENIQIPPPRKRSPRAAKRKPRRPTTLIDEFLDENSQIRHVFFPGQKHVI 120

Query: 133 DPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTYHAHKKG 192
           DPM                                D  +++YY+Y               
Sbjct: 121 DPMK-------------------------------DTGNDSYYYY--------------- 180

Query: 193 AARVRIIGKGTPVSFCVLMRPALPSSINFIEFLLTSFDSKNNRTAPSGPSFLRIMRRFCR 252
             R+ +   G P+        A    I + + L T +     +  P+        ++   
Sbjct: 181 PGRIWLDTDGNPIQ-------AHGGGILYDDKLRTYYWYGEYKDGPT----YHAHKKGAA 240

Query: 253 NVTLKGVGCYSSKDLWTWKNEGIVLTAEETNETHDLHKSNVLERPKVIYNSRTRKYVMWM 312
            V + GVGCYSS+DLW WKNEGIVL AE+TNETHDLH+ NVLERPKVIYN RT KYVMWM
Sbjct: 241 RVDIIGVGCYSSRDLWKWKNEGIVLAAEKTNETHDLHELNVLERPKVIYNERTGKYVMWM 300

Query: 313 HIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLAYSSEDNS 372
           HIDD NYTKA+VG+A+SDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDG AYL YSSEDNS
Sbjct: 301 HIDDVNYTKAAVGIAISDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGVAYLIYSSEDNS 360

Query: 373 ELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKDQGTYYMITSGCTGWAPNEALAHASE 432
           ELHIGPL+EDYLDVTN+ +R+LVGQHREAPALFK +GTYYMITSGCTGWAPNEALAHA+E
Sbjct: 361 ELHIGPLTEDYLDVTNIVRRVLVGQHREAPALFKYEGTYYMITSGCTGWAPNEALAHAAE 420

Query: 433 SIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLRLPSHPGLFIFMADRWNPADLRDSRYI 492
           SIMGPWETLGNPC GGNK+ RL TFF+QSTFV+ +P+ PG FIFMADRWNPADLRDSRY+
Sbjct: 421 SIMGPWETLGNPCAGGNKVSRLTTFFAQSTFVVPVPAFPGSFIFMADRWNPADLRDSRYV 467

Query: 493 WLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 537
           WLPL+VGG  D+PLDYNFGFPLWSRVSIYWHRKWRLP+GW+  K
Sbjct: 481 WLPLIVGGPADRPLDYNFGFPLWSRVSIYWHRKWRLPRGWSSSK 467

BLAST of CmaCh20G010190.1 vs. NCBI nr
Match: gi|657949096|ref|XP_008341189.1| (PREDICTED: uncharacterized protein LOC103404103 isoform X2 [Malus domestica])

HSP 1 Score: 659.4 bits (1700), Expect = 5.2e-186
Identity = 325/524 (62.02%), Postives = 376/524 (71.76%), Query Frame = 1

Query: 13  MNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKYEIGRGIQLRTSRH 72
           M +RN+YRK T  RC+AGSRC  S V+ SL+GC+L+ HL + V +K  +G  IQ R S H
Sbjct: 1   MRIRNKYRKPTTFRCNAGSRCSTSAVVWSLVGCLLMFHLYTLVRQKDRLGGAIQFRASHH 60

Query: 73  LHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQLRHKFFPDHKTSV 132
             F+ELE+VEEENIQ+PPPRKRSPRA KR+P++  TLIDEFLDE+SQ+RH FFP  K  +
Sbjct: 61  PLFQELEQVEEENIQLPPPRKRSPRAEKRKPRRPTTLIDEFLDENSQIRHVFFPQ-KLDI 120

Query: 133 DPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTYHAHKKG 192
           DPM                                D  +++YY+Y               
Sbjct: 121 DPMK-------------------------------DTGNDSYYYY--------------- 180

Query: 193 AARVRIIGKGTPVSFCVLMRPALPSSINFIEFLLTSFDSKNNRTAPSGPSFLRIMRRFCR 252
             R+ +   G P+        A    I F E   T +     +  P+        ++   
Sbjct: 181 PGRIWLDTDGYPIQ-------AHGGGILFNEKSRTYYWYGEYKDGPT----YHAHKKGAA 240

Query: 253 NVTLKGVGCYSSKDLWTWKNEGIVLTAEETNETHDLHKSNVLERPKVIYNSRTRKYVMWM 312
            V + GVGCYSS DLW WKNEG+VL AE+ NETHDLH+SNVLERPKVIYN  T KYVMWM
Sbjct: 241 RVDIIGVGCYSSNDLWKWKNEGVVLAAEKANETHDLHESNVLERPKVIYNEHTGKYVMWM 300

Query: 313 HIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLAYSSEDNS 372
           HIDDANYTKASVGVA+SDYPTGPFDYLYS+RPHGF+SRDMTIFKDDDG AYL YSSEDNS
Sbjct: 301 HIDDANYTKASVGVAISDYPTGPFDYLYSQRPHGFESRDMTIFKDDDGVAYLIYSSEDNS 360

Query: 373 ELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKDQGTYYMITSGCTGWAPNEALAHASE 432
           ELHIGPL+EDYLDVTN  +RILVGQHREAPALFK +GTYYMITSGCTGWAPNEAL HA+E
Sbjct: 361 ELHIGPLTEDYLDVTNTVRRILVGQHREAPALFKHEGTYYMITSGCTGWAPNEALVHAAE 420

Query: 433 SIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLRLPSHPGLFIFMADRWNPADLRDSRYI 492
           SIMGPWET+GNPC GGNK+ RLATFF+QSTFVL +P  PG FIFMADRWNPADLRDSRY+
Sbjct: 421 SIMGPWETMGNPCAGGNKVSRLATFFAQSTFVLPVPGFPGAFIFMADRWNPADLRDSRYV 466

Query: 493 WLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 537
           WLPL+VGG  D+P DYNFGFPLWSRVSIYWHRKW+LPQGW+  K
Sbjct: 481 WLPLIVGGPADRPFDYNFGFPLWSRVSIYWHRKWKLPQGWSGSK 466

BLAST of CmaCh20G010190.1 vs. NCBI nr
Match: gi|747102432|ref|XP_011099381.1| (PREDICTED: uncharacterized protein LOC105177822 isoform X3 [Sesamum indicum])

HSP 1 Score: 656.4 bits (1692), Expect = 4.4e-185
Identity = 326/524 (62.21%), Postives = 378/524 (72.14%), Query Frame = 1

Query: 13  MNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKYEIGRGIQLRTSRH 72
           M MRN+YRK T L C+AGSRC  S ++ SL+  +L+LHL + +S      + I    S  
Sbjct: 1   MRMRNKYRKPTTLHCNAGSRCSTSTLVWSLVVVLLMLHLYTLISHTDVQSKEIHRDMSHR 60

Query: 73  LHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQLRHKFFPDHKTSV 132
           +  RELEEVEEENIQ+PPPRKRSPRAAKR+P++  TLIDEFLDE SQ+RH FFP  KT+V
Sbjct: 61  ILLRELEEVEEENIQMPPPRKRSPRAAKRKPRRPTTLIDEFLDESSQIRHVFFPTIKTAV 120

Query: 133 DPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTYHAHKKG 192
           DPM+                               D  ++++Y+Y               
Sbjct: 121 DPMV-------------------------------DAGNDSFYYY--------------- 180

Query: 193 AARVRIIGKGTPVSFCVLMRPALPSSINFIEFLLTSFDSKNNRTAPSGPSFLRIMRRFCR 252
             R+ +  +G P+        A    I + E   T +     +  P+        ++   
Sbjct: 181 PGRIWLDTEGNPIQ-------AHGGGILYDEKSRTYYWYGEYKDGPT----YHAHKKGAA 240

Query: 253 NVTLKGVGCYSSKDLWTWKNEGIVLTAEETNETHDLHKSNVLERPKVIYNSRTRKYVMWM 312
            V + GVGCYSSKDLWTWKNEGIVL AEE NETHDLHKSNVLERPKVIYN RT KYVMWM
Sbjct: 241 RVDVIGVGCYSSKDLWTWKNEGIVLAAEERNETHDLHKSNVLERPKVIYNDRTGKYVMWM 300

Query: 313 HIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLAYSSEDNS 372
           HIDDANYTKAS+GVA+SD PTGPFDYLYSKRPHGF+SRDMTIFKDDDG AYL YSSEDN+
Sbjct: 301 HIDDANYTKASIGVAISDSPTGPFDYLYSKRPHGFESRDMTIFKDDDGVAYLVYSSEDNT 360

Query: 373 ELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKDQGTYYMITSGCTGWAPNEALAHASE 432
           ELHIGPL E+YLDVT+VA+RILVGQHREAPALFK +GTYYMITSGCTGWAPNEALAHA+E
Sbjct: 361 ELHIGPLDENYLDVTHVARRILVGQHREAPALFKHEGTYYMITSGCTGWAPNEALAHAAE 420

Query: 433 SIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLRLPSHPGLFIFMADRWNPADLRDSRYI 492
           SIMGPWET+GNPCIGGNK+FRL TFF+QSTFVL LP  PGLFIFMADRWNPADLRDSRY+
Sbjct: 421 SIMGPWETMGNPCIGGNKVFRLTTFFAQSTFVLPLPGLPGLFIFMADRWNPADLRDSRYV 467

Query: 493 WLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 537
           WLPL  GG  DQPLDY+FGFPLWSRVSIYWH++WRLP  W+ +K
Sbjct: 481 WLPLTAGGAADQPLDYSFGFPLWSRVSIYWHKRWRLPGEWSGMK 467

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A067GGY3_CITSI4.0e-18562.08Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g011114mg PE=3 SV=1[more]
A0A061DHP7_THECC1.2e-18463.15Glycosyl hydrolase family protein 43 isoform 2 OS=Theobroma cacao GN=TCM_000802 ... [more]
M5W5C1_PRUPE1.5e-18461.07Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005306mg PE=3 SV=1[more]
A0A0D2T2K4_GOSRA2.9e-18361.33Uncharacterized protein OS=Gossypium raimondii GN=B456_008G083400 PE=3 SV=1[more]
V4SQH2_9ROSI3.7e-18362.26Uncharacterized protein OS=Citrus clementina GN=CICLE_v10025552mg PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT3G49880.15.6e-14679.66 glycosyl hydrolase family protein 43[more]
AT5G67540.21.0e-14482.61 Arabinanase/levansucrase/invertase[more]
Match NameE-valueIdentityDescription
gi|764516698|ref|XP_011466090.1|2.5e-18862.84PREDICTED: uncharacterized protein LOC101313840 isoform X1 [Fragaria vesca subsp... [more]
gi|747102428|ref|XP_011099379.1|9.4e-18861.94PREDICTED: uncharacterized protein LOC105177822 isoform X1 [Sesamum indicum][more]
gi|645219903|ref|XP_008237861.1|2.7e-18762.02PREDICTED: uncharacterized protein LOC103336579 isoform X1 [Prunus mume][more]
gi|657949096|ref|XP_008341189.1|5.2e-18662.02PREDICTED: uncharacterized protein LOC103404103 isoform X2 [Malus domestica][more]
gi|747102432|ref|XP_011099381.1|4.4e-18562.21PREDICTED: uncharacterized protein LOC105177822 isoform X3 [Sesamum indicum][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR006710Glyco_hydro_43
IPR023296Glyco_hydro_beta-prop_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
Vocabulary: Biological Process
TermDefinition
GO:0005975carbohydrate metabolic process
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005975 carbohydrate metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmaCh20G010190CmaCh20G010190gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmaCh20G010190.1CmaCh20G010190.1-proteinpolypeptide


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh20G010190.1.CDS.4CmaCh20G010190.1.CDS.4CDS
CmaCh20G010190.1.CDS.3CmaCh20G010190.1.CDS.3CDS
CmaCh20G010190.1.CDS.2CmaCh20G010190.1.CDS.2CDS
CmaCh20G010190.1.CDS.1CmaCh20G010190.1.CDS.1CDS


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh20G010190.1.exon.4CmaCh20G010190.1.exon.4exon
CmaCh20G010190.1.exon.3CmaCh20G010190.1.exon.3exon
CmaCh20G010190.1.exon.2CmaCh20G010190.1.exon.2exon
CmaCh20G010190.1.exon.1CmaCh20G010190.1.exon.1exon


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006710Glycoside hydrolase, family 43PANTHERPTHR22925GLYCOSYL HYDROLASE 43 FAMILY MEMBERcoord: 10..200
score: 5.6E-259coord: 259..532
score: 5.6E
IPR006710Glycoside hydrolase, family 43PFAMPF04616Glyco_hydro_43coord: 256..440
score: 2.9
IPR023296Glycosyl hydrolase, five-bladed beta-propellor domainGENE3DG3DSA:2.115.10.20coord: 255..476
score: 1.4
IPR023296Glycosyl hydrolase, five-bladed beta-propellor domainunknownSSF75005Arabinanase/levansucrase/invertasecoord: 256..496
score: 5.89
NoneNo IPR availablePANTHERPTHR22925:SF32SUBFAMILY NOT NAMEDcoord: 10..200
score: 5.6E-259coord: 259..532
score: 5.6E