MC00g0145 (gene) Bitter gourd (Dali-11) v1

Overview
NameMC00g0145
Typegene
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPolyglutamine tract-binding protein 1
Locationscaffold15: 350372 .. 376872 (-)
RNA-Seq ExpressionMC00g0145
SyntenyMC00g0145
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: utr5polypeptideCDSutr3
Hold the cursor over a type above to highlight its positions in the sequence below.
AGAGGGAAGAAATAAAGATTCGGAAGAAATTTGTTTTTTTTGAATAATAAGAATATTTTAGTAATTTTATAAAGTAAAAAAATGAAAATACGTCAAAAAAAAAAAAAAAGAAAGAAAGAAAGAGAGGATCAAGAACGAGAGATTTTTGGCTAGTCATCGAAATTCAATTAAGAGGAACAAAAATCCCAATTTCTGAAAAATAAGGGGAGGTGGGTCACTAATTAGATTTTTTCGGAAGGAATCCAAAACTGGTTCTCGAAGGGAATCCCAATTTCCCAACAATCTCGGCATTTCATCGCTCGCTCTCTCTGGCACCCCATTTTGCACCATTCCGACCAAAAGCTTCTCTCTGCATATCGTCGAATTCGCCATATCCAAATGCTCCTCAAATTCCCATAATCAGTTGGTCAAGTGCAGCACAACTTCCAAGACTGAAACCCTAATCCAGAGGATGCCGACCTCAAATGCAGCAGCTGCAGTTTCCGGAGACTCGTTCATAACTACAATTGGGTCCAGCGTTGAAGATAGACCCCTCAAGGAATCGGGCGCCGCTCAATCTCAATCTTACGCCCAGAATGAAGTGCAAGAACTTGTAAAGTCTGGCAAACAAGACTCCTCTAGCCAACCGGGAGAAGCGCAGAGTTCTGTAGCAGTGTCTTCCGATCAAGATACCTTTGTTGAGCAGCAACTTGGGAAGAGCACTGCAACTGTGGATGAACTGCTCGTGCAGGGGAATGAAAAGTTCCAGGAGACAGAACCAAGTCGTGTAAATGATCAGAACAATGTTCCCCATGACGGCGTGTTTAAAATTGCTTGCTCGTCTTCTAGCAAATTCGGATCGCATGTTGGCGATACCAGGGACATTGACAGTGCTGTTCAGGATGCAGTGTTGAGGGAACAGGTATGCATGTTCAATATATTTCAGTTCTTCAAATCACTCATAATTTTGTATAGCTCTTGTTGTCTGCTTCACCTTTTATGGAAGACAACGAGATAAATTATTTTAGTTTTTCAAATGCTGTAATTAGTTTTTTTCCTAACAGGAACTTGCAACCCAAAATATTATTCGCAGCCAAAGGTGATAGTTCTTCATTTGTTTTTGAGCTGTTACAGTTCAAAATTAATAATGGAAACTGAATATATTTCCTTTTCCCAGGGAGTCCTTGGGTGCAGATGGACCTCCTAGCGAAAGATCAGATATCTTTTCAGAACGTTATGACCCAAGTACTCTTAAAGTATGTGATTTTCTACCTTCATTTCCATTCTTAAATTTTAATTAGCATGTTAAGGTTATCCATAAACTCAGTTGTATGTCATCATTACCACCACGGTCTCTACTTTTTACTTTCTTGGGAAACAAATGTCTTGTATTGAGTGTATGAATGTACAAAAAGGTGGACTAGGGTGGTGCATAACAATGAAAAGTTCGTAGCCCATTTATATAAGAACTTGGAGAAAATTATATATTGGATTAAATCAAACTGATGTACCATTACTAATTTGAATGCCTTTGAGCTCATCAAATTTGTCATTTTAGGTGCATATTTTTTGACCCATTGATTTAAAAATTTTTCGATGTGGTCTCCCCATTTCCTGCTTTCTTTTGAAACAAAATCCCATGTTTTCTCATCTGTGATTCAATTGTTGATCAAGAGTAACTAACCAATGTAATAGCTAGTCGAAACCTATTAACCAACCAAGGCTACATATGAGGTCCCATCCAACAAAATGATTAATGTCAAAATAGAAATAAGCAGGAAAAAGAAAGGAACACTGCAAAATATTACAGATTCGTAGAATTTGTTGCCCAAAGAGCAATAGTAGTCCAAAGAGAGCAACAAATACATGCTTTTTGCCAAAAATTGTCCAAAGTGGTCCTCCAATCTTGTTTCTTCTCCCTCGCTCTCCCTCTCTTCTAGCCAACAAGATATCCAACAGACGAGAAAAAGTTGTATTAGTTCAGAGTGTCATGGCCTTATTGAGATAGAACCTCAATCATTAGCAACCAAAGGTTCTCAACATGGTTGGAATATGAAGTTTCCAATAAGTTTTCCTAGTGAATAGAAATTATTAACAGTGAATAGAAAGATTCTTAGTAGAAAAAAAGATCAGTGAGGAGAAAGATTACTCAAATCCTCTGTAATTATCTCACATCAGTCTTGGTCCTAATTACCTCACGGAAAATACATCTTTAAGAATTGAGAGCCATGAATGGATATCTTCTTTATCTTTGCACTTTTTCATAATTGTTGATATTGCATGACACCGTTGTCCGCACAAAGACTTATTAATTGTTATCAGGTTTCAGCCTTGTGACTTTAATGTTTTTTTCTTCATGAATATTATTTCAGGAGCATCTTTTGAAGATTACTTCTGATCATCGTGCTGAAATGGCTATGAAAAGAGGAAAGTCAAACCTTCCAGAAGAAGGTTGGTACTTTGTTATGTGGCTTCCTTGTCCTATTGAAGCTTTTGTCCTTCACTTGCTTCTGCAGTGGCTCAATGATAATGATGTTTTTGTACATTTTGGTAAAGATATTAATTGTTGAATGAGAAGAGTTTATATTGGTTGCACAGAGTTAGAGTTATTGATTATAGAAGTGCTTAGGTCTTTGAGAGAAAGAATTGAACAGGAAGAATTGAACTTTGGGGACAGTTTATCAGAATTAATTGATGCACATTGTATTTTCTTGACGAGAGGAAACGTTAACTGTGAAATGGCATCTATGTTCCATCAAAATTTGCTTCTTTTGTTCCATGCGTCTGATTTTAGAATGTGATGGTGATTCCATTGAGTTAGATTATTCTAATAATTATCATCATCGTTATAACTATTTTTCATTCTCTTTTCTTAAACTAGAGTCAATTTTGAGAAGAAATAAAATTATTTATCTTCTTTGAAGGTAAAATGTTAGGTCACCTTAAGTAGAGGTGAGACACGTTACTAAATAAGGAAACTAGACAAAAATCTACTCATTACCAAATCAGCCTTGGCCAAAACAACCGTCCAAAATTTACGACGGTCCTTGGGCTGGCATATCTAAGTGCATTAACAACATAAAGAATAGAGTCCATTGTAAAATTGGCAATGGTAGATTGGCTAGCTTCTGGTACGACAGATGGACCTCTAACACCCCCTTGGTTGAGCTTTTCCTCCGTTTCCCCTCGCTCTCTACAAAGAAAGATGCTAAGGTTAGCGATATGTGGGATCAACCCACAGCTTCATGGAATCACCATTTCCGCAGATCGCTCAAAGATGAAGAACTGGAAGAATGGATTGACCTTGAGCACAAAATCTCGGCAATTCATCTCTCTCACAGGGAAGACAAATGGATTTGGCCTATGGACCCCACTCAAACCTATACGGTTAAATCCCTCTTAAAGCACCTCACAGCTGTGCTGTCTTTTGAAATTATTTGGGCAGAAAATACCCCAAAAAAGTAAAATTCTTTTGTTGGGAGCTTAGTCATGAAGCTATCAACACCCAAGATAGGTTACAGAGGTGTCTTCCAATGATGGCTTTATCTCCAAATGGGTGCATAATGTGCCTCAAGGAAGCAGAAACCCAAAACCATCTCATTGTCACTTGCGAATTTGCTCACAGCTTTTGGAGTCTCATCATTGAAACTTTCGATTGGCAAATCCCTCTTCATGCAAATATAAACTCTCTGTTACAAGTAACTTTGGGGGGACATCCTTTAAAAAAAAAACAAGATGATTATGTGGTCCCTATTTATCAGAGCTTTTCTTTGGAATCTTTGGAGAGAGGAATAGCAGAAGTTTCAACAAGTCCCAGCCGTTTGCCAACATTTTTGATGGGGTTTTGGTTACAGCTCTTAGCTGGTGTAAAAGCTATTTTCCTTCTTTCTCTAGCTATAGTCTAAATGACCTTTTAAGGAATTGGAGAGATCTTTTGTAACTCCTTTCCTTTTTGTATATTTCATCAATGAAATATGTGTTCCTCATTAAAAAAAAAAATATATAAAAGGAAAAACAATTAATTTTATTTTATTTTTTTGATAAGAAACAAAACTTTCATTATCAAAAGAGAGTAGTACAAAAAGAGTGGGAGATGAGGTATCCCCACATGCCAAAGAAGGTTAAAAAAAGCCTCCCAATTGGCAAAGATTGTGTCACACCATAGCTACAAAACAATCTAGATAACGAGGTCCACGAAGAGGCAAAAATTTTGACAAGGTCCCAAAAAGCATGCACGTTCATTTCTTTGTCATGAAAAACCCGTTGATTCCTCTCGTGCCATATTCTCCAAAGGATACTCGCCACTGCATTTGACCATAAAGTGAATGCTTGACCCTTGAACTGAGTCCCCCAAAGAAGTTGTCTTAAGGCTTCACCCGCACTCTTAGGAAATACCCAATTAATGTTGAATGTCTCCATGAGCTTCTCCCAGCTATTAGCACTGTAAGGGCACGTAAAGAATAGGTGCATTTGGTCTTCTGCACTCAATTTGCACATACTACACCAATTCGGGGATATTTGCCAGTTTGGACAACTTTTTTGCAATCTAGTACAAGTTTGTAGTCGTTCTCTAGCAAAAATCCACAAGAAGGTCTTCACTTTTTTAGGAATTTCCAGCTTCCAAACAGCACTACTACCAAAGGCATCCAACTCCTGATGCCTTATTAGTTTCGGGACTAGGGATTTAACAGAGAATATACCACTATTCTCAAGCCCCCACGCCATGGAGTCCTTCTCTCTTGAGAGTGTGCAATTTCGCTGGATAACTTCAACCATTCCTCAATTTCCAGGTCCGATAAGTTTCTTCTGAAGCATAAGTTCCAACCTCTTTGCTCAACTAACCAACATTGAGCTATCGTGAATTCTTTAGAGTTAGCCACTCTAAAAAATCTTGGAAAGGCATCTGAGATTGGGGTGTTTGAGGTCTTTCCAAAATTGAATTTTTCCCCTGTCGCCCACCTTGAAATTGTACCATTCTTCTGCCTCTTTACAGCCATGAATTATAGTAACCCAGGGGCCTTTAAATCTAAATCTGCCTTTAGGATGTGGCCACCATTCATGAGGAAGATACCCAAATTTTCTAGCAATAACATTTCTCCACAGCGAGCTTTCCTCCGTTAGGAAGCACCAACTTCACTTAGCCAAAAGAACTGTATTTCTAGCTTCTAGGTTACCTAAACCGATGCCTCCCATGCTTATAGGCAGAGCAGTCACTTTCCATGAAATTGGGTTGGGACCTCCATCCTGTTTTCCTCCTTTCCAAAAGAAATTTCTGATTAAATTCTCCATCCTTGTAATGATTTGTTTTGGGCATTTAAAAATAGAGAAGAAGTGCAGGGGAATACTTGTCAGCACTGAGTTAATCAAAGTTACTCTACCTCTCGAGATGAAAGGATTTCCATCTAGCTAGCTTCCTCTCCATTTTCTCCACTATTAGCTCCCAAAAGGCACATTTACTCTGGATATGGAGATCGCTGTCCCCTACCGTGAGACCTTCCAACCAGCCTTCTCTAGAGCCTTTGTCCAGCATCCTAATTCCTACTCATCACGTTAACCACTAGAACAAACAGGAACGGGGCAAGCGGGTCCCCTTGCCTCAAAGCTCTTTGAGCTTTAAACTTCGCCTTGGCCTTCCATTAATAATAACAAAAAAAGAGGTGTTCGTTATGCATCCTTTTATCCACGAGATCCAATGTTTGAAAAAGCCCACCTAGGCACGCACAGGCGCTAAGCGCAGCCTCCATGCCTCGCCTTGAATAAGCGAGGCGATTTTAATGAAGCACGCACCTCAATTGAAGCCCCGAAGCGCAAAAGCGTGCACTTCTTTAATAAAATATTAAAAAAAAATGAAGCCCTAAAAGGCATAAAATCTTGCGTTTTGGGTTTTTGTTTTTTTAAAACTCAATGCTGCTGTTTGGAGTTGTTAGAGTATAGATATTAGTATATTTTATATTCTTGTATTAATATTAATCTGTCTTTTACCTTCTCCTTATAGGATTAGGTCAACCTTGTATATTTGTCTATATATATAGACTTTAATAATGAGAATACATCACATTGGATTCTCTCCAACCTTAGTCTCTACATGGTATCAGAGCTCTAGGTTTCTTCTAAAATTTTTGTGCCCATTAGGGTTTTGTGCCCATTAGGGTTTTGATGCCCATTAGGGTTTTGTGCCCATTAGGGTTTGTCCCTTAGGGTTTTGTTCATTGCTATCCAACTCAACTCACTACTGCCGCTGCCGCCGACCGCCGCTGTTTGCTGATTCGCCAGCGCTATTGTTTTCCGATTGCTGTTCATGGGTTTCGTCTCGCCGCCAATCGCCGTCGCCGTGTGCTTTTGTCACTGCAAGTCGCCACCCGTCGCCTTCGCCCCTCTGGTTGTCGTGGGTTAGCACTTAGGGTTTCTGGTTTCAGAAAATACCTAATCCGGTTTGGTTCCAGCAGTTTGATTCATCTGTTTTGTTTGGTTCGGGTGGTTTGGTCTGGTTCTGTCGTTTTGTTCAAATAAGTTCGTTAGTTTGTTATGGTTGAGAAGAAACCAATCGTAACTTCTAAGGTGATCCCAATGATATCAAAAATAACAGAACACAAGTTGAATGAAGCACGAATAGATTATTTGTCTAACAAGTTGGGCATGTTTAATATATATGCACCAACTTGAGGGGGAGTGTTAGAGTATAGATATTAGTATATTTTATATTCTTGTATTAATATTAATTTGTCCTTTACCTTCTCCTTATAGGATTAGGTCAACCTTGTATATTTGTCTATATATATAGACTTTAATAATGAGAATACATCACATTGGATTCTCTCCAACCTTAGTCTCTACAGGAGTAAAAACACAGAGGCAGAAGATGGAAAGATTATATTGTTCTTTTTTTTTTTTGGGAAAAGATCTCAAAAGTCAATAATATTTCATGATTTTTTCCCAACATTTACCCATTTGGTCATTTCTTCTTCTCTTCCTTCACATGCATGAAAAAGAAATTCAGTCTTCAATAAAATAGACCTCCAGCCATAGCAAGTCTATTGTCTCGAGTTTCCTCTCTTGATTTTTCTCTGCAATTTTTCCATTCCATCCCTTAGCAAGAAATCAGACAACAGGCCTCCAATACAACACCAACGTAACCTCTCTTTCTCTCTCCTTTTTTCAATTTTTATTTCTCAAGCAATCTTCTTTCTTCAATTTTTATTAATCAAACAGTAGCTCTCTCCTCAATTTTTATTTCTCAAACAGTAGAGTAGGCTCTCTCCTCTATTTTCATTTCTCAATCGGTAGGCTCTCCCTTTTTTCAAATTCTCCAATCGACCGTCTGCCTCTCTATTTTTTCCTCTCTCTCATTTTATCTCCTTTTCTCTCTCTCAAACAGCTCTGTCTCCCTCTAAAACAGTAAGCTTTTTGTCCCTCTTTTTCTCTTTTTCTCTTTCTAAAACAGAACGTTAGAATTTCATTTTTTTTCTAACCATTGATTACAATTTAGATGTCTAACGAAGGCCCAAGAAAAGATCCGGCATGGACGTATGCTAGTTTGGCAAATCCTCAAGATATGACTACATTTATTTGTAGTTTTTGTTCCAAAATAACAAAAGGAGGAGTTTATAGGGTGAAACATCTTGTTGGCCGTTGTATTTAATACAACGGCATGCAAGAAATGTCCGAATCGTGTGAAGGAAGAAGTTAAGGAACACATGTCAAAGAAAAAGGAGATCAAAGAACAAAGAAATCTTATACTTGATATTGATGACCAACGTTTTGGTATGGATAATGAAGATGAGGATGATTTGATTATGACTAGTTCGAGTGAGAGGTTGTCTTCAAGTCAAAGACCAAATTTAGACCCTAAGAAGCCAAGATAGAAGGTGAATCGAAAGAATGAGAAAGGCAAACAAACTACGCTCAATGAAGCCTACAAAAAGGAAATGAGGGAGCGCACAGTACAAGAATTGTGCGATGGTTTTATGATTCCGGAATTCCTTTAAGTGCTTGCAATTATGAAAGCTTTGCTTATATGCTTGAAGCAATAGGACAATACGACCTTGGATTAAAAGTACCATCTTATTATGAGCTTAGAGTGCTGTTGTTGAAAAAAAAGAGTTAGTAGTGACACATGAGTTGATGAAGAGTCACAAGAAAGAGTGAGCCAAGATTGGATGCACTATCATGGCTGATGAATGAACGGATGATGGACGGATAGGAGACAGAGGACATTAATTAACTTTTTAGTTAATAGTAAAAAAGGCACAATGTTTATTGAGTCCATTAGATAGGAGAGAGAGGACATTAATTAACTTTTTAGTTAATAGTCAAAAAGGCACCATGTTTATTGAGTCCATTACTGCTTCGTCTTATGTAAAGAATGGAAAGAAAATGTTTGAGCTACTCGATGATTTTGTTGAGCGCATTGGAGAAGCCAATGTCGCACATGTCGTTACTTATAGTGCTTCAGCAAATGTAATGGCGGGTAAGAAATCAAATTCCTTGCTTCTCTTCAAATTAGTCTTATGTTAAAAATTTCAAGTAGCCTAATATGTTTTCTTCTTATGTTATTTTGTTACTAGGGAGATTATTAGAAGCAAAACGACCAAATTTATTTTGGTCGCCACGTGCTCCTCATTGCTTGGACTTGATGTTGGAGGATATATTCAAGATCTCAAATATCCGCAACACATTGAAAAGAGGCATGAAGATCAATAATTTTATTTTTCTTCGCCTAGGATTGTTAAATATGATGAGGCAATTTACGAATCAAAAAGAGTTAGTTAGGCCAGCTAAGACTCGCTTTGCCACAGCTTGCATAACATTATCAAGCATACATCATTAGAAGAATAACTTGAGGAAGGTGTTTACTTCTGATTAGTGGAAGAATAGCAAATGGAGCAAGGAACATCAAGGCAAGCAAGTTGTTCACACAATATTGTATTCCAAAATGATGCCATCAATATTGTCTGAACAACTCGCTTGCCTAGTTATTCTTCTAATGATGTATGCTTGATAATGTTATGCAAGCTGTGGCAAAGCGAGACTTAGCTGACCTAACTAACTCTTTTTGATTCGTAAATCGCCTCATCATATTTAACAATCCTGGTCGAACATAAATAAAACTACTGATCTCCATGCCTCTTTTCAGTGTGTTGCGGATATTTGAGATCTTGAATATGTCCTCCAACATCAAGTCCAAGCAATGCGCAACACATGACAACACATGATGACCAAAATAAATTTGGTCGTTTTGCTTCTAATAATCTCCCTAGTAATAAAATAACATAAAAGAGATCATATTAGGCTCCTTGAAATTTTTAATATAAGACTAATTTGAAGAGAAGCAAGGAATTTGATTTCTCACCTGCCATTACATTTGCTGAAGCACTATCAGTAATGACAGGTGCGACGTTGCCTTTTCCAATGCGCTCAACAAAATCATTGAGTAGCTCAAACATTTTCTTTCCATTCTTTACCTAAGATGAAACATCAATGGACTCAATAAACTTTGTGCCTCTAGGACTATTAACTAGAAAGTTAATTAATGTCCTCTCTCTCTCTTATCTGTCCATCCATCGACCATGATAGTACATCCAACCTTGGCCCACTCTTTTCTCAATTCATGTGCCGTTGATAACTCTTTTTTCAACAACGGCACTCTAAACTCATAATAAGATGGTACTTTTAATCCAGGGCCGAATTGTCCTATTGCTTCAATGAAGCAAAGCTTTCATAATTGCAAGCATTTAAAGGAATAGCGGAATCATAAAACCATCGTGCAATTCTTTGTACTGTGCGCTCCCTCATTGTCTTTTTGTAGGCTCCATTTAGTGTTGTTTGTTTGCCTTTCTCATTCTTTCGATTCAGAACTACTGTCTCTGGATTAGGAGTAAAAAATGCATCGATCGGACCATTCTGTCTTGGCTTCAAAGGGTCTAAATTTGGCCTTTGACTGGAAGACAACCTTTTACTCGACCTAGTTATAATCAAATCATCCTTATCTTCATCATCCATATTGAAATCTTGGTTATCAATATCGGGTATAAGATTTCTTCGTTCTTTGATCAAGAGAGGGCTGTTTTAGAGAGAACCTGAGAAAGAAAGTAAAAAATAGAGAGAGCAAGAGCTTACTGTTGGATTGGTTGATTGGAGAATTTGAAAGAAGGGAGAGCTACTGTTTGATTAATAAAAATTGAAAGAAGATTGTTTGAGAAATAGAAATTGAAGAAAGGAGAGAGAAAGAGAGCTTCCGTTGGTGTTGGAGGCCTGCTGTCGAATTGCTTGCTAAGGGATGAAAAAATTGCAGAGGAAAATCGAGAGAGGCAGAGTAGAGCTGCTATGGCTGGAGTTGCTATGTCTGGAGGTTTAGTTTAGTGAAGAAGACTGTATTCTTTTTCGTGCATGTGAAAGAAGAGAAGAAACGCCAAATGGGTAACTGGAGAGAAAGAAAATCGTGAAATATTATTGACTTTTGAGATCCTTTCCAAGAAAAAGACAAAAACAATAAAATATTTCCATCTTCTGCCATCTGTGTTTTACATAAAAACGGCAGCATTGGGGTTTTCAAAAAACAAACCCTAAACACAAGATTTTATGCATTTAGGGCTTTAGTTTTTTATTTTTATATTTTTTATTAAAGAAGCCCACGTTATTGCGTTTCAGGGCTTCATTTGCGCGCCTCATTAAAATCACCTCACCTAGACGCTCCTTTTTAATCACTGTTCCTCGGAGTATTAGACTAGTTTATCGACAAGTCAACTCTTTGATCTTCATTTCTTTGATTTAATTACCGTACGTAGAGGTGAGATGTATTACTAGATAAATAAACTAGGATAATAATCTACTAAGCTGGGAGGCAATGGTTAGTCACTTGTGGTGTGAGGAGTCTCGATCTTGGACTCTTAGATCGAGGAGGAATCTTACAGATGAAGAGGTTGAGGAGTGGTCCACTTTGCTGCAAGTTTTATCCCCACTAAGGCCATCCATGAGACAAGACTATAGAGTTTGGAGTTTAAGTAGTGGTGGGAGTTTTACGGTGAAATCTTTGATGGATAATCTTACGGAGCAGTGTAGTGTGCTTCCCGATGAGTTCATCCCATGTTTATGGAGTTCAAAAGTCCCCAAAAAAGTGAAGGTTTTTATTTGGATCCTCGTCCAAGAGAAGTTAAATACATGCAATACATTGCAAGCTAAAAGGCCGAATCTGGCCTCATCTCCTGGTTCGTGCCTTTTATGTAAAGGAGAGATGGAATCTCATGCCTGTATCTTTTTCTTTTGTGAGTTTACTGCAACAGTGTGGTCCTTAGCACTCAGAGTTTGGGTTGAGTTGGTGCTTTCCAGCTGTAAAGAGGCCTTTTTTCAGCTGTTTTTTGGTTCTCCTAAAAAAGAAGTACTCGTTATCTGACGTAATGTAGTGGTAGTCACGGTTTGGAGGATGTGGTATGAAAGGAATGAAAGAATCTTCAAAGGCAAAGAAAACTCTATAATTTGCTATTGGGAAGTTGTTCATACTTTTGCTTCTTGGTGCCTTATTAGCAAGTTGTTTTGTAATTATAGCTTCATGAGTCTTTAACGCCAATTGTAATTATTTAGTAAAAATACATGAAATCAACAGTTGGAATAGTGGTTAACAATTGGAATGCTAAAAAAAATGTGCGCCTGAGCTTGGTAAGAAATAGGGACAAACTTTTGAGTCAAAAGTGTCCATTTCAACGTTAATAGTTCAGGTCCGCCTTTAAGGTCAACTGATAGTTTTCTCACCATAATGATAGGGGCAAACTTTTTCTGGTTGAGTTTGTTGTAAGTTCTAGCTGGAAGATTAAATATCCATTTCCTGTCCAAAGGGGCCAAGGTAAATCAAAAGGATTTAAAGGGAATGAGTTCAAACTCCGTGATGGCCACCTACCTAGGAATTAATTTCCTATGTGTTTCCTTAACAACTTGTCCCGTGAGATTAGTCGAGGTGCGTTCAAGCTGACCCGGATACTCACAGATATAAAAAAACAAACGAGTATAGTTGGAAGATGACTGAAATATTGCATGCTTCAGTTCCTCCTTTTGGGTTTCAACTCATTTTGTTCAAAATATTCTCAATAATATTGTTTATTGAAAATTCAGGATTTCACTTATGTTTCCTTTTGTTCTTGTTCTTTTCCTTGTGTGGCTTGTAGCTCTTATTTTTGTGCAACTCCCTTGACGCCAGCTGGAGAACCTTTGTTAACTTCTTGGCATTATTTACGTTATTTCATTTACATCATTGAATTTTTTTCTCTAAAAGAACCTTTTCTGGCCACTCACATTAATTTTTTTTGGAAGTGCAAAGACCAGTTTGCACTAACTAAAAAAATCAGCAGATCTATGACTATTCTTTTGTCATTGATACTGTGATACATTGTTATTCTATGTTGGAAAAACTAAAAAACTCAACATATATGTGAGTGTTCTTATTTTACCCTTTCAAAGAGATAATATCTTTTTGTCATGTGACATGTCAGGGAACTTGGAAATTGGAAATGGTTATGGCGTACCTGGTGGATGTGCTTTCTATGGTGCTTCAAAGCCTGGAATTGTAACCCCTGGTAAGAACTTTCTCCTTCTGCTTGACTTCTACTTTGATCTCCAAATTGGTTCATTTCATTCTTGTCATGCTTTCAGGAAATAATGCGATTGGCTGGAAAATCCAGGGACAGGTTACGGAAGCAGAACAAAACTCTGCTACTAAAGAATTGCCCGAGTACCTCAAGCAGAAGCTAAGAGCCAGGGGTATTCTTAAAGAAGATACACAAAAAAGAAATTCTGTAAGAACTGATATGCCAAACTTCCCAATACTTCTTTGTCCATCTTTTTCTCCTGTTTTAGTGATTATTGGAAATGATGTGTTTTAGTCTTCTATTTTATTTTGCCACAGACTAACTTGGCCCAGTTAACTAACATTTTTTTTTTCTCCCTGATGAGGAAGTACTTCCGTTTTTCTCCTTTCTTATTTTTATAAAACAGATTTACTATGCACCTAATATTTTGTCCAGATTTACAAATAGCCTTTCGTGTCATTTTTTGGATGTCTAGAATCTGACTTCTGATAGCATTTATAGTTAAACCGGCTTTTGCATATTCTTTATTCTTGTGCTTCTTTCTTAATCTTGTTGTGCAAACAATAACTGAACCTTAAATTACAGACAAATTCTGATGCTATTTCAAATCAACCAGTGCAAGGAGATAAGCTGCCTCGTGGATGGGTGTGTACTTTGTGCATTTTCTTTACCAATTATATATAATGTTCTCTTTCACTCTACAAGCTTATGTTAGAGTTACCTATTGCAACTTGCAAGTTAGGTTTACTTCATTTGATATCAATTGATGGGTGTTTATTATATAGAAGACAATGTTGGTTTAACTAATTTTTCTATGATGTACTTGATAAACAATTATACTAAACTTAATTCATTTTAATAAATCATACGTTTATATAATATTAGTTATGTTATATCCTGTAAACGTAGTGCTATAGTTGCAAAAATCATATTAATTCAGTATCAATTTTATCAATGTGATTATAGATTACAAAAGGTTATGCCAATGTCACACTAGTGGTGTTTGCTAAAGCAGTTAAGCCTGGGAAATGAAAGGACTTGGAAGAATGATGAAAAGCCACACTTATGGGGTTGAGGCAGCACTTGCCATGATAAGATTGTTTTTGTCATTTCCTTGTATCAAATAATGTTGTTGGCGAGGCGACCTTAGGATTTTGGGTTTGGTTCTTGTACTGGAAATATTATCGACTCCTAAAGGGTATATTTTGCCTTATGATTTGCAAAAGGAAAAAAACCCACGATACGGAGTATTGGCTGACCTTTAAAGTAAGTCCCCTTCACAAATTGATCAAAAGCTTCTGCAACATTTTTAGGGAATACCCAAGGCTTGAAAGAATTCAGCAGCTTTATCCATCAATCAGCAACAAAATCACATTCCAGGAAGAGATGCGTTTGATTTTCTTCACTATTTCTGCATAGCACACACACCAATTTGGGGATATAGACATTGAGGGGCATAATCTCTGAAGTACGAAGCGAAGATGTTCTTTTTTTCTGTGTAGACAGTTCTATTTTCTTCTTAACAATTTCAATAAAATATGAAGATGTTCTCAGACCTCTTCTTTATAAAATTGCATGTGTTTTCTGCATAGGTGGAGGCTAAAGACCCTGATAGTGGTGTTTTGTATTATTATAATGAAAGTACTGGGAAGAGTCAATGGGAAAGGCCCTCTGACTCCTCTTTTGATCTGCAACTTCCATCAGCTGTATCCCTTCCAGAAGATTGGATGGAGGCACTCGATGAAACAACAGGTTGGTAATGTATTTCTGATGCTTATATATCTCCCTCCATGTATGAGGAGAAAAGATGAGAAAGCTGAAGAAGTAGCAGCACCAGAATCCGTATCTTTTCACCTTTACCATACTTTCCTGAACGTGTATCATTTCTCCTAAAAATAGAAATAACGTATCTTTTAAAGGTTTCAAGTTGAGTAAGTGGCCTTCAATTCCCAGTCTCTTGTTTGGATTGGTAATGTAGCTATCAGGAAGACTCATTAAGTACATTAAATGAGACAGCATAAATATTTAAATCTCTGCATTTGACCTGATTTATCCATTAAAAAAAAAGCTGGGCCAGTTGATGTAGGTTTGACATTTGATCTTTAGTTGAATGATTTTCATTTGTTTGATGTGCGCCCCAGTTGGATGTGTTATCTTGAAAGAGGGGCAACGGGGATACGATCATAGGGTGTTGGTTAGAAGGTGACAGAGCAAGTTGTTAGAAGCACAGTAAGAAAGGGGAAGAGAGTAGAATAAGTTGTAATTGGGGCATGAAAGCGGAACGTCATGAAATTCAAGGATATTGTATTTAGGGAGAGTGATGAGGGTAAAGAAAATTTAAGGTTATTTTATAATCACATCGAATATCAATCCAAGTATCAGAAGGCTAAAATCAAAGGGTAATTTTTTGCCTTGTTGCATTAATACCAAGTCTTATTAAACCTACAACAGAAGGGGACATGGGATGCACGCACACAAATAACACAACACACAAACTGAAGGCTGCACACCAAGTCAAGTATGGGGAAATCCTATTTTTGAGAATTTCACTCATATTCTCTCCAAGGTGATACAAGTGTATGGAATGAACTGCTTGCACTAGTCATAAGCTGATCACCCTATTTATAGAGGAGATACTCCTTCACATTTCCCCCCACTATATGGTCCTTAAAGTACTTTAAGGAAGCAACTACCCACTACCTTACAATCTTGCCACTAACATTGTCTAGGTAGTTATTTAACTACTCCCGTTACATGCTGGTATTTCTCCATGTGACCCATGACTTTAGGGGACCATGAAGGTCCCCTTCACGTTCTTGTTGTAATTCGTGCTTTTCCTCGGTTCAGTGCAAACAAGTGATTAATTATAAATGTTCAGAAAAAAGTTGCTGAATAATATAATTTGCAGTTTTGTTCCCGTTGACAGCCTTTAATTTGATTGCAGGCCTTAAATACTACTACAATGTAAGAACCCACGTAACCCAGTGGGAGCATCCTGTTTCATCTCATCAGGGAACTTTGACACACTCGAATGGTAACGTTCCTGGGGTTTGGAACAACCAAACTTTTGAACAAAATAAATGCATCGCATGTGGAAGAGGAATGACCCTCATGCAGGGTTCTAGATACTGCAACAGTTATACATGGTAAATGATATATGCATATGCATATGCATGTCAACATGTGTGTGTCTATGTATGTATATGTATATGTATATTTATTTTGCTAAGAAAAGAAACAAAGGTCTTTTTTTAAGTAACTGGAAAGTATAAGAGAGGATGTACATAATAAAACTCATTTGATCAAAGAGGTATTGAGTTACATTTAGGGAAGTTTGAGAGTCCAAACCTCTCCCCTCCTACCAGTCTACTTTAGTCTAAGAAAAATAATGAATTCCTCCTCTCCAACTCACTGTTTTTTTTTTGGAACAAGGAACAACTTTTCATTGATATATGGATACAAACTCCCGAAGGAGCGAAAGAAAAAGGAAAAAAAAAGGTAATAAATGACAAGCTCAACGTAGAGCAATACAATTTGATTGGAAATCAGAAGTCACTAATGAAGAGGCTCGAAAGCAATAAATATTGAACACCAAGCTTCCTTAGAACACACTAGAAATGAATTTCGAGGAGAAACCACTGAGAAGGTACCAAAGCTCCAAAATTTGCAACAAAATCCAACAAAACAACCAAAAAAAACATTCACGAAACACCATCTTCATCAGAACCTCTCCAACTTATTAATCTCTACTACACTAACACATCCTTTTAGATTCTAACCAATAGACTTAGAGTCCCACGTATACATCCTACTCATCTGTGAGTACATCAATCCCCCCACGTGCTTCCTCTTGTACATGTTTATGGTTGGGGGCCTATTGTTGCATGCTTAGCCCAATGCTTAGGGCGTGGCATGCAAAAACTACATAAATACGTTTCTCCCAATTGACCACATTCATCTGACAGTCCATGCTTTTAGTTTGGCTGGGAGTGTTGGTCGTATTTGGTTTTAGTTGGGTTGTTTAAATTTATCTATTGGGAGTTATTTTGTAACTTCTATTGTAATTTTCATTTAATTAATAAAAACTTTTGTCTTTCCATGAAGAAGGTGGAATGGTGGGAGAATCAGATGATAAATAAAACCAACCTGTCACAATAATGTAGGAGCTCACCTTGAAAGAGTAGCCTTCTGGGGTGTTTCTTCCTGCCATATAGCTTGCTTAATACCCTGAAGCTCATGTTCACCATTGTTTTTTCTTTTTTTTTTCCTCACGATAAGAGGTGTATTAAGCCACCACTCACTAAGTTCCAGTGTATGTGGCATTCTCGAGAGTGTATTAGCAGCCTCGTTCTCTAGGTTGGTCCTGTATTGAATTGTAATTTTCTTATTTTTCCCCAGCAACCTGTCACTGATCAAGGAGAAATTTCAAGCTGTTCTAATCATTTTGCATAATTTTTTTTTGATATGAAACAAAAGTTACAAAACTTTCATGGAGAACAATGAAAAAGGTACAAAATACACCAAAAGGCCTAAAAAGGGCATACAAAAAGATAGCCCAGCAAGAGGAAGCCCCCTAAAGAAAAGGACTCCAATCCAAAAGAATAAGGCCAAGCGAATAATTACAAAAGAGATGAGAAATCGAGCCCCATAACGAAGTATTAAATCTAACAATGTCCCAAACATTGTCCACAGACTTTTCCACCCCCCTTAAACACTCTATGATTCCTCTCCAGCCAAATACCCCATAACATATCTAGAAAGCAAGCCTGCCACAGAAAACGATCCTTATCCCGAAAAGGGGGGAATAAGAGCACCTCCTCCATCATAGCCCTACAATCCCTGGCACAGACCCAAGAAATCCCGAAGCATAAAAAGAGACGATGCCAAAGATCCTGAGCAAACTGATGCCACTGTAAATGCCACTGTAAATTTTCTGTTTTGTCTAGAAAAAAGAAACAGTTGCCATTAGTTCAGGTTCATAAACCAGTTTACGTTGGGGTGTTGGGGTAAAGCTTGACTAAATTGGTAACTGGATGTTGATTTTGAATGACCCCAACACCAGACCCAAACACACCAATCCCCTAATTGAATGACTGAGAAATTGAGATGCCCTAGAACTGCCTGAAGTTCTGCTGCTGCATTCTTTAACCACCAAAAATGGTCTCTCTTGAGTTGTTGAGTTAATGGGAAGGCAATGGAGCGACAGTTAGATACGAATGCACTAGAGAGTTCCAAAGAAATATTGAGATATGCCTAGGAAAATTTGATACTCTCTCATCCTTCTAGATTACTCTACACTAAGCAAAAGTGTGACTTCCTTCTCCCCAACTTACTACTGTCCGTCCATTTAGATTCTGACTAAGGGCTTAGATATACAATTTATCCTCCATTAAGTACGTCAATTTTTCCCCCTCCTTCCTCTTATACACGTATATTAGACACAATCTGCCAACTAACAATATGGTCATTTCCCAACTGTTTTCTTACTGCAAGATTTTATTTGGATAAAACCTCACACTTCATTCTTCATTTTGTTGGTTTTGGTGTCTAAATGATCAGTGAGGCTTCTACAAGTTCAACCAATGGGAATTGGCAGGAGCAATCGTTTGAGCTAAACAAATGCATGGGATGTGGCGGTTGGGGGCTTGGCCTTGTGCAATCTTGGGGTTACTGCAATCATTGTACACGGTATGTTGATGTCTATATCATTCAGCACTATTAATCAGCATACCCTTGTACTTGGTGTGATTATTTCCAACTATTAACTGATGTGTGGGTCACGGTGGGTAGTTAGTAATTGTGCATTGTTGTTCTGTTAGTCTTGAAGATAGAACGAGCTTTATTCATATACCTAAATTTCCTAGGGAGAATAATATCATAAGGAAAATTTCAAGTTAGAAAGATGTGGTCCCTGTGAATCTTGAGTATATTAGAACTTGACCTGTATCATCTTATTTTATCCAATGCATCCACTTGGGATTTCCCTTTCTTGGGGATAGCAGTTGGTTTTCAAAATATGATAATTTTTAAGTCTGATAGGCTTGAAAACTTATTAGTTACTATATAATCGCTAACGTAATTTCAGGTGTCTGAAAACAAAAAACGCAAATGTGTTTTGGAACTTTGGAATTTTCTTTAACATTAATCCATTCATTAATAAGGCATATTGCAGGTGCAATTATTTTGAAGTAAAAAATTTGATAAAATTTGATATTTCTATGCCGTGTATAGTTTGTTAGCCCAAACCCTGGATAAATTCATTAAAAGTTATTTGTGCTTATCATGGAGTATGTACTGCTATTGTTTGACTATGAAAACTTTTACAATGGCTTTCAGAATTCTCAGGCTTCCCCAGTGTGAATACTTGCCAACCAGCAGTGTTAGTAATCAGCATCAGCAGAAGACCGAGAGTATCGATCACAGCGCTGATGCTTCCATTAAAAAGTCTGCTATGGATAGGTAATGGAATCATCTTCTGTGTTTTAGGATAAGAATTAAATCAGTTTGTTTATAATATCAATATCATATACAGATTCTAATTTTTGCAATATTTGGAGTTTGGTGGTTGTGCTCAAAACTTCTTTTTTTTTTCTTTAAAAAAATAAGGTTAGTGACAATATGTGTGGATAATGGTATTTAATAGTTATTTTCTCTTGGACCTATTTTTTAATTCAACTAAAGTTGTATTGAAAATTATGGGTTGTGAATAGGTATTTTACATGCAGAGAACTTCATCCTTATAATCTCTGCCTCTAACTAGTTTTCCGGCATCCTTGAGTGGCCTTTTCATCCTTCAACAATGCTCTTTATTGTGTGTGCTAGTTCTATCTATCCTCCTTTCTTCTCCCTTTTTACATTGCTGTGAGTCTGTGACACTAAAGAGCTAGGTTAGCATCATGGCGTTGGAAATTTTGTTTTAAGTGTTCCAGTGAGGTGAGTGTCATGGGGCTTGGTCAACCATTTGAGAAATCGTTCAATTCTTTTGTCTTGTAAAATAGCAAATAGCTGACTATAAGCTTGCCAAAAGTACCAAAGACTTGATTTGGCATAAACTCACTATTTGAGCATACATATTTATAAGTATTAAATGGCGAAAGGTTTTCTAACCTGGCTTATCATATCATTCTGAACCATCACCCAACATTTTCTCCCTCTATTGTCAATTTGTGAACTTTGGCCATGCAACCTTTTTGGTTTGATTTTTTCCAATTCAAAGAAGATTGTCTTTCTCATCTTGATTCATTAAAGAACTTGGTGTATCTGAAGTACTGATAAGGTTTTTTGAAGACACTGCAAAGCCACTTGATAGATGTTGGTGCAAACGGCATAAAAAAGGATTTGAACACTTCAATCTCTGACATGAGATAGGAATGACCTCTAAAATGAACATCTAAAGTTATCTTGAGCTATTTTTTTCAACAGCACAAAAGACGTGATAGGTGAGAGAGTCACCATTGGGAACAACTAATGAGTGAAAGCTAAGCTTAACAATTGATTAAAAAGAACTTTCCATGGTGGAGGAGATTGGCAATGAAATAAGGGACAAGGAGCCTGGTCTGGGGTCAGTTTTGGAAGAGGGGAAAGGAAAGTGGAGAGAGACGCCATAGTTCGAAGAGGGATAAAAAATTGGTTGGTAAGTTTGATAGAAATGAGGCCAAAGCGAGCTTAGCTCAGCGGTAATTGGCACATACCTTCAATCAACTTCGAATCCCCACACCCACATATTGTACTAAGAAATAAAAAAAGTTCGATAGGATTGAGGCATGGGCACACTTTATGGCATCTACTTGGTGTTATTTAGCTTAAATCTTTCTTTAATGGTTCACCGTTTGTAGTTTATTTGTCTTGGAGCTATTTTCTTGTGAGCTCATTTGATTGGGGTTCATATTGATTTTCCTTTAGTACTTTTCGTAGAATCAATGAAAAGCTTCGTTTGCGTTTAGAGAAAAATGTCTTCTCCATGCTTGCCATGTCTGACATTAAAGGAAAAAGTTATATTTATTATCGTGTATCCAACATGTGTCATCCTGTTCTGCTTTTTTGGATGCAGTGGATGCTAGTATGTTACCCTTACCACGGGAGTTCTCATGGTACATAGGTTTGTAGATAAAGTATCTTTGTCTACTAAGATATAGAATCAACCAATTATTTCTTTTTTTAATATGAAACAAATATTTCATTGATGGTATGAAATATACAAAAGGCCCCGAAGGCAATTACAAGAAAGCTCTCCAACTTGTCATAAGATCATTAAAAGAGTAATTGTTGAAAGGAGGTAGAAACATTTACACCAACCCAAGGCTAAAACAAAAATTCTATCAAAAAAGTTTCCAAAAGGGAGGGTTTTATCATTGAAAACCCGAAGATTCCTTTCTTTCTTTCCTTTTTTTTTTTTTTTTTTTTNNNNNNNNNNNATAAGAAACAAAATATGTGTATTCAAAACAAGAAGGCCCCTAAAGGGGATACAAACAAAAGGCGGAGGGTAGAGCTAACCCTCCCCAAAGCTCTATACAATCACCGCTTTCCAATCTTGTGTAAGCAAAAGAAGATTGTAGTTACAAAAAACTTTCTTGTGTAGTAAAGTCCACCAAGAAGCTAGATACTTAATATTCACACAAAAAGAATCTAACAAAATCAACTTATCTTCAAAAAGCCGACTATTCCGTTCCTTCCATATCGATTATAATAGAGCCTTGGCCCCACAATTCCAAAGAACCCAAGCTTTGCCCTTGAGCTGCCTTCCTCCATAACCTTCAGCAATGAAATCCTCAATTCTTCTAGGAATACAGAGAAAGAAAGAGAGAAATCTAGCAAAGAAGCTATAAACCCTCAAAATTTGGAAACAAAAGGGCAATGGAGAAAAAGGTGGTCCACTATTTCATTGTTGCTCATACACATTGTGCACACAGAAGGAGAAAGAGTCCAATGAGGTAATTTCCTTTGAAGTTTCTCTGTGGTATTTACCCCTCTATAGAACAAAGTCCAAATGAAGAATTTTACCTTTGGGATTTTAAAATTCCATATCAAGCTTGTAGTAGCTGAATTAAGTTTAGGAGAAGGGGACGTGAGAGCCATGAAGGCTGAATTGACTGTGAAACGGCCGGATGGGTCCGCTTTCCACCATAAAATGTCCCTTCCATTTCTGATAGCTGCACCTTCTATTTTTCCAGTGAAGTTTGCCCATCTACTCACTTCCCTTTCAAAGAGATTCCTCCTGAAACCAAGATTCCAGGTTGAGGCCTTTACACACCAACAATCCGCAATGGACATACCTTTCTTGGAGGATAACGCAAACAAATCTGGGAAAGTGGTGGAAAGAGGGGCAGAATCAATCCATGAGTCCTCCCAAAATCTGACTTGGACCCCCGAGGATATTTTGAAGAAGGAGAACTTTTGAAAAGCTCTAAGGTTCCCTTGTATGTCCCTCCACGATCTACCAATGGAAGAACCACCCGAAGGTTTTGTCCACCAACCAAAAGACTCCACACCATAAATATTGGCTATCACTCGTCTCCAAAGGCTATCTCTTTCCATTGTGAACTGCCATAATCATTTCATTAATAGAGAATTGTTCCTTAGCCTCAAAGAACCAATTCCAATGCCCCCCTGTTTATGAGGCAAAGCCGACCAAGACCACTTCACCAAATTCCCCACCGATCTAGAAATCCCACCGTTCCATATGAAGTCCCGCATAAGCTTCTCCATAGACTTTATGACTGTCTCTGGAGCTTTGAGAATCGAAAAGAAGTATATGGGGATACTATTGAGGACCGACTGAGCAAGAGTAAGCCTACCACCCTTGGAAAGAAGCAAAGAACTCCATTTTTGGAGTCTTTTGTGCATTTTGTCCACCACCGGAGCCCAAAAAGTAGTGGTCTTGTGGTTCTCTCCGAGGGGGAAACCTAGATAATTAAAAGGGAAAGAATTCATCTTGTAGCCAAAAGAGGAAGTCCAGAAGCTAACCTCTGACCTTGTGCAATTAACACCAATCACAGCAGTTTTTGAGACATTCAGGGAAAGACCCGATGCCAATAAAAACACATGAATGAGACCCCACCAATTCTCCATTAATTGAGCTGAAAAGGGCAGAAAATAAGCACATCGTCGGCATATTGGAGGTGAGTCACCTCTAAGGAACTATTTCCAATATTGAAACCTTTGAGAATATTCCTCTCATTACAAAAGTGAATTAATCTACTCAACGCATCACCCACAATCGTGAAGAGAAAAGGGGAGAGGGGATCCCCTTGTCTTAAGCCCCTCTTGGCCTCAGAAAAGTTGGTAGAAGAAAGGCACCCCCTAATCCACCTACGCCACCGAACCCCAAACCCTTTTGCCTCCAGAACAACATCTAGGAGATCCCAGTCTACCTTATCACAAGCCTTTTCGAAATCAAGTTTGAGGAGAAAGCCTTTTTTCTTCGAGTTCTTCTACTGACTCACTGTCTTAGAAGCCACCATAATAGCATCAAGGATTTTCCGACCTTCTACAAAAGCTGCTTGGGCATCATGAATAATGCTAGGGAGCACTTGTTTCAAACGGGCAGTAAGCACCTTAGCAATAATTTTGTATAAAGAAGTGACCAGACTAATGGGCCTATAATCTTTGACTTTAAGAGGGTTTGAGTTTTTGGAAATCAAACAAATGTAGGCTTCGTTAGTCTTCTGGTTGACAATTCCCTTCTCGAAAAAATCGTGGAACACAGTAACCAAATCCGGTTTAATAATGTTTCAAGCCTTTTTCAAAAATTCATTGCTCATACCATCCGGACCCGGAGATTTAAGAGTGCCCAAATCTGCCACTGTCTGTCCTATCTCATCCGCTGTGAAAGGGTTTTCTAGAAGAGCCATATGGGAGTCTTGGAGGGGAACCCAATCCAAATTCTCAAATAGAAAGCCTCTGTCCATGGTGGTGGAGTACAAGGAGGAGAAATATCTCAAGAATTCTGATTCTATATCTTCCCATTTAGTCAAAGAGGTGCCCTGTTCAGTTTGAATTTCAGCTATGTAGGCTCGGTTTTTCTTGTATGCAATCCATCGGTGAAAGAAGTTCGTGTTCTCGTCTCCTTGCAGCCATTTTATTTTCCCTTTCTGCATCTACAATCTTTGCTCACTGAAAGTAGTTTCAATCAAGCACTCTTTTAGCCCCTGTCTTCTATTGAAATCATCTTCAGAGAGAGAATGAACTTCTTCCTCTCTATCCAAAGCATCTATGGCTTTCATAAGATCCATCTTCTTGAGGCCAATGGGGCCAAAAACCTCTCTGTTCCAGTTGGTGAGCACCCCTTTCAGACCTTGAAATTTCTTCATGAAACCATAACCCGGCAAAAAACAATCTAGAAGAATGGCTGGTTGCGAATGAATAGCTGTTGACCATCCAAAAGCCAAGAGAACACTGTTCCTAAAACGAGTGGCAAAAGGACAGAAAATAAAGAGGTGACCTTGTGTCTCTGATGCTCCAAAACAGAGGTGACAGTAGTGAGGGGAGAGTGATATGTAAGGGTGCTTCCTCTGTATCCTGTCATGTGTATTCAAAGCCCAATGGCTGAGCTCCTAGCAAAAAAATTTGATTTTCTTGGGACATTTTCCTTGCCATAGGCAAAGGTAAAGGGTGGGAGCCAATTGGTCAGGTTGAGTGTTGAGGAGATTGAAGGCAGATTTAGTGAACACTCCCGATGGTTCTAGGAATTATGATATGACCCAAACCATTAGAATAATATTGAAATATAAAGATAAAATAACTACTAATTTGTGGTACTTGGACCCTCCCCACTCTTGAATATTCTTACTCAAGCCTAAACCACACTAGACTCCCACACCCTATTTATAACAAAATCCCCCCAATACTCTTAATACCCTAATAATATTGCTAGTTTTATCCTAATAACAGTTGGAATTGTCTTAGCGCCAGTATCATTATATTTTCATGACTTAATATGTAGATACAAATCTATATGTACATGAATTATGGTTGTTGTTTATTATTAAATATACTTCATCGAACTTTTTGGTACACTTCTTTATGTATTTTCTTTTCCTAGTTTTTGTAATGTAAAGTTTACACTGCACGTCTTTCCGGGGAAGCGGATGCATTACTCTGTCCACAAATAGGGAAAGAGTAGATTATTAAGCTATAGAAAATGGATTGCCAGGTCCAAATGGAAACCTCCAATGGGGAAAGGTGGAAAACGAGAAAGTAGGAAGCGATCCTACAGCGAGGATGATGAATTGGATCCGATGGACCCTAGCTCCTATTCAGATGCTCCTCGTGGTGGCTGGTAGGTGTTCGAAAATTAGCTTTCTGGGTTTAATACTGATTGATTATCTAGGATTTCTGCCAATAAGGGGAGTGATGATAGTTAATTGTTTTATTTACTTTCCAGGGTTGTGGGTCTAAAAGGAGTACAACCTCGAGCAGCAGATACTACTGCTACAGTAAGTAGTAACTAATCATTTTGTTGAATTTTGAACTTCAAAGAACGATTAAGTTCGTACTTTAGGTTGCAACTAGTCCACCTGTATCTGGTGGAGAGTAAACTCAATGAAAAGTAAGAATATCTCAATTGCCTTTATTAAGAGAGGATCTCTATTTATAGGGAGATCTTAACCAACCAACTAACAAACTAATAGTAACTTGTAGTTACTACACAGTAAAAATGATGTTTTAAAAGAAATCCAGCTACAAGCTAAGCTTTAAGACTTAAAGCTATCCGAAAGGCCATACACGGCTACATCAGTATCTCATTTGTTATTTCATGTATGGAAATGTCACGTACAGCATATAAATACCAAAACTTCCCGTATACATTTGAATGCTCAACGTTTATGTTTTAGAATGCCAATACATTTTCCAAAAGACAAATTGCACACTGAACAGGAAATAATTGAAAATATTAACCATGCTTTCATGTTCATTTCTGTGATATTTTTCTAGCACGACACTTTGTTTTTTCTGCCAATCTTAAAACAGTTCTTTCACGATGCCACGAGTCTTCTCTATGACTTGTACTGATTCTATAGTCTGCAATGAACACAAGATAGTGGCTAGAACATGGGTTGTCCTCTTGTTCCTCTATCAAATCTTCATTGTGTAGGATCTTAGTTTTTTCCCCCCTCTTTGTAATGGAAATACTTTGCTGTGGATAATTGGTTCCCCTCATACATGGTGAAATTGGCAGGGTCCTCTATTTCAACAGCGGCCATACCCATCACCTGGAGCTGTTCTGAGGAAGAATGCCGAAATTGCTTCACAAACCAAGAAGGGAAGCTCTCACTATGCACCTATTTCCAAGAAAGGAGATGGGAGTGATGGCCTTGGTGATGCTGACTGATCTTCTACGATACAACAGCATGCCACGTGTTTCTCTAGCGCTGGAATGCGTTTTCCACATGAGATGCTGTGGACTTTTTTACCCCTATAGCCTTAACTTGAAGGTATGGATCAAATGAACGTGATTGCTGCTGACTTTTTGAAGGATTGGCCGTATGTTCAAGTGGGAGTCATTGAGGCATTTTAATGTGTCCTGTTTACATGTTCTTGGTTCTGTGTCACTTTTTCTACCCCTTTGTCTTTTTGCCGCCCCCTCTTGTACATATATATACTTCTATAAGAAAAAGTAAAATTTTGTGTAACTAACTACGTTGAGCTTGTGGGCCTTGGGGCGGGCGCGAGTGCCAAGAGGTGAGTAAAGCTCCGACTCCAAATTATCCACAAAAAAAAAAAAAAAAAAAATTACACTGCATTGAGCTTGGACGTAGGTTTGTTTATCACAACTTAAGAGCTTTCTAATCCTAGTAGTTATGTAAACTCAAAGCCTACTTTCTATATTACATTTTTGCTTTTACAAATTATAGATTCACGAGTCAAATATAGAGTTATGTAAACTTGAGTTTGTCAATTCTAAATTTCATAAATTCAGAAAGATTAGATTCATGAGATGAACATTCA

mRNA sequence

AGAGGGAAGAAATAAAGATTCGGAAGAAATTTGTTTTTTTTGAATAATAAGAATATTTTAGTAATTTTATAAAGTAAAAAAATGAAAATACGTCAAAAAAAAAAAAAAAGAAAGAAAGAAAGAGAGGATCAAGAACGAGAGATTTTTGGCTAGTCATCGAAATTCAATTAAGAGGAACAAAAATCCCAATTTCTGAAAAATAAGGGGAGGTGGGTCACTAATTAGATTTTTTCGGAAGGAATCCAAAACTGGTTCTCGAAGGGAATCCCAATTTCCCAACAATCTCGGCATTTCATCGCTCGCTCTCTCTGGCACCCCATTTTGCACCATTCCGACCAAAAGCTTCTCTCTGCATATCGTCGAATTCGCCATATCCAAATGCTCCTCAAATTCCCATAATCAGTTGGTCAAGTGCAGCACAACTTCCAAGACTGAAACCCTAATCCAGAGGATGCCGACCTCAAATGCAGCAGCTGCAGTTTCCGGAGACTCGTTCATAACTACAATTGGGTCCAGCGTTGAAGATAGACCCCTCAAGGAATCGGGCGCCGCTCAATCTCAATCTTACGCCCAGAATGAAGTGCAAGAACTTGTAAAGTCTGGCAAACAAGACTCCTCTAGCCAACCGGGAGAAGCGCAGAGTTCTGTAGCAGTGTCTTCCGATCAAGATACCTTTGTTGAGCAGCAACTTGGGAAGAGCACTGCAACTGTGGATGAACTGCTCGTGCAGGGGAATGAAAAGTTCCAGGAGACAGAACCAAGTCGTGTAAATGATCAGAACAATGTTCCCCATGACGGCGTGTTTAAAATTGCTTGCTCGTCTTCTAGCAAATTCGGATCGCATGTTGGCGATACCAGGGACATTGACAGTGCTGTTCAGGATGCAGTGTTGAGGGAACAGGAACTTGCAACCCAAAATATTATTCGCAGCCAAAGGGAGTCCTTGGGTGCAGATGGACCTCCTAGCGAAAGATCAGATATCTTTTCAGAACGTTATGACCCAAGTACTCTTAAAGAGCATCTTTTGAAGATTACTTCTGATCATCGTGCTGAAATGGCTATGAAAAGAGGAAAGTCAAACCTTCCAGAAGAAGGGAACTTGGAAATTGGAAATGGTTATGGCGTACCTGGTGGATGTGCTTTCTATGGTGCTTCAAAGCCTGGAATTGTAACCCCTGGAAATAATGCGATTGGCTGGAAAATCCAGGGACAGGTTACGGAAGCAGAACAAAACTCTGCTACTAAAGAATTGCCCGAGTACCTCAAGCAGAAGCTAAGAGCCAGGGGTATTCTTAAAGAAGATACACAAAAAAGAAATTCTACAAATTCTGATGCTATTTCAAATCAACCAGTGCAAGGAGATAAGCTGCCTCGTGGATGGGTGGAGGCTAAAGACCCTGATAGTGGTGTTTTGTATTATTATAATGAAAGTACTGGGAAGAGTCAATGGGAAAGGCCCTCTGACTCCTCTTTTGATCTGCAACTTCCATCAGCTGTATCCCTTCCAGAAGATTGGATGGAGGCACTCGATGAAACAACAGGCCTTAAATACTACTACAATGTAAGAACCCACGTAACCCAGTGGGAGCATCCTGTTTCATCTCATCAGGGAACTTTGACACACTCGAATGGTAACGTTCCTGGGGTTTGGAACAACCAAACTTTTGAACAAAATAAATGCATCGCATGTGGAAGAGGAATGACCCTCATGCAGGGTTCTAGATACTGCAACAGTTATACATGTGAGGCTTCTACAAGTTCAACCAATGGGAATTGGCAGGAGCAATCGTTTGAGCTAAACAAATGCATGGGATGTGGCGGTTGGGGGCTTGGCCTTGTGCAATCTTGGGGTTACTGCAATCATTGTACACGAATTCTCAGGCTTCCCCAGTGTGAATACTTGCCAACCAGCAGTGTTAGTAATCAGCATCAGCAGAAGACCGAGAGTATCGATCACAGCGCTGATGCTTCCATTAAAAAGTCTGCTATGGATAGGTCCAAATGGAAACCTCCAATGGGGAAAGGTGGAAAACGAGAAAGTAGGAAGCGATCCTACAGCGAGGATGATGAATTGGATCCGATGGACCCTAGCTCCTATTCAGATGCTCCTCGTGGTGGCTGGGTTGTGGGTCTAAAAGGAGTACAACCTCGAGCAGCAGATACTACTGCTACAGGTCCTCTATTTCAACAGCGGCCATACCCATCACCTGGAGCTGTTCTGAGGAAGAATGCCGAAATTGCTTCACAAACCAAGAAGGGAAGCTCTCACTATGCACCTATTTCCAAGAAAGGAGATGGGAGTGATGGCCTTGGTGATGCTGACTGATCTTCTACGATACAACAGCATGCCACGTGTTTCTCTAGCGCTGGAATGCGTTTTCCACATGAGATGCTGTGGACTTTTTTACCCCTATAGCCTTAACTTGAAGGTATGGATCAAATGAACGTGATTGCTGCTGACTTTTTGAAGGATTGGCCGTATGTTCAAGTGGGAGTCATTGAGGCATTTTAATGTGTCCTGTTTACATGTTCTTGGTTCTGTGTCACTTTTTCTACCCCTTTGTCTTTTTGCCGCCCCCTCTTGTACATATATATACTTCTATAAGAAAAAGTAAAATTTTGTGTAACTAACTACGTTGAGCTTGTGGGCCTTGGGGCGGGCGCGAGTGCCAAGAGGTGAGTAAAGCTCCGACTCCAAATTATCCACAAAAAAAAAAAAAAAAAAAATTACACTGCATTGAGCTTGGACGTAGGTTTGTTTATCACAACTTAAGAGCTTTCTAATCCTAGTAGTTATGTAAACTCAAAGCCTACTTTCTATATTACATTTTTGCTTTTACAAATTATAGATTCACGAGTCAAATATAGAGTTATGTAAACTTGAGTTTGTCAATTCTAAATTTCATAAATTCAGAAAGATTAGATTCATGAGATGAACATTCA

Coding sequence (CDS)

ATGCCGACCTCAAATGCAGCAGCTGCAGTTTCCGGAGACTCGTTCATAACTACAATTGGGTCCAGCGTTGAAGATAGACCCCTCAAGGAATCGGGCGCCGCTCAATCTCAATCTTACGCCCAGAATGAAGTGCAAGAACTTGTAAAGTCTGGCAAACAAGACTCCTCTAGCCAACCGGGAGAAGCGCAGAGTTCTGTAGCAGTGTCTTCCGATCAAGATACCTTTGTTGAGCAGCAACTTGGGAAGAGCACTGCAACTGTGGATGAACTGCTCGTGCAGGGGAATGAAAAGTTCCAGGAGACAGAACCAAGTCGTGTAAATGATCAGAACAATGTTCCCCATGACGGCGTGTTTAAAATTGCTTGCTCGTCTTCTAGCAAATTCGGATCGCATGTTGGCGATACCAGGGACATTGACAGTGCTGTTCAGGATGCAGTGTTGAGGGAACAGGAACTTGCAACCCAAAATATTATTCGCAGCCAAAGGGAGTCCTTGGGTGCAGATGGACCTCCTAGCGAAAGATCAGATATCTTTTCAGAACGTTATGACCCAAGTACTCTTAAAGAGCATCTTTTGAAGATTACTTCTGATCATCGTGCTGAAATGGCTATGAAAAGAGGAAAGTCAAACCTTCCAGAAGAAGGGAACTTGGAAATTGGAAATGGTTATGGCGTACCTGGTGGATGTGCTTTCTATGGTGCTTCAAAGCCTGGAATTGTAACCCCTGGAAATAATGCGATTGGCTGGAAAATCCAGGGACAGGTTACGGAAGCAGAACAAAACTCTGCTACTAAAGAATTGCCCGAGTACCTCAAGCAGAAGCTAAGAGCCAGGGGTATTCTTAAAGAAGATACACAAAAAAGAAATTCTACAAATTCTGATGCTATTTCAAATCAACCAGTGCAAGGAGATAAGCTGCCTCGTGGATGGGTGGAGGCTAAAGACCCTGATAGTGGTGTTTTGTATTATTATAATGAAAGTACTGGGAAGAGTCAATGGGAAAGGCCCTCTGACTCCTCTTTTGATCTGCAACTTCCATCAGCTGTATCCCTTCCAGAAGATTGGATGGAGGCACTCGATGAAACAACAGGCCTTAAATACTACTACAATGTAAGAACCCACGTAACCCAGTGGGAGCATCCTGTTTCATCTCATCAGGGAACTTTGACACACTCGAATGGTAACGTTCCTGGGGTTTGGAACAACCAAACTTTTGAACAAAATAAATGCATCGCATGTGGAAGAGGAATGACCCTCATGCAGGGTTCTAGATACTGCAACAGTTATACATGTGAGGCTTCTACAAGTTCAACCAATGGGAATTGGCAGGAGCAATCGTTTGAGCTAAACAAATGCATGGGATGTGGCGGTTGGGGGCTTGGCCTTGTGCAATCTTGGGGTTACTGCAATCATTGTACACGAATTCTCAGGCTTCCCCAGTGTGAATACTTGCCAACCAGCAGTGTTAGTAATCAGCATCAGCAGAAGACCGAGAGTATCGATCACAGCGCTGATGCTTCCATTAAAAAGTCTGCTATGGATAGGTCCAAATGGAAACCTCCAATGGGGAAAGGTGGAAAACGAGAAAGTAGGAAGCGATCCTACAGCGAGGATGATGAATTGGATCCGATGGACCCTAGCTCCTATTCAGATGCTCCTCGTGGTGGCTGGGTTGTGGGTCTAAAAGGAGTACAACCTCGAGCAGCAGATACTACTGCTACAGGTCCTCTATTTCAACAGCGGCCATACCCATCACCTGGAGCTGTTCTGAGGAAGAATGCCGAAATTGCTTCACAAACCAAGAAGGGAAGCTCTCACTATGCACCTATTTCCAAGAAAGGAGATGGGAGTGATGGCCTTGGTGATGCTGACTGA

Protein sequence

MPTSNAAAAVSGDSFITTIGSSVEDRPLKESGAAQSQSYAQNEVQELVKSGKQDSSSQPGEAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQETEPSRVNDQNNVPHDGVFKIACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSERYDPSTLKEHLLKITSDHRAEMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVTPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSTNSDAISNQPVQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHPVSSHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLMQGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQSWGYCNHCTRILRLPQCEYLPTSSVSNQHQQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHYAPISKKGDGSDGLGDAD
Homology
BLAST of MC00g0145 vs. ExPASy Swiss-Prot
Match: Q91VJ5 (Polyglutamine-binding protein 1 OS=Mus musculus OX=10090 GN=Pqbp1 PE=1 SV=1)

HSP 1 Score: 96.7 bits (239), Expect = 1.0e-18
Identity = 85/267 (31.84%), Postives = 116/267 (43.45%), Query Frame = 0

Query: 338 DSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHPVSSHQGTLTHSNGNVP 397
           D   D +      LP  W +  D + GL YY+NV T +  W   +S H      +  +  
Sbjct: 35  DDPVDYEATRIEGLPPSWYKVFDPSCGLPYYWNVETDLVSW---LSPHDPNFVVTK-SAK 94

Query: 398 GVWNNQTFEQNKCIACGRGMTLMQGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGG 457
            V NN    ++K     R +  +  +   +  + E    S      +++ E N       
Sbjct: 95  KVRNNNADAEDK---SDRNLEKVDRNHEKSDRSHEKPDRSHEK--ADRNHEKND------ 154

Query: 458 WGLGLVQSWGYCNHCTRILRLPQCEYLPTSSVSNQHQQKTESIDHSADASIKKSAMDRSK 517
                              R  +  Y       ++ +++  + D +     K     R +
Sbjct: 155 -------------------RERERNYDKVDRERDRDRERERAFDKADREEGKDRRHHRRE 214

Query: 518 WKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGL--KGVQPRAADTTATG 577
              P  K  K  SRK     D+ELDPMDPSSYSDAPRG W  GL  +      ADTTA G
Sbjct: 215 ELAPYPKNKKATSRK-----DEELDPMDPSSYSDAPRGTWSTGLPKRNEAKTGADTTAAG 261

Query: 578 PLFQQRPYPSPGAVLRKNAEIASQTKK 603
           PLFQQRPYPSPGAVLR NAE AS+TK+
Sbjct: 275 PLFQQRPYPSPGAVLRANAE-ASRTKQ 261

BLAST of MC00g0145 vs. ExPASy Swiss-Prot
Match: Q2HJC9 (Polyglutamine-binding protein 1 OS=Bos taurus OX=9913 GN=PQBP1 PE=2 SV=1)

HSP 1 Score: 95.9 bits (237), Expect = 1.7e-18
Identity = 51/76 (67.11%), Postives = 59/76 (77.63%), Query Frame = 0

Query: 529 ESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGL--KGVQPRAADTTATGPLFQQRPYPSP 588
           +S+K +  +D+ELDPMDPSSYSDAPRG W  GL  +      ADTTA GPLFQQRPYPSP
Sbjct: 187 KSKKAASRKDEELDPMDPSSYSDAPRGTWSTGLPKRNEAKTGADTTAAGPLFQQRPYPSP 246

Query: 589 GAVLRKNAEIASQTKK 603
           GAVLR NAE AS+TK+
Sbjct: 247 GAVLRANAE-ASRTKQ 261

BLAST of MC00g0145 vs. ExPASy Swiss-Prot
Match: A1YFA7 (Polyglutamine-binding protein 1 OS=Gorilla gorilla gorilla OX=9595 GN=PQBP1 PE=3 SV=1)

HSP 1 Score: 94.7 bits (234), Expect = 3.8e-18
Identity = 51/76 (67.11%), Postives = 58/76 (76.32%), Query Frame = 0

Query: 529 ESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGL--KGVQPRAADTTATGPLFQQRPYPSP 588
           +S+K    +D+ELDPMDPSSYSDAPRG W  GL  +      ADTTA GPLFQQRPYPSP
Sbjct: 189 KSKKAVSRKDEELDPMDPSSYSDAPRGTWSTGLPKRNEAKTGADTTAAGPLFQQRPYPSP 248

Query: 589 GAVLRKNAEIASQTKK 603
           GAVLR NAE AS+TK+
Sbjct: 249 GAVLRANAE-ASRTKQ 263

BLAST of MC00g0145 vs. ExPASy Swiss-Prot
Match: O60828 (Polyglutamine-binding protein 1 OS=Homo sapiens OX=9606 GN=PQBP1 PE=1 SV=1)

HSP 1 Score: 94.7 bits (234), Expect = 3.8e-18
Identity = 51/76 (67.11%), Postives = 58/76 (76.32%), Query Frame = 0

Query: 529 ESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGL--KGVQPRAADTTATGPLFQQRPYPSP 588
           +S+K    +D+ELDPMDPSSYSDAPRG W  GL  +      ADTTA GPLFQQRPYPSP
Sbjct: 189 KSKKAVSRKDEELDPMDPSSYSDAPRGTWSTGLPKRNEAKTGADTTAAGPLFQQRPYPSP 248

Query: 589 GAVLRKNAEIASQTKK 603
           GAVLR NAE AS+TK+
Sbjct: 249 GAVLRANAE-ASRTKQ 263

BLAST of MC00g0145 vs. ExPASy Swiss-Prot
Match: A2T806 (Polyglutamine-binding protein 1 OS=Pongo pygmaeus OX=9600 GN=PQBP1 PE=3 SV=1)

HSP 1 Score: 94.7 bits (234), Expect = 3.8e-18
Identity = 51/76 (67.11%), Postives = 58/76 (76.32%), Query Frame = 0

Query: 529 ESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGL--KGVQPRAADTTATGPLFQQRPYPSP 588
           +S+K    +D+ELDPMDPSSYSDAPRG W  GL  +      ADTTA GPLFQQRPYPSP
Sbjct: 189 KSKKAVSRKDEELDPMDPSSYSDAPRGTWSTGLPKRNEAKTGADTTAAGPLFQQRPYPSP 248

Query: 589 GAVLRKNAEIASQTKK 603
           GAVLR NAE AS+TK+
Sbjct: 249 GAVLRANAE-ASRTKQ 263

BLAST of MC00g0145 vs. NCBI nr
Match: XP_022154433.1 (uncharacterized protein LOC111021704 [Momordica charantia])

HSP 1 Score: 1244 bits (3220), Expect = 0.0
Identity = 624/624 (100.00%), Postives = 624/624 (100.00%), Query Frame = 0

Query: 1   MPTSNAAAAVSGDSFITTIGSSVEDRPLKESGAAQSQSYAQNEVQELVKSGKQDSSSQPG 60
           MPTSNAAAAVSGDSFITTIGSSVEDRPLKESGAAQSQSYAQNEVQELVKSGKQDSSSQPG
Sbjct: 1   MPTSNAAAAVSGDSFITTIGSSVEDRPLKESGAAQSQSYAQNEVQELVKSGKQDSSSQPG 60

Query: 61  EAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQETEPSRVNDQNNVPHDGVFKI 120
           EAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQETEPSRVNDQNNVPHDGVFKI
Sbjct: 61  EAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQETEPSRVNDQNNVPHDGVFKI 120

Query: 121 ACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSE 180
           ACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSE
Sbjct: 121 ACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSE 180

Query: 181 RYDPSTLKEHLLKITSDHRAEMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIV 240
           RYDPSTLKEHLLKITSDHRAEMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIV
Sbjct: 181 RYDPSTLKEHLLKITSDHRAEMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIV 240

Query: 241 TPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSTNSDAISNQP 300
           TPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSTNSDAISNQP
Sbjct: 241 TPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSTNSDAISNQP 300

Query: 301 VQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALD 360
           VQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALD
Sbjct: 301 VQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALD 360

Query: 361 ETTGLKYYYNVRTHVTQWEHPVSSHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLM 420
           ETTGLKYYYNVRTHVTQWEHPVSSHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLM
Sbjct: 361 ETTGLKYYYNVRTHVTQWEHPVSSHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLM 420

Query: 421 QGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQSWGYCNHCTRILRLPQ 480
           QGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQSWGYCNHCTRILRLPQ
Sbjct: 421 QGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQSWGYCNHCTRILRLPQ 480

Query: 481 CEYLPTSSVSNQHQQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDE 540
           CEYLPTSSVSNQHQQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDE
Sbjct: 481 CEYLPTSSVSNQHQQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDE 540

Query: 541 LDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQT 600
           LDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQT
Sbjct: 541 LDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQT 600

Query: 601 KKGSSHYAPISKKGDGSDGLGDAD 624
           KKGSSHYAPISKKGDGSDGLGDAD
Sbjct: 601 KKGSSHYAPISKKGDGSDGLGDAD 624

BLAST of MC00g0145 vs. NCBI nr
Match: KAG6580903.1 (Polyglutamine-binding protein 1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 924 bits (2387), Expect = 0.0
Identity = 481/624 (77.08%), Postives = 517/624 (82.85%), Query Frame = 0

Query: 1   MPTSNAAAAVSGDSFITTIGSSVEDRPLKESGAAQSQSYAQNEVQELVKSGKQDSSSQPG 60
           MPTS AA A  GDS  TTIGSSVED  LKESG+AQSQSYAQNEVQEL K G Q S  QPG
Sbjct: 1   MPTSTAAIADLGDSSKTTIGSSVEDSSLKESGSAQSQSYAQNEVQELEKFGNQISPCQPG 60

Query: 61  EAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQETEPSRVNDQNNVPHDGVFKI 120
           E  SSV +SSD                           QE  PS  NDQN V HDGVF I
Sbjct: 61  EVHSSVVISSD---------------------------QEKPPSFGNDQNIVSHDGVFNI 120

Query: 121 ACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSE 180
           A SSSSKFGSHV DTRDID+AV+DAVLREQELATQNIIRS+R+S+ ADG P ERSDIFSE
Sbjct: 121 AVSSSSKFGSHVVDTRDIDNAVRDAVLREQELATQNIIRSRRDSVDADGLPEERSDIFSE 180

Query: 181 RYDPSTLKEHLLKITSDHRAEMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIV 240
           RYDPS LKEHLLKITS+HRAEMAMKRGK NLPEEGNLEIGNGYGVPGGCAFYGASKPGIV
Sbjct: 181 RYDPSALKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIV 240

Query: 241 TPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSTNSDAISNQP 300
           T GNN I  KIQGQV EAEQ+ + KELPEYLKQKL+ARGILKED +  NS NSDAISNQ 
Sbjct: 241 THGNNTIDQKIQGQVREAEQSPSAKELPEYLKQKLKARGILKEDAKHSNSANSDAISNQM 300

Query: 301 VQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALD 360
           +QG+KLP GWVEAKDP SGV YYYNESTGKSQWERP++SSF LQL SAVSLPEDWMEA+D
Sbjct: 301 LQGEKLPHGWVEAKDPGSGVSYYYNESTGKSQWERPTESSFGLQLSSAVSLPEDWMEAVD 360

Query: 361 ETTGLKYYYNVRTHVTQWEHPVSSHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLM 420
           +TTG KYYYN RT VTQWE PV+SHQ TL HSN + PG WNNQT  Q+KC+ CG GMTL+
Sbjct: 361 QTTGHKYYYNRRTQVTQWEPPVASHQATLAHSNVSAPGSWNNQTSGQSKCVTCGSGMTLV 420

Query: 421 QGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQSWGYCNHCTRILRLPQ 480
           QGSRYCN      STSSTNG WQ+Q  + +KCMGCGGWGLGLVQ+WGYCNHCTR L LPQ
Sbjct: 421 QGSRYCNCCASGVSTSSTNGKWQDQLSDQHKCMGCGGWGLGLVQAWGYCNHCTRTLGLPQ 480

Query: 481 CEYLPTSSVSNQHQQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDE 540
           C+YLPTS++ NQ  QKTE+I ++AD SIKKSA DRSK KPP+GKGGKRESRKRS+SEDDE
Sbjct: 481 CQYLPTSNIYNQ--QKTENIKNNADPSIKKSASDRSKGKPPIGKGGKRESRKRSHSEDDE 540

Query: 541 LDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQT 600
           LDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQT
Sbjct: 541 LDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQT 595

Query: 601 KKGSSHYAPISKKGDGSDGLGDAD 624
           KKGSSHYAPISK+GDGSDGLGDAD
Sbjct: 601 KKGSSHYAPISKRGDGSDGLGDAD 595

BLAST of MC00g0145 vs. NCBI nr
Match: XP_022983732.1 (uncharacterized protein LOC111482260 [Cucurbita maxima])

HSP 1 Score: 922 bits (2384), Expect = 0.0
Identity = 477/624 (76.44%), Postives = 517/624 (82.85%), Query Frame = 0

Query: 1   MPTSNAAAAVSGDSFITTIGSSVEDRPLKESGAAQSQSYAQNEVQELVKSGKQDSSSQPG 60
           MPTS AA A SGDS  TTIGSSVED  LKESG+AQSQSYAQNEVQEL K G Q S  QPG
Sbjct: 1   MPTSTAAIADSGDSSKTTIGSSVEDSSLKESGSAQSQSYAQNEVQELEKFGNQISPCQPG 60

Query: 61  EAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQETEPSRVNDQNNVPHDGVFKI 120
           E  SSV + SD                           QE  PS  NDQN VPH GVF I
Sbjct: 61  EVHSSVVIYSD---------------------------QEKTPSFGNDQNIVPHAGVFNI 120

Query: 121 ACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSE 180
           A SSSSKFGSHV DTRDID+AV+DAVLREQELATQNIIRSQR+S+GADG P ERSDIFSE
Sbjct: 121 AVSSSSKFGSHVVDTRDIDNAVRDAVLREQELATQNIIRSQRDSVGADGLPEERSDIFSE 180

Query: 181 RYDPSTLKEHLLKITSDHRAEMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIV 240
           RYDPSTLKEHLLKIT++HRAEMAMKRGK NLPEEGNLEIGNGYGVPGGCAFYGASKPGIV
Sbjct: 181 RYDPSTLKEHLLKITTEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIV 240

Query: 241 TPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSTNSDAISNQP 300
           T GNN I  KIQGQV E +Q+S+ KELPEYLKQKL+ARGILKED +  NS N+DAISNQ 
Sbjct: 241 THGNNTIDQKIQGQVRETKQSSSAKELPEYLKQKLKARGILKEDAKHSNSANADAISNQM 300

Query: 301 VQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALD 360
           +QG+KLP GWVEAKDP SG  YYYNESTGKSQWERP++SSF LQL SAVSLPEDWMEA+D
Sbjct: 301 LQGEKLPHGWVEAKDPGSGASYYYNESTGKSQWERPTESSFGLQLSSAVSLPEDWMEAVD 360

Query: 361 ETTGLKYYYNVRTHVTQWEHPVSSHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLM 420
           + TG KYYYN RT VTQWE P +SHQ TL HSN   PG WN+QT  Q+KC+ CG GMTL+
Sbjct: 361 QITGHKYYYNRRTQVTQWEPPAASHQATLAHSNVGAPGSWNDQTSGQSKCVTCGSGMTLV 420

Query: 421 QGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQSWGYCNHCTRILRLPQ 480
           QGSRYCN      STSSTNG WQ+Q  +L+KCMGCGGWGLGLVQ+WGYCNHCTR L LPQ
Sbjct: 421 QGSRYCNCCASGVSTSSTNGKWQDQPSDLHKCMGCGGWGLGLVQAWGYCNHCTRTLGLPQ 480

Query: 481 CEYLPTSSVSNQHQQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDE 540
           C+YLPTS+++NQH  KTE+I +++D SIKKSA DRSK KPP+GKGGKRESRKRS+SEDDE
Sbjct: 481 CQYLPTSNINNQH--KTENIKNNSDPSIKKSASDRSKGKPPIGKGGKRESRKRSHSEDDE 540

Query: 541 LDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQT 600
           LDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQT
Sbjct: 541 LDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQT 595

Query: 601 KKGSSHYAPISKKGDGSDGLGDAD 624
           KKGSSHYAPISK+GDGSDGLGDAD
Sbjct: 601 KKGSSHYAPISKRGDGSDGLGDAD 595

BLAST of MC00g0145 vs. NCBI nr
Match: XP_022935213.1 (uncharacterized protein LOC111442162 [Cucurbita moschata])

HSP 1 Score: 921 bits (2380), Expect = 0.0
Identity = 478/624 (76.60%), Postives = 518/624 (83.01%), Query Frame = 0

Query: 1   MPTSNAAAAVSGDSFITTIGSSVEDRPLKESGAAQSQSYAQNEVQELVKSGKQDSSSQPG 60
           MPTS AA A  GDS  TTIGSSVED  LKESG+AQSQSYAQNEVQEL K G Q S  QPG
Sbjct: 1   MPTSTAAIADLGDSSKTTIGSSVEDSSLKESGSAQSQSYAQNEVQELEKFGNQISPCQPG 60

Query: 61  EAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQETEPSRVNDQNNVPHDGVFKI 120
           E +SSV +SSD                           QE  PS  NDQN VPHDGVF I
Sbjct: 61  EVRSSVVISSD---------------------------QEKPPSFGNDQNIVPHDGVFNI 120

Query: 121 ACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSE 180
           A SSSSKFGSHV DTRDID+AV+DAVLREQELATQNIIRS+R+S+ ADG P ERSDIFSE
Sbjct: 121 AVSSSSKFGSHVVDTRDIDNAVRDAVLREQELATQNIIRSRRDSVDADGLPEERSDIFSE 180

Query: 181 RYDPSTLKEHLLKITSDHRAEMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIV 240
           RYDPS LKEHLLKITS+HRAEMAMKRGK NLPEEGNLEIGNGYGVPGGCAFYGASKPGIV
Sbjct: 181 RYDPSALKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIV 240

Query: 241 TPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSTNSDAISNQP 300
           T GNN I  KIQGQV EAEQ+ + KELPEYLKQKL+ARGILKED +  NS NSDAISNQ 
Sbjct: 241 THGNNTIDQKIQGQVREAEQSPSAKELPEYLKQKLKARGILKEDAKHSNSANSDAISNQM 300

Query: 301 VQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALD 360
           +QG+KLP GWVEAKDP SGV YYYNESTGKSQWERP++SSF LQL SAVSLPEDWMEA+D
Sbjct: 301 LQGEKLPHGWVEAKDPGSGVSYYYNESTGKSQWERPTESSFGLQLSSAVSLPEDWMEAVD 360

Query: 361 ETTGLKYYYNVRTHVTQWEHPVSSHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLM 420
           +TTG +YYYN RT VTQWE PV+SHQ TL HS  + PG WN+QT  Q+KC+ CG GMTL+
Sbjct: 361 QTTGHRYYYNRRTQVTQWEPPVASHQATLAHSTVSAPGSWNDQTSGQSKCVTCGSGMTLV 420

Query: 421 QGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQSWGYCNHCTRILRLPQ 480
           QG+RYCN      STSSTNG WQ+Q  + +KCMGCGGWGLGLVQ+WGYCNHCTR L LPQ
Sbjct: 421 QGTRYCNCCASGVSTSSTNGKWQDQPSDQHKCMGCGGWGLGLVQAWGYCNHCTRTLGLPQ 480

Query: 481 CEYLPTSSVSNQHQQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDE 540
           C+YLPTS++ NQ  QKTE+I ++AD SIKKSA DRSK KPP+GKGGKRESRKRS+SEDDE
Sbjct: 481 CQYLPTSNIYNQ--QKTENIKNNADPSIKKSASDRSKGKPPIGKGGKRESRKRSHSEDDE 540

Query: 541 LDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQT 600
           LDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQT
Sbjct: 541 LDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQT 595

Query: 601 KKGSSHYAPISKKGDGSDGLGDAD 624
           KKGSSHYAPISK+GDGSDGLGDAD
Sbjct: 601 KKGSSHYAPISKRGDGSDGLGDAD 595

BLAST of MC00g0145 vs. NCBI nr
Match: XP_038906175.1 (uncharacterized protein LOC120092051 isoform X3 [Benincasa hispida])

HSP 1 Score: 916 bits (2367), Expect = 0.0
Identity = 480/624 (76.92%), Postives = 512/624 (82.05%), Query Frame = 0

Query: 1   MPTSNAAAAVSGDSFITTIGSSVEDRPLKESGAAQSQSYAQNEVQELVKSGKQDSSSQPG 60
           MPT+ AA A SGDS  T IGSSVED+ LKE  AAQSQ + QNEVQEL K GK     Q G
Sbjct: 1   MPTTTAAIAGSGDSSNTIIGSSVEDKSLKELAAAQSQYHPQNEVQELEKLGKHIYPCQQG 60

Query: 61  EAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQETEPSRVNDQNNVPHDGVFKI 120
           EAQ S                                 QET  S  ND + VPHD VF I
Sbjct: 61  EAQGS---------------------------------QETNRSFGNDPDIVPHDSVFNI 120

Query: 121 ACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSE 180
           A SSSSKF SHV DTRDIDSAVQDAVLREQELATQNIIRSQR+S+GADG P ERSDIFSE
Sbjct: 121 AVSSSSKFRSHVNDTRDIDSAVQDAVLREQELATQNIIRSQRDSVGADGLPVERSDIFSE 180

Query: 181 RYDPSTLKEHLLKITSDHRAEMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIV 240
           RYDPST+KEHLLKITS+HRAEMAMKRGK NLPEEGNLEIGNGYGVPGGCAFYGASKPG+V
Sbjct: 181 RYDPSTIKEHLLKITSEHRAEMAMKRGKPNLPEEGNLEIGNGYGVPGGCAFYGASKPGVV 240

Query: 241 TPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSTNSDAISNQP 300
             GNN IG KIQGQV E EQ+SA K LPEYLKQKLRARGILKE+ +  NST+SDAISNQ 
Sbjct: 241 AIGNNTIGQKIQGQVRETEQSSAVKALPEYLKQKLRARGILKEEAEHSNSTSSDAISNQT 300

Query: 301 VQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALD 360
           +QG+KLP GWVEAKDP SGV YYYNES+GKSQWERPS+SS D QL SA SLPEDWMEALD
Sbjct: 301 LQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDTQLSSAESLPEDWMEALD 360

Query: 361 ETTGLKYYYNVRTHVTQWEHPVSSHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLM 420
           + TGLKYYYN+RT VTQWE PV+SHQ TLTHSN NV G WNNQT EQ+KCI CG G+TL+
Sbjct: 361 QATGLKYYYNMRTQVTQWEPPVASHQTTLTHSNDNVLGSWNNQTLEQSKCITCGSGITLV 420

Query: 421 QGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQSWGYCNHCTRILRLPQ 480
           QGSRYCN  T   STSSTNG WQ+QS E NKCMGC GWGLGLVQ+WGYCNHCTRIL LPQ
Sbjct: 421 QGSRYCNC-TSGVSTSSTNGRWQDQSSEQNKCMGCCGWGLGLVQAWGYCNHCTRILGLPQ 480

Query: 481 CEYLPTSSVSNQHQQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDE 540
           C+YLPTS++SNQ  QKTE+I HSAD SIKKSA D SKWKPP+GKGGKRESRKRSYSEDDE
Sbjct: 481 CQYLPTSNISNQ--QKTENIKHSADPSIKKSATDGSKWKPPIGKGGKRESRKRSYSEDDE 540

Query: 541 LDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQT 600
           LDPMDPS+YSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQT
Sbjct: 541 LDPMDPSAYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQT 588

Query: 601 KKGSSHYAPISKKGDGSDGLGDAD 624
           KKGSSHYAPISK+GDGSDGLGDAD
Sbjct: 601 KKGSSHYAPISKRGDGSDGLGDAD 588

BLAST of MC00g0145 vs. ExPASy TrEMBL
Match: A0A6J1DKA8 (Polyglutamine tract-binding protein 1 OS=Momordica charantia OX=3673 GN=LOC111021704 PE=4 SV=1)

HSP 1 Score: 1244 bits (3220), Expect = 0.0
Identity = 624/624 (100.00%), Postives = 624/624 (100.00%), Query Frame = 0

Query: 1   MPTSNAAAAVSGDSFITTIGSSVEDRPLKESGAAQSQSYAQNEVQELVKSGKQDSSSQPG 60
           MPTSNAAAAVSGDSFITTIGSSVEDRPLKESGAAQSQSYAQNEVQELVKSGKQDSSSQPG
Sbjct: 1   MPTSNAAAAVSGDSFITTIGSSVEDRPLKESGAAQSQSYAQNEVQELVKSGKQDSSSQPG 60

Query: 61  EAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQETEPSRVNDQNNVPHDGVFKI 120
           EAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQETEPSRVNDQNNVPHDGVFKI
Sbjct: 61  EAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQETEPSRVNDQNNVPHDGVFKI 120

Query: 121 ACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSE 180
           ACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSE
Sbjct: 121 ACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSE 180

Query: 181 RYDPSTLKEHLLKITSDHRAEMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIV 240
           RYDPSTLKEHLLKITSDHRAEMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIV
Sbjct: 181 RYDPSTLKEHLLKITSDHRAEMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIV 240

Query: 241 TPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSTNSDAISNQP 300
           TPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSTNSDAISNQP
Sbjct: 241 TPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSTNSDAISNQP 300

Query: 301 VQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALD 360
           VQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALD
Sbjct: 301 VQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALD 360

Query: 361 ETTGLKYYYNVRTHVTQWEHPVSSHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLM 420
           ETTGLKYYYNVRTHVTQWEHPVSSHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLM
Sbjct: 361 ETTGLKYYYNVRTHVTQWEHPVSSHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLM 420

Query: 421 QGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQSWGYCNHCTRILRLPQ 480
           QGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQSWGYCNHCTRILRLPQ
Sbjct: 421 QGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQSWGYCNHCTRILRLPQ 480

Query: 481 CEYLPTSSVSNQHQQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDE 540
           CEYLPTSSVSNQHQQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDE
Sbjct: 481 CEYLPTSSVSNQHQQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDE 540

Query: 541 LDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQT 600
           LDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQT
Sbjct: 541 LDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQT 600

Query: 601 KKGSSHYAPISKKGDGSDGLGDAD 624
           KKGSSHYAPISKKGDGSDGLGDAD
Sbjct: 601 KKGSSHYAPISKKGDGSDGLGDAD 624

BLAST of MC00g0145 vs. ExPASy TrEMBL
Match: A0A6J1J063 (Polyglutamine tract-binding protein 1 OS=Cucurbita maxima OX=3661 GN=LOC111482260 PE=4 SV=1)

HSP 1 Score: 922 bits (2384), Expect = 0.0
Identity = 477/624 (76.44%), Postives = 517/624 (82.85%), Query Frame = 0

Query: 1   MPTSNAAAAVSGDSFITTIGSSVEDRPLKESGAAQSQSYAQNEVQELVKSGKQDSSSQPG 60
           MPTS AA A SGDS  TTIGSSVED  LKESG+AQSQSYAQNEVQEL K G Q S  QPG
Sbjct: 1   MPTSTAAIADSGDSSKTTIGSSVEDSSLKESGSAQSQSYAQNEVQELEKFGNQISPCQPG 60

Query: 61  EAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQETEPSRVNDQNNVPHDGVFKI 120
           E  SSV + SD                           QE  PS  NDQN VPH GVF I
Sbjct: 61  EVHSSVVIYSD---------------------------QEKTPSFGNDQNIVPHAGVFNI 120

Query: 121 ACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSE 180
           A SSSSKFGSHV DTRDID+AV+DAVLREQELATQNIIRSQR+S+GADG P ERSDIFSE
Sbjct: 121 AVSSSSKFGSHVVDTRDIDNAVRDAVLREQELATQNIIRSQRDSVGADGLPEERSDIFSE 180

Query: 181 RYDPSTLKEHLLKITSDHRAEMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIV 240
           RYDPSTLKEHLLKIT++HRAEMAMKRGK NLPEEGNLEIGNGYGVPGGCAFYGASKPGIV
Sbjct: 181 RYDPSTLKEHLLKITTEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIV 240

Query: 241 TPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSTNSDAISNQP 300
           T GNN I  KIQGQV E +Q+S+ KELPEYLKQKL+ARGILKED +  NS N+DAISNQ 
Sbjct: 241 THGNNTIDQKIQGQVRETKQSSSAKELPEYLKQKLKARGILKEDAKHSNSANADAISNQM 300

Query: 301 VQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALD 360
           +QG+KLP GWVEAKDP SG  YYYNESTGKSQWERP++SSF LQL SAVSLPEDWMEA+D
Sbjct: 301 LQGEKLPHGWVEAKDPGSGASYYYNESTGKSQWERPTESSFGLQLSSAVSLPEDWMEAVD 360

Query: 361 ETTGLKYYYNVRTHVTQWEHPVSSHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLM 420
           + TG KYYYN RT VTQWE P +SHQ TL HSN   PG WN+QT  Q+KC+ CG GMTL+
Sbjct: 361 QITGHKYYYNRRTQVTQWEPPAASHQATLAHSNVGAPGSWNDQTSGQSKCVTCGSGMTLV 420

Query: 421 QGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQSWGYCNHCTRILRLPQ 480
           QGSRYCN      STSSTNG WQ+Q  +L+KCMGCGGWGLGLVQ+WGYCNHCTR L LPQ
Sbjct: 421 QGSRYCNCCASGVSTSSTNGKWQDQPSDLHKCMGCGGWGLGLVQAWGYCNHCTRTLGLPQ 480

Query: 481 CEYLPTSSVSNQHQQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDE 540
           C+YLPTS+++NQH  KTE+I +++D SIKKSA DRSK KPP+GKGGKRESRKRS+SEDDE
Sbjct: 481 CQYLPTSNINNQH--KTENIKNNSDPSIKKSASDRSKGKPPIGKGGKRESRKRSHSEDDE 540

Query: 541 LDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQT 600
           LDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQT
Sbjct: 541 LDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQT 595

Query: 601 KKGSSHYAPISKKGDGSDGLGDAD 624
           KKGSSHYAPISK+GDGSDGLGDAD
Sbjct: 601 KKGSSHYAPISKRGDGSDGLGDAD 595

BLAST of MC00g0145 vs. ExPASy TrEMBL
Match: A0A6J1F9X9 (Polyglutamine tract-binding protein 1 OS=Cucurbita moschata OX=3662 GN=LOC111442162 PE=4 SV=1)

HSP 1 Score: 921 bits (2380), Expect = 0.0
Identity = 478/624 (76.60%), Postives = 518/624 (83.01%), Query Frame = 0

Query: 1   MPTSNAAAAVSGDSFITTIGSSVEDRPLKESGAAQSQSYAQNEVQELVKSGKQDSSSQPG 60
           MPTS AA A  GDS  TTIGSSVED  LKESG+AQSQSYAQNEVQEL K G Q S  QPG
Sbjct: 1   MPTSTAAIADLGDSSKTTIGSSVEDSSLKESGSAQSQSYAQNEVQELEKFGNQISPCQPG 60

Query: 61  EAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQETEPSRVNDQNNVPHDGVFKI 120
           E +SSV +SSD                           QE  PS  NDQN VPHDGVF I
Sbjct: 61  EVRSSVVISSD---------------------------QEKPPSFGNDQNIVPHDGVFNI 120

Query: 121 ACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSE 180
           A SSSSKFGSHV DTRDID+AV+DAVLREQELATQNIIRS+R+S+ ADG P ERSDIFSE
Sbjct: 121 AVSSSSKFGSHVVDTRDIDNAVRDAVLREQELATQNIIRSRRDSVDADGLPEERSDIFSE 180

Query: 181 RYDPSTLKEHLLKITSDHRAEMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIV 240
           RYDPS LKEHLLKITS+HRAEMAMKRGK NLPEEGNLEIGNGYGVPGGCAFYGASKPGIV
Sbjct: 181 RYDPSALKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIV 240

Query: 241 TPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSTNSDAISNQP 300
           T GNN I  KIQGQV EAEQ+ + KELPEYLKQKL+ARGILKED +  NS NSDAISNQ 
Sbjct: 241 THGNNTIDQKIQGQVREAEQSPSAKELPEYLKQKLKARGILKEDAKHSNSANSDAISNQM 300

Query: 301 VQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALD 360
           +QG+KLP GWVEAKDP SGV YYYNESTGKSQWERP++SSF LQL SAVSLPEDWMEA+D
Sbjct: 301 LQGEKLPHGWVEAKDPGSGVSYYYNESTGKSQWERPTESSFGLQLSSAVSLPEDWMEAVD 360

Query: 361 ETTGLKYYYNVRTHVTQWEHPVSSHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLM 420
           +TTG +YYYN RT VTQWE PV+SHQ TL HS  + PG WN+QT  Q+KC+ CG GMTL+
Sbjct: 361 QTTGHRYYYNRRTQVTQWEPPVASHQATLAHSTVSAPGSWNDQTSGQSKCVTCGSGMTLV 420

Query: 421 QGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQSWGYCNHCTRILRLPQ 480
           QG+RYCN      STSSTNG WQ+Q  + +KCMGCGGWGLGLVQ+WGYCNHCTR L LPQ
Sbjct: 421 QGTRYCNCCASGVSTSSTNGKWQDQPSDQHKCMGCGGWGLGLVQAWGYCNHCTRTLGLPQ 480

Query: 481 CEYLPTSSVSNQHQQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDE 540
           C+YLPTS++ NQ  QKTE+I ++AD SIKKSA DRSK KPP+GKGGKRESRKRS+SEDDE
Sbjct: 481 CQYLPTSNIYNQ--QKTENIKNNADPSIKKSASDRSKGKPPIGKGGKRESRKRSHSEDDE 540

Query: 541 LDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQT 600
           LDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQT
Sbjct: 541 LDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQT 595

Query: 601 KKGSSHYAPISKKGDGSDGLGDAD 624
           KKGSSHYAPISK+GDGSDGLGDAD
Sbjct: 601 KKGSSHYAPISKRGDGSDGLGDAD 595

BLAST of MC00g0145 vs. ExPASy TrEMBL
Match: A0A5D3DPP7 (Polyglutamine tract-binding protein 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G006960 PE=4 SV=1)

HSP 1 Score: 915 bits (2365), Expect = 0.0
Identity = 477/625 (76.32%), Postives = 514/625 (82.24%), Query Frame = 0

Query: 1   MPTSNAAAAVSGDSFITTIGSSVEDRPLKESGAAQSQSYAQNEVQELVKSGKQDSSSQPG 60
           MPTS  A A SGDS  T IGSS ED+ LKES AAQ      NEVQEL K  KQ    QPG
Sbjct: 1   MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQ------NEVQELEKFSKQIYPCQPG 60

Query: 61  EAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQETEPSRVNDQNNVPHDGVFK- 120
           EAQ SVA+S+D                           QET  S  NDQN VPH+GVF  
Sbjct: 61  EAQCSVAISAD---------------------------QETNQSFGNDQNIVPHEGVFNN 120

Query: 121 IACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFS 180
           IA S+SS F S+V D RDI+ AVQDAVLREQELATQNIIRSQRES+GADG P+E+SDIFS
Sbjct: 121 IAVSTSSNFRSNVDDGRDIEIAVQDAVLREQELATQNIIRSQRESVGADGLPAEQSDIFS 180

Query: 181 ERYDPSTLKEHLLKITSDHRAEMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGI 240
           ERYDPST+KEHLLKITS+HRAEMAMKRGK NLPEEGNLEIGNGYGVPGGCA YGASKPGI
Sbjct: 181 ERYDPSTIKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCASYGASKPGI 240

Query: 241 VTPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSTNSDAISNQ 300
           V  GNN  G KIQGQV E EQ+SA K LPEYLKQKLRARGILKED +  N TNSDA+SN 
Sbjct: 241 VANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNPTNSDAVSNT 300

Query: 301 PVQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEAL 360
            + G+KLP GWVEAKDP SGV YYYNES+GKSQWERPS+ S D QL SAVSLPEDWMEA+
Sbjct: 301 KLHGEKLPHGWVEAKDPHSGVSYYYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAI 360

Query: 361 DETTGLKYYYNVRTHVTQWEHPVSSHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTL 420
           D+T+GLKYYYN+RTH+TQWE PV+SHQ TLTHSN  VPG WN+QT EQ+KCI CG GMTL
Sbjct: 361 DQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTL 420

Query: 421 MQGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQSWGYCNHCTRILRLP 480
           +QGSRYCN+ T   STSSTNG WQ+QS E NKCMGCGGWGLGLVQ+WGYCNHCTRIL LP
Sbjct: 421 VQGSRYCNACTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLP 480

Query: 481 QCEYLPTSSVSNQHQQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDD 540
           QC+YLPT+++SNQ  QKTE+I HSAD SIKKSA DRSKWKPPMGKGGKRESRKRSYSEDD
Sbjct: 481 QCQYLPTNNISNQ--QKTENIKHSADPSIKKSATDRSKWKPPMGKGGKRESRKRSYSEDD 540

Query: 541 ELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQ 600
           ELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQ
Sbjct: 541 ELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQ 590

Query: 601 TKKGSSHYAPISKKGDGSDGLGDAD 624
           TKKGSSHYAPISK+GDGSDGLGDAD
Sbjct: 601 TKKGSSHYAPISKRGDGSDGLGDAD 590

BLAST of MC00g0145 vs. ExPASy TrEMBL
Match: A0A0A0LFL2 (Polyglutamine tract-binding protein 1 OS=Cucumis sativus OX=3659 GN=Csa_3G822240 PE=4 SV=1)

HSP 1 Score: 910 bits (2353), Expect = 0.0
Identity = 476/629 (75.68%), Postives = 515/629 (81.88%), Query Frame = 0

Query: 1   MPTSNAAAAVSGDSFITTIGSSVEDRPLKESGAAQSQSYAQNEVQELVKSGKQDSSSQPG 60
           MPTS A  A SGDS  T IGSS ED+ LKES AAQSQ  AQNEVQEL KS KQ    QPG
Sbjct: 103 MPTSTAGIAGSGDSSNTIIGSSAEDKSLKESAAAQSQYRAQNEVQELEKSSKQLYPCQPG 162

Query: 61  EAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQETEPSRVNDQNNVPHDGVFK- 120
           EAQ +VA+ +D                           QET  S  NDQN VPH G F  
Sbjct: 163 EAQGAVAIPAD---------------------------QETNRSSGNDQNIVPHHGTFNN 222

Query: 121 IACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFS 180
           IA SSSS F S+V D RDID AVQDAVLREQELATQNIIRSQR+S+GADG P ERSDIFS
Sbjct: 223 IAVSSSSNFRSNVDDARDIDIAVQDAVLREQELATQNIIRSQRDSVGADGLPVERSDIFS 282

Query: 181 ERYDPSTLKEHLLKITSDHRAEMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGI 240
           ERYDPS+LKEHLLKITS+HRAEMA+KRGK NLPEEGNLEIGNGYGVPGGCAFYGASKPGI
Sbjct: 283 ERYDPSSLKEHLLKITSEHRAEMAIKRGKLNLPEEGNLEIGNGYGVPGGCAFYGASKPGI 342

Query: 241 VTPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNS----TNSDA 300
           V  GNN  G KIQGQ+ EAEQ+SA+K LPEYLKQKLRARGILKED +  NS    TNSDA
Sbjct: 343 VANGNNVTGQKIQGQIKEAEQSSASKALPEYLKQKLRARGILKEDAEHSNSVRADTNSDA 402

Query: 301 ISNQPVQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDW 360
           +SN  +QG+KLP GWVEAKDP SGV YYYNES+GKSQWERPS+ S + QL SAVSLPEDW
Sbjct: 403 VSNTKLQGEKLPHGWVEAKDPHSGVSYYYNESSGKSQWERPSELSSNTQLSSAVSLPEDW 462

Query: 361 MEALDETTGLKYYYNVRTHVTQWEHPVSSHQGTLTHSNGNVPGVWNNQTFEQNKCIACGR 420
           MEA+D+T+G+KYYYN+RTHVTQWE PV+SHQ TLTHSN   PG WN+QT EQ+KCI CG 
Sbjct: 463 MEAIDQTSGVKYYYNMRTHVTQWERPVASHQTTLTHSNDKFPGPWNDQTLEQSKCITCGS 522

Query: 421 GMTLMQGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQSWGYCNHCTRI 480
           GMTL+QGSRYCNS T   STSSTNG WQ+Q  E NKCMGCGGWGLGLVQ+WGYC HCTRI
Sbjct: 523 GMTLVQGSRYCNSCTSGVSTSSTNGIWQDQPSEQNKCMGCGGWGLGLVQAWGYCIHCTRI 582

Query: 481 LRLPQCEYLPTSSVSNQHQQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSY 540
           L LPQC+YLPT+++SNQ  QK E++ HSAD SIKKS  DRSKWKPP+GKGGKRESRKRSY
Sbjct: 583 LGLPQCQYLPTNNISNQ--QKIENVKHSADPSIKKSVTDRSKWKPPIGKGGKRESRKRSY 642

Query: 541 SEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAE 600
           SEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAE
Sbjct: 643 SEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAE 702

Query: 601 IASQTKKGSSHYAPISKKGDGSDGLGDAD 624
           IASQTKKGSSHYAPISK+GDGSDGLGDAD
Sbjct: 703 IASQTKKGSSHYAPISKRGDGSDGLGDAD 702

BLAST of MC00g0145 vs. TAIR 10
Match: AT2G41020.1 (WW domain-containing protein )

HSP 1 Score: 407.1 bits (1045), Expect = 2.5e-113
Identity = 242/518 (46.72%), Postives = 312/518 (60.23%), Query Frame = 0

Query: 123 SSSSKFGSHVG--DTRDIDSAVQDAVLREQELATQNIIRSQRES-LGADGPPSERSDIFS 182
           +S+  +GS +    ++DI+SA   A+LREQE+ TQ II+ QRE+     G     +DI  
Sbjct: 15  TSNYGYGSSLAYDQSQDIESAANTALLREQEIETQKIIQGQREAGTSVAGDSKHNTDILR 74

Query: 183 ERYDPSTLKEHLLKITSDHRAEMAMKRGKS-NLPEEGNLEIGNGYGVPGGCAFYGASKPG 242
           +R DP+ LKEHLLK T++HRAE A KRG S +   EGN+++GNGYG+PGG A+ G S   
Sbjct: 75  DRADPNALKEHLLKFTANHRAEAAAKRGGSVSTCGEGNVDVGNGYGIPGGVAYAGHS--- 134

Query: 243 IVTPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKE--DTQKRNSTNSDAI 302
                      ++ G   + E  +A+  LPEYLKQKL+ARGIL++       N  ++ A+
Sbjct: 135 -----------ELSG---KPEPTNASNNLPEYLKQKLKARGILRDGAGAVTSNPEDTSAV 194

Query: 303 S-----NQPVQGD--KLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAV 362
           S       P Q +   LP GWV+AKDP SG  YYYN+ TG  QWERP + S+       V
Sbjct: 195 SWNRQATLPFQANASTLPLGWVDAKDPASGATYYYNQHTGTCQWERPVELSYATSSAPPV 254

Query: 363 SLPEDWMEALDETTGLKYYYNVRTHVTQWEHPVSSHQGTLTHSNGNVPGVWNNQTFEQNK 422
              E+W+E  DE +G KY+YN RTHV+QWE P S  +   T+SN  V             
Sbjct: 255 LSKEEWIETFDEASGHKYFYNTRTHVSQWEPPASLQKPAATNSNNAV------------- 314

Query: 423 CIACGRGMTLMQGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQSWGYC 482
                                   + S+ NG  +    +L +C GCGGWG+GLVQ WGYC
Sbjct: 315 ------------------------TQSTANGKGEHPPSQLPRCSGCGGWGVGLVQRWGYC 374

Query: 483 NHCTRILRLPQCEYLPTSSVSNQHQQKTESIDHSADA--SIKKSAMDRSKWKPPMGKGGK 542
            HCTR+  LP+ ++LP              ++H  +A  S +K    RS  KPPM    K
Sbjct: 375 VHCTRVFNLPEKQFLPA------------HLNHFTNAGDSGQKDPNQRSSSKPPM---KK 434

Query: 543 RESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPG 602
              +KR+++EDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTA+GPLFQQRPYPSPG
Sbjct: 435 VIGKKRAHAEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTASGPLFQQRPYPSPG 463

Query: 603 AVLRKNAEIA-SQTKKGSSHYAPISKKGDGSDGLGDAD 625
           AVLR+NAE+A SQ KK +S +  I+K+GDGSDGLGDAD
Sbjct: 495 AVLRRNAEVASSQKKKPNSQFTEITKRGDGSDGLGDAD 463

BLAST of MC00g0145 vs. TAIR 10
Match: AT2G41020.2 (WW domain-containing protein )

HSP 1 Score: 259.2 bits (661), Expect = 8.3e-69
Identity = 153/376 (40.69%), Postives = 208/376 (55.32%), Query Frame = 0

Query: 123 SSSSKFGSHVG--DTRDIDSAVQDAVLREQELATQNIIRSQRES-LGADGPPSERSDIFS 182
           +S+  +GS +    ++DI+SA   A+LREQE+ TQ II+ QRE+     G     +DI  
Sbjct: 15  TSNYGYGSSLAYDQSQDIESAANTALLREQEIETQKIIQGQREAGTSVAGDSKHNTDILR 74

Query: 183 ERYDPSTLKEHLLKITSDHRAEMAMKRGKS-NLPEEGNLEIGNGYGVPGGCAFYGASKPG 242
           +R DP+ LKEHLLK T++HRAE A KRG S +   EGN+++GNGYG+PGG A+ G S   
Sbjct: 75  DRADPNALKEHLLKFTANHRAEAAAKRGGSVSTCGEGNVDVGNGYGIPGGVAYAGHS--- 134

Query: 243 IVTPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKE--DTQKRNSTNSDAI 302
                      ++ G   + E  +A+  LPEYLKQKL+ARGIL++       N  ++ A+
Sbjct: 135 -----------ELSG---KPEPTNASNNLPEYLKQKLKARGILRDGAGAVTSNPEDTSAV 194

Query: 303 S-----NQPVQGD--KLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAV 362
           S       P Q +   LP GWV+AKDP SG  YYYN+ TG  QWERP + S+       V
Sbjct: 195 SWNRQATLPFQANASTLPLGWVDAKDPASGATYYYNQHTGTCQWERPVELSYATSSAPPV 254

Query: 363 SLPEDWMEALDETTGLKYYYNVRTHVTQWEHPVSSHQGTLTHSNGNVPGVWNNQTFEQNK 422
              E+W+E  DE +G KY+YN RTHV+QWE P S  +   T+SN  V             
Sbjct: 255 LSKEEWIETFDEASGHKYFYNTRTHVSQWEPPASLQKPAATNSNNAV------------- 314

Query: 423 CIACGRGMTLMQGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQSWGYC 482
                                   + S+ NG  +    +L +C GCGGWG+GLVQ WGYC
Sbjct: 315 ------------------------TQSTANGKGEHPPSQLPRCSGCGGWGVGLVQRWGYC 336

Query: 483 NHCTRILRLPQCEYLP 486
            HCTR+  LP+ ++LP
Sbjct: 375 VHCTRVFNLPEKQFLP 336

BLAST of MC00g0145 vs. TAIR 10
Match: AT3G19840.1 (pre-mRNA-processing protein 40C )

HSP 1 Score: 48.1 bits (113), Expect = 2.9e-05
Identity = 29/76 (38.16%), Postives = 39/76 (51.32%), Query Frame = 0

Query: 313 AKDPDSGVLYYYNESTGKSQWERP------SDSSFDLQLP-SAVSLPEDWMEALDETTGL 372
           A   ++GVLYYYN  TG+S +E+P       D      +P S  SLP      +    G 
Sbjct: 251 AHKSEAGVLYYYNSVTGQSTYEKPPGFGGEPDKVPVQPIPVSMESLPGTDWALVSTNDGK 310

Query: 373 KYYYNVRTHVTQWEHP 382
           KYYYN +T V+ W+ P
Sbjct: 311 KYYYNNKTKVSSWQIP 326

BLAST of MC00g0145 vs. TAIR 10
Match: AT1G44910.1 (pre-mRNA-processing protein 40A )

HSP 1 Score: 43.1 bits (100), Expect = 9.4e-04
Identity = 33/104 (31.73%), Postives = 46/104 (44.23%), Query Frame = 0

Query: 281 LKEDTQKRNSTNSDAISNQPVQGDKLPRG---WVEAKDPDSGVLYYYNESTGKSQWERPS 340
           L    Q+       A+S  P  G+  P+    W E    D G  YYYN+ T +S WE+P 
Sbjct: 160 LVSPVQQTGQQTPVAVSTDP--GNLTPQSASDWQEHTSAD-GRKYYYNKRTKQSNWEKPL 219

Query: 341 DSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHP 382
           +    L+   A ++   W E      G KYYYN  T  ++W  P
Sbjct: 220 ELMTPLERADASTV---WKE-FTTPEGKKYYYNKVTKESKWTIP 256

BLAST of MC00g0145 vs. TAIR 10
Match: AT1G44910.2 (pre-mRNA-processing protein 40A )

HSP 1 Score: 43.1 bits (100), Expect = 9.4e-04
Identity = 33/104 (31.73%), Postives = 46/104 (44.23%), Query Frame = 0

Query: 281 LKEDTQKRNSTNSDAISNQPVQGDKLPRG---WVEAKDPDSGVLYYYNESTGKSQWERPS 340
           L    Q+       A+S  P  G+  P+    W E    D G  YYYN+ T +S WE+P 
Sbjct: 160 LVSPVQQTGQQTPVAVSTDP--GNLTPQSASDWQEHTSAD-GRKYYYNKRTKQSNWEKPL 219

Query: 341 DSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHP 382
           +    L+   A ++   W E      G KYYYN  T  ++W  P
Sbjct: 220 ELMTPLERADASTV---WKE-FTTPEGKKYYYNKVTKESKWTIP 256

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q91VJ51.0e-1831.84Polyglutamine-binding protein 1 OS=Mus musculus OX=10090 GN=Pqbp1 PE=1 SV=1[more]
Q2HJC91.7e-1867.11Polyglutamine-binding protein 1 OS=Bos taurus OX=9913 GN=PQBP1 PE=2 SV=1[more]
A1YFA73.8e-1867.11Polyglutamine-binding protein 1 OS=Gorilla gorilla gorilla OX=9595 GN=PQBP1 PE=3... [more]
O608283.8e-1867.11Polyglutamine-binding protein 1 OS=Homo sapiens OX=9606 GN=PQBP1 PE=1 SV=1[more]
A2T8063.8e-1867.11Polyglutamine-binding protein 1 OS=Pongo pygmaeus OX=9600 GN=PQBP1 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
XP_022154433.10.0100.00uncharacterized protein LOC111021704 [Momordica charantia][more]
KAG6580903.10.077.08Polyglutamine-binding protein 1, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_022983732.10.076.44uncharacterized protein LOC111482260 [Cucurbita maxima][more]
XP_022935213.10.076.60uncharacterized protein LOC111442162 [Cucurbita moschata][more]
XP_038906175.10.076.92uncharacterized protein LOC120092051 isoform X3 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A6J1DKA80.0100.00Polyglutamine tract-binding protein 1 OS=Momordica charantia OX=3673 GN=LOC11102... [more]
A0A6J1J0630.076.44Polyglutamine tract-binding protein 1 OS=Cucurbita maxima OX=3661 GN=LOC11148226... [more]
A0A6J1F9X90.076.60Polyglutamine tract-binding protein 1 OS=Cucurbita moschata OX=3662 GN=LOC111442... [more]
A0A5D3DPP70.076.32Polyglutamine tract-binding protein 1 OS=Cucumis melo var. makuwa OX=1194695 GN=... [more]
A0A0A0LFL20.075.68Polyglutamine tract-binding protein 1 OS=Cucumis sativus OX=3659 GN=Csa_3G822240... [more]
Match NameE-valueIdentityDescription
AT2G41020.12.5e-11346.72WW domain-containing protein [more]
AT2G41020.28.3e-6940.69WW domain-containing protein [more]
AT3G19840.12.9e-0538.16pre-mRNA-processing protein 40C [more]
AT1G44910.19.4e-0431.73pre-mRNA-processing protein 40A [more]
AT1G44910.29.4e-0431.73pre-mRNA-processing protein 40A [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (Dali-11) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001202WW domainSMARTSM00456ww_5coord: 350..383
e-value: 1.2E-9
score: 48.0
coord: 305..338
e-value: 1.8E-10
score: 50.8
IPR001202WW domainPFAMPF00397WWcoord: 351..381
e-value: 6.5E-11
score: 42.1
coord: 306..336
e-value: 5.2E-14
score: 52.0
IPR001202WW domainPROSITEPS01159WW_DOMAIN_1coord: 310..336
IPR001202WW domainPROSITEPS01159WW_DOMAIN_1coord: 355..381
IPR001202WW domainPROSITEPS50020WW_DOMAIN_2coord: 349..383
score: 12.412801
IPR001202WW domainPROSITEPS50020WW_DOMAIN_2coord: 304..338
score: 13.7918
IPR001202WW domainCDDcd00201WWcoord: 352..382
e-value: 7.8837E-7
score: 43.6706
IPR001202WW domainCDDcd00201WWcoord: 307..338
e-value: 2.16573E-8
score: 48.293
NoneNo IPR availableGENE3D2.20.70.10coord: 302..341
e-value: 6.5E-12
score: 46.9
NoneNo IPR availableGENE3D3.40.30.10Glutaredoxincoord: 516..603
e-value: 5.7E-24
score: 86.9
NoneNo IPR availableGENE3D2.20.70.10coord: 347..399
e-value: 9.7E-11
score: 43.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 27..73
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 281..310
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 498..540
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 159..179
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 31..73
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 490..624
NoneNo IPR availablePANTHERPTHR21737:SF3POLYGLUTAMINE-BINDING PROTEIN 1coord: 74..430
coord: 436..624
NoneNo IPR availablePANTHERPTHR21737POLYGLUTAMINE BINDING PROTEIN 1/MARVEL MEMBRANE-ASSOCIATING DOMAIN CONTAINING 3coord: 74..430
coord: 436..624
IPR036020WW domain superfamilySUPERFAMILY51045WW domaincoord: 299..338
IPR036020WW domain superfamilySUPERFAMILY51045WW domaincoord: 346..383

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MC00g0145.1MC00g0145.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0000380 alternative mRNA splicing, via spliceosome
cellular_component GO:0005737 cytoplasm
cellular_component GO:0016604 nuclear body
molecular_function GO:0005515 protein binding
molecular_function GO:0043021 ribonucleoprotein complex binding