Cp4.1LG03g08750 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG03g08750
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionPolyglutamine tract-binding protein 1
LocationCp4.1LG03: 2671329 .. 2685561 (+)
RNA-Seq ExpressionCp4.1LG03g08750
SyntenyCp4.1LG03g08750
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGGTCAATTTCCGGACAGTTTTGGGAACCGCCAAGAAATACAACCAGGTTTTTCCAATTCGGCGCTCTATTATATCCTTTCAATTCGACGCATACCGTCGATCCGCATCAGCAATCCGCCATACTCAAATCTGCCTCAAATTCCCACAAGCAGCTGGTCCAGTACAGCAAAGCTTTCAAGATCCAAGCCCTTCCGGGGATGCCGACTTCTACTGCAGCAATTGCAGATTCAGGAGATTCGTCCAAAACTACAATTGGTTCTAGTGTTGAAGATAGTTCTCTCAAGGAATCAGGTTCTGCTCAATCTCAATCTTGTGCCCAAAATGAAGTGCAAGAACTTGAAAAGTTTGGCAACCAAATTTCTCCTTGTCAACTGGGAGAAGTTCATAGTTCTGTAGTAATCTCTTCTGATCAAGAGAAACCCCTAAGTTTTGGAAATGATCAGAACATTGTTTCCCATGATGGTGTGTTTAATATTGCTGTCTCGGCTTCTAGCAAATTCGGGTCACATGTCGATACCAGAGACATTGATAATGCTGTTCGGGATGCCGTGTTGAGGGAACAGGTATGCACGTTTAATATGTTTCAGTTTTTCAAATTCCTCTAAATGTTGTCAGCTTAGTTCTGAATACTATGACGGCGCTCTCTTCCATTGCTATCCCCATTCAGCCTGACTCTTTTTGCTCTCTGCATAGCCCTCATGTTATGTTTCATTTTTGATAGATGTTTTAGTTAATCTTCTTTTACAATTTTCTTCCTCCTATTTTCGATGGCGCTTCCCACTCATCACCTTCGTTCTGATGGGGACTAATTTTATGGTTTTCTAGAGTAGAGAGAATACATTTGGACGTTGCAAATAACATGAGTTAAATTAGAATTTGTGTACCGGATTGTAGTGACCAAAATGGTAGTTTAATAATAATAATAAAAAAAAAGTTCAGAACTTCTGTGGGGGATGGAGGAATGAAATGTTGCGTCTTCTTCCAAAAGCAATATGGAAGCAGCTTCTTCTGAAGATAATGAATCTGTTCATCAACGCGTTGATTTGTTGAAGTCAACATTTATTTCGTTGGGTTCGAAAAGGGTTCTTGGAAGAAGCCTTTTTAGTCTCTATTTGATATAATTTTCTTAGAAGAATCGGGAGGCTTTTATTGTTCTTTCTCATTAAGATGTTGGATCACTTAAGTCAACATTTGTATTGTTGGTTCAAATTTCAAAAGGTTTCTTGGAAGAAGCCTTTTTTCGTCCAACATGTGCTCTAGTTTTTTATTGGCATTTATGGTTACTCCAAAATTTCTTTTATTTCAGATAACTTCGATTTTAGCTTGTTCGGTTTTCATTAATGTTTACGAGGAAGAAGATTGTAGTCGTTTCTTAGTTCTTGATTTTGTAACCTTTGAACACTTTTTGCTTTTCATTATATCAATGAGATGCTTTTGTTGTTAAAAAGAAAAAGAAAAAGAAAGAAACGGATGTAGGTTCCTGCCAACTAACTTCAATTGGTATCCAATCTCTCCCCAAAAAGCTAAATATCTTGAAAAAGCTTTTGTAGAGCAGAAATTTGTTAGGCTATGGGGACAAACAAAATGGGGCCAGATGGTTTTACCGTTGAATTCCATAAGAAGTCTTGGAACGTCCCTTAACCTGAAATAATGAGTATTCCAAGATTTTCAAGAATTGTATCATCAATGCTAGCTTGAGCGAGACCTATATTTTCCTCATTCCAAAGAAATCGGATGCTAGAAATGTGGGAGACTACCACCCAATTATCCGCATATCATGTATGTACAAGATCATTGCTAGGATTCTCTCTTAAAGGCTTAAGAAGGCAATTCCTTATACTATCACTGAACAACAATCAGCTTTTGTGTCTAGTAGACAAATACTTGATGCCTGTTTAATAGCCGATGAGCTCATTGATGATTGGAAATGGAAAAAGAAGCAAGGGCTGATCATAAAACTGGACGTTGAAAAAGCCTTTTATAAGGTGAATTTGGGATGTTCTTGAAGAAATTTTTGTGGGTAATGGTTTTGGTCCAACGTGGAGGAAATGGATTTGGGGTTGTTATCCTCTCCTAATTTTTCAATCATCATCAACAACCGTTTGAGAGGAAAGATCCTGGCTACTGGAGGCCTCCATCAAGGTGGCCCCCTCTCCCCCTTCCTCTTTATTTTGGTTGTGGATTGCTTTAGTCTTCTTATGGTTCATGCTTCAAATAGAAAAGTCATTCGAGGTTTTTCTATGGGAAGAAGTTCCCTAGATATGTATGCATCACCTATAGTTAGCCGATTACATTATGTTGTTCTCATCTAAAGAACAAGTGCTTGATAATTTTTTCAACATTGTGCAGCTATTTGACGTAGCATCATGGCTTGATGTTAGGTGGAAAGAAAGTTCCTCTCATGGAAAAACAATCATATCTCAAAGGGTGGAAGACTGCAACTCTTGCCAATCCTCCGTCCAACTTATTATATGGCCATCTATAGAATGCCAACCAAAGTGAGTAACATTATGGATCAATTGCCCATAAATTTCCTCTGGAAAGGTAACAATGATGGCAAGGGTCTTCATCTTGTCAAATGGGATAATCTCCTTCACCCCTTGGAGAAAGGAGAGCTTGGTTTACACAAATTGAAAGATCGAAATGGAGCCCTCGTTGCAAAATGGATTTGGTGTTACAATGTGAGAAAACAGTTCTTTAGAGGATGGTCATTGATACAAAATACGGGACTACCCTTCTCAACAAAAAATCGGGCAGCAACTCCTTGGAACAGCCAAGGGTCCATTGAAAGCCATTATGAAACATTTCTATCTGATTCACGACAGATTATCCTACACTGGGTGATGGGGCAAATATTCCCTTCTTGCAGGACATATGAATTGGAAATGACTCTTGCAAGAAAATTTCCCTTGATTTTTTTTTTTTTTTTTTTTTTTTTAGTAAAGATGGGGCTATCAAGGATTTCTGGAACACAGAGTTTGGCTTTTGGAACTTGAGATTGAGATGAAACCAGAGACAGAAATTGACGAATTTGGGCCTACTTCTTACACTGCCTGTCCACTGTTTCCTTATTTCACAGTTCTTAATTAGGAATTTGGAACCTAGAGACAAATAGTGTCTTGTCCAACAGCTCTCTCTTGCGTGATATAGCCCCCTCCAATACAGAAACTGAATTTTCTCTTTATGATTAGATTAGGAAAGGACACTACCCAAAAAATATCATTTTTCCCATGGGAGCTATCTCACAAAGCTATTAATATACTCTTGAAAAGTTAAAGAGAAGACTGCCTGAGATGGGTTTGTCTCCACAATGGTGCTTTATACGCAAACAGAAATCAGAATCTCAAAGTCACCTCTTGGTGTCATGTAAATTTGCGAAGACCTTTGGAACTATCTCCTCTCTTCATTTAACTGGTATGCGGACATTTCGGAAAATCCCCATGTCCTCGCTTACACCCTTGGTGGGCATCCCTTAAAAAAGGAGAAAAAAGACGATTTGGGAGAATTATATAAGAGCCTTCTTCTGGAATTTGTGGCCAAAAAGAAATAGAAGATCTTGAGAAAGAGCTTTCTTTTATAGATTTTTTTTATGCTATAGTTAATACGGTTGTGAATTGGTGTAATGTTCTCCCTTATTTCACCACTATAGCTTTACCATACTCCTTTGTAACTCCTTTGTGGGGTCTCCCTGCTTCTGTAATTTTTATACCATCAATGAAATTGTTTTTGACAAAAAAAAGTTGTCAGAAGAGGTCCATCCAACCCTAACATAATTGTTTTTTTTTTTTTCATATTATTTATACATTTTCAATCTATTTTGAAGTACATATATATTCTTTTGAAAAATGTATCTTCAACGTGTTGTCCTAATTTTTTAGAATTGACATGTTTTTGTATTCGTGTCATGTTGTATATGTTTTCCATGTCCATGTCTAACCTTTTTAATGGGTAGATTTCGTTGAGGGAGACGAGAGTATTCAATACAAAGAGAAACCTGCACCACAAAAATGTGGGGATATTGGTTCTCTTTCAGCTTCCTCATGGTGCCTGAAGTTTCCCTTGCATATGAATTGGAGAGGATTATATATTTATTTTAATTCTCCTTTAGATTTCTTTAGCTCTCCTCTTGTAATTTTATTCATATTCCGGACACTCTCTTATCTATCTAAATAAATCATTTTAGTTTTTCAAATGCTTTAATTAGTTTTTTCCCTAACAGGAACTTGCTACCCAAAATATTATTCGTAGCCGAAGGTGATGGTTCTTCATTTGTTTTTGAGTTATTACAGTTCAAAATTAATAATGGAAACTAAATATATTTTCTCTTCCCAGAGACTCCGTGGATGCAGATGGACTTCCAGAGGAGAGATCAGATATCTTTTCAGAACGTTATGACCCAAGTGCTCTTAAAGTAAGTAATTTTCTACCTTCATACAACCTTCATTCATAAATTTTCTTTAGAATGTTGGATAGCTACACCTTCAACAGAGTTAGCTGTATGTCATCACTGCCACCATGGTCCCTATTTATTTACTCTCTTGAGAAACAGTTGTCTTGTATTGAGTCTATGAATGTACAACAGGTGGACTTGGATGGTGTAGTACAATGCGAGAACAATAGGATTCTCTTAAGTAAGACTGATTATTACTTTTTAAGCATTAGTCTAACCAATAAACTTCATAAGGATTAATGCATTAGTCCTGTGGGTAAGTTCTTGGGTTCTCTTGCTCATACTGGCTTAGAGGTTCCAAGTTTGAACCTTTGAGTGAGTTCAATACCAAAAACCTTCGATGTCACATGGGTCTGGTCCTTGGGGCGGGTGCGGGTGCCTAAGCTCCAACTCCCGATTATGTATGTAAAATACTTCTCGAAGAATTGTGAGAGCCATGGATATCTTCTTTGGACTCTTTCATTCGTGTTGATAATGCATGACCTGTTATACACACAAAAGCTTATTATTCATTATCATATTTTGTAGCTTTGATGTATTTTTCCTTCATTAATGTTACTTCAGGAGCATCTTTTGAAGATTACTTCTGAACATCGTGCAGAAATGGCTATGAAAAGGGGAAAGTTGAATCTTCCCGAAGAAGGTTGGTACTTTGTAATCTTTCTATTGAACTTTTCTCTTCACTTGCTCTTGTTGTCGCTCAATATTAGTTAATACAGAATGGGCCAAATTAAGCCTTAACCTTGCCCCAGTGATATTGTCAAATGAAGCAAACTTATTGACTTGGCTCTCCAGCACTGACGGGGTCTACTCTACAAAATCCTTCATGATGGACATAGGTGAAAAAATAGAAGCAATAAATCCCACACTAGCAAAGACAATATGGAAAGGACATCACCCTAAAAATAGAAGCAATAAATCCCACACTTTCAAGGAGAATGACATCTTGTTCTCATGCAGGGAACTTGGAAATTGGAAATGGTTATGGCGTACCTGGTGGATGTGCTTTCTATGGGGCTTCAAAGCCTGGAATTGTTACCCACGGTAAAAACCTTTTCTTTTTGCTTGACTTCGATGTTGACCTCCAAATTGGTTCATTTCATTGTTGTCATGCTTTCAGGAAATAATACGATTGACCAGAAAATCCAGGGACAGGTTAGGGAAGCAGAACAAAGTTCTTCTGCCAAAGAATTGCCCGAGTACCTCAAGCAGAAGCTAAAAGCTAGGGGTATTCTTAAAGAAGATGCAAAACATAGCAATTCTGTAAGAATTGATATGCCTTCGTATTCTTTTTTTTTTTTTTTTTTTCAATGAAAGCTTGGTTATTCATTAAAAGAAATGATATGCCAAACTTCCTAAAACAAGCCTTGGCAGCTGCGTCTTAGTCTTATTAGTTAGCCACAGACCAATTTGACCCGGTAAAACCGATATTACCTGTTCTTCCTGCTGAGTAAACACTTCAATTTTCTCCTTTCTTATCTTTAAAAAGCAGATTTATAAAGAAAAGCTTATGCACCTAAATTTCGTTCAGATTCATAAATAGTATTTTGTATAATGTTTGGGATTTGAATTTGCGGCCTCCTGATAGCAATTATAGTTCAATTTTCCTTCGCATACTCTTTATTCAATTGCTTCTCTCAATCAGTTATCAGAACAGTAACCGGACCTTAAATTAAATTACAGGCAAATTCTGATGCTATTTCAAATCAAATGTTGCAAGGAGAAAAGCTGCCTCATGGATGGGTGTGTACTGTTTGCAATTTCCTTACCAATTCTATGTTGTGTTCTCTTTCACTATCCAACTTATGTTAGTATTGCCTATTCCATCTTGTGAGTTAGGTTGCTTTTCTTTGATATCGATTGTTATGTGTTTCGTGACTGAGTTCGAATTGGATGTATTTAACAAAATATTTCCAACGAGTAAGAATGTAACATGCTTGACTATTTATTCTAGAGTAATGTTAGTTTAACCATTTTTCCCTATGTACTTGATCAATAAATATACTAAATCTACTTTGTTATAATATATCATACATTTAAATATTACTAGCCTCTGTTTCATATGAACAATAATCATAATATTAGTGCATATGATCCCAAGGGGCGGTGTCGTCGGGTACCTGGTCTCTGATGTCACTTGTTTGAATCTTTGGGTAAGATTGAAACAATTGTTTTTTTTTTTTTTGTTTTTTTTTTTTATGTGTTTGGACCCTTGGCGGGTGTGGACTGTTAAAGAATAGTGTGTTAGATGTTTTAGTCAATGTAGCACTTTAGTTTCAGAAATAATATTATTTAATATCAACTATAATTATGTTATTTTGGATTACAGTAGATTACTTCATTGTCACACCAGTCTGGTGTTTGCTAAAACAGTTAAGTTTGTGAAGTGAATGAGTTTTGACAGAATAATTACGGAAGGTCAGTTTCCCATATGTGAGTGAGTATACCTAGCGTCTAGGGGACCTTGTTATGGGGTTGAGGCGGCCATTACTGTAATAGGGACTTCAGATTTTTTTTTTTTCTTGTAATTTGTTTGTATCATATATGGTTGTTGGTAAACCATCAACTGGATGGGAATATTGTAATTTATGCTTTTTTATAATTAAGAGTAAAATACACTTTTTGGTCCCTAAAATTTGAACGTAATTTCCATTTGATCCGAGAGTTTTGAAGCTAACACTTTTAATTCTTAAGATTTGAATTTAGTTTCTATTTGATCTTTTGAGATTTCAAAAGTGACATGTTTGGTTTTCAACATCTAAATTTTATTTCTAATTGGCGCCTGGTATTTGAAATTTGTTTAAATGCCCATGTAAAGAAAATAAAAAATAGAAGACTGTTTCAACAAAGTCCACTGCAATGACATTATTTTGTTATATCAACTCGCAGACAACTTTATTTAGGAACCTATTTTAAACCTCAGGACTAGTTAAAAATCAAATTCAAACCTCGGAGTAAACATTGTTCGAAATTTCAAGGAACAAATACAAACTAAATTTAAAAATCAAGAATCAAATGTGTTAGCCTTAAAATCCGTTTGACTAGCAGAAGTTTAGTATCCCAGGGACCAAAAGTATGAGTTACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAANAGTGTGATTGAGAATATTGACTGGTCTAAAAGAGAGAACTCTTTTTATTTCCAAACAGTTTCAGTAAAATATGAAGATGTGTTGGGATACATTTTTTTGGAAGTATGTCCTTAGGCCTCCTTTTTTATAAACTTGTATGTTTCTACTGCATAGGTGGAGGCTAAAGACCCTGGCAGTGGTGTTTCATATTACTATAATGAAAGCACTGGGAAGAGTCAATGGGAAAGGCCCACTGAATCGTCTTTTGGTTTGCAACTTTCATCTGCTGTATCCCTTCCAGAAGATTGGATGGAGGCAGTCGATCAAACAACAGGTTGGTGAGAGATTTAATGTATTTATGATTTTGATATACTTATGCATTATCTGGAGGAGAAAAGATAAGAAAGCTGGAGGATTCCAACACCATAATCCTTGTCGTTTCAGCTTTACCATACTTTTTCTCGATGCATGTTTAAATTTCTGATTTCCCCTAAAAATAAGAATGATGTATCTTTAAAGGTTTATACTTGAGAAAGTGGCCTTCAGTTCCAAGTTTCTTGTTTGGCATTGTTGTAACGTAGCTATCAGGAAGGCATTCAGTTAGTGTAATTTTGACTTTGTTTGTAGCAAGTTCCATTTAAACGTGGAAGGCATTCTTTCTGAGGCCTAGTTACGGCCATCCACTTCCTGGTTTGCATTTTAAAGCTTCTAACCATTGGTTACATCTTCCCGATATATATATAAAAAAAAAAAATAATAATAATAAAAATAAATAACGTTATCTAATTGTGAGTTGCTGGACATCTGGATTCATTCTGTCCTTTGTTCTTTGCTGGTTGCTCAAAATCAGACTGGAGGTAGCACTTAACAACTAAACCATAGAGAATCTTCATGTTTAGTCCTGTCCTGCATCCCAACCTTTTTTGTGAGTCATTGATGTGGAGAGTCTAGACGTTGAAGGCTTGAGTAATCAGCTTGAATTGTGTGGAAACTTGTTTATTTGGCATCTCATGAATCTTCTCTACAAAGTTCGCAAGGCTTGAGAATGGAATTATACATTGGGATGCTTATGTTGTACCTTTACTATTATGTCCTTGTGCAAATTCGTAGCTTTCCTAACTTCAATGAAAAAGATGCTCAGAAGAAAGTTGCTGATTAATATAGTTTGCAGTTTTGTTCCTGTTGACAACTTTTCATATGATTGCAGGCCATAAATACTACTACAATAGGAGAACCCAGGTAACCCAGTGGGAGCCGCCTGTTGCATCTCATCAGGCAACTTTGGCACACTCGAATGTTAGTGCTCCTGGGTCTTGGAACGACCAAACTTCAGGGCAAAGTAAATGCGTCACATGTGGAAGCGGAATGACCCTCGTGCAGGGTTCAAGATACTGCAACTGTTGTGCAAGGTAAATGAGATATATATATATATCTAGCTAAAAAAATGAAGATCTTTTCATTAAGAAACTTCAAAGTAAAGAGATGGTGCATGAGTGAACACACTATATCAAAAAAGCTGTTGAGATACATCCTTGTTTGGCAAATCGATTTGGAATTTTTATCATTAATCGTCTGCATTATTACATAAGGTGATTGTTGGTGTTTAAAGAAAATCTGAAGCAGGACTGTTCTTCTATAGTCATAAAAGCTGCCAGTCCAAGGAGTCCTTGGGTGAGCATTAGTTAGCATTAAGTCCTTTAGATTCTTCTGCCTTCAAATTAGGCCCTGGTAACATGAATCATGAACCATTCTTGGAATATGGCTTCTTTTTCCCTGATCGGATTATTACATAAGTTCCTTATTGGATTTCCTTCCATTTTCAGAGTTGCTTCTAATCCAAACAGCTTAATTTCTGGTTTGTGGGATCATTTGAACCATTCTTGGAAAATTACATTGAGGAGAAATTTGAAGTATGATAAAATTATGCACTTTAGTGATCTAATTTTACCATTAGAAGAGGTTCATATAAGCTTTAGGGACAACAAAAAATGTATTGTTGCTTTGCCTTACGGATTCCTTAGTCAAATCCCTAATTAATAAGCTGTTTGGGAATCCCAATGCCCTCTTTCTTGTTTAAAGCTATTTGAAAATCGATATTTGTTTCCCATTATTTCACAGTATTCATGAGCAGGTTTACGTTGGTGTGTTAGTGGTAAAGCTGGACTGGAATAGGTAATTAGACGATGGTTTTGCACAAGCACAGCCTGAACGCCATCCAGAGATTCATCAATCTCTAAAAGGAATGGCTATGAGAAATCGAGAAGCCCCAACATCATCATAGATGTCAACTTTTGAAGGGCTGCTGTTACATTCTGTCCATGAAAAACTGTCTTTTTAGAGTCATTGAGTTAATGGAAAGGAAATGGACCATAGTTAGCTACGTATTCACTAGCTCAAAAGGGATGTGGAGATATACTTGGGAAAATCAAAGAATCCATTAGCTCTCTCATCCGACTAAAAGACTCTAGTCTGAGAAAAAGCATGAATTCCCCATCTCCATCTCACTACTATATTCTATGCTACTTATCCGCCCTTTTTAGATTCTGACTAAGGGCCCACATATACATCTTACTCATCCATTAAGTACATCTTCCTCCCCCCTCCTCACACTTCCTCTCATACATGATACATGTTTATTAGACACAATCTGCCAATCAACTAGTTGGACACCCACTAACTGTTTTTTTACTGCAATTTTATATGGATAAAATCTGACACTTCGCTCTTCTTTGTGTTGGTTTTGGTGGCTAAATTCATCAGTGGGGTTTCTACAAGTTCAACCAATGGGACGTGGCAGGATCAACCGTCTGACCAACATAAATGCATGGGATGTGGAGGTTGGGGACTAGGCCTTGTTCAAGCTTGGGGTTACTGTAATCATTGTACACGGTATGTTGATCTCTTATATCATTCGGCAGTATTACTCTTTAGAACTTTACTTAACATTAATATATCCATTGATGTGGCGTACTGCAGATGCAATTATTTTAATTTGCAATCTTAAATGTTTCTATGACCTTACAGTTTTCAGCCCAAACCTTCGACAAATTCATACAAATATCTTGGTTCTTCTCATGGATTACGTGATATATGCTGCTATTGTTTTACTATAAAAAAATTACATTGACTTTCAGAACTCTCGGCCTTCCCCAGTGTCAGTACTTGCCAACCAGCAATATTTATAATCAGCAGAAGACCGAGAACATCAAGAATAACGCTGATCCTTCCATCAAAAAATCTGCTTCAGATAGGTAATGGAACAGTCTTCTCACTTTAGACTAGCTTAAGTATTGATTCTGTTTCTTTATATAATATTAATATGACATACAGTACTAATTTTGCAATATTTGAAGTTGTTTAGTAGTTGAGCATAATTTTTTAAAAAGATTTGTGGGTTATGGTTTTCTAGAAGTTATTTTCTATTAGTTGCTTTGAGCGACATATGTAAGATCAACTAAAGCCATGTTGAAAATTATGGGTTGAAATGATTGACATACACAGTAACTTCAATCTTTAACCTTGATCACAGTTGTTCGTTCTAACTATTGTATTGAACAAACTGCATCTAAATAGTTTTCTCCAGTGTTAATATTTGGTCGAACTTTGACCTCATGATCCCTAGTCTTAAAATTTATGACCTCTTTATCCTTCAATGCACTTTTTGTTCTCTGTACTAGTTTTATCCTCTTTTCTTTTTCTCCCTTTTCATGGTGCGTTGGACGTTGGACTGACCTAATAATTAAGATATTGAAGATGTGCGCTAGCACTGTGGGTTTGTATCACTAAAGATCTAGAATCGTGGCATTGGATATTTTGTTCAAGTGTTCTGGCAACACGAGTGTCTTGGTGCTGATATTAACTTGGTCAACAATTTGAAAAATCCTTTGATTCTTTCATTTTGCGAAATAACAAATAACCAGCTATAAGCTTGTCAAAAATATAAAAAATTTGATGTGGCATGAACCCACTGTGTAAGCATACATACTCATAGGTAGTAAATTGTGAAAGATTTTCTTACTATGTTTACATATCCGTGTGGAACCATTACACATTTTCTTCCTATCACATCAAGTATGGGAACTTTGGCCATGCAACCCCTTTGGTTTAATTTTTCCTAATCCATGAACATTCTCACCTCTTGATTCTTTAGAGAACTCGGAGGATCTAGGTGTATGGATGAGGTTTCTGAAGACACTGCATAGCCACTTGATAGAAGCTAATTCAATTGGCGTAGAAAATGATTTTTACTCTCTAATCTCTCTGAGGAGATAGCAATGGCCTCTACAACAAACTTCTAAATCTAGCTTGTACTCTTTATTTCAATAAAACAAAAGGCATGGTAAGCAAGAGAGATTTGCCTGTGGGAACAACTTACGAGTGATAACTAAGCTTGATAAAGGGCATATACTTGGTGATACTTATCTGAACCTTTCTTAACTATTCACCTTTTATATTTTATTTGTCTTGAAGCAGTTTTCTTGTGAGCTGATTTGATTGGGATTTGTTTTCATTCTCCTCTAGTACTTCTCAATAAATCGAGCTCTTTTTCCCACTAAAAAATGTCTGTTATGTATGACATTAAATGGACAAAGTATATATTTATTATGGAGCATTCAACATGTTTCACCATGTTTTGGTTTTCTGGGTGTGTTGTATGGATGTATGTTGCAGACTTTTTATGGCACAGGTTGCTACCATACTATTCTGCCTTCTGAAGTCTACAACATTAAATTTGAATTATTATAACAGTTGGAATGATCTTAATCCTAGTATCATTATAATTTCATAACTTGATATGCAGACACAATTATCTATGCATACATGAAATTGTGGTTGGTCTTTATAATTCAAAATACTTCATCAAACTATTTAATACTCTTTATGTGTCCTGTTTTCTTCTTTGTTATAGTCTATGTAATGTAAAGTTTCCACATTTCTTTTTGGGGAAGGGGATGCATTACTCTGTCCAGAAATAGGGGGAGGATTAATTATTAAGCTATTGAGTATGGGTTTCCAGGTCCAAAGGGAAGCCTCCAATAGGGAAAGGTGGAAAGCGAGAAAGTAGGAAGCGTTCCCACAGTGAGGATGATGAATTGGATCCAATGGATCCTAGCTCCTATTCAGATGCTCCTCGTGGTGGCTGGTAGGTGTTCAAAATTTAGCTTTTTGGGTTTAATACTAAATGATTATTTAGGCCCATTTGATCACAACTTAGTTTTTGGTTTTTGAAAATTATGCTTATCTTTTCATCGATTGAATTCTTAGCCAAATTCCCCCCCCCCCCCCNAAGTACTACTCTCTTAAAAAATTTCAGAACTTGATTTAGATTTTGGCCCTAAGAAGTAGATAACAAAACAAAAAGTACTTCATTTTCAAAAACTAAAAACCAAAATGGTTGTCAAACGAGGACTCATTTTGGATATTTTTATCGAAGAAGTGTGATATTTAATGGTTTGTTTTGATTACTTTTCAGGGTTGTGGGACTAAAAGGAGTGCAACCTCGAGCAGCAGATACTACTGCTACAGTAAGTAACTAATCATTTTGTTCTATGTTTTAACTTCGAAGAATGAGTAAGATTGAGATGAATTAGGATTTTAGACTTTTAGCTGTCTGCTGTTGCAACTGTTTCACCTTATGATAACGCAATTATCATTTCTTGTATGGCCATTGTATGTAGTTTCACATATAAAATGTTTCAGAGTTGCAGTACATTTTCCAAAATACGAACTGCATATGGAAGATACAATAATTAAAAACATAAACTGCTCGGTGCCTTCTTCATGTTCCTTCCCGAGAGTTTTTTCCAGCCCCAACATTTTTCGGTTCTTTTTTTTCTGCCAATCATAAATCATTTTCTTCATACGGAGCCTTGGTTTCAGTCTGTTTTATGTATTGCGCTGACTCTACCCCTTCTGCGATGAACTCTAGACACTGGCTAGAACATGCGTTATTCTCTTGTTCTTCAATCCAATGCTCATTGTGTTGGACATCAGTTTTTTTTTTTTTTTTTTNTTGGATAATAAATGCTTCCCTTCGTCCATGGTGAATTGCAGGGTCCTCTCTTTCAACAGCGGCCATACCCATCACCTGGAGCTGTACTGAGGAAGAATGCTGAAATTGCTTCACAGACCAAGAAAGGAAGCTCTCACTACGCGCCTATTTCCAAGAGAGGAGATGGAAGTGATGGCCTTGGTGATGCTGACTGATCTTCTACTTTATTCTGCTTCGGCATTTACAACAAGCACTATTTCTGTAGCGCAGGAATGCACTTTCCACTTGTGGAAATGCTGCTAACCCTATATTCTTACTTGAAGTTATGGATCAGATTGCATGTGATTGCTGCTGACTTTCTGGATAGGATGTGGCCGTATAATCAAGTGGGAGTTAGTTATTGAGCCATTTAAAGGTCTTCTTCTATTCATGAGTTTCTTGGTTTTCTTGGGGCTCACTTGTATGTAGAATAGCTAATGATGCAATTACTGGAAAGTAGAACATTTGAACATATTGAACCTTTTTCTCCTTCTTTCGTGATCATTTACTTCAAACGTTATACTTACGTAACTTAGAGTTGCATATCTTTATCATGTTGCTTTTCTTGTCTCACCTTATGAGTTTGTTAATCCTACAATTATGTACACTCAAGAGCTGAACAGTTTCTCAATATGAACACTGCTTTACTGGGTTGC

mRNA sequence

CGGTCAATTTCCGGACAGTTTTGGGAACCGCCAAGAAATACAACCAGGTTTTTCCAATTCGGCGCTCTATTATATCCTTTCAATTCGACGCATACCGTCGATCCGCATCAGCAATCCGCCATACTCAAATCTGCCTCAAATTCCCACAAGCAGCTGGTCCAGTACAGCAAAGCTTTCAAGATCCAAGCCCTTCCGGGGATGCCGACTTCTACTGCAGCAATTGCAGATTCAGGAGATTCGTCCAAAACTACAATTGGTTCTAGTGTTGAAGATAGTTCTCTCAAGGAATCAGGTTCTGCTCAATCTCAATCTTGTGCCCAAAATGAAGTGCAAGAACTTGAAAAGTTTGGCAACCAAATTTCTCCTTGTCAACTGGGAGAAGTTCATAGTTCTGTAGTAATCTCTTCTGATCAAGAGAAACCCCTAAGTTTTGGAAATGATCAGAACATTGTTTCCCATGATGGTGTGTTTAATATTGCTGTCTCGGCTTCTAGCAAATTCGGGTCACATGTCGATACCAGAGACATTGATAATGCTGTTCGGGATGCCGTGTTGAGGGAACAGGAACTTGCTACCCAAAATATTATTCGTAGCCGAAGAGACTCCGTGGATGCAGATGGACTTCCAGAGGAGAGATCAGATATCTTTTCAGAACGTTATGACCCAAGTGCTCTTAAAGAGCATCTTTTGAAGATTACTTCTGAACATCGTGCAGAAATGGCTATGAAAAGGGGAAAGTTGAATCTTCCCGAAGAAGGAAATAATACGATTGACCAGAAAATCCAGGGACAGGTTAGGGAAGCAGAACAAAGTTCTTCTGCCAAAGAATTGCCCGAGTACCTCAAGCAGAAGCTAAAAGCTAGGGGTATTCTTAAAGAAGATGCAAAACATAGCAATTCTGCAAATTCTGATGCTATTTCAAATCAAATGTTGCAAGGAGAAAAGCTGCCTCATGGATGGGTGGAGGCTAAAGACCCTGGCAGTGGTGTTTCATATTACTATAATGAAAGCACTGGGAAGAGTCAATGGGAAAGGCCCACTGAATCGTCTTTTGGTTTGCAACTTTCATCTGCTGTATCCCTTCCAGAAGATTGGATGGAGGCAGTCGATCAAACAACAGGCCATAAATACTACTACAATAGGAGAACCCAGGTAACCCAGTGGGAGCCGCCTGTTGCATCTCATCAGGCAACTTTGGCACACTCGAATGTTAGTGCTCCTGGGTCTTGGAACGACCAAACTTCAGGGCAAAGTAAATGCGTCACATGTGGAAGCGGAATGACCCTCGTGCAGGGTTCAAGATACTGCAACTGTTGTGCAAGTGGGGTTTCTACAAGTTCAACCAATGGGACGTGGCAGGATCAACCGTCTGACCAACATAAATGCATGGGATGTGGAGGTTGGGGACTAGGCCTTGTTCAAGCTTGGGGTTACTGTAATCATTGTACACGAACTCTCGGCCTTCCCCAGTGTCAGTACTTGCCAACCAGCAATATTTATAATCAGCAGAAGACCGAGAACATCAAGAATAACGCTGATCCTTCCATCAAAAAATCTGCTTCAGATAGGTCCAAAGGGAAGCCTCCAATAGGGAAAGGTGGAAAGCGAGAAAGTAGGAAGCGTTCCCACAGTGAGGATGATGAATTGGATCCAATGGATCCTAGCTCCTATTCAGATGCTCCTCGTGGAGTGCAACCTCGAGCAGCAGATACTACTGCTACAGGTCCTCTCTTTCAACAGCGGCCATACCCATCACCTGGAGCTGTACTGAGGAAGAATGCTGAAATTGCTTCACAGACCAAGAAAGGAAGCTCTCACTACGCGCCTATTTCCAAGAGAGGAGATGGAAGTGATGGCCTTGGTGATGCTGACTGATCTTCTACTTTATTCTGCTTCGGCATTTACAACAAGCACTATTTCTGTAGCGCAGGAATGCACTTTCCACTTGTGGAAATGCTGCTAACCCTATATTCTTACTTGAAGTTATGGATCAGATTGCATGTGATTGCTGCTGACTTTCTGGATAGGATGTGGCCGTATAATCAAGTGGGAGTTAGTTATTGAGCCATTTAAAGGTCTTCTTCTATTCATGAGTTTCTTGGTTTTCTTGGGGCTCACTTGTATGTAGAATAGCTAATGATGCAATTACTGGAAAGTAGAACATTTGAACATATTGAACCTTTTTCTCCTTCTTTCGTGATCATTTACTTCAAACGTTATACTTACGTAACTTAGAGTTGCATATCTTTATCATGTTGCTTTTCTTGTCTCACCTTATGAGTTTGTTAATCCTACAATTATGTACACTCAAGAGCTGAACAGTTTCTCAATATGAACACTGCTTTACTGGGTTGC

Coding sequence (CDS)

CGGTCAATTTCCGGACAGTTTTGGGAACCGCCAAGAAATACAACCAGGTTTTTCCAATTCGGCGCTCTATTATATCCTTTCAATTCGACGCATACCGTCGATCCGCATCAGCAATCCGCCATACTCAAATCTGCCTCAAATTCCCACAAGCAGCTGGTCCAGTACAGCAAAGCTTTCAAGATCCAAGCCCTTCCGGGGATGCCGACTTCTACTGCAGCAATTGCAGATTCAGGAGATTCGTCCAAAACTACAATTGGTTCTAGTGTTGAAGATAGTTCTCTCAAGGAATCAGGTTCTGCTCAATCTCAATCTTGTGCCCAAAATGAAGTGCAAGAACTTGAAAAGTTTGGCAACCAAATTTCTCCTTGTCAACTGGGAGAAGTTCATAGTTCTGTAGTAATCTCTTCTGATCAAGAGAAACCCCTAAGTTTTGGAAATGATCAGAACATTGTTTCCCATGATGGTGTGTTTAATATTGCTGTCTCGGCTTCTAGCAAATTCGGGTCACATGTCGATACCAGAGACATTGATAATGCTGTTCGGGATGCCGTGTTGAGGGAACAGGAACTTGCTACCCAAAATATTATTCGTAGCCGAAGAGACTCCGTGGATGCAGATGGACTTCCAGAGGAGAGATCAGATATCTTTTCAGAACGTTATGACCCAAGTGCTCTTAAAGAGCATCTTTTGAAGATTACTTCTGAACATCGTGCAGAAATGGCTATGAAAAGGGGAAAGTTGAATCTTCCCGAAGAAGGAAATAATACGATTGACCAGAAAATCCAGGGACAGGTTAGGGAAGCAGAACAAAGTTCTTCTGCCAAAGAATTGCCCGAGTACCTCAAGCAGAAGCTAAAAGCTAGGGGTATTCTTAAAGAAGATGCAAAACATAGCAATTCTGCAAATTCTGATGCTATTTCAAATCAAATGTTGCAAGGAGAAAAGCTGCCTCATGGATGGGTGGAGGCTAAAGACCCTGGCAGTGGTGTTTCATATTACTATAATGAAAGCACTGGGAAGAGTCAATGGGAAAGGCCCACTGAATCGTCTTTTGGTTTGCAACTTTCATCTGCTGTATCCCTTCCAGAAGATTGGATGGAGGCAGTCGATCAAACAACAGGCCATAAATACTACTACAATAGGAGAACCCAGGTAACCCAGTGGGAGCCGCCTGTTGCATCTCATCAGGCAACTTTGGCACACTCGAATGTTAGTGCTCCTGGGTCTTGGAACGACCAAACTTCAGGGCAAAGTAAATGCGTCACATGTGGAAGCGGAATGACCCTCGTGCAGGGTTCAAGATACTGCAACTGTTGTGCAAGTGGGGTTTCTACAAGTTCAACCAATGGGACGTGGCAGGATCAACCGTCTGACCAACATAAATGCATGGGATGTGGAGGTTGGGGACTAGGCCTTGTTCAAGCTTGGGGTTACTGTAATCATTGTACACGAACTCTCGGCCTTCCCCAGTGTCAGTACTTGCCAACCAGCAATATTTATAATCAGCAGAAGACCGAGAACATCAAGAATAACGCTGATCCTTCCATCAAAAAATCTGCTTCAGATAGGTCCAAAGGGAAGCCTCCAATAGGGAAAGGTGGAAAGCGAGAAAGTAGGAAGCGTTCCCACAGTGAGGATGATGAATTGGATCCAATGGATCCTAGCTCCTATTCAGATGCTCCTCGTGGAGTGCAACCTCGAGCAGCAGATACTACTGCTACAGGTCCTCTCTTTCAACAGCGGCCATACCCATCACCTGGAGCTGTACTGAGGAAGAATGCTGAAATTGCTTCACAGACCAAGAAAGGAAGCTCTCACTACGCGCCTATTTCCAAGAGAGGAGATGGAAGTGATGGCCTTGGTGATGCTGACTGA

Protein sequence

RSISGQFWEPPRNTTRFFQFGALLYPFNSTHTVDPHQQSAILKSASNSHKQLVQYSKAFKIQALPGMPTSTAAIADSGDSSKTTIGSSVEDSSLKESGSAQSQSCAQNEVQELEKFGNQISPCQLGEVHSSVVISSDQEKPLSFGNDQNIVSHDGVFNIAVSASSKFGSHVDTRDIDNAVRDAVLREQELATQNIIRSRRDSVDADGLPEERSDIFSERYDPSALKEHLLKITSEHRAEMAMKRGKLNLPEEGNNTIDQKIQGQVREAEQSSSAKELPEYLKQKLKARGILKEDAKHSNSANSDAISNQMLQGEKLPHGWVEAKDPGSGVSYYYNESTGKSQWERPTESSFGLQLSSAVSLPEDWMEAVDQTTGHKYYYNRRTQVTQWEPPVASHQATLAHSNVSAPGSWNDQTSGQSKCVTCGSGMTLVQGSRYCNCCASGVSTSSTNGTWQDQPSDQHKCMGCGGWGLGLVQAWGYCNHCTRTLGLPQCQYLPTSNIYNQQKTENIKNNADPSIKKSASDRSKGKPPIGKGGKRESRKRSHSEDDELDPMDPSSYSDAPRGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHYAPISKRGDGSDGLGDAD
Homology
BLAST of Cp4.1LG03g08750 vs. ExPASy Swiss-Prot
Match: Q2HJC9 (Polyglutamine-binding protein 1 OS=Bos taurus OX=9913 GN=PQBP1 PE=2 SV=1)

HSP 1 Score: 84.3 bits (207), Expect = 5.2e-15
Identity = 48/76 (63.16%), Postives = 55/76 (72.37%), Query Frame = 0

Query: 537 ESRKRSHSEDDELDPMDPSSYSDAPRGV----------QPRAADTTATGPLFQQRPYPSP 596
           +S+K +  +D+ELDPMDPSSYSDAPRG               ADTTA GPLFQQRPYPSP
Sbjct: 187 KSKKAASRKDEELDPMDPSSYSDAPRGTWSTGLPKRNEAKTGADTTAAGPLFQQRPYPSP 246

Query: 597 GAVLRKNAEIASQTKK 603
           GAVLR NAE AS+TK+
Sbjct: 247 GAVLRANAE-ASRTKQ 261

BLAST of Cp4.1LG03g08750 vs. ExPASy Swiss-Prot
Match: Q6PCT5 (Polyglutamine-binding protein 1 OS=Rattus norvegicus OX=10116 GN=Pqbp1 PE=2 SV=1)

HSP 1 Score: 81.6 bits (200), Expect = 3.4e-14
Identity = 46/76 (60.53%), Postives = 55/76 (72.37%), Query Frame = 0

Query: 537 ESRKRSHSEDDELDPMDPSSYSDAPRGV----------QPRAADTTATGPLFQQRPYPSP 596
           +++K +  +D+ELDPMDPSSYSDAPRG               ADTTA GPLFQQRPYPSP
Sbjct: 187 KNKKATSRKDEELDPMDPSSYSDAPRGTWSTGLPKRNEAKTGADTTAAGPLFQQRPYPSP 246

Query: 597 GAVLRKNAEIASQTKK 603
           GAVLR NAE AS++K+
Sbjct: 247 GAVLRANAE-ASRSKQ 261

BLAST of Cp4.1LG03g08750 vs. ExPASy Swiss-Prot
Match: Q91VJ5 (Polyglutamine-binding protein 1 OS=Mus musculus OX=10090 GN=Pqbp1 PE=1 SV=1)

HSP 1 Score: 81.3 bits (199), Expect = 4.4e-14
Identity = 56/102 (54.90%), Postives = 61/102 (59.80%), Query Frame = 0

Query: 521 SDRSKGK----------PPIGKGGKRESRKRSHSEDDELDPMDPSSYSDAPRGV------ 580
           +DR +GK           P  K  K  SRK     D+ELDPMDPSSYSDAPRG       
Sbjct: 166 ADREEGKDRRHHRREELAPYPKNKKATSRK-----DEELDPMDPSSYSDAPRGTWSTGLP 225

Query: 581 ----QPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK 603
                   ADTTA GPLFQQRPYPSPGAVLR NAE AS+TK+
Sbjct: 226 KRNEAKTGADTTAAGPLFQQRPYPSPGAVLRANAE-ASRTKQ 261

BLAST of Cp4.1LG03g08750 vs. ExPASy Swiss-Prot
Match: A1YFA7 (Polyglutamine-binding protein 1 OS=Gorilla gorilla gorilla OX=9595 GN=PQBP1 PE=3 SV=1)

HSP 1 Score: 79.7 bits (195), Expect = 1.3e-13
Identity = 54/96 (56.25%), Postives = 59/96 (61.46%), Query Frame = 0

Query: 517 KKSASDRSKGKPPIGKGGKRESRKRSHSEDDELDPMDPSSYSDAPRGV----------QP 576
           K+    R +   P  K  K  SRK     D+ELDPMDPSSYSDAPRG             
Sbjct: 174 KERRHHRREELAPYPKSKKAVSRK-----DEELDPMDPSSYSDAPRGTWSTGLPKRNEAK 233

Query: 577 RAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK 603
             ADTTA GPLFQQRPYPSPGAVLR NAE AS+TK+
Sbjct: 234 TGADTTAAGPLFQQRPYPSPGAVLRANAE-ASRTKQ 263

BLAST of Cp4.1LG03g08750 vs. ExPASy Swiss-Prot
Match: O60828 (Polyglutamine-binding protein 1 OS=Homo sapiens OX=9606 GN=PQBP1 PE=1 SV=1)

HSP 1 Score: 79.7 bits (195), Expect = 1.3e-13
Identity = 54/96 (56.25%), Postives = 59/96 (61.46%), Query Frame = 0

Query: 517 KKSASDRSKGKPPIGKGGKRESRKRSHSEDDELDPMDPSSYSDAPRGV----------QP 576
           K+    R +   P  K  K  SRK     D+ELDPMDPSSYSDAPRG             
Sbjct: 174 KERRHHRREELAPYPKSKKAVSRK-----DEELDPMDPSSYSDAPRGTWSTGLPKRNEAK 233

Query: 577 RAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK 603
             ADTTA GPLFQQRPYPSPGAVLR NAE AS+TK+
Sbjct: 234 TGADTTAAGPLFQQRPYPSPGAVLRANAE-ASRTKQ 263

BLAST of Cp4.1LG03g08750 vs. NCBI nr
Match: KAG7017655.1 (Polyglutamine-binding protein 1, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1144 bits (2959), Expect = 0.0
Identity = 590/622 (94.86%), Postives = 600/622 (96.46%), Query Frame = 0

Query: 12  RNTTRFFQFGALLYPFNSTHTVDPHQQSAILKSASNSHKQLVQYSKAFKIQALPGMPTST 71
           ++  + FQFGAL+YPFNST TV P+QQSA+LKSASNSHKQLV+Y +AFKIQALPGMPTST
Sbjct: 33  KSYNQVFQFGALIYPFNSTRTVFPYQQSAVLKSASNSHKQLVKYREAFKIQALPGMPTST 92

Query: 72  AAIADSGDSSKTTIGSSVEDSSLKESGSAQSQSCAQNEVQELEKFGNQISPCQLGEVHSS 131
           AAIAD GDSSKTTIGSSVEDSSLKESGSAQSQS AQNEVQELEKFGNQISPCQ GEVHSS
Sbjct: 93  AAIADLGDSSKTTIGSSVEDSSLKESGSAQSQSYAQNEVQELEKFGNQISPCQPGEVHSS 152

Query: 132 VVISSDQEKPLSFGNDQNIVSHDGVFNIAVSASSKFGSHV-DTRDIDNAVRDAVLREQEL 191
           VVISSDQEKP SFGNDQNIVSHDGVFNIAVS+SSKFGSHV DTRDIDNAVRDAVLREQEL
Sbjct: 153 VVISSDQEKPPSFGNDQNIVSHDGVFNIAVSSSSKFGSHVVDTRDIDNAVRDAVLREQEL 212

Query: 192 ATQNIIRSRRDSVDADGLPEERSDIFSERYDPSALKEHLLKITSEHRAEMAMKRGKLNLP 251
           ATQNIIRSRRDSVDADGLPEERSDIFSERYDPSALKEHLLKITSEHRAEMAMKRGKLNLP
Sbjct: 213 ATQNIIRSRRDSVDADGLPEERSDIFSERYDPSALKEHLLKITSEHRAEMAMKRGKLNLP 272

Query: 252 EEGNNTIDQKIQGQVREAEQSSSAKELPEYLKQKLKARGILKEDAKHSNSANSDAISNQM 311
           EEGNNTIDQKIQGQVREAEQS SAKELPEYLKQKLKARGILKEDAKHSNSANSDAISNQM
Sbjct: 273 EEGNNTIDQKIQGQVREAEQSPSAKELPEYLKQKLKARGILKEDAKHSNSANSDAISNQM 332

Query: 312 LQGEKLPHGWVEAKDPGSGVSYYYNESTGKSQWERPTESSFGLQLSSAVSLPEDWMEAVD 371
           LQGEKLPHGWVEAKDPGSGVSYYYNESTGKSQWERPTESSFGLQLSSAVSLPEDWMEAVD
Sbjct: 333 LQGEKLPHGWVEAKDPGSGVSYYYNESTGKSQWERPTESSFGLQLSSAVSLPEDWMEAVD 392

Query: 372 QTTGHKYYYNRRTQVTQWEPPVASHQATLAHSNVSAPGSWNDQTSGQSKCVTCGSGMTLV 431
           QTTGHKYYYNRRTQVTQWEPPVASHQATLAHSNVSAPGSWN+QTSGQSKCVTCGSGMTLV
Sbjct: 393 QTTGHKYYYNRRTQVTQWEPPVASHQATLAHSNVSAPGSWNNQTSGQSKCVTCGSGMTLV 452

Query: 432 QGSRYCNCCASGVSTSSTNGTWQDQPSDQHKCMGCGGWGLGLVQAWGYCNHCTRTLGLPQ 491
           QGSRYCNCCASGVSTSSTNG WQDQ SDQHKCMGCGGWGLGLVQAWGYCNHCTRTLGLPQ
Sbjct: 453 QGSRYCNCCASGVSTSSTNGKWQDQLSDQHKCMGCGGWGLGLVQAWGYCNHCTRTLGLPQ 512

Query: 492 CQYLPTSNIYNQQKTENIKNNADPSIKKSASDRSKGKPPIGKGGKRESRKRSHSEDDELD 551
           CQYLPTSNIYNQQKTENIKNNADPSIKKSASDRSKGKPPIGKGGKRESRKRSHSEDDELD
Sbjct: 513 CQYLPTSNIYNQQKTENIKNNADPSIKKSASDRSKGKPPIGKGGKRESRKRSHSEDDELD 572

Query: 552 PMDPSSYSDAPRG--------VQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK 611
           PMDPSSYSDAPRG        VQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK
Sbjct: 573 PMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK 632

Query: 612 GSSHYAPISKRGDGSDGLGDAD 624
           GSSHYAPISKRGDGSDGLGDAD
Sbjct: 633 GSSHYAPISKRGDGSDGLGDAD 654

BLAST of Cp4.1LG03g08750 vs. NCBI nr
Match: XP_023527950.1 (uncharacterized protein LOC111791012 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1075 bits (2780), Expect = 0.0
Identity = 558/594 (93.94%), Postives = 558/594 (93.94%), Query Frame = 0

Query: 67  MPTSTAAIADSGDSSKTTIGSSVEDSSLKESGSAQSQSCAQNEVQELEKFGNQISPCQLG 126
           MPTSTAAIADSGDSSKTTIGSSVEDSSLKESGSAQSQSCAQNEVQELEKFGNQISPCQLG
Sbjct: 1   MPTSTAAIADSGDSSKTTIGSSVEDSSLKESGSAQSQSCAQNEVQELEKFGNQISPCQLG 60

Query: 127 EVHSSVVISSDQEKPLSFGNDQNIVSHDGVFNIAVSASSKFGSHVDTRDIDNAVRDAVLR 186
           EVHSSVVISSDQEKPLSFGNDQNIVSHDGVFNIAVSASSKFGSHVDTRDIDNAVRDAVLR
Sbjct: 61  EVHSSVVISSDQEKPLSFGNDQNIVSHDGVFNIAVSASSKFGSHVDTRDIDNAVRDAVLR 120

Query: 187 EQELATQNIIRSRRDSVDADGLPEERSDIFSERYDPSALKEHLLKITSEHRAEMAMKRGK 246
           EQELATQNIIRSRRDSVDADGLPEERSDIFSERYDPSALKEHLLKITSEHRAEMAMKRGK
Sbjct: 121 EQELATQNIIRSRRDSVDADGLPEERSDIFSERYDPSALKEHLLKITSEHRAEMAMKRGK 180

Query: 247 LNLPEEGN----------------------------NTIDQKIQGQVREAEQSSSAKELP 306
           LNLPEEGN                            NTIDQKIQGQVREAEQSSSAKELP
Sbjct: 181 LNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVTHGNNTIDQKIQGQVREAEQSSSAKELP 240

Query: 307 EYLKQKLKARGILKEDAKHSNSANSDAISNQMLQGEKLPHGWVEAKDPGSGVSYYYNEST 366
           EYLKQKLKARGILKEDAKHSNSANSDAISNQMLQGEKLPHGWVEAKDPGSGVSYYYNEST
Sbjct: 241 EYLKQKLKARGILKEDAKHSNSANSDAISNQMLQGEKLPHGWVEAKDPGSGVSYYYNEST 300

Query: 367 GKSQWERPTESSFGLQLSSAVSLPEDWMEAVDQTTGHKYYYNRRTQVTQWEPPVASHQAT 426
           GKSQWERPTESSFGLQLSSAVSLPEDWMEAVDQTTGHKYYYNRRTQVTQWEPPVASHQAT
Sbjct: 301 GKSQWERPTESSFGLQLSSAVSLPEDWMEAVDQTTGHKYYYNRRTQVTQWEPPVASHQAT 360

Query: 427 LAHSNVSAPGSWNDQTSGQSKCVTCGSGMTLVQGSRYCNCCASGVSTSSTNGTWQDQPSD 486
           LAHSNVSAPGSWNDQTSGQSKCVTCGSGMTLVQGSRYCNCCASGVSTSSTNGTWQDQPSD
Sbjct: 361 LAHSNVSAPGSWNDQTSGQSKCVTCGSGMTLVQGSRYCNCCASGVSTSSTNGTWQDQPSD 420

Query: 487 QHKCMGCGGWGLGLVQAWGYCNHCTRTLGLPQCQYLPTSNIYNQQKTENIKNNADPSIKK 546
           QHKCMGCGGWGLGLVQAWGYCNHCTRTLGLPQCQYLPTSNIYNQQKTENIKNNADPSIKK
Sbjct: 421 QHKCMGCGGWGLGLVQAWGYCNHCTRTLGLPQCQYLPTSNIYNQQKTENIKNNADPSIKK 480

Query: 547 SASDRSKGKPPIGKGGKRESRKRSHSEDDELDPMDPSSYSDAPRG--------VQPRAAD 606
           SASDRSKGKPPIGKGGKRESRKRSHSEDDELDPMDPSSYSDAPRG        VQPRAAD
Sbjct: 481 SASDRSKGKPPIGKGGKRESRKRSHSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAAD 540

Query: 607 TTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHYAPISKRGDGSDGLGDAD 624
           TTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHYAPISKRGDGSDGLGDAD
Sbjct: 541 TTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHYAPISKRGDGSDGLGDAD 594

BLAST of Cp4.1LG03g08750 vs. NCBI nr
Match: KAG6580903.1 (Polyglutamine-binding protein 1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1047 bits (2708), Expect = 0.0
Identity = 549/595 (92.27%), Postives = 551/595 (92.61%), Query Frame = 0

Query: 67  MPTSTAAIADSGDSSKTTIGSSVEDSSLKESGSAQSQSCAQNEVQELEKFGNQISPCQLG 126
           MPTSTAAIAD GDSSKTTIGSSVEDSSLKESGSAQSQS AQNEVQELEKFGNQISPCQ G
Sbjct: 1   MPTSTAAIADLGDSSKTTIGSSVEDSSLKESGSAQSQSYAQNEVQELEKFGNQISPCQPG 60

Query: 127 EVHSSVVISSDQEKPLSFGNDQNIVSHDGVFNIAVSASSKFGSHV-DTRDIDNAVRDAVL 186
           EVHSSVVISSDQEKP SFGNDQNIVSHDGVFNIAVS+SSKFGSHV DTRDIDNAVRDAVL
Sbjct: 61  EVHSSVVISSDQEKPPSFGNDQNIVSHDGVFNIAVSSSSKFGSHVVDTRDIDNAVRDAVL 120

Query: 187 REQELATQNIIRSRRDSVDADGLPEERSDIFSERYDPSALKEHLLKITSEHRAEMAMKRG 246
           REQELATQNIIRSRRDSVDADGLPEERSDIFSERYDPSALKEHLLKITSEHRAEMAMKRG
Sbjct: 121 REQELATQNIIRSRRDSVDADGLPEERSDIFSERYDPSALKEHLLKITSEHRAEMAMKRG 180

Query: 247 KLNLPEEGN----------------------------NTIDQKIQGQVREAEQSSSAKEL 306
           KLNLPEEGN                            NTIDQKIQGQVREAEQS SAKEL
Sbjct: 181 KLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVTHGNNTIDQKIQGQVREAEQSPSAKEL 240

Query: 307 PEYLKQKLKARGILKEDAKHSNSANSDAISNQMLQGEKLPHGWVEAKDPGSGVSYYYNES 366
           PEYLKQKLKARGILKEDAKHSNSANSDAISNQMLQGEKLPHGWVEAKDPGSGVSYYYNES
Sbjct: 241 PEYLKQKLKARGILKEDAKHSNSANSDAISNQMLQGEKLPHGWVEAKDPGSGVSYYYNES 300

Query: 367 TGKSQWERPTESSFGLQLSSAVSLPEDWMEAVDQTTGHKYYYNRRTQVTQWEPPVASHQA 426
           TGKSQWERPTESSFGLQLSSAVSLPEDWMEAVDQTTGHKYYYNRRTQVTQWEPPVASHQA
Sbjct: 301 TGKSQWERPTESSFGLQLSSAVSLPEDWMEAVDQTTGHKYYYNRRTQVTQWEPPVASHQA 360

Query: 427 TLAHSNVSAPGSWNDQTSGQSKCVTCGSGMTLVQGSRYCNCCASGVSTSSTNGTWQDQPS 486
           TLAHSNVSAPGSWN+QTSGQSKCVTCGSGMTLVQGSRYCNCCASGVSTSSTNG WQDQ S
Sbjct: 361 TLAHSNVSAPGSWNNQTSGQSKCVTCGSGMTLVQGSRYCNCCASGVSTSSTNGKWQDQLS 420

Query: 487 DQHKCMGCGGWGLGLVQAWGYCNHCTRTLGLPQCQYLPTSNIYNQQKTENIKNNADPSIK 546
           DQHKCMGCGGWGLGLVQAWGYCNHCTRTLGLPQCQYLPTSNIYNQQKTENIKNNADPSIK
Sbjct: 421 DQHKCMGCGGWGLGLVQAWGYCNHCTRTLGLPQCQYLPTSNIYNQQKTENIKNNADPSIK 480

Query: 547 KSASDRSKGKPPIGKGGKRESRKRSHSEDDELDPMDPSSYSDAPRG--------VQPRAA 606
           KSASDRSKGKPPIGKGGKRESRKRSHSEDDELDPMDPSSYSDAPRG        VQPRAA
Sbjct: 481 KSASDRSKGKPPIGKGGKRESRKRSHSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAA 540

Query: 607 DTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHYAPISKRGDGSDGLGDAD 624
           DTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHYAPISKRGDGSDGLGDAD
Sbjct: 541 DTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHYAPISKRGDGSDGLGDAD 595

BLAST of Cp4.1LG03g08750 vs. NCBI nr
Match: XP_022935213.1 (uncharacterized protein LOC111442162 [Cucurbita moschata])

HSP 1 Score: 1043 bits (2698), Expect = 0.0
Identity = 546/595 (91.76%), Postives = 549/595 (92.27%), Query Frame = 0

Query: 67  MPTSTAAIADSGDSSKTTIGSSVEDSSLKESGSAQSQSCAQNEVQELEKFGNQISPCQLG 126
           MPTSTAAIAD GDSSKTTIGSSVEDSSLKESGSAQSQS AQNEVQELEKFGNQISPCQ G
Sbjct: 1   MPTSTAAIADLGDSSKTTIGSSVEDSSLKESGSAQSQSYAQNEVQELEKFGNQISPCQPG 60

Query: 127 EVHSSVVISSDQEKPLSFGNDQNIVSHDGVFNIAVSASSKFGSHV-DTRDIDNAVRDAVL 186
           EV SSVVISSDQEKP SFGNDQNIV HDGVFNIAVS+SSKFGSHV DTRDIDNAVRDAVL
Sbjct: 61  EVRSSVVISSDQEKPPSFGNDQNIVPHDGVFNIAVSSSSKFGSHVVDTRDIDNAVRDAVL 120

Query: 187 REQELATQNIIRSRRDSVDADGLPEERSDIFSERYDPSALKEHLLKITSEHRAEMAMKRG 246
           REQELATQNIIRSRRDSVDADGLPEERSDIFSERYDPSALKEHLLKITSEHRAEMAMKRG
Sbjct: 121 REQELATQNIIRSRRDSVDADGLPEERSDIFSERYDPSALKEHLLKITSEHRAEMAMKRG 180

Query: 247 KLNLPEEGN----------------------------NTIDQKIQGQVREAEQSSSAKEL 306
           KLNLPEEGN                            NTIDQKIQGQVREAEQS SAKEL
Sbjct: 181 KLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVTHGNNTIDQKIQGQVREAEQSPSAKEL 240

Query: 307 PEYLKQKLKARGILKEDAKHSNSANSDAISNQMLQGEKLPHGWVEAKDPGSGVSYYYNES 366
           PEYLKQKLKARGILKEDAKHSNSANSDAISNQMLQGEKLPHGWVEAKDPGSGVSYYYNES
Sbjct: 241 PEYLKQKLKARGILKEDAKHSNSANSDAISNQMLQGEKLPHGWVEAKDPGSGVSYYYNES 300

Query: 367 TGKSQWERPTESSFGLQLSSAVSLPEDWMEAVDQTTGHKYYYNRRTQVTQWEPPVASHQA 426
           TGKSQWERPTESSFGLQLSSAVSLPEDWMEAVDQTTGH+YYYNRRTQVTQWEPPVASHQA
Sbjct: 301 TGKSQWERPTESSFGLQLSSAVSLPEDWMEAVDQTTGHRYYYNRRTQVTQWEPPVASHQA 360

Query: 427 TLAHSNVSAPGSWNDQTSGQSKCVTCGSGMTLVQGSRYCNCCASGVSTSSTNGTWQDQPS 486
           TLAHS VSAPGSWNDQTSGQSKCVTCGSGMTLVQG+RYCNCCASGVSTSSTNG WQDQPS
Sbjct: 361 TLAHSTVSAPGSWNDQTSGQSKCVTCGSGMTLVQGTRYCNCCASGVSTSSTNGKWQDQPS 420

Query: 487 DQHKCMGCGGWGLGLVQAWGYCNHCTRTLGLPQCQYLPTSNIYNQQKTENIKNNADPSIK 546
           DQHKCMGCGGWGLGLVQAWGYCNHCTRTLGLPQCQYLPTSNIYNQQKTENIKNNADPSIK
Sbjct: 421 DQHKCMGCGGWGLGLVQAWGYCNHCTRTLGLPQCQYLPTSNIYNQQKTENIKNNADPSIK 480

Query: 547 KSASDRSKGKPPIGKGGKRESRKRSHSEDDELDPMDPSSYSDAPRG--------VQPRAA 606
           KSASDRSKGKPPIGKGGKRESRKRSHSEDDELDPMDPSSYSDAPRG        VQPRAA
Sbjct: 481 KSASDRSKGKPPIGKGGKRESRKRSHSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAA 540

Query: 607 DTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHYAPISKRGDGSDGLGDAD 624
           DTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHYAPISKRGDGSDGLGDAD
Sbjct: 541 DTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHYAPISKRGDGSDGLGDAD 595

BLAST of Cp4.1LG03g08750 vs. NCBI nr
Match: XP_022983732.1 (uncharacterized protein LOC111482260 [Cucurbita maxima])

HSP 1 Score: 1019 bits (2636), Expect = 0.0
Identity = 534/595 (89.75%), Postives = 540/595 (90.76%), Query Frame = 0

Query: 67  MPTSTAAIADSGDSSKTTIGSSVEDSSLKESGSAQSQSCAQNEVQELEKFGNQISPCQLG 126
           MPTSTAAIADSGDSSKTTIGSSVEDSSLKESGSAQSQS AQNEVQELEKFGNQISPCQ G
Sbjct: 1   MPTSTAAIADSGDSSKTTIGSSVEDSSLKESGSAQSQSYAQNEVQELEKFGNQISPCQPG 60

Query: 127 EVHSSVVISSDQEKPLSFGNDQNIVSHDGVFNIAVSASSKFGSHV-DTRDIDNAVRDAVL 186
           EVHSSVVI SDQEK  SFGNDQNIV H GVFNIAVS+SSKFGSHV DTRDIDNAVRDAVL
Sbjct: 61  EVHSSVVIYSDQEKTPSFGNDQNIVPHAGVFNIAVSSSSKFGSHVVDTRDIDNAVRDAVL 120

Query: 187 REQELATQNIIRSRRDSVDADGLPEERSDIFSERYDPSALKEHLLKITSEHRAEMAMKRG 246
           REQELATQNIIRS+RDSV ADGLPEERSDIFSERYDPS LKEHLLKIT+EHRAEMAMKRG
Sbjct: 121 REQELATQNIIRSQRDSVGADGLPEERSDIFSERYDPSTLKEHLLKITTEHRAEMAMKRG 180

Query: 247 KLNLPEEGN----------------------------NTIDQKIQGQVREAEQSSSAKEL 306
           KLNLPEEGN                            NTIDQKIQGQVRE +QSSSAKEL
Sbjct: 181 KLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVTHGNNTIDQKIQGQVRETKQSSSAKEL 240

Query: 307 PEYLKQKLKARGILKEDAKHSNSANSDAISNQMLQGEKLPHGWVEAKDPGSGVSYYYNES 366
           PEYLKQKLKARGILKEDAKHSNSAN+DAISNQMLQGEKLPHGWVEAKDPGSG SYYYNES
Sbjct: 241 PEYLKQKLKARGILKEDAKHSNSANADAISNQMLQGEKLPHGWVEAKDPGSGASYYYNES 300

Query: 367 TGKSQWERPTESSFGLQLSSAVSLPEDWMEAVDQTTGHKYYYNRRTQVTQWEPPVASHQA 426
           TGKSQWERPTESSFGLQLSSAVSLPEDWMEAVDQ TGHKYYYNRRTQVTQWEPP ASHQA
Sbjct: 301 TGKSQWERPTESSFGLQLSSAVSLPEDWMEAVDQITGHKYYYNRRTQVTQWEPPAASHQA 360

Query: 427 TLAHSNVSAPGSWNDQTSGQSKCVTCGSGMTLVQGSRYCNCCASGVSTSSTNGTWQDQPS 486
           TLAHSNV APGSWNDQTSGQSKCVTCGSGMTLVQGSRYCNCCASGVSTSSTNG WQDQPS
Sbjct: 361 TLAHSNVGAPGSWNDQTSGQSKCVTCGSGMTLVQGSRYCNCCASGVSTSSTNGKWQDQPS 420

Query: 487 DQHKCMGCGGWGLGLVQAWGYCNHCTRTLGLPQCQYLPTSNIYNQQKTENIKNNADPSIK 546
           D HKCMGCGGWGLGLVQAWGYCNHCTRTLGLPQCQYLPTSNI NQ KTENIKNN+DPSIK
Sbjct: 421 DLHKCMGCGGWGLGLVQAWGYCNHCTRTLGLPQCQYLPTSNINNQHKTENIKNNSDPSIK 480

Query: 547 KSASDRSKGKPPIGKGGKRESRKRSHSEDDELDPMDPSSYSDAPRG--------VQPRAA 606
           KSASDRSKGKPPIGKGGKRESRKRSHSEDDELDPMDPSSYSDAPRG        VQPRAA
Sbjct: 481 KSASDRSKGKPPIGKGGKRESRKRSHSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAA 540

Query: 607 DTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHYAPISKRGDGSDGLGDAD 624
           DTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHYAPISKRGDGSDGLGDAD
Sbjct: 541 DTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHYAPISKRGDGSDGLGDAD 595

BLAST of Cp4.1LG03g08750 vs. ExPASy TrEMBL
Match: A0A6J1F9X9 (Polyglutamine tract-binding protein 1 OS=Cucurbita moschata OX=3662 GN=LOC111442162 PE=4 SV=1)

HSP 1 Score: 1043 bits (2698), Expect = 0.0
Identity = 546/595 (91.76%), Postives = 549/595 (92.27%), Query Frame = 0

Query: 67  MPTSTAAIADSGDSSKTTIGSSVEDSSLKESGSAQSQSCAQNEVQELEKFGNQISPCQLG 126
           MPTSTAAIAD GDSSKTTIGSSVEDSSLKESGSAQSQS AQNEVQELEKFGNQISPCQ G
Sbjct: 1   MPTSTAAIADLGDSSKTTIGSSVEDSSLKESGSAQSQSYAQNEVQELEKFGNQISPCQPG 60

Query: 127 EVHSSVVISSDQEKPLSFGNDQNIVSHDGVFNIAVSASSKFGSHV-DTRDIDNAVRDAVL 186
           EV SSVVISSDQEKP SFGNDQNIV HDGVFNIAVS+SSKFGSHV DTRDIDNAVRDAVL
Sbjct: 61  EVRSSVVISSDQEKPPSFGNDQNIVPHDGVFNIAVSSSSKFGSHVVDTRDIDNAVRDAVL 120

Query: 187 REQELATQNIIRSRRDSVDADGLPEERSDIFSERYDPSALKEHLLKITSEHRAEMAMKRG 246
           REQELATQNIIRSRRDSVDADGLPEERSDIFSERYDPSALKEHLLKITSEHRAEMAMKRG
Sbjct: 121 REQELATQNIIRSRRDSVDADGLPEERSDIFSERYDPSALKEHLLKITSEHRAEMAMKRG 180

Query: 247 KLNLPEEGN----------------------------NTIDQKIQGQVREAEQSSSAKEL 306
           KLNLPEEGN                            NTIDQKIQGQVREAEQS SAKEL
Sbjct: 181 KLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVTHGNNTIDQKIQGQVREAEQSPSAKEL 240

Query: 307 PEYLKQKLKARGILKEDAKHSNSANSDAISNQMLQGEKLPHGWVEAKDPGSGVSYYYNES 366
           PEYLKQKLKARGILKEDAKHSNSANSDAISNQMLQGEKLPHGWVEAKDPGSGVSYYYNES
Sbjct: 241 PEYLKQKLKARGILKEDAKHSNSANSDAISNQMLQGEKLPHGWVEAKDPGSGVSYYYNES 300

Query: 367 TGKSQWERPTESSFGLQLSSAVSLPEDWMEAVDQTTGHKYYYNRRTQVTQWEPPVASHQA 426
           TGKSQWERPTESSFGLQLSSAVSLPEDWMEAVDQTTGH+YYYNRRTQVTQWEPPVASHQA
Sbjct: 301 TGKSQWERPTESSFGLQLSSAVSLPEDWMEAVDQTTGHRYYYNRRTQVTQWEPPVASHQA 360

Query: 427 TLAHSNVSAPGSWNDQTSGQSKCVTCGSGMTLVQGSRYCNCCASGVSTSSTNGTWQDQPS 486
           TLAHS VSAPGSWNDQTSGQSKCVTCGSGMTLVQG+RYCNCCASGVSTSSTNG WQDQPS
Sbjct: 361 TLAHSTVSAPGSWNDQTSGQSKCVTCGSGMTLVQGTRYCNCCASGVSTSSTNGKWQDQPS 420

Query: 487 DQHKCMGCGGWGLGLVQAWGYCNHCTRTLGLPQCQYLPTSNIYNQQKTENIKNNADPSIK 546
           DQHKCMGCGGWGLGLVQAWGYCNHCTRTLGLPQCQYLPTSNIYNQQKTENIKNNADPSIK
Sbjct: 421 DQHKCMGCGGWGLGLVQAWGYCNHCTRTLGLPQCQYLPTSNIYNQQKTENIKNNADPSIK 480

Query: 547 KSASDRSKGKPPIGKGGKRESRKRSHSEDDELDPMDPSSYSDAPRG--------VQPRAA 606
           KSASDRSKGKPPIGKGGKRESRKRSHSEDDELDPMDPSSYSDAPRG        VQPRAA
Sbjct: 481 KSASDRSKGKPPIGKGGKRESRKRSHSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAA 540

Query: 607 DTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHYAPISKRGDGSDGLGDAD 624
           DTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHYAPISKRGDGSDGLGDAD
Sbjct: 541 DTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHYAPISKRGDGSDGLGDAD 595

BLAST of Cp4.1LG03g08750 vs. ExPASy TrEMBL
Match: A0A6J1J063 (Polyglutamine tract-binding protein 1 OS=Cucurbita maxima OX=3661 GN=LOC111482260 PE=4 SV=1)

HSP 1 Score: 1019 bits (2636), Expect = 0.0
Identity = 534/595 (89.75%), Postives = 540/595 (90.76%), Query Frame = 0

Query: 67  MPTSTAAIADSGDSSKTTIGSSVEDSSLKESGSAQSQSCAQNEVQELEKFGNQISPCQLG 126
           MPTSTAAIADSGDSSKTTIGSSVEDSSLKESGSAQSQS AQNEVQELEKFGNQISPCQ G
Sbjct: 1   MPTSTAAIADSGDSSKTTIGSSVEDSSLKESGSAQSQSYAQNEVQELEKFGNQISPCQPG 60

Query: 127 EVHSSVVISSDQEKPLSFGNDQNIVSHDGVFNIAVSASSKFGSHV-DTRDIDNAVRDAVL 186
           EVHSSVVI SDQEK  SFGNDQNIV H GVFNIAVS+SSKFGSHV DTRDIDNAVRDAVL
Sbjct: 61  EVHSSVVIYSDQEKTPSFGNDQNIVPHAGVFNIAVSSSSKFGSHVVDTRDIDNAVRDAVL 120

Query: 187 REQELATQNIIRSRRDSVDADGLPEERSDIFSERYDPSALKEHLLKITSEHRAEMAMKRG 246
           REQELATQNIIRS+RDSV ADGLPEERSDIFSERYDPS LKEHLLKIT+EHRAEMAMKRG
Sbjct: 121 REQELATQNIIRSQRDSVGADGLPEERSDIFSERYDPSTLKEHLLKITTEHRAEMAMKRG 180

Query: 247 KLNLPEEGN----------------------------NTIDQKIQGQVREAEQSSSAKEL 306
           KLNLPEEGN                            NTIDQKIQGQVRE +QSSSAKEL
Sbjct: 181 KLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVTHGNNTIDQKIQGQVRETKQSSSAKEL 240

Query: 307 PEYLKQKLKARGILKEDAKHSNSANSDAISNQMLQGEKLPHGWVEAKDPGSGVSYYYNES 366
           PEYLKQKLKARGILKEDAKHSNSAN+DAISNQMLQGEKLPHGWVEAKDPGSG SYYYNES
Sbjct: 241 PEYLKQKLKARGILKEDAKHSNSANADAISNQMLQGEKLPHGWVEAKDPGSGASYYYNES 300

Query: 367 TGKSQWERPTESSFGLQLSSAVSLPEDWMEAVDQTTGHKYYYNRRTQVTQWEPPVASHQA 426
           TGKSQWERPTESSFGLQLSSAVSLPEDWMEAVDQ TGHKYYYNRRTQVTQWEPP ASHQA
Sbjct: 301 TGKSQWERPTESSFGLQLSSAVSLPEDWMEAVDQITGHKYYYNRRTQVTQWEPPAASHQA 360

Query: 427 TLAHSNVSAPGSWNDQTSGQSKCVTCGSGMTLVQGSRYCNCCASGVSTSSTNGTWQDQPS 486
           TLAHSNV APGSWNDQTSGQSKCVTCGSGMTLVQGSRYCNCCASGVSTSSTNG WQDQPS
Sbjct: 361 TLAHSNVGAPGSWNDQTSGQSKCVTCGSGMTLVQGSRYCNCCASGVSTSSTNGKWQDQPS 420

Query: 487 DQHKCMGCGGWGLGLVQAWGYCNHCTRTLGLPQCQYLPTSNIYNQQKTENIKNNADPSIK 546
           D HKCMGCGGWGLGLVQAWGYCNHCTRTLGLPQCQYLPTSNI NQ KTENIKNN+DPSIK
Sbjct: 421 DLHKCMGCGGWGLGLVQAWGYCNHCTRTLGLPQCQYLPTSNINNQHKTENIKNNSDPSIK 480

Query: 547 KSASDRSKGKPPIGKGGKRESRKRSHSEDDELDPMDPSSYSDAPRG--------VQPRAA 606
           KSASDRSKGKPPIGKGGKRESRKRSHSEDDELDPMDPSSYSDAPRG        VQPRAA
Sbjct: 481 KSASDRSKGKPPIGKGGKRESRKRSHSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAA 540

Query: 607 DTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHYAPISKRGDGSDGLGDAD 624
           DTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHYAPISKRGDGSDGLGDAD
Sbjct: 541 DTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHYAPISKRGDGSDGLGDAD 595

BLAST of Cp4.1LG03g08750 vs. ExPASy TrEMBL
Match: A0A0A0LFL2 (Polyglutamine tract-binding protein 1 OS=Cucumis sativus OX=3659 GN=Csa_3G822240 PE=4 SV=1)

HSP 1 Score: 874 bits (2257), Expect = 8.48e-313
Identity = 479/666 (71.92%), Postives = 522/666 (78.38%), Query Frame = 0

Query: 1   RSISGQFWEPPRNTTRFFQFGALLYPFNSTHTVDPHQQSAILKSASNSHKQLVQYSKAFK 60
           RS+S   W    NT   FQ    ++P  S   V  H++   ++ +    + + Q  ++F+
Sbjct: 47  RSVSRVVW----NT---FQKQNQVFPIRS---VVSHRRFRHIQISLQFRQAIDQLQESFR 106

Query: 61  IQALPGMPTSTAAIADSGDSSKTTIGSSVEDSSLKESGSAQSQSCAQNEVQELEKFGNQI 120
              LP MPTSTA IA SGDSS T IGSS ED SLKES +AQSQ  AQNEVQELEK   Q+
Sbjct: 107 NLKLPAMPTSTAGIAGSGDSSNTIIGSSAEDKSLKESAAAQSQYRAQNEVQELEKSSKQL 166

Query: 121 SPCQLGEVHSSVVISSDQEKPLSFGNDQNIVSHDGVFN-IAVSASSKFGSHVD-TRDIDN 180
            PCQ GE   +V I +DQE   S GNDQNIV H G FN IAVS+SS F S+VD  RDID 
Sbjct: 167 YPCQPGEAQGAVAIPADQETNRSSGNDQNIVPHHGTFNNIAVSSSSNFRSNVDDARDIDI 226

Query: 181 AVRDAVLREQELATQNIIRSRRDSVDADGLPEERSDIFSERYDPSALKEHLLKITSEHRA 240
           AV+DAVLREQELATQNIIRS+RDSV ADGLP ERSDIFSERYDPS+LKEHLLKITSEHRA
Sbjct: 227 AVQDAVLREQELATQNIIRSQRDSVGADGLPVERSDIFSERYDPSSLKEHLLKITSEHRA 286

Query: 241 EMAMKRGKLNLPEEGN----------------------------NTIDQKIQGQVREAEQ 300
           EMA+KRGKLNLPEEGN                            N   QKIQGQ++EAEQ
Sbjct: 287 EMAIKRGKLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVANGNNVTGQKIQGQIKEAEQ 346

Query: 301 SSSAKELPEYLKQKLKARGILKEDAKHSNSA----NSDAISNQMLQGEKLPHGWVEAKDP 360
           SS++K LPEYLKQKL+ARGILKEDA+HSNS     NSDA+SN  LQGEKLPHGWVEAKDP
Sbjct: 347 SSASKALPEYLKQKLRARGILKEDAEHSNSVRADTNSDAVSNTKLQGEKLPHGWVEAKDP 406

Query: 361 GSGVSYYYNESTGKSQWERPTESSFGLQLSSAVSLPEDWMEAVDQTTGHKYYYNRRTQVT 420
            SGVSYYYNES+GKSQWERP+E S   QLSSAVSLPEDWMEA+DQT+G KYYYN RT VT
Sbjct: 407 HSGVSYYYNESSGKSQWERPSELSSNTQLSSAVSLPEDWMEAIDQTSGVKYYYNMRTHVT 466

Query: 421 QWEPPVASHQATLAHSNVSAPGSWNDQTSGQSKCVTCGSGMTLVQGSRYCNCCASGVSTS 480
           QWE PVASHQ TL HSN   PG WNDQT  QSKC+TCGSGMTLVQGSRYCN C SGVSTS
Sbjct: 467 QWERPVASHQTTLTHSNDKFPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNSCTSGVSTS 526

Query: 481 STNGTWQDQPSDQHKCMGCGGWGLGLVQAWGYCNHCTRTLGLPQCQYLPTSNIYNQQKTE 540
           STNG WQDQPS+Q+KCMGCGGWGLGLVQAWGYC HCTR LGLPQCQYLPT+NI NQQK E
Sbjct: 527 STNGIWQDQPSEQNKCMGCGGWGLGLVQAWGYCIHCTRILGLPQCQYLPTNNISNQQKIE 586

Query: 541 NIKNNADPSIKKSASDRSKGKPPIGKGGKRESRKRSHSEDDELDPMDPSSYSDAPRG--- 600
           N+K++ADPSIKKS +DRSK KPPIGKGGKRESRKRS+SEDDELDPMDPSSYSDAPRG   
Sbjct: 587 NVKHSADPSIKKSVTDRSKWKPPIGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWV 646

Query: 601 -----VQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHYAPISKRGDGSD 624
                VQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHYAPISKRGDGSD
Sbjct: 647 VGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHYAPISKRGDGSD 702

BLAST of Cp4.1LG03g08750 vs. ExPASy TrEMBL
Match: A0A5D3DPP7 (Polyglutamine tract-binding protein 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G006960 PE=4 SV=1)

HSP 1 Score: 865 bits (2236), Expect = 2.18e-311
Identity = 461/596 (77.35%), Postives = 490/596 (82.21%), Query Frame = 0

Query: 67  MPTSTAAIADSGDSSKTTIGSSVEDSSLKESGSAQSQSCAQNEVQELEKFGNQISPCQLG 126
           MPTST AIA SGDSS T IGSS ED SLKES      + AQNEVQELEKF  QI PCQ G
Sbjct: 1   MPTSTTAIAGSGDSSNTIIGSSAEDKSLKES------AAAQNEVQELEKFSKQIYPCQPG 60

Query: 127 EVHSSVVISSDQEKPLSFGNDQNIVSHDGVFN-IAVSASSKFGSHVDT-RDIDNAVRDAV 186
           E   SV IS+DQE   SFGNDQNIV H+GVFN IAVS SS F S+VD  RDI+ AV+DAV
Sbjct: 61  EAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNVDDGRDIEIAVQDAV 120

Query: 187 LREQELATQNIIRSRRDSVDADGLPEERSDIFSERYDPSALKEHLLKITSEHRAEMAMKR 246
           LREQELATQNIIRS+R+SV ADGLP E+SDIFSERYDPS +KEHLLKITSEHRAEMAMKR
Sbjct: 121 LREQELATQNIIRSQRESVGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKR 180

Query: 247 GKLNLPEEGN----------------------------NTIDQKIQGQVREAEQSSSAKE 306
           GKLNLPEEGN                            N   QKIQGQV+E EQSS+AK 
Sbjct: 181 GKLNLPEEGNLEIGNGYGVPGGCASYGASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKA 240

Query: 307 LPEYLKQKLKARGILKEDAKHSNSANSDAISNQMLQGEKLPHGWVEAKDPGSGVSYYYNE 366
           LPEYLKQKL+ARGILKEDA+HSN  NSDA+SN  L GEKLPHGWVEAKDP SGVSYYYNE
Sbjct: 241 LPEYLKQKLRARGILKEDAEHSNPTNSDAVSNTKLHGEKLPHGWVEAKDPHSGVSYYYNE 300

Query: 367 STGKSQWERPTESSFGLQLSSAVSLPEDWMEAVDQTTGHKYYYNRRTQVTQWEPPVASHQ 426
           S+GKSQWERP+E S   QLSSAVSLPEDWMEA+DQT+G KYYYN RT +TQWE PVASHQ
Sbjct: 301 SSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQ 360

Query: 427 ATLAHSNVSAPGSWNDQTSGQSKCVTCGSGMTLVQGSRYCNCCASGVSTSSTNGTWQDQP 486
            TL HSN   PG WNDQT  QSKC+TCGSGMTLVQGSRYCN C SGVSTSSTNG WQDQ 
Sbjct: 361 TTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNACTSGVSTSSTNGMWQDQS 420

Query: 487 SDQHKCMGCGGWGLGLVQAWGYCNHCTRTLGLPQCQYLPTSNIYNQQKTENIKNNADPSI 546
           S+Q+KCMGCGGWGLGLVQAWGYCNHCTR L LPQCQYLPT+NI NQQKTENIK++ADPSI
Sbjct: 421 SEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPQCQYLPTNNISNQQKTENIKHSADPSI 480

Query: 547 KKSASDRSKGKPPIGKGGKRESRKRSHSEDDELDPMDPSSYSDAPRG--------VQPRA 606
           KKSA+DRSK KPP+GKGGKRESRKRS+SEDDELDPMDPSSYSDAPRG        VQPRA
Sbjct: 481 KKSATDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRA 540

Query: 607 ADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHYAPISKRGDGSDGLGDAD 624
           ADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHYAPISKRGDGSDGLGDAD
Sbjct: 541 ADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHYAPISKRGDGSDGLGDAD 590

BLAST of Cp4.1LG03g08750 vs. ExPASy TrEMBL
Match: A0A5A7UK56 (Polyglutamine tract-binding protein 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold135G002510 PE=4 SV=1)

HSP 1 Score: 823 bits (2127), Expect = 2.84e-294
Identity = 451/630 (71.59%), Postives = 481/630 (76.35%), Query Frame = 0

Query: 67  MPTSTAAIADSGDSSKTTIGSSVEDSSLKESGSAQSQSCAQNEVQELEKFGNQISPCQLG 126
           MPTST AIA SGDSS T IGSS ED SLKES      + AQNEVQELEKF  QI PCQ G
Sbjct: 1   MPTSTTAIAGSGDSSNTIIGSSAEDKSLKES------AAAQNEVQELEKFSKQIYPCQPG 60

Query: 127 EVHSSVVISSDQEKPLSFGNDQNIVSHDGVFN-IAVSASSKFGSHVDT-RDIDNAVRDAV 186
           E   SV IS+DQE   SFGNDQNIV H+GVFN IAVS SS F S+VD  RDI+ AV+DAV
Sbjct: 61  EAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNVDDGRDIEIAVQDAV 120

Query: 187 LREQE--LATQNIIRSRRDSVDADGLPEERSDIFSERYDPSALKEHLLKITSEHRAEMAM 246
           LREQ   +     +   R+SV ADGLP E+SDIFSERYDPS +KEHLLKITSEHRAEMAM
Sbjct: 121 LREQFKLIMKTKYVSFPRESVGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAM 180

Query: 247 KRGKLNLPEEGN------------------------------------------------ 306
           KRGKLNLPEEGN                                                
Sbjct: 181 KRGKLNLPEEGNLEIGNGYGVPGGCASYGASKPGIVANGLELKTFFLLLDFDVDFQLVHF 240

Query: 307 --------NTIDQKIQGQVREAEQSSSAKELPEYLKQKLKARGILKEDAKHSN----SAN 366
                   N   QKIQGQV+E EQSS+AK LPEYLKQKL+ARGILKEDA+HSN      N
Sbjct: 241 IFVLLSGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNPVRADTN 300

Query: 367 SDAISNQMLQGEKLPHGWVEAKDPGSGVSYYYNESTGKSQWERPTESSFGLQLSSAVSLP 426
           SDA+SN  L GEKLPHGWVEAKDP SGVSYYYNES+GKSQWERP+E S   QLSSAVSLP
Sbjct: 301 SDAVSNTKLHGEKLPHGWVEAKDPHSGVSYYYNESSGKSQWERPSELSSDTQLSSAVSLP 360

Query: 427 EDWMEAVDQTTGHKYYYNRRTQVTQWEPPVASHQATLAHSNVSAPGSWNDQTSGQSKCVT 486
           EDWMEA+DQT+G KYYYN RT +TQWE PVASHQ TL HSN   PG WNDQT  QSKC+T
Sbjct: 361 EDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCIT 420

Query: 487 CGSGMTLVQGSRYCNCCASGVSTSSTNGTWQDQPSDQHKCMGCGGWGLGLVQAWGYCNHC 546
           CGSGMTLVQGSRYCN C SGVSTSSTNG WQDQ S+Q+KCMGCGGWGLGLVQAWGYCNHC
Sbjct: 421 CGSGMTLVQGSRYCNACTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHC 480

Query: 547 TRTLGLPQCQYLPTSNIYNQQKTENIKNNADPSIKKSASDRSKGKPPIGKGGKRESRKRS 606
           TR L LPQCQYLPT+NI NQQKTENIK++ADPSIKKSA+DRSK KPP+GKGGKRESRKRS
Sbjct: 481 TRILSLPQCQYLPTNNISNQQKTENIKHSADPSIKKSATDRSKWKPPMGKGGKRESRKRS 540

Query: 607 HSEDDELDPMDPSSYSDAPRG--------VQPRAADTTATGPLFQQRPYPSPGAVLRKNA 624
           +SEDDELDPMDPSSYSDAPRG        VQPRAADTTATGPLFQQRPYPSPGAVLRKNA
Sbjct: 541 YSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNA 600

BLAST of Cp4.1LG03g08750 vs. TAIR 10
Match: AT2G41020.1 (WW domain-containing protein )

HSP 1 Score: 364.0 bits (933), Expect = 2.4e-100
Identity = 227/499 (45.49%), Postives = 295/499 (59.12%), Query Frame = 0

Query: 160 AVSASSKFGSHV---DTRDIDNAVRDAVLREQELATQNIIRSRRDS-VDADGLPEERSDI 219
           +V+++  +GS +    ++DI++A   A+LREQE+ TQ II+ +R++     G  +  +DI
Sbjct: 13  SVTSNYGYGSSLAYDQSQDIESAANTALLREQEIETQKIIQGQREAGTSVAGDSKHNTDI 72

Query: 220 FSERYDPSALKEHLLKITSEHRAEMAMKR-GKLNLPEEGNNTIDQ--KIQGQVREA---- 279
             +R DP+ALKEHLLK T+ HRAE A KR G ++   EGN  +     I G V  A    
Sbjct: 73  LRDRADPNALKEHLLKFTANHRAEAAAKRGGSVSTCGEGNVDVGNGYGIPGGVAYAGHSE 132

Query: 280 -----EQSSSAKELPEYLKQKLKARGILKE--DAKHSNSANSDAIS-NQ------MLQGE 339
                E ++++  LPEYLKQKLKARGIL++   A  SN  ++ A+S N+           
Sbjct: 133 LSGKPEPTNASNNLPEYLKQKLKARGILRDGAGAVTSNPEDTSAVSWNRQATLPFQANAS 192

Query: 340 KLPHGWVEAKDPGSGVSYYYNESTGKSQWERPTESSFGLQLSSAVSLPEDWMEAVDQTTG 399
            LP GWV+AKDP SG +YYYN+ TG  QWERP E S+    +  V   E+W+E  D+ +G
Sbjct: 193 TLPLGWVDAKDPASGATYYYNQHTGTCQWERPVELSYATSSAPPVLSKEEWIETFDEASG 252

Query: 400 HKYYYNRRTQVTQWEPPVASHQATLAHSNVSAPGSWNDQTSGQSKCVTCGSGMTLVQGSR 459
           HKY+YN RT V+QWEPP +  +    +SN                               
Sbjct: 253 HKYFYNTRTHVSQWEPPASLQKPAATNSN------------------------------- 312

Query: 460 YCNCCASGVSTSSTNGTWQDQPSDQHKCMGCGGWGLGLVQAWGYCNHCTRTLGLPQCQYL 519
                 + V+ S+ NG  +  PS   +C GCGGWG+GLVQ WGYC HCTR   LP+ Q+L
Sbjct: 313 ------NAVTQSTANGKGEHPPSQLPRCSGCGGWGVGLVQRWGYCVHCTRVFNLPEKQFL 372

Query: 520 PTSNIYNQQKTENIKNNADPSIKKSASDRSKGKPPIGKGGKRESRKRSHSEDDELDPMDP 579
           P           N   NA  S +K  + RS  KPP+    K   +KR+H+EDDELDPMDP
Sbjct: 373 PAH--------LNHFTNAGDSGQKDPNQRSSSKPPM---KKVIGKKRAHAEDDELDPMDP 432

Query: 580 SSYSDAPR--------GVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIA-SQTKKGSS 625
           SSYSDAPR        GVQPRAADTTA+GPLFQQRPYPSPGAVLR+NAE+A SQ KK +S
Sbjct: 433 SSYSDAPRGGWVVGLKGVQPRAADTTASGPLFQQRPYPSPGAVLRRNAEVASSQKKKPNS 463

BLAST of Cp4.1LG03g08750 vs. TAIR 10
Match: AT2G41020.2 (WW domain-containing protein )

HSP 1 Score: 237.3 bits (604), Expect = 3.4e-62
Identity = 144/361 (39.89%), Postives = 200/361 (55.40%), Query Frame = 0

Query: 160 AVSASSKFGSHV---DTRDIDNAVRDAVLREQELATQNIIRSRRDS-VDADGLPEERSDI 219
           +V+++  +GS +    ++DI++A   A+LREQE+ TQ II+ +R++     G  +  +DI
Sbjct: 13  SVTSNYGYGSSLAYDQSQDIESAANTALLREQEIETQKIIQGQREAGTSVAGDSKHNTDI 72

Query: 220 FSERYDPSALKEHLLKITSEHRAEMAMKR-GKLNLPEEGNNTIDQ--KIQGQVREA---- 279
             +R DP+ALKEHLLK T+ HRAE A KR G ++   EGN  +     I G V  A    
Sbjct: 73  LRDRADPNALKEHLLKFTANHRAEAAAKRGGSVSTCGEGNVDVGNGYGIPGGVAYAGHSE 132

Query: 280 -----EQSSSAKELPEYLKQKLKARGILKE--DAKHSNSANSDAIS-NQ------MLQGE 339
                E ++++  LPEYLKQKLKARGIL++   A  SN  ++ A+S N+           
Sbjct: 133 LSGKPEPTNASNNLPEYLKQKLKARGILRDGAGAVTSNPEDTSAVSWNRQATLPFQANAS 192

Query: 340 KLPHGWVEAKDPGSGVSYYYNESTGKSQWERPTESSFGLQLSSAVSLPEDWMEAVDQTTG 399
            LP GWV+AKDP SG +YYYN+ TG  QWERP E S+    +  V   E+W+E  D+ +G
Sbjct: 193 TLPLGWVDAKDPASGATYYYNQHTGTCQWERPVELSYATSSAPPVLSKEEWIETFDEASG 252

Query: 400 HKYYYNRRTQVTQWEPPVASHQATLAHSNVSAPGSWNDQTSGQSKCVTCGSGMTLVQGSR 459
           HKY+YN RT V+QWEPP +  +    +SN                               
Sbjct: 253 HKYFYNTRTHVSQWEPPASLQKPAATNSN------------------------------- 312

Query: 460 YCNCCASGVSTSSTNGTWQDQPSDQHKCMGCGGWGLGLVQAWGYCNHCTRTLGLPQCQYL 496
                 + V+ S+ NG  +  PS   +C GCGGWG+GLVQ WGYC HCTR   LP+ Q+L
Sbjct: 313 ------NAVTQSTANGKGEHPPSQLPRCSGCGGWGVGLVQRWGYCVHCTRVFNLPEKQFL 336

BLAST of Cp4.1LG03g08750 vs. TAIR 10
Match: AT3G19840.1 (pre-mRNA-processing protein 40C )

HSP 1 Score: 48.5 bits (114), Expect = 2.2e-05
Identity = 34/97 (35.05%), Postives = 45/97 (46.39%), Query Frame = 0

Query: 304 DAISNQMLQGEKLPHGWVEAKDPGSGVSYYYNESTGKSQWERPTESSFGLQLS------- 363
           D  +   L G +L   W   K   +GV YYYN  TG+S +E+P    FG +         
Sbjct: 234 DDRAGSQLVGNRL-DAWTAHKSE-AGVLYYYNSVTGQSTYEKP--PGFGGEPDKVPVQPI 293

Query: 364 --SAVSLPEDWMEAVDQTTGHKYYYNRRTQVTQWEPP 392
             S  SLP      V    G KYYYN +T+V+ W+ P
Sbjct: 294 PVSMESLPGTDWALVSTNDGKKYYYNNKTKVSSWQIP 326

BLAST of Cp4.1LG03g08750 vs. TAIR 10
Match: AT1G44910.1 (pre-mRNA-processing protein 40A )

HSP 1 Score: 48.1 bits (113), Expect = 2.9e-05
Identity = 30/81 (37.04%), Postives = 42/81 (51.85%), Query Frame = 0

Query: 329 GVSYYYNESTGKSQWERPTESSFGLQLSSAVSLPEDWMEAVDQTTGHKYYYNRRTQVTQW 388
           G  YYYN+ T +S WE+P E    L+ + A ++   W E      G KYYYN+ T+ ++W
Sbjct: 198 GRKYYYNKRTKQSNWEKPLELMTPLERADASTV---WKE-FTTPEGKKYYYNKVTKESKW 257

Query: 389 EPP----VASHQATLAHSNVS 406
             P    +A  QA LA    S
Sbjct: 258 TIPEDLKLAREQAQLASEKTS 274

BLAST of Cp4.1LG03g08750 vs. TAIR 10
Match: AT1G44910.2 (pre-mRNA-processing protein 40A )

HSP 1 Score: 48.1 bits (113), Expect = 2.9e-05
Identity = 30/81 (37.04%), Postives = 42/81 (51.85%), Query Frame = 0

Query: 329 GVSYYYNESTGKSQWERPTESSFGLQLSSAVSLPEDWMEAVDQTTGHKYYYNRRTQVTQW 388
           G  YYYN+ T +S WE+P E    L+ + A ++   W E      G KYYYN+ T+ ++W
Sbjct: 198 GRKYYYNKRTKQSNWEKPLELMTPLERADASTV---WKE-FTTPEGKKYYYNKVTKESKW 257

Query: 389 EPP----VASHQATLAHSNVS 406
             P    +A  QA LA    S
Sbjct: 258 TIPEDLKLAREQAQLASEKTS 274

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q2HJC95.2e-1563.16Polyglutamine-binding protein 1 OS=Bos taurus OX=9913 GN=PQBP1 PE=2 SV=1[more]
Q6PCT53.4e-1460.53Polyglutamine-binding protein 1 OS=Rattus norvegicus OX=10116 GN=Pqbp1 PE=2 SV=1[more]
Q91VJ54.4e-1454.90Polyglutamine-binding protein 1 OS=Mus musculus OX=10090 GN=Pqbp1 PE=1 SV=1[more]
A1YFA71.3e-1356.25Polyglutamine-binding protein 1 OS=Gorilla gorilla gorilla OX=9595 GN=PQBP1 PE=3... [more]
O608281.3e-1356.25Polyglutamine-binding protein 1 OS=Homo sapiens OX=9606 GN=PQBP1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
KAG7017655.10.094.86Polyglutamine-binding protein 1, partial [Cucurbita argyrosperma subsp. argyrosp... [more]
XP_023527950.10.093.94uncharacterized protein LOC111791012 [Cucurbita pepo subsp. pepo][more]
KAG6580903.10.092.27Polyglutamine-binding protein 1, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_022935213.10.091.76uncharacterized protein LOC111442162 [Cucurbita moschata][more]
XP_022983732.10.089.75uncharacterized protein LOC111482260 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1F9X90.091.76Polyglutamine tract-binding protein 1 OS=Cucurbita moschata OX=3662 GN=LOC111442... [more]
A0A6J1J0630.089.75Polyglutamine tract-binding protein 1 OS=Cucurbita maxima OX=3661 GN=LOC11148226... [more]
A0A0A0LFL28.48e-31371.92Polyglutamine tract-binding protein 1 OS=Cucumis sativus OX=3659 GN=Csa_3G822240... [more]
A0A5D3DPP72.18e-31177.35Polyglutamine tract-binding protein 1 OS=Cucumis melo var. makuwa OX=1194695 GN=... [more]
A0A5A7UK562.84e-29471.59Polyglutamine tract-binding protein 1 OS=Cucumis melo var. makuwa OX=1194695 GN=... [more]
Match NameE-valueIdentityDescription
AT2G41020.12.4e-10045.49WW domain-containing protein [more]
AT2G41020.23.4e-6239.89WW domain-containing protein [more]
AT3G19840.12.2e-0535.05pre-mRNA-processing protein 40C [more]
AT1G44910.12.9e-0537.04pre-mRNA-processing protein 40A [more]
AT1G44910.22.9e-0537.04pre-mRNA-processing protein 40A [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001202WW domainSMARTSM00456ww_5coord: 360..393
e-value: 7.3E-9
score: 45.4
coord: 315..348
e-value: 1.9E-10
score: 50.7
IPR001202WW domainPFAMPF00397WWcoord: 361..391
e-value: 4.6E-11
score: 42.6
coord: 316..346
e-value: 4.0E-13
score: 49.2
IPR001202WW domainPROSITEPS01159WW_DOMAIN_1coord: 365..391
IPR001202WW domainPROSITEPS01159WW_DOMAIN_1coord: 320..346
IPR001202WW domainPROSITEPS50020WW_DOMAIN_2coord: 359..393
score: 13.062901
IPR001202WW domainPROSITEPS50020WW_DOMAIN_2coord: 314..348
score: 13.7721
IPR001202WW domainCDDcd00201WWcoord: 317..348
e-value: 9.36974E-10
score: 52.145
IPR001202WW domainCDDcd00201WWcoord: 362..392
e-value: 8.78123E-7
score: 43.6706
NoneNo IPR availableGENE3D2.20.70.10coord: 356..407
e-value: 1.8E-11
score: 45.4
NoneNo IPR availableGENE3D2.20.70.10coord: 310..351
e-value: 1.0E-12
score: 49.4
NoneNo IPR availableGENE3D3.40.30.10Glutaredoxincoord: 511..603
e-value: 3.7E-21
score: 77.7
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 73..103
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 516..550
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 72..103
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 505..624
NoneNo IPR availablePANTHERPTHR21737:SF3POLYGLUTAMINE-BINDING PROTEIN 1coord: 107..441
coord: 447..624
NoneNo IPR availablePANTHERPTHR21737POLYGLUTAMINE BINDING PROTEIN 1/MARVEL MEMBRANE-ASSOCIATING DOMAIN CONTAINING 3coord: 107..441
coord: 447..624
IPR036020WW domain superfamilySUPERFAMILY51045WW domaincoord: 308..348
IPR036020WW domain superfamilySUPERFAMILY51045WW domaincoord: 357..394

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG03g08750.1Cp4.1LG03g08750.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0000380 alternative mRNA splicing, via spliceosome
cellular_component GO:0005737 cytoplasm
cellular_component GO:0016604 nuclear body
molecular_function GO:0005515 protein binding
molecular_function GO:0043021 ribonucleoprotein complex binding