Cp4.1LG02g01820 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG02g01820
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionArmadillo-like helical
LocationCp4.1LG02: 4340715 .. 4352776 (+)
RNA-Seq ExpressionCp4.1LG02g01820
SyntenyCp4.1LG02g01820
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAACTCCCTTTCATAAAATACTTTTCTTAGGCGGTGGGCTTCGTCTTCCTCAAGGAAAAAGCCGACTGAAAATTCTCTGCACAGTCCCTTTCAACTATCGTTTCTTTCCATTTTTTTACTTCCACACGAAAACTCTGGGCATAGAAATGGAGGAATTTGCTGTGGATGATCCGACTCAGCTACTTGAAGCAGCTGCAGATTTCGCAAATTATCCCGGTTAACCATCTGATACATAATTTCAACTTCAAATTCTACATATTCTACTGAAAATGCATGAGAATCTGACATCGTTAATCATTTCCCATTTGTGAAGGTGTTCGGACTGATGCGTCGGTGAAGGAATTCTTCAGCCGCTTTCCCCTTCCCGTCGTAATCAAGTATTTATACACGATATGAAAATCGAAGCATCTCTTGTTTGATTGATTGTTTTTCAATTTTCTTTTATGTATTTTCATTGCTCTTTTGGTTCTCTTTGTTGACGTGGTGTTGTTTTCTTCTTCAAGTGCTTTACAAGCAAAAGCGGAAATTCCTGGTTTGGAAAACACTTTGGTTGCATGTCTCGACAGGATATTCAAAACCAAGTATGGTGCTTCACTTATACCACATTATATGGTATGAACACTTCCCTTGTATTTTTACTCATGTTGGGGATGTGCTGAGTTTTCTGATACTACAGTGAATAGGGTGCCATGGGGGAGGTTTTTTTGACATTTAATTCTTTACATGTTTATCGCTATTGCGGGCCTAGTTGATTGAGTTTAAATTCATTGAAATCCCCCATTATTTTGTTCAACTGCAATAGTAATTCCAATCATGTTATTAGAATCCCTGGATGTGCCTCATGGTCTGTCCGTTGCATTGATTGTTTCTATACCCCCTCCTCCTGCAGCCCTTTGTACAGGTTGGACTACAAGCAGATTCTCAAGCAGTTAGAGGCTTAGCTTGTAAAACGGTAGGTTTTTGTTGCTGCTATGTTCAACTTAAACTAGTTATCCCTCTTTCTTGTCAATTACTTTTCCCTTTCTTTTTGAACTTCATTGTTTGTTATCACTATGCTAATTATAGCCTTCTGTGTTCTTAAACTAAGAATTCTAAATATCTTCAGGTCACTCGCCTGCTGGAGGAGACCGATCCGACTACTCAGTTGGCCCCACAACTTATTGTTGACTATAACATCTATCCACTTTTGATTGAGTGCCTTCTCAATGGGTAAGAACACATACGGATGATTGTGAATTAAACTATGAATGTGATTGAGCCTTGGATGGTACCAGTATAGTGCCTGTGTGTTTTTAATTAAATACTTATTGTTCATACATTTTTTCTACTTAAAACCTTACTTTAAGTTTGAAACTATAAATAGCACAGCTTTCTCGAATCATTAAAGAACTGTCGAATTATTCATCCAATACACACTTCTTTTACAACTTAGGAACATTTTGTTAATGCTGCAAAACTGTTATATAATTGAATTCTATGATTCCTTTATGGGTCTAGGGGTTCTCAACCCCACGAACTCCCCACCTGTCCCTCAGCTTCTTTGTGTTAAGCATCATTGACCTGATAAAACTCCTAAACTATGCAAATCTATTTAAAGCCCTCAATATTGGAACAAATGACGGTGTTCCTTGCATTTTCAACGTTCATGTTGTTAACTCCAAAAAAGGAAGAAAAAAATATCAGCTTTCTTTAGATTATGAATCCGTTAGTTTCATCTGTTTTTATGTTGGTGTGGAACTAAGCGTGAGGTTGTCACTCTTTTTTGTAGTAACGAACAAGTTGCTAACTCATCAATGGATGCAATAAAGAAATTAGCTGCATTTCCAAAGGGGATGGTACATATAATATACTATAGCGGTTATTTATCTGCTCAATATGGTTAACTTGTCCTGCATCTTGTTACAATAAATATTTTTTTGCCAAAACTAACTGTAATTTATTGCTCTGTAGGAAATCATCTTCCCAACAAATAAAACGGAAGCAACACACCTAGGAACTGTAGCTTCAACATGCTCATCTCTGGTATTGTAGCGGGTACTTTGACTTTGAGATATTTATTTTATTGTGCAACACTTAATGCTTGCTGAATATCATTTTAAAGAAAAATTCCTCCAACCATGCTCTCTTTTAATAGAGCTTATGTTTATTTATTTATTTATTTTCCCTCAAGAAGTTTAACGTGGTCTCTTTTTAGCATCAGTGCCATTTTTTCAATTTGTACTTGTTTTCTTATCATTTCTCTACGACGGGGATCCATCTTCCTTAAGAAAATATTCGAATTCTTAGCCAAATTTTAAAAACAAAAGGAAGCTTTTGAAAACTGCATTTTGTAGTTTTTGAAAACACCTGTAAAGAGTAGATAATGAGGCTAATTTTTTTTTATCAAACAGGGCTTTAGTTACTAATAATTTGTGTTTCTGATTCTTATTGAAATTAAAACTTCCCAGGGAAGAGTCCGAGTTATGGCTTTGATAGTGAAACTGTTTTCAGTTTCTAGCTCTGTGGCATCTGCAGTATACAATTCAAATTTACTAAACCTACTGGAAAGTGAAATCAGCAACTCAAACGACACACTTGTAACTTTAAGCGTGTTGGAGCTCTTGTACGAGGTTTACTATCTCCTACTCCCTTTTTCTTGTTCACTAACATGTGTATCCCCCGTCTGTTTTTATTGTTGGTCTTGAGAGACATTAGTAAAAATGGTGTTAGAAATTATGCTATCCCTTCAGTTTTTTTCTTTCTTTTTGTAAATGTTATTAAACTCGTACAAGTTGGTTACTGTGATATCCAACAACATATTCTTAACATGAATTTGAAACAGACATGATTTGAATTGTCATGGCTGAATGAGCTGCCTCTCTGACTTCCTGTTGTTAGTTAAGAAGCAAGATGGTAGTTGCTTAGTGGTTTGATCATGCAAGAAAAATCATTATGGTCGATCATTGTAGATGAAGTGGAATGGGCAATTTGACACGGCAATGTAGCCTTGCTTGCTTTTGGCAAAACTAACGCAAAGAATTTGTTTAGAATGTTATTAAAATAATTGTTAGGAATAATATTACGATGTTAGGGTTATATTGATCATTAGATAAAGAGTTCGCTAGCAGGGTGGTTATAAATAATGGAGGTGGGAAGGGTTTGTAAGTCATGCATGTTTTTGAACTGGTCTTGTGAGCCAACTAAGCCTCTTTAATTCTTCAGTAACTTCATAATTTTTTCTCGACATTCCAATAAATTTGCAGTCCATATTTTCTTGATTTATTATCTCCAGATTGGGAATCTATTAGAATTATGGCCAGGATATATTTTTACTCAAGCATGCCTGATAGAGCTAGAGCAATCATCATGAGTTACACCATGCTCTCTTTTGTTCTTCCTTTGTTTTATCATCCCACCTTGAAAATTTTCATTCTGATATAATTGATTTTCTTTTTATGCAATTTATAACATAATTTTATTATATTTCCAATTGATGAAGTTAGTGGAGATTGAACATGGTACAAATTTTTTGCCAAGGACCAGCTTTCTCCAACAACTAAGCTGTATAATCAGGTTTTGAAGAGAGAATTATAAACTTCAGTTATACAAGCTAGTGGTCTGACTTTGGATATTGATGGTGTTCTTAATTTTTTTGTTGTATATTGCAGCAACCGGTCGGCAGAGTCTATATTAAGATCCAGAGCAATGGTCATTAGTGGAAGACTTTTGTCTAAAGAGAATATTTTCTCGCTTGTAGATGAATCTTGTAAGCAATAAGTTTACTGCATTTCATTTGATTAGAAAAAGGTGAATTAGATTGAATTAGATTAACTTGTCAAATTTCAATTTTCACATTCCAAACCTCTTCCCATCAAGTTTTACTTGGGACACTCGTATCCCTTGGACCTATGATTACTCGTTTTGGCACCATGGATATACAAGCTACATATAGATCATGAAGATCACATCTATATATAACCTATGTGAAAGAACTAGGAAGCATTACACTGTTTTAGCTCTGTTTTAGGTAGAGCTTCTGTGTCAAAACACGTGTTAAAAGATCACCACTAGTCCAATAGGGTCAAGCCCAATAATTGTATGAGCCCATTGTCTAGATATTTTGGAAATATAGATGTCTAGATATTTTAAGAATAAAAATGTCCACATATTTTAGGAAAAAGAATATGTAGACATTTTAGGAATAAATCTAGAATATCAATATGGTCTATATTGTCTAGCTTCTACTAAAAATACCCTTTCACCCTACTAAGTTTTTCATCCCAAGAAATTTCAAGCAAAGCAAAGTGTAGTAGTTTCTACCGAGTGTCTTGTAGTCTCTTTTCGATTGGTCTTAATAAAGAGTGTGAATGTTTTCCATAAAAGAGTTGTGTTCAATATGTTGATCATGCACATGTTCGTCTGGACACACTCCGAACATGTCTGACATGCTAAAAACATGGGTAATATTTATTCTATTTTCTTTTTATCTAGTTTTGGACACAGCTAAAAATAAAGCCAGACCTAAAAAGTTTATAAAAGAAAAGGAGTGAACGACCTTACTTGACCTTGATCTGTTTTATAGGTTATACAATTGTGTCCTTTGAGTTCTAAAAGAAAAGGAGAGAGGATTCTATAACAATTTCTAATCTTTTAGGATGACATCTTCTTCATCACGCTATGGCCAATTAAATTCTTCTATATTTTGAAGCTCTTAAAATATAGTTAGTTGACTCGACTTTCTATGTTTTTATCTATGATGAAGTATAATAAAAAAACTGTCCAGAACATATCTCTGTCCTACATTTTTAGAAATCGATGTGCCTGTGTTGTATTGAGTCCATGTTTCTTAGAGAAGGAATCATGTGCCTTTTATTATATTTTCAATTCAAGTTTTGCATAAATATCATTTAGATAAAAAATTACAAAACAAATTGCAAGGCTAGGTAGAAAAATTGCACGAACAATGTCCTACCCAGGTTATGAGTTGGATTATTTACTTTTACGAGTAAGGGTTTTCTAGACTGAGAGCCTTTTATGATCGATTTTCTTGCTGTACAAATTCAAAAATATGAAAATCACTTGAGAGATTAGCACAATCTAATTGATTGTGTTGAAAGCAGTTGACAGTAATCTATAATCACCTAATACAGGTGACCCTATTGCCTAATTTAGTCTTCTGAATCTGTAAATTTTGTCATTATCAGTAGGACAAAGAAATCTTTTGATAGTTTCTTGTTTTGCTTTGATATTCTCAGGTGTACGAATTTTAATATCTGCTATAGATGAAATTCTTGGATCATCTGAAGGCCAGGATGTAAATGTATGTGAATCTGCATTCGAAGCACTGGGTCAAATTGGTTCGAGTAAGTATAGCTACTGCCACTTGTAATTTAAGATTTATTTATTTAATGCGAGCAACTGGATTGTGATTTTCTGTTACCATTTTTATAATAAGAGATAGAGTGAAGTTGGCATGTATTTCTAAATGCATGAGGACTCCAGAAAATAAACCTCGACCTCTTTAGAACATGTTAGATATTATCTTGTTTTTAATTTTTTCCCTTCCTAATGGAAAACTATGAACATGAAAGCTCGTATATACCCACTAAGATAAGTATTGGATGCTTAGAAACTATTGGCATTTTGGTTAATAATATCTATGATCTGATTTTGACTGTTACAATGAGAATGCAATGTACAGTAACTATTAAGGTTTCTACTTTTCAGAAGGGGTTTGTAGTAGATATCTTTTTCTAATAGGAAACCATAACATTTCATTGATATGATGAAATTACTTAAGGATAAGTTCCTATCCAATGAATTGCAAAAAACTTGCCCAATTGGTCGTGAGAGAAGATAAACTATAGGTACAAAAGGAGCAATCGATTTACACTAAGAGAGAGCCAAAAAAGTAATAGATGTCCGAAAGGAGTGTATGTTTAATTTCTTGTCTTTAAAGATATGGGAATTGCGCTTGTTCCAAGTAGACCAAAAGGAAGCCCAGATAAAATTCAGCCAAAGGATTTTCATTTTCTTTTTTGAATGGATGACCATTGAACACATTAGAAAGGAATGCTGATGATTTTGAGGGAATAGGAGAAAAGGATAGATGGTAGATGGGGAGATAAGCCAAAATAGCTTGTATGAGAGTCAGTCGACCTTACTTTGAAATATGAGCACATTTTCAACTATTGGCATCCAAAAGGTATTGAGACTTTGGCTTACCATTCAGAGGCAAACCCAAATAATCAGCAGGCCACTTTCCAATTTTGCAACCCCATTTCGCTGCCAATGCCTCTTGCAAGAGAAGTATCAAGATTAATCCCCAAAAAGTCAGTTTTCTGACAATTAATATTCATCCCAGATGCTTCCTTAAACTCATTATTGGTGTTGAACAAATTATTAATGTGATAGACATCAGGGGATGGAAAAAAAGGTGAGTAATCGATAAGCAAAGATCCACCCTGTCCACTTCAAACCCTTTTAATCAAACCTTTCATTTTCGCGAGGGAGAGAATTCTGCTAAGAGCGTCCATAACAATGGTAAAAAGGAAGTGTGAAGTACACCTTGTCTTAGGCGTCGAGAAGCAATGATCTTACCAAGGGGTTTCCACTGATAAGAGGGAGAAGTTAGTAGATGGCATATAACCCCTATTCCACCTCCTCAGATGCCCAAACCCTTTGCTTGCAGAATAAGGTCTAGGAAATTCCAATCAACCTTGTCAAAAGCCTTTTCAATATCTAGTTTGATCACCATAACCGATTCCATCTGTCAATGTATCTCATCTATTCGCTCATAAGCAATCAAAGATGCATCAATAATCTGTCTATTAGCCACAAAAGCTCACTGATTATCTGTTTGACCCCAGTGATTTGTTAGTGCTCAATTTATTAACAAAATGAATGTCTTCTTCGGGAAATGGTGCCTCCAAAAACATTTATTTTCCGTGGATTCTTCTAGAAACGAATTTTCTTATAATTTTTTTTTACCTTTTTTATTAAACATTTTAGAAAACAAGAACAACGGTTATCAAACCCGTTTCTATTTTTTTAATTATTTTTTATTATTCAATAAAAATGAAAGATCAAGAAACAAAATACAGTTAACAAGCGTAATTGATAATTCTATTTTTCTTTATTAAAAAAGTGGAAACGAGTAACAATAAATGGAAGTTACCAAACGTGCCTGAAGGTGTTTTACATCATTTTAACATTTTATTTAGACCTTTATAAACATTTCATCCCCAAAAGCCAACATGTATAGATCAAATTTGTCTCAGGCAAATGGGGAGCCACTTTACTGCTGTCAAGTTATCCAACTTGTGTGAAGTATGTAATTAACGCAGCGTTTGATCGGCATGAACATGGTAAACAGCTGGTAAGTATGTTAAGCTCTTTGACTTATATTTGCGTTTATGCCATCGTTTTCTACACCCCAGTTTCTTTCGTTAAACCATTTGGGAAGGTAAGTATATGGTAAAAAAACTAGATGTTTCAGCTTGCTTTGTAAATGATTAAGTTTGGAAAATCCCTTGGAAGCTAGACATTTCTCGGATGACAAAATTCTTCGCTAAAGTTCTTGCAAGGCTTATTAGTGGGAGAGTAGAGTGATAATGTTTTCAACAATGATTAATGTGGAACAGTGGTATCTTCCCTGCTATGCCATGGTTTTAACAGAAAATATGGAATTATGAATATTTTTTGGATTGTAAGGTTGTAAGTGTTGCTATTCTCTTTTTGCACTGGTTTTCTATTTATCAAAAATCATTAACTCTTTATCTTTTGTGTAAAGCGTTATATTGACAACGGAAAGGATTTTGTTTTAAAAAGATGTTTGTGATAGAATGTGAACTGAAATATGCCATCGCCAATCATTTACTCAAACCTAGGCGAGTATCTAATGATATGTGCTTTATGTACCTATAATATTTCACTTATATTTCAACTGTCAATACAGGCAGCCATGCACGCTCTTGGTAACATCTTTGGTGAAACTCGATCTGAGAATGATGTTCTGCTGAATGATAATGCAGAAGAAAATTTACGGGACTTAATTTATCAAACTGCATCCAGAAGTCCAAAAATGACGCCATCAGTGAGTGAAGGGTTTAGGGTTTATTTCTTTTGATAATATTTTCCTGCTTCAAGTATGTTTCCTATCTAAAGCAATTTGGAATTGGATATCTAATCTGCTTTAGTTGCAGAATATATGTCATTCCAATTCAAGCCAATCCTTCTGCACTCACTTGACAAGATTAGACATTCCTCTGTTTCCTTACATGTTAGAATTAAAACAAAATGCTTAAGCTGATGGTAATGGAGTTAAGTTCATATATCAAACATGTTGGTCCACAGGCTTCGAAGTGTAACAGATTCTTGAAATAAGAATTGCTGTCTGTTCGTGAAATTAATTTGGTTGCTAAGCAATGTTTTAGAAAACATTCCTCAAGGCAACCTCGAGAAGCAAGACATAAGCCTTGAGGCAGTATGAGATGTAACATTACAATAGTACAAAACATACTCGAAAATTAATAACTATAAAGGAAAATATCTCAATAGTCCATCTATATTCATTACTACATCTAACTTGCTTGTGTTAATAAGAATCTAATCAAAATGAAAACAAAGGTAAAATAGGAAAAAGAATAAAACTCACCGTGAACAAATAAAAACAAAAAGAGAACAAGAGTGAAGGGGAAAAATAAAACAAAATTCACCTTCTTGTGTTGTTGTCAAAGATTTAGCTACCTTGCAGTTTTTTTTCTTTGGTTAATGCAGACACTCGCAGTTAGCTCTATTTTAAAATAACACAGCAAGGTGAAAACTTTTTTGTTTGTTAAGTAAACCAATTGAACTGCTTATGCCTTCTTAGTTTTCAGTGCATTTTTAAGTTCTCTCAGATATTTTCATGGCCCTATAGCTCACTACCAATGAACACAAGGCTTATGCCTTGTAGCCTCAGAGGCGTTCATGGACAGGCGTAGGTCTTTTTTTATGTGTTGAAGGCATAAGCCTTGAGACGCAATGTTTCCGCCTCACCCTTGAGGTGCAATATTTCCTCCTTACCTCAAGGCTTATGCCTTGAATTGAACGCATTATAAAGCATTGTTCCCAAGATATATGATGTGATGCTTCATACATTACTTGGTTATCACTGTTGTTAAGCCTTATGGAAAGATGTTTTTAGTGTTCTTTTAGAGTCTGGAGGGTATCATCCTTTTTCACTCAAGACTGCTTCAACTGAACAAATTGACCTATTTTCACTTAATTATCTTCAGCAAGCAACAATTTTGACATTGTTCTTCATTTCTTTCATGATCATAGTTTTTAAAAGTCTTAACAAGCTGCTATGGCAGGGCCTTATTCTAGCTGTCCTTCAACAGGACTCTGAGATTCGCTTGGCGGTGAGTGTTTCAGTCATTTTTTGTTACTTCCAACACTCTTTACCTGTCTTTTAACTGGCCTTAAAGAATCTTGATCTTAGTTTTGTAGAAAAATTCCCAAAAGATAAAAATACATGATATTGGTCAAGCCAATACTTAAATTAAAAATAGTCTGCAAGGTGCAAAGAATTCAAAATGTATTGTTCACGATGATTTATCATCCTTTATTATTGTGGATGGGGTAACTTGCAACCTTAGCGTATGGTCATTGGCATGGAGGCCTATTTTTCATAAAAAATGAGAACGCAGATGCTAAAAAACATGGATATGAACGCATAGAAATGGGTGCATGTATGAAGATTTGATATAGATATGTAGTACTTTGCGTTTTTTCATGTAAATTAGGCATAGCAAGACATTTTGATGTTTAGAGGATAAATATCTTATTTAGAAGCACATCCTATAAATTTGCATCAAATGGGTCTTCCTTGGTCTAGACAATTTTCTTAGTTTGTTTTTTGTTTAGGAAGTTTCAACTTCCAGTAAGAAGGTTGCTTGGTTATGGGTTTAGTATTATCCTGGGCAAGAAACTATGTGGGTTCTAGTTTCCTATGTATTTAAATTTGTATGACTTGCTAACACAACTTTTTCTTCAGAGTTATAGAATGATAACTGGGTTGGTCGCTCGACCGTGGTGCCTTGTGGAAATCTGCTCGAAACAAGACATAATAAATATAGTTTCTGATGCAAGTACCGAGACTACAAAAATAGGTATATGTGCTTGCTCTCCCTTGCATCCATCTTTTGAAGTAAATCAGAGATTTTTTTCATGATTGTCGTGTATGCATCATTAGAAATCCCACGGTCGCTACAAGCATAAATCTTACTTAATTAGTTATTTATCTCAGGAATTTAAACAGTATTATAATTAAGCGAGCATTCTCCTCCTTTTATGGTGTTCCAGTTAAGGGTTGTTGATTTCTTTAGTTGCGACATAGTTAAGCATTCTCAAGCTTTTGTATTAGCCGTTGGCATGCAAACCTGCAATGTTCTTCAGATGTATAGAACATTAGTTCTTCCAGTGTCACAGTTTCAAACGAAATGTTCTAAGCATGTTCAGAAAAGAATTATTGTTGATGGCCTATGCTCATGTTCAATTTTAACTTCCTTTCTTCAAATTAATATCCAAAATCTTGCAAATCATATTCACCCACTTGTTAATAACAAGTTTGCTGAATATATTGCTTTCTAGACCCTGTGAAAGCAACCCTACGAACTCCAAGCAATTCGAAATATGCTGTTGGGTATTAGAATTGCTCCCATTGATCAAATCCCTTTGCTGTTCTTGTCTGTTGTTGGATCAAAGACATTTATTAGCATTTATTTCTGAATCATATCTTGGTGTCGTTAAGTCTTTTTTGTAGTCTAATTATTTCAATACATTGACATGCATTATATCATTATGAAAAATGTAATGTAGGGTGCCGCTATTTTTTGGTGTGCATTTCAGGAGTGTCTTTCATGATGCAATTCTTCATGTATGAACGCATTTCTTATTGTATGAATCTGCTTTTGTGAGATTCATAAACTCTTTTGAAACAAATATTTCATGCACGGGTTGGCATGTAATTAGATAGATTTGAGCGATCCTCCCCCTATGTTTGATGCTAATCTCAATCACATGGTTTATATTTTATTATAATTTTATATATAATTTTTTTTTTCATTTTTTTTTCAGGAATGGAAGCTAGATATAACTGTTGTTTGGCTATCCATAAGACATTCATGTCTTCAACAAGGCTTACGGGCGATCCTGCCCTTGCTGGAATAGCTTCGAAGGTTAGATGGGATGCATAAATATTTCATCCTTTTCTTCAACATTTTCCTTTTCAAGGCAGTTCCTTCTAGTAGTGCTCACAAAACTGGCTCTGGTTGTATGGCTCATGTCTGATTAAGAAAGAAAAATCTAATTGATATAATATGTGTATGGCTATGCAATAATACCTTAAAATAGGAAAGGAAGGGAGAAAAACAACAAAAACAAAACTAAGCAGATAGCCCGGAAAGGCCATCTCCATGTTTTCAGTTTATCTTGTATACATCCTTTTATTGTTTTTTTTATAATTAATGTGGGGTGAAGATTTGAACATCTAACCCCTTGATCGACGTTATATGCTTTATGCTGTCTCAATGTCTGAATCAAGCCTCTTGTCGTCTTACTTGGGAAAAAAGTAGTGTGTAATTCTGCTGTTTGAGTTTGAAAATTTGAACTTCGTAGTGGTTATATGCCTGCTAATCGTTACACGGGTTTGTCTGCAGTTGCAGGAAGCTGTTCAAAATGGTCCATATCTTAGTAGAAGAAAACTGGAAACTCAACCAGCAATAATGACAGCTGAGAGATTTTAGGAGTCTAGTATGGATCAAAAGTAAGCAAGCCTTAGATACAATGGTTCGTAGGGTAGTTGCATCTTTGGTTTCTAGTTGATCCGAGGAGTTCACAGAAGAAGTTGAAACTAGATTGAGCATTTTCCATGGATTGTGTGCACTAATATTTCTCTACTTTGTTCAGTTTCCTGAATGGATTGCTTAGTCAAATTTGCTTGGGACAATGAATTTGTTTAAGTCAATTTTTTTTTTTAAATATTTCATATAAAACTTTTTTTTAAGAAAAAAATTAATCCAATTGTTTGAATCACTTAT

mRNA sequence

CAACTCCCTTTCATAAAATACTTTTCTTAGGCGGTGGGCTTCGTCTTCCTCAAGGAAAAAGCCGACTGAAAATTCTCTGCACAGTCCCTTTCAACTATCGTTTCTTTCCATTTTTTTACTTCCACACGAAAACTCTGGGCATAGAAATGGAGGAATTTGCTGTGGATGATCCGACTCAGCTACTTGAAGCAGCTGCAGATTTCGCAAATTATCCCGGTGTTCGGACTGATGCGTCGGTGAAGGAATTCTTCAGCCGCTTTCCCCTTCCCGTCGTAATCAATGCTTTACAAGCAAAAGCGGAAATTCCTGGTTTGGAAAACACTTTGGTTGCATGTCTCGACAGGATATTCAAAACCAAGTATGGTGCTTCACTTATACCACATTATATGCCCTTTGTACAGGTTGGACTACAAGCAGATTCTCAAGCAGTTAGAGGCTTAGCTTGTAAAACGGTCACTCGCCTGCTGGAGGAGACCGATCCGACTACTCAGTTGGCCCCACAACTTATTGTTGACTATAACATCTATCCACTTTTGATTGAGTGCCTTCTCAATGGTAACGAACAAGTTGCTAACTCATCAATGGATGCAATAAAGAAATTAGCTGCATTTCCAAAGGGGATGGAAATCATCTTCCCAACAAATAAAACGGAAGCAACACACCTAGGAACTGTAGCTTCAACATGCTCATCTCTGGGAAGAGTCCGAGTTATGGCTTTGATAGTGAAACTGTTTTCAGTTTCTAGCTCTGTGGCATCTGCAGTATACAATTCAAATTTACTAAACCTACTGGAAAGTGAAATCAGCAACTCAAACGACACACTTGTAACTTTAAGCGTGTTGGAGCTCTTGTACGAGTTAGTGGAGATTGAACATGGTACAAATTTTTTGCCAAGGACCAGCTTTCTCCAACAACTAAGCTGTATAATCAGCAACCGGTCGGCAGAGTCTATATTAAGATCCAGAGCAATGGTCATTAGTGGAAGACTTTTGTCTAAAGAGAATATTTTCTCGCTTGTAGATGAATCTTGTGTACGAATTTTAATATCTGCTATAGATGAAATTCTTGGATCATCTGAAGGCCAGGATGTAAATGTATGTGAATCTGCATTCGAAGCACTGGGTCAAATTGGTTCGAGCAAATGGGGAGCCACTTTACTGCTGTCAAGTTATCCAACTTGTGTGAAGTATGTAATTAACGCAGCGTTTGATCGGCATGAACATGGTAAACAGCTGGCAGCCATGCACGCTCTTGGTAACATCTTTGGTGAAACTCGATCTGAGAATGATGTTCTGCTGAATGATAATGCAGAAGAAAATTTACGGGACTTAATTTATCAAACTGCATCCAGAAGTCCAAAAATGACGCCATCAGGCCTTATTCTAGCTGTCCTTCAACAGGACTCTGAGATTCGCTTGGCGAGTTATAGAATGATAACTGGGTTGGTCGCTCGACCGTGGTGCCTTGTGGAAATCTGCTCGAAACAAGACATAATAAATATAGTTTCTGATGCAAGTACCGAGACTACAAAAATAGGAATGGAAGCTAGATATAACTGTTGTTTGGCTATCCATAAGACATTCATGTCTTCAACAAGGCTTACGGGCGATCCTGCCCTTGCTGGAATAGCTTCGAAGTTGCAGGAAGCTGTTCAAAATGGTCCATATCTTAGTAGAAGAAAACTGGAAACTCAACCAGCAATAATGACAGCTGAGAGATTTTAGGAGTCTAGTATGGATCAAAAGTAAGCAAGCCTTAGATACAATGGTTCGTAGGGTAGTTGCATCTTTGGTTTCTAGTTGATCCGAGGAGTTCACAGAAGAAGTTGAAACTAGATTGAGCATTTTCCATGGATTGTGTGCACTAATATTTCTCTACTTTGTTCAGTTTCCTGAATGGATTGCTTAGTCAAATTTGCTTGGGACAATGAATTTGTTTAAGTCAATTTTTTTTTTTAAATATTTCATATAAAACTTTTTTTTAAGAAAAAAATTAATCCAATTGTTTGAATCACTTAT

Coding sequence (CDS)

ATGGAGGAATTTGCTGTGGATGATCCGACTCAGCTACTTGAAGCAGCTGCAGATTTCGCAAATTATCCCGGTGTTCGGACTGATGCGTCGGTGAAGGAATTCTTCAGCCGCTTTCCCCTTCCCGTCGTAATCAATGCTTTACAAGCAAAAGCGGAAATTCCTGGTTTGGAAAACACTTTGGTTGCATGTCTCGACAGGATATTCAAAACCAAGTATGGTGCTTCACTTATACCACATTATATGCCCTTTGTACAGGTTGGACTACAAGCAGATTCTCAAGCAGTTAGAGGCTTAGCTTGTAAAACGGTCACTCGCCTGCTGGAGGAGACCGATCCGACTACTCAGTTGGCCCCACAACTTATTGTTGACTATAACATCTATCCACTTTTGATTGAGTGCCTTCTCAATGGTAACGAACAAGTTGCTAACTCATCAATGGATGCAATAAAGAAATTAGCTGCATTTCCAAAGGGGATGGAAATCATCTTCCCAACAAATAAAACGGAAGCAACACACCTAGGAACTGTAGCTTCAACATGCTCATCTCTGGGAAGAGTCCGAGTTATGGCTTTGATAGTGAAACTGTTTTCAGTTTCTAGCTCTGTGGCATCTGCAGTATACAATTCAAATTTACTAAACCTACTGGAAAGTGAAATCAGCAACTCAAACGACACACTTGTAACTTTAAGCGTGTTGGAGCTCTTGTACGAGTTAGTGGAGATTGAACATGGTACAAATTTTTTGCCAAGGACCAGCTTTCTCCAACAACTAAGCTGTATAATCAGCAACCGGTCGGCAGAGTCTATATTAAGATCCAGAGCAATGGTCATTAGTGGAAGACTTTTGTCTAAAGAGAATATTTTCTCGCTTGTAGATGAATCTTGTGTACGAATTTTAATATCTGCTATAGATGAAATTCTTGGATCATCTGAAGGCCAGGATGTAAATGTATGTGAATCTGCATTCGAAGCACTGGGTCAAATTGGTTCGAGCAAATGGGGAGCCACTTTACTGCTGTCAAGTTATCCAACTTGTGTGAAGTATGTAATTAACGCAGCGTTTGATCGGCATGAACATGGTAAACAGCTGGCAGCCATGCACGCTCTTGGTAACATCTTTGGTGAAACTCGATCTGAGAATGATGTTCTGCTGAATGATAATGCAGAAGAAAATTTACGGGACTTAATTTATCAAACTGCATCCAGAAGTCCAAAAATGACGCCATCAGGCCTTATTCTAGCTGTCCTTCAACAGGACTCTGAGATTCGCTTGGCGAGTTATAGAATGATAACTGGGTTGGTCGCTCGACCGTGGTGCCTTGTGGAAATCTGCTCGAAACAAGACATAATAAATATAGTTTCTGATGCAAGTACCGAGACTACAAAAATAGGAATGGAAGCTAGATATAACTGTTGTTTGGCTATCCATAAGACATTCATGTCTTCAACAAGGCTTACGGGCGATCCTGCCCTTGCTGGAATAGCTTCGAAGTTGCAGGAAGCTGTTCAAAATGGTCCATATCTTAGTAGAAGAAAACTGGAAACTCAACCAGCAATAATGACAGCTGAGAGATTTTAG

Protein sequence

MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTLVACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVRGLACKTVTRLLEETDPTTQLAPQLIVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTCSSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVEIEHGTNFLPRTSFLQQLSCIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILISAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHGKQLAAMHALGNIFGETRSENDVLLNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDSEIRLASYRMITGLVARPWCLVEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKTFMSSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF
Homology
BLAST of Cp4.1LG02g01820 vs. NCBI nr
Match: XP_023525525.1 (uncharacterized protein LOC111789113 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1008 bits (2605), Expect = 0.0
Identity = 525/525 (100.00%), Postives = 525/525 (100.00%), Query Frame = 0

Query: 1   MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL 60
           MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL
Sbjct: 1   MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL 60

Query: 61  VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVRGLACKTVTRLLEETDPTTQLAPQL 120
           VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVRGLACKTVTRLLEETDPTTQLAPQL
Sbjct: 61  VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVRGLACKTVTRLLEETDPTTQLAPQL 120

Query: 121 IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC 180
           IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC
Sbjct: 121 IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC 180

Query: 181 SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE 240
           SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE
Sbjct: 181 SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE 240

Query: 241 IEHGTNFLPRTSFLQQLSCIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI 300
           IEHGTNFLPRTSFLQQLSCIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI
Sbjct: 241 IEHGTNFLPRTSFLQQLSCIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI 300

Query: 301 SAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG 360
           SAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG
Sbjct: 301 SAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG 360

Query: 361 KQLAAMHALGNIFGETRSENDVLLNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDS 420
           KQLAAMHALGNIFGETRSENDVLLNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDS
Sbjct: 361 KQLAAMHALGNIFGETRSENDVLLNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDS 420

Query: 421 EIRLASYRMITGLVARPWCLVEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKTFM 480
           EIRLASYRMITGLVARPWCLVEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKTFM
Sbjct: 421 EIRLASYRMITGLVARPWCLVEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKTFM 480

Query: 481 SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF 525
           SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF
Sbjct: 481 SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF 525

BLAST of Cp4.1LG02g01820 vs. NCBI nr
Match: XP_022940916.1 (uncharacterized protein LOC111446360 [Cucurbita moschata])

HSP 1 Score: 994 bits (2571), Expect = 0.0
Identity = 518/525 (98.67%), Postives = 521/525 (99.24%), Query Frame = 0

Query: 1   MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL 60
           MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL
Sbjct: 1   MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL 60

Query: 61  VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVRGLACKTVTRLLEETDPTTQLAPQL 120
           VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVR LACKTVTRLLEETDPTTQLAPQL
Sbjct: 61  VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVRSLACKTVTRLLEETDPTTQLAPQL 120

Query: 121 IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC 180
           IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC
Sbjct: 121 IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC 180

Query: 181 SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE 240
           SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE
Sbjct: 181 SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE 240

Query: 241 IEHGTNFLPRTSFLQQLSCIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI 300
           IEHGTNFLPRTSFLQ LS IISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI
Sbjct: 241 IEHGTNFLPRTSFLQLLSSIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI 300

Query: 301 SAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG 360
           SAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG
Sbjct: 301 SAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG 360

Query: 361 KQLAAMHALGNIFGETRSENDVLLNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDS 420
           KQLAAMHALGNIFGETRSEND++LNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDS
Sbjct: 361 KQLAAMHALGNIFGETRSENDIMLNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDS 420

Query: 421 EIRLASYRMITGLVARPWCLVEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKTFM 480
           EIRLASYRMITGLVARPWCL+EICSKQDIINIVSDASTETTKIGMEARYNCCLAIHK FM
Sbjct: 421 EIRLASYRMITGLVARPWCLMEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKAFM 480

Query: 481 SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF 525
           SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF
Sbjct: 481 SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF 525

BLAST of Cp4.1LG02g01820 vs. NCBI nr
Match: KAG6607893.1 (26S proteasome non-ATPase regulatory subunit 5, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 993 bits (2566), Expect = 0.0
Identity = 517/525 (98.48%), Postives = 521/525 (99.24%), Query Frame = 0

Query: 1   MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL 60
           MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL
Sbjct: 1   MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL 60

Query: 61  VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVRGLACKTVTRLLEETDPTTQLAPQL 120
           VACLDRIFKTKYGASLIPHYMPFVQVGLQA+SQAVR LACKTVTRLLEETDPTTQLAPQL
Sbjct: 61  VACLDRIFKTKYGASLIPHYMPFVQVGLQANSQAVRSLACKTVTRLLEETDPTTQLAPQL 120

Query: 121 IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC 180
           IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC
Sbjct: 121 IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC 180

Query: 181 SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE 240
           SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE
Sbjct: 181 SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE 240

Query: 241 IEHGTNFLPRTSFLQQLSCIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI 300
           IEHGTNFLPRTSFLQ LS IISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI
Sbjct: 241 IEHGTNFLPRTSFLQLLSSIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI 300

Query: 301 SAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG 360
           SAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG
Sbjct: 301 SAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG 360

Query: 361 KQLAAMHALGNIFGETRSENDVLLNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDS 420
           KQLAAMHALGNIFGETRSEND++LNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDS
Sbjct: 361 KQLAAMHALGNIFGETRSENDIMLNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDS 420

Query: 421 EIRLASYRMITGLVARPWCLVEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKTFM 480
           EIRLASYRMITGLV RPWCL+EICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKTFM
Sbjct: 421 EIRLASYRMITGLVPRPWCLMEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKTFM 480

Query: 481 SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF 525
           SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF
Sbjct: 481 SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF 525

BLAST of Cp4.1LG02g01820 vs. NCBI nr
Match: XP_022981236.1 (uncharacterized protein LOC111480433 [Cucurbita maxima])

HSP 1 Score: 986 bits (2550), Expect = 0.0
Identity = 513/525 (97.71%), Postives = 519/525 (98.86%), Query Frame = 0

Query: 1   MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL 60
           MEEFAVDDPTQLLEAAADFANYPGVRTD SVKEFFSRFPLPVVINALQAKAEIPGLENTL
Sbjct: 1   MEEFAVDDPTQLLEAAADFANYPGVRTDESVKEFFSRFPLPVVINALQAKAEIPGLENTL 60

Query: 61  VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVRGLACKTVTRLLEETDPTTQLAPQL 120
           VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVR LACKTVTRLLEETDPTTQLAPQL
Sbjct: 61  VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVRSLACKTVTRLLEETDPTTQLAPQL 120

Query: 121 IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC 180
           IVDYNIYPLLIECLLNGNEQVANSSMDA+KKLAAFPKGMEIIFPTNKTEATHLGTVASTC
Sbjct: 121 IVDYNIYPLLIECLLNGNEQVANSSMDALKKLAAFPKGMEIIFPTNKTEATHLGTVASTC 180

Query: 181 SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE 240
           SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE
Sbjct: 181 SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE 240

Query: 241 IEHGTNFLPRTSFLQQLSCIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI 300
           IEHGTNFLPRTSFLQ LS IISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI
Sbjct: 241 IEHGTNFLPRTSFLQLLSSIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI 300

Query: 301 SAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG 360
           S+IDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG
Sbjct: 301 SSIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG 360

Query: 361 KQLAAMHALGNIFGETRSENDVLLNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDS 420
           KQLAAMHALGNIFGETRSEND++LNDNAEENL DLIYQTASRSPK+TPSGLILAVLQQDS
Sbjct: 361 KQLAAMHALGNIFGETRSENDIMLNDNAEENLGDLIYQTASRSPKITPSGLILAVLQQDS 420

Query: 421 EIRLASYRMITGLVARPWCLVEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKTFM 480
           EIRLASYRMITGLVARPWCL+EICSKQDIINIVSDASTETTKIGMEARYNCCLAIHK FM
Sbjct: 421 EIRLASYRMITGLVARPWCLMEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKAFM 480

Query: 481 SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF 525
           SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF
Sbjct: 481 SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF 525

BLAST of Cp4.1LG02g01820 vs. NCBI nr
Match: KAG7037420.1 (26S proteasome non-ATPase regulatory subunit 5 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 979 bits (2532), Expect = 0.0
Identity = 517/548 (94.34%), Postives = 521/548 (95.07%), Query Frame = 0

Query: 1   MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL 60
           MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL
Sbjct: 1   MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL 60

Query: 61  VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVRGLACKTVTRLLEETDPTTQLAPQL 120
           VACLDRIFKTKYGASLIPHYMPFVQVGLQA+SQAVR LACKTVTRLLEETDPTTQLAPQL
Sbjct: 61  VACLDRIFKTKYGASLIPHYMPFVQVGLQANSQAVRSLACKTVTRLLEETDPTTQLAPQL 120

Query: 121 IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC 180
           IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC
Sbjct: 121 IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC 180

Query: 181 SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYE--- 240
           SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYE   
Sbjct: 181 SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYEVYY 240

Query: 241 --------------------LVEIEHGTNFLPRTSFLQQLSCIISNRSAESILRSRAMVI 300
                               LVEIEHGTNFLPRTSFLQ LS IISNRSAESILRSRAMVI
Sbjct: 241 LLLPDFLFTSMCIPRLFLLLLVEIEHGTNFLPRTSFLQLLSSIISNRSAESILRSRAMVI 300

Query: 301 SGRLLSKENIFSLVDESCVRILISAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATL 360
           SGRLLSKENIFSLVDESCVRILISAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATL
Sbjct: 301 SGRLLSKENIFSLVDESCVRILISAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATL 360

Query: 361 LLSSYPTCVKYVINAAFDRHEHGKQLAAMHALGNIFGETRSENDVLLNDNAEENLRDLIY 420
           LLSSYPTCVKYVINAAFDRHEHGKQLAAMHALGNIFGETRSEND++LNDNAEENLRDLIY
Sbjct: 361 LLSSYPTCVKYVINAAFDRHEHGKQLAAMHALGNIFGETRSENDIMLNDNAEENLRDLIY 420

Query: 421 QTASRSPKMTPSGLILAVLQQDSEIRLASYRMITGLVARPWCLVEICSKQDIINIVSDAS 480
           QTASRSPKMTPSGLILAVLQQDSEIRLASYRMITGLV RPWCL+EICSKQDIINIVSDAS
Sbjct: 421 QTASRSPKMTPSGLILAVLQQDSEIRLASYRMITGLVPRPWCLMEICSKQDIINIVSDAS 480

Query: 481 TETTKIGMEARYNCCLAIHKTFMSSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQP 525
           TETTKIGMEARYNCCLAIHKTFMSSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQP
Sbjct: 481 TETTKIGMEARYNCCLAIHKTFMSSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQP 540

BLAST of Cp4.1LG02g01820 vs. ExPASy TrEMBL
Match: A0A6J1FJQ7 (uncharacterized protein LOC111446360 OS=Cucurbita moschata OX=3662 GN=LOC111446360 PE=4 SV=1)

HSP 1 Score: 994 bits (2571), Expect = 0.0
Identity = 518/525 (98.67%), Postives = 521/525 (99.24%), Query Frame = 0

Query: 1   MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL 60
           MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL
Sbjct: 1   MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL 60

Query: 61  VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVRGLACKTVTRLLEETDPTTQLAPQL 120
           VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVR LACKTVTRLLEETDPTTQLAPQL
Sbjct: 61  VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVRSLACKTVTRLLEETDPTTQLAPQL 120

Query: 121 IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC 180
           IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC
Sbjct: 121 IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC 180

Query: 181 SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE 240
           SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE
Sbjct: 181 SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE 240

Query: 241 IEHGTNFLPRTSFLQQLSCIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI 300
           IEHGTNFLPRTSFLQ LS IISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI
Sbjct: 241 IEHGTNFLPRTSFLQLLSSIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI 300

Query: 301 SAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG 360
           SAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG
Sbjct: 301 SAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG 360

Query: 361 KQLAAMHALGNIFGETRSENDVLLNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDS 420
           KQLAAMHALGNIFGETRSEND++LNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDS
Sbjct: 361 KQLAAMHALGNIFGETRSENDIMLNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDS 420

Query: 421 EIRLASYRMITGLVARPWCLVEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKTFM 480
           EIRLASYRMITGLVARPWCL+EICSKQDIINIVSDASTETTKIGMEARYNCCLAIHK FM
Sbjct: 421 EIRLASYRMITGLVARPWCLMEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKAFM 480

Query: 481 SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF 525
           SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF
Sbjct: 481 SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF 525

BLAST of Cp4.1LG02g01820 vs. ExPASy TrEMBL
Match: A0A6J1IVZ7 (uncharacterized protein LOC111480433 OS=Cucurbita maxima OX=3661 GN=LOC111480433 PE=4 SV=1)

HSP 1 Score: 986 bits (2550), Expect = 0.0
Identity = 513/525 (97.71%), Postives = 519/525 (98.86%), Query Frame = 0

Query: 1   MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL 60
           MEEFAVDDPTQLLEAAADFANYPGVRTD SVKEFFSRFPLPVVINALQAKAEIPGLENTL
Sbjct: 1   MEEFAVDDPTQLLEAAADFANYPGVRTDESVKEFFSRFPLPVVINALQAKAEIPGLENTL 60

Query: 61  VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVRGLACKTVTRLLEETDPTTQLAPQL 120
           VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVR LACKTVTRLLEETDPTTQLAPQL
Sbjct: 61  VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVRSLACKTVTRLLEETDPTTQLAPQL 120

Query: 121 IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC 180
           IVDYNIYPLLIECLLNGNEQVANSSMDA+KKLAAFPKGMEIIFPTNKTEATHLGTVASTC
Sbjct: 121 IVDYNIYPLLIECLLNGNEQVANSSMDALKKLAAFPKGMEIIFPTNKTEATHLGTVASTC 180

Query: 181 SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE 240
           SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE
Sbjct: 181 SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE 240

Query: 241 IEHGTNFLPRTSFLQQLSCIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI 300
           IEHGTNFLPRTSFLQ LS IISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI
Sbjct: 241 IEHGTNFLPRTSFLQLLSSIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI 300

Query: 301 SAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG 360
           S+IDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG
Sbjct: 301 SSIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG 360

Query: 361 KQLAAMHALGNIFGETRSENDVLLNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDS 420
           KQLAAMHALGNIFGETRSEND++LNDNAEENL DLIYQTASRSPK+TPSGLILAVLQQDS
Sbjct: 361 KQLAAMHALGNIFGETRSENDIMLNDNAEENLGDLIYQTASRSPKITPSGLILAVLQQDS 420

Query: 421 EIRLASYRMITGLVARPWCLVEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKTFM 480
           EIRLASYRMITGLVARPWCL+EICSKQDIINIVSDASTETTKIGMEARYNCCLAIHK FM
Sbjct: 421 EIRLASYRMITGLVARPWCLMEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKAFM 480

Query: 481 SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF 525
           SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF
Sbjct: 481 SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF 525

BLAST of Cp4.1LG02g01820 vs. ExPASy TrEMBL
Match: A0A6J1CFN3 (uncharacterized protein LOC111010386 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111010386 PE=4 SV=1)

HSP 1 Score: 886 bits (2290), Expect = 0.0
Identity = 458/525 (87.24%), Postives = 489/525 (93.14%), Query Frame = 0

Query: 1   MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL 60
           MEEFAVDDPTQLLEAAADFA+YPGVRTDASVKEF  RFPLPV+INALQ KAE PGLENTL
Sbjct: 1   MEEFAVDDPTQLLEAAADFASYPGVRTDASVKEFLDRFPLPVIINALQTKAETPGLENTL 60

Query: 61  VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVRGLACKTVTRLLEETDPTTQLAPQL 120
           VACLDRIFKTKYGAS IPH+MPF+QVGL+ADSQ VR LACKTVT LLEE+D    LA QL
Sbjct: 61  VACLDRIFKTKYGASFIPHFMPFIQVGLRADSQTVRDLACKTVTFLLEESDNDAVLAIQL 120

Query: 121 IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC 180
           I+DY IYPLL+ECLLNGNEQVANSSMDAIKKLAAFPKGME+IFPTN+TEATHLGT+ASTC
Sbjct: 121 IIDYGIYPLLLECLLNGNEQVANSSMDAIKKLAAFPKGMEVIFPTNETEATHLGTLASTC 180

Query: 181 SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE 240
           SSLGRVRVMAL+VKLFSVS SVASA+YNSNLLNLLESEI+NSNDTLVTLSVLELLYELVE
Sbjct: 181 SSLGRVRVMALVVKLFSVSRSVASAIYNSNLLNLLESEINNSNDTLVTLSVLELLYELVE 240

Query: 241 IEHGTNFLPRTSFLQQLSCIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI 300
           IEHGT FLPRTS LQ LS IISN S ESILRSRAMVISGRLLSKEN++ LVDESCVRILI
Sbjct: 241 IEHGTKFLPRTSILQLLSSIISNSSTESILRSRAMVISGRLLSKENMYLLVDESCVRILI 300

Query: 301 SAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG 360
           SAIDE LGSSEGQDVNVCESAFEALGQIGS+  GATLLLSS+ TCVK +I+AAFDRHEHG
Sbjct: 301 SAIDEALGSSEGQDVNVCESAFEALGQIGSTNRGATLLLSSFSTCVKLLIHAAFDRHEHG 360

Query: 361 KQLAAMHALGNIFGETRSENDVLLNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDS 420
           KQLAAMHALGNI GETRSEND++LND AEENLRDL+YQ ASRS K+ PSGL LAVLQQDS
Sbjct: 361 KQLAAMHALGNICGETRSENDIMLNDMAEENLRDLMYQIASRSSKIMPSGLFLAVLQQDS 420

Query: 421 EIRLASYRMITGLVARPWCLVEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKTFM 480
           EIRLASYRMITGLVARPWCL+EICSKQ+IINIV+DASTETTKIGMEARYNCC+AIHK FM
Sbjct: 421 EIRLASYRMITGLVARPWCLMEICSKQEIINIVTDASTETTKIGMEARYNCCMAIHKAFM 480

Query: 481 SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF 525
           SSTRLTGDPALAGIASKLQEAV+NGPYL+RR  ETQPA+MTAERF
Sbjct: 481 SSTRLTGDPALAGIASKLQEAVRNGPYLTRRNFETQPAVMTAERF 525

BLAST of Cp4.1LG02g01820 vs. ExPASy TrEMBL
Match: A0A0A0L246 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G629990 PE=4 SV=1)

HSP 1 Score: 881 bits (2276), Expect = 0.0
Identity = 455/525 (86.67%), Postives = 487/525 (92.76%), Query Frame = 0

Query: 1   MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL 60
           MEEF+V+DPT+LL+AAA+FANYPGVRTDASVKEF  RFPLP +INALQ KAE PGLE+TL
Sbjct: 1   MEEFSVNDPTRLLQAAANFANYPGVRTDASVKEFLDRFPLPAIINALQTKAEFPGLEDTL 60

Query: 61  VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVRGLACKTVTRLLEETDPTTQLAPQL 120
           VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQ VR LACKTVTRLL+E+D T     QL
Sbjct: 61  VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQTVRTLACKTVTRLLQESDETALSPIQL 120

Query: 121 IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC 180
           I+DY IYPLL++CLLNGNEQVANSSMD+IK LAAFP+GMEII P+NKTEATHLGTVASTC
Sbjct: 121 IIDYGIYPLLLDCLLNGNEQVANSSMDSIKTLAAFPQGMEIIIPSNKTEATHLGTVASTC 180

Query: 181 SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE 240
           SSLGRVRVMAL+VKLFSVSSSVASAVYN+NLL+LLESEI+NS DTLVTLSVLELLYELVE
Sbjct: 181 SSLGRVRVMALVVKLFSVSSSVASAVYNANLLSLLESEINNSKDTLVTLSVLELLYELVE 240

Query: 241 IEHGTNFLPRTSFLQQLSCIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI 300
           IEHGT FLPRTSFLQ L  IISN SAESILRSRAMVI GRLLSKENIFSLVDESC+R LI
Sbjct: 241 IEHGTKFLPRTSFLQLLGSIISNSSAESILRSRAMVICGRLLSKENIFSLVDESCLRNLI 300

Query: 301 SAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG 360
           SA+D ILGSSEG+DVNV E+A EALGQIGSS WGATLLLSS+PTCVK+VI AAFDRHEHG
Sbjct: 301 SAVDGILGSSEGEDVNVSEAAIEALGQIGSSTWGATLLLSSFPTCVKHVIYAAFDRHEHG 360

Query: 361 KQLAAMHALGNIFGETRSENDVLLNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDS 420
           KQLAAMHALGNIFGE RSEND++LNDNAEENLRDLIYQ ASRS KMTPSGL LAVLQQDS
Sbjct: 361 KQLAAMHALGNIFGEGRSENDIMLNDNAEENLRDLIYQIASRSSKMTPSGLFLAVLQQDS 420

Query: 421 EIRLASYRMITGLVARPWCLVEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKTFM 480
           EIRLASYRMITGLVARPWCL EICSKQDI+NIV DAS+ETTKIGMEARYNCCLAIHK FM
Sbjct: 421 EIRLASYRMITGLVARPWCLTEICSKQDIVNIVGDASSETTKIGMEARYNCCLAIHKAFM 480

Query: 481 SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF 525
           SS RLTGDPALAGIASKLQEAV+NGPYL+RR +ETQPAIMTAERF
Sbjct: 481 SSPRLTGDPALAGIASKLQEAVRNGPYLNRRNVETQPAIMTAERF 525

BLAST of Cp4.1LG02g01820 vs. ExPASy TrEMBL
Match: A0A1S3CKC0 (uncharacterized protein LOC103501781 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103501781 PE=4 SV=1)

HSP 1 Score: 877 bits (2267), Expect = 0.0
Identity = 452/525 (86.10%), Postives = 487/525 (92.76%), Query Frame = 0

Query: 1   MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL 60
           ME+F+V+DPTQLL+AAA+FANYPGVRTDASVKEF  RFPLP +INALQ KAE PG+E+TL
Sbjct: 1   MEDFSVNDPTQLLQAAANFANYPGVRTDASVKEFLDRFPLPAIINALQTKAEFPGVEDTL 60

Query: 61  VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVRGLACKTVTRLLEETDPTTQLAPQL 120
           VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQ VR LACKTVTRLL+E+D T   A QL
Sbjct: 61  VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQTVRTLACKTVTRLLQESDETVPSAIQL 120

Query: 121 IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC 180
           I+DY IYPLL++CLLNGNEQVANSSMD+IK LAAFP+GMEII P+NKTEATHLG VASTC
Sbjct: 121 IIDYGIYPLLLDCLLNGNEQVANSSMDSIKTLAAFPQGMEIIIPSNKTEATHLGIVASTC 180

Query: 181 SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE 240
           SSLGRVRVMAL+VKLFSVSSSVASAVYN+NLL+LLESEI+NS DTLVTLSVLELLYELVE
Sbjct: 181 SSLGRVRVMALVVKLFSVSSSVASAVYNANLLSLLESEINNSKDTLVTLSVLELLYELVE 240

Query: 241 IEHGTNFLPRTSFLQQLSCIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI 300
           IEHGT FLPRTSFLQ LS IISN SAESILRSRAMVI GRLLSKENIFSLVDESCVR LI
Sbjct: 241 IEHGTKFLPRTSFLQLLSSIISNSSAESILRSRAMVICGRLLSKENIFSLVDESCVRNLI 300

Query: 301 SAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG 360
           SA+D ILGSSEG+DVNV E+A EALGQIGSS WGATLLLSS+PTCVK+ I  AFDRHEHG
Sbjct: 301 SAVDGILGSSEGEDVNVSEAAIEALGQIGSSTWGATLLLSSFPTCVKHAIYTAFDRHEHG 360

Query: 361 KQLAAMHALGNIFGETRSENDVLLNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDS 420
           KQLAAMHALGNIFGE+RSEND++LNDNAEENLRDLIYQ ASRS KMTPSGL LAVLQQDS
Sbjct: 361 KQLAAMHALGNIFGESRSENDIVLNDNAEENLRDLIYQIASRSSKMTPSGLFLAVLQQDS 420

Query: 421 EIRLASYRMITGLVARPWCLVEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKTFM 480
           EIRLASYRMITGLVARPWCL EICSKQ+I+NIV DAS+ETTKIGMEARYNCCL+IHK FM
Sbjct: 421 EIRLASYRMITGLVARPWCLTEICSKQEIVNIVCDASSETTKIGMEARYNCCLSIHKAFM 480

Query: 481 SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF 525
           SS RLTGDPALAGIASKLQEAV+NGPYL+RR +ETQPAIMTAERF
Sbjct: 481 SSPRLTGDPALAGIASKLQEAVRNGPYLNRRNVETQPAIMTAERF 525

BLAST of Cp4.1LG02g01820 vs. TAIR 10
Match: AT3G15180.1 (ARM repeat superfamily protein )

HSP 1 Score: 565.5 bits (1456), Expect = 4.6e-161
Identity = 292/520 (56.15%), Postives = 385/520 (74.04%), Query Frame = 0

Query: 6   VDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTLVACLD 65
           ++D  QL +AA +FA+YPG + + SVKEF  RFPLPV+ NALQ   +IPG ENTLV CL+
Sbjct: 1   MEDVNQLFDAAFEFAHYPGAQNETSVKEFLDRFPLPVIFNALQTDPDIPGFENTLVTCLE 60

Query: 66  RIFKTKYGASLIPHYMPFVQVGLQADSQAVRGLACKTVTRLLEETDPTTQLAPQLIVDYN 125
           R+FKTKYGASLIP YMP +QVGL+ADS  V+ LACKTV  LLE+ D     + QL+V+  
Sbjct: 61  RLFKTKYGASLIPQYMPVLQVGLKADSAVVKSLACKTVLCLLEDCDTNDVSSVQLVVNNG 120

Query: 126 IYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTCSSLGR 185
           IYPLL++ ++N +++VAN++ + IK LA FP  M +IFP+   + THL  +A+ CSSL R
Sbjct: 121 IYPLLLDYIINSDDEVANAASETIKSLARFPDAMSVIFPSETNDPTHLRNLAARCSSLAR 180

Query: 186 VRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVEIEHGT 245
           VRV++LIVKLFS+S  VAS V  S LL+LLE+E+  + DTLV L+VLEL YEL+E+EH +
Sbjct: 181 VRVLSLIVKLFSISRLVASEVKKSGLLDLLEAEMKGTKDTLVILNVLELYYELMEVEHSS 240

Query: 246 NFLPRTSFLQQLSCIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILISAIDE 305
            F+P+TS +Q L  IIS  S     + RAM+ISGRLLSKENI+ +V+E+ V+ LISAID 
Sbjct: 241 EFVPQTSLIQLLCSIISGTSTGPYEKLRAMMISGRLLSKENIYKVVEEASVKALISAIDG 300

Query: 306 ILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHGKQLAA 365
            L S E  D +  E+A +ALGQ+GS+  GA L+LS+ P   ++V+ +AFDR+ HGKQLAA
Sbjct: 301 SLESVEMNDTDAQEAAIDALGQMGSTTKGADLVLSTSPPAARHVVASAFDRNAHGKQLAA 360

Query: 366 MHALGNIFGETRSENDVLLNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDSEIRLA 425
           +HAL NI GETR +++ +++  AEE+LR LIY  A++S K+TPSGL L+VLQQ SEIRLA
Sbjct: 361 LHALANIAGETRPKSNRIVDGKAEESLRCLIYDAAAQSTKLTPSGLFLSVLQQSSEIRLA 420

Query: 426 SYRMITGLVARPWCLVEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKTFMSSTRL 485
            YR +T LVARPWCLVEI +K++IINIV+DA+TET KI MEARYNCC AIH+ F+ S   
Sbjct: 421 GYRTLTALVARPWCLVEILAKEEIINIVTDATTETAKIAMEARYNCCKAIHEAFLCS-NF 480

Query: 486 TGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF 526
             DP       KLQEAV++GPY+S++    +P +MT E F
Sbjct: 481 VDDPRRLKTGDKLQEAVRSGPYMSKKHRGARPEVMTGEGF 519

BLAST of Cp4.1LG02g01820 vs. TAIR 10
Match: AT3G15180.2 (ARM repeat superfamily protein )

HSP 1 Score: 552.7 bits (1423), Expect = 3.1e-157
Identity = 293/552 (53.08%), Postives = 386/552 (69.93%), Query Frame = 0

Query: 6   VDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTLVACLD 65
           ++D  QL +AA +FA+YPG + + SVKEF  RFPLPV+ NALQ   +IPG ENTLV CL+
Sbjct: 1   MEDVNQLFDAAFEFAHYPGAQNETSVKEFLDRFPLPVIFNALQTDPDIPGFENTLVTCLE 60

Query: 66  RIFKTKYGASLIPHYMPFVQVGLQADSQAVRGLACKTVTRLLEETDPTTQLAPQLIVDYN 125
           R+FKTKYGASLIP YMP +QVGL+ADS  V+ LACKTV  LLE+ D     + QL+V+  
Sbjct: 61  RLFKTKYGASLIPQYMPVLQVGLKADSAVVKSLACKTVLCLLEDCDTNDVSSVQLVVNNG 120

Query: 126 IYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTCSSLGR 185
           IYPLL++ ++N +++VAN++ + IK LA FP  M +IFP+   + THL  +A+ CSSL R
Sbjct: 121 IYPLLLDYIINSDDEVANAASETIKSLARFPDAMSVIFPSETNDPTHLRNLAARCSSLAR 180

Query: 186 VRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVEIEHGT 245
           VRV++LIVKLFS+S  VAS V  S LL+LLE+E+  + DTLV L+VLEL YEL+E+EH +
Sbjct: 181 VRVLSLIVKLFSISRLVASEVKKSGLLDLLEAEMKGTKDTLVILNVLELYYELMEVEHSS 240

Query: 246 NFLPRTSFLQQLSCIISNRSAESILRSRAMVISGRLLSKENIFSLVDES----------- 305
            F+P+TS +Q L  IIS  S     + RAM+ISGRLLSKENI+ +V+E+           
Sbjct: 241 EFVPQTSLIQLLCSIISGTSTGPYEKLRAMMISGRLLSKENIYKVVEEARPVVPCLKASV 300

Query: 306 ---------------------CVRILISAIDEILGSSEGQDVNVCESAFEALGQIGSSKW 365
                                CV+ LISAID  L S E  D +  E+A +ALGQ+GS+  
Sbjct: 301 CCAHKTSDEVEKLTTNCFVSECVKALISAIDGSLESVEMNDTDAQEAAIDALGQMGSTTK 360

Query: 366 GATLLLSSYPTCVKYVINAAFDRHEHGKQLAAMHALGNIFGETRSENDVLLNDNAEENLR 425
           GA L+LS+ P   ++V+ +AFDR+ HGKQLAA+HAL NI GETR +++ +++  AEE+LR
Sbjct: 361 GADLVLSTSPPAARHVVASAFDRNAHGKQLAALHALANIAGETRPKSNRIVDGKAEESLR 420

Query: 426 DLIYQTASRSPKMTPSGLILAVLQQDSEIRLASYRMITGLVARPWCLVEICSKQDIINIV 485
            LIY  A++S K+TPSGL L+VLQQ SEIRLA YR +T LVARPWCLVEI +K++IINIV
Sbjct: 421 CLIYDAAAQSTKLTPSGLFLSVLQQSSEIRLAGYRTLTALVARPWCLVEILAKEEIINIV 480

Query: 486 SDASTETTKIGMEARYNCCLAIHKTFMSSTRLTGDPALAGIASKLQEAVQNGPYLSRRKL 526
           +DA+TET KI MEARYNCC AIH+ F+ S     DP       KLQEAV++GPY+S++  
Sbjct: 481 TDATTETAKIAMEARYNCCKAIHEAFLCS-NFVDDPRRLKTGDKLQEAVRSGPYMSKKHR 540

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023525525.10.0100.00uncharacterized protein LOC111789113 [Cucurbita pepo subsp. pepo][more]
XP_022940916.10.098.67uncharacterized protein LOC111446360 [Cucurbita moschata][more]
KAG6607893.10.098.4826S proteasome non-ATPase regulatory subunit 5, partial [Cucurbita argyrosperma ... [more]
XP_022981236.10.097.71uncharacterized protein LOC111480433 [Cucurbita maxima][more]
KAG7037420.10.094.3426S proteasome non-ATPase regulatory subunit 5 [Cucurbita argyrosperma subsp. ar... [more]
Match NameE-valueIdentityDescription
A0A6J1FJQ70.098.67uncharacterized protein LOC111446360 OS=Cucurbita moschata OX=3662 GN=LOC1114463... [more]
A0A6J1IVZ70.097.71uncharacterized protein LOC111480433 OS=Cucurbita maxima OX=3661 GN=LOC111480433... [more]
A0A6J1CFN30.087.24uncharacterized protein LOC111010386 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A0A0L2460.086.67Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G629990 PE=4 SV=1[more]
A0A1S3CKC00.086.10uncharacterized protein LOC103501781 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
Match NameE-valueIdentityDescription
AT3G15180.14.6e-16156.15ARM repeat superfamily protein [more]
AT3G15180.23.1e-15753.08ARM repeat superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR01953826S proteasome non-ATPase regulatory subunit 5PFAMPF10508Proteasom_PSMBcoord: 64..518
e-value: 2.7E-16
score: 59.2
IPR01953826S proteasome non-ATPase regulatory subunit 5PANTHERPTHR1355426S PROTEASOME NON-ATPASE REGULATORY SUBUNIT 5-RELATEDcoord: 2..525
IPR011989Armadillo-like helicalGENE3D1.25.10.10coord: 6..406
e-value: 9.4E-16
score: 59.4
IPR016024Armadillo-type foldSUPERFAMILY48371ARM repeatcoord: 87..483

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG02g01820.1Cp4.1LG02g01820.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0043248 proteasome assembly