CmoCh01G001070 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh01G001070
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionDNA-directed RNA polymerase subunit beta
LocationCmo_Chr01: 474128 .. 483053 (+)
RNA-Seq ExpressionCmoCh01G001070
SyntenyCmoCh01G001070
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAAGCCCAAGATTCCAAGCGCCAAGCAAACTCTTCGTTAAGAACTCGACCATAGCAACGCCAGCATGAGCTTCGACACTATGAGAGTTCAATCCTCAACCCCTCAAAGCCCCACTTCCAATCGGAGGCTCGAGCGTGCCTTCTCTTCTCGCAGAGCCCCTCATCACAGCGGCGATTTTGATGACGACGATGACCATGATGTTTCCAAGACGAAGAAGAACAGATTTTCCTTATTCACTCACCGCCTTTCCATTTACTTCACTCGAATCGGACCCATTTGGGCCTGCCTTGCGCTCGTTGGTTTAATCCTTCTCATGATTTCGTCCTTGATATTCTTTCACTCCCGCAGATTTGTTTGCGTTTCGTCTTACGATCCTGTTTCCCGCTCTGGGTTCTTTGGCATGGATGGGCTCGATTCCGATTTCGGTTCTCTTGGTGTGCCCTGGTGTAAGTTTTCTTGCTTATCACTTTTGGATCTGATCCATTGTAGTTCTCTGGAAATGTTGAGTTTTAGATCCGTGTTCCGGGTTTATGGCGGTCTGTATATAATGCAAAAAATGGAGAAGAACTATCGTTAATTTGTTTTGGGTATGATGAATTGGTTACATCGAAAACGGGGCGTTGTTTATAGAATTGAATCTGTGTTATCGTAGTTTGAATTTGTTTCGTATCATCCGAGCTTTTCCTAATTTGGAACATTAAATATTTTCTTTACTCAATCTTGGACTGTCTATTGTGGCCTCGGCATTGATCCGTTCCTCGAGTAATGGCTAGGCTATTGTTGAGCAACCTTTAATTTCAAGTTATGTGATAAAAGTGAAGAGTCAAGGCAATATTTAATAATGTTAGAATTGTTGGTTTACGATTAAGATGCCTTATTAATGAACTACTATTCTCATTGACTTCACAGTCCTGGGTTCTAGTTCTAGTCTGATTAAATGTTATGAAATTCCTAAAATTTTTGCTTGTTTGAATCCTTATAATCCCATCCTTGGCAAAAGGCAGATCGAAACATGGAAAGACAGTTGAATGGACTGCAAAAGATTTACTAAAGGGCTTGGAAGAGTTTGTACCAATTTATGAGACTCGACCAATAAAGAACAACCTGTTTGGTATGGGCTTTGATCATAGCTTTGGCCTTTGGTTCATTGCTCGTTGGCTAAAACCAGATTTGATGATTGAAAGTGGCGCATTCAAGGGACATTCAACTTGGGTGTTGCGGCAAGCAATGCCAGACACACCGATTATTTCACTCTCACCCCGTCACCCCGAAAAATACTTGAAGAAGGGACCTGCTTACGTTGATGCTAACTGCACATATTTTGCTGGAAAGGACTTCGTAGATTTTGGAAGTGTTTCCTGGAATAGTGTGATGAAGCAACATGGAATTGATGATCTTAGCCGTGTTCTTGTATTTTTCGACGACCATCAGAATGAATTAAAGAGGTATCCTTGAGTTTCATTGCAATCAATTATTTTTGTTCACCGTCTATTTATGCACACTCTATTATGCTGAAAATTTGTCTTGTAATTTGAGGGCAGAATAAGTCAGGCTCTGAAAGTTGGCTTTCAACACCTTGTTTTTGAGGATAACTACGATACTGGCACAGGAGATCACTATTCTTTAAGGCAGATGTGCGATCAGTTCTATATTAGAGGTGCCTGCTCTTCCACTTGTATCTCGTTCCACTTTGCTTTGTTTTTTATACTCTATATGAATGTTCCCACCATCTATTTGGCTAATCAGATAATGGTTGTCTCATAAAGTGCCTTCCTTAGTACAAAGAAACTGTAACCTAATGAGAAATTTAGAACTGATTTATGTGATCCTCATAATATGTAACAGCCCCGCTGCCCCCGCCGCTAGCAGATATTGTCCTCTTTGGGCTTTTCCTTTTGGACTTCCCTCAAGGTTTTAAAACACGTCTCTTCCCCTCAAGGTCTTTTGAACTTTCCCTTATGGGCTTTTTCTTTTGGGTTTTCCCACAAGGTTTTAAAACGCGTCTCTTCCCCACACCCTTATAAGGAATGTTTCGTTCTCCTCTCCGACTGATGTGAGATCTCACGTAATATTTCTAGGGCATAATAAAATCAAAATTATATTCCAGCGCTACTTCCAATGATTTGAAAGTGATTCTGCGAGTCTGTGACAATTTAAGTTGAAGTTAATTTTAATGCTGGTATATTTCTGATTTAGGAGGTGGGCACAGTTGCTTCAAGGACAGCGATGAAGCCAGAATCAGAGCAAAAAGGAAGTTGTTCTGGGAAAAGGCAGTGGATGTAGAAGAACTTTGTGGACCGTATGAGGCTTGGTGGGGTGTCCGAGGCTACATGCGTGATGATTTTAACCACAGCAATAGGGCCATCTCCCACGCAGAGCACCTCCAGAACAGCAGGTACTTGGAGTCGATTCTTGATGTGTATTGGGAGCTCCCTCCAGTTGCTGGCCCTTCTTTAACACATCAGACTAGGTACGATCCCGCTCGTGTTTCGATCCCTATTGTGGAAGATGGCAGGTACGGTTTGTTCCAGCGACTTGGTTTAACTCGACTTGAGACTTGTGTATTTAATGGATACACACAAATGGTCTATATTCAGATATCTAAACAATAGTTGTTAGGCATTTTACTCTACCTGTCAATAGCTTGTAGTCCTTTTTCCTATACCACACTGTTATTCAAAATTTGAGTCAATTAAGGGACAAATGCTTTGTGTTCAAACATATTCCTAGTGGTTTTGTTGGTGGCAGTTTTATATATTGGCCACACTCAGATCCCTTGCTTGTCTTATCTCGCACTGTTGTAGCATATCGGACCTCAAATTGAGTTTAGCAAACTGTTTACCCCTTCGTAGCATATCGGATCTCCTGTTACCACGTCAAATTGTTGGTCTCGTGAACATATAATTTGACCCATAAATTTTAACAAGTTTGGACCTTTGGATACATTTAATAAATTTATAGTTTATACTTTGCTCAACTAAATAATTGTTAATGAATTTAACTCAAGTATATTCAATTATTAATACCGTCCTGAAACAATTAATGAATTTTAAATGAATACCAAACTCATTTTTTAATGGTTAAAAGTGAATATAATTTAACAGAAATCAAATATACTCTTAATTTAATTTGATTTAATTGAGAGGTTAAAGGTGGCGTCATATTTATCCCAATTTGCCACATTTATTTTATTTTGAGGAAAAAATAATTGGACCGTGTTTGGAGAAGAGGATCCAATTTCTGCTGCCCGTGAGAAAAAGGGCACACAGACCCAAGCGGAGCCGATTGATGAGGTAGCGGTTGCTCCGCGTTTGGCGTTCTGTACGAGAACTGAGCGTCGCTCTTTACCAATCACTCCTCTTCGTCTCCACTTACTCTCCCGCCATTAACAGCGCCACGGACCTCTCCCTTCTTCGCTTCGCGCTGCAACTCATCCATTTGTCATGGTCATAGAATCCTCAGCTGCAGCTCCCGCCAGCTCCTCCTCTTCGTCTTCGCCTTCCACCAGCACCTCCCATCTTTCTCCACCTCCTGAAGATGCATGGACTCTTGCTCATCAGCGGCTACTTCCTCGCTGGAAATCTCTTTCCCAGTCTCACTTGGTACCTTCATCCCTCTCTCTTTGTCTCGCACCCAGCCTAATTTTGTCTGCTTATTTGTTCCCTTTTTTATTTGTTCATAGTTGCCTTTTGATTCTGTGGCGTACGTCTTTTTGTCCGTTCGTTTCTATGTATATACTAGAGTTTGTTTCTAATTAATAGCTTTACCGAAGATCATTTCTGACACATGGTTGGGAAAGATTGCATTCGTGTTAATTCGGCTTCACTTTCTGTTTGTTTCGTTGTCCAAGCCATGCGAGAGTTAAATTTTCTTCTACTTGCACCTTGAACAGTACATGTATTGTAATTTGATATTGGTTGATACATGTGTACTAATTTATGATACTGGAGTCTTTTTTTTTTCTTTTCCTTTTCTTTTCTATCTATTCTGCAGTCGCCAATTCCAATTTCGATATCGAAAGTTAATCAAGTGGATGCGGCGCGGCTAGATATTGAAATGTCAGCCATGTTGAAAGAGCAGTTGGTTAAGGTCTTCGCTTTGATGAAGGTATATGTTGCTTGTTATTCGGGCATTGAGATGTAGATACTTGGCTAGTAAAGTTATTTAGGCGTCATAAAGTATTTTCTTCTACTTTGATGGATTAGCCAGGAATGTTGTTTCAATATGAAGCAGAGCTTGATGCTTTTCTGGCGTTCCTTATTTGGCGCTTTTCAATTTGGGTAGACAAGCCCACACCAGGAATTGCTCTGATGAATCTGCGGTATAGAGATGAGCGTGCAATGGAAATTCCAGGAAAAGGTGAAATGATCTAAACATCTCCCTGCCATGCTCTACTTGTTTAACTGATTGTTACGGTCAGTACGGTGTAGGAATGTAATTGAATTTATCCAAACTAACATGTAAAATAAATAATTATAAATTGAGTACGAAGTTGAGAATTAAAACTGTAGCAACCATGGAACTGCTGTTGTCAAATCTTCATAGCAATGTGTGGCAGTCACTTACTAATTCCTTTATTGAAAACTATAATCAATTTTTTCTGGCTCTGGACTTCTTCTGCAATACAGTCAGAACTGGATTGGAAGGACCTGGCCTCACAGTTGCTCAAAAGATTTGGTATTGCGTGGCCACTGTGGGTGGTCAATACATTTGGACTCGGTTACAATCATTTTCTGCTTTTCGTAGATGGGGAGATTCAGAGCAGGTACTGGTTTTCAGAAACTCACCACCCCTACCATGTGGACTACCTTCTTGTTATGTTGGTATGGTTGTTGAATAAAATAGGTATTCATATGTCCCTGTATTTTTTTTTCAGAGGTCCTTGGCGAGGCGAGCATGGCTTTTGATTCAGCGCATTGAAGGAATATACAAAGCTGCTGCATTTGGCAACTTGCTCATATTTCTTTACACAGGAAGGTAGGTTTTCTGCATCTGATAAGCCGGATTAGACTATCGATATCAAGATTTCAAATTTTCACATACCTTGATGAATATTTTTGTGAATAATTTTTTTTATATAAATTATTTAAATTAATAATTAAGTTTTCCCCCCTTTTTCTTAACTTTTAAACTAAGTCATAGATCTTGTTATCAATGTTTTAAAAGGCTTAAGGCGGGCCTTGGGGCATGAGGTGGTATGAGGCATAAGCCTTATTTTAAATTTAAAAAATGTACATAAAACATAAAGCATTATATGCTCCTAACGATATAATATTTCTTAATGTACCGGACATACTAAATGTTCAATAACTAATGCATAATAGTAGTAAGAACTACAAACCAATTAGACATTTGGGAAATAAAAAGTCTTCTTCAAAAGAAAATAATGAATAATAGTAGTAAGTAGCTAGTTAGAACCATTAAGAAAGAGAACTTTAGGCGGGAAAACAAATCAGAACATAAAAATAACTGCTAATTAAAAGAAGAAAACAGAAAAAAATTGAAATGATAATTATTTAACTAATATTAAGAAAGAAAACAAAGAAGACAATCGAATGGGAAAGAAAATAATAATCAAGATTCAAATATAGAGAAAAAAAAAAAGCAAAATAGAACGGAAAGAAAAATAATAATCGAAACTCAAAAGAAAAGAAAAGAAAAGAAAGTCGAACAAACAAAGAACAAAATGAAAACGGAAGTAGTAAGAAAAAAATATTAAACTCGCTATGCAAGTGAACAAATAAAACAAAAACAAGAACAACAAGATGAAGAAGAATGAAACTCGCTTTTGTTGTGTTGTGTTTTCGTCAAATAGTGATTCCTTGCAGTTTTTTTTCTTTGCGTAATGTTCACTTCAAATAACACTTGCACAAAAAGGAAAAGCTTTCAAGTATTTTGTTTCCCAAGTACAGTTCTTTATGTTTTCTTCAATATTCTCGCTACCATAAATTAGTATGAAGACATCACCAACTTCAATTAAATGCACCGTTGGATGATGCATTGCATGGGCACTTGGCCCACCAACCAATATCAAAGGCTTACGCCTTGTAGCCTTTGAGACTTACGCCTCTCTCAACAGAGGCGCGCAAGCCTCATATTACACTCTCAAGGCGTAAGCCTTAAAGCATGAATCTGCCGCATCACCTTGAGGCGCGCCTCAAGATTTATGCCGTGACCGATTTTTAAAACGTGGCCTGTTATTAGTTTTTATATTTATATTGTGATTGTATATAAAAGATATTAGAGATATCGATTAATCTTCAATATTTATGTTGAACTCTCCAATTTACGGAAATATCATCATATTAATGGTTGACGGATATTTTCATCCTTGCTGATAAGTTAACTTTTCAGTGAACTGTTATACTTTGAGCGGCTTGAAGTGATTATTTGTTAAATATTAGTGCTAAAAGGGCATTTTCGAAAGAATCTGCCCAAGTAAAAAGCATACTTACCTGTGTCAAATGGCATCTCTTCAACTATTATGTTTCAGTTTGCAGTGGAAAAAGGAAATTATTTTTCCATCAATAATGTAGAAGTTTATTCAGTAGAAACAAAAGAATGGATGCTTCATTTTTATTTATCATTTAAAAATTAATATTTAATGCATCTGCTAATCTGTAGGGAGCGGTAAGTTATTTTGTCATCAGTCTGAATTGGTATTTTGCAGATACATCTGACTGCACCACTTTTCACTCTATTGCTGAATCTTGTGCATTGTTGGTTGTCACATATGCTTTTCACTCAGTTGTGTTGGTTAAATTGTTGTAAGTTTTTGTAGTGGAACTCAAATAAAAATTGGTGGCTTGTCGACTATCATAAGTAGTTTACTTTCTGCCCAACTCAGCATCGTATTTTTTGTGTGGTGCAGGTATAGAAATCTTGTCGAGAGAGTTCTCAGAGCCAGGCTTGTTTATGGGAGTCCTAATATGAACAGGGCTGTCAGCTTTGAGTATATGAATCGCCAGTTAGTGTGGAATGAATTCTCGGTAATTTTCTGTCCTTTATAGTTAGTTGCAGAAGTATTGGAACTCTTATTTTGGCTTAGAACTTATTCACTTATTGCTTCATTTCAACAAAATATAGTCTCCGAACGTACTATTGTTCTTAACCATATGATGATTAGCTCACCCTCTTCCCTTTCCTCCTGAAGTAGAAATCTATCTCAGGTCGAGCATGGTATATATTCTCAGATTGTGTGCTGTTTTAAGTTTAGCGTGGAAAAACTGTGTGTTTGACGAAAAGTTATCATCTAGATAGAATTCAAATCAACTACAAGTTTCAAGTAGTTCATATAGGATATTCATTTGGTTTACAGTGGTTAAGATCGTCTACTATACTCATGCGTGCATGAATAATATGGGAAGGGTTTCAATTTTCTTTTGCCCAATGCTTCATCACAATTTTATTATCCATAGTAGGGAACATTTGCCCTCGAGAATTTATTTTTCTGTGCTAATGGCAGGAAATGTTGCTGTTGCTTCTTCCTCTTCTAAATTCTTCCTCTGTTAGAAACTTTCTTCGTCCATTTTCCAAGGAGAAGTCCTCAAGCTCAGCCGAGGATGACAGTGCTTGTCCAATTTGCCTGGCAAGTCCAACGATTCCATTTCTGGCTTTGCCTTGTCAACACAGGTCAGGCCTTTTTATCACATCACTCTATCTTTTAAACCAAAAAACTACTATTTGTTATATAAAAAAGGCAGAAACTCAACGATAGAATTAGGATGCATTTGTTCGAAAGACAGACAAAACAAGGAGGAGACGTGACGATAGAATTATTTCTTTAATGAATATAGCTTGGACGTGCATATTACAGATTTCGTTCAAAAACAAAAAAAATTGACTATGGTAATAAAATAAACGTTGTAAATCTAACAACGTCGATTGATGTAATTAGTTCTAAATGTTAAAACTTCGCAAAGTGAAAGTATATACGCTAACCAAATAAACAAATTTTAAATAATTCAATGTTTAACGTAAAGTTTTGTAATGAAAATTCCAGAGAGATTTATGTCCATACGATCATATATTTAAAATAGAAACATAAGAAACAAGTCTATTTCGCATTCATCATCTTTCACCAATGTTTTTGAGTTGTTTCGACAGTCCAGGCTTTTTATGTTCACAATTAGACATTGAAAAGAACAACAGAATATTTGATTCAAGATTTAAGATTCTGATTTAACTGTCGGTTACTGTCGACCAACGCTCGTAGACAAAGATGGCTGCATTTCATGTCTATATAATTGTTTACATTGAATCTCACAGGTTCAACTGGGGTGAGATGGAGATAAGTATTACTAACGTCCATTGGTTTTTTTTTTTTCTGACAGATACTGTTACTATTGCCTCCGAACACGATGCATGGCAGCTCAATCATTTAGATGTTCAAGATGCAGCGAGCCTGTGGTGGCCATGCAGCGGTATGTCGAAGGCACTAGTGCAAATCCCAAACGGTAATCCCTGGGCAGAGGGAGCAAATACAATTTATAGTGATTATAATAAGAAAAAGGAAAATTGCTTCAATGTTGCAATTAATTTTTTTTTGTTGCTTGAAGCATAAATGCTTACAGAAAATCAAATTCCCTGGAAAAGCATATGTTGTATTGTGTAGTAGCCGTAGGGGGGCTCATTCTTTTTAACTTTCCTGGTACATTTGTAGTATAGGTACTCGCTGATTATCTTGTGATTAATTCCCTGCAGTTACAAGAGGAAACATTATGCTGTCTTTCTTTCTTTTTCCTTTTTTTCTTATAAAGGCACAAAGAGAGGTAGGAGAGTATTTCAACCCAAGATAGAAATTTATATATTTTCAGTTCCTGGTTCGACCATGGTTCAATTAGTTTAGTCATGTATCCTCAACTAAGAGGTTAGATGTTCGGAGCCTCCTCAAAACATGTTCTTGAACTTGAAATCTTGTGC

mRNA sequence

ATGGCAAGCCCAAGATTCCAAGCGCCAAGCAAACTCTTCGTTAAGAACTCGACCATAGCAACGCCAGCATGAGCTTCGACACTATGAGAGTTCAATCCTCAACCCCTCAAAGCCCCACTTCCAATCGGAGGCTCGAGCGTGCCTTCTCTTCTCGCAGAGCCCCTCATCACAGCGGCGATTTTGATGACGACGATGACCATGATGTTTCCAAGACGAAGAAGAACAGATTTTCCTTATTCACTCACCGCCTTTCCATTTACTTCACTCGAATCGGACCCATTTGGGCCTGCCTTGCGCTCGTTGGTTTAATCCTTCTCATGATTTCGTCCTTGATATTCTTTCACTCCCGCAGATTTGTTTGCGTTTCGTCTTACGATCCTGTTTCCCGCTCTGGGTTCTTTGGCATGGATGGGCTCGATTCCGATTTCGGTTCTCTTGGTGTGCCCTGGTGCAGATCGAAACATGGAAAGACAGTTGAATGGACTGCAAAAGATTTACTAAAGGGCTTGGAAGAGTTTGTACCAATTTATGAGACTCGACCAATAAAGAACAACCTGTTTGGTATGGGCTTTGATCATAGCTTTGGCCTTTGGTTCATTGCTCGTTGGCTAAAACCAGATTTGATGATTGAAAGTGGCGCATTCAAGGGACATTCAACTTGGGTGTTGCGGCAAGCAATGCCAGACACACCGATTATTTCACTCTCACCCCGTCACCCCGAAAAATACTTGAAGAAGGGACCTGCTTACGTTGATGCTAACTGCACATATTTTGCTGGAAAGGACTTCGTAGATTTTGGAAGTGTTTCCTGGAATAGTGTGATGAAGCAACATGGAATTGATGATCTTAGCCGTGTTCTTGTATTTTTCGACGACCATCAGAATGAATTAAAGAGAATAAGTCAGGCTCTGAAAGTTGGCTTTCAACACCTTGTTTTTGAGGATAACTACGATACTGGCACAGGAGATCACTATTCTTTAAGGCAGATGTGCGATCAGTTCTATATTAGAGGAGGTGGGCACAGTTGCTTCAAGGACAGCGATGAAGCCAGAATCAGAGCAAAAAGGAAGTTGTTCTGGGAAAAGGCAGTGGATGTAGAAGAACTTTGTGGACCGTATGAGGCTTGGTGGGGTGTCCGAGGCTACATGCGTGATGATTTTAACCACAGCAATAGGGCCATCTCCCACGCAGAGCACCTCCAGAACAGCAGGTACTTGGAGTCGATTCTTGATGTGTATTGGGAGCTCCCTCCAGTTGCTGGCCCTTCTTTAACACATCAGACTAGGTACGATCCCGCTCGTGTTTCGATCCCTATTGTGGAAGATGGCAGGTACGAATCCTCAGCTGCAGCTCCCGCCAGCTCCTCCTCTTCGTCTTCGCCTTCCACCAGCACCTCCCATCTTTCTCCACCTCCTGAAGATGCATGGACTCTTGCTCATCAGCGGCTACTTCCTCGCTGGAAATCTCTTTCCCAGTCTCACTTGTCGCCAATTCCAATTTCGATATCGAAAGTTAATCAAGTGGATGCGGCGCGGCTAGATATTGAAATGTCAGCCATGTTGAAAGAGCAGTTGGTTAAGGTCTTCGCTTTGATGAAGCCAGGAATGTTGTTTCAATATGAAGCAGAGCTTGATGCTTTTCTGGCGTTCCTTATTTGGCGCTTTTCAATTTGGGTAGACAAGCCCACACCAGGAATTGCTCTGATGAATCTGCGGTATAGAGATGAGCGTGCAATGGAAATTCCAGGAAAAGTCAGAACTGGATTGGAAGGACCTGGCCTCACAGTTGCTCAAAAGATTTGGTATTGCGTGGCCACTGTGGGTGGTCAATACATTTGGACTCGGTTACAATCATTTTCTGCTTTTCGTAGATGGGGAGATTCAGAGCAGGGAGCGGTAAGTTATTTTGTCATCAGTCTGAATTGGTATTTTGCAGATACATCTGACTGCACCACTTTTCACTCTATTGCTGAATCTTGTGCATTGTTGGTTGTCACATATGCTTTTCACTCAGTTGTGTTGGTTAAATTGTTGTATAGAAATCTTGTCGAGAGAGTTCTCAGAGCCAGGCTTGTTTATGGGAGTCCTAATATGAACAGGGCTGTCAGCTTTGAGTATATGAATCGCCAGTTAGTGTGGAATGAATTCTCGGAAATGTTGCTGTTGCTTCTTCCTCTTCTAAATTCTTCCTCTGTTAGAAACTTTCTTCGTCCATTTTCCAAGGAGAAGTCCTCAAGCTCAGCCGAGGATGACAGTGCTTGTCCAATTTGCCTGGCAAGTCCAACGATTCCATTTCTGGCTTTGCCTTGTCAACACAGATACTGTTACTATTGCCTCCGAACACGATGCATGGCAGCTCAATCATTTAGATGTTCAAGATGCAGCGAGCCTGTGGTGGCCATGCAGCGGTATGTCGAAGGCACTAGTGCAAATCCCAAACGGTAATCCCTGGGCAGAGGGAGCAAATACAATTTATAGTGATTATAATAAGAAAAAGGAAAATTGCTTCAATGTTGCAATTAATTTTTTTTTGTTGCTTGAAGCATAAATGCTTACAGAAAATCAAATTCCCTGGAAAAGCATATGTTGTATTGTGTAGTAGCCGTAGGGGGGCTCATTCTTTTTAACTTTCCTGGTACATTTGTAGTATAGGTACTCGCTGATTATCTTGTGATTAATTCCCTGCAGTTACAAGAGGAAACATTATGCTGTCTTTCTTTCTTTTTCCTTTTTTTCTTATAAAGGCACAAAGAGAGGTAGGAGAGTATTTCAACCCAAGATAGAAATTTATATATTTTCAGTTCCTGGTTCGACCATGGTTCAATTAGTTTAGTCATGTATCCTCAACTAAGAGGTTAGATGTTCGGAGCCTCCTCAAAACATGTTCTTGAACTTGAAATCTTGTGC

Coding sequence (CDS)

ATGAGCTTCGACACTATGAGAGTTCAATCCTCAACCCCTCAAAGCCCCACTTCCAATCGGAGGCTCGAGCGTGCCTTCTCTTCTCGCAGAGCCCCTCATCACAGCGGCGATTTTGATGACGACGATGACCATGATGTTTCCAAGACGAAGAAGAACAGATTTTCCTTATTCACTCACCGCCTTTCCATTTACTTCACTCGAATCGGACCCATTTGGGCCTGCCTTGCGCTCGTTGGTTTAATCCTTCTCATGATTTCGTCCTTGATATTCTTTCACTCCCGCAGATTTGTTTGCGTTTCGTCTTACGATCCTGTTTCCCGCTCTGGGTTCTTTGGCATGGATGGGCTCGATTCCGATTTCGGTTCTCTTGGTGTGCCCTGGTGCAGATCGAAACATGGAAAGACAGTTGAATGGACTGCAAAAGATTTACTAAAGGGCTTGGAAGAGTTTGTACCAATTTATGAGACTCGACCAATAAAGAACAACCTGTTTGGTATGGGCTTTGATCATAGCTTTGGCCTTTGGTTCATTGCTCGTTGGCTAAAACCAGATTTGATGATTGAAAGTGGCGCATTCAAGGGACATTCAACTTGGGTGTTGCGGCAAGCAATGCCAGACACACCGATTATTTCACTCTCACCCCGTCACCCCGAAAAATACTTGAAGAAGGGACCTGCTTACGTTGATGCTAACTGCACATATTTTGCTGGAAAGGACTTCGTAGATTTTGGAAGTGTTTCCTGGAATAGTGTGATGAAGCAACATGGAATTGATGATCTTAGCCGTGTTCTTGTATTTTTCGACGACCATCAGAATGAATTAAAGAGAATAAGTCAGGCTCTGAAAGTTGGCTTTCAACACCTTGTTTTTGAGGATAACTACGATACTGGCACAGGAGATCACTATTCTTTAAGGCAGATGTGCGATCAGTTCTATATTAGAGGAGGTGGGCACAGTTGCTTCAAGGACAGCGATGAAGCCAGAATCAGAGCAAAAAGGAAGTTGTTCTGGGAAAAGGCAGTGGATGTAGAAGAACTTTGTGGACCGTATGAGGCTTGGTGGGGTGTCCGAGGCTACATGCGTGATGATTTTAACCACAGCAATAGGGCCATCTCCCACGCAGAGCACCTCCAGAACAGCAGGTACTTGGAGTCGATTCTTGATGTGTATTGGGAGCTCCCTCCAGTTGCTGGCCCTTCTTTAACACATCAGACTAGGTACGATCCCGCTCGTGTTTCGATCCCTATTGTGGAAGATGGCAGGTACGAATCCTCAGCTGCAGCTCCCGCCAGCTCCTCCTCTTCGTCTTCGCCTTCCACCAGCACCTCCCATCTTTCTCCACCTCCTGAAGATGCATGGACTCTTGCTCATCAGCGGCTACTTCCTCGCTGGAAATCTCTTTCCCAGTCTCACTTGTCGCCAATTCCAATTTCGATATCGAAAGTTAATCAAGTGGATGCGGCGCGGCTAGATATTGAAATGTCAGCCATGTTGAAAGAGCAGTTGGTTAAGGTCTTCGCTTTGATGAAGCCAGGAATGTTGTTTCAATATGAAGCAGAGCTTGATGCTTTTCTGGCGTTCCTTATTTGGCGCTTTTCAATTTGGGTAGACAAGCCCACACCAGGAATTGCTCTGATGAATCTGCGGTATAGAGATGAGCGTGCAATGGAAATTCCAGGAAAAGTCAGAACTGGATTGGAAGGACCTGGCCTCACAGTTGCTCAAAAGATTTGGTATTGCGTGGCCACTGTGGGTGGTCAATACATTTGGACTCGGTTACAATCATTTTCTGCTTTTCGTAGATGGGGAGATTCAGAGCAGGGAGCGGTAAGTTATTTTGTCATCAGTCTGAATTGGTATTTTGCAGATACATCTGACTGCACCACTTTTCACTCTATTGCTGAATCTTGTGCATTGTTGGTTGTCACATATGCTTTTCACTCAGTTGTGTTGGTTAAATTGTTGTATAGAAATCTTGTCGAGAGAGTTCTCAGAGCCAGGCTTGTTTATGGGAGTCCTAATATGAACAGGGCTGTCAGCTTTGAGTATATGAATCGCCAGTTAGTGTGGAATGAATTCTCGGAAATGTTGCTGTTGCTTCTTCCTCTTCTAAATTCTTCCTCTGTTAGAAACTTTCTTCGTCCATTTTCCAAGGAGAAGTCCTCAAGCTCAGCCGAGGATGACAGTGCTTGTCCAATTTGCCTGGCAAGTCCAACGATTCCATTTCTGGCTTTGCCTTGTCAACACAGATACTGTTACTATTGCCTCCGAACACGATGCATGGCAGCTCAATCATTTAGATGTTCAAGATGCAGCGAGCCTGTGGTGGCCATGCAGCGGTATGTCGAAGGCACTAGTGCAAATCCCAAACGGTAA

Protein sequence

MSFDTMRVQSSTPQSPTSNRRLERAFSSRRAPHHSGDFDDDDDHDVSKTKKNRFSLFTHRLSIYFTRIGPIWACLALVGLILLMISSLIFFHSRRFVCVSSYDPVSRSGFFGMDGLDSDFGSLGVPWCRSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIARWLKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKDFVDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALKVGFQHLVFEDNYDTGTGDHYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGYMRDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSIPIVEDGRYESSAAAPASSSSSSSPSTSTSHLSPPPEDAWTLAHQRLLPRWKSLSQSHLSPIPISISKVNQVDAARLDIEMSAMLKEQLVKVFALMKPGMLFQYEAELDAFLAFLIWRFSIWVDKPTPGIALMNLRYRDERAMEIPGKVRTGLEGPGLTVAQKIWYCVATVGGQYIWTRLQSFSAFRRWGDSEQGAVSYFVISLNWYFADTSDCTTFHSIAESCALLVVTYAFHSVVLVKLLYRNLVERVLRARLVYGSPNMNRAVSFEYMNRQLVWNEFSEMLLLLLPLLNSSSVRNFLRPFSKEKSSSSAEDDSACPICLASPTIPFLALPCQHRYCYYCLRTRCMAAQSFRCSRCSEPVVAMQRYVEGTSANPKR
Homology
BLAST of CmoCh01G001070 vs. ExPASy Swiss-Prot
Match: Q9CA86 (Peroxisome biogenesis protein 2 OS=Arabidopsis thaliana OX=3702 GN=PEX2 PE=1 SV=1)

HSP 1 Score: 436.0 bits (1120), Expect = 8.9e-121
Identity = 230/344 (66.86%), Postives = 265/344 (77.03%), Query Frame = 0

Query: 446 SPPPEDAWTLAHQRLLPRWKSLSQSHLSPIPISISKVNQVDAARLDIEMSAMLKEQLVKV 505
           S P +DAW  ++QRLLP  +SL  S  S IP++IS+VNQ DAARLD+EMSAMLKEQLVKV
Sbjct: 4   STPADDAWIRSYQRLLPESQSLLASRRSVIPVAISRVNQFDAARLDVEMSAMLKEQLVKV 63

Query: 506 FALMKPGMLFQYEAELDAFLAFLIWRFSIWVDKPTPGIALMNLRYRDERAM--EIPGKVR 565
           F LMKPGMLFQYE ELDAFL FLIWRFSIWVDKPTPG ALMNLRYRDER +  +  GKVR
Sbjct: 64  FTLMKPGMLFQYEPELDAFLEFLIWRFSIWVDKPTPGNALMNLRYRDERGVVAQHLGKVR 123

Query: 566 TGLEGPGLTVAQKIWYCVATVGGQYIWTRLQSFSAFRRWGDSEQGAVSYFVISLNWYFAD 625
           TGLEGPGLT  QKIWYCVA+VGGQY+++RLQSFSAFRRWGDSEQ  ++  + +L      
Sbjct: 124 TGLEGPGLTSPQKIWYCVASVGGQYLFSRLQSFSAFRRWGDSEQRPLARRLWTL------ 183

Query: 626 TSDCTTFHSIAESCALLVVTYAFHSVVLVKLLYRNLVERVLRARLVYGSPNMNRAVSFEY 685
                  +  A    LL   Y           YRNL+E+ L+ARLVY SP+MNR+VSFEY
Sbjct: 184 VQRIEGIYKAASFLNLLSFLYTGR--------YRNLIEKALKARLVYRSPHMNRSVSFEY 243

Query: 686 MNRQLVWNEFSEMLLLLLPLLNSSSVRNFLRPFSKEKSSSSAEDDSACPICLASPTIPFL 745
           MNRQLVWNEFSEMLLLLLPLLNSS+V+N L PF+K+KSSS+ ED   CPIC   P IPF+
Sbjct: 244 MNRQLVWNEFSEMLLLLLPLLNSSAVKNILSPFAKDKSSSTKEDTVTCPICQVDPAIPFI 303

Query: 746 ALPCQHRYCYYCLRTRCMAAQSFRCSRCSEPVVAMQRYVEGTSA 788
           ALPCQHRYCYYC+RTRC +A SFRC RC+EPVVA+QR  EG S+
Sbjct: 304 ALPCQHRYCYYCIRTRCASAASFRCLRCNEPVVAIQR--EGVSS 331

BLAST of CmoCh01G001070 vs. ExPASy Swiss-Prot
Match: Q75JQ3 (Peroxisome biogenesis factor 2 OS=Dictyostelium discoideum OX=44689 GN=pex2 PE=3 SV=1)

HSP 1 Score: 157.9 bits (398), Expect = 4.7e-37
Identity = 102/331 (30.82%), Postives = 170/331 (51.36%), Query Frame = 0

Query: 478 SISKVNQVDAARLDIEMSAMLKEQLVKVFALMKPGMLFQYEAELDAFLAFLIWRFSIWVD 537
           SI +V+Q+D+ARLD E+  +L+ Q +K+F   KP  +  ++ E++  L  +I++ SI+  
Sbjct: 107 SIVRVSQLDSARLDEEILDLLRSQFMKIFTFFKPNFIHNFQPEINLVLKSVIYKLSIFNL 166

Query: 538 KPTPGIALMNLRYRDERAMEIPGKVRTGLEGPGLTVAQKIWYCVATVGGQYIWTRLQSFS 597
             T G  L NL YR+E+A +    +R   +   LT+ QK    +  +GG+++WTR+  + 
Sbjct: 167 GTTYGNQLQNLTYRNEKAFD---PIRGSDQLNKLTMRQKWLSGLINIGGEWLWTRINRYL 226

Query: 598 AFRRWGDSEQGAVSYFVISLNWYFADTSDCTTFHSIAESCALL-VVTYAFHSVVLVKLLY 657
               W +     +        W F + ++     S  ++ ALL  +T+ F+        Y
Sbjct: 227 INNNWSEHPPNDIR----KKFWNFLNFAE-----SAYKALALLNFLTFLFNG------KY 286

Query: 658 RNLVERVLRARLVYGSPNMNRAVSFEYMNRQLVWNEFSEMLLLLLPLLNSSSVRNFLRPF 717
             LV R+L  RLVY  P ++R +SFEYMNR LVW+ F+E +L ++PL+N   +++FL   
Sbjct: 287 VTLVNRILHMRLVYAHPTLSRNISFEYMNRLLVWHGFTEFILFIMPLINIDRIKSFLYRL 346

Query: 718 SKEKS---SSSAEDDSA-----------------------CPICLASPTIPFLALPCQHR 777
             + S   SS   +++A                       CPIC+  P     +  C H 
Sbjct: 347 LVKTSFGNSSGNNNNTASNPLQQLQKQQLLIQQQQMALAKCPICMNDPISMPYSADCGHL 406

Query: 778 YCYYCLRTRCMAAQSFRCSRCSEPVVAMQRY 782
           +CYYC++T CM   SF C RC+  +  ++R+
Sbjct: 407 FCYYCIKTSCMIDSSFTCPRCNSLISNIKRF 419

BLAST of CmoCh01G001070 vs. ExPASy Swiss-Prot
Match: P24392 (Peroxisome biogenesis factor 2 OS=Rattus norvegicus OX=10116 GN=Pex2 PE=2 SV=1)

HSP 1 Score: 119.8 bits (299), Expect = 1.4e-25
Identity = 87/313 (27.80%), Postives = 152/313 (48.56%), Query Frame = 0

Query: 479 ISKVNQVDAARLDIEMSAMLKEQLVKVFALMKPGMLFQYEAELDAFLAFLIWRFSIWVDK 538
           + +++Q+DA  L+  +  ++  Q  + F   KPG+L ++E E+ AFL   +WRF+I+   
Sbjct: 14  VLRISQLDALELNKALEQLVWSQFTQCFHGFKPGLLARFEPEVKAFLWLFLWRFTIYSKN 73

Query: 539 PTPGIALMNLRYRDERAMEIPGKVRTGLEGPGLTVAQKIWYCVATVGGQYIWTRLQSFSA 598
            T G +++N++Y+++ +   P  V    + P     QK+ Y V T+GG+  W   + +  
Sbjct: 74  ATVGQSVLNIQYKNDSS---PNPV---YQPPSKN--QKLLYAVCTIGGR--WLEERCYDL 133

Query: 599 FRRWGDSEQGAVSYFVISLNWYFADTSDCTTFHSIAESCALLVVTYAFHSVV-LVKLLYR 658
           FR    +               F     C  F        LL +    + ++ L K  + 
Sbjct: 134 FRNRHLAS--------------FGKAKQCMNF-----VVGLLKLGELMNFLIFLQKGKFA 193

Query: 659 NLVERVLRARLVYGSPNMNRAVSFEYMNRQLVWNEFSEMLLLLLPLLNSSSVRNFLRPFS 718
            L ER+L    V+  P   R V FEYMNR+L+W+ F+E L+ LLPL+N   ++  L  + 
Sbjct: 194 TLTERLLGIHSVFCKPQSMREVGFEYMNRELLWHGFAEFLVFLLPLINIQKLKAKLSSWC 253

Query: 719 KEKSSSSAEDDS------ACPICLASPTIPFLALPCQHRYCYYCLRTRCMAAQSFRCSRC 778
              +S++  D +       C +C   PT+P   + C+H +CYYC+++  +    F C +C
Sbjct: 254 IPLTSTAGSDSTLGSSGKECALCGEWPTMPH-TIGCEHVFCYYCVKSSFLFDMYFTCPKC 296

Query: 779 SEPVVAMQRYVEG 785
              V ++Q    G
Sbjct: 314 GTEVHSVQPLKSG 296

BLAST of CmoCh01G001070 vs. ExPASy Swiss-Prot
Match: Q06438 (Peroxisome biogenesis factor 2 OS=Cricetulus griseus OX=10029 GN=PEX2 PE=1 SV=1)

HSP 1 Score: 117.5 bits (293), Expect = 7.0e-25
Identity = 86/312 (27.56%), Postives = 150/312 (48.08%), Query Frame = 0

Query: 479 ISKVNQVDAARLDIEMSAMLKEQLVKVFALMKPGMLFQYEAELDAFLAFLIWRFSIWVDK 538
           + +++Q+DA  L+  +  ++  Q  + F   KPG+L ++E E+ A L   +WRF+I+   
Sbjct: 13  VLRISQLDALELNKALEQLVWSQFTQCFHGFKPGLLARFEPEVKACLWLFLWRFTIYSKN 72

Query: 539 PTPGIALMNLRYRDERAMEIPGKVRTGLEGPGLTVAQKIWYCVATVGGQYIWTRLQSFSA 598
            T G +++N++Y+++ +        +  + P     QK+WY V T+GG+  W   + +  
Sbjct: 73  ATVGQSVLNIQYKNDFSS------NSRYQPPSKN--QKLWYAVCTIGGR--WLEERCYDL 132

Query: 599 FRRWGDSEQGAVSYFVISLNWYFADTSDCTTFHSIAESCALLVVTYAFHSVVLVKLLYRN 658
           FR                 N + A         ++      L     F  + L K  +  
Sbjct: 133 FR-----------------NRHLASFGKVKQCMNVMVGLLKLGELINF-LIFLQKGKFAT 192

Query: 659 LVERVLRARLVYGSPNMNRAVSFEYMNRQLVWNEFSEMLLLLLPLLN----SSSVRNFLR 718
           L ER+L    V+  P   R V F+YMNR+L+W+ F+E L+ LLPL+N     + + ++  
Sbjct: 193 LTERLLGIHSVFCKPQNIREVGFDYMNRELLWHGFAEFLIFLLPLINIQKFKAKLSSWCI 252

Query: 719 PFSKEKSSSSAEDDSA--CPICLASPTIPFLALPCQHRYCYYCLRTRCMAAQSFRCSRCS 778
           P +   SS SA   S   C +C   PT+P   + C+H +CYYC+++  +    F C +C 
Sbjct: 253 PLTGAASSDSALASSGKECALCGEWPTMPH-TIGCEHVFCYYCVKSSFLFDMYFTCPKCG 295

Query: 779 EPVVAMQRYVEG 785
             V ++Q    G
Sbjct: 313 IEVHSVQPLKSG 295

BLAST of CmoCh01G001070 vs. ExPASy Swiss-Prot
Match: P55098 (Peroxisome biogenesis factor 2 OS=Mus musculus OX=10090 GN=Pex2 PE=2 SV=1)

HSP 1 Score: 114.0 bits (284), Expect = 7.7e-24
Identity = 84/308 (27.27%), Postives = 150/308 (48.70%), Query Frame = 0

Query: 479 ISKVNQVDAARLDIEMSAMLKEQLVKVFALMKPGMLFQYEAELDAFLAFLIWRFSIWVDK 538
           + +++Q+DA  L+  +  ++  Q  + F   KPG+L ++E E+ AFL   +WRF+I+   
Sbjct: 14  VLRISQLDALELNKALEQLVWSQFTQCFHGFKPGLLARFEPEVKAFLWLFLWRFTIYSKN 73

Query: 539 PTPGIALMNLRYRDERAMEIPGKVRTGLEGPGLTVAQKIWYCVATVGGQYIWTRLQSFSA 598
            T G +++N++++++ +   P  V    + P     QK+ Y V T+GG+  W   + +  
Sbjct: 74  ATVGQSVLNIQHKNDSS---PNPV---YQPPSKN--QKLLYAVCTIGGR--WLEERCYDL 133

Query: 599 FRRWGDSEQGAVSYFVISLNWYFADTSDCTTFHSIAESCALLVVTYAFHSVV-LVKLLYR 658
           FR    +               F     C  F        LL +    + ++ L K  + 
Sbjct: 134 FRNRHLAS--------------FGKAKQCMNF-----VVGLLKLGELMNFLIFLQKGKFA 193

Query: 659 NLVERVLRARLVYGSPNMNRAVSFEYMNRQLVWNEFSEMLLLLLPLLNSSSVRNFLRPFS 718
            L ER+L    V+  P   R V FEYMNR+L+W+ F+E L+ LLPL+N   ++  L  + 
Sbjct: 194 TLTERLLGIHSVFCKPQNMREVGFEYMNRELLWHGFAEFLIFLLPLINIQKLKAKLSSWC 253

Query: 719 KEKSSSSAEDDS------ACPICLASPTIPFLALPCQHRYCYYCLRTRCMAAQSFRCSRC 778
              + ++  D +       C +C   PT+P   + C+H +CYYC+++  +    F C +C
Sbjct: 254 TLCTGAAGHDSTLGSSGKECALCGEWPTMPH-TIGCEHVFCYYCVKSSFLFDIYFTCPKC 291

Query: 779 SEPVVAMQ 780
              V ++Q
Sbjct: 314 GTEVHSVQ 291

BLAST of CmoCh01G001070 vs. ExPASy TrEMBL
Match: A0A6J1GC96 (uncharacterized protein LOC111452845 OS=Cucurbita moschata OX=3662 GN=LOC111452845 PE=4 SV=1)

HSP 1 Score: 890.6 bits (2300), Expect = 4.9e-255
Identity = 422/422 (100.00%), Postives = 422/422 (100.00%), Query Frame = 0

Query: 1   MSFDTMRVQSSTPQSPTSNRRLERAFSSRRAPHHSGDFDDDDDHDVSKTKKNRFSLFTHR 60
           MSFDTMRVQSSTPQSPTSNRRLERAFSSRRAPHHSGDFDDDDDHDVSKTKKNRFSLFTHR
Sbjct: 1   MSFDTMRVQSSTPQSPTSNRRLERAFSSRRAPHHSGDFDDDDDHDVSKTKKNRFSLFTHR 60

Query: 61  LSIYFTRIGPIWACLALVGLILLMISSLIFFHSRRFVCVSSYDPVSRSGFFGMDGLDSDF 120
           LSIYFTRIGPIWACLALVGLILLMISSLIFFHSRRFVCVSSYDPVSRSGFFGMDGLDSDF
Sbjct: 61  LSIYFTRIGPIWACLALVGLILLMISSLIFFHSRRFVCVSSYDPVSRSGFFGMDGLDSDF 120

Query: 121 GSLGVPWCRSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIARW 180
           GSLGVPWCRSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIARW
Sbjct: 121 GSLGVPWCRSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIARW 180

Query: 181 LKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKDF 240
           LKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKDF
Sbjct: 181 LKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKDF 240

Query: 241 VDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALKVGFQHLVFEDNYDTGTGD 300
           VDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALKVGFQHLVFEDNYDTGTGD
Sbjct: 241 VDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALKVGFQHLVFEDNYDTGTGD 300

Query: 301 HYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGYM 360
           HYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGYM
Sbjct: 301 HYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGYM 360

Query: 361 RDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSIPIVEDG 420
           RDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSIPIVEDG
Sbjct: 361 RDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSIPIVEDG 420

Query: 421 RY 423
           RY
Sbjct: 421 RY 422

BLAST of CmoCh01G001070 vs. ExPASy TrEMBL
Match: A0A6J1KCQ5 (uncharacterized protein LOC111492689 OS=Cucurbita maxima OX=3661 GN=LOC111492689 PE=4 SV=1)

HSP 1 Score: 882.5 bits (2279), Expect = 1.3e-252
Identity = 417/422 (98.82%), Postives = 419/422 (99.29%), Query Frame = 0

Query: 1   MSFDTMRVQSSTPQSPTSNRRLERAFSSRRAPHHSGDFDDDDDHDVSKTKKNRFSLFTHR 60
           MSFDTMRVQSSTPQSPTSNRRLERAFSSRRAPHHSGDFDDDDDHDVSKTKKNRFSLFTHR
Sbjct: 1   MSFDTMRVQSSTPQSPTSNRRLERAFSSRRAPHHSGDFDDDDDHDVSKTKKNRFSLFTHR 60

Query: 61  LSIYFTRIGPIWACLALVGLILLMISSLIFFHSRRFVCVSSYDPVSRSGFFGMDGLDSDF 120
           LSIYFTRIGPIWACLALVGLILLMISSLIFFHSRRFVCVSSYDPVSRSGFFGMDGLDSDF
Sbjct: 61  LSIYFTRIGPIWACLALVGLILLMISSLIFFHSRRFVCVSSYDPVSRSGFFGMDGLDSDF 120

Query: 121 GSLGVPWCRSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIARW 180
           GSLGVPWCRSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIARW
Sbjct: 121 GSLGVPWCRSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIARW 180

Query: 181 LKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKDF 240
           LKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKDF
Sbjct: 181 LKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKDF 240

Query: 241 VDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALKVGFQHLVFEDNYDTGTGD 300
           VDFGS+SWNSVMKQHGI+DLS VLVFFDDHQNELKRISQALK GFQHLVFEDNYDTGTGD
Sbjct: 241 VDFGSISWNSVMKQHGINDLSHVLVFFDDHQNELKRISQALKAGFQHLVFEDNYDTGTGD 300

Query: 301 HYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGYM 360
           HYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGYM
Sbjct: 301 HYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGYM 360

Query: 361 RDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSIPIVEDG 420
           RDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVS PIVEDG
Sbjct: 361 RDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSSPIVEDG 420

Query: 421 RY 423
           RY
Sbjct: 421 RY 422

BLAST of CmoCh01G001070 vs. ExPASy TrEMBL
Match: A0A5A7U6D1 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold323G00750 PE=4 SV=1)

HSP 1 Score: 817.0 bits (2109), Expect = 6.8e-233
Identity = 387/424 (91.27%), Postives = 402/424 (94.81%), Query Frame = 0

Query: 1   MSFDTMRVQSS-TPQSPTSNRRLERAFSSRRAPHHSGDF-DDDDDHDVSKTKKNRFSLFT 60
           MSFDTMRVQSS TPQSPTS+R LERA SSRR PHHSGD  DDDDD DVSKTKK+ FS FT
Sbjct: 26  MSFDTMRVQSSTTPQSPTSSRMLERALSSRRVPHHSGDIDDDDDDDDVSKTKKHNFSFFT 85

Query: 61  HRLSIYFTRIGPIWACLALVGLILLMISSLIFFHSRRFVCVSSYDPVSRSGFFGMDGLDS 120
           HR+S YF RIGPIWACLALV LILL+ISSLIFFHSRRFVCVSSYDPVSRSGFFGMDGLDS
Sbjct: 86  HRISNYFVRIGPIWACLALVALILLLISSLIFFHSRRFVCVSSYDPVSRSGFFGMDGLDS 145

Query: 121 DFGSLGVPWCRSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIA 180
           DFGSLGVPWCRSK GKTVEWTAKDLLK LEEFVPIYETRPIKNN++GMGFDHSFGLWFIA
Sbjct: 146 DFGSLGVPWCRSKQGKTVEWTAKDLLKALEEFVPIYETRPIKNNMYGMGFDHSFGLWFIA 205

Query: 181 RWLKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGK 240
           RWLKPDLMIESGAFKGHSTWVLRQAMP T IISLSPRHPEKYLKKGPAYVDANCTYFAGK
Sbjct: 206 RWLKPDLMIESGAFKGHSTWVLRQAMPYTRIISLSPRHPEKYLKKGPAYVDANCTYFAGK 265

Query: 241 DFVDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALKVGFQHLVFEDNYDTGT 300
           DFVDFGSV+W +VMK+HGIDDLS+VLVFFDDHQNELKRI QAL  GF+HLVFEDNYDTGT
Sbjct: 266 DFVDFGSVAWKNVMKEHGIDDLSQVLVFFDDHQNELKRIKQALNAGFRHLVFEDNYDTGT 325

Query: 301 GDHYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRG 360
           GDHYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVD+EELCGPYE+WWGV+G
Sbjct: 326 GDHYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDIEELCGPYESWWGVQG 385

Query: 361 YMRDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSIPIVE 420
           YMRDDFNHSNRAISHAEH QNSRYLESILDVYWE+PPVAGPSLTHQTRYDPARVS PIVE
Sbjct: 386 YMRDDFNHSNRAISHAEHFQNSRYLESILDVYWEVPPVAGPSLTHQTRYDPARVSSPIVE 445

Query: 421 DGRY 423
           DGRY
Sbjct: 446 DGRY 449

BLAST of CmoCh01G001070 vs. ExPASy TrEMBL
Match: A0A1S3BHM5 (uncharacterized protein LOC103489955 OS=Cucumis melo OX=3656 GN=LOC103489955 PE=4 SV=1)

HSP 1 Score: 817.0 bits (2109), Expect = 6.8e-233
Identity = 387/424 (91.27%), Postives = 402/424 (94.81%), Query Frame = 0

Query: 1   MSFDTMRVQSS-TPQSPTSNRRLERAFSSRRAPHHSGDF-DDDDDHDVSKTKKNRFSLFT 60
           MSFDTMRVQSS TPQSPTS+R LERA SSRR PHHSGD  DDDDD DVSKTKK+ FS FT
Sbjct: 26  MSFDTMRVQSSTTPQSPTSSRMLERALSSRRVPHHSGDIDDDDDDDDVSKTKKHNFSFFT 85

Query: 61  HRLSIYFTRIGPIWACLALVGLILLMISSLIFFHSRRFVCVSSYDPVSRSGFFGMDGLDS 120
           HR+S YF RIGPIWACLALV LILL+ISSLIFFHSRRFVCVSSYDPVSRSGFFGMDGLDS
Sbjct: 86  HRISNYFVRIGPIWACLALVALILLLISSLIFFHSRRFVCVSSYDPVSRSGFFGMDGLDS 145

Query: 121 DFGSLGVPWCRSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIA 180
           DFGSLGVPWCRSK GKTVEWTAKDLLK LEEFVPIYETRPIKNN++GMGFDHSFGLWFIA
Sbjct: 146 DFGSLGVPWCRSKQGKTVEWTAKDLLKALEEFVPIYETRPIKNNMYGMGFDHSFGLWFIA 205

Query: 181 RWLKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGK 240
           RWLKPDLMIESGAFKGHSTWVLRQAMP T IISLSPRHPEKYLKKGPAYVDANCTYFAGK
Sbjct: 206 RWLKPDLMIESGAFKGHSTWVLRQAMPYTRIISLSPRHPEKYLKKGPAYVDANCTYFAGK 265

Query: 241 DFVDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALKVGFQHLVFEDNYDTGT 300
           DFVDFGSV+W +VMK+HGIDDLS+VLVFFDDHQNELKRI QAL  GF+HLVFEDNYDTGT
Sbjct: 266 DFVDFGSVAWKNVMKEHGIDDLSQVLVFFDDHQNELKRIKQALNAGFRHLVFEDNYDTGT 325

Query: 301 GDHYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRG 360
           GDHYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVD+EELCGPYE+WWGV+G
Sbjct: 326 GDHYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDIEELCGPYESWWGVQG 385

Query: 361 YMRDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSIPIVE 420
           YMRDDFNHSNRAISHAEH QNSRYLESILDVYWE+PPVAGPSLTHQTRYDPARVS PIVE
Sbjct: 386 YMRDDFNHSNRAISHAEHFQNSRYLESILDVYWEVPPVAGPSLTHQTRYDPARVSSPIVE 445

Query: 421 DGRY 423
           DGRY
Sbjct: 446 DGRY 449

BLAST of CmoCh01G001070 vs. ExPASy TrEMBL
Match: A0A0A0LAU9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G469200 PE=4 SV=1)

HSP 1 Score: 815.1 bits (2104), Expect = 2.6e-232
Identity = 384/423 (90.78%), Postives = 401/423 (94.80%), Query Frame = 0

Query: 1   MSFDTMRVQSS-TPQSPTSNRRLERAFSSRRAPHHSGDFDDDDDHDVSKTKKNRFSLFTH 60
           MSFDTMRVQSS TPQSPTS+R LERA SSRR PHH+GD DDDDD DVSKTKK+ FS FTH
Sbjct: 26  MSFDTMRVQSSTTPQSPTSSRMLERALSSRRVPHHTGDIDDDDD-DVSKTKKHHFSFFTH 85

Query: 61  RLSIYFTRIGPIWACLALVGLILLMISSLIFFHSRRFVCVSSYDPVSRSGFFGMDGLDSD 120
           R+S YF RIGPIWACLA+V LILL+I SLIFFHSRRFVCVSSYDPVSRSGFFGMDGLDSD
Sbjct: 86  RISNYFVRIGPIWACLAIVALILLLIFSLIFFHSRRFVCVSSYDPVSRSGFFGMDGLDSD 145

Query: 121 FGSLGVPWCRSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIAR 180
           FGSLGVPWCRSKHGKTVEWTAKDLLK LEEFVPIYETRPIKNN++GMGFDHSFGLWFIAR
Sbjct: 146 FGSLGVPWCRSKHGKTVEWTAKDLLKALEEFVPIYETRPIKNNMYGMGFDHSFGLWFIAR 205

Query: 181 WLKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKD 240
           WLKPDL+IESGAFKGHSTWVLRQAMP T IISLSPRHPEKYLKKGPAYVDANCTYFAGKD
Sbjct: 206 WLKPDLLIESGAFKGHSTWVLRQAMPYTRIISLSPRHPEKYLKKGPAYVDANCTYFAGKD 265

Query: 241 FVDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALKVGFQHLVFEDNYDTGTG 300
           FVDFGSV+W +VMK+HGI+DLSRVLVFFDDHQNELKRI QAL  GFQHLVFEDNYDTGTG
Sbjct: 266 FVDFGSVAWKNVMKEHGINDLSRVLVFFDDHQNELKRIKQALNAGFQHLVFEDNYDTGTG 325

Query: 301 DHYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGY 360
           DHYSLRQMCDQFYIRGGGHSCFKDSDEARIR KRKLFWEKAVD+EELCGPYE+WWGV+GY
Sbjct: 326 DHYSLRQMCDQFYIRGGGHSCFKDSDEARIRGKRKLFWEKAVDIEELCGPYESWWGVQGY 385

Query: 361 MRDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSIPIVED 420
           MRDDFNHSNRAISHAEH QNSRYLESILDVYWE+PPVAGPSLTHQTRYDPARVS PIVED
Sbjct: 386 MRDDFNHSNRAISHAEHFQNSRYLESILDVYWEVPPVAGPSLTHQTRYDPARVSSPIVED 445

Query: 421 GRY 423
           GRY
Sbjct: 446 GRY 447

BLAST of CmoCh01G001070 vs. NCBI nr
Match: KAG6606758.1 (Peroxisome biogenesis protein 2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1463.7 bits (3788), Expect = 0.0e+00
Identity = 740/841 (87.99%), Postives = 747/841 (88.82%), Query Frame = 0

Query: 1   MSFDTMRVQSSTPQSPTSNRRLERAFSSRRAPHHSGDFDDDDDHDVSKTKKNRFSLFTHR 60
           MSFDTMRVQSSTPQSPTSNRRLERAFSSRRAPHHSGDFDDDDDHDVSKTKKNRFSLFTHR
Sbjct: 1   MSFDTMRVQSSTPQSPTSNRRLERAFSSRRAPHHSGDFDDDDDHDVSKTKKNRFSLFTHR 60

Query: 61  LSIYFTRIGPIWACLALVGLILLMISSLIFFHSRRFVCVSSYDPVSRSGFFGMDGLDSDF 120
           LSIYFTRIGPIWACLALVGLILLMISSLIFFHSRRFVCVSSYDPVSRSGFFGMDGLDSDF
Sbjct: 61  LSIYFTRIGPIWACLALVGLILLMISSLIFFHSRRFVCVSSYDPVSRSGFFGMDGLDSDF 120

Query: 121 GSLGVPWCRSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIARW 180
           GSLGVPWCRSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIARW
Sbjct: 121 GSLGVPWCRSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIARW 180

Query: 181 LKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKDF 240
           LKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKDF
Sbjct: 181 LKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKDF 240

Query: 241 VDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALKVGFQHLVFEDNYDTGTGD 300
           VDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALKVGFQHLVFEDNYDTGTGD
Sbjct: 241 VDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALKVGFQHLVFEDNYDTGTGD 300

Query: 301 HYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGYM 360
           HYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGYM
Sbjct: 301 HYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGYM 360

Query: 361 RDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSIPIVEDG 420
           RDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSIPIVEDG
Sbjct: 361 RDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSIPIVEDG 420

Query: 421 R---------------------------------------------------YESSAAAP 480
                                                                ES AAAP
Sbjct: 421 SISDLKLSLANCLPLRSISDLLLPPLYVNVKLLRHGPLPSSLRAATHPFVMVIESLAAAP 480

Query: 481 ASSSSSSSPSTSTSHLSPPPEDAWTLAHQRLLPRWKSLSQSHLSPIPISISKVNQVDAAR 540
           ASSSSSSSPSTS SHLSPPPEDAWTLA+QRLLPRWKSLSQSHLSPIPISISKVNQVDAAR
Sbjct: 481 ASSSSSSSPSTSASHLSPPPEDAWTLAYQRLLPRWKSLSQSHLSPIPISISKVNQVDAAR 540

Query: 541 LDIEMSAMLKEQLVKVFALMKPGMLFQYEAELDAFLAFLIWRFSIWVDKPTPGIALMNLR 600
           LDIEMSAMLKEQLVKVFALMKPGMLFQYEAELDAFL F IWRFSIWVDKPTPGIALMNLR
Sbjct: 541 LDIEMSAMLKEQLVKVFALMKPGMLFQYEAELDAFLEFFIWRFSIWVDKPTPGIALMNLR 600

Query: 601 YRDERAMEIPGKVRTGLEGPGLTVAQKIWYCVATVGGQYIWTRLQSFSAFRRWGDSEQGA 660
           YRDERAMEIPGKVRTGLEGPGLTVAQKIWYCVATVGGQYIWTRLQSFSAFRRWGDSEQ +
Sbjct: 601 YRDERAMEIPGKVRTGLEGPGLTVAQKIWYCVATVGGQYIWTRLQSFSAFRRWGDSEQRS 660

Query: 661 VSYFVISLNWYFADTSDCTTFHSIAESCALLVVTYAFHSVVLVKLLYRNLVERVLRARLV 720
           ++       W      +    +  A    LL+  Y           YRNLVERVLRARLV
Sbjct: 661 LA----RRAWLLIQRIE--GIYKAAAFGNLLIFLYTGR--------YRNLVERVLRARLV 720

Query: 721 YGSPNMNRAVSFEYMNRQLVWNEFSEMLLLLLPLLNSSSVRNFLRPFSKEKSSSSAEDDS 780
           YGSPNMNRAVSFEYMNRQLVWNEFSEMLLLLLPLLNSSSVRNFLRPFSKEKSSSSAEDDS
Sbjct: 721 YGSPNMNRAVSFEYMNRQLVWNEFSEMLLLLLPLLNSSSVRNFLRPFSKEKSSSSAEDDS 780

Query: 781 ACPICLASPTIPFLALPCQHRYCYYCLRTRCMAAQSFRCSRCSEPVVAMQRYVEGTSANP 791
           ACPICLASPTIPFLALPCQHRYCYYCLRTRCMAAQSFRCSRCSEPVVAMQRYVEGTSANP
Sbjct: 781 ACPICLASPTIPFLALPCQHRYCYYCLRTRCMAAQSFRCSRCSEPVVAMQRYVEGTSANP 827

BLAST of CmoCh01G001070 vs. NCBI nr
Match: KAG7018321.1 (Peroxisome biogenesis protein 2, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1339.7 bits (3466), Expect = 0.0e+00
Identity = 680/845 (80.47%), Postives = 717/845 (84.85%), Query Frame = 0

Query: 1   MSFD-TMRVQSSTPQSPTSNRRLERAFSSRRAPHHSGDF-DDDDDHDVSKTKKNRFSLFT 60
           MSFD TMR QSSTPQSPTS R L+RA SSRR PHHSGD  DDDDD DVSKTKK+ FS FT
Sbjct: 26  MSFDTTMRAQSSTPQSPTSKRMLDRALSSRRVPHHSGDLDDDDDDDDVSKTKKHNFSFFT 85

Query: 61  HRLSIYFTRIGPIWACLALVGLILLMISSLIFFHSRRFVCVSSYDPVSRSGFFGMDGLDS 120
           HRLS YF RIGPI ACLAL+ LILL+ISSLIFFHSRRFVCVSSYD +SRSGFFG+DGLDS
Sbjct: 86  HRLSNYFARIGPISACLALLALILLLISSLIFFHSRRFVCVSSYDHISRSGFFGVDGLDS 145

Query: 121 DFGSLGVPWCRSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIA 180
           DFGSLGVPWCRSKHGKTVEWT KDLLKGLEEFVPIYETRPI+NN++GMGFDHSFGLWFIA
Sbjct: 146 DFGSLGVPWCRSKHGKTVEWTTKDLLKGLEEFVPIYETRPIQNNMYGMGFDHSFGLWFIA 205

Query: 181 RWLKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGK 240
           RWLKPDLMIESGAFKGHSTWVLRQAMPDT IISLSPRHPEKYLKKGPAYVDANCTYFAGK
Sbjct: 206 RWLKPDLMIESGAFKGHSTWVLRQAMPDTAIISLSPRHPEKYLKKGPAYVDANCTYFAGK 265

Query: 241 DFVDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALKVGFQHLVFEDNYDTGT 300
           DFVDFGSV+W  VMK+HGIDDLSRVLVFFDDHQNELKRI QA+K GFQHLVFEDNYDTGT
Sbjct: 266 DFVDFGSVAWKKVMKEHGIDDLSRVLVFFDDHQNELKRIKQAVKAGFQHLVFEDNYDTGT 325

Query: 301 GDHYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRG 360
           GDHYSLRQMCDQFYI+GGGHSCFKDSDEARIRAKRKLFWEKAVD+EELCGPYE+WWGVRG
Sbjct: 326 GDHYSLRQMCDQFYIKGGGHSCFKDSDEARIRAKRKLFWEKAVDIEELCGPYESWWGVRG 385

Query: 361 YMRDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSIPIVE 420
           YMRDDFNHSNRAISHAEH QNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVS PIVE
Sbjct: 386 YMRDDFNHSNRAISHAEHFQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSSPIVE 445

Query: 421 DGRY-----------ESSA----------------------------------------A 480
           DGRY           E+S                                         A
Sbjct: 446 DGRYGLFRRLGLAQLETSVFNGYTQMVYIQFLFRHYQHDGAPSSSPLLSATLIFVMVLEA 505

Query: 481 APASSSSSSSPSTSTSHLSPPPEDAWTLAHQRLLPRWKSLSQSHLSPIPISISKVNQVDA 540
             A+SS++S PSTS  +L PPPEDAW+ A+QRL PRWKSLS SHLS IPISISKVNQVDA
Sbjct: 506 LAAASSTASPPSTSNFNLPPPPEDAWSRAYQRLHPRWKSLSHSHLSAIPISISKVNQVDA 565

Query: 541 ARLDIEMSAMLKEQLVKVFALMKPGMLFQYEAELDAFLAFLIWRFSIWVDKPTPGIALMN 600
           ARLDIEMSAMLKEQLVKVFALMKPGMLFQYEAELDAFL FLIWRFSIWVDKPTPGI+LMN
Sbjct: 566 ARLDIEMSAMLKEQLVKVFALMKPGMLFQYEAELDAFLEFLIWRFSIWVDKPTPGISLMN 625

Query: 601 LRYRDERAMEIPGKVRTGLEGPGLTVAQKIWYCVATVGGQYIWTRLQSFSAFRRWGDSEQ 660
           LRYRDERA+E+PGKVRTGLEGPGLTVAQKIWYCVATVGGQY+WTRLQSFSAFRRWGDSEQ
Sbjct: 626 LRYRDERALEVPGKVRTGLEGPGLTVAQKIWYCVATVGGQYMWTRLQSFSAFRRWGDSEQ 685

Query: 661 GAVSYFVISLNWYFADTSDCTTFHSIAESCALLVVTYAFHSVVLVKLLYRNLVERVLRAR 720
            +++       W      +    +  A    LL+  Y           YRNLVERVLRAR
Sbjct: 686 RSLA----RRAWLLIQRIE--GIYKAAAFGNLLIFLYTGR--------YRNLVERVLRAR 745

Query: 721 LVYGSPNMNRAVSFEYMNRQLVWNEFSEMLLLLLPLLNSSSVRNFLRPFSKEKSSSSAED 780
           LVYGSPNMNRAVSFEYMNRQLVWNEFSEMLLLLLPLLNSS+VRNFLRPFSK+K SSSA+D
Sbjct: 746 LVYGSPNMNRAVSFEYMNRQLVWNEFSEMLLLLLPLLNSSTVRNFLRPFSKDKPSSSAKD 805

Query: 781 DSACPICLASPTIPFLALPCQHRYCYYCLRTRCMAAQSFRCSRCSEPVVAMQRYVEGTS- 792
           DSACPICLA+PTIPFLALPCQHRYCYYCLRTRCMAAQSFRCSRCSEPVVAMQR+VEGTS 
Sbjct: 806 DSACPICLANPTIPFLALPCQHRYCYYCLRTRCMAAQSFRCSRCSEPVVAMQRHVEGTST 856

BLAST of CmoCh01G001070 vs. NCBI nr
Match: KAG6581888.1 (Peroxisome biogenesis protein 2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1339.3 bits (3465), Expect = 0.0e+00
Identity = 680/845 (80.47%), Postives = 717/845 (84.85%), Query Frame = 0

Query: 1   MSFD-TMRVQSSTPQSPTSNRRLERAFSSRRAPHHSGDF-DDDDDHDVSKTKKNRFSLFT 60
           MSFD TMR QSSTPQSPTS R L+RA SSRR PHHSGD  DDDDD DVSKTKK+ FS FT
Sbjct: 26  MSFDTTMRAQSSTPQSPTSKRMLDRALSSRRVPHHSGDLDDDDDDDDVSKTKKHNFSFFT 85

Query: 61  HRLSIYFTRIGPIWACLALVGLILLMISSLIFFHSRRFVCVSSYDPVSRSGFFGMDGLDS 120
           HRLS YF RIGPI ACLAL+ LILL+ISSLIFFHSRRFVCVSSYD +SRSGFFG+DGLDS
Sbjct: 86  HRLSNYFARIGPISACLALLALILLLISSLIFFHSRRFVCVSSYDHISRSGFFGVDGLDS 145

Query: 121 DFGSLGVPWCRSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIA 180
           DFGSLGVPWCRSKHGKTVEWT KDLLKGLEEFVPIYETRPI+NN++GMGFDHSFGLWFIA
Sbjct: 146 DFGSLGVPWCRSKHGKTVEWTTKDLLKGLEEFVPIYETRPIQNNMYGMGFDHSFGLWFIA 205

Query: 181 RWLKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGK 240
           RWLKPDLMIESGAFKGHSTWVLRQAMPDT IISLSPRHPEKYLKKGPAYVDANCTYFAGK
Sbjct: 206 RWLKPDLMIESGAFKGHSTWVLRQAMPDTAIISLSPRHPEKYLKKGPAYVDANCTYFAGK 265

Query: 241 DFVDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALKVGFQHLVFEDNYDTGT 300
           DFVDFGSV+W  VMK+HGIDDLSRVLVFFDDHQNELKRI QA+K GFQHLVFEDNYDTGT
Sbjct: 266 DFVDFGSVAWKKVMKEHGIDDLSRVLVFFDDHQNELKRIKQAVKAGFQHLVFEDNYDTGT 325

Query: 301 GDHYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRG 360
           GDHYSLRQMCDQFYI+GGGHSCFKDSDEARIRAKRKLFWEKAVD+EELCGPYE+WWGVRG
Sbjct: 326 GDHYSLRQMCDQFYIKGGGHSCFKDSDEARIRAKRKLFWEKAVDIEELCGPYESWWGVRG 385

Query: 361 YMRDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSIPIVE 420
           YMRDDFNHSNRAISHAEH QNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVS PIVE
Sbjct: 386 YMRDDFNHSNRAISHAEHFQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSSPIVE 445

Query: 421 DGRY-----------ESSA----------------------------------------A 480
           DGRY           E+S                                         A
Sbjct: 446 DGRYGLFRRLGLAQLETSVFNGYTQMVYIQFLFRHYQHDGAPSSSPLLSATLIFVMVLEA 505

Query: 481 APASSSSSSSPSTSTSHLSPPPEDAWTLAHQRLLPRWKSLSQSHLSPIPISISKVNQVDA 540
             A+SS++S PSTS  +L PPPEDAW+ A+QRL PRWKSLS SHLS IPISISKVNQVDA
Sbjct: 506 LAAASSTASPPSTSNFNLPPPPEDAWSRAYQRLHPRWKSLSHSHLSAIPISISKVNQVDA 565

Query: 541 ARLDIEMSAMLKEQLVKVFALMKPGMLFQYEAELDAFLAFLIWRFSIWVDKPTPGIALMN 600
           ARLDIEMSAMLKEQLVKVFALMKPGMLFQYEAELDAFL FLIWRFSIWVDKPTPGI+LMN
Sbjct: 566 ARLDIEMSAMLKEQLVKVFALMKPGMLFQYEAELDAFLEFLIWRFSIWVDKPTPGISLMN 625

Query: 601 LRYRDERAMEIPGKVRTGLEGPGLTVAQKIWYCVATVGGQYIWTRLQSFSAFRRWGDSEQ 660
           LRYRDERA+E+PGKVRTGLEGPGLTVAQKIWYCVATVGGQY+WTRLQSFSAFRRWGDSEQ
Sbjct: 626 LRYRDERALEVPGKVRTGLEGPGLTVAQKIWYCVATVGGQYMWTRLQSFSAFRRWGDSEQ 685

Query: 661 GAVSYFVISLNWYFADTSDCTTFHSIAESCALLVVTYAFHSVVLVKLLYRNLVERVLRAR 720
            +++       W      +    +  A    LL+  Y           YRNLVERVLRAR
Sbjct: 686 RSLA----RRAWLLIQRIE--GIYKAAAFGNLLIFLYTGR--------YRNLVERVLRAR 745

Query: 721 LVYGSPNMNRAVSFEYMNRQLVWNEFSEMLLLLLPLLNSSSVRNFLRPFSKEKSSSSAED 780
           LVYGSPNMNRAVSFEYMNRQLVWNEFSEMLLLLLPLLNS +VRNFLRPFSK+K SSSAED
Sbjct: 746 LVYGSPNMNRAVSFEYMNRQLVWNEFSEMLLLLLPLLNSFTVRNFLRPFSKDKPSSSAED 805

Query: 781 DSACPICLASPTIPFLALPCQHRYCYYCLRTRCMAAQSFRCSRCSEPVVAMQRYVEGTS- 792
           DSACPICLA+PTIPFLALPCQHRYCYYCLRTRCMAAQSFRCSRCSEPVVAMQR+VEGTS 
Sbjct: 806 DSACPICLANPTIPFLALPCQHRYCYYCLRTRCMAAQSFRCSRCSEPVVAMQRHVEGTST 856

BLAST of CmoCh01G001070 vs. NCBI nr
Match: XP_022949517.1 (uncharacterized protein LOC111452845 [Cucurbita moschata])

HSP 1 Score: 890.6 bits (2300), Expect = 1.0e-254
Identity = 422/422 (100.00%), Postives = 422/422 (100.00%), Query Frame = 0

Query: 1   MSFDTMRVQSSTPQSPTSNRRLERAFSSRRAPHHSGDFDDDDDHDVSKTKKNRFSLFTHR 60
           MSFDTMRVQSSTPQSPTSNRRLERAFSSRRAPHHSGDFDDDDDHDVSKTKKNRFSLFTHR
Sbjct: 1   MSFDTMRVQSSTPQSPTSNRRLERAFSSRRAPHHSGDFDDDDDHDVSKTKKNRFSLFTHR 60

Query: 61  LSIYFTRIGPIWACLALVGLILLMISSLIFFHSRRFVCVSSYDPVSRSGFFGMDGLDSDF 120
           LSIYFTRIGPIWACLALVGLILLMISSLIFFHSRRFVCVSSYDPVSRSGFFGMDGLDSDF
Sbjct: 61  LSIYFTRIGPIWACLALVGLILLMISSLIFFHSRRFVCVSSYDPVSRSGFFGMDGLDSDF 120

Query: 121 GSLGVPWCRSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIARW 180
           GSLGVPWCRSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIARW
Sbjct: 121 GSLGVPWCRSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIARW 180

Query: 181 LKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKDF 240
           LKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKDF
Sbjct: 181 LKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKDF 240

Query: 241 VDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALKVGFQHLVFEDNYDTGTGD 300
           VDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALKVGFQHLVFEDNYDTGTGD
Sbjct: 241 VDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALKVGFQHLVFEDNYDTGTGD 300

Query: 301 HYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGYM 360
           HYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGYM
Sbjct: 301 HYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGYM 360

Query: 361 RDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSIPIVEDG 420
           RDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSIPIVEDG
Sbjct: 361 RDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSIPIVEDG 420

Query: 421 RY 423
           RY
Sbjct: 421 RY 422

BLAST of CmoCh01G001070 vs. NCBI nr
Match: KAG7036471.1 (hypothetical protein SDJN02_00088, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 889.4 bits (2297), Expect = 2.2e-254
Identity = 421/422 (99.76%), Postives = 422/422 (100.00%), Query Frame = 0

Query: 1   MSFDTMRVQSSTPQSPTSNRRLERAFSSRRAPHHSGDFDDDDDHDVSKTKKNRFSLFTHR 60
           MSFDTMRVQSSTPQSPTSNRRLERAFSSRRAPHHSGDFDDDDDHDVSKTKKNRFSLFTHR
Sbjct: 1   MSFDTMRVQSSTPQSPTSNRRLERAFSSRRAPHHSGDFDDDDDHDVSKTKKNRFSLFTHR 60

Query: 61  LSIYFTRIGPIWACLALVGLILLMISSLIFFHSRRFVCVSSYDPVSRSGFFGMDGLDSDF 120
           LSIYFTRIGPIWACLALVGLILLMISSLIFFHSRRFVCVSSYDPVSRSGFFGMDGLDSDF
Sbjct: 61  LSIYFTRIGPIWACLALVGLILLMISSLIFFHSRRFVCVSSYDPVSRSGFFGMDGLDSDF 120

Query: 121 GSLGVPWCRSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIARW 180
           GSLGVPWCRSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIARW
Sbjct: 121 GSLGVPWCRSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIARW 180

Query: 181 LKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKDF 240
           LKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKDF
Sbjct: 181 LKPDLMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKDF 240

Query: 241 VDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALKVGFQHLVFEDNYDTGTGD 300
           VDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALKVGFQHLVFEDNYDTGTGD
Sbjct: 241 VDFGSVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALKVGFQHLVFEDNYDTGTGD 300

Query: 301 HYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGYM 360
           HYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGYM
Sbjct: 301 HYSLRQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGYM 360

Query: 361 RDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSIPIVEDG 420
           RDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARV+IPIVEDG
Sbjct: 361 RDDFNHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVTIPIVEDG 420

Query: 421 RY 423
           RY
Sbjct: 421 RY 422

BLAST of CmoCh01G001070 vs. TAIR 10
Match: AT3G16200.1 (unknown protein; Has 97 Blast hits to 97 proteins in 15 species: Archae - 0; Bacteria - 8; Metazoa - 0; Fungi - 0; Plants - 36; Viruses - 0; Other Eukaryotes - 53 (source: NCBI BLink). )

HSP 1 Score: 645.2 bits (1663), Expect = 6.9e-185
Identity = 304/418 (72.73%), Postives = 352/418 (84.21%), Query Frame = 0

Query: 10  SSTPQSPTSNRRLERAFSSRRAPHHSGDF--DDDDDHDVSKTKKNRFSLFTHRLSIYFTR 69
           S +P++PT+   L+RA SSRR PH   D     +   D SKTK+    L     S + +R
Sbjct: 14  SQSPKTPTT--MLDRALSSRR-PHSDADLSASGESGTDESKTKRPHIYLLA---SNFLSR 73

Query: 70  IGPIW---ACLALVGLILLMISSLIFFHSRRFVCVSSYDPVSRSGFFGMDGLDSDFGSLG 129
           IG  W     LAL+ L+LL + S + FHS  FVC+S +DP +R GFFG+DGL+SDFG+LG
Sbjct: 74  IGHQWWPCLILALLFLVLLFLIS-VAFHSHSFVCISRFDPAARIGFFGLDGLESDFGALG 133

Query: 130 VPWCRSKHGKTVEWTAKDLLKGLEEFVPIYETRPIKNNLFGMGFDHSFGLWFIARWLKPD 189
           VPWCRSKHGK VEWT+KDLLKGLEEFVPIYETRPIKNN++GMGFDHSFGLWF+ARWLKPD
Sbjct: 134 VPWCRSKHGKEVEWTSKDLLKGLEEFVPIYETRPIKNNMYGMGFDHSFGLWFMARWLKPD 193

Query: 190 LMIESGAFKGHSTWVLRQAMPDTPIISLSPRHPEKYLKKGPAYVDANCTYFAGKDFVDFG 249
           +MIESGAFKGHSTWVLRQAMPDTP+ISL+PRHPEKYL+KGPAYVD NCTYFAGKDFVDFG
Sbjct: 194 MMIESGAFKGHSTWVLRQAMPDTPMISLTPRHPEKYLRKGPAYVDGNCTYFAGKDFVDFG 253

Query: 250 SVSWNSVMKQHGIDDLSRVLVFFDDHQNELKRISQALKVGFQHLVFEDNYDTGTGDHYSL 309
           SV W +V+++HGI DLSRV+VFFDDHQNELKR+ QALK GF+HL+FEDNYDTGTGDHYSL
Sbjct: 254 SVDWKNVLRKHGITDLSRVIVFFDDHQNELKRLKQALKAGFRHLIFEDNYDTGTGDHYSL 313

Query: 310 RQMCDQFYIRGGGHSCFKDSDEARIRAKRKLFWEKAVDVEELCGPYEAWWGVRGYMRDDF 369
           RQ+CDQ +IRGGGHSCFKDSDEAR+R+KRK FWEKAVD EELCGP E WWGV+G MRDDF
Sbjct: 314 RQICDQSHIRGGGHSCFKDSDEARMRSKRKKFWEKAVDTEELCGPGETWWGVKGEMRDDF 373

Query: 370 NHSNRAISHAEHLQNSRYLESILDVYWELPPVAGPSLTHQTRYDPARVSIPIVEDGRY 423
           NH+N  IS+ +H QNSRY+ESILDVYWELPPVAGPSLTHQ+RYDPAR + PIV DG++
Sbjct: 374 NHTNTPISYNQHFQNSRYVESILDVYWELPPVAGPSLTHQSRYDPARATPPIVADGKH 424

BLAST of CmoCh01G001070 vs. TAIR 10
Match: AT1G79810.1 (Pex2/Pex12 N-terminal domain-containing protein / zinc finger (C3HC4-type RING finger) family protein )

HSP 1 Score: 436.0 bits (1120), Expect = 6.3e-122
Identity = 230/344 (66.86%), Postives = 265/344 (77.03%), Query Frame = 0

Query: 446 SPPPEDAWTLAHQRLLPRWKSLSQSHLSPIPISISKVNQVDAARLDIEMSAMLKEQLVKV 505
           S P +DAW  ++QRLLP  +SL  S  S IP++IS+VNQ DAARLD+EMSAMLKEQLVKV
Sbjct: 4   STPADDAWIRSYQRLLPESQSLLASRRSVIPVAISRVNQFDAARLDVEMSAMLKEQLVKV 63

Query: 506 FALMKPGMLFQYEAELDAFLAFLIWRFSIWVDKPTPGIALMNLRYRDERAM--EIPGKVR 565
           F LMKPGMLFQYE ELDAFL FLIWRFSIWVDKPTPG ALMNLRYRDER +  +  GKVR
Sbjct: 64  FTLMKPGMLFQYEPELDAFLEFLIWRFSIWVDKPTPGNALMNLRYRDERGVVAQHLGKVR 123

Query: 566 TGLEGPGLTVAQKIWYCVATVGGQYIWTRLQSFSAFRRWGDSEQGAVSYFVISLNWYFAD 625
           TGLEGPGLT  QKIWYCVA+VGGQY+++RLQSFSAFRRWGDSEQ  ++  + +L      
Sbjct: 124 TGLEGPGLTSPQKIWYCVASVGGQYLFSRLQSFSAFRRWGDSEQRPLARRLWTL------ 183

Query: 626 TSDCTTFHSIAESCALLVVTYAFHSVVLVKLLYRNLVERVLRARLVYGSPNMNRAVSFEY 685
                  +  A    LL   Y           YRNL+E+ L+ARLVY SP+MNR+VSFEY
Sbjct: 184 VQRIEGIYKAASFLNLLSFLYTGR--------YRNLIEKALKARLVYRSPHMNRSVSFEY 243

Query: 686 MNRQLVWNEFSEMLLLLLPLLNSSSVRNFLRPFSKEKSSSSAEDDSACPICLASPTIPFL 745
           MNRQLVWNEFSEMLLLLLPLLNSS+V+N L PF+K+KSSS+ ED   CPIC   P IPF+
Sbjct: 244 MNRQLVWNEFSEMLLLLLPLLNSSAVKNILSPFAKDKSSSTKEDTVTCPICQVDPAIPFI 303

Query: 746 ALPCQHRYCYYCLRTRCMAAQSFRCSRCSEPVVAMQRYVEGTSA 788
           ALPCQHRYCYYC+RTRC +A SFRC RC+EPVVA+QR  EG S+
Sbjct: 304 ALPCQHRYCYYCIRTRCASAASFRCLRCNEPVVAIQR--EGVSS 331

BLAST of CmoCh01G001070 vs. TAIR 10
Match: AT1G79810.2 (Pex2/Pex12 N-terminal domain-containing protein / zinc finger (C3HC4-type RING finger) family protein )

HSP 1 Score: 384.0 bits (985), Expect = 2.8e-106
Identity = 202/296 (68.24%), Postives = 229/296 (77.36%), Query Frame = 0

Query: 494 MSAMLKEQLVKVFALMKPGMLFQYEAELDAFLAFLIWRFSIWVDKPTPGIALMNLRYRDE 553
           MSAMLKEQLVKVF LMKPGMLFQYE ELDAFL FLIWRFSIWVDKPTPG ALMNLRYRDE
Sbjct: 1   MSAMLKEQLVKVFTLMKPGMLFQYEPELDAFLEFLIWRFSIWVDKPTPGNALMNLRYRDE 60

Query: 554 RAM--EIPGKVRTGLEGPGLTVAQKIWYCVATVGGQYIWTRLQSFSAFRRWGDSEQGAVS 613
           R +  +  GKVRTGLEGPGLT  QKIWYCVA+VGGQY+++RLQSFSAFRRWGDSEQ  ++
Sbjct: 61  RGVVAQHLGKVRTGLEGPGLTSPQKIWYCVASVGGQYLFSRLQSFSAFRRWGDSEQRPLA 120

Query: 614 YFVISLNWYFADTSDCTTFHSIAESCALLVVTYAFHSVVLVKLLYRNLVERVLRARLVYG 673
             + +L             +  A    LL   Y           YRNL+E+ L+ARLVY 
Sbjct: 121 RRLWTL------VQRIEGIYKAASFLNLLSFLYTGR--------YRNLIEKALKARLVYR 180

Query: 674 SPNMNRAVSFEYMNRQLVWNEFSEMLLLLLPLLNSSSVRNFLRPFSKEKSSSSAEDDSAC 733
           SP+MNR+VSFEYMNRQLVWNEFSEMLLLLLPLLNSS+V+N L PF+K+KSSS+ ED   C
Sbjct: 181 SPHMNRSVSFEYMNRQLVWNEFSEMLLLLLPLLNSSAVKNILSPFAKDKSSSTKEDTVTC 240

Query: 734 PICLASPTIPFLALPCQHRYCYYCLRTRCMAAQSFRCSRCSEPVVAMQRYVEGTSA 788
           PIC   P IPF+ALPCQHRYCYYC+RTRC +A SFRC RC+EPVVA+QR  EG S+
Sbjct: 241 PICQVDPAIPFIALPCQHRYCYYCIRTRCASAASFRCLRCNEPVVAIQR--EGVSS 280

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9CA868.9e-12166.86Peroxisome biogenesis protein 2 OS=Arabidopsis thaliana OX=3702 GN=PEX2 PE=1 SV=... [more]
Q75JQ34.7e-3730.82Peroxisome biogenesis factor 2 OS=Dictyostelium discoideum OX=44689 GN=pex2 PE=3... [more]
P243921.4e-2527.80Peroxisome biogenesis factor 2 OS=Rattus norvegicus OX=10116 GN=Pex2 PE=2 SV=1[more]
Q064387.0e-2527.56Peroxisome biogenesis factor 2 OS=Cricetulus griseus OX=10029 GN=PEX2 PE=1 SV=1[more]
P550987.7e-2427.27Peroxisome biogenesis factor 2 OS=Mus musculus OX=10090 GN=Pex2 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1GC964.9e-255100.00uncharacterized protein LOC111452845 OS=Cucurbita moschata OX=3662 GN=LOC1114528... [more]
A0A6J1KCQ51.3e-25298.82uncharacterized protein LOC111492689 OS=Cucurbita maxima OX=3661 GN=LOC111492689... [more]
A0A5A7U6D16.8e-23391.27Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3BHM56.8e-23391.27uncharacterized protein LOC103489955 OS=Cucumis melo OX=3656 GN=LOC103489955 PE=... [more]
A0A0A0LAU92.6e-23290.78Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G469200 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
KAG6606758.10.0e+0087.99Peroxisome biogenesis protein 2, partial [Cucurbita argyrosperma subsp. sororia][more]
KAG7018321.10.0e+0080.47Peroxisome biogenesis protein 2, partial [Cucurbita argyrosperma subsp. argyrosp... [more]
KAG6581888.10.0e+0080.47Peroxisome biogenesis protein 2, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_022949517.11.0e-254100.00uncharacterized protein LOC111452845 [Cucurbita moschata][more]
KAG7036471.12.2e-25499.76hypothetical protein SDJN02_00088, partial [Cucurbita argyrosperma subsp. argyro... [more]
Match NameE-valueIdentityDescription
AT3G16200.16.9e-18572.73unknown protein; Has 97 Blast hits to 97 proteins in 15 species: Archae - 0; Bac... [more]
AT1G79810.16.3e-12266.86Pex2/Pex12 N-terminal domain-containing protein / zinc finger (C3HC4-type RING f... [more]
AT1G79810.22.8e-10668.24Pex2/Pex12 N-terminal domain-containing protein / zinc finger (C3HC4-type RING f... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001841Zinc finger, RING-typeSMARTSM00184ring_2coord: 731..771
e-value: 0.0084
score: 25.3
IPR001841Zinc finger, RING-typePROSITEPS50089ZF_RING_2coord: 731..772
score: 10.055033
IPR006845Pex, N-terminalPFAMPF04757Pex2_Pex12coord: 493..714
e-value: 3.6E-25
score: 88.9
IPR013083Zinc finger, RING/FYVE/PHD-typeGENE3D3.30.40.10Zinc/RING finger domain, C3HC4 (zinc finger)coord: 707..787
e-value: 7.1E-9
score: 37.2
IPR018957Zinc finger, C3HC4 RING-typePFAMPF00097zf-C3HC4coord: 731..771
e-value: 5.8E-5
score: 22.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 426..448
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..22
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 23..46
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 425..450
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..46
NoneNo IPR availablePANTHERPTHR36362:SF2BNAC05G37190D PROTEINcoord: 10..422
NoneNo IPR availablePANTHERPTHR36362DNA-DIRECTED RNA POLYMERASE SUBUNIT BETAcoord: 10..422
NoneNo IPR availableCDDcd16526RING-HC_PEX2coord: 731..772
e-value: 3.20495E-18
score: 76.6562
NoneNo IPR availableSUPERFAMILY57850RING/U-boxcoord: 724..778
IPR017907Zinc finger, RING-type, conserved sitePROSITEPS00518ZF_RING_1coord: 747..756

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh01G001070.1CmoCh01G001070.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0032774 RNA biosynthetic process
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003899 DNA-directed 5'-3' RNA polymerase activity
molecular_function GO:0046872 metal ion binding