CmaCh12G001570 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh12G001570
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionTMEM135_C_rich domain-containing protein
LocationCma_Chr12: 693072 .. 703860 (+)
RNA-Seq ExpressionCmaCh12G001570
SyntenyCmaCh12G001570
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
AAGAACAGAAAAGGAAGGGAAAAAAATATGGGCGGAGGGTTGTGAGTCGCAAACTGACAGGGATTTCGCGTTAATTTAATGCTTGAGAAATTAGAGCACAGGATTTGGTTTTATCCGCGCCAATTCGTCTATGCTGCCTTTGGGATCTCACTTCCTCTTCAACTCACGCTCGGAAGGTGAGCCCTAGTCTCCGCTGCAGTGGCCGTCATGTCGCCGGGCGGATGTATTGAGAATGGGGACGCGGAAGCTGCTGGTAACTGCAAATCCGGTGATTCGTATTGCGAGCATTGTGGTCGTGGTTGTGGCAGTGGTTCGGCGGATTATTCTTCTTTGCCGGCTTTTTCTTGCTCTTCTTATTCACTTTGGCTTGATTCTGCGCGGTTGAGAGAGTCGGGGAAGTTTTGGCGGATTCTGGTTGCTTCTGCTAAAGGCTTCACAATCGGAGCTGGTCTTAAAGGTGGACTCTCCCTGTTTTCGATCCTTGCTGGATTGAAGCGGAGAAAGGCTTTGGCCTCCCTCGGGTAAGGTTGTTTCAAAATTGGTTATGAAGTTTATTTGCATGGTATCGTGCAATTTCTGTATGCTGTAAACAAAGATGATATCGAATTATGTTCGTTTAGGAAGAAAGGAGTGATTACGAATCGTGATGTAATTTCCATGGCTTTGAAGGAGACTTTGAGGTACGGCCTGTTTCTTGGAACCTTTGCTGGGACATTCGTTTCCATCGATGAGATAATAGGCAATCTGGCAGGTCACCGTAGGTATACTCTTTTCTATTCTGATTTCTGTTCATGTAATCCTGAATCTTTGCTGTTTCATGTTCTCTGTTATCTTTTTTTTATTATTTAAATGTTCTCTTTTAGTTCAGGACTGCAAGATGGAGGGCTCTATCAGCGGGAGCATTAGCTGGGCCATCAATGCTTCTGACTGGGTTAAATACGCAACATAAGACCTTGGCTATCTACATTTTTATGCGTGCTGCGGTCTTGGCAGCGCGTTGTGGGATTAAAAGCAAGCGGTTCGGCCATATTTGTAAGCCGCTCACGTGGTCATATGGTGACATCTTCCTTATGTGTCTCTCCTCTTCGCAGATCTTGTAAGAAATCTGCCGACCCAATACATTGCTGCTAATGCTAATACTATAGGTCTTGAGCGTTTGTTCTAAATTATCGCCCATTTTCACTTCGCTTGTTTTCCTGCATGGTATCTGAATCAACAACTTTACTTTTAGCCTGCAAATCACTACATGTAATCCCTAACGCAAGCAAGCAGATGGGAAATGTGATGTCCCACATTGGTTGGGGAGGAGAACAAAACACCTTTTATAAGGGTGTGGAAACCTTCCTCTAGTAGACGCGTTTTAAAGCCTTGAGGGGAAGCCCGAAAGGGAAAGCCCAAAGAGGACTATATCTGCTAGTGGTGGATCTGGGGTGTTACAAATGGTATCAGAGCCAGACATCGGACTATGTGCCAGTGGGGAGGCTGTTCCCCATAGGGGGTAGACACGAGAGGGTGTGCTAGTAAGGACACTGGGCCCCAAAGAGGGGTGGATTTGGCAGGGGTCCCACATTGATTGGAGAAAGTAAAGAGTATCAGCGAGGACGCTGGGCCCTAAAGGGGGTGGATTGTGATGTCCCACATTGGTTGGGGAGGAGAACAAAACACCCTTTATAAGGGTGTGGAAACCTTCCCCTAGCATACATGTTTTAAAGCCTTGAGGGGAAGCCCGAAAGGGAAAGTCCAAAGAGGACAATATCTGCTAGCGGTGGATCTGGGGTGTTACAGGAAACCAGTCAATGACTTGCATCGTTATGATTATGAACTAATTATGTAACCGTTGAACTAGATCTATTGTTGGAATGAATGCAGGTGAGAAAACTGTGTTCCGTATCTTATGACACCAGGATGGTATATGCGTTAGCCAAAATGTTCTTCTATGTTATTACTTTTGATCACCCAGATGAAACTTATGGTCCTTTATGCTGTCATATGAAATTACTTTGTGCTATTTTTTTTGTAGTATCTATTGTAAGTGACTAATTTTTATAATAATAACATACAGCTAATTAGAAGAATCATTTTCAGGTCTGCGTATGTGTTGAAGCAAGACAGCTTACCACCATCATTTAGTTCCTTTCTCAATATACATGGTGGAAAAGATACTGTAATCCTGGAAGGTTTAAAGAGTTTTGTATCAGGCATGCCATCTTCTAATAAATTTAAAGCAATAGAAAAGTACTACAGGGCAATGGGTGCAGATGTCAAATTAGATCTGCAAATGAAGACTCCATGCACGGTATTGTTTATATATATATATTGATTTATTTTGTGTTTCTCTGTTCAACTAGAAGAAATTAAATTCTTTGCTCTCAAAGTAAAATTAGTTGTTCCTTATGTGTTGCACAAAATATGCACGCACCTTAAACGTCGATGCACTGAGATATTAGAATATCTGCCATTGTAGCAGACAACTCAAGGCGTTAATGTACTTTCCTCTCACCTGATTCAAAATCCAAGACAAACTTTTGTTCTATCCAACTCATATCCTGTTCCCCAGTTTAGTTGGCATCTTGAAGTATTTTCAATTGTCAGCTGGTTGAGAAGTCTAGTGACATTATAAGGTTTATGGTTCTCCATGAATGAAAATTGCACAAGTTAGAGCTATTACTTGAACATGAATTCTATGTGAACAGATTGCAACGAAGAAATGGAAATGCACGCTAAAAATAGTAATTATAGATGATCCAAAGGGAGAAATTTATAAGTTTTTCTTTATCTTTGATTATTGAACCAATTATCGACAAATAAAATAAAATAATAAAAATCATCATTCCTGCTTAGCCAAATGCTATCAAATATTCGAAAAAAATTGATAATCTGTAAGTTTGACAATGCTGAAAGCACATTTCAGAGCTGTAGAAATGAATTATGCTCATTCCTGCCCCCACTATTCTTGGAGGTGAGCGGAGGCCAATTCTGGGGGAAAAACAAAGATTATAGCATAAAACTTTTTCCTTGGAGGCTCGTACCAAGACCTTCCAGAGGATTTTTCCAACCCATTCCCTTTCTTCCTCAAGTACTGGGGAACAATCTTTCCGAAATGTATCTAGTTCTTTTTGTTTTCTCCACATATTGTGTGATGTCAATTGTGATGCTGATTTGATGTCTGCATGCTTGAAATCATTCTGAAGACTCCCTGCTTAGAAAATTGAGGTGGTAATCATGGTATTTTTGAGAGTTTACCCTAACTATCTGCAGGTATCTTTTAATTTTAAATTTTCACTTTGCATATGCTCCTGCCAAGACTAGTAAGATCAGACCTGAAGGTTTCTTTCTTTCCTAATGCAGATTATACATGGAAATCAATCATGTGGTGGCCATGTTCTTAACTTTCTCATTCAAGGATATAAAAGAGCATTGCCAGTTTACCTACCTGTTTATCTTGTCCCAGCCCTGATAGTTCATCGTGAAGGTCTCATGAATAGGTTACATTTCAATTTGAATATCATTTGATTTATTTATACTGTGAAATGTTTGGTTTCAATTTATGATTAGAGGTAATGATTTCAAGTTAGGTTTTATTATTTTTCATGGCTCGTTGTGGTTGAACTATTGCTTAAACTGTCGATTATGTCGCAAATGTCCACTTATTAATTTTTAGGAAATGGAGATAAGTTCCCTAGTTACTCGAGTGCCTTCATAGGATTTTATTTCATGAGCAAACAAGGGCTATGAGTTTTGGGCTCGTTTATGTCTTTATTATAAACAAAATATTTAAAAAAAAATGAAAATTACCCATGCAAACCTATGGATAGGAAAAACAATTAAAAGAATGGAAAAAACTAAAACTACTTTAACTTTTTGAGACAGTTCTTAAATGGTGTTTTGTTCTAATTTGATTCCTTCATAGTTGAAGAAGTTATTAGATTACTTTACTTTTTTAATATAATCTTGACCTCGAGGATTGTTACAAAATACTTGCATGTTGAGATAACTTTTCTGGCTGCCTTTTTCCATCCATTGTTTTGTATTACGGAATTTCCATGTTTCCGAGGGATGAAAAGTGTTATGACCTTTTGTGCAGGCCTTACGAAATTTTAGCAAGGGGGCTTCTAGGAACTGCCAGATCAAGTTTGTTCCTTTCTGTATATTGTTCATCTGCTTGGTCAGTATCTCTTCCACTATATTCTTCTAATGCATTGGCTTCTTTTCTTTTCTTTTTTTTTTTCTTTTTTCTTTTTCTTTTATCGTTCTTGTTACTATTTTCTTCTACCTCGTTTGTGCAACTTTTTGTTTCACTTTAAAATGAAAATCCTAAGGAACCCTTTGGTTATATTAAGTGAACTCCTAATTATTCATTTCTAAAACCTTCAAAATCAACCACTACACTCCAAGGGAGAGTTAAAGAAGGTTATCTTGTGATCAATGACTAGGAAGTAAGGAGAAGTGGTTTCTTACTAGAAGTTTGAACACTCCACAAGAAAGTTGATTAGAATTGCATGAATGCTCAAACATGCAATCCTAGATCAAGAATCACAAAGAGGTTGAGTCTTGTGGCTCTCAAGTGTGTAGATAACATCAAAATAACGTAGTTAAATAGTCAATTTTAGGAATGATGTCCTAATTTGATATCAATTCATCATTCTAAACAAACTAGCTAAAATACCCTTACTTAAAACATGCTTAGAAGGGTAAAAATAGAAAACAATAAAAACTCTAAAACATAAATTTGGATGATAATTTGAAGTCACTTCAAAGTTTCATCTTCTGGAGATTATATCGGCTCAGATAAAAACAATAGTCTCTAATAGATCGTGTTTTTCCACCTACTTTTCAGGATGTGGACATGCCTTACTGCAAGAACTTTCAAGAAAATAAACATTCCCTTGGTTGCTCTGGCAACGGTATAGCTTTCAGTTCTTGATTCCAATTAATTTTTATAGAGCGAGAGCTCCTGTTCTTGTCCTGAAAAGCATATACTTTTTGAGCAGTTCTTAGCTGGTTTGGCTCTGGCCATTGAGAAGAAGAGCCGGAGGATTGAAATCTCACTCTATTGCCTTGCGAGAGGCATCGAGAGCTTCTTCAGTTGCATGACGGATCTGGGATACTTGCCACAATCATTGAATTTTAAGAGAGCGGATGTGATAATCTTTAGCATATCAACTGCCATTATAATGCACTGCTATGCACAGGAAAGAGAGGTATTTCGATCCAAGTATTTGAACGTCCTTGACTGGGTTTTTGGTGTGCCTCCTCCTCCCCCTTGTGATGAAACACCAAGCTGCAAAAATTGCAAGCAGCATTGAGGCGACGTCGGATCTTCTTCGACGAACTAGACTGACCAGGATTTTAGGGTGCCAAATATGTGCATTATTTGGTCATCAATTTTCATTCTACACAATATTACAATGTTTATGTGATCATTAGCTGATATAGAATTAGCATTCAATCAAGCTTTAGATGATTCATATGAAGATATACTTTCCCCACAGGGCCTCACATTTGGTAAATAACAGATGGCATTATTGTCCCGTCACTTTTCATTTGTTAAATAAGAGCCCCTTTTCATCAATTTTAACATCCCATAGAATCTCAAACTCAGACCTCGTCTTCGTGTAGAATCGGCGGCCAGTCTCCGAGCAAACAAGAGCGCCTTGCTCGAGATGCATCTCCAGAAGAGCATGGTGGAATCGTTGAAGGAACTCATGATTCCTTGAAGTATAATCTTTGATTATACCTTTGTAAGCTTGCAAGATTTCTTGAAGTGTTTTTAAGAGTGTAGAAGTTCACAAGATCAATGCTTATGGGAGCCTTTGAAAATTCAAAGCGCGTAGATGTTCTTCCCACTCCACAAAATCTTTCTTATTTTGGCCCGAATACAAGACATAATCTCTTTGAAAGAAAGTCTTGAAGAGATTTGAGAAGCATTTTAAAGAATTCTATGGAGAGGAAGTTTCATTTCAAGAGAAAACGTGAGAAATCAGTATCATCTTAGATCAATCTAAGCTATTTGTAAAAACTCTCGAATAATCTTTTGATGAATAATATATTTGAATATTTAATATATTAAAATATAATATTTTATTTAAATCATATTTTTAATATTTCAATATATATTTAGTAATTTGAATCTCATATAAATACATTAATTACGTTAAATATTTAGTTAATTTGAATCTCATATAAATATATTAATTATTTTAATATTTAACGTATTAAATATTAATTTGAAGAGTTATCCTAAAATTTTACCAAATATCTATTTTTATTAAGTTTGCGAATTTTCAAATTTACGTTGAAAATAATATTTGTAATTGGAAGGAATAGAAATTTAGAATTAAATTCGTAATTTTGCCGTTTAAAATTTAAAAGACCGAACAGTCTGCGTCTGTATACTTTAAACAAAAAGAAAAAGAAGAAGAAAGCGCGCTCACCGCCCCCAGGTCGGACTGCGCGGGAAGATGCCGAGGTCTGTAAACCAGGGAGACACTTGTGGAGGTTTCTGAAGCTTCTCTAAAATTTCGACATCACTTTCGCCGGAAGTTTCAAAGATTTCCCGTCGCTGTTTTCAGCGCTTGAATTCTCCTATTTATCTTATCGTCGGCTTAGTCCGGGGTAAGCTATGGAATTACTTCAGTTTTATTATCAATAGTCTCTCATGTTTTGTCAATATTGTCCATTAGAATCTCTTTTCATCCTCGATCAAGCCCGACCTTTCTGCTAAAATCGTTCTTGTTGATATATATAGTGATTTTTGAAATCCACCGTTGTTCTGTCATGACTTCAGTTGTTGATTTCGATTGCGTATGTTCATTTGACCTCGAATTGATTTTGATTTTCGTTTTATTGTAAGGGAAATTGTTGAGTATGGCAAGTTCAGGAGATTATCTTTCCCAGATTGACATTTCGAACGAGGTATGAGATTGTTCTCTCTGCAATAAGCCCTGTATGACTATTTGGAAATTGAAGGTTTGCTCCTTGTATCGCCCTCGCATGGTTAATATTTCTGATTCCGAAACCACTTTTAATCGATAGGAAAAGGACAAGCTCGTGGCGGAAGTTATCCGCTATATCATATTCAAAACTCATCAAAATTCTGGATGTCCCATCAAAAGGGAAGAGCTTACTCAGATCGTAACAAAAAATTATAGGAATCGAGGATTGCCGGCTATTGTGATAGAAGAAGCTAAAGAGAAGCTGTCAAGCATTTTTGGTTATGAGTTGAGGGAACTTCAAAGAGCACGGCCTTCATCCACTGCCCAAGGGCACCCTCCTCAACATAGTAAGAAGTGATCTAACTTATCATCTTACTCTTGATTATTACGATCGAGATCTTCAGACGTCGGTCCAACTCCCATTTATTCACGTTATCTTACTGCAATCCTAGTTATCAGAATTTTTGTAGATATGACTACTGCAAGTCACCACAGGTTTCTTTTAACAAACAGTACAGGGTTATGGTTCAGTCGTCATGAGTCGAAGCATACAAATTTTAAGAATGAGCCAACCTTTTGCAACTTTAGTATAGAAAAAGTGTCCAACTAATGTTCTTGTCAGTGGAACAAAATGTTCTTGTGGGTGACGAGTCCATAGGATTAGAAATTCATTTCTTCTAGTAATGTTATTGTTACTACTCCAATCCATGAGCTGTGCACATGCTTTATATTAAATGTGGTTCATTGATCATTTTATTACTTATGCTACACACTCCTATCTCAAGAACCATATATGTGAAATAGCACGTATTCGAATTTATTCTCATCTCAGAAACTGCGTATGCCCTAGGTTTTGGTTCATTCTTCAAAATAAACGTCTGATGTATCTGATATACAATCCTCTTTATTCATTGCATCGAAATAAAACAACTGGAACATGAGGGTTGGCATTTTTTTAGCATGAAAGTTGTGGCTATGGGTTGTAGAAAGTCATACTAAATTCAGTGGATGTACAGACTGCTGAATGTGGTTATTTGGTTATAAATTTTCTTAGTCGCTGAATAAGCTCTCATTTCGTCAAGAACTAGGATCTGCTGTATGCTTCATGGCTTGCCATTAATATTCTAATTTTCAGGTTACTTCCATCTGACATGGCTTACTGCAAAACTATATCGTTATTGGATCTTGCTTTAACTCTTGTCACTACATGCCTTCAGATGTTGTAGATTCAAAATCCTATGTTCTCATAAGCAAGCTTCCTGCGGAGGTTTATAGAAAGTATGTTGAGGATGTCGATACCGCACACGTGACTGGTTTTAATTTTGTTGTAGTTAGTATTGTACATCTGGCTGGAGGAAAAATTCCTGAAGGTTAGCTCTATGGCTGTAATTTCTTAATCCAATGCATACTTGGTAGATTTAGAGTGTGATAGAAAGACAGCTTACACATCAAATGGATTTCATTCATAAACTTTTGCTTTCATCTTATTTTGGAGACACTAATTTAGTACGTGAAACAAAATATTTCAAAGAGTTTCTTAGGGCAGAGTGCCACTTTTTGCTTCATAATGGAACCTGTCAATACACTGCAGAAAGCCTTTGGCATCACTTGAGACGAATGGGGTTATCTGAAAATGATGAAAAACATCCAGTCCTTGGAAACATCAAGCAGGCAGTAGAGTTGCTTGTCCAGCAAAGGTCAGTAAACTCTGTACTATATGCTTGGTAACTGCAATTGAGAAACTTCAACATAATTCGATGACCATAGAAGATACTGGAAAGACGGGTTTACAGGTTCATCTTATTCTTGCCGTGCTTTGTAGATATCTGCAGAAGGCCAAAATAAATGGTCCTGAAGGGAATATCACGTTCTACGAGCTTGCTGAAAGAGCTATAGATGGACCAGTTAGTGAAAAGATTAAAGAATATGTAGCACAGGTATGCAGCTTGTCTATATGTTTATATAGTGATGGATGAATATGTTGTTTTTAGTATAATTAAATGATAATCTTGTAGCATAAAATCTCATTAAGTTGCTAAATTTAAACTTATTATTTGCTTACTTTGGAAGAAGATTAAATGAACACGAGGAATAGAATCCCATAATCGTGTTGTCGTGCATTTTAGATTATGTGTCTATCAGTAATTAGTTTTTCGTAATTTCTTAAGTTGCATAAAAATTGTGCTTAATATGGATGGTTGGATGGACGAATCAATGCTACCATGAATGATGGAGCCCATCTTCTGAAGGATTCGCTGTTTATCTGTCTGGAGTGATATCTTTCTGGTTTCTGTTGAATCTAGTTTGCTTTCAGCTTATCATTGCATGTTTCTAATAATTGTGCTTCGACAGCATCAGATCATAGCTTCACGCATTCGACTTAGTTTCTCCTATTCCAAGCTGTCTTTTGCTGCTACAGAATGTCCAAGCATATTATAATTTAGAGTCAAATAGCATAACTTTACGATTCTTTTAACTAAGTTAACCACATGCCAACCTTTCAGTCCAGCTTGCTGATCTATTGGGTTTGTAAAGTTTCAAGATGAAGAGTTATATGTTTTTTTGGGTGTGTTTTAATCTTTTTATACATGTGAACAGCCGCTACTTTATGCGTTGCGGCTTATTATTATCATCACTAACAATCAAGAATATGAACTTACAAACAACTTCTTCCTTCATCATATTGGCTGGCCCATATTAATGTTTTTAAGATTTGATATTACATTAGAAGATTGTTTGTCAGAAGAAAATTATTCACGCAGTACGAATATGGCTATGATTTTGCTGACTATCATATCCCTTCTCCCAGATCGTCAATAAAAATGTTACTACAGAAGAGGGCTGACTATTGTCTCAACATTGCCTCCAGCTAATCTTGATGACCTCAAACTTCGCAAGGTGTGTTCAGATTGGTCATTTTGAATTGAATGTAATTAGTCTACATGTATGTCTGATACTTTAATTCCAATTGAATAGATTGGAGTTGTATTTATTGCATTAAATATTGTATGTTACAGCTATTTAATATTTTAGGTGATTTAGGGAATAGCCCATCACATTCAATTACACATCTAGCTCTATGCCTGTCTATGCCTATCTTCCTCTTTACTCGAGTGTCACAATTCCTTTTATTAAATTGCATCATAGAATCATTGACTTGGAGAAATTACTGTCGAATTGAAGTCTAACCTTGATATAGCTAGTGAAAGTTAGGAATTTGGTTGTCTGATCTTGATTCGAGTCAACTAGGGGAAGTTGCTATTCCTTTAAAAGAATCGACCAATTTGGTCTCACAGTTTGTCTATGGCATTTGAGATCGAATTCGTTGCACGATGTTACATACATAAGGTTTTGGACACAACTTCTAGTTAGCAGTTCAAAGTTGAAAGAAAAAGAATAGAATGAACACTGGTTTAGAACCTACCATAGATGATTTTGGGGCATTGGCAGTCGCAATAGCAAGAAGTTACACAAAGGGAGGGAAAGTCTGTTGGTTATTCCTATGCACAAATTGGGTTCAGGCATCGTTTGAAAATGCGTAACAAATTTTGAACTAACATTCTACAACAAGAAAAAATAAAAAATAAATGGATTAGGACGAGTATCCTTGCTTTACCTTCGTCGTTCACAAGATGTCGAACCCAAAAGCTACTCGAGGCTCTAGAATGGAAATTTCTACAGCTCACCATGTCGTTGATGCACATCGACAACATTCTTATTGGCACACTCGGCTACTAGAGCTTGCATCGCCTATAGAAACTGTGGCAAAAGACTTGGCCCATGCCCTGGCATTCATGGCACCTCCTTGTAAATTGCTTCTCTTGCAAGGATCCTTGAAATGGCTAAATAACTGA

mRNA sequence

AAGAACAGAAAAGGAAGGGAAAAAAATATGGGCGGAGGGTTGTGAGTCGCAAACTGACAGGGATTTCGCGTTAATTTAATGCTTGAGAAATTAGAGCACAGGATTTGGTTTTATCCGCGCCAATTCGTCTATGCTGCCTTTGGGATCTCACTTCCTCTTCAACTCACGCTCGGAAGGTGAGCCCTAGTCTCCGCTGCAGTGGCCGTCATGTCGCCGGGCGGATGTATTGAGAATGGGGACGCGGAAGCTGCTGGTAACTGCAAATCCGGTGATTCGTATTGCGAGCATTGTGGTCGTGGTTGTGGCAGTGGTTCGGCGGATTATTCTTCTTTGCCGGCTTTTTCTTGCTCTTCTTATTCACTTTGGCTTGATTCTGCGCGGTTGAGAGAGTCGGGGAAGTTTTGGCGGATTCTGGTTGCTTCTGCTAAAGGCTTCACAATCGGAGCTGGTCTTAAAGGTGGACTCTCCCTGTTTTCGATCCTTGCTGGATTGAAGCGGAGAAAGGCTTTGGCCTCCCTCGGGAAGAAAGGAGTGATTACGAATCGTGATGTAATTTCCATGGCTTTGAAGGAGACTTTGAGGTACGGCCTGTTTCTTGGAACCTTTGCTGGGACATTCGTTTCCATCGATGAGATAATAGGCAATCTGGCAGGTCACCGTAGGACTGCAAGATGGAGGGCTCTATCAGCGGGAGCATTAGCTGGGCCATCAATGCTTCTGACTGGGTTAAATACGCAACATAAGACCTTGGCTATCTACATTTTTATGCGTGCTGCGGTCTTGGCAGCGCGTTGTGGGATTAAAAGCAAGCGGTTCGGCCATATTTGTAAGCCGCTCACGTGGTCATATGGTGACATCTTCCTTATGTGTCTCTCCTCTTCGCAGATCTTGTCTGCGTATGTGTTGAAGCAAGACAGCTTACCACCATCATTTAGTTCCTTTCTCAATATACATGGTGGAAAAGATACTGTAATCCTGGAAGGTTTAAAGAGTTTTGTATCAGGCATGCCATCTTCTAATAAATTTAAAGCAATAGAAAAGTACTACAGGGCAATGGGTGCAGATGTCAAATTAGATCTGCAAATGAAGACTCCATGCACGATTATACATGGAAATCAATCATGTGGTGGCCATGTTCTTAACTTTCTCATTCAAGGATATAAAAGAGCATTGCCAGTTTACCTACCTGTTTATCTTGTCCCAGCCCTGATAGTTCATCGTGAAGGTCTCATGAATAGGATGTGGACATGCCTTACTGCAAGAACTTTCAAGAAAATAAACATTCCCTTGGTTGCTCTGGCAACGAGCGAGAGCTCCTGTTCTTGTCCTGAAAAGCATATACTTTTTGAGCAGTTCTTAGCTGGTTTGGCTCTGGCCATTGAGAAGAAGAGCCGGAGGATTGAAATCTCACTCTATTGCCTTGCGAGAGGCATCGAGAGCTTCTTCAGTTGCATGACGGATCTGGGATACTTGCCACAATCATTGAATTTTAAGAGAGCGGATGTGATAATCTTTAGCATATCAACTGCCATTATAATGCACTGCTATGCACAGGAAAGAGAGACCTCGTCTTCGTGTAGAATCGGCGGCCAGTCTCCGAGCAAACAAGAGCGCCTTGCTCGAGATGCATCTCCAGAAGAGCATGGTGGAATCGTTGAAGGAACTCATGATTCCTTGAAACCGAACAGTCTGCGTCTGTATACTTTAAACAAAAAGAAAAAGAAGAAGAAAGCGCGCTCACCGCCCCCAGGTCGGACTGCGCGGGAAGATGCCGAGCGCTTGAATTCTCCTATTTATCTTATCGTCGGCTTAGTCCGGGGGAAATTGTTGAGTATGGCAAGTTCAGGAGATTATCTTTCCCAGATTGACATTTCGAACGAGGAAAAGGACAAGCTCGTGGCGGAAGTTATCCGCTATATCATATTCAAAACTCATCAAAATTCTGGATGTCCCATCAAAAGGGAAGAGCTTACTCAGATCGTAACAAAAAATTATAGGAATCGAGGATTGCCGGCTATTGTGATAGAAGAAGCTAAAGAGAAGCTGTCAAGCATTTTTGGTTATGAGTTGAGGGAACTTCAAAGAGCACGGCCTTCATCCACTGCCCAAGGGCACCCTCCTCAACATAATGTTGTAGATTCAAAATCCTATGTTCTCATAAGCAAGCTTCCTGCGGAGGTTTATAGAAAGTATGTTGAGGATGTCGATACCGCACACGTGACTGAGTGCCACTTTTTGCTTCATAATGGAACCTGTCAATACACTGCAGAAAGCCTTTGGCATCACTTGAGACGAATGGGGTTATCTGAAAATGATGAAAAACATCCAGTCCTTGGAAACATCAAGCAGGCAGTAGAGTTGCTTGTCCAGCAAAGATATCTGCAGAAGGCCAAAATAAATGGTCCTGAAGGGAATATCACGTTCTACGAGCTTGCTGAAAGAGCTATAGATGGACCAGTTAGTGAAAAGATTAAAGAATATGTAGCACAGCCGCTACTTTATGCGTTGCGGCTTATTATTATCATCACTAACAATCAAGAATATGAACTTACAAACAACTTCTTCCTTCATCATATTGGCTGGCCCATATTAATAAGAGGGCTGACTATTGTCTCAACATTGCCTCCAGCTAATCTTGATGACCTCAAACTTCGCAAGGACGAGTATCCTTGCTTTACCTTCGTCGTTCACAAGATGTCGAACCCAAAAGCTACTCGAGGCTCTAGAATGGAAATTTCTACAGCTCACCATGTCGTTGATGCACATCGACAACATTCTTATTGGCACACTCGGCTACTAGAGCTTGCATCGCCTATAGAAACTGTGGCAAAAGACTTGGCCCATGCCCTGGCATTCATGGCACCTCCTTGTAAATTGCTTCTCTTGCAAGGATCCTTGAAATGGCTAAATAACTGA

Coding sequence (CDS)

ATGTCGCCGGGCGGATGTATTGAGAATGGGGACGCGGAAGCTGCTGGTAACTGCAAATCCGGTGATTCGTATTGCGAGCATTGTGGTCGTGGTTGTGGCAGTGGTTCGGCGGATTATTCTTCTTTGCCGGCTTTTTCTTGCTCTTCTTATTCACTTTGGCTTGATTCTGCGCGGTTGAGAGAGTCGGGGAAGTTTTGGCGGATTCTGGTTGCTTCTGCTAAAGGCTTCACAATCGGAGCTGGTCTTAAAGGTGGACTCTCCCTGTTTTCGATCCTTGCTGGATTGAAGCGGAGAAAGGCTTTGGCCTCCCTCGGGAAGAAAGGAGTGATTACGAATCGTGATGTAATTTCCATGGCTTTGAAGGAGACTTTGAGGTACGGCCTGTTTCTTGGAACCTTTGCTGGGACATTCGTTTCCATCGATGAGATAATAGGCAATCTGGCAGGTCACCGTAGGACTGCAAGATGGAGGGCTCTATCAGCGGGAGCATTAGCTGGGCCATCAATGCTTCTGACTGGGTTAAATACGCAACATAAGACCTTGGCTATCTACATTTTTATGCGTGCTGCGGTCTTGGCAGCGCGTTGTGGGATTAAAAGCAAGCGGTTCGGCCATATTTGTAAGCCGCTCACGTGGTCATATGGTGACATCTTCCTTATGTGTCTCTCCTCTTCGCAGATCTTGTCTGCGTATGTGTTGAAGCAAGACAGCTTACCACCATCATTTAGTTCCTTTCTCAATATACATGGTGGAAAAGATACTGTAATCCTGGAAGGTTTAAAGAGTTTTGTATCAGGCATGCCATCTTCTAATAAATTTAAAGCAATAGAAAAGTACTACAGGGCAATGGGTGCAGATGTCAAATTAGATCTGCAAATGAAGACTCCATGCACGATTATACATGGAAATCAATCATGTGGTGGCCATGTTCTTAACTTTCTCATTCAAGGATATAAAAGAGCATTGCCAGTTTACCTACCTGTTTATCTTGTCCCAGCCCTGATAGTTCATCGTGAAGGTCTCATGAATAGGATGTGGACATGCCTTACTGCAAGAACTTTCAAGAAAATAAACATTCCCTTGGTTGCTCTGGCAACGAGCGAGAGCTCCTGTTCTTGTCCTGAAAAGCATATACTTTTTGAGCAGTTCTTAGCTGGTTTGGCTCTGGCCATTGAGAAGAAGAGCCGGAGGATTGAAATCTCACTCTATTGCCTTGCGAGAGGCATCGAGAGCTTCTTCAGTTGCATGACGGATCTGGGATACTTGCCACAATCATTGAATTTTAAGAGAGCGGATGTGATAATCTTTAGCATATCAACTGCCATTATAATGCACTGCTATGCACAGGAAAGAGAGACCTCGTCTTCGTGTAGAATCGGCGGCCAGTCTCCGAGCAAACAAGAGCGCCTTGCTCGAGATGCATCTCCAGAAGAGCATGGTGGAATCGTTGAAGGAACTCATGATTCCTTGAAACCGAACAGTCTGCGTCTGTATACTTTAAACAAAAAGAAAAAGAAGAAGAAAGCGCGCTCACCGCCCCCAGGTCGGACTGCGCGGGAAGATGCCGAGCGCTTGAATTCTCCTATTTATCTTATCGTCGGCTTAGTCCGGGGGAAATTGTTGAGTATGGCAAGTTCAGGAGATTATCTTTCCCAGATTGACATTTCGAACGAGGAAAAGGACAAGCTCGTGGCGGAAGTTATCCGCTATATCATATTCAAAACTCATCAAAATTCTGGATGTCCCATCAAAAGGGAAGAGCTTACTCAGATCGTAACAAAAAATTATAGGAATCGAGGATTGCCGGCTATTGTGATAGAAGAAGCTAAAGAGAAGCTGTCAAGCATTTTTGGTTATGAGTTGAGGGAACTTCAAAGAGCACGGCCTTCATCCACTGCCCAAGGGCACCCTCCTCAACATAATGTTGTAGATTCAAAATCCTATGTTCTCATAAGCAAGCTTCCTGCGGAGGTTTATAGAAAGTATGTTGAGGATGTCGATACCGCACACGTGACTGAGTGCCACTTTTTGCTTCATAATGGAACCTGTCAATACACTGCAGAAAGCCTTTGGCATCACTTGAGACGAATGGGGTTATCTGAAAATGATGAAAAACATCCAGTCCTTGGAAACATCAAGCAGGCAGTAGAGTTGCTTGTCCAGCAAAGATATCTGCAGAAGGCCAAAATAAATGGTCCTGAAGGGAATATCACGTTCTACGAGCTTGCTGAAAGAGCTATAGATGGACCAGTTAGTGAAAAGATTAAAGAATATGTAGCACAGCCGCTACTTTATGCGTTGCGGCTTATTATTATCATCACTAACAATCAAGAATATGAACTTACAAACAACTTCTTCCTTCATCATATTGGCTGGCCCATATTAATAAGAGGGCTGACTATTGTCTCAACATTGCCTCCAGCTAATCTTGATGACCTCAAACTTCGCAAGGACGAGTATCCTTGCTTTACCTTCGTCGTTCACAAGATGTCGAACCCAAAAGCTACTCGAGGCTCTAGAATGGAAATTTCTACAGCTCACCATGTCGTTGATGCACATCGACAACATTCTTATTGGCACACTCGGCTACTAGAGCTTGCATCGCCTATAGAAACTGTGGCAAAAGACTTGGCCCATGCCCTGGCATTCATGGCACCTCCTTGTAAATTGCTTCTCTTGCAAGGATCCTTGAAATGGCTAAATAACTGA

Protein sequence

MSPGGCIENGDAEAAGNCKSGDSYCEHCGRGCGSGSADYSSLPAFSCSSYSLWLDSARLRESGKFWRILVASAKGFTIGAGLKGGLSLFSILAGLKRRKALASLGKKGVITNRDVISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALSAGALAGPSMLLTGLNTQHKTLAIYIFMRAAVLAARCGIKSKRFGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFSSFLNIHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYRAMGADVKLDLQMKTPCTIIHGNQSCGGHVLNFLIQGYKRALPVYLPVYLVPALIVHREGLMNRMWTCLTARTFKKINIPLVALATSESSCSCPEKHILFEQFLAGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIIFSISTAIIMHCYAQERETSSSCRIGGQSPSKQERLARDASPEEHGGIVEGTHDSLKPNSLRLYTLNKKKKKKKARSPPPGRTAREDAERLNSPIYLIVGLVRGKLLSMASSGDYLSQIDISNEEKDKLVAEVIRYIIFKTHQNSGCPIKREELTQIVTKNYRNRGLPAIVIEEAKEKLSSIFGYELRELQRARPSSTAQGHPPQHNVVDSKSYVLISKLPAEVYRKYVEDVDTAHVTECHFLLHNGTCQYTAESLWHHLRRMGLSENDEKHPVLGNIKQAVELLVQQRYLQKAKINGPEGNITFYELAERAIDGPVSEKIKEYVAQPLLYALRLIIIITNNQEYELTNNFFLHHIGWPILIRGLTIVSTLPPANLDDLKLRKDEYPCFTFVVHKMSNPKATRGSRMEISTAHHVVDAHRQHSYWHTRLLELASPIETVAKDLAHALAFMAPPCKLLLLQGSLKWLNN
Homology
BLAST of CmaCh12G001570 vs. ExPASy TrEMBL
Match: A0A6J1KK52 (uncharacterized protein LOC111496453 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111496453 PE=4 SV=1)

HSP 1 Score: 842.4 bits (2175), Expect = 1.7e-240
Identity = 437/483 (90.48%), Postives = 437/483 (90.48%), Query Frame = 0

Query: 1   MSPGGCIENGDAEAAGNCKSGDSYCEHCGRGCGSGSADYSSLPAFSCSSYSLWLDSARLR 60
           MSPGGCIENGDAEAAGNCKSGDSYCEHCGRGCGSGSADYSSLPAFSCSSYSLWLDSARLR
Sbjct: 1   MSPGGCIENGDAEAAGNCKSGDSYCEHCGRGCGSGSADYSSLPAFSCSSYSLWLDSARLR 60

Query: 61  ESGKFWRILVASAKGFTIGAGLKGGLSLFSILAGLKRRKALASLGKKGVITNRDVISMAL 120
           ESGKFWRILVASAKGFTIGAGLKGGLSLFSILAGLKRRKALASLGKKGVITNRDVISMAL
Sbjct: 61  ESGKFWRILVASAKGFTIGAGLKGGLSLFSILAGLKRRKALASLGKKGVITNRDVISMAL 120

Query: 121 KETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALSAGALAGPSMLLTGLNTQHKT 180
           KETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALSAGALAGPSMLLTGLNTQHKT
Sbjct: 121 KETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALSAGALAGPSMLLTGLNTQHKT 180

Query: 181 LAIYIFMRAAVLAARCGIKSKRFGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPP 240
           LAIYIFMRAAVLAARCGIKSKRFGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPP
Sbjct: 181 LAIYIFMRAAVLAARCGIKSKRFGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPP 240

Query: 241 SFSSFLNIHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYRAMGADVKLDLQMKTPCTII 300
           SFSSFLNIHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYRAMGADVKLDLQMKTPCTII
Sbjct: 241 SFSSFLNIHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYRAMGADVKLDLQMKTPCTII 300

Query: 301 HGNQSCGGHVLNFLIQGYKRALPVYLPVYLVPALIVHREGLMNR---------------- 360
           HGNQSCGGHVLNFLIQGYKRALPVYLPVYLVPALIVHREGLMNR                
Sbjct: 301 HGNQSCGGHVLNFLIQGYKRALPVYLPVYLVPALIVHREGLMNRPYEILARGLLGTARSS 360

Query: 361 -----------MWTCLTARTFKKINIPLVALATSESSCSCPEKHILFEQFLAGLALAIEK 420
                      MWTCLTARTFKKINIPLVALAT                FLAGLALAIEK
Sbjct: 361 LFLSVYCSSAWMWTCLTARTFKKINIPLVALAT----------------FLAGLALAIEK 420

Query: 421 KSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIIFSISTAIIMHCYAQERET 457
           KSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIIFSISTAIIMHCYAQERE 
Sbjct: 421 KSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIIFSISTAIIMHCYAQEREV 467

BLAST of CmaCh12G001570 vs. ExPASy TrEMBL
Match: A0A6J1GJ26 (uncharacterized protein LOC111454723 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111454723 PE=4 SV=1)

HSP 1 Score: 830.9 bits (2145), Expect = 5.2e-237
Identity = 434/487 (89.12%), Postives = 435/487 (89.32%), Query Frame = 0

Query: 1   MSPGGC----IENGDAEAAGNCKSGDSYCEHCGRGCGSGSADYSSLPAFSCSSYSLWLDS 60
           MSPGGC    IENGDAEA+GNCKSGDSYCEHCG GCGSGSADYSSLPAFSCSSYSLWLDS
Sbjct: 1   MSPGGCACVAIENGDAEASGNCKSGDSYCEHCGCGCGSGSADYSSLPAFSCSSYSLWLDS 60

Query: 61  ARLRESGKFWRILVASAKGFTIGAGLKGGLSLFSILAGLKRRKALASLGKKGVITNRDVI 120
           ARLRESGKFWRILVASAKGFTIGAGLKGGLSLFSILAGLKRRKALASLGKKGVITNRD I
Sbjct: 61  ARLRESGKFWRILVASAKGFTIGAGLKGGLSLFSILAGLKRRKALASLGKKGVITNRDAI 120

Query: 121 SMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALSAGALAGPSMLLTGLNT 180
           SMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALSAGALAGPSMLLTGLNT
Sbjct: 121 SMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALSAGALAGPSMLLTGLNT 180

Query: 181 QHKTLAIYIFMRAAVLAARCGIKSKRFGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQD 240
           QHKTLAIYIFMRAAVLAARCGIKSKRFGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQD
Sbjct: 181 QHKTLAIYIFMRAAVLAARCGIKSKRFGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQD 240

Query: 241 SLPPSFSSFLNIHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYRAMGADVKLDLQMKTP 300
           SLPPSFSSFLNIHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYRAMGADVKLDLQMKTP
Sbjct: 241 SLPPSFSSFLNIHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYRAMGADVKLDLQMKTP 300

Query: 301 CTIIHGNQSCGGHVLNFLIQGYKRALPVYLPVYLVPALIVHREGLMNR------------ 360
           CTIIHGNQSCGGHVLNFLIQGYKRALPVYLPVYLVPALIVHREGLMNR            
Sbjct: 301 CTIIHGNQSCGGHVLNFLIQGYKRALPVYLPVYLVPALIVHREGLMNRPYEILARGLLGT 360

Query: 361 ---------------MWTCLTARTFKKINIPLVALATSESSCSCPEKHILFEQFLAGLAL 420
                          MWTCLTARTFKKINIPLVALAT                FLAGLAL
Sbjct: 361 ARSSLFLSVYCSSAWMWTCLTARTFKKINIPLVALAT----------------FLAGLAL 420

Query: 421 AIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIIFSISTAIIMHCYAQ 457
           AIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIIFSISTAIIMHCYAQ
Sbjct: 421 AIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIIFSISTAIIMHCYAQ 471

BLAST of CmaCh12G001570 vs. ExPASy TrEMBL
Match: A0A5A7VHD2 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold82G004870 PE=4 SV=1)

HSP 1 Score: 755.0 bits (1948), Expect = 3.6e-214
Identity = 398/484 (82.23%), Postives = 408/484 (84.30%), Query Frame = 0

Query: 4   GGCI----ENGDAEAAGNCKSGDSYCEHCGRGCGSGSADYSSLPAFSCSSYSLWLDSARL 63
           GGC     +NGDAE A NCKSGDSYCEH    C  GSAD SS P+FSCSS SLWLDS RL
Sbjct: 13  GGCACLAQQNGDAETAANCKSGDSYCEH----CRYGSADSSSFPSFSCSSSSLWLDSTRL 72

Query: 64  RESGKFWRILVASAKGFTIGAGLKGGLSLFSILAGLKRRKALASLGKKGVITNRDVISMA 123
           RE GK WRILVASAKGFTIGAGLKGGLSLFS+LAGLKRRKALASLGKKGVITNRD ISMA
Sbjct: 73  REYGKLWRILVASAKGFTIGAGLKGGLSLFSVLAGLKRRKALASLGKKGVITNRDAISMA 132

Query: 124 LKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALSAGALAGPSMLLTGLNTQHK 183
           LKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRAL AGALAGPSMLLTGLNTQHK
Sbjct: 133 LKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHK 192

Query: 184 TLAIYIFMRAAVLAARCGIKSKRFGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLP 243
           TLAIYIFMRAAVLA+RCGIKSKR GHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLP
Sbjct: 193 TLAIYIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLP 252

Query: 244 PSFSSFLNIHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYRAMGADVKLDLQMKTPCTI 303
           PSF SFLN HGGKDTVILEGLKSFVSGMPSS+KFKAIEKYY AMGA VKLD QMKTPCTI
Sbjct: 253 PSFRSFLNTHGGKDTVILEGLKSFVSGMPSSDKFKAIEKYYSAMGAAVKLDPQMKTPCTI 312

Query: 304 IHGNQSCGGHVLNFLIQGYKRALPVYLPVYLVPALIVHREGLMNR--------------- 363
           IHGNQSCGGH L+FLIQGYKRALPVYLPVYL+PALIVHREGLMNR               
Sbjct: 313 IHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARS 372

Query: 364 ------------MWTCLTARTFKKINIPLVALATSESSCSCPEKHILFEQFLAGLALAIE 423
                       MWTCLT+RTFKKINIPLVALAT                FL GLALAIE
Sbjct: 373 SLFLSAYCASAWMWTCLTSRTFKKINIPLVALAT----------------FLTGLALAIE 432

Query: 424 KKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIIFSISTAIIMHCYAQERE 457
           KKSRRIEISLYCLARGIESFFSCMTDLGYLP SLNFKRADVI+FSIST+IIMHCYAQERE
Sbjct: 433 KKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQERE 476

BLAST of CmaCh12G001570 vs. ExPASy TrEMBL
Match: A0A1S3BAT4 (uncharacterized protein LOC103488060 OS=Cucumis melo OX=3656 GN=LOC103488060 PE=4 SV=1)

HSP 1 Score: 755.0 bits (1948), Expect = 3.6e-214
Identity = 398/484 (82.23%), Postives = 408/484 (84.30%), Query Frame = 0

Query: 4   GGCI----ENGDAEAAGNCKSGDSYCEHCGRGCGSGSADYSSLPAFSCSSYSLWLDSARL 63
           GGC     +NGDAE A NCKSGDSYCEH    C  GSAD SS P+FSCSS SLWLDS RL
Sbjct: 13  GGCACLAQQNGDAETAANCKSGDSYCEH----CRYGSADSSSFPSFSCSSSSLWLDSTRL 72

Query: 64  RESGKFWRILVASAKGFTIGAGLKGGLSLFSILAGLKRRKALASLGKKGVITNRDVISMA 123
           RE GK WRILVASAKGFTIGAGLKGGLSLFS+LAGLKRRKALASLGKKGVITNRD ISMA
Sbjct: 73  REYGKLWRILVASAKGFTIGAGLKGGLSLFSVLAGLKRRKALASLGKKGVITNRDAISMA 132

Query: 124 LKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALSAGALAGPSMLLTGLNTQHK 183
           LKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRAL AGALAGPSMLLTGLNTQHK
Sbjct: 133 LKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHK 192

Query: 184 TLAIYIFMRAAVLAARCGIKSKRFGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLP 243
           TLAIYIFMRAAVLA+RCGIKSKR GHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLP
Sbjct: 193 TLAIYIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLP 252

Query: 244 PSFSSFLNIHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYRAMGADVKLDLQMKTPCTI 303
           PSF SFLN HGGKDTVILEGLKSFVSGMPSS+KFKAIEKYY AMGA VKLD QMKTPCTI
Sbjct: 253 PSFRSFLNTHGGKDTVILEGLKSFVSGMPSSDKFKAIEKYYSAMGAAVKLDPQMKTPCTI 312

Query: 304 IHGNQSCGGHVLNFLIQGYKRALPVYLPVYLVPALIVHREGLMNR--------------- 363
           IHGNQSCGGH L+FLIQGYKRALPVYLPVYL+PALIVHREGLMNR               
Sbjct: 313 IHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARS 372

Query: 364 ------------MWTCLTARTFKKINIPLVALATSESSCSCPEKHILFEQFLAGLALAIE 423
                       MWTCLT+RTFKKINIPLVALAT                FL GLALAIE
Sbjct: 373 SLFLSAYCASAWMWTCLTSRTFKKINIPLVALAT----------------FLTGLALAIE 432

Query: 424 KKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIIFSISTAIIMHCYAQERE 457
           KKSRRIEISLYCLARGIESFFSCMTDLGYLP SLNFKRADVI+FSIST+IIMHCYAQERE
Sbjct: 433 KKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQERE 476

BLAST of CmaCh12G001570 vs. ExPASy TrEMBL
Match: A0A6J1BRD3 (uncharacterized protein LOC111004892 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111004892 PE=4 SV=1)

HSP 1 Score: 750.4 bits (1936), Expect = 9.0e-213
Identity = 393/484 (81.20%), Postives = 408/484 (84.30%), Query Frame = 0

Query: 4   GGCI----ENGDAEAAGNCKSGDSYCEHCGRGCGSGSADYSSLPAFSCSSYSLWLDSARL 63
           GGC     +NGD E+AGNCKSG+SYC+H    CG GSAD SSLPAFSCSS SLW DS RL
Sbjct: 13  GGCACVARQNGDVESAGNCKSGESYCQH----CGCGSADSSSLPAFSCSSSSLWFDSMRL 72

Query: 64  RESGKFWRILVASAKGFTIGAGLKGGLSLFSILAGLKRRKALASLGKKGVITNRDVISMA 123
           RESGK WRILVASAKGFTIGAGLKGGLSLFSILAGLKRRKALASL KKGVITNRD ISMA
Sbjct: 73  RESGKLWRILVASAKGFTIGAGLKGGLSLFSILAGLKRRKALASLRKKGVITNRDAISMA 132

Query: 124 LKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALSAGALAGPSMLLTGLNTQHK 183
           LKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRAL AGALAGPSMLLTGLNTQHK
Sbjct: 133 LKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHK 192

Query: 184 TLAIYIFMRAAVLAARCGIKSKRFGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLP 243
           +LAIYIFMRAAVLA+RCGIKSK+ GHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLP
Sbjct: 193 SLAIYIFMRAAVLASRCGIKSKQLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLP 252

Query: 244 PSFSSFLNIHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYRAMGADVKLDLQMKTPCTI 303
           PSF SFLN HGGKDTVILEGLKSFVSGMPSSNKF+ IEKYYRAMGAD KLDLQMKTPCTI
Sbjct: 253 PSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFEVIEKYYRAMGADTKLDLQMKTPCTI 312

Query: 304 IHGNQSCGGHVLNFLIQGYKRALPVYLPVYLVPALIVHREGLMNR--------------- 363
           IHGNQSCGGH ++FLIQGYKRALPVYLPVYLVPALIVHR+ L+NR               
Sbjct: 313 IHGNQSCGGHFISFLIQGYKRALPVYLPVYLVPALIVHRQDLLNRPNEILARGLLGTARS 372

Query: 364 ------------MWTCLTARTFKKINIPLVALATSESSCSCPEKHILFEQFLAGLALAIE 423
                       MWTCLT+RTFKKINIPLVALAT                FL GLALAIE
Sbjct: 373 SLFLSLYCSSAWMWTCLTSRTFKKINIPLVALAT----------------FLTGLALAIE 432

Query: 424 KKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIIFSISTAIIMHCYAQERE 457
           KKSRRIEISLYCLARGIESFF C TD GYLPQSLNFKRADVI+FS+STAIIMHCYAQERE
Sbjct: 433 KKSRRIEISLYCLARGIESFFICATDAGYLPQSLNFKRADVIVFSLSTAIIMHCYAQERE 476

BLAST of CmaCh12G001570 vs. NCBI nr
Match: KAG6585294.1 (Non-structural maintenance of chromosomes element 3-like protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1258.4 bits (3255), Expect = 0.0e+00
Identity = 672/805 (83.48%), Postives = 686/805 (85.22%), Query Frame = 0

Query: 1   MSPGGC----IENGDAEAAGNCKSGDSYCEHCGRGCGSGSADYSSLPAFSCSSYSLWLDS 60
           MSPGGC    IENGDAEA+GNCKSG SYCEHCG GCGSGSADYSSLPAFSCSSYSLWLDS
Sbjct: 1   MSPGGCACIAIENGDAEASGNCKSGGSYCEHCGCGCGSGSADYSSLPAFSCSSYSLWLDS 60

Query: 61  ARLRESGKFWRILVASAKGFTIGAGLKGGLSLFSILAGLKRRKALASLGKKGVITNRDVI 120
           ARLRESGKFWRILVASAKGFTIGAGLKGGLSLFSILAGLKRRKALASLGKKGVITNRD I
Sbjct: 61  ARLRESGKFWRILVASAKGFTIGAGLKGGLSLFSILAGLKRRKALASLGKKGVITNRDAI 120

Query: 121 SMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALSAGALAGPSMLLTGLNT 180
           SMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALSAGALAGPSMLLTGLNT
Sbjct: 121 SMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALSAGALAGPSMLLTGLNT 180

Query: 181 QHKTLAIYIFMRAAVLAARCGIKSKRFGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQD 240
           QHKTLAIYIFMRAAVLAARCGIKSKRFGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQD
Sbjct: 181 QHKTLAIYIFMRAAVLAARCGIKSKRFGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQD 240

Query: 241 SLPPSFSSFLNIHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYRAMGADVKLDLQMKTP 300
           SLPPSFSSFLNIHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYRAMGADVKLDLQMKTP
Sbjct: 241 SLPPSFSSFLNIHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYRAMGADVKLDLQMKTP 300

Query: 301 CTIIHGNQSCGGHVLNFLIQGYKRALPVYLPVYLVPALIVHREGLMNR------------ 360
           CTIIHGNQSCGGHVLNFLIQGYKRALPVYLPVYLVPALIVHREGLMNR            
Sbjct: 301 CTIIHGNQSCGGHVLNFLIQGYKRALPVYLPVYLVPALIVHREGLMNRPYEILARGLLGT 360

Query: 361 ---------------MWTCLTARTFKKINIPLVALATSESSCSCPEKHILFEQFLAGLAL 420
                          MWTCLTARTFKKINIPLVALAT                FLAGLAL
Sbjct: 361 ARSSLFLSVYCSSAWMWTCLTARTFKKINIPLVALAT----------------FLAGLAL 420

Query: 421 AIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIIFSISTAIIMHCYAQ 480
           AIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIIFSISTAIIMHCYAQ
Sbjct: 421 AIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIIFSISTAIIMHCYAQ 480

Query: 481 ERE---------TSSSCRIGGQSPSKQERLARDASPEEHGGIVEGTHDSLKPNSLRLYTL 540
           ERE           S+ R+     +K   L         GGIV GTHDSL          
Sbjct: 481 EREKQIWDAFIDRESAARL---RANKSALLEMQLQKSMVGGIVAGTHDSL---------- 540

Query: 541 NKKKKKKKARSPPPGRTAREDAERLNSPIYLIVGLVRGKLLSMASSGDYLSQIDISNEEK 600
               K   + SP   + +R   +RLNSP+YLIVGLVRGKLLSMASSGDYLSQIDISNEEK
Sbjct: 541 -NFSKIPTSLSPEVSKISRRSFQRLNSPLYLIVGLVRGKLLSMASSGDYLSQIDISNEEK 600

Query: 601 DKLVAEVIRYIIFKTHQNSGCPIKREELTQIVTKNYRNRGLPAIVIEEAKEKLSSIFGYE 660
           DKLVAEVIRYIIFKTHQNSGCPIKREELTQIVTKNYRNRGLPAIVIEEAKEKLSSIFGYE
Sbjct: 601 DKLVAEVIRYIIFKTHQNSGCPIKREELTQIVTKNYRNRGLPAIVIEEAKEKLSSIFGYE 660

Query: 661 LRELQRARPSSTAQGHPPQHNVVDSKSYVLISKLPAEVYRKYVEDVDTAHVTECHF---- 720
           LRELQRARPSSTAQGHPPQHNVVDSKSYVLISKLPAEVYRKYVEDVDTAHVT  +F    
Sbjct: 661 LRELQRARPSSTAQGHPPQHNVVDSKSYVLISKLPAEVYRKYVEDVDTAHVTGFNFVVVS 720

Query: 721 LLHNGTCQYTAESLWHHLRRMGLSENDEKHPVLGNIKQAVELLVQQRYLQKAKINGPEGN 762
           ++H    +   ESLWHHLRRMGLSENDEKHPVLGNIKQAVELLVQQRYLQKAK+NG EGN
Sbjct: 721 IVHLAGGKIPEESLWHHLRRMGLSENDEKHPVLGNIKQAVELLVQQRYLQKAKVNGSEGN 775

BLAST of CmaCh12G001570 vs. NCBI nr
Match: XP_023002662.1 (uncharacterized protein LOC111496453 isoform X1 [Cucurbita maxima])

HSP 1 Score: 842.4 bits (2175), Expect = 3.6e-240
Identity = 437/483 (90.48%), Postives = 437/483 (90.48%), Query Frame = 0

Query: 1   MSPGGCIENGDAEAAGNCKSGDSYCEHCGRGCGSGSADYSSLPAFSCSSYSLWLDSARLR 60
           MSPGGCIENGDAEAAGNCKSGDSYCEHCGRGCGSGSADYSSLPAFSCSSYSLWLDSARLR
Sbjct: 1   MSPGGCIENGDAEAAGNCKSGDSYCEHCGRGCGSGSADYSSLPAFSCSSYSLWLDSARLR 60

Query: 61  ESGKFWRILVASAKGFTIGAGLKGGLSLFSILAGLKRRKALASLGKKGVITNRDVISMAL 120
           ESGKFWRILVASAKGFTIGAGLKGGLSLFSILAGLKRRKALASLGKKGVITNRDVISMAL
Sbjct: 61  ESGKFWRILVASAKGFTIGAGLKGGLSLFSILAGLKRRKALASLGKKGVITNRDVISMAL 120

Query: 121 KETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALSAGALAGPSMLLTGLNTQHKT 180
           KETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALSAGALAGPSMLLTGLNTQHKT
Sbjct: 121 KETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALSAGALAGPSMLLTGLNTQHKT 180

Query: 181 LAIYIFMRAAVLAARCGIKSKRFGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPP 240
           LAIYIFMRAAVLAARCGIKSKRFGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPP
Sbjct: 181 LAIYIFMRAAVLAARCGIKSKRFGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPP 240

Query: 241 SFSSFLNIHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYRAMGADVKLDLQMKTPCTII 300
           SFSSFLNIHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYRAMGADVKLDLQMKTPCTII
Sbjct: 241 SFSSFLNIHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYRAMGADVKLDLQMKTPCTII 300

Query: 301 HGNQSCGGHVLNFLIQGYKRALPVYLPVYLVPALIVHREGLMNR---------------- 360
           HGNQSCGGHVLNFLIQGYKRALPVYLPVYLVPALIVHREGLMNR                
Sbjct: 301 HGNQSCGGHVLNFLIQGYKRALPVYLPVYLVPALIVHREGLMNRPYEILARGLLGTARSS 360

Query: 361 -----------MWTCLTARTFKKINIPLVALATSESSCSCPEKHILFEQFLAGLALAIEK 420
                      MWTCLTARTFKKINIPLVALAT                FLAGLALAIEK
Sbjct: 361 LFLSVYCSSAWMWTCLTARTFKKINIPLVALAT----------------FLAGLALAIEK 420

Query: 421 KSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIIFSISTAIIMHCYAQERET 457
           KSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIIFSISTAIIMHCYAQERE 
Sbjct: 421 KSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIIFSISTAIIMHCYAQEREV 467

BLAST of CmaCh12G001570 vs. NCBI nr
Match: XP_022952006.1 (uncharacterized protein LOC111454723 isoform X1 [Cucurbita moschata])

HSP 1 Score: 830.9 bits (2145), Expect = 1.1e-236
Identity = 434/487 (89.12%), Postives = 435/487 (89.32%), Query Frame = 0

Query: 1   MSPGGC----IENGDAEAAGNCKSGDSYCEHCGRGCGSGSADYSSLPAFSCSSYSLWLDS 60
           MSPGGC    IENGDAEA+GNCKSGDSYCEHCG GCGSGSADYSSLPAFSCSSYSLWLDS
Sbjct: 1   MSPGGCACVAIENGDAEASGNCKSGDSYCEHCGCGCGSGSADYSSLPAFSCSSYSLWLDS 60

Query: 61  ARLRESGKFWRILVASAKGFTIGAGLKGGLSLFSILAGLKRRKALASLGKKGVITNRDVI 120
           ARLRESGKFWRILVASAKGFTIGAGLKGGLSLFSILAGLKRRKALASLGKKGVITNRD I
Sbjct: 61  ARLRESGKFWRILVASAKGFTIGAGLKGGLSLFSILAGLKRRKALASLGKKGVITNRDAI 120

Query: 121 SMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALSAGALAGPSMLLTGLNT 180
           SMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALSAGALAGPSMLLTGLNT
Sbjct: 121 SMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALSAGALAGPSMLLTGLNT 180

Query: 181 QHKTLAIYIFMRAAVLAARCGIKSKRFGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQD 240
           QHKTLAIYIFMRAAVLAARCGIKSKRFGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQD
Sbjct: 181 QHKTLAIYIFMRAAVLAARCGIKSKRFGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQD 240

Query: 241 SLPPSFSSFLNIHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYRAMGADVKLDLQMKTP 300
           SLPPSFSSFLNIHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYRAMGADVKLDLQMKTP
Sbjct: 241 SLPPSFSSFLNIHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYRAMGADVKLDLQMKTP 300

Query: 301 CTIIHGNQSCGGHVLNFLIQGYKRALPVYLPVYLVPALIVHREGLMNR------------ 360
           CTIIHGNQSCGGHVLNFLIQGYKRALPVYLPVYLVPALIVHREGLMNR            
Sbjct: 301 CTIIHGNQSCGGHVLNFLIQGYKRALPVYLPVYLVPALIVHREGLMNRPYEILARGLLGT 360

Query: 361 ---------------MWTCLTARTFKKINIPLVALATSESSCSCPEKHILFEQFLAGLAL 420
                          MWTCLTARTFKKINIPLVALAT                FLAGLAL
Sbjct: 361 ARSSLFLSVYCSSAWMWTCLTARTFKKINIPLVALAT----------------FLAGLAL 420

Query: 421 AIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIIFSISTAIIMHCYAQ 457
           AIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIIFSISTAIIMHCYAQ
Sbjct: 421 AIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIIFSISTAIIMHCYAQ 471

BLAST of CmaCh12G001570 vs. NCBI nr
Match: XP_023538421.1 (uncharacterized protein LOC111799208 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 827.8 bits (2137), Expect = 9.1e-236
Identity = 432/487 (88.71%), Postives = 434/487 (89.12%), Query Frame = 0

Query: 1   MSPGGC----IENGDAEAAGNCKSGDSYCEHCGRGCGSGSADYSSLPAFSCSSYSLWLDS 60
           MSPGGC    IENGDAEA+GNCKSGDSYCEHCG GCGSGSADYSSLPAFSCSSYSLWLDS
Sbjct: 1   MSPGGCACVAIENGDAEASGNCKSGDSYCEHCGCGCGSGSADYSSLPAFSCSSYSLWLDS 60

Query: 61  ARLRESGKFWRILVASAKGFTIGAGLKGGLSLFSILAGLKRRKALASLGKKGVITNRDVI 120
           ARLRESGKFWRILVASAKGFTIGAGLKGGLSLFSILAGLKRRKALASLGKKGVITNRD I
Sbjct: 61  ARLRESGKFWRILVASAKGFTIGAGLKGGLSLFSILAGLKRRKALASLGKKGVITNRDAI 120

Query: 121 SMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALSAGALAGPSMLLTGLNT 180
           SMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTA WRALSAGALAGPSMLLTGLNT
Sbjct: 121 SMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTAGWRALSAGALAGPSMLLTGLNT 180

Query: 181 QHKTLAIYIFMRAAVLAARCGIKSKRFGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQD 240
           QHKTLAIYIFMRAAVLAARCGIKSKRFGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQD
Sbjct: 181 QHKTLAIYIFMRAAVLAARCGIKSKRFGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQD 240

Query: 241 SLPPSFSSFLNIHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYRAMGADVKLDLQMKTP 300
           SLPPSFSSFLNIHGGKDTVILEGLKSFVSGMPSSNKFKA+EKYYRAMGADVKLDLQMKTP
Sbjct: 241 SLPPSFSSFLNIHGGKDTVILEGLKSFVSGMPSSNKFKAVEKYYRAMGADVKLDLQMKTP 300

Query: 301 CTIIHGNQSCGGHVLNFLIQGYKRALPVYLPVYLVPALIVHREGLMNR------------ 360
           CTIIHGNQSCGGHVLNFLIQGYKRALPVYLPVYLVPALIVHREGLMNR            
Sbjct: 301 CTIIHGNQSCGGHVLNFLIQGYKRALPVYLPVYLVPALIVHREGLMNRPYEILARGLLGT 360

Query: 361 ---------------MWTCLTARTFKKINIPLVALATSESSCSCPEKHILFEQFLAGLAL 420
                          MWTCLTARTFKKINIPLVALAT                FLAGLAL
Sbjct: 361 ARSSLFLSVYCSSAWMWTCLTARTFKKINIPLVALAT----------------FLAGLAL 420

Query: 421 AIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIIFSISTAIIMHCYAQ 457
           AIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIIFSISTAIIMHCYAQ
Sbjct: 421 AIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIIFSISTAIIMHCYAQ 471

BLAST of CmaCh12G001570 vs. NCBI nr
Match: KAG7020214.1 (putative mitochondrial adenine nucleotide transporter BTL3 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 820.5 bits (2118), Expect = 1.5e-233
Identity = 433/496 (87.30%), Postives = 434/496 (87.50%), Query Frame = 0

Query: 1   MSPGGC----IENGDAEAAGNCKSGDSYCEHCGRGCGSGSADYSSLPAFSCSSYSLWLDS 60
           MSPGGC    IENGDAEA+GNCKSG SYCEHCG GCGSGSADYSSLPAFSCSSYSLWLDS
Sbjct: 455 MSPGGCACIAIENGDAEASGNCKSGGSYCEHCGCGCGSGSADYSSLPAFSCSSYSLWLDS 514

Query: 61  ARLRESGKFWRILVASAKGFTIGAGLKGGLSLFSILAGLKRRKALASLGKKGVITNRDVI 120
           ARLRESGKFWRILVASAKGFTIGAGLKGGLSLFSILAGLKRRKALASLGKKGVITNRD I
Sbjct: 515 ARLRESGKFWRILVASAKGFTIGAGLKGGLSLFSILAGLKRRKALASLGKKGVITNRDAI 574

Query: 121 SMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALSAGALAGPSMLLTGLNT 180
           SMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALSAGALAGPSMLLTGLNT
Sbjct: 575 SMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALSAGALAGPSMLLTGLNT 634

Query: 181 QHKTLAIYIFMRAAVLAARCGIKSKRFGHICKPLTWSYGDIFLMCLSSSQIL-------- 240
           QHKTLAIYIFMRAAVLAARCGIKSKRFGHICKPLTWSYGDIFLMCLSSSQIL        
Sbjct: 635 QHKTLAIYIFMRAAVLAARCGIKSKRFGHICKPLTWSYGDIFLMCLSSSQILQLIRRIIF 694

Query: 241 -SAYVLKQDSLPPSFSSFLNIHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYRAMGADV 300
            SAYVLKQDSLPPSFSSFLNIHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYRAMGADV
Sbjct: 695 RSAYVLKQDSLPPSFSSFLNIHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYRAMGADV 754

Query: 301 KLDLQMKTPCTIIHGNQSCGGHVLNFLIQGYKRALPVYLPVYLVPALIVHREGLMNR--- 360
           KLDLQMKTPCTIIHGNQSCGGHVLNFLIQGYKRALPVYLPVYLVPALIVHREGLMNR   
Sbjct: 755 KLDLQMKTPCTIIHGNQSCGGHVLNFLIQGYKRALPVYLPVYLVPALIVHREGLMNRPYE 814

Query: 361 ------------------------MWTCLTARTFKKINIPLVALATSESSCSCPEKHILF 420
                                   MWTCLTARTFKKINIPLVALAT              
Sbjct: 815 ILARGLLGTARSSLFLSVYCSSAWMWTCLTARTFKKINIPLVALAT-------------- 874

Query: 421 EQFLAGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIIFSIST 457
             FLAGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIIFSIST
Sbjct: 875 --FLAGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIIFSIST 934

BLAST of CmaCh12G001570 vs. TAIR 10
Match: AT1G34630.1 (BEST Arabidopsis thaliana protein match is: Mitochondrial import inner membrane translocase subunit Tim17/Tim22/Tim23 family protein (TAIR:AT5G51150.1); Has 323 Blast hits to 315 proteins in 124 species: Archae - 0; Bacteria - 0; Metazoa - 95; Fungi - 110; Plants - 73; Viruses - 0; Other Eukaryotes - 45 (source: NCBI BLink). )

HSP 1 Score: 494.2 bits (1271), Expect = 2.2e-139
Identity = 264/465 (56.77%), Postives = 321/465 (69.03%), Query Frame = 0

Query: 23  SYCEHCGRGCGSGSADYSSLPAFS---CSSYSLWLDSARLRESGKFWRILVASAKGFTIG 82
           S C  C      G+ D+     F+   C +      S  + +S K  RI+VAS KGFTIG
Sbjct: 11  SRCSSCISSDNDGN-DFGKSDEFTCKQCKNSDSKSKSKSVNDSDKLRRIIVASVKGFTIG 70

Query: 83  AGLKGGLSLFSILAGLKRRKALASLGKK-GVITNRDVISMALKETLRYGLFLGTFAGTFV 142
            GLKGGL++FSI+A   RR+  +   +K G  +N + I+M +KETLRYGLFLGTFAGTFV
Sbjct: 71  TGLKGGLAIFSIVARFARRRRSSPQSRKTGEFSNSEAIAMGIKETLRYGLFLGTFAGTFV 130

Query: 143 SIDEIIGNLAGHRRTARWRALSAGALAGPSMLLTGLNTQHKTLAIYIFMRAAVLAARCGI 202
           S+DE I  LAG +RTA+WRAL AG +AGPSMLLTG NTQH +LA+YI MRAAVLA+RCGI
Sbjct: 131 SVDEAIAALAGDKRTAKWRALFAGLVAGPSMLLTGPNTQHTSLAVYILMRAAVLASRCGI 190

Query: 203 KSKRFGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFSSFLNIHGGKDTVILE 262
           KSKRFG ICKPLTW +GD+FLMCLSSSQILSAY+LKQ+SLP S+ SFLN  GGKD  IL+
Sbjct: 191 KSKRFGTICKPLTWKHGDLFLMCLSSSQILSAYILKQESLPSSYKSFLNKQGGKDLSILQ 250

Query: 263 GLKSFVSGMPSSNKFKAIEKYYRAMGADVKLDLQMKTPCTIIHGNQSCGGHVLNFLIQGY 322
           G+K   +  P +N  +AIEKYY+++G D+KLD  MK PCTIIHGN+SC  H + F +Q Y
Sbjct: 251 GVKDIATAQPFTN-LRAIEKYYKSVGVDIKLDPTMKVPCTIIHGNESCVKHGVTFFLQAY 310

Query: 323 KRALPVYLPVYLVPALIVHREGLMNRM---------------------------WTCLTA 382
           KRALPVY+PVYL+PALIVHR+ L+ +                            WTCL  
Sbjct: 311 KRALPVYVPVYLIPALIVHRQDLLKKQYSILGKGLLGTARSSLFLATYCSSAWAWTCLLF 370

Query: 383 RTFKKINIPLVALATSESSCSCPEKHILFEQFLAGLALAIEKKSRRIEISLYCLARGIES 442
           RTF+  NIPLVA+AT                F  GLALAIEKKSRRIEISLYCLAR IES
Sbjct: 371 RTFETCNIPLVAIAT----------------FPTGLALAIEKKSRRIEISLYCLARAIES 430

Query: 443 FFSCMTDLGYLPQSLNFKRADVIIFSISTAIIMHCYAQERETSSS 457
           FF+CMT+ GY+    + +RADV++FS+STAIIMHCYAQER+   S
Sbjct: 431 FFTCMTEAGYIRPPKSLRRADVVVFSVSTAIIMHCYAQERDVFRS 457

BLAST of CmaCh12G001570 vs. TAIR 10
Match: AT1G34630.2 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: Mitochondrial import inner membrane translocase subunit Tim17/Tim22/Tim23 family protein (TAIR:AT5G51150.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 399.1 bits (1024), Expect = 9.8e-111
Identity = 211/378 (55.82%), Postives = 261/378 (69.05%), Query Frame = 0

Query: 23  SYCEHCGRGCGSGSADYSSLPAFS---CSSYSLWLDSARLRESGKFWRILVASAKGFTIG 82
           S C  C      G+ D+     F+   C +      S  + +S K  RI+VAS KGFTIG
Sbjct: 11  SRCSSCISSDNDGN-DFGKSDEFTCKQCKNSDSKSKSKSVNDSDKLRRIIVASVKGFTIG 70

Query: 83  AGLKGGLSLFSILAGLKRRKALASLGKK-GVITNRDVISMALKETLRYGLFLGTFAGTFV 142
            GLKGGL++FSI+A   RR+  +   +K G  +N + I+M +KETLRYGLFLGTFAGTFV
Sbjct: 71  TGLKGGLAIFSIVARFARRRRSSPQSRKTGEFSNSEAIAMGIKETLRYGLFLGTFAGTFV 130

Query: 143 SIDEIIGNLAGHRRTARWRALSAGALAGPSMLLTGLNTQHKTLAIYIFMRAAVLAARCGI 202
           S+DE I  LAG +RTA+WRAL AG +AGPSMLLTG NTQH +LA+YI MRAAVLA+RCGI
Sbjct: 131 SVDEAIAALAGDKRTAKWRALFAGLVAGPSMLLTGPNTQHTSLAVYILMRAAVLASRCGI 190

Query: 203 KSKRFGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFSSFLNIHGGKDTVILE 262
           KSKRFG ICKPLTW +GD+FLMCLSSSQILSAY+LKQ+SLP S+ SFLN  GGKD  IL+
Sbjct: 191 KSKRFGTICKPLTWKHGDLFLMCLSSSQILSAYILKQESLPSSYKSFLNKQGGKDLSILQ 250

Query: 263 GLKSFVSGMPSSNKFKAIEKYYRAMGADVKLDLQMKTPCTIIHGNQSCGGHVLNFLIQGY 322
           G+K   +  P +N  +AIEKYY+++G D+KLD  MK PCTIIHGN+SC  H + F +Q Y
Sbjct: 251 GVKDIATAQPFTN-LRAIEKYYKSVGVDIKLDPTMKVPCTIIHGNESCVKHGVTFFLQAY 310

Query: 323 KRALPVYLPVYLVPALIVHREGLMNRM---------------------------WTCLTA 370
           KRALPVY+PVYL+PALIVHR+ L+ +                            WTCL  
Sbjct: 311 KRALPVYVPVYLIPALIVHRQDLLKKQYSILGKGLLGTARSSLFLATYCSSAWAWTCLLF 370

BLAST of CmaCh12G001570 vs. TAIR 10
Match: AT1G34770.1 (CONTAINS InterPro DOMAIN/s: MAGE protein (InterPro:IPR002190); Has 1274 Blast hits to 1260 proteins in 85 species: Archae - 0; Bacteria - 0; Metazoa - 1104; Fungi - 45; Plants - 49; Viruses - 0; Other Eukaryotes - 76 (source: NCBI BLink). )

HSP 1 Score: 263.5 bits (672), Expect = 6.4e-70
Identity = 137/226 (60.62%), Postives = 171/226 (75.66%), Query Frame = 0

Query: 543 MASSGDYLSQIDISNEEKDKLVAEVIRYIIFKTHQNSGCPIKREELTQIVTKNYRNRGLP 602
           MA   D LSQ DIS EE DKLV+EVIR+I+FK HQ+SG PIKRE+LTQIVTKNYR R L 
Sbjct: 1   MADEEDSLSQFDISKEETDKLVSEVIRFILFKFHQSSGTPIKREDLTQIVTKNYRQRNLA 60

Query: 603 AIVIEEAKEKLSSIFGYELRELQRARPSSTAQGHPPQ-HNVVDSKSYVLISKLPAEVYRK 662
             VI EAK+KLS++FGY+L+ELQRAR SST Q   PQ  + VDSKSYVL+S+LP EV++K
Sbjct: 61  THVINEAKKKLSNVFGYDLKELQRARSSSTGQSRLPQSQSSVDSKSYVLVSELPLEVFKK 120

Query: 663 YVEDVDTAHVTECHF----LLHNGTCQYTAESLWHHLRRMGLSENDEKHPVLGNIKQAVE 722
           +V D  T+ VT   F    ++     +   E+LWHHL+RMGL ENDE +PV GN KQ +E
Sbjct: 121 HVVDETTSPVTGFTFVVLAIVQLAGGKIPEETLWHHLKRMGLHENDEHNPVFGNNKQTLE 180

Query: 723 LLVQQRYLQKAKINGPEGNITFYELAERAIDGPVSEKIKEYVAQPL 764
            LVQQR+LQK K++GPEG+  FY+LAERA+D  VSEK+K+Y++Q L
Sbjct: 181 TLVQQRFLQKEKVSGPEGSTLFYDLAERALDPQVSEKVKDYISQIL 226

BLAST of CmaCh12G001570 vs. TAIR 10
Match: AT5G51150.1 (Mitochondrial import inner membrane translocase subunit Tim17/Tim22/Tim23 family protein )

HSP 1 Score: 105.9 bits (263), Expect = 1.7e-22
Identity = 95/394 (24.11%), Postives = 179/394 (45.43%), Query Frame = 0

Query: 74  KGFTIGAGLKGGLSLFSILAGLKRRKALAS-LGKKGVITNRDVISMALKETLRYGLFLGT 133
           + F +  G++ G+ +      L R ++ +S L  K +++ +D+I    +E  R GL  G 
Sbjct: 80  QSFLLSYGVRVGIGILLRAFKLARGQSYSSLLDLKQLVSEKDLI--VREEACRIGLLFGG 139

Query: 134 FAGTFVSIDEIIGNLAGHRRTARWRALSAGALAGPSMLLTGLNTQHKTLAIYIFMRAAVL 193
           F G++ ++   +      ++     ++ AG++AG S+L    + Q +TLA+Y+  R    
Sbjct: 140 FTGSYHALRCCLRK--WRKKETPLNSVLAGSVAGLSILALDDSNQRRTLALYLLARLG-Q 199

Query: 194 AARCGIKSKRFGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFSSFLNIHGGK 253
           AA    KSK   H+     W +GD  L  L+ +Q++ +++++ ++LP S+  F+   G  
Sbjct: 200 AAYNSAKSKNKFHLWGS-HWRHGDSLLFSLACAQVMYSFIMRPETLPKSYREFIQKTGPV 259

Query: 254 DTVILEGLKSFVSGMPSSNKFKAIEKYYRAMGADVKL-DLQMKTPCTIIHGN-QSCGGHV 313
              + + ++    G P      +     +   +DVK+ +     PC  IH N  SC    
Sbjct: 260 ARPVYQAVRECCRGGPIDVASLSAYISSKNEASDVKVEEFASIIPCAAIHPNTNSCLAQN 319

Query: 314 LNFLIQGYKRALPVYLPVYLVPALIVHREGLMNRMW--TCLTARTFKKINIPLVALATSE 373
            N +   +K+  P+Y  +  VP +++H +  M   +  + L  R   +    L A     
Sbjct: 320 ANAMSATFKKTFPLYFSLTFVPYVVLHLQKFMASPYRTSWLAIRDSVRSTSFLSAFVGIF 379

Query: 374 SSCSCPEKHIL---------FEQFLAGLALAIEKKSRRIEISLYCLARGIESFFSCMTDL 433
            +  C  + +          F    A L++ +EKK RR E++LY L R  +S +  + + 
Sbjct: 380 QAFICAHRKVATKDHKLVYWFAGGAAALSVMLEKKPRRSELALYVLPRAGDSLWEILVNR 439

Query: 434 GYLPQSLNFKRADVIIFSISTAIIMHCYAQERET 454
             LP   + K A+V +F      IM+    E +T
Sbjct: 440 HLLP---DIKNAEVALFCGCMGGIMYYLEYEPDT 464

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1KK521.7e-24090.48uncharacterized protein LOC111496453 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1GJ265.2e-23789.12uncharacterized protein LOC111454723 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A5A7VHD23.6e-21482.23Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A1S3BAT43.6e-21482.23uncharacterized protein LOC103488060 OS=Cucumis melo OX=3656 GN=LOC103488060 PE=... [more]
A0A6J1BRD39.0e-21381.20uncharacterized protein LOC111004892 isoform X1 OS=Momordica charantia OX=3673 G... [more]
Match NameE-valueIdentityDescription
KAG6585294.10.0e+0083.48Non-structural maintenance of chromosomes element 3-like protein, partial [Cucur... [more]
XP_023002662.13.6e-24090.48uncharacterized protein LOC111496453 isoform X1 [Cucurbita maxima][more]
XP_022952006.11.1e-23689.12uncharacterized protein LOC111454723 isoform X1 [Cucurbita moschata][more]
XP_023538421.19.1e-23688.71uncharacterized protein LOC111799208 isoform X1 [Cucurbita pepo subsp. pepo][more]
KAG7020214.11.5e-23387.30putative mitochondrial adenine nucleotide transporter BTL3 [Cucurbita argyrosper... [more]
Match NameE-valueIdentityDescription
AT1G34630.12.2e-13956.77BEST Arabidopsis thaliana protein match is: Mitochondrial import inner membrane ... [more]
AT1G34630.29.8e-11155.82FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
AT1G34770.16.4e-7060.62CONTAINS InterPro DOMAIN/s: MAGE protein (InterPro:IPR002190); Has 1274 Blast hi... [more]
AT5G51150.11.7e-2224.11Mitochondrial import inner membrane translocase subunit Tim17/Tim22/Tim23 family... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002190MAGE homology domainSMARTSM01373MAGE_2coord: 567..755
e-value: 9.0E-18
score: 75.0
IPR002190MAGE homology domainPFAMPF01454MAGEcoord: 567..746
e-value: 1.0E-24
score: 87.4
IPR041898MAGE homology domain, winged helix WH1 motifGENE3D1.10.10.1200MAGE homology domain, winged helix WH1 motifcoord: 554..656
e-value: 1.1E-22
score: 81.7
IPR041899MAGE homology domain, winged helix WH2 motifGENE3D1.10.10.1210MAGE homology domain, winged helix WH2 motifcoord: 674..763
e-value: 4.3E-11
score: 44.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 501..521
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 456..489
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 470..485
NoneNo IPR availablePANTHERPTHR12459:SF18BNAANNG02190D PROTEINcoord: 11..455
IPR026749Transmembrane protein 135PANTHERPTHR12459UNCHARACTERIZEDcoord: 11..455

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh12G001570.1CmaCh12G001570.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane