CmaCh12G011040 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh12G011040
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionWD repeat-containing protein 48
LocationCma_Chr12: 8736338 .. 8756951 (-)
RNA-Seq ExpressionCmaCh12G011040
SyntenyCmaCh12G011040
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTAGGTATTTTACAAGTGTTTGATTATTGTTCTTATTTTCTGTTTGAAATTGAGCAATTTAATAGAAAATTTAACTGATAAAGAAAAACCTCCCAATGGAAATATATCATTATATTGTGATTTGAATTAGGCCAATTTTAGCTGCAAATCCTCATAAATTCTACCTTTTTTTTTTAATTTTTTATTATTGTTCTTATTTGAAATCTTCCCAAACATAGGTTTTTAATTTTATTACTATTAGGGCACGCACAGGTTTGATTAGAAGAACGTTTTGGACCGTTTTGGACCAATTAGAAATTTCGAGTGGGTGAGCTTAAGCAACTATTAGATGAACACGACTCTCTACAATGGTATGATATTATCCACTTTGAGCATAAGTTCTCATAGCTTTGCTTTGGGCTTCCCAAAAGACCTTGTACTAATGGAGATAGTATTTCTTGATTATTAAAAGACCTCGTACTAATGTAGATAGTATTTCTTGATTATTAATAGATGTGGGACTTTCATCCAACCCCTCCAAAAAAAATTATTAATTATGCAACAAAACTGTTCATACCCGGCAATGCAAGGGAAGTCATCGATAAGATACTAAAAGTCGTGAATATTTTTGGAATTGGAATAGTCATTCCACTCATTTCATCATGATAAACAAGACATATTCGCCAAGAAAAAAAGTCCAACACCAATAAATCTATGAGAGATTATTTGTAAAATAGTTCCATTGAGTCCCGTGTCGCGTATAGAACAAATTTCTATGATTATGAAACCCATATGAAATACAGAGGAATAAGCTATTCTTTTTTATTTCAGAGATGTGTGACATTACTAATTAAGATCTTATAAAGTTCAATATAAAATATTTACCCGTCATAATGTGTCACTAATATCGATTACAAACGTTGAATATTATGAATCTTCTTGATAGTAGGGTATGCTTGGGTTGTAACCCTATGTTTTAGGTTGAGTAGTCCAGGTTGGTCGGGTTATTGAGTCATATAAACACTTTTATAATTTAACGAGTTTATGTACTCTCATTGCCTATACAAACAAATTAATATGTTCTTTTTTTTAGGAATGTGTAAGAGGGTCATATTTGCTAGTTAGTGGATTGAAATGCTAACAGTTTTGTCTCATTGTCGAGAACATAATCAAAACCATTTAATTGCATCCCTGCTTTTATTAAAAAAAACTTTGCTCTTCTTTTATCATTTGGAGGTATAGATTATGTAGTCGAATCAATGCACGTGGTTGTACGAAACTGGCAAACTGTTGAGGATTGTTAGGAGATGAGTCCTACACCGGCTAATTAAGTAGTTGATCATGGATTACAAACATCGACGACAGAGAGACCCAACTCCATTCCACCCTTAATTCCATTGACGTTTCTAAAGAAGACAACTCACCCACCTCCGATGATCAAACGACGGAGAAGTTCAACAAGCATTGATTGGAGGAGAAATTTGCAGAGTTGAACACAGGGATTTATGAATGTAGATCATGTGGGCATAAATTCGACGAGGCAGTCGGAAATCCGCCGTATCCAGTAGCACCGAGACTGTCGTTCGAACAGCTACCGGAGGACTGGCAGTGCCCGACGTGTGGGGCGGTGAAGAGTTTATAAAGTTTGAAGCTCGAACCCACAATTATAAAGTTCGAATCCACAAACTAAATTAATAAAAATATTACTATATTTTAAAAAGAATAAATTTAAGATTTAGATTAGTTTTCTATGGAAACGCTAGTATTTGTTTAAAAAAAGTACCCATTAACTTTTAAAATTAACATTACTATCCTTAATCTTAATAAAAAAATATATATAATTTTTAAAATGTTTTTAATATTAATGTCTACGTAGATTATTTCTCCCTTCAAGTTTTTTAAAATTTTTAAAAGTATTACTCATTTTTTTAAAGTTTATGAATATTTTTTTTTAGATCATCTCAGCTTTATTTTAATTATTTTAAAAAAAATTAAAATATTTTATTGAGAGGTATTTAAAAAATTAAATATTTTAGATTCTTTTTCATTTAAGGATATTTTTTATTTTTTTATTTAATTTAAGAATATTATTGAGAATTAGGAATATTTTAAAGATAATTTAGAGAATCGAATAATTTATTTTTTTTGTTTTTTTTTGTTTTTTTTTGTTTTTGTTTTTGTTTTTGTTTTTGTTTTTGTTTTTGTTTTTGTTTTATGCTATTTGTGAGCCTTTTCCCCTCGTAATACCCGCACGAGGACTGAAATCTCGACAAGGTACACCTTCGGCCCGGCATTTCTTCGCTGTGCTCTTCATCCGTTTGTGTGTAAAAATCGAGAAAAATCAAACCCCATTTGGCATTTCGTCTACCGGTGAGAGTTTCGAGCTCGATTACCTGTCATTTTCATATTATTTTCATCGGTTTCTATCCATTTTCATCAGTATTGTTGTTTTCGAGCAACTTCTGGGAATTGGATCATCATTTTACTGTGGAAATCATCAACATCTCGACTGTTTCTTGTGAAAAATCTCTCTCGGTTGGTTTTATTTCTGATGGATTTCTCTGTTTTCTGTTTTTATTATGGATTTCATATGTAATTTCCATGGTTTTTGGGTCAGATTTCTTGGATCACCTTGTTTTTCTGCTTTATATTTTCCTCAAACCTGTTTTGTTCTATGAATTTATGAATGGATTGGCTGGTTGAAAGAATCTCTGTTCGTTTTGGTATTTGGTTGAACTTTTTACTCATTTTTTAGGCTCTAAGACGAGTTTCTTCGTATTTATTGGAGTATGGTTTGTGTGATCTGATCGGAACATATCATATTAGGAAGATCAAGATCATTTCAATTGTGTTTTTATGTTATGTTCTCTGAATCTCAATTATTCAACAGTTTAGAGAAGTGGGGAGTTCATTTCTTTTAATTTTGTGCCAAGAACTGTAATGATACTGAAACATGTTGTTTGATTGAAATATATTTGCTTTGATTTGCTGATTTGTGATTATGTTTTCATCTGCAGGTTTGTTGCTCAAAATGGTTGATGCAAAATCTTCAATAGCTAAAGATGTTACTGAAGTAAGCCACTAGCCTTCCTTTCCCTCTTTTTTCTCCTGTGCATTGAATGATTGATTTAGCTTATCACCAAGAACTTCAACTCATTTCGTACATGTTTCATATCTTTAAATTTACAGCTGATCGGTAAAACACCGCTTGTATATCTCAACCGTGTTGTTGATGGCTGCGTGGCCCGGGTTGCTGCCAAGTTGGAGATGATGGAGCCCTGCTCCAGTGTCAAAGATAGGTAGTTTTTATTTTTAAATTATAATGATCCTAAATTTGGCCTTAAAACTTGAGTAGTGAGCATTTTCTTGCCACAAGTGAACATAGAAGCACCTTGTCGGCGTGTTCATTATTTACTCGTGTTCTTTATTATTTCCTTTCTTTTGGAAAACCATCTTTCATTAAGAAAATGAAAGAACACATCTATTTATTTACTTGTTCATATGCTTCTTGCTGTTTCTTTGTAATTGGATTTACACTTTTAGAATGTAATTCTGATAAATTTTGTTCATGCTGATATATATCATCAAGTTTCCTTCAAGTTCTACTTCCATTAGAATTTAGAATTTTCTGCAACTAGTTTGATGAGAAAAATCTTTCTTTCCTTTCAGGATTGGATATAGCATGATTTCAGATGCAGAGGAAAATGGTCTTATCACTCCGGGGAAGGTTAGCAATTTAGAATCGTCTTTTCGAAATCTAAACGAAACAGTGTTATATTTTTGTTGGTTTATTCACTTTTCTTTTTCATTTCATAGAGTGCCTTGATTGAACCTACCAGTGGTAATACTGGTATAGGCTTGGCTTTCATTGCTGCTGCCAAGGGTTACAAACTTATAATCACCATGCCTGCCTCAATGAGTCTCGAAAGAAGAATCATTCTTCGAGCTTTTGGAGCTGAACTGATTCTCACAGATCCAGCTCGAGGGATGAAAGGAGCCGTTCAGAAGGCCGAAGAAATAAAAGCAAAGACGCCAGATTCATACATCCTTCAGCAATTTGAAAACCCTGCTAACCCAAAGGTATCCTCGGAGTCGATTCAATGTGAAACATCATTTTATGTTTAAGAGATTTATGTAATATTTTCATGTTTTTTCAGATCCACTATGAGACTACTGGTCCAGAAATCTGGAGTGGTTCAGGTGGGAAGGTCGATGCACTCGTCTCTGGTATAGGGACCGGAGGTACAATAACCGGTGCAGGGAGGTATCTCAAAGAACAAAATCCTTCTATTAAGGTTTGTTATTTCTAATCATCCGAGCTCTTGCATTGCTGCTGTTATCTAAGACAACCGTTCACGGTTGCTGATGTGATTTCTTCCTTTTCGTGGTTAGTTATATGGTGTAGAACCAGTTGAAAGTGCAATTCTATCCGGAGGAAAGCCCGGTGAGTCATGGCATCCTTTCTCTTCCCCCGGAAAATGAAAATATATTTTTGGCGATATGATGTCTTGTTGGGCGATATGATATCTAGTGTGGTGTACAGAGTATATTGTTTGCAATTCATGAGGCCTATTGGAAATTCTGGATGAGCTTGTTTTGTTTTCTCACATTTTTTCTGATAAAGAAAACTTGTTTCAACTTTCAGGCCCACATAAAATCCAAGGAATAGGGGCAGGCTTCATCCCTGGAGTTTTGGATGTCCCTCTGCTGGATGAAGTTGTTCAAGTAAGGATCAAGCTTTTATTAGATTTCTTTCAATAGATATGGGAAGTGGGTGACTATCTTTGTAACGGCCCGAGCTCATCGCTAGTAGATATTGCCCTCTTTGGACTTTCCCTTTCGGTATTCCCCTGAAGGTTTCTCAAACGTGTCTGTTAGGGAGAGGTTTCCACACCCTTATAAGGAAAAAAACATTCCTTATAAGGGTGTGGAAACCTCTCTCTAGTAGACGCGTTTTAAAACCATGAGACTGATGACAATACGTAATGGGCCAAAGCAGACAATATTTGCTAGTTGTGGGCTTGGGCTTGGGCTGTTACAGATGGTATTAGAGCCAGACATCGAACGGTATGCTAGCGAGGACGTTGGGCCCCCAAAGGGTGGGGTGGATTGTGAGATTCCACACAAGAACGAAACATTCCTTATTAGGGTGTGAAACCTCTCCCTAGTAGACGCGTTTTAAAACCGTGAGACTGACGGCGTTACGTAACGGGCCAAAGCAGACAATATCTGCTAGCGGTGAGCTTGGTCCAATTATTGGTCTAACCATAAAGCATCTCTGACTTGAATAGTTTGCAGGTATCAAGTGAAGAAGCCATCGAAACTGCAAAGCAGCTTGCATTGAAAGAAGGTCTACTGGTAGGTTGCAACATAATGAAGATAAGGCATCAAAAAAACATTTTCCAATGCTATTTGATTCTCTTTTCATCACGATCATTCGAGCCAATTAGGCTCGAGATCTGATTGTCAACACTTTTGGTTCGGTATGCTATGCTATGATGTGATAATATCGTCTACATGTAGGTTGGTATATCATCCGGTGCTGCCGCGGCCGCTGCAATTAAGGTTGCAAAGAGACCAGAAAATGCTGGAAAGCTTATTGTTGTAAGCGTTCCTTTTCTTTCTCATGTAGTATACTTCCTTTGGCATTACAACACTGCTTTAGCTTGTTTACATACATCAATTCTTCTATGATAGATATTGATGTGCTGAAGAACAAAAAAAACTACATATGTTTTAACAATTGAAACAGCTTTTAAGGATTTTTTCAACAAAAAAACGAACAAGAGTGAGTCTAGGGAGCTCGGGAAACTTCAGCAATGGAAAATAGACTGACCGAGCGAATATAACATTGTACTTAGGAATGAGAATTTGAACCTATAACTTACGAGAAGTAAGAACGACTGTCTAACTAATCTTAGTTTTGGAAATTTGAACAGGTGATCTTCCCGAGCTTTGGGGAACGATACCTCTCAACCGTGTTGTTCGAGTCTGTGAAACAAGAGACCGAGAATATGGTTTTCGAGCCATGAGAGTTTACGCAATGTTTCGTTGCTTCAAATAGATGCCACTGAGAATAAGAACTGGCTTTTATGTATTGAATGAGTATAATTGATTTAGTTTTATGTGAGAAAATAAGAAATTAACTCGTAGGCTTTTGGAGAATTTCGAACCGATTTATTAACATCTTAGTCTATATACTTTCGATAAATTTTATTAGAACAAATTTGCAATAAAATATGCAAGAGAGTTCTTGGCTCGGGTTGTGTCGGGGTTAGACATTTTGTTAATCAACACGGGATCGAGTTTGAAAAAAGCATTTTGATTTAAATAATAAAAAATAAAATGATGATTTTTAAAGTAAAATAAAATCATAAAAATAAAATAATAAGAACATTATTTATTTATTTATTTTTTTTTTTTTTTGTCGTCGGTGTCATTCAGATTCAATATTTGTTCTCGAAAATCTTTATTTTTTCAGTTGAAGCTCACTCATGTCCCGTCCTTGATAGAGAGAACTAAACGGGTTTAATCCTTTGTAATGTCTCACATTGGTTCGATAAGAGAACAAAACACTTTTTATAAGGGTGTGAAAATCTTCCCGTAATAGATGTGTTTTAGAAAAGTCCAAAGATGACAATATCTGCTAGCGATGGATCTGGGCCATTATAAATGGTATTAGAGTCAGACATCGAACTATGTGTCAATGAGGAGGCTATTCCCCAAAGGAAGTAGACACGAGACAATGTGCTAGTAAGGACGCTGGGACCCAAAGGGGGGGGGGGGGGGGGGGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGGGGGGGGGGGGGGGGGGTGGAAGAGTGTCAACGAGGACGTTGGGTCTCAAAGGGGGTGGATTGTGAAGACGCGTCTTCAAGCTTTGAGGGAAAGTCGAAAGAGGACAAAATCTGCTAGCGGTGGATTTGGACTGCTACATTCTTACTCTTCATTTTCACATCTGTTCAGACTAACGGGACTCTATGAGGAGAGTGAAGTCTCGAGAAGGTCACCCAGACTACCGAGACTTTACGAAGAGAGTAAAGACTAGAGAAGGTTTAGAGGAAGATGCTTAACCCTATTCATTCTCTAGTAGTAATAATAAATAAAACTTTAAATGCTACGAATATCATTTAAGAAATAATACAAAGATTATTAGATATATTTCTACAGTATTAGGGGTGAATTAGTAGTTTCGCCTTGGTTCAGTGTACGGTATTCCACAGCCGTCGATTCCTTACGAGATCCCGGCTCTATCTAGGGCTAAAATCGCGAGTACAAAATATTTTTATACTCTCAAGCGCGTCGATTATGCACAGTTTCATGTGTTTTTGTGCAACAATTTTACCCTCGATCAAATCCCAAAAACCGAGTCAAAGAGTTGGCCACAACAAAATCGAAACAGAGAGAGCCCAGTTCCGAGTCCTCTCTCTCTCTCAACGGGATTTTCTCTTTCTAGAAATATTTTTGCAACTTCCGTAGTGGAACAAGAAGGACGATACATAAAATCGTTGAATACAGCTTCAAATTCAATCCTGAAGGTTAGGCTTTCATTTCATCTATCGTAATTTAGGTGTTTTCAACTCTGATTTTTTTTTTTTGTTCTTTCTTTTGGCGGGTGTTCGTCGATATAATACGTACATTTGTTTTGGTTTGATTTTTGTTTTTCGAATTCTGGATTGTGGACTTAGTTTAAATTTTGTTCGTCGTAGGCTTTAAATTTCACTTCTGTGATCTGATTGGTGTGTTTGGCGAAGATCAGTGCTTCTTCCTTTTCCATTTAGGCATTTTTTGGGTGTTTGAGAGCTCTATGGGGTAGGAATAGTAATTTATTGTGTAACTTTTGGCGTGGTATTGAAATTTAAGATAGTTCTTGGTTAGTTCGGGATGTTAATTGGAAATCTATTAGAGATGCCCGCTGCCGTATAATCTTTGGCAATGGAAGCGATTATTGTTCATTTATGATTCATCATATCAAGAATGTGTTGAAGCTCAAGTAAGTTCAGAGTACTGAGTAATATTTTTCTCGTGAGACTACACATTCTAAGTTCTGGCTTACTAGTTATTGCTAGTTTCATGTGTTTCTAGTGAGCTCTGCAATGCACCGAGTGGGTAGTGCGGGGAATACATCCAATTCTTCTCGCCCTCGCAAAGAGAAAAGGTTGACATATGTTTTGAGTGATGCCAATGATAGTAAGGTAATGTTTTCCCCCCTTTTTTTTCTAAAAAATAATTATGTTATTATAATTGTGTTTTATTTTGGTTAAACTGATTGATTTCCTGCTTTTGTCACTGTCATCATAAGGATCATTGTTTGAATAAATGTACTTTTAATTGGACGTATATTCGTGAAGCTGTCGTGTATATTTATTGATTGCCATGCTTTTGTATGCATACACAGTACCAAAATTTGAAAAATATATGCATGAAACTTGTCTTCTTGTGGAGTATCAAGCCTTTATCCATGTTATTTTCTTTATGCTACCTAATAAACATCTTTTGCAATTGACATGAGTCCATGAATAAGCTGAAACTTCTCATGAAATCCTTCTTGTAAATTTGAAAAATATGGCTTGGAAACTGAGGGAAACACGACCTTGTTTTTCAATGCTTTAATTATATCCCACGAAGGCTTCCAATTTTGTATTGTGTCTTCTTCAGAATTTACCTATATATGTGCTAGTAGTTTAAATCTGTTTATGCACTTTTAATCTAAGAAGGACCTTATGAGAGAACTATTACGCCAAGGATTTGATTATATTGGTTTAGAGAACGTTTTTTTTTTAAATTTTTTTCCAGATACATACAACGAAGGGTGAGGTAACTAAATTGAGCTATAAGTTAGATATGTTAATAGGGTAAACCTTAGTTTTTTTATGGTTCGTAATTCAATTTTCCTTTGGATATGCTATTTGCATTTGTTTAGGCTCTTGTGAGTCTTTACAAACCTAAAATAAAATATGAATGCCTTGCACACTTGTATTTGAATGCTTTTACATCTTTCGATTTATGTAGCATTCTGCCGGTATAAATTGCTTGGCTGTGCCAAAGTCTTCTATTGATGGCTACAATTACCTTTTTACTGGGAGCCGTGATGGTACACTGAAAAGATGGTCACTAGATGAAGATGCAGCTTCCTGCTCTGCTACCTTTGAGTCTCACGTGGACTGGGTATTTCTATTTACCTTCTCCTTTCCCCATATAATTTATGCTCTTAATAGTAGTTATATTATAGATTCATTGTGCTTTCATGCCTCTTTTGGTTTCTAATAAGAAATATTTGAAATTTTCTTGAATAGGAGTAAGTATTATCCTATCGAGAAGGCAACTGTTAGTGATACTGTAAATAGTAACTCAAGCTAAAAAGTGAATAATGGAACAGCATACAAGGATGTATTACATGTAAAAGGTTAACTCATCAGCATCACTTTTCTACTTTTCAGGTTAATGATGCGGTCCTTGTAGGCAATAATAGGCTTATTTCTTGTTCTTCAGATGGCACTGTGAAGGTTATTTGTTTTTTATATTCTTTTTTTTCCCCATCAACTTCTCTTCATTTTCTTTTCCATCCAATTCTCTATATTGTTATATGGTTTGATTGACTTATTTTAATTATGTTGATCTACTCTTTTGACCTATGCTAGACATGGAACTGCTTAACTGATGGGGGTTGCACTAAGACTCTGCGTCAGCATTCTGACTATGTGACTTGTCTTGCTGCAGCAGAAAAAAATGTAATTTCTTTGCCAAACTTTGTGTAAACCTTGTGGTATGCCCTGATGAAAAGATTGCTGATTGTGAAAACCTGTAGTGAGTTTGTTGGATTTTGTTATGTTTTCTGGGACTAGTATTTTATTTTATGTGGGTTGTCTTCTGATCCTATCGTAGTATCATGCATAATAGAGCAATGTTGTTGCATCTGGAGGTCTTGGTGGGGAGGTTTTCATATGGGATCTGGAGGCTGCATATGCACCAAGTTCAAAGTCGAGCGATGCAACCGGCGATGAGTGTTCAAATGGTATCATTGCTTCAGGAAATTCACTTCCAGTTACAAGTTTACATGCTATCAACTCCAGTAACAACATTTCTACACACCCAAACCATTCTCATGGTTATGTTCCAATTGCTGCAAAAGGCCATAAGGAATCAGTTTATGCATTAGCAATGAATGATAGTGGGACGCTTCTTGTTTCTGGCGGAACTGAAAAGGTATATTCCGACTCCTACACACCCTTGTTTGATTGACATATTTAATTATCACAAGCTTAGGAGTCTATTTTCAGGTGGTTAAAATTAGTTGTTTCTTCTCTCACTTCCTCTAGGTTGTTAAATTAGTTGTTTCTTCTCACTTCCTCTAGGTTGTTCGTGTTTGGGACCCAAGAACTGGTTCAAAGACCATGAAGTTAAGAGGACATACAGATAATATTAGGGCTCTGCTTTTGGACTCGACTGGCAGGTTAGTTAGAGGGATTCTTTCTTTTCTGTTTGGACGTTTTTTTTTCAGGTGTTACACTAAAACCTAGTTATCATGACAATAAGTTTCTTGAAAGTGGTTTTTTGGGAGTTAATTTTAATTTTTTTTTAAAAAGACGATGGAAGTTTTTTTTTTTTTTGTTATAGTTTTTATAATTTTTTTTAATATATAAATCAGAGGTTCCTATTCAATAGTGATTTTAAGGACGCGTCCAAGGAGCGGTCCATTGGATAAGACCCATGTCTCGAGTAACAAGGTCACATTTTTGAAGCTTTGGTGAGCTTAATGTGAAAAACCCTTGAATGTTGATGTCTGCAAAGTCTGAGTAACACCTAGCAAAGGTCACGTGTCTGGAGCTTTGGTTGAGCTTAATGTGAAAAACCCTTGAATTTTGATGTCTGCAGAGTCTTGGCCTTAGGGTGGGCACATACTTTTTAAAATATAGTGATTATGAGTCTAAATATTTACTAGTCACTTGATGACTTCTGAAAATGAGTTATAGAGAGTCATTTGCTAAACAGCCTTACTCTTTCTCCTCTCAGGGCTATCTTTTGGAATGGGTCTCATTGATTTGTCTGTTTTATTTTTCTTTTTAGTAATTATAATAAGATTCTATTTACATGGCTGCAGGTTCTGCCTATCCGGATCTTCTGATTCCATGATCAGGTAGATGAGTCTTGCCATTTCATTTTCTGTAGTTTTTATGTAACTTGCTTCAATGTTTTTGCTTCCTTTTCTCACAAAATTATTTCAAGTATTATAATTTGGAAGCCTGTAATTCTCTCCTTGCAGACTGTGGGATCTTGGTCAACAACGATGTGTGCATTCCTATGCAGTACATACTGATTCTGTTTGGGCACTAGCAAGTACGCCATCATTTAGTTATGTTTACAGTGGTGGTAGAGATCTCTCTGTAAGTCCCTTTTTGTTGTTAGAGAATCATTTTTCTGTTTGATTTGGTTTTAATGCATAGTTTTATCTTTTAATGCCCATTTCTTTTATATTCAGCTGTATGTAACAGACTTGTCGACAAGAGAGAGTCTTTTGCTTTGTACAGGGGAGTATCCCATTCAACAATTAGCAATACATGATGAGAATATATGGGTTGCGACTACAGATTCTTCAGTCCATAGATGGCCTGCTGAAGGGCGTAACCCTCATAAGGCTTTCGAAAGAGGTGGTTCATTTGTAGCTGGAAATCTGTCATTTTCTAGGGCAAGGGCTTCCTTAGAAGGATCTACTCCTGTGAGTAAATCATCTAGTTTAAAATTTCTTGATCATATTAGTTTAAATTTTACTCTCATTTTTCCCCTTGACTAAGTTGCATATGTACATATTTCTTTTCAGGGTGTTCTTTGGTTATTTTCTTAGAAACAACTTGTCTTTTATGGTTGTCAATTGGACAGTCCTTTTGTTATTTCTCTGGCTTTTGTGAGCTTCTTGCCTACTTTTCCATTTTGTTTGTTGTGAAGAAAAGATTGCACATGTTCTGGTATTTCCACCATCCTTCCACTAAATTTCACGGTTTGTTTTTGTTGTATGCGTTTTTAATTTGTTAAAAAATCAATGGCAAAAGGTTTCATCAAGACTAGATTTTTTTTATACTGATTCTTAGATTTAGGAAACTTTATTTTTCTTCTTGTTTGTTTCGTTAAAATGATTTTCCTAAATTGTTCGTTAAATTCCTTGCATATTCTGGTTTTCCACCATCTTTGCACCAAACTCCGTAGTGTCTTTTATGGAAAAAGGGTTTAGGGTTTAATCAATATTAGATTTTTTTATGTTGGTTCTTGGATTTAGGAAACTTGATTTCTATACTGGTTTATTGTGGTAAATTGTTTTTTTTTTTTTTTTTTTTTTCTGTTTTACAGGTTCCTGTGTATAAAGAACCAACATTTACTATTTCTGGAGCTCCAGCAATTGTGCAGCATGAAATTTTAAATAATAGGAGACATATCCTGACAAAGGTAATTGTATTCCTTAAATGGTGAACTAACTATGATGAACTCTCTACCGGCATATGGCTTTTTTGTGCTATTTCTATGATTTTTTTCCTCAGTGATGTGTGATCTTCCGTCACAGGATGCTGCAGGTTCTGTGAAGCTATGGGAAGTTACCAGGGGTATTGTAATTGAGGATTATGGAAAGGTTGCACTCGTCATTCTTTTTTTGTTATAATATTCTCTGCAAAAAATGATTGAATTGCTGAAAACACATTTCAATGCTTGACGNGGGGGGGGGGGGGTTGATGGACGCATGGAAGACATTAAATTCCTCTGTGGAACTAAGTGTCATCTAATATGCTTTAGTAAAATGTTCCATTTTTATAACATAGCTTTTTTTGGTTTCAATGCAAACATTTTAGGTCTCATATGAAGAGAAAAAGGAAGAGTTATTTGAGATGGTAAGATTTCTTTTATACATTACTTATAGTTTTACTATTTCTGCTGGATGGACAGTGACACGTCGATTTTGACATGACTTCAGGTCAGTATTCCTGCATGGTTTACTGTGGATACCAGGCTTGGGAGTTTGTCAGTTCATCTGGACACTCCCCAGTGCTTTTCCGCAGAGATGTATTCAGCAGACCTTAATATTACAGGAAAACCGGAAGACGATAAGGTTAAATCATTATTTTCTTTTTCTAGTTATGTTTAGGGTCTGTCTCTTTTTTTTTTACTTCTTTCCTATCAGCTTTGTTTTTTTTTTTTTTGGATTCAAGTCTGTTTCTGCTTTGTTTTGGTTTGGACTCCTGCTTTGTTTTGGTTTCTCACCTTTTTTGTCTTCTTTTTGATTCCTTGGAGCTTTCTTCATAAGACTTCTATTAGCAATTATAGTTTATCATTGATTTGTACTTTTTGGTGAGGTTTTTTAGACTCCTTTAGAGCTTTCCCTTTGTAGTTTCAGTCATACAATGATGTTGTGTTTCTTGTTTTTGGAATAATAATCATAATGATAATAAATGAAGATCAAACCATGGGCTGTATCCTAAGAATCTTTGGAATTTATTATGAGGTTATGTATTTAATACCTTCAATCCTCTTACCTGAGGTAGTGATATTCCTTAGTTTCATGATTATCTCATCACGTCAAAGGTGCTAAAATATGCTTGCTGACTTTGAACGGTGACCAAAACAAACTTTTGGGTTAGAATCATGATTGATGGAAGAGTTAATCAATTGATAGCTTGGACGTTCTTATCTTTCCGTAAGTTATTAGTCAATTATCATTGAGTACTGAATTTTGTATTTGTATATTTTTTTTTCAGGTTAATCTAGCTCGAGAAAACCTTAAAGGACTTATGGCTCATTGGTTAGCCAAAAGAAAACAAAGATTTGGATCACAAGCTTCAGCTAATGGAGAAGTTATTTCCAGCAAGGATATTAGTGCAAGAAGTCTTTCCCATTCAAGGCTTGAAGCAGTTGATGGCAATGCTGAAAATGATTCCATGGTCTATTCTCCATTTGAATTTTCAACAGTTTCTCCTCCTTCAATCATTACGGAGGGTTCTCAAGGTGGTCCTTGGAGAAAAAATATTACTGAATTTGATGGAACAGAAGATGAGAAAGATTTTCCATGGTGGTGTTTGGACTGCGTGTTGAACAATCGTCTCCCTCCGAGAGAAAATACAAAGTAATATTCTAAATTCAGTACCCACCTTCCTGGTTATGGACTTTAAAAGTTCAAAGAAAACTTTCTTATTTTTAAAACAGATATAGTGCATGCTTAAAACCTGTACGACACCCAGTCAATTATTATTATTTATGACCAAAACTGGGGAGTGAGATACTTCCATTTTTATCCCTCTTTACTTGTAACTCTTTCTCAAATTTGTTTGGGTATTGAGATTCAACAATCTGTTTATGAGGGATTTCTTGTGGGAGGGAGTCAATGGGGGATGTTGATTAGTTGGGAGATGGTATCTAGACCGTTGGAGTTTGGGGATTTGGGTATTTGTAGCCTGAGACAGTGGAATAAGGTCATTGTAACCATGTGGTTGTGGCGCCTATTTAGATGGAGCTTAGAACTTTGTGGCACAAAGTTCTTGTGAGTGGGTTTTTGGTTGTAGGCCTTTGAGATTGCTTGAAGACTCATAGAAAGATGTTGCTTCTCAGTTCCTCCTCGAGTGTTTTGTTGGGGATGACTCCAGATATCTATTTTTGGGAGAAACGTTTCTGTCTTTCTTTCCCCCTTTTGTATCATCTTTTGGATATGAAGATAAAGTTTTTGGCGTCTATTTTTTAGGTGGGCTCCATTCTCTTGTCTTTAATTTTGGGATTTCATCATCGTCTTACTGACAGGGAGGTTGCCTCTCTACCCTAGTTTTTTGAGCAACCCTTTTGTTGTTCAAGGAGAGATTCTTGTATCTAGTCCCCTCTTCCTCCTAAGGTAGGTTTTTCTTCCTATTGACCTTGCTTTCTCCAAAGGGCTACGTTTCTTTCTTCTGTTACCCCTTCATTTTCTTCACTTAGGAAGATTTAGGTTCCCAAGAAGATTAAGTTCTTTGCCCGATAGATCCTGGATGGAAAAATGAATCACTTAAGATTGCATGTAGAAACCCTTTTTCTTCTTTTTGGGGTCTTCCCTTTGTCGAGATGAGTTGGAGAGTCTGGACTATATTCTCTAAAGTTGTTGTTATACTTAGTTTATATGGTCCAGGCTCTAGATGCTTTTGGAGTCTTGTTGGCTTGGAGCAGGGATTGTAGGACTATGATGGTGAAGGGCCTCTAATTCTATGGTATGCTTGCTTGAGAGGAATGGTAGGGCTGGATGGAGGAAGCTATGCTCCCCTCCCCCCACCCCTCTAATGCTATGGTAGGCTTGCTTGAGAGAAATTGCAGGGTTTTTAGTGGTGGAGAGAGATTCGAGTGAGAGTCGTGGGAGGTTATTAGGTTTAAAGTGGCTAATGTTTTTTATGTAATTATCAATAGGTTTATTCTTTTAGATCGATGTACTTTCTTATAGTTCTTTGAACTCCCTTCTTGTGGGACTTCTTTTTTTGCCTGGCATATACTGGGGCTGGTTTTTGAGCTTATTAATAAAGCTAGAGGTAGTAAAGACATAATGGAATATTTATTTTTTCTTGCTAATAAATCCATCTCTGTTTCCTTGCACGTGTTCTAATGGACAATACTCTCTGCAGGTGTAGCTTTTATCTACATCCTTGTGAAGGCTCATCCATTCAGATCCTAACACAAGGAAAACTAAGTGCTCCTCGTATCTTGAGAGTGAATAAGGTCAGTTTTCTATTTTGTTACTTTTTTTTAATAGAGATAAAAACATTTCATTAAAGAAATGGCAAGAGATTAATGCTCAAAAGACACAAACTTCAATAGGAATGAAAAAGAAATAGAAAAAATTATAAAGGAGAAATAAAAGCATTCCAATCGAAAAAATTATAAAGGAGAAATAAAAGCATTCCAATCTAAACAAATGTCCTGCAAAGAAATACCAGAAAGAGAACATGTATCAGTCCAGGCCCACTGCCAGCAGATATCGTTCTCTTTGGTCTTTTCTTTTCGAGCTTCCTCTCAAGTTTTTTAAAGCACATCTGCTTGGGAGAGGTTTCCACACTCTTATAAAGGGTGTTTCGTTGTCCTCTCCAACCGATGTGGGATCTCACAAAACACCAAAGAGAGGCTTTGTGATGGGATGGCTCATATCGGTTGACCCATGAACTATATTTATCTGAAAAAAATCCTTTGATTTCTCACAACTGTAAATCAGAAATCAAAGCTTTGACCCCTTTGACCCAAAGACGATGAGACCACAAAGGCAAAGGAAGTTGCTTTTGGCTGTAGAATGAAAAATCCATCGTGGTTTGTTGCCATTATTTGGTTAGAGAGGAACAGCAAATTAAGGTTGGAGAGACCTAAGGAGGAATGGATGATGAGTGGGCCCTCCGCCTTTGTTTATGTTCTCTATCTTTCTTTAGGCTTTTTTGGTTATTAAGTATTTTTATAATTACGTTTTAGATCTCATTTTGCCTGCTTGAAGTTGTTTTTGTTTTAGGCTTCATTTGAGATTGCTGTAGAGAAATCAGTTGTGACGTGAAATAAATAGTCTTCGATAAATTTGACGAAGTAAATTTGAACTAAATTGTTGTGGTTTAGTTATAATGACCGAAGTAGGCTTATTATTTCAATTATTTTAAAATTTATATAGTTCAGATTGCATTTTGTTTGATCTAAATTATTTGATAAAAGTATAATGAAAATAATTATTATAAAAAAAATTGAATCAATAAAATTTTTGTTGATATAAAAGAGAATAGATGTGGAAATGTATGTATATGGAAGGAAAAATTTGAGGTTTGTAAACGGTTAGGGAAGACATATTTTTTGGGAAGTTGAATATAGAAGTCTGTATCTATCAGCTTTTTCAGGTCAAGAGTTGATTGAGGAAGTGTAGATTGAGAAAGTGACTCTTTCATAGATCACGTCTAATTATTAGACTGTTGTTTTTTATTCTTATTTATTTTTAAAGAATTAAATGCCTCTTTTACCCGAATGACTAAATCTAGATTGATAGATAATCTAAATGATAATCCCTACATATCAAGTAACTAAATTTAATGGGATTACACATATATGATTGGACACTCCCCTTCAAGCTCAGTAATAGATATCATTATTCTGAGCTTGGATACTCATTCATGAGTTACACAAATATCTCCAAATTAAGGCTAATCCTTTGATACTCCCTCTTAAGTTGAGAATGAGTATCTATTACTCAGCTTGTACTTGATAGGTTTATGTGCAATATTGATCGCATGACTTCAATGATAGCTATGAAGTCTCCAAATAAATTTGGTCCAGGTGGATGATCAGTAGCTATGATGGTTTCGGATGAATTTGTCAATGGTAGCTATGAAGGCTCTGAGTAAATTTATAAATTTGTCAATGGTAGCTATGAAGGCTTTGAGTAAATTTGTCAATGGTAGCTATGAAGGTTTTGAATAAATTTGTCAACGGTAGCTACGAAGGCTCTAAATAAGTTTGTCAGCAGTAGCTACGAAGGCTCTGAATAAGTTTGTCAGCGGTAGCTACGAAGGCTCTGAATAAGTTTGTCAGTGGTAGCTACGAAGGCTCTGAATAAGTTTGTCAACGGTAGCTACGAAGGCTCTAAATAAGTTTGTCAACGGTAGCTATGAAGGCTTTGAATAAGTTTGTCAACGGTTGTCAATGGTGGCTACGAAGGCTCTGAATAAATTTGTCAACGGTAGCTATGTCAGCACATTTGTCAGTTGAAGCCGTGAGAGCACATAACTCATCCATACTATCCTCTTGGAGCACTCCCACAGGCGTTGCTCGCATCTCTTGTGCTCACTCCTAGAGTGTTTGGTTCAGTTGGTAACTCTACAACAGTAGGGGACTCATAATCTACTAACTAGGCTTTCTATAGTCGCCTTTCTTAAGGTAGCGTCCAATTCTTCCATTTGAGAATTGAAATTGGAGTCCATGATGGATTTAGACATCTTAGTTTTGTAGCCTCCTTTTCTTCCTACTGACGCCTTTGGTTCCCTTCATTATTGTATACTTCAAACCTGAATCTCAGAATTTGATACTTGATGAGGATGATTGGAAGTATTTTTCTAACATAATGAATTGATGAGTCTTATTTCGGAGGGAAATTTAGTATGGAGCAAAATTTATTCAAGTCACGACTGAAAAATCCCTCCTTGAACGCAGTGGGTTCGAAGAACAGTGGGTAAACCTATTTATTCAATTTGAAAAACAGCGGGTGGCTTGCTGAATGAAATTTTCTGGGCGGTGTCTTGCCGAATTAGCTTGCGATTGGGCGGTGGAAGTAAGGCAGCTTGGTCACACTGACGGCAACTACGGAGACTCCAAATAAAACTTGATCACAGATGATTGGTGGCTACAAGGCTTCAAGTTTGGGTTGGCTTTGATACATGTTGCAAGTCTAAATTAGCAGCTGTTTTTGTTCTTATTTATTTTCAGAGAATTGAAGGCAGGGAAAGACTGACTAAATCTAAATTGTAGATAATCTAAATAATAATCTATAGTACGGGATAACTAAATTTAAATAAATCCAACGTGATAAATTAGGGATTACACAAATATATACTAACTCATGAATTACACAATATCTCCAATTCAAGGCTAATCCTTCGACAATACATAATTTTGTTTTGCCAAACAACCTAGGCAGTCAAGATTGTCCTATAATTGAATTTTAGACTTACAAATGCAATATGAAATGGGGCCATAGTTTGATCCTTTTGTTTGGGATTGATTTTTGTGTATTCCCTGTTTGTTCTTTCGTTTTCTCTCTATTAAAGTTTGGTATTCTACTTTAAAAAGACCTATTGTGTCATGTAATGCCTTAGCGATGGAAGAGACTTCTTATGCATCATAGAACCATGCTATATAAATTTTCTTGTCTGTTTCTTTATCCTATCCTATATTGAAGGTCTTGGAACTGTATCCACCACATGATAAGAATGACTTTGAAATTCAGTCATAAAACATAGTATCTAGGTTAATTTGACAAACTACCTTCTATTTATATTCTAACCCCAATTGCTATAATGTATACTTCTTCGACCCCTTCAAAGAGCCAAGGGCTAGGATAAGAGAGAGAATGATTAACTATAAGGGGAACCAAAAGCAAAAACATTGAATGTTTTTGTAGTATAGATTTAATGTTTTTACAGGATTATCAACACTGTCACATCCTGTCACTGGGAGATAACTTGTAATTTTTGGTGGATTTAATTTGTAACAGCCCAAGCCCAAGCCCACCGCTAGCAGATATGGTCCTTTTTGAGCTTTCCTTTTCAAGCTTCCCCTTAAGGTTTTTAAAGCGTGTTTATTGGGGAGAGGTTTCCATACCCTTATAAAAAATGTTTCGTTCTCCTCCCCAACCAATGTGGGATCTCACAATCCATCCCCCTTTGGGACTCAGTGTCCTTGCTAGCACTTGTTCCCTTCTCCAATTGATGTGGGACCCCCCATCCACCACCCCCTTTGGGGCCCAGTATCCTTATTGACACATCGCCTCGTGTCCACCTCCCTTCGAGGCTCAGCCTCCTCGCTGGCACATCACCCGATGTCTAGCTTCGATACCATTTGTAACAGCTCAAGCCCACCGCTAACAGATATTGTCCTCTTTGGGTCTTTCCTTTTGGACTTCCTCTCAAGCTTTTTAAAACGCGTCTGCTATGGAGAGGTTTTCGTACCCTTATAAAGAATGGTTCGTTCTCCTCTCCGACCAATGTGGGTACTCACATAATTAGTTGGCAGCTGAAATTTAGTTTACCTTTGTGGAAGAACTTTATATATGATAAATTTATGTTATAATTTCATTCTGATTGAGTTTGATTTGATATAACCCATTTCCCTTCGCTATTTAACAGGTTGTAAATTATGTCGTAGAGAAGATGGTTCTTGACAAGCCATTGGATAATGTAAATCCGGACATTTCTTTTGGTCCTGGACTCTCTTCAACCGTTGGAGACGGATCATTTCGGTCTGGATTAAAGCCTTGGCAAAAGCTTAAACCTTCAATAGAAATCTTATGCAATAATCAGGTCTTGTCCCTAATGTTTATGCAATGATCTGATATTTTTTGGTGTTGTATTCTGCAGCTCTTTTTACTTCTTTCTTTTTTTTTTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGGTAACATTACAGGTCTTGTCCCCTGATATGAGTTTAGCTACAGTTAGGAATTACATCTGGAAGAAGCCCGAGGACTTGGTCCTCAATTACAGAATGCTTCAAGGCAGATGA

mRNA sequence

ATGGTTAGATCATGTGGGCATAAATTCGACGAGGCAGTCGGAAATCCGCCGTATCCAGTAGCACCGAGACTGTCGTTCGAACAGCTACCGGAGGACTGGCAGTGCCCGACGTGTGGGGCGGCTCTAAGACGAGTTTCTTCGTATTTATTGGAGTATGGTTTGTTGCTCAAAATGGTTGATGCAAAATCTTCAATAGCTAAAGATGTTACTGAACTGATCGGTAAAACACCGCTTGTATATCTCAACCGTGTTGTTGATGGCTGCGTGGCCCGGGTTGCTGCCAAGTTGGAGATGATGGAGCCCTGCTCCAGTGTCAAAGATAGGATTGGATATAGCATGATTTCAGATGCAGAGGAAAATGGTCTTATCACTCCGGGGAAGAGTGCCTTGATTGAACCTACCAGTGGTAATACTGGTATAGGCTTGGCTTTCATTGCTGCTGCCAAGGGTTACAAACTTATAATCACCATGCCTGCCTCAATGAGTCTCGAAAGAAGAATCATTCTTCGAGCTTTTGGAGCTGAACTGATTCTCACAGATCCAGCTCGAGGGATGAAAGGAGCCGTTCAGAAGGCCGAAGAAATAAAAGCAAAGACGCCAGATTCATACATCCTTCAGCAATTTGAAAACCCTGCTAACCCAAAGATCCACTATGAGACTACTGGTCCAGAAATCTGGAGTGGTTCAGGTGGGAAGGTCGATGCACTCGTCTCTGGTATAGGGACCGGAGGTACAATAACCGGTGCAGGGAGGTATCTCAAAGAACAAAATCCTTCTATTAAGTTATATGGTGTAGAACCAGTTGAAAGTGCAATTCTATCCGGAGGAAAGCCCGGCCCACATAAAATCCAAGGAATAGGGGCAGGCTTCATCCCTGGAGTTTTGGATGTCCCTCTGCTGGATGAAGTTGTTCAAGTTGGTATATCATCCGGTGCTGCCGCGGCCGCTGCAATTAAGGTTGCAAAGAGACCAGAAAATGCTGGAAAGCTTATTGTTGTAAGCGTTCCTTTTCTTTCTCATGTAGTATACTTCCTTTGGCATTACAACACTGCTTTAGCTTGTGATCTTCCCGAGCTTTGGGGAACGATACCTCTCAACCGTGTTGTTCGAGTCTGTGAAACAAGAGACCGAGAATATGGTTTTCGAGCCATGAGAGTTTACGCAATTGGAACAAGAAGGACGATACATAAAATCGTTGAATACAGCTTCAAATTCAATCCTGAAGTGAGCTCTGCAATGCACCGAGTGGGTAGTGCGGGGAATACATCCAATTCTTCTCGCCCTCGCAAAGAGAAAAGGTTGACATATGTTTTGAGTGATGCCAATGATAGTAAGCATTCTGCCGGTATAAATTGCTTGGCTGTGCCAAAGTCTTCTATTGATGGCTACAATTACCTTTTTACTGGGAGCCGTGATGGTACACTGAAAAGATGGTCACTAGATGAAGATGCAGCTTCCTGCTCTGCTACCTTTGAGTCTCACGTGGACTGGGTTAATGATGCGGTCCTTGTAGGCAATAATAGGCTTATTTCTTGTTCTTCAGATGGCACTGTGAAGACATGGAACTGCTTAACTGATGGGGGTTGCACTAAGACTCTGCGTCAGCATTCTGACTATGTGACTTGTCTTGCTGCAGCAGAAAAAAATAGCAATGTTGTTGCATCTGGAGGTCTTGGTGGGGAGGTTTTCATATGGGATCTGGAGGCTGCATATGCACCAAGTTCAAAGTCGAGCGATGCAACCGGCGATGAGTGTTCAAATGGTATCATTGCTTCAGGAAATTCACTTCCAGTTACAAGTTTACATGCTATCAACTCCAGTAACAACATTTCTACACACCCAAACCATTCTCATGGTTATGTTCCAATTGCTGCAAAAGGCCATAAGGAATCAGTTTATGCATTAGCAATGAATGATAGTGGGACGCTTCTTGTTTCTGGCGGAACTGAAAAGGTTGTTCGTGTTTGGGACCCAAGAACTGGTTCAAAGACCATGAAGTTAAGAGGACATACAGATAATATTAGGGCTCTGCTTTTGGACTCGACTGGCAGGTTCTGCCTATCCGGATCTTCTGATTCCATGATCAGACTGTGGGATCTTGGTCAACAACGATGTGTGCATTCCTATGCAGTACATACTGATTCTGTTTGGGCACTAGCAAGTACGCCATCATTTAGTTATGTTTACAGTGGTGGTAGAGATCTCTCTCTGTATGTAACAGACTTGTCGACAAGAGAGAGTCTTTTGCTTTGTACAGGGGAGTATCCCATTCAACAATTAGCAATACATGATGAGAATATATGGGTTGCGACTACAGATTCTTCAGTCCATAGATGGCCTGCTGAAGGGCGTAACCCTCATAAGGCTTTCGAAAGAGGTGGTTCATTTGTAGCTGGAAATCTGTCATTTTCTAGGGCAAGGGCTTCCTTAGAAGGATCTACTCCTGTTCCTGTGTATAAAGAACCAACATTTACTATTTCTGGAGCTCCAGCAATTGTGCAGCATGAAATTTTAAATAATAGGAGACATATCCTGACAAAGGATGCTGCAGGTTCTGTGAAGCTATGGGAAGTTACCAGGGGTATTGTAATTGAGGATTATGGAAAGGTCTCATATGAAGAGAAAAAGGAAGAGTTATTTGAGATGGTCAGTATTCCTGCATGGTTTACTGTGGATACCAGGCTTGGGAGTTTGTCAGTTCATCTGGACACTCCCCAGTGCTTTTCCGCAGAGATGTATTCAGCAGACCTTAATATTACAGGAAAACCGGAAGACGATAAGGTTAATCTAGCTCGAGAAAACCTTAAAGGACTTATGGCTCATTGGTTAGCCAAAAGAAAACAAAGATTTGGATCACAAGCTTCAGCTAATGGAGAAGTTATTTCCAGCAAGGATATTAGTGCAAGAAGTCTTTCCCATTCAAGGCTTGAAGCAGTTGATGGCAATGCTGAAAATGATTCCATGGTCTATTCTCCATTTGAATTTTCAACAGTTTCTCCTCCTTCAATCATTACGGAGGGTTCTCAAGGTGGTCCTTGGAGAAAAAATATTACTGAATTTGATGGAACAGAAGATGAGAAAGATTTTCCATGGTGGTGTTTGGACTGCGTGTTGAACAATCGTCTCCCTCCGAGAGAAAATACAAATGGGTTCGAAGAACAGTGGGTTGTAAATTATGTCGTAGAGAAGATGGTTCTTGACAAGCCATTGGATAATGTAAATCCGGACATTTCTTTTGGTCCTGGACTCTCTTCAACCGTTGGAGACGGATCATTTCGGTCTGGATTAAAGCCTTGGCAAAAGCTTAAACCTTCAATAGAAATCTTATGCAATAATCAGGTCTTGTCCCCTGATATGAGTTTAGCTACAGTTAGGAATTACATCTGGAAGAAGCCCGAGGACTTGGTCCTCAATTACAGAATGCTTCAAGGCAGATGA

Coding sequence (CDS)

ATGGTTAGATCATGTGGGCATAAATTCGACGAGGCAGTCGGAAATCCGCCGTATCCAGTAGCACCGAGACTGTCGTTCGAACAGCTACCGGAGGACTGGCAGTGCCCGACGTGTGGGGCGGCTCTAAGACGAGTTTCTTCGTATTTATTGGAGTATGGTTTGTTGCTCAAAATGGTTGATGCAAAATCTTCAATAGCTAAAGATGTTACTGAACTGATCGGTAAAACACCGCTTGTATATCTCAACCGTGTTGTTGATGGCTGCGTGGCCCGGGTTGCTGCCAAGTTGGAGATGATGGAGCCCTGCTCCAGTGTCAAAGATAGGATTGGATATAGCATGATTTCAGATGCAGAGGAAAATGGTCTTATCACTCCGGGGAAGAGTGCCTTGATTGAACCTACCAGTGGTAATACTGGTATAGGCTTGGCTTTCATTGCTGCTGCCAAGGGTTACAAACTTATAATCACCATGCCTGCCTCAATGAGTCTCGAAAGAAGAATCATTCTTCGAGCTTTTGGAGCTGAACTGATTCTCACAGATCCAGCTCGAGGGATGAAAGGAGCCGTTCAGAAGGCCGAAGAAATAAAAGCAAAGACGCCAGATTCATACATCCTTCAGCAATTTGAAAACCCTGCTAACCCAAAGATCCACTATGAGACTACTGGTCCAGAAATCTGGAGTGGTTCAGGTGGGAAGGTCGATGCACTCGTCTCTGGTATAGGGACCGGAGGTACAATAACCGGTGCAGGGAGGTATCTCAAAGAACAAAATCCTTCTATTAAGTTATATGGTGTAGAACCAGTTGAAAGTGCAATTCTATCCGGAGGAAAGCCCGGCCCACATAAAATCCAAGGAATAGGGGCAGGCTTCATCCCTGGAGTTTTGGATGTCCCTCTGCTGGATGAAGTTGTTCAAGTTGGTATATCATCCGGTGCTGCCGCGGCCGCTGCAATTAAGGTTGCAAAGAGACCAGAAAATGCTGGAAAGCTTATTGTTGTAAGCGTTCCTTTTCTTTCTCATGTAGTATACTTCCTTTGGCATTACAACACTGCTTTAGCTTGTGATCTTCCCGAGCTTTGGGGAACGATACCTCTCAACCGTGTTGTTCGAGTCTGTGAAACAAGAGACCGAGAATATGGTTTTCGAGCCATGAGAGTTTACGCAATTGGAACAAGAAGGACGATACATAAAATCGTTGAATACAGCTTCAAATTCAATCCTGAAGTGAGCTCTGCAATGCACCGAGTGGGTAGTGCGGGGAATACATCCAATTCTTCTCGCCCTCGCAAAGAGAAAAGGTTGACATATGTTTTGAGTGATGCCAATGATAGTAAGCATTCTGCCGGTATAAATTGCTTGGCTGTGCCAAAGTCTTCTATTGATGGCTACAATTACCTTTTTACTGGGAGCCGTGATGGTACACTGAAAAGATGGTCACTAGATGAAGATGCAGCTTCCTGCTCTGCTACCTTTGAGTCTCACGTGGACTGGGTTAATGATGCGGTCCTTGTAGGCAATAATAGGCTTATTTCTTGTTCTTCAGATGGCACTGTGAAGACATGGAACTGCTTAACTGATGGGGGTTGCACTAAGACTCTGCGTCAGCATTCTGACTATGTGACTTGTCTTGCTGCAGCAGAAAAAAATAGCAATGTTGTTGCATCTGGAGGTCTTGGTGGGGAGGTTTTCATATGGGATCTGGAGGCTGCATATGCACCAAGTTCAAAGTCGAGCGATGCAACCGGCGATGAGTGTTCAAATGGTATCATTGCTTCAGGAAATTCACTTCCAGTTACAAGTTTACATGCTATCAACTCCAGTAACAACATTTCTACACACCCAAACCATTCTCATGGTTATGTTCCAATTGCTGCAAAAGGCCATAAGGAATCAGTTTATGCATTAGCAATGAATGATAGTGGGACGCTTCTTGTTTCTGGCGGAACTGAAAAGGTTGTTCGTGTTTGGGACCCAAGAACTGGTTCAAAGACCATGAAGTTAAGAGGACATACAGATAATATTAGGGCTCTGCTTTTGGACTCGACTGGCAGGTTCTGCCTATCCGGATCTTCTGATTCCATGATCAGACTGTGGGATCTTGGTCAACAACGATGTGTGCATTCCTATGCAGTACATACTGATTCTGTTTGGGCACTAGCAAGTACGCCATCATTTAGTTATGTTTACAGTGGTGGTAGAGATCTCTCTCTGTATGTAACAGACTTGTCGACAAGAGAGAGTCTTTTGCTTTGTACAGGGGAGTATCCCATTCAACAATTAGCAATACATGATGAGAATATATGGGTTGCGACTACAGATTCTTCAGTCCATAGATGGCCTGCTGAAGGGCGTAACCCTCATAAGGCTTTCGAAAGAGGTGGTTCATTTGTAGCTGGAAATCTGTCATTTTCTAGGGCAAGGGCTTCCTTAGAAGGATCTACTCCTGTTCCTGTGTATAAAGAACCAACATTTACTATTTCTGGAGCTCCAGCAATTGTGCAGCATGAAATTTTAAATAATAGGAGACATATCCTGACAAAGGATGCTGCAGGTTCTGTGAAGCTATGGGAAGTTACCAGGGGTATTGTAATTGAGGATTATGGAAAGGTCTCATATGAAGAGAAAAAGGAAGAGTTATTTGAGATGGTCAGTATTCCTGCATGGTTTACTGTGGATACCAGGCTTGGGAGTTTGTCAGTTCATCTGGACACTCCCCAGTGCTTTTCCGCAGAGATGTATTCAGCAGACCTTAATATTACAGGAAAACCGGAAGACGATAAGGTTAATCTAGCTCGAGAAAACCTTAAAGGACTTATGGCTCATTGGTTAGCCAAAAGAAAACAAAGATTTGGATCACAAGCTTCAGCTAATGGAGAAGTTATTTCCAGCAAGGATATTAGTGCAAGAAGTCTTTCCCATTCAAGGCTTGAAGCAGTTGATGGCAATGCTGAAAATGATTCCATGGTCTATTCTCCATTTGAATTTTCAACAGTTTCTCCTCCTTCAATCATTACGGAGGGTTCTCAAGGTGGTCCTTGGAGAAAAAATATTACTGAATTTGATGGAACAGAAGATGAGAAAGATTTTCCATGGTGGTGTTTGGACTGCGTGTTGAACAATCGTCTCCCTCCGAGAGAAAATACAAATGGGTTCGAAGAACAGTGGGTTGTAAATTATGTCGTAGAGAAGATGGTTCTTGACAAGCCATTGGATAATGTAAATCCGGACATTTCTTTTGGTCCTGGACTCTCTTCAACCGTTGGAGACGGATCATTTCGGTCTGGATTAAAGCCTTGGCAAAAGCTTAAACCTTCAATAGAAATCTTATGCAATAATCAGGTCTTGTCCCCTGATATGAGTTTAGCTACAGTTAGGAATTACATCTGGAAGAAGCCCGAGGACTTGGTCCTCAATTACAGAATGCTTCAAGGCAGATGA

Protein sequence

MVRSCGHKFDEAVGNPPYPVAPRLSFEQLPEDWQCPTCGAALRRVSSYLLEYGLLLKMVDAKSSIAKDVTELIGKTPLVYLNRVVDGCVARVAAKLEMMEPCSSVKDRIGYSMISDAEENGLITPGKSALIEPTSGNTGIGLAFIAAAKGYKLIITMPASMSLERRIILRAFGAELILTDPARGMKGAVQKAEEIKAKTPDSYILQQFENPANPKIHYETTGPEIWSGSGGKVDALVSGIGTGGTITGAGRYLKEQNPSIKLYGVEPVESAILSGGKPGPHKIQGIGAGFIPGVLDVPLLDEVVQVGISSGAAAAAAIKVAKRPENAGKLIVVSVPFLSHVVYFLWHYNTALACDLPELWGTIPLNRVVRVCETRDREYGFRAMRVYAIGTRRTIHKIVEYSFKFNPEVSSAMHRVGSAGNTSNSSRPRKEKRLTYVLSDANDSKHSAGINCLAVPKSSIDGYNYLFTGSRDGTLKRWSLDEDAASCSATFESHVDWVNDAVLVGNNRLISCSSDGTVKTWNCLTDGGCTKTLRQHSDYVTCLAAAEKNSNVVASGGLGGEVFIWDLEAAYAPSSKSSDATGDECSNGIIASGNSLPVTSLHAINSSNNISTHPNHSHGYVPIAAKGHKESVYALAMNDSGTLLVSGGTEKVVRVWDPRTGSKTMKLRGHTDNIRALLLDSTGRFCLSGSSDSMIRLWDLGQQRCVHSYAVHTDSVWALASTPSFSYVYSGGRDLSLYVTDLSTRESLLLCTGEYPIQQLAIHDENIWVATTDSSVHRWPAEGRNPHKAFERGGSFVAGNLSFSRARASLEGSTPVPVYKEPTFTISGAPAIVQHEILNNRRHILTKDAAGSVKLWEVTRGIVIEDYGKVSYEEKKEELFEMVSIPAWFTVDTRLGSLSVHLDTPQCFSAEMYSADLNITGKPEDDKVNLARENLKGLMAHWLAKRKQRFGSQASANGEVISSKDISARSLSHSRLEAVDGNAENDSMVYSPFEFSTVSPPSIITEGSQGGPWRKNITEFDGTEDEKDFPWWCLDCVLNNRLPPRENTNGFEEQWVVNYVVEKMVLDKPLDNVNPDISFGPGLSSTVGDGSFRSGLKPWQKLKPSIEILCNNQVLSPDMSLATVRNYIWKKPEDLVLNYRMLQGR
Homology
BLAST of CmaCh12G011040 vs. ExPASy Swiss-Prot
Match: Q43317 (Cysteine synthase OS=Citrullus lanatus OX=3654 PE=1 SV=1)

HSP 1 Score: 491.9 bits (1265), Expect = 2.0e-137
Identity = 258/311 (82.96%), Postives = 273/311 (87.78%), Query Frame = 0

Query: 58  MVDAKSSIAKDVTELIGKTPLVYLNRVVDGCVARVAAKLEMMEPCSSVKDRIGYSMISDA 117
           M DAKS+IAKDVTELIG TPLVYLNRVVDGCVARVAAKLEMMEPCSSVKDRIGYSMISDA
Sbjct: 1   MADAKSTIAKDVTELIGNTPLVYLNRVVDGCVARVAAKLEMMEPCSSVKDRIGYSMISDA 60

Query: 118 EENGLITPGKSALIEPTSGNTGIGLAFIAAAKGYKLIITMPASMSLERRIILRAFGAELI 177
           E  GLITPG+S LIEPTSGNTGIGLAFIAAAKGY+LII MPASMSLERR ILRAFGAEL+
Sbjct: 61  ENKGLITPGESVLIEPTSGNTGIGLAFIAAAKGYRLIICMPASMSLERRTILRAFGAELV 120

Query: 178 LTDPARGMKGAVQKAEEIKAKTPDSYILQQFENPANPKIHYETTGPEIWSGSGGKVDALV 237
           LTDPARGMKGAVQKAEEIKAKTP+SYILQQFENPANPKIHYETTGPEIW GSGGK+DALV
Sbjct: 121 LTDPARGMKGAVQKAEEIKAKTPNSYILQQFENPANPKIHYETTGPEIWRGSGGKIDALV 180

Query: 238 SGIGTGGTITGAGRYLKEQNPSIKLYGVEPVESAILSGGKPGPHKIQGIGAGFIPGVLDV 297
           SGIGTGGT+TGAG+YLKEQNP+IKLYGVEPVESAILSGGKPGPHKIQGIGAGFIPGVLDV
Sbjct: 181 SGIGTGGTVTGAGKYLKEQNPNIKLYGVEPVESAILSGGKPGPHKIQGIGAGFIPGVLDV 240

Query: 298 PLLDEVVQ--------------------VGISSGAAAAAAIKVAKRPENAGKLIVVSVP- 344
            LLDEV+Q                    VGISSGAAAAAAI++AKRPENAGKLIV   P 
Sbjct: 241 NLLDEVIQVSSEESIETAKLLALKEGLLVGISSGAAAAAAIRIAKRPENAGKLIVAVFPS 300

BLAST of CmaCh12G011040 vs. ExPASy Swiss-Prot
Match: Q00834 (Cysteine synthase OS=Spinacia oleracea OX=3562 PE=1 SV=1)

HSP 1 Score: 470.3 bits (1209), Expect = 6.2e-131
Identity = 245/311 (78.78%), Postives = 268/311 (86.17%), Query Frame = 0

Query: 58  MVDAKSSIAKDVTELIGKTPLVYLNRVVDGCVARVAAKLEMMEPCSSVKDRIGYSMISDA 117
           MV+ K+ IAKDVTELIGKTPLVYLN V DGCVARVAAKLE MEPCSSVKDRIG+SMI+DA
Sbjct: 1   MVEEKAFIAKDVTELIGKTPLVYLNTVADGCVARVAAKLEGMEPCSSVKDRIGFSMITDA 60

Query: 118 EENGLITPGKSALIEPTSGNTGIGLAFIAAAKGYKLIITMPASMSLERRIILRAFGAELI 177
           E++GLITPG+S LIEPTSGNTGIGLAFIAAAKGYKLIITMPASMSLERR ILRAFGAELI
Sbjct: 61  EKSGLITPGESVLIEPTSGNTGIGLAFIAAAKGYKLIITMPASMSLERRTILRAFGAELI 120

Query: 178 LTDPARGMKGAVQKAEEIKAKTPDSYILQQFENPANPKIHYETTGPEIWSGSGGKVDALV 237
           LTDPA+GMKGAVQKAEEI+ KTP+SYILQQFENPANPK+HYETTGPEIW G+GGK+D  V
Sbjct: 121 LTDPAKGMKGAVQKAEEIRDKTPNSYILQQFENPANPKVHYETTGPEIWKGTGGKIDIFV 180

Query: 238 SGIGTGGTITGAGRYLKEQNPSIKLYGVEPVESAILSGGKPGPHKIQGIGAGFIPGVLDV 297
           SGIGTGGTITGAG+YLKEQNP +KL G+EPVESA+LSGGKPGPHKIQG+GAGFIPGVLDV
Sbjct: 181 SGIGTGGTITGAGKYLKEQNPDVKLIGLEPVESAVLSGGKPGPHKIQGLGAGFIPGVLDV 240

Query: 298 PLLDEVVQ--------------------VGISSGAAAAAAIKVAKRPENAGKLIVVSVP- 344
            ++DEVVQ                    VGISSGAAAAAAIKVAKRPENAGKLIV   P 
Sbjct: 241 NIIDEVVQISSEESIEMAKLLALKEGLLVGISSGAAAAAAIKVAKRPENAGKLIVAVFPS 300

BLAST of CmaCh12G011040 vs. ExPASy Swiss-Prot
Match: O81154 (Cysteine synthase OS=Solanum tuberosum OX=4113 PE=2 SV=1)

HSP 1 Score: 467.2 bits (1201), Expect = 5.2e-130
Identity = 244/311 (78.46%), Postives = 265/311 (85.21%), Query Frame = 0

Query: 58  MVDAKSSIAKDVTELIGKTPLVYLNRVVDGCVARVAAKLEMMEPCSSVKDRIGYSMISDA 117
           M   K  IAKDVTELIG TPLVYLN VVDGCVARVAAKLE MEPCSSVKDRIGYSMI+DA
Sbjct: 1   MAGEKIGIAKDVTELIGNTPLVYLNNVVDGCVARVAAKLESMEPCSSVKDRIGYSMITDA 60

Query: 118 EENGLITPGKSALIEPTSGNTGIGLAFIAAAKGYKLIITMPASMSLERRIILRAFGAELI 177
           EE GLI PG+S LIEPTSGNTG+GLAF+AAAKGYKLIITMP+SMSLERRIILR F +EL+
Sbjct: 61  EEKGLIKPGESVLIEPTSGNTGVGLAFMAAAKGYKLIITMPSSMSLERRIILRGFRSELV 120

Query: 178 LTDPARGMKGAVQKAEEIKAKTPDSYILQQFENPANPKIHYETTGPEIWSGSGGKVDALV 237
           LTDPA+GMKGA+ KAEEIKAKTP+SYILQQFENPANPKIHYETTGPEIW GS GKVDAL 
Sbjct: 121 LTDPAKGMKGAISKAEEIKAKTPNSYILQQFENPANPKIHYETTGPEIWKGSNGKVDALA 180

Query: 238 SGIGTGGTITGAGRYLKEQNPSIKLYGVEPVESAILSGGKPGPHKIQGIGAGFIPGVLDV 297
           SGIGTGGTITG+G+YL+EQNP++KLYGVEPVESAILSGGKPGPHKIQGIGAGFIPGVL+V
Sbjct: 181 SGIGTGGTITGSGKYLREQNPNVKLYGVEPVESAILSGGKPGPHKIQGIGAGFIPGVLEV 240

Query: 298 PLLDEVVQ--------------------VGISSGAAAAAAIKVAKRPENAGKLIVVSVP- 344
            L+D+VVQ                    VGISSGAAAAAAIKVAKRPENAGKLIVV  P 
Sbjct: 241 NLIDDVVQVSSEESIEMAKLLALKEGLLVGISSGAAAAAAIKVAKRPENAGKLIVVIFPS 300

BLAST of CmaCh12G011040 vs. ExPASy Swiss-Prot
Match: Q9XEA6 (Cysteine synthase OS=Oryza sativa subsp. japonica OX=39947 GN=RCS1 PE=2 SV=2)

HSP 1 Score: 455.7 bits (1171), Expect = 1.6e-126
Identity = 235/305 (77.05%), Postives = 260/305 (85.25%), Query Frame = 0

Query: 64  SIAKDVTELIGKTPLVYLNRVVDGCVARVAAKLEMMEPCSSVKDRIGYSMISDAEENGLI 123
           +IAKDVTELIG TPLVYLNRV DGCV RVAAKLE MEPCSSVKDRIGYSMI+DAEE GLI
Sbjct: 4   TIAKDVTELIGNTPLVYLNRVTDGCVGRVAAKLESMEPCSSVKDRIGYSMITDAEEKGLI 63

Query: 124 TPGKSALIEPTSGNTGIGLAFIAAAKGYKLIITMPASMSLERRIILRAFGAELILTDPAR 183
           TPGKS LIEPTSGNTGIGLAF+AAAKGY+L++TMPASMS+ERRIIL+AFGAELILTDP  
Sbjct: 64  TPGKSVLIEPTSGNTGIGLAFMAAAKGYRLVLTMPASMSMERRIILKAFGAELILTDPLL 123

Query: 184 GMKGAVQKAEEIKAKTPDSYILQQFENPANPKIHYETTGPEIWSGSGGKVDALVSGIGTG 243
           GMKGAVQKAEE+ AKT +S+ILQQFENPANPKIHYETTGPEIW G+GGKVD LVSGIGTG
Sbjct: 124 GMKGAVQKAEELAAKTNNSFILQQFENPANPKIHYETTGPEIWKGTGGKVDGLVSGIGTG 183

Query: 244 GTITGAGRYLKEQNPSIKLYGVEPVESAILSGGKPGPHKIQGIGAGFIPGVLDVPLLDEV 303
           GTITGAGRYL+EQNP IK+YGVEPVESA+LSGGKPGPHKIQGIGAGF+PGVLDV L++E 
Sbjct: 184 GTITGAGRYLREQNPDIKIYGVEPVESAVLSGGKPGPHKIQGIGAGFVPGVLDVDLINET 243

Query: 304 VQ--------------------VGISSGAAAAAAIKVAKRPENAGKLIVVSVP-----FL 344
           VQ                    VGISSGAAAAAA+++A+RPEN GKL VV  P     +L
Sbjct: 244 VQVSSDEAIEMAKALALKEGLLVGISSGAAAAAAVRLAQRPENEGKLFVVVFPSFGERYL 303

BLAST of CmaCh12G011040 vs. ExPASy Swiss-Prot
Match: P80608 (Cysteine synthase OS=Zea mays OX=4577 PE=1 SV=2)

HSP 1 Score: 454.5 bits (1168), Expect = 3.5e-126
Identity = 235/311 (75.56%), Postives = 260/311 (83.60%), Query Frame = 0

Query: 58  MVDAKSSIAKDVTELIGKTPLVYLNRVVDGCVARVAAKLEMMEPCSSVKDRIGYSMISDA 117
           M +A  SIAKDVTELIG TPLVYLN+V DGCV R  AKLE MEPCSSVKDRIGYSMI+DA
Sbjct: 1   MGEASPSIAKDVTELIGNTPLVYLNKVTDGCVGRSRAKLESMEPCSSVKDRIGYSMITDA 60

Query: 118 EENGLITPGKSALIEPTSGNTGIGLAFIAAAKGYKLIITMPASMSLERRIILRAFGAELI 177
           EE GLITPG S LIEPTSGNTGIGLAF+AAAKGYKL +TMPASMS+ERRIIL+AFGAEL+
Sbjct: 61  EEKGLITPGVSVLIEPTSGNTGIGLAFMAAAKGYKLTLTMPASMSMERRIILKAFGAELV 120

Query: 178 LTDPARGMKGAVQKAEEIKAKTPDSYILQQFENPANPKIHYETTGPEIWSGSGGKVDALV 237
           LTDP  GMKGAV+KAEEI+AKTP+SYILQQFENPANPKIHYETTGPEIW  + GK+D LV
Sbjct: 121 LTDPLLGMKGAVKKAEEIQAKTPNSYILQQFENPANPKIHYETTGPEIWKATAGKIDGLV 180

Query: 238 SGIGTGGTITGAGRYLKEQNPSIKLYGVEPVESAILSGGKPGPHKIQGIGAGFIPGVLDV 297
           SGIGTGGTITG GRYL+EQNP++KLYGVEPVESA+L+GGKPGPHKIQGIGAGFIPGVLDV
Sbjct: 181 SGIGTGGTITGTGRYLREQNPNVKLYGVEPVESAVLNGGKPGPHKIQGIGAGFIPGVLDV 240

Query: 298 PLLDEVVQ--------------------VGISSGAAAAAAIKVAKRPENAGKLIVVSVP- 344
            LLDE +Q                    VGISSGAAAAAA+++AKRPENAGKL VV  P 
Sbjct: 241 DLLDETLQVSSDEAIETAKALALKEGLLVGISSGAAAAAAVRLAKRPENAGKLFVVVFPS 300

BLAST of CmaCh12G011040 vs. ExPASy TrEMBL
Match: A0A5J5T780 (Cysteine synthase OS=Gossypium barbadense OX=3634 GN=ES319_A13G191000v1 PE=3 SV=1)

HSP 1 Score: 1585.5 bits (4104), Expect = 0.0e+00
Identity = 809/1138 (71.09%), Postives = 904/1138 (79.44%), Query Frame = 0

Query: 59   VDAKSSIAKDVTELIGKTPLVYLNRVVDGCVARVAAKLEMMEPCSSVKDRIGYSMISDAE 118
            ++ K +I KDVTELIG TP+VYLN +VDGC AR+AAKLE+MEPCSSVKDRI YSMI DAE
Sbjct: 1    MEDKYAIKKDVTELIGNTPMVYLNNIVDGCGARIAAKLELMEPCSSVKDRIAYSMIKDAE 60

Query: 119  ENGLITPGKSALIEPTSGNTGIGLAFIAAAKGYKLIITMPASMSLERRIILRAFGAELIL 178
            + GLIT GKS LI+ TSGNTGI +AFI AA+GYK+I+TMPA MS+ERRI+LRAFGAE+ L
Sbjct: 61   DKGLITSGKSVLIDTTSGNTGIAMAFIGAARGYKVIVTMPAYMSIERRIVLRAFGAEVRL 120

Query: 179  TDPARGMKGAVQKAEEIKAKTPDSYILQQFENPANPKIHYETTGPEIWSGSGGKVDALVS 238
            TDPA+G KG++ KA EI   T + Y+L+QFENP+NPKIHYETTGPEIW  S GKVD LV+
Sbjct: 121  TDPAKGFKGSLDKALEILKNTRNGYMLRQFENPSNPKIHYETTGPEIWKDSEGKVDVLVA 180

Query: 239  GIGTGGTITGAGRYLKEQNPSIKLYGVEPVESAILSGGKPGPHKIQGIGAGFIPGVLDVP 298
            GIGTGGT+TGAGR+LKE+N  IK+YGVEPVESA+L+GGKPG H IQGIGAG IP VLDV 
Sbjct: 181  GIGTGGTVTGAGRFLKEKNSKIKVYGVEPVESAVLNGGKPGSHLIQGIGAGIIPDVLDVG 240

Query: 299  LLDEVVQ--------------------VGISSGAAAAAAIKVAKRPENAGKLIVVSVPFL 358
            LLDEVVQ                    VGISSGAAAAAAIK+AKRPEN+GKLIVV  P  
Sbjct: 241  LLDEVVQISSEEAIETAKLLALKEGLLVGISSGAAAAAAIKLAKRPENSGKLIVVIFPSA 300

Query: 359  SHVVYFLWHYNTALACDLPELWGTIPLNRVVRVCETRDREYGFRAMRVYAIGTRRTIHKI 418
                                                                        
Sbjct: 301  G----------------------------------------------------------- 360

Query: 419  VEYSFKFNPEVSSAMHRVGSAGNTSNSSRPRKEKRLTYVLSDANDSKHSAGINCLAVPKS 478
                        +AMHRVGSAGN SNSSRPRKEKRLTYVL+D++D+KHSAGINCLAV KS
Sbjct: 361  -----------LTAMHRVGSAGNNSNSSRPRKEKRLTYVLNDSDDTKHSAGINCLAVLKS 420

Query: 479  SI-DGYNYLFTGSRDGTLKRWSLDEDAASCSATFESHVDWVNDAVLVGNNRLISCSSDGT 538
            S+ DG NYLFTGSRDGTLKRW+L EDAA+CSATFESHVDWVND V+ G N L+SCSSD T
Sbjct: 421  SVSDGCNYLFTGSRDGTLKRWALAEDAATCSATFESHVDWVNDTVIAGENTLVSCSSDTT 480

Query: 539  VKTWNCLTDGGCTKTLRQHSDYVTCLAAAEKNSNVVASGGLGGEVFIWDLEAAYAPSSKS 598
            +K WNCL+DG CT+TLRQHSDYVTCLAAAE+N+NVVASGGLGGEVF+WD+EAA  P SKS
Sbjct: 481  LKIWNCLSDGTCTRTLRQHSDYVTCLAAAERNANVVASGGLGGEVFVWDIEAAVTPLSKS 540

Query: 599  SDATGDECSNGIIASGNSLPVTSLHAINSSNNISTHPNHSHGYVPIAAKGHKESVYALAM 658
            SD   D+ SNGI  S NSLPV+SL  I+S+N+I+ H     GYVPIAAKGHKESVYALAM
Sbjct: 541  SDVMEDDFSNGINGSANSLPVSSLRPISSNNSITAHTTQCPGYVPIAAKGHKESVYALAM 600

Query: 659  NDSGTLLVSGGTEKVVRVWDPRTGSKTMKLRGHTDNIRALLLDSTGRFCLSGSSDSMIRL 718
            ND+G+LLVSGGTEKVVRVWDPRTGSKTMKLRGHTDN+RALLLDSTGR+CLSGSSDSMIRL
Sbjct: 601  NDNGSLLVSGGTEKVVRVWDPRTGSKTMKLRGHTDNVRALLLDSTGRYCLSGSSDSMIRL 660

Query: 719  WDLGQQRCVHSYAVHTDSVWALASTPSFSYVYSGGRDLSLYVTDLSTRESLLLCTGEYPI 778
            WDLGQQRCVHSYAVHTDSVWALASTP+F++VYSGGRDLSLY+TDL+TRESLLLCT E+P+
Sbjct: 661  WDLGQQRCVHSYAVHTDSVWALASTPTFTHVYSGGRDLSLYLTDLTTRESLLLCTKEHPV 720

Query: 779  QQLAIHDENIWVATTDSSVHRWPAEGRNPHKAFERGGSFVAGNLSFSRARASLEGSTPVP 838
             QLA+HD++IWVATTDSSVHRWPAEGRNP K F+RGGSF+AGNLSFSRAR SLEGSTP  
Sbjct: 721  LQLALHDDSIWVATTDSSVHRWPAEGRNPQKVFQRGGSFLAGNLSFSRARVSLEGSTPAA 780

Query: 839  VYKEPTFTISGAPAIVQHEILNNRRHILTKDAAGSVKLWEVTRGIVIEDYGKVSYEEKKE 898
            VYKEP F+I G PAIVQHEILNNRRH+LTKD AGSVKLWE+TRG+V+EDYG+VS++EKK+
Sbjct: 781  VYKEPIFSIPGTPAIVQHEILNNRRHVLTKDTAGSVKLWEITRGVVVEDYGQVSFDEKKQ 840

Query: 899  ELFEMVSIPAWFTVDTRLGSLSVHLDTPQCFSAEMYSADLNITGKPEDDKVNLARENLKG 958
            ELFEMVSIPAWFTVDTRLGSLSVHLDTPQCFSAEMYS DLNITGKPEDDKVNLARE LKG
Sbjct: 841  ELFEMVSIPAWFTVDTRLGSLSVHLDTPQCFSAEMYSVDLNITGKPEDDKVNLARETLKG 900

Query: 959  LMAHWLAKRKQRFGSQASANGEVISSKDISARSLSHSRLEAVDGNAENDSMVYSPFEFST 1018
            L+AHW+ KR+QR GSQASANG+V+S KD +ARSL+HSR+E VDGNAENDSMV+ PFEFS 
Sbjct: 901  LLAHWMTKRRQRLGSQASANGDVLSGKDNTARSLAHSRIE-VDGNAENDSMVHPPFEFSM 960

Query: 1019 VSPPSIITEGSQGGPWRKNITEFDGTEDEKDFPWWCLDCVLNNRLPPRENT--------- 1078
            VSPPSII+EGSQGGPWRK ITE DGTEDEKDFPWW LDCVLNNRLPPRENT         
Sbjct: 961  VSPPSIISEGSQGGPWRKKITELDGTEDEKDFPWWVLDCVLNNRLPPRENTKCSFYLHPC 1020

Query: 1079 NGFEEQ----------------WVVNYVVEKMVLDKPLDNVNPDISFGPG-----LSSTV 1138
             G   Q                 VVNYVVEKMVLDKP+D  N D S  PG       S V
Sbjct: 1021 EGTAVQILTQGKLSAPRILRINKVVNYVVEKMVLDKPIDTGNTDGSLAPGHGGQLQHSAV 1067

Query: 1139 GDGSFRSGLKPWQKLKPSIEILCNNQVLSPDMSLATVRNYIWKKPEDLVLNYRMLQGR 1146
             DGSF+SGLKPW K +PS+EILCNNQVLS DMSLATVR YIWKKPEDLVLNYR+ QGR
Sbjct: 1081 VDGSFKSGLKPWPKPRPSVEILCNNQVLSTDMSLATVRAYIWKKPEDLVLNYRVAQGR 1067

BLAST of CmaCh12G011040 vs. ExPASy TrEMBL
Match: A0A6J1HP49 (WD repeat-containing protein 48-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111465442 PE=3 SV=1)

HSP 1 Score: 1468.4 bits (3800), Expect = 0.0e+00
Identity = 729/762 (95.67%), Postives = 730/762 (95.80%), Query Frame = 0

Query: 409  VSSAMHRVGSAGNTSNSSRPRKEKRLTYVLSDANDSKHSAGINCLAVPKSSIDGYNYLFT 468
            +SSAMHRVGSAGNTSNSSRPRKEKRLTYVLSDANDSKHSAGINCLAVPKSSIDGYNYLFT
Sbjct: 1    MSSAMHRVGSAGNTSNSSRPRKEKRLTYVLSDANDSKHSAGINCLAVPKSSIDGYNYLFT 60

Query: 469  GSRDGTLKRWSLDEDAASCSATFESHVDWVNDAVLVGNNRLISCSSDGTVKTWNCLTDGG 528
            GSRDGTLKRWSLDEDAASCSATFESHVDWVNDAVLVGNNRLISCSSDGTVKTWNCLTDGG
Sbjct: 61   GSRDGTLKRWSLDEDAASCSATFESHVDWVNDAVLVGNNRLISCSSDGTVKTWNCLTDGG 120

Query: 529  CTKTLRQHSDYVTCLAAAEKNSNVVASGGLGGEVFIWDLEAAYAPSSKSSDATGDECSNG 588
            CTKTLRQHSDYVTCLAAAEKNSNVVASGGLGGEVFIWDLEAAYAPSSKSSDATGDECSNG
Sbjct: 121  CTKTLRQHSDYVTCLAAAEKNSNVVASGGLGGEVFIWDLEAAYAPSSKSSDATGDECSNG 180

Query: 589  IIASGNSLPVTSLHAINSSNNISTHPNHSHGYVPIAAKGHKESVYALAMNDSGTLLVSGG 648
            IIASGNSLPVTSLHAINSSNNISTHPNHSHGYVPIAAKGHKESVYALAMNDSGTLLVSGG
Sbjct: 181  IIASGNSLPVTSLHAINSSNNISTHPNHSHGYVPIAAKGHKESVYALAMNDSGTLLVSGG 240

Query: 649  TEKVVRVWDPRTGSKTMKLRGHTDNIRALLLDSTGRFCLSGSSDSMIRLWDLGQQRCVHS 708
            TEKVVRVWDPRTGSKTMKLRGHTDNIRALLLDSTGRFCLSGSSDSMIRLWDLGQQRCVHS
Sbjct: 241  TEKVVRVWDPRTGSKTMKLRGHTDNIRALLLDSTGRFCLSGSSDSMIRLWDLGQQRCVHS 300

Query: 709  YAVHTDSVWALASTPSFSYVYSGGRDLSLYVTDLSTRESLLLCTGEYPIQQLAIHDENIW 768
            YAVHTDSVWALASTPSFSYVYSGGRDLSLYVTDLSTRESLLLCTGEYPIQQLAIHDENIW
Sbjct: 301  YAVHTDSVWALASTPSFSYVYSGGRDLSLYVTDLSTRESLLLCTGEYPIQQLAIHDENIW 360

Query: 769  VATTDSSVHRWPAEGRNPHKAFERGGSFVAGNLSFSRARASLEGSTPVPVYKEPTFTISG 828
            VATTDSSVHRWPAEGRNPHKAFERGGSFVAGNLSFSRARASLEGSTPVPVYKEPTFTISG
Sbjct: 361  VATTDSSVHRWPAEGRNPHKAFERGGSFVAGNLSFSRARASLEGSTPVPVYKEPTFTISG 420

Query: 829  APAIVQHEILNNRRHILTKDAAGSVKLWEVTRGIVIEDYGKVSYEEKKEELFEMVSIPAW 888
            APAIVQHEILNNRRHILTKDAAGSVKLWEVTRGIVIEDYGKVSYEEKKEELFEMVSIPAW
Sbjct: 421  APAIVQHEILNNRRHILTKDAAGSVKLWEVTRGIVIEDYGKVSYEEKKEELFEMVSIPAW 480

Query: 889  FTVDTRLGSLSVHLDTPQCFSAEMYSADLNITGKPEDDKVNLARENLKGLMAHWLAKRKQ 948
            FTVDTRLGSLSVHLDTPQCFSAEMYSADLNITGKPEDDKVNLARENLKGLMAHWLAKRKQ
Sbjct: 481  FTVDTRLGSLSVHLDTPQCFSAEMYSADLNITGKPEDDKVNLARENLKGLMAHWLAKRKQ 540

Query: 949  RFGSQASANGEVISSKDISARSLSHSRLEAVDGNAENDSMVYSPFEFSTVSPPSIITEGS 1008
            RFGSQASANGEVISSKDISARSLSHSRLEAVDGNAENDSMVYSPFEFSTVSPPSIITEGS
Sbjct: 541  RFGSQASANGEVISSKDISARSLSHSRLEAVDGNAENDSMVYSPFEFSTVSPPSIITEGS 600

Query: 1009 QGGPWRKNITEFDGTEDEKDFPWWCLDCVLNNRLPPRENTNG------------------ 1068
            QGGPWRKNITEFDGTEDEKDFPWWCLDCVLNNRLPPRENT                    
Sbjct: 601  QGGPWRKNITEFDGTEDEKDFPWWCLDCVLNNRLPPRENTKCSFYLHPCEGSSIQILTQG 660

Query: 1069 -------FEEQWVVNYVVEKMVLDKPLDNVNPDISFGPGLSSTVGDGSFRSGLKPWQKLK 1128
                        VVNYVVEKMVLDKPLDNVNPDISFGPGLSSTVGDGSFRSGLKPWQKLK
Sbjct: 661  KLSAPRILRVNKVVNYVVEKMVLDKPLDNVNPDISFGPGLSSTVGDGSFRSGLKPWQKLK 720

Query: 1129 PSIEILCNNQVLSPDMSLATVRNYIWKKPEDLVLNYRMLQGR 1146
            PSIEILCNNQVLSPDMSLATVRNYIWKKPEDLVLNYRMLQGR
Sbjct: 721  PSIEILCNNQVLSPDMSLATVRNYIWKKPEDLVLNYRMLQGR 762

BLAST of CmaCh12G011040 vs. ExPASy TrEMBL
Match: A0A6J1HKR1 (WD repeat-containing protein 48-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111465442 PE=3 SV=1)

HSP 1 Score: 1463.7 bits (3788), Expect = 0.0e+00
Identity = 726/758 (95.78%), Postives = 726/758 (95.78%), Query Frame = 0

Query: 413  MHRVGSAGNTSNSSRPRKEKRLTYVLSDANDSKHSAGINCLAVPKSSIDGYNYLFTGSRD 472
            MHRVGSAGNTSNSSRPRKEKRLTYVLSDANDSKHSAGINCLAVPKSSIDGYNYLFTGSRD
Sbjct: 1    MHRVGSAGNTSNSSRPRKEKRLTYVLSDANDSKHSAGINCLAVPKSSIDGYNYLFTGSRD 60

Query: 473  GTLKRWSLDEDAASCSATFESHVDWVNDAVLVGNNRLISCSSDGTVKTWNCLTDGGCTKT 532
            GTLKRWSLDEDAASCSATFESHVDWVNDAVLVGNNRLISCSSDGTVKTWNCLTDGGCTKT
Sbjct: 61   GTLKRWSLDEDAASCSATFESHVDWVNDAVLVGNNRLISCSSDGTVKTWNCLTDGGCTKT 120

Query: 533  LRQHSDYVTCLAAAEKNSNVVASGGLGGEVFIWDLEAAYAPSSKSSDATGDECSNGIIAS 592
            LRQHSDYVTCLAAAEKNSNVVASGGLGGEVFIWDLEAAYAPSSKSSDATGDECSNGIIAS
Sbjct: 121  LRQHSDYVTCLAAAEKNSNVVASGGLGGEVFIWDLEAAYAPSSKSSDATGDECSNGIIAS 180

Query: 593  GNSLPVTSLHAINSSNNISTHPNHSHGYVPIAAKGHKESVYALAMNDSGTLLVSGGTEKV 652
            GNSLPVTSLHAINSSNNISTHPNHSHGYVPIAAKGHKESVYALAMNDSGTLLVSGGTEKV
Sbjct: 181  GNSLPVTSLHAINSSNNISTHPNHSHGYVPIAAKGHKESVYALAMNDSGTLLVSGGTEKV 240

Query: 653  VRVWDPRTGSKTMKLRGHTDNIRALLLDSTGRFCLSGSSDSMIRLWDLGQQRCVHSYAVH 712
            VRVWDPRTGSKTMKLRGHTDNIRALLLDSTGRFCLSGSSDSMIRLWDLGQQRCVHSYAVH
Sbjct: 241  VRVWDPRTGSKTMKLRGHTDNIRALLLDSTGRFCLSGSSDSMIRLWDLGQQRCVHSYAVH 300

Query: 713  TDSVWALASTPSFSYVYSGGRDLSLYVTDLSTRESLLLCTGEYPIQQLAIHDENIWVATT 772
            TDSVWALASTPSFSYVYSGGRDLSLYVTDLSTRESLLLCTGEYPIQQLAIHDENIWVATT
Sbjct: 301  TDSVWALASTPSFSYVYSGGRDLSLYVTDLSTRESLLLCTGEYPIQQLAIHDENIWVATT 360

Query: 773  DSSVHRWPAEGRNPHKAFERGGSFVAGNLSFSRARASLEGSTPVPVYKEPTFTISGAPAI 832
            DSSVHRWPAEGRNPHKAFERGGSFVAGNLSFSRARASLEGSTPVPVYKEPTFTISGAPAI
Sbjct: 361  DSSVHRWPAEGRNPHKAFERGGSFVAGNLSFSRARASLEGSTPVPVYKEPTFTISGAPAI 420

Query: 833  VQHEILNNRRHILTKDAAGSVKLWEVTRGIVIEDYGKVSYEEKKEELFEMVSIPAWFTVD 892
            VQHEILNNRRHILTKDAAGSVKLWEVTRGIVIEDYGKVSYEEKKEELFEMVSIPAWFTVD
Sbjct: 421  VQHEILNNRRHILTKDAAGSVKLWEVTRGIVIEDYGKVSYEEKKEELFEMVSIPAWFTVD 480

Query: 893  TRLGSLSVHLDTPQCFSAEMYSADLNITGKPEDDKVNLARENLKGLMAHWLAKRKQRFGS 952
            TRLGSLSVHLDTPQCFSAEMYSADLNITGKPEDDKVNLARENLKGLMAHWLAKRKQRFGS
Sbjct: 481  TRLGSLSVHLDTPQCFSAEMYSADLNITGKPEDDKVNLARENLKGLMAHWLAKRKQRFGS 540

Query: 953  QASANGEVISSKDISARSLSHSRLEAVDGNAENDSMVYSPFEFSTVSPPSIITEGSQGGP 1012
            QASANGEVISSKDISARSLSHSRLEAVDGNAENDSMVYSPFEFSTVSPPSIITEGSQGGP
Sbjct: 541  QASANGEVISSKDISARSLSHSRLEAVDGNAENDSMVYSPFEFSTVSPPSIITEGSQGGP 600

Query: 1013 WRKNITEFDGTEDEKDFPWWCLDCVLNNRLPPRENTNG---------------------- 1072
            WRKNITEFDGTEDEKDFPWWCLDCVLNNRLPPRENT                        
Sbjct: 601  WRKNITEFDGTEDEKDFPWWCLDCVLNNRLPPRENTKCSFYLHPCEGSSIQILTQGKLSA 660

Query: 1073 ---FEEQWVVNYVVEKMVLDKPLDNVNPDISFGPGLSSTVGDGSFRSGLKPWQKLKPSIE 1132
                    VVNYVVEKMVLDKPLDNVNPDISFGPGLSSTVGDGSFRSGLKPWQKLKPSIE
Sbjct: 661  PRILRVNKVVNYVVEKMVLDKPLDNVNPDISFGPGLSSTVGDGSFRSGLKPWQKLKPSIE 720

Query: 1133 ILCNNQVLSPDMSLATVRNYIWKKPEDLVLNYRMLQGR 1146
            ILCNNQVLSPDMSLATVRNYIWKKPEDLVLNYRMLQGR
Sbjct: 721  ILCNNQVLSPDMSLATVRNYIWKKPEDLVLNYRMLQGR 758

BLAST of CmaCh12G011040 vs. ExPASy TrEMBL
Match: A0A6J1FHE9 (WD repeat-containing protein 48-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111443999 PE=3 SV=1)

HSP 1 Score: 1455.3 bits (3766), Expect = 0.0e+00
Identity = 722/762 (94.75%), Postives = 726/762 (95.28%), Query Frame = 0

Query: 409  VSSAMHRVGSAGNTSNSSRPRKEKRLTYVLSDANDSKHSAGINCLAVPKSSIDGYNYLFT 468
            +SSAMHRVGSAGNTSNSSRPRKEKRLTYVLSDANDSKHSAGINCLAVPKSSIDGYNYLFT
Sbjct: 1    MSSAMHRVGSAGNTSNSSRPRKEKRLTYVLSDANDSKHSAGINCLAVPKSSIDGYNYLFT 60

Query: 469  GSRDGTLKRWSLDEDAASCSATFESHVDWVNDAVLVGNNRLISCSSDGTVKTWNCLTDGG 528
            GSRDGTLKRWSLDEDAASCSATFESHVDWVNDAVLVGNNRLISCSSDGTVKTWNCLTDGG
Sbjct: 61   GSRDGTLKRWSLDEDAASCSATFESHVDWVNDAVLVGNNRLISCSSDGTVKTWNCLTDGG 120

Query: 529  CTKTLRQHSDYVTCLAAAEKNSNVVASGGLGGEVFIWDLEAAYAPSSKSSDATGDECSNG 588
            CTKTLRQHSDYVTCLAAAEKNSNVVASGGLGGEVFIWDLEAAYAPS KSSDATGDECSNG
Sbjct: 121  CTKTLRQHSDYVTCLAAAEKNSNVVASGGLGGEVFIWDLEAAYAPSLKSSDATGDECSNG 180

Query: 589  IIASGNSLPVTSLHAINSSNNISTHPNHSHGYVPIAAKGHKESVYALAMNDSGTLLVSGG 648
            IIASGNSLPVTSLHAI+SSNNISTH NHSHGYVPIAAKGHKESVYALAMNDSGTLLVSGG
Sbjct: 181  IIASGNSLPVTSLHAISSSNNISTHSNHSHGYVPIAAKGHKESVYALAMNDSGTLLVSGG 240

Query: 649  TEKVVRVWDPRTGSKTMKLRGHTDNIRALLLDSTGRFCLSGSSDSMIRLWDLGQQRCVHS 708
            TEKVVRVWDPRTGSKTMKLRGHTDNIRALLLDSTGRFCLSGSSDSMIRLWDLGQQRCVHS
Sbjct: 241  TEKVVRVWDPRTGSKTMKLRGHTDNIRALLLDSTGRFCLSGSSDSMIRLWDLGQQRCVHS 300

Query: 709  YAVHTDSVWALASTPSFSYVYSGGRDLSLYVTDLSTRESLLLCTGEYPIQQLAIHDENIW 768
            YAVHTDSVWALASTPSFSYVYSGGRDLSLYVTDLSTRESLLLCTGEYPIQQLAIHDENIW
Sbjct: 301  YAVHTDSVWALASTPSFSYVYSGGRDLSLYVTDLSTRESLLLCTGEYPIQQLAIHDENIW 360

Query: 769  VATTDSSVHRWPAEGRNPHKAFERGGSFVAGNLSFSRARASLEGSTPVPVYKEPTFTISG 828
            VATTDSSVHRWPAEGRNPHKAFERGGSFVAGNLSFSRARASLEGSTPVPVYKEPTFTISG
Sbjct: 361  VATTDSSVHRWPAEGRNPHKAFERGGSFVAGNLSFSRARASLEGSTPVPVYKEPTFTISG 420

Query: 829  APAIVQHEILNNRRHILTKDAAGSVKLWEVTRGIVIEDYGKVSYEEKKEELFEMVSIPAW 888
            APAIVQHEILNNRRHILTKDAAGSVKLWEVTRGIVIEDYGKVSYEEKKEELFEMVSIPAW
Sbjct: 421  APAIVQHEILNNRRHILTKDAAGSVKLWEVTRGIVIEDYGKVSYEEKKEELFEMVSIPAW 480

Query: 889  FTVDTRLGSLSVHLDTPQCFSAEMYSADLNITGKPEDDKVNLARENLKGLMAHWLAKRKQ 948
            FTVDTRLGSLSVHLDTPQCFSAEMYSADLNITGKPEDDKVNLARE LKGLMAHWL KRKQ
Sbjct: 481  FTVDTRLGSLSVHLDTPQCFSAEMYSADLNITGKPEDDKVNLARETLKGLMAHWLGKRKQ 540

Query: 949  RFGSQASANGEVISSKDISARSLSHSRLEAVDGNAENDSMVYSPFEFSTVSPPSIITEGS 1008
            RFGSQASANGEV+SSKDISARSLSHSRLEAVDGNAENDSMVYSPFEFSTVSPPSIITEGS
Sbjct: 541  RFGSQASANGEVLSSKDISARSLSHSRLEAVDGNAENDSMVYSPFEFSTVSPPSIITEGS 600

Query: 1009 QGGPWRKNITEFDGTEDEKDFPWWCLDCVLNNRLPPRENTNG------------------ 1068
            QGGPWR+NITEFDGTEDEKDFPWWCLDCVLNNRLPPRENT                    
Sbjct: 601  QGGPWRRNITEFDGTEDEKDFPWWCLDCVLNNRLPPRENTKCSFYLHPCEGSSIQILTQG 660

Query: 1069 -------FEEQWVVNYVVEKMVLDKPLDNVNPDISFGPGLSSTVGDGSFRSGLKPWQKLK 1128
                        VVNYVVEKMVLDKPLDNVNPDISFGPGLSSTVGDGSFRSGLKPWQKLK
Sbjct: 661  KLSAPRILRVNKVVNYVVEKMVLDKPLDNVNPDISFGPGLSSTVGDGSFRSGLKPWQKLK 720

Query: 1129 PSIEILCNNQVLSPDMSLATVRNYIWKKPEDLVLNYRMLQGR 1146
            PSIEILCNNQVLSPDMSLATVRNYIWKKPEDLVLNYRMLQGR
Sbjct: 721  PSIEILCNNQVLSPDMSLATVRNYIWKKPEDLVLNYRMLQGR 762

BLAST of CmaCh12G011040 vs. ExPASy TrEMBL
Match: A0A6J1FAZ8 (WD repeat-containing protein 48-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111443999 PE=3 SV=1)

HSP 1 Score: 1450.6 bits (3754), Expect = 0.0e+00
Identity = 719/758 (94.85%), Postives = 722/758 (95.25%), Query Frame = 0

Query: 413  MHRVGSAGNTSNSSRPRKEKRLTYVLSDANDSKHSAGINCLAVPKSSIDGYNYLFTGSRD 472
            MHRVGSAGNTSNSSRPRKEKRLTYVLSDANDSKHSAGINCLAVPKSSIDGYNYLFTGSRD
Sbjct: 1    MHRVGSAGNTSNSSRPRKEKRLTYVLSDANDSKHSAGINCLAVPKSSIDGYNYLFTGSRD 60

Query: 473  GTLKRWSLDEDAASCSATFESHVDWVNDAVLVGNNRLISCSSDGTVKTWNCLTDGGCTKT 532
            GTLKRWSLDEDAASCSATFESHVDWVNDAVLVGNNRLISCSSDGTVKTWNCLTDGGCTKT
Sbjct: 61   GTLKRWSLDEDAASCSATFESHVDWVNDAVLVGNNRLISCSSDGTVKTWNCLTDGGCTKT 120

Query: 533  LRQHSDYVTCLAAAEKNSNVVASGGLGGEVFIWDLEAAYAPSSKSSDATGDECSNGIIAS 592
            LRQHSDYVTCLAAAEKNSNVVASGGLGGEVFIWDLEAAYAPS KSSDATGDECSNGIIAS
Sbjct: 121  LRQHSDYVTCLAAAEKNSNVVASGGLGGEVFIWDLEAAYAPSLKSSDATGDECSNGIIAS 180

Query: 593  GNSLPVTSLHAINSSNNISTHPNHSHGYVPIAAKGHKESVYALAMNDSGTLLVSGGTEKV 652
            GNSLPVTSLHAI+SSNNISTH NHSHGYVPIAAKGHKESVYALAMNDSGTLLVSGGTEKV
Sbjct: 181  GNSLPVTSLHAISSSNNISTHSNHSHGYVPIAAKGHKESVYALAMNDSGTLLVSGGTEKV 240

Query: 653  VRVWDPRTGSKTMKLRGHTDNIRALLLDSTGRFCLSGSSDSMIRLWDLGQQRCVHSYAVH 712
            VRVWDPRTGSKTMKLRGHTDNIRALLLDSTGRFCLSGSSDSMIRLWDLGQQRCVHSYAVH
Sbjct: 241  VRVWDPRTGSKTMKLRGHTDNIRALLLDSTGRFCLSGSSDSMIRLWDLGQQRCVHSYAVH 300

Query: 713  TDSVWALASTPSFSYVYSGGRDLSLYVTDLSTRESLLLCTGEYPIQQLAIHDENIWVATT 772
            TDSVWALASTPSFSYVYSGGRDLSLYVTDLSTRESLLLCTGEYPIQQLAIHDENIWVATT
Sbjct: 301  TDSVWALASTPSFSYVYSGGRDLSLYVTDLSTRESLLLCTGEYPIQQLAIHDENIWVATT 360

Query: 773  DSSVHRWPAEGRNPHKAFERGGSFVAGNLSFSRARASLEGSTPVPVYKEPTFTISGAPAI 832
            DSSVHRWPAEGRNPHKAFERGGSFVAGNLSFSRARASLEGSTPVPVYKEPTFTISGAPAI
Sbjct: 361  DSSVHRWPAEGRNPHKAFERGGSFVAGNLSFSRARASLEGSTPVPVYKEPTFTISGAPAI 420

Query: 833  VQHEILNNRRHILTKDAAGSVKLWEVTRGIVIEDYGKVSYEEKKEELFEMVSIPAWFTVD 892
            VQHEILNNRRHILTKDAAGSVKLWEVTRGIVIEDYGKVSYEEKKEELFEMVSIPAWFTVD
Sbjct: 421  VQHEILNNRRHILTKDAAGSVKLWEVTRGIVIEDYGKVSYEEKKEELFEMVSIPAWFTVD 480

Query: 893  TRLGSLSVHLDTPQCFSAEMYSADLNITGKPEDDKVNLARENLKGLMAHWLAKRKQRFGS 952
            TRLGSLSVHLDTPQCFSAEMYSADLNITGKPEDDKVNLARE LKGLMAHWL KRKQRFGS
Sbjct: 481  TRLGSLSVHLDTPQCFSAEMYSADLNITGKPEDDKVNLARETLKGLMAHWLGKRKQRFGS 540

Query: 953  QASANGEVISSKDISARSLSHSRLEAVDGNAENDSMVYSPFEFSTVSPPSIITEGSQGGP 1012
            QASANGEV+SSKDISARSLSHSRLEAVDGNAENDSMVYSPFEFSTVSPPSIITEGSQGGP
Sbjct: 541  QASANGEVLSSKDISARSLSHSRLEAVDGNAENDSMVYSPFEFSTVSPPSIITEGSQGGP 600

Query: 1013 WRKNITEFDGTEDEKDFPWWCLDCVLNNRLPPRENTNG---------------------- 1072
            WR+NITEFDGTEDEKDFPWWCLDCVLNNRLPPRENT                        
Sbjct: 601  WRRNITEFDGTEDEKDFPWWCLDCVLNNRLPPRENTKCSFYLHPCEGSSIQILTQGKLSA 660

Query: 1073 ---FEEQWVVNYVVEKMVLDKPLDNVNPDISFGPGLSSTVGDGSFRSGLKPWQKLKPSIE 1132
                    VVNYVVEKMVLDKPLDNVNPDISFGPGLSSTVGDGSFRSGLKPWQKLKPSIE
Sbjct: 661  PRILRVNKVVNYVVEKMVLDKPLDNVNPDISFGPGLSSTVGDGSFRSGLKPWQKLKPSIE 720

Query: 1133 ILCNNQVLSPDMSLATVRNYIWKKPEDLVLNYRMLQGR 1146
            ILCNNQVLSPDMSLATVRNYIWKKPEDLVLNYRMLQGR
Sbjct: 721  ILCNNQVLSPDMSLATVRNYIWKKPEDLVLNYRMLQGR 758

BLAST of CmaCh12G011040 vs. NCBI nr
Match: KAG6586292.1 (hypothetical protein SDJN03_19025, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1963.3 bits (5085), Expect = 0.0e+00
Identity = 1005/1133 (88.70%), Postives = 1017/1133 (89.76%), Query Frame = 0

Query: 58   MVDAKSSIAKDVTELIGKTPLVYLNRVVDGCVARVAAKLEMMEPCSSVKDRIGYSMISDA 117
            MVDAKSSIAKDVTELIGKTPLVYLNRVVDGCVARVAAKLEMMEPCSSVKDRIGYSMISDA
Sbjct: 1    MVDAKSSIAKDVTELIGKTPLVYLNRVVDGCVARVAAKLEMMEPCSSVKDRIGYSMISDA 60

Query: 118  EENGLITPGKSALIEPTSGNTGIGLAFIAAAKGYKLIITMPASMSLERRIILRAFGAELI 177
            E+NGLITPGKS LIEPTSGNTGIGLAFIAAAKGYKLIITMPASMSLERRIILRAFGAELI
Sbjct: 61   EKNGLITPGKSVLIEPTSGNTGIGLAFIAAAKGYKLIITMPASMSLERRIILRAFGAELI 120

Query: 178  LTDPARGMKGAVQKAEEIKAKTPDSYILQQFENPANPKIHYETTGPEIWSGSGGKVDALV 237
            LTDPARGMKGAVQKAEEIKAKTPDSYILQQFENPANPKIHYETTGPEIWSGSGGKVDALV
Sbjct: 121  LTDPARGMKGAVQKAEEIKAKTPDSYILQQFENPANPKIHYETTGPEIWSGSGGKVDALV 180

Query: 238  SGIGTGGTITGAGRYLKEQNPSIKLYGVEPVESAILSGGKPGPHKIQGIGAGFIPGVLDV 297
            SGIGTGGT+TGAGRYLKEQNPSIKLYGVEPVESAILSGGKPGPHKIQGIGAGFIPGVLDV
Sbjct: 181  SGIGTGGTVTGAGRYLKEQNPSIKLYGVEPVESAILSGGKPGPHKIQGIGAGFIPGVLDV 240

Query: 298  PLLDEVVQ--------------------VGISSGAAAAAAIKVAKRPENAGKLIVVSVPF 357
            PLLDEVVQ                    VGISSGAAAAAAIKVAKRPENAGKLIVV  P 
Sbjct: 241  PLLDEVVQVSSDEAIETAKQLALKEGLLVGISSGAAAAAAIKVAKRPENAGKLIVVIFP- 300

Query: 358  LSHVVYFLWHYNTALACDLPELWGTIPLNRVVRVCETRDREYGFRAMRVYAIGTRRTIHK 417
                                                     +G R +         ++ +
Sbjct: 301  ----------------------------------------SFGERYLSTVLF---ESVKQ 360

Query: 418  IVEYSFKFNPEVSSAMHRVGSAGNTSNSSRPRKEKRLTYVLSDANDSKHSAGINCLAVPK 477
              E   +FNPEVSSAMHRVGSAGNTSNSSRPRKEKRLTYVLSDANDSKHSAGINCLAVPK
Sbjct: 361  ETENMLQFNPEVSSAMHRVGSAGNTSNSSRPRKEKRLTYVLSDANDSKHSAGINCLAVPK 420

Query: 478  SSIDGYNYLFTGSRDGTLKRWSLDEDAASCSATFESHVDWVNDAVLVGNNRLISCSSDGT 537
            SSIDGYNYLFTGSRDGTLKRWSLDEDAASCSATFESHVDWVNDAVLVGNNRLISCSSDGT
Sbjct: 421  SSIDGYNYLFTGSRDGTLKRWSLDEDAASCSATFESHVDWVNDAVLVGNNRLISCSSDGT 480

Query: 538  VKTWNCLTDGGCTKTLRQHSDYVTCLAAAEKNSNVVASGGLGGEVFIWDLEAAYAPSSKS 597
            VKTWNCLTDGGCTKTLRQHSDYVTCLAAAEKNSNVVASGGLGGEVFIWDLEAAYAPSSKS
Sbjct: 481  VKTWNCLTDGGCTKTLRQHSDYVTCLAAAEKNSNVVASGGLGGEVFIWDLEAAYAPSSKS 540

Query: 598  SDATGDECSNGIIASGNSLPVTSLHAINSSNNISTHPNHSHGYVPIAAKGHKESVYALAM 657
            SDATGDECSNGIIASGNSLPVTSLHAI+SSNNISTH NHSHGYVPIAAKGHKESVYALAM
Sbjct: 541  SDATGDECSNGIIASGNSLPVTSLHAISSSNNISTHSNHSHGYVPIAAKGHKESVYALAM 600

Query: 658  NDSGTLLVSGGTEKVVRVWDPRTGSKTMKLRGHTDNIRALLLDSTGRFCLSGSSDSMIRL 717
            NDSGTLLVSGGTEKVVRVWDPRTGSKTMKLRGHTDNIRALLLDSTGRFCLSGSSDSMIRL
Sbjct: 601  NDSGTLLVSGGTEKVVRVWDPRTGSKTMKLRGHTDNIRALLLDSTGRFCLSGSSDSMIRL 660

Query: 718  WDLGQQRCVHSYAVHTDSVWALASTPSFSYVYSGGRDLSLYVTDLSTRESLLLCTGEYPI 777
            WDLGQQRCVHSYAVHTDSVWALASTPSFSYVYSGGRDLSLYVTDLSTRESLLLCTGEYPI
Sbjct: 661  WDLGQQRCVHSYAVHTDSVWALASTPSFSYVYSGGRDLSLYVTDLSTRESLLLCTGEYPI 720

Query: 778  QQLAIHDENIWVATTDSSVHRWPAEGRNPHKAFERGGSFVAGNLSFSRARASLEGSTPVP 837
            QQLAIHDENIWVATTDSSVHRWPAEGRNPHKAFERGGSFVAGNLSFSRARASLEGSTPVP
Sbjct: 721  QQLAIHDENIWVATTDSSVHRWPAEGRNPHKAFERGGSFVAGNLSFSRARASLEGSTPVP 780

Query: 838  VYKEPTFTISGAPAIVQHEILNNRRHILTKDAAGSVKLWEVTRGIVIEDYGKVSYEEKKE 897
            VYKEPTFTISGAPAIVQHEILNNRRHILTKDAAGSVKLWEVTRGIVIEDYGKVSYEEKKE
Sbjct: 781  VYKEPTFTISGAPAIVQHEILNNRRHILTKDAAGSVKLWEVTRGIVIEDYGKVSYEEKKE 840

Query: 898  ELFEMVSIPAWFTVDTRLGSLSVHLDTPQCFSAEMYSADLNITGKPEDDKVNLARENLKG 957
            ELFEMVSIPAWFTVDTRLGSLSVHLDTPQCFSAEMYSADLNITGKPEDDKVNLARE LKG
Sbjct: 841  ELFEMVSIPAWFTVDTRLGSLSVHLDTPQCFSAEMYSADLNITGKPEDDKVNLARETLKG 900

Query: 958  LMAHWLAKRKQRFGSQASANGEVISSKDISARSLSHSRLEAVDGNAENDSMVYSPFEFST 1017
            LMAHWLAKRKQRFGSQASANGEV+SSKDISARSLSHSRLEAVDGNAENDSMVYSPFEFST
Sbjct: 901  LMAHWLAKRKQRFGSQASANGEVLSSKDISARSLSHSRLEAVDGNAENDSMVYSPFEFST 960

Query: 1018 VSPPSIITEGSQGGPWRKNITEFDGTEDEKDFPWWCLDCVLNNRLPPRENTNG------- 1077
            VSPPSIITEGSQGGPWR+NITEFDGTEDEKDFPWWCLDCVLNNRLPPRENT         
Sbjct: 961  VSPPSIITEGSQGGPWRRNITEFDGTEDEKDFPWWCLDCVLNNRLPPRENTKCSFYLHPC 1020

Query: 1078 ------------------FEEQWVVNYVVEKMVLDKPLDNVNPDISFGPGLSSTVGDGSF 1137
                                   VVNYVVEKMVLDKPLDNVNPDISFGPGLSSTVGDGSF
Sbjct: 1021 EGSSIQILTQGKLSAPRILRVNKVVNYVVEKMVLDKPLDNVNPDISFGPGLSSTVGDGSF 1080

Query: 1138 RSGLKPWQKLKPSIEILCNNQVLSPDMSLATVRNYIWKKPEDLVLNYRMLQGR 1146
            RSGLKPWQKLKPSIEILCNNQVLSPDMSLATVRNYIWKKPEDLVLNY+MLQGR
Sbjct: 1081 RSGLKPWQKLKPSIEILCNNQVLSPDMSLATVRNYIWKKPEDLVLNYKMLQGR 1089

BLAST of CmaCh12G011040 vs. NCBI nr
Match: KAB2049639.1 (hypothetical protein ES319_A13G191000v1 [Gossypium barbadense])

HSP 1 Score: 1585.5 bits (4104), Expect = 0.0e+00
Identity = 809/1138 (71.09%), Postives = 904/1138 (79.44%), Query Frame = 0

Query: 59   VDAKSSIAKDVTELIGKTPLVYLNRVVDGCVARVAAKLEMMEPCSSVKDRIGYSMISDAE 118
            ++ K +I KDVTELIG TP+VYLN +VDGC AR+AAKLE+MEPCSSVKDRI YSMI DAE
Sbjct: 1    MEDKYAIKKDVTELIGNTPMVYLNNIVDGCGARIAAKLELMEPCSSVKDRIAYSMIKDAE 60

Query: 119  ENGLITPGKSALIEPTSGNTGIGLAFIAAAKGYKLIITMPASMSLERRIILRAFGAELIL 178
            + GLIT GKS LI+ TSGNTGI +AFI AA+GYK+I+TMPA MS+ERRI+LRAFGAE+ L
Sbjct: 61   DKGLITSGKSVLIDTTSGNTGIAMAFIGAARGYKVIVTMPAYMSIERRIVLRAFGAEVRL 120

Query: 179  TDPARGMKGAVQKAEEIKAKTPDSYILQQFENPANPKIHYETTGPEIWSGSGGKVDALVS 238
            TDPA+G KG++ KA EI   T + Y+L+QFENP+NPKIHYETTGPEIW  S GKVD LV+
Sbjct: 121  TDPAKGFKGSLDKALEILKNTRNGYMLRQFENPSNPKIHYETTGPEIWKDSEGKVDVLVA 180

Query: 239  GIGTGGTITGAGRYLKEQNPSIKLYGVEPVESAILSGGKPGPHKIQGIGAGFIPGVLDVP 298
            GIGTGGT+TGAGR+LKE+N  IK+YGVEPVESA+L+GGKPG H IQGIGAG IP VLDV 
Sbjct: 181  GIGTGGTVTGAGRFLKEKNSKIKVYGVEPVESAVLNGGKPGSHLIQGIGAGIIPDVLDVG 240

Query: 299  LLDEVVQ--------------------VGISSGAAAAAAIKVAKRPENAGKLIVVSVPFL 358
            LLDEVVQ                    VGISSGAAAAAAIK+AKRPEN+GKLIVV  P  
Sbjct: 241  LLDEVVQISSEEAIETAKLLALKEGLLVGISSGAAAAAAIKLAKRPENSGKLIVVIFPSA 300

Query: 359  SHVVYFLWHYNTALACDLPELWGTIPLNRVVRVCETRDREYGFRAMRVYAIGTRRTIHKI 418
                                                                        
Sbjct: 301  G----------------------------------------------------------- 360

Query: 419  VEYSFKFNPEVSSAMHRVGSAGNTSNSSRPRKEKRLTYVLSDANDSKHSAGINCLAVPKS 478
                        +AMHRVGSAGN SNSSRPRKEKRLTYVL+D++D+KHSAGINCLAV KS
Sbjct: 361  -----------LTAMHRVGSAGNNSNSSRPRKEKRLTYVLNDSDDTKHSAGINCLAVLKS 420

Query: 479  SI-DGYNYLFTGSRDGTLKRWSLDEDAASCSATFESHVDWVNDAVLVGNNRLISCSSDGT 538
            S+ DG NYLFTGSRDGTLKRW+L EDAA+CSATFESHVDWVND V+ G N L+SCSSD T
Sbjct: 421  SVSDGCNYLFTGSRDGTLKRWALAEDAATCSATFESHVDWVNDTVIAGENTLVSCSSDTT 480

Query: 539  VKTWNCLTDGGCTKTLRQHSDYVTCLAAAEKNSNVVASGGLGGEVFIWDLEAAYAPSSKS 598
            +K WNCL+DG CT+TLRQHSDYVTCLAAAE+N+NVVASGGLGGEVF+WD+EAA  P SKS
Sbjct: 481  LKIWNCLSDGTCTRTLRQHSDYVTCLAAAERNANVVASGGLGGEVFVWDIEAAVTPLSKS 540

Query: 599  SDATGDECSNGIIASGNSLPVTSLHAINSSNNISTHPNHSHGYVPIAAKGHKESVYALAM 658
            SD   D+ SNGI  S NSLPV+SL  I+S+N+I+ H     GYVPIAAKGHKESVYALAM
Sbjct: 541  SDVMEDDFSNGINGSANSLPVSSLRPISSNNSITAHTTQCPGYVPIAAKGHKESVYALAM 600

Query: 659  NDSGTLLVSGGTEKVVRVWDPRTGSKTMKLRGHTDNIRALLLDSTGRFCLSGSSDSMIRL 718
            ND+G+LLVSGGTEKVVRVWDPRTGSKTMKLRGHTDN+RALLLDSTGR+CLSGSSDSMIRL
Sbjct: 601  NDNGSLLVSGGTEKVVRVWDPRTGSKTMKLRGHTDNVRALLLDSTGRYCLSGSSDSMIRL 660

Query: 719  WDLGQQRCVHSYAVHTDSVWALASTPSFSYVYSGGRDLSLYVTDLSTRESLLLCTGEYPI 778
            WDLGQQRCVHSYAVHTDSVWALASTP+F++VYSGGRDLSLY+TDL+TRESLLLCT E+P+
Sbjct: 661  WDLGQQRCVHSYAVHTDSVWALASTPTFTHVYSGGRDLSLYLTDLTTRESLLLCTKEHPV 720

Query: 779  QQLAIHDENIWVATTDSSVHRWPAEGRNPHKAFERGGSFVAGNLSFSRARASLEGSTPVP 838
             QLA+HD++IWVATTDSSVHRWPAEGRNP K F+RGGSF+AGNLSFSRAR SLEGSTP  
Sbjct: 721  LQLALHDDSIWVATTDSSVHRWPAEGRNPQKVFQRGGSFLAGNLSFSRARVSLEGSTPAA 780

Query: 839  VYKEPTFTISGAPAIVQHEILNNRRHILTKDAAGSVKLWEVTRGIVIEDYGKVSYEEKKE 898
            VYKEP F+I G PAIVQHEILNNRRH+LTKD AGSVKLWE+TRG+V+EDYG+VS++EKK+
Sbjct: 781  VYKEPIFSIPGTPAIVQHEILNNRRHVLTKDTAGSVKLWEITRGVVVEDYGQVSFDEKKQ 840

Query: 899  ELFEMVSIPAWFTVDTRLGSLSVHLDTPQCFSAEMYSADLNITGKPEDDKVNLARENLKG 958
            ELFEMVSIPAWFTVDTRLGSLSVHLDTPQCFSAEMYS DLNITGKPEDDKVNLARE LKG
Sbjct: 841  ELFEMVSIPAWFTVDTRLGSLSVHLDTPQCFSAEMYSVDLNITGKPEDDKVNLARETLKG 900

Query: 959  LMAHWLAKRKQRFGSQASANGEVISSKDISARSLSHSRLEAVDGNAENDSMVYSPFEFST 1018
            L+AHW+ KR+QR GSQASANG+V+S KD +ARSL+HSR+E VDGNAENDSMV+ PFEFS 
Sbjct: 901  LLAHWMTKRRQRLGSQASANGDVLSGKDNTARSLAHSRIE-VDGNAENDSMVHPPFEFSM 960

Query: 1019 VSPPSIITEGSQGGPWRKNITEFDGTEDEKDFPWWCLDCVLNNRLPPRENT--------- 1078
            VSPPSII+EGSQGGPWRK ITE DGTEDEKDFPWW LDCVLNNRLPPRENT         
Sbjct: 961  VSPPSIISEGSQGGPWRKKITELDGTEDEKDFPWWVLDCVLNNRLPPRENTKCSFYLHPC 1020

Query: 1079 NGFEEQ----------------WVVNYVVEKMVLDKPLDNVNPDISFGPG-----LSSTV 1138
             G   Q                 VVNYVVEKMVLDKP+D  N D S  PG       S V
Sbjct: 1021 EGTAVQILTQGKLSAPRILRINKVVNYVVEKMVLDKPIDTGNTDGSLAPGHGGQLQHSAV 1067

Query: 1139 GDGSFRSGLKPWQKLKPSIEILCNNQVLSPDMSLATVRNYIWKKPEDLVLNYRMLQGR 1146
             DGSF+SGLKPW K +PS+EILCNNQVLS DMSLATVR YIWKKPEDLVLNYR+ QGR
Sbjct: 1081 VDGSFKSGLKPWPKPRPSVEILCNNQVLSTDMSLATVRAYIWKKPEDLVLNYRVAQGR 1067

BLAST of CmaCh12G011040 vs. NCBI nr
Match: XP_022965590.1 (WD repeat-containing protein 48-like isoform X1 [Cucurbita maxima] >XP_022965591.1 WD repeat-containing protein 48-like isoform X1 [Cucurbita maxima])

HSP 1 Score: 1468.4 bits (3800), Expect = 0.0e+00
Identity = 729/762 (95.67%), Postives = 730/762 (95.80%), Query Frame = 0

Query: 409  VSSAMHRVGSAGNTSNSSRPRKEKRLTYVLSDANDSKHSAGINCLAVPKSSIDGYNYLFT 468
            +SSAMHRVGSAGNTSNSSRPRKEKRLTYVLSDANDSKHSAGINCLAVPKSSIDGYNYLFT
Sbjct: 1    MSSAMHRVGSAGNTSNSSRPRKEKRLTYVLSDANDSKHSAGINCLAVPKSSIDGYNYLFT 60

Query: 469  GSRDGTLKRWSLDEDAASCSATFESHVDWVNDAVLVGNNRLISCSSDGTVKTWNCLTDGG 528
            GSRDGTLKRWSLDEDAASCSATFESHVDWVNDAVLVGNNRLISCSSDGTVKTWNCLTDGG
Sbjct: 61   GSRDGTLKRWSLDEDAASCSATFESHVDWVNDAVLVGNNRLISCSSDGTVKTWNCLTDGG 120

Query: 529  CTKTLRQHSDYVTCLAAAEKNSNVVASGGLGGEVFIWDLEAAYAPSSKSSDATGDECSNG 588
            CTKTLRQHSDYVTCLAAAEKNSNVVASGGLGGEVFIWDLEAAYAPSSKSSDATGDECSNG
Sbjct: 121  CTKTLRQHSDYVTCLAAAEKNSNVVASGGLGGEVFIWDLEAAYAPSSKSSDATGDECSNG 180

Query: 589  IIASGNSLPVTSLHAINSSNNISTHPNHSHGYVPIAAKGHKESVYALAMNDSGTLLVSGG 648
            IIASGNSLPVTSLHAINSSNNISTHPNHSHGYVPIAAKGHKESVYALAMNDSGTLLVSGG
Sbjct: 181  IIASGNSLPVTSLHAINSSNNISTHPNHSHGYVPIAAKGHKESVYALAMNDSGTLLVSGG 240

Query: 649  TEKVVRVWDPRTGSKTMKLRGHTDNIRALLLDSTGRFCLSGSSDSMIRLWDLGQQRCVHS 708
            TEKVVRVWDPRTGSKTMKLRGHTDNIRALLLDSTGRFCLSGSSDSMIRLWDLGQQRCVHS
Sbjct: 241  TEKVVRVWDPRTGSKTMKLRGHTDNIRALLLDSTGRFCLSGSSDSMIRLWDLGQQRCVHS 300

Query: 709  YAVHTDSVWALASTPSFSYVYSGGRDLSLYVTDLSTRESLLLCTGEYPIQQLAIHDENIW 768
            YAVHTDSVWALASTPSFSYVYSGGRDLSLYVTDLSTRESLLLCTGEYPIQQLAIHDENIW
Sbjct: 301  YAVHTDSVWALASTPSFSYVYSGGRDLSLYVTDLSTRESLLLCTGEYPIQQLAIHDENIW 360

Query: 769  VATTDSSVHRWPAEGRNPHKAFERGGSFVAGNLSFSRARASLEGSTPVPVYKEPTFTISG 828
            VATTDSSVHRWPAEGRNPHKAFERGGSFVAGNLSFSRARASLEGSTPVPVYKEPTFTISG
Sbjct: 361  VATTDSSVHRWPAEGRNPHKAFERGGSFVAGNLSFSRARASLEGSTPVPVYKEPTFTISG 420

Query: 829  APAIVQHEILNNRRHILTKDAAGSVKLWEVTRGIVIEDYGKVSYEEKKEELFEMVSIPAW 888
            APAIVQHEILNNRRHILTKDAAGSVKLWEVTRGIVIEDYGKVSYEEKKEELFEMVSIPAW
Sbjct: 421  APAIVQHEILNNRRHILTKDAAGSVKLWEVTRGIVIEDYGKVSYEEKKEELFEMVSIPAW 480

Query: 889  FTVDTRLGSLSVHLDTPQCFSAEMYSADLNITGKPEDDKVNLARENLKGLMAHWLAKRKQ 948
            FTVDTRLGSLSVHLDTPQCFSAEMYSADLNITGKPEDDKVNLARENLKGLMAHWLAKRKQ
Sbjct: 481  FTVDTRLGSLSVHLDTPQCFSAEMYSADLNITGKPEDDKVNLARENLKGLMAHWLAKRKQ 540

Query: 949  RFGSQASANGEVISSKDISARSLSHSRLEAVDGNAENDSMVYSPFEFSTVSPPSIITEGS 1008
            RFGSQASANGEVISSKDISARSLSHSRLEAVDGNAENDSMVYSPFEFSTVSPPSIITEGS
Sbjct: 541  RFGSQASANGEVISSKDISARSLSHSRLEAVDGNAENDSMVYSPFEFSTVSPPSIITEGS 600

Query: 1009 QGGPWRKNITEFDGTEDEKDFPWWCLDCVLNNRLPPRENTNG------------------ 1068
            QGGPWRKNITEFDGTEDEKDFPWWCLDCVLNNRLPPRENT                    
Sbjct: 601  QGGPWRKNITEFDGTEDEKDFPWWCLDCVLNNRLPPRENTKCSFYLHPCEGSSIQILTQG 660

Query: 1069 -------FEEQWVVNYVVEKMVLDKPLDNVNPDISFGPGLSSTVGDGSFRSGLKPWQKLK 1128
                        VVNYVVEKMVLDKPLDNVNPDISFGPGLSSTVGDGSFRSGLKPWQKLK
Sbjct: 661  KLSAPRILRVNKVVNYVVEKMVLDKPLDNVNPDISFGPGLSSTVGDGSFRSGLKPWQKLK 720

Query: 1129 PSIEILCNNQVLSPDMSLATVRNYIWKKPEDLVLNYRMLQGR 1146
            PSIEILCNNQVLSPDMSLATVRNYIWKKPEDLVLNYRMLQGR
Sbjct: 721  PSIEILCNNQVLSPDMSLATVRNYIWKKPEDLVLNYRMLQGR 762

BLAST of CmaCh12G011040 vs. NCBI nr
Match: XP_022965592.1 (WD repeat-containing protein 48-like isoform X2 [Cucurbita maxima])

HSP 1 Score: 1463.7 bits (3788), Expect = 0.0e+00
Identity = 726/758 (95.78%), Postives = 726/758 (95.78%), Query Frame = 0

Query: 413  MHRVGSAGNTSNSSRPRKEKRLTYVLSDANDSKHSAGINCLAVPKSSIDGYNYLFTGSRD 472
            MHRVGSAGNTSNSSRPRKEKRLTYVLSDANDSKHSAGINCLAVPKSSIDGYNYLFTGSRD
Sbjct: 1    MHRVGSAGNTSNSSRPRKEKRLTYVLSDANDSKHSAGINCLAVPKSSIDGYNYLFTGSRD 60

Query: 473  GTLKRWSLDEDAASCSATFESHVDWVNDAVLVGNNRLISCSSDGTVKTWNCLTDGGCTKT 532
            GTLKRWSLDEDAASCSATFESHVDWVNDAVLVGNNRLISCSSDGTVKTWNCLTDGGCTKT
Sbjct: 61   GTLKRWSLDEDAASCSATFESHVDWVNDAVLVGNNRLISCSSDGTVKTWNCLTDGGCTKT 120

Query: 533  LRQHSDYVTCLAAAEKNSNVVASGGLGGEVFIWDLEAAYAPSSKSSDATGDECSNGIIAS 592
            LRQHSDYVTCLAAAEKNSNVVASGGLGGEVFIWDLEAAYAPSSKSSDATGDECSNGIIAS
Sbjct: 121  LRQHSDYVTCLAAAEKNSNVVASGGLGGEVFIWDLEAAYAPSSKSSDATGDECSNGIIAS 180

Query: 593  GNSLPVTSLHAINSSNNISTHPNHSHGYVPIAAKGHKESVYALAMNDSGTLLVSGGTEKV 652
            GNSLPVTSLHAINSSNNISTHPNHSHGYVPIAAKGHKESVYALAMNDSGTLLVSGGTEKV
Sbjct: 181  GNSLPVTSLHAINSSNNISTHPNHSHGYVPIAAKGHKESVYALAMNDSGTLLVSGGTEKV 240

Query: 653  VRVWDPRTGSKTMKLRGHTDNIRALLLDSTGRFCLSGSSDSMIRLWDLGQQRCVHSYAVH 712
            VRVWDPRTGSKTMKLRGHTDNIRALLLDSTGRFCLSGSSDSMIRLWDLGQQRCVHSYAVH
Sbjct: 241  VRVWDPRTGSKTMKLRGHTDNIRALLLDSTGRFCLSGSSDSMIRLWDLGQQRCVHSYAVH 300

Query: 713  TDSVWALASTPSFSYVYSGGRDLSLYVTDLSTRESLLLCTGEYPIQQLAIHDENIWVATT 772
            TDSVWALASTPSFSYVYSGGRDLSLYVTDLSTRESLLLCTGEYPIQQLAIHDENIWVATT
Sbjct: 301  TDSVWALASTPSFSYVYSGGRDLSLYVTDLSTRESLLLCTGEYPIQQLAIHDENIWVATT 360

Query: 773  DSSVHRWPAEGRNPHKAFERGGSFVAGNLSFSRARASLEGSTPVPVYKEPTFTISGAPAI 832
            DSSVHRWPAEGRNPHKAFERGGSFVAGNLSFSRARASLEGSTPVPVYKEPTFTISGAPAI
Sbjct: 361  DSSVHRWPAEGRNPHKAFERGGSFVAGNLSFSRARASLEGSTPVPVYKEPTFTISGAPAI 420

Query: 833  VQHEILNNRRHILTKDAAGSVKLWEVTRGIVIEDYGKVSYEEKKEELFEMVSIPAWFTVD 892
            VQHEILNNRRHILTKDAAGSVKLWEVTRGIVIEDYGKVSYEEKKEELFEMVSIPAWFTVD
Sbjct: 421  VQHEILNNRRHILTKDAAGSVKLWEVTRGIVIEDYGKVSYEEKKEELFEMVSIPAWFTVD 480

Query: 893  TRLGSLSVHLDTPQCFSAEMYSADLNITGKPEDDKVNLARENLKGLMAHWLAKRKQRFGS 952
            TRLGSLSVHLDTPQCFSAEMYSADLNITGKPEDDKVNLARENLKGLMAHWLAKRKQRFGS
Sbjct: 481  TRLGSLSVHLDTPQCFSAEMYSADLNITGKPEDDKVNLARENLKGLMAHWLAKRKQRFGS 540

Query: 953  QASANGEVISSKDISARSLSHSRLEAVDGNAENDSMVYSPFEFSTVSPPSIITEGSQGGP 1012
            QASANGEVISSKDISARSLSHSRLEAVDGNAENDSMVYSPFEFSTVSPPSIITEGSQGGP
Sbjct: 541  QASANGEVISSKDISARSLSHSRLEAVDGNAENDSMVYSPFEFSTVSPPSIITEGSQGGP 600

Query: 1013 WRKNITEFDGTEDEKDFPWWCLDCVLNNRLPPRENTNG---------------------- 1072
            WRKNITEFDGTEDEKDFPWWCLDCVLNNRLPPRENT                        
Sbjct: 601  WRKNITEFDGTEDEKDFPWWCLDCVLNNRLPPRENTKCSFYLHPCEGSSIQILTQGKLSA 660

Query: 1073 ---FEEQWVVNYVVEKMVLDKPLDNVNPDISFGPGLSSTVGDGSFRSGLKPWQKLKPSIE 1132
                    VVNYVVEKMVLDKPLDNVNPDISFGPGLSSTVGDGSFRSGLKPWQKLKPSIE
Sbjct: 661  PRILRVNKVVNYVVEKMVLDKPLDNVNPDISFGPGLSSTVGDGSFRSGLKPWQKLKPSIE 720

Query: 1133 ILCNNQVLSPDMSLATVRNYIWKKPEDLVLNYRMLQGR 1146
            ILCNNQVLSPDMSLATVRNYIWKKPEDLVLNYRMLQGR
Sbjct: 721  ILCNNQVLSPDMSLATVRNYIWKKPEDLVLNYRMLQGR 758

BLAST of CmaCh12G011040 vs. NCBI nr
Match: XP_022937659.1 (WD repeat-containing protein 48-like isoform X1 [Cucurbita moschata] >XP_022937660.1 WD repeat-containing protein 48-like isoform X1 [Cucurbita moschata])

HSP 1 Score: 1455.3 bits (3766), Expect = 0.0e+00
Identity = 722/762 (94.75%), Postives = 726/762 (95.28%), Query Frame = 0

Query: 409  VSSAMHRVGSAGNTSNSSRPRKEKRLTYVLSDANDSKHSAGINCLAVPKSSIDGYNYLFT 468
            +SSAMHRVGSAGNTSNSSRPRKEKRLTYVLSDANDSKHSAGINCLAVPKSSIDGYNYLFT
Sbjct: 1    MSSAMHRVGSAGNTSNSSRPRKEKRLTYVLSDANDSKHSAGINCLAVPKSSIDGYNYLFT 60

Query: 469  GSRDGTLKRWSLDEDAASCSATFESHVDWVNDAVLVGNNRLISCSSDGTVKTWNCLTDGG 528
            GSRDGTLKRWSLDEDAASCSATFESHVDWVNDAVLVGNNRLISCSSDGTVKTWNCLTDGG
Sbjct: 61   GSRDGTLKRWSLDEDAASCSATFESHVDWVNDAVLVGNNRLISCSSDGTVKTWNCLTDGG 120

Query: 529  CTKTLRQHSDYVTCLAAAEKNSNVVASGGLGGEVFIWDLEAAYAPSSKSSDATGDECSNG 588
            CTKTLRQHSDYVTCLAAAEKNSNVVASGGLGGEVFIWDLEAAYAPS KSSDATGDECSNG
Sbjct: 121  CTKTLRQHSDYVTCLAAAEKNSNVVASGGLGGEVFIWDLEAAYAPSLKSSDATGDECSNG 180

Query: 589  IIASGNSLPVTSLHAINSSNNISTHPNHSHGYVPIAAKGHKESVYALAMNDSGTLLVSGG 648
            IIASGNSLPVTSLHAI+SSNNISTH NHSHGYVPIAAKGHKESVYALAMNDSGTLLVSGG
Sbjct: 181  IIASGNSLPVTSLHAISSSNNISTHSNHSHGYVPIAAKGHKESVYALAMNDSGTLLVSGG 240

Query: 649  TEKVVRVWDPRTGSKTMKLRGHTDNIRALLLDSTGRFCLSGSSDSMIRLWDLGQQRCVHS 708
            TEKVVRVWDPRTGSKTMKLRGHTDNIRALLLDSTGRFCLSGSSDSMIRLWDLGQQRCVHS
Sbjct: 241  TEKVVRVWDPRTGSKTMKLRGHTDNIRALLLDSTGRFCLSGSSDSMIRLWDLGQQRCVHS 300

Query: 709  YAVHTDSVWALASTPSFSYVYSGGRDLSLYVTDLSTRESLLLCTGEYPIQQLAIHDENIW 768
            YAVHTDSVWALASTPSFSYVYSGGRDLSLYVTDLSTRESLLLCTGEYPIQQLAIHDENIW
Sbjct: 301  YAVHTDSVWALASTPSFSYVYSGGRDLSLYVTDLSTRESLLLCTGEYPIQQLAIHDENIW 360

Query: 769  VATTDSSVHRWPAEGRNPHKAFERGGSFVAGNLSFSRARASLEGSTPVPVYKEPTFTISG 828
            VATTDSSVHRWPAEGRNPHKAFERGGSFVAGNLSFSRARASLEGSTPVPVYKEPTFTISG
Sbjct: 361  VATTDSSVHRWPAEGRNPHKAFERGGSFVAGNLSFSRARASLEGSTPVPVYKEPTFTISG 420

Query: 829  APAIVQHEILNNRRHILTKDAAGSVKLWEVTRGIVIEDYGKVSYEEKKEELFEMVSIPAW 888
            APAIVQHEILNNRRHILTKDAAGSVKLWEVTRGIVIEDYGKVSYEEKKEELFEMVSIPAW
Sbjct: 421  APAIVQHEILNNRRHILTKDAAGSVKLWEVTRGIVIEDYGKVSYEEKKEELFEMVSIPAW 480

Query: 889  FTVDTRLGSLSVHLDTPQCFSAEMYSADLNITGKPEDDKVNLARENLKGLMAHWLAKRKQ 948
            FTVDTRLGSLSVHLDTPQCFSAEMYSADLNITGKPEDDKVNLARE LKGLMAHWL KRKQ
Sbjct: 481  FTVDTRLGSLSVHLDTPQCFSAEMYSADLNITGKPEDDKVNLARETLKGLMAHWLGKRKQ 540

Query: 949  RFGSQASANGEVISSKDISARSLSHSRLEAVDGNAENDSMVYSPFEFSTVSPPSIITEGS 1008
            RFGSQASANGEV+SSKDISARSLSHSRLEAVDGNAENDSMVYSPFEFSTVSPPSIITEGS
Sbjct: 541  RFGSQASANGEVLSSKDISARSLSHSRLEAVDGNAENDSMVYSPFEFSTVSPPSIITEGS 600

Query: 1009 QGGPWRKNITEFDGTEDEKDFPWWCLDCVLNNRLPPRENTNG------------------ 1068
            QGGPWR+NITEFDGTEDEKDFPWWCLDCVLNNRLPPRENT                    
Sbjct: 601  QGGPWRRNITEFDGTEDEKDFPWWCLDCVLNNRLPPRENTKCSFYLHPCEGSSIQILTQG 660

Query: 1069 -------FEEQWVVNYVVEKMVLDKPLDNVNPDISFGPGLSSTVGDGSFRSGLKPWQKLK 1128
                        VVNYVVEKMVLDKPLDNVNPDISFGPGLSSTVGDGSFRSGLKPWQKLK
Sbjct: 661  KLSAPRILRVNKVVNYVVEKMVLDKPLDNVNPDISFGPGLSSTVGDGSFRSGLKPWQKLK 720

Query: 1129 PSIEILCNNQVLSPDMSLATVRNYIWKKPEDLVLNYRMLQGR 1146
            PSIEILCNNQVLSPDMSLATVRNYIWKKPEDLVLNYRMLQGR
Sbjct: 721  PSIEILCNNQVLSPDMSLATVRNYIWKKPEDLVLNYRMLQGR 762

BLAST of CmaCh12G011040 vs. TAIR 10
Match: AT3G05090.1 (Transducin/WD40 repeat-like superfamily protein )

HSP 1 Score: 1065.4 bits (2754), Expect = 3.1e-311
Identity = 536/762 (70.34%), Postives = 615/762 (80.71%), Query Frame = 0

Query: 413  MHRVGSAGNTSNSSRPRKEKRLTYVLSDANDSKHSAGINCLAVPKSSI-DGYNYLFTGSR 472
            MHRVGSAG+   S R RKEK+LTYVL+DAND+KH AGINCL V KSS+ +  +YLFTGSR
Sbjct: 1    MHRVGSAGSNGGSVRTRKEKKLTYVLNDANDTKHCAGINCLDVLKSSVSNDQSYLFTGSR 60

Query: 473  DGTLKRWSLDEDAASCSATFESHVDWVNDAVLVGNNRLISCSSDGTVKTWNCLTDGGCTK 532
            DGTLKRW+ DEDA  CSATFESHVDWVNDA L G + L+SCSSD TVKTW+ L+DG CT+
Sbjct: 61   DGTLKRWAFDEDATFCSATFESHVDWVNDAALAGESTLVSCSSDTTVKTWDGLSDGVCTR 120

Query: 533  TLRQHSDYVTCLAAAEKNSNVVASGGLGGEVFIWDLEAAYAPSSKSSDATGDECSNGIIA 592
            TLRQHSDYVTCLA A KN+NVVASGGLGGEVFIWD+EAA +P +K +DA  D  SNG  A
Sbjct: 121  TLRQHSDYVTCLAVAAKNNNVVASGGLGGEVFIWDIEAALSPVTKPNDANEDSSSNG--A 180

Query: 593  SGNSLPVTSLHAINSSNNISTHPNHSHGYVPIAAKGHKESVYALAMNDSGTLLVSGGTEK 652
            +G   PVTSL  + SSNNIS   + SHGY P  AKGHKESVYALAMND+GT+LVSGGTEK
Sbjct: 181  NG---PVTSLRTVGSSNNISVQSSPSHGYTPTIAKGHKESVYALAMNDTGTMLVSGGTEK 240

Query: 653  VVRVWDPRTGSKTMKLRGHTDNIRALLLDSTGRFCLSGSSDSMIRLWDLGQQRCVHSYAV 712
            V+RVWDPRTGSK+MKLRGHTDN+R LLLDSTGRFCLSGSSDSMIRLWDLGQQRC+H+YAV
Sbjct: 241  VLRVWDPRTGSKSMKLRGHTDNVRVLLLDSTGRFCLSGSSDSMIRLWDLGQQRCLHTYAV 300

Query: 713  HTDSVWALASTPSFSYVYSGGRDLSLYVTDLSTRESLLLCTGEYPIQQLAIHDENIWVAT 772
            HTDSVWALA  PSFS+VYSGGRD  LY+TDL+TRES+LLCT E+PIQQLA+ D +IWVAT
Sbjct: 301  HTDSVWALACNPSFSHVYSGGRDQCLYLTDLATRESVLLCTKEHPIQQLALQDNSIWVAT 360

Query: 773  TDSSVHRWPAEGRNPHKAFERGGSFVAGNLSFSRARASLEGSTPVPVYKEPTFTISGAPA 832
            TDSSV RWPAE ++P   F+RGGSF+AGNLSF+RAR SLEG  P P YKEP+ T+ G   
Sbjct: 361  TDSSVERWPAEVQSPKTVFQRGGSFLAGNLSFNRARVSLEGLNPAPAYKEPSITVPGTHP 420

Query: 833  IVQHEILNNRRHILTKDAAGSVKLWEVTRGIVIEDYGKVSYEEKKEELFEMVSIPAWFTV 892
            IVQHEILNN+R ILTKDAAGSVKLW++TRG+V+EDYGK+S+EEKKEELFEMVSIP+WFTV
Sbjct: 421  IVQHEILNNKRQILTKDAAGSVKLWDITRGVVVEDYGKISFEEKKEELFEMVSIPSWFTV 480

Query: 893  DTRLGSLSVHLDTPQCFSAEMYSADLNITGKPEDDKVNLARENLKGLMAHWLAKRKQRFG 952
            DTRLG LSVHL+TPQCFSAEMYSADL ++G+PEDDK+NLARE LKGL+ HWLAK+K +  
Sbjct: 481  DTRLGCLSVHLETPQCFSAEMYSADLKVSGRPEDDKINLARETLKGLLGHWLAKKKHKPK 540

Query: 953  SQASANGEVISSKDISARSLSHSRLEAVDGNAENDSMVYSPFEFSTVSPPSIITEGSQGG 1012
             Q  A+G+ +S KD + ++LS S+ E  + +A +D  VY PFEFS+VSPPSIITEGSQGG
Sbjct: 541  PQVLASGDTLSVKD-TKKNLSASKTE--ESSAASDP-VYPPFEFSSVSPPSIITEGSQGG 600

Query: 1013 PWRKNITEFDGTEDEKDFPWWCLDCVLNNRLPPRENT---------NGFEEQ-------- 1072
            PWRK ITEF GTEDEKDFP WCLD VLNNRLPPRENT          G   Q        
Sbjct: 601  PWRKKITEFTGTEDEKDFPLWCLDAVLNNRLPPRENTKLSFFLHPCEGSSVQVVTLGKLS 660

Query: 1073 --------WVVNYVVEKMVLDKPLDNVNPD---ISFGPGLSSTVGDGSFRSGLKPWQKLK 1132
                     V NYVVEKMVLD PLD++  D   +S G       G+G  +SGLKPWQKL+
Sbjct: 661  APRILRVHKVTNYVVEKMVLDNPLDSLAIDAASVSGGQPQPLFSGNGLLQSGLKPWQKLR 720

Query: 1133 PSIEILCNNQVLSPDMSLATVRNYIWKKPEDLVLNYRMLQGR 1146
            PSIEILCN+QVLSPDMSLATVR Y+WKKPEDL+LNYR+   R
Sbjct: 721  PSIEILCNSQVLSPDMSLATVRAYVWKKPEDLILNYRVAIAR 753

BLAST of CmaCh12G011040 vs. TAIR 10
Match: AT3G05090.2 (Transducin/WD40 repeat-like superfamily protein )

HSP 1 Score: 1065.4 bits (2754), Expect = 3.1e-311
Identity = 536/762 (70.34%), Postives = 615/762 (80.71%), Query Frame = 0

Query: 413  MHRVGSAGNTSNSSRPRKEKRLTYVLSDANDSKHSAGINCLAVPKSSI-DGYNYLFTGSR 472
            MHRVGSAG+   S R RKEK+LTYVL+DAND+KH AGINCL V KSS+ +  +YLFTGSR
Sbjct: 1    MHRVGSAGSNGGSVRTRKEKKLTYVLNDANDTKHCAGINCLDVLKSSVSNDQSYLFTGSR 60

Query: 473  DGTLKRWSLDEDAASCSATFESHVDWVNDAVLVGNNRLISCSSDGTVKTWNCLTDGGCTK 532
            DGTLKRW+ DEDA  CSATFESHVDWVNDA L G + L+SCSSD TVKTW+ L+DG CT+
Sbjct: 61   DGTLKRWAFDEDATFCSATFESHVDWVNDAALAGESTLVSCSSDTTVKTWDGLSDGVCTR 120

Query: 533  TLRQHSDYVTCLAAAEKNSNVVASGGLGGEVFIWDLEAAYAPSSKSSDATGDECSNGIIA 592
            TLRQHSDYVTCLA A KN+NVVASGGLGGEVFIWD+EAA +P +K +DA  D  SNG  A
Sbjct: 121  TLRQHSDYVTCLAVAAKNNNVVASGGLGGEVFIWDIEAALSPVTKPNDANEDSSSNG--A 180

Query: 593  SGNSLPVTSLHAINSSNNISTHPNHSHGYVPIAAKGHKESVYALAMNDSGTLLVSGGTEK 652
            +G   PVTSL  + SSNNIS   + SHGY P  AKGHKESVYALAMND+GT+LVSGGTEK
Sbjct: 181  NG---PVTSLRTVGSSNNISVQSSPSHGYTPTIAKGHKESVYALAMNDTGTMLVSGGTEK 240

Query: 653  VVRVWDPRTGSKTMKLRGHTDNIRALLLDSTGRFCLSGSSDSMIRLWDLGQQRCVHSYAV 712
            V+RVWDPRTGSK+MKLRGHTDN+R LLLDSTGRFCLSGSSDSMIRLWDLGQQRC+H+YAV
Sbjct: 241  VLRVWDPRTGSKSMKLRGHTDNVRVLLLDSTGRFCLSGSSDSMIRLWDLGQQRCLHTYAV 300

Query: 713  HTDSVWALASTPSFSYVYSGGRDLSLYVTDLSTRESLLLCTGEYPIQQLAIHDENIWVAT 772
            HTDSVWALA  PSFS+VYSGGRD  LY+TDL+TRES+LLCT E+PIQQLA+ D +IWVAT
Sbjct: 301  HTDSVWALACNPSFSHVYSGGRDQCLYLTDLATRESVLLCTKEHPIQQLALQDNSIWVAT 360

Query: 773  TDSSVHRWPAEGRNPHKAFERGGSFVAGNLSFSRARASLEGSTPVPVYKEPTFTISGAPA 832
            TDSSV RWPAE ++P   F+RGGSF+AGNLSF+RAR SLEG  P P YKEP+ T+ G   
Sbjct: 361  TDSSVERWPAEVQSPKTVFQRGGSFLAGNLSFNRARVSLEGLNPAPAYKEPSITVPGTHP 420

Query: 833  IVQHEILNNRRHILTKDAAGSVKLWEVTRGIVIEDYGKVSYEEKKEELFEMVSIPAWFTV 892
            IVQHEILNN+R ILTKDAAGSVKLW++TRG+V+EDYGK+S+EEKKEELFEMVSIP+WFTV
Sbjct: 421  IVQHEILNNKRQILTKDAAGSVKLWDITRGVVVEDYGKISFEEKKEELFEMVSIPSWFTV 480

Query: 893  DTRLGSLSVHLDTPQCFSAEMYSADLNITGKPEDDKVNLARENLKGLMAHWLAKRKQRFG 952
            DTRLG LSVHL+TPQCFSAEMYSADL ++G+PEDDK+NLARE LKGL+ HWLAK+K +  
Sbjct: 481  DTRLGCLSVHLETPQCFSAEMYSADLKVSGRPEDDKINLARETLKGLLGHWLAKKKHKPK 540

Query: 953  SQASANGEVISSKDISARSLSHSRLEAVDGNAENDSMVYSPFEFSTVSPPSIITEGSQGG 1012
             Q  A+G+ +S KD + ++LS S+ E  + +A +D  VY PFEFS+VSPPSIITEGSQGG
Sbjct: 541  PQVLASGDTLSVKD-TKKNLSASKTE--ESSAASDP-VYPPFEFSSVSPPSIITEGSQGG 600

Query: 1013 PWRKNITEFDGTEDEKDFPWWCLDCVLNNRLPPRENT---------NGFEEQ-------- 1072
            PWRK ITEF GTEDEKDFP WCLD VLNNRLPPRENT          G   Q        
Sbjct: 601  PWRKKITEFTGTEDEKDFPLWCLDAVLNNRLPPRENTKLSFFLHPCEGSSVQVVTLGKLS 660

Query: 1073 --------WVVNYVVEKMVLDKPLDNVNPD---ISFGPGLSSTVGDGSFRSGLKPWQKLK 1132
                     V NYVVEKMVLD PLD++  D   +S G       G+G  +SGLKPWQKL+
Sbjct: 661  APRILRVHKVTNYVVEKMVLDNPLDSLAIDAASVSGGQPQPLFSGNGLLQSGLKPWQKLR 720

Query: 1133 PSIEILCNNQVLSPDMSLATVRNYIWKKPEDLVLNYRMLQGR 1146
            PSIEILCN+QVLSPDMSLATVR Y+WKKPEDL+LNYR+   R
Sbjct: 721  PSIEILCNSQVLSPDMSLATVRAYVWKKPEDLILNYRVAIAR 753

BLAST of CmaCh12G011040 vs. TAIR 10
Match: AT4G14880.1 (O-acetylserine (thiol) lyase (OAS-TL) isoform A1 )

HSP 1 Score: 447.2 bits (1149), Expect = 4.0e-125
Identity = 233/306 (76.14%), Postives = 255/306 (83.33%), Query Frame = 0

Query: 63  SSIAKDVTELIGKTPLVYLNRVVDGCVARVAAKLEMMEPCSSVKDRIGYSMISDAEENGL 122
           S IAKDVTELIG TPLVYLN V +GCV RVAAKLEMMEPCSSVKDRIG+SMISDAE+ GL
Sbjct: 3   SRIAKDVTELIGNTPLVYLNNVAEGCVGRVAAKLEMMEPCSSVKDRIGFSMISDAEKKGL 62

Query: 123 ITPGKSALIEPTSGNTGIGLAFIAAAKGYKLIITMPASMSLERRIILRAFGAELILTDPA 182
           I PG+S LIEPTSGNTG+GLAF AAAKGYKLIITMPASMS ERRIIL AFG EL+LTDPA
Sbjct: 63  IKPGESVLIEPTSGNTGVGLAFTAAAKGYKLIITMPASMSTERRIILLAFGVELVLTDPA 122

Query: 183 RGMKGAVQKAEEIKAKTPDSYILQQFENPANPKIHYETTGPEIWSGSGGKVDALVSGIGT 242
           +GMKGA+ KAEEI AKTP+ Y+LQQFENPANPKIHYETTGPEIW G+GGK+D  VSGIGT
Sbjct: 123 KGMKGAIAKAEEILAKTPNGYMLQQFENPANPKIHYETTGPEIWKGTGGKIDGFVSGIGT 182

Query: 243 GGTITGAGRYLKEQNPSIKLYGVEPVESAILSGGKPGPHKIQGIGAGFIPGVLDVPLLDE 302
           GGTITGAG+YLKEQN ++KLYGVEPVESAILSGGKPGPHKIQGIGAGFIP VL+V L+DE
Sbjct: 183 GGTITGAGKYLKEQNANVKLYGVEPVESAILSGGKPGPHKIQGIGAGFIPSVLNVDLIDE 242

Query: 303 VVQ--------------------VGISSGAAAAAAIKVAKRPENAGKLIVVSVP-----F 344
           VVQ                    VGISSGAAAAAAIK+A+RPENAGKL V   P     +
Sbjct: 243 VVQVSSDESIDMARQLALKEGLLVGISSGAAAAAAIKLAQRPENAGKLFVAIFPSFGERY 302

BLAST of CmaCh12G011040 vs. TAIR 10
Match: AT4G14880.2 (O-acetylserine (thiol) lyase (OAS-TL) isoform A1 )

HSP 1 Score: 447.2 bits (1149), Expect = 4.0e-125
Identity = 233/306 (76.14%), Postives = 255/306 (83.33%), Query Frame = 0

Query: 63  SSIAKDVTELIGKTPLVYLNRVVDGCVARVAAKLEMMEPCSSVKDRIGYSMISDAEENGL 122
           S IAKDVTELIG TPLVYLN V +GCV RVAAKLEMMEPCSSVKDRIG+SMISDAE+ GL
Sbjct: 3   SRIAKDVTELIGNTPLVYLNNVAEGCVGRVAAKLEMMEPCSSVKDRIGFSMISDAEKKGL 62

Query: 123 ITPGKSALIEPTSGNTGIGLAFIAAAKGYKLIITMPASMSLERRIILRAFGAELILTDPA 182
           I PG+S LIEPTSGNTG+GLAF AAAKGYKLIITMPASMS ERRIIL AFG EL+LTDPA
Sbjct: 63  IKPGESVLIEPTSGNTGVGLAFTAAAKGYKLIITMPASMSTERRIILLAFGVELVLTDPA 122

Query: 183 RGMKGAVQKAEEIKAKTPDSYILQQFENPANPKIHYETTGPEIWSGSGGKVDALVSGIGT 242
           +GMKGA+ KAEEI AKTP+ Y+LQQFENPANPKIHYETTGPEIW G+GGK+D  VSGIGT
Sbjct: 123 KGMKGAIAKAEEILAKTPNGYMLQQFENPANPKIHYETTGPEIWKGTGGKIDGFVSGIGT 182

Query: 243 GGTITGAGRYLKEQNPSIKLYGVEPVESAILSGGKPGPHKIQGIGAGFIPGVLDVPLLDE 302
           GGTITGAG+YLKEQN ++KLYGVEPVESAILSGGKPGPHKIQGIGAGFIP VL+V L+DE
Sbjct: 183 GGTITGAGKYLKEQNANVKLYGVEPVESAILSGGKPGPHKIQGIGAGFIPSVLNVDLIDE 242

Query: 303 VVQ--------------------VGISSGAAAAAAIKVAKRPENAGKLIVVSVP-----F 344
           VVQ                    VGISSGAAAAAAIK+A+RPENAGKL V   P     +
Sbjct: 243 VVQVSSDESIDMARQLALKEGLLVGISSGAAAAAAIKLAQRPENAGKLFVAIFPSFGERY 302

BLAST of CmaCh12G011040 vs. TAIR 10
Match: AT4G14880.3 (O-acetylserine (thiol) lyase (OAS-TL) isoform A1 )

HSP 1 Score: 447.2 bits (1149), Expect = 4.0e-125
Identity = 233/306 (76.14%), Postives = 255/306 (83.33%), Query Frame = 0

Query: 63  SSIAKDVTELIGKTPLVYLNRVVDGCVARVAAKLEMMEPCSSVKDRIGYSMISDAEENGL 122
           S IAKDVTELIG TPLVYLN V +GCV RVAAKLEMMEPCSSVKDRIG+SMISDAE+ GL
Sbjct: 3   SRIAKDVTELIGNTPLVYLNNVAEGCVGRVAAKLEMMEPCSSVKDRIGFSMISDAEKKGL 62

Query: 123 ITPGKSALIEPTSGNTGIGLAFIAAAKGYKLIITMPASMSLERRIILRAFGAELILTDPA 182
           I PG+S LIEPTSGNTG+GLAF AAAKGYKLIITMPASMS ERRIIL AFG EL+LTDPA
Sbjct: 63  IKPGESVLIEPTSGNTGVGLAFTAAAKGYKLIITMPASMSTERRIILLAFGVELVLTDPA 122

Query: 183 RGMKGAVQKAEEIKAKTPDSYILQQFENPANPKIHYETTGPEIWSGSGGKVDALVSGIGT 242
           +GMKGA+ KAEEI AKTP+ Y+LQQFENPANPKIHYETTGPEIW G+GGK+D  VSGIGT
Sbjct: 123 KGMKGAIAKAEEILAKTPNGYMLQQFENPANPKIHYETTGPEIWKGTGGKIDGFVSGIGT 182

Query: 243 GGTITGAGRYLKEQNPSIKLYGVEPVESAILSGGKPGPHKIQGIGAGFIPGVLDVPLLDE 302
           GGTITGAG+YLKEQN ++KLYGVEPVESAILSGGKPGPHKIQGIGAGFIP VL+V L+DE
Sbjct: 183 GGTITGAGKYLKEQNANVKLYGVEPVESAILSGGKPGPHKIQGIGAGFIPSVLNVDLIDE 242

Query: 303 VVQ--------------------VGISSGAAAAAAIKVAKRPENAGKLIVVSVP-----F 344
           VVQ                    VGISSGAAAAAAIK+A+RPENAGKL V   P     +
Sbjct: 243 VVQVSSDESIDMARQLALKEGLLVGISSGAAAAAAIKLAQRPENAGKLFVAIFPSFGERY 302

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q433172.0e-13782.96Cysteine synthase OS=Citrullus lanatus OX=3654 PE=1 SV=1[more]
Q008346.2e-13178.78Cysteine synthase OS=Spinacia oleracea OX=3562 PE=1 SV=1[more]
O811545.2e-13078.46Cysteine synthase OS=Solanum tuberosum OX=4113 PE=2 SV=1[more]
Q9XEA61.6e-12677.05Cysteine synthase OS=Oryza sativa subsp. japonica OX=39947 GN=RCS1 PE=2 SV=2[more]
P806083.5e-12675.56Cysteine synthase OS=Zea mays OX=4577 PE=1 SV=2[more]
Match NameE-valueIdentityDescription
A0A5J5T7800.0e+0071.09Cysteine synthase OS=Gossypium barbadense OX=3634 GN=ES319_A13G191000v1 PE=3 SV=... [more]
A0A6J1HP490.0e+0095.67WD repeat-containing protein 48-like isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1HKR10.0e+0095.78WD repeat-containing protein 48-like isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1FHE90.0e+0094.75WD repeat-containing protein 48-like isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1FAZ80.0e+0094.85WD repeat-containing protein 48-like isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
KAG6586292.10.0e+0088.70hypothetical protein SDJN03_19025, partial [Cucurbita argyrosperma subsp. sorori... [more]
KAB2049639.10.0e+0071.09hypothetical protein ES319_A13G191000v1 [Gossypium barbadense][more]
XP_022965590.10.0e+0095.67WD repeat-containing protein 48-like isoform X1 [Cucurbita maxima] >XP_022965591... [more]
XP_022965592.10.0e+0095.78WD repeat-containing protein 48-like isoform X2 [Cucurbita maxima][more]
XP_022937659.10.0e+0094.75WD repeat-containing protein 48-like isoform X1 [Cucurbita moschata] >XP_0229376... [more]
Match NameE-valueIdentityDescription
AT3G05090.13.1e-31170.34Transducin/WD40 repeat-like superfamily protein [more]
AT3G05090.23.1e-31170.34Transducin/WD40 repeat-like superfamily protein [more]
AT4G14880.14.0e-12576.14O-acetylserine (thiol) lyase (OAS-TL) isoform A1 [more]
AT4G14880.24.0e-12576.14O-acetylserine (thiol) lyase (OAS-TL) isoform A1 [more]
AT4G14880.34.0e-12576.14O-acetylserine (thiol) lyase (OAS-TL) isoform A1 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR020472G-protein beta WD-40 repeatPRINTSPR00320GPROTEINBRPTcoord: 509..523
score: 37.66
coord: 686..700
score: 38.53
coord: 466..480
score: 37.43
IPR001680WD40 repeatSMARTSM00320WD40_4coord: 660..699
e-value: 1.9E-7
score: 40.8
coord: 702..741
e-value: 4.3E-4
score: 29.6
coord: 484..522
e-value: 2.4E-5
score: 33.8
coord: 431..479
e-value: 0.054
score: 22.6
coord: 526..566
e-value: 7.4E-6
score: 35.5
coord: 618..657
e-value: 9.6E-5
score: 31.8
IPR001680WD40 repeatPFAMPF00400WD40coord: 663..699
e-value: 2.5E-4
score: 21.8
coord: 486..522
e-value: 0.084
score: 13.8
coord: 528..566
e-value: 0.0062
score: 17.4
coord: 626..657
e-value: 0.0011
score: 19.7
coord: 704..738
e-value: 0.15
score: 13.0
IPR001680WD40 repeatPROSITEPS50082WD_REPEATS_2coord: 533..575
score: 9.906923
IPR001680WD40 repeatPROSITEPS50082WD_REPEATS_2coord: 625..666
score: 15.019908
IPR001680WD40 repeatPROSITEPS50082WD_REPEATS_2coord: 667..708
score: 14.652308
IPR015943WD40/YVTN repeat-like-containing domain superfamilyGENE3D2.130.10.10coord: 600..886
e-value: 3.0E-40
score: 140.1
IPR015943WD40/YVTN repeat-like-containing domain superfamilyGENE3D2.130.10.10coord: 401..595
e-value: 1.3E-25
score: 92.4
IPR001926Pyridoxal-phosphate dependent enzymePFAMPF00291PALPcoord: 70..322
e-value: 2.6E-55
score: 187.9
IPR036052Tryptophan synthase beta subunit-like PLP-dependent enzymeGENE3D3.40.50.1100coord: 71..305
e-value: 5.8E-106
score: 355.9
IPR036052Tryptophan synthase beta subunit-like PLP-dependent enzymeGENE3D3.40.50.1100coord: 107..209
e-value: 5.8E-106
score: 355.9
IPR036052Tryptophan synthase beta subunit-like PLP-dependent enzymeGENE3D3.40.50.1100coord: 306..344
e-value: 1.3E-5
score: 26.9
IPR036052Tryptophan synthase beta subunit-like PLP-dependent enzymeSUPERFAMILY53686Tryptophan synthase beta subunit-like PLP-dependent enzymescoord: 65..336
IPR005859Cysteine synthase CysKTIGRFAMTIGR01139TIGR01139coord: 70..306
e-value: 1.4E-110
score: 367.1
IPR005856Cysteine synthaseTIGRFAMTIGR01136TIGR01136coord: 70..306
e-value: 1.3E-105
score: 350.9
IPR024935Rubredoxin domainPFAMPF00301Rubredoxincoord: 4..41
e-value: 9.9E-11
score: 41.5
IPR024935Rubredoxin domainCDDcd00730rubredoxincoord: 3..40
e-value: 3.55864E-12
score: 60.0122
IPR021772WDR48/Bun107PFAMPF11816DUF3337coord: 1057..1141
e-value: 2.6E-12
score: 47.0
NoneNo IPR availableGENE3D2.20.28.10coord: 1..44
e-value: 1.0E-11
score: 46.3
NoneNo IPR availablePIRSRPIRSR038945-3PIRSR038945-3coord: 69..256
e-value: 1.1E-5
score: 22.7
NoneNo IPR availablePANTHERPTHR19862:SF18GUANINE NUCLEOTIDE-BINDING PROTEIN, BETA SUBUNIT-RELATEDcoord: 1055..1114
coord: 413..1049
NoneNo IPR availablePANTHERPTHR19862WD REPEAT-CONTAINING PROTEIN 48coord: 1055..1114
coord: 413..1049
NoneNo IPR availablePROSITEPS50294WD_REPEATS_REGIONcoord: 667..700
score: 11.444681
NoneNo IPR availablePROSITEPS50294WD_REPEATS_REGIONcoord: 625..657
score: 11.550137
NoneNo IPR availableCDDcd01561CBS_likecoord: 74..336
e-value: 2.70069E-121
score: 373.387
NoneNo IPR availableCDDcd00200WD40coord: 446..779
e-value: 7.26981E-55
score: 191.009
NoneNo IPR availableSUPERFAMILY57802Rubredoxin-likecoord: 3..41
IPR002052DNA methylase, N-6 adenine-specific, conserved sitePROSITEPS00092N6_MTASEcoord: 12..18
IPR001216Cysteine synthase/cystathionine beta-synthase, pyridoxal-phosphate attachment sitePROSITEPS00901CYS_SYNTHASEcoord: 95..113
IPR018527Rubredoxin, iron-binding sitePROSITEPS00202RUBREDOXINcoord: 29..39
IPR019775WD40 repeat, conserved sitePROSITEPS00678WD_REPEATS_1coord: 686..700
IPR024934Rubredoxin-like domainPROSITEPS50903RUBREDOXIN_LIKEcoord: 3..52
score: 9.027646
IPR036322WD40-repeat-containing domain superfamilySUPERFAMILY50978WD40 repeat-likecoord: 445..862

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh12G011040.1CmaCh12G011040.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006535 cysteine biosynthetic process from serine
biological_process GO:0000724 double-strand break repair via homologous recombination
biological_process GO:0032259 methylation
biological_process GO:0016579 protein deubiquitination
molecular_function GO:0004124 cysteine synthase activity
molecular_function GO:0005506 iron ion binding
molecular_function GO:0008168 methyltransferase activity
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0043130 ubiquitin binding
molecular_function GO:0046872 metal ion binding
molecular_function GO:0005515 protein binding