Cp4.1LG12g05070 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG12g05070
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionDRBM domain-containing protein
LocationCp4.1LG12: 4995937 .. 5014714 (+)
RNA-Seq ExpressionCp4.1LG12g05070
SyntenyCp4.1LG12g05070
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTATACATCACAAGCACATTTTTTAAACTCATATATCAGACGACGTAATTTATTGACGGAATCACAAGCAGTTGACATGCAGTGTTCGGTCCACTCACACAGTGCGCAGTCACCTCCTCCGCCTTCCGGACACTTGCAGACTGATTCCTGTACTCTCGAACTTCGCATTAAAACTTTCATGGCTGATCGCCCATCCTCTCAACTACTCCCCCTCCTCTCCGAACCCTAATCTTCGCGCTTCAGCTTCCATTCGAGGTAATGCGAACTTCTTCCGCCTTTTTACTTGAACTTTTCCTGAGGTTCGATGTGGTGAAGGTTCGTCTACTGGAACTCGGCCGTTTTTGGGCTGTTTCTTCTCTTTGTTTTGGTATAACATGCACGAGGATCAGATAAAAGTTTGTCGCTCAATGTGTAGTTAATTTGTGGTTAAGGTCTTTCTTGAATTGGAGTTTCTCCGAGTGTGGTTGAGTCATGGTTCATTCGGCCTTGTGTTTATTTGAATTTATGAGGAAGTAGGATACGGCGATGGTTTTTCTTCCTGAACTCACCCTATTTTGAGGCATTTTTTGCTTTTGGACATCGAAATTTTTTCTTCTGTTTCTTCTGTGTAACATGCATTAGGATCAGGTAAAATTTTCCGTTCAACGTGTTATTAGTTTGTTATGGAAGTGAATAGATTATGTGAACGTTAGTTTATTGAAGGTAGAATCGTTATGGAATTGGAACTTTATAGAGGTAAGCAGTTTTGTTTGTAGTAAGATAATGTACTTGGTTTGTAGCAAGATAATGTANTAAGGTGTGATTGAGGCACAGGTCGTTCTACTAATCTGATGGACAACTTTGTGAATTTTAAACACACAGTATTGGAAATCTCTTAGAATGTTTTGGGTCTCTTTGACAATGTTAATATTTTCTGGCTTCTGTTTTTGAAACATTGGCTCTCTTTAATACTTTCGTATCTTTTGTTGGATGACTTTGAGGAGACCCGGAACTTCATATTCTCTAGTCCTTAGTCTTAGCAATGTCTCCAAAATTGTGCATAAAAGAAGTAGAAATTGTAAGTCTCTTGTTTTCTATATTGTTAAAGACAAATCTTCAGCTAATTGAAGCTTCCAGATGCTGTTCTTCGCCTAAATTCATCTGGTGGGCTTTCCTGCAATGATTTTTAACAACTTTCAGCTGTGTTTTTCCTTGTATTTCTTTTCTAGATTTTCATTTATGCGTTGCTTCTTTTTCTCAGTCTAAACTCGGTAGACTTTGACTTGCTCTACTCCCTTAGCATTTTTCAAGCATTAATGAAAAGTTGTGCCTTTTTGTTTATGAATCTGTCTTTTATCTTATTCTTCTTGTTTTTATATATATATATTTTGTTTTTCGGTTAGTTACATTTTAATGCTACAAAAATAAAAGGATAATTCTCTGTTTTCTATCAAGCATGTATTATTACTGGCAATGTTTAAGGGAATTTGGTTAAATTTTGTTCCTCTACTCGGTGCTCCCTCTACAATGAGTTGTCCTGTTGTGTTTTTCTTTTTGTAGATTTACGAGTCTTTTCTGTAATTCCCTTGGGTTTTCAGGAATTTTTTATCTCCTCATTTTGTACATCCTCTTTTGATTAATACCCACATGTTTCCTACATATATAAGAAAAGGAATATTTTTCTTGTTTTCTTGTTGAAATTGAATCAGCGAGTGTTTTATTTTTATTTTTATTTTTTGATTTAAAGATTACAAAGTAAAGAAACAACTTATAGTTATCGAATAAAAAAGTACATATATAAGTAAAAAATGAAAAATAAAGAAAATTAAACACAAGAAACTAAAACTAAGAAATATATGGCTGATGAAGATTTGAACACGTTACTTTACCATGACATATTCTCAAAGAATAAAAAGGAAAAACTACGATATGTTGTCAACCATTTGAAGCAACAGTAGTCAAAATGGAACAACTAGAAGGGTTTCTTCACTTTCACTTCCACACACTTGGTACAACATCAACGTGAACCTGATTTGAGTAATTTCTTCATGAAGAGTTAGTTATACAGTATGATTACTGGTAATCAAGCTGCTAAAGTATACACATGAATTGATTTTTTTACTCTGTGCCAGCAGTCTGTCTATTCTGACAAGGATCATAAAGAACTGAAAAATTGATTGTTATTTTCTTGTTGGCAAATCTTAATTTTTCTTTGTCTCGTGGCTATGAAAATGGACAATTATTGTTTATTGGATGGAAAAGAGTTTCTGACGTGTTAATTCAGCTGTTCATCTGGCAGTTCTTTTCGTGGGCTCCCCAATTTTCATTCCAAGAAGGAAGTTTGAATCCTTTGTTATTTTCCTCCGTTGATGGAAAGGTCTTGCTCCTGATTGTGGGAAGAGGATATCATATATAAGAAACTGAAAAAGCAGTAGTGGTAAAAAAAACACCCTTACCATGAGTGCAACAGGTGTATGCCCAACCGAGGATGCCATACTTGCATTATTGGATTATTTAGTTGAACCTATGCTTCCTTCAAAGTCATCTTCGATAGAAAATCCACCACTAGCTCTACTGCAATCAGTTGCAAAACAGGTACTTCTGAAATTGACTTTCGTGTGTTTAATATTTTAATTCACTTTCATTCTTGAAATATTCGAACAATTATAATTAGGTTAATAATTTATTTTGGTTTCGGACTGGGAGATTGGAGTTGATTTACTGGAAACATAAAAGTTGTACAAGCAGCCGTCTGTGCTTTAATATCCATCATCTTTTGAAAGTACCAAAAGATTATGATAATACTATTACTTGTGCTTAAAAAAATTTAATTTCTACAGTGCTCTCTTCATTGACTAACCGAATGCTTTAATTTTTTCTAATCTTTTGAGTGTACCAGCGTCAAGCATAAAACTACCTATGTCAGTAAATATATACTTGTCCAAAGAACTCTCTCTCACTATCTTACACACACACAAATGCACACGCACATTCACACAGACATGATAACTGAAAGAAGGCACATAGTGAGAGGAGGGATATTTGGGGGCTATGTATCCAGGATTCAGCTATACTCCATAACAATAGAAATTACAAGGAATGTCCATTCATTAAATTTATGTGAGCAAATTGATTAACATATTTTCCAATATTTGAAACAACTGCAAATAAAAAGTAATTCATGGCACTTCTGCTGCTTTTGCTAGTCTATGATTTATTTGATTTTTCTCATTTACACGTTTTAGTTTACCACCATTCTTCTTGCTGATAAACTTTGGATTTTCCTATATTTAATAGCACCCGAAATTGACTAGTACTGTTCATATGATCAACACGGAATAGGTTTAAAGTAGGTTAGTTTTAACTATATGTGGCAGCTTATTTCTCACTACTCCTTCCCATAAGAAAAAAAATTGTTAAGAGAGAATAAAAGTGTCATCTATATTTAAAATCGTTATTGTTAAAAAATAATGTTATCTAACTTAGTGTTTTTCCTTTGAATGTACTATTAGATGCATGCCGTTGTTTTGTTATACAACTACTACCACAGGAAACAACACCCACATCTTGAATTTTTGAGTTTTGAGGCATTTTGCAAATTAGCTGTGGTCGTTAAACCAGCTTTGTTGTCTCACATGAAACTCATGCAAAGCTCTGATGATATTGAATTGGAAAATCCCGAGAAGCAGCTTTCTCCAGCAGAAAAAGCAATTATGGATGCATGTGGTTTAGCCACTTGTCTATATACATCAAAAGATGAAAATATAGAGGGCTGGCCCCTTTCCAAGGTTGCCGTTTTTTTGATCGACTCCAAGAAGGAGCATTGCCATTTGCTGTTTAGTTCCATCACTCAAGGAGTTTGGTCTGTCATTGAACAAAATTTAGATACCTCTGAATGCCAACCAAAAAGCGTGGAAGAGGAAAAACATGTAAATAAAAAGAAAAGAGTGATTAAGAAACCTTCAAAAGAGGGGCTAGTTGTTGTTGGAACTAAGACTCAGCAGCTTGCATATTCAGCAGTCAAGGAAGCAACTGGTATGTATTCTACTTTGTTAACTCATCGATAGTTCAATGGCTATATCTTGTTAAGTATACCAAGGAGAACTGCTTTTATTTCCTACCTTTTTATATTGAATGAGATGTCCCCCTTTGTTCGTGTTAGATTCAAAGATATGTTTTAGAATTCTAATGACTTATTGCAGAGATGATTGTTTCATATATATTTTCTTCCTCAATTAAGGATCATGTATGGTTAAGTATTAGGAAGTGAATTGTAACTCGAATCACATGTCGCCACGTACACTCAAGGGTAAAGAGGGCGGCATTAAAATTTAAATATGCATTCAAAACCAACATAAAAGTAGTGTAAAGGTAATATTTGCGGAGTTAAATCAAAACGGCTTACAAAAAATGCATAAGTTTTGGGAAGTAGCATTTAAAATAATAATAATATGATAAGAAGGAAGATGACTCGATCTAAAGAGCACCCCCTGCAGTTGCACGGCTCTGCACGCTCCCACCATCAACAATGATCTACACTCCACCTAAAAAAAAAAAAAAAAAAAAAAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAAAAAAAAAAAAAAGAATAACATGGGATGAGTATAAAAATACCCAGTAAGCAACCTACTTGTAGATTTTTTATTTATTTTTTTGAAAAAAATCTTTTATTTATTTTCTTAAAATTCCAGGTGTTACATGAATATATTGATTGTCCTTTTCTTTTGTTTTTTCTGAATTGTATCAATATCTAATTATGGTATTGGTGAGATGGTCTGATGGTGCCTCATCATATTGGTTGCATTTTGTGTTCTTTCTATTCTCTTTTGTACTTACCTCAGGAAAAAGGGAAGAATTTGGATTGCTTAAGATATGTATTTGTTTTGGAACAAGAAAAAACTCCTCATTGATTATGAAAAAGAGTAAATAATGTTCAAGGAATCCAAAATCCTTAAGTGAGTGAAAAAAAAAAAAAAAAAAAAGACAAACAACAAATACAACCATATAGAGATTAGCCATGAAAATAAATAAACATTTTAAGAACAGAGAAAATTTTCCAATATCTTTGCCTGAAAGAATGATATTGTTTGGAGTGTATGTACGAAAATAAACTAAACTTTCTAAGAGAAATATATTATTTAGAACTTGGATCCATCTATCATGACGTTGTTAGGCCAATCTTGAAAACATTGTTTGGAGTAAAGGGCTTCTTTACTTGTTATTACTGTTTTAGGAATTTCGTCTCATAACGTCTTGGAAGAATAAACTTAGTTAACAAGGAACATAATATCATATTGCCTTGCTTGTAAATTTAGTCAACAAGAACTTCCTTGCTTGTAATTACTACTTTAGGAATTCAGTCTTATAATCTTGCTACCCCCCTCCACCCACACACACTTACCCCCCATTGGCCTTATTCAGACATTGTTTTCTTTTCTTTTGAGGCCTTATTGATGCAAAGTTGATAAGATTGCCTAATGGATATGGAACGTTCTATGGGGATGTGGGGATGCTTGTTTGCAGGCATGATTTGGGACAATATCAGGAGGCTCTTGAAGCAAATGGCATGTATGTTGTCATGCTGGTTTATGATTGAAGAGGTGTAGTCTTGTTGATGTAGAATTAAGGGAAATTCAGAATTGTATTCAATTCAAGGTAATGATACAAACTGGAAGCTGACCAATTATAGGTCACAAGTCAGCAAACAAATCAAAACTAACCCAAGGCTTCCTAACTAATTGACCAATTCGGTTGAGGGGGTAAATAAGAGAAAACTTGGGTAAACTAAAGAAACTAAAGAGTACAAGAATGGCAGATCTAGTGAGAATTAAAAAAAGATGCAAGAACTATGAAGAAACTCAAATTAAATATCTTAAAAAGTACTTCCAAAGGGATTATGATTAATGGAACTTAAATCTTCAAATATTTTACAGAGAAACCTCCCTTCACACGATTTATCACCCTCAATTTGTCCTTTTTGTTTCTTGGATGCGGATTGTCTTCTGCATATTTTCTTTGGTTGTTCCTATTTGCAGTTATGTTGGTTTAAGTTGTTTCCGAACATTTCTCATTAGCCCAAAATTGGCTCCGAAATCTATGCTTTAGTGGGGTAATGCAGTAAAGCCTTTCTTTTTTCCTTGTCAAAATTATGTTTGAAAGAAACCAAAGGCTGTTTCACATTAAGCATTTCCTTTGGTTAGATCATTTTGAGTCAGCTCATCTGCTTCATCATGGTGTTCTCTTTCCAAATATTTTGTTAATTTACATTTTCAGGATATCACTCTTAATTGGAATGCTTGTATATACCCTTTGTAGATTTCTTTATTTGAGTCTTACTATTTCTTATTGCTTTTATTCCCTCCTTGGAGTTTGCATCCATTAAGCATTAGGCTCTTTTCATTACATCCTTTTCGTATCTCTAGATTTTGGGAGGGTACAAATTATATTGCTATATGTTAAAACTGTGCAACTTGGGTCTTCTTGTGTAGAAGAGAGACCTATTACATACATGGAAAGAATAATGCAATTAAGACTAAATATTGACCACTAATGTTTACACTTAAATAGTAGAATTCCAAGACTTCCCTTCAGGTTGGTGTAAAGATACCTTCCATAGTTAGCTTGCCAATTACTTTGTCACATTACTTCTTTGGGAGTCCTTTAATTGATACATCAACAATTTGCTGTGACGTTGGAAGGTAAGGGATACATATTACTTCTGCATCAATCTTCTTTTTAATAAAATGCTTATCCACCTTAACATTTTTGTTCTATCATGTAAACTGGATTATGAGCTATCAAAATGATAACTTTATTGTCACAACAAATCCGTATAGGCGTCTTTTGAAGGAACTTCAATTCTCCAAGGACCCTTTTTTGTCCGTATACCCCCACAAATTCCACGTGCTAAAGCTCTTGTATGCTTCAGTACTGCTTCTAACTACCACATTTTGTTTTTCTGCTCCGCCACATAATTAAGTTCCTTCCAACAATAGAACGATAGCCTGAGGTAGATCTTCTATCAGTAGTACTTTTTACCCAATCTGCATTAGTATAAGAGGGTGCTTTTTGAATAAGATACCTTTCCCATGAGTTTCTGTAAGTCAAAATTTCAACTTCAAAATGAATTGGTCCATGCATACAACATGTGTAATATCAGGATGGGTATGGGATAGATACACTAACCCTTCAACAAGTTTTTGGTATCGCTCCTTGTCCTTTATTTCTTCTGCTTTTGCAACTTGTAATTTTAGATTGAGTTTAATGGGGGCTTTCTGCAACCTTACGTTCAAGTAGTCCAGTTTCTTCAAGTAAGTCAAAGACATATTTTCTTTGATTACAAAGATACCCTTTTTAGATGTTGCAAATTCCATTTCTAAGAAATATTTTAATGTTCTCAAATCCTTGATCTGAATAATGTCAACTAAGTATCATAAGTAAATTATTCCCGATTGTTCAATCATTTGATAAACAGGGCCTTTCACTCTTGATGGTAAGGTTTTTTCTTTTTTCTTATTGTCTTTTCATCCAAANAAGGTGGAGAACTGAACCAAGAATTATCTTTCTTTATCATTAATCTAATCACAGTTTGAGACTAGGATTCTATTTTATTATCAATCCAATCAATGTTCACATTTTTTTCTTATCAATGGATATATCTCTTTACTTGTATGACTTAGATGTCTCGCATTTCTCAAAAAAGTGATTCGATTGATGGGATTTGGTACGATACTTGTGAGATTGATGAGATTGATATTCCGATCCTTCTTCTTCGAACATATTGATTTGATCCCGTAAGCAGGACCATCATCCCAATAACTTGTTGCCACCAAAAGGAGTACCCATATTTCTTCTAGAGAATCTCCTAATTGTTCCAAGGAAACTAGAAAGAGATTCTTTAACCGGAAAGAATTTAGTTTAGATATGGGATACTTATCTAGAAATCTTCGCAACTCAATCATGTACGATGGAATCATCAAAGACTTGACCTTTTCGAACTCTATTTGTAACTCACTATAGGCTTGAGAAACAAGGAGAAGATGTGTACGATTTCTCTAACAAACTTTTTCTTGGGTTCAATTATACTTGTTTCATCATTACCTATAAGAATAATATCATCCACTTAGAGGATCAATATTGAGATTTTATTATTCTTCTAATGTGATGAGGAAGGGGTTGGAGAAGAAGCTAAGTAATGAAATAGGGAAATTTAGAAGAAGGATCCCGACAGTTTGAGAGACCCCAATCATCAGGCTCTACAATCAGGAGAGAGAGCAACCGATTGAAGTAAAGACAAGAGAGCATCGTAAGGGCAGTTAAACGGATGATAAGGAATGAAAGAAACTCCAGGATTTCCCAAGACAAAGAAAGATCTCCCCAAGAAGTCCTTGACTCGTCCCCCCATATGGCTTTTTATTGGTGTAAAAATGTTCATCCTTATAGCAACTATAGTTTGAATTCTTTCATATCCAATTGTTGAAGTCTTTTGTAATCACTTTGGATGGAGCTTACCTTTTGTATCATTTCATTTAATCAGTGAAAGTATTTCCTATACAAAATAGTAAAGGCTCAGAAGCTACTTCTCTATCTCCTGGTCATGGATAGCCTGCCGAATGTGAAGTCATCTGAAGTGGTTTCTCCTCCTGAATTGAGGTCAAGAAGTTTTGGGGACCCGTATAAAGGATTCGACACTGAAAAGCAATAGGGGGCATAAGAAGCCAAAGATACTCCTAGAATCAAACGCTTGTATCATCTCCCACAATAGATGGAAGAACGAAAGGAAGCTTGGAAGGCAGAAGGGGATTTCCTTCCACAAGTTTCTAAAAGCGCCTATAGAAGCTAAATTGGAAACCTAGCTGTTGGGATGACACCTGTATTTGCTCACAATTTCCGAATGCCATAAAGTGTAAGGTTTTGAAGGGACCTGTCATGTCCATTATGAGGCAAAGCTTTTGTTGCCCAACTTTCCTATCCTCAAACTACCTCCTTATTAGGTTTCAACACCACACTCCAATTCACTGAATGATTTCCTATCCTCAAACCACTCCCTTGTTAGGTTGCCACAACATACTGCAATTCACTAAATGTGACCCACCTCCCTCCTCAACCCCTCCCTGCAGAAATTCCCATGTGACTTTTCCTTGATAGATCCATGTATGTCGAAAAGAAAGAAAATGCACGGGATTTCAGGATAGTCTTCCCCTTCAGAGAAGAAATCTTTTTTTTCCTTGAAGAAGCTCTTGTAAAAATTTCAATTACCTTGAATGAATTTTAAATCTTTCACCTCTGAAGATTAATTTTCTTTTGTAAATTTCATGCATCAATGAAATTTGTTTCTTATTAAAGAAATGTTTAAACTTAGTTCTTAAACGCAGATGCTAGTTCAATAAAGTTGTTCATTCAGTACCAATATTTTTAAAAATAAAATTCAATTAATCTAAAGTCCCCATTGAACATATTATTGGACTTGGTGTAGCCTGATGCAGCAATGGCGTGGAAGCAAAATTCATGGAGTCTCCTTTTTGTTGCTATTCACCCTTGGGGTCCTATATTATGGATATGTCTCTTATCTTGGTACTTACAGAGAGTTGCTAATCCTGTTTATTTTCATGCAAAACAGGCATTAATCAACACGATCTCAAAATTTTAGAAAGTCACGTTGCCTACTCTCTAAGTAAAGAAAAATCAGCAGTCTACTTTTATATGATGCAGTGCACTCGATCAGCAACTGAAGATGTAATTCAAGTTCCCATAAAAGATGCCGTTGACAGGTTTCATTGTTTTCCTTTCACGCCTGACCCCTTCAATTCTGTGTCTTGGTTTGAATGCATGTTCCAGGAGCTCATGTGAAATGAGCATGGATGAAGGTCAGTCCTCAAATGCATAGATACCCTTTTTTGCTCTTGATTTTTCTCTTTTCCATGGTATTAGTTTGCAGGATTCGTTGTTTAAAAAAAATGGTAGGAGATGGAGCGTTACTTCAAAAGTTGAGTACTACCACATTCTTCCTTATGTGAAGATGGTGCTAACCTGGTTTCACAGGTATACTTTTCATTGCTATGGATATTATTTTTGATGTTGGTATTTATATATCGGTCTGCATAACTTGATACTTAAAGCCTGATTCTTGATCATTGTGACCTTGTTTATTTCTGCTTCCTGGTGCCGCTGTGTCCAGCCTCTTGGCTCTTGCTAGTTGCCAGTAATGTGAGGTTTAACTATTATTGGATATTAGTTGGAAAATAGTAGAAAGAAGACTGCCAACCCCATAAGCCTCTCGTTCAGTAAGGGGTTGAGGAGATCTAAGAAAGGAAGAATTATCAGAAGATGGCAAGAAAAAGCTACTATCTAAAAAATGATACAGATGAGGAAACAAGCCGCATGAGGTTTTTCCCCACTCAAAAATCCTCCTAAAATAGGTGTTCGACTCGTCTCCAATAGAACACTTAACAAATTGAGAGAATAGAGGATCCTCCACCAGAGAGTATTAGGCTCCTTGTGGAAACGCTACAACAGTTTTGCCAAAAGAGTCTCATTACATAATTACAAAAACATCAGACATTAGCAGAAGTGGTTGGTAGAGAAGTCTCCCAGAGGTGGAGATAGAGATTGAACGAAGGTGGGGAGTGAGAATTGGGGGAGGGGAAGGAGTTTTCTAGAGTTGGTGTTTATTTTTTTGTTTTTTTTTTTTTCATTACAAACTTTTTTAGAAATTTTAATTTGTGGGCTTTTAATGGACACTTTGTAGCATGTTTCTTTACTGGGTTTAAATGGGCTCATATAGCTTATTGGGTTTTATGTTCTTGTAAATTTTAAGCTAAATTCAAGGACCAATTAAATCTTCCTTCACATTTTTTTTAAAAAAAAATTAATTATTTTTAATATTTAGTAAAAAAACTAAAGTATTTTGGGTTTGATCTTTTAATATATTTATGAACGTGTCCCTAACGTGTCCATAATCCTATGTTTTAGAAAATTGGTGTGTCGTAATGTCATGTCATGTCTATGCTTCTCACGCGAAGCTTGTCCATGCAGTTGGGTTTTGGAAAAAGTCTTTCCTAAGTTGGTTAAGGGGAAGAAATTCTTTTAAGGAAAGGTTGGAAGTGCATTTTCTTCTTCTTCTTCTTCGTTTTTTTTTTTTTTNTATAATTAATTAATGAACATTTATTTTTTCTGGACGAGAAGCTTAGCTTTTATTATCAGAAGCGAGTGAAAAAATGGAGGGAGACTTTGTTCTAAGTGGTTCTTGCAAACTTCTCCAACTAAAATCCGTCTTTCCTTACCGTCTATCTTGCTTGAGGATGGGTAGTGTAGAAAATTGACCTGTAAATTTGAGATCTTGGCTGCATATATTTGATTTCCAGCCGTTGCTTAAAAGTCATTTATCCTAAATCTCCCTGCAGGTGTGGAGAATGCGCTCCTTTGTATGTAGTGTTTTCTTATTATCTCGAAGTTCTATATTAGGGAGTAATTACATATTTTAGAGCAATGGAAGTTCTTTTTTGCCTCATAATGGGAAGCATTGTTATTCATTTCTGTCTCAATACGGAGAAGTGATTGCATTCTAATATCTCTATCATCTCTTTCACGTAGGGAAACTTTAACAGATAATTTGGGAGTCGTAGGTGGAGAAAAGATTGATGAAAACCTGAACAAGCCTAAGAGAAAAGATGTAACCAGGAAGCTTGGAACTCAAAACAATCAAGACGATGCTACTACAAACAATATGAATAAAGGCACTAGCATTTATGATGCAGGATTGGAGAGATTGCCCGATAAAACGAACTGTATGAGTAGTTTGCATGATGCGATCTGCAGGCCCCAGAGTCCTAGTGTGGATGACTTGGTTCCCTCCAATCCAATGGAGAAGAGAAAGGGTGTACCGACTCCCACCCAAGTTATCATGTCATATGTAAAGAAAATACATGGTAGTCCAGTTTACAATCACTATGAAGCAACTATCCCATGTTCGGTGACTGGTAGGCAAGTTTACAATCACTATGAAGCAACTATCCCATGTACGGTGAATGAATCGAAGGCTTCAGAGAGTGGTATCAAAGTTGAGGTAAGATGTTGTGCGTCTCTTCATATACTTAGTCTTGATTTTCTATTTATCTTTAGGATCTTAAGTTACATATTTTGCCTGAATTTTTATCAAACATTAGGACTCCTGATGCTTATGGGGACATTGACTGACATTGACTCGGATTTTATTCTCTCCTATAAAGTTTGGCTTAAAGCATGTCTATATTTTAAGAATTATTGTGGGTTCATCCCTTGAAGGATCTGGACATGATAATTTTTGGACCAGTTTTTGCATGCTTTATTCACTCAAGATGAGATTCTCTGTTGGGACCTATGGTCCAAGGGACAGTCTTGAGGAGGGGTCAACGTGGTTACTCTATGTTTTTGACACCACAAGAGATACTATTTGGCTTTAGGTATCATTCTTTTAATGGATTAATTGAAGAATTTGCACATTTAAATCCCTGACCTTTTTTGATAATGTAAAGAGTTATGTCAAGTGTAGCAAGGTAAATAATTAAAAAAATTGAAGTTTTTAAAGAATTGTTAGTCCTTAAAAAATGATTCATTGCTTCTCAATTAATATAGGTACACAACTTTGAAAATACTTGAAGGAAACTGGAAAAAGAAAACATCTAGAGTACAAGATAACCCCTGTAAGCATAGGACATGACCAGATACTCTTTATACGTCCCAGTAAATTACAAGATAACCAAGTAATCCTCTATCTTCAACTGCATGAACTTTGATATAATTAAACAGAAAAACTCTGTTATAGAACTAGTGGAAAAATTAAACCATTGTAAATGCATTGAAATGAAGGATGGTGAAGACGTAATAGTGCAATCTAGCGCAAAAGAGTTGGGAACTAGAACTCTCGACACAGGATCCTTATTATTATTCATTTAAGTTCATTGACAAATCTATTATAAATGAAAACCAATTAATTCCAACTATTATGCAAGTTTTCGGATTCAGTATTCTTTTAACATGGAACCAAGGAGTTATTGAATTCAATTTTTTGCGTGCCACATCTTTACTTCAAAATTAATTATCTGTATGATGAGCTTGTTAACCCAAGGAAAATTCGATCTGTACTTGACAGAATGCTAGTTATTAATGTTGTAAATTAATAATTGTCATAATAACTCAGATTTTTCAGATAGTACAATCTGTAGCTTTTAATGAATTAATTTTTGCCTTATTTTATTATATTTATAATTCAGCTGATAGAATAATTTACTTTCCAGGATGGAATACTAGCAACAAACCCGTGTATTGCTGAAGGCAGTGGTGAAAAGGTTGCATCTGGCAATCTCTCTGACAATATTTCAGATCAAAATAGGAATGATGATCATGTTCTCATCACCTGTCAATCGAACACAAAGCATCTTTCCAAGATGCAGGCAATTATTTCGAAAGAAACAGCATTGTCACAAGCTGCCATTAAAGCTCTAATTAGAAAGAGGGATAAACTGGTACAAACACACATCTTATATTGTGAAGTTTTGGGCTTATCCTTGTCTCATGAATTTATTTTGAACCAGAAACTTTTAAAGATAGCGTGATGCAAAAAAATGAGAACGCGTAATATATAATAAATAGCTGAGAGCTTCCACTTCCACCATAATTGTAAAATAGAAATGGGCGACGTGATTTAAATCTCACGTCTCCTGCACCTTTGTTGATGAGATGGGTAGGAAATTAATTAAGCTTGCTCTGGTTTCTTTTTGTATGTCTCACCTTACAGTGGTATTCAGCTGGCCCTTTCTCTCTGACCAAGGGTTTGTCATAAATTGAGCCCTTTTGAATCATCTTCCATAATCGTTATTTCTGACAGTGTAATCCAATCATTGTCTGCAGTCTCATCAACAGCGCATAATTGAAGATGAGATAGCTCAGTGTGATAAAAATATGCAGACAATATTAAGGGGTATGTTCTATTTTCTTTTTTCTTTTTTTGTTCCTCCTGTTCTTGTTTTGTTTCTACAGTTTATTCTATTCGATATGCTTCATGTATGAAAGCCTCTAGAAATTTAGATGATTTCCTTCTTCAAGTACTAGCAGTCCTCGGTCCTCTTTTTTGCTTCATTTTTACTTGATGAACTCTTACCCCAGCTGTCTATGATTTATGGTCTTATAAGCCTGTGGTGCATGCTATAAAATGATTTTTCAGTATTGGCTTGAGCCTATGACTCTAAATGTCGTTATAATTGGTAGTCTAAGGATTTTGTAGATCCTGAGCTTCAAATGGAAGTAAAATGCATATTCTAGGATTTTGTATTTTGTATTTGTCATTGCTTTAATCATTTGATGTTTATACTGGAGCTAGATTTTCAATTTTATATGTAAACTGATGAATTGTGTTATATTAGAACAGGTAATATTAGAAACTCATTATTGAATTAAAATATGCATTGAAAGAAGTTTGAAATTGTTAGTGGTGCATCCTAGTATTTTAGATAAATGAAGAAAAAAAAATGTGGGGTGCTAAGTACTCATGTTACTGTTTTTTTTAAAAAAAAAATTAATTAGCACCAATGTACAAGTAGCTAACACCATTGGAGATGCTTTATAACGTTAAGGAGATGCTTTAAATTTGAACTTGAGTTTTTACTTTATTCCCTAGTTTTGCTTATTTGATTCCATAATTATTTTTAGATTTTATTTTGATATTTTTTCATCAGATCAAGTTAATCTATGTCATCATAACTAACCAAAGAGAATGTCTGAATTACATTCACATTTTGGTTGCTAAAGGTGATGAAGATGATTTTGTTGTAAAGCTGGATTCTGTGATTGAATGTTGTAATGATGTATGTCTAAGAAGCGCAGCTGAAGATAAACCTTATCAATACTCTGAAGAAAACTGCTCATCTCAACTTGTCACAAGGAAGAGATTGTCAGAAGAAATTCTCTGCATACGGAATCCATGTCAGGTGGGTTAACCATTGAAGATAATAATATTCAAAATTTTTATATTTGAGGGTATTGTGGTTGTATTCGTATATTTATGTCTTAATGCTGCCCAAGTTATATTTTTAAAAATAATAGGATCTCATCATTCCTTACAATAGATTTAAGTAAAAGAGATCAGAGGGCCTTGAAAGGAACTCTTCAAAAGAATTGCCATGCTAAAATTATACATCACAAATAACACAGAAGACATTATTTCGAACAACTGATTTGGAAGAATAGTGATGTTAAATAGCTTTATTATGGCTTAGTATACGAGTTGGAAATATGTGGACTTAAAACATCGAGATTATGATGTCACTTTTTAAAAATTTGAGATACACATAAGAACTTATCATTAAAAGTTGAAAGTAGGACTTATATAGACAACTGGAAAATTTTTTCTTCTTTCATTTTTTTTTTTTGGGGTGGGGAGGGTGTGAATTTTTGAGTTCCTATTTTTGTCTACTTTATACATATGACTAATCTATGCAAGGAAAATGCTGCCTCAATATTCTTTATGCTACTGGTCAGCACACCTACCAGTACACCTATTTTTCGAGATCCTAGTTTTGTCTACATGTTTATATTTATGTTTTAAACTTCTTTATGGTCCCTCTCATACGAAAGAGATTTCCAACTATTCTCTTTTTGACTTTTTCTTTCTGTAGATTTTCCTCTCAATATCTTTTAGAACTGGGAAAGGACACCTACTATGTCTTCACTTAAAAAAAAAAAAAAAAATCTTTATCATGGATGGAATGTGGTCCTCAAAGAAGAAAAATTGAGAACTTAGTCTATGAGCTTAGTTATAAATACAAATTTTATCTTTCTATGGGTGACGTTAGGCCTCCAATAGTAAGAGTGTACTGGAGGAGGGGTAAAAAGTGAAACATCCTTTGGAAGTTTAATAGGATGAATTGCCCTCGTATTTATAGGGGAGTGATGTAAGTGTGAGGGAGAAGATGAATCTTGTGAAGTTTGTATAGGGAGGGAAGCTGGCCCTCAAGTTTTGTAAGGTGCTTGGTTCACCTCTTTCTTTGTGAATAGTAATAATATTAGTGTTCATGCCTACTTTCCTTAAAATATACTTTCTTGTATAAACAATAATTGTCGAAGGATATTTGTGTAATTCCATAAAATAAGGGTTAGCGATTATTATTTAGATTTACTGTTACGTAGTATTTAGGGATTTATTTAAATTTAATAGAGTATTTAGATTATTTAGTGATTTAGATTTAGTTAGTCTCTCCCTCCCCTTGTAAAAGAGACATGTAATTCTCTGAAAATAAATAAGAAAAAACAGCAAGCTAATTTAGACTCTCAACATGGTATGAGAGCATCGTATTAGATCAGAACATCGAAGGGAATTTTCTTGATGTGTTAAATAAGGATATGAAGACGTTGAAGTTCCGGATTCCTTTGTACAAAAAGTTGTCCTTGGCTGGAAGCACCATCCATTTGATTGTCGGGAAGTGAAATGGCGAGGAAGAATGAGCTCGAGTTGCCAAATTTGTATGGCCATAAGTGTTTTTTATATTTTGCTGCATTTTTTGAAAATGTAGTGTTAACGAAGCAGTTACTTACACACTGGTGGATTGGAGAAGGCCGTCTGGACCCTTCAGGTAGTGGGGATCAAACACCTGAAGTTAAAGAAATTTACAATGCCCCCTATTTTGTATTCTGCTGCAATTATACTGGCCAAAGAAGAGATGTTTTTTTAATTACGATTCCGGAGGACAACTCATTTGGTAAATCTTTTGGCTGCTCATGATATTACAATCTATGGCAACCTATCTGCACTTGCAACTCTTGGACCAACAGAGTTAAAGAGTTTAGTTTTGGATATTGAGAGTTTTAGTTTTGGATATTGAGAGTTTTCAGTTTTATTCGGAGCCTTCATAGCTACCGGTCTTCCACTTGACCTAATTTATTCAGAGCTCTCATAGCTACCGATCCTTTCCCTCGACCAAATTTATCAGGTGTCATTGACAAATTTATTCAGAGCTTTAGCTAGTTGACAAATGTATTCGGAGCCTTCATAGCTACCGTTGAAGTCATGTCTTGATGTTGAAGTCGTGTCATCGAACTCGTGCAATCAATATTGCACATTAACCTAACCAGTCCAAGCTAAGAACGAATGATATTTATTACTCAGCTTGAGAAGGTGTCGAAGGAATGATATCTATTACTCAGCTTGACAAGGGGGTGTCAAAATTTATGTGTAATTCATAAAATAAGGGAAATAGTTGACCGTTGGATTTATTTAGATTTAGTTACCTACTATCTCTTTACATTTGTATTTGGATTATTTAGATTATCTAGTGATTTAGATTTAGTTAGTTGTTCTCCCTTATAAAGAGGACAGGTAAGTCTCTGAAAATAAATAAACAGCAGTCTAATTTAGACTTCAACAATAACGGTAGATTTCAACTTGCTCTCTTGCCTGTAATTCAAATGCTAACTTTTGAAATTGGTTGAACAGATCCTTGTCTTCTGGGTTCTTTAATTTAAACTGTAAAATCCATGTGGATTCAGTGGATTTTCATTTCGTTAAGAGAGAGTTTCCTTTCTAGGACAAGAGACACGTATTAAATTCAAGTTGCATTGTCCTGAGTAACAAATCTTCATATTTTTCCCAGAATGCAGCAAATGGATCACAAAAACTTGTAACTTGTAATACCATTGATTAGATACGCTGTTGAGGATGCTGCTAAGATGGTATTTGTTATGTATGGCAAACTTAGGCTAATAAAATGTGTAGTGTATTGATATCTCTAATGCGTTAACGATATCATGTACGTCAAACTTATTGGGACCGGTTATCAATTTGGACTTGCCTAATGCACAGGGAACATTGAACTTCAAAACTTTTGAAAAGCAAGTAGCATAAGCTTTCAGATACTGTTTCAAATAGTGTTGGGATGTAAATTTGATGTTAGTCTGCTATACATTTTTATTTCATTATTTCCACATTTGCTAACATTTGCAGCTGTAAATGTAATGATGCTCTGTTGGTTGGCTGGCAGGAACTGGACGATATATGTCATAAGAATAATTGGATATTGCCAGTCTATGGAGTTTCGTCATCAGATGGTAAGATCACATGTTTTTTCTCGCCTGCAAGTTATTCTTCTTCTTTTTGTTTTTTTGTTTTTAATAAATAAACTGCATTGCAAATCATTTGGATTTATATGGTGGATTCTCCCTGTTGCATCTCGAAATATTCCACAAATATGATGTCTCTTTCCTTCTTTCCCCTCTCCTTCCCCATAAGAAAATTTCAATGGTGTTTCATCCTATTCGCTTCCATTCATCCGTGGGTATGTTGCCGCTTCTCCCTTCGTGGAGACGGCCACCGCCGTCGAGTGTGATCAGGCTGCCACCTCAGTTCTCGGTTTAAAGCCTTGCCATCTCAGGTTGGACATGAGTTTGACCTTCATAGGTTGGAGAAGGGGTATCCTTTGATTATTGGGAGAATTAGTATTCTCCATGATAGAGGTTGCGAGGGTCATTTTGATAGGGGATGTTCCACTTCGCTGTGTGGTTGATGCCATTTTGAGTGCCTTGGGGCTTCTTGATATTGGACATATCTTTCCAGATTCAAATCCCAAATAG

mRNA sequence

TTTATACATCACAAGCACATTTTTTAAACTCATATATCAGACGACGTAATTTATTGACGGAATCACAAGCAGTTGACATGCAGTGTTCGGTCCACTCACACAGTGCGCAGTCACCTCCTCCGCCTTCCGGACACTTGCAGACTGATTCCTGTACTCTCGAACTTCGCATTAAAACTTTCATGGCTGATCGCCCATCCTCTCAACTACTCCCCCTCCTCTCCGAACCCTAATCTTCGCGCTTCAGCTTCCATTCGAGTTCTTTTCGTGGGCTCCCCAATTTTCATTCCAAGAAGGAAGTTTGAATCCTTTGTTATTTTCCTCCGTTGATGGAAAGGTCTTGCTCCTGATTGTGGGAAGAGGATATCATATATAAGAAACTGAAAAAGCAGTAGTGGTAAAAAAAACACCCTTACCATGAGTGCAACAGGTGTATGCCCAACCGAGGATGCCATACTTGCATTATTGGATTATTTAGTTGAACCTATGCTTCCTTCAAAGTCATCTTCGATAGAAAATCCACCACTAGCTCTACTGCAATCAGTTGCAAAACAGATGCATGCCGTTGTTTTGTTATACAACTACTACCACAGGAAACAACACCCACATCTTGAATTTTTGAGTTTTGAGGCATTTTGCAAATTAGCTGTGGTCGTTAAACCAGCTTTGTTGTCTCACATGAAACTCATGCAAAGCTCTGATGATATTGAATTGGAAAATCCCGAGAAGCAGCTTTCTCCAGCAGAAAAAGCAATTATGGATGCATGTGGTTTAGCCACTTGTCTATATACATCAAAAGATGAAAATATAGAGGGCTGGCCCCTTTCCAAGGTTGCCGTTTTTTTGATCGACTCCAAGAAGGAGCATTGCCATTTGCTGTTTAGTTCCATCACTCAAGGAGTTTGGTCTGTCATTGAACAAAATTTAGATACCTCTGAATGCCAACCAAAAAGCGTGGAAGAGGAAAAACATGTAAATAAAAAGAAAAGAGTGATTAAGAAACCTTCAAAAGAGGGGCTAGTTGTTGTTGGAACTAAGACTCAGCAGCTTGCATATTCAGCAGTCAAGGAAGCAACTGGCATGATTTGGGACAATATCAGGAGGCTCTTGAAGCAAATGGCATGCATTAATCAACACGATCTCAAAATTTTAGAAAGTCACGTTGCCTACTCTCTAAGTAAAGAAAAATCAGCAGTCTACTTTTATATGATGCAGTGCACTCGATCAGCAACTGAAGATGTAATTCAAGTTCCCATAAAAGATGCCGTTGACAGTTTGCAGGATTCGTTGTTTAAAAAAAATGGTAGGAGATGGAGCGTTACTTCAAAAGTTGAGTACTACCACATTCTTCCTTATGTGAAGATGGTGCTAACCTGGTTTCACAGGGAAACTTTAACAGATAATTTGGGAGTCGTAGGTGGAGAAAAGATTGATGAAAACCTGAACAAGCCTAAGAGAAAAGATGTAACCAGGAAGCTTGGAACTCAAAACAATCAAGACGATGCTACTACAAACAATATGAATAAAGGCACTAGCATTTATGATGCAGGATTGGAGAGATTGCCCGATAAAACGAACTGTATGAGTAGTTTGCATGATGCGATCTGCAGGCCCCAGAGTCCTAGTGTGGATGACTTGGTTCCCTCCAATCCAATGGAGAAGAGAAAGGGTGTACCGACTCCCACCCAAGTTATCATGTCATATGTAAAGAAAATACATGGTAGTCCAGTTTACAATCACTATGAAGCAACTATCCCATGTTCGGTGACTGGTAGGCAAGTTTACAATCACTATGAAGCAACTATCCCATGTACGGTGAATGAATCGAAGGCTTCAGAGAGTGGTATCAAAGTTGAGGATGGAATACTAGCAACAAACCCGTGTATTGCTGAAGGCAGTGGTGAAAAGGTTGCATCTGGCAATCTCTCTGACAATATTTCAGATCAAAATAGGAATGATGATCATGTTCTCATCACCTGTCAATCGAACACAAAGCATCTTTCCAAGATGCAGGCAATTATTTCGAAAGAAACAGCATTGTCACAAGCTGCCATTAAAGCTCTAATTAGAAAGAGGGATAAACTGTCTCATCAACAGCGCATAATTGAAGATGAGATAGCTCAGTGTGATAAAAATATGCAGACAATATTAAGGGGTGATGAAGATGATTTTGTTGTAAAGCTGGATTCTGTGATTGAATGTTGTAATGATGTATGTCTAAGAAGCGCAGCTGAAGATAAACCTTATCAATACTCTGAAGAAAACTGCTCATCTCAACTTGTCACAAGGAAGAGATTGTCAGAAGAAATTCTCTGCATACGGAATCCATGTCAGGAACTGGACGATATATGTCATAAGAATAATTGGATATTGCCAGTCTATGGAGTTTCGTCATCAGATGGGGATGTTCCACTTCGCTGTGTGGTTGATGCCATTTTGAGTGCCTTGGGGCTTCTTGATATTGGACATATCTTTCCAGATTCAAATCCCAAATAG

Coding sequence (CDS)

ATGAGTGCAACAGGTGTATGCCCAACCGAGGATGCCATACTTGCATTATTGGATTATTTAGTTGAACCTATGCTTCCTTCAAAGTCATCTTCGATAGAAAATCCACCACTAGCTCTACTGCAATCAGTTGCAAAACAGATGCATGCCGTTGTTTTGTTATACAACTACTACCACAGGAAACAACACCCACATCTTGAATTTTTGAGTTTTGAGGCATTTTGCAAATTAGCTGTGGTCGTTAAACCAGCTTTGTTGTCTCACATGAAACTCATGCAAAGCTCTGATGATATTGAATTGGAAAATCCCGAGAAGCAGCTTTCTCCAGCAGAAAAAGCAATTATGGATGCATGTGGTTTAGCCACTTGTCTATATACATCAAAAGATGAAAATATAGAGGGCTGGCCCCTTTCCAAGGTTGCCGTTTTTTTGATCGACTCCAAGAAGGAGCATTGCCATTTGCTGTTTAGTTCCATCACTCAAGGAGTTTGGTCTGTCATTGAACAAAATTTAGATACCTCTGAATGCCAACCAAAAAGCGTGGAAGAGGAAAAACATGTAAATAAAAAGAAAAGAGTGATTAAGAAACCTTCAAAAGAGGGGCTAGTTGTTGTTGGAACTAAGACTCAGCAGCTTGCATATTCAGCAGTCAAGGAAGCAACTGGCATGATTTGGGACAATATCAGGAGGCTCTTGAAGCAAATGGCATGCATTAATCAACACGATCTCAAAATTTTAGAAAGTCACGTTGCCTACTCTCTAAGTAAAGAAAAATCAGCAGTCTACTTTTATATGATGCAGTGCACTCGATCAGCAACTGAAGATGTAATTCAAGTTCCCATAAAAGATGCCGTTGACAGTTTGCAGGATTCGTTGTTTAAAAAAAATGGTAGGAGATGGAGCGTTACTTCAAAAGTTGAGTACTACCACATTCTTCCTTATGTGAAGATGGTGCTAACCTGGTTTCACAGGGAAACTTTAACAGATAATTTGGGAGTCGTAGGTGGAGAAAAGATTGATGAAAACCTGAACAAGCCTAAGAGAAAAGATGTAACCAGGAAGCTTGGAACTCAAAACAATCAAGACGATGCTACTACAAACAATATGAATAAAGGCACTAGCATTTATGATGCAGGATTGGAGAGATTGCCCGATAAAACGAACTGTATGAGTAGTTTGCATGATGCGATCTGCAGGCCCCAGAGTCCTAGTGTGGATGACTTGGTTCCCTCCAATCCAATGGAGAAGAGAAAGGGTGTACCGACTCCCACCCAAGTTATCATGTCATATGTAAAGAAAATACATGGTAGTCCAGTTTACAATCACTATGAAGCAACTATCCCATGTTCGGTGACTGGTAGGCAAGTTTACAATCACTATGAAGCAACTATCCCATGTACGGTGAATGAATCGAAGGCTTCAGAGAGTGGTATCAAAGTTGAGGATGGAATACTAGCAACAAACCCGTGTATTGCTGAAGGCAGTGGTGAAAAGGTTGCATCTGGCAATCTCTCTGACAATATTTCAGATCAAAATAGGAATGATGATCATGTTCTCATCACCTGTCAATCGAACACAAAGCATCTTTCCAAGATGCAGGCAATTATTTCGAAAGAAACAGCATTGTCACAAGCTGCCATTAAAGCTCTAATTAGAAAGAGGGATAAACTGTCTCATCAACAGCGCATAATTGAAGATGAGATAGCTCAGTGTGATAAAAATATGCAGACAATATTAAGGGGTGATGAAGATGATTTTGTTGTAAAGCTGGATTCTGTGATTGAATGTTGTAATGATGTATGTCTAAGAAGCGCAGCTGAAGATAAACCTTATCAATACTCTGAAGAAAACTGCTCATCTCAACTTGTCACAAGGAAGAGATTGTCAGAAGAAATTCTCTGCATACGGAATCCATGTCAGGAACTGGACGATATATGTCATAAGAATAATTGGATATTGCCAGTCTATGGAGTTTCGTCATCAGATGGGGATGTTCCACTTCGCTGTGTGGTTGATGCCATTTTGAGTGCCTTGGGGCTTCTTGATATTGGACATATCTTTCCAGATTCAAATCCCAAATAG

Protein sequence

MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRKQHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACGLATCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSVEEEKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATGMIWDNIRRLLKQMACINQHDLKILESHVAYSLSKEKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKKNGRRWSVTSKVEYYHILPYVKMVLTWFHRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIYDAGLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVPTPTQVIMSYVKKIHGSPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVEDGILATNPCIAEGSGEKVASGNLSDNISDQNRNDDHVLITCQSNTKHLSKMQAIISKETALSQAAIKALIRKRDKLSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVCLRSAAEDKPYQYSEENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGDVPLRCVVDAILSALGLLDIGHIFPDSNPK
Homology
BLAST of Cp4.1LG12g05070 vs. ExPASy Swiss-Prot
Match: Q9CAK8 (2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=ISPF PE=1 SV=1)

HSP 1 Score: 57.4 bits (137), Expect = 7.5e-07
Identity = 27/33 (81.82%), Postives = 28/33 (84.85%), Query Frame = 0

Query: 660 SDGDVPLRCVVDAILSALGLLDIGHIFPDSNPK 693
           SDGDV L CVVDAIL ALGL DIG IFPDS+PK
Sbjct: 109 SDGDVLLHCVVDAILGALGLPDIGQIFPDSDPK 141

BLAST of Cp4.1LG12g05070 vs. ExPASy Swiss-Prot
Match: Q9M4W3 (2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase, chloroplastic OS=Catharanthus roseus OX=4058 GN=ISPF PE=2 SV=1)

HSP 1 Score: 56.2 bits (134), Expect = 1.7e-06
Identity = 26/33 (78.79%), Postives = 28/33 (84.85%), Query Frame = 0

Query: 660 SDGDVPLRCVVDAILSALGLLDIGHIFPDSNPK 693
           SDGDV L CVVDAIL ALGL DIG IFPD++PK
Sbjct: 114 SDGDVLLHCVVDAILGALGLPDIGQIFPDTDPK 146

BLAST of Cp4.1LG12g05070 vs. ExPASy Swiss-Prot
Match: Q6EPN6 (2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase, chloroplastic OS=Oryza sativa subsp. japonica OX=39947 GN=ISPF PE=1 SV=1)

HSP 1 Score: 56.2 bits (134), Expect = 1.7e-06
Identity = 26/33 (78.79%), Postives = 28/33 (84.85%), Query Frame = 0

Query: 660 SDGDVPLRCVVDAILSALGLLDIGHIFPDSNPK 693
           SDGDV L CVVDAIL ALGL DIG IFPDS+P+
Sbjct: 100 SDGDVLLHCVVDAILGALGLPDIGQIFPDSDPR 132

BLAST of Cp4.1LG12g05070 vs. ExPASy Swiss-Prot
Match: Q72HP8 (2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase OS=Thermus thermophilus (strain ATCC BAA-163 / DSM 7039 / HB27) OX=262724 GN=ispF PE=3 SV=1)

HSP 1 Score: 48.9 bits (115), Expect = 2.7e-04
Identity = 21/40 (52.50%), Postives = 29/40 (72.50%), Query Frame = 0

Query: 653 PVYGVSSSDGDVPLRCVVDAILSALGLLDIGHIFPDSNPK 693
           PV  ++ SDGD  L  + DA+LSA GL DIG +FPD++P+
Sbjct: 28  PVGALAHSDGDAALHALTDALLSAYGLGDIGLLFPDTDPR 67

BLAST of Cp4.1LG12g05070 vs. ExPASy Swiss-Prot
Match: Q8RQP5 (2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase OS=Thermus thermophilus (strain ATCC 27634 / DSM 579 / HB8) OX=300852 GN=ispF PE=1 SV=1)

HSP 1 Score: 48.9 bits (115), Expect = 2.7e-04
Identity = 21/40 (52.50%), Postives = 29/40 (72.50%), Query Frame = 0

Query: 653 PVYGVSSSDGDVPLRCVVDAILSALGLLDIGHIFPDSNPK 693
           PV  ++ SDGD  L  + DA+LSA GL DIG +FPD++P+
Sbjct: 28  PVGALAHSDGDAALHALTDALLSAYGLGDIGLLFPDTDPR 67

BLAST of Cp4.1LG12g05070 vs. NCBI nr
Match: XP_023548856.1 (uncharacterized protein LOC111807382 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023548857.1 uncharacterized protein LOC111807382 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023548858.1 uncharacterized protein LOC111807382 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1273 bits (3293), Expect = 0.0
Identity = 647/662 (97.73%), Postives = 647/662 (97.73%), Query Frame = 0

Query: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60
           MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK
Sbjct: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60

Query: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACGLA 120
           QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACGLA
Sbjct: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACGLA 120

Query: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSV 180
           TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSV
Sbjct: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSV 180

Query: 181 EEEKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATGMIWDNIRRLLKQMACINQH 240
           EEEKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATG               INQH
Sbjct: 181 EEEKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATG---------------INQH 240

Query: 241 DLKILESHVAYSLSKEKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKKNGRRWS 300
           DLKILESHVAYSLSKEKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKKNGRRWS
Sbjct: 241 DLKILESHVAYSLSKEKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKKNGRRWS 300

Query: 301 VTSKVEYYHILPYVKMVLTWFHRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQ 360
           VTSKVEYYHILPYVKMVLTWFHRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQ
Sbjct: 301 VTSKVEYYHILPYVKMVLTWFHRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQ 360

Query: 361 DDATTNNMNKGTSIYDAGLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVP 420
           DDATTNNMNKGTSIYDAGLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVP
Sbjct: 361 DDATTNNMNKGTSIYDAGLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVP 420

Query: 421 TPTQVIMSYVKKIHGSPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVE 480
           TPTQVIMSYVKKIHGSPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVE
Sbjct: 421 TPTQVIMSYVKKIHGSPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVE 480

Query: 481 DGILATNPCIAEGSGEKVASGNLSDNISDQNRNDDHVLITCQSNTKHLSKMQAIISKETA 540
           DGILATNPCIAEGSGEKVASGNLSDNISDQNRNDDHVLITCQSNTKHLSKMQAIISKETA
Sbjct: 481 DGILATNPCIAEGSGEKVASGNLSDNISDQNRNDDHVLITCQSNTKHLSKMQAIISKETA 540

Query: 541 LSQAAIKALIRKRDKLSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVC 600
           LSQAAIKALIRKRDKLSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVC
Sbjct: 541 LSQAAIKALIRKRDKLSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVC 600

Query: 601 LRSAAEDKPYQYSEENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSS 660
           LRSAAEDKPYQYSEENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSS
Sbjct: 601 LRSAAEDKPYQYSEENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSS 647

Query: 661 DG 662
           DG
Sbjct: 661 DG 647

BLAST of Cp4.1LG12g05070 vs. NCBI nr
Match: XP_022929885.1 (uncharacterized protein LOC111436360 isoform X1 [Cucurbita moschata] >XP_022929886.1 uncharacterized protein LOC111436360 isoform X1 [Cucurbita moschata] >XP_022929887.1 uncharacterized protein LOC111436360 isoform X1 [Cucurbita moschata] >XP_022929888.1 uncharacterized protein LOC111436360 isoform X1 [Cucurbita moschata])

HSP 1 Score: 1263 bits (3269), Expect = 0.0
Identity = 643/673 (95.54%), Postives = 648/673 (96.29%), Query Frame = 0

Query: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60
           MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK
Sbjct: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60

Query: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACGLA 120
           QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACGLA
Sbjct: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACGLA 120

Query: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSV 180
           TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSV
Sbjct: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSV 180

Query: 181 EEEKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATGMIWDNIRRLLKQMACINQH 240
           EEEKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATG               INQH
Sbjct: 181 EEEKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATG---------------INQH 240

Query: 241 DLKILESHVAYSLSKEKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKKNGRRWS 300
           DLKILESHVAYSLSKEKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKKNGRRWS
Sbjct: 241 DLKILESHVAYSLSKEKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKKNGRRWS 300

Query: 301 VTSKVEYYHILPYVKMVLTWFHRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQ 360
           VTSKVEYYHILPYVKMVLTWFHRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQ
Sbjct: 301 VTSKVEYYHILPYVKMVLTWFHRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQ 360

Query: 361 DDATTNNMNKGTSIYDAGLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVP 420
           DDATTNNMNKGTSIYDAGLERLP+KTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVP
Sbjct: 361 DDATTNNMNKGTSIYDAGLERLPNKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVP 420

Query: 421 TPTQVIMSYVKKIHGSPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVE 480
           TPTQVIMSYVKKIHGSPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVE
Sbjct: 421 TPTQVIMSYVKKIHGSPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVE 480

Query: 481 DGILATNPCIAEGSGEKVASGNLSDNISDQNRNDDHVLITCQSNTKHLSKMQAIISKETA 540
           DGILATNPCIAE SGEKVASGNLSDNISDQNRNDDH LITCQSNTK+LSKMQAIISKETA
Sbjct: 481 DGILATNPCIAECSGEKVASGNLSDNISDQNRNDDHALITCQSNTKNLSKMQAIISKETA 540

Query: 541 LSQAAIKALIRKRDKLSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVC 600
           LSQAAIKALIRKRDKLSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVC
Sbjct: 541 LSQAAIKALIRKRDKLSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVC 600

Query: 601 LRSAAEDKPYQYSEENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSS 660
           LRSAAEDKPYQYSEENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSS
Sbjct: 601 LRSAAEDKPYQYSEENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSS 658

Query: 661 DGDVPLRCVVDAI 673
           DG      ++  +
Sbjct: 661 DGGFQANVILKGL 658

BLAST of Cp4.1LG12g05070 vs. NCBI nr
Match: XP_022929889.1 (uncharacterized protein LOC111436360 isoform X2 [Cucurbita moschata])

HSP 1 Score: 1260 bits (3261), Expect = 0.0
Identity = 642/673 (95.39%), Postives = 647/673 (96.14%), Query Frame = 0

Query: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60
           MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK
Sbjct: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60

Query: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACGLA 120
           QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACGLA
Sbjct: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACGLA 120

Query: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSV 180
           TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSV
Sbjct: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSV 180

Query: 181 EEEKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATGMIWDNIRRLLKQMACINQH 240
           EEEKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATG               INQH
Sbjct: 181 EEEKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATG---------------INQH 240

Query: 241 DLKILESHVAYSLSKEKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKKNGRRWS 300
           DLKILESHVAYSLSKEKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKKNGRRWS
Sbjct: 241 DLKILESHVAYSLSKEKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKKNGRRWS 300

Query: 301 VTSKVEYYHILPYVKMVLTWFHRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQ 360
           VTSKVEYYHILPYVKMVLTWF RETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQ
Sbjct: 301 VTSKVEYYHILPYVKMVLTWFRRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQ 360

Query: 361 DDATTNNMNKGTSIYDAGLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVP 420
           DDATTNNMNKGTSIYDAGLERLP+KTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVP
Sbjct: 361 DDATTNNMNKGTSIYDAGLERLPNKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVP 420

Query: 421 TPTQVIMSYVKKIHGSPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVE 480
           TPTQVIMSYVKKIHGSPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVE
Sbjct: 421 TPTQVIMSYVKKIHGSPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVE 480

Query: 481 DGILATNPCIAEGSGEKVASGNLSDNISDQNRNDDHVLITCQSNTKHLSKMQAIISKETA 540
           DGILATNPCIAE SGEKVASGNLSDNISDQNRNDDH LITCQSNTK+LSKMQAIISKETA
Sbjct: 481 DGILATNPCIAECSGEKVASGNLSDNISDQNRNDDHALITCQSNTKNLSKMQAIISKETA 540

Query: 541 LSQAAIKALIRKRDKLSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVC 600
           LSQAAIKALIRKRDKLSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVC
Sbjct: 541 LSQAAIKALIRKRDKLSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVC 600

Query: 601 LRSAAEDKPYQYSEENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSS 660
           LRSAAEDKPYQYSEENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSS
Sbjct: 601 LRSAAEDKPYQYSEENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSS 658

Query: 661 DGDVPLRCVVDAI 673
           DG      ++  +
Sbjct: 661 DGGFQANVILKGL 658

BLAST of Cp4.1LG12g05070 vs. NCBI nr
Match: KAG6575356.1 (hypothetical protein SDJN03_25995, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1252 bits (3240), Expect = 0.0
Identity = 638/673 (94.80%), Postives = 643/673 (95.54%), Query Frame = 0

Query: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60
           MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK
Sbjct: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60

Query: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACGLA 120
           QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDAC LA
Sbjct: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACDLA 120

Query: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSV 180
           TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSV
Sbjct: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSV 180

Query: 181 EEEKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATGMIWDNIRRLLKQMACINQH 240
           EEEKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATG               INQH
Sbjct: 181 EEEKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATG---------------INQH 240

Query: 241 DLKILESHVAYSLSKEKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKKNGRRWS 300
           DL+ILESHVAYSLSKEKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKK+GRRWS
Sbjct: 241 DLRILESHVAYSLSKEKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKKSGRRWS 300

Query: 301 VTSKVEYYHILPYVKMVLTWFHRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQ 360
           VTSKVEYYHILPYVKMVLTWF RETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQ
Sbjct: 301 VTSKVEYYHILPYVKMVLTWFRRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQ 360

Query: 361 DDATTNNMNKGTSIYDAGLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVP 420
           DDATTNNMNKGTSIYDAGLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNP EKRKGV 
Sbjct: 361 DDATTNNMNKGTSIYDAGLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPTEKRKGVS 420

Query: 421 TPTQVIMSYVKKIHGSPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVE 480
           TPTQVIMSYVKKIHGSPVYNHYEATIPC VTGRQVYNHY ATIPCTVNESKASESGIKVE
Sbjct: 421 TPTQVIMSYVKKIHGSPVYNHYEATIPCLVTGRQVYNHYGATIPCTVNESKASESGIKVE 480

Query: 481 DGILATNPCIAEGSGEKVASGNLSDNISDQNRNDDHVLITCQSNTKHLSKMQAIISKETA 540
           DGILATNPCIAEGSGEKVASGNLSDNISDQNRNDDH LITCQSNTKHLSKMQAIISKETA
Sbjct: 481 DGILATNPCIAEGSGEKVASGNLSDNISDQNRNDDHALITCQSNTKHLSKMQAIISKETA 540

Query: 541 LSQAAIKALIRKRDKLSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVC 600
           LSQAAIKALIRKRDKLSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVC
Sbjct: 541 LSQAAIKALIRKRDKLSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVC 600

Query: 601 LRSAAEDKPYQYSEENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSS 660
           LRSAAEDKPYQYSEENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSS
Sbjct: 601 LRSAAEDKPYQYSEENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSS 658

Query: 661 DGDVPLRCVVDAI 673
           DG      ++  +
Sbjct: 661 DGGFQANVILKGL 658

BLAST of Cp4.1LG12g05070 vs. NCBI nr
Match: XP_022992174.1 (uncharacterized protein LOC111488583 isoform X1 [Cucurbita maxima] >XP_022992175.1 uncharacterized protein LOC111488583 isoform X1 [Cucurbita maxima] >XP_022992176.1 uncharacterized protein LOC111488583 isoform X1 [Cucurbita maxima])

HSP 1 Score: 1248 bits (3228), Expect = 0.0
Identity = 637/662 (96.22%), Postives = 638/662 (96.37%), Query Frame = 0

Query: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60
           MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK
Sbjct: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60

Query: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACGLA 120
           QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSP+EKAIMDAC LA
Sbjct: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPSEKAIMDACDLA 120

Query: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSV 180
           TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDT ECQPKSV
Sbjct: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTYECQPKSV 180

Query: 181 EEEKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATGMIWDNIRRLLKQMACINQH 240
           EEEKHVNKKKRVIKKPSKEGLVVV TKTQQLAYSAVKEATG               INQH
Sbjct: 181 EEEKHVNKKKRVIKKPSKEGLVVVETKTQQLAYSAVKEATG---------------INQH 240

Query: 241 DLKILESHVAYSLSKEKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKKNGRRWS 300
           DLKILESHVAYSLSKEKSAVYFYMMQCTRSATEDVIQVPIKDA DSLQDSLFKKNGRRWS
Sbjct: 241 DLKILESHVAYSLSKEKSAVYFYMMQCTRSATEDVIQVPIKDAFDSLQDSLFKKNGRRWS 300

Query: 301 VTSKVEYYHILPYVKMVLTWFHRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQ 360
           VTSKVEYYHILPYVKMVLTWF RETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQ
Sbjct: 301 VTSKVEYYHILPYVKMVLTWFRRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQ 360

Query: 361 DDATTNNMNKGTSIYDAGLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVP 420
           DDATTNNMNKGTSIYDA LERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGV 
Sbjct: 361 DDATTNNMNKGTSIYDAVLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVL 420

Query: 421 TPTQVIMSYVKKIHGSPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVE 480
           TPTQVIMSYVKKIHGSPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVE
Sbjct: 421 TPTQVIMSYVKKIHGSPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVE 480

Query: 481 DGILATNPCIAEGSGEKVASGNLSDNISDQNRNDDHVLITCQSNTKHLSKMQAIISKETA 540
           DGILATNPCIAEGSGEKVASGNLSDNISDQNRNDDH LITCQSN KHLSKMQAIISKETA
Sbjct: 481 DGILATNPCIAEGSGEKVASGNLSDNISDQNRNDDHALITCQSNAKHLSKMQAIISKETA 540

Query: 541 LSQAAIKALIRKRDKLSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVC 600
           LSQAAIKALIRKRDKLSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVC
Sbjct: 541 LSQAAIKALIRKRDKLSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVC 600

Query: 601 LRSAAEDKPYQYSEENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSS 660
           LRSAAEDKPYQYSEENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSS
Sbjct: 601 LRSAAEDKPYQYSEENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSS 647

Query: 661 DG 662
           DG
Sbjct: 661 DG 647

BLAST of Cp4.1LG12g05070 vs. ExPASy TrEMBL
Match: A0A6J1EPE2 (uncharacterized protein LOC111436360 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111436360 PE=4 SV=1)

HSP 1 Score: 1263 bits (3269), Expect = 0.0
Identity = 643/673 (95.54%), Postives = 648/673 (96.29%), Query Frame = 0

Query: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60
           MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK
Sbjct: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60

Query: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACGLA 120
           QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACGLA
Sbjct: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACGLA 120

Query: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSV 180
           TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSV
Sbjct: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSV 180

Query: 181 EEEKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATGMIWDNIRRLLKQMACINQH 240
           EEEKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATG               INQH
Sbjct: 181 EEEKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATG---------------INQH 240

Query: 241 DLKILESHVAYSLSKEKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKKNGRRWS 300
           DLKILESHVAYSLSKEKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKKNGRRWS
Sbjct: 241 DLKILESHVAYSLSKEKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKKNGRRWS 300

Query: 301 VTSKVEYYHILPYVKMVLTWFHRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQ 360
           VTSKVEYYHILPYVKMVLTWFHRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQ
Sbjct: 301 VTSKVEYYHILPYVKMVLTWFHRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQ 360

Query: 361 DDATTNNMNKGTSIYDAGLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVP 420
           DDATTNNMNKGTSIYDAGLERLP+KTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVP
Sbjct: 361 DDATTNNMNKGTSIYDAGLERLPNKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVP 420

Query: 421 TPTQVIMSYVKKIHGSPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVE 480
           TPTQVIMSYVKKIHGSPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVE
Sbjct: 421 TPTQVIMSYVKKIHGSPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVE 480

Query: 481 DGILATNPCIAEGSGEKVASGNLSDNISDQNRNDDHVLITCQSNTKHLSKMQAIISKETA 540
           DGILATNPCIAE SGEKVASGNLSDNISDQNRNDDH LITCQSNTK+LSKMQAIISKETA
Sbjct: 481 DGILATNPCIAECSGEKVASGNLSDNISDQNRNDDHALITCQSNTKNLSKMQAIISKETA 540

Query: 541 LSQAAIKALIRKRDKLSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVC 600
           LSQAAIKALIRKRDKLSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVC
Sbjct: 541 LSQAAIKALIRKRDKLSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVC 600

Query: 601 LRSAAEDKPYQYSEENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSS 660
           LRSAAEDKPYQYSEENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSS
Sbjct: 601 LRSAAEDKPYQYSEENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSS 658

Query: 661 DGDVPLRCVVDAI 673
           DG      ++  +
Sbjct: 661 DGGFQANVILKGL 658

BLAST of Cp4.1LG12g05070 vs. ExPASy TrEMBL
Match: A0A6J1EQ29 (uncharacterized protein LOC111436360 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111436360 PE=4 SV=1)

HSP 1 Score: 1260 bits (3261), Expect = 0.0
Identity = 642/673 (95.39%), Postives = 647/673 (96.14%), Query Frame = 0

Query: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60
           MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK
Sbjct: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60

Query: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACGLA 120
           QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACGLA
Sbjct: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACGLA 120

Query: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSV 180
           TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSV
Sbjct: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSV 180

Query: 181 EEEKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATGMIWDNIRRLLKQMACINQH 240
           EEEKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATG               INQH
Sbjct: 181 EEEKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATG---------------INQH 240

Query: 241 DLKILESHVAYSLSKEKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKKNGRRWS 300
           DLKILESHVAYSLSKEKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKKNGRRWS
Sbjct: 241 DLKILESHVAYSLSKEKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKKNGRRWS 300

Query: 301 VTSKVEYYHILPYVKMVLTWFHRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQ 360
           VTSKVEYYHILPYVKMVLTWF RETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQ
Sbjct: 301 VTSKVEYYHILPYVKMVLTWFRRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQ 360

Query: 361 DDATTNNMNKGTSIYDAGLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVP 420
           DDATTNNMNKGTSIYDAGLERLP+KTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVP
Sbjct: 361 DDATTNNMNKGTSIYDAGLERLPNKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVP 420

Query: 421 TPTQVIMSYVKKIHGSPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVE 480
           TPTQVIMSYVKKIHGSPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVE
Sbjct: 421 TPTQVIMSYVKKIHGSPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVE 480

Query: 481 DGILATNPCIAEGSGEKVASGNLSDNISDQNRNDDHVLITCQSNTKHLSKMQAIISKETA 540
           DGILATNPCIAE SGEKVASGNLSDNISDQNRNDDH LITCQSNTK+LSKMQAIISKETA
Sbjct: 481 DGILATNPCIAECSGEKVASGNLSDNISDQNRNDDHALITCQSNTKNLSKMQAIISKETA 540

Query: 541 LSQAAIKALIRKRDKLSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVC 600
           LSQAAIKALIRKRDKLSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVC
Sbjct: 541 LSQAAIKALIRKRDKLSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVC 600

Query: 601 LRSAAEDKPYQYSEENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSS 660
           LRSAAEDKPYQYSEENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSS
Sbjct: 601 LRSAAEDKPYQYSEENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSS 658

Query: 661 DGDVPLRCVVDAI 673
           DG      ++  +
Sbjct: 661 DGGFQANVILKGL 658

BLAST of Cp4.1LG12g05070 vs. ExPASy TrEMBL
Match: A0A6J1JUZ0 (uncharacterized protein LOC111488583 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111488583 PE=4 SV=1)

HSP 1 Score: 1248 bits (3228), Expect = 0.0
Identity = 637/662 (96.22%), Postives = 638/662 (96.37%), Query Frame = 0

Query: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60
           MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK
Sbjct: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60

Query: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACGLA 120
           QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSP+EKAIMDAC LA
Sbjct: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPSEKAIMDACDLA 120

Query: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSV 180
           TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDT ECQPKSV
Sbjct: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTYECQPKSV 180

Query: 181 EEEKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATGMIWDNIRRLLKQMACINQH 240
           EEEKHVNKKKRVIKKPSKEGLVVV TKTQQLAYSAVKEATG               INQH
Sbjct: 181 EEEKHVNKKKRVIKKPSKEGLVVVETKTQQLAYSAVKEATG---------------INQH 240

Query: 241 DLKILESHVAYSLSKEKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKKNGRRWS 300
           DLKILESHVAYSLSKEKSAVYFYMMQCTRSATEDVIQVPIKDA DSLQDSLFKKNGRRWS
Sbjct: 241 DLKILESHVAYSLSKEKSAVYFYMMQCTRSATEDVIQVPIKDAFDSLQDSLFKKNGRRWS 300

Query: 301 VTSKVEYYHILPYVKMVLTWFHRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQ 360
           VTSKVEYYHILPYVKMVLTWF RETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQ
Sbjct: 301 VTSKVEYYHILPYVKMVLTWFRRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQ 360

Query: 361 DDATTNNMNKGTSIYDAGLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVP 420
           DDATTNNMNKGTSIYDA LERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGV 
Sbjct: 361 DDATTNNMNKGTSIYDAVLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVL 420

Query: 421 TPTQVIMSYVKKIHGSPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVE 480
           TPTQVIMSYVKKIHGSPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVE
Sbjct: 421 TPTQVIMSYVKKIHGSPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVE 480

Query: 481 DGILATNPCIAEGSGEKVASGNLSDNISDQNRNDDHVLITCQSNTKHLSKMQAIISKETA 540
           DGILATNPCIAEGSGEKVASGNLSDNISDQNRNDDH LITCQSN KHLSKMQAIISKETA
Sbjct: 481 DGILATNPCIAEGSGEKVASGNLSDNISDQNRNDDHALITCQSNAKHLSKMQAIISKETA 540

Query: 541 LSQAAIKALIRKRDKLSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVC 600
           LSQAAIKALIRKRDKLSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVC
Sbjct: 541 LSQAAIKALIRKRDKLSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVC 600

Query: 601 LRSAAEDKPYQYSEENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSS 660
           LRSAAEDKPYQYSEENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSS
Sbjct: 601 LRSAAEDKPYQYSEENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSS 647

Query: 661 DG 662
           DG
Sbjct: 661 DG 647

BLAST of Cp4.1LG12g05070 vs. ExPASy TrEMBL
Match: A0A6J1ENX1 (uncharacterized protein LOC111436360 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111436360 PE=4 SV=1)

HSP 1 Score: 1199 bits (3102), Expect = 0.0
Identity = 618/673 (91.83%), Postives = 623/673 (92.57%), Query Frame = 0

Query: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60
           MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK
Sbjct: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60

Query: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACGLA 120
           QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACGLA
Sbjct: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACGLA 120

Query: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSV 180
           TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSV
Sbjct: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSV 180

Query: 181 EEEKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATGMIWDNIRRLLKQMACINQH 240
           EEEKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATG               INQH
Sbjct: 181 EEEKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATG---------------INQH 240

Query: 241 DLKILESHVAYSLSKEKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKKNGRRWS 300
           DLKILESHVAYSLSKEKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKKNGRRWS
Sbjct: 241 DLKILESHVAYSLSKEKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKKNGRRWS 300

Query: 301 VTSKVEYYHILPYVKMVLTWFHRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQ 360
           VTSKVEYYHILPYVKMVLTWFHRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQ
Sbjct: 301 VTSKVEYYHILPYVKMVLTWFHRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQ 360

Query: 361 DDATTNNMNKGTSIYDAGLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVP 420
           DDATTNNMNKGTSIYDAGLERLP+KTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVP
Sbjct: 361 DDATTNNMNKGTSIYDAGLERLPNKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVP 420

Query: 421 TPTQVIMSYVKKIHGSPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVE 480
           TPTQVIMSYVKKIHGSPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVE
Sbjct: 421 TPTQVIMSYVKKIHGSPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVE 480

Query: 481 DGILATNPCIAEGSGEKVASGNLSDNISDQNRNDDHVLITCQSNTKHLSKMQAIISKETA 540
           DGILATNPCIAE SGEKVASGNLSDNISDQNRNDDH LITCQSNTK+LSKMQAIISKETA
Sbjct: 481 DGILATNPCIAECSGEKVASGNLSDNISDQNRNDDHALITCQSNTKNLSKMQAIISKETA 540

Query: 541 LSQAAIKALIRKRDKLSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVC 600
           LSQAAIKALIRKRDKLSHQQRIIEDEIAQCDKNMQTILR                     
Sbjct: 541 LSQAAIKALIRKRDKLSHQQRIIEDEIAQCDKNMQTILR--------------------- 600

Query: 601 LRSAAEDKPYQYSEENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSS 660
               AEDKPYQYSEENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSS
Sbjct: 601 ----AEDKPYQYSEENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSS 633

Query: 661 DGDVPLRCVVDAI 673
           DG      ++  +
Sbjct: 661 DGGFQANVILKGL 633

BLAST of Cp4.1LG12g05070 vs. ExPASy TrEMBL
Match: A0A6J1JP19 (uncharacterized protein LOC111488583 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111488583 PE=4 SV=1)

HSP 1 Score: 1183 bits (3061), Expect = 0.0
Identity = 612/662 (92.45%), Postives = 613/662 (92.60%), Query Frame = 0

Query: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60
           MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK
Sbjct: 1   MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60

Query: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACGLA 120
           QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSP+EKAIMDAC LA
Sbjct: 61  QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPSEKAIMDACDLA 120

Query: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSV 180
           TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDT ECQPKSV
Sbjct: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTYECQPKSV 180

Query: 181 EEEKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATGMIWDNIRRLLKQMACINQH 240
           EEEKHVNKKKRVIKKPSKEGLVVV TKTQQLAYSAVKEATG               INQH
Sbjct: 181 EEEKHVNKKKRVIKKPSKEGLVVVETKTQQLAYSAVKEATG---------------INQH 240

Query: 241 DLKILESHVAYSLSKEKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKKNGRRWS 300
           DLKILESHVAYSLSKEKSAVYFYMMQCTRSATEDVIQVPIKDA DSLQDSLFKKNGRRWS
Sbjct: 241 DLKILESHVAYSLSKEKSAVYFYMMQCTRSATEDVIQVPIKDAFDSLQDSLFKKNGRRWS 300

Query: 301 VTSKVEYYHILPYVKMVLTWFHRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQ 360
           VTSKVEYYHILPYVKMVLTWF RETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQ
Sbjct: 301 VTSKVEYYHILPYVKMVLTWFRRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQ 360

Query: 361 DDATTNNMNKGTSIYDAGLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVP 420
           DDATTNNMNKGTSIYDA LERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGV 
Sbjct: 361 DDATTNNMNKGTSIYDAVLERLPDKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVL 420

Query: 421 TPTQVIMSYVKKIHGSPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVE 480
           TPTQVIMSYVKKIHGSPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVE
Sbjct: 421 TPTQVIMSYVKKIHGSPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVE 480

Query: 481 DGILATNPCIAEGSGEKVASGNLSDNISDQNRNDDHVLITCQSNTKHLSKMQAIISKETA 540
           DGILATNPCIAEGSGEKVASGNLSDNISDQNRNDDH LITCQSN KHLSKMQAIISKETA
Sbjct: 481 DGILATNPCIAEGSGEKVASGNLSDNISDQNRNDDHALITCQSNAKHLSKMQAIISKETA 540

Query: 541 LSQAAIKALIRKRDKLSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVC 600
           LSQAAIKALIRKRDKLSHQQRIIEDEIAQCDKNMQTILR                     
Sbjct: 541 LSQAAIKALIRKRDKLSHQQRIIEDEIAQCDKNMQTILR--------------------- 600

Query: 601 LRSAAEDKPYQYSEENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSS 660
               AEDKPYQYSEENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSS
Sbjct: 601 ----AEDKPYQYSEENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSS 622

Query: 661 DG 662
           DG
Sbjct: 661 DG 622

BLAST of Cp4.1LG12g05070 vs. TAIR 10
Match: AT1G05950.1 (unknown protein; Has 50 Blast hits to 45 proteins in 14 species: Archae - 5; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 34; Viruses - 0; Other Eukaryotes - 7 (source: NCBI BLink). )

HSP 1 Score: 303.9 bits (777), Expect = 3.3e-82
Identity = 236/669 (35.28%), Postives = 363/669 (54.26%), Query Frame = 0

Query: 4   TGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRKQHP 63
           T  CPTEDAI ALL+ LV+P+LPSK +  + P  ++ +SVAKQ+HAVVLLYNYYHRK +P
Sbjct: 14  TDSCPTEDAIRALLESLVDPLLPSKPTD-DLPSTSIRESVAKQVHAVVLLYNYYHRKDNP 73

Query: 64  HLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACGLATCL 123
           HLE LSFE+F  LA V+KPALL H+K        E      Q    EK I+DAC L+  L
Sbjct: 74  HLECLSFESFRSLATVMKPALLQHLK--------EDGGVSGQTVLLEKVIVDACSLSMSL 133

Query: 124 YTSKDENI-EGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSVEE 183
             S D  I    P+ +VAV L+DS+K+ C+L  SSITQGVWS++E          K +E+
Sbjct: 134 DASSDLFILNKCPIRRVAVLLVDSEKKSCYLQHSSITQGVWSLLE----------KPIEK 193

Query: 184 EKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATGMIWDNIRRLLKQMACINQHDL 243
           EK   + ++      +EG+       Q++A++ VKEATG               +N  D+
Sbjct: 194 EKAARENQK------EEGVF------QKVAFAVVKEATG---------------VNHKDI 253

Query: 244 KILESHVAYSLSKEKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKKNGRRWSVT 303
            ILE H+  SLS+EK+AV FY+M+CT S  +   + P+++ +  +Q  LF+K+   W++ 
Sbjct: 254 VILERHLVCSLSEEKTAVRFYIMKCT-SQDKFSGENPVEEVLSCMQGPLFEKSFSDWTMN 313

Query: 304 SKVEYYHILPYVKMVLTWFHRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDD 363
           S VEY+H+LPY  ++  WF R   T+ +     E + +++              ++N+ D
Sbjct: 314 SIVEYFHVLPYATLIEDWFSRRGDTEFVIEKEPEAVCDDI--------------ESNKVD 373

Query: 364 ATTNNMNKGTSIYD----AGLERLPD-KTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRK 423
           AT    ++ + I++    A L+R  + K   +++L                 S+P  + K
Sbjct: 374 ATKE--SEVSDIFERREKAALKRRYEIKAKKVAAL----------------LSHPGARGK 433

Query: 424 GVPTPTQVIMSYVK-KIHGSPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESG 483
                T++   Y+K  + G+   N +  T+  ++  + V N      PC  N S   + G
Sbjct: 434 AT---TRLQNRYLKGSMSGAKEPNVHSETV-VALKAKNVGNEMS---PCKDNYSNGEKGG 493

Query: 484 IKVEDGILATNP--CIAEGSGEKVASGNLSDNISDQNRNDDHVLITCQSNTKHLSKMQ-A 543
            +V     A++P      G   K A  +  ++I   N        +  ++  +L ++Q +
Sbjct: 494 FEV-----ASDPKELKERGLQRKKAVPDRLNSIHKLNSTP----ASAHNSNPNLEELQTS 553

Query: 544 IISKETALSQAAIKALIRKRDKLSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVI 603
           ++SK T+LS+ A+K L+ KRDKL+ QQR IEDEIA+CDK +Q I    + D+ ++L++V+
Sbjct: 554 LLSKATSLSETALKVLLCKRDKLTRQQRNIEDEIAKCDKCIQNI----KGDWELQLETVL 578

Query: 604 ECCNDVCLRSAAEDKPYQYSEENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILP 663
           ECCN+   R     +  Q S +  + Q   R +LSE +   ++ CQ LDDIC  NNW+LP
Sbjct: 614 ECCNETYPR-----RNLQESLDKSACQSNKRLKLSETLPSTKSLCQRLDDICLMNNWVLP 578

BLAST of Cp4.1LG12g05070 vs. TAIR 10
Match: AT1G63970.1 (isoprenoid F )

HSP 1 Score: 57.4 bits (137), Expect = 5.3e-08
Identity = 27/33 (81.82%), Postives = 28/33 (84.85%), Query Frame = 0

Query: 660 SDGDVPLRCVVDAILSALGLLDIGHIFPDSNPK 693
           SDGDV L CVVDAIL ALGL DIG IFPDS+PK
Sbjct: 109 SDGDVLLHCVVDAILGALGLPDIGQIFPDSDPK 141

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9CAK87.5e-0781.822-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase, chloroplastic OS=Arabidop... [more]
Q9M4W31.7e-0678.792-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase, chloroplastic OS=Catharan... [more]
Q6EPN61.7e-0678.792-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase, chloroplastic OS=Oryza sa... [more]
Q72HP82.7e-0452.502-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase OS=Thermus thermophilus (s... [more]
Q8RQP52.7e-0452.502-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase OS=Thermus thermophilus (s... [more]
Match NameE-valueIdentityDescription
XP_023548856.10.097.73uncharacterized protein LOC111807382 isoform X1 [Cucurbita pepo subsp. pepo] >XP... [more]
XP_022929885.10.095.54uncharacterized protein LOC111436360 isoform X1 [Cucurbita moschata] >XP_0229298... [more]
XP_022929889.10.095.39uncharacterized protein LOC111436360 isoform X2 [Cucurbita moschata][more]
KAG6575356.10.094.80hypothetical protein SDJN03_25995, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022992174.10.096.22uncharacterized protein LOC111488583 isoform X1 [Cucurbita maxima] >XP_022992175... [more]
Match NameE-valueIdentityDescription
A0A6J1EPE20.095.54uncharacterized protein LOC111436360 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1EQ290.095.39uncharacterized protein LOC111436360 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1JUZ00.096.22uncharacterized protein LOC111488583 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1ENX10.091.83uncharacterized protein LOC111436360 isoform X3 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1JP190.092.45uncharacterized protein LOC111488583 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
Match NameE-valueIdentityDescription
AT1G05950.13.3e-8235.28unknown protein; Has 50 Blast hits to 45 proteins in 14 species: Archae - 5; Bac... [more]
AT1G63970.15.3e-0881.82isoprenoid F [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR0365712-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase superfamilyGENE3D3.30.1330.50coord: 654..692
e-value: 9.0E-9
score: 37.4
IPR0365712-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase superfamilySUPERFAMILY69765IpsF-likecoord: 657..692
IPR0035262-C-methyl-D-erythritol 2,4-cyclodiphosphate synthasePFAMPF02542YgbBcoord: 658..691
e-value: 5.2E-7
score: 30.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 400..419
NoneNo IPR availablePANTHERPTHR33913ALEURONE LAYER MORPHOGENESIS PROTEINcoord: 1..663

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG12g05070.1Cp4.1LG12g05070.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016114 terpenoid biosynthetic process
molecular_function GO:0008685 2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase activity