Cp4.1LG14g05630 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG14g05630
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionEmbryo-specific 3
LocationCp4.1LG14 : 764462 .. 774304 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGGGAACGTTTCCACTAAGAGGCTGAAGGATGGGCTCGACCTGTAACGCCATGACCTTTATAAAAACCGACACATCATCAGTTTCCCATTTTCGTTTTCGTGTGTGACGTTTTAGAGAAAATGGAGAGGTCCTCGTCGCCTCCTCTGTTGTTGCTACTGCTGCTCTCTTCCCTCGCCGTCGTCTACTCCGCTGTCGATCCGCCTCAGATCACCGTGAAACCCGACGTTGCTCCCGAGTCCCTCCCGATTAACTACATCCAGGTCCGTGTTTCAACTCTCTTTACTCTCAACCGTCGCTTTTGTATTTTTCCTTTTAACTTTTGCGTAATATACTTCTCTTTTTCTTTTGTGTTGCGAAGGAAGTTGGAAGCTGTTCTTACGAGGTTACAGTAGCAACAAGCTGTTCATCTCCTTCATTTATCACTGACGAAATCGGTGTGTTGTTTGGAGATTCTCATGGAAATCAGGTGTTATTTCTTTTCTGTCATTTTCCTTTTTAAAATTATTAACTCTAGGGTTTTTACTTTTATTTACCTATTTCTGAAATTAATTCCCCTGAATCCGAACAAGAGTATGGATTTTGTTTCTAATCTTACGCCAGGGTGATTCTGGCCTGAATAATAAACTTCATTATTATTATTTTTTTGTTAATCAACCACTTATCAACTCTAGGTGTTCTTTCTTTTCTGTCATTTTCCCTTTTAAAATTATTAACTCTAGGGTTTTTATTTTTATTTACCTATTTCTGAAATTAATTCCCCTGAATCCGAACAAGAGTATGGATTTTGTTTCTAATCTTACGCCAGGGTGATTCTGGCCTGAATAATCAACTTCATTATTATTATTTTTTTGTTAATCAACCACTTATCAACTCTAGGTGTTCTTTCTTTTCTGTCATTTTCCCTTTTAAAATTATTAACTCTAGGGTTTTTATTTTTATTTACCTATTTCTGAAATTAATTCCCCTGAATCCGAACAAGAGTATGGATTTTGTTTCTAATCTTACGCCAGGGTGATTCTGGCCTGAATAATCAACTTCATTATTATTATTTTTTTGTTAATCTAGCTCTTAATATATGATCTGATTCACGGATTTTTGTTTTGTTTTGATTGAATGCTTCACATGTGTAACGAATAGATTGTGGAGAAGAAGTTGAGCAATGGAGACAAGGTGTTNTGATGATTCTGGCCTGAATAATCAACTTCATTATTATTATTTTTTTGTTAATCAAGCTCTCAATATATGATCTGATCCACGGATTTTTGTTTTGTTTTGTTTTGATTGAATGCTTCACATGTGTAACAAATAGATTGTGGAGAAGAAGTTGAGCAATGGAGACAAGGTGTTTGATAGTTGTAATACAGATAGGTTTGTTCTAAATGATCGACCCTGCAGTGTCCAAATCAATTATATGTATATCTATAAAGACGGAGCTGACGATTGGCTTCCGAACTCTGTTGAAATCTCTGGCTCTGGAATCAAACCCCTTTTGTTCATCTTCAAATCCTCCATCCCAAGAAACACTTGGTTTGGCTTTGATTTACGCCAATATCCTTTTCCATCGCCGCCGTCTTACAACCCGTACCCTCCTTTTCCATCGTTGTCTCCTCCTCCTCCTCCTCCTCCCCATCCCCTGCCACCGCCGCCGGAGCCAATTCTTTCTCCTCCGCCTCCACCTAAGCCTGTTCTTCCCCCACCGTCTCCTCCTCCTCCTCCTCCCCATCCCCTGCCACCGCCGCCGGAGCCAATTTTTTCTCCTCCGCCTCCACCTAAGCCTGTTCTTCCCCCACCGCCTCCTCCTCCACCATCATCCTCCAGCAAAATCTCTGGTCTGAAATTGGCTAAGGTATCTGTGGTTTTGGGATTGCTGTTTGCTCTTCTGTGATTCTGCAGTAGCTGCTTGCTTTATGTAGTAGTTTGATTTTGTAGTGGTGGGTGGGTGATGTTAATGTTCAAGTTTCATCCCCCATTCCACCATGTTGTAATACCAATGTATGCTTTCAAAGCAATAGTGAAACTTTATCTTTTCAGCTGTATGGCTATAGGCCTGGTGCAATTNGTACCAATGTATGCTTTCAAAGCATTAGTGGAACTTTATCTTTTCAGCTGTATGGCTATAGGCCTGGTGCAATTCTGCAGCCACTGTTTGTTCTATGTGGGTGGCGGTTGAGAATGGTAGCTCTATGGTGGTGGGTGATACTATATTTACGATTCATCTCCTTTTCCACCATGTTTTAAGGTCTAGACAAGCAAGTTTTACCGAACCTCTTTATGTTATCGTCTTTCAGTTGTATTGGATACATGCCTTCTCTATGAAAGTCGGCCAAGCTATCTATGTCGAAAGTTGTATGGTGTTTTATTTGTCAATAATGTGGGATGGAGTGGATGTGTTAAATTTTGATTTTTGAGTATTTTTTGTTTTTATTTCAACCATAATTATTATAAATAAAAGAACATTGGTTAGAACATTATTAATTTTGAGAATTATACCAAACTTAAGCTACACCTAAAAGTACTTAATAATCTGACATTTCAACTTTGTATTTAATATGGTTTTTAATATCTTAAGTAATTAACAAAATGGGCTATTATACCATCTTAATTCGATTATTTGTCCAGATATTTCTAAAATTACACTAAAAGGTTTTTCCCTCCATGAAAATTTTATAACGAATATCAATATTTAGTGTAAAAAAAAACTACTTTTTAAATTTTATTTACATTTTAATTTCTTTTTTTATTTCCTTAAATATTGTCAACCTGAACAAAAGCAACACCATCTTCCGTTCACATATAGAAGAGGATAGAAATATCTAGTGATAGTTTCGTATTTTTTAAATAGGTCACTTTCGGAGTTACAACCAAACACATGAAGTCTAACAAGTAAATCGACATAGTATTGTGTTAGAAAAAACCCCACTTACCCCGTCCCCAATAATATTCTCATCTCAAGTTCAAACCTTCAAATTAAGCGACTCCAAAAGGGAGAATTTCATTTCGGCCTATTCATCTTTTTAAGTGTAATCGAGACTAAATATTCTTTAAAATTTTAGATTTAAAATAGATTGATGTTGATTTAAAAAGGTGTAATTTTAGTTGGCATTAGTAAGAAAAGGAAAATAGTTTGTTGGTTACAAAAGATTGGGGATCCCTTGCATTTTGAGTTTGGTATAAATATTTTATTTATTTATTTGCTTTTTTTTTTTTTTTTTTCGACTTCCCGGCAGTGAAAACCTACCTACTCCATCACCGATTGACCATATATTTGATTGGAGCCAAGACTCTGGTTCTGCACCTCTTCAACCAAGCTTCTTGTACTCTCGTTTCAGTGTAGCCAAACGAAGGTCTGTAAAAATGGCGATGTCTCTGCTTTCTTTCTTCATCTTTACCTTCTTCTCCATTTCTGCCGCCGTATCATCCGGTTCTCTGCCTCAACCTGCTCCAAAATCCTTAACAATCGGCTATATTCAGGTTCCGTTTTATCATTTTACCTTTTACTTTCGTTACTCAATCGATGTGCTTATCTGAGGTTCTCTGTGAATCAAACTGACTCTGTTATCTTCTTCTTTGGTACTTTCGGGTGAAGAAAGCGGGAGATTGCAACTACAGGGTCAATATAACAACAAGCTGCTCTTCGCCTTTCCATATTACTGCTGAAATCGGCGTTCTGTTTGGGGATGCGCATGGAAATCAGGTTCGTTTTTCGGGCATTTTAATTTAAATGATTTTGCAATTAGTTCATCTGCTGAATATTGTCGACTCTAGAGGAGAAAATTGAAAGCATATTGATTAACAATTAAGCATCCAAGGCCAAACAGAAGTCTATTCCTGATTTTTCATTGAATATGGCTGTTTTCCTGCTGTGCTCTATATTATAGTCAAATTTTTTATATTGGAGTTTTGAATTGCAAGAAGAAAAAAAAAAAAACAAATAAACTAGAATTAGAAGCTTACGATCTGTGATTCTATTGAAAGGATACATGGTTGGTTCACTTAAACTGGAATTAGAAGCTTATAATCAGTTTTCAAATATGTTTATCATTACAGATTTGAAACCAAAAACAAGAAATTGTTTGTAGAATAACAGGTAGTAGCACTGTTAGTTTCTAAACTTTCAAATGCTTTATTAGTTCCTCAACTTTAAAAGTTTGTGTTAGTCTCTACAGTTTACTTACATTCATAAGCTTGATATTTAATCTGTTACACAGTGACTTTAAACACTTAGAAATGTGGGTATGGTAAGACTTCATTAAGTAATGTTTAAAAGGATGAATATGGCTGATTTAGAAGTTGGAAAGCAGATATACGAGCCAAAGCTGGAAGTTGAAAGCAGCAAAGCATTTGCAAAATGCAGCAAAGACATTTTTGAATTGACAGGGCCATGCACAGACCAGATATGTTTCTTTTACCTTTACAAGAATGGATCGGACGATTGGATTCCAGAGACTGTAGAAATCTCCAGCCCTGATATTGACACTGTTAAATACAAATACAACTCCTCAATTCCAGACGACACATGGGATGGCTTTGACGACTGCCAATACTTCACATCGCCACCGCCACCGCCACCATCTCCGCCAGCCCCCTCCACAGCCGGCCATCTGCGGAGGTTGAAAGGGCTTGCTTATGTGATCCCTGTGCTTCTTAGCAGCGCTGTGCTGTGAGTGTGATTCTGCATCTTGGCCATGGGCGTGGTTGTATTTTTGATCCACCGCTGTGCCTGTATTTGCTTTGTGTTTTTTGATCTTTTGATGATCATATCAAGGTCAAAGGAATGTAGTTCATCTTTGATATGCAGAATGGGATTTGATGGAGGAATATCTGTTATGGTCAGTACCTTTTTCTTTCTCGGCTTTGAGAGTAGAATATTAATTTTTGTTTAAATATTTGCTTATGCAGAGATATATGAAACATACTAAATTGTTTTTATTTATTTATTTATTTTTTAAGTATGAGAGACCCATTTGAGAGTTAAAAACATTTAAACCATTGAATTTATAACTTTCCCAAAATAAGAAAAAAAGAAAAAAAAAAAGGAAAAGAAAAGAAAATTTGTGTACCTTTCAATGGTTGACATAAATATGGGTGGCGTATAATTCGTACGTCTCTGTGTTCTTCCGTATTCTTTTTCTTCTTCAAAAGCTCCGCAACGCTAAAGCCAAACACACAAACAAGGAGAAATGAAACGGGTTCTCTGTTTTCTGCTCACCTCCGCCTTTCTCTTGGCTCTCTTGGAAGCCACGGAATTGCTGCCCGAATCCGCTGAATCCTTCAATTTAACCTATATTCAGGTATCAGCGTTCCGTATCGGTTTTTTTCCATTTATGCTGATTTTCAATCTTGTATCTTACAACCATTTCGTGTTTCTCTTCTTCTTTCTTGTTTGGGGATTTCTGATGTTGAAGCAACTCGCGAGTTGTTCTTATTCGGTCGTTATATCAACTAGCTGTTCTTCGCCTACATACACAAGGGATCAGATCAGCGTTTCTTTCGGCGATGCTTATGGCAACCAGGTTTTCTTCGTTTAAATCTTCATTTTTCCCCTTCATTTTATCAGAATACTAAGCCCTGGGTTTCTCGATCGTTCGCGATTTAACTAATTCAGTTATTCGAAATGGTCGATGAACGAAATCGTAGTTGAATTTTGAATCTGAGTTTGGGAATAATTTTACATTTGGCGAGGGATATAGAAACATATTAATATCGTTAGGTTAAAGTAATTGATTGGTCATCCTTTATCTTTCGATGCAAAATGAAAGAGAAATCGAGTTGGGTTTGTATTTCTCGATTATGATGAGATGATAGCCTGTTGATTTATATGTTTCAGAACTTCAAATCAGATATTTATTTATCCATAGTCATTAGCAAGTTGAGCAATATCTTGTACCGTTTAGAACTTCCTAAAACATTTTTAAAGAGCGTCATGCCATCGTTCGATCATTCAAAAACAATAAAAATTGTCTTTACTTTCAGATATATTTAGGAATTTGACGTGTTACCATTAATGATTGAAGAAACTTGTGTACCAGATTTATGTGCCTAGGATTGATGATCCATCCAGAAGGATATTTGAAAGATGTTCCTCTGATACATTTAGCGTAAGTGGACCTTGTGCTTACCAAATATGCTATGTCTATCTTTATCGCTCTGGACCGGATGCCTGGATCCCAACAACGGTGAAGATCTCTGGTCCTAATTCTCGACCTGTCACGTTTAACTACAACACTGCCATACCAAACGACGTATGGTTCGGGTTTAACTTGTGTGGTCATTCTTCATCTTCTAACCGTATTTCAAGTTGCATATGGTTCTTATACGTTAGTTTCGTGTCTATTCTTCTGCTTTTTTTGTAATTTTATTGTATGTCAAAATGCTACTCATGTTCTTTTCTTGGTAAGAACACATTTTGATGTAGCACTCTTGTAGCATAGAGAACATGAGTATCTCACTATAAAGCAGCATCAGGAGTGTACTTATGTATATTTAGTGATTATACACAGGCAGAGGCAATTCTTGGTGATTGAGCCTCTTGACTTGGGATAAGCTTTTGGATGTCTTGTTATGGAACTCAGAAACAGAGGATTTTTCTGGTTTTATTTACCTTTTAGATATATTTGTCTTTGATTCATGTGAGATTCTCCTATTGTTAGCTTTGTCCTATAAAAATCTGTTTTATCAACCATTAAGGTGGGTAATAGCCCCAGTCTCTGATATATTGGAGGTTGGGGGACGGGATTTGTTCTTGAACGGGCCAATTACTATCATTATTATTATTTGAAGTAAGTTCATAAACGGTGCCTAATCATAACTCAATCGAATAAAGTAAAAAAAAATTATTTCATTTCATTTTTCTTTTGTAACAATTTTATTTTGATTGATATTTAATATATTTTATATACTAATTGTATATTAATGATATATTTACTAATGTTGTTTCAAAATTCATTTGAGGTATAAGGTTGATGTACCCATATTACTCTATAATATACAAAGAATGTAAAAATTTATATTAGATGATCTTCAAATTTGTGCTTGAAGGAAATTTGACATAAGTTTTTATAACATTTGAATCATCAAAAAAGAAAAAAAGAAATTCTTGGGGAGCAAGGAAATTGGTAGAAGATTCTGGACCTGAAACCGCCCCATTTGGCTTGGAATTTGGGTCTATCCGATATTAAGATGATGAGTCTGCTGATGCCCCAGCCCCGCCAAGCTGTCGATGCATCGGCTCTCCATGGCCGCCACGGTACGCATCTTAACTTCTACATAAATCCGGATCAACCCATTTCACTTTCTGCCTTGTTGCTTCTTCCAACTTCTTTTCTCCCCTTTTGATTCTACTTTCCCCTTCTCGAGAACTTTTATACATATATTCATGAGTGAGGTAGTTTCTGATTGATTCTCATCTATTGGATTACACATATGTTTTACTCTAGAGATATGAGTTGCTATAGAAAATCCAAGTCTGCATTTGATGCATTCCGGAACTTGTCTTCAAAGCTTTTTCCCGAGTTAATTCGAGATTCTAAGTCAAGAATTTCCCGCGGTGGGCATTCGTTTACGGCTCGAAAAATGTCTAATTCTTATGGGTTTCAATCGTCTTCTCCAATTATACAAAGATTTGGAAGACAAGTTCGAGAGAAGAGGAGGCTATACGATCCCTTCTTCGGTGATTCCAAGAGATTTTACTATGTCGATCACTACCGTGTCCAGCATTTTAAGTCCAGAGGACCTCGGCGATGGTTTCAGGATCCAAGAACCGTATTGGTTGTTGTGTTTGCGGGTTCTGGGGTTTTTATCACCGTGTATTATGGGAATCTAGAAACCATACCTTATACTAAACGAAGGCATTTCGTACTCTTGTCTAGAGCTATGGAGAGGAGCCTCGGGGAGTCGCAATTTGAGCAAATGAAGGCAGCTTTCAAGGGTAAAATATTGCCTGCTGTACATCCAGAAAGTATTAGAGTAAGATTGATAGCTAAGGATATAATTGATGCATTACAAAGAGGGTTGAAGCAAGAGAATGTTTGGAGTGATTTAGGGTATGCATCAGAGGCTGCGATTGGAGCCCCTGAAGGGAGTGGCAATGAGACATTGATGGCGCTTAGGGACTCTGGGGCTGGGAAGATGGAAGCTAAATGGTACCATGAAGACGAGATTCTTGATGACAAATGGGTCGAGCGCAGTAGAAAGAAGGGTCAGAAACAGGGGTCCCAAGCAGATATCTCGCATTTGGATGGATTGAAATGGGAGGTTTTGGTGGTGAATGAGGCAGTTGTTAATGCATTTTGCTTGCCTGGTGGGAAGATCGTTGTTTTCACGGGCTTGCTCGATCACTTCAGAAGTGATGCAGAAATCGCAACTATTATTGGTCATGAGGTATGTTATAAAGATTTGTTCTTCAACTTACAACCTTTCACTTGGCTCAGCCTTAAGGTGTTGTTTTTATTAATTAGATTGGGCATGCTGTGGCACGACATGCTGCAGAGGGCATCACAAAGAACCTTTGGTTTGCCGTTTTGCAACTCATCCTTTATCAATTCGTGATGCCTGATATTGTGAACACTGTGTCCACGCTTTTCTTGAGGCTTCCTTTCTCTAGAAGGTAAGAGAAGAGCTATATGGTATGGGTAATTTGTTGAATGAAGATATAAAGGGGTAGTGTAATTCATTTTGATGGGTTCAGGATGGAAATGGAAGCGGATTACATTGGTCTGCTTTTGATTGCCTCTGCTGGATACGACCCGAGGGTTGCACCCACTGTGTATGAGAGGTTGGGTAAGGTGACGGGGGATTCCGCGCTGAGGGATTATCTTTCTACTCATCCATCGGGAAAGAAAAGAGCTCAGTTGCTAGCTCAAGCTAAGGTTATGGAGGAAGCACTTAGTGTTTACAGAGAAGTAAGAGCCGGACGTGGGGTTGAAGGCTTCCTATAAGACACACAAAGCTGCCCATGCCATGGGAACTTAAAGAGAAAAGCCTCCCTCCTGAATTTAAGGTATGGTTTTGATATGTTACTCACCCTTGAATTTGTCGAGTTCATTGAGCTTTCTCCTCTATAGCTTCCACATTACCTTCCCCTGTTTCACATCCATGAATGGAGTCCCATTTCCATCTCTTCCTCCTACGTTGGACGGGCGCAGTCTGTGTTTCATGGACAAAACACACTGTTTATCCAAATTTTGGCCCAGAGTCCAACAATCCATGGGCCGAATGAATCGATGGGCATCCAATTTGAGGCCTATTTCAACAATCCACGGGCCGCATTGATGACCCTTCTCGGTTCTTTAATCTTGCCGGTGTCGGAGAAACTATGACACGTCGAGGGTATGAAAGCGATTCGTGGGCAAATCGCAAGAAATTTACGTACCCAATTTCAAGAGCGTGATTGAGCATTTAAGCTTGCCGTTATCGGGGGGGTCCAGTAATAAAGGCGGTCGGAGAAGTGTTGAAGCTTAATGAAAAGGACGTGGAAGTGACATTGATGACTCTGCATAGATTTGGTAACCACTCGTCTTCTTCGTTGTGGTATGAACTTTAGCGTATTTGGAAGCTAAACCTAAAGAAAGAGAGGTATCATTGATATGTTAGATGATAAAAGTCCCATGTTAGCTAATATAGGAAATGATCACGAGTTTATAGGTAAGGAATGCTCTCTCCATTGGTACGAGAAAGAGTTTTTAATCTTAGTTTACAAGTTACAAAATGGATTATAAGTCATGTTTTTAGAGAGTCGTGTTTTTAGAGAATCTAAAGTGCAACTAGACAATGATGTCAGGTTGCAAGAAATTGGATGGAAAAGGTTGATAAAGTTACTGATAAAGTAAATTTTTACAGCTCAA

mRNA sequence

CGGGAACGTTTCCACTAAGAGGCTGAAGGATGGGCTCGACCTGTAACGCCATGACCTTTATAAAAACCGACACATCATCAGTTTCCCATTTTCGTTTTCGTGTGTGACGTTTTAGAGAAAATGGAGAGGTCCTCGTCGCCTCCTCTGTTGTTGCTACTGCTGCTCTCTTCCCTCGCCGTCGTCTACTCCGCTGTCGATCCGCCTCAGATCACCGTGAAACCCGACGTTGCTCCCGAGTCCCTCCCGATTAACTACATCCAGGAAGTTGGAAGCTGTTCTTACGAGGTTACAGTAGCAACAAGCTGTTCATCTCCTTCATTTATCACTGACGAAATCGGTGTGTTGTTTGGAGATTCTCATGGAAATCAGATTGTGGAGAAGAAGTTGAGCAATGGAGACAAGGTGTTTGATAGTTGTAATACAGATAGGTTTGTTCTAAATGATCGACCCTGCAGTGTCCAAATCAATTATATGTATATCTATAAAGACGGAGCTGACGATTGGCTTCCGAACTCTGTTGAAATCTCTGGCTCTGGAATCAAACCCCTTTTGTTCATCTTCAAATCCTCCATCCCAAGAAACACTTGGTTTGGCTTTGATTTACGCCAATATCCTTTTCCATCGCCGCCGTCTTACAACCCGTACCCTCCTTTTCCATCGTTGTCTCCTCCTCCTCCTCCTCCTCCCCATCCCCTGCCACCGCCGCCGGAGCCAATTCTTTCTCCTCCGCCTCCACCTAAGCCTGTTCTTCCCCCACCGTCTCCTCCTCCTCCTCCTCCCCATCCCCTGCCACCGCCGCCGGAGCCAATTTTTTCTCCTCCGCCTCCACCTAAGCCTGTTCTTCCCCCACCGCCTCCTCCTCCACCATCATCCTCCAGCAAAATCTCTGGTCTGAAATTGGCTAAGCTGTATGGCTATAGGCCTGGTGCAATTNGTACCAATGTATGCTTTCAAAGCATTAGTGGAACTTTATCTTTTCAGCTGTATGGCTATAGGCCTGGTGCAATTCTGCAGCCACTGTTTGTTCTATGTGGGTGGCGGTTGAGAATGGTAGCTCTATGGTGTGAAAACCTACCTACTCCATCACCGATTGACCATATATTTGATTGGAGCCAAGACTCTGGTTCTGCACCTCTTCAACCAAGCTTCTTGTACTCTCGTTTCAGTGTAGCCAAACGAAGGTCTGTAAAAATGGCGATGTCTCTGCTTTCTTTCTTCATCTTTACCTTCTTCTCCATTTCTGCCGCCGTATCATCCGGTTCTCTGCCTCAACCTGCTCCAAAATCCTTAACAATCGGCTATATTCAGAAAGCGGGAGATTGCAACTACAGGGTCAATATAACAACAAGCTGCTCTTCGCCTTTCCATATTACTGCTGAAATCGGCGTTCTGTTTGGGGATGCGCATGGAAATCAGATATACGAGCCAAAGCTGGAAGTTGAAAGCAGCAAAGCATTTGCAAAATGCAGCAAAGACATTTTTGAATTGACAGGGCCATGCACAGACCAGATATGTTTCTTTTACCTTTACAAGAATGGATCGGACGATTGGATTCCAGAGACTGTAGAAATCTCCAGCCCTGATATTGACACTGTTAAATACAAATACAACTCCTCAATTCCAGACGACACATGGGATGGCTTTGACGACTGCCAATACTTCACATCGCCACCGCCACCGCCACCATCTCCGCCAGCCCCCTCCACAGCCGGCCATCTGCGGAGGTTGAAAGGGCTTGCTTATGTGATCCCTGTGCTTCTTAGCAGCGCTGTGCTGTCAAAGGAATGTAGTTCATCTTTGATATGCAGAATGGGATTTGATGGAGGAATATCTGTTATGGTATCAGCGTTCCGTATCGGTTTTTTTCCATTTATGCTGATTTTCAATCTTGTATCTTACAACCATTTCGTGTTTCTCTTCTTCTTTCTTGTTTGGGGATTTCTGATGTTGAAGCAACTCGCGAGTTGTTCTTATTCGGTCGTTATATCAACTAGCTGTTCTTCGCCTACATACACAAGGGATCAGATCAGCGTTTCTTTCGGCGATGCTTATGGCAACCAGATTTATGTGCCTAGGATTGATGATCCATCCAGAAGGATATTTGAAAGATGTTCCTCTGATACATTTAGCGTAAGTGGACCTTGTGCTTACCAAATATGCTATGTCTATCTTTATCGCTCTGGACCGGATGCCTGGATCCCAACAACGGTGAAGATCTCTGGTCCTAATTCTCGACCTGTCACGTTTAACTACAACACTGCCATACCAAACGACGTATGAGATATGAGTTGCTATAGAAAATCCAAGTCTGCATTTGATGCATTCCGGAACTTGTCTTCAAAGCTTTTTCCCGAGTTAATTCGAGATTCTAAGTCAAGAATTTCCCGCGGTGGGCATTCGTTTACGGCTCGAAAAATGTCTAATTCTTATGGGTTTCAATCGTCTTCTCCAATTATACAAAGATTTGGAAGACAAGTTCGAGAGAAGAGGAGGCTATACGATCCCTTCTTCGGTGATTCCAAGAGATTTTACTATGTCGATCACTACCGTGTCCAGCATTTTAAGTCCAGAGGACCTCGGCGATGGTTTCAGGATCCAAGAACCGTATTGGTTGTTGTGTTTGCGGGTTCTGGGGTTTTTATCACCGTGTATTATGGGAATCTAGAAACCATACCTTATACTAAACGAAGGCATTTCGTACTCTTGTCTAGAGCTATGGAGAGGAGCCTCGGGGAGTCGCAATTTGAGCAAATGAAGGCAGCTTTCAAGGGTAAAATATTGCCTGCTGTACATCCAGAAAGTATTAGAGTAAGATTGATAGCTAAGGATATAATTGATGCATTACAAAGAGGGTTGAAGCAAGAGAATGTTTGGAGTGATTTAGGGTATGCATCAGAGGCTGCGATTGGAGCCCCTGAAGGGAGTGGCAATGAGACATTGATGGCGCTTAGGGACTCTGGGGCTGGGAAGATGGAAGCTAAATGGTACCATGAAGACGAGATTCTTGATGACAAATGGGTCGAGCGCAGTAGAAAGAAGGGTCAGAAACAGGGGTCCCAAGCAGATATCTCGCATTTGGATGGATTGAAATGGGAGGTTTTGGTGGTGAATGAGGCAGTTGTTAATGCATTTTGCTTGCCTGGTGGGAAGATCGTTGTTTTCACGGGCTTGCTCGATCACTTCAGAAGTGATGCAGAAATCGCAACTATTATTGGTCATGAGATTGGGCATGCTGTGGCACGACATGCTGCAGAGGGCATCACAAAGAACCTTTGGTTTGCCGTTTTGCAACTCATCCTTTATCAATTCGTGATGCCTGATATTGTGAACACTGTGTCCACGCTTTTCTTGAGGCTTCCTTTCTCTAGAAGGATGGAAATGGAAGCGGATTACATTGGTCTGCTTTTGATTGCCTCTGCTGGATACGACCCGAGGGTTGCACCCACTGTGTATGAGAGGTTGGGTAAGGTGACGGGGGATTCCGCGCTGAGGGATTATCTTTCTACTCATCCATCGGGAAAGAAAAGAGCTCAGTTGCTAGCTCAAGCTAAGGTTATGGAGGAAGCACTTAGTGTTTACAGAGAAGTAAGAGCCGGACGTGGGGTTGAAGGCTTCCTATAAGACACACAAAGCTGCCCATGCCATGGGAACTTAAAGAGAAAAGCCTCCCTCCTGAATTTAAGGTATGGTTTTGATATGTTACTCACCCTTGAATTTGTCGAGTTCATTGAGCTTTCTCCTCTATAGCTTCCACATTACCTTCCCCTGTTTCACATCCATGAATGGAGTCCCATTTCCATCTCTTCCTCCTACGTTGGACGGGCGCAGTCTGTGTTTCATGGACAAAACACACTGTTTATCCAAATTTTGGCCCAGAGTCCAACAATCCATGGGCCGAATGAATCGATGGGCATCCAATTTGAGGCCTATTTCAACAATCCACGGGCCGCATTGATGACCCTTCTCGGTTCTTTAATCTTGCCGGTGTCGGAGAAACTATGACACGTCGAGGGTATGAAAGCGATTCGTGGGCAAATCGCAAGAAATTTACGTACCCAATTTCAAGAGCGTGATTGAGCATTTAAGCTTGCCGTTATCGGGGGGGTCCAGTAATAAAGGCGGTCGGAGAAGTGTTGAAGCTTAATGAAAAGGACGTGGAAGTGACATTGATGACTCTGCATAGATTTGGTAACCACTCGTCTTCTTCGTTGTGGTATGAACTTTAGCGTATTTGGAAGCTAAACCTAAAGAAAGAGAGGTATCATTGATATGTTAGATGATAAAAGTCCCATGTTAGCTAATATAGGAAATGATCACGAGTTTATAGGTAAGGAATGCTCTCTCCATTGGTACGAGAAAGAGTTTTTAATCTTAGTTTACAAGTTACAAAATGGATTATAAGTCATGTTTTTAGAGAGTCGTGTTTTTAGAGAATCTAAAGTGCAACTAGACAATGATGTCAGGTTGCAAGAAATTGGATGGAAAAGGTTGATAAAGTTACTGATAAAGTAAATTTTTACAGCTCAA

Coding sequence (CDS)

ATGGAGAGGTCCTCGTCGCCTCCTCTGTTGTTGCTACTGCTGCTCTCTTCCCTCGCCGTCGTCTACTCCGCTGTCGATCCGCCTCAGATCACCGTGAAACCCGACGTTGCTCCCGAGTCCCTCCCGATTAACTACATCCAGGAAGTTGGAAGCTGTTCTTACGAGGTTACAGTAGCAACAAGCTGTTCATCTCCTTCATTTATCACTGACGAAATCGGTGTGTTGTTTGGAGATTCTCATGGAAATCAGATTGTGGAGAAGAAGTTGAGCAATGGAGACAAGGTGTTTGATAGTTGTAATACAGATAGGTTTGTTCTAAATGATCGACCCTGCAGTGTCCAAATCAATTATATGTATATCTATAAAGACGGAGCTGACGATTGGCTTCCGAACTCTGTTGAAATCTCTGGCTCTGGAATCAAACCCCTTTTGTTCATCTTCAAATCCTCCATCCCAAGAAACACTTGGTTTGGCTTTGATTTACGCCAATATCCTTTTCCATCGCCGCCGTCTTACAACCCGTACCCTCCTTTTCCATCGTTGTCTCCTCCTCCTCCTCCTCCTCCCCATCCCCTGCCACCGCCGCCGGAGCCAATTCTTTCTCCTCCGCCTCCACCTAAGCCTGTTCTTCCCCCACCGTCTCCTCCTCCTCCTCCTCCCCATCCCCTGCCACCGCCGCCGGAGCCAATTTTTTCTCCTCCGCCTCCACCTAAGCCTGTTCTTCCCCCACCGCCTCCTCCTCCACCATCATCCTCCAGCAAAATCTCTGGTCTGAAATTGGCTAAGCTGTATGGCTATAGGCCTGGTGCAATTNGTACCAATGTATGCTTTCAAAGCATTAGTGGAACTTTATCTTTTCAGCTGTATGGCTATAGGCCTGGTGCAATTCTGCAGCCACTGTTTGTTCTATGTGGGTGGCGGTTGAGAATGGTAGCTCTATGGTGTGAAAACCTACCTACTCCATCACCGATTGACCATATATTTGATTGGAGCCAAGACTCTGGTTCTGCACCTCTTCAACCAAGCTTCTTGTACTCTCGTTTCAGTGTAGCCAAACGAAGGTCTGTAAAAATGGCGATGTCTCTGCTTTCTTTCTTCATCTTTACCTTCTTCTCCATTTCTGCCGCCGTATCATCCGGTTCTCTGCCTCAACCTGCTCCAAAATCCTTAACAATCGGCTATATTCAGAAAGCGGGAGATTGCAACTACAGGGTCAATATAACAACAAGCTGCTCTTCGCCTTTCCATATTACTGCTGAAATCGGCGTTCTGTTTGGGGATGCGCATGGAAATCAGATATACGAGCCAAAGCTGGAAGTTGAAAGCAGCAAAGCATTTGCAAAATGCAGCAAAGACATTTTTGAATTGACAGGGCCATGCACAGACCAGATATGTTTCTTTTACCTTTACAAGAATGGATCGGACGATTGGATTCCAGAGACTGTAGAAATCTCCAGCCCTGATATTGACACTGTTAAATACAAATACAACTCCTCAATTCCAGACGACACATGGGATGGCTTTGACGACTGCCAATACTTCACATCGCCACCGCCACCGCCACCATCTCCGCCAGCCCCCTCCACAGCCGGCCATCTGCGGAGGTTGAAAGGGCTTGCTTATGTGATCCCTGTGCTTCTTAGCAGCGCTGTGCTGTCAAAGGAATGTAGTTCATCTTTGATATGCAGAATGGGATTTGATGGAGGAATATCTGTTATGGTATCAGCGTTCCGTATCGGTTTTTTTCCATTTATGCTGATTTTCAATCTTGTATCTTACAACCATTTCGTGTTTCTCTTCTTCTTTCTTGTTTGGGGATTTCTGATGTTGAAGCAACTCGCGAGTTGTTCTTATTCGGTCGTTATATCAACTAGCTGTTCTTCGCCTACATACACAAGGGATCAGATCAGCGTTTCTTTCGGCGATGCTTATGGCAACCAGATTTATGTGCCTAGGATTGATGATCCATCCAGAAGGATATTTGAAAGATGTTCCTCTGATACATTTAGCGTAAGTGGACCTTGTGCTTACCAAATATGCTATGTCTATCTTTATCGCTCTGGACCGGATGCCTGGATCCCAACAACGGTGAAGATCTCTGGTCCTAATTCTCGACCTGTCACGTTTAACTACAACACTGCCATACCAAACGACGTATGA

Protein sequence

MERSSSPPLLLLLLLSSLAVVYSAVDPPQITVKPDVAPESLPINYIQEVGSCSYEVTVATSCSSPSFITDEIGVLFGDSHGNQIVEKKLSNGDKVFDSCNTDRFVLNDRPCSVQINYMYIYKDGADDWLPNSVEISGSGIKPLLFIFKSSIPRNTWFGFDLRQYPFPSPPSYNPYPPFPSLSPPPPPPPHPLPPPPEPILSPPPPPKPVLPPPSPPPPPPHPLPPPPEPIFSPPPPPKPVLPPPPPPPPSSSSKISGLKLAKLYGYRPGAIXTNVCFQSISGTLSFQLYGYRPGAILQPLFVLCGWRLRMVALWCENLPTPSPIDHIFDWSQDSGSAPLQPSFLYSRFSVAKRRSVKMAMSLLSFFIFTFFSISAAVSSGSLPQPAPKSLTIGYIQKAGDCNYRVNITTSCSSPFHITAEIGVLFGDAHGNQIYEPKLEVESSKAFAKCSKDIFELTGPCTDQICFFYLYKNGSDDWIPETVEISSPDIDTVKYKYNSSIPDDTWDGFDDCQYFTSPPPPPPSPPAPSTAGHLRRLKGLAYVIPVLLSSAVLSKECSSSLICRMGFDGGISVMVSAFRIGFFPFMLIFNLVSYNHFVFLFFFLVWGFLMLKQLASCSYSVVISTSCSSPTYTRDQISVSFGDAYGNQIYVPRIDDPSRRIFERCSSDTFSVSGPCAYQICYVYLYRSGPDAWIPTTVKISGPNSRPVTFNYNTAIPNDV
BLAST of Cp4.1LG14g05630 vs. TrEMBL
Match: A0A0A0L8G7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G177970 PE=4 SV=1)

HSP 1 Score: 293.1 bits (749), Expect = 9.1e-76
Identity = 148/199 (74.37%), Postives = 164/199 (82.41%), Query Frame = 1

Query: 355 SVKMAMSLLSFFIFTFFSISAAVSSGSLPQPAPKSLTIGYIQKAGDCNYRVNITTSCSSP 414
           S+ MA  LL FF F+FFSISAA S+ S      KSL+I YI++AGDCNYRVNITTSCSSP
Sbjct: 7   SLHMATPLLPFFFFSFFSISAAQSTAS------KSLSIAYIREAGDCNYRVNITTSCSSP 66

Query: 415 FHITAEIGVLFGDAHGNQIYEPKLEVESSKAFAKCSKDIFELTGPCTDQICFFYLYKNGS 474
           F+I++EIGVLFGDA GNQIYEPKLEVES  AF KC KDIFEL GPC DQICFFYLYKNGS
Sbjct: 67  FYISSEIGVLFGDAQGNQIYEPKLEVESGNAFRKCRKDIFELIGPCIDQICFFYLYKNGS 126

Query: 475 DDWIPETVEISSPDIDTVKYKYNSSIPDDTWDGFDDCQYFTSPPPPPPSPPA-PSTAGHL 534
           D+WIPETVEISSPDIDTVKY YNSSIP+DTW GF+DCQYF SP PPPP PP+ PSTAG L
Sbjct: 127 DNWIPETVEISSPDIDTVKYTYNSSIPNDTWYGFEDCQYFPSPSPPPPPPPSVPSTAGSL 186

Query: 535 RRLKGLAYVIPVLLSSAVL 553
            R K +A +IPVL S  +L
Sbjct: 187 PRWKWIASLIPVLFSCFLL 199

BLAST of Cp4.1LG14g05630 vs. TrEMBL
Match: A0A0A0L699_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G177960 PE=4 SV=1)

HSP 1 Score: 284.6 bits (727), Expect = 3.2e-73
Identity = 175/257 (68.09%), Postives = 187/257 (72.76%), Query Frame = 1

Query: 6   SPP--LLLLLLLSSLAVVYSAVDPPQITVKPDVAPESLPINYIQEVGSCSYEVTVATSCS 65
           +PP  L+LLLLLS LAVV SAV P  IT+ PDV P SLPI+YIQEVGSCSYEVTV TSC+
Sbjct: 3   TPPSFLMLLLLLSFLAVVCSAVVPTPITLDPDVPPSSLPIDYIQEVGSCSYEVTVETSCA 62

Query: 66  SPSFITDEIGVLFGDSHGNQIVEKKLSNGDKVFDSCNTDRFVLNDRPCSVQINYMYIYKD 125
           SPS IT EIGVLFGD++GNQI+EKKL  GDKVF SC TD FVL DRPC +QI+YMYIYKD
Sbjct: 63  SPSSITSEIGVLFGDTYGNQIIEKKLGTGDKVFGSCKTDSFVLKDRPCIIQISYMYIYKD 122

Query: 126 GADDWLPNSVEISGSGIKPLLFIFKSSIPRNTWFGFDLRQYPFPSPPSYNPYPPFPSLSP 185
           GADDWLPNSVEISGSGI PLLFIFKSSIP NTWFGFDLRQY FP PPS         + P
Sbjct: 123 GADDWLPNSVEISGSGINPLLFIFKSSIPTNTWFGFDLRQYTFPPPPS---------VFP 182

Query: 186 PPPPPPHPLPPPPEPIL-SPPPPPKPVLPPPSPPPPPPHPLPPPPEPIFSPPPPPKPVLP 245
            PPPP HPL PPPEPI  SPPPPPKP+LPPP                            P
Sbjct: 183 APPPPSHPLVPPPEPIFSSPPPPPKPILPPPP---------------------------P 223

Query: 246 PPPPPPPSSSSKISGLK 260
           PP P PPSSSSKISG K
Sbjct: 243 PPQPTPPSSSSKISGQK 223

BLAST of Cp4.1LG14g05630 vs. TrEMBL
Match: A0A0A0L5P4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G177980 PE=4 SV=1)

HSP 1 Score: 211.5 bits (537), Expect = 3.5e-51
Identity = 96/110 (87.27%), Postives = 104/110 (94.55%), Query Frame = 1

Query: 610 LKQLASCSYSVVISTSCSSPTYTRDQISVSFGDAYGNQIYVPRIDDPSRRIFERCSSDTF 669
           ++QL SCSYSVVISTSC SP YTRDQIS+SFGDAYGNQIYVPR+DDPSRRIFERCSSDTF
Sbjct: 38  IQQLGSCSYSVVISTSCLSPAYTRDQISLSFGDAYGNQIYVPRLDDPSRRIFERCSSDTF 97

Query: 670 SVSGPCAYQICYVYLYRSGPDAWIPTTVKISGPNSRPVTFNYNTAIPNDV 720
            ++GPCAYQICYVYLYR+GPDAWIPTTV+ISG NSRPVTFNYNTAIP DV
Sbjct: 98  GINGPCAYQICYVYLYRTGPDAWIPTTVRISGDNSRPVTFNYNTAIPGDV 147

BLAST of Cp4.1LG14g05630 vs. TrEMBL
Match: A0A061G0A9_THECC (DEAD-box ATP-dependent RNA helicase 7 OS=Theobroma cacao GN=TCM_015015 PE=4 SV=1)

HSP 1 Score: 182.2 bits (461), Expect = 2.3e-42
Identity = 83/110 (75.45%), Postives = 94/110 (85.45%), Query Frame = 1

Query: 610 LKQLASCSYSVVISTSCSSPTYTRDQISVSFGDAYGNQIYVPRIDDPSRRIFERCSSDTF 669
           ++ L SCSYSVVISTSCSS +YTRDQIS++FGDAYGNQIYVPR+DDPS R FE+CSSDTF
Sbjct: 251 IQNLGSCSYSVVISTSCSSTSYTRDQISIAFGDAYGNQIYVPRLDDPSTRTFEQCSSDTF 310

Query: 670 SVSGPCAYQICYVYLYRSGPDAWIPTTVKISGPNSRPVTFNYNTAIPNDV 720
            + GPCAYQICYVYLYRSGPD W P +VKI G NSR VTF Y+T IP D+
Sbjct: 311 EIYGPCAYQICYVYLYRSGPDGWKPESVKIYGYNSRAVTFYYDTFIPGDI 360

BLAST of Cp4.1LG14g05630 vs. TrEMBL
Match: A0A067JQT7_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26201 PE=4 SV=1)

HSP 1 Score: 179.9 bits (455), Expect = 1.1e-41
Identity = 79/110 (71.82%), Postives = 93/110 (84.55%), Query Frame = 1

Query: 609 MLKQLASCSYSVVISTSCSSPTYTRDQISVSFGDAYGNQIYVPRIDDPSRRIFERCSSDT 668
           +++ + SCSY V++STSCSSP YTRDQIS+SFGDAYGNQIY PR+DDPS   FERCSSDT
Sbjct: 56  LIQNVGSCSYRVIVSTSCSSPKYTRDQISLSFGDAYGNQIYAPRLDDPSSNTFERCSSDT 115

Query: 669 FSVSGPCAYQICYVYLYRSGPDAWIPTTVKISGPNSRPVTFNYNTAIPND 719
           F +SGPCAYQICYVYLYR+GPD W P +VKI G NS PV F+YNT IP++
Sbjct: 116 FQISGPCAYQICYVYLYRTGPDGWKPESVKIYGYNSNPVRFDYNTFIPSN 165

BLAST of Cp4.1LG14g05630 vs. TAIR10
Match: AT5G62200.1 (AT5G62200.1 Embryo-specific protein 3, (ATS3))

HSP 1 Score: 169.1 bits (427), Expect = 1.0e-41
Identity = 74/105 (70.48%), Postives = 86/105 (81.90%), Query Frame = 1

Query: 615 SCSYSVVISTSCSSPTYTRDQISVSFGDAYGNQIYVPRIDDPSRRIFERCSSDTFSVSGP 674
           +C+Y+V+ISTSCSS  YTRDQISV+FGD YGNQIY PR+DDPS + FE+CSSDTF ++GP
Sbjct: 47  TCAYTVIISTSCSSTRYTRDQISVAFGDGYGNQIYAPRLDDPSTKTFEQCSSDTFQINGP 106

Query: 675 CAYQICYVYLYRSGPDAWIPTTVKISGPNSRPVTFNYNTAIPNDV 720
           C YQICYVYLYRSGPD WIP TVKI    S+ VTF YNT +P  V
Sbjct: 107 CTYQICYVYLYRSGPDGWIPNTVKIYSHGSKAVTFPYNTYVPESV 151

BLAST of Cp4.1LG14g05630 vs. TAIR10
Match: AT2G41475.1 (AT2G41475.1 Embryo-specific protein 3, (ATS3))

HSP 1 Score: 151.8 bits (382), Expect = 1.7e-36
Identity = 66/110 (60.00%), Postives = 83/110 (75.45%), Query Frame = 1

Query: 610 LKQLASCSYSVVISTSCSSPTYTRDQISVSFGDAYGNQIYVPRIDDPSRRIFERCSSDTF 669
           L+  A+CSY+V+I TSCSS +YTRD+IS+SFGD YGN++YV R+DDPS R FE+CSSDT+
Sbjct: 42  LENAAACSYTVIIKTSCSSVSYTRDKISISFGDVYGNEVYVKRLDDPSSRTFEKCSSDTY 101

Query: 670 SVSGPCAYQICYVYLYRSGPDAWIPTTVKISGPNSRPVTFNYNTAIPNDV 720
            +SGPC   +CY+YL R G D W P  VKI G + R VTF YN  +PN V
Sbjct: 102 KISGPCMRDVCYLYLLRQGSDGWKPENVKIYGSSIRSVTFYYNLFLPNSV 151

BLAST of Cp4.1LG14g05630 vs. TAIR10
Match: AT5G62210.1 (AT5G62210.1 Embryo-specific protein 3, (ATS3))

HSP 1 Score: 117.5 bits (293), Expect = 3.5e-26
Identity = 54/114 (47.37%), Postives = 76/114 (66.67%), Query Frame = 1

Query: 608 LMLKQLASCSYSVVISTSCSSPTYTRDQISVSFGDAYGNQIYVPRIDDP--SRRIFERCS 667
           L L +  SC Y+V+++TSC SP ++RDQ++++ GDA  NQ+  PR+D P      FE+CS
Sbjct: 32  LDLHEEESCPYTVIVTTSCFSPDWSRDQVTIALGDADDNQVVAPRLDKPLSGGGGFEKCS 91

Query: 668 SDTFSVSGPCAYQICYVYLYRSGPDAWIPTTVKISGPNSRPVTFNYNTAIPNDV 720
           SDTF V G C   IC VY+YRSG D WIP TV+I    S+ V F++N  +P ++
Sbjct: 92  SDTFQVKGKCLNTICSVYIYRSGTDGWIPETVEIYKEGSKSVKFDFNKNVPENI 145

BLAST of Cp4.1LG14g05630 vs. TAIR10
Match: AT5G07190.1 (AT5G07190.1 seed gene 3)

HSP 1 Score: 108.6 bits (270), Expect = 1.6e-23
Identity = 72/215 (33.49%), Postives = 108/215 (50.23%), Query Frame = 1

Query: 363 LSFFIFTFFSISAAVSSGSLPQPAPKSLTIGYIQKAGDCNYRVNITTSCSSPFHITAEIG 422
           +SF  F F  ++ A             L+I  +Q+ G C Y V + TSC SP     +I 
Sbjct: 8   VSFLFFAFIFVTHAFD-----------LSIIQMQQ-GTCPYTVVVMTSCLSPESTRDQIS 67

Query: 423 VLFGDAHGNQIYEPKLE--VESSKAFAKCSKDIFELTGPC-TDQICFFYLYKNGSDDWIP 482
           ++FGDA GN++Y PKL   V       KCS + F++ G C  D IC  Y+ +NG D W+P
Sbjct: 68  IVFGDADGNKVYAPKLGGLVRGPGGLGKCSTNTFQVRGQCLNDPICSLYINRNGPDGWVP 127

Query: 483 ETVEISSPDIDTVKYKYNSSIPD-DTWDGFDDCQ-------------YF---------TS 542
           E++EI S    +VK+ ++ S+P  +TW G ++C              +F         T+
Sbjct: 128 ESIEIYSEGSKSVKFDFSKSVPQLNTWYGHNNCNTTGRPSSPDLPPPHFPPEFPPETPTT 187

Query: 543 PPPPPPSPPAPSTAGHLRRLKGLAYVIPVLLSSAV 552
           PPPPPP P A S  G+   +  LA+ I   +++ V
Sbjct: 188 PPPPPPRPSAASRLGNGESV-FLAFAIATAIAAMV 209

BLAST of Cp4.1LG14g05630 vs. NCBI nr
Match: gi|659077284|ref|XP_008439124.1| (PREDICTED: uncharacterized protein LOC103484014 isoform X1 [Cucumis melo])

HSP 1 Score: 323.2 bits (827), Expect = 1.2e-84
Identity = 160/202 (79.21%), Postives = 174/202 (86.14%), Query Frame = 1

Query: 352 KRRSVKMAMSLLSFFIFTFFSISAAVSSGSLPQPAPKSLTIGYIQKAGDCNYRVNITTSC 411
           K  S+ MA  LL FFIF+ FSISA+ S+GSLPQPA KSL+I YI++AGDCNYRVNITTSC
Sbjct: 4   KSSSLNMATPLLPFFIFSLFSISASQSTGSLPQPASKSLSIAYIREAGDCNYRVNITTSC 63

Query: 412 SSPFHITAEIGVLFGDAHGNQIYEPKLEVESSKAFAKCSKDIFELTGPCTDQICFFYLYK 471
           SSPF+I++EIGVLFGDAHGNQIYEPKLEVES  AF KC KDIFEL GPCTDQICFFYLYK
Sbjct: 64  SSPFYISSEIGVLFGDAHGNQIYEPKLEVESGNAFGKCRKDIFELIGPCTDQICFFYLYK 123

Query: 472 NGSDDWIPETVEISSPDIDTVKYKYNSSIPDDTWDGFDDCQYFTSPPPPPPSPPA-PSTA 531
           NGSDDWIPETVEISSPDIDTVKY YNSSIP+DTW GFDDCQYF SP PPPP PP+ PSTA
Sbjct: 124 NGSDDWIPETVEISSPDIDTVKYTYNSSIPNDTWYGFDDCQYFPSPSPPPPPPPSVPSTA 183

Query: 532 GHLRRLKGLAYVIPVLLSSAVL 553
           G L R K LA +IPVL SS +L
Sbjct: 184 GCLPRWKWLASLIPVLFSSFLL 205

BLAST of Cp4.1LG14g05630 vs. NCBI nr
Match: gi|449460842|ref|XP_004148153.1| (PREDICTED: uncharacterized protein LOC101215783 [Cucumis sativus])

HSP 1 Score: 293.1 bits (749), Expect = 1.3e-75
Identity = 148/199 (74.37%), Postives = 164/199 (82.41%), Query Frame = 1

Query: 355 SVKMAMSLLSFFIFTFFSISAAVSSGSLPQPAPKSLTIGYIQKAGDCNYRVNITTSCSSP 414
           S+ MA  LL FF F+FFSISAA S+ S      KSL+I YI++AGDCNYRVNITTSCSSP
Sbjct: 7   SLHMATPLLPFFFFSFFSISAAQSTAS------KSLSIAYIREAGDCNYRVNITTSCSSP 66

Query: 415 FHITAEIGVLFGDAHGNQIYEPKLEVESSKAFAKCSKDIFELTGPCTDQICFFYLYKNGS 474
           F+I++EIGVLFGDA GNQIYEPKLEVES  AF KC KDIFEL GPC DQICFFYLYKNGS
Sbjct: 67  FYISSEIGVLFGDAQGNQIYEPKLEVESGNAFRKCRKDIFELIGPCIDQICFFYLYKNGS 126

Query: 475 DDWIPETVEISSPDIDTVKYKYNSSIPDDTWDGFDDCQYFTSPPPPPPSPPA-PSTAGHL 534
           D+WIPETVEISSPDIDTVKY YNSSIP+DTW GF+DCQYF SP PPPP PP+ PSTAG L
Sbjct: 127 DNWIPETVEISSPDIDTVKYTYNSSIPNDTWYGFEDCQYFPSPSPPPPPPPSVPSTAGSL 186

Query: 535 RRLKGLAYVIPVLLSSAVL 553
            R K +A +IPVL S  +L
Sbjct: 187 PRWKWIASLIPVLFSCFLL 199

BLAST of Cp4.1LG14g05630 vs. NCBI nr
Match: gi|659077282|ref|XP_008439123.1| (PREDICTED: sulfated surface glycoprotein 185-like [Cucumis melo])

HSP 1 Score: 284.6 bits (727), Expect = 4.6e-73
Identity = 180/264 (68.18%), Postives = 191/264 (72.35%), Query Frame = 1

Query: 1   MERSSSPPLLLLLLLSSLAVVYSAVDPPQITVKPDVAPESLPINYIQEVGSCSYEVTVAT 60
           M+   SP  LLLL LS LAVV SAV P  IT+ P+VAP SLPI+YIQEVGSCSY VTVAT
Sbjct: 1   MDTPPSPFFLLLLPLSFLAVVCSAVLPAPITLDPEVAPSSLPIHYIQEVGSCSYYVTVAT 60

Query: 61  SCSSPSFITDEIGVLFGDSHGNQIVEKKLSNGDKVFDSCNTDRFVLNDRPCSVQINYMYI 120
           SC+SPS I  EIGVLFGD++GNQI+EKKLSNGDKVF SC TD FVL DRPC VQI+YMYI
Sbjct: 61  SCASPSSIASEIGVLFGDTYGNQIIEKKLSNGDKVFGSCKTDSFVLKDRPCIVQISYMYI 120

Query: 121 YKDGADDWLPNSVEISGSGIKPLLFIFKSSIPRNTWFGFDLRQYPFPSPPSYNPYPPFPS 180
           YKDG DDWLPNSVEISGSGI PLLFIFKSSIP NTWFGFDLRQY FP             
Sbjct: 121 YKDGDDDWLPNSVEISGSGINPLLFIFKSSIPTNTWFGFDLRQYTFP------------- 180

Query: 181 LSPPPPPPPHPLPPPPEPILSPPPPPKPVLPPPSPPPPPPHPLPPPPEPIF-SPPPPPKP 240
                PPPP   P PP P LSPPPPP P +PPP              EP+F  PPPPPKP
Sbjct: 181 -----PPPPSVFPAPPLPWLSPPPPPYPTVPPP--------------EPVFPPPPPPPKP 232

Query: 241 VLPPPPPP----PPSSSSKISGLK 260
           VLPPPPPP    PPSSSSKISG K
Sbjct: 241 VLPPPPPPAHPTPPSSSSKISGQK 232

BLAST of Cp4.1LG14g05630 vs. NCBI nr
Match: gi|449460780|ref|XP_004148123.1| (PREDICTED: WAS protein family homolog 1-like [Cucumis sativus])

HSP 1 Score: 284.6 bits (727), Expect = 4.6e-73
Identity = 175/257 (68.09%), Postives = 187/257 (72.76%), Query Frame = 1

Query: 6   SPP--LLLLLLLSSLAVVYSAVDPPQITVKPDVAPESLPINYIQEVGSCSYEVTVATSCS 65
           +PP  L+LLLLLS LAVV SAV P  IT+ PDV P SLPI+YIQEVGSCSYEVTV TSC+
Sbjct: 3   TPPSFLMLLLLLSFLAVVCSAVVPTPITLDPDVPPSSLPIDYIQEVGSCSYEVTVETSCA 62

Query: 66  SPSFITDEIGVLFGDSHGNQIVEKKLSNGDKVFDSCNTDRFVLNDRPCSVQINYMYIYKD 125
           SPS IT EIGVLFGD++GNQI+EKKL  GDKVF SC TD FVL DRPC +QI+YMYIYKD
Sbjct: 63  SPSSITSEIGVLFGDTYGNQIIEKKLGTGDKVFGSCKTDSFVLKDRPCIIQISYMYIYKD 122

Query: 126 GADDWLPNSVEISGSGIKPLLFIFKSSIPRNTWFGFDLRQYPFPSPPSYNPYPPFPSLSP 185
           GADDWLPNSVEISGSGI PLLFIFKSSIP NTWFGFDLRQY FP PPS         + P
Sbjct: 123 GADDWLPNSVEISGSGINPLLFIFKSSIPTNTWFGFDLRQYTFPPPPS---------VFP 182

Query: 186 PPPPPPHPLPPPPEPIL-SPPPPPKPVLPPPSPPPPPPHPLPPPPEPIFSPPPPPKPVLP 245
            PPPP HPL PPPEPI  SPPPPPKP+LPPP                            P
Sbjct: 183 APPPPSHPLVPPPEPIFSSPPPPPKPILPPPP---------------------------P 223

Query: 246 PPPPPPPSSSSKISGLK 260
           PP P PPSSSSKISG K
Sbjct: 243 PPQPTPPSSSSKISGQK 223

BLAST of Cp4.1LG14g05630 vs. NCBI nr
Match: gi|659077288|ref|XP_008439126.1| (PREDICTED: uncharacterized protein LOC103484016 [Cucumis melo])

HSP 1 Score: 212.2 bits (539), Expect = 2.9e-51
Identity = 97/110 (88.18%), Postives = 105/110 (95.45%), Query Frame = 1

Query: 610 LKQLASCSYSVVISTSCSSPTYTRDQISVSFGDAYGNQIYVPRIDDPSRRIFERCSSDTF 669
           ++QL SCSYSVVISTSC SPTYTRDQIS+SFGDAYGNQIYVPR+DDPSRRIFERCSSDTF
Sbjct: 38  IQQLGSCSYSVVISTSCLSPTYTRDQISLSFGDAYGNQIYVPRLDDPSRRIFERCSSDTF 97

Query: 670 SVSGPCAYQICYVYLYRSGPDAWIPTTVKISGPNSRPVTFNYNTAIPNDV 720
            ++GPCAYQICYVYLYR+GPDAWIPTTV ISG NSRPVTFNYNTAIP+DV
Sbjct: 98  GINGPCAYQICYVYLYRTGPDAWIPTTVTISGHNSRPVTFNYNTAIPSDV 147

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0L8G7_CUCSA9.1e-7674.37Uncharacterized protein OS=Cucumis sativus GN=Csa_3G177970 PE=4 SV=1[more]
A0A0A0L699_CUCSA3.2e-7368.09Uncharacterized protein OS=Cucumis sativus GN=Csa_3G177960 PE=4 SV=1[more]
A0A0A0L5P4_CUCSA3.5e-5187.27Uncharacterized protein OS=Cucumis sativus GN=Csa_3G177980 PE=4 SV=1[more]
A0A061G0A9_THECC2.3e-4275.45DEAD-box ATP-dependent RNA helicase 7 OS=Theobroma cacao GN=TCM_015015 PE=4 SV=1[more]
A0A067JQT7_JATCU1.1e-4171.82Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26201 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G62200.11.0e-4170.48 Embryo-specific protein 3, (ATS3)[more]
AT2G41475.11.7e-3660.00 Embryo-specific protein 3, (ATS3)[more]
AT5G62210.13.5e-2647.37 Embryo-specific protein 3, (ATS3)[more]
AT5G07190.11.6e-2333.49 seed gene 3[more]
Match NameE-valueIdentityDescription
gi|659077284|ref|XP_008439124.1|1.2e-8479.21PREDICTED: uncharacterized protein LOC103484014 isoform X1 [Cucumis melo][more]
gi|449460842|ref|XP_004148153.1|1.3e-7574.37PREDICTED: uncharacterized protein LOC101215783 [Cucumis sativus][more]
gi|659077282|ref|XP_008439123.1|4.6e-7368.18PREDICTED: sulfated surface glycoprotein 185-like [Cucumis melo][more]
gi|449460780|ref|XP_004148123.1|4.6e-7368.09PREDICTED: WAS protein family homolog 1-like [Cucumis sativus][more]
gi|659077288|ref|XP_008439126.1|2.9e-5188.18PREDICTED: uncharacterized protein LOC103484016 [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR010417Embryo-specific_ATS3
IPR001024PLAT/LH2_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0008152 metabolic process
biological_process GO:0006508 proteolysis
cellular_component GO:0005730 nucleolus
cellular_component GO:0005575 cellular_component
molecular_function GO:0005524 ATP binding
molecular_function GO:0004386 helicase activity
molecular_function GO:0005515 protein binding
molecular_function GO:0003723 RNA binding
molecular_function GO:0003674 molecular_function
molecular_function GO:0004222 metalloendopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG14g05630.1Cp4.1LG14g05630.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001024PLAT/LH2 domainunknownSSF49723Lipase/lipooxygenase domain (PLAT/LH2 domain)coord: 49..158
score: 7.91E-11coord: 613..718
score: 4.68E-17coord: 400..508
score: 5.4
IPR010417Embryo-specific 3PFAMPF06232ATS3coord: 48..160
score: 6.0E-31coord: 610..719
score: 5.7E-45coord: 396..511
score: 4.3
NoneNo IPR availablePRINTSPR01217PRICHEXTENSNcoord: 163..179
score: 3.2E-14coord: 181..198
score: 3.2E-14coord: 204..229
score: 3.2E-14coord: 26..38
score: 3.2
NoneNo IPR availablePANTHERPTHR31718FAMILY NOT NAMEDcoord: 566..715
score: 3.5
NoneNo IPR availablePANTHERPTHR31718:SF8EMBRYO-SPECIFIC PROTEIN 3coord: 566..715
score: 3.5

The following gene(s) are paralogous to this gene:

None