Cp4.1LG16g01000 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG16g01000
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPeptidyl-prolyl cis-trans isomerase G isoform 1
LocationCp4.1LG16 : 1884469 .. 1905106 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CATACTCACACAGACTTGAGGGTGGAGAGATTGAGAGTCCCACATCTTCATTGTTCAGCGGCAGTGAGTGTCGCCGTTTCTCCTTCCACTGTTGATTGGCCCACTCACTGAATTTGGGTCCCCAATTTTGCCTACTTACCTTTCGACTCTCAACCAATTCTATGCATATTCCCACGCAAACCCCACACCAACACCCCCCGCCAGGGCTCCATGTTCCACGACCCAAATCTTCGTAAGGAATTTAATTTTCCTTCGTTCGGCCACTCGGAGTTTGTGGCATCTATTGGAGTCCAAGGTGATGGTTTTTCTTGCCTTTTCCAATCTATTTCTGCTTGTTCTTTGCCTGCAATCTGTTTGCGATAATTCCCCTGTGGAATTTGTTCTTGCTTTTGAAGGTTTTGATCTTTGTTCTTTCTAGCAGGTTGATTTGGACCACTGGGTTGAATTTTAGAAACATTTTTGTTGTGTTGTGTTGCTGCTGCTGTTTGTTTGATTTAATCGGCTGAACTGCGTTATTCGCGATGAGTGGCGCCCCGAAGAGGTCCCATGAGGATGGGGGTCACTCCTCATCTTCCAAATATCCACACGATGATTCCAGTCCCTATCGCAAGCTTTCTTCGTCACTTCCAATGCAGTATCGTCCTTCGTTTGAGATGGGTCAAGACGCCCCAATGTCGAAGATACCCCGCACGGAATCCCGTGAGGGAGATAGGAGATCCCCCCTGCACTCGATTTTTCGAATGCCTTCGTCTTCTAATGATCCTCATGTAGATCACTCTGTTGCTTCGGAAAGCAGGCCAGAGCTGTGGGACTCCAAGGACGGTGGGGACAATAGATTTGAGAATCGAGATTCAAGAGTGGAGGCTCGAGAGCTGTTTGGTGATGAAAGGAGGGATTCTCAAGCTGTGAAGTTGGAGAAGGAAATGAGATACGAAGGCAGATTGGATGATATCAAGGAAATGAAATATGATAGAGATGGTTATCACGATTACAAGGGTGAATTGAAGTCAGAAAAAGAGATGTATGGATCAGCAACCAATCACTTAAACTGGAAAGAATCAAAAGATTACCATAGAGGGAAAAGATATCCTGAAGCTTCTGTTGGAAGCTTGGAACCCTGGCATGTTTCACGCACCAGTTCACAGAGTGCAGCCGAGGCTGTAAAAGAAGCCCTAACTACTGAGGAGAAAGACTATGTTGAAACAAGGGAGGCTGTTGGAGAGAATAAAATAGATTCAAAAGGGGAGGATAAATTTAAAGAGAAGGACAGGAAAAGGAAAGATACAAAGCAGAGGGATTGGGGAGATAAAGATAAGGAAAGAAATGATCATAGGAGCAGTACTCAAGCAACGAACAGCAATGTTGAGCCCAAAGACTTGTCAAAGGAAGAGAGAGATTCGGAAAGGTGGGAGAGGGACCGGAAAGATACCTCGAAAGACAAGGAAAGGCCTCGTGAAAGGGAAAAAGACCATGTGAAGAGAGAATCATGGAATGGGATGGATAAGGAGACTGCACACCTTGAAAAGGAGTCAGGTGAAGTGTCAGCGAGAATGTTAGAGCAGGAAAATCCAATTTCAGAGCAAAAAAAGCAAAAGGATTTTGATAGCTGGAAAAATGCTGATAGAGAGGGTAGAGACAGGAAGAAAGAAAGGGATACAGACATTGAGGGAGATCGACCAGAAAAGCGTAGTAGATGTCATGAAAAAGAGTCAGATGAGGGATGTGTAGATGTTGAAGGGACTTTAGACAGAGAGAGAGAAGTTTATAATTATGGTGTTCAGCATCGGAGAAGGATGCAACGTTCAAGGGGAAGCCCTCAAGTGGCAAATAGAGAACCTCGCTTCAGGTCTCGTGCTCAAGATAATGAAGGGTACACCATATGATTTTCTTTGCAGTTTTCATTTTCCCTTGCTACCTAAATTATTGTTTTCAAATAGCGTTCATTGTTGGTGGTTCTCCCCCCCCCCCCCTGGTACCATGTGTTTCTAGGTGTTGCTTTTTCAGAGTGAATTAAGGGATTTTATTTCATCCGATCATTTCATTATGTATGTATTGTTGGGCAATTCAGTTGCATTTCCTGATTTATGTTCACTTTATGGACTTGGACTGGTATCTCCCTGCGAGGTAATTCAATGGACTATGCATGATGAGGGAGAGTCATTCCATTTGAACTCACATTTGCTATGCCTATATGATGGGAAGTTTGAACTCGATAAAAAAGACATCTAATCTATCAAATGGATACATGAATCTGATTAAGTTATATCCTATTTATTTATGCTGATATGATGAGTTTAATTGCCTGCTGGTATGTGATTTCGTGTGATAAAATGTTCACACGTCTCAACTACTTGGTTTGTTTGATAACAAATTGAGGGGAAGCTCTTCCTATCCAATTAAAAAGAGGAGGGGAGGACTTGTTCACCAAAAGGTTCTAGGTAGGCAATTTTTTGTCTTCTCTTCTCTATGGCATTATTTCTGATGTGTTGACCTGAATGCTGGTGAAAGGGGATGAGTTTGAACTCTGGGGAGTTCTAGAGTTGGAAGAGATCAAGTTTACGCTACTCATTTTCATTTTGAAGATGATACCATAATTATTATGTTTTTTGGCTGTAGGCCCTAGATATTACATAAAAGATGAGCGAAATAACAGCCTAAGGATGGAGGCTGAGGTAGAGGAAACTCCCTCCCAAAAGCTACTTGACTAGCTTCGAGTCTGATAATCATGAGGGCCTGTAGTTACAAAAGAATTTAGATCGTGAAGAGGCTCACCAAGAAGCCATATTTTTATGAAATTACAAAAAAGTCAAAANAACTCTTCCTATCCAATTAAAAAGAGGCGGGGAGGATTTGTTAACCAAAAGGTTCTAGGTAGGAAATTTTTTGTCTTCTCTTCTCTTTGGCATTATTTCTGACGTGTTGACCTGAATGCTGGTGAAAGGGGATGAGTTTGAACTCTGGGGAGTTCTAGAGTTGGAAGAGATCAAGTTTACGCTACTCATTTTCATTTTGAAGATGATACCATAATTATTATGTTTTTTGGCTGTAGGCCCTAGATATTACATAAAAGATGAGCGAAATAACAGCCTAAGGATGGAGGCTGAGGTAGAGGAAACTCCCTCCCAAAAGCTACTTGACTAGCTTCGAGTCTGATAATCATGAGGGCCTGTAGTTACAAAAGAATTTAGATCGTGAAGAGGCTCACCAAGAAGCCATATTTTTATGAAATTACAAAAAAGTCAAAATAAAGAGTCTTATCATGAAAAGATCTATCATTTCATTTGAGCCATAAGTGACACAAAAGAGATCTAGGAGTGTAACTTCCCAAAATTTAGGCCCGGTGAATGTTGGAATAAGATTATCTGTATTCTCATATGTGCTTCAGGTCCCCATGGGAATAGTCTTTCATTGTAGTTAGCTATCAACAACAAGTGAGAATTATTTTTTAATAAGAAACTAAGTTTTCATCAGGAAATAGGAAAGAATGCACAAAGGCATACAACTTTATTCTCCCTGGTACCGCATCATATTGACTCAGAAAAATGCTTTGATACTTTTTTTTCTATCCATAACTTTATTCACAGTAATATTTCTTTTTCAATTAAAGAATGGTTGTGTCACGTAATGATCCTTCTCCTGTATCTTGCAGATTACAAGGTATGCTGTCTTGTTACACTGTTCTCCTTGGTGTAAAATTCTTCTCAACTGTGACTGCTGCGATCATGCTACCTTTGCTGGATAGCTTGCGAGCTACATTATTCTAGTTTTCACTCTCTCCAAAGGCTTTGAAGCTACTTTTCTTTTCTTTTTTCTTTTTATCATGCTCTATCAATCTGTTTGACATAGAAGTATCTTTTTAACTTGGATTTATTACTCTTGCTNATAAAGAGTCTTATCTTGAAAAGATCTATTATTTCATTAGAGCCATAAGTGACACAAAAGAGATCTAGGAGTGTAACTTCCCAAGGCCCGGTGAATGTTGGAATAAGATTATCTGTATTCTCATAAGTGCTTCAGGTCCCCATGGGAATAGTCTTTCATTGTAGTTAGCTATCAACAACAAATGAGAATTATTTTTTAAATAAGAAACTAAGTTTTCATCAGGAAATATGAAAGAATGCACAAAGGCATACAACTTTATTCTCCCTGGTACCGCATCATATTGACTCAGAAAAATGCTTTGATACTTTTTTTTCTATCCATAACTTTATTCACAGTAATATTTCTTTTTCAATTAAAGAATGGTTGTGTCACGTAATGATCCTTCTCCTGTATCTTGCAGATTACAAGGTATGCTGTCTTGTTACACTGTTCTCCTTGGTGTAAAATTCTTCTCAACTGTGACTGCTGCGATCATGCTACCTTTGCTGGATAGCTTGCGAGCTACATTATTCTAGTTTTCACTCTCTCCAAAGGCTTTGAAGCTACTTTTCTTTTCTTTTTTCTTTTTATCATGCTCTATCAATCTGTTTGACATAGAAGTATCTTTTTAACTTGGATTTATTACTCTTGCTTTTTCAATTTAGGGCAGTTCTTAACAGCTCTTTAGGGAACAATAAGACAAGGAAGAACAGTTTGGGAATAAGGATTTTTATTTTAGAAACATATTTCTTTGATGTCTGATATTTGTAGAAGGGAGAAGATTTAGAGTTAATGCTAGGAGAATAGTTACAGAAATGTTCATTAAATAGCTCTGTTGTATTGAGCAATAATAACCCCAGCTTTCATTTTAATGATAGATCCACAAAGTAAACTAATCACCCTTATTGGTCTGGGTTTTATTTATTTATCTACTTATTTTATTGCTCTGGTCTATTCCTTTCTTCATTATTTCTTTCTTTTATAAGTTGTTTTTTCACTAAATAATATTGTTTCATGCATTATTATATTATTTTTTGTAAATACAATTATGCTCTATAATTGTTTCTATTTGTAACTTCAATTAAAGATAAGTTTGTTCTTCATATATATATATATATATATATATATGTATGTATATATGCATGTATGTATGTATCTATTAGTTTCCCATGCGTGTTTTTTTACAGCATGTGCTTCAACTTTAAAGAACAATTGTACCTCGGTGCATTTTATGAGTTATAAAAAAACAGATTCATGGCGTTAATTATCTTTTAAGGGGAAAAACCAGCTTAATGAAACCCATCTTGTCTTGATGATGCTATGTGTATGTATTGCTGATGCTTGATACTTTAAGATAAGCATGTCTTATCTACTAAATCTTAGTGCCTGTATTACATGCAAAATGTCTTTTCAGGAAAACCGGAAGTCTCATCTGTGGTTTATAAAGTTGGTGAGTGCGTGCAAGAACTAATAAAGTTGTGGAAGGAACATGAATCGTCACAGATAGAGAAAAATGGTGAAAGCTCACAGAATATCCCCACTCTAGAAATTCGAATACCAGCTGAACACGTTATTGCTACAAATAGGCAGGTAAAAAGCATATTTCTTCATTATTTTATCATAAAAGCATTTAAAAGCACTACCGATATGTTTTAAACTATCAATGCTGACGGCAGTGTCTCTTTATATACATAAAAAAGTATCGATAAAAGAAACAAAGGAGAGAAGCAGCAAACAGGGAAGAAGAGCTTTTTAATTGGGGGGGAGGGGGGGAAGCTGACTTTCAAGTGAACCAACAAAGAAGGGCAGCACATTGATTAGTCGGTATCAGTTTTATCTGCAATGAAAACTGATAATTTTGTTTGGTTGGTTGAAAACCAGCCATTTTTAAGATTTGTATAAATTGACTGATCGTTTATTGGTGTGGTCGGTCAATTTTGGCCAAACTCAAGATTGAGAGCGAGTGAGGTCAAGCAAGGCTCTGATGGACGTTGGAGGTAGACAAATGTACAGGGTGCTAACTACTGCGTTATTGAAAGGGTTGAGGTGCTATGAGTAGGGGAAAAATGAAAGATGGAGAGAGTTGGAAGGAAAAGAAAAAGTGGAGGGGAAAATGATACATGGGCTACGTTTTGAGCACTAGTCTCTTTTTTTAATATTATTTTAAAATTAAAAGAATATCCTTCGTTCTTTTCAACCTAGCGTGCAGAAACATTATTCTTTTTATAGGAAATAGATATATTCATGGGGGGATCCTTGGGGACAGGTAGAGAGGGAAACCGCCCAAAGGGCTACAAGAAAGAGCTTTTTATCTGCTAATACTCATTATAAGGTTGTTACTACAAAATTATTTCCTATGGTTTGAAAGAGATTATGATTATAAAACTCTTTTCTTTTTTTGCATAGCAACAATTTTTGGGAGAGTTGACCCTTAGGATTTCATCCAACGACTTTCTCCCTTTGTGTTGCATCTACAATGGTGTGTCTTTTACAAACATCATGTAAAGGATCTTGATTATCTCTTATGGGGTTGTCAGTTTGCTCATTTCTTTTGAAGCCAGTGTTTTGACGTGTTTAGGATTGCATAGGATTGTAACAGTGGTTGTACTCTCCCTTTCGAAATAAAGGAGAAGTGTTGTGGTAAGCTTGCTCTTTTTCTATCTTATGCAGTTTGTTTCGAGAGGAACAGTAAGATCTTTAGAGGGTTAGGAGAGGTCTTTGGATGATTTGTGGGTGGGAAGTATAGATGGAAGATTTTGCCTCAAGGTGTCTTACGCAAGGGGACCCTTTTCACCCTTCCTGTCCATCTAGGTGATGGATGTTCTAAGAAGACTTGTTGCTAGGTGTGGAGAAGGGGGTGATTAAGGGGTCAAATGTGGATAAGGAGTTCACTTGTTTAATCTTAAATTTGCTGATGACGCCCTCAGTTTATGTTTGGAGAAAGAGTNAGAAAAAGTGGAGGGGAAAATGATACATGGGCTACGTTTTGAGCACTAGTCTCTTTTTTTAATATTATTTTAAAATTAAAAGAATATCCTTCGTTCTTTTCAACCTAGCGTGCAGAAACATTATTCTTTTTATAGGAAATAGATATATTCATGGGGGGATCCTTGGGGACAGGTAGAGAGGGAAACCGCCCAAAGGGCTACAAGAAAGAGCTTTTTATCTGCTAATACTCATTATAAGGTTGTTACTACAAAATTATTTCCTATGGTTTGAAAGAGATTATGATTATAAAACTCTTTTCTTTTTTTGCATAGCAACAATTTTTGGGAGAGTTGACCCTTAGGATTTCATCCAACGACTTTCTCCCTTTGTGTTGCATCTACAATGGTGTGTCTTTTACAAACATCATGTAAAGGATCTTGATTATCTCTTATGGGGTTGTCAGTTTGCTCATTTCTTTTGAAGCCAGTGTTTTGGCGTGTTTAGGATTTGCATAGGATTGTAACAGTGGTTGAACTCTCCCTTTCAAAATAAAGGAGAAGTGTTGTGGTAAGCTTGCTCTTTTTCTATCTTATGCAGTTTGTTTCGAGAGGAACAGTAAGATCTTTAGAGGGTTAGGAGAGGTCTTTGGATGATTTGTGGGTGGGAAGTATAGATGGAAGATTTTGCCTCAAGGTGTCTTACGCAAGGGGACCCTTTTCACCCTTCCTGTCCATCTAGGTGATGGATGTTCTAAGAAGACTTGTTGCTAGGTGTGGAGAAGGGGGTGATTAAGGGGTCAAATGTGGATAAGGAGTTCACTTGTTTAATCTTAAATTTGCTGATGACGCCCTCAGTTTATGTTTGGAGAAAGAGTGCTCCTTTGTGAATCTTAATGGTTTTATTTTTGTTTTTATTTGAAGTTTCTTGGCCTTAAGTTCAATAGGGGTAAGTGTTCCATCTTTCTTCAACGGTTGCCTTTTTGCTCTTGAGTGTCTCTGATTGGGTGTGAGGTTAATCATCTCTTCTCTTAGTAGCTTGGCCTTTCGTTGGGTGGTAACCCAAGGAGTAAGGTCTTTTGGAACCCGATCTAGGCTAAGATTCAAAAACGTTTGTCTTCTTGGAGGAAAGTTTTTTTTTTTTTTTTTTTTTTTTCCTCTCTTAAAGAGGAAGATTCACTCATATAAAGGAAGAAGACTCACTCCTATACAATAGGTTTTCAATGGAATTCCAGGTTAGGGGGATCTTGAGAGGGCTATGAGAAATTTTATGTGGGAAGAGGTTCATGAGGGGGTGAGTCCTATCTGGTCAAGTGGGAGGTGGTTGTTAAGCCTGTGGAGCTAGGGGTCTAGGAATTGGGAACTTAACAGCCATCTAGTGAACTGGTTGAAAGTAGCACCCTTATCTTCGGGTGGCCTTGGCACTGGAAATTTTAGGAGGAATGCTGCTCTTCTAGCTAAATGGCGTTGGAGATACACTCAAAAGGAGCCAGCTATGTGGAAAACAACTGTAAGGATCAAGTATGGTCAAACAGACAACAGTTGGCTCCTAGATGATTAAAGAATCTTTGAGTACAAGTCTCTTGATTGGGTGGATCATCTGGAGATTATCAAGATCAAAGCTTTCTGCTGGTGTTTACAGCTCATTATCATATCTTTGCCAGCCTAGTTATCTGTTTCAAATGGGGCATTCTTGCAAAGGGTAAAGGGATTCTTTTGAGGATTATCTTTTTGTTGTTTCTTTATTGGGCATCGTTTGCTGAATTTTTCGTTTCCCAAGTATAATCTTTTTAAACTTCATCTTCTTGTATTTTCTTTTTTCTTTTCTACTCTAATGAAACGTCCTGTTCTTGTTAAAAAATGGTCTCTTATCAAAAAAAAAAAAAAAANGAGAAGTGTTGTGGTAAGCTTGCTCTTTTTCTATCTTATGCAGTTTGTTTCGAGAGGAACAGTAAGATCTTTAGAGGGTTAGGAGAGGTCTTTGGATGATTTGTGGGTGGGAAGTATAGATGGAAGATTTTGCCTCAAGGTGTCTTACGCAAGGGGACCCTTTTCACCCTTCCTGTCCATCTAGGTGATGGATGTTCTAAGAAGACTTGTTGCTAGGTGTGGAGAAGGGGGTGATTAAGGGGTCAAATGTGGATAAGGAGTTCACTTGTTTAATCTTAAATTTGCTGATGACGCCCTCAGTTTATGTTTGGAGAAAGAGTGCTCCTTTGTGAATCTTAATGGTTTTATTTTTGTTTTTATTTGAAGTTTCTTGGCCTTAAGTTCAATAGGGGTAAGTGTTCCATCTTTCTTCAACGGTTGCCTTTTTGCTCTTGAGTGTCTCTGATTGGGTGTGAGGTTAATCATCTCTTCTCTTAGTAGCTTGGCCTTTCGTTGGGTGGTAACCCAAGGAGTAAGGTCTTTTGGAACCCGATCTAGGCTAAGATTCAAAAACGTTTGTCTTCTTGGAGGAAAGTTTTTTTTTTTTTTTTTTTTTTTTCCTCTCTTAAAGAGGAAGATTCACTCATATAAAGGAAGAAGACTCACTCCTATACAATAGGTTTTCAATGGAATTCCAGGTTAGGGGGATCTTGAGAGGGCTATGAGAAATTTTATGTGGGAAGAGGTTCATGAGGGGGTGAGTCCTATCTGGTCAAGTGGGAGGTGGTTGTTAAGCCTGTGGAGCTAGGGGTCTAGGAATTGGGAACTTAACAGCCATCTAGTGAACTGGTTGAAAGTAGCACCCTTATCTTCGGGTGGCCTTGGCACTGGAAATTTTAGGAGGAATGCTGCTCTTCTAGCTAAATGGCGTTGGAGATACACTCAAAAGGAGCCAGCTATGTGGAAAACAACTGTAAGGATCAAGTATGGTCAAACAGACAACAGTTGGCTCCTAGATGATTAAAGAATCTTTGAGTACAAGTCTCTTGATTGGGTGGATCATCTGGAGATTATCAAGATCAAAGCTTTCTGCTGGTGTTTACAGCTCATTATCATATCTTTGCCAGCCTAGTTATCTGTTTCAAATGGGGCATTCTTGCAAAGGGTAAAGGGATTCTTTTGAGGATTATCTTTTTGTTGTTTCTTTATTGGGCATCGTTTGCTGAATTTTTCGTTTCCCAAGTATAATCTTTTTAAACTTCATCTTCTTGTATTTTCTTTTTTCTTTTCTGTAATAAAAGAATCTTGGTTAATATTTCCTTTATAAAATTTTAANGATATTCTCCTAGTGATCTGTTTCAAATGGGGCATTCTTGCAAAGGGTAAAGGGATCCTTTTGAGGATTATCTTTTTGTTGTTTCTTTATTGGGCATCGCTTGCTGAATTTTTCGTTTCCCAAGTATAATCTTTTTAGACTTCATCTTCTTGTATTCTCTTTTTGCTTTTCTATCTCTAATGAAAAGTCCTGTTCTTGTTAAAAAATGGTCTCTTATCAAAAAAAAAANAAAAATAAGATTTTTGGGAGGATAGGTGCGTCAATTCCAGACCCCTGAGCACCATGTTCCACTCCCTCTGTACAATTGCAAACTTCAGATTTTCTGTCTGTAAGGACTTCTGGTCCAATCTAACAAAGCTTGGAATTTCTTTTTTGAAGAGGTCTTTTAGACAGAGAATAGGATGAATGATTAACTCTCACTCACTATGACCAATGATGGTATTCACCCGAACCATTCCATGAACTCTATTGCTTGAAATTTAACTCCTCTGGAGCCTTTTTGCTAAATTAGCCACCAAGACCCTCCATTCAGGCCATCTTTTCTGGACACTTATCTACTAAACAGACTTGGGAAGGCAACAGCTTCAAGAAAGTGTAATTTTTTTTGTGGTCCACAACCCATGCCAGCCTAAATGTCATGGACAGAGTGCAAAAGAGAAATCCTAACTTGATTTCATCCTTGAATATATATTTTTTTAATGTATAGCTAGTAGTCATCTACCCATTCACTAAGGATTGAATTTCATCCTCAATCGTTTTGATATTGTGTGGTGTATTCCAAAGAATATAGAAGATGGCCTTATTGAATTACTCTACGGCTGGTGGTTCGAAGGAAGATTCAAAGTGATTCGGTCCAACGCAGTCAGTCTTTGGTATATTTGGAAAGAAAGAAATTCTAAACTTTCTCTAATGGGTATGATTCTTTTATCAATTTTTCAGACCACTTACAGTAGTCGGCATCAAAATGGAGTGCCCTTCACAATTCCTTTGTAACTCACCTCTTTTGTTGATCAACTTAGATATGAGGCCTTTTGTTTAATTCCTCCTTTTTGGCGCAGGGACTCTTTGTCCTTTTTGTTGATAATATATATCTTTCATCACTTCTTCTTAAAGAAAAAGTAGCAGGAGAAAAACACCCTTTTTCTTTTTTCTTTTAGTTTATTTATCTAAAGCTGGAACTCCAATTTTGCTTCTTCTTTGTTGATACAAACTGTGATCCTGTATTGTAACTCTAGGTGTGTTGTCTGTTGTTTCTTCTTCCCTGGTGTTTTTAATACCATTAGAGGCTGCTTCTATTTCTAAAAAGCAAAGAAAAAAAGTAATACTATTAAGGTCTTAAAGTAATGTGTGCAGGTCAGGGGTGGACAGCTGTGGGGAACAGATGTGTATACATATGATTCAGATCTTGTTGCTGGTATGTAATGTCCCACTGTTTTCTAGTTAATTGAATTGATGTGGATCTATGAGAAATAACACAGGTTGTTATATTTTGTTTTCTAGTTCTCATGCATACAGGCTACTGTCGACTAACAGCTTCTCCACCTCCACCTGCGATCCAGGAGTTGCGTGCAACCATCAAAGTATTACCTCCGCAAGATTGTAAGCCTGTTTTTCTTTATCTTGTTTGATAGATGGCTAAGTTTTATGATGGGGTTACAAAAAATCAAGCCCAATAAAAAAGGAGTCCACAACTTAGTAGTTGTATATCTTTGAGAAACAAACACTTTCATTGTGTTAACGAAAGGTAAATTCCAAAAGCAATCTTTGACACCCGCACCCTTAGAACTTGTAGAAAATCAAGTGGGTTTCATTGCAAAAAGAAGAAAGGAACATCAACTTGAACTAATAAATCTTGCCAAATCAGCAAACTCATTCTGATTTTTTTCGGTTATCACCAAGTTTTTTTTTTTTTTTTTTCTAAGATAGATTTTTTATTGATGAAATGAAAAACTACATGAGTTCAAACAATCATACGAAGAATAAAAAAGAAATCAGCTACTAAGGTATCATAGGAAAATGTAAAAGAAGTTTGAGAAGCCTTCTCCTCTCTCCTCCTAACCCTTCCCCTTCCAAAACCCTAGTTACCTCACTGTTTAGCCTCAAACTGGGCCACATCTTTGGTCGTCACTCCTCGTTGATTTCGTCTATCGCATGTGCGGGCCACTTCCCTCTCGGTTTCATCATCCCTTTGTTTTCTTTTAACCTTAGTCACTCCTCCTCCTTGTCGGTTCTATCTGGCGCCATTTGTTCATACTCTCTTGTAGACCAACGTTTTGGGTGAGGAAAGAAAAGGAAGTGGTGGAAATGAATTTCGAACTCAATCTGGGTGTTTTCTGCTAGCCAACAACTCTTGAAAAGACATTCATAATGCTTTAGAGGTTTATTTTCAATCCAAGATTTCTTTTAATCCCTTCATGGAAAATAAGACGACGTTGAAATAGAGTGTTGATTTTTCTGATTTGGAGTTTGATGATAAGTGGAGGAAAATTGGAAATTTTCAACCAAGAATTAAGAAATGGTATTGGATTTTTTTGAAGAATTTACGAAACTGTATTGGAAATGTTACTTGTTTTGAAGCAATTGGTCCACATTTTGGAGGTTTAGTCNGAAAATTCCAAAAGCAATCTTTGACACCTGCACCCTTAGAACTTATAGAAAATCAAGTGGGTTTCATTGCAAAAAGAAGAAAGGAACATCAACCTGAACTAATAAATCTTGCCAAATCAGCAAACTCATTCTGATTTTTTTCGGTTATCACCAAGTTTTTTTATTTTTATTTTCTAAGATAGATTTTTTATTGATGAAATGAAAAACTACATGAGTTCAAACAATCATACGAAGAATAAAAAAGAAATCAGCTACTAAGGTATCATAGGAAAATGTAAAAGAAGTTTGAGAAGCCTTCTCCTCTCTCCTCCTAACCCTTCCCCTTCCAAAACCCTAGTTACCTCACTGTTTAGCCTCAAACTGGGCCACATCTTTGGTCGTCACTCCTCGTTGATTTCGTCTATCGCATGTGCGGGCCACTTCCCTCTCGGTTTCATCATCCCTTTGTTTTCTTTTAACCTTAGTCACTCCTCCTCCTTGTCGGTTCTATCTGGCGCCATTTGTTCATACTCTCTTGTAGACCAACGTTTTGGGTGAGGAAAGAAAAGGAAGTGGTGGAAATGAATTTCGAACTCAATCTGGGTGTTTTCTGCTAGCCAACAACTCTTGAAAAGACATTCATAATGCTTTAGAGGTTTATTTTCAATCCAAGATTTCTTTTAATCCCTTCATGGAAAATAAGACGACGTTGAAATAGAGTGTTGATTTTTCTGATTTGGAGTTTGATGATAAGTGGAGGAAAATTGGAAATTTTCAACCAAGAATTAAGAAATGGTATTGGATTTTTTTGAAGAATTTACGAAACTGTATTGGAAATGTTACTTGTTTTGAAGCAATTGGTCCACATTTTGGAGGTTTAGTCAATATTCCTCCCAGACTCTTAATTTTATTGATCGCCCAACTTCTAGAATTGAAGTTAGGAAAAATATTTATGGTTCCATTCAGGCTAAGATTAAGGTAACAGATGAATTTGAAAGTCTCTCTCCACATTTTGGGGATATATCTCCTTTAGATCCTCCAAGCATTGACTATGGGGACTTGTCAATTCTTAATTTTACAAGTTCAGTAGATCGAAAGAGAATAAATCAAGTTAGGATGAATAAAGGCGCAACCATTACTTCAAGTTCCCATCATTTCCAAACCTACGCTTCATGTCTCTAAAATGAAAAATAAACTTATTGAATCTCAACCTCTCCATTATACTCTGAAGAAAGACTACTTTGGTATCCCTACTAATTCAGAGATTAATTAAACATTTTGGAAGAAGTTTATGTTTTGCTCCTCACTTCCTTTCCAAAGTTACACAACCCCCTTCACAACCTACTAACTCTTTTATTCTTTGAGTGATCACTCCCCCAAGTTCAAATGTCACTTTAACCAAAGGTATCGTTAGCTCTTCCACCACTAAGTATCCTATCTCCGCTCCAAGTGTTCACAATATTGAACCTAATGAAGCATCTGATGTTAATATGAGCAGTGAAGAATTTGAAGCACATTTAATTGATGTAGAGGTCGATGAATTTTTTGACAAGGACTCTTTTTTCATAAGCATTCAACAGTGTTCGTTTGTTGATAAAAGGAATTGAATTGAAGTTTCGTACTCAAAAACATCATTAGCAAAATATACTTCAATTCCTCCAAAATTTCATTCCTTAATTGAAGTTCGTGGATTACACATGAAAGAAATCACTCCACACTCATCACAAATTAAGATCTATTTGTGTTTCACTCATTTGTTTTGATTATCGAGCAATGAGAATTAAATATTGATAATCGAGAGGTTTTAGGGGCTTTTTTTTGGTGTTGCCTTGAAGATCATTAAGGTTCAATTTTTATTCATTTTGGAGTTCAAAGGTTGTTGGTTGTTCATATGAAGCATATTTTGGAAGATTACAAGTATGTTTAATTTGGTGGTTTGGAGTTGTGCGGTTAAGCGTATTCGAATTGTTTCTCAATTGTCAGATCTCTTTTCTCATTTTTGGAATTTGACTTTGAATATAATGTGCAAAAAGGATTGTTCTCAATTTTCAGATCTCTTCCATCCTCTATTAGTCGTTTTTGTTCAGAGCCTACTGTTCTTGGCCTTTATTTCGGTGTTTTGATTAGTTAATTATTTTAGGCTTTTTGTTGAAAAGGATTTAGTATTATTTGGATTGTTGTCTATATTCTCTTTGTAATTCTTCTTAGTTGTTTATTTTCAATGTGGAGATTTGTAACTTTTGAACATTTTAGTCTCTTTTCATTATTTCAATGAAAATTTTAATTCTGACTTGACTTAAGTCTATAAAAGCACCCCAATTAAATGCCTTTTTTCCATGCTAATGCACCTGCTAAGAGGAATAAATGTACAAAAAGACCTTCTGACGCCTTATGGTCAGTATTTAAAAAGAAAATAATAGGGTTTTTTTTTTCTTCTTTATTAATTGAAGAAAAATAAAATTTATTAACTCTAAATGCAAAAAATTGTGTTAGGGTTTTTTTTTTTTCATATTCCCACTTCAATTATGCTCTCTTCTTTATGTATATTTATATATACATACATATTTATAGTGTGCCCCACAAAAAAGGCAATGTTTTTCTTTGTGCCTTGCGCTTAAACCCTTGAAGACTATTGCACTTTATTGCACCTTGAACTTTAAACAACACTGGTCTCCAAAAATATTTGTAAATTATGTCAAATTGGTTTAGGCATGGGTTCAAATCCTTGCCTCCATTTGTTGTTGAAGTCAAAAAAAAAAAATGCAAATTATTTGTAATCATAGGAAGGGAAATTTTTGAAAATATATAGAACTTCACTCTTCAGCTATAGATAGTTTCTAATAGTTTCTCTTAGAAAATTCCAAATATCTTAATTTTACTTGGTTTGGATTACTCATTTCTTATTTATGTTGTTGAAAAACATTAATTTTTCTCTCCTTTTTTTCCCCATATCTTTTACTTGCAATGCTCTTCTTTTAAATGCATGATTTCTTGTTTTTTTCCATCAATTTTTATTGTAATTATCTTTATTTGTTGATGTTACTTCTAATTGTGTGTATTAGGTTACATTTCTACTCTTCGAAATAATGTTCGATCTCGAGCTTGGGGAGCTGCTATTGGTTGTAGCTATTGTGTTGAGAGATGCTGCATTGTGAAGGTAATGAAACTTCAAATTTTTTGGAGATAGGTTCTAAAACTTAAATAGTAGTGTAGGATTTATATATTTATATATATACATACATTAAATCCTACACTAATATTTAAGTTTTAGTATATTTATATATCTTGTGGAAAGAATAAATGGAAAGAGTAGATGGAAACCACTTGCCGTTAGTCAAGGGTAATACTTTCTCAGATGTGAGAAAAATCATGGGTATGTAGTCAAATATCATAGATCTGTGATCCTAACTACGCGTATACGAAATCATTATTATTATTATTATTATTATTATTTTTTATAGAAGACAATTTCATTGATGATTGAAATTTACGAAAGGGATGTATAATCCATGGTGTTTACCAGAGACCTCAATTTTGCAATTTGGGAGGTATACTCTAGGAAGTAAAAATATTAGACAGTTTACACCAAAATATAGCTTGGTAAACAACATTGTTGAAAAAAAGTTCTTGTATACAAAATAATTATCAGAATCTTCTTGTAATAACAGAGTACTTTTTGTTGAGTTTCAATCGGTTTTGGCAATATATTCAGGGGATTGTAGTGAAGTCGCTACTATGGACGGATGGTAAGAAGGTCCCATCCGAGGGCTAGAGAGTGATGACTGGTCCAATGATTATTCTATGTTTTATGTCATTTGGAGTGTCTTTTGGTGGTTTCTCTTGATTTCGGGAATCCCACCCCCNTTTTTATAAGAGAATTTTCGTGTGCAATTAGAGGTCATCTTTCCTTACAGAAAAAAAGTGAATATATTGATTTTAGAGCCAATTGCAACATATTGATCCTTGTATTTTGAAGGCTTTTTCCAACTTTGTTATTTTCTTCTCTCTTCAGAAAGGAGGTGGATCCATTGACCTGGAGCCGTGTCTTACACATACATCTGCAGTTGAGCCGACCCTTGCTCCTGTGGCTGTTGAGCGGACGATGACTACAAGAGCTGCAGCTTCTGTAATTACTTGACATAATATTTAACCCGTTTTCTTTTTATGTTAACTTCCTAGATTTTTTCTTTCTATCTTATAATACAATAATAGAAGTATTTTCCTGTTCATACGGTAAGATTGCTTCCTTTTCGGTATTTAGGAAATGAATACAAATTTGGCTGATGATATTTGATTAATTCAAAGATATAAAGATAATGCCCTGCTTACTGTCTGTTTCATAGTATATACTCACTATTCTCCTAACTCATTTAAAAATTTCAAAACTGAACATATTATTAGGCACTTCAAATGAATAACATATCTACCATTGTTATGATTTTTCATTTTCGTTGTAATCTGTTTTTATTTTAAATGAACTGACGGGGATTTACCATGTTATGCTATATTTTTCAGAATGCATTGCGTCAACAGAGGTTTGTTAGAGAAGTTACAATTCAGTACAACCTTTGCAACGAACCATGGTATACTTCAAACCCTCTCTATTAATACATGCATTATTTGGAACTATCATGCTTTGTTTTGTTTTATCTTGTTTTTCTGTTTACCTTGGCTTTTATGCATGTCATACTAGATCTATATATTTTCTGTTTACCTAAGAGTAACCTATATATTTTCTATTATTATTTCGATCTAACACAATACTCCATACGCATAAATTATATTTGTAGTTTGTTGGAATGCATACGCATCCCATAGCGCTGATATATGAGATAGTTTATCCTTCATACAGTTTGATCAATGAATCCTCCGAACTCCTAAACCTTATCCNACACAGATCTATATATTTTCTGTTTACCTACGCTAACCTATATATCCTCTATTATTATTTCGATCTAACACAATACTGCATACGCATAAATAATATTTGTAGTTTGTTGGAATTCATACGCATCCCATAGCGCTGATATATGAGATAGTTTATCCTTCAGACAGTTTGATCAATGAATCCTCCGAACTCCTAAACCTTATCCAATTATTATCAATGTTCTTAAAAAGACTTTTCAGGCTTTTCCCTTGCATCAAGTCAAGGCTCAATACTTGATGATACTTGCCCAAGTTCACCACTAGCAAATATTGTCCTATTTGGGCTTTCCCTTTCGGACTTTTCCTCAAGGTTTTTAAAACGTGTCTGCTATAGGGAGAGGTTTCCACACTCTTATAAAGAATGAATGATTTGTTCACCTCCCTGACCAATGTGAGATCGCACAATCCACCCACCTTCGGCGCCCAGTGTCCTCGCTGGGCCTCGTTTCCTTCTCCAATCAATGTGGGACCCTCCAATCCACCCTCCTTCGAGGCCCAGCGTCCATACTGGCACACCGCCTCATGTCCCCCCTTTCGGGGCTCCGCTTCCTCACTGACACATTGCCCGGTGTCTGGCTCTGATACTATTTGTAACAGCCCAAGCTCACTGCAAGTAGATATTGTCCTCTTTGGGCTCCCACATCGGTCGGGGAGGAGAACGAATCATTCTTTATAAGGGTGGGAAAACCTCTCACTAGCAGACACATTTTAAAAACTTTGAGGGAAAGCCCAAAGAGGACAATATTTGCTAGCAGTGGGCTCGGGTTGTTACAACACTAATATGCAAGCTCATTTACAATGTTATTTTGATGATTTGTAAAGAACGTCTATCACCATGGGGGATCTTAGTGGTCGAGAGAGGCCAGGTCAAGTAGTAGTGAACACTTGGATGTTTTCTAGATGTTTGTAGGATTCAACAAATGTCACAAGTGATTAGTGAATAGCATATATCTATGAGATAGTTTGGATTCCCATGTTTATCCAAAAACAGGAAAGTAGGATACATTAGTGCCTTGCTTTTAAGAATTTTTTTGGCAGTCACTGGCATTGTCATGACATATTTCATATGTTCTTGTACAATACAAAACATAGCCATAATTGAATCTTGCTTAGTGTGATATCCTATGTTTCTTGAACATTACATAACATACCCATATTTGAATTTTGCATTGTGTGCTTTCTTATGTTTCTTGTACACTACATAACATAAAAAATTCACTTTGTAGTGTTAATTAAATCACAGATGCATATTGCCCCTGGATTTACTCTAGATGGTTGCACAATGTTCGTATTAAATGTTCCATCTACAAAACTCTGCAGGATTAAATACAGTATTAGTATTGTTGCGGATAAAGGTCTAAAGAAACCTCTCTACACATCTGCGCGGTTGAAGAAGGGAGAAGTTTTATATCTAGAAACGCATTCATGCAGGTATGTCAAGTCCCATGGTGTTTTCCTTACCTCTCATAAACAACAAATAAGTTGATTTCTGAAACACATGATGTGGATATTTCACGAGCTACATTGTGAAAGCCTCATTAAGAAGAAAAAACTTCCTTTAGAATATTTCAGGCATCAATGAAATATTTTTCATATAAAATAATTTGTCTGAAAGAACTACTTTGTGAAGGTATGAACTCTGCTTTTCCGGGGAGAAAATGGTGAAAGCCATTGCAGCATCTCAGGGGCATGAATCTGAAGCTGAGAAATCTCAGAATCACATTATGCACTGCCCAAATGGTGAAAGAACCGATAATGATAATACTTTGATCGATGTGTTTCGGTGGTCTCGCTGTAAGAAGCCACTGCCTCAGAAGGTTATGCGTTCTATTGGGATTCCACTGCCTTCTGAACACATGGAGGTACTTTTCTCCAGCTTTCTTTCAATCTTTCAAAGTTAAATTCTATCAGAATGTCTTGCATCTTCATTGATTTATTTACTTCAAAGATTGCTGTGTAGTGGGCTTTCTGGTGCCTATTTTAGTACATTTTTCTGCCTAAAGTTGCCATGACTTCACTTTCACGTTTTCACTTTCACGTTTCCAGTTAGCGGCATGTTATATCTCTCGTTAAATCCTAGTAGATAATAATATGGGGTTGCTATGGCCCATTCTTTAAATGTAGTCTAATGGCACGCTGTAATAGACCCTTAGAGAGAAAGAGGATGAAACTTCTCTTTCTGTGATGATTTCATTTATTTTTCACGACCTATATTAGGTCACATATTAAGGTAGAGATTGGCAAAGATCAATAGTACAGTCATCAGGAGGTGTGCCAGCGAAGACGCTGGGGCCCTAAGAGGGGTAGATTGTGANTGGTTACATATTAAGGTCATTTCATAGAAGAGATTGGCAAAGATCAATAGTACAGTCATCAGGAGGTGTGCCAGCGAGGACGCTGAGTCCCTAAGAGGGGTAGATTGTGAGATCCCACATCGGTTGGGAAAGGGGAACGAAGCATTCCTTATAAGGGTGTAGAAACCTCTCTCTTGCTAACACGTTTTAAAACTTTGAGGGAAAGCCCAGACGGGAAAGTCTAAAGAATACAATATCAAAGCTAGCTGTTACAAATATTCCCGTCTAAAGAATACAATATCAAAGCTAGCTGTTACAAATATATTGACTAAGATAAATAATACACCATGGCTTTGAGCATTTTCATCTTTATGGATAAAATTGAATTGCCAACTTTCCTTGTGGTTGGAGCTTCGAATCTCGACTCCTATATTTACTATAAGCATCTGATTGAATTACTTTTCTTTTTTAGGTACTTGAGGATAATTTGGACTGGGAAGATGTACGGTGGTCACAGACTGGAGTTTGGATAGCTGGGAAAGAATATGTACTTGCTCGTGTGCATTTCCTATCCATGAATTAGTTGCATTTTTAGGATCGATCTTGTTACTTTTAGATGTATGAGAGTATATATATTTCATAATCATATTTCAGAGTAGTAGGTGGGATGCTATCACAATAGTGCTGATTCTTTTCTAGTTGATATTATTATATTGGATGTGGAAAACTTGGTTCTTTGATCTTGCATCTCATCATATCAATACTCTATCATACTCTTCATCAGTACTAGGAAAATCATTTAGATACACTCAGCACTTGTAACTTCTTGATTTCATGAATGT

mRNA sequence

CATACTCACACAGACTTGAGGGTGGAGAGATTGAGAGTCCCACATCTTCATTGTTCAGCGGCAGTGAGTGTCGCCGTTTCTCCTTCCACTGTTGATTGGCCCACTCACTGAATTTGGGTCCCCAATTTTGCCTACTTACCTTTCGACTCTCAACCAATTCTATGCATATTCCCACGCAAACCCCACACCAACACCCCCCGCCAGGGCTCCATGTTCCACGACCCAAATCTTCGTAAGGAATTTAATTTTCCTTCGTTCGGCCACTCGGAGTTTGTGGCATCTATTGGAGTCCAAGGTTGATTTGGACCACTGGGTTGAATTTTAGAAACATTTTTGTTGTGTTGTGTTGCTGCTGCTGTTTGTTTGATTTAATCGGCTGAACTGCGTTATTCGCGATGAGTGGCGCCCCGAAGAGGTCCCATGAGGATGGGGGTCACTCCTCATCTTCCAAATATCCACACGATGATTCCAGTCCCTATCGCAAGCTTTCTTCGTCACTTCCAATGCAGTATCGTCCTTCGTTTGAGATGGGTCAAGACGCCCCAATGTCGAAGATACCCCGCACGGAATCCCGTGAGGGAGATAGGAGATCCCCCCTGCACTCGATTTTTCGAATGCCTTCGTCTTCTAATGATCCTCATGTAGATCACTCTGTTGCTTCGGAAAGCAGGCCAGAGCTGTGGGACTCCAAGGACGGTGGGGACAATAGATTTGAGAATCGAGATTCAAGAGTGGAGGCTCGAGAGCTGTTTGGTGATGAAAGGAGGGATTCTCAAGCTGTGAAGTTGGAGAAGGAAATGAGATACGAAGGCAGATTGGATGATATCAAGGAAATGAAATATGATAGAGATGGTTATCACGATTACAAGGGTGAATTGAAGTCAGAAAAAGAGATGTATGGATCAGCAACCAATCACTTAAACTGGAAAGAATCAAAAGATTACCATAGAGGGAAAAGATATCCTGAAGCTTCTGTTGGAAGCTTGGAACCCTGGCATGTTTCACGCACCAGTTCACAGAGTGCAGCCGAGGCTGTAAAAGAAGCCCTAACTACTGAGGAGAAAGACTATGTTGAAACAAGGGAGGCTGTTGGAGAGAATAAAATAGATTCAAAAGGGGAGGATAAATTTAAAGAGAAGGACAGGAAAAGGAAAGATACAAAGCAGAGGGATTGGGGAGATAAAGATAAGGAAAGAAATGATCATAGGAGCAGTACTCAAGCAACGAACAGCAATGTTGAGCCCAAAGACTTGTCAAAGGAAGAGAGAGATTCGGAAAGGTGGGAGAGGGACCGGAAAGATACCTCGAAAGACAAGGAAAGGCCTCGTGAAAGGGAAAAAGACCATGTGAAGAGAGAATCATGGAATGGGATGGATAAGGAGACTGCACACCTTGAAAAGGAGTCAGGTGAAGTGTCAGCGAGAATGTTAGAGCAGGAAAATCCAATTTCAGAGCAAAAAAAGCAAAAGGATTTTGATAGCTGGAAAAATGCTGATAGAGAGGGTAGAGACAGGAAGAAAGAAAGGGATACAGACATTGAGGGAGATCGACCAGAAAAGCGTAGTAGATGTCATGAAAAAGAGTCAGATGAGGGATGTGTAGATGTTGAAGGGACTTTAGACAGAGAGAGAGAAGTTTATAATTATGGTGTTCAGCATCGGAGAAGGATGCAACGTTCAAGGGGAAGCCCTCAAGTGGCAAATAGAGAACCTCGCTTCAGGTCTCGTGCTCAAGATAATGAAGGATTACAAGGTATGCTGTCTTGTTACACTGTTCTCCTTGGTGTAAAATTCTTCTCAACTGTGACTGCTGCGATCATGCTACCTTTGCTGGATAGCTTGCGAGCTACATTATTCTAGTTTTCACTCTCTCCAAAGGCTTTGAAGCTACTTTTCTTTTCTTTTTTCTTTTTATCATGCTCTATCAATCTGTTTGACATAGAAGTATCTTTTTAACTTGGATTTATTACTCTTGCTTTTTCAATTTAGGGCAGTTCTTAACAGCTCTTTAGGGAACAATAAGACAAGGAAGAACAGTTTGGGAATAAGGATTTTTATTTTAGAAACATATTTCTTTGATGTCTGATATTTGTAGAAGGGAGAAGATTTAGAGTTAATGCTAGGAGAATAGTTACAGAAATGTTCATTAAATAGCTCTGTTGTATTGAGCAATAATAACCCCAGCTTTCATTTTAATGATAGATCCACAAAGTAAACTAATCACCCTTATTGGTCTGGGTTTTATTTATTTATCTACTTATTTTATTGCTCTGGTCTATTCCTTTCTTCATTATTTCTTTCTTTTATAAGTTGTTTTTTCACTAAATAATATTGTTTCATGCATTATTATATTATTTTTTGTAAATACAATTATGCTCTATAATTGTTTCTATTTGTAACTTCAATTAAAGATAAGTTTGTTCTTCATATATATATATATATATATATATATGTATGTATATATGCATGTATGTATGTATCTATTAGTTTCCCATGCGTGTTTTTTTACAGCATGTGCTTCAACTTTAAAGAACAATTGTACCTCGGTGCATTTTATGAGTTATAAAAAAACAGATTCATGGCGTTAATTATCTTTTAAGGGGAAAAACCAGCTTAATGAAACCCATCTTGTCTTGATGATGCTATGTGTATGTATTGCTGATGCTTGATACTTTAAGATAAGCATGTCTTATCTACTAAATCTTAGTGCCTGTATTACATGCAAAATGTCTTTTCAGGAAAACCGGAAGTCTCATCTGTGGTTTATAAAGTTGGTGAGTGCGTGCAAGAACTAATAAAGTTGTGGAAGGAACATGAATCGTCACAGATAGAGAAAAATGGTGAAAGCTCACAGAATATCCCCACTCTAGAAATTCGAATACCAGCTGAACACGTTATTGCTACAAATAGGCAGGTCAGGGGTGGACAGCTGTGGGGAACAGATGTGTATACATATGATTCAGATCTTGTTGCTGTTCTCATGCATACAGGCTACTGTCGACTAACAGCTTCTCCACCTCCACCTGCGATCCAGGAGTTGCGTGCAACCATCAAAGTATTACCTCCGCAAGATTGTTACATTTCTACTCTTCGAAATAATGTTCGATCTCGAGCTTGGGGAGCTGCTATTGGTTGTAGCTATTGTGTTGAGAGATGCTGCATTGTGAAGAAAGGAGGTGGATCCATTGACCTGGAGCCGTGTCTTACACATACATCTGCAGTTGAGCCGACCCTTGCTCCTGTGGCTGTTGAGCGGACGATGACTACAAGAGCTGCAGCTTCTAATGCATTGCGTCAACAGAGGTTTGTTAGAGAAGTTACAATTCAGTACAACCTTTGCAACGAACCATGGATTAAATACAGTATTAGTATTGTTGCGGATAAAGGTCTAAAGAAACCTCTCTACACATCTGCGCGGTTGAAGAAGGGAGAAGTTTTATATCTAGAAACGCATTCATGCAGGTATGAACTCTGCTTTTCCGGGGAGAAAATGGTGAAAGCCATTGCAGCATCTCAGGGGCATGAATCTGAAGCTGAGAAATCTCAGAATCACATTATGCACTGCCCAAATGGTGAAAGAACCGATAATGATAATACTTTGATCGATGTGTTTCGGTGGTCTCGCTGTAAGAAGCCACTGCCTCAGAAGGTTATGCGTTCTATTGGGATTCCACTGCCTTCTGAACACATGGAGGTACTTGAGGATAATTTGGACTGGGAAGATGTACGGTGGTCACAGACTGGAGTTTGGATAGCTGGGAAAGAATATGTACTTGCTCGTGTGCATTTCCTATCCATGAATTAGTTGCATTTTTAGGATCGATCTTGTTACTTTTAGATGTATGAGAGTATATATATTTCATAATCATATTTCAGAGTAGTAGGTGGGATGCTATCACAATAGTGCTGATTCTTTTCTAGTTGATATTATTATATTGGATGTGGAAAACTTGGTTCTTTGATCTTGCATCTCATCATATCAATACTCTATCATACTCTTCATCAGTACTAGGAAAATCATTTAGATACACTCAGCACTTGTAACTTCTTGATTTCATGAATGT

Coding sequence (CDS)

ATGCAAAATGTCTTTTCAGGAAAACCGGAAGTCTCATCTGTGGTTTATAAAGTTGGTGAGTGCGTGCAAGAACTAATAAAGTTGTGGAAGGAACATGAATCGTCACAGATAGAGAAAAATGGTGAAAGCTCACAGAATATCCCCACTCTAGAAATTCGAATACCAGCTGAACACGTTATTGCTACAAATAGGCAGGTCAGGGGTGGACAGCTGTGGGGAACAGATGTGTATACATATGATTCAGATCTTGTTGCTGTTCTCATGCATACAGGCTACTGTCGACTAACAGCTTCTCCACCTCCACCTGCGATCCAGGAGTTGCGTGCAACCATCAAAGTATTACCTCCGCAAGATTGTTACATTTCTACTCTTCGAAATAATGTTCGATCTCGAGCTTGGGGAGCTGCTATTGGTTGTAGCTATTGTGTTGAGAGATGCTGCATTGTGAAGAAAGGAGGTGGATCCATTGACCTGGAGCCGTGTCTTACACATACATCTGCAGTTGAGCCGACCCTTGCTCCTGTGGCTGTTGAGCGGACGATGACTACAAGAGCTGCAGCTTCTAATGCATTGCGTCAACAGAGGTTTGTTAGAGAAGTTACAATTCAGTACAACCTTTGCAACGAACCATGGATTAAATACAGTATTAGTATTGTTGCGGATAAAGGTCTAAAGAAACCTCTCTACACATCTGCGCGGTTGAAGAAGGGAGAAGTTTTATATCTAGAAACGCATTCATGCAGGTATGAACTCTGCTTTTCCGGGGAGAAAATGGTGAAAGCCATTGCAGCATCTCAGGGGCATGAATCTGAAGCTGAGAAATCTCAGAATCACATTATGCACTGCCCAAATGGTGAAAGAACCGATAATGATAATACTTTGATCGATGTGTTTCGGTGGTCTCGCTGTAAGAAGCCACTGCCTCAGAAGGTTATGCGTTCTATTGGGATTCCACTGCCTTCTGAACACATGGAGGTACTTGAGGATAATTTGGACTGGGAAGATGTACGGTGGTCACAGACTGGAGTTTGGATAGCTGGGAAAGAATATGTACTTGCTCGTGTGCATTTCCTATCCATGAATTAG

Protein sequence

MQNVFSGKPEVSSVVYKVGECVQELIKLWKEHESSQIEKNGESSQNIPTLEIRIPAEHVIATNRQVRGGQLWGTDVYTYDSDLVAVLMHTGYCRLTASPPPPAIQELRATIKVLPPQDCYISTLRNNVRSRAWGAAIGCSYCVERCCIVKKGGGSIDLEPCLTHTSAVEPTLAPVAVERTMTTRAAASNALRQQRFVREVTIQYNLCNEPWIKYSISIVADKGLKKPLYTSARLKKGEVLYLETHSCRYELCFSGEKMVKAIAASQGHESEAEKSQNHIMHCPNGERTDNDNTLIDVFRWSRCKKPLPQKVMRSIGIPLPSEHMEVLEDNLDWEDVRWSQTGVWIAGKEYVLARVHFLSMN
BLAST of Cp4.1LG16g01000 vs. TrEMBL
Match: A0A0A0LJZ2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G337260 PE=4 SV=1)

HSP 1 Score: 709.1 bits (1829), Expect = 2.7e-201
Identity = 340/355 (95.77%), Postives = 351/355 (98.87%), Query Frame = 1

Query: 7   GKPEVSSVVYKVGECVQELIKLWKEHESSQIEKNGESSQNIPTLEIRIPAEHVIATNRQV 66
           GK EVSSV+YKVGEC+QELIKLWKEHESSQI+KNGESSQNIPTLEIRIPAEHVIATNRQV
Sbjct: 453 GKSEVSSVIYKVGECMQELIKLWKEHESSQIDKNGESSQNIPTLEIRIPAEHVIATNRQV 512

Query: 67  RGGQLWGTDVYTYDSDLVAVLMHTGYCRLTASPPPPAIQELRATIKVLPPQDCYISTLRN 126
           RGGQLWGTDVYTYDSDLVAVLMHTGYCRLTASPPPPAIQELRATI+VLPPQDCYISTLRN
Sbjct: 513 RGGQLWGTDVYTYDSDLVAVLMHTGYCRLTASPPPPAIQELRATIRVLPPQDCYISTLRN 572

Query: 127 NVRSRAWGAAIGCSYCVERCCIVKKGGGSIDLEPCLTHTSAVEPTLAPVAVERTMTTRAA 186
           NVRSRAWGAAIGCSYCVERCCIVKKGGG+IDLEPCLTHTSAVEPTLAPVAVERTMTTRAA
Sbjct: 573 NVRSRAWGAAIGCSYCVERCCIVKKGGGAIDLEPCLTHTSAVEPTLAPVAVERTMTTRAA 632

Query: 187 ASNALRQQRFVREVTIQYNLCNEPWIKYSISIVADKGLKKPLYTSARLKKGEVLYLETHS 246
           ASNALRQQRFVREVTIQYNLCNEPWIKYSISIVADKGLKKPLYTSARLKKGEVLYLETHS
Sbjct: 633 ASNALRQQRFVREVTIQYNLCNEPWIKYSISIVADKGLKKPLYTSARLKKGEVLYLETHS 692

Query: 247 CRYELCFSGEKMVKAIAASQGHESEAEKSQNHIMHCPNGERTDNDNTLIDVFRWSRCKKP 306
           CRYELCFSGEKMVK IA+SQGHE+EAEKSQNH ++CPNGERTDNDNTLIDVFRWSRCKKP
Sbjct: 693 CRYELCFSGEKMVKTIASSQGHETEAEKSQNHFLNCPNGERTDNDNTLIDVFRWSRCKKP 752

Query: 307 LPQKVMRSIGIPLPSEHMEVLEDNLDWEDVRWSQTGVWIAGKEYVLARVHFLSMN 362
           LPQKVMRSIGIPLPSEH+EVLEDNLDWEDV+WSQTGVWIAGKEY LARVHFLSMN
Sbjct: 753 LPQKVMRSIGIPLPSEHVEVLEDNLDWEDVQWSQTGVWIAGKEYQLARVHFLSMN 807

BLAST of Cp4.1LG16g01000 vs. TrEMBL
Match: A0A0D2QAY5_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_002G106900 PE=4 SV=1)

HSP 1 Score: 648.3 bits (1671), Expect = 5.6e-183
Identity = 311/357 (87.11%), Postives = 333/357 (93.28%), Query Frame = 1

Query: 5   FSGKPEVSSVVYKVGECVQELIKLWKEHESSQIEKNGESSQNIPTLEIRIPAEHVIATNR 64
           F GKPEVS+VVYKVGEC+QELIKLW+E+E+SQ +KNGESSQN PTLEI+IPAEHV ATNR
Sbjct: 460 FQGKPEVSTVVYKVGECMQELIKLWEEYEASQADKNGESSQNGPTLEIQIPAEHVTATNR 519

Query: 65  QVRGGQLWGTDVYTYDSDLVAVLMHTGYCRLTASPPPPAIQELRATIKVLPPQDCYISTL 124
           QVRGGQLWGTD+YT DSDLVAVLMHTGYCR TASPPPPAIQELRATI+VLPPQDCY S L
Sbjct: 520 QVRGGQLWGTDIYTDDSDLVAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQDCYTSKL 579

Query: 125 RNNVRSRAWGAAIGCSYCVERCCIVKKGGGSIDLEPCLTHTSAVEPTLAPVAVERTMTTR 184
           RNNVRSRAWGA I CSY VERCCIVKKGGG+IDLEPCLTH+S VEPTLAPVAVERT+TTR
Sbjct: 580 RNNVRSRAWGAGISCSYRVERCCIVKKGGGTIDLEPCLTHSSTVEPTLAPVAVERTITTR 639

Query: 185 AAASNALRQQRFVREVTIQYNLCNEPWIKYSISIVADKGLKKPLYTSARLKKGEVLYLET 244
           AAASNALRQQRFVREVTIQYNLCNEPWIKYSISIVADKGLKKPLYTSARLKKGEVLYLET
Sbjct: 640 AAASNALRQQRFVREVTIQYNLCNEPWIKYSISIVADKGLKKPLYTSARLKKGEVLYLET 699

Query: 245 HSCRYELCFSGEKMVKAIAASQGHESEAEKSQNHIMHCPNGERTDNDNTLIDVFRWSRCK 304
           HSCRYELCF+GEKMVKA  ASQ HE++AEKSQNH  H  NGE+ D++NTLIDVFRWSRCK
Sbjct: 700 HSCRYELCFTGEKMVKATVASQAHEADAEKSQNHHSHSSNGEKNDSENTLIDVFRWSRCK 759

Query: 305 KPLPQKVMRSIGIPLPSEHMEVLEDNLDWEDVRWSQTGVWIAGKEYVLARVHFLSMN 362
           KPLPQKVMRSIGIPLP EH+EVL +N+DWEDV+WSQTGVWIAGKEY LARVHFLS N
Sbjct: 760 KPLPQKVMRSIGIPLPLEHVEVLVENVDWEDVQWSQTGVWIAGKEYTLARVHFLSPN 816

BLAST of Cp4.1LG16g01000 vs. TrEMBL
Match: A0A061FAS4_THECC (Peptidyl-prolyl cis-trans isomerase G isoform 1 OS=Theobroma cacao GN=TCM_033022 PE=4 SV=1)

HSP 1 Score: 647.9 bits (1670), Expect = 7.3e-183
Identity = 311/355 (87.61%), Postives = 330/355 (92.96%), Query Frame = 1

Query: 7   GKPEVSSVVYKVGECVQELIKLWKEHESSQIEKNGESSQNIPTLEIRIPAEHVIATNRQV 66
           GKPEVS VVYKVGEC+QELIKLWKE E+SQ +KNGESSQN PTLEIRIPAEHV ATNRQV
Sbjct: 463 GKPEVSCVVYKVGECMQELIKLWKEFEASQADKNGESSQNGPTLEIRIPAEHVTATNRQV 522

Query: 67  RGGQLWGTDVYTYDSDLVAVLMHTGYCRLTASPPPPAIQELRATIKVLPPQDCYISTLRN 126
           RGGQLWGTD+YT DSDLVAVLMHTGYCR TASPPPPAIQELRATI+VLPPQDCY S LRN
Sbjct: 523 RGGQLWGTDIYTDDSDLVAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQDCYTSKLRN 582

Query: 127 NVRSRAWGAAIGCSYCVERCCIVKKGGGSIDLEPCLTHTSAVEPTLAPVAVERTMTTRAA 186
           NVRSRAWGA IGCSY VERCCIVKKGGG+IDLEPCLTH+S VEPTLAPVAVERTMTTRAA
Sbjct: 583 NVRSRAWGAGIGCSYRVERCCIVKKGGGTIDLEPCLTHSSTVEPTLAPVAVERTMTTRAA 642

Query: 187 ASNALRQQRFVREVTIQYNLCNEPWIKYSISIVADKGLKKPLYTSARLKKGEVLYLETHS 246
           ASNALRQQRFVREVTIQYNLCNEPWIKYSISIVADKGLKKPLYTSARLKKGEVLYLETHS
Sbjct: 643 ASNALRQQRFVREVTIQYNLCNEPWIKYSISIVADKGLKKPLYTSARLKKGEVLYLETHS 702

Query: 247 CRYELCFSGEKMVKAIAASQGHESEAEKSQNHIMHCPNGERTDNDNTLIDVFRWSRCKKP 306
           CRYELCF+GEKMVKA  ASQ +E++ EKSQNH  H  NGE+ D+DN +IDVFRWSRCKKP
Sbjct: 703 CRYELCFTGEKMVKATPASQAYETDTEKSQNHHSHSSNGEKNDSDNIMIDVFRWSRCKKP 762

Query: 307 LPQKVMRSIGIPLPSEHMEVLEDNLDWEDVRWSQTGVWIAGKEYVLARVHFLSMN 362
           LPQK+MRSIGIPLP EH+EVLE+N+DWEDV+WSQTGVWIAGKEY LARVHFLS N
Sbjct: 763 LPQKIMRSIGIPLPLEHVEVLEENIDWEDVQWSQTGVWIAGKEYTLARVHFLSSN 817

BLAST of Cp4.1LG16g01000 vs. TrEMBL
Match: A0A061FAF0_THECC (Peptidyl-prolyl cis-trans isomerase G isoform 4 OS=Theobroma cacao GN=TCM_033022 PE=4 SV=1)

HSP 1 Score: 643.3 bits (1658), Expect = 1.8e-181
Identity = 311/356 (87.36%), Postives = 330/356 (92.70%), Query Frame = 1

Query: 7   GKPEVSSVVYKVGECVQELIKLWKEHESSQIEKNGESSQNIPTLEIRIPAEHVIATNR-Q 66
           GKPEVS VVYKVGEC+QELIKLWKE E+SQ +KNGESSQN PTLEIRIPAEHV ATNR Q
Sbjct: 463 GKPEVSCVVYKVGECMQELIKLWKEFEASQADKNGESSQNGPTLEIRIPAEHVTATNRQQ 522

Query: 67  VRGGQLWGTDVYTYDSDLVAVLMHTGYCRLTASPPPPAIQELRATIKVLPPQDCYISTLR 126
           VRGGQLWGTD+YT DSDLVAVLMHTGYCR TASPPPPAIQELRATI+VLPPQDCY S LR
Sbjct: 523 VRGGQLWGTDIYTDDSDLVAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQDCYTSKLR 582

Query: 127 NNVRSRAWGAAIGCSYCVERCCIVKKGGGSIDLEPCLTHTSAVEPTLAPVAVERTMTTRA 186
           NNVRSRAWGA IGCSY VERCCIVKKGGG+IDLEPCLTH+S VEPTLAPVAVERTMTTRA
Sbjct: 583 NNVRSRAWGAGIGCSYRVERCCIVKKGGGTIDLEPCLTHSSTVEPTLAPVAVERTMTTRA 642

Query: 187 AASNALRQQRFVREVTIQYNLCNEPWIKYSISIVADKGLKKPLYTSARLKKGEVLYLETH 246
           AASNALRQQRFVREVTIQYNLCNEPWIKYSISIVADKGLKKPLYTSARLKKGEVLYLETH
Sbjct: 643 AASNALRQQRFVREVTIQYNLCNEPWIKYSISIVADKGLKKPLYTSARLKKGEVLYLETH 702

Query: 247 SCRYELCFSGEKMVKAIAASQGHESEAEKSQNHIMHCPNGERTDNDNTLIDVFRWSRCKK 306
           SCRYELCF+GEKMVKA  ASQ +E++ EKSQNH  H  NGE+ D+DN +IDVFRWSRCKK
Sbjct: 703 SCRYELCFTGEKMVKATPASQAYETDTEKSQNHHSHSSNGEKNDSDNIMIDVFRWSRCKK 762

Query: 307 PLPQKVMRSIGIPLPSEHMEVLEDNLDWEDVRWSQTGVWIAGKEYVLARVHFLSMN 362
           PLPQK+MRSIGIPLP EH+EVLE+N+DWEDV+WSQTGVWIAGKEY LARVHFLS N
Sbjct: 763 PLPQKIMRSIGIPLPLEHVEVLEENIDWEDVQWSQTGVWIAGKEYTLARVHFLSSN 818

BLAST of Cp4.1LG16g01000 vs. TrEMBL
Match: F6HGJ4_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0130g00270 PE=4 SV=1)

HSP 1 Score: 641.0 bits (1652), Expect = 8.9e-181
Identity = 314/358 (87.71%), Postives = 330/358 (92.18%), Query Frame = 1

Query: 6   SGKPEVSSVVYKVGECVQELIKLWKEHESSQIEKNGESSQNIPTLEIRIPAEHVIATNRQ 65
           +GKPEVS+VVYKVGEC+QELIKLWKE+ESSQ +KNGESS N PTLEIRIPAEHV ATNRQ
Sbjct: 461 AGKPEVSTVVYKVGECMQELIKLWKEYESSQADKNGESSSNGPTLEIRIPAEHVTATNRQ 520

Query: 66  VRGGQLWGTDVYTYDSDLVAVLMHTGYCRLTASPPPPAIQELRATIKVLPPQDCYISTLR 125
           VRGGQLWGTD+YT DSDLVAVLMHTGYCR TASPPPPAIQELRATI+VLPPQDCYISTLR
Sbjct: 521 VRGGQLWGTDIYTDDSDLVAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQDCYISTLR 580

Query: 126 NNVRSRAWGAAIGCSYCVERCCIVKKGGGSIDLEPCLTHTSAVEPTLAPVAVERTMTTRA 185
           NNVRSRAWGAAIGCSY VERCCIVKKGGG+IDLEPCLTHTS VEPTLAPVAVERTMTTRA
Sbjct: 581 NNVRSRAWGAAIGCSYRVERCCIVKKGGGTIDLEPCLTHTSTVEPTLAPVAVERTMTTRA 640

Query: 186 AASNALRQQRFVREVTIQYNLCNEPWIKYSISIVADKGLKKPLYTSARLKKGEVLYLETH 245
           AASNALRQQRFVREVTIQYNLCNEPWIKYSISIVADKGLKKPLYTSARLKKGEVLYLETH
Sbjct: 641 AASNALRQQRFVREVTIQYNLCNEPWIKYSISIVADKGLKKPLYTSARLKKGEVLYLETH 700

Query: 246 SCRYELCFSGEKMVKAIAASQGHESEAEKSQNHIMHCPNGER--TDNDNTLIDVFRWSRC 305
           S RYELCF GEKMVKA  A  GHE+E EKSQ H +H  NGER  TD DN +IDVFRWSRC
Sbjct: 701 SRRYELCFIGEKMVKATTALHGHETETEKSQTHSLHSTNGERNSTDGDNIMIDVFRWSRC 760

Query: 306 KKPLPQKVMRSIGIPLPSEHMEVLEDNLDWEDVRWSQTGVWIAGKEYVLARVHFLSMN 362
           K+ LPQKVMRS+GIPLP EH+EVLE+NLDWEDV+WSQTGV IAGKEY LARVHFLS N
Sbjct: 761 KRALPQKVMRSLGIPLPLEHLEVLEENLDWEDVQWSQTGVCIAGKEYALARVHFLSPN 818

BLAST of Cp4.1LG16g01000 vs. TAIR10
Match: AT5G08450.1 (AT5G08450.1 Histone deacetylation protein Rxt3 (InterPro:IPR013951))

HSP 1 Score: 586.3 bits (1510), Expect = 1.3e-167
Identity = 288/363 (79.34%), Postives = 320/363 (88.15%), Query Frame = 1

Query: 7   GKPEVSSVVYKVGECVQELIKLWKEHESSQIEKNGESSQNIPTLEIRIPAEHVIATNRQV 66
           GK EVS VVYKVGEC+QELIKLWKE++ S  +K+G+ + N PTLE+RIPAEHV ATNRQV
Sbjct: 559 GKSEVSIVVYKVGECMQELIKLWKEYDLSHPDKSGDFANNGPTLEVRIPAEHVTATNRQV 618

Query: 67  RGGQLWGTDVYTYDSDLVAVLMHTGYCRLTASPPPPAIQELRATIKVLPPQDCYISTLRN 126
           RGGQLWGTD+YT DSDLVAVLMHTGYCR TASPPPP +QELR TI+VLP QD Y S LRN
Sbjct: 619 RGGQLWGTDIYTDDSDLVAVLMHTGYCRPTASPPPPTMQELRTTIRVLPSQDYYTSKLRN 678

Query: 127 NVRSRAWGAAIGCSYCVERCCIVKKGGGSIDLEPCLTHTSAVEPTLAPVAVERTMTTRAA 186
           NVRSRAWGA IGCSY VERC I+KKGGG+I+LEP LTH+S VEPTLAP+AVER+MTTRAA
Sbjct: 679 NVRSRAWGAGIGCSYRVERCYILKKGGGTIELEPSLTHSSTVEPTLAPMAVERSMTTRAA 738

Query: 187 ASNALRQQRFVREVTIQYNLCNEPWIKYSISIVADKGLKKPLYTSARLKKGEVLYLETHS 246
           ASNALRQQRFVREVTIQYNLCNEPWIKYSISIVADKGLKKPL+TSARLKKGEVLYLETHS
Sbjct: 739 ASNALRQQRFVREVTIQYNLCNEPWIKYSISIVADKGLKKPLFTSARLKKGEVLYLETHS 798

Query: 247 CRYELCFSGEKMVKAIAASQ---GHE-----SEAEKSQNHIMHCPNGERTDNDNTLIDVF 306
           CRYELCF+GEK +KAI ASQ    HE     +   KSQNH+    NG++TD+DN+LIDVF
Sbjct: 799 CRYELCFAGEKTIKAIQASQQQSSHEAMETDNNNNKSQNHL---TNGDKTDSDNSLIDVF 858

Query: 307 RWSRCKKPLPQKVMRSIGIPLPSEHMEVLEDNLDWEDVRWSQTGVWIAGKEYVLARVHFL 362
           RWSRCKKPLPQK+MRSIG PLP++H+EVLE+NLDWEDV+WSQTGVWIAGKEY LARVHFL
Sbjct: 859 RWSRCKKPLPQKLMRSIGFPLPADHIEVLEENLDWEDVQWSQTGVWIAGKEYTLARVHFL 918

BLAST of Cp4.1LG16g01000 vs. NCBI nr
Match: gi|659121850|ref|XP_008460844.1| (PREDICTED: uncharacterized protein LOC103499596 isoform X2 [Cucumis melo])

HSP 1 Score: 712.6 bits (1838), Expect = 3.5e-202
Identity = 342/355 (96.34%), Postives = 351/355 (98.87%), Query Frame = 1

Query: 7   GKPEVSSVVYKVGECVQELIKLWKEHESSQIEKNGESSQNIPTLEIRIPAEHVIATNRQV 66
           GKPEVSSVVYKVGEC+QELIKLWKEHE SQI+KNGESSQNIPTLEIRIPAEHVIATNRQV
Sbjct: 453 GKPEVSSVVYKVGECMQELIKLWKEHELSQIDKNGESSQNIPTLEIRIPAEHVIATNRQV 512

Query: 67  RGGQLWGTDVYTYDSDLVAVLMHTGYCRLTASPPPPAIQELRATIKVLPPQDCYISTLRN 126
           RGGQLWGTDVYTYDSDLVAVLMHTGYCRLTASPPPPAIQELRATI+VLPPQDCYISTLRN
Sbjct: 513 RGGQLWGTDVYTYDSDLVAVLMHTGYCRLTASPPPPAIQELRATIRVLPPQDCYISTLRN 572

Query: 127 NVRSRAWGAAIGCSYCVERCCIVKKGGGSIDLEPCLTHTSAVEPTLAPVAVERTMTTRAA 186
           NVRSRAWGAAIGCSYCVERCCIVKKGGG+IDLEPCLTHTSAVEPTLAPVAVERTMTTRAA
Sbjct: 573 NVRSRAWGAAIGCSYCVERCCIVKKGGGAIDLEPCLTHTSAVEPTLAPVAVERTMTTRAA 632

Query: 187 ASNALRQQRFVREVTIQYNLCNEPWIKYSISIVADKGLKKPLYTSARLKKGEVLYLETHS 246
           ASNALRQQRFVREVTIQYNLCNEPWIKYSISIVADKGLKKPLYTSARLKKGEVLYLETHS
Sbjct: 633 ASNALRQQRFVREVTIQYNLCNEPWIKYSISIVADKGLKKPLYTSARLKKGEVLYLETHS 692

Query: 247 CRYELCFSGEKMVKAIAASQGHESEAEKSQNHIMHCPNGERTDNDNTLIDVFRWSRCKKP 306
           CRYELCFSGEKMVK IA+SQGHE+EAEKSQNH +HCPNGERTDNDNTLIDVFRWSRCKKP
Sbjct: 693 CRYELCFSGEKMVKTIASSQGHETEAEKSQNHFVHCPNGERTDNDNTLIDVFRWSRCKKP 752

Query: 307 LPQKVMRSIGIPLPSEHMEVLEDNLDWEDVRWSQTGVWIAGKEYVLARVHFLSMN 362
           LPQKVMRSIGIPLPSEH+EVLEDNLDWEDV+WSQTGVWIAGKEY LARVHFLSMN
Sbjct: 753 LPQKVMRSIGIPLPSEHVEVLEDNLDWEDVQWSQTGVWIAGKEYQLARVHFLSMN 807

BLAST of Cp4.1LG16g01000 vs. NCBI nr
Match: gi|778670329|ref|XP_011649442.1| (PREDICTED: peptidyl-prolyl cis-trans isomerase G [Cucumis sativus])

HSP 1 Score: 709.1 bits (1829), Expect = 3.8e-201
Identity = 340/355 (95.77%), Postives = 351/355 (98.87%), Query Frame = 1

Query: 7   GKPEVSSVVYKVGECVQELIKLWKEHESSQIEKNGESSQNIPTLEIRIPAEHVIATNRQV 66
           GK EVSSV+YKVGEC+QELIKLWKEHESSQI+KNGESSQNIPTLEIRIPAEHVIATNRQV
Sbjct: 453 GKSEVSSVIYKVGECMQELIKLWKEHESSQIDKNGESSQNIPTLEIRIPAEHVIATNRQV 512

Query: 67  RGGQLWGTDVYTYDSDLVAVLMHTGYCRLTASPPPPAIQELRATIKVLPPQDCYISTLRN 126
           RGGQLWGTDVYTYDSDLVAVLMHTGYCRLTASPPPPAIQELRATI+VLPPQDCYISTLRN
Sbjct: 513 RGGQLWGTDVYTYDSDLVAVLMHTGYCRLTASPPPPAIQELRATIRVLPPQDCYISTLRN 572

Query: 127 NVRSRAWGAAIGCSYCVERCCIVKKGGGSIDLEPCLTHTSAVEPTLAPVAVERTMTTRAA 186
           NVRSRAWGAAIGCSYCVERCCIVKKGGG+IDLEPCLTHTSAVEPTLAPVAVERTMTTRAA
Sbjct: 573 NVRSRAWGAAIGCSYCVERCCIVKKGGGAIDLEPCLTHTSAVEPTLAPVAVERTMTTRAA 632

Query: 187 ASNALRQQRFVREVTIQYNLCNEPWIKYSISIVADKGLKKPLYTSARLKKGEVLYLETHS 246
           ASNALRQQRFVREVTIQYNLCNEPWIKYSISIVADKGLKKPLYTSARLKKGEVLYLETHS
Sbjct: 633 ASNALRQQRFVREVTIQYNLCNEPWIKYSISIVADKGLKKPLYTSARLKKGEVLYLETHS 692

Query: 247 CRYELCFSGEKMVKAIAASQGHESEAEKSQNHIMHCPNGERTDNDNTLIDVFRWSRCKKP 306
           CRYELCFSGEKMVK IA+SQGHE+EAEKSQNH ++CPNGERTDNDNTLIDVFRWSRCKKP
Sbjct: 693 CRYELCFSGEKMVKTIASSQGHETEAEKSQNHFLNCPNGERTDNDNTLIDVFRWSRCKKP 752

Query: 307 LPQKVMRSIGIPLPSEHMEVLEDNLDWEDVRWSQTGVWIAGKEYVLARVHFLSMN 362
           LPQKVMRSIGIPLPSEH+EVLEDNLDWEDV+WSQTGVWIAGKEY LARVHFLSMN
Sbjct: 753 LPQKVMRSIGIPLPSEHVEVLEDNLDWEDVQWSQTGVWIAGKEYQLARVHFLSMN 807

BLAST of Cp4.1LG16g01000 vs. NCBI nr
Match: gi|659121842|ref|XP_008460840.1| (PREDICTED: uncharacterized protein LOC103499596 isoform X1 [Cucumis melo])

HSP 1 Score: 700.3 bits (1806), Expect = 1.8e-198
Identity = 342/376 (90.96%), Postives = 351/376 (93.35%), Query Frame = 1

Query: 7   GKPEVSSVVYKVGECVQELIKLWKEHESSQIEKNGESSQNIPTLEIRIPAEHVIATNRQV 66
           GKPEVSSVVYKVGEC+QELIKLWKEHE SQI+KNGESSQNIPTLEIRIPAEHVIATNRQV
Sbjct: 453 GKPEVSSVVYKVGECMQELIKLWKEHELSQIDKNGESSQNIPTLEIRIPAEHVIATNRQV 512

Query: 67  RGGQLWGTDVYTYDSDLVAVLMHTGYCRLTASPPPPAIQELRATIKVLPPQDCYISTLRN 126
           RGGQLWGTDVYTYDSDLVAVLMHTGYCRLTASPPPPAIQELRATI+VLPPQDCYISTLRN
Sbjct: 513 RGGQLWGTDVYTYDSDLVAVLMHTGYCRLTASPPPPAIQELRATIRVLPPQDCYISTLRN 572

Query: 127 NVRSRAWGAAIGCSYCVERCCIVK---------------------KGGGSIDLEPCLTHT 186
           NVRSRAWGAAIGCSYCVERCCIVK                     KGGG+IDLEPCLTHT
Sbjct: 573 NVRSRAWGAAIGCSYCVERCCIVKEFGCSSAGCSVDDRGVSSPKEKGGGAIDLEPCLTHT 632

Query: 187 SAVEPTLAPVAVERTMTTRAAASNALRQQRFVREVTIQYNLCNEPWIKYSISIVADKGLK 246
           SAVEPTLAPVAVERTMTTRAAASNALRQQRFVREVTIQYNLCNEPWIKYSISIVADKGLK
Sbjct: 633 SAVEPTLAPVAVERTMTTRAAASNALRQQRFVREVTIQYNLCNEPWIKYSISIVADKGLK 692

Query: 247 KPLYTSARLKKGEVLYLETHSCRYELCFSGEKMVKAIAASQGHESEAEKSQNHIMHCPNG 306
           KPLYTSARLKKGEVLYLETHSCRYELCFSGEKMVK IA+SQGHE+EAEKSQNH +HCPNG
Sbjct: 693 KPLYTSARLKKGEVLYLETHSCRYELCFSGEKMVKTIASSQGHETEAEKSQNHFVHCPNG 752

Query: 307 ERTDNDNTLIDVFRWSRCKKPLPQKVMRSIGIPLPSEHMEVLEDNLDWEDVRWSQTGVWI 362
           ERTDNDNTLIDVFRWSRCKKPLPQKVMRSIGIPLPSEH+EVLEDNLDWEDV+WSQTGVWI
Sbjct: 753 ERTDNDNTLIDVFRWSRCKKPLPQKVMRSIGIPLPSEHVEVLEDNLDWEDVQWSQTGVWI 812

BLAST of Cp4.1LG16g01000 vs. NCBI nr
Match: gi|1009125605|ref|XP_015879699.1| (PREDICTED: zinc finger CCCH domain-containing protein 13-like [Ziziphus jujuba])

HSP 1 Score: 651.0 bits (1678), Expect = 1.2e-183
Identity = 312/355 (87.89%), Postives = 332/355 (93.52%), Query Frame = 1

Query: 7   GKPEVSSVVYKVGECVQELIKLWKEHESSQIEKNGESSQNIPTLEIRIPAEHVIATNRQV 66
           GK EVSSV+YKVGEC+QELIKLWKE+E+SQ EKNGESS + PTLEIRIPAEHV ATNRQV
Sbjct: 451 GKAEVSSVIYKVGECMQELIKLWKEYEASQAEKNGESSHSGPTLEIRIPAEHVTATNRQV 510

Query: 67  RGGQLWGTDVYTYDSDLVAVLMHTGYCRLTASPPPPAIQELRATIKVLPPQDCYISTLRN 126
           RGGQLWGTD+YT DSDLVAVLMHTGYCR TASPPPPAIQELRATI+VLPPQDCY+STLRN
Sbjct: 511 RGGQLWGTDIYTDDSDLVAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQDCYVSTLRN 570

Query: 127 NVRSRAWGAAIGCSYCVERCCIVKKGGGSIDLEPCLTHTSAVEPTLAPVAVERTMTTRAA 186
           NVRSRAWGAAIGCSY VERCCIVKKGGG+IDLEPCLTHTS VEPTLAPVAVERTMTTRAA
Sbjct: 571 NVRSRAWGAAIGCSYRVERCCIVKKGGGTIDLEPCLTHTSTVEPTLAPVAVERTMTTRAA 630

Query: 187 ASNALRQQRFVREVTIQYNLCNEPWIKYSISIVADKGLKKPLYTSARLKKGEVLYLETHS 246
           ASNALRQQRFVREVTIQYNLCNEPWIKYSISIVADKGLKKPLYTSARLKKGEVLYLETHS
Sbjct: 631 ASNALRQQRFVREVTIQYNLCNEPWIKYSISIVADKGLKKPLYTSARLKKGEVLYLETHS 690

Query: 247 CRYELCFSGEKMVKAIAASQGHESEAEKSQNHIMHCPNGERTDNDNTLIDVFRWSRCKKP 306
           CRYELCF+GEKMVKA  +SQ HE+E EKSQNH  H  NG+R + DN +ID FRWSRCK+P
Sbjct: 691 CRYELCFTGEKMVKATQSSQAHEAETEKSQNHHSHSTNGDRIECDNVVIDAFRWSRCKRP 750

Query: 307 LPQKVMRSIGIPLPSEHMEVLEDNLDWEDVRWSQTGVWIAGKEYVLARVHFLSMN 362
           LPQKVMRSIGIPLP EH+EVLE+NLDWEDV+WSQTGVW+AGKEY LARVHFLSMN
Sbjct: 751 LPQKVMRSIGIPLPLEHVEVLEENLDWEDVQWSQTGVWVAGKEYTLARVHFLSMN 805

BLAST of Cp4.1LG16g01000 vs. NCBI nr
Match: gi|1009171478|ref|XP_015866759.1| (PREDICTED: uncharacterized protein LOC107404327 [Ziziphus jujuba])

HSP 1 Score: 649.8 bits (1675), Expect = 2.8e-183
Identity = 311/355 (87.61%), Postives = 332/355 (93.52%), Query Frame = 1

Query: 7   GKPEVSSVVYKVGECVQELIKLWKEHESSQIEKNGESSQNIPTLEIRIPAEHVIATNRQV 66
           GK EVSSV+YKVGEC+QELIKLWKE+E+SQ EKNGESS + PTLEIRIPAEHV ATNRQV
Sbjct: 179 GKAEVSSVIYKVGECMQELIKLWKEYEASQAEKNGESSHSGPTLEIRIPAEHVTATNRQV 238

Query: 67  RGGQLWGTDVYTYDSDLVAVLMHTGYCRLTASPPPPAIQELRATIKVLPPQDCYISTLRN 126
           RGGQLWGTD+YT DSDLVAVLMHTGYCR TASPPPPAIQELRATI+VLPPQDCY+STLRN
Sbjct: 239 RGGQLWGTDIYTDDSDLVAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQDCYVSTLRN 298

Query: 127 NVRSRAWGAAIGCSYCVERCCIVKKGGGSIDLEPCLTHTSAVEPTLAPVAVERTMTTRAA 186
           NVRSRAWGAAIGCSY VERCCIVKKGGG+IDLEPCLTHTS VEPTLAPVAVERTMTTRAA
Sbjct: 299 NVRSRAWGAAIGCSYRVERCCIVKKGGGTIDLEPCLTHTSTVEPTLAPVAVERTMTTRAA 358

Query: 187 ASNALRQQRFVREVTIQYNLCNEPWIKYSISIVADKGLKKPLYTSARLKKGEVLYLETHS 246
           ASNALRQQRFVREVTIQYNLCNEPWIKYSISIVADKGLKKPLYTSARLKKGEVLYLETH+
Sbjct: 359 ASNALRQQRFVREVTIQYNLCNEPWIKYSISIVADKGLKKPLYTSARLKKGEVLYLETHT 418

Query: 247 CRYELCFSGEKMVKAIAASQGHESEAEKSQNHIMHCPNGERTDNDNTLIDVFRWSRCKKP 306
           CRYELCF+GEKMVKA  +SQ HE+E EKSQNH  H  NG+R + DN +ID FRWSRCK+P
Sbjct: 419 CRYELCFTGEKMVKATQSSQAHEAETEKSQNHHSHSTNGDRIECDNVVIDAFRWSRCKRP 478

Query: 307 LPQKVMRSIGIPLPSEHMEVLEDNLDWEDVRWSQTGVWIAGKEYVLARVHFLSMN 362
           LPQKVMRSIGIPLP EH+EVLE+NLDWEDV+WSQTGVW+AGKEY LARVHFLSMN
Sbjct: 479 LPQKVMRSIGIPLPLEHVEVLEENLDWEDVQWSQTGVWVAGKEYTLARVHFLSMN 533

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LJZ2_CUCSA2.7e-20195.77Uncharacterized protein OS=Cucumis sativus GN=Csa_2G337260 PE=4 SV=1[more]
A0A0D2QAY5_GOSRA5.6e-18387.11Uncharacterized protein OS=Gossypium raimondii GN=B456_002G106900 PE=4 SV=1[more]
A0A061FAS4_THECC7.3e-18387.61Peptidyl-prolyl cis-trans isomerase G isoform 1 OS=Theobroma cacao GN=TCM_033022... [more]
A0A061FAF0_THECC1.8e-18187.36Peptidyl-prolyl cis-trans isomerase G isoform 4 OS=Theobroma cacao GN=TCM_033022... [more]
F6HGJ4_VITVI8.9e-18187.71Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0130g00270 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT5G08450.11.3e-16779.34 Histone deacetylation protein Rxt3 (InterPro:IPR013951)[more]
Match NameE-valueIdentityDescription
gi|659121850|ref|XP_008460844.1|3.5e-20296.34PREDICTED: uncharacterized protein LOC103499596 isoform X2 [Cucumis melo][more]
gi|778670329|ref|XP_011649442.1|3.8e-20195.77PREDICTED: peptidyl-prolyl cis-trans isomerase G [Cucumis sativus][more]
gi|659121842|ref|XP_008460840.1|1.8e-19890.96PREDICTED: uncharacterized protein LOC103499596 isoform X1 [Cucumis melo][more]
gi|1009125605|ref|XP_015879699.1|1.2e-18387.89PREDICTED: zinc finger CCCH domain-containing protein 13-like [Ziziphus jujuba][more]
gi|1009171478|ref|XP_015866759.1|2.8e-18387.61PREDICTED: uncharacterized protein LOC107404327 [Ziziphus jujuba][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR013951Rxt3
IPR004043LCCL domain
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0080186 developmental vegetative growth
biological_process GO:0009908 flower development
biological_process GO:0016575 histone deacetylation
biological_process GO:0009737 response to abscisic acid
biological_process GO:1902074 response to salt
biological_process GO:0009845 seed germination
biological_process GO:0008150 biological_process
cellular_component GO:0005634 nucleus
cellular_component GO:0005575 cellular_component
molecular_function GO:0016853 isomerase activity
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG16g01000.1Cp4.1LG16g01000.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004043LCCL domainunknownSSF69848LCCL domaincoord: 68..136
score: 9.1
IPR013951Histone deacetylation protein Rxt3PFAMPF08642Rxt3coord: 51..94
score: 3.2

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG16g01000CmaCh20G010440Cucurbita maxima (Rimu)cmacpeB565
Cp4.1LG16g01000CmoCh20G010510Cucurbita moschata (Rifu)cmocpeB517
Cp4.1LG16g01000Carg24296Silver-seed gourdcarcpeB1010
The following gene(s) are paralogous to this gene:

None