Cp4.1LG14g02170 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG14g02170
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionShort-chain dehydrogenase
LocationCp4.1LG14 : 3196950 .. 3207623 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGAAAGATTGCTAATGATGAACCACCCAAAGAAAATAGCCATCTACCACAAGAAGATGGCAAAATTGTAATTTCATAGTCTAAAAAAACCTTGATTTCTTTTCAAAAGAGGGTTTATTCTAAAAGAAAACAAAACTTTTAGAGTGCAGTTTCAACGAGTACTCAAACCACAGAGTCTGGAGCAACACTAAGACGAGGGCCTTGCCGGATTCACCAATCTGTATTTGTATCGATCTCAACTATGTGGATCTTTGGATGGACCGGACCATCTGGGTTCTCAGCCCGTTCTACAGCAGAAGAAGTCACAGAAGGAATTGATGGGAATGGCCTCACTGCTATTGTTACAGGTTTCGTTTCATCCCCTTCCCCTTTTTCTCCTTCTTCCCCTCTCTTTTAATCCCTTCTTTGATCTGGGTTTTGTTGTTTACAAAATCGATGGTGATTTAGTTTATTTATAGCAACCTGGATACCTTGTCTTTTGATTGGATTAAGGTTGTTTTGTTATTGACTCACTACTTGATTGCTATTTGTTTTGCAAATCTTTCTTGGGTATAATATGAATTCTGTGTTTTCTGAGCCCGTTCTGTAGCTGTTCTTGGACTGTTCAGTCTCAATCAAGTGTATATTTTGTATCCTTTATATGTAGAATGCAGAAATCTATGGAGTAATGGGGTTTTCTTTGAGGATTTATGTATAGATTGAGCTTTATTTGTTAGGTTTCTATACTTAGCATAATGTTTGTATGCTTAGAAACTGAAGCAGAAATGGAACTAGAATGTCCATTTGCAAGAATGTGCAAGTATATCAATCAAAGCATCACAAGTTTGGATGGACTTCTAAAAAAGTTTCTAGACATGGAGTTTGAAGCATGTTTGCTTTCAAAAGTTTCATTAATACTCGTAAACGGTTAAGAAATATCCATAACCTTACCGGTAGGATATGAATGGAAAACTTTTCCATTTTTCCCTCTTTTTTTACCATATCTTATTCAATATCCTCTTACTCCCCTTCTTTTGTTCTCCGATATCTAAAACGGCCCCTTCATTCAAGGTTATAGACTATAACAAGAAGAAGAAGAAAAAAATGGAATCACCAAGGGACCTTAGGACATAAATGTTTAACATGGTTATACGTTAGTAGTTGAGTGAGAGTTGATACGTTTACGACATAATTTTGAGGACAACTAGTGGGAGGAAAAAAGAAAGAAAACGTGTGCAATACCAAATTTGTAGAGATGAACTTTTCTTTGTAGTTCAAATTAGTCTATTTGTTGTCGTCGATTCAGAAGGTTTCTCAATAATCTAAGATGATATTTCTTTGTCAGCCTTTTTTGTATTCGACAATTCATCGTTTTGTTGTGTTTTGAGGGGAGATTCTGATATTGGGACTTGGTGGGGTTTTGTATAAGAAATTGTTAGGTTATAGATTGAAGCAAAGGAGAGAGGAAACGAAGGAGGGAGTTGATGGGTAAACGGTTTTCATCCATATACTAATCGTAAATGCATTTTGAGCTATTTTTTTTTGGAAAGGTTAAGGGTAATCAAACTCTTTTCGAGCTGATTTGGGAAGGAAACTGTAACGGCTCAAGCCCACCGCTAACAGATATTGCTTCCCTTCAACGCTTTAAAACTCGTCTGCTAGGGAAAGGTTTCCACACCCTTATAAAGGGTGTTTCGTTCTCCTCCCCAACTAATGTGGGATATCACAATCCACCCCCCTTCAGGGCCCAGCGTCCTCAGTGACACTCGTTCCTTTCTCCAATCGATGTGGGACCCCCACCAAATCCACCCCCTTCAAGGCCCAGTGTCCTTACTGGCCCACTGCCTCGTGTCTACTCCCCTTCGGGGAACAACCTCCTCCCTGGCATATCGCTCGGTACTTGACTCTGATACCATCTGTAACGGCCCAAGCCCATCGCTAGCAGATATTGTCCTCTTTGAGTTTTCTCTTTCAGGCTTCCCCTCAAGGCTTTAAAACGCGTCTGCTAGGGAGAGATTTCCACACCCTTATAAAGGGTGTTTCGTTCTCCTCCCCAACCAATTTAAATATTTTTATAGTTCAAATTTAGATATAAAATAGAAAACTGCAAAAGTTTTTTGAAGTTAAAAAATCAAGTAAAAAAAAAAACACTAAATGATTTTAATCAAAGTGTCAACTGTCAAGTCTAGATCTCTTCCAACCATGTGGGTGAAAGGATAGTTCCAAGAACATGGGAAAGGAGGAAACCGAACGACAATAAATGTCTTATTAAAAAACCAAGGCTTGTTTTTGCAGTACTAGGAGCAACCAAAGTACTGTTTTACAACGTTGTCTTGGCTGCAATTAATAAGTTACGTCAAAGTGCACTAAAATCAGCCATCAATATTTCAACAATTAATTTTGGACTTTCAAACATTATATTTCATCTAAAAATACTTCATTAAATTCAGATTGGATGTCAAGAAAATTCAAACTTCGATAGAAAATCTATAATTATTGATCAAATTCCTAAATGTCAACTAAAAAACGGGTTTAGACTCAATAAAAAAAAAAAAAAACTAAATGATTATAATCAAAGTGTCAAACTGTCGAGTCTAGATCTCTTCCAACCATGTGGGTGAAAGGATAGTTCCAAGAACATGGGAAAGGAGGAAACCAAACGACAATAAATGTCTTAATCAAAAACCAAGGCTTGTTTTTGCAGTACTAGGAGCAACCAAAGTACTGTTTTACAATGTTGTCTTGGCTGCAATTAATAAGTTAGGTCAAAGTGGCACTAAAATCAGCCATCAATATTTCAACAATTAATTTTAGACTTTGAAACATTATATTTCATCTAAAAATACTTCTTCGGATTGGATGTCATGAAAATTCAAATTTCTATAGAAAATCTATAATTATTGACCAAATTCCTAAATGTCAACTAAAAAACGGGTTTAGACTCAAAATTGAGAACCTCCAACTATAGATAGAAAAATTCTCATATGAAAAACGATGATTCTCGATCCTAATATTATGTTAAGCATACCATCAAATAAAATCAATCTAAAATACGGAGAGAAGATCAAGAAATGAACCCATTTGATCCGCTCTTAGCCACTAAAGTTTTGATTTTTTTGGCTCACAAATAGACTGAAAAATCAGTGAAACCAATCAGAAACAACTATAGATAGAGAAATTTTCATATGAAAAACATGATTCTCGATCTTAATATTATGTTAAGCAGACCATCAAATAAAATCAATCTAAAACACGGATAAAAAATCAAGAAATCAACTTATTTGCCTCCCTACTCTTAGTCACTAAAATTTTGATTTTTTTGGCTCACCCATAATCATAAACAACTAACCCAGTTGTAATTCATGATAAATTTTCCAAATCACAGTTTTATTCAGAGAAAGATTGCTAATGATGAACCACCCAAAGAAAATAGCCATCTACCACAAGAAGATGGCAAAATTGTAATTTCATAGTCTAAAAAAACCTTGATTTCTTTTCAAAAGAGGGTTTATTCTAAAAGAAAACAAAACTTTTAGAGTGCAGTTTCAACGAGTACTCAAACCACAGAGTCTGGAGCAACACTAAGACGAGGGCCTTGCCGGATTCACCAATTTGTATTTGTATCGATCTCAACCATGTGGATCTTTGGATGGACCGGACCATCTGGGTTCTCAGCCCGTTCTACAGCAGAAGAAGTCACAGAAGGAATTGATGGGAATGGCCTCACTGCTATTGTTACAGGTTTCGTTTCATCCCCTTCCCCTTTTTCTCCTTCTTCCCCTCTCTTTTAATCCCTTCTTTGATCTGGGTTTTGTTGTTTACAAAATCGATGGTGATTTAGTTTATTTATAGCAACCTGGATACCTTGTCTTTTGATTGGATTAAGGTTGTTTTGTTATTGACTCACTACTTGATTGCTATTTGTTTTGCAAATCTTTCTTGGGTATAATATGAATTCTGTGTTTTCTGAGCCCGTTCTGTAGCTGTTCTTGGACTGTTCAGTCTCAATCAAGTGTATATTTTGTATCCTTTATATGTAGAATGCAGAAATCTATGGAGTAATGGGGTTTTCTTTGAGGATTTATGTATAGATTGAGCTTTATTTGTTAGGTTTCTATACTTAGCATAATGTTTGTATGCTTAGAAACTGAAGCAGAAATGGAACTAGAATGTCCATTTGCAAGAATGTGCAAGTATATCAATCAAAGCATCACAAGTTTGGATGGACTTCTAAAAAAGTTTCTAGACATGGAGTTTGAAGCATGTTTGCTTTCAAAAGTTTCATTAATACTCGTAAACGGTTAAGAAATATCCATAACCTTACCGGTACTATATGAATGGAAAACTTTTCCATTTTTCCCTCTTTTTTTACCATATCTTATTCAATATCCTCTTACTCCCCTTCTTTCGTTCTCCGATATCTAAAACGGCCCCTTCATTCAAGGTTATAGACTATAACAAGAAGAAGAAGAAAAAAATGGAATCACCAAGGGACCTTAGGACATAAATGTTTAACATGGTTATACGTTAGTAGTTGAGTGAGAGTTGATACGTTTACGACATAATTTTGAGGACAACTAGTGGGAGGAAAAAAGAAAGAAAACGTGTGCAATACCAAATTTGTAGAGATGAACTTTTCTTTGTAGTTCAAATTAGTCTATTTGTTGTCGTCGATTCAGAAGGTTTCTCAATAATCTAAGATGATATTTCTTTGTCAGCCTTTTTTGTATTCGACAATTCATCGTTTTGTTGTGTTTTGAGGGGAGATTCTGATATTGGGACTTGGTGGGGTTTTGTATAAGAAATTGTTAGGTTATAGATTGAAGCAAAGGAGAGAGGAAACGAAGGAGGGAGTTGATGGGTAAACGGTTTTCATCCATATACTAATCGTAAATGCATTTTGAGCTATTTTTTTTTGGAAAGGTTAAGGGTAATCAAACTCTTTTCGAGCTGATTTGGGAAGGAAACTGTAACGGCTCAAGCCCACCGCTAACAGATATTGCTTCCCTTCAACGCTTTAAAACTCGTCTGCTAGGGAAAGGTTTCCACACCCTTATAAAGGGTGTTTCGTTCTCCTCCCCAACTAATGTGGGATATCACAATCCACCCCCCTTCAGGGCCCAGCGTCCTCAGTGACACTCGTTCCTTTCTCCAATCGATGTGGGACCCCCACCAAATCCACCCCCTTCAAGGCCCAGTGTCCTTACTGGCCCACTGCCTCGTGTCTACTCCCCTTCGGGGAACAACCTCCTCCCTGGCATATCGCTCGGTACTTGACTCTGATACCATCTGTAACGGCCCAAGCCCATCGCTAGCAGATATTGTCCTCTTTGAGTTTTCTCTTTCAGGCTTCCCCTCAAGGCTTTAAAACGCGTCTGCTAGGGAGAGATTTCCACACCCTTATAAAGGGTGTTTCGTTCTCCTCCCCAACCAATGTGGGATACCACAGAAACAGTCCAATGAATTTCTAGATTTAGTTCTCATATGTTCGATTTTTCTATATACTTTTCTCTAATCATTCACTTCAAATGTTTCATTTTGATCTATAGAAGATCTTAATCTAAAGAACTGTTGAAGTTTTCCAACTTCGAAACATGTAGTTCATTTCCCTTTCCTTCCTCATGATTTCAGGAGCTTCAAGTGGTCTCGGTGAAGAGACGACACGTGTTCTTGCATTACGAGGAGTCTATGTCATTATGGCTGTAAGAAATGTTGAAGCAGGGAAAAAAGTAAAAGAAGCAGTACTGAAAGAATCCCCTTCAGCCAAAATTGATGTCATGGAGTTAGATCTTAGTTCAATGGAATCTGTAAGGAAATTTGCATCAGATTACATTGCATCAGGCCGTGCACTAAATATTCTCATGTAAGAACTGAATTAAAGCTGGTAGACTAAAATTCTTTGCCTGGGAAATCTGCTTATTTGATAATAATACAACTCTATATCTTCTTTGATTGAAGGAACAATGCGGGCGTTATGGCGACGCCTTTTATGCTTTCGCATGACGGCATAGAGTTGCAGTTTGCAACAAACCATATAGGTATGTTTCAAATAAAAGATTTGTAATGGGTGCAGTTCTTATTATTGTTCTTTTGTAGGACATTTTCTTCTGACGAACCTTCTGCTGGAAACTATGAAAAAAACTGTGGTTGAAAGCAAAAAGGAGGGAAGGATTGTTAATCTGTCGTCGGAGGGGCACCGATTGACATATGGCGAAGGAGTTCGTTTCGATAAAATCAATGACGAATCAGAGTAACGATCTACGTTGCTTATTTGATGATTACTTCCTCAAAAGCTTCCTTGAATGTTATGTAGTGATTCATTGTATGATGTGAAATTTTAGGTACAGAACTATCTTTGCTTATGGACAATCAAAGCTTTCCAACATATTGCATGCCAAAGAGCTTGCCAGGCGGTTGAAGGTGCTTATTGGTTTAGTTATTGAATGTATAACCAGAGCTTTAGAACTTGAACCTTAAATGAGGCAGGATTTTTATATGTACGATAATCGGGGTAGTCGAAGGGGGAATAATCAAGTGAACCTTGAAACTTATATCCTTTCGAGATTGAGTCGTTTTTCAGAATCGAGAAACTTTAGTCTTCGTCTGTATAGATGTTCAACTGGACACTCTAATTGTGAGATCACACATTGGTTGGAGAAGGAAACGAAGCATTCCTTATATGGGTTTGGAAACCTCTCCTTAGTAGACGCGTTTTAAAATCTTGAGGGGAAGCCTAAAGAGGACAATATTTGCTAACGGTGGGCTTGGGATGTTACAAATGATATCAGAGCCAGGCACCAAACGGTATACCAACGAGGACGTTGGACCCCCAAGGGGGTGGATTGTGAGATCCCCACATTGGTTGGAGAGGGGAACGAAGCATTCCTTATAAGAATGTGGAAACCTCTCCCTAGTAGACGCGTTTTAAAACCTTGAGGGGAAGCCTAAAGAGGACAATATCTGCTCGCGGTAGGCTTGGGATGTTACAAATGATATCAGAGCCAGACACCAAACAGTATGCCAGCGAAGACTTTGAGGGAGCCTAAAAGGGAAAGCTCAAAGAGGACAATATCCGCTAGCGGTGGACTTGGGGTGTTACAAATGGTATCAGAGCGAGGCACTGAACAGTGTGCCAGCGAAGACGTTGGGCCCCCAAGGGGGTGGATTGTGAGATCCCACATTGGTTGGAGAGGGGAACGAAACATTCCTTATAAGGGTGTGGAAACCTATCCCTAGTAGACGCGTTTTAAAACTTTGAGGGAAAGCCTGAAAGGGAAAGCCCAAAAAGGACAATATCTGCTAGCGGTGAGCTTGGGCTGTTACACAATGGTTTACATTACTGATTTCGCTGCAGTTCCCTTTGTTCTTTCTTATTGATTTTCCAAGATATCTTATGAAGTCAAATTGAGTAAGTGACTATCATTTCTCTACTTTTTTTAAATAGGAAGAAGGGGTGGAAATAACAGCCAACGCCCTTCATCCTGGAGCAATTGCTGCTACCAACCTACTACGCTTCCATGGTCTCATTAATGGTAACTTTCTCGCGTATTACATGTATAAGCCAAATCTGGTTTACTAGATGCAAGACGTAAGGTCTGCCTTTTTACTTAAACTAGCACGAGTCAGATGTTTTTTTTATTTTTACATTAATCAGATGCTTGTTTAGACTGTAGAATATAGGGCATTTTAATTTGCATTTCGACCGAGGTGGTAGCTTATGGTTTGTCCGAGTATGTTTCGTTCCAAAGAAGGGTGCTAGCACCTGGACTCTAATATCTACATATATGGGTAGGAGATTTGAAAATCTCTAATCTTTTGGTCGAAGACATGTCTTAACTAGTTTTTACCTTAACTAGTTGAACTATGTTTAAATTGACGTATACATATTTAGTTTGACAAAATGTGTTTTGAACAAAGCTCCGACTCCCGGGTATTAAGAACAAAAAGTTTTTTGAACAAATATAAACATCTACCAGATTGCTATCTATGCTTCATTACCTTTTCTTATTTCCTGCTTATTTCCTGCCCTTGGTTTTCTGTCATCTGTTGGTGTCTGAGTATATGAGTTGTTTTCTTTTAAACAGCTGTTGCTACTTCCATTGCTACATTTGTGCTTAAAAACGTCCAGCAGGTATGTTATGAATTCATTTCCCCAAATTTTGTTCAAGACTGTTTATTTAGAGTTTCTCTCAGGTGGGTTTCTATGTGAATAAATTTTCATTGGATGTAGGGAGCGGCGACTCAGTGCTATGTAGCATTGAATCCCCAAGTCAAAGGGGTAAGTGGAGAGTATTTTGTGGATAGTAATATAGCCAACCCAACCAATCATGCCAAAGATATGGAATTGGCCAAGAAACTGTGGGATTTCAGTATGGATCTAACCACCCCTAAATAATTGTATCTCTTCTGAAAACCCTTCCTTACTCTTAGCTTCTCATCTCAGGCTCTTCATGTTAGAGATATATGGACAGTTTAATGGTGGAAGTAGATGAACCATTGTTTGTAGTTTTAAGTGCCACTTGAGAAACAGATACGAGTATAAGTTTGTAATGCTTTGTTTTGAAACAGATACCATAGTCATTCTTGTAGATTTCATATTGCTGTGTTTTAGAGTGTATTTGGAAAGGTGTTCATAATAAAGGCATATCCAACCAGCTCATGTAAAGCATTGCAAAGAAGTGTTAAATATAGTAATCAACTTTTTTGTGATTTCTACAAACTAATTCAAAATCAATAAAAGAAATAACTTTGAAAAGATCAAACACTAATCATTAACATATTCACACACTTTCTATCTATAGAGTAGTAATCTCTATTACAATTCAAATGAGGAGCCGAAACTCTGTGTTAATAGGACCCTGTTGAAGCTACACATTTCTATCAATCATTCTAATGATATCTGGCTTTGTTTTAGGTCATATTGTTGAAAAGAAAATGGCTTCTGGTTTGGCTCTTTCATGATTTGACCTCTATTTTACCTTCCAAACCCATCATTCCTCTAGCTGTTTGTCTAACATATCATGCAACACATTATGGGCAAGAGGAGATTCCAGAGGAAAAGCACCAGTTTGCTCAACTTCGCTGTGTCGTTGGGAAGAATGACCAGGCCCTAGTGGTCGTGCCTTCTGTCTCTATATCCTTTGCATTCGTCCACCGGTCTGAATTTACATATGGCTTAAGAGTATCACATCCATTGCCATGATCAAAATTGAATTCCTTCACAAACCCAATGGTAAGGGAGTGGTGTTGTTGATGCTGATGTGCTTGTTCACCGTTTGCTTTTGCATATTCTTCTGTACCATTTGAGGTATGACAGACAGCAGGAGAATTTTCTTTAGCCGTTTCAAATGGTTCATGTAATGGAGATGATGCAACTTCCAGTTCATTTGATTCCAGTGGCTTACTTCCTACGCTTCTTAATAAAGCATCTCCATCAGATAACTCAAATGAGAATCTATGATTAGCAGCAACAGGCTTCCTCCTTTCGGCTTCCATCAATGAGAATTCGAATACTTTGGGATTCATTTGTTGCGTGGTGATCTGACATAGATTCTGAAGTTTGGGGATTCAAAACAAAATCATTACTTGATTTGAATCCTATAGAATCTTGAGAGTAAGAATCAGTGCTTCGCCGTTGTTGCCAGCTATAAGTGGAACATTTGTCAAGGTCCAATAATGTAGGTGGAACTTCTAAATGGGAAATTCGAAAACTGAGAGCCAGAGGAAGCAAAATCACAATCAGGCAAAAGCGACTACGACCCAGAACGAGAAATGACTGACTGTGGTGGTATGCGGTGACTGATCGGGCTGCCAGGATAGGTAGTCATCATTAGGAAATGAACATTGATGATCAGGCTCAGGTTTCTGAAGGCTAGGTGGAAGAACCTAAGCAAAAGGAACTTCAAGGGAAGAAGGCCTAATCAAGTGGGTAGACAAAGGAGTGAAGGGAGCAGTTGATTCAAAGGTGGAGACACTAGCTATGTTTCATGAGCAAATGGGCCAATGGAAAAAATGGAGGAAGGCCCATCAGGAGAACACATGTTGCTGTAGGTGAATGAGTAGCAGAAGGTGGCTCAGATTGAAGGAAGGATACAGGGGAAGAGGGAGGGGTGCAGCAAGTGGAAGCTCAATGTCGGGTGAGTGCAATGAATCTTCTTCATGAGCCTCAGCTGAAGGAGGACTAGGTTCTGGTACCAACACACAGCGTGCACAACAACTCTTTTCCTCTGTTTGAGAGATCCAAAGCACCAATAAATACTCCAACAGCTACCCCATCTCCTTTTCTGAGGGCAAGCAAACAAAGAGAAAACGAGGAAAGTGATAATGACATCAGTACAGAACAGTGGAAAGAGCTTGGACAAATGAGAGCCCTATTTGTTGGAATCTTTATATAAAAACCAATACTCTTCCTCTATAGAGAATGACATTAATTTAAAGAATGGAGAGATGCCTTTCTGGTTAGAAAGAGCAGCGTTATTACAAGAAGACAAACAACATTGGGCTAGAACAGCGCATTGCTTCCTGACACAGTCGTCCAACCCAATGCACAAATCTTGGTACCTTCCAAACCCTAAGTATCTAATAATCATGCGAGGAGAAAGCCAAAAGCAGAAAAAGCAGAAAATCAGAAACGAAGCCAGCTCGTGATTCCGTGTATTCATGTAACGATTCGCAGTCCCTAAGCAAACAAACAGCA

mRNA sequence

AGAAAGATTGCTAATGATGAACCACCCAAAGAAAATAGCCATCTACCACAAGAAGATGGCAAAATTAGTGCAGTTTCAACGAGTACTCAAACCACAGAGTCTGGAGCAACACTAAGACGAGGGCCTTGCCGGATTCACCAATCTGTATTTGTATCGATCTCAACTATGTGGATCTTTGGATGGACCGGACCATCTGGGTTCTCAGCCCGTTCTACAGCAGAAGAAGTCACAGAAGGAATTGATGGGAATGGCCTCACTGCTATTGTTACAGTTTCAACGAGTACTCAAACCACAGAGTCTGGAGCAACACTAAGACGAGGGCCTTGCCGGATTCACCAATTTGTATTTGTATCGATCTCAACCATGTGGATCTTTGGATGGACCGGACCATCTGGGTTCTCAGCCCGTTCTACAGCAGAAGAAGTCACAGAAGGAATTGATGGGAATGGCCTCACTGCTATTGTTACAGGAGCTTCAAGTGGTCTCGGTGAAGAGACGACACGTGTTCTTGCATTACGAGGAGTCTATGTCATTATGGCTGTAAGAAATGTTGAAGCAGGGAAAAAAGTAAAAGAAGCAGTACTGAAAGAATCCCCTTCAGCCAAAATTGATGTCATGGAGTTAGATCTTAGTTCAATGGAATCTGTAAGGAAATTTGCATCAGATTACATTGCATCAGGCCGTGCACTAAATATTCTCATGAACAATGCGGGCGTTATGGCGACGCCTTTTATGCTTTCGCATGACGGCATAGAGTTGCAGTTTGCAACAAACCATATAGGACATTTTCTTCTGACGAACCTTCTGCTGGAAACTATGAAAAAAACTGTGGTTGAAAGCAAAAAGGAGGGAAGGATTGTTAATCTGTCGTCGGAGGGGCACCGATTGACATATGGCGAAGGAGTTCGTTTCGATAAAATCAATGACGAATCAGAGTACAGAACTATCTTTGCTTATGGACAATCAAAGCTTTCCAACATATTGCATGCCAAAGAGCTTGCCAGGCGGTTGAAGGAAGAAGGGGTGGAAATAACAGCCAACGCCCTTCATCCTGGAGCAATTGCTGCTACCAACCTACTACGCTTCCATGGTCTCATTAATGCTGTTGCTACTTCCATTGCTACATTTGTGCTTAAAAACGTCCAGCAGGGAGCGGCGACTCAGTGCTATGTAGCATTGAATCCCCAAGTCAAAGGGGTAAGTGGAGAGTATTTTGTGGATAGTAATATAGCCAACCCAACCAATCATGCCAAAGATATGGAATTGGCCAAGAAACTGTGGGATTTCAGTATGGATCTAACCACCCCTAAATAATTGTATCTCTTCTGAAAACCCTTCCTTACTCTTAGCTTCTCATCTCAGGCTCTTCATGTTAGAGATATATGGACAGTTTAATGGTGGAAGTAGATGAACCATTGTTTGTAGTTTTAAGTGCCACTTGAGAAACAGATACGAGTATAAGTTTGTAATGCTTTGTTTTGAAACAGATACCATAGTCATTCTTGTAGATTTCATATTGCTGTGTTTTAGAGTGTATTTGGAAAGGTGTTCATAATAAAGGCATATCCAACCAGCTCATGTAAAGCATTGCAAAGAAGTGTTAAATATAGTAATCAACTTTTTTGTGATTTCTACAAACTAATTCAAAATCAATAAAAGAAATAACTTTGAAAAGATCAAACACTAATCATTAACATATTCACACACTTTCTATCTATAGAGTAGTAATCTCTATTACAATTCAAATGAGGAGCCGAAACTCTGTGTTAATAGGACCCTGTTGAAGCTACACATTTCTATCAATCATTCTAATGATATCTGGCTTTGTTTTAGGTCATATTGTTGAAAAGAAAATGGCTTCTGGTTTGGCTCTTTCATGATTTGACCTCTATTTTACCTTCCAAACCCATCATTCCTCTAGCTGTTTGTCTAACATATCATGCAACACATTATGGGCAAGAGGAGATTCCAGAGGAAAAGCACCAGTTTGCTCAACTTCGCTGTGTCGTTGGGAAGAATGACCAGGCCCTAGTGGTCGTGCCTTCTGTCTCTATATCCTTTGCATTCGTCCACCGGTCTGAATTTACATATGGCTTAAGAGTATCACATCCATTGCCATGATCAAAATTGAATTCCTTCACAAACCCAATGGTAAGGGAGTGGTGTTGTTGATGCTGATGTGCTTGTTCACCGTTTGCTTTTGCATATTCTTCTGTACCATTTGAGGTATGACAGACAGCAGGAGAATTTTCTTTAGCCGTTTCAAATGGTTCATGTAATGGAGATGATGCAACTTCCAGTTCATTTGATTCCAGTGGCTTACTTCCTACGCTTCTTAATAAAGCATCTCCATCAGATAACTCAAATGAGAATCTATGATTAGCAGCAACAGGCTTCCTCCTTTCGGCTTCCATCAATGAGAATTCGAATACTTTGGGATTCATTTGTTGCGTGGTGATCTGACATAGATTCTGAAGTTTGGGGATTCAAAACAAAATCATTACTTGATTTGAATCCTATAGAATCTTGAGAGTAAGAATCAGTGCTTCGCCGTTGTTGCCAGCTATAAGTGGAACATTTGTCAAGGTCCAATAATGTAGGTGGAACTTCTAAATGGGAAATTCGAAAACTGAGAGCCAGAGGAAGCAAAATCACAATCAGGCAAAAGCGACTACGACCCAGAACGAGAAATGACTGACTGTGGTGGTATGCGGTGACTGATCGGGCTGCCAGGATAGGTAGTCATCATTAGGAAATGAACATTGATGATCAGGCTCAGGTTTCTGAAGGCTAGGTGGAAGAACCTAAGCAAAAGGAACTTCAAGGGAAGAAGGCCTAATCAAGTGGGTAGACAAAGGAGTGAAGGGAGCAGTTGATTCAAAGGTGGAGACACTAGCTATGTTTCATGAGCAAATGGGCCAATGGAAAAAATGGAGGAAGGCCCATCAGGAGAACACATGTTGCTGTAGGTGAATGAGTAGCAGAAGGTGGCTCAGATTGAAGGAAGGATACAGGGGAAGAGGGAGGGGTGCAGCAAGTGGAAGCTCAATGTCGGGTGAGTGCAATGAATCTTCTTCATGAGCCTCAGCTGAAGGAGGACTAGGTTCTGGTACCAACACACAGCGTGCACAACAACTCTTTTCCTCTGTTTGAGAGATCCAAAGCACCAATAAATACTCCAACAGCTACCCCATCTCCTTTTCTGAGGGCAAGCAAACAAAGAGAAAACGAGGAAAGTGATAATGACATCAGTACAGAACAGTGGAAAGAGCTTGGACAAATGAGAGCCCTATTTGTTGGAATCTTTATATAAAAACCAATACTCTTCCTCTATAGAGAATGACATTAATTTAAAGAATGGAGAGATGCCTTTCTGGTTAGAAAGAGCAGCGTTATTACAAGAAGACAAACAACATTGGGCTAGAACAGCGCATTGCTTCCTGACACAGTCGTCCAACCCAATGCACAAATCTTGGTACCTTCCAAACCCTAAGTATCTAATAATCATGCGAGGAGAAAGCCAAAAGCAGAAAAAGCAGAAAATCAGAAACGAAGCCAGCTCGTGATTCCGTGTATTCATGTAACGATTCGCAGTCCCTAAGCAAACAAACAGCA

Coding sequence (CDS)

AGAAAGATTGCTAATGATGAACCACCCAAAGAAAATAGCCATCTACCACAAGAAGATGGCAAAATTAGTGCAGTTTCAACGAGTACTCAAACCACAGAGTCTGGAGCAACACTAAGACGAGGGCCTTGCCGGATTCACCAATCTGTATTTGTATCGATCTCAACTATGTGGATCTTTGGATGGACCGGACCATCTGGGTTCTCAGCCCGTTCTACAGCAGAAGAAGTCACAGAAGGAATTGATGGGAATGGCCTCACTGCTATTGTTACAGTTTCAACGAGTACTCAAACCACAGAGTCTGGAGCAACACTAAGACGAGGGCCTTGCCGGATTCACCAATTTGTATTTGTATCGATCTCAACCATGTGGATCTTTGGATGGACCGGACCATCTGGGTTCTCAGCCCGTTCTACAGCAGAAGAAGTCACAGAAGGAATTGATGGGAATGGCCTCACTGCTATTGTTACAGGAGCTTCAAGTGGTCTCGGTGAAGAGACGACACGTGTTCTTGCATTACGAGGAGTCTATGTCATTATGGCTGTAAGAAATGTTGAAGCAGGGAAAAAAGTAAAAGAAGCAGTACTGAAAGAATCCCCTTCAGCCAAAATTGATGTCATGGAGTTAGATCTTAGTTCAATGGAATCTGTAAGGAAATTTGCATCAGATTACATTGCATCAGGCCGTGCACTAAATATTCTCATGAACAATGCGGGCGTTATGGCGACGCCTTTTATGCTTTCGCATGACGGCATAGAGTTGCAGTTTGCAACAAACCATATAGGACATTTTCTTCTGACGAACCTTCTGCTGGAAACTATGAAAAAAACTGTGGTTGAAAGCAAAAAGGAGGGAAGGATTGTTAATCTGTCGTCGGAGGGGCACCGATTGACATATGGCGAAGGAGTTCGTTTCGATAAAATCAATGACGAATCAGAGTACAGAACTATCTTTGCTTATGGACAATCAAAGCTTTCCAACATATTGCATGCCAAAGAGCTTGCCAGGCGGTTGAAGGAAGAAGGGGTGGAAATAACAGCCAACGCCCTTCATCCTGGAGCAATTGCTGCTACCAACCTACTACGCTTCCATGGTCTCATTAATGCTGTTGCTACTTCCATTGCTACATTTGTGCTTAAAAACGTCCAGCAGGGAGCGGCGACTCAGTGCTATGTAGCATTGAATCCCCAAGTCAAAGGGGTAAGTGGAGAGTATTTTGTGGATAGTAATATAGCCAACCCAACCAATCATGCCAAAGATATGGAATTGGCCAAGAAACTGTGGGATTTCAGTATGGATCTAACCACCCCTAAATAA

Protein sequence

RKIANDEPPKENSHLPQEDGKISAVSTSTQTTESGATLRRGPCRIHQSVFVSISTMWIFGWTGPSGFSARSTAEEVTEGIDGNGLTAIVTVSTSTQTTESGATLRRGPCRIHQFVFVSISTMWIFGWTGPSGFSARSTAEEVTEGIDGNGLTAIVTGASSGLGEETTRVLALRGVYVIMAVRNVEAGKKVKEAVLKESPSAKIDVMELDLSSMESVRKFASDYIASGRALNILMNNAGVMATPFMLSHDGIELQFATNHIGHFLLTNLLLETMKKTVVESKKEGRIVNLSSEGHRLTYGEGVRFDKINDESEYRTIFAYGQSKLSNILHAKELARRLKEEGVEITANALHPGAIAATNLLRFHGLINAVATSIATFVLKNVQQGAATQCYVALNPQVKGVSGEYFVDSNIANPTNHAKDMELAKKLWDFSMDLTTPK
BLAST of Cp4.1LG14g02170 vs. Swiss-Prot
Match: TIC32_ARATH (Short-chain dehydrogenase TIC 32, chloroplastic OS=Arabidopsis thaliana GN=TIC32 PE=2 SV=1)

HSP 1 Score: 422.9 bits (1086), Expect = 4.2e-117
Identity = 220/314 (70.06%), Postives = 258/314 (82.17%), Query Frame = 1

Query: 122 MWIFGWTGPSGFSARSTAEEVTEGIDGNGLTAIVTGASSGLGEETTRVLALRGVYVIMAV 181
           MW FG  G SGFS+RSTAEEVT G+DG GLTAIVTGASSG+G ET RVL+LRGV+V+MAV
Sbjct: 1   MWFFGSKGASGFSSRSTAEEVTHGVDGTGLTAIVTGASSGIGVETARVLSLRGVHVVMAV 60

Query: 182 RNVEAGKKVKEAVLKESPSAKIDVMELDLSSMESVRKFASDYIASGRALNILMNNAGVMA 241
           RN ++G KVKE ++K+ P AK+DVMELDLSSM+SVRKFAS+Y ++G  LN+L+NNAG+MA
Sbjct: 61  RNTDSGAKVKEDIVKQVPGAKLDVMELDLSSMQSVRKFASEYKSTGLPLNLLINNAGIMA 120

Query: 242 TPFMLSHDGIELQFATNHIGHFLLTNLLLETMKKTVVESKKEGRIVNLSSEGHRLTYGEG 301
            PFMLS D IELQFATNH+GHFLLT LLL+TMK T  ESK+EGRIVNLSSE HR +Y EG
Sbjct: 121 CPFMLSKDNIELQFATNHLGHFLLTKLLLDTMKSTSRESKREGRIVNLSSEAHRFSYPEG 180

Query: 302 VRFDKINDESEYRTIFAYGQSKLSNILHAKELARRLKEEGVEITANALHPGAIAATNLLR 361
           VRFDKIND+S Y ++ AYGQSKL N+LHA EL ++LKE+GV ITAN+LHPGAI  TNL R
Sbjct: 181 VRFDKINDKSSYSSMRAYGQSKLCNVLHANELTKQLKEDGVNITANSLHPGAI-MTNLGR 240

Query: 362 FHGLINAVAT-SIATFVLKNVQQGAATQCYVALNPQVKGVSGEYFVDSNIANPTNHAKDM 421
           +     AVA  ++A ++LK+V QGAAT CYVALNPQV GVSGEYF DSNIA P    KD 
Sbjct: 241 YFNPYLAVAVGAVAKYILKSVPQGAATTCYVALNPQVAGVSGEYFQDSNIAKPLPLVKDT 300

Query: 422 ELAKKLWDFSMDLT 435
           ELAKK+WDFS  LT
Sbjct: 301 ELAKKVWDFSTKLT 313

BLAST of Cp4.1LG14g02170 vs. Swiss-Prot
Match: TIC32_PEA (Short-chain dehydrogenase TIC 32, chloroplastic OS=Pisum sativum GN=TIC32 PE=1 SV=1)

HSP 1 Score: 416.4 bits (1069), Expect = 3.9e-115
Identity = 215/317 (67.82%), Postives = 252/317 (79.50%), Query Frame = 1

Query: 122 MWIFGWT-GPSGFSARSTAEEVTEGIDGNGLTAIVTGASSGLGEETTRVLALRGVYVIMA 181
           MW F    G SGFS  STAE+VT GID  GLTAIVTGASSG+G ETTRVLALRG +VIM 
Sbjct: 1   MWPFSSKKGVSGFSGSSTAEQVTHGIDATGLTAIVTGASSGIGAETTRVLALRGAHVIMG 60

Query: 182 VRNVEAGKKVKEAVLKESPSAKIDVMELDLSSMESVRKFASDYIASGRALNILMNNAGVM 241
           VRN+ A K VK+ +LK+ PSAK+D +ELDLSS++SV+KFAS++ +SGR LNIL+NNAG+M
Sbjct: 61  VRNMVAAKDVKDTILKDIPSAKVDAIELDLSSLDSVKKFASEFNSSGRPLNILINNAGIM 120

Query: 242 ATPFMLSHDGIELQFATNHIGHFLLTNLLLETMKKTVVESKKEGRIVNLSSEGHRLTYGE 301
           A PF LS D IELQFATNHIGHFLLTNLLL+TMKKT  ESKKEGRIVN++SE HR  Y E
Sbjct: 121 ACPFKLSKDNIELQFATNHIGHFLLTNLLLDTMKKTTRESKKEGRIVNVASEAHRFAYPE 180

Query: 302 GVRFDKINDESEYRTIFAYGQSKLSNILHAKELARRLKEEGVEITANALHPGAIAATNLL 361
           G+RFDKIND+S Y    AYGQSKL+N+LHA +L + LKE+GV ITAN+LHPG I  TNL 
Sbjct: 181 GIRFDKINDQSSYNNWRAYGQSKLANVLHANQLTKHLKEDGVNITANSLHPGTI-VTNLF 240

Query: 362 RFHGLINAVATSIATFVLKNVQQGAATQCYVALNPQVKGVSGEYFVDSNIANPTNHAKDM 421
           R +  +N +   I   VLKNVQQGAAT CYVAL+PQVKGVSGEYF DSN+   T H KD+
Sbjct: 241 RHNSAVNGLINVIGKLVLKNVQQGAATTCYVALHPQVKGVSGEYFSDSNVYKTTPHGKDV 300

Query: 422 ELAKKLWDFSMDLTTPK 438
           +LAKKLWDFS++L   K
Sbjct: 301 DLAKKLWDFSINLVKQK 316

BLAST of Cp4.1LG14g02170 vs. Swiss-Prot
Match: RDH11_MOUSE (Retinol dehydrogenase 11 OS=Mus musculus GN=Rdh11 PE=1 SV=2)

HSP 1 Score: 188.3 bits (477), Expect = 1.7e-46
Identity = 113/291 (38.83%), Postives = 172/291 (59.11%), Query Frame = 1

Query: 143 TEGIDGNGLTAIVTGASSGLGEETTRVLALRGVYVIMAVRNVEAGKKVKEAVLKESPSAK 202
           T  +   G  AIVTGA++G+G+ET + LA RG  V +A R+V+ G+     +   + +++
Sbjct: 31  TSNVQLPGKVAIVTGANTGIGKETAKDLAQRGARVYLACRDVDKGELAAREIQAVTGNSQ 90

Query: 203 IDVMELDLSSMESVRKFASDYIASGRALNILMNNAGVMATPFMLSHDGIELQFATNHIGH 262
           + V +LDL+  +S+R FA D++A  + L++L+NNAGVM  P+  + DG E+    NH+GH
Sbjct: 91  VFVRKLDLADTKSIRAFAKDFLAEEKHLHLLINNAGVMMCPYSKTADGFEMHIGVNHLGH 150

Query: 263 FLLTNLLLETMKKTVVESKKEGRIVNLSSEGHRLTYGEGVRFDKINDESEYRTIFAYGQS 322
           FLLT+LLLE +K++        RIVNLSS GH L     + F  +  E  Y    AY  S
Sbjct: 151 FLLTHLLLEKLKESA-----PSRIVNLSSLGHHL---GRIHFHNLQGEKFYSAGLAYCHS 210

Query: 323 KLSNILHAKELARRLKEEGVEITANALHPGAIAATNLLRFHGLINAVATSIATFVLKNVQ 382
           KL+NIL  KELA+RLK  GV  T  ++HPG +  + L R+  ++  +      F+ K  Q
Sbjct: 211 KLANILFTKELAKRLKGSGV--TTYSVHPGTV-HSELTRYSSIMRWLWQLFFVFI-KTPQ 270

Query: 383 QGAATQCYVALNPQVKGVSGEYFVDSNIANPTNHAKDMELAKKLWDFSMDL 434
           +GA T  Y AL   ++ +SG +F D  +A  +   ++  +A++LWD S DL
Sbjct: 271 EGAQTSLYCALTEGLESLSGSHFSDCQLAWVSYQGRNEIIARRLWDVSCDL 309

BLAST of Cp4.1LG14g02170 vs. Swiss-Prot
Match: RDH11_HUMAN (Retinol dehydrogenase 11 OS=Homo sapiens GN=RDH11 PE=1 SV=2)

HSP 1 Score: 185.3 bits (469), Expect = 1.5e-45
Identity = 112/284 (39.44%), Postives = 170/284 (59.86%), Query Frame = 1

Query: 150 GLTAIVTGASSGLGEETTRVLALRGVYVIMAVRNVEAGKKVKEAVLKESPSAKIDVMELD 209
           G   +VTGA++G+G+ET + LA RG  V +A R+VE G+ V + +   + + ++ V +LD
Sbjct: 41  GKVVVVTGANTGIGKETAKELAQRGARVYLACRDVEKGELVAKEIQTTTGNQQVLVRKLD 100

Query: 210 LSSMESVRKFASDYIASGRALNILMNNAGVMATPFMLSHDGIELQFATNHIGHFLLTNLL 269
           LS  +S+R FA  ++A  + L++L+NNAGVM  P+  + DG E+    NH+GHFLLT+LL
Sbjct: 101 LSDTKSIRAFAKGFLAEEKHLHVLINNAGVMMCPYSKTADGFEMHIGVNHLGHFLLTHLL 160

Query: 270 LETMKKTVVESKKEGRIVNLSSEGHRLTYGEGVRFDKINDESEYRTIFAYGQSKLSNILH 329
           LE +K++        RIVN+SS  H L     + F  +  E  Y    AY  SKL+NIL 
Sbjct: 161 LEKLKESA-----PSRIVNVSSLAHHL---GRIHFHNLQGEKFYNAGLAYCHSKLANILF 220

Query: 330 AKELARRLKEEGVEITANALHPGAIAATNLLRFHGLINAVATSIATFVLKNVQQGAATQC 389
            +ELARRLK  GV  T  ++HPG +  + L+R H         + +F +K  QQGA T  
Sbjct: 221 TQELARRLKGSGV--TTYSVHPGTV-QSELVR-HSSFMRWMWWLFSFFIKTPQQGAQTSL 280

Query: 390 YVALNPQVKGVSGEYFVDSNIANPTNHAKDMELAKKLWDFSMDL 434
           + AL   ++ +SG +F D ++A  +  A++  +A++LWD S DL
Sbjct: 281 HCALTEGLEILSGNHFSDCHVAWVSAQARNETIARRLWDVSCDL 312

BLAST of Cp4.1LG14g02170 vs. Swiss-Prot
Match: RDH14_HUMAN (Retinol dehydrogenase 14 OS=Homo sapiens GN=RDH14 PE=1 SV=1)

HSP 1 Score: 182.6 bits (462), Expect = 9.5e-45
Identity = 117/298 (39.26%), Postives = 164/298 (55.03%), Query Frame = 1

Query: 149 NGLTAIVTGASSGLGEETTRVLALRGVYVIMAVRNVEAGKKVKEAVLKESPSA------- 208
           +G T ++TGA+SGLG  T   L   G  VIM  R+    ++    + +E   A       
Sbjct: 42  HGKTVLITGANSGLGRATAAELLRLGARVIMGCRDRARAEEAAGQLRRELRQAAECGPEP 101

Query: 209 ------KIDVMELDLSSMESVRKFASDYIASGRALNILMNNAGVMATPFMLSHDGIELQF 268
                 ++ V ELDL+S+ SVR F  + +     L++L+NNAG+   P+M + DG E+QF
Sbjct: 102 GVSGVGELIVRELDLASLRSVRAFCQEMLQEEPRLDVLINNAGIFQCPYMKTEDGFEMQF 161

Query: 269 ATNHIGHFLLTNLLLETMKKTVVESKKEGRIVNLSSEGHRLTYGEGVRFDKINDESEYRT 328
             NH+GHFLLTNLLL  +K     S    RIV +SS+ ++  YG+ + FD +N E  Y  
Sbjct: 162 GVNHLGHFLLTNLLLGLLK-----SSAPSRIVVVSSKLYK--YGD-INFDDLNSEQSYNK 221

Query: 329 IFAYGQSKLSNILHAKELARRLKEEGVEITANALHPGAIAATNL---LRFHGLINAVATS 388
            F Y +SKL+NIL  +ELARRL  EG  +T N LHPG I  TNL   +    L+  +   
Sbjct: 222 SFCYSRSKLANILFTRELARRL--EGTNVTVNVLHPG-IVRTNLGRHIHIPLLVKPLFNL 281

Query: 389 IATFVLKNVQQGAATQCYVALNPQVKGVSGEYFVDSNIANPTNHAKDMELAKKLWDFS 431
           ++    K   +GA T  Y+A +P+V+GVSG YF D         A D  +A+KLWD S
Sbjct: 282 VSWAFFKTPVEGAQTSIYLASSPEVEGVSGRYFGDCKEEELLPKAMDESVARKLWDIS 328

BLAST of Cp4.1LG14g02170 vs. TrEMBL
Match: A0A0A0KUV5_CUCSA (Short-chain dehydrogenase OS=Cucumis sativus GN=Csa_4G047990 PE=4 SV=1)

HSP 1 Score: 563.1 bits (1450), Expect = 2.9e-157
Identity = 288/316 (91.14%), Postives = 303/316 (95.89%), Query Frame = 1

Query: 122 MWIFGWTGPSGFSARSTAEEVTEGIDGNGLTAIVTGASSGLGEETTRVLALRGVYVIMAV 181
           MWIFGW GPSGFSARSTAEEVTEGIDGNGLTAIVTGASSGLGEE+TRVLALRGVYVIMAV
Sbjct: 1   MWIFGWKGPSGFSARSTAEEVTEGIDGNGLTAIVTGASSGLGEESTRVLALRGVYVIMAV 60

Query: 182 RNVEAGKKVKEAVLKESPSAKIDVMELDLSSMESVRKFASDYIASGRALNILMNNAGVMA 241
           RN+EAG+KVKEAVLKESPSAKIDVMELDLSSMESVRKFA+DYIASG  LNILMNNAGVMA
Sbjct: 61  RNIEAGRKVKEAVLKESPSAKIDVMELDLSSMESVRKFAADYIASGLPLNILMNNAGVMA 120

Query: 242 TPFMLSHDGIELQFATNHIGHFLLTNLLLETMKKTVVESKKEGRIVNLSSEGHRLTYGEG 301
           TPFMLSHDGIELQFATNH+GHFLLTNLLLETMKKTV+ESKKEGRIVNLSSEGHR+TYGEG
Sbjct: 121 TPFMLSHDGIELQFATNHLGHFLLTNLLLETMKKTVLESKKEGRIVNLSSEGHRITYGEG 180

Query: 302 VRFDKINDESEYRTIFAYGQSKLSNILHAKELARRLKEEGVEITANALHPGAIAATNLLR 361
           +RF+KIN+ESEYRTI AYGQSKLSNILHAKELARRLK EGVEITANALHPG+I ATNLLR
Sbjct: 181 IRFNKINNESEYRTILAYGQSKLSNILHAKELARRLKVEGVEITANALHPGSI-ATNLLR 240

Query: 362 FHGLINAVATSIATFVLKNVQQGAATQCYVALNPQVKGVSGEYFVDSNIANPTNHAKDME 421
           FH  INAV   +A +VLKNVQQGAATQCYVALNPQVKGVSGEYFVDSNIANPTNHAKDM+
Sbjct: 241 FHSTINAVTNLVAKYVLKNVQQGAATQCYVALNPQVKGVSGEYFVDSNIANPTNHAKDMD 300

Query: 422 LAKKLWDFSMDLTTPK 438
           LAKKLWDFS+DLT PK
Sbjct: 301 LAKKLWDFSVDLTNPK 315

BLAST of Cp4.1LG14g02170 vs. TrEMBL
Match: M5WB87_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa008889mg PE=4 SV=1)

HSP 1 Score: 483.4 bits (1243), Expect = 2.9e-133
Identity = 248/316 (78.48%), Postives = 277/316 (87.66%), Query Frame = 1

Query: 122 MWIFGWTGPSGFSARSTAEEVTEGIDGNGLTAIVTGASSGLGEETTRVLALRGVYVIMAV 181
           MWIFGW G SGFSARSTAEEVT+GIDG GLTAIVTGASSGLG ETTRVLALR V+VIMAV
Sbjct: 1   MWIFGWKGQSGFSARSTAEEVTQGIDGTGLTAIVTGASSGLGLETTRVLALRRVHVIMAV 60

Query: 182 RNVEAGKKVKEAVLKESPSAKIDVMELDLSSMESVRKFASDYIASGRALNILMNNAGVMA 241
           RN EAG+ V+ A+LKE P+A I+VMELDLSSM SVRKFAS+Y +SG  LNIL+NNAGVMA
Sbjct: 61  RNTEAGRDVRTAILKEIPTANINVMELDLSSMASVRKFASEYNSSGLPLNILINNAGVMA 120

Query: 242 TPFMLSHDGIELQFATNHIGHFLLTNLLLETMKKTVVESKKEGRIVNLSSEGHRLTYGEG 301
           TPFMLS D IELQFATNH+GHFLLTNLLLETMKKT  ESKKEGRIVNLSSE HR  Y EG
Sbjct: 121 TPFMLSQDNIELQFATNHLGHFLLTNLLLETMKKTTRESKKEGRIVNLSSEAHRFAYSEG 180

Query: 302 VRFDKINDESEYRTIFAYGQSKLSNILHAKELARRLKEEGVEITANALHPGAIAATNLLR 361
           +RFDKINDES Y +I+AYGQSKL+NILHA EL +RLKEEGV ITAN+LHPG+I ATNLLR
Sbjct: 181 IRFDKINDESGYSSIYAYGQSKLANILHANELTKRLKEEGVAITANSLHPGSI-ATNLLR 240

Query: 362 FHGLINAVATSIATFVLKNVQQGAATQCYVALNPQVKGVSGEYFVDSNIANPTNHAKDME 421
           +H  IN +A+++   +LKNVQQGAAT+CYVAL+PQVKGVSGEYF+DSN ANPT+ AKD E
Sbjct: 241 YHSYINVIASTLGRLMLKNVQQGAATECYVALHPQVKGVSGEYFMDSNKANPTSQAKDPE 300

Query: 422 LAKKLWDFSMDLTTPK 438
           LAKKLWDFS+ LT PK
Sbjct: 301 LAKKLWDFSLSLTDPK 315

BLAST of Cp4.1LG14g02170 vs. TrEMBL
Match: A0A0D2MPJ0_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_003G157700 PE=3 SV=1)

HSP 1 Score: 480.3 bits (1235), Expect = 2.4e-132
Identity = 243/316 (76.90%), Postives = 274/316 (86.71%), Query Frame = 1

Query: 122 MWIFGWTGPSGFSARSTAEEVTEGIDGNGLTAIVTGASSGLGEETTRVLALRGVYVIMAV 181
           MWIFGW GPSGFSA STAEEVT+GIDG+ LTAIVTGASSG+G ETTRVLALRGV+V+MAV
Sbjct: 1   MWIFGWKGPSGFSASSTAEEVTQGIDGSALTAIVTGASSGIGVETTRVLALRGVHVVMAV 60

Query: 182 RNVEAGKKVKEAVLKESPSAKIDVMELDLSSMESVRKFASDYIASGRALNILMNNAGVMA 241
           RN +AG+ VKE++LKE PSAKIDVM+LDLSSM SVRKFAS Y +S   LN+L+NNAGVMA
Sbjct: 61  RNADAGQNVKESILKEIPSAKIDVMDLDLSSMASVRKFASQYQSSNLPLNLLINNAGVMA 120

Query: 242 TPFMLSHDGIELQFATNHIGHFLLTNLLLETMKKTVVESKKEGRIVNLSSEGHRLTYGEG 301
           +PFMLS D IELQFATNH+GHFLLT+LLLETMK+T  ES  EGRIVN+SSEGHR+ Y EG
Sbjct: 121 SPFMLSQDKIELQFATNHLGHFLLTDLLLETMKRTARESDIEGRIVNVSSEGHRIAYSEG 180

Query: 302 VRFDKINDESEYRTIFAYGQSKLSNILHAKELARRLKEEGVEITANALHPGAIAATNLLR 361
           +RFDKINDES Y T +AYGQSKL+NILHAKELARRLKEEGVEITAN+LHPGAI +TNL+R
Sbjct: 181 IRFDKINDESGYYTWYAYGQSKLANILHAKELARRLKEEGVEITANSLHPGAIISTNLMR 240

Query: 362 FHGLINAVATSIATFVLKNVQQGAATQCYVALNPQVKGVSGEYFVDSNIANPTNHAKDME 421
            HGLIN V   +  + LKN+ QGAAT CYVALNPQVKGVSGEYF+DSNI NP+  AKD +
Sbjct: 241 HHGLINTVGQMLGKYFLKNIPQGAATTCYVALNPQVKGVSGEYFLDSNIGNPSAKAKDAD 300

Query: 422 LAKKLWDFSMDLTTPK 438
           LAKKLWDFS  LT PK
Sbjct: 301 LAKKLWDFSCTLTNPK 316

BLAST of Cp4.1LG14g02170 vs. TrEMBL
Match: A0A0B0PI19_GOSAR (Short-chain dehydrogenase TIC 32, chloroplastic-like protein OS=Gossypium arboreum GN=F383_28545 PE=3 SV=1)

HSP 1 Score: 477.2 bits (1227), Expect = 2.1e-131
Identity = 242/316 (76.58%), Postives = 272/316 (86.08%), Query Frame = 1

Query: 122 MWIFGWTGPSGFSARSTAEEVTEGIDGNGLTAIVTGASSGLGEETTRVLALRGVYVIMAV 181
           MWIFGW GPSGFSA STAEEVT+GIDG+ L AIVTGASSG+G ETTRVLALRGV+V+MAV
Sbjct: 1   MWIFGWKGPSGFSASSTAEEVTQGIDGSALAAIVTGASSGIGVETTRVLALRGVHVVMAV 60

Query: 182 RNVEAGKKVKEAVLKESPSAKIDVMELDLSSMESVRKFASDYIASGRALNILMNNAGVMA 241
           RN +AG+ VKE++LKE PSAKIDVMELDLSSM SVRKFAS Y +S   LN+L+NNAGVMA
Sbjct: 61  RNADAGRNVKESILKEIPSAKIDVMELDLSSMASVRKFASQYQSSNLPLNLLINNAGVMA 120

Query: 242 TPFMLSHDGIELQFATNHIGHFLLTNLLLETMKKTVVESKKEGRIVNLSSEGHRLTYGEG 301
           TPFMLS D IELQFATNH+GHFLLT+LLLETMK+T  ES  EGRIVN+SSEGHR+ Y EG
Sbjct: 121 TPFMLSQDKIELQFATNHLGHFLLTDLLLETMKRTARESNIEGRIVNVSSEGHRIAYSEG 180

Query: 302 VRFDKINDESEYRTIFAYGQSKLSNILHAKELARRLKEEGVEITANALHPGAIAATNLLR 361
           +RFDKINDES Y T +AYGQSKL+NILHAKELA+RLKEE VEITAN+LHPGAI +TNL+R
Sbjct: 181 IRFDKINDESGYYTWYAYGQSKLANILHAKELAQRLKEEEVEITANSLHPGAIISTNLMR 240

Query: 362 FHGLINAVATSIATFVLKNVQQGAATQCYVALNPQVKGVSGEYFVDSNIANPTNHAKDME 421
            HGLIN V   +  + LKN+ QGAAT CYVALNPQVKGVSGEYF+DSNI NP+  AKD +
Sbjct: 241 HHGLINTVGQMLGRYFLKNIPQGAATTCYVALNPQVKGVSGEYFLDSNIGNPSAKAKDAD 300

Query: 422 LAKKLWDFSMDLTTPK 438
           LAKKLWDFS  LT PK
Sbjct: 301 LAKKLWDFSCTLTNPK 316

BLAST of Cp4.1LG14g02170 vs. TrEMBL
Match: Q0VH86_GOSHI (3-ketoacyl-CoA reductase 3 OS=Gossypium hirsutum PE=2 SV=1)

HSP 1 Score: 476.5 bits (1225), Expect = 3.5e-131
Identity = 242/316 (76.58%), Postives = 272/316 (86.08%), Query Frame = 1

Query: 122 MWIFGWTGPSGFSARSTAEEVTEGIDGNGLTAIVTGASSGLGEETTRVLALRGVYVIMAV 181
           MWIFGW GPSGFSA STAEEVT+GIDG+ L AIVTGASSG+G ETTRVLALRGV+V+MAV
Sbjct: 13  MWIFGWKGPSGFSASSTAEEVTQGIDGSALAAIVTGASSGIGVETTRVLALRGVHVVMAV 72

Query: 182 RNVEAGKKVKEAVLKESPSAKIDVMELDLSSMESVRKFASDYIASGRALNILMNNAGVMA 241
           RN +AG+ VKE++LKE PSAKIDVMELDLSSM SVRKFAS Y +S   LN+L+NNAGVMA
Sbjct: 73  RNADAGRNVKESILKEIPSAKIDVMELDLSSMASVRKFASQYQSSNLPLNLLINNAGVMA 132

Query: 242 TPFMLSHDGIELQFATNHIGHFLLTNLLLETMKKTVVESKKEGRIVNLSSEGHRLTYGEG 301
           TPFMLS D IELQFATNH+GHFLLT+LLLETMK+T  ES  EGRIVN+SSEGHR+ Y EG
Sbjct: 133 TPFMLSQDKIELQFATNHLGHFLLTDLLLETMKRTARESNIEGRIVNVSSEGHRIAYREG 192

Query: 302 VRFDKINDESEYRTIFAYGQSKLSNILHAKELARRLKEEGVEITANALHPGAIAATNLLR 361
           +RFDKINDES Y T +AYGQSKL+NILHAKELA+RLKEE VEITAN+LHPGAI +TNL+R
Sbjct: 193 IRFDKINDESGYYTWYAYGQSKLANILHAKELAQRLKEEEVEITANSLHPGAIISTNLMR 252

Query: 362 FHGLINAVATSIATFVLKNVQQGAATQCYVALNPQVKGVSGEYFVDSNIANPTNHAKDME 421
            HGLIN V   +  + LKN+ QGAAT CYVALNPQVKGVSGEYF+DSNI NP+  AKD +
Sbjct: 253 HHGLINTVGQMLGKYFLKNIPQGAATTCYVALNPQVKGVSGEYFLDSNIGNPSAKAKDAD 312

Query: 422 LAKKLWDFSMDLTTPK 438
           LAKKLWDFS  LT PK
Sbjct: 313 LAKKLWDFSCTLTNPK 328

BLAST of Cp4.1LG14g02170 vs. TAIR10
Match: AT4G11410.1 (AT4G11410.1 NAD(P)-binding Rossmann-fold superfamily protein)

HSP 1 Score: 430.6 bits (1106), Expect = 1.1e-120
Identity = 216/313 (69.01%), Postives = 257/313 (82.11%), Query Frame = 1

Query: 122 MWIFGWTGPSGFSARSTAEEVTEGIDGNGLTAIVTGASSGLGEETTRVLALRGVYVIMAV 181
           MW F W G SGFSARSTAEEVT GIDG GLTAIVTGASSG+GEETTRVLALRGV+V+MAV
Sbjct: 1   MWPFWWKGASGFSARSTAEEVTHGIDGTGLTAIVTGASSGIGEETTRVLALRGVHVVMAV 60

Query: 182 RNVEAGKKVKEAVLKESPSAKIDVMELDLSSMESVRKFASDYIASGRALNILMNNAGVMA 241
           RN ++G +V++ +LKE P AKIDVM+LDLSSM SVR FAS+Y +    LN+L+NNAG+MA
Sbjct: 61  RNTDSGNQVRDKILKEIPQAKIDVMKLDLSSMASVRSFASEYQSLDLPLNLLINNAGIMA 120

Query: 242 TPFMLSHDGIELQFATNHIGHFLLTNLLLETMKKTVVESKKEGRIVNLSSEGHRLTYGEG 301
            PF+LS D IELQFATNH+GHFLLTNLLLE MKKT  ES +EGRIV +SSEGHR  Y EG
Sbjct: 121 CPFLLSSDNIELQFATNHLGHFLLTNLLLERMKKTASESNREGRIVIVSSEGHRFAYREG 180

Query: 302 VRFDKINDESEYRTIFAYGQSKLSNILHAKELARRLKEEGVEITANALHPGAIAATNLLR 361
           V+FDKINDE+ Y T+ AYGQSKL NILHA ELAR  KE+GV ITAN+LHPG+I  TNLLR
Sbjct: 181 VQFDKINDEARYNTLQAYGQSKLGNILHATELARLFKEQGVNITANSLHPGSI-MTNLLR 240

Query: 362 FHGLINAVATSIATFVLKNVQQGAATQCYVALNPQVKGVSGEYFVDSNIANPTNHAKDME 421
           +H  IN +  ++  +VLK++ QGAAT CY AL+PQ KGVSGEY +D+NI++P +  KD +
Sbjct: 241 YHSFINTIGNAVGKYVLKSIPQGAATTCYAALHPQAKGVSGEYLMDNNISDPNSQGKDKD 300

Query: 422 LAKKLWDFSMDLT 435
           LAKKLW+FS+ LT
Sbjct: 301 LAKKLWEFSLRLT 312

BLAST of Cp4.1LG14g02170 vs. TAIR10
Match: AT4G23430.2 (AT4G23430.2 NAD(P)-binding Rossmann-fold superfamily protein)

HSP 1 Score: 422.9 bits (1086), Expect = 2.3e-118
Identity = 220/314 (70.06%), Postives = 258/314 (82.17%), Query Frame = 1

Query: 122 MWIFGWTGPSGFSARSTAEEVTEGIDGNGLTAIVTGASSGLGEETTRVLALRGVYVIMAV 181
           MW FG  G SGFS+RSTAEEVT G+DG GLTAIVTGASSG+G ET RVL+LRGV+V+MAV
Sbjct: 1   MWFFGSKGASGFSSRSTAEEVTHGVDGTGLTAIVTGASSGIGVETARVLSLRGVHVVMAV 60

Query: 182 RNVEAGKKVKEAVLKESPSAKIDVMELDLSSMESVRKFASDYIASGRALNILMNNAGVMA 241
           RN ++G KVKE ++K+ P AK+DVMELDLSSM+SVRKFAS+Y ++G  LN+L+NNAG+MA
Sbjct: 61  RNTDSGAKVKEDIVKQVPGAKLDVMELDLSSMQSVRKFASEYKSTGLPLNLLINNAGIMA 120

Query: 242 TPFMLSHDGIELQFATNHIGHFLLTNLLLETMKKTVVESKKEGRIVNLSSEGHRLTYGEG 301
            PFMLS D IELQFATNH+GHFLLT LLL+TMK T  ESK+EGRIVNLSSE HR +Y EG
Sbjct: 121 CPFMLSKDNIELQFATNHLGHFLLTKLLLDTMKSTSRESKREGRIVNLSSEAHRFSYPEG 180

Query: 302 VRFDKINDESEYRTIFAYGQSKLSNILHAKELARRLKEEGVEITANALHPGAIAATNLLR 361
           VRFDKIND+S Y ++ AYGQSKL N+LHA EL ++LKE+GV ITAN+LHPGAI  TNL R
Sbjct: 181 VRFDKINDKSSYSSMRAYGQSKLCNVLHANELTKQLKEDGVNITANSLHPGAI-MTNLGR 240

Query: 362 FHGLINAVAT-SIATFVLKNVQQGAATQCYVALNPQVKGVSGEYFVDSNIANPTNHAKDM 421
           +     AVA  ++A ++LK+V QGAAT CYVALNPQV GVSGEYF DSNIA P    KD 
Sbjct: 241 YFNPYLAVAVGAVAKYILKSVPQGAATTCYVALNPQVAGVSGEYFQDSNIAKPLPLVKDT 300

Query: 422 ELAKKLWDFSMDLT 435
           ELAKK+WDFS  LT
Sbjct: 301 ELAKKVWDFSTKLT 313

BLAST of Cp4.1LG14g02170 vs. TAIR10
Match: AT4G23420.3 (AT4G23420.3 NAD(P)-binding Rossmann-fold superfamily protein)

HSP 1 Score: 411.8 bits (1057), Expect = 5.4e-115
Identity = 212/304 (69.74%), Postives = 248/304 (81.58%), Query Frame = 1

Query: 131 SGFSARSTAEEVTEGIDGNGLTAIVTGASSGLGEETTRVLALRGVYVIMAVRNVEAGKKV 190
           SGFS+RSTAEEVT G+DG GLTAIVTGASSG+G ET RVLALRGV+V+MAVRN  AG KV
Sbjct: 27  SGFSSRSTAEEVTHGVDGTGLTAIVTGASSGIGVETARVLALRGVHVVMAVRNTGAGAKV 86

Query: 191 KEAVLKESPSAKIDVMELDLSSMESVRKFASDYIASGRALNILMNNAGVMATPFMLSHDG 250
           KE ++K+ P AK+DVMEL+LSSMESVRKFAS+Y ++G  LN+L+NNAG+MA PFMLS D 
Sbjct: 87  KEDIVKQVPGAKVDVMELELSSMESVRKFASEYKSAGLPLNLLINNAGIMACPFMLSKDN 146

Query: 251 IELQFATNHIGHFLLTNLLLETMKKTVVESKKEGRIVNLSSEGHRLTYGEGVRFDKINDE 310
           IELQFATNH+GHFLLT LLL+TMK T  ESK+EGRIVN+SSE HR +Y EGVRFDKINDE
Sbjct: 147 IELQFATNHLGHFLLTKLLLDTMKNTSRESKREGRIVNVSSEAHRYSYPEGVRFDKINDE 206

Query: 311 SEYRTIFAYGQSKLSNILHAKELARRLKEEGVEITANALHPGAIAATNLLRFHGLINAVA 370
           S Y +I AYGQSKL N+LHA ELA++LKE+GV ITAN+LHPGAI       F+  +    
Sbjct: 207 SSYSSIRAYGQSKLCNVLHANELAKQLKEDGVNITANSLHPGAIMTNLWGYFNSYLAGAV 266

Query: 371 TSIATFVLKNVQQGAATQCYVALNPQVKGVSGEYFVDSNIANPTNHAKDMELAKKLWDFS 430
            ++A +++K+V QGAAT CYVALNPQV GV+GEYF DSNIA P    KD ELAKKLWDFS
Sbjct: 267 GAVAKYMVKSVPQGAATTCYVALNPQVAGVTGEYFSDSNIAKPIELVKDTELAKKLWDFS 326

Query: 431 MDLT 435
             LT
Sbjct: 327 TKLT 330

BLAST of Cp4.1LG14g02170 vs. TAIR10
Match: AT5G02540.1 (AT5G02540.1 NAD(P)-binding Rossmann-fold superfamily protein)

HSP 1 Score: 330.1 bits (845), Expect = 2.1e-90
Identity = 170/310 (54.84%), Postives = 224/310 (72.26%), Query Frame = 1

Query: 124 IFGWTGPSGFSARSTAEEVTEGIDGNGLTAIVTGASSGLGEETTRVLALRGVYVIMAVRN 183
           I G  GPSGF + STAEEVT+GID   LTAI+TG + G+G ET RVL+ RG +V++  RN
Sbjct: 7   ITGRRGPSGFGSASTAEEVTQGIDATNLTAIITGGTGGIGMETARVLSKRGAHVVIGARN 66

Query: 184 VEAGKKVKEAVLKESPSAKIDVMELDLSSMESVRKFASDYIASGRALNILMNNAGVMATP 243
           + A +  K  +L+++ +A++ +++LDLSS++S++ F  ++ A    LN+L+NNAGVM  P
Sbjct: 67  MGAAENAKTEILRQNANARVTLLQLDLSSIKSIKAFVREFHALHLPLNLLINNAGVMFCP 126

Query: 244 FMLSHDGIELQFATNHIGHFLLTNLLLETMKKTVVESKKEGRIVNLSSEGHRLTYGEGVR 303
           + LS DGIELQFATNHIGHFLLTNLLL+TMK T   S  EGRI+N+SS  H  TY EG++
Sbjct: 127 YQLSEDGIELQFATNHIGHFLLTNLLLDTMKNTAKTSGVEGRILNVSSVAHIYTYQEGIQ 186

Query: 304 FDKINDESEYRTIFAYGQSKLSNILHAKELARRLKEEGVEITANALHPGAIAATNLLRFH 363
           FD IND   Y    AYGQSKL+NILHA EL+R+L+EEGV ITAN++HPG I  TNL +  
Sbjct: 187 FDSINDICSYSDKRAYGQSKLANILHANELSRQLQEEGVNITANSVHPGLI-LTNLFQHT 246

Query: 364 GLINAVATSIATFVLKNVQQGAATQCYVALNPQVKGVSGEYFVDSNIANPTNHAKDMELA 423
            L+       + ++ KN+ QGAAT CYVAL+P VKGV+G+YF D N   P+  A+D  LA
Sbjct: 247 ALLMRFLKFFSFYLWKNIPQGAATTCYVALHPSVKGVTGKYFADCNEVTPSKLARDETLA 306

Query: 424 KKLWDFSMDL 434
           +KLWDFS+ L
Sbjct: 307 QKLWDFSVKL 315

BLAST of Cp4.1LG14g02170 vs. TAIR10
Match: AT2G37540.1 (AT2G37540.1 NAD(P)-binding Rossmann-fold superfamily protein)

HSP 1 Score: 325.5 bits (833), Expect = 5.1e-89
Identity = 169/308 (54.87%), Postives = 221/308 (71.75%), Query Frame = 1

Query: 126 GWTGPSGFSARSTAEEVTEGIDGNGLTAIVTGASSGLGEETTRVLALRGVYVIMAVRNVE 185
           G  G SGF + STAE+VT+ ID + LTAI+TG +SG+G E  RVLA+RG +VI+A RN +
Sbjct: 9   GKKGKSGFGSASTAEDVTQAIDASHLTAIITGGTSGIGLEAARVLAMRGAHVIIAARNPK 68

Query: 186 AGKKVKEAVLKESPSAKIDVMELDLSSMESVRKFASDYIASGRALNILMNNAGVMATPFM 245
           A  + KE +L+ +P+A++D +++D+SS++SVR F   ++A    LNIL+NNAGVM  PF 
Sbjct: 69  AANESKEMILQMNPNARVDYLQIDVSSIKSVRSFVDQFLALNVPLNILINNAGVMFCPFK 128

Query: 246 LSHDGIELQFATNHIGHFLLTNLLLETMKKTVVESKKEGRIVNLSSEGHRLTYGEGVRFD 305
           L+ DGIE QFATNHIGHFLLTNLLL+ MK T  ES  +GRIVNLSS  H  TY EG++F 
Sbjct: 129 LTEDGIESQFATNHIGHFLLTNLLLDKMKSTARESGVQGRIVNLSSIAHTYTYSEGIKFQ 188

Query: 306 KINDESEYRTIFAYGQSKLSNILHAKELARRLKEEGVEITANALHPGAIAATNLLRFHGL 365
            IND + Y    AYGQSKLSN+LH+  L+RRL+EEGV IT N++HPG +  TNL R+ G 
Sbjct: 189 GINDPAGYSERRAYGQSKLSNLLHSNALSRRLQEEGVNITINSVHPG-LVTTNLFRYSGF 248

Query: 366 INAVATSIATFVLKNVQQGAATQCYVALNPQVKGVSGEYFVDSNIANPTNHAKDMELAKK 425
              V  ++     KN+ QGAAT CYVAL+P ++GV+G+YF D NI  P+  A +  LA K
Sbjct: 249 SMKVFRAMTFLFWKNIPQGAATTCYVALHPDLEGVTGKYFGDCNIVAPSKFATNNSLADK 308

Query: 426 LWDFSMDL 434
           LWDFS+ L
Sbjct: 309 LWDFSVFL 315

BLAST of Cp4.1LG14g02170 vs. NCBI nr
Match: gi|659102252|ref|XP_008452031.1| (PREDICTED: short-chain dehydrogenase TIC 32, chloroplastic-like [Cucumis melo])

HSP 1 Score: 565.5 bits (1456), Expect = 8.3e-158
Identity = 290/316 (91.77%), Postives = 302/316 (95.57%), Query Frame = 1

Query: 122 MWIFGWTGPSGFSARSTAEEVTEGIDGNGLTAIVTGASSGLGEETTRVLALRGVYVIMAV 181
           MWIFGW GPSGFSARSTAEEVTEGIDGNGLTAIVTGASSGLGEE+TRVLALRGVYVIMAV
Sbjct: 1   MWIFGWKGPSGFSARSTAEEVTEGIDGNGLTAIVTGASSGLGEESTRVLALRGVYVIMAV 60

Query: 182 RNVEAGKKVKEAVLKESPSAKIDVMELDLSSMESVRKFASDYIASGRALNILMNNAGVMA 241
           RNVEAG+KVKEAVLKESPSAKIDVMELDLSSMESVRKFA+DYIASGR LNILMNNAGVMA
Sbjct: 61  RNVEAGRKVKEAVLKESPSAKIDVMELDLSSMESVRKFAADYIASGRPLNILMNNAGVMA 120

Query: 242 TPFMLSHDGIELQFATNHIGHFLLTNLLLETMKKTVVESKKEGRIVNLSSEGHRLTYGEG 301
           TPF LSHDGIELQFATNH+GHFLLTNLLLETMKKTV+ESKKEGRIVNLSSEGHR+ YGEG
Sbjct: 121 TPFTLSHDGIELQFATNHLGHFLLTNLLLETMKKTVLESKKEGRIVNLSSEGHRMAYGEG 180

Query: 302 VRFDKINDESEYRTIFAYGQSKLSNILHAKELARRLKEEGVEITANALHPGAIAATNLLR 361
           +RFDKIN+E+EYRTI AYGQSKLSNILHAKELARRLKEEGVEITANALHPGAI ATNLLR
Sbjct: 181 IRFDKINNEAEYRTILAYGQSKLSNILHAKELARRLKEEGVEITANALHPGAI-ATNLLR 240

Query: 362 FHGLINAVATSIATFVLKNVQQGAATQCYVALNPQVKGVSGEYFVDSNIANPTNHAKDME 421
           FH  INAV   +A FVLKNVQQGAATQCYVALNPQVKGVSGEYFVDSNIANPT HAKDM+
Sbjct: 241 FHSTINAVTNLVAKFVLKNVQQGAATQCYVALNPQVKGVSGEYFVDSNIANPTKHAKDMD 300

Query: 422 LAKKLWDFSMDLTTPK 438
           LAKKLWDFS+DLT PK
Sbjct: 301 LAKKLWDFSVDLTNPK 315

BLAST of Cp4.1LG14g02170 vs. NCBI nr
Match: gi|449457572|ref|XP_004146522.1| (PREDICTED: short-chain dehydrogenase TIC 32, chloroplastic [Cucumis sativus])

HSP 1 Score: 563.1 bits (1450), Expect = 4.1e-157
Identity = 288/316 (91.14%), Postives = 303/316 (95.89%), Query Frame = 1

Query: 122 MWIFGWTGPSGFSARSTAEEVTEGIDGNGLTAIVTGASSGLGEETTRVLALRGVYVIMAV 181
           MWIFGW GPSGFSARSTAEEVTEGIDGNGLTAIVTGASSGLGEE+TRVLALRGVYVIMAV
Sbjct: 1   MWIFGWKGPSGFSARSTAEEVTEGIDGNGLTAIVTGASSGLGEESTRVLALRGVYVIMAV 60

Query: 182 RNVEAGKKVKEAVLKESPSAKIDVMELDLSSMESVRKFASDYIASGRALNILMNNAGVMA 241
           RN+EAG+KVKEAVLKESPSAKIDVMELDLSSMESVRKFA+DYIASG  LNILMNNAGVMA
Sbjct: 61  RNIEAGRKVKEAVLKESPSAKIDVMELDLSSMESVRKFAADYIASGLPLNILMNNAGVMA 120

Query: 242 TPFMLSHDGIELQFATNHIGHFLLTNLLLETMKKTVVESKKEGRIVNLSSEGHRLTYGEG 301
           TPFMLSHDGIELQFATNH+GHFLLTNLLLETMKKTV+ESKKEGRIVNLSSEGHR+TYGEG
Sbjct: 121 TPFMLSHDGIELQFATNHLGHFLLTNLLLETMKKTVLESKKEGRIVNLSSEGHRITYGEG 180

Query: 302 VRFDKINDESEYRTIFAYGQSKLSNILHAKELARRLKEEGVEITANALHPGAIAATNLLR 361
           +RF+KIN+ESEYRTI AYGQSKLSNILHAKELARRLK EGVEITANALHPG+I ATNLLR
Sbjct: 181 IRFNKINNESEYRTILAYGQSKLSNILHAKELARRLKVEGVEITANALHPGSI-ATNLLR 240

Query: 362 FHGLINAVATSIATFVLKNVQQGAATQCYVALNPQVKGVSGEYFVDSNIANPTNHAKDME 421
           FH  INAV   +A +VLKNVQQGAATQCYVALNPQVKGVSGEYFVDSNIANPTNHAKDM+
Sbjct: 241 FHSTINAVTNLVAKYVLKNVQQGAATQCYVALNPQVKGVSGEYFVDSNIANPTNHAKDMD 300

Query: 422 LAKKLWDFSMDLTTPK 438
           LAKKLWDFS+DLT PK
Sbjct: 301 LAKKLWDFSVDLTNPK 315

BLAST of Cp4.1LG14g02170 vs. NCBI nr
Match: gi|1009156147|ref|XP_015896091.1| (PREDICTED: short-chain dehydrogenase TIC 32, chloroplastic [Ziziphus jujuba])

HSP 1 Score: 486.9 bits (1252), Expect = 3.7e-134
Identity = 254/316 (80.38%), Postives = 275/316 (87.03%), Query Frame = 1

Query: 122 MWIFGWTGPSGFSARSTAEEVTEGIDGNGLTAIVTGASSGLGEETTRVLALRGVYVIMAV 181
           MWIFG  GPSGFSA STAEEVT GIDG GLTAIVTGASSGLG ETTRVLALRGV+VIMAV
Sbjct: 1   MWIFGRKGPSGFSACSTAEEVTHGIDGAGLTAIVTGASSGLGVETTRVLALRGVHVIMAV 60

Query: 182 RNVEAGKKVKEAVLKESPSAKIDVMELDLSSMESVRKFASDYIASGRALNILMNNAGVMA 241
           RNV +GK V+E VLKE P+AKIDVMELDLSSM SVRKFAS+Y ASG  LN+L+NNAGVMA
Sbjct: 61  RNVNSGKDVRETVLKEIPTAKIDVMELDLSSMASVRKFASEYSASGLPLNLLINNAGVMA 120

Query: 242 TPFMLSHDGIELQFATNHIGHFLLTNLLLETMKKTVVESKKEGRIVNLSSEGHRLTYGEG 301
           TPFMLS D IELQFATNHIGHFLLTNLLLETMK T  ES KEGRIV++SSEGHR  Y EG
Sbjct: 121 TPFMLSQDNIELQFATNHIGHFLLTNLLLETMKNTARESNKEGRIVHVSSEGHRFAYREG 180

Query: 302 VRFDKINDESEYRTIFAYGQSKLSNILHAKELARRLKEEGVEITANALHPGAIAATNLLR 361
           +RFDKINDES Y +I+AYGQSKL+NILHAKEL+R LK EGV+ITANALHPGAI  TNLLR
Sbjct: 181 IRFDKINDESSYNSIYAYGQSKLANILHAKELSRILKAEGVDITANALHPGAI-VTNLLR 240

Query: 362 FHGLINAVATSIATFVLKNVQQGAATQCYVALNPQVKGVSGEYFVDSNIANPTNHAKDME 421
            HG  N +A     FVLKNVQQGAATQCYVALNPQVKGVSGEYF+DSN ANP+N AKD E
Sbjct: 241 HHGFFNVIANMFGKFVLKNVQQGAATQCYVALNPQVKGVSGEYFMDSNKANPSNLAKDAE 300

Query: 422 LAKKLWDFSMDLTTPK 438
           LAKKLWDFS++LT+PK
Sbjct: 301 LAKKLWDFSLNLTSPK 315

BLAST of Cp4.1LG14g02170 vs. NCBI nr
Match: gi|595847686|ref|XP_007209377.1| (hypothetical protein PRUPE_ppa008889mg [Prunus persica])

HSP 1 Score: 483.4 bits (1243), Expect = 4.1e-133
Identity = 248/316 (78.48%), Postives = 277/316 (87.66%), Query Frame = 1

Query: 122 MWIFGWTGPSGFSARSTAEEVTEGIDGNGLTAIVTGASSGLGEETTRVLALRGVYVIMAV 181
           MWIFGW G SGFSARSTAEEVT+GIDG GLTAIVTGASSGLG ETTRVLALR V+VIMAV
Sbjct: 1   MWIFGWKGQSGFSARSTAEEVTQGIDGTGLTAIVTGASSGLGLETTRVLALRRVHVIMAV 60

Query: 182 RNVEAGKKVKEAVLKESPSAKIDVMELDLSSMESVRKFASDYIASGRALNILMNNAGVMA 241
           RN EAG+ V+ A+LKE P+A I+VMELDLSSM SVRKFAS+Y +SG  LNIL+NNAGVMA
Sbjct: 61  RNTEAGRDVRTAILKEIPTANINVMELDLSSMASVRKFASEYNSSGLPLNILINNAGVMA 120

Query: 242 TPFMLSHDGIELQFATNHIGHFLLTNLLLETMKKTVVESKKEGRIVNLSSEGHRLTYGEG 301
           TPFMLS D IELQFATNH+GHFLLTNLLLETMKKT  ESKKEGRIVNLSSE HR  Y EG
Sbjct: 121 TPFMLSQDNIELQFATNHLGHFLLTNLLLETMKKTTRESKKEGRIVNLSSEAHRFAYSEG 180

Query: 302 VRFDKINDESEYRTIFAYGQSKLSNILHAKELARRLKEEGVEITANALHPGAIAATNLLR 361
           +RFDKINDES Y +I+AYGQSKL+NILHA EL +RLKEEGV ITAN+LHPG+I ATNLLR
Sbjct: 181 IRFDKINDESGYSSIYAYGQSKLANILHANELTKRLKEEGVAITANSLHPGSI-ATNLLR 240

Query: 362 FHGLINAVATSIATFVLKNVQQGAATQCYVALNPQVKGVSGEYFVDSNIANPTNHAKDME 421
           +H  IN +A+++   +LKNVQQGAAT+CYVAL+PQVKGVSGEYF+DSN ANPT+ AKD E
Sbjct: 241 YHSYINVIASTLGRLMLKNVQQGAATECYVALHPQVKGVSGEYFMDSNKANPTSQAKDPE 300

Query: 422 LAKKLWDFSMDLTTPK 438
           LAKKLWDFS+ LT PK
Sbjct: 301 LAKKLWDFSLSLTDPK 315

BLAST of Cp4.1LG14g02170 vs. NCBI nr
Match: gi|823143979|ref|XP_012471805.1| (PREDICTED: short-chain dehydrogenase TIC 32, chloroplastic-like [Gossypium raimondii])

HSP 1 Score: 480.3 bits (1235), Expect = 3.5e-132
Identity = 243/316 (76.90%), Postives = 274/316 (86.71%), Query Frame = 1

Query: 122 MWIFGWTGPSGFSARSTAEEVTEGIDGNGLTAIVTGASSGLGEETTRVLALRGVYVIMAV 181
           MWIFGW GPSGFSA STAEEVT+GIDG+ LTAIVTGASSG+G ETTRVLALRGV+V+MAV
Sbjct: 1   MWIFGWKGPSGFSASSTAEEVTQGIDGSALTAIVTGASSGIGVETTRVLALRGVHVVMAV 60

Query: 182 RNVEAGKKVKEAVLKESPSAKIDVMELDLSSMESVRKFASDYIASGRALNILMNNAGVMA 241
           RN +AG+ VKE++LKE PSAKIDVM+LDLSSM SVRKFAS Y +S   LN+L+NNAGVMA
Sbjct: 61  RNADAGQNVKESILKEIPSAKIDVMDLDLSSMASVRKFASQYQSSNLPLNLLINNAGVMA 120

Query: 242 TPFMLSHDGIELQFATNHIGHFLLTNLLLETMKKTVVESKKEGRIVNLSSEGHRLTYGEG 301
           +PFMLS D IELQFATNH+GHFLLT+LLLETMK+T  ES  EGRIVN+SSEGHR+ Y EG
Sbjct: 121 SPFMLSQDKIELQFATNHLGHFLLTDLLLETMKRTARESDIEGRIVNVSSEGHRIAYSEG 180

Query: 302 VRFDKINDESEYRTIFAYGQSKLSNILHAKELARRLKEEGVEITANALHPGAIAATNLLR 361
           +RFDKINDES Y T +AYGQSKL+NILHAKELARRLKEEGVEITAN+LHPGAI +TNL+R
Sbjct: 181 IRFDKINDESGYYTWYAYGQSKLANILHAKELARRLKEEGVEITANSLHPGAIISTNLMR 240

Query: 362 FHGLINAVATSIATFVLKNVQQGAATQCYVALNPQVKGVSGEYFVDSNIANPTNHAKDME 421
            HGLIN V   +  + LKN+ QGAAT CYVALNPQVKGVSGEYF+DSNI NP+  AKD +
Sbjct: 241 HHGLINTVGQMLGKYFLKNIPQGAATTCYVALNPQVKGVSGEYFLDSNIGNPSAKAKDAD 300

Query: 422 LAKKLWDFSMDLTTPK 438
           LAKKLWDFS  LT PK
Sbjct: 301 LAKKLWDFSCTLTNPK 316

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TIC32_ARATH4.2e-11770.06Short-chain dehydrogenase TIC 32, chloroplastic OS=Arabidopsis thaliana GN=TIC32... [more]
TIC32_PEA3.9e-11567.82Short-chain dehydrogenase TIC 32, chloroplastic OS=Pisum sativum GN=TIC32 PE=1 S... [more]
RDH11_MOUSE1.7e-4638.83Retinol dehydrogenase 11 OS=Mus musculus GN=Rdh11 PE=1 SV=2[more]
RDH11_HUMAN1.5e-4539.44Retinol dehydrogenase 11 OS=Homo sapiens GN=RDH11 PE=1 SV=2[more]
RDH14_HUMAN9.5e-4539.26Retinol dehydrogenase 14 OS=Homo sapiens GN=RDH14 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KUV5_CUCSA2.9e-15791.14Short-chain dehydrogenase OS=Cucumis sativus GN=Csa_4G047990 PE=4 SV=1[more]
M5WB87_PRUPE2.9e-13378.48Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa008889mg PE=4 SV=1[more]
A0A0D2MPJ0_GOSRA2.4e-13276.90Uncharacterized protein OS=Gossypium raimondii GN=B456_003G157700 PE=3 SV=1[more]
A0A0B0PI19_GOSAR2.1e-13176.58Short-chain dehydrogenase TIC 32, chloroplastic-like protein OS=Gossypium arbore... [more]
Q0VH86_GOSHI3.5e-13176.583-ketoacyl-CoA reductase 3 OS=Gossypium hirsutum PE=2 SV=1[more]
Match NameE-valueIdentityDescription
AT4G11410.11.1e-12069.01 NAD(P)-binding Rossmann-fold superfamily protein[more]
AT4G23430.22.3e-11870.06 NAD(P)-binding Rossmann-fold superfamily protein[more]
AT4G23420.35.4e-11569.74 NAD(P)-binding Rossmann-fold superfamily protein[more]
AT5G02540.12.1e-9054.84 NAD(P)-binding Rossmann-fold superfamily protein[more]
AT2G37540.15.1e-8954.87 NAD(P)-binding Rossmann-fold superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659102252|ref|XP_008452031.1|8.3e-15891.77PREDICTED: short-chain dehydrogenase TIC 32, chloroplastic-like [Cucumis melo][more]
gi|449457572|ref|XP_004146522.1|4.1e-15791.14PREDICTED: short-chain dehydrogenase TIC 32, chloroplastic [Cucumis sativus][more]
gi|1009156147|ref|XP_015896091.1|3.7e-13480.38PREDICTED: short-chain dehydrogenase TIC 32, chloroplastic [Ziziphus jujuba][more]
gi|595847686|ref|XP_007209377.1|4.1e-13378.48hypothetical protein PRUPE_ppa008889mg [Prunus persica][more]
gi|823143979|ref|XP_012471805.1|3.5e-13276.90PREDICTED: short-chain dehydrogenase TIC 32, chloroplastic-like [Gossypium raimo... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR016040NAD(P)-bd_dom
IPR002347SDR_fam
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015995 chlorophyll biosynthetic process
biological_process GO:0055114 oxidation-reduction process
biological_process GO:0019685 photosynthesis, dark reaction
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016630 protochlorophyllide reductase activity
molecular_function GO:0016491 oxidoreductase activity
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG14g02170.1Cp4.1LG14g02170.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002347Short-chain dehydrogenase/reductase SDRPRINTSPR00081GDHRDHcoord: 278..294
score: 1.5E-15coord: 152..169
score: 1.5E-15coord: 319..338
score: 1.5E-15coord: 342..359
score: 1.5E-15coord: 228..239
score: 1.5
IPR002347Short-chain dehydrogenase/reductase SDRPFAMPF00106adh_shortcoord: 152..293
score: 2.2
IPR016040NAD(P)-binding domainGENE3DG3DSA:3.40.50.720coord: 149..407
score: 1.6
IPR016040NAD(P)-binding domainunknownSSF51735NAD(P)-binding Rossmann-fold domainscoord: 150..411
score: 2.53
NoneNo IPR availablePANTHERPTHR24320FAMILY NOT NAMEDcoord: 99..435
score: 1.6E
NoneNo IPR availablePANTHERPTHR24320:SF71SUBFAMILY NOT NAMEDcoord: 99..435
score: 1.6E

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG14g02170Cucumber (Gy14) v2cgybcpeB324
Cp4.1LG14g02170Cucumber (Gy14) v2cgybcpeB619
Cp4.1LG14g02170Melon (DHL92) v3.6.1cpemedB217
Cp4.1LG14g02170Melon (DHL92) v3.6.1cpemedB230
Cp4.1LG14g02170Silver-seed gourdcarcpeB1301
Cp4.1LG14g02170Cucumber (Chinese Long) v3cpecucB0254
Cp4.1LG14g02170Cucumber (Chinese Long) v3cpecucB0277
Cp4.1LG14g02170Cucurbita pepo (Zucchini)cpecpeB196
Cp4.1LG14g02170Cucurbita pepo (Zucchini)cpecpeB233
Cp4.1LG14g02170Cucurbita maxima (Rimu)cmacpeB714
Cp4.1LG14g02170Bottle gourd (USVL1VR-Ls)cpelsiB187