Chy7G142440 (gene) Cucumber (hystrix) v1

Overview
NameChy7G142440
Typegene
OrganismCucumis hystrix (Cucumber (hystrix) v1)
Descriptionprolyl 4-hydroxylase 1
LocationchrH07: 18662484 .. 18675127 (-)
RNA-Seq ExpressionChy7G142440
SyntenyChy7G142440
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTTCCGCTCAGATGAGGATTGTCTTCGGTCTCTTGACATTTGTCACCGTCGGCATGATCATCGGTACGCATTTTCGTTTCTCTCTGTCGGTTTTTGTTGATTTTATGTGTATGTGTGGCGTGTGGCGTGTGGCGTGTTATTTACTCATTCGATGCCGAAATGGAGATTAACAAGAACTTGAGTGATTAGTGGCTTTATGCTTTGTTGCGGTATTCGTCAATGTATTTTTAAATCTGAGTTGTGATGTTTTTCTGTGAGCTGATTTCTTTAGAGGAGCGAGGAATTGTTGGTACCATCAGGGAATTACGTTTTCTTCTTCTTCTTCTTCTTCTCATACTTTGTAATGGTGCACTTGCCGGCTCAGTCAGAGATTGGCAGCTACACAGTGTTGTGTTGACGACTACTTGTAAATGTTCATTTGCTTTTCATTTTCTGGAATTTAGCTCTTCCTTGGATTTGATTTAACGGAAAGGAAAACACTCTTGACTCTCCTTTTTTGAGTTTTTCTATGATTAAACGTTGCAGTTTTTTATGTTTGCTATGAGTTGAACTATTTGAATGATGAATCTTTGACCATCTGGCTGGATTCTTTGTTCGCATATGGTAACATTTTGGTGACTTGATTGTGTTCCTGTACTTCTAGAGAATTATGCAAAGTTGTGATTGATAAGGTGCTTTCTTGAATGGTAATTGCAGGTGCTTTGTTACAACTAGCATTTTTAAGAAGGCTGGAGGACTCTATCGGTACTCTTCTTGGTTTATTTGTCATCTAGGACTTGAAATTATAATGACAGTATCTTATATTTGGAAAGACGATGAGCACTTTTGGGGTTGGATTTGAATCTCTTTGAAATTTTTATTTCGATTTTATTTTCATTCATGTCTTTGGGGAAGTGTTGTCATCATGTCCACCCATAACTAAATAGATATGAGTAGACATGATGACAACAAACGTTCATTAATAAAAGCATCTGTAAAATGTCGAGGAAATAGCACATGGTAACAGTCAATATGCAGTATGAGTTAACTTATTTTGCTTCAAACTAATAGAAGAAGAAGTTGTTCATGAACATATTTGAAATCTCTTAGAAACTCAATTCTGTAGCATGCACCATAGGCATACAGATATTGCAAGGCCAGGTTCCATGAAAAAGACAGAAAAATGTTCAGGTGTAAAAACTTTCCCGCCAGTATTGTAAGTTTCTATTTTCTTCAATTGGACTAAAAAAATTTCTACGGTGAGTATCTGGATTGGACCAAACACTTGGACTTTGGAGCACATAGGAGATAGTTTAATTTATCTAAGTTGCTGCATCATGACAGTTCAATTTGAAATTCATTGAAGAAAAAGTTCTTACATAGATATTTTAAAATTATCTGAAGTAGTTTATGTATCTCACGAGGATTTGAAATATTGTTTGCATACCATAAAACTTATGACTTCCTGGTAATTGTTTGGAAGAATGACCAAACGATGTAGACACTTTGCTTAGCCCAGAGCTCTCGACGTATCCATGCTTAAATAATATTACCTCTCTTTGCTGACAGCATGACATTTACTGACTGGTGGATGTAACCTATATGTTTATGGGGCATGGTTTGTTTTGGTATTACTACAATGATTGACTTTTGTATGATCCATGTCTTTGCAGGCACGGAGTTTCTACCTGCTGGAAGGTTACATAAAGCTCAGTATGATAGCCAACATCAATTGCCCCGAGGTTGGAAAGATATTATAGGGTCAAATTTTACACAAAAATTTTCCCCTATTAAATAACATTAGAATTGTTTTAAAGGGAACATATCTTTCTTCGATCATTTCTTTGATTTAGCATGGCTGAAAGTTGTTCATCTATGTTTTTGCTTCAAAAGTGGTTTTTAACGTCTGTTTATTTCATATGATGAAGAAAAGACTTGATGGAAATTTGACTTAAATTACAACTTCCAATTTCTTTAGTCATTTTTGTTCTGTTCTAGATTATTAATATAGGAATAATTCATCTACAGTTTTTTACTGAAAGTTGTTTCCATCTTCATGTTTATAAAAAAATATGATTTCATCATTTTTTTATTTGAAAGCATATTATATTTTCATTAAAGGACTATACATAACTTTTGTTTGGGATAATCATCATGAGTTGGCCTAGTGACTATAAAGGATTAGTGACTGTAAAGGATCGTGAATTTAGTAAAGAACTTAGAGGGAATCAATTCAAACCATGGTAGGCACATACCTATATATACTACAATTTTCTTCGGCATCCAATATTGTAAGGTCAAACATATTATCTCGTGAGATCTGTCGAGGTTGCCGTAAGTTGGTCATACAGATATAAAAAATGTTTGTTATTGCCTTACTTTTACTCTGGACCAATTAGAGAAGCTCCTTTTTTATTTTTTATTATTTAAAACTGACTCTGGTTGGTTGTACTCCTTATCCTATCTTTGTAACTTTTGTTTCCTAATTTTAAACAATATTTCTTATCTAGAAAGCATCAATTCTTGTTGGCCTATTGGACAATAATACTGACTACTTATTTATTTGTTATTATGTTTTTAGAAGATTGATATTGGTATTAAGGTTCTCATCGAATTAAGTTCATATTAGAACGAGATTTAGCAAACATAATGAAATTGTTCCATTATCAGCATGTTTTCCACACTGATTTATTTATTTGTTTTTATTTTGTCTTGATGATTCTCCTTTTGCTAATTTTCAGGCTTTCCTAATTGGATTAATGACAAAGAAGCAGAAATTCTTCGTCTTGGCTATGTATGCTTATCACTATTTTTCTATTTTTGACCGTCAACGTTTTTTTTGTTTTGGGATCTTTGGATGCCTTGTTATTAAAAAAGATAGGATAATTTTATACAAAAGAAGTTTGTTTATCCTTCGTAAACCAAGTAAAAGTTAAGAACCAAGCAGATTGGGGGAAAACATACAATACAGTTGTGTCGAAGAAGAAATGAGTAAGTAGGCTAAAGAATACCATAATTCGACAGTGTCCTGGACACAGCCACCCACACACACAATAGCCTTGGAATCAAGACAAGGGATTAATTACCAATAGCTTGAAGAATCATTCCATCTAGAAGCACGAGAGAAGAAAAAACTCGCATATTATATGTATACGCAAGTGCTCGATTCATTCAAATATGAATGTAAAATAATTGAAACATTGTTTCATATCTCATGACATTTAAATATCCATGTGCAAAACAAAATGAGAAATATTCTCTTGAAGCGAAAAAGATAGTCAACTAATGATTTGGAAGCAATTAAGAAAATTTGGAGTATTTATATTGTGTCTTTTCCAGTAACACGTCTGCTCTTTTACATTGGTGGACAGTCTTTTCTAACATTTAAGTGAAATTTATTGATCCTACACGTGTTTTTGGGAATAATTTGTAGGCAAGTTTTCCATGTGATCAATTAAAGTTTCCTGTCCTAAAAGAAAGTTTCCCTTTTGTTTTGTGGAATAGGTTAAACCAGAAGTAGTAAGCTGGTCACCACGAATCATAGTATTGCATAATTTTTTGAGCACAGAGGTGAGCATAATCCCATCGGAATGGTAATTCTTATTCTGAATGTTATCGTTTCTCTGCAGATTCTTTCCTTTACATAGCTTTGAATATAGAAGAACTTTCAACTAAGTGACTTATATTTGTATATATGTATTATCTATATATATATATAAAGAAACAGTTTATTGATGAATGAAATTTATGGAAAATCTACAGGATTACATAAAGCTTTCCCAATTAGCTAAATGGGAAGCGTACCTGTAGGCAGTAAAGTGGATTGGCAATTTACCTCAGATTGTGTTGGATTTTTATCATTTTCTCACCAGGAGACCTTTGGATGGCTCCTTACCATGGTGATGTGGTACTCGACTTTTAAAAATGGATCGATTTGTAGGTTAGGGTGGCCTTTGAAGATGAATGTATTTGTACTTCCATCATGCCAAGGCCAAAGCTGTACACTTCACCTTGATGCTTGCACTTTGATAGTCTCTAGTTTGGACGAAGTTTTTTTTTCGGTGGATCAACATAGCCTCACGAGTCTTTTTGTCCGTATCAGGGGGGTCCCACTGGTCTTTTTATATTAATCCAACCTCCGTGGACAGAACCTGGGTTTCTTCTTCTATAGGAATTTCATGTTCAAGTTTTAGGAGGAAGAATCCAACTGCATACAAATTCTCTTTGGTGCATAGGTGGCTTGCTTGCTTTTCATTTTTATATTGGCGGAGGGTTTGGTCCCTTCAGAGGCATCTGATAAAATTGGAAGAAAGAGAGAAGATAAGAATGACCCTTTCGAAGGATCACATTTATGCATTCGAGGACGGCTTCATTGGATCGAAAAGGATTCAAGAAGCGGAAACCAACTGCACATGATTTTGTTCCAATCATCATGAAAGTGCTTTATCTACCACAATGGAGGAGTCCCACTGGGTTGAGGCATGCTGACTGGAAGGATTGGGATCCTCCATAGACATTATTGTTGCTCATGGTTAAAGACCAATTGGATGACGACCATACATGAACAGGAGTTGTTCTATAGGCTATACAATTGTGTTCTTTCCACTTTGATGGTAATGACAACAATGTTCCTAACACTTCTGTAAGATAAATGGCCATTTTCAATCGAAGTCCATATAATTGGAAATGATCATGGGATATTTGCCCTTCTTTCCCTTTTACACAAGATAGTTGCAAGACTACTTGGTTCCTTGTACTTTTTGGCTGGAAAGGAATTAAGAAATGTAGGATTTTAGAAAGTGGATTAAGATTAGGATTCTAGGATAGAATAAGACATGAGTTGGGACTTGGGATTTTGTAGTCTTGATGCATAGAAATAAGACCACCAGTTATTGAACAATCATCAATGCCTAGACAAAAGCTGGTAGGGGAGGTTTTTGAAGATTCTAGAAAGGAAACCTTAATTACCAAATTTATTGAACAATCACCAGCTGGTGGAGTTAAACAAACTGCCAAAAAAAAAATGATGGTGGTTTTCAGCAATTTGGTTGCGAAGACAACACCAGTTAGGCCAGATGCTTAGATTGAAATTCTCTTAAGAACTTGTTCATTTACATTGATGCCAACCGAAGCCAAACACCATGAAAATAATTTAATGTGTTGAAGATATTCTATATTTAGTTTGCATTTTGATAATTTCTTTCTGGAGAAGTTGGTAACCTCTAGACATTTGTCTAAAGAGAGACTTGCTTGTAAAACTTCCTAACTTATCATCCAGCCACCACAGCTGGGTATCATGCTGAGGGATAGGTTTTTACCCTCAAACTTCTTAAGAAAGAGATATCCTTTCAGGAATTTCCTACAGAACAAGATTTTATCATTGTGGTTTATTTTTTTTCGCTTTTAAATCTCAATCCCAATGGAATTGCATCTTCTTTTTTTGTTCCAAAAAAATAAATCTCAATTCTAATGGAATTGCATCAGTTTTAGTTCAAGGAGCATATATTTGGATAAGAGGGACATGATAGCTTCTGTTTTTACTAATTTATTAATTGTATAGGAGTGTGACTACCTTAAGGGAATAGCACTTGCTCGCCTTGAAATTTCCACTGTCGTGGATACGAAAACCGGGAAGGTATGCACTTTTTCCCCCTTCCTTTGTATGATCACCTGTCTATTAGTTTAAGGCATGATAGATAATTACTTGTCTATACACTATTTTTGCTCTCAACTTCTATGAATGTTATCTGTTGAAATAGACCCTACTTACTGAAATGAAAATGAATTAACTATGGTCATATATTGATATTTGGTTCTACTCCAATTCTCTACACACTTTAAAGTTTGAAGTATCTGTGTTTCATAGATAGACGAAGTAAAAAGAAAAAAAAAAGAAAAAGGGAAAAGAAGAAAATGAAGAAAAAACATCAATGCAGTGTCTGGTGTTTGGTTTAGAAATGGAATTGGGTTTTTGAAAACCAAAAATTTTCATTTCTGAAATTTCTTTATAAATTTAAAGATGTTTCTAAAATGAAAAAACCAAAAACCGAATTTGTTAGCTAATAAACTTCTTTATCAACAACCAAAAAAAAAAAAAAAACTAATTTGTTATTTATTATTATGTTTTTTTGTTAAAAAAACTAATTTGCTATCAAGCTACCAGTTCGTAATGTTTTTAGGATTTGGGAAAAGAACAGATGGTATAGGTAAAACAATCTCAAACTTTCATCTTTCTGGCAATTTTGGGATTAAAACATTGTTTCAACATCAAACATTAGGAAATTACTAGATTGTATACCCCTTTTGTGAATTTCATACATCAATGAAATTGTTTCTTATAAAGAAAACCTTAGGCAAGGATCTTGTGGGGAGGGGGTTGTCTCAACCCGTTACCTTTTGTCTGTTTCCCCTTGTTTTTGGTGAATATATATACATACACACACACACATTCTTGTTTCCTATAAAAAAAAAGTCAGCATAGTTTCAAACCTGAGAGAGAGGAGGACCCATCCAGGGAAATCAAAGCTAGAATGTTGCAAATTTGTGCCGACAGTTATGATAAGATTAAGCACCACTGTATTTTCAGCAAAGAAAGAAGATAAAATATTGTTGTTTATGAGTAGCTATATAAATATATTGAAATGCTAATACTTCTACATTTAAGGTTACTTTATCTTGCGTGGTTTTTTTTTTTATCTCTTGATAGATTCTATATTTCAAGATTGTGATTGATAGTTTTGCTTCGTTTCTTTTGTAAATGGCATTATTTTTTCGTATTGGAGGACCCATTTATTAATGTTCTTTTGGTTGAATGATATATATATTTTTCCTATTAGAAAGATATGTTAAATAAATGTGTTCGCTCCCTTGTACATTTATTTTCAGGGCGTTAAAAGTGATTTTAGAACGAGCTCTGGAATGTTTTTAAGTCATCATGAGAAAAACTTTCCAATGGTCCAGGTATTAATTACAAAGAAAGTGTTTGGTATTTTGATGTTTCAAGTAATGGATATGGATGTGCTCAATAGTAAACTTTATGGAAAAAAAGCTTTCAATTTAGTTAAAGGTATTCTTCCGGATAAATTTACAAATTTGTTCCACTTGTTTTTGACAAGATTTTTATTTAAAGAAACATAAATGATCATGTACGACTGATTTACAGTATAGTCAATAGGGATGCTAAGCAAATGTAGTATATAGTTTGTTTTTAAACTATAGTCTATTTCACAAACTTCAAATTCTCATTAACTTCAAAGTTCAAATAGCCAAGTTCAGTCCTTAGAGTGACAAAAAACAGTAGCAGATTTTAAACTAGTAGTCACTTAACAGTAAATAAGTGTTATTTCTATCCAAAAAAAAGAAAAGATAAAACGGACAAGGAAAAGAAAAAAAAGAGTAAATAAGTGGCTGACCAAAAACTTAAAATTTACAGCATCAGTTACCCAACATTTACCCCACGGCCAAACACAAGAATCTAAAGATTTATTAAAAGATCTTGGAAGAAATTTTAAAGAAACAGTTCAAAAATTTCGAAAGCAACAGATCGCCTTATTTCCTTCCCTCCAAAGGAAAGCTTGTCCTTAAGTTTTAGGCCATCAAATAATAGGTTTCAGGAGATTATTTAAAATTCAGCAACATTGAAGGTGGGTTTGATCTTGAGATTTTGAGTTAGTTACTTTATATGCACTGCCACCAAATCTCTTAAGATTTTGAGAAGGCCTAATCTTCTTAGGATGAAGCTTAGAGTAGAACCAGAAGAAAATCTTGTTAACTTGTCATCACAAGTTATCAAATTTCCAAACTTAAACTATTGTTTGCCCTCCATGTTGGAAATTTACGAGATATAAATAAATGCGGAATAAACCAAATTATATTTCAAACTCCCACTTTATAGGTCAGAGCCAAAATAAGTGATTGATCATACAATAACAAATAATAAAAAGTAGGGAAAAGAAGAATGATACCGAAAAATATACTGGTTCGTCTCAAACTCAGAACTACGTCGAGTCTTCTACTACTGCATGTTCTGCTACTATGGAGATTGCAAAGAACCCTCAAACGTTGCTTACAAATTCTTCTAAATAGCTTAGAGAAACTCACGACCCAACAAGAAATTATACTCTGTAGTTTTCTTTTCCTCATTGCCACTCTAAAACGACACACTGAAATTTGAATAACTTCAAACTGAAGTCTTCAAATTGCCTTCTGGTGCATTTTCTCCACTTCCGGCGTGAAGGACCTCAGGGCAGACCTCCTGCAGTTCTTCTTCTTCAAACACAAACATATATTTATACGTGTGTTGCTGCCAAATAAGCCAAAGCTCAATAATTATTTGGTGTCTCTATACCGCCCCAAATTCTTCTTTTAAAAAAAAAGTCACCTTTATTATTATAATAGCTAATTAAACAATTTTGCTCACCACTCACATTACCCAGTTAAAAACAAGCTTATGGGTTAGTAAATGAGTTGGATTTGGAGTTTTGCCATTCCTAAAACGCAAGTAATGCTGCAAAAATTATTCTAAACTCGGTGCGCTAACCACACTGCATTTCCTTGATGTATTTTTACTGTTCTGTCTCACAATGCGTTGTGAAAATGCCAACTATTTGAGATTAAACATTTCCTCAAGGCACCAATTTTTGCCTTTGCCAATTTCACACGGCTCTCTTATTACAGTTCTTCCTGAAAGTTAACATACTTTCTTCATCCTTCATATTACAGCCCCATGTTCTTATTTTTTGGGGTTCTCTCTCTCGGTTGGTGATAGTCTGAGACTCTGATAGATTTCCTGTGTTTTGGAATTTGTGAAAGCAGGCAATTGAAAAAAGAATTTCTGTCTATTCTCAAATTCCAGTAGAAAATGGAGAGCTAATTCAAGTGTTAAGGTGTGGATATAATTTTGTTTCTGTAAGCACGTGAACACAAAAAGTGGTTTGTATTTCAGTTTTACGTCTAACAGTTTAAGCAGTACAGGTACGAGAAGAATCAATTTTACAAGCCTCATCATGACTACTTTTCTGATACTGTAAGATTTTTGTTATTTCTTCAGCCATGAGCCCTAGTATTCTGAAATCAGTATTTACTTATTTTTTTCTTTAAAGTAATGTGATCTGGTGCTCTCGCTGATGGCACGTGTTGTTTCTTCCATGTCTTCTTGTTACAGTTTAACTTGAAGCGTGGTGGTCAGCGGATTGCAACTATGCTTATGTATCTAAGTGAAAACATTGAAGGAGGAGAAACTTACTTTCCGAAGGTATTTACCATTTTTCTTGCTGAATTAAATTTCAGTATTTAGATGGAGTGCTATGACACACGGCAGAATTTAACATGAATTTGAAAGACAAAAAAATGGCAGCTTTTCCCATTTTAAATCTTGTTTTCTTGTCGGATCCAAGTTTCGTGGAAGTGTCGTGTCCTAAGTTTTCCCTTTGCATATTTTCTTCAGGCTGGTTCTGGTGAGTGTAGTTGTGGTGGGAAGACCGTTCCAGGACTGTCAGTCAAACCAGCCAAAGGAGATGCAGTACTTTTCTGGAGCATGGTATGTACCTCTTGCCACTTAGAATGACATTAGTATGATGTGAAAATAAAATGATTATGATTTTGACTGCTAAATCAAGAAATCGACACCCTAAGGGATGCCTTATGCTAGATAAGGGCTTAAGATTTGTTTCAGAGTTACCCTAGAAGTAGAAGTTGTGGCCTAGGACCCCAGCCTCAATTCAAATTTCAAAAACTTAATAACTAGGAGATAGTTTGCCTATGTTTCATTCTCATTTTTCGTATTTCTTTCCAAAATTGTGTAGAAATTTTGGAAGTAGCAAATTTGAGTCTCTTTTTTCTTGTTTATGTCTCTGGTTTTTCTCTAAAACTAAAAATTAAATTACAGCATTATTTTAACATTCTTGACTTATAAAAGGGATAATAATCAAATAATTAACATAGTTTTTCGTTAGGATATTTAACTTTGAAAACCAGTTGTCTCGAACATATATTACTTTTTTAATCTAAAATGTTTTAGAACAAGAAATGATTAAGAAAACAAGAAACGTTTACCAAACACAATTTTGTTTTTCTTTATTAAAAAAAAAAGTAAAAATGACAACAAGAACATGTAAATTGGAAATAAAAAACAGGAACTTCACCAAACGAACAAGCCCTTACGTTCTTTCTGGACTTTTGGAGTTGATCTTAAACTCAAAGTAGATCAAGAGATCAAAGTAAGAGTAATCCTTGAGTATACTAATAATAATGAAAAAGGAGGTACCCTGTTCAAGAACATAAAACCAACCAATGCTTAAAGCCTGTTTGGAATGAACTCCAAATGGAAGGCTGTTGTTTTAAACCAATTCCTTTGATATGTAAATGGTTCAGGGGTTAGATGGGCAATCAGATCCAAAGAGCATTCATGGAGGGTGTGAAGTACTGTCAGGGGAAAAATGGTCTGCCACAAAATGGATGAGGCAAAAGAGTACTCTGGTACCATAATTCATACTTTCTAGTTTCATTTGTATTGTATATCACAATTGAATATTTTGTTACACATATCAACTAATAAATTTATATATATATATAGAGAGAGAGAGAAAGAAAATGGAGAGGCTAATTTAGATAGCGTCTTAACATAATTAATCACACTTAGAATGAATCAATTTATTTAATGAATAATACAGTAGAAGCAACATCTGATGTTTCTTTGTTCTTTTGTCAAAGTGTGGAATTTCTCTATATTTTTCATGCATGATTGTTATAAGGTTAAGTCTCAATAACCAACAAAAGTTTCTCCAAGAACTATTGAGAGGATGATTTAGAAATTGCGATCACATGTAATGTAACAGAATGTGTGAAGGGTCCAGGGGAAATATTGTATGATATTTATATAATATGTCATTTTATGTGGGAGTTATTTTTTATTTGGTGGTTTTAATTATATTATCAGACTGAGAAAGCTTCATTTTAGGCTGGAAGTTTGTTTCAAAATAATCTTGTAGGTTTTTGTGAGTTCAATTAATAGGTGTCTAGTATGGTTTGTACTTTGTTTTTAAACGTTTTTAGATTTTGTAACCTAAATTTAGACAACCAACTTTATGGTTTCTATAGTTGAGTTATACTCACTTAAGCTTTCTTTGAGAACTACAACTAATACCTTATAAGAAAAAAAAAACAAAAAATTAAAGAGCATATCTCAAATGATATAGTGTTAGTTTTAACTCTTGAGATTTTGACAAACTAAAACAAAATTGTAAAGTGTGAAGTTAATCATCGCACACTTGAGATCAAGGACAATGGAACTTTTGTCACACAGATGATATTAATTTTGTTCATATCTTCAGTGATACAATTTTGTTTTACCTAAATGGTAATTCATTTTCATATGTTTTTTCGTGTTCTTTTCGATCATGCTTATAAATCGAGCAACATGTTAAAACATCTTAACTTTTAAATGATTTCACAAAGTTTAAACTTGGAACATTTTTGGTATAGATGTTTTGGTGTTATATGAAGAAACATAGGTAGAAAGTGATTCACTTCAAATGGAGGAGCAAAGTGGGTATTTCACTTCTTCAAAGATAATAAAACATGTTTTAGGCAAATAGGTCAATTACATATAAGCTTTTGGGTTTTGGGTTAGTTTTTCATAATTTTCTTCCCAATGATACTTACCTTTCACATAGATCTTCAACAAATCAAATTGATCTCACATGCTCTATCGTGTTAGAGATGTTTACGAGTAGGGTGGGGTTGGAGATGGCTTCCATTTTCCCAACTCTGCTCTCCATTTTATTTTTCATCCCCCCAACATTACTTGTCCGTCCAAATTAGTGTGATTGGGTAGGGAATTGAGAAGTGGGTTCCCCATGTTTATGTAAGAACTCTTACTTCCTTTCTCTACTTTTTGCATTCACTTCTTCCTCATTTTTTCTTTCCTCTTTTCTCTCGTTTGTTCTCTTTTGATCCTAAAATCATTATGTGGAATGAATCTGAAAGGTCATATCTAAAACGAAGGTCAAACTTAAGAAAGATGTTTTAGATAGAGAAAAAGAAGACAAGAGGGAGAAGAGGAAAAAAAAAAGGAAAGAGTAATTTAATGAGATACATGGCCTCAAATTTAATAATTTAGAATTTTAATTTATCTTAATCAAAGTTTTATGAGTCAGAAAACTAAAAATTCCACACCACCACAAAATCAACCAAACATTTACTCAAATATGGAGCATAAAAACTAGTAAATAATAGGAAAGCTGAGGTACAAACAAACAATAATACTACTAATCCAACACTAAACTATGGTTCAAACTTTAAAGGAAAGAGACACACATATCCTTTTACAGCAATTACTATAAGCTTTTTAACTTATTGATCAATCACGACCATAATAAAACATAAAAAAAAAAAAAAAAAAACAGTTCCACAATTGAACACACACTTCCTCTAGCAATTTGGAGACTTAGAAATGTTGCAACGCTTAGGCAGCTCCATTGCAAGTTCAGGGTTAATGCCAAAAGAAGAAAGAGCTCCCGAGTTCCGGTACGCACAAAAGCAGTGCAAGTCGGCGTGCATAAGTGCCTTGCAGCATTGGGTGGTCGGCGGTGTCGGGTTTGGAGGTGTAACCGACGGTCGACATTCATACAAATCCGCAATTGGCATGTTGCAAATGGATTGA

mRNA sequence

ATGGTTTCCGCTCAGATGAGGATTGTCTTCGGTCTCTTGACATTTGTCACCGTCGGCATGATCATCGGTGCTTTGTTACAACTAGCATTTTTAAGAAGGCTGGAGGACTCTATCGGCACGGAGTTTCTACCTGCTGGAAGGTTACATAAAGCTCAGTATGATAGCCAACATCAATTGCCCCGAGGCTTTCCTAATTGGATTAATGACAAAGAAGCAGAAATTCTTCGTCTTGGCTATGTTAAACCAGAAGTAGTAAGCTGGTCACCACGAATCATAGTATTGCATAATTTTTTGAGCACAGAGGAGTGTGACTACCTTAAGGGAATAGCACTTGCTCGCCTTGAAATTTCCACTGTCGTGGATACGAAAACCGGGAAGGGCGTTAAAAGTGATTTTAGAACGAGCTCTGGAATGTTTTTAAGTCATCATGAGAAAAACTTTCCAATGGTCCAGGCAATTGAAAAAAGAATTTCTGTCTATTCTCAAATTCCAGTAGAAAATGGAGAGCTAATTCAAGTGTTAAGGTACGAGAAGAATCAATTTTACAAGCCTCATCATGACTACTTTTCTGATACTTTTAACTTGAAGCGTGGTGGTCAGCGGATTGCAACTATGCTTATGTATCTAAGTGAAAACATTGAAGGAGGAGAAACTTACTTTCCGAAGGCTGGTTCTGGTGAGTGTAGTTGTGGTGGGAAGACCGTTCCAGGACTGTCAGTCAAACCAGCCAAAGGAGATGCAGTACTTTTCTGGAGCATGGGGTTAGATGGGCAATCAGATCCAAAGAGCATTCATGGAGGGTGTGAAGTACTGTCAGGGGAAAAATGGTCTGCCACAAAATGGATGAGGCAAAAGAGTACTCTGCAATTTGGAGACTTAGAAATGTTGCAACGCTTAGGCAGCTCCATTGCAAGTTCAGGGTTAATGCCAAAAGAAGAAAGAGCTCCCGAGTTCCGGTACGCACAAAAGCAGTGCAAGTCGGCGTGCATAAGTGCCTTGCAGCATTGGGTGGTCGGCGGTGTCGGGTTTGGAGGTGTAACCGACGGTCGACATTCATACAAATCCGCAATTGGCATGTTGCAAATGGATTGA

Coding sequence (CDS)

ATGGTTTCCGCTCAGATGAGGATTGTCTTCGGTCTCTTGACATTTGTCACCGTCGGCATGATCATCGGTGCTTTGTTACAACTAGCATTTTTAAGAAGGCTGGAGGACTCTATCGGCACGGAGTTTCTACCTGCTGGAAGGTTACATAAAGCTCAGTATGATAGCCAACATCAATTGCCCCGAGGCTTTCCTAATTGGATTAATGACAAAGAAGCAGAAATTCTTCGTCTTGGCTATGTTAAACCAGAAGTAGTAAGCTGGTCACCACGAATCATAGTATTGCATAATTTTTTGAGCACAGAGGAGTGTGACTACCTTAAGGGAATAGCACTTGCTCGCCTTGAAATTTCCACTGTCGTGGATACGAAAACCGGGAAGGGCGTTAAAAGTGATTTTAGAACGAGCTCTGGAATGTTTTTAAGTCATCATGAGAAAAACTTTCCAATGGTCCAGGCAATTGAAAAAAGAATTTCTGTCTATTCTCAAATTCCAGTAGAAAATGGAGAGCTAATTCAAGTGTTAAGGTACGAGAAGAATCAATTTTACAAGCCTCATCATGACTACTTTTCTGATACTTTTAACTTGAAGCGTGGTGGTCAGCGGATTGCAACTATGCTTATGTATCTAAGTGAAAACATTGAAGGAGGAGAAACTTACTTTCCGAAGGCTGGTTCTGGTGAGTGTAGTTGTGGTGGGAAGACCGTTCCAGGACTGTCAGTCAAACCAGCCAAAGGAGATGCAGTACTTTTCTGGAGCATGGGGTTAGATGGGCAATCAGATCCAAAGAGCATTCATGGAGGGTGTGAAGTACTGTCAGGGGAAAAATGGTCTGCCACAAAATGGATGAGGCAAAAGAGTACTCTGCAATTTGGAGACTTAGAAATGTTGCAACGCTTAGGCAGCTCCATTGCAAGTTCAGGGTTAATGCCAAAAGAAGAAAGAGCTCCCGAGTTCCGGTACGCACAAAAGCAGTGCAAGTCGGCGTGCATAAGTGCCTTGCAGCATTGGGTGGTCGGCGGTGTCGGGTTTGGAGGTGTAACCGACGGTCGACATTCATACAAATCCGCAATTGGCATGTTGCAAATGGATTGA

Protein sequence

MVSAQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLPRGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTEECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTLQFGDLEMLQRLGSSIASSGLMPKEERAPEFRYAQKQCKSACISALQHWVVGGVGFGGVTDGRHSYKSAIGMLQMD*
Homology
BLAST of Chy7G142440 vs. ExPASy Swiss-Prot
Match: Q9ZW86 (Prolyl 4-hydroxylase 1 OS=Arabidopsis thaliana OX=3702 GN=P4H1 PE=1 SV=1)

HSP 1 Score: 438.0 bits (1125), Expect = 1.1e-121
Identity = 211/282 (74.82%), Postives = 246/282 (87.23%), Query Frame = 0

Query: 6   MRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLPRGFPN 65
           M+IVFGLLTFVTVGM+IG+LLQLAF+ RLEDS GT F P+ R  + Q     +  R    
Sbjct: 5   MKIVFGLLTFVTVGMVIGSLLQLAFINRLEDSYGTGF-PSLRGLRGQ---NTRYLRDVSR 64

Query: 66  WINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTEECDYLKGIALARLEISTVVDTKTG 125
           W NDK+AE+LR+G VKPEVVSWSPRIIVLH+FLS EEC+YLK IA  RL++STVVD KTG
Sbjct: 65  WANDKDAELLRIGNVKPEVVSWSPRIIVLHDFLSPEECEYLKAIARPRLQVSTVVDVKTG 124

Query: 126 KGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQFYKPH 185
           KGVKSD RTSSGMFL+H E+++P++QAIEKRI+V+SQ+P ENGELIQVLRYE  QFYKPH
Sbjct: 125 KGVKSDVRTSSGMFLTHVERSYPIIQAIEKRIAVFSQVPAENGELIQVLRYEPQQFYKPH 184

Query: 186 HDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKG 245
           HDYF+DTFNLKRGGQR+ATMLMYL++++EGGETYFP AG G+C+CGGK + G+SVKP KG
Sbjct: 185 HDYFADTFNLKRGGQRVATMLMYLTDDVEGGETYFPLAGDGDCTCGGKIMKGISVKPTKG 244

Query: 246 DAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKST 288
           DAVLFWSMGLDGQSDP+SIHGGCEVLSGEKWSATKWMRQK+T
Sbjct: 245 DAVLFWSMGLDGQSDPRSIHGGCEVLSGEKWSATKWMRQKAT 282

BLAST of Chy7G142440 vs. ExPASy Swiss-Prot
Match: Q9LN20 (Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana OX=3702 GN=P4H3 PE=2 SV=1)

HSP 1 Score: 218.8 bits (556), Expect = 1.0e-55
Identity = 115/209 (55.02%), Postives = 137/209 (65.55%), Query Frame = 0

Query: 83  EVVSWSPRIIVLHNFLSTEECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSH 142
           EV+SW PR  V HNFLS EEC+YL  +A   +  STVVD++TGK   S  RTSSG FL  
Sbjct: 77  EVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETGKSKDSRVRTSSGTFLRR 136

Query: 143 HEKNFPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRI 202
                 +++ IEKRI+ Y+ IP ++GE +QVL YE  Q Y+PH+DYF D FN K GGQR+
Sbjct: 137 GRDK--IIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQRM 196

Query: 203 ATMLMYLSENIEGGETYFPKAGSGECS---------CGGKTVPGLSVKPAKGDAVLFWSM 262
           ATMLMYLS+  EGGET FP A     S         CG K   GLSVKP  GDA+LFWSM
Sbjct: 197 ATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKK---GLSVKPRMGDALLFWSM 256

Query: 263 GLDGQSDPKSIHGGCEVLSGEKWSATKWM 283
             D   DP S+HGGC V+ G KWS+TKWM
Sbjct: 257 RPDATLDPTSLHGGCPVIRGNKWSSTKWM 280

BLAST of Chy7G142440 vs. ExPASy Swiss-Prot
Match: F4JZ24 (Probable prolyl 4-hydroxylase 10 OS=Arabidopsis thaliana OX=3702 GN=P4H10 PE=3 SV=1)

HSP 1 Score: 218.0 bits (554), Expect = 1.8e-55
Identity = 114/212 (53.77%), Postives = 137/212 (64.62%), Query Frame = 0

Query: 83  EVVSWSPRIIVLHNFLSTEECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSH 142
           E++SW PR  V HNFL+ EEC YL  +A   +E STVVD KTGK   S  RTSSG FL+ 
Sbjct: 79  EIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSRVRTSSGTFLAR 138

Query: 143 HEKNFPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRI 202
                  ++ IEKRIS ++ IPVE+GE +QVL YE  Q Y+PH+DYF D +N + GGQRI
Sbjct: 139 GRDK--TIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMDEYNTRNGGQRI 198

Query: 203 ATMLMYLSENIEGGETYFPKAGS-----------GECSCGGKTVPGLSVKPAKGDAVLFW 262
           AT+LMYLS+  EGGET FP A              EC  G     GLSVKP  GDA+LFW
Sbjct: 199 ATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKG-----GLSVKPKMGDALLFW 258

Query: 263 SMGLDGQSDPKSIHGGCEVLSGEKWSATKWMR 284
           SM  D   DP S+HGGC V+ G KWS+TKW+R
Sbjct: 259 SMTPDATLDPSSLHGGCAVIKGNKWSSTKWLR 283

BLAST of Chy7G142440 vs. ExPASy Swiss-Prot
Match: F4JNU8 (Probable prolyl 4-hydroxylase 8 OS=Arabidopsis thaliana OX=3702 GN=P4H8 PE=3 SV=1)

HSP 1 Score: 214.5 bits (545), Expect = 1.9e-54
Identity = 114/208 (54.81%), Postives = 139/208 (66.83%), Query Frame = 0

Query: 83  EVVSWSPRIIVLHNFLSTEECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSH 142
           EV+SW PR  V HNFL+ EEC++L  +A   +  S VVD KTGK + S  RTSSG FL+ 
Sbjct: 81  EVISWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDVKTGKSIDSRVRTSSGTFLNR 140

Query: 143 -HEKNFPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQR 202
            H++   +V+ IE RIS ++ IP ENGE +QVL YE  Q Y+PHHDYF D FN+++GGQR
Sbjct: 141 GHDE---IVEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEPHHDYFFDEFNVRKGGQR 200

Query: 203 IATMLMYLSENIEGGETYFPKAGSG--------ECSCGGKTVPGLSVKPAKGDAVLFWSM 262
           IAT+LMYLS+  EGGET FP A           E S  GK   GLSV P K DA+LFWSM
Sbjct: 201 IATVLMYLSDVDEGGETVFPAAKGNVSDVPWWDELSQCGK--EGLSVLPKKRDALLFWSM 260

Query: 263 GLDGQSDPKSIHGGCEVLSGEKWSATKW 282
             D   DP S+HGGC V+ G KWS+TKW
Sbjct: 261 KPDASLDPSSLHGGCPVIKGNKWSSTKW 283

BLAST of Chy7G142440 vs. ExPASy Swiss-Prot
Match: Q24JN5 (Prolyl 4-hydroxylase 5 OS=Arabidopsis thaliana OX=3702 GN=P4H5 PE=2 SV=1)

HSP 1 Score: 210.3 bits (534), Expect = 3.7e-53
Identity = 113/208 (54.33%), Postives = 138/208 (66.35%), Query Frame = 0

Query: 83  EVVSWSPRIIVLHNFLSTEECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSH 142
           EV+SW PR +V HNFL+ EEC++L  +A   +  STVVD KTG    S  RTSSG FL  
Sbjct: 81  EVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSRVRTSSGTFLRR 140

Query: 143 -HEKNFPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQR 202
            H++   +V+ IEKRIS ++ IPVENGE +QVL Y+  Q Y+PH+DYF D FN K GGQR
Sbjct: 141 GHDE---VVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQR 200

Query: 203 IATMLMYLSENIEGGETYFPKAGS--------GECSCGGKTVPGLSVKPAKGDAVLFWSM 262
           IAT+LMYLS+  +GGET FP A           E S  GK   GLSV P K DA+LFW+M
Sbjct: 201 IATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGK--EGLSVLPKKRDALLFWNM 260

Query: 263 GLDGQSDPKSIHGGCEVLSGEKWSATKW 282
             D   DP S+HGGC V+ G KWS+TKW
Sbjct: 261 RPDASLDPSSLHGGCPVVKGNKWSSTKW 283

BLAST of Chy7G142440 vs. ExPASy TrEMBL
Match: A0A0A0KU17 (Fe2OG dioxygenase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G017130 PE=4 SV=1)

HSP 1 Score: 582.8 bits (1501), Expect = 1.0e-162
Identity = 285/288 (98.96%), Postives = 288/288 (100.00%), Query Frame = 0

Query: 1   MVSAQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLP 60
           MVS+QMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLP
Sbjct: 1   MVSSQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLP 60

Query: 61  RGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTEECDYLKGIALARLEISTVV 120
           RGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST+ECDYLKGIALARLEISTVV
Sbjct: 61  RGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVV 120

Query: 121 DTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQ 180
           DTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQ+PVENGELIQVLRYEKNQ
Sbjct: 121 DTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQ 180

Query: 181 FYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSV 240
           FYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSV
Sbjct: 181 FYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSV 240

Query: 241 KPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTL 289
           KPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTL
Sbjct: 241 KPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTL 288

BLAST of Chy7G142440 vs. ExPASy TrEMBL
Match: A0A1S3BXE6 (prolyl 4-hydroxylase 1 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103494504 PE=4 SV=1)

HSP 1 Score: 573.5 bits (1477), Expect = 6.1e-160
Identity = 281/288 (97.57%), Postives = 283/288 (98.26%), Query Frame = 0

Query: 1   MVSAQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLP 60
           MVSAQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHK QYDSQ QLP
Sbjct: 1   MVSAQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKTQYDSQRQLP 60

Query: 61  RGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTEECDYLKGIALARLEISTVV 120
           RG PNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTEECDYLKGIAL RLEISTVV
Sbjct: 61  RGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTEECDYLKGIALPRLEISTVV 120

Query: 121 DTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQ 180
           DTKTGKGVKSDFRTSSGMFLSHHEKN+PMVQAIEKRISVYSQIPVENGELIQVLRYEKNQ
Sbjct: 121 DTKTGKGVKSDFRTSSGMFLSHHEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQ 180

Query: 181 FYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSV 240
           FYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSV
Sbjct: 181 FYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSV 240

Query: 241 KPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTL 289
           KPAKGDA+LFWSMGLDGQSDP SIHGGCEVLSGEKWSATKWMRQKSTL
Sbjct: 241 KPAKGDAILFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTL 288

BLAST of Chy7G142440 vs. ExPASy TrEMBL
Match: A0A1S4DZZ9 (prolyl 4-hydroxylase 1 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103494504 PE=4 SV=1)

HSP 1 Score: 562.0 bits (1447), Expect = 1.8e-156
Identity = 281/307 (91.53%), Postives = 283/307 (92.18%), Query Frame = 0

Query: 1   MVSAQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLP 60
           MVSAQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHK QYDSQ QLP
Sbjct: 1   MVSAQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKTQYDSQRQLP 60

Query: 61  R-------------------GFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTE 120
           R                   G PNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTE
Sbjct: 61  RVTVKNREFSKELGGNQFNPGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTE 120

Query: 121 ECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYS 180
           ECDYLKGIAL RLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKN+PMVQAIEKRISVYS
Sbjct: 121 ECDYLKGIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNYPMVQAIEKRISVYS 180

Query: 181 QIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFP 240
           QIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFP
Sbjct: 181 QIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFP 240

Query: 241 KAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKW 289
           KAGSGECSCGGKTVPGLSVKPAKGDA+LFWSMGLDGQSDP SIHGGCEVLSGEKWSATKW
Sbjct: 241 KAGSGECSCGGKTVPGLSVKPAKGDAILFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKW 300

BLAST of Chy7G142440 vs. ExPASy TrEMBL
Match: A0A6J1CBS4 (prolyl 4-hydroxylase 1 OS=Momordica charantia OX=3673 GN=LOC111009248 PE=4 SV=1)

HSP 1 Score: 547.7 bits (1410), Expect = 3.6e-152
Identity = 262/288 (90.97%), Postives = 274/288 (95.14%), Query Frame = 0

Query: 1   MVSAQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLP 60
           M SA MRIVFGLLTFVT+GMIIGAL QLAF+RRLEDS GTEFL AGRLHK QYD   QLP
Sbjct: 1   MASAPMRIVFGLLTFVTLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLP 60

Query: 61  RGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTEECDYLKGIALARLEISTVV 120
           RGFPNWIND+EAEILRLGYVKPEVVSWSPRIIVLHNFLSTEECDYL+ +AL RLE+STVV
Sbjct: 61  RGFPNWINDREAEILRLGYVKPEVVSWSPRIIVLHNFLSTEECDYLRAVALPRLEVSTVV 120

Query: 121 DTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQ 180
           DTKTGKGVKSDFRTSSGMFLSH EKN+PM+QAIEKRISVYSQIP+ENGELIQVLRYEKNQ
Sbjct: 121 DTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQ 180

Query: 181 FYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSV 240
           FYKPHHDYFSDTFNLKRGGQR+ATMLMYLS+N+EGGETYFPKAGSGECSCGGKTVPGLSV
Sbjct: 181 FYKPHHDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSV 240

Query: 241 KPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTL 289
           KP KGDAVLFWSMGLDGQSDP SIHGGCEVLSGEKWSATKWMRQKSTL
Sbjct: 241 KPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTL 288

BLAST of Chy7G142440 vs. ExPASy TrEMBL
Match: A0A6J1GWV8 (prolyl 4-hydroxylase 1 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111457888 PE=4 SV=1)

HSP 1 Score: 540.0 bits (1390), Expect = 7.4e-150
Identity = 261/288 (90.62%), Postives = 268/288 (93.06%), Query Frame = 0

Query: 1   MVSAQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLP 60
           M SA MRI FGLLTFVTVGMIIGAL QLAF+RRLEDS G EFLPAGRLHK QYDSQHQLP
Sbjct: 1   MASAPMRIAFGLLTFVTVGMIIGALFQLAFIRRLEDSTGNEFLPAGRLHKTQYDSQHQLP 60

Query: 61  RGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTEECDYLKGIALARLEISTVV 120
           RG PNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLS EECDYLK IAL  LEISTVV
Sbjct: 61  RGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSPEECDYLKAIALPHLEISTVV 120

Query: 121 DTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQ 180
           DTKTGKGVKSDFRTSSGMFL H +K FPMVQAIEKRISVYSQIP+ENGE IQVLRYEKNQ
Sbjct: 121 DTKTGKGVKSDFRTSSGMFLGHRDKKFPMVQAIEKRISVYSQIPIENGEFIQVLRYEKNQ 180

Query: 181 FYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSV 240
           FYKPHHDYFSDT+NL  GGQRIAT+LMYLSEN+EGGETYFPKAGSGECSCGGKTVPGLSV
Sbjct: 181 FYKPHHDYFSDTYNLMHGGQRIATILMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSV 240

Query: 241 KPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTL 289
           KP +GDAVLFWSMGLDGQSDP SIHGGCEVLSGEKWSATKWMRQKSTL
Sbjct: 241 KPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTL 288

BLAST of Chy7G142440 vs. NCBI nr
Match: XP_004152082.1 (prolyl 4-hydroxylase 1 [Cucumis sativus] >KGN53125.1 hypothetical protein Csa_014405 [Cucumis sativus])

HSP 1 Score: 580 bits (1494), Expect = 2.31e-207
Identity = 285/288 (98.96%), Postives = 288/288 (100.00%), Query Frame = 0

Query: 1   MVSAQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLP 60
           MVS+QMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLP
Sbjct: 1   MVSSQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLP 60

Query: 61  RGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTEECDYLKGIALARLEISTVV 120
           RGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST+ECDYLKGIALARLEISTVV
Sbjct: 61  RGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVV 120

Query: 121 DTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQ 180
           DTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQ+PVENGELIQVLRYEKNQ
Sbjct: 121 DTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQ 180

Query: 181 FYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSV 240
           FYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSV
Sbjct: 181 FYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSV 240

Query: 241 KPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTL 288
           KPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTL
Sbjct: 241 KPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTL 288

BLAST of Chy7G142440 vs. NCBI nr
Match: XP_008453925.1 (PREDICTED: prolyl 4-hydroxylase 1 isoform X3 [Cucumis melo])

HSP 1 Score: 570 bits (1470), Expect = 1.05e-203
Identity = 281/288 (97.57%), Postives = 283/288 (98.26%), Query Frame = 0

Query: 1   MVSAQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLP 60
           MVSAQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHK QYDSQ QLP
Sbjct: 1   MVSAQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKTQYDSQRQLP 60

Query: 61  RGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTEECDYLKGIALARLEISTVV 120
           RG PNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTEECDYLKGIAL RLEISTVV
Sbjct: 61  RGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTEECDYLKGIALPRLEISTVV 120

Query: 121 DTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQ 180
           DTKTGKGVKSDFRTSSGMFLSHHEKN+PMVQAIEKRISVYSQIPVENGELIQVLRYEKNQ
Sbjct: 121 DTKTGKGVKSDFRTSSGMFLSHHEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQ 180

Query: 181 FYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSV 240
           FYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSV
Sbjct: 181 FYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSV 240

Query: 241 KPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTL 288
           KPAKGDA+LFWSMGLDGQSDP SIHGGCEVLSGEKWSATKWMRQKSTL
Sbjct: 241 KPAKGDAILFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTL 288

BLAST of Chy7G142440 vs. NCBI nr
Match: XP_016901567.1 (PREDICTED: prolyl 4-hydroxylase 1 isoform X1 [Cucumis melo])

HSP 1 Score: 559 bits (1440), Expect = 7.85e-199
Identity = 281/307 (91.53%), Postives = 283/307 (92.18%), Query Frame = 0

Query: 1   MVSAQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLP 60
           MVSAQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHK QYDSQ QLP
Sbjct: 1   MVSAQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKTQYDSQRQLP 60

Query: 61  R-------------------GFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTE 120
           R                   G PNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTE
Sbjct: 61  RVTVKNREFSKELGGNQFNPGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTE 120

Query: 121 ECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYS 180
           ECDYLKGIAL RLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKN+PMVQAIEKRISVYS
Sbjct: 121 ECDYLKGIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNYPMVQAIEKRISVYS 180

Query: 181 QIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFP 240
           QIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFP
Sbjct: 181 QIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFP 240

Query: 241 KAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKW 288
           KAGSGECSCGGKTVPGLSVKPAKGDA+LFWSMGLDGQSDP SIHGGCEVLSGEKWSATKW
Sbjct: 241 KAGSGECSCGGKTVPGLSVKPAKGDAILFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKW 300

BLAST of Chy7G142440 vs. NCBI nr
Match: XP_038904320.1 (prolyl 4-hydroxylase 1 isoform X1 [Benincasa hispida])

HSP 1 Score: 555 bits (1429), Expect = 1.84e-197
Identity = 275/288 (95.49%), Postives = 277/288 (96.18%), Query Frame = 0

Query: 1   MVSAQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLP 60
           M SA MRIVFGLLTFVTVGMIIGALLQLAF+RRLEDSIGTEFL AGRLHK QYDSQ QL 
Sbjct: 1   MASAPMRIVFGLLTFVTVGMIIGALLQLAFIRRLEDSIGTEFLSAGRLHKTQYDSQRQLS 60

Query: 61  RGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTEECDYLKGIALARLEISTVV 120
           RG PNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTEECDYLK IAL RLEISTVV
Sbjct: 61  RGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTEECDYLKAIALPRLEISTVV 120

Query: 121 DTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQ 180
           DTKTGKGVKSDFRTSSGMFLSH EKN+PMVQAIEKRISVYSQIPVENGELIQVLRYEKNQ
Sbjct: 121 DTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQ 180

Query: 181 FYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSV 240
           FYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSV
Sbjct: 181 FYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSV 240

Query: 241 KPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTL 288
           KPAKGDAVLFWSMGLDGQSDP SIHGGCEVLSGEKWSATKWMRQKSTL
Sbjct: 241 KPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTL 288

BLAST of Chy7G142440 vs. NCBI nr
Match: XP_022137963.1 (prolyl 4-hydroxylase 1 [Momordica charantia])

HSP 1 Score: 545 bits (1403), Expect = 1.68e-193
Identity = 262/288 (90.97%), Postives = 274/288 (95.14%), Query Frame = 0

Query: 1   MVSAQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLP 60
           M SA MRIVFGLLTFVT+GMIIGAL QLAF+RRLEDS GTEFL AGRLHK QYD   QLP
Sbjct: 1   MASAPMRIVFGLLTFVTLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLP 60

Query: 61  RGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTEECDYLKGIALARLEISTVV 120
           RGFPNWIND+EAEILRLGYVKPEVVSWSPRIIVLHNFLSTEECDYL+ +AL RLE+STVV
Sbjct: 61  RGFPNWINDREAEILRLGYVKPEVVSWSPRIIVLHNFLSTEECDYLRAVALPRLEVSTVV 120

Query: 121 DTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQ 180
           DTKTGKGVKSDFRTSSGMFLSH EKN+PM+QAIEKRISVYSQIP+ENGELIQVLRYEKNQ
Sbjct: 121 DTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQ 180

Query: 181 FYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSV 240
           FYKPHHDYFSDTFNLKRGGQR+ATMLMYLS+N+EGGETYFPKAGSGECSCGGKTVPGLSV
Sbjct: 181 FYKPHHDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSV 240

Query: 241 KPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTL 288
           KP KGDAVLFWSMGLDGQSDP SIHGGCEVLSGEKWSATKWMRQKSTL
Sbjct: 241 KPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTL 288

BLAST of Chy7G142440 vs. TAIR 10
Match: AT2G43080.1 (P4H isoform 1 )

HSP 1 Score: 438.0 bits (1125), Expect = 7.6e-123
Identity = 211/282 (74.82%), Postives = 246/282 (87.23%), Query Frame = 0

Query: 6   MRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLPRGFPN 65
           M+IVFGLLTFVTVGM+IG+LLQLAF+ RLEDS GT F P+ R  + Q     +  R    
Sbjct: 5   MKIVFGLLTFVTVGMVIGSLLQLAFINRLEDSYGTGF-PSLRGLRGQ---NTRYLRDVSR 64

Query: 66  WINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTEECDYLKGIALARLEISTVVDTKTG 125
           W NDK+AE+LR+G VKPEVVSWSPRIIVLH+FLS EEC+YLK IA  RL++STVVD KTG
Sbjct: 65  WANDKDAELLRIGNVKPEVVSWSPRIIVLHDFLSPEECEYLKAIARPRLQVSTVVDVKTG 124

Query: 126 KGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQFYKPH 185
           KGVKSD RTSSGMFL+H E+++P++QAIEKRI+V+SQ+P ENGELIQVLRYE  QFYKPH
Sbjct: 125 KGVKSDVRTSSGMFLTHVERSYPIIQAIEKRIAVFSQVPAENGELIQVLRYEPQQFYKPH 184

Query: 186 HDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKG 245
           HDYF+DTFNLKRGGQR+ATMLMYL++++EGGETYFP AG G+C+CGGK + G+SVKP KG
Sbjct: 185 HDYFADTFNLKRGGQRVATMLMYLTDDVEGGETYFPLAGDGDCTCGGKIMKGISVKPTKG 244

Query: 246 DAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKST 288
           DAVLFWSMGLDGQSDP+SIHGGCEVLSGEKWSATKWMRQK+T
Sbjct: 245 DAVLFWSMGLDGQSDPRSIHGGCEVLSGEKWSATKWMRQKAT 282

BLAST of Chy7G142440 vs. TAIR 10
Match: AT1G20270.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 218.8 bits (556), Expect = 7.3e-57
Identity = 115/209 (55.02%), Postives = 137/209 (65.55%), Query Frame = 0

Query: 83  EVVSWSPRIIVLHNFLSTEECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSH 142
           EV+SW PR  V HNFLS EEC+YL  +A   +  STVVD++TGK   S  RTSSG FL  
Sbjct: 77  EVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETGKSKDSRVRTSSGTFLRR 136

Query: 143 HEKNFPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRI 202
                 +++ IEKRI+ Y+ IP ++GE +QVL YE  Q Y+PH+DYF D FN K GGQR+
Sbjct: 137 GRDK--IIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQRM 196

Query: 203 ATMLMYLSENIEGGETYFPKAGSGECS---------CGGKTVPGLSVKPAKGDAVLFWSM 262
           ATMLMYLS+  EGGET FP A     S         CG K   GLSVKP  GDA+LFWSM
Sbjct: 197 ATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKK---GLSVKPRMGDALLFWSM 256

Query: 263 GLDGQSDPKSIHGGCEVLSGEKWSATKWM 283
             D   DP S+HGGC V+ G KWS+TKWM
Sbjct: 257 RPDATLDPTSLHGGCPVIRGNKWSSTKWM 280

BLAST of Chy7G142440 vs. TAIR 10
Match: AT5G66060.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 218.0 bits (554), Expect = 1.2e-56
Identity = 114/212 (53.77%), Postives = 137/212 (64.62%), Query Frame = 0

Query: 83  EVVSWSPRIIVLHNFLSTEECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSH 142
           E++SW PR  V HNFL+ EEC YL  +A   +E STVVD KTGK   S  RTSSG FL+ 
Sbjct: 79  EIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSRVRTSSGTFLAR 138

Query: 143 HEKNFPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRI 202
                  ++ IEKRIS ++ IPVE+GE +QVL YE  Q Y+PH+DYF D +N + GGQRI
Sbjct: 139 GRDK--TIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMDEYNTRNGGQRI 198

Query: 203 ATMLMYLSENIEGGETYFPKAGS-----------GECSCGGKTVPGLSVKPAKGDAVLFW 262
           AT+LMYLS+  EGGET FP A              EC  G     GLSVKP  GDA+LFW
Sbjct: 199 ATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKG-----GLSVKPKMGDALLFW 258

Query: 263 SMGLDGQSDPKSIHGGCEVLSGEKWSATKWMR 284
           SM  D   DP S+HGGC V+ G KWS+TKW+R
Sbjct: 259 SMTPDATLDPSSLHGGCAVIKGNKWSSTKWLR 283

BLAST of Chy7G142440 vs. TAIR 10
Match: AT4G35810.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 214.5 bits (545), Expect = 1.4e-55
Identity = 114/208 (54.81%), Postives = 139/208 (66.83%), Query Frame = 0

Query: 83  EVVSWSPRIIVLHNFLSTEECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSH 142
           EV+SW PR  V HNFL+ EEC++L  +A   +  S VVD KTGK + S  RTSSG FL+ 
Sbjct: 81  EVISWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDVKTGKSIDSRVRTSSGTFLNR 140

Query: 143 -HEKNFPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQR 202
            H++   +V+ IE RIS ++ IP ENGE +QVL YE  Q Y+PHHDYF D FN+++GGQR
Sbjct: 141 GHDE---IVEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEPHHDYFFDEFNVRKGGQR 200

Query: 203 IATMLMYLSENIEGGETYFPKAGSG--------ECSCGGKTVPGLSVKPAKGDAVLFWSM 262
           IAT+LMYLS+  EGGET FP A           E S  GK   GLSV P K DA+LFWSM
Sbjct: 201 IATVLMYLSDVDEGGETVFPAAKGNVSDVPWWDELSQCGK--EGLSVLPKKRDALLFWSM 260

Query: 263 GLDGQSDPKSIHGGCEVLSGEKWSATKW 282
             D   DP S+HGGC V+ G KWS+TKW
Sbjct: 261 KPDASLDPSSLHGGCPVIKGNKWSSTKW 283

BLAST of Chy7G142440 vs. TAIR 10
Match: AT2G17720.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 210.3 bits (534), Expect = 2.6e-54
Identity = 113/208 (54.33%), Postives = 138/208 (66.35%), Query Frame = 0

Query: 83  EVVSWSPRIIVLHNFLSTEECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSH 142
           EV+SW PR +V HNFL+ EEC++L  +A   +  STVVD KTG    S  RTSSG FL  
Sbjct: 81  EVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSRVRTSSGTFLRR 140

Query: 143 -HEKNFPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQR 202
            H++   +V+ IEKRIS ++ IPVENGE +QVL Y+  Q Y+PH+DYF D FN K GGQR
Sbjct: 141 GHDE---VVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQR 200

Query: 203 IATMLMYLSENIEGGETYFPKAGS--------GECSCGGKTVPGLSVKPAKGDAVLFWSM 262
           IAT+LMYLS+  +GGET FP A           E S  GK   GLSV P K DA+LFW+M
Sbjct: 201 IATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGK--EGLSVLPKKRDALLFWNM 260

Query: 263 GLDGQSDPKSIHGGCEVLSGEKWSATKW 282
             D   DP S+HGGC V+ G KWS+TKW
Sbjct: 261 RPDASLDPSSLHGGCPVVKGNKWSSTKW 283

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9ZW861.1e-12174.82Prolyl 4-hydroxylase 1 OS=Arabidopsis thaliana OX=3702 GN=P4H1 PE=1 SV=1[more]
Q9LN201.0e-5555.02Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana OX=3702 GN=P4H3 PE=2 SV=... [more]
F4JZ241.8e-5553.77Probable prolyl 4-hydroxylase 10 OS=Arabidopsis thaliana OX=3702 GN=P4H10 PE=3 S... [more]
F4JNU81.9e-5454.81Probable prolyl 4-hydroxylase 8 OS=Arabidopsis thaliana OX=3702 GN=P4H8 PE=3 SV=... [more]
Q24JN53.7e-5354.33Prolyl 4-hydroxylase 5 OS=Arabidopsis thaliana OX=3702 GN=P4H5 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KU171.0e-16298.96Fe2OG dioxygenase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G... [more]
A0A1S3BXE66.1e-16097.57prolyl 4-hydroxylase 1 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103494504 PE=4 S... [more]
A0A1S4DZZ91.8e-15691.53prolyl 4-hydroxylase 1 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103494504 PE=4 S... [more]
A0A6J1CBS43.6e-15290.97prolyl 4-hydroxylase 1 OS=Momordica charantia OX=3673 GN=LOC111009248 PE=4 SV=1[more]
A0A6J1GWV87.4e-15090.63prolyl 4-hydroxylase 1 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111457888 ... [more]
Match NameE-valueIdentityDescription
XP_004152082.12.31e-20798.96prolyl 4-hydroxylase 1 [Cucumis sativus] >KGN53125.1 hypothetical protein Csa_01... [more]
XP_008453925.11.05e-20397.57PREDICTED: prolyl 4-hydroxylase 1 isoform X3 [Cucumis melo][more]
XP_016901567.17.85e-19991.53PREDICTED: prolyl 4-hydroxylase 1 isoform X1 [Cucumis melo][more]
XP_038904320.11.84e-19795.49prolyl 4-hydroxylase 1 isoform X1 [Benincasa hispida][more]
XP_022137963.11.68e-19390.97prolyl 4-hydroxylase 1 [Momordica charantia][more]
Match NameE-valueIdentityDescription
AT2G43080.17.6e-12374.82P4H isoform 1 [more]
AT1G20270.17.3e-5755.022-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT5G66060.11.2e-5653.772-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT4G35810.11.4e-5554.812-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT2G17720.12.6e-5454.332-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (hystrix) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006620Prolyl 4-hydroxylase, alpha subunitSMARTSM00702p4hccoord: 89..283
e-value: 7.8E-57
score: 204.8
NoneNo IPR availableGENE3D2.60.120.620q2cbj1_9rhob like domaincoord: 82..283
e-value: 2.1E-66
score: 225.6
NoneNo IPR availablePANTHERPTHR10869:SF179BNAA04G24820D PROTEINcoord: 1..287
IPR044862Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domainPFAMPF136402OG-FeII_Oxy_3coord: 171..283
e-value: 1.5E-20
score: 73.8
IPR045054Prolyl 4-hydroxylasePANTHERPTHR10869PROLYL 4-HYDROXYLASE ALPHA SUBUNITcoord: 1..287
IPR005123Oxoglutarate/iron-dependent dioxygenasePROSITEPS51471FE2OG_OXYcoord: 167..284
score: 11.197351

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Chy7G142440.1Chy7G142440.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0018401 peptidyl-proline hydroxylation to 4-hydroxy-L-proline
cellular_component GO:0005783 endoplasmic reticulum
cellular_component GO:0000137 Golgi cis cisterna
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0005506 iron ion binding
molecular_function GO:0031418 L-ascorbic acid binding
molecular_function GO:0004656 procollagen-proline 4-dioxygenase activity
molecular_function GO:0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen