CsGy5G009980 (gene) Cucumber (Gy14) v2.1

Overview
NameCsGy5G009980
Typegene
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionProcollagen-proline 3-dioxygenase
LocationGy14Chr5: 8862185 .. 8873909 (+)
RNA-Seq ExpressionCsGy5G009980
SyntenyCsGy5G009980
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGGATATTGAAAAAGAAAAGAAAGGAAGGTTTTAATGAAAGGGAGCAGTGGCATGTTTCTTTTCAAAATAGTATTTTCTTCTATAGAAGAACTTTTGTTCGTATTTATAGATGAAATGAAATGGAATGAAAAATTATGTAAAAGTAATGGATGAGTTTACCATAAATTAAAGTTTTTCAAATTTGTTGGTAACTCCAATTCTTCATTTCCAATCAAGCTAAGTCCAATCCTTCATTTCAAGTTTCTTAGGTTATTTTGGACCATTTTCCCACCCGTCTCGCTGAAACGGGAGAGACGGAGAACTGGACGCCAAAATGGTAGATGGAGCTGAGAGCAGGCAGCGGCGGCGTCTGATTCTTGAAAATTTCCTAAGCCGCGAAGAATGCAGGGAACTGGAGTTCATCCATAAGAGCTGCTCTACGGTGGGGTATAGACCAAACGTCTTTTCCACCACTCTGTTGCATCTTGTTGCCACCAATTCCGCTCATTTGATCATCCCTTTTGTTCCGATTAGAGGTAAAGCTGCCCCTTGGTCGATTTATATCCTTAGTTTTTCTTGGGTAATCTCTATGTTTAATTTTGATTTGACTCCAATTTTTGCGACAGAGAAGTTGAAGGAGAAAGCCGAGGAATTCTTTGGGTGTCATTATGAGCTATTTGTTGAGTTCACTGGCTTGATCAGGTTCTTCTTCTTCCTCATGTGCTAAATGCCGAATTGGGTTGTTGTCTCTTTCTACATATAATTCCAATTTTTGAAGAATGAACATAGTTAGTAATTGATGCAGACCTGTGAGGAAACAAATTAAAAGGATAGCAGTAAGTTTTTCTTTTTTGTCTCTGGGAGTCTGACGGTTGTGGTATAGTGGTCATTTAGAAATTGAAGTGCACTACTTATCGGTGTCCATAACTTGTATGATTAACATGTCAAGTAATACAATTAGTTTAGATGATGTATCAATTTGAATTGAGCCTTAAACAAGTGTTCCATGAGCATTTTTCTCCACAAAGAGAAAGACCTTAAAATTTCTCACTTGGTGACAAAATGATGATCTTATCTCTCACTACGCTGAAAACTGATCATCTTGATAAACTCCAAAGTTGATCTTCTTGGTGAACTCGAAGGTTAAAAAAACCTATAAACTCTAGGAAAAAAAAGAAAGAAAGGAAAACATTAGAAATATAACTGTAAATCATTAGTACTTGGCCTGAAGCATGAGCAAGCTATGTGAACTTTTCCCCCAAACATAAGAGACAATCAAGTACTCCCACTTTGACGTAACACTCGCCATTTCTTCCAACTCAAAAACTTGTTCCTAAAGAAGATAGATTCTATAGCTTTTGTTAAGGCTCCTAAGCTACTTATTCAAGCTCTTCCTCGACATTATTCTCGAAAAGGTGCTCTCTCCCTTGCATCCAAGTTATCGAATGCTCATAATGATCTTATTGATGTTGCTTGTGTATGAGTCAAAGTCCCTAATTTGGCTTTGTCTAAGTCCAATCACAATTCTTCTACGCTTATTCCCCCTTCACAAAACTCCAAGCACCAATGAATATGGACTGTTTCTCAACAAGTTAGCTCTTATTGTCACACAATTCAAATTGATAGAGAAGATGATTTAGGTGCAAGTGTTGGCAAGTAAAGAGCTAGAGTTAAATTCTTTGAATCCAAACTTAGAGGATTCCCTTCTTGAGGAGAACTTTGGAGAAGATTGATAGACCCCCGATCTTATACCAATATAGCAGGAGAGAGAAAAAGAAGTAACACGTGTGGGTGAGTAACACGTGGGAATAACATAAGGGTTAGGAGTTAGTATAAATAAAAGCCAATAGGAGGGTGGGAAAGTCAGTTAGAATCTGTGAAGTGAGAGACCTTCTCTGTTCTTCCTTGAAAGAAAGGATAACAGTAGAATTGAGAGGAGAGATTGTAACCAGGAATTCATTCTAATACTAACTATCAATAATACTAGATTTACAATACCATATCCGATTTCGAAAAAATTGGTATCAGAGAATGTGATCTCGGGCAAAGAGGGCAAAAATGGTGCAAACTCGGATCGAAGAGAAGATGGAAATGTTAGCTAAGGAATTACAGAAAATTAAGAAAGAAATCAGAAAGTTACCGGCTATTGAGAAAACGCTGAATGAAATATCAAAGAATATGGAGGGACAGAACCAATTGATATTGTGAATTATGGAATCGACAGCGCAGGAGAGATCGACAATGAATGAAAATTTAATCGAATTATTGATGCGGAATTTTCCGGTGAAGAATTTAGCCGAAAGCGAAGGATCTTCAAGACGAGAGAGCGAAACAAAGAACAAAGAAAAGAAAGTGAACGAAGAAGGCATCAATGACCGTAACAAGTTTAAGAAGGTGGAGATGCCAATATTCAACGGAGATGATCCCGACTCGTGGCTCTTTCGCGCCGAAAGGTATTTTCAAATTCGTAAACTTACTGAATATGAAAAGGCGACTGTTTCTACCGTTAGTTTTGAAGGACCAACATTAAATTGGTATCGTTCCCAAGAGGAACGAGAGAAGTTTGTTGACCGGGCAAACATGAAGGAAAAACTATTAGTTCGATTCCAATCTATGAGGGAGGGATTGTTATATGGAAGGTTTTTACGTATTCAACAGAAGACAAGGGTGGAAGAATATCGAAATTTATTCGATAAATGGGTAGCACCTTTATCAGACTTATCCGAAAAAGTAGTGGAAGAAATATTTATGTTTGGATTGAAGCCTCAGATTCAAGCTGAGATGGTCTTTTGTGAATCGAAGGGATTGACGCAAATGATGAGAATAGCACAGAAAGTAGAGAATAGGGAAGATATTTGAGAATAGGGAAGATATTTGACGAGAAGCAAATCTTCCTGGGTATTCCAGTGGAAAGTTAACTAACTCGTATAATAGTGTTAAGACTAATGCGAATGCGAATTCTGGAGAAAATAAAAGGAGTATGAGTTGGGCGATGAGGACAATCACGTTACGAGGGACCTCGAATGAAGAGGTTCGGAAAGAGGGTCCAACAAAGCGGCTGTCTGATGTCGAATTTCATTCTCTAAAGGAGAAAGGCCTATGTTTTCGATGTAATGAGAAATATTCTCACGATCACAACTGTAAAACTAAGGAGTAGAGAGAATTGCGTATGGTTGTAGTGAGGGGAGAAAATGAGGAGTACAAGATTATTGAAGAAGGAGGTAGTGAACGGAAAGAGTTGAATGTCATTGAGATCATAGGGGAAAATCAAACTGTGGAGTTGTCTATCAATTCTGTGGTAGGCCTATCCAATCCGGGAACGATGAAGGTGAAAGGAATGATACAGGGAAGGGAAATTATAGTGTTGATAGACTGCAGAGCGACACATAACTTTGTTTCAGAAAAACTGGTGAAGGAGCTACAACTGAATACACAAGATACATCAAACTATGGGGTCATCTTGGGTTCCGGTACTGCCATTAAAGGGAAGGGAATTTGTGAGGCTGTTGAATTGATGTTAGGAGACTAGAAGGTGATTGATGAATTTTTACCCTTGGAATTGGGGGACGTGGATACCATATTAGGAATGCAATGGCTATATTCTTTGGGCATAACTGAAGTAGACTGGAAGAACCTGATATTGTCCTTTATGCATTAGGGAAAGAAGATTATCATACAAGGAGATCCAAGTCTGACCAAGGCCAGGGTTAGTTTGGAAAACTTAATGAGGACCTGGGGAGAAGAAGACCAAGTATTCTTAGTTGAATGCAGAGCCTTGGAAAGGAGATAATCGTCTGAGGAAGAAGACTTGATTGAGGAAGAAGTAACTGTAGAGGAATCACTGGCAGTGGTATTAAAGAGTTTTGAGCACGTCTTTGAATGGCCTGAGACATTACCTCCTCGGAGAATGATAGAACACCATATACACTTAAAAAGGGAGTGAATCCCGTGAACGTGAGACCTTATCGCTATGCATATCAGCAGAAAACTGAAATGGAAAAACTTGTTGAAGAAATGCTGTCATCAGGGATAATACGACCAAGCACGAGTCGTTATTCGAGCCCTATATTGCTGGTAAGAAAGAAGGATGGAAGCTGGCAATTTTGTGTAGATTACCAAGTTCTGAATAATGTAACTGTGCTAGATAAATTTCCGATACCGATGATTGAAGAACTTTTCGATGAATTAAATGGAGCTGCTATGTTTACTAAGATTGATCTAAAATCAGAGTACCACCAGATCAGAATGTGTGCAGAAGACATTGAAAAGACAGCATTTAGAACTCATGAAGGCCATTGTGAGTTTACGGTGATGTCCTTTGGGTTAATAAATGCACCATCAACATTTCAGACATTGATGAATGCTATCTTTAAGCCGTACCTCAGGAAGTTTGTCCTGGTGTTCTTTGATGATATACTGATCTATAATAGGGATTTGAAAGCTCATTTGAATCATATGAGGGCAGTGTTGGAGGTGTTGAGAAAAAATGAGTTGTATGCGAACAAGAAGAAATGCAGTTTTGCTAGATCAAGAGTGGATTATCTTGGGCATATTATTTCTGGAGAAGGAGTGGAGGTGGATCCGGAGAAAATTCGAGCCATCAAAGAGTGGCCCATTCCAGCCAATGTGAGGGAAGTTCGGAGATTCTTGGGCTTAACTGATTATTACCGAAAATTTGTGCAACATTATGGCATGATTGTGGCTGCTTTGACACAACTATTGAAGATAGGAGGATTTAAGTGGTTTATGGAAGCTCAAGAGGCTTTTATCAAGTTGCAGCAAGCTATGGTGTCTCTCCCTGTTTTGGCACTACCAGATTTCAGTATTCCTTTTGAGATAGAGACAGATGCATCTGGATATGGGTTGGGTGCAGTGCTCGTACAAAAAAACGGTCAATTGCTTACTACAGTCACACTTTAGCAGTCAGAGATAGGGCTAAACCGGTATATGAAAGAGAATTAATGGTGGTGGTCATGGCAATATAGAGGTGGCGTGCCCATTTGTTGGGTAAGAAGTTCATGGTGAAAACTGATCAGCGGTCATTGAAGTTCTTATTGGAACAACGAGTGATTCAACCTCAATATAAAAAATGGATATCCAAACTCCTTGGGTATTCGTTCGAAGTGGTATATAAGCCGGGGCTTGAGAACAAAGCTGCTGATGCTTTGTCTAGAATGCCACCCACAGTTCACTTAAATCAGTTGACAGCTCCAAATCTGATTGATGTGGCAGTAATAAAGGAAGAGGTAGACCAAGATGAGAAGTTGCAGAAAATTAAAGAAGAGTTGGAAGAAAAAGGAGAGGATCAGGACAACAAATATTCAGTGAAACAAGGGATGTTGATGTATAAGGATCGAATGGTGATATCCAAAACATCTAAACTGATTCTTATGATTTTACACTTACCATGACTCGGTATTCGGAGGACATTCAGGTTTTTTGCGAACATACAAAAAACTGACTGGAAAGCTGTTTTGGGAAGGGATGAAACAAGATGTGAAAAAGTATTGTGAGGAGTGTATGATTTGTCAACGAAATAAAACGTTGGCATTATCTCTAGCTGGTTTATTGACACCTTTGGAAATTCCTAATAGAGTATGGGAGGACATCTCAATGGACTTCATAGAAGGATTGCCGAAAGCAAATGGGTTTGAAGTCATCTTTGTTGTGGTTGATCGATTCAGCAAATATGGGCGTTTTCTACCGTTGAAACATTCATATATTGCAAAGACTGTATCTGAATTGTTTGTGAAGGAAGTAGTATGGTTGCACAGTTTTCCTAAATCTATAGTCTCAGACAGAGATAAGTTGTTCTTGAGCCATTTTTAGAGGGAGTTGTTCAGATTGGCGGGTACAAGATTAAATCACAGTACTACATATCATCCTCAATCTGATGGGCAAACGGAAGTAGTTAATCGAGTAGTGGAAAGTTATTTACGTTGCTTTTGTGGAGAAAAACCGAAGAAGTGGGTAAAATGGATACCTTGGACGGAATATTGGTATAATACTACTTACCAATGCTCACTGGGAGTTACCCCATTTCAAGTAGTTTATGGCCGGTTACCTCCTCCATTGATATAACATGGAGATAGAGATACTTCAAGTTCAACATTAGATGAACAGGTGAAAGAAAGGGGAGCTCTGAAGGAGCATTTGAGAGTGGCACAAGATAAAATGAAGAAATATGCCGATTTAAAGAGGTGGGAAGTGCATTATCAGGTGGGGGACTTGCTTCTGTTGAAAATAAGACCCTACCGGCAAGTTACATTGAGAAGAAAAAGGAACAAGAAGCTTTCTCCTAAATTTTTCGGGCCCTACAAAGTGATAGAGAAGATTGGGCCAGTAGCGTATAAACTGGAGTTGCCCGATAATGCCGCTATACATCCCGTGTTTCATGTATCCTAGCTAAAGAAAGTATTTGGAACACATGATGAGAATCAGAATGATATTTCCTGTTTGACAGAAAACCACGAGTGGAGGGCTGTACCTGAAGAAGTGTATGGATATTTGAAGAATAAGGCGGGAGGTTGGGATGTGTTAGTGAGGTGGAAGGGACTACCTCGACATGAGGCAACGTGGGAGTTATATGAGGATATGCAACATCATTTTCCAGATTTTCACCTTCAGGACAAGGTGCATTTGGAGGAGTGTAATGATAGACGCCCGATCTTATACCAATATAAGAGAGAAAAAAAAGTAACACGTGTGTGTGAGTAACACGTGGGAATAACATGAGGGTTAGGAGTTAGTATAAAAAAAAGCCAATAGGAGGGCGGGAAAGTCAGTTAGAATCTGTGAAGTGAGAGACCTTCTCTGTTCTTCCTTGAAAGAAAGGATAACAGTAGAATTGAGAGGAGAGATTGTAACCAGGAATTCATTCCAATACTACTATCAATAATACTAGATTTACAATACCATATCTAGTTTCCATCAAAGATCATGTTGTCTTGTTTGATTGTTGAGTTGAAAAGAAGGTTAGGGAGGCCAATTTAGTTTGCTTTTCTCCATCTCAAATACCCTCCAAGTTCTTCTCCCTGCCCTAGTCGAAACTTGTGGCCTTCAATTTTGCAAAGCATCCCACCTTTCAACTTGTTGCGTTTTGAGTAGGATTTTGGTAATCTTGAAGATTTTGATTTTTAATGTCCGAAAGAGAGTCAGAAGTCTAAAGCAGTGGTTTTTGGCTTTTTCAGAATTCAAGTTGTCCTGGAAGAAGGCTCATTTGATGTCAAGTTTTTATCTTTTTTTTTTATTTTTTTAATTTGTTATCTTGGTCTAGTTTTGAATCTCTTCTTGTACTGGAAAGCTTTTGATACCTTTATGTCTAGTACCCTCATTTGTACTTTGCATTAGTCTCATTTCATTATTTCATTGAAAAGTTATGCTTCCATTTATTAAAAGAAATAATCTATATGTGGCTTAAAAAATTCAAACTTTCTGCTAAACTAAAGACTTGAACTTTGACCTTGGCAATTACCTTGGTCCTTTTCACTTTCATGCTTCAATTCACCTAGGACATCATCAACTTATTAGAATGTTTTTCAATTGGGGTTTGGGTAGTGACACATTGAAATCAACTATCTCTCTTAGACTTCCCAATTGTTTTTGGGTTGATAGATCATCTGACTGATTGAATTGGTTTTCATTGCTACTGGTTTAACTATACACCTACTAAAACCGTTTGGAACTACGCATCCATTGTCTCATTGATCCTTTGACAATATTTAAGTGCTATTTCTAGAATGTAGATTTTCTGTGGCCAACTACAACTGCTTGGGATTCTTGATCCAGTTTACATTCCAAAGCACACTTGCAACCCTCATCATCCAACTTAGGGTAAAGATATCTTTTGTAGCCTTGGGATTCTTTCTACTGCTTTGTGTTGGTGCAAATCTAAGCAACCTTTTTATCGTTTGAGCCTTTTTTATTTAGTTTCCAATTGGAAATTTCCTTTGTATCACCTTTAAGTGCTCTGGGGTCTCCCCTTTTACTTCATTTATTCAATTAAATGTTCACTCTAAAAAAACCATCCAACTTAGGGTAGAGATGAGATTCATCCTTTAAATGATTACAAATGTGACATGGTGTGTAGGAAGCATGGACATGTGTTTGACATGCAAGAAAATTCGTGTCTTCTTCTTTTCTTTTTTTTCCCGTCCCCCTCTCTGATTTCGGACACATGAAGACATGCACTAGACACACCATTTGCCCATAAAAAAGGAAAACCGAACCTTTTTTATTTTTTATTTTTATACGGACACAACGTTCAAGTTCATAAAAAAAGATGAAAGAATACAAGGGTATATGAAAAGCCAACCCCAAAAAAGGAACTCCCTCTACAAAAATGAATCGAAATATGCAAAATAGTGCCTATAGAATAATTACAAAAGGTCTTTGAAATTGGAGCTCACAGAGAAGCATGAAAACAAACGAGAGACTAAATTTCATGAGGGTTCCTTTCCACCTCGCTAAACACCTTACCCTACTATACTATTAGGATTTCTTTCAAGAAATATAGACTTTTAATAATTGTTATTTTGTGGATTTTAATATACCTTTTTCTCTATTTGGTTTAAATGAGCTTTTGTTTCCATTTATGTTTTATCCAATACTTCATTTGATTTACTCTTTAGTCTTTAATAGGGTAGTAGAACTTGTATTTTTCTTCTAACATTTTGTCATAATTCAGCTGGACCAGGGGAGCAAGCATTGGATGGCATAGTGATGATAACCGGCCCTATCTAAAACAACGTGAATTTTCTGTATGTGTACCAATCTTTCCTAAGAAAGTTGACGTCATTCTTCCTACTCTGGATAGGCTATTATTTAATTTCAATTGTTGTCTCATATTACTAATATATTTTTTCTACAGTATGTTCTCGAGTTTTTCTTTTCAATTTTTCTTTCTGGCAAGGGGAATGGTGGCTTTGTATTTTTTAACAATTCCTTTACCTTCTCTCTGAAATTCATTTGGTATGTCAGCTTTATTATTACAGTACTCTAGCTGATAGTTATCTTCGTGTGTTTGGCAAAAGGCAGTGTGTTACTTGAATAGTTATGGAGTAGAATTTGGAGGTGGACTGTTTCACTTTCAGGATGGGGAACCAGAAACCATCTCACCTTTTTATGGAGTAAGTTAGATGAAAGCTGTTTACCTCCTTGACTTATTGTTATTATTTAATGTTGACGGTACATCGTAACTTTATCTTATCGAAAATGGTACATATGCGATTGCTACCAATCTGAGTTGTAGCTATGCTGGTCAAGACATTATAATGTTGACCAAAAGGTTAAAGGTTTTAATCCTCTCCATATATTGTTGTGCTCGAAAATGGTACATTTTCTATTTTGAACTCTGAAGAATCTGCTTGTGCTCTTTTGAGCCAAGAAGTCTGCAGCATTTTATGGTAAATAGAAAAAATAAGTGGAGCTAGTCCTTTACATTTTGAATCTAGACCCATGTTTCATTGCTCTTCTTGTTGTTACCTTTCTAATTATTTTAGTAATTATGCCTTATTTATAAATTCTAATCAGTTCTAGTCTTGGGGGATGATGAACTGTTGGGTTTCTTGTTTGATGTCAATATATATAAGTAATGTGGAACAGGGCACATTTTCGGAATTGTTTGATTTTGGTGGTGGATTTTTAATTCATTCGACGGTTTTCCTTTTTACAAGAAATGCTTGCGACTAAATTAGTCTGAATAATTGATGATGTTGGATCATAGTTCCTAGTAGTTTGAAATTTGAAACAATGACAGATCAACTATAGTTAATACTCATTTCACTTTATCAGGATTGTGTGATGTACACGGCTGACAATGACAATGTTCATTCTGTTGATGAGGTATGCCCAAGCACTGAGACCCTCTGAATCTGTGTGAAACCATTTTACAAAAAGGAAAACCGTTTGATAGACTAATCACAGCCTTTCTAAAATTAATGCTAAATATTGGCTAGGCAAAAAGATGTTGCATCCGTTCACTGAGTTTATATTTTAATATAATGGAGTAATTTCATGTACATCCATGACCCACAAGTCCAGTTACCACTATCATTTTAAGATAACAGTGGATAACAAAAAGGGTCTAATTGTTTCTAGAACTTCTATAGGTGCTTGTAAAGAAGATCGCAATTCTTAATTTCTTAACTTGGAGACAGCAACACATGCTTGTTGACCCTTTCTGGATGCGGCTCTAAATGAGGCTAAGCCTCGATGCCTAGTGTTTGATAGACTAATCACAGCCTTTCTAAAATTAATTGGCATCAGTAGGATGAGGATCATATACAAAAAATTGCTCTTCACAGTTAACACAAACATTTTCTTAATATAAGCAATTAATGGGTGGGGATCTGAACTTTGATCGATTCTTTAACCCTAACCAAATCTTAGTCAAGTTTTCTATTGAGTCTTTTTTTGGTTATTTTTTGGAAAGCAAGATAAATAGGAGCACATTAATTCTTTCTTGCTTTTTTGCTTTTTACAGGTAACTGAGGAATCATGCATTTGATTAATCCTCAAGAGAGTTGCTAGTAGACCTCTTAGCACAGTTGTTCTTCATCTTGCTTACTTCATGGTTTCCTTGGATAGATAACCAATGGAGAAAGGCTTACACTGACATTATGGTTCACCCGTGATAGCTCTCATGATGAAGATGCAAAACTTCTTTCCCTTCTTTCACAAAGCCCTTTACATGATCGTTTTCCTGACTCGTGCCTCCCTCAGCCTCCGTCCTGTAATATGTATTGGTTTTCACCAGAAGATGATCCAAATTTCAAGTTCGGTTTTGATATATGCTGGGCGAGACTGCGTGCGCTTGGATACGACCTCTATTTTCCTGGGGACCATGATTTTTCAGAGTATCCAGATTTATTCTTTCAGGACGTACAATTAGTGTGGGGAGATAAGATATTCTTTCAGAAATTTGAGAACATTTTGCATTTGCTTCAGGTATTCCATCGAAATTACCCTTTAACATTTATTTCCCATTCTTTATAGGGGAAACTGGGGGATTGTAGTGTGACGAAAAACATGCTATATCATCAATCGTGAAAGGCTACTTTGAATAACTTGCTTTGTTTTATAGGGTTTTCTCATTGAAAAATTCTACTGAAACTAAAAGTAAATTGCGTTTACTACATATGCTTAATGGTTAAACAGGTAGTGCAGTTCCTGTGTTGGAAAGGCAAAGAGCTGGATTCTACCAACCTCAGTGAGGATTCGAGCTATGCAGAATATTTATCCCCAAAGAGAAATGTGGGAGTCAGTTACTTTAAATCTGAGTTTTCGAAGAACGATGGGTTGGCCGAATCGGTCTTCTCATCTGCTGCATCTGATGGCAAGGAGAACCAACAATGGTTGGGGTGGGATAAGCTTGTTGCTGCAGCAGCAGCTTGGGAACATTATGCTTCCATTTTAAGGAGAGAACTCCTTGGAAGCTTCAGCCATTGGAGGAATTGCCAATCCATATACAGTGTTTCACTTGATAGCTGAATATCCCAAATACTTAAACTAGCAGTAGCTTCGAGGTTAGTTCTGGGCCTTTGTATAGCTTAACTTTTACAATATCAAATAAAGTTCCGTTTGGTAACAATTTTGTTTTTTGTTTTGGAAATTTATGTTTGTTTCATTCCAAATTTCCAATTATAGTTTTCAATTTTGTTAATTAAACACTAGGGTCCTGTTTGGTAATCATTTTGTTTTTTGTTTTTGAAAAATTAAGTCTATTGACACACTACTTAAAGAACC

mRNA sequence

AGGATATTGAAAAAGAAAAGAAAGGAAGGTTTTAATGAAAGGGAGCAGTGGCATGTTTCTTTTCAAAATAGTATTTTCTTCTATAGAAGAACTTTTGTTCGTATTTATAGATGAAATGAAATGGAATGAAAAATTATGTAAAAGTAATGGATGAGTTTACCATAAATTAAAGTTTTTCAAATTTGTTGGTAACTCCAATTCTTCATTTCCAATCAAGCTAAGTCCAATCCTTCATTTCAAGTTTCTTAGGTTATTTTGGACCATTTTCCCACCCGTCTCGCTGAAACGGGAGAGACGGAGAACTGGACGCCAAAATGGTAGATGGAGCTGAGAGCAGGCAGCGGCGGCGTCTGATTCTTGAAAATTTCCTAAGCCGCGAAGAATGCAGGGAACTGGAGTTCATCCATAAGAGCTGCTCTACGGTGGGGTATAGACCAAACGTCTTTTCCACCACTCTGTTGCATCTTGTTGCCACCAATTCCGCTCATTTGATCATCCCTTTTGTTCCGATTAGAGAGAAGTTGAAGGAGAAAGCCGAGGAATTCTTTGGGTGTCATTATGAGCTATTTGTTGAGTTCACTGGCTTGATCAGCTGGACCAGGGGAGCAAGCATTGGATGGCATAGTGATGATAACCGGCCCTATCTAAAACAACGTGAATTTTCTGCAGTGTGTTACTTGAATAGTTATGGAGTAGAATTTGGAGGTGGACTGTTTCACTTTCAGGATGGGGAACCAGAAACCATCTCACCTTTTTATGGAGATTGTGTGATGTACACGGCTGACAATGACAATGTTCATTCTGTTGATGAGATAACCAATGGAGAAAGGCTTACACTGACATTATGGTTCACCCGTGATAGCTCTCATGATGAAGATGCAAAACTTCTTTCCCTTCTTTCACAAAGCCCTTTACATGATCGTTTTCCTGACTCGTGCCTCCCTCAGCCTCCGTCCTGTAATATGTATTGGTTTTCACCAGAAGATGATCCAAATTTCAAGTTCGGTTTTGATATATGCTGGGCGAGACTGCGTGCGCTTGGATACGACCTCTATTTTCCTGGGGACCATGATTTTTCAGAGTATCCAGATTTATTCTTTCAGGACGTACAATTAGTGTGGGGAGATAAGATATTCTTTCAGAAATTTGAGAACATTTTGCATTTGCTTCAGGTAGTGCAGTTCCTGTGTTGGAAAGGCAAAGAGCTGGATTCTACCAACCTCAGTGAGGATTCGAGCTATGCAGAATATTTATCCCCAAAGAGAAATGTGGGAGTCAGTTACTTTAAATCTGAGTTTTCGAAGAACGATGGGTTGGCCGAATCGGTCTTCTCATCTGCTGCATCTGATGGCAAGGAGAACCAACAATGGTTGGGGTGGGATAAGCTTGTTGCTGCAGCAGCAGCTTGGGAACATTATGCTTCCATTTTAAGGAGAGAACTCCTTGGAAGCTTCAGCCATTGGAGGAATTGCCAATCCATATACAGTGTTTCACTTGATAGCTGAATATCCCAAATACTTAAACTAGCAGTAGCTTCGAGGTTAGTTCTGGGCCTTTGTATAGCTTAACTTTTACAATATCAAATAAAGTTCCGTTTGGTAACAATTTTGTTTTTTGTTTTGGAAATTTATGTTTGTTTCATTCCAAATTTCCAATTATAGTTTTCAATTTTGTTAATTAAACACTAGGGTCCTGTTTGGTAATCATTTTGTTTTTTGTTTTTGAAAAATTAAGTCTATTGACACACTACTTAAAGAACC

Coding sequence (CDS)

ATGGTAGATGGAGCTGAGAGCAGGCAGCGGCGGCGTCTGATTCTTGAAAATTTCCTAAGCCGCGAAGAATGCAGGGAACTGGAGTTCATCCATAAGAGCTGCTCTACGGTGGGGTATAGACCAAACGTCTTTTCCACCACTCTGTTGCATCTTGTTGCCACCAATTCCGCTCATTTGATCATCCCTTTTGTTCCGATTAGAGAGAAGTTGAAGGAGAAAGCCGAGGAATTCTTTGGGTGTCATTATGAGCTATTTGTTGAGTTCACTGGCTTGATCAGCTGGACCAGGGGAGCAAGCATTGGATGGCATAGTGATGATAACCGGCCCTATCTAAAACAACGTGAATTTTCTGCAGTGTGTTACTTGAATAGTTATGGAGTAGAATTTGGAGGTGGACTGTTTCACTTTCAGGATGGGGAACCAGAAACCATCTCACCTTTTTATGGAGATTGTGTGATGTACACGGCTGACAATGACAATGTTCATTCTGTTGATGAGATAACCAATGGAGAAAGGCTTACACTGACATTATGGTTCACCCGTGATAGCTCTCATGATGAAGATGCAAAACTTCTTTCCCTTCTTTCACAAAGCCCTTTACATGATCGTTTTCCTGACTCGTGCCTCCCTCAGCCTCCGTCCTGTAATATGTATTGGTTTTCACCAGAAGATGATCCAAATTTCAAGTTCGGTTTTGATATATGCTGGGCGAGACTGCGTGCGCTTGGATACGACCTCTATTTTCCTGGGGACCATGATTTTTCAGAGTATCCAGATTTATTCTTTCAGGACGTACAATTAGTGTGGGGAGATAAGATATTCTTTCAGAAATTTGAGAACATTTTGCATTTGCTTCAGGTAGTGCAGTTCCTGTGTTGGAAAGGCAAAGAGCTGGATTCTACCAACCTCAGTGAGGATTCGAGCTATGCAGAATATTTATCCCCAAAGAGAAATGTGGGAGTCAGTTACTTTAAATCTGAGTTTTCGAAGAACGATGGGTTGGCCGAATCGGTCTTCTCATCTGCTGCATCTGATGGCAAGGAGAACCAACAATGGTTGGGGTGGGATAAGCTTGTTGCTGCAGCAGCAGCTTGGGAACATTATGCTTCCATTTTAAGGAGAGAACTCCTTGGAAGCTTCAGCCATTGGAGGAATTGCCAATCCATATACAGTGTTTCACTTGATAGCTGA

Protein sequence

MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS*
Homology
BLAST of CsGy5G009980 vs. ExPASy Swiss-Prot
Match: Q8CG71 (Prolyl 3-hydroxylase 2 OS=Mus musculus OX=10090 GN=P3h2 PE=1 SV=1)

HSP 1 Score: 68.2 bits (165), Expect = 2.4e-10
Identity = 60/221 (27.15%), Postives = 96/221 (43.44%), Query Frame = 0

Query: 11  RRLILENFLSREECRELEFIHKSCSTVG---------YRPN---VFSTTLLHL------- 70
           +R++L+N LS+E+CREL  +      VG         + PN     +T L  L       
Sbjct: 458 QRVLLDNVLSQEQCRELHSVANGIMLVGDGYRGKTSPHTPNEKFEGATVLKALKFGYEGR 517

Query: 71  VATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWT---------RGASIGW 130
           V   SA L   F  I EK ++  E +F  +  L+  +T ++  T            S   
Sbjct: 518 VPLKSARL---FYDISEKARKIVESYFMLNSTLYFSYTHMVCRTALSGQQDRRNDLSHPI 577

Query: 131 HSDD------------NRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPET----ISP 188
           H+D+              P    R++SA+ Y+N    +F GG F F + + +T    I P
Sbjct: 578 HADNCLLDPEANECWKEPPAYTFRDYSALLYMND---DFDGGEFIFTEMDAKTVTASIKP 637

BLAST of CsGy5G009980 vs. ExPASy Swiss-Prot
Match: Q4KLM6 (Prolyl 3-hydroxylase 2 OS=Rattus norvegicus OX=10116 GN=P3h2 PE=1 SV=1)

HSP 1 Score: 67.4 bits (163), Expect = 4.2e-10
Identity = 60/221 (27.15%), Postives = 95/221 (42.99%), Query Frame = 0

Query: 11  RRLILENFLSREECRELEFIHKSCSTVG---------YRPN---VFSTTLLHL------- 70
           +R++L+N LS E+CREL  +      VG         + PN     +T L  L       
Sbjct: 458 QRVLLDNVLSEEQCRELHSVASGIMLVGDGYRGKTSPHTPNEKFEGATVLKALKFGYEGR 517

Query: 71  VATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWT---------RGASIGW 130
           V   SA L   F  I EK ++  E +F  +  L+  +T ++  T            S   
Sbjct: 518 VPLKSARL---FYDISEKARKIVESYFMLNSTLYFSYTHMVCRTALSGQQDRRNDLSHPI 577

Query: 131 HSDD------------NRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPET----ISP 188
           H+D+              P    R++SA+ Y+N    +F GG F F + + +T    I P
Sbjct: 578 HADNCLLDPEANECWKEPPAYTFRDYSALLYMND---DFEGGEFIFTEMDAKTVTASIKP 637

BLAST of CsGy5G009980 vs. ExPASy Swiss-Prot
Match: Q8IVL5 (Prolyl 3-hydroxylase 2 OS=Homo sapiens OX=9606 GN=P3H2 PE=1 SV=1)

HSP 1 Score: 66.6 bits (161), Expect = 7.1e-10
Identity = 60/221 (27.15%), Postives = 94/221 (42.53%), Query Frame = 0

Query: 11  RRLILENFLSREECRELEFIHKSCSTVG---------YRPN---VFSTTLLHL------- 70
           +R++L+N LS E+CREL  +      VG         + PN     +T L  L       
Sbjct: 463 QRVLLDNVLSEEQCRELHSVASGIMLVGDGYRGKTSPHTPNEKFEGATVLKALKSGYEGR 522

Query: 71  VATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWT---------RGASIGW 130
           V   SA L   F  I EK +   E +F  +  L+  +T ++  T            S   
Sbjct: 523 VPLKSARL---FYDISEKARRIVESYFMLNSTLYFSYTHMVCRTALSGQQDRRNDLSHPI 582

Query: 131 HSDD------------NRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPET----ISP 188
           H+D+              P    R++SA+ Y+N    +F GG F F + + +T    I P
Sbjct: 583 HADNCLLDPEANECWKEPPAYTFRDYSALLYMND---DFEGGEFIFTEMDAKTVTASIKP 642

BLAST of CsGy5G009980 vs. ExPASy Swiss-Prot
Match: Q5XGE0 (2-oxoglutarate and iron-dependent oxygenase domain-containing protein 3 OS=Xenopus tropicalis OX=8364 GN=ogfod3 PE=2 SV=1)

HSP 1 Score: 57.4 bits (137), Expect = 4.3e-07
Identity = 26/85 (30.59%), Postives = 48/85 (56.47%), Query Frame = 0

Query: 102 WHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQD-GEPETISPFYGDCVMYTADNDN 161
           WH   ++      +++++ YL+ Y  +FGGG F F D G   T+ P  G    +T+ ++N
Sbjct: 225 WHPHIDKVTYGSFDYTSLLYLSDYSQDFGGGRFVFIDEGANRTVEPRTGRLSFFTSGSEN 284

Query: 162 VHSVDEITNGERLTLTLWFTRDSSH 186
           +H V++++ G R  +T+ FT +  H
Sbjct: 285 LHRVEKVSWGTRYAITISFTCNPEH 309

BLAST of CsGy5G009980 vs. NCBI nr
Match: XP_004140463.1 (prolyl 3-hydroxylase 1 [Cucumis sativus] >KAE8648112.1 hypothetical protein Csa_004690 [Cucumis sativus])

HSP 1 Score: 831 bits (2147), Expect = 1.38e-304
Identity = 396/396 (100.00%), Postives = 396/396 (100.00%), Query Frame = 0

Query: 1   MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLI 60
           MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLI
Sbjct: 1   MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLI 60

Query: 61  IPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFSAVC 120
           IPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFSAVC
Sbjct: 61  IPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFSAVC 120

Query: 121 YLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFT 180
           YLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFT
Sbjct: 121 YLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFT 180

Query: 181 RDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLR 240
           RDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLR
Sbjct: 181 RDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLR 240

Query: 241 ALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS 300
           ALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS
Sbjct: 241 ALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS 300

Query: 301 TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVA 360
           TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVA
Sbjct: 301 TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVA 360

Query: 361 AAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS 396
           AAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS
Sbjct: 361 AAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS 396

BLAST of CsGy5G009980 vs. NCBI nr
Match: TYK14443.1 (prolyl 3-hydroxylase 1 [Cucumis melo var. makuwa])

HSP 1 Score: 796 bits (2057), Expect = 7.56e-291
Identity = 381/397 (95.97%), Postives = 386/397 (97.23%), Query Frame = 0

Query: 1   MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLI 60
           MVDGAESRQRRRLILENFLSREECRELEFIHKSC TVGYRPNV STTLLHLVATNSAHLI
Sbjct: 1   MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLI 60

Query: 61  IPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFSAVC 120
           IPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFSAVC
Sbjct: 61  IPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFSAVC 120

Query: 121 YLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFT 180
           YLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMY AD+DNVHSVDEITNGERLTLTLWFT
Sbjct: 121 YLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYRADSDNVHSVDEITNGERLTLTLWFT 180

Query: 181 RDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLR 240
           RDSSHDEDAKLLSLLSQSPLHDRFP+SCLPQPPSCNMYWFSPEDDPNFKFGFDICWARL 
Sbjct: 181 RDSSHDEDAKLLSLLSQSPLHDRFPNSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLH 240

Query: 241 ALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS 300
           ALGYD+YFPGDHDFSEYPDLF QDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELD+
Sbjct: 241 ALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT 300

Query: 301 TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV- 360
           TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSA S GKENQ WLGWDKLV 
Sbjct: 301 TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVV 360

Query: 361 AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS 396
           AAAAAWE YASILRRELLGSFSHWRNCQSIYSVSLDS
Sbjct: 361 AAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS 397

BLAST of CsGy5G009980 vs. NCBI nr
Match: XP_008456831.1 (PREDICTED: uncharacterized protein LOC103496668 isoform X1 [Cucumis melo] >KAA0032195.1 prolyl 3-hydroxylase 1 [Cucumis melo var. makuwa])

HSP 1 Score: 794 bits (2051), Expect = 6.21e-290
Identity = 380/397 (95.72%), Postives = 386/397 (97.23%), Query Frame = 0

Query: 1   MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLI 60
           MVDGAESRQRRRLILENFLSREECRELEFIHKSC TVGYRPNV STTLLHLVATNSAHLI
Sbjct: 1   MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLI 60

Query: 61  IPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFSAVC 120
           IPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFSAVC
Sbjct: 61  IPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFSAVC 120

Query: 121 YLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFT 180
           YLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTAD+DNVHSVDEITNGERLTLTLWFT
Sbjct: 121 YLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFT 180

Query: 181 RDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLR 240
           RDSSHDEDAKLLSLLSQSPLHDRF +SCLPQPPSCNMYWFSPE+DPNFKFGFDICWARL 
Sbjct: 181 RDSSHDEDAKLLSLLSQSPLHDRFSNSCLPQPPSCNMYWFSPEEDPNFKFGFDICWARLH 240

Query: 241 ALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS 300
           ALGYD+YFPGDHDFSEYPDLF QDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELD+
Sbjct: 241 ALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT 300

Query: 301 TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV- 360
           TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSA S GKENQ WLGWDKLV 
Sbjct: 301 TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVV 360

Query: 361 AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS 396
           AAAAAWE YASILRRELLGSFSHWRNCQSIYSVSLDS
Sbjct: 361 AAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS 397

BLAST of CsGy5G009980 vs. NCBI nr
Match: XP_038893062.1 (uncharacterized protein LOC120081945 [Benincasa hispida] >XP_038893063.1 uncharacterized protein LOC120081945 [Benincasa hispida])

HSP 1 Score: 735 bits (1898), Expect = 1.34e-266
Identity = 356/398 (89.45%), Postives = 371/398 (93.22%), Query Frame = 0

Query: 1   MVDGAESRQRRR--LILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAH 60
           M D  ESR+RRR  LILENFL+REECRELEFIHKSC TVGYRPNVFSTTLLHLVATNSAH
Sbjct: 1   MGDEVESRRRRRRRLILENFLTREECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAH 60

Query: 61  LIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFSA 120
           LI+PFVPIRE+LKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQR+FSA
Sbjct: 61  LIMPFVPIRERLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQRDFSA 120

Query: 121 VCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLW 180
           VCYLNSYGVEFGGGLFHFQDGEPETISPF GDCVMYTAD+DNVHSVDEITNGERLTLTLW
Sbjct: 121 VCYLNSYGVEFGGGLFHFQDGEPETISPFCGDCVMYTADSDNVHSVDEITNGERLTLTLW 180

Query: 181 FTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWAR 240
            TRDSSHDED+KLLSLLSQS LHDR PDS LPQPPSCNMYWFS EDDPNFK GFDICWAR
Sbjct: 181 LTRDSSHDEDSKLLSLLSQSHLHDRLPDSRLPQPPSCNMYWFSLEDDPNFKSGFDICWAR 240

Query: 241 LRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKEL 300
           L ALGYD+YF GDH FSEYPDLF +DVQLV G+K+FFQ+FENILHLLQVVQFLCWKGKEL
Sbjct: 241 LHALGYDIYFRGDHSFSEYPDLFSRDVQLVQGNKLFFQEFENILHLLQVVQFLCWKGKEL 300

Query: 301 DSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKL 360
           DSTN+ EDSSYAEYLSPKRNVGVSYFKSEFSK+D LAESVFSSA SDGKENQ WLGWDKL
Sbjct: 301 DSTNIKEDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSATSDGKENQHWLGWDKL 360

Query: 361 VAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS 396
            AAAAAWE YASILRRELLGS S+WRN QSIYSVSL S
Sbjct: 361 AAAAAAWEDYASILRRELLGSLSYWRNSQSIYSVSLSS 398

BLAST of CsGy5G009980 vs. NCBI nr
Match: XP_008456833.1 (PREDICTED: uncharacterized protein LOC103496668 isoform X2 [Cucumis melo])

HSP 1 Score: 707 bits (1824), Expect = 7.35e-256
Identity = 348/397 (87.66%), Postives = 354/397 (89.17%), Query Frame = 0

Query: 1   MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLI 60
           MVDGAESRQRRRLILENFLSREECRELEFIHKSC TVGYRPNV STTLLHLVATNSAHLI
Sbjct: 1   MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLI 60

Query: 61  IPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFSAVC 120
           IPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFS   
Sbjct: 61  IPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFS--- 120

Query: 121 YLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFT 180
                                        DCVMYTAD+DNVHSVDEITNGERLTLTLWFT
Sbjct: 121 -----------------------------DCVMYTADSDNVHSVDEITNGERLTLTLWFT 180

Query: 181 RDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLR 240
           RDSSHDEDAKLLSLLSQSPLHDRF +SCLPQPPSCNMYWFSPE+DPNFKFGFDICWARL 
Sbjct: 181 RDSSHDEDAKLLSLLSQSPLHDRFSNSCLPQPPSCNMYWFSPEEDPNFKFGFDICWARLH 240

Query: 241 ALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS 300
           ALGYD+YFPGDHDFSEYPDLF QDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELD+
Sbjct: 241 ALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT 300

Query: 301 TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV- 360
           TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSA S GKENQ WLGWDKLV 
Sbjct: 301 TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVV 360

Query: 361 AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS 396
           AAAAAWE YASILRRELLGSFSHWRNCQSIYSVSLDS
Sbjct: 361 AAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS 365

BLAST of CsGy5G009980 vs. ExPASy TrEMBL
Match: A0A0A0KMN7 (Procollagen-proline 3-dioxygenase OS=Cucumis sativus OX=3659 GN=Csa_5G289640 PE=3 SV=1)

HSP 1 Score: 821 bits (2121), Expect = 1.09e-300
Identity = 396/411 (96.35%), Postives = 396/411 (96.35%), Query Frame = 0

Query: 1   MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLI 60
           MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLI
Sbjct: 1   MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLI 60

Query: 61  IPFVPIREKLKEKAEEFFGCHYELFVEFTGLIS---------------WTRGASIGWHSD 120
           IPFVPIREKLKEKAEEFFGCHYELFVEFTGLIS               WTRGASIGWHSD
Sbjct: 61  IPFVPIREKLKEKAEEFFGCHYELFVEFTGLISLHSKAHLQPSSSNLGWTRGASIGWHSD 120

Query: 121 DNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVD 180
           DNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVD
Sbjct: 121 DNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVD 180

Query: 181 EITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDD 240
           EITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDD
Sbjct: 181 EITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDD 240

Query: 241 PNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLL 300
           PNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLL
Sbjct: 241 PNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLL 300

Query: 301 QVVQFLCWKGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASD 360
           QVVQFLCWKGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASD
Sbjct: 301 QVVQFLCWKGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASD 360

Query: 361 GKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS 396
           GKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS
Sbjct: 361 GKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS 411

BLAST of CsGy5G009980 vs. ExPASy TrEMBL
Match: A0A5D3CRE9 (Procollagen-proline 3-dioxygenase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold186G001030 PE=3 SV=1)

HSP 1 Score: 796 bits (2057), Expect = 3.66e-291
Identity = 381/397 (95.97%), Postives = 386/397 (97.23%), Query Frame = 0

Query: 1   MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLI 60
           MVDGAESRQRRRLILENFLSREECRELEFIHKSC TVGYRPNV STTLLHLVATNSAHLI
Sbjct: 1   MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLI 60

Query: 61  IPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFSAVC 120
           IPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFSAVC
Sbjct: 61  IPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFSAVC 120

Query: 121 YLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFT 180
           YLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMY AD+DNVHSVDEITNGERLTLTLWFT
Sbjct: 121 YLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYRADSDNVHSVDEITNGERLTLTLWFT 180

Query: 181 RDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLR 240
           RDSSHDEDAKLLSLLSQSPLHDRFP+SCLPQPPSCNMYWFSPEDDPNFKFGFDICWARL 
Sbjct: 181 RDSSHDEDAKLLSLLSQSPLHDRFPNSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLH 240

Query: 241 ALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS 300
           ALGYD+YFPGDHDFSEYPDLF QDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELD+
Sbjct: 241 ALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT 300

Query: 301 TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV- 360
           TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSA S GKENQ WLGWDKLV 
Sbjct: 301 TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVV 360

Query: 361 AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS 396
           AAAAAWE YASILRRELLGSFSHWRNCQSIYSVSLDS
Sbjct: 361 AAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS 397

BLAST of CsGy5G009980 vs. ExPASy TrEMBL
Match: A0A5A7SSL8 (Procollagen-proline 3-dioxygenase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold452G001280 PE=3 SV=1)

HSP 1 Score: 794 bits (2051), Expect = 3.01e-290
Identity = 380/397 (95.72%), Postives = 386/397 (97.23%), Query Frame = 0

Query: 1   MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLI 60
           MVDGAESRQRRRLILENFLSREECRELEFIHKSC TVGYRPNV STTLLHLVATNSAHLI
Sbjct: 1   MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLI 60

Query: 61  IPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFSAVC 120
           IPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFSAVC
Sbjct: 61  IPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFSAVC 120

Query: 121 YLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFT 180
           YLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTAD+DNVHSVDEITNGERLTLTLWFT
Sbjct: 121 YLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFT 180

Query: 181 RDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLR 240
           RDSSHDEDAKLLSLLSQSPLHDRF +SCLPQPPSCNMYWFSPE+DPNFKFGFDICWARL 
Sbjct: 181 RDSSHDEDAKLLSLLSQSPLHDRFSNSCLPQPPSCNMYWFSPEEDPNFKFGFDICWARLH 240

Query: 241 ALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS 300
           ALGYD+YFPGDHDFSEYPDLF QDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELD+
Sbjct: 241 ALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT 300

Query: 301 TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV- 360
           TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSA S GKENQ WLGWDKLV 
Sbjct: 301 TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVV 360

Query: 361 AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS 396
           AAAAAWE YASILRRELLGSFSHWRNCQSIYSVSLDS
Sbjct: 361 AAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS 397

BLAST of CsGy5G009980 vs. ExPASy TrEMBL
Match: A0A1S3C486 (Procollagen-proline 3-dioxygenase OS=Cucumis melo OX=3656 GN=LOC103496668 PE=3 SV=1)

HSP 1 Score: 794 bits (2051), Expect = 3.01e-290
Identity = 380/397 (95.72%), Postives = 386/397 (97.23%), Query Frame = 0

Query: 1   MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLI 60
           MVDGAESRQRRRLILENFLSREECRELEFIHKSC TVGYRPNV STTLLHLVATNSAHLI
Sbjct: 1   MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLI 60

Query: 61  IPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFSAVC 120
           IPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFSAVC
Sbjct: 61  IPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFSAVC 120

Query: 121 YLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFT 180
           YLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTAD+DNVHSVDEITNGERLTLTLWFT
Sbjct: 121 YLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFT 180

Query: 181 RDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLR 240
           RDSSHDEDAKLLSLLSQSPLHDRF +SCLPQPPSCNMYWFSPE+DPNFKFGFDICWARL 
Sbjct: 181 RDSSHDEDAKLLSLLSQSPLHDRFSNSCLPQPPSCNMYWFSPEEDPNFKFGFDICWARLH 240

Query: 241 ALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS 300
           ALGYD+YFPGDHDFSEYPDLF QDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELD+
Sbjct: 241 ALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT 300

Query: 301 TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV- 360
           TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSA S GKENQ WLGWDKLV 
Sbjct: 301 TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVV 360

Query: 361 AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS 396
           AAAAAWE YASILRRELLGSFSHWRNCQSIYSVSLDS
Sbjct: 361 AAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS 397

BLAST of CsGy5G009980 vs. ExPASy TrEMBL
Match: A0A1S3C4U3 (uncharacterized protein LOC103496668 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103496668 PE=4 SV=1)

HSP 1 Score: 707 bits (1824), Expect = 3.56e-256
Identity = 348/397 (87.66%), Postives = 354/397 (89.17%), Query Frame = 0

Query: 1   MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLI 60
           MVDGAESRQRRRLILENFLSREECRELEFIHKSC TVGYRPNV STTLLHLVATNSAHLI
Sbjct: 1   MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLI 60

Query: 61  IPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFSAVC 120
           IPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFS   
Sbjct: 61  IPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFS--- 120

Query: 121 YLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFT 180
                                        DCVMYTAD+DNVHSVDEITNGERLTLTLWFT
Sbjct: 121 -----------------------------DCVMYTADSDNVHSVDEITNGERLTLTLWFT 180

Query: 181 RDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLR 240
           RDSSHDEDAKLLSLLSQSPLHDRF +SCLPQPPSCNMYWFSPE+DPNFKFGFDICWARL 
Sbjct: 181 RDSSHDEDAKLLSLLSQSPLHDRFSNSCLPQPPSCNMYWFSPEEDPNFKFGFDICWARLH 240

Query: 241 ALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS 300
           ALGYD+YFPGDHDFSEYPDLF QDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELD+
Sbjct: 241 ALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT 300

Query: 301 TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV- 360
           TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSA S GKENQ WLGWDKLV 
Sbjct: 301 TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVV 360

Query: 361 AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS 396
           AAAAAWE YASILRRELLGSFSHWRNCQSIYSVSLDS
Sbjct: 361 AAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS 365

BLAST of CsGy5G009980 vs. TAIR 10
Match: AT1G68080.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 418.3 bits (1074), Expect = 6.8e-117
Identity = 213/391 (54.48%), Postives = 269/391 (68.80%), Query Frame = 0

Query: 8   RQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIR 67
           ++  RLIL NFLS  EC+ELE IHKS ST+GYRPNVFSTTL HL+ATNS HLIIPFV IR
Sbjct: 4   KEHPRLILHNFLSPAECKELELIHKSSSTIGYRPNVFSTTLSHLIATNSPHLIIPFVSIR 63

Query: 68  EKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGV 127
           E+LKEK EE FGC YELF+EFTGLISW +GASIGWHSDDNR YLKQR+F+AVCYLNSY  
Sbjct: 64  ERLKEKIEETFGCEYELFIEFTGLISWCKGASIGWHSDDNRSYLKQRDFAAVCYLNSYEK 123

Query: 128 EFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDE 187
           +F GGLF FQ GEP T++P  GD +MYTAD+ N+HSVDE+T+GERLTL LWF+RDSSHDE
Sbjct: 124 DFIGGLFRFQSGEPVTVAPSAGDVIMYTADDRNIHSVDEVTDGERLTLALWFSRDSSHDE 183

Query: 188 DAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLRALGYDL 247
           D+KLLS LSQ   H    + CLP P S NMYWF P +D  N   GFD+C ARL  LG+D+
Sbjct: 184 DSKLLSRLSQCTSH----EVCLPLPASTNMYWFCPHQDGSNQNIGFDVCVARLHLLGFDV 243

Query: 248 Y-FPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLSE 307
           +   G+   ++  +     +QL  G K+  +KF NILH LQVVQF  WK  EL ++N+  
Sbjct: 244 HSLQGEDHSTDASEQLMGPLQLAKGGKLLTRKFANILHALQVVQFYHWKASELVTSNVEN 303

Query: 308 DS-SYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAA 367
           D+    + +S  +   ++  KS F  ++ L  + F  + S G++ +  L    +  A  +
Sbjct: 304 DTLEEVKAMSHSQLETINALKSVFLLDENLVATTFGYSCS-GEDRKDSLDLTGIALAVTS 363

Query: 368 WEHYASILRRELLGSFSHWRNCQSIYSVSLD 396
           WE Y+  L +ELL S   W+  Q+I+ V  D
Sbjct: 364 WEEYSCKLLKELLSSLPQWKTYQTIHKVESD 389

BLAST of CsGy5G009980 vs. TAIR 10
Match: AT1G68080.3 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 376.3 bits (965), Expect = 3.0e-104
Identity = 198/391 (50.64%), Postives = 254/391 (64.96%), Query Frame = 0

Query: 8   RQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIR 67
           ++  RLIL NFLS  EC+ELE IHKS ST+GYRPNVFSTTL HL+ATNS HLIIPFV IR
Sbjct: 4   KEHPRLILHNFLSPAECKELELIHKSSSTIGYRPNVFSTTLSHLIATNSPHLIIPFVSIR 63

Query: 68  EKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGV 127
           E+LKEK EE FGC YELF+EFTGLISW +GASIGWHSDDNR YLKQR+F++         
Sbjct: 64  ERLKEKIEETFGCEYELFIEFTGLISWCKGASIGWHSDDNRSYLKQRDFAS--------- 123

Query: 128 EFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDE 187
                      GEP T++P  GD +MYTAD+ N+HSVDE+T+GERLTL LWF+RDSSHDE
Sbjct: 124 -----------GEPVTVAPSAGDVIMYTADDRNIHSVDEVTDGERLTLALWFSRDSSHDE 183

Query: 188 DAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLRALGYDL 247
           D+KLLS LSQ   H    + CLP P S NMYWF P +D  N   GFD+C ARL  LG+D+
Sbjct: 184 DSKLLSRLSQCTSH----EVCLPLPASTNMYWFCPHQDGSNQNIGFDVCVARLHLLGFDV 243

Query: 248 Y-FPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLSE 307
           +   G+   ++  +     +QL  G K+  +KF NILH LQVVQF  WK  EL ++N+  
Sbjct: 244 HSLQGEDHSTDASEQLMGPLQLAKGGKLLTRKFANILHALQVVQFYHWKASELVTSNVEN 303

Query: 308 DS-SYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAA 367
           D+    + +S  +   ++  KS F  ++ L  + F  + S G++ +  L    +  A  +
Sbjct: 304 DTLEEVKAMSHSQLETINALKSVFLLDENLVATTFGYSCS-GEDRKDSLDLTGIALAVTS 363

Query: 368 WEHYASILRRELLGSFSHWRNCQSIYSVSLD 396
           WE Y+  L +ELL S   W+  Q+I+ V  D
Sbjct: 364 WEEYSCKLLKELLSSLPQWKTYQTIHKVESD 369

BLAST of CsGy5G009980 vs. TAIR 10
Match: AT1G68080.2 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 336.3 bits (861), Expect = 3.4e-92
Identity = 183/390 (46.92%), Postives = 237/390 (60.77%), Query Frame = 0

Query: 8   RQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIR 67
           ++  RLIL NFLS  EC+ELE IHKS ST+GYRPNVFSTTL HL+ATNS HLIIPFV IR
Sbjct: 4   KEHPRLILHNFLSPAECKELELIHKSSSTIGYRPNVFSTTLSHLIATNSPHLIIPFVSIR 63

Query: 68  EKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGV 127
           E+LKEK EE FGC YELF+EFTGLISW +GASIGWHSDDNR YLKQR+F++         
Sbjct: 64  ERLKEKIEETFGCEYELFIEFTGLISWCKGASIGWHSDDNRSYLKQRDFAS--------- 123

Query: 128 EFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDE 187
                      GEP T++P  GD +MYTAD+ N+HSVDE+T+GERLTL LWF+RDSSHDE
Sbjct: 124 -----------GEPVTVAPSAGDVIMYTADDRNIHSVDEVTDGERLTLALWFSRDSSHDE 183

Query: 188 DAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLY 247
           D+KLLS LSQ                                  FD+C ARL  LG+D++
Sbjct: 184 DSKLLSRLSQC---------------------------------FDVCVARLHLLGFDVH 243

Query: 248 -FPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLSED 307
              G+   ++  +     +QL  G K+  +KF NILH LQVVQF  WK  EL ++N+  D
Sbjct: 244 SLQGEDHSTDASEQLMGPLQLAKGGKLLTRKFANILHALQVVQFYHWKASELVTSNVEND 303

Query: 308 S-SYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAW 367
           +    + +S  +   ++  KS F  ++ L  + F  + S G++ +  L    +  A  +W
Sbjct: 304 TLEEVKAMSHSQLETINALKSVFLLDENLVATTFGYSCS-GEDRKDSLDLTGIALAVTSW 339

Query: 368 EHYASILRRELLGSFSHWRNCQSIYSVSLD 396
           E Y+  L +ELL S   W+  Q+I+ V  D
Sbjct: 364 EEYSCKLLKELLSSLPQWKTYQTIHKVESD 339

BLAST of CsGy5G009980 vs. TAIR 10
Match: AT1G68080.4 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 192.2 bits (487), Expect = 8.0e-49
Identity = 106/246 (43.09%), Postives = 148/246 (60.16%), Query Frame = 0

Query: 153 MYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQP 212
           MYTAD+ N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H    + CLP P
Sbjct: 1   MYTADDRNIHSVDEVTDGERLTLALWFSRDSSHDEDSKLLSRLSQCTSH----EVCLPLP 60

Query: 213 PSCNMYWFSP-EDDPNFKFGFDICWARLRALGYDLY-FPGDHDFSEYPDLFFQDVQLVWG 272
            S NMYWF P +D  N   GFD+C ARL  LG+D++   G+   ++  +     +QL  G
Sbjct: 61  ASTNMYWFCPHQDGSNQNIGFDVCVARLHLLGFDVHSLQGEDHSTDASEQLMGPLQLAKG 120

Query: 273 DKIFFQKFENILHLLQVVQFLCWKGKELDSTNLSEDS-SYAEYLSPKRNVGVSYFKSEFS 332
            K+  +KF NILH LQVVQF  WK  EL ++N+  D+    + +S  +   ++  KS F 
Sbjct: 121 GKLLTRKFANILHALQVVQFYHWKASELVTSNVENDTLEEVKAMSHSQLETINALKSVFL 180

Query: 333 KNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSI 392
            ++ L  + F  + S G++ +  L    +  A  +WE Y+  L +ELL S   W+  Q+I
Sbjct: 181 LDENLVATTFGYSCS-GEDRKDSLDLTGIALAVTSWEEYSCKLLKELLSSLPQWKTYQTI 240

Query: 393 YSVSLD 396
           + V  D
Sbjct: 241 HKVESD 241

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8CG712.4e-1027.15Prolyl 3-hydroxylase 2 OS=Mus musculus OX=10090 GN=P3h2 PE=1 SV=1[more]
Q4KLM64.2e-1027.15Prolyl 3-hydroxylase 2 OS=Rattus norvegicus OX=10116 GN=P3h2 PE=1 SV=1[more]
Q8IVL57.1e-1027.15Prolyl 3-hydroxylase 2 OS=Homo sapiens OX=9606 GN=P3H2 PE=1 SV=1[more]
Q5XGE04.3e-0730.592-oxoglutarate and iron-dependent oxygenase domain-containing protein 3 OS=Xenop... [more]
Match NameE-valueIdentityDescription
XP_004140463.11.38e-304100.00prolyl 3-hydroxylase 1 [Cucumis sativus] >KAE8648112.1 hypothetical protein Csa_... [more]
TYK14443.17.56e-29195.97prolyl 3-hydroxylase 1 [Cucumis melo var. makuwa][more]
XP_008456831.16.21e-29095.72PREDICTED: uncharacterized protein LOC103496668 isoform X1 [Cucumis melo] >KAA00... [more]
XP_038893062.11.34e-26689.45uncharacterized protein LOC120081945 [Benincasa hispida] >XP_038893063.1 unchara... [more]
XP_008456833.17.35e-25687.66PREDICTED: uncharacterized protein LOC103496668 isoform X2 [Cucumis melo][more]
Match NameE-valueIdentityDescription
A0A0A0KMN71.09e-30096.35Procollagen-proline 3-dioxygenase OS=Cucumis sativus OX=3659 GN=Csa_5G289640 PE=... [more]
A0A5D3CRE93.66e-29195.97Procollagen-proline 3-dioxygenase OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A5A7SSL83.01e-29095.72Procollagen-proline 3-dioxygenase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C2... [more]
A0A1S3C4863.01e-29095.72Procollagen-proline 3-dioxygenase OS=Cucumis melo OX=3656 GN=LOC103496668 PE=3 S... [more]
A0A1S3C4U33.56e-25687.66uncharacterized protein LOC103496668 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
Match NameE-valueIdentityDescription
AT1G68080.16.8e-11754.482-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT1G68080.33.0e-10450.642-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT1G68080.23.4e-9246.922-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT1G68080.48.0e-4943.092-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Gy14) v2.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006620Prolyl 4-hydroxylase, alpha subunitSMARTSM00702p4hccoord: 10..180
e-value: 0.0045
score: 3.0
IPR044862Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domainPFAMPF136402OG-FeII_Oxy_3coord: 95..179
e-value: 8.0E-11
score: 42.7
NoneNo IPR availableGENE3D2.60.120.620q2cbj1_9rhob like domaincoord: 4..197
e-value: 1.3E-20
score: 75.9
NoneNo IPR availablePANTHERPTHR14049:SF92-OXOGLUTARATE (2OG) AND FE(II)-DEPENDENT OXYGENASE SUPERFAMILY PROTEINcoord: 6..392
IPR039575Prolyl 3-hydroxylasePANTHERPTHR14049LEPRECAN 1coord: 6..392
IPR005123Oxoglutarate/iron-dependent dioxygenasePROSITEPS51471FE2OG_OXYcoord: 73..181
score: 8.919938

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy5G009980.1CsGy5G009980.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0032963 collagen metabolic process
biological_process GO:0019511 peptidyl-proline hydroxylation
molecular_function GO:0005506 iron ion binding
molecular_function GO:0031418 L-ascorbic acid binding
molecular_function GO:0019797 procollagen-proline 3-dioxygenase activity
molecular_function GO:0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen