CcUC02G034210 (gene) Watermelon (PI 537277) v1

Overview
NameCcUC02G034210
Typegene
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
Description2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein
LocationCicolChr02: 29999187 .. 30004003 (-)
RNA-Seq ExpressionCcUC02G034210
SyntenyCcUC02G034210
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CACAAATCCAATCCCCCAATTACCCACCAAAAATTCTCTCCCACTTTCCTCCATGGCCAAATCTCCCAAACCCCACTTAGTATATATTCATCTTCGTTTCAATTCCAGCAGCAGTGTTTCAGAATCAGCTTGTTTTGATTCTTTTCTAGCATCTTCGTTCTGTCAATGGCGGTATCGAAAGGGAAATACATCAAGTTACAGCCCAAGAAATGGTCCACATTTCAGCTTTCGAAAATGATCATGGCCCTCGTTTTCATACTTGGGTTTGTCATGCTTCTTGCTCTCCGGTTCTTCTCTCCTCCGGAAACTTCTCATCGGAATCTACCCCACCGTCTTGCTTCCCTCCGATATACAGCCCTTCAAAGGTACTCCGGAAACGTAATGGCTGTGATTCTTTAAGCTTGTTACAGGAATCAACTGATTTTTGATTTTGGTACTTTTGTTGTTGTTGTAGTAGTGATGGGTTAGGGAAGAGAGGGGATCAGTGGGTTGAGTTCATTTCTTGGGAGCCTAGAGCTTTTGTTTATCACAATTTCTTGGTAAGTTTCTAGAATTGTGGTTTCCATTTGGCTTCTTGTTTGATTTTGTCATTTTTTGAAAAAAAATAAAAGAAAATGCGTTTGGTTAAAATGTTGGCTGATGTTGGAATGTTGTTGGACAGTCCAAGGAAGAATGCTTGTACTTGATTAGTCTTGCAAAACCTCACATGGAGAAATCAACTGTGGTTGATAGCAAAAGTGGCAAAAGTGTGGATAGCAGGTCTGTATTTGTCACTTTCATGGTTGTCTTGTAAAAAAAAAATGTTAATATTTACTGTAACCCACTAGCTTAACCTTTTAAGTTGATTGCTAATTTAAAATAGTATTGTAGTAGGAGATCTCAAATCTTGGTGCTGTCATTTCCTCTTTACGTTTCCTATTAGCTTAAGCTTTTGGGTTGGTGGGTAATTTAACTCCGTGAATTAAATTGTTTAGGACTGAATTACATTGTTTAGGACTGACAATAGTGTACGAATATACTTTTCTTCCCTTTGTTTTGATTGGTTTACATGAGATGGCTAGATTGGTTGGTTCGGTAAAAAACCAACCTGAACCGACGACCACAATGGAATGGTTATGGATAAAGCGCAATAGAAGGGTATAACAATACAATACAATGAACAAGGGAAACTTATAATTATTTGCTATACAATGTCATGTCCACAGGGTGCGCACCAGTTCTGGGATGTTTCTGAATAGAGGGCAGGACAAAATCATCAGTAACATAGAGAAAAGAATAGCAGATTTTACATTCATTCCTATTGGTAAAGAATTCAAAAACTTTCATATTACTATGGCCAATTTCTGAAGCATTTTGTAACAATTCCTGTCTATAAGGGCTTGTGTAGGAATACTATTCAAAGCAGTTTTTGAAAACATGTTTGAATCACTGTTTTTCAATAGTTGTTTTATTTTGTGTGTGTTTTAATCTTGTCAAACGAGACAAGGGTGGTGATTTCATTCAATATATCAATCTTTACAAGGGATTCAACTGTAGTTCTATTGAATAGTTCTTCAAAATAGAGTTGCAACCAGAACATAATTTCAAACAAACCTTGGCAAACATTGATGAAAAATTAGTTAAATTTCATGTGAGATCTGTTCACCATTCTTGTTTTTAGGCTCAAACCAAACAGGGGTGTGGTGAATGAGCCATCATTTGCTGCTTGTTGCTTTGTTGATGTCACTGACATTGTTGGTCTTAATACATTTTGGTATTAATGTTGATATATTCCGAACAGAGCATACAGAGCATGCGCTGTAGGAGAAATGCTAGATAATTAGTATAATTTAGTTAATCTTGGTTTAGTAGATAATTTTAGATTAGTTAGTGCTTAATTATTAGTTTATTTGTCTTCTTTTCTTGCAAGGCTATAAATAGTCAATTTTAGGTTTGTATTGGGAGTTTTCGGAATCATTTTGAAGTAGAACTTTGTTCTAGAGAGTTTTCTCTCAACTAAGAGACGAATTTCTCTTATTTGGCCATAGATTTGGTCTAATCTCAAATCCACTTCATCAGCATGGAGAATGTGGGTTCGTTTTCCATTCTAGGTTTGAGGCCCGAACCAAACAGAGATTTGGTGAACAACATTCATCTGCTTGTTGATGTCTCTGTCATTCTTGATCTTAATACATTTTGATACTGATATTCACGATTCCAAACAGAGCATGGAGAAGGACTTCAAATTCTCCATTATGAAGTTGGGCAGAAGTATGATGCTCATTATGATTACTTTGTTGATGAGTACAACATCAAAAAAGGAGGCCAAAGAATGGCCACCCTCCTCATGTATTTGTAAGATTCTACTTTGACATATCTCAAACTCATCTGCTTCTCTCTCAAGGTTATCTCTCTGAATCGGACCAATACGAATGCAGGTCGGACGTCGAAGAAGGGGGCGAGACGGTGTTCCCAGCTGCGAAAGGAAACTTTAGCTCTGTGCCACAGTGGAATGAACTGTCTGAATGTGGTAAAGGTGGACTCTCTGTAAAACCAAAGATGGGAGATGCTTTATTGTTCTGGAGCATGAAGCCTGATACTACCTTAGACCCTACAAGTTTGCATGGTTAGTGTTTTTTCACTGTCTATCTTTTATAGTTTAGCCTTGGAAGACTGCTTACATCTAAATTTTCTTGTTTAGTTCTTTTAACCGTTTGAATTTTTTTCCAAAACCTTAATGGATTTCCGAAAAGGGAAAAAAAAAAAAAAAAGGATTTTGGAGCTTACATTTATATAAAGCATGGTTTCTTAAAATAAATAACTAAAAAGGAGGGGTTTTACTTAAATAATTAGGATTTTTTTTTCTTTGATATTTGTGAGTGTCTTGGCTACGGAAATTCGATATTAAATCCTAGGTAGTTGGCCACCATTGATTGAACCAATGTCTTCTTAGCCATTTATATGTCTCATTTTTTCCCACTATGCCAACCTATGATGGTTAATAGTTAAAAGTCGGTTGGATACCATTTTGGTCCCGGTTGGAAACAATTTTGTTTTTAATTTTTGGTTTTTAGTTTATAAAATTTATGTTTGTCTTCCCTTAAATTCTATGCGATGATTTTCATATTTCTTGAAAAAACAATAAATCTCAAAAATAAAAATGAGTTTTTTTAAAAAAAATTTTTTGGTTTGATTTTTGAAAATGTGGTGGAAAGTGGATAACAAAATAAAGAAACTTCCACGTAAAAGTGGTGTTTAGTTTAATTTTTAAAAACTCAATAGTTATCAAACAAAAATTTTTTTTTTTTTTTTTTTTAATTTTTAAAAATTAAACATTTTCTTAATTTTGTTGTCTAGGTGTGTTTTCAAAATCTAAGCTAAATTTTTATGGTCCCGAAGGGCTTATTTGGGCCCCTTCTTGTAGAGAGTGTCTGTCCCTTTTTGTGGGCTTGTTTTTGTATGCCAGTGTGTTTTTTTAATTTTATTTTCAATAAAATTCATTATTTTCATTTAAGAAAATATTAGCTAAGTTTTGAAAACTGAAAATAAAAAAGTATTTTTTTATTTATTTGGAATTTGACTATGAATTAAACATTTTCTTTAGAAAGATAAAAATCATAGCAATAAATTTGTGAGAAAAAAAAAAGTTTATCAAATAGAATTGTAATTTTAGAAAATTTTGTTTTTAAAATTTGACTAAGAATTAAGAATTCAAATTTAAGAAAGATGAAAACATTTGCAATTAAATGGTGAGAAAATAAGCATAATCTTAAAAACTAAAAGAGATATCTAGACGACGACAAAAAAAAAAAAAAAAAAAAAAAACTTAATTTTAGAAATGTCCACAAATTTGAAAAATTTCATCTTTAGATTTTATCTTAAAATTTAAACATTTTTAACCTGAGATAATTGGCTGGAGTGGAGGGTGATTCAATGCATATTTCTATTTGTATGATCTTATAGGGTAATAGATTAACTCACTAATTTTGAATTCAATTTTGATTGTTTTAATGTGGCTTTGGTGACAGGGGTGGGATCATTTCTTTGATTTGTGAAGTAGAATTAATTTTGTATTTTTGTGATATATAGGTGCTTGCCCTGTCATAAGTGGGAACAAATGGTCATGTACAAAGTGGATGCATGTTAATAAATATATGTAACTCTAAGGTAACTGATACATTTCTTTTTAATTTTTCTATATACGTCATTTATATTTTCACAAGCTAAGCCGTATTTAAATTTTCCTTTTCATCATTAAAAAAAAAAAAAATCCTTTTCTTATTCTTCAAAATAATCCTCACCAATAAACTGGAAAAATAGTGTCTCAATTGAGTGCCTTTCTTTTTTTTAAAAAATTAATTTGTGTTAAATTTGTACGTTTATTGTGAGACATTTTGAAGTGCAATAGTTTTCTTTTTATTATCATTTATTATTTTTATTTTAAAAGTCAGTGGAATAGTTTTCTTCAACTTAATGTGCAATAATCATATATAAAAACGAGGTAATAATAAATTTGAGAAAACCATTATTAATGCAATGTAACATATTTTAAATTCCACCTTTTTTTTTTTGTTTTGAGGAAAAATTTTAAAATCCACTTATCCTCTTGATTTTATAGTTTATACCATATTTGATATGATTTAAAGAGAGATTATTTCCAAATTGCAGGGTGTACTGGAAGTACTTTGTTGGGGAAGGAGAGAAATAGAAGACGACGGCTGATTGTTTATTTATTTTTTTATTATATTTTTGAGCTTGATTTATAATTCACCCTCTGAAAATTTTCATATTTAGTTGTTAATATATACATAGAGAAGTTCCTTTTTTCTAGGCAGAAG

mRNA sequence

CACAAATCCAATCCCCCAATTACCCACCAAAAATTCTCTCCCACTTTCCTCCATGGCCAAATCTCCCAAACCCCACTTAGTATATATTCATCTTCGTTTCAATTCCAGCAGCAGTGTTTCAGAATCAGCTTGTTTTGATTCTTTTCTAGCATCTTCGTTCTGTCAATGGCGGTATCGAAAGGGAAATACATCAAGTTACAGCCCAAGAAATGGTCCACATTTCAGCTTTCGAAAATGATCATGGCCCTCGTTTTCATACTTGGGTTTGTCATGCTTCTTGCTCTCCGGTTCTTCTCTCCTCCGGAAACTTCTCATCGGAATCTACCCCACCGTCTTGCTTCCCTCCGATATACAGCCCTTCAAAGTAGTGATGGGTTAGGGAAGAGAGGGGATCAGTGGGTTGAGTTCATTTCTTGGGAGCCTAGAGCTTTTGTTTATCACAATTTCTTGTCCAAGGAAGAATGCTTGTACTTGATTAGTCTTGCAAAACCTCACATGGAGAAATCAACTGTGGTTGATAGCAAAAGTGGCAAAAGTGTGGATAGCAGGGTGCGCACCAGTTCTGGGATGTTTCTGAATAGAGGGCAGGACAAAATCATCAGTAACATAGAGAAAAGAATAGCAGATTTTACATTCATTCCTATTGAGCATGGAGAAGGACTTCAAATTCTCCATTATGAAGTTGGGCAGAAGTATGATGCTCATTATGATTACTTTGTTGATGAGTACAACATCAAAAAAGGAGGCCAAAGAATGGCCACCCTCCTCATGTCGGACGTCGAAGAAGGGGGCGAGACGGTGTTCCCAGCTGCGAAAGGAAACTTTAGCTCTGTGCCACAGTGGAATGAACTGTCTGAATGTGGTAAAGGTGGACTCTCTGTAAAACCAAAGATGGGAGATGCTTTATTGTTCTGGAGCATGAAGCCTGATACTACCTTAGACCCTACAAGTTTGCATGGTGCTTGCCCTGTCATAAGTGGGAACAAATGGTCATGTACAAAGTGGATGCATGGTGTACTGGAAGTACTTTGTTGGGGAAGGAGAGAAATAGAAGACGACGGCTGATTGTTTATTTATTTTTTTATTATATTTTTGAGCTTGATTTATAATTCACCCTCTGAAAATTTTCATATTTAGTTGTTAATATATACATAGAGAAGTTCCTTTTTTCTAGGCAGAAG

Coding sequence (CDS)

ATGGCGGTATCGAAAGGGAAATACATCAAGTTACAGCCCAAGAAATGGTCCACATTTCAGCTTTCGAAAATGATCATGGCCCTCGTTTTCATACTTGGGTTTGTCATGCTTCTTGCTCTCCGGTTCTTCTCTCCTCCGGAAACTTCTCATCGGAATCTACCCCACCGTCTTGCTTCCCTCCGATATACAGCCCTTCAAAGTAGTGATGGGTTAGGGAAGAGAGGGGATCAGTGGGTTGAGTTCATTTCTTGGGAGCCTAGAGCTTTTGTTTATCACAATTTCTTGTCCAAGGAAGAATGCTTGTACTTGATTAGTCTTGCAAAACCTCACATGGAGAAATCAACTGTGGTTGATAGCAAAAGTGGCAAAAGTGTGGATAGCAGGGTGCGCACCAGTTCTGGGATGTTTCTGAATAGAGGGCAGGACAAAATCATCAGTAACATAGAGAAAAGAATAGCAGATTTTACATTCATTCCTATTGAGCATGGAGAAGGACTTCAAATTCTCCATTATGAAGTTGGGCAGAAGTATGATGCTCATTATGATTACTTTGTTGATGAGTACAACATCAAAAAAGGAGGCCAAAGAATGGCCACCCTCCTCATGTCGGACGTCGAAGAAGGGGGCGAGACGGTGTTCCCAGCTGCGAAAGGAAACTTTAGCTCTGTGCCACAGTGGAATGAACTGTCTGAATGTGGTAAAGGTGGACTCTCTGTAAAACCAAAGATGGGAGATGCTTTATTGTTCTGGAGCATGAAGCCTGATACTACCTTAGACCCTACAAGTTTGCATGGTGCTTGCCCTGTCATAAGTGGGAACAAATGGTCATGTACAAAGTGGATGCATGGTGTACTGGAAGTACTTTGTTGGGGAAGGAGAGAAATAGAAGACGACGGCTGA

Protein sequence

MAVSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATLLMSDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMHGVLEVLCWGRREIEDDG
Homology
BLAST of CcUC02G034210 vs. NCBI nr
Match: XP_038889689.1 (probable prolyl 4-hydroxylase 3 [Benincasa hispida])

HSP 1 Score: 539.3 bits (1388), Expect = 2.1e-149
Identity = 262/284 (92.25%), Postives = 272/284 (95.77%), Query Frame = 0

Query: 1   MAVSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSHRNLPHRLASL 60
           MA+SKGKY K+Q KKWSTF+LSKMIMALV  LGF MLLALRFFSPPETSHRNLPH LAS+
Sbjct: 1   MALSKGKYTKIQGKKWSTFELSKMIMALVLALGFFMLLALRFFSPPETSHRNLPHHLASV 60

Query: 61  RYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSK 120
           R++A++ SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSK
Sbjct: 61  RHSAVE-SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSK 120

Query: 121 SGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAH 180
           +GKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAH
Sbjct: 121 TGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAH 180

Query: 181 YDYFVDEYNIKKGGQRMATLLM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLS 240
           YDYFVDEYNIKKGGQRMATLLM  SDVEEGGETVFPAA+GNFSSVP WNELSECGKGGLS
Sbjct: 181 YDYFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAARGNFSSVPWWNELSECGKGGLS 240

Query: 241 VKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH 283
           VKPKMGDALLFWSMKPD TLDPTSLHGACPVI GNKWSCTKWMH
Sbjct: 241 VKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMH 283

BLAST of CcUC02G034210 vs. NCBI nr
Match: XP_008456388.1 (PREDICTED: probable prolyl 4-hydroxylase 3 isoform X2 [Cucumis melo])

HSP 1 Score: 530.8 bits (1366), Expect = 7.6e-147
Identity = 264/284 (92.96%), Postives = 267/284 (94.01%), Query Frame = 0

Query: 1   MAVSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSHRNLPHRLASL 60
           MAVSKGKYIKLQ KKWSTFQLSKMIMALV  LGF ML+ALRFFSPPETSH    HRL S+
Sbjct: 1   MAVSKGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSH----HRLPSV 60

Query: 61  RYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSK 120
           R TA Q SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSK
Sbjct: 61  RRTAFQ-SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSK 120

Query: 121 SGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAH 180
           +GKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAH
Sbjct: 121 TGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAH 180

Query: 181 YDYFVDEYNIKKGGQRMATLLM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLS 240
           YDYFVDEYNIKKGGQRMATLLM  SDVEEGGETVFPAAKGNFSSVP WNELSECGKGGLS
Sbjct: 181 YDYFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLS 240

Query: 241 VKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH 283
           VKPKMGDALLFWSMKPD TLDPTSLHGACPVI GNKWSCTKWMH
Sbjct: 241 VKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMH 279

BLAST of CcUC02G034210 vs. NCBI nr
Match: XP_011648735.2 (probable prolyl 4-hydroxylase 3 [Cucumis sativus] >KAE8651549.1 hypothetical protein Csa_019368 [Cucumis sativus])

HSP 1 Score: 523.9 bits (1348), Expect = 9.3e-145
Identity = 258/284 (90.85%), Postives = 266/284 (93.66%), Query Frame = 0

Query: 1   MAVSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSHRNLPHRLASL 60
           MA+SKGKYIKLQ +KWSTFQLSKMIMALV  LGF ML+ALRFFSPPETSH    HR +S+
Sbjct: 1   MAISKGKYIKLQGRKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSH----HRFSSV 60

Query: 61  RYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSK 120
           R+TA   SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSK
Sbjct: 61  RHTAF-LSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSK 120

Query: 121 SGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAH 180
           +G+SVDSRVRTSSGMFLNRGQDKII NIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAH
Sbjct: 121 TGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAH 180

Query: 181 YDYFVDEYNIKKGGQRMATLLM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLS 240
           YDYFVDEYNIKKGGQRMATLLM  SDVEEGGETVFPAAKGNFSSVP WNELSECGKGGLS
Sbjct: 181 YDYFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLS 240

Query: 241 VKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH 283
           VKPKMGDALLFWSMKPD TLDPTSLHGACPVI GNKWSCTKWMH
Sbjct: 241 VKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMH 279

BLAST of CcUC02G034210 vs. NCBI nr
Match: XP_016901368.1 (PREDICTED: probable prolyl 4-hydroxylase 3 [Cucumis melo])

HSP 1 Score: 503.1 bits (1294), Expect = 1.7e-138
Identity = 250/284 (88.03%), Postives = 260/284 (91.55%), Query Frame = 0

Query: 1   MAVSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSHRNLPHRLASL 60
           MAVS GKYIKLQ KKWSTFQLSKMIMALV  LGF ML AL FFSPPETSH    HRL+S+
Sbjct: 1   MAVSIGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLSALWFFSPPETSH----HRLSSV 60

Query: 61  RYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSK 120
           R+TA   SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDS+
Sbjct: 61  RHTAF-LSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSE 120

Query: 121 SGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAH 180
           +GKSVDS VRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGE +QILHY VGQKYDAH
Sbjct: 121 TGKSVDSSVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEDIQILHYAVGQKYDAH 180

Query: 181 YDYFVDEYNIKKGGQRMATLLM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLS 240
           YD+FVDEYN+K  GQRMATLLM  SDVEEGGETVFPAAKGNFSSVP WNELSECGK GLS
Sbjct: 181 YDFFVDEYNLKSVGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKSGLS 240

Query: 241 VKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH 283
           +KPKMGDALLFWSMKPDTTLDPTSLHGACPVI GNKWSCTKW+H
Sbjct: 241 IKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWIH 279

BLAST of CcUC02G034210 vs. NCBI nr
Match: XP_011650099.2 (probable prolyl 4-hydroxylase 3 [Cucumis sativus] >XP_031736348.1 probable prolyl 4-hydroxylase 3 [Cucumis sativus])

HSP 1 Score: 497.7 bits (1280), Expect = 7.2e-137
Identity = 245/284 (86.27%), Postives = 261/284 (91.90%), Query Frame = 0

Query: 1   MAVSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSHRNLPHRLASL 60
           MAVS  KYIKLQ KKWSTFQLSKMIMALV  LGF ML+ALRF SPPETSH    HR +S+
Sbjct: 1   MAVSIRKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFLSPPETSH----HRFSSV 60

Query: 61  RYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSK 120
           R+TA   SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVD++
Sbjct: 61  RHTAF-LSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDNE 120

Query: 121 SGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAH 180
           +GK+V+  VRTSSGMFLNRGQDKI+SNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAH
Sbjct: 121 TGKNVEDSVRTSSGMFLNRGQDKIVSNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAH 180

Query: 181 YDYFVDEYNIKKGGQRMATLLM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLS 240
           YD+F DE+N+K+ GQRMATLLM  SDVEEGGETVFPAAKGNFSSVP WNELS+CGKGGLS
Sbjct: 181 YDFFDDEFNLKEIGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSKCGKGGLS 240

Query: 241 VKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH 283
           VKPKMGDALLFWSMKPDTTLDPTSLHGACPVI GNKWSCTKW+H
Sbjct: 241 VKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWIH 279

BLAST of CcUC02G034210 vs. ExPASy Swiss-Prot
Match: Q9LN20 (Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana OX=3702 GN=P4H3 PE=2 SV=1)

HSP 1 Score: 406.4 bits (1043), Expect = 2.9e-112
Identity = 199/284 (70.07%), Postives = 230/284 (80.99%), Query Frame = 0

Query: 3   VSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRF--FSPPETSHRNLPHRLASL 62
           ++K ++ + Q +KWST  L   ++ ++F+L  V+L+ L F  FS P  +  + P  L+  
Sbjct: 1   MAKLRHSRFQARKWSTLML---VLFMLFMLTIVLLMLLAFGVFSLPINNDESSPIDLSYF 60

Query: 63  RYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSK 122
           R  A + S+GLGKRGDQW E +SWEPRAFVYHNFLSKEEC YLISLAKPHM KSTVVDS+
Sbjct: 61  RRAATERSEGLGKRGDQWTEVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSE 120

Query: 123 SGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAH 182
           +GKS DSRVRTSSG FL RG+DKII  IEKRIAD+TFIP +HGEGLQ+LHYE GQKY+ H
Sbjct: 121 TGKSKDSRVRTSSGTFLRRGRDKIIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPH 180

Query: 183 YDYFVDEYNIKKGGQRMATLLM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLS 242
           YDYFVDE+N K GGQRMAT+LM  SDVEEGGETVFPAA  NFSSVP +NELSECGK GLS
Sbjct: 181 YDYFVDEFNTKNGGQRMATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLS 240

Query: 243 VKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH 283
           VKP+MGDALLFWSM+PD TLDPTSLHG CPVI GNKWS TKWMH
Sbjct: 241 VKPRMGDALLFWSMRPDATLDPTSLHGGCPVIRGNKWSSTKWMH 281

BLAST of CcUC02G034210 vs. ExPASy Swiss-Prot
Match: F4JZ24 (Probable prolyl 4-hydroxylase 10 OS=Arabidopsis thaliana OX=3702 GN=P4H10 PE=3 SV=1)

HSP 1 Score: 372.5 bits (955), Expect = 4.6e-102
Identity = 180/265 (67.92%), Postives = 210/265 (79.25%), Query Frame = 0

Query: 22  SKMIMALVFILGFVMLLALRF---FSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQW 81
           S ++ A++ +  FV+L+ L F     P   +  +  + L S+    LQ S     + ++W
Sbjct: 18  STLVFAVLIMSTFVILILLAFGILSVPSNNAGSSKANDLTSIVRKTLQRSGEDDSKNERW 77

Query: 82  VEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLN 141
           VE ISWEPRA VYHNFL+KEEC YLI LAKPHMEKSTVVD K+GKS DSRVRTSSG FL 
Sbjct: 78  VEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSRVRTSSGTFLA 137

Query: 142 RGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMA 201
           RG+DK I  IEKRI+DFTFIP+EHGEGLQ+LHYE+GQKY+ HYDYF+DEYN + GGQR+A
Sbjct: 138 RGRDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMDEYNTRNGGQRIA 197

Query: 202 TLLM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDT 261
           T+LM  SDVEEGGETVFPAAKGN+S+VP WNELSECGKGGLSVKPKMGDALLFWSM PD 
Sbjct: 198 TVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGLSVKPKMGDALLFWSMTPDA 257

Query: 262 TLDPTSLHGACPVISGNKWSCTKWM 282
           TLDP+SLHG C VI GNKWS TKW+
Sbjct: 258 TLDPSSLHGGCAVIKGNKWSSTKWL 282

BLAST of CcUC02G034210 vs. ExPASy Swiss-Prot
Match: F4JNU8 (Probable prolyl 4-hydroxylase 8 OS=Arabidopsis thaliana OX=3702 GN=P4H8 PE=3 SV=1)

HSP 1 Score: 352.8 bits (904), Expect = 3.7e-96
Identity = 174/285 (61.05%), Postives = 220/285 (77.19%), Query Frame = 0

Query: 5   KGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSH-RNLPHRLASLRYT 64
           K K ++ +P+K  + Q   +++ ++F++  ++L+ L  FS P T+   ++P  L ++  T
Sbjct: 4   KPKQLRNKPRKSFSTQTFTVVVLVLFVI--LILVGLGIFSLPSTNKTSSMPMDLTTIVQT 63

Query: 65  ALQSSDGLGK----RGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDS 124
            +Q  +  G      GD+W+E ISWEPRAFVYHNFL+ EEC +LISLAKP M KS VVD 
Sbjct: 64  -IQERESFGDEEDGNGDRWLEVISWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDV 123

Query: 125 KSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDA 184
           K+GKS+DSRVRTSSG FLNRG D+I+  IE RI+DFTFIP E+GEGLQ+LHYEVGQ+Y+ 
Sbjct: 124 KTGKSIDSRVRTSSGTFLNRGHDEIVEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEP 183

Query: 185 HYDYFVDEYNIKKGGQRMATLLM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGL 244
           H+DYF DE+N++KGGQR+AT+LM  SDV+EGGETVFPAAKGN S VP W+ELS+CGK GL
Sbjct: 184 HHDYFFDEFNVRKGGQRIATVLMYLSDVDEGGETVFPAAKGNVSDVPWWDELSQCGKEGL 243

Query: 245 SVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH 283
           SV PK  DALLFWSMKPD +LDP+SLHG CPVI GNKWS TKW H
Sbjct: 244 SVLPKKRDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWFH 285

BLAST of CcUC02G034210 vs. ExPASy Swiss-Prot
Match: Q24JN5 (Prolyl 4-hydroxylase 5 OS=Arabidopsis thaliana OX=3702 GN=P4H5 PE=2 SV=1)

HSP 1 Score: 342.8 bits (878), Expect = 3.9e-93
Identity = 172/288 (59.72%), Postives = 216/288 (75.00%), Query Frame = 0

Query: 1   MAVSKGKYIKLQPKK---WSTFQLSKMIMALVFILGFVMLLALRFFSPPETS-HRNLPHR 60
           MA    ++++ QP+K    ST   + +I+ LV IL   +LL L   S P  + + +  + 
Sbjct: 1   MASKSKQHLRYQPRKSVSRSTQAFTVLILLLVVIL---ILLGLGILSLPNANRNSSKTND 60

Query: 61  LASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTV 120
           L ++   +  SS      G++WVE ISWEPRA VYHNFL+ EEC +LISLAKP M KSTV
Sbjct: 61  LTNIVRKSETSSGDEEGNGERWVEVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTV 120

Query: 121 VDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQK 180
           VD K+G S DSRVRTSSG FL RG D+++  IEKRI+DFTFIP+E+GEGLQ+LHY+VGQK
Sbjct: 121 VDEKTGGSKDSRVRTSSGTFLRRGHDEVVEVIEKRISDFTFIPVENGEGLQVLHYQVGQK 180

Query: 181 YDAHYDYFVDEYNIKKGGQRMATLLM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGK 240
           Y+ HYDYF+DE+N K GGQR+AT+LM  SDV++GGETVFPAA+GN S+VP WNELS+CGK
Sbjct: 181 YEPHYDYFLDEFNTKNGGQRIATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGK 240

Query: 241 GGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH 283
            GLSV PK  DALLFW+M+PD +LDP+SLHG CPV+ GNKWS TKW H
Sbjct: 241 EGLSVLPKKRDALLFWNMRPDASLDPSSLHGGCPVVKGNKWSSTKWFH 285

BLAST of CcUC02G034210 vs. ExPASy Swiss-Prot
Match: Q8L970 (Probable prolyl 4-hydroxylase 7 OS=Arabidopsis thaliana OX=3702 GN=P4H7 PE=2 SV=1)

HSP 1 Score: 243.0 bits (619), Expect = 4.2e-63
Identity = 113/203 (55.67%), Postives = 153/203 (75.37%), Query Frame = 0

Query: 82  ISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQ 141
           +SW PR F+Y  FLS EEC + I LAK  +EKS V D+ SG+SV+S VRTSSGMFL++ Q
Sbjct: 59  LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGMFLSKRQ 118

Query: 142 DKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATLL 201
           D I+SN+E ++A +TF+P E+GE +QILHYE GQKY+ H+DYF D+ N++ GG R+AT+L
Sbjct: 119 DDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRIATVL 178

Query: 202 M--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLD 261
           M  S+VE+GGETVFP  KG  + +   +  +EC K G +VKP+ GDALLF+++ P+ T D
Sbjct: 179 MYLSNVEKGGETVFPMWKGKATQLKD-DSWTECAKQGYAVKPRKGDALLFFNLHPNATTD 238

Query: 262 PTSLHGACPVISGNKWSCTKWMH 283
             SLHG+CPV+ G KWS T+W+H
Sbjct: 239 SNSLHGSCPVVEGEKWSATRWIH 260

BLAST of CcUC02G034210 vs. ExPASy TrEMBL
Match: A0A1S3C2P6 (probable prolyl 4-hydroxylase 3 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103496331 PE=4 SV=1)

HSP 1 Score: 530.8 bits (1366), Expect = 3.7e-147
Identity = 264/284 (92.96%), Postives = 267/284 (94.01%), Query Frame = 0

Query: 1   MAVSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSHRNLPHRLASL 60
           MAVSKGKYIKLQ KKWSTFQLSKMIMALV  LGF ML+ALRFFSPPETSH    HRL S+
Sbjct: 1   MAVSKGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSH----HRLPSV 60

Query: 61  RYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSK 120
           R TA Q SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSK
Sbjct: 61  RRTAFQ-SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSK 120

Query: 121 SGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAH 180
           +GKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAH
Sbjct: 121 TGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAH 180

Query: 181 YDYFVDEYNIKKGGQRMATLLM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLS 240
           YDYFVDEYNIKKGGQRMATLLM  SDVEEGGETVFPAAKGNFSSVP WNELSECGKGGLS
Sbjct: 181 YDYFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLS 240

Query: 241 VKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH 283
           VKPKMGDALLFWSMKPD TLDPTSLHGACPVI GNKWSCTKWMH
Sbjct: 241 VKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMH 279

BLAST of CcUC02G034210 vs. ExPASy TrEMBL
Match: A0A1S4DZG7 (probable prolyl 4-hydroxylase 3 OS=Cucumis melo OX=3656 GN=LOC103496316 PE=4 SV=1)

HSP 1 Score: 503.1 bits (1294), Expect = 8.3e-139
Identity = 250/284 (88.03%), Postives = 260/284 (91.55%), Query Frame = 0

Query: 1   MAVSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSHRNLPHRLASL 60
           MAVS GKYIKLQ KKWSTFQLSKMIMALV  LGF ML AL FFSPPETSH    HRL+S+
Sbjct: 1   MAVSIGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLSALWFFSPPETSH----HRLSSV 60

Query: 61  RYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSK 120
           R+TA   SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDS+
Sbjct: 61  RHTAF-LSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSE 120

Query: 121 SGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAH 180
           +GKSVDS VRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGE +QILHY VGQKYDAH
Sbjct: 121 TGKSVDSSVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEDIQILHYAVGQKYDAH 180

Query: 181 YDYFVDEYNIKKGGQRMATLLM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLS 240
           YD+FVDEYN+K  GQRMATLLM  SDVEEGGETVFPAAKGNFSSVP WNELSECGK GLS
Sbjct: 181 YDFFVDEYNLKSVGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKSGLS 240

Query: 241 VKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH 283
           +KPKMGDALLFWSMKPDTTLDPTSLHGACPVI GNKWSCTKW+H
Sbjct: 241 IKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWIH 279

BLAST of CcUC02G034210 vs. ExPASy TrEMBL
Match: A0A6J1FBR3 (probable prolyl 4-hydroxylase 3 OS=Cucurbita moschata OX=3662 GN=LOC111444174 PE=4 SV=1)

HSP 1 Score: 496.5 bits (1277), Expect = 7.7e-137
Identity = 237/283 (83.75%), Postives = 261/283 (92.23%), Query Frame = 0

Query: 1   MAVSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSHRNLPHRLASL 60
           MAVSKGKY+K Q +KWSTF+LSK+IMA +  LG  ML+A RFFSPPE+SH NL HR+AS+
Sbjct: 1   MAVSKGKYVKFQARKWSTFKLSKIIMAFLLALGVSMLIAFRFFSPPESSHSNLLHRVASV 60

Query: 61  RYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSK 120
           ++ A+  SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHM KSTVVD+K
Sbjct: 61  QHRAVH-SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMAKSTVVDNK 120

Query: 121 SGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAH 180
           +GKS+DSRVRTSSGMFL RGQ+KI+SNIEKRIADFTFIP+EHGE LQILHYEVGQKYDAH
Sbjct: 121 TGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAH 180

Query: 181 YDYFVDEYNIKKGGQRMATLLM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLS 240
           +DYF DE+NIK+GGQRMATLLM  SDVEEGGETVFPAA+GNFSS+P WNELSECGKGGLS
Sbjct: 181 HDYFADEFNIKQGGQRMATLLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLS 240

Query: 241 VKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWM 282
           VKPKMGDALLFWSMKPD T+DPTSLHGACPVI GNKWSCTKWM
Sbjct: 241 VKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNKWSCTKWM 282

BLAST of CcUC02G034210 vs. ExPASy TrEMBL
Match: A0A1S3C367 (probable prolyl 4-hydroxylase 3 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103496331 PE=4 SV=1)

HSP 1 Score: 493.4 bits (1269), Expect = 6.5e-136
Identity = 254/294 (86.39%), Postives = 258/294 (87.76%), Query Frame = 0

Query: 1   MAVSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSHRNLPHRLASL 60
           MAVSKGKYIKLQ KKWSTFQLSKMIMALV  LGF ML+ALRFFSPPETSH    HRL S+
Sbjct: 1   MAVSKGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSH----HRLPSV 60

Query: 61  RYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSK 120
           R TA Q SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSK
Sbjct: 61  RRTAFQ-SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSK 120

Query: 121 SGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAH 180
           +GKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAH
Sbjct: 121 TGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAH 180

Query: 181 YDYFVDEYNIKKGGQRMATLLM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLS 240
           YDYFVDEYNIKKGGQRMATLLM  SDVEEGGETVFPAAKGNFSSVP WNELSECGKGGLS
Sbjct: 181 YDYFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLS 240

Query: 241 VKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMHGVLEVLCWGR 293
           VKPKMGDALLFWSMKPD TLDPTSLHG           C +        LCWGR
Sbjct: 241 VKPKMGDALLFWSMKPDATLDPTSLHG-----------CNE------NTLCWGR 272

BLAST of CcUC02G034210 vs. ExPASy TrEMBL
Match: A0A6J1CNS9 (probable prolyl 4-hydroxylase 3 OS=Momordica charantia OX=3673 GN=LOC111013165 PE=4 SV=1)

HSP 1 Score: 491.5 bits (1264), Expect = 2.5e-135
Identity = 242/286 (84.62%), Postives = 260/286 (90.91%), Query Frame = 0

Query: 1   MAVSKGKY--IKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSHRNLPHRLA 60
           MAVSKGKY  I    +KWST +LSK+IMALV  LGF MLLALRFFSPPE+S  NLP RLA
Sbjct: 1   MAVSKGKYGKINFHGRKWSTLELSKIIMALVLALGFSMLLALRFFSPPESSDPNLPDRLA 60

Query: 61  SLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVD 120
           S+R  A++ S+GLGKRG+QWVE ISWEPRAF+YHNFLSKEECLYLISLAKP M KSTV+D
Sbjct: 61  SVRRKAVE-SEGLGKRGEQWVEVISWEPRAFIYHNFLSKEECLYLISLAKPRMVKSTVID 120

Query: 121 SKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYD 180
           S++GKS+DSRVRTSSGMFL+RGQD+II NIEKRIADFTFIPIEHGEGLQILHYEVGQKYD
Sbjct: 121 SETGKSMDSRVRTSSGMFLSRGQDRIIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYD 180

Query: 181 AHYDYFVDEYNIKKGGQRMATLLM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGG 240
           AH+DYFVDEYNIKKG QRMATLLM  SDVEEGGETVFPAAKGNFSSVP WNELSECGKGG
Sbjct: 181 AHHDYFVDEYNIKKGSQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGG 240

Query: 241 LSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH 283
           LSVKPKMGDALLFWSMKPD +LDPTSLHGACPVI GNKWSCTKWMH
Sbjct: 241 LSVKPKMGDALLFWSMKPDASLDPTSLHGACPVIKGNKWSCTKWMH 285

BLAST of CcUC02G034210 vs. TAIR 10
Match: AT1G20270.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 406.4 bits (1043), Expect = 2.0e-113
Identity = 199/284 (70.07%), Postives = 230/284 (80.99%), Query Frame = 0

Query: 3   VSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRF--FSPPETSHRNLPHRLASL 62
           ++K ++ + Q +KWST  L   ++ ++F+L  V+L+ L F  FS P  +  + P  L+  
Sbjct: 1   MAKLRHSRFQARKWSTLML---VLFMLFMLTIVLLMLLAFGVFSLPINNDESSPIDLSYF 60

Query: 63  RYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSK 122
           R  A + S+GLGKRGDQW E +SWEPRAFVYHNFLSKEEC YLISLAKPHM KSTVVDS+
Sbjct: 61  RRAATERSEGLGKRGDQWTEVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSE 120

Query: 123 SGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAH 182
           +GKS DSRVRTSSG FL RG+DKII  IEKRIAD+TFIP +HGEGLQ+LHYE GQKY+ H
Sbjct: 121 TGKSKDSRVRTSSGTFLRRGRDKIIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPH 180

Query: 183 YDYFVDEYNIKKGGQRMATLLM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLS 242
           YDYFVDE+N K GGQRMAT+LM  SDVEEGGETVFPAA  NFSSVP +NELSECGK GLS
Sbjct: 181 YDYFVDEFNTKNGGQRMATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLS 240

Query: 243 VKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH 283
           VKP+MGDALLFWSM+PD TLDPTSLHG CPVI GNKWS TKWMH
Sbjct: 241 VKPRMGDALLFWSMRPDATLDPTSLHGGCPVIRGNKWSSTKWMH 281

BLAST of CcUC02G034210 vs. TAIR 10
Match: AT5G66060.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 372.5 bits (955), Expect = 3.2e-103
Identity = 180/265 (67.92%), Postives = 210/265 (79.25%), Query Frame = 0

Query: 22  SKMIMALVFILGFVMLLALRF---FSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQW 81
           S ++ A++ +  FV+L+ L F     P   +  +  + L S+    LQ S     + ++W
Sbjct: 18  STLVFAVLIMSTFVILILLAFGILSVPSNNAGSSKANDLTSIVRKTLQRSGEDDSKNERW 77

Query: 82  VEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLN 141
           VE ISWEPRA VYHNFL+KEEC YLI LAKPHMEKSTVVD K+GKS DSRVRTSSG FL 
Sbjct: 78  VEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSRVRTSSGTFLA 137

Query: 142 RGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMA 201
           RG+DK I  IEKRI+DFTFIP+EHGEGLQ+LHYE+GQKY+ HYDYF+DEYN + GGQR+A
Sbjct: 138 RGRDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMDEYNTRNGGQRIA 197

Query: 202 TLLM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDT 261
           T+LM  SDVEEGGETVFPAAKGN+S+VP WNELSECGKGGLSVKPKMGDALLFWSM PD 
Sbjct: 198 TVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGLSVKPKMGDALLFWSMTPDA 257

Query: 262 TLDPTSLHGACPVISGNKWSCTKWM 282
           TLDP+SLHG C VI GNKWS TKW+
Sbjct: 258 TLDPSSLHGGCAVIKGNKWSSTKWL 282

BLAST of CcUC02G034210 vs. TAIR 10
Match: AT4G35810.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 352.8 bits (904), Expect = 2.7e-97
Identity = 174/285 (61.05%), Postives = 220/285 (77.19%), Query Frame = 0

Query: 5   KGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSH-RNLPHRLASLRYT 64
           K K ++ +P+K  + Q   +++ ++F++  ++L+ L  FS P T+   ++P  L ++  T
Sbjct: 4   KPKQLRNKPRKSFSTQTFTVVVLVLFVI--LILVGLGIFSLPSTNKTSSMPMDLTTIVQT 63

Query: 65  ALQSSDGLGK----RGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDS 124
            +Q  +  G      GD+W+E ISWEPRAFVYHNFL+ EEC +LISLAKP M KS VVD 
Sbjct: 64  -IQERESFGDEEDGNGDRWLEVISWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDV 123

Query: 125 KSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDA 184
           K+GKS+DSRVRTSSG FLNRG D+I+  IE RI+DFTFIP E+GEGLQ+LHYEVGQ+Y+ 
Sbjct: 124 KTGKSIDSRVRTSSGTFLNRGHDEIVEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEP 183

Query: 185 HYDYFVDEYNIKKGGQRMATLLM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGL 244
           H+DYF DE+N++KGGQR+AT+LM  SDV+EGGETVFPAAKGN S VP W+ELS+CGK GL
Sbjct: 184 HHDYFFDEFNVRKGGQRIATVLMYLSDVDEGGETVFPAAKGNVSDVPWWDELSQCGKEGL 243

Query: 245 SVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH 283
           SV PK  DALLFWSMKPD +LDP+SLHG CPVI GNKWS TKW H
Sbjct: 244 SVLPKKRDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWFH 285

BLAST of CcUC02G034210 vs. TAIR 10
Match: AT2G17720.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 342.8 bits (878), Expect = 2.8e-94
Identity = 172/288 (59.72%), Postives = 216/288 (75.00%), Query Frame = 0

Query: 1   MAVSKGKYIKLQPKK---WSTFQLSKMIMALVFILGFVMLLALRFFSPPETS-HRNLPHR 60
           MA    ++++ QP+K    ST   + +I+ LV IL   +LL L   S P  + + +  + 
Sbjct: 1   MASKSKQHLRYQPRKSVSRSTQAFTVLILLLVVIL---ILLGLGILSLPNANRNSSKTND 60

Query: 61  LASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTV 120
           L ++   +  SS      G++WVE ISWEPRA VYHNFL+ EEC +LISLAKP M KSTV
Sbjct: 61  LTNIVRKSETSSGDEEGNGERWVEVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTV 120

Query: 121 VDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQK 180
           VD K+G S DSRVRTSSG FL RG D+++  IEKRI+DFTFIP+E+GEGLQ+LHY+VGQK
Sbjct: 121 VDEKTGGSKDSRVRTSSGTFLRRGHDEVVEVIEKRISDFTFIPVENGEGLQVLHYQVGQK 180

Query: 181 YDAHYDYFVDEYNIKKGGQRMATLLM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGK 240
           Y+ HYDYF+DE+N K GGQR+AT+LM  SDV++GGETVFPAA+GN S+VP WNELS+CGK
Sbjct: 181 YEPHYDYFLDEFNTKNGGQRIATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGK 240

Query: 241 GGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH 283
            GLSV PK  DALLFW+M+PD +LDP+SLHG CPV+ GNKWS TKW H
Sbjct: 241 EGLSVLPKKRDALLFWNMRPDASLDPSSLHGGCPVVKGNKWSSTKWFH 285

BLAST of CcUC02G034210 vs. TAIR 10
Match: AT5G66060.2 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 281.6 bits (719), Expect = 7.5e-76
Identity = 142/220 (64.55%), Postives = 169/220 (76.82%), Query Frame = 0

Query: 22  SKMIMALVFILGFVMLLALRF---FSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQW 81
           S ++ A++ +  FV+L+ L F     P   +  +  + L S+    LQ S     + ++W
Sbjct: 18  STLVFAVLIMSTFVILILLAFGILSVPSNNAGSSKANDLTSIVRKTLQRSGEDDSKNERW 77

Query: 82  VEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLN 141
           VE ISWEPRA VYHNFL  EEC YLI LAKPHMEKSTVVD K+GKS DSRVRTSSG FL 
Sbjct: 78  VEIISWEPRASVYHNFL--EECKYLIELAKPHMEKSTVVDEKTGKSTDSRVRTSSGTFLA 137

Query: 142 RGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMA 201
           RG+DK I  IEKRI+DFTFIP+EHGEGLQ+LHYE+GQKY+ HYDYF+DEYN + GGQR+A
Sbjct: 138 RGRDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMDEYNTRNGGQRIA 197

Query: 202 TLLM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGG 237
           T+LM  SDVEEGGETVFPAAKGN+S+VP WNELSECGKGG
Sbjct: 198 TVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGG 235

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038889689.12.1e-14992.25probable prolyl 4-hydroxylase 3 [Benincasa hispida][more]
XP_008456388.17.6e-14792.96PREDICTED: probable prolyl 4-hydroxylase 3 isoform X2 [Cucumis melo][more]
XP_011648735.29.3e-14590.85probable prolyl 4-hydroxylase 3 [Cucumis sativus] >KAE8651549.1 hypothetical pro... [more]
XP_016901368.11.7e-13888.03PREDICTED: probable prolyl 4-hydroxylase 3 [Cucumis melo][more]
XP_011650099.27.2e-13786.27probable prolyl 4-hydroxylase 3 [Cucumis sativus] >XP_031736348.1 probable proly... [more]
Match NameE-valueIdentityDescription
Q9LN202.9e-11270.07Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana OX=3702 GN=P4H3 PE=2 SV=... [more]
F4JZ244.6e-10267.92Probable prolyl 4-hydroxylase 10 OS=Arabidopsis thaliana OX=3702 GN=P4H10 PE=3 S... [more]
F4JNU83.7e-9661.05Probable prolyl 4-hydroxylase 8 OS=Arabidopsis thaliana OX=3702 GN=P4H8 PE=3 SV=... [more]
Q24JN53.9e-9359.72Prolyl 4-hydroxylase 5 OS=Arabidopsis thaliana OX=3702 GN=P4H5 PE=2 SV=1[more]
Q8L9704.2e-6355.67Probable prolyl 4-hydroxylase 7 OS=Arabidopsis thaliana OX=3702 GN=P4H7 PE=2 SV=... [more]
Match NameE-valueIdentityDescription
A0A1S3C2P63.7e-14792.96probable prolyl 4-hydroxylase 3 isoform X2 OS=Cucumis melo OX=3656 GN=LOC1034963... [more]
A0A1S4DZG78.3e-13988.03probable prolyl 4-hydroxylase 3 OS=Cucumis melo OX=3656 GN=LOC103496316 PE=4 SV=... [more]
A0A6J1FBR37.7e-13783.75probable prolyl 4-hydroxylase 3 OS=Cucurbita moschata OX=3662 GN=LOC111444174 PE... [more]
A0A1S3C3676.5e-13686.39probable prolyl 4-hydroxylase 3 isoform X1 OS=Cucumis melo OX=3656 GN=LOC1034963... [more]
A0A6J1CNS92.5e-13584.62probable prolyl 4-hydroxylase 3 OS=Momordica charantia OX=3673 GN=LOC111013165 P... [more]
Match NameE-valueIdentityDescription
AT1G20270.12.0e-11370.072-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT5G66060.13.2e-10367.922-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT4G35810.12.7e-9761.052-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT2G17720.12.8e-9459.722-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT5G66060.27.5e-7664.552-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (PI 537277) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006620Prolyl 4-hydroxylase, alpha subunitSMARTSM00702p4hccoord: 86..282
e-value: 1.1E-56
score: 204.3
IPR044862Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domainPFAMPF136402OG-FeII_Oxy_3coord: 166..282
e-value: 8.8E-17
score: 61.8
NoneNo IPR availableGENE3D2.60.120.620q2cbj1_9rhob like domaincoord: 79..282
e-value: 6.9E-75
score: 253.3
NoneNo IPR availablePANTHERPTHR10869:SF169PROLYL 4-HYDROXYLASE 3-RELATEDcoord: 4..282
IPR045054Prolyl 4-hydroxylasePANTHERPTHR10869PROLYL 4-HYDROXYLASE ALPHA SUBUNITcoord: 4..282
IPR005123Oxoglutarate/iron-dependent dioxygenasePROSITEPS51471FE2OG_OXYcoord: 162..283
score: 12.052887

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CcUC02G034210.1CcUC02G034210.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0051213 dioxygenase activity
molecular_function GO:0005506 iron ion binding
molecular_function GO:0031418 L-ascorbic acid binding
molecular_function GO:0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen